BLASTX nr result

ID: Mentha27_contig00001143 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00001143
         (1713 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006349291.1| PREDICTED: uncharacterized protein LOC102579...   428   e-117
ref|XP_006349292.1| PREDICTED: uncharacterized protein LOC102579...   427   e-117
ref|XP_004230425.1| PREDICTED: uncharacterized protein LOC101259...   424   e-116
gb|EYU35025.1| hypothetical protein MIMGU_mgv1a021997mg [Mimulus...   407   e-111
gb|EYU33290.1| hypothetical protein MIMGU_mgv1a022895mg [Mimulus...   407   e-111
ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624...   405   e-110
ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citr...   405   e-110
ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255...   402   e-109
ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma...   402   e-109
ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma...   394   e-107
ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prun...   388   e-105
ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306...   381   e-103
ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citr...   380   e-103
ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797...   377   e-102
ref|XP_002528866.1| conserved hypothetical protein [Ricinus comm...   377   e-101
ref|XP_006598790.1| PREDICTED: uncharacterized protein LOC100797...   376   e-101
ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [...   373   e-100
ref|XP_006592649.1| PREDICTED: uncharacterized protein LOC100526...   371   e-100
ref|XP_006589225.1| PREDICTED: uncharacterized protein LOC100807...   370   e-100
ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana] ...   370   e-100

>ref|XP_006349291.1| PREDICTED: uncharacterized protein LOC102579538 isoform X1 [Solanum
            tuberosum]
          Length = 427

 Score =  428 bits (1100), Expect = e-117
 Identities = 223/430 (51%), Positives = 280/430 (65%), Gaps = 22/430 (5%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MKS++ VS+  D ++RSCI SLF+T++LI   +F  SA ++KD    +   G  +N+T+ 
Sbjct: 1    MKSWNIVSVP-DAKSRSCICSLFLTLALICAVYFTGSALMAKD---FRAFSGFTMNSTKQ 56

Query: 440  YTENVKCEEPTVLNIRVDKAE----DETQLEKCNDECRPVGTEALPKGIISSTSNLEMYP 607
              +  KC+ P     R +K E    +  Q  KC  +CRP+G+EALP+GIIS TSNLEM P
Sbjct: 57   NGQCGKCKVPPP---REEKQESHVTENVQNNKCQKKCRPLGSEALPEGIISKTSNLEMRP 113

Query: 608  LSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELE 787
            L G V  +  S H  +LL +AVG KQK  VN+IVKKFL+ DFVVMLFHYDG+VD+W +LE
Sbjct: 114  LWGDV--EKKSPHSVNLLGIAVGIKQKEMVNKIVKKFLEHDFVVMLFHYDGVVDEWNDLE 171

Query: 788  WSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGF 967
            WS+  IHVSA NQTKWWFAKRFLHPDIV+EYDYIFLWDEDLGVENFHP +Y+SI++EEG 
Sbjct: 172  WSNRAIHVSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGL 231

Query: 968  EISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAP 1147
            EISQP LD   SE                 +Y+ + GG   CD NST PPCVGWVEMMAP
Sbjct: 232  EISQPGLDASKSEVHHHITVRRGRSKVHRRFYRLNRGGR-TCDNNSTEPPCVGWVEMMAP 290

Query: 1148 VFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVS 1273
            VFS+AAWRCAWYM+QNDLIHAW                  G VD EY+ HLA+P+LGG S
Sbjct: 291  VFSKAAWRCAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAVPSLGGNS 350

Query: 1274 EKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVEN 1453
            +  +T                          K +NR  +R++SY EM+VF  RW+KA++ 
Sbjct: 351  D-VETVIKELDNNSLQGKNLSDSDTLAAPVEKFDNRSLVRRQSYIEMKVFRERWRKAIKQ 409

Query: 1454 DQCWVDPFKT 1483
            DQCWVDPF++
Sbjct: 410  DQCWVDPFQS 419


>ref|XP_006349292.1| PREDICTED: uncharacterized protein LOC102579538 isoform X2 [Solanum
            tuberosum]
          Length = 426

 Score =  427 bits (1098), Expect = e-117
 Identities = 223/430 (51%), Positives = 280/430 (65%), Gaps = 22/430 (5%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MKS++ VS+  D ++RSCI SLF+T++LI   +F  SA ++KD    +   G  +N+T+ 
Sbjct: 1    MKSWNIVSVP-DAKSRSCICSLFLTLALICAVYFTGSALMAKD---FRAFSGFTMNSTKQ 56

Query: 440  YTENVKCEEPTVLNIRVDKAE----DETQLEKCNDECRPVGTEALPKGIISSTSNLEMYP 607
              +  KC+ P     R +K E    +  Q  KC  +CRP+G+EALP+GIIS TSNLEM P
Sbjct: 57   NGQCGKCKVPPP---REEKQESHVTENVQNNKCQKKCRPLGSEALPEGIISKTSNLEMRP 113

Query: 608  LSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELE 787
            L G V +   S H  +LL +AVG KQK  VN+IVKKFL+ DFVVMLFHYDG+VD+W +LE
Sbjct: 114  LWGDVEK---SPHSVNLLGIAVGIKQKEMVNKIVKKFLEHDFVVMLFHYDGVVDEWNDLE 170

Query: 788  WSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGF 967
            WS+  IHVSA NQTKWWFAKRFLHPDIV+EYDYIFLWDEDLGVENFHP +Y+SI++EEG 
Sbjct: 171  WSNRAIHVSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGL 230

Query: 968  EISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAP 1147
            EISQP LD   SE                 +Y+ + GG   CD NST PPCVGWVEMMAP
Sbjct: 231  EISQPGLDASKSEVHHHITVRRGRSKVHRRFYRLNRGGR-TCDNNSTEPPCVGWVEMMAP 289

Query: 1148 VFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVS 1273
            VFS+AAWRCAWYM+QNDLIHAW                  G VD EY+ HLA+P+LGG S
Sbjct: 290  VFSKAAWRCAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAVPSLGGNS 349

Query: 1274 EKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVEN 1453
            +  +T                          K +NR  +R++SY EM+VF  RW+KA++ 
Sbjct: 350  D-VETVIKELDNNSLQGKNLSDSDTLAAPVEKFDNRSLVRRQSYIEMKVFRERWRKAIKQ 408

Query: 1454 DQCWVDPFKT 1483
            DQCWVDPF++
Sbjct: 409  DQCWVDPFQS 418


>ref|XP_004230425.1| PREDICTED: uncharacterized protein LOC101259678 [Solanum
            lycopersicum]
          Length = 428

 Score =  424 bits (1089), Expect = e-116
 Identities = 216/426 (50%), Positives = 275/426 (64%), Gaps = 18/426 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MKS+++VS+  D ++RS I SLF+T++LI   +F  SA ++KD    +   G  +N+T+ 
Sbjct: 1    MKSWNTVSVP-DPKSRSFICSLFLTLALICAVYFTGSALMAKD---FRAFSGFTINSTKQ 56

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
              +  KCE P     +     +  Q  KC  +CRP+G+EALP+GI+S TSNLEM PL G 
Sbjct: 57   NGQCGKCEVPPREEKQESHVTENVQNNKCQKKCRPLGSEALPEGIVSKTSNLEMRPLWGD 116

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
            V  +  S H  +LL +AVG KQK  VN+IVK+FL+ DFVVMLFHYDG+VD+W +LEWS+ 
Sbjct: 117  V--EKKSPHSVNLLGIAVGIKQKELVNKIVKRFLEHDFVVMLFHYDGVVDEWNDLEWSNR 174

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA NQTKWWFAKRFLHPDIV+EYDYIFLWDEDLGVENFHP +Y+SI++EEG EISQ
Sbjct: 175  AIHVSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGLEISQ 234

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            P LD   SE                 +Y+ + GG   CD NST PPCVGWVEMMAPVFS+
Sbjct: 235  PGLDASKSEVHHHITVRRGRSKVHRRFYRLNRGGR-TCDNNSTEPPCVGWVEMMAPVFSK 293

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRCAWYM+QNDLIHAW                  G VD EY+ HLA+P+LG  S+  +
Sbjct: 294  AAWRCAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAIPSLGANSD-VE 352

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
            T                          K +NR  +R++SY EM++F  RW KA++ DQCW
Sbjct: 353  TVIKELDNNSPQGKNLSDSDTLAAPVEKFDNRSLVRRQSYIEMKIFRERWGKAIKQDQCW 412

Query: 1466 VDPFKT 1483
            VDPF++
Sbjct: 413  VDPFQS 418


>gb|EYU35025.1| hypothetical protein MIMGU_mgv1a021997mg [Mimulus guttatus]
          Length = 339

 Score =  407 bits (1047), Expect = e-111
 Identities = 200/335 (59%), Positives = 229/335 (68%), Gaps = 18/335 (5%)
 Frame = +2

Query: 530  DECRPVGTEALPKGIISSTSNLEMYPLSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIV 709
            ++CRP+G+EALPKGI+S+TSNL+M+PLSGP+ ED+NSKH +SLL +AVG  QK  VNEIV
Sbjct: 35   EKCRPIGSEALPKGIVSATSNLKMHPLSGPIPEDNNSKHTTSLLALAVGINQKQLVNEIV 94

Query: 710  KKFLQEDFVVMLFHYDGIVDQWRELEWSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYI 889
            KKF+ + F +M FHYD  VD+W E EW DSVIHVSA NQTKWWFAKRFLHPDIVAEY+YI
Sbjct: 95   KKFMGKKFAIMFFHYDDHVDEWHEFEWCDSVIHVSAVNQTKWWFAKRFLHPDIVAEYEYI 154

Query: 890  FLWDEDLGVENFHPGRYLSIIKEEGFEISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKF 1069
            FLWDEDLGV+NFHP RYLSIIKEEG EISQP LD  +SE                 YYK 
Sbjct: 155  FLWDEDLGVDNFHPERYLSIIKEEGLEISQPGLDRSSSEIHHQITVRGRRSRVHRRYYK- 213

Query: 1070 SSGGSGRCDGNSTAPPCVGWVEMMAPVFSRAAWRCAWYMIQNDLIHAW------------ 1213
                 G+CD NST+PPCVGWVEMMAPVFSRAAWRCAWYMIQNDLIHAW            
Sbjct: 214  ----PGKCDNNSTSPPCVGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQG 269

Query: 1214 ------GXVDQEYLIHLALPTLGGVSEKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLN 1375
                  G VD EY++HL LPTLGG SEK K                              
Sbjct: 270  DRTLKVGVVDHEYIVHLGLPTLGGASEKNK------------------------------ 299

Query: 1376 NRFAIRQRSYAEMRVFNNRWKKAVENDQCWVDPFK 1480
                IRQRSY EMRVF +RW KAV++D+CW+DPF+
Sbjct: 300  ----IRQRSYEEMRVFKSRWSKAVKDDKCWIDPFE 330


>gb|EYU33290.1| hypothetical protein MIMGU_mgv1a022895mg [Mimulus guttatus]
          Length = 336

 Score =  407 bits (1047), Expect = e-111
 Identities = 204/380 (53%), Positives = 240/380 (63%), Gaps = 18/380 (4%)
 Frame = +2

Query: 395  TMQLSRGIEVNNTEIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGI 574
            TMQLS G  +N T+ +T+  KCE+                      +CRPVG+EALP GI
Sbjct: 7    TMQLSPGFRMNTTQQHTDTYKCEQ----------------------KCRPVGSEALPNGI 44

Query: 575  ISSTSNLEMYPLSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHY 754
            IS  +N+EM PL GPVSED+  KH + LL +AVG  QK  VN+IVKKFL+ DFVVMLFHY
Sbjct: 45   ISIHANMEMRPLWGPVSEDNKPKHGTGLLAVAVGINQKELVNKIVKKFLENDFVVMLFHY 104

Query: 755  DGIVDQWRELEWSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPG 934
            DG VD+W + +WS+ V+H+S  NQTKWWFAKRFLHPDIVAEY+YIFLWDEDLGVE+FHP 
Sbjct: 105  DGFVDKWHDFDWSNRVLHISVKNQTKWWFAKRFLHPDIVAEYEYIFLWDEDLGVEDFHPK 164

Query: 935  RYLSIIKEEGFEISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAP 1114
            RY+SI+KEEG EISQPALDP  SE                 YYKF   GSGRCD  ST+P
Sbjct: 165  RYISIVKEEGLEISQPALDPGKSEVHHPITARRHKSRVHRRYYKFK--GSGRCDEKSTSP 222

Query: 1115 PCVGWVEMMAPVFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLI 1240
            PCVGWVEMMAPVFSRAAWRC WYMIQNDLIHAW                  G VD+EY++
Sbjct: 223  PCVGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDRTVKIGVVDEEYIV 282

Query: 1241 HLALPTLGGVSEKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRV 1420
            HL LPTLG  S++ K                                  +R++SY EM +
Sbjct: 283  HLGLPTLGVFSDRNK----------------------------------VRRQSYTEMGI 308

Query: 1421 FNNRWKKAVENDQCWVDPFK 1480
            F NRW KAV  D+CW+DPFK
Sbjct: 309  FRNRWAKAVNEDECWIDPFK 328


>ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624954 [Citrus sinensis]
          Length = 407

 Score =  405 bits (1040), Expect = e-110
 Identities = 210/424 (49%), Positives = 265/424 (62%), Gaps = 18/424 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MK+ +S+S+ SD  +RSC+ SLFI  +LI + +FI S++++K++    +  G+  +    
Sbjct: 2    MKATNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSA 61

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
              E  K                       N +CR  GTEALP+GI+S TSNLEM PL   
Sbjct: 62   KPETCK-----------------------NQQCRLPGTEALPEGIVSKTSNLEMRPLWSS 98

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
             S+ +N +   +LL +A G KQK  V++IV+KF  +DFVVMLFHYDG+VD+W++L W+D 
Sbjct: 99   PSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADR 158

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA+NQTKWWFAKRFLHPDIVAEY+YIFLWDED+GVENF+P RYLSI+K+EG EISQ
Sbjct: 159  AIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQ 218

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            PALDP  SE                  YK+   GSGRCD  STAPPC+GWVEMMAPVFSR
Sbjct: 219  PALDPVKSEVHHPITARRRNSKAHRRMYKYK--GSGRCDDYSTAPPCIGWVEMMAPVFSR 276

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRCAWYMIQNDLIHAW                  G VD EY++HL LPTLG  +E   
Sbjct: 277  AAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPEL 336

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
                                      Y  +NR  +R++SY EM++F NRWK AVE+D+CW
Sbjct: 337  NTVGQASDDLEQIANPVALAPSQSRRY--DNRPEVRRQSYIEMQIFRNRWKHAVEDDKCW 394

Query: 1466 VDPF 1477
            VDP+
Sbjct: 395  VDPY 398


>ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533616|gb|ESR44734.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 407

 Score =  405 bits (1040), Expect = e-110
 Identities = 210/424 (49%), Positives = 265/424 (62%), Gaps = 18/424 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MK+ +S+S+ SD  +RSC+ SLFI  +LI + +FI S++++K++    +  G+  +    
Sbjct: 2    MKTTNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSA 61

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
              E  K                       N +CR  GTEALP+GI+S TSNLEM PL   
Sbjct: 62   KPETCK-----------------------NQQCRLPGTEALPEGIVSKTSNLEMRPLWSS 98

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
             S+ +N +   +LL +A G KQK  V++IV+KF  +DFVVMLFHYD +VD+W++L W+D 
Sbjct: 99   PSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADR 158

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA+NQTKWWFAKRFLHPDIVAEY+YIFLWDED+GVENF+P RYLSI+K+EGFEISQ
Sbjct: 159  AIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQ 218

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            PALDP  SE                  YK+   GSGRCD  STAPPC+GWVEMMAPVFSR
Sbjct: 219  PALDPVKSEVHHPITARRRNSKAHRRMYKYK--GSGRCDDYSTAPPCIGWVEMMAPVFSR 276

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRCAWYMIQNDLIHAW                  G VD EY++HL LPTLG  +E   
Sbjct: 277  AAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPEL 336

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
                                      Y  +NR  +R++SY EM++F NRWK AVE+D+CW
Sbjct: 337  NAVGQASDDLEQIANPVALAPSQSRRY--DNRPEVRRQSYIEMQIFRNRWKHAVEDDKCW 394

Query: 1466 VDPF 1477
            VDP+
Sbjct: 395  VDPY 398


>ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera]
            gi|297739491|emb|CBI29673.3| unnamed protein product
            [Vitis vinifera]
          Length = 413

 Score =  402 bits (1034), Expect = e-109
 Identities = 208/429 (48%), Positives = 260/429 (60%), Gaps = 18/429 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MK+   +SL SD ++RS + SLFI   L    +FIAS +  KD                 
Sbjct: 1    MKTLSCISLPSDPKSRSYLCSLFIGACLFCGVYFIASEFTVKD----------------- 43

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
            Y +     + +V       +   TQ  KC ++CRP G+EALP+GI+  TSNLE+ PL G 
Sbjct: 44   YKDRSSRWQISVFQNAHSNSIQNTQSSKCKNQCRPSGSEALPEGIVVKTSNLEVQPLWGA 103

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
                  S    SLL MAVG KQK  VN+IV+KF+  +FVVMLFHYDG+VD+WRE  WSD 
Sbjct: 104  TLNGEKSSPSKSLLAMAVGIKQKEIVNQIVEKFILSNFVVMLFHYDGVVDEWREFAWSDH 163

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHV+  NQTKWWFAKRFLHPDIVAEY+YIFLWDEDLGVENFHPGRY+SI+++EG EISQ
Sbjct: 164  AIHVTVVNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQ 223

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            PALDP  S                   YK    GSGRCD  STAPPCVGWVEMMAPVFS+
Sbjct: 224  PALDPKKSRVHHQITARVRNSRVHRRTYKHR--GSGRCDDQSTAPPCVGWVEMMAPVFSK 281

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRC W+MIQN+LIHAW                  G VD EY++HLALPTL GV ++ +
Sbjct: 282  AAWRCVWHMIQNELIHAWGVDMQLGYCAQGDRTKNVGVVDSEYVVHLALPTL-GVLDENE 340

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
                                      +K++NR A+R++S+ EM++F +RW  AV+ D+CW
Sbjct: 341  LRGEGHDHSSLREKLPKSVALAQSEFHKVDNRSAVRRQSFIEMQIFRSRWANAVKEDKCW 400

Query: 1466 VDPFKTVAD 1492
            +DP+   A+
Sbjct: 401  IDPYAQPAE 409


>ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705077|gb|EOX96973.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 405

 Score =  402 bits (1033), Expect = e-109
 Identities = 209/425 (49%), Positives = 263/425 (61%), Gaps = 18/425 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MK+++  S+ SD +TRSC+  LF+  SLI   +FI+ A+++K+    +LSR   +N  + 
Sbjct: 1    MKAFNCASVVSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKD-RLSRWEVINMLQN 59

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
               N+                       C   CRP G+EALP+GI+  TSNLEM PL   
Sbjct: 60   SKSNI-----------------------CKIRCRPPGSEALPQGIVVKTSNLEMRPLWSD 96

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
              ++ N +  S+LL +AVG KQK  VN+I+KKF   DFVVMLFHYDGIVD+WR+LEWSD 
Sbjct: 97   TVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDH 156

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA NQTKWWFAKRFLHPDIVA+Y Y+FLWDEDLGV+NF P +YLSI+++EG EISQ
Sbjct: 157  AIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQ 216

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            PALDP  SE                  YKF   GSGRCDG STAPPC+GWVEMMAPVFSR
Sbjct: 217  PALDPVKSEVHHQITARRRNSRVHRRMYKFK--GSGRCDGRSTAPPCIGWVEMMAPVFSR 274

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRCAWYMIQNDLIHAW                  G VD EY++HL L TLG ++E   
Sbjct: 275  AAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAE--N 332

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
                                      +K++NR  +R++S+ EM++F  RW+ AV  D+CW
Sbjct: 333  ELNSTRVNITRRQPSSDSETLAPSESHKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCW 392

Query: 1466 VDPFK 1480
            VDP++
Sbjct: 393  VDPYQ 397


>ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705078|gb|EOX96974.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 416

 Score =  394 bits (1011), Expect = e-107
 Identities = 209/436 (47%), Positives = 263/436 (60%), Gaps = 29/436 (6%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MK+++  S+ SD +TRSC+  LF+  SLI   +FI+ A+++K+    +LSR   +N  + 
Sbjct: 1    MKAFNCASVVSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKD-RLSRWEVINMLQN 59

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
               N+                       C   CRP G+EALP+GI+  TSNLEM PL   
Sbjct: 60   SKSNI-----------------------CKIRCRPPGSEALPQGIVVKTSNLEMRPLWSD 96

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
              ++ N +  S+LL +AVG KQK  VN+I+KKF   DFVVMLFHYDGIVD+WR+LEWSD 
Sbjct: 97   TVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDH 156

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA NQTKWWFAKRFLHPDIVA+Y Y+FLWDEDLGV+NF P +YLSI+++EG EISQ
Sbjct: 157  AIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQ 216

Query: 980  PALDPHNSE-----------XXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVG 1126
            PALDP  SE                             YKF   GSGRCDG STAPPC+G
Sbjct: 217  PALDPVKSEVHHQITARRRNSRVHSYDTINPSRLNRRMYKFK--GSGRCDGRSTAPPCIG 274

Query: 1127 WVEMMAPVFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLAL 1252
            WVEMMAPVFSRAAWRCAWYMIQNDLIHAW                  G VD EY++HL L
Sbjct: 275  WVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKNVGVVDAEYIVHLGL 334

Query: 1253 PTLGGVSEKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNR 1432
             TLG ++E                             +K++NR  +R++S+ EM++F  R
Sbjct: 335  STLGVLAE--NELNSTRVNITRRQPSSDSETLAPSESHKVDNRPEVRRQSFIEMQMFRKR 392

Query: 1433 WKKAVENDQCWVDPFK 1480
            W+ AV  D+CWVDP++
Sbjct: 393  WENAVNQDKCWVDPYQ 408


>ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica]
            gi|462419692|gb|EMJ23955.1| hypothetical protein
            PRUPE_ppa006529mg [Prunus persica]
          Length = 407

 Score =  388 bits (996), Expect = e-105
 Identities = 205/430 (47%), Positives = 255/430 (59%), Gaps = 22/430 (5%)
 Frame = +2

Query: 269  YDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEIYTE 448
            ++  S   D + RS   SLFI  SLI   +FI  A ++K+                 Y E
Sbjct: 5    FNPASALPDPKNRSFYCSLFIVASLICGAYFIGGASIAKE-----------------YKE 47

Query: 449  NVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGPVSE 628
             +          +V      T+ + C + C+P+G+EALP+GI++ TS+LE+ PL G    
Sbjct: 48   RLT-------RWKVIYTRQNTKFDTCKNRCQPLGSEALPEGIVAKTSDLEVRPLWGSSVN 100

Query: 629  DSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDSVIH 808
            + NSK   SLL +AVG KQK  V+ IVKKFL  DFVVMLFHYDG VD+WR+L WSD  IH
Sbjct: 101  NENSKPSMSLLAIAVGIKQKEIVDRIVKKFLSSDFVVMLFHYDGAVDKWRDLNWSDRAIH 160

Query: 809  VSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQPAL 988
            VS  NQTKWWFAKRFLHPDIV+EY+YIFLWDEDLGVENF P RYLSI++EEG EISQPAL
Sbjct: 161  VSVMNQTKWWFAKRFLHPDIVSEYEYIFLWDEDLGVENFDPKRYLSIVREEGLEISQPAL 220

Query: 989  DPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSRAAW 1168
            DP  S+                 +YKF   GSGRCD +S+APPC GWVEMMAPVFS+AAW
Sbjct: 221  DPDKSDVYHPITARVKKLKVHRRFYKFK--GSGRCDNHSSAPPCAGWVEMMAPVFSKAAW 278

Query: 1169 RCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTKT-- 1288
            +C WYMIQNDLIHAW                  G VD EY++HL LPTL GVS+  K   
Sbjct: 279  QCVWYMIQNDLIHAWGLDVQLGYCAQGDRTKNVGVVDSEYIVHLGLPTL-GVSDGNKAIM 337

Query: 1289 --XXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQC 1462
                                        K+N+R  +R +S+ +M++F  RW  AV+ D+C
Sbjct: 338  LKTRLDFYCLSPIHLSLCNIISAPSASDKVNDRAKVRMQSFIDMQIFKERWSNAVKEDKC 397

Query: 1463 WVDPFKTVAD 1492
            WVDPF+  A+
Sbjct: 398  WVDPFQLSAN 407


>ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306243 [Fragaria vesca
            subsp. vesca]
          Length = 397

 Score =  381 bits (979), Expect = e-103
 Identities = 200/418 (47%), Positives = 248/418 (59%), Gaps = 18/418 (4%)
 Frame = +2

Query: 278  VSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEIYTENVK 457
            VS+ SD + RS   SLFI +SL++  +FI  A ++K+                 Y E + 
Sbjct: 12   VSVLSDPKNRSFYCSLFIVVSLVTGAYFIGGASIAKE-----------------YKEKLT 54

Query: 458  CEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGPVSEDSN 637
                     +V      T L+ C   C+P GTEALP+GI++ TS+ ++ PL G   +D N
Sbjct: 55   -------RWKVTYTMQNTNLDTCKKRCQPSGTEALPEGIVAKTSDFKIRPLWGTSKKDKN 107

Query: 638  SKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDSVIHVSA 817
            S    SLL +AVG KQK  V++IV+KFL  DFVVMLFHYDG VD+WR+L WSD+ IHVS 
Sbjct: 108  STPSKSLLAIAVGIKQKEIVDKIVRKFLSSDFVVMLFHYDGAVDKWRDLHWSDTAIHVSV 167

Query: 818  SNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQPALDPH 997
             NQTKWWFAKRFLHPDIV EY +IFLWDEDLGVENF P RYLS+I +EG EISQPALDP 
Sbjct: 168  MNQTKWWFAKRFLHPDIVTEYKHIFLWDEDLGVENFDPERYLSVIWDEGLEISQPALDPV 227

Query: 998  NSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSRAAWRCA 1177
             SE                 +YKF   GSGRCD  S+ PPC+GWVEMMAPVFSRAAWRC 
Sbjct: 228  KSEVYHPITARVKKSKVHRRFYKFK--GSGRCDDQSSGPPCIGWVEMMAPVFSRAAWRCV 285

Query: 1178 WYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTKTXXXXX 1303
            WYMIQNDL+HAW                  G VD EY++HL LPTL GV++  K      
Sbjct: 286  WYMIQNDLVHAWGLDEQLGYCAQGDRMKNVGVVDSEYIVHLGLPTL-GVTDDNKGINNMV 344

Query: 1304 XXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCWVDPF 1477
                                   ++R  +R +S+ +MR+F  RW+ AV+ D CWVDP+
Sbjct: 345  HSQKEDSKALAPSGPPIP-----SDRAKVRMQSFIDMRIFKERWRSAVKEDNCWVDPY 397


>ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533617|gb|ESR44735.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 358

 Score =  380 bits (977), Expect = e-103
 Identities = 190/339 (56%), Positives = 227/339 (66%), Gaps = 19/339 (5%)
 Frame = +2

Query: 518  EKC-NDECRPVGTEALPKGIISSTSNLEMYPLSGPVSEDSNSKHKSSLLVMAVGFKQKGF 694
            E C N +CR  GTEALP+GI+S TSNLEM PL    S+ +N +   +LL +A G KQK  
Sbjct: 15   ETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKI 74

Query: 695  VNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDSVIHVSASNQTKWWFAKRFLHPDIVA 874
            V++IV+KF  +DFVVMLFHYD +VD+W++L W+D  IHVSA+NQTKWWFAKRFLHPDIVA
Sbjct: 75   VDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVA 134

Query: 875  EYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQPALDPHNSEXXXXXXXXXXXXXXXX 1054
            EY+YIFLWDED+GVENF+P RYLSI+K+EGFEISQPALDP  SE                
Sbjct: 135  EYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHHPITARRRNSKAHR 194

Query: 1055 XYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSRAAWRCAWYMIQNDLIHAW------- 1213
              YK+   GSGRCD  STAPPC+GWVEMMAPVFSRAAWRCAWYMIQNDLIHAW       
Sbjct: 195  RMYKYK--GSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLG 252

Query: 1214 -----------GXVDQEYLIHLALPTLGGVSEKTKTXXXXXXXXXXXXXXXXXXXXXXXX 1360
                       G VD EY++HL LPTLG  +E                            
Sbjct: 253  YCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNAVGQASDDLEQIANPVALAPSQSR 312

Query: 1361 XYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCWVDPF 1477
             Y  +NR  +R++SY EM++F NRWK AVE+D+CWVDP+
Sbjct: 313  RY--DNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPY 349


>ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797710 isoform X1 [Glycine
            max]
          Length = 385

 Score =  377 bits (968), Expect = e-102
 Identities = 200/427 (46%), Positives = 250/427 (58%), Gaps = 20/427 (4%)
 Frame = +2

Query: 260  MKSYDSVS--LSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNT 433
            MKS+DSV+  +  D + R  +WS+F+ +SLIS  +F+ +A+ +K+        G+     
Sbjct: 1    MKSFDSVTGFILPDPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGL----- 55

Query: 434  EIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLS 613
             I+T                    +++   C  +C P G+EALP+GII+ TSNLEM PL 
Sbjct: 56   -IHTM------------------PDSKFNSCKRQCLPFGSEALPEGIIARTSNLEMRPLW 96

Query: 614  GPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWS 793
                ++   K   +LL MAVG +QK  VN+IV+KFL  DFVVMLFHYDG VD W+ L WS
Sbjct: 97   DSGKDNGILKRPLNLLAMAVGLEQKEIVNKIVEKFLSSDFVVMLFHYDGFVDGWKSLAWS 156

Query: 794  DSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEI 973
               IHVSA NQTKWWFAKRFLHPDIV EY+YIFLWDEDL V+NF P RYLSI+KEEG EI
Sbjct: 157  SRAIHVSAINQTKWWFAKRFLHPDIVVEYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEI 216

Query: 974  SQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVF 1153
            SQPALDP  SE                 YYK    GSGRCD  STAPPC+GWVEMMAPVF
Sbjct: 217  SQPALDPTKSEVHHPLTVHKAGSKVHRRYYKLK--GSGRCDDKSTAPPCIGWVEMMAPVF 274

Query: 1154 SRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEK 1279
            S+ +W+C W++IQNDLIHAW                  G VD EY++HL LPTLGG +  
Sbjct: 275  SKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGDRMQNVGVVDSEYIVHLGLPTLGGSNGN 334

Query: 1280 TKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQ 1459
                                           +NR  +R +SY EM+VF  RWK A E D+
Sbjct: 335  EAPSGSSG-----------------------DNRAKVRMQSYIEMQVFGKRWKDAAEKDK 371

Query: 1460 CWVDPFK 1480
            CW+DP++
Sbjct: 372  CWIDPYE 378


>ref|XP_002528866.1| conserved hypothetical protein [Ricinus communis]
            gi|223531717|gb|EEF33540.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 370

 Score =  377 bits (967), Expect = e-101
 Identities = 201/425 (47%), Positives = 249/425 (58%), Gaps = 18/425 (4%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEI 439
            MKS    S S D ++RS + +LF+  SLI + +FI  +++ K+                 
Sbjct: 1    MKSLYCASASPDPKSRSYLCTLFVVASLICSAYFIGGSFIGKE----------------- 43

Query: 440  YTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLSGP 619
            Y E +          +V +    T+   C D+C+P GT+ALP+GI+  TS+ EM PL   
Sbjct: 44   YKERLA-------RWQVIETVQSTKSTNCEDQCKPTGTKALPQGIVRKTSDFEMRPLWNS 96

Query: 620  VSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDS 799
              ED+  K   SLL +AVG  QK  V++IVKKF   DFVVMLFHYDG+VD+WR+L WSD 
Sbjct: 97   SLEDNKQKLSKSLLALAVGINQKVVVDQIVKKFPLSDFVVMLFHYDGVVDKWRDLPWSDH 156

Query: 800  VIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQ 979
             IHVSA NQTKWWFAKRFLHPDIV+EYDY+FLWDEDLGVENF+P RYLSII++EG EISQ
Sbjct: 157  AIHVSAVNQTKWWFAKRFLHPDIVSEYDYLFLWDEDLGVENFNPKRYLSIIRDEGLEISQ 216

Query: 980  PALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSR 1159
            PALDP  S                   YKF   GSGRC GNST+PPC+GWVEMMAPVFS 
Sbjct: 217  PALDPTKSAVYHPITARQPKSTVHRRIYKFK--GSGRCYGNSTSPPCIGWVEMMAPVFST 274

Query: 1160 AAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEKTK 1285
            AAWRCAW+MIQNDLIHAW                  G VD EY++HL L TLG       
Sbjct: 275  AAWRCAWHMIQNDLIHAWGLDFQLGYCAQGDRTKNVGVVDSEYIVHLGLLTLG------- 327

Query: 1286 TXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCW 1465
                                        + N   +R++S  EM++F +RWK A + D+CW
Sbjct: 328  ----------------------------VFNGTEVRKQSSVEMQIFLDRWKNAAKEDKCW 359

Query: 1466 VDPFK 1480
            VDPF+
Sbjct: 360  VDPFQ 364


>ref|XP_006598790.1| PREDICTED: uncharacterized protein LOC100797710 isoform X4 [Glycine
            max]
          Length = 387

 Score =  376 bits (966), Expect = e-101
 Identities = 200/429 (46%), Positives = 250/429 (58%), Gaps = 22/429 (5%)
 Frame = +2

Query: 260  MKSYDSVS----LSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVN 427
            MKS+DSV+    +  D + R  +WS+F+ +SLIS  +F+ +A+ +K+        G+   
Sbjct: 1    MKSFDSVTHQGFILPDPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGL--- 57

Query: 428  NTEIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYP 607
               I+T                    +++   C  +C P G+EALP+GII+ TSNLEM P
Sbjct: 58   ---IHTM------------------PDSKFNSCKRQCLPFGSEALPEGIIARTSNLEMRP 96

Query: 608  LSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELE 787
            L     ++   K   +LL MAVG +QK  VN+IV+KFL  DFVVMLFHYDG VD W+ L 
Sbjct: 97   LWDSGKDNGILKRPLNLLAMAVGLEQKEIVNKIVEKFLSSDFVVMLFHYDGFVDGWKSLA 156

Query: 788  WSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGF 967
            WS   IHVSA NQTKWWFAKRFLHPDIV EY+YIFLWDEDL V+NF P RYLSI+KEEG 
Sbjct: 157  WSSRAIHVSAINQTKWWFAKRFLHPDIVVEYNYIFLWDEDLLVDNFDPKRYLSIVKEEGL 216

Query: 968  EISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAP 1147
            EISQPALDP  SE                 YYK    GSGRCD  STAPPC+GWVEMMAP
Sbjct: 217  EISQPALDPTKSEVHHPLTVHKAGSKVHRRYYKLK--GSGRCDDKSTAPPCIGWVEMMAP 274

Query: 1148 VFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVS 1273
            VFS+ +W+C W++IQNDLIHAW                  G VD EY++HL LPTLGG +
Sbjct: 275  VFSKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGDRMQNVGVVDSEYIVHLGLPTLGGSN 334

Query: 1274 EKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVEN 1453
                                             +NR  +R +SY EM+VF  RWK A E 
Sbjct: 335  GNEAPSGSSG-----------------------DNRAKVRMQSYIEMQVFGKRWKDAAEK 371

Query: 1454 DQCWVDPFK 1480
            D+CW+DP++
Sbjct: 372  DKCWIDPYE 380


>ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|508705079|gb|EOX96975.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 438

 Score =  373 bits (958), Expect = e-100
 Identities = 193/363 (53%), Positives = 235/363 (64%), Gaps = 18/363 (4%)
 Frame = +2

Query: 242  FFRSCTMKSYDSVSLSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIE 421
            FF+   MK+++  S+ SD +TRSC+  LF+  SLI   +FI+ A+++K+    +LSR   
Sbjct: 77   FFQCKKMKAFNCASVVSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKD-RLSRWEV 135

Query: 422  VNNTEIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEM 601
            +N  +    N+                       C   CRP G+EALP+GI+  TSNLEM
Sbjct: 136  INMLQNSKSNI-----------------------CKIRCRPPGSEALPQGIVVKTSNLEM 172

Query: 602  YPLSGPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRE 781
             PL     ++ N +  S+LL +AVG KQK  VN+I+KKF   DFVVMLFHYDGIVD+WR+
Sbjct: 173  RPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKFPSSDFVVMLFHYDGIVDEWRD 232

Query: 782  LEWSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEE 961
            LEWSD  IHVSA NQTKWWFAKRFLHPDIVA+Y Y+FLWDEDLGV+NF P +YLSI+++E
Sbjct: 233  LEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLWDEDLGVDNFDPKQYLSIVEDE 292

Query: 962  GFEISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMM 1141
            G EISQPALDP  SE                  YKF   GSGRCDG STAPPC+GWVEMM
Sbjct: 293  GLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKFK--GSGRCDGRSTAPPCIGWVEMM 350

Query: 1142 APVFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGG 1267
            APVFSRAAWRCAWYMIQNDLIHAW                  G VD EY++HL L TLG 
Sbjct: 351  APVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKNVGVVDAEYIVHLGLSTLGV 410

Query: 1268 VSE 1276
            ++E
Sbjct: 411  LAE 413


>ref|XP_006592649.1| PREDICTED: uncharacterized protein LOC100526994 isoform X1 [Glycine
            max]
          Length = 385

 Score =  371 bits (953), Expect = e-100
 Identities = 200/427 (46%), Positives = 247/427 (57%), Gaps = 20/427 (4%)
 Frame = +2

Query: 260  MKSYDSVS--LSSDHQTRSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNT 433
            MKS+ SV+  +  D + R  +WS+ I +SLIS  +F+ +A+ +K+        G+     
Sbjct: 1    MKSFGSVTGFVLPDPKNRLLLWSVLILVSLISGAYFVGNAFFAKEYKQRLARWGL----- 55

Query: 434  EIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPLS 613
             I+T                     ++   C  +C P G+EALP+GII+ TSNLEM PL 
Sbjct: 56   -IHTM------------------PHSKFNACKRQCLPFGSEALPEGIIARTSNLEMRPLW 96

Query: 614  GPVSEDSNSKHKSSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWS 793
                ++   K   +LL MAVG KQK  VN+IV+KFL   FVVMLFHYDG VD W+ L WS
Sbjct: 97   DSGKDNRILKRPLNLLAMAVGLKQKEIVNKIVEKFLSSGFVVMLFHYDGFVDGWKSLAWS 156

Query: 794  DSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEI 973
               IHVSA NQTKWWFAKRFLHPDIVAEY+YIFLWDEDL V+NF P RYLSI+KEEG EI
Sbjct: 157  SCAIHVSAINQTKWWFAKRFLHPDIVAEYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEI 216

Query: 974  SQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVF 1153
            SQPALDP  SE                 YYK    GSGRCD  STAPPC+GWVEMMAPVF
Sbjct: 217  SQPALDPTKSEVHHPLTVHKAVSKVHRRYYKLK--GSGRCDDKSTAPPCIGWVEMMAPVF 274

Query: 1154 SRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPTLGGVSEK 1279
            S+ +W+C W++IQNDLIHAW                  G VD EY++HL LPTLGG +  
Sbjct: 275  SKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGDRMRNVGVVDSEYIVHLGLPTLGGSNGN 334

Query: 1280 TKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQ 1459
                                           +NR  +R +SY EM+VF  RWK A E D+
Sbjct: 335  EAPSDSPG-----------------------DNRAKVRMQSYIEMQVFGKRWKDAAEKDK 371

Query: 1460 CWVDPFK 1480
            CW+DP++
Sbjct: 372  CWIDPYE 378


>ref|XP_006589225.1| PREDICTED: uncharacterized protein LOC100807140 isoform X1 [Glycine
            max]
          Length = 389

 Score =  370 bits (951), Expect = e-100
 Identities = 201/434 (46%), Positives = 264/434 (60%), Gaps = 27/434 (6%)
 Frame = +2

Query: 260  MKSYDSVSLSSDHQTR-SCIWSLFITMSLI-STTFFIASAWLSKDSLTMQLSRGIEVNNT 433
            MKS       +D ++R SC++++F   SLI +  FF+ S++  +  L      G++++  
Sbjct: 1    MKSPSRAFSGADTKSRKSCLYAIFPAASLICAVLFFMTSSFTHEPKLP-----GLKMDTA 55

Query: 434  EIYTENVKCEEPTVLNIRVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPL- 610
                +NV  +               T ++KC ++CRP G+EALP GI+S+TS+LE+ PL 
Sbjct: 56   VDADQNVIVD---------------TVIDKCKNQCRPNGSEALPAGIVSTTSSLELRPLW 100

Query: 611  SGPVSEDSNSKHK------SSLLVMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQ 772
            + PV++  + K +      ++L  MAVG KQK  V+++VKKF+  +FVVMLFHYDGIVD+
Sbjct: 101  NPPVTKKGHHKIELKVNASTNLFAMAVGIKQKDLVSKMVKKFIDSNFVVMLFHYDGIVDE 160

Query: 773  WRELEWSDSVIHVSASNQTKWWFAKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSII 952
            W++LEWS  VIHVSA +Q+KWWFAKRFLHPDIV EYDYIFLWDEDLGVE+FHP +Y+SII
Sbjct: 161  WKDLEWSSLVIHVSAIDQSKWWFAKRFLHPDIVTEYDYIFLWDEDLGVEHFHPDKYVSII 220

Query: 953  KEEGFEISQPALDPHNSEXXXXXXXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWV 1132
            K EG EISQPALDP  SE                  YK S+ G G CD +STAPPC GW+
Sbjct: 221  KREGLEISQPALDPKKSEVHHQITARGRRSSVHRRTYKASNDGKG-CDKSSTAPPCTGWI 279

Query: 1133 EMMAPVFSRAAWRCAWYMIQNDLIHAW------------------GXVDQEYLIHLALPT 1258
            EMMAPVFSRAAWRC WYMIQNDLIHAW                  G VD EY++H   PT
Sbjct: 280  EMMAPVFSRAAWRCVWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDAEYIVHYNRPT 339

Query: 1259 LGGVSEKTKTXXXXXXXXXXXXXXXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWK 1438
            LGG+     +                          + ++R  +R+ SY E+ VF  RW+
Sbjct: 340  LGGIDNTMVS------------------------SQEKDHRVDVRRLSYQELDVFRKRWE 375

Query: 1439 KAVENDQCWVDPFK 1480
            KAVE D+CWVDPF+
Sbjct: 376  KAVEEDKCWVDPFQ 389


>ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana]
            gi|332658658|gb|AEE84058.1| uncharacterized protein
            AT4G18530 [Arabidopsis thaliana]
          Length = 389

 Score =  370 bits (951), Expect = e-100
 Identities = 197/410 (48%), Positives = 242/410 (59%), Gaps = 19/410 (4%)
 Frame = +2

Query: 305  RSCIWSLFITMSLISTTFFIASAWLSKDSLTMQLSRGIEVNNTEIYTENVKCEEPTVLNI 484
            RSC+ S+ IT +LI   +FI +A+L+KD     L                K E    ++ 
Sbjct: 8    RSCLCSVLITTALICGAYFICNAYLAKDFKEKLL----------------KWEITDKMHN 51

Query: 485  RVDKAEDETQLEKCNDECRPVGTEALPKGIISSTSNLEMYPL-SGPVSEDSNSKHKSSLL 661
              DK ++ T    C +  +PVGTEALP+GII  TSNLE   L +   ++     H  SLL
Sbjct: 52   STDKMQNATTTSTCKNFNKPVGTEALPQGIIEKTSNLETQHLWNYDDTKKRRPNHSMSLL 111

Query: 662  VMAVGFKQKGFVNEIVKKFLQEDFVVMLFHYDGIVDQWRELEWSDSVIHVSASNQTKWWF 841
             MAVG KQK  VN++++KF   DF VMLFHYDG+VD W++  W++  IHVS  NQTKWWF
Sbjct: 112  AMAVGIKQKELVNKVIQKFPPRDFAVMLFHYDGVVDDWKQYPWNNHAIHVSVMNQTKWWF 171

Query: 842  AKRFLHPDIVAEYDYIFLWDEDLGVENFHPGRYLSIIKEEGFEISQPALDPHNSEXXXXX 1021
            AKRFLHPDIVAEY+YIFLWDEDLGV +F+P RYLSI+KEEG EISQPALD   SE     
Sbjct: 172  AKRFLHPDIVAEYEYIFLWDEDLGVGHFNPQRYLSIVKEEGLEISQPALDTSKSEVHHPI 231

Query: 1022 XXXXXXXXXXXXYYKFSSGGSGRCDGNSTAPPCVGWVEMMAPVFSRAAWRCAWYMIQNDL 1201
                         YK+   GSGRCD +ST PPC+GWVEMMAPVFSRAAWRC+WYMIQNDL
Sbjct: 232  TARRKKSKVHRRMYKYK--GSGRCDDHSTNPPCIGWVEMMAPVFSRAAWRCSWYMIQNDL 289

Query: 1202 IHAW------------------GXVDQEYLIHLALPTLGGVSEKTKTXXXXXXXXXXXXX 1327
            IHAW                  G VD EY+IH  LPTLG V   +               
Sbjct: 290  IHAWGLDTQLGYCAQGDRKKNVGVVDAEYIIHYGLPTLGVVETASSALRNETDSKSTESL 349

Query: 1328 XXXXXXXXXXXXYKLNNRFAIRQRSYAEMRVFNNRWKKAVENDQCWVDPF 1477
                         +++NR  +R +S+ EM+ F  RWKKAV +D CWVDP+
Sbjct: 350  ESR----------EVDNRPEVRMKSFVEMKRFKERWKKAVRDDTCWVDPY 389


Top