BLASTX nr result

ID: Mentha22_contig00004040 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00004040
         (3721 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC30509.1| hypothetical protein L484_010758 [Morus notabilis]      86   1e-13
ref|XP_006470479.1| PREDICTED: uncharacterized protein LOC102611...    80   5e-12
ref|XP_006495054.1| PREDICTED: uncharacterized protein LOC102626...    80   9e-12
ref|XP_007049260.1| Uncharacterized protein TCM_002293 [Theobrom...    79   2e-11
ref|XP_006482789.1| PREDICTED: uncharacterized protein LOC102618...    75   3e-10
ref|XP_007010437.1| Uncharacterized protein TCM_044253 [Theobrom...    75   3e-10
ref|XP_007009514.1| Uncharacterized protein TCM_042921 [Theobrom...    70   1e-08
ref|XP_006484773.1| PREDICTED: uncharacterized protein LOC102628...    69   2e-08
ref|XP_006478664.1| PREDICTED: uncharacterized protein LOC102607...    69   2e-08
ref|XP_006494018.1| PREDICTED: uncharacterized protein LOC102609...    68   4e-08
ref|XP_004167400.1| PREDICTED: uncharacterized protein LOC101226...    67   6e-08
ref|XP_007010629.1| Uncharacterized protein TCM_044555 [Theobrom...    67   8e-08
ref|XP_006396927.1| hypothetical protein EUTSA_v10029511mg [Eutr...    66   1e-07
ref|XP_006418926.1| hypothetical protein EUTSA_v100031140mg, par...    64   5e-07
ref|XP_006391727.1| hypothetical protein EUTSA_v100238800mg, par...    64   5e-07
ref|XP_007031827.1| Uncharacterized protein TCM_017149 [Theobrom...    64   5e-07
ref|XP_006401890.1| hypothetical protein EUTSA_v100159020mg, par...    63   1e-06
ref|XP_007014370.1| Uncharacterized protein TCM_039370 [Theobrom...    63   1e-06
ref|XP_004240477.1| PREDICTED: uncharacterized protein LOC101262...    62   2e-06
ref|XP_006364558.1| PREDICTED: uncharacterized protein LOC102586...    62   2e-06

>gb|EXC30509.1| hypothetical protein L484_010758 [Morus notabilis]
          Length = 698

 Score = 86.3 bits (212), Expect = 1e-13
 Identities = 86/358 (24%), Positives = 161/358 (44%), Gaps = 9/358 (2%)
 Frame = +2

Query: 560  PLPNKRGEATVCTTLIGIGVMSPRFEPDESYDDYEILPENEYGISFERIPRQILESRPPW 739
            P P KR         + + V+ P+ +P          P NE  +  +     ++  +  W
Sbjct: 54   PAPKKRK--------LDVAVLQPQPQPQP--------PTNEVDLEAQ----SLIVPKKQW 93

Query: 740  FRRPRTINRNMHPNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSR 919
                   N     N Y +  ++D L   L+   +      +F++G FGH L +   +   
Sbjct: 94   -------NTKQKINLYSKAKVVDILNEKLTARQKE-----LFRKGCFGHLLDFKIKKFPS 141

Query: 920  KLLNAVISRCI--FKEQGIWFHIRGRDVEYTLEDHALITGLSFGSTDF--DPRVARNPND 1087
            +L++ +I R     K+  +WF I G  V++ +++ ALITGL+  +  F  + ++  +   
Sbjct: 142  QLIHHLILRQCPQAKKNELWFDIEGAIVKFGMKEFALITGLNCSNYPFIFEKQLPESTTK 201

Query: 1088 SSLFQIVCGGLNESANTLCSVFQ-HKPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLW 1264
               F+    G +     L  VF+ ++  + E  VKLA L      ++       I P   
Sbjct: 202  RKFFR---KGKSVQRIKLNDVFRANRGGTDEDIVKLAKLYCLESLLIPKKIENNIDPNHL 258

Query: 1265 ELVNDTDAFYSFPWGAYSYK-TLLYYLRTFR-RTGQRYHIYGPSWALVIWAFEVMPGLGD 1438
            ++V++ + F ++PWG  SY+ T+ Y  R+ + +  + Y I G  +A+++WA+E +P L  
Sbjct: 259  KMVDNPELFDNYPWGRLSYEMTIAYIKRSIKSQEAEAYGIGGFPYAVIVWAYETIPTLIK 318

Query: 1439 SFGLQVSTDDAPRCLRW--NFLSTFRGVSDAQLYAQFDDDELEILMIMPEGQEFRQDY 1606
                +   +  PR + W  +   +FR ++D      FD  ELE+  I+P  +E  Q +
Sbjct: 319  KNIAKRIGNGIPRIINWEADQQPSFREITD----RVFDSLELEVRQIIPSKEEMEQPF 372


>ref|XP_006470479.1| PREDICTED: uncharacterized protein LOC102611939 [Citrus sinensis]
          Length = 401

 Score = 80.5 bits (197), Expect = 5e-12
 Identities = 80/331 (24%), Positives = 138/331 (41%), Gaps = 30/331 (9%)
 Frame = +2

Query: 809  DLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIFKEQG-IWFHIR 985
            D+   LSE  +N     +F+   FGHFL     + S +LL+ ++ R +  ++  +WF I 
Sbjct: 29   DIKDKLSEAQKN-----IFRRSCFGHFLDVKELKFSAQLLHIILLREVKSDENTMWFRIG 83

Query: 986  GRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLF-QIVCGGLNESANTLCSVF-QH 1159
             +++ ++LE+ AL+TGL   S  ++P    N +D ++  + + G    +   L + F Q 
Sbjct: 84   RKNIRFSLEEFALVTGLDC-SPSYEPDTENNDDDYTIVDEFLDGNCAITTQELRTKFLQA 142

Query: 1160 KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKTLLYY 1339
            K       VKLA L      +LG +    I      LV++   F  FPWG  S+K  +  
Sbjct: 143  KSTDDMKMVKLAMLYFVESVLLGKENRNHINETNVLLVDNFTEFNEFPWGRISFKMTIDS 202

Query: 1340 LR----------------TFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDA 1471
            LR                  +  G  Y ++G   A ++WA+E +  +GD    +   D  
Sbjct: 203  LRKGVAERVVKPKKKSIADSKYKGATYSVHGFPHAFMVWAYEAVSVIGDKCAKRYG-DLF 261

Query: 1472 PRCLRWNFLSTFRGVSDAQLYAQFDDDELEIL------MIMPE-----GQEFRQDYYLSV 1618
            PR LRW                +F++ ++ +L      M+  E     G      +Y+S 
Sbjct: 262  PRILRWKSTKP----------KEFEEIQMNVLNKKASRMVFIEKLHATGDVEANKHYMSY 311

Query: 1619 HQRDALVVKYHVPSSTYIRGDDGPREDPEQE 1711
               D  + +    +   I  D    ++P  E
Sbjct: 312  FSEDEWLEEIGEENDNAIESDSHAEQNPSHE 342


>ref|XP_006495054.1| PREDICTED: uncharacterized protein LOC102626871 [Citrus sinensis]
          Length = 401

 Score = 79.7 bits (195), Expect = 9e-12
 Identities = 79/331 (23%), Positives = 139/331 (41%), Gaps = 30/331 (9%)
 Frame = +2

Query: 809  DLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIFKEQG-IWFHIR 985
            D+   LSE  +N     +F+   FGHFL     + S +LL++++ R +  ++  +WF I 
Sbjct: 29   DIKDKLSEAQKN-----IFRRSCFGHFLDVKELKFSAQLLHSILLREVKSDENTMWFRIG 83

Query: 986  GRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLF-QIVCGGLNESANTLCSVF-QH 1159
             +++ ++LE+ AL+TGL   +  ++P    N +D ++  + + G    +   L + F Q 
Sbjct: 84   RKNIRFSLEEFALVTGLDC-NPSYEPDTENNDDDYTIVDEFLDGNCAITTQELRTKFLQA 142

Query: 1160 KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKTLLYY 1339
            K       VKLA L      +LG +    I      LV++   F  FPWG  S+K  +  
Sbjct: 143  KSTDDMKMVKLAMLYFVESVLLGKENRNHINEINVLLVDNFTEFNEFPWGRISFKMTIDS 202

Query: 1340 LR----------------TFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDA 1471
            LR                  +  G  Y ++G   A ++WA+E +  +GD    +   D  
Sbjct: 203  LRKGVAERVAKPKKKSTADSKYKGATYSVHGFPHAFMVWAYEAVSVIGDKCAKRYG-DLF 261

Query: 1472 PRCLRWNFLSTFRGVSDAQLYAQFDDDELEIL------MIMPE-----GQEFRQDYYLSV 1618
            PR LRW                +F++ ++ +L      M+  E     G      +Y+S 
Sbjct: 262  PRILRWKSTKP----------KEFEEIQMNVLNKKASRMVFIEKLHATGDVEANKHYMSY 311

Query: 1619 HQRDALVVKYHVPSSTYIRGDDGPREDPEQE 1711
               D  + +    +   I  D    ++P  E
Sbjct: 312  FSEDEWLEEIGEENDNAIESDSHAEQNPSHE 342


>ref|XP_007049260.1| Uncharacterized protein TCM_002293 [Theobroma cacao]
            gi|508701521|gb|EOX93417.1| Uncharacterized protein
            TCM_002293 [Theobroma cacao]
          Length = 791

 Score = 78.6 bits (192), Expect = 2e-11
 Identities = 66/216 (30%), Positives = 97/216 (44%), Gaps = 26/216 (12%)
 Frame = +2

Query: 923  LLNAVISRCIFKEQG----IWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDS 1090
            LL++++   I + Q     +WF I       + ++  LITGL FGS    P V R     
Sbjct: 100  LLHSIMIHRITERQSMDHELWFTIGKSKARLSKQEFCLITGLKFGSM---PDVFRR---- 152

Query: 1091 SLFQIVCGGLNE---------SANTLCSVFQ---HKPLSSEVRVKLANLLTANCFVLGHD 1234
             L+++   G++              L   F+    + L  E   K+A +L AN  + G D
Sbjct: 153  -LYEVAADGIHARYWNGEDSVKLQALLDTFRGGNFQRLGDES--KMALVLIANNILFGQD 209

Query: 1235 AGKTIFPWLWELVNDTDAFYSFPWGAYSYK-TLLYYLRTF--------RRTGQRYHIYGP 1387
              + + PWL  LV D DA+  FPWG Y +K TL Y L+ F        + T  RY+IYG 
Sbjct: 210  YRRRMTPWLLSLVEDIDAWNVFPWGHYVWKLTLDYLLKGFEVLDLSVTKETRLRYNIYGF 269

Query: 1388 SWALVIWAFEVMPGLGDSFGLQVSTDDA-PRCLRWN 1492
            +W +  WA E +  L          D+  PR  RW+
Sbjct: 270  AWVIQFWAMEAISTLRKIVAPSGLKDNVHPRMCRWD 305


>ref|XP_006482789.1| PREDICTED: uncharacterized protein LOC102618257 [Citrus sinensis]
          Length = 638

 Score = 74.7 bits (182), Expect = 3e-10
 Identities = 88/372 (23%), Positives = 160/372 (43%), Gaps = 51/372 (13%)
 Frame = +2

Query: 776  PNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIF 955
            P       +L  +   + E   N +L  MF++  FGHFL       S  +L+ ++ R + 
Sbjct: 23   PGRVLSMCMLSSVVKAIEEKLTNRQLR-MFKKDIFGHFLECRSFPFSGVILHNLLLRQVA 81

Query: 956  KEQG-----IWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGL 1120
             E+      +WF I    +  ++ +  L+TGLSFG    D     +  +  L     GG+
Sbjct: 82   HEEDSREDQLWFQIGKHVIRLSIVEWCLVTGLSFG---VDTNQKNDEMEQRLRNTYFGGV 138

Query: 1121 NESANTLCSVFQHKPLSSEVRVKLANLLTANCFVLGHDAGKTI----------FPWLWEL 1270
            +   N    V Q   +  E++++  N + A    L + A + +          F WL + 
Sbjct: 139  HRKIN----VKQFDAVFKELKLEEMNDMDALKIALFYFADRVLNARKNHCQINFDWL-DQ 193

Query: 1271 VNDTDAFYSFPWGAYSYKTLLYYL--------RTFRRTG--------QRYHIYGPSWALV 1402
            V+D   F   PWG  S++ +   L          F++T         ++Y++YG +  + 
Sbjct: 194  VDDIQYFRKRPWGLLSWEIIYESLDNALFEKDEKFKKTRLKNSDHNIEKYNLYGFTSGVQ 253

Query: 1403 IWAFEVMPGLGDSFGLQVSTDDAPRCLRWNFLSTFRGVSDAQLYAQFDDDEL--EILMIM 1576
             W  E + GL  ++ ++ + +  P  L+W  +++ R ++ A++Y+ F+D+    ++L  +
Sbjct: 254  AWIHEAIGGLPSTWVVK-TKNKIPHILQWKPMASSR-INFAEVYSFFNDESRLGDVLQTL 311

Query: 1577 -PEGQEFRQDYYLSVHQRDALV-----VKYHVPSSTYIRG-----------DDGPREDPE 1705
             P  +E  + Y+LSV  +D L      V  H PS   +             DD P   PE
Sbjct: 312  EPNSKESSRKYWLSV--KDYLPSIPDWVHKHQPSINAMPSVTRQSDEHDDHDDIPNPIPE 369

Query: 1706 QE-HGVEDVYWY 1738
            Q  H V+   +Y
Sbjct: 370  QRLHSVQSCKYY 381


>ref|XP_007010437.1| Uncharacterized protein TCM_044253 [Theobroma cacao]
            gi|508727350|gb|EOY19247.1| Uncharacterized protein
            TCM_044253 [Theobroma cacao]
          Length = 547

 Score = 74.7 bits (182), Expect = 3e-10
 Identities = 53/168 (31%), Positives = 75/168 (44%), Gaps = 11/168 (6%)
 Frame = +2

Query: 968  IWFHIRGRDVEYTLEDHALITGLSFGST-DFDPRVARNPNDSSLFQIVCGGLNESANTLC 1144
            +WF I       + ++  LITGL FG   D   R      D    +   G  +     L 
Sbjct: 5    LWFAIGKSKARLSKQEFCLITGLKFGPMLDVFKRPYEVAVDGIHARYWNGEDSVKLQALL 64

Query: 1145 SVFQHKPLSSEV-RVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSY 1321
              F+           K+A +L AN  + G D  + + PWL  LV D DA+  FPWG Y +
Sbjct: 65   DTFREGNFQRPGDATKMALILIANNILFGQDYRRRVTPWLLSLVEDIDAWNVFPWGHYIW 124

Query: 1322 K-TLLYYLRTF--------RRTGQRYHIYGPSWALVIWAFEVMPGLGD 1438
            K TL Y L+ F        + T  RY+IYG +W + +WA E +  + D
Sbjct: 125  KLTLDYLLKGFEVPDLSVTKETRLRYNIYGFAWVIQLWALETLEPIAD 172


>ref|XP_007009514.1| Uncharacterized protein TCM_042921 [Theobroma cacao]
            gi|508726427|gb|EOY18324.1| Uncharacterized protein
            TCM_042921 [Theobroma cacao]
          Length = 715

 Score = 69.7 bits (169), Expect = 1e-08
 Identities = 56/187 (29%), Positives = 77/187 (41%), Gaps = 12/187 (6%)
 Frame = +2

Query: 968  IWFHIRGRDVEYTLEDHALITGLSFGST-DFDPRVARNPNDSSLFQIVCGGLNESANTLC 1144
            +WF I       + ++  LITGL FG   D   R      D    +   G  +     L 
Sbjct: 5    LWFAIGKSKARLSKQEFCLITGLKFGPMLDVFRRPYEVAADGIHARYWNGQDSVKLQALL 64

Query: 1145 SVFQHKPLSS-EVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSY 1321
              F+           K+A +L AN  + G      + PWL  LV D DA+  FPWG Y +
Sbjct: 65   DTFRRSNFKRPRDATKMAFVLIANNILFGQYYRIRVTPWLLSLVEDIDAWNVFPWGHYVW 124

Query: 1322 K-TLLYYLRTF--------RRTGQRYHIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDA- 1471
            K TL Y L+ F        + T   Y+IYG +W +  WA E +P            D+  
Sbjct: 125  KLTLDYLLKGFKVPDLSVTKETRLHYNIYGFAWVIQFWAMEAIPAFQKIVAPFGPKDNVH 184

Query: 1472 PRCLRWN 1492
            PR  RW+
Sbjct: 185  PRMCRWD 191


>ref|XP_006484773.1| PREDICTED: uncharacterized protein LOC102628928 [Citrus sinensis]
          Length = 672

 Score = 68.9 bits (167), Expect = 2e-08
 Identities = 71/315 (22%), Positives = 142/315 (45%), Gaps = 34/315 (10%)
 Frame = +2

Query: 776  PNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIF 955
            P+      +L  +   + E   N +L  MF++  FGHFL       S  +L+ ++   + 
Sbjct: 10   PSRISSMCMLSSVVKAIEEKLTNRQLR-MFKKDIFGHFLECRSFPFSGVILHNLLLWQVA 68

Query: 956  KEQG-----IWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGL 1120
             E+      +WF I    +  ++ +  L+TGLSFG    D     +  +  L     GG+
Sbjct: 69   HEEDSREDQLWFQIGKHVIRLSIVEWCLVTGLSFG---VDTNQKNDEMEQRLRNTYFGGV 125

Query: 1121 NESANTLCSVFQHKPLSSEVRVKLANLLTANCFVLGHDAGKTI----------FPWLWEL 1270
            +   N    V Q   +  E++ +  + + A    L + A + +          F WL + 
Sbjct: 126  HRKIN----VKQFDAVFKEIKFEEIDDIDALKIALFYFADRVLNARKNHCQINFDWL-DQ 180

Query: 1271 VNDTDAFYSFPWGAYSYKTLLYYL--------RTFRRTG--------QRYHIYGPSWALV 1402
            V+D   F   PW   S++ +   L          F++T         ++Y++YG +  + 
Sbjct: 181  VDDIQYFRKRPWDLLSWEMIYESLDNALFEKDEKFKKTRLKNPDHNIEKYNLYGFTSGVQ 240

Query: 1403 IWAFEVMPGLGDSFGLQVSTDDAPRCLRWNFLSTFRGVSDAQLYAQFDDDEL--EILMIM 1576
             W +E + GL  ++ ++ + +  PR L+W  +++ R ++ A++Y+ F+D+    ++L  +
Sbjct: 241  AWIYEAIGGLPSTWVVK-TKNKIPRILQWKPMASSR-INFAEVYSFFNDESRLGDVLQTL 298

Query: 1577 -PEGQEFRQDYYLSV 1618
             P  +E  ++Y+LSV
Sbjct: 299  EPNSKESSRNYWLSV 313


>ref|XP_006478664.1| PREDICTED: uncharacterized protein LOC102607154 [Citrus sinensis]
          Length = 343

 Score = 68.9 bits (167), Expect = 2e-08
 Identities = 72/315 (22%), Positives = 142/315 (45%), Gaps = 34/315 (10%)
 Frame = +2

Query: 776  PNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIF 955
            P       +L  +   + E     +L+ MF++  FGHFL       S  +L+ ++ R + 
Sbjct: 23   PGRISSMCMLSSVVKAIEEKLTMRQLS-MFKKDIFGHFLECRNFLFSGVILHNLLLRQVA 81

Query: 956  -----KEQGIWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGL 1120
                 +E  +WF I    +  ++ +  L+TGLSFG    D     +  +  L     GG+
Sbjct: 82   HEDDSREDQLWFQIGKHLIRLSIVEWCLVTGLSFG---VDTNEKNDEVEQRLQNTYFGGV 138

Query: 1121 NESANTLCSVFQHKPLSSEVRVKLANLLTANCFVLGHDAGKTI----------FPWLWEL 1270
            +   N    V Q   +  E++ +  + + A    L + A + +          F WL + 
Sbjct: 139  HREIN----VKQFDAVFKELKFEEMDDMDALKIALFYFADRVLNARKNHCQINFDWL-DQ 193

Query: 1271 VNDTDAFYSFPWGAYSYKTLLYYL--------RTFRRTG--------QRYHIYGPSWALV 1402
            V+D   F   PWG  S++ +   L          F++T         ++Y++YG +  + 
Sbjct: 194  VDDIQYFRKRPWGLLSWEMIYESLDNALFEKDEKFKKTRLKNPDHNIEKYNLYGFTSGVQ 253

Query: 1403 IWAFEVMPGLGDSFGLQVSTDDAPRCLRWNFLSTFRGVSDAQLYAQFDDDEL--EILMIM 1576
             W +E + GL  ++ ++ + +  PR L+W  +++ R ++ A++Y+ F+D+    ++L  +
Sbjct: 254  AWIYEAIGGLPSTWVVK-TKNKIPRILQWKPMASSR-INFAEVYSFFNDESRLGDVLQTL 311

Query: 1577 -PEGQEFRQDYYLSV 1618
             P  +E  + Y+LSV
Sbjct: 312  EPNSKESSRKYWLSV 326


>ref|XP_006494018.1| PREDICTED: uncharacterized protein LOC102609172 [Citrus sinensis]
          Length = 345

 Score = 67.8 bits (164), Expect = 4e-08
 Identities = 70/306 (22%), Positives = 139/306 (45%), Gaps = 25/306 (8%)
 Frame = +2

Query: 776  PNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIF 955
            P       +L  +   + E     +L+ MF++  FGHFL       S  +L+ ++ R + 
Sbjct: 23   PGRISSMCMLSSVVKAIEEKLTKRQLS-MFKKDIFGHFLECRSFPFSGVILHNLLLRQVA 81

Query: 956  KEQG-----IWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGL 1120
             E+      +WF I    +  ++ +  L+TGLSFG    D     +  +  L     GG+
Sbjct: 82   HEEDSREDQLWFQIGKHLIRLSIVEWCLVTGLSFG---VDTNQKNDEMEQRLRNTYFGGV 138

Query: 1121 NESANTLCSVFQHKPLSSEVRVKLANLLTANCFVLGHDAGKTI-FPWLWELVNDTDAFYS 1297
            +     L    + + +     +K+A    A+  +        I F WL + V+D   F  
Sbjct: 139  H-----LFKELKFEEMDDMDALKIALFYFADRVLNARKNHCQINFDWL-DQVDDIQYFRK 192

Query: 1298 FPWGAYSYKTLLYYL--------RTFRRTG--------QRYHIYGPSWALVIWAFEVMPG 1429
             PWG  S++ +   L          F++T         ++Y++YG +  +  W +E + G
Sbjct: 193  CPWGLLSWEMVYESLDNALFEKDEKFKKTRLKNSDHNIEKYNLYGFTSGVQAWIYEAIGG 252

Query: 1430 LGDSFGLQVSTDDAPRCLRWNFLSTFRGVSDAQLYAQFDDDEL--EILMIM-PEGQEFRQ 1600
            L  ++ ++ + +  PR L+W  +++ R ++ A++Y+ F+D+    ++L  + P  +E  +
Sbjct: 253  LPSTWVVK-TKNKIPRILQWKPMASSR-INFAEVYSFFNDESRLGDVLQTLEPNSKESSK 310

Query: 1601 DYYLSV 1618
            +Y+LSV
Sbjct: 311  NYWLSV 316


>ref|XP_004167400.1| PREDICTED: uncharacterized protein LOC101226019, partial [Cucumis
            sativus]
          Length = 504

 Score = 67.0 bits (162), Expect = 6e-08
 Identities = 62/237 (26%), Positives = 106/237 (44%), Gaps = 10/237 (4%)
 Frame = +2

Query: 809  DLTSVLSEDFENGELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISR--CIFKEQGIWFHI 982
            D+ S++       +L   F++  FG+FL     + S +L   +I R  C      +WF++
Sbjct: 181  DVISIIKNTLNERQLK-KFKKSCFGNFLDLKISKFSSQLFYHLIRRQCCSKNRNELWFNL 239

Query: 983  RGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNESAN--TLCSVF- 1153
             GR  ++ ++D ALITGL+ G     P +  +      F     G  ++     L  VF 
Sbjct: 240  EGRIHKFGMKDFALITGLNCGEL---PAIDMSKIQKGKFNKRYFGGEKTIRRAKLHKVFT 296

Query: 1154 -QHKPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKTL 1330
               K  + +V VK+A L     F+LG      I      L++D   F S+PWG  SY+  
Sbjct: 297  EMDKGRNKDV-VKMAKLYILEMFILGKQIRTGINHEYTLLIDDKKQFDSYPWGRISYEIT 355

Query: 1331 LYYLRTFRRT--GQRYHIYGPSWALVIWAFEVMP--GLGDSFGLQVSTDDAPRCLRW 1489
            + +++   ++       + G  +AL++WA+E +P   L  +F     +   PR   W
Sbjct: 356  VDFVKKSIKSNDASAIGVGGFPYALLVWAYETIPLLALNSNFLAMRISFGTPRMNNW 412


>ref|XP_007010629.1| Uncharacterized protein TCM_044555 [Theobroma cacao]
            gi|508727542|gb|EOY19439.1| Uncharacterized protein
            TCM_044555 [Theobroma cacao]
          Length = 697

 Score = 66.6 bits (161), Expect = 8e-08
 Identities = 40/106 (37%), Positives = 57/106 (53%), Gaps = 9/106 (8%)
 Frame = +2

Query: 1187 KLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYK-TLLYYLRTF---- 1351
            K+A +L AN  +   D  + + PWL  LV D DA+  FPWG Y +K TL Y L+ F    
Sbjct: 90   KMAFILIANNILFDQDYRRRVTPWLLSLVEDIDAWNVFPWGHYVWKLTLDYLLKGFKVLN 149

Query: 1352 ----RRTGQRYHIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDAPR 1477
                + T  RY+IYG +W + +WA E+         L+ +TD+A R
Sbjct: 150  LSVTKETRLRYNIYGFAWVIQLWALEM---------LEPTTDEAFR 186


>ref|XP_006396927.1| hypothetical protein EUTSA_v10029511mg [Eutrema salsugineum]
            gi|557097944|gb|ESQ38380.1| hypothetical protein
            EUTSA_v10029511mg [Eutrema salsugineum]
          Length = 997

 Score = 66.2 bits (160), Expect = 1e-07
 Identities = 68/236 (28%), Positives = 106/236 (44%), Gaps = 19/236 (8%)
 Frame = +2

Query: 971  WFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNESANTLCSV 1150
            W  I GR V ++L ++A ITGL+    D + +V  +  +      V GG   + N L + 
Sbjct: 93   WTLIGGRPVRFSLLEYAEITGLNCDPIDPNDKVEIDHTEFWAEIGVRGGEGPNWNELEAR 152

Query: 1151 FQH-KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKT 1327
             +  +  + E +  LA L   +  VLG      I   L + V D  AF   PWG Y +  
Sbjct: 153  MKTCQAWTYEKKKMLALLFILHVGVLGLHRNSRIPLALAKTVMDEAAFERQPWGRYGFHE 212

Query: 1328 LLYYLRTFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFG-LQVSTDDA-PRCLRWNFLS 1501
            L+Y L+     G RY ++G   A+++W +E +P L +  G ++   DD+    LRW   S
Sbjct: 213  LIYSLKIANLGGARYTLHGCVQAMLVWGYECIPILAERAGKIRPDVDDSVVPLLRWT-SS 271

Query: 1502 TFRGVSDAQLYAQFDDDELE--------ILMIMP--------EGQEFRQDYYLSVH 1621
              R V +  L  + D  E E        +L++ P        EG+  R D  + VH
Sbjct: 272  RCRNVFETLL--EMDKSETEDHKVRVRHLLVVKPLEEIYPVWEGEPRRVDVDIKVH 325


>ref|XP_006418926.1| hypothetical protein EUTSA_v100031140mg, partial [Eutrema
            salsugineum] gi|557096854|gb|ESQ37362.1| hypothetical
            protein EUTSA_v100031140mg, partial [Eutrema salsugineum]
          Length = 242

 Score = 63.9 bits (154), Expect = 5e-07
 Identities = 60/200 (30%), Positives = 93/200 (46%), Gaps = 3/200 (1%)
 Frame = +2

Query: 971  WFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNESANTLCSV 1150
            W  I GR V ++L ++A ITGL+    D + +V  +  +      V GG   + N L + 
Sbjct: 41   WTLIGGRPVRFSLLEYAEITGLNCDPIDPNDKVEIDHTEFWAEIGVRGGEGPNWNELEAR 100

Query: 1151 FQH-KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKT 1327
             +  +  + E +  LA L   +  VLG      I   L + V D  AF   PWG Y +  
Sbjct: 101  MKTCQAWTYEKKKMLALLFILHVGVLGLHRNSRIPLALAKTVMDEAAFERQPWGRYGFHE 160

Query: 1328 LLYYLRTFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFG-LQVSTDDA-PRCLRWNFLS 1501
            L+Y L+     G RY ++G   A+++W +E +P L +  G ++   DD+    LRW   S
Sbjct: 161  LIYSLKIANLGGARYTLHGCVQAMLVWGYECIPILAERAGKIRPDVDDSVVPLLRWT-SS 219

Query: 1502 TFRGVSDAQLYAQFDDDELE 1561
              R V +  L  + D  E E
Sbjct: 220  RCRNVFETLL--EMDKSETE 237


>ref|XP_006391727.1| hypothetical protein EUTSA_v100238800mg, partial [Eutrema
            salsugineum] gi|557088233|gb|ESQ29013.1| hypothetical
            protein EUTSA_v100238800mg, partial [Eutrema salsugineum]
          Length = 285

 Score = 63.9 bits (154), Expect = 5e-07
 Identities = 53/176 (30%), Positives = 84/176 (47%), Gaps = 3/176 (1%)
 Frame = +2

Query: 971  WFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNESANTLCSV 1150
            W  I GR V ++L ++A ITGL+    D + +V  +  +      V GG   + N L + 
Sbjct: 93   WTLIGGRPVRFSLLEYAEITGLNCDPIDPNDKVEIDHTEFWAEIGVRGGEGPNWNELEAR 152

Query: 1151 FQH-KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKT 1327
             +  +  + E +  LA L   +  VLG      I   L + V D  AF   PWG Y +  
Sbjct: 153  MKTCQAWTYEKKKMLALLFILHVGVLGLHRNSRIPLTLAKTVMDEAAFERQPWGRYGFHE 212

Query: 1328 LLYYLRTFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFG-LQVSTDDA-PRCLRW 1489
            L+Y L+     G RY ++G   A+++W +E +P L +  G ++   DD+    LRW
Sbjct: 213  LIYSLKIANLGGARYTLHGCVQAMLVWGYECIPILAERAGKIRPDVDDSVVPLLRW 268


>ref|XP_007031827.1| Uncharacterized protein TCM_017149 [Theobroma cacao]
            gi|508710856|gb|EOY02753.1| Uncharacterized protein
            TCM_017149 [Theobroma cacao]
          Length = 249

 Score = 63.9 bits (154), Expect = 5e-07
 Identities = 66/232 (28%), Positives = 105/232 (45%), Gaps = 27/232 (11%)
 Frame = +2

Query: 767  NMHPNHYYRPAILDDLTSVLSEDFENGELAYMFQEGPFGHFLGWSP-GQNSRKLLNAVIS 943
            N+  N +Y+ + L  +T  L +  E   +    +   FG  LG++P G     LL +++ 
Sbjct: 30   NVTINTHYKWSQLHYITKTLQQKGEYDAV----KRTCFGMLLGFNPQGYFCAGLLYSIMI 85

Query: 944  RCIFKEQG----IWFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVC 1111
              I + Q     +WF I   +V  + ++  LIT L FG     P V R P     +++  
Sbjct: 86   HRITERQSMDHELWFAIGKSNVRLSKQEFCLITRLKFGPM---PDVFRRP-----YEVAT 137

Query: 1112 GGLN-------ESAN--TLCSVFQ----HKPLSSEVRVKLANLLTANCFVLGHDAGKTIF 1252
             G++       ESA    L   F+     +P  +    K+A +L  N  + G D  + + 
Sbjct: 138  EGIHDRYWNRQESAKLQALLDTFRGGNFQRPGDA---TKMALVLITNNILFGQDYRRRVT 194

Query: 1253 PWLWELVNDTDAFYSFPWGAYSYK-TLLYYLRTF--------RRTGQRYHIY 1381
            PWL  L+ D DA+  FPWG Y +K TL Y L+ F        ++T   Y+IY
Sbjct: 195  PWLLSLMEDIDAWNVFPWGHYVWKLTLDYLLKEFEVPDSSVTKKTRLHYNIY 246


>ref|XP_006401890.1| hypothetical protein EUTSA_v100159020mg, partial [Eutrema
            salsugineum] gi|557102980|gb|ESQ43343.1| hypothetical
            protein EUTSA_v100159020mg, partial [Eutrema salsugineum]
          Length = 292

 Score = 62.8 bits (151), Expect = 1e-06
 Identities = 51/176 (28%), Positives = 85/176 (48%), Gaps = 3/176 (1%)
 Frame = +2

Query: 971  WFHIRGRDVEYTLEDHALITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNESANTLCSV 1150
            W  I GR V ++L ++A ITGL+    D + ++  +  +      V GG + + N L + 
Sbjct: 93   WTLIGGRPVRFSLLEYAEITGLNCDPIDPNDKLEIDHTEFWAEIGVRGGEDPNWNELEAR 152

Query: 1151 FQH-KPLSSEVRVKLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKT 1327
             +  +  + E +  LA L   +  VLG      I   L + V D  AF   PWG Y +  
Sbjct: 153  MKTCQAWTYEKKKMLALLFILHVGVLGLHRNSRISLALAKTVMDEAAFERQPWGRYGFHE 212

Query: 1328 LLYYLRTFRRTGQRYHIYGPSWALVIWAFEVMPGLGDSFG-LQVSTDDA-PRCLRW 1489
            L+Y ++     G RY ++G   A+++W +E +P L +  G ++   DD+    LRW
Sbjct: 213  LIYSIKIANLGGARYTLHGCVQAMLVWGYECIPILAERSGKIRPDVDDSVVPLLRW 268


>ref|XP_007014370.1| Uncharacterized protein TCM_039370 [Theobroma cacao]
            gi|508784733|gb|EOY31989.1| Uncharacterized protein
            TCM_039370 [Theobroma cacao]
          Length = 653

 Score = 62.8 bits (151), Expect = 1e-06
 Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 10/113 (8%)
 Frame = +2

Query: 1187 KLANLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYK-TLLYYLRTF---- 1351
            K+A +L  N  + G D  + + PWL  LV D DA+  FPWG Y +K TL Y L+ F    
Sbjct: 128  KMALVLITNNILFGQDYRRRVTPWLLSLVEDIDAWNVFPWGYYVWKLTLDYLLKGFEVPD 187

Query: 1352 ----RRTGQRYHIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDA-PRCLRWNF 1495
                + T  RY+IYG +W  VI   E +           S D+  PR  RW++
Sbjct: 188  LSVTKETRLRYNIYGFAW--VIQTMEAISAFRKIVAPSGSKDNVHPRMCRWDY 238


>ref|XP_004240477.1| PREDICTED: uncharacterized protein LOC101262922 [Solanum
            lycopersicum]
          Length = 654

 Score = 62.4 bits (150), Expect = 2e-06
 Identities = 72/311 (23%), Positives = 133/311 (42%), Gaps = 7/311 (2%)
 Frame = +2

Query: 668  LPENEYGISFERIPRQILESRPPWFRRPRTINRNMHPNHYYRPAILDDLTSVLSEDFENG 847
            LP+   G + E + + ++    P     R + R  HP    +  +  D   +L+E  +  
Sbjct: 89   LPKRRKGNAGEAMDKALVVDLEPGEFCARPVRR-YHPRVGAQTNV--DAVKLLNEKLDAK 145

Query: 848  ELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIFKEQ--GIWFHIRGRDVEYTLEDHA 1021
            ++  MF+E  FGHFL   P     +L ++++ R + +E+   +W  ++   + + L +  
Sbjct: 146  QIQ-MFRETCFGHFLDLPPVVVQHQLAHSLLLREVVEEEEDALWISMKDISLRFGLVEFG 204

Query: 1022 LITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNE-SANTLCSVFQHKP-LSSEVRVKLA 1195
            +ITGL      +  + + +     L       L      +L   F +K   S E  VK+A
Sbjct: 205  IITGLKCTGDAY--KCSDSDGTGQLMNTYFAELTRVPKQSLIECFHNKRWKSDEDAVKIA 262

Query: 1196 NLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKTLLYYLR-TFRRTGQRY 1372
             L   + F+    + K I    +ELV D+ A+ ++PWG   +K  L  ++   +     Y
Sbjct: 263  VLYFIHTFLFSTVSRKHISRDDFELV-DSGAYATYPWGKAVFKATLKSVKGKLQGKPSMY 321

Query: 1373 HIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDAPRCLRWNFLS--TFRGVSDAQLYAQFD 1546
             + G   A   W +E  P + +    +V  D  PR L W      +F+ + D     +  
Sbjct: 322  RLGGLPLAFQCWFYECCPYVNNKIAFRVD-DKVPRILSWKVTKQPSFKELLDG--IFRLS 378

Query: 1547 DDELEILMIMP 1579
             D+L++  I P
Sbjct: 379  QDQLKLRNISP 389


>ref|XP_006364558.1| PREDICTED: uncharacterized protein LOC102586575 [Solanum tuberosum]
          Length = 656

 Score = 62.0 bits (149), Expect = 2e-06
 Identities = 72/311 (23%), Positives = 131/311 (42%), Gaps = 7/311 (2%)
 Frame = +2

Query: 668  LPENEYGISFERIPRQILESRPPWFRRPRTINRNMHPNHYYRPAILDDLTSVLSEDFENG 847
            LP+   G   E + + ++    P     R + R  HP    +  +  D   +L+E  +  
Sbjct: 89   LPKRRKGNDDEAMEKALVVDLEPGEFCARPVRR-YHPRVGAQTNV--DAVKLLNEKLDAK 145

Query: 848  ELAYMFQEGPFGHFLGWSPGQNSRKLLNAVISRCIFKEQ--GIWFHIRGRDVEYTLEDHA 1021
            ++  MF+E  FGHFL   P     +L ++++ R + +E+   +W  ++   + + L +  
Sbjct: 146  QIQ-MFRETCFGHFLDLPPVVVQHQLAHSLLLREVVEEEEDALWISMKNVSLRFGLVEFG 204

Query: 1022 LITGLSFGSTDFDPRVARNPNDSSLFQIVCGGLNE-SANTLCSVFQHKP-LSSEVRVKLA 1195
            +ITGL      +  +   +     L       L      +L   F +K   S E  VK+A
Sbjct: 205  IITGLKCTGDAY--KCCDSDGTGQLMNTYFSELTRVPKQSLIECFHNKRWKSDEDAVKIA 262

Query: 1196 NLLTANCFVLGHDAGKTIFPWLWELVNDTDAFYSFPWGAYSYKTLLYYLR-TFRRTGQRY 1372
             L   + F+    + K I    +ELV D+ A+ ++PWG   +K  L  ++   +     Y
Sbjct: 263  VLYFIHTFLFSTVSRKHISRDDFELV-DSGAYETYPWGKAVFKATLKSVKGKLQGKPSMY 321

Query: 1373 HIYGPSWALVIWAFEVMPGLGDSFGLQVSTDDAPRCLRWNFLS--TFRGVSDAQLYAQFD 1546
             + G   A   W +E  P + +    +V  D  PR L W      +F+ + D     +  
Sbjct: 322  RLGGLPLAFQCWFYECCPYVNNKIAFRVD-DKVPRILSWKVTKQPSFKELLDG--IFRLS 378

Query: 1547 DDELEILMIMP 1579
             D+L++  I P
Sbjct: 379  QDQLKLRNISP 389


Top