BLASTX nr result

ID: Akebia27_contig00002351 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00002351
         (1307 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002271455.1| PREDICTED: uncharacterized protein LOC100249...   549   e-153
ref|XP_007019945.1| Galactose-binding protein isoform 4 [Theobro...   531   e-148
ref|XP_007019942.1| Galactose-binding protein isoform 1 [Theobro...   531   e-148
emb|CBI17031.3| unnamed protein product [Vitis vinifera]              527   e-147
ref|XP_006441747.1| hypothetical protein CICLE_v10019431mg [Citr...   515   e-143
ref|XP_006371384.1| hypothetical protein POPTR_0019s09690g [Popu...   514   e-143
ref|XP_006492474.1| PREDICTED: uncharacterized protein slp1-like...   511   e-142
ref|XP_007199767.1| hypothetical protein PRUPE_ppa003178mg [Prun...   508   e-141
ref|XP_002523463.1| conserved hypothetical protein [Ricinus comm...   508   e-141
ref|XP_004516033.1| PREDICTED: uncharacterized protein LOC101491...   496   e-137
ref|XP_004516032.1| PREDICTED: uncharacterized protein LOC101491...   496   e-137
ref|XP_004290252.1| PREDICTED: uncharacterized protein SLP1-like...   492   e-136
gb|EXC32470.1| putative glycosyltransferase [Morus notabilis]         490   e-136
ref|XP_007019951.1| Galactose-binding protein isoform 10, partia...   482   e-133
ref|XP_007019947.1| Galactose-binding protein isoform 6, partial...   482   e-133
ref|XP_004141528.1| PREDICTED: uncharacterized protein LOC101220...   481   e-133
ref|XP_003534427.1| PREDICTED: uncharacterized protein LOC100783...   481   e-133
ref|XP_006356930.1| PREDICTED: uncharacterized protein LOC102595...   478   e-132
gb|AAU04771.1| membrane protein-like [Cucumis melo]                   476   e-131
ref|XP_007019949.1| Galactose-binding protein isoform 8 [Theobro...   475   e-131

>ref|XP_002271455.1| PREDICTED: uncharacterized protein LOC100249908 [Vitis vinifera]
          Length = 586

 Score =  549 bits (1414), Expect = e-153
 Identities = 281/413 (68%), Positives = 333/413 (80%), Gaps = 6/413 (1%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K K V GQAG++IHRVE GG +YNYASASKGAKVLA NKEAKGASNILGKDKDKYL
Sbjct: 162  AISYKSKSVTGQAGNVIHRVEPGGADYNYASASKGAKVLASNKEAKGASNILGKDKDKYL 221

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSN KD +LLGS V+PTD WV LG+F A
Sbjct: 222  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNPKDFELLGSSVFPTDEWVKLGNFTA 281

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NVKH+QRF L EPKWVRYLKLNLLSHHG+EFYCTLS +EVYGVDAVE MLEDLISVQDN
Sbjct: 282  ANVKHAQRFALHEPKWVRYLKLNLLSHHGTEFYCTLSVVEVYGVDAVERMLEDLISVQDN 341

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILE 721
             F  EE+  E   +  Q EPT G++L+Q  V+E ++    +    K + +K+N+P+ + E
Sbjct: 342  PFVPEEITAEKKSIPSQPEPTEGNNLYQKPVSETESDPLLD----KPEAIKSNMPDPVEE 397

Query: 722  TRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVI 901
             R QQVGRMPGDTVLKILMQKV+S+DLSLSVLE+YLE+LNSRYGNIF+E D EI  KDV+
Sbjct: 398  IRHQQVGRMPGDTVLKILMQKVQSLDLSLSVLERYLEDLNSRYGNIFKEFDKEIEEKDVL 457

Query: 902  LEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHM 1081
            LE IRSD++N  D +E+I KDVSDL++WKSLVSLQL+NL++DNA+LR+EV+ VQ +Q HM
Sbjct: 458  LENIRSDIRNFLDSKEIITKDVSDLISWKSLVSLQLDNLLKDNALLRAEVQKVQEDQTHM 517

Query: 1082 ENKGIAIFLVSFVFS------CITLMILFIDMVVSVCSRTEKSGKFCGMRSSW 1222
            ENKGIA+FL+  +F        +  M+L + M VSV +R++KS  FCG  SSW
Sbjct: 518  ENKGIAVFLICLIFGFWAFARLLVDMMLSVYMAVSVNNRSDKSRNFCGTSSSW 570


>ref|XP_007019945.1| Galactose-binding protein isoform 4 [Theobroma cacao]
            gi|508725273|gb|EOY17170.1| Galactose-binding protein
            isoform 4 [Theobroma cacao]
          Length = 553

 Score =  531 bits (1368), Expect = e-148
 Identities = 266/408 (65%), Positives = 329/408 (80%), Gaps = 5/408 (1%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQAG + HRVE GG+EYNYASASKGAKVL  NKEAKGASNILGKDKDKYLRNPC
Sbjct: 132  RSKSGTGQAG-VKHRVEPGGKEYNYASASKGAKVLLCNKEAKGASNILGKDKDKYLRNPC 190

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFV+IELSEETLVDTIEIANFEHYSS LKD +LLGSL +PTD W+ LG+F AGNVK
Sbjct: 191  SAEEKFVIIELSEETLVDTIEIANFEHYSSKLKDFELLGSLFFPTDVWIKLGNFTAGNVK 250

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+QRF L+EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLISVQDN F S
Sbjct: 251  HAQRFVLKEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLISVQDNLFAS 310

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            ++   +   +  +LEPT G+ ++QN   E+ ++S  ENSN++ D   N VP+ + +   Q
Sbjct: 311  DDGTRDQKQMPSKLEPTQGNSVYQNSHKEMGSESSVENSNLQHDVFNNIVPSPVEDIHHQ 370

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGR+PGD+VLKILMQKVR++DL+LSVLE+YLEELNS+YGNIF+E D +IG KD +LEKI
Sbjct: 371  QVGRVPGDSVLKILMQKVRALDLNLSVLERYLEELNSKYGNIFKEFDEDIGEKDKLLEKI 430

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            +SD+K+L D Q+++AKD+ D+ +WKSLVS+QL+ ++RDNA LRS+VE V+  Q+ MENKG
Sbjct: 431  KSDIKDLLDSQKIMAKDIGDVASWKSLVSIQLDTILRDNADLRSKVEKVREKQISMENKG 490

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVC-----SRTEKSGKFCGMRSSW 1222
            IA+F+VS +F  +  + L +DM++SV       +TEK  KFC   SSW
Sbjct: 491  IAVFVVSLIFGFLAFVRLLVDMLLSVSMSLSDEKTEKPRKFCSFSSSW 538


>ref|XP_007019942.1| Galactose-binding protein isoform 1 [Theobroma cacao]
            gi|590603203|ref|XP_007019948.1| Galactose-binding
            protein isoform 1 [Theobroma cacao]
            gi|590603215|ref|XP_007019950.1| Galactose-binding
            protein isoform 1 [Theobroma cacao]
            gi|508725270|gb|EOY17167.1| Galactose-binding protein
            isoform 1 [Theobroma cacao] gi|508725276|gb|EOY17173.1|
            Galactose-binding protein isoform 1 [Theobroma cacao]
            gi|508725278|gb|EOY17175.1| Galactose-binding protein
            isoform 1 [Theobroma cacao]
          Length = 586

 Score =  531 bits (1368), Expect = e-148
 Identities = 266/408 (65%), Positives = 329/408 (80%), Gaps = 5/408 (1%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQAG + HRVE GG+EYNYASASKGAKVL  NKEAKGASNILGKDKDKYLRNPC
Sbjct: 165  RSKSGTGQAG-VKHRVEPGGKEYNYASASKGAKVLLCNKEAKGASNILGKDKDKYLRNPC 223

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFV+IELSEETLVDTIEIANFEHYSS LKD +LLGSL +PTD W+ LG+F AGNVK
Sbjct: 224  SAEEKFVIIELSEETLVDTIEIANFEHYSSKLKDFELLGSLFFPTDVWIKLGNFTAGNVK 283

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+QRF L+EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLISVQDN F S
Sbjct: 284  HAQRFVLKEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLISVQDNLFAS 343

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            ++   +   +  +LEPT G+ ++QN   E+ ++S  ENSN++ D   N VP+ + +   Q
Sbjct: 344  DDGTRDQKQMPSKLEPTQGNSVYQNSHKEMGSESSVENSNLQHDVFNNIVPSPVEDIHHQ 403

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGR+PGD+VLKILMQKVR++DL+LSVLE+YLEELNS+YGNIF+E D +IG KD +LEKI
Sbjct: 404  QVGRVPGDSVLKILMQKVRALDLNLSVLERYLEELNSKYGNIFKEFDEDIGEKDKLLEKI 463

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            +SD+K+L D Q+++AKD+ D+ +WKSLVS+QL+ ++RDNA LRS+VE V+  Q+ MENKG
Sbjct: 464  KSDIKDLLDSQKIMAKDIGDVASWKSLVSIQLDTILRDNADLRSKVEKVREKQISMENKG 523

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVC-----SRTEKSGKFCGMRSSW 1222
            IA+F+VS +F  +  + L +DM++SV       +TEK  KFC   SSW
Sbjct: 524  IAVFVVSLIFGFLAFVRLLVDMLLSVSMSLSDEKTEKPRKFCSFSSSW 571


>emb|CBI17031.3| unnamed protein product [Vitis vinifera]
          Length = 544

 Score =  527 bits (1357), Expect = e-147
 Identities = 275/413 (66%), Positives = 318/413 (76%), Gaps = 6/413 (1%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K K V GQAG++IHRVE GG +YNYASASKGAKVLA NKEAKGASNILGKDKDKYL
Sbjct: 144  AISYKSKSVTGQAGNVIHRVEPGGADYNYASASKGAKVLASNKEAKGASNILGKDKDKYL 203

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSN KD +LLGS V+PTD WV LG+F A
Sbjct: 204  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNPKDFELLGSSVFPTDEWVKLGNFTA 263

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NVKH+QRF L EPKWVRYLKLNLLSHHG+EFYCTLS +EVYGVDAVE MLEDLISVQDN
Sbjct: 264  ANVKHAQRFALHEPKWVRYLKLNLLSHHGTEFYCTLSVVEVYGVDAVERMLEDLISVQDN 323

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILE 721
             F  EE+  E   +  Q EPT G++L+Q  V                            +
Sbjct: 324  PFVPEEITAEKKSIPSQPEPTEGNNLYQKPV----------------------------K 355

Query: 722  TRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVI 901
             R QQVGRMPGDTVLKILMQKV+S+DLSLSVLE+YLE+LNSRYGNIF+E D EI  KDV+
Sbjct: 356  IRHQQVGRMPGDTVLKILMQKVQSLDLSLSVLERYLEDLNSRYGNIFKEFDKEIEEKDVL 415

Query: 902  LEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHM 1081
            LE IRSD++N  D +E+I KDVSDL++WKSLVSLQL+NL++DNA+LR+EV+ VQ +Q HM
Sbjct: 416  LENIRSDIRNFLDSKEIITKDVSDLISWKSLVSLQLDNLLKDNALLRAEVQKVQEDQTHM 475

Query: 1082 ENKGIAIFLVSFVFS------CITLMILFIDMVVSVCSRTEKSGKFCGMRSSW 1222
            ENKGIA+FL+  +F        +  M+L + M VSV +R++KS  FCG  SSW
Sbjct: 476  ENKGIAVFLICLIFGFWAFARLLVDMMLSVYMAVSVNNRSDKSRNFCGTSSSW 528


>ref|XP_006441747.1| hypothetical protein CICLE_v10019431mg [Citrus clementina]
            gi|557544009|gb|ESR54987.1| hypothetical protein
            CICLE_v10019431mg [Citrus clementina]
          Length = 587

 Score =  515 bits (1327), Expect = e-143
 Identities = 261/408 (63%), Positives = 319/408 (78%), Gaps = 5/408 (1%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQ G +IHRVET G EYNYASA+KGAKVL++NKEAKGA+NIL +DKDKYLRNPC
Sbjct: 165  RSKSATGQPGGVIHRVETEGTEYNYASAAKGAKVLSYNKEAKGATNILSRDKDKYLRNPC 224

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEK+VVIELSEETLVD+ EIANFEH+SSNL++ +L GSLVYPTD WV LG+F A NVK
Sbjct: 225  SAEEKYVVIELSEETLVDSFEIANFEHHSSNLREFELHGSLVYPTDVWVKLGNFTAANVK 284

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
             +QRF L EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLI VQ+N F  
Sbjct: 285  LAQRFRLDEPKWVRYLKLNLLSHYGSEFYCTLSVVEVYGVDAVERMLEDLIPVQENVFVP 344

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            E+   +  P  P  E + GD+  QN+  E+++ S  E+ +VKR   K+NVP+ + E R  
Sbjct: 345  EKGRGDLKPTSPPQESSQGDEFFQNLYIELESDSSEESFDVKRAVTKSNVPDPVGEVR-H 403

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGRMP DTVLKIL+QKVRS+DL+LSVLE+YLEELNSRYGNIF E D E+G KD ILEKI
Sbjct: 404  QVGRMPADTVLKILVQKVRSLDLNLSVLERYLEELNSRYGNIFNEFDEEMGEKDRILEKI 463

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            RSD+ N+ + QE IAKDV DL +WKSLVS+QL  L++DN++LR +VE VQ NQV +ENKG
Sbjct: 464  RSDIANILNSQETIAKDVGDLNSWKSLVSMQLETLLKDNSVLRQKVEKVQENQVTLENKG 523

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVC-----SRTEKSGKFCGMRSSW 1222
            I +FL+  +F    ++ LF+D+++SV        T+K GKFC + SSW
Sbjct: 524  IIVFLICLIFGIFAILRLFVDILLSVYMALSERTTQKPGKFCSVNSSW 571


>ref|XP_006371384.1| hypothetical protein POPTR_0019s09690g [Populus trichocarpa]
            gi|550317140|gb|ERP49181.1| hypothetical protein
            POPTR_0019s09690g [Populus trichocarpa]
          Length = 587

 Score =  514 bits (1325), Expect = e-143
 Identities = 265/411 (64%), Positives = 324/411 (78%), Gaps = 4/411 (0%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K KP  GQ G +IHR+E GG+EYNYASASKGAKVLAFNKEAKGASNIL  DKDKYL
Sbjct: 162  AFSSKSKPGTGQVGGVIHRMEPGGKEYNYASASKGAKVLAFNKEAKGASNILVGDKDKYL 221

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLK  +LLGSLVYPT  WV LG+F A
Sbjct: 222  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKHFELLGSLVYPTGDWVKLGNFTA 281

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NVKH+QRFTLQ    VRYL+LNLLSH+GSEFYCTLS +E+YGVDAVE MLED+IS QDN
Sbjct: 282  ANVKHAQRFTLQVLIGVRYLRLNLLSHYGSEFYCTLSVIEIYGVDAVEQMLEDMISDQDN 341

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILE 721
             FG E    E  P    LE T  DD + ++ +++++ S  ENSN K + +KN +P+ + E
Sbjct: 342  LFGYEVGAGEQKPPSSHLESTQDDDTYTDLYSDMED-SSVENSNAKNEVVKNKLPDPVEE 400

Query: 722  TRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVI 901
             R QQVGRMPGD+VLKILMQKVRS+DLSLS+LE+YLEE+NS+YGNIF+E+D ++G KD++
Sbjct: 401  VRHQQVGRMPGDSVLKILMQKVRSLDLSLSILERYLEEVNSKYGNIFKEIDKDLGEKDIL 460

Query: 902  LEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHM 1081
            LEK+RSD+K+L   Q++IAKDV+DL++WKSL S QL+ L+RDN ILRS++E V   Q  M
Sbjct: 461  LEKMRSDVKSLHSSQDLIAKDVNDLISWKSLASTQLDGLLRDNLILRSKIERVLEIQKSM 520

Query: 1082 ENKGIAIFLVSFVFSCITLMILFIDMVVSVCS----RTEKSGKFCGMRSSW 1222
            ENKGIA+FL+  +F  +  + LF+D+++SV      +  +S KFC   SSW
Sbjct: 521  ENKGIAVFLICLIFGILAFVRLFVDLLLSVYMAFNVQGTESRKFCWTGSSW 571


>ref|XP_006492474.1| PREDICTED: uncharacterized protein slp1-like [Citrus sinensis]
          Length = 587

 Score =  511 bits (1316), Expect = e-142
 Identities = 258/408 (63%), Positives = 318/408 (77%), Gaps = 5/408 (1%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K    Q G +IHRVET G EYNYASA+KGAKVL++NKEAKGA+NIL +DKDKYLRNPC
Sbjct: 165  RSKSATDQPGGVIHRVETEGTEYNYASATKGAKVLSYNKEAKGATNILSRDKDKYLRNPC 224

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEK+VVIELSEETLVD+ EIANFEH+SSNL++ +L GSLVYPTD WV LG+F A NVK
Sbjct: 225  SAEEKYVVIELSEETLVDSFEIANFEHHSSNLREFELHGSLVYPTDVWVKLGNFTAANVK 284

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
             +QRF L EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLI VQ+N F  
Sbjct: 285  LAQRFRLDEPKWVRYLKLNLLSHYGSEFYCTLSVLEVYGVDAVERMLEDLIPVQENVFVP 344

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            E+   +  P  P  E + GD+  QN+  E+++ S  E+ +VKR   K+NVP+ + E R  
Sbjct: 345  EKGRGDLNPTSPPQESSQGDEFFQNLYIELESDSSEESFDVKRAVTKSNVPDPVGEVR-H 403

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGRMP DTVLKIL+QKVRS+DL+LSVLE+YLEELNSRYGNIF+E D E+G KD +LE+I
Sbjct: 404  QVGRMPADTVLKILVQKVRSLDLNLSVLERYLEELNSRYGNIFKEFDEEMGEKDRVLERI 463

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            RSD+ N+ + QE IAKDV DL +WKS+VS+QL  L++DN++LR +VE VQ NQV +ENKG
Sbjct: 464  RSDITNILNSQETIAKDVGDLNSWKSIVSMQLETLLKDNSVLRLKVEKVQENQVSLENKG 523

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVCS-----RTEKSGKFCGMRSSW 1222
            I +FL+  +F    L+ LF+D++ SV        T+K GKFC + SSW
Sbjct: 524  IIVFLICLIFGIFALLRLFVDILSSVYGALSERTTQKPGKFCSVNSSW 571


>ref|XP_007199767.1| hypothetical protein PRUPE_ppa003178mg [Prunus persica]
            gi|595792039|ref|XP_007199768.1| hypothetical protein
            PRUPE_ppa003178mg [Prunus persica]
            gi|462395167|gb|EMJ00966.1| hypothetical protein
            PRUPE_ppa003178mg [Prunus persica]
            gi|462395168|gb|EMJ00967.1| hypothetical protein
            PRUPE_ppa003178mg [Prunus persica]
          Length = 596

 Score =  508 bits (1309), Expect = e-141
 Identities = 263/408 (64%), Positives = 317/408 (77%), Gaps = 5/408 (1%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            K K   G+AG I HRVE GG EYNYASA+KGAKVLAFNKEAKGASNILG+DKDKYLRNPC
Sbjct: 173  KTKSGNGEAGGIKHRVEPGGAEYNYASAAKGAKVLAFNKEAKGASNILGRDKDKYLRNPC 232

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAE KFV IELSEETLVDTI+IAN EHYSSNLK  +LLGSLVYPTD WV LG+F A N K
Sbjct: 233  SAEGKFVDIELSEETLVDTIQIANHEHYSSNLKAFELLGSLVYPTDEWVLLGNFTAANNK 292

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
             +QRF LQEPKWVRY+KLNLLSHHGSEFYCTLS +E+YGVDAVE MLEDLISV+++ F S
Sbjct: 293  LAQRFDLQEPKWVRYIKLNLLSHHGSEFYCTLSVVEIYGVDAVERMLEDLISVENSPFVS 352

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            E    +  P     +    D+ + NIV E++ +    +S++  + +K+ VP+ I E R  
Sbjct: 353  EGATVDQKPTSSNPDSPEVDEFYHNIVKELEPEYAVGHSDLNNEIMKSEVPDPIKEVRHL 412

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QV RMPGDTVLKILMQKVRS+D SLSVLE+YLEE NSRYG+IF+E D ++G KD+ ++KI
Sbjct: 413  QVNRMPGDTVLKILMQKVRSLDFSLSVLERYLEESNSRYGSIFREFDKDLGEKDLDVQKI 472

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            R D++NL + QE+IAKDV +L++W+SLVS+QL NLVRDNAILRSEVE V+  Q  ++NKG
Sbjct: 473  REDIRNLLESQEIIAKDVRNLISWQSLVSMQLGNLVRDNAILRSEVEKVREKQQSVDNKG 532

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSV-----CSRTEKSGKFCGMRSSW 1222
            I IFLV  +FS + L+ LFIDM VSV       RT++S KFC +  SW
Sbjct: 533  IIIFLVCLIFSLLALVKLFIDMAVSVYMAFSVHRTDQSRKFCRLSPSW 580


>ref|XP_002523463.1| conserved hypothetical protein [Ricinus communis]
            gi|223537291|gb|EEF38922.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 484

 Score =  508 bits (1309), Expect = e-141
 Identities = 260/388 (67%), Positives = 308/388 (79%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K K    QAG +IHRVE GG+EYNYASASKGAKVL FNKEAKGASNILGKDKDKYL
Sbjct: 97   AFSSKSKLGTDQAGGVIHRVEPGGKEYNYASASKGAKVLDFNKEAKGASNILGKDKDKYL 156

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCSAEEKFV+IELSEETLV TIEIANFEHYSSNLKD +LLGSLVYPTD W+ LG+F A
Sbjct: 157  RNPCSAEEKFVIIELSEETLVATIEIANFEHYSSNLKDFELLGSLVYPTDTWIRLGNFTA 216

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NVK +QRF LQEP+WVRYLKLNLLSH+GSEFYCTLS +EV GVDAVE MLEDLISVQ+N
Sbjct: 217  ANVKLAQRFPLQEPQWVRYLKLNLLSHYGSEFYCTLSIVEVLGVDAVERMLEDLISVQNN 276

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILE 721
             F  +E   +   +  Q E T  DD  Q +  E+ + S  ENSNVK +  KN VP+ + E
Sbjct: 277  VFVPKEETGDQKQLSSQTESTQVDDCDQELCMEMGSSSSVENSNVKHEVPKNKVPDPVDE 336

Query: 722  TRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVI 901
             R QQ GRMPGD+VLKILMQKVRS+DLSLSVLE+YLEELN RYGNIF+  D ++  KD +
Sbjct: 337  IRQQQGGRMPGDSVLKILMQKVRSLDLSLSVLERYLEELNYRYGNIFKGFDKDLVEKDTL 396

Query: 902  LEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHM 1081
            LEK+RSD+KNL D +E++AKDV DL++WKSLVS Q++NL++DN  LRS VE VQ NQ+ M
Sbjct: 397  LEKVRSDIKNLYDSKELMAKDVEDLLSWKSLVSTQMDNLLKDNFALRSMVEGVQKNQISM 456

Query: 1082 ENKGIAIFLVSFVFSCITLMILFIDMVV 1165
            ENKGIA+F +  +F  +  + L +D+++
Sbjct: 457  ENKGIAVFFICLIFGTLAFVRLLVDILL 484


>ref|XP_004516033.1| PREDICTED: uncharacterized protein LOC101491550 isoform X2 [Cicer
            arietinum] gi|502177227|ref|XP_004516034.1| PREDICTED:
            uncharacterized protein LOC101491550 isoform X3 [Cicer
            arietinum]
          Length = 564

 Score =  496 bits (1276), Expect = e-137
 Identities = 258/414 (62%), Positives = 318/414 (76%), Gaps = 7/414 (1%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K K   GQ+GS+IHR+E GG EYNYASASKGAKVL  NKEAKGASNIL +DKDKYL
Sbjct: 136  AISSKVKSGTGQSGSVIHRLEPGGAEYNYASASKGAKVLGSNKEAKGASNILSRDKDKYL 195

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCS EEKFV+IELSEETLVDTIEIANFEH+SSNLKD ++ GSL +PTD WV LG+F A
Sbjct: 196  RNPCSVEEKFVIIELSEETLVDTIEIANFEHHSSNLKDFEIHGSLSFPTDVWVFLGNFTA 255

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NV+H+QRF L+EPKWVRYLKLNL SH+GSEFYCTLS +E+YGVDAVE MLEDLI+ QDN
Sbjct: 256  SNVRHAQRFVLKEPKWVRYLKLNLQSHYGSEFYCTLSVVELYGVDAVERMLEDLINTQDN 315

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLK-NNVPNTIL 718
             F S E+N +   V P  +P   + +HQN V  +++    E ++   + +K N+VP+ I 
Sbjct: 316  LFTSGEVNDDKKTVFPHPDPAESEHVHQNTVGGVNSDPSSEITSANHETVKSNSVPDPIE 375

Query: 719  ETRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDV 898
            E R QQVGRMPGDTVLKILMQKVRS+DL+L VLE+YLE+LNSRY NIF+E   +IG KD+
Sbjct: 376  EIR-QQVGRMPGDTVLKILMQKVRSLDLNLFVLERYLEDLNSRYVNIFKEYSKDIGEKDI 434

Query: 899  ILEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVH 1078
            +L+KI+ D+KNL D Q+VIAKD SDL +WKS  SLQL++L+ DNA+LRSEVE V+  QV 
Sbjct: 435  LLQKIKEDIKNLIDQQDVIAKDASDLNSWKSQASLQLDHLLWDNAVLRSEVEKVREKQVS 494

Query: 1079 MENKGIAIFLVSFVFSCITLMILFIDMVVSVC------SRTEKSGKFCGMRSSW 1222
            +ENKG+ +FL+  +FS I ++ L +++  +VC       RT  S  FC   SSW
Sbjct: 495  LENKGVIVFLLCCIFSSIAVLWLSLEIAKNVCRALISVDRTVYSRNFCVCSSSW 548


>ref|XP_004516032.1| PREDICTED: uncharacterized protein LOC101491550 isoform X1 [Cicer
            arietinum]
          Length = 602

 Score =  496 bits (1276), Expect = e-137
 Identities = 258/414 (62%), Positives = 318/414 (76%), Gaps = 7/414 (1%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A S K K   GQ+GS+IHR+E GG EYNYASASKGAKVL  NKEAKGASNIL +DKDKYL
Sbjct: 174  AISSKVKSGTGQSGSVIHRLEPGGAEYNYASASKGAKVLGSNKEAKGASNILSRDKDKYL 233

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCS EEKFV+IELSEETLVDTIEIANFEH+SSNLKD ++ GSL +PTD WV LG+F A
Sbjct: 234  RNPCSVEEKFVIIELSEETLVDTIEIANFEHHSSNLKDFEIHGSLSFPTDVWVFLGNFTA 293

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NV+H+QRF L+EPKWVRYLKLNL SH+GSEFYCTLS +E+YGVDAVE MLEDLI+ QDN
Sbjct: 294  SNVRHAQRFVLKEPKWVRYLKLNLQSHYGSEFYCTLSVVELYGVDAVERMLEDLINTQDN 353

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLK-NNVPNTIL 718
             F S E+N +   V P  +P   + +HQN V  +++    E ++   + +K N+VP+ I 
Sbjct: 354  LFTSGEVNDDKKTVFPHPDPAESEHVHQNTVGGVNSDPSSEITSANHETVKSNSVPDPIE 413

Query: 719  ETRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDV 898
            E R QQVGRMPGDTVLKILMQKVRS+DL+L VLE+YLE+LNSRY NIF+E   +IG KD+
Sbjct: 414  EIR-QQVGRMPGDTVLKILMQKVRSLDLNLFVLERYLEDLNSRYVNIFKEYSKDIGEKDI 472

Query: 899  ILEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVH 1078
            +L+KI+ D+KNL D Q+VIAKD SDL +WKS  SLQL++L+ DNA+LRSEVE V+  QV 
Sbjct: 473  LLQKIKEDIKNLIDQQDVIAKDASDLNSWKSQASLQLDHLLWDNAVLRSEVEKVREKQVS 532

Query: 1079 MENKGIAIFLVSFVFSCITLMILFIDMVVSVC------SRTEKSGKFCGMRSSW 1222
            +ENKG+ +FL+  +FS I ++ L +++  +VC       RT  S  FC   SSW
Sbjct: 533  LENKGVIVFLLCCIFSSIAVLWLSLEIAKNVCRALISVDRTVYSRNFCVCSSSW 586


>ref|XP_004290252.1| PREDICTED: uncharacterized protein SLP1-like [Fragaria vesca subsp.
            vesca]
          Length = 595

 Score =  492 bits (1266), Expect = e-136
 Identities = 256/410 (62%), Positives = 320/410 (78%), Gaps = 5/410 (1%)
 Frame = +2

Query: 8    SPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRN 187
            S K K ++G AGSI HRVE GG EYNYASA+KGAKVLAFNKEAKGASNI+ +DKDKYLRN
Sbjct: 172  SSKSKSLIGLAGSIKHRVEPGGTEYNYASAAKGAKVLAFNKEAKGASNIISRDKDKYLRN 231

Query: 188  PCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGN 367
            PCSAEEKFV IELSEETLVDTI+I N EHYSSNL+D +LLGSLVYPTD WV LG+F A N
Sbjct: 232  PCSAEEKFVDIELSEETLVDTIKIGNLEHYSSNLRDFELLGSLVYPTDEWVKLGNFTAAN 291

Query: 368  VKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQF 547
            +K +QRF L+ PKWVRY+KL +L+H+GSEFYCT+S +E+YGVDAVE MLEDLISV+   +
Sbjct: 292  IKLAQRFDLEVPKWVRYIKLKILNHYGSEFYCTVSVIEIYGVDAVERMLEDLISVESGAY 351

Query: 548  GSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETR 727
             S+ +  +  PV    +   GDD   +I  E++ Q+  E SNV  + +KN+VP+ I E  
Sbjct: 352  VSDGVTVDQKPVTSHSDSPEGDDFF-DINKEMEPQAAVE-SNVNNEVIKNDVPDPIKEVL 409

Query: 728  PQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILE 907
             QQ  RMPGDTVLKILMQKV S+D SLS+LE+YLEE N RYG+IF+E D ++  K++ L+
Sbjct: 410  HQQGSRMPGDTVLKILMQKVHSLDFSLSLLERYLEESNLRYGSIFKEFDTDMDGKELELQ 469

Query: 908  KIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMEN 1087
            KI+ +++NL + QEVIAKDV++L++W+SLVS+QL+NLVRDNAILRSEVE V+  QV ++N
Sbjct: 470  KIKENMRNLLESQEVIAKDVNNLMSWQSLVSVQLDNLVRDNAILRSEVEKVREKQVSVDN 529

Query: 1088 KGIAIFLVSFVFSCITLMILFIDMVVSVCS-----RTEKSGKFCGMRSSW 1222
            KGI IF+V  +FS + L  LF+D++VSV S      TEKS KFC M SSW
Sbjct: 530  KGIVIFVVCVLFSLLALARLFVDILVSVYSAFSVRTTEKSRKFCLMSSSW 579


>gb|EXC32470.1| putative glycosyltransferase [Morus notabilis]
          Length = 827

 Score =  490 bits (1262), Expect = e-136
 Identities = 264/414 (63%), Positives = 313/414 (75%), Gaps = 12/414 (2%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            K K   GQAG I HRVE GG+EYNYASASKGAKVLAFNKEAKGASNILGKD+DKYLRNPC
Sbjct: 193  KSKSGNGQAGGIKHRVEPGGKEYNYASASKGAKVLAFNKEAKGASNILGKDEDKYLRNPC 252

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFVVIELSEETLVD+IEIANFEHYSSNLKD +LLGSLVYPTD WV LG F A NVK
Sbjct: 253  SAEEKFVVIELSEETLVDSIEIANFEHYSSNLKDFELLGSLVYPTDEWVKLGEFRANNVK 312

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQD--NQF 547
             +QRF L EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLI V+   +  
Sbjct: 313  LAQRFVLSEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLIFVEGSVSVS 372

Query: 548  GSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETR 727
             SE    +  P+  Q E   G DL Q++  E  +Q+         + +K+NVP+ I E R
Sbjct: 373  VSEGATADQKPLLSQPETLAGYDLDQHMDKETSSQT---------EIMKSNVPDPIEEVR 423

Query: 728  PQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILE 907
             QQ GRMPGD VLKIL+QKVRS+DL+LSVLE+YLEEL S+YGNIF+E+D +IG KDV+LE
Sbjct: 424  HQQTGRMPGDAVLKILVQKVRSLDLNLSVLERYLEELTSKYGNIFKEIDKDIGDKDVLLE 483

Query: 908  KIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMEN 1087
             IR+D+++L + + +IAKDV DL +WKSLVS Q++N+VRDNAILR EVE V+  Q+ +EN
Sbjct: 484  NIRTDIRDLLESRRIIAKDVDDLTSWKSLVSFQMDNIVRDNAILRYEVEKVREKQMSIEN 543

Query: 1088 KGIAIFLVSFVFSCITLMILFIDMVVSV-----CSRTEKS-----GKFCGMRSS 1219
            K I IF+V  +FS + ++ LFID+  SV       RT         KFC + SS
Sbjct: 544  KNIIIFIVCLIFSSLAVVRLFIDVAASVYKALSAERTNNCHSNSWKKFCWISSS 597


>ref|XP_007019951.1| Galactose-binding protein isoform 10, partial [Theobroma cacao]
            gi|508725279|gb|EOY17176.1| Galactose-binding protein
            isoform 10, partial [Theobroma cacao]
          Length = 515

 Score =  482 bits (1241), Expect = e-133
 Identities = 240/350 (68%), Positives = 293/350 (83%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQAG + HRVE GG+EYNYASASKGAKVL  NKEAKGASNILGKDKDKYLRNPC
Sbjct: 165  RSKSGTGQAG-VKHRVEPGGKEYNYASASKGAKVLLCNKEAKGASNILGKDKDKYLRNPC 223

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFV+IELSEETLVDTIEIANFEHYSS LKD +LLGSL +PTD W+ LG+F AGNVK
Sbjct: 224  SAEEKFVIIELSEETLVDTIEIANFEHYSSKLKDFELLGSLFFPTDVWIKLGNFTAGNVK 283

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+QRF L+EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLISVQDN F S
Sbjct: 284  HAQRFVLKEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLISVQDNLFAS 343

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            ++   +   +  +LEPT G+ ++QN   E+ ++S  ENSN++ D   N VP+ + +   Q
Sbjct: 344  DDGTRDQKQMPSKLEPTQGNSVYQNSHKEMGSESSVENSNLQHDVFNNIVPSPVEDIHHQ 403

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGR+PGD+VLKILMQKVR++DL+LSVLE+YLEELNS+YGNIF+E D +IG KD +LEKI
Sbjct: 404  QVGRVPGDSVLKILMQKVRALDLNLSVLERYLEELNSKYGNIFKEFDEDIGEKDKLLEKI 463

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQ 1063
            +SD+K+L D Q+++AKD+ D+ +WKSLVS+QL+ ++RDNA LRS+VE V+
Sbjct: 464  KSDIKDLLDSQKIMAKDIGDVASWKSLVSIQLDTILRDNADLRSKVEKVR 513


>ref|XP_007019947.1| Galactose-binding protein isoform 6, partial [Theobroma cacao]
            gi|508725275|gb|EOY17172.1| Galactose-binding protein
            isoform 6, partial [Theobroma cacao]
          Length = 482

 Score =  482 bits (1241), Expect = e-133
 Identities = 240/350 (68%), Positives = 293/350 (83%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQAG + HRVE GG+EYNYASASKGAKVL  NKEAKGASNILGKDKDKYLRNPC
Sbjct: 132  RSKSGTGQAG-VKHRVEPGGKEYNYASASKGAKVLLCNKEAKGASNILGKDKDKYLRNPC 190

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFV+IELSEETLVDTIEIANFEHYSS LKD +LLGSL +PTD W+ LG+F AGNVK
Sbjct: 191  SAEEKFVIIELSEETLVDTIEIANFEHYSSKLKDFELLGSLFFPTDVWIKLGNFTAGNVK 250

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+QRF L+EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLISVQDN F S
Sbjct: 251  HAQRFVLKEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLISVQDNLFAS 310

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            ++   +   +  +LEPT G+ ++QN   E+ ++S  ENSN++ D   N VP+ + +   Q
Sbjct: 311  DDGTRDQKQMPSKLEPTQGNSVYQNSHKEMGSESSVENSNLQHDVFNNIVPSPVEDIHHQ 370

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGR+PGD+VLKILMQKVR++DL+LSVLE+YLEELNS+YGNIF+E D +IG KD +LEKI
Sbjct: 371  QVGRVPGDSVLKILMQKVRALDLNLSVLERYLEELNSKYGNIFKEFDEDIGEKDKLLEKI 430

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQ 1063
            +SD+K+L D Q+++AKD+ D+ +WKSLVS+QL+ ++RDNA LRS+VE V+
Sbjct: 431  KSDIKDLLDSQKIMAKDIGDVASWKSLVSIQLDTILRDNADLRSKVEKVR 480


>ref|XP_004141528.1| PREDICTED: uncharacterized protein LOC101220988 [Cucumis sativus]
            gi|449481474|ref|XP_004156194.1| PREDICTED:
            uncharacterized protein LOC101230695 [Cucumis sativus]
          Length = 584

 Score =  481 bits (1237), Expect = e-133
 Identities = 242/403 (60%), Positives = 307/403 (76%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            +GK   GQAG+ IHR+E GG EYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC
Sbjct: 169  QGKSETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 228

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFVVIELSEETLV TIEIANFEH+SSNLK+ ++ GSLVYPTD W  LG+F A N K
Sbjct: 229  SAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAK 288

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+ RF L++PKWVRYLKLN L+H+GSEFYCTLS +EVYG+DAVE+MLEDLIS Q     S
Sbjct: 289  HAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSIS 348

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            +E   +   +  Q  P + +  H+  +  + N+ G +  +++    K+N P  + E+  Q
Sbjct: 349  DEATHDKRVIPSQPGP-IDEVSHRRELQSVANEEGDDGVDIELS--KSNTPEPVEESHHQ 405

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            Q GRMPGDTVLKIL QKVRS+DLSLSVLE+YLE+L S+YGNIF+E D +IG  ++++EK 
Sbjct: 406  QPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKT 465

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            ++D++N+   Q+   KD+ DL++WKS+VSLQL+ L R N+ILRSE+E VQ NQ+ +ENKG
Sbjct: 466  QADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKG 525

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVCSRTEKSGKFCGMRSSW 1222
            I +FLV  +FS + +  LF+ +V+ V  RT  S KFC +  SW
Sbjct: 526  IVVFLVCLIFSSLAIFRLFLHIVLRVYERTNNSRKFCCISPSW 568


>ref|XP_003534427.1| PREDICTED: uncharacterized protein LOC100783254 [Glycine max]
          Length = 541

 Score =  481 bits (1237), Expect = e-133
 Identities = 242/403 (60%), Positives = 304/403 (75%), Gaps = 6/403 (1%)
 Frame = +2

Query: 32   GQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKF 211
            G +GS++HRVE GG EYNYASAS GAK+L  NKEAKGASNIL +DKDKYLRNPCSAE+KF
Sbjct: 124  GSSGSVMHRVEPGGAEYNYASASMGAKLLGSNKEAKGASNILSRDKDKYLRNPCSAEDKF 183

Query: 212  VVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVKHSQRFT 391
            V+IELSEETLVDTIEIANFEH+SSNLK  +LLGSL +PTD WV LG+F A NV+H+QRF 
Sbjct: 184  VIIELSEETLVDTIEIANFEHHSSNLKAFELLGSLSFPTDVWVFLGNFTASNVRHAQRFV 243

Query: 392  LQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGSEELNTE 571
            LQ+PKWVRYLKLNL SH+GSEFYCTLS +EVYGVDAVE MLEDLI  QDN     + N +
Sbjct: 244  LQQPKWVRYLKLNLQSHYGSEFYCTLSVVEVYGVDAVERMLEDLIHTQDNLLAPGDGNAD 303

Query: 572  ATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQQVGRMP 751
               V P   P   +D HQN    I++    + S+   + L +NVP+ + E R QQVGRMP
Sbjct: 304  KMTVSPHPNPPESEDAHQNTFGGINSYPASDISSANHEKLNSNVPDPVEEIR-QQVGRMP 362

Query: 752  GDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKIRSDLKN 931
            GDTVLKILMQKVR++DL+L VLE+Y+E+LN+RY NIF+E   +IG KD++++ I+ D++N
Sbjct: 363  GDTVLKILMQKVRTLDLNLFVLERYMEDLNTRYVNIFKEYSKDIGGKDILIQNIKEDIRN 422

Query: 932  LADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKGIAIFLV 1111
            L D Q+ I KD SDL +WKS +S+Q  +L+RDNA+LRSEV  V+  Q  +ENKG+ +FLV
Sbjct: 423  LVDQQDAITKDGSDLKSWKSHISMQFGHLLRDNAVLRSEVNEVRRKQASLENKGVLVFLV 482

Query: 1112 SFVFSCITLMILFIDMVVSV------CSRTEKSGKFCGMRSSW 1222
              +FS + ++ L +DM  SV       +RT+ S KFC + SSW
Sbjct: 483  CCIFSMLVILRLSLDMATSVYRVLQSVNRTDCSRKFCAVSSSW 525


>ref|XP_006356930.1| PREDICTED: uncharacterized protein LOC102595355 isoform X1 [Solanum
            tuberosum] gi|565381125|ref|XP_006356931.1| PREDICTED:
            uncharacterized protein LOC102595355 isoform X2 [Solanum
            tuberosum] gi|565381127|ref|XP_006356932.1| PREDICTED:
            uncharacterized protein LOC102595355 isoform X3 [Solanum
            tuberosum]
          Length = 574

 Score =  478 bits (1229), Expect = e-132
 Identities = 252/407 (61%), Positives = 305/407 (74%)
 Frame = +2

Query: 2    ASSPKGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYL 181
            A + K    +G A  IIHR+E GG EYNYASASKGAKVLA+NKEAKGASNILG+DKDKYL
Sbjct: 163  AFNAKNHNKIGHAEGIIHRLEPGGSEYNYASASKGAKVLAYNKEAKGASNILGRDKDKYL 222

Query: 182  RNPCSAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIA 361
            RNPCSAEEKFVVIELSEETLVDT+E+ANFEH+SSNLKD +LLGS +YPTD W+ LG+F A
Sbjct: 223  RNPCSAEEKFVVIELSEETLVDTVEVANFEHHSSNLKDFELLGSPIYPTDTWIKLGNFTA 282

Query: 362  GNVKHSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDN 541
             NV+H+QRF L EPKWVRYLKLNLL H+GSEFYCTLS +EVYGVDAVEIML+DLIS QD 
Sbjct: 283  VNVRHAQRFLLPEPKWVRYLKLNLLGHYGSEFYCTLSILEVYGVDAVEIMLDDLISDQDK 342

Query: 542  QFGSEELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILE 721
             F  E+ + E   V  Q     G+   QN   E++           +  +  +VP+ + E
Sbjct: 343  LFVPEQTSNEDKSVPTQHVSNHGETF-QNANDEMEKD--------LQGVMTTDVPDPVEE 393

Query: 722  TRPQQVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVI 901
             R QQV RMPGD+ LKILM+KVRS+D++LSVLE+YLEELNSRYG IF++ D+E+G KDV+
Sbjct: 394  IRRQQVNRMPGDS-LKILMKKVRSLDINLSVLERYLEELNSRYGKIFKDFDSEMGEKDVL 452

Query: 902  LEKIRSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHM 1081
            L+ IRSD++ L+  ++ + K+V DLV+WKSLVS QL  ++R NAILR EVE VQ NQVHM
Sbjct: 453  LQNIRSDIRGLSHSKDALGKEVVDLVSWKSLVSTQLEEIIRGNAILRKEVEKVQRNQVHM 512

Query: 1082 ENKGIAIFLVSFVFSCITLMILFIDMVVSVCSRTEKSGKFCGMRSSW 1222
            ENKGI IFLV   F  + L  L +D V+S   R+E S KFC    SW
Sbjct: 513  ENKGIVIFLVCSFFGLLALFKLLVDTVLS-NYRSENSRKFCSESYSW 558


>gb|AAU04771.1| membrane protein-like [Cucumis melo]
          Length = 584

 Score =  476 bits (1224), Expect = e-131
 Identities = 242/403 (60%), Positives = 304/403 (75%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            +GK   GQAG+ IHR+E GG EYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC
Sbjct: 169  RGKSETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 228

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFVVIELSEETLV TIEIANFEH+SSNLK+ ++ GSLVYPTD W  LG+F A N K
Sbjct: 229  SAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAK 288

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+ RF L++PKWVRYLKLN L+H+GSEFYCTLS +EVYG+DAVE+MLEDLIS Q     S
Sbjct: 289  HAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSIS 348

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            +E   +   +  Q  P + +  H   +  + N+ G +  +++    K+N P+ + E+  Q
Sbjct: 349  DEATPDKRVIPSQPGP-IDEVSHGRELQSLANEEGGDGVDLELS--KSNTPDPVEESHHQ 405

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            Q GRMPGDTVLKIL QKVRS+DLSLSVLE+YLE+L S+YGNIF+E D +IG  ++++EK 
Sbjct: 406  QPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKT 465

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILRSEVEMVQLNQVHMENKG 1093
            + D++N+   Q+   KD+ DL++WKS+VSLQL+ L R N+ILRSE+E VQ NQ  +ENKG
Sbjct: 466  QEDIRNILKIQDNTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQTSLENKG 525

Query: 1094 IAIFLVSFVFSCITLMILFIDMVVSVCSRTEKSGKFCGMRSSW 1222
            I +FLV  +FS   +  LF+ +V+ V  RT  S KFC +  SW
Sbjct: 526  IVVFLVCLIFSSFAIFRLFLHIVLRVYERTNNSRKFCCISPSW 568


>ref|XP_007019949.1| Galactose-binding protein isoform 8 [Theobroma cacao]
            gi|508725277|gb|EOY17174.1| Galactose-binding protein
            isoform 8 [Theobroma cacao]
          Length = 513

 Score =  475 bits (1223), Expect = e-131
 Identities = 236/343 (68%), Positives = 287/343 (83%)
 Frame = +2

Query: 14   KGKPVMGQAGSIIHRVETGGREYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPC 193
            + K   GQAG + HRVE GG+EYNYASASKGAKVL  NKEAKGASNILGKDKDKYLRNPC
Sbjct: 165  RSKSGTGQAG-VKHRVEPGGKEYNYASASKGAKVLLCNKEAKGASNILGKDKDKYLRNPC 223

Query: 194  SAEEKFVVIELSEETLVDTIEIANFEHYSSNLKDLDLLGSLVYPTDRWVNLGSFIAGNVK 373
            SAEEKFV+IELSEETLVDTIEIANFEHYSS LKD +LLGSL +PTD W+ LG+F AGNVK
Sbjct: 224  SAEEKFVIIELSEETLVDTIEIANFEHYSSKLKDFELLGSLFFPTDVWIKLGNFTAGNVK 283

Query: 374  HSQRFTLQEPKWVRYLKLNLLSHHGSEFYCTLSAMEVYGVDAVEIMLEDLISVQDNQFGS 553
            H+QRF L+EPKWVRYLKLNLLSH+GSEFYCTLS +EVYGVDAVE MLEDLISVQDN F S
Sbjct: 284  HAQRFVLKEPKWVRYLKLNLLSHYGSEFYCTLSVIEVYGVDAVERMLEDLISVQDNLFAS 343

Query: 554  EELNTEATPVDPQLEPTVGDDLHQNIVTEIDNQSGPENSNVKRDGLKNNVPNTILETRPQ 733
            ++   +   +  +LEPT G+ ++QN   E+ ++S  ENSN++ D   N VP+ + +   Q
Sbjct: 344  DDGTRDQKQMPSKLEPTQGNSVYQNSHKEMGSESSVENSNLQHDVFNNIVPSPVEDIHHQ 403

Query: 734  QVGRMPGDTVLKILMQKVRSIDLSLSVLEQYLEELNSRYGNIFQELDNEIGIKDVILEKI 913
            QVGR+PGD+VLKILMQKVR++DL+LSVLE+YLEELNS+YGNIF+E D +IG KD +LEKI
Sbjct: 404  QVGRVPGDSVLKILMQKVRALDLNLSVLERYLEELNSKYGNIFKEFDEDIGEKDKLLEKI 463

Query: 914  RSDLKNLADGQEVIAKDVSDLVAWKSLVSLQLNNLVRDNAILR 1042
            +SD+K+L D Q+++AKD+ D+ +WKSLVS+QL+ ++RDNA LR
Sbjct: 464  KSDIKDLLDSQKIMAKDIGDVASWKSLVSIQLDTILRDNADLR 506


Top