BLASTX nr result
ID: Akebia24_contig00023728
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00023728 (1140 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003634489.1| PREDICTED: uncharacterized protein LOC100254... 335 2e-89 emb|CBI19286.3| unnamed protein product [Vitis vinifera] 335 2e-89 ref|XP_002285638.1| PREDICTED: uncharacterized protein LOC100254... 335 2e-89 ref|XP_004501666.1| PREDICTED: uncharacterized protein LOC101490... 330 7e-88 ref|XP_006472434.1| PREDICTED: uncharacterized protein LOC102612... 325 3e-86 ref|XP_006581514.1| PREDICTED: uncharacterized protein LOC100785... 324 5e-86 ref|XP_003526559.1| PREDICTED: uncharacterized protein LOC100785... 324 5e-86 ref|XP_006433793.1| hypothetical protein CICLE_v10000004mg [Citr... 322 1e-85 ref|XP_004152743.1| PREDICTED: uncharacterized protein LOC101207... 318 3e-84 ref|XP_002527368.1| SAB, putative [Ricinus communis] gi|22353328... 317 8e-84 ref|XP_007136305.1| hypothetical protein PHAVU_009G035200g [Phas... 315 2e-83 ref|XP_002301119.2| hypothetical protein POPTR_0002s11130g [Popu... 315 3e-83 ref|XP_006386459.1| SABRE family protein [Populus trichocarpa] g... 315 3e-83 ref|XP_004238014.1| PREDICTED: uncharacterized protein LOC101260... 313 9e-83 gb|EYU36461.1| hypothetical protein MIMGU_mgv1a000017mg [Mimulus... 312 1e-82 ref|XP_007018271.1| Golgi-body localization protein domain isofo... 305 2e-80 ref|XP_007018270.1| Golgi-body localization protein domain isofo... 305 2e-80 ref|XP_007018269.1| Golgi-body localization protein domain isofo... 305 2e-80 ref|XP_007018268.1| Golgi-body localization protein domain isofo... 305 2e-80 ref|XP_003602872.1| hypothetical protein MTR_3g099870 [Medicago ... 303 1e-79 >ref|XP_003634489.1| PREDICTED: uncharacterized protein LOC100254031 isoform 2 [Vitis vinifera] Length = 2618 Score = 335 bits (860), Expect = 2e-89 Identities = 174/258 (67%), Positives = 204/258 (79%), Gaps = 3/258 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SP KFLF FLL+SII W++FIFAA+LLAW LS+IMGASV FRV GW CLRDVVVKF Sbjct: 1 MAASPAKFLFGFLLVSIILWLIFIFAARLLAWILSQIMGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ES+SVGEI+LSLRQSLVKL GFIS+DPKLQ+LICDLEVV+RP Sbjct: 61 KGAIESVSVGEIRLSLRQSLVKL-FGFISKDPKLQVLICDLEVVMRPSGKSTKKIRSQKP 119 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VANMAR+LSVS+++LV+K KATIEVKDLRVDISKDGGSKPTLFV+L ++ Sbjct: 120 RSSGRGKWMVVANMARFLSVSISDLVLKTPKATIEVKDLRVDISKDGGSKPTLFVKLQVL 179 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRA---SLTMMERASDPFICEELSLLCEFGHDREAGV 56 P +VH+G+ RL+ DQSS+ NQG A S MMER+S PF CEELSL CEFGHD E GV Sbjct: 180 PLVVHVGDPRLTCDQSSNFNQGSVSAGQPSFCMMERSSAPFYCEELSLSCEFGHDSEVGV 239 Query: 55 VIRNVDVTSGVVTCNLNE 2 +I+NVD+ G V NLNE Sbjct: 240 IIKNVDIAIGEVAVNLNE 257 >emb|CBI19286.3| unnamed protein product [Vitis vinifera] Length = 2465 Score = 335 bits (860), Expect = 2e-89 Identities = 174/258 (67%), Positives = 204/258 (79%), Gaps = 3/258 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SP KFLF FLL+SII W++FIFAA+LLAW LS+IMGASV FRV GW CLRDVVVKF Sbjct: 1 MAASPAKFLFGFLLVSIILWLIFIFAARLLAWILSQIMGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ES+SVGEI+LSLRQSLVKL GFIS+DPKLQ+LICDLEVV+RP Sbjct: 61 KGAIESVSVGEIRLSLRQSLVKL-FGFISKDPKLQVLICDLEVVMRPSGKSTKKIRSQKP 119 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VANMAR+LSVS+++LV+K KATIEVKDLRVDISKDGGSKPTLFV+L ++ Sbjct: 120 RSSGRGKWMVVANMARFLSVSISDLVLKTPKATIEVKDLRVDISKDGGSKPTLFVKLQVL 179 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRA---SLTMMERASDPFICEELSLLCEFGHDREAGV 56 P +VH+G+ RL+ DQSS+ NQG A S MMER+S PF CEELSL CEFGHD E GV Sbjct: 180 PLVVHVGDPRLTCDQSSNFNQGSVSAGQPSFCMMERSSAPFYCEELSLSCEFGHDSEVGV 239 Query: 55 VIRNVDVTSGVVTCNLNE 2 +I+NVD+ G V NLNE Sbjct: 240 IIKNVDIAIGEVAVNLNE 257 >ref|XP_002285638.1| PREDICTED: uncharacterized protein LOC100254031 isoform 1 [Vitis vinifera] Length = 2641 Score = 335 bits (860), Expect = 2e-89 Identities = 174/258 (67%), Positives = 204/258 (79%), Gaps = 3/258 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SP KFLF FLL+SII W++FIFAA+LLAW LS+IMGASV FRV GW CLRDVVVKF Sbjct: 1 MAASPAKFLFGFLLVSIILWLIFIFAARLLAWILSQIMGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ES+SVGEI+LSLRQSLVKL GFIS+DPKLQ+LICDLEVV+RP Sbjct: 61 KGAIESVSVGEIRLSLRQSLVKL-FGFISKDPKLQVLICDLEVVMRPSGKSTKKIRSQKP 119 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VANMAR+LSVS+++LV+K KATIEVKDLRVDISKDGGSKPTLFV+L ++ Sbjct: 120 RSSGRGKWMVVANMARFLSVSISDLVLKTPKATIEVKDLRVDISKDGGSKPTLFVKLQVL 179 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRA---SLTMMERASDPFICEELSLLCEFGHDREAGV 56 P +VH+G+ RL+ DQSS+ NQG A S MMER+S PF CEELSL CEFGHD E GV Sbjct: 180 PLVVHVGDPRLTCDQSSNFNQGSVSAGQPSFCMMERSSAPFYCEELSLSCEFGHDSEVGV 239 Query: 55 VIRNVDVTSGVVTCNLNE 2 +I+NVD+ G V NLNE Sbjct: 240 IIKNVDIAIGEVAVNLNE 257 >ref|XP_004501666.1| PREDICTED: uncharacterized protein LOC101490938 [Cicer arietinum] Length = 2630 Score = 330 bits (846), Expect = 7e-88 Identities = 172/259 (66%), Positives = 203/259 (78%), Gaps = 4/259 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPV FLF FLLLSI W++FIFA+ LLAW LS I+GASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVNFLFGFLLLSITLWLLFIFASGLLAWILSWILGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGAVES+SVGEIKLSLRQSLVKLGVGFISRDPKLQ+LICDLEVV+RP Sbjct: 61 KGAVESVSVGEIKLSLRQSLVKLGVGFISRDPKLQVLICDLEVVMRPSNKIPRKKKTRKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWMIV N+ARYLSV VT+LV+KM K T+E+K+L VDISKDGGSK +L VRL ++ Sbjct: 121 RASGRGKWMIVGNIARYLSVCVTDLVLKMPKCTVEIKELNVDISKDGGSKSSLLVRLQVL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDREAG 59 P LVHIGE R+SYDQ S+L+ G +AS+ +ER+S PFICE+ S+ EFGHDRE G Sbjct: 181 PILVHIGEPRVSYDQLSNLSGGGCSSSYQASIASIERSSAPFICEKFSVSSEFGHDREVG 240 Query: 58 VVIRNVDVTSGVVTCNLNE 2 ++I+NVD++SG VT NLNE Sbjct: 241 IIIKNVDISSGEVTLNLNE 259 >ref|XP_006472434.1| PREDICTED: uncharacterized protein LOC102612548 [Citrus sinensis] Length = 2648 Score = 325 bits (832), Expect = 3e-86 Identities = 168/262 (64%), Positives = 208/262 (79%), Gaps = 7/262 (2%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W++FIFA++L+AW LSRIMGASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVKFLFGFLIVSITLWLLFIFASRLVAWILSRIMGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIR---PXXXXXXXXXX 416 KG++ES+SVGEIKLSLRQSLVKLGVGFIS+DPKLQ+LICDLE+V+R Sbjct: 61 KGSIESVSVGEIKLSLRQSLVKLGVGFISKDPKLQVLICDLEIVMRTASKSSSKPKVRKP 120 Query: 415 XXXXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRL 236 KWM+VA++AR+LSVSVT++V+K KAT+EVK+L VDISKDGGSKP L V+L Sbjct: 121 RSSSSSGRGKWMVVASIARFLSVSVTDMVVKNPKATVEVKELIVDISKDGGSKPNLVVKL 180 Query: 235 HLIPFLVHIGESRLSYDQSSSLNQGD----DRASLTMMERASDPFICEELSLLCEFGHDR 68 H++P VHIGE R+S DQS++LN G+ +AS MME+ S PF CEELSL CEFGH+R Sbjct: 181 HILPIYVHIGEPRISCDQSANLNTGETFSAGQASFPMMEKYSAPFSCEELSLSCEFGHNR 240 Query: 67 EAGVVIRNVDVTSGVVTCNLNE 2 EAGVVI+N+D++ G V+ +LNE Sbjct: 241 EAGVVIQNLDISCGEVSVSLNE 262 >ref|XP_006581514.1| PREDICTED: uncharacterized protein LOC100785854 isoform X2 [Glycine max] Length = 2638 Score = 324 bits (830), Expect = 5e-86 Identities = 169/259 (65%), Positives = 199/259 (76%), Gaps = 4/259 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPV FLF FLLLSI W+VFIFA+ LLAW LSRI+GASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVNFLFGFLLLSITLWLVFIFASGLLAWILSRILGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ES+SVGEIKLSLRQSLVKLGVGFISRDPKLQ+LICDLEVV+RP Sbjct: 61 KGAIESVSVGEIKLSLRQSLVKLGVGFISRDPKLQVLICDLEVVMRPSNKSPGKKKTRKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWMIV N+ARYLSV VT+LV+K K T+E+K+L VDISKDGGSK L V L ++ Sbjct: 121 RASGRGKWMIVGNIARYLSVCVTDLVLKTPKFTVEIKELNVDISKDGGSKSNLLVGLQIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDREAG 59 P VHIGE R+S D S+L+ G +AS+T +ER+S PFICE S+ CEFGHDRE G Sbjct: 181 PIFVHIGEPRVSCDFLSNLSGGGCSSSGQASITALERSSAPFICEMFSVSCEFGHDREVG 240 Query: 58 VVIRNVDVTSGVVTCNLNE 2 +VI+N+D++SG +T NLNE Sbjct: 241 IVIKNMDISSGEMTVNLNE 259 >ref|XP_003526559.1| PREDICTED: uncharacterized protein LOC100785854 isoformX1 [Glycine max] Length = 2632 Score = 324 bits (830), Expect = 5e-86 Identities = 169/259 (65%), Positives = 199/259 (76%), Gaps = 4/259 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPV FLF FLLLSI W+VFIFA+ LLAW LSRI+GASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVNFLFGFLLLSITLWLVFIFASGLLAWILSRILGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ES+SVGEIKLSLRQSLVKLGVGFISRDPKLQ+LICDLEVV+RP Sbjct: 61 KGAIESVSVGEIKLSLRQSLVKLGVGFISRDPKLQVLICDLEVVMRPSNKSPGKKKTRKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWMIV N+ARYLSV VT+LV+K K T+E+K+L VDISKDGGSK L V L ++ Sbjct: 121 RASGRGKWMIVGNIARYLSVCVTDLVLKTPKFTVEIKELNVDISKDGGSKSNLLVGLQIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDREAG 59 P VHIGE R+S D S+L+ G +AS+T +ER+S PFICE S+ CEFGHDRE G Sbjct: 181 PIFVHIGEPRVSCDFLSNLSGGGCSSSGQASITALERSSAPFICEMFSVSCEFGHDREVG 240 Query: 58 VVIRNVDVTSGVVTCNLNE 2 +VI+N+D++SG +T NLNE Sbjct: 241 IVIKNMDISSGEMTVNLNE 259 >ref|XP_006433793.1| hypothetical protein CICLE_v10000004mg [Citrus clementina] gi|557535915|gb|ESR47033.1| hypothetical protein CICLE_v10000004mg [Citrus clementina] Length = 2648 Score = 322 bits (826), Expect = 1e-85 Identities = 167/262 (63%), Positives = 206/262 (78%), Gaps = 7/262 (2%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W++FIFA++L+AW LSRIMGASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVKFLFGFLIVSITLWLLFIFASRLVAWILSRIMGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIR---PXXXXXXXXXX 416 KG++ES+SVGEIKLSLRQSLVKLGVGFIS+DPKLQ+LICDLE+V+R Sbjct: 61 KGSIESVSVGEIKLSLRQSLVKLGVGFISKDPKLQVLICDLEIVMRTASKSSSKPKVRKP 120 Query: 415 XXXXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRL 236 KWM+VA++AR+LSVSVT++V+K KAT+EVK+L VDISKDGGSKP L V+L Sbjct: 121 RSSSSSGRGKWMVVASIARFLSVSVTDMVVKNPKATVEVKELIVDISKDGGSKPNLVVKL 180 Query: 235 HLIPFLVHIGESRLSYDQSSSLNQGD----DRASLTMMERASDPFICEELSLLCEFGHDR 68 H++P VHIGE R+S DQS +LN G+ +AS MME+ S PF CEE SL CEFGH+R Sbjct: 181 HILPIYVHIGEPRISCDQSPNLNTGETFSAGQASFPMMEKYSAPFSCEEFSLSCEFGHNR 240 Query: 67 EAGVVIRNVDVTSGVVTCNLNE 2 EAGVVI+N+D++ G V+ +LNE Sbjct: 241 EAGVVIQNLDISCGEVSVSLNE 262 >ref|XP_004152743.1| PREDICTED: uncharacterized protein LOC101207547 [Cucumis sativus] gi|449516195|ref|XP_004165133.1| PREDICTED: uncharacterized LOC101207547 [Cucumis sativus] Length = 2606 Score = 318 bits (815), Expect = 3e-84 Identities = 162/260 (62%), Positives = 200/260 (76%), Gaps = 5/260 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPV FLF FLL+SI W+ F+FA++L+AW LSR++GASV FRV GW CLRDVV+KF+ Sbjct: 1 MAASPVNFLFGFLLISITLWLFFMFASRLVAWVLSRVVGASVAFRVGGWKCLRDVVIKFR 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRP-XXXXXXXXXXXX 410 KGA+ESISVGEIKLSLRQSLVKLGVGFISRDPKLQ+LICDLEV +RP Sbjct: 61 KGAIESISVGEIKLSLRQSLVKLGVGFISRDPKLQILICDLEVCMRPSSKGRPKSSKPRR 120 Query: 409 XXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHL 230 KWM+VAN+ARYLSVS+T+LV+K KAT+EVKD +DISK+GG++P LFV+L + Sbjct: 121 TRSSGRGKWMVVANIARYLSVSITDLVVKTPKATVEVKDFSIDISKNGGTRPNLFVKLQI 180 Query: 229 IPFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDREA 62 +P VHIGE R+S +QSS+L+ G +S ME++S PF CEE SL EFGHDREA Sbjct: 181 LPIFVHIGEPRVSCEQSSNLSSGGCISTVNSSFATMEKSSAPFSCEEFSLYGEFGHDREA 240 Query: 61 GVVIRNVDVTSGVVTCNLNE 2 G++++NVDVT G V NLNE Sbjct: 241 GIIVKNVDVTFGEVNLNLNE 260 >ref|XP_002527368.1| SAB, putative [Ricinus communis] gi|223533287|gb|EEF35040.1| SAB, putative [Ricinus communis] Length = 2626 Score = 317 bits (811), Expect = 8e-84 Identities = 165/263 (62%), Positives = 200/263 (76%), Gaps = 8/263 (3%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W+VFIFA++LLAW LSRI+GASV FRV GW CLRDV+VKFK Sbjct: 1 MAASPVKFLFGFLMISITLWMVFIFASRLLAWILSRIVGASVGFRVGGWKCLRDVIVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRP----XXXXXXXXX 419 KG +ESISVGEI+LSLRQSLVKLGVGFISRDPKLQ+LICDLE+V+R Sbjct: 61 KGPLESISVGEIRLSLRQSLVKLGVGFISRDPKLQVLICDLEIVMRTSSKGTQKKKTRRV 120 Query: 418 XXXXXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVR 239 KWM++AN+AR+LSVSVT+L +K KA IEVK+L++DI+KDGGSKP LFV+ Sbjct: 121 RSRSSGSGRGKWMVLANIARFLSVSVTDLAVKTPKAMIEVKELKLDITKDGGSKPNLFVK 180 Query: 238 LHLIPFLVHIGESRLSYDQSSSLNQGD----DRASLTMMERASDPFICEELSLLCEFGHD 71 LH++P ++H GE R+S DQSS+++ G S +E S F CE+ SL CEFGHD Sbjct: 181 LHILPIVIHTGEPRVSCDQSSNIDSGGCITAGETSYGSVEGPSASFSCEDFSLSCEFGHD 240 Query: 70 REAGVVIRNVDVTSGVVTCNLNE 2 RE GV+IRNVDVTSG VT NLNE Sbjct: 241 REVGVIIRNVDVTSGEVTVNLNE 263 >ref|XP_007136305.1| hypothetical protein PHAVU_009G035200g [Phaseolus vulgaris] gi|561009392|gb|ESW08299.1| hypothetical protein PHAVU_009G035200g [Phaseolus vulgaris] Length = 2631 Score = 315 bits (808), Expect = 2e-83 Identities = 162/257 (63%), Positives = 196/257 (76%), Gaps = 2/257 (0%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPV FLF FLLLSI W++FIFA+ L+AW LSRI+GASV FRV GW CLRDVVVKFK Sbjct: 1 MAASPVNFLFGFLLLSITLWLLFIFASGLVAWILSRILGASVGFRVGGWKCLRDVVVKFK 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGAVES+SVGEIKLSLRQSLVKLGVGF+SRDPKLQ+LICDLEVV+RP Sbjct: 61 KGAVESVSVGEIKLSLRQSLVKLGVGFMSRDPKLQVLICDLEVVLRPPDKTPGKKKTRKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWMIV N+ARYLSV VT+LV+K K+T+E+K+L +DISKDGGSK L VRLH++ Sbjct: 121 RASGRGKWMIVGNIARYLSVCVTDLVLKTPKSTVEIKELNLDISKDGGSKSNLLVRLHIL 180 Query: 226 PFLVHIGESRLSYDQSSSLN--QGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVV 53 P VHIGE R+S D + S+ +AS+T +ER+S PF CE + CEF HDRE G+V Sbjct: 181 PIFVHIGEPRVSCDLNLSVGGCSSSGQASITAIERSSAPFFCEMFFVSCEFDHDREVGIV 240 Query: 52 IRNVDVTSGVVTCNLNE 2 I+++D++SG V NLNE Sbjct: 241 IKSMDISSGEVNVNLNE 257 >ref|XP_002301119.2| hypothetical protein POPTR_0002s11130g [Populus trichocarpa] gi|550344765|gb|EEE80392.2| hypothetical protein POPTR_0002s11130g [Populus trichocarpa] Length = 2621 Score = 315 bits (806), Expect = 3e-83 Identities = 161/261 (61%), Positives = 199/261 (76%), Gaps = 6/261 (2%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL LS+ W++FIFA++L+AW LSRI+GASV FRV GW CLRDVVVKF+ Sbjct: 1 MAASPVKFLFGFLSLSVTLWLLFIFASRLMAWILSRILGASVGFRVGGWKCLRDVVVKFR 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRP--XXXXXXXXXXX 413 KG VESISVGE++LS+RQSLVKLGVGFISRDPKLQ+LICDLE+V+RP Sbjct: 61 KGPVESISVGEVRLSIRQSLVKLGVGFISRDPKLQVLICDLEIVMRPSSRGTQKTKTQRP 120 Query: 412 XXXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLH 233 KWM++AN+AR+LSVSVT+L +K KATI+VK+LR+DISKDGGSKP L+V+L+ Sbjct: 121 RPRTSGRGKWMVLANVARFLSVSVTDLAVKTPKATIDVKELRLDISKDGGSKPNLYVKLN 180 Query: 232 LIPFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDRE 65 + P L+H+GESR+ DQ + N G + M+R+S F CEELSL CEF HDRE Sbjct: 181 ISPVLIHMGESRIISDQMPNFNNGGCISSGEVAFGNMDRSSAAFFCEELSLSCEFNHDRE 240 Query: 64 AGVVIRNVDVTSGVVTCNLNE 2 GV+I+NVD+ SG VT NLNE Sbjct: 241 VGVIIQNVDINSGEVTVNLNE 261 >ref|XP_006386459.1| SABRE family protein [Populus trichocarpa] gi|550344764|gb|ERP64256.1| SABRE family protein [Populus trichocarpa] Length = 2255 Score = 315 bits (806), Expect = 3e-83 Identities = 161/261 (61%), Positives = 199/261 (76%), Gaps = 6/261 (2%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL LS+ W++FIFA++L+AW LSRI+GASV FRV GW CLRDVVVKF+ Sbjct: 1 MAASPVKFLFGFLSLSVTLWLLFIFASRLMAWILSRILGASVGFRVGGWKCLRDVVVKFR 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRP--XXXXXXXXXXX 413 KG VESISVGE++LS+RQSLVKLGVGFISRDPKLQ+LICDLE+V+RP Sbjct: 61 KGPVESISVGEVRLSIRQSLVKLGVGFISRDPKLQVLICDLEIVMRPSSRGTQKTKTQRP 120 Query: 412 XXXXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLH 233 KWM++AN+AR+LSVSVT+L +K KATI+VK+LR+DISKDGGSKP L+V+L+ Sbjct: 121 RPRTSGRGKWMVLANVARFLSVSVTDLAVKTPKATIDVKELRLDISKDGGSKPNLYVKLN 180 Query: 232 LIPFLVHIGESRLSYDQSSSLNQG----DDRASLTMMERASDPFICEELSLLCEFGHDRE 65 + P L+H+GESR+ DQ + N G + M+R+S F CEELSL CEF HDRE Sbjct: 181 ISPVLIHMGESRIISDQMPNFNNGGCISSGEVAFGNMDRSSAAFFCEELSLSCEFNHDRE 240 Query: 64 AGVVIRNVDVTSGVVTCNLNE 2 GV+I+NVD+ SG VT NLNE Sbjct: 241 VGVIIQNVDINSGEVTVNLNE 261 >ref|XP_004238014.1| PREDICTED: uncharacterized protein LOC101260131 [Solanum lycopersicum] Length = 2636 Score = 313 bits (802), Expect = 9e-83 Identities = 164/258 (63%), Positives = 193/258 (74%), Gaps = 3/258 (1%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 M SP KFLF FL SII W +F+FA+++LAW LSR MGASV FRV GW CLRD+ VKF Sbjct: 1 MDVSPAKFLFGFLFASIILWSIFVFASRMLAWILSRAMGASVSFRVGGWKCLRDIGVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGAVES+S+GEI+LS+RQSLVKLGVGFISRDPKLQ+LICDLEVV+R Sbjct: 61 KGAVESVSIGEIRLSIRQSLVKLGVGFISRDPKLQVLICDLEVVMRASNKISKKAKSRKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VANMAR+LSVSVTE+V+K KAT+EVK+L +D+SKDGGSKP LFV+L L Sbjct: 121 RKSGRGKWMVVANMARFLSVSVTEVVVKTPKATVEVKELTLDLSKDGGSKPELFVKLLLA 180 Query: 226 PFLVHIGESRLSYDQSSSLN---QGDDRASLTMMERASDPFICEELSLLCEFGHDREAGV 56 P VH GESR+SYDQ S +DR L M ER S PF CEE SL+C FGHDREAGV Sbjct: 181 PIFVHFGESRVSYDQLSMHGGSFPSNDRL-LAMTERISAPFSCEEFSLMCGFGHDREAGV 239 Query: 55 VIRNVDVTSGVVTCNLNE 2 V+RNV++ +G V+ NLNE Sbjct: 240 VVRNVEIGTGDVSINLNE 257 >gb|EYU36461.1| hypothetical protein MIMGU_mgv1a000017mg [Mimulus guttatus] Length = 2637 Score = 312 bits (800), Expect = 1e-82 Identities = 158/255 (61%), Positives = 195/255 (76%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 M +SP KFLF FL SI+ WI+F+FA++LLAW LSR MGASV FRV GW CLRD+V+KF Sbjct: 1 MGASPAKFLFGFLFCSIVLWIIFMFASRLLAWILSRFMGASVGFRVGGWKCLRDIVLKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ESIS+GEI+LSLRQSLVKLGVGFISRDPKLQ+LICDLEVVIR Sbjct: 61 KGAIESISIGEIRLSLRQSLVKLGVGFISRDPKLQVLICDLEVVIRSSTKSTQKTRSKKS 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM++ANMAR+LS+S+TELV+K KAT+++K+LRVDISKDGGS+ LFV+L L Sbjct: 121 RSSGRGKWMVLANMARFLSISLTELVLKTPKATLDIKELRVDISKDGGSEAGLFVKLQLF 180 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIR 47 P VH+GESR+ D S+ G + +++ S PF CEE SLLCEFGH+REAGVV+R Sbjct: 181 PINVHLGESRVISDH--SVTSGGTFSDNQLVDGVSAPFSCEEFSLLCEFGHNREAGVVVR 238 Query: 46 NVDVTSGVVTCNLNE 2 N+D+TSG V+ N+NE Sbjct: 239 NLDITSGEVSININE 253 >ref|XP_007018271.1| Golgi-body localization protein domain isoform 4, partial [Theobroma cacao] gi|508723599|gb|EOY15496.1| Golgi-body localization protein domain isoform 4, partial [Theobroma cacao] Length = 2164 Score = 305 bits (781), Expect = 2e-80 Identities = 159/255 (62%), Positives = 188/255 (73%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W+VFIFA++LLAW LSRI+GASV FRV GW CLRDVVVKF Sbjct: 1 MAASPVKFLFGFLMISITLWMVFIFASRLLAWILSRIVGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ESI VGEIKLSLRQSLVKLG G IS+DPKLQ+LICDLE+V+RP Sbjct: 61 KGAIESILVGEIKLSLRQSLVKLGFGIISKDPKLQVLICDLEIVLRPSTKSSQKAKSRKP 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VAN+AR+LSVS+T+LV+K KAT+EVK+L+VDISKDGGSKP LFV+LH++ Sbjct: 121 RTSGRGKWMVVANIARFLSVSITDLVLKTPKATVEVKELKVDISKDGGSKPNLFVKLHIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIR 47 P VH R+ +ME+ S PF CEE SL CEFGHDREAGVV+R Sbjct: 181 PISVHA-----------------IRSLSGIMEKFSAPFSCEEFSLSCEFGHDREAGVVVR 223 Query: 46 NVDVTSGVVTCNLNE 2 NVD+ G V NLNE Sbjct: 224 NVDINCGEVVVNLNE 238 >ref|XP_007018270.1| Golgi-body localization protein domain isoform 3, partial [Theobroma cacao] gi|508723598|gb|EOY15495.1| Golgi-body localization protein domain isoform 3, partial [Theobroma cacao] Length = 2591 Score = 305 bits (781), Expect = 2e-80 Identities = 159/255 (62%), Positives = 188/255 (73%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W+VFIFA++LLAW LSRI+GASV FRV GW CLRDVVVKF Sbjct: 1 MAASPVKFLFGFLMISITLWMVFIFASRLLAWILSRIVGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ESI VGEIKLSLRQSLVKLG G IS+DPKLQ+LICDLE+V+RP Sbjct: 61 KGAIESILVGEIKLSLRQSLVKLGFGIISKDPKLQVLICDLEIVLRPSTKSSQKAKSRKP 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VAN+AR+LSVS+T+LV+K KAT+EVK+L+VDISKDGGSKP LFV+LH++ Sbjct: 121 RTSGRGKWMVVANIARFLSVSITDLVLKTPKATVEVKELKVDISKDGGSKPNLFVKLHIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIR 47 P VH R+ +ME+ S PF CEE SL CEFGHDREAGVV+R Sbjct: 181 PISVHA-----------------IRSLSGIMEKFSAPFSCEEFSLSCEFGHDREAGVVVR 223 Query: 46 NVDVTSGVVTCNLNE 2 NVD+ G V NLNE Sbjct: 224 NVDINCGEVVVNLNE 238 >ref|XP_007018269.1| Golgi-body localization protein domain isoform 2 [Theobroma cacao] gi|508723597|gb|EOY15494.1| Golgi-body localization protein domain isoform 2 [Theobroma cacao] Length = 2155 Score = 305 bits (781), Expect = 2e-80 Identities = 159/255 (62%), Positives = 188/255 (73%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W+VFIFA++LLAW LSRI+GASV FRV GW CLRDVVVKF Sbjct: 1 MAASPVKFLFGFLMISITLWMVFIFASRLLAWILSRIVGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ESI VGEIKLSLRQSLVKLG G IS+DPKLQ+LICDLE+V+RP Sbjct: 61 KGAIESILVGEIKLSLRQSLVKLGFGIISKDPKLQVLICDLEIVLRPSTKSSQKAKSRKP 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VAN+AR+LSVS+T+LV+K KAT+EVK+L+VDISKDGGSKP LFV+LH++ Sbjct: 121 RTSGRGKWMVVANIARFLSVSITDLVLKTPKATVEVKELKVDISKDGGSKPNLFVKLHIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIR 47 P VH R+ +ME+ S PF CEE SL CEFGHDREAGVV+R Sbjct: 181 PISVHA-----------------IRSLSGIMEKFSAPFSCEEFSLSCEFGHDREAGVVVR 223 Query: 46 NVDVTSGVVTCNLNE 2 NVD+ G V NLNE Sbjct: 224 NVDINCGEVVVNLNE 238 >ref|XP_007018268.1| Golgi-body localization protein domain isoform 1 [Theobroma cacao] gi|508723596|gb|EOY15493.1| Golgi-body localization protein domain isoform 1 [Theobroma cacao] Length = 2621 Score = 305 bits (781), Expect = 2e-80 Identities = 159/255 (62%), Positives = 188/255 (73%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFIFAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFK 587 MA+SPVKFLF FL++SI W+VFIFA++LLAW LSRI+GASV FRV GW CLRDVVVKF Sbjct: 1 MAASPVKFLFGFLMISITLWMVFIFASRLLAWILSRIVGASVGFRVGGWKCLRDVVVKFN 60 Query: 586 KGAVESISVGEIKLSLRQSLVKLGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXX 407 KGA+ESI VGEIKLSLRQSLVKLG G IS+DPKLQ+LICDLE+V+RP Sbjct: 61 KGAIESILVGEIKLSLRQSLVKLGFGIISKDPKLQVLICDLEIVLRPSTKSSQKAKSRKP 120 Query: 406 XXXXXXKWMIVANMARYLSVSVTELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLI 227 KWM+VAN+AR+LSVS+T+LV+K KAT+EVK+L+VDISKDGGSKP LFV+LH++ Sbjct: 121 RTSGRGKWMVVANIARFLSVSITDLVLKTPKATVEVKELKVDISKDGGSKPNLFVKLHIL 180 Query: 226 PFLVHIGESRLSYDQSSSLNQGDDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIR 47 P VH R+ +ME+ S PF CEE SL CEFGHDREAGVV+R Sbjct: 181 PISVHA-----------------IRSLSGIMEKFSAPFSCEEFSLSCEFGHDREAGVVVR 223 Query: 46 NVDVTSGVVTCNLNE 2 NVD+ G V NLNE Sbjct: 224 NVDINCGEVVVNLNE 238 >ref|XP_003602872.1| hypothetical protein MTR_3g099870 [Medicago truncatula] gi|355491920|gb|AES73123.1| hypothetical protein MTR_3g099870 [Medicago truncatula] Length = 369 Score = 303 bits (775), Expect = 1e-79 Identities = 167/297 (56%), Positives = 200/297 (67%), Gaps = 42/297 (14%) Frame = -3 Query: 766 MASSPVKFLFVFLLLSIIGWIVFI------------------------------------ 695 MA+SPV FLF FLLLS+ W++FI Sbjct: 1 MAASPVNFLFGFLLLSVTLWLLFISFIVFVAALRCLVHGYFQGCLQTFDHLVLDFDYAFF 60 Query: 694 --FAAKLLAWFLSRIMGASVVFRVAGWNCLRDVVVKFKKGAVESISVGEIKLSLRQSLVK 521 FA+ LLAW LS+I+GASV FRV GW CLRDVVVKF+KGAVES+SVGEIKLSLRQSLVK Sbjct: 61 YRFASTLLAWILSKILGASVGFRVGGWKCLRDVVVKFEKGAVESVSVGEIKLSLRQSLVK 120 Query: 520 LGVGFISRDPKLQLLICDLEVVIRPXXXXXXXXXXXXXXXXXXXKWMIVANMARYLSVSV 341 LGVGFISRDPKLQ+LICDLEVV+RP KWMI+ N+ARYLSV V Sbjct: 121 LGVGFISRDPKLQVLICDLEVVMRPSNKSPGKKKTRKSRASGRGKWMIIGNIARYLSVFV 180 Query: 340 TELVIKMSKATIEVKDLRVDISKDGGSKPTLFVRLHLIPFLVHIGESRLSYDQSSSLNQG 161 T+LV+K K T+E+K+L VDISKDGGSK +L VRL ++P LVHIGE R S DQ S+L G Sbjct: 181 TDLVLKTPKYTLEIKELNVDISKDGGSKSSLLVRLQILPILVHIGEPRDSCDQLSNLGGG 240 Query: 160 ----DDRASLTMMERASDPFICEELSLLCEFGHDREAGVVIRNVDVTSGVVTCNLNE 2 +AS +ER+S PFICE+ S+ CEFGHDRE G+VI+++D++SG VT NLNE Sbjct: 241 GCSSSCQASFAAIERSSAPFICEKFSISCEFGHDREVGIVIKSLDISSGEVTLNLNE 297