BLASTX nr result
ID: Sinomenium22_contig00011418
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00011418 (2317 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera] 729 0.0 ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246... 697 0.0 ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612... 682 0.0 ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citr... 682 0.0 ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prun... 681 0.0 emb|CBI36173.3| unnamed protein product [Vitis vinifera] 675 0.0 ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein ... 672 0.0 ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein ... 668 0.0 ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein ... 663 0.0 ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300... 656 0.0 gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis] 646 0.0 ref|XP_002528176.1| glycosyltransferase, putative [Ricinus commu... 643 0.0 ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601... 640 e-180 ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793... 639 e-180 ref|XP_002298139.1| glycosyl transferase family 1 family protein... 638 e-180 ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247... 635 e-179 ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501... 629 e-177 ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795... 629 e-177 ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phas... 628 e-177 ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207... 625 e-176 >emb|CAN71826.1| hypothetical protein VITISV_013841 [Vitis vinifera] Length = 734 Score = 729 bits (1882), Expect = 0.0 Identities = 401/640 (62%), Positives = 467/640 (72%), Gaps = 8/640 (1%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 D GN VRQ STPR+SPSFRR+HSSRTPRRE R S QW R+N Sbjct: 9 DFHGNVVRQSSLRPGGSLKSTLSGRSTPRNSPSFRRSHSSRTPRREARSSGVGSQWFRNN 68 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDNDKEAF-VGYKSKVSIPATKQNQQAEVDTNEDS 1541 RV+ WLILITLWAYLGFYVQSKWAHGDN+++ G K I ++ N++A + N+ Sbjct: 69 RVVFWLILITLWAYLGFYVQSKWAHGDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKL 128 Query: 1540 VAHNNASSEATQSQKELNLKNMTVSLAKKRH-ISLRRI----KRSTMEXXXXXXXXXXXX 1376 +A N S + K + V LAKK + + RR KRS Sbjct: 129 LAVKNGSDKNPVGSG----KKVDVVLAKKGNSVPSRRSASSKKRSKKSERSLRGKTRKQK 184 Query: 1375 XXXXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLV 1196 +DE E+EIPK NTSYGLLVGPFG TEDRILEW+ EKRSGTCDR G ARLV Sbjct: 185 TKTEVEVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDRILEWSPEKRSGTCDRRGELARLV 244 Query: 1195 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLED 1016 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLS+KGGLM EL RRRIKVLED Sbjct: 245 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMPELARRRIKVLED 304 Query: 1015 KGELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLG 836 + +LS+KTAMK+DLVIAGSAVCASWIEQYIAH TAGS QI WWIMENRREYFDR+K ++ Sbjct: 305 RADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGSSQIVWWIMENRREYFDRSKLVIN 364 Query: 835 QVKMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEE 656 +VKMLIFLSESQS QWL WCKEE I L S P +VPLSVNDELAFVAGI+CSLNTPSF+ E Sbjct: 365 RVKMLIFLSESQSKQWLTWCKEENIRLISQPAVVPLSVNDELAFVAGITCSLNTPSFTTE 424 Query: 655 KMLEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGR 476 KM EKRRLLRD++RKEMGLTD DML++SLSSINPGKGQ LLES R +++ S + Sbjct: 425 KMQEKRRLLRDSIRKEMGLTDTDMLLLSLSSINPGKGQFFLLESVRSMIEQEPSQDDP-- 482 Query: 475 KIKDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKK-AI 299 ++KD++ I ++ S S K SRAL Q V HF S++G +ES +++N PK K + Sbjct: 483 ELKDLAKIGQDQ--SNFSGKHYSRALLQNVNHFSVSSSGLRLSNESFIELNGPKSKNLML 540 Query: 298 TSLF-SNNHTEAISHGNTYKARKMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYL 122 SLF S + ++A+S G+ YK RK+ S++ G Q+QALKVLIGSVGSKSNKVPYVK LLR+L Sbjct: 541 PSLFPSISPSDAVSIGSGYKRRKVLSENEGTQEQALKVLIGSVGSKSNKVPYVKGLLRFL 600 Query: 121 SQHPELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 +H LS+ VLWTPATTRVASLYSAADVY INSQG GETF Sbjct: 601 XRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETF 640 >ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera] Length = 691 Score = 697 bits (1799), Expect = 0.0 Identities = 382/612 (62%), Positives = 441/612 (72%), Gaps = 6/612 (0%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TPR+SPSFRR+HSSRTPRRE R S QW R+NRV+ WLILITLWAYLGFYVQSKWAHG Sbjct: 24 TPRNSPSFRRSHSSRTPRREARSSGVGSQWFRNNRVVFWLILITLWAYLGFYVQSKWAHG 83 Query: 1639 DNDKEAF-VGYKSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVSL 1463 DN+++ G K I ++ N++A + N+ +A N S + K + V L Sbjct: 84 DNNEDIIGFGGKPNNGISDSELNRKAPLIANDKLLAVKNGSDKNPVGSG----KKVDVVL 139 Query: 1462 AKKRH-ISLRRI----KRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGL 1298 AKK + + RR KRS +DE E+EIPK NTSYGL Sbjct: 140 AKKGNSVPSRRSASSKKRSKKSERSLRGKTRKQKTKTEVEVTEMDEQEQEIPKLNTSYGL 199 Query: 1297 LVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELAT 1118 LVGPFG TEDRILEW+ EKRSGTCDR G ARLVWSRKFVLIFHELSMTGAPLSMMELAT Sbjct: 200 LVGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVLIFHELSMTGAPLSMMELAT 259 Query: 1117 ELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWI 938 ELLSCGATVSAVVLS+KGGLM EL RRRIKVLED+ +LS+KTAMK+DLVIAGSAVCASWI Sbjct: 260 ELLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCASWI 319 Query: 937 EQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGIT 758 EQYIAH TAGS QI WWIMENRREYFDR+K ++ +VKMLIFLSESQS QWL WCKEE I Sbjct: 320 EQYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIFLSESQSKQWLTWCKEENIR 379 Query: 757 LKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLV 578 L S P +VPLSVNDELAFVAGI+CSLNTPSF+ EKM EKRRLLRD++RKEMGLTD DML+ Sbjct: 380 LISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRRLLRDSIRKEMGLTDTDMLL 439 Query: 577 MSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRAL 398 +SLSSINPGKGQ LLES R +++ S + ++KD+ + I + S S K SRAL Sbjct: 440 LSLSSINPGKGQFFLLESVRSMIEQEPSQDDP--ELKDL--VKIGQDQSNFSGKHYSRAL 495 Query: 397 FQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMFSDS 218 Q V HF S+ ++ +S G+ YK RK+ S++ Sbjct: 496 LQNVNHFSVSS------------------------------SDEVSIGSGYKRRKVLSEN 525 Query: 217 TGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADV 38 G Q+QALKVLIGSVGSKSNKVPYVK LLR+L++H LS+ VLWTPATTRVASLYSAADV Sbjct: 526 EGTQEQALKVLIGSVGSKSNKVPYVKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADV 585 Query: 37 YSINSQGPGETF 2 Y INSQG GETF Sbjct: 586 YVINSQGMGETF 597 >ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612096 isoform X1 [Citrus sinensis] gi|568822059|ref|XP_006465457.1| PREDICTED: uncharacterized protein LOC102612096 isoform X2 [Citrus sinensis] Length = 732 Score = 682 bits (1759), Expect = 0.0 Identities = 370/638 (57%), Positives = 451/638 (70%), Gaps = 6/638 (0%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 DL N RQ STP++SPSFRR ++SRTPRRE R + S QW RSN Sbjct: 9 DLHVNVARQSSFRQGGSLKSSLSGRSTPKNSPSFRRLNASRTPRREVRSA--SLQWFRSN 66 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDN-DKEAFVGYKSKVSIPATKQNQQAEVDTNEDS 1541 R++ WL+LITLW YLGFYVQS+WAHG+N DK G K + I + QN++ ++ N Sbjct: 67 RLVYWLLLITLWTYLGFYVQSRWAHGENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSD 126 Query: 1540 VAHNNASSEATQSQKELNLKNMTVSLAKKRHISLRR---IKRSTMEXXXXXXXXXXXXXX 1370 + NN + + + K M + L ++R+ R KR + Sbjct: 127 LDINNGTIKTLGADS----KKMDMVLTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAK 182 Query: 1369 XXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWS 1190 ++ EIP N SYGLLVGPFGLTEDRILEW+ EKRSGTCDR+G FAR VWS Sbjct: 183 LDVESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWS 242 Query: 1189 RKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKG 1010 RKF+LIFHELSMTGAPLSMMELATELLSCGATVSAVVLS++GGLM EL RR+IKVLED+G Sbjct: 243 RKFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRG 302 Query: 1009 ELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQV 830 E S+KT+MK+DLVIAGSAVCA+WI+QYI AG Q+ WWIMENRREYFDRAK +L +V Sbjct: 303 EPSFKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRV 362 Query: 829 KMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKM 650 K+L+FLSESQ+ QWL WC+EE + L+S P +VPLSVNDELAFVAG +CSLNTP+ S EKM Sbjct: 363 KLLVFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKM 422 Query: 649 LEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKI 470 EKR LLRD+VRKEMGLTD DMLV+SLSSINPGKGQLLL+ESA+L+++ SM+ S KI Sbjct: 423 REKRNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMIEQEPSMDDS--KI 480 Query: 469 KDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAIT-S 293 + N+ +K S T + R L Q G S+N ES +NEP RK ++ S Sbjct: 481 RKSRNVGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLNEPVRKNLLSPS 540 Query: 292 LFSN-NHTEAISHGNTYKARKMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQ 116 LF++ +T+A+S G+ + RK+ S S G Q+QALK+LIGSVGSKSNKVPYVK +L +LSQ Sbjct: 541 LFTSIGNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVPYVKEILEFLSQ 600 Query: 115 HPELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 H LS+ +LWTPATTRVASLYSAADVY INSQG GETF Sbjct: 601 HSNLSKAMLWTPATTRVASLYSAADVYVINSQGLGETF 638 >ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citrus clementina] gi|557529073|gb|ESR40323.1| hypothetical protein CICLE_v10024994mg [Citrus clementina] Length = 732 Score = 682 bits (1759), Expect = 0.0 Identities = 370/638 (57%), Positives = 451/638 (70%), Gaps = 6/638 (0%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 DL N RQ STP++SPSFRR ++SRTPRRE R + S QW RSN Sbjct: 9 DLHVNVARQSSFRQGGSLKSSLSGRSTPKNSPSFRRLNASRTPRREVRSA--SLQWFRSN 66 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDN-DKEAFVGYKSKVSIPATKQNQQAEVDTNEDS 1541 R++ WL+LITLW YLGFYVQS+WAHG+N DK G K + I + QN++ ++ N Sbjct: 67 RLVYWLLLITLWTYLGFYVQSRWAHGENNDKFLGFGGKRRNEIVDSNQNKRRDLIANHSD 126 Query: 1540 VAHNNASSEATQSQKELNLKNMTVSLAKKRHISLRR---IKRSTMEXXXXXXXXXXXXXX 1370 + NN + + + K + + L ++R+ R KR + Sbjct: 127 LDINNGTIKTLGADS----KKIDMVLTQRRNNDASRRSVAKRKKSKRSSRGKGRGKQKAK 182 Query: 1369 XXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWS 1190 ++ EIP N SYGLLVGPFGLTEDRILEW+ EKRSGTCDR+G FAR VWS Sbjct: 183 LDVESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWS 242 Query: 1189 RKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKG 1010 RKF+LIFHELSMTGAPLSMMELATELLSCGATVSAVVLS++GGLM EL RR+IKVLED+G Sbjct: 243 RKFILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRG 302 Query: 1009 ELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQV 830 E S+KT+MK+DLVIAGSAVCA+WI+QYI AG Q+ WWIMENRREYFDRAK +L +V Sbjct: 303 EPSFKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRV 362 Query: 829 KMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKM 650 KML+FLSESQ+ QWL WC+EE + L+S P +VPLSVNDELAFVAG +CSLNTP+ S EKM Sbjct: 363 KMLVFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKM 422 Query: 649 LEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKI 470 EKR LLRD+VRKEMGLTD DMLV+SLSSINPGKGQLLL+ESA+L+++ SM+ S KI Sbjct: 423 CEKRNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMIEQEPSMDDS--KI 480 Query: 469 KDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAIT-S 293 + N+ +K S T + R L Q G S+N ES +NEP RK ++ S Sbjct: 481 RKSRNVGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLNEPVRKNLLSPS 540 Query: 292 LFSN-NHTEAISHGNTYKARKMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQ 116 LF++ +T+A+S G+ + RK+ S S G Q+QALK+LIGSVGSKSNKVPYVK +L +LSQ Sbjct: 541 LFTSIGNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVPYVKEILEFLSQ 600 Query: 115 HPELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 H LS+ +LWTPATTRVASLYSAADVY INSQG GETF Sbjct: 601 HSNLSKAMLWTPATTRVASLYSAADVYVINSQGLGETF 638 >ref|XP_007217014.1| hypothetical protein PRUPE_ppa002059mg [Prunus persica] gi|462413164|gb|EMJ18213.1| hypothetical protein PRUPE_ppa002059mg [Prunus persica] Length = 723 Score = 681 bits (1758), Expect = 0.0 Identities = 383/618 (61%), Positives = 460/618 (74%), Gaps = 12/618 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 +PR+SPSFRR +SSRTPRRE R S G QW RSNR+L WL+LITLWAYLGFY QS WAH Sbjct: 27 SPRNSPSFRRLNSSRTPRREARSS-GGVQWFRSNRLLFWLLLITLWAYLGFYFQSSWAH- 84 Query: 1639 DNDKEAFVGYKSKVSI--PATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 N+KE F+G+ +K S T+QN + ++ ++ S+A N E Q+Q + K++ V Sbjct: 85 -NNKENFLGFGNKASNGNSDTEQNARRDLLASDSSMAVKN---ETNQNQVKAG-KSIDVV 139 Query: 1465 LAKKRH-ISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLD-----EGEEEIPKRNTSY 1304 L KK + +S RR S ++ E E +IPK NTSY Sbjct: 140 LTKKENGVSSRRSASSKKRSKKSARSLRGKVHGKQKKTVEVEGHETEEQELDIPKTNTSY 199 Query: 1303 GLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMEL 1124 G+LVGPFG EDR LEW+ + RSGTCDR+G FARLVWSR+F+LIFHELSMTGAPLSMMEL Sbjct: 200 GMLVGPFGFVEDRTLEWSPKTRSGTCDRKGDFARLVWSRRFLLIFHELSMTGAPLSMMEL 259 Query: 1123 ATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCAS 944 ATELLSCGATVSAVVLS+KGGLM EL RRRIKVLEDK E S+KTAMK+DLVIAGSAVCAS Sbjct: 260 ATELLSCGATVSAVVLSKKGGLMPELARRRIKVLEDKVEQSFKTAMKADLVIAGSAVCAS 319 Query: 943 WIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEG 764 WI+QY+ H AG+ QIAWWIMENRREYFDRAK +L +VKML FLSESQS QWL WC+EE Sbjct: 320 WIDQYMDHFPAGASQIAWWIMENRREYFDRAKVVLNRVKMLAFLSESQSKQWLDWCEEEK 379 Query: 763 ITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDM 584 I L+S P +VPLS+NDELAFVAGI CSLNTPS S EKMLEKR+LLRD+VRKEMGLTDNDM Sbjct: 380 IKLRSQPAVVPLSINDELAFVAGIGCSLNTPSSSTEKMLEKRQLLRDSVRKEMGLTDNDM 439 Query: 583 LVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRS--STLSRKQK 410 LVMSLSSINPGKGQLLLLESARL+++ + S KIK+ + KR STL+RK Sbjct: 440 LVMSLSSINPGKGQLLLLESARLVIEEPLKYNS---KIKN----PVRKRQARSTLARKHH 492 Query: 409 SRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKK-AITSLFSN-NHTEAISHGNTYKAR 236 RALFQ++ G S+N +ES V +NEP++KK + SL+++ + T ++ T+K R Sbjct: 493 LRALFQELNDDGVSSNELPLSNESDVQLNEPQKKKLRLRSLYTSFDDTGDLTFNVTHK-R 551 Query: 235 KMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASL 56 K+ SD+ G +Q++K LIGSVGSKSNKV YVK LL +LSQH +S+ VLWTPATTRVA+L Sbjct: 552 KVLSDNGGTLEQSVKFLIGSVGSKSNKVLYVKELLGFLSQHSNMSKSVLWTPATTRVAAL 611 Query: 55 YSAADVYSINSQGPGETF 2 YSAADVY +NSQG GETF Sbjct: 612 YSAADVYVMNSQGLGETF 629 >emb|CBI36173.3| unnamed protein product [Vitis vinifera] Length = 683 Score = 675 bits (1741), Expect = 0.0 Identities = 381/638 (59%), Positives = 435/638 (68%), Gaps = 6/638 (0%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 D GN VRQ STPR+SPSFRR+HSSRTPRRE R S QW R+N Sbjct: 9 DFHGNVVRQSSLRPGGSLKSTLSGRSTPRNSPSFRRSHSSRTPRREARSSGVGSQWFRNN 68 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDNDKEAF-VGYKSKVSIPATKQNQQAEVDTNEDS 1541 RV+ WLILITLWAYLGFYVQSKWAHGDN+++ G K I ++ N++A + N+ Sbjct: 69 RVVFWLILITLWAYLGFYVQSKWAHGDNNEDIIGFGGKPNNGISDSELNRKAPLIANDKL 128 Query: 1540 VAHNNASSEATQSQKELNLKNMTVSLAKKRH-ISLRRI----KRSTMEXXXXXXXXXXXX 1376 +A N S + K + V LAKK + + RR KRS Sbjct: 129 LAVKNGSDKNPVGSG----KKVDVVLAKKGNSVPSRRSASSKKRSKKSERSLRGKTRKQK 184 Query: 1375 XXXXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLV 1196 +DE E+EIPK NTSYGLLVGPFG TEDRILEW+ EKRSGTCDR G ARLV Sbjct: 185 TKTEVEVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDRILEWSPEKRSGTCDRRGELARLV 244 Query: 1195 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLED 1016 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLS+KGGLM EL RRRIKVLED Sbjct: 245 WSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMPELARRRIKVLED 304 Query: 1015 KGELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLG 836 + +LS+KTAMK+DLVIAGSAVCASWIEQYIAH TAGS QI WWIMENRREYFDR+K ++ Sbjct: 305 RADLSFKTAMKADLVIAGSAVCASWIEQYIAHFTAGSSQIVWWIMENRREYFDRSKLVIN 364 Query: 835 QVKMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEE 656 +VKMLIFLSESQS QWL WCKEE I L S P +VPLSVNDELAFVAGI+CSLNTPSF+ E Sbjct: 365 RVKMLIFLSESQSKQWLTWCKEENIRLISQPAVVPLSVNDELAFVAGITCSLNTPSFTTE 424 Query: 655 KMLEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGR 476 KM EKRRLLRD++RKEMGLTD DML++SLSSINPGKGQ LLES R +++ S + Sbjct: 425 KMQEKRRLLRDSIRKEMGLTDTDMLLLSLSSINPGKGQFFLLESVRSMIEQEPSQDDP-- 482 Query: 475 KIKDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAIT 296 ++KD+ + I + S S K SRAL Q ++N PK K + Sbjct: 483 ELKDL--VKIGQDQSNFSGKHYSRALLQ--------------------NLNGPKSKNLM- 519 Query: 295 SLFSNNHTEAISHGNTYKARKMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQ 116 + KQALKVLIGSVGSKSNKVPYVK LLR+L++ Sbjct: 520 ----------------------------LPKQALKVLIGSVGSKSNKVPYVKGLLRFLTR 551 Query: 115 HPELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 H LS+ VLWTPATTRVASLYSAADVY INSQG GETF Sbjct: 552 HSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETF 589 >ref|XP_007024055.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma cacao] gi|508779421|gb|EOY26677.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma cacao] Length = 702 Score = 672 bits (1735), Expect = 0.0 Identities = 368/612 (60%), Positives = 442/612 (72%), Gaps = 6/612 (0%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TP+ SP+FRR +SSRTPRRE R G QW RSNR++ WL+LITLWAYLGFYVQS+WAHG Sbjct: 26 TPKSSPTFRRLNSSRTPRREARSGAGGIQWFRSNRLVYWLLLITLWAYLGFYVQSRWAHG 85 Query: 1639 DNDKEAFVGYKS--KVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 N KE F+G+ + + +QN + ++ ++ VA NN +++ TQ + + V Sbjct: 86 HN-KEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK-TQVYSD---RKFDVI 140 Query: 1465 LAKKRH---ISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGLL 1295 LAKKR+ + +R +RS + E EI ++N++YGLL Sbjct: 141 LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200 Query: 1294 VGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELATE 1115 VGPFG EDRILEW+ EKRSGTCDR+G FARLVWSR+ VL+FHELSMTGAP+SMMELATE Sbjct: 201 VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260 Query: 1114 LLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWIE 935 LLSCGATVSAVVLS+KGGLM EL RRRIKV+ED+ +LS+KTAMK+DLVIAGSAVCASWI+ Sbjct: 261 LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320 Query: 934 QYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGITL 755 QYIAH AG QIAWWIMENRREYFDR+K +L +VKMLIFLSE QS QWL WC+EE I L Sbjct: 321 QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380 Query: 754 KSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLVM 575 +S P LVPL+VNDELAFVAGI CSLNTPS S EKMLEKR+LLRDAVRKEMGLTDNDMLVM Sbjct: 381 RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440 Query: 574 SLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRALF 395 SLSSIN GKGQLLLLE+A L++D + S + ++ I + STL+ K R L Sbjct: 441 SLSSINTGKGQLLLLEAAGLMIDQDPLQTDS----EVTKSLDIRQDQSTLTVKHHLRGLL 496 Query: 394 QKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSN-NHTEAISHGNTYKARKMFSDS 218 QK S D S D+ LF++ N T A+S ++++ R M DS Sbjct: 497 QK----------SSDVDVSSTDLR----------LFASVNGTNAVSIDSSHRRRNMLFDS 536 Query: 217 TGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADV 38 G Q+QALK+LIGSVGSKSNK+PYVK +LR+LSQH +LS VLWTPATT VASLYSAADV Sbjct: 537 KGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADV 596 Query: 37 YSINSQGPGETF 2 Y +NSQG GETF Sbjct: 597 YVMNSQGLGETF 608 >ref|XP_007024056.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma cacao] gi|508779422|gb|EOY26678.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma cacao] Length = 703 Score = 668 bits (1723), Expect = 0.0 Identities = 368/613 (60%), Positives = 442/613 (72%), Gaps = 7/613 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TP+ SP+FRR +SSRTPRRE R G QW RSNR++ WL+LITLWAYLGFYVQS+WAHG Sbjct: 26 TPKSSPTFRRLNSSRTPRREARSGAGGIQWFRSNRLVYWLLLITLWAYLGFYVQSRWAHG 85 Query: 1639 DNDKEAFVGYKS--KVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 N KE F+G+ + + +QN + ++ ++ VA NN +++ TQ + + V Sbjct: 86 HN-KEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK-TQVYSD---RKFDVI 140 Query: 1465 LAKKRH---ISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGLL 1295 LAKKR+ + +R +RS + E EI ++N++YGLL Sbjct: 141 LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200 Query: 1294 VGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELATE 1115 VGPFG EDRILEW+ EKRSGTCDR+G FARLVWSR+ VL+FHELSMTGAP+SMMELATE Sbjct: 201 VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260 Query: 1114 LLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWIE 935 LLSCGATVSAVVLS+KGGLM EL RRRIKV+ED+ +LS+KTAMK+DLVIAGSAVCASWI+ Sbjct: 261 LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320 Query: 934 QYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGITL 755 QYIAH AG QIAWWIMENRREYFDR+K +L +VKMLIFLSE QS QWL WC+EE I L Sbjct: 321 QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380 Query: 754 KSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLVM 575 +S P LVPL+VNDELAFVAGI CSLNTPS S EKMLEKR+LLRDAVRKEMGLTDNDMLVM Sbjct: 381 RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440 Query: 574 SLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRALF 395 SLSSIN GKGQLLLLE+A L++D + S + ++ I + STL+ K R L Sbjct: 441 SLSSINTGKGQLLLLEAAGLMIDQDPLQTDS----EVTKSLDIRQDQSTLTVKHHLRGLL 496 Query: 394 QKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSN-NHTEAISHGNTYKARKMFSDS 218 QK S D S D+ LF++ N T A+S ++++ R M DS Sbjct: 497 QK----------SSDVDVSSTDLR----------LFASVNGTNAVSIDSSHRRRNMLFDS 536 Query: 217 TGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADV 38 G Q+QALK+LIGSVGSKSNK+PYVK +LR+LSQH +LS VLWTPATT VASLYSAADV Sbjct: 537 KGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADV 596 Query: 37 YSINS-QGPGETF 2 Y +NS QG GETF Sbjct: 597 YVMNSQQGLGETF 609 >ref|XP_007024057.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma cacao] gi|508779423|gb|EOY26679.1| UDP-Glycosyltransferase superfamily protein isoform 3 [Theobroma cacao] Length = 608 Score = 663 bits (1710), Expect = 0.0 Identities = 363/606 (59%), Positives = 437/606 (72%), Gaps = 6/606 (0%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TP+ SP+FRR +SSRTPRRE R G QW RSNR++ WL+LITLWAYLGFYVQS+WAHG Sbjct: 26 TPKSSPTFRRLNSSRTPRREARSGAGGIQWFRSNRLVYWLLLITLWAYLGFYVQSRWAHG 85 Query: 1639 DNDKEAFVGYKS--KVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 N KE F+G+ + + +QN + ++ ++ VA NN +++ TQ + + V Sbjct: 86 HN-KEEFLGFSGNPRNGLIDAEQNPRRDLLADDSLVAVNNGTNK-TQVYSD---RKFDVI 140 Query: 1465 LAKKRH---ISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGLL 1295 LAKKR+ + +R +RS + E EI ++N++YGLL Sbjct: 141 LAKKRNEVSFNKKRSRRSKRAGRNLSKMRGKRKATINIENGETEGQEHEILQKNSTYGLL 200 Query: 1294 VGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELATE 1115 VGPFG EDRILEW+ EKRSGTCDR+G FARLVWSR+ VL+FHELSMTGAP+SMMELATE Sbjct: 201 VGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWSRRLVLVFHELSMTGAPISMMELATE 260 Query: 1114 LLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWIE 935 LLSCGATVSAVVLS+KGGLM EL RRRIKV+ED+ +LS+KTAMK+DLVIAGSAVCASWI+ Sbjct: 261 LLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRADLSFKTAMKADLVIAGSAVCASWID 320 Query: 934 QYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGITL 755 QYIAH AG QIAWWIMENRREYFDR+K +L +VKMLIFLSE QS QWL WC+EE I L Sbjct: 321 QYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRVKMLIFLSELQSKQWLTWCQEENIKL 380 Query: 754 KSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLVM 575 +S P LVPL+VNDELAFVAGI CSLNTPS S EKMLEKR+LLRDAVRKEMGLTDNDMLVM Sbjct: 381 RSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKMLEKRQLLRDAVRKEMGLTDNDMLVM 440 Query: 574 SLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRALF 395 SLSSIN GKGQLLLLE+A L++D + S + ++ I + STL+ K R L Sbjct: 441 SLSSINTGKGQLLLLEAAGLMIDQDPLQTDS----EVTKSLDIRQDQSTLTVKHHLRGLL 496 Query: 394 QKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSN-NHTEAISHGNTYKARKMFSDS 218 QK S D S D+ LF++ N T A+S ++++ R M DS Sbjct: 497 QK----------SSDVDVSSTDLR----------LFASVNGTNAVSIDSSHRRRNMLFDS 536 Query: 217 TGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADV 38 G Q+QALK+LIGSVGSKSNK+PYVK +LR+LSQH +LS VLWTPATT VASLYSAADV Sbjct: 537 KGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAADV 596 Query: 37 YSINSQ 20 Y +NSQ Sbjct: 597 YVMNSQ 602 >ref|XP_004302927.1| PREDICTED: uncharacterized protein LOC101300160 [Fragaria vesca subsp. vesca] Length = 720 Score = 656 bits (1693), Expect = 0.0 Identities = 368/611 (60%), Positives = 437/611 (71%), Gaps = 5/611 (0%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 +PR SPSF+R HSSRTPRRE R S G QW RSNR+L WL+LITLWAYLGFY QS WAH Sbjct: 27 SPRSSPSFKRLHSSRTPRREARSS-GGVQWFRSNRLLFWLLLITLWAYLGFYFQSSWAHS 85 Query: 1639 DNDKEAFVGYKSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVSLA 1460 +N K F+G ++ S + Q D + V N E Q+Q E K + V LA Sbjct: 86 NN-KVNFLGVGNEASNDKSDAEQNQRRDLLDSPVKLKN---ETGQNQPEAG-KTIDVVLA 140 Query: 1459 KKRH-ISLRRI--KRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGLLVG 1289 KK ++ RR + + ++E E +IPK N SYG+LVG Sbjct: 141 KKDDGVASRRSLSSKKKSKKAARGKSHGKPKKTVAIEIHEIEEQEPDIPKTNASYGMLVG 200 Query: 1288 PFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELATELL 1109 PFG TEDRILEWN + R+GTCDR+G F+RLVWSR+F+LIFHELSMTGAPLSMMELATELL Sbjct: 201 PFGSTEDRILEWNPKTRTGTCDRKGDFSRLVWSRRFLLIFHELSMTGAPLSMMELATELL 260 Query: 1108 SCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWIEQY 929 SCGATVSA+VLS+KGGLM EL RRRIKVLEDK + S+KTAMK DLVIAGSAVCASWI+QY Sbjct: 261 SCGATVSAIVLSKKGGLMPELTRRRIKVLEDKADHSFKTAMKQDLVIAGSAVCASWIDQY 320 Query: 928 IAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGITLKS 749 I AG+ QIAWWIMENRREYFDRAK +L +VKML FLSESQS QWL WC+EE I L+S Sbjct: 321 IDKFPAGASQIAWWIMENRREYFDRAKVVLDRVKMLAFLSESQSKQWLDWCEEEKIKLRS 380 Query: 748 PPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLVMSL 569 P +VPLS+NDELAFVAGI CSLNTPS S EKMLEK +LLRDAVRKEMGLTDNDML +SL Sbjct: 381 QPAIVPLSINDELAFVAGIGCSLNTPSSSIEKMLEKMKLLRDAVRKEMGLTDNDMLAISL 440 Query: 568 SSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRALFQK 389 SSINPGKGQLL+L SARL+++ ++S KIK +++ + S L+RK RAL Q Sbjct: 441 SSINPGKGQLLVLNSARLVIEEEPQPDNS--KIK--NSVRKGRVRSALARKHHIRALLQG 496 Query: 388 VKHFGKSANGSYQPDESRVDMNEPKRKKA-ITSLFSN-NHTEAISHGNTYKARKMFSDST 215 S NG ES V E ++K + + F++ + T+A++ TYK RK+ +D+ Sbjct: 497 SNDHSASLNGFPLSTESSVHFKEDQKKHLHLHNRFASVDDTDAMNFDVTYK-RKVLADNG 555 Query: 214 GIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADVY 35 G KQ+ K LIGSVGSKSNKV YVK LL YLSQH LS+ VLWTP+TTRVA+LYSAADVY Sbjct: 556 GTVKQSAKFLIGSVGSKSNKVAYVKELLSYLSQHSNLSKSVLWTPSTTRVAALYSAADVY 615 Query: 34 SINSQGPGETF 2 +NSQG GETF Sbjct: 616 VMNSQGLGETF 626 >gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis] Length = 688 Score = 646 bits (1667), Expect = 0.0 Identities = 348/610 (57%), Positives = 427/610 (70%), Gaps = 4/610 (0%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAH- 1643 TPR+SPSFRR+ SSRTPRREGRGS QW RSNR+L WL+LITLWAYLGF+VQS+WAH Sbjct: 28 TPRNSPSFRRSQSSRTPRREGRGSARGLQWFRSNRLLFWLLLITLWAYLGFFVQSRWAHD 87 Query: 1642 GDNDKEAFVGYKSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQS---QKELNLKNMT 1472 DND G K K T+QN + ++ + S+A N + + S + ++ L Sbjct: 88 NDNDNVMGFGKKPKNWNSETEQNLRRDLIATDISLAVKNGTGKNQVSDGKRMDVVLAGRN 147 Query: 1471 VSLAKKRHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGLLV 1292 ++ R ++ ++ K ++E E +IPK N SYG+LV Sbjct: 148 DGISSHRKLNSKKKKTKRANRSLRSKVHGKQKMTMEVKNVEIEEQEPDIPKTNASYGMLV 207 Query: 1291 GPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELATEL 1112 GPFG EDRILEW+ EKRSGTCDR+G FAR+VWSR+FVLIFHELSMTG+PLSMMELATEL Sbjct: 208 GPFGSLEDRILEWSPEKRSGTCDRKGDFARIVWSRRFVLIFHELSMTGSPLSMMELATEL 267 Query: 1111 LSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWIEQ 932 LSCGATVSAV LS+KGGLM EL RRRIKVLEDK +LS+KTAMK+DLVIAGSAVCASWI+Q Sbjct: 268 LSCGATVSAVALSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASWIDQ 327 Query: 931 YIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGITLK 752 +I H AG+ Q+AWWIMENRREYFDRAK +L +VKML+F+SE Q QWLAW +EE I L+ Sbjct: 328 FIEHFPAGASQVAWWIMENRREYFDRAKVVLNRVKMLVFISELQWKQWLAWAEEEKIYLR 387 Query: 751 SPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLVMS 572 S P LVPLS+NDE+AFVAGI+C+LNTPSF+ EKM+EKR+LLRD+ RKEMGL DNDMLVMS Sbjct: 388 SQPVLVPLSINDEMAFVAGIACTLNTPSFTTEKMIEKRQLLRDSARKEMGLKDNDMLVMS 447 Query: 571 LSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRALFQ 392 LSSINPGKGQ LLL S RL+++ E S K + + I+ S +RK + + +FQ Sbjct: 448 LSSINPGKGQHLLLGSGRLMIEKEAFEEKSNIK----NPVDIKHHQSKSTRKHRLKTVFQ 503 Query: 391 KVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMFSDSTG 212 K+ NGS ++ G T+ RK DS G Sbjct: 504 KL-------NGS------------------------------MAFGGTH--RKEMLDSGG 524 Query: 211 IQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAADVYS 32 ++++++K+LIGSVGSKSNKV YVK LL YLSQHP S+ VLWTPA+TRVA+LY+AADVY Sbjct: 525 MRERSVKILIGSVGSKSNKVVYVKELLNYLSQHPNTSKSVLWTPASTRVAALYAAADVYV 584 Query: 31 INSQGPGETF 2 INSQG GETF Sbjct: 585 INSQGLGETF 594 >ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis] gi|223532388|gb|EEF34183.1| glycosyltransferase, putative [Ricinus communis] Length = 686 Score = 643 bits (1659), Expect = 0.0 Identities = 362/644 (56%), Positives = 437/644 (67%), Gaps = 12/644 (1%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 DL N VRQ ST ++SP+FRR HSSRTPR E R G QW RS Sbjct: 9 DLHVNVVRQSPLRSGGSFRSTLSGRSTAKNSPTFRRLHSSRTPRGEARSIGGGVQWFRST 68 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDNDKEAFVGY----KSKVSIPATKQNQQAEVDTN 1550 R++ WL+LITLWAYLGFYVQS+WAHGDN KE F+G+ ++++S+P +QN + ++ N Sbjct: 69 RLVYWLLLITLWAYLGFYVQSRWAHGDN-KEDFLGFGGQNRNEISVP--EQNTRRDLLAN 125 Query: 1549 EDSVAHNNASSEATQSQKE-----LNLKNMTVSL-AKKRHISLRRIKRSTMEXXXXXXXX 1388 + SVA N+ + L K TVS KK S +R KR+ Sbjct: 126 DSSVAVNDGTDNVQVEDDRRIGVVLAKKGNTVSSNQKKNSFSKKRSKRAGRRLRSKTRDK 185 Query: 1387 XXXXXXXXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGF 1208 + E +IP++NT+YG LVGPFG TEDRILEW+ EKR+GTCDR+G F Sbjct: 186 QKATVEVESEDVEVQE--PDIPQKNTTYGFLVGPFGSTEDRILEWSPEKRTGTCDRKGDF 243 Query: 1207 ARLVWSRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIK 1028 ARLVWSRKFVLIFHELSMTGAPLSMMELATE LSCGATVSAVVLS+KGGLM ELNRRRIK Sbjct: 244 ARLVWSRKFVLIFHELSMTGAPLSMMELATEFLSCGATVSAVVLSKKGGLMSELNRRRIK 303 Query: 1027 VLEDKGELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAK 848 VLEDK +LS+KTAMK+DLVIAGSAVCASWI+QY+ AG QI WWIMENRREYFDR+K Sbjct: 304 VLEDKADLSFKTAMKADLVIAGSAVCASWIDQYMTRFPAGGSQIVWWIMENRREYFDRSK 363 Query: 847 TMLGQVKMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPS 668 +L +VKML+FLSESQ++QWL+WC EE I L++PP +VPLS+NDELAFVAGI+CSLNTPS Sbjct: 364 IVLNRVKMLVFLSESQTEQWLSWCDEEKIKLRAPPAIVPLSINDELAFVAGIACSLNTPS 423 Query: 667 FSEEKMLEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISME 488 S EKMLEKRRLL D+VRKEMGLTD+D+L++SLSSINPGKGQLL+LESA+LL++ Sbjct: 424 SSPEKMLEKRRLLADSVRKEMGLTDDDVLLVSLSSINPGKGQLLILESAKLLIE-----P 478 Query: 487 SSGRKIKDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRK 308 +K++ S++ I + S ++ K RAL Q ++ Sbjct: 479 EPLQKLR--SSVGIGEEQSRIAVKHHLRALLQ-------------------------EKS 511 Query: 307 KAITSLFSNNHTEAISHGNTYKARKMFSDSTGIQK--QALKVLIGSVGSKSNKVPYVKVL 134 KA++ L G +K +ALKVLIGSVGSKSNKVPYVK + Sbjct: 512 KAVSDL-----------------------KEGQEKYLKALKVLIGSVGSKSNKVPYVKEM 548 Query: 133 LRYLSQHPELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 L YL+QH LS+ VLWTPATTRVASLYSAAD Y INSQG GETF Sbjct: 549 LSYLTQHSNLSKSVLWTPATTRVASLYSAADAYVINSQGLGETF 592 >ref|XP_006343109.1| PREDICTED: uncharacterized protein LOC102601346 [Solanum tuberosum] Length = 711 Score = 640 bits (1650), Expect = e-180 Identities = 353/614 (57%), Positives = 425/614 (69%), Gaps = 8/614 (1%) Frame = -1 Query: 1819 TPRD-SPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAH 1643 TPR SPSFRR +S RTPRR+G+ S QW RSNR+LLWL+LITLWAY GFYVQS+WAH Sbjct: 29 TPRGGSPSFRRLNSGRTPRRDGKSSAFGSQWFRSNRILLWLLLITLWAYGGFYVQSRWAH 88 Query: 1642 GDNDKEAFVGYKSKVSIPATK--QNQQAEVDTNEDSVAHNNASSEATQSQKELNL---KN 1478 GDN + F G V+ ++ + Q + NE+S+A S++ + +L++ K Sbjct: 89 GDNKEGIFGGTGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAKQ 148 Query: 1477 MTVSLAKKRHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGL 1298 ++ K S ++ K+ST ++ EEEIPKRNT+YGL Sbjct: 149 GNSVVSDKVSSSKKKSKKSTRASRRKTHGKKKVVAEVKTDD--IEVQEEEIPKRNTTYGL 206 Query: 1297 LVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELAT 1118 LVGPFG ED+ILEW+ EKRSGTCDR+ FARLVWSRKFVLI HELSMTGAPL+M+ELAT Sbjct: 207 LVGPFGSIEDKILEWSPEKRSGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELAT 266 Query: 1117 ELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWI 938 ELLSCGATV V LS++GGLM EL+RR+IKVLEDK +LS+KTAMK+DL+IAGSAVCASWI Sbjct: 267 ELLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWI 326 Query: 937 EQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGIT 758 EQY A + GS QI WWIMENRREYFDRAK +VK LIFLSESQS +WLAWC+EE I Sbjct: 327 EQYAARTVLGSSQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIK 386 Query: 757 LKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLV 578 LK+ P LVPLS++DELAFVAGI CSL+TP FS EKMLEKR+LLRD VRKEMGLTDNDMLV Sbjct: 387 LKTQPALVPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLV 446 Query: 577 MSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRAL 398 MSLSSINPGKGQ LLLE+ RLL++G + S K R+ + R L Sbjct: 447 MSLSSINPGKGQFLLLETTRLLIEGAPPLNGSAVK----------------RREYQKRTL 490 Query: 397 FQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSN--NHTEAISHGNTYKARKMFS 224 K FG+ ++ + S + N + LF N+T I N RK+FS Sbjct: 491 LYNWKQFGE-----WKKESSTLSNNPQTETLQVPQLFIKGVNYTAGIE--NDRGTRKLFS 543 Query: 223 DSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAA 44 + G Q + LKVLIGSVGSKSNKVPYVK LL +L+QH LS VLWTP+TTRVA+LY+AA Sbjct: 544 LTEGKQGEKLKVLIGSVGSKSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVAALYAAA 603 Query: 43 DVYSINSQGPGETF 2 D Y +NSQG GETF Sbjct: 604 DAYVMNSQGLGETF 617 >ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793827 isoform X1 [Glycine max] gi|571514725|ref|XP_006597142.1| PREDICTED: uncharacterized protein LOC100793827 isoform X2 [Glycine max] Length = 701 Score = 639 bits (1648), Expect = e-180 Identities = 356/613 (58%), Positives = 424/613 (69%), Gaps = 7/613 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TPR+SPSFRR +S RTPR+EGR SVG W RSNR+LLWL+LITLWAYLGF+VQS+WAH Sbjct: 35 TPRNSPSFRRLNSGRTPRKEGRSSVGGALWFRSNRLLLWLLLITLWAYLGFFVQSRWAHS 94 Query: 1639 DNDKEAFVGYKSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELN--LKNMTVS 1466 D KE F GY + N AE D +A N + S + ++ K + V+ Sbjct: 95 DK-KEEFSGYGTG----PRNTNSDAEQIQRRDLLASNKSLSANNDTDADIAGISKTINVA 149 Query: 1465 LAKK-----RHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYG 1301 LAK H RS ++E E EIP N++YG Sbjct: 150 LAKNDNDVPSHRKTSSKNRSKGRRSSKGKSRGKLKPTTEIKNTDIEEQEPEIPTTNSTYG 209 Query: 1300 LLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELA 1121 LLVGPFG EDRILEW+ EKRSGTC+R+ FARLVWSR+F+LIFHELSMTGAPLSMMELA Sbjct: 210 LLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMTGAPLSMMELA 269 Query: 1120 TELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASW 941 TELLSCGATVSAVVLSRKGGLM EL RRRIKVLEDK +LS+KTAMK+DLVIAGSAVCASW Sbjct: 270 TELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASW 329 Query: 940 IEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGI 761 IEQYI H AG+ Q+AWWIMENRREYFDR+K +L +VKML+FLSESQS QW WC+EE I Sbjct: 330 IEQYIEHFPAGASQVAWWIMENRREYFDRSKDVLHRVKMLVFLSESQSKQWQKWCEEESI 389 Query: 760 TLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDML 581 L+S PE+VPLSVNDELAFVAGI +LNTPSFS EKM+EK++LLR++VRKEMGLTDNDML Sbjct: 390 KLRSHPEIVPLSVNDELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDML 449 Query: 580 VMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRA 401 V+SLSSINPGKGQLLLLES +++ S +K+K+VSN I++ S+L+RK + R Sbjct: 450 VISLSSINPGKGQLLLLESVSSVLEQGQS--PGDKKMKEVSN--IKEGLSSLARKHRIRK 505 Query: 400 LFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMFSD 221 L + + GK A+ S IS + + +++ + Sbjct: 506 LLPLMSN-GKVASNS------------------------------ISSNSLSRRKQVLPN 534 Query: 220 STGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAAD 41 G +Q+LK+LIGSV SKSNK YVK LL +L QHP S + WTPATTRVASLYSAAD Sbjct: 535 DKGTIQQSLKLLIGSVRSKSNKADYVKSLLSFLEQHPNTSTSIFWTPATTRVASLYSAAD 594 Query: 40 VYSINSQGPGETF 2 VY INSQG GETF Sbjct: 595 VYVINSQGLGETF 607 >ref|XP_002298139.1| glycosyl transferase family 1 family protein [Populus trichocarpa] gi|222845397|gb|EEE82944.1| glycosyl transferase family 1 family protein [Populus trichocarpa] Length = 681 Score = 638 bits (1645), Expect = e-180 Identities = 354/615 (57%), Positives = 421/615 (68%), Gaps = 9/615 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TPR+SP+ R HSSRTPRREGRGS G QW RSNR++ WL+LITLW YLGFYVQS+WAHG Sbjct: 36 TPRNSPTHRLLHSSRTPRREGRGS-GGIQWFRSNRLIYWLLLITLWTYLGFYVQSRWAHG 94 Query: 1639 DNDKEAFVGY--KSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 DN K+ F+G+ KS + +Q+ + ++ N+ V NN +++ + N K + V Sbjct: 95 DN-KDEFLGFGGKSSNGLLDAEQHTRRDLLANDSLVVVNNGTNKI----QVRNAKKIDVV 149 Query: 1465 LAKK-------RHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTS 1307 LAKK R + ++ K ++ E ++PK N S Sbjct: 150 LAKKGNGVSSNRRATPKKKKSKRGGRRSRAKAHDKQKATVVVESDDVEVAEPDVPKNNAS 209 Query: 1306 YGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMME 1127 YGLLVGPFG EDRILEW+ EKRSGTCDR+G FARLVWSRKFVLIFHELSMTGAPLSM+E Sbjct: 210 YGLLVGPFGPIEDRILEWSPEKRSGTCDRKGAFARLVWSRKFVLIFHELSMTGAPLSMLE 269 Query: 1126 LATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCA 947 LATE LSCGATVSAVVLS+KGGLM EL RRRIKVLED+ +LS+KTAMK+DLVIAGSAVC Sbjct: 270 LATEFLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFKTAMKADLVIAGSAVCT 329 Query: 946 SWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEE 767 SWI+QYIA AG Q+ WWIMENRREYFDR+K +L +VKML+FLSESQ QW WC+EE Sbjct: 330 SWIDQYIARFPAGGSQVVWWIMENRREYFDRSKIILNRVKMLVFLSESQMKQWQTWCEEE 389 Query: 766 GITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDND 587 I L+SPP +V LSVNDELAFVAGI+CSLNTP+ S EKMLEKR+LLR++VRKEMGLTDND Sbjct: 390 NIRLRSPPAVVQLSVNDELAFVAGIACSLNTPTSSSEKMLEKRQLLRESVRKEMGLTDND 449 Query: 586 MLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKS 407 MLVMSLSSIN GKGQLLLLESA L+++ + S + ++N + STL+ K Sbjct: 450 MLVMSLSSINAGKGQLLLLESANLVIEPDPSPK--------ITNSVDKGNQSTLAAKHHL 501 Query: 406 RALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMF 227 RAL + KRK + Sbjct: 502 RALSHR------------------------KRK-------------------------LL 512 Query: 226 SDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSA 47 +DS G +QALKVLIGSVGSKSNKVPYVK +LR++SQH LS+ VLWT ATTRVASLYSA Sbjct: 513 ADSEGTHEQALKVLIGSVGSKSNKVPYVKEILRFISQHSNLSKSVLWTSATTRVASLYSA 572 Query: 46 ADVYSINSQGPGETF 2 ADVY NSQG GETF Sbjct: 573 ADVYITNSQGLGETF 587 >ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247116 [Solanum lycopersicum] Length = 711 Score = 635 bits (1639), Expect = e-179 Identities = 350/614 (57%), Positives = 424/614 (69%), Gaps = 8/614 (1%) Frame = -1 Query: 1819 TPRD-SPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAH 1643 TPR SPSFRR +S RTPRR+G+ SV QW RSNR++LWL+LITLWAY GFYVQS+WAH Sbjct: 29 TPRGGSPSFRRLNSGRTPRRDGKSSVFGSQWFRSNRIVLWLLLITLWAYGGFYVQSRWAH 88 Query: 1642 GDNDKEAFVGYKSKVSIPATK--QNQQAEVDTNEDSVAHNNASSEATQSQKELNL---KN 1478 GDN + F G V+ ++ + Q + NE+S+A S++ + +L++ K Sbjct: 89 GDNKEGIFGGSGGDVANGTSQPEEKNQRILVANEESLAVKPPSNKTQGNSMDLDVVLAKQ 148 Query: 1477 MTVSLAKKRHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGL 1298 ++ K ++ K+ST + E EEIPKRNT+YGL Sbjct: 149 GNSVVSDKGASPKKKSKKSTRASRRKTRGKKKVVAEVKSDDIEIQE--EEIPKRNTTYGL 206 Query: 1297 LVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELAT 1118 LVGPFG ED+ILEW+ EKR+GTCDR+ FARLVWSRKFVLI HELSMTGAPL+M+ELAT Sbjct: 207 LVGPFGSIEDKILEWSPEKRTGTCDRKSQFARLVWSRKFVLILHELSMTGAPLAMLELAT 266 Query: 1117 ELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWI 938 ELLSCGATV V LS++GGLM EL+RR+IKVLEDK +LS+KTAMK+DL+IAGSAVCASWI Sbjct: 267 ELLSCGATVYVVPLSKRGGLMSELSRRKIKVLEDKSDLSFKTAMKADLIIAGSAVCASWI 326 Query: 937 EQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGIT 758 EQY A + GS QI WWIMENRREYFDRAK +VK LIFLSESQS +WLAWC+EE I Sbjct: 327 EQYAARTVLGSTQITWWIMENRREYFDRAKLAFNRVKKLIFLSESQSKRWLAWCEEEHIK 386 Query: 757 LKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLV 578 LK+ P L+PLS++DELAFVAGI CSL+TP FS EKMLEKR+LLRD VRKEMGLTDNDMLV Sbjct: 387 LKTQPALIPLSISDELAFVAGIPCSLSTPLFSPEKMLEKRQLLRDFVRKEMGLTDNDMLV 446 Query: 577 MSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRAL 398 MSLSSINPGKGQ LLLE+ RLL++G + S K R+ + R L Sbjct: 447 MSLSSINPGKGQFLLLETTRLLIEGAPPLYGSAVK----------------RREYQKRTL 490 Query: 397 FQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSN--NHTEAISHGNTYKARKMFS 224 K FG+ ++ + S + N+ + LF N+T I N RK+FS Sbjct: 491 LYNWKQFGE-----WKKESSTLSNNQETEALQVPQLFIKGVNYTAGIE--NDRGTRKLFS 543 Query: 223 DSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAA 44 G Q + LKVLIGSVGSKSNKVPYVK LL +L+QH LS VLWTP+TTRVA+LY+AA Sbjct: 544 LPEGKQGEKLKVLIGSVGSKSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVAALYAAA 603 Query: 43 DVYSINSQGPGETF 2 D Y +NSQG GETF Sbjct: 604 DAYVMNSQGLGETF 617 >ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501726 [Cicer arietinum] Length = 709 Score = 629 bits (1621), Expect = e-177 Identities = 355/618 (57%), Positives = 436/618 (70%), Gaps = 12/618 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TPR+SP+FRR ++SRTPR++GR SVGS W RSNRVLLWL+LITLWAYLGF+VQS+WAH Sbjct: 38 TPRNSPTFRRLNTSRTPRKDGR-SVGSSLWFRSNRVLLWLLLITLWAYLGFFVQSRWAHS 96 Query: 1639 DNDKEAFVGYKSKVSIPATKQNQ---QAEVDTNEDSVAHNNASSEATQSQKELNLKNMTV 1469 D KE F G+ + + + + ++ +EDS++ NN T K + + V Sbjct: 97 DK-KEEFSGFGTGPRNTGSNDDSTSLRRDLIASEDSLSVNN----ETVINKGGVGRTINV 151 Query: 1468 SLAKKRH------ISLRR---IKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKR 1316 +LA K + + RR K+ + ++E E EIP+ Sbjct: 152 ALAMKGNDDDDDDVPSRRKASSKKKKSKRSSRGKARGKNKPKVEIKNNDIEEQEPEIPET 211 Query: 1315 NTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLS 1136 N++YGLLVGPFG TEDRILEW+ +KRSGTC+R+G FARLVWSR+F+LIFHELSMTGAPLS Sbjct: 212 NSTYGLLVGPFGSTEDRILEWSPQKRSGTCNRKGDFARLVWSRRFILIFHELSMTGAPLS 271 Query: 1135 MMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSA 956 MMELATELLSCGATVSAV LSRKGGLM EL RRRIK+LEDK +LS+KTAMK+DLVIAGSA Sbjct: 272 MMELATELLSCGATVSAVALSRKGGLMSELARRRIKLLEDKADLSFKTAMKADLVIAGSA 331 Query: 955 VCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWC 776 VCASWIEQYI H AG+ Q+AWWIMENRREYF+R K +L +VKML+FLSESQS QW WC Sbjct: 332 VCASWIEQYIEHFPAGASQVAWWIMENRREYFNRTKGVLDRVKMLVFLSESQSKQWQKWC 391 Query: 775 KEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLT 596 +EE I L+S PE++PLSVNDELAFVAGI +LNTPSF +KM+EK++LLR++VRKEMGLT Sbjct: 392 EEENIKLRSRPEIIPLSVNDELAFVAGIPSTLNTPSFDTDKMIEKKQLLRESVRKEMGLT 451 Query: 595 DNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRK 416 D+DMLV+SLSSINPGKGQLLLLESA +V+ + +K+K SN I++ STL+RK Sbjct: 452 DHDMLVISLSSINPGKGQLLLLESAISVVEHGQLQDD--KKMKKSSN--IKEGLSTLTRK 507 Query: 415 QKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKAR 236 Q+ R L +K GK A + +N R+K + NN T Sbjct: 508 QRIRKLLPMLKD-GKVA-------LKDISINSLSRRKQV---LPNNKTTT---------- 546 Query: 235 KMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASL 56 +Q+LKVLIGSVGSKSNK YVK LL +L+QHP S+ VLWTP+TT+VASL Sbjct: 547 ----------QQSLKVLIGSVGSKSNKADYVKSLLSFLAQHPNTSKTVLWTPSTTQVASL 596 Query: 55 YSAADVYSINSQGPGETF 2 YSAADVY INSQG GETF Sbjct: 597 YSAADVYVINSQGLGETF 614 >ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795000 isoform X1 [Glycine max] gi|571503664|ref|XP_006595144.1| PREDICTED: uncharacterized protein LOC100795000 isoform X2 [Glycine max] Length = 701 Score = 629 bits (1621), Expect = e-177 Identities = 354/613 (57%), Positives = 425/613 (69%), Gaps = 8/613 (1%) Frame = -1 Query: 1816 PRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHGD 1637 PR+SPSFRR +S RTPR+EGR SVG W RSN +LLWL+LITLWAYLGF+VQS+WAH D Sbjct: 36 PRNSPSFRRLNSVRTPRKEGRISVGGALWFRSNHLLLWLLLITLWAYLGFFVQSRWAHSD 95 Query: 1636 NDKEAFVGYKSKVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELN--LKNMTVSL 1463 KE F G+ + N AE D +A + + S ++ ++ K ++V+L Sbjct: 96 K-KEEFSGFGTG----PRNTNTDAEQIQRRDLLASDKSLSANNETGADIAGISKTISVAL 150 Query: 1462 AKK-----RHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYGL 1298 AKK H KRS ++E E EIP N +YGL Sbjct: 151 AKKDNDVPSHRKTSSKKRSKSRRSSKGKSRGKLKPTTEIKNTDIEEQEPEIPTTNNTYGL 210 Query: 1297 LVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELAT 1118 LVGPFG EDRILEW+ EKRSGTC+R+ FARLVWSR+F+LIFHELSMTGAPLSMMELAT Sbjct: 211 LVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFILIFHELSMTGAPLSMMELAT 270 Query: 1117 ELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASWI 938 ELLSCGATVSAVVLSRKGGLM EL RRRIKVLEDK +LS+KTAMK+DLVIAGSAVCASWI Sbjct: 271 ELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKSDLSFKTAMKADLVIAGSAVCASWI 330 Query: 937 EQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGIT 758 EQYI H AG+ Q+AWWIMENRREYFDR+K +L +VKML+FLSESQS QW WC+EE I Sbjct: 331 EQYIDHFPAGASQVAWWIMENRREYFDRSKDILHRVKMLVFLSESQSKQWQKWCEEESIK 390 Query: 757 LKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDMLV 578 L+S PE+V LSVN+ELAFVAGI +LNTPSFS EKM+EK++LLR++VRKEMGLTDNDMLV Sbjct: 391 LRSLPEIVALSVNEELAFVAGIPSTLNTPSFSTEKMVEKKQLLRESVRKEMGLTDNDMLV 450 Query: 577 MSLSSINPGKGQLLLLES-ARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRA 401 +SLSSINPGKGQLLLLES + +L G + +K+K VSN I++ S+L+RK + R Sbjct: 451 ISLSSINPGKGQLLLLESVSSVLEQGQL---QDDKKMKKVSN--IKEGLSSLTRKHRIRK 505 Query: 400 LFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMFSD 221 L +K+ GK A+ S IS + + +++ + Sbjct: 506 LLPLMKN-GKVASNS------------------------------ISSNSLSRRKQVLPN 534 Query: 220 STGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAAD 41 G +Q+LK+LIGSV SKSNK YVK LL +L QHP S + WTPATTRVASLYSAAD Sbjct: 535 GKGTIQQSLKLLIGSVRSKSNKADYVKSLLSFLEQHPNASTSIFWTPATTRVASLYSAAD 594 Query: 40 VYSINSQGPGETF 2 VY INSQG GETF Sbjct: 595 VYVINSQGLGETF 607 >ref|XP_007150675.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] gi|593700475|ref|XP_007150676.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] gi|561023939|gb|ESW22669.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] gi|561023940|gb|ESW22670.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] Length = 701 Score = 628 bits (1620), Expect = e-177 Identities = 346/613 (56%), Positives = 428/613 (69%), Gaps = 7/613 (1%) Frame = -1 Query: 1819 TPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSNRVLLWLILITLWAYLGFYVQSKWAHG 1640 TPR+SPSFRR +S RTPR+EGR +G W RSNR+L WL+LITLWAYLGF+VQS+WAH Sbjct: 35 TPRNSPSFRRQNSGRTPRKEGRSGIGGALWFRSNRLLFWLLLITLWAYLGFFVQSRWAHS 94 Query: 1639 DNDKEAFVGYKS--KVSIPATKQNQQAEVDTNEDSVAHNNASSEATQSQKELNLKNMTVS 1466 D KE F G+ + + + +Q Q+ ++ ++ S++ NN T + L+ K + V Sbjct: 95 DK-KEEFSGFGTGPRNTGSDAEQVQRRDLLASDHSLSANNE----TDANIALSSKTINVV 149 Query: 1465 LAKK-----RHISLRRIKRSTMEXXXXXXXXXXXXXXXXXXXXVLDEGEEEIPKRNTSYG 1301 LAK+ H KRS ++E + EIP N +YG Sbjct: 150 LAKRGNDVPSHRKTSSKKRSRRRRASKGKSSGKLKPSTDVKDADIEEQKPEIPTANGTYG 209 Query: 1300 LLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVWSRKFVLIFHELSMTGAPLSMMELA 1121 LLVGPFG EDRILEW+ EKRSGTC+R+G FARLVWSR+F+L+FHELSMTGAPLSMMELA Sbjct: 210 LLVGPFGPVEDRILEWSPEKRSGTCNRKGDFARLVWSRRFILVFHELSMTGAPLSMMELA 269 Query: 1120 TELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDKGELSYKTAMKSDLVIAGSAVCASW 941 TELLSCGATVSAVVLS+KGGLM EL RRRIKVLEDK +LS+KTAMK+DLVIAGSAVCASW Sbjct: 270 TELLSCGATVSAVVLSKKGGLMSELARRRIKVLEDKADLSFKTAMKADLVIAGSAVCASW 329 Query: 940 IEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQVKMLIFLSESQSDQWLAWCKEEGI 761 I+QYI AG+ Q+ WWIMENRREYFD +K L +VKML+FLSESQS QWL WC+EE I Sbjct: 330 IDQYIERFPAGASQVVWWIMENRREYFDLSKDALDRVKMLVFLSESQSKQWLKWCEEESI 389 Query: 760 TLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEKMLEKRRLLRDAVRKEMGLTDNDML 581 L+S PE++PLSVNDELAFVAGI +LNTPSFS +KM+EKR+LLR++VRKE+GL D+DML Sbjct: 390 KLRSYPEIIPLSVNDELAFVAGIPSTLNTPSFSTDKMVEKRQLLRESVRKEIGLNDSDML 449 Query: 580 VMSLSSINPGKGQLLLLESARLLVDGNISMESSGRKIKDVSNISIEKRSSTLSRKQKSRA 401 V+SLSSINPGKGQLLLLES +++ + +K+K VSN I++ STL+RK + R Sbjct: 450 VISLSSINPGKGQLLLLESVSSVLEQGWLQDD--KKMKKVSN--IKEGISTLARKHRIRK 505 Query: 400 LFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITSLFSNNHTEAISHGNTYKARKMFSD 221 L +K NG + SN+ IS + + +++ D Sbjct: 506 LLPVLK------NG---------------------KVVSND----ISSNSLSRRKQVLPD 534 Query: 220 STGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQHPELSRVVLWTPATTRVASLYSAAD 41 G +++LK+LIGSVGSKSNK YVK LL +L QHP S+ + WTPATTRVASLYSAAD Sbjct: 535 DKGTIQKSLKLLIGSVGSKSNKADYVKSLLNFLEQHPNTSKSIFWTPATTRVASLYSAAD 594 Query: 40 VYSINSQGPGETF 2 VY INSQG GETF Sbjct: 595 VYVINSQGLGETF 607 >ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus] gi|449496350|ref|XP_004160111.1| PREDICTED: uncharacterized protein LOC101223486 [Cucumis sativus] Length = 682 Score = 625 bits (1611), Expect = e-176 Identities = 346/637 (54%), Positives = 419/637 (65%), Gaps = 5/637 (0%) Frame = -1 Query: 1897 DLSGNTVRQXXXXXXXXXXXXXXXXSTPRDSPSFRRAHSSRTPRREGRGSVGSFQWIRSN 1718 D GN V+ STPR SPSFRR HSSRTPRRE R + S WIR+N Sbjct: 8 DFLGNVVKPSSLRPSGSFKPSVSGKSTPRGSPSFRRLHSSRTPRREARSTGFSLHWIRNN 67 Query: 1717 RVLLWLILITLWAYLGFYVQSKWAHGDNDKE--AFVGYKSKVSIPATKQNQQAEVDTNED 1544 +VL WL+LITLWAYLGFYVQS+WAHG+N E F G +S + + + + + TN Sbjct: 68 KVLFWLLLITLWAYLGFYVQSRWAHGENKDEFLGFGGQQSNQKLDSEQNQSLSLISTNNR 127 Query: 1543 SVAHNNASSEATQSQKELNL---KNMTVSLAKKRHISLRRIKRSTMEXXXXXXXXXXXXX 1373 V N + +N+ K A K+ +R KRS + Sbjct: 128 LVVENRSGENDRSDGGVVNVVLAKKANGVSASKKTKPRKRSKRSKRDKVHKGKIPAEVTN 187 Query: 1372 XXXXXXXVLDEGEEEIPKRNTSYGLLVGPFGLTEDRILEWNAEKRSGTCDREGGFARLVW 1193 ++E E EIP +N+SYG+LVGPFG TEDRILEW+ EKRSGTCDR+G FARLVW Sbjct: 188 HD------IEEQEPEIPLKNSSYGMLVGPFGSTEDRILEWSPEKRSGTCDRKGDFARLVW 241 Query: 1192 SRKFVLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMGELNRRRIKVLEDK 1013 SR+FVLIFHELSMTGAP+SMMELATELLSCGA+VSAV LS+KGGLM EL+RRRIKVL+DK Sbjct: 242 SRRFVLIFHELSMTGAPISMMELATELLSCGASVSAVALSKKGGLMSELSRRRIKVLDDK 301 Query: 1012 GELSYKTAMKSDLVIAGSAVCASWIEQYIAHSTAGSGQIAWWIMENRREYFDRAKTMLGQ 833 +LS+KTAMK+DLVIAGSAVCASWI+ YI H AG+ Q+AWWIMENRREYF+R+K +L + Sbjct: 302 ADLSFKTAMKADLVIAGSAVCASWIDGYIEHFPAGASQVAWWIMENRREYFNRSKVVLDR 361 Query: 832 VKMLIFLSESQSDQWLAWCKEEGITLKSPPELVPLSVNDELAFVAGISCSLNTPSFSEEK 653 VKMLIF+SE QS QWL W +EE I L+S P +VPLSVNDELAFVAGISCSLNT S S EK Sbjct: 362 VKMLIFISELQSKQWLNWSQEENIKLRSQPAIVPLSVNDELAFVAGISCSLNTESSSPEK 421 Query: 652 MLEKRRLLRDAVRKEMGLTDNDMLVMSLSSINPGKGQLLLLESARLLVDGNISMESSGRK 473 MLEK++LLR+ RKEMG+ DND++VM+LSSINPGKG LLLES+ LL+D + + + Sbjct: 422 MLEKKQLLRNTTRKEMGVGDNDVVVMTLSSINPGKGHFLLLESSNLLIDRGLKRDDPKIR 481 Query: 472 IKDVSNISIEKRSSTLSRKQKSRALFQKVKHFGKSANGSYQPDESRVDMNEPKRKKAITS 293 D S+ S K L+R++ RAL QK+ Sbjct: 482 NPDDSSPSRPK----LARRRYMRALLQKLN------------------------------ 507 Query: 292 LFSNNHTEAISHGNTYKARKMFSDSTGIQKQALKVLIGSVGSKSNKVPYVKVLLRYLSQH 113 R++ +D + + + K+LIGSVGSKSNKV YVK LLR+LSQH Sbjct: 508 ----------------DRRRLLADGGELPETSFKLLIGSVGSKSNKVVYVKRLLRFLSQH 551 Query: 112 PELSRVVLWTPATTRVASLYSAADVYSINSQGPGETF 2 LS+ VLWTPATTRVASLYSAAD+Y INSQG GETF Sbjct: 552 SNLSQSVLWTPATTRVASLYSAADIYVINSQGIGETF 588