BLASTX nr result
ID: Catharanthus23_contig00015025
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00015025 (2635 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247... 823 0.0 emb|CBI36173.3| unnamed protein product [Vitis vinifera] 785 0.0 ref|XP_002528176.1| glycosyltransferase, putative [Ricinus commu... 780 0.0 ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246... 777 0.0 ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207... 753 0.0 ref|XP_002298139.1| glycosyl transferase family 1 family protein... 748 0.0 ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793... 748 0.0 gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis] 746 0.0 gb|EOY26677.1| UDP-Glycosyltransferase superfamily protein isofo... 744 0.0 gb|ESW22669.1| hypothetical protein PHAVU_005G172300g [Phaseolus... 744 0.0 gb|EOY26678.1| UDP-Glycosyltransferase superfamily protein isofo... 740 0.0 ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795... 734 0.0 ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501... 733 0.0 ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citr... 717 0.0 ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612... 715 0.0 ref|NP_188215.1| UDP-glycosyltransferase-like protein [Arabidops... 694 0.0 ref|XP_006297092.1| hypothetical protein CARUB_v10013095mg [Caps... 694 0.0 ref|XP_006406901.1| hypothetical protein EUTSA_v10020188mg [Eutr... 691 0.0 ref|XP_002885116.1| glycosyl transferase family 1 protein [Arabi... 689 0.0 ref|XP_006583137.1| PREDICTED: uncharacterized protein LOC100796... 679 0.0 >ref|XP_004235700.1| PREDICTED: uncharacterized protein LOC101247116 [Solanum lycopersicum] Length = 711 Score = 823 bits (2125), Expect = 0.0 Identities = 437/711 (61%), Positives = 521/711 (73%), Gaps = 39/711 (5%) Frame = +2 Query: 476 MDEINLVRPSSLRTNGAL--KSTLSGKSTPRG-SPSFRRINSGRTPRREGRSSGIRFYCF 646 M+E+N+VR S LR NG + KSTLSG+STPRG SPSFRR+NSGRTPRR+G+SS F Sbjct: 1 MEELNVVRLSPLRLNGPVPAKSTLSGRSTPRGGSPSFRRLNSGRTPRRDGKSSVFGSQWF 60 Query: 647 GGNRXXXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTA 826 NR YGGFY+QSRWAHGDNKEGIFG + + ++ ++K++R L A Sbjct: 61 RSNRIVLWLLLITLWAYGGFYVQSRWAHGDNKEGIFGGSGGDVANGTSQPEEKNQRILVA 120 Query: 827 NEDSLAVNNHVDTRQSDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXX 1006 NE+SLAV + Q +S +D++LAK+G+S Sbjct: 121 NEESLAVKPPSNKTQGNSMDLDVVLAKQGNSVVSDKGASPKKKSKKSTRASRRKTRGKKK 180 Query: 1007 EMVELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLV 1186 + E+++ + EEIP +N+TYGLLVGPFG++EDKILEWSPE+RTGTCDR QFARLV Sbjct: 181 VVAEVKSDDIEIQEEEIPKRNTTYGLLVGPFGSIEDKILEWSPEKRTGTCDRKSQFARLV 240 Query: 1187 WSRKFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLED 1366 WSRKFVLI HELSMTGAPLAM+ELATELLSCGATV VV LS++GGLM ELSRRKIKVLED Sbjct: 241 WSRKFVLILHELSMTGAPLAMLELATELLSCGATVYVVPLSKRGGLMSELSRRKIKVLED 300 Query: 1367 KLDLSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALT 1546 K DLSFKTAMK+DLIIAGSAVCASWIE+Y TVLG++QI WWIMENRREYFDRAKLA Sbjct: 301 KSDLSFKTAMKADLIIAGSAVCASWIEQYAARTVLGSTQITWWIMENRREYFDRAKLAFN 360 Query: 1547 LVKKIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKE 1726 VKK+IFLSE QSK+WLAWCEEE I+LK++PALIPLS++DELAFVAGI CSL+TP FS E Sbjct: 361 RVKKLIFLSESQSKRWLAWCEEEHIKLKTQPALIPLSISDELAFVAGIPCSLSTPLFSPE 420 Query: 1727 KMLEKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFP------ 1888 KMLEKRQLLR VR+EMGLTD DML +SLSSINPGKGQFLLLE+ R++IE P Sbjct: 421 KMLEKRQLLRDFVRKEMGLTDNDMLVMSLSSINPGKGQFLLLETTRLLIEGAPPLYGSAV 480 Query: 1889 ----------------------QNNSLTKSFKHRRINLPLRRTKASN---GI-----IXX 1978 ++++L+ + + + +P K N GI Sbjct: 481 KRREYQKRTLLYNWKQFGEWKKESSTLSNNQETEALQVPQLFIKGVNYTAGIENDRGTRK 540 Query: 1979 XXXXXXXXXXXXXXXXXXXVGSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLY 2158 VGSKSNKVPYVK+LL FL+QHSNLS++VLWTP+TTRVA+LY Sbjct: 541 LFSLPEGKQGEKLKVLIGSVGSKSNKVPYVKALLNFLNQHSNLSNTVLWTPSTTRVAALY 600 Query: 2159 AAADAYVMNAQGMGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAE 2338 AAADAYVMN+QG+GETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLH LGRPG + Sbjct: 601 AAADAYVMNSQGLGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHSLGRPGTQ 660 Query: 2339 VLAKHLKYLLENPSARQQMGTEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 VLA++L+YLL NPS RQ++G+ GRKKV+ M+LKK MY++FGEVLY+CMRIK Sbjct: 661 VLAQNLQYLLNNPSERQRLGSNGRKKVKDMYLKKHMYRRFGEVLYDCMRIK 711 >emb|CBI36173.3| unnamed protein product [Vitis vinifera] Length = 683 Score = 785 bits (2027), Expect = 0.0 Identities = 419/684 (61%), Positives = 498/684 (72%), Gaps = 16/684 (2%) Frame = +2 Query: 488 NLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXX 667 N+VR SSLR G+LKSTLSG+STPR SPSFRR +S RTPRRE RSSG+ F NR Sbjct: 13 NVVRQSSLRPGGSLKSTLSGRSTPRNSPSFRRSHSSRTPRREARSSGVGSQWFRNNRVVF 72 Query: 668 XXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESG-EKTELQQKDRRELTANEDSLA 844 Y GFY+QS+WAHGDN E I G +G +EL +K L AN+ LA Sbjct: 73 WLILITLWAYLGFYVQSKWAHGDNNEDIIGFGGKPNNGISDSELNRK--APLIANDKLLA 130 Query: 845 VNNHVDTRQSDS-KRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVEL 1021 V N D S K+VD++LAK+G+S P + + E+ Sbjct: 131 VKNGSDKNPVGSGKKVDVVLAKKGNSVPSRRSASSKKRSKKSERSLRGKTRKQKTK-TEV 189 Query: 1022 QNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKF 1201 + T+ D +EIP N++YGLLVGPFG+ ED+ILEWSPE+R+GTCDR G+ ARLVWSRKF Sbjct: 190 EVTEMDEQEQEIPKLNTSYGLLVGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKF 249 Query: 1202 VLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLS 1381 VLIFHELSMTGAPL+MMELATELLSCGATVS VVLS+KGGLM EL+RR+IKVLED+ DLS Sbjct: 250 VLIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLS 309 Query: 1382 FKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKI 1561 FKTAMK+DL+IAGSAVCASWIE+Y H G+SQI WWIMENRREYFDR+KL + VK + Sbjct: 310 FKTAMKADLVIAGSAVCASWIEQYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKML 369 Query: 1562 IFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEK 1741 IFLSE QSKQWL WC+EE IRL S+PA++PLSVNDELAFVAGI CSLNTP+F+ EKM EK Sbjct: 370 IFLSESQSKQWLTWCKEENIRLISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEK 429 Query: 1742 RQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNNSLTK---- 1909 R+LLR +R+EMGLTD DML +SLSSINPGKGQF LLES R MIEQ Q++ K Sbjct: 430 RRLLRDSIRKEMGLTDTDMLLLSLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVK 489 Query: 1910 --------SFKH--RRINLPLRRTKASNGIIXXXXXXXXXXXXXXXXXXXXXVGSKSNKV 2059 S KH R + L K+ N ++ VGSKSNKV Sbjct: 490 IGQDQSNFSGKHYSRALLQNLNGPKSKNLML----------PKQALKVLIGSVGSKSNKV 539 Query: 2060 PYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAF 2239 PYVK LL FL++HSNLS SVLWTPATTRVASLY+AAD YV+N+QGMGETFGRVTIEAMAF Sbjct: 540 PYVKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGRVTIEAMAF 599 Query: 2240 GLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKV 2419 GLPVLGTDAGGTKE+VE NVTGLLHP+G G ++L++++++LL+NPS+R+QMG GRKKV Sbjct: 600 GLPVLGTDAGGTKEVVEQNVTGLLHPVGHLGTQILSENIRFLLKNPSSREQMGKRGRKKV 659 Query: 2420 EKMFLKKDMYKKFGEVLYNCMRIK 2491 E+M+LK+ MYK+ EVLY CMRIK Sbjct: 660 ERMYLKRHMYKRLAEVLYKCMRIK 683 >ref|XP_002528176.1| glycosyltransferase, putative [Ricinus communis] gi|223532388|gb|EEF34183.1| glycosyltransferase, putative [Ricinus communis] Length = 686 Score = 780 bits (2014), Expect = 0.0 Identities = 406/682 (59%), Positives = 495/682 (72%), Gaps = 13/682 (1%) Frame = +2 Query: 485 INLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXX 664 +N+VR S LR+ G+ +STLSG+ST + SP+FRR++S RTPR E RS G F R Sbjct: 12 VNVVRQSPLRSGGSFRSTLSGRSTAKNSPTFRRLHSSRTPRGEARSIGGGVQWFRSTRLV 71 Query: 665 XXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLA 844 Y GFY+QSRWAHGDNKE G Q E + +Q RR+L AN+ S+A Sbjct: 72 YWLLLITLWAYLGFYVQSRWAHGDNKEDFLGFG-GQNRNEISVPEQNTRRDLLANDSSVA 130 Query: 845 VNNHVDTRQ-SDSKRVDMILAKRGS--SDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMV 1015 VN+ D Q D +R+ ++LAK+G+ S ++ V Sbjct: 131 VNDGTDNVQVEDDRRIGVVLAKKGNTVSSNQKKNSFSKKRSKRAGRRLRSKTRDKQKATV 190 Query: 1016 ELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSR 1195 E+++ + +IP +N+TYG LVGPFG+ ED+ILEWSPE+RTGTCDR G FARLVWSR Sbjct: 191 EVESEDVEVQEPDIPQKNTTYGFLVGPFGSTEDRILEWSPEKRTGTCDRKGDFARLVWSR 250 Query: 1196 KFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLD 1375 KFVLIFHELSMTGAPL+MMELATE LSCGATVS VVLS+KGGLM EL+RR+IKVLEDK D Sbjct: 251 KFVLIFHELSMTGAPLSMMELATEFLSCGATVSAVVLSKKGGLMSELNRRRIKVLEDKAD 310 Query: 1376 LSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVK 1555 LSFKTAMK+DL+IAGSAVCASWI++Y G SQI WWIMENRREYFDR+K+ L VK Sbjct: 311 LSFKTAMKADLVIAGSAVCASWIDQYMTRFPAGGSQIVWWIMENRREYFDRSKIVLNRVK 370 Query: 1556 KIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKML 1735 ++FLSE Q++QWL+WC+EEKI+L++ PA++PLS+NDELAFVAGI CSLNTP+ S EKML Sbjct: 371 MLVFLSESQTEQWLSWCDEEKIKLRAPPAIVPLSINDELAFVAGIACSLNTPSSSPEKML 430 Query: 1736 EKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQ----------GF 1885 EKR+LL VR+EMGLTD+D+L VSLSSINPGKGQ L+LESA+++IE G Sbjct: 431 EKRRLLADSVRKEMGLTDDDVLLVSLSSINPGKGQLLILESAKLLIEPEPLQKLRSSVGI 490 Query: 1886 PQNNSLTKSFKHRRINLPLRRTKASNGIIXXXXXXXXXXXXXXXXXXXXXVGSKSNKVPY 2065 + S + KH L ++KA + + VGSKSNKVPY Sbjct: 491 GEEQSRI-AVKHHLRALLQEKSKAVSDL-----KEGQEKYLKALKVLIGSVGSKSNKVPY 544 Query: 2066 VKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGL 2245 VK +L +L+QHSNLS SVLWTPATTRVASLY+AADAYV+N+QG+GETFGRVTIEAMAFGL Sbjct: 545 VKEMLSYLTQHSNLSKSVLWTPATTRVASLYSAADAYVINSQGLGETFGRVTIEAMAFGL 604 Query: 2246 PVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEK 2425 PVLGTDAGGTKEIVEHNVTGLLHP+GRPG VLA++L++LL NPS R+QMG GRKKVE+ Sbjct: 605 PVLGTDAGGTKEIVEHNVTGLLHPVGRPGTHVLAQNLRFLLRNPSVREQMGMAGRKKVER 664 Query: 2426 MFLKKDMYKKFGEVLYNCMRIK 2491 M+LK+ MYKKF EVLY CMR+K Sbjct: 665 MYLKRHMYKKFSEVLYKCMRVK 686 >ref|XP_002284822.1| PREDICTED: uncharacterized protein LOC100246448 [Vitis vinifera] Length = 691 Score = 777 bits (2007), Expect = 0.0 Identities = 415/691 (60%), Positives = 494/691 (71%), Gaps = 25/691 (3%) Frame = +2 Query: 494 VRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXX 673 VR SSLR G+LKSTLSG+STPR SPSFRR +S RTPRRE RSSG+ F NR Sbjct: 4 VRQSSLRPGGSLKSTLSGRSTPRNSPSFRRSHSSRTPRREARSSGVGSQWFRNNRVVFWL 63 Query: 674 XXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESG-EKTELQQKDRRELTANEDSLAVN 850 Y GFY+QS+WAHGDN E I G +G +EL +K L AN+ LAV Sbjct: 64 ILITLWAYLGFYVQSKWAHGDNNEDIIGFGGKPNNGISDSELNRK--APLIANDKLLAVK 121 Query: 851 NHVDTRQSDS-KRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQN 1027 N D S K+VD++LAK+G+S P + + E++ Sbjct: 122 NGSDKNPVGSGKKVDVVLAKKGNSVPSRRSASSKKRSKKSERSLRGKTRKQKTK-TEVEV 180 Query: 1028 TQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVL 1207 T+ D +EIP N++YGLLVGPFG+ ED+ILEWSPE+R+GTCDR G+ ARLVWSRKFVL Sbjct: 181 TEMDEQEQEIPKLNTSYGLLVGPFGSTEDRILEWSPEKRSGTCDRRGELARLVWSRKFVL 240 Query: 1208 IFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFK 1387 IFHELSMTGAPL+MMELATELLSCGATVS VVLS+KGGLM EL+RR+IKVLED+ DLSFK Sbjct: 241 IFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRADLSFK 300 Query: 1388 TAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIF 1567 TAMK+DL+IAGSAVCASWIE+Y H G+SQI WWIMENRREYFDR+KL + VK +IF Sbjct: 301 TAMKADLVIAGSAVCASWIEQYIAHFTAGSSQIVWWIMENRREYFDRSKLVINRVKMLIF 360 Query: 1568 LSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQ 1747 LSE QSKQWL WC+EE IRL S+PA++PLSVNDELAFVAGI CSLNTP+F+ EKM EKR+ Sbjct: 361 LSESQSKQWLTWCKEENIRLISQPAVVPLSVNDELAFVAGITCSLNTPSFTTEKMQEKRR 420 Query: 1748 LLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQ---------------G 1882 LLR +R+EMGLTD DML +SLSSINPGKGQF LLES R MIEQ G Sbjct: 421 LLRDSIRKEMGLTDTDMLLLSLSSINPGKGQFFLLESVRSMIEQEPSQDDPELKDLVKIG 480 Query: 1883 FPQNN--------SLTKSFKHRRINLPLRRTKASNGIIXXXXXXXXXXXXXXXXXXXXXV 2038 Q+N +L ++ H ++ + S V Sbjct: 481 QDQSNFSGKHYSRALLQNVNHFSVSSSDEVSIGSGYKRRKVLSENEGTQEQALKVLIGSV 540 Query: 2039 GSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRV 2218 GSKSNKVPYVK LL FL++HSNLS SVLWTPATTRVASLY+AAD YV+N+QGMGETFGRV Sbjct: 541 GSKSNKVPYVKGLLRFLTRHSNLSKSVLWTPATTRVASLYSAADVYVINSQGMGETFGRV 600 Query: 2219 TIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMG 2398 TIEAMAFGLPVLGTDAGGTKE+VE NVTGLLHP+G G ++L++++++LL+NPS+R+QMG Sbjct: 601 TIEAMAFGLPVLGTDAGGTKEVVEQNVTGLLHPVGHLGTQILSENIRFLLKNPSSREQMG 660 Query: 2399 TEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 GRKKVE+M+LK+ MYK+ EVLY CMRIK Sbjct: 661 KRGRKKVERMYLKRHMYKRLAEVLYKCMRIK 691 >ref|XP_004149847.1| PREDICTED: uncharacterized protein LOC101207532 [Cucumis sativus] gi|449496350|ref|XP_004160111.1| PREDICTED: uncharacterized protein LOC101223486 [Cucumis sativus] Length = 682 Score = 753 bits (1944), Expect = 0.0 Identities = 394/682 (57%), Positives = 492/682 (72%), Gaps = 14/682 (2%) Frame = +2 Query: 488 NLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXX 667 N+V+PSSLR +G+ K ++SGKSTPRGSPSFRR++S RTPRRE RS+G + N+ Sbjct: 12 NVVKPSSLRPSGSFKPSVSGKSTPRGSPSFRRLHSSRTPRREARSTGFSLHWIRNNKVLF 71 Query: 668 XXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAV 847 Y GFY+QSRWAHG+NK+ G Q+S +K + +Q L + + L V Sbjct: 72 WLLLITLWAYLGFYVQSRWAHGENKDEFLGFG-GQQSNQKLDSEQNQSLSLISTNNRLVV 130 Query: 848 NNHV-DTRQSDSKRVDMILAKR--GSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVE 1018 N + +SD V+++LAK+ G S K+ E Sbjct: 131 ENRSGENDRSDGGVVNVVLAKKANGVSASKKTKPRKRSKRSKRDKVHKGKIP------AE 184 Query: 1019 LQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRK 1198 + N + EIP +NS+YG+LVGPFG+ ED+ILEWSPE+R+GTCDR G FARLVWSR+ Sbjct: 185 VTNHDIEEQEPEIPLKNSSYGMLVGPFGSTEDRILEWSPEKRSGTCDRKGDFARLVWSRR 244 Query: 1199 FVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDL 1378 FVLIFHELSMTGAP++MMELATELLSCGA+VS V LS+KGGLM ELSRR+IKVL+DK DL Sbjct: 245 FVLIFHELSMTGAPISMMELATELLSCGASVSAVALSKKGGLMSELSRRRIKVLDDKADL 304 Query: 1379 SFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKK 1558 SFKTAMK+DL+IAGSAVCASWI+ Y EH GASQ+AWWIMENRREYF+R+K+ L VK Sbjct: 305 SFKTAMKADLVIAGSAVCASWIDGYIEHFPAGASQVAWWIMENRREYFNRSKVVLDRVKM 364 Query: 1559 IIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLE 1738 +IF+SE QSKQWL W +EE I+L+S+PA++PLSVNDELAFVAGI CSLNT + S EKMLE Sbjct: 365 LIFISELQSKQWLNWSQEENIKLRSQPAIVPLSVNDELAFVAGISCSLNTESSSPEKMLE 424 Query: 1739 KRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNN------- 1897 K+QLLR+ R+EMG+ D D++ ++LSSINPGKG FLLLES+ ++I++G +++ Sbjct: 425 KKQLLRNTTRKEMGVGDNDVVVMTLSSINPGKGHFLLLESSNLLIDRGLKRDDPKIRNPD 484 Query: 1898 ----SLTKSFKHRRINLPLRRTKASNGIIXXXXXXXXXXXXXXXXXXXXXVGSKSNKVPY 2065 S K + R + L++ ++ VGSKSNKV Y Sbjct: 485 DSSPSRPKLARRRYMRALLQKLNDRRRLL----ADGGELPETSFKLLIGSVGSKSNKVVY 540 Query: 2066 VKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGL 2245 VK LL FLSQHSNLS SVLWTPATTRVASLY+AAD YV+N+QG+GETFGRVTIEAMAFGL Sbjct: 541 VKRLLRFLSQHSNLSQSVLWTPATTRVASLYSAADIYVINSQGIGETFGRVTIEAMAFGL 600 Query: 2246 PVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEK 2425 PVLGTDAGGTKEIVEHNVTGLLHPLGRPG +VLA++L++LL+NP R++MG EGRKKV+K Sbjct: 601 PVLGTDAGGTKEIVEHNVTGLLHPLGRPGTQVLAQNLEFLLKNPQVREKMGAEGRKKVKK 660 Query: 2426 MFLKKDMYKKFGEVLYNCMRIK 2491 ++LK+ MYKKF EV+ CMR K Sbjct: 661 IYLKRHMYKKFVEVIVKCMRTK 682 >ref|XP_002298139.1| glycosyl transferase family 1 family protein [Populus trichocarpa] gi|222845397|gb|EEE82944.1| glycosyl transferase family 1 family protein [Populus trichocarpa] Length = 681 Score = 748 bits (1931), Expect = 0.0 Identities = 393/680 (57%), Positives = 479/680 (70%), Gaps = 11/680 (1%) Frame = +2 Query: 485 INLVRPSSLRTNGALKST-LSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRX 661 +N+++ + R G+ KST LSG+STPR SP+ R ++S RTPRREGR SG F NR Sbjct: 12 VNVLKQTPSRQGGSFKSTTLSGRSTPRNSPTHRLLHSSRTPRREGRGSG-GIQWFRSNRL 70 Query: 662 XXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSL 841 Y GFY+QSRWAHGDNK+ G +G + +Q RR+L AN+ + Sbjct: 71 IYWLLLITLWTYLGFYVQSRWAHGDNKDEFLGFGGKSSNG-LLDAEQHTRRDLLANDSLV 129 Query: 842 AVNNHVDTRQ-SDSKRVDMILAKRGSS-DPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMV 1015 VNN + Q ++K++D++LAK+G+ + V Sbjct: 130 VVNNGTNKIQVRNAKKIDVVLAKKGNGVSSNRRATPKKKKSKRGGRRSRAKAHDKQKATV 189 Query: 1016 ELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSR 1195 +++ + + ++P N++YGLLVGPFG +ED+ILEWSPE+R+GTCDR G FARLVWSR Sbjct: 190 VVESDDVEVAEPDVPKNNASYGLLVGPFGPIEDRILEWSPEKRSGTCDRKGAFARLVWSR 249 Query: 1196 KFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLD 1375 KFVLIFHELSMTGAPL+M+ELATE LSCGATVS VVLS+KGGLM EL+RR+IKVLED+ D Sbjct: 250 KFVLIFHELSMTGAPLSMLELATEFLSCGATVSAVVLSKKGGLMPELARRRIKVLEDRAD 309 Query: 1376 LSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVK 1555 LSFKTAMK+DL+IAGSAVC SWI++Y G SQ+ WWIMENRREYFDR+K+ L VK Sbjct: 310 LSFKTAMKADLVIAGSAVCTSWIDQYIARFPAGGSQVVWWIMENRREYFDRSKIILNRVK 369 Query: 1556 KIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKML 1735 ++FLSE Q KQW WCEEE IRL+S PA++ LSVNDELAFVAGI CSLNTP S EKML Sbjct: 370 MLVFLSESQMKQWQTWCEEENIRLRSPPAVVQLSVNDELAFVAGIACSLNTPTSSSEKML 429 Query: 1736 EKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIE--------QGFPQ 1891 EKRQLLR VR+EMGLTD DML +SLSSIN GKGQ LLLESA ++IE + Sbjct: 430 EKRQLLRESVRKEMGLTDNDMLVMSLSSINAGKGQLLLLESANLVIEPDPSPKITNSVDK 489 Query: 1892 NNSLTKSFKHRRINLPLRRTKASNGIIXXXXXXXXXXXXXXXXXXXXXVGSKSNKVPYVK 2071 N T + KH L R+ K VGSKSNKVPYVK Sbjct: 490 GNQSTLAAKHHLRALSHRKRK--------LLADSEGTHEQALKVLIGSVGSKSNKVPYVK 541 Query: 2072 SLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGLPV 2251 +L F+SQHSNLS SVLWT ATTRVASLY+AAD Y+ N+QG+GETFGRVTIEAMAFGLPV Sbjct: 542 EILRFISQHSNLSKSVLWTSATTRVASLYSAADVYITNSQGLGETFGRVTIEAMAFGLPV 601 Query: 2252 LGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEKMF 2431 LGTDAGGT+EIVEHN+TGLLHP+GRPG+ VLA++++ LL+NPS R+QMG +GRKKVEKM+ Sbjct: 602 LGTDAGGTQEIVEHNITGLLHPVGRPGSRVLAQNIELLLKNPSVRKQMGIKGRKKVEKMY 661 Query: 2432 LKKDMYKKFGEVLYNCMRIK 2491 LK+ MYKK EVLY CMR+K Sbjct: 662 LKRHMYKKIWEVLYKCMRVK 681 >ref|XP_006597141.1| PREDICTED: uncharacterized protein LOC100793827 isoform X1 [Glycine max] gi|571514725|ref|XP_006597142.1| PREDICTED: uncharacterized protein LOC100793827 isoform X2 [Glycine max] Length = 701 Score = 748 bits (1930), Expect = 0.0 Identities = 408/691 (59%), Positives = 477/691 (69%), Gaps = 23/691 (3%) Frame = +2 Query: 488 NLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXX 667 NL + SSLR G+ KSTLSG+STPR SPSFRR+NSGRTPR+EGRSS F NR Sbjct: 13 NLAKQSSLRLGGSFKSTLSGRSTPRNSPSFRRLNSGRTPRKEGRSSVGGALWFRSNRLLL 72 Query: 668 XXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAV 847 Y GF++QSRWAH D KE G + ++ +Q RR+L A+ SL+ Sbjct: 73 WLLLITLWAYLGFFVQSRWAHSDKKEEFSGYGTGPRN-TNSDAEQIQRRDLLASNKSLSA 131 Query: 848 NNHVDTRQSD-SKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQ 1024 NN D + SK +++ LAK + P E++ Sbjct: 132 NNDTDADIAGISKTINVALAKNDNDVPSH-RKTSSKNRSKGRRSSKGKSRGKLKPTTEIK 190 Query: 1025 NTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFV 1204 NT + EIP+ NSTYGLLVGPFG +ED+ILEWSPE+R+GTC+R FARLVWSR+F+ Sbjct: 191 NTDIEEQEPEIPTTNSTYGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFI 250 Query: 1205 LIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSF 1384 LIFHELSMTGAPL+MMELATELLSCGATVS VVLSRKGGLM EL+RR+IKVLEDK DLSF Sbjct: 251 LIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKADLSF 310 Query: 1385 KTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKII 1564 KTAMK+DL+IAGSAVCASWIE+Y EH GASQ+AWWIMENRREYFDR+K L VK ++ Sbjct: 311 KTAMKADLVIAGSAVCASWIEQYIEHFPAGASQVAWWIMENRREYFDRSKDVLHRVKMLV 370 Query: 1565 FLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKR 1744 FLSE QSKQW WCEEE I+L+S P ++PLSVNDELAFVAGI +LNTP+FS EKM+EK+ Sbjct: 371 FLSESQSKQWQKWCEEESIKLRSHPEIVPLSVNDELAFVAGIPSTLNTPSFSTEKMVEKK 430 Query: 1745 QLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNNSLTKSF--- 1915 QLLR VR+EMGLTD DML +SLSSINPGKGQ LLLES ++EQG + K Sbjct: 431 QLLRESVRKEMGLTDNDMLVISLSSINPGKGQLLLLESVSSVLEQGQSPGDKKMKEVSNI 490 Query: 1916 ---------KHR-RINLPLRRT--KASNGII-------XXXXXXXXXXXXXXXXXXXXXV 2038 KHR R LPL ASN I V Sbjct: 491 KEGLSSLARKHRIRKLLPLMSNGKVASNSISSNSLSRRKQVLPNDKGTIQQSLKLLIGSV 550 Query: 2039 GSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRV 2218 SKSNK YVKSLL FL QH N S S+ WTPATTRVASLY+AAD YV+N+QG+GETFGRV Sbjct: 551 RSKSNKADYVKSLLSFLEQHPNTSTSIFWTPATTRVASLYSAADVYVINSQGLGETFGRV 610 Query: 2219 TIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMG 2398 TIEAMAFGLPVLGTDAGGT+EIVEHNVTGLLHP+G PG VLA++L +LL+N SAR+QMG Sbjct: 611 TIEAMAFGLPVLGTDAGGTQEIVEHNVTGLLHPVGHPGNLVLAQNLWFLLKNQSARKQMG 670 Query: 2399 TEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 GRKKV+KM+LK+ MYK F EV+ CMR K Sbjct: 671 VVGRKKVQKMYLKQQMYKNFVEVIARCMRSK 701 >gb|EXC25804.1| Putative glycosyltransferase ytcC [Morus notabilis] Length = 688 Score = 746 bits (1927), Expect = 0.0 Identities = 394/688 (57%), Positives = 488/688 (70%), Gaps = 17/688 (2%) Frame = +2 Query: 479 DEINLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNR 658 ++ ++ SLR G+ KSTLSG+STPR SPSFRR S RTPRREGR S F NR Sbjct: 3 EDSKILELKSLRIGGSFKSTLSGRSTPRNSPSFRRSQSSRTPRREGRGSARGLQWFRSNR 62 Query: 659 XXXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDS 838 Y GF++QSRWAH ++ + + G + ++ +E +Q RR+L A + S Sbjct: 63 LLFWLLLITLWAYLGFFVQSRWAHDNDNDNVMGFGKKPKNWN-SETEQNLRRDLIATDIS 121 Query: 839 LAVNNHVDTRQ-SDSKRVDMILAKR--GSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXE 1009 LAV N Q SD KR+D++LA R G S ++L Sbjct: 122 LAVKNGTGKNQVSDGKRMDVVLAGRNDGISSHRKLNSKKKKTKRANRSLRSKVHGKQKMT 181 Query: 1010 MVELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVW 1189 M E++N + + +IP N++YG+LVGPFG+LED+ILEWSPE+R+GTCDR G FAR+VW Sbjct: 182 M-EVKNVEIEEQEPDIPKTNASYGMLVGPFGSLEDRILEWSPEKRSGTCDRKGDFARIVW 240 Query: 1190 SRKFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDK 1369 SR+FVLIFHELSMTG+PL+MMELATELLSCGATVS V LS+KGGLM EL+RR+IKVLEDK Sbjct: 241 SRRFVLIFHELSMTGSPLSMMELATELLSCGATVSAVALSKKGGLMSELARRRIKVLEDK 300 Query: 1370 LDLSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTL 1549 DLSFKTAMK+DL+IAGSAVCASWI+++ EH GASQ+AWWIMENRREYFDRAK+ L Sbjct: 301 ADLSFKTAMKADLVIAGSAVCASWIDQFIEHFPAGASQVAWWIMENRREYFDRAKVVLNR 360 Query: 1550 VKKIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEK 1729 VK ++F+SE Q KQWLAW EEEKI L+S+P L+PLS+NDE+AFVAGI C+LNTP+F+ EK Sbjct: 361 VKMLVFISELQWKQWLAWAEEEKIYLRSQPVLVPLSINDEMAFVAGIACTLNTPSFTTEK 420 Query: 1730 MLEKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIE-QGFPQNNSL- 1903 M+EKRQLLR R+EMGL D DML +SLSSINPGKGQ LLL S R+MIE + F + +++ Sbjct: 421 MIEKRQLLRDSARKEMGLKDNDMLVMSLSSINPGKGQHLLLGSGRLMIEKEAFEEKSNIK 480 Query: 1904 ---------TKSFKHRRINLPLRRTKAS---NGIIXXXXXXXXXXXXXXXXXXXXXVGSK 2047 +KS + R+ ++ S G VGSK Sbjct: 481 NPVDIKHHQSKSTRKHRLKTVFQKLNGSMAFGGTHRKEMLDSGGMRERSVKILIGSVGSK 540 Query: 2048 SNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIE 2227 SNKV YVK LL +LSQH N S SVLWTPA+TRVA+LYAAAD YV+N+QG+GETFGRVTIE Sbjct: 541 SNKVVYVKELLNYLSQHPNTSKSVLWTPASTRVAALYAAADVYVINSQGLGETFGRVTIE 600 Query: 2228 AMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEG 2407 AMAF LPVLGTDAGGTKEIVEHNVTGLLHP G PGA VLA +L++LL+NP R++MG +G Sbjct: 601 AMAFSLPVLGTDAGGTKEIVEHNVTGLLHPTGSPGAPVLAGNLEFLLKNPVTRKEMGMKG 660 Query: 2408 RKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 R+KVE+M+LK+ +YKKF +VL CMR K Sbjct: 661 REKVERMYLKRHLYKKFVDVLVKCMRPK 688 >gb|EOY26677.1| UDP-Glycosyltransferase superfamily protein isoform 1 [Theobroma cacao] Length = 702 Score = 744 bits (1922), Expect = 0.0 Identities = 408/707 (57%), Positives = 491/707 (69%), Gaps = 35/707 (4%) Frame = +2 Query: 476 MDEINLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGN 655 M+E PSSLR G+ KS+LSG+STP+ SP+FRR+NS RTPRRE RS F N Sbjct: 1 MEESVSKGPSSLR-QGSFKSSLSGRSTPKSSPTFRRLNSSRTPRREARSGAGGIQWFRSN 59 Query: 656 RXXXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANED 835 R Y GFY+QSRWAHG NKE G + + +G + +Q RR+L A++ Sbjct: 60 RLVYWLLLITLWAYLGFYVQSRWAHGHNKEEFLGFSGNPRNG-LIDAEQNPRRDLLADDS 118 Query: 836 SLAVNNHVDTRQSDSKR-VDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEM 1012 +AVNN + Q S R D+ILAK+ + + Sbjct: 119 LVAVNNGTNKTQVYSDRKFDVILAKKRN---EVSFNKKRSRRSKRAGRNLSKMRGKRKAT 175 Query: 1013 VELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWS 1192 + ++N + + EI +NSTYGLLVGPFG++ED+ILEWSPE+R+GTCDR G FARLVWS Sbjct: 176 INIENGETEGQEHEILQKNSTYGLLVGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWS 235 Query: 1193 RKFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKL 1372 R+ VL+FHELSMTGAP++MMELATELLSCGATVS VVLS+KGGLM EL+RR+IKV+ED+ Sbjct: 236 RRLVLVFHELSMTGAPISMMELATELLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRA 295 Query: 1373 DLSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLV 1552 DLSFKTAMK+DL+IAGSAVCASWI++Y H G SQIAWWIMENRREYFDR+KL L V Sbjct: 296 DLSFKTAMKADLVIAGSAVCASWIDQYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRV 355 Query: 1553 KKIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKM 1732 K +IFLSE QSKQWL WC+EE I+L+S+PAL+PL+VNDELAFVAGI CSLNTP+ S EKM Sbjct: 356 KMLIFLSELQSKQWLTWCQEENIKLRSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKM 415 Query: 1733 LEKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNNS-LTK 1909 LEKRQLLR VR+EMGLTD DML +SLSSIN GKGQ LLLE+A +MI+Q Q +S +TK Sbjct: 416 LEKRQLLRDAVRKEMGLTDNDMLVMSLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTK 475 Query: 1910 SF-----------KHRRINL------------PLRRTKASNGI-------IXXXXXXXXX 1999 S KH L LR + NG Sbjct: 476 SLDIRQDQSTLTVKHHLRGLLQKSSDVDVSSTDLRLFASVNGTNAVSIDSSHRRRNMLFD 535 Query: 2000 XXXXXXXXXXXXVGS---KSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAAD 2170 +GS KSNK+PYVK +L FLSQH+ LS SVLWTPATT VASLY+AAD Sbjct: 536 SKGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAAD 595 Query: 2171 AYVMNAQGMGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAK 2350 YVMN+QG+GETFGRVT+EAMAFGLPVLGTDAGGTKEIVE+NVTGL HP+G PGA+ LA Sbjct: 596 VYVMNSQGLGETFGRVTVEAMAFGLPVLGTDAGGTKEIVENNVTGLFHPMGHPGAQALAG 655 Query: 2351 HLKYLLENPSARQQMGTEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 +L++LL+NPSAR+QMG EGRKKVE+ +LK+ MYK+F EVL CMRIK Sbjct: 656 NLRFLLKNPSARKQMGMEGRKKVERKYLKRHMYKRFVEVLTRCMRIK 702 >gb|ESW22669.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] gi|561023940|gb|ESW22670.1| hypothetical protein PHAVU_005G172300g [Phaseolus vulgaris] Length = 701 Score = 744 bits (1921), Expect = 0.0 Identities = 400/691 (57%), Positives = 482/691 (69%), Gaps = 23/691 (3%) Frame = +2 Query: 488 NLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXX 667 NL + +SLR G+ KSTLSG+STPR SPSFRR NSGRTPR+EGRS F NR Sbjct: 13 NLAKQTSLRLGGSFKSTLSGRSTPRNSPSFRRQNSGRTPRKEGRSGIGGALWFRSNRLLF 72 Query: 668 XXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAV 847 Y GF++QSRWAH D KE G + ++ +Q RR+L A++ SL+ Sbjct: 73 WLLLITLWAYLGFFVQSRWAHSDKKEEFSGFGTGPRN-TGSDAEQVQRRDLLASDHSLSA 131 Query: 848 NNHVDTRQS-DSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQ 1024 NN D + SK ++++LAKRG+ P +++ Sbjct: 132 NNETDANIALSSKTINVVLAKRGNDVPSHRKTSSKKRSRRRRASKGKSSGKLKPS-TDVK 190 Query: 1025 NTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFV 1204 + + EIP+ N TYGLLVGPFG +ED+ILEWSPE+R+GTC+R G FARLVWSR+F+ Sbjct: 191 DADIEEQKPEIPTANGTYGLLVGPFGPVEDRILEWSPEKRSGTCNRKGDFARLVWSRRFI 250 Query: 1205 LIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSF 1384 L+FHELSMTGAPL+MMELATELLSCGATVS VVLS+KGGLM EL+RR+IKVLEDK DLSF Sbjct: 251 LVFHELSMTGAPLSMMELATELLSCGATVSAVVLSKKGGLMSELARRRIKVLEDKADLSF 310 Query: 1385 KTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKII 1564 KTAMK+DL+IAGSAVCASWI++Y E GASQ+ WWIMENRREYFD +K AL VK ++ Sbjct: 311 KTAMKADLVIAGSAVCASWIDQYIERFPAGASQVVWWIMENRREYFDLSKDALDRVKMLV 370 Query: 1565 FLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKR 1744 FLSE QSKQWL WCEEE I+L+S P +IPLSVNDELAFVAGI +LNTP+FS +KM+EKR Sbjct: 371 FLSESQSKQWLKWCEEESIKLRSYPEIIPLSVNDELAFVAGIPSTLNTPSFSTDKMVEKR 430 Query: 1745 QLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNNSLTKSF--- 1915 QLLR VR+E+GL D DML +SLSSINPGKGQ LLLES ++EQG+ Q++ K Sbjct: 431 QLLRESVRKEIGLNDSDMLVISLSSINPGKGQLLLLESVSSVLEQGWLQDDKKMKKVSNI 490 Query: 1916 ---------KHR-RINLPLRRT--KASNGII-------XXXXXXXXXXXXXXXXXXXXXV 2038 KHR R LP+ + SN I V Sbjct: 491 KEGISTLARKHRIRKLLPVLKNGKVVSNDISSNSLSRRKQVLPDDKGTIQKSLKLLIGSV 550 Query: 2039 GSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRV 2218 GSKSNK YVKSLL FL QH N S S+ WTPATTRVASLY+AAD YV+N+QG+GETFGRV Sbjct: 551 GSKSNKADYVKSLLNFLEQHPNTSKSIFWTPATTRVASLYSAADVYVINSQGLGETFGRV 610 Query: 2219 TIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMG 2398 TIEAMAFGLPVLGT+AGGTKEIVEHNVTGLLHP+G PG VLA++L++LL+N AR+QMG Sbjct: 611 TIEAMAFGLPVLGTEAGGTKEIVEHNVTGLLHPVGHPGNLVLAQNLRFLLKNQLARKQMG 670 Query: 2399 TEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 EGRKKV++M+LK+ MYKKF EV+ CMR K Sbjct: 671 VEGRKKVQQMYLKQHMYKKFVEVIVRCMRSK 701 >gb|EOY26678.1| UDP-Glycosyltransferase superfamily protein isoform 2 [Theobroma cacao] Length = 703 Score = 740 bits (1910), Expect = 0.0 Identities = 408/708 (57%), Positives = 491/708 (69%), Gaps = 36/708 (5%) Frame = +2 Query: 476 MDEINLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGN 655 M+E PSSLR G+ KS+LSG+STP+ SP+FRR+NS RTPRRE RS F N Sbjct: 1 MEESVSKGPSSLR-QGSFKSSLSGRSTPKSSPTFRRLNSSRTPRREARSGAGGIQWFRSN 59 Query: 656 RXXXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANED 835 R Y GFY+QSRWAHG NKE G + + +G + +Q RR+L A++ Sbjct: 60 RLVYWLLLITLWAYLGFYVQSRWAHGHNKEEFLGFSGNPRNG-LIDAEQNPRRDLLADDS 118 Query: 836 SLAVNNHVDTRQSDSKR-VDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEM 1012 +AVNN + Q S R D+ILAK+ + + Sbjct: 119 LVAVNNGTNKTQVYSDRKFDVILAKKRN---EVSFNKKRSRRSKRAGRNLSKMRGKRKAT 175 Query: 1013 VELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWS 1192 + ++N + + EI +NSTYGLLVGPFG++ED+ILEWSPE+R+GTCDR G FARLVWS Sbjct: 176 INIENGETEGQEHEILQKNSTYGLLVGPFGSVEDRILEWSPEKRSGTCDRKGDFARLVWS 235 Query: 1193 RKFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKL 1372 R+ VL+FHELSMTGAP++MMELATELLSCGATVS VVLS+KGGLM EL+RR+IKV+ED+ Sbjct: 236 RRLVLVFHELSMTGAPISMMELATELLSCGATVSAVVLSKKGGLMSELARRRIKVIEDRA 295 Query: 1373 DLSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLV 1552 DLSFKTAMK+DL+IAGSAVCASWI++Y H G SQIAWWIMENRREYFDR+KL L V Sbjct: 296 DLSFKTAMKADLVIAGSAVCASWIDQYIAHFPAGGSQIAWWIMENRREYFDRSKLVLHRV 355 Query: 1553 KKIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKM 1732 K +IFLSE QSKQWL WC+EE I+L+S+PAL+PL+VNDELAFVAGI CSLNTP+ S EKM Sbjct: 356 KMLIFLSELQSKQWLTWCQEENIKLRSQPALVPLAVNDELAFVAGIPCSLNTPSASPEKM 415 Query: 1733 LEKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQNNS-LTK 1909 LEKRQLLR VR+EMGLTD DML +SLSSIN GKGQ LLLE+A +MI+Q Q +S +TK Sbjct: 416 LEKRQLLRDAVRKEMGLTDNDMLVMSLSSINTGKGQLLLLEAAGLMIDQDPLQTDSEVTK 475 Query: 1910 SF-----------KHRRINL------------PLRRTKASNGI-------IXXXXXXXXX 1999 S KH L LR + NG Sbjct: 476 SLDIRQDQSTLTVKHHLRGLLQKSSDVDVSSTDLRLFASVNGTNAVSIDSSHRRRNMLFD 535 Query: 2000 XXXXXXXXXXXXVGS---KSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAAD 2170 +GS KSNK+PYVK +L FLSQH+ LS SVLWTPATT VASLY+AAD Sbjct: 536 SKGTQEQALKILIGSVGSKSNKMPYVKEILRFLSQHAKLSESVLWTPATTHVASLYSAAD 595 Query: 2171 AYVMNA-QGMGETFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLA 2347 YVMN+ QG+GETFGRVT+EAMAFGLPVLGTDAGGTKEIVE+NVTGL HP+G PGA+ LA Sbjct: 596 VYVMNSQQGLGETFGRVTVEAMAFGLPVLGTDAGGTKEIVENNVTGLFHPMGHPGAQALA 655 Query: 2348 KHLKYLLENPSARQQMGTEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 +L++LL+NPSAR+QMG EGRKKVE+ +LK+ MYK+F EVL CMRIK Sbjct: 656 GNLRFLLKNPSARKQMGMEGRKKVERKYLKRHMYKRFVEVLTRCMRIK 703 >ref|XP_003542107.1| PREDICTED: uncharacterized protein LOC100795000 isoform X1 [Glycine max] gi|571503664|ref|XP_006595144.1| PREDICTED: uncharacterized protein LOC100795000 isoform X2 [Glycine max] Length = 701 Score = 734 bits (1894), Expect = 0.0 Identities = 398/692 (57%), Positives = 480/692 (69%), Gaps = 24/692 (3%) Frame = +2 Query: 488 NLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXX 667 NL + SSLR G+ KSTLSG+S PR SPSFRR+NS RTPR+EGR S F N Sbjct: 13 NLAKQSSLRLGGSFKSTLSGRSNPRNSPSFRRLNSVRTPRKEGRISVGGALWFRSNHLLL 72 Query: 668 XXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAV 847 Y GF++QSRWAH D KE G + T+ +Q RR+L A++ SL+ Sbjct: 73 WLLLITLWAYLGFFVQSRWAHSDKKEEFSGFGTGPRN-TNTDAEQIQRRDLLASDKSLSA 131 Query: 848 NNHVDTRQSD-SKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQ 1024 NN + SK + + LAK+ + P E++ Sbjct: 132 NNETGADIAGISKTISVALAKKDNDVPSH-RKTSSKKRSKSRRSSKGKSRGKLKPTTEIK 190 Query: 1025 NTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFV 1204 NT + EIP+ N+TYGLLVGPFG +ED+ILEWSPE+R+GTC+R FARLVWSR+F+ Sbjct: 191 NTDIEEQEPEIPTTNNTYGLLVGPFGPMEDRILEWSPEKRSGTCNRKEDFARLVWSRRFI 250 Query: 1205 LIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSF 1384 LIFHELSMTGAPL+MMELATELLSCGATVS VVLSRKGGLM EL+RR+IKVLEDK DLSF Sbjct: 251 LIFHELSMTGAPLSMMELATELLSCGATVSAVVLSRKGGLMSELARRRIKVLEDKSDLSF 310 Query: 1385 KTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKII 1564 KTAMK+DL+IAGSAVCASWIE+Y +H GASQ+AWWIMENRREYFDR+K L VK ++ Sbjct: 311 KTAMKADLVIAGSAVCASWIEQYIDHFPAGASQVAWWIMENRREYFDRSKDILHRVKMLV 370 Query: 1565 FLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKR 1744 FLSE QSKQW WCEEE I+L+S P ++ LSVN+ELAFVAGI +LNTP+FS EKM+EK+ Sbjct: 371 FLSESQSKQWQKWCEEESIKLRSLPEIVALSVNEELAFVAGIPSTLNTPSFSTEKMVEKK 430 Query: 1745 QLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQN---------- 1894 QLLR VR+EMGLTD DML +SLSSINPGKGQ LLLES ++EQG Q+ Sbjct: 431 QLLRESVRKEMGLTDNDMLVISLSSINPGKGQLLLLESVSSVLEQGQLQDDKKMKKVSNI 490 Query: 1895 ----NSLTKSFKHRRINLPLRRT--KASNGII-------XXXXXXXXXXXXXXXXXXXXX 2035 +SLT+ + R++ LPL + ASN I Sbjct: 491 KEGLSSLTRKHRIRKL-LPLMKNGKVASNSISSNSLSRRKQVLPNGKGTIQQSLKLLIGS 549 Query: 2036 VGSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGR 2215 V SKSNK YVKSLL FL QH N S S+ WTPATTRVASLY+AAD YV+N+QG+GETFGR Sbjct: 550 VRSKSNKADYVKSLLSFLEQHPNASTSIFWTPATTRVASLYSAADVYVINSQGLGETFGR 609 Query: 2216 VTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQM 2395 VTIEAMA+GLPVLGTDAGGT+EIVE+NVTGLLHP+G PG +VLA++L++LL+N AR+QM Sbjct: 610 VTIEAMAYGLPVLGTDAGGTREIVENNVTGLLHPVGHPGNDVLAQNLRFLLKNQLARKQM 669 Query: 2396 GTEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 G EGRKKV+KM+LK+ MYK F EV+ CMR K Sbjct: 670 GVEGRKKVQKMYLKQHMYKNFVEVITRCMRSK 701 >ref|XP_004486717.1| PREDICTED: uncharacterized protein LOC101501726 [Cicer arietinum] Length = 709 Score = 733 bits (1893), Expect = 0.0 Identities = 396/697 (56%), Positives = 485/697 (69%), Gaps = 27/697 (3%) Frame = +2 Query: 482 EINLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRX 661 + +L + SSLR+ G+ KSTLSG+STPR SP+FRR+N+ RTPR++GRS G + F NR Sbjct: 14 QASLAKLSSLRSGGSFKSTLSGRSTPRNSPTFRRLNTSRTPRKDGRSVGSSLW-FRSNRV 72 Query: 662 XXXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSL 841 Y GF++QSRWAH D KE G + + RR+L A+EDSL Sbjct: 73 LLWLLLITLWAYLGFFVQSRWAHSDKKEEFSGFGTGPRNTGSNDDSTSLRRDLIASEDSL 132 Query: 842 AVNNH-VDTRQSDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXE--- 1009 +VNN V + + +++ LA +G+ D + Sbjct: 133 SVNNETVINKGGVGRTINVALAMKGNDDDDDDVPSRRKASSKKKKSKRSSRGKARGKNKP 192 Query: 1010 MVELQNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVW 1189 VE++N + EIP NSTYGLLVGPFG+ ED+ILEWSP++R+GTC+R G FARLVW Sbjct: 193 KVEIKNNDIEEQEPEIPETNSTYGLLVGPFGSTEDRILEWSPQKRSGTCNRKGDFARLVW 252 Query: 1190 SRKFVLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDK 1369 SR+F+LIFHELSMTGAPL+MMELATELLSCGATVS V LSRKGGLM EL+RR+IK+LEDK Sbjct: 253 SRRFILIFHELSMTGAPLSMMELATELLSCGATVSAVALSRKGGLMSELARRRIKLLEDK 312 Query: 1370 LDLSFKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTL 1549 DLSFKTAMK+DL+IAGSAVCASWIE+Y EH GASQ+AWWIMENRREYF+R K L Sbjct: 313 ADLSFKTAMKADLVIAGSAVCASWIEQYIEHFPAGASQVAWWIMENRREYFNRTKGVLDR 372 Query: 1550 VKKIIFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEK 1729 VK ++FLSE QSKQW WCEEE I+L+S P +IPLSVNDELAFVAGI +LNTP+F +K Sbjct: 373 VKMLVFLSESQSKQWQKWCEEENIKLRSRPEIIPLSVNDELAFVAGIPSTLNTPSFDTDK 432 Query: 1730 MLEKRQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQN----- 1894 M+EK+QLLR VR+EMGLTD DML +SLSSINPGKGQ LLLESA ++E G Q+ Sbjct: 433 MIEKKQLLRESVRKEMGLTDHDMLVISLSSINPGKGQLLLLESAISVVEHGQLQDDKKMK 492 Query: 1895 ---------NSLTKSFKHRRINLPLRRTKASNGII--------XXXXXXXXXXXXXXXXX 2023 ++LT+ + R++ L+ K + I Sbjct: 493 KSSNIKEGLSTLTRKQRIRKLLPMLKDGKVALKDISINSLSRRKQVLPNNKTTTQQSLKV 552 Query: 2024 XXXXVGSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGE 2203 VGSKSNK YVKSLL FL+QH N S +VLWTP+TT+VASLY+AAD YV+N+QG+GE Sbjct: 553 LIGSVGSKSNKADYVKSLLSFLAQHPNTSKTVLWTPSTTQVASLYSAADVYVINSQGLGE 612 Query: 2204 TFGRVTIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGR-PGAEVLAKHLKYLLENPS 2380 TFGRVTIEAMAFGLPVLGTDAGGTKEIVE+NVTGLLHP+GR G +VLA++L YLL+N Sbjct: 613 TFGRVTIEAMAFGLPVLGTDAGGTKEIVENNVTGLLHPVGRAAGNDVLAQNLVYLLKNQL 672 Query: 2381 ARQQMGTEGRKKVEKMFLKKDMYKKFGEVLYNCMRIK 2491 AR+QMG EGRKKVE+M+LK+ MYKKF EV+ CMR K Sbjct: 673 ARKQMGMEGRKKVERMYLKQHMYKKFVEVIVRCMRNK 709 >ref|XP_006427083.1| hypothetical protein CICLE_v10024994mg [Citrus clementina] gi|557529073|gb|ESR40323.1| hypothetical protein CICLE_v10024994mg [Citrus clementina] Length = 732 Score = 717 bits (1850), Expect = 0.0 Identities = 382/727 (52%), Positives = 479/727 (65%), Gaps = 58/727 (7%) Frame = +2 Query: 485 INLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXX 664 +N+ R SS R G+LKS+LSG+STP+ SPSFRR+N+ RTPRRE RS+ +++ F NR Sbjct: 12 VNVARQSSFRQGGSLKSSLSGRSTPKNSPSFRRLNASRTPRREVRSASLQW--FRSNRLV 69 Query: 665 XXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLA 844 Y GFY+QSRWAHG+N + G + + E + Q RR+L AN L Sbjct: 70 YWLLLITLWTYLGFYVQSRWAHGENNDKFLGFGGKRRN-EIVDSNQNKRRDLIANHSDLD 128 Query: 845 VNNH-VDTRQSDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVEL 1021 +NN + T +DSK++DM+L +R ++D + +++ Sbjct: 129 INNGTIKTLGADSKKIDMVLTQRRNNDASR---RSVAKRKKSKRSSRGKGRGKQKAKLDV 185 Query: 1022 QNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKF 1201 ++ +A EIP N++YGLLVGPFG ED+ILEWSPE+R+GTCDR G FAR VWSRKF Sbjct: 186 ESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKF 245 Query: 1202 VLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLS 1381 +LIFHELSMTGAPL+MMELATELLSCGATVS VVLS++GGLM EL+RRKIKVLED+ + S Sbjct: 246 ILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPS 305 Query: 1382 FKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKI 1561 FKT+MK+DL+IAGSAVCA+WI++Y G SQ+ WWIMENRREYFDRAKL L VK + Sbjct: 306 FKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKML 365 Query: 1562 IFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEK 1741 +FLSE Q+KQWL WCEEEK++L+S+PA++PLSVNDELAFVAG CSLNTP S EKM EK Sbjct: 366 VFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMCEK 425 Query: 1742 RQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQ-------------- 1879 R LLR VR+EMGLTD+DML +SLSSINPGKGQ LL+ESA++MIEQ Sbjct: 426 RNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRN 485 Query: 1880 --------------------------GFPQNNSLTKSFKHRRINLPLRRTKASNGIIXXX 1981 G N S ++N P+R+ S + Sbjct: 486 VGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSI 545 Query: 1982 XXXXXXXXXXXXXXXXXXVGSKSNKVPYVKSLLEFLSQHSN-----------------LS 2110 S + +K L+ + SN LS Sbjct: 546 GNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLS 605 Query: 2111 HSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGLPVLGTDAGGTKEIVE 2290 ++LWTPATTRVASLY+AAD YV+N+QG+GETFGRVTIEAMAFG+PVLGTDAGGTKEIVE Sbjct: 606 KAMLWTPATTRVASLYSAADVYVINSQGLGETFGRVTIEAMAFGVPVLGTDAGGTKEIVE 665 Query: 2291 HNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEKMFLKKDMYKKFGEVL 2470 HNVTGLLHP G PGA+VLA++L+YLL+NPS R++M EGRKKVE+M+LKK MYKK +V+ Sbjct: 666 HNVTGLLHPPGHPGAQVLAQNLRYLLKNPSVRERMAMEGRKKVERMYLKKQMYKKLSQVI 725 Query: 2471 YNCMRIK 2491 Y CM+ K Sbjct: 726 YKCMKPK 732 >ref|XP_006465456.1| PREDICTED: uncharacterized protein LOC102612096 isoform X1 [Citrus sinensis] gi|568822059|ref|XP_006465457.1| PREDICTED: uncharacterized protein LOC102612096 isoform X2 [Citrus sinensis] Length = 732 Score = 715 bits (1845), Expect = 0.0 Identities = 382/727 (52%), Positives = 479/727 (65%), Gaps = 58/727 (7%) Frame = +2 Query: 485 INLVRPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXX 664 +N+ R SS R G+LKS+LSG+STP+ SPSFRR+N+ RTPRRE RS+ +++ F NR Sbjct: 12 VNVARQSSFRQGGSLKSSLSGRSTPKNSPSFRRLNASRTPRREVRSASLQW--FRSNRLV 69 Query: 665 XXXXXXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLA 844 Y GFY+QSRWAHG+N + G + + E + Q RR+L AN L Sbjct: 70 YWLLLITLWTYLGFYVQSRWAHGENNDKFLGFGGKRRN-EIVDSNQNKRRDLIANHSDLD 128 Query: 845 VNNH-VDTRQSDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVEL 1021 +NN + T +DSK++DM+L +R ++D + +++ Sbjct: 129 INNGTIKTLGADSKKMDMVLTQRRNNDASR---RSVAKRKKSKRSSRGKGRGKQKAKLDV 185 Query: 1022 QNTQADASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKF 1201 ++ +A EIP N++YGLLVGPFG ED+ILEWSPE+R+GTCDR G FAR VWSRKF Sbjct: 186 ESNYMEAQLPEIPMTNASYGLLVGPFGLTEDRILEWSPEKRSGTCDRKGDFARFVWSRKF 245 Query: 1202 VLIFHELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLS 1381 +LIFHELSMTGAPL+MMELATELLSCGATVS VVLS++GGLM EL+RRKIKVLED+ + S Sbjct: 246 ILIFHELSMTGAPLSMMELATELLSCGATVSAVVLSKRGGLMPELARRKIKVLEDRGEPS 305 Query: 1382 FKTAMKSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKI 1561 FKT+MK+DL+IAGSAVCA+WI++Y G SQ+ WWIMENRREYFDRAKL L VK + Sbjct: 306 FKTSMKADLVIAGSAVCATWIDQYITRFPAGGSQVVWWIMENRREYFDRAKLVLDRVKLL 365 Query: 1562 IFLSEPQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEK 1741 +FLSE Q+KQWL WCEEEK++L+S+PA++PLSVNDELAFVAG CSLNTP S EKM EK Sbjct: 366 VFLSESQTKQWLTWCEEEKLKLRSQPAVVPLSVNDELAFVAGFTCSLNTPTSSPEKMREK 425 Query: 1742 RQLLRSLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQ-------------- 1879 R LLR VR+EMGLTD+DML +SLSSINPGKGQ LL+ESA++MIEQ Sbjct: 426 RNLLRDSVRKEMGLTDQDMLVLSLSSINPGKGQLLLVESAQLMIEQEPSMDDSKIRKSRN 485 Query: 1880 --------------------------GFPQNNSLTKSFKHRRINLPLRRTKASNGIIXXX 1981 G N S ++N P+R+ S + Sbjct: 486 VGRKKSSLTSRHHLRGRGLLQMSDDVGLSSNELSVSSESFTQLNEPVRKNLLSPSLFTSI 545 Query: 1982 XXXXXXXXXXXXXXXXXXVGSKSNKVPYVKSLLEFLSQHSN-----------------LS 2110 S + +K L+ + SN LS Sbjct: 546 GNTDAVSFGSGHLRRKVLSKSDGKQQQALKILIGSVGSKSNKVPYVKEILEFLSQHSNLS 605 Query: 2111 HSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGLPVLGTDAGGTKEIVE 2290 ++LWTPATTRVASLY+AAD YV+N+QG+GETFGRVTIEAMAFG+PVLGTDAGGTKEIVE Sbjct: 606 KAMLWTPATTRVASLYSAADVYVINSQGLGETFGRVTIEAMAFGVPVLGTDAGGTKEIVE 665 Query: 2291 HNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEKMFLKKDMYKKFGEVL 2470 HNVTGLLHP G PGA+VLA++L+YLL+NPS R++M EGRKKVE+M+LKK MYKK +V+ Sbjct: 666 HNVTGLLHPPGHPGAQVLAQNLRYLLKNPSVRERMAMEGRKKVERMYLKKHMYKKLSQVI 725 Query: 2471 YNCMRIK 2491 Y CM+ K Sbjct: 726 YKCMKPK 732 >ref|NP_188215.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] gi|334185383|ref|NP_001189906.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] gi|9294599|dbj|BAB02880.1| glycosyl transferases-like protein [Arabidopsis thaliana] gi|20147191|gb|AAM10311.1| AT3g15940/MVC8_7 [Arabidopsis thaliana] gi|22796166|emb|CAD45267.1| putative glycosyltransferase [Arabidopsis thaliana] gi|332642228|gb|AEE75749.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] gi|332642229|gb|AEE75750.1| UDP-glycosyltransferase-like protein [Arabidopsis thaliana] Length = 697 Score = 694 bits (1792), Expect = 0.0 Identities = 370/687 (53%), Positives = 468/687 (68%), Gaps = 32/687 (4%) Frame = +2 Query: 521 GALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXXXXXXXXXYG 700 G+ KS+LSG+STPRGSP+ R+++SGRTPRREG+ SG F NR Y Sbjct: 12 GSFKSSLSGRSTPRGSPTLRKVHSGRTPRREGKGSGGAVQWFRSNRLLYWLLLITLWTYL 71 Query: 701 GFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAVNNHVD-TRQSD 877 GFY+QSRWAH D+ + F + + ++Q RR+L A+E S AV +H + Sbjct: 72 GFYVQSRWAHDDDNKVEFLRFGGKLREDVLHVEQNKRRDLVADESSHAVVDHTNIVHLGV 131 Query: 878 SKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQNTQADASAEEI 1057 +KR+ + LAK+ S ++ V ++ + D +E+ Sbjct: 132 NKRMHVTLAKKEDSTSRRSVSPRRRTRKASRSSRTRIRSTQKVRKV-METKELDEQDQEL 190 Query: 1058 PSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVLIFHELSMTGA 1237 P+ N TYG L GPFG+LED+ILEWSP++R+GTCDR F RLVWSR+FVL+FHELSMTGA Sbjct: 191 PNINVTYGKLFGPFGSLEDRILEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSMTGA 250 Query: 1238 PLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFKTAMKSDLIIA 1417 P++MMELA+ELLSCGATV VVLSR+GGL+QEL+RR+IKV+EDK +LSFKTAMK+DL+IA Sbjct: 251 PISMMELASELLSCGATVYAVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIA 310 Query: 1418 GSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIFLSEPQSKQWL 1597 GSAVCASWI++Y +H G SQIAWW+MENRREYFDRAK L VK +IFLSE QSKQWL Sbjct: 311 GSAVCASWIDQYMDHHPAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWL 370 Query: 1598 AWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQLLRSLVRREM 1777 WCEE+ ++L+S+P ++PLSVNDELAFVAG+ SLNTP ++E M EKRQ LR VR E Sbjct: 371 TWCEEDHVKLRSQPVIVPLSVNDELAFVAGVSSSLNTPTLTQETMKEKRQKLRESVRTEF 430 Query: 1778 GLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQ-------------------NNS 1900 GLTD+DML +SLSSINPGKGQ LLLES + +E+ Q Sbjct: 431 GLTDKDMLVMSLSSINPGKGQLLLLESVALALEREQTQEQVAKRNQSKIIKNLNGIRKEK 490 Query: 1901 LTKSFKHRRINLPLRRTKASNGII------------XXXXXXXXXXXXXXXXXXXXXVGS 2044 ++ S +H R+ R+ K ++ + VGS Sbjct: 491 ISLSARH-RLRGSSRKMKITSPAVDNHPSVLSATGRRKLLLSGNVTQKQDLKLLLGSVGS 549 Query: 2045 KSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTI 2224 KSNKV YVK +L FLS + NLS+SVLWTPATTRVASLY+AAD YV N+QG+GETFGRVTI Sbjct: 550 KSNKVAYVKEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRVTI 609 Query: 2225 EAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTE 2404 EAMA+GLPVLGTDAGGTKEIVEHNVTGLLHP+GR G +VLA++L +LL NPS R Q+G++ Sbjct: 610 EAMAYGLPVLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLGSQ 669 Query: 2405 GRKKVEKMFLKKDMYKKFGEVLYNCMR 2485 GR+ VEKM++K+ MYK+F +VL CMR Sbjct: 670 GREIVEKMYMKQHMYKRFVDVLVKCMR 696 >ref|XP_006297092.1| hypothetical protein CARUB_v10013095mg [Capsella rubella] gi|482565801|gb|EOA29990.1| hypothetical protein CARUB_v10013095mg [Capsella rubella] Length = 699 Score = 694 bits (1791), Expect = 0.0 Identities = 372/689 (53%), Positives = 465/689 (67%), Gaps = 34/689 (4%) Frame = +2 Query: 521 GALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXXXXXXXXXYG 700 G+ KS+LSG+STP+GSP+FRR++SGRTPRR+G+ SG F NR Y Sbjct: 12 GSFKSSLSGRSTPKGSPTFRRVHSGRTPRRDGKGSGGAVQWFRSNRLLYWLLLITLWTYL 71 Query: 701 GFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAVNNHVD-TRQSD 877 GFY+QSRWAH D+ + F + + ++Q R + ANE S AV ++ + Sbjct: 72 GFYVQSRWAHDDDNKVEFLRFGGKLREDVLHVEQNKRLDSVANESSHAVVDNTNIVHIGV 131 Query: 878 SKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQNTQADASAEEI 1057 +KR+ + LAK+ + V ++ +D +E+ Sbjct: 132 NKRMHVTLAKKEDVTSRPSLSSRRRTRKASRSSRTRIRSKQKVRKV-METKDSDDQDQEL 190 Query: 1058 PSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVLIFHELSMTGA 1237 P N TYG + GPFG+LEDK+LEWSP++R+GTCDR F RLVWSR+FVL+FHELSMTGA Sbjct: 191 PKTNVTYGKIFGPFGSLEDKVLEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSMTGA 250 Query: 1238 PLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFKTAMKSDLIIA 1417 P++MMELA+ELLSCGATV VVLSR+GGL+QEL+RR+IKV+EDK +LSFKTAMK+DL+IA Sbjct: 251 PISMMELASELLSCGATVYAVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADLVIA 310 Query: 1418 GSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIFLSEPQSKQWL 1597 GSAVCASWI++Y +H G SQIAWW+MENRREYFDRAK L VK +IFLSE QSKQWL Sbjct: 311 GSAVCASWIDQYMDHHPAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSKQWL 370 Query: 1598 AWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQLLRSLVRREM 1777 AWCEE+ I+L+S+P ++PLSVNDELAFVAGI SLNTP ++E M +KR LR VR E Sbjct: 371 AWCEEDHIKLRSQPVIVPLSVNDELAFVAGISSSLNTPTLTQEMMRKKRHTLRESVRTEF 430 Query: 1778 GLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQ---------------------- 1891 GLTD DML +SLSSINPGKGQ LLLESA + +E+ Q Sbjct: 431 GLTDTDMLVMSLSSINPGKGQLLLLESAALALERQQEQEQEPVAKTKSSQSKIKNLNGIK 490 Query: 1892 NNSLTKSFKHRRINLPLRRTKASNGII-----------XXXXXXXXXXXXXXXXXXXXXV 2038 ++ S +HR P R+ K ++ I V Sbjct: 491 KEKISLSVRHRLRGSP-RKMKITSPAIENPSVLTATGKRKLLLSGNVTQKQDLKLLLGSV 549 Query: 2039 GSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRV 2218 GSKSNKV YVK +L FLS + NLS+SVLWTPATTRVASLY+AAD YV N+QG+GETFGRV Sbjct: 550 GSKSNKVAYVKEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGVGETFGRV 609 Query: 2219 TIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMG 2398 TIEAMA+GLPVLGTDAGGTKEIVEHNVTGLLHP+GRPG +VLA++L +LL NPS R Q+G Sbjct: 610 TIEAMAYGLPVLGTDAGGTKEIVEHNVTGLLHPVGRPGNKVLAQNLLFLLRNPSTRLQLG 669 Query: 2399 TEGRKKVEKMFLKKDMYKKFGEVLYNCMR 2485 +GR+KVEKM++K+ MYK+F +VL CMR Sbjct: 670 NQGREKVEKMYMKQHMYKRFVDVLVKCMR 698 >ref|XP_006406901.1| hypothetical protein EUTSA_v10020188mg [Eutrema salsugineum] gi|557108047|gb|ESQ48354.1| hypothetical protein EUTSA_v10020188mg [Eutrema salsugineum] Length = 691 Score = 691 bits (1783), Expect = 0.0 Identities = 372/680 (54%), Positives = 464/680 (68%), Gaps = 25/680 (3%) Frame = +2 Query: 521 GALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXXXXXXXXXYG 700 G+ KS LSGKSTPRGSP+FRR++SGRTPRREG+ SG F NR Y Sbjct: 12 GSFKSPLSGKSTPRGSPNFRRVHSGRTPRREGKGSGGAVQWFRSNRLFYWLLLITLWTYL 71 Query: 701 GFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAVNNHVD-TRQSD 877 GFY+QSRWAH D+ + F + + ++Q R + ANE S +V ++ + Sbjct: 72 GFYVQSRWAHDDDSKVEFLRFGGKLREDVLHVEQNKRLDSVANESSHSVVDNTNIVHIGV 131 Query: 878 SKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQNTQADASAEEI 1057 +KR+ + L K+ S ++ V +++ D +E+ Sbjct: 132 NKRMHVTLVKKEDSTSRRSLSSRRRTRKSGRGSRTKTRSKQNVRKV-VESKDLDDQDQEL 190 Query: 1058 PSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVLIFHELSMTGA 1237 P N T+ L GPFG+LEDKILEWSP++R+GTCDR F RLVWSR+FVL+FHELSMTGA Sbjct: 191 PKTNVTFSKLFGPFGSLEDKILEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSMTGA 250 Query: 1238 PLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFKTAMKSDLIIA 1417 P++MMELA+ELLSCGATV VVLSR+GGL+ EL+RR+IKV+EDK +LSFKTAMK+DL+IA Sbjct: 251 PISMMELASELLSCGATVYAVVLSRRGGLLHELTRRRIKVVEDKGELSFKTAMKADLVIA 310 Query: 1418 GSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIFLSEPQSKQWL 1597 GSAVCASWI++Y +H G SQIAWW+MENRREYFDRAK L VK +IFLSE QSKQWL Sbjct: 311 GSAVCASWIDQYMDHFPAGGSQIAWWVMENRREYFDRAKPVLNRVKLLIFLSEIQSKQWL 370 Query: 1598 AWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQLLRSLVRREM 1777 WCEE+ I+L+S+P ++PLSVNDELAFVAGI SLNTP ++E M EKRQ LR VR E+ Sbjct: 371 TWCEEDHIKLRSQPVIVPLSVNDELAFVAGISSSLNTPTLTQEMMKEKRQKLRESVRTEL 430 Query: 1778 GLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQ------GFPQNNSLTKSFKHR----- 1924 GLTD DML +SLSSINPGKGQ LLLESA + +E+ PQ +L K + Sbjct: 431 GLTDRDMLVMSLSSINPGKGQLLLLESAALALEKEQEAESNQPQIKNLNGIRKQKMSLSV 490 Query: 1925 --RINLPLRRTKASNGII-----------XXXXXXXXXXXXXXXXXXXXXVGSKSNKVPY 2065 R+ R+ K ++ ++ VGSKSNKV Y Sbjct: 491 RHRLRGSSRKMKIASPVLDNPSVLSATGKRKLLLSGNVTQKQDFKLLLGSVGSKSNKVAY 550 Query: 2066 VKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMAFGL 2245 VK +L FLS + NLS+SVLWT ATTRVASLY+AAD YV N+QG+GETFGRVTIEAMA+GL Sbjct: 551 VKEMLSFLSNNGNLSNSVLWTLATTRVASLYSAADVYVTNSQGIGETFGRVTIEAMAYGL 610 Query: 2246 PVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKKVEK 2425 PVLGTDAGGTKEIVEHNVTGLLHP+GRPG +VLA++L +LL NPS R Q+G+ GR+KVEK Sbjct: 611 PVLGTDAGGTKEIVEHNVTGLLHPVGRPGNKVLAQNLLFLLRNPSTRLQLGSIGREKVEK 670 Query: 2426 MFLKKDMYKKFGEVLYNCMR 2485 M++K+ MYK+F +VL CMR Sbjct: 671 MYMKQHMYKRFVDVLVKCMR 690 >ref|XP_002885116.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp. lyrata] gi|297330956|gb|EFH61375.1| glycosyl transferase family 1 protein [Arabidopsis lyrata subsp. lyrata] Length = 696 Score = 689 bits (1779), Expect = 0.0 Identities = 374/689 (54%), Positives = 463/689 (67%), Gaps = 34/689 (4%) Frame = +2 Query: 521 GALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXXXXXXXXXYG 700 G+ KS+LSGKSTPRGSP+ RR++SGRTPRR+G+ SG F NR Y Sbjct: 12 GSFKSSLSGKSTPRGSPTSRRVHSGRTPRRDGKGSGGAVQWFRSNRLLYWLLLITLWTYL 71 Query: 701 GFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAVNNHVDTRQ--- 871 GFY+QSRWAH D+ + F + + ++Q R + ANE+S AV VDT Sbjct: 72 GFYVQSRWAHDDDNKVEFLRFGGKLREDVLHVEQNKRLDSVANENSHAV---VDTTNIVH 128 Query: 872 -SDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQNTQADASA 1048 +KR+ + LAK+ D Q ++ D Sbjct: 129 IGVNKRMHVTLAKK-EDDTSQRSLSSRRRTRKASRSSRTRIRSKQKVRKVMETKDLDEQD 187 Query: 1049 EEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVLIFHELSM 1228 +E+P+ N TYG + GPFG+LED++LEWSP++R+GTCDR F RLVWSR+FVL+FHELSM Sbjct: 188 QELPNTNVTYGKIFGPFGSLEDRVLEWSPQKRSGTCDRKSDFKRLVWSRRFVLLFHELSM 247 Query: 1229 TGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFKTAMKSDL 1408 TGAP++MMELA+ELLSCGATV VVLSR+GGL+QEL+RR+IKV+EDK +LSFKTAMK+DL Sbjct: 248 TGAPISMMELASELLSCGATVYAVVLSRRGGLLQELTRRRIKVVEDKGELSFKTAMKADL 307 Query: 1409 IIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIFLSEPQSK 1588 +IAGSAVCASWI++Y +H G SQIAWW+MENRREYFDRAK L VK +IFLSE QSK Sbjct: 308 VIAGSAVCASWIDQYMDHHPAGGSQIAWWVMENRREYFDRAKPVLDRVKLLIFLSEVQSK 367 Query: 1589 QWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQLLRSLVR 1768 QWL WCEE+ I+L+S+P ++PLSVNDELAFVAGI SLNTP ++E M EKRQ LR VR Sbjct: 368 QWLTWCEEDHIKLRSQPVIVPLSVNDELAFVAGIYSSLNTPTLTQEMMKEKRQKLRESVR 427 Query: 1769 REMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQGFPQ------------------- 1891 E GLTD+DML +SLSSINPGKGQ LLLES + +E+ Q Sbjct: 428 TEFGLTDKDMLVMSLSSINPGKGQLLLLESVALALEREQEQEQVAKSNQQPKIKNLNGIR 487 Query: 1892 NNSLTKSFKHRRINLPLRRTKASNGII-----------XXXXXXXXXXXXXXXXXXXXXV 2038 ++ S KH R+ LR+ K + V Sbjct: 488 KEKISLSVKH-RLRGSLRKMKITTPATDNSSVLSATGKRKLLFSGNVTQKQDLKLLLGSV 546 Query: 2039 GSKSNKVPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRV 2218 GSKSNKV YVK +L FLS + NLS+SVLWTPATTRVASLY+AAD YV N+QG+GETFGRV Sbjct: 547 GSKSNKVAYVKEMLSFLSNNGNLSNSVLWTPATTRVASLYSAADVYVTNSQGIGETFGRV 606 Query: 2219 TIEAMAFGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMG 2398 TIEAMA+GLPVLGTDAGGTKEIVEHNVTGLLHP+GR G +VLA++L +LL NPS R Q+G Sbjct: 607 TIEAMAYGLPVLGTDAGGTKEIVEHNVTGLLHPVGRAGNKVLAQNLLFLLRNPSTRLQLG 666 Query: 2399 TEGRKKVEKMFLKKDMYKKFGEVLYNCMR 2485 ++GR+ VEKM++K+ MYK+F +VL CMR Sbjct: 667 SQGREIVEKMYMKQHMYKRFVDVLVKCMR 695 >ref|XP_006583137.1| PREDICTED: uncharacterized protein LOC100796443 [Glycine max] Length = 693 Score = 679 bits (1753), Expect = 0.0 Identities = 375/683 (54%), Positives = 458/683 (67%), Gaps = 20/683 (2%) Frame = +2 Query: 497 RPSSLRTNGALKSTLSGKSTPRGSPSFRRINSGRTPRREGRSSGIRFYCFGGNRXXXXXX 676 + SS R+ +LK+ LSG+S+P+ PSF+R S TPRRE + C+G NR Sbjct: 18 KQSSSRSGISLKAALSGRSSPQHFPSFQRPYSTLTPRRESKGDA---QCYGSNRLLLWLL 74 Query: 677 XXXXXXYGGFYIQSRWAHGDNKEGIFGSNESQESGEKTELQQKDRRELTANEDSLAVNNH 856 Y GFY+QSRWAH D +E G Q + + Q +L A SL+VN Sbjct: 75 LITLWAYLGFYVQSRWAHDDKEEEFSGFGSRQSDTTNSYVGQNQHLDLIAKNISLSVNIE 134 Query: 857 VDTRQSDSKRVDMILAKRGSSDPKQLXXXXXXXXXXXXXXXXXXXXXXXXEMVELQNTQA 1036 + ++K VD+ LAK+ QL ++E ++ Sbjct: 135 L----VENKTVDVALAKKEYGVLSQLKASSKKRNRRKRSTHALRGTRRRKHILE--SSDI 188 Query: 1037 DASAEEIPSQNSTYGLLVGPFGTLEDKILEWSPERRTGTCDRTGQFARLVWSRKFVLIFH 1216 + EIP +N TYG LVGPFG++ED+IL+WSP+RR TCD+ G+FARLVWSR+FVLIFH Sbjct: 189 EEQEPEIPLRNDTYGFLVGPFGSIEDRILQWSPQRRYETCDKKGEFARLVWSRRFVLIFH 248 Query: 1217 ELSMTGAPLAMMELATELLSCGATVSVVVLSRKGGLMQELSRRKIKVLEDKLDLSFKTAM 1396 ELSMTGAPL+MMELATELLSCGA+VS VVLSRKGGLMQEL+RR+IKVL+DK LSFK A Sbjct: 249 ELSMTGAPLSMMELATELLSCGASVSAVVLSRKGGLMQELARRRIKVLDDKAYLSFKIAN 308 Query: 1397 KSDLIIAGSAVCASWIEKYREHTVLGASQIAWWIMENRREYFDRAKLALTLVKKIIFLSE 1576 K+DL+IAGSAVC SWIE+Y EH GA+Q+AWWIMENRREYFDRAK L V ++FLSE Sbjct: 309 KADLVIAGSAVCTSWIEQYIEHFPAGANQVAWWIMENRREYFDRAKDVLQRVNTLVFLSE 368 Query: 1577 PQSKQWLAWCEEEKIRLKSEPALIPLSVNDELAFVAGIRCSLNTPAFSKEKMLEKRQLLR 1756 QS+QW WC EE I+L S+ AL+PLSVNDELAFVAGI +L P+FS KM E+R+LLR Sbjct: 369 SQSRQWQKWCVEEGIKLSSQLALVPLSVNDELAFVAGIPSTLKVPSFSAAKMDERRKLLR 428 Query: 1757 SLVRREMGLTDEDMLAVSLSSINPGKGQFLLLESARIMIEQG--------FPQNNS---- 1900 +RREMGL D D+L ++LSSIN GKGQ LLLESAR M+E G P+++ Sbjct: 429 DSIRREMGLNDNDILVMTLSSINRGKGQLLLLESARSMVEHGPLQQDDKKIPESSDDGEY 488 Query: 1901 -LTKSFKHRRINLPLRRTKASNGI-------IXXXXXXXXXXXXXXXXXXXXXVGSKSNK 2056 T + +H NL + A N I VGSKSNK Sbjct: 489 LSTLARRHHIRNLLKDNSVALNNISSNFINRTREVLSQNNGTMAQSLKILIGSVGSKSNK 548 Query: 2057 VPYVKSLLEFLSQHSNLSHSVLWTPATTRVASLYAAADAYVMNAQGMGETFGRVTIEAMA 2236 V YVK LL FL++HSNLS SVLWT ATTRVASLY+AAD Y +N+QG+GETFGRVTIEAMA Sbjct: 549 VDYVKGLLSFLARHSNLSKSVLWTSATTRVASLYSAADVYAINSQGLGETFGRVTIEAMA 608 Query: 2237 FGLPVLGTDAGGTKEIVEHNVTGLLHPLGRPGAEVLAKHLKYLLENPSARQQMGTEGRKK 2416 FGLPVLGTDAGGT+EIVEHNVTGLLHP+GR G VLA++L++LLEN AR+QMG EGRKK Sbjct: 609 FGLPVLGTDAGGTQEIVEHNVTGLLHPIGRAGNRVLAQNLRFLLENRLAREQMGMEGRKK 668 Query: 2417 VEKMFLKKDMYKKFGEVLYNCMR 2485 V++MFLK+ MY+K EVL CMR Sbjct: 669 VQRMFLKQHMYEKLVEVLVKCMR 691