BLASTX nr result
ID: Mentha29_contig00011778
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00011778 (2414 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS63888.1| exostosin-like protein, partial [Genlisea aurea] 574 e-161 gb|EYU25911.1| hypothetical protein MIMGU_mgv1a009897mg [Mimulus... 524 e-146 ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|22... 499 e-138 ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citr... 498 e-138 ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutr... 497 e-138 ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata... 496 e-137 ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prun... 495 e-137 ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group] g... 493 e-136 ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [A... 491 e-136 ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g... 491 e-136 ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobrom... 491 e-136 ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g... 491 e-136 ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable gly... 489 e-135 ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable gly... 488 e-135 ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Caps... 487 e-134 ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana] g... 486 e-134 ref|XP_004971083.1| PREDICTED: probable glycosyltransferase At3g... 484 e-134 ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [S... 479 e-132 gb|EXC06151.1| putative glycosyltransferase [Morus notabilis] 478 e-132 ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase ... 478 e-132 >gb|EPS63888.1| exostosin-like protein, partial [Genlisea aurea] Length = 386 Score = 574 bits (1480), Expect = e-161 Identities = 271/383 (70%), Positives = 320/383 (83%), Gaps = 2/383 (0%) Frame = +1 Query: 1048 SPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYWMVASLLY 1227 S PLRVYMYDLP RFN+G+MDP+F D T V+A N P+WRWNDGLR+QHSVEYWM+ASL+ Sbjct: 8 SSPLRVYMYDLPARFNLGLMDPSFRDGTRVSAANFPAWRWNDGLRRQHSVEYWMMASLM- 66 Query: 1228 GGGNDDGSAS--TREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDTVDEKLQLEMV 1401 NDD S T EAVRV DP+SAD +VRNM E DT+DE+LQ+E+V Sbjct: 67 ---NDDDSPEEFTPEAVRVWDPNSADVFFVPFFASLSFNLYVRNMTEVDTIDEQLQVEIV 123 Query: 1402 NILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMNISTLAKDVVAP 1581 N LR+S YWKRS GRDHVI +HHPNAFRH+R +NASIFIVADFGRIM IS L+KDVVAP Sbjct: 124 NFLRSSKYWKRSQGRDHVIAVHHPNAFRHHRGSVNASIFIVADFGRIMKISRLSKDVVAP 183 Query: 1582 YPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIYEEAHAS 1761 YPHMVES++++ DPY+SRKTLLFFRGRT RKDEG IR +LHK+L+GT+ +IY+EA+AS Sbjct: 184 YPHMVESYLNDAVDDPYESRKTLLFFRGRTRRKDEGKIRTRLHKLLHGTEGVIYDEAYAS 243 Query: 1762 EEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFESEIDYKEF 1941 EEGF+ STE MR+SKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SDKIELPFESE+DYKEF Sbjct: 244 EEGFRTSTEQMRASKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFESELDYKEF 303 Query: 1942 SIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAVNMIWKEV 2121 SIFFS E L PGY+V+ELR+VSK++W MW KL+ ++HHFEFQYP KK+DAVNMIW++V Sbjct: 304 SIFFSDEEALTPGYMVSELRKVSKQEWTKMWSKLRSVAHHFEFQYPTKKDDAVNMIWRQV 363 Query: 2122 KHKIPAVKLAVHRSRRLKIADWW 2190 + K+P VKLA+HRSRRLKI DWW Sbjct: 364 RQKVPTVKLAIHRSRRLKIPDWW 386 >gb|EYU25911.1| hypothetical protein MIMGU_mgv1a009897mg [Mimulus guttatus] Length = 328 Score = 524 bits (1349), Expect = e-146 Identities = 252/331 (76%), Positives = 288/331 (87%), Gaps = 2/331 (0%) Frame = +1 Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDTVDEKL 1386 M+ASLL+ G +GS TREAVRVTDP SA+ HVRNMAE +TVDEKL Sbjct: 1 MMASLLHEG---NGSGLTREAVRVTDPDSAEVFFVPFFSSLSFNVHVRNMAELNTVDEKL 57 Query: 1387 QLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMNISTLAK 1566 QLEM+NIL+ASDYWK+S GRDHVIPMHHPNAFRHYRD++NASIFIVADFGRIMNIS LAK Sbjct: 58 QLEMINILKASDYWKKSGGRDHVIPMHHPNAFRHYRDEVNASIFIVADFGRIMNISKLAK 117 Query: 1567 DVVAPYPHMVESFISEDSP--DPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDII 1740 DVVAPYPHMVES+I+E+ DPYKSR+TLL FRGRT RKDEG IRAQLHK+LN TKD+I Sbjct: 118 DVVAPYPHMVESYIAEEENHVDPYKSRQTLLVFRGRTKRKDEGKIRAQLHKMLNDTKDVI 177 Query: 1741 YEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFES 1920 YEE ASEEGFKAS E MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD++ELPFES Sbjct: 178 YEEGAASEEGFKASAEQMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDRLELPFES 237 Query: 1921 EIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAV 2100 EIDYKEFS+FFSVNE L+PGY++++LR VS+++W+ MW ++K I+HHFEFQYPPK EDAV Sbjct: 238 EIDYKEFSMFFSVNEALQPGYLIDKLRAVSEDQWLKMWSRVKSITHHFEFQYPPKDEDAV 297 Query: 2101 NMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 NMIW++VKHK+PAVKLAVHRSRRLKI DWWR Sbjct: 298 NMIWRQVKHKVPAVKLAVHRSRRLKIPDWWR 328 >ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis] Length = 434 Score = 499 bits (1285), Expect = e-138 Identities = 247/412 (59%), Positives = 309/412 (75%), Gaps = 7/412 (1%) Frame = +1 Query: 979 DYKSQFLNFIPQL---TNNAPPCRARSPPLRVYMYDLPPRFNVGMMDP--NFPDATPVTA 1143 D +S F + Q T A A PPL+VYMYDLP RF+VGMMD + + TPVT Sbjct: 27 DMRSYFFPLLQQQQSPTTGARSLCATGPPLKVYMYDLPRRFHVGMMDHGGDAKNDTPVTG 86 Query: 1144 LNIPSWRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXX 1323 N+P+W N GLRKQHSVEYW++ASLLY G ++ REAVRV DP AD Sbjct: 87 ENLPTWPKNSGLRKQHSVEYWLMASLLYEGADE------REAVRVLDPEKADAFFVPFFS 140 Query: 1324 XXXXXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQ 1500 H M + +T +D +LQ++++++L S YW++S GRDHVIPM HPNAFR R Q Sbjct: 141 SLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKYWQKSGGRDHVIPMTHPNAFRFLRQQ 200 Query: 1501 INASIFIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVR 1677 +NASI IVADFGR ++STL+KDVVAPY H+V+SF ++ +P++SR TLLFFRG T+R Sbjct: 201 LNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSFTDDEVSNPFESRTTLLFFRGNTIR 260 Query: 1678 KDEGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFD 1857 KDEG +RA+L KIL G DI +E + A+ E KASTE MRSSKFCLHPAGDTPSSCRLFD Sbjct: 261 KDEGKVRAKLAKILTGYDDIHFERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFD 320 Query: 1858 AIVSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWL 2037 AIVSHCVPVI+SD+IELP+E EIDY +FS+FFSVNE ++PGY+V++LR++ KE+W+ MW Sbjct: 321 AIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWR 380 Query: 2038 KLKEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 KLK ISHHFEFQYPP+KEDAV+M+W+EVKHK+P +LAVHRSRRLKI DWW+ Sbjct: 381 KLKSISHHFEFQYPPEKEDAVDMLWREVKHKLPGAQLAVHRSRRLKIQDWWQ 432 >ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|567891051|ref|XP_006438046.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|568861185|ref|XP_006484086.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1 [Citrus sinensis] gi|568861187|ref|XP_006484087.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X2 [Citrus sinensis] gi|568861189|ref|XP_006484088.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X3 [Citrus sinensis] gi|557540241|gb|ESR51285.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|557540242|gb|ESR51286.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] Length = 431 Score = 498 bits (1283), Expect = e-138 Identities = 241/399 (60%), Positives = 301/399 (75%), Gaps = 2/399 (0%) Frame = +1 Query: 1000 NFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGL 1179 +F P L + A C A PLRVYMYDLP RF+VGM+D + PD PVT+ N+P W + G+ Sbjct: 39 HFFPLLQSTAQSCSA---PLRVYMYDLPRRFHVGMLDHSSPDGLPVTSENLPRWPRSSGI 95 Query: 1180 RKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMA 1359 ++QHSVEYW++ASLLY DG + REAVRV+DP +A H NM Sbjct: 96 KRQHSVEYWLMASLLY-----DGESEEREAVRVSDPDTAQAFFVPFFSSLSFNTHGHNMT 150 Query: 1360 ETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFG 1536 + DT D +LQ+E++ LR S YW++S GRDHVIPM HPNAFR R Q+NASI IVADFG Sbjct: 151 DPDTEFDRQLQIEILEFLRNSKYWQKSGGRDHVIPMTHPNAFRFLRQQLNASILIVADFG 210 Query: 1537 RI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHK 1713 R ++S L+KDVVAPY H+VESF ++ PDP+ +RKTLLFF+G T+RKDEG +RA+L K Sbjct: 211 RYPRSMSNLSKDVVAPYVHVVESFTDDNPPDPFVARKTLLFFQGNTIRKDEGKVRAKLAK 270 Query: 1714 ILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIIS 1893 IL G D+ YE + + + K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+S Sbjct: 271 ILTGYDDVHYERSAPTTKSIKESTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS 330 Query: 1894 DKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQ 2073 D+IELPFE EIDY EFS+FFS+ E +PGY++++LR++ K +WI MW +LK ISH++EFQ Sbjct: 331 DRIELPFEDEIDYSEFSVFFSIKEAGQPGYMIDQLRQIPKARWIEMWQRLKSISHYYEFQ 390 Query: 2074 YPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 YPPKKEDAVNM+W++VK+KIP V+LAVHR RRLKI DWW Sbjct: 391 YPPKKEDAVNMVWRQVKNKIPGVQLAVHRHRRLKIPDWW 429 >ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum] gi|557087717|gb|ESQ28569.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum] Length = 432 Score = 497 bits (1280), Expect = e-138 Identities = 238/393 (60%), Positives = 290/393 (73%), Gaps = 5/393 (1%) Frame = +1 Query: 1027 APPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYW 1206 A PC PLRV+MYDLP +FNV MMDP D P+T N+PSW G+++QHSVEYW Sbjct: 42 ASPCSITGRPLRVFMYDLPRKFNVAMMDPQSSDVEPLTGKNLPSWPQTSGIKRQHSVEYW 101 Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEK 1383 ++ASLL+GGG G +EA RV DP AD H +NM + DT D + Sbjct: 102 LMASLLHGGG---GGEEEKEAFRVFDPELADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQ 158 Query: 1384 LQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTL 1560 LQ+E++ L S YW+RS GRDHVIPM HPNAFR R Q+NASI +V DFGR ++ L Sbjct: 159 LQVELMEYLENSKYWQRSGGRDHVIPMTHPNAFRFLRQQVNASILVVVDFGRYPREMARL 218 Query: 1561 AKDVVAPYPHMVESFISE---DSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTK 1731 KDVV+PY H+VESF + D+PDP+++R TLL+FRG TVRK EG IR +L K+L G Sbjct: 219 GKDVVSPYVHVVESFTEDGGVDTPDPFEARTTLLYFRGNTVRKAEGKIRLRLEKLLAGNS 278 Query: 1732 DIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELP 1911 D+ YE++ A+ + K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISD+IELP Sbjct: 279 DVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDRIELP 338 Query: 1912 FESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKE 2091 FE EIDY EFS+FFS+ E L PGYI+N LR+ KEKW+ MW LK +SHHFEFQYPPK+E Sbjct: 339 FEDEIDYSEFSVFFSIKEALEPGYILNNLRQFPKEKWLQMWENLKNVSHHFEFQYPPKRE 398 Query: 2092 DAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 DAVNM+W++VKHKIP+VKLAVHR+RRLK+ DWW Sbjct: 399 DAVNMLWRQVKHKIPSVKLAVHRNRRLKVPDWW 431 >ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata] gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata] Length = 429 Score = 496 bits (1277), Expect = e-137 Identities = 238/395 (60%), Positives = 295/395 (74%), Gaps = 5/395 (1%) Frame = +1 Query: 1021 NNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVE 1200 N A PC + PLRV+MYDLP +FNV MMDP+ D P+T N+PSW G+++QHSVE Sbjct: 40 NVASPCSSTGKPLRVFMYDLPRKFNVAMMDPHSSDVEPLTGKNLPSWPQTSGIKRQHSVE 99 Query: 1201 YWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VD 1377 YW++ASLL GG +D+ EA+RV DP AD H +NM + DT D Sbjct: 100 YWLMASLLNGGDDDN------EAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFD 153 Query: 1378 EKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIM-NIS 1554 +LQ+E++ L S+YW RS G+DHVIPM HPNAFR R Q+NASI IV DFGR +++ Sbjct: 154 RQLQVELMEFLEGSEYWNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYAKDMA 213 Query: 1555 TLAKDVVAPYPHMVESFISEDSP---DPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725 L+KDVV+PY H+VES ED DP+++R TLL+FRG TVRKDEG IR +L K+L G Sbjct: 214 RLSKDVVSPYVHVVESLNEEDDDGLTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAG 273 Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905 D+ +E++ A+ + K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISDKIE Sbjct: 274 NSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIE 333 Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085 LPFE EIDY EFS+FFS+ E+L PGYI+N+LR+ KEKW+ MW +LK +SHHFEFQYPPK Sbjct: 334 LPFEDEIDYSEFSLFFSIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPK 393 Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 +EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW Sbjct: 394 REDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 428 >ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prunus persica] gi|462424394|gb|EMJ28657.1| hypothetical protein PRUPE_ppa005995mg [Prunus persica] Length = 433 Score = 495 bits (1274), Expect = e-137 Identities = 241/407 (59%), Positives = 300/407 (73%), Gaps = 2/407 (0%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158 D +S FL +P A P RA PPL+VYMYDLP RFNVGM++ + PVTA P+ Sbjct: 28 DIRSYFLPLLPSPPPGAQPPRATGPPLKVYMYDLPRRFNVGMLNRKSTEQAPVTARTWPT 87 Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338 W N GL++QHSVEYWM+ SLL+ G DG R AVRV+DP AD Sbjct: 88 WPRNSGLKRQHSVEYWMMGSLLFDGDGGDG----RAAVRVSDPELADAFFVPFFSSLSFN 143 Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515 H +M + T +D +LQ++++ IL S YW+RS GRDHVIP+ HPNAFR R QINASI Sbjct: 144 THGHHMTDPATEIDHQLQIDVLKILGESKYWQRSGGRDHVIPLTHPNAFRFLRPQINASI 203 Query: 1516 FIVADFGRIMNI-STLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692 IV DFGR ++ S L+KDVV+PY H+V+SF ++ +PY+SR TLLFF+GRT RKDEGI Sbjct: 204 QIVVDFGRYPHVMSNLSKDVVSPYVHVVDSFTDDNHSNPYESRTTLLFFQGRTFRKDEGI 263 Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872 +R +L KIL G D+ YE + A+ + KAS++ MRSSKFCLHPAGDTPSSCRLFDAIVSH Sbjct: 264 VRVKLAKILAGYDDVHYERSVATGDNIKASSQRMRSSKFCLHPAGDTPSSCRLFDAIVSH 323 Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052 CVPVI+SD+IELPFE EIDY +FS+FFS E L PGY+V++LR+ K++WI MW +L I Sbjct: 324 CVPVIVSDEIELPFEDEIDYTKFSLFFSFKEALEPGYMVDQLRKFPKDRWIEMWRQLNSI 383 Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 SHHFEF YPP+KEDAVNM+W++VKHK+PAVKLA+HR+RRLKI DWWR Sbjct: 384 SHHFEFHYPPEKEDAVNMLWRQVKHKLPAVKLAIHRNRRLKIPDWWR 430 >ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group] gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group] gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group] gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group] gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group] gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group] gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group] Length = 437 Score = 493 bits (1268), Expect = e-136 Identities = 243/391 (62%), Positives = 296/391 (75%), Gaps = 5/391 (1%) Frame = +1 Query: 1033 PCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWN-DGLRKQHSVEYWM 1209 P A +PPLRV+MYDLP RF+VGMMD +A P+W + G+R+QHSVEYWM Sbjct: 54 PAAAAAPPLRVFMYDLPRRFHVGMMD--------ASASGFPAWPPSAGGIRRQHSVEYWM 105 Query: 1210 VASLLYGGGNDDGSAST--REAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDE 1380 +ASL GGG +GS+S REAVRVTDP +A+ H RNM + +T D Sbjct: 106 MASLQGGGGGGNGSSSEEGREAVRVTDPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADR 165 Query: 1381 KLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMN-IST 1557 LQ+E++ IL S YW+RS GRDHVIPMHHPNAFR RD +NASI IVADFGR +++ Sbjct: 166 LLQVELMEILWKSKYWQRSAGRDHVIPMHHPNAFRFLRDMVNASILIVADFGRYTKELAS 225 Query: 1558 LAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDI 1737 L KDVVAPY H+V+SF+++D PDP+ R TLLFFRGRTVRKDEG IRA+L KIL G + Sbjct: 226 LRKDVVAPYVHVVDSFLNDDPPDPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGV 285 Query: 1738 IYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFE 1917 +E++ A+ EG K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+S +IELPFE Sbjct: 286 RFEDSLATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFE 345 Query: 1918 SEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDA 2097 EIDY EFS+FFSV E LRP Y++N+LR++ K KW+ +W KLK +SHH+EFQ PP+K DA Sbjct: 346 DEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDA 405 Query: 2098 VNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 VNMIW++VKHK+PAV LA+HR+RRLKI DWW Sbjct: 406 VNMIWRQVKHKVPAVNLAIHRNRRLKIPDWW 436 >ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda] gi|548851701|gb|ERN09976.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda] Length = 422 Score = 491 bits (1265), Expect = e-136 Identities = 242/406 (59%), Positives = 305/406 (75%), Gaps = 2/406 (0%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158 D +SQF F P + AP + PL++YMY+LP FN+GM+ + P IP Sbjct: 26 DLRSQF--FAPTII--APS----NSPLKIYMYNLPRHFNIGMLRRSDPHQDLPFTGQIPP 77 Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338 W N GL+KQHSVEYWM+ASLLY +DG EA+RV+DP AD Sbjct: 78 WPQNSGLKKQHSVEYWMMASLLY----EDGEGRDMEAIRVSDPEEADAFFVPFFSSLSFN 133 Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515 H NM + +T VD +LQ+E++ LR S +W++S GRDHVIPMHHPNAFR R+++NASI Sbjct: 134 THGHNMTDPETEVDRQLQIELLEFLRISKFWEQSGGRDHVIPMHHPNAFRFLREKVNASI 193 Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692 +VADFGR NIS+L+KDVVAPY H+ +SFI +DS DP++SR TLLFFRGRTVRK EGI Sbjct: 194 LVVADFGRCPKNISSLSKDVVAPYVHVGDSFIDDDSSDPFESRTTLLFFRGRTVRKAEGI 253 Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872 +R++L KIL G + + +EE+ A+ E KAS+ MRSSKFCL+PAGDTPSSCRLFDAIVSH Sbjct: 254 VRSKLAKILRGQEGVHFEESVATGESIKASSLGMRSSKFCLNPAGDTPSSCRLFDAIVSH 313 Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052 C+PVI+SD+IELP+E EIDY+ FS+FFSV E LRPGY++ ELR++ +EKW+ MW +LKEI Sbjct: 314 CIPVIVSDRIELPYEDEIDYRTFSLFFSVEEALRPGYMLKELRQIKREKWVEMWRRLKEI 373 Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 SHHFEFQ+PPK++DAVNMIWK+V+HK+PA KLAVHRSRRLKI DWW Sbjct: 374 SHHFEFQFPPKRDDAVNMIWKQVRHKLPAAKLAVHRSRRLKIPDWW 419 >ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria vesca subsp. vesca] Length = 446 Score = 491 bits (1265), Expect = e-136 Identities = 245/402 (60%), Positives = 299/402 (74%), Gaps = 5/402 (1%) Frame = +1 Query: 1003 FIPQLTNN--APPCR-ARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWND 1173 FIP L ++ AP A PPL+V+MYDLP RFNVGM++ + PVTA P W N Sbjct: 46 FIPLLKSSPLAPQSLCATGPPLKVFMYDLPRRFNVGMLNRKSAEEAPVTAREWPPWPRNS 105 Query: 1174 GLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRN 1353 GL+KQHSVEYWM+ S+L+ G +GS E VRV+DP AD H N Sbjct: 106 GLKKQHSVEYWMMGSVLWEGNGGEGS----EVVRVSDPEVADAFFVPFFSSLSFNTHGHN 161 Query: 1354 MAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVAD 1530 M + +T VD +LQ+++V +L S YW RS GRDHVIPM HPNAFR R QINASI IV D Sbjct: 162 MNDPETEVDHQLQIDLVKLLHESKYWNRSGGRDHVIPMTHPNAFRFLRPQINASIQIVVD 221 Query: 1531 FGRIMNI-STLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQL 1707 FGR ++ S L+KDVV PY H+VESF ++S DPY+SR TLLFF+GRT RKDEGI+RA+L Sbjct: 222 FGRYPHVMSNLSKDVVTPYVHVVESFTDDNSSDPYESRTTLLFFQGRTHRKDEGIVRAKL 281 Query: 1708 HKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI 1887 K+L G D+ YE + A+ E K ST+ MR+SKFCLHPAGDTPSSCRLFDAIVSHC+PVI Sbjct: 282 AKVLAGYDDVHYERSVATGENIKLSTQRMRASKFCLHPAGDTPSSCRLFDAIVSHCIPVI 341 Query: 1888 ISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFE 2067 +SD+IELPFE E+DY +FS+FFS E L+PGY+VNELR++SKEKW+ M+ LK ISHHFE Sbjct: 342 VSDEIELPFEDELDYNQFSVFFSFKEALQPGYMVNELRKLSKEKWMEMYRHLKSISHHFE 401 Query: 2068 FQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 F YPP+KEDAVNM+W++VK K+PAVKLAVHRS+RLKI DWWR Sbjct: 402 FHYPPEKEDAVNMLWRQVKRKVPAVKLAVHRSQRLKIPDWWR 443 >ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|590692416|ref|XP_007044050.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|590692424|ref|XP_007044051.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508707984|gb|EOX99880.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508707985|gb|EOX99881.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508707986|gb|EOX99882.1| Exostosin family protein isoform 1 [Theobroma cacao] Length = 432 Score = 491 bits (1264), Expect = e-136 Identities = 241/387 (62%), Positives = 293/387 (75%), Gaps = 3/387 (0%) Frame = +1 Query: 1042 ARSPPLRVYMYDLPPRFNVGMMDP-NFPDATPVTALNIPSWRWNDGLRKQHSVEYWMVAS 1218 A PLRVYMYDLP +F+VGM+D + +A PVT N+P W N G+++QHSVEYW++AS Sbjct: 47 ATGRPLRVYMYDLPRKFHVGMLDRRSSEEAAPVTMENLPPWPSNSGIKRQHSVEYWLMAS 106 Query: 1219 LLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEKLQLE 1395 LLY G ++DG REAVRV DP AD H NM + +T +D LQ+E Sbjct: 107 LLYDGQDEDG----REAVRVLDPEKADAFFVPFFSSLSFNTHGHNMTDPETEIDRHLQVE 162 Query: 1396 MVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTLAKDV 1572 ++ L+ S Y++RS GRDHVIPM HPNAFR R+Q+NASI IV DFGR +S+L+KDV Sbjct: 163 LLEFLQQSKYYQRSGGRDHVIPMTHPNAFRFLREQLNASILIVVDFGRYPKTMSSLSKDV 222 Query: 1573 VAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIYEEA 1752 VAPY H+V+SF +D DPY+SR TLLFFRG TVRKDEG IR +L KIL G+ D+ YE++ Sbjct: 223 VAPYVHVVDSFTDDDPLDPYESRTTLLFFRGNTVRKDEGKIRVKLAKILAGSDDVHYEKS 282 Query: 1753 HASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFESEIDY 1932 A+ + K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SDKIELP+E EIDY Sbjct: 283 VATPKNIKMSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPYEDEIDY 342 Query: 1933 KEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAVNMIW 2112 EFSIFFS+ E L PGY+VN LR+ K +W+ MW LK IS H+EFQYPPKKEDAVNM+W Sbjct: 343 TEFSIFFSMKEALEPGYLVNHLRQFPKNRWVQMWKLLKNISRHYEFQYPPKKEDAVNMLW 402 Query: 2113 KEVKHKIPAVKLAVHRSRRLKIADWWR 2193 ++VKHK+P V+LAVHRSRRLK+ DWWR Sbjct: 403 RQVKHKLPGVQLAVHRSRRLKVPDWWR 429 >ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis sativus] Length = 429 Score = 491 bits (1264), Expect = e-136 Identities = 242/407 (59%), Positives = 298/407 (73%), Gaps = 2/407 (0%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158 D +S F + + PC PPLRVYMYDLP RFNVG+++ D TPVTA P Sbjct: 27 DIRSYFFPLLQSQPISPFPCTT-DPPLRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPP 85 Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338 W N GL++QHSVEYWM+ SLL+ D R+AVRV DP +AD Sbjct: 86 WPRNSGLKRQHSVEYWMMGSLLHEATGDG-----RDAVRVMDPENADAFFVPFFSSLSFN 140 Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515 H RNM + T VD +LQ++++ L S YW+RS GRDHVIPM HPNAFR R+Q+NASI Sbjct: 141 SHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASI 200 Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692 IV DFGR +S L KDVVAPY H+V SFI ++ PDP++SR TLLFF+G+T RKD+GI Sbjct: 201 QIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGI 260 Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872 IR +L KIL+G D+ YE + A+E+ K S++ MRSSKFCLHPAGDTPSSCRLFDAIVSH Sbjct: 261 IRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSH 320 Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052 CVPVI+SD+IELP+E EIDY +F++FFS E L+PGY+V +LR KE+WI MW +LKEI Sbjct: 321 CVPVIVSDQIELPYEDEIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQLKEI 380 Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 S H+EFQYPPKKEDAVNM+W++VKHK+PAVKLAVHRSRRLK+ DWW+ Sbjct: 381 SRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQ 427 >ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase At3g07620-like [Cucumis sativus] Length = 429 Score = 489 bits (1259), Expect = e-135 Identities = 241/407 (59%), Positives = 297/407 (72%), Gaps = 2/407 (0%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158 D +S F + + PC PPLRVYMYDLP RFNVG+++ D TPVTA P Sbjct: 27 DIRSYFFPLLQSQPISPFPCTT-DPPLRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPP 85 Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338 W N GL++QHSVEYWM+ SLL+ D R+AVRV DP +AD Sbjct: 86 WPRNSGLKRQHSVEYWMMGSLLHEATGDG-----RDAVRVMDPENADAFFVPFFSSLSFN 140 Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515 H RNM + T VD +LQ++++ L S YW+RS GRDHVIPM HPNAFR R+Q+NASI Sbjct: 141 SHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASI 200 Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692 IV DFGR +S L KDVVAPY H+V SFI ++ PDP++SR TLLFF+G+T RKD+GI Sbjct: 201 QIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGI 260 Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872 IR +L KIL+G D+ YE + A+E+ K S++ MRSSKFCLHPAGDTPSSCRLFDAIVSH Sbjct: 261 IRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSH 320 Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052 CVPVI+SD+IELP+E EIDY +F++FF E L+PGY+V +LR KE+WI MW +LKEI Sbjct: 321 CVPVIVSDQIELPYEDEIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQLKEI 380 Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 S H+EFQYPPKKEDAVNM+W++VKHK+PAVKLAVHRSRRLK+ DWW+ Sbjct: 381 SRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQ 427 >ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase At5g25310-like [Vitis vinifera] Length = 437 Score = 488 bits (1257), Expect = e-135 Identities = 241/394 (61%), Positives = 289/394 (73%), Gaps = 8/394 (2%) Frame = +1 Query: 1033 PCRARSPPLRVYMYDLPPRFNVGMMDPNFP-DATPVTALNIPSWRWNDGLRKQHSVEYWM 1209 PC PL VYMYDLP RF+VGM+ P D +PVTA N+P W N GL+KQHSVEYWM Sbjct: 45 PCSTGGGPLMVYMYDLPRRFHVGMLRRRSPADESPVTAENLPPWPSNSGLKKQHSVEYWM 104 Query: 1210 VASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEKL 1386 +ASLLY GG G TREAVRV DP AD H NM + DT D +L Sbjct: 105 MASLLYDGG---GGNETREAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQL 161 Query: 1387 QLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTLA 1563 Q++++ ILR S YW+RS GRDHVIPMHHPNAFR +R+Q+N SI IVADFGR IS L Sbjct: 162 QIDILKILRESKYWQRSGGRDHVIPMHHPNAFRFFREQVNTSILIVADFGRYPKEISNLR 221 Query: 1564 KDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIY 1743 KDVVAPY H+V+SF ++SPDPY+SR TLLFFRGRT+RKDEGI+R +L K+L G D Y Sbjct: 222 KDVVAPYVHVVDSFTDDNSPDPYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDD--Y 279 Query: 1744 EEAHASEEGFKA-----STEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIEL 1908 + H + + ST+ MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SD+IEL Sbjct: 280 LQLHFHHRSYLSFLVXQSTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIEL 339 Query: 1909 PFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKK 2088 P+E EIDY +FSIFFS E L PGY++ +LR++ KE+W+ MW LK ISHH+EFQYPPKK Sbjct: 340 PYEDEIDYTQFSIFFSDKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKK 399 Query: 2089 EDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 DA++M+W++VKHK+P L VHRSRRLK+ DWW Sbjct: 400 GDAIDMLWRQVKHKLPRANLDVHRSRRLKVPDWW 433 >ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Capsella rubella] gi|482570884|gb|EOA35072.1| hypothetical protein CARUB_v10020184mg [Capsella rubella] Length = 494 Score = 487 bits (1253), Expect = e-134 Identities = 237/395 (60%), Positives = 292/395 (73%), Gaps = 7/395 (1%) Frame = +1 Query: 1027 APPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYW 1206 A PC + PLRV+MYDLP +FNV MMDP D P+T N+PSW G+++QHSVEYW Sbjct: 104 ASPCSSNGRPLRVFMYDLPRKFNVAMMDPRSSDVEPLTGKNLPSWPQTSGIKRQHSVEYW 163 Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEK 1383 ++ASLL GG DG EA+RV DP AD H +NM + DT D K Sbjct: 164 LMASLLQRGG--DGGDD--EAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRK 219 Query: 1384 LQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTL 1560 LQ+E++ L S+YWKRS G+DHVIPM HPNAFR R Q+NASI IV DFGR +++ L Sbjct: 220 LQVELMEFLENSEYWKRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYPKDMARL 279 Query: 1561 AKDVVAPYPHMVESFISEDSPD-----PYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725 +KDVV+PY H+VE+ ++ED D P+++R TLL+FRG T RKDEG IR +L K+L Sbjct: 280 SKDVVSPYVHVVET-LTEDGDDDGMTDPFEARTTLLYFRGNTARKDEGKIRLRLEKLLAN 338 Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905 D+ YE++ A+ + K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISDKIE Sbjct: 339 NSDVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIE 398 Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085 LPFE EIDY EFS+FFS+ E+L PGYI+N LR+ K+KW+ MW +LK +SHHFEFQYPPK Sbjct: 399 LPFEDEIDYSEFSVFFSIKESLEPGYILNNLRQFPKDKWLEMWKRLKNVSHHFEFQYPPK 458 Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 +EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW Sbjct: 459 REDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 493 >ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana] gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana] gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana] gi|591402328|gb|AHL38891.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 430 Score = 486 bits (1252), Expect = e-134 Identities = 236/409 (57%), Positives = 294/409 (71%), Gaps = 5/409 (1%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158 D + F Q + PC + PLRV+MYDLP +FN+ MMDP+ D P+T N+PS Sbjct: 27 DPRPYFYLLQSQPNGASSPCSSSGKPLRVFMYDLPRKFNIAMMDPHSSDVEPITGKNLPS 86 Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338 W G+++QHSVEYW++ASLL GG +++ EA+RV DP AD Sbjct: 87 WPQTSGIKRQHSVEYWLMASLLNGGEDEN------EAIRVFDPDLADVFYVPFFSSLSFN 140 Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515 H +NM + DT D LQ+E++ L S YW RS G+DHVIPM HPNAFR R Q+NASI Sbjct: 141 THGKNMTDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHVIPMTHPNAFRFLRQQVNASI 200 Query: 1516 FIVADFGRIM-NISTLAKDVVAPYPHMVESFISEDSP---DPYKSRKTLLFFRGRTVRKD 1683 IV DFGR +++ L+KDVV+PY H+VES E DP+++R TLL+FRG TVRKD Sbjct: 201 LIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRGNTVRKD 260 Query: 1684 EGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAI 1863 EG IR +L K+L G D+ +E++ A+ + K STE MRSSKFCLHPAGDTPSSCRLFDAI Sbjct: 261 EGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAI 320 Query: 1864 VSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKL 2043 VSHC+PVIISDKIELPFE EIDY EFS+FFS+ E+L PGYI+N LR+ KEKW+ MW +L Sbjct: 321 VSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRL 380 Query: 2044 KEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 K +SHHFEFQYPPK+EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW Sbjct: 381 KNVSHHFEFQYPPKREDAVNMLWRQVKHKIPYVKLAVHRNRRLKVPDWW 429 >ref|XP_004971083.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1 [Setaria italica] Length = 445 Score = 484 bits (1246), Expect = e-134 Identities = 244/409 (59%), Positives = 301/409 (73%), Gaps = 7/409 (1%) Frame = +1 Query: 985 KSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWR 1164 ++ LN P + PP A +PPLRV+MYDLPPRF+V MM DA+ TA P+W Sbjct: 39 RAALLNLKP-FSARCPPSAA-APPLRVFMYDLPPRFHVAMMT-GAADASNATAGPFPAWP 95 Query: 1165 WN-DGLRKQHSVEYWMVASLLYGGGNDDGSAST----REAVRVTDPHSADXXXXXXXXXX 1329 + G+++QHSVEYWM+ASL GGG G REAVRV DP A+ Sbjct: 96 PSAGGIKRQHSVEYWMMASLQDGGGGGGGGGGVGSERREAVRVRDPDDAEAFFVPFFSSL 155 Query: 1330 XXXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQIN 1506 H RNM + DT D LQ+E+++IL S YW+RS GRDHVIPMHHPNAFR R+ +N Sbjct: 156 SFNVHGRNMTDPDTEADRLLQVELMDILWKSKYWQRSAGRDHVIPMHHPNAFRFLRNMVN 215 Query: 1507 ASIFIVADFGRIMN-ISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKD 1683 ASI IVADFGR +++L KDVVAPY H+V SFI +D+PDP+++R TLLFFRGRTVRKD Sbjct: 216 ASILIVADFGRYTKELASLRKDVVAPYVHVVASFIDDDAPDPFEARHTLLFFRGRTVRKD 275 Query: 1684 EGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAI 1863 EG IRA+L IL G + +E + A+ +G K STE MRSSKFCLHPAGDTPSSCRLFDAI Sbjct: 276 EGKIRAKLANILKGKDGVRFENSFATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAI 335 Query: 1864 VSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKL 2043 VSHCVPVI+S +IELPFE EIDY EFS+FFSV E LRP Y++N+LR++ K+KW+ MWLKL Sbjct: 336 VSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIPKKKWMEMWLKL 395 Query: 2044 KEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 K +S H+EFQ+PP++ DAVNMIW++V+HKIPAV LA+HR+RRLKI DWW Sbjct: 396 KNVSRHYEFQHPPREGDAVNMIWRQVRHKIPAVNLAIHRNRRLKIPDWW 444 >ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor] gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor] Length = 432 Score = 479 bits (1233), Expect = e-132 Identities = 240/405 (59%), Positives = 297/405 (73%), Gaps = 3/405 (0%) Frame = +1 Query: 985 KSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWR 1164 ++ LN P AP A + PLRV+MYDLP RF+V MM + P+W Sbjct: 40 RATLLNLKPFSARCAP---AAAAPLRVFMYDLPARFHVAMMGAD-------DGAGFPAWP 89 Query: 1165 WN-DGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXX 1341 + G+R+QHSVEYWM+ASL G DG REAVRV DP +AD Sbjct: 90 PSAGGIRRQHSVEYWMMASLQDGAAGPDGG---REAVRVRDPDAADAFFVPFFSSLSFNV 146 Query: 1342 HVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIF 1518 H RNM + DT D LQ+E+V+IL S YW+RS GRDHVIPMHHPNAFR R +NASI Sbjct: 147 HGRNMTDPDTEADRLLQVEIVDILWKSKYWQRSAGRDHVIPMHHPNAFRFLRAMVNASIL 206 Query: 1519 IVADFGRIMN-ISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGII 1695 IV+DFGR +++L KDVVAPY H+V+SF+ +D PDP+++R TLLFFRGRTVRKDEG I Sbjct: 207 IVSDFGRYTKELASLRKDVVAPYVHVVDSFLDDDPPDPFEARHTLLFFRGRTVRKDEGKI 266 Query: 1696 RAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHC 1875 RA+L K+L G + + +E++ A+ +G K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC Sbjct: 267 RAKLGKVLKGKEGVRFEDSIATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHC 326 Query: 1876 VPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEIS 2055 VPVI+S +IELPFE EIDY EFS+FFSV E LRP Y++N+LR++ K+KW++MW KLK +S Sbjct: 327 VPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIPKKKWVDMWSKLKNVS 386 Query: 2056 HHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190 HH+EFQYPP+K DAVNMIW++V+HKIPAV LA+HR+RRLKI DWW Sbjct: 387 HHYEFQYPPRKGDAVNMIWRQVRHKIPAVNLAIHRNRRLKIPDWW 431 >gb|EXC06151.1| putative glycosyltransferase [Morus notabilis] Length = 469 Score = 478 bits (1230), Expect = e-132 Identities = 241/409 (58%), Positives = 288/409 (70%), Gaps = 4/409 (0%) Frame = +1 Query: 979 DYKSQFLNFIPQLTNNAPPCRA-RSP-PLRVYMYDLPPRFNVGMMDPNFPDATPVTALNI 1152 D +S F + P C SP PLRV+MYDLP RFNVGM++ D PVTA Sbjct: 65 DLRSYFFPLLQSPPGARPLCATIASPLPLRVFMYDLPRRFNVGMLNRRSSDQAPVTAQTW 124 Query: 1153 PSWRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXX 1332 P W N GL++QHSVEYWM+ SLLY G DG RE VRV+DP A+ Sbjct: 125 PPWPKNSGLKRQHSVEYWMMGSLLYDG---DG----REVVRVSDPEMAEAFFVPFFSSLS 177 Query: 1333 XXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINA 1509 H NM + T +D +LQ++++ L S YWKR GRDHVIPM HPNAFR R ++NA Sbjct: 178 FNTHGHNMTDPKTRIDHQLQIDLLEFLGESKYWKRYGGRDHVIPMTHPNAFRFLRAELNA 237 Query: 1510 SIFIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDE 1686 SI IV DFGR +S L KDVVAPY H+V+SF +D DPY+SR TLLFFRGRT RKDE Sbjct: 238 SIQIVVDFGRHPRTMSNLGKDVVAPYVHVVDSFTDDDLSDPYESRTTLLFFRGRTFRKDE 297 Query: 1687 GIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIV 1866 GI+R +L K+L G D+ YE + A+ E KAS+ MR SKFCLHPAGDTPSSCRLFDAIV Sbjct: 298 GIVRVKLAKVLAGYDDVHYERSVATGENIKASSLGMRLSKFCLHPAGDTPSSCRLFDAIV 357 Query: 1867 SHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLK 2046 SHCVPVI+SD+IELPFE EIDY +FS+FFS E L PGY+V +LR+ KEKW+ MW +LK Sbjct: 358 SHCVPVIVSDQIELPFEDEIDYSQFSLFFSFKEALEPGYMVEQLRKFPKEKWVEMWRRLK 417 Query: 2047 EISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 ISHHFEFQYPP KEDAV+M+W++VKHK+P V LAVHRSRRLK+ DWW+ Sbjct: 418 NISHHFEFQYPPNKEDAVDMLWRQVKHKVPGVNLAVHRSRRLKVPDWWK 466 >ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like isoform X1 [Glycine max] Length = 427 Score = 478 bits (1229), Expect = e-132 Identities = 232/396 (58%), Positives = 291/396 (73%), Gaps = 2/396 (0%) Frame = +1 Query: 1012 QLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQH 1191 +L + AP A PPLRV+MYDLP RFNVGM+D PVT + P+W N GL+KQH Sbjct: 37 KLPSGAPAPCAPDPPLRVFMYDLPRRFNVGMIDRRSAAEMPVTVEDWPAWPVNWGLKKQH 96 Query: 1192 SVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT 1371 SVEYWM+ SLL GG RE VRV+DP A H M + T Sbjct: 97 SVEYWMMGSLLNVGGG-------REVVRVSDPELAQAFFVPFFSSLSFNTHGHTMKDPAT 149 Query: 1372 -VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-M 1545 +D +LQ++++ +L+ S+YW+RS GRDHV PM HPNAFR RDQ+N SI +V DFGR Sbjct: 150 QIDRQLQVDLMELLKKSNYWQRSGGRDHVFPMTHPNAFRFLRDQLNESIQVVVDFGRYPR 209 Query: 1546 NISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725 +S L KDVV+PY H+V+SF ++ DPY+SR TLLFFRGRT RKDEGI+R +L KIL G Sbjct: 210 GMSNLNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAG 269 Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905 D+ YE + A+EE KAS++ MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD+IE Sbjct: 270 YDDVHYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIE 329 Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085 LPFE EIDY +FS+FFS E L+PGY++++LR+ KEKW MW +LK ISHH+EF+YPPK Sbjct: 330 LPFEDEIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPK 389 Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193 +EDAV+M+W++VKHK+P VKL+VHR+RRLKI DWW+ Sbjct: 390 REDAVDMLWRQVKHKLPGVKLSVHRNRRLKIPDWWQ 425