BLASTX nr result

ID: Mentha29_contig00011778 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00011778
         (2414 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS63888.1| exostosin-like protein, partial [Genlisea aurea]       574   e-161
gb|EYU25911.1| hypothetical protein MIMGU_mgv1a009897mg [Mimulus...   524   e-146
ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|22...   499   e-138
ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citr...   498   e-138
ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutr...   497   e-138
ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata...   496   e-137
ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prun...   495   e-137
ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group] g...   493   e-136
ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [A...   491   e-136
ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g...   491   e-136
ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobrom...   491   e-136
ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g...   491   e-136
ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable gly...   489   e-135
ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable gly...   488   e-135
ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Caps...   487   e-134
ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana] g...   486   e-134
ref|XP_004971083.1| PREDICTED: probable glycosyltransferase At3g...   484   e-134
ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [S...   479   e-132
gb|EXC06151.1| putative glycosyltransferase [Morus notabilis]         478   e-132
ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase ...   478   e-132

>gb|EPS63888.1| exostosin-like protein, partial [Genlisea aurea]
          Length = 386

 Score =  574 bits (1480), Expect = e-161
 Identities = 271/383 (70%), Positives = 320/383 (83%), Gaps = 2/383 (0%)
 Frame = +1

Query: 1048 SPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYWMVASLLY 1227
            S PLRVYMYDLP RFN+G+MDP+F D T V+A N P+WRWNDGLR+QHSVEYWM+ASL+ 
Sbjct: 8    SSPLRVYMYDLPARFNLGLMDPSFRDGTRVSAANFPAWRWNDGLRRQHSVEYWMMASLM- 66

Query: 1228 GGGNDDGSAS--TREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDTVDEKLQLEMV 1401
               NDD S    T EAVRV DP+SAD              +VRNM E DT+DE+LQ+E+V
Sbjct: 67   ---NDDDSPEEFTPEAVRVWDPNSADVFFVPFFASLSFNLYVRNMTEVDTIDEQLQVEIV 123

Query: 1402 NILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMNISTLAKDVVAP 1581
            N LR+S YWKRS GRDHVI +HHPNAFRH+R  +NASIFIVADFGRIM IS L+KDVVAP
Sbjct: 124  NFLRSSKYWKRSQGRDHVIAVHHPNAFRHHRGSVNASIFIVADFGRIMKISRLSKDVVAP 183

Query: 1582 YPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIYEEAHAS 1761
            YPHMVES++++   DPY+SRKTLLFFRGRT RKDEG IR +LHK+L+GT+ +IY+EA+AS
Sbjct: 184  YPHMVESYLNDAVDDPYESRKTLLFFRGRTRRKDEGKIRTRLHKLLHGTEGVIYDEAYAS 243

Query: 1762 EEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFESEIDYKEF 1941
            EEGF+ STE MR+SKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SDKIELPFESE+DYKEF
Sbjct: 244  EEGFRTSTEQMRASKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFESELDYKEF 303

Query: 1942 SIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAVNMIWKEV 2121
            SIFFS  E L PGY+V+ELR+VSK++W  MW KL+ ++HHFEFQYP KK+DAVNMIW++V
Sbjct: 304  SIFFSDEEALTPGYMVSELRKVSKQEWTKMWSKLRSVAHHFEFQYPTKKDDAVNMIWRQV 363

Query: 2122 KHKIPAVKLAVHRSRRLKIADWW 2190
            + K+P VKLA+HRSRRLKI DWW
Sbjct: 364  RQKVPTVKLAIHRSRRLKIPDWW 386


>gb|EYU25911.1| hypothetical protein MIMGU_mgv1a009897mg [Mimulus guttatus]
          Length = 328

 Score =  524 bits (1349), Expect = e-146
 Identities = 252/331 (76%), Positives = 288/331 (87%), Gaps = 2/331 (0%)
 Frame = +1

Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDTVDEKL 1386
            M+ASLL+ G   +GS  TREAVRVTDP SA+              HVRNMAE +TVDEKL
Sbjct: 1    MMASLLHEG---NGSGLTREAVRVTDPDSAEVFFVPFFSSLSFNVHVRNMAELNTVDEKL 57

Query: 1387 QLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMNISTLAK 1566
            QLEM+NIL+ASDYWK+S GRDHVIPMHHPNAFRHYRD++NASIFIVADFGRIMNIS LAK
Sbjct: 58   QLEMINILKASDYWKKSGGRDHVIPMHHPNAFRHYRDEVNASIFIVADFGRIMNISKLAK 117

Query: 1567 DVVAPYPHMVESFISEDSP--DPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDII 1740
            DVVAPYPHMVES+I+E+    DPYKSR+TLL FRGRT RKDEG IRAQLHK+LN TKD+I
Sbjct: 118  DVVAPYPHMVESYIAEEENHVDPYKSRQTLLVFRGRTKRKDEGKIRAQLHKMLNDTKDVI 177

Query: 1741 YEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFES 1920
            YEE  ASEEGFKAS E MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD++ELPFES
Sbjct: 178  YEEGAASEEGFKASAEQMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDRLELPFES 237

Query: 1921 EIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAV 2100
            EIDYKEFS+FFSVNE L+PGY++++LR VS+++W+ MW ++K I+HHFEFQYPPK EDAV
Sbjct: 238  EIDYKEFSMFFSVNEALQPGYLIDKLRAVSEDQWLKMWSRVKSITHHFEFQYPPKDEDAV 297

Query: 2101 NMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            NMIW++VKHK+PAVKLAVHRSRRLKI DWWR
Sbjct: 298  NMIWRQVKHKVPAVKLAVHRSRRLKIPDWWR 328


>ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|223548294|gb|EEF49785.1|
            catalytic, putative [Ricinus communis]
          Length = 434

 Score =  499 bits (1285), Expect = e-138
 Identities = 247/412 (59%), Positives = 309/412 (75%), Gaps = 7/412 (1%)
 Frame = +1

Query: 979  DYKSQFLNFIPQL---TNNAPPCRARSPPLRVYMYDLPPRFNVGMMDP--NFPDATPVTA 1143
            D +S F   + Q    T  A    A  PPL+VYMYDLP RF+VGMMD   +  + TPVT 
Sbjct: 27   DMRSYFFPLLQQQQSPTTGARSLCATGPPLKVYMYDLPRRFHVGMMDHGGDAKNDTPVTG 86

Query: 1144 LNIPSWRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXX 1323
             N+P+W  N GLRKQHSVEYW++ASLLY G ++      REAVRV DP  AD        
Sbjct: 87   ENLPTWPKNSGLRKQHSVEYWLMASLLYEGADE------REAVRVLDPEKADAFFVPFFS 140

Query: 1324 XXXXXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQ 1500
                  H   M + +T +D +LQ++++++L  S YW++S GRDHVIPM HPNAFR  R Q
Sbjct: 141  SLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKYWQKSGGRDHVIPMTHPNAFRFLRQQ 200

Query: 1501 INASIFIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVR 1677
            +NASI IVADFGR   ++STL+KDVVAPY H+V+SF  ++  +P++SR TLLFFRG T+R
Sbjct: 201  LNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSFTDDEVSNPFESRTTLLFFRGNTIR 260

Query: 1678 KDEGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFD 1857
            KDEG +RA+L KIL G  DI +E + A+ E  KASTE MRSSKFCLHPAGDTPSSCRLFD
Sbjct: 261  KDEGKVRAKLAKILTGYDDIHFERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFD 320

Query: 1858 AIVSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWL 2037
            AIVSHCVPVI+SD+IELP+E EIDY +FS+FFSVNE ++PGY+V++LR++ KE+W+ MW 
Sbjct: 321  AIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWR 380

Query: 2038 KLKEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            KLK ISHHFEFQYPP+KEDAV+M+W+EVKHK+P  +LAVHRSRRLKI DWW+
Sbjct: 381  KLKSISHHFEFQYPPEKEDAVDMLWREVKHKLPGAQLAVHRSRRLKIQDWWQ 432


>ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citrus clementina]
            gi|567891051|ref|XP_006438046.1| hypothetical protein
            CICLE_v10031600mg [Citrus clementina]
            gi|568861185|ref|XP_006484086.1| PREDICTED: probable
            glycosyltransferase At3g07620-like isoform X1 [Citrus
            sinensis] gi|568861187|ref|XP_006484087.1| PREDICTED:
            probable glycosyltransferase At3g07620-like isoform X2
            [Citrus sinensis] gi|568861189|ref|XP_006484088.1|
            PREDICTED: probable glycosyltransferase At3g07620-like
            isoform X3 [Citrus sinensis] gi|557540241|gb|ESR51285.1|
            hypothetical protein CICLE_v10031600mg [Citrus
            clementina] gi|557540242|gb|ESR51286.1| hypothetical
            protein CICLE_v10031600mg [Citrus clementina]
          Length = 431

 Score =  498 bits (1283), Expect = e-138
 Identities = 241/399 (60%), Positives = 301/399 (75%), Gaps = 2/399 (0%)
 Frame = +1

Query: 1000 NFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGL 1179
            +F P L + A  C A   PLRVYMYDLP RF+VGM+D + PD  PVT+ N+P W  + G+
Sbjct: 39   HFFPLLQSTAQSCSA---PLRVYMYDLPRRFHVGMLDHSSPDGLPVTSENLPRWPRSSGI 95

Query: 1180 RKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMA 1359
            ++QHSVEYW++ASLLY     DG +  REAVRV+DP +A               H  NM 
Sbjct: 96   KRQHSVEYWLMASLLY-----DGESEEREAVRVSDPDTAQAFFVPFFSSLSFNTHGHNMT 150

Query: 1360 ETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFG 1536
            + DT  D +LQ+E++  LR S YW++S GRDHVIPM HPNAFR  R Q+NASI IVADFG
Sbjct: 151  DPDTEFDRQLQIEILEFLRNSKYWQKSGGRDHVIPMTHPNAFRFLRQQLNASILIVADFG 210

Query: 1537 RI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHK 1713
            R   ++S L+KDVVAPY H+VESF  ++ PDP+ +RKTLLFF+G T+RKDEG +RA+L K
Sbjct: 211  RYPRSMSNLSKDVVAPYVHVVESFTDDNPPDPFVARKTLLFFQGNTIRKDEGKVRAKLAK 270

Query: 1714 ILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIIS 1893
            IL G  D+ YE +  + +  K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+S
Sbjct: 271  ILTGYDDVHYERSAPTTKSIKESTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS 330

Query: 1894 DKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQ 2073
            D+IELPFE EIDY EFS+FFS+ E  +PGY++++LR++ K +WI MW +LK ISH++EFQ
Sbjct: 331  DRIELPFEDEIDYSEFSVFFSIKEAGQPGYMIDQLRQIPKARWIEMWQRLKSISHYYEFQ 390

Query: 2074 YPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            YPPKKEDAVNM+W++VK+KIP V+LAVHR RRLKI DWW
Sbjct: 391  YPPKKEDAVNMVWRQVKNKIPGVQLAVHRHRRLKIPDWW 429


>ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum]
            gi|557087717|gb|ESQ28569.1| hypothetical protein
            EUTSA_v10018590mg [Eutrema salsugineum]
          Length = 432

 Score =  497 bits (1280), Expect = e-138
 Identities = 238/393 (60%), Positives = 290/393 (73%), Gaps = 5/393 (1%)
 Frame = +1

Query: 1027 APPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYW 1206
            A PC     PLRV+MYDLP +FNV MMDP   D  P+T  N+PSW    G+++QHSVEYW
Sbjct: 42   ASPCSITGRPLRVFMYDLPRKFNVAMMDPQSSDVEPLTGKNLPSWPQTSGIKRQHSVEYW 101

Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEK 1383
            ++ASLL+GGG   G    +EA RV DP  AD              H +NM + DT  D +
Sbjct: 102  LMASLLHGGG---GGEEEKEAFRVFDPELADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQ 158

Query: 1384 LQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTL 1560
            LQ+E++  L  S YW+RS GRDHVIPM HPNAFR  R Q+NASI +V DFGR    ++ L
Sbjct: 159  LQVELMEYLENSKYWQRSGGRDHVIPMTHPNAFRFLRQQVNASILVVVDFGRYPREMARL 218

Query: 1561 AKDVVAPYPHMVESFISE---DSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTK 1731
             KDVV+PY H+VESF  +   D+PDP+++R TLL+FRG TVRK EG IR +L K+L G  
Sbjct: 219  GKDVVSPYVHVVESFTEDGGVDTPDPFEARTTLLYFRGNTVRKAEGKIRLRLEKLLAGNS 278

Query: 1732 DIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELP 1911
            D+ YE++ A+ +  K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISD+IELP
Sbjct: 279  DVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDRIELP 338

Query: 1912 FESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKE 2091
            FE EIDY EFS+FFS+ E L PGYI+N LR+  KEKW+ MW  LK +SHHFEFQYPPK+E
Sbjct: 339  FEDEIDYSEFSVFFSIKEALEPGYILNNLRQFPKEKWLQMWENLKNVSHHFEFQYPPKRE 398

Query: 2092 DAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            DAVNM+W++VKHKIP+VKLAVHR+RRLK+ DWW
Sbjct: 399  DAVNMLWRQVKHKIPSVKLAVHRNRRLKVPDWW 431


>ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297334437|gb|EFH64855.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 429

 Score =  496 bits (1277), Expect = e-137
 Identities = 238/395 (60%), Positives = 295/395 (74%), Gaps = 5/395 (1%)
 Frame = +1

Query: 1021 NNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVE 1200
            N A PC +   PLRV+MYDLP +FNV MMDP+  D  P+T  N+PSW    G+++QHSVE
Sbjct: 40   NVASPCSSTGKPLRVFMYDLPRKFNVAMMDPHSSDVEPLTGKNLPSWPQTSGIKRQHSVE 99

Query: 1201 YWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VD 1377
            YW++ASLL GG +D+      EA+RV DP  AD              H +NM + DT  D
Sbjct: 100  YWLMASLLNGGDDDN------EAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFD 153

Query: 1378 EKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIM-NIS 1554
             +LQ+E++  L  S+YW RS G+DHVIPM HPNAFR  R Q+NASI IV DFGR   +++
Sbjct: 154  RQLQVELMEFLEGSEYWNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYAKDMA 213

Query: 1555 TLAKDVVAPYPHMVESFISEDSP---DPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725
             L+KDVV+PY H+VES   ED     DP+++R TLL+FRG TVRKDEG IR +L K+L G
Sbjct: 214  RLSKDVVSPYVHVVESLNEEDDDGLTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAG 273

Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905
              D+ +E++ A+ +  K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISDKIE
Sbjct: 274  NSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIE 333

Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085
            LPFE EIDY EFS+FFS+ E+L PGYI+N+LR+  KEKW+ MW +LK +SHHFEFQYPPK
Sbjct: 334  LPFEDEIDYSEFSLFFSIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPK 393

Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            +EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW
Sbjct: 394  REDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 428


>ref|XP_007227458.1| hypothetical protein PRUPE_ppa005995mg [Prunus persica]
            gi|462424394|gb|EMJ28657.1| hypothetical protein
            PRUPE_ppa005995mg [Prunus persica]
          Length = 433

 Score =  495 bits (1274), Expect = e-137
 Identities = 241/407 (59%), Positives = 300/407 (73%), Gaps = 2/407 (0%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158
            D +S FL  +P     A P RA  PPL+VYMYDLP RFNVGM++    +  PVTA   P+
Sbjct: 28   DIRSYFLPLLPSPPPGAQPPRATGPPLKVYMYDLPRRFNVGMLNRKSTEQAPVTARTWPT 87

Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338
            W  N GL++QHSVEYWM+ SLL+ G   DG    R AVRV+DP  AD             
Sbjct: 88   WPRNSGLKRQHSVEYWMMGSLLFDGDGGDG----RAAVRVSDPELADAFFVPFFSSLSFN 143

Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515
             H  +M +  T +D +LQ++++ IL  S YW+RS GRDHVIP+ HPNAFR  R QINASI
Sbjct: 144  THGHHMTDPATEIDHQLQIDVLKILGESKYWQRSGGRDHVIPLTHPNAFRFLRPQINASI 203

Query: 1516 FIVADFGRIMNI-STLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692
             IV DFGR  ++ S L+KDVV+PY H+V+SF  ++  +PY+SR TLLFF+GRT RKDEGI
Sbjct: 204  QIVVDFGRYPHVMSNLSKDVVSPYVHVVDSFTDDNHSNPYESRTTLLFFQGRTFRKDEGI 263

Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872
            +R +L KIL G  D+ YE + A+ +  KAS++ MRSSKFCLHPAGDTPSSCRLFDAIVSH
Sbjct: 264  VRVKLAKILAGYDDVHYERSVATGDNIKASSQRMRSSKFCLHPAGDTPSSCRLFDAIVSH 323

Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052
            CVPVI+SD+IELPFE EIDY +FS+FFS  E L PGY+V++LR+  K++WI MW +L  I
Sbjct: 324  CVPVIVSDEIELPFEDEIDYTKFSLFFSFKEALEPGYMVDQLRKFPKDRWIEMWRQLNSI 383

Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            SHHFEF YPP+KEDAVNM+W++VKHK+PAVKLA+HR+RRLKI DWWR
Sbjct: 384  SHHFEFHYPPEKEDAVNMLWRQVKHKLPAVKLAIHRNRRLKIPDWWR 430


>ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
            gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa
            Japonica Group] gi|57899432|dbj|BAD88370.1|
            exostosin-like [Oryza sativa Japonica Group]
            gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa
            Japonica Group] gi|125573139|gb|EAZ14654.1| hypothetical
            protein OsJ_04578 [Oryza sativa Japonica Group]
            gi|215741014|dbj|BAG97509.1| unnamed protein product
            [Oryza sativa Japonica Group]
            gi|215767487|dbj|BAG99715.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 437

 Score =  493 bits (1268), Expect = e-136
 Identities = 243/391 (62%), Positives = 296/391 (75%), Gaps = 5/391 (1%)
 Frame = +1

Query: 1033 PCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWN-DGLRKQHSVEYWM 1209
            P  A +PPLRV+MYDLP RF+VGMMD         +A   P+W  +  G+R+QHSVEYWM
Sbjct: 54   PAAAAAPPLRVFMYDLPRRFHVGMMD--------ASASGFPAWPPSAGGIRRQHSVEYWM 105

Query: 1210 VASLLYGGGNDDGSAST--REAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDE 1380
            +ASL  GGG  +GS+S   REAVRVTDP +A+              H RNM + +T  D 
Sbjct: 106  MASLQGGGGGGNGSSSEEGREAVRVTDPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADR 165

Query: 1381 KLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRIMN-IST 1557
             LQ+E++ IL  S YW+RS GRDHVIPMHHPNAFR  RD +NASI IVADFGR    +++
Sbjct: 166  LLQVELMEILWKSKYWQRSAGRDHVIPMHHPNAFRFLRDMVNASILIVADFGRYTKELAS 225

Query: 1558 LAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDI 1737
            L KDVVAPY H+V+SF+++D PDP+  R TLLFFRGRTVRKDEG IRA+L KIL G   +
Sbjct: 226  LRKDVVAPYVHVVDSFLNDDPPDPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGV 285

Query: 1738 IYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFE 1917
             +E++ A+ EG K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+S +IELPFE
Sbjct: 286  RFEDSLATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFE 345

Query: 1918 SEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDA 2097
             EIDY EFS+FFSV E LRP Y++N+LR++ K KW+ +W KLK +SHH+EFQ PP+K DA
Sbjct: 346  DEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDA 405

Query: 2098 VNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            VNMIW++VKHK+PAV LA+HR+RRLKI DWW
Sbjct: 406  VNMIWRQVKHKVPAVNLAIHRNRRLKIPDWW 436


>ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda]
            gi|548851701|gb|ERN09976.1| hypothetical protein
            AMTR_s00013p00218260 [Amborella trichopoda]
          Length = 422

 Score =  491 bits (1265), Expect = e-136
 Identities = 242/406 (59%), Positives = 305/406 (75%), Gaps = 2/406 (0%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158
            D +SQF  F P +   AP     + PL++YMY+LP  FN+GM+  + P         IP 
Sbjct: 26   DLRSQF--FAPTII--APS----NSPLKIYMYNLPRHFNIGMLRRSDPHQDLPFTGQIPP 77

Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338
            W  N GL+KQHSVEYWM+ASLLY    +DG     EA+RV+DP  AD             
Sbjct: 78   WPQNSGLKKQHSVEYWMMASLLY----EDGEGRDMEAIRVSDPEEADAFFVPFFSSLSFN 133

Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515
             H  NM + +T VD +LQ+E++  LR S +W++S GRDHVIPMHHPNAFR  R+++NASI
Sbjct: 134  THGHNMTDPETEVDRQLQIELLEFLRISKFWEQSGGRDHVIPMHHPNAFRFLREKVNASI 193

Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692
             +VADFGR   NIS+L+KDVVAPY H+ +SFI +DS DP++SR TLLFFRGRTVRK EGI
Sbjct: 194  LVVADFGRCPKNISSLSKDVVAPYVHVGDSFIDDDSSDPFESRTTLLFFRGRTVRKAEGI 253

Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872
            +R++L KIL G + + +EE+ A+ E  KAS+  MRSSKFCL+PAGDTPSSCRLFDAIVSH
Sbjct: 254  VRSKLAKILRGQEGVHFEESVATGESIKASSLGMRSSKFCLNPAGDTPSSCRLFDAIVSH 313

Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052
            C+PVI+SD+IELP+E EIDY+ FS+FFSV E LRPGY++ ELR++ +EKW+ MW +LKEI
Sbjct: 314  CIPVIVSDRIELPYEDEIDYRTFSLFFSVEEALRPGYMLKELRQIKREKWVEMWRRLKEI 373

Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            SHHFEFQ+PPK++DAVNMIWK+V+HK+PA KLAVHRSRRLKI DWW
Sbjct: 374  SHHFEFQFPPKRDDAVNMIWKQVRHKLPAAKLAVHRSRRLKIPDWW 419


>ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria
            vesca subsp. vesca]
          Length = 446

 Score =  491 bits (1265), Expect = e-136
 Identities = 245/402 (60%), Positives = 299/402 (74%), Gaps = 5/402 (1%)
 Frame = +1

Query: 1003 FIPQLTNN--APPCR-ARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWND 1173
            FIP L ++  AP    A  PPL+V+MYDLP RFNVGM++    +  PVTA   P W  N 
Sbjct: 46   FIPLLKSSPLAPQSLCATGPPLKVFMYDLPRRFNVGMLNRKSAEEAPVTAREWPPWPRNS 105

Query: 1174 GLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRN 1353
            GL+KQHSVEYWM+ S+L+ G   +GS    E VRV+DP  AD              H  N
Sbjct: 106  GLKKQHSVEYWMMGSVLWEGNGGEGS----EVVRVSDPEVADAFFVPFFSSLSFNTHGHN 161

Query: 1354 MAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVAD 1530
            M + +T VD +LQ+++V +L  S YW RS GRDHVIPM HPNAFR  R QINASI IV D
Sbjct: 162  MNDPETEVDHQLQIDLVKLLHESKYWNRSGGRDHVIPMTHPNAFRFLRPQINASIQIVVD 221

Query: 1531 FGRIMNI-STLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQL 1707
            FGR  ++ S L+KDVV PY H+VESF  ++S DPY+SR TLLFF+GRT RKDEGI+RA+L
Sbjct: 222  FGRYPHVMSNLSKDVVTPYVHVVESFTDDNSSDPYESRTTLLFFQGRTHRKDEGIVRAKL 281

Query: 1708 HKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI 1887
             K+L G  D+ YE + A+ E  K ST+ MR+SKFCLHPAGDTPSSCRLFDAIVSHC+PVI
Sbjct: 282  AKVLAGYDDVHYERSVATGENIKLSTQRMRASKFCLHPAGDTPSSCRLFDAIVSHCIPVI 341

Query: 1888 ISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFE 2067
            +SD+IELPFE E+DY +FS+FFS  E L+PGY+VNELR++SKEKW+ M+  LK ISHHFE
Sbjct: 342  VSDEIELPFEDELDYNQFSVFFSFKEALQPGYMVNELRKLSKEKWMEMYRHLKSISHHFE 401

Query: 2068 FQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            F YPP+KEDAVNM+W++VK K+PAVKLAVHRS+RLKI DWWR
Sbjct: 402  FHYPPEKEDAVNMLWRQVKRKVPAVKLAVHRSQRLKIPDWWR 443


>ref|XP_007044049.1| Exostosin family protein isoform 1 [Theobroma cacao]
            gi|590692416|ref|XP_007044050.1| Exostosin family protein
            isoform 1 [Theobroma cacao]
            gi|590692424|ref|XP_007044051.1| Exostosin family protein
            isoform 1 [Theobroma cacao] gi|508707984|gb|EOX99880.1|
            Exostosin family protein isoform 1 [Theobroma cacao]
            gi|508707985|gb|EOX99881.1| Exostosin family protein
            isoform 1 [Theobroma cacao] gi|508707986|gb|EOX99882.1|
            Exostosin family protein isoform 1 [Theobroma cacao]
          Length = 432

 Score =  491 bits (1264), Expect = e-136
 Identities = 241/387 (62%), Positives = 293/387 (75%), Gaps = 3/387 (0%)
 Frame = +1

Query: 1042 ARSPPLRVYMYDLPPRFNVGMMDP-NFPDATPVTALNIPSWRWNDGLRKQHSVEYWMVAS 1218
            A   PLRVYMYDLP +F+VGM+D  +  +A PVT  N+P W  N G+++QHSVEYW++AS
Sbjct: 47   ATGRPLRVYMYDLPRKFHVGMLDRRSSEEAAPVTMENLPPWPSNSGIKRQHSVEYWLMAS 106

Query: 1219 LLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEKLQLE 1395
            LLY G ++DG    REAVRV DP  AD              H  NM + +T +D  LQ+E
Sbjct: 107  LLYDGQDEDG----REAVRVLDPEKADAFFVPFFSSLSFNTHGHNMTDPETEIDRHLQVE 162

Query: 1396 MVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTLAKDV 1572
            ++  L+ S Y++RS GRDHVIPM HPNAFR  R+Q+NASI IV DFGR    +S+L+KDV
Sbjct: 163  LLEFLQQSKYYQRSGGRDHVIPMTHPNAFRFLREQLNASILIVVDFGRYPKTMSSLSKDV 222

Query: 1573 VAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIYEEA 1752
            VAPY H+V+SF  +D  DPY+SR TLLFFRG TVRKDEG IR +L KIL G+ D+ YE++
Sbjct: 223  VAPYVHVVDSFTDDDPLDPYESRTTLLFFRGNTVRKDEGKIRVKLAKILAGSDDVHYEKS 282

Query: 1753 HASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIELPFESEIDY 1932
             A+ +  K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SDKIELP+E EIDY
Sbjct: 283  VATPKNIKMSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPYEDEIDY 342

Query: 1933 KEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKKEDAVNMIW 2112
             EFSIFFS+ E L PGY+VN LR+  K +W+ MW  LK IS H+EFQYPPKKEDAVNM+W
Sbjct: 343  TEFSIFFSMKEALEPGYLVNHLRQFPKNRWVQMWKLLKNISRHYEFQYPPKKEDAVNMLW 402

Query: 2113 KEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            ++VKHK+P V+LAVHRSRRLK+ DWWR
Sbjct: 403  RQVKHKLPGVQLAVHRSRRLKVPDWWR 429


>ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
            sativus]
          Length = 429

 Score =  491 bits (1264), Expect = e-136
 Identities = 242/407 (59%), Positives = 298/407 (73%), Gaps = 2/407 (0%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158
            D +S F   +     +  PC    PPLRVYMYDLP RFNVG+++    D TPVTA   P 
Sbjct: 27   DIRSYFFPLLQSQPISPFPCTT-DPPLRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPP 85

Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338
            W  N GL++QHSVEYWM+ SLL+    D      R+AVRV DP +AD             
Sbjct: 86   WPRNSGLKRQHSVEYWMMGSLLHEATGDG-----RDAVRVMDPENADAFFVPFFSSLSFN 140

Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515
             H RNM +  T VD +LQ++++  L  S YW+RS GRDHVIPM HPNAFR  R+Q+NASI
Sbjct: 141  SHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASI 200

Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692
             IV DFGR    +S L KDVVAPY H+V SFI ++ PDP++SR TLLFF+G+T RKD+GI
Sbjct: 201  QIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGI 260

Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872
            IR +L KIL+G  D+ YE + A+E+  K S++ MRSSKFCLHPAGDTPSSCRLFDAIVSH
Sbjct: 261  IRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSH 320

Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052
            CVPVI+SD+IELP+E EIDY +F++FFS  E L+PGY+V +LR   KE+WI MW +LKEI
Sbjct: 321  CVPVIVSDQIELPYEDEIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQLKEI 380

Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            S H+EFQYPPKKEDAVNM+W++VKHK+PAVKLAVHRSRRLK+ DWW+
Sbjct: 381  SRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQ 427


>ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
            At3g07620-like [Cucumis sativus]
          Length = 429

 Score =  489 bits (1259), Expect = e-135
 Identities = 241/407 (59%), Positives = 297/407 (72%), Gaps = 2/407 (0%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158
            D +S F   +     +  PC    PPLRVYMYDLP RFNVG+++    D TPVTA   P 
Sbjct: 27   DIRSYFFPLLQSQPISPFPCTT-DPPLRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPP 85

Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338
            W  N GL++QHSVEYWM+ SLL+    D      R+AVRV DP +AD             
Sbjct: 86   WPRNSGLKRQHSVEYWMMGSLLHEATGDG-----RDAVRVMDPENADAFFVPFFSSLSFN 140

Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515
             H RNM +  T VD +LQ++++  L  S YW+RS GRDHVIPM HPNAFR  R+Q+NASI
Sbjct: 141  SHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASI 200

Query: 1516 FIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGI 1692
             IV DFGR    +S L KDVVAPY H+V SFI ++ PDP++SR TLLFF+G+T RKD+GI
Sbjct: 201  QIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGI 260

Query: 1693 IRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSH 1872
            IR +L KIL+G  D+ YE + A+E+  K S++ MRSSKFCLHPAGDTPSSCRLFDAIVSH
Sbjct: 261  IRVKLAKILDGYDDVHYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSH 320

Query: 1873 CVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEI 2052
            CVPVI+SD+IELP+E EIDY +F++FF   E L+PGY+V +LR   KE+WI MW +LKEI
Sbjct: 321  CVPVIVSDQIELPYEDEIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQLKEI 380

Query: 2053 SHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            S H+EFQYPPKKEDAVNM+W++VKHK+PAVKLAVHRSRRLK+ DWW+
Sbjct: 381  SRHYEFQYPPKKEDAVNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQ 427


>ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
            At5g25310-like [Vitis vinifera]
          Length = 437

 Score =  488 bits (1257), Expect = e-135
 Identities = 241/394 (61%), Positives = 289/394 (73%), Gaps = 8/394 (2%)
 Frame = +1

Query: 1033 PCRARSPPLRVYMYDLPPRFNVGMMDPNFP-DATPVTALNIPSWRWNDGLRKQHSVEYWM 1209
            PC     PL VYMYDLP RF+VGM+    P D +PVTA N+P W  N GL+KQHSVEYWM
Sbjct: 45   PCSTGGGPLMVYMYDLPRRFHVGMLRRRSPADESPVTAENLPPWPSNSGLKKQHSVEYWM 104

Query: 1210 VASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEKL 1386
            +ASLLY GG   G   TREAVRV DP  AD              H  NM + DT  D +L
Sbjct: 105  MASLLYDGG---GGNETREAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQL 161

Query: 1387 QLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTLA 1563
            Q++++ ILR S YW+RS GRDHVIPMHHPNAFR +R+Q+N SI IVADFGR    IS L 
Sbjct: 162  QIDILKILRESKYWQRSGGRDHVIPMHHPNAFRFFREQVNTSILIVADFGRYPKEISNLR 221

Query: 1564 KDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNGTKDIIY 1743
            KDVVAPY H+V+SF  ++SPDPY+SR TLLFFRGRT+RKDEGI+R +L K+L G  D  Y
Sbjct: 222  KDVVAPYVHVVDSFTDDNSPDPYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDD--Y 279

Query: 1744 EEAHASEEGFKA-----STEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIEL 1908
             + H     + +     ST+ MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVI+SD+IEL
Sbjct: 280  LQLHFHHRSYLSFLVXQSTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIEL 339

Query: 1909 PFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPKK 2088
            P+E EIDY +FSIFFS  E L PGY++ +LR++ KE+W+ MW  LK ISHH+EFQYPPKK
Sbjct: 340  PYEDEIDYTQFSIFFSDKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKK 399

Query: 2089 EDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
             DA++M+W++VKHK+P   L VHRSRRLK+ DWW
Sbjct: 400  GDAIDMLWRQVKHKLPRANLDVHRSRRLKVPDWW 433


>ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Capsella rubella]
            gi|482570884|gb|EOA35072.1| hypothetical protein
            CARUB_v10020184mg [Capsella rubella]
          Length = 494

 Score =  487 bits (1253), Expect = e-134
 Identities = 237/395 (60%), Positives = 292/395 (73%), Gaps = 7/395 (1%)
 Frame = +1

Query: 1027 APPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQHSVEYW 1206
            A PC +   PLRV+MYDLP +FNV MMDP   D  P+T  N+PSW    G+++QHSVEYW
Sbjct: 104  ASPCSSNGRPLRVFMYDLPRKFNVAMMDPRSSDVEPLTGKNLPSWPQTSGIKRQHSVEYW 163

Query: 1207 MVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT-VDEK 1383
            ++ASLL  GG  DG     EA+RV DP  AD              H +NM + DT  D K
Sbjct: 164  LMASLLQRGG--DGGDD--EAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRK 219

Query: 1384 LQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-MNISTL 1560
            LQ+E++  L  S+YWKRS G+DHVIPM HPNAFR  R Q+NASI IV DFGR   +++ L
Sbjct: 220  LQVELMEFLENSEYWKRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYPKDMARL 279

Query: 1561 AKDVVAPYPHMVESFISEDSPD-----PYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725
            +KDVV+PY H+VE+ ++ED  D     P+++R TLL+FRG T RKDEG IR +L K+L  
Sbjct: 280  SKDVVSPYVHVVET-LTEDGDDDGMTDPFEARTTLLYFRGNTARKDEGKIRLRLEKLLAN 338

Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905
              D+ YE++ A+ +  K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIISDKIE
Sbjct: 339  NSDVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIE 398

Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085
            LPFE EIDY EFS+FFS+ E+L PGYI+N LR+  K+KW+ MW +LK +SHHFEFQYPPK
Sbjct: 399  LPFEDEIDYSEFSVFFSIKESLEPGYILNNLRQFPKDKWLEMWKRLKNVSHHFEFQYPPK 458

Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            +EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW
Sbjct: 459  REDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 493


>ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
            gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis
            thaliana] gi|332196520|gb|AEE34641.1| exostosin-like
            protein [Arabidopsis thaliana]
            gi|591402328|gb|AHL38891.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 430

 Score =  486 bits (1252), Expect = e-134
 Identities = 236/409 (57%), Positives = 294/409 (71%), Gaps = 5/409 (1%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPS 1158
            D +  F     Q    + PC +   PLRV+MYDLP +FN+ MMDP+  D  P+T  N+PS
Sbjct: 27   DPRPYFYLLQSQPNGASSPCSSSGKPLRVFMYDLPRKFNIAMMDPHSSDVEPITGKNLPS 86

Query: 1159 WRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXX 1338
            W    G+++QHSVEYW++ASLL GG +++      EA+RV DP  AD             
Sbjct: 87   WPQTSGIKRQHSVEYWLMASLLNGGEDEN------EAIRVFDPDLADVFYVPFFSSLSFN 140

Query: 1339 XHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASI 1515
             H +NM + DT  D  LQ+E++  L  S YW RS G+DHVIPM HPNAFR  R Q+NASI
Sbjct: 141  THGKNMTDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHVIPMTHPNAFRFLRQQVNASI 200

Query: 1516 FIVADFGRIM-NISTLAKDVVAPYPHMVESFISEDSP---DPYKSRKTLLFFRGRTVRKD 1683
             IV DFGR   +++ L+KDVV+PY H+VES   E      DP+++R TLL+FRG TVRKD
Sbjct: 201  LIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRGNTVRKD 260

Query: 1684 EGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAI 1863
            EG IR +L K+L G  D+ +E++ A+ +  K STE MRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 261  EGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAI 320

Query: 1864 VSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKL 2043
            VSHC+PVIISDKIELPFE EIDY EFS+FFS+ E+L PGYI+N LR+  KEKW+ MW +L
Sbjct: 321  VSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRL 380

Query: 2044 KEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            K +SHHFEFQYPPK+EDAVNM+W++VKHKIP VKLAVHR+RRLK+ DWW
Sbjct: 381  KNVSHHFEFQYPPKREDAVNMLWRQVKHKIPYVKLAVHRNRRLKVPDWW 429


>ref|XP_004971083.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1
            [Setaria italica]
          Length = 445

 Score =  484 bits (1246), Expect = e-134
 Identities = 244/409 (59%), Positives = 301/409 (73%), Gaps = 7/409 (1%)
 Frame = +1

Query: 985  KSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWR 1164
            ++  LN  P  +   PP  A +PPLRV+MYDLPPRF+V MM     DA+  TA   P+W 
Sbjct: 39   RAALLNLKP-FSARCPPSAA-APPLRVFMYDLPPRFHVAMMT-GAADASNATAGPFPAWP 95

Query: 1165 WN-DGLRKQHSVEYWMVASLLYGGGNDDGSAST----REAVRVTDPHSADXXXXXXXXXX 1329
             +  G+++QHSVEYWM+ASL  GGG   G        REAVRV DP  A+          
Sbjct: 96   PSAGGIKRQHSVEYWMMASLQDGGGGGGGGGGVGSERREAVRVRDPDDAEAFFVPFFSSL 155

Query: 1330 XXXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQIN 1506
                H RNM + DT  D  LQ+E+++IL  S YW+RS GRDHVIPMHHPNAFR  R+ +N
Sbjct: 156  SFNVHGRNMTDPDTEADRLLQVELMDILWKSKYWQRSAGRDHVIPMHHPNAFRFLRNMVN 215

Query: 1507 ASIFIVADFGRIMN-ISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKD 1683
            ASI IVADFGR    +++L KDVVAPY H+V SFI +D+PDP+++R TLLFFRGRTVRKD
Sbjct: 216  ASILIVADFGRYTKELASLRKDVVAPYVHVVASFIDDDAPDPFEARHTLLFFRGRTVRKD 275

Query: 1684 EGIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAI 1863
            EG IRA+L  IL G   + +E + A+ +G K STE MRSSKFCLHPAGDTPSSCRLFDAI
Sbjct: 276  EGKIRAKLANILKGKDGVRFENSFATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAI 335

Query: 1864 VSHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKL 2043
            VSHCVPVI+S +IELPFE EIDY EFS+FFSV E LRP Y++N+LR++ K+KW+ MWLKL
Sbjct: 336  VSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIPKKKWMEMWLKL 395

Query: 2044 KEISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            K +S H+EFQ+PP++ DAVNMIW++V+HKIPAV LA+HR+RRLKI DWW
Sbjct: 396  KNVSRHYEFQHPPREGDAVNMIWRQVRHKIPAVNLAIHRNRRLKIPDWW 444


>ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
            gi|241928830|gb|EES01975.1| hypothetical protein
            SORBIDRAFT_03g044100 [Sorghum bicolor]
          Length = 432

 Score =  479 bits (1233), Expect = e-132
 Identities = 240/405 (59%), Positives = 297/405 (73%), Gaps = 3/405 (0%)
 Frame = +1

Query: 985  KSQFLNFIPQLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWR 1164
            ++  LN  P     AP   A + PLRV+MYDLP RF+V MM  +            P+W 
Sbjct: 40   RATLLNLKPFSARCAP---AAAAPLRVFMYDLPARFHVAMMGAD-------DGAGFPAWP 89

Query: 1165 WN-DGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXX 1341
             +  G+R+QHSVEYWM+ASL  G    DG    REAVRV DP +AD              
Sbjct: 90   PSAGGIRRQHSVEYWMMASLQDGAAGPDGG---REAVRVRDPDAADAFFVPFFSSLSFNV 146

Query: 1342 HVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIF 1518
            H RNM + DT  D  LQ+E+V+IL  S YW+RS GRDHVIPMHHPNAFR  R  +NASI 
Sbjct: 147  HGRNMTDPDTEADRLLQVEIVDILWKSKYWQRSAGRDHVIPMHHPNAFRFLRAMVNASIL 206

Query: 1519 IVADFGRIMN-ISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGII 1695
            IV+DFGR    +++L KDVVAPY H+V+SF+ +D PDP+++R TLLFFRGRTVRKDEG I
Sbjct: 207  IVSDFGRYTKELASLRKDVVAPYVHVVDSFLDDDPPDPFEARHTLLFFRGRTVRKDEGKI 266

Query: 1696 RAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHC 1875
            RA+L K+L G + + +E++ A+ +G K STE MRSSKFCLHPAGDTPSSCRLFDAIVSHC
Sbjct: 267  RAKLGKVLKGKEGVRFEDSIATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHC 326

Query: 1876 VPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEIS 2055
            VPVI+S +IELPFE EIDY EFS+FFSV E LRP Y++N+LR++ K+KW++MW KLK +S
Sbjct: 327  VPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIPKKKWVDMWSKLKNVS 386

Query: 2056 HHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWW 2190
            HH+EFQYPP+K DAVNMIW++V+HKIPAV LA+HR+RRLKI DWW
Sbjct: 387  HHYEFQYPPRKGDAVNMIWRQVRHKIPAVNLAIHRNRRLKIPDWW 431


>gb|EXC06151.1| putative glycosyltransferase [Morus notabilis]
          Length = 469

 Score =  478 bits (1230), Expect = e-132
 Identities = 241/409 (58%), Positives = 288/409 (70%), Gaps = 4/409 (0%)
 Frame = +1

Query: 979  DYKSQFLNFIPQLTNNAPPCRA-RSP-PLRVYMYDLPPRFNVGMMDPNFPDATPVTALNI 1152
            D +S F   +       P C    SP PLRV+MYDLP RFNVGM++    D  PVTA   
Sbjct: 65   DLRSYFFPLLQSPPGARPLCATIASPLPLRVFMYDLPRRFNVGMLNRRSSDQAPVTAQTW 124

Query: 1153 PSWRWNDGLRKQHSVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXX 1332
            P W  N GL++QHSVEYWM+ SLLY G   DG    RE VRV+DP  A+           
Sbjct: 125  PPWPKNSGLKRQHSVEYWMMGSLLYDG---DG----REVVRVSDPEMAEAFFVPFFSSLS 177

Query: 1333 XXXHVRNMAETDT-VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINA 1509
               H  NM +  T +D +LQ++++  L  S YWKR  GRDHVIPM HPNAFR  R ++NA
Sbjct: 178  FNTHGHNMTDPKTRIDHQLQIDLLEFLGESKYWKRYGGRDHVIPMTHPNAFRFLRAELNA 237

Query: 1510 SIFIVADFGRI-MNISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDE 1686
            SI IV DFGR    +S L KDVVAPY H+V+SF  +D  DPY+SR TLLFFRGRT RKDE
Sbjct: 238  SIQIVVDFGRHPRTMSNLGKDVVAPYVHVVDSFTDDDLSDPYESRTTLLFFRGRTFRKDE 297

Query: 1687 GIIRAQLHKILNGTKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIV 1866
            GI+R +L K+L G  D+ YE + A+ E  KAS+  MR SKFCLHPAGDTPSSCRLFDAIV
Sbjct: 298  GIVRVKLAKVLAGYDDVHYERSVATGENIKASSLGMRLSKFCLHPAGDTPSSCRLFDAIV 357

Query: 1867 SHCVPVIISDKIELPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLK 2046
            SHCVPVI+SD+IELPFE EIDY +FS+FFS  E L PGY+V +LR+  KEKW+ MW +LK
Sbjct: 358  SHCVPVIVSDQIELPFEDEIDYSQFSLFFSFKEALEPGYMVEQLRKFPKEKWVEMWRRLK 417

Query: 2047 EISHHFEFQYPPKKEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
             ISHHFEFQYPP KEDAV+M+W++VKHK+P V LAVHRSRRLK+ DWW+
Sbjct: 418  NISHHFEFQYPPNKEDAVDMLWRQVKHKVPGVNLAVHRSRRLKVPDWWK 466


>ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like isoform
            X1 [Glycine max]
          Length = 427

 Score =  478 bits (1229), Expect = e-132
 Identities = 232/396 (58%), Positives = 291/396 (73%), Gaps = 2/396 (0%)
 Frame = +1

Query: 1012 QLTNNAPPCRARSPPLRVYMYDLPPRFNVGMMDPNFPDATPVTALNIPSWRWNDGLRKQH 1191
            +L + AP   A  PPLRV+MYDLP RFNVGM+D       PVT  + P+W  N GL+KQH
Sbjct: 37   KLPSGAPAPCAPDPPLRVFMYDLPRRFNVGMIDRRSAAEMPVTVEDWPAWPVNWGLKKQH 96

Query: 1192 SVEYWMVASLLYGGGNDDGSASTREAVRVTDPHSADXXXXXXXXXXXXXXHVRNMAETDT 1371
            SVEYWM+ SLL  GG        RE VRV+DP  A               H   M +  T
Sbjct: 97   SVEYWMMGSLLNVGGG-------REVVRVSDPELAQAFFVPFFSSLSFNTHGHTMKDPAT 149

Query: 1372 -VDEKLQLEMVNILRASDYWKRSDGRDHVIPMHHPNAFRHYRDQINASIFIVADFGRI-M 1545
             +D +LQ++++ +L+ S+YW+RS GRDHV PM HPNAFR  RDQ+N SI +V DFGR   
Sbjct: 150  QIDRQLQVDLMELLKKSNYWQRSGGRDHVFPMTHPNAFRFLRDQLNESIQVVVDFGRYPR 209

Query: 1546 NISTLAKDVVAPYPHMVESFISEDSPDPYKSRKTLLFFRGRTVRKDEGIIRAQLHKILNG 1725
             +S L KDVV+PY H+V+SF  ++  DPY+SR TLLFFRGRT RKDEGI+R +L KIL G
Sbjct: 210  GMSNLNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAG 269

Query: 1726 TKDIIYEEAHASEEGFKASTEHMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIISDKIE 1905
              D+ YE + A+EE  KAS++ MRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD+IE
Sbjct: 270  YDDVHYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIE 329

Query: 1906 LPFESEIDYKEFSIFFSVNETLRPGYIVNELRRVSKEKWINMWLKLKEISHHFEFQYPPK 2085
            LPFE EIDY +FS+FFS  E L+PGY++++LR+  KEKW  MW +LK ISHH+EF+YPPK
Sbjct: 330  LPFEDEIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPK 389

Query: 2086 KEDAVNMIWKEVKHKIPAVKLAVHRSRRLKIADWWR 2193
            +EDAV+M+W++VKHK+P VKL+VHR+RRLKI DWW+
Sbjct: 390  REDAVDMLWRQVKHKLPGVKLSVHRNRRLKIPDWWQ 425


Top