BLASTX nr result

ID: Paeonia22_contig00003537 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00003537
         (1981 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268...   572   e-160
ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prun...   566   e-158
ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citr...   556   e-155
emb|CBI28020.3| unnamed protein product [Vitis vinifera]              541   e-151
ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Popu...   540   e-151
ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g...   535   e-149
ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g...   528   e-147
gb|EXB59796.1| putative glycosyltransferase [Morus notabilis]         525   e-146
ref|XP_007013073.1| Exostosin family protein [Theobroma cacao] g...   525   e-146
ref|XP_002514248.1| catalytic, putative [Ricinus communis] gi|22...   518   e-144
ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g...   514   e-143
emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]   514   e-143
ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g...   495   e-137
gb|EYU27286.1| hypothetical protein MIMGU_mgv1a002540mg [Mimulus...   491   e-136
ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]...   469   e-129
ref|XP_006400529.1| hypothetical protein EUTSA_v10013011mg [Eutr...   460   e-127
ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g...   460   e-127
ref|XP_007160303.1| hypothetical protein PHAVU_002G310300g [Phas...   458   e-126
ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Caps...   451   e-124
ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata...   443   e-121

>ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
          Length = 738

 Score =  572 bits (1473), Expect = e-160
 Identities = 315/576 (54%), Positives = 384/576 (66%), Gaps = 68/576 (11%)
 Frame = +1

Query: 457  ISFQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRP 636
            + FQ+   +E R+++F+VG+VAIT+LLCQSL+LPYG AL SL+ + +VP  +    P R 
Sbjct: 5    LKFQKFCLVETRRWIFMVGLVAITYLLCQSLLLPYGNALLSLLPDRDVPIYDNFSSPTRQ 64

Query: 637  SQ----QFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIG------------------ 750
            S         S+  NASD T+T+LFVE+V+    SN   E G                  
Sbjct: 65   SSVRSFMVNKSLLSNASDLTDTSLFVEVVEDVEKSNVTVEFGDDNGTEGTDEDIEDGLAL 124

Query: 751  --------IEFR-----PKGMDSRQSDFAPEVRNMDNAFQLVENRNGVNGMQSEKIEDVD 891
                    +EF      PK       +FA E + MD+  +  ++ N   G+  +K+ D+D
Sbjct: 125  EREDLENIVEFNEDDNGPKEKGGDTENFASESKGMDHVVEFTKDNNISKGLPFKKVVDMD 184

Query: 892  --------EKQDNSFILERASDARHSLSLEQTVKPNHE-ISADNHLEEDVSRVSSELESR 1044
                      Q+NS  L++ S+ RH  S    VKP +E IS DN ++ D S   S   S 
Sbjct: 185  GISALEYVNNQENSSDLKKDSEMRHIGSAVHIVKPPNEGISTDNIVKADASLTPSTPGSL 244

Query: 1045 DSAFQSSLLGSSPPS--YNITYLKNSTSDASSA----------------------VNSAV 1152
             + F+S LL S      +N TY++   S+ +++                       N  V
Sbjct: 245  GTTFKSHLLASPGVDSLFNTTYIEKMASNGNASNHLTATDISSVGKPEKEILSKDENLLV 304

Query: 1153 LRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIR 1332
            L+SDL              RKKM+SE+PPKSVTSI DMN  LVRHRASSR++RPRW+S R
Sbjct: 305  LQSDLADLNNNSAMTSNPGRKKMQSEMPPKSVTSIYDMNRRLVRHRASSRAMRPRWASPR 364

Query: 1333 DKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQ 1512
            D+E+LAAK QI NAP VKND EL+APLFRNVSMFKRSYELMER+LK+Y+YK+GEKPIFHQ
Sbjct: 365  DQEMLAAKLQIQNAPRVKNDPELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQ 424

Query: 1513 PILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQ 1692
            PILKGLYASEGWFMKLME NK +VVKDPR+A LFYMPFSSRMLEY LYVRNSHNRTNLRQ
Sbjct: 425  PILKGLYASEGWFMKLMERNKHFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQ 484

Query: 1693 HLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIG 1872
            +LK+Y+EKI+AKY F+NRT GADHFLVACHDWAPYETRHHME+CIKALCNADVT+GFKIG
Sbjct: 485  YLKQYSEKIAAKYRFWNRTGGADHFLVACHDWAPYETRHHMEQCIKALCNADVTAGFKIG 544

Query: 1873 RDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            RDVSLPET VRSARNPLRDLGGKPPSERHILAF+AG
Sbjct: 545  RDVSLPETYVRSARNPLRDLGGKPPSERHILAFYAG 580


>ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prunus persica]
            gi|462400148|gb|EMJ05816.1| hypothetical protein
            PRUPE_ppa002387mg [Prunus persica]
          Length = 678

 Score =  566 bits (1459), Expect = e-158
 Identities = 302/528 (57%), Positives = 383/528 (72%), Gaps = 18/528 (3%)
 Frame = +1

Query: 451  MEISFQ--RLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVP--FREGN 618
            M+ SFQ  ++  +E  ++LF++G++A+T++  QSL+LPYG AL+SL+ +NEV   F+   
Sbjct: 1    MKYSFQFPKICHVETGRWLFLLGVLAVTYVSFQSLLLPYGNALRSLLPQNEVQEQFKGSG 60

Query: 619  RFPVRPSQQ---FGNSIPMNAS-DFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMD-SR 783
             F +  S +     N + +++S DF + ++F  + K A +S  G EIG +   KG D  +
Sbjct: 61   VFSIHSSAKSVMVRNPLTVHSSSDFIDVSMFSGVEKAAGNSGLGGEIGHDRGRKGKDVHK 120

Query: 784  QSDFAPEVRNMDNAFQLVENRNGVNGMQSEKIEDVD--------EKQDNSFILERASDAR 939
            + D   E + +DN F    +RN  +   SE + D +        E Q+N  + ++A+ A+
Sbjct: 121  EIDLILEEKGIDNTFANTIHRNVDHNFPSENVVDTNGSLALVSIENQENGSVQDKANVAK 180

Query: 940  HSLSLEQTVKPNHEISADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNST 1119
            +   LE+ V PN+E S +N L+E+ +  + + +   + F SS L        I     S 
Sbjct: 181  YGFPLERIVLPNYETSTENTLKENSNLTAKKSDGVKTGFPSSPL--------ILPAAASL 232

Query: 1120 SDASSA-VNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRAS 1296
            ++A++A V S   +SD+              RKKM+SELPPKS+TSI +MNHILVRHRAS
Sbjct: 233  ANATNASVGSTSFKSDVVTSKNGSVVMTNPGRKKMKSELPPKSITSIYEMNHILVRHRAS 292

Query: 1297 SRSVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIY 1476
            SRS+RPRWSS+RD++ILA KSQI + P+  ND+ELYAPLFRNVSMFKRSYELMER LKIY
Sbjct: 293  SRSLRPRWSSVRDQDILAVKSQIEHPPVAINDRELYAPLFRNVSMFKRSYELMERTLKIY 352

Query: 1477 IYKEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLY 1656
            IYK+G KPIFHQPILKGLYASEGWFMKLM+G KR+VVKDPRKAHLFYMPFSSRMLEYSLY
Sbjct: 353  IYKDGNKPIFHQPILKGLYASEGWFMKLMQGYKRFVVKDPRKAHLFYMPFSSRMLEYSLY 412

Query: 1657 VRNSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKAL 1836
            VRNSHNRTNLRQ LKEY+EKI+AKYP++NRT GADHFLVACHDWAPYETRHHMERC+KAL
Sbjct: 413  VRNSHNRTNLRQFLKEYSEKIAAKYPYWNRTGGADHFLVACHDWAPYETRHHMERCMKAL 472

Query: 1837 CNADVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            CNADVT GFKIGRDVSLPET VRSARNPLRDLGGKPPS+R ILAF+AG
Sbjct: 473  CNADVTGGFKIGRDVSLPETYVRSARNPLRDLGGKPPSQRQILAFYAG 520


>ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citrus clementina]
            gi|568883066|ref|XP_006494321.1| PREDICTED: probable
            glycosyltransferase At5g03795-like [Citrus sinensis]
            gi|557554479|gb|ESR64493.1| hypothetical protein
            CICLE_v10007651mg [Citrus clementina]
          Length = 677

 Score =  556 bits (1434), Expect = e-155
 Identities = 288/519 (55%), Positives = 372/519 (71%), Gaps = 13/519 (2%)
 Frame = +1

Query: 463  FQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPV---- 630
            F ++F+++ R++LFVV +VA+THLL QSL+LPYG AL+SLM ++EV   + +  P     
Sbjct: 7    FLKVFRVQTRRWLFVVLVVAVTHLLFQSLLLPYGKALRSLMPDSEVGVHDESGLPALKSF 66

Query: 631  RPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSD-FAPEV 807
              S    N + +NASD  + ++F   ++    S  G + G +   + +D   ++    E 
Sbjct: 67   SKSVMVRNPLTVNASDLMSDSVFKGSLEDDEDSKFGSDTGDDSGLREVDGDTNNGIVSEG 126

Query: 808  RNMDNAFQLVENRNGVNGMQSEKIEDVDEKQD--------NSFILERASDARHSLSLEQT 963
            +  DN  +LV +R   +   +E ++D+++  +        NS  +E A +A+ SL L+Q 
Sbjct: 127  KGQDNPIELVTDREVDDDSVAENVKDLNDLSELEIERIGENSATVEPAGEAKQSLPLKQI 186

Query: 964  VKPNHEISADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVN 1143
            V+PN EI +D   E+  S+  + +    +      L    P  NIT+LK   S+ASSA  
Sbjct: 187  VQPNLEIVSDGVPEQHTSQSIANIGGEKT------LSIVSPLTNITHLKTEESNASSAAR 240

Query: 1144 SAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWS 1323
            SAV +SD+              +KKMR  +PPK+VTSI +MN IL+RH  SSR++RPRWS
Sbjct: 241  SAVPKSDIATSVNISALIGSPGKKKMRCNMPPKTVTSIFEMNDILMRHHRSSRAMRPRWS 300

Query: 1324 SIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPI 1503
            S+RDKE+LAAK++I  A +  +DQEL+APLFRNVSMFKRSYELM+R LK+Y+Y++G+KPI
Sbjct: 301  SVRDKEVLAAKTEIEKASVSVSDQELHAPLFRNVSMFKRSYELMDRTLKVYVYRDGKKPI 360

Query: 1504 FHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTN 1683
            FHQPILKGLYASEGWFMKLMEGNK + VKDPRKAHLFYMPFSSRMLEY+LYVRNSHNRTN
Sbjct: 361  FHQPILKGLYASEGWFMKLMEGNKHFAVKDPRKAHLFYMPFSSRMLEYALYVRNSHNRTN 420

Query: 1684 LRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGF 1863
            LRQ+LKEY E I+AKY ++NRT GADHFLVACHDWAPYETRHHME CIKALCNADVT+GF
Sbjct: 421  LRQYLKEYAESIAAKYRYWNRTGGADHFLVACHDWAPYETRHHMEHCIKALCNADVTAGF 480

Query: 1864 KIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            K+GRDVSLPET VRSARNPLRDLGGKPPS+RHILAF+AG
Sbjct: 481  KLGRDVSLPETYVRSARNPLRDLGGKPPSQRHILAFYAG 519


>emb|CBI28020.3| unnamed protein product [Vitis vinifera]
          Length = 665

 Score =  541 bits (1395), Expect = e-151
 Identities = 302/532 (56%), Positives = 357/532 (67%), Gaps = 44/532 (8%)
 Frame = +1

Query: 517  VAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRPSQ----QFGNSIPMNASDFT 684
            +AIT+LLCQSL+LPYG AL SL+ + +VP  +    P R S         S+  NASD T
Sbjct: 57   LAITYLLCQSLLLPYGNALLSLLPDRDVPIYDNFSSPTRQSSVRSFMVNKSLLSNASDLT 116

Query: 685  NTTLFVELVKVANSSNGGEEIG--------------------------IEFR-----PKG 771
            +T+LFVE+V+    SN   E G                          +EF      PK 
Sbjct: 117  DTSLFVEVVEDVEKSNVTVEFGDDNGTEGTDEDIEDGLALEREDLENIVEFNEDDNGPKE 176

Query: 772  MDSRQSDFAPEVRNMDNAFQLVENRNGVNGMQSEKIEDVD--------EKQDNSFILERA 927
                  +FA E + MD+  +  ++ N   G+  +K+ D+D          Q+NS  L++ 
Sbjct: 177  KGGDTENFASESKGMDHVVEFTKDNNISKGLPFKKVVDMDGISALEYVNNQENSSDLKKD 236

Query: 928  SDARHSLSLEQTVKPNHE-ISADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSYNITY 1104
            S+ RH  S    VKP +E IS DN ++ D                +SL  S+P S     
Sbjct: 237  SEMRHIGSAVHIVKPPNEGISTDNIVKAD----------------ASLTPSTPGSLEKEI 280

Query: 1105 LKNSTSDASSAVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVR 1284
            L       S   N  VL+SDL              RKKM+SE+PPKSVTSI DMN  LVR
Sbjct: 281  L-------SKDENLLVLQSDLADLNNNSAMTSNPGRKKMQSEMPPKSVTSIYDMNRRLVR 333

Query: 1285 HRASSRSVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERM 1464
            HRASSR++RPRW+S RD+E+LAAK QI NAP VKND EL+APLFRNVSMFKRSYELMER+
Sbjct: 334  HRASSRAMRPRWASPRDQEMLAAKLQIQNAPRVKNDPELHAPLFRNVSMFKRSYELMERI 393

Query: 1465 LKIYIYKEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLE 1644
            LK+Y+YK+GEKPIFHQPILKGLYASEGWFMKLME NK +VVKDPR+A LFYMPFSSRMLE
Sbjct: 394  LKVYVYKDGEKPIFHQPILKGLYASEGWFMKLMERNKHFVVKDPRQAQLFYMPFSSRMLE 453

Query: 1645 YSLYVRNSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERC 1824
            Y LYVRNSHNRTNLRQ+LK+Y+EKI+AKY F+NRT GADHFLVACHDWAPYETRHHME+C
Sbjct: 454  YKLYVRNSHNRTNLRQYLKQYSEKIAAKYRFWNRTGGADHFLVACHDWAPYETRHHMEQC 513

Query: 1825 IKALCNADVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            IKALCNADVT+GFKIGRDVSLPET VRSARNPLRDLGGKPPSERHILAF+AG
Sbjct: 514  IKALCNADVTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPSERHILAFYAG 565


>ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Populus trichocarpa]
            gi|550318376|gb|EEF03003.2| hypothetical protein
            POPTR_0018s09250g [Populus trichocarpa]
          Length = 682

 Score =  540 bits (1391), Expect = e-151
 Identities = 296/530 (55%), Positives = 371/530 (70%), Gaps = 20/530 (3%)
 Frame = +1

Query: 451  MEISFQ--RLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRF 624
            ME+ FQ  + FQ   R++L V+G+VA+TH L Q L+LPYG AL+SL         + + F
Sbjct: 1    MELCFQLPKFFQNVNRRWLLVLGVVAVTHTLFQFLLLPYGNALRSLFPNVNDSMYDKSSF 60

Query: 625  PV----RPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEF-RPKGMDSRQS 789
             V    + S      + ++ S   N   F  +++ A+ SNGG E G +    K  +    
Sbjct: 61   AVIQSSKKSVMVRYPLTVDKSSLNNYFKFDGVLENADDSNGGVEEGHDDGTKKNTEDTDH 120

Query: 790  DFAPEVRNM---DNAFQLVENRNGVNGMQSEKIEDVDEK--------QDNSFILERASDA 936
            DF+ E  +M   D+  QL  +R+  +   SE ++D  E         ++++ +L+ A++A
Sbjct: 121  DFSSEEGDMEVLDDVIQLEVDRDLEDDFPSEDVKDRHETFASGGVKTEESNPVLKLANEA 180

Query: 937  RHSLSLEQTVKPNHEISADNHLEEDVSRVSSELESRDSAF--QSSLLGSSPPSYNITYLK 1110
            R +L LE+ VK +H+I  DN L+++ S+   E E  +S     S  + SS  +   TYLK
Sbjct: 181  RFNLPLERNVKSDHDIPTDNVLQQNKSQAHKEFEHVNSTLPVDSQAVASSTKA---TYLK 237

Query: 1111 NSTSDASSAVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHR 1290
               S+ SS++  A L+SD               +KKMR E+PPKSVT ID+MN ILVRHR
Sbjct: 238  ---SNGSSSIGPAALKSDSAAAKNYSVVLAKPGKKKMRCEMPPKSVTLIDEMNSILVRHR 294

Query: 1291 ASSRSVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLK 1470
             SSRS+RPRWSS RD+EILAA+SQI +AP V +D++LYAPLFRNVS FKRSYELMER LK
Sbjct: 295  RSSRSMRPRWSSARDQEILAARSQIESAPAVVHDRDLYAPLFRNVSKFKRSYELMERTLK 354

Query: 1471 IYIYKEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYS 1650
            IYIYK+G+KPIFH PILKGLYASEGWFMKLM+GNK +VVKDPRKAHLFYMPFSSRMLEY+
Sbjct: 355  IYIYKDGKKPIFHLPILKGLYASEGWFMKLMQGNKHFVVKDPRKAHLFYMPFSSRMLEYT 414

Query: 1651 LYVRNSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIK 1830
            LYVRNSHNRTNLR ++K Y E I+AKY F+NRT GADHFLVACHDWAPYETRHHME CIK
Sbjct: 415  LYVRNSHNRTNLRLYMKRYAESIAAKYSFWNRTGGADHFLVACHDWAPYETRHHMEHCIK 474

Query: 1831 ALCNADVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            ALCNADVT+GFKIGRDVS PET VRSARNPLRDLGGKPPS+R+ILAF+AG
Sbjct: 475  ALCNADVTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAG 524


>ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria
            vesca subsp. vesca]
          Length = 686

 Score =  535 bits (1379), Expect = e-149
 Identities = 286/526 (54%), Positives = 357/526 (67%), Gaps = 18/526 (3%)
 Frame = +1

Query: 457  ISFQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPV-- 630
            + F +L  IE R+ L V+G+VA+T+L+ Q L+LPY  AL+SL+  ++VP      F    
Sbjct: 5    VQFLKLCHIETRRRLLVLGVVAVTYLMFQWLLLPYENALQSLLPRSQVPDHATGSFLTIH 64

Query: 631  --RPSQQFGNSIPMNASDFTNTTLFVELVKVA-NSSNGGEEIGIEFRPKGMDSRQSDFAP 801
                S    N + +N+SD  +   F  + K A NSS GGE +      +    ++ D   
Sbjct: 65   SSAKSVMVRNPLTVNSSDLIDAPRFGGVEKYADNSSLGGETVDKSEPNEKEGFKEIDSVL 124

Query: 802  EVRNMDNAFQLVENRNGVNGMQSEKIEDVD--------EKQDNSFILERASDARHSLSLE 957
            E + MDN F+   +RN      S    D D         K++N   L + ++A +    E
Sbjct: 125  EEKEMDNTFEHAADRNVDENFPSGNGVDTDASLTLVSISKEENGSNLVKTNEASYDFP-E 183

Query: 958  QTVKPNHEISADNHLEEDVSRVSSELESRDSAFQSSLL-----GSSPPSYNITYLKNSTS 1122
             TV    E+S +N LE +++  +   E   + F SS L      S     ++TY+    S
Sbjct: 184  PTVLSKDEVSTENTLEVNMTMAAKHSEGVKTIFPSSPLILPATASFTHQTDVTYVSYLVS 243

Query: 1123 DASSAVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSR 1302
            +ASS+V SA L SD+              +K M+  +PPKS+TSID+MN  LVRH A  R
Sbjct: 244  NASSSVGSAFLESDIVTIKNDSLTRTSPGKKMMKCNMPPKSITSIDEMNLTLVRHHAKPR 303

Query: 1303 SVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIY 1482
            ++RPRWSS+RD++ILA KSQI + P+ KND+ELYAPL+RNVSMFKRSYELMER LK+YIY
Sbjct: 304  ALRPRWSSVRDQDILAVKSQIQHPPVAKNDRELYAPLYRNVSMFKRSYELMERTLKVYIY 363

Query: 1483 KEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVR 1662
            KEG KPIFHQPI+KGLYASEGWFMKLMEG+KR+VVKDPRKAHLFYMPFSSRMLE++LYVR
Sbjct: 364  KEGNKPIFHQPIMKGLYASEGWFMKLMEGDKRFVVKDPRKAHLFYMPFSSRMLEFTLYVR 423

Query: 1663 NSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCN 1842
            NSHNRT LRQ+LKEY+E I+AKYPF+NRT GADHFLVACHDWAPYETRHHMERCIKALCN
Sbjct: 424  NSHNRTKLRQYLKEYSETIAAKYPFWNRTGGADHFLVACHDWAPYETRHHMERCIKALCN 483

Query: 1843 ADVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            ADVT GFKIGRD+SLPET VRSARNPLRDLGGK  SER +L F+AG
Sbjct: 484  ADVTQGFKIGRDISLPETYVRSARNPLRDLGGKRASERQVLTFYAG 529


>ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Solanum tuberosum] gi|565373856|ref|XP_006353482.1|
            PREDICTED: probable glycosyltransferase At5g03795-like
            isoform X2 [Solanum tuberosum]
          Length = 674

 Score =  528 bits (1361), Expect = e-147
 Identities = 278/519 (53%), Positives = 363/519 (69%), Gaps = 13/519 (2%)
 Frame = +1

Query: 463  FQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFRE-----GNRFP 627
            FQ + QI+ RK++ VV +VA+THL CQ+L+LPYG AL SL++E+ +   E          
Sbjct: 7    FQSVCQIDKRKWILVVVLVAVTHLFCQTLMLPYGNALHSLLSESNIQLPEKVSLSSKESS 66

Query: 628  VRPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDS-RQSDFAPE 804
            V  S + G S     S F +  +    +K  ++ +  E+  I+      D  +       
Sbjct: 67   VVESTKVGESFSGTLSSFDDVHMLAHRLKTVDNGDVSEDGEIDESVNEKDEVKPHSNHSV 126

Query: 805  VRNMDNAFQLVENRNGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEI 984
            V+ M+N    VE+    N    +++ D+DE+       ++ +++R  LSLEQ VK N E+
Sbjct: 127  VKTMENDSDFVEDAILENDNLFDEVVDMDEETTT----QKNNESRRDLSLEQVVKTNGEL 182

Query: 985  SADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSY-------NITYLKNSTSDASSAVN 1143
            SAD+ L+ + + V ++ ++      SS++ S+            I +++ +++++S+   
Sbjct: 183  SADSELDANRNSVLNDTKAASVTNSSSVVASNQLDNLPLVTIGEINFIRTTSNNSSTGDL 242

Query: 1144 SAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWS 1323
            + +L +                +KKMR  LPPK+VTSI  M  +LVRHRA SR++RPRWS
Sbjct: 243  TQLLPNH-----GNHSLVQSTVKKKMRCMLPPKTVTSISQMERLLVRHRARSRAMRPRWS 297

Query: 1324 SIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPI 1503
            S RDKEILAA+ QI NAP+++ND+ELYAP FRN+SMFKRSYELMER+LK+Y+YKEGEKPI
Sbjct: 298  SERDKEILAARLQIENAPLLRNDRELYAPAFRNMSMFKRSYELMERILKVYVYKEGEKPI 357

Query: 1504 FHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTN 1683
            FHQPI+KGLYASEGWFMKLMEGN R+VVKDPRKAHLFY+PFSSRMLE+SLYV NSHNRTN
Sbjct: 358  FHQPIMKGLYASEGWFMKLMEGNNRFVVKDPRKAHLFYLPFSSRMLEHSLYVHNSHNRTN 417

Query: 1684 LRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGF 1863
            LRQ+LK+Y+EKI+AKY F+NRT GADHFLVACHDWAPYETRHHME CIKALCNADVT GF
Sbjct: 418  LRQYLKDYSEKIAAKYRFWNRTGGADHFLVACHDWAPYETRHHMEHCIKALCNADVTLGF 477

Query: 1864 KIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            KIGRDVSLPET VRSARNPLRDLGGKPPS+R +LAF+AG
Sbjct: 478  KIGRDVSLPETYVRSARNPLRDLGGKPPSQRKVLAFYAG 516


>gb|EXB59796.1| putative glycosyltransferase [Morus notabilis]
          Length = 669

 Score =  525 bits (1353), Expect = e-146
 Identities = 292/522 (55%), Positives = 366/522 (70%), Gaps = 16/522 (3%)
 Frame = +1

Query: 463  FQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFP--VRP 636
            F +L ++  R ++ VV +VA+THLL QSL+LPYG AL+SL+ E + P R+ N      R 
Sbjct: 7    FHKLGRVRAR-WVLVVLLVAVTHLLFQSLLLPYGKALRSLLPEKDDP-RDVNYAARTARI 64

Query: 637  SQQFG---NSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFA--- 798
            S ++    N + +NAS+  +T+   +L       + G ++G +   +G D R  +F    
Sbjct: 65   STKYAVVRNPLTVNASELIDTSTSDDL-------DDGGDLGSDTGGEG-DDRFEEFGFTL 116

Query: 799  PEVRNMDNAFQLVENRNGVNGMQS-EKIEDVD----EKQDNSFILERASDARHSLSLEQT 963
             E + +    Q + +R   + + S +K E +     + ++N F+L +AS  R    L+QT
Sbjct: 117  DEEKGLHRTSQDLVDRYVDDTLNSADKPESLALISMKNEENDFVLSKASKDRRGFPLDQT 176

Query: 964  -VKPNHEISADNHLEEDVS-RVSSELESRDSAFQSSLLGSSPPSY-NITYLKNSTSDASS 1134
             V+PN E+S +N   E++  R+       DS FQ S L SS  +  N ++   STS  S 
Sbjct: 177  AVEPNIEMSTENIRTENIDLRLKKSDGGLDSPFQPSPLASSADALVNASFSTTSTSSVSE 236

Query: 1135 AVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRP 1314
                   +S L               KKMR  +PPKS+T+  +MN ILVRHRA SRS+RP
Sbjct: 237  -------QSGLLITNNHSAIATTPGVKKMRCNMPPKSITTFQEMNQILVRHRAKSRSLRP 289

Query: 1315 RWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGE 1494
            RWSS+RDKEILA K QI NAP+  NDQELYAPLFRNVSMFKRSYELMER LK+Y+YK+G+
Sbjct: 290  RWSSVRDKEILAMKPQIENAPLAMNDQELYAPLFRNVSMFKRSYELMERTLKVYVYKDGD 349

Query: 1495 KPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHN 1674
            KPIFHQPI+KGLYASEGWFMKLME N+RYVVKDPR+AHLFYMPFSSRMLE+ LYVRNSHN
Sbjct: 350  KPIFHQPIMKGLYASEGWFMKLMERNRRYVVKDPRRAHLFYMPFSSRMLEHVLYVRNSHN 409

Query: 1675 RTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVT 1854
            RTNLRQ+LKEY+EK++AKYP++NRT GADHFLVACHDWAPYETRHHMERC+KALCNADVT
Sbjct: 410  RTNLRQYLKEYSEKLAAKYPYWNRTGGADHFLVACHDWAPYETRHHMERCMKALCNADVT 469

Query: 1855 SGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            SGFKIGRDVS PET VRSARNPLRDLGGKPPS RH+LAF+AG
Sbjct: 470  SGFKIGRDVSFPETYVRSARNPLRDLGGKPPSRRHVLAFYAG 511


>ref|XP_007013073.1| Exostosin family protein [Theobroma cacao]
            gi|508783436|gb|EOY30692.1| Exostosin family protein
            [Theobroma cacao]
          Length = 736

 Score =  525 bits (1353), Expect = e-146
 Identities = 301/573 (52%), Positives = 363/573 (63%), Gaps = 67/573 (11%)
 Frame = +1

Query: 463  FQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNR--FPVRP 636
            F++LF  E ++++ +VG+VAITHLL QS +LPYG AL+SL+  +E          F +  
Sbjct: 7    FKKLFHSENKRWVLLVGVVAITHLLFQSFLLPYGNALRSLLPGDEGSIANDKDVIFGILS 66

Query: 637  SQQFG---NSIPMNASDFTNTTLFVE-LVKVANSSNGGEEIGIEFRPKGMDSRQSD--FA 798
            S       N + +NASD +   + +  ++K  NSSN G   G      G D R+ +  FA
Sbjct: 67   SVNSAMVRNPLTINASDTSTRNVVINGVLKDGNSSNVGGSAGNGGGLMG-DRREMENGFA 125

Query: 799  PEVRNMDNAFQLVENRNGVNGMQSEKIEDVDE---------KQDNS-------------- 909
             E    D   ++  +RN  +   SE  ED++E          QDNS              
Sbjct: 126  SEGMESDTRIKIAIDRNIDDDYASENAEDLNEISVLDDIIRDQDNSPLEEVVEPGQLVSA 185

Query: 910  ----------------------------------FILERASDARHSLSLEQTVKPNHEIS 987
                                                +E   +A H  +LE  VK   E+S
Sbjct: 186  DKLLENDASQTPKEFGHVNTSSQTPTLASPVVSSLAMESTDEAGHGFTLETVVKHAQEVS 245

Query: 988  ADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPS--YNITYLKNSTSDASSAVNSAVLRS 1161
                LE   S+   EL   + A  S  L S   S   N TYL+NST +A S   S  L S
Sbjct: 246  TSKLLETRTSQSPKELGHVNIASPSPTLASPVVSSLVNKTYLRNSTKNADSLGFSTSLLS 305

Query: 1162 DLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKE 1341
            +               RKK+R E+PPKSVT+I++MN ILV HR SSR++RPR SS+RD+E
Sbjct: 306  NHLTSKNNSAMIAKPGRKKVRCEMPPKSVTTIEEMNRILVWHRRSSRAMRPRRSSVRDQE 365

Query: 1342 ILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPIL 1521
              AA+SQI +AP++ NDQELYAPLFRNVSMFKRSYELMER LK+Y+YK G+KPIFH PIL
Sbjct: 366  TFAARSQIESAPVIVNDQELYAPLFRNVSMFKRSYELMERTLKVYVYKNGKKPIFHLPIL 425

Query: 1522 KGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLK 1701
            KGLYASEGWFMKLM+GNKR+VVKDPR+AHLFYMPFSSRMLEY+LYVRNSHNRTNLRQ LK
Sbjct: 426  KGLYASEGWFMKLMQGNKRFVVKDPRRAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLK 485

Query: 1702 EYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDV 1881
            +YTE I+AKYP+FNRT GADHFLVACHDWAPYETRHHME CIKALCNADVT GFKIGRDV
Sbjct: 486  DYTENIAAKYPYFNRTGGADHFLVACHDWAPYETRHHMEHCIKALCNADVTVGFKIGRDV 545

Query: 1882 SLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            SLPET VRSARNPLRDLGGKPPS+RHILAF+AG
Sbjct: 546  SLPETYVRSARNPLRDLGGKPPSQRHILAFYAG 578


>ref|XP_002514248.1| catalytic, putative [Ricinus communis] gi|223546704|gb|EEF48202.1|
            catalytic, putative [Ricinus communis]
          Length = 676

 Score =  518 bits (1334), Expect = e-144
 Identities = 284/538 (52%), Positives = 362/538 (67%), Gaps = 28/538 (5%)
 Frame = +1

Query: 451  MEISFQ--RLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRF 624
            ME+ FQ  +L QIE RK+L VVG VA+TH+L Q L+LPYG AL+SL+  +  P  + + F
Sbjct: 1    MELRFQFHKLCQIETRKWLLVVGAVAVTHILFQFLLLPYGNALRSLLPNSSDPIYDKSSF 60

Query: 625  PVRPSQ----QFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSD 792
            P+  S        N + ++ S  +  ++   LVK A    G  ++      +  +   + 
Sbjct: 61   PIIQSSTKSVMVRNPLTVDTSSLSKDSM---LVKDAGLVGGSGDL-----KRNREDTVNG 112

Query: 793  FAPEVRNMDNAFQLVENRNGVNGMQSEKIEDVDEKQDNSFILERASDA----RHSLSLEQ 960
            F  +   +DN  +L  + +G    +    ED+D   +  F+++R  D      +  S  Q
Sbjct: 113  FVSDDEELDNPIELAVDNDGFVSDE----EDLDNTIE--FVVDRNVDDDFPDSNGTSTLQ 166

Query: 961  TVKPNHEISAD----NHLEEDVSRVSSELESRDSAFQSSLLG------SSPPSY------ 1092
             +K    IS+        E D   + S + S D+      LG       SPP+       
Sbjct: 167  IIKIQESISSSLESITEAERDNEILISNIVSGDTTLPQKELGHANISFKSPPAVAQALAL 226

Query: 1093 --NITYLKNSTSDASSAVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDM 1266
              N+T L++S    +S++ SA+L++                +KKMR ++PPKS+T I +M
Sbjct: 227  PINVTNLRSS---GNSSLGSAILKNSFATSKNVSAKPV---KKKMRCDMPPKSITLIHEM 280

Query: 1267 NHILVRHRASSRSVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSY 1446
            N ILVRHR SSR+ RPRWSS RD+EILAA+ QI NAP   NDQ+LYAPLFRN+S FKRSY
Sbjct: 281  NQILVRHRRSSRATRPRWSSQRDREILAARMQIENAPHAVNDQDLYAPLFRNISKFKRSY 340

Query: 1447 ELMERMLKIYIYKEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPF 1626
            ELMER LK+YIYK+G+KPIFH PI+KGLYASEGWFMKLM+GNK ++VKDPR+AHLFYMPF
Sbjct: 341  ELMERTLKVYIYKDGKKPIFHLPIMKGLYASEGWFMKLMQGNKHFLVKDPRRAHLFYMPF 400

Query: 1627 SSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETR 1806
            SSRMLEY+LYVRNSHNRTNLRQ+LK+Y+EKI+AKYPF+NRT GADHFLVACHDWAPYETR
Sbjct: 401  SSRMLEYTLYVRNSHNRTNLRQYLKDYSEKIAAKYPFWNRTDGADHFLVACHDWAPYETR 460

Query: 1807 HHMERCIKALCNADVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            HHME CIKALCNADVT+GFKIGRD+SLPET VRSARNPLRDLGGKPPS+RHILAF+AG
Sbjct: 461  HHMEHCIKALCNADVTAGFKIGRDISLPETYVRSARNPLRDLGGKPPSQRHILAFYAG 518


>ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g03795-like [Solanum
            lycopersicum]
          Length = 674

 Score =  514 bits (1324), Expect = e-143
 Identities = 270/519 (52%), Positives = 361/519 (69%), Gaps = 13/519 (2%)
 Frame = +1

Query: 463  FQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFRE-----GNRFP 627
            FQ + QI+ RK++ VV +VA+THL CQ+L+LPYG AL SL++E+     E          
Sbjct: 7    FQSVCQIDKRKWILVVVLVAVTHLFCQTLMLPYGNALHSLLSESNTQLSEKVSLLSKESS 66

Query: 628  VRPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDS-RQSDFAPE 804
            V  S + G       S F +  +    +K  ++S+  E+  I+      D  +       
Sbjct: 67   VVESTKVGEGFSGTLSSFDDVHMLAHRLKTVDNSDVSEDGEIDESVNEKDEVKPHSNHSV 126

Query: 805  VRNMDNAFQLVENRNGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEI 984
            V+ M+N    VE+    N    +++ D+DE+      +++ ++++  LS+EQ VK   E+
Sbjct: 127  VKTMENDSDFVEDATIENDNLFDEMVDMDEETT----MQKNNESKWDLSIEQVVKTTDEL 182

Query: 985  SADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSY-------NITYLKNSTSDASSAVN 1143
            SAD+ L+ + + V ++ ++ +    SS+  S+            I +++ + +++S+   
Sbjct: 183  SADSDLDANRNTVLNDTKAANVTNSSSVEASNHLDNLPLVAIGEINFIRTTGNNSSTGNL 242

Query: 1144 SAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWS 1323
            + +L ++               +KKMR  LPPK+VT+I  M  +LVRHRA SR++RPRWS
Sbjct: 243  TQLLPNN-----GNHSLVLSTVKKKMRCMLPPKTVTTISQMERLLVRHRARSRAMRPRWS 297

Query: 1324 SIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPI 1503
            S RDKEILAA+ QI NAP+++ND+E+YAP FRN+SMFKRSYELMER+L++Y+YKEGEKPI
Sbjct: 298  SERDKEILAARLQIENAPLIRNDREIYAPAFRNMSMFKRSYELMERILRVYVYKEGEKPI 357

Query: 1504 FHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTN 1683
            FHQPI+KGLYASEGWFMKLMEGN ++VVKDPRKAHLFY+PFSSRMLE+SLYVRNSHNRTN
Sbjct: 358  FHQPIMKGLYASEGWFMKLMEGNNKFVVKDPRKAHLFYLPFSSRMLEHSLYVRNSHNRTN 417

Query: 1684 LRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGF 1863
            LRQ+LK+Y+EKI+AKY F+NRT GADHFLVACHDWAPYETRHHME CIKALCNADVT GF
Sbjct: 418  LRQYLKDYSEKIAAKYRFWNRTGGADHFLVACHDWAPYETRHHMEHCIKALCNADVTLGF 477

Query: 1864 KIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            KIGRDVSL ET VRSARNPLRDLGGKP S+R +LAF+AG
Sbjct: 478  KIGRDVSLAETYVRSARNPLRDLGGKPASQRKVLAFYAG 516


>emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
          Length = 1908

 Score =  514 bits (1323), Expect = e-143
 Identities = 287/554 (51%), Positives = 358/554 (64%), Gaps = 46/554 (8%)
 Frame = +1

Query: 457  ISFQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRP 636
            + FQ+   +E R+++F+VG+VAIT+LLCQSL+LPYG AL SL+ + +VP  +    P R 
Sbjct: 714  LKFQKFCLVETRRWIFMVGLVAITYLLCQSLLLPYGNALLSLLPDRDVPIYDNFSSPTRQ 773

Query: 637  SQ----QFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIG------------------ 750
            S         S+  NASD T+T+LFVE+V+    SN   E G                  
Sbjct: 774  SSVRPFMVNKSLLSNASDLTDTSLFVEVVEDVEKSNVTVEFGDDNGTEGTDEDIEDGLAL 833

Query: 751  --------IEFR-----PKGMDSRQSDFAPEVRNMDNAFQLVENRNGVNGMQSEKIEDVD 891
                    +EF      PK       +FA E + MD+  +  ++ N   G+  +K+ D+D
Sbjct: 834  EREDLENIVEFNEDDNGPKEKGGDTENFASESKGMDHVVEFTKDNNISKGLPFKKVVDMD 893

Query: 892  --------EKQDNSFILERASDARHSLSLEQTVKPNHE-ISADNHLEEDVSRVSSELESR 1044
                      Q+NS  L++ S+ RH  S    VKP +E IS DN ++ D S   S   S 
Sbjct: 894  GISALEYVNNQENSSDLKKDSEMRHIGSAVHIVKPPNEGISTDNIVKADASLTPSTPGSL 953

Query: 1045 DSAFQSSLLGSSPPS--YNITYLKNSTSDASSAVNSAVLRSDLXXXXXXXXXXXXLRRKK 1218
             + F+S LL S      +N TY++   S+ +++  + +  +D+                 
Sbjct: 954  GTTFKSHLLASPGVDSLFNTTYVEKMASNGNAS--NHLTATDISSVGK------------ 999

Query: 1219 MRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKEILAAKSQIANAPIVKNDQE 1398
                 P K + S D+              +RPRW+S RD+E+LAAK QI NAP VKND E
Sbjct: 1000 -----PEKEILSKDE------------NLLRPRWASPRDQEMLAAKLQIQNAPRVKNDPE 1042

Query: 1399 LYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPILKGLYASEGWFMKLMEGNKR 1578
            L+APLFRNVSMFKRSYELMER+LK+Y+YK+GEKPIFHQPILKGLYASEGWFMKLME NK 
Sbjct: 1043 LHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQPILKGLYASEGWFMKLMERNKX 1102

Query: 1579 YVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISAKYPFFNRTSGA 1758
            +VVKDPR+A LFYMPFSSRMLEY LYVRNSHNRTNLRQ+LK+Y+EKI+AKY F+NRT G 
Sbjct: 1103 FVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAKYRFWNRTGGX 1162

Query: 1759 DHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDVSLPETLVRSARNPLRDLGG 1938
            DHFLVACHDWAPYETRHHME+CIKALCNADVT+GFKIGRDVSLPET VRSARNPLRDLGG
Sbjct: 1163 DHFLVACHDWAPYETRHHMEQCIKALCNADVTAGFKIGRDVSLPETYVRSARNPLRDLGG 1222

Query: 1939 KPPSERHILAFFAG 1980
            KPPSERHILAF+AG
Sbjct: 1223 KPPSERHILAFYAG 1236



 Score =  323 bits (829), Expect = 1e-85
 Identities = 213/574 (37%), Positives = 301/574 (52%), Gaps = 64/574 (11%)
 Frame = +1

Query: 451  MEISFQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPV 630
            M   F +L  +E R+ LF+VG+V  + ++ Q   LP      S+      P  +G+    
Sbjct: 3    MTALFMKLCHVESRRLLFIVGLVVASVIVFQVFELP------SMNTLTLSPTVKGS---- 52

Query: 631  RPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEV- 807
              S   G++  +  S   N+ +   +V  +++S+  +E  +++     D    D++ E+ 
Sbjct: 53   -VSMMVGDATILKNSISANSYVIRTVVNNSDASDLEDEADMDYHLASDDDGDLDYSVEMH 111

Query: 808  --RNMDNAFQLVENRNGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHE 981
              +N DN F L     GV   +S  + +V    DNS   E+A + RH   LE     ++ 
Sbjct: 112  KEKNSDNEFIL---EKGVGLDKSMTVRNV-RHTDNS-PKEKAIEFRHG-PLEHLKISDNN 165

Query: 982  ISADNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSYNITYLK--NSTSDASSAVN---- 1143
               D+  +   S    E  +RD      L+     S     L   + TSD S+  N    
Sbjct: 166  FKIDDDRKASTSLTIGEGSNRDGLVSLPLVSPGISSKGTRNLDADSRTSDLSTVSNVKHV 225

Query: 1144 -------SAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSR 1302
                   +  L   +            +   + R   P    T+I  MN +L++   SS 
Sbjct: 226  MEAEKDKNTNLLQTVSVPLDNNYTIADISITRRRGMKP----TTISKMNLLLLQSAVSSY 281

Query: 1303 SVRPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIY 1482
            S+RPRWSS RD+E+L+A+S+I NAP+++N   LYA ++RNVSMFKRSYELMER+LKIYIY
Sbjct: 282  SMRPRWSSPRDRELLSARSEIQNAPVIRNTPGLYASVYRNVSMFKRSYELMERVLKIYIY 341

Query: 1483 KEGEKPIFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVR 1662
            +EGEKPIFHQP L+G+YASEGWFMKL+EGNKR+VV+DPRKAHLFY+PFSS+ML    Y +
Sbjct: 342  REGEKPIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAHLFYVPFSSKMLRTVFYEQ 401

Query: 1663 NSHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDW------------------ 1788
            NS    +L ++ K Y   I+ KY F+NRT GADH +VACHDW                  
Sbjct: 402  NSSTPRDLEKYFKNYVGLIAGKYRFWNRTGGADHLIVACHDWNPIYRTISTNTIRIKSQA 461

Query: 1789 ------------------------------APYETRHHMERCIKALCNADVTSGFKIGRD 1878
                                          AP  TR      I+ALCN+++ SGFKIG+D
Sbjct: 462  ITMPPFIFVGGESTYDLVSGTFSNKGFNSQAPRITRQCSWNSIRALCNSNIASGFKIGKD 521

Query: 1879 VSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
             +LP T +R + +PL+ LGGKPPS+R ILAFFAG
Sbjct: 522  TTLPVTYIRKSEDPLKYLGGKPPSQRPILAFFAG 555


>ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
            sativus] gi|449501299|ref|XP_004161331.1| PREDICTED:
            probable glycosyltransferase At5g03795-like [Cucumis
            sativus]
          Length = 664

 Score =  495 bits (1274), Expect = e-137
 Identities = 276/517 (53%), Positives = 352/517 (68%), Gaps = 14/517 (2%)
 Frame = +1

Query: 472  LFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRPSQQFG 651
            L  I+ R+ L +VG+VA T+L+ QSL+LPYG AL+SL+ E+ +   + + + +    QFG
Sbjct: 10   LCHIQTRRCLLLVGVVAFTYLIFQSLLLPYGDALRSLLPEDAI--HKYDHYNI----QFG 63

Query: 652  NSIPMNASDFTNTTLFVELVKVANSSNGGEEIGI----------EFRPKGMDSRQSDFAP 801
             + P  A+   N    ++L  V+ +  G  + G           E+  +    R+ DF  
Sbjct: 64   PNSPKLAT-VRNPLTVLDLANVSTTPIGKIDKGFQRDNLLNSKGEYVKEEEIPREVDFGS 122

Query: 802  EVRNMDNAFQLVENRNGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPN-- 975
            E  N  +A   +E+ +G     ++ I  VD +    F L+           +Q VKP+  
Sbjct: 123  ESGNNVDANGNLES-DGTKNRANDSILPVDGETSFGFPLK-----------QQVVKPSDT 170

Query: 976  HEISADNHLEE--DVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSA 1149
            + I+ +N LE+   +     ELE   ++    L  +  P  + T++  +++   + ++S 
Sbjct: 171  NTITLENELEDFGQMDLDFGELEEFKNSSLQKLEDTDMPFNSSTFMLQTSTSTVNTIHSH 230

Query: 1150 VLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSI 1329
             L S+L             +RKKM+SELPPK+VT++++MN IL RHR SSR++RPR SS+
Sbjct: 231  QLLSNLSSSASETNSTSIGKRKKMKSELPPKTVTTLEEMNRILFRHRRSSRAMRPRRSSL 290

Query: 1330 RDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFH 1509
            RD+EI +AKS I  A  V ND ELYAPLFRNVSMFKRSYELMER LKIY+Y++G+KPIFH
Sbjct: 291  RDQEIFSAKSLIVQASAV-NDPELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFH 349

Query: 1510 QPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLR 1689
            QPILKGLYASEGWFMKLMEGNKR+VVKDPRKAHLFYMPFSSRMLEY+LYVRNSHNRTNLR
Sbjct: 350  QPILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLR 409

Query: 1690 QHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKI 1869
            Q LKEY E I+AKYP++NRT GADHFL  CHDWAPYETRHHME CIKALCNADVT GFKI
Sbjct: 410  QFLKEYAENIAAKYPYWNRTGGADHFLAGCHDWAPYETRHHMEHCIKALCNADVTVGFKI 469

Query: 1870 GRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            GRDVSLPET VRSARNPLRDLGGKP S+RHILAF+AG
Sbjct: 470  GRDVSLPETYVRSARNPLRDLGGKPASQRHILAFYAG 506


>gb|EYU27286.1| hypothetical protein MIMGU_mgv1a002540mg [Mimulus guttatus]
          Length = 661

 Score =  491 bits (1265), Expect = e-136
 Identities = 277/525 (52%), Positives = 355/525 (67%), Gaps = 17/525 (3%)
 Frame = +1

Query: 457  ISFQRLFQIEVRKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENE---VPFREGNRFP 627
            +  ++L Q E RK++F+VG+V +THL CQSL+LPYG AL SL+ +++   V   E +   
Sbjct: 5    VKIKKLVQFEKRKWVFLVGLVGLTHLFCQSLMLPYGNALLSLLPDDKSSVVVTAEDDDSS 64

Query: 628  VRPSQQFGNSIPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPK--GMDSRQSDFAP 801
            V+ S    N   + AS+  + +L V  V    +S  G +IG +      G D+++     
Sbjct: 65   VKISI-VENLGTLAASNLDSQSLLVRRV----TSTVGRDIGNDDDKGSVGTDNQEKMNPD 119

Query: 802  EVRNMDNAFQLVENRNGVNG-----MQSE----KIEDVDEKQDNSFILERASDARHSLSL 954
               + D+ F  VE+   VN      M  E    +IE   + +  S I E+  +   ++S+
Sbjct: 120  PDMDDDDDFDFVEDETLVNNSNNVDMDKEGSVMQIEISQQHESLSQIGEQGDNIMKNISV 179

Query: 955  EQTVKPNHEISADNHLEE--DVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDA 1128
             Q  K +  +  D+   E  D +     + S     +S +  +S    +I  + N  SD+
Sbjct: 180  IQLAKESPGVVLDSETSEMKDKNVKGGSVTSSPLLIESQVSTTSSAEGHILMVNNKLSDS 239

Query: 1129 SSAVNSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSV 1308
            ++   S+V                   +KKMR ++PPK+VT +++M  ILVR+RA SR++
Sbjct: 240  TNG--SSV-------------------KKKMRCDMPPKTVTPVNEMERILVRNRARSRAM 278

Query: 1309 RPRWSSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKE 1488
            RPRWSS RD+EIL AK +I + PI+ ND ELYAPLFRN+SMFKRSYELMER+LK+Y+YKE
Sbjct: 279  RPRWSSERDQEILTAKLKIESPPILNNDPELYAPLFRNISMFKRSYELMERVLKVYVYKE 338

Query: 1489 GEKPIFHQPILKGLYASEGWFMKLMEG-NKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRN 1665
            GEKPIFHQPILKGLYASEGWFMKLMEG NKR++VKDPRKAHLFYMPFSSRMLEY+LYVRN
Sbjct: 339  GEKPIFHQPILKGLYASEGWFMKLMEGGNKRFLVKDPRKAHLFYMPFSSRMLEYTLYVRN 398

Query: 1666 SHNRTNLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNA 1845
            SHNRTNLR +LK+Y+EKI++KY F+NRT GADHFLVACHDWAPYETRHHME CIKALCNA
Sbjct: 399  SHNRTNLRHYLKDYSEKIASKYRFWNRTGGADHFLVACHDWAPYETRHHMEHCIKALCNA 458

Query: 1846 DVTSGFKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            DVT GFKIGRDVSLPET VRSARNPLRDLGGKPPS+R  LAFFAG
Sbjct: 459  DVTGGFKIGRDVSLPETYVRSARNPLRDLGGKPPSQRSTLAFFAG 503


>ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
            gi|332005353|gb|AED92736.1| Exostosin family protein
            [Arabidopsis thaliana] gi|591401784|gb|AHL38619.1|
            glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 610

 Score =  469 bits (1208), Expect = e-129
 Identities = 267/500 (53%), Positives = 333/500 (66%), Gaps = 3/500 (0%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAEN---EVPFREGNRFPVRPSQQFGNSI 660
            RK+  +VGIVA+TH+L   L+L YG AL+ L+ +    ++P  E N   + PS+   N++
Sbjct: 16   RKWAILVGIVALTHIL---LLLSYGDALRYLLPDGRRLKLP-NENNALLMTPSR---NTL 68

Query: 661  PMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNMDNAFQLVE 840
             +N S+ +  +    L K       G   G   R +  D             D  F    
Sbjct: 69   AVNVSEDSAVSGIHVLEK------NGYVSGFGLRNESED-------------DEGF---- 105

Query: 841  NRNGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEISADNHLEEDVSR 1020
                V  +  E  EDV   +D+  I E A  + +    E TV     +S  N+  + V  
Sbjct: 106  ----VGNVDFESFEDV---KDSIIIKEVAGSSDNLFPSETTVMQKESVSTSNNGYQ-VQN 157

Query: 1021 VSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSAVLRSDLXXXXXXXXXXX 1200
            V+  ++S+ +   S L G S          +  S AS   NS++L S             
Sbjct: 158  VT--VQSQKNVKSSILSGGS----------SIASPASG--NSSLLVSKKVS--------- 194

Query: 1201 XLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKEILAAKSQIANAPI 1380
              ++KKMR +LPPKSVT+ID+MN IL RHR +SR++RPRWSS RD+EIL A+ +I NAP+
Sbjct: 195  --KKKKMRCDLPPKSVTTIDEMNRILARHRRTSRAMRPRWSSRRDEEILTARKEIENAPV 252

Query: 1381 VKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPILKGLYASEGWFMKL 1560
             K ++ELY P+FRNVS+FKRSYELMER+LK+Y+YKEG +PIFH PILKGLYASEGWFMKL
Sbjct: 253  AKLERELYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFHTPILKGLYASEGWFMKL 312

Query: 1561 MEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISAKYPFF 1740
            MEGNK+Y VKDPRKAHL+YMPFS+RMLEY+LYVRNSHNRTNLRQ LKEYTE IS+KYPFF
Sbjct: 313  MEGNKQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYPFF 372

Query: 1741 NRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDVSLPETLVRSARNP 1920
            NRT GADHFLVACHDWAPYETRHHME CIKALCNADVT+GFKIGRD+SLPET VR+A+NP
Sbjct: 373  NRTDGADHFLVACHDWAPYETRHHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAKNP 432

Query: 1921 LRDLGGKPPSERHILAFFAG 1980
            LRDLGGKPPS+R  LAF+AG
Sbjct: 433  LRDLGGKPPSQRRTLAFYAG 452


>ref|XP_006400529.1| hypothetical protein EUTSA_v10013011mg [Eutrema salsugineum]
            gi|557101619|gb|ESQ41982.1| hypothetical protein
            EUTSA_v10013011mg [Eutrema salsugineum]
          Length = 606

 Score =  460 bits (1184), Expect = e-127
 Identities = 259/499 (51%), Positives = 322/499 (64%), Gaps = 2/499 (0%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRPSQQFGNSIPMN 669
            RK+  +VGI+A+TH+   SL+L YG  L  L     +     N   + PSQ   N++  N
Sbjct: 16   RKWAILVGILALTHI---SLLLSYGRYL--LPDGRRLKLPNENNTLMNPSQ---NALLRN 67

Query: 670  ASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNMDNAFQLVENRN 849
                   TL V + +V               P G++          RN        ++ +
Sbjct: 68   -------TLAVNVSEV---------------PAGLEKNGYVTGSGPRN--------DSED 97

Query: 850  GVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEISADNH--LEEDVSRV 1023
                + S   E  ++ +D+  I E A ++      E+ V  N  +   NH   E++VS  
Sbjct: 98   DEGFVDSVDFEGFEDAKDSVIIKEVAVNSDSLFPSEKVVMKNEGLLTSNHGHQEQNVS-- 155

Query: 1024 SSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSAVLRSDLXXXXXXXXXXXX 1203
               L+S+ +   S L   +  S     L NS+   S  V                     
Sbjct: 156  ---LQSQKNVKSSKLNAGN--SIAAPVLGNSSLPVSKQVG-------------------- 190

Query: 1204 LRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKEILAAKSQIANAPIV 1383
             ++KKMR +LPPK+VT+ID+MN IL RHR SSR++RPRWSS RD+EILAA+ +I NAP+V
Sbjct: 191  -KKKKMRCDLPPKTVTTIDEMNRILARHRRSSRAMRPRWSSRRDEEILAARKEIENAPVV 249

Query: 1384 KNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPILKGLYASEGWFMKLM 1563
              D+ELY P+FRNVSMFKRSYELMERMLK+Y+YKEG +PIFH PILKGLYASEGWFMKLM
Sbjct: 250  TIDRELYPPIFRNVSMFKRSYELMERMLKVYVYKEGNRPIFHTPILKGLYASEGWFMKLM 309

Query: 1564 EGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISAKYPFFN 1743
            E NK Y VKDPR+AHL+YMPFS+RMLEY+LYVRNSHNRTNLRQ LKEYTE+I +KYPFFN
Sbjct: 310  EANKHYTVKDPRRAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEQIGSKYPFFN 369

Query: 1744 RTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDVSLPETLVRSARNPL 1923
            RT GADHFLVACHDWAPYETRHHME CIKALCNAD+T+GFKIGRD+SLPET VR+A+NPL
Sbjct: 370  RTGGADHFLVACHDWAPYETRHHMEHCIKALCNADITAGFKIGRDISLPETYVRAAKNPL 429

Query: 1924 RDLGGKPPSERHILAFFAG 1980
            RDLGGKPPS+R  LAF+AG
Sbjct: 430  RDLGGKPPSQRRTLAFYAG 448


>ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Glycine max] gi|571456766|ref|XP_006580477.1| PREDICTED:
            probable glycosyltransferase At5g03795-like isoform X2
            [Glycine max] gi|571456768|ref|XP_006580478.1| PREDICTED:
            probable glycosyltransferase At5g03795-like isoform X3
            [Glycine max]
          Length = 643

 Score =  460 bits (1184), Expect = e-127
 Identities = 263/515 (51%), Positives = 338/515 (65%), Gaps = 18/515 (3%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGN-RFPVRPSQQF---GNS 657
            R+ LF++G++A+  LL QS+++PYG         + VP +  N R  +  + ++    N 
Sbjct: 4    RRLLFLLGVLAVNFLLFQSILVPYGNGNAPW---SSVPQKYDNVRLSLHSTPKYFTVRNP 60

Query: 658  IPMNASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNM--DNAFQ 831
                 S F+N++ F+  V+  +     +E+G   + KGM +         RN   DN F+
Sbjct: 61   PTGTVSGFSNSSAFIATVQKVHIPIVVDEVG-HGKKKGMHNNVKGGLVSERNGSDDNVFE 119

Query: 832  LVENRNGVNGMQSEK---------IEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEI 984
               +RN V  +  +K         +E V  K   +FI + A  ++   S++Q ++     
Sbjct: 120  HGADRNDVRSLSEKKDVGKGDRLELESVGSK---NFIADSAKGSKVDFSVKQFLETKRGA 176

Query: 985  SA---DNHLEEDVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSAVL 1155
            S    DN+++         + + DS+  S+ L +SP       +  S SD S+AV+    
Sbjct: 177  SRLVKDNNMDSR-EHDGVGVHTSDSSTFSTNLENSPQK-----IVFSASDNSTAVS---- 226

Query: 1156 RSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRD 1335
                            + R+KMR  +PPKS T I +MN ILVR RAS+R++RPRWSS RD
Sbjct: 227  ----------------IPRRKMRCMMPPKSRTLIGEMNRILVRKRASARAMRPRWSSKRD 270

Query: 1336 KEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQP 1515
             EILAA+S+I +AP V +D+ELYAPLFRN+SMFKRSYELMER LK+YIYK+G KPIFHQP
Sbjct: 271  LEILAARSEIEHAPTVTHDKELYAPLFRNLSMFKRSYELMERTLKVYIYKDGNKPIFHQP 330

Query: 1516 ILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQH 1695
            I+KGLYASEGWFMKLME NK +V+KDP KAHLFYMPFSSRMLE++LYVRNSHNRTNLRQ 
Sbjct: 331  IMKGLYASEGWFMKLMEENKHFVLKDPAKAHLFYMPFSSRMLEHALYVRNSHNRTNLRQF 390

Query: 1696 LKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGR 1875
            LK+YT+KISAKY +FNRT GADHFLVACHDWAPYETRHHME CIKALCNADVT GFKIGR
Sbjct: 391  LKDYTDKISAKYRYFNRTGGADHFLVACHDWAPYETRHHMEYCIKALCNADVTQGFKIGR 450

Query: 1876 DVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            DVSLPE  VRS R+P RDLGGKPP +R ILAF+AG
Sbjct: 451  DVSLPEAYVRSVRDPQRDLGGKPPHQRPILAFYAG 485


>ref|XP_007160303.1| hypothetical protein PHAVU_002G310300g [Phaseolus vulgaris]
            gi|593794531|ref|XP_007160304.1| hypothetical protein
            PHAVU_002G310300g [Phaseolus vulgaris]
            gi|561033718|gb|ESW32297.1| hypothetical protein
            PHAVU_002G310300g [Phaseolus vulgaris]
            gi|561033719|gb|ESW32298.1| hypothetical protein
            PHAVU_002G310300g [Phaseolus vulgaris]
          Length = 648

 Score =  458 bits (1178), Expect = e-126
 Identities = 267/520 (51%), Positives = 337/520 (64%), Gaps = 23/520 (4%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGN-RFPV---RPSQQFGNS 657
            R+ LF++G++A+ +LL QS+++PYG+        + VP +    RFP     P      S
Sbjct: 4    RRLLFLLGVLAVNYLLFQSILIPYGSGNAPW---SSVPQKYDKVRFPSLHSTPKYFTVWS 60

Query: 658  IPMNA-SDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNM--DNAF 828
             PM + S F+N++ F+  V+   +     E+G   +    +    D   E RN+  D+ F
Sbjct: 61   PPMGSVSGFSNSSAFIATVEKMPNPIVQFEVGDGKKMGRHNDENGDLVSE-RNLSNDDVF 119

Query: 829  QLVENRNGVNGMQSEK----------IEDVDEKQDNSFILERASDARHS-LSLEQTVKPN 975
            +   ++N    +  +K          +E V+ K   + IL + SD   S     +T +  
Sbjct: 120  EHGTDKNDARSLSEKKDVGRKGDGLDLESVESKNFYA-ILGKGSDVNFSGKQFSKTKRRA 178

Query: 976  HEISADNHL---EEDVSRVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDA--SSAV 1140
              +  DN++   E D  RV +   S  SA             N+T L+NS      S++ 
Sbjct: 179  SRLVNDNNVDSREYDGVRVHTSHSSTSSA-------------NVTSLENSAQKVVFSASN 225

Query: 1141 NSAVLRSDLXXXXXXXXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRW 1320
            NS  + +                R+KMR  +PPK+ T I +MNHILVR RAS+R++RPRW
Sbjct: 226  NSTAMITP---------------RRKMRCMMPPKTRTLIQEMNHILVRRRASARAMRPRW 270

Query: 1321 SSIRDKEILAAKSQIANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKP 1500
            SS RD EILAA+ +I +AP V  D+ELYAPLFRN+SMFKRSYELMERMLK+YIYK+G+KP
Sbjct: 271  SSKRDLEILAARLEIEHAPTVTEDKELYAPLFRNISMFKRSYELMERMLKVYIYKDGDKP 330

Query: 1501 IFHQPILKGLYASEGWFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRT 1680
            IFHQPILKGLYASEGWFMKLME NK +VVKDP KAHLFY+PFS+RMLE+SLYVRNSHNRT
Sbjct: 331  IFHQPILKGLYASEGWFMKLMEENKHFVVKDPSKAHLFYLPFSARMLEHSLYVRNSHNRT 390

Query: 1681 NLRQHLKEYTEKISAKYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSG 1860
            NLRQ LK+YT+KISAKY  FNRT GADHFLVACHDWAPYETRHHME CIKALCNADVT G
Sbjct: 391  NLRQFLKDYTDKISAKYRHFNRTGGADHFLVACHDWAPYETRHHMEYCIKALCNADVTQG 450

Query: 1861 FKIGRDVSLPETLVRSARNPLRDLGGKPPSERHILAFFAG 1980
            FKIGRDVSLPE  VRS R+P RDLGGKPP +R ILAF+AG
Sbjct: 451  FKIGRDVSLPEAYVRSVRDPQRDLGGKPPHQRPILAFYAG 490


>ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Capsella rubella]
            gi|482556007|gb|EOA20199.1| hypothetical protein
            CARUB_v10000494mg [Capsella rubella]
          Length = 613

 Score =  451 bits (1161), Expect = e-124
 Identities = 252/505 (49%), Positives = 326/505 (64%), Gaps = 8/505 (1%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENEVPFREGNRFPVRPSQQFGNSIPMN 669
            RK+  +VGIVA+TH+L   L+L YG  L+ L+ +       G R  + P++         
Sbjct: 16   RKWAILVGIVALTHIL---LLLSYGDVLRYLVPD-------GRRLKL-PNES------KK 58

Query: 670  ASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNMDNAFQLVENRN 849
                +  TL V +          E+ GI    K           E  + D  F       
Sbjct: 59   LMTLSRNTLAVSV---------SEDSGIHVLEKNASISGFGLRNETED-DEGFD------ 102

Query: 850  GVNGMQSEKIEDVDEKQDNSFILERASDARHSL-SLEQTVKPNHEISADNH--LEEDVS- 1017
                 ++   E  ++ +D+  ++++  ++  +L  LE  VK + E+S   +    +DVS 
Sbjct: 103  -----ETADFESFEDAKDSVVVIKQVVESSDTLYPLEMNVKQSAEMSTSKYGYQVQDVSV 157

Query: 1018 ----RVSSELESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSAVLRSDLXXXXXX 1185
                +V + + S  S+  +S +G  P S N + L       S  V+              
Sbjct: 158  ESQKKVKTSMLSASSSLAASSVGKLPVSGNSSLL------VSKQVS-------------- 197

Query: 1186 XXXXXXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKEILAAKSQI 1365
                   ++KKMR  LPPK+VT+I++MN IL RHR +SR++RPRWSS RD+EILAA+ +I
Sbjct: 198  -------KKKKMRCNLPPKTVTTIEEMNRILARHRRTSRAMRPRWSSRRDEEILAARKEI 250

Query: 1366 ANAPIVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPILKGLYASEG 1545
             NAP+ K ++ELY P++RNVSMFKRSYELMER LK+Y+YKEG +PIFH PILKGLYASEG
Sbjct: 251  ENAPVAKLERELYPPIYRNVSMFKRSYELMERTLKVYVYKEGNRPIFHTPILKGLYASEG 310

Query: 1546 WFMKLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISA 1725
            WFMKLME +K+Y VKDPR+AHL+YMPFS+RMLE++LYVRNSHNRTNLRQ LKEYTE IS+
Sbjct: 311  WFMKLMEESKQYTVKDPRRAHLYYMPFSARMLEFTLYVRNSHNRTNLRQFLKEYTEHISS 370

Query: 1726 KYPFFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDVSLPETLVR 1905
            KYPFFNRT GADHFLVACHDWAPYETRHHME CIKALCNADVT+GFKIGRD+SLPET VR
Sbjct: 371  KYPFFNRTDGADHFLVACHDWAPYETRHHMEHCIKALCNADVTAGFKIGRDISLPETYVR 430

Query: 1906 SARNPLRDLGGKPPSERHILAFFAG 1980
            +A+NP RDLGGKPPS+R  LAF+AG
Sbjct: 431  AAKNPQRDLGGKPPSQRRTLAFYAG 455


>ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297317735|gb|EFH48157.1| exostosin family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 610

 Score =  443 bits (1139), Expect = e-121
 Identities = 256/502 (50%), Positives = 326/502 (64%), Gaps = 5/502 (0%)
 Frame = +1

Query: 490  RKFLFVVGIVAITHLLCQSLILPYGTALKSLMAENE-VPFREGNRFPVRPSQQFGNSIPM 666
            RK+  +VGI+A T++L   L+L YG AL+ L+ +   +     N   + PS+   N++ +
Sbjct: 17   RKWAILVGIMAFTYIL---LLLSYGDALRYLLPDGRRLKLPNENNALMTPSR---NTLAV 70

Query: 667  NASDFTNTTLFVELVKVANSSNGGEEIGIEFRPKGMDSRQSDFAPEVRNMDNAFQLVENR 846
            N S+ +  +    L K  N S+ G     E   +G           V N+D         
Sbjct: 71   NFSEDSAGSGIHVLEKNGNVSDFGLRNESEDDEEGF----------VGNVD--------- 111

Query: 847  NGVNGMQSEKIEDVDEKQDNSFILERASDARHSLSLEQTVKPNHEISADN--HLEEDVSR 1020
                       E  ++ +D+  I E A  +      E+TV  N  +S  N  H  ++VS 
Sbjct: 112  ----------FESFEDAKDSIIIKEVAGSSDSLFPTEKTVMQNEIVSTSNNGHQVQNVSV 161

Query: 1021 VSSE-LESRDSAFQSSLLGSSPPSYNITYLKNSTSDASSAVNSAVLRSDLXXXXXXXXXX 1197
             S + L+S  S+  SS+ GS+          NS+   S  V+                  
Sbjct: 162  QSQKNLKSSMSSAGSSIAGSA--------FGNSSLLVSRKVS------------------ 195

Query: 1198 XXLRRKKMRSELPPKSVTSIDDMNHILVRHRASSRSVRPRWSSIRDKEILAAKSQIANAP 1377
               ++KKMR +LPPKSVT+ID+MN IL RHR +SR++      +RD+EIL A+ +I NAP
Sbjct: 196  ---KKKKMRCDLPPKSVTTIDEMNRILARHRRTSRAMV--CVQLRDEEILTARKEIENAP 250

Query: 1378 -IVKNDQELYAPLFRNVSMFKRSYELMERMLKIYIYKEGEKPIFHQPILKGLYASEGWFM 1554
             +  ++++LY P+FRNVSMFKRSYELMER+LK+Y+YKEG +PIFH PILKGLYASEGWFM
Sbjct: 251  PVATSERQLYPPIFRNVSMFKRSYELMERILKVYVYKEGNRPIFHTPILKGLYASEGWFM 310

Query: 1555 KLMEGNKRYVVKDPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQHLKEYTEKISAKYP 1734
            KLMEGNK+Y VKDPRKAHL+YMPFS+RMLEY+LYVRNSHNRTNLRQ LKEYTE IS+KYP
Sbjct: 311  KLMEGNKQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYP 370

Query: 1735 FFNRTSGADHFLVACHDWAPYETRHHMERCIKALCNADVTSGFKIGRDVSLPETLVRSAR 1914
            FFNRT GADHFLVACHDWAPYETRHHME CIKALCNADVT+GFKIGRD+SLPET VR+A+
Sbjct: 371  FFNRTDGADHFLVACHDWAPYETRHHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAK 430

Query: 1915 NPLRDLGGKPPSERHILAFFAG 1980
            NPLRDLGGKPPS+R  LAF+AG
Sbjct: 431  NPLRDLGGKPPSQRRTLAFYAG 452


Top