BLASTX nr result

ID: Cocculus23_contig00015049 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00015049
         (2274 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268...   642   0.0  
ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prun...   638   e-180
ref|XP_007012125.1| Exostosin family protein, putative isoform 2...   638   e-180
ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g...   638   e-180
ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g...   635   e-179
ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citr...   632   e-178
ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citr...   630   e-177
ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g...   629   e-177
ref|XP_007012124.1| Exostosin family protein, putative isoform 1...   628   e-177
ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g...   627   e-177
ref|XP_006476046.1| PREDICTED: probable glycosyltransferase At5g...   626   e-176
ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Popu...   626   e-176
ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g...   625   e-176
ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prun...   623   e-175
ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Popu...   622   e-175
ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]...   619   e-174
gb|EXB59796.1| putative glycosyltransferase [Morus notabilis]         619   e-174
ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g...   619   e-174
ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Popu...   617   e-174
ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Caps...   614   e-173

>ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
          Length = 738

 Score =  642 bits (1655), Expect = 0.0
 Identities = 345/663 (52%), Positives = 431/663 (65%), Gaps = 20/663 (3%)
 Frame = +3

Query: 60   STAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSP 239
            S A D    S  + +  D++ +++T            +E+ +    ++    E    F+ 
Sbjct: 78   SNASDLTDTSLFVEVVEDVEKSNVTVEFGDDNGTEGTDEDIEDGLALEREDLENIVEFNE 137

Query: 240  ESRSPV--------------GASEQANLMKPAN--KYSP-EKVVDLGKNFRIENVEDPHN 368
            +   P               G        K  N  K  P +KVVD+     +E V +  N
Sbjct: 138  DDNGPKEKGGDTENFASESKGMDHVVEFTKDNNISKGLPFKKVVDMDGISALEYVNNQEN 197

Query: 369  GSVSEKTREPENANVQEQFHKTEN-GSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXX 545
             S  +K  E  +        K  N G S D I K D +L      S              
Sbjct: 198  SSDLKKDSEMRHIGSAVHIVKPPNEGISTDNIVKADASLTPSTPGSLGTTFKSHLLASPG 257

Query: 546  XXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSI 725
                 +     K+  N + S     ++ISSV K  ++ +SKDE    L+   A  N +S 
Sbjct: 258  VDSLFNTTYIEKMASNGNASNHLTATDISSVGKPEKEILSKDENLLVLQSDLADLNNNSA 317

Query: 726  ITSNPNKDQWMK--PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQN 899
            +TSNP + +     PP SV SI +MN  L+R+ ASS + +PRW++PRD+E+L+AK QIQN
Sbjct: 318  MTSNPGRKKMQSEMPPKSVTSIYDMNRRLVRHRASSRAMRPRWASPRDQEMLAAKLQIQN 377

Query: 900  AAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGW 1079
            A    K D EL  PLFRNVS+FKRSYELMER LKVYVYK+G KPIFH+PI KG+YASEGW
Sbjct: 378  APRV-KNDPELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQPILKGLYASEGW 436

Query: 1080 FMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAK 1259
            FMK ME +K FVVK PR+A LFY+PFSSRMLE  LYV NSH+  NL QYLK Y + I+AK
Sbjct: 437  FMKLMERNKHFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAK 496

Query: 1260 YPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVI 1439
            Y FWNRTGGADHFLVACHDWAP ETRH HM   I+ALCNADV   F IG+DVSLPET V 
Sbjct: 497  YRFWNRTGGADHFLVACHDWAPYETRH-HMEQCIKALCNADVTAGFKIGRDVSLPETYVR 555

Query: 1440 SPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKM 1619
            S + P +D GGKPPS+R ILAF+AGNMHGYLRPILL++W++KDPDMKI+G M  G  SKM
Sbjct: 556  SARNPLRDLGGKPPSERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKM 615

Query: 1620 NYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVA 1799
            NYIQHMKSSK+CIC KG+EVNSPRVVE+I YECVPVIISDN+VPPFF+VLDW AF++ +A
Sbjct: 616  NYIQHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILA 675

Query: 1800 EKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQI 1979
            EKDIPNLK++LLSIP  +YL M L ++K+Q+HF+WH+KP+KYD+FHM LHSIWYNRVFQ+
Sbjct: 676  EKDIPNLKDVLLSIPNDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNRVFQV 735

Query: 1980 KAK 1988
            K +
Sbjct: 736  KPR 738


>ref|XP_007204617.1| hypothetical protein PRUPE_ppa002387mg [Prunus persica]
            gi|462400148|gb|EMJ05816.1| hypothetical protein
            PRUPE_ppa002387mg [Prunus persica]
          Length = 678

 Score =  638 bits (1646), Expect = e-180
 Identities = 330/649 (50%), Positives = 431/649 (66%), Gaps = 5/649 (0%)
 Frame = +3

Query: 57   SSTAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFS 236
            SS     V     +  SSD  + S+  G+ +    S +  E  H    K     +     
Sbjct: 66   SSAKSVMVRNPLTVHSSSDFIDVSMFSGVEKAAGNSGLGGEIGHDRGRKGKDVHKEIDLI 125

Query: 237  PESRSPVGASEQANLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQ 416
             E +               + +  E VVD   +  + ++E+  NGSV +K    +     
Sbjct: 126  LEEKGIDNTFANTIHRNVDHNFPSENVVDTNGSLALVSIENQENGSVQDKANVAKYGFPL 185

Query: 417  EQFHKTENGSSLDIIRKEDTNL---RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLD 587
            E+       +S +   KE++NL   + DG ++G                   P+ P+ L 
Sbjct: 186  ERIVLPNYETSTENTLKENSNLTAKKSDGVKTGF------------------PSSPLILP 227

Query: 588  GNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMK-- 761
              +S +    ++++ S                S K     S   S++ +NP + +     
Sbjct: 228  AAASLA-NATNASVGST---------------SFKSDVVTSKNGSVVMTNPGRKKMKSEL 271

Query: 762  PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPP 941
            PP S+ SI EMN +L+R+ ASS S +PRWS+ RD++IL+ K QI++  +    D+EL+ P
Sbjct: 272  PPKSITSIYEMNHILVRHRASSRSLRPRWSSVRDQDILAVKSQIEHPPVAIN-DRELYAP 330

Query: 942  LFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVK 1121
            LFRNVS+FKRSYELMERTLK+Y+YK+G KPIFH+PI KG+YASEGWFMK M+G+K+FVVK
Sbjct: 331  LFRNVSMFKRSYELMERTLKIYIYKDGNKPIFHQPILKGLYASEGWFMKLMQGYKRFVVK 390

Query: 1122 SPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFL 1301
             PR+AHLFY+PFSSRMLE +LYV NSH+  NL Q+LK+Y + I+AKYP+WNRTGGADHFL
Sbjct: 391  DPRKAHLFYMPFSSRMLEYSLYVRNSHNRTNLRQFLKEYSEKIAAKYPYWNRTGGADHFL 450

Query: 1302 VACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPP 1481
            VACHDWAP ETRH HM   ++ALCNADV   F IG+DVSLPET V S + P +D GGKPP
Sbjct: 451  VACHDWAPYETRH-HMERCMKALCNADVTGGFKIGRDVSLPETYVRSARNPLRDLGGKPP 509

Query: 1482 SQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCIC 1661
            SQRQILAF+AGNMHGYLRPILL++W+++DPDMKIFG M  G  SKMNYIQHMKSSKYCIC
Sbjct: 510  SQRQILAFYAGNMHGYLRPILLEYWKDRDPDMKIFGPMPPGVASKMNYIQHMKSSKYCIC 569

Query: 1662 AKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSI 1841
             KG+EVNSPRVVE+I YECVPVIISDN+VPPFFEVL+W AF+V +AE+DIPNLK ILLSI
Sbjct: 570  PKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWGAFSVILAERDIPNLKEILLSI 629

Query: 1842 PQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            P+++YL M   ++K+Q+HF+WH++P+KYD+FHM LHSIWYNRVFQIK +
Sbjct: 630  PEEKYLQMQRGVRKVQKHFLWHARPLKYDLFHMTLHSIWYNRVFQIKIR 678


>ref|XP_007012125.1| Exostosin family protein, putative isoform 2 [Theobroma cacao]
            gi|508782488|gb|EOY29744.1| Exostosin family protein,
            putative isoform 2 [Theobroma cacao]
          Length = 788

 Score =  638 bits (1645), Expect = e-180
 Identities = 318/562 (56%), Positives = 410/562 (72%)
 Frame = +3

Query: 303  SPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNL 482
            S E+ VDL KN  ++  E   N +V+E+  + E +   +      N S+ +I     T+ 
Sbjct: 232  STEQFVDLNKNSTVDYAES-FNKTVAEEASKTEESFSLKNDTIDVNTSNNNIGNGNFTS- 289

Query: 483  RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKI 662
              + T S                  T+  +   ++ N  T +  V+S+ SS+ + +    
Sbjct: 290  SAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSSLEQHVTPSF 349

Query: 663  SKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKP 842
             K+EK E +K     S+ +S  T+ P   +  + P ++ +I++MN +  ++  S  S+ P
Sbjct: 350  DKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQSRVSYYSKTP 409

Query: 843  RWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEG 1022
            RWS+  D+ +L+A+ QI+NA I  K D  L+ PLFRNVS+FKRSYELME TLKVYVY+EG
Sbjct: 410  RWSSGADQVLLNARSQIENAPIV-KNDPRLYAPLFRNVSMFKRSYELMESTLKVYVYQEG 468

Query: 1023 AKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSH 1202
             +PI H PI KGIYASEGWFMK +E +K+FV K+PR AHLFYLPFSSRMLE TLYVP+SH
Sbjct: 469  KRPIVHTPILKGIYASEGWFMKQLEANKKFVTKNPREAHLFYLPFSSRMLEETLYVPDSH 528

Query: 1203 SHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNAD 1382
            +HKNL++YLK+Y+ +I+AKYPFWNRT GADHFLVACHDWAP+ETR  HM+N IRALCN+D
Sbjct: 529  NHKNLIEYLKNYVGIIAAKYPFWNRTEGADHFLVACHDWAPSETR-KHMANCIRALCNSD 587

Query: 1383 VHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWEN 1562
            + E ++ GKDVSLPET V +P++P +D GGKPPS+R ILAFFAG+MHGYLRPILL+ W N
Sbjct: 588  IREGYIFGKDVSLPETYVRNPQKPLRDLGGKPPSKRSILAFFAGSMHGYLRPILLEQWGN 647

Query: 1563 KDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDN 1742
            KDPDMKIFG+M    + KMNYIQHMKSSKYC+C +G+EVNSPRVVE+I Y CVPVIISDN
Sbjct: 648  KDPDMKIFGKM-PNVKGKMNYIQHMKSSKYCLCPRGYEVNSPRVVEAIFYGCVPVIISDN 706

Query: 1743 YVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVK 1922
            +VPPFFEVL+WE+FAVFV EKDIPNLK ILLSIP+KR+  M LR+KK+QQHF+WH +P K
Sbjct: 707  FVPPFFEVLNWESFAVFVLEKDIPNLKKILLSIPEKRFRQMQLRVKKIQQHFLWHPRPEK 766

Query: 1923 YDIFHMILHSIWYNRVFQIKAK 1988
            YDIFHMILHS+WYNRVFQ+K +
Sbjct: 767  YDIFHMILHSVWYNRVFQMKPR 788


>ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 675

 Score =  638 bits (1645), Expect = e-180
 Identities = 341/627 (54%), Positives = 439/627 (70%), Gaps = 8/627 (1%)
 Frame = +3

Query: 132  TDGIVRTTSASNMEE-ETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPANKYSP 308
            +D + +  +  NM   +  +++DV +     SN  + E  +    ++ A++M  A   S 
Sbjct: 61   SDSLSKLGTMGNMTTAQGLNSSDVHAMHGIDSNAETMEGNNEGPKNDFASVMNGALDKSF 120

Query: 309  EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNLRL 488
                D  KN  +E V +  N S  +   + E++   E      N SSL  I+++D  L  
Sbjct: 121  GLDED-NKNVTVEKVNNSGNRSALKNASKHESSLYLENITADSN-SSLGKIQEDDMALLS 178

Query: 489  DGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNI-------SSVAKQ 647
              +E   +                 PA+P  +  +++TSL  +D +        SSV + 
Sbjct: 179  QRSERSGVGLISPL-----------PALPQIISSSNTTSLTNLDPHPITLPPERSSVEED 227

Query: 648  LQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASS 827
                ++KDEK E+ ++    SN SSI  S P  +   + P+ V +ISEMN +L+++ ASS
Sbjct: 228  AAHTLNKDEKAETSQKDLTLSNRSSI--SVPALETRPELPA-VTTISEMNDLLVQSRASS 284

Query: 828  GSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVY 1007
             S KPRWS+  D+E+L AK QI+NA I  K D  L   L+RNVSVFKRSYELME TLKVY
Sbjct: 285  RSMKPRWSSAVDKELLYAKSQIENAPII-KNDPGLHASLYRNVSVFKRSYELMENTLKVY 343

Query: 1008 VYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLY 1187
             Y+EG +P+FH+P  KGIYASEGWFMK M+ +K+FV K+ R+AHLFYLPFSS MLE  LY
Sbjct: 344  TYREGERPVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPFSSLMLEEALY 403

Query: 1188 VPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRA 1367
            VPNSHS KNL QYLK+Y+DMI AKYPFWNRTGGADHFLVACHDWAP+ET    M+N+IRA
Sbjct: 404  VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPSETLKL-MANSIRA 462

Query: 1368 LCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILL 1547
            LCN+D+ E F +GKDVSLPET V  P+ P +  GGKPPSQR+ILAFFAG+MHGY+RPILL
Sbjct: 463  LCNSDIREGFKLGKDVSLPETCVRIPQNPLRQLGGKPPSQRRILAFFAGSMHGYVRPILL 522

Query: 1548 QHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPV 1727
            ++WENKDPDMKI+GRM +  +  MNYIQHMKSSKYCICAKG+EVNSPRVVE+I YECVPV
Sbjct: 523  KYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGYEVNSPRVVEAIFYECVPV 582

Query: 1728 IISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWH 1907
            IISDN+VPPFF VL+WE+FAVF+ EKDIPNLK+ILLSIP+K YL + +R+K++QQHF+WH
Sbjct: 583  IISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLSIPEKSYLEIQMRVKQVQQHFLWH 642

Query: 1908 SKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            +KPVKYD+FHMILHS+WYNRV QI+ +
Sbjct: 643  AKPVKYDVFHMILHSVWYNRVLQIRVR 669


>ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2
            [Citrus sinensis]
          Length = 663

 Score =  635 bits (1639), Expect = e-179
 Identities = 346/649 (53%), Positives = 429/649 (66%), Gaps = 24/649 (3%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEETKHATDVKS--NAAEQSNGFSPESRSPVGASEQANLM- 284
            ++N SL  G         +E +++ A+D  +  N+    N     + +    +E ANL  
Sbjct: 55   VENNSLVTG--------GLESKSEIASDAVNGLNSTGTHNVHEMANDTRTSKAEDANLQA 106

Query: 285  ----------KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKT 434
                      +P N    EK+  L KN  ++ V++  N    EK RE E + +Q      
Sbjct: 107  DFDDGEDIHEEPTN----EKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQRN---- 158

Query: 435  ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614
                  DI+          G +SG +                D +  I L G + ++   
Sbjct: 159  ------DIM----------GGDSGGVGLSPIPVSPVM-----DLSSNITLQGANISTPIT 197

Query: 615  VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794
            + SN SS  K     + K EKP          N S +     NK   +  P+ V++I+EM
Sbjct: 198  IHSNSSSTDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEM 256

Query: 795  NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRS 974
              +LL+N AS  S +PRWS+  D+E+L A+ QI+NA +  K D EL+ PL+RNVS FKRS
Sbjct: 257  KNMLLQNRASYRSMRPRWSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRS 315

Query: 975  YELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLP 1154
            YELME TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K  R+AHLFYLP
Sbjct: 316  YELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRKAHLFYLP 375

Query: 1155 FSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTET 1334
            FSSRMLE TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACHDWAP ET
Sbjct: 376  FSSRMLEETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACHDWAPAET 435

Query: 1335 RHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAG 1514
            R   M+N IRALCN+DV E FV GKDV+LPET V+SP+ P +  GGKP SQR ILAFFAG
Sbjct: 436  R-IIMANCIRALCNSDVKEGFVFGKDVALPETYVLSPQNPLRAIGGKPASQRSILAFFAG 494

Query: 1515 NMHGYLRPILLQHWENKDPDMKIFGRM----------GRGTR-SKMNYIQHMKSSKYCIC 1661
             MHGYLRPILL HWENKDPDMKIFG+M          G+G R  KM+YIQHMKSSKYCIC
Sbjct: 495  RMHGYLRPILLHHWENKDPDMKIFGQMPMVKGKGKGKGKGKRKGKMDYIQHMKSSKYCIC 554

Query: 1662 AKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSI 1841
            AKG+EVNSPRVVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNLKNILLSI
Sbjct: 555  AKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNLKNILLSI 614

Query: 1842 PQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
             +KRY  M +R+KK+QQHF+WH +PVKYDIFHM+LHSIWYNRVF  +A+
Sbjct: 615  SEKRYRRMQMRVKKVQQHFLWHPQPVKYDIFHMLLHSIWYNRVFLARAR 663


>ref|XP_006451253.1| hypothetical protein CICLE_v10007651mg [Citrus clementina]
            gi|568883066|ref|XP_006494321.1| PREDICTED: probable
            glycosyltransferase At5g03795-like [Citrus sinensis]
            gi|557554479|gb|ESR64493.1| hypothetical protein
            CICLE_v10007651mg [Citrus clementina]
          Length = 677

 Score =  632 bits (1631), Expect = e-178
 Identities = 334/629 (53%), Positives = 427/629 (67%), Gaps = 1/629 (0%)
 Frame = +3

Query: 105  SSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLM 284
            +SD+ + S+  G +     S    +T   + ++    + +NG   E +       Q N  
Sbjct: 80   ASDLMSDSVFKGSLEDDEDSKFGSDTGDDSGLREVDGDTNNGIVSEGKG------QDN-- 131

Query: 285  KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIR 464
             P    +  +V D   +   ENV+D ++ S  E  R  EN+   E   + +    L  I 
Sbjct: 132  -PIELVTDREVDD---DSVAENVKDLNDLSELEIERIGENSATVEPAGEAKQSLPLKQIV 187

Query: 465  KEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAK 644
            + +  +  DG    H                  P   I       T LK  +SN SS A+
Sbjct: 188  QPNLEIVSDGVPEQHTSQSIANIGGEKTLSIVSPLTNI-------THLKTEESNASSAAR 240

Query: 645  QLQDKISKDEKPESLKRFPAHSNVSSIITS-NPNKDQWMKPPSSVMSISEMNLVLLRNHA 821
                 + K +   S+       N+S++I S    K +   PP +V SI EMN +L+R+H 
Sbjct: 241  SA---VPKSDIATSV-------NISALIGSPGKKKMRCNMPPKTVTSIFEMNDILMRHHR 290

Query: 822  SSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLK 1001
            SS + +PRWS+ RD+E+L+AK +I+ A+++   DQEL  PLFRNVS+FKRSYELM+RTLK
Sbjct: 291  SSRAMRPRWSSVRDKEVLAAKTEIEKASVSVS-DQELHAPLFRNVSMFKRSYELMDRTLK 349

Query: 1002 VYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEAT 1181
            VYVY++G KPIFH+PI KG+YASEGWFMK MEG+K F VK PR+AHLFY+PFSSRMLE  
Sbjct: 350  VYVYRDGKKPIFHQPILKGLYASEGWFMKLMEGNKHFAVKDPRKAHLFYMPFSSRMLEYA 409

Query: 1182 LYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTI 1361
            LYV NSH+  NL QYLK+Y + I+AKY +WNRTGGADHFLVACHDWAP ETRH HM + I
Sbjct: 410  LYVRNSHNRTNLRQYLKEYAESIAAKYRYWNRTGGADHFLVACHDWAPYETRH-HMEHCI 468

Query: 1362 RALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPI 1541
            +ALCNADV   F +G+DVSLPET V S + P +D GGKPPSQR ILAF+AGN+HGYLRPI
Sbjct: 469  KALCNADVTAGFKLGRDVSLPETYVRSARNPLRDLGGKPPSQRHILAFYAGNLHGYLRPI 528

Query: 1542 LLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECV 1721
            LL++W++KDPDMKIFG M  G  SKMNYIQHMKSSKYCIC KG+EVNSPRVVESI YECV
Sbjct: 529  LLKYWKDKDPDMKIFGPMPPGVASKMNYIQHMKSSKYCICPKGYEVNSPRVVESIFYECV 588

Query: 1722 PVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFM 1901
            PVIISDN+VPPF+EVL+WEAF+V +AE++IPNLK+ILLSIP+K+Y  M   ++KLQ+HF+
Sbjct: 589  PVIISDNFVPPFYEVLNWEAFSVIIAEENIPNLKDILLSIPEKKYFEMQFAVRKLQRHFL 648

Query: 1902 WHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            WH+KP KYD+FHM LHSIWYNRV+QIK +
Sbjct: 649  WHAKPEKYDLFHMTLHSIWYNRVYQIKPR 677


>ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citrus clementina]
            gi|557553910|gb|ESR63924.1| hypothetical protein
            CICLE_v10007698mg [Citrus clementina]
          Length = 652

 Score =  630 bits (1624), Expect = e-177
 Identities = 345/637 (54%), Positives = 423/637 (66%), Gaps = 12/637 (1%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPA 293
            ++N SL  G     S S +  +T  A  + S      +  + ++R+    +E ANL    
Sbjct: 55   VENNSLVTG--GPESKSEIASDT--ANGLNSTGTHNVHEMANDTRT--SKAEDANLQDDF 108

Query: 294  -----NKYSP--EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSL 452
                 N   P  EK+ +L KN  ++ V++  NG   EK RE E + +Q        G+ L
Sbjct: 109  YDGEDNHEEPMTEKLEELNKNSTVDTVQNAGNGPGPEKGRESEQSFIQRN---DSGGAGL 165

Query: 453  DIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNIS 632
              I                                 D +  I L G + ++   + SN S
Sbjct: 166  SPIPVSPV---------------------------MDLSSNITLQGANISTPITIHSNSS 198

Query: 633  SVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLR 812
            S  K     + K EKP          N S +     NK   +  P+ V++I+EM  +LL+
Sbjct: 199  STDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEMKNMLLQ 257

Query: 813  NHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMER 992
            N AS  S  PR S+  D+E+L A+ QI+NA +  K D EL+ PL+RNVS FKRSYELME 
Sbjct: 258  NRASYRSMSPRLSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRSYELMEE 316

Query: 993  TLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRML 1172
            TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K  R+AHLFYLPFSSRML
Sbjct: 317  TLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRKAHLFYLPFSSRML 376

Query: 1173 EATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMS 1352
            E TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACHDWAP ETR   M+
Sbjct: 377  EETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACHDWAPAETR-IIMA 435

Query: 1353 NTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYL 1532
            N IRALCN+DV + FV GKDVSLPET V+SP+ P    GGKP SQR ILAFFAG+MHGYL
Sbjct: 436  NCIRALCNSDVKQGFVFGKDVSLPETNVLSPQNPLWAIGGKPASQRSILAFFAGSMHGYL 495

Query: 1533 RPILLQHWENKDPDMKIFGRM----GRGTR-SKMNYIQHMKSSKYCICAKGFEVNSPRVV 1697
            RPILL HWENKDPDMKIFG+M    GRG R  KM+YIQHMKSSKYCICAKG+EV+SPRVV
Sbjct: 496  RPILLHHWENKDPDMKIFGQMPKAKGRGKRKGKMDYIQHMKSSKYCICAKGYEVHSPRVV 555

Query: 1698 ESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRI 1877
            E+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNLKNILLSI +KRY  M + +
Sbjct: 556  EAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNLKNILLSISEKRYRKMQMMV 615

Query: 1878 KKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            KK+QQHF+WH +PVKYDIFHMILHSIWYNRVF  +A+
Sbjct: 616  KKVQQHFLWHPRPVKYDIFHMILHSIWYNRVFLARAR 652


>ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Citrus sinensis]
          Length = 670

 Score =  629 bits (1621), Expect = e-177
 Identities = 346/656 (52%), Positives = 429/656 (65%), Gaps = 31/656 (4%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEETKHATDVKS--NAAEQSNGFSPESRSPVGASEQANLM- 284
            ++N SL  G         +E +++ A+D  +  N+    N     + +    +E ANL  
Sbjct: 55   VENNSLVTG--------GLESKSEIASDAVNGLNSTGTHNVHEMANDTRTSKAEDANLQA 106

Query: 285  ----------KPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKT 434
                      +P N    EK+  L KN  ++ V++  N    EK RE E + +Q      
Sbjct: 107  DFDDGEDIHEEPTN----EKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQRN---- 158

Query: 435  ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614
                  DI+          G +SG +                D +  I L G + ++   
Sbjct: 159  ------DIM----------GGDSGGVGLSPIPVSPVM-----DLSSNITLQGANISTPIT 197

Query: 615  VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794
            + SN SS  K     + K EKP          N S +     NK   +  P+ V++I+EM
Sbjct: 198  IHSNSSSTDKDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEM 256

Query: 795  NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKR- 971
              +LL+N AS  S +PRWS+  D+E+L A+ QI+NA +  K D EL+ PL+RNVS FKR 
Sbjct: 257  KNMLLQNRASYRSMRPRWSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRNVSRFKRF 315

Query: 972  ------SYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRR 1133
                  SYELME TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV K  R+
Sbjct: 316  YNAICRSYELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTKDSRK 375

Query: 1134 AHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACH 1313
            AHLFYLPFSSRMLE TLYV NSH+HKNL+QYL++Y+++ISAK+ FWNRT GADHFLVACH
Sbjct: 376  AHLFYLPFSSRMLEETLYVQNSHNHKNLIQYLRNYVNLISAKHNFWNRTEGADHFLVACH 435

Query: 1314 DWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQ 1493
            DWAP ETR   M+N IRALCN+DV E FV GKDV+LPET V+SP+ P +  GGKP SQR 
Sbjct: 436  DWAPAETRII-MANCIRALCNSDVKEGFVFGKDVALPETYVLSPQNPLRAIGGKPASQRS 494

Query: 1494 ILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRM----------GRGTRS-KMNYIQHMK 1640
            ILAFFAG MHGYLRPILL HWENKDPDMKIFG+M          G+G R  KM+YIQHMK
Sbjct: 495  ILAFFAGRMHGYLRPILLHHWENKDPDMKIFGQMPMVKGKGKGKGKGKRKGKMDYIQHMK 554

Query: 1641 SSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNL 1820
            SSKYCICAKG+EVNSPRVVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV EKDIPNL
Sbjct: 555  SSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLEKDIPNL 614

Query: 1821 KNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            KNILLSI +KRY  M +R+KK+QQHF+WH +PVKYDIFHM+LHSIWYNRVF  +A+
Sbjct: 615  KNILLSISEKRYRRMQMRVKKVQQHFLWHPQPVKYDIFHMLLHSIWYNRVFLARAR 670


>ref|XP_007012124.1| Exostosin family protein, putative isoform 1 [Theobroma cacao]
            gi|508782487|gb|EOY29743.1| Exostosin family protein,
            putative isoform 1 [Theobroma cacao]
          Length = 802

 Score =  628 bits (1620), Expect = e-177
 Identities = 318/576 (55%), Positives = 410/576 (71%), Gaps = 14/576 (2%)
 Frame = +3

Query: 303  SPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRKEDTNL 482
            S E+ VDL KN  ++  E   N +V+E+  + E +   +      N S+ +I     T+ 
Sbjct: 232  STEQFVDLNKNSTVDYAES-FNKTVAEEASKTEESFSLKNDTIDVNTSNNNIGNGNFTS- 289

Query: 483  RLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSNISSVAKQLQDKI 662
              + T S                  T+  +   ++ N  T +  V+S+ SS+ + +    
Sbjct: 290  SAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSSLEQHVTPSF 349

Query: 663  SKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKP 842
             K+EK E +K     S+ +S  T+ P   +  + P ++ +I++MN +  ++  S  S+ P
Sbjct: 350  DKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQSRVSYYSKTP 409

Query: 843  RWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFK--------------RSYE 980
            RWS+  D+ +L+A+ QI+NA I  K D  L+ PLFRNVS+FK              RSYE
Sbjct: 410  RWSSGADQVLLNARSQIENAPIV-KNDPRLYAPLFRNVSMFKSQVHNVYTICIINFRSYE 468

Query: 981  LMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFS 1160
            LME TLKVYVY+EG +PI H PI KGIYASEGWFMK +E +K+FV K+PR AHLFYLPFS
Sbjct: 469  LMESTLKVYVYQEGKRPIVHTPILKGIYASEGWFMKQLEANKKFVTKNPREAHLFYLPFS 528

Query: 1161 SRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRH 1340
            SRMLE TLYVP+SH+HKNL++YLK+Y+ +I+AKYPFWNRT GADHFLVACHDWAP+ETR 
Sbjct: 529  SRMLEETLYVPDSHNHKNLIEYLKNYVGIIAAKYPFWNRTEGADHFLVACHDWAPSETRK 588

Query: 1341 AHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNM 1520
             HM+N IRALCN+D+ E ++ GKDVSLPET V +P++P +D GGKPPS+R ILAFFAG+M
Sbjct: 589  -HMANCIRALCNSDIREGYIFGKDVSLPETYVRNPQKPLRDLGGKPPSKRSILAFFAGSM 647

Query: 1521 HGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVE 1700
            HGYLRPILL+ W NKDPDMKIFG+M    + KMNYIQHMKSSKYC+C +G+EVNSPRVVE
Sbjct: 648  HGYLRPILLEQWGNKDPDMKIFGKMPN-VKGKMNYIQHMKSSKYCLCPRGYEVNSPRVVE 706

Query: 1701 SILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIK 1880
            +I Y CVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLK ILLSIP+KR+  M LR+K
Sbjct: 707  AIFYGCVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKKILLSIPEKRFRQMQLRVK 766

Query: 1881 KLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            K+QQHF+WH +P KYDIFHMILHS+WYNRVFQ+K +
Sbjct: 767  KIQQHFLWHPRPEKYDIFHMILHSVWYNRVFQMKPR 802


>ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
            [Solanum tuberosum] gi|565373856|ref|XP_006353482.1|
            PREDICTED: probable glycosyltransferase At5g03795-like
            isoform X2 [Solanum tuberosum]
          Length = 674

 Score =  627 bits (1617), Expect = e-177
 Identities = 330/638 (51%), Positives = 428/638 (67%), Gaps = 10/638 (1%)
 Frame = +3

Query: 105  SSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLM 284
            SS +++T + +    T S+ +      H      N     +G   ES +      + + +
Sbjct: 65   SSVVESTKVGESFSGTLSSFDDVHMLAHRLKTVDNGDVSEDGEIDESVN------EKDEV 118

Query: 285  KPANKYSPEKVVDLGKNF----------RIENVEDPHNGSVSEKTREPENANVQEQFHKT 434
            KP + +S  K ++   +F            + V D    + ++K  E       EQ  KT
Sbjct: 119  KPHSNHSVVKTMENDSDFVEDAILENDNLFDEVVDMDEETTTQKNNESRRDLSLEQVVKT 178

Query: 435  ENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKF 614
                S D     + N  L+ T++  +               T+ +  +    N   +L  
Sbjct: 179  NGELSADSELDANRNSVLNDTKAASV---------------TNSSSVVA--SNQLDNLPL 221

Query: 615  VDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEM 794
            V     +  +   +  S  +  + L   P H N S + ++   K + M PP +V SIS+M
Sbjct: 222  VTIGEINFIRTTSNNSSTGDLTQLL---PNHGNHSLVQSTVKKKMRCMLPPKTVTSISQM 278

Query: 795  NLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRS 974
              +L+R+ A S + +PRWS+ RD+EIL+A+ QI+NA +  + D+EL+ P FRN+S+FKRS
Sbjct: 279  ERLLVRHRARSRAMRPRWSSERDKEILAARLQIENAPLL-RNDRELYAPAFRNMSMFKRS 337

Query: 975  YELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLP 1154
            YELMER LKVYVYKEG KPIFH+PI KG+YASEGWFMK MEG+ +FVVK PR+AHLFYLP
Sbjct: 338  YELMERILKVYVYKEGEKPIFHQPIMKGLYASEGWFMKLMEGNNRFVVKDPRKAHLFYLP 397

Query: 1155 FSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTET 1334
            FSSRMLE +LYV NSH+  NL QYLKDY + I+AKY FWNRTGGADHFLVACHDWAP ET
Sbjct: 398  FSSRMLEHSLYVHNSHNRTNLRQYLKDYSEKIAAKYRFWNRTGGADHFLVACHDWAPYET 457

Query: 1335 RHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAG 1514
            RH HM + I+ALCNADV   F IG+DVSLPET V S + P +D GGKPPSQR++LAF+AG
Sbjct: 458  RH-HMEHCIKALCNADVTLGFKIGRDVSLPETYVRSARNPLRDLGGKPPSQRKVLAFYAG 516

Query: 1515 NMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRV 1694
            NMHGYLRPILL+HW++KDPDM+IFG M  G  SKMNYIQHMKSSK+CIC KG+EVNSPRV
Sbjct: 517  NMHGYLRPILLEHWKDKDPDMEIFGPMPSGVASKMNYIQHMKSSKFCICPKGYEVNSPRV 576

Query: 1695 VESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLR 1874
            VE+I YECVPVIISDN+VPPFF VL+W+ F++ +AEKDIPNLK+ILLSIP+ +YL M L 
Sbjct: 577  VEAIFYECVPVIISDNFVPPFFGVLNWDTFSLILAEKDIPNLKSILLSIPENKYLEMQLA 636

Query: 1875 IKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            ++K+Q+HF+WH+KPVKYD+FHM LHSIWYNRVFQ KA+
Sbjct: 637  VRKVQRHFLWHAKPVKYDLFHMTLHSIWYNRVFQTKAR 674


>ref|XP_006476046.1| PREDICTED: probable glycosyltransferase At5g03795-like [Citrus
            sinensis]
          Length = 653

 Score =  626 bits (1615), Expect = e-176
 Identities = 343/639 (53%), Positives = 425/639 (66%), Gaps = 14/639 (2%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMKPA 293
            ++N SL  G     S S +  +T  A  + S      +  + ++R+    +E ANL    
Sbjct: 55   VENNSLVTG--GPESKSEIASDT--ANGLNSTGTHNVHEMANDTRT--SKAEDANLQDDF 108

Query: 294  -----NKYSP--EKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSL 452
                 N   P  EK+ +L KN  ++ V++  NG   EK RE E + +Q         S +
Sbjct: 109  YDGEDNHEEPMTEKLEELNKNSTVDTVQNAGNGPGPEKGRESEQSFIQRNDSGGAGLSPI 168

Query: 453  DIIRKED--TNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSN 626
             +    D  +N+ L G                                N ST    +DSN
Sbjct: 169  PVSPVMDLSSNITLQGA-------------------------------NISTPPITIDSN 197

Query: 627  ISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVL 806
             SS+       + K EKP          N S +     NK   +  P+ V++I+EM  +L
Sbjct: 198  TSSMDMDATPALVKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPA-VITIAEMKNML 256

Query: 807  LRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELM 986
            L+N AS  S +PR S+  D+E+L A+ QI+NA +  K D EL+ PL+R+VS FKRSYELM
Sbjct: 257  LQNRASYRSMRPRLSSAVDQEMLYARSQIENAPLL-KNDHELYAPLYRSVSRFKRSYELM 315

Query: 987  ERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSR 1166
            E TLKVYVYKEG +PI H P+ KGIYASEGWFMK +E +KQFV +  R+AHLFYLPFSSR
Sbjct: 316  EETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLEANKQFVTRDSRKAHLFYLPFSSR 375

Query: 1167 MLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAH 1346
            MLE TLYV NSH+HK+L+QYL++Y++MISAK+ FWNRT GADHFLVACHDWAP ETR   
Sbjct: 376  MLEETLYVQNSHNHKDLIQYLRNYVNMISAKHNFWNRTEGADHFLVACHDWAPAETR-II 434

Query: 1347 MSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHG 1526
            M+N IRALCN+DV + FV GKDVSLPET V+SP+ P    GGKP SQR ILAFFAG+MHG
Sbjct: 435  MANCIRALCNSDVKQGFVFGKDVSLPETNVLSPQNPLWAIGGKPASQRSILAFFAGSMHG 494

Query: 1527 YLRPILLQHWENKDPDMKIFGRM----GRGTR-SKMNYIQHMKSSKYCICAKGFEVNSPR 1691
            YLRPILL HWENKDPDMKIFG+M    GRG R  K +YIQHMKSSKYCICAKG+EV+SPR
Sbjct: 495  YLRPILLHHWENKDPDMKIFGQMPKAKGRGKRKGKTDYIQHMKSSKYCICAKGYEVHSPR 554

Query: 1692 VVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHL 1871
            VVE+I YECVPVIISDN+VPPFFE+L+WE+FAVFV E+DIPNLKNILLSI +KRYL M +
Sbjct: 555  VVEAIFYECVPVIISDNFVPPFFEILNWESFAVFVLERDIPNLKNILLSISEKRYLKMQM 614

Query: 1872 RIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
             +KK+QQHF+WH +PVKYDIFHMILHSIWYNRVF  +A+
Sbjct: 615  MVKKVQQHFLWHPRPVKYDIFHMILHSIWYNRVFLARAR 653


>ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa]
            gi|550337072|gb|EEE93070.2| hypothetical protein
            POPTR_0006s25540g [Populus trichocarpa]
          Length = 705

 Score =  626 bits (1615), Expect = e-176
 Identities = 336/651 (51%), Positives = 430/651 (66%), Gaps = 26/651 (3%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEET--KHATDVKSNAAEQSNGFSPESRSPVGASEQANLMK 287
            + NT+ ++G+  T  + +  +ET   H T+  +N    +N   PE     G +E + +  
Sbjct: 70   VSNTTQSNGLNTTAISPDRAQETDNSHGTETPANV---NNDVVPERSR--GLNESSLIDS 124

Query: 288  PANKYSPEKVVDLGKNFRIENVEDPHNGSVSE---------------KTREPENANVQEQ 422
               + SPE++VD   N    +    HNG VSE               K  +PE   V E 
Sbjct: 125  RGKESSPEQLVDTNTN----STSYVHNGVVSEGISGLNKSSGIDNHGKESKPEQL-VMEP 179

Query: 423  FHKTENGSS-----LDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLD 587
             +   NGS+       + R++ T++       G                 + P + ++++
Sbjct: 180  VNSLGNGSAPQETERSLSREDVTSI---SENIGASDARIAPIAPELLPVDSPPNITLQMN 236

Query: 588  GNSSTSLKFV--DSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPN-KDQWM 758
               ST    V  +SN S V K     +  D K    K+     + +  +TS P  K +  
Sbjct: 237  AEPSTIAHIVPIESNTSKVDKDAAPSLENDGKTGDQKKDLTLLHNNPSVTSFPEVKKEPQ 296

Query: 759  KPPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFP 938
             P   V+SISEM  + L+  +S  S +PRW +  D+E+L+AK QIQNA I E  D  L+ 
Sbjct: 297  TPSLEVVSISEMKNLQLQRWSSPNSRRPRWPSVVDQELLNAKSQIQNAPIVEN-DPVLYA 355

Query: 939  PLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVV 1118
            PL+ N+S+FK+SYELME  LKVY+YKEG  PIFH+P+  GIYASEGWFMK +EG+K+FV 
Sbjct: 356  PLYWNISMFKKSYELMEDILKVYIYKEGEMPIFHQPLLNGIYASEGWFMKLLEGNKKFVT 415

Query: 1119 KSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHF 1298
            K  ++AHLFYLPFSSR LE  LYVPNSHSHKNL++YLK Y+DMIS KYPFWNRT GADHF
Sbjct: 416  KDSKKAHLFYLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHF 475

Query: 1299 LVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKP 1478
            L ACHDWAP+ETR  HM+N IRALCN+D  E FV GKD SLPET V++ + P +D GG  
Sbjct: 476  LAACHDWAPSETRQ-HMANCIRALCNSDAKEDFVYGKDASLPETYVLTQENPLRDLGGNR 534

Query: 1479 PSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGR-GTRSKMNYIQHMKSSKYC 1655
             S+R ILAFFAG+MHGYLRPILLQHWENKDPDMKIFGR+ +   R KMNY ++MKSSKYC
Sbjct: 535  ASKRSILAFFAGSMHGYLRPILLQHWENKDPDMKIFGRLPKVKGRGKMNYARYMKSSKYC 594

Query: 1656 ICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILL 1835
            ICAKG+EVNSPRVVE+I YECVPVIISDN+VPPF EVL+WE+FAVFV EKDIPNLK ILL
Sbjct: 595  ICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIPNLKKILL 654

Query: 1836 SIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            SIP K+Y  M +R+K++QQHF+WH++PVKYD+FHMILHSIWYNRVFQ++ +
Sbjct: 655  SIPAKKYRRMQMRVKRVQQHFLWHARPVKYDVFHMILHSIWYNRVFQMQPR 705


>ref|XP_004287457.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria
            vesca subsp. vesca]
          Length = 686

 Score =  625 bits (1613), Expect = e-176
 Identities = 336/657 (51%), Positives = 436/657 (66%), Gaps = 16/657 (2%)
 Frame = +3

Query: 60   STAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSP 239
            S+A+  + ++ +   SSD+ +     G+ +    S++  ET     V  +   +  GF  
Sbjct: 65   SSAKSVMVRNPLTVNSSDLIDAPRFGGVEKYADNSSLGGET-----VDKSEPNEKEGFK- 118

Query: 240  ESRSPVGASEQANLMKPA------NKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTRE-- 395
            E  S +   E  N  + A        +     VD   +  + ++    NGS   KT E  
Sbjct: 119  EIDSVLEEKEMDNTFEHAADRNVDENFPSGNGVDTDASLTLVSISKEENGSNLVKTNEAS 178

Query: 396  ---PENANVQEQFHKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDP 566
               PE   + +    TEN   +++      +   +G ++                  TD 
Sbjct: 179  YDFPEPTVLSKDEVSTENTLEVNMTMAAKHS---EGVKTIFPSSPLILPATASFTHQTDV 235

Query: 567  AMPIKLDGNSSTSL--KFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNP 740
                 L  N+S+S+   F++S+I ++            K +SL R            ++P
Sbjct: 236  TYVSYLVSNASSSVGSAFLESDIVTI------------KNDSLTR------------TSP 271

Query: 741  NKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAIT 911
             K + MK   PP S+ SI EMNL L+R+HA   + +PRWS+ RD++IL+ K QIQ+  + 
Sbjct: 272  GK-KMMKCNMPPKSITSIDEMNLTLVRHHAKPRALRPRWSSVRDQDILAVKSQIQHPPVA 330

Query: 912  EKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKH 1091
             K D+EL+ PL+RNVS+FKRSYELMERTLKVY+YKEG KPIFH+PI KG+YASEGWFMK 
Sbjct: 331  -KNDRELYAPLYRNVSMFKRSYELMERTLKVYIYKEGNKPIFHQPIMKGLYASEGWFMKL 389

Query: 1092 MEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFW 1271
            MEG K+FVVK PR+AHLFY+PFSSRMLE TLYV NSH+   L QYLK+Y + I+AKYPFW
Sbjct: 390  MEGDKRFVVKDPRKAHLFYMPFSSRMLEFTLYVRNSHNRTKLRQYLKEYSETIAAKYPFW 449

Query: 1272 NRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKE 1451
            NRTGGADHFLVACHDWAP ETRH HM   I+ALCNADV + F IG+D+SLPET V S + 
Sbjct: 450  NRTGGADHFLVACHDWAPYETRH-HMERCIKALCNADVTQGFKIGRDISLPETYVRSARN 508

Query: 1452 PAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQ 1631
            P +D GGK  S+RQ+L F+AGNMHGYLRPILL++W++KDPDMKIFG M  G  SKMNYI+
Sbjct: 509  PLRDLGGKRASERQVLTFYAGNMHGYLRPILLKYWKDKDPDMKIFGPMPPGVASKMNYIE 568

Query: 1632 HMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDI 1811
            HMKSSKYC+C KG+EVNSPRVVE+I YEC+PVIISDN+VPPFFEVL+WEAF++ +AEKDI
Sbjct: 569  HMKSSKYCLCPKGYEVNSPRVVEAIFYECIPVIISDNFVPPFFEVLNWEAFSLILAEKDI 628

Query: 1812 PNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIK 1982
            PNLKNILLSIP ++YL M L +K++Q+HF+WH KP+KYD+FHM LHSIWYNR+FQIK
Sbjct: 629  PNLKNILLSIPDEKYLQMQLAVKRVQKHFLWHPKPLKYDLFHMTLHSIWYNRLFQIK 685


>ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prunus persica]
            gi|462422090|gb|EMJ26353.1| hypothetical protein
            PRUPE_ppa002395mg [Prunus persica]
          Length = 678

 Score =  623 bits (1606), Expect = e-175
 Identities = 334/640 (52%), Positives = 431/640 (67%), Gaps = 4/640 (0%)
 Frame = +3

Query: 75   SVSKSAIIG---ISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPES 245
            S S S I+G   +S+D+ NT         T A + +     ++D      E SN      
Sbjct: 63   SPSNSEIVGNLSLSNDLNNTG--------TYAIHEKASNTRSSDSVLEGHEGSN------ 108

Query: 246  RSPVGASEQANLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQF 425
               +  +E  +  K A   S   +V   +   +EN++        E  REPE ++V+++ 
Sbjct: 109  -RALEINEDEDDGKDA---SSGNLVKQNRTIIVENIKPLETNFAQEGGREPEVSSVEKK- 163

Query: 426  HKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTS 605
            + T+N      I  E+  + +  + +G +               T PA+    + N    
Sbjct: 164  NTTDNTYLEGRIGNENNTVDVVNSTAG-LPVSSPAPPMMNSSPSTAPAI---FETNVGAP 219

Query: 606  LKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPN-KDQWMKPPSSVMS 782
            +K VDSN++SV K       K E  E L      +  +S +T  P  K +   P   V S
Sbjct: 220  IKSVDSNVTSVEKDRTTPSEKTENSEQLHSDLNQTEHNSSMTRVPEVKIEPEVPILDVYS 279

Query: 783  ISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSV 962
            IS+MN +LL++ AS  S   +WS+P D+E+     QI+NA I  K D  L+  L+RN+SV
Sbjct: 280  ISDMNNLLLQSRASYNSMLAQWSSPADQELQYVASQIENAPII-KSDPTLYALLYRNLSV 338

Query: 963  FKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHL 1142
            FKRSYELME TLKVYVY+EG +PI H P  KGIYASEGWFMK +E  K+FV K+P++AHL
Sbjct: 339  FKRSYELMEDTLKVYVYREGERPILHSPFLKGIYASEGWFMKQLEADKKFVTKNPQKAHL 398

Query: 1143 FYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWA 1322
            +YLPFSSR LE  LYVPNSHSHKNL+QYLKDY+DMI+ K+PFWNRTGGADHFLVACHDWA
Sbjct: 399  YYLPFSSRTLEERLYVPNSHSHKNLIQYLKDYVDMIAVKHPFWNRTGGADHFLVACHDWA 458

Query: 1323 PTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILA 1502
            P+ET+  +M+  IRALCN+D+ E FV GKDVSLPET + + K P +D GG  PS+R ILA
Sbjct: 459  PSETK-KYMATCIRALCNSDIKEGFVFGKDVSLPETYIKNDKNPLRDLGGNRPSKRSILA 517

Query: 1503 FFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVN 1682
            FFAG+MHGYLRPILLQHWE+KDPDMKIFG++ +  +   NY+++M+SSKYCICAKG+EVN
Sbjct: 518  FFAGSMHGYLRPILLQHWEDKDPDMKIFGKLPK-VKGNKNYVRYMQSSKYCICAKGYEVN 576

Query: 1683 SPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLM 1862
            SPRVVE+I YECVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLKNILLSIP+K+YL 
Sbjct: 577  SPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPKKKYLQ 636

Query: 1863 MHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIK 1982
            M +R+KK+Q+HF+WH+KP KYDIFHMILHSIWYNR+ Q+K
Sbjct: 637  MQMRVKKVQKHFLWHAKPEKYDIFHMILHSIWYNRLHQLK 676


>ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa]
            gi|550317697|gb|EEF03366.2| hypothetical protein
            POPTR_0018s00290g [Populus trichocarpa]
          Length = 707

 Score =  622 bits (1603), Expect = e-175
 Identities = 329/643 (51%), Positives = 426/643 (66%), Gaps = 21/643 (3%)
 Frame = +3

Query: 114  IKNTSLTDGIVRTTSASNMEEETKHATDVKSNA-----AEQSNGFSPESRSPVGASEQA- 275
            + N + ++G+    +A   E    H T+  +N      +E S G +  S       E + 
Sbjct: 70   LSNVTQSNGL--NYAAGGQETGDNHGTETPANVNNGVVSEGSRGMNESSLVDSRGEESSL 127

Query: 276  ---------NLMKPANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFH 428
                     + +   N    E +  L K+  I+N    H    S +    +N N   + +
Sbjct: 128  DELVDTNTNSTLYVNNDVGSEGIKGLNKSLGIDN----HGRESSPEQLLDQNENSTLELN 183

Query: 429  KTENGS-SLDIIR---KEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNS 596
             + NGS S++  R   +E+     + T +                  T+ A+P   + ++
Sbjct: 184  HSGNGSASIETDRSLFRENITSTSENTGTSQAGITPIAPALPPVDSPTNIAIPRNAEPST 243

Query: 597  STSLKFVDSNISSVAKQLQDKISKDEKP-ESLKRFPAHSNVSSIITSNPNKDQWMKPPSS 773
               +  V+SN S   K     +  D K  E L    +  N +S+ +    K +   P  +
Sbjct: 244  LAPVVPVESNTSKTDKDASHGLENDGKAGEQLNNSTSLQNNTSVTSVREVKKEPHTPSPA 303

Query: 774  VMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRN 953
            V+SISEMN + L++ +S  S +PRW +  D+E+L+AK QIQ A + E  D  L+ PL+RN
Sbjct: 304  VISISEMNNLQLQSWSSPISRRPRWPSAVDQELLNAKSQIQKAPLVES-DSMLYAPLYRN 362

Query: 954  VSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRR 1133
            +S+FK+SYELME  LKVY+YKEG +PI H+   KGIYASEGWFMK +E +K+FV K P++
Sbjct: 363  ISMFKKSYELMEDILKVYIYKEGERPILHQAPLKGIYASEGWFMKLLETNKKFVTKDPKK 422

Query: 1134 AHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACH 1313
            +HLFYLPFSSR LE  LYVPNSHSHKNL+QYLK+Y+DMISAKYPFWNRT GADHFLVACH
Sbjct: 423  SHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLKNYLDMISAKYPFWNRTRGADHFLVACH 482

Query: 1314 DWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQ 1493
            DWAPTETR  HM+N IRALCN+D    FV GKD +LPET V +P+   +D GGKP S+R 
Sbjct: 483  DWAPTETRQ-HMANCIRALCNSDAKGGFVFGKDAALPETTVRTPQNLLRDLGGKPASKRS 541

Query: 1494 ILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGR-GTRSKMNYIQHMKSSKYCICAKG 1670
            ILAFFAG+MHGYLRPILLQHW NKDPD+K+FG++ +   R KMNY Q+MKSSKYCICAKG
Sbjct: 542  ILAFFAGSMHGYLRPILLQHWGNKDPDVKVFGKLPKVKGRGKMNYPQYMKSSKYCICAKG 601

Query: 1671 FEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQK 1850
            FEVNSPRVVE+I YECVPVIISDN+VPPFFEVL+WE+FAVFV EKDIPNLKNILLSIP+ 
Sbjct: 602  FEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPEN 661

Query: 1851 RYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQI 1979
            +Y  M +R+KK+QQHF+WH++PVKYDIFHMILHS+WYNRVFQ+
Sbjct: 662  KYREMQMRVKKVQQHFLWHARPVKYDIFHMILHSVWYNRVFQV 704


>ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
            gi|332005353|gb|AED92736.1| Exostosin family protein
            [Arabidopsis thaliana] gi|591401784|gb|AHL38619.1|
            glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 610

 Score =  619 bits (1597), Expect = e-174
 Identities = 292/433 (67%), Positives = 358/433 (82%), Gaps = 3/433 (0%)
 Frame = +3

Query: 699  PAHSNVSSIITSNPNKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEE 869
            PA  N S +++   +K + M+   PP SV +I EMN +L R+  +S + +PRWS+ RDEE
Sbjct: 180  PASGNSSLLVSKKVSKKKKMRCDLPPKSVTTIDEMNRILARHRRTSRAMRPRWSSRRDEE 239

Query: 870  ILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPI 1049
            IL+A+++I+NA +  K ++EL+PP+FRNVS+FKRSYELMER LKVYVYKEG +PIFH PI
Sbjct: 240  ILTARKEIENAPVA-KLERELYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFHTPI 298

Query: 1050 TKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYL 1229
             KG+YASEGWFMK MEG+KQ+ VK PR+AHL+Y+PFS+RMLE TLYV NSH+  NL Q+L
Sbjct: 299  LKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFL 358

Query: 1230 KDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGK 1409
            K+Y + IS+KYPF+NRT GADHFLVACHDWAP ETRH HM + I+ALCNADV   F IG+
Sbjct: 359  KEYTEHISSKYPFFNRTDGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTAGFKIGR 417

Query: 1410 DVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFG 1589
            D+SLPET V + K P +D GGKPPSQR+ LAF+AG+MHGYLR ILLQHW++KDPDMKIFG
Sbjct: 418  DISLPETYVRAAKNPLRDLGGKPPSQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFG 477

Query: 1590 RMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVL 1769
            RM  G  SKMNYI+ MKSSKYCIC KG+EVNSPRVVESI YECVPVIISDN+VPPFFEVL
Sbjct: 478  RMPFGVASKMNYIEQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVL 537

Query: 1770 DWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILH 1949
            DW AF+V VAEKDIP LK+ILLSIP+ +Y+ M + ++K Q+HF+WH+KP KYD+FHM+LH
Sbjct: 538  DWSAFSVIVAEKDIPRLKDILLSIPEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLH 597

Query: 1950 SIWYNRVFQIKAK 1988
            SIWYNRVFQ K +
Sbjct: 598  SIWYNRVFQAKRR 610


>gb|EXB59796.1| putative glycosyltransferase [Morus notabilis]
          Length = 669

 Score =  619 bits (1596), Expect = e-174
 Identities = 307/514 (59%), Positives = 385/514 (74%)
 Frame = +3

Query: 447  SLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDGNSSTSLKFVDSN 626
            S + IR E+ +LRL  ++ G                   P+ P+    ++  +  F  ++
Sbjct: 185  STENIRTENIDLRLKKSDGG-------------LDSPFQPS-PLASSADALVNASFSTTS 230

Query: 627  ISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWMKPPSSVMSISEMNLVL 806
             SSV++Q    I+ +           HS +++  T    K +   PP S+ +  EMN +L
Sbjct: 231  TSSVSEQSGLLITNN-----------HSAIAT--TPGVKKMRCNMPPKSITTFQEMNQIL 277

Query: 807  LRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELM 986
            +R+ A S S +PRWS+ RD+EIL+ K QI+NA +    DQEL+ PLFRNVS+FKRSYELM
Sbjct: 278  VRHRAKSRSLRPRWSSVRDKEILAMKPQIENAPLA-MNDQELYAPLFRNVSMFKRSYELM 336

Query: 987  ERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSR 1166
            ERTLKVYVYK+G KPIFH+PI KG+YASEGWFMK ME ++++VVK PRRAHLFY+PFSSR
Sbjct: 337  ERTLKVYVYKDGDKPIFHQPIMKGLYASEGWFMKLMERNRRYVVKDPRRAHLFYMPFSSR 396

Query: 1167 MLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAH 1346
            MLE  LYV NSH+  NL QYLK+Y + ++AKYP+WNRTGGADHFLVACHDWAP ETRH H
Sbjct: 397  MLEHVLYVRNSHNRTNLRQYLKEYSEKLAAKYPYWNRTGGADHFLVACHDWAPYETRH-H 455

Query: 1347 MSNTIRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHG 1526
            M   ++ALCNADV   F IG+DVS PET V S + P +D GGKPPS+R +LAF+AGN+HG
Sbjct: 456  MERCMKALCNADVTSGFKIGRDVSFPETYVRSARNPLRDLGGKPPSRRHVLAFYAGNIHG 515

Query: 1527 YLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESI 1706
            YLRPILL++W++KDPDMKIFG M  G  +KMNYIQHMKSSKYCIC KG+EVNSPRVVESI
Sbjct: 516  YLRPILLKYWKDKDPDMKIFGPMPPGVANKMNYIQHMKSSKYCICPKGYEVNSPRVVESI 575

Query: 1707 LYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKL 1886
             YECVPVIISDN+VPPFFEVL+WEAF++ +AEKDIP LK ILLSIP+++YL M L ++K 
Sbjct: 576  FYECVPVIISDNFVPPFFEVLNWEAFSIVLAEKDIPKLKEILLSIPKEKYLEMQLAVRKA 635

Query: 1887 QQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            Q+HF+WH+KP+KYD+FHM LHSIWYNRVFQIK +
Sbjct: 636  QKHFLWHAKPMKYDLFHMTLHSIWYNRVFQIKPR 669


>ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g03795-like [Solanum
            lycopersicum]
          Length = 674

 Score =  619 bits (1596), Expect = e-174
 Identities = 332/659 (50%), Positives = 439/659 (66%), Gaps = 14/659 (2%)
 Frame = +3

Query: 54   ASSTAEDSVSKSAIIGISSDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGF 233
            + S  + S   S +   SS +++T + +G   T S+ +      H      N+    +G 
Sbjct: 48   SESNTQLSEKVSLLSKESSVVESTKVGEGFSGTLSSFDDVHMLAHRLKTVDNSDVSEDGE 107

Query: 234  SPESRSPVGASEQANLMKPANKYSPEKVVDLGKNF----RIEN------VEDPHNGSVSE 383
              ES +      + + +KP + +S  K ++   +F     IEN      + D    +  +
Sbjct: 108  IDESVN------EKDEVKPHSNHSVVKTMENDSDFVEDATIENDNLFDEMVDMDEETTMQ 161

Query: 384  KTREPENANVQEQFHKTENGSSLDIIRKEDTNLRLDGTESGHIXXXXXXXXXXXXXXXTD 563
            K  E +     EQ  KT +  S D     + N  L+ T++ ++                 
Sbjct: 162  KNNESKWDLSIEQVVKTTDELSADSDLDANRNTVLNDTKAANVTNSSSVEASNHLDNLPL 221

Query: 564  PAMP----IKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLKRFPAHSNVSSIIT 731
             A+     I+  GN+S++      N++ +                    P + N S +++
Sbjct: 222  VAIGEINFIRTTGNNSST-----GNLTQL-------------------LPNNGNHSLVLS 257

Query: 732  SNPNKDQWMKPPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAAIT 911
            +   K + M PP +V +IS+M  +L+R+ A S + +PRWS+ RD+EIL+A+ QI+NA + 
Sbjct: 258  TVKKKMRCMLPPKTVTTISQMERLLVRHRARSRAMRPRWSSERDKEILAARLQIENAPLI 317

Query: 912  EKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFMKH 1091
             + D+E++ P FRN+S+FKRSYELMER L+VYVYKEG KPIFH+PI KG+YASEGWFMK 
Sbjct: 318  -RNDREIYAPAFRNMSMFKRSYELMERILRVYVYKEGEKPIFHQPIMKGLYASEGWFMKL 376

Query: 1092 MEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYPFW 1271
            MEG+ +FVVK PR+AHLFYLPFSSRMLE +LYV NSH+  NL QYLKDY + I+AKY FW
Sbjct: 377  MEGNNKFVVKDPRKAHLFYLPFSSRMLEHSLYVRNSHNRTNLRQYLKDYSEKIAAKYRFW 436

Query: 1272 NRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISPKE 1451
            NRTGGADHFLVACHDWAP ETRH HM + I+ALCNADV   F IG+DVSL ET V S + 
Sbjct: 437  NRTGGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTLGFKIGRDVSLAETYVRSARN 495

Query: 1452 PAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNYIQ 1631
            P +D GGKP SQR++LAF+AGNMHGYLRPILL+HW++KDPDM+IFG M  G  SKMNYIQ
Sbjct: 496  PLRDLGGKPASQRKVLAFYAGNMHGYLRPILLEHWKDKDPDMEIFGPMPSGVASKMNYIQ 555

Query: 1632 HMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEKDI 1811
            HMKSSK+CIC KG+EVNSPRVVE+I YECVPVIISDN+VPPFF VL+W+ F++ +AEKDI
Sbjct: 556  HMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWDTFSLILAEKDI 615

Query: 1812 PNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            PNLK+ILLSIP+K+YL M L I+K+Q+HF+WH+KPVKYD+FHM LHSIWYNRVFQ KA+
Sbjct: 616  PNLKSILLSIPEKKYLDMQLAIRKVQRHFLWHAKPVKYDLFHMTLHSIWYNRVFQTKAR 674


>ref|XP_002324438.2| hypothetical protein POPTR_0018s09250g [Populus trichocarpa]
            gi|550318376|gb|EEF03003.2| hypothetical protein
            POPTR_0018s09250g [Populus trichocarpa]
          Length = 682

 Score =  617 bits (1590), Expect = e-174
 Identities = 326/630 (51%), Positives = 414/630 (65%), Gaps = 3/630 (0%)
 Frame = +3

Query: 108  SDIKNTSLTDGIVRTTSASNMEEETKHATDVKSNAAEQSNGFSPESRSPVGASEQANLMK 287
            S + N    DG++     SN   E  H    K N  +  + FS         SE+ ++  
Sbjct: 81   SSLNNYFKFDGVLENADDSNGGVEEGHDDGTKKNTEDTDHDFS---------SEEGDMEV 131

Query: 288  PANKYSPEKVVDLGKNFRIENVEDPHNGSVSEKTREPENANVQEQFHKTENGSSLDIIRK 467
              +    E   DL  +F  E+V+D H    S   +  E+  V +  ++      L+   K
Sbjct: 132  LDDVIQLEVDRDLEDDFPSEDVKDRHETFASGGVKTEESNPVLKLANEARFNLPLERNVK 191

Query: 468  EDTNLRLDGTESGHIXXXXXXXXXXXXXXXTDPAMPIKLDG-NSSTSLKFVDSNISSVAK 644
             D ++  D     +                 +  +P+      SST   ++ SN SS   
Sbjct: 192  SDHDIPTDNVLQQN------KSQAHKEFEHVNSTLPVDSQAVASSTKATYLKSNGSSSIG 245

Query: 645  QLQDKISKDEKPESLKRFPAHSNVSSIITSNPNKDQWM--KPPSSVMSISEMNLVLLRNH 818
                       P +LK   A +   S++ + P K +     PP SV  I EMN +L+R+ 
Sbjct: 246  -----------PAALKSDSAAAKNYSVVLAKPGKKKMRCEMPPKSVTLIDEMNSILVRHR 294

Query: 819  ASSGSEKPRWSTPRDEEILSAKRQIQNAAITEKGDQELFPPLFRNVSVFKRSYELMERTL 998
             SS S +PRWS+ RD+EIL+A+ QI++A      D++L+ PLFRNVS FKRSYELMERTL
Sbjct: 295  RSSRSMRPRWSSARDQEILAARSQIESAPAVVH-DRDLYAPLFRNVSKFKRSYELMERTL 353

Query: 999  KVYVYKEGAKPIFHRPITKGIYASEGWFMKHMEGHKQFVVKSPRRAHLFYLPFSSRMLEA 1178
            K+Y+YK+G KPIFH PI KG+YASEGWFMK M+G+K FVVK PR+AHLFY+PFSSRMLE 
Sbjct: 354  KIYIYKDGKKPIFHLPILKGLYASEGWFMKLMQGNKHFVVKDPRKAHLFYMPFSSRMLEY 413

Query: 1179 TLYVPNSHSHKNLVQYLKDYIDMISAKYPFWNRTGGADHFLVACHDWAPTETRHAHMSNT 1358
            TLYV NSH+  NL  Y+K Y + I+AKY FWNRTGGADHFLVACHDWAP ETRH HM + 
Sbjct: 414  TLYVRNSHNRTNLRLYMKRYAESIAAKYSFWNRTGGADHFLVACHDWAPYETRH-HMEHC 472

Query: 1359 IRALCNADVHESFVIGKDVSLPETLVISPKEPAKDPGGKPPSQRQILAFFAGNMHGYLRP 1538
            I+ALCNADV   F IG+DVS PET V S + P +D GGKPPSQR ILAF+AGNMHGYLRP
Sbjct: 473  IKALCNADVTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAGNMHGYLRP 532

Query: 1539 ILLQHWENKDPDMKIFGRMGRGTRSKMNYIQHMKSSKYCICAKGFEVNSPRVVESILYEC 1718
            ILL++W++KDPDMKIFG M  G  SKMNYI HM+ SKYCIC KG+EVNSPRVVE+I YEC
Sbjct: 533  ILLKYWKDKDPDMKIFGPMPPGVASKMNYIHHMQRSKYCICPKGYEVNSPRVVEAIFYEC 592

Query: 1719 VPVIISDNYVPPFFEVLDWEAFAVFVAEKDIPNLKNILLSIPQKRYLMMHLRIKKLQQHF 1898
            VPVIISDN+VPPFF+VLDW AF++ +AEKDI NLK ILLSIP+++YL M L ++K Q+HF
Sbjct: 593  VPVIISDNFVPPFFDVLDWGAFSLILAEKDISNLKEILLSIPKEKYLQMQLGVRKAQRHF 652

Query: 1899 MWHSKPVKYDIFHMILHSIWYNRVFQIKAK 1988
            +WH+ P+KYD+F+M LHSIWYNRV+QIK +
Sbjct: 653  LWHASPMKYDLFYMTLHSIWYNRVYQIKPR 682


>ref|XP_006287301.1| hypothetical protein CARUB_v10000494mg [Capsella rubella]
            gi|482556007|gb|EOA20199.1| hypothetical protein
            CARUB_v10000494mg [Capsella rubella]
          Length = 613

 Score =  614 bits (1583), Expect = e-173
 Identities = 298/481 (61%), Positives = 371/481 (77%), Gaps = 8/481 (1%)
 Frame = +3

Query: 570  MPIKLDGNSSTSLKFVDSNISSVAKQLQDKISKDEKPESLK-----RFPAHSNVSSIITS 734
            M +K     STS         SV  Q + K S      SL      + P   N S +++ 
Sbjct: 135  MNVKQSAEMSTSKYGYQVQDVSVESQKKVKTSMLSASSSLAASSVGKLPVSGNSSLLVSK 194

Query: 735  NPNKDQWMK---PPSSVMSISEMNLVLLRNHASSGSEKPRWSTPRDEEILSAKRQIQNAA 905
              +K + M+   PP +V +I EMN +L R+  +S + +PRWS+ RDEEIL+A+++I+NA 
Sbjct: 195  QVSKKKKMRCNLPPKTVTTIEEMNRILARHRRTSRAMRPRWSSRRDEEILAARKEIENAP 254

Query: 906  ITEKGDQELFPPLFRNVSVFKRSYELMERTLKVYVYKEGAKPIFHRPITKGIYASEGWFM 1085
            +  K ++EL+PP++RNVS+FKRSYELMERTLKVYVYKEG +PIFH PI KG+YASEGWFM
Sbjct: 255  VA-KLERELYPPIYRNVSMFKRSYELMERTLKVYVYKEGNRPIFHTPILKGLYASEGWFM 313

Query: 1086 KHMEGHKQFVVKSPRRAHLFYLPFSSRMLEATLYVPNSHSHKNLVQYLKDYIDMISAKYP 1265
            K ME  KQ+ VK PRRAHL+Y+PFS+RMLE TLYV NSH+  NL Q+LK+Y + IS+KYP
Sbjct: 314  KLMEESKQYTVKDPRRAHLYYMPFSARMLEFTLYVRNSHNRTNLRQFLKEYTEHISSKYP 373

Query: 1266 FWNRTGGADHFLVACHDWAPTETRHAHMSNTIRALCNADVHESFVIGKDVSLPETLVISP 1445
            F+NRT GADHFLVACHDWAP ETRH HM + I+ALCNADV   F IG+D+SLPET V + 
Sbjct: 374  FFNRTDGADHFLVACHDWAPYETRH-HMEHCIKALCNADVTAGFKIGRDISLPETYVRAA 432

Query: 1446 KEPAKDPGGKPPSQRQILAFFAGNMHGYLRPILLQHWENKDPDMKIFGRMGRGTRSKMNY 1625
            K P +D GGKPPSQR+ LAF+AG+MHGYLR ILLQHW++KDP+MKIFGRM  G  SKMNY
Sbjct: 433  KNPQRDLGGKPPSQRRTLAFYAGSMHGYLRAILLQHWKDKDPEMKIFGRMPLGVASKMNY 492

Query: 1626 IQHMKSSKYCICAKGFEVNSPRVVESILYECVPVIISDNYVPPFFEVLDWEAFAVFVAEK 1805
            I+ MKSSKYCIC KG+EVNSPRVVESI YECVPVIISDN+VPPFFEVLDW AF+V +AEK
Sbjct: 493  IEQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIIAEK 552

Query: 1806 DIPNLKNILLSIPQKRYLMMHLRIKKLQQHFMWHSKPVKYDIFHMILHSIWYNRVFQIKA 1985
            DIP LK+IL SIP+++Y+ M + ++K Q+HF+WH+KP +YD+FHM+LHSIWYNRVFQ K 
Sbjct: 553  DIPRLKDILSSIPEEKYVKMQMAVRKAQRHFLWHAKPQRYDLFHMVLHSIWYNRVFQAKR 612

Query: 1986 K 1988
            +
Sbjct: 613  R 613


Top