BLASTX nr result

ID: Rauwolfia21_contig00007142 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00007142
         (2288 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AED99886.1| glycosyltransferase [Panax notoginseng]                720   0.0  
ref|XP_004308086.1| PREDICTED: O-glucosyltransferase rumi homolo...   714   0.0  
ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolo...   704   0.0  
gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]        703   0.0  
ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolo...   701   0.0  
ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, p...   697   0.0  
ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Popu...   695   0.0  
gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]     693   0.0  
ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-l...   681   0.0  
gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]        675   0.0  
ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-l...   675   0.0  
gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus pe...   672   0.0  
ref|XP_004309206.1| PREDICTED: O-glucosyltransferase rumi homolo...   669   0.0  
ref|XP_006421921.1| hypothetical protein CICLE_v10004696mg [Citr...   667   0.0  
ref|XP_006490390.1| PREDICTED: KDEL motif-containing protein 2-l...   664   0.0  
ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, p...   656   0.0  
gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus pe...   655   0.0  
ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Caps...   641   0.0  
ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana] ...   639   e-180
ref|XP_006490389.1| PREDICTED: protein O-glucosyltransferase 1-l...   637   e-180

>gb|AED99886.1| glycosyltransferase [Panax notoginseng]
          Length = 546

 Score =  720 bits (1858), Expect = 0.0
 Identities = 347/541 (64%), Positives = 405/541 (74%), Gaps = 6/541 (1%)
 Frame = +2

Query: 506  MREQEQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFL----VVLCIGAFVYTRL 673
            +R+  Q  +L GSG    + +++  P L  K  + +    F L     +L +GAF+ TRL
Sbjct: 6    IRQGFQSYLLYGSGKLYRYLKEMVTPLLTIKLSSATFSYYFRLSTVITLLFLGAFISTRL 65

Query: 674  LDSSVP-SIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSNA-RACPP 847
            LDS+V  SI     Q SI    +    P   P I+      ++++PLNCS  N  R CP 
Sbjct: 66   LDSTVTTSITGNSSQSSILVTKTTHIYPEITP-IIRKKPPRKVEIPLNCSTGNLIRTCPA 124

Query: 848  NYYPSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVIL 1027
            NYYP  F+ Q+ D SS P  +CP+YFRWI+EDLRPWRETGI+REMVE ARRTANFRLVIL
Sbjct: 125  NYYPRTFNIQDQDHSSIPPVSCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVIL 184

Query: 1028 DGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANA 1207
            +G+ Y+E + KSFQSRD FTLWGILQLLR YPG+VPDLDLMFDCVDWPVI    Y G NA
Sbjct: 185  NGRAYVETHQKSFQSRDVFTLWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNA 244

Query: 1208 TAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAY 1387
            TAP PLFRYC DD+TLDIVFPDW+FWGWPEINIKPW  L KDLK+GN   +W+DREPYAY
Sbjct: 245  TAPPPLFRYCADDSTLDIVFPDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAY 304

Query: 1388 WKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIE 1567
            WKGNP+VAKTRMDLLKCNVSDKQDWNARVYA DW +E + GYKQSDLASQCIHRYKIYIE
Sbjct: 305  WKGNPIVAKTRMDLLKCNVSDKQDWNARVYAXDWARESQLGYKQSDLASQCIHRYKIYIE 364

Query: 1568 GSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNG 1747
            GSAWSVSEKYILACDS+ L VKPRYYDF TR LMP+ HYWP++DDDKCRSIK+AVDWGN 
Sbjct: 365  GSAWSVSEKYILACDSVTLXVKPRYYDFFTRGLMPVHHYWPIRDDDKCRSIKFAVDWGNN 424

Query: 1748 HIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMAC 1927
            H ++A +IGK AS FIQ++L M+YVYDYMFH            PT+PP+AVELCSE MAC
Sbjct: 425  HKQKAHSIGKEASNFIQEDLKMDYVYDYMFHLLNEYAKLLRYKPTVPPKAVELCSETMAC 484

Query: 1928 PAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQ 2107
            PA+G  K FMM+S  +GP+  +PC M P YD  TLHS+L RKEN IKQVE  EK YWD+ 
Sbjct: 485  PAEGFTKKFMMESIVKGPTDKSPCVMQPPYDPPTLHSVLRRKENSIKQVENWEKLYWDNH 544

Query: 2108 N 2110
            N
Sbjct: 545  N 545


>ref|XP_004308086.1| PREDICTED: O-glucosyltransferase rumi homolog [Fragaria vesca subsp.
            vesca]
          Length = 508

 Score =  714 bits (1844), Expect = 0.0
 Identities = 332/508 (65%), Positives = 401/508 (78%), Gaps = 2/508 (0%)
 Frame = +2

Query: 599  APARSSLAIF-FLVVLCIGAFVYTRLLDSSVPSIAEYLPQKSIFNAISFRNNPRDAPKIV 775
            +PARSS A+  FL++  +GAFV TRLL+S+  ++     Q SI N  + ++ P D P ++
Sbjct: 3    SPARSSSAVLVFLLLFFVGAFVCTRLLNSTTHTLGGTSAQDSILNTKASQSYPHDTP-VL 61

Query: 776  ENSSRHEIDVPLNCSVSN-ARACPPNYYPSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRP 952
              +    +++PLNC+  +  R CP NY  +  S+ + DP   P PTCP+YFRWIHEDLRP
Sbjct: 62   PKTPPKILEIPLNCTAFDLTRTCPSNYPTT--SSPDHDPERPPAPTCPEYFRWIHEDLRP 119

Query: 953  WRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQV 1132
            W  TGIS+   + ARRTANF+LVI++GK YMERY KSFQSRDTFTLWGILQLLRRYPG+V
Sbjct: 120  WAHTGISKATFQKARRTANFKLVIVNGKAYMERYGKSFQSRDTFTLWGILQLLRRYPGKV 179

Query: 1133 PDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKP 1312
            PDL+LMFDCVDWPVI  + Y+G N++AP PLFRYCGDD++LDIVFPDWSFWGWPEINI P
Sbjct: 180  PDLELMFDCVDWPVILSKFYTGDNSSAPPPLFRYCGDDSSLDIVFPDWSFWGWPEINIAP 239

Query: 1313 WVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWV 1492
            W  L K L++GN+R +W+DREPYAYWKGNP VA+TR DLLKCNVS++QDWNARVYAQDW 
Sbjct: 240  WENLLKQLEEGNKRSRWIDREPYAYWKGNPAVAETRQDLLKCNVSEEQDWNARVYAQDWS 299

Query: 1493 KEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMP 1672
            +E K+G+KQSDLASQCIHRYKIYIEGSAWSVS KYILACDS+ L+VKPRYYDF TR LMP
Sbjct: 300  RESKEGFKQSDLASQCIHRYKIYIEGSAWSVSNKYILACDSVTLIVKPRYYDFFTRELMP 359

Query: 1673 LQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXX 1852
            + HYWP+KDDDKCRSIKYAVDWGN H ++AQAIGKAAS  IQ++L M+YVYDYMFH    
Sbjct: 360  VHHYWPIKDDDKCRSIKYAVDWGNSHKQKAQAIGKAASNLIQEDLKMDYVYDYMFHLLSE 419

Query: 1853 XXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATL 2032
                    PTIP +A+ELCSE MAC A+GLEK FMM+S  +GP++ +PC MPP YD   L
Sbjct: 420  YAKLLQFKPTIPRKAIELCSEAMACQAQGLEKKFMMESMVKGPAVTSPCTMPPPYDPPAL 479

Query: 2033 HSILERKENLIKQVETREKQYWDSQNKQ 2116
             S+L R+ N IKQVET EK YW++QNKQ
Sbjct: 480  FSVLRRQSNSIKQVETWEKSYWENQNKQ 507


>ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
            gi|302143884|emb|CBI22745.3| unnamed protein product
            [Vitis vinifera]
          Length = 525

 Score =  704 bits (1817), Expect = 0.0
 Identities = 336/527 (63%), Positives = 403/527 (76%), Gaps = 1/527 (0%)
 Frame = +2

Query: 533  LNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVPSIAEYLP 712
            L+GSG  +HF++ IW PF+  KAPARSS  +FF + L IGAF+ TRLLDS     A  LP
Sbjct: 9    LHGSGYFRHFSDSIWRPFM--KAPARSSAILFFFLFLFIGAFLSTRLLDS-----ATSLP 61

Query: 713  QKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYYPSKFSTQNPDP 889
              S+   I         P  +      +I+ PLNCS  N  R CP NY P+ FS ++PD 
Sbjct: 62   TTSVEKPI-LPTGTAHKPFKIPKKPPVKIEYPLNCSAGNLTRTCPRNY-PTAFSPEDPDR 119

Query: 890  SSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQ 1069
             S P+  CP YFRWI+ DLRPW ++GI+REMVE A+RTA F+LVIL+G+ Y+E+Y ++FQ
Sbjct: 120  PSPPE--CPHYFRWIYGDLRPWMKSGITREMVERAKRTATFKLVILNGRAYVEKYQRAFQ 177

Query: 1070 SRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDT 1249
            +RD FTLWGILQLLRRYPG+VPDL+LMFDCVDWPVI+   Y G NATAP PLFRYCGDD 
Sbjct: 178  TRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDA 237

Query: 1250 TLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDL 1429
            TLDIVFPDWSFWGWPEINIKPW  L KDLK+GN+R +W++REPYAYWKGNP VA TR+DL
Sbjct: 238  TLDIVFPDWSFWGWPEINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDL 297

Query: 1430 LKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILAC 1609
            LKCNVSDKQDWNARVY QDW+ E ++GYKQSDLASQCIHRYKIYIEGSAWSVS+KYILAC
Sbjct: 298  LKCNVSDKQDWNARVYTQDWILESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILAC 357

Query: 1610 DSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASR 1789
            DS+ LLVKP YYDF TRSLMP+ HYWP+++DDKCRSIK+AVDWGN H ++AQ+IGKAAS 
Sbjct: 358  DSVTLLVKPHYYDFFTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASD 417

Query: 1790 FIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDST 1969
            FIQ++L M+ VYDYMFH            PT+P +AVELCSE M C A+GL+K FMM+S 
Sbjct: 418  FIQEDLKMDNVYDYMFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAEGLKKKFMMESM 477

Query: 1970 ARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQN 2110
             + P  A+PC MPP +    L + L RK N IKQVE  EK++W++QN
Sbjct: 478  VKYPMDASPCTMPPPFSPLELQTFLNRKVNSIKQVEAWEKKFWENQN 524


>gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]
          Length = 522

 Score =  703 bits (1814), Expect = 0.0
 Identities = 342/537 (63%), Positives = 404/537 (75%), Gaps = 3/537 (0%)
 Frame = +2

Query: 506  MREQ--EQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLD 679
            MRE   +QG   NGSG+   F E IW PF   K+ ARSS      +VL +GAF  T LLD
Sbjct: 5    MRENNMQQG---NGSGLFSQFTETIWRPFA--KSSARSSAIFVVFIVLLVGAFS-THLLD 58

Query: 680  SSVPSIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYY 856
            ++  +    L QK + +  + R NP+          R + D+PLNC+  N  RACP N  
Sbjct: 59   TT--TFLGSLAQKPMLSTRTSRGNPK--------KPRQQRDIPLNCTARNLTRACPTND- 107

Query: 857  PSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGK 1036
            P+    +   P S     CPDYFRWIHEDLRPW  TGIS +M++ A +TANFRLV+++G+
Sbjct: 108  PTAIEEE---PDSSLNAMCPDYFRWIHEDLRPWAYTGISMDMLKRAEKTANFRLVVVNGR 164

Query: 1037 VYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAP 1216
             Y++RY +SFQ+RD FTLWGILQLLRRYPG+VPDLDLMFDCVDWPVIK   Y G NAT P
Sbjct: 165  AYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVDWPVIKTSDYGGPNATTP 224

Query: 1217 LPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKG 1396
             PLFRYC DD TLDIVFPDWSFWGWPEINIKPWVPL  DL +GN+R+ W  REP+AYWKG
Sbjct: 225  PPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEGNKRMGWEGREPHAYWKG 284

Query: 1397 NPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSA 1576
            NP VA TR DLLKCNVSDKQDW ARVYAQDW +E +QGYKQSDLA+QCIHR+KIYIEGSA
Sbjct: 285  NPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSDLANQCIHRFKIYIEGSA 344

Query: 1577 WSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIE 1756
            WSVSEKYILACDSL LLVKPRYYDF TRSL P++HYWP+KDDDKCRSIK+AVDWGNGH +
Sbjct: 345  WSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDDKCRSIKHAVDWGNGHQQ 404

Query: 1757 EAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAK 1936
            EAQAIGKAAS FI++ L M+YVYDYMFH            PT+P +AVELCSE MACPA+
Sbjct: 405  EAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTVPRKAVELCSETMACPAE 464

Query: 1937 GLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQ 2107
            GL+K FMM+S  +GPS+ +PC MPP YD A+L+++L +KEN IKQVE  EK++W+ Q
Sbjct: 465  GLQKKFMMESMVKGPSVTSPCTMPPPYDPASLYALLSKKENSIKQVEEWEKKFWEMQ 521


>ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolog [Solanum lycopersicum]
          Length = 514

 Score =  701 bits (1809), Expect = 0.0
 Identities = 332/505 (65%), Positives = 396/505 (78%), Gaps = 3/505 (0%)
 Frame = +2

Query: 611  SSLAIFFLVVLCIGAFVYTRLLDSSVP-SIAEYLPQKSIFNAISFRNNPRDAPKIVENSS 787
            SSL +F  ++L IGA   T  L S    S   Y P+K+I   +   N+    P + +   
Sbjct: 7    SSLTLFVSLLLFIGAIFSTHFLYSPFNNSTTGYSPRKTIVTRVIRYNHTYATPSVSKQPL 66

Query: 788  RHEIDVPLNCSVSN-ARACPPNYYPSKFSTQNPDP-SSRPQPTCPDYFRWIHEDLRPWRE 961
            + ++++ LNC++ N  R CP +YYP KF+ QN    SS P PTCPDYFRWI++DL  WRE
Sbjct: 67   K-KLEIQLNCTLGNLTRTCPASYYPLKFTEQNESSTSSSPPPTCPDYFRWIYDDLWHWRE 125

Query: 962  TGISREMVEMARRTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDL 1141
            TGI++EMV  A+RTA+FRLVI++G+ Y+E Y K+FQSRDTFTLWGILQ+LRRYPG+VPDL
Sbjct: 126  TGITKEMVMRAKRTADFRLVIVNGRAYVETYHKAFQSRDTFTLWGILQMLRRYPGKVPDL 185

Query: 1142 DLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVP 1321
            DLMFDCVDWPV+K E Y    A  P PLFRYCG+D++LDIVFPDWSFWGWPEINIKPW  
Sbjct: 186  DLMFDCVDWPVLKTEFYRHPKAPVPPPLFRYCGNDSSLDIVFPDWSFWGWPEINIKPWET 245

Query: 1322 LSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQ 1501
            LSKDLK GNE++KW +REPYAYWKGNPVVA+TR DLLKCN S+KQDWNARVYAQDW + +
Sbjct: 246  LSKDLKKGNEKMKWTEREPYAYWKGNPVVAETRRDLLKCNASEKQDWNARVYAQDWAQAE 305

Query: 1502 KQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQH 1681
            KQGYKQSDLA+QCIHRYKIY+EGSAWSVSEKYILACDS+ LL+KP+YYDF TR LMPLQH
Sbjct: 306  KQGYKQSDLANQCIHRYKIYVEGSAWSVSEKYILACDSVTLLIKPQYYDFYTRGLMPLQH 365

Query: 1682 YWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXX 1861
            YWPVKD DKCRSIK+AVDWGN H +EAQAIGKAAS FIQ++L M+YVYDYMFH       
Sbjct: 366  YWPVKDKDKCRSIKHAVDWGNTHEQEAQAIGKAASDFIQEQLKMDYVYDYMFHLLSEYAK 425

Query: 1862 XXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSI 2041
                 PT+P +AVELCSE MAC A+GL K FM++S   GPS ATPC MPP Y  A LHSI
Sbjct: 426  LLKYKPTVPRKAVELCSEAMACSAEGLTKKFMLESMVEGPSDATPCNMPPPYGPAGLHSI 485

Query: 2042 LERKENLIKQVETREKQYWDSQNKQ 2116
            L+RKEN IKQV++ E+QYW +++KQ
Sbjct: 486  LDRKENSIKQVDSWEQQYWKNKSKQ 510


>ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549903|gb|EEF51390.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 528

 Score =  697 bits (1799), Expect = 0.0
 Identities = 341/540 (63%), Positives = 408/540 (75%), Gaps = 3/540 (0%)
 Frame = +2

Query: 506  MREQE--QGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLD 679
            MR Q+  Q ++  GSG   HF +KI  P L  K P+R S+ +F L+ L   AF+ TR LD
Sbjct: 1    MRVQQTLQRSLQYGSGFYSHFIDKI-SPSL--KLPSRISIFLFLLICLA-SAFLTTRFLD 56

Query: 680  SSVPSIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYY 856
            SS  +      QK +    S   NP     ++  ++ ++I++PLNC+  N  R CP NY 
Sbjct: 57   SS-SAFTGSSAQKPLITTKSAPTNPT----LISKNALNKINIPLNCAAFNLTRTCPSNY- 110

Query: 857  PSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGK 1036
            P+ F T+NPD  S     CP+Y+RWI+EDLRPW  TGISR+MVE A+ TANFRLVI++GK
Sbjct: 111  PTTF-TENPDRPS--VSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANFRLVIVNGK 167

Query: 1037 VYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAP 1216
             Y+E+Y ++FQ+RD FTLWGILQLLRRYPG+VPDL+LMFDCVDWPVIK   YSG NA AP
Sbjct: 168  AYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNYSGPNAMAP 227

Query: 1217 LPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKG 1396
             PLFRYCGDD TLD+VFPDWSFWGW EINIKPW  L ++LK+GNE+ +W++REPYAYWKG
Sbjct: 228  PPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMEREPYAYWKG 287

Query: 1397 NPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSA 1576
            NP VA+TR DL+KCNVS++QDWNARVYAQDW+KE +QGYKQS+LASQC+HRYKIYIEGSA
Sbjct: 288  NPAVAETRQDLMKCNVSEQQDWNARVYAQDWIKELQQGYKQSNLASQCMHRYKIYIEGSA 347

Query: 1577 WSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIE 1756
            WSVSEKYILACDS+ LLVKP YYDF TRSL P+ HYWP+KD DKCRSIK+AVDWGN H +
Sbjct: 348  WSVSEKYILACDSVTLLVKPHYYDFFTRSLRPIHHYWPIKDYDKCRSIKFAVDWGNNHKQ 407

Query: 1757 EAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAK 1936
            +AQAIGKAAS FIQ+EL M+YVYDYMFH            P IP +AVELCSE MACPA 
Sbjct: 408  KAQAIGKAASEFIQEELKMDYVYDYMFHLLNEYAKLLTFKPVIPRKAVELCSESMACPAN 467

Query: 1937 GLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            G+EK FMM+S  +GP+   PC M P YD + LHSI  RKEN I+QVE  EK YWD Q KQ
Sbjct: 468  GIEKEFMMESMVQGPAETNPCIMLPPYDPSALHSIFRRKENSIRQVELWEKMYWDKQKKQ 527


>ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Populus trichocarpa]
            gi|550322617|gb|EEF06046.2| hypothetical protein
            POPTR_0015s13090g [Populus trichocarpa]
          Length = 506

 Score =  695 bits (1793), Expect = 0.0
 Identities = 330/521 (63%), Positives = 394/521 (75%), Gaps = 6/521 (1%)
 Frame = +2

Query: 572  IWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVPS----IAEYLPQKSIFNAIS 739
            IW PF+  K PARSS+ IF L+ L +GA V TRLLDS+V      +  +L  K       
Sbjct: 10   IWRPFM--KLPARSSVVIFLLLFLIVGALVCTRLLDSTVTGGSSVVKTFLTDK------- 60

Query: 740  FRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYYPSKFSTQNPDPSSRPQ-PTC 913
                    PKI  N + +    P+NC+  N  R CP NY      T   +   RP   TC
Sbjct: 61   -------IPKITRNKTEY----PVNCTAFNPTRKCPLNY-----PTNTQEGPDRPSVSTC 104

Query: 914  PDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQSRDTFTLW 1093
            P++FRWIHEDLRPW  TGISR+MVE A+RTANFRLVI++GK YMERY KSFQ+RDTFT+W
Sbjct: 105  PEHFRWIHEDLRPWAHTGISRDMVERAKRTANFRLVIVNGKAYMERYRKSFQTRDTFTVW 164

Query: 1094 GILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLDIVFPD 1273
            GI+QLLR+YPG++PDLD+MFDCVDWPVI+   YSG NAT+P  LFRYCGDD +LD+VFPD
Sbjct: 165  GIIQLLRKYPGKLPDLDMMFDCVDWPVIRSSDYSGPNATSPPALFRYCGDDDSLDVVFPD 224

Query: 1274 WSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKCNVSDK 1453
            WSFWGWPEINIKPW  LS DLK+GN+  KW++REPYAYWKGNP VA TR DL+KC+ S+ 
Sbjct: 225  WSFWGWPEINIKPWESLSNDLKEGNKITKWMEREPYAYWKGNPSVAATRQDLMKCHASET 284

Query: 1454 QDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSLALLVK 1633
            QDWNARVYAQDW+KE +QGY+QS+LA+QC+H+YKIYIEGSAWSVSEKYILACDS+ LLVK
Sbjct: 285  QDWNARVYAQDWIKESQQGYQQSNLANQCVHKYKIYIEGSAWSVSEKYILACDSVTLLVK 344

Query: 1634 PRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQDELAM 1813
            P YYDF TRSL+P +HYWP+K+DDKCRSIK+AV+WGN H EEAQA+GKAAS FIQ++L M
Sbjct: 345  PHYYDFFTRSLVPNRHYWPIKEDDKCRSIKFAVEWGNNHSEEAQAMGKAASEFIQEDLKM 404

Query: 1814 NYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDSTARGPSIAT 1993
            +YVYDYMFH            PTIP RA+ELC+E MACPA GLEK FMMDS    P+  +
Sbjct: 405  DYVYDYMFHLLNEYAKLLTFKPTIPGRAIELCAEAMACPANGLEKKFMMDSMVMSPADTS 464

Query: 1994 PCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            PC MPP YD  +LHS+ +R  N IKQVE+ EK+YWD+Q KQ
Sbjct: 465  PCTMPPPYDPLSLHSVFQRNGNSIKQVESWEKEYWDNQIKQ 505


>gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]
          Length = 515

 Score =  693 bits (1789), Expect = 0.0
 Identities = 330/526 (62%), Positives = 407/526 (77%), Gaps = 6/526 (1%)
 Frame = +2

Query: 557  HFAEKIWFPFLPKKAPARSSLAIF-FLVVLCIGAFVYTRLLDSSV---PSIAEYLPQKSI 724
            +F + IW PFL  K+ A+S   +F FL  L +GAFV TRLL+++    P+IA+       
Sbjct: 17   NFTDTIWRPFL--KSSAKSPAVLFVFLFFLFVGAFVSTRLLNTANLAGPTIAK------- 67

Query: 725  FNAISFRNNPRDAPKIVENSSRHEIDVPLNCSV-SNARACPPNYYPSKFSTQNPDPSSRP 901
                            +   SR  I +PLNCS  S  R CP NY P+ ++ Q  D   RP
Sbjct: 68   ----------------ISEKSRQRIGIPLNCSAYSPTRTCPANY-PTTYNKQ--DDLDRP 108

Query: 902  Q-PTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQSRD 1078
              PTCPDYFRWI+EDLRPW  TGISR+MVE A+RTANFRLVI++GK Y+E + K+FQ+RD
Sbjct: 109  LLPTCPDYFRWIYEDLRPWAYTGISRDMVERAKRTANFRLVIVNGKAYVETFQKAFQTRD 168

Query: 1079 TFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLD 1258
             FTLWGILQLLR+YPG+VPDL+LMFDCVDWPV+  +AYSG +AT P PLFRYCGDD+TLD
Sbjct: 169  VFTLWGILQLLRKYPGRVPDLELMFDCVDWPVVLSKAYSGPDATTPPPLFRYCGDDSTLD 228

Query: 1259 IVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKC 1438
            IVFPDWSFWGWPE NIKPW  L K+L++GN++ KWV+RE YAYWKGNPVVA TR DLLKC
Sbjct: 229  IVFPDWSFWGWPETNIKPWEALLKELEEGNKKSKWVEREAYAYWKGNPVVAATRQDLLKC 288

Query: 1439 NVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSL 1618
            NVSDKQDWNAR+YAQDW+KE K+GYKQSDLA+QCIHRYKIYIEGSAWSVSEKYILACDS+
Sbjct: 289  NVSDKQDWNARLYAQDWLKESKEGYKQSDLANQCIHRYKIYIEGSAWSVSEKYILACDSV 348

Query: 1619 ALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQ 1798
             L+VKP YYDF TR L+P+QHYWP+KDDDKCRSIK+AVDWGN H ++A++IGKAASRFIQ
Sbjct: 349  TLIVKPHYYDFFTRGLVPMQHYWPIKDDDKCRSIKFAVDWGNSHKKKAKSIGKAASRFIQ 408

Query: 1799 DELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDSTARG 1978
            D+L M YVYDYMFH            P+IP +AVE CSE MAC A+G+ K FMM+S  +G
Sbjct: 409  DDLKMEYVYDYMFHLLNEYAKLLKFKPSIPEKAVEFCSESMACTAEGIGKKFMMESMVKG 468

Query: 1979 PSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            P+ ++PC MPP+Y+ ++L+S++++K +LI+QVE  + +YW++QNKQ
Sbjct: 469  PADSSPCTMPPSYNPSSLYSLIQKKTSLIEQVEMWQNKYWENQNKQ 514


>ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  681 bits (1758), Expect = 0.0
 Identities = 334/553 (60%), Positives = 399/553 (72%), Gaps = 16/553 (2%)
 Frame = +2

Query: 506  MREQEQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLA-IFFLVVLCIGAFVYTRLLDS 682
            MRE   G+  N       F + I+ PF+  K+PA  SL  +FF + L  G F+ TRLL S
Sbjct: 1    MREGSGGSFRNRFSHYAFFPDHIFKPFI--KSPATFSLLFLFFSLFLLAGVFLSTRLLHS 58

Query: 683  SVPSI---------AEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-- 829
            S  +          ++Y P     N     +NP   P+      R +++  L+C+  N  
Sbjct: 59   STTAYNLTIKGSGKSQYYPT----NTSQVPHNPNHQPR------RPQVEFTLHCASFNNI 108

Query: 830  -ARACPPNYYPSKFST---QNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMAR 997
               ACP  +YP+ ++T   QNP  SS     CPDYFRWIHEDLRPW  TGI+R  +E  +
Sbjct: 109  TPGACPA-HYPTNWTTDEDQNPPSSSS---ACPDYFRWIHEDLRPWARTGITRATLEAGQ 164

Query: 998  RTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVI 1177
            RTANFRL+IL+GK Y+E Y KSFQ+RDTFT+WGILQLLRRYPG+VPDLDLMFDCVDWPVI
Sbjct: 165  RTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI 224

Query: 1178 KKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERI 1357
                +SG N   P PLFRYCGDD T DIVFPDWSFWGWPEINIKPW PL KD+K+GN+RI
Sbjct: 225  LTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEINIKPWEPLLKDIKEGNKRI 284

Query: 1358 KWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQ 1537
             W  REPYAYWKGNP VA TR DL+KCNVSD+QDWNARV+AQDW KE ++GYKQSDL++Q
Sbjct: 285  PWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFAQDWTKESQEGYKQSDLSNQ 344

Query: 1538 CIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRS 1717
            C+HRYKIYIEGSAWSVSEKYILACDS+ L+VKP YYDF TR LMP+ HYWPVKDDDKC+S
Sbjct: 345  CLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKS 404

Query: 1718 IKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRA 1897
            IK+AVDWGN H ++AQAIGKAAS FIQ+EL M+YVYDYMFH            PT+PP A
Sbjct: 405  IKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPTLPPNA 464

Query: 1898 VELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVE 2077
            +ELCSE MACPA+GL K FM +S  + P+ + PC MPP YD A+LH +L RKEN IKQVE
Sbjct: 465  IELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPPPYDPASLHFVLSRKENSIKQVE 524

Query: 2078 TREKQYWDSQNKQ 2116
              E  +W++Q+KQ
Sbjct: 525  KWETSFWNTQSKQ 537


>gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]
          Length = 498

 Score =  675 bits (1742), Expect = 0.0
 Identities = 329/514 (64%), Positives = 386/514 (75%), Gaps = 3/514 (0%)
 Frame = +2

Query: 506  MREQ--EQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLD 679
            MRE   +QG   NGSG+   F E IW PF   K+ ARSS      +VL +GAF  T LLD
Sbjct: 5    MRENNMQQG---NGSGLFSQFTETIWRPFA--KSSARSSAIFVVFIVLLVGAFS-THLLD 58

Query: 680  SSVPSIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYY 856
            ++  +    L QK + +  + R NP+          R + D+PLNC+  N  RACP N  
Sbjct: 59   TT--TFLGSLAQKPMLSTRTSRGNPK--------KPRQQRDIPLNCTARNLTRACPTND- 107

Query: 857  PSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGK 1036
            P+    +   P S     CPDYFRWIHEDLRPW  TGIS +M++ A +TANFRLV+++G+
Sbjct: 108  PTAIEEE---PDSSLNAMCPDYFRWIHEDLRPWAYTGISMDMLKRAEKTANFRLVVVNGR 164

Query: 1037 VYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAP 1216
             Y++RY +SFQ+RD FTLWGILQLLRRYPG+VPDLDLMFDCVDWPVIK   Y G NAT P
Sbjct: 165  AYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVDWPVIKTSDYGGPNATTP 224

Query: 1217 LPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKG 1396
             PLFRYC DD TLDIVFPDWSFWGWPEINIKPWVPL  DL +GN+R+ W  REP+AYWKG
Sbjct: 225  PPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEGNKRMGWEGREPHAYWKG 284

Query: 1397 NPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSA 1576
            NP VA TR DLLKCNVSDKQDW ARVYAQDW +E +QGYKQSDLA+QCIHR+KIYIEGSA
Sbjct: 285  NPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSDLANQCIHRFKIYIEGSA 344

Query: 1577 WSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIE 1756
            WSVSEKYILACDSL LLVKPRYYDF TRSL P++HYWP+KDDDKCRSIK+AVDWGNGH +
Sbjct: 345  WSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDDKCRSIKHAVDWGNGHQQ 404

Query: 1757 EAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAK 1936
            EAQAIGKAAS FI++ L M+YVYDYMFH            PT+P +AVELCSE MACPA+
Sbjct: 405  EAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTVPRKAVELCSETMACPAE 464

Query: 1937 GLEKTFMMDSTARGPSIATPCAMPPAYDSATLHS 2038
            GL+K FMM+S  +GPS+ +PC MPP YD A+L++
Sbjct: 465  GLQKKFMMESMVKGPSVTSPCTMPPPYDPASLYA 498


>ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  675 bits (1742), Expect = 0.0
 Identities = 331/553 (59%), Positives = 398/553 (71%), Gaps = 16/553 (2%)
 Frame = +2

Query: 506  MREQEQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLA-IFFLVVLCIGAFVYTRLLDS 682
            MRE   G+  N       F + I+ PF+  K+PA  SL  +FF + L  G F+ TRLL S
Sbjct: 1    MREGSGGSFRNRFSHYAFFPDHIFKPFI--KSPATFSLLFLFFSLFLLAGVFLSTRLLHS 58

Query: 683  SVPSI---------AEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-- 829
            S  +          ++Y P     N     +NP   P+      R +++  L+C+  N  
Sbjct: 59   STTAYNLTIKGSGKSQYYPT----NTSQVPHNPNHQPR------RPQVEFTLHCASFNNI 108

Query: 830  -ARACPPNYYPSKFST---QNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMAR 997
               ACP  +YP+ ++T   QNP  SS     CPDYFRWIHEDLRPW  TGI+R  +E  +
Sbjct: 109  TPGACPA-HYPTNWTTDEDQNPPSSSS---ACPDYFRWIHEDLRPWARTGITRATLEAGQ 164

Query: 998  RTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVI 1177
            RTANFRL+IL+GK Y+E Y KSFQ+RDTFT+WGILQLLRRYPG+VPDLDLMFDCVDWPVI
Sbjct: 165  RTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRYPGKVPDLDLMFDCVDWPVI 224

Query: 1178 KKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERI 1357
                +SG N   P PLFRYCGDD T DIVFPDWSFWGWPEINIKPW PL KD+K+GN+RI
Sbjct: 225  LTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEINIKPWEPLLKDIKEGNKRI 284

Query: 1358 KWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQ 1537
             W  R+PYAYWKGNP VA TR DL+KCNVSD+QDWNARV+AQDW KE ++GYKQS+L++Q
Sbjct: 285  PWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFAQDWTKESQEGYKQSNLSNQ 344

Query: 1538 CIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRS 1717
            C+HRYKIYIEGSAWSVSEKYILACDS+ L+VKP YYDF TR LMP+ HYWPVKDDDKC+S
Sbjct: 345  CLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMPVHHYWPVKDDDKCKS 404

Query: 1718 IKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRA 1897
            IK+AVDWGN H ++AQAIGKAAS FIQ+EL M+YVYDYMFH            PT+PP A
Sbjct: 405  IKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFHLLSEYSKLLTFKPTLPPNA 464

Query: 1898 VELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVE 2077
            +ELCSE MACPA+GL K FM +S  + P+ + PC MP  YD A+LH +L RKEN IKQVE
Sbjct: 465  IELCSEAMACPAEGLTKKFMTESLVKRPAESNPCTMPSPYDPASLHFVLSRKENSIKQVE 524

Query: 2078 TREKQYWDSQNKQ 2116
              E  +W++Q+KQ
Sbjct: 525  KWETSFWNTQSKQ 537


>gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus persica]
          Length = 502

 Score =  672 bits (1735), Expect = 0.0
 Identities = 310/500 (62%), Positives = 387/500 (77%), Gaps = 1/500 (0%)
 Frame = +2

Query: 620  AIFFLVVLCIGAFVYTRLLDSSVPSIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEI 799
            AIF ++ + +GA + TRLL+ +  ++   +  ++  +  S+ +   + PK      R ++
Sbjct: 9    AIFVVLFVLVGALICTRLLNYNTETLLGAISGQARTSQ-SYPHKTGEIPK----KPRGKL 63

Query: 800  DVPLNCSVSNARACPPNYYPSKFST-QNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISR 976
            ++PLNC   + R   P+ YP+ F   QNP+  S   PTCP+YFRWIHEDLRPW  TGI+R
Sbjct: 64   EIPLNCPAYDLRGTCPSNYPTTFHPEQNPERPS--PPTCPEYFRWIHEDLRPWARTGITR 121

Query: 977  EMVEMARRTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFD 1156
            EMVE A RTANF+ VI++GK Y+E+Y K+FQ+RD FT+WG LQLLRRYPGQVPDL+LMFD
Sbjct: 122  EMVERANRTANFKFVIVNGKAYVEQYEKAFQTRDVFTVWGFLQLLRRYPGQVPDLELMFD 181

Query: 1157 CVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDL 1336
            CVDWPVI    YSG NATAP PLFRYC DD TLDIVFPDWSFWGW EINI+PW  L ++L
Sbjct: 182  CVDWPVIPSHEYSGPNATAPPPLFRYCADDNTLDIVFPDWSFWGWAEINIRPWEVLFEEL 241

Query: 1337 KDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYK 1516
            K+GN+R  W++REPYAYWKGNP +A+TR DL+KCNVS++ DWNAR+YAQDW +E K+GY 
Sbjct: 242  KEGNKRKTWLEREPYAYWKGNPDIAETRQDLIKCNVSEEHDWNARLYAQDWDRESKEGYN 301

Query: 1517 QSDLASQCIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVK 1696
            +SDLASQCIHRYKIYIEGSAWSVSEKYILACDS+ L+VKPRYYDF TR LMP++HYWP+K
Sbjct: 302  KSDLASQCIHRYKIYIEGSAWSVSEKYILACDSVTLIVKPRYYDFFTRRLMPVEHYWPIK 361

Query: 1697 DDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXX 1876
            DDDKCRSIK++VDWGN H  +AQAIGKA+S  IQ+EL M YVYDYMFH            
Sbjct: 362  DDDKCRSIKFSVDWGNTHRRKAQAIGKASSNLIQEELKMEYVYDYMFHLLNEYAKLLQFK 421

Query: 1877 PTIPPRAVELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKE 2056
            PT+P +AVELCSE MAC A+G EK FM+ S  +GP+++ PCAMPP YD ++L ++L RKE
Sbjct: 422  PTVPKKAVELCSEAMACQAEGTEKKFMLQSLVKGPAVSEPCAMPPPYDPSSLFAVLRRKE 481

Query: 2057 NLIKQVETREKQYWDSQNKQ 2116
            N IKQVET E+ YW+SQ+K+
Sbjct: 482  NSIKQVETWERNYWESQSKK 501


>ref|XP_004309206.1| PREDICTED: O-glucosyltransferase rumi homolog [Fragaria vesca subsp.
            vesca]
          Length = 508

 Score =  669 bits (1726), Expect = 0.0
 Identities = 313/511 (61%), Positives = 392/511 (76%), Gaps = 4/511 (0%)
 Frame = +2

Query: 596  KAPARSS-LAIFFLVVLCIGAFVYTRLL-DSSVPSIAEYLPQKSIFNAISFRNNPRDAPK 769
            + P RSS  A+  L++  +GAFV+TRLL +SS  ++     Q +I    + + +P+  P 
Sbjct: 2    ECPTRSSSAALVSLLLFFVGAFVFTRLLLNSSTHTLVGKSAQDAIVTIDASQLHPQQTP- 60

Query: 770  IVENSSRHEIDVPLNCSVSNARACPPNYYPSKFSTQNPDPS-SRP-QPTCPDYFRWIHED 943
            ++  +  + + +PL+C   N     P+ YP+   T +PD   +RP QPTCPD+FRWIHED
Sbjct: 61   VLPKTPPNTLKIPLDCPAYNLTGTCPSNYPT---TSSPDQDHNRPSQPTCPDFFRWIHED 117

Query: 944  LRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQSRDTFTLWGILQLLRRYP 1123
            L+PW  TGI+R+  E A RTA F+LVI++GK Y ++Y K+FQSRDTFTLWGILQLLRRYP
Sbjct: 118  LKPWAYTGITRDTFEAANRTAAFKLVIVNGKAYYQKYVKAFQSRDTFTLWGILQLLRRYP 177

Query: 1124 GQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDTTLDIVFPDWSFWGWPEIN 1303
            G+VPDL+LMFDCVDWPVI   +++G N+TAP PLFRYCGD+ TLDIVFPDWSFWGWPE N
Sbjct: 178  GKVPDLELMFDCVDWPVILSSSFTGPNSTAPPPLFRYCGDNNTLDIVFPDWSFWGWPETN 237

Query: 1304 IKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDLLKCNVSDKQDWNARVYAQ 1483
            I PW  L + L +GN R +WVDREPYAYWKGNP VA+TR DLLKCNVS++ +WNARVYAQ
Sbjct: 238  IAPWENLLEQLVEGNRRSRWVDREPYAYWKGNPKVAETRQDLLKCNVSEEHEWNARVYAQ 297

Query: 1484 DWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSLALLVKPRYYDFSTRS 1663
            +W  E+K G+K+SDLASQC+HRYKIYIEGSAWSVS KYILACDS+ LLV+PRY DF  R 
Sbjct: 298  NWTLEEKAGFKKSDLASQCVHRYKIYIEGSAWSVSNKYILACDSVTLLVRPRYNDFFMRG 357

Query: 1664 LMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASRFIQDELAMNYVYDYMFHX 1843
            LMP+ HYWPV+DDDKCRSIKYAVDWGN H ++AQAIGKAAS +I+++L M+YVYDYMFH 
Sbjct: 358  LMPVHHYWPVRDDDKCRSIKYAVDWGNSHQKKAQAIGKAASNYIKEDLKMDYVYDYMFHL 417

Query: 1844 XXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDSTARGPSIATPCAMPPAYDS 2023
                       PT+PP A+ELCSE MAC A+GLEK FMM+S  +GP++ +PC MPP YD 
Sbjct: 418  LSEYAKLLRFKPTVPPEAIELCSETMACQAEGLEKKFMMESMVKGPAVTSPCTMPPPYDP 477

Query: 2024 ATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            A+L S+L R+ N+IK+VET EK YW+ QNKQ
Sbjct: 478  ASLFSVLRRRSNIIKRVETLEKNYWEHQNKQ 508


>ref|XP_006421921.1| hypothetical protein CICLE_v10004696mg [Citrus clementina]
            gi|557523794|gb|ESR35161.1| hypothetical protein
            CICLE_v10004696mg [Citrus clementina]
          Length = 536

 Score =  667 bits (1720), Expect = 0.0
 Identities = 310/532 (58%), Positives = 400/532 (75%), Gaps = 4/532 (0%)
 Frame = +2

Query: 533  LNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVPSIAEYLP 712
            ++GSG + HF + IW  F+   +PA+S +   F+VVL +GA V TRLLDS+     +   
Sbjct: 16   VHGSGHSGHFTDTIWRQFV--MSPAKSYVLFSFIVVLLLGALVSTRLLDSAA---LDGGA 70

Query: 713  QKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSNA----RACPPNYYPSKFSTQN 880
             + + +  S   +PR     +    R++I+ PLNC+ + +    ++CP  Y P+ ++ + 
Sbjct: 71   NRVVTDRKSLTFDPR-----ITKKPRNKIEYPLNCTAAGSHTHTKSCPGTY-PTSYAPEE 124

Query: 881  PDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSK 1060
             + ++ P  TCP+YFRWIHEDLRPW  TGI+REMVE AR+TANFRLVI+ GK Y+E Y+K
Sbjct: 125  DNDATSPS-TCPEYFRWIHEDLRPWARTGITREMVERARKTANFRLVIVKGKAYVETYTK 183

Query: 1061 SFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCG 1240
            +FQSRDTFTLWGILQLLRRYPG++PDLDLMFDCVDWPV+ + AY   +A AP PLFRYC 
Sbjct: 184  AFQSRDTFTLWGILQLLRRYPGRIPDLDLMFDCVDWPVVLRNAYCAPDAPAPPPLFRYCA 243

Query: 1241 DDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTR 1420
            +D T DIVFPDWSFWGWPE+NIK W P  KDL++GN RIKW DREPYAYWKGNP VA TR
Sbjct: 244  NDQTYDIVFPDWSFWGWPEVNIKSWEPQLKDLEEGNGRIKWSDREPYAYWKGNPTVAPTR 303

Query: 1421 MDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYI 1600
             DL+KCNVS+ Q+WNARV+AQDW+KEQ++GYKQSDLASQC  R+KIYIEGSAWSVSEKYI
Sbjct: 304  QDLMKCNVSEGQEWNARVFAQDWIKEQQEGYKQSDLASQCRDRFKIYIEGSAWSVSEKYI 363

Query: 1601 LACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKA 1780
            LACDS+ L+V P+YYDF TR LMPL HYWP+ D DKCRSIK+AVDWGN H ++A+A+G+A
Sbjct: 364  LACDSVTLIVTPKYYDFYTRGLMPLHHYWPINDHDKCRSIKFAVDWGNSHKKKAKAMGRA 423

Query: 1781 ASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMM 1960
            AS+FIQDEL ++YVYDYMFH            PT+PP AVE C+E +AC  +G  + FM 
Sbjct: 424  ASKFIQDELKLDYVYDYMFHLLNQYSKLLRYQPTVPPEAVEYCAERLACAEEGPARKFME 483

Query: 1961 DSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            +S  + P   +PC +PP+YD ++L+ +L++KEN I QVE+ ++ YW++Q KQ
Sbjct: 484  ESLVQSPKETSPCTLPPSYDPSSLNDVLQKKENSILQVESWQRAYWENQTKQ 535


>ref|XP_006490390.1| PREDICTED: KDEL motif-containing protein 2-like [Citrus sinensis]
          Length = 536

 Score =  664 bits (1712), Expect = 0.0
 Identities = 308/532 (57%), Positives = 399/532 (75%), Gaps = 4/532 (0%)
 Frame = +2

Query: 533  LNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVPSIAEYLP 712
            ++GSG + HF + IW  F+   +PA+S +   F+VVL +GA V TRLLDS+     +   
Sbjct: 16   VHGSGHSGHFTDTIWRQFV--MSPAKSYVLFSFIVVLFLGALVSTRLLDSAA---LDGGA 70

Query: 713  QKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSNA----RACPPNYYPSKFSTQN 880
             + + +  S   +PR     +    R++++ PLNC+ + +    ++CP  Y P+ ++ + 
Sbjct: 71   NRVVTDRKSLTFDPR-----ITKKPRNKVEYPLNCTAAGSHTHTKSCPGTY-PTSYAPEE 124

Query: 881  PDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSK 1060
             + ++ P  TCP+YFRWIHEDLRPW  TGI+REMVE AR+TANFRLVI+ GK Y+E Y+K
Sbjct: 125  DNDATSPS-TCPEYFRWIHEDLRPWARTGITREMVERARKTANFRLVIVKGKAYVETYTK 183

Query: 1061 SFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCG 1240
            +FQSRDTFTLWGILQLLRRYPG++PDLDLMFDCVDWPV+ + AY   +A AP PLFRYC 
Sbjct: 184  AFQSRDTFTLWGILQLLRRYPGRIPDLDLMFDCVDWPVVLRNAYCAPDAPAPPPLFRYCA 243

Query: 1241 DDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTR 1420
            +D T DIVFPDWSFWGWPE+NIK W P  KDL++GN RIKW DREPYAYWKGNP VA TR
Sbjct: 244  NDQTYDIVFPDWSFWGWPEVNIKSWEPQLKDLEEGNRRIKWSDREPYAYWKGNPTVAPTR 303

Query: 1421 MDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYI 1600
             DL+KCNVS+ Q+WNARV+AQDW+KEQ++GYKQSDLASQC  R+KIYIEGSAWSVSEKYI
Sbjct: 304  QDLMKCNVSEGQEWNARVFAQDWIKEQQEGYKQSDLASQCRDRFKIYIEGSAWSVSEKYI 363

Query: 1601 LACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKA 1780
            LACDS+ L+V P+YYDF TR LMPL HYWP+ D DKCRSIK+AVDWGN H ++A+A+G+A
Sbjct: 364  LACDSVTLIVTPKYYDFYTRGLMPLHHYWPINDHDKCRSIKFAVDWGNSHKKKAKAMGRA 423

Query: 1781 ASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMM 1960
            AS+FIQDEL ++YVYDYMFH            PT+ P AVE C+E +AC  +G  + FM 
Sbjct: 424  ASKFIQDELKLDYVYDYMFHLLNQYSKLLRYQPTVSPEAVEYCAERLACAEEGPARKFME 483

Query: 1961 DSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            +S  + P   +PC +PP+YD ++L+ +L++KEN I QVE+ ++ YW++Q KQ
Sbjct: 484  ESLVQSPKETSPCTLPPSYDPSSLNDVLQKKENSILQVESWQRAYWENQTKQ 535


>ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549902|gb|EEF51389.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 506

 Score =  656 bits (1692), Expect = 0.0
 Identities = 310/529 (58%), Positives = 382/529 (72%), Gaps = 1/529 (0%)
 Frame = +2

Query: 533  LNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVPSIAEYLP 712
            + GSG+  H  E I  P L    P +SS A   LV L +G  + TR              
Sbjct: 1    MQGSGVVGHLTEPIMRPLL--LLPGKSSAAFLLLVFLLVGMLLSTRFQ------------ 46

Query: 713  QKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSN-ARACPPNYYPSKFSTQNPDP 889
                FNAI+  + P+    + +  +R  + +PLNC   N  R CP +Y     ST + DP
Sbjct: 47   ----FNAITGYSAPKSTVPLEKPDNR--LVIPLNCHALNLTRTCPTDYP----STSSQDP 96

Query: 890  SSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMERYSKSFQ 1069
            +    PTCP+YFRWIHEDLRPW  TGI+RE +E A+ TANFRLVIL+G  Y+E Y KSFQ
Sbjct: 97   NRSSPPTCPEYFRWIHEDLRPWVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQ 156

Query: 1070 SRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFRYCGDDT 1249
            +RD FTLWGILQLLR+YPG+VPDL++MFDCVDWPV+K   YSG++A +P PLFRYCG+D 
Sbjct: 157  TRDVFTLWGILQLLRKYPGRVPDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDE 216

Query: 1250 TLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVAKTRMDL 1429
            TLDIVFPDWS+WGW E NIKPW  + KDLK+GN+R KW +REPYAYWKGNP VA+TR+DL
Sbjct: 217  TLDIVFPDWSYWGWVETNIKPWEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDL 276

Query: 1430 LKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILAC 1609
            +KCNVS + DWNAR+Y QDWV+E +QGYKQSDLA+QC HRYKIYIEGSAWSVSEKYILAC
Sbjct: 277  MKCNVSQEHDWNARLYTQDWVRESQQGYKQSDLANQCNHRYKIYIEGSAWSVSEKYILAC 336

Query: 1610 DSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAIGKAASR 1789
            DS+ L+VKP YYDF TR LMP  HYWP+K+DDKC+SIK+AVDWGN H ++AQAIGKAAS 
Sbjct: 337  DSVTLIVKPHYYDFFTRGLMPNHHYWPIKEDDKCKSIKFAVDWGNSHKQKAQAIGKAASD 396

Query: 1790 FIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKTFMMDST 1969
            FIQ++L M+YVYDYMFH            PTIP  A +LC+E MACPA GL K  MMDS 
Sbjct: 397  FIQEDLKMDYVYDYMFHLLNEYARLLTFKPTIPQNATKLCAETMACPADGLAKKLMMDSM 456

Query: 1970 ARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
              GP+  +PC MP +YD ++L+++   K N IKQ+E  E ++W++Q+KQ
Sbjct: 457  VEGPADTSPCTMPSSYDPSSLYNVTREKVNAIKQIELWENKHWENQSKQ 505


>gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus persica]
          Length = 474

 Score =  655 bits (1691), Expect = 0.0
 Identities = 296/417 (70%), Positives = 348/417 (83%), Gaps = 1/417 (0%)
 Frame = +2

Query: 869  STQNPDPSSRP-QPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYM 1045
            S Q+PD   RP  PTCP+YFRWIHEDLRPW  TGI+R+M++ A+RTANF+LVI++GK Y+
Sbjct: 60   SRQDPD---RPLPPTCPEYFRWIHEDLRPWAHTGITRDMIQRAKRTANFKLVIVNGKAYV 116

Query: 1046 ERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPL 1225
            E+Y KSFQ+RD FT+WGILQLLRRYPGQVPDL+LMFDCVDWPVI    YSG NATAP PL
Sbjct: 117  EKYQKSFQTRDVFTMWGILQLLRRYPGQVPDLELMFDCVDWPVISSNDYSGPNATAPPPL 176

Query: 1226 FRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPV 1405
            FRYCGDD +LDIVFPDWSFWGW EINI PW  L KDL++GN+R +W+DR PYAYWKGNP 
Sbjct: 177  FRYCGDDNSLDIVFPDWSFWGWAEINIMPWEVLLKDLEEGNKRRRWIDRAPYAYWKGNPS 236

Query: 1406 VAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSV 1585
            VA TR DLLKCNVSD+QDWNARVYAQDW++E  +GYKQSDLASQC+ RYKIYIEGSAWSV
Sbjct: 237  VAATRQDLLKCNVSDQQDWNARVYAQDWLRESSEGYKQSDLASQCVDRYKIYIEGSAWSV 296

Query: 1586 SEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQ 1765
            S+KYILACDS+ L+VKPRYYDF TRSLMP+ HYWP+KDDDKCRSIK+AVDWGN H ++AQ
Sbjct: 297  SDKYILACDSVTLIVKPRYYDFFTRSLMPVHHYWPIKDDDKCRSIKFAVDWGNSHKQKAQ 356

Query: 1766 AIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLE 1945
            AIGKAAS+ IQ+EL M+YVYDYMFH            PTIP +A+ELCSE MAC A+G E
Sbjct: 357  AIGKAASKLIQEELKMDYVYDYMFHLLNEYAKLLQFKPTIPRKAIELCSEAMACQAQGTE 416

Query: 1946 KTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
            K FMM+S  +GP+++ PC MPP Y  A+L ++L R  N IKQVET EK+YW++Q+KQ
Sbjct: 417  KKFMMESMVKGPAVSNPCTMPPPYGPASLFAVLRRNANSIKQVETWEKKYWENQSKQ 473


>ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Capsella rubella]
            gi|482556148|gb|EOA20340.1| hypothetical protein
            CARUB_v10000648mg [Capsella rubella]
          Length = 544

 Score =  641 bits (1654), Expect = 0.0
 Identities = 301/538 (55%), Positives = 388/538 (72%), Gaps = 12/538 (2%)
 Frame = +2

Query: 536  NGS--GINKHFAEKIWFPFLPK---KAPARSSLAIFFLVVLCIGAFVYTRLL-DSSV--- 688
            NGS  G  ++F + +W PF+      +P RS   +  +++L +GAFV TRLL D +V   
Sbjct: 8    NGSSGGHCRYFIDAVWSPFVKSGFGSSPNRSYALVSLIILLVVGAFVSTRLLLDPTVLIE 67

Query: 689  -PSIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSNA--RACPPNYYP 859
              ++A     K+  N IS +  PR A  I +N         L+CS +      CP N  P
Sbjct: 68   KEAVAATPKTKTQTNTISPKY-PRPATVITQNPKPQ---FTLHCSANETTGNTCPKNKDP 123

Query: 860  SKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKV 1039
            +  S  + D +  P  TCPDYFRWIHEDLRPW  TGI+RE +E A +TANFRL I+ GKV
Sbjct: 124  TTASFNDDDTNHPPTATCPDYFRWIHEDLRPWARTGITREALERANKTANFRLAIVGGKV 183

Query: 1040 YMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPL 1219
            Y+E++  +FQ+RD FT+WG LQLLR+YPG++PDL+LMFDCVDWPV++   ++G +A +P 
Sbjct: 184  YVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRAAEFAGVDAPSPP 243

Query: 1220 PLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGN 1399
            PLFRYCG++ TLDIVFPDWSFWGW E+NIKPW  L K+L++GNE+I W++REPYAYWKGN
Sbjct: 244  PLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNEKINWINREPYAYWKGN 303

Query: 1400 PVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAW 1579
            PVVA+TR DL+KCNVS++ +WNAR+YAQDW+KE K+GYKQSDLA+QC HRYKIYIEGSAW
Sbjct: 304  PVVAETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLANQCHHRYKIYIEGSAW 363

Query: 1580 SVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEE 1759
            SVSEKYILACDS+ LLVKP YYDF TR L+P  HYWPV++ DKCRSIK+AVDWGN HI++
Sbjct: 364  SVSEKYILACDSMTLLVKPHYYDFFTRGLLPAHHYWPVREKDKCRSIKFAVDWGNSHIQK 423

Query: 1760 AQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKG 1939
            AQ IGKAAS FIQ EL M+YVYDYM+H            P +PP AVE+CSE MAC   G
Sbjct: 424  AQDIGKAASEFIQQELKMDYVYDYMYHLLNEYSKLLQFKPEVPPNAVEICSETMACTRSG 483

Query: 1940 LEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNK 2113
             E+ FM +S  + P+ + PCA+PP YD  +L+S+ +RK++   ++   E +YW  QN+
Sbjct: 484  NERKFMTESLVKHPAESGPCALPPPYDPVSLYSVAKRKQSTTARILHMEMKYWSKQNQ 541


>ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
            gi|10176852|dbj|BAB10058.1| unnamed protein product
            [Arabidopsis thaliana] gi|48310551|gb|AAT41837.1|
            At5g23850 [Arabidopsis thaliana]
            gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis
            thaliana] gi|332005839|gb|AED93222.1| uncharacterized
            protein AT5G23850 [Arabidopsis thaliana]
          Length = 542

 Score =  639 bits (1649), Expect = e-180
 Identities = 300/539 (55%), Positives = 389/539 (72%), Gaps = 13/539 (2%)
 Frame = +2

Query: 536  NGS--GINKHFAEKIWFPFLPKK---APARSSLAIFFLVVLCIGAFVYTRLL-DSSVPSI 697
            NGS  G ++ + + IW PF+      +P RS   +  L++L +GAF+ TRLL D++V   
Sbjct: 8    NGSAGGHSRTYTDTIWSPFVKSGLGISPNRSYALVSLLILLIVGAFISTRLLLDTTV--- 64

Query: 698  AEYLPQKSIFNAISFRNNPRDAPK------IVENSSRHEIDVPLNCSVSNARA-CPPNYY 856
               L +K+     +        PK      ++  S + E    L+CS +   A CP N Y
Sbjct: 65   --LLEKKAATTTTTKTQTQTITPKYPRPTTVITQSPKPEFT--LHCSANETTASCPSNKY 120

Query: 857  PSKFSTQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGK 1036
            P+  S ++ D +  P  TCPDYFRWIHEDLRPW  TGI+RE +E A++TA FRL I+ GK
Sbjct: 121  PTTTSFEDDDTNHPPTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGK 180

Query: 1037 VYMERYSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAP 1216
            +Y+E++  +FQ+RD FT+WG LQLLR+YPG++PDL+LMFDCVDWPV++   ++GANA +P
Sbjct: 181  IYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSP 240

Query: 1217 LPLFRYCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKG 1396
             PLFRYCG++ TLDIVFPDWSFWGW E+NIKPW  L K+L++GNER KW++REPYAYWKG
Sbjct: 241  PPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKG 300

Query: 1397 NPVVAKTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSA 1576
            NP+VA+TR DL+KCNVS++ +WNAR+YAQDW+KE K+GYKQSDLASQC HRYKIYIEGSA
Sbjct: 301  NPMVAETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSA 360

Query: 1577 WSVSEKYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIE 1756
            WSVSEKYILACDS+ LLVKP YYDF TR L+P  HYWPV++ DKCRSIK+AVDWGN HI+
Sbjct: 361  WSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQ 420

Query: 1757 EAQAIGKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAK 1936
            +AQ IGKAAS FIQ +L M+YVYDYM+H            P IP  AVE+CSE MAC   
Sbjct: 421  KAQDIGKAASDFIQQDLKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACLRS 480

Query: 1937 GLEKTFMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNK 2113
            G E+ FM +S  + P+ + PCAMPP YD AT + +++RK++   ++   E +YW  QN+
Sbjct: 481  GNERKFMTESLVKQPADSGPCAMPPPYDPATYYEVVKRKQSTNMRILQWEMKYWSKQNQ 539


>ref|XP_006490389.1| PREDICTED: protein O-glucosyltransferase 1-like [Citrus sinensis]
          Length = 526

 Score =  637 bits (1642), Expect = e-180
 Identities = 302/535 (56%), Positives = 374/535 (69%)
 Frame = +2

Query: 512  EQEQGAVLNGSGINKHFAEKIWFPFLPKKAPARSSLAIFFLVVLCIGAFVYTRLLDSSVP 691
            +Q Q + ++G G + HF + IW  F+  ++PA+S     F+ +L +GA + TRLLDS+  
Sbjct: 2    QQRQSSNVHGPGHSGHFTDTIWRQFI--QSPAKSYALFAFIFLLLVGALISTRLLDSTAL 59

Query: 692  SIAEYLPQKSIFNAISFRNNPRDAPKIVENSSRHEIDVPLNCSVSNARACPPNYYPSKFS 871
                          +  R    DAP I +    ++ + PL C+  N     P  YP+ + 
Sbjct: 60   G-------GGTNKKLRDRKGQTDAPDITKKHY-NKTEYPLKCTDGNNTKTCPGTYPTSY- 110

Query: 872  TQNPDPSSRPQPTCPDYFRWIHEDLRPWRETGISREMVEMARRTANFRLVILDGKVYMER 1051
            T   D  S   PTCPDYFRWIHEDLRPW  TGI+REMVE A  TANFRLVI+ G+ Y++R
Sbjct: 111  TPEEDHDSPLAPTCPDYFRWIHEDLRPWARTGITREMVERANETANFRLVIVKGRAYVKR 170

Query: 1052 YSKSFQSRDTFTLWGILQLLRRYPGQVPDLDLMFDCVDWPVIKKEAYSGANATAPLPLFR 1231
              K+FQSRDTFTLWGILQLLRRYPG++PDLDLMFDCVDWP++ K  YS   A AP PLFR
Sbjct: 171  NIKAFQSRDTFTLWGILQLLRRYPGKIPDLDLMFDCVDWPILLKSNYSVPGAPAPPPLFR 230

Query: 1232 YCGDDTTLDIVFPDWSFWGWPEINIKPWVPLSKDLKDGNERIKWVDREPYAYWKGNPVVA 1411
            YC +D T DIVFPDWSFWGWPE+NIK W  + KDL++GN R+ W DREPYAYWKGNPVVA
Sbjct: 231  YCANDQTFDIVFPDWSFWGWPEVNIKSWGKILKDLEEGNRRMNWTDREPYAYWKGNPVVA 290

Query: 1412 KTRMDLLKCNVSDKQDWNARVYAQDWVKEQKQGYKQSDLASQCIHRYKIYIEGSAWSVSE 1591
             +R DL+KCNVS+ Q+WNAR+Y QDW KE+++GYKQSDLASQC HR+KIYIEGSAWSVSE
Sbjct: 291  SSRQDLMKCNVSEGQEWNARLYVQDWKKEKQKGYKQSDLASQCKHRFKIYIEGSAWSVSE 350

Query: 1592 KYILACDSLALLVKPRYYDFSTRSLMPLQHYWPVKDDDKCRSIKYAVDWGNGHIEEAQAI 1771
            KYILACDS+ L V P Y DF TR L+P+ H+WP+   DKCRSIK+AVDWGN H  +AQ I
Sbjct: 351  KYILACDSVTLYVTPNYTDFFTRGLIPMHHFWPINVYDKCRSIKFAVDWGNNHTGKAQEI 410

Query: 1772 GKAASRFIQDELAMNYVYDYMFHXXXXXXXXXXXXPTIPPRAVELCSELMACPAKGLEKT 1951
            G+AASRFIQ+EL M+YVYDYMFH            PTIP  AVE C+E MACP +G+ + 
Sbjct: 411  GRAASRFIQEELKMDYVYDYMFHLLNQYSKLFRYQPTIPTGAVEYCAETMACPEEGMARK 470

Query: 1952 FMMDSTARGPSIATPCAMPPAYDSATLHSILERKENLIKQVETREKQYWDSQNKQ 2116
             M +S    P   +PC +PP YD ++L+ +L  KEN I QVE+  K YW++Q  Q
Sbjct: 471  LMEESLETSPKETSPCTLPPPYDPSSLYDVLREKENSILQVESWVKAYWENQTNQ 525