BLASTX nr result

ID: Ephedra27_contig00007303 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00007303
         (2245 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]        506   e-140
gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus pe...   504   e-140
ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Caps...   504   e-140
ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [A...   503   e-139
ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, p...   503   e-139
ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Popu...   501   e-139
ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-l...   499   e-138
ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-l...   498   e-138
gb|AED99886.1| glycosyltransferase [Panax notoginseng]                497   e-138
ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Caps...   497   e-137
ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolo...   496   e-137
gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]     496   e-137
ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arab...   496   e-137
gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]        495   e-137
ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutr...   494   e-137
ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolo...   493   e-136
ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, p...   493   e-136
ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutr...   492   e-136
ref|XP_006491072.1| PREDICTED: O-glucosyltransferase rumi homolo...   491   e-136
ref|XP_006445081.1| hypothetical protein CICLE_v10019760mg [Citr...   491   e-136

>gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]
          Length = 522

 Score =  506 bits (1303), Expect = e-140
 Identities = 232/437 (53%), Positives = 311/437 (71%), Gaps = 6/437 (1%)
 Frame = +2

Query: 581  RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 745
            R +  NCT+R   +A        +E  P+      CPD+F +IHEDLRPW  TGI+++M+
Sbjct: 88   RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147

Query: 746  EMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 925
            + A +TANFRL +V+GR YV   ++SFQTRDVFT+WG +QL+  YPG +PD+DLMFDCVD
Sbjct: 148  KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207

Query: 926  WPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1102
            WPVI    Y           FRYC D++ LDI  PDWSFWGW E+N KPW  L+ D+ +G
Sbjct: 208  WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267

Query: 1103 NKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1282
            NKR+ WE R P AYWKGNP+VA  R++L++CN S   DW AR+Y QDW RES+QGYKQS+
Sbjct: 268  NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327

Query: 1283 LANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDK 1462
            LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D 
Sbjct: 328  LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387

Query: 1463 KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 1642
            KC SIK AVDWGN H ++A+A+G+A S F+KE LK+  VYDYMFH+L++Y+KL++YKP+V
Sbjct: 388  KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447

Query: 1643 PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREK 1822
            P  A E+CS++  C  E   ++K+FM +S  +GP+  +PC + P  D   L      +E 
Sbjct: 448  PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYDPASLYALLSKKEN 505

Query: 1823 ALRKVAKMEEESWNKEK 1873
            ++++V + E++ W  +K
Sbjct: 506  SIKQVEEWEKKFWEMQK 522


>gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus persica]
          Length = 502

 Score =  504 bits (1298), Expect = e-140
 Identities = 226/406 (55%), Positives = 300/406 (73%), Gaps = 1/406 (0%)
 Frame = +2

Query: 656  PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTR 835
            P+P  CP++F +IHEDLRPW  TGIT EMVE ANRTANF+  IV+G+ YV   +K+FQTR
Sbjct: 95   PSPPTCPEYFRWIHEDLRPWARTGITREMVERANRTANFKFVIVNGKAYVEQYEKAFQTR 154

Query: 836  DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHL 1012
            DVFT+WGF+QL+  YPG +PD++LMFDCVDWPVI    Y           FRYC+D+  L
Sbjct: 155  DVFTVWGFLQLLRRYPGQVPDLELMFDCVDWPVIPSHEYSGPNATAPPPLFRYCADDNTL 214

Query: 1013 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQ 1192
            DI  PDWSFWGWAE+N +PWE L +++ +GNKR  W +R P AYWKGNP +A+ R++L++
Sbjct: 215  DIVFPDWSFWGWAEINIRPWEVLFEELKEGNKRKTWLEREPYAYWKGNPDIAETRQDLIK 274

Query: 1193 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 1372
            CN S  HDWNARLY QDW RES++GY +S+LA+QC HRYKIY+EGSAWSVS K I+ACDS
Sbjct: 275  CNVSEEHDWNARLYAQDWDRESKEGYNKSDLASQCIHRYKIYIEGSAWSVSEKYILACDS 334

Query: 1373 PTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 1552
             TLIV P+YYDFF+R LMP  HYWPI+ D KC SIKF+VDWGN H ++A+A+G+A S+ +
Sbjct: 335  VTLIVKPRYYDFFTRRLMPVEHYWPIKDDDKCRSIKFSVDWGNTHRRKAQAIGKASSNLI 394

Query: 1553 KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 1732
            +E+LK+  VYDYMFH+L++Y+KL+++KP+VP+ A E+CS++  C  E +  EK+FM QS 
Sbjct: 395  QEELKMEYVYDYMFHLLNEYAKLLQFKPTVPKKAVELCSEAMACQAEGT--EKKFMLQSL 452

Query: 1733 TEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
             +GP    PC + P  D   L    + +E ++++V   E   W  +
Sbjct: 453  VKGPAVSEPCAMPPPYDPSSLFAVLRRKENSIKQVETWERNYWESQ 498


>ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Capsella rubella]
            gi|482556148|gb|EOA20340.1| hypothetical protein
            CARUB_v10000648mg [Capsella rubella]
          Length = 544

 Score =  504 bits (1297), Expect = e-140
 Identities = 231/423 (54%), Positives = 304/423 (71%), Gaps = 1/423 (0%)
 Frame = +2

Query: 605  SREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTI 784
            +++P  A  N    T  P    CPD+F +IHEDLRPW  TGIT E +E AN+TANFRL I
Sbjct: 120  NKDPTTASFN-DDDTNHPPTATCPDYFRWIHEDLRPWARTGITREALERANKTANFRLAI 178

Query: 785  VDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDK-KYYEXX 961
            V G++YV   Q +FQTRDVFTIWGF+QL+  YPG +PD++LMFDCVDWPV+   ++    
Sbjct: 179  VGGKVYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRAAEFAGVD 238

Query: 962  XXXXXXXFRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAA 1141
                   FRYC + + LDI  PDWSFWGWAEVN KPWE L+K++ +GN++INW  R P A
Sbjct: 239  APSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNEKINWINREPYA 298

Query: 1142 YWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYV 1321
            YWKGNP VA+ R++LM+CN S  H+WNARLY QDWI+ES++GYKQS+LANQC HRYKIY+
Sbjct: 299  YWKGNPVVAETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLANQCHHRYKIYI 358

Query: 1322 EGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGN 1501
            EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P  HYWP+R   KC SIKFAVDWGN
Sbjct: 359  EGSAWSVSEKYILACDSMTLLVKPHYYDFFTRGLLPAHHYWPVREKDKCRSIKFAVDWGN 418

Query: 1502 QHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEY 1681
             H ++A+ +G+A S F++++LK+  VYDYM+H+L++YSKL+++KP VP  A E+CS++  
Sbjct: 419  SHIQKAQDIGKAASEFIQQELKMDYVYDYMYHLLNEYSKLLQFKPEVPPNAVEICSETMA 478

Query: 1682 CSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESW 1861
            C+  RS  E++FM +S  + P    PC L P  D   L    K ++    ++  ME + W
Sbjct: 479  CT--RSGNERKFMTESLVKHPAESGPCALPPPYDPVSLYSVAKRKQSTTARILHMEMKYW 536

Query: 1862 NKE 1870
            +K+
Sbjct: 537  SKQ 539


>ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [Amborella trichopoda]
            gi|548845188|gb|ERN04666.1| hypothetical protein
            AMTR_s00076p00109920 [Amborella trichopoda]
          Length = 496

 Score =  503 bits (1294), Expect = e-139
 Identities = 235/431 (54%), Positives = 306/431 (70%), Gaps = 2/431 (0%)
 Frame = +2

Query: 587  VNFNCTSRE--PCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANR 760
            +N NC+S    P  + +NL     +P  + CPD+F +IHEDL+PWK TGIT EMVE A R
Sbjct: 73   INTNCSSLPWPPFPSIQNL-----DPPTSTCPDYFRWIHEDLKPWKGTGITQEMVERARR 127

Query: 761  TANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVID 940
            TA FRL ++DG++YV    K++Q RD FTIWG +QL   Y G +PD+DLMFDCVDWPV+ 
Sbjct: 128  TATFRLLVIDGKVYVERYAKAYQCRDDFTIWGMLQLFRRYSGRVPDLDLMFDCVDWPVV- 186

Query: 941  KKYYEXXXXXXXXXFRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINW 1120
            K++           FRYC D   LDI  PDWSFWGW E+N +PWE L+KD+  GNK+I W
Sbjct: 187  KRWDYRGRVVPPPLFRYCGDKDSLDIVFPDWSFWGWPEINIEPWEALLKDLDDGNKKIKW 246

Query: 1121 EKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCD 1300
              R P AYWKGNP VAD RK+L++CN +   DWNAR+Y+QDWI+ES+QGYK+SNLANQC 
Sbjct: 247  MNRDPTAYWKGNPYVADTRKDLLKCNVTETQDWNARVYVQDWIKESQQGYKESNLANQCT 306

Query: 1301 HRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIK 1480
            HRYKIY+EGSAWSVS K I+ACDSPTL+VTP YYDF +RALMP  HYWPI+ D KC SIK
Sbjct: 307  HRYKIYIEGSAWSVSEKYILACDSPTLLVTPHYYDFVTRALMPTHHYWPIKGDDKCRSIK 366

Query: 1481 FAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKE 1660
            +AVDWGN H ++A+A+G+  SSF+ ED+K++ VYDYMFH+L +YSKL++YKP+VPE A +
Sbjct: 367  YAVDWGNSHKQKAQAIGKTASSFILEDVKMAYVYDYMFHLLSEYSKLLRYKPTVPEKAVQ 426

Query: 1661 VCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVA 1840
             CS+S  C  + +   ++FM +S  + P+   PC L P  +   L    + +  A+++V 
Sbjct: 427  YCSESMACPAKGN--YEKFMKESFVKVPSDSEPCILPPPFEPPALQLLLRRKANAIKQVE 484

Query: 1841 KMEEESWNKEK 1873
              E+ S  K K
Sbjct: 485  TWEQNSRKKTK 495


>ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549902|gb|EEF51389.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 506

 Score =  503 bits (1294), Expect = e-139
 Identities = 228/447 (51%), Positives = 311/447 (69%), Gaps = 9/447 (2%)
 Frame = +2

Query: 557  SSNPIHKPR---VVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRP 712
            S+ P+ KP    V+  NC +    +        T   +PN+     CP++F +IHEDLRP
Sbjct: 58   STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQDPNRSSPPTCPEYFRWIHEDLRP 117

Query: 713  WKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGIL 892
            W  TGIT E +E A  TANFRL I++G  Y+ + +KSFQTRDVFT+WG +QL+  YPG +
Sbjct: 118  WVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRKYPGRV 177

Query: 893  PDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNTKP 1069
            PD+++MFDCVDWPV+    Y           FRYC +++ LDI  PDWS+WGW E N KP
Sbjct: 178  PDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVETNIKP 237

Query: 1070 WEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWI 1249
            WE +VKD+ +GN+R  W++R P AYWKGNP+VA+ R +LM+CN S  HDWNARLY QDW+
Sbjct: 238  WEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLYTQDWV 297

Query: 1250 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMP 1429
            RES+QGYKQS+LANQC+HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R LMP
Sbjct: 298  RESQQGYKQSDLANQCNHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMP 357

Query: 1430 GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 1609
              HYWPI+ D KC+SIKFAVDWGN H ++A+A+G+A S F++EDLK+  VYDYMFH+L++
Sbjct: 358  NHHYWPIKEDDKCKSIKFAVDWGNSHKQKAQAIGKAASDFIQEDLKMDYVYDYMFHLLNE 417

Query: 1610 YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 1789
            Y++L+ +KP++P+ A ++C+++  C  +   + K+ M  S  EGP   +PC +  S D  
Sbjct: 418  YARLLTFKPTIPQNATKLCAETMACPAD--GLAKKLMMDSMVEGPADTSPCTMPSSYDPS 475

Query: 1790 RLNQWNKMREKALRKVAKMEEESWNKE 1870
             L    + +  A++++   E + W  +
Sbjct: 476  SLYNVTREKVNAIKQIELWENKHWENQ 502


>ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Populus trichocarpa]
            gi|550322617|gb|EEF06046.2| hypothetical protein
            POPTR_0015s13090g [Populus trichocarpa]
          Length = 506

 Score =  501 bits (1290), Expect = e-139
 Identities = 229/427 (53%), Positives = 302/427 (70%), Gaps = 1/427 (0%)
 Frame = +2

Query: 593  FNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANF 772
            FN T + P     N  +    P+ + CP+ F +IHEDLRPW  TGI+ +MVE A RTANF
Sbjct: 78   FNPTRKCPLNYPTNTQEGPDRPSVSTCPEHFRWIHEDLRPWAHTGISRDMVERAKRTANF 137

Query: 773  RLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYY 952
            RL IV+G+ Y+   +KSFQTRD FT+WG IQL+  YPG LPD+D+MFDCVDWPVI    Y
Sbjct: 138  RLVIVNGKAYMERYRKSFQTRDTFTVWGIIQLLRKYPGKLPDLDMMFDCVDWPVIRSSDY 197

Query: 953  EXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKR 1129
                       FRYC D+  LD+  PDWSFWGW E+N KPWE L  D+ +GNK   W +R
Sbjct: 198  SGPNATSPPALFRYCGDDDSLDVVFPDWSFWGWPEINIKPWESLSNDLKEGNKITKWMER 257

Query: 1130 VPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRY 1309
             P AYWKGNPSVA  R++LM+C+AS   DWNAR+Y QDWI+ES+QGY+QSNLANQC H+Y
Sbjct: 258  EPYAYWKGNPSVAATRQDLMKCHASETQDWNARVYAQDWIKESQQGYQQSNLANQCVHKY 317

Query: 1310 KIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAV 1489
            KIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L+P RHYWPI+ D KC SIKFAV
Sbjct: 318  KIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLVPNRHYWPIKEDDKCRSIKFAV 377

Query: 1490 DWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCS 1669
            +WGN H+++A+AMG+A S F++EDLK+  VYDYMFH+L++Y+KL+ +KP++P  A E+C+
Sbjct: 378  EWGNNHSEEAQAMGKAASEFIQEDLKMDYVYDYMFHLLNEYAKLLTFKPTIPGRAIELCA 437

Query: 1670 DSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKME 1849
            ++  C    + +EK+FM  S    P   +PC + P  D   L+   +    ++++V   E
Sbjct: 438  EAMACPA--NGLEKKFMMDSMVMSPADTSPCTMPPPYDPLSLHSVFQRNGNSIKQVESWE 495

Query: 1850 EESWNKE 1870
            +E W+ +
Sbjct: 496  KEYWDNQ 502


>ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  499 bits (1285), Expect = e-138
 Identities = 238/451 (52%), Positives = 308/451 (68%), Gaps = 15/451 (3%)
 Frame = +2

Query: 563  NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 700
            NP H+PR           +FN  +   C A    +  T E  NP    + CPD+F +IHE
Sbjct: 86   NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145

Query: 701  DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELY 880
            DLRPW  TGIT   +E   RTANFRL I++G+ YV   +KSFQTRD FT+WG +QL+  Y
Sbjct: 146  DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205

Query: 881  PGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEV 1057
            PG +PD+DLMFDCVDWPVI   ++           FRYC D+   DI  PDWSFWGW E+
Sbjct: 206  PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265

Query: 1058 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYI 1237
            N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S   DWNAR++ 
Sbjct: 266  NIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325

Query: 1238 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSR 1417
            QDW +ES++GYKQS+L+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R
Sbjct: 326  QDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385

Query: 1418 ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 1597
             LMP  HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+  VYDYMFH
Sbjct: 386  GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445

Query: 1598 MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 1777
            +L +YSKL+ +KP++P  A E+CS++  C  E   + K+FM +S  + P    PC + P 
Sbjct: 446  LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPPP 503

Query: 1778 PDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
             D   L+     +E ++++V K E   WN +
Sbjct: 504  YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534


>ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  498 bits (1282), Expect = e-138
 Identities = 238/451 (52%), Positives = 307/451 (68%), Gaps = 15/451 (3%)
 Frame = +2

Query: 563  NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 700
            NP H+PR           +FN  +   C A    +  T E  NP    + CPD+F +IHE
Sbjct: 86   NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145

Query: 701  DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELY 880
            DLRPW  TGIT   +E   RTANFRL I++G+ YV   +KSFQTRD FT+WG +QL+  Y
Sbjct: 146  DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205

Query: 881  PGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEV 1057
            PG +PD+DLMFDCVDWPVI   ++           FRYC D+   DI  PDWSFWGW E+
Sbjct: 206  PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265

Query: 1058 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYI 1237
            N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S   DWNAR++ 
Sbjct: 266  NIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325

Query: 1238 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSR 1417
            QDW +ES++GYKQSNL+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R
Sbjct: 326  QDWTKESQEGYKQSNLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385

Query: 1418 ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 1597
             LMP  HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+  VYDYMFH
Sbjct: 386  GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445

Query: 1598 MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 1777
            +L +YSKL+ +KP++P  A E+CS++  C  E   + K+FM +S  + P    PC +   
Sbjct: 446  LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPSP 503

Query: 1778 PDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
             D   L+     +E ++++V K E   WN +
Sbjct: 504  YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534


>gb|AED99886.1| glycosyltransferase [Panax notoginseng]
          Length = 546

 Score =  497 bits (1280), Expect = e-138
 Identities = 225/402 (55%), Positives = 293/402 (72%), Gaps = 1/402 (0%)
 Frame = +2

Query: 662  PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDV 841
            P  CP++F +I+EDLRPW+ETGIT EMVE A RTANFRL I++GR YV  +QKSFQ+RDV
Sbjct: 143  PVSCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDV 202

Query: 842  FTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDI 1018
            FT+WG +QL+ +YPG +PD+DLMFDCVDWPVI  ++Y           FRYC+D+  LDI
Sbjct: 203  FTLWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDI 262

Query: 1019 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCN 1198
              PDW+FWGW E+N KPW  L+KD+ +GN    W  R P AYWKGNP VA  R +L++CN
Sbjct: 263  VFPDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCN 322

Query: 1199 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPT 1378
             S   DWNAR+Y  DW RES+ GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T
Sbjct: 323  VSDKQDWNARVYAXDWARESQLGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSVT 382

Query: 1379 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 1558
            L V P+YYDFF+R LMP  HYWPIR D KC SIKFAVDWGN H ++A ++G+  S+F++E
Sbjct: 383  LXVKPRYYDFFTRGLMPVHHYWPIRDDDKCRSIKFAVDWGNNHKQKAHSIGKEASNFIQE 442

Query: 1559 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 1738
            DLK+  VYDYMFH+L++Y+KL++YKP+VP  A E+CS++  C  E     K+FM +S  +
Sbjct: 443  DLKMDYVYDYMFHLLNEYAKLLRYKPTVPPKAVELCSETMACPAE--GFTKKFMMESIVK 500

Query: 1739 GPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWN 1864
            GP   +PC + P  D   L+   + +E ++++V   E+  W+
Sbjct: 501  GPTDKSPCVMQPPYDPPTLHSVLRRKENSIKQVENWEKLYWD 542


>ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Capsella rubella]
            gi|482559574|gb|EOA23765.1| hypothetical protein
            CARUB_v10016976mg [Capsella rubella]
          Length = 539

 Score =  497 bits (1279), Expect = e-137
 Identities = 221/404 (54%), Positives = 295/404 (73%), Gaps = 1/404 (0%)
 Frame = +2

Query: 662  PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDV 841
            P  CPD+F +IHEDLRPW++TGIT E +E AN TA FRL I+DGR+YV   +++FQTRDV
Sbjct: 133  PATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIIDGRIYVENFREAFQTRDV 192

Query: 842  FTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDI 1018
            FTIWGF+QL+  YPG +PD++LMFDCVDWPV+  + Y           FRYC++++ LDI
Sbjct: 193  FTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAEEYSGVDKPSPPPLFRYCANDETLDI 252

Query: 1019 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCN 1198
              PDWS+WGWAEVN KPWE L+KD+S+GN+R  W  R P AYWKGNP+VA+ R +LM+CN
Sbjct: 253  VFPDWSYWGWAEVNIKPWESLLKDLSEGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCN 312

Query: 1199 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPT 1378
             S  +DW ARLY QDW++ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T
Sbjct: 313  LSEEYDWKARLYKQDWLKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVT 372

Query: 1379 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 1558
            L+V P YYDFF+R + PG HYWP++ D KC SIKFAVDWGN H ++A+ +G+  S F+++
Sbjct: 373  LMVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQ 432

Query: 1559 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 1738
            +LK+  VYDYMFH+L QYSKL+++KP +P+ + EVCS++  C   R   E++FM +S  +
Sbjct: 433  ELKMDYVYDYMFHLLTQYSKLLRFKPEIPQNSTEVCSETMAC--PRDGNERKFMMESLVK 490

Query: 1739 GPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
             P    PC + P  D        K R+    ++ + E + W K+
Sbjct: 491  RPAETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKYWRKQ 534


>ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
            gi|302143884|emb|CBI22745.3| unnamed protein product
            [Vitis vinifera]
          Length = 525

 Score =  496 bits (1278), Expect = e-137
 Identities = 224/406 (55%), Positives = 297/406 (73%), Gaps = 1/406 (0%)
 Frame = +2

Query: 656  PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTR 835
            P+P +CP +F +I+ DLRPW ++GIT EMVE A RTA F+L I++GR YV   Q++FQTR
Sbjct: 120  PSPPECPHYFRWIYGDLRPWMKSGITREMVERAKRTATFKLVILNGRAYVEKYQRAFQTR 179

Query: 836  DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHL 1012
            DVFT+WG +QL+  YPG +PD++LMFDCVDWPVI    Y           FRYC D+  L
Sbjct: 180  DVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDATL 239

Query: 1013 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQ 1192
            DI  PDWSFWGW E+N KPWE L+KD+ +GNKR  W +R P AYWKGNP+VA  R +L++
Sbjct: 240  DIVFPDWSFWGWPEINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLK 299

Query: 1193 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 1372
            CN S   DWNAR+Y QDWI ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS
Sbjct: 300  CNVSDKQDWNARVYTQDWILESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDS 359

Query: 1373 PTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 1552
             TL+V P YYDFF+R+LMP  HYWPIR D KC SIKFAVDWGN+H ++A+++G+A S F+
Sbjct: 360  VTLLVKPHYYDFFTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFI 419

Query: 1553 KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 1732
            +EDLK+ NVYDYMFH+L++Y+KL+K+KP+VPE A E+CS+   C  E   ++K+FM +S 
Sbjct: 420  QEDLKMDNVYDYMFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAE--GLKKKFMMESM 477

Query: 1733 TEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
             + P   +PC + P      L  +   +  ++++V   E++ W  +
Sbjct: 478  VKYPMDASPCTMPPPFSPLELQTFLNRKVNSIKQVEAWEKKFWENQ 523


>gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]
          Length = 515

 Score =  496 bits (1277), Expect = e-137
 Identities = 225/401 (56%), Positives = 293/401 (73%), Gaps = 1/401 (0%)
 Frame = +2

Query: 671  CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTI 850
            CPD+F +I+EDLRPW  TGI+ +MVE A RTANFRL IV+G+ YV   QK+FQTRDVFT+
Sbjct: 113  CPDYFRWIYEDLRPWAYTGISRDMVERAKRTANFRLVIVNGKAYVETFQKAFQTRDVFTL 172

Query: 851  WGFIQLMELYPGILPDVDLMFDCVDWPVI-DKKYYEXXXXXXXXXFRYCSDNKHLDIPLP 1027
            WG +QL+  YPG +PD++LMFDCVDWPV+  K Y           FRYC D+  LDI  P
Sbjct: 173  WGILQLLRKYPGRVPDLELMFDCVDWPVVLSKAYSGPDATTPPPLFRYCGDDSTLDIVFP 232

Query: 1028 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASH 1207
            DWSFWGW E N KPWE L+K++ +GNK+  W +R   AYWKGNP VA  R++L++CN S 
Sbjct: 233  DWSFWGWPETNIKPWEALLKELEEGNKKSKWVEREAYAYWKGNPVVAATRQDLLKCNVSD 292

Query: 1208 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIV 1387
              DWNARLY QDW++ES++GYKQS+LANQC HRYKIY+EGSAWSVS K I+ACDS TLIV
Sbjct: 293  KQDWNARLYAQDWLKESKEGYKQSDLANQCIHRYKIYIEGSAWSVSEKYILACDSVTLIV 352

Query: 1388 TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 1567
             P YYDFF+R L+P +HYWPI+ D KC SIKFAVDWGN H K+AK++G+A S F+++DLK
Sbjct: 353  KPHYYDFFTRGLVPMQHYWPIKDDDKCRSIKFAVDWGNSHKKKAKSIGKAASRFIQDDLK 412

Query: 1568 ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 1747
            +  VYDYMFH+L++Y+KL+K+KPS+PE A E CS+S  C+ E   + K+FM +S  +GP 
Sbjct: 413  MEYVYDYMFHLLNEYAKLLKFKPSIPEKAVEFCSESMACTAE--GIGKKFMMESMVKGPA 470

Query: 1748 SIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
              +PC + PS +   L    + +   + +V   + + W  +
Sbjct: 471  DSSPCTMPPSYNPSSLYSLIQKKTSLIEQVEMWQNKYWENQ 511


>ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
            lyrata] gi|297317912|gb|EFH48334.1| hypothetical protein
            ARALYDRAFT_910396 [Arabidopsis lyrata subsp. lyrata]
          Length = 543

 Score =  496 bits (1276), Expect = e-137
 Identities = 224/424 (52%), Positives = 299/424 (70%), Gaps = 1/424 (0%)
 Frame = +2

Query: 602  TSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLT 781
            +++ P  A       T  P    CPD+F +IHEDLRPW  TGIT E +E A +TANFRL 
Sbjct: 117  SNKYPTTASFGEDDDTNHPPNATCPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLA 176

Query: 782  IVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVID-KKYYEX 958
            I+DG++YV   Q +FQTRDVFTIWGF+QL+  YPG +PD++LMFDCVDWPV+   ++   
Sbjct: 177  IIDGKIYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGA 236

Query: 959  XXXXXXXXFRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPA 1138
                    FRYC + + LDI  PDWSFWGWAEVN KPWE L+K++ +GN+R  W  R P 
Sbjct: 237  NAPSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPY 296

Query: 1139 AYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIY 1318
            AYWKGNP VA+ R++LM+CN S  H+WNARLY+QDWI+ES +GYKQS+LA+QC HRYKIY
Sbjct: 297  AYWKGNPMVAETRQDLMKCNVSEEHEWNARLYVQDWIKESNEGYKQSDLASQCHHRYKIY 356

Query: 1319 VEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWG 1498
            +EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P  HYWP+R   KC SIKFAVDWG
Sbjct: 357  IEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWG 416

Query: 1499 NQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSE 1678
            N H ++A+ +G+A S F++ +LK+  VYDYM+H+L +YSKL+++KP +P+ A E+CS++ 
Sbjct: 417  NSHIQKAQDIGKAASDFIQHELKMDYVYDYMYHLLTEYSKLLRFKPEIPQNAAEICSETM 476

Query: 1679 YCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEES 1858
             C   RS  E++FM +S  + P    PC + P  D   L    K ++    ++ + E + 
Sbjct: 477  AC--PRSGNERKFMTESFVKHPAESGPCAMPPPYDPALLYGVVKRKQSTNMRILQWEMKY 534

Query: 1859 WNKE 1870
            W+K+
Sbjct: 535  WSKQ 538


>gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]
          Length = 498

 Score =  495 bits (1274), Expect = e-137
 Identities = 226/407 (55%), Positives = 296/407 (72%), Gaps = 6/407 (1%)
 Frame = +2

Query: 581  RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 745
            R +  NCT+R   +A        +E  P+      CPD+F +IHEDLRPW  TGI+++M+
Sbjct: 88   RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147

Query: 746  EMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 925
            + A +TANFRL +V+GR YV   ++SFQTRDVFT+WG +QL+  YPG +PD+DLMFDCVD
Sbjct: 148  KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207

Query: 926  WPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1102
            WPVI    Y           FRYC D++ LDI  PDWSFWGW E+N KPW  L+ D+ +G
Sbjct: 208  WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267

Query: 1103 NKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1282
            NKR+ WE R P AYWKGNP+VA  R++L++CN S   DW AR+Y QDW RES+QGYKQS+
Sbjct: 268  NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327

Query: 1283 LANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDK 1462
            LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D 
Sbjct: 328  LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387

Query: 1463 KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 1642
            KC SIK AVDWGN H ++A+A+G+A S F+KE LK+  VYDYMFH+L++Y+KL++YKP+V
Sbjct: 388  KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447

Query: 1643 PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPD 1783
            P  A E+CS++  C  E   ++K+FM +S  +GP+  +PC + P  D
Sbjct: 448  PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYD 492


>ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutrema salsugineum]
            gi|557091280|gb|ESQ31927.1| hypothetical protein
            EUTSA_v10003948mg [Eutrema salsugineum]
          Length = 545

 Score =  494 bits (1271), Expect = e-137
 Identities = 220/401 (54%), Positives = 294/401 (73%), Gaps = 1/401 (0%)
 Frame = +2

Query: 671  CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTI 850
            CPD+F +IHEDLRPW++TGIT E +E A +TANFRL IV G++YV   Q +FQTRDVFTI
Sbjct: 142  CPDYFRWIHEDLRPWEKTGITREALERAKKTANFRLAIVGGKLYVEKFQDAFQTRDVFTI 201

Query: 851  WGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLP 1027
            WGF+QL+  YPG +PD++LMFDCVDWPV+    +           FRYC + + LDI  P
Sbjct: 202  WGFLQLLRRYPGKIPDLELMFDCVDWPVVKAANFAGANSPSPPPLFRYCGNEETLDIVFP 261

Query: 1028 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASH 1207
            DWSFWGW+EVN KPWE L+K++ +GN++ NW  R P AYWKGNP VA+ R++LM+CN S 
Sbjct: 262  DWSFWGWSEVNIKPWESLLKELREGNEKTNWINREPYAYWKGNPLVAETRQDLMKCNVSE 321

Query: 1208 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIV 1387
             H+WNARLY QDWIRES++GYKQS+LA+QC HR+KIY+EGSAWSVS K I+ACDS TL+V
Sbjct: 322  EHEWNARLYAQDWIRESKEGYKQSDLASQCHHRFKIYIEGSAWSVSEKYILACDSVTLLV 381

Query: 1388 TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 1567
             P YYDFF+R L+P  HYWP+R   KC SIKFAV WGN H ++A+ +G+A S F++++LK
Sbjct: 382  KPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVHWGNSHIQKAQDIGKAASEFIQQELK 441

Query: 1568 ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 1747
            +  VYDYMFH+L +YSKL+++KP +P+ AKE+CS++  C   RS  E++FM +S  + P 
Sbjct: 442  MDYVYDYMFHLLTEYSKLLQFKPEIPQNAKEICSETMAC--PRSGNERKFMTESLVKHPA 499

Query: 1748 SIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNKE 1870
               PC + P  D        K ++ A  ++ + E + W+K+
Sbjct: 500  QTGPCAMPPPYDPASFYAVVKRKQSAATRILQWEMKYWSKQ 540


>ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolog [Solanum lycopersicum]
          Length = 514

 Score =  493 bits (1270), Expect = e-136
 Identities = 231/451 (51%), Positives = 307/451 (68%), Gaps = 12/451 (2%)
 Frame = +2

Query: 557  SSNPIHKPRVVNFNCT----------SREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDL 706
            S  P+ K  +   NCT          S  P K  +     T    P  CPD+F +I++DL
Sbjct: 62   SKQPLKKLEI-QLNCTLGNLTRTCPASYYPLKFTEQNESSTSSSPPPTCPDYFRWIYDDL 120

Query: 707  RPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPG 886
              W+ETGIT EMV  A RTA+FRL IV+GR YV    K+FQ+RD FT+WG +Q++  YPG
Sbjct: 121  WHWRETGITKEMVMRAKRTADFRLVIVNGRAYVETYHKAFQSRDTFTLWGILQMLRRYPG 180

Query: 887  ILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNT 1063
             +PD+DLMFDCVDWPV+  ++Y           FRYC ++  LDI  PDWSFWGW E+N 
Sbjct: 181  KVPDLDLMFDCVDWPVLKTEFYRHPKAPVPPPLFRYCGNDSSLDIVFPDWSFWGWPEINI 240

Query: 1064 KPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQD 1243
            KPWE L KD+ KGN+++ W +R P AYWKGNP VA+ R++L++CNAS   DWNAR+Y QD
Sbjct: 241  KPWETLSKDLKKGNEKMKWTEREPYAYWKGNPVVAETRRDLLKCNASEKQDWNARVYAQD 300

Query: 1244 WIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRAL 1423
            W +  +QGYKQS+LANQC HRYKIYVEGSAWSVS K I+ACDS TL++ P+YYDF++R L
Sbjct: 301  WAQAEKQGYKQSDLANQCIHRYKIYVEGSAWSVSEKYILACDSVTLLIKPQYYDFYTRGL 360

Query: 1424 MPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHML 1603
            MP +HYWP++   KC SIK AVDWGN H ++A+A+G+A S F++E LK+  VYDYMFH+L
Sbjct: 361  MPLQHYWPVKDKDKCRSIKHAVDWGNTHEQEAQAIGKAASDFIQEQLKMDYVYDYMFHLL 420

Query: 1604 DQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPD 1783
             +Y+KL+KYKP+VP  A E+CS++  CS E   + K+FM +S  EGP+   PC + P   
Sbjct: 421  SEYAKLLKYKPTVPRKAVELCSEAMACSAE--GLTKKFMLESMVEGPSDATPCNMPPPYG 478

Query: 1784 GQRLNQWNKMREKALRKVAKMEEESW-NKEK 1873
               L+     +E ++++V   E++ W NK K
Sbjct: 479  PAGLHSILDRKENSIKQVDSWEQQYWKNKSK 509


>ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549903|gb|EEF51390.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 528

 Score =  493 bits (1269), Expect = e-136
 Identities = 223/428 (52%), Positives = 304/428 (71%), Gaps = 1/428 (0%)
 Frame = +2

Query: 593  FNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANF 772
            FN T   P       ++    P+ + CP+++ +I+EDLRPW  TGI+ +MVE A  TANF
Sbjct: 100  FNLTRTCPSNYPTTFTENPDRPSVSACPEYYRWIYEDLRPWARTGISRDMVERAKTTANF 159

Query: 773  RLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYY 952
            RL IV+G+ YV   +++FQTRDVFT+WG +QL+  YPG +PD++LMFDCVDWPVI    Y
Sbjct: 160  RLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIKSSNY 219

Query: 953  EXXXXXXXXX-FRYCSDNKHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKR 1129
                       FRYC D+  LD+  PDWSFWGW+E+N KPWE L++++ +GN++  W +R
Sbjct: 220  SGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLRELKEGNEKRRWMER 279

Query: 1130 VPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRY 1309
             P AYWKGNP+VA+ R++LM+CN S   DWNAR+Y QDWI+E +QGYKQSNLA+QC HRY
Sbjct: 280  EPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQDWIKELQQGYKQSNLASQCMHRY 339

Query: 1310 KIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAV 1489
            KIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L P  HYWPI+   KC SIKFAV
Sbjct: 340  KIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLRPIHHYWPIKDYDKCRSIKFAV 399

Query: 1490 DWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCS 1669
            DWGN H ++A+A+G+A S F++E+LK+  VYDYMFH+L++Y+KL+ +KP +P  A E+CS
Sbjct: 400  DWGNNHKQKAQAIGKAASEFIQEELKMDYVYDYMFHLLNEYAKLLTFKPVIPRKAVELCS 459

Query: 1670 DSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKME 1849
            +S  C    + +EK FM +S  +GP    PC + P  D   L+   + +E ++R+V   E
Sbjct: 460  ESMACPA--NGIEKEFMMESMVQGPAETNPCIMLPPYDPSALHSIFRRKENSIRQVELWE 517

Query: 1850 EESWNKEK 1873
            +  W+K+K
Sbjct: 518  KMYWDKQK 525


>ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutrema salsugineum]
            gi|557105314|gb|ESQ45648.1| hypothetical protein
            EUTSA_v10010269mg [Eutrema salsugineum]
          Length = 543

 Score =  492 bits (1267), Expect = e-136
 Identities = 226/447 (50%), Positives = 306/447 (68%), Gaps = 15/447 (3%)
 Frame = +2

Query: 575  KPRVVNFNCTS-------------REPCKARKNLSKKTLEPNPNK-CPDFFMFIHEDLRP 712
            KP+    NC +             R P   R    +   E +P   CPD+F +IHEDLRP
Sbjct: 94   KPKEFTLNCAAFSGNETVITCPRNRYPTSLRSGAREDDPERSPPATCPDYFRWIHEDLRP 153

Query: 713  WKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRDVFTIWGFIQLMELYPGIL 892
            W++TGIT E +E AN TANFRL I++GR+YV   +++FQTRDVFTIWGF+QL+  YPG +
Sbjct: 154  WEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 213

Query: 893  PDVDLMFDCVDWPVIDK-KYYEXXXXXXXXXFRYCSDNKHLDIPLPDWSFWGWAEVNTKP 1069
            PD++LMFDCVDWPV+   ++           FRYC +N+ LDI  PDWS+WGWAEVN KP
Sbjct: 214  PDLELMFDCVDWPVVKAAEFAGVDQLTPPPLFRYCGNNETLDIVFPDWSYWGWAEVNIKP 273

Query: 1070 WEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCNASHGHDWNARLYIQDWI 1249
            WE L+K++ +GN+R  W  R P AYWKGNP+VA+ R++LM+CN S  +DW ARLY QDW+
Sbjct: 274  WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRQDLMKCNVSEDYDWKARLYPQDWV 333

Query: 1250 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPTLIVTPKYYDFFSRALMP 1429
            RES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R + P
Sbjct: 334  RESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGMFP 393

Query: 1430 GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 1609
            G HYWP++ D KC SIKFAVD+GN H  +A+ +G+  S F++++LK+  VYDYM+H+L Q
Sbjct: 394  GHHYWPVKEDDKCRSIKFAVDFGNLHMLKAQDIGKKASEFVQQELKMDYVYDYMYHLLTQ 453

Query: 1610 YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 1789
            YSKL+++KP +P+ A E+CS++  C   R   E++FM +S  + P    PC + P  D  
Sbjct: 454  YSKLLRFKPKIPQNATELCSEAMAC--PRDGNERKFMMESLVKRPAETGPCAMPPPYDPA 511

Query: 1790 RLNQWNKMREKALRKVAKMEEESWNKE 1870
                  K R+    ++ + E + W K+
Sbjct: 512  SFYSVLKRRQSTTSRIEQWESKYWRKQ 538


>ref|XP_006491072.1| PREDICTED: O-glucosyltransferase rumi homolog [Citrus sinensis]
          Length = 531

 Score =  491 bits (1265), Expect = e-136
 Identities = 220/403 (54%), Positives = 296/403 (73%)
 Frame = +2

Query: 659  NPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRD 838
            N + CP +F +IHEDLR W+++GIT +M+E A +TA+FRL IV+G+ YV   ++S QTRD
Sbjct: 129  NLSTCPSYFRWIHEDLRHWRDSGITKDMIERARKTAHFRLVIVNGKAYVEKYKQSIQTRD 188

Query: 839  VFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXXFRYCSDNKHLDI 1018
             FT+WG +QL+ LYPG LPD++LMFDC D PV+  + +          FRYCSD   LDI
Sbjct: 189  KFTLWGILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDI 248

Query: 1019 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCN 1198
              PDWSFWGWAE N +PW +++KDI +GNKR  W++RVP AYW+GNP+V+ +RKELM CN
Sbjct: 249  VFPDWSFWGWAETNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTCN 308

Query: 1199 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPT 1378
            AS  +DWNARLY+QDW +ES+Q +KQSNL +QC HRYKIY+EG AWSVS K I+ACDS T
Sbjct: 309  ASDKNDWNARLYVQDWGQESKQNFKQSNLGDQCSHRYKIYIEGWAWSVSEKYILACDSMT 368

Query: 1379 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 1558
            LIV P+YYDFFSR ++P +HYWPIR + KC S+KFAVDWGN HT++A+A+G A S F++E
Sbjct: 369  LIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEAASRFIRE 428

Query: 1559 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 1738
            DLK+  VYDYMFH+L++Y++L+++KPS+P GA E+CS++  CS + +   ++FM +S  +
Sbjct: 429  DLKMGYVYDYMFHLLNEYARLLRFKPSIPAGALELCSETMACSAKGT--WRKFMEESMVK 486

Query: 1739 GPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNK 1867
             P+   PC L P      L  +   + K  R+V   E E W K
Sbjct: 487  SPSDSIPCSLPPPYHPSALKNFTDTKVKLTRQVEAWENEYWKK 529


>ref|XP_006445081.1| hypothetical protein CICLE_v10019760mg [Citrus clementina]
            gi|557547343|gb|ESR58321.1| hypothetical protein
            CICLE_v10019760mg [Citrus clementina]
          Length = 512

 Score =  491 bits (1265), Expect = e-136
 Identities = 220/403 (54%), Positives = 296/403 (73%)
 Frame = +2

Query: 659  NPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYVLINQKSFQTRD 838
            N + CP +F +IHEDLR W+++GIT +M+E A +TA+FRL IV+G+ YV   ++S QTRD
Sbjct: 110  NLSTCPSYFRWIHEDLRHWRDSGITKDMIERARKTAHFRLVIVNGKAYVEKYKQSIQTRD 169

Query: 839  VFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIDKKYYEXXXXXXXXXFRYCSDNKHLDI 1018
             FT+WG +QL+ LYPG LPD++LMFDC D PV+  + +          FRYCSD   LDI
Sbjct: 170  KFTLWGILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDI 229

Query: 1019 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPSVADVRKELMQCN 1198
              PDWSFWGWAE N +PW +++KDI +GNKR  W++RVP AYW+GNP+V+ +RKELM CN
Sbjct: 230  VFPDWSFWGWAETNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTCN 289

Query: 1199 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSPT 1378
            AS  +DWNARLY+QDW +ES+Q +KQSNL +QC HRYKIY+EG AWSVS K I+ACDS T
Sbjct: 290  ASDKNDWNARLYVQDWGQESKQNFKQSNLGDQCSHRYKIYIEGWAWSVSEKYILACDSMT 349

Query: 1379 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 1558
            LIV P+YYDFFSR ++P +HYWPIR + KC S+KFAVDWGN HT++A+A+G A S F++E
Sbjct: 350  LIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEAASRFIRE 409

Query: 1559 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 1738
            DLK+  VYDYMFH+L++Y++L+++KPS+P GA E+CS++  CS + +   ++FM +S  +
Sbjct: 410  DLKMGYVYDYMFHLLNEYARLLRFKPSIPAGALELCSETMACSAKGT--WRKFMEESMVK 467

Query: 1739 GPNSIAPCQLDPSPDGQRLNQWNKMREKALRKVAKMEEESWNK 1867
             P+   PC L P      L  +   + K  R+V   E E W K
Sbjct: 468  SPSDSIPCSLPPPYHPSALKNFTDTKVKLTRQVEAWENEYWKK 510


Top