BLASTX nr result

ID: Ephedra25_contig00009764 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00009764
         (2364 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, p...   520   e-144
ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [A...   518   e-144
ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Caps...   518   e-144
gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]        518   e-144
gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus pe...   515   e-143
ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arab...   512   e-142
ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutr...   511   e-142
ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Popu...   511   e-142
gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]     511   e-142
ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolo...   509   e-141
gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]        509   e-141
ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Caps...   508   e-141
ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-l...   507   e-140
gb|AED99886.1| glycosyltransferase [Panax notoginseng]                506   e-140
ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-l...   506   e-140
ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana] ...   506   e-140
gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus pe...   505   e-140
ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, p...   504   e-140
ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arab...   504   e-140
ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutr...   503   e-139

>ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549902|gb|EEF51389.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 506

 Score =  520 bits (1339), Expect = e-144
 Identities = 235/447 (52%), Positives = 317/447 (70%), Gaps = 9/447 (2%)
 Frame = -3

Query: 1780 SSNPIHKPR---VVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRP 1625
            S+ P+ KP    V+  NC +    +        T   +PN+     CP++F +IHEDLRP
Sbjct: 58   STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQDPNRSSPPTCPEYFRWIHEDLRP 117

Query: 1624 WKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGIL 1445
            W  TGIT E +E A  TANFRL I++G  Y+ + +KSFQTRDVFT+WG +QL+  YPG +
Sbjct: 118  WVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRKYPGRV 177

Query: 1444 PDVDLMFDCVDWPVIGK-KYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKP 1268
            PD+++MFDCVDWPV+    Y  SS+  PPPLFRYC ++E LDI  PDWS+WGW E N KP
Sbjct: 178  PDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVETNIKP 237

Query: 1267 WEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWI 1088
            WE +VKD+ +GN+R  W++R P AYWKGNP VA+ R +LM+CN S  HDWNARLY QDW+
Sbjct: 238  WEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLYTQDWV 297

Query: 1087 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMP 908
            RES+QGYKQS+LANQC+HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R LMP
Sbjct: 298  RESQQGYKQSDLANQCNHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMP 357

Query: 907  GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 728
              HYWPI+ D KC+SIKFAVDWGN H ++A+A+G+A S F++EDLK+  VYDYMFH+L++
Sbjct: 358  NHHYWPIKEDDKCKSIKFAVDWGNSHKQKAQAIGKAASDFIQEDLKMDYVYDYMFHLLNE 417

Query: 727  YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 548
            Y++L+ +KP++P+ A ++C+++  C  +   + K+ M  S  EGP   +PC +  S D  
Sbjct: 418  YARLLTFKPTIPQNATKLCAETMACPAD--GLAKKLMMDSMVEGPADTSPCTMPSSYDPS 475

Query: 547  RLDQWNKMRAKALRKVAKMEEESWSKE 467
             L    + +  A++++   E + W  +
Sbjct: 476  SLYNVTREKVNAIKQIELWENKHWENQ 502


>ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [Amborella trichopoda]
            gi|548845188|gb|ERN04666.1| hypothetical protein
            AMTR_s00076p00109920 [Amborella trichopoda]
          Length = 496

 Score =  518 bits (1335), Expect = e-144
 Identities = 239/431 (55%), Positives = 313/431 (72%), Gaps = 2/431 (0%)
 Frame = -3

Query: 1750 VNFNCTSRE--PCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANR 1577
            +N NC+S    P  + +NL     +P  + CPD+F +IHEDL+PWK TGIT EMVE A R
Sbjct: 73   INTNCSSLPWPPFPSIQNL-----DPPTSTCPDYFRWIHEDLKPWKGTGITQEMVERARR 127

Query: 1576 TANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIG 1397
            TA FRL ++DG++Y+    K++Q RD FTIWG +QL   Y G +PD+DLMFDCVDWPV+ 
Sbjct: 128  TATFRLLVIDGKVYVERYAKAYQCRDDFTIWGMLQLFRRYSGRVPDLDLMFDCVDWPVV- 186

Query: 1396 KKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINW 1217
            K++D      PPPLFRYC D + LDI  PDWSFWGW E+N +PWE L+KD+  GNK+I W
Sbjct: 187  KRWDYRGRVVPPPLFRYCGDKDSLDIVFPDWSFWGWPEINIEPWEALLKDLDDGNKKIKW 246

Query: 1216 EKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCD 1037
              R P AYWKGNP+VAD RK+L++CN +   DWNAR+Y+QDWI+ES+QGYK+SNLANQC 
Sbjct: 247  MNRDPTAYWKGNPYVADTRKDLLKCNVTETQDWNARVYVQDWIKESQQGYKESNLANQCT 306

Query: 1036 HRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIK 857
            HRYKIY+EGSAWSVS K I+ACDS TL+VTP YYDF +RALMP  HYWPI+ D KC SIK
Sbjct: 307  HRYKIYIEGSAWSVSEKYILACDSPTLLVTPHYYDFVTRALMPTHHYWPIKGDDKCRSIK 366

Query: 856  FAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKE 677
            +AVDWGN H ++A+A+G+  SSF+ ED+K++ VYDYMFH+L +YSKL++YKP+VPE A +
Sbjct: 367  YAVDWGNSHKQKAQAIGKTASSFILEDVKMAYVYDYMFHLLSEYSKLLRYKPTVPEKAVQ 426

Query: 676  VCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVA 497
             CS+S  C  + +   ++FM +S  + P+   PC L P  +   L    + +A A+++V 
Sbjct: 427  YCSESMACPAKGN--YEKFMKESFVKVPSDSEPCILPPPFEPPALQLLLRRKANAIKQVE 484

Query: 496  KMEEESWSKEK 464
              E+ S  K K
Sbjct: 485  TWEQNSRKKTK 495


>ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Capsella rubella]
            gi|482556148|gb|EOA20340.1| hypothetical protein
            CARUB_v10000648mg [Capsella rubella]
          Length = 544

 Score =  518 bits (1334), Expect = e-144
 Identities = 237/423 (56%), Positives = 307/423 (72%), Gaps = 1/423 (0%)
 Frame = -3

Query: 1732 SREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTI 1553
            +++P  A  N    T  P    CPD+F +IHEDLRPW  TGIT E +E AN+TANFRL I
Sbjct: 120  NKDPTTASFN-DDDTNHPPTATCPDYFRWIHEDLRPWARTGITREALERANKTANFRLAI 178

Query: 1552 VDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSS 1373
            V G++Y+   Q +FQTRDVFTIWGF+QL+  YPG +PD++LMFDCVDWPV+         
Sbjct: 179  VGGKVYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRAAEFAGVD 238

Query: 1372 SP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAA 1196
            +P PPPLFRYC + E LDI  PDWSFWGWAEVN KPWE L+K++ +GN++INW  R P A
Sbjct: 239  APSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNEKINWINREPYA 298

Query: 1195 YWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYV 1016
            YWKGNP VA+ R++LM+CN S  H+WNARLY QDWI+ES++GYKQS+LANQC HRYKIY+
Sbjct: 299  YWKGNPVVAETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLANQCHHRYKIYI 358

Query: 1015 EGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGN 836
            EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P  HYWP+R   KC SIKFAVDWGN
Sbjct: 359  EGSAWSVSEKYILACDSMTLLVKPHYYDFFTRGLLPAHHYWPVREKDKCRSIKFAVDWGN 418

Query: 835  QHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEY 656
             H ++A+ +G+A S F++++LK+  VYDYM+H+L++YSKL+++KP VP  A E+CS++  
Sbjct: 419  SHIQKAQDIGKAASEFIQQELKMDYVYDYMYHLLNEYSKLLQFKPEVPPNAVEICSETMA 478

Query: 655  CSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESW 476
            C+  RS  E++FM +S  + P    PC L P  D   L    K +     ++  ME + W
Sbjct: 479  CT--RSGNERKFMTESLVKHPAESGPCALPPPYDPVSLYSVAKRKQSTTARILHMEMKYW 536

Query: 475  SKE 467
            SK+
Sbjct: 537  SKQ 539


>gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]
          Length = 522

 Score =  518 bits (1333), Expect = e-144
 Identities = 235/437 (53%), Positives = 316/437 (72%), Gaps = 6/437 (1%)
 Frame = -3

Query: 1756 RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 1592
            R +  NCT+R   +A        +E  P+      CPD+F +IHEDLRPW  TGI+++M+
Sbjct: 88   RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147

Query: 1591 EMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 1412
            + A +TANFRL +V+GR Y+   ++SFQTRDVFT+WG +QL+  YPG +PD+DLMFDCVD
Sbjct: 148  KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207

Query: 1411 WPVIGKK-YDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1235
            WPVI    Y   +++ PPPLFRYC D+E LDI  PDWSFWGW E+N KPW  L+ D+ +G
Sbjct: 208  WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267

Query: 1234 NKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1055
            NKR+ WE R P AYWKGNP VA  R++L++CN S   DW AR+Y QDW RES+QGYKQS+
Sbjct: 268  NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327

Query: 1054 LANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDK 875
            LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D 
Sbjct: 328  LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387

Query: 874  KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 695
            KC SIK AVDWGN H ++A+A+G+A S F+KE LK+  VYDYMFH+L++Y+KL++YKP+V
Sbjct: 388  KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447

Query: 694  PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAK 515
            P  A E+CS++  C  E   ++K+FM +S  +GP+  +PC + P  D   L      +  
Sbjct: 448  PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYDPASLYALLSKKEN 505

Query: 514  ALRKVAKMEEESWSKEK 464
            ++++V + E++ W  +K
Sbjct: 506  SIKQVEEWEKKFWEMQK 522


>gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus persica]
          Length = 502

 Score =  515 bits (1327), Expect = e-143
 Identities = 228/406 (56%), Positives = 307/406 (75%), Gaps = 1/406 (0%)
 Frame = -3

Query: 1681 PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTR 1502
            P+P  CP++F +IHEDLRPW  TGIT EMVE ANRTANF+  IV+G+ Y+   +K+FQTR
Sbjct: 95   PSPPTCPEYFRWIHEDLRPWARTGITREMVERANRTANFKFVIVNGKAYVEQYEKAFQTR 154

Query: 1501 DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHL 1325
            DVFT+WGF+QL+  YPG +PD++LMFDCVDWPVI   +Y   +++ PPPLFRYC+D+  L
Sbjct: 155  DVFTVWGFLQLLRRYPGQVPDLELMFDCVDWPVIPSHEYSGPNATAPPPLFRYCADDNTL 214

Query: 1324 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQ 1145
            DI  PDWSFWGWAE+N +PWE L +++ +GNKR  W +R P AYWKGNP +A+ R++L++
Sbjct: 215  DIVFPDWSFWGWAEINIRPWEVLFEELKEGNKRKTWLEREPYAYWKGNPDIAETRQDLIK 274

Query: 1144 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 965
            CN S  HDWNARLY QDW RES++GY +S+LA+QC HRYKIY+EGSAWSVS K I+ACDS
Sbjct: 275  CNVSEEHDWNARLYAQDWDRESKEGYNKSDLASQCIHRYKIYIEGSAWSVSEKYILACDS 334

Query: 964  ATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 785
             TLIV P+YYDFF+R LMP  HYWPI+ D KC SIKF+VDWGN H ++A+A+G+A S+ +
Sbjct: 335  VTLIVKPRYYDFFTRRLMPVEHYWPIKDDDKCRSIKFSVDWGNTHRRKAQAIGKASSNLI 394

Query: 784  KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 605
            +E+LK+  VYDYMFH+L++Y+KL+++KP+VP+ A E+CS++  C  E +  EK+FM QS 
Sbjct: 395  QEELKMEYVYDYMFHLLNEYAKLLQFKPTVPKKAVELCSEAMACQAEGT--EKKFMLQSL 452

Query: 604  TEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             +GP    PC + P  D   L    + +  ++++V   E   W  +
Sbjct: 453  VKGPAVSEPCAMPPPYDPSSLFAVLRRKENSIKQVETWERNYWESQ 498


>ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
            lyrata] gi|297317912|gb|EFH48334.1| hypothetical protein
            ARALYDRAFT_910396 [Arabidopsis lyrata subsp. lyrata]
          Length = 543

 Score =  512 bits (1318), Expect = e-142
 Identities = 229/424 (54%), Positives = 305/424 (71%), Gaps = 1/424 (0%)
 Frame = -3

Query: 1735 TSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLT 1556
            +++ P  A       T  P    CPD+F +IHEDLRPW  TGIT E +E A +TANFRL 
Sbjct: 117  SNKYPTTASFGEDDDTNHPPNATCPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLA 176

Query: 1555 IVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDES 1379
            I+DG++Y+   Q +FQTRDVFTIWGF+QL+  YPG +PD++LMFDCVDWPV+   ++  +
Sbjct: 177  IIDGKIYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGA 236

Query: 1378 SSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPA 1199
            ++  PPPLFRYC + E LDI  PDWSFWGWAEVN KPWE L+K++ +GN+R  W  R P 
Sbjct: 237  NAPSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPY 296

Query: 1198 AYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIY 1019
            AYWKGNP VA+ R++LM+CN S  H+WNARLY+QDWI+ES +GYKQS+LA+QC HRYKIY
Sbjct: 297  AYWKGNPMVAETRQDLMKCNVSEEHEWNARLYVQDWIKESNEGYKQSDLASQCHHRYKIY 356

Query: 1018 VEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWG 839
            +EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P  HYWP+R   KC SIKFAVDWG
Sbjct: 357  IEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWG 416

Query: 838  NQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSE 659
            N H ++A+ +G+A S F++ +LK+  VYDYM+H+L +YSKL+++KP +P+ A E+CS++ 
Sbjct: 417  NSHIQKAQDIGKAASDFIQHELKMDYVYDYMYHLLTEYSKLLRFKPEIPQNAAEICSETM 476

Query: 658  YCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEES 479
             C   RS  E++FM +S  + P    PC + P  D   L    K +     ++ + E + 
Sbjct: 477  AC--PRSGNERKFMTESFVKHPAESGPCAMPPPYDPALLYGVVKRKQSTNMRILQWEMKY 534

Query: 478  WSKE 467
            WSK+
Sbjct: 535  WSKQ 538


>ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutrema salsugineum]
            gi|557091280|gb|ESQ31927.1| hypothetical protein
            EUTSA_v10003948mg [Eutrema salsugineum]
          Length = 545

 Score =  511 bits (1317), Expect = e-142
 Identities = 227/401 (56%), Positives = 300/401 (74%), Gaps = 1/401 (0%)
 Frame = -3

Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487
            CPD+F +IHEDLRPW++TGIT E +E A +TANFRL IV G++Y+   Q +FQTRDVFTI
Sbjct: 142  CPDYFRWIHEDLRPWEKTGITREALERAKKTANFRLAIVGGKLYVEKFQDAFQTRDVFTI 201

Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLP 1310
            WGF+QL+  YPG +PD++LMFDCVDWPV+       ++SP PPPLFRYC + E LDI  P
Sbjct: 202  WGFLQLLRRYPGKIPDLELMFDCVDWPVVKAANFAGANSPSPPPLFRYCGNEETLDIVFP 261

Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130
            DWSFWGW+EVN KPWE L+K++ +GN++ NW  R P AYWKGNP VA+ R++LM+CN S 
Sbjct: 262  DWSFWGWSEVNIKPWESLLKELREGNEKTNWINREPYAYWKGNPLVAETRQDLMKCNVSE 321

Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950
             H+WNARLY QDWIRES++GYKQS+LA+QC HR+KIY+EGSAWSVS K I+ACDS TL+V
Sbjct: 322  EHEWNARLYAQDWIRESKEGYKQSDLASQCHHRFKIYIEGSAWSVSEKYILACDSVTLLV 381

Query: 949  TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770
             P YYDFF+R L+P  HYWP+R   KC SIKFAV WGN H ++A+ +G+A S F++++LK
Sbjct: 382  KPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVHWGNSHIQKAQDIGKAASEFIQQELK 441

Query: 769  ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590
            +  VYDYMFH+L +YSKL+++KP +P+ AKE+CS++  C   RS  E++FM +S  + P 
Sbjct: 442  MDYVYDYMFHLLTEYSKLLQFKPEIPQNAKEICSETMAC--PRSGNERKFMTESLVKHPA 499

Query: 589  SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
               PC + P  D        K +  A  ++ + E + WSK+
Sbjct: 500  QTGPCAMPPPYDPASFYAVVKRKQSAATRILQWEMKYWSKQ 540


>ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Populus trichocarpa]
            gi|550322617|gb|EEF06046.2| hypothetical protein
            POPTR_0015s13090g [Populus trichocarpa]
          Length = 506

 Score =  511 bits (1317), Expect = e-142
 Identities = 231/427 (54%), Positives = 306/427 (71%), Gaps = 1/427 (0%)
 Frame = -3

Query: 1744 FNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANF 1565
            FN T + P     N  +    P+ + CP+ F +IHEDLRPW  TGI+ +MVE A RTANF
Sbjct: 78   FNPTRKCPLNYPTNTQEGPDRPSVSTCPEHFRWIHEDLRPWAHTGISRDMVERAKRTANF 137

Query: 1564 RLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKY 1388
            RL IV+G+ Y+   +KSFQTRD FT+WG IQL+  YPG LPD+D+MFDCVDWPVI    Y
Sbjct: 138  RLVIVNGKAYMERYRKSFQTRDTFTVWGIIQLLRKYPGKLPDLDMMFDCVDWPVIRSSDY 197

Query: 1387 DESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKR 1208
               +++ PP LFRYC D++ LD+  PDWSFWGW E+N KPWE L  D+ +GNK   W +R
Sbjct: 198  SGPNATSPPALFRYCGDDDSLDVVFPDWSFWGWPEINIKPWESLSNDLKEGNKITKWMER 257

Query: 1207 VPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRY 1028
             P AYWKGNP VA  R++LM+C+AS   DWNAR+Y QDWI+ES+QGY+QSNLANQC H+Y
Sbjct: 258  EPYAYWKGNPSVAATRQDLMKCHASETQDWNARVYAQDWIKESQQGYQQSNLANQCVHKY 317

Query: 1027 KIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAV 848
            KIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L+P RHYWPI+ D KC SIKFAV
Sbjct: 318  KIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLVPNRHYWPIKEDDKCRSIKFAV 377

Query: 847  DWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCS 668
            +WGN H+++A+AMG+A S F++EDLK+  VYDYMFH+L++Y+KL+ +KP++P  A E+C+
Sbjct: 378  EWGNNHSEEAQAMGKAASEFIQEDLKMDYVYDYMFHLLNEYAKLLTFKPTIPGRAIELCA 437

Query: 667  DSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKME 488
            ++  C    + +EK+FM  S    P   +PC + P  D   L    +    ++++V   E
Sbjct: 438  EAMACPA--NGLEKKFMMDSMVMSPADTSPCTMPPPYDPLSLHSVFQRNGNSIKQVESWE 495

Query: 487  EESWSKE 467
            +E W  +
Sbjct: 496  KEYWDNQ 502


>gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]
          Length = 515

 Score =  511 bits (1315), Expect = e-142
 Identities = 228/401 (56%), Positives = 299/401 (74%), Gaps = 1/401 (0%)
 Frame = -3

Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487
            CPD+F +I+EDLRPW  TGI+ +MVE A RTANFRL IV+G+ Y+   QK+FQTRDVFT+
Sbjct: 113  CPDYFRWIYEDLRPWAYTGISRDMVERAKRTANFRLVIVNGKAYVETFQKAFQTRDVFTL 172

Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLP 1310
            WG +QL+  YPG +PD++LMFDCVDWPV+  K Y    ++ PPPLFRYC D+  LDI  P
Sbjct: 173  WGILQLLRKYPGRVPDLELMFDCVDWPVVLSKAYSGPDATTPPPLFRYCGDDSTLDIVFP 232

Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130
            DWSFWGW E N KPWE L+K++ +GNK+  W +R   AYWKGNP VA  R++L++CN S 
Sbjct: 233  DWSFWGWPETNIKPWEALLKELEEGNKKSKWVEREAYAYWKGNPVVAATRQDLLKCNVSD 292

Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950
              DWNARLY QDW++ES++GYKQS+LANQC HRYKIY+EGSAWSVS K I+ACDS TLIV
Sbjct: 293  KQDWNARLYAQDWLKESKEGYKQSDLANQCIHRYKIYIEGSAWSVSEKYILACDSVTLIV 352

Query: 949  TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770
             P YYDFF+R L+P +HYWPI+ D KC SIKFAVDWGN H K+AK++G+A S F+++DLK
Sbjct: 353  KPHYYDFFTRGLVPMQHYWPIKDDDKCRSIKFAVDWGNSHKKKAKSIGKAASRFIQDDLK 412

Query: 769  ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590
            +  VYDYMFH+L++Y+KL+K+KPS+PE A E CS+S  C+ E   + K+FM +S  +GP 
Sbjct: 413  MEYVYDYMFHLLNEYAKLLKFKPSIPEKAVEFCSESMACTAE--GIGKKFMMESMVKGPA 470

Query: 589  SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
              +PC + PS +   L    + +   + +V   + + W  +
Sbjct: 471  DSSPCTMPPSYNPSSLYSLIQKKTSLIEQVEMWQNKYWENQ 511


>ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
            gi|302143884|emb|CBI22745.3| unnamed protein product
            [Vitis vinifera]
          Length = 525

 Score =  509 bits (1312), Expect = e-141
 Identities = 227/406 (55%), Positives = 304/406 (74%), Gaps = 1/406 (0%)
 Frame = -3

Query: 1681 PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTR 1502
            P+P +CP +F +I+ DLRPW ++GIT EMVE A RTA F+L I++GR Y+   Q++FQTR
Sbjct: 120  PSPPECPHYFRWIYGDLRPWMKSGITREMVERAKRTATFKLVILNGRAYVEKYQRAFQTR 179

Query: 1501 DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHL 1325
            DVFT+WG +QL+  YPG +PD++LMFDCVDWPVI   +Y   +++ PPPLFRYC D+  L
Sbjct: 180  DVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDATL 239

Query: 1324 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQ 1145
            DI  PDWSFWGW E+N KPWE L+KD+ +GNKR  W +R P AYWKGNP VA  R +L++
Sbjct: 240  DIVFPDWSFWGWPEINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLK 299

Query: 1144 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 965
            CN S   DWNAR+Y QDWI ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS
Sbjct: 300  CNVSDKQDWNARVYTQDWILESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDS 359

Query: 964  ATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 785
             TL+V P YYDFF+R+LMP  HYWPIR D KC SIKFAVDWGN+H ++A+++G+A S F+
Sbjct: 360  VTLLVKPHYYDFFTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFI 419

Query: 784  KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 605
            +EDLK+ NVYDYMFH+L++Y+KL+K+KP+VPE A E+CS+   C  E   ++K+FM +S 
Sbjct: 420  QEDLKMDNVYDYMFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAE--GLKKKFMMESM 477

Query: 604  TEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             + P   +PC + P      L  +   +  ++++V   E++ W  +
Sbjct: 478  VKYPMDASPCTMPPPFSPLELQTFLNRKVNSIKQVEAWEKKFWENQ 523


>gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]
          Length = 498

 Score =  509 bits (1311), Expect = e-141
 Identities = 230/407 (56%), Positives = 302/407 (74%), Gaps = 6/407 (1%)
 Frame = -3

Query: 1756 RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 1592
            R +  NCT+R   +A        +E  P+      CPD+F +IHEDLRPW  TGI+++M+
Sbjct: 88   RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147

Query: 1591 EMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 1412
            + A +TANFRL +V+GR Y+   ++SFQTRDVFT+WG +QL+  YPG +PD+DLMFDCVD
Sbjct: 148  KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207

Query: 1411 WPVIGKK-YDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1235
            WPVI    Y   +++ PPPLFRYC D+E LDI  PDWSFWGW E+N KPW  L+ D+ +G
Sbjct: 208  WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267

Query: 1234 NKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1055
            NKR+ WE R P AYWKGNP VA  R++L++CN S   DW AR+Y QDW RES+QGYKQS+
Sbjct: 268  NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327

Query: 1054 LANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDK 875
            LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D 
Sbjct: 328  LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387

Query: 874  KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 695
            KC SIK AVDWGN H ++A+A+G+A S F+KE LK+  VYDYMFH+L++Y+KL++YKP+V
Sbjct: 388  KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447

Query: 694  PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPD 554
            P  A E+CS++  C  E   ++K+FM +S  +GP+  +PC + P  D
Sbjct: 448  PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYD 492


>ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Capsella rubella]
            gi|482559574|gb|EOA23765.1| hypothetical protein
            CARUB_v10016976mg [Capsella rubella]
          Length = 539

 Score =  508 bits (1309), Expect = e-141
 Identities = 225/404 (55%), Positives = 298/404 (73%), Gaps = 1/404 (0%)
 Frame = -3

Query: 1675 PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDV 1496
            P  CPD+F +IHEDLRPW++TGIT E +E AN TA FRL I+DGR+Y+   +++FQTRDV
Sbjct: 133  PATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIIDGRIYVENFREAFQTRDV 192

Query: 1495 FTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDI 1319
            FTIWGF+QL+  YPG +PD++LMFDCVDWPV+  ++Y       PPPLFRYC+++E LDI
Sbjct: 193  FTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAEEYSGVDKPSPPPLFRYCANDETLDI 252

Query: 1318 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCN 1139
              PDWS+WGWAEVN KPWE L+KD+S+GN+R  W  R P AYWKGNP VA+ R +LM+CN
Sbjct: 253  VFPDWSYWGWAEVNIKPWESLLKDLSEGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCN 312

Query: 1138 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSAT 959
             S  +DW ARLY QDW++ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T
Sbjct: 313  LSEEYDWKARLYKQDWLKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVT 372

Query: 958  LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 779
            L+V P YYDFF+R + PG HYWP++ D KC SIKFAVDWGN H ++A+ +G+  S F+++
Sbjct: 373  LMVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQ 432

Query: 778  DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 599
            +LK+  VYDYMFH+L QYSKL+++KP +P+ + EVCS++  C   R   E++FM +S  +
Sbjct: 433  ELKMDYVYDYMFHLLTQYSKLLRFKPEIPQNSTEVCSETMAC--PRDGNERKFMMESLVK 490

Query: 598  GPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             P    PC + P  D        K R     ++ + E + W K+
Sbjct: 491  RPAETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKYWRKQ 534


>ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  507 bits (1305), Expect = e-140
 Identities = 240/451 (53%), Positives = 311/451 (68%), Gaps = 15/451 (3%)
 Frame = -3

Query: 1774 NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 1637
            NP H+PR           +FN  +   C A    +  T E  NP    + CPD+F +IHE
Sbjct: 86   NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145

Query: 1636 DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELY 1457
            DLRPW  TGIT   +E   RTANFRL I++G+ Y+   +KSFQTRD FT+WG +QL+  Y
Sbjct: 146  DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205

Query: 1456 PGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEV 1280
            PG +PD+DLMFDCVDWPVI   +    + P PPPLFRYC D+   DI  PDWSFWGW E+
Sbjct: 206  PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265

Query: 1279 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYI 1100
            N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S   DWNAR++ 
Sbjct: 266  NIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325

Query: 1099 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSR 920
            QDW +ES++GYKQS+L+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R
Sbjct: 326  QDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385

Query: 919  ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 740
             LMP  HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+  VYDYMFH
Sbjct: 386  GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445

Query: 739  MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 560
            +L +YSKL+ +KP++P  A E+CS++  C  E   + K+FM +S  + P    PC + P 
Sbjct: 446  LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPPP 503

Query: 559  PDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             D   L      +  ++++V K E   W+ +
Sbjct: 504  YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534


>gb|AED99886.1| glycosyltransferase [Panax notoginseng]
          Length = 546

 Score =  506 bits (1303), Expect = e-140
 Identities = 227/401 (56%), Positives = 296/401 (73%), Gaps = 1/401 (0%)
 Frame = -3

Query: 1675 PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDV 1496
            P  CP++F +I+EDLRPW+ETGIT EMVE A RTANFRL I++GR Y+  +QKSFQ+RDV
Sbjct: 143  PVSCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDV 202

Query: 1495 FTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDI 1319
            FT+WG +QL+ +YPG +PD+DLMFDCVDWPVI  + Y   +++ PPPLFRYC+D+  LDI
Sbjct: 203  FTLWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDI 262

Query: 1318 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCN 1139
              PDW+FWGW E+N KPW  L+KD+ +GN    W  R P AYWKGNP VA  R +L++CN
Sbjct: 263  VFPDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCN 322

Query: 1138 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSAT 959
             S   DWNAR+Y  DW RES+ GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T
Sbjct: 323  VSDKQDWNARVYAXDWARESQLGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSVT 382

Query: 958  LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 779
            L V P+YYDFF+R LMP  HYWPIR D KC SIKFAVDWGN H ++A ++G+  S+F++E
Sbjct: 383  LXVKPRYYDFFTRGLMPVHHYWPIRDDDKCRSIKFAVDWGNNHKQKAHSIGKEASNFIQE 442

Query: 778  DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 599
            DLK+  VYDYMFH+L++Y+KL++YKP+VP  A E+CS++  C  E     K+FM +S  +
Sbjct: 443  DLKMDYVYDYMFHLLNEYAKLLRYKPTVPPKAVELCSETMACPAE--GFTKKFMMESIVK 500

Query: 598  GPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESW 476
            GP   +PC + P  D   L    + +  ++++V   E+  W
Sbjct: 501  GPTDKSPCVMQPPYDPPTLHSVLRRKENSIKQVENWEKLYW 541


>ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus]
          Length = 538

 Score =  506 bits (1302), Expect = e-140
 Identities = 240/451 (53%), Positives = 310/451 (68%), Gaps = 15/451 (3%)
 Frame = -3

Query: 1774 NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 1637
            NP H+PR           +FN  +   C A    +  T E  NP    + CPD+F +IHE
Sbjct: 86   NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145

Query: 1636 DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELY 1457
            DLRPW  TGIT   +E   RTANFRL I++G+ Y+   +KSFQTRD FT+WG +QL+  Y
Sbjct: 146  DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205

Query: 1456 PGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEV 1280
            PG +PD+DLMFDCVDWPVI   +    + P PPPLFRYC D+   DI  PDWSFWGW E+
Sbjct: 206  PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265

Query: 1279 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYI 1100
            N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S   DWNAR++ 
Sbjct: 266  NIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325

Query: 1099 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSR 920
            QDW +ES++GYKQSNL+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R
Sbjct: 326  QDWTKESQEGYKQSNLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385

Query: 919  ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 740
             LMP  HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+  VYDYMFH
Sbjct: 386  GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445

Query: 739  MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 560
            +L +YSKL+ +KP++P  A E+CS++  C  E   + K+FM +S  + P    PC +   
Sbjct: 446  LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPSP 503

Query: 559  PDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             D   L      +  ++++V K E   W+ +
Sbjct: 504  YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534


>ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
            gi|10176852|dbj|BAB10058.1| unnamed protein product
            [Arabidopsis thaliana] gi|48310551|gb|AAT41837.1|
            At5g23850 [Arabidopsis thaliana]
            gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis
            thaliana] gi|332005839|gb|AED93222.1| uncharacterized
            protein AT5G23850 [Arabidopsis thaliana]
          Length = 542

 Score =  506 bits (1302), Expect = e-140
 Identities = 227/409 (55%), Positives = 297/409 (72%), Gaps = 1/409 (0%)
 Frame = -3

Query: 1690 TLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSF 1511
            T  P    CPD+F +IHEDLRPW  TGIT E +E A +TA FRL IV G++Y+   Q +F
Sbjct: 131  TNHPPTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAF 190

Query: 1510 QTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDN 1334
            QTRDVFTIWGF+QL+  YPG +PD++LMFDCVDWPV+       +++P PPPLFRYC + 
Sbjct: 191  QTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNE 250

Query: 1333 EHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKE 1154
            E LDI  PDWSFWGWAEVN KPWE L+K++ +GN+R  W  R P AYWKGNP VA+ R++
Sbjct: 251  ETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQD 310

Query: 1153 LMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMA 974
            LM+CN S  H+WNARLY QDWI+ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+A
Sbjct: 311  LMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILA 370

Query: 973  CDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGS 794
            CDS TL+V P YYDFF+R L+P  HYWP+R   KC SIKFAVDWGN H ++A+ +G+A S
Sbjct: 371  CDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAAS 430

Query: 793  SFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMF 614
             F+++DLK+  VYDYM+H+L +YSKL+++KP +P  A E+CS++  C   RS  E++FM 
Sbjct: 431  DFIQQDLKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACL--RSGNERKFMT 488

Query: 613  QSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
            +S  + P    PC + P  D     +  K +     ++ + E + WSK+
Sbjct: 489  ESLVKQPADSGPCAMPPPYDPATYYEVVKRKQSTNMRILQWEMKYWSKQ 537


>gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus persica]
          Length = 474

 Score =  505 bits (1301), Expect = e-140
 Identities = 228/419 (54%), Positives = 303/419 (72%), Gaps = 1/419 (0%)
 Frame = -3

Query: 1720 CKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGR 1541
            C    N  +    P P  CP++F +IHEDLRPW  TGIT +M++ A RTANF+L IV+G+
Sbjct: 54   CTRLLNSRQDPDRPLPPTCPEYFRWIHEDLRPWAHTGITRDMIQRAKRTANFKLVIVNGK 113

Query: 1540 MYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKK-YDESSSSPP 1364
             Y+   QKSFQTRDVFT+WG +QL+  YPG +PD++LMFDCVDWPVI    Y   +++ P
Sbjct: 114  AYVEKYQKSFQTRDVFTMWGILQLLRRYPGQVPDLELMFDCVDWPVISSNDYSGPNATAP 173

Query: 1363 PPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKG 1184
            PPLFRYC D+  LDI  PDWSFWGWAE+N  PWE L+KD+ +GNKR  W  R P AYWKG
Sbjct: 174  PPLFRYCGDDNSLDIVFPDWSFWGWAEINIMPWEVLLKDLEEGNKRRRWIDRAPYAYWKG 233

Query: 1183 NPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSA 1004
            NP VA  R++L++CN S   DWNAR+Y QDW+RES +GYKQS+LA+QC  RYKIY+EGSA
Sbjct: 234  NPSVAATRQDLLKCNVSDQQDWNARVYAQDWLRESSEGYKQSDLASQCVDRYKIYIEGSA 293

Query: 1003 WSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTK 824
            WSVS K I+ACDS TLIV P+YYDFF+R+LMP  HYWPI+ D KC SIKFAVDWGN H +
Sbjct: 294  WSVSDKYILACDSVTLIVKPRYYDFFTRSLMPVHHYWPIKDDDKCRSIKFAVDWGNSHKQ 353

Query: 823  QAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGE 644
            +A+A+G+A S  ++E+LK+  VYDYMFH+L++Y+KL+++KP++P  A E+CS++  C  +
Sbjct: 354  KAQAIGKAASKLIQEELKMDYVYDYMFHLLNEYAKLLQFKPTIPRKAIELCSEAMACQAQ 413

Query: 643  RSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
             +  EK+FM +S  +GP    PC + P      L    +  A ++++V   E++ W  +
Sbjct: 414  GT--EKKFMMESMVKGPAVSNPCTMPPPYGPASLFAVLRRNANSIKQVETWEKKYWENQ 470


>ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549903|gb|EEF51390.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 528

 Score =  504 bits (1299), Expect = e-140
 Identities = 229/442 (51%), Positives = 313/442 (70%), Gaps = 2/442 (0%)
 Frame = -3

Query: 1783 NSSNPIHKP-RVVNFNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGI 1607
            N+ N I+ P     FN T   P       ++    P+ + CP+++ +I+EDLRPW  TGI
Sbjct: 86   NALNKINIPLNCAAFNLTRTCPSNYPTTFTENPDRPSVSACPEYYRWIYEDLRPWARTGI 145

Query: 1606 TLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLM 1427
            + +MVE A  TANFRL IV+G+ Y+   +++FQTRDVFT+WG +QL+  YPG +PD++LM
Sbjct: 146  SRDMVERAKTTANFRLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELM 205

Query: 1426 FDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVK 1250
            FDCVDWPVI    Y   ++  PPPLFRYC D++ LD+  PDWSFWGW+E+N KPWE L++
Sbjct: 206  FDCVDWPVIKSSNYSGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLR 265

Query: 1249 DISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQG 1070
            ++ +GN++  W +R P AYWKGNP VA+ R++LM+CN S   DWNAR+Y QDWI+E +QG
Sbjct: 266  ELKEGNEKRRWMEREPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQDWIKELQQG 325

Query: 1069 YKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWP 890
            YKQSNLA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L P  HYWP
Sbjct: 326  YKQSNLASQCMHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLRPIHHYWP 385

Query: 889  IRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMK 710
            I+   KC SIKFAVDWGN H ++A+A+G+A S F++E+LK+  VYDYMFH+L++Y+KL+ 
Sbjct: 386  IKDYDKCRSIKFAVDWGNNHKQKAQAIGKAASEFIQEELKMDYVYDYMFHLLNEYAKLLT 445

Query: 709  YKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWN 530
            +KP +P  A E+CS+S  C    + +EK FM +S  +GP    PC + P  D   L    
Sbjct: 446  FKPVIPRKAVELCSESMACPA--NGIEKEFMMESMVQGPAETNPCIMLPPYDPSALHSIF 503

Query: 529  KMRAKALRKVAKMEEESWSKEK 464
            + +  ++R+V   E+  W K+K
Sbjct: 504  RRKENSIRQVELWEKMYWDKQK 525


>ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
            lyrata] gi|297321774|gb|EFH52195.1| hypothetical protein
            ARALYDRAFT_485256 [Arabidopsis lyrata subsp. lyrata]
          Length = 539

 Score =  504 bits (1297), Expect = e-140
 Identities = 221/401 (55%), Positives = 297/401 (74%), Gaps = 1/401 (0%)
 Frame = -3

Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487
            CPD+F +IHEDLRPW++TGIT E +E AN TANFRL I++GR+Y+   +++FQTRDVFTI
Sbjct: 136  CPDYFRWIHEDLRPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTI 195

Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLP 1310
            WGF+QL+  YPG +PD++LMFDCVDWPV+   ++      PPPPLFRYC+++E LDI  P
Sbjct: 196  WGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFP 255

Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130
            DWS+WGWAEVN KPWE L+K++ +GN+R  W  R P AYWKGNP VA+ R +LM+CN S 
Sbjct: 256  DWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSE 315

Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950
             +DW ARLY QDW++ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V
Sbjct: 316  EYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLV 375

Query: 949  TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770
             P YYDFF+R + PG HYWP++ D KC SIKFAVDWGN H ++A+ +G+  S F++++LK
Sbjct: 376  KPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQELK 435

Query: 769  ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590
            +  VYDYMFH+L QYSKL+++KP +P+ + E+CS++  C   R   E++FM +S  + P 
Sbjct: 436  MDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMAC--PRDGNERKFMMESLVKHPA 493

Query: 589  SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467
               PC + P  D        K R     ++ + E + W K+
Sbjct: 494  ETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKYWRKQ 534


>ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutrema salsugineum]
            gi|557105314|gb|ESQ45648.1| hypothetical protein
            EUTSA_v10010269mg [Eutrema salsugineum]
          Length = 543

 Score =  503 bits (1294), Expect = e-139
 Identities = 230/447 (51%), Positives = 308/447 (68%), Gaps = 15/447 (3%)
 Frame = -3

Query: 1762 KPRVVNFNCTS-------------REPCKARKNLSKKTLEPNPNK-CPDFFMFIHEDLRP 1625
            KP+    NC +             R P   R    +   E +P   CPD+F +IHEDLRP
Sbjct: 94   KPKEFTLNCAAFSGNETVITCPRNRYPTSLRSGAREDDPERSPPATCPDYFRWIHEDLRP 153

Query: 1624 WKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGIL 1445
            W++TGIT E +E AN TANFRL I++GR+Y+   +++FQTRDVFTIWGF+QL+  YPG +
Sbjct: 154  WEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 213

Query: 1444 PDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKP 1268
            PD++LMFDCVDWPV+   ++       PPPLFRYC +NE LDI  PDWS+WGWAEVN KP
Sbjct: 214  PDLELMFDCVDWPVVKAAEFAGVDQLTPPPLFRYCGNNETLDIVFPDWSYWGWAEVNIKP 273

Query: 1267 WEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWI 1088
            WE L+K++ +GN+R  W  R P AYWKGNP VA+ R++LM+CN S  +DW ARLY QDW+
Sbjct: 274  WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRQDLMKCNVSEDYDWKARLYPQDWV 333

Query: 1087 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMP 908
            RES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R + P
Sbjct: 334  RESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGMFP 393

Query: 907  GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 728
            G HYWP++ D KC SIKFAVD+GN H  +A+ +G+  S F++++LK+  VYDYM+H+L Q
Sbjct: 394  GHHYWPVKEDDKCRSIKFAVDFGNLHMLKAQDIGKKASEFVQQELKMDYVYDYMYHLLTQ 453

Query: 727  YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 548
            YSKL+++KP +P+ A E+CS++  C   R   E++FM +S  + P    PC + P  D  
Sbjct: 454  YSKLLRFKPKIPQNATELCSEAMAC--PRDGNERKFMMESLVKRPAETGPCAMPPPYDPA 511

Query: 547  RLDQWNKMRAKALRKVAKMEEESWSKE 467
                  K R     ++ + E + W K+
Sbjct: 512  SFYSVLKRRQSTTSRIEQWESKYWRKQ 538


Top