BLASTX nr result

ID: Paeonia25_contig00020260 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00020260
         (1564 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262...   506   e-141
emb|CBI40456.3| unnamed protein product [Vitis vinifera]              506   e-141
emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera]   497   e-138
ref|XP_007051668.1| Glycosyl transferase family 1 protein isofor...   495   e-137
ref|XP_007051667.1| Glycosyl transferase family 1 protein isofor...   495   e-137
ref|XP_002301386.2| glycosyltransferase family protein [Populus ...   494   e-137
ref|XP_002320170.1| glycosyltransferase family protein [Populus ...   494   e-137
ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prun...   489   e-135
ref|XP_002511940.1| transferase, transferring glycosyl groups, p...   474   e-133
gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis]     477   e-132
ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   470   e-130
ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212...   470   e-130
ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302...   469   e-129
ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citr...   461   e-127
ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591...   454   e-125
ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246...   452   e-124
ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutr...   424   e-116
ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabid...   416   e-113
ref|XP_007135157.1| hypothetical protein PHAVU_010G105900g [Phas...   414   e-113
ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidop...   402   e-109

>ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262009 [Vitis vinifera]
          Length = 1026

 Score =  506 bits (1304), Expect = e-141
 Identities = 269/434 (61%), Positives = 313/434 (72%), Gaps = 3/434 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G+  KRD  L                               DYLQW+CT     
Sbjct: 1    MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPG +MEKSG +L   E   GDL+F K I GLDF E IRF+P K+L +F+K
Sbjct: 56   FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQK 115

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EA E NLSS  +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQVYSL
Sbjct: 116  EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSL 174

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            EDGPV A+W N+  PVT+I++NA     VDWLNYDGI+VNSLEAR + SCF+QEPFKSLP
Sbjct: 175  EDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLP 234

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            LIWTI E TLATRLR YN +G+  L+NDWKKVFNRAT VVFPNYVLPMIYS  D+GNYFV
Sbjct: 235  LIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFV 294

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IPGSP +AWE DN  A ++D+ R  +GYG DDFVIA+V SQF Y+ LWLEHALIL+A+ P
Sbjct: 295  IPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLP 354

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIG-GDADNTL 1520
            L   FP  NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVVKHIAI  G+ADN L
Sbjct: 355  LVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADNVL 414

Query: 1521 SAADLVIYGSFLEE 1562
            +AAD+VIYGSFLEE
Sbjct: 415  AAADIVIYGSFLEE 428


>emb|CBI40456.3| unnamed protein product [Vitis vinifera]
          Length = 1026

 Score =  506 bits (1304), Expect = e-141
 Identities = 269/434 (61%), Positives = 313/434 (72%), Gaps = 3/434 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G+  KRD  L                               DYLQW+CT     
Sbjct: 1    MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPG +MEKSG +L   E   GDL+F K I GLDF E IRF+P K+L +F+K
Sbjct: 56   FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQK 115

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EA E NLSS  +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQVYSL
Sbjct: 116  EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSL 174

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            EDGPV A+W N+  PVT+I++NA     VDWLNYDGI+VNSLEAR + SCF+QEPFKSLP
Sbjct: 175  EDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLP 234

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            LIWTI E TLATRLR YN +G+  L+NDWKKVFNRAT VVFPNYVLPMIYS  D+GNYFV
Sbjct: 235  LIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFV 294

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IPGSP +AWE DN  A ++D+ R  +GYG DDFVIA+V SQF Y+ LWLEHALIL+A+ P
Sbjct: 295  IPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLP 354

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIG-GDADNTL 1520
            L   FP  NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVVKHIAI  G+ADN L
Sbjct: 355  LVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADNVL 414

Query: 1521 SAADLVIYGSFLEE 1562
            +AAD+VIYGSFLEE
Sbjct: 415  AAADIVIYGSFLEE 428


>emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera]
          Length = 1040

 Score =  497 bits (1280), Expect = e-138
 Identities = 269/448 (60%), Positives = 314/448 (70%), Gaps = 17/448 (3%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G+  KRD  L                               DYLQW+CT     
Sbjct: 1    MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPG +MEKSG +L   E   GDL+F K+I GLDF E IRF+P K+L +F+K
Sbjct: 56   FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKKIGGLDFGEGIRFEPSKLLQKFQK 115

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQ---- 791
            EA E NLSS  +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQ    
Sbjct: 116  EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQALPY 174

Query: 792  ----------VYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941
                      VYSLEDGPV A+W N+  PVT+I++NA     VDWLNYDGI+VNSLEAR 
Sbjct: 175  LVSIYVAWIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARG 234

Query: 942  IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121
            + SCF+QEPFKSLPLIWTI E TLATRLR YN +G+  L+NDWKKVFNRAT VVFPNYVL
Sbjct: 235  VVSCFVQEPFKSLPLIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVL 294

Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301
            PMIYS  D+GNYFVIPGSP +AWE DN  A ++D+ R  +GYG DDFVIA+V SQF Y+ 
Sbjct: 295  PMIYSTFDSGNYFVIPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKG 354

Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481
            LWLEHALIL+A+ PL   FP  NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVV
Sbjct: 355  LWLEHALILQALLPLVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVV 414

Query: 1482 KHIAIG-GDADNTLSAADLVIYGSFLEE 1562
            KHIAI  G+ADN L+AAD+VIYGSFLEE
Sbjct: 415  KHIAIDVGEADNVLAAADIVIYGSFLEE 442


>ref|XP_007051668.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao]
            gi|508703929|gb|EOX95825.1| Glycosyl transferase family 1
            protein isoform 2 [Theobroma cacao]
          Length = 686

 Score =  495 bits (1274), Expect = e-137
 Identities = 259/436 (59%), Positives = 313/436 (71%), Gaps = 5/436 (1%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE+G+S KR                                   DYLQWICT     
Sbjct: 1    MGSLESGISLKR--------AGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFL 52

Query: 450  XXXXXXXXXLPGSVMEKS--GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFE 620
                     LPGSVM+KS       K++V G+L + KE+ GLDF EDIR +P K+L +F+
Sbjct: 53   FFVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQ 112

Query: 621  KEAREANL--SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794
            +E +  NL  SS FNR++ R+ YRKPQLALVFADLLVDP Q+LMVT+A AL+EIGYAIQV
Sbjct: 113  RENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQV 172

Query: 795  YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974
            YSLEDGPV  VW ++ +PV+++Q N++  I VDWLNYDGILV+SLEA+ +FS FMQEPFK
Sbjct: 173  YSLEDGPVHNVWQSIGVPVSVLQVNSNE-IGVDWLNYDGILVSSLEAKGVFSSFMQEPFK 231

Query: 975  SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154
            S+PLIWTIHERTLA R R + SSGQ  L+N+WKKVF+RATVVVFPNY LPMIYS  D GN
Sbjct: 232  SIPLIWTIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGN 291

Query: 1155 YFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKA 1334
            Y+VIPGSP EAW+ +NA  +YKDN R  +GYG D+ +IAIVGSQF YR LWLEHA++L+A
Sbjct: 292  YYVIPGSPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQA 351

Query: 1335 IYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADN 1514
            + PLFT F    N NSH KIIILSGDSTSNY +AVE I+ NL+YP GVVKH+A+ GD D+
Sbjct: 352  LLPLFTDFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHVAVDGDVDS 411

Query: 1515 TLSAADLVIYGSFLEE 1562
             LS  D+VIYGSFLEE
Sbjct: 412  VLSMTDIVIYGSFLEE 427


>ref|XP_007051667.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao]
            gi|508703928|gb|EOX95824.1| Glycosyl transferase family 1
            protein isoform 1 [Theobroma cacao]
          Length = 1026

 Score =  495 bits (1274), Expect = e-137
 Identities = 259/436 (59%), Positives = 313/436 (71%), Gaps = 5/436 (1%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE+G+S KR                                   DYLQWICT     
Sbjct: 1    MGSLESGISLKR--------AGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFL 52

Query: 450  XXXXXXXXXLPGSVMEKS--GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFE 620
                     LPGSVM+KS       K++V G+L + KE+ GLDF EDIR +P K+L +F+
Sbjct: 53   FFVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQ 112

Query: 621  KEAREANL--SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794
            +E +  NL  SS FNR++ R+ YRKPQLALVFADLLVDP Q+LMVT+A AL+EIGYAIQV
Sbjct: 113  RENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQV 172

Query: 795  YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974
            YSLEDGPV  VW ++ +PV+++Q N++  I VDWLNYDGILV+SLEA+ +FS FMQEPFK
Sbjct: 173  YSLEDGPVHNVWQSIGVPVSVLQVNSNE-IGVDWLNYDGILVSSLEAKGVFSSFMQEPFK 231

Query: 975  SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154
            S+PLIWTIHERTLA R R + SSGQ  L+N+WKKVF+RATVVVFPNY LPMIYS  D GN
Sbjct: 232  SIPLIWTIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGN 291

Query: 1155 YFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKA 1334
            Y+VIPGSP EAW+ +NA  +YKDN R  +GYG D+ +IAIVGSQF YR LWLEHA++L+A
Sbjct: 292  YYVIPGSPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQA 351

Query: 1335 IYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADN 1514
            + PLFT F    N NSH KIIILSGDSTSNY +AVE I+ NL+YP GVVKH+A+ GD D+
Sbjct: 352  LLPLFTDFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHVAVDGDVDS 411

Query: 1515 TLSAADLVIYGSFLEE 1562
             LS  D+VIYGSFLEE
Sbjct: 412  VLSMTDIVIYGSFLEE 427


>ref|XP_002301386.2| glycosyltransferase family protein [Populus trichocarpa]
            gi|550345174|gb|EEE80659.2| glycosyltransferase family
            protein [Populus trichocarpa]
          Length = 984

 Score =  494 bits (1273), Expect = e-137
 Identities = 262/438 (59%), Positives = 305/438 (69%), Gaps = 7/438 (1%)
 Frame = +3

Query: 270  MGSLETG-LSFKRD-HFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443
            MGSLE+G +SFKRD + L+                              DY+QWICT   
Sbjct: 1    MGSLESGGISFKRDSNNLIRSHSAGRTERNPFLYRPRSRLSRFLLFKKLDYIQWICTVAV 60

Query: 444  XXXXXXXXXXXLPGSVMEKSGLALTK----EIVSGDLTFPKEIAGLDFKEDIRFKP-KIL 608
                       LPGSV+EKS L  +     E+V+ DL + KEI GLDF EDI+F+P KIL
Sbjct: 61   FLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVNKDLLYLKEIGGLDFGEDIKFEPSKIL 120

Query: 609  ARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAI 788
             +F KE RE N+  T N T  R+ YRKPQLALVFADLLVDP Q+LMVTVA ALQEIGY I
Sbjct: 121  QKFRKENREMNMPFT-NGTLSRFPYRKPQLALVFADLLVDPQQLLMVTVATALQEIGYTI 179

Query: 789  QVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEP 968
             VY+L DGPV+ +W ++  PVT+IQ +  + I VDWLNYDGILVNSLE R + SCFMQEP
Sbjct: 180  HVYTLRDGPVQNIWKSMGYPVTIIQMSHKLEIAVDWLNYDGILVNSLETRSVISCFMQEP 239

Query: 969  FKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDA 1148
            FKS+PLIWTIHER LA R R Y SS Q  LLNDW+K FNRATVVVFPN+VLPM+YS  DA
Sbjct: 240  FKSVPLIWTIHERALAIRSRQYTSSWQIELLNDWRKAFNRATVVVFPNHVLPMMYSAFDA 299

Query: 1149 GNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328
            GNY+VIPGSP E WEAD   A+Y D++R  +GY   D VIA+VGSQF YR LWLEHAL+L
Sbjct: 300  GNYYVIPGSPAEVWEADTTMALYNDDIRVKMGYEPTDIVIAVVGSQFLYRGLWLEHALVL 359

Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508
            KA+ PL   FP  +N  SHLKII+LSGDST NY  AVEAI++NL YPRG VKH A+ GD 
Sbjct: 360  KALLPLLQDFPLDSNSISHLKIIVLSGDSTGNYSAAVEAIAVNLSYPRGTVKHFAVDGDV 419

Query: 1509 DNTLSAADLVIYGSFLEE 1562
             + LSA DLVIYGSFLEE
Sbjct: 420  SSALSAVDLVIYGSFLEE 437


>ref|XP_002320170.1| glycosyltransferase family protein [Populus trichocarpa]
            gi|222860943|gb|EEE98485.1| glycosyltransferase family
            protein [Populus trichocarpa]
          Length = 990

 Score =  494 bits (1271), Expect = e-137
 Identities = 263/438 (60%), Positives = 302/438 (68%), Gaps = 7/438 (1%)
 Frame = +3

Query: 270  MGSLETG-LSFKRD-HFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443
            MGSLETG +SFKRD + L+                              DY+QWICT   
Sbjct: 1    MGSLETGGISFKRDKNTLIRSYSAGRTERHPFLYRPRSSFSRFLRFKKLDYIQWICTVAV 60

Query: 444  XXXXXXXXXXXLPGSVMEKSGLALTK----EIVSGDLTFPKEIAGLDFKEDIRFKP-KIL 608
                       LPGSV+EKS L  +     E+V  DL + KEI GLDF EDI+F+P KIL
Sbjct: 61   FLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVDKDLWYLKEIGGLDFGEDIKFQPSKIL 120

Query: 609  ARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAI 788
              F KE RE N+S + NRT  R+ YRKPQLALVFADLLVDPHQ+LMVTVA ALQEIGY I
Sbjct: 121  QHFRKENREMNMSFS-NRTLSRFPYRKPQLALVFADLLVDPHQLLMVTVATALQEIGYTI 179

Query: 789  QVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEP 968
             VYSL DGP +++W ++R PV +IQ +  M I VDWLNYDGILVNSLE + +FSCFMQEP
Sbjct: 180  HVYSLGDGPAQSIWKSMRSPVNIIQISHKMEIAVDWLNYDGILVNSLETKSVFSCFMQEP 239

Query: 969  FKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDA 1148
            FKS+PLIWTI+ERTLAT  R Y SS Q  LL DW+K FNRATVVVFPN+VLPM+YS  D 
Sbjct: 240  FKSVPLIWTINERTLATHSRQYTSSWQIELLYDWRKAFNRATVVVFPNHVLPMMYSAFDT 299

Query: 1149 GNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328
            GNY+VIPGSP + WE +   A+Y D +   +GY  DD VIAIVGSQF YR LWLEHAL+L
Sbjct: 300  GNYYVIPGSPADIWETETTMALYNDEIHVKMGYEPDDIVIAIVGSQFLYRGLWLEHALVL 359

Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508
            KA+ PLF  F   NN  SHLKIIILSGD T NY VAVEAI+ NL YPRG VKH A+  D 
Sbjct: 360  KALLPLFAEFSLDNNSKSHLKIIILSGDPTGNYSVAVEAIAANLSYPRGTVKHFAVDDDV 419

Query: 1509 DNTLSAADLVIYGSFLEE 1562
             + L AADLVIYGSFLEE
Sbjct: 420  GSPLGAADLVIYGSFLEE 437


>ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica]
            gi|462416747|gb|EMJ21484.1| hypothetical protein
            PRUPE_ppa000692mg [Prunus persica]
          Length = 1034

 Score =  489 bits (1260), Expect = e-135
 Identities = 256/433 (59%), Positives = 309/433 (71%), Gaps = 2/433 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE+G+  KRD  LL                              DYLQWICT     
Sbjct: 1    MGSLESGVPLKRDP-LLRSSSTGRTERHPFLQRPRSKFSRFLLIKKLDYLQWICTVAVFL 59

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPGSV+EKS + +   E+ S DL F KE+  LDF EDIRF+P K+L +F+K
Sbjct: 60   FFVVLFQMFLPGSVVEKSRVLMKNVELNSEDLRFLKELGLLDFGEDIRFEPSKLLEKFQK 119

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EAREA+L+S  NRTRQ +GYRKPQLALVFADL V   Q+LMVTVAAALQEIGYA  VYSL
Sbjct: 120  EAREASLTSAMNRTRQHFGYRKPQLALVFADLSVASQQLLMVTVAAALQEIGYAFSVYSL 179

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            EDGPV  VW +L +PVT+IQT     +N+DWLNYDGILVNSLEA+ IFSCF+QEPFKSLP
Sbjct: 180  EDGPVHDVWRSLGVPVTIIQTYDQSELNIDWLNYDGILVNSLEAKGIFSCFVQEPFKSLP 239

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            ++WTIHE+ LATR R Y+S+ Q  L NDWK++F+R+TVVVFPNY LPM YS  DAGN+FV
Sbjct: 240  ILWTIHEQALATRSRKYSSNRQIELFNDWKRLFSRSTVVVFPNYFLPMAYSVFDAGNFFV 299

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IPGSP EA +AD+   + K++L   +GYG +D VI IVGSQF YR LWLEH+++L+A+ P
Sbjct: 300  IPGSPAEACKADSIMVLDKNHLLAKMGYGSEDVVITIVGSQFLYRGLWLEHSIVLRAVLP 359

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523
            L   FP  NN  SHLKII+LSGDSTSNY   VEAI+ NL+YP G+VKH+A+   AD+ LS
Sbjct: 360  LLEDFPLDNNSYSHLKIIVLSGDSTSNYSSVVEAIAYNLKYPSGIVKHVAVDMAADSVLS 419

Query: 1524 AADLVIYGSFLEE 1562
             +D+VIYGSFLEE
Sbjct: 420  ISDVVIYGSFLEE 432


>ref|XP_002511940.1| transferase, transferring glycosyl groups, putative [Ricinus
            communis] gi|223549120|gb|EEF50609.1| transferase,
            transferring glycosyl groups, putative [Ricinus communis]
          Length = 935

 Score =  474 bits (1220), Expect(2) = e-133
 Identities = 237/362 (65%), Positives = 289/362 (79%), Gaps = 3/362 (0%)
 Frame = +3

Query: 477  LPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEKEAREANL-S 647
            LPGS+++KS ++L K EIV GDL + K +  LDF ED++F+P K+L +F+KE RE NL S
Sbjct: 18   LPGSMIDKSEVSLKKLEIVPGDLLYLKAMGTLDFGEDVQFQPLKLLEKFQKENREVNLTS 77

Query: 648  STFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSLEDGPVRAV 827
            S FNRT  R+GYRKPQLALVFADLL DP Q+LMVTVA ALQEIGYAIQV+S+ DGPV  +
Sbjct: 78   SAFNRTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSVNDGPVHDI 137

Query: 828  WGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLPLIWTIHER 1007
            W  + +PVT+ QTN  M I VDWL +D I+VNSLEA+ +F CFMQEPFKS+PLIWTIHE+
Sbjct: 138  WKRIGVPVTIFQTNHKMEIAVDWLIFDSIIVNSLEAKVVFPCFMQEPFKSIPLIWTIHEK 197

Query: 1008 TLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFVIPGSPTEA 1187
            TL  R R Y S+GQ  L++DWK+VFNRATVVVFPN+VLPM+YS  DA NY+VIPGSP E 
Sbjct: 198  TLGIRSRQYISNGQIELVSDWKRVFNRATVVVFPNHVLPMMYSAFDAENYYVIPGSPAEV 257

Query: 1188 WEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYPLFTMFPFY 1367
            WEA+   A+YKD++R  +GY  DD +IAIVGSQF YR LWLEHALIL+A+ PLF+ F F 
Sbjct: 258  WEAEAMAAVYKDSIRMKMGYRPDDIIIAIVGSQFLYRGLWLEHALILQALSPLFSDFSFD 317

Query: 1368 NNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLSAADLVIYG 1547
            +N N HLKII+LSG+STSNY VA+EAI++NL YP G VKHIAI GD  + L+AAD+V YG
Sbjct: 318  DNSNPHLKIIVLSGNSTSNYSVAIEAIAINLHYPIGAVKHIAIDGDVGSFLTAADIVTYG 377

Query: 1548 SF 1553
            SF
Sbjct: 378  SF 379



 Score = 29.3 bits (64), Expect(2) = e-133
 Identities = 10/15 (66%), Positives = 13/15 (86%)
 Frame = +1

Query: 349 DTHFYRDPDRDSHVF 393
           DTHF +DPD+D H+F
Sbjct: 3   DTHFCKDPDQDFHMF 17


>gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis]
          Length = 1040

 Score =  477 bits (1227), Expect = e-132
 Identities = 250/435 (57%), Positives = 300/435 (68%), Gaps = 4/435 (0%)
 Frame = +3

Query: 270  MGSLETGLS--FKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443
            MGSLE G +  FKRD FL                               DYLQWICT   
Sbjct: 1    MGSLEGGSATPFKRDPFLRSASFTGRSDRNPFLQRQRSRFSRFFLFKKLDYLQWICTVAV 60

Query: 444  XXXXXXXXXXXLPGSVMEKS-GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARF 617
                       LPGSV+EKS      +E  SGDL F KE   LDF EDIRF+P K+L +F
Sbjct: 61   FLFFVVLFQMFLPGSVVEKSIKTHRDEEFSSGDLFFLKEYGILDFGEDIRFEPSKVLEKF 120

Query: 618  EKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVY 797
             +E +E NLS  FNR+R RY ++KPQLALVFADLLVD  Q+LMVTVAAALQEIGY IQVY
Sbjct: 121  RRENKEVNLSHAFNRSRLRYPHKKPQLALVFADLLVDSQQLLMVTVAAALQEIGYEIQVY 180

Query: 798  SLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKS 977
            SLE GPV  +W NL +PV++IQ      + VDWL YDGILVNS EA+D+FSCF+QEPFKS
Sbjct: 181  SLEGGPVHGIWRNLGVPVSIIQACDPADVTVDWLIYDGILVNSFEAKDMFSCFVQEPFKS 240

Query: 978  LPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNY 1157
            LPL+WTIH+R LATR R+Y S+ Q  LLNDWK+ FNR+TVVVFPNYVLPMIYS  D+GN+
Sbjct: 241  LPLVWTIHDRALATRSRNYTSNKQIELLNDWKRAFNRSTVVVFPNYVLPMIYSTFDSGNF 300

Query: 1158 FVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAI 1337
            FVIPGSP EAW+ +      KD LR  +GYG +D VI IVGS+  YR LWLEH+++L+A+
Sbjct: 301  FVIPGSPAEAWKIETLMESEKDYLRAKMGYGHEDIVITIVGSELLYRGLWLEHSIVLQAL 360

Query: 1338 YPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNT 1517
            +PL   F    N  SHLKII+LSGD TSNY  AVEAI+LNL+YP G+V H+ +  +ADN 
Sbjct: 361  FPLLEDFSSDENSFSHLKIIVLSGDPTSNYSSAVEAIALNLKYPNGIVNHVPMDAEADNV 420

Query: 1518 LSAADLVIYGSFLEE 1562
            L+A+D+VIYGS +EE
Sbjct: 421  LTASDVVIYGSSVEE 435


>ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212216 [Cucumis
            sativus]
          Length = 1037

 Score =  470 bits (1209), Expect = e-130
 Identities = 250/433 (57%), Positives = 296/433 (68%), Gaps = 2/433 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G   KRD  LL                              DYLQWICT     
Sbjct: 1    MGSLENGFPLKRDP-LLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFF 59

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPGSV+EKS +AL   E   GDL F KE+  LDF EDIRF+P K+L +F+K
Sbjct: 60   FFVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKK 119

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EAREA+ SS FNRTR R+GYRKPQLALVF+DLLVD +Q+LMVT+A+ALQEIGY  QVYSL
Sbjct: 120  EAREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSL 178

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            + GP   VW  + +PVTLIQ+     + VDWLNYDGILV+SL  +D+FSC++QEPFKSLP
Sbjct: 179  QGGPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLP 238

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            LIWTIHE  LA R ++Y S G   +LNDWK+VFN +TVVVFPNYV+PMIYS  D+GN+FV
Sbjct: 239  LIWTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFV 298

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IP  P EA EA+       DNLR  +GY  DD VIAIVGSQF YR +WLEHA++L+A+ P
Sbjct: 299  IPSFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLP 358

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523
            L   F FY + NS LKI +LSGDS SNY +AVEAI+  L+YPR VVKH  +  D+D  LS
Sbjct: 359  LLHEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALS 418

Query: 1524 AADLVIYGSFLEE 1562
             ADLVIYGS LEE
Sbjct: 419  MADLVIYGSCLEE 431


>ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus]
          Length = 1037

 Score =  470 bits (1209), Expect = e-130
 Identities = 250/433 (57%), Positives = 296/433 (68%), Gaps = 2/433 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G   KRD  LL                              DYLQWICT     
Sbjct: 1    MGSLENGFPLKRDP-LLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFF 59

Query: 450  XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPGSV+EKS +AL   E   GDL F KE+  LDF EDIRF+P K+L +F+K
Sbjct: 60   FFVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKK 119

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EAREA+ SS FNRTR R+GYRKPQLALVF+DLLVD +Q+LMVT+A+ALQEIGY  QVYSL
Sbjct: 120  EAREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSL 178

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            + GP   VW  + +PVTLIQ+     + VDWLNYDGILV+SL  +D+FSC++QEPFKSLP
Sbjct: 179  QGGPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLP 238

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            LIWTIHE  LA R ++Y S G   +LNDWK+VFN +TVVVFPNYV+PMIYS  D+GN+FV
Sbjct: 239  LIWTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFV 298

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IP  P EA EA+       DNLR  +GY  DD VIAIVGSQF YR +WLEHA++L+A+ P
Sbjct: 299  IPSFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLP 358

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523
            L   F FY + NS LKI +LSGDS SNY +AVEAI+  L+YPR VVKH  +  D+D  LS
Sbjct: 359  LLHEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALS 418

Query: 1524 AADLVIYGSFLEE 1562
             ADLVIYGS LEE
Sbjct: 419  MADLVIYGSCLEE 431


>ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302584 [Fragaria vesca
            subsp. vesca]
          Length = 1039

 Score =  469 bits (1208), Expect = e-129
 Identities = 259/438 (59%), Positives = 302/438 (68%), Gaps = 7/438 (1%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-DYLQWICTXXXX 446
            MGSLE+G+  KRD  L                                DYL WICT    
Sbjct: 1    MGSLESGVPLKRDPLLRSSSNGGRSSDRHLFLQRPRSRFSRFLILKKLDYLLWICTVAVF 60

Query: 447  XXXXXXXXXXLPGSVMEKSGLALTKEIVS---GDLTFPKEIAGLDFKEDIRFKP-KILAR 614
                      LPGSV+EKSG  L K+ V    GDL F KE+  LDF EDIRF+P K+L +
Sbjct: 61   LFFVVLFQMFLPGSVVEKSGSLLQKKNVELDYGDLRFVKELGLLDFGEDIRFEPSKLLEK 120

Query: 615  FEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794
            F KE REA+LSS FNRT Q +G RKPQLALVFADLL D HQ+ MVTVAAALQEIGY + V
Sbjct: 121  FRKEGREASLSSGFNRTLQHFGLRKPQLALVFADLLFDSHQLQMVTVAAALQEIGYELWV 180

Query: 795  YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974
            YSLEDGP R  W +L +PVT+IQT     I VDWLNY+GILV+SLEA+ IFSCF+QEPFK
Sbjct: 181  YSLEDGPARGAWKSLGVPVTIIQTCDQPKIVVDWLNYNGILVSSLEAKGIFSCFVQEPFK 240

Query: 975  SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154
            SLP+IWTIHE  LATR R Y+SS Q  LLNDWK+VFNR+TVVVFPNY LPMIYS  DAGN
Sbjct: 241  SLPVIWTIHEEALATRSRKYSSSSQIELLNDWKRVFNRSTVVVFPNYFLPMIYSTLDAGN 300

Query: 1155 YFVIPGSPTEAWEADNAN--AIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328
            +FVIPGSP EA + D+ +  A+  DNL+   G   ++ VI IVGS+F YR LWLEH+++L
Sbjct: 301  FFVIPGSPAEACKTDSDSIVALDIDNLQGSAGNEPENVVITIVGSKFLYRGLWLEHSIVL 360

Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508
            +A+ PL   F   NN +SHLKII+LSGDSTSNY   VEAI+ NL+YP G+VKH AI  DA
Sbjct: 361  RALLPLLEDFLLDNN-SSHLKIIVLSGDSTSNYSSVVEAIAYNLKYPSGIVKHAAIDVDA 419

Query: 1509 DNTLSAADLVIYGSFLEE 1562
            DN LS + LVIYGSFLEE
Sbjct: 420  DNVLSTSHLVIYGSFLEE 437


>ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citrus clementina]
            gi|568876282|ref|XP_006491210.1| PREDICTED:
            uncharacterized protein LOC102628793 [Citrus sinensis]
            gi|557547178|gb|ESR58156.1| hypothetical protein
            CICLE_v10018649mg [Citrus clementina]
          Length = 1038

 Score =  461 bits (1186), Expect = e-127
 Identities = 235/388 (60%), Positives = 281/388 (72%), Gaps = 4/388 (1%)
 Frame = +3

Query: 411  DYLQWICTXXXXXXXXXXXXXXLPGSVM---EKSGLALTKEIVSGDLTFPKEIAGLDFKE 581
            DYL WICT              LPGSV    E  G     + V  DL F KE+  LDF E
Sbjct: 48   DYLLWICTVAVFLFFVVIFQLFLPGSVTVMDESQGSLRDFDKVPADLMFLKEMGLLDFGE 107

Query: 582  DIRFKP-KILARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVA 758
            ++ F P K++ +F+ E ++ NL+S F+R   R+GYRKPQLALVF DLL+DP Q+ MVT+A
Sbjct: 108  EVTFLPLKLMEKFQSEDKDVNLTSVFHRKLHRFGYRKPQLALVFPDLLIDPQQLQMVTIA 167

Query: 759  AALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEAR 938
             AL+EIGYAIQVYSLEDG    VW N+ +PV ++QT       V+WLNYDGILVNSLEA+
Sbjct: 168  IALREIGYAIQVYSLEDGRAHEVWRNIGVPVAILQTGREKASFVNWLNYDGILVNSLEAK 227

Query: 939  DIFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYV 1118
             + S  MQEPFKSLPL+WTIHE TLATR R+Y SSGQ  LLNDWKKVFNRATVVVFP+YV
Sbjct: 228  VVISNIMQEPFKSLPLVWTIHEGTLATRARNYASSGQLELLNDWKKVFNRATVVVFPDYV 287

Query: 1119 LPMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYR 1298
            LPM+YS  DAGNY+VIPGSP +AWEAD    +Y D +R  +G+  DD VIAIVG+QF YR
Sbjct: 288  LPMMYSAFDAGNYYVIPGSPAKAWEADTNMDLYNDTVRVKMGFKPDDLVIAIVGTQFMYR 347

Query: 1299 DLWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGV 1478
             LWLEHALIL+A+ PLF+     N  NS +K++ILSGDSTSNY V +EAI+ NL YP GV
Sbjct: 348  GLWLEHALILRALLPLFSEVSVENESNSPIKVMILSGDSTSNYSVVIEAIAHNLHYPLGV 407

Query: 1479 VKHIAIGGDADNTLSAADLVIYGSFLEE 1562
            VKHIA  GD D+ L+ AD+VIYGSFLEE
Sbjct: 408  VKHIAAEGDVDSVLNTADVVIYGSFLEE 435


>ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591393 [Solanum tuberosum]
          Length = 1038

 Score =  454 bits (1168), Expect = e-125
 Identities = 237/433 (54%), Positives = 299/433 (69%), Gaps = 2/433 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G+S K+D  LL                              +YLQWICT     
Sbjct: 1    MGSLENGVSLKKDQNLLRSSSATGRNVFGQRQVRSRFARFLFVKKI-NYLQWICTVAVFF 59

Query: 450  XXXXXXXXXLPGSVMEKSG-LALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPGSVMEKSG L    E+  GDL   KE+ GLDF EDI+F+P K+LA+F  
Sbjct: 60   FFVVLFQMLLPGSVMEKSGNLTQDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFHD 119

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EA EAN  +  +RT  R+GYRKP+LALVFA+LLVDP+QI+MV VAAAL+EIGY I+V SL
Sbjct: 120  EAVEAN-GTVASRTVVRFGYRKPKLALVFANLLVDPYQIMMVNVAAALREIGYEIEVLSL 178

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            EDGPVR++W ++ +PV ++ T+    I++DWLNYDG+LVNSLEA ++ SC MQEPFK++P
Sbjct: 179  EDGPVRSIWKDVGVPVIIMNTDGHTKISLDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVP 238

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            L+WTI+E TLA+RL+ Y SSGQ   +++W+KVF+RA VVVFPNY+LP+ YS CDAGNYFV
Sbjct: 239  LVWTINELTLASRLKQYISSGQNDFVDNWRKVFSRANVVVFPNYILPIGYSVCDAGNYFV 298

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IPGSP EAWE D+  A+  DNLR  + Y  +DFVI +VGS   Y+ LWLE AL+L+A+ P
Sbjct: 299  IPGSPKEAWEVDSFMAVSNDNLRAKMDYAPEDFVIVVVGSHLLYKGLWLEQALVLQALLP 358

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523
            +F       N NSH KI++L+  S +NY VAVEAI+ NL+YP G+VKHIA   D + TLS
Sbjct: 359  VFPELTNDGNSNSHFKIVVLTEGSNTNYSVAVEAIARNLRYPEGMVKHIAPAEDTERTLS 418

Query: 1524 AADLVIYGSFLEE 1562
             ADLVIY SF EE
Sbjct: 419  VADLVIYASFREE 431


>ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246380 [Solanum
            lycopersicum]
          Length = 1038

 Score =  452 bits (1163), Expect = e-124
 Identities = 235/433 (54%), Positives = 299/433 (69%), Gaps = 2/433 (0%)
 Frame = +3

Query: 270  MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449
            MGSLE G+S K+D  LL                              +YLQWICT     
Sbjct: 1    MGSLENGVSLKKDQNLLRSSSATGRNAFGQRQVRSRFARFLFVKKI-NYLQWICTVAVFF 59

Query: 450  XXXXXXXXXLPGSVMEKSG-LALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623
                     LPGSVMEKSG L L  E+  GDL   KE+ GLDF EDI+F+P K+LA+F +
Sbjct: 60   FFVVLFQMLLPGSVMEKSGNLTLDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFRE 119

Query: 624  EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803
            EA EAN  +  +R   R+GYRKP+LALVF++L VDP+QI+MV VAAAL+EIGY I+V SL
Sbjct: 120  EAVEAN-GTVASRIVVRFGYRKPKLALVFSNLSVDPYQIMMVNVAAALREIGYEIEVLSL 178

Query: 804  EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983
            EDGPVR++W ++ +PV ++ T+    I++DWLNYDG+LVNSLEA ++ SC MQEPFK++P
Sbjct: 179  EDGPVRSIWKDIGVPVIIMNTDGHTKISLDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVP 238

Query: 984  LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163
            L+WTI+E TLA+RL+ Y SSGQ   +++W+KVF+RA VVVFPNY+LP+ YS CDAGNYFV
Sbjct: 239  LVWTINELTLASRLKQYMSSGQNDFVDNWRKVFSRANVVVFPNYILPIGYSVCDAGNYFV 298

Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343
            IPGSP EAWE D   A+  D+LR  + Y  +DFVI +VGSQ  Y+ LWLE AL+L+A+ P
Sbjct: 299  IPGSPKEAWEVDTFMAVSNDDLRAKMDYAAEDFVIVVVGSQLLYKGLWLEQALVLQALLP 358

Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523
            +F       N NSH KI++L+  S +NY VAVEAI+ NL+YP G+VKHIA   D + TLS
Sbjct: 359  VFPELMNDGNSNSHFKIVVLTEGSNTNYSVAVEAIARNLRYPEGMVKHIAPAEDTERTLS 418

Query: 1524 AADLVIYGSFLEE 1562
             ADLVIY SF EE
Sbjct: 419  VADLVIYASFREE 431


>ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutrema salsugineum]
            gi|557097307|gb|ESQ37743.1| hypothetical protein
            EUTSA_v10028385mg [Eutrema salsugineum]
          Length = 1022

 Score =  424 bits (1089), Expect = e-116
 Identities = 217/389 (55%), Positives = 278/389 (71%), Gaps = 5/389 (1%)
 Frame = +3

Query: 411  DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALT-KEIVSGDLTFPKEIAGLDFKEDI 587
            DYLQWICT              LPG V++KS    + KE +  DL   KE    DF ED+
Sbjct: 44   DYLQWICTMGVFFFFVVLFQMFLPGLVIDKSDKPWSNKEFLPPDLVVFKERGFFDFGEDV 103

Query: 588  RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761
            R +P K+L +F++E    N  SS+ N T QR+G+RKP+LALVFADLL DP Q+LMVTV+ 
Sbjct: 104  RLEPTKLLMKFQRETNALNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQLLMVTVSK 163

Query: 762  ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941
            AL EIGYA++VYSLEDGPV  +W N+ + VT+++TN +    +DWL+YDG++VNSLEAR 
Sbjct: 164  ALLEIGYAVEVYSLEDGPVHGIWQNMGVSVTILETNHASSCVIDWLSYDGVIVNSLEARS 223

Query: 942  IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121
            +F+CFMQEPFKSLPL+W I+E TLA R R YNS+GQT LL DWKK+F+RA+VVVF NY+L
Sbjct: 224  MFTCFMQEPFKSLPLVWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLL 283

Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301
            P++YS  DAGN++VIPGSP EAW+A N +   K           DD VI+IVGSQF Y+ 
Sbjct: 284  PILYSEFDAGNFYVIPGSPEEAWKAKNLDIPRK-----------DDMVISIVGSQFLYKG 332

Query: 1302 LWLEHALILKAIYPLFTMFPFYNNP--NSHLKIIILSGDSTSNYDVAVEAISLNLQYPRG 1475
             WLEHAL+L+A+ PLF+    YN+   NS LKII+L G+S SNY VA+E IS NL YP+ 
Sbjct: 333  QWLEHALLLQALRPLFS---GYNSERYNSRLKIIVLGGESASNYSVAIETISQNLTYPKE 389

Query: 1476 VVKHIAIGGDADNTLSAADLVIYGSFLEE 1562
             VKH++I G+ D  L ++DLV+YGSFLEE
Sbjct: 390  AVKHVSIAGNVDKILESSDLVLYGSFLEE 418


>ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp.
            lyrata] gi|297318740|gb|EFH49162.1| glycosyltransferase
            family protein 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 1018

 Score =  416 bits (1070), Expect = e-113
 Identities = 213/387 (55%), Positives = 278/387 (71%), Gaps = 3/387 (0%)
 Frame = +3

Query: 411  DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALT-KEIVSGDLTFPKEIAGLDFKEDI 587
            +YLQWI +              LPG V++KS    T KEI+  DL   +E   LDF +D+
Sbjct: 50   NYLQWISSICVFFFFVVLFQMFLPGLVIDKSDKPWTSKEILPPDLLGFREKGFLDFGDDV 109

Query: 588  RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761
            RF+P K+L +F++EA   N  SS+ N T QR+G+RKP+LALVFADLL DP Q+LMV+++ 
Sbjct: 110  RFEPTKLLMKFQREANGLNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQVLMVSLSK 169

Query: 762  ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941
            ALQEIGYAI+VYSLEDGPV ++W  + +PVT+++TN +    +DWL+YDGI+VNSL A+ 
Sbjct: 170  ALQEIGYAIEVYSLEDGPVNSIWRKMGVPVTILKTNHASSCVIDWLSYDGIIVNSLRAKS 229

Query: 942  IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121
            +F+CFMQEPFKSLPLIW I+E TLA R R YNS GQT LLNDWKK+F+RA+VVVF NY+L
Sbjct: 230  MFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSIGQTELLNDWKKIFSRASVVVFHNYLL 289

Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301
            P++Y+  DAGN++VIPGSP + W+A N     +           DD VI+IVGSQF Y+ 
Sbjct: 290  PILYTEFDAGNFYVIPGSPEDVWKAKNLEFPPQK----------DDVVISIVGSQFLYKG 339

Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481
             WLEHAL+L+A+ PLF    +  +  SHLKII+L G+S SNY VA+E IS NL YP+  V
Sbjct: 340  QWLEHALLLQALRPLFP-GNYLESDTSHLKIIVLGGESASNYSVAIETISQNLTYPKDAV 398

Query: 1482 KHIAIGGDADNTLSAADLVIYGSFLEE 1562
            KH++I G+ D  L ++DLVIYGSFLEE
Sbjct: 399  KHVSIAGNVDKILESSDLVIYGSFLEE 425


>ref|XP_007135157.1| hypothetical protein PHAVU_010G105900g [Phaseolus vulgaris]
            gi|561008202|gb|ESW07151.1| hypothetical protein
            PHAVU_010G105900g [Phaseolus vulgaris]
          Length = 1034

 Score =  414 bits (1064), Expect = e-113
 Identities = 211/388 (54%), Positives = 272/388 (70%), Gaps = 4/388 (1%)
 Frame = +3

Query: 411  DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALTKEIVSGDLTFPK-EIAGL--DFKE 581
            DY+QWICT              LPGSV+E S  +L    +  D  F   EI  +  D  E
Sbjct: 45   DYVQWICTVVVFLCLVVVFQMFLPGSVVENSEESLKAVKMRSDNLFHYGEIQKVVSDIGE 104

Query: 582  DIRFKPKILARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761
            D  F P IL +F +       +  FN T Q +GYRKPQLA+VF +LLVD HQ+LMVTVA 
Sbjct: 105  DAVFLPMILEKFRRRGGGGMDAGLFNHTVQHFGYRKPQLAMVFGELLVDSHQLLMVTVAT 164

Query: 762  ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941
            ALQEIGY IQV+SLEDGP   VW NL +P+T+ +T       VDWLNYDGI+++SLEA+ 
Sbjct: 165  ALQEIGYEIQVFSLEDGPGHNVWSNLGVPITIFRTCDKRNNTVDWLNYDGIIMSSLEAKG 224

Query: 942  IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121
             FSCF+QEPFKS+PLIW +HE  LA R R Y ++GQ  +LNDW +VFNR+TVVVFPNY L
Sbjct: 225  AFSCFLQEPFKSIPLIWIVHENALAYRSRQYTTNGQIEILNDWGRVFNRSTVVVFPNYAL 284

Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301
            PMIYS  DAGN+FVIPGSP EA EA+   A+ KDNLR  +GYG +D ++AIVGSQF Y+ 
Sbjct: 285  PMIYSTFDAGNFFVIPGSPAEALEAEAFMALQKDNLRVNMGYGPEDVIVAIVGSQFLYKG 344

Query: 1302 LWLEHALILKAIYPLFTMFPF-YNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGV 1478
            +WL HA++L+A+ PL T FP   +N ++ L+II+ SG+ T+NY VA+E ++ +L+YPRG+
Sbjct: 345  MWLGHAIVLRALEPLVTNFPSNKDNSSAQLRIIVHSGELTNNYSVALETMAHSLKYPRGI 404

Query: 1479 VKHIAIGGDADNTLSAADLVIYGSFLEE 1562
            ++HIA   +AD+ L  AD+V+YGSFLEE
Sbjct: 405  IEHIAGDLNADSILGTADVVVYGSFLEE 432


>ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidopsis thaliana]
            gi|332656594|gb|AEE81994.1| glycosyl transferase family 1
            protein [Arabidopsis thaliana]
            gi|591401974|gb|AHL38714.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 1031

 Score =  402 bits (1032), Expect = e-109
 Identities = 205/387 (52%), Positives = 273/387 (70%), Gaps = 3/387 (0%)
 Frame = +3

Query: 411  DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLA-LTKEIVSGDLTFPKEIAGLDFKEDI 587
            +YL WI                LPG V++KS    ++KEI+  DL   +E   LDF +D+
Sbjct: 51   NYLLWISIICVFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLVGFREKGFLDFGDDV 110

Query: 588  RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761
            R +P K+L +F+++A   N  SS+ N T QR+G+RKP+LALVF DLL DP Q+LMV+++ 
Sbjct: 111  RIEPTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDLLADPEQVLMVSLSK 170

Query: 762  ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941
            ALQE+GYAI+VYSLEDGPV ++W  + +PVT+++ N      +DWL+YDGI+VNSL AR 
Sbjct: 171  ALQEVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSYDGIIVNSLRARS 230

Query: 942  IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121
            +F+CFMQEPFKSLPLIW I+E TLA R R YNS+GQT LL DWKK+F+RA+VVVF NY+L
Sbjct: 231  MFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLL 290

Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301
            P++Y+  DAGN++VIPGSP E  +A N     +           DD VI+IVGSQF Y+ 
Sbjct: 291  PILYTEFDAGNFYVIPGSPEEVCKAKNLEFPPQK----------DDVVISIVGSQFLYKG 340

Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481
             WLEHAL+L+A+ PLF+   +  + NSHLKII+L G++ SNY VA+E IS NL YP+  V
Sbjct: 341  QWLEHALLLQALRPLFS-GNYLESDNSHLKIIVLGGETASNYSVAIETISQNLTYPKEAV 399

Query: 1482 KHIAIGGDADNTLSAADLVIYGSFLEE 1562
            KH+ + G+ D  L ++DLVIYGSFLEE
Sbjct: 400  KHVRVAGNVDKILESSDLVIYGSFLEE 426


Top