BLASTX nr result
ID: Paeonia25_contig00020260
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia25_contig00020260 (1564 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262... 506 e-141 emb|CBI40456.3| unnamed protein product [Vitis vinifera] 506 e-141 emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera] 497 e-138 ref|XP_007051668.1| Glycosyl transferase family 1 protein isofor... 495 e-137 ref|XP_007051667.1| Glycosyl transferase family 1 protein isofor... 495 e-137 ref|XP_002301386.2| glycosyltransferase family protein [Populus ... 494 e-137 ref|XP_002320170.1| glycosyltransferase family protein [Populus ... 494 e-137 ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prun... 489 e-135 ref|XP_002511940.1| transferase, transferring glycosyl groups, p... 474 e-133 gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis] 477 e-132 ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 470 e-130 ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212... 470 e-130 ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302... 469 e-129 ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citr... 461 e-127 ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591... 454 e-125 ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246... 452 e-124 ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutr... 424 e-116 ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabid... 416 e-113 ref|XP_007135157.1| hypothetical protein PHAVU_010G105900g [Phas... 414 e-113 ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidop... 402 e-109 >ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262009 [Vitis vinifera] Length = 1026 Score = 506 bits (1304), Expect = e-141 Identities = 269/434 (61%), Positives = 313/434 (72%), Gaps = 3/434 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G+ KRD L DYLQW+CT Sbjct: 1 MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPG +MEKSG +L E GDL+F K I GLDF E IRF+P K+L +F+K Sbjct: 56 FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQK 115 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EA E NLSS +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQVYSL Sbjct: 116 EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSL 174 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 EDGPV A+W N+ PVT+I++NA VDWLNYDGI+VNSLEAR + SCF+QEPFKSLP Sbjct: 175 EDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLP 234 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 LIWTI E TLATRLR YN +G+ L+NDWKKVFNRAT VVFPNYVLPMIYS D+GNYFV Sbjct: 235 LIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFV 294 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IPGSP +AWE DN A ++D+ R +GYG DDFVIA+V SQF Y+ LWLEHALIL+A+ P Sbjct: 295 IPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLP 354 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIG-GDADNTL 1520 L FP NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVVKHIAI G+ADN L Sbjct: 355 LVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADNVL 414 Query: 1521 SAADLVIYGSFLEE 1562 +AAD+VIYGSFLEE Sbjct: 415 AAADIVIYGSFLEE 428 >emb|CBI40456.3| unnamed protein product [Vitis vinifera] Length = 1026 Score = 506 bits (1304), Expect = e-141 Identities = 269/434 (61%), Positives = 313/434 (72%), Gaps = 3/434 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G+ KRD L DYLQW+CT Sbjct: 1 MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPG +MEKSG +L E GDL+F K I GLDF E IRF+P K+L +F+K Sbjct: 56 FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQK 115 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EA E NLSS +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQVYSL Sbjct: 116 EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSL 174 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 EDGPV A+W N+ PVT+I++NA VDWLNYDGI+VNSLEAR + SCF+QEPFKSLP Sbjct: 175 EDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLP 234 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 LIWTI E TLATRLR YN +G+ L+NDWKKVFNRAT VVFPNYVLPMIYS D+GNYFV Sbjct: 235 LIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFV 294 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IPGSP +AWE DN A ++D+ R +GYG DDFVIA+V SQF Y+ LWLEHALIL+A+ P Sbjct: 295 IPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLP 354 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIG-GDADNTL 1520 L FP NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVVKHIAI G+ADN L Sbjct: 355 LVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADNVL 414 Query: 1521 SAADLVIYGSFLEE 1562 +AAD+VIYGSFLEE Sbjct: 415 AAADIVIYGSFLEE 428 >emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera] Length = 1040 Score = 497 bits (1280), Expect = e-138 Identities = 269/448 (60%), Positives = 314/448 (70%), Gaps = 17/448 (3%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G+ KRD L DYLQW+CT Sbjct: 1 MGSLENGVPVKRDPLL-----RSSSNKGSAFQRPIVRFSRFLFFGKLDYLQWVCTVAVFC 55 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPG +MEKSG +L E GDL+F K+I GLDF E IRF+P K+L +F+K Sbjct: 56 FFVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKKIGGLDFGEGIRFEPSKLLQKFQK 115 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQ---- 791 EA E NLSS +R R R+GYRKPQLALVF DLLVDP Q+LMVTVA+AL E+GY IQ Sbjct: 116 EADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQALPY 174 Query: 792 ----------VYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941 VYSLEDGPV A+W N+ PVT+I++NA VDWLNYDGI+VNSLEAR Sbjct: 175 LVSIYVAWIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNYDGIIVNSLEARG 234 Query: 942 IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121 + SCF+QEPFKSLPLIWTI E TLATRLR YN +G+ L+NDWKKVFNRAT VVFPNYVL Sbjct: 235 VVSCFVQEPFKSLPLIWTIPEGTLATRLRQYNLTGKIELVNDWKKVFNRATAVVFPNYVL 294 Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301 PMIYS D+GNYFVIPGSP +AWE DN A ++D+ R +GYG DDFVIA+V SQF Y+ Sbjct: 295 PMIYSTFDSGNYFVIPGSPAQAWEVDNFMASHRDSPRVKMGYGPDDFVIALVRSQFLYKG 354 Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481 LWLEHALIL+A+ PL FP NN NSHLKI+I SG+S +NY VAVEAI+L L+YP+GVV Sbjct: 355 LWLEHALILQALLPLVAEFPVDNNSNSHLKILITSGNSANNYSVAVEAIALKLRYPKGVV 414 Query: 1482 KHIAIG-GDADNTLSAADLVIYGSFLEE 1562 KHIAI G+ADN L+AAD+VIYGSFLEE Sbjct: 415 KHIAIDVGEADNVLAAADIVIYGSFLEE 442 >ref|XP_007051668.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao] gi|508703929|gb|EOX95825.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao] Length = 686 Score = 495 bits (1274), Expect = e-137 Identities = 259/436 (59%), Positives = 313/436 (71%), Gaps = 5/436 (1%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE+G+S KR DYLQWICT Sbjct: 1 MGSLESGISLKR--------AGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFL 52 Query: 450 XXXXXXXXXLPGSVMEKS--GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFE 620 LPGSVM+KS K++V G+L + KE+ GLDF EDIR +P K+L +F+ Sbjct: 53 FFVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQ 112 Query: 621 KEAREANL--SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794 +E + NL SS FNR++ R+ YRKPQLALVFADLLVDP Q+LMVT+A AL+EIGYAIQV Sbjct: 113 RENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQV 172 Query: 795 YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974 YSLEDGPV VW ++ +PV+++Q N++ I VDWLNYDGILV+SLEA+ +FS FMQEPFK Sbjct: 173 YSLEDGPVHNVWQSIGVPVSVLQVNSNE-IGVDWLNYDGILVSSLEAKGVFSSFMQEPFK 231 Query: 975 SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154 S+PLIWTIHERTLA R R + SSGQ L+N+WKKVF+RATVVVFPNY LPMIYS D GN Sbjct: 232 SIPLIWTIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGN 291 Query: 1155 YFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKA 1334 Y+VIPGSP EAW+ +NA +YKDN R +GYG D+ +IAIVGSQF YR LWLEHA++L+A Sbjct: 292 YYVIPGSPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQA 351 Query: 1335 IYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADN 1514 + PLFT F N NSH KIIILSGDSTSNY +AVE I+ NL+YP GVVKH+A+ GD D+ Sbjct: 352 LLPLFTDFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHVAVDGDVDS 411 Query: 1515 TLSAADLVIYGSFLEE 1562 LS D+VIYGSFLEE Sbjct: 412 VLSMTDIVIYGSFLEE 427 >ref|XP_007051667.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao] gi|508703928|gb|EOX95824.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao] Length = 1026 Score = 495 bits (1274), Expect = e-137 Identities = 259/436 (59%), Positives = 313/436 (71%), Gaps = 5/436 (1%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE+G+S KR DYLQWICT Sbjct: 1 MGSLESGISLKR--------AGSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFL 52 Query: 450 XXXXXXXXXLPGSVMEKS--GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFE 620 LPGSVM+KS K++V G+L + KE+ GLDF EDIR +P K+L +F+ Sbjct: 53 FFVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQ 112 Query: 621 KEAREANL--SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794 +E + NL SS FNR++ R+ YRKPQLALVFADLLVDP Q+LMVT+A AL+EIGYAIQV Sbjct: 113 RENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQV 172 Query: 795 YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974 YSLEDGPV VW ++ +PV+++Q N++ I VDWLNYDGILV+SLEA+ +FS FMQEPFK Sbjct: 173 YSLEDGPVHNVWQSIGVPVSVLQVNSNE-IGVDWLNYDGILVSSLEAKGVFSSFMQEPFK 231 Query: 975 SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154 S+PLIWTIHERTLA R R + SSGQ L+N+WKKVF+RATVVVFPNY LPMIYS D GN Sbjct: 232 SIPLIWTIHERTLAVRSRQFTSSGQIELVNNWKKVFSRATVVVFPNYALPMIYSAFDTGN 291 Query: 1155 YFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKA 1334 Y+VIPGSP EAW+ +NA +YKDN R +GYG D+ +IAIVGSQF YR LWLEHA++L+A Sbjct: 292 YYVIPGSPAEAWKGENAMNLYKDNQRVKMGYGPDEVLIAIVGSQFMYRGLWLEHAIVLQA 351 Query: 1335 IYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADN 1514 + PLFT F N NSH KIIILSGDSTSNY +AVE I+ NL+YP GVVKH+A+ GD D+ Sbjct: 352 LLPLFTDFSSDTNSNSHPKIIILSGDSTSNYSMAVERITHNLKYPSGVVKHVAVDGDVDS 411 Query: 1515 TLSAADLVIYGSFLEE 1562 LS D+VIYGSFLEE Sbjct: 412 VLSMTDIVIYGSFLEE 427 >ref|XP_002301386.2| glycosyltransferase family protein [Populus trichocarpa] gi|550345174|gb|EEE80659.2| glycosyltransferase family protein [Populus trichocarpa] Length = 984 Score = 494 bits (1273), Expect = e-137 Identities = 262/438 (59%), Positives = 305/438 (69%), Gaps = 7/438 (1%) Frame = +3 Query: 270 MGSLETG-LSFKRD-HFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443 MGSLE+G +SFKRD + L+ DY+QWICT Sbjct: 1 MGSLESGGISFKRDSNNLIRSHSAGRTERNPFLYRPRSRLSRFLLFKKLDYIQWICTVAV 60 Query: 444 XXXXXXXXXXXLPGSVMEKSGLALTK----EIVSGDLTFPKEIAGLDFKEDIRFKP-KIL 608 LPGSV+EKS L + E+V+ DL + KEI GLDF EDI+F+P KIL Sbjct: 61 FLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVNKDLLYLKEIGGLDFGEDIKFEPSKIL 120 Query: 609 ARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAI 788 +F KE RE N+ T N T R+ YRKPQLALVFADLLVDP Q+LMVTVA ALQEIGY I Sbjct: 121 QKFRKENREMNMPFT-NGTLSRFPYRKPQLALVFADLLVDPQQLLMVTVATALQEIGYTI 179 Query: 789 QVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEP 968 VY+L DGPV+ +W ++ PVT+IQ + + I VDWLNYDGILVNSLE R + SCFMQEP Sbjct: 180 HVYTLRDGPVQNIWKSMGYPVTIIQMSHKLEIAVDWLNYDGILVNSLETRSVISCFMQEP 239 Query: 969 FKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDA 1148 FKS+PLIWTIHER LA R R Y SS Q LLNDW+K FNRATVVVFPN+VLPM+YS DA Sbjct: 240 FKSVPLIWTIHERALAIRSRQYTSSWQIELLNDWRKAFNRATVVVFPNHVLPMMYSAFDA 299 Query: 1149 GNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328 GNY+VIPGSP E WEAD A+Y D++R +GY D VIA+VGSQF YR LWLEHAL+L Sbjct: 300 GNYYVIPGSPAEVWEADTTMALYNDDIRVKMGYEPTDIVIAVVGSQFLYRGLWLEHALVL 359 Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508 KA+ PL FP +N SHLKII+LSGDST NY AVEAI++NL YPRG VKH A+ GD Sbjct: 360 KALLPLLQDFPLDSNSISHLKIIVLSGDSTGNYSAAVEAIAVNLSYPRGTVKHFAVDGDV 419 Query: 1509 DNTLSAADLVIYGSFLEE 1562 + LSA DLVIYGSFLEE Sbjct: 420 SSALSAVDLVIYGSFLEE 437 >ref|XP_002320170.1| glycosyltransferase family protein [Populus trichocarpa] gi|222860943|gb|EEE98485.1| glycosyltransferase family protein [Populus trichocarpa] Length = 990 Score = 494 bits (1271), Expect = e-137 Identities = 263/438 (60%), Positives = 302/438 (68%), Gaps = 7/438 (1%) Frame = +3 Query: 270 MGSLETG-LSFKRD-HFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443 MGSLETG +SFKRD + L+ DY+QWICT Sbjct: 1 MGSLETGGISFKRDKNTLIRSYSAGRTERHPFLYRPRSSFSRFLRFKKLDYIQWICTVAV 60 Query: 444 XXXXXXXXXXXLPGSVMEKSGLALTK----EIVSGDLTFPKEIAGLDFKEDIRFKP-KIL 608 LPGSV+EKS L + E+V DL + KEI GLDF EDI+F+P KIL Sbjct: 61 FLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVDKDLWYLKEIGGLDFGEDIKFQPSKIL 120 Query: 609 ARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAI 788 F KE RE N+S + NRT R+ YRKPQLALVFADLLVDPHQ+LMVTVA ALQEIGY I Sbjct: 121 QHFRKENREMNMSFS-NRTLSRFPYRKPQLALVFADLLVDPHQLLMVTVATALQEIGYTI 179 Query: 789 QVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEP 968 VYSL DGP +++W ++R PV +IQ + M I VDWLNYDGILVNSLE + +FSCFMQEP Sbjct: 180 HVYSLGDGPAQSIWKSMRSPVNIIQISHKMEIAVDWLNYDGILVNSLETKSVFSCFMQEP 239 Query: 969 FKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDA 1148 FKS+PLIWTI+ERTLAT R Y SS Q LL DW+K FNRATVVVFPN+VLPM+YS D Sbjct: 240 FKSVPLIWTINERTLATHSRQYTSSWQIELLYDWRKAFNRATVVVFPNHVLPMMYSAFDT 299 Query: 1149 GNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328 GNY+VIPGSP + WE + A+Y D + +GY DD VIAIVGSQF YR LWLEHAL+L Sbjct: 300 GNYYVIPGSPADIWETETTMALYNDEIHVKMGYEPDDIVIAIVGSQFLYRGLWLEHALVL 359 Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508 KA+ PLF F NN SHLKIIILSGD T NY VAVEAI+ NL YPRG VKH A+ D Sbjct: 360 KALLPLFAEFSLDNNSKSHLKIIILSGDPTGNYSVAVEAIAANLSYPRGTVKHFAVDDDV 419 Query: 1509 DNTLSAADLVIYGSFLEE 1562 + L AADLVIYGSFLEE Sbjct: 420 GSPLGAADLVIYGSFLEE 437 >ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica] gi|462416747|gb|EMJ21484.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica] Length = 1034 Score = 489 bits (1260), Expect = e-135 Identities = 256/433 (59%), Positives = 309/433 (71%), Gaps = 2/433 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE+G+ KRD LL DYLQWICT Sbjct: 1 MGSLESGVPLKRDP-LLRSSSTGRTERHPFLQRPRSKFSRFLLIKKLDYLQWICTVAVFL 59 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPGSV+EKS + + E+ S DL F KE+ LDF EDIRF+P K+L +F+K Sbjct: 60 FFVVLFQMFLPGSVVEKSRVLMKNVELNSEDLRFLKELGLLDFGEDIRFEPSKLLEKFQK 119 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EAREA+L+S NRTRQ +GYRKPQLALVFADL V Q+LMVTVAAALQEIGYA VYSL Sbjct: 120 EAREASLTSAMNRTRQHFGYRKPQLALVFADLSVASQQLLMVTVAAALQEIGYAFSVYSL 179 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 EDGPV VW +L +PVT+IQT +N+DWLNYDGILVNSLEA+ IFSCF+QEPFKSLP Sbjct: 180 EDGPVHDVWRSLGVPVTIIQTYDQSELNIDWLNYDGILVNSLEAKGIFSCFVQEPFKSLP 239 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 ++WTIHE+ LATR R Y+S+ Q L NDWK++F+R+TVVVFPNY LPM YS DAGN+FV Sbjct: 240 ILWTIHEQALATRSRKYSSNRQIELFNDWKRLFSRSTVVVFPNYFLPMAYSVFDAGNFFV 299 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IPGSP EA +AD+ + K++L +GYG +D VI IVGSQF YR LWLEH+++L+A+ P Sbjct: 300 IPGSPAEACKADSIMVLDKNHLLAKMGYGSEDVVITIVGSQFLYRGLWLEHSIVLRAVLP 359 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523 L FP NN SHLKII+LSGDSTSNY VEAI+ NL+YP G+VKH+A+ AD+ LS Sbjct: 360 LLEDFPLDNNSYSHLKIIVLSGDSTSNYSSVVEAIAYNLKYPSGIVKHVAVDMAADSVLS 419 Query: 1524 AADLVIYGSFLEE 1562 +D+VIYGSFLEE Sbjct: 420 ISDVVIYGSFLEE 432 >ref|XP_002511940.1| transferase, transferring glycosyl groups, putative [Ricinus communis] gi|223549120|gb|EEF50609.1| transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 935 Score = 474 bits (1220), Expect(2) = e-133 Identities = 237/362 (65%), Positives = 289/362 (79%), Gaps = 3/362 (0%) Frame = +3 Query: 477 LPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEKEAREANL-S 647 LPGS+++KS ++L K EIV GDL + K + LDF ED++F+P K+L +F+KE RE NL S Sbjct: 18 LPGSMIDKSEVSLKKLEIVPGDLLYLKAMGTLDFGEDVQFQPLKLLEKFQKENREVNLTS 77 Query: 648 STFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSLEDGPVRAV 827 S FNRT R+GYRKPQLALVFADLL DP Q+LMVTVA ALQEIGYAIQV+S+ DGPV + Sbjct: 78 SAFNRTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSVNDGPVHDI 137 Query: 828 WGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLPLIWTIHER 1007 W + +PVT+ QTN M I VDWL +D I+VNSLEA+ +F CFMQEPFKS+PLIWTIHE+ Sbjct: 138 WKRIGVPVTIFQTNHKMEIAVDWLIFDSIIVNSLEAKVVFPCFMQEPFKSIPLIWTIHEK 197 Query: 1008 TLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFVIPGSPTEA 1187 TL R R Y S+GQ L++DWK+VFNRATVVVFPN+VLPM+YS DA NY+VIPGSP E Sbjct: 198 TLGIRSRQYISNGQIELVSDWKRVFNRATVVVFPNHVLPMMYSAFDAENYYVIPGSPAEV 257 Query: 1188 WEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYPLFTMFPFY 1367 WEA+ A+YKD++R +GY DD +IAIVGSQF YR LWLEHALIL+A+ PLF+ F F Sbjct: 258 WEAEAMAAVYKDSIRMKMGYRPDDIIIAIVGSQFLYRGLWLEHALILQALSPLFSDFSFD 317 Query: 1368 NNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLSAADLVIYG 1547 +N N HLKII+LSG+STSNY VA+EAI++NL YP G VKHIAI GD + L+AAD+V YG Sbjct: 318 DNSNPHLKIIVLSGNSTSNYSVAIEAIAINLHYPIGAVKHIAIDGDVGSFLTAADIVTYG 377 Query: 1548 SF 1553 SF Sbjct: 378 SF 379 Score = 29.3 bits (64), Expect(2) = e-133 Identities = 10/15 (66%), Positives = 13/15 (86%) Frame = +1 Query: 349 DTHFYRDPDRDSHVF 393 DTHF +DPD+D H+F Sbjct: 3 DTHFCKDPDQDFHMF 17 >gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis] Length = 1040 Score = 477 bits (1227), Expect = e-132 Identities = 250/435 (57%), Positives = 300/435 (68%), Gaps = 4/435 (0%) Frame = +3 Query: 270 MGSLETGLS--FKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXX 443 MGSLE G + FKRD FL DYLQWICT Sbjct: 1 MGSLEGGSATPFKRDPFLRSASFTGRSDRNPFLQRQRSRFSRFFLFKKLDYLQWICTVAV 60 Query: 444 XXXXXXXXXXXLPGSVMEKS-GLALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARF 617 LPGSV+EKS +E SGDL F KE LDF EDIRF+P K+L +F Sbjct: 61 FLFFVVLFQMFLPGSVVEKSIKTHRDEEFSSGDLFFLKEYGILDFGEDIRFEPSKVLEKF 120 Query: 618 EKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVY 797 +E +E NLS FNR+R RY ++KPQLALVFADLLVD Q+LMVTVAAALQEIGY IQVY Sbjct: 121 RRENKEVNLSHAFNRSRLRYPHKKPQLALVFADLLVDSQQLLMVTVAAALQEIGYEIQVY 180 Query: 798 SLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKS 977 SLE GPV +W NL +PV++IQ + VDWL YDGILVNS EA+D+FSCF+QEPFKS Sbjct: 181 SLEGGPVHGIWRNLGVPVSIIQACDPADVTVDWLIYDGILVNSFEAKDMFSCFVQEPFKS 240 Query: 978 LPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNY 1157 LPL+WTIH+R LATR R+Y S+ Q LLNDWK+ FNR+TVVVFPNYVLPMIYS D+GN+ Sbjct: 241 LPLVWTIHDRALATRSRNYTSNKQIELLNDWKRAFNRSTVVVFPNYVLPMIYSTFDSGNF 300 Query: 1158 FVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAI 1337 FVIPGSP EAW+ + KD LR +GYG +D VI IVGS+ YR LWLEH+++L+A+ Sbjct: 301 FVIPGSPAEAWKIETLMESEKDYLRAKMGYGHEDIVITIVGSELLYRGLWLEHSIVLQAL 360 Query: 1338 YPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNT 1517 +PL F N SHLKII+LSGD TSNY AVEAI+LNL+YP G+V H+ + +ADN Sbjct: 361 FPLLEDFSSDENSFSHLKIIVLSGDPTSNYSSAVEAIALNLKYPNGIVNHVPMDAEADNV 420 Query: 1518 LSAADLVIYGSFLEE 1562 L+A+D+VIYGS +EE Sbjct: 421 LTASDVVIYGSSVEE 435 >ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212216 [Cucumis sativus] Length = 1037 Score = 470 bits (1209), Expect = e-130 Identities = 250/433 (57%), Positives = 296/433 (68%), Gaps = 2/433 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G KRD LL DYLQWICT Sbjct: 1 MGSLENGFPLKRDP-LLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFF 59 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPGSV+EKS +AL E GDL F KE+ LDF EDIRF+P K+L +F+K Sbjct: 60 FFVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKK 119 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EAREA+ SS FNRTR R+GYRKPQLALVF+DLLVD +Q+LMVT+A+ALQEIGY QVYSL Sbjct: 120 EAREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSL 178 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 + GP VW + +PVTLIQ+ + VDWLNYDGILV+SL +D+FSC++QEPFKSLP Sbjct: 179 QGGPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLP 238 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 LIWTIHE LA R ++Y S G +LNDWK+VFN +TVVVFPNYV+PMIYS D+GN+FV Sbjct: 239 LIWTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFV 298 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IP P EA EA+ DNLR +GY DD VIAIVGSQF YR +WLEHA++L+A+ P Sbjct: 299 IPSFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLP 358 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523 L F FY + NS LKI +LSGDS SNY +AVEAI+ L+YPR VVKH + D+D LS Sbjct: 359 LLHEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALS 418 Query: 1524 AADLVIYGSFLEE 1562 ADLVIYGS LEE Sbjct: 419 MADLVIYGSCLEE 431 >ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus] Length = 1037 Score = 470 bits (1209), Expect = e-130 Identities = 250/433 (57%), Positives = 296/433 (68%), Gaps = 2/433 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G KRD LL DYLQWICT Sbjct: 1 MGSLENGFPLKRDP-LLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFF 59 Query: 450 XXXXXXXXXLPGSVMEKSGLALTK-EIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPGSV+EKS +AL E GDL F KE+ LDF EDIRF+P K+L +F+K Sbjct: 60 FFVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKK 119 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EAREA+ SS FNRTR R+GYRKPQLALVF+DLLVD +Q+LMVT+A+ALQEIGY QVYSL Sbjct: 120 EAREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSL 178 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 + GP VW + +PVTLIQ+ + VDWLNYDGILV+SL +D+FSC++QEPFKSLP Sbjct: 179 QGGPANDVWRQMGVPVTLIQSCDETEVMVDWLNYDGILVHSLGVKDVFSCYLQEPFKSLP 238 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 LIWTIHE LA R ++Y S G +LNDWK+VFN +TVVVFPNYV+PMIYS D+GN+FV Sbjct: 239 LIWTIHEEALAIRSQNYASDGLLDILNDWKRVFNHSTVVVFPNYVMPMIYSAYDSGNFFV 298 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IP P EA EA+ DNLR +GY DD VIAIVGSQF YR +WLEHA++L+A+ P Sbjct: 299 IPSFPAEALEAEIDVTSDADNLRAKMGYANDDLVIAIVGSQFLYRGMWLEHAMVLQAMLP 358 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523 L F FY + NS LKI +LSGDS SNY +AVEAI+ L+YPR VVKH + D+D LS Sbjct: 359 LLHEFSFYEHSNSRLKIFVLSGDSNSNYTMAVEAIAQRLEYPRSVVKHFPVAADSDKALS 418 Query: 1524 AADLVIYGSFLEE 1562 ADLVIYGS LEE Sbjct: 419 MADLVIYGSCLEE 431 >ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302584 [Fragaria vesca subsp. vesca] Length = 1039 Score = 469 bits (1208), Expect = e-129 Identities = 259/438 (59%), Positives = 302/438 (68%), Gaps = 7/438 (1%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-DYLQWICTXXXX 446 MGSLE+G+ KRD L DYL WICT Sbjct: 1 MGSLESGVPLKRDPLLRSSSNGGRSSDRHLFLQRPRSRFSRFLILKKLDYLLWICTVAVF 60 Query: 447 XXXXXXXXXXLPGSVMEKSGLALTKEIVS---GDLTFPKEIAGLDFKEDIRFKP-KILAR 614 LPGSV+EKSG L K+ V GDL F KE+ LDF EDIRF+P K+L + Sbjct: 61 LFFVVLFQMFLPGSVVEKSGSLLQKKNVELDYGDLRFVKELGLLDFGEDIRFEPSKLLEK 120 Query: 615 FEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQV 794 F KE REA+LSS FNRT Q +G RKPQLALVFADLL D HQ+ MVTVAAALQEIGY + V Sbjct: 121 FRKEGREASLSSGFNRTLQHFGLRKPQLALVFADLLFDSHQLQMVTVAAALQEIGYELWV 180 Query: 795 YSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFK 974 YSLEDGP R W +L +PVT+IQT I VDWLNY+GILV+SLEA+ IFSCF+QEPFK Sbjct: 181 YSLEDGPARGAWKSLGVPVTIIQTCDQPKIVVDWLNYNGILVSSLEAKGIFSCFVQEPFK 240 Query: 975 SLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGN 1154 SLP+IWTIHE LATR R Y+SS Q LLNDWK+VFNR+TVVVFPNY LPMIYS DAGN Sbjct: 241 SLPVIWTIHEEALATRSRKYSSSSQIELLNDWKRVFNRSTVVVFPNYFLPMIYSTLDAGN 300 Query: 1155 YFVIPGSPTEAWEADNAN--AIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALIL 1328 +FVIPGSP EA + D+ + A+ DNL+ G ++ VI IVGS+F YR LWLEH+++L Sbjct: 301 FFVIPGSPAEACKTDSDSIVALDIDNLQGSAGNEPENVVITIVGSKFLYRGLWLEHSIVL 360 Query: 1329 KAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDA 1508 +A+ PL F NN +SHLKII+LSGDSTSNY VEAI+ NL+YP G+VKH AI DA Sbjct: 361 RALLPLLEDFLLDNN-SSHLKIIVLSGDSTSNYSSVVEAIAYNLKYPSGIVKHAAIDVDA 419 Query: 1509 DNTLSAADLVIYGSFLEE 1562 DN LS + LVIYGSFLEE Sbjct: 420 DNVLSTSHLVIYGSFLEE 437 >ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citrus clementina] gi|568876282|ref|XP_006491210.1| PREDICTED: uncharacterized protein LOC102628793 [Citrus sinensis] gi|557547178|gb|ESR58156.1| hypothetical protein CICLE_v10018649mg [Citrus clementina] Length = 1038 Score = 461 bits (1186), Expect = e-127 Identities = 235/388 (60%), Positives = 281/388 (72%), Gaps = 4/388 (1%) Frame = +3 Query: 411 DYLQWICTXXXXXXXXXXXXXXLPGSVM---EKSGLALTKEIVSGDLTFPKEIAGLDFKE 581 DYL WICT LPGSV E G + V DL F KE+ LDF E Sbjct: 48 DYLLWICTVAVFLFFVVIFQLFLPGSVTVMDESQGSLRDFDKVPADLMFLKEMGLLDFGE 107 Query: 582 DIRFKP-KILARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVA 758 ++ F P K++ +F+ E ++ NL+S F+R R+GYRKPQLALVF DLL+DP Q+ MVT+A Sbjct: 108 EVTFLPLKLMEKFQSEDKDVNLTSVFHRKLHRFGYRKPQLALVFPDLLIDPQQLQMVTIA 167 Query: 759 AALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEAR 938 AL+EIGYAIQVYSLEDG VW N+ +PV ++QT V+WLNYDGILVNSLEA+ Sbjct: 168 IALREIGYAIQVYSLEDGRAHEVWRNIGVPVAILQTGREKASFVNWLNYDGILVNSLEAK 227 Query: 939 DIFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYV 1118 + S MQEPFKSLPL+WTIHE TLATR R+Y SSGQ LLNDWKKVFNRATVVVFP+YV Sbjct: 228 VVISNIMQEPFKSLPLVWTIHEGTLATRARNYASSGQLELLNDWKKVFNRATVVVFPDYV 287 Query: 1119 LPMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYR 1298 LPM+YS DAGNY+VIPGSP +AWEAD +Y D +R +G+ DD VIAIVG+QF YR Sbjct: 288 LPMMYSAFDAGNYYVIPGSPAKAWEADTNMDLYNDTVRVKMGFKPDDLVIAIVGTQFMYR 347 Query: 1299 DLWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGV 1478 LWLEHALIL+A+ PLF+ N NS +K++ILSGDSTSNY V +EAI+ NL YP GV Sbjct: 348 GLWLEHALILRALLPLFSEVSVENESNSPIKVMILSGDSTSNYSVVIEAIAHNLHYPLGV 407 Query: 1479 VKHIAIGGDADNTLSAADLVIYGSFLEE 1562 VKHIA GD D+ L+ AD+VIYGSFLEE Sbjct: 408 VKHIAAEGDVDSVLNTADVVIYGSFLEE 435 >ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591393 [Solanum tuberosum] Length = 1038 Score = 454 bits (1168), Expect = e-125 Identities = 237/433 (54%), Positives = 299/433 (69%), Gaps = 2/433 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G+S K+D LL +YLQWICT Sbjct: 1 MGSLENGVSLKKDQNLLRSSSATGRNVFGQRQVRSRFARFLFVKKI-NYLQWICTVAVFF 59 Query: 450 XXXXXXXXXLPGSVMEKSG-LALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPGSVMEKSG L E+ GDL KE+ GLDF EDI+F+P K+LA+F Sbjct: 60 FFVVLFQMLLPGSVMEKSGNLTQDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFHD 119 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EA EAN + +RT R+GYRKP+LALVFA+LLVDP+QI+MV VAAAL+EIGY I+V SL Sbjct: 120 EAVEAN-GTVASRTVVRFGYRKPKLALVFANLLVDPYQIMMVNVAAALREIGYEIEVLSL 178 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 EDGPVR++W ++ +PV ++ T+ I++DWLNYDG+LVNSLEA ++ SC MQEPFK++P Sbjct: 179 EDGPVRSIWKDVGVPVIIMNTDGHTKISLDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVP 238 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 L+WTI+E TLA+RL+ Y SSGQ +++W+KVF+RA VVVFPNY+LP+ YS CDAGNYFV Sbjct: 239 LVWTINELTLASRLKQYISSGQNDFVDNWRKVFSRANVVVFPNYILPIGYSVCDAGNYFV 298 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IPGSP EAWE D+ A+ DNLR + Y +DFVI +VGS Y+ LWLE AL+L+A+ P Sbjct: 299 IPGSPKEAWEVDSFMAVSNDNLRAKMDYAPEDFVIVVVGSHLLYKGLWLEQALVLQALLP 358 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523 +F N NSH KI++L+ S +NY VAVEAI+ NL+YP G+VKHIA D + TLS Sbjct: 359 VFPELTNDGNSNSHFKIVVLTEGSNTNYSVAVEAIARNLRYPEGMVKHIAPAEDTERTLS 418 Query: 1524 AADLVIYGSFLEE 1562 ADLVIY SF EE Sbjct: 419 VADLVIYASFREE 431 >ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246380 [Solanum lycopersicum] Length = 1038 Score = 452 bits (1163), Expect = e-124 Identities = 235/433 (54%), Positives = 299/433 (69%), Gaps = 2/433 (0%) Frame = +3 Query: 270 MGSLETGLSFKRDHFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDYLQWICTXXXXX 449 MGSLE G+S K+D LL +YLQWICT Sbjct: 1 MGSLENGVSLKKDQNLLRSSSATGRNAFGQRQVRSRFARFLFVKKI-NYLQWICTVAVFF 59 Query: 450 XXXXXXXXXLPGSVMEKSG-LALTKEIVSGDLTFPKEIAGLDFKEDIRFKP-KILARFEK 623 LPGSVMEKSG L L E+ GDL KE+ GLDF EDI+F+P K+LA+F + Sbjct: 60 FFVVLFQMLLPGSVMEKSGNLTLDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFRE 119 Query: 624 EAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAAALQEIGYAIQVYSL 803 EA EAN + +R R+GYRKP+LALVF++L VDP+QI+MV VAAAL+EIGY I+V SL Sbjct: 120 EAVEAN-GTVASRIVVRFGYRKPKLALVFSNLSVDPYQIMMVNVAAALREIGYEIEVLSL 178 Query: 804 EDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARDIFSCFMQEPFKSLP 983 EDGPVR++W ++ +PV ++ T+ I++DWLNYDG+LVNSLEA ++ SC MQEPFK++P Sbjct: 179 EDGPVRSIWKDIGVPVIIMNTDGHTKISLDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVP 238 Query: 984 LIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVLPMIYSPCDAGNYFV 1163 L+WTI+E TLA+RL+ Y SSGQ +++W+KVF+RA VVVFPNY+LP+ YS CDAGNYFV Sbjct: 239 LVWTINELTLASRLKQYMSSGQNDFVDNWRKVFSRANVVVFPNYILPIGYSVCDAGNYFV 298 Query: 1164 IPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRDLWLEHALILKAIYP 1343 IPGSP EAWE D A+ D+LR + Y +DFVI +VGSQ Y+ LWLE AL+L+A+ P Sbjct: 299 IPGSPKEAWEVDTFMAVSNDDLRAKMDYAAEDFVIVVVGSQLLYKGLWLEQALVLQALLP 358 Query: 1344 LFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVVKHIAIGGDADNTLS 1523 +F N NSH KI++L+ S +NY VAVEAI+ NL+YP G+VKHIA D + TLS Sbjct: 359 VFPELMNDGNSNSHFKIVVLTEGSNTNYSVAVEAIARNLRYPEGMVKHIAPAEDTERTLS 418 Query: 1524 AADLVIYGSFLEE 1562 ADLVIY SF EE Sbjct: 419 VADLVIYASFREE 431 >ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutrema salsugineum] gi|557097307|gb|ESQ37743.1| hypothetical protein EUTSA_v10028385mg [Eutrema salsugineum] Length = 1022 Score = 424 bits (1089), Expect = e-116 Identities = 217/389 (55%), Positives = 278/389 (71%), Gaps = 5/389 (1%) Frame = +3 Query: 411 DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALT-KEIVSGDLTFPKEIAGLDFKEDI 587 DYLQWICT LPG V++KS + KE + DL KE DF ED+ Sbjct: 44 DYLQWICTMGVFFFFVVLFQMFLPGLVIDKSDKPWSNKEFLPPDLVVFKERGFFDFGEDV 103 Query: 588 RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761 R +P K+L +F++E N SS+ N T QR+G+RKP+LALVFADLL DP Q+LMVTV+ Sbjct: 104 RLEPTKLLMKFQRETNALNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQLLMVTVSK 163 Query: 762 ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941 AL EIGYA++VYSLEDGPV +W N+ + VT+++TN + +DWL+YDG++VNSLEAR Sbjct: 164 ALLEIGYAVEVYSLEDGPVHGIWQNMGVSVTILETNHASSCVIDWLSYDGVIVNSLEARS 223 Query: 942 IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121 +F+CFMQEPFKSLPL+W I+E TLA R R YNS+GQT LL DWKK+F+RA+VVVF NY+L Sbjct: 224 MFTCFMQEPFKSLPLVWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLL 283 Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301 P++YS DAGN++VIPGSP EAW+A N + K DD VI+IVGSQF Y+ Sbjct: 284 PILYSEFDAGNFYVIPGSPEEAWKAKNLDIPRK-----------DDMVISIVGSQFLYKG 332 Query: 1302 LWLEHALILKAIYPLFTMFPFYNNP--NSHLKIIILSGDSTSNYDVAVEAISLNLQYPRG 1475 WLEHAL+L+A+ PLF+ YN+ NS LKII+L G+S SNY VA+E IS NL YP+ Sbjct: 333 QWLEHALLLQALRPLFS---GYNSERYNSRLKIIVLGGESASNYSVAIETISQNLTYPKE 389 Query: 1476 VVKHIAIGGDADNTLSAADLVIYGSFLEE 1562 VKH++I G+ D L ++DLV+YGSFLEE Sbjct: 390 AVKHVSIAGNVDKILESSDLVLYGSFLEE 418 >ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp. lyrata] gi|297318740|gb|EFH49162.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp. lyrata] Length = 1018 Score = 416 bits (1070), Expect = e-113 Identities = 213/387 (55%), Positives = 278/387 (71%), Gaps = 3/387 (0%) Frame = +3 Query: 411 DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALT-KEIVSGDLTFPKEIAGLDFKEDI 587 +YLQWI + LPG V++KS T KEI+ DL +E LDF +D+ Sbjct: 50 NYLQWISSICVFFFFVVLFQMFLPGLVIDKSDKPWTSKEILPPDLLGFREKGFLDFGDDV 109 Query: 588 RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761 RF+P K+L +F++EA N SS+ N T QR+G+RKP+LALVFADLL DP Q+LMV+++ Sbjct: 110 RFEPTKLLMKFQREANGLNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQVLMVSLSK 169 Query: 762 ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941 ALQEIGYAI+VYSLEDGPV ++W + +PVT+++TN + +DWL+YDGI+VNSL A+ Sbjct: 170 ALQEIGYAIEVYSLEDGPVNSIWRKMGVPVTILKTNHASSCVIDWLSYDGIIVNSLRAKS 229 Query: 942 IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121 +F+CFMQEPFKSLPLIW I+E TLA R R YNS GQT LLNDWKK+F+RA+VVVF NY+L Sbjct: 230 MFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSIGQTELLNDWKKIFSRASVVVFHNYLL 289 Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301 P++Y+ DAGN++VIPGSP + W+A N + DD VI+IVGSQF Y+ Sbjct: 290 PILYTEFDAGNFYVIPGSPEDVWKAKNLEFPPQK----------DDVVISIVGSQFLYKG 339 Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481 WLEHAL+L+A+ PLF + + SHLKII+L G+S SNY VA+E IS NL YP+ V Sbjct: 340 QWLEHALLLQALRPLFP-GNYLESDTSHLKIIVLGGESASNYSVAIETISQNLTYPKDAV 398 Query: 1482 KHIAIGGDADNTLSAADLVIYGSFLEE 1562 KH++I G+ D L ++DLVIYGSFLEE Sbjct: 399 KHVSIAGNVDKILESSDLVIYGSFLEE 425 >ref|XP_007135157.1| hypothetical protein PHAVU_010G105900g [Phaseolus vulgaris] gi|561008202|gb|ESW07151.1| hypothetical protein PHAVU_010G105900g [Phaseolus vulgaris] Length = 1034 Score = 414 bits (1064), Expect = e-113 Identities = 211/388 (54%), Positives = 272/388 (70%), Gaps = 4/388 (1%) Frame = +3 Query: 411 DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLALTKEIVSGDLTFPK-EIAGL--DFKE 581 DY+QWICT LPGSV+E S +L + D F EI + D E Sbjct: 45 DYVQWICTVVVFLCLVVVFQMFLPGSVVENSEESLKAVKMRSDNLFHYGEIQKVVSDIGE 104 Query: 582 DIRFKPKILARFEKEAREANLSSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761 D F P IL +F + + FN T Q +GYRKPQLA+VF +LLVD HQ+LMVTVA Sbjct: 105 DAVFLPMILEKFRRRGGGGMDAGLFNHTVQHFGYRKPQLAMVFGELLVDSHQLLMVTVAT 164 Query: 762 ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941 ALQEIGY IQV+SLEDGP VW NL +P+T+ +T VDWLNYDGI+++SLEA+ Sbjct: 165 ALQEIGYEIQVFSLEDGPGHNVWSNLGVPITIFRTCDKRNNTVDWLNYDGIIMSSLEAKG 224 Query: 942 IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121 FSCF+QEPFKS+PLIW +HE LA R R Y ++GQ +LNDW +VFNR+TVVVFPNY L Sbjct: 225 AFSCFLQEPFKSIPLIWIVHENALAYRSRQYTTNGQIEILNDWGRVFNRSTVVVFPNYAL 284 Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301 PMIYS DAGN+FVIPGSP EA EA+ A+ KDNLR +GYG +D ++AIVGSQF Y+ Sbjct: 285 PMIYSTFDAGNFFVIPGSPAEALEAEAFMALQKDNLRVNMGYGPEDVIVAIVGSQFLYKG 344 Query: 1302 LWLEHALILKAIYPLFTMFPF-YNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGV 1478 +WL HA++L+A+ PL T FP +N ++ L+II+ SG+ T+NY VA+E ++ +L+YPRG+ Sbjct: 345 MWLGHAIVLRALEPLVTNFPSNKDNSSAQLRIIVHSGELTNNYSVALETMAHSLKYPRGI 404 Query: 1479 VKHIAIGGDADNTLSAADLVIYGSFLEE 1562 ++HIA +AD+ L AD+V+YGSFLEE Sbjct: 405 IEHIAGDLNADSILGTADVVVYGSFLEE 432 >ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidopsis thaliana] gi|332656594|gb|AEE81994.1| glycosyl transferase family 1 protein [Arabidopsis thaliana] gi|591401974|gb|AHL38714.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 1031 Score = 402 bits (1032), Expect = e-109 Identities = 205/387 (52%), Positives = 273/387 (70%), Gaps = 3/387 (0%) Frame = +3 Query: 411 DYLQWICTXXXXXXXXXXXXXXLPGSVMEKSGLA-LTKEIVSGDLTFPKEIAGLDFKEDI 587 +YL WI LPG V++KS ++KEI+ DL +E LDF +D+ Sbjct: 51 NYLLWISIICVFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLVGFREKGFLDFGDDV 110 Query: 588 RFKP-KILARFEKEAREANL-SSTFNRTRQRYGYRKPQLALVFADLLVDPHQILMVTVAA 761 R +P K+L +F+++A N SS+ N T QR+G+RKP+LALVF DLL DP Q+LMV+++ Sbjct: 111 RIEPTKLLMKFQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDLLADPEQVLMVSLSK 170 Query: 762 ALQEIGYAIQVYSLEDGPVRAVWGNLRIPVTLIQTNASMGINVDWLNYDGILVNSLEARD 941 ALQE+GYAI+VYSLEDGPV ++W + +PVT+++ N +DWL+YDGI+VNSL AR Sbjct: 171 ALQEVGYAIEVYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSYDGIIVNSLRARS 230 Query: 942 IFSCFMQEPFKSLPLIWTIHERTLATRLRHYNSSGQTGLLNDWKKVFNRATVVVFPNYVL 1121 +F+CFMQEPFKSLPLIW I+E TLA R R YNS+GQT LL DWKK+F+RA+VVVF NY+L Sbjct: 231 MFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSTGQTELLTDWKKIFSRASVVVFHNYLL 290 Query: 1122 PMIYSPCDAGNYFVIPGSPTEAWEADNANAIYKDNLREVLGYGLDDFVIAIVGSQFFYRD 1301 P++Y+ DAGN++VIPGSP E +A N + DD VI+IVGSQF Y+ Sbjct: 291 PILYTEFDAGNFYVIPGSPEEVCKAKNLEFPPQK----------DDVVISIVGSQFLYKG 340 Query: 1302 LWLEHALILKAIYPLFTMFPFYNNPNSHLKIIILSGDSTSNYDVAVEAISLNLQYPRGVV 1481 WLEHAL+L+A+ PLF+ + + NSHLKII+L G++ SNY VA+E IS NL YP+ V Sbjct: 341 QWLEHALLLQALRPLFS-GNYLESDNSHLKIIVLGGETASNYSVAIETISQNLTYPKEAV 399 Query: 1482 KHIAIGGDADNTLSAADLVIYGSFLEE 1562 KH+ + G+ D L ++DLVIYGSFLEE Sbjct: 400 KHVRVAGNVDKILESSDLVIYGSFLEE 426