BLASTX nr result
ID: Akebia27_contig00016162
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00016162 (1649 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prun... 229 2e-57 ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262... 227 1e-56 emb|CBI40456.3| unnamed protein product [Vitis vinifera] 227 1e-56 ref|XP_007051668.1| Glycosyl transferase family 1 protein isofor... 219 3e-54 ref|XP_007051667.1| Glycosyl transferase family 1 protein isofor... 219 3e-54 ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 218 6e-54 ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212... 218 6e-54 emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera] 218 6e-54 gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis] 213 2e-52 ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302... 213 2e-52 ref|XP_002301386.2| glycosyltransferase family protein [Populus ... 202 4e-49 ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591... 201 6e-49 ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246... 196 2e-47 ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citr... 195 6e-47 ref|XP_002320170.1| glycosyltransferase family protein [Populus ... 193 2e-46 ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutr... 191 6e-46 ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabid... 186 2e-44 ref|XP_002511940.1| transferase, transferring glycosyl groups, p... 177 9e-42 ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidop... 174 1e-40 ref|XP_006286696.1| hypothetical protein CARUB_v10002775mg [Caps... 167 2e-38 >ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica] gi|462416747|gb|EMJ21484.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica] Length = 1034 Score = 229 bits (585), Expect = 2e-57 Identities = 122/213 (57%), Positives = 146/213 (68%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GVPLKR P PRS+F+RFLL +K+DYLQWICT+A Sbjct: 1 MGSLESGVPLKRDPLLRSSSTGRTERHPFLQRPRSKFSRFLLIKKLDYLQWICTVAVFLF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPGSV+EKS L S D LK +G LDFGE IRFEPSKLLEKF +E Sbjct: 61 FVVLFQMFLPGSVVEKSRVLMKNVELNSEDLRFLKELGLLDFGEDIRFEPSKLLEKFQKE 120 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 ARE +++SA++R R G RKPQLALVFADL V QLLM++VA +L+EIGY VYSLE Sbjct: 121 AREASLTSAMNRTRQHFGYRKPQLALVFADLSVASQQLLMVTVAAALQEIGYAFSVYSLE 180 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 DGPVH VW ++GVPVTI+Q ++ E+ +DWLNY Sbjct: 181 DGPVHDVWRSLGVPVTIIQTYDQSELNIDWLNY 213 >ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262009 [Vitis vinifera] Length = 1026 Score = 227 bits (579), Expect = 1e-56 Identities = 124/213 (58%), Positives = 147/213 (69%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GVP+KR P P RF+RFL F K+DYLQW+CT+A Sbjct: 1 MGSLENGVPVKRDPLLRSSSNKGSAFQR----PIVRFSRFLFFGKLDYLQWVCTVAVFCF 56 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPG +MEKSG S GD +K +G LDFGEGIRFEPSKLL+KF +E Sbjct: 57 FVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQKE 116 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 A EVN+SSA SR R R G RKPQLALVF DLLV+P QLLM++VA++L E+GYTIQVYSLE Sbjct: 117 ADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSLE 175 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 DGPV+ +W N+G PVTI+++N K VDWLNY Sbjct: 176 DGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNY 208 >emb|CBI40456.3| unnamed protein product [Vitis vinifera] Length = 1026 Score = 227 bits (579), Expect = 1e-56 Identities = 124/213 (58%), Positives = 147/213 (69%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GVP+KR P P RF+RFL F K+DYLQW+CT+A Sbjct: 1 MGSLENGVPVKRDPLLRSSSNKGSAFQR----PIVRFSRFLFFGKLDYLQWVCTVAVFCF 56 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPG +MEKSG S GD +K +G LDFGEGIRFEPSKLL+KF +E Sbjct: 57 FVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKNIGGLDFGEGIRFEPSKLLQKFQKE 116 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 A EVN+SSA SR R R G RKPQLALVF DLLV+P QLLM++VA++L E+GYTIQVYSLE Sbjct: 117 ADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQVYSLE 175 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 DGPV+ +W N+G PVTI+++N K VDWLNY Sbjct: 176 DGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNY 208 >ref|XP_007051668.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao] gi|508703929|gb|EOX95825.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao] Length = 686 Score = 219 bits (558), Expect = 3e-54 Identities = 123/216 (56%), Positives = 148/216 (68%), Gaps = 3/216 (1%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G+ LKR PRSRF+RFLLF+K+DYLQWICT+ Sbjct: 1 MGSLESGISLKRA-------GSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFLF 53 Query: 527 XXXXXQMFLPGSVMEKSGGSG-DEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDR 703 QM+LPGSVM+KS S ++ L+ G+ LK MG LDFGE IR EP KLLEKF R Sbjct: 54 FVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQR 113 Query: 704 EAREVNI--SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVY 877 E + +N+ SS +R + R RKPQLALVFADLLV+P QLLM+++A +LREIGY IQVY Sbjct: 114 ENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQVY 173 Query: 878 SLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 SLEDGPVH VW +IGVPV++LQ N+ EI VDWLNY Sbjct: 174 SLEDGPVHNVWQSIGVPVSVLQVNSN-EIGVDWLNY 208 >ref|XP_007051667.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao] gi|508703928|gb|EOX95824.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao] Length = 1026 Score = 219 bits (558), Expect = 3e-54 Identities = 123/216 (56%), Positives = 148/216 (68%), Gaps = 3/216 (1%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G+ LKR PRSRF+RFLLF+K+DYLQWICT+ Sbjct: 1 MGSLESGISLKRA-------GSRNERNPFLNRPRSRFSRFLLFKKLDYLQWICTVVVFLF 53 Query: 527 XXXXXQMFLPGSVMEKSGGSG-DEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDR 703 QM+LPGSVM+KS S ++ L+ G+ LK MG LDFGE IR EP KLLEKF R Sbjct: 54 FVVFFQMYLPGSVMDKSQDSFLEDKDLVYGELRYLKEMGGLDFGEDIRLEPRKLLEKFQR 113 Query: 704 EAREVNI--SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVY 877 E + +N+ SS +R + R RKPQLALVFADLLV+P QLLM+++A +LREIGY IQVY Sbjct: 114 ENKVLNLESSSGFNRSQHRFQYRKPQLALVFADLLVDPQQLLMVTIATALREIGYAIQVY 173 Query: 878 SLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 SLEDGPVH VW +IGVPV++LQ N+ EI VDWLNY Sbjct: 174 SLEDGPVHNVWQSIGVPVSVLQVNSN-EIGVDWLNY 208 >ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212216 [Cucumis sativus] Length = 1037 Score = 218 bits (555), Expect = 6e-54 Identities = 116/213 (54%), Positives = 144/213 (67%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G PLKR P PRSRF+RFL F K+DYLQWICT+A Sbjct: 1 MGSLENGFPLKRDPLLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFFF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPGSV+EKS + + GD LK +G LDFGE IRFEPSKLL KF +E Sbjct: 61 FVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKKE 120 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 ARE + SS +R R R G RKPQLALVF+DLLV+ Q+LM+++A++L+EIGY QVYSL+ Sbjct: 121 AREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQ 179 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 GP + VW +GVPVT++Q+ ++ E+ VDWLNY Sbjct: 180 GGPANDVWRQMGVPVTLIQSCDETEVMVDWLNY 212 >ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus] Length = 1037 Score = 218 bits (555), Expect = 6e-54 Identities = 116/213 (54%), Positives = 144/213 (67%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G PLKR P PRSRF+RFL F K+DYLQWICT+A Sbjct: 1 MGSLENGFPLKRDPLLRSSSSVRGERYPFLQRPRSRFSRFLFFRKIDYLQWICTVAVFFF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPGSV+EKS + + GD LK +G LDFGE IRFEPSKLL KF +E Sbjct: 61 FVVLFQMFLPGSVVEKSEVALKDVEKSLGDLKFLKELGMLDFGEDIRFEPSKLLGKFKKE 120 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 ARE + SS +R R R G RKPQLALVF+DLLV+ Q+LM+++A++L+EIGY QVYSL+ Sbjct: 121 AREADFSS-FNRTRSRFGYRKPQLALVFSDLLVDSYQVLMVTIASALQEIGYVFQVYSLQ 179 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 GP + VW +GVPVT++Q+ ++ E+ VDWLNY Sbjct: 180 GGPANDVWRQMGVPVTLIQSCDETEVMVDWLNY 212 >emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera] Length = 1040 Score = 218 bits (555), Expect = 6e-54 Identities = 124/227 (54%), Positives = 147/227 (64%), Gaps = 14/227 (6%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GVP+KR P P RF+RFL F K+DYLQW+CT+A Sbjct: 1 MGSLENGVPVKRDPLLRSSSNKGSAFQR----PIVRFSRFLFFGKLDYLQWVCTVAVFCF 56 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPG +MEKSG S GD +K +G LDFGEGIRFEPSKLL+KF +E Sbjct: 57 FVVLFQMFLPGLIMEKSGESLKNMENGYGDLSFIKKIGGLDFGEGIRFEPSKLLQKFQKE 116 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYT------- 865 A EVN+SSA SR R R G RKPQLALVF DLLV+P QLLM++VA++L E+GYT Sbjct: 117 ADEVNLSSA-SRLRHRFGYRKPQLALVFPDLLVDPQQLLMVTVASALLEMGYTIQALPYL 175 Query: 866 -------IQVYSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 IQVYSLEDGPV+ +W N+G PVTI+++N K VDWLNY Sbjct: 176 VSIYVAWIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDWLNY 222 >gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis] Length = 1040 Score = 213 bits (543), Expect = 2e-52 Identities = 119/216 (55%), Positives = 144/216 (66%), Gaps = 3/216 (1%) Frame = +2 Query: 347 MGSLEVG--VPLKRGPXXXXXXXXXXXXXXXXXX-PRSRFARFLLFEKVDYLQWICTIAX 517 MGSLE G P KR P RSRF+RF LF+K+DYLQWICT+A Sbjct: 1 MGSLEGGSATPFKRDPFLRSASFTGRSDRNPFLQRQRSRFSRFFLFKKLDYLQWICTVAV 60 Query: 518 XXXXXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKF 697 QMFLPGSV+EKS + + SGD LK G LDFGE IRFEPSK+LEKF Sbjct: 61 FLFFVVLFQMFLPGSVVEKSIKTHRDEEFSSGDLFFLKEYGILDFGEDIRFEPSKVLEKF 120 Query: 698 DREAREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVY 877 RE +EVN+S A +R R+R +KPQLALVFADLLV+ QLLM++VA +L+EIGY IQVY Sbjct: 121 RRENKEVNLSHAFNRSRLRYPHKKPQLALVFADLLVDSQQLLMVTVAAALQEIGYEIQVY 180 Query: 878 SLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 SLE GPVH +W N+GVPV+I+Q + ++ VDWL Y Sbjct: 181 SLEGGPVHGIWRNLGVPVSIIQACDPADVTVDWLIY 216 >ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302584 [Fragaria vesca subsp. vesca] Length = 1039 Score = 213 bits (542), Expect = 2e-52 Identities = 117/217 (53%), Positives = 143/217 (65%), Gaps = 4/217 (1%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXX--PRSRFARFLLFEKVDYLQWICTIAXX 520 MGSLE GVPLKR P PRSRF+RFL+ +K+DYL WICT+A Sbjct: 1 MGSLESGVPLKRDPLLRSSSNGGRSSDRHLFLQRPRSRFSRFLILKKLDYLLWICTVAVF 60 Query: 521 XXXXXXXQMFLPGSVMEKSGGSGDEPG--LISGDWVGLKWMGELDFGEGIRFEPSKLLEK 694 QMFLPGSV+EKSG + L GD +K +G LDFGE IRFEPSKLLEK Sbjct: 61 LFFVVLFQMFLPGSVVEKSGSLLQKKNVELDYGDLRFVKELGLLDFGEDIRFEPSKLLEK 120 Query: 695 FDREAREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQV 874 F +E RE ++SS +R G+RKPQLALVFADLL + QL M++VA +L+EIGY + V Sbjct: 121 FRKEGREASLSSGFNRTLQHFGLRKPQLALVFADLLFDSHQLQMVTVAAALQEIGYELWV 180 Query: 875 YSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 YSLEDGP W ++GVPVTI+Q ++P+I VDWLNY Sbjct: 181 YSLEDGPARGAWKSLGVPVTIIQTCDQPKIVVDWLNY 217 >ref|XP_002301386.2| glycosyltransferase family protein [Populus trichocarpa] gi|550345174|gb|EEE80659.2| glycosyltransferase family protein [Populus trichocarpa] Length = 984 Score = 202 bits (514), Expect = 4e-49 Identities = 110/187 (58%), Positives = 139/187 (74%), Gaps = 6/187 (3%) Frame = +2 Query: 443 PRSRFARFLLFEKVDYLQWICTIAXXXXXXXXXQMFLPGSVMEKSG-GSGDEPG--LISG 613 PRSR +RFLLF+K+DY+QWICT+A QMFLPGSV+EKS GS G L++ Sbjct: 36 PRSRLSRFLLFKKLDYIQWICTVAVFLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVNK 95 Query: 614 DWVGLKWMGELDFGEGIRFEPSKLLEKFDREAREVNI---SSALSRPRVRSGVRKPQLAL 784 D + LK +G LDFGE I+FEPSK+L+KF +E RE+N+ + LSR RKPQLAL Sbjct: 96 DLLYLKEIGGLDFGEDIKFEPSKILQKFRKENREMNMPFTNGTLSR----FPYRKPQLAL 151 Query: 785 VFADLLVEPGQLLMISVANSLREIGYTIQVYSLEDGPVHVVWTNIGVPVTILQNNNKPEI 964 VFADLLV+P QLLM++VA +L+EIGYTI VY+L DGPV +W ++G PVTI+Q ++K EI Sbjct: 152 VFADLLVDPQQLLMVTVATALQEIGYTIHVYTLRDGPVQNIWKSMGYPVTIIQMSHKLEI 211 Query: 965 AVDWLNY 985 AVDWLNY Sbjct: 212 AVDWLNY 218 >ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591393 [Solanum tuberosum] Length = 1038 Score = 201 bits (512), Expect = 6e-49 Identities = 110/213 (51%), Positives = 137/213 (64%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GV LK+ RSRFARFL +K++YLQWICT+A Sbjct: 1 MGSLENGVSLKKDQNLLRSSSATGRNVFGQRQVRSRFARFLFVKKINYLQWICTVAVFFF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QM LPGSVMEKSG + + GD LK +G LDFGE I+FEP KLL KF E Sbjct: 61 FVVLFQMLLPGSVMEKSGNLTQDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFHDE 120 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 A E N + SR VR G RKP+LALVFA+LLV+P Q++M++VA +LREIGY I+V SLE Sbjct: 121 AVEAN-GTVASRTVVRFGYRKPKLALVFANLLVDPYQIMMVNVAAALREIGYEIEVLSLE 179 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 DGPV +W ++GVPV I+ + +I++DWLNY Sbjct: 180 DGPVRSIWKDVGVPVIIMNTDGHTKISLDWLNY 212 >ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246380 [Solanum lycopersicum] Length = 1038 Score = 196 bits (499), Expect = 2e-47 Identities = 109/213 (51%), Positives = 136/213 (63%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE GV LK+ RSRFARFL +K++YLQWICT+A Sbjct: 1 MGSLENGVSLKKDQNLLRSSSATGRNAFGQRQVRSRFARFLFVKKINYLQWICTVAVFFF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QM LPGSVMEKSG + + GD LK +G LDFGE I+FEP KLL KF E Sbjct: 61 FVVLFQMLLPGSVMEKSGNLTLDSEVGYGDLALLKELGGLDFGEDIKFEPLKLLAKFREE 120 Query: 707 AREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLE 886 A E N + SR VR G RKP+LALVF++L V+P Q++M++VA +LREIGY I+V SLE Sbjct: 121 AVEAN-GTVASRIVVRFGYRKPKLALVFSNLSVDPYQIMMVNVAAALREIGYEIEVLSLE 179 Query: 887 DGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 DGPV +W +IGVPV I+ + +I++DWLNY Sbjct: 180 DGPVRSIWKDIGVPVIIMNTDGHTKISLDWLNY 212 >ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citrus clementina] gi|568876282|ref|XP_006491210.1| PREDICTED: uncharacterized protein LOC102628793 [Citrus sinensis] gi|557547178|gb|ESR58156.1| hypothetical protein CICLE_v10018649mg [Citrus clementina] Length = 1038 Score = 195 bits (495), Expect = 6e-47 Identities = 112/217 (51%), Positives = 136/217 (62%), Gaps = 4/217 (1%) Frame = +2 Query: 347 MGSLEVG--VPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXX 520 MGSLE G VPLKR RSRF+RFL F+K+DYL WICT+A Sbjct: 1 MGSLESGLVVPLKRDNLGRSSSRTERQHSFLQRN-RSRFSRFLFFKKLDYLLWICTVAVF 59 Query: 521 XXXXXXXQMFLPGSV--MEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEK 694 Q+FLPGSV M++S GS + + D + LK MG LDFGE + F P KL+EK Sbjct: 60 LFFVVIFQLFLPGSVTVMDESQGSLRDFDKVPADLMFLKEMGLLDFGEEVTFLPLKLMEK 119 Query: 695 FDREAREVNISSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQV 874 F E ++VN++S R R G RKPQLALVF DLL++P QL M+++A +LREIGY IQV Sbjct: 120 FQSEDKDVNLTSVFHRKLHRFGYRKPQLALVFPDLLIDPQQLQMVTIAIALREIGYAIQV 179 Query: 875 YSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 YSLEDG H VW NIGVPV ILQ + V+WLNY Sbjct: 180 YSLEDGRAHEVWRNIGVPVAILQTGREKASFVNWLNY 216 >ref|XP_002320170.1| glycosyltransferase family protein [Populus trichocarpa] gi|222860943|gb|EEE98485.1| glycosyltransferase family protein [Populus trichocarpa] Length = 990 Score = 193 bits (490), Expect = 2e-46 Identities = 105/184 (57%), Positives = 133/184 (72%), Gaps = 3/184 (1%) Frame = +2 Query: 443 PRSRFARFLLFEKVDYLQWICTIAXXXXXXXXXQMFLPGSVMEKSG-GSGDEPG--LISG 613 PRS F+RFL F+K+DY+QWICT+A QMFLPGSV+EKS GS G L+ Sbjct: 36 PRSSFSRFLRFKKLDYIQWICTVAVFLFFVVLFQMFLPGSVVEKSELGSSPWRGMELVDK 95 Query: 614 DWVGLKWMGELDFGEGIRFEPSKLLEKFDREAREVNISSALSRPRVRSGVRKPQLALVFA 793 D LK +G LDFGE I+F+PSK+L+ F +E RE+N+S + +R R RKPQLALVFA Sbjct: 96 DLWYLKEIGGLDFGEDIKFQPSKILQHFRKENREMNMSFS-NRTLSRFPYRKPQLALVFA 154 Query: 794 DLLVEPGQLLMISVANSLREIGYTIQVYSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVD 973 DLLV+P QLLM++VA +L+EIGYTI VYSL DGP +W ++ PV I+Q ++K EIAVD Sbjct: 155 DLLVDPHQLLMVTVATALQEIGYTIHVYSLGDGPAQSIWKSMRSPVNIIQISHKMEIAVD 214 Query: 974 WLNY 985 WLNY Sbjct: 215 WLNY 218 >ref|XP_006396290.1| hypothetical protein EUTSA_v10028385mg [Eutrema salsugineum] gi|557097307|gb|ESQ37743.1| hypothetical protein EUTSA_v10028385mg [Eutrema salsugineum] Length = 1022 Score = 191 bits (486), Expect = 6e-46 Identities = 103/214 (48%), Positives = 131/214 (61%), Gaps = 1/214 (0%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G+P KR RSR +RF LF+++DYLQWICT+ Sbjct: 1 MGSLESGIPAKRESGVRAARQQQHPFLQRN---RSRLSRFFLFKRLDYLQWICTMGVFFF 57 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDRE 706 QMFLPG V++KS + D V K G DFGE +R EP+KLL KF RE Sbjct: 58 FVVLFQMFLPGLVIDKSDKPWSNKEFLPPDLVVFKERGFFDFGEDVRLEPTKLLMKFQRE 117 Query: 707 AREVNI-SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSL 883 +N SS+L+ R G RKP+LALVFADLL +P QLLM++V+ +L EIGY ++VYSL Sbjct: 118 TNALNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQLLMVTVSKALLEIGYAVEVYSL 177 Query: 884 EDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 EDGPVH +W N+GV VTIL+ N+ +DWL+Y Sbjct: 178 EDGPVHGIWQNMGVSVTILETNHASSCVIDWLSY 211 >ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp. lyrata] gi|297318740|gb|EFH49162.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp. lyrata] Length = 1018 Score = 186 bits (473), Expect = 2e-44 Identities = 101/217 (46%), Positives = 134/217 (61%), Gaps = 4/217 (1%) Frame = +2 Query: 347 MGSLEVGVPLKR---GPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAX 517 MGSLE G+P KR G RSR +RF L + +YLQWI +I Sbjct: 1 MGSLESGIPTKRDNGGGRTGRQQQLQQQQQFFLQRNRSRLSRFFLLKSFNYLQWISSICV 60 Query: 518 XXXXXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKF 697 QMFLPG V++KS ++ D +G + G LDFG+ +RFEP+KLL KF Sbjct: 61 FFFFVVLFQMFLPGLVIDKSDKPWTSKEILPPDLLGFREKGFLDFGDDVRFEPTKLLMKF 120 Query: 698 DREAREVNI-SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQV 874 REA +N SS+L+ R G RKP+LALVFADLL +P Q+LM+S++ +L+EIGY I+V Sbjct: 121 QREANGLNFTSSSLNTTLQRFGFRKPKLALVFADLLADPEQVLMVSLSKALQEIGYAIEV 180 Query: 875 YSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 YSLEDGPV+ +W +GVPVTIL+ N+ +DWL+Y Sbjct: 181 YSLEDGPVNSIWRKMGVPVTILKTNHASSCVIDWLSY 217 >ref|XP_002511940.1| transferase, transferring glycosyl groups, putative [Ricinus communis] gi|223549120|gb|EEF50609.1| transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 935 Score = 177 bits (450), Expect = 9e-42 Identities = 91/146 (62%), Positives = 114/146 (78%), Gaps = 1/146 (0%) Frame = +2 Query: 545 MFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEKFDREAREVNI 724 MFLPGS+++KS S + ++ GD + LK MG LDFGE ++F+P KLLEKF +E REVN+ Sbjct: 16 MFLPGSMIDKSEVSLKKLEIVPGDLLYLKAMGTLDFGEDVQFQPLKLLEKFQKENREVNL 75 Query: 725 -SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQVYSLEDGPVH 901 SSA +R +R G RKPQLALVFADLL +P QLLM++VA +L+EIGY IQV+S+ DGPVH Sbjct: 76 TSSAFNRTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSVNDGPVH 135 Query: 902 VVWTNIGVPVTILQNNNKPEIAVDWL 979 +W IGVPVTI Q N+K EIAVDWL Sbjct: 136 DIWKRIGVPVTIFQTNHKMEIAVDWL 161 >ref|NP_192030.4| glycosyl transferase family 1 protein [Arabidopsis thaliana] gi|332656594|gb|AEE81994.1| glycosyl transferase family 1 protein [Arabidopsis thaliana] gi|591401974|gb|AHL38714.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 1031 Score = 174 bits (440), Expect = 1e-40 Identities = 96/218 (44%), Positives = 128/218 (58%), Gaps = 5/218 (2%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXX----PRSRFARFLLFEKVDYLQWICTIA 514 MGSLE G+P KR RSR +RF L + +YL WI I Sbjct: 1 MGSLESGIPTKRDNGGVRGGRQQQQQQQQQQFFLQRNRSRLSRFFLLKSFNYLLWISIIC 60 Query: 515 XXXXXXXXXQMFLPGSVMEKSGGSGDEPGLISGDWVGLKWMGELDFGEGIRFEPSKLLEK 694 QMFLPG V++KS ++ D VG + G LDFG+ +R EP+KLL K Sbjct: 61 VFFFFAVLFQMFLPGLVIDKSDKPWISKEILPPDLVGFREKGFLDFGDDVRIEPTKLLMK 120 Query: 695 FDREAREVNI-SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYTIQ 871 F R+A N SS+L+ R G RKP+LALVF DLL +P Q+LM+S++ +L+E+GY I+ Sbjct: 121 FQRDAHGFNFTSSSLNTTLQRFGFRKPKLALVFGDLLADPEQVLMVSLSKALQEVGYAIE 180 Query: 872 VYSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 VYSLEDGPV+ +W +GVPVTIL+ N + +DWL+Y Sbjct: 181 VYSLEDGPVNSIWQKMGVPVTILKPNQESSCVIDWLSY 218 >ref|XP_006286696.1| hypothetical protein CARUB_v10002775mg [Capsella rubella] gi|482555402|gb|EOA19594.1| hypothetical protein CARUB_v10002775mg [Capsella rubella] Length = 1027 Score = 167 bits (422), Expect = 2e-38 Identities = 94/220 (42%), Positives = 131/220 (59%), Gaps = 7/220 (3%) Frame = +2 Query: 347 MGSLEVGVPLKRGPXXXXXXXXXXXXXXXXXXPRSRFARFLLFEKVDYLQWICTIAXXXX 526 MGSLE G+P KR RSR +RF+L ++++YLQ + +I Sbjct: 1 MGSLESGIPAKRDNGGGRGGRQQLLQHQFSQRNRSRLSRFILLKRLNYLQLVSSICIFFF 60 Query: 527 XXXXXQMFLPGSVMEKSGGSGDEPGL------ISGDWVGLKWMGELDFGEGIRFEPSKLL 688 QMFLPG V++KS D+P + + D + G DFG +R EP+KLL Sbjct: 61 FAVLFQMFLPGLVIDKS----DKPWIRIIKDNLPPDLAVFRDKGFFDFGNEVRIEPTKLL 116 Query: 689 EKFDREAREVNI-SSALSRPRVRSGVRKPQLALVFADLLVEPGQLLMISVANSLREIGYT 865 KF REA +N SS+L+ R RKP+LALVF DLL +P Q+LM+S++ L EIGY+ Sbjct: 117 MKFQREANALNFTSSSLNTTLQRFNFRKPKLALVFGDLLADPEQVLMVSLSRVLLEIGYS 176 Query: 866 IQVYSLEDGPVHVVWTNIGVPVTILQNNNKPEIAVDWLNY 985 I+VYSL+DGPV+ +W +GVPVTIL+ N++ +DWL+Y Sbjct: 177 IEVYSLKDGPVNGIWQTMGVPVTILETNHESSCVIDWLSY 216