BLASTX nr result
ID: Catharanthus23_contig00008498
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00008498 (1831 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|22... 560 e-157 ref|XP_002311068.2| exostosin family protein [Populus trichocarp... 557 e-156 ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citr... 555 e-155 ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable gly... 545 e-152 ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g... 543 e-152 gb|EXC06151.1| putative glycosyltransferase [Morus notabilis] 543 e-151 gb|EOX99880.1| Exostosin family protein isoform 1 [Theobroma cac... 542 e-151 ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable gly... 541 e-151 gb|EMJ28657.1| hypothetical protein PRUPE_ppa005995mg [Prunus pe... 541 e-151 ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g... 536 e-149 ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutr... 533 e-148 ref|XP_003531191.2| PREDICTED: probable glycosyltransferase At3g... 531 e-148 ref|XP_004504444.1| PREDICTED: probable glycosyltransferase At3g... 528 e-147 ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata... 528 e-147 ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase ... 527 e-147 ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana] g... 527 e-147 ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Caps... 525 e-146 ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [A... 524 e-146 ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [S... 524 e-146 ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group] g... 522 e-145 >ref|XP_002512333.1| catalytic, putative [Ricinus communis] gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis] Length = 434 Score = 560 bits (1443), Expect = e-157 Identities = 267/392 (68%), Positives = 315/392 (80%) Frame = +1 Query: 388 CRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVE 567 C + P PLKVYMY+LPRRF+VGMM+ ND VT ENLP WP +GL++QHSVE Sbjct: 50 CATGP--PLKVYMYDLPRRFHVGMMDHGGDAKND--TPVTGENLPTWPKNSGLRKQHSVE 105 Query: 568 YWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQ 747 YWLMASLLY G + +EAVRVLDPE AD NTHG MTDPETE DRQ Sbjct: 106 YWLMASLLYEGAD----EREAVRVLDPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQ 161 Query: 748 LQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANL 927 LQ D++ +L +S YWQ+S GRDHVIPM HPNAFRFLR +NASILIVADF RY +S++ L Sbjct: 162 LQVDVIDMLYKSKYWQKSGGRDHVIPMTHPNAFRFLRQQLNASILIVADFGRYPKSMSTL 221 Query: 928 RKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVT 1107 KDVVAPYVHVVDSF DDE+ +PF +R TLLFFRG T+RKDEGK+RAKL +L GYDD+ Sbjct: 222 SKDVVAPYVHVVDSFTDDEVSNPFESRTTLLFFRGNTIRKDEGKVRAKLAKILTGYDDIH 281 Query: 1108 YAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFED 1287 + +S T E + AS +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+ED Sbjct: 282 FERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYED 341 Query: 1288 ELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAV 1467 E+DYS+FS+FFS+ EA+Q YMV +LR++ KERWLEMWR+LKSISHHFE+QYPP+KEDAV Sbjct: 342 EIDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWRKLKSISHHFEFQYPPEKEDAV 401 Query: 1468 NMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 +M+WR+VKHKLP A+L+VHRSRRLK+ DWW+R Sbjct: 402 DMLWREVKHKLPGAQLAVHRSRRLKIQDWWQR 433 >ref|XP_002311068.2| exostosin family protein [Populus trichocarpa] gi|550332343|gb|EEE88435.2| exostosin family protein [Populus trichocarpa] Length = 379 Score = 557 bits (1435), Expect = e-156 Identities = 267/380 (70%), Positives = 310/380 (81%) Frame = +1 Query: 424 MYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVEYWLMASLLYNGN 603 MY+LPRRFN+GMM DD TAE LP WP G+++QHSVEYWLMASLL +G Sbjct: 1 MYDLPRRFNIGMMQWKKGG-GDDTPVRTAEELPRWPVNVGVRKQHSVEYWLMASLLGSGG 59 Query: 604 EWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQADILQILRES 783 E + +EAVRVLDPE+A+ NTHGRNMTDPETEKDRQLQ D++ L++S Sbjct: 60 EGEE--REAVRVLDPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKS 117 Query: 784 PYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRKDVVAPYVHVV 963 YWQRS GRDHVIPM HPNAFRFLR VNASILIVADF RY +SL+ L KDVV+PYVH V Sbjct: 118 KYWQRSGGRDHVIPMTHPNAFRFLRQLVNASILIVADFGRYPKSLSTLSKDVVSPYVHNV 177 Query: 964 DSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYAKSDPTGEGVN 1143 DSF DD+L DPF +RKTLLFFRG T+RKD+GK+RAKLE +L GYDDV Y +S PT E + Sbjct: 178 DSFKDDDLLDPFESRKTLLFFRGNTVRKDKGKVRAKLEKILAGYDDVRYERSSPTAEAIQ 237 Query: 1144 ASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDELDYSKFSIFFS 1323 AS QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+EDE+DYS+FSIFFS Sbjct: 238 ASTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYSQFSIFFS 297 Query: 1324 IKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNMIWRQVKHKLP 1503 I EA+Q DY+V++LRK K+RW+EMWR+LK ISHHFE+QYPP KEDAVN++WRQVK+KLP Sbjct: 298 INEAIQPDYLVNQLRKFPKDRWIEMWRQLKKISHHFEFQYPPVKEDAVNLLWRQVKNKLP 357 Query: 1504 AAKLSVHRSRRLKVPDWWRR 1563 A+L+VHR+ RLKVPDWW+R Sbjct: 358 GAQLAVHRNHRLKVPDWWQR 377 >ref|XP_006438045.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|567891051|ref|XP_006438046.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|568861185|ref|XP_006484086.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1 [Citrus sinensis] gi|568861187|ref|XP_006484087.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X2 [Citrus sinensis] gi|568861189|ref|XP_006484088.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X3 [Citrus sinensis] gi|557540241|gb|ESR51285.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] gi|557540242|gb|ESR51286.1| hypothetical protein CICLE_v10031600mg [Citrus clementina] Length = 431 Score = 555 bits (1431), Expect = e-155 Identities = 264/385 (68%), Positives = 312/385 (81%) Frame = +1 Query: 403 SSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVEYWLMA 582 S+PL+VYMY+LPRRF+VGM++ ++ D VT+ENLP WP +G+KRQHSVEYWLMA Sbjct: 52 SAPLRVYMYDLPRRFHVGMLDHSS----PDGLPVTSENLPRWPRSSGIKRQHSVEYWLMA 107 Query: 583 SLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQADI 762 SLLY+G +EAVRV DP+ A NTHG NMTDP+TE DRQLQ +I Sbjct: 108 SLLYDGES---EEREAVRVSDPDTAQAFFVPFFSSLSFNTHGHNMTDPDTEFDRQLQIEI 164 Query: 763 LQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRKDVV 942 L+ LR S YWQ+S GRDHVIPM HPNAFRFLR +NASILIVADF RY RS++NL KDVV Sbjct: 165 LEFLRNSKYWQKSGGRDHVIPMTHPNAFRFLRQQLNASILIVADFGRYPRSMSNLSKDVV 224 Query: 943 APYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYAKSD 1122 APYVHVV+SF DD PDPF ARKTLLFF+G T+RKDEGK+RAKL +L GYDDV Y +S Sbjct: 225 APYVHVVESFTDDNPPDPFVARKTLLFFQGNTIRKDEGKVRAKLAKILTGYDDVHYERSA 284 Query: 1123 PTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDELDYS 1302 PT + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELPFEDE+DYS Sbjct: 285 PTTKSIKESTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDEIDYS 344 Query: 1303 KFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNMIWR 1482 +FS+FFSIKEA Q YM+ +LR++ K RW+EMW+RLKSISH++E+QYPPKKEDAVNM+WR Sbjct: 345 EFSVFFSIKEAGQPGYMIDQLRQIPKARWIEMWQRLKSISHYYEFQYPPKKEDAVNMVWR 404 Query: 1483 QVKHKLPAAKLSVHRSRRLKVPDWW 1557 QVK+K+P +L+VHR RRLK+PDWW Sbjct: 405 QVKNKIPGVQLAVHRHRRLKIPDWW 429 >ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase At5g25310-like [Vitis vinifera] Length = 437 Score = 545 bits (1405), Expect = e-152 Identities = 268/394 (68%), Positives = 308/394 (78%), Gaps = 3/394 (0%) Frame = +1 Query: 385 PCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSV 564 PC S PL VYMY+LPRRF+VGM+ R + D + VTAENLP WP +GLK+QHSV Sbjct: 45 PC-STGGGPLMVYMYDLPRRFHVGMLRRRSPA---DESPVTAENLPPWPSNSGLKKQHSV 100 Query: 565 EYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDR 744 EYW+MASLLY+G N+T +EAVRV DPE+AD NTHG NMTDP+TE DR Sbjct: 101 EYWMMASLLYDGGGGNET-REAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDR 159 Query: 745 QLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLAN 924 QLQ DIL+ILRES YWQRS GRDHVIPMHHPNAFRF R VN SILIVADF RY + ++N Sbjct: 160 QLQIDILKILRESKYWQRSGGRDHVIPMHHPNAFRFFREQVNTSILIVADFGRYPKEISN 219 Query: 925 LRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDD- 1101 LRKDVVAPYVHVVDSF DD PDP+ +R TLLFFRG+T+RKDEG +R KL +L G DD Sbjct: 220 LRKDVVAPYVHVVDSFTDDNSPDPYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDDY 279 Query: 1102 --VTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIEL 1275 + + V S QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IEL Sbjct: 280 LQLHFHHRSYLSFLVXQSTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIEL 339 Query: 1276 PFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKK 1455 P+EDE+DY++FSIFFS KEAL+ YM+ +LR++ KERW+EMWR LK ISHH+E+QYPPKK Sbjct: 340 PYEDEIDYTQFSIFFSDKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKK 399 Query: 1456 EDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 DA++M+WRQVKHKLP A L VHRSRRLKVPDWW Sbjct: 400 GDAIDMLWRQVKHKLPRANLDVHRSRRLKVPDWW 433 >ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis sativus] Length = 429 Score = 543 bits (1400), Expect = e-152 Identities = 263/393 (66%), Positives = 308/393 (78%) Frame = +1 Query: 385 PCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSV 564 PC ++P PL+VYMY+LPRRFNVG++NR N D VTA P WP +GLKRQHSV Sbjct: 45 PCTTDP--PLRVYMYDLPRRFNVGILNRRNL----DQTPVTASTWPPWPRNSGLKRQHSV 98 Query: 565 EYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDR 744 EYW+M SLL+ E ++AVRV+DPE AD N+HGRNMTDP TE D Sbjct: 99 EYWMMGSLLH---EATGDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDH 155 Query: 745 QLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLAN 924 QLQ D+++ L ES YWQRS GRDHVIPM HPNAFRFLR+ VNASI IV DF RY ++++N Sbjct: 156 QLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASIQIVVDFGRYPKTMSN 215 Query: 925 LRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDV 1104 L KDVVAPYVHVV SF+DD PDPF +R TLLFF+GKT RKD+G IR KL +L GYDDV Sbjct: 216 LGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDV 275 Query: 1105 TYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFE 1284 Y +S T + + S QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+E Sbjct: 276 HYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYE 335 Query: 1285 DELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDA 1464 DE+DYS+F++FFS +EALQ YMV +LR+ KERW+EMW++LK IS H+E+QYPPKKEDA Sbjct: 336 DEIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDA 395 Query: 1465 VNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 VNM+WRQVKHKLPA KL+VHRSRRLKVPDWW+R Sbjct: 396 VNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQR 428 >gb|EXC06151.1| putative glycosyltransferase [Morus notabilis] Length = 469 Score = 543 bits (1398), Expect = e-151 Identities = 260/385 (67%), Positives = 304/385 (78%) Frame = +1 Query: 409 PLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVEYWLMASL 588 PL+V+MY+LPRRFNVGM+NR + D A VTA+ P WP +GLKRQHSVEYW+M SL Sbjct: 92 PLRVFMYDLPRRFNVGMLNRRS----SDQAPVTAQTWPPWPKNSGLKRQHSVEYWMMGSL 147 Query: 589 LYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQADILQ 768 LY+G+ +E VRV DPE+A+ NTHG NMTDP+T D QLQ D+L+ Sbjct: 148 LYDGDG-----REVVRVSDPEMAEAFFVPFFSSLSFNTHGHNMTDPKTRIDHQLQIDLLE 202 Query: 769 ILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRKDVVAP 948 L ES YW+R GRDHVIPM HPNAFRFLR+ +NASI IV DF R+ R+++NL KDVVAP Sbjct: 203 FLGESKYWKRYGGRDHVIPMTHPNAFRFLRAELNASIQIVVDFGRHPRTMSNLGKDVVAP 262 Query: 949 YVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYAKSDPT 1128 YVHVVDSF DD+L DP+ +R TLLFFRG+T RKDEG +R KL VL GYDDV Y +S T Sbjct: 263 YVHVVDSFTDDDLSDPYESRTTLLFFRGRTFRKDEGIVRVKLAKVLAGYDDVHYERSVAT 322 Query: 1129 GEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDELDYSKF 1308 GE + AS GMR SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELPFEDE+DYS+F Sbjct: 323 GENIKASSLGMRLSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDEIDYSQF 382 Query: 1309 SIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNMIWRQV 1488 S+FFS KEAL+ YMV +LRK KE+W+EMWRRLK+ISHHFE+QYPP KEDAV+M+WRQV Sbjct: 383 SLFFSFKEALEPGYMVEQLRKFPKEKWVEMWRRLKNISHHFEFQYPPNKEDAVDMLWRQV 442 Query: 1489 KHKLPAAKLSVHRSRRLKVPDWWRR 1563 KHK+P L+VHRSRRLKVPDWW+R Sbjct: 443 KHKVPGVNLAVHRSRRLKVPDWWKR 467 >gb|EOX99880.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508707985|gb|EOX99881.1| Exostosin family protein isoform 1 [Theobroma cacao] gi|508707986|gb|EOX99882.1| Exostosin family protein isoform 1 [Theobroma cacao] Length = 432 Score = 542 bits (1396), Expect = e-151 Identities = 258/385 (67%), Positives = 312/385 (81%) Frame = +1 Query: 409 PLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVEYWLMASL 588 PL+VYMY+LPR+F+VGM++R + +++ A VT ENLP WP +G+KRQHSVEYWLMASL Sbjct: 51 PLRVYMYDLPRKFHVGMLDRRS---SEEAAPVTMENLPPWPSNSGIKRQHSVEYWLMASL 107 Query: 589 LYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQADILQ 768 LY+G + + +EAVRVLDPE AD NTHG NMTDPETE DR LQ ++L+ Sbjct: 108 LYDGQD--EDGREAVRVLDPEKADAFFVPFFSSLSFNTHGHNMTDPETEIDRHLQVELLE 165 Query: 769 ILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRKDVVAP 948 L++S Y+QRS GRDHVIPM HPNAFRFLR +NASILIV DF RY +++++L KDVVAP Sbjct: 166 FLQQSKYYQRSGGRDHVIPMTHPNAFRFLREQLNASILIVVDFGRYPKTMSSLSKDVVAP 225 Query: 949 YVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYAKSDPT 1128 YVHVVDSF DD+ DP+ +R TLLFFRG T+RKDEGKIR KL +L G DDV Y KS T Sbjct: 226 YVHVVDSFTDDDPLDPYESRTTLLFFRGNTVRKDEGKIRVKLAKILAGSDDVHYEKSVAT 285 Query: 1129 GEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDELDYSKF 1308 + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+EDE+DY++F Sbjct: 286 PKNIKMSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPYEDEIDYTEF 345 Query: 1309 SIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNMIWRQV 1488 SIFFS+KEAL+ Y+V+ LR+ K RW++MW+ LK+IS H+E+QYPPKKEDAVNM+WRQV Sbjct: 346 SIFFSMKEALEPGYLVNHLRQFPKNRWVQMWKLLKNISRHYEFQYPPKKEDAVNMLWRQV 405 Query: 1489 KHKLPAAKLSVHRSRRLKVPDWWRR 1563 KHKLP +L+VHRSRRLKVPDWWRR Sbjct: 406 KHKLPGVQLAVHRSRRLKVPDWWRR 430 >ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase At3g07620-like [Cucumis sativus] Length = 429 Score = 541 bits (1395), Expect = e-151 Identities = 262/393 (66%), Positives = 307/393 (78%) Frame = +1 Query: 385 PCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSV 564 PC ++P PL+VYMY+LPRRFNVG++NR N D VTA P WP +GLKRQHSV Sbjct: 45 PCTTDP--PLRVYMYDLPRRFNVGILNRRNL----DQTPVTASTWPPWPRNSGLKRQHSV 98 Query: 565 EYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDR 744 EYW+M SLL+ E ++AVRV+DPE AD N+HGRNMTDP TE D Sbjct: 99 EYWMMGSLLH---EATGDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDH 155 Query: 745 QLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLAN 924 QLQ D+++ L ES YWQRS GRDHVIPM HPNAFRFLR+ VNASI IV DF RY ++++N Sbjct: 156 QLQIDLMKFLSESKYWQRSKGRDHVIPMTHPNAFRFLRNQVNASIQIVVDFGRYPKTMSN 215 Query: 925 LRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDV 1104 L KDVVAPYVHVV SF+DD PDPF +R TLLFF+GKT RKD+G IR KL +L GYDDV Sbjct: 216 LGKDVVAPYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDV 275 Query: 1105 TYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFE 1284 Y +S T + + S QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELP+E Sbjct: 276 HYERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYE 335 Query: 1285 DELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDA 1464 DE+DYS+F++FF +EALQ YMV +LR+ KERW+EMW++LK IS H+E+QYPPKKEDA Sbjct: 336 DEIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDA 395 Query: 1465 VNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 VNM+WRQVKHKLPA KL+VHRSRRLKVPDWW+R Sbjct: 396 VNMLWRQVKHKLPAVKLAVHRSRRLKVPDWWQR 428 >gb|EMJ28657.1| hypothetical protein PRUPE_ppa005995mg [Prunus persica] Length = 433 Score = 541 bits (1393), Expect = e-151 Identities = 259/385 (67%), Positives = 303/385 (78%) Frame = +1 Query: 409 PLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVEYWLMASL 588 PLKVYMY+LPRRFNVGM+NR + + A VTA P WP +GLKRQHSVEYW+M SL Sbjct: 53 PLKVYMYDLPRRFNVGMLNRKSTEQ----APVTARTWPTWPRNSGLKRQHSVEYWMMGSL 108 Query: 589 LYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQADILQ 768 L++G+ + + AVRV DPELAD NTHG +MTDP TE D QLQ D+L+ Sbjct: 109 LFDGDGGDG--RAAVRVSDPELADAFFVPFFSSLSFNTHGHHMTDPATEIDHQLQIDVLK 166 Query: 769 ILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRKDVVAP 948 IL ES YWQRS GRDHVIP+ HPNAFRFLR +NASI IV DF RY ++NL KDVV+P Sbjct: 167 ILGESKYWQRSGGRDHVIPLTHPNAFRFLRPQINASIQIVVDFGRYPHVMSNLSKDVVSP 226 Query: 949 YVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYAKSDPT 1128 YVHVVDSF DD +P+ +R TLLFF+G+T RKDEG +R KL +L GYDDV Y +S T Sbjct: 227 YVHVVDSFTDDNHSNPYESRTTLLFFQGRTFRKDEGIVRVKLAKILAGYDDVHYERSVAT 286 Query: 1129 GEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDELDYSKF 1308 G+ + AS Q MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELPFEDE+DY+KF Sbjct: 287 GDNIKASSQRMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDEIELPFEDEIDYTKF 346 Query: 1309 SIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNMIWRQV 1488 S+FFS KEAL+ YMV +LRK K+RW+EMWR+L SISHHFE+ YPP+KEDAVNM+WRQV Sbjct: 347 SLFFSFKEALEPGYMVDQLRKFPKDRWIEMWRQLNSISHHFEFHYPPEKEDAVNMLWRQV 406 Query: 1489 KHKLPAAKLSVHRSRRLKVPDWWRR 1563 KHKLPA KL++HR+RRLK+PDWWRR Sbjct: 407 KHKLPAVKLAIHRNRRLKIPDWWRR 431 >ref|XP_004310070.1| PREDICTED: probable glycosyltransferase At3g07620-like [Fragaria vesca subsp. vesca] Length = 446 Score = 536 bits (1380), Expect = e-149 Identities = 259/392 (66%), Positives = 304/392 (77%) Frame = +1 Query: 388 CRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSVE 567 C + P PLKV+MY+LPRRFNVGM+NR + + A VTA P WP +GLK+QHSVE Sbjct: 61 CATGP--PLKVFMYDLPRRFNVGMLNRKSA----EEAPVTAREWPPWPRNSGLKKQHSVE 114 Query: 568 YWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQ 747 YW+M S+L+ GN + E VRV DPE+AD NTHG NM DPETE D Q Sbjct: 115 YWMMGSVLWEGNGGEGS--EVVRVSDPEVADAFFVPFFSSLSFNTHGHNMNDPETEVDHQ 172 Query: 748 LQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANL 927 LQ D++++L ES YW RS GRDHVIPM HPNAFRFLR +NASI IV DF RY ++NL Sbjct: 173 LQIDLVKLLHESKYWNRSGGRDHVIPMTHPNAFRFLRPQINASIQIVVDFGRYPHVMSNL 232 Query: 928 RKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVT 1107 KDVV PYVHVV+SF DD DP+ +R TLLFF+G+T RKDEG +RAKL VL GYDDV Sbjct: 233 SKDVVTPYVHVVESFTDDNSSDPYESRTTLLFFQGRTHRKDEGIVRAKLAKVLAGYDDVH 292 Query: 1108 YAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFED 1287 Y +S TGE + S Q MR+SKFCLHPAGDTPSSCRLFDAIVSHC+PVIVSD IELPFED Sbjct: 293 YERSVATGENIKLSTQRMRASKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDEIELPFED 352 Query: 1288 ELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAV 1467 ELDY++FS+FFS KEALQ YMV+ELRK+SKE+W+EM+R LKSISHHFE+ YPP+KEDAV Sbjct: 353 ELDYNQFSVFFSFKEALQPGYMVNELRKLSKEKWMEMYRHLKSISHHFEFHYPPEKEDAV 412 Query: 1468 NMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 NM+WRQVK K+PA KL+VHRS+RLK+PDWWRR Sbjct: 413 NMLWRQVKRKVPAVKLAVHRSQRLKIPDWWRR 444 >ref|XP_006391283.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum] gi|557087717|gb|ESQ28569.1| hypothetical protein EUTSA_v10018590mg [Eutrema salsugineum] Length = 432 Score = 533 bits (1373), Expect = e-148 Identities = 254/400 (63%), Positives = 310/400 (77%), Gaps = 7/400 (1%) Frame = +1 Query: 379 STPCRSEPSS----PLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGL 546 S P R+ P S PL+V+MY+LPR+FNV MM+ + D+ +T +NLP+WP +G+ Sbjct: 37 SQPRRASPCSITGRPLRVFMYDLPRKFNVAMMDPQS----SDVEPLTGKNLPSWPQTSGI 92 Query: 547 KRQHSVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDP 726 KRQHSVEYWLMASLL+ G + +EA RV DPELAD NTHG+NMTDP Sbjct: 93 KRQHSVEYWLMASLLHGGGG-GEEEKEAFRVFDPELADAFYVPFFSSLSFNTHGKNMTDP 151 Query: 727 ETEKDRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARY 906 +TE DRQLQ ++++ L S YWQRS GRDHVIPM HPNAFRFLR VNASIL+V DF RY Sbjct: 152 DTEFDRQLQVELMEYLENSKYWQRSGGRDHVIPMTHPNAFRFLRQQVNASILVVVDFGRY 211 Query: 907 ERSLANLRKDVVAPYVHVVDSFLDD---ELPDPFTARKTLLFFRGKTLRKDEGKIRAKLE 1077 R +A L KDVV+PYVHVV+SF +D + PDPF AR TLL+FRG T+RK EGKIR +LE Sbjct: 212 PREMARLGKDVVSPYVHVVESFTEDGGVDTPDPFEARTTLLYFRGNTVRKAEGKIRLRLE 271 Query: 1078 NVLVGYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIV 1257 +L G DV Y KS T + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+ Sbjct: 272 KLLAGNSDVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVII 331 Query: 1258 SDHIELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEY 1437 SD IELPFEDE+DYS+FS+FFSIKEAL+ Y+++ LR+ KE+WL+MW LK++SHHFE+ Sbjct: 332 SDRIELPFEDEIDYSEFSVFFSIKEALEPGYILNNLRQFPKEKWLQMWENLKNVSHHFEF 391 Query: 1438 QYPPKKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 QYPPK+EDAVNM+WRQVKHK+P+ KL+VHR+RRLKVPDWW Sbjct: 392 QYPPKREDAVNMLWRQVKHKIPSVKLAVHRNRRLKVPDWW 431 >ref|XP_003531191.2| PREDICTED: probable glycosyltransferase At3g07620-like isoformX1 [Glycine max] Length = 472 Score = 531 bits (1369), Expect = e-148 Identities = 254/393 (64%), Positives = 301/393 (76%) Frame = +1 Query: 385 PCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSV 564 PC EP PL+V+MY+LPRRFNVGM++R + VT E+ PAWP GLK+QHSV Sbjct: 90 PCAPEP--PLRVFMYDLPRRFNVGMIDRRSASETP----VTVEDWPAWPVNWGLKKQHSV 143 Query: 565 EYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDR 744 EYW+M SLL G +EAVRV DPELA NTHG M DP T+ DR Sbjct: 144 EYWMMGSLLNAGEG-----REAVRVSDPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDR 198 Query: 745 QLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLAN 924 QLQ D++++L++S YWQRS GRDHV PM HPNAFRFLR +N SI +V DF RY R ++N Sbjct: 199 QLQVDLMELLKKSKYWQRSGGRDHVFPMTHPNAFRFLRGQLNESIQVVVDFGRYPRGMSN 258 Query: 925 LRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDV 1104 L KDVV+PYVHVVDSF DDE DP+ +R TLLFFRG+T RKDEG +R KL +L GYDDV Sbjct: 259 LNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDV 318 Query: 1105 TYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFE 1284 Y +S T E + AS +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD IELPFE Sbjct: 319 HYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFE 378 Query: 1285 DELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDA 1464 D++DYS+FS+FFS KEALQ YM+ +LRK KE+W EMWR+LKSISHH+E++YPPK+EDA Sbjct: 379 DDIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFEYPPKREDA 438 Query: 1465 VNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 V+M+WRQ KHKLP KLSVHR+RRLK+PDWW+R Sbjct: 439 VDMLWRQAKHKLPGVKLSVHRNRRLKIPDWWQR 471 >ref|XP_004504444.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cicer arietinum] Length = 430 Score = 528 bits (1361), Expect = e-147 Identities = 252/414 (60%), Positives = 302/414 (72%) Frame = +1 Query: 322 YMNDFDFDYTRFATARSFYSTPCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAA 501 +M D F +S P P PL+VYMY+LPRRFNV M+ + Sbjct: 22 FMGTLDIRSYFFPHLKSPTLEPAPCSPDPPLRVYMYDLPRRFNVEMITHRTASESP---- 77 Query: 502 VTAENLPAWPDRTGLKRQHSVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXX 681 VT ++ P WPD GLK+QHSVEYW+M SLL+ G + ++EAVRV DPE AD Sbjct: 78 VTVKDWPPWPDNWGLKKQHSVEYWMMGSLLHEGEDGE--SREAVRVFDPEFADAFFVPFF 135 Query: 682 XXXXXNTHGRNMTDPETEKDRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRS 861 N+HG MTDP TE DRQLQ D+++ L +S YWQRS GRDH+ PM HPNAFRFLR+ Sbjct: 136 SSLSFNSHGHTMTDPATEIDRQLQVDVMEFLTKSKYWQRSRGRDHIFPMTHPNAFRFLRN 195 Query: 862 GVNASILIVADFARYERSLANLRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTL 1041 VN +I +V DF RY + ++NL KDVV+PYVHVVDSF DDE DP+ AR TLLFFRG+T Sbjct: 196 QVNDTIQVVVDFGRYPKGMSNLNKDVVSPYVHVVDSFTDDEPEDPYEARSTLLFFRGRTF 255 Query: 1042 RKDEGKIRAKLENVLVGYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLF 1221 RKDEG +RAKL +L GY DV Y +S TGE + AS +GMRSSKFCLHPAGDTPSSCRLF Sbjct: 256 RKDEGIVRAKLTKILSGYSDVHYERSVATGENIKASSKGMRSSKFCLHPAGDTPSSCRLF 315 Query: 1222 DAIVSHCVPVIVSDHIELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMW 1401 DAIVSHCVPVIVSD IELPFED++DYS+FS+FFS KEALQ YM+ LRK K++W EMW Sbjct: 316 DAIVSHCVPVIVSDQIELPFEDQIDYSQFSLFFSFKEALQPGYMIDHLRKFPKQKWTEMW 375 Query: 1402 RRLKSISHHFEYQYPPKKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 R+LK+ SHH+E+QYPPK+ DAVNM+WRQ+KHKLP LS+HRSRRLK+PDWW R Sbjct: 376 RQLKNNSHHYEFQYPPKRGDAVNMLWRQIKHKLPEVTLSIHRSRRLKIPDWWHR 429 >ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata] gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata] Length = 429 Score = 528 bits (1361), Expect = e-147 Identities = 251/396 (63%), Positives = 311/396 (78%), Gaps = 3/396 (0%) Frame = +1 Query: 379 STPCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQH 558 ++PC S PL+V+MY+LPR+FNV MM+ ++ D+ +T +NLP+WP +G+KRQH Sbjct: 42 ASPC-SSTGKPLRVFMYDLPRKFNVAMMDPHS----SDVEPLTGKNLPSWPQTSGIKRQH 96 Query: 559 SVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEK 738 SVEYWLMASLL G++ N EA+RV DP+LAD NTHG+NMTDP+TE Sbjct: 97 SVEYWLMASLLNGGDDDN----EAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEF 152 Query: 739 DRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSL 918 DRQLQ ++++ L S YW RS G+DHVIPM HPNAFRFLR VNASILIV DF RY + + Sbjct: 153 DRQLQVELMEFLEGSEYWNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYAKDM 212 Query: 919 ANLRKDVVAPYVHVVDSFL---DDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLV 1089 A L KDVV+PYVHVV+S DD L DPF AR TLL+FRG T+RKDEGKIR +LE +L Sbjct: 213 ARLSKDVVSPYVHVVESLNEEDDDGLTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLA 272 Query: 1090 GYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHI 1269 G DV + KS T + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD I Sbjct: 273 GNSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKI 332 Query: 1270 ELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPP 1449 ELPFEDE+DYS+FS+FFSIKE+L+ Y++++LR+ KE+WLEMW+RLK++SHHFE+QYPP Sbjct: 333 ELPFEDEIDYSEFSLFFSIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPP 392 Query: 1450 KKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 K+EDAVNM+WRQVKHK+P KL+VHR+RRLKVPDWW Sbjct: 393 KREDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 428 >ref|XP_003524893.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like isoform X1 [Glycine max] Length = 427 Score = 527 bits (1358), Expect = e-147 Identities = 252/392 (64%), Positives = 300/392 (76%) Frame = +1 Query: 385 PCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQHSV 564 PC +P PL+V+MY+LPRRFNVGM++R + VT E+ PAWP GLK+QHSV Sbjct: 45 PCAPDP--PLRVFMYDLPRRFNVGMIDRRSAAE----MPVTVEDWPAWPVNWGLKKQHSV 98 Query: 565 EYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDR 744 EYW+M SLL G +E VRV DPELA NTHG M DP T+ DR Sbjct: 99 EYWMMGSLLNVGGG-----REVVRVSDPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDR 153 Query: 745 QLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLAN 924 QLQ D++++L++S YWQRS GRDHV PM HPNAFRFLR +N SI +V DF RY R ++N Sbjct: 154 QLQVDLMELLKKSNYWQRSGGRDHVFPMTHPNAFRFLRDQLNESIQVVVDFGRYPRGMSN 213 Query: 925 LRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDV 1104 L KDVV+PYVHVVDSF DDE DP+ +R TLLFFRG+T RKDEG +R KL +L GYDDV Sbjct: 214 LNKDVVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDV 273 Query: 1105 TYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFE 1284 Y +S T E + AS +GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVIVSD IELPFE Sbjct: 274 HYERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFE 333 Query: 1285 DELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDA 1464 DE+DYS+FS+FFS KEALQ YM+ +LRK KE+W EMWR+LKSISHH+E++YPPK+EDA Sbjct: 334 DEIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPKREDA 393 Query: 1465 VNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWR 1560 V+M+WRQVKHKLP KLSVHR+RRLK+PDWW+ Sbjct: 394 VDMLWRQVKHKLPGVKLSVHRNRRLKIPDWWQ 425 >ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana] gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana] gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana] Length = 430 Score = 527 bits (1357), Expect = e-147 Identities = 249/396 (62%), Positives = 308/396 (77%), Gaps = 3/396 (0%) Frame = +1 Query: 379 STPCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQH 558 S+PC S PL+V+MY+LPR+FN+ MM+ ++ D+ +T +NLP+WP +G+KRQH Sbjct: 43 SSPCSSS-GKPLRVFMYDLPRKFNIAMMDPHS----SDVEPITGKNLPSWPQTSGIKRQH 97 Query: 559 SVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEK 738 SVEYWLMASLL G + N EA+RV DP+LAD NTHG+NMTDP+TE Sbjct: 98 SVEYWLMASLLNGGEDEN----EAIRVFDPDLADVFYVPFFSSLSFNTHGKNMTDPDTEF 153 Query: 739 DRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSL 918 DR LQ ++++ L S YW RS G+DHVIPM HPNAFRFLR VNASILIV DF RY + + Sbjct: 154 DRLLQVELMEFLENSKYWNRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYSKDM 213 Query: 919 ANLRKDVVAPYVHVVDSFL---DDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLV 1089 A L KDVV+PYVHVV+S DD + DPF AR TLL+FRG T+RKDEGKIR +LE +L Sbjct: 214 ARLSKDVVSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLA 273 Query: 1090 GYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHI 1269 G DV + KS T + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD I Sbjct: 274 GNSDVHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKI 333 Query: 1270 ELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPP 1449 ELPFEDE+DYS+FS+FFSIKE+L+ Y+++ LR+ KE+WLEMW+RLK++SHHFE+QYPP Sbjct: 334 ELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVSHHFEFQYPP 393 Query: 1450 KKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 K+EDAVNM+WRQVKHK+P KL+VHR+RRLKVPDWW Sbjct: 394 KREDAVNMLWRQVKHKIPYVKLAVHRNRRLKVPDWW 429 >ref|XP_006302174.1| hypothetical protein CARUB_v10020184mg [Capsella rubella] gi|482570884|gb|EOA35072.1| hypothetical protein CARUB_v10020184mg [Capsella rubella] Length = 494 Score = 525 bits (1351), Expect = e-146 Identities = 247/397 (62%), Positives = 307/397 (77%), Gaps = 4/397 (1%) Frame = +1 Query: 379 STPCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTGLKRQH 558 ++PC S PL+V+MY+LPR+FNV MM+ + D+ +T +NLP+WP +G+KRQH Sbjct: 104 ASPCSSN-GRPLRVFMYDLPRKFNVAMMDPRS----SDVEPLTGKNLPSWPQTSGIKRQH 158 Query: 559 SVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEK 738 SVEYWLMASLL G + D EA+RV DP+LAD NTHG+NMTDP+TE Sbjct: 159 SVEYWLMASLLQRGGDGGD--DEAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEF 216 Query: 739 DRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSL 918 DR+LQ ++++ L S YW+RS G+DHVIPM HPNAFRFLR VNASILIV DF RY + + Sbjct: 217 DRKLQVELMEFLENSEYWKRSGGKDHVIPMTHPNAFRFLRQQVNASILIVVDFGRYPKDM 276 Query: 919 ANLRKDVVAPYVHVVDSFL----DDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVL 1086 A L KDVV+PYVHVV++ DD + DPF AR TLL+FRG T RKDEGKIR +LE +L Sbjct: 277 ARLSKDVVSPYVHVVETLTEDGDDDGMTDPFEARTTLLYFRGNTARKDEGKIRLRLEKLL 336 Query: 1087 VGYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDH 1266 DV Y KS T + + S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHC+PVI+SD Sbjct: 337 ANNSDVHYEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDK 396 Query: 1267 IELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYP 1446 IELPFEDE+DYS+FS+FFSIKE+L+ Y+++ LR+ K++WLEMW+RLK++SHHFE+QYP Sbjct: 397 IELPFEDEIDYSEFSVFFSIKESLEPGYILNNLRQFPKDKWLEMWKRLKNVSHHFEFQYP 456 Query: 1447 PKKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 PK+EDAVNM+WRQVKHK+P KL+VHR+RRLKVPDWW Sbjct: 457 PKREDAVNMLWRQVKHKIPNVKLAVHRNRRLKVPDWW 493 >ref|XP_006848395.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda] gi|548851701|gb|ERN09976.1| hypothetical protein AMTR_s00013p00218260 [Amborella trichopoda] Length = 422 Score = 524 bits (1350), Expect = e-146 Identities = 250/400 (62%), Positives = 309/400 (77%), Gaps = 1/400 (0%) Frame = +1 Query: 367 RSFYSTPCRSEPS-SPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTG 543 RS + P PS SPLK+YMY LPR FN+GM+ R++ + +P WP +G Sbjct: 28 RSQFFAPTIIAPSNSPLKIYMYNLPRHFNIGMLRRSDPHQDLPFTG----QIPPWPQNSG 83 Query: 544 LKRQHSVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTD 723 LK+QHSVEYW+MASLLY E D EA+RV DPE AD NTHG NMTD Sbjct: 84 LKKQHSVEYWMMASLLYEDGEGRD--MEAIRVSDPEEADAFFVPFFSSLSFNTHGHNMTD 141 Query: 724 PETEKDRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFAR 903 PETE DRQLQ ++L+ LR S +W++S GRDHVIPMHHPNAFRFLR VNASIL+VADF R Sbjct: 142 PETEVDRQLQIELLEFLRISKFWEQSGGRDHVIPMHHPNAFRFLREKVNASILVVADFGR 201 Query: 904 YERSLANLRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENV 1083 +++++L KDVVAPYVHV DSF+DD+ DPF +R TLLFFRG+T+RK EG +R+KL + Sbjct: 202 CPKNISSLSKDVVAPYVHVGDSFIDDDSSDPFESRTTLLFFRGRTVRKAEGIVRSKLAKI 261 Query: 1084 LVGYDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSD 1263 L G + V + +S TGE + AS GMRSSKFCL+PAGDTPSSCRLFDAIVSHC+PVIVSD Sbjct: 262 LRGQEGVHFEESVATGESIKASSLGMRSSKFCLNPAGDTPSSCRLFDAIVSHCIPVIVSD 321 Query: 1264 HIELPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQY 1443 IELP+EDE+DY FS+FFS++EAL+ YM+ ELR++ +E+W+EMWRRLK ISHHFE+Q+ Sbjct: 322 RIELPYEDEIDYRTFSLFFSVEEALRPGYMLKELRQIKREKWVEMWRRLKEISHHFEFQF 381 Query: 1444 PPKKEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWWRR 1563 PPK++DAVNMIW+QV+HKLPAAKL+VHRSRRLK+PDWW + Sbjct: 382 PPKRDDAVNMIWKQVRHKLPAAKLAVHRSRRLKIPDWWEK 421 >ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor] gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor] Length = 432 Score = 524 bits (1350), Expect = e-146 Identities = 250/395 (63%), Positives = 309/395 (78%), Gaps = 1/395 (0%) Frame = +1 Query: 376 YSTPCRSEPSSPLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTG-LKR 552 +S C ++PL+V+MY+LP RF+V MM ++ PAWP G ++R Sbjct: 49 FSARCAPAAAAPLRVFMYDLPARFHVAMMGADD-----------GAGFPAWPPSAGGIRR 97 Query: 553 QHSVEYWLMASLLYNGNEWNDTTQEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPET 732 QHSVEYW+MASL +G D +EAVRV DP+ AD N HGRNMTDP+T Sbjct: 98 QHSVEYWMMASL-QDGAAGPDGGREAVRVRDPDAADAFFVPFFSSLSFNVHGRNMTDPDT 156 Query: 733 EKDRQLQADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYER 912 E DR LQ +I+ IL +S YWQRSAGRDHVIPMHHPNAFRFLR+ VNASILIV+DF RY + Sbjct: 157 EADRLLQVEIVDILWKSKYWQRSAGRDHVIPMHHPNAFRFLRAMVNASILIVSDFGRYTK 216 Query: 913 SLANLRKDVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVG 1092 LA+LRKDVVAPYVHVVDSFLDD+ PDPF AR TLLFFRG+T+RKDEGKIRAKL VL G Sbjct: 217 ELASLRKDVVAPYVHVVDSFLDDDPPDPFEARHTLLFFRGRTVRKDEGKIRAKLGKVLKG 276 Query: 1093 YDDVTYAKSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIE 1272 + V + S TG+G+ S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS IE Sbjct: 277 KEGVRFEDSIATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIE 336 Query: 1273 LPFEDELDYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPK 1452 LPFEDE+DYS+FS+FFS++EAL+ DY++++LR++ K++W++MW +LK++SHH+E+QYPP+ Sbjct: 337 LPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIPKKKWVDMWSKLKNVSHHYEFQYPPR 396 Query: 1453 KEDAVNMIWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 K DAVNMIWRQV+HK+PA L++HR+RRLK+PDWW Sbjct: 397 KGDAVNMIWRQVRHKIPAVNLAIHRNRRLKIPDWW 431 >ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group] gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group] gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group] gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group] gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group] gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group] gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group] Length = 437 Score = 522 bits (1345), Expect = e-145 Identities = 250/388 (64%), Positives = 305/388 (78%), Gaps = 5/388 (1%) Frame = +1 Query: 409 PLKVYMYELPRRFNVGMMNRNNRKLNDDIAAVTAENLPAWPDRTG-LKRQHSVEYWLMAS 585 PL+V+MY+LPRRF+VGMM+ +A PAWP G ++RQHSVEYW+MAS Sbjct: 61 PLRVFMYDLPRRFHVGMMD------------ASASGFPAWPPSAGGIRRQHSVEYWMMAS 108 Query: 586 LLYNGNEWNDTT----QEAVRVLDPELADXXXXXXXXXXXXNTHGRNMTDPETEKDRQLQ 753 L G N ++ +EAVRV DP+ A+ N HGRNMTDPETE DR LQ Sbjct: 109 LQGGGGGGNGSSSEEGREAVRVTDPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQ 168 Query: 754 ADILQILRESPYWQRSAGRDHVIPMHHPNAFRFLRSGVNASILIVADFARYERSLANLRK 933 ++++IL +S YWQRSAGRDHVIPMHHPNAFRFLR VNASILIVADF RY + LA+LRK Sbjct: 169 VELMEILWKSKYWQRSAGRDHVIPMHHPNAFRFLRDMVNASILIVADFGRYTKELASLRK 228 Query: 934 DVVAPYVHVVDSFLDDELPDPFTARKTLLFFRGKTLRKDEGKIRAKLENVLVGYDDVTYA 1113 DVVAPYVHVVDSFL+D+ PDPF R TLLFFRG+T+RKDEGKIRAKL +L G D V + Sbjct: 229 DVVAPYVHVVDSFLNDDPPDPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGVRFE 288 Query: 1114 KSDPTGEGVNASIQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDHIELPFEDEL 1293 S TGEG+ S +GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS IELPFEDE+ Sbjct: 289 DSLATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEI 348 Query: 1294 DYSKFSIFFSIKEALQQDYMVSELRKVSKERWLEMWRRLKSISHHFEYQYPPKKEDAVNM 1473 DYS+FS+FFS++EAL+ DY++++LR++ K +W+E+W +LK++SHH+E+Q PP+K DAVNM Sbjct: 349 DYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNM 408 Query: 1474 IWRQVKHKLPAAKLSVHRSRRLKVPDWW 1557 IWRQVKHK+PA L++HR+RRLK+PDWW Sbjct: 409 IWRQVKHKVPAVNLAIHRNRRLKIPDWW 436