BLASTX nr result
ID: Akebia26_contig00013800
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00013800 (776 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003555466.1| PREDICTED: uncharacterized protein LOC100790... 358 1e-96 ref|XP_004496161.1| PREDICTED: uncharacterized protein LOC101507... 355 1e-95 ref|XP_007144298.1| hypothetical protein PHAVU_007G144400g [Phas... 353 4e-95 ref|XP_006379496.1| hypothetical protein POPTR_0008s02900g [Popu... 352 8e-95 ref|XP_006838728.1| hypothetical protein AMTR_s00002p00253220 [A... 349 6e-94 gb|EXB58473.1| hypothetical protein L484_005207 [Morus notabilis] 348 2e-93 ref|XP_003536299.1| PREDICTED: uncharacterized protein LOC100789... 347 3e-93 ref|XP_002262646.1| PREDICTED: uncharacterized protein LOC100242... 342 9e-92 ref|XP_004307152.1| PREDICTED: uncharacterized protein LOC101295... 340 3e-91 gb|EXB58476.1| hypothetical protein L484_005210 [Morus notabilis] 333 5e-89 ref|XP_007010125.1| Glycosyltransferase family protein 47 [Theob... 333 5e-89 ref|XP_006436587.1| hypothetical protein CICLE_v10030719mg [Citr... 322 9e-86 ref|XP_004250015.1| PREDICTED: uncharacterized protein LOC101257... 319 8e-85 ref|XP_006360502.1| PREDICTED: uncharacterized protein LOC102585... 318 1e-84 ref|XP_002532924.1| transferase, transferring glycosyl groups, p... 318 1e-84 ref|XP_004142449.1| PREDICTED: uncharacterized protein LOC101212... 316 7e-84 ref|XP_002871098.1| glycosyltransferase family protein 47 [Arabi... 315 1e-83 ref|NP_196070.2| glycosyltransferase family protein 47 [Arabidop... 313 3e-83 emb|CAB85556.1| putative protein [Arabidopsis thaliana] 313 3e-83 ref|XP_006289714.1| hypothetical protein CARUB_v10003280mg [Caps... 311 1e-82 >ref|XP_003555466.1| PREDICTED: uncharacterized protein LOC100790409 [Glycine max] Length = 761 Score = 358 bits (919), Expect = 1e-96 Identities = 170/259 (65%), Positives = 197/259 (76%), Gaps = 13/259 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575 SCCDMS+KC CRWR ++ S TLY WL FSP +SLS Sbjct: 14 SCCDMSVKCSCRWRLENQQYYKRLFSSGFVFFFGCFVLFGSIATLYGWLAFSPTVHTSLS 73 Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395 S GC+DD+EGSWS+GVFYGDSPFSLKPIE+ NV DES AWPVANPVVTCAS +DAG+PS Sbjct: 74 SFGCRDDNEGSWSVGVFYGDSPFSLKPIEAANVSNDESAAWPVANPVVTCASVSDAGYPS 133 Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239 NFVADPFL+IQ GDIGV++S D+GATWQQLGIAL+E+WHLSYPY Sbjct: 134 NFVADPFLFIQGNTFYLFYETKSSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 193 Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59 VF++DGQIYMMPE SQKG+LRLYRA+ FPLQW LEKV++K+PLVDSF++N+ G YWLFGS Sbjct: 194 VFEHDGQIYMMPEGSQKGDLRLYRAVNFPLQWRLEKVVMKKPLVDSFVINHGGRYWLFGS 253 Query: 58 DHSGFGAKKNGQLEIWYSN 2 DHSGFG +KNGQLEIWYSN Sbjct: 254 DHSGFGTQKNGQLEIWYSN 272 >ref|XP_004496161.1| PREDICTED: uncharacterized protein LOC101507497 [Cicer arietinum] Length = 773 Score = 355 bits (910), Expect = 1e-95 Identities = 164/259 (63%), Positives = 195/259 (75%), Gaps = 13/259 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575 SCCDMS+KCWCRWR ++ S + Y WL FSP +++S Sbjct: 27 SCCDMSMKCWCRWRMENQHYYNRIFSSGFVFFFGCFVLFGSIASFYGWLAFSPSVHTAIS 86 Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395 GCQDD+EGSWSIG+FYG SPFSLKPIES NV D+S +WPVANPVVTCAS +DAGFPS Sbjct: 87 PFGCQDDNEGSWSIGIFYGHSPFSLKPIESSNVSNDDSASWPVANPVVTCASVSDAGFPS 146 Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239 NFVADPFL+IQ GDIGV++S D+GATWQQLGIAL+E+WHLSYPY Sbjct: 147 NFVADPFLFIQGDTLYLFYETKNSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 206 Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59 VF++DGQIYMMPE S++G+LRLY+A+ FPLQW LEKV++K+PL+DSFI++Y G YWLFGS Sbjct: 207 VFEHDGQIYMMPEGSKRGDLRLYKAVNFPLQWKLEKVLIKKPLIDSFIVDYGGKYWLFGS 266 Query: 58 DHSGFGAKKNGQLEIWYSN 2 DHSGFG KKNGQLEIWYSN Sbjct: 267 DHSGFGTKKNGQLEIWYSN 285 >ref|XP_007144298.1| hypothetical protein PHAVU_007G144400g [Phaseolus vulgaris] gi|561017488|gb|ESW16292.1| hypothetical protein PHAVU_007G144400g [Phaseolus vulgaris] Length = 768 Score = 353 bits (906), Expect = 4e-95 Identities = 172/259 (66%), Positives = 196/259 (75%), Gaps = 13/259 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575 SCCDMS+KC CRWR ++ LS TLY W+ F P RSSL+ Sbjct: 23 SCCDMSVKCSCRWRLENQQYYKRLLSSGFVFFFGCFVLFGSIATLYGWVAFPPTVRSSLN 82 Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395 GC+DD+EGSWSIG+FYGDSPFSLKPIE+ NV DES AWPVANPVVTCAS +DAGFPS Sbjct: 83 --GCRDDNEGSWSIGIFYGDSPFSLKPIEAANVSHDESAAWPVANPVVTCASVSDAGFPS 140 Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239 NFVADPFL+IQ GDIGV++S D+GATWQQLGIAL+E+WHLSYPY Sbjct: 141 NFVADPFLFIQGNTFYLFYETKNSITYQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 200 Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59 VF++DGQIYMMPE S+KG+LRLYRA+ FPLQW L KVI+K PLVDSFI+NY G YWLFGS Sbjct: 201 VFEHDGQIYMMPEGSKKGDLRLYRAVNFPLQWRLAKVIIKRPLVDSFIINYGGRYWLFGS 260 Query: 58 DHSGFGAKKNGQLEIWYSN 2 DHSGFG+KKNGQLEIWYSN Sbjct: 261 DHSGFGSKKNGQLEIWYSN 279 >ref|XP_006379496.1| hypothetical protein POPTR_0008s02900g [Populus trichocarpa] gi|550332290|gb|ERP57293.1| hypothetical protein POPTR_0008s02900g [Populus trichocarpa] Length = 789 Score = 352 bits (903), Expect = 8e-95 Identities = 172/281 (61%), Positives = 198/281 (70%), Gaps = 35/281 (12%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDH---------------------CFLSXXXXXXXXXXXXXXXXXT 623 +CCDMSL+CWCRW+W + S Sbjct: 21 NCCDMSLRCWCRWKWGNHQQQQQPQQNHHNLLHQRLVSLVFSSGFMFFLGCLVLYGSIGM 80 Query: 622 LYAWLTFS-PYERSS-----LSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDES 461 Y WL FS PY RS+ L+SLGCQ+D+EGSWSIGVFYGDSPFSLKPIE+ N W+DE Sbjct: 81 FYGWLVFSKPYSRSTNVGVGLNSLGCQEDNEGSWSIGVFYGDSPFSLKPIEAMNEWRDEG 140 Query: 460 GAWPVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSID 305 AWPVANPVVTCAS +DA FPSNFVADPFLY+Q GDI VA+S+D Sbjct: 141 VAWPVANPVVTCASLSDANFPSNFVADPFLYVQGDTLFLFYETKNSITMQGDIAVAKSMD 200 Query: 304 EGATWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVI 125 +GATWQQLGIALDE+WHLSYPYVF+Y GQIYMMPESSQKGELRLYRA+ FPLQW LEKV+ Sbjct: 201 KGATWQQLGIALDEDWHLSYPYVFNYLGQIYMMPESSQKGELRLYRALNFPLQWTLEKVL 260 Query: 124 LKEPLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 +K+PLVDSFI+N+ G YWLFGSDHSGFG ++NGQLEIWYS+ Sbjct: 261 IKKPLVDSFIINHAGIYWLFGSDHSGFGTRRNGQLEIWYSS 301 >ref|XP_006838728.1| hypothetical protein AMTR_s00002p00253220 [Amborella trichopoda] gi|548841234|gb|ERN01297.1| hypothetical protein AMTR_s00002p00253220 [Amborella trichopoda] Length = 762 Score = 349 bits (896), Expect = 6e-94 Identities = 165/255 (64%), Positives = 191/255 (74%), Gaps = 11/255 (4%) Frame = -2 Query: 736 CCDMSLKCWCRWR--WDHCFL-SXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLSSLG 566 CCDM LKCWCRW +DH FL S +AWLTFSPYER LSS G Sbjct: 28 CCDMRLKCWCRWHPTFDHSFLISSAFSFFLISSLLFGSLALAFAWLTFSPYERPRLSSYG 87 Query: 565 CQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFV 386 CQDD+EGSWSIGV+YGD+PFSLKP+E RNVW D+ AWPVANPV+TCA A+DAG+PSNFV Sbjct: 88 CQDDNEGSWSIGVYYGDNPFSLKPLELRNVWSDKGLAWPVANPVMTCALASDAGYPSNFV 147 Query: 385 ADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFD 230 ADPFLY+Q G+IGVARS+D ATW+ LGIALDEEWHLS+PYVF Sbjct: 148 ADPFLYVQDDILYMFFETKNSVTLKGEIGVARSLDNSATWEHLGIALDEEWHLSFPYVFS 207 Query: 229 YDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHS 50 Y+G+IYM+PE SQKG+LRLYRA+ FPLQW LEKVILK P+VDSFI+ + ++WLFGSD S Sbjct: 208 YNGEIYMLPEGSQKGDLRLYRALKFPLQWTLEKVILKRPMVDSFIIQRDRSFWLFGSDIS 267 Query: 49 GFGAKKNGQLEIWYS 5 GF KKNG+LEIWYS Sbjct: 268 GFSTKKNGELEIWYS 282 >gb|EXB58473.1| hypothetical protein L484_005207 [Morus notabilis] Length = 775 Score = 348 bits (892), Expect = 2e-93 Identities = 170/260 (65%), Positives = 193/260 (74%), Gaps = 14/260 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDH----CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSS--L 578 SCC +S+KCWCRWR H C LS TLYA F+P R++ L Sbjct: 28 SCCHVSMKCWCRWRCHHRLQRCLLSSGFVFSVACLALFGSLATLYARFAFAPGVRTTTGL 87 Query: 577 SSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398 SS G DD+EGSWS+GVF+GDSPFSLKPIE+ NVW DES AWPVANPV+TCAS ++AGFP Sbjct: 88 SSFGRHDDNEGSWSVGVFFGDSPFSLKPIEAENVWNDESAAWPVANPVMTCASVSEAGFP 147 Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242 SNFVADPFLY+Q GDIGV +S D GATWQQLGIALDEEWHLSYP Sbjct: 148 SNFVADPFLYVQGDAFYLFYETKNSITMQGDIGVVKSTDGGATWQQLGIALDEEWHLSYP 207 Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62 YVF+ GQIYMMPESS+KGELRLY+A+ FPLQW L+KVI+K+PLVDSFI+ YN YWLFG Sbjct: 208 YVFEDLGQIYMMPESSKKGELRLYQAVKFPLQWRLKKVIMKKPLVDSFIIKYNNIYWLFG 267 Query: 61 SDHSGFGAKKNGQLEIWYSN 2 SDHSGFG KKNGQLEIWYS+ Sbjct: 268 SDHSGFGTKKNGQLEIWYSS 287 >ref|XP_003536299.1| PREDICTED: uncharacterized protein LOC100789310 [Glycine max] Length = 768 Score = 347 bits (890), Expect = 3e-93 Identities = 166/260 (63%), Positives = 195/260 (75%), Gaps = 14/260 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575 SCCDMS+KC CRWR ++ S TLY W FSP ++LS Sbjct: 20 SCCDMSVKCSCRWRLENQQYYKRLFSSGFIFFFGCFVLFGSIATLYGWFAFSPTVHTALS 79 Query: 574 S-LGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398 S GC++D+EGSWSIGVFYGDSPFSLKPIE+ NV DE+ AWPVANPVVTCAS +D G+P Sbjct: 80 SSFGCREDNEGSWSIGVFYGDSPFSLKPIEAANVSNDETAAWPVANPVVTCASVSDVGYP 139 Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242 SNFVADPFL+IQ GDIGV++S D+GATWQQLGIAL+E+WHLSYP Sbjct: 140 SNFVADPFLFIQGNTFYLFYETKNSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYP 199 Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62 YVF++DGQIYMMPE SQKG+LRLYRA+ FPLQW LEKV++K+PLVDSF++N+ G YWLFG Sbjct: 200 YVFEHDGQIYMMPEGSQKGDLRLYRAVNFPLQWRLEKVVMKKPLVDSFVINHGGRYWLFG 259 Query: 61 SDHSGFGAKKNGQLEIWYSN 2 SDHSGFG +KNGQLEIWYSN Sbjct: 260 SDHSGFGTQKNGQLEIWYSN 279 >ref|XP_002262646.1| PREDICTED: uncharacterized protein LOC100242107 [Vitis vinifera] gi|296090371|emb|CBI40190.3| unnamed protein product [Vitis vinifera] Length = 756 Score = 342 bits (877), Expect = 9e-92 Identities = 162/255 (63%), Positives = 193/255 (75%), Gaps = 9/255 (3%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDH-CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLSSLGC 563 SCC M+ RWRWDH CFLS +YAWL +P+ L+SLGC Sbjct: 19 SCCHMA-----RWRWDHHCFLSSTFVFFIASFVVYGFIAGVYAWLFVNPHAPLELASLGC 73 Query: 562 QDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFVA 383 + DSEGSW+IGVFYGDSPFSL+PIE+ NVW++ES AWPVANPVVTCASA+DA FPSNFVA Sbjct: 74 RPDSEGSWAIGVFYGDSPFSLRPIEAMNVWRNESAAWPVANPVVTCASASDAVFPSNFVA 133 Query: 382 DPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFDY 227 DPFLY+Q GDIGV++S D+GATWQ LG+ALDEEWHLSYPYVF+Y Sbjct: 134 DPFLYVQGDTLFLFYETKNSITMQGDIGVSKSDDKGATWQHLGVALDEEWHLSYPYVFEY 193 Query: 226 DGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHSG 47 G+IYMMPE S KGELR+YRA+ FPLQW LEK+I+K+ LVDS I+N++G YW+FGSDH+G Sbjct: 194 LGKIYMMPECSGKGELRIYRALNFPLQWTLEKIIIKKHLVDSVIINHDGKYWIFGSDHTG 253 Query: 46 FGAKKNGQLEIWYSN 2 FGAKKNGQ+EIWYS+ Sbjct: 254 FGAKKNGQMEIWYSS 268 >ref|XP_004307152.1| PREDICTED: uncharacterized protein LOC101295367 [Fragaria vesca subsp. vesca] Length = 778 Score = 340 bits (872), Expect = 3e-91 Identities = 162/257 (63%), Positives = 194/257 (75%), Gaps = 12/257 (4%) Frame = -2 Query: 736 CCDMSLKC-WCRWRWDHCFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPY---ERSSLSSL 569 CC+MSLKC C+WR C +S T Y W F+PY ++ S+L Sbjct: 37 CCNMSLKCRLCKWR---CLMSSGFVFFFGCCVLFGSVATFYVWFAFTPYYYARGTTPSAL 93 Query: 568 GCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNF 389 GCQ+D+EGSWS+GVF+GDSPF LKPIE+ NVW+++S AWPVANPVVTCAS +++GFPSNF Sbjct: 94 GCQEDNEGSWSVGVFFGDSPFHLKPIEAMNVWRNKSAAWPVANPVVTCASLSESGFPSNF 153 Query: 388 VADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVF 233 VADPFLY+Q GDIGV++S D+GATWQQLGIALDEEWHLSYPYVF Sbjct: 154 VADPFLYVQGDTLYMFYETKNSITMQGDIGVSKSSDKGATWQQLGIALDEEWHLSYPYVF 213 Query: 232 DYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDH 53 Y GQIYMMPESS GE+RLY+A++FP+QW LEKVILK+PLVDSF++NY+G YWLFGSDH Sbjct: 214 PYLGQIYMMPESSMNGEVRLYQALSFPMQWTLEKVILKKPLVDSFLINYDGAYWLFGSDH 273 Query: 52 SGFGAKKNGQLEIWYSN 2 SGFG KNGQLEIWYS+ Sbjct: 274 SGFGTTKNGQLEIWYSS 290 >gb|EXB58476.1| hypothetical protein L484_005210 [Morus notabilis] Length = 770 Score = 333 bits (853), Expect = 5e-89 Identities = 164/260 (63%), Positives = 189/260 (72%), Gaps = 14/260 (5%) Frame = -2 Query: 739 SCCDMSLKCWCRWRWDH----CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSS--L 578 SC +S+KCWC+ H LS LYA F+P R++ L Sbjct: 23 SCSHVSMKCWCQQHLHHQLQRFLLSSGFVFFVACLALFGSLAMLYARFAFAPGIRTTTGL 82 Query: 577 SSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398 SS GC+DD+EGSWS+GVF+GDSPFSL+PIE+ NVW DES AWPVANPVVTCAS ++AGFP Sbjct: 83 SSFGCRDDNEGSWSVGVFFGDSPFSLQPIEAENVWSDESAAWPVANPVVTCASVSEAGFP 142 Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242 SNFVADPFLY+Q GDIGVA+S D GATWQQLGIALDEEWHLSYP Sbjct: 143 SNFVADPFLYVQSDALYLFYETKNSITMQGDIGVAKSTDGGATWQQLGIALDEEWHLSYP 202 Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62 YVF+ GQIYMMPE S KGELRLY+A+ FPLQW L+KVI+K+PLVDSFI+ YN YWLFG Sbjct: 203 YVFEDLGQIYMMPEGSVKGELRLYQAVKFPLQWRLKKVIMKKPLVDSFIIKYNDMYWLFG 262 Query: 61 SDHSGFGAKKNGQLEIWYSN 2 SDHSGFG +KNGQLEIWYS+ Sbjct: 263 SDHSGFGTQKNGQLEIWYSS 282 >ref|XP_007010125.1| Glycosyltransferase family protein 47 [Theobroma cacao] gi|508727038|gb|EOY18935.1| Glycosyltransferase family protein 47 [Theobroma cacao] Length = 779 Score = 333 bits (853), Expect = 5e-89 Identities = 154/219 (70%), Positives = 182/219 (83%), Gaps = 12/219 (5%) Frame = -2 Query: 622 LYAWLTFSP----YERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGA 455 LY W+ +P YER L LGCQ+D+EGSWSIG+F+G SPFSLKPIE+ +VW++ES A Sbjct: 73 LYGWVILTPSFFTYERRGLPWLGCQEDNEGSWSIGLFFGHSPFSLKPIETADVWRNESAA 132 Query: 454 WPVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEG 299 WPVANPV+TCASA+D+GFPSNFVADPFLY+Q GDIGVA+SID+G Sbjct: 133 WPVANPVITCASASDSGFPSNFVADPFLYVQGDVFYLFYETKNSFTMQGDIGVAKSIDKG 192 Query: 298 ATWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILK 119 ATWQQLGIALDE+WHLSYPYVF+Y GQIYMMPESSQKGELRLYRAI FPLQW L+++I+K Sbjct: 193 ATWQQLGIALDEDWHLSYPYVFNYLGQIYMMPESSQKGELRLYRAINFPLQWELDRIIIK 252 Query: 118 EPLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 +PL+DSFI+N++G YWLFGSDHS FG KKNGQLEIWYS+ Sbjct: 253 KPLIDSFIINHDGEYWLFGSDHSSFGTKKNGQLEIWYSD 291 >ref|XP_006436587.1| hypothetical protein CICLE_v10030719mg [Citrus clementina] gi|568863642|ref|XP_006485243.1| PREDICTED: uncharacterized protein LOC102631491 [Citrus sinensis] gi|557538783|gb|ESR49827.1| hypothetical protein CICLE_v10030719mg [Citrus clementina] Length = 814 Score = 322 bits (825), Expect = 9e-86 Identities = 148/217 (68%), Positives = 176/217 (81%), Gaps = 10/217 (4%) Frame = -2 Query: 622 LYAWLTFS-PYE-RSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWP 449 LY WL PY + LSS GCQ+DSEGSWSIGVF+G+SPFSLKPIE+ NVW+D+S AWP Sbjct: 110 LYGWLALKKPYTVAAGLSSFGCQEDSEGSWSIGVFFGNSPFSLKPIETANVWRDDSAAWP 169 Query: 448 VANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGAT 293 VANP++TCAS + AGFPSNFVADPF Y+Q GDIGVA+S+D+GAT Sbjct: 170 VANPIMTCASVSSAGFPSNFVADPFFYLQGNDLYLFYETKNSITMQGDIGVAKSVDKGAT 229 Query: 292 WQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEP 113 WQQLGIALDE+WHLS+PYVFDY GQIYMMPES KGE+RLYRA+ FPL+W LEK+I+K+P Sbjct: 230 WQQLGIALDEDWHLSFPYVFDYHGQIYMMPESRAKGEVRLYRAVNFPLEWKLEKIIMKKP 289 Query: 112 LVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 LVD F++N++G YWLFGSDHSGFG +NGQLEIWYS+ Sbjct: 290 LVDPFMINHDGQYWLFGSDHSGFGTTQNGQLEIWYSS 326 >ref|XP_004250015.1| PREDICTED: uncharacterized protein LOC101257919 [Solanum lycopersicum] Length = 768 Score = 319 bits (817), Expect = 8e-85 Identities = 145/215 (67%), Positives = 174/215 (80%), Gaps = 8/215 (3%) Frame = -2 Query: 622 LYAWLTFSPYERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVA 443 LY + P ++LSSLGC +D+EGSWSIGV+YGDSPF+LKPIE NVW++++ AWPVA Sbjct: 66 LYCRILLPPNVHTTLSSLGCNEDNEGSWSIGVYYGDSPFNLKPIEEANVWRNKTAAWPVA 125 Query: 442 NPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQ 287 NP+VTCASA+ A FPSNFVADPFLY+Q GDIGVARS D+GATW+ Sbjct: 126 NPIVTCASASGASFPSNFVADPFLYVQGDILYLFFEAKNSITMQGDIGVARSTDKGATWE 185 Query: 286 QLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLV 107 QLG+ALDE+WHLSYPYVFDY+G IYMMPE S KG+LRLYRA+ FP +W LEKVI+K+PLV Sbjct: 186 QLGVALDEDWHLSYPYVFDYNGNIYMMPEGSAKGDLRLYRAVKFPTEWELEKVIMKKPLV 245 Query: 106 DSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 DSF++ ++G YWLFGSDHSG GAKKNGQLEIWYS+ Sbjct: 246 DSFLIQHDGKYWLFGSDHSGIGAKKNGQLEIWYSS 280 >ref|XP_006360502.1| PREDICTED: uncharacterized protein LOC102585335 [Solanum tuberosum] Length = 768 Score = 318 bits (816), Expect = 1e-84 Identities = 147/215 (68%), Positives = 173/215 (80%), Gaps = 8/215 (3%) Frame = -2 Query: 622 LYAWLTFSPYERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVA 443 LY + P + LSSLGC +D+EGSWSIGV+YGDSPF+LKPIE NVW++++ AWPVA Sbjct: 66 LYCRVLLPPNVHTILSSLGCNEDNEGSWSIGVYYGDSPFNLKPIEEANVWRNKTAAWPVA 125 Query: 442 NPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQ 287 NPVVTCASA+ A FPSNFVADPFLY+Q GDIGVARS D+GATW+ Sbjct: 126 NPVVTCASASGASFPSNFVADPFLYVQGDILYLFFEAKNSITMQGDIGVARSTDKGATWE 185 Query: 286 QLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLV 107 QLG+ALDE+WHLSYPYVFDY+G IYMMPE S KG+LRLYRA+ FP +W LEKVI+K+PLV Sbjct: 186 QLGVALDEDWHLSYPYVFDYNGNIYMMPEGSAKGDLRLYRAVKFPTEWKLEKVIMKKPLV 245 Query: 106 DSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 DSFI+ ++G YWLFGSDHSG GAKKNGQLEIWYS+ Sbjct: 246 DSFIIQHDGKYWLFGSDHSGIGAKKNGQLEIWYSS 280 >ref|XP_002532924.1| transferase, transferring glycosyl groups, putative [Ricinus communis] gi|223527317|gb|EEF29466.1| transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 704 Score = 318 bits (816), Expect = 1e-84 Identities = 146/201 (72%), Positives = 169/201 (84%), Gaps = 8/201 (3%) Frame = -2 Query: 580 LSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGF 401 L+S+GC+ D+EGSWSIGVFYG SPFSLKPIE+ NVWKD+S AWPVANPV+TCAS +DAGF Sbjct: 16 LNSVGCRQDNEGSWSIGVFYGHSPFSLKPIETMNVWKDDSAAWPVANPVITCASVSDAGF 75 Query: 400 PSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSY 245 PSNFVADPFLYIQ GDIGVA+S D+GATWQQLGIALDE+WHLSY Sbjct: 76 PSNFVADPFLYIQGDIIYIFYETKNSITMQGDIGVAKSTDKGATWQQLGIALDEDWHLSY 135 Query: 244 PYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLF 65 PYVFDY G+IYMMPE S KGELRLYRAI FPLQW LEK+++K+PLVDSF++ ++G +WLF Sbjct: 136 PYVFDYLGEIYMMPEGSAKGELRLYRAINFPLQWTLEKILIKKPLVDSFVIKHDGEFWLF 195 Query: 64 GSDHSGFGAKKNGQLEIWYSN 2 GSDHS FG KKNGQLEIW+S+ Sbjct: 196 GSDHSSFGTKKNGQLEIWHSS 216 >ref|XP_004142449.1| PREDICTED: uncharacterized protein LOC101212638 [Cucumis sativus] gi|449513220|ref|XP_004164265.1| PREDICTED: uncharacterized LOC101212638 [Cucumis sativus] Length = 783 Score = 316 bits (809), Expect = 7e-84 Identities = 150/250 (60%), Positives = 186/250 (74%), Gaps = 14/250 (5%) Frame = -2 Query: 709 CRWRWDHCFL---SXXXXXXXXXXXXXXXXXTLYAWLTFSP-YERS--SLSSLGCQDDSE 548 C W+W + S TLYAWL F+P Y R+ +SSLGCQ+D+E Sbjct: 46 CGWKWQQRHIRLVSSGFVFFFGCFVLFGSIATLYAWLAFTPQYVRTIGGVSSLGCQEDNE 105 Query: 547 GSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFVADPFLY 368 GSWSIGVFYGDSPFSLKPIE NVW++ES AWPVANPV+ CAS ++AGFPSNFVADPFL+ Sbjct: 106 GSWSIGVFYGDSPFSLKPIEDANVWRNESAAWPVANPVINCASVSNAGFPSNFVADPFLF 165 Query: 367 IQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFDYDGQIY 212 +Q GDIGVA+S+D GATWQQLG+AL+E+WHLS+P+VF++ G+IY Sbjct: 166 VQGDTIYLFYETKNSVSLQGDIGVAKSVDNGATWQQLGVALNEKWHLSFPFVFEHLGEIY 225 Query: 211 MMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHSGFGAKK 32 MMPESS+KGE+RLYRA+ FPL+W L+++ILK+PLVDS I+N+NG YWLFGSDH G G K+ Sbjct: 226 MMPESSKKGEVRLYRAVNFPLKWELDRIILKKPLVDSVIINHNGMYWLFGSDHRGLGTKR 285 Query: 31 NGQLEIWYSN 2 NG L IWYS+ Sbjct: 286 NGHLAIWYSS 295 >ref|XP_002871098.1| glycosyltransferase family protein 47 [Arabidopsis lyrata subsp. lyrata] gi|297316935|gb|EFH47357.1| glycosyltransferase family protein 47 [Arabidopsis lyrata subsp. lyrata] Length = 765 Score = 315 bits (807), Expect = 1e-83 Identities = 147/218 (67%), Positives = 177/218 (81%), Gaps = 12/218 (5%) Frame = -2 Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452 YAW F P+ + S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+ NVW++ESGAW Sbjct: 58 YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETINVWRNESGAW 117 Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296 PVANPV+TCAS T+AG PSNFVADPFLY+Q GDIGVA+SID+GA Sbjct: 118 PVANPVITCASFTNAGLPSNFVADPFLYVQGDTLYLFFETKSPITMQGDIGVAKSIDKGA 177 Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116 TW+ LGIALDE WHLS+P+VF+Y+G+I+MMPES++ G+L LYRA+ FPL W LEKVILK+ Sbjct: 178 TWEPLGIALDEAWHLSFPFVFNYNGEIFMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 237 Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 PLVDS ++++ G YWLFGSDHS FGAKKNGQLEIWYS+ Sbjct: 238 PLVDSTLIHHEGIYWLFGSDHSSFGAKKNGQLEIWYSS 275 >ref|NP_196070.2| glycosyltransferase family protein 47 [Arabidopsis thaliana] gi|28393253|gb|AAO42055.1| unknown protein [Arabidopsis thaliana] gi|332003370|gb|AED90753.1| glycosyltransferase family protein 47 [Arabidopsis thaliana] gi|591401836|gb|AHL38645.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 765 Score = 313 bits (803), Expect = 3e-83 Identities = 145/218 (66%), Positives = 176/218 (80%), Gaps = 12/218 (5%) Frame = -2 Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452 YAW F P+ + S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+RNVW++ESGAW Sbjct: 58 YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETRNVWRNESGAW 117 Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296 PV NPV+TCAS T++G PSNF+ADPFLY+Q GDIG A+SID+GA Sbjct: 118 PVTNPVITCASFTNSGLPSNFLADPFLYVQGDTLYLFFETKSPITMQGDIGAAKSIDKGA 177 Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116 TW+ LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVILK+ Sbjct: 178 TWEPLGIALDEAWHLSFPFVFNYNGEIYMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 237 Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 PLVDS I+++ G YWL GSDH+GFGAKKNGQLEIWYS+ Sbjct: 238 PLVDSTIVHHEGIYWLIGSDHTGFGAKKNGQLEIWYSS 275 >emb|CAB85556.1| putative protein [Arabidopsis thaliana] Length = 764 Score = 313 bits (803), Expect = 3e-83 Identities = 145/218 (66%), Positives = 176/218 (80%), Gaps = 12/218 (5%) Frame = -2 Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452 YAW F P+ + S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+RNVW++ESGAW Sbjct: 57 YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETRNVWRNESGAW 116 Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296 PV NPV+TCAS T++G PSNF+ADPFLY+Q GDIG A+SID+GA Sbjct: 117 PVTNPVITCASFTNSGLPSNFLADPFLYVQGDTLYLFFETKSPITMQGDIGAAKSIDKGA 176 Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116 TW+ LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVILK+ Sbjct: 177 TWEPLGIALDEAWHLSFPFVFNYNGEIYMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 236 Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 PLVDS I+++ G YWL GSDH+GFGAKKNGQLEIWYS+ Sbjct: 237 PLVDSTIVHHEGIYWLIGSDHTGFGAKKNGQLEIWYSS 274 >ref|XP_006289714.1| hypothetical protein CARUB_v10003280mg [Capsella rubella] gi|482558420|gb|EOA22612.1| hypothetical protein CARUB_v10003280mg [Capsella rubella] Length = 762 Score = 311 bits (798), Expect = 1e-82 Identities = 147/218 (67%), Positives = 175/218 (80%), Gaps = 12/218 (5%) Frame = -2 Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452 YAWL F P+ + S SSLGC++D+EGSWSIGVFYGDSPFSLKPIES VW++ESGAW Sbjct: 55 YAWLAFPPHIGRTDHVSWSSLGCREDNEGSWSIGVFYGDSPFSLKPIESMKVWRNESGAW 114 Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296 PV+NPV+TCAS T++G PSNFVADPFLY+Q GDIGVA+SID+GA Sbjct: 115 PVSNPVLTCASLTNSGLPSNFVADPFLYVQGDTLYLFFETKNPITMQGDIGVAKSIDKGA 174 Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116 TW LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVI+K+ Sbjct: 175 TWIPLGIALDEAWHLSFPFVFNYNGEIYMMPESNELGQLNLYRALNFPLSWKLEKVIMKK 234 Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2 LVDS I+++ G YWLFGSDHS FGAKKNGQLEIWYSN Sbjct: 235 RLVDSTIIHHEGIYWLFGSDHSSFGAKKNGQLEIWYSN 272