BLASTX nr result

ID: Akebia26_contig00013800 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00013800
         (776 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003555466.1| PREDICTED: uncharacterized protein LOC100790...   358   1e-96
ref|XP_004496161.1| PREDICTED: uncharacterized protein LOC101507...   355   1e-95
ref|XP_007144298.1| hypothetical protein PHAVU_007G144400g [Phas...   353   4e-95
ref|XP_006379496.1| hypothetical protein POPTR_0008s02900g [Popu...   352   8e-95
ref|XP_006838728.1| hypothetical protein AMTR_s00002p00253220 [A...   349   6e-94
gb|EXB58473.1| hypothetical protein L484_005207 [Morus notabilis]     348   2e-93
ref|XP_003536299.1| PREDICTED: uncharacterized protein LOC100789...   347   3e-93
ref|XP_002262646.1| PREDICTED: uncharacterized protein LOC100242...   342   9e-92
ref|XP_004307152.1| PREDICTED: uncharacterized protein LOC101295...   340   3e-91
gb|EXB58476.1| hypothetical protein L484_005210 [Morus notabilis]     333   5e-89
ref|XP_007010125.1| Glycosyltransferase family protein 47 [Theob...   333   5e-89
ref|XP_006436587.1| hypothetical protein CICLE_v10030719mg [Citr...   322   9e-86
ref|XP_004250015.1| PREDICTED: uncharacterized protein LOC101257...   319   8e-85
ref|XP_006360502.1| PREDICTED: uncharacterized protein LOC102585...   318   1e-84
ref|XP_002532924.1| transferase, transferring glycosyl groups, p...   318   1e-84
ref|XP_004142449.1| PREDICTED: uncharacterized protein LOC101212...   316   7e-84
ref|XP_002871098.1| glycosyltransferase family protein 47 [Arabi...   315   1e-83
ref|NP_196070.2| glycosyltransferase family protein 47 [Arabidop...   313   3e-83
emb|CAB85556.1| putative protein [Arabidopsis thaliana]               313   3e-83
ref|XP_006289714.1| hypothetical protein CARUB_v10003280mg [Caps...   311   1e-82

>ref|XP_003555466.1| PREDICTED: uncharacterized protein LOC100790409 [Glycine max]
          Length = 761

 Score =  358 bits (919), Expect = 1e-96
 Identities = 170/259 (65%), Positives = 197/259 (76%), Gaps = 13/259 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575
           SCCDMS+KC CRWR ++        S                 TLY WL FSP   +SLS
Sbjct: 14  SCCDMSVKCSCRWRLENQQYYKRLFSSGFVFFFGCFVLFGSIATLYGWLAFSPTVHTSLS 73

Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395
           S GC+DD+EGSWS+GVFYGDSPFSLKPIE+ NV  DES AWPVANPVVTCAS +DAG+PS
Sbjct: 74  SFGCRDDNEGSWSVGVFYGDSPFSLKPIEAANVSNDESAAWPVANPVVTCASVSDAGYPS 133

Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239
           NFVADPFL+IQ                 GDIGV++S D+GATWQQLGIAL+E+WHLSYPY
Sbjct: 134 NFVADPFLFIQGNTFYLFYETKSSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 193

Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59
           VF++DGQIYMMPE SQKG+LRLYRA+ FPLQW LEKV++K+PLVDSF++N+ G YWLFGS
Sbjct: 194 VFEHDGQIYMMPEGSQKGDLRLYRAVNFPLQWRLEKVVMKKPLVDSFVINHGGRYWLFGS 253

Query: 58  DHSGFGAKKNGQLEIWYSN 2
           DHSGFG +KNGQLEIWYSN
Sbjct: 254 DHSGFGTQKNGQLEIWYSN 272


>ref|XP_004496161.1| PREDICTED: uncharacterized protein LOC101507497 [Cicer arietinum]
          Length = 773

 Score =  355 bits (910), Expect = 1e-95
 Identities = 164/259 (63%), Positives = 195/259 (75%), Gaps = 13/259 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575
           SCCDMS+KCWCRWR ++        S                 + Y WL FSP   +++S
Sbjct: 27  SCCDMSMKCWCRWRMENQHYYNRIFSSGFVFFFGCFVLFGSIASFYGWLAFSPSVHTAIS 86

Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395
             GCQDD+EGSWSIG+FYG SPFSLKPIES NV  D+S +WPVANPVVTCAS +DAGFPS
Sbjct: 87  PFGCQDDNEGSWSIGIFYGHSPFSLKPIESSNVSNDDSASWPVANPVVTCASVSDAGFPS 146

Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239
           NFVADPFL+IQ                 GDIGV++S D+GATWQQLGIAL+E+WHLSYPY
Sbjct: 147 NFVADPFLFIQGDTLYLFYETKNSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 206

Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59
           VF++DGQIYMMPE S++G+LRLY+A+ FPLQW LEKV++K+PL+DSFI++Y G YWLFGS
Sbjct: 207 VFEHDGQIYMMPEGSKRGDLRLYKAVNFPLQWKLEKVLIKKPLIDSFIVDYGGKYWLFGS 266

Query: 58  DHSGFGAKKNGQLEIWYSN 2
           DHSGFG KKNGQLEIWYSN
Sbjct: 267 DHSGFGTKKNGQLEIWYSN 285


>ref|XP_007144298.1| hypothetical protein PHAVU_007G144400g [Phaseolus vulgaris]
           gi|561017488|gb|ESW16292.1| hypothetical protein
           PHAVU_007G144400g [Phaseolus vulgaris]
          Length = 768

 Score =  353 bits (906), Expect = 4e-95
 Identities = 172/259 (66%), Positives = 196/259 (75%), Gaps = 13/259 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575
           SCCDMS+KC CRWR ++       LS                 TLY W+ F P  RSSL+
Sbjct: 23  SCCDMSVKCSCRWRLENQQYYKRLLSSGFVFFFGCFVLFGSIATLYGWVAFPPTVRSSLN 82

Query: 574 SLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPS 395
             GC+DD+EGSWSIG+FYGDSPFSLKPIE+ NV  DES AWPVANPVVTCAS +DAGFPS
Sbjct: 83  --GCRDDNEGSWSIGIFYGDSPFSLKPIEAANVSHDESAAWPVANPVVTCASVSDAGFPS 140

Query: 394 NFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPY 239
           NFVADPFL+IQ                 GDIGV++S D+GATWQQLGIAL+E+WHLSYPY
Sbjct: 141 NFVADPFLFIQGNTFYLFYETKNSITYQGDIGVSKSTDKGATWQQLGIALNEDWHLSYPY 200

Query: 238 VFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGS 59
           VF++DGQIYMMPE S+KG+LRLYRA+ FPLQW L KVI+K PLVDSFI+NY G YWLFGS
Sbjct: 201 VFEHDGQIYMMPEGSKKGDLRLYRAVNFPLQWRLAKVIIKRPLVDSFIINYGGRYWLFGS 260

Query: 58  DHSGFGAKKNGQLEIWYSN 2
           DHSGFG+KKNGQLEIWYSN
Sbjct: 261 DHSGFGSKKNGQLEIWYSN 279


>ref|XP_006379496.1| hypothetical protein POPTR_0008s02900g [Populus trichocarpa]
           gi|550332290|gb|ERP57293.1| hypothetical protein
           POPTR_0008s02900g [Populus trichocarpa]
          Length = 789

 Score =  352 bits (903), Expect = 8e-95
 Identities = 172/281 (61%), Positives = 198/281 (70%), Gaps = 35/281 (12%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDH---------------------CFLSXXXXXXXXXXXXXXXXXT 623
           +CCDMSL+CWCRW+W +                        S                  
Sbjct: 21  NCCDMSLRCWCRWKWGNHQQQQQPQQNHHNLLHQRLVSLVFSSGFMFFLGCLVLYGSIGM 80

Query: 622 LYAWLTFS-PYERSS-----LSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDES 461
            Y WL FS PY RS+     L+SLGCQ+D+EGSWSIGVFYGDSPFSLKPIE+ N W+DE 
Sbjct: 81  FYGWLVFSKPYSRSTNVGVGLNSLGCQEDNEGSWSIGVFYGDSPFSLKPIEAMNEWRDEG 140

Query: 460 GAWPVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSID 305
            AWPVANPVVTCAS +DA FPSNFVADPFLY+Q                 GDI VA+S+D
Sbjct: 141 VAWPVANPVVTCASLSDANFPSNFVADPFLYVQGDTLFLFYETKNSITMQGDIAVAKSMD 200

Query: 304 EGATWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVI 125
           +GATWQQLGIALDE+WHLSYPYVF+Y GQIYMMPESSQKGELRLYRA+ FPLQW LEKV+
Sbjct: 201 KGATWQQLGIALDEDWHLSYPYVFNYLGQIYMMPESSQKGELRLYRALNFPLQWTLEKVL 260

Query: 124 LKEPLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           +K+PLVDSFI+N+ G YWLFGSDHSGFG ++NGQLEIWYS+
Sbjct: 261 IKKPLVDSFIINHAGIYWLFGSDHSGFGTRRNGQLEIWYSS 301


>ref|XP_006838728.1| hypothetical protein AMTR_s00002p00253220 [Amborella trichopoda]
           gi|548841234|gb|ERN01297.1| hypothetical protein
           AMTR_s00002p00253220 [Amborella trichopoda]
          Length = 762

 Score =  349 bits (896), Expect = 6e-94
 Identities = 165/255 (64%), Positives = 191/255 (74%), Gaps = 11/255 (4%)
 Frame = -2

Query: 736 CCDMSLKCWCRWR--WDHCFL-SXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLSSLG 566
           CCDM LKCWCRW   +DH FL S                   +AWLTFSPYER  LSS G
Sbjct: 28  CCDMRLKCWCRWHPTFDHSFLISSAFSFFLISSLLFGSLALAFAWLTFSPYERPRLSSYG 87

Query: 565 CQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFV 386
           CQDD+EGSWSIGV+YGD+PFSLKP+E RNVW D+  AWPVANPV+TCA A+DAG+PSNFV
Sbjct: 88  CQDDNEGSWSIGVYYGDNPFSLKPLELRNVWSDKGLAWPVANPVMTCALASDAGYPSNFV 147

Query: 385 ADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFD 230
           ADPFLY+Q                 G+IGVARS+D  ATW+ LGIALDEEWHLS+PYVF 
Sbjct: 148 ADPFLYVQDDILYMFFETKNSVTLKGEIGVARSLDNSATWEHLGIALDEEWHLSFPYVFS 207

Query: 229 YDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHS 50
           Y+G+IYM+PE SQKG+LRLYRA+ FPLQW LEKVILK P+VDSFI+  + ++WLFGSD S
Sbjct: 208 YNGEIYMLPEGSQKGDLRLYRALKFPLQWTLEKVILKRPMVDSFIIQRDRSFWLFGSDIS 267

Query: 49  GFGAKKNGQLEIWYS 5
           GF  KKNG+LEIWYS
Sbjct: 268 GFSTKKNGELEIWYS 282


>gb|EXB58473.1| hypothetical protein L484_005207 [Morus notabilis]
          Length = 775

 Score =  348 bits (892), Expect = 2e-93
 Identities = 170/260 (65%), Positives = 193/260 (74%), Gaps = 14/260 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDH----CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSS--L 578
           SCC +S+KCWCRWR  H    C LS                 TLYA   F+P  R++  L
Sbjct: 28  SCCHVSMKCWCRWRCHHRLQRCLLSSGFVFSVACLALFGSLATLYARFAFAPGVRTTTGL 87

Query: 577 SSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398
           SS G  DD+EGSWS+GVF+GDSPFSLKPIE+ NVW DES AWPVANPV+TCAS ++AGFP
Sbjct: 88  SSFGRHDDNEGSWSVGVFFGDSPFSLKPIEAENVWNDESAAWPVANPVMTCASVSEAGFP 147

Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242
           SNFVADPFLY+Q                 GDIGV +S D GATWQQLGIALDEEWHLSYP
Sbjct: 148 SNFVADPFLYVQGDAFYLFYETKNSITMQGDIGVVKSTDGGATWQQLGIALDEEWHLSYP 207

Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62
           YVF+  GQIYMMPESS+KGELRLY+A+ FPLQW L+KVI+K+PLVDSFI+ YN  YWLFG
Sbjct: 208 YVFEDLGQIYMMPESSKKGELRLYQAVKFPLQWRLKKVIMKKPLVDSFIIKYNNIYWLFG 267

Query: 61  SDHSGFGAKKNGQLEIWYSN 2
           SDHSGFG KKNGQLEIWYS+
Sbjct: 268 SDHSGFGTKKNGQLEIWYSS 287


>ref|XP_003536299.1| PREDICTED: uncharacterized protein LOC100789310 [Glycine max]
          Length = 768

 Score =  347 bits (890), Expect = 3e-93
 Identities = 166/260 (63%), Positives = 195/260 (75%), Gaps = 14/260 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDHC-----FLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLS 575
           SCCDMS+KC CRWR ++        S                 TLY W  FSP   ++LS
Sbjct: 20  SCCDMSVKCSCRWRLENQQYYKRLFSSGFIFFFGCFVLFGSIATLYGWFAFSPTVHTALS 79

Query: 574 S-LGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398
           S  GC++D+EGSWSIGVFYGDSPFSLKPIE+ NV  DE+ AWPVANPVVTCAS +D G+P
Sbjct: 80  SSFGCREDNEGSWSIGVFYGDSPFSLKPIEAANVSNDETAAWPVANPVVTCASVSDVGYP 139

Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242
           SNFVADPFL+IQ                 GDIGV++S D+GATWQQLGIAL+E+WHLSYP
Sbjct: 140 SNFVADPFLFIQGNTFYLFYETKNSITMQGDIGVSKSTDKGATWQQLGIALNEDWHLSYP 199

Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62
           YVF++DGQIYMMPE SQKG+LRLYRA+ FPLQW LEKV++K+PLVDSF++N+ G YWLFG
Sbjct: 200 YVFEHDGQIYMMPEGSQKGDLRLYRAVNFPLQWRLEKVVMKKPLVDSFVINHGGRYWLFG 259

Query: 61  SDHSGFGAKKNGQLEIWYSN 2
           SDHSGFG +KNGQLEIWYSN
Sbjct: 260 SDHSGFGTQKNGQLEIWYSN 279


>ref|XP_002262646.1| PREDICTED: uncharacterized protein LOC100242107 [Vitis vinifera]
           gi|296090371|emb|CBI40190.3| unnamed protein product
           [Vitis vinifera]
          Length = 756

 Score =  342 bits (877), Expect = 9e-92
 Identities = 162/255 (63%), Positives = 193/255 (75%), Gaps = 9/255 (3%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDH-CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSSLSSLGC 563
           SCC M+     RWRWDH CFLS                  +YAWL  +P+    L+SLGC
Sbjct: 19  SCCHMA-----RWRWDHHCFLSSTFVFFIASFVVYGFIAGVYAWLFVNPHAPLELASLGC 73

Query: 562 QDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFVA 383
           + DSEGSW+IGVFYGDSPFSL+PIE+ NVW++ES AWPVANPVVTCASA+DA FPSNFVA
Sbjct: 74  RPDSEGSWAIGVFYGDSPFSLRPIEAMNVWRNESAAWPVANPVVTCASASDAVFPSNFVA 133

Query: 382 DPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFDY 227
           DPFLY+Q                 GDIGV++S D+GATWQ LG+ALDEEWHLSYPYVF+Y
Sbjct: 134 DPFLYVQGDTLFLFYETKNSITMQGDIGVSKSDDKGATWQHLGVALDEEWHLSYPYVFEY 193

Query: 226 DGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHSG 47
            G+IYMMPE S KGELR+YRA+ FPLQW LEK+I+K+ LVDS I+N++G YW+FGSDH+G
Sbjct: 194 LGKIYMMPECSGKGELRIYRALNFPLQWTLEKIIIKKHLVDSVIINHDGKYWIFGSDHTG 253

Query: 46  FGAKKNGQLEIWYSN 2
           FGAKKNGQ+EIWYS+
Sbjct: 254 FGAKKNGQMEIWYSS 268


>ref|XP_004307152.1| PREDICTED: uncharacterized protein LOC101295367 [Fragaria vesca
           subsp. vesca]
          Length = 778

 Score =  340 bits (872), Expect = 3e-91
 Identities = 162/257 (63%), Positives = 194/257 (75%), Gaps = 12/257 (4%)
 Frame = -2

Query: 736 CCDMSLKC-WCRWRWDHCFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPY---ERSSLSSL 569
           CC+MSLKC  C+WR   C +S                 T Y W  F+PY     ++ S+L
Sbjct: 37  CCNMSLKCRLCKWR---CLMSSGFVFFFGCCVLFGSVATFYVWFAFTPYYYARGTTPSAL 93

Query: 568 GCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNF 389
           GCQ+D+EGSWS+GVF+GDSPF LKPIE+ NVW+++S AWPVANPVVTCAS +++GFPSNF
Sbjct: 94  GCQEDNEGSWSVGVFFGDSPFHLKPIEAMNVWRNKSAAWPVANPVVTCASLSESGFPSNF 153

Query: 388 VADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVF 233
           VADPFLY+Q                 GDIGV++S D+GATWQQLGIALDEEWHLSYPYVF
Sbjct: 154 VADPFLYVQGDTLYMFYETKNSITMQGDIGVSKSSDKGATWQQLGIALDEEWHLSYPYVF 213

Query: 232 DYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDH 53
            Y GQIYMMPESS  GE+RLY+A++FP+QW LEKVILK+PLVDSF++NY+G YWLFGSDH
Sbjct: 214 PYLGQIYMMPESSMNGEVRLYQALSFPMQWTLEKVILKKPLVDSFLINYDGAYWLFGSDH 273

Query: 52  SGFGAKKNGQLEIWYSN 2
           SGFG  KNGQLEIWYS+
Sbjct: 274 SGFGTTKNGQLEIWYSS 290


>gb|EXB58476.1| hypothetical protein L484_005210 [Morus notabilis]
          Length = 770

 Score =  333 bits (853), Expect = 5e-89
 Identities = 164/260 (63%), Positives = 189/260 (72%), Gaps = 14/260 (5%)
 Frame = -2

Query: 739 SCCDMSLKCWCRWRWDH----CFLSXXXXXXXXXXXXXXXXXTLYAWLTFSPYERSS--L 578
           SC  +S+KCWC+    H      LS                  LYA   F+P  R++  L
Sbjct: 23  SCSHVSMKCWCQQHLHHQLQRFLLSSGFVFFVACLALFGSLAMLYARFAFAPGIRTTTGL 82

Query: 577 SSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFP 398
           SS GC+DD+EGSWS+GVF+GDSPFSL+PIE+ NVW DES AWPVANPVVTCAS ++AGFP
Sbjct: 83  SSFGCRDDNEGSWSVGVFFGDSPFSLQPIEAENVWSDESAAWPVANPVVTCASVSEAGFP 142

Query: 397 SNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYP 242
           SNFVADPFLY+Q                 GDIGVA+S D GATWQQLGIALDEEWHLSYP
Sbjct: 143 SNFVADPFLYVQSDALYLFYETKNSITMQGDIGVAKSTDGGATWQQLGIALDEEWHLSYP 202

Query: 241 YVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFG 62
           YVF+  GQIYMMPE S KGELRLY+A+ FPLQW L+KVI+K+PLVDSFI+ YN  YWLFG
Sbjct: 203 YVFEDLGQIYMMPEGSVKGELRLYQAVKFPLQWRLKKVIMKKPLVDSFIIKYNDMYWLFG 262

Query: 61  SDHSGFGAKKNGQLEIWYSN 2
           SDHSGFG +KNGQLEIWYS+
Sbjct: 263 SDHSGFGTQKNGQLEIWYSS 282


>ref|XP_007010125.1| Glycosyltransferase family protein 47 [Theobroma cacao]
           gi|508727038|gb|EOY18935.1| Glycosyltransferase family
           protein 47 [Theobroma cacao]
          Length = 779

 Score =  333 bits (853), Expect = 5e-89
 Identities = 154/219 (70%), Positives = 182/219 (83%), Gaps = 12/219 (5%)
 Frame = -2

Query: 622 LYAWLTFSP----YERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGA 455
           LY W+  +P    YER  L  LGCQ+D+EGSWSIG+F+G SPFSLKPIE+ +VW++ES A
Sbjct: 73  LYGWVILTPSFFTYERRGLPWLGCQEDNEGSWSIGLFFGHSPFSLKPIETADVWRNESAA 132

Query: 454 WPVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEG 299
           WPVANPV+TCASA+D+GFPSNFVADPFLY+Q                 GDIGVA+SID+G
Sbjct: 133 WPVANPVITCASASDSGFPSNFVADPFLYVQGDVFYLFYETKNSFTMQGDIGVAKSIDKG 192

Query: 298 ATWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILK 119
           ATWQQLGIALDE+WHLSYPYVF+Y GQIYMMPESSQKGELRLYRAI FPLQW L+++I+K
Sbjct: 193 ATWQQLGIALDEDWHLSYPYVFNYLGQIYMMPESSQKGELRLYRAINFPLQWELDRIIIK 252

Query: 118 EPLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           +PL+DSFI+N++G YWLFGSDHS FG KKNGQLEIWYS+
Sbjct: 253 KPLIDSFIINHDGEYWLFGSDHSSFGTKKNGQLEIWYSD 291


>ref|XP_006436587.1| hypothetical protein CICLE_v10030719mg [Citrus clementina]
           gi|568863642|ref|XP_006485243.1| PREDICTED:
           uncharacterized protein LOC102631491 [Citrus sinensis]
           gi|557538783|gb|ESR49827.1| hypothetical protein
           CICLE_v10030719mg [Citrus clementina]
          Length = 814

 Score =  322 bits (825), Expect = 9e-86
 Identities = 148/217 (68%), Positives = 176/217 (81%), Gaps = 10/217 (4%)
 Frame = -2

Query: 622 LYAWLTFS-PYE-RSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWP 449
           LY WL    PY   + LSS GCQ+DSEGSWSIGVF+G+SPFSLKPIE+ NVW+D+S AWP
Sbjct: 110 LYGWLALKKPYTVAAGLSSFGCQEDSEGSWSIGVFFGNSPFSLKPIETANVWRDDSAAWP 169

Query: 448 VANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGAT 293
           VANP++TCAS + AGFPSNFVADPF Y+Q                 GDIGVA+S+D+GAT
Sbjct: 170 VANPIMTCASVSSAGFPSNFVADPFFYLQGNDLYLFYETKNSITMQGDIGVAKSVDKGAT 229

Query: 292 WQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEP 113
           WQQLGIALDE+WHLS+PYVFDY GQIYMMPES  KGE+RLYRA+ FPL+W LEK+I+K+P
Sbjct: 230 WQQLGIALDEDWHLSFPYVFDYHGQIYMMPESRAKGEVRLYRAVNFPLEWKLEKIIMKKP 289

Query: 112 LVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           LVD F++N++G YWLFGSDHSGFG  +NGQLEIWYS+
Sbjct: 290 LVDPFMINHDGQYWLFGSDHSGFGTTQNGQLEIWYSS 326


>ref|XP_004250015.1| PREDICTED: uncharacterized protein LOC101257919 [Solanum
           lycopersicum]
          Length = 768

 Score =  319 bits (817), Expect = 8e-85
 Identities = 145/215 (67%), Positives = 174/215 (80%), Gaps = 8/215 (3%)
 Frame = -2

Query: 622 LYAWLTFSPYERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVA 443
           LY  +   P   ++LSSLGC +D+EGSWSIGV+YGDSPF+LKPIE  NVW++++ AWPVA
Sbjct: 66  LYCRILLPPNVHTTLSSLGCNEDNEGSWSIGVYYGDSPFNLKPIEEANVWRNKTAAWPVA 125

Query: 442 NPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQ 287
           NP+VTCASA+ A FPSNFVADPFLY+Q                 GDIGVARS D+GATW+
Sbjct: 126 NPIVTCASASGASFPSNFVADPFLYVQGDILYLFFEAKNSITMQGDIGVARSTDKGATWE 185

Query: 286 QLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLV 107
           QLG+ALDE+WHLSYPYVFDY+G IYMMPE S KG+LRLYRA+ FP +W LEKVI+K+PLV
Sbjct: 186 QLGVALDEDWHLSYPYVFDYNGNIYMMPEGSAKGDLRLYRAVKFPTEWELEKVIMKKPLV 245

Query: 106 DSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           DSF++ ++G YWLFGSDHSG GAKKNGQLEIWYS+
Sbjct: 246 DSFLIQHDGKYWLFGSDHSGIGAKKNGQLEIWYSS 280


>ref|XP_006360502.1| PREDICTED: uncharacterized protein LOC102585335 [Solanum tuberosum]
          Length = 768

 Score =  318 bits (816), Expect = 1e-84
 Identities = 147/215 (68%), Positives = 173/215 (80%), Gaps = 8/215 (3%)
 Frame = -2

Query: 622 LYAWLTFSPYERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVA 443
           LY  +   P   + LSSLGC +D+EGSWSIGV+YGDSPF+LKPIE  NVW++++ AWPVA
Sbjct: 66  LYCRVLLPPNVHTILSSLGCNEDNEGSWSIGVYYGDSPFNLKPIEEANVWRNKTAAWPVA 125

Query: 442 NPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQ 287
           NPVVTCASA+ A FPSNFVADPFLY+Q                 GDIGVARS D+GATW+
Sbjct: 126 NPVVTCASASGASFPSNFVADPFLYVQGDILYLFFEAKNSITMQGDIGVARSTDKGATWE 185

Query: 286 QLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLV 107
           QLG+ALDE+WHLSYPYVFDY+G IYMMPE S KG+LRLYRA+ FP +W LEKVI+K+PLV
Sbjct: 186 QLGVALDEDWHLSYPYVFDYNGNIYMMPEGSAKGDLRLYRAVKFPTEWKLEKVIMKKPLV 245

Query: 106 DSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           DSFI+ ++G YWLFGSDHSG GAKKNGQLEIWYS+
Sbjct: 246 DSFIIQHDGKYWLFGSDHSGIGAKKNGQLEIWYSS 280


>ref|XP_002532924.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis] gi|223527317|gb|EEF29466.1| transferase,
           transferring glycosyl groups, putative [Ricinus
           communis]
          Length = 704

 Score =  318 bits (816), Expect = 1e-84
 Identities = 146/201 (72%), Positives = 169/201 (84%), Gaps = 8/201 (3%)
 Frame = -2

Query: 580 LSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGF 401
           L+S+GC+ D+EGSWSIGVFYG SPFSLKPIE+ NVWKD+S AWPVANPV+TCAS +DAGF
Sbjct: 16  LNSVGCRQDNEGSWSIGVFYGHSPFSLKPIETMNVWKDDSAAWPVANPVITCASVSDAGF 75

Query: 400 PSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSY 245
           PSNFVADPFLYIQ                 GDIGVA+S D+GATWQQLGIALDE+WHLSY
Sbjct: 76  PSNFVADPFLYIQGDIIYIFYETKNSITMQGDIGVAKSTDKGATWQQLGIALDEDWHLSY 135

Query: 244 PYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLF 65
           PYVFDY G+IYMMPE S KGELRLYRAI FPLQW LEK+++K+PLVDSF++ ++G +WLF
Sbjct: 136 PYVFDYLGEIYMMPEGSAKGELRLYRAINFPLQWTLEKILIKKPLVDSFVIKHDGEFWLF 195

Query: 64  GSDHSGFGAKKNGQLEIWYSN 2
           GSDHS FG KKNGQLEIW+S+
Sbjct: 196 GSDHSSFGTKKNGQLEIWHSS 216


>ref|XP_004142449.1| PREDICTED: uncharacterized protein LOC101212638 [Cucumis sativus]
           gi|449513220|ref|XP_004164265.1| PREDICTED:
           uncharacterized LOC101212638 [Cucumis sativus]
          Length = 783

 Score =  316 bits (809), Expect = 7e-84
 Identities = 150/250 (60%), Positives = 186/250 (74%), Gaps = 14/250 (5%)
 Frame = -2

Query: 709 CRWRWDHCFL---SXXXXXXXXXXXXXXXXXTLYAWLTFSP-YERS--SLSSLGCQDDSE 548
           C W+W    +   S                 TLYAWL F+P Y R+   +SSLGCQ+D+E
Sbjct: 46  CGWKWQQRHIRLVSSGFVFFFGCFVLFGSIATLYAWLAFTPQYVRTIGGVSSLGCQEDNE 105

Query: 547 GSWSIGVFYGDSPFSLKPIESRNVWKDESGAWPVANPVVTCASATDAGFPSNFVADPFLY 368
           GSWSIGVFYGDSPFSLKPIE  NVW++ES AWPVANPV+ CAS ++AGFPSNFVADPFL+
Sbjct: 106 GSWSIGVFYGDSPFSLKPIEDANVWRNESAAWPVANPVINCASVSNAGFPSNFVADPFLF 165

Query: 367 IQXXXXXXXXX--------GDIGVARSIDEGATWQQLGIALDEEWHLSYPYVFDYDGQIY 212
           +Q                 GDIGVA+S+D GATWQQLG+AL+E+WHLS+P+VF++ G+IY
Sbjct: 166 VQGDTIYLFYETKNSVSLQGDIGVAKSVDNGATWQQLGVALNEKWHLSFPFVFEHLGEIY 225

Query: 211 MMPESSQKGELRLYRAITFPLQWALEKVILKEPLVDSFILNYNGNYWLFGSDHSGFGAKK 32
           MMPESS+KGE+RLYRA+ FPL+W L+++ILK+PLVDS I+N+NG YWLFGSDH G G K+
Sbjct: 226 MMPESSKKGEVRLYRAVNFPLKWELDRIILKKPLVDSVIINHNGMYWLFGSDHRGLGTKR 285

Query: 31  NGQLEIWYSN 2
           NG L IWYS+
Sbjct: 286 NGHLAIWYSS 295


>ref|XP_002871098.1| glycosyltransferase family protein 47 [Arabidopsis lyrata subsp.
           lyrata] gi|297316935|gb|EFH47357.1| glycosyltransferase
           family protein 47 [Arabidopsis lyrata subsp. lyrata]
          Length = 765

 Score =  315 bits (807), Expect = 1e-83
 Identities = 147/218 (67%), Positives = 177/218 (81%), Gaps = 12/218 (5%)
 Frame = -2

Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452
           YAW  F P+    +  S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+ NVW++ESGAW
Sbjct: 58  YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETINVWRNESGAW 117

Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296
           PVANPV+TCAS T+AG PSNFVADPFLY+Q                 GDIGVA+SID+GA
Sbjct: 118 PVANPVITCASFTNAGLPSNFVADPFLYVQGDTLYLFFETKSPITMQGDIGVAKSIDKGA 177

Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116
           TW+ LGIALDE WHLS+P+VF+Y+G+I+MMPES++ G+L LYRA+ FPL W LEKVILK+
Sbjct: 178 TWEPLGIALDEAWHLSFPFVFNYNGEIFMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 237

Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           PLVDS ++++ G YWLFGSDHS FGAKKNGQLEIWYS+
Sbjct: 238 PLVDSTLIHHEGIYWLFGSDHSSFGAKKNGQLEIWYSS 275


>ref|NP_196070.2| glycosyltransferase family protein 47 [Arabidopsis thaliana]
           gi|28393253|gb|AAO42055.1| unknown protein [Arabidopsis
           thaliana] gi|332003370|gb|AED90753.1|
           glycosyltransferase family protein 47 [Arabidopsis
           thaliana] gi|591401836|gb|AHL38645.1|
           glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 765

 Score =  313 bits (803), Expect = 3e-83
 Identities = 145/218 (66%), Positives = 176/218 (80%), Gaps = 12/218 (5%)
 Frame = -2

Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452
           YAW  F P+    +  S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+RNVW++ESGAW
Sbjct: 58  YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETRNVWRNESGAW 117

Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296
           PV NPV+TCAS T++G PSNF+ADPFLY+Q                 GDIG A+SID+GA
Sbjct: 118 PVTNPVITCASFTNSGLPSNFLADPFLYVQGDTLYLFFETKSPITMQGDIGAAKSIDKGA 177

Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116
           TW+ LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVILK+
Sbjct: 178 TWEPLGIALDEAWHLSFPFVFNYNGEIYMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 237

Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           PLVDS I+++ G YWL GSDH+GFGAKKNGQLEIWYS+
Sbjct: 238 PLVDSTIVHHEGIYWLIGSDHTGFGAKKNGQLEIWYSS 275


>emb|CAB85556.1| putative protein [Arabidopsis thaliana]
          Length = 764

 Score =  313 bits (803), Expect = 3e-83
 Identities = 145/218 (66%), Positives = 176/218 (80%), Gaps = 12/218 (5%)
 Frame = -2

Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452
           YAW  F P+    +  S SSLGC++D+EGSWSIGVFYGDSPFSLKPIE+RNVW++ESGAW
Sbjct: 57  YAWFVFPPHIGRTDHVSSSSLGCREDNEGSWSIGVFYGDSPFSLKPIETRNVWRNESGAW 116

Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296
           PV NPV+TCAS T++G PSNF+ADPFLY+Q                 GDIG A+SID+GA
Sbjct: 117 PVTNPVITCASFTNSGLPSNFLADPFLYVQGDTLYLFFETKSPITMQGDIGAAKSIDKGA 176

Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116
           TW+ LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVILK+
Sbjct: 177 TWEPLGIALDEAWHLSFPFVFNYNGEIYMMPESNEIGQLNLYRAVNFPLSWKLEKVILKK 236

Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
           PLVDS I+++ G YWL GSDH+GFGAKKNGQLEIWYS+
Sbjct: 237 PLVDSTIVHHEGIYWLIGSDHTGFGAKKNGQLEIWYSS 274


>ref|XP_006289714.1| hypothetical protein CARUB_v10003280mg [Capsella rubella]
           gi|482558420|gb|EOA22612.1| hypothetical protein
           CARUB_v10003280mg [Capsella rubella]
          Length = 762

 Score =  311 bits (798), Expect = 1e-82
 Identities = 147/218 (67%), Positives = 175/218 (80%), Gaps = 12/218 (5%)
 Frame = -2

Query: 619 YAWLTFSPY----ERSSLSSLGCQDDSEGSWSIGVFYGDSPFSLKPIESRNVWKDESGAW 452
           YAWL F P+    +  S SSLGC++D+EGSWSIGVFYGDSPFSLKPIES  VW++ESGAW
Sbjct: 55  YAWLAFPPHIGRTDHVSWSSLGCREDNEGSWSIGVFYGDSPFSLKPIESMKVWRNESGAW 114

Query: 451 PVANPVVTCASATDAGFPSNFVADPFLYIQXXXXXXXXX--------GDIGVARSIDEGA 296
           PV+NPV+TCAS T++G PSNFVADPFLY+Q                 GDIGVA+SID+GA
Sbjct: 115 PVSNPVLTCASLTNSGLPSNFVADPFLYVQGDTLYLFFETKNPITMQGDIGVAKSIDKGA 174

Query: 295 TWQQLGIALDEEWHLSYPYVFDYDGQIYMMPESSQKGELRLYRAITFPLQWALEKVILKE 116
           TW  LGIALDE WHLS+P+VF+Y+G+IYMMPES++ G+L LYRA+ FPL W LEKVI+K+
Sbjct: 175 TWIPLGIALDEAWHLSFPFVFNYNGEIYMMPESNELGQLNLYRALNFPLSWKLEKVIMKK 234

Query: 115 PLVDSFILNYNGNYWLFGSDHSGFGAKKNGQLEIWYSN 2
            LVDS I+++ G YWLFGSDHS FGAKKNGQLEIWYSN
Sbjct: 235 RLVDSTIIHHEGIYWLFGSDHSSFGAKKNGQLEIWYSN 272


Top