BLASTX nr result

ID: Akebia26_contig00025343 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00025343
         (675 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein ...   195   1e-47
ref|XP_007204604.1| hypothetical protein PRUPE_ppa002708mg [Prun...   181   1e-43
ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910...   174   2e-41
gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis]     173   4e-41
ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910...   171   2e-40
ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910...   169   6e-40
ref|XP_004157938.1| PREDICTED: DUF246 domain-containing protein ...   164   3e-38
ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910...   158   2e-36
ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutr...   157   3e-36
ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutr...   157   3e-36
ref|XP_007012658.1| O-fucosyltransferase family protein isoform ...   156   7e-36
ref|XP_007012657.1| O-fucosyltransferase family protein isoform ...   156   7e-36
ref|XP_007012656.1| O-fucosyltransferase family protein isoform ...   156   7e-36
emb|CAN67382.1| hypothetical protein VITISV_017920 [Vitis vinifera]   153   4e-35
ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910...   153   6e-35
ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arab...   152   9e-35
ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Caps...   151   2e-34
ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Popu...   148   2e-33
ref|XP_007154587.1| hypothetical protein PHAVU_003G131300g [Phas...   147   2e-33
ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsi...   146   5e-33

>ref|XP_002279041.1| PREDICTED: DUF246 domain-containing protein At1g04910 [Vitis
           vinifera] gi|297738571|emb|CBI27816.3| unnamed protein
           product [Vitis vinifera]
          Length = 634

 Score =  195 bits (495), Expect = 1e-47
 Identities = 108/217 (49%), Positives = 134/217 (61%), Gaps = 24/217 (11%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQNT------------HHEIDLQLN 238
           H N  DG+SQR+NSPRFSGPMTRR+ S KR NS+ N +             H+EID+ LN
Sbjct: 4   HHNASDGVSQRVNSPRFSGPMTRRAHSFKRGNSSGNAHNNGSSKGGGGFDPHYEIDVHLN 63

Query: 239 SPGPETPNNTVLIDGFELISEKKQTNLPNQRVH----------HVGSVAVPLFGKNIREK 388
           SP  E   + V  DGF+++ E+KQT+  NQRVH          HVGS  + L    +RE+
Sbjct: 64  SPRSEICGSPVSGDGFDVVLERKQTHHVNQRVHGGVLKNQPKKHVGSAVLDL---GLRER 120

Query: 389 KILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYIT--DPTVTSSHEN 562
           K LG W+F VFCG CLFLGVLKICA GWFGSA++R   +QD S    T  +    SSH+ 
Sbjct: 121 KKLGHWMFFVFCGVCLFLGVLKICATGWFGSAIDRIGSHQDFSDPLNTHLNEMDKSSHDY 180

Query: 563 GRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
              E GSDVERTL MV  G ++ Q ++ E S IWSKP
Sbjct: 181 VYREGGSDVERTLMMVASGVVNRQKSMAENSDIWSKP 217


>ref|XP_007204604.1| hypothetical protein PRUPE_ppa002708mg [Prunus persica]
           gi|462400135|gb|EMJ05803.1| hypothetical protein
           PRUPE_ppa002708mg [Prunus persica]
          Length = 642

 Score =  181 bits (460), Expect = 1e-43
 Identities = 107/228 (46%), Positives = 135/228 (59%), Gaps = 29/228 (12%)
 Frame = +2

Query: 77  MGLQQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRN------------NSNSNQNT--- 211
           MG      N  DG+SQR+NSPRFSGPMTRR+ S KRN            NSNSN ++   
Sbjct: 1   MGHHLHLHNTSDGVSQRVNSPRFSGPMTRRAHSFKRNPNTSANNGSSHGNSNSNNSSGSV 60

Query: 212 -----HHEIDLQLNSPGPETPNNTVLIDGFELISEKKQTNLPNQRV-------HHVGSVA 355
                 +EIDL LNSP  E   N+V  DGF+ + E+KQT+  +QRV         +GSV 
Sbjct: 61  GFGSGEYEIDLPLNSPRSEIGGNSVPGDGFDSVLERKQTHHVSQRVAVRGFLRKPIGSVV 120

Query: 356 VPLFGKNIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTY-IT 532
           V L    +REKK LG W+F  FCG CLFLG+LKICA GWFGSA+E +   QD S    + 
Sbjct: 121 VDL---GLREKKQLGHWMFFAFCGVCLFLGILKICATGWFGSAIESSRSNQDGSDPITLM 177

Query: 533 DPTVTSSHENGRIESGSDVERTLKMVE-LGTISSQNNVIEYSGIWSKP 673
           +    SSH+ G  + GSDVERTL M   +  +  + N +EY+GIWS+P
Sbjct: 178 NRMDQSSHDYGHRDGGSDVERTLMMASGVNRVVGEENSVEYTGIWSRP 225


>ref|XP_004243713.1| PREDICTED: uncharacterized protein At1g04910-like [Solanum
           lycopersicum]
          Length = 646

 Score =  174 bits (441), Expect = 2e-41
 Identities = 102/220 (46%), Positives = 133/220 (60%), Gaps = 27/220 (12%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKR-NNSNSNQ--------------NTHHEIDL 229
           HS   DG+ QR+NSPRFSGPMTRR+ S KR NN+N N               NTHHEID+
Sbjct: 13  HSTATDGVPQRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGGGSSNSTATLNTHHEIDV 72

Query: 230 QLNSPGPETPNNTVLIDGFELISEKKQTNLPN--QRVH---HVGSVAVPL-FGKNIREKK 391
            LNSP  ET  N  + D +E++ EKK T+L N  QRVH    + S+ V   FG  ++ +K
Sbjct: 73  PLNSPRSET--NANIADEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLELKGRK 130

Query: 392 ILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYIT---DPTVTSSHEN 562
            LG W+FLVFCG CLF+GVLK CA GWFGSA+ER    QD   + ++     T T  H +
Sbjct: 131 KLGHWMFLVFCGFCLFMGVLKFCAYGWFGSAIERVAYSQDSYDSLVSLRDQSTHTYRHMD 190

Query: 563 GRIESGSD---VERTLKMVELGTISSQNNVIEYSGIWSKP 673
           G  +   +   +E+TL MV  G + +QNN+++YS IW  P
Sbjct: 191 GDTKHSGERNHLEQTLSMVASGVVGNQNNMLDYSEIWLHP 230


>gb|EXB38940.1| hypothetical protein L484_027375 [Morus notabilis]
          Length = 641

 Score =  173 bits (439), Expect = 4e-41
 Identities = 104/219 (47%), Positives = 134/219 (61%), Gaps = 25/219 (11%)
 Frame = +2

Query: 92  QHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQN--------------------T 211
           QHS + DG+SQR+NSPRFSGPMTRR+ S KRN ++S+Q+                     
Sbjct: 12  QHSPS-DGVSQRVNSPRFSGPMTRRAHSFKRNANSSSQSGTNTGNNGGGGGGNNGSGLSP 70

Query: 212 HHEIDLQLNSPGPETPNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFGKNIREKK 391
           HHEI+LQLNSP  E   N   +DGF+ + E++      +++   GSV V L    +REKK
Sbjct: 71  HHEIELQLNSPRSEIGGNLSSVDGFDSVLERRHRFALRKKI---GSVVVDL---GLREKK 124

Query: 392 ILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQD----LSKTYITDPTVTSSHE 559
            LG W+FLVFCG CLFLGVLKICA GWFGSA+ERA   +D    +S   + D   +S   
Sbjct: 125 KLGHWMFLVFCGLCLFLGVLKICATGWFGSAIERASSDRDSTDPMSGLLVMDQ--SSKDY 182

Query: 560 NGRIESGSDVERTLKMVELGT-ISSQNNVIEYSGIWSKP 673
             R + G+DVERTL MV  G  + +Q +  EYSGIWS+P
Sbjct: 183 VYREKKGTDVERTLMMVSTGVRVDNQKSKDEYSGIWSRP 221


>ref|XP_006342369.1| PREDICTED: uncharacterized protein At1g04910-like isoform X1
           [Solanum tuberosum]
          Length = 648

 Score =  171 bits (433), Expect = 2e-40
 Identities = 101/222 (45%), Positives = 135/222 (60%), Gaps = 29/222 (13%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKR-NNSNSNQ-------------NTHHEIDLQ 232
           HS   DG+ QR+NSPRFSGPMTRR+ S KR NN+N N              NTHHEID+ 
Sbjct: 13  HSTATDGVPQRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHHEIDVP 72

Query: 233 LNSPGPETPNNTVLIDGFELISEKKQTNLPN--QRVH---HVGSVAVPL-FGKNIREKKI 394
           LNSP  ET  N  + D +E++ EKK T+L N  QRVH    + S+ V   FG  ++ +K 
Sbjct: 73  LNSPRSET--NANIADEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLELKGRKK 130

Query: 395 LGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDPTV--TSSHENGR 568
           LG W+FLVFCG CLF+GVLK CA GWFGSA+ER    QD   + I+  ++   S+H    
Sbjct: 131 LGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERVAYSQDSYDSLISQLSLRDQSTHAYRH 190

Query: 569 IESG-------SDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           +E         + +E+TL MV  G + +QN+++++S IW KP
Sbjct: 191 MEGDTKHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKP 232


>ref|XP_006342370.1| PREDICTED: uncharacterized protein At1g04910-like isoform X2
           [Solanum tuberosum]
          Length = 643

 Score =  169 bits (429), Expect = 6e-40
 Identities = 100/217 (46%), Positives = 132/217 (60%), Gaps = 24/217 (11%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKR-NNSNSNQ-------------NTHHEIDLQ 232
           HS   DG+ QR+NSPRFSGPMTRR+ S KR NN+N N              NTHHEID+ 
Sbjct: 13  HSTATDGVPQRVNSPRFSGPMTRRAHSFKRTNNTNQNAQNTGSSSSSTASLNTHHEIDVP 72

Query: 233 LNSPGPETPNNTVLIDGFELISEKKQTNLPN--QRVH---HVGSVAVPL-FGKNIREKKI 394
           LNSP  ET  N  + D +E++ EKK T+L N  QRVH    + S+ V   FG  ++ +K 
Sbjct: 73  LNSPRSET--NANIADEYEILGEKKHTHLSNVIQRVHLRKKLESLTVDFGFGLELKGRKK 130

Query: 395 LGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITD-PTVTSSHENGRI 571
           LG W+FLVFCG CLF+GVLK CA GWFGSA+ER      +S+  + D  T    H  G  
Sbjct: 131 LGHWMFLVFCGFCLFIGVLKFCAYGWFGSAIERDSYDSLISQLSLRDQSTHAYRHMEGDT 190

Query: 572 ESGSD---VERTLKMVELGTISSQNNVIEYSGIWSKP 673
           +   +   +E+TL MV  G + +QN+++++S IW KP
Sbjct: 191 KHSGERNHLEQTLSMVASGVVGNQNSMLDFSEIWLKP 227


>ref|XP_004157938.1| PREDICTED: DUF246 domain-containing protein At1g04910-like [Cucumis
           sativus]
          Length = 638

 Score =  164 bits (414), Expect = 3e-38
 Identities = 98/222 (44%), Positives = 128/222 (57%), Gaps = 27/222 (12%)
 Frame = +2

Query: 89  QQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQNT------------------H 214
           Q+H N  DG+SQR+NSPRFSGP+TRR+ S KRNN+N+N N+                  H
Sbjct: 4   QRHHNGNDGVSQRVNSPRFSGPITRRAHSFKRNNNNNNNNSDTHSNTNSNILNNNGLSSH 63

Query: 215 HEIDLQLNSPGPETPNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAV-----PLFGK-- 373
           HEIDL  NSP  E   +TV +DGFE   E+K     +QR+H  G VA      P F    
Sbjct: 64  HEIDLPANSPRSEAFRSTVQVDGFESALERKTAPHVSQRIH--GGVAAKSSLNPGFVSLD 121

Query: 374 -NIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDPTVTS 550
             +REK+ LG  +F+VFCG CLFLG+LKIC NGWFGS +E    + D   +  +   V  
Sbjct: 122 FRLREKRKLGHLMFMVFCGLCLFLGILKICMNGWFGSVIETNESHHDTPDSITSRNQVDH 181

Query: 551 SHENGRIESG-SDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           + +N +   G +  ERTL M+E   + SQN  +E+S IW KP
Sbjct: 182 NSDNIKHREGETSFERTL-MMESSVVGSQNG-MEHSEIWMKP 221


>ref|XP_003550617.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max]
          Length = 628

 Score =  158 bits (399), Expect = 2e-36
 Identities = 98/213 (46%), Positives = 120/213 (56%), Gaps = 20/213 (9%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQN-----THH---------EIDLQ 232
           H N  DG+SQR+NSPRFSGPMTRR+ S KRNNS++N N     T H         EI+LQ
Sbjct: 10  HHNTSDGVSQRVNSPRFSGPMTRRAHSFKRNNSSNNSNNTATTTSHGGGGGSGGVEIELQ 69

Query: 233 LNSPGPETPNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFG----KNIREKKILG 400
           +NSP  E  +  V +        K   +   QRVH  G +  PL        +RE+K +G
Sbjct: 70  INSPRSEEASEGVPVG-------KHSHHHVTQRVHVRGLLKKPLASIVEDLGLRERKKIG 122

Query: 401 RWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDPTVTSSHENGRIESG 580
            W+FLVFCG CLF+GVLKICA GW GSA+E     ++LS + I   T+      G    G
Sbjct: 123 HWMFLVFCGVCLFMGVLKICATGWLGSAIEITQSNKELSDS-IPSLTLMDKSSLGYAYRG 181

Query: 581 --SDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
             SDVERTLK V  G   S   + E SGIWSKP
Sbjct: 182 GASDVERTLKTVATGVDGSHTAMTEDSGIWSKP 214


>ref|XP_006395968.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum]
           gi|557092607|gb|ESQ33254.1| hypothetical protein
           EUTSA_v10003786mg [Eutrema salsugineum]
          Length = 654

 Score =  157 bits (397), Expect = 3e-36
 Identities = 99/237 (41%), Positives = 126/237 (53%), Gaps = 38/237 (16%)
 Frame = +2

Query: 77  MGLQQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNS--NSNQNTH------------ 214
           MG    H +  DG+ Q +NSPRFSGPMTRR+QS KR  S  +S+ NTH            
Sbjct: 1   MGHHLHHQDGGDGVPQHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNST 60

Query: 215 ----------HEIDLQLNSPGPETPNNTVL--IDGFELISEKKQTNLPNQRVHHV----- 343
                     HEIDLQLNSP  E  + + L     FE    +K       R   V     
Sbjct: 61  GTNHSTLRVHHEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLR 120

Query: 344 ---GSVAVPLFGKNIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDL 514
              GSV   L   ++RE+K LG W+F  FCG CLF+GVLKICA GW GSA++ A   QDL
Sbjct: 121 KPMGSVVSEL---SLRERKKLGHWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDL 177

Query: 515 SKTYITDPTVT----SSHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           S +    P V     SSH+    + G+ ++ TL MV  G +  QN+V+EYSG+W+KP
Sbjct: 178 SDSI---PRVNLLDHSSHDYIYKDGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKP 231


>ref|XP_006395967.1| hypothetical protein EUTSA_v10003786mg [Eutrema salsugineum]
           gi|557092606|gb|ESQ33253.1| hypothetical protein
           EUTSA_v10003786mg [Eutrema salsugineum]
          Length = 460

 Score =  157 bits (397), Expect = 3e-36
 Identities = 99/237 (41%), Positives = 126/237 (53%), Gaps = 38/237 (16%)
 Frame = +2

Query: 77  MGLQQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNS--NSNQNTH------------ 214
           MG    H +  DG+ Q +NSPRFSGPMTRR+QS KR  S  +S+ NTH            
Sbjct: 1   MGHHLHHQDGGDGVPQHVNSPRFSGPMTRRAQSFKRGGSGGSSSNNTHAGGSISAGDNST 60

Query: 215 ----------HEIDLQLNSPGPETPNNTVL--IDGFELISEKKQTNLPNQRVHHV----- 343
                     HEIDLQLNSP  E  + + L     FE    +K       R   V     
Sbjct: 61  GTNHSTLRVHHEIDLQLNSPRSEIASGSGLDPSSAFESAINRKHQTYGQLRERVVKGLLR 120

Query: 344 ---GSVAVPLFGKNIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDL 514
              GSV   L   ++RE+K LG W+F  FCG CLF+GVLKICA GW GSA++ A   QDL
Sbjct: 121 KPMGSVVSEL---SLRERKKLGHWMFFAFCGVCLFMGVLKICATGWLGSAIDGAASDQDL 177

Query: 515 SKTYITDPTVT----SSHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           S +    P V     SSH+    + G+ ++ TL MV  G +  QN+V+EYSG+W+KP
Sbjct: 178 SDSI---PRVNLLDHSSHDYIYKDGGNGIDPTLAMVASGVVGDQNSVVEYSGVWAKP 231


>ref|XP_007012658.1| O-fucosyltransferase family protein isoform 3 [Theobroma cacao]
           gi|508783021|gb|EOY30277.1| O-fucosyltransferase family
           protein isoform 3 [Theobroma cacao]
          Length = 677

 Score =  156 bits (394), Expect = 7e-36
 Identities = 101/221 (45%), Positives = 122/221 (55%), Gaps = 25/221 (11%)
 Frame = +2

Query: 86  QQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNS---------------------- 199
           Q  H N  DG+SQR+NSPRFSGPMTRR+ S KR N NS                      
Sbjct: 6   QHHHHNTSDGVSQRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGN 65

Query: 200 NQNTHHEIDLQLNSPGPET-PNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFGKN 376
           N + HHEIDL +NSP  ET    +V IDG   +S+++       R   VGS+ +  FG  
Sbjct: 66  NLSVHHEIDLPINSPRSETGAAGSVSIDG---LSQRRGF----LRKPSVGSMVLD-FG-- 115

Query: 377 IREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDP--TVTS 550
           ++E+K LG W+FLVFCG CLFLGV KICA GWFGSA+E     Q LS   I  P      
Sbjct: 116 LKERKKLGHWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQG 175

Query: 551 SHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           SH+ G  E GSD +RTL  V        ++V E SGIWS P
Sbjct: 176 SHDYGYREEGSDSDRTLMTV-------PSDVTEDSGIWSLP 209


>ref|XP_007012657.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
           gi|508783020|gb|EOY30276.1| O-fucosyltransferase family
           protein isoform 2 [Theobroma cacao]
          Length = 564

 Score =  156 bits (394), Expect = 7e-36
 Identities = 101/221 (45%), Positives = 122/221 (55%), Gaps = 25/221 (11%)
 Frame = +2

Query: 86  QQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNS---------------------- 199
           Q  H N  DG+SQR+NSPRFSGPMTRR+ S KR N NS                      
Sbjct: 6   QHHHHNTSDGVSQRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGN 65

Query: 200 NQNTHHEIDLQLNSPGPET-PNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFGKN 376
           N + HHEIDL +NSP  ET    +V IDG   +S+++       R   VGS+ +  FG  
Sbjct: 66  NLSVHHEIDLPINSPRSETGAAGSVSIDG---LSQRRGF----LRKPSVGSMVLD-FG-- 115

Query: 377 IREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDP--TVTS 550
           ++E+K LG W+FLVFCG CLFLGV KICA GWFGSA+E     Q LS   I  P      
Sbjct: 116 LKERKKLGHWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQG 175

Query: 551 SHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           SH+ G  E GSD +RTL  V        ++V E SGIWS P
Sbjct: 176 SHDYGYREEGSDSDRTLMTV-------PSDVTEDSGIWSLP 209


>ref|XP_007012656.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
           gi|508783019|gb|EOY30275.1| O-fucosyltransferase family
           protein isoform 1 [Theobroma cacao]
          Length = 626

 Score =  156 bits (394), Expect = 7e-36
 Identities = 101/221 (45%), Positives = 122/221 (55%), Gaps = 25/221 (11%)
 Frame = +2

Query: 86  QQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNS---------------------- 199
           Q  H N  DG+SQR+NSPRFSGPMTRR+ S KR N NS                      
Sbjct: 6   QHHHHNTSDGVSQRVNSPRFSGPMTRRASSFKRGNGNSQTTNSNNALGSGNGNNNGSNGN 65

Query: 200 NQNTHHEIDLQLNSPGPET-PNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFGKN 376
           N + HHEIDL +NSP  ET    +V IDG   +S+++       R   VGS+ +  FG  
Sbjct: 66  NLSVHHEIDLPINSPRSETGAAGSVSIDG---LSQRRGF----LRKPSVGSMVLD-FG-- 115

Query: 377 IREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDP--TVTS 550
           ++E+K LG W+FLVFCG CLFLGV KICA GWFGSA+E     Q LS   I  P      
Sbjct: 116 LKERKKLGHWMFLVFCGVCLFLGVFKICATGWFGSAIETVTSNQGLSDISINRPKRIDQG 175

Query: 551 SHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           SH+ G  E GSD +RTL  V        ++V E SGIWS P
Sbjct: 176 SHDYGYREEGSDSDRTLMTV-------PSDVTEDSGIWSLP 209


>emb|CAN67382.1| hypothetical protein VITISV_017920 [Vitis vinifera]
          Length = 514

 Score =  153 bits (387), Expect = 4e-35
 Identities = 89/192 (46%), Positives = 113/192 (58%), Gaps = 24/192 (12%)
 Frame = +2

Query: 155 MTRRSQSLKRNNSNSNQNT------------HHEIDLQLNSPGPETPNNTVLIDGFELIS 298
           MTRR+ S KR NS+ N +             H+EID+ LNSP  E   + V  DGF+++ 
Sbjct: 1   MTRRAHSFKRGNSSGNAHNNGSSKGGGGFDPHYEIDVHLNSPRSEICGSPVSGDGFDVVL 60

Query: 299 EKKQTNLPNQRVH----------HVGSVAVPLFGKNIREKKILGRWLFLVFCGACLFLGV 448
           E+KQT+  NQRVH          HVGS  + L    +RE+K LG W+F VFCG CLFLGV
Sbjct: 61  ERKQTHHVNQRVHGGVLKNQPKKHVGSAVLDL---GLRERKKLGHWMFFVFCGVCLFLGV 117

Query: 449 LKICANGWFGSAVERAWPYQDLSKTYIT--DPTVTSSHENGRIESGSDVERTLKMVELGT 622
           LKICA GWFGSA++R   +QD S    T  +    SSH+    E GSDVERTL MV  G 
Sbjct: 118 LKICATGWFGSAIDRIGSHQDFSDPLNTHLNEMDKSSHDYVYREGGSDVERTLMMVASGV 177

Query: 623 ISSQNNVIEYSG 658
           ++ Q ++ E SG
Sbjct: 178 VNRQKSMAEISG 189


>ref|XP_003542359.1| PREDICTED: uncharacterized protein At1g04910-like [Glycine max]
          Length = 626

 Score =  153 bits (386), Expect = 6e-35
 Identities = 94/211 (44%), Positives = 117/211 (55%), Gaps = 18/211 (8%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQNTHH-------------EIDLQL 235
           H N  DG+SQR+NSPRFSGPMTRR+ S KRNN+N   NT               E++LQ+
Sbjct: 10  HHNTSDGVSQRVNSPRFSGPMTRRAHSFKRNNNNIAANTAATTSHGGAGGSGAGEVELQI 69

Query: 236 NSPGPETPNNTVLIDGFELISEKKQTNLPNQRVHHVGSVAVPLFG----KNIREKKILGR 403
           NSP  E  +  V +        K   +   QRVH  G +  PL        +RE+K +G 
Sbjct: 70  NSPRSEEASEGVPVG-------KHSHHHVTQRVHVRGLLKKPLASIVEDLGLRERKKIGH 122

Query: 404 WLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDPTVTSSHENGRIESG- 580
           W+FLVFCG CLF+GVLKICA GW GSA+ER    ++LS +  +   +  S        G 
Sbjct: 123 WMFLVFCGVCLFMGVLKICATGWLGSAIERTQSNKELSDSIASLNLMDKSSLGYAYRGGA 182

Query: 581 SDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           SDVERTLK V  G   S   + E SGIWSKP
Sbjct: 183 SDVERTLKTVATGD-GSHTAMTEDSGIWSKP 212


>ref|XP_002870435.1| hypothetical protein ARALYDRAFT_493618 [Arabidopsis lyrata subsp.
           lyrata] gi|297316271|gb|EFH46694.1| hypothetical protein
           ARALYDRAFT_493618 [Arabidopsis lyrata subsp. lyrata]
          Length = 653

 Score =  152 bits (384), Expect = 9e-35
 Identities = 94/220 (42%), Positives = 121/220 (55%), Gaps = 27/220 (12%)
 Frame = +2

Query: 95  HSNNCDGLSQR-INSPRFSGPMTRRSQSLKRNNSN-SNQNTH-------------HEIDL 229
           H +  DG+ Q  +NSPRFSGPMTRR+QS KR  S  S+ NTH             HEIDL
Sbjct: 4   HHDGGDGVPQHHVNSPRFSGPMTRRAQSFKRGGSGGSSSNTHVGDGNNTSTLRVHHEIDL 63

Query: 230 QLNSPGPETPNNTVLID---GFELISEKKQTNLPNQRVHHVGSVAVPLFGK-----NIRE 385
            LNSP  E  + +   D   GF+    +K       R   V  +     G      ++RE
Sbjct: 64  PLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPMGSVVSDFSLRE 123

Query: 386 KKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTYITDPTVT----SS 553
           +K LG W+F  FCG CLFLGV KICA GW GSA++ A  +QDLS +    P V     SS
Sbjct: 124 RKKLGHWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASHQDLSNSI---PRVNLLDHSS 180

Query: 554 HENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           H+    + G+DV+ TL MV    +  QN+V+EYSG+W+KP
Sbjct: 181 HDYIYKDGGNDVDPTLVMVASDVVGDQNSVVEYSGVWAKP 220


>ref|XP_006283281.1| hypothetical protein CARUB_v10004322mg [Capsella rubella]
           gi|482551986|gb|EOA16179.1| hypothetical protein
           CARUB_v10004322mg [Capsella rubella]
          Length = 659

 Score =  151 bits (381), Expect = 2e-34
 Identities = 94/239 (39%), Positives = 122/239 (51%), Gaps = 40/239 (16%)
 Frame = +2

Query: 77  MGLQQQHSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNS-------------------NS 199
           MG    H +  DG+ Q +NSPRFSGPMTRR+QS KR  S                   N+
Sbjct: 1   MGHHLHHHDGGDGVPQHVNSPRFSGPMTRRAQSFKRGGSGGGGTSSNSHVGVSDNIGINN 60

Query: 200 NQNT---------HHEIDLQLNSPGPETPNNTVLID---GFELISEKKQTNLPNQRVHHV 343
           N NT         HHEIDL LNSP  E  +     D   GF+    +K       R   V
Sbjct: 61  NNNTSSSSSTLRVHHEIDLPLNSPRSEIVSGGSGSDPSGGFDSAVNRKHQTYGQLRERVV 120

Query: 344 GSVAVPLFGK-----NIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQ 508
             +     G      +++E+K LG W+F  FCG CLF+GV KICA GW GSA++ A   Q
Sbjct: 121 KGLLRKPMGSVVSDFSLKERKKLGHWMFFAFCGVCLFMGVFKICATGWLGSAIDSAASDQ 180

Query: 509 DLSKTYITDPTVT----SSHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           DLS +    P V     SSH+    + G+DV+ TL MV    +  QN+V+EY+G+W+KP
Sbjct: 181 DLSNSI---PRVNLLDHSSHDYIYKDGGNDVDPTLVMVASDVVGDQNSVVEYTGVWAKP 236


>ref|XP_006381630.1| hypothetical protein POPTR_0006s14490g [Populus trichocarpa]
           gi|550336338|gb|ERP59427.1| hypothetical protein
           POPTR_0006s14490g [Populus trichocarpa]
          Length = 648

 Score =  148 bits (373), Expect = 2e-33
 Identities = 94/233 (40%), Positives = 126/233 (54%), Gaps = 40/233 (17%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQNT--------------------- 211
           H++  DG+SQR+NSPRFSGPMTRR+ S KRNN++SN N+                     
Sbjct: 10  HNSASDGVSQRVNSPRFSGPMTRRAHSFKRNNTSSNNNSNAGNANSSNNGSNNVSNGNSN 69

Query: 212 ------HHEIDLQLNSPGPETPNNTVLIDGFELISEKKQTNLPNQRVH---------HVG 346
                 H EIDL LNSP  ET      +DGFE  S  +Q NL +QRVH           G
Sbjct: 70  NSILSPHLEIDLPLNSPRSET------VDGFERESHSRQ-NL-SQRVHGGVVRILTNKKG 121

Query: 347 SVAVPLFGKNIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLSKTY 526
           S+   +     +E+K LG W+F  FCG CLFLGV KIC  GWFGS +ERA   Q    T+
Sbjct: 122 SIGSVILDFGFKERKKLGHWMFFFFCGLCLFLGVFKICLYGWFGSTLERAASNQ---VTH 178

Query: 527 ITD--PTVTSSHENGRIESGSDVERTLKMVELGT--ISSQNNVIEYSGIWSKP 673
           + D   ++T   ++     GS+ ++   ++E+G+  +   N   E+SGIWSKP
Sbjct: 179 LIDVFGSITRQEQDSYRYMGSENDQKRMIIEVGSDVVDRLNKKAEFSGIWSKP 231


>ref|XP_007154587.1| hypothetical protein PHAVU_003G131300g [Phaseolus vulgaris]
           gi|561027941|gb|ESW26581.1| hypothetical protein
           PHAVU_003G131300g [Phaseolus vulgaris]
          Length = 617

 Score =  147 bits (372), Expect = 2e-33
 Identities = 87/199 (43%), Positives = 111/199 (55%), Gaps = 6/199 (3%)
 Frame = +2

Query: 95  HSNNCDGLSQRINSPRFSGPMTRRSQSLKRNNSNSNQNTHH-EIDLQLNSPGPETPNNTV 271
           H N  DG+SQR+NSPRFSGPMTRR+ S KRN   +N N    E++LQ+NSP  E      
Sbjct: 10  HHNTSDGVSQRVNSPRFSGPMTRRAHSFKRNTDGTNSNGGSGEVELQINSPRSEEA---- 65

Query: 272 LIDGFELISEKKQTNLPNQRVHHVGSVAVPLFG----KNIREKKILGRWLFLVFCGACLF 439
            ++G  +       N   QRVH    +  PL         RE+K +G  +FLVFCG C+F
Sbjct: 66  -LEGIPVGRHSHNHNHVTQRVHVRSLLKKPLASIVEDLGFRERKKIGHLMFLVFCGVCIF 124

Query: 440 LGVLKICANGWFGSAVERAWPYQDLSKTYITDPTVTSSHENGRIESG-SDVERTLKMVEL 616
           +GVLKICA GW GSA+ERA   ++L  +  +   +  S        G SDVERTLK +  
Sbjct: 125 IGVLKICATGWLGSAIERAQSDKELPDSIASLNLMDKSSLGYAYRGGASDVERTLKTLAT 184

Query: 617 GTISSQNNVIEYSGIWSKP 673
           G   S   + E SG WSKP
Sbjct: 185 GVGDSHTAMAEDSGTWSKP 203


>ref|NP_568528.2| O-fucosyltransferase family protein [Arabidopsis thaliana]
           gi|14517444|gb|AAK62612.1| AT5g35570/K2K18_1
           [Arabidopsis thaliana] gi|21360449|gb|AAM47340.1|
           AT5g35570/K2K18_1 [Arabidopsis thaliana]
           gi|332006599|gb|AED93982.1| O-fucosyltransferase family
           protein [Arabidopsis thaliana]
          Length = 652

 Score =  146 bits (369), Expect = 5e-33
 Identities = 93/229 (40%), Positives = 121/229 (52%), Gaps = 36/229 (15%)
 Frame = +2

Query: 95  HSNNCDGLSQR-INSPRFSGPMTRRSQSLKRNNS-------------------NSNQNT- 211
           H +  DG+ Q  +NSPRFSGPMTRR+QS KR  S                   N+N NT 
Sbjct: 4   HHDGGDGVPQHHVNSPRFSGPMTRRAQSFKRGGSAGSSSNNNNTHVGVSGGDGNNNNNTS 63

Query: 212 -----HHEIDLQLNSPGPETPNNTVLID---GFELISEKKQTNLPNQRVHHVGSVAVPLF 367
                HHEIDL LNSP  E  + +   D   GF+    +K       R   V  +     
Sbjct: 64  STLRVHHEIDLPLNSPRSEIVSGSSGSDPSGGFDSALNRKHQTYGQLRERVVKGLLRKPM 123

Query: 368 GK-----NIREKKILGRWLFLVFCGACLFLGVLKICANGWFGSAVERAWPYQDLS--KTY 526
           G      ++RE+K LG W+F  FCG CLFLGV KICA GW GSA++ A   QDLS  +  
Sbjct: 124 GSVVSDFSLRERKKLGHWMFFAFCGVCLFLGVFKICATGWLGSAIDGAASDQDLSIPRVN 183

Query: 527 ITDPTVTSSHENGRIESGSDVERTLKMVELGTISSQNNVIEYSGIWSKP 673
           + D    SSH+    + G+DV+ TL MV    +  QN+V+E+SG+W+KP
Sbjct: 184 LLD---HSSHDYIYKDGGNDVDPTLVMVASDVVGDQNSVVEFSGVWAKP 229


Top