BLASTX nr result

ID: Mentha25_contig00035758 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00035758
         (2088 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus...   827   0.0  
ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584...   733   0.0  
ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268...   723   0.0  
gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlise...   715   0.0  
ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602...   703   0.0  
ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262...   690   0.0  
ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254...   687   0.0  
gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]     672   0.0  
ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299...   660   0.0  
ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776...   660   0.0  
ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus tric...   651   0.0  
ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsi...   646   0.0  
gb|AAM66093.1| unknown [Arabidopsis thaliana]                         645   0.0  
ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arab...   645   0.0  
ref|XP_003627474.1| GDP-fucose protein-O-fucosyltransferase [Med...   645   0.0  
ref|XP_007024790.1| O-fucosyltransferase family protein isoform ...   643   0.0  
ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046...   643   0.0  
ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208...   641   0.0  
ref|XP_002533327.1| conserved hypothetical protein [Ricinus comm...   640   0.0  
ref|XP_007024791.1| O-fucosyltransferase family protein isoform ...   639   e-180

>gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus guttatus]
          Length = 559

 Score =  827 bits (2137), Expect = 0.0
 Identities = 420/561 (74%), Positives = 457/561 (81%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXX 228
            NLISQNARPNDVVKSP +H  R   SA +ID     R+SGAAR FNKR            
Sbjct: 15   NLISQNARPNDVVKSPTNHTRR---SALRIDGG--GRLSGAARGFNKRYLLAILLPMVIL 69

Query: 229  XXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 408
                TTDLK++F+MR+P+I++ GGN PL NRMRESELRALYLLKQQE++L+KMWNYTTL 
Sbjct: 70   ILYFTTDLKSLFQMRIPTIKDIGGNSPL-NRMRESELRALYLLKQQELQLLKMWNYTTLQ 128

Query: 409  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAG 588
             +                       +DLKSRVF QIS+NKQIQGILLS+HESEG  D   
Sbjct: 129  NQSNSSSVNNSNSFD----------EDLKSRVFSQISLNKQIQGILLSSHESEGFPDLNE 178

Query: 589  NYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAA 768
            N TD+ LS W  C KVD +L++R+TIEW P+SNKYL AICVSGQMSNHLICLEKHMFFAA
Sbjct: 179  NNTDASLSGWNMCGKVDQKLSERRTIEWKPRSNKYLLAICVSGQMSNHLICLEKHMFFAA 238

Query: 769  LLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFSM 948
            LLNRVLVIPSSKVD+ FHRVLDI+ INKCLGRKVVVTFEEFAE KKNHLHIDKFMCYFS+
Sbjct: 239  LLNRVLVIPSSKVDFPFHRVLDIETINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSL 298

Query: 949  PQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDVF 1128
            PQPCFMDD+               E VWKEDVK P  + V+DV AKFSSD DVIA+GDVF
Sbjct: 299  PQPCFMDDDHLKKLKGLGLSLGKIETVWKEDVKKPNQRKVDDVTAKFSSDDDVIAVGDVF 358

Query: 1129 FADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFC 1308
            FADVER+WVMQPGGPIAHKCKTLIEPSRLILLTA RFIQTFLGKDF+ALHFRRHGFLKFC
Sbjct: 359  FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAHRFIQTFLGKDFIALHFRRHGFLKFC 418

Query: 1309 NAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQR 1488
            NAKQPSCF+PVPQAAECINRVVERA+TPV+YLSTDAA SETGLLQSLVV NGKTVPLVQR
Sbjct: 419  NAKQPSCFFPVPQAAECINRVVERANTPVVYLSTDAAASETGLLQSLVVWNGKTVPLVQR 478

Query: 1489 PTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGS 1668
            P RN AEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFT+DILR+RKDWGS
Sbjct: 479  PARNLAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTEDILRIRKDWGS 538

Query: 1669 ASQCDEYLCQGEHPNFIAEDE 1731
            AS CDEYLCQGE PNFIAEDE
Sbjct: 539  ASVCDEYLCQGELPNFIAEDE 559


>ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584575 [Solanum tuberosum]
          Length = 568

 Score =  733 bits (1891), Expect = 0.0
 Identities = 367/566 (64%), Positives = 442/566 (78%), Gaps = 5/566 (0%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFN----KRXXXXXXXX 216
            NLI QN R ND+ KSP        RS FQI+D  ++R +   R+FN    KR        
Sbjct: 16   NLIHQNERVNDLSKSP-------RRSTFQIED-VKDRFA-LCRRFNFTSGKRYLLAIILP 66

Query: 217  XXXXXXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNY 396
                     TD+K++F+  V +I+  G      N MR+SELRALYLL+QQ++ L K+WN+
Sbjct: 67   VLVLVLYFATDIKSLFQTTVTTIKYDGS----VNSMRDSELRALYLLRQQQLGLFKLWNH 122

Query: 397  TTLVERXXXXXXXXXXXXXXXXXXXXA-MLQDLKSRVFGQISMNKQIQGILLSAHESEGS 573
            T + +                     + +++DLK+ +  QIS+NKQIQ +LLS+H+   S
Sbjct: 123  TLVNDTSTTHTGSSLESTPGFASVSRSSIVEDLKADLLRQISLNKQIQQVLLSSHQLGNS 182

Query: 574  VDSAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKH 753
            + ++ N TD  L   +RC+KVD  L+ R+T+EW P+SNKYLFAICVSGQMSNHLICLEKH
Sbjct: 183  LITSDNSTDPTLGGLSRCRKVDHNLSQRRTVEWKPRSNKYLFAICVSGQMSNHLICLEKH 242

Query: 754  MFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFM 933
            MFFAALLNR+LVIPSSKVDYEF RVLD+DHINKCLGR+V+VT++EFAE +K+HLHIDKF+
Sbjct: 243  MFFAALLNRILVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKFL 302

Query: 934  CYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIA 1113
            CYFS PQPCF+D+ER              EA W EDVK+PK +TV+D++AKFS+D DV+A
Sbjct: 303  CYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWNEDVKNPKKRTVQDIMAKFSTDDDVLA 362

Query: 1114 IGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHG 1293
            IGDVFFADVE+ WVMQPGGPI+HKCKTLIEPSRLI+LTAQRFIQTFLG +F+ALHFRRHG
Sbjct: 363  IGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFIQTFLGDNFIALHFRRHG 422

Query: 1294 FLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTV 1473
            FLKFCNAK+PSCFYPVPQAA+CINRV+ERA++PVIYLSTDAA+SETGLLQSLVV+NGKTV
Sbjct: 423  FLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAAESETGLLQSLVVVNGKTV 482

Query: 1474 PLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLR 1653
            PLVQRP RN+AEKWDALLYRHGLEGD QV+AMLDKTICA+SSVFIGSSGSTFTDDILRLR
Sbjct: 483  PLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSSVFIGSSGSTFTDDILRLR 542

Query: 1654 KDWGSASQCDEYLCQGEHPNFIAEDE 1731
            KDWGSAS CDEYLCQGE PN++A+DE
Sbjct: 543  KDWGSASLCDEYLCQGELPNYVADDE 568


>ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268664 [Solanum
            lycopersicum]
          Length = 565

 Score =  723 bits (1866), Expect = 0.0
 Identities = 366/565 (64%), Positives = 434/565 (76%), Gaps = 4/565 (0%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFN----KRXXXXXXXX 216
            NLI QN R N + KSP         S FQI+D  ++R +   R+FN    K         
Sbjct: 16   NLIHQNERVNHLSKSP-------RPSTFQIED-VKDRFA-LCRRFNFTSGKTYLLAIILP 66

Query: 217  XXXXXXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNY 396
                     TD+K +F+  V +I+  G      N MRESELRALYLLKQQ++ L K+WN+
Sbjct: 67   LLVLILYFATDIKALFQTTVTTIKYDGS----VNSMRESELRALYLLKQQQLGLFKLWNH 122

Query: 397  TTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSV 576
            T + +                     ++++DLK  +  QIS+NKQIQ +LLS+H+   S+
Sbjct: 123  TLVNDTSTTHSLESAPGFTLVSRS--SIVEDLKDDLLRQISLNKQIQQVLLSSHQLGNSL 180

Query: 577  DSAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHM 756
             ++ N TD  L    RC+KVD  L++R+T+EW P+SNKYLFAICVSGQMSNHLICLEKHM
Sbjct: 181  ITSDNSTDPSLGGLGRCRKVDHNLSERRTVEWKPRSNKYLFAICVSGQMSNHLICLEKHM 240

Query: 757  FFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMC 936
            FFAALLNRVLVIPSSKVDYEF RVLD+DHINKCLGR+V+VT++EFAE +K+HLHIDKF+C
Sbjct: 241  FFAALLNRVLVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKFLC 300

Query: 937  YFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAI 1116
            YFS PQPCF+D+ER              EA W EDVK+PK +T +D++AKFS D DV+AI
Sbjct: 301  YFSQPQPCFLDEERVKKLKSLGISMNKLEAAWDEDVKNPKKRTAQDIVAKFSMDDDVLAI 360

Query: 1117 GDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGF 1296
            GDVFFADVE+ WVMQPGGPI+HKCKTLIEPSRLI+LTAQRF+QTFLG +F+ALHFRRHGF
Sbjct: 361  GDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFVQTFLGDNFIALHFRRHGF 420

Query: 1297 LKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVP 1476
            LKFCNAK+PSCFYPVPQAA+CINRV+ERA++PV+YLSTDAA+SETGLLQSLVV NGKTVP
Sbjct: 421  LKFCNAKKPSCFYPVPQAADCINRVLERANSPVMYLSTDAAESETGLLQSLVVFNGKTVP 480

Query: 1477 LVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRK 1656
            LVQRP RN+AEKWDALLYRHGLEGD QVEAMLDKTICA+SSVFIGSSGSTFTDDILRLRK
Sbjct: 481  LVQRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTICAMSSVFIGSSGSTFTDDILRLRK 540

Query: 1657 DWGSASQCDEYLCQGEHPNFIAEDE 1731
            DWGSAS CDEYLCQGE PNF+A+DE
Sbjct: 541  DWGSASLCDEYLCQGELPNFVADDE 565


>gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlisea aurea]
          Length = 568

 Score =  715 bits (1845), Expect = 0.0
 Identities = 365/562 (64%), Positives = 422/562 (75%), Gaps = 1/562 (0%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXX 228
            NLISQNAR +D VKS    NH +HRS+  ++ D R R S AA  + KR            
Sbjct: 15   NLISQNARSDDAVKSS---NHSHHRSSLHVERDLRRRFSAAAGGYKKRYFLAIVLPALIL 71

Query: 229  XXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 408
                TTDLKNVF M +P I   GG+  L++RMRESEL+AL LL+QQE EL K+WNYT+  
Sbjct: 72   VLYFTTDLKNVFAMSIPKIGYHGGDA-LSDRMRESELQALNLLRQQEAELFKLWNYTSSA 130

Query: 409  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAH-ESEGSVDSA 585
             +                     +  DLKS+VF Q+S+NK+IQ +LLS+H   E   DS 
Sbjct: 131  NKLNYSHDPVNVNSSAIHNLD--LFLDLKSQVFSQLSLNKRIQTLLLSSHGNGEAFHDSN 188

Query: 586  GNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFA 765
             ++TD  L+  TRC   +  L  R+ +EW+P  NK+L AIC+SGQMSNHLICLEKHMFFA
Sbjct: 189  YSFTDDGLT--TRCPTANRNLLGRRKMEWDPLPNKFLLAICISGQMSNHLICLEKHMFFA 246

Query: 766  ALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFS 945
            ALL R+LVIPSSKVDY FHRVLDIDHIN CLG+K VVTFEEF+  +KNHLHID+F+CYFS
Sbjct: 247  ALLKRILVIPSSKVDYAFHRVLDIDHINTCLGKKAVVTFEEFSVMQKNHLHIDRFLCYFS 306

Query: 946  MPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDV 1125
             PQPC+MDDE               E+VWKEDVKSP+   VEDV++KFSS+  V+A+GD+
Sbjct: 307  SPQPCYMDDEYVKKLKGVGLSLSKVESVWKEDVKSPRKTKVEDVVSKFSSNEAVVAVGDL 366

Query: 1126 FFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKF 1305
            FFA VE  WVMQPGGPI HKCKTLIEPSRLI LTAQRF+QTFLGKDF+ALHFRRHGFLKF
Sbjct: 367  FFAQVEEDWVMQPGGPIEHKCKTLIEPSRLIRLTAQRFVQTFLGKDFIALHFRRHGFLKF 426

Query: 1306 CNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQ 1485
            CNAKQPSCFYPVPQAAECINRV+ERA+ PVIYLSTDAA+SETGLLQSLV   G TVPLV+
Sbjct: 427  CNAKQPSCFYPVPQAAECINRVIERANAPVIYLSTDAAESETGLLQSLVTRYGNTVPLVK 486

Query: 1486 RPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWG 1665
            RP RN+AEKWDALLYRHGLEGDSQVEAMLDK ICALSSVFIGSSGSTFT+DILRLR+ W 
Sbjct: 487  RPARNSAEKWDALLYRHGLEGDSQVEAMLDKAICALSSVFIGSSGSTFTEDILRLRRVWE 546

Query: 1666 SASQCDEYLCQGEHPNFIAEDE 1731
            S S CDEYLC+G  PN+IAEDE
Sbjct: 547  SESVCDEYLCEGRLPNYIAEDE 568


>ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602087 [Solanum tuberosum]
          Length = 565

 Score =  703 bits (1815), Expect = 0.0
 Identities = 357/570 (62%), Positives = 428/570 (75%), Gaps = 9/570 (1%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXX 228
            NLI+Q  R N++ +SP        R+AFQIDD+  +      R FN              
Sbjct: 16   NLIAQRERGNNLSESPV-------RTAFQIDDEIAD-----TRPFNSSCSKCCYFLTIIV 63

Query: 229  XXXX------TTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMW 390
                      TTD+ NV +  V        N    N MRESELRALYLL+QQ++ L K+W
Sbjct: 64   VTVFIFIRFYTTDVDNVSKTGVM-------NNDSVNLMRESELRALYLLRQQQLGLFKLW 116

Query: 391  NYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEG 570
            N  TL++                     A+ ++LK  +  QIS+NKQIQ  LLS+H+   
Sbjct: 117  N-NTLIDNSLNATAANNSNFVSTSLFSSALSEELKLELISQISLNKQIQQALLSSHQLGN 175

Query: 571  SVDSAGNYTDSILSDW---TRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLIC 741
             ++++ N TD  L D+    RC+K+D +L+DR+TIEW P+S+KYLFAIC SGQMSNHLIC
Sbjct: 176  LLNASDNATDPSLDDYGGLDRCRKMDYKLSDRRTIEWEPRSDKYLFAICASGQMSNHLIC 235

Query: 742  LEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHI 921
            LEKHMFFAALLNR+L+IPSS+VDYEF RVLDIDHINKCLGRKVVVTFEEFA+S+K H+HI
Sbjct: 236  LEKHMFFAALLNRILIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHI 295

Query: 922  DKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDS 1101
            DKF+CYFS PQPCF+DDE               EA W ED+K+PK +TV+D++ KFS D 
Sbjct: 296  DKFICYFSQPQPCFLDDEHVKKLKSLGVSMNKLEAAWDEDIKNPKPRTVQDIMTKFSLDD 355

Query: 1102 DVIAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHF 1281
            DVIAIGDVFFA+VE++WVMQPGGPI+HKCKTL+EPSRLILLTAQRFIQTFLGK+F+ALHF
Sbjct: 356  DVIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHF 415

Query: 1282 RRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLN 1461
            RRHGFLKFCNAK+PSCFYPVPQAA+CINRVVERA+ PVIYLSTDAA+SETG+LQSLV +N
Sbjct: 416  RRHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVAVN 475

Query: 1462 GKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDI 1641
            GKTVPLV+RP +N+AEKWDALLYRHGLEGD QVEAMLDKTICA+S VFIGS GSTFT+DI
Sbjct: 476  GKTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAMSEVFIGSMGSTFTEDI 535

Query: 1642 LRLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            LRLRKDWG++S CDEYLC+GE P+FIA+DE
Sbjct: 536  LRLRKDWGTSSLCDEYLCRGEVPSFIADDE 565


>ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262928 [Solanum
            lycopersicum]
          Length = 562

 Score =  690 bits (1781), Expect = 0.0
 Identities = 348/564 (61%), Positives = 422/564 (74%), Gaps = 3/564 (0%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXX 228
            NLI+Q  R N++ + P+       R+AFQIDD+  N                        
Sbjct: 14   NLIAQRQRGNNLSEFPE-------RTAFQIDDEIANTRPSDPSCSKCCCFSTIIFAVFVI 66

Query: 229  XXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 408
                +T + NV +  V        N    N M ESELRAL LL+QQ++ L K+WN  TL+
Sbjct: 67   ILCFSTGVNNVSKTGVM-------NNDSVNLMLESELRALSLLRQQQLGLFKLWN-NTLI 118

Query: 409  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAG 588
            +                      + ++LK  +  QIS+NKQIQ  LLS+H+    ++++ 
Sbjct: 119  DNSLNATAANNSNIVSTSLFSSVLSEELKLDLISQISLNKQIQQALLSSHQLSNLLNASD 178

Query: 589  NYTDSILSDWT---RCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMF 759
            N TD  L D++   RC+K+D +L+DR+TIEW P+S+KYLFAIC SGQMSNHLICLEKHMF
Sbjct: 179  NATDPSLDDYSGLHRCRKMDYKLSDRRTIEWKPRSDKYLFAICASGQMSNHLICLEKHMF 238

Query: 760  FAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCY 939
            FAALLNR+++IPSS+VDYEF RVLDIDHINKCLGRKVVVTFEEFA+S+K H+HIDKF+CY
Sbjct: 239  FAALLNRIMIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHIDKFVCY 298

Query: 940  FSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIG 1119
            FS PQPCF+DDE               EA W ED+K+PK +TV+D+++KFS D  VIAIG
Sbjct: 299  FSQPQPCFLDDEHLKKLKSLGVSTNKLEAAWDEDIKNPKPRTVQDIMSKFSLDDAVIAIG 358

Query: 1120 DVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFL 1299
            DVFFA+VE++WVMQPGGPI+HKCKTL+EPSRLILLTAQRFIQTFLGK+F+ALHFRRHGFL
Sbjct: 359  DVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHFRRHGFL 418

Query: 1300 KFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPL 1479
            KFCNAK+PSCFYPVPQAA+CINRVVERA+ PVIYLSTDAA+SETG+LQSLVV+NGKTVPL
Sbjct: 419  KFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVVVNGKTVPL 478

Query: 1480 VQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKD 1659
            V+RP +N+AEKWDALLYRHGLEGD QVEAMLDKTICA+S VFIGS GSTFT+DILRLRK 
Sbjct: 479  VRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAISEVFIGSMGSTFTEDILRLRKA 538

Query: 1660 WGSASQCDEYLCQGEHPNFIAEDE 1731
            WG++S CDEYLC+GE PNFIA+DE
Sbjct: 539  WGTSSLCDEYLCRGEVPNFIADDE 562


>ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis
            vinifera]
          Length = 559

 Score =  687 bits (1774), Expect = 0.0
 Identities = 347/564 (61%), Positives = 418/564 (74%), Gaps = 3/564 (0%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXX 228
            NLI +N R     K P       HRS FQI+D F++R+S     FNKR            
Sbjct: 14   NLIDENER-----KLP-------HRSGFQIED-FKSRLSAHRFSFNKRYLFAIFPPLFIL 60

Query: 229  XXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 408
                TTD++N+F   +  ++    + P  +RMRESELRALYLL+QQ++ L  +WN+T   
Sbjct: 61   LIYFTTDVRNLFTTSISIVK---ADSP-TDRMRESELRALYLLRQQQLSLFSLWNHTAFA 116

Query: 409  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS---VD 579
            +                         D KS +  QIS+NK+IQ +LLS+H S      VD
Sbjct: 117  DSAPIPSNSSNSTLDFSTRQVLLSSADFKSALLKQISLNKEIQQVLLSSHPSGNLSELVD 176

Query: 580  SAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMF 759
              G+      S + RC KV+  ++ R TIEW P+S+KYLFAIC+SGQMSNHLICLEKHMF
Sbjct: 177  DNGDLNFGAYS-FNRCPKVNQNMSQRPTIEWKPRSDKYLFAICLSGQMSNHLICLEKHMF 235

Query: 760  FAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCY 939
            FAALLNR+LVIPSSK DY+++RVLDI+HIN CLGRKVVVTFEEF ESKKNHLHID+ +CY
Sbjct: 236  FAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEFTESKKNHLHIDRVICY 295

Query: 940  FSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIG 1119
            FS+P PC++DD+               E  W ED+K PK +T +DV AKFSS+ DVIAIG
Sbjct: 296  FSLPLPCYVDDDHVKKLKSLGISMGKLEPAWAEDIKKPKKRTAQDVQAKFSSNDDVIAIG 355

Query: 1120 DVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFL 1299
            DVF+A+VE +WVMQPGGP+AHKC+TLIEPSRLI+LTAQRF+QTFLGK F ALHFRRHGFL
Sbjct: 356  DVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTFLGKSFTALHFRRHGFL 415

Query: 1300 KFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPL 1479
            KFCNAK+PSCF+P+PQAA+CI+RVVERA TPVIYLSTDAA+SETGLLQSLVVLNGK VPL
Sbjct: 416  KFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGLLQSLVVLNGKLVPL 475

Query: 1480 VQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKD 1659
            ++RPTRN+AEKWDALLYRHGL+GDSQVEAMLDKTICA++SVFIG+ GSTFT+DILRLR+ 
Sbjct: 476  IKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAPGSTFTEDILRLRRG 535

Query: 1660 WGSASQCDEYLCQGEHPNFIAEDE 1731
            WGSAS CDEYLCQGE PNFIA++E
Sbjct: 536  WGSASHCDEYLCQGEQPNFIADNE 559


>gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]
          Length = 578

 Score =  672 bits (1734), Expect = 0.0
 Identities = 345/579 (59%), Positives = 423/579 (73%), Gaps = 18/579 (3%)
 Frame = +1

Query: 49   NLISQNARPNDVVKSPDHHNHRNH-RSAFQIDD------DFRNRVS---GAARKFNKRXX 198
            NLI QN R             +NH RS F IDD      +FR+R+     +    NK+  
Sbjct: 16   NLIEQNER-----------KLQNHPRSTFHIDDVDGGNREFRSRIRRRLSSLGLLNKKFM 64

Query: 199  XXXXXXXXXXXXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIEL 378
                          +TD++ +F   +  +R        ++R+RESELRAL+LL+QQ++ L
Sbjct: 65   FAIFLPLFIVVLFLSTDVRGLFSADLSGVRF----DSFSDRLRESELRALFLLRQQQLGL 120

Query: 379  VKMWNYT------TLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQG 540
              +WN T                               +++ DLK  V  Q+S+NK+IQ 
Sbjct: 121  FALWNQTFHDSPPISSNSTNNSSSSSSINSSASGTEQNSVIDDLKFAVLRQLSLNKEIQQ 180

Query: 541  ILLSAHESEGSVDSAGNYTDSIL--SDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVS 714
            +LLS H S G+  S  +  D  L  SD+  C+KVD + + R+TIEW P SNK+LFAIC+S
Sbjct: 181  VLLSPHRS-GNSSSITDAGDPNLGGSDFDTCRKVDQKFSQRRTIEWKPNSNKFLFAICLS 239

Query: 715  GQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFA 894
            GQMSN LICLEKHMFFAALLNRVLVIPSSKVDY+++RVLDIDHINKCLGRKVV++FE+FA
Sbjct: 240  GQMSNRLICLEKHMFFAALLNRVLVIPSSKVDYQYNRVLDIDHINKCLGRKVVISFEDFA 299

Query: 895  ESKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVED 1074
            E+KKNH+HI++F+CYFS PQPC++DDE               E+ W ED+K P  +TV+D
Sbjct: 300  ETKKNHMHINRFICYFSQPQPCYVDDEHIKKLKGLGLTMGKLESAWTEDIKGPNKRTVQD 359

Query: 1075 VLAKFSSDSDVIAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFL 1254
            V +KFS++ DVIAIGDVF+ADVE++WVMQPGGP+AHKC+TLIEPSRLI+LTAQRFIQTFL
Sbjct: 360  VQSKFSTNDDVIAIGDVFYADVEQEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFIQTFL 419

Query: 1255 GKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETG 1434
            GK+FVALHFRRHGFLKFCNAKQPSCF+P+PQAA+CI  VVERA+ PVIYLSTDAA+SETG
Sbjct: 420  GKNFVALHFRRHGFLKFCNAKQPSCFFPIPQAADCITSVVERANAPVIYLSTDAAESETG 479

Query: 1435 LLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGS 1614
            LLQSL+VLNGK VPLV+RP RN+AEKWDALLYRHGLEGDSQVEAMLDKTICA+SSVFIG+
Sbjct: 480  LLQSLIVLNGKPVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGA 539

Query: 1615 SGSTFTDDILRLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
             GSTFT+DILRLRKDWGSAS CD+YLCQGE PNF+A++E
Sbjct: 540  PGSTFTEDILRLRKDWGSASSCDKYLCQGEEPNFVADNE 578


>ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299396 [Fragaria vesca
            subsp. vesca]
          Length = 556

 Score =  660 bits (1702), Expect = 0.0
 Identities = 348/574 (60%), Positives = 408/574 (71%), Gaps = 13/574 (2%)
 Frame = +1

Query: 49   NLISQNAR-----PNDV----VKSPDHHNHRNHRSAFQIDDDFRNRVSGAARK--FNKRX 195
            NLI QN R     P       +   D   HR+HR       + R R +    +  FNKR 
Sbjct: 19   NLIEQNDRKQLPSPRSATTFHIDDGDVDRHRHHR-------EIRRRFASLNLRDLFNKRS 71

Query: 196  XXXXXXXXXXXXXXX--TTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQE 369
                             +TD+K++F            +  ++ ++RESELRALYLL+QQ+
Sbjct: 72   FLVFFIFIPLFVLVLFFSTDIKSLF------FSHLSVSDSVSGKLRESELRALYLLRQQQ 125

Query: 370  IELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILL 549
            + L  +WN T+                          L DLKS V  QIS+NK+IQ +LL
Sbjct: 126  LGLFGLWNSTSNHSNPD--------------------LDDLKSSVLRQISLNKEIQQVLL 165

Query: 550  SAHESEGSVDSAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSN 729
            S H S  S +S  ++ D  L D  RC+ VD R ++R+TIEW P S+KYL AICVSGQMSN
Sbjct: 166  SPHSSGNSSESE-DFRDPSLGD--RCRVVDQRFSERRTIEWKPNSDKYLLAICVSGQMSN 222

Query: 730  HLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKN 909
            HLICLEKHMFFAALLNR+LVIPSSKVDY++  VLDI+HINKC+GRKVVVTFEE AE KKN
Sbjct: 223  HLICLEKHMFFAALLNRILVIPSSKVDYQYSTVLDIEHINKCIGRKVVVTFEELAEEKKN 282

Query: 910  HLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKF 1089
            H+HID+F+CYFS P  C++DDE               E  W EDVK P  KTV+DV +KF
Sbjct: 283  HIHIDRFICYFSKPTLCYVDDEHLKKLKALGISYKSREPAWGEDVKKPSKKTVQDVQSKF 342

Query: 1090 SSDSDVIAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFV 1269
            SS  +VIAIGDVFFAD E+ WVMQPGGP+AHKCKTLIEPSRLILLTAQRFIQTFLGK+FV
Sbjct: 343  SSGDEVIAIGDVFFADAEQDWVMQPGGPLAHKCKTLIEPSRLILLTAQRFIQTFLGKNFV 402

Query: 1270 ALHFRRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSL 1449
            ALHFRRHGFLKFCN KQPSCFYP+PQAA+CI R+ ERA+ PV+YLSTDAA+SETGLLQSL
Sbjct: 403  ALHFRRHGFLKFCNNKQPSCFYPIPQAADCITRIAERANAPVVYLSTDAAESETGLLQSL 462

Query: 1450 VVLNGKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTF 1629
            VV+NGKTVPLV+RP RN+AEKWDALLYRHG+EGD QVEAMLDKTI A+SSVFIG+SGSTF
Sbjct: 463  VVVNGKTVPLVKRPARNSAEKWDALLYRHGIEGDPQVEAMLDKTISAMSSVFIGASGSTF 522

Query: 1630 TDDILRLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            T+DILRLRK WGSAS CDEYLCQGE PNFIAE+E
Sbjct: 523  TEDILRLRKGWGSASVCDEYLCQGEEPNFIAENE 556


>ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776069 [Glycine max]
          Length = 543

 Score =  660 bits (1702), Expect = 0.0
 Identities = 340/557 (61%), Positives = 404/557 (72%), Gaps = 12/557 (2%)
 Frame = +1

Query: 97   DHHN--HRNHR--------SAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXXXXXXTT 246
            DH N    NHR        +AF ++D   +R    +    K+                 T
Sbjct: 10   DHRNLVDNNHRKPPSPPPSAAFHVED-LSSRFRRVSFALQKKYIIAILALLFLLLFFSIT 68

Query: 247  DLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXX 426
            D   +F    PS  +F     + +RM+ESELRA+ LL QQ+  L+  WN+T         
Sbjct: 69   DFHQLFS--TPSSFKFDS---ITDRMKESELRAINLLYQQQQSLLTAWNHTLRTNASDPN 123

Query: 427  XXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSI 606
                             +L+DLKS +F QIS+N++IQ ILL+ H + G+        ++ 
Sbjct: 124  -----------------LLEDLKSSLFKQISLNREIQQILLNPHSTGGNAIEPELDLNAT 166

Query: 607  LSD--WTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNR 780
            L+   + RC+ VD  L+ RKTIEWNP+  K+L AICVSGQMSNHLICLEKHMFFAALLNR
Sbjct: 167  LNGVVYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICLEKHMFFAALLNR 226

Query: 781  VLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFSMPQPC 960
            VLVIPSSKVDY++ RV+DIDHINKCLG+KVVV+FEEF+  KK HLHIDKF+CYFS PQPC
Sbjct: 227  VLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEEFSNLKKGHLHIDKFLCYFSHPQPC 286

Query: 961  FMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDVFFADV 1140
            ++DDER              EAVW ED + PK KTV+DVL KFS D DV+AIGDVF+A+V
Sbjct: 287  YLDDERLKKLGALGLTMSKPEAVWDEDTRKPKKKTVQDVLGKFSFDDDVMAIGDVFYAEV 346

Query: 1141 ERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQ 1320
            ER+WVMQPGGPIAHKCKTLIEP+RLILLTAQRFIQTFLG++F+ALHFRRHGFLKFCNAK+
Sbjct: 347  EREWVMQPGGPIAHKCKTLIEPNRLILLTAQRFIQTFLGRNFIALHFRRHGFLKFCNAKK 406

Query: 1321 PSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRN 1500
            PSCFYP+PQAA+CI RVVE A  P+IYLSTDAA+SETGLLQSLVVLNG+ VPLV RP RN
Sbjct: 407  PSCFYPIPQAADCILRVVEMADAPIIYLSTDAAESETGLLQSLVVLNGRPVPLVIRPARN 466

Query: 1501 AAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQC 1680
            +AEKWDALLYRH ++GDSQVEAMLDKTICA+SSVFIG+ GSTFT+DILRLRKDWGSAS C
Sbjct: 467  SAEKWDALLYRHNMDGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDILRLRKDWGSASMC 526

Query: 1681 DEYLCQGEHPNFIAEDE 1731
            DEYLCQGE PN IAE+E
Sbjct: 527  DEYLCQGEEPNIIAENE 543


>ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus trichocarpa]
            gi|222840769|gb|EEE78316.1| protein-O-fucosyltransferase
            2 [Populus trichocarpa]
          Length = 527

 Score =  651 bits (1679), Expect = 0.0
 Identities = 327/502 (65%), Positives = 390/502 (77%), Gaps = 5/502 (0%)
 Frame = +1

Query: 241  TTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYT---TLVE 411
            +TD++N+F   +           L+ RMRESELRALYLLK+Q++ L  +WN T   TL+E
Sbjct: 49   STDIRNLFSTHLKV------GDSLSIRMRESELRALYLLKKQQLSLFSLWNSTGNSTLLE 102

Query: 412  RXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGN 591
            +                       +DLKS +  QIS+NK+IQ +LL+ HES G+V S+ +
Sbjct: 103  KDLNS----------------VSFEDLKSALLKQISLNKEIQQVLLAPHES-GNVSSSSS 145

Query: 592  YTDSILSDW--TRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFA 765
              D   +     RC+KVD R ADRKTIEW PK NK+LFA+C+SGQMSNHLICLEKHMFFA
Sbjct: 146  DLDFSNAGGFVQRCEKVDQRFADRKTIEWKPKPNKFLFALCLSGQMSNHLICLEKHMFFA 205

Query: 766  ALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFS 945
            ALLNRVLVIPSS+ DY+++RVLDI+H+N CLGRKVVVTFEEF E  KN  HID+F CYFS
Sbjct: 206  ALLNRVLVIPSSRFDYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIMKNKPHIDRFFCYFS 265

Query: 946  MPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDV 1125
             P PC++D+E               E+ WKED+K P   TV+DV  KF SD +VIA+GDV
Sbjct: 266  DPTPCYVDEEHVKKLKGLGVSMGKLESPWKEDIKKPSKLTVKDVEGKFVSDDNVIAVGDV 325

Query: 1126 FFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKF 1305
            FFADVE +W+MQPGGPIAHKCKTLIEP+R+I+LTAQRFIQTFLG +F+ALHFRRHGFLKF
Sbjct: 326  FFADVEEEWIMQPGGPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSNFIALHFRRHGFLKF 385

Query: 1306 CNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQ 1485
            CNAK+PSCFYPVPQAA+CI RVVERA+ PV+YLSTDAA+SETGLLQSLVV+NG+TVPLV 
Sbjct: 386  CNAKKPSCFYPVPQAADCIARVVERANAPVVYLSTDAAESETGLLQSLVVVNGRTVPLVT 445

Query: 1486 RPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWG 1665
            RP+RNAAEKWDALLYRHGL+ D+QVEAMLDKTICA+SSVFIG+SGSTFT+DI RLRK W 
Sbjct: 446  RPSRNAAEKWDALLYRHGLQEDAQVEAMLDKTICAMSSVFIGASGSTFTEDIFRLRKGWE 505

Query: 1666 SASQCDEYLCQGEHPNFIAEDE 1731
            SAS CDEYLCQGE PN+IAE+E
Sbjct: 506  SASSCDEYLCQGELPNYIAENE 527


>ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|9758924|dbj|BAB09461.1| unnamed protein product
            [Arabidopsis thaliana] gi|133778858|gb|ABO38769.1|
            At5g50420 [Arabidopsis thaliana]
            gi|332008558|gb|AED95941.1| O-fucosyltransferase family
            protein [Arabidopsis thaliana]
            gi|591401714|gb|AHL38584.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 566

 Score =  646 bits (1667), Expect = 0.0
 Identities = 330/542 (60%), Positives = 397/542 (73%), Gaps = 3/542 (0%)
 Frame = +1

Query: 115  NHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXXXXXXT-TDLKNVFRMRVPSIRE 291
            N RSAFQIDD             NKR                  TD + +F     S + 
Sbjct: 40   NQRSAFQIDDILHRVQHRGKISLNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKL 99

Query: 292  FGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXX 471
                 PL+NR++ESELRALYLL+QQ++ L+ +WN T +                      
Sbjct: 100  ----DPLSNRVKESELRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGS 146

Query: 472  XAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSILSDWTRCKKVDPRLA 651
              + +D+KS V  QIS+NK+IQ +LLS H S     S G   DS+   + RC+KVD +L+
Sbjct: 147  SVLFEDVKSAVSKQISLNKEIQEVLLSPHRSSNY--SGGTDVDSVNFSYNRCRKVDQKLS 204

Query: 652  DRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVL 831
            DRKT+EW P+S+K+LFAIC+SGQMSNHLICLEKHMFFAALL+RVLVIPSSK DY++ RV+
Sbjct: 205  DRKTVEWKPRSDKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVI 264

Query: 832  DIDHINKCLGRKVVVTFEEFAE-SKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXX 1008
            DI+ IN CLGR VVV F++F E +KKNH  ID+F+CYFS PQ C++D+E           
Sbjct: 265  DIERINTCLGRNVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGIS 324

Query: 1009 XXXX-EAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDVFFADVERQWVMQPGGPIAHK 1185
                 EA W ED+K P  +TV+DV  KF SD DVIAIGDVF+AD+E+ WVMQPGGPI HK
Sbjct: 325  IDGKLEAPWSEDIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHK 384

Query: 1186 CKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECIN 1365
            CKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRRHGFLKFCNAK PSCFYP+PQAAECI 
Sbjct: 385  CKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIA 444

Query: 1366 RVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLE 1545
            R+VER++  VIYLSTDAA+SET LLQSLVV++GK VPLV+RP RN+AEKWDALLYRHG+E
Sbjct: 445  RIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIE 504

Query: 1546 GDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGEHPNFIAE 1725
             DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILRLRKDWG++S CDEYLC+GE PNFIAE
Sbjct: 505  DDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAE 564

Query: 1726 DE 1731
            DE
Sbjct: 565  DE 566


>gb|AAM66093.1| unknown [Arabidopsis thaliana]
          Length = 566

 Score =  645 bits (1665), Expect = 0.0
 Identities = 329/542 (60%), Positives = 397/542 (73%), Gaps = 3/542 (0%)
 Frame = +1

Query: 115  NHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXXXXXXXXXXT-TDLKNVFRMRVPSIRE 291
            N RSAFQIDD             NKR                  TD + +F     S + 
Sbjct: 40   NQRSAFQIDDILHRVQHRGKISLNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKL 99

Query: 292  FGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXX 471
                 PL+NR++ESELRALYLL+QQ++ L+ +WN T +                      
Sbjct: 100  ----DPLSNRVKESELRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGS 146

Query: 472  XAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSILSDWTRCKKVDPRLA 651
              + +D+KS V  QIS+NK+IQ +LLS H S     S G   DS+   + RC+KVD +L+
Sbjct: 147  SVLFEDVKSAVSKQISLNKEIQEVLLSPHRSSNY--SGGTDVDSVNFSYNRCRKVDQKLS 204

Query: 652  DRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVL 831
            DRKT+EW P+S+K+LFAIC+SGQMSNHL+CLEKHMFFAALL+RVLVIPSSK DY++ RV+
Sbjct: 205  DRKTVEWKPRSDKFLFAICLSGQMSNHLLCLEKHMFFAALLDRVLVIPSSKFDYQYDRVI 264

Query: 832  DIDHINKCLGRKVVVTFEEFAE-SKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXX 1008
            DI+ IN CLGR VVV F++F E +KKNH  ID+F+CYFS PQ C++D+E           
Sbjct: 265  DIERINTCLGRNVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGIS 324

Query: 1009 XXXX-EAVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDVFFADVERQWVMQPGGPIAHK 1185
                 EA W ED+K P  +TV+DV  KF SD DVIAIGDVF+AD+E+ WVMQPGGPI HK
Sbjct: 325  IDGKLEAPWSEDIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHK 384

Query: 1186 CKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECIN 1365
            CKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRRHGFLKFCNAK PSCFYP+PQAAECI 
Sbjct: 385  CKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIA 444

Query: 1366 RVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLE 1545
            R+VER++  VIYLSTDAA+SET LLQSLVV++GK VPLV+RP RN+AEKWDALLYRHG+E
Sbjct: 445  RIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIE 504

Query: 1546 GDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGEHPNFIAE 1725
             DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILRLRKDWG++S CDEYLC+GE PNFIAE
Sbjct: 505  DDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAE 564

Query: 1726 DE 1731
            DE
Sbjct: 565  DE 566


>ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
            lyrata] gi|297311638|gb|EFH42062.1| hypothetical protein
            ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score =  645 bits (1664), Expect = 0.0
 Identities = 335/568 (58%), Positives = 407/568 (71%), Gaps = 7/568 (1%)
 Frame = +1

Query: 49   NLISQN----ARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRXXXXXXXX 216
            +LI QN        D + S       N RSAFQI+D  +          NKR        
Sbjct: 14   HLIPQNDTRIRHREDPISSTATTTGGNQRSAFQIEDILQRVQRRWKISLNKRYVIVFVSL 73

Query: 217  XXXXXXXXT-TDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 393
                      TD + +F     S +      PL+NR++ESELRALYLL+QQ++ L+ +WN
Sbjct: 74   IISIGLLFLLTDPRELFSANFSSFKL----DPLSNRVKESELRALYLLRQQQLALLSLWN 129

Query: 394  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 573
             T +                        + +D+KS V  QIS+NK+IQ +LLS H S   
Sbjct: 130  GTLV---------NPSLNQSENDLRSSVLFEDVKSAVSKQISLNKEIQNVLLSPHRSSNY 180

Query: 574  VDSAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKH 753
              S G   DS+   + RC+KVD +L+DRKT+EW P+S+K+LFAIC+SGQMSNHLICLEKH
Sbjct: 181  --SGGTEVDSVNFSYDRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLICLEKH 238

Query: 754  MFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAE-SKKNHLHIDKF 930
            MFFAALL+RVLVIPSSK DY++ RV+DI+ IN CLGR VVV+F++F E +KKNH  ID+F
Sbjct: 239  MFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTCLGRNVVVSFDQFKEKAKKNHFRIDRF 298

Query: 931  MCYFSMPQPCFMDDERXXXXXXXXXXXXXX-EAVWKEDVKSPKHKTVEDVLAKFSSDSDV 1107
            +CYFS PQ C++D+E                EA W ED+K P  +TV+DV  KF SD DV
Sbjct: 299  ICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSKRTVQDVQTKFKSDDDV 358

Query: 1108 IAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRR 1287
            IAIGDVF+AD+E+ WVMQPGGPI HKCKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRR
Sbjct: 359  IAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRR 418

Query: 1288 HGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGK 1467
            HGFLKFCNAK PSCFYP+PQAAECI R+VER++  VIYLSTDAA+SET LLQSLVV++GK
Sbjct: 419  HGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSLVVVDGK 478

Query: 1468 TVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILR 1647
             VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILR
Sbjct: 479  IVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTFTEDILR 538

Query: 1648 LRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            LRKDWG++S CDEYLC+GE PNFIAEDE
Sbjct: 539  LRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>ref|XP_003627474.1| GDP-fucose protein-O-fucosyltransferase [Medicago truncatula]
            gi|355521496|gb|AET01950.1| GDP-fucose
            protein-O-fucosyltransferase [Medicago truncatula]
          Length = 542

 Score =  645 bits (1664), Expect = 0.0
 Identities = 316/493 (64%), Positives = 391/493 (79%), Gaps = 7/493 (1%)
 Frame = +1

Query: 274  VPSIREFGGNG----PLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXX 441
            VP++R +         + +RM+ESELRA+YLL+QQ+  L  ++N +   +          
Sbjct: 67   VPNLRRYFTTSFTSDSITDRMKESELRAIYLLRQQQQRLSTVFNSSDQNQNPNPK----- 121

Query: 442  XXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSILS--D 615
                        +++DLKS +F QIS+N +IQ ILL+ H +   +D   N+ +S  +  +
Sbjct: 122  ------------LIEDLKSALFKQISINNEIQQILLNPHRTGNVIDPEFNFGNSNFNVGN 169

Query: 616  WTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIP 795
            + RC+ VD  L+ RKTIEWNPK +K+L AICVSGQMSNHLICLEKHMFFAA+LNRVLVIP
Sbjct: 170  YDRCRTVDQSLSKRKTIEWNPKKDKFLVAICVSGQMSNHLICLEKHMFFAAILNRVLVIP 229

Query: 796  SSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFSMPQPCFMDDE 975
            SSKVDY++ RV+DIDHINKCLG+KVV++F+EF+  KK HLHIDKF+CYF++PQPC++DDE
Sbjct: 230  SSKVDYQYDRVVDIDHINKCLGKKVVMSFDEFSNVKKGHLHIDKFLCYFALPQPCYLDDE 289

Query: 976  RXXXXXXXXXXXXXXEAVWK-EDVKSPKHKTVEDVLAKFSSDSDVIAIGDVFFADVERQW 1152
            R              +AVW+ ED ++PK KTV+DV+ KFS D DV+AIGDVF+A VE +W
Sbjct: 290  RLKKLDGLGLGMSKPKAVWEDEDTRNPKKKTVQDVMDKFSYDDDVMAIGDVFYAKVEHEW 349

Query: 1153 VMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCF 1332
            VMQPGGPIAH+CKTLIEP+RLILLTAQRFIQTFLG++F+ALHFRRHGFLKFCNAK+PSCF
Sbjct: 350  VMQPGGPIAHQCKTLIEPNRLILLTAQRFIQTFLGRNFIALHFRRHGFLKFCNAKKPSCF 409

Query: 1333 YPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEK 1512
            +P+PQAA+CI RV+ERA  P+IYLSTDAA+SETGLLQSL+VLNGK+VPLV RP RN+AEK
Sbjct: 410  FPIPQAADCILRVIERADAPIIYLSTDAAESETGLLQSLIVLNGKSVPLVIRPARNSAEK 469

Query: 1513 WDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYL 1692
            WDALLYRH +EGDSQVEAMLDKTICA+SSVFIG+ GSTFT+DILRLRKDWGSAS CDEYL
Sbjct: 470  WDALLYRHHIEGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDILRLRKDWGSASLCDEYL 529

Query: 1693 CQGEHPNFIAEDE 1731
            C GE PN +AE+E
Sbjct: 530  CHGEEPNIVAENE 542


>ref|XP_007024790.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508780156|gb|EOY27412.1| O-fucosyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 558

 Score =  643 bits (1659), Expect = 0.0
 Identities = 330/569 (57%), Positives = 407/569 (71%), Gaps = 9/569 (1%)
 Frame = +1

Query: 52   LISQNAR---PNDVVKSPDHHNHRNHRSAFQIDD---DFRNRVSGAARKFNKRXXXXXXX 213
            LI QN     P+ +  SP      + RS+F I++     R R       FNKR       
Sbjct: 15   LIHQNDTKNLPHQIPASP--RPSTSPRSSFHIEELESQIRRRFK---LTFNKRYLFAIFL 69

Query: 214  XXXXXXXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 393
                     +TD++++F   + S++       +++R+RES+L+ALYLL QQ+  L+ +WN
Sbjct: 70   PLLIIPIYFSTDIRSLFSSNISSLKF----NTVSDRIRESQLQALYLLNQQQNSLLSLWN 125

Query: 394  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 573
            +T +                           D+K+ +  QI++NK IQ ILLS H++ G+
Sbjct: 126  HTFVNSNNNITA---------------VQFDDIKASLLTQITLNKHIQQILLSPHKT-GN 169

Query: 574  VDSAGNYTDSILSDWT--RCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLE 747
                G   D   + ++  RC+KVD + A+RKT EW PK NK+LFAIC+SGQMSNHLICLE
Sbjct: 170  SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNHLICLE 229

Query: 748  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDK 927
            KHMFFAA+LNR LVIPSS+ DY+++RVLDI+HIN C+G+K V+ FEEF E KKNH HIDK
Sbjct: 230  KHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNHAHIDK 289

Query: 928  FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWK-EDVKSPKHKTVEDVLAKFSSDSD 1104
            F+CYFS PQPC++D+E               E  WK ED+K P  KT++DV  KF SD D
Sbjct: 290  FICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKFGSDDD 349

Query: 1105 VIAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFR 1284
            VIAIGDVF+ADVER WV+QPGGPIAHKCKTLIEPS+LILLTA+RFIQTFLG +F+ALHFR
Sbjct: 350  VIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFIALHFR 409

Query: 1285 RHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNG 1464
            RHGFLKFCNAK+PSCFYP+PQAA+CI R+VERA+TPVIYLSTDAA+SET LLQS+VVLNG
Sbjct: 410  RHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSMVVLNG 469

Query: 1465 KTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDIL 1644
            KT+PLV+RP RN+AEKWDALLYRHGL  D QVEAMLDKTICA+SSVFIG+ GSTFT DIL
Sbjct: 470  KTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVEAMLDKTICAMSSVFIGAPGSTFTGDIL 529

Query: 1645 RLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            RLRKDWG+AS CDEYLCQGE PNF A +E
Sbjct: 530  RLRKDWGTASLCDEYLCQGEDPNFTAGEE 558


>ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046 [Glycine max]
          Length = 543

 Score =  643 bits (1659), Expect = 0.0
 Identities = 317/476 (66%), Positives = 375/476 (78%), Gaps = 2/476 (0%)
 Frame = +1

Query: 310  LANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQD 489
            L +RM+ESELRA+ LL QQ+  L+  WN+T                          +L+D
Sbjct: 85   LTDRMKESELRAINLLNQQQQALLTAWNHTLRTNASDPN-----------------LLED 127

Query: 490  LKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSILSD--WTRCKKVDPRLADRKT 663
            LKS +F QIS+N++IQ ILL+ H +  +        ++ L+   + RC+ VD  L+ RKT
Sbjct: 128  LKSSIFKQISLNREIQQILLNPHSTGNNAIEPEFDLNATLNGVVYDRCRTVDQNLSQRKT 187

Query: 664  IEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDH 843
            IEWNP+  K+L AICVSGQMSNHLICLEKH+FFAALLNRVLVIPSSKVDY++ RV+DIDH
Sbjct: 188  IEWNPRDGKFLLAICVSGQMSNHLICLEKHIFFAALLNRVLVIPSSKVDYQYDRVVDIDH 247

Query: 844  INKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXE 1023
            INKCLG+KVVV+FE F+  KK HLHIDKF+CYFS PQPC++DDER               
Sbjct: 248  INKCLGKKVVVSFEVFSNLKKGHLHIDKFLCYFSQPQPCYLDDERLKKLGALGLTMSKPV 307

Query: 1024 AVWKEDVKSPKHKTVEDVLAKFSSDSDVIAIGDVFFADVERQWVMQPGGPIAHKCKTLIE 1203
            AVW ED ++PK KTV+DVL KFS D DV+AIGDVF+A+VER+WVMQPGGPIAHKC TLIE
Sbjct: 308  AVWDEDTRNPKKKTVQDVLGKFSFDDDVMAIGDVFYAEVEREWVMQPGGPIAHKCTTLIE 367

Query: 1204 PSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECINRVVERA 1383
            P+RLILLTAQRFIQTFLG++FVALHFRRHGFLKFCNAK+PSCFY + QAA+CI RVVERA
Sbjct: 368  PNRLILLTAQRFIQTFLGRNFVALHFRRHGFLKFCNAKKPSCFYSITQAADCILRVVERA 427

Query: 1384 STPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVE 1563
              P+IYLSTDAA+SETGLLQSLVVLNG+ VPLV RP RN+AEKWDALLYRH ++GDSQVE
Sbjct: 428  DAPIIYLSTDAAESETGLLQSLVVLNGRPVPLVIRPARNSAEKWDALLYRHRMDGDSQVE 487

Query: 1564 AMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            AMLDK+ICA+SSVFIG+ GSTFT+DILRLRKDWGSAS CDEYLCQGE PN +AE+E
Sbjct: 488  AMLDKSICAMSSVFIGAPGSTFTEDILRLRKDWGSASMCDEYLCQGEEPNIVAENE 543


>ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus]
            gi|449517914|ref|XP_004165989.1| PREDICTED:
            uncharacterized protein LOC101230373 [Cucumis sativus]
          Length = 573

 Score =  641 bits (1654), Expect = 0.0
 Identities = 326/554 (58%), Positives = 402/554 (72%), Gaps = 10/554 (1%)
 Frame = +1

Query: 100  HHNHRNHRSAFQIDDD--FRNRV-----SGAARKFNKRXXXXXXXXXXXXXXXX--TTDL 252
            H +   H + F IDDD  FR  +     S     F+KR                  + D+
Sbjct: 26   HPSPPTHSTTFDIDDDPHFRPPIPRFPFSIPKFAFDKRYYYLLAAALPLCILVLFFSVDI 85

Query: 253  KNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXX 432
             ++F   + S  +   +  L +RMRESEL ALYLL+QQ++    +WN++  ++       
Sbjct: 86   TSLFSTTLSSTLKTSDS--LTDRMRESELTALYLLRQQQLGFFHLWNHSLFLQSNSSFNS 143

Query: 433  XXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSAGNYTDSILS 612
                          A+ + +KS +  QI++NK+IQ +LLS H S    +  G+       
Sbjct: 144  TPSNNLSSNS----ALTEYIKSALLKQITLNKEIQNVLLSPHRSGNLSEEVGDALPMDTF 199

Query: 613  DWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVI 792
               RC+K+D +L+DR+TIEW PKSNK+LFAIC SGQMSNHLICLEKHMFFAA+LNRVLVI
Sbjct: 200  ALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRVLVI 259

Query: 793  PSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDKFMCYFSMPQPCFMDD 972
            PS KVDY+F RV+DID +N CLGRKVV++FEEF+E KK+HLHID+F+CYFS P PC++DD
Sbjct: 260  PSHKVDYQFSRVIDIDRMNMCLGRKVVISFEEFSEIKKHHLHIDRFICYFSKPNPCYVDD 319

Query: 973  ERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSD-VIAIGDVFFADVERQ 1149
            E               E+ W ED K P  KTV DV +KFSS++D VIA+GD+FFA+VE++
Sbjct: 320  EHISKLKNLGISMGKLESAWNEDTKHPNRKTVSDVESKFSSNNDDVIAVGDIFFANVEQE 379

Query: 1150 WVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSC 1329
            WV QPGGPIAHKC+TLIEPS LI LTAQRFIQTFLGK+++ALHFRRHGFLKFCNAKQPSC
Sbjct: 380  WVNQPGGPIAHKCQTLIEPSHLIKLTAQRFIQTFLGKNYIALHFRRHGFLKFCNAKQPSC 439

Query: 1330 FYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAE 1509
            FYP+PQAA+CI R+VERA+ PVIYLSTDAA+SE GLLQSL+VLNGK +PLV+RP RN+AE
Sbjct: 440  FYPIPQAADCIIRMVERANVPVIYLSTDAAESEHGLLQSLLVLNGKPIPLVKRPPRNSAE 499

Query: 1510 KWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEY 1689
            KWDALLYRHGLE DSQVEAMLDKTICA+SS FIG+ GSTFT+DILRLRKDWG+AS CDEY
Sbjct: 500  KWDALLYRHGLEEDSQVEAMLDKTICAMSSTFIGAPGSTFTEDILRLRKDWGTASMCDEY 559

Query: 1690 LCQGEHPNFIAEDE 1731
            LCQGE PNFI+E+E
Sbjct: 560  LCQGEEPNFISENE 573


>ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis]
            gi|223526849|gb|EEF29063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 565

 Score =  640 bits (1652), Expect = 0.0
 Identities = 331/568 (58%), Positives = 412/568 (72%), Gaps = 7/568 (1%)
 Frame = +1

Query: 49   NLISQNARP--NDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARK--FNKRXXXXXXXX 216
            NLI QN R   N     P    HR   S F I++       G  R+  FNKR        
Sbjct: 14   NLIEQNDRKHHNHQQTVPTSSPHRRSFSTFHIEE-----YGGVIRRRLFNKRYYYYLLAI 68

Query: 217  XXXXXXXX---TTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKM 387
                       + DL+++F   + S+         ++RMRE+EL+ALYLL+QQ++ L+ +
Sbjct: 69   FLPLLIIIVYFSADLRSLFSANISSLNF----NSASDRMREAELQALYLLEQQQLSLLSI 124

Query: 388  WNYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESE 567
            +N     +                       +++ +S +  Q++ NKQIQ ILLS H+S 
Sbjct: 125  FN-----QSFPSRNKNFSSNSSFINSFDNVKIENFRSALLKQMTFNKQIQQILLSPHKS- 178

Query: 568  GSVDSAGNYTDSILSDWTRCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLE 747
            G+ + +G+++ S    + RCKKV+ R  DRKTIEW P+S+K+LF IC+SGQMSNHLICLE
Sbjct: 179  GNENVSGSFSGSGFG-FDRCKKVESRFLDRKTIEWKPRSDKFLFPICLSGQMSNHLICLE 237

Query: 748  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDK 927
            KHMFFAALLNRVLV+PSSK DY+++RVLDI+HIN C+GRKVVVTFEEF + +KNH+HID+
Sbjct: 238  KHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVGRKVVVTFEEFVQMRKNHVHIDR 297

Query: 928  FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKSPKHKTVEDVLAKFSSDSDV 1107
            F+CYFS P  C++D+E               E+ WKEDVK P  KTV+DVLAKF+S+ DV
Sbjct: 298  FICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWKEDVKKPSQKTVQDVLAKFTSNDDV 357

Query: 1108 IAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRR 1287
            IAIGDVF+AD+E+ WVMQPGGP+AHKCKTLIEPSRLIL+TAQRFIQTFLGK+F+ALHFRR
Sbjct: 358  IAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLILVTAQRFIQTFLGKNFIALHFRR 417

Query: 1288 HGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGK 1467
            HGFLKFCNAK PSCFYP+PQAA+CI RV ERA+ PVIYLSTDAA+SET LLQSL+++NGK
Sbjct: 418  HGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIYLSTDAAESETDLLQSLIIVNGK 477

Query: 1468 TVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILR 1647
            TVPLV+RP+  + EKWD+LL RHG+E DSQVEAMLDKTI A+S+VFIG+SGSTFT+DILR
Sbjct: 478  TVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKTISAMSNVFIGASGSTFTEDILR 537

Query: 1648 LRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            LRKDW SAS CDEYLCQGE PNFIAEDE
Sbjct: 538  LRKDWESASLCDEYLCQGELPNFIAEDE 565


>ref|XP_007024791.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508780157|gb|EOY27413.1| O-fucosyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 559

 Score =  639 bits (1647), Expect = e-180
 Identities = 330/570 (57%), Positives = 407/570 (71%), Gaps = 10/570 (1%)
 Frame = +1

Query: 52   LISQNAR---PNDVVKSPDHHNHRNHRSAFQIDD---DFRNRVSGAARKFNKRXXXXXXX 213
            LI QN     P+ +  SP      + RS+F I++     R R       FNKR       
Sbjct: 15   LIHQNDTKNLPHQIPASP--RPSTSPRSSFHIEELESQIRRRFK---LTFNKRYLFAIFL 69

Query: 214  XXXXXXXXXTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 393
                     +TD++++F   + S++       +++R+RES+L+ALYLL QQ+  L+ +WN
Sbjct: 70   PLLIIPIYFSTDIRSLFSSNISSLKF----NTVSDRIRESQLQALYLLNQQQNSLLSLWN 125

Query: 394  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 573
            +T +                           D+K+ +  QI++NK IQ ILLS H++ G+
Sbjct: 126  HTFVNSNNNITA---------------VQFDDIKASLLTQITLNKHIQQILLSPHKT-GN 169

Query: 574  VDSAGNYTDSILSDWT--RCKKVDPRLADRKTIEWNPKSNKYLFAICVSGQMSNHLICLE 747
                G   D   + ++  RC+KVD + A+RKT EW PK NK+LFAIC+SGQMSNHLICLE
Sbjct: 170  SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNHLICLE 229

Query: 748  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAESKKNHLHIDK 927
            KHMFFAA+LNR LVIPSS+ DY+++RVLDI+HIN C+G+K V+ FEEF E KKNH HIDK
Sbjct: 230  KHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNHAHIDK 289

Query: 928  FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWK-EDVKSPKHKTVEDVLAKFSSDSD 1104
            F+CYFS PQPC++D+E               E  WK ED+K P  KT++DV  KF SD D
Sbjct: 290  FICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKFGSDDD 349

Query: 1105 VIAIGDVFFADVERQWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFR 1284
            VIAIGDVF+ADVER WV+QPGGPIAHKCKTLIEPS+LILLTA+RFIQTFLG +F+ALHFR
Sbjct: 350  VIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFIALHFR 409

Query: 1285 RHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNG 1464
            RHGFLKFCNAK+PSCFYP+PQAA+CI R+VERA+TPVIYLSTDAA+SET LLQS+VVLNG
Sbjct: 410  RHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSMVVLNG 469

Query: 1465 KTVPLVQRPTRNAAEKWDALLYRHGLEGDSQ-VEAMLDKTICALSSVFIGSSGSTFTDDI 1641
            KT+PLV+RP RN+AEKWDALLYRHGL  D Q VEAMLDKTICA+SSVFIG+ GSTFT DI
Sbjct: 470  KTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVVEAMLDKTICAMSSVFIGAPGSTFTGDI 529

Query: 1642 LRLRKDWGSASQCDEYLCQGEHPNFIAEDE 1731
            LRLRKDWG+AS CDEYLCQGE PNF A +E
Sbjct: 530  LRLRKDWGTASLCDEYLCQGEDPNFTAGEE 559


Top