BLASTX nr result

ID: Achyranthes23_contig00014344 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00014344
         (2224 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584...   706   0.0  
ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268...   700   0.0  
gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]     692   0.0  
ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254...   690   0.0  
ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsi...   672   0.0  
ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arab...   672   0.0  
gb|AAM66093.1| unknown [Arabidopsis thaliana]                         671   0.0  
ref|XP_006307109.1| hypothetical protein CARUB_v10008696mg [Caps...   671   0.0  
ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus tric...   667   0.0  
ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299...   667   0.0  
ref|XP_002892943.1| hypothetical protein ARALYDRAFT_889130 [Arab...   666   0.0  
ref|XP_006280247.1| hypothetical protein CARUB_v10026161mg [Caps...   665   0.0  
ref|XP_002533327.1| conserved hypothetical protein [Ricinus comm...   663   0.0  
gb|EOY27412.1| O-fucosyltransferase family protein isoform 1 [Th...   659   0.0  
ref|NP_173170.2| O-fucosyltransferase family protein [Arabidopsi...   658   0.0  
ref|XP_006416723.1| hypothetical protein EUTSA_v10007186mg [Eutr...   657   0.0  
ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208...   656   0.0  
gb|EOY27413.1| O-fucosyltransferase family protein isoform 2 [Th...   655   0.0  
ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617...   653   0.0  
ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602...   653   0.0  

>ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584575 [Solanum tuberosum]
          Length = 568

 Score =  706 bits (1822), Expect = 0.0
 Identities = 357/571 (62%), Positives = 441/571 (77%), Gaps = 3/571 (0%)
 Frame = +3

Query: 321  TSSDDDEEDCRSLIHQNDTVKPTSNSHSFSPFDIDNNGFGASLRRRFKMNK-KRYLLTIL 497
            T S D+E+D  +LIHQN+ V   S S   S F I++     +L RRF     KRYLL I+
Sbjct: 5    TESSDEEDDRENLIHQNERVNDLSKSPRRSTFQIEDVKDRFALCRRFNFTSGKRYLLAII 64

Query: 498  IPLAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALITLLNRT 671
            +P+ ++ L+F         + V  IK   S +  MR+SEL+ALYLL++QQL L  L N T
Sbjct: 65   LPVLVLVLYFATDIKSLFQTTVTTIKYDGSVNS-MRDSELRALYLLRQQQLGLFKLWNHT 123

Query: 672  LSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQKVLLSTH 851
            L    +              G+ ++S      ++ EDLK+ +   I LNK+IQ+VLLS+H
Sbjct: 124  LVNDTST--THTGSSLESTPGFASVSR----SSIVEDLKADLLRQISLNKQIQQVLLSSH 177

Query: 852  KNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQMSNHLI 1031
            +  N+      + D    G+S C KVD  L++R+T+EWKPRS+KYLFAIC+SGQMSNHLI
Sbjct: 178  QLGNSLITSDNSTDPTLGGLSRCRKVDHNLSQRRTVEWKPRSNKYLFAICVSGQMSNHLI 237

Query: 1032 CLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAKKKDLR 1211
            CLEKHMFFAALL+R+LVIPS KVDY+F RVLD++HINKCLGR+V+VTY+EF E +K  L 
Sbjct: 238  CLEKHMFFAALLNRILVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLH 297

Query: 1212 IDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKAKFSSD 1391
            ID+F+CYFSQPQ C++DE+ VKKLKSLG+S+ KLE+ W EDVK PK RTV D+ AKFS+D
Sbjct: 298  IDKFLCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWNEDVKNPKKRTVQDIMAKFSTD 357

Query: 1392 EGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKDFVALH 1571
            + V+AIGDVFFADVE D VMQPGGPI+HKCKTLIEPSRLIM+TAQRF+QTFLG +F+ALH
Sbjct: 358  DDVLAIGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFIQTFLGDNFIALH 417

Query: 1572 FRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQSLIVI 1751
            FRRHG+LKFCNAK PSCF+ +PQAA CI RV+ RAN+PVIYLSTDAA SETGLLQSL+V+
Sbjct: 418  FRRHGFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAAESETGLLQSLVVV 477

Query: 1752 DGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTED 1931
            +GK VPLVQRPARN+AEKWDALLYRHGLEGD QV+AMLDKTICAMSSVFIGSSGSTFT+D
Sbjct: 478  NGKTVPLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSSVFIGSSGSTFTDD 537

Query: 1932 ILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            ILRLRK WG+ S+CDEY+CQGE+PN +A++E
Sbjct: 538  ILRLRKDWGSASLCDEYLCQGELPNYVADDE 568


>ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268664 [Solanum
            lycopersicum]
          Length = 565

 Score =  700 bits (1807), Expect = 0.0
 Identities = 362/571 (63%), Positives = 436/571 (76%), Gaps = 5/571 (0%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVKPTSNSHSFSPFDIDNNGFGASLRRRFKMNK-KRYLLTILIP 503
            S D+E+D  +LIHQN+ V   S S   S F I++     +L RRF     K YLL I++P
Sbjct: 7    SSDEEDDRENLIHQNERVNHLSKSPRPSTFQIEDVKDRFALCRRFNFTSGKTYLLAIILP 66

Query: 504  LAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALITLLNRTL- 674
            L ++ L+F         + V  IK   S +  MRESEL+ALYLLK+QQL L  L N TL 
Sbjct: 67   LLVLILYFATDIKALFQTTVTTIKYDGSVNS-MRESELRALYLLKQQQLGLFKLWNHTLV 125

Query: 675  -STSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQKVLLSTH 851
              TS    L           G+  +S      ++ EDLK  +   I LNK+IQ+VLLS+H
Sbjct: 126  NDTSTTHSLESAP-------GFTLVSR----SSIVEDLKDDLLRQISLNKQIQQVLLSSH 174

Query: 852  KNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQMSNHLI 1031
            +  N+      + D    G+  C KVD  L+ER+T+EWKPRS+KYLFAIC+SGQMSNHLI
Sbjct: 175  QLGNSLITSDNSTDPSLGGLGRCRKVDHNLSERRTVEWKPRSNKYLFAICVSGQMSNHLI 234

Query: 1032 CLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAKKKDLR 1211
            CLEKHMFFAALL+RVLVIPS KVDY+F RVLD++HINKCLGR+V+VTY+EF E +K  L 
Sbjct: 235  CLEKHMFFAALLNRVLVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLH 294

Query: 1212 IDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKAKFSSD 1391
            ID+F+CYFSQPQ C++DE+ VKKLKSLG+S+ KLE+ W+EDVK PK RT  D+ AKFS D
Sbjct: 295  IDKFLCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWDEDVKNPKKRTAQDIVAKFSMD 354

Query: 1392 EGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKDFVALH 1571
            + V+AIGDVFFADVE D VMQPGGPI+HKCKTLIEPSRLIM+TAQRFVQTFLG +F+ALH
Sbjct: 355  DDVLAIGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFVQTFLGDNFIALH 414

Query: 1572 FRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQSLIVI 1751
            FRRHG+LKFCNAK PSCF+ +PQAA CI RV+ RAN+PV+YLSTDAA SETGLLQSL+V 
Sbjct: 415  FRRHGFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVMYLSTDAAESETGLLQSLVVF 474

Query: 1752 DGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTED 1931
            +GK VPLVQRPARN+AEKWDALLYRHGLEGD QVEAMLDKTICAMSSVFIGSSGSTFT+D
Sbjct: 475  NGKTVPLVQRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTICAMSSVFIGSSGSTFTDD 534

Query: 1932 ILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            ILRLRK WG+ S+CDEY+CQGE+PN +A++E
Sbjct: 535  ILRLRKDWGSASLCDEYLCQGELPNFVADDE 565


>gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]
          Length = 578

 Score =  692 bits (1786), Expect = 0.0
 Identities = 356/576 (61%), Positives = 440/576 (76%), Gaps = 9/576 (1%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDTVKPTSNSHSFSPFDID--NNGFGASLRRRFK---MNKKRYLL 488
            SS D+++D  +LI QN+         +F   D+D  N  F + +RRR     +  K+++ 
Sbjct: 6    SSSDEDDDRENLIEQNERKLQNHPRSTFHIDDVDGGNREFRSRIRRRLSSLGLLNKKFMF 65

Query: 489  TILIPLAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALITLL 662
             I +PL IV LF +    G  ++ +  ++   S SDR+RESEL+AL+LL++QQL L  L 
Sbjct: 66   AIFLPLFIVVLFLSTDVRGLFSADLSGVRF-DSFSDRLRESELRALFLLRQQQLGLFALW 124

Query: 663  NRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEK-TVFEDLKSRIFDGIMLNKEIQKVL 839
            N+T   S  I               +N S    E+ +V +DLK  +   + LNKEIQ+VL
Sbjct: 125  NQTFHDSPPIS--SNSTNNSSSSSSINSSASGTEQNSVIDDLKFAVLRQLSLNKEIQQVL 182

Query: 840  LSTHKNVNNSEL-GVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQM 1016
            LS H++ N+S +   G+ ++       C KVDQK ++R+TIEWKP S+K+LFAICLSGQM
Sbjct: 183  LSPHRSGNSSSITDAGDPNLGGSDFDTCRKVDQKFSQRRTIEWKPNSNKFLFAICLSGQM 242

Query: 1017 SNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAK 1196
            SN LICLEKHMFFAALL+RVLVIPS KVDY ++RVLDI+HINKCLGRKVV+++E+F E K
Sbjct: 243  SNRLICLEKHMFFAALLNRVLVIPSSKVDYQYNRVLDIDHINKCLGRKVVISFEDFAETK 302

Query: 1197 KKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKA 1376
            K  + I+RFICYFSQPQ CYVD++H+KKLK LGL++ KLES W ED+K P  RTV DV++
Sbjct: 303  KNHMHINRFICYFSQPQPCYVDDEHIKKLKGLGLTMGKLESAWTEDIKGPNKRTVQDVQS 362

Query: 1377 KFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKD 1556
            KFS+++ VIAIGDVF+ADVE + VMQPGGP+AHKC+TLIEPSRLIM+TAQRF+QTFLGK+
Sbjct: 363  KFSTNDDVIAIGDVFYADVEQEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFIQTFLGKN 422

Query: 1557 FVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQ 1736
            FVALHFRRHG+LKFCNAK PSCFF IPQAA CI  VV RAN PVIYLSTDAA SETGLLQ
Sbjct: 423  FVALHFRRHGFLKFCNAKQPSCFFPIPQAADCITSVVERANAPVIYLSTDAAESETGLLQ 482

Query: 1737 SLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGS 1916
            SLIV++GKPVPLV+RPARN+AEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIG+ GS
Sbjct: 483  SLIVLNGKPVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGAPGS 542

Query: 1917 TFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            TFTEDILRLRK WG+ S CD+Y+CQGE PN +A+NE
Sbjct: 543  TFTEDILRLRKDWGSASSCDKYLCQGEEPNFVADNE 578


>ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis
            vinifera]
          Length = 559

 Score =  690 bits (1780), Expect = 0.0
 Identities = 365/570 (64%), Positives = 428/570 (75%), Gaps = 4/570 (0%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVKPTSNSHSFSPFDIDNNGFGASLRRRFKMNKKRYLLTILIPL 506
            S DDEED ++LI +N+   P  +      F    +        RF  NK RYL  I  PL
Sbjct: 5    SSDDEEDRQNLIDENERKLPHRSGFQIEDFKSRLSA------HRFSFNK-RYLFAIFPPL 57

Query: 507  AIVFLFFTLGFHGNSRVWDIKLLQ--SPSDRMRESELKALYLLKEQQLALITLLNRTLST 680
             I+ ++FT     N     I +++  SP+DRMRESEL+ALYLL++QQL+L +L N T   
Sbjct: 58   FILLIYFTTDVR-NLFTTSISIVKADSPTDRMRESELRALYLLRQQQLSLFSLWNHTAFA 116

Query: 681  SLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQKVLLSTHKNV 860
              A                L+ S   V  +   D KS +   I LNKEIQ+VLLS+H + 
Sbjct: 117  DSA------PIPSNSSNSTLDFSTRQVLLSS-ADFKSALLKQISLNKEIQQVLLSSHPSG 169

Query: 861  NNSELGVGNDDMVEEGISI--CGKVDQKLAERKTIEWKPRSDKYLFAICLSGQMSNHLIC 1034
            N SEL   N D+     S   C KV+Q +++R TIEWKPRSDKYLFAICLSGQMSNHLIC
Sbjct: 170  NLSELVDDNGDLNFGAYSFNRCPKVNQNMSQRPTIEWKPRSDKYLFAICLSGQMSNHLIC 229

Query: 1035 LEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAKKKDLRI 1214
            LEKHMFFAALL+R+LVIPS K DY ++RVLDIEHIN CLGRKVVVT+EEF E+KK  L I
Sbjct: 230  LEKHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEFTESKKNHLHI 289

Query: 1215 DRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKAKFSSDE 1394
            DR ICYFS P  CYVD+DHVKKLKSLG+S+ KLE  W ED+KKPK RT  DV+AKFSS++
Sbjct: 290  DRVICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWAEDIKKPKKRTAQDVQAKFSSND 349

Query: 1395 GVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKDFVALHF 1574
             VIAIGDVF+A+VE + VMQPGGP+AHKC+TLIEPSRLIM+TAQRFVQTFLGK F ALHF
Sbjct: 350  DVIAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTFLGKSFTALHF 409

Query: 1575 RRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQSLIVID 1754
            RRHG+LKFCNAK PSCFF IPQAA CI RVV RA+TPVIYLSTDAA SETGLLQSL+V++
Sbjct: 410  RRHGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGLLQSLVVLN 469

Query: 1755 GKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTEDI 1934
            GK VPL++RP RN+AEKWDALLYRHGL+GDSQVEAMLDKTICAM+SVFIG+ GSTFTEDI
Sbjct: 470  GKLVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAPGSTFTEDI 529

Query: 1935 LRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            LRLR+GWG+ S CDEY+CQGE PN IA+NE
Sbjct: 530  LRLRRGWGSASHCDEYLCQGEQPNFIADNE 559


>ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|9758924|dbj|BAB09461.1| unnamed protein product
            [Arabidopsis thaliana] gi|133778858|gb|ABO38769.1|
            At5g50420 [Arabidopsis thaliana]
            gi|332008558|gb|AED95941.1| O-fucosyltransferase family
            protein [Arabidopsis thaliana]
          Length = 566

 Score =  672 bits (1734), Expect = 0.0
 Identities = 359/585 (61%), Positives = 431/585 (73%), Gaps = 18/585 (3%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDTV-----------KPTSNSHSFSPFDIDNNGFGASLRRRFKMN 470
            +S DDEED + LI QNDT              T   +  S F ID+       R +  +N
Sbjct: 4    NSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGKISLN 63

Query: 471  KKRYLLTILIPLAIVFLFFTLG----FHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQ 638
            K+  ++ + + ++I  LF        F  N   + +  L   S+R++ESEL+ALYLL++Q
Sbjct: 64   KRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPL---SNRVKESELRALYLLRQQ 120

Query: 639  QLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTV-FEDLKSRIFDGIML 815
            QLAL++L N TL                     LN SE  +  +V FED+KS +   I L
Sbjct: 121  QLALLSLWNGTLVNPS-----------------LNQSENALGSSVLFEDVKSAVSKQISL 163

Query: 816  NKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFA 995
            NKEIQ+VLLS H++ N S  G  + D V    + C KVDQKL++RKT+EWKPRSDK+LFA
Sbjct: 164  NKEIQEVLLSPHRSSNYS--GGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRSDKFLFA 221

Query: 996  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTY 1175
            ICLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + RV+DIE IN CLGR VVV +
Sbjct: 222  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAF 281

Query: 1176 EEFVE-AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEEDVKKPK 1349
            ++F E AKK   RIDRFICYFS PQ CYVDE+H+KKLK LG+S+  KLE+PW ED+KKP 
Sbjct: 282  DQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPS 341

Query: 1350 IRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQR 1529
             RTV DV+ KF SD+ VIAIGDVF+AD+E D VMQPGGPI HKCKTLIEPS+LI++TAQR
Sbjct: 342  KRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQR 401

Query: 1530 FVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDA 1709
            F+QTFLGK+F+ALHFRRHG+LKFCNAK+PSCF+ IPQAA CI R+V R+N  VIYLSTDA
Sbjct: 402  FIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDA 461

Query: 1710 AGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMS 1889
            A SET LLQSL+V+DGK VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICAMS
Sbjct: 462  AESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMS 521

Query: 1890 SVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            SVFIG+SGSTFTEDILRLRK WGT S CDEY+C+GE PN IAE+E
Sbjct: 522  SVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
            lyrata] gi|297311638|gb|EFH42062.1| hypothetical protein
            ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score =  672 bits (1734), Expect = 0.0
 Identities = 358/585 (61%), Positives = 430/585 (73%), Gaps = 18/585 (3%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDT-----------VKPTSNSHSFSPFDIDNNGFGASLRRRFKMN 470
            +S DDEED + LI QNDT              T+  +  S F I++       R +  +N
Sbjct: 4    NSSDDEEDHQHLIPQNDTRIRHREDPISSTATTTGGNQRSAFQIEDILQRVQRRWKISLN 63

Query: 471  KKRYLLTILIPLAIVFLFFTLG----FHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQ 638
            K+  ++ + + ++I  LF        F  N   + +  L   S+R++ESEL+ALYLL++Q
Sbjct: 64   KRYVIVFVSLIISIGLLFLLTDPRELFSANFSSFKLDPL---SNRVKESELRALYLLRQQ 120

Query: 639  QLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTV-FEDLKSRIFDGIML 815
            QLAL++L N TL                     LN SE  +  +V FED+KS +   I L
Sbjct: 121  QLALLSLWNGTLVNPS-----------------LNQSENDLRSSVLFEDVKSAVSKQISL 163

Query: 816  NKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFA 995
            NKEIQ VLLS H++ N S  G    D V      C KVDQKL++RKT+EWKPRSDK+LFA
Sbjct: 164  NKEIQNVLLSPHRSSNYS--GGTEVDSVNFSYDRCRKVDQKLSDRKTVEWKPRSDKFLFA 221

Query: 996  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTY 1175
            ICLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + RV+DIE IN CLGR VVV++
Sbjct: 222  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTCLGRNVVVSF 281

Query: 1176 EEFVE-AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEEDVKKPK 1349
            ++F E AKK   RIDRFICYFS PQ CYVDE+H+KKLK LG+S+  KLE+PW ED+KKP 
Sbjct: 282  DQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPS 341

Query: 1350 IRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQR 1529
             RTV DV+ KF SD+ VIAIGDVF+AD+E D VMQPGGPI HKCKTLIEPS+LI++TAQR
Sbjct: 342  KRTVQDVQTKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQR 401

Query: 1530 FVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDA 1709
            F+QTFLGK+F+ALHFRRHG+LKFCNAK+PSCF+ IPQAA CI R+V R+N  VIYLSTDA
Sbjct: 402  FIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDA 461

Query: 1710 AGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMS 1889
            A SET LLQSL+V+DGK VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICAMS
Sbjct: 462  AESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMS 521

Query: 1890 SVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            SVFIG+SGSTFTEDILRLRK WGT S CDEY+C+GE PN IAE+E
Sbjct: 522  SVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>gb|AAM66093.1| unknown [Arabidopsis thaliana]
          Length = 566

 Score =  671 bits (1732), Expect = 0.0
 Identities = 358/585 (61%), Positives = 431/585 (73%), Gaps = 18/585 (3%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDTV-----------KPTSNSHSFSPFDIDNNGFGASLRRRFKMN 470
            +S DDEED + LI QNDT              T   +  S F ID+       R +  +N
Sbjct: 4    NSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGKISLN 63

Query: 471  KKRYLLTILIPLAIVFLFFTLG----FHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQ 638
            K+  ++ + + ++I  LF        F  N   + +  L   S+R++ESEL+ALYLL++Q
Sbjct: 64   KRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPL---SNRVKESELRALYLLRQQ 120

Query: 639  QLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTV-FEDLKSRIFDGIML 815
            QLAL++L N TL                     LN SE  +  +V FED+KS +   I L
Sbjct: 121  QLALLSLWNGTLVNPS-----------------LNQSENALGSSVLFEDVKSAVSKQISL 163

Query: 816  NKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFA 995
            NKEIQ+VLLS H++ N S  G  + D V    + C KVDQKL++RKT+EWKPRSDK+LFA
Sbjct: 164  NKEIQEVLLSPHRSSNYS--GGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRSDKFLFA 221

Query: 996  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTY 1175
            ICLSGQMSNHL+CLEKHMFFAALLDRVLVIPS K DY + RV+DIE IN CLGR VVV +
Sbjct: 222  ICLSGQMSNHLLCLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAF 281

Query: 1176 EEFVE-AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEEDVKKPK 1349
            ++F E AKK   RIDRFICYFS PQ CYVDE+H+KKLK LG+S+  KLE+PW ED+KKP 
Sbjct: 282  DQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPS 341

Query: 1350 IRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQR 1529
             RTV DV+ KF SD+ VIAIGDVF+AD+E D VMQPGGPI HKCKTLIEPS+LI++TAQR
Sbjct: 342  KRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQR 401

Query: 1530 FVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDA 1709
            F+QTFLGK+F+ALHFRRHG+LKFCNAK+PSCF+ IPQAA CI R+V R+N  VIYLSTDA
Sbjct: 402  FIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDA 461

Query: 1710 AGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMS 1889
            A SET LLQSL+V+DGK VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICAMS
Sbjct: 462  AESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMS 521

Query: 1890 SVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            SVFIG+SGSTFTEDILRLRK WGT S CDEY+C+GE PN IAE+E
Sbjct: 522  SVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>ref|XP_006307109.1| hypothetical protein CARUB_v10008696mg [Capsella rubella]
            gi|482575820|gb|EOA40007.1| hypothetical protein
            CARUB_v10008696mg [Capsella rubella]
          Length = 576

 Score =  671 bits (1730), Expect = 0.0
 Identities = 356/591 (60%), Positives = 429/591 (72%), Gaps = 24/591 (4%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDTVKPT-----SNSHSF-----------SPFDIDNNGFGASLRR 455
            +S D+EED R+LI QNDT          N H             S F ID     A  R 
Sbjct: 4    NSSDEEEDHRNLIPQNDTRDNAINLRRENEHQSVRANGGGRSPRSAFQIDEFASRAGNRW 63

Query: 456  RFKMNKKRYLLTILIPLAIVFLFFTLGFHGNSRVWDIKL----LQSPSDRMRESELKALY 623
            +  +NK+  +  + + L +  LF    F    R + + L    L   S R++ESEL+ALY
Sbjct: 64   KISLNKRYVVGAVSLTLFLGVLFL---FTDTRRFFSVDLSTFQLDPLSSRVKESELRALY 120

Query: 624  LLKEQQLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFD 803
            LL++QQLAL++LLNRTL    A                 N S       V +++K+ + +
Sbjct: 121  LLRQQQLALVSLLNRTLVDQSA---------------NFNSSNAIGTSLVIDNVKAALVN 165

Query: 804  GIMLNKEIQKVLLSTHKNVNNSELGVGNDDMVEEGI--SICGKVDQKLAERKTIEWKPRS 977
             I +NKEI++VLLS H+  N S  G G D +       + C KVDQKL +RKTIEWKPRS
Sbjct: 166  QISINKEIEEVLLSPHRTGNYSSTGSGLDSISGSYYDDARCRKVDQKLLDRKTIEWKPRS 225

Query: 978  DKYLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGR 1157
            DK+LFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + RV+DI+ IN CLGR
Sbjct: 226  DKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSPKFDYQYDRVIDIDRINTCLGR 285

Query: 1158 KVVVTYEEFVEA-KKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEE 1331
             VVV++++F E  KK +  IDRFICYFS PQ CYVDE+H+KKLK LG+S+  KLE+PW E
Sbjct: 286  TVVVSFDQFKEIDKKNNAHIDRFICYFSSPQPCYVDEEHIKKLKGLGISIGGKLEAPWSE 345

Query: 1332 DVKKPKIRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLI 1511
            D+KKP  RT  +V  KF SD+GVIAIGD+F+AD+E DLVMQPGGPI HKCKTLIEPSRLI
Sbjct: 346  DIKKPTKRTSQEVVEKFKSDDGVIAIGDLFYADMEQDLVMQPGGPIKHKCKTLIEPSRLI 405

Query: 1512 MITAQRFVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVI 1691
            ++TAQRF+QTFLGK+F++LH RRHG+LKFCNAK+PSCF+ IPQAA CI R+V RAN PVI
Sbjct: 406  LVTAQRFIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVI 465

Query: 1692 YLSTDAAGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDK 1871
            YLSTDAA SETGLLQSL+V+DGK VPLV+RP RN+AEKWD+LLYRHG+E DSQV+AMLDK
Sbjct: 466  YLSTDAAESETGLLQSLVVVDGKVVPLVKRPPRNSAEKWDSLLYRHGIEDDSQVDAMLDK 525

Query: 1872 TICAMSSVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            TICAMSSVFIG+SGSTFTEDILRLRK WGT S+CDEY+C+GE PN IAENE
Sbjct: 526  TICAMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 576


>ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus trichocarpa]
            gi|222840769|gb|EEE78316.1| protein-O-fucosyltransferase
            2 [Populus trichocarpa]
          Length = 527

 Score =  667 bits (1722), Expect = 0.0
 Identities = 349/567 (61%), Positives = 414/567 (73%), Gaps = 1/567 (0%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVKPTSNSHSFSPFDIDNNGFGASLRRRFKMNKKRYLLTILIPL 506
            S D+E+D   LI QND     +  +S          F A++              I +PL
Sbjct: 5    SSDEEDDREHLIEQNDRKHHQNGRYSL---------FAAAI--------------IFLPL 41

Query: 507  AIVFLFFTLGFHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALITLLNRTLSTSL 686
             I+FL F+     N     +K+  S S RMRESEL+ALYLLK+QQL+L +L N T     
Sbjct: 42   FILFLSFSTDIR-NLFSTHLKVGDSLSIRMRESELRALYLLKKQQLSLFSLWNST----- 95

Query: 687  AIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQKVLLSTHKNVNN 866
                           G   L E  +    FEDLKS +   I LNKEIQ+VLL+ H++ N 
Sbjct: 96   ---------------GNSTLLEKDLNSVSFEDLKSALLKQISLNKEIQQVLLAPHESGNV 140

Query: 867  SELGVGNDDMVEEG-ISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQMSNHLICLEK 1043
            S      D     G +  C KVDQ+ A+RKTIEWKP+ +K+LFA+CLSGQMSNHLICLEK
Sbjct: 141  SSSSSDLDFSNAGGFVQRCEKVDQRFADRKTIEWKPKPNKFLFALCLSGQMSNHLICLEK 200

Query: 1044 HMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAKKKDLRIDRF 1223
            HMFFAALL+RVLVIPS + DY ++RVLDIEH+N CLGRKVVVT+EEFVE  K    IDRF
Sbjct: 201  HMFFAALLNRVLVIPSSRFDYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIMKNKPHIDRF 260

Query: 1224 ICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKAKFSSDEGVI 1403
             CYFS P  CYVDE+HVKKLK LG+S+ KLESPW+ED+KKP   TV DV+ KF SD+ VI
Sbjct: 261  FCYFSDPTPCYVDEEHVKKLKGLGVSMGKLESPWKEDIKKPSKLTVKDVEGKFVSDDNVI 320

Query: 1404 AIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKDFVALHFRRH 1583
            A+GDVFFADVE + +MQPGGPIAHKCKTLIEP+R+IM+TAQRF+QTFLG +F+ALHFRRH
Sbjct: 321  AVGDVFFADVEEEWIMQPGGPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSNFIALHFRRH 380

Query: 1584 GWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQSLIVIDGKP 1763
            G+LKFCNAK PSCF+ +PQAA CI RVV RAN PV+YLSTDAA SETGLLQSL+V++G+ 
Sbjct: 381  GFLKFCNAKKPSCFYPVPQAADCIARVVERANAPVVYLSTDAAESETGLLQSLVVVNGRT 440

Query: 1764 VPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTEDILRL 1943
            VPLV RP+RNAAEKWDALLYRHGL+ D+QVEAMLDKTICAMSSVFIG+SGSTFTEDI RL
Sbjct: 441  VPLVTRPSRNAAEKWDALLYRHGLQEDAQVEAMLDKTICAMSSVFIGASGSTFTEDIFRL 500

Query: 1944 RKGWGTLSICDEYICQGEVPNLIAENE 2024
            RKGW + S CDEY+CQGE+PN IAENE
Sbjct: 501  RKGWESASSCDEYLCQGELPNYIAENE 527


>ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299396 [Fragaria vesca
            subsp. vesca]
          Length = 556

 Score =  667 bits (1721), Expect = 0.0
 Identities = 355/584 (60%), Positives = 424/584 (72%), Gaps = 14/584 (2%)
 Frame = +3

Query: 315  EGTSSDDDEEDCR-SLIHQNDTVKPTSNSHSFSPF-----DIDNNGFGASLRRRFK---- 464
            +  SSDD+ ED R +LI QND  K   +  S + F     D+D +     +RRRF     
Sbjct: 5    DSLSSDDEVEDDRQNLIEQNDR-KQLPSPRSATTFHIDDGDVDRHRHHREIRRRFASLNL 63

Query: 465  ---MNKKRYLLT-ILIPLAIVFLFFTLGFHGNSRVWDIKLLQSPSDRMRESELKALYLLK 632
                NK+ +L+  I IPL ++ LFF+     +     + +  S S ++RESEL+ALYLL+
Sbjct: 64   RDLFNKRSFLVFFIFIPLFVLVLFFSTDIK-SLFFSHLSVSDSVSGKLRESELRALYLLR 122

Query: 633  EQQLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIM 812
            +QQL L  L N T + S                               +DLKS +   I 
Sbjct: 123  QQQLGLFGLWNSTSNHS---------------------------NPDLDDLKSSVLRQIS 155

Query: 813  LNKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLF 992
            LNKEIQ+VLLS H + N+SE     D  + +    C  VDQ+ +ER+TIEWKP SDKYL 
Sbjct: 156  LNKEIQQVLLSPHSSGNSSESEDFRDPSLGDR---CRVVDQRFSERRTIEWKPNSDKYLL 212

Query: 993  AICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVT 1172
            AIC+SGQMSNHLICLEKHMFFAALL+R+LVIPS KVDY +S VLDIEHINKC+GRKVVVT
Sbjct: 213  AICVSGQMSNHLICLEKHMFFAALLNRILVIPSSKVDYQYSTVLDIEHINKCIGRKVVVT 272

Query: 1173 YEEFVEAKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKI 1352
            +EE  E KK  + IDRFICYFS+P  CYVD++H+KKLK+LG+S    E  W EDVKKP  
Sbjct: 273  FEELAEEKKNHIHIDRFICYFSKPTLCYVDDEHLKKLKALGISYKSREPAWGEDVKKPSK 332

Query: 1353 RTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRF 1532
            +TV DV++KFSS + VIAIGDVFFAD E D VMQPGGP+AHKCKTLIEPSRLI++TAQRF
Sbjct: 333  KTVQDVQSKFSSGDEVIAIGDVFFADAEQDWVMQPGGPLAHKCKTLIEPSRLILLTAQRF 392

Query: 1533 VQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAA 1712
            +QTFLGK+FVALHFRRHG+LKFCN K PSCF+ IPQAA CI R+  RAN PV+YLSTDAA
Sbjct: 393  IQTFLGKNFVALHFRRHGFLKFCNNKQPSCFYPIPQAADCITRIAERANAPVVYLSTDAA 452

Query: 1713 GSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSS 1892
             SETGLLQSL+V++GK VPLV+RPARN+AEKWDALLYRHG+EGD QVEAMLDKTI AMSS
Sbjct: 453  ESETGLLQSLVVVNGKTVPLVKRPARNSAEKWDALLYRHGIEGDPQVEAMLDKTISAMSS 512

Query: 1893 VFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            VFIG+SGSTFTEDILRLRKGWG+ S+CDEY+CQGE PN IAENE
Sbjct: 513  VFIGASGSTFTEDILRLRKGWGSASVCDEYLCQGEEPNFIAENE 556


>ref|XP_002892943.1| hypothetical protein ARALYDRAFT_889130 [Arabidopsis lyrata subsp.
            lyrata] gi|297338785|gb|EFH69202.1| hypothetical protein
            ARALYDRAFT_889130 [Arabidopsis lyrata subsp. lyrata]
          Length = 583

 Score =  666 bits (1719), Expect = 0.0
 Identities = 358/598 (59%), Positives = 437/598 (73%), Gaps = 31/598 (5%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDT------VKPTSNSHSFSPFDIDN--NGFGASLRRRFKMNK-- 473
            +S DDEED R+LI QNDT      ++     HS +     N  NG G S R  F++++  
Sbjct: 4    NSSDDEEDHRNLIPQNDTRDNDLDLRREDELHSVTTARAINRANGGGRSPRSAFQIDEIV 63

Query: 474  ------------KRYLLTIL-IPLAIVFLFFTLGFHGNSRVWDIKL----LQSPSDRMRE 602
                        KRY++ ++ + L + FLF    F    R + + L    L   S R++E
Sbjct: 64   SRARNRWKISVNKRYVVAVVSLTLFVGFLFL---FTDTRRFFSVDLSSFKLDPMSSRVKE 120

Query: 603  SELKALYLLKEQQLALITLLNRTL-STSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFE 779
            SEL+AL LL++QQLAL++LLNR + ++S AIG                         + +
Sbjct: 121  SELQALNLLRQQQLALVSLLNRAIFNSSNAIG----------------------SSVLID 158

Query: 780  DLKSRIFDGIMLNKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISI-CGKVDQKLAERKT 956
            ++K+ +   I +NKEI++VLLS HK  N S  G G+D +        C KVDQKL ERKT
Sbjct: 159  NVKAALLKQISVNKEIEEVLLSPHKTGNYSVTGSGSDSITGSYYDDRCKKVDQKLLERKT 218

Query: 957  IEWKPRSDKYLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEH 1136
            IEWKPR +K+LFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + RV+DIE 
Sbjct: 219  IEWKPRPEKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIER 278

Query: 1137 INKCLGRKVVVTYEEFVEA-KKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-K 1310
            IN CLGR VV+++++F E  KK +  IDRFICYFS PQ CYVDEDH+KKLK LG+S+  K
Sbjct: 279  INTCLGRTVVISFDQFKEIDKKNNAHIDRFICYFSSPQPCYVDEDHIKKLKGLGVSIGGK 338

Query: 1311 LESPWEEDVKKPKIRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTL 1490
            LE+PW ED+KKP  RT  +V  KF SD+GVIAIGDVF+AD+E DLVMQPGGPI HKCKTL
Sbjct: 339  LEAPWIEDIKKPTKRTSKEVVEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTL 398

Query: 1491 IEPSRLIMITAQRFVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVA 1670
            IEPSRLI++TAQRF+QTFLGK+F++LH RRHG+LKFCNAK+PSCF+ IPQAA CI R+V 
Sbjct: 399  IEPSRLILVTAQRFIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVE 458

Query: 1671 RANTPVIYLSTDAAGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQ 1850
            RAN PVIYLSTDAA SETGLLQSL+V++GK VPLV+RP RN+AEKWD+LLYRHG+E DSQ
Sbjct: 459  RANAPVIYLSTDAAESETGLLQSLVVVNGKVVPLVKRPPRNSAEKWDSLLYRHGIEDDSQ 518

Query: 1851 VEAMLDKTICAMSSVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            V+AMLDKTICAMSSVFIG+SGSTFTEDILRLRK WGT S+CDEY+C+GE PN IAENE
Sbjct: 519  VDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 576


>ref|XP_006280247.1| hypothetical protein CARUB_v10026161mg [Capsella rubella]
            gi|482548951|gb|EOA13145.1| hypothetical protein
            CARUB_v10026161mg [Capsella rubella]
          Length = 568

 Score =  665 bits (1716), Expect = 0.0
 Identities = 354/584 (60%), Positives = 428/584 (73%), Gaps = 17/584 (2%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDT--------VKPTSNSHSFSP---FDIDNNGFGASLRRRFKMN 470
            +S DDEED + LI QNDT        +  T+ +   SP   F I++       R +  +N
Sbjct: 4    NSSDDEEDHQHLIPQNDTRHRHREDPISSTATTTGGSPRSAFQIEDIVQRVQHRWKISLN 63

Query: 471  KKRYLLTILIPLAIVFLFFTLG----FHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQ 638
            K+  ++ + + ++I  LF        F  N   +    L   S+R++ESEL+ALYLL++Q
Sbjct: 64   KRYVIVAVSLIISIGLLFILTDPRELFSANLSSFKRDPL---SNRVKESELRALYLLRQQ 120

Query: 639  QLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLN 818
            QLAL++L N TL                      N S +     +FED+KS +   I LN
Sbjct: 121  QLALLSLWNGTLVNP-------------SLNQSANASSLE-SSVLFEDVKSAVSKQISLN 166

Query: 819  KEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAI 998
            KEIQ+VLLS H+  N S  G    D V      C KVDQ L++R+T+EWKPRSDK+LFAI
Sbjct: 167  KEIQEVLLSPHRTANYS--GGTEVDSVNLAYDRCRKVDQNLSDRRTVEWKPRSDKFLFAI 224

Query: 999  CLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYE 1178
            CLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + RV+DIE IN CLGR VVV+++
Sbjct: 225  CLSGQMSNHLICLEKHMFFAALLDRVLVIPSPKFDYQYDRVIDIERINTCLGRNVVVSFD 284

Query: 1179 EFVE-AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEEDVKKPKI 1352
            +F E AKK   RIDRFICYFS PQ CYVDE+H+KKLK LG+S+  KLE+PW ED+KKP  
Sbjct: 285  QFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSK 344

Query: 1353 RTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRF 1532
            RTV DV+ KF SD+ VIAIGDVF+AD+E D VMQPGGPI HKCKTLIEPS+LI++TAQRF
Sbjct: 345  RTVQDVQTKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRF 404

Query: 1533 VQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAA 1712
            +QTFLGK+F+ALHFRRHG+LKFCNAK+PSCF+ IPQAA CI R+V R+N  VIYLSTDAA
Sbjct: 405  IQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAA 464

Query: 1713 GSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSS 1892
             SET LLQSL+V+DGK VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICAMSS
Sbjct: 465  ESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSS 524

Query: 1893 VFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            VFIG+SGSTFTEDILRLRK WGT S+CDEY+C+GE PN IAE+E
Sbjct: 525  VFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAEDE 568


>ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis]
            gi|223526849|gb|EEF29063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 565

 Score =  663 bits (1711), Expect = 0.0
 Identities = 342/577 (59%), Positives = 426/577 (73%), Gaps = 11/577 (1%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDT-------VKPTSNSH--SFSPFDIDNNGFGASLRRRFKMNKKR 479
            S D+E+D  +LI QND          PTS+ H  SFS F I+  G G   RR F      
Sbjct: 5    SSDEEDDRENLIEQNDRKHHNHQQTVPTSSPHRRSFSTFHIEEYG-GVIRRRLFNKRYYY 63

Query: 480  YLLTILIPLAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALI 653
            YLL I +PL I+ ++F+       ++ +  +    S SDRMRE+EL+ALYLL++QQL+L+
Sbjct: 64   YLLAIFLPLLIIIVYFSADLRSLFSANISSLNF-NSASDRMREAELQALYLLEQQQLSLL 122

Query: 654  TLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQK 833
            ++ N++  +                  ++N      +    E+ +S +   +  NK+IQ+
Sbjct: 123  SIFNQSFPSR--------NKNFSSNSSFIN----SFDNVKIENFRSALLKQMTFNKQIQQ 170

Query: 834  VLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQ 1013
            +LLS HK+ N +  G  +      G   C KV+ +  +RKTIEWKPRSDK+LF ICLSGQ
Sbjct: 171  ILLSPHKSGNENVSGSFSGSGF--GFDRCKKVESRFLDRKTIEWKPRSDKFLFPICLSGQ 228

Query: 1014 MSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEA 1193
            MSNHLICLEKHMFFAALL+RVLV+PS K DY ++RVLDIEHIN C+GRKVVVT+EEFV+ 
Sbjct: 229  MSNHLICLEKHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVGRKVVVTFEEFVQM 288

Query: 1194 KKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVK 1373
            +K  + IDRFICYFS P  CYVDE+HVKKLK LG+ + K ESPW+EDVKKP  +TV DV 
Sbjct: 289  RKNHVHIDRFICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWKEDVKKPSQKTVQDVL 348

Query: 1374 AKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGK 1553
            AKF+S++ VIAIGDVF+AD+E D VMQPGGP+AHKCKTLIEPSRLI++TAQRF+QTFLGK
Sbjct: 349  AKFTSNDDVIAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLILVTAQRFIQTFLGK 408

Query: 1554 DFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLL 1733
            +F+ALHFRRHG+LKFCNAKNPSCF+ IPQAA CI RV  RAN PVIYLSTDAA SET LL
Sbjct: 409  NFIALHFRRHGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIYLSTDAAESETDLL 468

Query: 1734 QSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSG 1913
            QSLI+++GK VPLV+RP+  + EKWD+LL RHG+E DSQVEAMLDKTI AMS+VFIG+SG
Sbjct: 469  QSLIIVNGKTVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKTISAMSNVFIGASG 528

Query: 1914 STFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            STFTEDILRLRK W + S+CDEY+CQGE+PN IAE+E
Sbjct: 529  STFTEDILRLRKDWESASLCDEYLCQGELPNFIAEDE 565


>gb|EOY27412.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
          Length = 558

 Score =  659 bits (1701), Expect = 0.0
 Identities = 342/579 (59%), Positives = 425/579 (73%), Gaps = 13/579 (2%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVK-----PTSNSHSFSP---FDIDNNGFGASLRRRFKMN-KKR 479
            S D+++D ++LIHQNDT       P S   S SP   F I+     + +RRRFK+   KR
Sbjct: 5    SSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEE--LESQIRRRFKLTFNKR 62

Query: 480  YLLTILIPLAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALI 653
            YL  I +PL I+ ++F+       +S +  +K   + SDR+RES+L+ALYLL +QQ +L+
Sbjct: 63   YLFAIFLPLLIIPIYFSTDIRSLFSSNISSLKF-NTVSDRIRESQLQALYLLNQQQNSLL 121

Query: 654  TLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQK 833
            +L N T                     ++N S   +    F+D+K+ +   I LNK IQ+
Sbjct: 122  SLWNHT---------------------FVN-SNNNITAVQFDDIKASLLTQITLNKHIQQ 159

Query: 834  VLLSTHKNVNNSELGVGND-DMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSG 1010
            +LLS HK  N+ + G   D +        C KVDQK AERKT EWKP+ +K+LFAICLSG
Sbjct: 160  ILLSPHKTGNSPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSG 219

Query: 1011 QMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVE 1190
            QMSNHLICLEKHMFFAA+L+R LVIPS + DY ++RVLDIEHIN C+G+K V+ +EEF+E
Sbjct: 220  QMSNHLICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFME 279

Query: 1191 AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWE-EDVKKPKIRTVDD 1367
             KK    ID+FICYFS PQ CYVDE+H+KKLKSLG+S  KLE+ W+ ED+KKP  +T+ D
Sbjct: 280  IKKNHAHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKD 339

Query: 1368 VKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFL 1547
            V+ KF SD+ VIAIGDVF+ADVE D V+QPGGPIAHKCKTLIEPS+LI++TA+RF+QTFL
Sbjct: 340  VEEKFGSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFL 399

Query: 1548 GKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETG 1727
            G +F+ALHFRRHG+LKFCNAK PSCF+ IPQAA CI R+V RANTPVIYLSTDAA SET 
Sbjct: 400  GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETS 459

Query: 1728 LLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGS 1907
            LLQS++V++GK +PLV+RP RN+AEKWDALLYRHGL  D QVEAMLDKTICAMSSVFIG+
Sbjct: 460  LLQSMVVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVEAMLDKTICAMSSVFIGA 519

Query: 1908 SGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
             GSTFT DILRLRK WGT S+CDEY+CQGE PN  A  E
Sbjct: 520  PGSTFTGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 558


>ref|NP_173170.2| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|27754290|gb|AAO22598.1| unknown protein [Arabidopsis
            thaliana] gi|332191445|gb|AEE29566.1|
            O-fucosyltransferase family protein [Arabidopsis
            thaliana]
          Length = 564

 Score =  658 bits (1697), Expect = 0.0
 Identities = 347/585 (59%), Positives = 421/585 (71%), Gaps = 18/585 (3%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDT------VKPTSNSHSF---------SPFDIDNNGFGASLRRR 458
            +S D+EED R+LI QNDT      ++P + + +          S   ID     A  R +
Sbjct: 4    NSSDEEEDHRNLIPQNDTRDNDLNLRPDARTVNMANGGGRSPRSALQIDEILSRARNRWK 63

Query: 459  FKMNKKRYLLTILIPLAIVFLFFTLGFHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQ 638
              +NK+  +  + + L +  LF    F      +    L   S R++ESEL+AL LL++Q
Sbjct: 64   ISVNKRYVVAAVSLTLFVGLLFL---FTDTRTFFSSFKLDPMSSRVKESELQALNLLRQQ 120

Query: 639  QLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLN 818
            QLAL++LLNRT                       N S       V +++K+ +   I +N
Sbjct: 121  QLALVSLLNRT---------------------NFNSSNAISSSVVIDNVKAALLKQISVN 159

Query: 819  KEIQKVLLSTHKNVNNSELGVGNDDMVEE-GISICGKVDQKLAERKTIEWKPRSDKYLFA 995
            KEI++VLLS H+  N S    G+D         IC KVDQKL +RKTIEWKPR DK+LFA
Sbjct: 160  KEIEEVLLSPHRTGNYSITASGSDSFTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLFA 219

Query: 996  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTY 1175
            ICLSGQMSNHLICLEKHMFFAALLDRVLVIPS K DY + +V+DIE IN CLGR VV+++
Sbjct: 220  ICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDKVIDIERINTCLGRTVVISF 279

Query: 1176 EEFVEA-KKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEEDVKKPK 1349
            ++F E  KK +  IDRFICY S PQ CYVDEDH+KKLK LG+S+  KLE+PW ED+KKP 
Sbjct: 280  DQFKEIDKKNNAHIDRFICYVSSPQPCYVDEDHIKKLKGLGVSIGGKLEAPWSEDIKKPT 339

Query: 1350 IRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQR 1529
             RT  +V  KF SD+GVIAIGDVF+AD+E DLVMQPGGPI HKCKTLIEPSRLI++TAQR
Sbjct: 340  KRTSQEVVEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQR 399

Query: 1530 FVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDA 1709
            F+QTFLGK+F++LH RRHG+LKFCNAK+PSCF+ IPQAA CI R+V RAN PVIYLSTDA
Sbjct: 400  FIQTFLGKNFISLHLRRHGFLKFCNAKSPSCFYPIPQAADCISRMVERANAPVIYLSTDA 459

Query: 1710 AGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMS 1889
            A SETGLLQSL+V+DGK VPLV+RP +N+AEKWD+LLYRHG+E DSQV AMLDKTICAMS
Sbjct: 460  AESETGLLQSLVVVDGKVVPLVKRPPQNSAEKWDSLLYRHGIEDDSQVYAMLDKTICAMS 519

Query: 1890 SVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            SVFIG+SGSTFTEDILRLRK WGT S+CDEY+C+GE PN IAENE
Sbjct: 520  SVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAENE 564


>ref|XP_006416723.1| hypothetical protein EUTSA_v10007186mg [Eutrema salsugineum]
            gi|557094494|gb|ESQ35076.1| hypothetical protein
            EUTSA_v10007186mg [Eutrema salsugineum]
          Length = 579

 Score =  657 bits (1694), Expect = 0.0
 Identities = 348/591 (58%), Positives = 422/591 (71%), Gaps = 27/591 (4%)
 Frame = +3

Query: 333  DDEEDCRSLIHQND------------TVKPTSNSHSF-----------SPFDIDNNGFGA 443
            D++ED RSLI  ND             ++  + + +            S F ID      
Sbjct: 7    DEDEDHRSLIPHNDIRDNDLNRRREDNIQSVTTARAINMANGDDRSPRSAFQIDETVTRT 66

Query: 444  SLRRRFKMNKKRYLLTILIPLAIVFLFFTLGFHGNSRVWDIKLLQSP-SDRMRESELKAL 620
              R    ++K+  +  + + L +VF F     + N R +       P S R+RESEL+AL
Sbjct: 67   RSRWNISLDKRYVVAAVSLTLLVVFFFL---LYTNPRRFSSSFKLDPLSTRVRESELRAL 123

Query: 621  YLLKEQQLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIF 800
            YLL++QQLAL++LLNRTL    A                LN S       + +++K+ + 
Sbjct: 124  YLLRQQQLALVSLLNRTLVDQTA---------------NLNSSNSIGSSLLVDNVKAALA 168

Query: 801  DGIMLNKEIQKVLLSTHKNVNNSELGVGNDDMVEE-GISICGKVDQKLAERKTIEWKPRS 977
              I L+K+I+ VLLS H+  N+S    G+D +        C KVDQKL ERKTIEWKPR 
Sbjct: 169  KQISLSKQIEDVLLSPHRTGNHSVTDPGSDSITGSYNYERCRKVDQKLLERKTIEWKPRP 228

Query: 978  DKYLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGR 1157
             K+LFAICLSGQMSNHLICLEKHMFFAALLDR LVIPS K DY + RV+DIE IN CLGR
Sbjct: 229  GKFLFAICLSGQMSNHLICLEKHMFFAALLDRALVIPSSKFDYQYDRVIDIERINTCLGR 288

Query: 1158 KVVVTYEEFVEA-KKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVP-KLESPWEE 1331
             VV+++++F E  KKK+  IDRFICYFS PQ CYVDEDHVKKLK LG+S+  KLE+PW E
Sbjct: 289  TVVISFDQFKEIDKKKNAHIDRFICYFSSPQPCYVDEDHVKKLKGLGISIGGKLEAPWSE 348

Query: 1332 DVKKPKIRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLI 1511
            D+KKP  R+  +V+ KF SD+GVIAIGDVF+AD+E DLVMQPGGPI HKCKTLIEPSRLI
Sbjct: 349  DIKKPTKRSFQEVQEKFKSDDGVIAIGDVFYADMEQDLVMQPGGPIKHKCKTLIEPSRLI 408

Query: 1512 MITAQRFVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVI 1691
            ++TAQRF+QTFLGK+F ALH RRHG+LKFCNAK+PSCF+ IPQAA CI R+V RAN PVI
Sbjct: 409  LLTAQRFIQTFLGKNFTALHLRRHGFLKFCNAKSPSCFYPIPQAADCISRIVERANAPVI 468

Query: 1692 YLSTDAAGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDK 1871
            YLSTDAA SETGLLQSL+V+DGK VPLV+RP R++AEKWDALLYRHG+E DSQV+AMLDK
Sbjct: 469  YLSTDAAESETGLLQSLVVVDGKVVPLVKRPPRDSAEKWDALLYRHGIEDDSQVDAMLDK 528

Query: 1872 TICAMSSVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            TI AMSSVFIG+SGSTFTEDILRLRK WGT S+CDEY+C+GE PN IAE+E
Sbjct: 529  TISAMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCRGEEPNFIAEDE 579


>ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus]
            gi|449517914|ref|XP_004165989.1| PREDICTED:
            uncharacterized protein LOC101230373 [Cucumis sativus]
          Length = 573

 Score =  656 bits (1693), Expect = 0.0
 Identities = 348/580 (60%), Positives = 427/580 (73%), Gaps = 13/580 (2%)
 Frame = +3

Query: 324  SSDDDEEDCRSLIHQNDTVK-PTSNSHSFSPFDIDNNG-FGASLRR------RFKMNKKR 479
            SS D+E+D +SL+  ND    P+  +HS + FDID++  F   + R      +F  +K+ 
Sbjct: 6    SSSDEEDDRQSLVEHNDIKPHPSPPTHS-TTFDIDDDPHFRPPIPRFPFSIPKFAFDKRY 64

Query: 480  Y-LLTILIPLAIVFLFFTL---GFHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQQLA 647
            Y LL   +PL I+ LFF++        +    +K   S +DRMRESEL ALYLL++QQL 
Sbjct: 65   YYLLAAALPLCILVLFFSVDITSLFSTTLSSTLKTSDSLTDRMRESELTALYLLRQQQLG 124

Query: 648  LITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEI 827
               L N +L                      NLS       + E +KS +   I LNKEI
Sbjct: 125  FFHLWNHSLFLQSNSSFNSTPSN--------NLSS---NSALTEYIKSALLKQITLNKEI 173

Query: 828  QKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLS 1007
            Q VLLS H++ N SE       M    +  C K+DQKL++R+TIEWKP+S+K+LFAIC S
Sbjct: 174  QNVLLSPHRSGNLSEEVGDALPMDTFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTS 233

Query: 1008 GQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFV 1187
            GQMSNHLICLEKHMFFAA+L+RVLVIPSHKVDY FSRV+DI+ +N CLGRKVV+++EEF 
Sbjct: 234  GQMSNHLICLEKHMFFAAILNRVLVIPSHKVDYQFSRVIDIDRMNMCLGRKVVISFEEFS 293

Query: 1188 EAKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDD 1367
            E KK  L IDRFICYFS+P  CYVD++H+ KLK+LG+S+ KLES W ED K P  +TV D
Sbjct: 294  EIKKHHLHIDRFICYFSKPNPCYVDDEHISKLKNLGISMGKLESAWNEDTKHPNRKTVSD 353

Query: 1368 VKAKFSSD-EGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTF 1544
            V++KFSS+ + VIA+GD+FFA+VE + V QPGGPIAHKC+TLIEPS LI +TAQRF+QTF
Sbjct: 354  VESKFSSNNDDVIAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSHLIKLTAQRFIQTF 413

Query: 1545 LGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSET 1724
            LGK+++ALHFRRHG+LKFCNAK PSCF+ IPQAA CI R+V RAN PVIYLSTDAA SE 
Sbjct: 414  LGKNYIALHFRRHGFLKFCNAKQPSCFYPIPQAADCIIRMVERANVPVIYLSTDAAESEH 473

Query: 1725 GLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIG 1904
            GLLQSL+V++GKP+PLV+RP RN+AEKWDALLYRHGLE DSQVEAMLDKTICAMSS FIG
Sbjct: 474  GLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGLEEDSQVEAMLDKTICAMSSTFIG 533

Query: 1905 SSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            + GSTFTEDILRLRK WGT S+CDEY+CQGE PN I+ENE
Sbjct: 534  APGSTFTEDILRLRKDWGTASMCDEYLCQGEEPNFISENE 573


>gb|EOY27413.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
          Length = 559

 Score =  655 bits (1689), Expect = 0.0
 Identities = 342/580 (58%), Positives = 425/580 (73%), Gaps = 14/580 (2%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVK-----PTSNSHSFSP---FDIDNNGFGASLRRRFKMN-KKR 479
            S D+++D ++LIHQNDT       P S   S SP   F I+     + +RRRFK+   KR
Sbjct: 5    SSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEE--LESQIRRRFKLTFNKR 62

Query: 480  YLLTILIPLAIVFLFFTLGFHG--NSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALI 653
            YL  I +PL I+ ++F+       +S +  +K   + SDR+RES+L+ALYLL +QQ +L+
Sbjct: 63   YLFAIFLPLLIIPIYFSTDIRSLFSSNISSLKF-NTVSDRIRESQLQALYLLNQQQNSLL 121

Query: 654  TLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQK 833
            +L N T                     ++N S   +    F+D+K+ +   I LNK IQ+
Sbjct: 122  SLWNHT---------------------FVN-SNNNITAVQFDDIKASLLTQITLNKHIQQ 159

Query: 834  VLLSTHKNVNNSELGVGND-DMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSG 1010
            +LLS HK  N+ + G   D +        C KVDQK AERKT EWKP+ +K+LFAICLSG
Sbjct: 160  ILLSPHKTGNSPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSG 219

Query: 1011 QMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVE 1190
            QMSNHLICLEKHMFFAA+L+R LVIPS + DY ++RVLDIEHIN C+G+K V+ +EEF+E
Sbjct: 220  QMSNHLICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFME 279

Query: 1191 AKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWE-EDVKKPKIRTVDD 1367
             KK    ID+FICYFS PQ CYVDE+H+KKLKSLG+S  KLE+ W+ ED+KKP  +T+ D
Sbjct: 280  IKKNHAHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKD 339

Query: 1368 VKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFL 1547
            V+ KF SD+ VIAIGDVF+ADVE D V+QPGGPIAHKCKTLIEPS+LI++TA+RF+QTFL
Sbjct: 340  VEEKFGSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFL 399

Query: 1548 GKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETG 1727
            G +F+ALHFRRHG+LKFCNAK PSCF+ IPQAA CI R+V RANTPVIYLSTDAA SET 
Sbjct: 400  GSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETS 459

Query: 1728 LLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQ-VEAMLDKTICAMSSVFIG 1904
            LLQS++V++GK +PLV+RP RN+AEKWDALLYRHGL  D Q VEAMLDKTICAMSSVFIG
Sbjct: 460  LLQSMVVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVVEAMLDKTICAMSSVFIG 519

Query: 1905 SSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            + GSTFT DILRLRK WGT S+CDEY+CQGE PN  A  E
Sbjct: 520  APGSTFTGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 559


>ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617227 [Citrus sinensis]
          Length = 563

 Score =  653 bits (1684), Expect = 0.0
 Identities = 343/588 (58%), Positives = 428/588 (72%), Gaps = 22/588 (3%)
 Frame = +3

Query: 327  SDDDEEDCRSLIHQNDTVK-----PTSNSHS-------FSPFDIDNNGFGASLRRRF--- 461
            S DD++D  +LIHQNDT       PTSN++         S F ID+    + +RRRF   
Sbjct: 5    SSDDDDDRETLIHQNDTKHGNHRLPTSNNNEDEEHNRRHSTFHIDDLPNASPIRRRFTFD 64

Query: 462  --KMNKKRYLLTILIPLAIVFLFFTLG----FHGNSRVWDIKLLQSPSDRMRESELKALY 623
              K+N KRYL  + +PL I+ L+F++     F GN   +    L   +DRMRESEL+AL 
Sbjct: 65   FKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSL---ADRMRESELRALS 121

Query: 624  LLKEQQLALITLLNRTLSTSLAIGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFD 803
            LLK+QQ  L++L N++   +                 Y N +  P     F+D KS + +
Sbjct: 122  LLKQQQSHLLSLWNQSFVNN----------------SYGNNTNNPF----FQDAKSALLN 161

Query: 804  GIMLNKEIQKVLLSTHKNVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDK 983
             I LNK+I+++LLS HK  N +     ND +   G   C KVD  +  ++T+EWKP+SDK
Sbjct: 162  QISLNKQIEQILLSPHKVSNFTP----NDAVW--GFEGCRKVDSIIPNKRTVEWKPKSDK 215

Query: 984  YLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKV 1163
            +LFAICLSGQMSNHLICLEKHMF AALL+RVLVIPS K DY +SRVLDIEHIN CLGRKV
Sbjct: 216  FLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKV 275

Query: 1164 VVTYEEFVEAKKKDLRIDRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWE-EDVK 1340
            VV++E F+E +K    IDRF+CYF  P+ C+VD++H+KKLK LG+S+ K E+ W+ ED +
Sbjct: 276  VVSFENFMEMEKNHAHIDRFLCYFGLPEPCFVDDEHIKKLKQLGISMGKTETVWKNEDTR 335

Query: 1341 KPKIRTVDDVKAKFSSDEGVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMIT 1520
            KP  RTV D++ KF +D+ VIA+GD+F+ADVE D VMQPGGPI H+CKTLIEPSRLIM+T
Sbjct: 336  KPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVT 395

Query: 1521 AQRFVQTFLGKDFVALHFRRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLS 1700
            AQRFVQTFLG +F+ALHFRRHG+LKFCNAK PSCF+ IPQAA CI R+  RAN PVIYLS
Sbjct: 396  AQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERANAPVIYLS 455

Query: 1701 TDAAGSETGLLQSLIVIDGKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTIC 1880
            TDAA SET LLQSL+V++GK + LV+RP RN+AEKWD+LLYRH LE DSQVEAMLDKTIC
Sbjct: 456  TDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTIC 515

Query: 1881 AMSSVFIGSSGSTFTEDILRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            AMS+VFIG+SGSTFTEDI+RLRK WG+ S+CDEY+CQGE PN IAE+E
Sbjct: 516  AMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563


>ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602087 [Solanum tuberosum]
          Length = 565

 Score =  653 bits (1684), Expect = 0.0
 Identities = 336/570 (58%), Positives = 420/570 (73%), Gaps = 6/570 (1%)
 Frame = +3

Query: 333  DDEEDCRSLIHQNDTVKPTSNSHSFSPFDIDNNGFGASLRRRFKMNKKRYLLTILIPLAI 512
            ++EED  +LI Q +     S S   + F ID+     +       +K  Y LTI++    
Sbjct: 9    NEEEDQENLIAQRERGNNLSESPVRTAFQIDDE-IADTRPFNSSCSKCCYFLTIIVVTVF 67

Query: 513  VFL-FFTLGFHGNSRVWDIKLLQSPSDRMRESELKALYLLKEQQLALITLLNRTLSTSLA 689
            +F+ F+T      S+   +       + MRESEL+ALYLL++QQL L  L N TL  +  
Sbjct: 68   IFIRFYTTDVDNVSKTGVMN--NDSVNLMRESELRALYLLRQQQLGLFKLWNNTLIDNSL 125

Query: 690  IGLXXXXXXXXXXXGYLNLSEMPVEKTVFEDLKSRIFDGIMLNKEIQKVLLSTHK----- 854
                              +S       + E+LK  +   I LNK+IQ+ LLS+H+     
Sbjct: 126  NATAANNSNF--------VSTSLFSSALSEELKLELISQISLNKQIQQALLSSHQLGNLL 177

Query: 855  NVNNSELGVGNDDMVEEGISICGKVDQKLAERKTIEWKPRSDKYLFAICLSGQMSNHLIC 1034
            N +++      DD    G+  C K+D KL++R+TIEW+PRSDKYLFAIC SGQMSNHLIC
Sbjct: 178  NASDNATDPSLDDY--GGLDRCRKMDYKLSDRRTIEWEPRSDKYLFAICASGQMSNHLIC 235

Query: 1035 LEKHMFFAALLDRVLVIPSHKVDYDFSRVLDIEHINKCLGRKVVVTYEEFVEAKKKDLRI 1214
            LEKHMFFAALL+R+L+IPS +VDY+F RVLDI+HINKCLGRKVVVT+EEF +++K  + I
Sbjct: 236  LEKHMFFAALLNRILIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHI 295

Query: 1215 DRFICYFSQPQKCYVDEDHVKKLKSLGLSVPKLESPWEEDVKKPKIRTVDDVKAKFSSDE 1394
            D+FICYFSQPQ C++D++HVKKLKSLG+S+ KLE+ W+ED+K PK RTV D+  KFS D+
Sbjct: 296  DKFICYFSQPQPCFLDDEHVKKLKSLGVSMNKLEAAWDEDIKNPKPRTVQDIMTKFSLDD 355

Query: 1395 GVIAIGDVFFADVENDLVMQPGGPIAHKCKTLIEPSRLIMITAQRFVQTFLGKDFVALHF 1574
             VIAIGDVFFA+VE   VMQPGGPI+HKCKTL+EPSRLI++TAQRF+QTFLGK+F+ALHF
Sbjct: 356  DVIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHF 415

Query: 1575 RRHGWLKFCNAKNPSCFFSIPQAARCIERVVARANTPVIYLSTDAAGSETGLLQSLIVID 1754
            RRHG+LKFCNAK PSCF+ +PQAA CI RVV RA  PVIYLSTDAA SETG+LQSL+ ++
Sbjct: 416  RRHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVAVN 475

Query: 1755 GKPVPLVQRPARNAAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTEDI 1934
            GK VPLV+RPA+N+AEKWDALLYRHGLEGD QVEAMLDKTICAMS VFIGS GSTFTEDI
Sbjct: 476  GKTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAMSEVFIGSMGSTFTEDI 535

Query: 1935 LRLRKGWGTLSICDEYICQGEVPNLIAENE 2024
            LRLRK WGT S+CDEY+C+GEVP+ IA++E
Sbjct: 536  LRLRKDWGTSSLCDEYLCRGEVPSFIADDE 565


Top