BLASTX nr result

ID: Sinomenium22_contig00019026 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00019026
         (2211 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254...   781   0.0  
gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]     764   0.0  
ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584...   732   0.0  
ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268...   728   0.0  
ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208...   714   0.0  
ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299...   714   0.0  
ref|XP_002533327.1| conserved hypothetical protein [Ricinus comm...   714   0.0  
ref|XP_007024790.1| O-fucosyltransferase family protein isoform ...   712   0.0  
ref|XP_007024791.1| O-fucosyltransferase family protein isoform ...   707   0.0  
ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus tric...   705   0.0  
gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus...   702   0.0  
ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602...   698   0.0  
ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617...   684   0.0  
ref|XP_006426814.1| hypothetical protein CICLE_v10025289mg [Citr...   684   0.0  
ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arab...   681   0.0  
ref|XP_004510588.1| PREDICTED: uncharacterized protein LOC101496...   680   0.0  
ref|XP_006280247.1| hypothetical protein CARUB_v10026161mg [Caps...   679   0.0  
ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776...   677   0.0  
ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsi...   677   0.0  
gb|AAM66093.1| unknown [Arabidopsis thaliana]                         676   0.0  

>ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis
            vinifera]
          Length = 559

 Score =  781 bits (2018), Expect = 0.0
 Identities = 391/580 (67%), Positives = 475/580 (81%), Gaps = 10/580 (1%)
 Frame = -2

Query: 1910 RATSDEEQEGEDRERLMEPNERKIAG-SSFEIGEFKNKITRF-YNSNKRYLFAICLPIFL 1737
            R +SD+E   EDR+ L++ NERK+   S F+I +FK++++   ++ NKRYLFAI  P+F+
Sbjct: 3    RESSDDE---EDRQNLIDENERKLPHRSGFQIEDFKSRLSAHRFSFNKRYLFAIFPPLFI 59

Query: 1736 IVVYFSVDIGNLYR-SVSSVRIGYPSDKMREAELKALYLLREQHVGLLNLWNLT----SS 1572
            +++YF+ D+ NL+  S+S V+   P+D+MRE+EL+ALYLLR+Q + L +LWN T    S+
Sbjct: 60   LLIYFTTDVRNLFTTSISIVKADSPTDRMRESELRALYLLRQQQLSLFSLWNHTAFADSA 119

Query: 1571 DLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFD--DFRSAVLKQIKLNKEVQ 1398
             +  NS  +                     +D S+  +L    DF+SA+LKQI LNKE+Q
Sbjct: 120  PIPSNSSNS--------------------TLDFSTRQVLLSSADFKSALLKQISLNKEIQ 159

Query: 1397 QVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLFVICV 1221
            QVLLSSH  GNL+EL +DN D  FG Y  +RC KV++ +S R T+EW P+S+K+LF IC+
Sbjct: 160  QVLLSSHPSGNLSELVDDNGDLNFGAYSFNRCPKVNQNMSQRPTIEWKPRSDKYLFAICL 219

Query: 1220 SGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVTFEEF 1041
            SGQMSNHLICLEKHMFFAALLNRILVIPS K DYQY+RVLDI+HIN C GRKVVVTFEEF
Sbjct: 220  SGQMSNHLICLEKHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEF 279

Query: 1040 SEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEPKKRSVE 861
            +E KKNH+HIDR ICY S P  C+VD+DHVKKLKSLGISMGKLEPAWAED+K+PKKR+ +
Sbjct: 280  TESKKNHLHIDRVICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWAEDIKKPKKRTAQ 339

Query: 860  DVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRFIQTF 681
            DV++KFSSNDDV+AIGDVFYA+VE+EWVMQ GGPLAHKC+TL+EPSRLIMLTAQRF+QTF
Sbjct: 340  DVQAKFSSNDDVIAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTF 399

Query: 680  LGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAAESET 501
            LG++F ALHFRRHGF+KFCNAK+PSCFFPIPQAADCI+RVVERA+ PVIYLSTDAAESET
Sbjct: 400  LGKSFTALHFRRHGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESET 459

Query: 500  DLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIG 321
             LLQSL+V NGK VPL+KRP RNSAEKWDALLYRHGL+GDSQVEAMLDKTICAM+SVFIG
Sbjct: 460  GLLQSLVVLNGKLVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIG 519

Query: 320  SSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            + GSTFTEDILRLR+ W SAS CDEYLCQ EQP+FI++NE
Sbjct: 520  APGSTFTEDILRLRRGWGSASHCDEYLCQGEQPNFIADNE 559


>gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]
          Length = 578

 Score =  764 bits (1973), Expect = 0.0
 Identities = 389/592 (65%), Positives = 464/592 (78%), Gaps = 19/592 (3%)
 Frame = -2

Query: 1919 MGRRATSDEEQEGEDRERLMEPNERKIAG---SSFEIG-------EFKNKITRFYNS--- 1779
            M R+ +S +E +  DRE L+E NERK+     S+F I        EF+++I R  +S   
Sbjct: 1    MERKDSSSDEDD--DRENLIEQNERKLQNHPRSTFHIDDVDGGNREFRSRIRRRLSSLGL 58

Query: 1778 -NKRYLFAICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHV 1605
             NK+++FAI LP+F++V++ S D+  L+ + +S VR    SD++RE+EL+AL+LLR+Q +
Sbjct: 59   LNKKFMFAIFLPLFIVVLFLSTDVRGLFSADLSGVRFDSFSDRLRESELRALFLLRQQQL 118

Query: 1604 GLLNLWNLTSSD---LKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSA 1434
            GL  LWN T  D   +  NS  N                 ++            DD + A
Sbjct: 119  GLFALWNQTFHDSPPISSNSTNNSSSSSSINSSASGTEQNSV-----------IDDLKFA 167

Query: 1433 VLKQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWN 1257
            VL+Q+ LNKE+QQVLLS HR GN + +  D  D   G    D CRKVD+  S R+T+EW 
Sbjct: 168  VLRQLSLNKEIQQVLLSPHRSGNSSSI-TDAGDPNLGGSDFDTCRKVDQKFSQRRTIEWK 226

Query: 1256 PKSNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKC 1077
            P SNKFLF IC+SGQMSN LICLEKHMFFAALLNR+LVIPS KVDYQY+RVLDIDHINKC
Sbjct: 227  PNSNKFLFAICLSGQMSNRLICLEKHMFFAALLNRVLVIPSSKVDYQYNRVLDIDHINKC 286

Query: 1076 FGRKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWA 897
             GRKVV++FE+F+E KKNHMHI+RFICY S PQ C+VD++H+KKLK LG++MGKLE AW 
Sbjct: 287  LGRKVVISFEDFAETKKNHMHINRFICYFSQPQPCYVDDEHIKKLKGLGLTMGKLESAWT 346

Query: 896  EDVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRL 717
            ED+K P KR+V+DV+SKFS+NDDV+AIGDVFYADVEQEWVMQ GGPLAHKC+TL+EPSRL
Sbjct: 347  EDIKGPNKRTVQDVQSKFSTNDDVIAIGDVFYADVEQEWVMQPGGPLAHKCQTLIEPSRL 406

Query: 716  IMLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPV 537
            IMLTAQRFIQTFLG+NFVALHFRRHGF+KFCNAK+PSCFFPIPQAADCIT VVERANAPV
Sbjct: 407  IMLTAQRFIQTFLGKNFVALHFRRHGFLKFCNAKQPSCFFPIPQAADCITSVVERANAPV 466

Query: 536  IYLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLD 357
            IYLSTDAAESET LLQSLIV NGK VPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLD
Sbjct: 467  IYLSTDAAESETGLLQSLIVLNGKPVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLD 526

Query: 356  KTICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            KTICAMSSVFIG+ GSTFTEDILRLRKDW SAS+CD+YLCQ E+P+F+++NE
Sbjct: 527  KTICAMSSVFIGAPGSTFTEDILRLRKDWGSASSCDKYLCQGEEPNFVADNE 578


>ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584575 [Solanum tuberosum]
          Length = 568

 Score =  732 bits (1889), Expect = 0.0
 Identities = 370/584 (63%), Positives = 460/584 (78%), Gaps = 16/584 (2%)
 Frame = -2

Query: 1904 TSDEEQEGEDRERLMEPNER------KIAGSSFEIGEFKNKIT---RF-YNSNKRYLFAI 1755
            +SDEE   +DRE L+  NER          S+F+I + K++     RF + S KRYL AI
Sbjct: 7    SSDEE---DDRENLIHQNERVNDLSKSPRRSTFQIEDVKDRFALCRRFNFTSGKRYLLAI 63

Query: 1754 CLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHVGLLNLWNLT 1578
             LP+ ++V+YF+ DI +L+++ V++++     + MR++EL+ALYLLR+Q +GL  LWN T
Sbjct: 64   ILPVLVLVLYFATDIKSLFQTTVTTIKYDGSVNSMRDSELRALYLLRQQQLGLFKLWNHT 123

Query: 1577 ----SSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLN 1410
                +S     S +                      V  SS   + +D ++ +L+QI LN
Sbjct: 124  LVNDTSTTHTGSSLESTPG--------------FASVSRSS---IVEDLKADLLRQISLN 166

Query: 1409 KEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLF 1233
            K++QQVLLSSH++GN     +++ D   G  GL RCRKVD  +S R+TVEW P+SNK+LF
Sbjct: 167  KQIQQVLLSSHQLGNSLITSDNSTDPTLG--GLSRCRKVDHNLSQRRTVEWKPRSNKYLF 224

Query: 1232 VICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVT 1053
             ICVSGQMSNHLICLEKHMFFAALLNRILVIPS KVDY++ RVLD+DHINKC GR+V+VT
Sbjct: 225  AICVSGQMSNHLICLEKHMFFAALLNRILVIPSSKVDYEFRRVLDVDHINKCLGREVIVT 284

Query: 1052 FEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEPKK 873
            ++EF+E +K+H+HID+F+CY S PQ CF+DE+ VKKLKSLGISM KLE AW EDVK PKK
Sbjct: 285  YDEFAERRKSHLHIDKFLCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWNEDVKNPKK 344

Query: 872  RSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRF 693
            R+V+D+ +KFS++DDVLAIGDVF+ADVE++WVMQ GGP++HKCKTL+EPSRLIMLTAQRF
Sbjct: 345  RTVQDIMAKFSTDDDVLAIGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRF 404

Query: 692  IQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAA 513
            IQTFLG NF+ALHFRRHGF+KFCNAKKPSCF+P+PQAADCI RV+ERAN+PVIYLSTDAA
Sbjct: 405  IQTFLGDNFIALHFRRHGFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAA 464

Query: 512  ESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSS 333
            ESET LLQSL+V NGK VPLV+RPARNSAEKWDALLYRHGLEGD QV+AMLDKTICAMSS
Sbjct: 465  ESETGLLQSLVVVNGKTVPLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSS 524

Query: 332  VFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            VFIGSSGSTFT+DILRLRKDW SAS CDEYLCQ E P++++++E
Sbjct: 525  VFIGSSGSTFTDDILRLRKDWGSASLCDEYLCQGELPNYVADDE 568


>ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268664 [Solanum
            lycopersicum]
          Length = 565

 Score =  728 bits (1879), Expect = 0.0
 Identities = 366/586 (62%), Positives = 456/586 (77%), Gaps = 13/586 (2%)
 Frame = -2

Query: 1919 MGRRATSDEEQEGEDRERLMEPNER------KIAGSSFEIGEFKNKIT---RF-YNSNKR 1770
            M  R +SDEE   +DRE L+  NER          S+F+I + K++     RF + S K 
Sbjct: 2    MRDRESSDEE---DDRENLIHQNERVNHLSKSPRPSTFQIEDVKDRFALCRRFNFTSGKT 58

Query: 1769 YLFAICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHVGLLN 1593
            YL AI LP+ ++++YF+ DI  L+++ V++++     + MRE+EL+ALYLL++Q +GL  
Sbjct: 59   YLLAIILPLLVLILYFATDIKALFQTTVTTIKYDGSVNSMRESELRALYLLKQQQLGLFK 118

Query: 1592 LWNLTS-SDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIK 1416
            LWN T  +D      +                        L S   + +D +  +L+QI 
Sbjct: 119  LWNHTLVNDTSTTHSLESAPGFT-----------------LVSRSSIVEDLKDDLLRQIS 161

Query: 1415 LNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKF 1239
            LNK++QQVLLSSH++GN     +++ D   G  GL RCRKVD  +S+R+TVEW P+SNK+
Sbjct: 162  LNKQIQQVLLSSHQLGNSLITSDNSTDPSLG--GLGRCRKVDHNLSERRTVEWKPRSNKY 219

Query: 1238 LFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVV 1059
            LF ICVSGQMSNHLICLEKHMFFAALLNR+LVIPS KVDY++ RVLD+DHINKC GR+V+
Sbjct: 220  LFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFRRVLDVDHINKCLGREVI 279

Query: 1058 VTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEP 879
            VT++EF+E +K+H+HID+F+CY S PQ CF+DE+ VKKLKSLGISM KLE AW EDVK P
Sbjct: 280  VTYDEFAERRKSHLHIDKFLCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWDEDVKNP 339

Query: 878  KKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQ 699
            KKR+ +D+ +KFS +DDVLAIGDVF+ADVE++WVMQ GGP++HKCKTL+EPSRLIMLTAQ
Sbjct: 340  KKRTAQDIVAKFSMDDDVLAIGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQ 399

Query: 698  RFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTD 519
            RF+QTFLG NF+ALHFRRHGF+KFCNAKKPSCF+P+PQAADCI RV+ERAN+PV+YLSTD
Sbjct: 400  RFVQTFLGDNFIALHFRRHGFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVMYLSTD 459

Query: 518  AAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAM 339
            AAESET LLQSL+V NGK VPLV+RPARNSAEKWDALLYRHGLEGD QVEAMLDKTICAM
Sbjct: 460  AAESETGLLQSLVVFNGKTVPLVQRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTICAM 519

Query: 338  SSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            SSVFIGSSGSTFT+DILRLRKDW SAS CDEYLCQ E P+F++++E
Sbjct: 520  SSVFIGSSGSTFTDDILRLRKDWGSASLCDEYLCQGELPNFVADDE 565


>ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus]
            gi|449517914|ref|XP_004165989.1| PREDICTED:
            uncharacterized protein LOC101230373 [Cucumis sativus]
          Length = 573

 Score =  714 bits (1843), Expect = 0.0
 Identities = 369/591 (62%), Positives = 449/591 (75%), Gaps = 22/591 (3%)
 Frame = -2

Query: 1907 ATSDEEQEGEDRERLMEPNERK------IAGSSFEIGE---FKNKITRF--------YNS 1779
            ++SDEE   +DR+ L+E N+ K         ++F+I +   F+  I RF        ++ 
Sbjct: 6    SSSDEE---DDRQSLVEHNDIKPHPSPPTHSTTFDIDDDPHFRPPIPRFPFSIPKFAFDK 62

Query: 1778 NKRYLFAICLPIFLIVVYFSVDIGNLYRSVSSVRIGYP---SDKMREAELKALYLLREQH 1608
               YL A  LP+ ++V++FSVDI +L+ +  S  +      +D+MRE+EL ALYLLR+Q 
Sbjct: 63   RYYYLLAAALPLCILVLFFSVDITSLFSTTLSSTLKTSDSLTDRMRESELTALYLLRQQQ 122

Query: 1607 VGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVL 1428
            +G  +LWN  S  L+ NS  N                      +LSS   L +  +SA+L
Sbjct: 123  LGFFHLWN-HSLFLQSNSSFNSTPSN-----------------NLSSNSALTEYIKSALL 164

Query: 1427 KQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPK 1251
            KQI LNKE+Q VLLS HR GNL+E   D        + LDRCRK+D+ +SDR+T+EW PK
Sbjct: 165  KQITLNKEIQNVLLSPHRSGNLSEEVGDALP--MDTFALDRCRKMDQKLSDRRTIEWKPK 222

Query: 1250 SNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFG 1071
            SNKFLF IC SGQMSNHLICLEKHMFFAA+LNR+LVIPS KVDYQ+SRV+DID +N C G
Sbjct: 223  SNKFLFAICTSGQMSNHLICLEKHMFFAAILNRVLVIPSHKVDYQFSRVIDIDRMNMCLG 282

Query: 1070 RKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAED 891
            RKVV++FEEFSEIKK+H+HIDRFICY S P  C+VD++H+ KLK+LGISMGKLE AW ED
Sbjct: 283  RKVVISFEEFSEIKKHHLHIDRFICYFSKPNPCYVDDEHISKLKNLGISMGKLESAWNED 342

Query: 890  VKEPKKRSVEDVKSKFSSN-DDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLI 714
             K P +++V DV+SKFSSN DDV+A+GD+F+A+VEQEWV Q GGP+AHKC+TL+EPS LI
Sbjct: 343  TKHPNRKTVSDVESKFSSNNDDVIAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSHLI 402

Query: 713  MLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVI 534
             LTAQRFIQTFLG+N++ALHFRRHGF+KFCNAK+PSCF+PIPQAADCI R+VERAN PVI
Sbjct: 403  KLTAQRFIQTFLGKNYIALHFRRHGFLKFCNAKQPSCFYPIPQAADCIIRMVERANVPVI 462

Query: 533  YLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDK 354
            YLSTDAAESE  LLQSL+V NGK +PLVKRP RNSAEKWDALLYRHGLE DSQVEAMLDK
Sbjct: 463  YLSTDAAESEHGLLQSLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGLEEDSQVEAMLDK 522

Query: 353  TICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            TICAMSS FIG+ GSTFTEDILRLRKDW +AS CDEYLCQ E+P+FISENE
Sbjct: 523  TICAMSSTFIGAPGSTFTEDILRLRKDWGTASMCDEYLCQGEEPNFISENE 573


>ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299396 [Fragaria vesca
            subsp. vesca]
          Length = 556

 Score =  714 bits (1842), Expect = 0.0
 Identities = 377/590 (63%), Positives = 446/590 (75%), Gaps = 22/590 (3%)
 Frame = -2

Query: 1904 TSDEEQEGEDRERLMEPNERKI-----AGSSFEIGE-------FKNKITRFYNS------ 1779
            +SD+E E +DR+ L+E N+RK      + ++F I +          +I R + S      
Sbjct: 8    SSDDEVE-DDRQNLIEQNDRKQLPSPRSATTFHIDDGDVDRHRHHREIRRRFASLNLRDL 66

Query: 1778 -NKR--YLFAICLPIFLIVVYFSVDIGNLYRSVSSVRIGYPSDKMREAELKALYLLREQH 1608
             NKR   +F I +P+F++V++FS DI +L+ S  SV     S K+RE+EL+ALYLLR+Q 
Sbjct: 67   FNKRSFLVFFIFIPLFVLVLFFSTDIKSLFFSHLSVSDSV-SGKLRESELRALYLLRQQQ 125

Query: 1607 VGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVL 1428
            +GL  LWN TS+                               DL       DD +S+VL
Sbjct: 126  LGLFGLWNSTSNH---------------------------SNPDL-------DDLKSSVL 151

Query: 1427 KQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPK 1251
            +QI LNKE+QQVLLS H  GN +E E D  D   G    DRCR VD+  S+R+T+EW P 
Sbjct: 152  RQISLNKEIQQVLLSPHSSGNSSESE-DFRDPSLG----DRCRVVDQRFSERRTIEWKPN 206

Query: 1250 SNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFG 1071
            S+K+L  ICVSGQMSNHLICLEKHMFFAALLNRILVIPS KVDYQYS VLDI+HINKC G
Sbjct: 207  SDKYLLAICVSGQMSNHLICLEKHMFFAALLNRILVIPSSKVDYQYSTVLDIEHINKCIG 266

Query: 1070 RKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAED 891
            RKVVVTFEE +E KKNH+HIDRFICY S P  C+VD++H+KKLK+LGIS    EPAW ED
Sbjct: 267  RKVVVTFEELAEEKKNHIHIDRFICYFSKPTLCYVDDEHLKKLKALGISYKSREPAWGED 326

Query: 890  VKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIM 711
            VK+P K++V+DV+SKFSS D+V+AIGDVF+AD EQ+WVMQ GGPLAHKCKTL+EPSRLI+
Sbjct: 327  VKKPSKKTVQDVQSKFSSGDEVIAIGDVFFADAEQDWVMQPGGPLAHKCKTLIEPSRLIL 386

Query: 710  LTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIY 531
            LTAQRFIQTFLG+NFVALHFRRHGF+KFCN K+PSCF+PIPQAADCITR+ ERANAPV+Y
Sbjct: 387  LTAQRFIQTFLGKNFVALHFRRHGFLKFCNNKQPSCFYPIPQAADCITRIAERANAPVVY 446

Query: 530  LSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKT 351
            LSTDAAESET LLQSL+V NGK VPLVKRPARNSAEKWDALLYRHG+EGD QVEAMLDKT
Sbjct: 447  LSTDAAESETGLLQSLVVVNGKTVPLVKRPARNSAEKWDALLYRHGIEGDPQVEAMLDKT 506

Query: 350  ICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            I AMSSVFIG+SGSTFTEDILRLRK W SAS CDEYLCQ E+P+FI+ENE
Sbjct: 507  ISAMSSVFIGASGSTFTEDILRLRKGWGSASVCDEYLCQGEEPNFIAENE 556


>ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis]
            gi|223526849|gb|EEF29063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 565

 Score =  714 bits (1842), Expect = 0.0
 Identities = 368/590 (62%), Positives = 444/590 (75%), Gaps = 20/590 (3%)
 Frame = -2

Query: 1910 RATSDEEQEGEDRERLMEPNERKIAG---------------SSFEIGEFKNKITRFYNSN 1776
            R +SDEE   +DRE L+E N+RK                  S+F I E+   I R    N
Sbjct: 3    RDSSDEE---DDRENLIEQNDRKHHNHQQTVPTSSPHRRSFSTFHIEEYGGVIRRRL-FN 58

Query: 1775 KRY---LFAICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQH 1608
            KRY   L AI LP+ +I+VYFS D+ +L+ + +SS+     SD+MREAEL+ALYLL +Q 
Sbjct: 59   KRYYYYLLAIFLPLLIIIVYFSADLRSLFSANISSLNFNSASDRMREAELQALYLLEQQQ 118

Query: 1607 VGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVL 1428
            + LL+++N +      N   N                        S   +  ++FRSA+L
Sbjct: 119  LSLLSIFNQSFPSRNKNFSSNSSFIN-------------------SFDNVKIENFRSALL 159

Query: 1427 KQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPK 1251
            KQ+  NK++QQ+LLS H+ GN    EN +       +G DRC+KV+    DRKT+EW P+
Sbjct: 160  KQMTFNKQIQQILLSPHKSGN----ENVSGSFSGSGFGFDRCKKVESRFLDRKTIEWKPR 215

Query: 1250 SNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFG 1071
            S+KFLF IC+SGQMSNHLICLEKHMFFAALLNR+LV+PS K DYQY+RVLDI+HIN C G
Sbjct: 216  SDKFLFPICLSGQMSNHLICLEKHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVG 275

Query: 1070 RKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAED 891
            RKVVVTFEEF +++KNH+HIDRFICY SSP  C+VDE+HVKKLK LGI MGK E  W ED
Sbjct: 276  RKVVVTFEEFVQMRKNHVHIDRFICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWKED 335

Query: 890  VKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIM 711
            VK+P +++V+DV +KF+SNDDV+AIGDVFYAD+EQ+WVMQ GGPLAHKCKTL+EPSRLI+
Sbjct: 336  VKKPSQKTVQDVLAKFTSNDDVIAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLIL 395

Query: 710  LTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIY 531
            +TAQRFIQTFLG+NF+ALHFRRHGF+KFCNAK PSCF+PIPQAADCI RV ERANAPVIY
Sbjct: 396  VTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIY 455

Query: 530  LSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKT 351
            LSTDAAESETDLLQSLI+ NGK VPLVKRP+  S EKWD+LL RHG+E DSQVEAMLDKT
Sbjct: 456  LSTDAAESETDLLQSLIIVNGKTVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKT 515

Query: 350  ICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            I AMS+VFIG+SGSTFTEDILRLRKDW SAS CDEYLCQ E P+FI+E+E
Sbjct: 516  ISAMSNVFIGASGSTFTEDILRLRKDWESASLCDEYLCQGELPNFIAEDE 565


>ref|XP_007024790.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508780156|gb|EOY27412.1| O-fucosyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 558

 Score =  712 bits (1838), Expect = 0.0
 Identities = 361/585 (61%), Positives = 437/585 (74%), Gaps = 19/585 (3%)
 Frame = -2

Query: 1898 DEEQEGEDRERLMEPNERK--------------IAGSSFEIGEFKNKITRFYNS--NKRY 1767
            D   E +DR+ L+  N+ K                 SSF I E +++I R +    NKRY
Sbjct: 4    DSSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEELESQIRRRFKLTFNKRY 63

Query: 1766 LFAICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHVGLLNL 1590
            LFAI LP+ +I +YFS DI +L+ S +SS++    SD++RE++L+ALYLL +Q   LL+L
Sbjct: 64   LFAIFLPLLIIPIYFSTDIRSLFSSNISSLKFNTVSDRIRESQLQALYLLNQQQNSLLSL 123

Query: 1589 WNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLN 1410
            WN T     +NS  N                          T + FDD ++++L QI LN
Sbjct: 124  WNHTF----VNSNNN-------------------------ITAVQFDDIKASLLTQITLN 154

Query: 1409 KEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLF 1233
            K +QQ+LLS H+ GN  +      D  F  Y  DRCRKVD+  ++RKT EW PK NKFLF
Sbjct: 155  KHIQQILLSPHKTGNSPQ-NGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLF 213

Query: 1232 VICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVT 1053
             IC+SGQMSNHLICLEKHMFFAA+LNR LVIPS + DYQY+RVLDI+HIN C G+K V+ 
Sbjct: 214  AICLSGQMSNHLICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIP 273

Query: 1052 FEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWA-EDVKEPK 876
            FEEF EIKKNH HID+FICY SSPQ C+VDE+H+KKLKSLGIS GKLE AW  ED+K+P 
Sbjct: 274  FEEFMEIKKNHAHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPS 333

Query: 875  KRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQR 696
            +++++DV+ KF S+DDV+AIGDVFYADVE++WV+Q GGP+AHKCKTL+EPS+LI+LTA+R
Sbjct: 334  QKTIKDVEEKFGSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAER 393

Query: 695  FIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDA 516
            FIQTFLG NF+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCITR+VERAN PVIYLSTDA
Sbjct: 394  FIQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDA 453

Query: 515  AESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMS 336
            AESET LLQS++V NGK +PLVKRP RNSAEKWDALLYRHGL  D QVEAMLDKTICAMS
Sbjct: 454  AESETSLLQSMVVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVEAMLDKTICAMS 513

Query: 335  SVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            SVFIG+ GSTFT DILRLRKDW +AS CDEYLCQ E P+F +  E
Sbjct: 514  SVFIGAPGSTFTGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 558


>ref|XP_007024791.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508780157|gb|EOY27413.1| O-fucosyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 559

 Score =  707 bits (1826), Expect = 0.0
 Identities = 361/586 (61%), Positives = 437/586 (74%), Gaps = 20/586 (3%)
 Frame = -2

Query: 1898 DEEQEGEDRERLMEPNERK--------------IAGSSFEIGEFKNKITRFYNS--NKRY 1767
            D   E +DR+ L+  N+ K                 SSF I E +++I R +    NKRY
Sbjct: 4    DSSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEELESQIRRRFKLTFNKRY 63

Query: 1766 LFAICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHVGLLNL 1590
            LFAI LP+ +I +YFS DI +L+ S +SS++    SD++RE++L+ALYLL +Q   LL+L
Sbjct: 64   LFAIFLPLLIIPIYFSTDIRSLFSSNISSLKFNTVSDRIRESQLQALYLLNQQQNSLLSL 123

Query: 1589 WNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLN 1410
            WN T     +NS  N                          T + FDD ++++L QI LN
Sbjct: 124  WNHTF----VNSNNN-------------------------ITAVQFDDIKASLLTQITLN 154

Query: 1409 KEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLF 1233
            K +QQ+LLS H+ GN  +      D  F  Y  DRCRKVD+  ++RKT EW PK NKFLF
Sbjct: 155  KHIQQILLSPHKTGNSPQ-NGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLF 213

Query: 1232 VICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVT 1053
             IC+SGQMSNHLICLEKHMFFAA+LNR LVIPS + DYQY+RVLDI+HIN C G+K V+ 
Sbjct: 214  AICLSGQMSNHLICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIP 273

Query: 1052 FEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWA-EDVKEPK 876
            FEEF EIKKNH HID+FICY SSPQ C+VDE+H+KKLKSLGIS GKLE AW  ED+K+P 
Sbjct: 274  FEEFMEIKKNHAHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPS 333

Query: 875  KRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQR 696
            +++++DV+ KF S+DDV+AIGDVFYADVE++WV+Q GGP+AHKCKTL+EPS+LI+LTA+R
Sbjct: 334  QKTIKDVEEKFGSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAER 393

Query: 695  FIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDA 516
            FIQTFLG NF+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCITR+VERAN PVIYLSTDA
Sbjct: 394  FIQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDA 453

Query: 515  AESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQ-VEAMLDKTICAM 339
            AESET LLQS++V NGK +PLVKRP RNSAEKWDALLYRHGL  D Q VEAMLDKTICAM
Sbjct: 454  AESETSLLQSMVVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVVEAMLDKTICAM 513

Query: 338  SSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            SSVFIG+ GSTFT DILRLRKDW +AS CDEYLCQ E P+F +  E
Sbjct: 514  SSVFIGAPGSTFTGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 559


>ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus trichocarpa]
            gi|222840769|gb|EEE78316.1| protein-O-fucosyltransferase
            2 [Populus trichocarpa]
          Length = 527

 Score =  705 bits (1820), Expect = 0.0
 Identities = 357/576 (61%), Positives = 437/576 (75%), Gaps = 6/576 (1%)
 Frame = -2

Query: 1910 RATSDEEQEGEDRERLMEPNERKIAGSSFEIGEFKNKITRFYNSNKRY-LFA---ICLPI 1743
            R +SDEE   +DRE L+E N+RK                  ++ N RY LFA   I LP+
Sbjct: 3    RDSSDEE---DDREHLIEQNDRK------------------HHQNGRYSLFAAAIIFLPL 41

Query: 1742 FLIVVYFSVDIGNLYRSVSSVRIGYP-SDKMREAELKALYLLREQHVGLLNLWNLTSSDL 1566
            F++ + FS DI NL+ +   +++G   S +MRE+EL+ALYLL++Q + L +LWN T +  
Sbjct: 42   FILFLSFSTDIRNLFST--HLKVGDSLSIRMRESELRALYLLKKQQLSLFSLWNSTGNST 99

Query: 1565 KLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLNKEVQQVLL 1386
             L   +N                            + F+D +SA+LKQI LNKE+QQVLL
Sbjct: 100  LLEKDLNS---------------------------VSFEDLKSALLKQISLNKEIQQVLL 132

Query: 1385 SSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLFVICVSGQM 1209
            + H  GN++   +D   S  G + + RC KVD+  +DRKT+EW PK NKFLF +C+SGQM
Sbjct: 133  APHESGNVSSSSSDLDFSNAGGF-VQRCEKVDQRFADRKTIEWKPKPNKFLFALCLSGQM 191

Query: 1208 SNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVTFEEFSEIK 1029
            SNHLICLEKHMFFAALLNR+LVIPS + DYQY+RVLDI+H+N C GRKVVVTFEEF EI 
Sbjct: 192  SNHLICLEKHMFFAALLNRVLVIPSSRFDYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIM 251

Query: 1028 KNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEPKKRSVEDVKS 849
            KN  HIDRF CY S P  C+VDE+HVKKLK LG+SMGKLE  W ED+K+P K +V+DV+ 
Sbjct: 252  KNKPHIDRFFCYFSDPTPCYVDEEHVKKLKGLGVSMGKLESPWKEDIKKPSKLTVKDVEG 311

Query: 848  KFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRFIQTFLGRN 669
            KF S+D+V+A+GDVF+ADVE+EW+MQ GGP+AHKCKTL+EP+R+IMLTAQRFIQTFLG N
Sbjct: 312  KFVSDDNVIAVGDVFFADVEEEWIMQPGGPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSN 371

Query: 668  FVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAAESETDLLQ 489
            F+ALHFRRHGF+KFCNAKKPSCF+P+PQAADCI RVVERANAPV+YLSTDAAESET LLQ
Sbjct: 372  FIALHFRRHGFLKFCNAKKPSCFYPVPQAADCIARVVERANAPVVYLSTDAAESETGLLQ 431

Query: 488  SLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGS 309
            SL+V NG+ VPLV RP+RN+AEKWDALLYRHGL+ D+QVEAMLDKTICAMSSVFIG+SGS
Sbjct: 432  SLVVVNGRTVPLVTRPSRNAAEKWDALLYRHGLQEDAQVEAMLDKTICAMSSVFIGASGS 491

Query: 308  TFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            TFTEDI RLRK W SAS+CDEYLCQ E P++I+ENE
Sbjct: 492  TFTEDIFRLRKGWESASSCDEYLCQGELPNYIAENE 527


>gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus guttatus]
          Length = 559

 Score =  702 bits (1813), Expect = 0.0
 Identities = 350/588 (59%), Positives = 438/588 (74%), Gaps = 15/588 (2%)
 Frame = -2

Query: 1919 MGRRATSDEEQEGEDRERLMEPNERKIAGSSFEIGEFKNKITRFYNS----------NKR 1770
            M  R +SDEE   ED E L+  N R            +    R              NKR
Sbjct: 1    MMERDSSDEE---EDHENLISQNARPNDVVKSPTNHTRRSALRIDGGGRLSGAARGFNKR 57

Query: 1769 YLFAICLPIFLIVVYFSVDIGNLYR----SVSSVRIGYPSDKMREAELKALYLLREQHVG 1602
            YL AI LP+ ++++YF+ D+ +L++    ++  +    P ++MRE+EL+ALYLL++Q + 
Sbjct: 58   YLLAILLPMVILILYFTTDLKSLFQMRIPTIKDIGGNSPLNRMRESELRALYLLKQQELQ 117

Query: 1601 LLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQ 1422
            LL +WN T+   + NS                          ++++    +D +S V  Q
Sbjct: 118  LLKMWNYTTLQNQSNSS------------------------SVNNSNSFDEDLKSRVFSQ 153

Query: 1421 IKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSN 1245
            I LNK++Q +LLSSH      +L  +N D+    + +  C KVD+ +S+R+T+EW P+SN
Sbjct: 154  ISLNKQIQGILLSSHESEGFPDLNENNTDASLSGWNM--CGKVDQKLSERRTIEWKPRSN 211

Query: 1244 KFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRK 1065
            K+L  ICVSGQMSNHLICLEKHMFFAALLNR+LVIPS KVD+ + RVLDI+ INKC GRK
Sbjct: 212  KYLLAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDFPFHRVLDIETINKCLGRK 271

Query: 1064 VVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVK 885
            VVVTFEEF+EIKKNH+HID+F+CY S PQ CF+D+DH+KKLK LG+S+GK+E  W EDVK
Sbjct: 272  VVVTFEEFAEIKKNHLHIDKFMCYFSLPQPCFMDDDHLKKLKGLGLSLGKIETVWKEDVK 331

Query: 884  EPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLT 705
            +P +R V+DV +KFSS+DDV+A+GDVF+ADVE+EWVMQ GGP+AHKCKTL+EPSRLI+LT
Sbjct: 332  KPNQRKVDDVTAKFSSDDDVIAVGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLT 391

Query: 704  AQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLS 525
            A RFIQTFLG++F+ALHFRRHGF+KFCNAK+PSCFFP+PQAA+CI RVVERAN PV+YLS
Sbjct: 392  AHRFIQTFLGKDFIALHFRRHGFLKFCNAKQPSCFFPVPQAAECINRVVERANTPVVYLS 451

Query: 524  TDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTIC 345
            TDAA SET LLQSL+V NGK VPLV+RPARN AEKWDALLYRHGLEGDSQVEAMLDKTIC
Sbjct: 452  TDAAASETGLLQSLVVWNGKTVPLVQRPARNLAEKWDALLYRHGLEGDSQVEAMLDKTIC 511

Query: 344  AMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            A+SSVFIGSSGSTFTEDILR+RKDW SAS CDEYLCQ E P+FI+E+E
Sbjct: 512  ALSSVFIGSSGSTFTEDILRIRKDWGSASVCDEYLCQGELPNFIAEDE 559


>ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602087 [Solanum tuberosum]
          Length = 565

 Score =  698 bits (1802), Expect = 0.0
 Identities = 352/578 (60%), Positives = 440/578 (76%), Gaps = 12/578 (2%)
 Frame = -2

Query: 1898 DEEQEGEDRERLMEPNER------KIAGSSFEIGEFKNKITRFYNSNKR----YLFAICL 1749
            D   E ED+E L+   ER          ++F+I + +   TR +NS+      +L  I +
Sbjct: 6    DPSNEEEDQENLIAQRERGNNLSESPVRTAFQIDD-EIADTRPFNSSCSKCCYFLTIIVV 64

Query: 1748 PIFLIVVYFSVDIGNLYRSVSSVRIGYPSDKMREAELKALYLLREQHVGLLNLWNLTSSD 1569
             +F+ + +++ D+ N+  S + V      + MRE+EL+ALYLLR+Q +GL  LWN T  D
Sbjct: 65   TVFIFIRFYTTDVDNV--SKTGVMNNDSVNLMRESELRALYLLRQQQLGLFKLWNNTLID 122

Query: 1568 LKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLNKEVQQVL 1389
              LN+                          L S+  L ++ +  ++ QI LNK++QQ L
Sbjct: 123  NSLNA--------------TAANNSNFVSTSLFSSA-LSEELKLELISQISLNKQIQQAL 167

Query: 1388 LSSHRVGNLTELENDNADSGFGNYG-LDRCRKVD-EISDRKTVEWNPKSNKFLFVICVSG 1215
            LSSH++GNL    ++  D    +YG LDRCRK+D ++SDR+T+EW P+S+K+LF IC SG
Sbjct: 168  LSSHQLGNLLNASDNATDPSLDDYGGLDRCRKMDYKLSDRRTIEWEPRSDKYLFAICASG 227

Query: 1214 QMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVTFEEFSE 1035
            QMSNHLICLEKHMFFAALLNRIL+IPS +VDY++ RVLDIDHINKC GRKVVVTFEEF++
Sbjct: 228  QMSNHLICLEKHMFFAALLNRILIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAK 287

Query: 1034 IKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEPKKRSVEDV 855
             +K HMHID+FICY S PQ CF+D++HVKKLKSLG+SM KLE AW ED+K PK R+V+D+
Sbjct: 288  SQKGHMHIDKFICYFSQPQPCFLDDEHVKKLKSLGVSMNKLEAAWDEDIKNPKPRTVQDI 347

Query: 854  KSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRFIQTFLG 675
             +KFS +DDV+AIGDVF+A+VE++WVMQ GGP++HKCKTLVEPSRLI+LTAQRFIQTFLG
Sbjct: 348  MTKFSLDDDVIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLG 407

Query: 674  RNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAAESETDL 495
            +NF+ALHFRRHGF+KFCNAKKPSCF+P+PQAADCI RVVERA APVIYLSTDAAESET +
Sbjct: 408  KNFIALHFRRHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGI 467

Query: 494  LQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSS 315
            LQSL+  NGK VPLV+RPA+NSAEKWDALLYRHGLEGD QVEAMLDKTICAMS VFIGS 
Sbjct: 468  LQSLVAVNGKTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAMSEVFIGSM 527

Query: 314  GSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            GSTFTEDILRLRKDW ++S CDEYLC+ E P FI+++E
Sbjct: 528  GSTFTEDILRLRKDWGTSSLCDEYLCRGEVPSFIADDE 565


>ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617227 [Citrus sinensis]
          Length = 563

 Score =  684 bits (1765), Expect = 0.0
 Identities = 348/595 (58%), Positives = 433/595 (72%), Gaps = 29/595 (4%)
 Frame = -2

Query: 1898 DEEQEGEDRERLMEPNERKIAG------------------SSFEIGEFKNK--ITRFYN- 1782
            D   + +DRE L+  N+ K                     S+F I +  N   I R +  
Sbjct: 4    DSSDDDDDRETLIHQNDTKHGNHRLPTSNNNEDEEHNRRHSTFHIDDLPNASPIRRRFTF 63

Query: 1781 -----SNKRYLFAICLPIFLIVVYFSVDIGNLYR-SVSSVRIGYPSDKMREAELKALYLL 1620
                 +NKRYLFA+ LP+ +I++YFSV++ +L+  +  + R    +D+MRE+EL+AL LL
Sbjct: 64   DFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSLL 123

Query: 1619 REQHVGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFR 1440
            ++Q   LL+LWN +  +    +  N                              F D +
Sbjct: 124  KQQQSHLLSLWNQSFVNNSYGNNTNNP---------------------------FFQDAK 156

Query: 1439 SAVLKQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDEI-SDRKTVE 1263
            SA+L QI LNK+++Q+LLS H+V N T   ND        +G + CRKVD I  +++TVE
Sbjct: 157  SALLNQISLNKQIEQILLSPHKVSNFTP--NDAV------WGFEGCRKVDSIIPNKRTVE 208

Query: 1262 WNPKSNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHIN 1083
            W PKS+KFLF IC+SGQMSNHLICLEKHMF AALLNR+LVIPS K DYQYSRVLDI+HIN
Sbjct: 209  WKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHIN 268

Query: 1082 KCFGRKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPA 903
             C GRKVVV+FE F E++KNH HIDRF+CY   P+ CFVD++H+KKLK LGISMGK E  
Sbjct: 269  DCLGRKVVVSFENFMEMEKNHAHIDRFLCYFGLPEPCFVDDEHIKKLKQLGISMGKTETV 328

Query: 902  WA-EDVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEP 726
            W  ED ++P KR+V+D++ KF ++DDV+A+GD+FYADVE++WVMQ GGP+ H+CKTL+EP
Sbjct: 329  WKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEP 388

Query: 725  SRLIMLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERAN 546
            SRLIM+TAQRF+QTFLG NF+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCITR+ ERAN
Sbjct: 389  SRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAN 448

Query: 545  APVIYLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEA 366
            APVIYLSTDAAESET LLQSL+V NGK + LVKRP RNSAEKWD+LLYRH LE DSQVEA
Sbjct: 449  APVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEA 508

Query: 365  MLDKTICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            MLDKTICAMS+VFIG+SGSTFTEDI+RLRKDW S S CDEYLCQ E+P+FI+E+E
Sbjct: 509  MLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563


>ref|XP_006426814.1| hypothetical protein CICLE_v10025289mg [Citrus clementina]
            gi|557528804|gb|ESR40054.1| hypothetical protein
            CICLE_v10025289mg [Citrus clementina]
          Length = 563

 Score =  684 bits (1765), Expect = 0.0
 Identities = 347/595 (58%), Positives = 433/595 (72%), Gaps = 29/595 (4%)
 Frame = -2

Query: 1898 DEEQEGEDRERLMEPNERKIAG------------------SSFEIGEFKNK--ITRFYN- 1782
            D   + +DRE L+  N+ K                     S+F I +F N   I R +  
Sbjct: 4    DSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTF 63

Query: 1781 -----SNKRYLFAICLPIFLIVVYFSVDIGNLYR-SVSSVRIGYPSDKMREAELKALYLL 1620
                 +NKRYLFA+ LP+ +I++YFSV++ +L+  +  + R    +D+MRE+EL+AL LL
Sbjct: 64   DFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSLL 123

Query: 1619 REQHVGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFR 1440
            ++Q   LL+LWN +  +    +  N                              F + +
Sbjct: 124  KQQQSHLLSLWNQSFVNNSYGNNTNNP---------------------------FFQEAK 156

Query: 1439 SAVLKQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDEI-SDRKTVE 1263
            S +L QI LN++++Q+LLS H+V N T   ND        +GL+ CRK+D I  +++TVE
Sbjct: 157  SVLLNQISLNRQIEQILLSPHKVSNFTP--NDAV------WGLESCRKIDSIIPNKRTVE 208

Query: 1262 WNPKSNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHIN 1083
            W PKS+KFLF IC+SGQMSNHLICLEKHMF AALLNR+LVIPS K DYQYSRVLDI+HIN
Sbjct: 209  WKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHIN 268

Query: 1082 KCFGRKVVVTFEEFSEIKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPA 903
             C GRKVVV+FE F E+KKNH HIDRF+CY   PQ CFVD++H+KKLK LGISMGK E  
Sbjct: 269  DCLGRKVVVSFENFMEMKKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETV 328

Query: 902  WA-EDVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEP 726
            W  ED ++P KR+V+D++ KF ++DDV+A+GD+FYADVE++WVMQ GGP+ H+CKTL+EP
Sbjct: 329  WKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEP 388

Query: 725  SRLIMLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERAN 546
            SRLIM+TAQRF+QTFLG NF+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCITR+ ERA 
Sbjct: 389  SRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAK 448

Query: 545  APVIYLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEA 366
            APVIYLSTDAAESET LLQSL+V NGK + LVKRP RNSAEKWD+LLYRH LE DSQVEA
Sbjct: 449  APVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEA 508

Query: 365  MLDKTICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            MLDKTICAMS+VFIG+SGSTFTEDI+RLRKDW S S CDEYLCQ E+P+FI+E+E
Sbjct: 509  MLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563


>ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
            lyrata] gi|297311638|gb|EFH42062.1| hypothetical protein
            ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score =  681 bits (1756), Expect = 0.0
 Identities = 356/594 (59%), Positives = 446/594 (75%), Gaps = 24/594 (4%)
 Frame = -2

Query: 1910 RATSDEEQEGEDRERLMEPNERKIAG-----------------SSFEIGEFKNKITRFY- 1785
            R +SD+E   ED + L+  N+ +I                   S+F+I +   ++ R + 
Sbjct: 3    RNSSDDE---EDHQHLIPQNDTRIRHREDPISSTATTTGGNQRSAFQIEDILQRVQRRWK 59

Query: 1784 -NSNKRYLFA-ICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLRE 1614
             + NKRY+   + L I + +++   D   L+ +  SS ++   S++++E+EL+ALYLLR+
Sbjct: 60   ISLNKRYVIVFVSLIISIGLLFLLTDPRELFSANFSSFKLDPLSNRVKESELRALYLLRQ 119

Query: 1613 QHVGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSA 1434
            Q + LL+LWN T  +  LN   N                      DL S++ LF+D +SA
Sbjct: 120  QQLALLSLWNGTLVNPSLNQSEN----------------------DLRSSV-LFEDVKSA 156

Query: 1433 VLKQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWN 1257
            V KQI LNKE+Q VLLS HR  N +       DS   N+  DRCRKVD+ +SDRKTVEW 
Sbjct: 157  VSKQISLNKEIQNVLLSPHRSSNYSG--GTEVDSV--NFSYDRCRKVDQKLSDRKTVEWK 212

Query: 1256 PKSNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKC 1077
            P+S+KFLF IC+SGQMSNHLICLEKHMFFAALL+R+LVIPS K DYQY RV+DI+ IN C
Sbjct: 213  PRSDKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTC 272

Query: 1076 FGRKVVVTFEEFSE-IKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISM-GKLEPA 903
             GR VVV+F++F E  KKNH  IDRFICY SSPQ C+VDE+H+KKLK LGIS+ GKLE  
Sbjct: 273  LGRNVVVSFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAP 332

Query: 902  WAEDVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPS 723
            W+ED+K+P KR+V+DV++KF S+DDV+AIGDVFYAD+EQ+WVMQ GGP+ HKCKTL+EPS
Sbjct: 333  WSEDIKKPSKRTVQDVQTKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPS 392

Query: 722  RLIMLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANA 543
            +LI+LTAQRFIQTFLG+NF+ALHFRRHGF+KFCNAK PSCF+PIPQAA+CI R+VER+N 
Sbjct: 393  KLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNG 452

Query: 542  PVIYLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAM 363
             VIYLSTDAAESET LLQSL+V +GK VPLVKRP RNSAEKWDALLYRHG+E DSQV+AM
Sbjct: 453  AVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAM 512

Query: 362  LDKTICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            LDKTICAMSSVFIG+SGSTFTEDILRLRKDW ++S CDEYLC+ E+P+FI+E+E
Sbjct: 513  LDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>ref|XP_004510588.1| PREDICTED: uncharacterized protein LOC101496484 [Cicer arietinum]
          Length = 549

 Score =  680 bits (1754), Expect = 0.0
 Identities = 341/578 (58%), Positives = 428/578 (74%), Gaps = 9/578 (1%)
 Frame = -2

Query: 1907 ATSDEEQEGEDRERLMEPNERK------IAGSSFEIGEFKNKITRF-YNSNKRYLFAICL 1749
            ++SDEE   +D   L+  N  K      I  ++F + +  ++  R  +   K+Y+ AI +
Sbjct: 3    SSSDEE---DDHHNLIHQNSTKPRTPPSITAATFHVDDLNSRFRRANFKFQKKYIIAIIV 59

Query: 1748 PIFLIVVYFSVDIGNLYRSVSSVRIGYPSDKMREAELKALYLLREQHVGLLNLWNLTSSD 1569
             + ++++ FS+     + S +S      SD+M+E+EL+A+YLLR+Q + LL ++N  S  
Sbjct: 60   -LLIVILLFSIPNLRRHFSTASFISDSVSDRMKESELRAIYLLRQQQLSLLTVFNRNSQS 118

Query: 1568 LKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLNKEVQQVL 1389
                                          D + T  L +D +SA+ KQI +N E+QQ+L
Sbjct: 119  ---------------------------NTSDPNQTPNLIEDLKSALSKQISINSEIQQIL 151

Query: 1388 LSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLFVICVSGQ 1212
            L+ HR+GN+ + E D  +    N   D CR +D+ +S RKTVEWNPK  KFL  ICVSGQ
Sbjct: 152  LNPHRIGNVFDPEFDFGNVNVSNGNYDTCRTIDQNLSKRKTVEWNPKEGKFLLAICVSGQ 211

Query: 1211 MSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVTFEEFSEI 1032
            MSNHLICLEKHMFFAALLNR+LVIPS K DYQY RV++IDHINKC G+KVV++F+EFS +
Sbjct: 212  MSNHLICLEKHMFFAALLNRVLVIPSSKFDYQYDRVVNIDHINKCLGKKVVISFDEFSNV 271

Query: 1031 KKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWA-EDVKEPKKRSVEDV 855
            KK+H+HID+F+CY S PQ C++D++ +KKL  LG+SM K +  W  ED + PKK+SVEDV
Sbjct: 272  KKDHLHIDKFLCYFSLPQPCYLDDEKLKKLSGLGLSMSKPKAVWDDEDTRNPKKKSVEDV 331

Query: 854  KSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRFIQTFLG 675
             SKFS +DDV+AIGDVFYA+VE EWVMQ GGP+AHKCKTL+EP+RLI LTAQRFIQTFLG
Sbjct: 332  MSKFSYDDDVMAIGDVFYAEVEHEWVMQPGGPIAHKCKTLIEPNRLITLTAQRFIQTFLG 391

Query: 674  RNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAAESETDL 495
            RNF+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCI RVVERA+AP+IYLSTDAA+SET L
Sbjct: 392  RNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCILRVVERADAPIIYLSTDAAQSETGL 451

Query: 494  LQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSS 315
            LQSL+V NGK VPLV RPARNSAEKWDALLYRHG+EGD+QVEAMLDKTICAMSSVFIG+ 
Sbjct: 452  LQSLVVLNGKPVPLVIRPARNSAEKWDALLYRHGIEGDAQVEAMLDKTICAMSSVFIGAP 511

Query: 314  GSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            GSTFTEDI RLRKDW S S CDEYLCQ E+P+ ++ENE
Sbjct: 512  GSTFTEDIRRLRKDWGSLSMCDEYLCQGEEPNIVAENE 549


>ref|XP_006280247.1| hypothetical protein CARUB_v10026161mg [Capsella rubella]
            gi|482548951|gb|EOA13145.1| hypothetical protein
            CARUB_v10026161mg [Capsella rubella]
          Length = 568

 Score =  679 bits (1751), Expect = 0.0
 Identities = 346/551 (62%), Positives = 431/551 (78%), Gaps = 7/551 (1%)
 Frame = -2

Query: 1832 SSFEIGEFKNKITRFY--NSNKRYLF-AICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYP 1665
            S+F+I +   ++   +  + NKRY+  A+ L I + +++   D   L+ + +SS +    
Sbjct: 43   SAFQIEDIVQRVQHRWKISLNKRYVIVAVSLIISIGLLFILTDPRELFSANLSSFKRDPL 102

Query: 1664 SDKMREAELKALYLLREQHVGLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIE 1485
            S++++E+EL+ALYLLR+Q + LL+LWN T  +  LN   N                    
Sbjct: 103  SNRVKESELRALYLLRQQQLALLSLWNGTLVNPSLNQSANAS------------------ 144

Query: 1484 KVDLSSTLMLFDDFRSAVLKQIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDR 1305
               L S++ LF+D +SAV KQI LNKE+Q+VLLS HR  N +       DS   N   DR
Sbjct: 145  --SLESSV-LFEDVKSAVSKQISLNKEIQEVLLSPHRTANYSG--GTEVDSV--NLAYDR 197

Query: 1304 CRKVDE-ISDRKTVEWNPKSNKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPK 1128
            CRKVD+ +SDR+TVEW P+S+KFLF IC+SGQMSNHLICLEKHMFFAALL+R+LVIPSPK
Sbjct: 198  CRKVDQNLSDRRTVEWKPRSDKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSPK 257

Query: 1127 VDYQYSRVLDIDHINKCFGRKVVVTFEEFSE-IKKNHMHIDRFICYVSSPQTCFVDEDHV 951
             DYQY RV+DI+ IN C GR VVV+F++F E  KKNH  IDRFICY SSPQ C+VDE+H+
Sbjct: 258  FDYQYDRVIDIERINTCLGRNVVVSFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHI 317

Query: 950  KKLKSLGISM-GKLEPAWAEDVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVM 774
            KKLK LGIS+ GKLE  W+ED+K+P KR+V+DV++KF S+DDV+AIGDVFYAD+EQ+WVM
Sbjct: 318  KKLKGLGISIDGKLEAPWSEDIKKPSKRTVQDVQTKFKSDDDVIAIGDVFYADMEQDWVM 377

Query: 773  QQGGPLAHKCKTLVEPSRLIMLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFP 594
            Q GGP+ HKCKTL+EPS+LI+LTAQRFIQTFLG+NF+ALHFRRHGF+KFCNAK PSCF+P
Sbjct: 378  QPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYP 437

Query: 593  IPQAADCITRVVERANAPVIYLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWD 414
            IPQAA+CI R+VER+N  VIYLSTDAAESET LLQSL+V +GK VPLVKRP RNSAEKWD
Sbjct: 438  IPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWD 497

Query: 413  ALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQ 234
            ALLYRHG+E DSQV+AMLDKTICAMSSVFIG+SGSTFTEDILRLRKDW ++S CDEYLC+
Sbjct: 498  ALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSMCDEYLCR 557

Query: 233  SEQPDFISENE 201
             E+P+FI+E+E
Sbjct: 558  GEEPNFIAEDE 568


>ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776069 [Glycine max]
          Length = 543

 Score =  677 bits (1748), Expect = 0.0
 Identities = 343/576 (59%), Positives = 429/576 (74%), Gaps = 7/576 (1%)
 Frame = -2

Query: 1907 ATSDEEQEGEDRERLMEPNERKIAG----SSFEIGEFKNKITRF-YNSNKRYLFAICLPI 1743
            ++SDEE   +D   L++ N RK       ++F + +  ++  R  +   K+Y+ AI   +
Sbjct: 3    SSSDEE---DDHRNLVDNNHRKPPSPPPSAAFHVEDLSSRFRRVSFALQKKYIIAILALL 59

Query: 1742 FLIVVYFSVDIGNLYRSVSSVRIGYPSDKMREAELKALYLLREQHVGLLNLWNLTSSDLK 1563
            FL++ +   D   L+ + SS +    +D+M+E+EL+A+ LL +Q   LL  WN T   L+
Sbjct: 60   FLLLFFSITDFHQLFSTPSSFKFDSITDRMKESELRAINLLYQQQQSLLTAWNHT---LR 116

Query: 1562 LNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLKQIKLNKEVQQVLLS 1383
             N+                            S   L +D +S++ KQI LN+E+QQ+LL+
Sbjct: 117  TNA----------------------------SDPNLLEDLKSSLFKQISLNREIQQILLN 148

Query: 1382 SHRVG-NLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKSNKFLFVICVSGQM 1209
             H  G N  E E D  ++       DRCR VD+ +S RKT+EWNP+  KFL  ICVSGQM
Sbjct: 149  PHSTGGNAIEPELD-LNATLNGVVYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQM 207

Query: 1208 SNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGRKVVVTFEEFSEIK 1029
            SNHLICLEKHMFFAALLNR+LVIPS KVDYQY RV+DIDHINKC G+KVVV+FEEFS +K
Sbjct: 208  SNHLICLEKHMFFAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEEFSNLK 267

Query: 1028 KNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISMGKLEPAWAEDVKEPKKRSVEDVKS 849
            K H+HID+F+CY S PQ C++D++ +KKL +LG++M K E  W ED ++PKK++V+DV  
Sbjct: 268  KGHLHIDKFLCYFSHPQPCYLDDERLKKLGALGLTMSKPEAVWDEDTRKPKKKTVQDVLG 327

Query: 848  KFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLIMLTAQRFIQTFLGRN 669
            KFS +DDV+AIGDVFYA+VE+EWVMQ GGP+AHKCKTL+EP+RLI+LTAQRFIQTFLGRN
Sbjct: 328  KFSFDDDVMAIGDVFYAEVEREWVMQPGGPIAHKCKTLIEPNRLILLTAQRFIQTFLGRN 387

Query: 668  FVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVIYLSTDAAESETDLLQ 489
            F+ALHFRRHGF+KFCNAKKPSCF+PIPQAADCI RVVE A+AP+IYLSTDAAESET LLQ
Sbjct: 388  FIALHFRRHGFLKFCNAKKPSCFYPIPQAADCILRVVEMADAPIIYLSTDAAESETGLLQ 447

Query: 488  SLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGSSGS 309
            SL+V NG+ VPLV RPARNSAEKWDALLYRH ++GDSQVEAMLDKTICAMSSVFIG+ GS
Sbjct: 448  SLVVLNGRPVPLVIRPARNSAEKWDALLYRHNMDGDSQVEAMLDKTICAMSSVFIGAPGS 507

Query: 308  TFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            TFTEDILRLRKDW SAS CDEYLCQ E+P+ I+ENE
Sbjct: 508  TFTEDILRLRKDWGSASMCDEYLCQGEEPNIIAENE 543


>ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|9758924|dbj|BAB09461.1| unnamed protein product
            [Arabidopsis thaliana] gi|133778858|gb|ABO38769.1|
            At5g50420 [Arabidopsis thaliana]
            gi|332008558|gb|AED95941.1| O-fucosyltransferase family
            protein [Arabidopsis thaliana]
            gi|591401714|gb|AHL38584.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 566

 Score =  677 bits (1747), Expect = 0.0
 Identities = 350/591 (59%), Positives = 440/591 (74%), Gaps = 21/591 (3%)
 Frame = -2

Query: 1910 RATSDEEQEGED-----------RERLMEPNERKIAG---SSFEIGEFKNKITRF--YNS 1779
            R +SD+E++ +            RE  +  N   I G   S+F+I +  +++      + 
Sbjct: 3    RNSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGKISL 62

Query: 1778 NKRYLFA-ICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHV 1605
            NKRY+   + L I + +++   D   L+ +  SS ++   S++++E+EL+ALYLLR+Q +
Sbjct: 63   NKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRALYLLRQQQL 122

Query: 1604 GLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLK 1425
             LL+LWN T  +  LN   N                          + +LF+D +SAV K
Sbjct: 123  ALLSLWNGTLVNPSLNQSENAL-----------------------GSSVLFEDVKSAVSK 159

Query: 1424 QIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKS 1248
            QI LNKE+Q+VLLS HR  N +       D    N+  +RCRKVD+ +SDRKTVEW P+S
Sbjct: 160  QISLNKEIQEVLLSPHRSSNYS----GGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRS 215

Query: 1247 NKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGR 1068
            +KFLF IC+SGQMSNHLICLEKHMFFAALL+R+LVIPS K DYQY RV+DI+ IN C GR
Sbjct: 216  DKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGR 275

Query: 1067 KVVVTFEEFSE-IKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISM-GKLEPAWAE 894
             VVV F++F E  KKNH  IDRFICY SSPQ C+VDE+H+KKLK LGIS+ GKLE  W+E
Sbjct: 276  NVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSE 335

Query: 893  DVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLI 714
            D+K+P KR+V+DV+ KF S+DDV+AIGDVFYAD+EQ+WVMQ GGP+ HKCKTL+EPS+LI
Sbjct: 336  DIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLI 395

Query: 713  MLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVI 534
            +LTAQRFIQTFLG+NF+ALHFRRHGF+KFCNAK PSCF+PIPQAA+CI R+VER+N  VI
Sbjct: 396  LLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVI 455

Query: 533  YLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDK 354
            YLSTDAAESET LLQSL+V +GK VPLVKRP RNSAEKWDALLYRHG+E DSQV+AMLDK
Sbjct: 456  YLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDK 515

Query: 353  TICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            TICAMSSVFIG+SGSTFTEDILRLRKDW ++S CDEYLC+ E+P+FI+E+E
Sbjct: 516  TICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>gb|AAM66093.1| unknown [Arabidopsis thaliana]
          Length = 566

 Score =  676 bits (1745), Expect = 0.0
 Identities = 349/591 (59%), Positives = 440/591 (74%), Gaps = 21/591 (3%)
 Frame = -2

Query: 1910 RATSDEEQEGED-----------RERLMEPNERKIAG---SSFEIGEFKNKITRF--YNS 1779
            R +SD+E++ +            RE  +  N   I G   S+F+I +  +++      + 
Sbjct: 3    RNSSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGKISL 62

Query: 1778 NKRYLFA-ICLPIFLIVVYFSVDIGNLYRS-VSSVRIGYPSDKMREAELKALYLLREQHV 1605
            NKRY+   + L I + +++   D   L+ +  SS ++   S++++E+EL+ALYLLR+Q +
Sbjct: 63   NKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRALYLLRQQQL 122

Query: 1604 GLLNLWNLTSSDLKLNSGINGXXXXXXXXXXXXXXXXTIEKVDLSSTLMLFDDFRSAVLK 1425
             LL+LWN T  +  LN   N                          + +LF+D +SAV K
Sbjct: 123  ALLSLWNGTLVNPSLNQSENAL-----------------------GSSVLFEDVKSAVSK 159

Query: 1424 QIKLNKEVQQVLLSSHRVGNLTELENDNADSGFGNYGLDRCRKVDE-ISDRKTVEWNPKS 1248
            QI LNKE+Q+VLLS HR  N +       D    N+  +RCRKVD+ +SDRKTVEW P+S
Sbjct: 160  QISLNKEIQEVLLSPHRSSNYS----GGTDVDSVNFSYNRCRKVDQKLSDRKTVEWKPRS 215

Query: 1247 NKFLFVICVSGQMSNHLICLEKHMFFAALLNRILVIPSPKVDYQYSRVLDIDHINKCFGR 1068
            +KFLF IC+SGQMSNHL+CLEKHMFFAALL+R+LVIPS K DYQY RV+DI+ IN C GR
Sbjct: 216  DKFLFAICLSGQMSNHLLCLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGR 275

Query: 1067 KVVVTFEEFSE-IKKNHMHIDRFICYVSSPQTCFVDEDHVKKLKSLGISM-GKLEPAWAE 894
             VVV F++F E  KKNH  IDRFICY SSPQ C+VDE+H+KKLK LGIS+ GKLE  W+E
Sbjct: 276  NVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSE 335

Query: 893  DVKEPKKRSVEDVKSKFSSNDDVLAIGDVFYADVEQEWVMQQGGPLAHKCKTLVEPSRLI 714
            D+K+P KR+V+DV+ KF S+DDV+AIGDVFYAD+EQ+WVMQ GGP+ HKCKTL+EPS+LI
Sbjct: 336  DIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLI 395

Query: 713  MLTAQRFIQTFLGRNFVALHFRRHGFMKFCNAKKPSCFFPIPQAADCITRVVERANAPVI 534
            +LTAQRFIQTFLG+NF+ALHFRRHGF+KFCNAK PSCF+PIPQAA+CI R+VER+N  VI
Sbjct: 396  LLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVI 455

Query: 533  YLSTDAAESETDLLQSLIVPNGKAVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDK 354
            YLSTDAAESET LLQSL+V +GK VPLVKRP RNSAEKWDALLYRHG+E DSQV+AMLDK
Sbjct: 456  YLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDK 515

Query: 353  TICAMSSVFIGSSGSTFTEDILRLRKDWRSASACDEYLCQSEQPDFISENE 201
            TICAMSSVFIG+SGSTFTEDILRLRKDW ++S CDEYLC+ E+P+FI+E+E
Sbjct: 516  TICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


Top