BLASTX nr result

ID: Mentha29_contig00020844 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00020844
         (2407 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus...   854   0.0  
ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584...   753   0.0  
ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268...   743   0.0  
gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlise...   736   0.0  
ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602...   712   0.0  
ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254...   708   0.0  
ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262...   696   0.0  
gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]     691   0.0  
ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299...   674   0.0  
ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776...   670   0.0  
ref|XP_002533327.1| conserved hypothetical protein [Ricinus comm...   667   0.0  
ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus tric...   666   0.0  
ref|XP_007024790.1| O-fucosyltransferase family protein isoform ...   666   0.0  
ref|XP_007024791.1| O-fucosyltransferase family protein isoform ...   661   0.0  
ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208...   660   0.0  
ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsi...   660   0.0  
gb|AAM66093.1| unknown [Arabidopsis thaliana]                         659   0.0  
ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arab...   659   0.0  
ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046...   658   0.0  
ref|XP_004510588.1| PREDICTED: uncharacterized protein LOC101496...   657   0.0  

>gb|EYU21259.1| hypothetical protein MIMGU_mgv1a003863mg [Mimulus guttatus]
          Length = 559

 Score =  854 bits (2207), Expect = 0.0
 Identities = 428/561 (76%), Positives = 471/561 (83%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIM 451
            NLISQNARPNDVVKSP +H  R   SA +ID     R+SGAAR FNKRY+  I LP+VI+
Sbjct: 15   NLISQNARPNDVVKSPTNHTRR---SALRIDGG--GRLSGAARGFNKRYLLAILLPMVIL 69

Query: 452  LVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 631
            ++Y TTDLK++F+MR+P+I++ GGN PL NRMRESELRALYLLKQQE++L+KMWNYTTL 
Sbjct: 70   ILYFTTDLKSLFQMRIPTIKDIGGNSPL-NRMRESELRALYLLKQQELQLLKMWNYTTLQ 128

Query: 632  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTG 811
             +                       +DLKSRVF QIS+NKQIQGILLS+HESEG  D   
Sbjct: 129  NQSNSSSVNNSNSFD----------EDLKSRVFSQISLNKQIQGILLSSHESEGFPDLNE 178

Query: 812  NYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAA 991
            N TD+  S W  C KVD +L++R+TIEW P+S+KYL AICVSGQMSNHLICLEKHMFFAA
Sbjct: 179  NNTDASLSGWNMCGKVDQKLSERRTIEWKPRSNKYLLAICVSGQMSNHLICLEKHMFFAA 238

Query: 992  LLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSM 1171
            LLNRVLVIPSSKVD+ FHRVLDI+ INKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFS+
Sbjct: 239  LLNRVLVIPSSKVDFPFHRVLDIETINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSL 298

Query: 1172 PQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVF 1351
            PQPCFMDD+               E VWKEDVK P  ++V+DV AKFSSD DVIA+GDVF
Sbjct: 299  PQPCFMDDDHLKKLKGLGLSLGKIETVWKEDVKKPNQRKVDDVTAKFSSDDDVIAVGDVF 358

Query: 1352 FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFC 1531
            FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTA RFIQTFLGKDF+ALHFRRHGFLKFC
Sbjct: 359  FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAHRFIQTFLGKDFIALHFRRHGFLKFC 418

Query: 1532 NAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQR 1711
            NAKQPSCF+PVPQAAECINRVVERA+TPV+YLSTDAA SETGLLQSLVV NGKTVPLVQR
Sbjct: 419  NAKQPSCFFPVPQAAECINRVVERANTPVVYLSTDAAASETGLLQSLVVWNGKTVPLVQR 478

Query: 1712 PTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGS 1891
            P RN AEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFT+DILR+RKDWGS
Sbjct: 479  PARNLAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTEDILRIRKDWGS 538

Query: 1892 ASQCDEYLCQGELPNFIAEDE 1954
            AS CDEYLCQGELPNFIAEDE
Sbjct: 539  ASVCDEYLCQGELPNFIAEDE 559


>ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584575 [Solanum tuberosum]
          Length = 568

 Score =  753 bits (1944), Expect = 0.0
 Identities = 372/566 (65%), Positives = 454/566 (80%), Gaps = 5/566 (0%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFN----KRYIFLIFLP 439
            NLI QN R ND+ KSP        RS FQI+D  ++R +   R+FN    KRY+  I LP
Sbjct: 16   NLIHQNERVNDLSKSP-------RRSTFQIED-VKDRFA-LCRRFNFTSGKRYLLAIILP 66

Query: 440  VVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNY 619
            V+++++Y  TD+K++F+  V +I+  G      N MR+SELRALYLL+QQ++ L K+WN+
Sbjct: 67   VLVLVLYFATDIKSLFQTTVTTIKYDGS----VNSMRDSELRALYLLRQQQLGLFKLWNH 122

Query: 620  TTLVERXXXXXXXXXXXXXXXXXXXXA-MLQDLKSRVFGQISMNKQIQGILLSAHESEGS 796
            T + +                     + +++DLK+ +  QIS+NKQIQ +LLS+H+   S
Sbjct: 123  TLVNDTSTTHTGSSLESTPGFASVSRSSIVEDLKADLLRQISLNKQIQQVLLSSHQLGNS 182

Query: 797  VDSTGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKH 976
            + ++ N TD      +RC+KVD  L+ R+T+EW P+S+KYLFAICVSGQMSNHLICLEKH
Sbjct: 183  LITSDNSTDPTLGGLSRCRKVDHNLSQRRTVEWKPRSNKYLFAICVSGQMSNHLICLEKH 242

Query: 977  MFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFM 1156
            MFFAALLNR+LVIPSSKVDYEF RVLD+DHINKCLGR+V+VT++EFAE +K+HLHIDKF+
Sbjct: 243  MFFAALLNRILVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKFL 302

Query: 1157 CYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIA 1336
            CYFS PQPCF+D+ER              EA W EDVKNPK + V+D++AKFS+D DV+A
Sbjct: 303  CYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWNEDVKNPKKRTVQDIMAKFSTDDDVLA 362

Query: 1337 IGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHG 1516
            IGDVFFADVE++WVMQPGGPI+HKCKTLIEPSRLI+LTAQRFIQTFLG +F+ALHFRRHG
Sbjct: 363  IGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFIQTFLGDNFIALHFRRHG 422

Query: 1517 FLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTV 1696
            FLKFCNAK+PSCFYPVPQAA+CINRV+ERA++PVIYLSTDAA+SETGLLQSLVV+NGKTV
Sbjct: 423  FLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAAESETGLLQSLVVVNGKTV 482

Query: 1697 PLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLR 1876
            PLVQRP RN+AEKWDALLYRHGLEGD QV+AMLDKTICA+SSVFIGSSGSTFTDDILRLR
Sbjct: 483  PLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSSVFIGSSGSTFTDDILRLR 542

Query: 1877 KDWGSASQCDEYLCQGELPNFIAEDE 1954
            KDWGSAS CDEYLCQGELPN++A+DE
Sbjct: 543  KDWGSASLCDEYLCQGELPNYVADDE 568


>ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268664 [Solanum
            lycopersicum]
          Length = 565

 Score =  743 bits (1917), Expect = 0.0
 Identities = 370/565 (65%), Positives = 446/565 (78%), Gaps = 4/565 (0%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFN----KRYIFLIFLP 439
            NLI QN R N + KSP         S FQI+D  ++R +   R+FN    K Y+  I LP
Sbjct: 16   NLIHQNERVNHLSKSP-------RPSTFQIED-VKDRFA-LCRRFNFTSGKTYLLAIILP 66

Query: 440  VVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNY 619
            ++++++Y  TD+K +F+  V +I+  G      N MRESELRALYLLKQQ++ L K+WN+
Sbjct: 67   LLVLILYFATDIKALFQTTVTTIKYDGS----VNSMRESELRALYLLKQQQLGLFKLWNH 122

Query: 620  TTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSV 799
            T + +                     ++++DLK  +  QIS+NKQIQ +LLS+H+   S+
Sbjct: 123  TLVNDTSTTHSLESAPGFTLVSRS--SIVEDLKDDLLRQISLNKQIQQVLLSSHQLGNSL 180

Query: 800  DSTGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHM 979
             ++ N TD       RC+KVD  L++R+T+EW P+S+KYLFAICVSGQMSNHLICLEKHM
Sbjct: 181  ITSDNSTDPSLGGLGRCRKVDHNLSERRTVEWKPRSNKYLFAICVSGQMSNHLICLEKHM 240

Query: 980  FFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMC 1159
            FFAALLNRVLVIPSSKVDYEF RVLD+DHINKCLGR+V+VT++EFAE +K+HLHIDKF+C
Sbjct: 241  FFAALLNRVLVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKFLC 300

Query: 1160 YFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAI 1339
            YFS PQPCF+D+ER              EA W EDVKNPK +  +D++AKFS D DV+AI
Sbjct: 301  YFSQPQPCFLDEERVKKLKSLGISMNKLEAAWDEDVKNPKKRTAQDIVAKFSMDDDVLAI 360

Query: 1340 GDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGF 1519
            GDVFFADVE++WVMQPGGPI+HKCKTLIEPSRLI+LTAQRF+QTFLG +F+ALHFRRHGF
Sbjct: 361  GDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFVQTFLGDNFIALHFRRHGF 420

Query: 1520 LKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVP 1699
            LKFCNAK+PSCFYPVPQAA+CINRV+ERA++PV+YLSTDAA+SETGLLQSLVV NGKTVP
Sbjct: 421  LKFCNAKKPSCFYPVPQAADCINRVLERANSPVMYLSTDAAESETGLLQSLVVFNGKTVP 480

Query: 1700 LVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRK 1879
            LVQRP RN+AEKWDALLYRHGLEGD QVEAMLDKTICA+SSVFIGSSGSTFTDDILRLRK
Sbjct: 481  LVQRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTICAMSSVFIGSSGSTFTDDILRLRK 540

Query: 1880 DWGSASQCDEYLCQGELPNFIAEDE 1954
            DWGSAS CDEYLCQGELPNF+A+DE
Sbjct: 541  DWGSASLCDEYLCQGELPNFVADDE 565


>gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlisea aurea]
          Length = 568

 Score =  736 bits (1899), Expect = 0.0
 Identities = 367/561 (65%), Positives = 432/561 (77%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIM 451
            NLISQNAR +D VKS    NH +HRS+  ++ D R R S AA  + KRY   I LP +I+
Sbjct: 15   NLISQNARSDDAVKSS---NHSHHRSSLHVERDLRRRFSAAAGGYKKRYFLAIVLPALIL 71

Query: 452  LVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 631
            ++Y TTDLKNVF M +P I   GG+  L++RMRESEL+AL LL+QQE EL K+WNYT+  
Sbjct: 72   VLYFTTDLKNVFAMSIPKIGYHGGDA-LSDRMRESELQALNLLRQQEAELFKLWNYTSSA 130

Query: 632  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTG 811
             +                     +  DLKS+VF Q+S+NK+IQ +LLS+H   G      
Sbjct: 131  NKLNYSHDPVNVNSSAIHNLD--LFLDLKSQVFSQLSLNKRIQTLLLSSH-GNGEAFHDS 187

Query: 812  NYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAA 991
            NY+ +     TRC   +  L  R+ +EW+P  +K+L AIC+SGQMSNHLICLEKHMFFAA
Sbjct: 188  NYSFTDDGLTTRCPTANRNLLGRRKMEWDPLPNKFLLAICISGQMSNHLICLEKHMFFAA 247

Query: 992  LLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSM 1171
            LL R+LVIPSSKVDY FHRVLDIDHIN CLG+K VVTFEEF+ ++KNHLHID+F+CYFS 
Sbjct: 248  LLKRILVIPSSKVDYAFHRVLDIDHINTCLGKKAVVTFEEFSVMQKNHLHIDRFLCYFSS 307

Query: 1172 PQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVF 1351
            PQPC+MDDE               E+VWKEDVK+P+  +VEDV++KFSS+  V+A+GD+F
Sbjct: 308  PQPCYMDDEYVKKLKGVGLSLSKVESVWKEDVKSPRKTKVEDVVSKFSSNEAVVAVGDLF 367

Query: 1352 FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFC 1531
            FA VE +WVMQPGGPI HKCKTLIEPSRLI LTAQRF+QTFLGKDF+ALHFRRHGFLKFC
Sbjct: 368  FAQVEEDWVMQPGGPIEHKCKTLIEPSRLIRLTAQRFVQTFLGKDFIALHFRRHGFLKFC 427

Query: 1532 NAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQR 1711
            NAKQPSCFYPVPQAAECINRV+ERA+ PVIYLSTDAA+SETGLLQSLV   G TVPLV+R
Sbjct: 428  NAKQPSCFYPVPQAAECINRVIERANAPVIYLSTDAAESETGLLQSLVTRYGNTVPLVKR 487

Query: 1712 PTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGS 1891
            P RN+AEKWDALLYRHGLEGDSQVEAMLDK ICALSSVFIGSSGSTFT+DILRLR+ W S
Sbjct: 488  PARNSAEKWDALLYRHGLEGDSQVEAMLDKAICALSSVFIGSSGSTFTEDILRLRRVWES 547

Query: 1892 ASQCDEYLCQGELPNFIAEDE 1954
             S CDEYLC+G LPN+IAEDE
Sbjct: 548  ESVCDEYLCEGRLPNYIAEDE 568


>ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602087 [Solanum tuberosum]
          Length = 565

 Score =  712 bits (1837), Expect = 0.0
 Identities = 358/570 (62%), Positives = 434/570 (76%), Gaps = 9/570 (1%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKR------YIFLIF 433
            NLI+Q  R N++ +SP        R+AFQIDD+  +      R FN        ++ +I 
Sbjct: 16   NLIAQRERGNNLSESPV-------RTAFQIDDEIAD-----TRPFNSSCSKCCYFLTIIV 63

Query: 434  LPVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMW 613
            + V I + + TTD+ NV +  V        N    N MRESELRALYLL+QQ++ L K+W
Sbjct: 64   VTVFIFIRFYTTDVDNVSKTGVM-------NNDSVNLMRESELRALYLLRQQQLGLFKLW 116

Query: 614  NYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEG 793
            N  TL++                     A+ ++LK  +  QIS+NKQIQ  LLS+H+   
Sbjct: 117  N-NTLIDNSLNATAANNSNFVSTSLFSSALSEELKLELISQISLNKQIQQALLSSHQLGN 175

Query: 794  SVDSTGNYTDSISSDW---TRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLIC 964
             ++++ N TD    D+    RC+K+D +L+DR+TIEW P+S KYLFAIC SGQMSNHLIC
Sbjct: 176  LLNASDNATDPSLDDYGGLDRCRKMDYKLSDRRTIEWEPRSDKYLFAICASGQMSNHLIC 235

Query: 965  LEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHI 1144
            LEKHMFFAALLNR+L+IPSS+VDYEF RVLDIDHINKCLGRKVVVTFEEFA+ +K H+HI
Sbjct: 236  LEKHMFFAALLNRILIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHI 295

Query: 1145 DKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDS 1324
            DKF+CYFS PQPCF+DDE               EA W ED+KNPK + V+D++ KFS D 
Sbjct: 296  DKFICYFSQPQPCFLDDEHVKKLKSLGVSMNKLEAAWDEDIKNPKPRTVQDIMTKFSLDD 355

Query: 1325 DVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHF 1504
            DVIAIGDVFFA+VE++WVMQPGGPI+HKCKTL+EPSRLILLTAQRFIQTFLGK+F+ALHF
Sbjct: 356  DVIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHF 415

Query: 1505 RRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLN 1684
            RRHGFLKFCNAK+PSCFYPVPQAA+CINRVVERA+ PVIYLSTDAA+SETG+LQSLV +N
Sbjct: 416  RRHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVAVN 475

Query: 1685 GKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDI 1864
            GKTVPLV+RP +N+AEKWDALLYRHGLEGD QVEAMLDKTICA+S VFIGS GSTFT+DI
Sbjct: 476  GKTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAMSEVFIGSMGSTFTEDI 535

Query: 1865 LRLRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            LRLRKDWG++S CDEYLC+GE+P+FIA+DE
Sbjct: 536  LRLRKDWGTSSLCDEYLCRGEVPSFIADDE 565


>ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis
            vinifera]
          Length = 559

 Score =  708 bits (1827), Expect = 0.0
 Identities = 354/564 (62%), Positives = 427/564 (75%), Gaps = 3/564 (0%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIM 451
            NLI +N R     K P       HRS FQI+D F++R+S     FNKRY+F IF P+ I+
Sbjct: 14   NLIDENER-----KLP-------HRSGFQIED-FKSRLSAHRFSFNKRYLFAIFPPLFIL 60

Query: 452  LVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLV 631
            L+Y TTD++N+F   +  ++    + P  +RMRESELRALYLL+QQ++ L  +WN+T   
Sbjct: 61   LIYFTTDVRNLFTTSISIVK---ADSP-TDRMRESELRALYLLRQQQLSLFSLWNHTAFA 116

Query: 632  ERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS---VD 802
            +                         D KS +  QIS+NK+IQ +LLS+H S      VD
Sbjct: 117  DSAPIPSNSSNSTLDFSTRQVLLSSADFKSALLKQISLNKEIQQVLLSSHPSGNLSELVD 176

Query: 803  STGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMF 982
              G+      S + RC KV+  ++ R TIEW P+S KYLFAIC+SGQMSNHLICLEKHMF
Sbjct: 177  DNGDLNFGAYS-FNRCPKVNQNMSQRPTIEWKPRSDKYLFAICLSGQMSNHLICLEKHMF 235

Query: 983  FAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCY 1162
            FAALLNR+LVIPSSK DY+++RVLDI+HIN CLGRKVVVTFEEF E KKNHLHID+ +CY
Sbjct: 236  FAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEFTESKKNHLHIDRVICY 295

Query: 1163 FSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIG 1342
            FS+P PC++DD+               E  W ED+K PK +  +DV AKFSS+ DVIAIG
Sbjct: 296  FSLPLPCYVDDDHVKKLKSLGISMGKLEPAWAEDIKKPKKRTAQDVQAKFSSNDDVIAIG 355

Query: 1343 DVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFL 1522
            DVF+A+VE EWVMQPGGP+AHKC+TLIEPSRLI+LTAQRF+QTFLGK F ALHFRRHGFL
Sbjct: 356  DVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTFLGKSFTALHFRRHGFL 415

Query: 1523 KFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPL 1702
            KFCNAK+PSCF+P+PQAA+CI+RVVERA TPVIYLSTDAA+SETGLLQSLVVLNGK VPL
Sbjct: 416  KFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGLLQSLVVLNGKLVPL 475

Query: 1703 VQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKD 1882
            ++RPTRN+AEKWDALLYRHGL+GDSQVEAMLDKTICA++SVFIG+ GSTFT+DILRLR+ 
Sbjct: 476  IKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAPGSTFTEDILRLRRG 535

Query: 1883 WGSASQCDEYLCQGELPNFIAEDE 1954
            WGSAS CDEYLCQGE PNFIA++E
Sbjct: 536  WGSASHCDEYLCQGEQPNFIADNE 559


>ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262928 [Solanum
            lycopersicum]
          Length = 562

 Score =  696 bits (1796), Expect = 0.0
 Identities = 353/565 (62%), Positives = 431/565 (76%), Gaps = 4/565 (0%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNHRSAFQIDDDFRN-RVSGAARKFNKRYIFLIFLPVVI 448
            NLI+Q  R N++ + P+       R+AFQIDD+  N R S  +      +  +IF   VI
Sbjct: 14   NLIAQRQRGNNLSEFPE-------RTAFQIDDEIANTRPSDPSCSKCCCFSTIIFAVFVI 66

Query: 449  MLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTL 628
            +L + +T + NV +  V        N    N M ESELRAL LL+QQ++ L K+WN  TL
Sbjct: 67   ILCF-STGVNNVSKTGVM-------NNDSVNLMLESELRALSLLRQQQLGLFKLWN-NTL 117

Query: 629  VERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDST 808
            ++                      + ++LK  +  QIS+NKQIQ  LLS+H+    ++++
Sbjct: 118  IDNSLNATAANNSNIVSTSLFSSVLSEELKLDLISQISLNKQIQQALLSSHQLSNLLNAS 177

Query: 809  GNYTDSISSDWT---RCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHM 979
             N TD    D++   RC+K+D +L+DR+TIEW P+S KYLFAIC SGQMSNHLICLEKHM
Sbjct: 178  DNATDPSLDDYSGLHRCRKMDYKLSDRRTIEWKPRSDKYLFAICASGQMSNHLICLEKHM 237

Query: 980  FFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMC 1159
            FFAALLNR+++IPSS+VDYEF RVLDIDHINKCLGRKVVVTFEEFA+ +K H+HIDKF+C
Sbjct: 238  FFAALLNRIMIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHIDKFVC 297

Query: 1160 YFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAI 1339
            YFS PQPCF+DDE               EA W ED+KNPK + V+D+++KFS D  VIAI
Sbjct: 298  YFSQPQPCFLDDEHLKKLKSLGVSTNKLEAAWDEDIKNPKPRTVQDIMSKFSLDDAVIAI 357

Query: 1340 GDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGF 1519
            GDVFFA+VE++WVMQPGGPI+HKCKTL+EPSRLILLTAQRFIQTFLGK+F+ALHFRRHGF
Sbjct: 358  GDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHFRRHGF 417

Query: 1520 LKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVP 1699
            LKFCNAK+PSCFYPVPQAA+CINRVVERA+ PVIYLSTDAA+SETG+LQSLVV+NGKTVP
Sbjct: 418  LKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVVVNGKTVP 477

Query: 1700 LVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRK 1879
            LV+RP +N+AEKWDALLYRHGLEGD QVEAMLDKTICA+S VFIGS GSTFT+DILRLRK
Sbjct: 478  LVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAISEVFIGSMGSTFTEDILRLRK 537

Query: 1880 DWGSASQCDEYLCQGELPNFIAEDE 1954
             WG++S CDEYLC+GE+PNFIA+DE
Sbjct: 538  AWGTSSLCDEYLCRGEVPNFIADDE 562


>gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis]
          Length = 578

 Score =  691 bits (1783), Expect = 0.0
 Identities = 349/578 (60%), Positives = 433/578 (74%), Gaps = 17/578 (2%)
 Frame = +2

Query: 272  NLISQNARPNDVVKSPDHHNHRNH-RSAFQIDD------DFRNRVS---GAARKFNKRYI 421
            NLI QN R             +NH RS F IDD      +FR+R+     +    NK+++
Sbjct: 16   NLIEQNER-----------KLQNHPRSTFHIDDVDGGNREFRSRIRRRLSSLGLLNKKFM 64

Query: 422  FLIFLPVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIEL 601
            F IFLP+ I++++L+TD++ +F   +  +R        ++R+RESELRAL+LL+QQ++ L
Sbjct: 65   FAIFLPLFIVVLFLSTDVRGLFSADLSGVRF----DSFSDRLRESELRALFLLRQQQLGL 120

Query: 602  VKMWNYT------TLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQG 763
              +WN T                               +++ DLK  V  Q+S+NK+IQ 
Sbjct: 121  FALWNQTFHDSPPISSNSTNNSSSSSSINSSASGTEQNSVIDDLKFAVLRQLSLNKEIQQ 180

Query: 764  ILLSAHESEGSVDSTGNYTDSIS-SDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSG 940
            +LLS H S  S   T     ++  SD+  C+KVD + + R+TIEW P S+K+LFAIC+SG
Sbjct: 181  VLLSPHRSGNSSSITDAGDPNLGGSDFDTCRKVDQKFSQRRTIEWKPNSNKFLFAICLSG 240

Query: 941  QMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAE 1120
            QMSN LICLEKHMFFAALLNRVLVIPSSKVDY+++RVLDIDHINKCLGRKVV++FE+FAE
Sbjct: 241  QMSNRLICLEKHMFFAALLNRVLVIPSSKVDYQYNRVLDIDHINKCLGRKVVISFEDFAE 300

Query: 1121 IKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDV 1300
             KKNH+HI++F+CYFS PQPC++DDE               E+ W ED+K P  + V+DV
Sbjct: 301  TKKNHMHINRFICYFSQPQPCYVDDEHIKKLKGLGLTMGKLESAWTEDIKGPNKRTVQDV 360

Query: 1301 LAKFSSDSDVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLG 1480
             +KFS++ DVIAIGDVF+ADVE+EWVMQPGGP+AHKC+TLIEPSRLI+LTAQRFIQTFLG
Sbjct: 361  QSKFSTNDDVIAIGDVFYADVEQEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFIQTFLG 420

Query: 1481 KDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGL 1660
            K+FVALHFRRHGFLKFCNAKQPSCF+P+PQAA+CI  VVERA+ PVIYLSTDAA+SETGL
Sbjct: 421  KNFVALHFRRHGFLKFCNAKQPSCFFPIPQAADCITSVVERANAPVIYLSTDAAESETGL 480

Query: 1661 LQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSS 1840
            LQSL+VLNGK VPLV+RP RN+AEKWDALLYRHGLEGDSQVEAMLDKTICA+SSVFIG+ 
Sbjct: 481  LQSLIVLNGKPVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGAP 540

Query: 1841 GSTFTDDILRLRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            GSTFT+DILRLRKDWGSAS CD+YLCQGE PNF+A++E
Sbjct: 541  GSTFTEDILRLRKDWGSASSCDKYLCQGEEPNFVADNE 578


>ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299396 [Fragaria vesca
            subsp. vesca]
          Length = 556

 Score =  674 bits (1738), Expect = 0.0
 Identities = 350/574 (60%), Positives = 418/574 (72%), Gaps = 13/574 (2%)
 Frame = +2

Query: 272  NLISQNAR-----PNDV----VKSPDHHNHRNHRSAFQIDDDFRNRVSGAARK--FNKR- 415
            NLI QN R     P       +   D   HR+HR       + R R +    +  FNKR 
Sbjct: 19   NLIEQNDRKQLPSPRSATTFHIDDGDVDRHRHHR-------EIRRRFASLNLRDLFNKRS 71

Query: 416  -YIFLIFLPVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQE 592
              +F IF+P+ +++++ +TD+K++F            +  ++ ++RESELRALYLL+QQ+
Sbjct: 72   FLVFFIFIPLFVLVLFFSTDIKSLF------FSHLSVSDSVSGKLRESELRALYLLRQQQ 125

Query: 593  IELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILL 772
            + L  +WN T+                          L DLKS V  QIS+NK+IQ +LL
Sbjct: 126  LGLFGLWNSTSNHSNPD--------------------LDDLKSSVLRQISLNKEIQQVLL 165

Query: 773  SAHESEGSVDSTGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSN 952
            S H S  S +S  ++ D    D  RC+ VD R ++R+TIEW P S KYL AICVSGQMSN
Sbjct: 166  SPHSSGNSSESE-DFRDPSLGD--RCRVVDQRFSERRTIEWKPNSDKYLLAICVSGQMSN 222

Query: 953  HLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKN 1132
            HLICLEKHMFFAALLNR+LVIPSSKVDY++  VLDI+HINKC+GRKVVVTFEE AE KKN
Sbjct: 223  HLICLEKHMFFAALLNRILVIPSSKVDYQYSTVLDIEHINKCIGRKVVVTFEELAEEKKN 282

Query: 1133 HLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKF 1312
            H+HID+F+CYFS P  C++DDE               E  W EDVK P  K V+DV +KF
Sbjct: 283  HIHIDRFICYFSKPTLCYVDDEHLKKLKALGISYKSREPAWGEDVKKPSKKTVQDVQSKF 342

Query: 1313 SSDSDVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFV 1492
            SS  +VIAIGDVFFAD E++WVMQPGGP+AHKCKTLIEPSRLILLTAQRFIQTFLGK+FV
Sbjct: 343  SSGDEVIAIGDVFFADAEQDWVMQPGGPLAHKCKTLIEPSRLILLTAQRFIQTFLGKNFV 402

Query: 1493 ALHFRRHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSL 1672
            ALHFRRHGFLKFCN KQPSCFYP+PQAA+CI R+ ERA+ PV+YLSTDAA+SETGLLQSL
Sbjct: 403  ALHFRRHGFLKFCNNKQPSCFYPIPQAADCITRIAERANAPVVYLSTDAAESETGLLQSL 462

Query: 1673 VVLNGKTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTF 1852
            VV+NGKTVPLV+RP RN+AEKWDALLYRHG+EGD QVEAMLDKTI A+SSVFIG+SGSTF
Sbjct: 463  VVVNGKTVPLVKRPARNSAEKWDALLYRHGIEGDPQVEAMLDKTISAMSSVFIGASGSTF 522

Query: 1853 TDDILRLRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            T+DILRLRK WGSAS CDEYLCQGE PNFIAE+E
Sbjct: 523  TEDILRLRKGWGSASVCDEYLCQGEEPNFIAENE 556


>ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776069 [Glycine max]
          Length = 543

 Score =  670 bits (1729), Expect = 0.0
 Identities = 344/560 (61%), Positives = 411/560 (73%), Gaps = 15/560 (2%)
 Frame = +2

Query: 320  DHHN--HRNHR--------SAFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIMLVYLTT 469
            DH N    NHR        +AF ++D   +R    +    K+YI  I   + ++L +  T
Sbjct: 10   DHRNLVDNNHRKPPSPPPSAAFHVED-LSSRFRRVSFALQKKYIIAILALLFLLLFFSIT 68

Query: 470  DLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXX 649
            D   +F    PS  +F     + +RM+ESELRA+ LL QQ+  L+  WN+T         
Sbjct: 69   DFHQLFS--TPSSFKFDS---ITDRMKESELRAINLLYQQQQSLLTAWNHTLRTNASDPN 123

Query: 650  XXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS-----VDSTGN 814
                             +L+DLKS +F QIS+N++IQ ILL+ H + G+     +D    
Sbjct: 124  -----------------LLEDLKSSLFKQISLNREIQQILLNPHSTGGNAIEPELDLNAT 166

Query: 815  YTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAAL 994
                +   + RC+ VD  L+ RKTIEWNP+  K+L AICVSGQMSNHLICLEKHMFFAAL
Sbjct: 167  LNGVV---YDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICLEKHMFFAAL 223

Query: 995  LNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSMP 1174
            LNRVLVIPSSKVDY++ RV+DIDHINKCLG+KVVV+FEEF+ +KK HLHIDKF+CYFS P
Sbjct: 224  LNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEEFSNLKKGHLHIDKFLCYFSHP 283

Query: 1175 QPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVFF 1354
            QPC++DDER              EAVW ED + PK K V+DVL KFS D DV+AIGDVF+
Sbjct: 284  QPCYLDDERLKKLGALGLTMSKPEAVWDEDTRKPKKKTVQDVLGKFSFDDDVMAIGDVFY 343

Query: 1355 ADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCN 1534
            A+VEREWVMQPGGPIAHKCKTLIEP+RLILLTAQRFIQTFLG++F+ALHFRRHGFLKFCN
Sbjct: 344  AEVEREWVMQPGGPIAHKCKTLIEPNRLILLTAQRFIQTFLGRNFIALHFRRHGFLKFCN 403

Query: 1535 AKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRP 1714
            AK+PSCFYP+PQAA+CI RVVE A  P+IYLSTDAA+SETGLLQSLVVLNG+ VPLV RP
Sbjct: 404  AKKPSCFYPIPQAADCILRVVEMADAPIIYLSTDAAESETGLLQSLVVLNGRPVPLVIRP 463

Query: 1715 TRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSA 1894
             RN+AEKWDALLYRH ++GDSQVEAMLDKTICA+SSVFIG+ GSTFT+DILRLRKDWGSA
Sbjct: 464  ARNSAEKWDALLYRHNMDGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDILRLRKDWGSA 523

Query: 1895 SQCDEYLCQGELPNFIAEDE 1954
            S CDEYLCQGE PN IAE+E
Sbjct: 524  SMCDEYLCQGEEPNIIAENE 543


>ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis]
            gi|223526849|gb|EEF29063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 565

 Score =  667 bits (1722), Expect = 0.0
 Identities = 339/568 (59%), Positives = 426/568 (75%), Gaps = 7/568 (1%)
 Frame = +2

Query: 272  NLISQNARP--NDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARK--FNKRYIFL---I 430
            NLI QN R   N     P    HR   S F I++       G  R+  FNKRY +    I
Sbjct: 14   NLIEQNDRKHHNHQQTVPTSSPHRRSFSTFHIEE-----YGGVIRRRLFNKRYYYYLLAI 68

Query: 431  FLPVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKM 610
            FLP++I++VY + DL+++F   + S+         ++RMRE+EL+ALYLL+QQ++ L+ +
Sbjct: 69   FLPLLIIIVYFSADLRSLFSANISSLNF----NSASDRMREAELQALYLLEQQQLSLLSI 124

Query: 611  WNYTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESE 790
            +N     +                       +++ +S +  Q++ NKQIQ ILLS H+S 
Sbjct: 125  FN-----QSFPSRNKNFSSNSSFINSFDNVKIENFRSALLKQMTFNKQIQQILLSPHKS- 178

Query: 791  GSVDSTGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLE 970
            G+ + +G+++ S    + RCKKV+ R  DRKTIEW P+S K+LF IC+SGQMSNHLICLE
Sbjct: 179  GNENVSGSFSGS-GFGFDRCKKVESRFLDRKTIEWKPRSDKFLFPICLSGQMSNHLICLE 237

Query: 971  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDK 1150
            KHMFFAALLNRVLV+PSSK DY+++RVLDI+HIN C+GRKVVVTFEEF +++KNH+HID+
Sbjct: 238  KHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVGRKVVVTFEEFVQMRKNHVHIDR 297

Query: 1151 FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDV 1330
            F+CYFS P  C++D+E               E+ WKEDVK P  K V+DVLAKF+S+ DV
Sbjct: 298  FICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWKEDVKKPSQKTVQDVLAKFTSNDDV 357

Query: 1331 IAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRR 1510
            IAIGDVF+AD+E++WVMQPGGP+AHKCKTLIEPSRLIL+TAQRFIQTFLGK+F+ALHFRR
Sbjct: 358  IAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLILVTAQRFIQTFLGKNFIALHFRR 417

Query: 1511 HGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGK 1690
            HGFLKFCNAK PSCFYP+PQAA+CI RV ERA+ PVIYLSTDAA+SET LLQSL+++NGK
Sbjct: 418  HGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIYLSTDAAESETDLLQSLIIVNGK 477

Query: 1691 TVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILR 1870
            TVPLV+RP+  + EKWD+LL RHG+E DSQVEAMLDKTI A+S+VFIG+SGSTFT+DILR
Sbjct: 478  TVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKTISAMSNVFIGASGSTFTEDILR 537

Query: 1871 LRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            LRKDW SAS CDEYLCQGELPNFIAEDE
Sbjct: 538  LRKDWESASLCDEYLCQGELPNFIAEDE 565


>ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus trichocarpa]
            gi|222840769|gb|EEE78316.1| protein-O-fucosyltransferase
            2 [Populus trichocarpa]
          Length = 527

 Score =  666 bits (1719), Expect = 0.0
 Identities = 341/548 (62%), Positives = 413/548 (75%), Gaps = 11/548 (2%)
 Frame = +2

Query: 344  RSAFQIDDDFRNRVSGAARKF--NKRYIF----LIFLPVVIMLVYLTTDLKNVFRMRVPS 505
            R +   +DD  + +    RK   N RY      +IFLP+ I+ +  +TD++N+F   +  
Sbjct: 3    RDSSDEEDDREHLIEQNDRKHHQNGRYSLFAAAIIFLPLFILFLSFSTDIRNLFSTHLKV 62

Query: 506  IREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYT---TLVERXXXXXXXXXXXXX 676
                     L+ RMRESELRALYLLK+Q++ L  +WN T   TL+E+             
Sbjct: 63   ------GDSLSIRMRESELRALYLLKKQQLSLFSLWNSTGNSTLLEKDLNS--------- 107

Query: 677  XXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTGNYTDSISSDW--TRC 850
                      +DLKS +  QIS+NK+IQ +LL+ HES G+V S+ +  D  ++     RC
Sbjct: 108  -------VSFEDLKSALLKQISLNKEIQQVLLAPHES-GNVSSSSSDLDFSNAGGFVQRC 159

Query: 851  KKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKV 1030
            +KVD R ADRKTIEW PK +K+LFA+C+SGQMSNHLICLEKHMFFAALLNRVLVIPSS+ 
Sbjct: 160  EKVDQRFADRKTIEWKPKPNKFLFALCLSGQMSNHLICLEKHMFFAALLNRVLVIPSSRF 219

Query: 1031 DYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSMPQPCFMDDERXXX 1210
            DY+++RVLDI+H+N CLGRKVVVTFEEF EI KN  HID+F CYFS P PC++D+E    
Sbjct: 220  DYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIMKNKPHIDRFFCYFSDPTPCYVDEEHVKK 279

Query: 1211 XXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVFFADVEREWVMQPG 1390
                       E+ WKED+K P    V+DV  KF SD +VIA+GDVFFADVE EW+MQPG
Sbjct: 280  LKGLGVSMGKLESPWKEDIKKPSKLTVKDVEGKFVSDDNVIAVGDVFFADVEEEWIMQPG 339

Query: 1391 GPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQ 1570
            GPIAHKCKTLIEP+R+I+LTAQRFIQTFLG +F+ALHFRRHGFLKFCNAK+PSCFYPVPQ
Sbjct: 340  GPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPVPQ 399

Query: 1571 AAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALL 1750
            AA+CI RVVERA+ PV+YLSTDAA+SETGLLQSLVV+NG+TVPLV RP+RNAAEKWDALL
Sbjct: 400  AADCIARVVERANAPVVYLSTDAAESETGLLQSLVVVNGRTVPLVTRPSRNAAEKWDALL 459

Query: 1751 YRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGEL 1930
            YRHGL+ D+QVEAMLDKTICA+SSVFIG+SGSTFT+DI RLRK W SAS CDEYLCQGEL
Sbjct: 460  YRHGLQEDAQVEAMLDKTICAMSSVFIGASGSTFTEDIFRLRKGWESASSCDEYLCQGEL 519

Query: 1931 PNFIAEDE 1954
            PN+IAE+E
Sbjct: 520  PNYIAENE 527


>ref|XP_007024790.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508780156|gb|EOY27412.1| O-fucosyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 558

 Score =  666 bits (1718), Expect = 0.0
 Identities = 337/569 (59%), Positives = 421/569 (73%), Gaps = 9/569 (1%)
 Frame = +2

Query: 275  LISQNAR---PNDVVKSPDHHNHRNHRSAFQIDD---DFRNRVSGAARKFNKRYIFLIFL 436
            LI QN     P+ +  SP      + RS+F I++     R R       FNKRY+F IFL
Sbjct: 15   LIHQNDTKNLPHQIPASP--RPSTSPRSSFHIEELESQIRRRFK---LTFNKRYLFAIFL 69

Query: 437  PVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 616
            P++I+ +Y +TD++++F   + S++       +++R+RES+L+ALYLL QQ+  L+ +WN
Sbjct: 70   PLLIIPIYFSTDIRSLFSSNISSLKF----NTVSDRIRESQLQALYLLNQQQNSLLSLWN 125

Query: 617  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 796
            +T +                           D+K+ +  QI++NK IQ ILLS H++ G+
Sbjct: 126  HTFVNSNNNITA---------------VQFDDIKASLLTQITLNKHIQQILLSPHKT-GN 169

Query: 797  VDSTGNYTDSISSDWT--RCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLE 970
                G   D   + ++  RC+KVD + A+RKT EW PK +K+LFAIC+SGQMSNHLICLE
Sbjct: 170  SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNHLICLE 229

Query: 971  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDK 1150
            KHMFFAA+LNR LVIPSS+ DY+++RVLDI+HIN C+G+K V+ FEEF EIKKNH HIDK
Sbjct: 230  KHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNHAHIDK 289

Query: 1151 FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWK-EDVKNPKHKRVEDVLAKFSSDSD 1327
            F+CYFS PQPC++D+E               E  WK ED+K P  K ++DV  KF SD D
Sbjct: 290  FICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKFGSDDD 349

Query: 1328 VIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFR 1507
            VIAIGDVF+ADVER+WV+QPGGPIAHKCKTLIEPS+LILLTA+RFIQTFLG +F+ALHFR
Sbjct: 350  VIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFIALHFR 409

Query: 1508 RHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNG 1687
            RHGFLKFCNAK+PSCFYP+PQAA+CI R+VERA+TPVIYLSTDAA+SET LLQS+VVLNG
Sbjct: 410  RHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSMVVLNG 469

Query: 1688 KTVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDIL 1867
            KT+PLV+RP RN+AEKWDALLYRHGL  D QVEAMLDKTICA+SSVFIG+ GSTFT DIL
Sbjct: 470  KTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVEAMLDKTICAMSSVFIGAPGSTFTGDIL 529

Query: 1868 RLRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            RLRKDWG+AS CDEYLCQGE PNF A +E
Sbjct: 530  RLRKDWGTASLCDEYLCQGEDPNFTAGEE 558


>ref|XP_007024791.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508780157|gb|EOY27413.1| O-fucosyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 559

 Score =  661 bits (1706), Expect = 0.0
 Identities = 337/570 (59%), Positives = 421/570 (73%), Gaps = 10/570 (1%)
 Frame = +2

Query: 275  LISQNAR---PNDVVKSPDHHNHRNHRSAFQIDD---DFRNRVSGAARKFNKRYIFLIFL 436
            LI QN     P+ +  SP      + RS+F I++     R R       FNKRY+F IFL
Sbjct: 15   LIHQNDTKNLPHQIPASP--RPSTSPRSSFHIEELESQIRRRFK---LTFNKRYLFAIFL 69

Query: 437  PVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 616
            P++I+ +Y +TD++++F   + S++       +++R+RES+L+ALYLL QQ+  L+ +WN
Sbjct: 70   PLLIIPIYFSTDIRSLFSSNISSLKF----NTVSDRIRESQLQALYLLNQQQNSLLSLWN 125

Query: 617  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 796
            +T +                           D+K+ +  QI++NK IQ ILLS H++ G+
Sbjct: 126  HTFVNSNNNITA---------------VQFDDIKASLLTQITLNKHIQQILLSPHKT-GN 169

Query: 797  VDSTGNYTDSISSDWT--RCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLE 970
                G   D   + ++  RC+KVD + A+RKT EW PK +K+LFAIC+SGQMSNHLICLE
Sbjct: 170  SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNHLICLE 229

Query: 971  KHMFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDK 1150
            KHMFFAA+LNR LVIPSS+ DY+++RVLDI+HIN C+G+K V+ FEEF EIKKNH HIDK
Sbjct: 230  KHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNHAHIDK 289

Query: 1151 FMCYFSMPQPCFMDDERXXXXXXXXXXXXXXEAVWK-EDVKNPKHKRVEDVLAKFSSDSD 1327
            F+CYFS PQPC++D+E               E  WK ED+K P  K ++DV  KF SD D
Sbjct: 290  FICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKFGSDDD 349

Query: 1328 VIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFR 1507
            VIAIGDVF+ADVER+WV+QPGGPIAHKCKTLIEPS+LILLTA+RFIQTFLG +F+ALHFR
Sbjct: 350  VIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFIALHFR 409

Query: 1508 RHGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNG 1687
            RHGFLKFCNAK+PSCFYP+PQAA+CI R+VERA+TPVIYLSTDAA+SET LLQS+VVLNG
Sbjct: 410  RHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSMVVLNG 469

Query: 1688 KTVPLVQRPTRNAAEKWDALLYRHGLEGDSQ-VEAMLDKTICALSSVFIGSSGSTFTDDI 1864
            KT+PLV+RP RN+AEKWDALLYRHGL  D Q VEAMLDKTICA+SSVFIG+ GSTFT DI
Sbjct: 470  KTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVVEAMLDKTICAMSSVFIGAPGSTFTGDI 529

Query: 1865 LRLRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            LRLRKDWG+AS CDEYLCQGE PNF A +E
Sbjct: 530  LRLRKDWGTASLCDEYLCQGEDPNFTAGEE 559


>ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus]
            gi|449517914|ref|XP_004165989.1| PREDICTED:
            uncharacterized protein LOC101230373 [Cucumis sativus]
          Length = 573

 Score =  660 bits (1704), Expect = 0.0
 Identities = 331/554 (59%), Positives = 416/554 (75%), Gaps = 10/554 (1%)
 Frame = +2

Query: 323  HHNHRNHRSAFQIDDD--FRNRV-----SGAARKFNKRYIFLIF--LPVVIMLVYLTTDL 475
            H +   H + F IDDD  FR  +     S     F+KRY +L+   LP+ I++++ + D+
Sbjct: 26   HPSPPTHSTTFDIDDDPHFRPPIPRFPFSIPKFAFDKRYYYLLAAALPLCILVLFFSVDI 85

Query: 476  KNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXX 655
             ++F   + S  +   +  L +RMRESEL ALYLL+QQ++    +WN++  ++       
Sbjct: 86   TSLFSTTLSSTLKTSDS--LTDRMRESELTALYLLRQQQLGFFHLWNHSLFLQSNSSFNS 143

Query: 656  XXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTGNYTDSISS 835
                          A+ + +KS +  QI++NK+IQ +LLS H S    +  G+     + 
Sbjct: 144  TPSNNLSSNS----ALTEYIKSALLKQITLNKEIQNVLLSPHRSGNLSEEVGDALPMDTF 199

Query: 836  DWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVI 1015
               RC+K+D +L+DR+TIEW PKS+K+LFAIC SGQMSNHLICLEKHMFFAA+LNRVLVI
Sbjct: 200  ALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMSNHLICLEKHMFFAAILNRVLVI 259

Query: 1016 PSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSMPQPCFMDD 1195
            PS KVDY+F RV+DID +N CLGRKVV++FEEF+EIKK+HLHID+F+CYFS P PC++DD
Sbjct: 260  PSHKVDYQFSRVIDIDRMNMCLGRKVVISFEEFSEIKKHHLHIDRFICYFSKPNPCYVDD 319

Query: 1196 ERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSD-VIAIGDVFFADVERE 1372
            E               E+ W ED K+P  K V DV +KFSS++D VIA+GD+FFA+VE+E
Sbjct: 320  EHISKLKNLGISMGKLESAWNEDTKHPNRKTVSDVESKFSSNNDDVIAVGDIFFANVEQE 379

Query: 1373 WVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSC 1552
            WV QPGGPIAHKC+TLIEPS LI LTAQRFIQTFLGK+++ALHFRRHGFLKFCNAKQPSC
Sbjct: 380  WVNQPGGPIAHKCQTLIEPSHLIKLTAQRFIQTFLGKNYIALHFRRHGFLKFCNAKQPSC 439

Query: 1553 FYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAE 1732
            FYP+PQAA+CI R+VERA+ PVIYLSTDAA+SE GLLQSL+VLNGK +PLV+RP RN+AE
Sbjct: 440  FYPIPQAADCIIRMVERANVPVIYLSTDAAESEHGLLQSLLVLNGKPIPLVKRPPRNSAE 499

Query: 1733 KWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEY 1912
            KWDALLYRHGLE DSQVEAMLDKTICA+SS FIG+ GSTFT+DILRLRKDWG+AS CDEY
Sbjct: 500  KWDALLYRHGLEEDSQVEAMLDKTICAMSSTFIGAPGSTFTEDILRLRKDWGTASMCDEY 559

Query: 1913 LCQGELPNFIAEDE 1954
            LCQGE PNFI+E+E
Sbjct: 560  LCQGEEPNFISENE 573


>ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsis thaliana]
            gi|9758924|dbj|BAB09461.1| unnamed protein product
            [Arabidopsis thaliana] gi|133778858|gb|ABO38769.1|
            At5g50420 [Arabidopsis thaliana]
            gi|332008558|gb|AED95941.1| O-fucosyltransferase family
            protein [Arabidopsis thaliana]
            gi|591401714|gb|AHL38584.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 566

 Score =  660 bits (1702), Expect = 0.0
 Identities = 334/542 (61%), Positives = 407/542 (75%), Gaps = 3/542 (0%)
 Frame = +2

Query: 338  NHRSAFQIDDDFRNRVSGAARKFNKRYIFL-IFLPVVIMLVYLTTDLKNVFRMRVPSIRE 514
            N RSAFQIDD             NKRY+ + + L + I L++L TD + +F     S + 
Sbjct: 40   NQRSAFQIDDILHRVQHRGKISLNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKL 99

Query: 515  FGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXX 694
                 PL+NR++ESELRALYLL+QQ++ L+ +WN T +                      
Sbjct: 100  ----DPLSNRVKESELRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGS 146

Query: 695  XAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTGNYTDSISSDWTRCKKVDPRLA 874
              + +D+KS V  QIS+NK+IQ +LLS H S     S G   DS++  + RC+KVD +L+
Sbjct: 147  SVLFEDVKSAVSKQISLNKEIQEVLLSPHRSSNY--SGGTDVDSVNFSYNRCRKVDQKLS 204

Query: 875  DRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVL 1054
            DRKT+EW P+S K+LFAIC+SGQMSNHLICLEKHMFFAALL+RVLVIPSSK DY++ RV+
Sbjct: 205  DRKTVEWKPRSDKFLFAICLSGQMSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVI 264

Query: 1055 DIDHINKCLGRKVVVTFEEFAE-IKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXX 1231
            DI+ IN CLGR VVV F++F E  KKNH  ID+F+CYFS PQ C++D+E           
Sbjct: 265  DIERINTCLGRNVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGIS 324

Query: 1232 XXXX-EAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVFFADVEREWVMQPGGPIAHK 1408
                 EA W ED+K P  + V+DV  KF SD DVIAIGDVF+AD+E++WVMQPGGPI HK
Sbjct: 325  IDGKLEAPWSEDIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHK 384

Query: 1409 CKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECIN 1588
            CKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRRHGFLKFCNAK PSCFYP+PQAAECI 
Sbjct: 385  CKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIA 444

Query: 1589 RVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLE 1768
            R+VER++  VIYLSTDAA+SET LLQSLVV++GK VPLV+RP RN+AEKWDALLYRHG+E
Sbjct: 445  RIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIE 504

Query: 1769 GDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGELPNFIAE 1948
             DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILRLRKDWG++S CDEYLC+GE PNFIAE
Sbjct: 505  DDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAE 564

Query: 1949 DE 1954
            DE
Sbjct: 565  DE 566


>gb|AAM66093.1| unknown [Arabidopsis thaliana]
          Length = 566

 Score =  659 bits (1700), Expect = 0.0
 Identities = 333/542 (61%), Positives = 407/542 (75%), Gaps = 3/542 (0%)
 Frame = +2

Query: 338  NHRSAFQIDDDFRNRVSGAARKFNKRYIFL-IFLPVVIMLVYLTTDLKNVFRMRVPSIRE 514
            N RSAFQIDD             NKRY+ + + L + I L++L TD + +F     S + 
Sbjct: 40   NQRSAFQIDDILHRVQHRGKISLNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKL 99

Query: 515  FGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXXXXXXXXXXXXXXXXX 694
                 PL+NR++ESELRALYLL+QQ++ L+ +WN T +                      
Sbjct: 100  ----DPLSNRVKESELRALYLLRQQQLALLSLWNGTLV---------NPSLNQSENALGS 146

Query: 695  XAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTGNYTDSISSDWTRCKKVDPRLA 874
              + +D+KS V  QIS+NK+IQ +LLS H S     S G   DS++  + RC+KVD +L+
Sbjct: 147  SVLFEDVKSAVSKQISLNKEIQEVLLSPHRSSNY--SGGTDVDSVNFSYNRCRKVDQKLS 204

Query: 875  DRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAALLNRVLVIPSSKVDYEFHRVL 1054
            DRKT+EW P+S K+LFAIC+SGQMSNHL+CLEKHMFFAALL+RVLVIPSSK DY++ RV+
Sbjct: 205  DRKTVEWKPRSDKFLFAICLSGQMSNHLLCLEKHMFFAALLDRVLVIPSSKFDYQYDRVI 264

Query: 1055 DIDHINKCLGRKVVVTFEEFAE-IKKNHLHIDKFMCYFSMPQPCFMDDERXXXXXXXXXX 1231
            DI+ IN CLGR VVV F++F E  KKNH  ID+F+CYFS PQ C++D+E           
Sbjct: 265  DIERINTCLGRNVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGIS 324

Query: 1232 XXXX-EAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIGDVFFADVEREWVMQPGGPIAHK 1408
                 EA W ED+K P  + V+DV  KF SD DVIAIGDVF+AD+E++WVMQPGGPI HK
Sbjct: 325  IDGKLEAPWSEDIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHK 384

Query: 1409 CKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFCNAKQPSCFYPVPQAAECIN 1588
            CKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRRHGFLKFCNAK PSCFYP+PQAAECI 
Sbjct: 385  CKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIA 444

Query: 1589 RVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQRPTRNAAEKWDALLYRHGLE 1768
            R+VER++  VIYLSTDAA+SET LLQSLVV++GK VPLV+RP RN+AEKWDALLYRHG+E
Sbjct: 445  RIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIE 504

Query: 1769 GDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGSASQCDEYLCQGELPNFIAE 1948
             DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILRLRKDWG++S CDEYLC+GE PNFIAE
Sbjct: 505  DDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAE 564

Query: 1949 DE 1954
            DE
Sbjct: 565  DE 566


>ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp.
            lyrata] gi|297311638|gb|EFH42062.1| hypothetical protein
            ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata]
          Length = 566

 Score =  659 bits (1699), Expect = 0.0
 Identities = 339/568 (59%), Positives = 417/568 (73%), Gaps = 7/568 (1%)
 Frame = +2

Query: 272  NLISQN----ARPNDVVKSPDHHNHRNHRSAFQIDDDFRNRVSGAARKFNKRYIFL-IFL 436
            +LI QN        D + S       N RSAFQI+D  +          NKRY+ + + L
Sbjct: 14   HLIPQNDTRIRHREDPISSTATTTGGNQRSAFQIEDILQRVQRRWKISLNKRYVIVFVSL 73

Query: 437  PVVIMLVYLTTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWN 616
             + I L++L TD + +F     S +      PL+NR++ESELRALYLL+QQ++ L+ +WN
Sbjct: 74   IISIGLLFLLTDPRELFSANFSSFKL----DPLSNRVKESELRALYLLRQQQLALLSLWN 129

Query: 617  YTTLVERXXXXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGS 796
             T +                        + +D+KS V  QIS+NK+IQ +LLS H S   
Sbjct: 130  GTLV---------NPSLNQSENDLRSSVLFEDVKSAVSKQISLNKEIQNVLLSPHRSSNY 180

Query: 797  VDSTGNYTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKH 976
              S G   DS++  + RC+KVD +L+DRKT+EW P+S K+LFAIC+SGQMSNHLICLEKH
Sbjct: 181  --SGGTEVDSVNFSYDRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLICLEKH 238

Query: 977  MFFAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAE-IKKNHLHIDKF 1153
            MFFAALL+RVLVIPSSK DY++ RV+DI+ IN CLGR VVV+F++F E  KKNH  ID+F
Sbjct: 239  MFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTCLGRNVVVSFDQFKEKAKKNHFRIDRF 298

Query: 1154 MCYFSMPQPCFMDDERXXXXXXXXXXXXXX-EAVWKEDVKNPKHKRVEDVLAKFSSDSDV 1330
            +CYFS PQ C++D+E                EA W ED+K P  + V+DV  KF SD DV
Sbjct: 299  ICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSKRTVQDVQTKFKSDDDV 358

Query: 1331 IAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRR 1510
            IAIGDVF+AD+E++WVMQPGGPI HKCKTLIEPS+LILLTAQRFIQTFLGK+F+ALHFRR
Sbjct: 359  IAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRR 418

Query: 1511 HGFLKFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGK 1690
            HGFLKFCNAK PSCFYP+PQAAECI R+VER++  VIYLSTDAA+SET LLQSLVV++GK
Sbjct: 419  HGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSLVVVDGK 478

Query: 1691 TVPLVQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILR 1870
             VPLV+RP RN+AEKWDALLYRHG+E DSQV+AMLDKTICA+SSVFIG+SGSTFT+DILR
Sbjct: 479  IVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTFTEDILR 538

Query: 1871 LRKDWGSASQCDEYLCQGELPNFIAEDE 1954
            LRKDWG++S CDEYLC+GE PNFIAEDE
Sbjct: 539  LRKDWGTSSTCDEYLCRGEEPNFIAEDE 566


>ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046 [Glycine max]
          Length = 543

 Score =  658 bits (1698), Expect = 0.0
 Identities = 345/564 (61%), Positives = 405/564 (71%), Gaps = 19/564 (3%)
 Frame = +2

Query: 320  DHHN--HRNHRS--------AFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIMLVYLTT 469
            DH N    NHR         AF ++D    R   A     K+YIF I   + ++L +  T
Sbjct: 10   DHRNLVDNNHRKPPSSPAAVAFHVEDP-SPRFRRANFTLQKKYIFAILAILFLLLFFSIT 68

Query: 470  DLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXXXX 649
            DL  +F     S   F     L +RM+ESELRA+ LL QQ+  L+  WN+T         
Sbjct: 69   DLHKLFS--TTSSFRFDS---LTDRMKESELRAINLLNQQQQALLTAWNHTLRTNASDPN 123

Query: 650  XXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDSTGNYTDSI 829
                             +L+DLKS +F QIS+N++IQ ILL+ H       STGN     
Sbjct: 124  -----------------LLEDLKSSIFKQISLNREIQQILLNPH-------STGNNAIEP 159

Query: 830  SSD---------WTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMF 982
              D         + RC+ VD  L+ RKTIEWNP+  K+L AICVSGQMSNHLICLEKH+F
Sbjct: 160  EFDLNATLNGVVYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICLEKHIF 219

Query: 983  FAALLNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCY 1162
            FAALLNRVLVIPSSKVDY++ RV+DIDHINKCLG+KVVV+FE F+ +KK HLHIDKF+CY
Sbjct: 220  FAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEVFSNLKKGHLHIDKFLCY 279

Query: 1163 FSMPQPCFMDDERXXXXXXXXXXXXXXEAVWKEDVKNPKHKRVEDVLAKFSSDSDVIAIG 1342
            FS PQPC++DDER               AVW ED +NPK K V+DVL KFS D DV+AIG
Sbjct: 280  FSQPQPCYLDDERLKKLGALGLTMSKPVAVWDEDTRNPKKKTVQDVLGKFSFDDDVMAIG 339

Query: 1343 DVFFADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFL 1522
            DVF+A+VEREWVMQPGGPIAHKC TLIEP+RLILLTAQRFIQTFLG++FVALHFRRHGFL
Sbjct: 340  DVFYAEVEREWVMQPGGPIAHKCTTLIEPNRLILLTAQRFIQTFLGRNFVALHFRRHGFL 399

Query: 1523 KFCNAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPL 1702
            KFCNAK+PSCFY + QAA+CI RVVERA  P+IYLSTDAA+SETGLLQSLVVLNG+ VPL
Sbjct: 400  KFCNAKKPSCFYSITQAADCILRVVERADAPIIYLSTDAAESETGLLQSLVVLNGRPVPL 459

Query: 1703 VQRPTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKD 1882
            V RP RN+AEKWDALLYRH ++GDSQVEAMLDK+ICA+SSVFIG+ GSTFT+DILRLRKD
Sbjct: 460  VIRPARNSAEKWDALLYRHRMDGDSQVEAMLDKSICAMSSVFIGAPGSTFTEDILRLRKD 519

Query: 1883 WGSASQCDEYLCQGELPNFIAEDE 1954
            WGSAS CDEYLCQGE PN +AE+E
Sbjct: 520  WGSASMCDEYLCQGEEPNIVAENE 543


>ref|XP_004510588.1| PREDICTED: uncharacterized protein LOC101496484 [Cicer arietinum]
          Length = 549

 Score =  657 bits (1696), Expect = 0.0
 Identities = 335/561 (59%), Positives = 416/561 (74%), Gaps = 16/561 (2%)
 Frame = +2

Query: 320  DHHN--HRNHR----------SAFQIDDDFRNRVSGAARKFNKRYIFLIFLPVVIMLVYL 463
            DHHN  H+N            + F +DD   +R   A  KF K+YI  I + ++++L++ 
Sbjct: 10   DHHNLIHQNSTKPRTPPSITAATFHVDD-LNSRFRRANFKFQKKYIIAIIVLLIVILLFS 68

Query: 464  TTDLKNVFRMRVPSIREFGGNGPLANRMRESELRALYLLKQQEIELVKMWNYTTLVERXX 643
              +L+  F     S   F  +  +++RM+ESELRA+YLL+QQ++ L+ ++N  +      
Sbjct: 69   IPNLRRHF-----STASFISDS-VSDRMKESELRAIYLLRQQQLSLLTVFNRNSQSNTSD 122

Query: 644  XXXXXXXXXXXXXXXXXXAMLQDLKSRVFGQISMNKQIQGILLSAHESEGSVDST---GN 814
                               +++DLKS +  QIS+N +IQ ILL+ H      D     GN
Sbjct: 123  PNQTPN-------------LIEDLKSALSKQISINSEIQQILLNPHRIGNVFDPEFDFGN 169

Query: 815  YTDSISSDWTRCKKVDPRLADRKTIEWNPKSSKYLFAICVSGQMSNHLICLEKHMFFAAL 994
               S + ++  C+ +D  L+ RKT+EWNPK  K+L AICVSGQMSNHLICLEKHMFFAAL
Sbjct: 170  VNVS-NGNYDTCRTIDQNLSKRKTVEWNPKEGKFLLAICVSGQMSNHLICLEKHMFFAAL 228

Query: 995  LNRVLVIPSSKVDYEFHRVLDIDHINKCLGRKVVVTFEEFAEIKKNHLHIDKFMCYFSMP 1174
            LNRVLVIPSSK DY++ RV++IDHINKCLG+KVV++F+EF+ +KK+HLHIDKF+CYFS+P
Sbjct: 229  LNRVLVIPSSKFDYQYDRVVNIDHINKCLGKKVVISFDEFSNVKKDHLHIDKFLCYFSLP 288

Query: 1175 QPCFMDDERXXXXXXXXXXXXXXEAVWK-EDVKNPKHKRVEDVLAKFSSDSDVIAIGDVF 1351
            QPC++DDE+              +AVW  ED +NPK K VEDV++KFS D DV+AIGDVF
Sbjct: 289  QPCYLDDEKLKKLSGLGLSMSKPKAVWDDEDTRNPKKKSVEDVMSKFSYDDDVMAIGDVF 348

Query: 1352 FADVEREWVMQPGGPIAHKCKTLIEPSRLILLTAQRFIQTFLGKDFVALHFRRHGFLKFC 1531
            +A+VE EWVMQPGGPIAHKCKTLIEP+RLI LTAQRFIQTFLG++F+ALHFRRHGFLKFC
Sbjct: 349  YAEVEHEWVMQPGGPIAHKCKTLIEPNRLITLTAQRFIQTFLGRNFIALHFRRHGFLKFC 408

Query: 1532 NAKQPSCFYPVPQAAECINRVVERASTPVIYLSTDAADSETGLLQSLVVLNGKTVPLVQR 1711
            NAK+PSCFYP+PQAA+CI RVVERA  P+IYLSTDAA SETGLLQSLVVLNGK VPLV R
Sbjct: 409  NAKKPSCFYPIPQAADCILRVVERADAPIIYLSTDAAQSETGLLQSLVVLNGKPVPLVIR 468

Query: 1712 PTRNAAEKWDALLYRHGLEGDSQVEAMLDKTICALSSVFIGSSGSTFTDDILRLRKDWGS 1891
            P RN+AEKWDALLYRHG+EGD+QVEAMLDKTICA+SSVFIG+ GSTFT+DI RLRKDWGS
Sbjct: 469  PARNSAEKWDALLYRHGIEGDAQVEAMLDKTICAMSSVFIGAPGSTFTEDIRRLRKDWGS 528

Query: 1892 ASQCDEYLCQGELPNFIAEDE 1954
             S CDEYLCQGE PN +AE+E
Sbjct: 529  LSMCDEYLCQGEEPNIVAENE 549


Top