BLASTX nr result
ID: Rauwolfia21_contig00000800
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00000800 (2653 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268... 819 0.0 ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584... 817 0.0 ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254... 732 0.0 gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis] 729 0.0 ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602... 729 0.0 ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262... 713 0.0 gb|EOY27412.1| O-fucosyltransferase family protein isoform 1 [Th... 707 0.0 gb|EOY27413.1| O-fucosyltransferase family protein isoform 2 [Th... 703 0.0 ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299... 693 0.0 gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlise... 689 0.0 ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617... 682 0.0 ref|XP_002533327.1| conserved hypothetical protein [Ricinus comm... 682 0.0 ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208... 681 0.0 ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776... 679 0.0 ref|XP_006426814.1| hypothetical protein CICLE_v10025289mg [Citr... 679 0.0 ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus tric... 676 0.0 ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046... 664 0.0 ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsi... 663 0.0 ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arab... 663 0.0 gb|AAM66093.1| unknown [Arabidopsis thaliana] 662 0.0 >ref|XP_004235519.1| PREDICTED: uncharacterized protein LOC101268664 [Solanum lycopersicum] Length = 565 Score = 819 bits (2116), Expect = 0.0 Identities = 413/565 (73%), Positives = 465/565 (82%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYLLA 2215 ESSDEEDDR +LI QNER N KSPR STFQI+D K R RR NF+ K YLLA Sbjct: 6 ESSDEEDDRENLIHQNERVNHLSKSPR--PSTFQIED-VKDRFALCRRFNFTSGKTYLLA 62 Query: 2214 IVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLWNH 2035 I+LPL +L+LYF TDIK LFQT+++ +KYD S N MRESELRA LWNH Sbjct: 63 IILPLLVLILYFATDIKALFQTTVTTIKYDGSVNSMRESELRALYLLKQQQLGLFKLWNH 122 Query: 2034 TLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDPLG 1855 TLVN ++ VE+LK DLL QISLNKQIQQVLLSSH+LG+ L Sbjct: 123 TLVNDTSTTHSLESAPGFTLVSRSSI-VEDLKDDLLRQISLNKQIQQVLLSSHQLGNSLI 181 Query: 1854 SLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICLEKHM 1675 + S N TDPS+G RC KVD LSER+T+EWKP+SNKYLFAICVSGQMSNHLICLEKHM Sbjct: 182 T-SDNSTDPSLGGLGRCRKVDHNLSERRTVEWKPRSNKYLFAICVSGQMSNHLICLEKHM 240 Query: 1674 VFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHIDKFIC 1495 FAA+LNRVLVIPSSKVDYEF RVLDVDHIN+CLGR+V+VT++EFAE++K+HLHIDKF+C Sbjct: 241 FFAALLNRVLVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKFLC 300 Query: 1494 YFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXXVIAI 1315 YFS PQPCF+D+E VKKLKSLG+SM+KLEAAW EDVK P+KRT V+AI Sbjct: 301 YFSQPQPCFLDEERVKKLKSLGISMNKLEAAWDEDVKNPKKRTAQDIVAKFSMDDDVLAI 360 Query: 1314 GDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFRRHGF 1135 GDVFFADVE++WVMQPGGPI+HKCKTLIEP RLIMLTAQRFVQTFLG +FIALHFRRHGF Sbjct: 361 GDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFVQTFLGDNFIALHFRRHGF 420 Query: 1134 LKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNGKTVP 955 LKFCNAK SCFYPVPQ+A+CINRV+ERANSPV+YLSTDAA SET LLQSL+VFNGKTVP Sbjct: 421 LKFCNAKKPSCFYPVPQAADCINRVLERANSPVMYLSTDAAESETGLLQSLVVFNGKTVP 480 Query: 954 LIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDILRLRK 775 L++RPARNSAEKWDALLYRHGLEGDPQVEAMLDKT+CA+S+VFIGS GSTFT+DILRLRK Sbjct: 481 LVQRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTICAMSSVFIGSSGSTFTDDILRLRK 540 Query: 774 DWGSASLCDEYLCQGELPNFIADDE 700 DWGSASLCDEYLCQGELPNF+ADDE Sbjct: 541 DWGSASLCDEYLCQGELPNFVADDE 565 >ref|XP_006342883.1| PREDICTED: uncharacterized protein LOC102584575 [Solanum tuberosum] Length = 568 Score = 817 bits (2111), Expect = 0.0 Identities = 409/567 (72%), Positives = 471/567 (83%), Gaps = 2/567 (0%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYLLA 2215 ESSDEEDDR +LI QNER ND KSPR +STFQI+D K R RR NF+ KRYLLA Sbjct: 6 ESSDEEDDRENLIHQNERVNDLSKSPR--RSTFQIED-VKDRFALCRRFNFTSGKRYLLA 62 Query: 2214 IVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLWNH 2035 I+LP+ +LVLYF TDIK+LFQT+++ +KYD S N MR+SELRA LWNH Sbjct: 63 IILPVLVLVLYFATDIKSLFQTTVTTIKYDGSVNSMRDSELRALYLLRQQQLGLFKLWNH 122 Query: 2034 TLVN--KXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDP 1861 TLVN +V + VE+LK+DLL QISLNKQIQQVLLSSH+LG+ Sbjct: 123 TLVNDTSTTHTGSSLESTPGFASVSRSSIVEDLKADLLRQISLNKQIQQVLLSSHQLGNS 182 Query: 1860 LGSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICLEK 1681 L + S N TDP++G +RC KVD LS+R+T+EWKP+SNKYLFAICVSGQMSNHLICLEK Sbjct: 183 LIT-SDNSTDPTLGGLSRCRKVDHNLSQRRTVEWKPRSNKYLFAICVSGQMSNHLICLEK 241 Query: 1680 HMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHIDKF 1501 HM FAA+LNR+LVIPSSKVDYEF RVLDVDHIN+CLGR+V+VT++EFAE++K+HLHIDKF Sbjct: 242 HMFFAALLNRILVIPSSKVDYEFRRVLDVDHINKCLGREVIVTYDEFAERRKSHLHIDKF 301 Query: 1500 ICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXXVI 1321 +CYFS PQPCF+D+E VKKLKSLG+SM+KLEAAW EDVK P+KRT V+ Sbjct: 302 LCYFSQPQPCFLDEERVKKLKSLGISMNKLEAAWNEDVKNPKKRTVQDIMAKFSTDDDVL 361 Query: 1320 AIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFRRH 1141 AIGDVFFADVE++WVMQPGGPI+HKCKTLIEP RLIMLTAQRF+QTFLG +FIALHFRRH Sbjct: 362 AIGDVFFADVEKDWVMQPGGPISHKCKTLIEPSRLIMLTAQRFIQTFLGDNFIALHFRRH 421 Query: 1140 GFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNGKT 961 GFLKFCNAK SCFYPVPQ+A+CINRV+ERANSPVIYLSTDAA SET LLQSL+V NGKT Sbjct: 422 GFLKFCNAKKPSCFYPVPQAADCINRVLERANSPVIYLSTDAAESETGLLQSLVVVNGKT 481 Query: 960 VPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDILRL 781 VPL++RPARNSAEKWDALLYRHGLEGDPQV+AMLDKT+CA+S+VFIGS GSTFT+DILRL Sbjct: 482 VPLVQRPARNSAEKWDALLYRHGLEGDPQVDAMLDKTICAMSSVFIGSSGSTFTDDILRL 541 Query: 780 RKDWGSASLCDEYLCQGELPNFIADDE 700 RKDWGSASLCDEYLCQGELPN++ADDE Sbjct: 542 RKDWGSASLCDEYLCQGELPNYVADDE 568 >ref|XP_002264087.1| PREDICTED: uncharacterized protein LOC100254979 isoform 1 [Vitis vinifera] Length = 559 Score = 732 bits (1890), Expect = 0.0 Identities = 381/568 (67%), Positives = 444/568 (78%), Gaps = 3/568 (0%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYLLA 2215 ESSD+E+DR +LI++NER K P H+S FQI+D FKSR R FS NKRYL A Sbjct: 4 ESSDDEEDRQNLIDENER-----KLP--HRSGFQIED-FKSRLSAHR---FSFNKRYLFA 52 Query: 2214 IVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLWNH 2035 I PLFIL++YFTTD++NLF TS+S VK D+ T+ MRESELRA LWNH Sbjct: 53 IFPPLFILLIYFTTDVRNLFTTSISIVKADSPTDRMRESELRALYLLRQQQLSLFSLWNH 112 Query: 2034 T-LVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDPL 1858 T + T + + KS LL QISLNK+IQQVLLSSH G+ L Sbjct: 113 TAFADSAPIPSNSSNSTLDFSTRQVLLSSADFKSALLKQISLNKEIQQVLLSSHPSGN-L 171 Query: 1857 GSLSVNFTDPSIG--SFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICLE 1684 L + D + G SFNRC KV+Q +S+R TIEWKP+S+KYLFAIC+SGQMSNHLICLE Sbjct: 172 SELVDDNGDLNFGAYSFNRCPKVNQNMSQRPTIEWKPRSDKYLFAICLSGQMSNHLICLE 231 Query: 1683 KHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHIDK 1504 KHM FAA+LNR+LVIPSSK DY+++RVLD++HIN CLGRKVVVTFEEF E KKNHLHID+ Sbjct: 232 KHMFFAALLNRILVIPSSKFDYQYNRVLDIEHINNCLGRKVVVTFEEFTESKKNHLHIDR 291 Query: 1503 FICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXXV 1324 ICYFS P PC++DD+HVKKLKSLG+SM KLE AW ED+KKP+KRT V Sbjct: 292 VICYFSLPLPCYVDDDHVKKLKSLGISMGKLEPAWAEDIKKPKKRTAQDVQAKFSSNDDV 351 Query: 1323 IAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFRR 1144 IAIGDVF+A+VE EWVMQPGGP+AHKC+TLIEP RLIMLTAQRFVQTFLG+ F ALHFRR Sbjct: 352 IAIGDVFYANVEEEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFVQTFLGKSFTALHFRR 411 Query: 1143 HGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNGK 964 HGFLKFCNAK+ SCF+P+PQ+A+CI+RVVERA++PVIYLSTDAA SET LLQSL+V NGK Sbjct: 412 HGFLKFCNAKEPSCFFPIPQAADCISRVVERADTPVIYLSTDAAESETGLLQSLVVLNGK 471 Query: 963 TVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDILR 784 VPLIKRP RNSAEKWDALLYRHGL+GD QVEAMLDKT+CA+++VFIG+ GSTFTEDILR Sbjct: 472 LVPLIKRPTRNSAEKWDALLYRHGLDGDSQVEAMLDKTICAMASVFIGAPGSTFTEDILR 531 Query: 783 LRKDWGSASLCDEYLCQGELPNFIADDE 700 LR+ WGSAS CDEYLCQGE PNFIAD+E Sbjct: 532 LRRGWGSASHCDEYLCQGEQPNFIADNE 559 >gb|EXB64649.1| hypothetical protein L484_017982 [Morus notabilis] Length = 578 Score = 729 bits (1883), Expect = 0.0 Identities = 368/579 (63%), Positives = 443/579 (76%), Gaps = 15/579 (2%) Frame = -2 Query: 2391 SSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDD------DFKSRSPNARRLNFSLNK 2230 SSDE+DDR +LIEQNER K H +STF IDD +F+SR LNK Sbjct: 7 SSDEDDDRENLIEQNER-----KLQNHPRSTFHIDDVDGGNREFRSRIRRRLSSLGLLNK 61 Query: 2229 RYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXX 2050 +++ AI LPLFI+VL+ +TD++ LF LS V++D+ ++ +RESELRA Sbjct: 62 KFMFAIFLPLFIVVLFLSTDVRGLFSADLSGVRFDSFSDRLRESELRALFLLRQQQLGLF 121 Query: 2049 XLWNHTL-------VNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQV 1891 LWN T N N +++LK +L Q+SLNK+IQQV Sbjct: 122 ALWNQTFHDSPPISSNSTNNSSSSSSINSSASGTEQNSVIDDLKFAVLRQLSLNKEIQQV 181 Query: 1890 LLSSHRLGDPLGSLSVNFTDPSIGS--FNRCGKVDQKLSERKTIEWKPKSNKYLFAICVS 1717 LLS HR G+ S + DP++G F+ C KVDQK S+R+TIEWKP SNK+LFAIC+S Sbjct: 182 LLSPHRSGN--SSSITDAGDPNLGGSDFDTCRKVDQKFSQRRTIEWKPNSNKFLFAICLS 239 Query: 1716 GQMSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFA 1537 GQMSN LICLEKHM FAA+LNRVLVIPSSKVDY+++RVLD+DHIN+CLGRKVV++FE+FA Sbjct: 240 GQMSNRLICLEKHMFFAALLNRVLVIPSSKVDYQYNRVLDIDHINKCLGRKVVISFEDFA 299 Query: 1536 EKKKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXX 1357 E KKNH+HI++FICYFS PQPC++DDEH+KKLK LG++M KLE+AWTED+K P KRT Sbjct: 300 ETKKNHMHINRFICYFSQPQPCYVDDEHIKKLKGLGLTMGKLESAWTEDIKGPNKRTVQD 359 Query: 1356 XXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFL 1177 VIAIGDVF+ADVE+EWVMQPGGP+AHKC+TLIEP RLIMLTAQRF+QTFL Sbjct: 360 VQSKFSTNDDVIAIGDVFYADVEQEWVMQPGGPLAHKCQTLIEPSRLIMLTAQRFIQTFL 419 Query: 1176 GRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETD 997 G++F+ALHFRRHGFLKFCNAK SCF+P+PQ+A+CI VVERAN+PVIYLSTDAA SET Sbjct: 420 GKNFVALHFRRHGFLKFCNAKQPSCFFPIPQAADCITSVVERANAPVIYLSTDAAESETG 479 Query: 996 LLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGS 817 LLQSL+V NGK VPL+KRPARNSAEKWDALLYRHGLEGD QVEAMLDKT+CA+S+VFIG+ Sbjct: 480 LLQSLIVLNGKPVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKTICAMSSVFIGA 539 Query: 816 FGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 GSTFTEDILRLRKDWGSAS CD+YLCQGE PNF+AD+E Sbjct: 540 PGSTFTEDILRLRKDWGSASSCDKYLCQGEEPNFVADNE 578 >ref|XP_006357428.1| PREDICTED: uncharacterized protein LOC102602087 [Solanum tuberosum] Length = 565 Score = 729 bits (1883), Expect = 0.0 Identities = 371/575 (64%), Positives = 449/575 (78%), Gaps = 7/575 (1%) Frame = -2 Query: 2403 IMMES--SDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNK 2230 +MME S+EE+D+ +LI Q ER N+ +SP ++ FQIDD+ P N S +K Sbjct: 1 MMMERDPSNEEEDQENLIAQRERGNNLSESPV--RTAFQIDDEIADTRP----FNSSCSK 54 Query: 2229 --RYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXX 2056 +L IV+ +FI + ++TTD+ N+ +T + N + S N MRESELRA Sbjct: 55 CCYFLTIIVVTVFIFIRFYTTDVDNVSKTGVMN---NDSVNLMRESELRALYLLRQQQLG 111 Query: 2055 XXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSH 1876 LWN+TL++ ++ + E LK +L+ QISLNKQIQQ LLSSH Sbjct: 112 LFKLWNNTLIDNSLNATAANNSNFVSTSLFSSALSEELKLELISQISLNKQIQQALLSSH 171 Query: 1875 RLGDPLGSLSVNFTDPSI---GSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMS 1705 +LG+ L + S N TDPS+ G +RC K+D KLS+R+TIEW+P+S+KYLFAIC SGQMS Sbjct: 172 QLGNLLNA-SDNATDPSLDDYGGLDRCRKMDYKLSDRRTIEWEPRSDKYLFAICASGQMS 230 Query: 1704 NHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKK 1525 NHLICLEKHM FAA+LNR+L+IPSS+VDYEF RVLD+DHIN+CLGRKVVVTFEEFA+ +K Sbjct: 231 NHLICLEKHMFFAALLNRILIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQK 290 Query: 1524 NHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXX 1345 H+HIDKFICYFS PQPCF+DDEHVKKLKSLGVSM+KLEAAW ED+K P+ RT Sbjct: 291 GHMHIDKFICYFSQPQPCFLDDEHVKKLKSLGVSMNKLEAAWDEDIKNPKPRTVQDIMTK 350 Query: 1344 XXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDF 1165 VIAIGDVFFA+VE++WVMQPGGPI+HKCKTL+EP RLI+LTAQRF+QTFLG++F Sbjct: 351 FSLDDDVIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNF 410 Query: 1164 IALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQS 985 IALHFRRHGFLKFCNAK SCFYPVPQ+A+CINRVVERA +PVIYLSTDAA SET +LQS Sbjct: 411 IALHFRRHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQS 470 Query: 984 LLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGST 805 L+ NGKTVPL++RPA+NSAEKWDALLYRHGLEGD QVEAMLDKT+CA+S VFIGS GST Sbjct: 471 LVAVNGKTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAMSEVFIGSMGST 530 Query: 804 FTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 FTEDILRLRKDWG++SLCDEYLC+GE+P+FIADDE Sbjct: 531 FTEDILRLRKDWGTSSLCDEYLCRGEVPSFIADDE 565 >ref|XP_004242264.1| PREDICTED: uncharacterized protein LOC101262928 [Solanum lycopersicum] Length = 562 Score = 713 bits (1841), Expect = 0.0 Identities = 359/569 (63%), Positives = 443/569 (77%), Gaps = 4/569 (0%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYLLA 2215 + S+EE+D+ +LI Q +R N+ + P ++ FQIDD+ + P+ + S +K + Sbjct: 4 DPSNEEEDQENLIAQRQRGNNLSEFPE--RTAFQIDDEIANTRPS----DPSCSKCCCFS 57 Query: 2214 -IVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLWN 2038 I+ +F+++L F+T + N+ +T + N + S N M ESELRA LWN Sbjct: 58 TIIFAVFVIILCFSTGVNNVSKTGVMN---NDSVNLMLESELRALSLLRQQQLGLFKLWN 114 Query: 2037 HTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDPL 1858 +TL++ ++ +V E LK DL+ QISLNKQIQQ LLSSH+L + L Sbjct: 115 NTLIDNSLNATAANNSNIVSTSLFSSVLSEELKLDLISQISLNKQIQQALLSSHQLSNLL 174 Query: 1857 GSLSVNFTDPSIGSFN---RCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICL 1687 + S N TDPS+ ++ RC K+D KLS+R+TIEWKP+S+KYLFAIC SGQMSNHLICL Sbjct: 175 NA-SDNATDPSLDDYSGLHRCRKMDYKLSDRRTIEWKPRSDKYLFAICASGQMSNHLICL 233 Query: 1686 EKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHID 1507 EKHM FAA+LNR+++IPSS+VDYEF RVLD+DHIN+CLGRKVVVTFEEFA+ +K H+HID Sbjct: 234 EKHMFFAALLNRIMIIPSSRVDYEFRRVLDIDHINKCLGRKVVVTFEEFAKSQKGHMHID 293 Query: 1506 KFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXX 1327 KF+CYFS PQPCF+DDEH+KKLKSLGVS +KLEAAW ED+K P+ RT Sbjct: 294 KFVCYFSQPQPCFLDDEHLKKLKSLGVSTNKLEAAWDEDIKNPKPRTVQDIMSKFSLDDA 353 Query: 1326 VIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFR 1147 VIAIGDVFFA+VE++WVMQPGGPI+HKCKTL+EP RLI+LTAQRF+QTFLG++FIALHFR Sbjct: 354 VIAIGDVFFANVEKKWVMQPGGPISHKCKTLVEPSRLILLTAQRFIQTFLGKNFIALHFR 413 Query: 1146 RHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNG 967 RHGFLKFCNAK SCFYPVPQ+A+CINRVVERA +PVIYLSTDAA SET +LQSL+V NG Sbjct: 414 RHGFLKFCNAKKPSCFYPVPQAADCINRVVERATAPVIYLSTDAAESETGILQSLVVVNG 473 Query: 966 KTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDIL 787 KTVPL++RPA+NSAEKWDALLYRHGLEGD QVEAMLDKT+CA+S VFIGS GSTFTEDIL Sbjct: 474 KTVPLVRRPAQNSAEKWDALLYRHGLEGDRQVEAMLDKTICAISEVFIGSMGSTFTEDIL 533 Query: 786 RLRKDWGSASLCDEYLCQGELPNFIADDE 700 RLRK WG++SLCDEYLC+GE+PNFIADDE Sbjct: 534 RLRKAWGTSSLCDEYLCRGEVPNFIADDE 562 >gb|EOY27412.1| O-fucosyltransferase family protein isoform 1 [Theobroma cacao] Length = 558 Score = 707 bits (1826), Expect = 0.0 Identities = 355/574 (61%), Positives = 436/574 (75%), Gaps = 9/574 (1%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRN---DAPKSPRHH---QSTFQIDDDFKSRSPNARRLNFSLN 2233 +SSDE+DDR LI QN+ +N P SPR +S+F I++ S RR + N Sbjct: 4 DSSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEE---LESQIRRRFKLTFN 60 Query: 2232 KRYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXX 2053 KRYL AI LPL I+ +YF+TDI++LF +++S++K++ ++ +RES+L+A Sbjct: 61 KRYLFAIFLPLLIIPIYFSTDIRSLFSSNISSLKFNTVSDRIRESQLQALYLLNQQQNSL 120 Query: 2052 XXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHR 1873 LWNHT VN I V +++K+ LL QI+LNK IQQ+LLS H+ Sbjct: 121 LSLWNHTFVNSNNN--------------ITAVQFDDIKASLLTQITLNKHIQQILLSPHK 166 Query: 1872 LGDPLGSLSVNFTDPSIG--SFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNH 1699 G+ + DP+ SF+RC KVDQK +ERKT EWKPK NK+LFAIC+SGQMSNH Sbjct: 167 TGN--SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNH 224 Query: 1698 LICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNH 1519 LICLEKHM FAAVLNR LVIPSS+ DY+++RVLD++HIN C+G+K V+ FEEF E KKNH Sbjct: 225 LICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNH 284 Query: 1518 LHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAW-TEDVKKPRKRTXXXXXXXX 1342 HIDKFICYFS+PQPC++D+EH+KKLKSLG+S KLE AW ED+KKP ++T Sbjct: 285 AHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKF 344 Query: 1341 XXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFI 1162 VIAIGDVF+ADVER+WV+QPGGPIAHKCKTLIEP +LI+LTA+RF+QTFLG +FI Sbjct: 345 GSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFI 404 Query: 1161 ALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSL 982 ALHFRRHGFLKFCNAK SCFYP+PQ+A+CI R+VERAN+PVIYLSTDAA SET LLQS+ Sbjct: 405 ALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSM 464 Query: 981 LVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTF 802 +V NGKT+PL+KRP RNSAEKWDALLYRHGL DPQVEAMLDKT+CA+S+VFIG+ GSTF Sbjct: 465 VVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVEAMLDKTICAMSSVFIGAPGSTF 524 Query: 801 TEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 T DILRLRKDWG+ASLCDEYLCQGE PNF A +E Sbjct: 525 TGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 558 >gb|EOY27413.1| O-fucosyltransferase family protein isoform 2 [Theobroma cacao] Length = 559 Score = 703 bits (1814), Expect = 0.0 Identities = 355/575 (61%), Positives = 436/575 (75%), Gaps = 10/575 (1%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRN---DAPKSPRHH---QSTFQIDDDFKSRSPNARRLNFSLN 2233 +SSDE+DDR LI QN+ +N P SPR +S+F I++ S RR + N Sbjct: 4 DSSDEDDDRQTLIHQNDTKNLPHQIPASPRPSTSPRSSFHIEE---LESQIRRRFKLTFN 60 Query: 2232 KRYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXX 2053 KRYL AI LPL I+ +YF+TDI++LF +++S++K++ ++ +RES+L+A Sbjct: 61 KRYLFAIFLPLLIIPIYFSTDIRSLFSSNISSLKFNTVSDRIRESQLQALYLLNQQQNSL 120 Query: 2052 XXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHR 1873 LWNHT VN I V +++K+ LL QI+LNK IQQ+LLS H+ Sbjct: 121 LSLWNHTFVNSNNN--------------ITAVQFDDIKASLLTQITLNKHIQQILLSPHK 166 Query: 1872 LGDPLGSLSVNFTDPSIG--SFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNH 1699 G+ + DP+ SF+RC KVDQK +ERKT EWKPK NK+LFAIC+SGQMSNH Sbjct: 167 TGN--SPQNGTLLDPNFAGYSFDRCRKVDQKFAERKTFEWKPKPNKFLFAICLSGQMSNH 224 Query: 1698 LICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNH 1519 LICLEKHM FAAVLNR LVIPSS+ DY+++RVLD++HIN C+G+K V+ FEEF E KKNH Sbjct: 225 LICLEKHMFFAAVLNRALVIPSSRFDYQYNRVLDIEHINGCIGKKAVIPFEEFMEIKKNH 284 Query: 1518 LHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAW-TEDVKKPRKRTXXXXXXXX 1342 HIDKFICYFS+PQPC++D+EH+KKLKSLG+S KLE AW ED+KKP ++T Sbjct: 285 AHIDKFICYFSSPQPCYVDEEHLKKLKSLGISTGKLETAWKNEDIKKPSQKTIKDVEEKF 344 Query: 1341 XXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFI 1162 VIAIGDVF+ADVER+WV+QPGGPIAHKCKTLIEP +LI+LTA+RF+QTFLG +FI Sbjct: 345 GSDDDVIAIGDVFYADVERDWVLQPGGPIAHKCKTLIEPSKLILLTAERFIQTFLGSNFI 404 Query: 1161 ALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSL 982 ALHFRRHGFLKFCNAK SCFYP+PQ+A+CI R+VERAN+PVIYLSTDAA SET LLQS+ Sbjct: 405 ALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRMVERANTPVIYLSTDAAESETSLLQSM 464 Query: 981 LVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQ-VEAMLDKTVCALSTVFIGSFGST 805 +V NGKT+PL+KRP RNSAEKWDALLYRHGL DPQ VEAMLDKT+CA+S+VFIG+ GST Sbjct: 465 VVLNGKTIPLVKRPPRNSAEKWDALLYRHGLAEDPQVVEAMLDKTICAMSSVFIGAPGST 524 Query: 804 FTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 FT DILRLRKDWG+ASLCDEYLCQGE PNF A +E Sbjct: 525 FTGDILRLRKDWGTASLCDEYLCQGEDPNFTAGEE 559 >ref|XP_004303772.1| PREDICTED: uncharacterized protein LOC101299396 [Fragaria vesca subsp. vesca] Length = 556 Score = 693 bits (1789), Expect = 0.0 Identities = 364/578 (62%), Positives = 433/578 (74%), Gaps = 12/578 (2%) Frame = -2 Query: 2397 MESSDE-EDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARR---------L 2248 + S DE EDDR +LIEQN+R+ SPR +TF IDD R + R L Sbjct: 7 LSSDDEVEDDRQNLIEQNDRKQ--LPSPRS-ATTFHIDDGDVDRHRHHREIRRRFASLNL 63 Query: 2247 NFSLNKRYLLA--IVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXX 2074 NKR L I +PLF+LVL+F+TDIK+LF + LS D+ + +RESELRA Sbjct: 64 RDLFNKRSFLVFFIFIPLFVLVLFFSTDIKSLFFSHLS--VSDSVSGKLRESELRALYLL 121 Query: 2073 XXXXXXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQ 1894 LWN T + N D+++LKS +L QISLNK+IQQ Sbjct: 122 RQQQLGLFGLWNSTSNHS-------------------NPDLDDLKSSVLRQISLNKEIQQ 162 Query: 1893 VLLSSHRLGDPLGSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSG 1714 VLLS H G+ S S +F DPS+G +RC VDQ+ SER+TIEWKP S+KYL AICVSG Sbjct: 163 VLLSPHSSGN--SSESEDFRDPSLG--DRCRVVDQRFSERRTIEWKPNSDKYLLAICVSG 218 Query: 1713 QMSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAE 1534 QMSNHLICLEKHM FAA+LNR+LVIPSSKVDY++ VLD++HIN+C+GRKVVVTFEE AE Sbjct: 219 QMSNHLICLEKHMFFAALLNRILVIPSSKVDYQYSTVLDIEHINKCIGRKVVVTFEELAE 278 Query: 1533 KKKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXX 1354 +KKNH+HID+FICYFS P C++DDEH+KKLK+LG+S E AW EDVKKP K+T Sbjct: 279 EKKNHIHIDRFICYFSKPTLCYVDDEHLKKLKALGISYKSREPAWGEDVKKPSKKTVQDV 338 Query: 1353 XXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLG 1174 VIAIGDVFFAD E++WVMQPGGP+AHKCKTLIEP RLI+LTAQRF+QTFLG Sbjct: 339 QSKFSSGDEVIAIGDVFFADAEQDWVMQPGGPLAHKCKTLIEPSRLILLTAQRFIQTFLG 398 Query: 1173 RDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDL 994 ++F+ALHFRRHGFLKFCN K SCFYP+PQ+A+CI R+ ERAN+PV+YLSTDAA SET L Sbjct: 399 KNFVALHFRRHGFLKFCNNKQPSCFYPIPQAADCITRIAERANAPVVYLSTDAAESETGL 458 Query: 993 LQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSF 814 LQSL+V NGKTVPL+KRPARNSAEKWDALLYRHG+EGDPQVEAMLDKT+ A+S+VFIG+ Sbjct: 459 LQSLVVVNGKTVPLVKRPARNSAEKWDALLYRHGIEGDPQVEAMLDKTISAMSSVFIGAS 518 Query: 813 GSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 GSTFTEDILRLRK WGSAS+CDEYLCQGE PNFIA++E Sbjct: 519 GSTFTEDILRLRKGWGSASVCDEYLCQGEEPNFIAENE 556 >gb|EPS60947.1| hypothetical protein M569_13853, partial [Genlisea aurea] Length = 568 Score = 689 bits (1778), Expect = 0.0 Identities = 353/571 (61%), Positives = 421/571 (73%), Gaps = 6/571 (1%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRH--HQSTFQIDDDFKSRSPNARRLNFSLNKRYL 2221 ESSDE+ D+ +LI QN R +DA KS H H+S+ ++ D + R A KRY Sbjct: 5 ESSDEDADQENLISQNARSDDAVKSSNHSHHRSSLHVERDLRRRFSAAAG---GYKKRYF 61 Query: 2220 LAIVLPLFILVLYFTTDIKNLFQTSLSNVKY---DASTNHMRESELRAXXXXXXXXXXXX 2050 LAIVLP ILVLYFTTD+KN+F S+ + Y DA ++ MRESEL+A Sbjct: 62 LAIVLPALILVLYFTTDLKNVFAMSIPKIGYHGGDALSDRMRESELQALNLLRQQEAELF 121 Query: 2049 XLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVE-NLKSDLLGQISLNKQIQQVLLSSHR 1873 LWN+T + + I N+D+ +LKS + Q+SLNK+IQ +LLSSH Sbjct: 122 KLWNYT--SSANKLNYSHDPVNVNSSAIHNLDLFLDLKSQVFSQLSLNKRIQTLLLSSHG 179 Query: 1872 LGDPLGSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLI 1693 G+ + +FTD G RC ++ L R+ +EW P NK+L AIC+SGQMSNHLI Sbjct: 180 NGEAFHDSNYSFTDD--GLTTRCPTANRNLLGRRKMEWDPLPNKFLLAICISGQMSNHLI 237 Query: 1692 CLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLH 1513 CLEKHM FAA+L R+LVIPSSKVDY FHRVLD+DHIN CLG+K VVTFEEF+ +KNHLH Sbjct: 238 CLEKHMFFAALLKRILVIPSSKVDYAFHRVLDIDHINTCLGKKAVVTFEEFSVMQKNHLH 297 Query: 1512 IDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXX 1333 ID+F+CYFS+PQPC+MDDE+VKKLK +G+S+SK+E+ W EDVK PRK Sbjct: 298 IDRFLCYFSSPQPCYMDDEYVKKLKGVGLSLSKVESVWKEDVKSPRKTKVEDVVSKFSSN 357 Query: 1332 XXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALH 1153 V+A+GD+FFA VE +WVMQPGGPI HKCKTLIEP RLI LTAQRFVQTFLG+DFIALH Sbjct: 358 EAVVAVGDLFFAQVEEDWVMQPGGPIEHKCKTLIEPSRLIRLTAQRFVQTFLGKDFIALH 417 Query: 1152 FRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVF 973 FRRHGFLKFCNAK SCFYPVPQ+A CINRV+ERAN+PVIYLSTDAA SET LLQSL+ Sbjct: 418 FRRHGFLKFCNAKQPSCFYPVPQAAECINRVIERANAPVIYLSTDAAESETGLLQSLVTR 477 Query: 972 NGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTED 793 G TVPL+KRPARNSAEKWDALLYRHGLEGD QVEAMLDK +CALS+VFIGS GSTFTED Sbjct: 478 YGNTVPLVKRPARNSAEKWDALLYRHGLEGDSQVEAMLDKAICALSSVFIGSSGSTFTED 537 Query: 792 ILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 ILRLR+ W S S+CDEYLC+G LPN+IA+DE Sbjct: 538 ILRLRRVWESESVCDEYLCEGRLPNYIAEDE 568 >ref|XP_006465793.1| PREDICTED: uncharacterized protein LOC102617227 [Citrus sinensis] Length = 563 Score = 682 bits (1760), Expect = 0.0 Identities = 345/581 (59%), Positives = 420/581 (72%), Gaps = 16/581 (2%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERR----------NDAPKSPRHHQSTFQIDDDFKSRSPNARRLN 2245 +SSD++DDR LI QN+ + N+ + STF IDD + SP RR Sbjct: 4 DSSDDDDDRETLIHQNDTKHGNHRLPTSNNNEDEEHNRRHSTFHIDD-LPNASPIRRRFT 62 Query: 2244 FSL----NKRYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXX 2077 F NKRYL A+ LPL I++LYF+ ++++LF + N ++D+ + MRESELRA Sbjct: 63 FDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSL 122 Query: 2076 XXXXXXXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQ 1897 LWN + VN +N ++ KS LL QISLNKQI+ Sbjct: 123 LKQQQSHLLSLWNQSFVNNSYGNNT------------NNPFFQDAKSALLNQISLNKQIE 170 Query: 1896 QVLLSSHRLGDPLGSLSVNFT-DPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICV 1720 Q+LLS H++ NFT + ++ F C KVD + ++T+EWKPKS+K+LFAIC+ Sbjct: 171 QILLSPHKVS--------NFTPNDAVWGFEGCRKVDSIIPNKRTVEWKPKSDKFLFAICL 222 Query: 1719 SGQMSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEF 1540 SGQMSNHLICLEKHM AA+LNRVLVIPSSK DY++ RVLD++HIN+CLGRKVVV+FE F Sbjct: 223 SGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENF 282 Query: 1539 AEKKKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAW-TEDVKKPRKRTX 1363 E +KNH HID+F+CYF P+PCF+DDEH+KKLK LG+SM K E W ED +KP KRT Sbjct: 283 MEMEKNHAHIDRFLCYFGLPEPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTV 342 Query: 1362 XXXXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQT 1183 VIA+GD+F+ADVER+WVMQPGGPI H+CKTLIEP RLIM+TAQRFVQT Sbjct: 343 QDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQT 402 Query: 1182 FLGRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASE 1003 FLG +FIALHFRRHGFLKFCNAK SCFYP+PQ+A+CI R+ ERAN+PVIYLSTDAA SE Sbjct: 403 FLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERANAPVIYLSTDAAESE 462 Query: 1002 TDLLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFI 823 T LLQSL+V NGKT+ L+KRP RNSAEKWD+LLYRH LE D QVEAMLDKT+CA+S VFI Sbjct: 463 TSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFI 522 Query: 822 GSFGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 G+ GSTFTEDI+RLRKDWGS SLCDEYLCQGE PNFIA+DE Sbjct: 523 GASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563 >ref|XP_002533327.1| conserved hypothetical protein [Ricinus communis] gi|223526849|gb|EEF29063.1| conserved hypothetical protein [Ricinus communis] Length = 565 Score = 682 bits (1760), Expect = 0.0 Identities = 343/575 (59%), Positives = 439/575 (76%), Gaps = 10/575 (1%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRND-----APKSPRHHQS--TFQIDDDFKSRSPNARRLNFSL 2236 +SSDEEDDR +LIEQN+R++ P S H +S TF I++ RRL Sbjct: 4 DSSDEEDDRENLIEQNDRKHHNHQQTVPTSSPHRRSFSTFHIEE---YGGVIRRRL---F 57 Query: 2235 NKRY---LLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXX 2065 NKRY LLAI LPL I+++YF+ D+++LF ++S++ ++++++ MRE+EL+A Sbjct: 58 NKRYYYYLLAIFLPLLIIIVYFSADLRSLFSANISSLNFNSASDRMREAELQALYLLEQQ 117 Query: 2064 XXXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLL 1885 ++N + ++ DNV +EN +S LL Q++ NKQIQQ+LL Sbjct: 118 QLSLLSIFNQSFPSRNKNFSSNSSFINS----FDNVKIENFRSALLKQMTFNKQIQQILL 173 Query: 1884 SSHRLGDPLGSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMS 1705 S H+ G+ ++S +F+ G F+RC KV+ + +RKTIEWKP+S+K+LF IC+SGQMS Sbjct: 174 SPHKSGNE--NVSGSFSGSGFG-FDRCKKVESRFLDRKTIEWKPRSDKFLFPICLSGQMS 230 Query: 1704 NHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKK 1525 NHLICLEKHM FAA+LNRVLV+PSSK DY+++RVLD++HIN C+GRKVVVTFEEF + +K Sbjct: 231 NHLICLEKHMFFAALLNRVLVMPSSKFDYQYNRVLDIEHINLCVGRKVVVTFEEFVQMRK 290 Query: 1524 NHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXX 1345 NH+HID+FICYFS+P C++D+EHVKKLK LG+ M K E+ W EDVKKP ++T Sbjct: 291 NHVHIDRFICYFSSPTACYVDEEHVKKLKGLGILMGKPESPWKEDVKKPSQKTVQDVLAK 350 Query: 1344 XXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDF 1165 VIAIGDVF+AD+E++WVMQPGGP+AHKCKTLIEP RLI++TAQRF+QTFLG++F Sbjct: 351 FTSNDDVIAIGDVFYADMEQDWVMQPGGPLAHKCKTLIEPSRLILVTAQRFIQTFLGKNF 410 Query: 1164 IALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQS 985 IALHFRRHGFLKFCNAK+ SCFYP+PQ+A+CI RV ERAN+PVIYLSTDAA SETDLLQS Sbjct: 411 IALHFRRHGFLKFCNAKNPSCFYPIPQAADCIARVAERANAPVIYLSTDAAESETDLLQS 470 Query: 984 LLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGST 805 L++ NGKTVPL+KRP+ S EKWD+LL RHG+E D QVEAMLDKT+ A+S VFIG+ GST Sbjct: 471 LIIVNGKTVPLVKRPSHTSVEKWDSLLSRHGIEDDSQVEAMLDKTISAMSNVFIGASGST 530 Query: 804 FTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 FTEDILRLRKDW SASLCDEYLCQGELPNFIA+DE Sbjct: 531 FTEDILRLRKDWESASLCDEYLCQGELPNFIAEDE 565 >ref|XP_004144450.1| PREDICTED: uncharacterized protein LOC101208722 [Cucumis sativus] gi|449517914|ref|XP_004165989.1| PREDICTED: uncharacterized protein LOC101230373 [Cucumis sativus] Length = 573 Score = 681 bits (1757), Expect = 0.0 Identities = 356/576 (61%), Positives = 425/576 (73%), Gaps = 12/576 (2%) Frame = -2 Query: 2391 SSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRL----NFSLNKRY 2224 SSDEEDDR L+E N+ + SP H +TF IDDD R P R F+ +KRY Sbjct: 7 SSDEEDDRQSLVEHNDIKPHP--SPPTHSTTFDIDDDPHFRPPIPRFPFSIPKFAFDKRY 64 Query: 2223 --LLAIVLPLFILVLYFTTDIKNLFQTSLSNV--KYDASTNHMRESELRAXXXXXXXXXX 2056 LLA LPL ILVL+F+ DI +LF T+LS+ D+ T+ MRESEL A Sbjct: 65 YYLLAAALPLCILVLFFSVDITSLFSTTLSSTLKTSDSLTDRMRESELTALYLLRQQQLG 124 Query: 2055 XXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSH 1876 LWNH+L + ++ E +KS LL QI+LNK+IQ VLLS H Sbjct: 125 FFHLWNHSLFLQSNSSFNSTPSNNLSS---NSALTEYIKSALLKQITLNKEIQNVLLSPH 181 Query: 1875 RLGDPLGSLSVNFTDP---SIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMS 1705 R G+ LS D + +RC K+DQKLS+R+TIEWKPKSNK+LFAIC SGQMS Sbjct: 182 RSGN----LSEEVGDALPMDTFALDRCRKMDQKLSDRRTIEWKPKSNKFLFAICTSGQMS 237 Query: 1704 NHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKK 1525 NHLICLEKHM FAA+LNRVLVIPS KVDY+F RV+D+D +N CLGRKVV++FEEF+E KK Sbjct: 238 NHLICLEKHMFFAAILNRVLVIPSHKVDYQFSRVIDIDRMNMCLGRKVVISFEEFSEIKK 297 Query: 1524 NHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXX 1345 +HLHID+FICYFS P PC++DDEH+ KLK+LG+SM KLE+AW ED K P ++T Sbjct: 298 HHLHIDRFICYFSKPNPCYVDDEHISKLKNLGISMGKLESAWNEDTKHPNRKTVSDVESK 357 Query: 1344 XXXXXXV-IAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRD 1168 IA+GD+FFA+VE+EWV QPGGPIAHKC+TLIEP LI LTAQRF+QTFLG++ Sbjct: 358 FSSNNDDVIAVGDIFFANVEQEWVNQPGGPIAHKCQTLIEPSHLIKLTAQRFIQTFLGKN 417 Query: 1167 FIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQ 988 +IALHFRRHGFLKFCNAK SCFYP+PQ+A+CI R+VERAN PVIYLSTDAA SE LLQ Sbjct: 418 YIALHFRRHGFLKFCNAKQPSCFYPIPQAADCIIRMVERANVPVIYLSTDAAESEHGLLQ 477 Query: 987 SLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGS 808 SLLV NGK +PL+KRP RNSAEKWDALLYRHGLE D QVEAMLDKT+CA+S+ FIG+ GS Sbjct: 478 SLLVLNGKPIPLVKRPPRNSAEKWDALLYRHGLEEDSQVEAMLDKTICAMSSTFIGAPGS 537 Query: 807 TFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 TFTEDILRLRKDWG+AS+CDEYLCQGE PNFI+++E Sbjct: 538 TFTEDILRLRKDWGTASMCDEYLCQGEEPNFISENE 573 >ref|XP_003529861.1| PREDICTED: uncharacterized protein LOC100776069 [Glycine max] Length = 543 Score = 679 bits (1753), Expect = 0.0 Identities = 345/569 (60%), Positives = 424/569 (74%), Gaps = 2/569 (0%) Frame = -2 Query: 2400 MMESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYL 2221 M SSDEEDD +L++ N R+ +P + F ++D S RR++F+L K+Y+ Sbjct: 1 MDSSSDEEDDHRNLVDNNHRKPPSPPP----SAAFHVED----LSSRFRRVSFALQKKYI 52 Query: 2220 LAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLW 2041 +AI+ LF+L+ + TD LF T S+ K+D+ T+ M+ESELRA W Sbjct: 53 IAILALLFLLLFFSITDFHQLFSTP-SSFKFDSITDRMKESELRAINLLYQQQQSLLTAW 111 Query: 2040 NHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDP 1861 NHTL D +E+LKS L QISLN++IQQ+LL+ H G Sbjct: 112 NHTLRTNAS----------------DPNLLEDLKSSLFKQISLNREIQQILLNPHSTGGN 155 Query: 1860 L--GSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICL 1687 L +N T + ++RC VDQ LS+RKTIEW P+ K+L AICVSGQMSNHLICL Sbjct: 156 AIEPELDLNATLNGV-VYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICL 214 Query: 1686 EKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHID 1507 EKHM FAA+LNRVLVIPSSKVDY++ RV+D+DHIN+CLG+KVVV+FEEF+ KK HLHID Sbjct: 215 EKHMFFAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEEFSNLKKGHLHID 274 Query: 1506 KFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXX 1327 KF+CYFS PQPC++DDE +KKL +LG++MSK EA W ED +KP+K+T Sbjct: 275 KFLCYFSHPQPCYLDDERLKKLGALGLTMSKPEAVWDEDTRKPKKKTVQDVLGKFSFDDD 334 Query: 1326 VIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFR 1147 V+AIGDVF+A+VEREWVMQPGGPIAHKCKTLIEP RLI+LTAQRF+QTFLGR+FIALHFR Sbjct: 335 VMAIGDVFYAEVEREWVMQPGGPIAHKCKTLIEPNRLILLTAQRFIQTFLGRNFIALHFR 394 Query: 1146 RHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNG 967 RHGFLKFCNAK SCFYP+PQ+A+CI RVVE A++P+IYLSTDAA SET LLQSL+V NG Sbjct: 395 RHGFLKFCNAKKPSCFYPIPQAADCILRVVEMADAPIIYLSTDAAESETGLLQSLVVLNG 454 Query: 966 KTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDIL 787 + VPL+ RPARNSAEKWDALLYRH ++GD QVEAMLDKT+CA+S+VFIG+ GSTFTEDIL Sbjct: 455 RPVPLVIRPARNSAEKWDALLYRHNMDGDSQVEAMLDKTICAMSSVFIGAPGSTFTEDIL 514 Query: 786 RLRKDWGSASLCDEYLCQGELPNFIADDE 700 RLRKDWGSAS+CDEYLCQGE PN IA++E Sbjct: 515 RLRKDWGSASMCDEYLCQGEEPNIIAENE 543 >ref|XP_006426814.1| hypothetical protein CICLE_v10025289mg [Citrus clementina] gi|557528804|gb|ESR40054.1| hypothetical protein CICLE_v10025289mg [Citrus clementina] Length = 563 Score = 679 bits (1751), Expect = 0.0 Identities = 343/581 (59%), Positives = 417/581 (71%), Gaps = 16/581 (2%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERR----------NDAPKSPRHHQSTFQIDDDFKSRSPNARRLN 2245 +SSD++DDR LI QN+ + N+ + STF IDD F + P RR Sbjct: 4 DSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDD-FPNAPPIRRRFT 62 Query: 2244 FSL----NKRYLLAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXX 2077 F NKRYL A+ LPL I++LYF+ ++++LF + N ++D+ + MRESELRA Sbjct: 63 FDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRALSL 122 Query: 2076 XXXXXXXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQ 1897 LWN + VN +N + KS LL QISLN+QI+ Sbjct: 123 LKQQQSHLLSLWNQSFVNNSYGNNT------------NNPFFQEAKSVLLNQISLNRQIE 170 Query: 1896 QVLLSSHRLGDPLGSLSVNFT-DPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICV 1720 Q+LLS H++ NFT + ++ C K+D + ++T+EWKPKS+K+LFAIC+ Sbjct: 171 QILLSPHKVS--------NFTPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICL 222 Query: 1719 SGQMSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEF 1540 SGQMSNHLICLEKHM AA+LNRVLVIPSSK DY++ RVLD++HIN+CLGRKVVV+FE F Sbjct: 223 SGQMSNHLICLEKHMFLAALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENF 282 Query: 1539 AEKKKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAW-TEDVKKPRKRTX 1363 E KKNH HID+F+CYF PQPCF+DDEH+KKLK LG+SM K E W ED +KP KRT Sbjct: 283 MEMKKNHAHIDRFLCYFGLPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTV 342 Query: 1362 XXXXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQT 1183 VIA+GD+F+ADVER+WVMQPGGPI H+CKTLIEP RLIM+TAQRFVQT Sbjct: 343 QDIEGKFKTDDDVIAVGDLFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQT 402 Query: 1182 FLGRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASE 1003 FLG +FIALHFRRHGFLKFCNAK SCFYP+PQ+A+CI R+ ERA +PVIYLSTDAA SE Sbjct: 403 FLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESE 462 Query: 1002 TDLLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFI 823 T LLQSL+V NGKT+ L+KRP RNSAEKWD+LLYRH LE D QVEAMLDKT+CA+S VFI Sbjct: 463 TSLLQSLVVLNGKTIALVKRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFI 522 Query: 822 GSFGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 G+ GSTFTEDI+RLRKDWGS SLCDEYLCQGE PNFIA+DE Sbjct: 523 GASGSTFTEDIMRLRKDWGSTSLCDEYLCQGEEPNFIAEDE 563 >ref|XP_002303337.1| protein-O-fucosyltransferase 2 [Populus trichocarpa] gi|222840769|gb|EEE78316.1| protein-O-fucosyltransferase 2 [Populus trichocarpa] Length = 527 Score = 676 bits (1743), Expect = 0.0 Identities = 357/574 (62%), Positives = 419/574 (72%), Gaps = 9/574 (1%) Frame = -2 Query: 2394 ESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYLL- 2218 +SSDEEDDR HLIEQN+R+ HHQ N RY L Sbjct: 4 DSSDEEDDREHLIEQNDRK--------HHQ-----------------------NGRYSLF 32 Query: 2217 ---AIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXX 2047 I LPLFIL L F+TDI+NLF T L D+ + MRESELRA Sbjct: 33 AAAIIFLPLFILFLSFSTDIRNLFSTHLK--VGDSLSIRMRESELRALYLLKKQQLSLFS 90 Query: 2046 LWNHT----LVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSS 1879 LWN T L+ K +++V E+LKS LL QISLNK+IQQVLL+ Sbjct: 91 LWNSTGNSTLLEKD----------------LNSVSFEDLKSALLKQISLNKEIQQVLLAP 134 Query: 1878 HRLGDPLGSLS-VNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSN 1702 H G+ S S ++F++ G RC KVDQ+ ++RKTIEWKPK NK+LFA+C+SGQMSN Sbjct: 135 HESGNVSSSSSDLDFSNAG-GFVQRCEKVDQRFADRKTIEWKPKPNKFLFALCLSGQMSN 193 Query: 1701 HLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKN 1522 HLICLEKHM FAA+LNRVLVIPSS+ DY+++RVLD++H+N+CLGRKVVVTFEEF E KN Sbjct: 194 HLICLEKHMFFAALLNRVLVIPSSRFDYQYNRVLDIEHVNDCLGRKVVVTFEEFVEIMKN 253 Query: 1521 HLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXX 1342 HID+F CYFS P PC++D+EHVKKLK LGVSM KLE+ W ED+KKP K T Sbjct: 254 KPHIDRFFCYFSDPTPCYVDEEHVKKLKGLGVSMGKLESPWKEDIKKPSKLTVKDVEGKF 313 Query: 1341 XXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFI 1162 VIA+GDVFFADVE EW+MQPGGPIAHKCKTLIEP R+IMLTAQRF+QTFLG +FI Sbjct: 314 VSDDNVIAVGDVFFADVEEEWIMQPGGPIAHKCKTLIEPTRIIMLTAQRFIQTFLGSNFI 373 Query: 1161 ALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSL 982 ALHFRRHGFLKFCNAK SCFYPVPQ+A+CI RVVERAN+PV+YLSTDAA SET LLQSL Sbjct: 374 ALHFRRHGFLKFCNAKKPSCFYPVPQAADCIARVVERANAPVVYLSTDAAESETGLLQSL 433 Query: 981 LVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTF 802 +V NG+TVPL+ RP+RN+AEKWDALLYRHGL+ D QVEAMLDKT+CA+S+VFIG+ GSTF Sbjct: 434 VVVNGRTVPLVTRPSRNAAEKWDALLYRHGLQEDAQVEAMLDKTICAMSSVFIGASGSTF 493 Query: 801 TEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 TEDI RLRK W SAS CDEYLCQGELPN+IA++E Sbjct: 494 TEDIFRLRKGWESASSCDEYLCQGELPNYIAENE 527 >ref|XP_003547949.1| PREDICTED: uncharacterized protein LOC548046 [Glycine max] Length = 543 Score = 664 bits (1712), Expect = 0.0 Identities = 337/569 (59%), Positives = 420/569 (73%), Gaps = 2/569 (0%) Frame = -2 Query: 2400 MMESSDEEDDRHHLIEQNERRNDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFSLNKRYL 2221 M SSDEEDD +L++ N R+ P SP F ++D SP RR NF+L K+Y+ Sbjct: 1 MDSSSDEEDDHRNLVDNNHRK--PPSSPA--AVAFHVEDP----SPRFRRANFTLQKKYI 52 Query: 2220 LAIVLPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXXXXXXXLW 2041 AI+ LF+L+ + TD+ LF T+ S+ ++D+ T+ M+ESELRA W Sbjct: 53 FAILAILFLLLFFSITDLHKLFSTT-SSFRFDSLTDRMKESELRAINLLNQQQQALLTAW 111 Query: 2040 NHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLSSHRLGDP 1861 NHTL D +E+LKS + QISLN++IQQ+LL+ H G+ Sbjct: 112 NHTLRTNAS----------------DPNLLEDLKSSIFKQISLNREIQQILLNPHSTGNN 155 Query: 1860 L--GSLSVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQMSNHLICL 1687 +N T + ++RC VDQ LS+RKTIEW P+ K+L AICVSGQMSNHLICL Sbjct: 156 AIEPEFDLNATLNGV-VYDRCRTVDQNLSQRKTIEWNPRDGKFLLAICVSGQMSNHLICL 214 Query: 1686 EKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEKKKNHLHID 1507 EKH+ FAA+LNRVLVIPSSKVDY++ RV+D+DHIN+CLG+KVVV+FE F+ KK HLHID Sbjct: 215 EKHIFFAALLNRVLVIPSSKVDYQYDRVVDIDHINKCLGKKVVVSFEVFSNLKKGHLHID 274 Query: 1506 KFICYFSAPQPCFMDDEHVKKLKSLGVSMSKLEAAWTEDVKKPRKRTXXXXXXXXXXXXX 1327 KF+CYFS PQPC++DDE +KKL +LG++MSK A W ED + P+K+T Sbjct: 275 KFLCYFSQPQPCYLDDERLKKLGALGLTMSKPVAVWDEDTRNPKKKTVQDVLGKFSFDDD 334 Query: 1326 VIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFLGRDFIALHFR 1147 V+AIGDVF+A+VEREWVMQPGGPIAHKC TLIEP RLI+LTAQRF+QTFLGR+F+ALHFR Sbjct: 335 VMAIGDVFYAEVEREWVMQPGGPIAHKCTTLIEPNRLILLTAQRFIQTFLGRNFVALHFR 394 Query: 1146 RHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETDLLQSLLVFNG 967 RHGFLKFCNAK SCFY + Q+A+CI RVVERA++P+IYLSTDAA SET LLQSL+V NG Sbjct: 395 RHGFLKFCNAKKPSCFYSITQAADCILRVVERADAPIIYLSTDAAESETGLLQSLVVLNG 454 Query: 966 KTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGSFGSTFTEDIL 787 + VPL+ RPARNSAEKWDALLYRH ++GD QVEAMLDK++CA+S+VFIG+ GSTFTEDIL Sbjct: 455 RPVPLVIRPARNSAEKWDALLYRHRMDGDSQVEAMLDKSICAMSSVFIGAPGSTFTEDIL 514 Query: 786 RLRKDWGSASLCDEYLCQGELPNFIADDE 700 RLRKDWGSAS+CDEYLCQGE PN +A++E Sbjct: 515 RLRKDWGSASMCDEYLCQGEEPNIVAENE 543 >ref|NP_199853.1| O-fucosyltransferase family protein [Arabidopsis thaliana] gi|9758924|dbj|BAB09461.1| unnamed protein product [Arabidopsis thaliana] gi|133778858|gb|ABO38769.1| At5g50420 [Arabidopsis thaliana] gi|332008558|gb|AED95941.1| O-fucosyltransferase family protein [Arabidopsis thaliana] Length = 566 Score = 663 bits (1711), Expect = 0.0 Identities = 347/579 (59%), Positives = 424/579 (73%), Gaps = 15/579 (2%) Frame = -2 Query: 2391 SSDEEDDRHHLIEQNERR---------NDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFS 2239 SSD+E+D HLI QN+ R ++A + +S FQIDD R S Sbjct: 5 SSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQ---HRGKIS 61 Query: 2238 LNKRYLLAIV-LPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXX 2062 LNKRY++ V L + I +L+ TD + LF + S+ K D +N ++ESELRA Sbjct: 62 LNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRALYLLRQQQ 121 Query: 2061 XXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLS 1882 LWN TLVN +V E++KS + QISLNK+IQ+VLLS Sbjct: 122 LALLSLWNGTLVNPSLNQSENALG--------SSVLFEDVKSAVSKQISLNKEIQEVLLS 173 Query: 1881 SHRLGDPLGSL---SVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQ 1711 HR + G SVNF S+NRC KVDQKLS+RKT+EWKP+S+K+LFAIC+SGQ Sbjct: 174 PHRSSNYSGGTDVDSVNF------SYNRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQ 227 Query: 1710 MSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEK 1531 MSNHLICLEKHM FAA+L+RVLVIPSSK DY++ RV+D++ IN CLGR VVV F++F EK Sbjct: 228 MSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAFDQFKEK 287 Query: 1530 -KKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMS-KLEAAWTEDVKKPRKRTXXX 1357 KKNH ID+FICYFS+PQ C++D+EH+KKLK LG+S+ KLEA W+ED+KKP KRT Sbjct: 288 AKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSKRTVQD 347 Query: 1356 XXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFL 1177 VIAIGDVF+AD+E++WVMQPGGPI HKCKTLIEP +LI+LTAQRF+QTFL Sbjct: 348 VQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFL 407 Query: 1176 GRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETD 997 G++FIALHFRRHGFLKFCNAK SCFYP+PQ+A CI R+VER+N VIYLSTDAA SET Sbjct: 408 GKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETS 467 Query: 996 LLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGS 817 LLQSL+V +GK VPL+KRP RNSAEKWDALLYRHG+E D QV+AMLDKT+CA+S+VFIG+ Sbjct: 468 LLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGA 527 Query: 816 FGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 GSTFTEDILRLRKDWG++S CDEYLC+GE PNFIA+DE Sbjct: 528 SGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566 >ref|XP_002865803.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata] gi|297311638|gb|EFH42062.1| hypothetical protein ARALYDRAFT_918074 [Arabidopsis lyrata subsp. lyrata] Length = 566 Score = 663 bits (1710), Expect = 0.0 Identities = 346/579 (59%), Positives = 426/579 (73%), Gaps = 15/579 (2%) Frame = -2 Query: 2391 SSDEEDDRHHLIEQNERR---------NDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFS 2239 SSD+E+D HLI QN+ R + A + + +S FQI+D + RR S Sbjct: 5 SSDDEEDHQHLIPQNDTRIRHREDPISSTATTTGGNQRSAFQIEDILQRVQ---RRWKIS 61 Query: 2238 LNKRYLLAIV-LPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXX 2062 LNKRY++ V L + I +L+ TD + LF + S+ K D +N ++ESELRA Sbjct: 62 LNKRYVIVFVSLIISIGLLFLLTDPRELFSANFSSFKLDPLSNRVKESELRALYLLRQQQ 121 Query: 2061 XXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLS 1882 LWN TLVN +V E++KS + QISLNK+IQ VLLS Sbjct: 122 LALLSLWNGTLVNPSLNQSENDLR--------SSVLFEDVKSAVSKQISLNKEIQNVLLS 173 Query: 1881 SHRLGDPLGSL---SVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQ 1711 HR + G SVNF S++RC KVDQKLS+RKT+EWKP+S+K+LFAIC+SGQ Sbjct: 174 PHRSSNYSGGTEVDSVNF------SYDRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQ 227 Query: 1710 MSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEK 1531 MSNHLICLEKHM FAA+L+RVLVIPSSK DY++ RV+D++ IN CLGR VVV+F++F EK Sbjct: 228 MSNHLICLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIEGINTCLGRNVVVSFDQFKEK 287 Query: 1530 -KKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMS-KLEAAWTEDVKKPRKRTXXX 1357 KKNH ID+FICYFS+PQ C++D+EH+KKLK LG+S+ KLEA W+ED+KKP KRT Sbjct: 288 AKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSKRTVQD 347 Query: 1356 XXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFL 1177 VIAIGDVF+AD+E++WVMQPGGPI HKCKTLIEP +LI+LTAQRF+QTFL Sbjct: 348 VQTKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFL 407 Query: 1176 GRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETD 997 G++FIALHFRRHGFLKFCNAK SCFYP+PQ+A CI R+VER+N VIYLSTDAA SET Sbjct: 408 GKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETS 467 Query: 996 LLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGS 817 LLQSL+V +GK VPL+KRP RNSAEKWDALLYRHG+E D QV+AMLDKT+CA+S+VFIG+ Sbjct: 468 LLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGA 527 Query: 816 FGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 GSTFTEDILRLRKDWG++S CDEYLC+GE PNFIA+DE Sbjct: 528 SGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566 >gb|AAM66093.1| unknown [Arabidopsis thaliana] Length = 566 Score = 662 bits (1709), Expect = 0.0 Identities = 346/579 (59%), Positives = 424/579 (73%), Gaps = 15/579 (2%) Frame = -2 Query: 2391 SSDEEDDRHHLIEQNERR---------NDAPKSPRHHQSTFQIDDDFKSRSPNARRLNFS 2239 SSD+E+D HLI QN+ R ++A + +S FQIDD R S Sbjct: 5 SSDDEEDHQHLIPQNDTRIRHREDSVSSNATTIGGNQRSAFQIDDILHRVQ---HRGKIS 61 Query: 2238 LNKRYLLAIV-LPLFILVLYFTTDIKNLFQTSLSNVKYDASTNHMRESELRAXXXXXXXX 2062 LNKRY++ V L + I +L+ TD + LF + S+ K D +N ++ESELRA Sbjct: 62 LNKRYVIVFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRALYLLRQQQ 121 Query: 2061 XXXXXLWNHTLVNKXXXXXXXXXXXXXXXTVIDNVDVENLKSDLLGQISLNKQIQQVLLS 1882 LWN TLVN +V E++KS + QISLNK+IQ+VLLS Sbjct: 122 LALLSLWNGTLVNPSLNQSENALG--------SSVLFEDVKSAVSKQISLNKEIQEVLLS 173 Query: 1881 SHRLGDPLGSL---SVNFTDPSIGSFNRCGKVDQKLSERKTIEWKPKSNKYLFAICVSGQ 1711 HR + G SVNF S+NRC KVDQKLS+RKT+EWKP+S+K+LFAIC+SGQ Sbjct: 174 PHRSSNYSGGTDVDSVNF------SYNRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQ 227 Query: 1710 MSNHLICLEKHMVFAAVLNRVLVIPSSKVDYEFHRVLDVDHINECLGRKVVVTFEEFAEK 1531 MSNHL+CLEKHM FAA+L+RVLVIPSSK DY++ RV+D++ IN CLGR VVV F++F EK Sbjct: 228 MSNHLLCLEKHMFFAALLDRVLVIPSSKFDYQYDRVIDIERINTCLGRNVVVAFDQFKEK 287 Query: 1530 -KKNHLHIDKFICYFSAPQPCFMDDEHVKKLKSLGVSMS-KLEAAWTEDVKKPRKRTXXX 1357 KKNH ID+FICYFS+PQ C++D+EH+KKLK LG+S+ KLEA W+ED+KKP KRT Sbjct: 288 AKKNHFRIDRFICYFSSPQLCYVDEEHIKKLKGLGISIDGKLEAPWSEDIKKPSKRTVQD 347 Query: 1356 XXXXXXXXXXVIAIGDVFFADVEREWVMQPGGPIAHKCKTLIEPGRLIMLTAQRFVQTFL 1177 VIAIGDVF+AD+E++WVMQPGGPI HKCKTLIEP +LI+LTAQRF+QTFL Sbjct: 348 VQMKFKSDDDVIAIGDVFYADMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFL 407 Query: 1176 GRDFIALHFRRHGFLKFCNAKDTSCFYPVPQSANCINRVVERANSPVIYLSTDAAASETD 997 G++FIALHFRRHGFLKFCNAK SCFYP+PQ+A CI R+VER+N VIYLSTDAA SET Sbjct: 408 GKNFIALHFRRHGFLKFCNAKSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETS 467 Query: 996 LLQSLLVFNGKTVPLIKRPARNSAEKWDALLYRHGLEGDPQVEAMLDKTVCALSTVFIGS 817 LLQSL+V +GK VPL+KRP RNSAEKWDALLYRHG+E D QV+AMLDKT+CA+S+VFIG+ Sbjct: 468 LLQSLVVVDGKIVPLVKRPPRNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGA 527 Query: 816 FGSTFTEDILRLRKDWGSASLCDEYLCQGELPNFIADDE 700 GSTFTEDILRLRKDWG++S CDEYLC+GE PNFIA+DE Sbjct: 528 SGSTFTEDILRLRKDWGTSSTCDEYLCRGEEPNFIAEDE 566