BLASTX nr result
ID: Achyranthes22_contig00002031
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes22_contig00002031 (2448 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255... 529 e-147 ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr... 515 e-143 ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608... 493 e-136 emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] 493 e-136 gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao] 491 e-136 ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204... 484 e-134 ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm... 482 e-133 ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310... 478 e-132 ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu... 464 e-128 ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600... 463 e-127 ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261... 454 e-124 gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] 447 e-122 ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab... 442 e-121 ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ... 432 e-118 gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theob... 429 e-117 ref|XP_006283541.1| hypothetical protein CARUB_v10004592mg [Caps... 429 e-117 ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutr... 428 e-117 ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779... 428 e-117 ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807... 428 e-117 gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao] 427 e-116 >ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera] Length = 520 Score = 529 bits (1362), Expect = e-147 Identities = 285/545 (52%), Positives = 360/545 (66%), Gaps = 10/545 (1%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MA R ELGFLK S +++E+ AR TLRNVR+QGH YV+LREDGKRF+FFCTLCLAPCYS+ Sbjct: 1 MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVN-----LL 863 +VL+DHL GNLH ER AAK TL+ +PWPFNDGVLFF +NS+E +K + N LL Sbjct: 61 SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFF-DNSSENDKHLSIANGNPTRLL 119 Query: 864 DDREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDC---YNGDNTDYGERVTDLVIPGVI 1034 + ++LAIV H N HV ++ DC + ++ + G R D++IPGV+ Sbjct: 120 GTHKNDNNLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVM 179 Query: 1035 GKDEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFA 1214 KDE+++L VR +GFG+I+AR E+DG G+ +IWCEW GK GD + VP H FA Sbjct: 180 IKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGK-EEPGDGETVMVPDHDFA 238 Query: 1215 IVTFGYYYDLGKQGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXX 1394 +VTF Y+Y+LG++GLFD++ ++L S+ GS ++RKKSFSD ED L+ Q Sbjct: 239 VVTFNYHYNLGRKGLFDDVISML-----SSSPTEGSGRKRKKSFSDPEDISESLSNQ-YD 292 Query: 1395 XXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKML 1574 +R ISSKT+RRELRRQQRVA+E+MCDICQHKML Sbjct: 293 SSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKML 352 Query: 1575 PGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXX 1754 PGKDVATL+N+KTG+L CSSRNV+GAFHVFHTSCLIHWILLCE E+F Sbjct: 353 PGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSS 412 Query: 1755 XXXXXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDEL 1934 +K + GK+ GV + T QICSVFCP+CQGTG+ ++ DEL Sbjct: 413 RRKSGSKCNGKGKD------GVIKPTTL----------QICSVFCPECQGTGI-MIEDEL 455 Query: 1935 EKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFP-VPGDNAEGKVSNLKLLHF 2111 E P +PLSEMFKYKIK DAHR WMK PE+L++CSTGF FP G+ + KVS+LKLLHF Sbjct: 456 EIPNIPLSEMFKYKIKVSDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHF 515 Query: 2112 YRADE 2126 Y ADE Sbjct: 516 YSADE 520 >ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910083|ref|XP_006447355.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910085|ref|XP_006447356.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910087|ref|XP_006447357.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|568831767|ref|XP_006470130.1| PREDICTED: uncharacterized protein LOC102608093 isoform X1 [Citrus sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED: uncharacterized protein LOC102608093 isoform X2 [Citrus sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED: uncharacterized protein LOC102608093 isoform X3 [Citrus sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED: uncharacterized protein LOC102608093 isoform X4 [Citrus sinensis] gi|557549965|gb|ESR60594.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549966|gb|ESR60595.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549967|gb|ESR60596.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549968|gb|ESR60597.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] Length = 523 Score = 515 bits (1326), Expect = e-143 Identities = 286/539 (53%), Positives = 345/539 (64%), Gaps = 8/539 (1%) Frame = +3 Query: 531 RELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLF 710 RELGF K S +++E+ AR TL NVR QGHTYV+LREDGKRF+FFCTLCLAPCYSD VLF Sbjct: 5 RELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSDLVLF 64 Query: 711 DHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVN-----LLDDRE 875 DHL GNLH ERL AAK TL+G NPWPFNDGVLFF +NSNE+EK N LD Sbjct: 65 DHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFF-DNSNEKEKQTTVSNDKLGRSLDYHN 123 Query: 876 PVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNT-DYGERVTDLVIPGVIGKDEIS 1052 +LAIV + + GN H +G + +DC NG D D VIPGV KDEI Sbjct: 124 NDSNLAIVKYGEDMKVNGNEH-SGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDEIV 182 Query: 1053 DLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGY 1232 DL VR +G G+I+AR++++D + RIWCEWLGK ++I +P H FAIVTF Y Sbjct: 183 DLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVE-IPDHDFAIVTFVY 241 Query: 1233 YYDLGKQGLFDEIKTLLLTG-GEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXX 1409 YDLG++GLFD++K LL + E+S +G G+ ++RKKSFSD ED L+ Q Sbjct: 242 NYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ-YDSCGED 300 Query: 1410 XXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDV 1589 +R ISSK RRE+RRQQR+A+E+MCDICQ K+LP KDV Sbjct: 301 SSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPDKDV 360 Query: 1590 ATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXX 1769 A LLNLKTG LACSSRN++G FHVFH SCLIHWILLCE E+ P Sbjct: 361 AALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQP-------------- 406 Query: 1770 AKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKPTV 1949 ++ K G KR Q RK E I +QI S+FCP+CQGTG+ I GDELEKPT+ Sbjct: 407 --VTPKVKRRSRRKNGSKRVQARKDGEY-IFTNQISSLFCPECQGTGVNIEGDELEKPTI 463 Query: 1950 PLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA-EGKVSNLKLLHFYRAD 2123 LS+MFKYKIK DA + WMK PE L+NCSTGF FP + + KVS LKLLHFY A+ Sbjct: 464 SLSQMFKYKIKVSDARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSAE 522 >ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus sinensis] Length = 508 Score = 493 bits (1270), Expect = e-136 Identities = 272/512 (53%), Positives = 328/512 (64%), Gaps = 7/512 (1%) Frame = +3 Query: 531 RELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLF 710 RELGF K S +++E+ AR TL NVR QGHTYV+LREDGKRF+FFCTLCLAPCYSD VLF Sbjct: 5 RELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSDLVLF 64 Query: 711 DHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVN-----LLDDRE 875 DHL GNLH ERL AAK TL+G NPWPFNDGVLFF +NSNE+EK N LD Sbjct: 65 DHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFF-DNSNEKEKQTTVSNDKLGRSLDYHN 123 Query: 876 PVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNT-DYGERVTDLVIPGVIGKDEIS 1052 +LAIV + + GN H +G + +DC NG D D VIPGV KDEI Sbjct: 124 NDSNLAIVKYGEDMKVNGNEH-SGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDEIV 182 Query: 1053 DLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGY 1232 DL VR +G G+I+AR++++D + RIWCEWLGK ++I +P H FAIVTF Y Sbjct: 183 DLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVE-IPDHDFAIVTFVY 241 Query: 1233 YYDLGKQGLFDEIKTLLLTG-GEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXX 1409 YDLG++GLFD++K LL + E+S +G G+ ++RKKSFSD ED L+ Q Sbjct: 242 NYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ-YDSCGED 300 Query: 1410 XXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDV 1589 +R ISSK RRE+RRQQR+A+E+MCDICQ K+LP KDV Sbjct: 301 SSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPDKDV 360 Query: 1590 ATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXX 1769 A LLNLKTG LACSSRN++G FHVFH SCLIHWILLCE E+ P Sbjct: 361 AALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQP-------------- 406 Query: 1770 AKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKPTV 1949 ++ K G KR Q RK E I +QI S+FCP+CQGTG+ I GDELEKPT+ Sbjct: 407 --VTPKVKRRSRRKNGSKRVQARKDGEY-IFTNQISSLFCPECQGTGVNIEGDELEKPTI 463 Query: 1950 PLSEMFKYKIKAIDAHRTWMKCPEKLENCSTG 2045 LS+MFKYKIK DA + WMK PE L+NCSTG Sbjct: 464 SLSQMFKYKIKVSDARKAWMKNPEALQNCSTG 495 >emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] Length = 896 Score = 493 bits (1269), Expect = e-136 Identities = 262/509 (51%), Positives = 333/509 (65%), Gaps = 8/509 (1%) Frame = +3 Query: 555 SVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLFDHLSGNLH 734 S +++E+ AR TLRNVR+QGH YV+LREDGKRF+FFCTLCLAPCYS++VL+DHL GNLH Sbjct: 349 SASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLH 408 Query: 735 KERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVN-----LLDDREPVDSLAIV 899 ER AAK TL+ +PWPFNDGVLFF +NS+E +K + N LL + ++LAIV Sbjct: 409 SERYAAAKVTLLKSHPWPFNDGVLFF-DNSSENDKHLSIANGNPTRLLGTHKNDNNLAIV 467 Query: 900 VHNGVSRTGGNGHVNGDGDRLYDC---YNGDNTDYGERVTDLVIPGVIGKDEISDLHVRL 1070 H N HV ++ DC + ++ + G R D++IPGV+ KDE+++L VR Sbjct: 468 CHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRF 527 Query: 1071 MGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGYYYDLGK 1250 +GFG+I+AR E+DG G+ +IWCEW GK GD + VP H FA+VTF Y+Y+LG+ Sbjct: 528 LGFGQIAARFFEKDGVSKGISKIWCEWFGK-EEPGDGETVMVPDHDFAVVTFNYHYNLGR 586 Query: 1251 QGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXXXXXXXXX 1430 +GLFD++ ++L S+ GS ++RKKSFSD ED L+ Q Sbjct: 587 KGLFDDVISML-----SSSPTEGSGRKRKKSFSDPEDISESLSNQ-YDSSGEDSLISNSP 640 Query: 1431 XXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDVATLLNLK 1610 +R ISSKT+RRELRRQQRVA+E+MCDICQHKMLPGKDVATL N+K Sbjct: 641 SPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNMK 700 Query: 1611 TGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXXAKLSQLG 1790 TG+L CSSRNV+GAFHVFHTSCLIHWILLCE E+F +K + G Sbjct: 701 TGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNGKG 760 Query: 1791 KECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKPTVPLSEMFK 1970 K+ GV + T QICSVFCP+CQGTG+ ++ DELE P +PLSEMFK Sbjct: 761 KD------GVIKPTTL----------QICSVFCPECQGTGI-MIEDELEIPNIPLSEMFK 803 Query: 1971 YKIKAIDAHRTWMKCPEKLENCSTGFTFP 2057 YKIK DAHR WMK PE+L++CSTGF FP Sbjct: 804 YKIKVSDAHRAWMKNPEELKHCSTGFNFP 832 >gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 517 Score = 491 bits (1263), Expect = e-136 Identities = 273/542 (50%), Positives = 336/542 (61%), Gaps = 7/542 (1%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAER ELG + S ++KE+ AR TL NVR QGHTY++LREDGKRF+FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVV----PKVNLLD 866 +VL DHL G+LH RL AAK TL+G NPWPFNDGVLFF + +E+++ + LL+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 867 DREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDE 1046 D+LAIV + G + +VN C GD +DL+IPGV+ KDE Sbjct: 121 FHNNDDNLAIVEYVGSEVSSYRKNVN--------CRAGD--------SDLLIPGVLIKDE 164 Query: 1047 ISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTF 1226 ISDL VR +GFGKI+AR E+DG + RIWCEWLGK D+ P HGFA+VTF Sbjct: 165 ISDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDD-KLKAPKHGFAVVTF 223 Query: 1227 GYYYDLGKQGLFDEIKTLLLTGGEES-NDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXX 1403 Y DLG++GL D++K+LL +G +G+ ++++RKKSFSD ED L+ Q Sbjct: 224 VYNCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQ-YDSSG 282 Query: 1404 XXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGK 1583 +R ISSK +RRELRRQQR+A+E+MCDICQ KMLP K Sbjct: 283 EDSSASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEK 342 Query: 1584 DVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXX 1763 DVATL+NL TG+L CSSRNV+GAFHVFHTSCLIHWILLCE E Sbjct: 343 DVATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRK 402 Query: 1764 XXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKP 1943 AK + +GK+ G I SV CP+CQGTG+ + GDELEKP Sbjct: 403 NGAKSNDMGKDGETKATGT----------------LISSVLCPECQGTGIDVEGDELEKP 446 Query: 1944 TVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTF-PVPGDNAEGKVSNLKLLHFYRA 2120 V LS+MF+YKIK DA R WMK PE LENCSTGF F G+ + K+ LKLLHFY A Sbjct: 447 DVSLSQMFRYKIKVSDARRAWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSA 506 Query: 2121 DE 2126 D+ Sbjct: 507 DK 508 >ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus] gi|449475785|ref|XP_004154550.1| PREDICTED: uncharacterized LOC101204451 [Cucumis sativus] Length = 525 Score = 484 bits (1246), Expect = e-134 Identities = 279/549 (50%), Positives = 349/549 (63%), Gaps = 16/549 (2%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MA R ELGF K + +++E+ AR LRNVR QGHTYV+LRE+GK+F+FFCTLCLAPCYSD Sbjct: 1 MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKV-VPKVN---LLD 866 +VLF HL G LH ERL AAK TL+G NPWPF+DGVLFF+ + +V + N LL+ Sbjct: 61 SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120 Query: 867 DREPVDSLAIVVHNGVSRTGGN--GHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGK 1040 ++LAIV + G S+ GN NG+ + DC + +N + G LVIPGV+ K Sbjct: 121 YNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDC-SFENLNDGGESCPLVIPGVLIK 179 Query: 1041 DEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIV 1220 +EISD+ VR +G+G+I+AR E+DG +G+ RIWCEWLGK+ +N+ VP H +AI+ Sbjct: 180 EEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGIENMVK-VPEHNYAII 238 Query: 1221 TFGYYYDLGKQGLFDEIKTLLLT--GGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXX 1394 TF Y DLG++GL D++K LL + G E ND N K RKKSFSD ED G L Sbjct: 239 TFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDENRQVK-RKKSFSDPED-GSLSMSPQYD 296 Query: 1395 XXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKML 1574 + + ++ +K +RRELRRQQR+A+E+MCDICQ K+L Sbjct: 297 SSGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQKIL 356 Query: 1575 PGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXX 1754 KDVATLLN+KTGRLACSSRNV+G FHVFHTSCLIHWILLCE E+ +K Sbjct: 357 THKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVK----------- 405 Query: 1755 XXXXXAKLSQLGKECRPDTYGVKRKQTRKGS------EEGIVCDQICSVFCPDCQGTGLQ 1916 LG Y +RK+ KG+ E + QI SVFCP CQGTG+ Sbjct: 406 ---------DLGGSKVRRRY--RRKKKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGIT 454 Query: 1917 IVGDELEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPV-PGDNAEGKVSN 2093 I GD+LEKPTVPLSE+FKYKIK DA R WMK PE L+NCSTGF FP P + + V Sbjct: 455 IDGDDLEKPTVPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKP 514 Query: 2094 LKLLHFYRA 2120 LKLLHFY A Sbjct: 515 LKLLHFYGA 523 >ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis] gi|223542914|gb|EEF44450.1| conserved hypothetical protein [Ricinus communis] Length = 509 Score = 482 bits (1241), Expect = e-133 Identities = 273/534 (51%), Positives = 329/534 (61%), Gaps = 3/534 (0%) Frame = +3 Query: 534 ELGFLKPSVPN-VKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLF 710 ELGF K N +KE+ AR TL NVR +GH YV+LREDGKRF+FFCTLCLAPCYSD VLF Sbjct: 6 ELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYSDAVLF 65 Query: 711 DHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVNLLDDREPVDSL 890 DHL GNLH ERL A TL+ ENPWPF+DGV FF+ +S E+++V K + SL Sbjct: 66 DHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQLVIKNDNESRGNGNSSL 125 Query: 891 AIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDEISDLHVRL 1070 AIV + GG+ GD D C N D D G R++DL+I GV+ KD+ISDL R Sbjct: 126 AIV------KYGGSLKPTGDEDT--GC-NKDANDNG-RISDLLIQGVLVKDDISDLQARF 175 Query: 1071 MGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGYYYDLGK 1250 MG+G+I ARL+E+DG + RIWCEWLGK + D + V H FA+VTF Y YDLG+ Sbjct: 176 MGYGRIGARLIEKDGNSNDISRIWCEWLGK-NTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234 Query: 1251 QGLFDEIKTLLLTGGEESNDGNGSTKR-RKKSFSDSEDAGGLLTYQCXXXXXXXXXXXXX 1427 +GL D++K LL + + +D G T R RKKSFSD ED + Q Sbjct: 235 KGLLDDVKLLLSSSPVQESDNQGGTNRKRKKSFSDPEDVSESFSNQYDSSGEESLTSIGG 294 Query: 1428 XXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDVATLLNL 1607 S++ISSKTLRRELRRQ +A+E+MCDICQ K+LP KDVATL+N+ Sbjct: 295 PPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVNM 354 Query: 1608 KTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXXAKLSQL 1787 TG+LACSSRN +G +HVFHTSCLIHWILL E EM +S Sbjct: 355 NTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEM----------------ARNQSVSPK 398 Query: 1788 GKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKPTVPLSEMF 1967 G+ G K K + +QI SVFCP+CQGTG + DE E PT+PLSEMF Sbjct: 399 GRRKSRRKNGTKSSHVEKVK---ALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMF 455 Query: 1968 KYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA-EGKVSNLKLLHFYRADE 2126 KYKIK D R WMK PE LENCS GF FP + A + KV LKLLHFYRADE Sbjct: 456 KYKIKVGDGRRAWMKSPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509 >ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca subsp. vesca] Length = 525 Score = 478 bits (1230), Expect = e-132 Identities = 263/545 (48%), Positives = 342/545 (62%), Gaps = 14/545 (2%) Frame = +3 Query: 534 ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLFD 713 ++G K + +++E+ R LRNVR QGH+YV++REDGK+F+FFCTLCLAPCYSD VLFD Sbjct: 6 DVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSDKVLFD 65 Query: 714 HLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVP----KVNLLDDREPV 881 HL GNLH ERL AAK TL+ NPWPFNDGV+FFNN+ ++ VV K +L+ + Sbjct: 66 HLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLESHDNE 125 Query: 882 DSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYN--------GDNTDYGERVTDLVIPGVIG 1037 ++LAIV + G +T G H DG + + GD+T G + + +VIPG++ Sbjct: 126 NNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAK-SSVVIPGIVV 184 Query: 1038 KDEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAI 1217 +DEI+DL VR +G G+I+AR L +D G+ RIWCEWLG +++ + VP H FA+ Sbjct: 185 RDEITDLEVREVGLGEIAARFLGKD----GIGRIWCEWLGVKSIDSEDLCN-VPEHDFAV 239 Query: 1218 VTFGYYYDLGKQGLFDEIKTLLLTGGE-ESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXX 1394 VTF Y DLG++GL D+++ LL + ES +G G+ +RKKSFSD ED L+ Q Sbjct: 240 VTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCKRKKSFSDPEDISDSLSNQ-YE 298 Query: 1395 XXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKML 1574 +R I +K++RRELRRQQR+AS +MCDICQ +ML Sbjct: 299 SFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDICQQRML 358 Query: 1575 PGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXX 1754 PGKDVATL+NLKTG+LACSSRNV+GAFHVFHTSCLIHWILLCE E+ Sbjct: 359 PGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVI------------- 405 Query: 1755 XXXXXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDEL 1934 + K R K K ++ + QI SVFCP+CQGTG+ + GD+L Sbjct: 406 -----TNQNTGSKARRRSRRKTAAKCNGKDAQLKSLSPQIYSVFCPECQGTGIVVDGDDL 460 Query: 1935 EKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFP-VPGDNAEGKVSNLKLLHF 2111 EKP +PLS+MF+YKIK DA R WMK PE L+NCSTGF FP + + KV LKLL F Sbjct: 461 EKPNLPLSQMFRYKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRF 520 Query: 2112 YRADE 2126 YRA E Sbjct: 521 YRAHE 525 >ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] gi|550325787|gb|EEE95821.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] Length = 513 Score = 464 bits (1195), Expect = e-128 Identities = 263/540 (48%), Positives = 324/540 (60%), Gaps = 8/540 (1%) Frame = +3 Query: 531 RELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLF 710 RE+GF K + +++E+ AR TL VR +GH Y++LREDGKRF+FFCTLCL+PCYSDT+L Sbjct: 5 REVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSDTILL 64 Query: 711 DHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPK-----VNLLDDRE 875 DHL GNLH ERL AAKATL+ NPWPF+DG+ FF+ +S EE++ K L E Sbjct: 65 DHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLAIKDGKESSRFLKFEE 124 Query: 876 PVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDEISD 1055 D+LAIV + + G D +N + +DLVIP V K+E+SD Sbjct: 125 NSDNLAIVKYVENLKPG------------CDTVVDENLSGSDEGSDLVIPSVRLKEEVSD 172 Query: 1056 LHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGYY 1235 L L+G G+I+AR+ E+ + RIWCEWLGK S+ D V H F +VTF Y Sbjct: 173 LKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGKK-SSNDEDKVKVLDHDFGVVTFAYD 231 Query: 1236 YDLGKQGLFDEIKTLLLTGGE--ESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXX 1409 Y+LGK GLFD++K LL + ND G+ K RK+S S+ ED LT Q Sbjct: 232 YELGKSGLFDDVKLLLSSSAPALTENDERGNWK-RKRSVSEPEDVSRSLTNQ-YGLCEEE 289 Query: 1410 XXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDV 1589 +R IS+KT+RRE+R+QQR+A+EKMCDICQ KMLP KDV Sbjct: 290 SSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKDV 349 Query: 1590 ATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXX 1769 ATL N KTG+LACSSRNV+GAFHVFHTSCLIHWIL CE E+ Sbjct: 350 ATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRN---------------- 393 Query: 1770 AKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKPTV 1949 +S G G K T K ++ + I SVFCPDCQGTG+ I GDE EKP Sbjct: 394 QTVSTKGGRRSRKKNGTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLT 453 Query: 1950 PLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFP-VPGDNAEGKVSNLKLLHFYRADE 2126 PLSEMFKYKIK + HR WMK PE LENCSTGF FP G+ + KV LKLLHFYR +E Sbjct: 454 PLSEMFKYKIKVSEGHRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513 >ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum] Length = 521 Score = 463 bits (1192), Expect = e-127 Identities = 271/541 (50%), Positives = 337/541 (62%), Gaps = 7/541 (1%) Frame = +3 Query: 522 MAERELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDT 701 MA R+L F + S N+KE+ R TL+NVR QGH YV+LREDGKR VFFCTLC +PCYSD+ Sbjct: 1 MAGRQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDS 60 Query: 702 VLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVNLLDDR--- 872 VLF+HL GNLH E L AAKATL+ NPWPFNDGVLFFN+ E++K P VN+ R Sbjct: 61 VLFNHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFND--PEQDKHSPNVNVGKSRLVD 118 Query: 873 ---EPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKD 1043 E SLAIV + R G+ +V + Y + + T GE LVIPGV+ KD Sbjct: 119 TCLEDESSLAIVECDDNLRHNGDTYVT---EYEYCLLDSELTGNGES-EYLVIPGVLCKD 174 Query: 1044 EISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVT 1223 E+SDL V+ +G GKI+AR+ R ++RIWCEWL K S D +S VP H FA+VT Sbjct: 175 ELSDLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDS-DDMDTSVVPDHDFAVVT 233 Query: 1224 FGYYYDLGKQGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXX 1403 F Y Y+LG++ L D+ L + ES + +G+ KR++KSFSD ED L+ C Sbjct: 234 FPYNYNLGRKPLLDDRFLLPSSPYSESEETSGTRKRKRKSFSDPEDFSESLSNHC-DSSG 292 Query: 1404 XXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGK 1583 SRIISSKT+RRELR+QQRVASE+MCDICQ KMLPGK Sbjct: 293 EESQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGK 352 Query: 1584 DVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXX 1763 DVATLL+ K+G+L CSSRN+ GAFH+FH SCLIHWIL CE + ++KP Sbjct: 353 DVATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKP------------V 400 Query: 1764 XXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKP 1943 K+ K G K K +E +I SVFCP+CQGTG+ I GDELEKP Sbjct: 401 DEPKMETKAKRRSKRKTGTKHNAKEK-EDEIKSARRINSVFCPECQGTGIIIEGDELEKP 459 Query: 1944 TVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDN-AEGKVSNLKLLHFYRA 2120 V LSE++++KIK DA + WMK PE L+NCSTGF P D+ + VS LKLLHFYRA Sbjct: 460 PVSLSEVYRHKIKLSDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRA 519 Query: 2121 D 2123 + Sbjct: 520 N 520 >ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum lycopersicum] Length = 526 Score = 454 bits (1167), Expect = e-124 Identities = 264/544 (48%), Positives = 333/544 (61%), Gaps = 10/544 (1%) Frame = +3 Query: 522 MAERELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDT 701 MA ++L + S N+KE+ R TL+NVR QGH YV+LREDGKR +FFCTLC +PCYSD+ Sbjct: 1 MAGKQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDS 60 Query: 702 VLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNN-NSNEEEKVVPKVNLLDDR-- 872 VLF+HL GNLH E L AAKATL+ NPWPFNDGVLFFN+ ++++K P VN+ R Sbjct: 61 VLFNHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFNDPEQDKQDKQSPNVNVGKSRLV 120 Query: 873 ----EPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYD--CYNGDNTDYGERVTDLVIPGVI 1034 E S+AIV ++ R + +V+ L D + +DY LVIPGV+ Sbjct: 121 DTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDY------LVIPGVL 174 Query: 1035 GKDEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFA 1214 KDE+SDL V+ +G GKI+AR+ R ++RIWCEWL K S D +S VP H FA Sbjct: 175 CKDELSDLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDS-DDMDTSVVPDHDFA 233 Query: 1215 IVTFGYYYDLGKQGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXX 1394 +VTF Y Y+LG+ L D+ L + ES + + + KR++KSFSD ED L+ C Sbjct: 234 VVTFPYNYNLGRSPLLDDRFLLPSSPYSESEETSVTGKRKRKSFSDPEDFSESLSNHC-D 292 Query: 1395 XXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKML 1574 SRIISSKT+RRELR+QQRVASE+MCDICQ KML Sbjct: 293 SSGEESQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKML 352 Query: 1575 PGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXX 1754 PGKDVATLL+ K+G+L CSSRN+ GAFH+FH SCLIHWIL CE + +KP Sbjct: 353 PGKDVATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKP---------- 402 Query: 1755 XXXXXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDEL 1934 K+ K G K K +E +I SVFCP+CQGTG+ I GDEL Sbjct: 403 --VDEPKMEPKAKRRSKKKTGTKHNAKEK-EDETKSARRINSVFCPECQGTGICIEGDEL 459 Query: 1935 EKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDN-AEGKVSNLKLLHF 2111 EKP V LSE+++ KIK DA + WMK PE L+NCSTGF P D+ + VS LKLLHF Sbjct: 460 EKPPVSLSEVYRLKIKLSDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHF 519 Query: 2112 YRAD 2123 YRA+ Sbjct: 520 YRAN 523 >gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] Length = 638 Score = 447 bits (1150), Expect = e-122 Identities = 249/533 (46%), Positives = 325/533 (60%), Gaps = 21/533 (3%) Frame = +3 Query: 534 ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLFD 713 EL K + ++K++ R LRNVR QGHTYV+LREDGK+ +FFCTLCLAPCYSD VLFD Sbjct: 14 ELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTLCLAPCYSDCVLFD 73 Query: 714 HLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNN-NSNEEEKVVPKVN---LLDDREPV 881 HL GNLH +RL AK TL+G NPWPFNDGV+FFNN N+++ V+ N LL+ ++ Sbjct: 74 HLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISNGNQSRLLESQDSE 133 Query: 882 DSLAIVVHNGVSRTGGNGHVNGD--GDRLYDCYNGDNTDYGERVTDLVIPGVIGKDEISD 1055 ++LAIV + + NGH+ D G + + + N ++IPGV DEI++ Sbjct: 134 NNLAIVTYGENLESCANGHIMVDELGHQNENPDSAGNLAGSGENCAVLIPGVRAGDEIAN 193 Query: 1056 LHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGY- 1232 + VR +G+G IS R E+DG + RIWCEWLGK ++ VP H FAIVTF Y Sbjct: 194 VEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIEDEDFLK-VPEHDFAIVTFSYN 252 Query: 1233 YYDLGKQGLFDEIKTLLLTG-GEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXX 1409 + LG+ GL D++K LL + E +G+ S+++R+KSFSD ED+ L+ Q Sbjct: 253 NFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRKRRKSFSDPEDSSENLSNQ---YDSCG 309 Query: 1410 XXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDV 1589 + +R IS+K +RRELRRQQR+A+E+MCDICQHKMLPGKDV Sbjct: 310 EDSSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQRIAAERMCDICQHKMLPGKDV 369 Query: 1590 ATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXX 1769 ATL+N+KTGRLACSSRN +GAFH+FHTSCLIHW+LLCE E Sbjct: 370 ATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEVEKCTN---------------- 413 Query: 1770 AKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICS-------------VFCPDCQGTG 1910 + + VKR+ RK + + C+++ + V CP+CQGTG Sbjct: 414 ----------QSEAPKVKRRSRRKAASK---CNEVLNDSEVKAFRTPINRVICPECQGTG 460 Query: 1911 LQIVGDELEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGD 2069 I G++ EKPTVPLS+MFKYKIK DA R WMK PE L NCSTGF FP P + Sbjct: 461 TMIDGED-EKPTVPLSKMFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAE 512 >ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] Length = 517 Score = 442 bits (1137), Expect = e-121 Identities = 240/554 (43%), Positives = 328/554 (59%), Gaps = 20/554 (3%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAE+ ELG K S+ N+KE+ AR TL+N+RLQGHTY++LREDGKRFVFFCTLCLAPCYSD Sbjct: 1 MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVNLLDDREP 878 T+L HL+GNLHKERL A+ TL+G NPWPF+DGVLFF++++ EEE+ P Sbjct: 60 TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTP---------- 109 Query: 879 VDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGER------------VTDLVI 1022 V G S G GH + D Y+ + + G + DL+I Sbjct: 110 -------VSGGASVPGTLGHCSDDDRFAIVKYDNNKANGGNQPAAVTDDEPSHSTDDLLI 162 Query: 1023 PGVIGKDEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPS 1202 GV+ K+ D+ + +GFG+I+ARL E G + ++WCEWLG G + D + +P Sbjct: 163 SGVLIKERTLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPS-DEEKATIPE 221 Query: 1203 HGFAIVTFGYYYDLGKQGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTY 1382 H FAIVTF Y+Y+LG+ GL D+ LL T ES +G S ++RKKSFSD ED L Sbjct: 222 HDFAIVTFSYFYNLGRLGLLDDPSRLLTTSQSESGNGEDSGRKRKKSFSDPEDTSESLCN 281 Query: 1383 QCXXXXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQ 1562 Q A R++ +KT+RRELRRQQR+ SE++C++C+ Sbjct: 282 QYDSSEEVSSGHNSNSSRA-LIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCK 340 Query: 1563 HKMLPGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXX 1742 KMLPGKD A +LN+KTG LAC SRN+ GAFH+FH SC++HW L CE+E+ Sbjct: 341 QKMLPGKDAAAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEIL--------- 391 Query: 1743 XXXXXXXXXAKLSQLGKE-CRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQI 1919 +S GK+ C + G + + + + V QI SVFCP+CQGTG+ I Sbjct: 392 -------GNKMVSGKGKKRCTKHSSGQTGVKWNELAND--VSWQIFSVFCPECQGTGINI 442 Query: 1920 VGDELEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA------EG 2081 G +E+ T PLS+ +++++K + + W+K PEKL+NCSTGF FP D + E Sbjct: 443 EGGVIERDTFPLSQTWRFQVKVSEGRKAWVKNPEKLKNCSTGFHFPQQADESGQIPVQEE 502 Query: 2082 KVSNLKLLHFYRAD 2123 +V +KL+ FYR + Sbjct: 503 RVQMMKLVRFYRVE 516 >ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] gi|145334149|ref|NP_001078455.1| uncharacterized protein [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1| putative protein [Arabidopsis thaliana] gi|110742700|dbj|BAE99261.1| hypothetical protein [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] gi|332660061|gb|AEE85461.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] Length = 516 Score = 432 bits (1112), Expect = e-118 Identities = 236/555 (42%), Positives = 328/555 (59%), Gaps = 21/555 (3%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAE+ ELG KPS+ N+KE+ AR TL+N+RLQGHTY++LREDGKRFVFFCTLCLAPCYSD Sbjct: 1 MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVP------KVNL 860 T+L HL+GNLHKERL A+ TL+G NPWPF+DGVLFF++++ EEE+ P + Sbjct: 60 TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119 Query: 861 LDDREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGK 1040 L+ + AIV ++ N N GD + D + DL+I GV+ K Sbjct: 120 LEHCSDDERFAIVKYD-------NNKTN--GDNVPAAVTDDEPSHA--ADDLLISGVLIK 168 Query: 1041 DEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIV 1220 + D+ + +GFG+I+ARL E G + ++WCEWLG G + D + +P H FAIV Sbjct: 169 ERTLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPS-DEEKATIPEHDFAIV 227 Query: 1221 TFGYYYDLGKQGLFDEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXX 1400 TF Y+Y+LG+ GL D+ LL + ES +G S ++RKKSFSD ED L Q Sbjct: 228 TFSYFYNLGRLGLLDDPGRLLTSSQSESGNGEDSGRKRKKSFSDPEDTSESLCNQ-YDSS 286 Query: 1401 XXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPG 1580 R++ ++T+RRELRRQQR+ SE++C++C+ KMLPG Sbjct: 287 EEVSSGHNSNSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPG 346 Query: 1581 KDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXX 1760 KD A +LN+KTG LAC SRN+ GAFH+FH SC++HW L CE+E+ Sbjct: 347 KDAAAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEIL--------------- 391 Query: 1761 XXXAKLSQLGKECRPDTYGVKRKQTRKGSEEGI--------VCDQICSVFCPDCQGTGLQ 1916 +S G K++ T+ + G+ V QI SVFCP+CQGTG+ Sbjct: 392 -GNKMVSGKG----------KKRCTKHSGQTGVKWNELANDVSWQIFSVFCPECQGTGIN 440 Query: 1917 IVGDELEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA------E 2078 I G +E+ T PLS+ +++++K + + W+K PE+L+NCSTGF FP + E Sbjct: 441 IEGAVIERDTFPLSQTWRFQVKVSEGRKAWVKNPERLKNCSTGFHFPQQAEETEQIPVQE 500 Query: 2079 GKVSNLKLLHFYRAD 2123 +V +KL+ FYR + Sbjct: 501 ERVQMMKLVRFYRVE 515 >gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 481 Score = 429 bits (1102), Expect = e-117 Identities = 246/515 (47%), Positives = 305/515 (59%), Gaps = 25/515 (4%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAER ELG + S ++KE+ AR TL NVR QGHTY++LREDGKRF+FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVV----PKVNLLD 866 +VL DHL G+LH RL AAK TL+G NPWPFNDGVLFF + +E+++ + LL+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 867 DREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDE 1046 D+LAIV + G + +VN C GD +DL+IPGV+ KDE Sbjct: 121 FHNNDDNLAIVEYVGSEVSSYRKNVN--------CRAGD--------SDLLIPGVLIKDE 164 Query: 1047 ISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTF 1226 ISDL VR +GFGKI+AR E+DG + RIWCEWLGK D+ P HGFA+VTF Sbjct: 165 ISDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDD-KLKAPKHGFAVVTF 223 Query: 1227 GYYYDLGKQGLFDEIKTLLLTGGEES-NDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXX 1403 Y DLG++GL D++K+LL +G +G+ ++++RKKSFSD ED L+ Q Sbjct: 224 VYNCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQ-YDSSG 282 Query: 1404 XXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGK 1583 +R ISSK +RRELRRQQR+A+E+MCDICQ KMLP K Sbjct: 283 EDSSASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEK 342 Query: 1584 DVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXX 1763 DVATL+NL TG+L CSSRNV+GAFHVFHTSCLIHWILLCE E Sbjct: 343 DVATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRK 402 Query: 1764 XXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKP 1943 AK + +GK+ G I SV CP+CQGTG+ + GDELEKP Sbjct: 403 NGAKSNDMGKDGETKATGT----------------LISSVLCPECQGTGIDVEGDELEKP 446 Query: 1944 TVPLSE-------------------MFKYKIKAID 1991 V LS+ MF+YKIK D Sbjct: 447 DVSLSQVCISDLKTIRCCCTRKLAGMFRYKIKVSD 481 >ref|XP_006283541.1| hypothetical protein CARUB_v10004592mg [Capsella rubella] gi|482552246|gb|EOA16439.1| hypothetical protein CARUB_v10004592mg [Capsella rubella] Length = 519 Score = 429 bits (1102), Expect = e-117 Identities = 239/550 (43%), Positives = 327/550 (59%), Gaps = 18/550 (3%) Frame = +3 Query: 528 ERELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVL 707 ++ELG KP+V N+KE+ AR TL+N+RLQGHTY++LREDGKRFVFFCTLCLAPCYSDT+L Sbjct: 4 KKELGLPKPTV-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSDTIL 62 Query: 708 FDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEE-EKVVPKVN-------LL 863 HL+GNLHKERL A+ TL+G NPWPF+DGVLFF++++ ++ E+ P ++ L Sbjct: 63 LGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGDDVEEEKPPISGGEDVPGQL 122 Query: 864 DDREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKD 1043 + AIV ++ G N D C DN L+I GV+ K+ Sbjct: 123 QHCGEDERFAIVKYDNNRTNGDNQPAAVPDDEPNHC--ADN---------LLISGVLIKE 171 Query: 1044 EISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVT 1223 + D+ + +GFG+I+ARL E G + ++WCEWLG G + D + VP H FAIVT Sbjct: 172 KTLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSDDE-KAVVPEHDFAIVT 230 Query: 1224 FGYYYDLGKQGLFDEIKTLL-LTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXX 1400 F Y+Y+LG+ GL D+ LL + ES +G S ++RKKSFSD ED L Q Sbjct: 231 FSYFYNLGRLGLLDDPSRLLTCSQSVESGNGEDSCRKRKKSFSDPEDTSESLCNQYDSSE 290 Query: 1401 XXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPG 1580 A R++ +KT+RRELRRQQR+ SE++C++C+ KMLPG Sbjct: 291 EVSSGLNSNSSRA-LVADYDDSFMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPG 349 Query: 1581 KDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXX 1760 KD AT+LNLKTG L C SRN+ GAFH+FH SC++HW L CE+E+ Sbjct: 350 KDAATILNLKTGNLVCGSRNLLGAFHLFHVSCVVHWFLFCESEIL--------------- 394 Query: 1761 XXXAKLSQLGKE-CRPDTY--GVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDE 1931 +S GK+ C + GVK Q V QI SVFCP+CQGTG+ I G Sbjct: 395 -GGKMMSGKGKKRCTKHSVQTGVKWNQLAND-----VSWQIFSVFCPECQGTGINIEGGV 448 Query: 1932 LEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA------EGKVSN 2093 +E+ T PLS+ +++++K + + W+K PE+LENCSTGF FP D + E +V Sbjct: 449 IERDTFPLSQTWRFQVKVSEGRKAWVKNPERLENCSTGFHFPQQPDESGQVRVQEERVQM 508 Query: 2094 LKLLHFYRAD 2123 +K++ FYR + Sbjct: 509 MKMVRFYRVE 518 >ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum] gi|557114148|gb|ESQ54431.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum] Length = 514 Score = 428 bits (1101), Expect = e-117 Identities = 243/554 (43%), Positives = 329/554 (59%), Gaps = 20/554 (3%) Frame = +3 Query: 522 MAE-RELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAE +ELG K ++ ++KE+ AR TLRN+R QGHTY++LREDGKRFVFFCTLCLAPCYSD Sbjct: 1 MAESKELGLPKTAI-SLKEQLARTTLRNLRSQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVPKVNLLDDREP 878 +L HL+GNLHKERL A+ TL+GENPWPFNDGVLFF++++ EEEK L+ D E Sbjct: 60 AILLGHLNGNLHKERLSCARITLLGENPWPFNDGVLFFDSSTGEEEK-----TLISDGEG 114 Query: 879 V----------DSLAIVVHNGVSRT----GGNGHVNGDGDRLYDCYNGDNTDYGERVTDL 1016 V + AIV ++ +RT G N G D C +L Sbjct: 115 VTGPLHHCSDNERFAIVTYD-ENRTCESQGDNQPAAGIDDEPNHC-----------AENL 162 Query: 1017 VIPGVIGKDEISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCV 1196 VI ++ K++ D+ + +GFG+I+ARL E G + ++WCEWLG+ S D + V Sbjct: 163 VISNLLIKEKTLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGE-ESPPDEEKATV 221 Query: 1197 PSHGFAIVTFGYYYDLGKQGLF-DEIKTLLLTGGEESNDGNGSTKRRKKSFSDSEDAGGL 1373 P H FAIVTF Y+Y+LG+ GL D + L L+ ES +G + ++RKKSFSD ED Sbjct: 222 PEHDFAIVTFSYFYNLGRLGLLADPSRLLTLSQSAESGNGEDNGRKRKKSFSDPEDTSES 281 Query: 1374 LTYQCXXXXXXXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCD 1553 L Q A R+I +K++RRELR+QQR+ S+++C+ Sbjct: 282 LCNQYDSSEEVSSARNSNSSRA-LIADYDDHLVNKRVIKNKSVRRELRKQQRIFSDRICE 340 Query: 1554 ICQHKMLPGKDVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXX 1733 +C+ KMLPGKD A +LN+KTG+LACSSRN GAFH+FH SC++HW L CE E+ Sbjct: 341 VCKQKMLPGKDAAAILNMKTGKLACSSRNRLGAFHLFHVSCVVHWFLFCETEIL------ 394 Query: 1734 XXXXXXXXXXXXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGL 1913 +S GK+ GVK + G V QI SVFCP+CQGTG+ Sbjct: 395 ----------GSKMVSGKGKKRCTKQSGVKWNEL-----VGDVSWQIFSVFCPECQGTGI 439 Query: 1914 QIVGDELEKPTVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGD----NAEG 2081 I GD +E+ T PLS+ +++ +K + + W+K PEKLENCSTGF FP + E Sbjct: 440 NIEGDVIERDTFPLSQTWRFGVKVSEGRKAWVKNPEKLENCSTGFHFPQQDEELVKGQED 499 Query: 2082 KVSNLKLLHFYRAD 2123 +V ++KL+ FYR + Sbjct: 500 RVQSMKLVRFYRVE 513 >ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779572 isoform X1 [Glycine max] gi|571494415|ref|XP_006592839.1| PREDICTED: uncharacterized protein LOC100779572 isoform X2 [Glycine max] Length = 501 Score = 428 bits (1101), Expect = e-117 Identities = 258/545 (47%), Positives = 320/545 (58%), Gaps = 11/545 (2%) Frame = +3 Query: 534 ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLFD 713 ELG K V N KE+ AR L+ VR QGH YV+LRE+GK+F++FCTLCLAPCYSD VLFD Sbjct: 6 ELGPPKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSDDVLFD 65 Query: 714 HLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNN--SNEEEKVVPKVN--LLDDREPV 881 HL GNLHKERL AAK TL+G PWPFNDG++FF+ + S++E +V LL + Sbjct: 66 HLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSTESHKELEVADSYQNRLLKFNDND 125 Query: 882 DSLAIVVH-NGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDEISDL 1058 SLAIV +GV ++G D Y LVIP ++ DEI D+ Sbjct: 126 VSLAIVKFGDGVQSNAKPRSIDGMQDDEYA---------------LVIPNLLIGDEIFDV 170 Query: 1059 HVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGYYY 1238 VR +G GKI+AR LE+ A G++RIWCEWLGK S G+ V H FA+V F Y Y Sbjct: 171 KVREVGLGKIAARFLEKCHALNGIKRIWCEWLGKE-SNGERDGVEVLEHDFAVVIFAYNY 229 Query: 1239 DLGKQGLFDEIKTLL--LTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXXX 1412 DLG+ GL D++ TLL +GG++ K S SD +D + Q Sbjct: 230 DLGRSGLLDDVNTLLPSASGGQKG----------KSSLSDFDDVSDSVCNQ--YDSSAEE 277 Query: 1413 XXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDVA 1592 + +R ISSK LR+ELRR+QR+A+EK+C+ICQ KMLPGKDVA Sbjct: 278 SSDSNNSSSRLTLDQFNNHLCTRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVA 337 Query: 1593 TLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXXA 1772 LLNLKT R+ACSSRN GAFHVFHTSCLIHWI+LCE E+ Sbjct: 338 ALLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIITN----------------- 380 Query: 1773 KLSQLGKECRPDTYGVKRKQTRKGSEEGIVCD---QICSVFCPDCQGTGLQIVGDELEKP 1943 C VKRK G++ G D I +VFCP+CQGTG+ I GD +E+P Sbjct: 381 -----HLVCPNVRRVVKRKVASDGNKIGKEKDIGKHIRTVFCPECQGTGMIIDGDGVEQP 435 Query: 1944 TVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA-EGKVSNLKLLHFYRA 2120 LS+MFK+KIKA DA R W+K PE L+NCSTGF FP + E KV + LLHFYRA Sbjct: 436 EFSLSQMFKFKIKACDARRDWIKSPEVLKNCSTGFHFPSQSEEIFEEKVEPINLLHFYRA 495 Query: 2121 DEDSW 2135 D+ SW Sbjct: 496 DDQSW 500 >ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807746 [Glycine max] Length = 500 Score = 428 bits (1100), Expect = e-117 Identities = 255/544 (46%), Positives = 322/544 (59%), Gaps = 11/544 (2%) Frame = +3 Query: 534 ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSDTVLFD 713 ELG K + N KE+ AR L+ VR QGH YV+LRE+GK+F++FCTLCLAPCYSD VLFD Sbjct: 6 ELGPPKSDISNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSDDVLFD 65 Query: 714 HLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVVP----KVNLLDDREPV 881 HL GNLH+ERL AAK TL+G PWPFNDG++FF+ ++ ++++ + LL + Sbjct: 66 HLKGNLHRERLSAAKVTLLGPKPWPFNDGLVFFDTSTESDKELEVADSYRNRLLKFNDDD 125 Query: 882 DSLAIV-VHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDEISDL 1058 SLAIV GV + G D +C LVIP ++ DEI DL Sbjct: 126 SSLAIVKFGEGVQSNAKPCSIEGMQDD--EC-------------ALVIPNLLIGDEIFDL 170 Query: 1059 HVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTFGYYY 1238 V+ +G GKI+AR LE+ A G++RIWCEWLGK S G+ V H FA+V F Y Y Sbjct: 171 KVKEVGLGKIAARFLEKCHALNGIKRIWCEWLGK-ESNGERDGVEVLEHDFAVVIFAYNY 229 Query: 1239 DLGKQGLFDEIKTLL-LTGGEESNDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXXXXXX 1415 DLG+ GL D++KTLL ++ G++ K S SDS+D L Q Sbjct: 230 DLGRSGLLDDVKTLLPVSAGQKG----------KTSLSDSDDVSDFLCNQ--YDSSAEES 277 Query: 1416 XXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGKDVAT 1595 + +R ISSK LR+ELRR+QR+A+EK+C+ICQ KMLPGKDVA Sbjct: 278 SDSNNSSSRLTLDQFNNHLCTRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVAA 337 Query: 1596 LLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXXXXAK 1775 LLNLKT R+ACSSRN GAFHVFHTSCLIHWI+LCE E+ Sbjct: 338 LLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEII-------------------- 377 Query: 1776 LSQLGKECRPDTYG-VKRKQTRKGSEEGIVCD---QICSVFCPDCQGTGLQIVGDELEKP 1943 + RP+ VKRK G + G D I +VFCP+CQGTG+ I GD +E+P Sbjct: 378 ---INHLVRPNIRRVVKRKVASDGDKMGKEKDIGKHIRTVFCPECQGTGMIIDGDGVEQP 434 Query: 1944 TVPLSEMFKYKIKAIDAHRTWMKCPEKLENCSTGFTFPVPGDNA-EGKVSNLKLLHFYRA 2120 LS+MFK+KIKA DA R W+K PE L+NCSTGF FP + E KV + LLHFYRA Sbjct: 435 EFSLSQMFKFKIKACDARRDWIKSPEVLQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRA 494 Query: 2121 DEDS 2132 D+ S Sbjct: 495 DDQS 498 >gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 478 Score = 427 bits (1098), Expect = e-116 Identities = 241/495 (48%), Positives = 301/495 (60%), Gaps = 6/495 (1%) Frame = +3 Query: 522 MAER-ELGFLKPSVPNVKEKFARMTLRNVRLQGHTYVDLREDGKRFVFFCTLCLAPCYSD 698 MAER ELG + S ++KE+ AR TL NVR QGHTY++LREDGKRF+FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 699 TVLFDHLSGNLHKERLFAAKATLIGENPWPFNDGVLFFNNNSNEEEKVV----PKVNLLD 866 +VL DHL G+LH RL AAK TL+G NPWPFNDGVLFF + +E+++ + LL+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 867 DREPVDSLAIVVHNGVSRTGGNGHVNGDGDRLYDCYNGDNTDYGERVTDLVIPGVIGKDE 1046 D+LAIV + G + +VN C GD +DL+IPGV+ KDE Sbjct: 121 FHNNDDNLAIVEYVGSEVSSYRKNVN--------CRAGD--------SDLLIPGVLIKDE 164 Query: 1047 ISDLHVRLMGFGKISARLLERDGACAGMQRIWCEWLGKMGSAGDNISSCVPSHGFAIVTF 1226 ISDL VR +GFGKI+AR E+DG + RIWCEWLGK D+ P HGFA+VTF Sbjct: 165 ISDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDD-KLKAPKHGFAVVTF 223 Query: 1227 GYYYDLGKQGLFDEIKTLLLTGGEES-NDGNGSTKRRKKSFSDSEDAGGLLTYQCXXXXX 1403 Y DLG++GL D++K+LL +G +G+ ++++RKKSFSD ED L+ Q Sbjct: 224 VYNCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQ-YDSSG 282 Query: 1404 XXXXXXXXXXXAXXXXXXXXXXXXSRIISSKTLRRELRRQQRVASEKMCDICQHKMLPGK 1583 +R ISSK +RRELRRQQR+A+E+MCDICQ KMLP K Sbjct: 283 EDSSASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEK 342 Query: 1584 DVATLLNLKTGRLACSSRNVHGAFHVFHTSCLIHWILLCENEMFMKPPXXXXXXXXXXXX 1763 DVATL+NL TG+L CSSRNV+GAFHVFHTSCLIHWILLCE E Sbjct: 343 DVATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRK 402 Query: 1764 XXAKLSQLGKECRPDTYGVKRKQTRKGSEEGIVCDQICSVFCPDCQGTGLQIVGDELEKP 1943 AK + +GK+ G I SV CP+CQGTG+ + GDELEKP Sbjct: 403 NGAKSNDMGKDGETKATGT----------------LISSVLCPECQGTGIDVEGDELEKP 446 Query: 1944 TVPLSEMFKYKIKAI 1988 V LS++ +K I Sbjct: 447 DVSLSQVCISDLKTI 461