BLASTX nr result
ID: Ephedra25_contig00007401
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00007401 (2424 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theob... 265 5e-68 gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] 265 5e-68 gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao] 265 5e-68 gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] 265 5e-68 gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theob... 265 5e-68 gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] 265 5e-68 ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-lik... 262 6e-67 ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Popu... 259 5e-66 gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus pe... 258 6e-66 emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera] 256 4e-65 gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] 254 1e-64 ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr... 253 2e-64 ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik... 253 3e-64 ref|XP_006577974.1| PREDICTED: filament-like plant protein 4-lik... 253 4e-64 ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu... 249 3e-63 ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu... 248 7e-63 ref|XP_006601345.1| PREDICTED: filament-like plant protein 6-lik... 247 2e-62 ref|XP_006581178.1| PREDICTED: filament-like plant protein 4-lik... 247 2e-62 ref|XP_006581177.1| PREDICTED: filament-like plant protein 4-lik... 247 2e-62 gb|ESW08071.1| hypothetical protein PHAVU_009G015700g [Phaseolus... 247 2e-62 >gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] Length = 951 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 81 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 140 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 141 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 200 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 201 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 260 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 261 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 320 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 321 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 380 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 381 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 440 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 441 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 500 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 501 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 556 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 557 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 616 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 617 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 676 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 677 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 735 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 736 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 770 >gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1107 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 85 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 144 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 145 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 204 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 205 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 264 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 265 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 324 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 325 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 384 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 385 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 444 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 445 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 504 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 505 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 560 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 561 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 620 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 621 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 680 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 681 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 739 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 740 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 774 >gb|EOY14984.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 992 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 81 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 140 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 141 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 200 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 201 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 260 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 261 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 320 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 321 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 380 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 381 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 440 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 441 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 500 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 501 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 556 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 557 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 616 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 617 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 676 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 677 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 735 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 736 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 770 >gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1106 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 85 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 144 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 145 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 204 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 205 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 264 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 265 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 324 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 325 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 384 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 385 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 444 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 445 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 504 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 505 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 560 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 561 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 620 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 621 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 680 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 681 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 739 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 740 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 774 >gb|EOY14981.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 992 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 81 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 140 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 141 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 200 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 201 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 260 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 261 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 320 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 321 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 380 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 381 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 440 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 441 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 500 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 501 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 556 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 557 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 616 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 617 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 676 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 677 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 735 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 736 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 770 >gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1102 Score = 265 bits (678), Expect = 5e-68 Identities = 214/698 (30%), Positives = 354/698 (50%), Gaps = 51/698 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 81 EIKDLNEKLSAADSEISTKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 140 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + +K++ EL+++I L++ +L Sbjct: 141 AEDRASHLDGALKECMRQIRNLKEEHEQKLQDVVISKNKQCEKIRLELEAKIANLDQELL 200 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 +S+ +A ++QE + ++ S+ + Q E++ H++ + E E +LK +++ +SK Sbjct: 201 KSEAENAAITRSLQERANMLIKISEEKAQAEAEIEHLKGNIESCEREINSLKYELHVVSK 260 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 261 ELEIRNEEKNMSMRSAEVANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 320 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGD---MQKEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S + S D L + QKE E L E L AME+ Sbjct: 321 VESLGRDYGDTRLRRSPVRPSTPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKML 380 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 ASR CA+T++KL ++E QL I ++ + + E S++ SN Sbjct: 381 KEALAKRNSELLASRNLCAKTSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSN 440 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D CA+SWA+AL++EL QFKK K K + + ++DLMDDFLEM Sbjct: 441 PPSVTSVSEDGNDDDRSCAESWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEM 500 Query: 928 ERLASMPSSKMVESKIKEWSETN--LDHSL-GDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + I TN + S+ GD + KE Q + + LS + Sbjct: 501 EKLACSSNDSTANGTITISDSTNNKISESVNGDASGEISCKELQSEKQH----VLSPSVN 556 Query: 757 FVSET--LEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR---------- 614 VS L + + ++L + ++ ++L++ + D+ +L+DI+ Sbjct: 557 QVSSNMDLSVVYPESDADQLPVMKLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTL 616 Query: 613 ---SARAVTEETDSDTSLCI-KPH-----MITESKFSSSQNK------VNIIDIELATAI 479 S V+EE CI + H + E + + S V + ELA AI Sbjct: 617 CEHSVNGVSEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAI 676 Query: 478 SSVVNFVQYMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 S + +FV + +++R V D S ++++ KI F+ ++V+ +T + +L++ L Sbjct: 677 SQIHDFVLSLGKEAR-AVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTIL 735 Query: 304 AVVRTLSSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 A L N D +N D V +PE K Sbjct: 736 AKASDLR---VNVLGYKDNEEEINSPDCIDKVVLPENK 770 >ref|XP_004291383.1| PREDICTED: filament-like plant protein 6-like [Fragaria vesca subsp. vesca] Length = 1091 Score = 262 bits (669), Expect = 6e-67 Identities = 210/687 (30%), Positives = 353/687 (51%), Gaps = 40/687 (5%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 ++ L+E+ S A S+ ++GL+KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 96 QITDLNEQLSTAQSEISTQEGLVKQHAKVAEEAVSGWEKAEAEALALKTHLESVTLLKLT 155 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KE+HE+K+ E V KT + DK+K EL++RI L++ +L Sbjct: 156 AEDRASHLDGALKECMRQIRNLKEDHEQKLQEVVITKTKQCDKIKHELETRIANLDQELL 215 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLES---ENAALKQKVNSLSK 1601 S +A ++QE S ++ ++ +++ ++ LES E +LK +++ +K Sbjct: 216 RSAAENAAISRSLQERSNMLYKINEEKSQAEAEIERFKSNLESCEREINSLKYELHIAAK 275 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 276 ELEIRTEEKNMSVRSADAANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQMKLE 335 Query: 1441 KEVGENDAGELKQKKSFGKGSISS----TELTQDKYLGDMQKEIETLKETLSAMEDXXXX 1274 E D GE + K+S K S TE + D + QKE E L E L AME+ Sbjct: 336 VESLGRDYGETRLKRSPVKPSSPQMSQVTEFSLDN-VQKFQKENEFLTERLLAMEEETKM 394 Query: 1273 XXXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPS 1106 +ASR+ CA+T +KL ++E QL+I ++ S ++++ E S S + S Sbjct: 395 LKEALSKRNSELQASRSICAKTVSKLQTLEAQLQITGQQKGSPKSVVHISTEGSLSRNAS 454 Query: 1105 NLQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLE 932 S S+ DG+ D CA+SW + L ++L KK K K + + +++LMDDFLE Sbjct: 455 IPPSFASMSEDGNDDDRSCAESWGTTLNSDLSHSKKEKNNEKSSKAENQNHLNLMDDFLE 514 Query: 931 MERLASMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFV 752 ME+LA +P+ +S + SE ++ + G++ +Q EA+ N DL+ + Sbjct: 515 MEKLACLPN----DSNGVKTSEIEINEASGEVTATKDIHSEQQHEASFN-----GDLSVL 565 Query: 751 SETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR-------------S 611 S + N+L L ++ ++L+ + D +++DI+ + Sbjct: 566 SPGANE-------NKLPLVKLRSRISVLLELLSKDTDFVKVIEDIKHVVQEAQDALQPHT 618 Query: 610 ARAVTEETDSDTSLCIKPHMITESKFS-----SSQNKVNIIDIELATAISSVVNFVQYMI 446 +V+EE S ++C +S FS +++ ++ I ELA+AIS + +FV ++ Sbjct: 619 VNSVSEEIHSADAICDTQAHPEDSVFSTEKETTAKETMSAISEELASAISLIHDFVVFLG 678 Query: 445 QQ--SRHKVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTLSSQDT 272 ++ H T ++++ KI F+ +V+HG + L+ +L+ LA S Sbjct: 679 KEVVGVHD-TFPDSNELSQKIEEFSGTFSKVIHGNLSLVDLVLDLSHVLA---NASELKF 734 Query: 271 NTAASADYNGGLNVKSDKDIVNIPEKK 191 N G N D V +PE K Sbjct: 735 NVIGFPGVEAGRNSPDCIDKVALPENK 761 >ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] gi|550339754|gb|EEE93914.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] Length = 1077 Score = 259 bits (661), Expect = 5e-66 Identities = 204/661 (30%), Positives = 338/661 (51%), Gaps = 46/661 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S S+ K+ L+KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 71 EIKDLNEKLSATHSEMTTKENLVKQHAKVAEEAVSGWEKAEAEALALKNHLESVTLSKLT 130 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+++ E V K + DK+K + +++I L++ +L Sbjct: 131 AEDRASHLDGALKECMRQIRNLKEEHEQRVQEIVLNKNKQLDKIKMDFEAKIATLDQELL 190 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 S +A ++QE S ++ S+ + Q E++ H++ + E E + K +++ +SK Sbjct: 191 RSAAENAALSRSLQEHSNMLIKISEEKSQAEAEIEHLKSNIESCEREINSHKYELHVISK 250 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ +KQ+ E KK +K ++ RGL K+ A LA +K + Sbjct: 251 ELEIRNEEKNMSIRSAEAANKQHMEGVKKVAKLESECQRLRGLVRKKLPGPAALAQMKLE 310 Query: 1441 KEVGENDAGELKQKKSFGK----GSISSTELTQDKYLGDMQKEIETLKETLSAMEDXXXX 1274 E D G+ + ++S K S S TE + D + KE E L E L AME+ Sbjct: 311 VESLGRDYGDSRLRRSPVKPPSPHSSSVTEFSLDN-VQKFHKENEFLTERLFAMEEETKM 369 Query: 1273 XXXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPS 1106 +ASR CA+TA+KL S+E Q I + S + + E S++ S Sbjct: 370 LKEALAKRNSELQASRNLCAKTASKLQSLEAQFHISNQVKSSPKSIIQVPAEGYSSQNIS 429 Query: 1105 NLQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLE 932 N S+ +V DG+ D+ CADSWA+ I+E FKK + K+ + + +++ MDDFLE Sbjct: 430 NPPSLTNVSEDGNDDTQSCADSWATISISEFSNFKKYNHSEKLNKAENAKHLEFMDDFLE 489 Query: 931 MERLASMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFV 752 ME+LA + + + ++T+ + E+ +QKE LSE N L+ Sbjct: 490 MEKLACLNADSAATTSNSPNNKTSEVANRDASGEISLQKENTLSEEKHNLDPPVNHLSCN 549 Query: 751 SETLEQLKARNSWNELSLASFHE---KMELILKAEDEGGDLHGLLKDIR--------SAR 605 ++ A S ++ L+SF + ++ ++L + + DL +L+DI+ A Sbjct: 550 KDS----SAIESGSDADLSSFMKLQLRISMLLDSGSKKADLGKILEDIKQVVQDAETGAS 605 Query: 604 AVTEE------TDSDTSLCIKPHMITESK----FSSSQNKVNI---IDIELATAISSVVN 464 V++E T D C + I K F S+ I + EL AIS + + Sbjct: 606 CVSKEAHCSDATTHDRQTCPEDAGIMGEKEIELFQESKTAAQIMHTVSQELLPAISQIHD 665 Query: 463 FVQYMIQQSRHKVTDSQLHKVNL--KISGFAELVDQVMHGTAKVTQLLAELASFLAVVRT 290 FV ++ + V D+ + L KI F+ ++V++ + +++LA LA+ Sbjct: 666 FV-LLLGKEAMTVHDTSCDSIGLSQKIKEFSITFNKVLYSDRSLVDFVSDLAHILALASG 724 Query: 289 L 287 L Sbjct: 725 L 725 >gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica] Length = 993 Score = 258 bits (660), Expect = 6e-66 Identities = 200/658 (30%), Positives = 336/658 (51%), Gaps = 47/658 (7%) Frame = -3 Query: 2119 LDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLALENE 1940 L+EK S A ++ K+ L+KQH K AEEA++GWE ++ EA+ALK + + KL E+ Sbjct: 3 LNEKLSAANTEMTNKESLVKQHTKVAEEAVSGWEKAEAEALALKTHLESVTLLKLTAEDR 62 Query: 1939 VSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVLESKV 1760 SHLDGALKEC RQ+R++KE+HE+K+ E V KT + +K+K EL+++I L++ +L S Sbjct: 63 ASHLDGALKECMRQIRNLKEDHEQKLQEVVFSKTKQCEKIKLELEAKISNLDQELLRSAA 122 Query: 1759 NQSADLDAMQESSKL---VNSKVRQQPESDRHVQDKVGLLESENAALKQKVNSLSKEMDR 1589 +A ++QE S + +N + Q + + E E +LK +++ SKE++ Sbjct: 123 ENAAISRSLQERSNMLFKINEEKSQAEAEIELFKSNIESCEREINSLKYELHLASKELEI 182 Query: 1588 ILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQDKEVG 1430 EK+ S +S+ +KQ+ E KK +K + RGL K+ A LA +K + E Sbjct: 183 RNEEKDMSMRSAEAANKQHMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLEVESL 242 Query: 1429 ENDAGELKQKKSFGKGSISSTELTQDKYLGDMQ---KEIETLKETLSAMEDXXXXXXXXX 1259 D GE + ++S K S + L ++Q KE E L E L AME+ Sbjct: 243 GRDYGETRLRRSPVKPSSPHMSPVTEFSLDNVQKFHKENEFLTERLLAMEEETKMLKEAL 302 Query: 1258 XXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSNLQSM 1091 + SR CA+T +KL ++E QL+I ++ S + +T E S S++ SN S+ Sbjct: 303 TKRNSELQTSRGMCAQTVSKLQTLEAQLQINNQQKGSPKSVVQITTEGSSSQNASNPPSL 362 Query: 1090 KSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEMERLA 917 S+ DG+ D CA+SWA+ L ++L +K K K + + +++LMDDFLEME+LA Sbjct: 363 TSLSEDGNDDDRSCAESWATTLGSDLSHIRKEKSNQKSNKAENQNHLNLMDDFLEMEKLA 422 Query: 916 SMPSSKMVESKI-----KEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFV 752 +P+ I + SE + GD V +K+ Q SE ++ S L D A Sbjct: 423 CLPNDSNGAVSISSGPNNKTSERENHDASGD---VTAEKDIQ-SEQQQDLSPLEGDQASS 478 Query: 751 SETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR-------------S 611 + L L + N+L L K+ ++L+ + D +++DI+ + Sbjct: 479 NVKLSGLSPESDENQLPLVKLRSKISMLLELLSKDTDFGKVIEDIKHVVQEAQDTLHPHT 538 Query: 610 ARAVTEETDSDTSLCIK------PHMITESKFSSSQ---NKVNIIDIELATAISSVVNFV 458 ++EE S ++C + + TE + + SQ + ++ +LA+AIS + +FV Sbjct: 539 VNCISEEVHSSDAICDRQANPEDSRLTTEKEITLSQPARGTMELMSEDLASAISLINDFV 598 Query: 457 QYMIQQSRH-KVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTL 287 ++ ++ T ++++ KI F+ ++ +HG + + L+ LA V L Sbjct: 599 LFLGKEVMGVHDTFPDGNELSHKIEEFSGAFNKAIHGNLSLADFVLGLSHVLANVGEL 656 >emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera] Length = 1085 Score = 256 bits (653), Expect = 4e-65 Identities = 201/692 (29%), Positives = 347/692 (50%), Gaps = 45/692 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 ++ L+EK S+A S+ KD L+KQHAK AEEA++GWE ++ EA+ALK + KL Sbjct: 77 QITELNEKLSEAHSEMTTKDNLVKQHAKVAEEAVSGWEKAEAEALALKNHLESATLAKLT 136 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+ + + V KT +W+K+K EL++++ +LE+ +L Sbjct: 137 AEDRASHLDGALKECMRQIRNLKEEHEQNLHDVVLAKTKQWEKIKLELEAKMGDLEQELL 196 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQ---DKVGLLESENAALKQKVNSLSK 1601 S + +QE S ++ ++ +++ ++ + E E +LK +++ +SK Sbjct: 197 RSAAENATLSRTLQERSNMLFKMSEEKSQAEAEIELLKSNIESCEREINSLKYELHLVSK 256 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 257 ELEIRNEEKNMSIRSAEVANKQHLEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLE 316 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDMQ---KEIETLKETLSAMEDXXXXX 1271 E D GE +Q++S K + + ++Q K+ E L E L ME+ Sbjct: 317 VESLGRDYGETRQRRSPVKPPSPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKML 376 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRK----NHSLNLTGEASESESPSN 1103 +ASR CA+TA+KL ++E QL++ ++ +L + + S S++ SN Sbjct: 377 KEALAKRNSELQASRNICAKTASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASN 436 Query: 1102 LQSMKSV-PDGHSDSF-CADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 SM S+ DG+ D+ CA+SWA+ L + L QFKK + +++LMDDFLEM Sbjct: 437 PPSMTSMSEDGNDDAVSCAESWATGLXSGLSQFKK----------ENANHLELMDDFLEM 486 Query: 928 ERLASMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFVS 749 E+LA + ++ + +DH G + EV K+ QL E + L+ ++ + Sbjct: 487 EKLACLSNNSNGAFSVNNKRSEAVDH--GAIAEVTSSKDLQL-EQKHDLDSLANQVSSNA 543 Query: 748 ETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR-------------SA 608 E L ++ ++ + L L ++ ++ ++ E D +L++I+ S Sbjct: 544 E-LSEVNPQSDKDLLPLTKLRSRISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSV 602 Query: 607 RAVTEETDSDTSLCIK------PHMITESKFSSSQ------NKVNIIDIELATAISSVVN 464 V EE + C + + E + S SQ + ++II ELA AIS + Sbjct: 603 SCVVEEIHCSDATCDRQACPEDAGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHE 662 Query: 463 FVQYMIQQSRH-KVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTL 287 FV ++ +++ + + + KI F+ V++V+ V + +L++ LA Sbjct: 663 FVLFLGKEAMAIQGASPDGNGWSRKIEDFSATVNKVLCXKMSVIDFIFDLSNVLA---KA 719 Query: 286 SSQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 S + N +N D V +PE K Sbjct: 720 SELNFNILGYKGAGEEINSSDCIDKVALPENK 751 >gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] Length = 1087 Score = 254 bits (649), Expect = 1e-64 Identities = 191/655 (29%), Positives = 341/655 (52%), Gaps = 45/655 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+ L+EK S A S+ KD L+KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 79 EISYLNEKLSAAQSEMTNKDNLVKQHAKVAEEAVSGWEKAEAEAVALKNHLETVTLSKLT 138 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALK C RQ+R++KEEHE+K+ E K + +K+K +L+ ++ LE+ + Sbjct: 139 AEDRASHLDGALKGCMRQIRNLKEEHEQKLQELALTKNKQCEKIKLDLEGKLANLEQDLR 198 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLES---ENAALKQKVNSLSK 1601 S +A ++Q+ S ++ ++ +++ ++ G +ES E +LK +++ SK Sbjct: 199 RSAAENAAISRSLQDRSNMLIKISEEKAQAEAEIELLKGNIESCEREINSLKYELHVASK 258 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ + +KQ++E KK +K + RGL K+ A LA +K + Sbjct: 259 ELEIRNEEKNMSMRSAEVANKQHTEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLE 318 Query: 1441 KEVGENDAGELKQKKSFGKGS----ISSTELTQDKYLGDMQKEIETLKETLSAMEDXXXX 1274 E D G+ + ++S K S +TE T D + QKE E L E L A+E+ Sbjct: 319 VESLGRDYGDTRVRRSPVKPSSPHLSPATEFTPDN-VQKYQKENEFLTERLLAVEEETKM 377 Query: 1273 XXXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPS 1106 + SR+ CA+T++KL S+E Q++ + + + ++ E S S++ S Sbjct: 378 LKEALAKRNSELQVSRSMCAKTSSKLQSLEAQIQSNNQHKTTPKSIVQISAEGSFSQNAS 437 Query: 1105 NLQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLE 932 N S+ S+ DG+ D CA+SW + LI+E+ Q KK K K R +++LMDDFLE Sbjct: 438 NPPSLTSMSEDGNDDDRSCAESWTTTLISEVSQVKKEKSNEKTNRAEKPNHLNLMDDFLE 497 Query: 931 MERLASMPSSKMVESKIKEWSETNLDHSLG-DLEEVLIQKEQQLSEANRNCSDLSRDLAF 755 ME+LA + + + + + + ++ D EV+++KE+Q + + L+ Sbjct: 498 MEKLACLSNESNGAISVSDSMSSKISETVNHDASEVVMRKEEQC-----DSNSLANQQLT 552 Query: 754 VSETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIRSARAVTEET-DSD 578 + +L+ ++ +L L ++ ++L++ + D+ +L+DI+ A T +T Sbjct: 553 SNGKSPELRPGSNSEQLPLMKLQSRISVLLESVSKDSDVGTILEDIKHAIQETHDTLHQH 612 Query: 577 TSLCIKPH-------------------MITESKFSSSQ---NKVNIIDIELATAISSVVN 464 T CI + +E + + SQ II +LA AIS + + Sbjct: 613 TVSCISEDVHCSDAGCDDRQANPEDAGLTSEKEIALSQPAREARQIIRDDLAAAISQIHD 672 Query: 463 FVQYMIQQSRH-KVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLA 302 FV ++ +++ T ++ + + +I F+ +++V+H + + +L+S LA Sbjct: 673 FVLFLGKEAMGVHDTSTEGSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDLSSVLA 727 >ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|567885183|ref|XP_006435150.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537271|gb|ESR48389.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537272|gb|ESR48390.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] Length = 1091 Score = 253 bits (647), Expect = 2e-64 Identities = 201/691 (29%), Positives = 339/691 (49%), Gaps = 44/691 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 ++K L+EK S A S+ AK+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 80 QIKELNEKLSAANSEISAKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLSKLT 139 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KE+HE+K+ + V KT +WDK++ E +++I E+ +L Sbjct: 140 AEDRAAHLDGALKECMRQIRNLKEDHEQKLQDFVLTKTKQWDKIRLEFEAKIANFEQELL 199 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLES---ENAALKQKVNSLSK 1601 S + ++QE S ++ ++ +++ ++ G +E E + K +++ +SK Sbjct: 200 RSAAENATLSRSLQERSNMLIKISEEKSQAEAEIELLKGNIEQCEREINSAKYELHIVSK 259 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 260 ELEIRNEEKNMSMRSAEAANKQHMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKME 319 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDXXXXX 1271 E D G+ + K+S K + + L ++ QKE E L E L AME+ Sbjct: 320 VESLGKDYGDSRLKRSPVKPTSPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKML 379 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 +ASR CA+TA+KL S+E Q++ ++ + + E S++ SN Sbjct: 380 KEALAKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASN 439 Query: 1102 LQSMKSVPDGHSDS--FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ S+ + +D CADSWA+ALI+EL Q KK K K + +++LMDDFLEM Sbjct: 440 PPSLTSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEM 499 Query: 928 ERLASMPSSKMVESKIKEWSETN------LDHSLGDLEEVLIQKEQQLSEANRNCSDLSR 767 E+LA + + I + N L+H D + E LSE R+ + Sbjct: 500 EKLACLSNDTNSNGTITASNGPNNKTSDILNH---DASGAVTSGEDLLSEQQRDMNPSVD 556 Query: 766 DLAFVSETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR--------- 614 L+ +E+ + + L ++ ++L+ + D+ +++DI+ Sbjct: 557 KLSSNTES-STVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVT 615 Query: 613 ----SARAVTEETD-SDTSLCIKPH-----MITESKFSSSQNKVNIIDIELATAISSVVN 464 SA ++EE SD S + + + TE K + V +I EL AIS + + Sbjct: 616 LHQHSANCISEEVKCSDVSCSAEAYPGDASLNTERKIDLT---VQVISQELVAAISQIHD 672 Query: 463 FVQYMIQQSRHKVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTLS 284 FV ++ +++R + + + KI F ++V+ + + L++ LA L Sbjct: 673 FVLFLGKEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELR 732 Query: 283 SQDTNTAASADYNGGLNVKSDKDIVNIPEKK 191 N D N D V +PE K Sbjct: 733 ---INVMGYKDTEIEPNSPDCIDKVALPENK 760 >ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Citrus sinensis] Length = 1091 Score = 253 bits (646), Expect = 3e-64 Identities = 195/685 (28%), Positives = 334/685 (48%), Gaps = 38/685 (5%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 ++K L+EK S A S+ AK+ L+KQH K AEEA++GWE ++ EA+ALK + + KL Sbjct: 80 QIKELNEKLSAANSEISAKEDLVKQHTKVAEEAVSGWEKAEAEALALKNHLESVTLSKLT 139 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KEEHE+K+ + V KT +WDK++ E +++I E+ +L Sbjct: 140 AEDRAAHLDGALKECMRQIRNLKEEHEQKLQDFVLTKTKQWDKIRLEFEAKIANFEQELL 199 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLES---ENAALKQKVNSLSK 1601 S + ++QE S ++ ++ +++ ++ G +E E + K +++ +SK Sbjct: 200 RSAAENATLSRSLQERSNMLIKISEEKSQAEAEIELLKGNIEQCEREINSAKYELHIVSK 259 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 260 ELEIRNEEKNMSMRSAEAANKQHMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKME 319 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDXXXXX 1271 E D G+ + K+S K + + L ++ QKE E L E L AME+ Sbjct: 320 VESLGRDYGDSRLKRSPVKPTSPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKML 379 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 +ASR CA+TA+KL S+E Q++ ++ + + E S++ SN Sbjct: 380 KEALAKRNSELQASRNLCAKTASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASN 439 Query: 1102 LQSMKSVPDGHSDS--FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ S+ + +D CADSWA+ALI+EL Q KK K K + +++LMDDFLEM Sbjct: 440 PPSLTSMSEDDNDDKVSCADSWATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEM 499 Query: 928 ERLASMPSSKMVESKIKEWSETN---LDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+LA + + I + N D D + E LSE R+ + L+ Sbjct: 500 EKLACLSNDTNSNGTITASNGPNNKTSDIVNHDASGAVTSGEDLLSEQQRDMNPSVDKLS 559 Query: 757 FVSETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIR------------ 614 +E+ + + L ++ ++L+ + D+ +++DI+ Sbjct: 560 SNTES-STVNPEADAGQPQLMKLRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQ 618 Query: 613 -SARAVTEETDSDTSLCIKPHMITESKFSSSQN---KVNIIDIELATAISSVVNFVQYMI 446 SA ++EE C +++ ++ + V +I EL AI+ + +FV ++ Sbjct: 619 HSANCISEEVKCSDVSCSAEAYPGDARLNTERKIDLTVQVISQELVAAITQIHDFVLFLG 678 Query: 445 QQSRHKVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTLSSQDTNT 266 +++R + + + KI F ++V+ + + L++ LA L N Sbjct: 679 KEARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELR---INV 735 Query: 265 AASADYNGGLNVKSDKDIVNIPEKK 191 D N D V +PE K Sbjct: 736 MGYKDTEIEPNSPDCIDKVALPENK 760 >ref|XP_006577974.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Glycine max] gi|571448851|ref|XP_006577975.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Glycine max] Length = 1078 Score = 253 bits (645), Expect = 4e-64 Identities = 200/653 (30%), Positives = 337/653 (51%), Gaps = 44/653 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K ++EK S A S+ K+ ++KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 72 EIKEMNEKLSAANSEINTKESMVKQHAKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 131 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KEEHE+KI E KT + DK+K EL+++I E+ +L Sbjct: 132 AEDRATHLDGALKECMRQIRNLKEEHEQKIQEVALSKTKQLDKIKGELEAKIVNFEQELL 191 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLESENAALKQKVNSL----- 1607 S A ++QE S + + + E H + ++ LL+ A ++++NSL Sbjct: 192 RSAAENGALSRSLQECSNM----LIKLSEEKAHAEAEIELLKGNIEACEKEINSLKYELH 247 Query: 1606 --SKEMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLAS 1454 SKE++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA Sbjct: 248 VVSKELEIRNEEKNMSMRSAEAANKQHMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQ 307 Query: 1453 VKQDKEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDX 1283 +K + E D GE + +KS K + + D L ++ QK+ E L E L AME+ Sbjct: 308 MKLEVESLGRDFGESRLRKSPVKPATPNLSPLPDFSLENVQKFQKDNEFLTERLLAMEEE 367 Query: 1282 XXXXXXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESE 1115 +ASR+ CA+T +KL S+E Q + + S + LT E+ ++ Sbjct: 368 TKMLKEALAKRNSELQASRSMCAKTLSKLQSLEAQSQTSNQLKLSPKSIVQLTHESIYNQ 427 Query: 1114 SPSNLQSMKSV-PDGHSD-SFCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDD 941 + S+ S+ S+ DG+ D + CA+SW++A+++ L QF + K + + ++LMDD Sbjct: 428 NASSAPSLVSMSEDGNDDAASCAESWSTAIVSGLSQFPREKCNEESNKSEVTNKLELMDD 487 Query: 940 FLEMERLASMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDL 761 FLE+E+LA + + V++ + T D GD+ EV KE LSE N N L + Sbjct: 488 FLEVEKLARLSNDSNVDATVSNNKTT--DIVTGDVSEVCTGKE-GLSEKNGNSDPLPNQV 544 Query: 760 AFVSETLEQLKARNSWNELS---LASFHEKMELILKAEDEGGDLHGLLKDIRSARAVTEE 590 + S+ L + A + ++LS L ++ L+ ++ + D+ +++DI+ + + Sbjct: 545 S--SDPL--MSAPDFQSDLSGLLLTELRSRILLVFESLAKDADIGKIVEDIKHVLEDSHD 600 Query: 589 TDSDTSLCIKPHMIT--------------ESKFSSSQNKVNIIDI--ELATAISSVVNFV 458 T S+ P T E + SSQ + I +L AIS + +FV Sbjct: 601 TTIHHSVDAHPSDATCDRKDNPEDAGLNLEKEVISSQQPKGYVQITSDLEAAISQIHDFV 660 Query: 457 QYMIQQSR--HKVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 ++ +++ H S +++ KI F+ ++V+ A + Q + +L+ L Sbjct: 661 LFLGKEAMTFHDDVSSDGNEMRQKIEEFSITFNKVLCNNASLLQFVLDLSYVL 713 >ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344134|gb|EEE81259.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 1063 Score = 249 bits (637), Expect = 3e-63 Identities = 195/671 (29%), Positives = 337/671 (50%), Gaps = 24/671 (3%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 ++ L+EK S A S+ K+ L+KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 71 QIMDLNEKLSAAHSEMTTKENLVKQHAKVAEEAVSGWEKAEAEALALKNHLETVTLSKLT 130 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ SHLDGALKEC RQ+R++KEEHE+K+ + V K + DK+K + +++I L++ +L Sbjct: 131 AEDRASHLDGALKECMRQIRNLKEEHEQKVQDVVLNKKKQLDKIKMDFEAKIGNLDQELL 190 Query: 1771 ESKVNQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSK 1601 S +A ++QE S ++ S+ R Q E+D ++ + E E +LK +++ SK Sbjct: 191 RSAAENAALSRSLQERSNMLIKISEERSQAEADIELLKSNIESCEREINSLKYELHVTSK 250 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK +S+ +KQ++E KK +K + RGL K+ A LA +K + Sbjct: 251 ELEIRNEEKNMIMRSAEAANKQHTEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLE 310 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDMQ---KEIETLKETLSAMEDXXXXX 1271 E D G+ + ++S K + L ++Q KE E L E L A+E+ Sbjct: 311 VESLGRDYGDSRLRRSPVKPPSPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKML 370 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 +ASR CA+TA+KL S+E Q +I + S + E S++ SN Sbjct: 371 KEALAKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISN 430 Query: 1102 LQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S+ SV DG+ D+ CADSWA+ ++++ FKK+ K + + +++LMDDFLEM Sbjct: 431 PPSLTSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEM 490 Query: 928 ERLASMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFVS 749 E+LA + + ++ + + L EV +QKE LSE R+ L+ ++ Sbjct: 491 EKLACLNADSATTISSSPNNKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNK 550 Query: 748 ETLEQLKARNSWNELSLASF---HEKMELILKAEDEGGDLHGLLKDIRSARAVTEETDSD 578 ++ A NS ++ L+SF ++ ++L++ + D+ +L++I+ E S Sbjct: 551 DS----SAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDAETAASC 606 Query: 577 TSLCIKPHMITESKFSSSQNKVNIIDIELATAISSVVNFVQYMIQQSRHKVTDSQLHKVN 398 S + T + + ++ V + + E+ S+++ ++ + V D+ + Sbjct: 607 GSKEVHHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCDSIG 666 Query: 397 L--KISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTLSSQDTNTAASADYNGGLNVKS 224 L KI F+ +V+ + + +L+ LA+ S N +N Sbjct: 667 LSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALA---SGLRFNVLGYKCNEAEINSPD 723 Query: 223 DKDIVNIPEKK 191 D V +PE K Sbjct: 724 CIDKVALPENK 734 >ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344133|gb|ERP63976.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 991 Score = 248 bits (634), Expect = 7e-63 Identities = 195/667 (29%), Positives = 335/667 (50%), Gaps = 24/667 (3%) Frame = -3 Query: 2119 LDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLALENE 1940 L+EK S A S+ K+ L+KQHAK AEEA++GWE ++ EA+ALK + + KL E+ Sbjct: 3 LNEKLSAAHSEMTTKENLVKQHAKVAEEAVSGWEKAEAEALALKNHLETVTLSKLTAEDR 62 Query: 1939 VSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVLESKV 1760 SHLDGALKEC RQ+R++KEEHE+K+ + V K + DK+K + +++I L++ +L S Sbjct: 63 ASHLDGALKECMRQIRNLKEEHEQKVQDVVLNKKKQLDKIKMDFEAKIGNLDQELLRSAA 122 Query: 1759 NQSADLDAMQESSKLV--NSKVRQQPESD-RHVQDKVGLLESENAALKQKVNSLSKEMDR 1589 +A ++QE S ++ S+ R Q E+D ++ + E E +LK +++ SKE++ Sbjct: 123 ENAALSRSLQERSNMLIKISEERSQAEADIELLKSNIESCEREINSLKYELHVTSKELEI 182 Query: 1588 ILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQDKEVG 1430 EK +S+ +KQ++E KK +K + RGL K+ A LA +K + E Sbjct: 183 RNEEKNMIMRSAEAANKQHTEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLEVESL 242 Query: 1429 ENDAGELKQKKSFGKGSISSTELTQDKYLGDMQ---KEIETLKETLSAMEDXXXXXXXXX 1259 D G+ + ++S K + L ++Q KE E L E L A+E+ Sbjct: 243 GRDYGDSRLRRSPVKPPSPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKMLKEAL 302 Query: 1258 XXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSNLQSM 1091 +ASR CA+TA+KL S+E Q +I + S + E S++ SN S+ Sbjct: 303 AKRNSELQASRNLCAKTASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSL 362 Query: 1090 KSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEMERLA 917 SV DG+ D+ CADSWA+ ++++ FKK+ K + + +++LMDDFLEME+LA Sbjct: 363 TSVSEDGNDDTQSCADSWATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLA 422 Query: 916 SMPSSKMVESKIKEWSETNLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLAFVSETLE 737 + + ++ + + L EV +QKE LSE R+ L+ ++ ++ Sbjct: 423 CLNADSATTISSSPNNKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNKDS-- 480 Query: 736 QLKARNSWNELSLASF---HEKMELILKAEDEGGDLHGLLKDIRSARAVTEETDSDTSLC 566 A NS ++ L+SF ++ ++L++ + D+ +L++I+ E S S Sbjct: 481 --SAINSGSDADLSSFGKLQSRISMLLESVSKEVDVDKILEEIKQVVHDAETAASCGSKE 538 Query: 565 IKPHMITESKFSSSQNKVNIIDIELATAISSVVNFVQYMIQQSRHKVTDSQLHKVNL--K 392 + T + + ++ V + + E+ S+++ ++ + V D+ + L K Sbjct: 539 VHHSDATCDRQTCPEDAVIMGEKEITLLQESIIHDFVLLLGKEAMAVHDTSCDSIGLSQK 598 Query: 391 ISGFAELVDQVMHGTAKVTQLLAELASFLAVVRTLSSQDTNTAASADYNGGLNVKSDKDI 212 I F+ +V+ + + +L+ LA+ S N +N D Sbjct: 599 IEEFSITFKKVLCSDRSLIDFMFDLSRVLALA---SGLRFNVLGYKCNEAEINSPDCIDK 655 Query: 211 VNIPEKK 191 V +PE K Sbjct: 656 VALPENK 662 >ref|XP_006601345.1| PREDICTED: filament-like plant protein 6-like [Glycine max] Length = 1070 Score = 247 bits (630), Expect = 2e-62 Identities = 199/662 (30%), Positives = 340/662 (51%), Gaps = 52/662 (7%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K L+EK S A S+ K+ L+KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 69 EIKELNEKLSAANSEINTKESLVKQHAKVAEEAVSGWEKAEAEALALKNHLETVTLAKLT 128 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E++ S LDGALKEC RQ+R++KEEHE+KI E KT + DK+K E +++I E+ +L Sbjct: 129 AEDQASQLDGALKECMRQIRNLKEEHEQKIQEVTLTKTKQLDKIKGEFEAKIANFEQELL 188 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLES---ENAALKQKVNSLSK 1601 S + +A ++QE S ++ + ++ ++ ++ G +ES E +LK +++ +SK Sbjct: 189 RSAADNAALSRSLQERSNMIINLSEEKAHAEAEIELLKGNIESCEREINSLKYELHVISK 248 Query: 1600 EMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQD 1442 E++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA +K + Sbjct: 249 ELEIRNEEKNMSMRSAEAANKQHMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKLE 308 Query: 1441 KEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDMQ---KEIETLKETLSAMEDXXXXX 1271 E + GE + +KS K + S L + Q K+ E L E L AME+ Sbjct: 309 VESLGREYGETRLRKSPVKPASSHMSTLAGFSLDNAQKFHKDNEFLTERLLAMEEETKML 368 Query: 1270 XXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPSN 1103 +ASR++ A+T +KL +E Q++ ++ S +++ E+ S++ SN Sbjct: 369 KEALAKRNSELQASRSSFAKTLSKLQILEAQVQTNNQQKGSPQSIIHINHESIYSQNASN 428 Query: 1102 LQSMKSV-PDGHSD-SFCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLEM 929 S S+ DG+ D CA+SW++A ++EL QF K K ++ + ++LMDDFLE+ Sbjct: 429 APSFVSLSEDGNDDVGSCAESWSTAFLSELSQFPKEKNTEELSKSDATKKLELMDDFLEV 488 Query: 928 ERLASMPSSKMVESKIKEWSETNLDHSL--GDLEEVLIQKE-----QQLSEANRNCSDLS 770 E+LA + + ES + N+ + + DL EV K+ Q+ SE N S++S Sbjct: 489 EKLAWLSN----ESSGVSVTSNNITNEIVVNDLSEVSAGKDVPSNTQENSEPNPLPSEVS 544 Query: 769 RDLAFVSETLEQLKARNSWNE----LSLASFHEKMELILKAEDEGGDLHGLLKDIRSARA 602 + E+L A + ++ LSLA ++ + ++ + D+ +LKDI+ A Sbjct: 545 --------SAEELSAPDPQSDVPAGLSLAELQSRISSVFESLAKDADMEKILKDIKHALE 596 Query: 601 VTEETDSDTSLCIKPHMITES------------------KFSSSQNKVNIIDI--ELATA 482 T S+ PH + S K SSQ + + +L A Sbjct: 597 EACGTSIQDSVSAIPHDVKPSDTTCDELGNAEDAGSNAEKEISSQKPTEFVQMTSDLEAA 656 Query: 481 ISSVVNFVQYMIQQ--SRHKVTDSQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASF 308 S + +FV ++ ++ + H ++ S ++ K+ F+ ++V A + Q + +L++ Sbjct: 657 TSQIHDFVLFLAKEAMTAHDIS-SDGDGISQKMKEFSVTFNKVTCNEASLLQFVLDLSNV 715 Query: 307 LA 302 LA Sbjct: 716 LA 717 >ref|XP_006581178.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Glycine max] gi|571458619|ref|XP_006581179.1| PREDICTED: filament-like plant protein 4-like isoform X3 [Glycine max] Length = 1080 Score = 247 bits (630), Expect = 2e-62 Identities = 198/652 (30%), Positives = 332/652 (50%), Gaps = 43/652 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K ++EK S A S+ K+ ++KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 72 EIKEMNEKMSAANSEINTKESMVKQHAKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 131 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KEEHE KI E KT + DK+K EL+++I E+ +L Sbjct: 132 AEDRATHLDGALKECMRQIRNLKEEHEHKIQEVALSKTMQLDKIKGELEAKIVNFEQELL 191 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLESENAALKQKVNSL----- 1607 S ++QE S + + + E H + ++ LL+ A ++++NSL Sbjct: 192 RSAAENGTLSRSLQERSNM----LIKLSEEKGHAEGEIELLKGNIEACEREINSLKYELH 247 Query: 1606 --SKEMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLAS 1454 SKE++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA Sbjct: 248 VVSKELEIRNEEKNMSMRSAEAANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQ 307 Query: 1453 VKQDKEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDX 1283 +K + E D GE + +KS K + + D L ++ QK+ E L E L AME+ Sbjct: 308 MKLEVESLGRDFGESRLRKSPVKPATPNLSPLPDFSLENVQKFQKDNEFLTERLLAMEEE 367 Query: 1282 XXXXXXXXXXXXXXXKASRATCARTANKLSSVE--EQLEIMKRKNHSLNLTGEASESESP 1109 +ASR+ CA+T +KL S+E Q ++ + LT E +++ Sbjct: 368 TKMLKEALAKRNSELQASRSMCAKTLSKLQSLEAQSQNQLKGSPKSIVQLTHERIYNQNS 427 Query: 1108 SNLQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFL 935 S+ S+ S+ DG+ D+ CA+SWA+A+++ L QF + K + + ++LMDDFL Sbjct: 428 SSAPSLISMSEDGNDDAESCAESWATAIVSGLSQFPREKCNEESNKSEVTNKLELMDDFL 487 Query: 934 EMERLASMPSSKMVESKIKEWSET-NLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+E+LA + + V++ I S D DL EV KE LSE N + L +++ Sbjct: 488 EVEKLARLSNDSNVDATISVSSNNKTTDFVADDLSEVCTGKE-GLSEKNGDSDQLPNEVS 546 Query: 757 FVSETLEQLKARNSWNELS---LASFHEKMELILKAEDEGGDLHGLLKDIRSARAVTEET 587 S+ L + A +S ++S L ++ L+ ++ + D+ ++ DI+ + +T Sbjct: 547 --SDAL--MSAPDSQTDVSGLLLTELRSRILLVFESLAKDADIGKIVDDIKHVLEDSHDT 602 Query: 586 DSDTSLCIKPHMIT--------------ESKFSSSQNKVNIIDI--ELATAISSVVNFVQ 455 S+ P T E + SSQ + I +L A+S + +FV Sbjct: 603 TIHHSVDAHPSDTTCDRKDNPEDAGLNLEKEVISSQQPKEYVQITTDLEAAVSQIHDFVL 662 Query: 454 YMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 ++ +++ D S +++ KI F+ ++V+ A + Q + +L+ L Sbjct: 663 FLGKEAMTSFHDVSSDGNEMRQKIEEFSVTFNKVLCNNASLLQFVLDLSYVL 714 >ref|XP_006581177.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Glycine max] Length = 1120 Score = 247 bits (630), Expect = 2e-62 Identities = 198/652 (30%), Positives = 332/652 (50%), Gaps = 43/652 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E+K ++EK S A S+ K+ ++KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 112 EIKEMNEKMSAANSEINTKESMVKQHAKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 171 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KEEHE KI E KT + DK+K EL+++I E+ +L Sbjct: 172 AEDRATHLDGALKECMRQIRNLKEEHEHKIQEVALSKTMQLDKIKGELEAKIVNFEQELL 231 Query: 1771 ESKVNQSADLDAMQESSKLVNSKVRQQPESDRHVQDKVGLLESENAALKQKVNSL----- 1607 S ++QE S + + + E H + ++ LL+ A ++++NSL Sbjct: 232 RSAAENGTLSRSLQERSNM----LIKLSEEKGHAEGEIELLKGNIEACEREINSLKYELH 287 Query: 1606 --SKEMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLAS 1454 SKE++ EK S +S+ +KQ+ E KK +K + RGL K+ A LA Sbjct: 288 VVSKELEIRNEEKNMSMRSAEAANKQHMEGVKKITKLEAECQRLRGLVRKKLPGPAALAQ 347 Query: 1453 VKQDKEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDX 1283 +K + E D GE + +KS K + + D L ++ QK+ E L E L AME+ Sbjct: 348 MKLEVESLGRDFGESRLRKSPVKPATPNLSPLPDFSLENVQKFQKDNEFLTERLLAMEEE 407 Query: 1282 XXXXXXXXXXXXXXXKASRATCARTANKLSSVE--EQLEIMKRKNHSLNLTGEASESESP 1109 +ASR+ CA+T +KL S+E Q ++ + LT E +++ Sbjct: 408 TKMLKEALAKRNSELQASRSMCAKTLSKLQSLEAQSQNQLKGSPKSIVQLTHERIYNQNS 467 Query: 1108 SNLQSMKSV-PDGHSDS-FCADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFL 935 S+ S+ S+ DG+ D+ CA+SWA+A+++ L QF + K + + ++LMDDFL Sbjct: 468 SSAPSLISMSEDGNDDAESCAESWATAIVSGLSQFPREKCNEESNKSEVTNKLELMDDFL 527 Query: 934 EMERLASMPSSKMVESKIKEWSET-NLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDLA 758 E+E+LA + + V++ I S D DL EV KE LSE N + L +++ Sbjct: 528 EVEKLARLSNDSNVDATISVSSNNKTTDFVADDLSEVCTGKE-GLSEKNGDSDQLPNEVS 586 Query: 757 FVSETLEQLKARNSWNELS---LASFHEKMELILKAEDEGGDLHGLLKDIRSARAVTEET 587 S+ L + A +S ++S L ++ L+ ++ + D+ ++ DI+ + +T Sbjct: 587 --SDAL--MSAPDSQTDVSGLLLTELRSRILLVFESLAKDADIGKIVDDIKHVLEDSHDT 642 Query: 586 DSDTSLCIKPHMIT--------------ESKFSSSQNKVNIIDI--ELATAISSVVNFVQ 455 S+ P T E + SSQ + I +L A+S + +FV Sbjct: 643 TIHHSVDAHPSDTTCDRKDNPEDAGLNLEKEVISSQQPKEYVQITTDLEAAVSQIHDFVL 702 Query: 454 YMIQQSRHKVTD--SQLHKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 ++ +++ D S +++ KI F+ ++V+ A + Q + +L+ L Sbjct: 703 FLGKEAMTSFHDVSSDGNEMRQKIEEFSVTFNKVLCNNASLLQFVLDLSYVL 754 >gb|ESW08071.1| hypothetical protein PHAVU_009G015700g [Phaseolus vulgaris] gi|561009165|gb|ESW08072.1| hypothetical protein PHAVU_009G015700g [Phaseolus vulgaris] Length = 1080 Score = 247 bits (630), Expect = 2e-62 Identities = 200/653 (30%), Positives = 329/653 (50%), Gaps = 44/653 (6%) Frame = -3 Query: 2131 ELKSLDEKFSQALSDDIAKDGLLKQHAKAAEEAIAGWENSDKEAMALKKENDKLAQQKLA 1952 E K ++EK S A S+ K+ ++KQHAK AEEA++GWE ++ EA+ALK + + KL Sbjct: 71 EFKEINEKLSAANSEINTKESMVKQHAKVAEEAVSGWEKAEAEALALKNHLESVTLLKLT 130 Query: 1951 LENEVSHLDGALKECTRQLRHVKEEHEKKISEAVAKKTSEWDKVKFELDSRIFELEELVL 1772 E+ +HLDGALKEC RQ+R++KEEHE KI + KT + D++K EL+++I E+ +L Sbjct: 131 AEDRATHLDGALKECMRQIRNLKEEHELKIQDVALSKTKQLDQIKGELEAKIVNFEQELL 190 Query: 1771 ESKVNQSADLDAMQESS----KLVNSKVRQQPESDRHVQDKVGLLESENAALKQKVNSLS 1604 S A ++QE S KL K R + E + ++ + E EN +LK +++ +S Sbjct: 191 RSAAENGALSRSLQERSNMLIKLSEDKARAEAEIEL-LKGNIEACERENNSLKYELHVVS 249 Query: 1603 KEMDRILSEKEESRKSSTMTSKQNSEVHKKGSKSDT-----RGLPHKR--SVATLASVKQ 1445 KE++ EK S +S+ +KQ E KK +K + RGL K+ A LA +K Sbjct: 250 KELEIRNEEKNMSMRSAEAANKQQMEGVKKIAKLEAECQRLRGLVRKKLPGPAALAQMKL 309 Query: 1444 DKEVGENDAGELKQKKSFGKGSISSTELTQDKYLGDM---QKEIETLKETLSAMEDXXXX 1274 + E D GE + +KS K + + D L ++ QK+ E L E L AME+ Sbjct: 310 EVESLGRDFGESRLRKSPVKAASPNLSPLPDFSLDNVQKFQKDNEFLTERLLAMEEETKM 369 Query: 1273 XXXXXXXXXXXXKASRATCARTANKLSSVEEQLEIMKRKNHS----LNLTGEASESESPS 1106 +ASR+ CA+T +KL S+E Q + + S + +T E+ +++ S Sbjct: 370 LKEALAKRNSELQASRSMCAKTLSKLQSLEAQPQTSNQLKGSPKSIVQITHESIYNQNAS 429 Query: 1105 NLQSMKSV-PDGHSDSF-CADSWASALIAELDQFKKNKIAGKIERLSDIPNVDLMDDFLE 932 + S+ S+ DG+ D+ CA+SW++A++ L QF K K + + ++LMDDFLE Sbjct: 430 SAPSLVSMSEDGNDDAVSCAESWSTAIVPGLSQFPKEKCTEESSKSEVSNKLELMDDFLE 489 Query: 931 MERLASMPSSKMVESKIKEWSET-NLDHSLGDLEEVLIQKEQQLSEANRNCSDLSRDL-- 761 +E+LA + + +V++ + S D GD+ EV I E LSE N + LS + Sbjct: 490 VEKLARLSNDSIVDATVSVSSNNKTTDIVNGDVSEVSIGNE-GLSEKIGNSNPLSNQVSS 548 Query: 760 -AFVSETLEQLKARNSWNELSLASFHEKMELILKAEDEGGDLHGLLKDIRSARAVTEETD 584 A +S Q A + L L ++ L+ ++ GD+ +++DI+ + + Sbjct: 549 DALMSAPYPQSDA----SGLILTELRSRILLVFESLANDGDIGKIVEDIKHVLEDSHDIT 604 Query: 583 SDTSLCIKPHMIT--------------ESKFSSSQNKVNIIDI--ELATAISSVVNFVQY 452 S+ P T E SSQ + I +L AIS + +FV Sbjct: 605 IRHSVDAHPSDATCDRKDDPEDAGLNLEKDIISSQQPREYVRITSDLEAAISQIHDFVLL 664 Query: 451 MIQQSRHKVTDSQL----HKVNLKISGFAELVDQVMHGTAKVTQLLAELASFL 305 + + VT + +++ KI F+ D++++ A + Q + +L+ L Sbjct: 665 L---GKEAVTFHDISCDGNEMRQKIEEFSITFDKILNNNASLLQFVLDLSYVL 714