BLASTX nr result
ID: Sinomenium21_contig00016679
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00016679 (1067 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002269821.2| PREDICTED: uncharacterized protein LOC100248... 194 4e-47 ref|XP_004307129.1| PREDICTED: uncharacterized protein LOC101312... 153 1e-34 ref|XP_007011120.1| Agglutinin-like protein ALA1, putative isofo... 153 1e-34 ref|XP_007011119.1| Agglutinin-like protein ALA1, putative isofo... 153 1e-34 ref|XP_007011118.1| Agglutinin-like protein ALA1, putative isofo... 153 1e-34 ref|XP_002303916.2| hypothetical protein POPTR_0003s19960g [Popu... 152 2e-34 ref|XP_006420755.1| hypothetical protein CICLE_v10004446mg [Citr... 147 7e-33 ref|XP_006578483.1| PREDICTED: uncharacterized protein LOC100812... 146 2e-32 ref|XP_006494919.1| PREDICTED: uncharacterized protein LOC102616... 144 5e-32 ref|XP_006403566.1| hypothetical protein EUTSA_v10010203mg [Eutr... 142 2e-31 ref|XP_006403565.1| hypothetical protein EUTSA_v10010203mg [Eutr... 142 2e-31 ref|XP_006578482.1| PREDICTED: uncharacterized protein LOC100812... 141 4e-31 ref|XP_006578476.1| PREDICTED: uncharacterized protein LOC100812... 140 7e-31 ref|XP_002513079.1| conserved hypothetical protein [Ricinus comm... 139 2e-30 dbj|BAC41810.1| unknown protein [Arabidopsis thaliana] 137 1e-29 ref|NP_191014.2| uncharacterized protein [Arabidopsis thaliana] ... 137 1e-29 ref|XP_002877982.1| hypothetical protein ARALYDRAFT_485852 [Arab... 137 1e-29 emb|CAB77570.1| putative protein [Arabidopsis thaliana] 137 1e-29 gb|EYU38268.1| hypothetical protein MIMGU_mgv1a003852mg [Mimulus... 134 8e-29 ref|XP_006365448.1| PREDICTED: uncharacterized protein LOC102601... 134 8e-29 >ref|XP_002269821.2| PREDICTED: uncharacterized protein LOC100248068 [Vitis vinifera] gi|297742697|emb|CBI35150.3| unnamed protein product [Vitis vinifera] Length = 704 Score = 194 bits (494), Expect = 4e-47 Identities = 118/301 (39%), Positives = 166/301 (55%), Gaps = 7/301 (2%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG A ESDDH VPYP NE+K FG K + +QEV VK EQ Sbjct: 1 MFDWNDEELANIIWGDAGESDDHTVPYPNENEKKPPATFGVNNK-DWNQEVTDVKPTEQT 59 Query: 709 ISEAKIESLGTKLESSSRLNTH-GLPIPEFVMNSWVETEISGLDSAK------GDHSQLD 551 S AKI+ G K E S+ L+ + GLP F M SW +++S ++AK + +QLD Sbjct: 60 ASGAKIQFHGNKQEHSTNLDINEGLPGTGFSMGSW--SDLSSSNAAKTNQDSMAETTQLD 117 Query: 550 NGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLDNADELWSSSAD 371 +F N +++ + G F+D WANIG+FDD DRIF N+ VF SL NADELWSSS Sbjct: 118 KDPEIFRNQHDENEQGDFVDYGWANIGSFDDLDRIFSNDAPVFGNASLGNADELWSSST- 176 Query: 370 VFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCGRRVKSHPGSHVLM 191 +SP KS +S DSP+ G+ A + S+ ++ E + +D S+T + +HP SH Sbjct: 177 --NSPVKSFPLSVDSPSLGLGALRNTSEHFEIKTEHVEHEDQSSTPAYGIMNHPSSH--- 231 Query: 190 DVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFVEKTGAFNSHSQSKRKVDK 11 + +C D +E+ GKS PI+ ++ V KT NS ++ Sbjct: 232 ---------GQQNTCATMDQ-VEYGGGKSKPIMKDQIAFDIVGKTTTLNSQYAAENAATP 281 Query: 10 N 8 N Sbjct: 282 N 282 >ref|XP_004307129.1| PREDICTED: uncharacterized protein LOC101312766 [Fragaria vesca subsp. vesca] Length = 687 Score = 153 bits (387), Expect = 1e-34 Identities = 101/309 (32%), Positives = 159/309 (51%), Gaps = 15/309 (4%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW+++E+ + IWG+A ESDDHIVPYP+ ++ + K E +QE ++K EQ Sbjct: 1 MFDWNDQELANIIWGEACESDDHIVPYPEATDDY-------QDKKERNQESDTIKPTEQK 53 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGLDSAK----GDHSQLDNG 545 AK++ G + ESSS T G+ F ++W ++SG + ++ + +QL G Sbjct: 54 APGAKVDLHGREPESSSNFETGEGISTSGFRTDTW--PDLSGFEKSEQHCMAEANQLVKG 111 Query: 544 LGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLDNADELWSSSADVF 365 L N E K+ F+D WA IG+FDD D+IF N+D +F GSL N DELW SS D Sbjct: 112 AELIQNSNEAKEQ--FVDYGWATIGSFDDLDQIFSNDDTIFSHGSLGNVDELW-SSRDAT 168 Query: 364 SSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCGRRVKSHPGSHVL--- 194 +SP K+ ++ DSP+ G A +VS+ +V E + Q + + + + SH L Sbjct: 169 NSPIKAFPLTSDSPSCGSGALRNVSEHSEVKTEYVQQDEQAISPVSGTTDYSASHGLQNA 228 Query: 193 ------MDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFVEKTGAFNSHSQ 32 ++ +GK + KE +D P V+ + + +K + Sbjct: 229 PATVDHVEYAVGKSRTTEKEQ-TVSDLGNSTVTNSYQPAVNSSSSTEIPDKVSRQRKFLR 287 Query: 31 SKRKV-DKN 8 S++K+ DKN Sbjct: 288 SRKKLEDKN 296 >ref|XP_007011120.1| Agglutinin-like protein ALA1, putative isoform 3 [Theobroma cacao] gi|508728033|gb|EOY19930.1| Agglutinin-like protein ALA1, putative isoform 3 [Theobroma cacao] Length = 679 Score = 153 bits (386), Expect = 1e-34 Identities = 107/319 (33%), Positives = 153/319 (47%), Gaps = 25/319 (7%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPY +G+E K E SQE ++K +Q Sbjct: 1 MFDWNDEELTNIIWGEDGESDDHIVPYQEGSENC-------HSKKEWSQETATIKSTDQK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVETEISGLDSAKGDH----SQLDNG 545 K++ G K+E SS N +G + F M SW E +S ++AK D S++ N Sbjct: 54 TPGDKVDLHGRKVEGSSNFNANGGIATSGFGMVSWPELSLS--NAAKTDQDSMGSEVSNH 111 Query: 544 LG--------------------LFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLV 425 L +F N E K+ G +D SWANIG+FDD DRIF N+D + Sbjct: 112 LAEVNKYSSTNAGTTELTKDSQIFQNPNEGKEQGDLVDYSWANIGSFDDLDRIFSNDDPI 171 Query: 424 FRCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDP 245 F SL +AD+LWSSS +V +S KS + DSP+ G+ A S+ +V E Q + Sbjct: 172 FGNVSLGSADDLWSSSKEVTNSAAKSFPTTVDSPSLGLGALRSTSENLEVKREYEQQDNQ 231 Query: 244 SATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFV 65 T SH L V E A +S I++E+ + + Sbjct: 232 PFTLSYEKLDGSTSHGLHHV--------------------EFAGDESKSIIEEQMNVETR 271 Query: 64 EKTGAFNSHSQSKRKVDKN 8 KT A SH +++ + N Sbjct: 272 GKTSASKSHMVAEKVMAPN 290 >ref|XP_007011119.1| Agglutinin-like protein ALA1, putative isoform 2 [Theobroma cacao] gi|508728032|gb|EOY19929.1| Agglutinin-like protein ALA1, putative isoform 2 [Theobroma cacao] Length = 606 Score = 153 bits (386), Expect = 1e-34 Identities = 107/319 (33%), Positives = 153/319 (47%), Gaps = 25/319 (7%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPY +G+E K E SQE ++K +Q Sbjct: 1 MFDWNDEELTNIIWGEDGESDDHIVPYQEGSENC-------HSKKEWSQETATIKSTDQK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVETEISGLDSAKGDH----SQLDNG 545 K++ G K+E SS N +G + F M SW E +S ++AK D S++ N Sbjct: 54 TPGDKVDLHGRKVEGSSNFNANGGIATSGFGMVSWPELSLS--NAAKTDQDSMGSEVSNH 111 Query: 544 LG--------------------LFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLV 425 L +F N E K+ G +D SWANIG+FDD DRIF N+D + Sbjct: 112 LAEVNKYSSTNAGTTELTKDSQIFQNPNEGKEQGDLVDYSWANIGSFDDLDRIFSNDDPI 171 Query: 424 FRCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDP 245 F SL +AD+LWSSS +V +S KS + DSP+ G+ A S+ +V E Q + Sbjct: 172 FGNVSLGSADDLWSSSKEVTNSAAKSFPTTVDSPSLGLGALRSTSENLEVKREYEQQDNQ 231 Query: 244 SATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFV 65 T SH L V E A +S I++E+ + + Sbjct: 232 PFTLSYEKLDGSTSHGLHHV--------------------EFAGDESKSIIEEQMNVETR 271 Query: 64 EKTGAFNSHSQSKRKVDKN 8 KT A SH +++ + N Sbjct: 272 GKTSASKSHMVAEKVMAPN 290 >ref|XP_007011118.1| Agglutinin-like protein ALA1, putative isoform 1 [Theobroma cacao] gi|590569623|ref|XP_007011121.1| Agglutinin-like protein ALA1, putative isoform 1 [Theobroma cacao] gi|508728031|gb|EOY19928.1| Agglutinin-like protein ALA1, putative isoform 1 [Theobroma cacao] gi|508728034|gb|EOY19931.1| Agglutinin-like protein ALA1, putative isoform 1 [Theobroma cacao] Length = 705 Score = 153 bits (386), Expect = 1e-34 Identities = 107/319 (33%), Positives = 153/319 (47%), Gaps = 25/319 (7%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPY +G+E K E SQE ++K +Q Sbjct: 1 MFDWNDEELTNIIWGEDGESDDHIVPYQEGSENC-------HSKKEWSQETATIKSTDQK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVETEISGLDSAKGDH----SQLDNG 545 K++ G K+E SS N +G + F M SW E +S ++AK D S++ N Sbjct: 54 TPGDKVDLHGRKVEGSSNFNANGGIATSGFGMVSWPELSLS--NAAKTDQDSMGSEVSNH 111 Query: 544 LG--------------------LFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLV 425 L +F N E K+ G +D SWANIG+FDD DRIF N+D + Sbjct: 112 LAEVNKYSSTNAGTTELTKDSQIFQNPNEGKEQGDLVDYSWANIGSFDDLDRIFSNDDPI 171 Query: 424 FRCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDP 245 F SL +AD+LWSSS +V +S KS + DSP+ G+ A S+ +V E Q + Sbjct: 172 FGNVSLGSADDLWSSSKEVTNSAAKSFPTTVDSPSLGLGALRSTSENLEVKREYEQQDNQ 231 Query: 244 SATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFV 65 T SH L V E A +S I++E+ + + Sbjct: 232 PFTLSYEKLDGSTSHGLHHV--------------------EFAGDESKSIIEEQMNVETR 271 Query: 64 EKTGAFNSHSQSKRKVDKN 8 KT A SH +++ + N Sbjct: 272 GKTSASKSHMVAEKVMAPN 290 >ref|XP_002303916.2| hypothetical protein POPTR_0003s19960g [Populus trichocarpa] gi|550343586|gb|EEE78895.2| hypothetical protein POPTR_0003s19960g [Populus trichocarpa] Length = 453 Score = 152 bits (385), Expect = 2e-34 Identities = 105/303 (34%), Positives = 148/303 (48%), Gaps = 20/303 (6%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+A +SDDHIVPYP+ +E+ K E ++E ++K +EQ Sbjct: 1 MFDWNDEELTNIIWGEADDSDDHIVPYPEASEDYCK-------KKESNEEASTIKSSEQK 53 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSW---------------VETEISG--L 584 AK+++ G KLES S ++T G M+ W +ET IS Sbjct: 54 APGAKVDTDGRKLESISNVDTSEGTSSLGLDMDRWPNLSSSNAAKTEQDSLETSISNNLT 113 Query: 583 DSAKGDHS--QLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGS 410 D K D S LD +F N +E K+ G F+D WA+IG+FDD DRIF N+D +F + Sbjct: 114 DITKLDSSADHLDKDTEIFQNSHEGKEQGDFVDYGWASIGSFDDLDRIFSNDDPIFGNVN 173 Query: 409 LDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCG 230 L NADELWSSS D+ +SP K +S S E ++D T G Sbjct: 174 LGNADELWSSSKDITNSPVKPFPISVAS-----------------REEYAQEEDRLFTLG 216 Query: 229 RRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFVEKTGA 50 + P SH L + D +E+ ++ PI+ E+TD V K A Sbjct: 217 YGKMNDPASHGLQNTQASLDH-------------VEYDEAENKPILKEQTDLAVVGKNTA 263 Query: 49 FNS 41 NS Sbjct: 264 ANS 266 >ref|XP_006420755.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] gi|567855273|ref|XP_006420756.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] gi|567855275|ref|XP_006420757.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] gi|557522628|gb|ESR33995.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] gi|557522629|gb|ESR33996.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] gi|557522630|gb|ESR33997.1| hypothetical protein CICLE_v10004446mg [Citrus clementina] Length = 715 Score = 147 bits (371), Expect = 7e-33 Identities = 107/308 (34%), Positives = 150/308 (48%), Gaps = 26/308 (8%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ +SDDHIVPY + NE+ +G K E SQE ++K AE+ Sbjct: 1 MFDWNDEELTNIIWGENGKSDDHIVPYQEKNED-----YGN--KKEWSQEATAIKPAERK 53 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGLDSAKGDH---------- 563 + E KI+ G KLE SS N+ + M SW +++S ++AK D Sbjct: 54 MPEVKIDFNGRKLECSSNFNSCERNSVSGIGMGSW--SDLSSSNAAKTDEEPMGTNVSNR 111 Query: 562 ---------------SQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDL 428 + LD +F N + K G F+D SWANIG+FDD D+IF N+D Sbjct: 112 IAEIAKSNSTSNAVKADLDKDSEIFENPQDGKDQGDFVDYSWANIGSFDDLDQIFSNDDP 171 Query: 427 VFRCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKD 248 +F SL A ELWSSS DV +SP KS +S D P + A + S+ V +E Q + Sbjct: 172 IFGNVSLGTA-ELWSSSNDVTNSPIKSFPLSEDHPNLELGALNNTSEHVVVKSEFEQQGN 230 Query: 247 PSATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKF 68 S T + P +H L + D +E+A KSM ++TD Sbjct: 231 KSLTLSNGKLNDP-AHDLQSTHSTLDH-------------VEYAGVKSMSFEKDQTDLCT 276 Query: 67 VEKTGAFN 44 T A N Sbjct: 277 RRNTTASN 284 >ref|XP_006578483.1| PREDICTED: uncharacterized protein LOC100812174 isoform X8 [Glycine max] Length = 638 Score = 146 bits (368), Expect = 2e-32 Identities = 105/292 (35%), Positives = 150/292 (51%), Gaps = 16/292 (5%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPYP+ NE+ + K E +QE + K E Sbjct: 1 MFDWNDEELANIIWGEGGESDDHIVPYPEVNEDVSN-------KKEWNQEAAATKLTELK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVETEISGLDSAKGDHSQLDNGLG-- 539 EAK + KL SSS L+ G LP + N+W + +S SAK DH L+ Sbjct: 54 RPEAKTDFHERKLGSSSNLDNSGELPTSGYGTNAWPDLALSS--SAKIDHGSLEETTQHE 111 Query: 538 ----LFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLDNADELWSSSAD 371 +F N +E K+ G F+D WANIG+FDD DRIF N+D +F SLD+++ELWSS D Sbjct: 112 KHAEIFQNAHEGKEQGHFVDYGWANIGSFDDLDRIFSNDDPIFGHASLDSSNELWSSK-D 170 Query: 370 VFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCGRRVKSHPGSHVLM 191 V S+ L + D+P+S A + ++ ++ E + D S PGS V+ Sbjct: 171 V-SNNVAPLPL--DTPSSS-GALRNRTESLEIKEEYVQCSDESLDLSNEKIGGPGSQVIE 226 Query: 190 D-----VNLG----KDKAAGKESCNFADPIIEHAAGKSMPIVDEKTDAKFVE 62 + N+G + K GKE F + KS+ +E T F + Sbjct: 227 NSCTTTANVGNGGVRSKPTGKEQQVFRQKNLLKTWKKSLVKQEENTLQDFYD 278 >ref|XP_006494919.1| PREDICTED: uncharacterized protein LOC102616093 isoform X1 [Citrus sinensis] gi|568884497|ref|XP_006494920.1| PREDICTED: uncharacterized protein LOC102616093 isoform X2 [Citrus sinensis] gi|568884499|ref|XP_006494921.1| PREDICTED: uncharacterized protein LOC102616093 isoform X3 [Citrus sinensis] Length = 694 Score = 144 bits (364), Expect = 5e-32 Identities = 108/308 (35%), Positives = 156/308 (50%), Gaps = 13/308 (4%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ +SDDHIVPY + NE+ +G K E SQE ++K AE+ Sbjct: 1 MFDWNDEELTNIIWGENGKSDDHIVPYQEENED-----YGN--KKEWSQEATAIKPAERK 53 Query: 709 ISEAKIESLGTKLESSSRLNT----HGLPIPEFVMNSWVETEISGLDSAKGDHSQLDNGL 542 + E KI+ G KLE SS N+ I V N E S ++ + LD Sbjct: 54 MPEVKIDFNGGKLECSSNFNSCERNSVSGIGTNVSNRIAEIAKSN-STSNAVKADLDKDS 112 Query: 541 GLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLDNADELWSSSADVFS 362 +F N + K G F+D SWANIG+FDD D+IF N+D +F SL A ELWSSS DV + Sbjct: 113 EIFENPQDGKDQGDFVDYSWANIGSFDDLDQIFSNDDPIFGNVSLGTA-ELWSSSNDVTN 171 Query: 361 SPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCGRRVKSHPGSHVLMDVN 182 SP KS +S D P + A + S+ V +E Q + S T + P +H L + Sbjct: 172 SPIKSFPLSEDHPNLELGALNNTSEHVVVKSEFEQQGNKSLTLSNGKLNDP-AHDLQSTH 230 Query: 181 LGKD--KAAGKESCNFADPIIE------HAAGKSMPIVDE-KTDAKFVEKTGAFNSHSQS 29 D + AG +S +F + A P+ + T KF K + + Sbjct: 231 STLDHVEYAGVKSTSFEKDQTDLCTRRNTTASNCGPVAEHVVTSEKFPNKGYKQKNPLRG 290 Query: 28 KRKVDKNA 5 +RK+++N+ Sbjct: 291 QRKLEENS 298 >ref|XP_006403566.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|557104685|gb|ESQ45019.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] Length = 620 Score = 142 bits (359), Expect = 2e-31 Identities = 86/217 (39%), Positives = 121/217 (55%), Gaps = 23/217 (10%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDWDEEE+ + IWG E+DDHIVP+ +E+ K E S++ ++VK AEQ Sbjct: 1 MFDWDEEELTNMIWGDDGETDDHIVPFKLRSEQLN--------KKERSEDSKTVKPAEQE 52 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGL----------------- 584 I+ K + +KL SSS N +P P+F ++SW ++ +S Sbjct: 53 ITGTKNDLHESKLGSSSGHNVDEKIPQPDFCISSWPDSSLSNAREADPDSSATELSKCLP 112 Query: 583 -----DSAKGDHSQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFR 419 +SA+ S+L G +F + E K+ GGF + WANIG+FDD DR+F N+ +F Sbjct: 113 EPARYNSAREKTSELGKGPDIFHSTDESKEQGGFDEYGWANIGSFDDLDRMFSNDVPIFG 172 Query: 418 CGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 GSL ADELWSSS DV +SP+KSL DS G++ Sbjct: 173 DGSLSGADELWSSSKDVLNSPSKSLPSILDSQDIGLD 209 >ref|XP_006403565.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|567186597|ref|XP_006403567.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|567186601|ref|XP_006403568.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|557104684|gb|ESQ45018.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|557104686|gb|ESQ45020.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] gi|557104687|gb|ESQ45021.1| hypothetical protein EUTSA_v10010203mg [Eutrema salsugineum] Length = 621 Score = 142 bits (359), Expect = 2e-31 Identities = 87/218 (39%), Positives = 123/218 (56%), Gaps = 24/218 (11%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDWDEEE+ + IWG E+DDHIVP+ +E+ K E S++ ++VK AEQ Sbjct: 1 MFDWDEEELTNMIWGDDGETDDHIVPFKLRSEQLN--------KKERSEDSKTVKPAEQE 52 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGL----------------- 584 I+ K + +KL SSS N +P P+F ++SW ++ +S Sbjct: 53 ITGTKNDLHESKLGSSSGHNVDEKIPQPDFCISSWPDSSLSNAREADPDSSATELSKCLP 112 Query: 583 -----DSAKGDH-SQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVF 422 +SA+G+ S+L G +F + E K+ GGF + WANIG+FDD DR+F N+ +F Sbjct: 113 EPARYNSARGEKTSELGKGPDIFHSTDESKEQGGFDEYGWANIGSFDDLDRMFSNDVPIF 172 Query: 421 RCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 GSL ADELWSSS DV +SP+KSL DS G++ Sbjct: 173 GDGSLSGADELWSSSKDVLNSPSKSLPSILDSQDIGLD 210 >ref|XP_006578482.1| PREDICTED: uncharacterized protein LOC100812174 isoform X7 [Glycine max] Length = 656 Score = 141 bits (356), Expect = 4e-31 Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 32/308 (10%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPYP+ NE+ + K E +QE + K E Sbjct: 1 MFDWNDEELANIIWGEGGESDDHIVPYPEVNEDVSN-------KKEWNQEAAATKLTELK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVE----------------------T 599 EAK + KL SSS L+ G LP + N+W + + Sbjct: 54 RPEAKTDFHERKLGSSSNLDNSGELPTSGYGTNAWPDLALSSSAKIDHGSLGTEVSNNLS 113 Query: 598 EISGLDSAKGDHSQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFR 419 E+S L S++ + +Q + +F N +E K+ G F+D WANIG+FDD DRIF N+D +F Sbjct: 114 ELSKLSSSREETTQHEKHAEIFQNAHEGKEQGHFVDYGWANIGSFDDLDRIFSNDDPIFG 173 Query: 418 CGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSA 239 SLD+++ELWSS DV S+ L + D+P+S A + ++ ++ E + D S Sbjct: 174 HASLDSSNELWSSK-DV-SNNVAPLPL--DTPSSS-GALRNRTESLEIKEEYVQCSDESL 228 Query: 238 TCGRRVKSHPGSHVLMD-----VNLG----KDKAAGKESCNFADPIIEHAAGKSMPIVDE 86 PGS V+ + N+G + K GKE F + KS+ +E Sbjct: 229 DLSNEKIGGPGSQVIENSCTTTANVGNGGVRSKPTGKEQQVFRQKNLLKTWKKSLVKQEE 288 Query: 85 KTDAKFVE 62 T F + Sbjct: 289 NTLQDFYD 296 >ref|XP_006578476.1| PREDICTED: uncharacterized protein LOC100812174 isoform X1 [Glycine max] gi|571450621|ref|XP_006578477.1| PREDICTED: uncharacterized protein LOC100812174 isoform X2 [Glycine max] gi|571450623|ref|XP_006578478.1| PREDICTED: uncharacterized protein LOC100812174 isoform X3 [Glycine max] gi|571450625|ref|XP_006578479.1| PREDICTED: uncharacterized protein LOC100812174 isoform X4 [Glycine max] gi|571450628|ref|XP_006578480.1| PREDICTED: uncharacterized protein LOC100812174 isoform X5 [Glycine max] gi|571450630|ref|XP_006578481.1| PREDICTED: uncharacterized protein LOC100812174 isoform X6 [Glycine max] Length = 657 Score = 140 bits (354), Expect = 7e-31 Identities = 104/309 (33%), Positives = 154/309 (49%), Gaps = 33/309 (10%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + IWG+ ESDDHIVPYP+ NE+ + K E +QE + K E Sbjct: 1 MFDWNDEELANIIWGEGGESDDHIVPYPEVNEDVSN-------KKEWNQEAAATKLTELK 53 Query: 709 ISEAKIESLGTKLESSSRLNTHG-LPIPEFVMNSWVE----------------------T 599 EAK + KL SSS L+ G LP + N+W + + Sbjct: 54 RPEAKTDFHERKLGSSSNLDNSGELPTSGYGTNAWPDLALSSSAKIDHGSLGTEVSNNLS 113 Query: 598 EISGLDSAKGDHS-QLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVF 422 E+S L S++G+ + Q + +F N +E K+ G F+D WANIG+FDD DRIF N+D +F Sbjct: 114 ELSKLSSSRGEETTQHEKHAEIFQNAHEGKEQGHFVDYGWANIGSFDDLDRIFSNDDPIF 173 Query: 421 RCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPS 242 SLD+++ELWSS DV S+ L + D+P+S A + ++ ++ E + D S Sbjct: 174 GHASLDSSNELWSSK-DV-SNNVAPLPL--DTPSSS-GALRNRTESLEIKEEYVQCSDES 228 Query: 241 ATCGRRVKSHPGSHVLMD-----VNLG----KDKAAGKESCNFADPIIEHAAGKSMPIVD 89 PGS V+ + N+G + K GKE F + KS+ + Sbjct: 229 LDLSNEKIGGPGSQVIENSCTTTANVGNGGVRSKPTGKEQQVFRQKNLLKTWKKSLVKQE 288 Query: 88 EKTDAKFVE 62 E T F + Sbjct: 289 ENTLQDFYD 297 >ref|XP_002513079.1| conserved hypothetical protein [Ricinus communis] gi|223548090|gb|EEF49582.1| conserved hypothetical protein [Ricinus communis] Length = 735 Score = 139 bits (350), Expect = 2e-30 Identities = 97/267 (36%), Positives = 133/267 (49%), Gaps = 33/267 (12%) Frame = -2 Query: 853 IWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQNISEAKIESLGTK 674 IW +A ESDDHIVPYP E+ + + E SQE ++K EQ K++ G K Sbjct: 66 IWDEAGESDDHIVPYPGAVEDHSK-------EKEWSQETNNIKSEEQKAPGPKVDIHGRK 118 Query: 673 LESSSRLNT-HGLPIPEFVMNSWVE----------------------TEISGLDSAKGDH 563 LESSS N+ G F ++SW TEI+ L+S+ G Sbjct: 119 LESSSNFNSSEGASASGFGIDSWPNLSLSTAAKTDQDSLDASVSNNLTEITKLESSGGAE 178 Query: 562 S-QLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLDNADELW 386 + QLD +F + K+ G F+D WA+IG+FDD DR+F N+D +F SL N DELW Sbjct: 179 TVQLDKDSEIF---QKGKEQGDFVDYGWASIGSFDDLDRMFSNDDPIFGTVSLSNPDELW 235 Query: 385 SSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPSATCGRRVKSHPG 206 SSS DV +SP S + DSPT G+ + S++ ++ E + + T G + P Sbjct: 236 SSSKDVTNSPGNSFRIYSDSPTLGLGPLRNTSERFEIKTEYVHDDNHPFTLGYGKVNDPA 295 Query: 205 SH-------VLMDVNL--GKDKAAGKE 152 SH VL V+ GK KA KE Sbjct: 296 SHGMQNASPVLNQVDFAGGKSKATLKE 322 >dbj|BAC41810.1| unknown protein [Arabidopsis thaliana] Length = 271 Score = 137 bits (344), Expect = 1e-29 Identities = 85/218 (38%), Positives = 117/218 (53%), Gaps = 24/218 (11%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW+EEE+ + IWG +E+ DHIVP+ +E+ + +++ K AEQ Sbjct: 1 MFDWEEEELTNMIWGDDAETGDHIVPFKVRSEQ-----------LNKKEQIEESKTAEQK 49 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGL----------------- 584 I+ KI+ L SSS N GLP P+F M+SW +T ++ Sbjct: 50 ITGTKIDLHDKNLGSSSSHNVDEGLPQPDFCMSSWPDTSLTNATKVDQDLSATELSKCLA 109 Query: 583 -----DSAKGDH-SQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVF 422 DS +G+ S+L G +F + E K+ G F D SWANIG+FDD DR+F N+ +F Sbjct: 110 EPVRYDSTRGEKTSELGKGPDIFHSSDESKEQGDFDDYSWANIGSFDDLDRMFSNDVPIF 169 Query: 421 RCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 GSL DELWSSS DV +SP KSLS DS G++ Sbjct: 170 GDGSLSGGDELWSSSKDVSNSP-KSLSSMLDSQDLGLD 206 >ref|NP_191014.2| uncharacterized protein [Arabidopsis thaliana] gi|332645718|gb|AEE79239.1| uncharacterized protein AT3G54500 [Arabidopsis thaliana] Length = 648 Score = 137 bits (344), Expect = 1e-29 Identities = 84/217 (38%), Positives = 115/217 (52%), Gaps = 23/217 (10%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW+EEE+ + IWG +E+ DHIVP+ +E+ + +++ K AEQ Sbjct: 1 MFDWEEEELTNMIWGDDAETGDHIVPFKVRSEQ-----------LNKKEQIEESKTAEQK 49 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGL----------------- 584 I+ KI+ L SSS N GLP P+F M+SW +T ++ Sbjct: 50 ITGTKIDLHDKNLGSSSSHNVDEGLPQPDFCMSSWPDTSLTNATKVDQDLSATELSKCLA 109 Query: 583 -----DSAKGDHSQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFR 419 DS + S+L G +F + E K+ G F D SWANIG+FDD DR+F N+ +F Sbjct: 110 EPVRYDSTREKTSELGKGPDIFHSSDESKEQGDFDDYSWANIGSFDDLDRMFSNDVPIFG 169 Query: 418 CGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 GSL DELWSSS DV +SP KSLS DS G++ Sbjct: 170 DGSLSGGDELWSSSKDVSNSP-KSLSSMLDSQDLGLD 205 >ref|XP_002877982.1| hypothetical protein ARALYDRAFT_485852 [Arabidopsis lyrata subsp. lyrata] gi|297323820|gb|EFH54241.1| hypothetical protein ARALYDRAFT_485852 [Arabidopsis lyrata subsp. lyrata] Length = 642 Score = 137 bits (344), Expect = 1e-29 Identities = 86/212 (40%), Positives = 115/212 (54%), Gaps = 18/212 (8%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW+EEE+ + IWG E+ DHIVP+ + N K E S+E ++ K AEQ Sbjct: 1 MFDWEEEELTNMIWGDDGETGDHIVPFKQLN------------KKEQSEETKTAKPAEQK 48 Query: 709 ISEAKIESLGTKLESSSRLNTH-GLPIPEFVMNSWVET-----------------EISGL 584 I+ K + KL S+S N G P P+F M+SW ++ E + Sbjct: 49 ITGTKTDLHDDKLGSTSGHNVDDGTPQPDFCMSSWSDSTKDDPDLSATQLSKCLAEPARY 108 Query: 583 DSAKGDHSQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVFRCGSLD 404 DS + S+L G +F + E K+ G F D SWANIG+FDD DR+F N+ +F GSL Sbjct: 109 DSTREKTSELGKGPDIFHSSDESKEQGDFDDYSWANIGSFDDLDRMFSNDVPIFGDGSLS 168 Query: 403 NADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 DELWSSS DV +SP KSLS DS G++ Sbjct: 169 GGDELWSSSKDVSNSP-KSLSSMLDSQDLGLD 199 >emb|CAB77570.1| putative protein [Arabidopsis thaliana] Length = 649 Score = 137 bits (344), Expect = 1e-29 Identities = 85/218 (38%), Positives = 117/218 (53%), Gaps = 24/218 (11%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW+EEE+ + IWG +E+ DHIVP+ +E+ + +++ K AEQ Sbjct: 1 MFDWEEEELTNMIWGDDAETGDHIVPFKVRSEQ-----------LNKKEQIEESKTAEQK 49 Query: 709 ISEAKIESLGTKLESSSRLNT-HGLPIPEFVMNSWVETEISGL----------------- 584 I+ KI+ L SSS N GLP P+F M+SW +T ++ Sbjct: 50 ITGTKIDLHDKNLGSSSSHNVDEGLPQPDFCMSSWPDTSLTNATKVDQDLSATELSKCLA 109 Query: 583 -----DSAKGDH-SQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVF 422 DS +G+ S+L G +F + E K+ G F D SWANIG+FDD DR+F N+ +F Sbjct: 110 EPVRYDSTRGEKTSELGKGPDIFHSSDESKEQGDFDDYSWANIGSFDDLDRMFSNDVPIF 169 Query: 421 RCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGIN 308 GSL DELWSSS DV +SP KSLS DS G++ Sbjct: 170 GDGSLSGGDELWSSSKDVSNSP-KSLSSMLDSQDLGLD 206 >gb|EYU38268.1| hypothetical protein MIMGU_mgv1a003852mg [Mimulus guttatus] Length = 560 Score = 134 bits (336), Expect = 8e-29 Identities = 95/293 (32%), Positives = 132/293 (45%), Gaps = 24/293 (8%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MFDW++EE+ + +WG+A ESDDHIVPYP EEK +FG+ K E +Q+ ++ EQ Sbjct: 1 MFDWNDEELTNIVWGEARESDDHIVPYPGQIEEKPAVLFGDPSKKEINQQTTNISSVEQK 60 Query: 709 ISEAKIESLGTKLESSSRLNTHGLPIPEFVMNSWVETEISGLDSAKGDHSQLDNGLG--- 539 K E +L+SS + +T E V +E S ++AK D +D Sbjct: 61 KPTVKSE-YAVELDSSPKYDTC-----EPVTGVGPHSETSSSNAAKADPESIDVAASSNI 114 Query: 538 ---------------------LFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDLVF 422 +F N ED ++ F+D SWANIG+FDD DRIF N D +F Sbjct: 115 ANSSKNVSLRDETSQFAKESDIFENTPEDGEHSDFVDYSWANIGSFDDLDRIFSNNDPIF 174 Query: 421 RCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKDPS 242 S N DELW SS DV SSP + GDS + A + ++ + + S Sbjct: 175 GDSSAGNVDELWPSSKDVTSSPLGPDPLCGDSFDLQLGALRTSIDESEIKDQHMLDPSQS 234 Query: 241 ATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEK 83 G + S DV D E++ GKS +V EK Sbjct: 235 FISGYEKLNVIASQAPEDVQASLDTN-------------EYSGGKSNLLVKEK 274 >ref|XP_006365448.1| PREDICTED: uncharacterized protein LOC102601920 isoform X3 [Solanum tuberosum] Length = 656 Score = 134 bits (336), Expect = 8e-29 Identities = 94/295 (31%), Positives = 141/295 (47%), Gaps = 26/295 (8%) Frame = -2 Query: 889 MFDWDEEEIGDSIWGQASESDDHIVPYPKGNEEKALGIFGEECKMECSQEVRSVKFAEQN 710 MF+W++EE+ D IWG+ ESDDHIVP+P G+E KA +G+ K + E ++K +Q Sbjct: 1 MFNWNDEELNDIIWGETGESDDHIVPHPDGSEGKAPA-YGDNIKKKWDVEASNIKPTDQK 59 Query: 709 ISEAKIESLGTKLESSSRLNTHGLPIPEFVMNSWVE------------------------ 602 K++ KL+ SS+ +T G P +E Sbjct: 60 KPTTKMDLSNIKLDGSSKHDTDG---PTMKAGCRIELGTDLSLTNATKSNQDSSGAEASN 116 Query: 601 --TEISGLDSAKGDHSQLDNGLGLFGNDYEDKQNGGFLDDSWANIGNFDDFDRIFRNEDL 428 TE+ + + + + L + +F + E+ + F+D WANIG+FDD D+IF N D Sbjct: 117 NLTEVPKYECLRDERNWLGDDSRVFHHQNEELEQSDFIDYGWANIGSFDDLDKIFSNNDP 176 Query: 427 VFRCGSLDNADELWSSSADVFSSPTKSLSMSGDSPTSGINAFGHVSQKCQVMAEALPQKD 248 +F SL N +LWSS DV SS KS+ +S DSP+ + + S++ +V AE ++ Sbjct: 177 LFGDTSLPNTLDLWSSCKDVTSSQDKSVPLSIDSPSLALGSLRIPSKRLKVRAEYRLDQE 236 Query: 247 PSATCGRRVKSHPGSHVLMDVNLGKDKAAGKESCNFADPIIEHAAGKSMPIVDEK 83 S T +VN D + C A EHA GKSM + EK Sbjct: 237 KSFTGDEE-----------NVN---DITSNVHLCTDAR---EHAGGKSMLLPKEK 274