BLASTX nr result
ID: Cinnamomum25_contig00012055
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum25_contig00012055 (1816 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010263610.1| PREDICTED: uncharacterized protein LOC104601... 338 8e-90 ref|XP_010254775.1| PREDICTED: uncharacterized protein LOC104595... 336 3e-89 ref|XP_010113417.1| hypothetical protein L484_026751 [Morus nota... 326 4e-86 ref|XP_010940103.1| PREDICTED: uncharacterized protein LOC105058... 324 2e-85 ref|XP_007046876.1| Uncharacterized protein TCM_000342 [Theobrom... 321 1e-84 ref|XP_008784926.1| PREDICTED: uncharacterized protein LOC103703... 320 2e-84 ref|XP_002521549.1| conserved hypothetical protein [Ricinus comm... 320 2e-84 ref|XP_010906332.1| PREDICTED: uncharacterized protein LOC105033... 319 4e-84 ref|XP_002310256.1| hypothetical protein POPTR_0007s13180g [Popu... 314 2e-82 gb|KHG23299.1| Bacteriophage N4 adsorption B [Gossypium arboreum] 311 1e-81 ref|XP_008440744.1| PREDICTED: uncharacterized protein LOC103485... 309 5e-81 ref|XP_008795948.1| PREDICTED: uncharacterized protein LOC103711... 306 3e-80 ref|XP_011025832.1| PREDICTED: uncharacterized protein LOC105126... 305 1e-79 ref|XP_010029768.1| PREDICTED: uncharacterized protein LOC104419... 305 1e-79 gb|KHN00526.1| hypothetical protein glysoja_000194 [Glycine soja] 303 4e-79 ref|XP_003520299.1| PREDICTED: uncharacterized protein LOC100798... 302 7e-79 ref|XP_003517215.1| PREDICTED: uncharacterized protein LOC100792... 300 3e-78 gb|KHN36085.1| hypothetical protein glysoja_003208 [Glycine soja] 299 4e-78 gb|KHN11126.1| hypothetical protein glysoja_033680 [Glycine soja] 297 2e-77 gb|KEH20419.1| plant/F18B3-190 protein, putative [Medicago trunc... 296 3e-77 >ref|XP_010263610.1| PREDICTED: uncharacterized protein LOC104601826 [Nelumbo nucifera] Length = 417 Score = 338 bits (867), Expect = 8e-90 Identities = 201/415 (48%), Positives = 258/415 (62%), Gaps = 4/415 (0%) Frame = -1 Query: 1576 PTFTAIALDRLLEPRAQNSISRPLMPKPDMQNSSSEKKTRRTYVSPALYTTPEATPLPDS 1397 PTFTAI LDRLLEP S+ + L K + S+EK +R SP+LY TPEATPLPDS Sbjct: 3 PTFTAITLDRLLEPGTPKSVPKLLNSKLQSRKPSTEKTIQRP--SPSLYATPEATPLPDS 60 Query: 1396 PVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNLVSEVSS 1217 P SF PSPYIV+HKRRGPRLLK+ + D +V+ +V+ Sbjct: 61 PSSFAPSPYIVNHKRRGPRLLKTGYQDDASVKQATEEVKVDANERNVDIEVVGSKEDVTV 120 Query: 1216 PVMHSTSCEEVHVNGYNNRKPEDNILGDGVV-IDDSTKSFPAXXXXXXXXXXXXXXXDVM 1040 P M S C++ VNG+++ KP DG +DS+K A + M Sbjct: 121 PPMVSNPCDDAVVNGFHDDKPGSCNSNDGFDGAEDSSKVDAAVDLEIEEGEDFFDPQESM 180 Query: 1039 SASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLL 863 S +SNT++++++ +R K + +P GE++DA+EELS E QS R+++ EL E+R NLL Sbjct: 181 SFTSNTDLEESSGPKRPMK-LCTPMGEFYDAWEELSSEVGQQSGIRDIEAELREIRLNLL 239 Query: 862 MEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAP-AAENDGHTEIDPAEELCQQVF 686 EIERRKQ EEAL +MQ QWQR+ QQLS+VGL LP A AA D + D E+LCQQ++ Sbjct: 240 TEIERRKQAEEALSNMQRQWQRIGQQLSLVGLRLPTASIAAAEDEDLDFDLGEDLCQQMY 299 Query: 685 IAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEX 506 IA+ V+NSVGRG+ARAE+E EMES IE KN EI RL DRLHYYE VN EMSQRNQE +E Sbjct: 300 IARFVSNSVGRGSARAEIEEEMESQIELKNFEIARLWDRLHYYEAVNHEMSQRNQEAIEM 359 Query: 505 XXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKESIST-RSDAPAGGAA 344 K +W +G AI +G+AALAWSYLP ++ S +T SDAP G A Sbjct: 360 ARQRRQRRKRRRKLVWGLVGTAIIVGAAALAWSYLPASRGSSTTNHSDAPRGDDA 414 >ref|XP_010254775.1| PREDICTED: uncharacterized protein LOC104595646 [Nelumbo nucifera] Length = 405 Score = 336 bits (862), Expect = 3e-89 Identities = 194/415 (46%), Positives = 263/415 (63%), Gaps = 3/415 (0%) Frame = -1 Query: 1576 PTFTAIALDRLLEPRAQNSISRPLMPKPDMQNSSSEKKTRRTYVSPALYTTPEATPLPDS 1397 PT TA+ALDRLLE A SI +PL KPD S +EK+T +SP LY TP TPLPDS Sbjct: 3 PTLTAVALDRLLEHGAPKSIPKPLNTKPD---SRTEKRTHLPQISPTLYATPVPTPLPDS 59 Query: 1396 PVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNLVSEVSS 1217 P SFPPSPYIV+HKRRGPRLLKS + +G + + KV + + Sbjct: 60 PSSFPPSPYIVNHKRRGPRLLKSFVQDDVSLQQ---------QGTE-EMKVDTNGNNMDI 109 Query: 1216 PVMHSTSCEEVHVNGYNNRKPEDNILGDGVVIDDSTKSFPAXXXXXXXXXXXXXXXDVMS 1037 V+ +TS +EV VNG+ + +P + L DG+ + + + A + MS Sbjct: 110 EVVETTSAKEVVVNGFRDGEPGNRNLNDGLGVTEDSSKVGATVDGRDECEDFFDPQESMS 169 Query: 1036 ASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLLM 860 +SN + +D TER K + +P GE++DA++ELS EG QS +++ EL ++R NLL Sbjct: 170 FTSNIDEEDTRGTERPLK-LTTPMGEFYDAWDELSSEGGRQSSLSDIEAELRDIRLNLLS 228 Query: 859 EIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAAENDG-HTEIDPAEELCQQVFI 683 EIE+RKQ EEAL +MQ QWQR+ QQLS+VGL+LP A ++ +G +++ D E+LCQQ+ + Sbjct: 229 EIEKRKQAEEALSNMQRQWQRIGQQLSLVGLTLPTASSSAVEGENSDFDLGEDLCQQICV 288 Query: 682 AQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXX 503 A+ V+ S+GRG+A+AE E +MES IE KN EI RL DRLHYYE VN EMSQRNQE +E Sbjct: 289 ARFVSQSIGRGSAKAEAEEKMESQIELKNFEIARLWDRLHYYEAVNHEMSQRNQEAIEMV 348 Query: 502 XXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAK-ESISTRSDAPAGGAAA 341 +WIWSS+G+ I++G+AALAWS + ++ SI+ RSDAP AA Sbjct: 349 RQRRQRRKRRWRWIWSSVGITISVGAAALAWSCVAASRGSSIANRSDAPGCDDAA 403 >ref|XP_010113417.1| hypothetical protein L484_026751 [Morus notabilis] gi|587949256|gb|EXC35444.1| hypothetical protein L484_026751 [Morus notabilis] Length = 462 Score = 326 bits (835), Expect = 4e-86 Identities = 203/466 (43%), Positives = 263/466 (56%), Gaps = 53/466 (11%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPL---MPKP--------DMQNSSS--EKKTRRTYVSP 1439 MPTFTAIALD LLEP A S+ + + +PKP + +NS+S E+KT R ++P Sbjct: 1 MPTFTAIALDTLLEPGASKSVDKSVPRPVPKPRPGPNSKLERRNSTSVAERKTNRPQITP 60 Query: 1438 ALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXE-RGV 1262 ALY TPEATPLPDSP SFPPSPYI++HKRRGPRLLKSSS G Sbjct: 61 ALYATPEATPLPDSPTSFPPSPYIINHKRRGPRLLKSSSESNVLARQKVQDEQKVNVDGK 120 Query: 1261 KGDGKVMNLVSEVSSPVMHSTSCE----EVHVNGYNNRKPEDNILGDG------------ 1130 + K NL+ S + ST+ E E +NG ++ + +L +G Sbjct: 121 DAETKATNLMENDS---VTSTNAELLIKEQLMNGCHDCGSSNGVLENGRAETESRNGKLG 177 Query: 1129 ----------------------VVIDDSTKSFPAXXXXXXXXXXXXXXXDVMSASSNTEV 1016 V++ DS+K P MS +SNT+ Sbjct: 178 TGNEELGNGKVEHGSSNFSNAVVIVHDSSKLAPTPERESEREDFYDPQES-MSVTSNTDA 236 Query: 1015 DDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLLMEIERRKQ 839 +DN ERS + +P GE+FDA+EELS EG QS +V+ EL EMR +LLMEIE+RKQ Sbjct: 237 EDNAEGERSAQ-FTTPMGEFFDAWEELSSEGGQQSALHDVEAELREMRLSLLMEIEKRKQ 295 Query: 838 VEEALYSMQSQWQRLAQQLSVVGLSLPVAPAAENDGHTEIDPAEELCQQVFIAQVVANSV 659 EEAL +M+ QW+ + QQLS+VGL+LP AE + DP E+LC+QV++A+ VANS+ Sbjct: 296 AEEALSNMRKQWESIRQQLSLVGLTLPAEVPAEGREQPDSDPGEKLCRQVYLARFVANSI 355 Query: 658 GRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXX 479 GRG ARAE+E EMES IEAKN EI RLCD+L YE +N+EM QRNQ+ +E Sbjct: 356 GRGLARAELEAEMESQIEAKNFEITRLCDKLRNYEAMNQEMVQRNQDVLEMARRERVRKE 415 Query: 478 XXXKWIWSSIGVAIALGSAALAWSYLPTAKESISTRSDAPAGGAAA 341 +WIW SI A+ LG+A LAWSYLP+ S S+ P A Sbjct: 416 RRQRWIWGSIAAALTLGAAGLAWSYLPSGTGSPKCDSEVPQSNDGA 461 >ref|XP_010940103.1| PREDICTED: uncharacterized protein LOC105058764 isoform X1 [Elaeis guineensis] Length = 421 Score = 324 bits (830), Expect = 2e-85 Identities = 204/421 (48%), Positives = 250/421 (59%), Gaps = 21/421 (4%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPR-AQNSISRPLMPKPDMQNS----SSEKKTRRTYVSPALYTTPEA 1415 MPTFTAI LDRLLEP ++N RP + ++ + S +K R SPALY TPE Sbjct: 1 MPTFTAIVLDRLLEPSPSRNPALRPPLAPVKVEKAPPSPSGKKNIRCPIASPALYATPEN 60 Query: 1414 TPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNL 1235 TPLPDSP SFPPSPYI++HKRRGPRLLKS S+ ++ +GK + Sbjct: 61 TPLPDSPSSFPPSPYIINHKRRGPRLLKSFSQNDVATSQPPPPAEVEKKVDMVNGKGV-- 118 Query: 1234 VSEVSSPVMHSTSCEEVH----VNGY--------NNRKPEDNILGDG-VVIDDSTKSFPA 1094 E ++ H E H NG +++K +D L DG V + K Sbjct: 119 --EGTANGFHDKKLEREHKAVDANGTQGASLSIEHHKKFQDAWLSDGPVAATEVAKPVVF 176 Query: 1093 XXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQ 914 D +S +SN E+ + WKP ++P GEYFDA+EE+S EG Q Sbjct: 177 DPEKDGENDDFFDLQDSLSTTSNMELCER------WKP-STPLGEYFDAFEEISSEGASQ 229 Query: 913 SR-RNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAA-- 743 S +NV+ EL EMR NLLMEIERRKQ EEAL ++QSQWQ L+Q LS+ GLSLP PA Sbjct: 230 SSYQNVENELREMRLNLLMEIERRKQAEEALENLQSQWQMLSQHLSLAGLSLPSPPAMTD 289 Query: 742 ENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLH 563 E D + IDPAEELC+Q+ IA VA SVGRG +RAEVELE+E IEAKN EI RL DRLH Sbjct: 290 EKDEKSCIDPAEELCRQIVIAHFVAASVGRGCSRAEVELELEPQIEAKNFEIARLWDRLH 349 Query: 562 YYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKES 383 YYE NREMSQRNQE VE KWIW SIG+A+ L +AA+AWSYLP + S Sbjct: 350 YYEAANREMSQRNQEAVEMARQQRHKQKSRQKWIWGSIGLAVTLSAAAIAWSYLPVSNPS 409 Query: 382 I 380 + Sbjct: 410 L 410 >ref|XP_007046876.1| Uncharacterized protein TCM_000342 [Theobroma cacao] gi|508699137|gb|EOX91033.1| Uncharacterized protein TCM_000342 [Theobroma cacao] Length = 475 Score = 321 bits (822), Expect = 1e-84 Identities = 207/467 (44%), Positives = 263/467 (56%), Gaps = 63/467 (13%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISR------PLMPKP--------DMQNSSS--EKKTRRTY 1448 MPTF+AIALDR LEP S+ + P +P P + +NS+S E+K R Sbjct: 1 MPTFSAIALDRFLEPGTSKSVDKSGPNLKPPIPTPKPITNSKLERRNSTSVTERKVNRPQ 60 Query: 1447 VSPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKS------SSRXXXXXXXXXXX 1286 +SPALY TPEATPLPDSP SFPPSPYI++HKRRGPRLLKS SSR Sbjct: 61 ISPALYATPEATPLPDSPSSFPPSPYIINHKRRGPRLLKSFSEDNVSSRKKALEENEVNG 120 Query: 1285 XXXXER--------------------------GVKGDGKVMNLVSEVSSPV-------MH 1205 G+ G K+ + P+ +H Sbjct: 121 IAKLAETKSVDSLKDAVTFSIPEPNEEEHGNDGLNGSMKMEQANGVTNGPIKLEQANGLH 180 Query: 1204 STSCEEVHVNGYN-------NRKPEDNILGDGVVIDDSTKSFPAXXXXXXXXXXXXXXXD 1046 S ++ H+NG + NR+ + + +G+ DS P + Sbjct: 181 GGSIQDEHMNGAHAGEFGSSNREVGSSQMSNGLA-RDSAVLVPLDLDRCGDSEDFFDPNE 239 Query: 1045 VMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTN 869 MS +SNTE DD+T E + + +A+P E+FDAY+ELS E PQS R++D EL E+R Sbjct: 240 SMSVTSNTEGDDDTGAESAAR-LATPRVEFFDAYDELSSESGPQSLLRDIDAELREIRLT 298 Query: 868 LLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAAENDGHTEIDPAEELCQQV 689 LLMEIE+RKQ EEAL M+ +WQR++Q+L+V GLSLPV P + I PAEEL QQV Sbjct: 299 LLMEIEKRKQAEEALNKMRCKWQRISQELAVEGLSLPVDPIDVTEDELMI-PAEELRQQV 357 Query: 688 FIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVE 509 +A+ V+ S+GRG ARAE+E+EME+ IE+KN EI RL DRLHYYE VNREMSQRNQE VE Sbjct: 358 GVARFVSLSLGRGIARAEMEMEMEAQIESKNFEIARLWDRLHYYEAVNREMSQRNQEAVE 417 Query: 508 XXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKESISTRS 368 +W+W SI AI LG+AALAWSYLPT K S ST S Sbjct: 418 MARRDRQRKNKRQRWVWGSIAAAITLGTAALAWSYLPTGKGSSSTSS 464 >ref|XP_008784926.1| PREDICTED: uncharacterized protein LOC103703745 [Phoenix dactylifera] gi|672123150|ref|XP_008784927.1| PREDICTED: uncharacterized protein LOC103703745 [Phoenix dactylifera] Length = 417 Score = 320 bits (820), Expect = 2e-84 Identities = 203/417 (48%), Positives = 251/417 (60%), Gaps = 18/417 (4%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRA-QNSISRPLMPKPDMQN---SSSEKKT-RRTYVSPALYTTPEA 1415 MPTFTAIALDRLLEP A +N RP + ++ S SEKK+ R VSPALY TPE Sbjct: 1 MPTFTAIALDRLLEPGASRNPTMRPPLAPGKVEKAPPSPSEKKSIPRPNVSPALYATPET 60 Query: 1414 TPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNL 1235 TPLPDSP SFPPSPYI++HKRRGPRLLKS S+ ++ +GK + Sbjct: 61 TPLPDSPSSFPPSPYIINHKRRGPRLLKSFSQNDVAGSQPLPPPEVEKKIAMVNGKGV-- 118 Query: 1234 VSEVSSPVMHSTSC--EEVHVNGYNNRKPEDNI-LGDGVVIDDST-------KSFPAXXX 1085 E ++ H E+ V+G R +I L G + DD + + Sbjct: 119 --EETANGFHDEKLKGEQKDVDGNGTRGESVSIELHQGKLQDDGSIAAKEVARPVAVDLE 176 Query: 1084 XXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR- 908 D +S +SNTE+D+ WKP ++P GEYFDA+EE+S EG QS Sbjct: 177 KDGESEDFFDLQDSLSTTSNTELDER------WKP-STPLGEYFDAFEEISSEGASQSAC 229 Query: 907 RNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAA--END 734 NV+ EL EMR NLL EIERRKQ EE L S+Q+QWQ L+ LS+VGL LP P+ E D Sbjct: 230 LNVEDELHEMRLNLLSEIERRKQAEETLKSLQNQWQMLSHHLSLVGLRLPDPPSMTEEKD 289 Query: 733 GHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYE 554 + DPAEELCQQ+ IA+ VA +GRG +RAEVE ++E IEAKN EI RLCDRLHYYE Sbjct: 290 EQSCADPAEELCQQIVIARFVAACLGRGCSRAEVE-QLEPQIEAKNFEIARLCDRLHYYE 348 Query: 553 TVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKES 383 NREMSQRNQE +E KWIW SIG+A++LG+AA+AWSY P +K S Sbjct: 349 AANREMSQRNQEAIEMARQQRHRRKKRQKWIWGSIGLAVSLGAAAIAWSYFPVSKPS 405 >ref|XP_002521549.1| conserved hypothetical protein [Ricinus communis] gi|223539227|gb|EEF40820.1| conserved hypothetical protein [Ricinus communis] Length = 475 Score = 320 bits (820), Expect = 2e-84 Identities = 205/474 (43%), Positives = 264/474 (55%), Gaps = 61/474 (12%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPR----------AQNSISRPLMP---KP------DMQNS--SSEKK 1463 MPTFTAIALDRLLEP + N +++P +P KP + +NS S+E+K Sbjct: 1 MPTFTAIALDRLLEPGTSKSADKSVPSSNPVTKPKLPPKSKPVPKSNLERRNSIASTERK 60 Query: 1462 TRRTYVSPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXX 1283 R +SPALY TPEATPLPDSP SFPPSPYI++HKRRGPRLLKS S Sbjct: 61 VSRPQISPALYATPEATPLPDSPSSFPPSPYIINHKRRGPRLLKSFSEDDVASRRKNLDE 120 Query: 1282 XXXE-RGVKGDGKVMNLVSEVSSPVMHSTSCEEVHVNGYNNRK-----PEDNILGDGVV- 1124 R + +V+N S + S EE NG + P+D+ V Sbjct: 121 EKINGRATNAENEVVNSTEGHSVTFSIANSVEERQSNGVRDSPQKQEFPDDSFEASSVKE 180 Query: 1123 ---------IDDSTKSFPAXXXXXXXXXXXXXXXDV-------------------MSASS 1028 + DS F + V MS +S Sbjct: 181 HMNGLCCSELGDSNGEFESRIARKGWANENDVTKLVSLNSERDGESEDFFDPQESMSYTS 240 Query: 1027 NTEVDDNTMTERSWKPVAS-PFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLLMEI 854 NT+ +DN E S K A+ P GE++DA+EELS E QS R+++ EL EMR +LL+EI Sbjct: 241 NTDGEDNCGVESSIKLAATTPVGEFYDAWEELSSESGQQSSFRDIEAELREMRLSLLVEI 300 Query: 853 ERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAAENDGHTEID--PAEELCQQVFIA 680 E+RKQ EE L + Q+ WQR+ +QL++VGL+LP P A+ +G +D PAEELCQQV++A Sbjct: 301 EKRKQAEETLNNAQNHWQRMREQLALVGLTLPAFPFADPEGELSLDTDPAEELCQQVYLA 360 Query: 679 QVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXX 500 + V++S+GRG A+AE E+E E+ IEAKN EI RL DRLHYYE +NREMSQRNQE VE Sbjct: 361 RFVSDSIGRGMAKAEAEMEKEAQIEAKNFEIARLVDRLHYYEAMNREMSQRNQEAVEMAR 420 Query: 499 XXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAK-ESISTRSDAPAGGAAA 341 +W+W SI + LG+AALAWSYLP K S S+ S AP G A Sbjct: 421 RNRQVRKGRQRWVWGSIATVVTLGTAALAWSYLPATKGSSSSSDSLAPEHGDGA 474 >ref|XP_010906332.1| PREDICTED: uncharacterized protein LOC105033294 [Elaeis guineensis] gi|743871548|ref|XP_010906333.1| PREDICTED: uncharacterized protein LOC105033294 [Elaeis guineensis] Length = 421 Score = 319 bits (818), Expect = 4e-84 Identities = 196/413 (47%), Positives = 241/413 (58%), Gaps = 14/413 (3%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRA-QNSISRPLMPKPDMQNS----SSEKKTRRTYVSPALYTTPEA 1415 MPTFTAIALDRLLEP A +N+ +P + ++ + S +K RR VSPALY TPE Sbjct: 1 MPTFTAIALDRLLEPGASRNTTMKPPLAPVKLEKAPPSPSGKKSNRRPNVSPALYATPET 60 Query: 1414 TPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNL 1235 TPLPDSP S+PPSPYI++HKRRGPRLLKS S+ ++ +GK Sbjct: 61 TPLPDSPSSYPPSPYIINHKRRGPRLLKSFSQNDVAGSLPAPPPEVEKKIEMVNGKG--- 117 Query: 1234 VSEVSSPVMHSTSCEEVHVNGYNNRKPEDNI------LGDGVVIDDSTKSFPAXXXXXXX 1073 V E S E+ V+G +R +I L D D S S Sbjct: 118 VEETSGFHDEKLEEEQKDVDGNGSRGESVSIELHQGKLQDVGFSDGSIASKEVAKPVAVD 177 Query: 1072 XXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQSR-RNVD 896 D + NT E WKP ++P GEYFDA+E++S EG QS N + Sbjct: 178 PEKDGENEDFFDPQDSLSTSSNTDLEERWKP-STPLGEYFDAFEDISSEGASQSACLNDE 236 Query: 895 FELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAA--ENDGHTE 722 EL EMR NLL+EIERRKQ EEAL ++Q+QWQ L+ LS+VGL LP P+ E D + Sbjct: 237 DELHEMRLNLLLEIERRKQAEEALKNLQNQWQMLSHHLSLVGLRLPDPPSMTEEKDEQSC 296 Query: 721 IDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNR 542 +DPAEELCQQ+ IA+ VA +GRG +RAE E E+E IEAKN EI RL DRLHYYE NR Sbjct: 297 VDPAEELCQQIVIARFVAACLGRGCSRAEAEQELEPQIEAKNFEIARLWDRLHYYEAANR 356 Query: 541 EMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKES 383 EMSQRNQE VE KWIW SIG+A+ LG+AA+AWSY P +K S Sbjct: 357 EMSQRNQEAVEIARQQRNRRKKRQKWIWGSIGLAVTLGAAAIAWSYFPVSKPS 409 >ref|XP_002310256.1| hypothetical protein POPTR_0007s13180g [Populus trichocarpa] gi|222853159|gb|EEE90706.1| hypothetical protein POPTR_0007s13180g [Populus trichocarpa] Length = 413 Score = 314 bits (804), Expect = 2e-82 Identities = 198/427 (46%), Positives = 250/427 (58%), Gaps = 20/427 (4%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPLMP-KPDMQNSSSEK---------KTRRTYVSPALY 1430 MP FTA+ALDRLLEP A S+ P+ KP + NS+ E+ K R +SP LY Sbjct: 1 MPHFTALALDRLLEPGASKSVDMPVPKLKPPLPNSNLERRNSTSVIERKGNRPQISPGLY 60 Query: 1429 TTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVK-GD 1253 TPE+TPLPDSP SFPPSPYI++HKRRGPRL KS S V G+ Sbjct: 61 ATPESTPLPDSPTSFPPSPYIINHKRRGPRLSKSFSDDDVASRKKKLEKLEVNGNVNNGE 120 Query: 1252 GKVMNLVSEVSSPVMHSTSCEEVHVNGYNNR-KP-EDNILGDGVVIDDSTKSFPAXXXXX 1079 KV V SS V T ++ + KP E N+ +G DS F Sbjct: 121 NKV---VDSRSSSVQLGTGDTRKDLSLEKDMLKPIEQNVERNG----DSDDFFDPQDS-- 171 Query: 1078 XXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVAS-PFGEYFDAYEELSIEGTPQ---S 911 MS +SNT+V+D T E S K A+ P GE++DA+EELS E Q S Sbjct: 172 ------------MSYTSNTDVEDTTAVESSMKLTAALPVGEFYDAWEELSSESGQQPSPS 219 Query: 910 RRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPA--AEN 737 + EL EMR +LLMEIE+RKQ EEAL +MQSQWQR+ Q+L++VGLSLP P E+ Sbjct: 220 PHHNGAELREMRLSLLMEIEKRKQAEEALDNMQSQWQRIRQELALVGLSLPACPVDVPES 279 Query: 736 DGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYY 557 D ++++P EE+CQQ+++A+ V+ S+GRG A+AE E+EME+ +EAKN EI RL DRLHYY Sbjct: 280 DQPSDVNPVEEICQQIYLARFVSESIGRGIAKAEAEIEMEAQVEAKNFEIARLLDRLHYY 339 Query: 556 ETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPT-AKESI 380 E VNRE+SQ NQE +E KW+W SI AI LG LAWSYLP + S Sbjct: 340 EAVNRELSQWNQEVIETARRNRQIRKRRQKWVWGSIAAAITLGMTTLAWSYLPAMSGSSS 399 Query: 379 STRSDAP 359 S+ S AP Sbjct: 400 SSDSHAP 406 >gb|KHG23299.1| Bacteriophage N4 adsorption B [Gossypium arboreum] Length = 459 Score = 311 bits (796), Expect = 1e-81 Identities = 196/446 (43%), Positives = 255/446 (57%), Gaps = 45/446 (10%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSI------SRPLMPKPD------MQNSSSEK---KTRRTYV 1445 MPTFTAIALDRL+EP S+ S+P +P P M+ SSS K R + Sbjct: 1 MPTFTAIALDRLIEPGPSRSVNNSGPNSKPPIPNPKPIPSTKMKRSSSTSVTSKVNRPQI 60 Query: 1444 SPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERG 1265 SPALY TPEATPLPDSP SFPPSPYI++HKRRGPRLLKS S G Sbjct: 61 SPALYATPEATPLPDSPSSFPPSPYIINHKRRGPRLLKSFSEDNVSSCEKKAHEEDEVNG 120 Query: 1264 VK--GDGKVMNLVSEVS-------------------SPV-------MHSTSCEEVHVNGY 1169 +G ++L+ + S P+ + S +E H+NG+ Sbjct: 121 NAKLAEGNSVDLLKDCSVTFSIHEPNEEEHENGAHNGPINVERPNSVRGGSIKEEHMNGF 180 Query: 1168 NNRKPEDNILGDGVVIDDST-KSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTER 992 ++ + + + +G+ ID S K + MS +SNTE D+ E Sbjct: 181 HDGEVGSSQMNNGLAIDASVLKPGALNLEKGGDSEDFFDPNESMSVASNTEGGDDAAAES 240 Query: 991 SWKPVASPFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLLMEIERRKQVEEALYSM 815 + + A+ E+FDA++ELS E PQS +++ EL E+R +LL EIE+RKQ EEAL M Sbjct: 241 AAR-FATQGVEFFDAWDELSSESLPQSGPHDIEAELREIRLSLLTEIEKRKQAEEALNKM 299 Query: 814 QSQWQRLAQQLSVVGLSLPVAPAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAE 635 QS+W+R+ Q+ VGLSLPV P + ++PAEEL QQ+ IA+ V+ S+GRG A+AE Sbjct: 300 QSKWRRIGQEFGDVGLSLPVDPFVVTEDEL-VNPAEELRQQMGIARFVSLSMGRGIAKAE 358 Query: 634 VELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWS 455 +E EME+ IE+KN EI RL DRLHYYE VNREMSQRNQE VE +W+W Sbjct: 359 LETEMEAQIESKNFEIARLLDRLHYYEAVNREMSQRNQEAVEMARRERQRKKRKQRWVWG 418 Query: 454 SIGVAIALGSAALAWSYLPTAKESIS 377 S+ AI LG+AALAWSYLPT KES S Sbjct: 419 SVATAITLGAAALAWSYLPTGKESSS 444 >ref|XP_008440744.1| PREDICTED: uncharacterized protein LOC103485065 [Cucumis melo] Length = 455 Score = 309 bits (791), Expect = 5e-81 Identities = 196/450 (43%), Positives = 251/450 (55%), Gaps = 46/450 (10%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPL-MPKP------------DMQNSSS--EKKTRRTYV 1445 MPTFT IALDRLLEP SI + L PKP + +NS+S ++K +R + Sbjct: 1 MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60 Query: 1444 SPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERG 1265 PALYTTPEATPLPDSP SFPPSPYIV+HKRRGPRLLKS S Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDVSHKKMNDKDVGNGSV 120 Query: 1264 VKGDGKVMNLVS----EVSSPV---------MHSTSCEEVHVNG---------------- 1172 + DG + L V++P+ + S + NG Sbjct: 121 ERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVSS 180 Query: 1171 YNNRKPEDNIL-GDGVVIDDSTKSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTE 995 +NN E +IL G+ + + + D +S +SNT+ +DN E Sbjct: 181 HNNH--ESSILTSSGIAQEKDSLKVVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGF-E 237 Query: 994 RSWKPVASPFGEYFDAYEELSIEGTPQSRRNVDFELCEMRTNLLMEIERRKQVEEALYSM 815 RS K +P GE++DA+EELS EG PQ + D E + LLMEIE++KQ EEAL + Sbjct: 238 RSAK-FGTPMGEFYDAWEELSSEGVPQPSIS-DIEPDQREMRLLMEIEKQKQAEEALNKL 295 Query: 814 QSQWQRLAQQLSVVGLSLPVAPAAENDG-HTEIDPAEELCQQVFIAQVVANSVGRGAARA 638 Q QWQRL +QL +VGL+LP P +G + DPAEELCQQV +A+ V+ S+G+G ARA Sbjct: 296 QCQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARA 355 Query: 637 EVELEMESHIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIW 458 EVE EME+ +E KN EI RL DRLHYYE VN EMSQRNQE V+ +WIW Sbjct: 356 EVEAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIW 415 Query: 457 SSIGVAIALGSAALAWSYLPTAKESISTRS 368 S+ AI LG+A LAWSYLP+ K+ S+ + Sbjct: 416 GSVATAITLGTAVLAWSYLPSGKDLPSSNN 445 >ref|XP_008795948.1| PREDICTED: uncharacterized protein LOC103711546 [Phoenix dactylifera] Length = 421 Score = 306 bits (785), Expect = 3e-80 Identities = 197/420 (46%), Positives = 242/420 (57%), Gaps = 21/420 (5%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRA-QNSISRPLMPKPDMQNS----SSEKKTRRTYVSPALYTTPEA 1415 MPTFTAIALD LLEP A +N RP ++ + S +K V PALY TPE Sbjct: 1 MPTFTAIALDGLLEPSASRNPTLRPPPVPVKVEKAPPIPSGKKSIPCPNVLPALYATPET 60 Query: 1414 TPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDGKVMNL 1235 T LPD P SFPPSPYI++HKRRGP LLKS S+ ++ +GK Sbjct: 61 TLLPDMPSSFPPSPYIINHKRRGPGLLKSLSQNDVAGSQMPPPEEVEKKAEMVNGKG--- 117 Query: 1234 VSEVSSPVMHSTSCEEVH--VNGY----------NNRKPEDNILGDG-VVIDDSTKSFPA 1094 +E ++ H E H VNG +++K +D L DG V + Sbjct: 118 -AEETANGFHEKKLEGEHKAVNGNGSQGESVSIEHHKKFQDAWLSDGPVAATEVANPVAL 176 Query: 1093 XXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIEGTPQ 914 + + +SNTE+ D WKP ++P GEYFDA+EE+S EG Q Sbjct: 177 DPEKDGENEDFFDPQNSLGTTSNTELGDG------WKP-STPLGEYFDAFEEISSEGASQ 229 Query: 913 SR-RNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPAA-- 743 S RN++ EL EMR NLL+EIERRKQ EEAL ++Q+QWQ L+Q LS+ GLSLP PA Sbjct: 230 SSYRNMENELREMRLNLLLEIERRKQAEEALENLQNQWQMLSQHLSLAGLSLPSPPAVTD 289 Query: 742 ENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLH 563 E D + IDPAEELC+Q+ IA VA SVGR +RAEVELE+E IEAKN EI RL DRLH Sbjct: 290 EKDEQSCIDPAEELCRQIVIAHFVAASVGRVFSRAEVELEVEPWIEAKNFEIARLWDRLH 349 Query: 562 YYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTAKES 383 YYE NREMSQRNQE VE KWIW S+G+A LG+A + WSYLP +K S Sbjct: 350 YYEAANREMSQRNQEAVEMARQQQHRQKRRQKWIWGSVGLAATLGAAVIVWSYLPESKPS 409 >ref|XP_011025832.1| PREDICTED: uncharacterized protein LOC105126612 [Populus euphratica] gi|743838973|ref|XP_011025833.1| PREDICTED: uncharacterized protein LOC105126612 [Populus euphratica] Length = 486 Score = 305 bits (780), Expect = 1e-79 Identities = 200/479 (41%), Positives = 257/479 (53%), Gaps = 72/479 (15%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPL--------MPKP--------------------DMQ 1484 MP FTA+ALDRLLEP A S+ P+ +PKP + + Sbjct: 1 MPHFTALALDRLLEPGASQSVDMPVPSSNNKYPVPKPQPKPKPPPPELKPPLPNSNLERR 60 Query: 1483 NSSS--EKKTRRTYVSPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXX 1310 NS+S E+K R +SP LY TPE+TPLPDSP SFPPSPYI++HKRRGPRL KS S Sbjct: 61 NSTSVIERKGNRPQISPGLYATPESTPLPDSPTSFPPSPYIINHKRRGPRLSKSFSEDDV 120 Query: 1309 XXXXXXXXXXXXERGVK-GDGKVMNLVSEVSSPVMHSTSCEEVHVNGYNNRKPEDNI--- 1142 V G KV++ + S + +S E VN N ++++ Sbjct: 121 ASRKKKLEKVEANGNVNNGVNKVVDSSNGHSVTLFIPSSVEGEFVNDVNRCPGKEDVVNG 180 Query: 1141 -------------------------LGDG------VVIDDSTKSFPAXXXXXXXXXXXXX 1055 LG G + D K Sbjct: 181 VHDCPIEVGHVNGSHGGEIGSSRVQLGTGDTRKDLSMEKDMLKPIEQNVERNGDSDDFFD 240 Query: 1054 XXDVMSASSNTEVDDNTMTERSWKPVAS-PFGEYFDAYEELSIEGTPQ---SRRNVDFEL 887 D MS +SNT+V+D T S K A+ P GE++DA+EELS E Q S N EL Sbjct: 241 PQDSMSYTSNTDVEDTTAVGSSMKLTAALPVGEFYDAWEELSSESGQQPSPSPHNNGAEL 300 Query: 886 CEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPVAPA--AENDGHTEIDP 713 EMR +LLMEIE+RKQ EEAL +MQSQWQR+ Q+L++VGLSLP P E+D ++ +P Sbjct: 301 REMRLSLLMEIEKRKQAEEALDNMQSQWQRIRQELALVGLSLPACPVDVPESDQPSDANP 360 Query: 712 AEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCDRLHYYETVNREMS 533 AEE+C+Q+++A+ V+ S+GRG A+AEVE+EME+ +EAKN EI RL DRLHYYE VNRE+S Sbjct: 361 AEEICKQIYLARFVSESIGRGIAKAEVEIEMEAQVEAKNFEIARLLDRLHYYEAVNRELS 420 Query: 532 QRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPT-AKESISTRSDAP 359 Q NQE +E KW+W SI AI LG LAWSYLP + S S+ S AP Sbjct: 421 QWNQEVIETARRNREIRKRRQKWVWGSIAAAITLGMTTLAWSYLPAMSGSSSSSDSHAP 479 >ref|XP_010029768.1| PREDICTED: uncharacterized protein LOC104419718 [Eucalyptus grandis] gi|629090474|gb|KCW56727.1| hypothetical protein EUGRSUZ_I02415 [Eucalyptus grandis] gi|629090475|gb|KCW56728.1| hypothetical protein EUGRSUZ_I02415 [Eucalyptus grandis] Length = 449 Score = 305 bits (780), Expect = 1e-79 Identities = 192/444 (43%), Positives = 256/444 (57%), Gaps = 38/444 (8%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPR----AQNSISRPLM------PKPD----------MQNSSSEKKT 1460 MPTFTAIALDRLLEPR A S++ P+ P+P+ S E+K Sbjct: 1 MPTFTAIALDRLLEPRTSRTADKSVNSPMPVPKLKPPRPEPVPSAKLERRRSTSVMERKV 60 Query: 1459 RRTYVSPALYTTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXX 1280 +R ++PALY TPE+TP+PDSP SFPPSPYI++HKRRGP L+KS S Sbjct: 61 QRPQMTPALYATPESTPVPDSPSSFPPSPYIINHKRRGPHLVKSLSEDDVSARKKSMDEA 120 Query: 1279 XXERGVKGDGKVMNLVSEVSSPVMH--STSCEEVHVNGYNN--------RKPEDNI---- 1142 V + K + S PV S + E+ HVNG ++ R + Sbjct: 121 NTNSTVT-EVKSEEIASVGDLPVTFTLSNTVEDEHVNGIDDVCEVGSSDRSASSALEVGT 179 Query: 1141 --LGDGVVID-DSTKSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVAS 971 L +G+V + D+ P + MS +SNTE +DN ERS K + Sbjct: 180 SNLNNGLVGETDTLVPVPMTPEREVDSEDFYDPQEAMSCTSNTEGEDNGTAERSVK-FTT 238 Query: 970 PFGEYFDAYEELSIEGTPQSR-RNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRL 794 P GE+FDA+EELS +G QS R+++ EL +R +LLMEIE+RKQ EE L ++QS WQ++ Sbjct: 239 PMGEFFDAWEELSSDGGAQSSLRDLEEELRGIRLSLLMEIEKRKQAEETLSNVQSNWQKI 298 Query: 793 AQQLSVVGLSLPVAPAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMES 614 QQLS+ GL+LP E+D ++ AE+L QQV +A+ VA ++GRG A+AE E EME+ Sbjct: 299 RQQLSLAGLTLPADLTLESD-QLSVEAAEQLNQQVQLARFVAEAIGRGMAKAEAETEMEA 357 Query: 613 HIEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIA 434 +E KN EI+RL DRLHYYE VN EMSQRNQE VE +W+W SI VA++ Sbjct: 358 QLEVKNFEISRLWDRLHYYEAVNHEMSQRNQEAVETARRLRQQRKRRQRWVWGSIAVALS 417 Query: 433 LGSAALAWSYLPTAKESISTRSDA 362 LG++ALAWSYLP+ S S + A Sbjct: 418 LGASALAWSYLPSGNGSRSDDNQA 441 >gb|KHN00526.1| hypothetical protein glysoja_000194 [Glycine soja] Length = 438 Score = 303 bits (775), Expect = 4e-79 Identities = 190/431 (44%), Positives = 253/431 (58%), Gaps = 26/431 (6%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSI--SRPL-MPKPD-MQNSSSEKKT------RRTYVSPALY 1430 MPTFTAIA DRL+EP A S P+ MP P ++ SSE KT R + PALY Sbjct: 1 MPTFTAIAFDRLIEPGASKPAYKSAPVPMPVPKKLERRSSEPKTVRKKPPPRPQLKPALY 60 Query: 1429 TTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDG 1250 TPE TPL D+P SFPPSPYI++HKRRGPRLLKS S G+ D Sbjct: 61 ATPEVTPLLDAPSSFPPSPYIINHKRRGPRLLKSFSEANVQSKQENLDNEIP-NGMSNDA 119 Query: 1249 KVMNLVSEVSSPVMHSTSCEEVHVNGYNNRKPEDN-----ILGDGVVIDDST-------- 1109 + ++ ++ +E VNG ++ + LG+G +S+ Sbjct: 120 VAASSDGDLQVNSTNTEPVKEEQVNGIHDTNLSSSGNNGGDLGEGHRESESSGILNGSSH 179 Query: 1108 --KSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEEL 935 K D MS S T+ +DNT +++ K A+ GE+FDA+EEL Sbjct: 180 LDKVVAFNLEREGESEDFFDPHDSMSLKSCTDAEDNTGADQAGKFSAAG-GEFFDAWEEL 238 Query: 934 SIEG-TPQSRRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLP 758 S +G T S R+++ EL E+R +LLMEIE+RKQVEE+L SMQSQW+RL Q+LS++G++LP Sbjct: 239 SSDGGTQNSHRDIEAELREIRLSLLMEIEKRKQVEESLNSMQSQWERLRQRLSLIGIALP 298 Query: 757 VAPAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRL 578 AE G DP E++CQQ++IA+ ++N++GRG ARAE E+EME+ +E+KN EI RL Sbjct: 299 SDLTAEG-GQLSSDPMEDVCQQLYIARFISNTIGRGIARAEAEIEMEAQLESKNFEIARL 357 Query: 577 CDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLP 398 +RLH YET+NREMSQRNQE VE +WIW SI AIA+G+AA+AWSYLP Sbjct: 358 LERLHCYETMNREMSQRNQEAVEMARRERQRRSRRQRWIWGSITTAIAVGTAAIAWSYLP 417 Query: 397 TAKESISTRSD 365 + S S D Sbjct: 418 VGRGSTSAVHD 428 >ref|XP_003520299.1| PREDICTED: uncharacterized protein LOC100798468 isoform X1 [Glycine max] gi|571438869|ref|XP_006574697.1| PREDICTED: uncharacterized protein LOC100798468 isoform X2 [Glycine max] Length = 438 Score = 302 bits (773), Expect = 7e-79 Identities = 190/431 (44%), Positives = 253/431 (58%), Gaps = 26/431 (6%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSI--SRPL-MPKPD-MQNSSSEKKT------RRTYVSPALY 1430 MPTFTAIA DRL+EP A S P+ MP P ++ SSE KT R + PALY Sbjct: 1 MPTFTAIAFDRLIEPGASKPAYKSAPVPMPVPKKLERRSSEPKTVRKKPPPRPQLKPALY 60 Query: 1429 TTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDG 1250 TPE TPL D+P SFPPSPYI++HKRRGPRLLKS S G+ D Sbjct: 61 ATPEVTPLLDAPSSFPPSPYIINHKRRGPRLLKSFSEANVQSKQENLDNEIP-NGMSNDA 119 Query: 1249 KVMNLVSEVSSPVMHSTSCEEVHVNGYNNRKPEDN-----ILGDGVVIDDST-------- 1109 + ++ ++ +E VNG ++ + LG+G +S+ Sbjct: 120 VAASSDGDLQVNSTNTEPVKEEQVNGIHDTNLSSSGNNGGDLGEGHRESESSGILNGSSH 179 Query: 1108 --KSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEEL 935 K D MS S T+ +DNT +++ K A+ GE+FDA+EEL Sbjct: 180 LDKVVAFNLEREGESEDFFDPHDSMSLKSCTDAEDNTGADQAGKFSAAG-GEFFDAWEEL 238 Query: 934 SIEG-TPQSRRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLP 758 S +G T S R+++ EL E+R +LLMEIE+RKQVEE+L SMQSQW+RL Q+LS++G++LP Sbjct: 239 SSDGGTQNSHRDIEAELREIRLSLLMEIEKRKQVEESLNSMQSQWERLRQRLSLMGIALP 298 Query: 757 VAPAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRL 578 AE G DP E++CQQ++IA+ ++N++GRG ARAE E+EME+ +E+KN EI RL Sbjct: 299 SDLTAEG-GQLSSDPMEDVCQQLYIARFISNTIGRGIARAEAEIEMEAQLESKNFEIARL 357 Query: 577 CDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLP 398 +RLH YET+NREMSQRNQE VE +WIW SI AIA+G+AA+AWSYLP Sbjct: 358 LERLHCYETMNREMSQRNQEAVEMARRERQRRSRRQRWIWGSITTAIAVGTAAIAWSYLP 417 Query: 397 TAKESISTRSD 365 + S S D Sbjct: 418 VGRGSTSAVHD 428 >ref|XP_003517215.1| PREDICTED: uncharacterized protein LOC100792599 [Glycine max] Length = 436 Score = 300 bits (767), Expect = 3e-78 Identities = 186/429 (43%), Positives = 249/429 (58%), Gaps = 24/429 (5%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPL---MPKPDMQN-----SSSEKKTR--RTYVSPALY 1430 MPTFTA+ALDRL+EP A + + MP P+ Q S+ KK++ + + PALY Sbjct: 1 MPTFTAMALDRLIEPGASKPVDKSAPTSMPVPNSQKLERSTSAPAKKSKVPQPPLKPALY 60 Query: 1429 TTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDG 1250 TTPE TPLPD+P SFPPSPYI++HKRRGPRLLKSSS ++ V D Sbjct: 61 TTPEVTPLPDAPSSFPPSPYIINHKRRGPRLLKSSSEASALSEVNIRCDDDNDKSV--DA 118 Query: 1249 KVMNLVSEVSSPVMHSTSCEEVHVNG-YNNRKPEDNIL---------GDGVVIDDSTKSF 1100 V + ++ +E VNG Y+ + N + G G + + K Sbjct: 119 VVTSSAGDLQVTSTKPELVKEEKVNGVYDGQLDRSNDVDHANGHRETGSGSLTNGLLKEK 178 Query: 1099 PAXXXXXXXXXXXXXXXDV--MSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIE 926 P + MS SSNT+ ++N TE S K ++SP E++DA+EELS E Sbjct: 179 PPALNLDRVSEVEDFFYPLDSMSFSSNTDGEENAGTELSMK-LSSPSTEFYDAWEELSSE 237 Query: 925 GTPQ-SRRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPV-A 752 G Q S +++ EL E+R +LL+EIE+RKQ EE++ +M+SQW+ + Q L G+ LP Sbjct: 238 GMSQNSTYDIEAELREVRLSLLVEIEKRKQAEESINNMRSQWESIRQGLYQAGIILPAYL 297 Query: 751 PAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCD 572 A D DP E+LCQQV+IA+ ++N++G+G ARAE+E EME+ +EAKN EI RL D Sbjct: 298 NATAEDEQLTSDPVEDLCQQVYIARFISNAIGKGIARAELETEMEAQLEAKNFEIARLLD 357 Query: 571 RLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTA 392 RLH YET+NREMSQRNQE VE +WIW I IAL +AA+AWSYLPT+ Sbjct: 358 RLHCYETMNREMSQRNQEAVEMARCERQRSSRRQRWIWGCITTVIALSTAAIAWSYLPTS 417 Query: 391 KESISTRSD 365 K S S D Sbjct: 418 KGSSSADHD 426 >gb|KHN36085.1| hypothetical protein glysoja_003208 [Glycine soja] Length = 436 Score = 299 bits (766), Expect = 4e-78 Identities = 185/429 (43%), Positives = 249/429 (58%), Gaps = 24/429 (5%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSISRPL---MPKPDMQN-----SSSEKKTR--RTYVSPALY 1430 MPTFTA+ALDRL+EP A + + MP P+ Q S+ KK++ + + PALY Sbjct: 1 MPTFTAMALDRLIEPGASKPVDKSAPTSMPVPNSQKLERSTSAPAKKSKVPQPPLKPALY 60 Query: 1429 TTPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKGDG 1250 TTPE TPLPD+P SFPPSPYI++HKRRGPRLLKSSS ++ V D Sbjct: 61 TTPEVTPLPDAPSSFPPSPYIINHKRRGPRLLKSSSEASALSEVNIRCDDDNDKSV--DA 118 Query: 1249 KVMNLVSEVSSPVMHSTSCEEVHVNG-YNNRKPEDNIL---------GDGVVIDDSTKSF 1100 + + ++ +E VNG Y+ + N + G G + + K Sbjct: 119 VITSSAGDLQVTSTKPELVKEEKVNGVYDGQLDRSNDVDHANGHRETGSGSLTNGLLKEK 178 Query: 1099 PAXXXXXXXXXXXXXXXDV--MSASSNTEVDDNTMTERSWKPVASPFGEYFDAYEELSIE 926 P + MS SSNT+ ++N TE S K ++SP E++DA+EELS E Sbjct: 179 PPALNLDRVSEVEDFFYPLDSMSFSSNTDGEENAGTELSMK-LSSPSTEFYDAWEELSSE 237 Query: 925 GTPQ-SRRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLSLPV-A 752 G Q S +++ EL E+R +LL+EIE+RKQ EE++ +M+SQW+ + Q L G+ LP Sbjct: 238 GMSQNSTYDIEAELREVRLSLLVEIEKRKQAEESINNMRSQWESIRQGLYQAGIILPAYL 297 Query: 751 PAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEINRLCD 572 A D DP E+LCQQV+IA+ ++N++G+G ARAE+E EME+ +EAKN EI RL D Sbjct: 298 NATAEDEQLTSDPVEDLCQQVYIARFISNAIGKGIARAELETEMEAQLEAKNFEIARLLD 357 Query: 571 RLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSYLPTA 392 RLH YET+NREMSQRNQE VE +WIW I IAL +AA+AWSYLPT+ Sbjct: 358 RLHCYETMNREMSQRNQEAVEMARCERQRSSRRQRWIWGCITTVIALSTAAIAWSYLPTS 417 Query: 391 KESISTRSD 365 K S S D Sbjct: 418 KGSSSADHD 426 >gb|KHN11126.1| hypothetical protein glysoja_033680 [Glycine soja] Length = 436 Score = 297 bits (760), Expect = 2e-77 Identities = 193/442 (43%), Positives = 256/442 (57%), Gaps = 29/442 (6%) Frame = -1 Query: 1579 MPTFTAIALDRLLEPRAQNSI--SRPL-MPKPD-MQNSSSE----KKTR-RTYVSPALYT 1427 MPTFTAIA DRL+EP A S P+ MP P ++ +SE KK R R + PALY Sbjct: 1 MPTFTAIAFDRLIEPGASKPAYKSAPVPMPVPKKIERRTSEPTIRKKPRPRPQLKPALYA 60 Query: 1426 TPEATPLPDSPVSFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXXERGVKG-DG 1250 TPE TPLPD+P SFPPSPYI++HKRRGPRLLKS S VK DG Sbjct: 61 TPEVTPLPDAPSSFPPSPYIINHKRRGPRLLKSYSEANVQAKQENLENENA--NVKSNDG 118 Query: 1249 KVMNLVSEVSSPVMHSTSCEEVHVNGYNN-----------------RKPEDNILGDGVVI 1121 + +L ++ + +E VNG ++ R+ E + + +G + Sbjct: 119 VITSLDGDLQVTFTNIEPVKEEQVNGVHDTDLSSSSNNKGDLGEAHRESESSGILNGSHL 178 Query: 1120 DDSTKSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGEYFDAYE 941 D K D MS S T+ +DNT T+++ K A+ GE+FDA+E Sbjct: 179 D---KVVALNLEREGESEDFFDPHDSMSLKSCTDGEDNTGTDQALKFSAAG-GEFFDAWE 234 Query: 940 ELSIEG-TPQSRRNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQQLSVVGLS 764 ELS +G T S R+++ EL E+R +LLMEIE+RKQVEE L SMQS W+RL Q+LS +G+ Sbjct: 235 ELSSDGGTQNSHRDIEAELREIRLSLLMEIEKRKQVEETLNSMQSHWERLGQRLSHIGIV 294 Query: 763 LPVAPAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESHIEAKNIEIN 584 LP AE G DP E++CQQ++I + ++N++GRG ARAE E+EME+ +++KN EI Sbjct: 295 LPSDLTAEG-GQLSSDPMEDVCQQLYITRFISNTIGRGIARAEAEIEMEAQLQSKNFEIA 353 Query: 583 RLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIALGSAALAWSY 404 RL +RLH YET+NREMSQRNQE VE +WIW S+ AIA+G+AA+AWSY Sbjct: 354 RLLERLHCYETMNREMSQRNQEAVEMARRERQRRSRRQRWIWGSVTTAIAVGTAAIAWSY 413 Query: 403 LPTAKES-ISTRSDAPAGGAAA 341 LP + S S P AA Sbjct: 414 LPVGRGSTFSVHDQVPEHDDAA 435 >gb|KEH20419.1| plant/F18B3-190 protein, putative [Medicago truncatula] Length = 449 Score = 296 bits (759), Expect = 3e-77 Identities = 188/442 (42%), Positives = 247/442 (55%), Gaps = 37/442 (8%) Frame = -1 Query: 1579 MPTFTAIALDRLLEP---RAQNSISRPLMPKPD---MQNSSSE-------KKTRRTYVSP 1439 MPTFTAIA DRL+EP RA + +P P+ ++ SSE K R + P Sbjct: 1 MPTFTAIAFDRLIEPGGSRAGQKSASTSVPVPNSKRLERRSSEPTATPRKKPPTRPQLKP 60 Query: 1438 ALYTTPEATPLPDSPV-----SFPPSPYIVDHKRRGPRLLKSSSRXXXXXXXXXXXXXXX 1274 +LY TPE TPLPDSP+ SFPPSPYI++HKRRGPRLLKS S Sbjct: 61 SLYATPEVTPLPDSPLQDSTSSFPPSPYIINHKRRGPRLLKSFSEANVQAKQEVCEEGNV 120 Query: 1273 ERGVKGDGKVMNLVSEVSSPVMHSTSCEEVHVNGYNNRKPEDNILGD------------- 1133 G D V + ++ ++ +E NG + K + GD Sbjct: 121 S-GKSSDTVVSSSAGDLQVTCVNPEPLKEEQDNGVQDTKLSTSNGGDVGHENRENKSSND 179 Query: 1132 --GVVIDDSTKSFPAXXXXXXXXXXXXXXXDVMSASSNTEVDDNTMTERSWKPVASPFGE 959 G ++ MS +S T+ +DNT TERS K S E Sbjct: 180 PNGKHVEKLVALNLERDGESEDFYDPRDTMSSMSFTSYTDGEDNTGTERSAK--YSTAAE 237 Query: 958 YFDAYEELSIEGTPQSR---RNVDFELCEMRTNLLMEIERRKQVEEALYSMQSQWQRLAQ 788 +FDA+EELS +G Q R+VD EL E+R +LLMEIE+RKQ+EE++ SMQSQW+R+ + Sbjct: 238 FFDAWEELSSDGGTQGSLRLRDVDAELREIRLSLLMEIEKRKQIEESMKSMQSQWERIRE 297 Query: 787 QLSVVGLSLPV-APAAENDGHTEIDPAEELCQQVFIAQVVANSVGRGAARAEVELEMESH 611 LS VG+ LP A G + DP ++LCQQV+IA+ ++N++GRG ARAE E EM++ Sbjct: 298 GLSSVGIVLPADLTAIAEGGQLDSDPVDDLCQQVYIARFISNAIGRGTARAEAETEMKTQ 357 Query: 610 IEAKNIEINRLCDRLHYYETVNREMSQRNQETVEXXXXXXXXXXXXXKWIWSSIGVAIAL 431 +++KN EI+RL +RLHYYET+NREMSQRNQE VE KWIW SI AI L Sbjct: 358 LDSKNFEISRLLERLHYYETMNREMSQRNQEAVETARRERQRRSRRQKWIWGSITTAIVL 417 Query: 430 GSAALAWSYLPTAKESISTRSD 365 G+ A+AWSYLP++K S ST D Sbjct: 418 GTTAIAWSYLPSSKGSTSTDHD 439