BLASTX nr result
ID: Mentha22_contig00050779
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00050779 (824 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU37030.1| hypothetical protein MIMGU_mgv1a023242mg [Mimulus... 296 8e-78 emb|CBI17132.3| unnamed protein product [Vitis vinifera] 260 4e-67 ref|XP_002272611.1| PREDICTED: uncharacterized protein LOC100267... 260 4e-67 ref|XP_006347204.1| PREDICTED: uncharacterized protein LOC102592... 258 2e-66 ref|XP_004242103.1| PREDICTED: uncharacterized protein LOC101261... 258 2e-66 ref|XP_007208132.1| hypothetical protein PRUPE_ppa000392mg [Prun... 254 4e-65 ref|XP_006488001.1| PREDICTED: uncharacterized protein LOC102626... 247 4e-63 ref|XP_007016066.1| Uncharacterized protein isoform 1 [Theobroma... 247 4e-63 ref|XP_006424443.1| hypothetical protein CICLE_v10027698mg [Citr... 245 1e-62 gb|EPS67617.1| hypothetical protein M569_07165, partial [Genlise... 244 4e-62 ref|XP_007150144.1| hypothetical protein PHAVU_005G130400g [Phas... 239 9e-61 ref|XP_006594958.1| PREDICTED: uncharacterized protein LOC100795... 238 2e-60 ref|XP_004288928.1| PREDICTED: uncharacterized protein LOC101291... 238 3e-60 ref|XP_004487559.1| PREDICTED: uncharacterized protein LOC101497... 237 3e-60 ref|XP_002879111.1| predicted protein [Arabidopsis lyrata subsp.... 237 4e-60 ref|XP_006592715.1| PREDICTED: uncharacterized protein LOC100788... 236 6e-60 ref|XP_002314306.2| hypothetical protein POPTR_0009s01060g [Popu... 236 6e-60 ref|XP_002523727.1| conserved hypothetical protein [Ricinus comm... 236 1e-59 gb|EXC24915.1| hypothetical protein L484_011781 [Morus notabilis] 200 5e-49 ref|XP_007016068.1| Uncharacterized protein isoform 3 [Theobroma... 166 1e-38 >gb|EYU37030.1| hypothetical protein MIMGU_mgv1a023242mg [Mimulus guttatus] Length = 1117 Score = 296 bits (757), Expect = 8e-78 Identities = 165/276 (59%), Positives = 193/276 (69%), Gaps = 2/276 (0%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 DVAHL+NKI DS FGSG+LDTPFR E +KI+ +M +W +SR LSFYHD DQGILYLQFS Sbjct: 60 DVAHLMNKIIDSRVFGSGNLDTPFRFEPDKINPDMGKWLQSRKLSFYHDVDQGILYLQFS 119 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C VA SE R GFESV +++E GDLKGLIFMFSVCH+I+LIQEGSRFDT++LKKF Sbjct: 120 SAGCPVAGEGPSETRFGFESVFDDQEFGDLKGLIFMFSVCHIILLIQEGSRFDTQILKKF 179 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRN-ASA 537 R+LQ+AKH + PF RSQN +TSRP SS S+ K + I NRN AS+ Sbjct: 180 RILQSAKHAMSPFTRSQNPPPVTSRPPSSAHSQ-----TSHNNPSPGKSRAILNRNTASS 234 Query: 538 NTLMSGLG-SYTSLLPGQCTPVVLFVFVDDFSETFLGGNMEQXXXXXXXXXXXXXXXXXX 714 MSG+G SYTSLLPGQCTPVVLFVF+DDF+E + + E Sbjct: 235 IKTMSGVGSSYTSLLPGQCTPVVLFVFLDDFTEIKMEDSTE-----------------AS 277 Query: 715 GMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 ++TKGS SVVVLARPVNK E RKKLQSSLEAQI Sbjct: 278 SLNTKGSGSVVVLARPVNKPETSPRKKLQSSLEAQI 313 >emb|CBI17132.3| unnamed protein product [Vitis vinifera] Length = 935 Score = 260 bits (665), Expect = 4e-67 Identities = 147/277 (53%), Positives = 182/277 (65%), Gaps = 3/277 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 DV+HL+N+I D AFGSG+L+ +E E E+ WFESR +S+YHDE++GIL+LQ+ Sbjct: 72 DVSHLMNRILDLNAFGSGNLEKGLCIEKE----EVKGWFESRRISYYHDEEKGILFLQYC 127 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C E L + GF+S LEERE GDL+G++FMF+VCHVII IQEGSRFDT++LKKF Sbjct: 128 STGCPAMEGFLQTD-WGFDSALEEREFGDLQGMLFMFAVCHVIIYIQEGSRFDTQVLKKF 186 Query: 361 RVLQAAKHTIVPFIRSQN--ISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNAS 534 RVLQAAKH++ PF+RS+ S TSRP S SR + G NRN S Sbjct: 187 RVLQAAKHSLAPFVRSRTTPTSISTSRPPS---SRPSLSATSSNNPSPGRGGGSSNRNTS 243 Query: 535 ANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETF-LGGNMEQXXXXXXXXXXXXXXXXX 711 + +LMSGLGSY SL PGQC PV LFVF+DDFS+ N+++ Sbjct: 244 SISLMSGLGSYASLFPGQCNPVTLFVFLDDFSDVLNPTSNVDESTDNSFNQSSSLSNLAR 303 Query: 712 XGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + TKGS SVVVLARP +KSEGG RKKLQSSLEAQI Sbjct: 304 PSLPTKGSGSVVVLARPGSKSEGGFRKKLQSSLEAQI 340 >ref|XP_002272611.1| PREDICTED: uncharacterized protein LOC100267175 [Vitis vinifera] Length = 1226 Score = 260 bits (665), Expect = 4e-67 Identities = 147/277 (53%), Positives = 182/277 (65%), Gaps = 3/277 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 DV+HL+N+I D AFGSG+L+ +E E E+ WFESR +S+YHDE++GIL+LQ+ Sbjct: 63 DVSHLMNRILDLNAFGSGNLEKGLCIEKE----EVKGWFESRRISYYHDEEKGILFLQYC 118 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C E L + GF+S LEERE GDL+G++FMF+VCHVII IQEGSRFDT++LKKF Sbjct: 119 STGCPAMEGFLQTD-WGFDSALEEREFGDLQGMLFMFAVCHVIIYIQEGSRFDTQVLKKF 177 Query: 361 RVLQAAKHTIVPFIRSQN--ISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNAS 534 RVLQAAKH++ PF+RS+ S TSRP S SR + G NRN S Sbjct: 178 RVLQAAKHSLAPFVRSRTTPTSISTSRPPS---SRPSLSATSSNNPSPGRGGGSSNRNTS 234 Query: 535 ANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETF-LGGNMEQXXXXXXXXXXXXXXXXX 711 + +LMSGLGSY SL PGQC PV LFVF+DDFS+ N+++ Sbjct: 235 SISLMSGLGSYASLFPGQCNPVTLFVFLDDFSDVLNPTSNVDESTDNSFNQSSSLSNLAR 294 Query: 712 XGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + TKGS SVVVLARP +KSEGG RKKLQSSLEAQI Sbjct: 295 PSLPTKGSGSVVVLARPGSKSEGGFRKKLQSSLEAQI 331 >ref|XP_006347204.1| PREDICTED: uncharacterized protein LOC102592220 isoform X1 [Solanum tuberosum] gi|565360907|ref|XP_006347205.1| PREDICTED: uncharacterized protein LOC102592220 isoform X2 [Solanum tuberosum] Length = 1237 Score = 258 bits (659), Expect = 2e-66 Identities = 143/281 (50%), Positives = 184/281 (65%), Gaps = 7/281 (2%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEA--EKIDL----EMSRWFESRNLSFYHDEDQGI 162 DVA+L+N+I DS FGSG LD P + EK D +M WFE RN+S++HDE++GI Sbjct: 77 DVAYLMNRIIDSNVFGSGGLDKPIFVNEPDEKTDFAVTDDMKSWFEFRNISYHHDEEKGI 136 Query: 163 LYLQFSMVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDT 342 L+LQFS C + E L E ++GF+S+LE+ E GDL+ ++FMFSVCHV++ IQEG RFDT Sbjct: 137 LFLQFSSTRCPLMEGNL-ESKMGFDSLLEDYEYGDLQAMLFMFSVCHVVVFIQEGPRFDT 195 Query: 343 RLLKKFRVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQN 522 ++LKK RVLQAAK + PF++SQ++ S + SR K GI N Sbjct: 196 QILKKLRVLQAAKQAMTPFVKSQSLPLSVSGSPFASPSRRAASGRSSDNPSPVKSHGIFN 255 Query: 523 RNASANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLGGNMEQ-XXXXXXXXXXXXX 699 RN SA TLMSGLGSYTSLLPGQCTPV LFVF+DDF++ + ++E+ Sbjct: 256 RNNSAITLMSGLGSYTSLLPGQCTPVTLFVFLDDFADDYPSSSVEEPADISSANQSSSVG 315 Query: 700 XXXXXGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + K + SVVVLARP++KSEGG RKKLQSSLEAQI Sbjct: 316 ASARPSVAPKVAGSVVVLARPMSKSEGGFRKKLQSSLEAQI 356 >ref|XP_004242103.1| PREDICTED: uncharacterized protein LOC101261038 [Solanum lycopersicum] Length = 1221 Score = 258 bits (659), Expect = 2e-66 Identities = 143/281 (50%), Positives = 185/281 (65%), Gaps = 7/281 (2%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEA--EKIDL----EMSRWFESRNLSFYHDEDQGI 162 DVA+L+N+I DS FGSG LD P + EK + +M WFE RN+S++HDE++GI Sbjct: 77 DVAYLMNRIIDSNVFGSGGLDKPIFVNKPDEKTNFAVTDDMKSWFEFRNISYHHDEEKGI 136 Query: 163 LYLQFSMVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDT 342 L+LQ S C + E L E ++GF+S+LE+ E GDL+ ++FMFSVCHV++ IQEG RFDT Sbjct: 137 LFLQLSSTRCPLMEGNL-ESKMGFDSLLEDYEYGDLQAMLFMFSVCHVVVFIQEGPRFDT 195 Query: 343 RLLKKFRVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQN 522 ++LKK RVLQAAK + PF++SQ++S S + SR K +GI N Sbjct: 196 QILKKLRVLQAAKQAMAPFVKSQSLSPSVSGSPFASPSRRATSGRSSDNPSPVKSRGIFN 255 Query: 523 RNASANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLGGNMEQ-XXXXXXXXXXXXX 699 RN SA TLMSGLGSYTSLLPGQCTPV LFVF+DDF++ + ++E+ Sbjct: 256 RNNSAITLMSGLGSYTSLLPGQCTPVTLFVFLDDFADDYPSSSVEEPGDISSANQSSSVG 315 Query: 700 XXXXXGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + K S SVVVLARP++KSEGG RKKLQSSLEAQI Sbjct: 316 ASARPSLAPKVSGSVVVLARPMSKSEGGFRKKLQSSLEAQI 356 >ref|XP_007208132.1| hypothetical protein PRUPE_ppa000392mg [Prunus persica] gi|462403774|gb|EMJ09331.1| hypothetical protein PRUPE_ppa000392mg [Prunus persica] Length = 1213 Score = 254 bits (648), Expect = 4e-65 Identities = 144/277 (51%), Positives = 175/277 (63%), Gaps = 3/277 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D A LIN+I D FGSG+LD LE E E+ WF R +S++H++ +GIL+LQF Sbjct: 65 DSAQLINRILDFNVFGSGNLDKSLCLEKE----ELRDWFRWRRISYFHEQQKGILFLQFC 120 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C + SE GF+S +EE + GDL+GL+FMFSVCHVII IQEGSRF++ LLK F Sbjct: 121 STRCPAMDDGFSESGSGFDSPVEEHDFGDLQGLLFMFSVCHVIIYIQEGSRFESELLKNF 180 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQS-RIXXXXXXXXXXXXXKIQGIQNRNASA 537 RVLQAAKH + PF+RSQ + SRP SS+ S R + I NRNAS+ Sbjct: 181 RVLQAAKHALAPFVRSQTLQPTPSRPPSSLSSARPTTSTTSTNSSSQGRSGSILNRNASS 240 Query: 538 NTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSET-FLGGNMEQ-XXXXXXXXXXXXXXXXX 711 +LMSGLGSYTSL PGQCTPV LFVF+DDFS+ N+E+ Sbjct: 241 ISLMSGLGSYTSLFPGQCTPVTLFVFIDDFSDVPNPSSNVEESSDTSSHNQSSSLGSLAR 300 Query: 712 XGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + KGS SVVVLARPV+KSEG RKKLQSSLEAQI Sbjct: 301 PSLPVKGSGSVVVLARPVSKSEGSFRKKLQSSLEAQI 337 >ref|XP_006488001.1| PREDICTED: uncharacterized protein LOC102626935 isoform X1 [Citrus sinensis] gi|568869587|ref|XP_006488002.1| PREDICTED: uncharacterized protein LOC102626935 isoform X2 [Citrus sinensis] Length = 1207 Score = 247 bits (630), Expect = 4e-63 Identities = 137/274 (50%), Positives = 175/274 (63%), Gaps = 2/274 (0%) Frame = +1 Query: 7 AHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFSMV 186 + LIN++ DS FGSG LD +E E E+ RWFESR +S+YH+E++GIL+LQF Sbjct: 63 SQLINRVLDSNTFGSGRLDKGLDVEKE----EVKRWFESRRISYYHEEEKGILFLQFCST 118 Query: 187 NCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKFRV 366 + ++S F+SV+ E+E GDL+GL+FMFSVCHVI+ IQEGSRFDT +LKKFRV Sbjct: 119 RSSESDS-------DFDSVITEQEFGDLQGLLFMFSVCHVIVYIQEGSRFDTEILKKFRV 171 Query: 367 LQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASANTL 546 LQAAKH + P++++++ L SRP SS SR + GI RNASA + Sbjct: 172 LQAAKHALTPYVKARSTPPLPSRPHSSSLSRPSVLVTTPNSSSSSRSGGISGRNASAISF 231 Query: 547 MSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLGGNM--EQXXXXXXXXXXXXXXXXXXGM 720 MSGLGS+TSL PGQCTPV LFVF+DDF++T + E + Sbjct: 232 MSGLGSHTSLFPGQCTPVALFVFIDDFADTPNPSSNVDESTDTSLLSQPSSSSSLTRPTL 291 Query: 721 HTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 KGS SVVVLARP +K EG RKKLQSSL+AQI Sbjct: 292 PVKGSGSVVVLARPSSKLEGSFRKKLQSSLDAQI 325 >ref|XP_007016066.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590587827|ref|XP_007016067.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508786429|gb|EOY33685.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508786430|gb|EOY33686.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1219 Score = 247 bits (630), Expect = 4e-63 Identities = 142/276 (51%), Positives = 177/276 (64%), Gaps = 2/276 (0%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D + LIN++ DS FGSG ++ + + E+ WF+ R +S+YH+ED+GIL+LQF Sbjct: 57 DSSQLINRVVDSNVFGSGKMNRVLSPDKD----ELKDWFKYRRISYYHEEDKGILFLQFC 112 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C V L+ F+ VLEERE GDL+GL+FMFSVCH+II IQEGSRFDT+ LKKF Sbjct: 113 SNGCPVFNGSLASGS-DFDGVLEEREFGDLQGLLFMFSVCHIIIYIQEGSRFDTQNLKKF 171 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASAN 540 RVLQAAKH + P+++S+ L SRP SS SR + G+ RNASA Sbjct: 172 RVLQAAKHALTPYVKSRTTPPLPSRPHSSSTSR-PSTIATTASTSPGRSGGMLGRNASAI 230 Query: 541 TLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLG-GNMEQ-XXXXXXXXXXXXXXXXXX 714 +LMSGLGSYTSL PGQCTPV LFVF+DDFS+ N+E+ Sbjct: 231 SLMSGLGSYTSLFPGQCTPVTLFVFIDDFSDVLNSTPNIEESVETSSINHASNSSSLARP 290 Query: 715 GMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + KGS+SVVVLARPV+KSEG RKKLQSSLEAQI Sbjct: 291 TLPMKGSASVVVLARPVSKSEGVFRKKLQSSLEAQI 326 >ref|XP_006424443.1| hypothetical protein CICLE_v10027698mg [Citrus clementina] gi|567863580|ref|XP_006424444.1| hypothetical protein CICLE_v10027698mg [Citrus clementina] gi|557526377|gb|ESR37683.1| hypothetical protein CICLE_v10027698mg [Citrus clementina] gi|557526378|gb|ESR37684.1| hypothetical protein CICLE_v10027698mg [Citrus clementina] Length = 1207 Score = 245 bits (626), Expect = 1e-62 Identities = 136/274 (49%), Positives = 174/274 (63%), Gaps = 2/274 (0%) Frame = +1 Query: 7 AHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFSMV 186 + LIN++ DS FGSG LD +E E E+ RWFESR +S+YH+E++GIL+LQF Sbjct: 63 SQLINRVLDSNTFGSGRLDKGLDVEKE----EVKRWFESRRISYYHEEEKGILFLQFCST 118 Query: 187 NCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKFRV 366 + ++S F+S + E+E GDL+GL+FMFSVCHVI+ IQEGSRFDT +LKKFRV Sbjct: 119 RSSESDS-------DFDSAITEQEFGDLQGLLFMFSVCHVIVYIQEGSRFDTEILKKFRV 171 Query: 367 LQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASANTL 546 LQAAKH + P++++++ L SRP SS SR + GI RNASA + Sbjct: 172 LQAAKHALTPYVKARSTPPLPSRPHSSSLSRPSVLVTTPNSSSSSRSGGISGRNASAISF 231 Query: 547 MSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLGGNM--EQXXXXXXXXXXXXXXXXXXGM 720 MSGLGS+TSL PGQCTPV LFVF+DDF++T + E + Sbjct: 232 MSGLGSHTSLFPGQCTPVALFVFIDDFADTPNPSSNADESTDTSLLSQPSSSSSLTRPTL 291 Query: 721 HTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 KGS SVVVLARP +K EG RKKLQSSL+AQI Sbjct: 292 PVKGSGSVVVLARPSSKLEGSFRKKLQSSLDAQI 325 >gb|EPS67617.1| hypothetical protein M569_07165, partial [Genlisea aurea] Length = 660 Score = 244 bits (622), Expect = 4e-62 Identities = 141/279 (50%), Positives = 178/279 (63%), Gaps = 5/279 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 DVAH INK+ DS+ FGSG LD PF +AEK+ EM RWFE RNLSFYHD +G +YLQFS Sbjct: 49 DVAHFINKLIDSHVFGSGKLDEPFPFDAEKLSPEMKRWFEGRNLSFYHDAVRGFVYLQFS 108 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 + C E+V+SEE +GFE + +E+EL DL+GL+FMFSVCH+II IQEG RFD +L+KF Sbjct: 109 PLFCPTVENVVSEETVGFEPIFDEQELADLQGLLFMFSVCHIIIFIQEGYRFDLLMLRKF 168 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQG---IQNRNA 531 RVLQAAK+ + + +I + TSRP SS + ++G + NA Sbjct: 169 RVLQAAKNRL-----ATSIGTRTSRPNSSSSDK------------HSPVRGRVILNRNNA 211 Query: 532 SANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETFL-GGNMEQXXXXXXXXXXXXXXXX 708 +A TL+SGL S+T+LLPGQ TPV+LFVFVDDF+E GNM Sbjct: 212 AAVTLLSGLSSHTALLPGQFTPVLLFVFVDDFTEIQQPSGNMGDAA-------------- 257 Query: 709 XXGMHTKGSS-SVVVLARPVNKSEGGLRKKLQSSLEAQI 822 +KGSS S V+ +RP GGL+KKLQSSLE QI Sbjct: 258 -----SKGSSDSGVMFSRPAANPSGGLKKKLQSSLEKQI 291 >ref|XP_007150144.1| hypothetical protein PHAVU_005G130400g [Phaseolus vulgaris] gi|561023408|gb|ESW22138.1| hypothetical protein PHAVU_005G130400g [Phaseolus vulgaris] Length = 1211 Score = 239 bits (610), Expect = 9e-61 Identities = 133/276 (48%), Positives = 166/276 (60%), Gaps = 2/276 (0%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D A L++++ DS F SG+LD P +E E E WFE R +S++HD ++GIL+LQFS Sbjct: 59 DSAQLLDRVIDSNVFASGNLDAPLLVEDE----EAREWFERRRISYFHDHERGILFLQFS 114 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C + GF+S LEE E GDL+G++FMFSVCHVII IQEGS F +R+L+ F Sbjct: 115 STRCPAIHTATDVAPPGFDSALEEHEFGDLQGMLFMFSVCHVIIYIQEGSHFGSRILRNF 174 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASAN 540 RVLQ+AKH + PF+RSQ + L +R S SR + G +RN SA Sbjct: 175 RVLQSAKHAMAPFVRSQTMPPLPARLHPSSSSR---PASAANNSSPGRGGGNLSRNVSAI 231 Query: 541 TLMSGLGSYTSLLPGQCTPVVLFVFVDDFS--ETFLGGNMEQXXXXXXXXXXXXXXXXXX 714 +LMSGLGSY SL PGQC PV LFVF+DDFS + E Sbjct: 232 SLMSGLGSYASLFPGQCIPVTLFVFIDDFSSLSSSSANGDESSDSTSLSHSSSLSGTAKG 291 Query: 715 GMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + KGS SVVVLARP ++SEGG RKKLQSSLEAQI Sbjct: 292 NLSAKGSGSVVVLARPASRSEGGFRKKLQSSLEAQI 327 >ref|XP_006594958.1| PREDICTED: uncharacterized protein LOC100795370 isoform X1 [Glycine max] gi|571502415|ref|XP_006594959.1| PREDICTED: uncharacterized protein LOC100795370 isoform X2 [Glycine max] gi|571502418|ref|XP_006594960.1| PREDICTED: uncharacterized protein LOC100795370 isoform X3 [Glycine max] gi|571502422|ref|XP_006594961.1| PREDICTED: uncharacterized protein LOC100795370 isoform X4 [Glycine max] Length = 1213 Score = 238 bits (607), Expect = 2e-60 Identities = 137/277 (49%), Positives = 168/277 (60%), Gaps = 3/277 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D A L+N++ DS AF SG+LD P ++ E E WFE R +S++HD D+GIL+LQFS Sbjct: 61 DSAQLLNRVIDSNAFASGNLDAPLLVDDE----EAKEWFERRRISYFHDHDKGILFLQFS 116 Query: 181 MVNCTVAESVLSEERL-GFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKK 357 C + GF+S +EE E GDL+G++FMFSVCHVII IQ+ S F TR+L+ Sbjct: 117 STRCPAIHAAADGTAPPGFDSAVEEHEFGDLQGMLFMFSVCHVIIYIQDRSHFGTRILRN 176 Query: 358 FRVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASA 537 FRVLQAAKH + PF+RSQ + L SR S SR + G RN SA Sbjct: 177 FRVLQAAKHAMAPFVRSQTMPPLPSRSHPSPSSR---PVSSANNSSPVRGGGNLGRNVSA 233 Query: 538 NTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSE-TFLGGNMEQXXXXXXXXXXXXXXXXXX 714 +LMSGLGSY SL PGQC PV LFVF+DDFS + N E+ Sbjct: 234 ISLMSGLGSYASLFPGQCIPVTLFVFIDDFSSLSNSSANGEESSDGSLINQSSSFSGAAK 293 Query: 715 G-MHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 G + KGS SVVVLARP ++SEGG RKKLQSSLEAQI Sbjct: 294 GNLPAKGSGSVVVLARPASRSEGGYRKKLQSSLEAQI 330 >ref|XP_004288928.1| PREDICTED: uncharacterized protein LOC101291573 [Fragaria vesca subsp. vesca] Length = 1173 Score = 238 bits (606), Expect = 3e-60 Identities = 136/276 (49%), Positives = 172/276 (62%), Gaps = 2/276 (0%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D A LIN+I DS FGSG+ +E ++ E+ WF+ R +S++HDE +GIL+LQF Sbjct: 58 DSAQLINRILDSNVFGSGNRAKTLGVEKQE---ELRDWFKWRGISYFHDEQKGILFLQFC 114 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C+ +S LS+ GF+S EE + GDL+G++FMF VCHVII + EGSRFDT+LLKKF Sbjct: 115 SSLCSAVDSGLSDSGSGFDSAFEEHDSGDLQGMLFMFYVCHVIIYVLEGSRFDTQLLKKF 174 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASAN 540 RVLQA KH + P +R +N+ S+P SS SR + + RNAS+ Sbjct: 175 RVLQAGKHALAPLVRPRNMQPTPSKPYSS-SSRPTTSAASSKNSSPGRGGSMLTRNASSI 233 Query: 541 TLMSGLGSYTSLLPGQCTPVVLFVFVDDFSET-FLGGNMEQ-XXXXXXXXXXXXXXXXXX 714 ++MSGLGSYTSL PGQCTPV LFVFVDDF + N+E Sbjct: 234 SVMSGLGSYTSLFPGQCTPVTLFVFVDDFYDVPNPSSNVEDLVDTSSLNQPSSLGTSARP 293 Query: 715 GMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + KGS SVVVLARPV+KSEG RKKLQSSLEAQI Sbjct: 294 SLPVKGSGSVVVLARPVSKSEGSFRKKLQSSLEAQI 329 >ref|XP_004487559.1| PREDICTED: uncharacterized protein LOC101497558 isoform X1 [Cicer arietinum] gi|502083773|ref|XP_004487560.1| PREDICTED: uncharacterized protein LOC101497558 isoform X2 [Cicer arietinum] gi|502083776|ref|XP_004487561.1| PREDICTED: uncharacterized protein LOC101497558 isoform X3 [Cicer arietinum] gi|502083779|ref|XP_004487562.1| PREDICTED: uncharacterized protein LOC101497558 isoform X4 [Cicer arietinum] Length = 1219 Score = 237 bits (605), Expect = 3e-60 Identities = 137/278 (49%), Positives = 168/278 (60%), Gaps = 4/278 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D HL+N++ DS F SG++D P ++ E E WF R +S++ D D+GIL+L F+ Sbjct: 59 DSTHLLNRVIDSNVFASGNIDIPLLVDDE----EAKEWFMRRRISYFRDRDKGILFLHFA 114 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 + +E LGF+SV EE E GDL+G++FMFSVCHVII IQEGSRFDTR+L+ F Sbjct: 115 STRFFPSVHDFTEPSLGFDSVREEHEFGDLQGMLFMFSVCHVIIYIQEGSRFDTRVLRNF 174 Query: 361 RVLQAAKHTIVPFIRSQNI-SSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASA 537 RVLQAAKH + PF+R + +L SR S G NRNASA Sbjct: 175 RVLQAAKHAMAPFVRLKGAPPTLPSRVHSPAPVSSRAVSSGNNSSPGRGGGGKLNRNASA 234 Query: 538 NTLMSGLGSYTSLLPGQCTPVVLFVFVDDFS---ETFLGGNMEQXXXXXXXXXXXXXXXX 708 +LMSGLGSYTSL PGQC PV+LFVFVDDFS + G+ Sbjct: 235 VSLMSGLGSYTSLFPGQCIPVMLFVFVDDFSNLLNSCTNGDESSDVSSLNQSSNLSSVGK 294 Query: 709 XXGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 TKGS SVVVLARP ++SEGGLRKKLQSSLEAQI Sbjct: 295 TNLPATKGSGSVVVLARPASRSEGGLRKKLQSSLEAQI 332 >ref|XP_002879111.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297324950|gb|EFH55370.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 1189 Score = 237 bits (604), Expect = 4e-60 Identities = 140/277 (50%), Positives = 177/277 (63%), Gaps = 3/277 (1%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D +HLIN++ D+ FGSG L+ L +K D + WF R + +YH+ED+GI+++QFS Sbjct: 58 DSSHLINQVLDNNVFGSGKLNKI--LTVDKPDFQ--DWFRFRKICYYHEEDKGIVFVQFS 113 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 + C ++ S GF+SVLEERE GDL+GL+FMFSVCHVII IQEGSRFDTRLLKKF Sbjct: 114 PIICP---ALSSSSDSGFDSVLEEREFGDLQGLLFMFSVCHVIINIQEGSRFDTRLLKKF 170 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASAN 540 RVLQA+K + PF+RSQ + LTSR SS + + GI +R+ S+ Sbjct: 171 RVLQASKQALAPFVRSQTVLPLTSRLHSSSNN------FSQLHSASSRGGGIVSRSGSSV 224 Query: 541 TLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETF-LGGNMEQ--XXXXXXXXXXXXXXXXX 711 +L SG GSYTSL PGQC PV LFVF+DDFS+ N+E Sbjct: 225 SLKSGGGSYTSLFPGQCNPVTLFVFLDDFSDMLKSSSNVEDSTTTSSANDQSVNTGKLTR 284 Query: 712 XGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + TK S SVVVL+RP +KSEGGLRKKLQSSLEAQ+ Sbjct: 285 SELPTKNSGSVVVLSRPGSKSEGGLRKKLQSSLEAQV 321 >ref|XP_006592715.1| PREDICTED: uncharacterized protein LOC100788114 isoform X2 [Glycine max] gi|571494000|ref|XP_006592716.1| PREDICTED: uncharacterized protein LOC100788114 isoform X3 [Glycine max] gi|571494002|ref|XP_006592717.1| PREDICTED: uncharacterized protein LOC100788114 isoform X4 [Glycine max] gi|571494004|ref|XP_006592718.1| PREDICTED: uncharacterized protein LOC100788114 isoform X5 [Glycine max] gi|571494006|ref|XP_003540204.2| PREDICTED: uncharacterized protein LOC100788114 isoform X1 [Glycine max] gi|571494008|ref|XP_006592719.1| PREDICTED: uncharacterized protein LOC100788114 isoform X6 [Glycine max] Length = 791 Score = 236 bits (603), Expect = 6e-60 Identities = 134/276 (48%), Positives = 164/276 (59%), Gaps = 2/276 (0%) Frame = +1 Query: 1 DVAHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFS 180 D A L+N++ DS F SG+LDTP ++ E E WFE R +S++HD D+GIL+LQFS Sbjct: 61 DSAQLLNRVIDSNVFASGNLDTPLLVDDE----EAREWFERRRISYFHDHDKGILFLQFS 116 Query: 181 MVNCTVAESVLSEERLGFESVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKF 360 C V + + GF+S +EE E GDL+G++FMFSVCHVII IQEGS F T +L+ F Sbjct: 117 STRCPVNHAAAAPS--GFDSAVEEHEFGDLQGMLFMFSVCHVIIYIQEGSHFGTGILRNF 174 Query: 361 RVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASAN 540 RVLQAAKH + PF+R Q + L SR S S+ + G RN SA Sbjct: 175 RVLQAAKHAMAPFVRYQTMGPLPSRSHPSPSSQ---PVSSVNNSSPGRGGGNLGRNMSAI 231 Query: 541 TLMSGLGSYTSLLPGQCTPVVLFVFVDDFS--ETFLGGNMEQXXXXXXXXXXXXXXXXXX 714 +LMSGLGSY SL PGQC PV LFVF+DDFS E Sbjct: 232 SLMSGLGSYASLFPGQCIPVTLFVFIDDFSSLSNSSANGEESLDGSSLNQSSSLSSAAKE 291 Query: 715 GMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + KGS SVVVLARP ++SEGG RKKLQ SLEAQI Sbjct: 292 NLPAKGSGSVVVLARPASRSEGGFRKKLQLSLEAQI 327 >ref|XP_002314306.2| hypothetical protein POPTR_0009s01060g [Populus trichocarpa] gi|550330780|gb|EEE88261.2| hypothetical protein POPTR_0009s01060g [Populus trichocarpa] Length = 1015 Score = 236 bits (603), Expect = 6e-60 Identities = 134/274 (48%), Positives = 171/274 (62%), Gaps = 3/274 (1%) Frame = +1 Query: 10 HLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFSMVN 189 HLIN+ DS AFGSG LD ++ E E+ WF+ R +S+YH+E++G+L+LQF + Sbjct: 65 HLINRTLDSNAFGSGHLDKTLFVDKE----EVKDWFKKRKISYYHEEEKGLLFLQFCSIR 120 Query: 190 CTVAESVLSEERLGFE-SVLEERELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKFRV 366 C + GF S LEE E +L+GL+FMFSVCHVI+ IQEGSRFDT +L+KFR+ Sbjct: 121 CPIIH--------GFSNSGLEELEFEELQGLLFMFSVCHVILYIQEGSRFDTHVLQKFRL 172 Query: 367 LQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASANTL 546 LQA+KH + P++RS+ I L+SRP SS+ S + +RN+SA ++ Sbjct: 173 LQASKHALTPYVRSRTIPPLSSRPHSSLSS---SRLASSTGSSPVRSGSFTSRNSSAVSI 229 Query: 547 MSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLGGNM--EQXXXXXXXXXXXXXXXXXXGM 720 MSGLGSY SL PG CTPV+LFVFVDDF + G+ E Sbjct: 230 MSGLGSYVSLFPGYCTPVMLFVFVDDFLDVLNSGSSVEESTDSSSFNQSSGLSSVARSNA 289 Query: 721 HTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 KGS SVVVLARPV+KSEGG RKKLQSSLEAQI Sbjct: 290 PAKGSGSVVVLARPVSKSEGGFRKKLQSSLEAQI 323 >ref|XP_002523727.1| conserved hypothetical protein [Ricinus communis] gi|223537031|gb|EEF38667.1| conserved hypothetical protein [Ricinus communis] Length = 1233 Score = 236 bits (601), Expect = 1e-59 Identities = 143/286 (50%), Positives = 171/286 (59%), Gaps = 14/286 (4%) Frame = +1 Query: 7 AHLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFSMV 186 + LIN++ DS FGSG LD ++ E E+ WF+ R +S+YHDE++G L+LQF + Sbjct: 59 SQLINRVLDSNVFGSGHLDKLLSIDKE----ELKDWFKWRRISYYHDEEKGFLFLQFCSI 114 Query: 187 NCTVAESVLSEERL-GFESVLEERELGDLKGLIFMFS-----------VCHVIILIQEGS 330 C V L +SVLEE E DL+GL+FMFS VCHVII IQEG Sbjct: 115 RCPVVHGSSRSGLLQDLDSVLEENEFEDLQGLLFMFSIFQRTAQLAMQVCHVIIYIQEGL 174 Query: 331 RFDTRLLKKFRVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQ 510 RFD LKKFRVLQAAKH + P++RS++ L SRP SS S + Sbjct: 175 RFDPHSLKKFRVLQAAKHALAPYVRSRSTPPLPSRPHSSSAS---SKPSPSTSSSPGRGG 231 Query: 511 GIQNRNASANTLMSGLGSYTSLLPGQCTPVVLFVFVDD-FSETFLGGNMEQ-XXXXXXXX 684 GI +RNASA +LMSGLGSYTSL PG CTPV+LFVFVDD F N+E+ Sbjct: 232 GIMSRNASAISLMSGLGSYTSLFPGNCTPVILFVFVDDLFDMPNPNSNVEESKDVPSLNQ 291 Query: 685 XXXXXXXXXXGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 + TKGS SVVVLARPVNKSEGG RKKLQSSLEAQI Sbjct: 292 SSSMSSVARPNLPTKGSGSVVVLARPVNKSEGGFRKKLQSSLEAQI 337 >gb|EXC24915.1| hypothetical protein L484_011781 [Morus notabilis] Length = 1321 Score = 200 bits (509), Expect = 5e-49 Identities = 125/274 (45%), Positives = 156/274 (56%), Gaps = 3/274 (1%) Frame = +1 Query: 10 HLINKISDSYAFGSGSLDTPFRLEAEKIDLEMSRWFESRNLSFYHDEDQGILYLQFSMVN 189 HLIN+I DS+ FG+ L+ + I + WF+ R +S++H GIL+L FS V Sbjct: 78 HLINRILDSHVFGNN-------LDTKLISDKQEDWFKWRRISYFHQRQMGILFLHFSSVL 130 Query: 190 CTVAESVLSEERLGFESVLEE-RELGDLKGLIFMFSVCHVIILIQEGSRFDTRLLKKFRV 366 C + GF S +E+ + GDL+GL+FMFS EGSRFDT+LLKKFRV Sbjct: 131 CPGFDD-------GFGSAMEDDHDFGDLQGLLFMFS---------EGSRFDTQLLKKFRV 174 Query: 367 LQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXXXXXXXXXXXKIQGIQNRNASANTL 546 LQAAKH + PF+RSQ S L SRP SS SR + + I RN S +L Sbjct: 175 LQAAKHALAPFVRSQATSGLPSRPPSSSSSRSTKLTPASKSSSPGRGRNILTRNVSVVSL 234 Query: 547 MSGLGSYTSLLPGQCTPVVLFVFVDDFSET-FLGGNMEQ-XXXXXXXXXXXXXXXXXXGM 720 M GLGSYTSL PGQCTPV+LFVF+DDF + N+E+ + Sbjct: 235 MPGLGSYTSLFPGQCTPVMLFVFIDDFCDVPNPSCNVEESTNASLHSQSSSLSGLTRPNL 294 Query: 721 HTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 K S VVVLAR +KSEGG RKKLQSSLEAQ+ Sbjct: 295 PVKVSGPVVVLARSTSKSEGGFRKKLQSSLEAQV 328 >ref|XP_007016068.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508786431|gb|EOY33687.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1072 Score = 166 bits (420), Expect = 1e-38 Identities = 98/178 (55%), Positives = 115/178 (64%), Gaps = 2/178 (1%) Frame = +1 Query: 295 VCHVIILIQEGSRFDTRLLKKFRVLQAAKHTIVPFIRSQNISSLTSRPRSSVQSRIXXXX 474 VCH+II IQEGSRFDT+ LKKFRVLQAAKH + P+++S+ L SRP SS SR Sbjct: 3 VCHIIIYIQEGSRFDTQNLKKFRVLQAAKHALTPYVKSRTTPPLPSRPHSSSTSR-PSTI 61 Query: 475 XXXXXXXXXKIQGIQNRNASANTLMSGLGSYTSLLPGQCTPVVLFVFVDDFSETFLG-GN 651 + G+ RNASA +LMSGLGSYTSL PGQCTPV LFVF+DDFS+ N Sbjct: 62 ATTASTSPGRSGGMLGRNASAISLMSGLGSYTSLFPGQCTPVTLFVFIDDFSDVLNSTPN 121 Query: 652 MEQ-XXXXXXXXXXXXXXXXXXGMHTKGSSSVVVLARPVNKSEGGLRKKLQSSLEAQI 822 +E+ + KGS+SVVVLARPV+KSEG RKKLQSSLEAQI Sbjct: 122 IEESVETSSINHASNSSSLARPTLPMKGSASVVVLARPVSKSEGVFRKKLQSSLEAQI 179