BLASTX nr result
ID: Mentha24_contig00045583
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00045583 (710 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU38127.1| hypothetical protein MIMGU_mgv1a0000291mg, partia... 260 4e-67 gb|EYU28553.1| hypothetical protein MIMGU_mgv1a024288mg, partial... 259 6e-67 ref|XP_003632363.1| PREDICTED: uncharacterized protein LOC100254... 247 2e-63 emb|CBI32314.3| unnamed protein product [Vitis vinifera] 247 2e-63 emb|CAN77825.1| hypothetical protein VITISV_015458 [Vitis vinifera] 238 1e-60 ref|XP_006397997.1| hypothetical protein EUTSA_v10001278mg [Eutr... 234 3e-59 ref|XP_007050709.1| Uncharacterized protein isoform 2, partial [... 233 4e-59 ref|XP_007050708.1| Uncharacterized protein isoform 1 [Theobroma... 233 4e-59 ref|XP_006293550.1| hypothetical protein CARUB_v10022493mg [Caps... 232 8e-59 ref|XP_006588615.1| PREDICTED: uncharacterized protein LOC100801... 231 1e-58 ref|XP_004290692.1| PREDICTED: uncharacterized protein LOC101301... 231 2e-58 ref|XP_007200947.1| hypothetical protein PRUPE_ppa000028mg [Prun... 230 3e-58 ref|XP_002882144.1| hypothetical protein ARALYDRAFT_322444 [Arab... 230 3e-58 ref|XP_004247483.1| PREDICTED: uncharacterized protein LOC101266... 229 5e-58 ref|XP_006358438.1| PREDICTED: uncharacterized protein LOC102605... 227 3e-57 ref|NP_182327.6| uncharacterized protein [Arabidopsis thaliana] ... 227 3e-57 gb|AAD13709.1| hypothetical protein [Arabidopsis thaliana] 227 3e-57 ref|XP_002524795.1| conserved hypothetical protein [Ricinus comm... 226 4e-57 ref|XP_006575095.1| PREDICTED: uncharacterized protein LOC100792... 226 8e-57 ref|XP_006575094.1| PREDICTED: uncharacterized protein LOC100792... 226 8e-57 >gb|EYU38127.1| hypothetical protein MIMGU_mgv1a0000291mg, partial [Mimulus guttatus] Length = 2016 Score = 260 bits (664), Expect = 4e-67 Identities = 138/239 (57%), Positives = 171/239 (71%), Gaps = 3/239 (1%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLRAKVLVIAACTLQYNVFHWLE+MP+S L G+SEEPCPLF+S +D T AS+ N Sbjct: 764 GVEAGLRAKVLVIAACTLQYNVFHWLEKMPASLLNAGKSEEPCPLFISEEDVST-ASTSN 822 Query: 530 GENQPLPENEMPEQRVGEGYSWPSPSHSPNFST-LRSTLSGSYRKHSLDYIWET--ENHN 360 G+ E+ Q+ SWP + ST + S+ S + RK+S YIW + E+H Sbjct: 823 GDR------EVSSQKTRSN-SWPFLTPDNYRSTEVSSSSSNTSRKYSFSYIWGSMKESHK 875 Query: 359 WKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNSISMLY 180 W KKRI+ALRQERF+MQKT LKVYL+FW+ENMFNLFGLEINMI LN+ISM Y Sbjct: 876 WNKKRIIALRQERFEMQKTMLKVYLKFWMENMFNLFGLEINMIALLLASFALLNAISMFY 935 Query: 179 IACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTHCHEC 3 IAC+ATC+LL R I KLWP+FV++ A ILL EYLAMWK+V+P + E + HCH+C Sbjct: 936 IACLATCVLLGRTIIRKLWPVFVVVFAAILLVEYLAMWKSVMPTT-----ETSAHCHDC 989 >gb|EYU28553.1| hypothetical protein MIMGU_mgv1a024288mg, partial [Mimulus guttatus] Length = 1430 Score = 259 bits (662), Expect = 6e-67 Identities = 136/239 (56%), Positives = 169/239 (70%), Gaps = 3/239 (1%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLRAKVLVIAACTLQYNVFHWLE+MP+S L G+SEEPCPLF+S +D T AS+ N Sbjct: 850 GVEAGLRAKVLVIAACTLQYNVFHWLEKMPASLLNAGKSEEPCPLFISEEDVST-ASTSN 908 Query: 530 GENQPLPENEMPEQRVGEGYSWPSPSHSPNFST-LRSTLSGSYRKHSLDYIWET--ENHN 360 G+ + + SWP + ST + S+ S + RK+S YIW + E+H Sbjct: 909 GDREESSQKTRSN-------SWPFLTPDNYRSTEVSSSSSNTSRKYSFSYIWGSMKESHK 961 Query: 359 WKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNSISMLY 180 W KKRI+ALRQERF+MQKT LKVYL+FW+ENMFNLFGLEINMI LN+ISM Y Sbjct: 962 WNKKRIIALRQERFEMQKTMLKVYLKFWMENMFNLFGLEINMIALLLASFALLNAISMFY 1021 Query: 179 IACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTHCHEC 3 IAC+ATC+LL R I KLWP+FV++ A ILL EYLAMWK+V+P + E + HCH+C Sbjct: 1022 IACLATCVLLGRTIIRKLWPVFVVVFAAILLVEYLAMWKSVMPTT-----ETSAHCHDC 1075 >ref|XP_003632363.1| PREDICTED: uncharacterized protein LOC100254568 [Vitis vinifera] Length = 2489 Score = 247 bits (631), Expect = 2e-63 Identities = 128/247 (51%), Positives = 168/247 (68%), Gaps = 11/247 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++ V S + Sbjct: 899 GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 958 Query: 530 GENQPLPENEM--PEQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378 ++P ++ ++R YSWPS SH + T S SGS RK S + IW Sbjct: 959 EVSKPSSDSSSLSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 1017 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI Sbjct: 1018 GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 1077 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24 N+ISMLYIA +A C+LL R I KLWP+F+ L A+IL+ EYLA+WKN+V LS + Sbjct: 1078 SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLSPDNPSDT 1137 Query: 23 TTHCHEC 3 HCH+C Sbjct: 1138 NLHCHDC 1144 >emb|CBI32314.3| unnamed protein product [Vitis vinifera] Length = 2409 Score = 247 bits (631), Expect = 2e-63 Identities = 128/247 (51%), Positives = 168/247 (68%), Gaps = 11/247 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++ V S + Sbjct: 881 GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 940 Query: 530 GENQPLPENEM--PEQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378 ++P ++ ++R YSWPS SH + T S SGS RK S + IW Sbjct: 941 EVSKPSSDSSSLSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 999 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI Sbjct: 1000 GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 1059 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24 N+ISMLYIA +A C+LL R I KLWP+F+ L A+IL+ EYLA+WKN+V LS + Sbjct: 1060 SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLSPDNPSDT 1119 Query: 23 TTHCHEC 3 HCH+C Sbjct: 1120 NLHCHDC 1126 >emb|CAN77825.1| hypothetical protein VITISV_015458 [Vitis vinifera] Length = 2393 Score = 238 bits (608), Expect = 1e-60 Identities = 124/233 (53%), Positives = 162/233 (69%), Gaps = 11/233 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++ V S + Sbjct: 811 GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 870 Query: 530 GENQPLPENEMP--EQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378 ++P ++ ++R YSWPS SH + T S SGS RK S + IW Sbjct: 871 EVSKPSSDSSSXSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 929 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI Sbjct: 930 GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 989 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS 45 N+ISMLYIA +A C+LL R I KLWP+F+ L A+IL+ EYLA+WKN+V LS Sbjct: 990 SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLS 1042 >ref|XP_006397997.1| hypothetical protein EUTSA_v10001278mg [Eutrema salsugineum] gi|557099070|gb|ESQ39450.1| hypothetical protein EUTSA_v10001278mg [Eutrema salsugineum] Length = 2511 Score = 234 bits (596), Expect = 3e-59 Identities = 122/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLV+AACTLQYNVF WLER P + G+ EEPCPLFVSA+D+ SS N Sbjct: 923 GIESGLRGKVLVVAACTLQYNVFRWLERTPGLTIIKGKYEEPCPLFVSAEDTTASVSSSN 982 Query: 530 GENQPLPENEMPEQRVGEGYSWPSPSHSPNFSTLRSTL--------SGSYRKHSLDYIWE 375 GEN E+ + GE S P SP + +L SGS RK S + W Sbjct: 983 GENPSSTEHASISMKQGEATSNSWPFFSPRDNQAAGSLHPKTGGSESGSSRKFSFGHFWG 1042 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W ++RI+AL++ERF+ QK LK+YL+FWIENMFNL+GLEINMI L Sbjct: 1043 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1102 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+IS++YIA +A C+LL R I KLWP+ V L A+IL EY+A W N +P S + E + Sbjct: 1103 NAISLVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNNSLP-SDQAPSETS 1161 Query: 20 THCHEC 3 HCH+C Sbjct: 1162 VHCHDC 1167 >ref|XP_007050709.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508702970|gb|EOX94866.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 1777 Score = 233 bits (595), Expect = 4e-59 Identities = 121/246 (49%), Positives = 160/246 (65%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLVIAAC QYN+F WL+ MPS G+ EEPCPLF+SA+D+FT N Sbjct: 668 GIESGLRGKVLVIAACIFQYNIFRWLDNMPSGISNKGKWEEPCPLFLSAEDTFTNGFMSN 727 Query: 530 GENQPLPE-NEMP---EQRVGEGYSWPSPSHSPNFSTLRSTLSGS----YRKHSLDYIWE 375 GE +P +P ++ V + +S SP+ S + S GS +RK S Y W Sbjct: 728 GEEKPSSSFGAVPIRQDRAVSDSWSSLSPAFSQAPHPVSSKAGGSEVSSFRKFSFGYFWG 787 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W KKRI+ALR+ERF+ QK LK+YL+FW+ENMFNL+GLEINMI L Sbjct: 788 STKESHKWNKKRILALRKERFETQKALLKIYLKFWMENMFNLYGLEINMIALLLASFALL 847 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISMLYI+ +A C+LL R I KLWP+ V L A+IL+ EY A+WKN+ PL+ + + Sbjct: 848 NAISMLYISLLAVCVLLNRRIIRKLWPVLVFLFASILILEYFAIWKNMFPLNQKKPSQAE 907 Query: 20 THCHEC 3 HCH+C Sbjct: 908 IHCHDC 913 >ref|XP_007050708.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508702969|gb|EOX94865.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 2501 Score = 233 bits (595), Expect = 4e-59 Identities = 121/246 (49%), Positives = 160/246 (65%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLVIAAC QYN+F WL+ MPS G+ EEPCPLF+SA+D+FT N Sbjct: 915 GIESGLRGKVLVIAACIFQYNIFRWLDNMPSGISNKGKWEEPCPLFLSAEDTFTNGFMSN 974 Query: 530 GENQPLPE-NEMP---EQRVGEGYSWPSPSHSPNFSTLRSTLSGS----YRKHSLDYIWE 375 GE +P +P ++ V + +S SP+ S + S GS +RK S Y W Sbjct: 975 GEEKPSSSFGAVPIRQDRAVSDSWSSLSPAFSQAPHPVSSKAGGSEVSSFRKFSFGYFWG 1034 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W KKRI+ALR+ERF+ QK LK+YL+FW+ENMFNL+GLEINMI L Sbjct: 1035 STKESHKWNKKRILALRKERFETQKALLKIYLKFWMENMFNLYGLEINMIALLLASFALL 1094 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISMLYI+ +A C+LL R I KLWP+ V L A+IL+ EY A+WKN+ PL+ + + Sbjct: 1095 NAISMLYISLLAVCVLLNRRIIRKLWPVLVFLFASILILEYFAIWKNMFPLNQKKPSQAE 1154 Query: 20 THCHEC 3 HCH+C Sbjct: 1155 IHCHDC 1160 >ref|XP_006293550.1| hypothetical protein CARUB_v10022493mg [Capsella rubella] gi|482562258|gb|EOA26448.1| hypothetical protein CARUB_v10022493mg [Capsella rubella] Length = 2485 Score = 232 bits (592), Expect = 8e-59 Identities = 124/246 (50%), Positives = 159/246 (64%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLV+AACTLQYNVF WLER P N+ G+ EEPCPLFVSA+D+ SS N Sbjct: 897 GIESGLRGKVLVVAACTLQYNVFRWLERTPGLNIIKGKYEEPCPLFVSAEDTTASVSSSN 956 Query: 530 GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375 GEN + + GEG S WP S S LR + SGS R+ S + W Sbjct: 957 GENSSSTPHASISTKQGEGTSNSWPFLSTRDSQAAGFLRPKTGGSESGSSRRFSFGHFWG 1016 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W ++RI+AL++ERF+ QK LK+YL+FWIENMFNL+GLEINMI L Sbjct: 1017 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1076 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISM+YIA +A C+LL R I KLWP+ V L A+IL EY+A W + +P S + E + Sbjct: 1077 NAISMVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 1135 Query: 20 THCHEC 3 HCH+C Sbjct: 1136 VHCHDC 1141 >ref|XP_006588615.1| PREDICTED: uncharacterized protein LOC100801841 [Glycine max] Length = 2483 Score = 231 bits (590), Expect = 1e-58 Identities = 126/247 (51%), Positives = 163/247 (65%), Gaps = 11/247 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVI ACTLQYNVFHWLERMP++ L GQ EEPCPLFV +D+F + N Sbjct: 897 GLESGLRGKVLVIVACTLQYNVFHWLERMPNTVLSKGQWEEPCPLFVPTEDAFIDDAKCN 956 Query: 530 GENQPLPENEMPEQRVGEGYSWPSP-------SHSPNF--STLRSTLSGSYRKHSLDYIW 378 E++ +++P + EG S S S +P+ S + S +K+S +IW Sbjct: 957 EESKSSYNSQLPSA-IKEGVSGNSLQIITSGLSQAPDTPSSKTEGSSDSSSKKYSFGFIW 1015 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI Sbjct: 1016 GSSKESHKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFAL 1075 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24 LN++SMLYIA +A C+LL R I K+WPIFV L A+IL+ EYLA+WK+++PL+ E Sbjct: 1076 LNALSMLYIALLAACVLLNRHIIRKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE- 1134 Query: 23 TTHCHEC 3 C +C Sbjct: 1135 -IRCRDC 1140 >ref|XP_004290692.1| PREDICTED: uncharacterized protein LOC101301158 [Fragaria vesca subsp. vesca] Length = 2451 Score = 231 bits (588), Expect = 2e-58 Identities = 125/244 (51%), Positives = 161/244 (65%), Gaps = 8/244 (3%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVIAACTLQYNVFHWLERMPS+ L G E PCPLF+SA+D+ A+ + Sbjct: 875 GLESGLRGKVLVIAACTLQYNVFHWLERMPSTILSKGMGE-PCPLFLSAEDTNISATIPS 933 Query: 530 GENQPLPENEMPEQRVGEGYSWP--SPS----HSPNFSTLRSTLSGSYRKHSLDYIWET- 372 +N+P + +Q +SWP SPS H+P+ ++ S K+S YIW + Sbjct: 934 EDNRPSTSFSV-KQEGARSHSWPFFSPSLLHSHNPSSPKAGTSKGSSSGKYSFGYIWGST 992 Query: 371 -ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNS 195 E+H W KKRI+AL++ERF+ QK K+Y++FW+ENMFNLFGLEINMI LN+ Sbjct: 993 KESHKWNKKRILALQKERFETQKLISKIYIKFWLENMFNLFGLEINMIALLLASFALLNA 1052 Query: 194 ISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTH 15 ISMLYIA +A CI+L R I KLWP FV L A+IL+ EY A+WK+ P +H P Sbjct: 1053 ISMLYIALLAACIILNRQIIRKLWPTFVFLFASILILEYFAIWKSTWPPNHPDATNPC-- 1110 Query: 14 CHEC 3 CH+C Sbjct: 1111 CHDC 1114 >ref|XP_007200947.1| hypothetical protein PRUPE_ppa000028mg [Prunus persica] gi|462396347|gb|EMJ02146.1| hypothetical protein PRUPE_ppa000028mg [Prunus persica] Length = 2388 Score = 230 bits (587), Expect = 3e-58 Identities = 125/246 (50%), Positives = 163/246 (66%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVIAACTLQYNVF WLE+MPS+ L G+ EEPCPLFVSA+D+ +S + Sbjct: 801 GLEFGLRGKVLVIAACTLQYNVFRWLEKMPSTILNKGKWEEPCPLFVSAEDANINSSIPS 860 Query: 530 GENQPLPENE-MPEQRVG-EGYSWP------SPSHSPNFSTLRSTLSGSYRKHSLDYIWE 375 EN+ ++E + +R G +SWP S SH+P + S K+S YIW Sbjct: 861 EENKQSTDSEALSVKREGARSHSWPFFSPGLSESHNPMSPRAGGSEGSSSNKYSFGYIWG 920 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W KKRI+ LR+ERF+ QK K+YL+FW+ENMFNLFGLEINMI L Sbjct: 921 STKESHKWNKKRILTLRKERFETQKLISKIYLKFWMENMFNLFGLEINMIALLLASFALL 980 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+IS++YIA +ATCI+L R I K+WPI V L A+IL+ EY A+WK++ P +H E Sbjct: 981 NAISLVYIALLATCIILNRHIIRKIWPILVFLFASILILEYFAIWKSMWPSNHP--DETN 1038 Query: 20 THCHEC 3 CH+C Sbjct: 1039 ARCHDC 1044 >ref|XP_002882144.1| hypothetical protein ARALYDRAFT_322444 [Arabidopsis lyrata subsp. lyrata] gi|297327983|gb|EFH58403.1| hypothetical protein ARALYDRAFT_322444 [Arabidopsis lyrata subsp. lyrata] Length = 1473 Score = 230 bits (587), Expect = 3e-58 Identities = 122/246 (49%), Positives = 158/246 (64%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLV+AACTLQYNVF WLER P + G+ EEPCPLFVSA+D+ SS N Sbjct: 239 GIESGLRGKVLVVAACTLQYNVFRWLERTPGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 298 Query: 530 GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375 GEN ++ + GE S WP SP + L + SGS RK S + W Sbjct: 299 GENPSSTDHASISMKQGEATSNSWPFFSPRDNQGAGFLHPKTGGSESGSSRKFSFGHFWG 358 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W ++RI+AL++ERF+ QK LK+YL+FWIENMFNL+GLEINMI L Sbjct: 359 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 418 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISM+YIA +A C+LL R I KLWP+ V L A+IL EY+A W + +P S + E + Sbjct: 419 NAISMVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 477 Query: 20 THCHEC 3 HCH+C Sbjct: 478 VHCHDC 483 >ref|XP_004247483.1| PREDICTED: uncharacterized protein LOC101266159 [Solanum lycopersicum] Length = 2450 Score = 229 bits (585), Expect = 5e-58 Identities = 128/248 (51%), Positives = 165/248 (66%), Gaps = 12/248 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLRAKVLV+AACTLQYNVFHWLE+MP+S L +SEEPCPLFVS +D + + Sbjct: 873 GLEAGLRAKVLVVAACTLQYNVFHWLEKMPASLLNDNRSEEPCPLFVSEEDVMPLVP--D 930 Query: 530 GENQPLPE-NEMPEQRVGEGYSWPSPSHSPNFSTLRSTLSGS-----YR---KHSLDYIW 378 GEN+P+ + NE Q + S P + +S S YR K+S IW Sbjct: 931 GENKPVADSNEFSTQGMRTS-SKSCPYFDQSLYQSSDGVSSSRGVSEYRSRSKYSFGSIW 989 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KK +V+LR+ER MQKTTLK+YL+FW+ENMFNLFGLEINM+ Sbjct: 990 GSRKESHKWNKKLVVSLRKERLVMQKTTLKIYLKFWVENMFNLFGLEINMLALLLTSFAL 1049 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS-HRVLVE 27 LN++S++YIA +A+C+LL R I K+WPIFVLL ILL EY AMWK+++PL+ HR Sbjct: 1050 LNAVSLIYIALLASCVLLERRIIRKVWPIFVLLFTLILLLEYFAMWKSLMPLNQHR--PN 1107 Query: 26 PTTHCHEC 3 T HCH+C Sbjct: 1108 QTVHCHDC 1115 >ref|XP_006358438.1| PREDICTED: uncharacterized protein LOC102605335 [Solanum tuberosum] Length = 2473 Score = 227 bits (579), Expect = 3e-57 Identities = 127/248 (51%), Positives = 164/248 (66%), Gaps = 12/248 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLRAKVLV+AACTLQYNVFHWLE+MP+S L +SEEPCPLFVS +D + + Sbjct: 896 GLEAGLRAKVLVVAACTLQYNVFHWLEKMPTSLLNGNKSEEPCPLFVSEEDVMPLVP--D 953 Query: 530 GENQPLPE-NEMPEQRVGEGYSWPSPSHSPNFSTLRSTLSGS-----YR---KHSLDYIW 378 EN+P+ + NE Q + S P + +S S YR K+S IW Sbjct: 954 EENKPVADSNEFSTQGMRTS-SKSCPYFDQSLYQSSDGVSSSRGVSEYRSRSKYSFGSIW 1012 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W KK +V+LR+ER +MQKTTLK+YL+FW+ENMFNLFGLEINM+ Sbjct: 1013 GSRKESHKWNKKLVVSLRKERLEMQKTTLKIYLKFWVENMFNLFGLEINMLALLLTSFAL 1072 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS-HRVLVE 27 LN++S+LYIA +A+C+LL R I K+WPIFVLL ILL EY AMWK+++PL+ HR Sbjct: 1073 LNAVSLLYIALLASCVLLERRIIRKVWPIFVLLFTLILLLEYFAMWKSLMPLNQHR--PN 1130 Query: 26 PTTHCHEC 3 HCH+C Sbjct: 1131 QAVHCHDC 1138 >ref|NP_182327.6| uncharacterized protein [Arabidopsis thaliana] gi|330255833|gb|AEC10927.1| uncharacterized protein AT2G48060 [Arabidopsis thaliana] Length = 2462 Score = 227 bits (579), Expect = 3e-57 Identities = 121/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLV+AACTLQYNVF WLER + G+ EEPCPLFVSA+D+ SS N Sbjct: 874 GIESGLRGKVLVVAACTLQYNVFRWLERTSGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 933 Query: 530 GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375 GEN ++ + GE S WP SP + L + SGS RK S + W Sbjct: 934 GENPSSTDHASISMKQGEATSNSWPFFSPRGNQGAGFLHPKTGGSESGSSRKFSFGHFWG 993 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W ++RI+AL++ERF+ QK LK+YL+FWIENMFNL+GLEINMI L Sbjct: 994 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1053 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISM+YIA +A C+LL R I KLWP+ V L A+IL EY+A W + +P S + E + Sbjct: 1054 NAISMVYIALLAACVLLRRRVIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 1112 Query: 20 THCHEC 3 HCH+C Sbjct: 1113 VHCHDC 1118 >gb|AAD13709.1| hypothetical protein [Arabidopsis thaliana] Length = 1500 Score = 227 bits (579), Expect = 3e-57 Identities = 121/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 G++ GLR KVLV+AACTLQYNVF WLER + G+ EEPCPLFVSA+D+ SS N Sbjct: 266 GIESGLRGKVLVVAACTLQYNVFRWLERTSGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 325 Query: 530 GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375 GEN ++ + GE S WP SP + L + SGS RK S + W Sbjct: 326 GENPSSTDHASISMKQGEATSNSWPFFSPRGNQGAGFLHPKTGGSESGSSRKFSFGHFWG 385 Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201 + E+H W ++RI+AL++ERF+ QK LK+YL+FWIENMFNL+GLEINMI L Sbjct: 386 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 445 Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21 N+ISM+YIA +A C+LL R I KLWP+ V L A+IL EY+A W + +P S + E + Sbjct: 446 NAISMVYIALLAACVLLRRRVIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 504 Query: 20 THCHEC 3 HCH+C Sbjct: 505 VHCHDC 510 >ref|XP_002524795.1| conserved hypothetical protein [Ricinus communis] gi|223535979|gb|EEF37638.1| conserved hypothetical protein [Ricinus communis] Length = 2254 Score = 226 bits (577), Expect = 4e-57 Identities = 123/247 (49%), Positives = 158/247 (63%), Gaps = 11/247 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVIAACTLQYNVF WL +MP++ G+ EEPCPLFVS +++F S +N Sbjct: 722 GLESGLRGKVLVIAACTLQYNVFRWLGKMPNTFPDKGKWEEPCPLFVSDENAFANGSIIN 781 Query: 530 GENQPLPENEMPEQR---------VGEGYSWPSPSHSPNFSTLRSTLSGSYRKHSLDYIW 378 EN+ E +P + S+ P H+ + T S SG+ R S YIW Sbjct: 782 DENKAPSEYNVPSVKKETVTATSTFSFTSSFTQPPHTFSNKTGSSVGSGT-RIFSFGYIW 840 Query: 377 ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204 + E+H W +KRI+ALR+ERF+ QK LK+YL+FWIENMFNLFGLEINMI Sbjct: 841 GSTKESHKWNRKRILALRKERFETQKALLKIYLKFWIENMFNLFGLEINMIALLLASFTL 900 Query: 203 LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24 LN+I+MLYIA +A CIL+ R I KLWPI V L A+IL+ EY A+WK++ PL+ E Sbjct: 901 LNAIAMLYIALLAACILVSRHIIRKLWPIVVTLFASILILEYFAIWKSIFPLNQHAPSET 960 Query: 23 TTHCHEC 3 +CH C Sbjct: 961 DIYCHNC 967 >ref|XP_006575095.1| PREDICTED: uncharacterized protein LOC100792646 isoform X4 [Glycine max] Length = 2173 Score = 226 bits (575), Expect = 8e-57 Identities = 126/248 (50%), Positives = 161/248 (64%), Gaps = 12/248 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVI ACTLQYNVF WLERMP++ L GQ EEPCPLFV +D F + N Sbjct: 897 GLESGLRGKVLVIVACTLQYNVFRWLERMPNTVLSKGQWEEPCPLFVPTEDVFIDDAMCN 956 Query: 530 GENQPLPENEMPEQRVGEGYSWPSPS----------HSPNFSTLRSTLSGSYRKHSLDYI 381 E++ + +P + EG S S +P+ T S+ S S +K+S +I Sbjct: 957 EESKSSYNSNLPSA-IKEGVSGKSLQIITSGLSQALDTPSSKTGDSSDSSS-KKYSFGFI 1014 Query: 380 WET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXX 207 W + E+ W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI Sbjct: 1015 WGSSKESQKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFA 1074 Query: 206 XLNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVE 27 LN+ISM+YIA +A C+LL R I K+WPIFV L A+IL+ EYLA+WK+++PL+ E Sbjct: 1075 LLNAISMMYIALLAACVLLNRHIICKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE 1134 Query: 26 PTTHCHEC 3 CH+C Sbjct: 1135 --IRCHDC 1140 >ref|XP_006575094.1| PREDICTED: uncharacterized protein LOC100792646 isoform X3 [Glycine max] Length = 2220 Score = 226 bits (575), Expect = 8e-57 Identities = 126/248 (50%), Positives = 161/248 (64%), Gaps = 12/248 (4%) Frame = -1 Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531 GL+ GLR KVLVI ACTLQYNVF WLERMP++ L GQ EEPCPLFV +D F + N Sbjct: 635 GLESGLRGKVLVIVACTLQYNVFRWLERMPNTVLSKGQWEEPCPLFVPTEDVFIDDAMCN 694 Query: 530 GENQPLPENEMPEQRVGEGYSWPSPS----------HSPNFSTLRSTLSGSYRKHSLDYI 381 E++ + +P + EG S S +P+ T S+ S S +K+S +I Sbjct: 695 EESKSSYNSNLPSA-IKEGVSGKSLQIITSGLSQALDTPSSKTGDSSDSSS-KKYSFGFI 752 Query: 380 WET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXX 207 W + E+ W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI Sbjct: 753 WGSSKESQKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFA 812 Query: 206 XLNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVE 27 LN+ISM+YIA +A C+LL R I K+WPIFV L A+IL+ EYLA+WK+++PL+ E Sbjct: 813 LLNAISMMYIALLAACVLLNRHIICKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE 872 Query: 26 PTTHCHEC 3 CH+C Sbjct: 873 --IRCHDC 878