BLASTX nr result

ID: Mentha24_contig00045583 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00045583
         (710 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38127.1| hypothetical protein MIMGU_mgv1a0000291mg, partia...   260   4e-67
gb|EYU28553.1| hypothetical protein MIMGU_mgv1a024288mg, partial...   259   6e-67
ref|XP_003632363.1| PREDICTED: uncharacterized protein LOC100254...   247   2e-63
emb|CBI32314.3| unnamed protein product [Vitis vinifera]              247   2e-63
emb|CAN77825.1| hypothetical protein VITISV_015458 [Vitis vinifera]   238   1e-60
ref|XP_006397997.1| hypothetical protein EUTSA_v10001278mg [Eutr...   234   3e-59
ref|XP_007050709.1| Uncharacterized protein isoform 2, partial [...   233   4e-59
ref|XP_007050708.1| Uncharacterized protein isoform 1 [Theobroma...   233   4e-59
ref|XP_006293550.1| hypothetical protein CARUB_v10022493mg [Caps...   232   8e-59
ref|XP_006588615.1| PREDICTED: uncharacterized protein LOC100801...   231   1e-58
ref|XP_004290692.1| PREDICTED: uncharacterized protein LOC101301...   231   2e-58
ref|XP_007200947.1| hypothetical protein PRUPE_ppa000028mg [Prun...   230   3e-58
ref|XP_002882144.1| hypothetical protein ARALYDRAFT_322444 [Arab...   230   3e-58
ref|XP_004247483.1| PREDICTED: uncharacterized protein LOC101266...   229   5e-58
ref|XP_006358438.1| PREDICTED: uncharacterized protein LOC102605...   227   3e-57
ref|NP_182327.6| uncharacterized protein [Arabidopsis thaliana] ...   227   3e-57
gb|AAD13709.1| hypothetical protein [Arabidopsis thaliana]            227   3e-57
ref|XP_002524795.1| conserved hypothetical protein [Ricinus comm...   226   4e-57
ref|XP_006575095.1| PREDICTED: uncharacterized protein LOC100792...   226   8e-57
ref|XP_006575094.1| PREDICTED: uncharacterized protein LOC100792...   226   8e-57

>gb|EYU38127.1| hypothetical protein MIMGU_mgv1a0000291mg, partial [Mimulus guttatus]
          Length = 2016

 Score =  260 bits (664), Expect = 4e-67
 Identities = 138/239 (57%), Positives = 171/239 (71%), Gaps = 3/239 (1%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLRAKVLVIAACTLQYNVFHWLE+MP+S L  G+SEEPCPLF+S +D  T AS+ N
Sbjct: 764  GVEAGLRAKVLVIAACTLQYNVFHWLEKMPASLLNAGKSEEPCPLFISEEDVST-ASTSN 822

Query: 530  GENQPLPENEMPEQRVGEGYSWPSPSHSPNFST-LRSTLSGSYRKHSLDYIWET--ENHN 360
            G+       E+  Q+     SWP  +     ST + S+ S + RK+S  YIW +  E+H 
Sbjct: 823  GDR------EVSSQKTRSN-SWPFLTPDNYRSTEVSSSSSNTSRKYSFSYIWGSMKESHK 875

Query: 359  WKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNSISMLY 180
            W KKRI+ALRQERF+MQKT LKVYL+FW+ENMFNLFGLEINMI         LN+ISM Y
Sbjct: 876  WNKKRIIALRQERFEMQKTMLKVYLKFWMENMFNLFGLEINMIALLLASFALLNAISMFY 935

Query: 179  IACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTHCHEC 3
            IAC+ATC+LL R  I KLWP+FV++ A ILL EYLAMWK+V+P +     E + HCH+C
Sbjct: 936  IACLATCVLLGRTIIRKLWPVFVVVFAAILLVEYLAMWKSVMPTT-----ETSAHCHDC 989


>gb|EYU28553.1| hypothetical protein MIMGU_mgv1a024288mg, partial [Mimulus guttatus]
          Length = 1430

 Score =  259 bits (662), Expect = 6e-67
 Identities = 136/239 (56%), Positives = 169/239 (70%), Gaps = 3/239 (1%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLRAKVLVIAACTLQYNVFHWLE+MP+S L  G+SEEPCPLF+S +D  T AS+ N
Sbjct: 850  GVEAGLRAKVLVIAACTLQYNVFHWLEKMPASLLNAGKSEEPCPLFISEEDVST-ASTSN 908

Query: 530  GENQPLPENEMPEQRVGEGYSWPSPSHSPNFST-LRSTLSGSYRKHSLDYIWET--ENHN 360
            G+ +   +            SWP  +     ST + S+ S + RK+S  YIW +  E+H 
Sbjct: 909  GDREESSQKTRSN-------SWPFLTPDNYRSTEVSSSSSNTSRKYSFSYIWGSMKESHK 961

Query: 359  WKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNSISMLY 180
            W KKRI+ALRQERF+MQKT LKVYL+FW+ENMFNLFGLEINMI         LN+ISM Y
Sbjct: 962  WNKKRIIALRQERFEMQKTMLKVYLKFWMENMFNLFGLEINMIALLLASFALLNAISMFY 1021

Query: 179  IACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTHCHEC 3
            IAC+ATC+LL R  I KLWP+FV++ A ILL EYLAMWK+V+P +     E + HCH+C
Sbjct: 1022 IACLATCVLLGRTIIRKLWPVFVVVFAAILLVEYLAMWKSVMPTT-----ETSAHCHDC 1075


>ref|XP_003632363.1| PREDICTED: uncharacterized protein LOC100254568 [Vitis vinifera]
          Length = 2489

 Score =  247 bits (631), Expect = 2e-63
 Identities = 128/247 (51%), Positives = 168/247 (68%), Gaps = 11/247 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++  V S  +
Sbjct: 899  GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 958

Query: 530  GENQPLPENEM--PEQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378
              ++P  ++     ++R    YSWPS        SH  +  T  S  SGS RK S + IW
Sbjct: 959  EVSKPSSDSSSLSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 1017

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI         
Sbjct: 1018 GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 1077

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24
             N+ISMLYIA +A C+LL R  I KLWP+F+ L A+IL+ EYLA+WKN+V LS     + 
Sbjct: 1078 SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLSPDNPSDT 1137

Query: 23   TTHCHEC 3
              HCH+C
Sbjct: 1138 NLHCHDC 1144


>emb|CBI32314.3| unnamed protein product [Vitis vinifera]
          Length = 2409

 Score =  247 bits (631), Expect = 2e-63
 Identities = 128/247 (51%), Positives = 168/247 (68%), Gaps = 11/247 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++  V S  +
Sbjct: 881  GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 940

Query: 530  GENQPLPENEM--PEQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378
              ++P  ++     ++R    YSWPS        SH  +  T  S  SGS RK S + IW
Sbjct: 941  EVSKPSSDSSSLSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 999

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI         
Sbjct: 1000 GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 1059

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24
             N+ISMLYIA +A C+LL R  I KLWP+F+ L A+IL+ EYLA+WKN+V LS     + 
Sbjct: 1060 SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLSPDNPSDT 1119

Query: 23   TTHCHEC 3
              HCH+C
Sbjct: 1120 NLHCHDC 1126


>emb|CAN77825.1| hypothetical protein VITISV_015458 [Vitis vinifera]
          Length = 2393

 Score =  238 bits (608), Expect = 1e-60
 Identities = 124/233 (53%), Positives = 162/233 (69%), Gaps = 11/233 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLVIAACTLQYNVFHWL++MPS+ L +G+ EEPCPLF+S +++  V S  +
Sbjct: 811  GIESGLRGKVLVIAACTLQYNVFHWLDKMPSTLLSMGKWEEPCPLFISEEETLPVVSVSS 870

Query: 530  GENQPLPENEMP--EQRVGEGYSWPS-------PSHSPNFSTLRSTLSGSYRKHSLDYIW 378
              ++P  ++     ++R    YSWPS        SH  +  T  S  SGS RK S + IW
Sbjct: 871  EVSKPSSDSSSXSVKKRGVTSYSWPSFNFGLSQESHPVSSETAESGGSGS-RKFSFENIW 929

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KKRI+AL++ERF+ QKTTLK+Y +FW+ENMFNLFGLEINMI         
Sbjct: 930  GSTKESHKWNKKRILALKKERFETQKTTLKIYFKFWVENMFNLFGLEINMIALLLASFAL 989

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS 45
             N+ISMLYIA +A C+LL R  I KLWP+F+ L A+IL+ EYLA+WKN+V LS
Sbjct: 990  SNAISMLYIAALAACVLLNRHIIWKLWPVFIFLFASILILEYLALWKNMVSLS 1042


>ref|XP_006397997.1| hypothetical protein EUTSA_v10001278mg [Eutrema salsugineum]
            gi|557099070|gb|ESQ39450.1| hypothetical protein
            EUTSA_v10001278mg [Eutrema salsugineum]
          Length = 2511

 Score =  234 bits (596), Expect = 3e-59
 Identities = 122/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLV+AACTLQYNVF WLER P   +  G+ EEPCPLFVSA+D+    SS N
Sbjct: 923  GIESGLRGKVLVVAACTLQYNVFRWLERTPGLTIIKGKYEEPCPLFVSAEDTTASVSSSN 982

Query: 530  GENQPLPENEMPEQRVGEGYSWPSPSHSPNFSTLRSTL--------SGSYRKHSLDYIWE 375
            GEN    E+     + GE  S   P  SP  +    +L        SGS RK S  + W 
Sbjct: 983  GENPSSTEHASISMKQGEATSNSWPFFSPRDNQAAGSLHPKTGGSESGSSRKFSFGHFWG 1042

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W ++RI+AL++ERF+ QK  LK+YL+FWIENMFNL+GLEINMI         L
Sbjct: 1043 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1102

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+IS++YIA +A C+LL R  I KLWP+ V L A+IL  EY+A W N +P S +   E +
Sbjct: 1103 NAISLVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNNSLP-SDQAPSETS 1161

Query: 20   THCHEC 3
             HCH+C
Sbjct: 1162 VHCHDC 1167


>ref|XP_007050709.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508702970|gb|EOX94866.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 1777

 Score =  233 bits (595), Expect = 4e-59
 Identities = 121/246 (49%), Positives = 160/246 (65%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLVIAAC  QYN+F WL+ MPS     G+ EEPCPLF+SA+D+FT     N
Sbjct: 668  GIESGLRGKVLVIAACIFQYNIFRWLDNMPSGISNKGKWEEPCPLFLSAEDTFTNGFMSN 727

Query: 530  GENQPLPE-NEMP---EQRVGEGYSWPSPSHSPNFSTLRSTLSGS----YRKHSLDYIWE 375
            GE +P      +P   ++ V + +S  SP+ S     + S   GS    +RK S  Y W 
Sbjct: 728  GEEKPSSSFGAVPIRQDRAVSDSWSSLSPAFSQAPHPVSSKAGGSEVSSFRKFSFGYFWG 787

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W KKRI+ALR+ERF+ QK  LK+YL+FW+ENMFNL+GLEINMI         L
Sbjct: 788  STKESHKWNKKRILALRKERFETQKALLKIYLKFWMENMFNLYGLEINMIALLLASFALL 847

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+ISMLYI+ +A C+LL R  I KLWP+ V L A+IL+ EY A+WKN+ PL+ +   +  
Sbjct: 848  NAISMLYISLLAVCVLLNRRIIRKLWPVLVFLFASILILEYFAIWKNMFPLNQKKPSQAE 907

Query: 20   THCHEC 3
             HCH+C
Sbjct: 908  IHCHDC 913


>ref|XP_007050708.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508702969|gb|EOX94865.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 2501

 Score =  233 bits (595), Expect = 4e-59
 Identities = 121/246 (49%), Positives = 160/246 (65%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLVIAAC  QYN+F WL+ MPS     G+ EEPCPLF+SA+D+FT     N
Sbjct: 915  GIESGLRGKVLVIAACIFQYNIFRWLDNMPSGISNKGKWEEPCPLFLSAEDTFTNGFMSN 974

Query: 530  GENQPLPE-NEMP---EQRVGEGYSWPSPSHSPNFSTLRSTLSGS----YRKHSLDYIWE 375
            GE +P      +P   ++ V + +S  SP+ S     + S   GS    +RK S  Y W 
Sbjct: 975  GEEKPSSSFGAVPIRQDRAVSDSWSSLSPAFSQAPHPVSSKAGGSEVSSFRKFSFGYFWG 1034

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W KKRI+ALR+ERF+ QK  LK+YL+FW+ENMFNL+GLEINMI         L
Sbjct: 1035 STKESHKWNKKRILALRKERFETQKALLKIYLKFWMENMFNLYGLEINMIALLLASFALL 1094

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+ISMLYI+ +A C+LL R  I KLWP+ V L A+IL+ EY A+WKN+ PL+ +   +  
Sbjct: 1095 NAISMLYISLLAVCVLLNRRIIRKLWPVLVFLFASILILEYFAIWKNMFPLNQKKPSQAE 1154

Query: 20   THCHEC 3
             HCH+C
Sbjct: 1155 IHCHDC 1160


>ref|XP_006293550.1| hypothetical protein CARUB_v10022493mg [Capsella rubella]
            gi|482562258|gb|EOA26448.1| hypothetical protein
            CARUB_v10022493mg [Capsella rubella]
          Length = 2485

 Score =  232 bits (592), Expect = 8e-59
 Identities = 124/246 (50%), Positives = 159/246 (64%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLV+AACTLQYNVF WLER P  N+  G+ EEPCPLFVSA+D+    SS N
Sbjct: 897  GIESGLRGKVLVVAACTLQYNVFRWLERTPGLNIIKGKYEEPCPLFVSAEDTTASVSSSN 956

Query: 530  GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375
            GEN     +     + GEG S  WP  S   S     LR     + SGS R+ S  + W 
Sbjct: 957  GENSSSTPHASISTKQGEGTSNSWPFLSTRDSQAAGFLRPKTGGSESGSSRRFSFGHFWG 1016

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W ++RI+AL++ERF+ QK  LK+YL+FWIENMFNL+GLEINMI         L
Sbjct: 1017 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1076

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+ISM+YIA +A C+LL R  I KLWP+ V L A+IL  EY+A W + +P S +   E +
Sbjct: 1077 NAISMVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 1135

Query: 20   THCHEC 3
             HCH+C
Sbjct: 1136 VHCHDC 1141


>ref|XP_006588615.1| PREDICTED: uncharacterized protein LOC100801841 [Glycine max]
          Length = 2483

 Score =  231 bits (590), Expect = 1e-58
 Identities = 126/247 (51%), Positives = 163/247 (65%), Gaps = 11/247 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVI ACTLQYNVFHWLERMP++ L  GQ EEPCPLFV  +D+F   +  N
Sbjct: 897  GLESGLRGKVLVIVACTLQYNVFHWLERMPNTVLSKGQWEEPCPLFVPTEDAFIDDAKCN 956

Query: 530  GENQPLPENEMPEQRVGEGYSWPSP-------SHSPNF--STLRSTLSGSYRKHSLDYIW 378
             E++    +++P   + EG S  S        S +P+   S    +   S +K+S  +IW
Sbjct: 957  EESKSSYNSQLPSA-IKEGVSGNSLQIITSGLSQAPDTPSSKTEGSSDSSSKKYSFGFIW 1015

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI         
Sbjct: 1016 GSSKESHKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFAL 1075

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24
            LN++SMLYIA +A C+LL R  I K+WPIFV L A+IL+ EYLA+WK+++PL+     E 
Sbjct: 1076 LNALSMLYIALLAACVLLNRHIIRKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE- 1134

Query: 23   TTHCHEC 3
               C +C
Sbjct: 1135 -IRCRDC 1140


>ref|XP_004290692.1| PREDICTED: uncharacterized protein LOC101301158 [Fragaria vesca
            subsp. vesca]
          Length = 2451

 Score =  231 bits (588), Expect = 2e-58
 Identities = 125/244 (51%), Positives = 161/244 (65%), Gaps = 8/244 (3%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVIAACTLQYNVFHWLERMPS+ L  G  E PCPLF+SA+D+   A+  +
Sbjct: 875  GLESGLRGKVLVIAACTLQYNVFHWLERMPSTILSKGMGE-PCPLFLSAEDTNISATIPS 933

Query: 530  GENQPLPENEMPEQRVGEGYSWP--SPS----HSPNFSTLRSTLSGSYRKHSLDYIWET- 372
             +N+P     + +Q     +SWP  SPS    H+P+     ++   S  K+S  YIW + 
Sbjct: 934  EDNRPSTSFSV-KQEGARSHSWPFFSPSLLHSHNPSSPKAGTSKGSSSGKYSFGYIWGST 992

Query: 371  -ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXLNS 195
             E+H W KKRI+AL++ERF+ QK   K+Y++FW+ENMFNLFGLEINMI         LN+
Sbjct: 993  KESHKWNKKRILALQKERFETQKLISKIYIKFWLENMFNLFGLEINMIALLLASFALLNA 1052

Query: 194  ISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPTTH 15
            ISMLYIA +A CI+L R  I KLWP FV L A+IL+ EY A+WK+  P +H     P   
Sbjct: 1053 ISMLYIALLAACIILNRQIIRKLWPTFVFLFASILILEYFAIWKSTWPPNHPDATNPC-- 1110

Query: 14   CHEC 3
            CH+C
Sbjct: 1111 CHDC 1114


>ref|XP_007200947.1| hypothetical protein PRUPE_ppa000028mg [Prunus persica]
            gi|462396347|gb|EMJ02146.1| hypothetical protein
            PRUPE_ppa000028mg [Prunus persica]
          Length = 2388

 Score =  230 bits (587), Expect = 3e-58
 Identities = 125/246 (50%), Positives = 163/246 (66%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVIAACTLQYNVF WLE+MPS+ L  G+ EEPCPLFVSA+D+   +S  +
Sbjct: 801  GLEFGLRGKVLVIAACTLQYNVFRWLEKMPSTILNKGKWEEPCPLFVSAEDANINSSIPS 860

Query: 530  GENQPLPENE-MPEQRVG-EGYSWP------SPSHSPNFSTLRSTLSGSYRKHSLDYIWE 375
             EN+   ++E +  +R G   +SWP      S SH+P       +   S  K+S  YIW 
Sbjct: 861  EENKQSTDSEALSVKREGARSHSWPFFSPGLSESHNPMSPRAGGSEGSSSNKYSFGYIWG 920

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W KKRI+ LR+ERF+ QK   K+YL+FW+ENMFNLFGLEINMI         L
Sbjct: 921  STKESHKWNKKRILTLRKERFETQKLISKIYLKFWMENMFNLFGLEINMIALLLASFALL 980

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+IS++YIA +ATCI+L R  I K+WPI V L A+IL+ EY A+WK++ P +H    E  
Sbjct: 981  NAISLVYIALLATCIILNRHIIRKIWPILVFLFASILILEYFAIWKSMWPSNHP--DETN 1038

Query: 20   THCHEC 3
              CH+C
Sbjct: 1039 ARCHDC 1044


>ref|XP_002882144.1| hypothetical protein ARALYDRAFT_322444 [Arabidopsis lyrata subsp.
           lyrata] gi|297327983|gb|EFH58403.1| hypothetical protein
           ARALYDRAFT_322444 [Arabidopsis lyrata subsp. lyrata]
          Length = 1473

 Score =  230 bits (587), Expect = 3e-58
 Identities = 122/246 (49%), Positives = 158/246 (64%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710 GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
           G++ GLR KVLV+AACTLQYNVF WLER P   +  G+ EEPCPLFVSA+D+    SS N
Sbjct: 239 GIESGLRGKVLVVAACTLQYNVFRWLERTPGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 298

Query: 530 GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375
           GEN    ++     + GE  S  WP  SP  +     L      + SGS RK S  + W 
Sbjct: 299 GENPSSTDHASISMKQGEATSNSWPFFSPRDNQGAGFLHPKTGGSESGSSRKFSFGHFWG 358

Query: 374 T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
           +  E+H W ++RI+AL++ERF+ QK  LK+YL+FWIENMFNL+GLEINMI         L
Sbjct: 359 SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 418

Query: 200 NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
           N+ISM+YIA +A C+LL R  I KLWP+ V L A+IL  EY+A W + +P S +   E +
Sbjct: 419 NAISMVYIALLAACVLLRRRLIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 477

Query: 20  THCHEC 3
            HCH+C
Sbjct: 478 VHCHDC 483


>ref|XP_004247483.1| PREDICTED: uncharacterized protein LOC101266159 [Solanum
            lycopersicum]
          Length = 2450

 Score =  229 bits (585), Expect = 5e-58
 Identities = 128/248 (51%), Positives = 165/248 (66%), Gaps = 12/248 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLRAKVLV+AACTLQYNVFHWLE+MP+S L   +SEEPCPLFVS +D   +    +
Sbjct: 873  GLEAGLRAKVLVVAACTLQYNVFHWLEKMPASLLNDNRSEEPCPLFVSEEDVMPLVP--D 930

Query: 530  GENQPLPE-NEMPEQRVGEGYSWPSPSHSPNFSTLRSTLSGS-----YR---KHSLDYIW 378
            GEN+P+ + NE   Q +    S   P    +       +S S     YR   K+S   IW
Sbjct: 931  GENKPVADSNEFSTQGMRTS-SKSCPYFDQSLYQSSDGVSSSRGVSEYRSRSKYSFGSIW 989

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KK +V+LR+ER  MQKTTLK+YL+FW+ENMFNLFGLEINM+         
Sbjct: 990  GSRKESHKWNKKLVVSLRKERLVMQKTTLKIYLKFWVENMFNLFGLEINMLALLLTSFAL 1049

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS-HRVLVE 27
            LN++S++YIA +A+C+LL R  I K+WPIFVLL   ILL EY AMWK+++PL+ HR    
Sbjct: 1050 LNAVSLIYIALLASCVLLERRIIRKVWPIFVLLFTLILLLEYFAMWKSLMPLNQHR--PN 1107

Query: 26   PTTHCHEC 3
             T HCH+C
Sbjct: 1108 QTVHCHDC 1115


>ref|XP_006358438.1| PREDICTED: uncharacterized protein LOC102605335 [Solanum tuberosum]
          Length = 2473

 Score =  227 bits (579), Expect = 3e-57
 Identities = 127/248 (51%), Positives = 164/248 (66%), Gaps = 12/248 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLRAKVLV+AACTLQYNVFHWLE+MP+S L   +SEEPCPLFVS +D   +    +
Sbjct: 896  GLEAGLRAKVLVVAACTLQYNVFHWLEKMPTSLLNGNKSEEPCPLFVSEEDVMPLVP--D 953

Query: 530  GENQPLPE-NEMPEQRVGEGYSWPSPSHSPNFSTLRSTLSGS-----YR---KHSLDYIW 378
             EN+P+ + NE   Q +    S   P    +       +S S     YR   K+S   IW
Sbjct: 954  EENKPVADSNEFSTQGMRTS-SKSCPYFDQSLYQSSDGVSSSRGVSEYRSRSKYSFGSIW 1012

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W KK +V+LR+ER +MQKTTLK+YL+FW+ENMFNLFGLEINM+         
Sbjct: 1013 GSRKESHKWNKKLVVSLRKERLEMQKTTLKIYLKFWVENMFNLFGLEINMLALLLTSFAL 1072

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLS-HRVLVE 27
            LN++S+LYIA +A+C+LL R  I K+WPIFVLL   ILL EY AMWK+++PL+ HR    
Sbjct: 1073 LNAVSLLYIALLASCVLLERRIIRKVWPIFVLLFTLILLLEYFAMWKSLMPLNQHR--PN 1130

Query: 26   PTTHCHEC 3
               HCH+C
Sbjct: 1131 QAVHCHDC 1138


>ref|NP_182327.6| uncharacterized protein [Arabidopsis thaliana]
            gi|330255833|gb|AEC10927.1| uncharacterized protein
            AT2G48060 [Arabidopsis thaliana]
          Length = 2462

 Score =  227 bits (579), Expect = 3e-57
 Identities = 121/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLV+AACTLQYNVF WLER     +  G+ EEPCPLFVSA+D+    SS N
Sbjct: 874  GIESGLRGKVLVVAACTLQYNVFRWLERTSGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 933

Query: 530  GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375
            GEN    ++     + GE  S  WP  SP  +     L      + SGS RK S  + W 
Sbjct: 934  GENPSSTDHASISMKQGEATSNSWPFFSPRGNQGAGFLHPKTGGSESGSSRKFSFGHFWG 993

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W ++RI+AL++ERF+ QK  LK+YL+FWIENMFNL+GLEINMI         L
Sbjct: 994  SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 1053

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+ISM+YIA +A C+LL R  I KLWP+ V L A+IL  EY+A W + +P S +   E +
Sbjct: 1054 NAISMVYIALLAACVLLRRRVIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 1112

Query: 20   THCHEC 3
             HCH+C
Sbjct: 1113 VHCHDC 1118


>gb|AAD13709.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1500

 Score =  227 bits (579), Expect = 3e-57
 Identities = 121/246 (49%), Positives = 157/246 (63%), Gaps = 10/246 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            G++ GLR KVLV+AACTLQYNVF WLER     +  G+ EEPCPLFVSA+D+    SS N
Sbjct: 266  GIESGLRGKVLVVAACTLQYNVFRWLERTSGLTVIKGKYEEPCPLFVSAEDTTASVSSSN 325

Query: 530  GENQPLPENEMPEQRVGEGYS--WP--SPSHSPNFSTLR----STLSGSYRKHSLDYIWE 375
            GEN    ++     + GE  S  WP  SP  +     L      + SGS RK S  + W 
Sbjct: 326  GENPSSTDHASISMKQGEATSNSWPFFSPRGNQGAGFLHPKTGGSESGSSRKFSFGHFWG 385

Query: 374  T--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXXL 201
            +  E+H W ++RI+AL++ERF+ QK  LK+YL+FWIENMFNL+GLEINMI         L
Sbjct: 386  SIKESHRWNRRRILALKKERFETQKNLLKIYLKFWIENMFNLYGLEINMIALLLASFALL 445

Query: 200  NSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEPT 21
            N+ISM+YIA +A C+LL R  I KLWP+ V L A+IL  EY+A W + +P S +   E +
Sbjct: 446  NAISMVYIALLAACVLLRRRVIQKLWPVVVFLFASILAIEYVATWNSFLP-SDQAPSETS 504

Query: 20   THCHEC 3
             HCH+C
Sbjct: 505  VHCHDC 510


>ref|XP_002524795.1| conserved hypothetical protein [Ricinus communis]
            gi|223535979|gb|EEF37638.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 2254

 Score =  226 bits (577), Expect = 4e-57
 Identities = 123/247 (49%), Positives = 158/247 (63%), Gaps = 11/247 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVIAACTLQYNVF WL +MP++    G+ EEPCPLFVS +++F   S +N
Sbjct: 722  GLESGLRGKVLVIAACTLQYNVFRWLGKMPNTFPDKGKWEEPCPLFVSDENAFANGSIIN 781

Query: 530  GENQPLPENEMPEQR---------VGEGYSWPSPSHSPNFSTLRSTLSGSYRKHSLDYIW 378
             EN+   E  +P  +              S+  P H+ +  T  S  SG+ R  S  YIW
Sbjct: 782  DENKAPSEYNVPSVKKETVTATSTFSFTSSFTQPPHTFSNKTGSSVGSGT-RIFSFGYIW 840

Query: 377  ET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXXX 204
             +  E+H W +KRI+ALR+ERF+ QK  LK+YL+FWIENMFNLFGLEINMI         
Sbjct: 841  GSTKESHKWNRKRILALRKERFETQKALLKIYLKFWIENMFNLFGLEINMIALLLASFTL 900

Query: 203  LNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVEP 24
            LN+I+MLYIA +A CIL+ R  I KLWPI V L A+IL+ EY A+WK++ PL+     E 
Sbjct: 901  LNAIAMLYIALLAACILVSRHIIRKLWPIVVTLFASILILEYFAIWKSIFPLNQHAPSET 960

Query: 23   TTHCHEC 3
              +CH C
Sbjct: 961  DIYCHNC 967


>ref|XP_006575095.1| PREDICTED: uncharacterized protein LOC100792646 isoform X4 [Glycine
            max]
          Length = 2173

 Score =  226 bits (575), Expect = 8e-57
 Identities = 126/248 (50%), Positives = 161/248 (64%), Gaps = 12/248 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVI ACTLQYNVF WLERMP++ L  GQ EEPCPLFV  +D F   +  N
Sbjct: 897  GLESGLRGKVLVIVACTLQYNVFRWLERMPNTVLSKGQWEEPCPLFVPTEDVFIDDAMCN 956

Query: 530  GENQPLPENEMPEQRVGEGYSWPSPS----------HSPNFSTLRSTLSGSYRKHSLDYI 381
             E++    + +P   + EG S  S             +P+  T  S+ S S +K+S  +I
Sbjct: 957  EESKSSYNSNLPSA-IKEGVSGKSLQIITSGLSQALDTPSSKTGDSSDSSS-KKYSFGFI 1014

Query: 380  WET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXX 207
            W +  E+  W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI        
Sbjct: 1015 WGSSKESQKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFA 1074

Query: 206  XLNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVE 27
             LN+ISM+YIA +A C+LL R  I K+WPIFV L A+IL+ EYLA+WK+++PL+     E
Sbjct: 1075 LLNAISMMYIALLAACVLLNRHIICKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE 1134

Query: 26   PTTHCHEC 3
                CH+C
Sbjct: 1135 --IRCHDC 1140


>ref|XP_006575094.1| PREDICTED: uncharacterized protein LOC100792646 isoform X3 [Glycine
            max]
          Length = 2220

 Score =  226 bits (575), Expect = 8e-57
 Identities = 126/248 (50%), Positives = 161/248 (64%), Gaps = 12/248 (4%)
 Frame = -1

Query: 710  GLQEGLRAKVLVIAACTLQYNVFHWLERMPSSNLYVGQSEEPCPLFVSAKDSFTVASSLN 531
            GL+ GLR KVLVI ACTLQYNVF WLERMP++ L  GQ EEPCPLFV  +D F   +  N
Sbjct: 635  GLESGLRGKVLVIVACTLQYNVFRWLERMPNTVLSKGQWEEPCPLFVPTEDVFIDDAMCN 694

Query: 530  GENQPLPENEMPEQRVGEGYSWPSPS----------HSPNFSTLRSTLSGSYRKHSLDYI 381
             E++    + +P   + EG S  S             +P+  T  S+ S S +K+S  +I
Sbjct: 695  EESKSSYNSNLPSA-IKEGVSGKSLQIITSGLSQALDTPSSKTGDSSDSSS-KKYSFGFI 752

Query: 380  WET--ENHNWKKKRIVALRQERFDMQKTTLKVYLRFWIENMFNLFGLEINMIXXXXXXXX 207
            W +  E+  W KKRIVALR+ERF+ QKT LKVYL+FW+EN FNLFGLEINMI        
Sbjct: 753  WGSSKESQKWNKKRIVALRKERFETQKTVLKVYLKFWMENTFNLFGLEINMISLLLVSFA 812

Query: 206  XLNSISMLYIACIATCILLPRPTISKLWPIFVLLSATILLAEYLAMWKNVVPLSHRVLVE 27
             LN+ISM+YIA +A C+LL R  I K+WPIFV L A+IL+ EYLA+WK+++PL+     E
Sbjct: 813  LLNAISMMYIALLAACVLLNRHIICKVWPIFVFLFASILILEYLAIWKDMLPLNSHASSE 872

Query: 26   PTTHCHEC 3
                CH+C
Sbjct: 873  --IRCHDC 878


Top