BLASTX nr result

ID: Ephedra28_contig00009319 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00009319
         (2105 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006857334.1| hypothetical protein AMTR_s00067p00089960 [A...   269   3e-69
emb|CBI23663.3| unnamed protein product [Vitis vinifera]              258   7e-66
gb|EOY06084.1| COP1-interacting protein-related, putative isofor...   256   2e-65
gb|EOY06082.1| COP1-interacting protein-related, putative isofor...   256   2e-65
gb|EOY06081.1| COP1-interacting protein-related, putative isofor...   256   2e-65
gb|EOY06080.1| COP1-interacting protein-related, putative isofor...   256   2e-65
gb|EOY06079.1| COP1-interacting protein-related, putative isofor...   256   2e-65
ref|XP_002281562.2| PREDICTED: uncharacterized protein LOC100251...   255   6e-65
gb|EXB93730.1| hypothetical protein L484_011725 [Morus notabilis]     253   2e-64
gb|EMJ26655.1| hypothetical protein PRUPE_ppa000302mg [Prunus pe...   249   3e-63
ref|XP_006403265.1| hypothetical protein EUTSA_v10003134mg [Eutr...   246   4e-62
ref|XP_006403264.1| hypothetical protein EUTSA_v10003134mg [Eutr...   246   4e-62
ref|XP_002528832.1| conserved hypothetical protein [Ricinus comm...   244   1e-61
ref|XP_006420261.1| hypothetical protein CICLE_v10004168mg [Citr...   243   2e-61
ref|XP_006606379.1| PREDICTED: dentin sialophosphoprotein-like i...   242   5e-61
ref|XP_006606378.1| PREDICTED: dentin sialophosphoprotein-like i...   242   5e-61
ref|XP_006606377.1| PREDICTED: dentin sialophosphoprotein-like i...   242   5e-61
gb|ESW16027.1| hypothetical protein PHAVU_007G123500g [Phaseolus...   240   2e-60
ref|XP_004980952.1| PREDICTED: uncharacterized protein LOC101783...   238   6e-60
ref|XP_004296379.1| PREDICTED: uncharacterized protein LOC101304...   238   6e-60

>ref|XP_006857334.1| hypothetical protein AMTR_s00067p00089960 [Amborella trichopoda]
            gi|548861427|gb|ERN18801.1| hypothetical protein
            AMTR_s00067p00089960 [Amborella trichopoda]
          Length = 1357

 Score =  269 bits (688), Expect = 3e-69
 Identities = 198/551 (35%), Positives = 267/551 (48%), Gaps = 32/551 (5%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK ET+LD  VFQLTPTRTRC+LVI ANG +EK+ SGL  PF+TH++TA  QIAKGGYSI
Sbjct: 1    MKAETKLDSAVFQLTPTRTRCDLVIFANGTSEKIVSGLLDPFLTHMRTAQHQIAKGGYSI 60

Query: 1459 RLEPPPG-SQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG- 1286
            +LEP PG +Q V WFTKGT+ERFVRFVSTPE++ER+NT              Q  E IG 
Sbjct: 61   QLEPGPGNNQGVAWFTKGTVERFVRFVSTPEVLERVNTIESEITQIEEAIAIQGNENIGF 120

Query: 1285 --AEEHSSKSGHDDESETGRSTVVDNSESALVLYEP----KQSNGKVEAAHAENSKIRLL 1124
               E+H++KS   + ++ GRS +  ++E A+VLY+P     +SNG       ENSK++LL
Sbjct: 121  STVEDHATKS--TESNDGGRSIMDSDAEKAIVLYKPGAQSAESNG--STTQEENSKVQLL 176

Query: 1123 KLLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKK 944
            ++LETRR +L KEQGM       AGFD++++  L+ F+ECFGA RL+EAC +FMELWK K
Sbjct: 177  RVLETRRTMLQKEQGMAFARAVAAGFDMDHLVHLISFAECFGASRLKEACIRFMELWKVK 236

Query: 943  QXXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGD 764
                           E  S RSE   +N +G  L+S    LK      +         GD
Sbjct: 237  HETSQWLEGMEFEAAEEMSSRSEFSSMNGSGFMLSSETSKLKEFRESWS------DFHGD 290

Query: 763  LKSWKSVNSEDFQAKHQNGVSTDD----TQVHYQNPALFP---WQGQQP-------PYGQ 626
            +   +S    + +A    G S       + +  Q P + P   +QGQ P       P   
Sbjct: 291  IGE-RSHGKTNIEAGSDTGASDPSRDKRSSMESQVPPVVPPEYYQGQYPQPMVHAWPLHA 349

Query: 625  NFHPPGFP-YAMQGVPYYPGYAMNPSFYQGMQHPMSDDHAHIPLHPHY-----SPSNAHP 464
                P FP Y MQG+PYY GY    +++Q    PM D   ++     +     S    + 
Sbjct: 350  PQGAPVFPAYPMQGMPYYQGYPGAGAYFQPPYPPMEDPRFNMASRMDFKRQPMSGKEGNL 409

Query: 463  SEESRRGSNKRTQSRRVATSKAKYPSINGDRXXXXXXXXXXXXXXXXXXXXXEHMENLQT 284
              E+  G++  T                                         H +N+Q 
Sbjct: 410  VPETWEGASNTTS----------------------------------------HDQNMQL 429

Query: 283  RXXXXXXXXXXXXXXXXGNKQSGR-VFIRNINYITS---DKHGKGTGLEDSDPEVDVEMD 116
                               K   R V IRNINYI S   D  G  +G E  + E+  E++
Sbjct: 430  EVEREGSSRQSNKRRGRMGKSRSRMVVIRNINYIASKGDDNSGSESGSEVDEEELQQEVE 489

Query: 115  EEHKDQASDVH 83
            E   +     H
Sbjct: 490  ESQLNHEKRAH 500


>emb|CBI23663.3| unnamed protein product [Vitis vinifera]
          Length = 1216

 Score =  258 bits (659), Expect = 7e-66
 Identities = 194/526 (36%), Positives = 246/526 (46%), Gaps = 12/526 (2%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD  VFQLTPTRTRC+L+ITANG+TEK+ASGL  PF+ HLKTA DQIAKGGYSI
Sbjct: 1    MKSSTLLDSAVFQLTPTRTRCDLIITANGKTEKIASGLLNPFLAHLKTAQDQIAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEP PGS D  WF KGT+ERFVRFVSTPE++ER+ T              Q    +G  
Sbjct: 61   ILEPKPGS-DATWFAKGTVERFVRFVSTPEVLERVYTIESEIIQIGEAIAIQSNNDLGLS 119

Query: 1279 EHSSKSGHDDESETGRSTVVDNS-ESALVLYE----PKQSNGKVEAAHAENSKIRLLKLL 1115
                      ES  G   V+D S E A+VLY+    P ++NG        NSK++LLK+L
Sbjct: 120  AVVDHQAKPVESIEGSKPVLDTSEEKAIVLYKPGAHPPEANG--STTQEGNSKVQLLKVL 177

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            ETR+ VL KEQGM       AGFDI++M  L+ F+ECFGA RL +AC +F++LWK K   
Sbjct: 178  ETRKTVLQKEQGMAFARAVAAGFDIDHMTPLLSFAECFGASRLMDACLRFLDLWKSKH-- 235

Query: 934  XXXXXXXXXXXXETNSMRSESCF--VNETGITLNSSVYALKP-RVAQDAQPLVARSESGD 764
                           +M S+S F  +N +GITL++ V   K  R A         SE+  
Sbjct: 236  ---ETGQWLEIEAAEAMSSQSDFSSMNPSGITLSNMVNKQKEFREAWPESLSELASENNG 292

Query: 763  LKSWKSVNSEDFQAKHQNGVSTDD-TQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQG 587
                 +   E     HQ  +   +  Q  + +    PW    PP      P   PY MQG
Sbjct: 293  KARIDASADEKPPMDHQVPLGHQEYFQGQFPHHMFPPWPIHSPP---GAVPVFQPYPMQG 349

Query: 586  VPYYPGYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVAT 407
            +PYY  Y  N SF Q    PM D        P Y       S +SR   +  T+S     
Sbjct: 350  MPYYQNYPGNGSFVQPPYPPMEDSR----FSPGYRMGQKRHSMDSR---DSNTESETWDA 402

Query: 406  SKAKYPSINGDRXXXXXXXXXXXXXXXXXXXXXEHMENLQTRXXXXXXXXXXXXXXXXGN 227
              +K  S  G +                                                
Sbjct: 403  DASKTRSSYGKK------------------------------------------------ 414

Query: 226  KQSGRVFIRNINYITSDKH---GKGTGLEDSDPEVDVEMDEEHKDQ 98
             +SG V IRNINYITS +    G  +  E S   +D     + +D+
Sbjct: 415  -KSGVVVIRNINYITSKRQNSSGSESQKESSTKSMDASKSSDKEDR 459


>gb|EOY06084.1| COP1-interacting protein-related, putative isoform 6 [Theobroma
            cacao]
          Length = 1142

 Score =  256 bits (655), Expect = 2e-65
 Identities = 171/425 (40%), Positives = 223/425 (52%), Gaps = 20/425 (4%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             L+P P S D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q    IG  
Sbjct: 61   ILQPEP-SIDATWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQSNNNIGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLY----EPKQSNGKVEAAHAENSKIRLLK 1121
              E+H  K    +  E  R T   N E A+VLY    +P ++NG   A    NSK++LLK
Sbjct: 120  AVEDHQVKP--LESIEGSRVTPDSNEEKAIVLYTPGAQPSEANG--SAVQEGNSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AGFDI++M  L+ F+E FGA RLR+AC KF ELWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGFDIDHMAPLMSFAESFGASRLRDACVKFTELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L++ +                  + G  
Sbjct: 236  ---ETGQWLEIEAAEAMSSRSDFSAMNASGIVLSNMI----------------NKQKGLK 276

Query: 760  KSWKSVNSEDFQAKHQNGV--------STDDTQVHYQN--PALFPWQGQQPPYGQNFHPP 611
            ++W  ++  + +A  ++           T   Q +YQ   P   PW    PP G    P 
Sbjct: 277  EAWLEISENNGKAGVESSTDERPPMDQQTPGRQEYYQAQFPMFPPWPIHSPPGGM---PT 333

Query: 610  GFPYAMQGVPYYPGYAMNPSF---YQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGS 440
               Y MQG+PYYP Y  +P F   Y  M+ P  +    I         ++H   E+    
Sbjct: 334  FQGYPMQGMPYYPSYPGSPFFQQPYPSMEDPRLNAGQRIQKRHSMESRDSHTGSETWEME 393

Query: 439  NKRTQ 425
              ++Q
Sbjct: 394  RAKSQ 398


>gb|EOY06082.1| COP1-interacting protein-related, putative isoform 4 [Theobroma
            cacao]
          Length = 1318

 Score =  256 bits (655), Expect = 2e-65
 Identities = 171/425 (40%), Positives = 223/425 (52%), Gaps = 20/425 (4%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             L+P P S D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q    IG  
Sbjct: 61   ILQPEP-SIDATWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQSNNNIGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLY----EPKQSNGKVEAAHAENSKIRLLK 1121
              E+H  K    +  E  R T   N E A+VLY    +P ++NG   A    NSK++LLK
Sbjct: 120  AVEDHQVKP--LESIEGSRVTPDSNEEKAIVLYTPGAQPSEANG--SAVQEGNSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AGFDI++M  L+ F+E FGA RLR+AC KF ELWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGFDIDHMAPLMSFAESFGASRLRDACVKFTELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L++ +                  + G  
Sbjct: 236  ---ETGQWLEIEAAEAMSSRSDFSAMNASGIVLSNMI----------------NKQKGLK 276

Query: 760  KSWKSVNSEDFQAKHQNGV--------STDDTQVHYQN--PALFPWQGQQPPYGQNFHPP 611
            ++W  ++  + +A  ++           T   Q +YQ   P   PW    PP G    P 
Sbjct: 277  EAWLEISENNGKAGVESSTDERPPMDQQTPGRQEYYQAQFPMFPPWPIHSPPGGM---PT 333

Query: 610  GFPYAMQGVPYYPGYAMNPSF---YQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGS 440
               Y MQG+PYYP Y  +P F   Y  M+ P  +    I         ++H   E+    
Sbjct: 334  FQGYPMQGMPYYPSYPGSPFFQQPYPSMEDPRLNAGQRIQKRHSMESRDSHTGSETWEME 393

Query: 439  NKRTQ 425
              ++Q
Sbjct: 394  RAKSQ 398


>gb|EOY06081.1| COP1-interacting protein-related, putative isoform 3 [Theobroma
            cacao] gi|508714186|gb|EOY06083.1| COP1-interacting
            protein-related, putative isoform 3 [Theobroma cacao]
          Length = 1180

 Score =  256 bits (655), Expect = 2e-65
 Identities = 171/425 (40%), Positives = 223/425 (52%), Gaps = 20/425 (4%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             L+P P S D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q    IG  
Sbjct: 61   ILQPEP-SIDATWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQSNNNIGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLY----EPKQSNGKVEAAHAENSKIRLLK 1121
              E+H  K    +  E  R T   N E A+VLY    +P ++NG   A    NSK++LLK
Sbjct: 120  AVEDHQVKP--LESIEGSRVTPDSNEEKAIVLYTPGAQPSEANG--SAVQEGNSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AGFDI++M  L+ F+E FGA RLR+AC KF ELWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGFDIDHMAPLMSFAESFGASRLRDACVKFTELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L++ +                  + G  
Sbjct: 236  ---ETGQWLEIEAAEAMSSRSDFSAMNASGIVLSNMI----------------NKQKGLK 276

Query: 760  KSWKSVNSEDFQAKHQNGV--------STDDTQVHYQN--PALFPWQGQQPPYGQNFHPP 611
            ++W  ++  + +A  ++           T   Q +YQ   P   PW    PP G    P 
Sbjct: 277  EAWLEISENNGKAGVESSTDERPPMDQQTPGRQEYYQAQFPMFPPWPIHSPPGGM---PT 333

Query: 610  GFPYAMQGVPYYPGYAMNPSF---YQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGS 440
               Y MQG+PYYP Y  +P F   Y  M+ P  +    I         ++H   E+    
Sbjct: 334  FQGYPMQGMPYYPSYPGSPFFQQPYPSMEDPRLNAGQRIQKRHSMESRDSHTGSETWEME 393

Query: 439  NKRTQ 425
              ++Q
Sbjct: 394  RAKSQ 398


>gb|EOY06080.1| COP1-interacting protein-related, putative isoform 2 [Theobroma
            cacao]
          Length = 1145

 Score =  256 bits (655), Expect = 2e-65
 Identities = 171/425 (40%), Positives = 223/425 (52%), Gaps = 20/425 (4%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             L+P P S D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q    IG  
Sbjct: 61   ILQPEP-SIDATWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQSNNNIGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLY----EPKQSNGKVEAAHAENSKIRLLK 1121
              E+H  K    +  E  R T   N E A+VLY    +P ++NG   A    NSK++LLK
Sbjct: 120  AVEDHQVKP--LESIEGSRVTPDSNEEKAIVLYTPGAQPSEANG--SAVQEGNSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AGFDI++M  L+ F+E FGA RLR+AC KF ELWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGFDIDHMAPLMSFAESFGASRLRDACVKFTELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L++ +                  + G  
Sbjct: 236  ---ETGQWLEIEAAEAMSSRSDFSAMNASGIVLSNMI----------------NKQKGLK 276

Query: 760  KSWKSVNSEDFQAKHQNGV--------STDDTQVHYQN--PALFPWQGQQPPYGQNFHPP 611
            ++W  ++  + +A  ++           T   Q +YQ   P   PW    PP G    P 
Sbjct: 277  EAWLEISENNGKAGVESSTDERPPMDQQTPGRQEYYQAQFPMFPPWPIHSPPGGM---PT 333

Query: 610  GFPYAMQGVPYYPGYAMNPSF---YQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGS 440
               Y MQG+PYYP Y  +P F   Y  M+ P  +    I         ++H   E+    
Sbjct: 334  FQGYPMQGMPYYPSYPGSPFFQQPYPSMEDPRLNAGQRIQKRHSMESRDSHTGSETWEME 393

Query: 439  NKRTQ 425
              ++Q
Sbjct: 394  RAKSQ 398


>gb|EOY06079.1| COP1-interacting protein-related, putative isoform 1 [Theobroma
            cacao]
          Length = 1297

 Score =  256 bits (655), Expect = 2e-65
 Identities = 171/425 (40%), Positives = 223/425 (52%), Gaps = 20/425 (4%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             L+P P S D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q    IG  
Sbjct: 61   ILQPEP-SIDATWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQSNNNIGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLY----EPKQSNGKVEAAHAENSKIRLLK 1121
              E+H  K    +  E  R T   N E A+VLY    +P ++NG   A    NSK++LLK
Sbjct: 120  AVEDHQVKP--LESIEGSRVTPDSNEEKAIVLYTPGAQPSEANG--SAVQEGNSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AGFDI++M  L+ F+E FGA RLR+AC KF ELWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGFDIDHMAPLMSFAESFGASRLRDACVKFTELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L++ +                  + G  
Sbjct: 236  ---ETGQWLEIEAAEAMSSRSDFSAMNASGIVLSNMI----------------NKQKGLK 276

Query: 760  KSWKSVNSEDFQAKHQNGV--------STDDTQVHYQN--PALFPWQGQQPPYGQNFHPP 611
            ++W  ++  + +A  ++           T   Q +YQ   P   PW    PP G    P 
Sbjct: 277  EAWLEISENNGKAGVESSTDERPPMDQQTPGRQEYYQAQFPMFPPWPIHSPPGGM---PT 333

Query: 610  GFPYAMQGVPYYPGYAMNPSF---YQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGS 440
               Y MQG+PYYP Y  +P F   Y  M+ P  +    I         ++H   E+    
Sbjct: 334  FQGYPMQGMPYYPSYPGSPFFQQPYPSMEDPRLNAGQRIQKRHSMESRDSHTGSETWEME 393

Query: 439  NKRTQ 425
              ++Q
Sbjct: 394  RAKSQ 398


>ref|XP_002281562.2| PREDICTED: uncharacterized protein LOC100251059 [Vitis vinifera]
          Length = 1292

 Score =  255 bits (651), Expect = 6e-65
 Identities = 172/422 (40%), Positives = 217/422 (51%), Gaps = 8/422 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD  VFQLTPTRTRC+L+ITANG+TEK+ASGL  PF+ HLKTA DQIAKGGYSI
Sbjct: 1    MKSSTLLDSAVFQLTPTRTRCDLIITANGKTEKIASGLLNPFLAHLKTAQDQIAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEP PGS D  WF KGT+ERFVRFVSTPE++ER+ T              Q    +G  
Sbjct: 61   ILEPKPGS-DATWFAKGTVERFVRFVSTPEVLERVYTIESEIIQIGEAIAIQSNNDLGLS 119

Query: 1279 EHSSKSGHDDESETGRSTVVDNS-ESALVLYE----PKQSNGKVEAAHAENSKIRLLKLL 1115
                      ES  G   V+D S E A+VLY+    P ++NG        NSK++LLK+L
Sbjct: 120  AVVDHQAKPVESIEGSKPVLDTSEEKAIVLYKPGAHPPEANG--STTQEGNSKVQLLKVL 177

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            ETR+ VL KEQGM       AGFDI++M  L+ F+ECFGA RL +AC +F++LWK K   
Sbjct: 178  ETRKTVLQKEQGMAFARAVAAGFDIDHMTPLLSFAECFGASRLMDACLRFLDLWKSKH-- 235

Query: 934  XXXXXXXXXXXXETNSMRSESCF--VNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                           +M S+S F  +N +GITL++ V                  +    
Sbjct: 236  ---ETGQWLEIEAAEAMSSQSDFSSMNPSGITLSNMV----------------NKQKEFR 276

Query: 760  KSWKSVNSEDFQAKHQNGVSTDD-TQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGV 584
            ++W    +E     HQ  +   +  Q  + +    PW    PP      P   PY MQG+
Sbjct: 277  EAWPESLNEKPPMDHQVPLGHQEYFQGQFPHHMFPPWPIHSPP---GAVPVFQPYPMQGM 333

Query: 583  PYYPGYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVATS 404
            PYY  Y  N SF Q    PM D        P Y       S +SR   +  T+S      
Sbjct: 334  PYYQNYPGNGSFVQPPYPPMEDSR----FSPGYRMGQKRHSMDSR---DSNTESETWDAD 386

Query: 403  KA 398
            KA
Sbjct: 387  KA 388


>gb|EXB93730.1| hypothetical protein L484_011725 [Morus notabilis]
          Length = 1278

 Score =  253 bits (646), Expect = 2e-64
 Identities = 193/536 (36%), Positives = 249/536 (46%), Gaps = 18/536 (3%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD  VFQLTPTRTRC+LVI+ANG+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSAVFQLTPTRTRCDLVISANGKTEKIASGLLNPFLAHLKTAQEQMAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEP PGS DV WFTKGT+ERFVRFVSTPE++ER+ T              Q        
Sbjct: 61   ILEPEPGS-DVSWFTKGTVERFVRFVSTPEVLERVYTLESEILQIEEAIAIQGNNETAPS 119

Query: 1279 EHSSKSGHDDESETGRSTVVDN-SESALVLYEP--KQSNGKVEAAHAENSKIRLLKLLET 1109
                      ES  G  +++D+  E A+VLY+P          AA   NSK++LLK+LET
Sbjct: 120  TVEESPAKPTESIEGNRSLLDSGDEKAIVLYKPGVHPPESNESAAQEGNSKVQLLKVLET 179

Query: 1108 RRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXXXX 929
            R+ VL KEQGM       AGFDI+N+  L+ FS CFGA RL +AC +F ELWKKK     
Sbjct: 180  RKTVLQKEQGMAFARAVAAGFDIDNISPLMSFSVCFGASRLMDACKRFKELWKKKH---E 236

Query: 928  XXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKSWK 749
                      E  S RS+   +N +GI L+S                         KSW 
Sbjct: 237  SGQWLEIEAAEAMSSRSDFSAMNASGIMLSSVA-----------------------KSWP 273

Query: 748  SVNSE---DFQAKHQNGVSTDDTQVHYQNPALFP---WQGQQP----PYGQNFHPPGF-- 605
              ++E   +   K  + +STD+       P   P   +QGQ P    P      PPG   
Sbjct: 274  ESHAEFALESNGKSSSLISTDEKPALEHQPPPGPQEYFQGQFPHQMFPPWPIHSPPGTVP 333

Query: 604  ---PYAMQGVPYYPGYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNK 434
                Y MQG+PYY  Y     FYQ             P +P       +P +  R G  +
Sbjct: 334  VFQAYPMQGMPYYQNYPGAGPFYQ-------------PPYPAVEDPRLNPGQ--RMGQKR 378

Query: 433  RTQSRRVATSKAKYPSINGDRXXXXXXXXXXXXXXXXXXXXXEHMENLQTRXXXXXXXXX 254
             +        +++   I+  R                           ++          
Sbjct: 379  HSMDSTNGNVESETWEIDAHR--------------------------TRSSDDAELEKEP 412

Query: 253  XXXXXXXGNKQSGRVFIRNINYITSDKHGKGTGLEDSDPEVDVEMDEEHKDQASDV 86
                   G KQSG V IRNINYI S   G+    ++S    D E+DEE +   S++
Sbjct: 413  RKRGSRSGKKQSGVVVIRNINYIAS--KGQNDSEDESRSGSDAEIDEEDRAGGSEM 466


>gb|EMJ26655.1| hypothetical protein PRUPE_ppa000302mg [Prunus persica]
          Length = 1312

 Score =  249 bits (636), Expect = 3e-63
 Identities = 186/514 (36%), Positives = 247/514 (48%), Gaps = 4/514 (0%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD  +FQLTPTRTR +LVI+ANG+TEK+ASGL  PF++HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLDSALFQLTPTRTRYDLVISANGKTEKIASGLLNPFLSHLKTAQEQMAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEP  GS D  WFTK T+ERFVRFVSTPE++ER+ T              Q    +   
Sbjct: 61   ILEPESGS-DATWFTKSTVERFVRFVSTPEVLERVYTLESEILQIEEAIAIQGNNDMALN 119

Query: 1279 EHSSKSGHDDESETGRSTVVD-NSESALVLYEP--KQSNGKVEAAHAENSKIRLLKLLET 1109
                  G   +S  G   ++D N E A+VLY+P   Q       A  ENSK++LLK+LET
Sbjct: 120  PVKENHGKPVDSIEGNRPMLDGNEEKAIVLYQPDASQPEANGSTAQGENSKVQLLKVLET 179

Query: 1108 RRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXXXX 929
            R+ +L KEQGM       AGFDI+++  L+ F+ECFGA RL +AC ++ ELWK+K     
Sbjct: 180  RKTMLQKEQGMAFARAVAAGFDIDHLPPLISFAECFGASRLMDACRRYKELWKRKH---E 236

Query: 928  XXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKSWK 749
                      ET + RSE   +N +GI L+S                V   ++  L ++ 
Sbjct: 237  TGQWLEIEAAETVATRSEFSAMNASGIMLSS----------------VTNKQNEILSAY- 279

Query: 748  SVNSEDFQAKHQNGVSTDD-TQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPYYP 572
             ++ E     HQ  +S  +     + +    PW     P     +P   PY MQG+PYY 
Sbjct: 280  -LSEEKLPVDHQQPLSHQEYFPGQFPHQMFPPWPVHSSPGALPVYP---PYPMQGMPYYQ 335

Query: 571  GYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVATSKAKY 392
             Y  N  F+Q                P Y      P+ E  R      Q +R+   +   
Sbjct: 336  NYPGNSPFFQ----------------PPY------PTVEDPR----LNQGQRMKQKRHSM 369

Query: 391  PSINGDRXXXXXXXXXXXXXXXXXXXXXEHMENLQTRXXXXXXXXXXXXXXXXGNKQSGR 212
             S NG+                         E+L++R                G KQSG 
Sbjct: 370  DSANGN--LESETLETDGLRTRSSDDAELENESLKSR-------ESRKKGSRSGKKQSGT 420

Query: 211  VFIRNINYITSDKHGKGTGLEDSDPEVDVEMDEE 110
            V IRNINYITS   GK +   +S    D + DEE
Sbjct: 421  VVIRNINYITS--KGKNSSDSESQSTSDSQTDEE 452


>ref|XP_006403265.1| hypothetical protein EUTSA_v10003134mg [Eutrema salsugineum]
            gi|557104378|gb|ESQ44718.1| hypothetical protein
            EUTSA_v10003134mg [Eutrema salsugineum]
          Length = 1189

 Score =  246 bits (627), Expect = 4e-62
 Identities = 164/429 (38%), Positives = 220/429 (51%), Gaps = 9/429 (2%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+L++TANG+TEK+A+GL  PF+ HLKTA DQ+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLLVTANGKTEKIATGLLDPFLAHLKTAQDQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINT--XXXXXXXXXXXXXXQCGEAIG 1286
             L+ P  S +  WFTKGT+ERFVRFVS P+++ER+ T                 C  A+ 
Sbjct: 61   ILK-PEDSDNAAWFTKGTIERFVRFVSNPDVLERVYTLETEIIQMKEAIGIQKNCEMALT 119

Query: 1285 AEEHSSKSGHDDESETGRSTVVDNSESALVLYE----PKQSNGKVEAAHAENSKIRLLKL 1118
              E   +    + +E  R  +  N E A+VLYE    PKQ+N    AA  ENSK+++LK+
Sbjct: 120  VVEEDLRGKRVESTEGSRPLLQLNEEKAIVLYEPDSHPKQANR--SAASDENSKVQVLKV 177

Query: 1117 LETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQX 938
            LETR++VL KEQGM       AGFD+++M  L+ F + FGA RL +AC K+ +LWKKK  
Sbjct: 178  LETRKIVLQKEQGMAFARAVAAGFDVDDMLPLISFGKSFGASRLMDACLKYKDLWKKKHE 237

Query: 937  XXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLK 758
                          T    S    +N++GI     ++A    + +D+ P     E+ D+K
Sbjct: 238  TGQWVEIEATEVMATQPNISA---MNDSGI-----MFANAANMRRDSWP--GTPENSDVK 287

Query: 757  SWKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPY 578
            S            ++  V+ +  Q  +  P   PW    PP      P    Y MQG+PY
Sbjct: 288  S---------PTDNKPNVNQEHVQGQHPQPKYAPWPVHSPP---GTFPIFQGYTMQGMPY 335

Query: 577  YPGY---AMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVAT 407
            YP Y   +  PS Y                  H S S    SE+  R   K    RR  +
Sbjct: 336  YPSYPGASPYPSPYPSTDDSRRSSCQRKAKKHHSSSSEDSGSEDQEREKGK--SGRRRKS 393

Query: 406  SKAKYPSIN 380
             K    +IN
Sbjct: 394  GKVVIRNIN 402


>ref|XP_006403264.1| hypothetical protein EUTSA_v10003134mg [Eutrema salsugineum]
            gi|557104377|gb|ESQ44717.1| hypothetical protein
            EUTSA_v10003134mg [Eutrema salsugineum]
          Length = 1019

 Score =  246 bits (627), Expect = 4e-62
 Identities = 164/429 (38%), Positives = 220/429 (51%), Gaps = 9/429 (2%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD VVFQLTPTRTRC+L++TANG+TEK+A+GL  PF+ HLKTA DQ+AKGGYSI
Sbjct: 1    MKSSTRLDSVVFQLTPTRTRCDLLVTANGKTEKIATGLLDPFLAHLKTAQDQVAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINT--XXXXXXXXXXXXXXQCGEAIG 1286
             L+ P  S +  WFTKGT+ERFVRFVS P+++ER+ T                 C  A+ 
Sbjct: 61   ILK-PEDSDNAAWFTKGTIERFVRFVSNPDVLERVYTLETEIIQMKEAIGIQKNCEMALT 119

Query: 1285 AEEHSSKSGHDDESETGRSTVVDNSESALVLYE----PKQSNGKVEAAHAENSKIRLLKL 1118
              E   +    + +E  R  +  N E A+VLYE    PKQ+N    AA  ENSK+++LK+
Sbjct: 120  VVEEDLRGKRVESTEGSRPLLQLNEEKAIVLYEPDSHPKQANR--SAASDENSKVQVLKV 177

Query: 1117 LETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQX 938
            LETR++VL KEQGM       AGFD+++M  L+ F + FGA RL +AC K+ +LWKKK  
Sbjct: 178  LETRKIVLQKEQGMAFARAVAAGFDVDDMLPLISFGKSFGASRLMDACLKYKDLWKKKHE 237

Query: 937  XXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLK 758
                          T    S    +N++GI     ++A    + +D+ P     E+ D+K
Sbjct: 238  TGQWVEIEATEVMATQPNISA---MNDSGI-----MFANAANMRRDSWP--GTPENSDVK 287

Query: 757  SWKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPY 578
            S            ++  V+ +  Q  +  P   PW    PP      P    Y MQG+PY
Sbjct: 288  S---------PTDNKPNVNQEHVQGQHPQPKYAPWPVHSPP---GTFPIFQGYTMQGMPY 335

Query: 577  YPGY---AMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVAT 407
            YP Y   +  PS Y                  H S S    SE+  R   K    RR  +
Sbjct: 336  YPSYPGASPYPSPYPSTDDSRRSSCQRKAKKHHSSSSEDSGSEDQEREKGK--SGRRRKS 393

Query: 406  SKAKYPSIN 380
             K    +IN
Sbjct: 394  GKVVIRNIN 402


>ref|XP_002528832.1| conserved hypothetical protein [Ricinus communis]
            gi|223531744|gb|EEF33566.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1280

 Score =  244 bits (623), Expect = 1e-61
 Identities = 158/382 (41%), Positives = 210/382 (54%), Gaps = 7/382 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T LD  VFQLTPTRTRCELVI+ANG+TEK+ASGL  PF+ HLKTA DQ+AKGGYSI
Sbjct: 1    MKYSTRLDSAVFQLTPTRTRCELVISANGKTEKIASGLVNPFLAHLKTAQDQMAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             LEP PG+    WFTK T+ERFVRFVSTPEI+ER++T              Q    IG  
Sbjct: 61   ILEPEPGT-GATWFTKETVERFVRFVSTPEILERVHTLESEILQIEEAIAIQSNNDIGLN 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLYE----PKQSNGKVEAAHAENSKIRLLK 1121
              E H +K       E  ++ +  N E A+VLY+    P ++NG   AAH  NSK++L+K
Sbjct: 120  MVENHQAKP--VARIEGSKALLDSNEEKAIVLYKPGSHPLEANG--SAAHEGNSKVQLMK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ VL KEQGM       AG+DI++M  L+ F+E FGA RL +AC +FM+LWK+K 
Sbjct: 176  VLETRKTVLQKEQGMAFARAVAAGYDIDHMAPLMSFAESFGATRLMDACVRFMDLWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDL 761
                          E  S RS+   +N +GI L+S+     P   +        ++   +
Sbjct: 236  ---ETGQWVEIEAAEAMSSRSDFAVMNASGIVLSSATNKQWPGTPESN----GEADVHPM 288

Query: 760  KSWKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVP 581
                S + +++            +Q H+ +P    W    PP      P    Y MQG+P
Sbjct: 289  DQQPSPSQQEY------------SQGHFPHPMYPHWPMHSPP---GALPVFQGYPMQGIP 333

Query: 580  YYPGYAMNPSFYQGMQHPMSDD 515
            YY  Y  N  +YQ   +P  +D
Sbjct: 334  YYQNYPGNGPYYQ-PPYPSGED 354


>ref|XP_006420261.1| hypothetical protein CICLE_v10004168mg [Citrus clementina]
            gi|557522134|gb|ESR33501.1| hypothetical protein
            CICLE_v10004168mg [Citrus clementina]
          Length = 1310

 Score =  243 bits (620), Expect = 2e-61
 Identities = 182/519 (35%), Positives = 255/519 (49%), Gaps = 6/519 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            MK  T L+  VFQLTPTRTRC+L+I+A G+TEK+ASGL  PF+ HLKTA +Q+AKGGYSI
Sbjct: 1    MKSSTRLNSAVFQLTPTRTRCDLLISAYGKTEKMASGLLNPFLAHLKTAQEQMAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             LEP PGS D  WFTKGTLERFVRFVSTPE++ER+ T              Q    +G  
Sbjct: 61   ILEPAPGS-DASWFTKGTLERFVRFVSTPEVLERVYTIESEILQIEEAIAIQSNNEMGLS 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLYEPKQSNGKVEAAHAE--NSKIRLLKLL 1115
              EE+ +K  H    E GR  +  N E A+VLY P+  + +   +  +  N K++LLK+L
Sbjct: 120  TTEENPAK--HVQSIEGGRPLLESNEEKAIVLYTPEAHSPEANGSTVQEGNPKVQLLKVL 177

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            ETR++VL KEQGM       AGFD++++  L+ F+E FG+ RL++AC +F ELWK+K   
Sbjct: 178  ETRKIVLQKEQGMAFARAVAAGFDVDHIPSLMSFAESFGSSRLKDACVRFRELWKRKH-- 235

Query: 934  XXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKS 755
                        E  S +S+   +N +GI L+S V   K             SE+G  K+
Sbjct: 236  --ESGQWLEIEAEAMSNQSDFSALNASGIILSSMVNKQK-----------EFSENG--KA 280

Query: 754  WKSVNSEDFQAKHQNGVSTDD-TQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPY 578
                N+++    +Q      +  Q  + +    PW    PP      P    Y MQG+ Y
Sbjct: 281  GIDANADEKPTINQQPAGNQEYLQGQFPHSIFPPWPIHSPP---GALPVFQGYPMQGMAY 337

Query: 577  YPGYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQSRRVATSKA 398
            YP    N  ++                HP Y      P E+ R+ + +R + RR      
Sbjct: 338  YPA---NSGYF----------------HPPYP-----PMEDPRQNAGQRMRQRR------ 367

Query: 397  KYPSINGDRXXXXXXXXXXXXXXXXXXXXXEHMENLQTRXXXXXXXXXXXXXXXXGNKQS 218
             +   +GD                         E+ + +                G KQS
Sbjct: 368  -HSMDSGDSNTELQTWEMDASKVKSQDDAELDRESSRKK------------ASRSGKKQS 414

Query: 217  GRVFIRNINYITSDKHGKGTGLEDSDPEVDVEMDEEHKD 101
            G+V IRNINYIT+++  + +   +S    + E DEE  D
Sbjct: 415  GKVVIRNINYITANR--QNSSGSESQSASNSETDEEDGD 451


>ref|XP_006606379.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 1116

 Score =  242 bits (617), Expect = 5e-61
 Identities = 154/379 (40%), Positives = 201/379 (53%), Gaps = 5/379 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M   T LD  VFQLTPTRTR +L+IT NG+ EK+ASGL  PF++HLK A +Q+ KGGYSI
Sbjct: 1    MNTSTRLDLAVFQLTPTRTRFDLIITVNGKKEKIASGLLNPFLSHLKAAQNQMDKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEPP G+ D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q   ++G  
Sbjct: 61   VLEPPEGNTDTSWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQGNSSLGTN 120

Query: 1279 EHSSKSGHDDESETGRSTVVD-NSESALVLYEPK----QSNGKVEAAHAENSKIRLLKLL 1115
                      ES  GR T  D N E A+VLY+P+    Q+NG       E+SK+ LLK+L
Sbjct: 121  TVEENQVKHVESTEGRKTQQDTNEERAIVLYKPEAQPPQANGSTSL--EESSKVHLLKVL 178

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            +TR+  L KEQGM       AGFDI+ +  L+ F+ECFGA R+++AC KF +LW++K   
Sbjct: 179  DTRKSALQKEQGMAFARAVAAGFDIDYIPPLMSFAECFGASRMKDACTKFRDLWRRKH-- 236

Query: 934  XXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKS 755
                        ET S RS+   +N +GI L        P +A  +   +    +G    
Sbjct: 237  -ETGQWLEIEAAETMSNRSDFSSLNVSGIIL--------PNMASASHTELDSESNGKA-- 285

Query: 754  WKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPYY 575
              S N ++ Q +            H+  P   PW    PP      P   PY +QG+PYY
Sbjct: 286  -SSDNQDNIQGQFP----------HHMFP---PWPVHSPPGSVPVLP---PYPVQGIPYY 328

Query: 574  PGYAMNPSFYQGMQHPMSD 518
            P Y  +  F Q    PM D
Sbjct: 329  PAYPGSSPFMQPNYSPMED 347


>ref|XP_006606378.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 1240

 Score =  242 bits (617), Expect = 5e-61
 Identities = 154/379 (40%), Positives = 201/379 (53%), Gaps = 5/379 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M   T LD  VFQLTPTRTR +L+IT NG+ EK+ASGL  PF++HLK A +Q+ KGGYSI
Sbjct: 1    MNTSTRLDLAVFQLTPTRTRFDLIITVNGKKEKIASGLLNPFLSHLKAAQNQMDKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEPP G+ D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q   ++G  
Sbjct: 61   VLEPPEGNTDTSWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQGNSSLGTN 120

Query: 1279 EHSSKSGHDDESETGRSTVVD-NSESALVLYEPK----QSNGKVEAAHAENSKIRLLKLL 1115
                      ES  GR T  D N E A+VLY+P+    Q+NG       E+SK+ LLK+L
Sbjct: 121  TVEENQVKHVESTEGRKTQQDTNEERAIVLYKPEAQPPQANGSTSL--EESSKVHLLKVL 178

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            +TR+  L KEQGM       AGFDI+ +  L+ F+ECFGA R+++AC KF +LW++K   
Sbjct: 179  DTRKSALQKEQGMAFARAVAAGFDIDYIPPLMSFAECFGASRMKDACTKFRDLWRRKH-- 236

Query: 934  XXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKS 755
                        ET S RS+   +N +GI L        P +A  +   +    +G    
Sbjct: 237  -ETGQWLEIEAAETMSNRSDFSSLNVSGIIL--------PNMASASHTELDSESNGKA-- 285

Query: 754  WKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPYY 575
              S N ++ Q +            H+  P   PW    PP      P   PY +QG+PYY
Sbjct: 286  -SSDNQDNIQGQFP----------HHMFP---PWPVHSPPGSVPVLP---PYPVQGIPYY 328

Query: 574  PGYAMNPSFYQGMQHPMSD 518
            P Y  +  F Q    PM D
Sbjct: 329  PAYPGSSPFMQPNYSPMED 347


>ref|XP_006606377.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1282

 Score =  242 bits (617), Expect = 5e-61
 Identities = 154/379 (40%), Positives = 201/379 (53%), Gaps = 5/379 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M   T LD  VFQLTPTRTR +L+IT NG+ EK+ASGL  PF++HLK A +Q+ KGGYSI
Sbjct: 1    MNTSTRLDLAVFQLTPTRTRFDLIITVNGKKEKIASGLLNPFLSHLKAAQNQMDKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEPP G+ D  WFTKGT+ERFVRFVSTPEI+ER+ T              Q   ++G  
Sbjct: 61   VLEPPEGNTDTSWFTKGTVERFVRFVSTPEILERVYTVESEILQIEEAIAIQGNSSLGTN 120

Query: 1279 EHSSKSGHDDESETGRSTVVD-NSESALVLYEPK----QSNGKVEAAHAENSKIRLLKLL 1115
                      ES  GR T  D N E A+VLY+P+    Q+NG       E+SK+ LLK+L
Sbjct: 121  TVEENQVKHVESTEGRKTQQDTNEERAIVLYKPEAQPPQANGSTSL--EESSKVHLLKVL 178

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            +TR+  L KEQGM       AGFDI+ +  L+ F+ECFGA R+++AC KF +LW++K   
Sbjct: 179  DTRKSALQKEQGMAFARAVAAGFDIDYIPPLMSFAECFGASRMKDACTKFRDLWRRKH-- 236

Query: 934  XXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKS 755
                        ET S RS+   +N +GI L        P +A  +   +    +G    
Sbjct: 237  -ETGQWLEIEAAETMSNRSDFSSLNVSGIIL--------PNMASASHTELDSESNGKA-- 285

Query: 754  WKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGFPYAMQGVPYY 575
              S N ++ Q +            H+  P   PW    PP      P   PY +QG+PYY
Sbjct: 286  -SSDNQDNIQGQFP----------HHMFP---PWPVHSPPGSVPVLP---PYPVQGIPYY 328

Query: 574  PGYAMNPSFYQGMQHPMSD 518
            P Y  +  F Q    PM D
Sbjct: 329  PAYPGSSPFMQPNYSPMED 347


>gb|ESW16027.1| hypothetical protein PHAVU_007G123500g [Phaseolus vulgaris]
            gi|561017224|gb|ESW16028.1| hypothetical protein
            PHAVU_007G123500g [Phaseolus vulgaris]
          Length = 1290

 Score =  240 bits (612), Expect = 2e-60
 Identities = 153/380 (40%), Positives = 205/380 (53%), Gaps = 6/380 (1%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M   T LD  VFQLTPTRTR +LVITANG+ EK+ASGL  PF++HLK A +Q+ KGGYSI
Sbjct: 1    MNASTRLDSAVFQLTPTRTRFDLVITANGKKEKIASGLLNPFLSHLKAAQNQMEKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEPP G+ D  WFTKGT+ERFVRFVSTPEI+ER++T              Q   ++G  
Sbjct: 61   VLEPPEGNSDTSWFTKGTVERFVRFVSTPEILERVHTAESEILQIEEAIVIQGNNSLGIS 120

Query: 1279 EHSSKSGHDDESETGRSTVVDNS-ESALVLY----EPKQSNGKVEAAHAENSKIRLLKLL 1115
                      ES  GR T  DN+ E A+VLY    +P Q+ G   ++   NSK+ LLK+L
Sbjct: 121  TVEENQMKHVESTEGRKTQQDNNEEKAIVLYKPDAQPPQAKGTTTSSEV-NSKVHLLKVL 179

Query: 1114 ETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQXX 935
            E R+  L KEQGM       AGFD++ +  L+ F+ECFGA R+++AC KF++LW++K   
Sbjct: 180  ELRKSALQKEQGMAFARAVAAGFDVDYIPPLMSFAECFGASRMKDACTKFIDLWRRKH-- 237

Query: 934  XXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGDLKS 755
                        ET S RS+   +N +GI   + V A    +  ++    A S+   +  
Sbjct: 238  -ETGQWLEIEAAETMSNRSDFSALNVSGIIPPNMVSASHTELDSESNG-KASSDVPPMDR 295

Query: 754  WKSVNSEDFQAKHQNGVSTDDTQVHYQNPALF-PWQGQQPPYGQNFHPPGFPYAMQGVPY 578
              S+ ++D+              +  Q P +F PW    PP      P   P  +QG+PY
Sbjct: 296  QPSIGNQDY--------------IQGQFPHMFSPWPIHSPP---GALPVFQPCPVQGIPY 338

Query: 577  YPGYAMNPSFYQGMQHPMSD 518
            Y  Y  N  F Q    PM D
Sbjct: 339  YQAYPGNSPFVQPNYSPMED 358


>ref|XP_004980952.1| PREDICTED: uncharacterized protein LOC101783885 [Setaria italica]
          Length = 1255

 Score =  238 bits (608), Expect = 6e-60
 Identities = 167/417 (40%), Positives = 216/417 (51%), Gaps = 11/417 (2%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M+PET L+  VFQLTPTRTRC+LV+ ANG  EK+ASGL  PFV HLK A +QIAKGGYSI
Sbjct: 1    MRPETRLESAVFQLTPTRTRCDLVVVANGWKEKIASGLLNPFVAHLKVAQEQIAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIGAE 1280
             LEP P   D  WFT+GT+ERFVRFVSTPE++ER+ T              Q  +++G  
Sbjct: 61   TLEPDP-EIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAIAVQGNDSLGLR 119

Query: 1279 EHSSKSGHDDESETGRSTVVD-NSESALVLYE-------PKQSNGKVEAAHAENSKIRLL 1124
                 +G   +   G  T+ D +++ ALV Y+       P Q+NG   A   ENSK +LL
Sbjct: 120  SVEDHNGKSVDCMEGSKTIFDPDADMALVPYKAGTQPTLPVQNNG---ATQEENSKAQLL 176

Query: 1123 KLLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKK 944
            ++LETR+ VL KEQ M       AGFDI+N+  L+ F+E FGA RL +AC  F+ LWK+K
Sbjct: 177  RVLETRKTVLRKEQAMAFARAVAAGFDIDNLVYLITFAERFGASRLMKACTHFIGLWKQK 236

Query: 943  QXXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKPRVAQDAQPLVARSESGD 764
                           E  S RSE    N +GI        +   + Q  + +     +GD
Sbjct: 237  H----ETGQWIEVEPEAMSARSEFAPFNPSGIMF------MGDNMKQTMETMSV--SNGD 284

Query: 763  LKSWKSVNSEDFQAKHQNGVSTDDTQVHYQNPALF---PWQGQQPPYGQNFHPPGFPYAM 593
                   N ED     Q       T  H   P  F   P+Q   PP+    HP   PY+M
Sbjct: 285  ------ANGEDASKADQR------TSQHSGAPHEFFHGPYQSAYPPWA--MHP---PYSM 327

Query: 592  QGVPYYPGYAMNPSFYQGMQHPMSDDHAHIPLHPHYSPSNAHPSEESRRGSNKRTQS 422
            QG+PYYPG  MNP  Y    +P  DD  H       S  ++  S++S    +   QS
Sbjct: 328  QGMPYYPG--MNP--YYPSPYPSMDDTRHHHSERRASKKHSSDSKDSETSDDGSDQS 380


>ref|XP_004296379.1| PREDICTED: uncharacterized protein LOC101304269 [Fragaria vesca
            subsp. vesca]
          Length = 1291

 Score =  238 bits (608), Expect = 6e-60
 Identities = 168/438 (38%), Positives = 227/438 (51%), Gaps = 24/438 (5%)
 Frame = -2

Query: 1639 MKPETELDFVVFQLTPTRTRCELVITANGETEKLASGLFQPFVTHLKTAADQIAKGGYSI 1460
            M+  T LD  +FQLTPTRTRC+LVI+ANG+TEK+ASGL  PF++HLKTA +Q+AKGGYSI
Sbjct: 1    MRSSTRLDSALFQLTPTRTRCDLVISANGKTEKIASGLLNPFLSHLKTAQEQMAKGGYSI 60

Query: 1459 RLEPPPGSQDVCWFTKGTLERFVRFVSTPEIVERINTXXXXXXXXXXXXXXQCGEAIG-- 1286
             LEP  GS D  WFTK T+ERFVRFVSTPE++ER+ +              Q     G  
Sbjct: 61   ILEPESGS-DAAWFTKSTVERFVRFVSTPEVLERVYSLESEILQIEEAITIQGNHDTGYN 119

Query: 1285 -AEEHSSKSGHDDESETGRSTVVDNSESALVLYE----PKQSNGKVEAAHAENSKIRLLK 1121
              EE+  K    D  E  R  +  N E A+VLYE      ++NG   AA  ENSK++LLK
Sbjct: 120  PVEENHEKP--LDIIEGNRPILDSNEEKAIVLYEAGARKPETNG--SAAQGENSKVQLLK 175

Query: 1120 LLETRRMVLHKEQGMXXXXXXXAGFDIENMRDLVLFSECFGAFRLREACFKFMELWKKKQ 941
            +LETR+ +L KEQGM       AGFD++++  L+ F+ECFGA RL +AC ++ ELWK+K 
Sbjct: 176  VLETRKKMLQKEQGMAFARAVAAGFDVDHLPPLISFAECFGASRLMDACRRYKELWKRKH 235

Query: 940  XXXXXXXXXXXXXXETNSMRSESCFVNETGITLNSSVYALKP-RVAQDAQPLVARSESGD 764
                          E  S R +    N +GI L+S     KP  +A++   + +  E   
Sbjct: 236  ---ETGQWLEIEAAEAMSNRGDFSTTNASGIVLSSMTN--KPNEMAENNGKVTSADEKPP 290

Query: 763  LKSWKSVNSEDFQAKHQNGVSTDDTQVHYQNPALFPWQGQQPPYGQNFHPPGF-----PY 599
            L+   S+  +++                   P  FP Q   PP+    H PG      PY
Sbjct: 291  LEHQPSLGHQEY------------------FPGQFPHQ-MFPPW--PVHSPGALPGYPPY 329

Query: 598  AMQGVPYYPGYAMNPSFYQGMQHPMSDDHAH-----------IPLHPHYSPSNAHPSEES 452
             MQG+PYY  Y  N  F+Q     + D   +           +   PH   S A   + S
Sbjct: 330  PMQGMPYYQNYPGNGPFFQPPYTTVEDPRLNQSQKRKQKRHSMDGSPHNDESEAWELDAS 389

Query: 451  RRGSNKRTQSRRVATSKA 398
            R  S+  T+  R +  K+
Sbjct: 390  RTRSSDDTELERESRKKS 407


Top