BLASTX nr result

ID: Paeonia22_contig00016355 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00016355
         (2471 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251...   705   0.0  
ref|XP_007027105.1| Uncharacterized protein isoform 1 [Theobroma...   693   0.0  
ref|XP_007027106.1| Uncharacterized protein isoform 2 [Theobroma...   655   0.0  
ref|XP_007027107.1| Uncharacterized protein isoform 3 [Theobroma...   644   0.0  
ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prun...   629   e-177
ref|XP_006341066.1| PREDICTED: uncharacterized protein LOC102588...   617   e-174
ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206...   613   e-172
ref|XP_004246472.1| PREDICTED: uncharacterized protein LOC101267...   613   e-172
ref|XP_006341068.1| PREDICTED: uncharacterized protein LOC102588...   610   e-172
ref|XP_007208469.1| hypothetical protein PRUPE_ppa004081mg [Prun...   603   e-169
ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma...   595   e-167
gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis]     590   e-165
ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma...   587   e-165
ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cuc...   580   e-162
ref|XP_002309044.2| hypothetical protein POPTR_0006s08280g [Popu...   572   e-160
ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma...   564   e-158
ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787...   561   e-157
ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phas...   554   e-155
ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma...   551   e-154
ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782...   549   e-153

>ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera]
          Length = 599

 Score =  705 bits (1819), Expect = 0.0
 Identities = 380/621 (61%), Positives = 435/621 (70%), Gaps = 4/621 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQNKGFWMAK  GC+TDGE+AYDN SR+EPKR HQWFMDG+E ELFPNKKQAVEV N+
Sbjct: 62   MSFQNKGFWMAKGVGCVTDGEMAYDNPSRIEPKRSHQWFMDGTE-ELFPNKKQAVEVPNS 120

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
              F G+S+PN+SPW NAS F SVSGHFTERLFD EAART+NFDDRN I SV A NMNM R
Sbjct: 121  NLFPGLSNPNVSPWANASGFHSVSGHFTERLFDPEAARTVNFDDRN-IPSVGAGNMNMAR 179

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            KVIEDPF N+S FGL                                             
Sbjct: 180  KVIEDPFGNESLFGL--------------------------------------------- 194

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
                SM+HS       L        N G    + +    D E+  + +SMG  YT+ D+N
Sbjct: 195  ----SMSHSLEDPRSGL--------NYGGIRKVKVSQVKDSENIMS-VSMGHTYTRADNN 241

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
                        TMSM H Y K D + ISMG  YN          KGD NI+S+  +Y +
Sbjct: 242  ------------TMSMAHAYNKGDGNSISMGLTYN----------KGDDNILSISDSYGR 279

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENV 999
            E+++ ISMG  ++KGD NI +M  +YK  D+TI+M H+FSKGD NIISMGQ+YNKG +N 
Sbjct: 280  EDNNFISMGQAYNKGDENI-AMSHTYKGGDNTISMGHTFSKGDNNIISMGQTYNKGDDNT 338

Query: 998  ISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGY---DDDTNPSGR 828
            ISMGHIYNK +EN I   H+Y K DN+NLS+GH Y+KG+S IISFGG+   DDDTNPSGR
Sbjct: 339  ISMGHIYNKGDENTISMGHTY-KGDNSNLSIGHSYNKGESNIISFGGFHDDDDDTNPSGR 397

Query: 827  LISSYDLLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXXX 651
            L+ SYDLLMGQPSVQ SEA NEK+LVES A +L+ T Q+ +SG+ETV             
Sbjct: 398  LVCSYDLLMGQPSVQRSEALNEKKLVESNADALISTAQITASGSETVSKKKEEQKLSKKV 457

Query: 650  XPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYE 471
             PN+FPSNVRSLLSTGMLDGVPVKYIAWSRE ELRG+IKGSGYLCGCQSCNFSK INAYE
Sbjct: 458  PPNNFPSNVRSLLSTGMLDGVPVKYIAWSRE-ELRGIIKGSGYLCGCQSCNFSKVINAYE 516

Query: 470  FERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWK 291
            FERHAGCKTKHPNNHIYFENGKTIYGIVQEL+STPQN LFDVIQTITGSPINQKSFR WK
Sbjct: 517  FERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPINQKSFRLWK 576

Query: 290  DSFLAATRELQRIYGKDEGKR 228
            +SFLAATRELQRIYGK+EGK+
Sbjct: 577  ESFLAATRELQRIYGKEEGKQ 597


>ref|XP_007027105.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508715710|gb|EOY07607.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 539

 Score =  693 bits (1788), Expect = 0.0
 Identities = 366/618 (59%), Positives = 437/618 (70%), Gaps = 1/618 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQN+GFWM+K +GC+ DGE+AYDNSSR+EPKR HQWFMDG E + FPNKKQAV V   
Sbjct: 1    MSFQNQGFWMSKGAGCINDGEMAYDNSSRIEPKRSHQWFMDGPETDSFPNKKQAVGVPTT 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
              F G+ + ++S WGN+SSF S+SGHF ERLFD+E AR +NFDD+ +I S S E ++MGR
Sbjct: 61   NLFSGVLNSHVSQWGNSSSFHSISGHFAERLFDTETARAVNFDDQ-SIPSGSTEKVDMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            KV ED F NDS+FGLS+SHT+EDPR GLNYGG RKVKV QVKDSENVMS+ M H +DR D
Sbjct: 120  KVNEDLFANDSSFGLSMSHTMEDPRSGLNYGGFRKVKVCQVKDSENVMSVSMAHAYDRVD 179

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
             +S+S  H YNK  D  ISMGLAY NKGD++++SIGD+Y+RE+ N FISMGQ Y K +D+
Sbjct: 180  KNSVSTDHGYNKVEDGNISMGLAY-NKGDENLMSIGDSYEREN-NVFISMGQSYNKSEDS 237

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
            I++   YK +   ++M +T+ K DN+ +SMGQ +NR +D          N I++G TY K
Sbjct: 238  ITVGQTYKESTSAIAMSNTFDKGDNNFMSMGQTFNRTDD----------NSITVGHTYGK 287

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENV 999
             +DS+IS+ H +++GD+N +S+G SY             SKG+  IIS G          
Sbjct: 288  GDDSAISISHSYNRGDNNNLSIGPSY-------------SKGESTIISFGG--------- 325

Query: 998  ISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPSGRLIS 819
                  Y+ DE+                                       TN +GRLIS
Sbjct: 326  ------YDDDED---------------------------------------TNQTGRLIS 340

Query: 818  SYDLLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPN 642
            SYDLLMGQPSVQ S+A NEKE+V+S A +LV TG + +SG E V               N
Sbjct: 341  SYDLLMGQPSVQRSDAPNEKEMVKSNADALVPTGNITASGME-VSRKKEDPKTAKKVSSN 399

Query: 641  SFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFER 462
            +FPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGY CGCQ+CNFSK INAYEFER
Sbjct: 400  NFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYQCGCQTCNFSKVINAYEFER 459

Query: 461  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSF 282
            HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQ MLFDVIQTITGSPINQKSFR WK+SF
Sbjct: 460  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQTMLFDVIQTITGSPINQKSFRLWKESF 519

Query: 281  LAATRELQRIYGKDEGKR 228
            LAATRELQRIYGKDEGK+
Sbjct: 520  LAATRELQRIYGKDEGKK 537


>ref|XP_007027106.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508715711|gb|EOY07608.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 523

 Score =  655 bits (1690), Expect = 0.0
 Identities = 348/600 (58%), Positives = 417/600 (69%), Gaps = 1/600 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQN+GFWM+K +GC+ DGE+AYDNSSR+EPKR HQWFMDG E + FPNKKQAV V   
Sbjct: 1    MSFQNQGFWMSKGAGCINDGEMAYDNSSRIEPKRSHQWFMDGPETDSFPNKKQAVGVPTT 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
              F G+ + ++S WGN+SSF S+SGHF ERLFD+E AR +NFDD+ +I S S E ++MGR
Sbjct: 61   NLFSGVLNSHVSQWGNSSSFHSISGHFAERLFDTETARAVNFDDQ-SIPSGSTEKVDMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            KV ED F NDS+FGLS+SHT+EDPR GLNYGG RKVKV QVKDSENVMS+ M H +DR D
Sbjct: 120  KVNEDLFANDSSFGLSMSHTMEDPRSGLNYGGFRKVKVCQVKDSENVMSVSMAHAYDRVD 179

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
             +S+S  H YNK  D  ISMGLAY NKGD++++SIGD+Y+RE+ N FISMGQ Y K +D+
Sbjct: 180  KNSVSTDHGYNKVEDGNISMGLAY-NKGDENLMSIGDSYEREN-NVFISMGQSYNKSEDS 237

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
            I++   YK +   ++M +T+ K DN+ +SMGQ +NR +D          N I++G TY K
Sbjct: 238  ITVGQTYKESTSAIAMSNTFDKGDNNFMSMGQTFNRTDD----------NSITVGHTYGK 287

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENV 999
             +DS+IS+ H +++GD+N +S+G SY             SKG+  IIS G          
Sbjct: 288  GDDSAISISHSYNRGDNNNLSIGPSY-------------SKGESTIISFGG--------- 325

Query: 998  ISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPSGRLIS 819
                  Y+ DE+                                       TN +GRLIS
Sbjct: 326  ------YDDDED---------------------------------------TNQTGRLIS 340

Query: 818  SYDLLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPN 642
            SYDLLMGQPSVQ S+A NEKE+V+S A +LV TG + +SG E V               N
Sbjct: 341  SYDLLMGQPSVQRSDAPNEKEMVKSNADALVPTGNITASGME-VSRKKEDPKTAKKVSSN 399

Query: 641  SFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFER 462
            +FPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGY CGCQ+CNFSK INAYEFER
Sbjct: 400  NFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYQCGCQTCNFSKVINAYEFER 459

Query: 461  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSF 282
            HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQ MLFDVIQTITGSPINQKSFR WK  F
Sbjct: 460  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQTMLFDVIQTITGSPINQKSFRLWKVLF 519


>ref|XP_007027107.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508715712|gb|EOY07609.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 510

 Score =  644 bits (1660), Expect = 0.0
 Identities = 351/618 (56%), Positives = 416/618 (67%), Gaps = 1/618 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQN+GFWM+K +GC+ DGE+AYDNSSR+EPKR HQWFMDG E + FPNKKQAV V   
Sbjct: 1    MSFQNQGFWMSKGAGCINDGEMAYDNSSRIEPKRSHQWFMDGPETDSFPNKKQAVGVP-- 58

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
                                       T  LFD+E AR +NFDD++ I S S E ++MGR
Sbjct: 59   ---------------------------TTNLFDTETARAVNFDDQS-IPSGSTEKVDMGR 90

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            KV ED F NDS+FGLS+SHT+EDPR GLNYGG RKVKV QVKDSENVMS+ M H +DR D
Sbjct: 91   KVNEDLFANDSSFGLSMSHTMEDPRSGLNYGGFRKVKVCQVKDSENVMSVSMAHAYDRVD 150

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
             +S+S  H YNK  D  ISMGLAY NKGD++++SIGD+Y+RE+ N FISMGQ Y K +D+
Sbjct: 151  KNSVSTDHGYNKVEDGNISMGLAY-NKGDENLMSIGDSYEREN-NVFISMGQSYNKSEDS 208

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
            I++   YK +   ++M +T+ K DN+ +SMGQ +NR +D          N I++G TY K
Sbjct: 209  ITVGQTYKESTSAIAMSNTFDKGDNNFMSMGQTFNRTDD----------NSITVGHTYGK 258

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENV 999
             +DS+IS+ H +++GD+N +S+G SY             SKG+  IIS G          
Sbjct: 259  GDDSAISISHSYNRGDNNNLSIGPSY-------------SKGESTIISFGG--------- 296

Query: 998  ISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPSGRLIS 819
                  Y+ DE+                                       TN +GRLIS
Sbjct: 297  ------YDDDED---------------------------------------TNQTGRLIS 311

Query: 818  SYDLLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPN 642
            SYDLLMGQPSVQ S+A NEKE+V+S A +LV TG + +SG E V               N
Sbjct: 312  SYDLLMGQPSVQRSDAPNEKEMVKSNADALVPTGNITASGME-VSRKKEDPKTAKKVSSN 370

Query: 641  SFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFER 462
            +FPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGY CGCQ+CNFSK INAYEFER
Sbjct: 371  NFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYQCGCQTCNFSKVINAYEFER 430

Query: 461  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSF 282
            HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQ MLFDVIQTITGSPINQKSFR WK+SF
Sbjct: 431  HAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQTMLFDVIQTITGSPINQKSFRLWKESF 490

Query: 281  LAATRELQRIYGKDEGKR 228
            LAATRELQRIYGKDEGK+
Sbjct: 491  LAATRELQRIYGKDEGKK 508


>ref|XP_007218931.1| hypothetical protein PRUPE_ppa003346mg [Prunus persica]
            gi|462415393|gb|EMJ20130.1| hypothetical protein
            PRUPE_ppa003346mg [Prunus persica]
          Length = 583

 Score =  629 bits (1621), Expect = e-177
 Identities = 330/619 (53%), Positives = 428/619 (69%), Gaps = 5/619 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQ K FW+ + + CLTDGE+ YDNSSR+E KR ++WFMD + +E F NKKQA+E  N 
Sbjct: 1    MSFQPKSFWIPRDASCLTDGEMGYDNSSRIESKRGNRWFMDSNGLEFFNNKKQAMEAVNG 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  G+    ISPW N S FQSV G FT+RLF SE  RT+N  DR NI SV +ENMN+GR
Sbjct: 61   RPVSGVPHLAISPWDNTSGFQSVPGQFTDRLFGSEPVRTVNLGDR-NIQSVGSENMNLGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  ED + ND + GLS+SHT+EDP   LN+GGIRKVKV++V+DS++V+S  MGH + +GD
Sbjct: 120  KGFEDQYGNDPSVGLSMSHTIEDPSSCLNFGGIRKVKVNEVRDSDDVVSASMGHSYCKGD 179

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
            ++++SMA++YNK+ DN IS+G AYN  G+++ ISIG ++++ D +NFISMG  ++K + N
Sbjct: 180  SNTMSMANTYNKSDDNAISLGSAYNT-GEENAISIGPSFNKAD-DNFISMGHTFSKANSN 237

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
                         +SM H Y K DN ++SMGQ          PF K D N ISMGQ+Y+K
Sbjct: 238  F------------ISMAHNYNKGDNSILSMGQ----------PFDKEDGNFISMGQSYEK 275

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSY-KENDSTITMSHSFSKGDGNIISMGQSYNKGSEN 1002
             + S IS+G+ + KG  N ISMG +Y K N++ I+M+ ++ K   N++SMG +Y+K   N
Sbjct: 276  GDSSFISLGNSYHKGHENFISMGATYGKANENFISMAPTYDKQTDNMMSMGPNYDKADSN 335

Query: 1001 VISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGY--DDDTNPSGR 828
            V+ +G  Y+K E               +N+SM H Y+K +ST ISFG +  + DTNPSG 
Sbjct: 336  VVPIGPPYHKGE---------------SNVSMSHNYNKNESTTISFGSFHHETDTNPSGG 380

Query: 827  LISSYDLLM-GQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXX 654
            +ISSYDLLM  Q + + SE S  K+ ++S     V       S T+TV            
Sbjct: 381  IISSYDLLMNNQNTAEQSEESGLKDPIQSNMDPNVDDALKLDSKTDTV-SKIKEPKTARK 439

Query: 653  XXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAY 474
              PN+FPSNV+SLLSTGM DGVPVKY++WSREK L+G+IKG+GYLC C  CN SK++NAY
Sbjct: 440  APPNNFPSNVKSLLSTGMFDGVPVKYVSWSREKNLKGIIKGTGYLCSCDDCNHSKSLNAY 499

Query: 473  EFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSW 294
            EFERHAG KTKHPNNHIYFENGKTIY +VQEL++TPQ MLFD IQT+TGSPINQK+FR W
Sbjct: 500  EFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQTVTGSPINQKNFRIW 559

Query: 293  KDSFLAATRELQRIYGKDE 237
            K S+ AATRELQRIYGKDE
Sbjct: 560  KASYQAATRELQRIYGKDE 578


>ref|XP_006341066.1| PREDICTED: uncharacterized protein LOC102588634 isoform X1 [Solanum
            tuberosum] gi|565348123|ref|XP_006341067.1| PREDICTED:
            uncharacterized protein LOC102588634 isoform X2 [Solanum
            tuberosum]
          Length = 558

 Score =  617 bits (1591), Expect = e-174
 Identities = 338/612 (55%), Positives = 409/612 (66%), Gaps = 16/612 (2%)
 Frame = -1

Query: 2015 LAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSPNISPWGNASSFQ 1836
            +AYDNSS +EPKR HQWFMDG E EL PNKKQA+EV N++ F G+ S NI+PW N   F 
Sbjct: 1    MAYDNSSTLEPKRSHQWFMDGIEPELLPNKKQAIEVPNHSSFSGLLSSNIAPWMNTPGFH 60

Query: 1835 SVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTNDSAFGLSISHTL 1656
            SV G + ER FD+++AR+++FDD N++ SV   NMNM RKV+EDPF +DS+FGLSISHTL
Sbjct: 61   SVPGQYAERQFDNDSARSLSFDD-NSVPSVGIGNMNMSRKVMEDPFGSDSSFGLSISHTL 119

Query: 1655 EDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHSYNKAADNLISMG 1476
            ED RLGLNY GIRKVKVSQVK++EN M               +SM   Y +   N++   
Sbjct: 120  EDHRLGLNYSGIRKVKVSQVKEAENFMP--------------VSMGDIYTRGISNVMPTD 165

Query: 1475 LAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKGNDDTMSMGHTYG 1296
             A++   D                N I+MG  +  GD+++            MS+G T+ 
Sbjct: 166  HAFSKAED----------------NCIAMGLSFNGGDEHL------------MSLGDTFN 197

Query: 1295 KDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMGHPFSKGDSNIIS 1116
            ++DN  ISMGQ          PF K DSN IS+G ++  +  S +SM HPF K +SNI  
Sbjct: 198  REDNSFISMGQ----------PFNKVDSNEISVGHSF--KESSLLSMSHPFCKDESNITM 245

Query: 1115 MGQSY-KENDSTITMSHSF-------------SKGDGNIISMGQSYNKGSENVISMGHIY 978
            + QS+ +E+DS I++SHSF             S  D NI S+GQ+ NK ++    M H Y
Sbjct: 246  LNQSFSREDDSAISVSHSFNDNNTAISMGQQFSNDDSNITSVGQTINKMADTNPPMSHCY 305

Query: 977  NKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGY--DDDTNPSGRLISSYDLL 804
            +K ++NAI  S +Y+K +NNNLSM   +  G+S IISFGG+  DDD N SGRLI SYDLL
Sbjct: 306  SKVDDNAISVSQTYSKVENNNLSMSQSFGNGESNIISFGGFNDDDDINSSGRLICSYDLL 365

Query: 803  MGQPSVQSSEASNEKELVESAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFPSNV 624
            M Q S Q S+    K LVES    V T     +G +                 NSFPSNV
Sbjct: 366  MSQSSGQQSDIVTGKRLVESNADTV-TSAAQMAGNKEFISKKEEQKATKKPPSNSFPSNV 424

Query: 623  RSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAGCKT 444
            RSLLSTGMLDGVPVKYIAWSREKELRG+IKGSGYLCGCQSCNFSKAINAYEFERHAGCKT
Sbjct: 425  RSLLSTGMLDGVPVKYIAWSREKELRGIIKGSGYLCGCQSCNFSKAINAYEFERHAGCKT 484

Query: 443  KHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAATRE 264
            KHPNNHIYFENGKTIYGIVQELR+TPQ++LF+VIQTITGS INQKSFR WK+SFLAATRE
Sbjct: 485  KHPNNHIYFENGKTIYGIVQELRNTPQDLLFEVIQTITGSSINQKSFRIWKESFLAATRE 544

Query: 263  LQRIYGKDEGKR 228
            LQRIYGKDE +R
Sbjct: 545  LQRIYGKDEVRR 556


>ref|XP_004147718.1| PREDICTED: uncharacterized protein LOC101206313 [Cucumis sativus]
          Length = 582

 Score =  613 bits (1581), Expect = e-172
 Identities = 323/618 (52%), Positives = 413/618 (66%), Gaps = 4/618 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQ+K FW+ + +GCLTDGE+ YD+SSR+E KR HQWFMDGS  ELF +KKQA+E  N+
Sbjct: 1    MSFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNS 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  G+   N+SPW N SSFQSV GHFT+RLF SE  RT+N  DR    SV   NM+MGR
Sbjct: 61   RPVPGVPHMNVSPWEN-SSFQSVPGHFTDRLFGSEPIRTVNLVDRG--ISVGNANMDMGR 117

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  E+ FTN+ + GLS+S ++EDP   LN+GGIRKVKV+QV+D +  M   +GH + RGD
Sbjct: 118  KEFENHFTNNPSVGLSMSQSIEDPSSCLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGD 177

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
            N +ISM   +NK  +N IS+G  YN++ D++ IS+G  Y + D +NFISMG  ++KGD +
Sbjct: 178  NCTISMGTGFNKNHENTISLGQTYNSR-DENAISVGPAYHKTD-DNFISMGHAFSKGDGS 235

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
                         +++GH Y K DN ++SM Q          PF KGD + ISMGQ+Y+K
Sbjct: 236  F------------ITIGHNYSKGDNSILSMNQ----------PFDKGDDSFISMGQSYEK 273

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENV 999
               + IS    ++KG  N ISMG +Y             SK     ISM  S+NKG+++ 
Sbjct: 274  AEGNIISFA-SYNKGQENFISMGPAY-------------SKAGDTFISMASSFNKGNDDN 319

Query: 998  ISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDT---NPSGR 828
            +SM   Y+K   + +     ++K D+  +SM H Y KG+S  ISFGG+DD+    NPSG 
Sbjct: 320  LSMAPTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGESNTISFGGFDDENGTDNPSGG 379

Query: 827  LISSYDLLM-GQPSVQSSEASNEKELVESAHSLVGTGQVFSSGTETVXXXXXXXXXXXXX 651
            +ISSYDLLM  Q S Q+SE S  ++ V+    +   G +   G                 
Sbjct: 380  IISSYDLLMANQASAQASEVSTLRDSVDPNVEVNINGAIKVDGKIDTNSKSKEPRMSKKV 439

Query: 650  XPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYE 471
             PNSFPSNV+SLLSTGMLDGVPVKY++WSREK L+G+IKG+GYLC C++CN SKA+NAYE
Sbjct: 440  PPNSFPSNVKSLLSTGMLDGVPVKYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYE 499

Query: 470  FERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWK 291
            FERHAGCKTKHPNNHIYFENGKTIY +VQEL++TPQ MLFD IQ +TGSPINQK+FR WK
Sbjct: 500  FERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWK 559

Query: 290  DSFLAATRELQRIYGKDE 237
             S+ AAT ELQRIYGKDE
Sbjct: 560  ASYQAATLELQRIYGKDE 577


>ref|XP_004246472.1| PREDICTED: uncharacterized protein LOC101267439 [Solanum
            lycopersicum]
          Length = 558

 Score =  613 bits (1580), Expect = e-172
 Identities = 336/613 (54%), Positives = 410/613 (66%), Gaps = 17/613 (2%)
 Frame = -1

Query: 2015 LAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSPNISPWGNASSFQ 1836
            +AYDNSS +EPKR HQWFMDG E EL PNKKQA+EV N++ F G+ S NI+PW N   F 
Sbjct: 1    MAYDNSSTLEPKRSHQWFMDGIEPELLPNKKQAIEVPNHSSFSGLLSSNIAPWMNTPGFH 60

Query: 1835 SVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTNDSAFGLSISHTL 1656
            SVSG + ER FD+++AR+++FDD N++ SV   NMNM RKV+EDPF +DS+FGLSISHTL
Sbjct: 61   SVSGQYAERQFDNDSARSLSFDD-NSVPSVGIGNMNMSRKVMEDPFGSDSSFGLSISHTL 119

Query: 1655 EDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHSYNKAADNLISMG 1476
            ED + GLNY GIRKVKVSQVK++EN               + +SM   Y +   N +   
Sbjct: 120  EDHKSGLNYSGIRKVKVSQVKEAENF--------------TPVSMGDIYTRGISNAMPTD 165

Query: 1475 LAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKGNDDTMSMGHTYG 1296
             A++   D                N I+MG  +  GD+++            MS+G T+ 
Sbjct: 166  HAFSKAED----------------NCIAMGLSFNGGDEHL------------MSLGDTFN 197

Query: 1295 KDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMGHPFSKGDSNIIS 1116
            +++N  ISMGQ          PF K DSN IS+G ++++   SS+SM HPF K +SNII 
Sbjct: 198  REENSFISMGQ----------PFNKVDSNEISLGHSFNES--SSLSMSHPFCKDESNIIM 245

Query: 1115 MGQSY-KENDSTITMSHSFSKGDGNIISMGQSYNKGSENVISMG--------------HI 981
            + QS+ +E+DSTI++SHSF+  +   ISMGQ +     N+ S+G              H 
Sbjct: 246  LNQSFSREDDSTISVSHSFNDNN-TAISMGQQFGNDDSNITSVGQTINTMADTNPPISHC 304

Query: 980  YNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGY--DDDTNPSGRLISSYDL 807
            Y+K  +NAI  S +Y+K +NNNLSM   +  G+S IISFGG+  DDD N SGRLI SYDL
Sbjct: 305  YSKVNDNAISVSQTYSKVENNNLSMSQSFGNGESNIISFGGFNDDDDINSSGRLICSYDL 364

Query: 806  LMGQPSVQSSEASNEKELVESAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFPSN 627
            LM Q S Q S+    K LVES    V T    +   E +               NSFPSN
Sbjct: 365  LMSQSSGQKSDIVTGKRLVESNADTVTTVAQMAGSKEFISKKEEQKATKKPPS-NSFPSN 423

Query: 626  VRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAGCK 447
            VRSLLSTGMLDGVPVKYIAWSREKELRG+IKGSGYLCGCQSCNFSKAINAYEFERHAGCK
Sbjct: 424  VRSLLSTGMLDGVPVKYIAWSREKELRGIIKGSGYLCGCQSCNFSKAINAYEFERHAGCK 483

Query: 446  TKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAATR 267
            TKHPNNHIYFENGKTIYGIVQELR+TPQ++LF+VIQTITGS INQKSFR WK+SFLAATR
Sbjct: 484  TKHPNNHIYFENGKTIYGIVQELRNTPQDLLFEVIQTITGSSINQKSFRIWKESFLAATR 543

Query: 266  ELQRIYGKDEGKR 228
            ELQRIYGKDE +R
Sbjct: 544  ELQRIYGKDEVRR 556


>ref|XP_006341068.1| PREDICTED: uncharacterized protein LOC102588634 isoform X3 [Solanum
            tuberosum]
          Length = 557

 Score =  610 bits (1574), Expect = e-172
 Identities = 337/612 (55%), Positives = 408/612 (66%), Gaps = 16/612 (2%)
 Frame = -1

Query: 2015 LAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSPNISPWGNASSFQ 1836
            +AYDNSS +EPKR HQWFMDG E EL PNKKQA+EV N++ F G+ S NI+PW N   F 
Sbjct: 1    MAYDNSSTLEPKRSHQWFMDGIEPELLPNKKQAIEVPNHSSFSGLLSSNIAPWMNTPGFH 60

Query: 1835 SVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTNDSAFGLSISHTL 1656
            SV G + ER FD+++AR+++FDD N++ SV   NMNM RKV+EDPF +DS+FGLSISHTL
Sbjct: 61   SVPGQYAERQFDNDSARSLSFDD-NSVPSVGIGNMNMSRKVMEDPFGSDSSFGLSISHTL 119

Query: 1655 EDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHSYNKAADNLISMG 1476
            ED RLGLNY GIRKVKVSQVK++EN M               +SM   Y +   N++   
Sbjct: 120  EDHRLGLNYSGIRKVKVSQVKEAENFMP--------------VSMGDIYTRGISNVMPTD 165

Query: 1475 LAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKGNDDTMSMGHTYG 1296
             A++   D                N I+MG  +  GD+++            MS+G T+ 
Sbjct: 166  HAFSKAED----------------NCIAMGLSFNGGDEHL------------MSLGDTFN 197

Query: 1295 KDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMGHPFSKGDSNIIS 1116
            ++DN  ISMGQ          PF K DSN IS+G ++  +  S +SM HPF K +SNI  
Sbjct: 198  REDNSFISMGQ----------PFNKVDSNEISVGHSF--KESSLLSMSHPFCKDESNITM 245

Query: 1115 MGQSY-KENDSTITMSHSF-------------SKGDGNIISMGQSYNKGSENVISMGHIY 978
            + QS+ +E+DS I++SHSF             S  D NI S+GQ+ NK ++    M H Y
Sbjct: 246  LNQSFSREDDSAISVSHSFNDNNTAISMGQQFSNDDSNITSVGQTINKMADTNPPMSHCY 305

Query: 977  NKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGY--DDDTNPSGRLISSYDLL 804
            +K ++NAI  S +Y+K +NNNLSM   +  G+S IISFGG+  DDD N SGRLI SYDLL
Sbjct: 306  SKVDDNAISVSQTYSKVENNNLSMSQSFGNGESNIISFGGFNDDDDINSSGRLICSYDLL 365

Query: 803  MGQPSVQSSEASNEKELVESAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFPSNV 624
            M Q S Q S+    K LVES    V T     +G +                 NSFPSNV
Sbjct: 366  MSQSSGQQSDIVTGKRLVESNADTV-TSAAQMAGNKEFISKKEEQKATKKPPSNSFPSNV 424

Query: 623  RSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAGCKT 444
            RSLLSTGMLDGVPVKYIAWSRE ELRG+IKGSGYLCGCQSCNFSKAINAYEFERHAGCKT
Sbjct: 425  RSLLSTGMLDGVPVKYIAWSRE-ELRGIIKGSGYLCGCQSCNFSKAINAYEFERHAGCKT 483

Query: 443  KHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAATRE 264
            KHPNNHIYFENGKTIYGIVQELR+TPQ++LF+VIQTITGS INQKSFR WK+SFLAATRE
Sbjct: 484  KHPNNHIYFENGKTIYGIVQELRNTPQDLLFEVIQTITGSSINQKSFRIWKESFLAATRE 543

Query: 263  LQRIYGKDEGKR 228
            LQRIYGKDE +R
Sbjct: 544  LQRIYGKDEVRR 555


>ref|XP_007208469.1| hypothetical protein PRUPE_ppa004081mg [Prunus persica]
            gi|462404111|gb|EMJ09668.1| hypothetical protein
            PRUPE_ppa004081mg [Prunus persica]
          Length = 531

 Score =  603 bits (1554), Expect = e-169
 Identities = 330/615 (53%), Positives = 399/615 (64%), Gaps = 2/615 (0%)
 Frame = -1

Query: 2066 NKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFL 1887
            N+GFWM K +GCL +GE  YDNS R+EPKR HQWFMDG EVELFPNKKQAVEV NN  F 
Sbjct: 2    NQGFWMPKGTGCLNEGEALYDNSPRIEPKRSHQWFMDGPEVELFPNKKQAVEVPNNNLFS 61

Query: 1886 GISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIE 1707
            G+ + N+SPWGN  SF S SGHFTERLFDSE  R +NFDDRN I +   E MN+ RK   
Sbjct: 62   GMLNANVSPWGNVPSFHSFSGHFTERLFDSETDRAVNFDDRN-IPAAETEKMNLARK--- 117

Query: 1706 DPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSI 1527
                N+  FG                             +++   L M H  +    S  
Sbjct: 118  ---GNEDLFG-----------------------------NDSSFGLSMSHTLEDPRTSP- 144

Query: 1526 SMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLS 1347
                                 N G    + + +  D E+    +S+G  Y +GD+   L+
Sbjct: 145  ---------------------NYGGFRKVKVSEVKDSENVMP-VSIGHAYNQGDNGAMLA 182

Query: 1346 -HAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKEND 1170
             H YK +D+T SMG  Y                        +KGD + ISM   Y++ ++
Sbjct: 183  AHVYKADDNTASMGLAY------------------------KKGDDSFISMSDNYNRADN 218

Query: 1169 SSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENVISM 990
            + ISMG PF+KGD NI S+GQ+YKE+++T++M  +F+KGD NIIS+GQ+YNK  E+ IS 
Sbjct: 219  NFISMGQPFNKGDENI-SIGQTYKESNNTLSMGQTFNKGDNNIISIGQTYNKVEESTISA 277

Query: 989  GHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPSGRLISSYD 810
            GHIYNK E++ I   H+Y+K D+N LS+GH Y+  +STIISFGGYDDD   +   IS Y+
Sbjct: 278  GHIYNKGEDSTISMGHAYSKGDSNMLSIGHSYNNRESTIISFGGYDDDDAHTSA-ISGYE 336

Query: 809  LLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFP 633
            LLMGQP    +EA NEKEL +S A +LV    + ++G E +              PN+FP
Sbjct: 337  LLMGQP-FPKTEAMNEKELGKSNADALVNLPHI-TAGNENISKKKVEQKMSKKVPPNNFP 394

Query: 632  SNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAG 453
            SNVRSLLSTGMLDGVPVKY AWSREKEL+GVIKGSGYLCGCQSC+FSK INAYEFERHAG
Sbjct: 395  SNVRSLLSTGMLDGVPVKYTAWSREKELQGVIKGSGYLCGCQSCDFSKVINAYEFERHAG 454

Query: 452  CKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAA 273
            CKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLF+VIQTITGSPINQKSFR WK+SFLAA
Sbjct: 455  CKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFEVIQTITGSPINQKSFRLWKESFLAA 514

Query: 272  TRELQRIYGKDEGKR 228
            TRELQRIYGKDEGK+
Sbjct: 515  TRELQRIYGKDEGKQ 529


>ref|XP_007009432.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590563660|ref|XP_007009433.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508726345|gb|EOY18242.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508726346|gb|EOY18243.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 584

 Score =  595 bits (1534), Expect = e-167
 Identities = 322/633 (50%), Positives = 410/633 (64%), Gaps = 19/633 (3%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQ+K FW+ +  GCLT+GE+ YDNSSR EPKR HQWFMD +  ELF NKKQA+E  N+
Sbjct: 1    MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  GI+  N+SPW NASSFQSVS   ++RLF SE  RT+N  DRN ++SV + NMNMGR
Sbjct: 61   RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRN-MSSVDSGNMNMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  +D + N S+ GLS+SHT+EDP    ++GGIRKVKV+QV+DS N M   MGH + RG 
Sbjct: 120  KDFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGV 179

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
            NS++                             S+   Y + D NN IS+G  Y  GD+N
Sbjct: 180  NSTV-----------------------------SMSTVYSKSD-NNAISLGPTYGSGDEN 209

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
                        T+S+G T+ K D + ISMG          H F K D + IS+G  Y+K
Sbjct: 210  ------------TISIGPTFTKADGNFISMG----------HTFNKRDGDFISVGHNYNK 247

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSYKENDSTI-TMSHSFSKGDGNIISMGQSYNKGSEN 1002
             N+S +S+G  F K D + ISMGQSY++ D+ + ++S S+ KG  N ISM  +Y K +E+
Sbjct: 248  GNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNES 307

Query: 1001 VISMGHIYNKDEENAIPTSHSYNKRDNNN--------------LSMGHIYSKGDSTIISF 864
            +ISM   ++K+E+  IP   SY+K D N               LSMG  Y KG+S  ISF
Sbjct: 308  LISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISF 367

Query: 863  GGYDDD--TNPSGRLISSYDLLMG-QPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTE 696
            GG+ D+  TNPSG +IS YDLLM  Q S Q+SE  ++KELVE + +S V      +S T+
Sbjct: 368  GGFHDESETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD 427

Query: 695  TVXXXXXXXXXXXXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLC 516
                            PN+FPSNV+SLLSTGMLDGV VKY++WSREK L+G I+G+GY+C
Sbjct: 428  A-NPKHKEPKTAKKVPPNNFPSNVKSLLSTGMLDGVAVKYVSWSREKSLKGYIQGTGYMC 486

Query: 515  GCQSCNFSKAINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQT 336
            GC+ C F KA+NAYEFERHA CKTKHPNNHIYFENGKTIY +VQEL++TPQ +LFDVIQ 
Sbjct: 487  GCKDCKFEKALNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQELLFDVIQN 546

Query: 335  ITGSPINQKSFRSWKDSFLAATRELQRIYGKDE 237
            +TGS INQK+FR WK S+ AATRELQRIYGKD+
Sbjct: 547  VTGSQINQKNFRIWKASYQAATRELQRIYGKDD 579


>gb|EXB65298.1| hypothetical protein L484_025377 [Morus notabilis]
          Length = 574

 Score =  590 bits (1520), Expect = e-165
 Identities = 317/610 (51%), Positives = 408/610 (66%), Gaps = 5/610 (0%)
 Frame = -1

Query: 2051 MAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSP 1872
            M K +GCL DGE+ YDNSSRME KR  QWFMD +  +LF NKKQAVE  N  P  G+   
Sbjct: 1    MPKDAGCLADGEMGYDNSSRMEQKRG-QWFMDANGPQLF-NKKQAVEAVNGRPISGVPHM 58

Query: 1871 NISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTN 1692
            N+S W N S FQSV G FT+RLF SE  R  N  DRN + S+ + NMNMGRK  E  + N
Sbjct: 59   NVSQWDNTSGFQSVPGQFTDRLFGSEPVRNSNLVDRN-VQSIGSGNMNMGRKGFESQYGN 117

Query: 1691 DSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHS 1512
              + GLS+SHT+EDP   LN+GGIRKVKV+QV+DS+N+++  MG+ + R +N++ISM +S
Sbjct: 118  TPSVGLSMSHTIEDPSSCLNFGGIRKVKVNQVRDSDNILNPSMGNSYGRVENNTISMGNS 177

Query: 1511 YNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKG 1332
            YNK+ +N IS+  AYNN G+++ IS+G T+ + D  +FIS+G  + KGD N         
Sbjct: 178  YNKSDNNSISLAPAYNN-GEENTISMGPTFTKAD-ESFISIGHTFNKGDGNF-------- 227

Query: 1331 NDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMG 1152
                +SMGH YGK DN ++SM Q          P+ KGD N ISMGQ+Y+K +   IS+G
Sbjct: 228  ----ISMGHNYGKGDNGLLSMSQ----------PYDKGDGNFISMGQSYEKGDGGVISLG 273

Query: 1151 HPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENVISMGHIYN- 975
              ++KG    IS+G +Y              K + N I M  SY KG++++ISMG     
Sbjct: 274  TSYNKGHEEFISVGTTY-------------GKANNNFIQMAPSYIKGNDSIISMGPTPTY 320

Query: 974  KDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDD--DTNPSGRLISSYDLLM 801
            K + N +P   +Y+K D++NLSMG  Y+K +ST ISFGG+ D  +TNPSG +ISSYDLLM
Sbjct: 321  KADSNVVPMGPNYDKGDSSNLSMGQTYNKAESTTISFGGFHDEPETNPSGGIISSYDLLM 380

Query: 800  -GQPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFPSN 627
              Q S Q+ E S +K   + + +  V +       ++ +              PN+FPSN
Sbjct: 381  SNQNSAQTLEVSEQKNSADFNVNPSVNSIPQADLKSDNI-PKNKEPKTVKKAPPNNFPSN 439

Query: 626  VRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAGCK 447
            V+SLLSTGM DGVPVKY++WSREK L+G+IKG+GYLC C  CN SK++NAYEFERHAGCK
Sbjct: 440  VKSLLSTGMFDGVPVKYVSWSREKNLKGIIKGTGYLCSCTDCNQSKSLNAYEFERHAGCK 499

Query: 446  TKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAATR 267
            TKHPNNHIYFENGKTIY +VQEL++TPQ MLFD IQ +TGSPIN K+FR WK S+ AATR
Sbjct: 500  TKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSPINHKNFRIWKASYQAATR 559

Query: 266  ELQRIYGKDE 237
            ELQRIYGKDE
Sbjct: 560  ELQRIYGKDE 569


>ref|XP_007009434.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508726347|gb|EOY18244.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 581

 Score =  587 bits (1514), Expect = e-165
 Identities = 318/629 (50%), Positives = 406/629 (64%), Gaps = 19/629 (3%)
 Frame = -1

Query: 2066 NKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFL 1887
            +K FW+ +  GCLT+GE+ YDNSSR EPKR HQWFMD +  ELF NKKQA+E  N+ P  
Sbjct: 2    HKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVS 61

Query: 1886 GISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIE 1707
            GI+  N+SPW NASSFQSVS   ++RLF SE  RT+N  DRN ++SV + NMNMGRK  +
Sbjct: 62   GIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDRN-MSSVDSGNMNMGRKDFD 120

Query: 1706 DPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSI 1527
            D + N S+ GLS+SHT+EDP    ++GGIRKVKV+QV+DS N M   MGH + RG NS++
Sbjct: 121  DQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTV 180

Query: 1526 SMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLS 1347
                                         S+   Y + D NN IS+G  Y  GD+N    
Sbjct: 181  -----------------------------SMSTVYSKSD-NNAISLGPTYGSGDEN---- 206

Query: 1346 HAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDS 1167
                    T+S+G T+ K D + ISMG          H F K D + IS+G  Y+K N+S
Sbjct: 207  --------TISIGPTFTKADGNFISMG----------HTFNKRDGDFISVGHNYNKGNES 248

Query: 1166 SISMGHPFSKGDSNIISMGQSYKENDSTI-TMSHSFSKGDGNIISMGQSYNKGSENVISM 990
             +S+G  F K D + ISMGQSY++ D+ + ++S S+ KG  N ISM  +Y K +E++ISM
Sbjct: 249  ILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISM 308

Query: 989  GHIYNKDEENAIPTSHSYNKRDNNN--------------LSMGHIYSKGDSTIISFGGYD 852
               ++K+E+  IP   SY+K D N               LSMG  Y KG+S  ISFGG+ 
Sbjct: 309  APTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFH 368

Query: 851  DD--TNPSGRLISSYDLLMG-QPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTETVXX 684
            D+  TNPSG +IS YDLLM  Q S Q+SE  ++KELVE + +S V      +S T+    
Sbjct: 369  DESETNPSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDA-NP 427

Query: 683  XXXXXXXXXXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQS 504
                        PN+FPSNV+SLLSTGMLDGV VKY++WSREK L+G I+G+GY+CGC+ 
Sbjct: 428  KHKEPKTAKKVPPNNFPSNVKSLLSTGMLDGVAVKYVSWSREKSLKGYIQGTGYMCGCKD 487

Query: 503  CNFSKAINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGS 324
            C F KA+NAYEFERHA CKTKHPNNHIYFENGKTIY +VQEL++TPQ +LFDVIQ +TGS
Sbjct: 488  CKFEKALNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQELLFDVIQNVTGS 547

Query: 323  PINQKSFRSWKDSFLAATRELQRIYGKDE 237
             INQK+FR WK S+ AATRELQRIYGKD+
Sbjct: 548  QINQKNFRIWKASYQAATRELQRIYGKDD 576


>ref|XP_004167779.1| PREDICTED: uncharacterized LOC101206313 [Cucumis sativus]
          Length = 561

 Score =  580 bits (1495), Expect = e-162
 Identities = 309/595 (51%), Positives = 394/595 (66%), Gaps = 4/595 (0%)
 Frame = -1

Query: 2009 YDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSPNISPWGNASSFQSV 1830
            YD+SSR+E KR HQWFMDGS  ELF +KKQA+E  N+ P  G+   N+SPW N SSFQSV
Sbjct: 3    YDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNSRPVPGVPHMNVSPWEN-SSFQSV 61

Query: 1829 SGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTNDSAFGLSISHTLED 1650
             GHFT+RLF SE  RT+N  DR    SV   NM+MGRK  E+ FTN+ + GLS+S ++ED
Sbjct: 62   PGHFTDRLFGSEPIRTVNLVDRG--ISVGNANMDMGRKEFENHFTNNPSVGLSMSQSIED 119

Query: 1649 PRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHSYNKAADNLISMGLA 1470
            P   LN+GGIRKVKV+QV+D +  M   +GH + RGDN +ISM   +NK  +N IS+G  
Sbjct: 120  PSSCLNFGGIRKVKVNQVRDPDVGMPASLGHAYTRGDNCTISMGTGFNKNHENTISLGQT 179

Query: 1469 YNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKGNDDTMSMGHTYGKD 1290
            YN++ D++ IS+G  Y + D +NFISMG  ++KGD +             +++GH Y K 
Sbjct: 180  YNSR-DENAISVGPAYHKTD-DNFISMGHAFSKGDGSF------------ITIGHNYSKG 225

Query: 1289 DNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMGHPFSKGDSNIISMG 1110
            DN ++SM Q          PF KGD + ISMGQ+Y+K   + IS    ++KG  N ISMG
Sbjct: 226  DNSILSMNQ----------PFDKGDDSFISMGQSYEKAEGNIISFA-SYNKGQENFISMG 274

Query: 1109 QSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSENVISMGHIYNKDEENAIPTSHSYNK 930
             +Y             SK     ISM  S+NKG+++ +SM   Y+K   + +     ++K
Sbjct: 275  PAY-------------SKAGDTFISMASSFNKGNDDNLSMAPTYDKVNSDIVHVGPKFDK 321

Query: 929  RDNNNLSMGHIYSKGDSTIISFGGYDDDT---NPSGRLISSYDLLM-GQPSVQSSEASNE 762
             D+  +SM H Y KG+S  ISFGG+DD+    NPSG +ISSYDLLM  Q S Q+SE S  
Sbjct: 322  ADSGAVSMAHNYHKGESNTISFGGFDDENGTDNPSGGIISSYDLLMANQASAQASEVSTL 381

Query: 761  KELVESAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFPSNVRSLLSTGMLDGVPV 582
            ++ V+    +   G +   G                  PNSFPSNV+SLLSTGMLDGVPV
Sbjct: 382  RDSVDPNVEVNINGAIKVDGKIDTNSKSKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPV 441

Query: 581  KYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAGCKTKHPNNHIYFENGKT 402
            KY++WSREK L+G+IKG+GYLC C++CN SKA+NAYEFERHAGCKTKHPNNHIYFENGKT
Sbjct: 442  KYVSWSREKNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKT 501

Query: 401  IYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAATRELQRIYGKDE 237
            IY +VQEL++TPQ MLFD IQ +TGSPINQK+FR WK S+ AAT ELQRIYGKDE
Sbjct: 502  IYAVVQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKASYQAATLELQRIYGKDE 556


>ref|XP_002309044.2| hypothetical protein POPTR_0006s08280g [Populus trichocarpa]
            gi|550335772|gb|EEE92567.2| hypothetical protein
            POPTR_0006s08280g [Populus trichocarpa]
          Length = 542

 Score =  572 bits (1475), Expect = e-160
 Identities = 329/623 (52%), Positives = 394/623 (63%), Gaps = 7/623 (1%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSS-RMEPKRPHQWFMDGSEVELFPNKKQAVEVQN 1902
            MSFQN+G WM K + C+ DGE+ YDNSS R+E KR HQW MDG E ELFPNKKQA+ V  
Sbjct: 3    MSFQNQGLWMVKGAECINDGEINYDNSSSRIESKRSHQWLMDG-EAELFPNKKQAIGVPT 61

Query: 1901 NAPFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMG 1722
            N  F G+ S N +PWG+ASSFQSVSGHFTER  DSE  R ++FDDR+ I SVS+  +N  
Sbjct: 62   NNLFTGMLSTNATPWGSASSFQSVSGHFTERFLDSETNRAVDFDDRS-IASVSSGKIN-- 118

Query: 1721 RKVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRG 1542
                            SI   L++                              H+F   
Sbjct: 119  ----------------SIGRKLDE------------------------------HLFGND 132

Query: 1541 DNSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDD 1362
                +SM H        L        N G    + +     +E  N  ++          
Sbjct: 133  SPFGLSMPHMLEDPRSGL--------NYGGIRKVKVSQV--KESENAMLA---------- 172

Query: 1361 NISLSHAYKGND-DTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTY 1185
              SL HA+   D +TMS+  +Y KD++ +ISMG  YN++          D N +S G TY
Sbjct: 173  --SLEHAFSRVDRNTMSVAQSYDKDES-IISMGLAYNKQ----------DENGMSTG-TY 218

Query: 1184 DKENDSSISMGHPFSKGDSNIISMGQSYKENDSTITMSHSFSKGDGNIISMGQSYNKGSE 1005
            D+EN+  ISM  P +KGD +I SM Q+YKEN + I M H+FS G+ N ISMGQ+Y+K  E
Sbjct: 219  DRENNIFISMRKPCNKGDEHI-SMSQTYKENGNAIPMGHTFSNGENNTISMGQTYSKVDE 277

Query: 1004 NVISMGH---IYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDD-TNP 837
            N+ISMGH   IYNK     +    +Y+K  NN+LS+G   +KG+STIISFGGYDDD TN 
Sbjct: 278  NIISMGHMGHIYNKGNSGMVSVDQTYDKDGNNSLSIGQSRNKGESTIISFGGYDDDDTNC 337

Query: 836  SGRLISSYDLLMGQPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXX 660
            SG+L SSY+LLM QPS Q SE  N+ ELV+S   + V    V +S T+ V          
Sbjct: 338  SGKLTSSYELLMAQPSFQRSEVRNDNELVKSNVDTRVSALHVATSRTDNVSKKKDDIKTA 397

Query: 659  XXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAIN 480
                 N+FPSNVRSLLSTGMLDGVPVKY+AWS+EKELRGVIKGSGYLCGCQ+CNFSK +N
Sbjct: 398  KKLPSNNFPSNVRSLLSTGMLDGVPVKYVAWSQEKELRGVIKGSGYLCGCQTCNFSKVVN 457

Query: 479  AYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFR 300
            AYEFERHA CKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLF+VIQTITGSPINQKSFR
Sbjct: 458  AYEFERHANCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFEVIQTITGSPINQKSFR 517

Query: 299  SWKDSFLAATRELQRIYGKDEGK 231
             WK+SFLAATRELQRIYGKDEGK
Sbjct: 518  LWKESFLAATRELQRIYGKDEGK 540


>ref|XP_007009436.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508726349|gb|EOY18246.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 563

 Score =  564 bits (1453), Expect = e-158
 Identities = 309/612 (50%), Positives = 393/612 (64%), Gaps = 19/612 (3%)
 Frame = -1

Query: 2015 LAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNNAPFLGISSPNISPWGNASSFQ 1836
            + YDNSSR EPKR HQWFMD +  ELF NKKQA+E  N+ P  GI+  N+SPW NASSFQ
Sbjct: 1    MGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNSRPVSGIADVNVSPWHNASSFQ 60

Query: 1835 SVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGRKVIEDPFTNDSAFGLSISHTL 1656
            SVS   ++RLF SE  RT+N  DRN ++SV + NMNMGRK  +D + N S+ GLS+SHT+
Sbjct: 61   SVSSQLSDRLFGSEPLRTVNLVDRN-MSSVDSGNMNMGRKDFDDQYVNSSSAGLSMSHTI 119

Query: 1655 EDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGDNSSISMAHSYNKAADNLISMG 1476
            EDP    ++GGIRKVKV+QV+DS N M   MGH + RG NS++                 
Sbjct: 120  EDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGVNSTV----------------- 162

Query: 1475 LAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDNISLSHAYKGNDDTMSMGHTYG 1296
                        S+   Y + D NN IS+G  Y  GD+N            T+S+G T+ 
Sbjct: 163  ------------SMSTVYSKSD-NNAISLGPTYGSGDEN------------TISIGPTFT 197

Query: 1295 KDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDKENDSSISMGHPFSKGDSNIIS 1116
            K D + ISMG          H F K D + IS+G  Y+K N+S +S+G  F K D + IS
Sbjct: 198  KADGNFISMG----------HTFNKRDGDFISVGHNYNKGNESILSVGQAFEKEDGSFIS 247

Query: 1115 MGQSYKENDSTI-TMSHSFSKGDGNIISMGQSYNKGSENVISMGHIYNKDEENAIPTSHS 939
            MGQSY++ D+ + ++S S+ KG  N ISM  +Y K +E++ISM   ++K+E+  IP   S
Sbjct: 248  MGQSYEKGDANLMSLSSSYGKGQENFISMAPAYGKPNESLISMAPTFDKEEDTIIPMGSS 307

Query: 938  YNKRDNNN--------------LSMGHIYSKGDSTIISFGGYDDD--TNPSGRLISSYDL 807
            Y+K D N               LSMG  Y KG+S  ISFGG+ D+  TNPSG +IS YDL
Sbjct: 308  YHKADCNITAMAPTQGKGESSILSMGQNYKKGESNTISFGGFHDESETNPSGSIISGYDL 367

Query: 806  LMG-QPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTETVXXXXXXXXXXXXXXPNSFP 633
            LM  Q S Q+SE  ++KELVE + +S V      +S T+                PN+FP
Sbjct: 368  LMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTDA-NPKHKEPKTAKKVPPNNFP 426

Query: 632  SNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAYEFERHAG 453
            SNV+SLLSTGMLDGV VKY++WSREK L+G I+G+GY+CGC+ C F KA+NAYEFERHA 
Sbjct: 427  SNVKSLLSTGMLDGVAVKYVSWSREKSLKGYIQGTGYMCGCKDCKFEKALNAYEFERHAN 486

Query: 452  CKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSWKDSFLAA 273
            CKTKHPNNHIYFENGKTIY +VQEL++TPQ +LFDVIQ +TGS INQK+FR WK S+ AA
Sbjct: 487  CKTKHPNNHIYFENGKTIYAVVQELKNTPQELLFDVIQNVTGSQINQKNFRIWKASYQAA 546

Query: 272  TRELQRIYGKDE 237
            TRELQRIYGKD+
Sbjct: 547  TRELQRIYGKDD 558


>ref|XP_003518656.1| PREDICTED: uncharacterized protein LOC100787520 [Glycine max]
          Length = 581

 Score =  561 bits (1445), Expect = e-157
 Identities = 299/622 (48%), Positives = 407/622 (65%), Gaps = 8/622 (1%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MS+Q+K FWM + +GC+ +  + Y+NSSR+E KR H+WFMD  E E+F NKKQAVE  + 
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRVESKRSHKWFMDAGEPEIFSNKKQAVEAVSG 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  G+S  N+S W N S F SV+  F++RLF S+ ART+N  D+ N+ S+ + N+NMGR
Sbjct: 61   RPVSGVSHANVSQWDNNSGFHSVTSQFSDRLFGSDLARTVNLVDK-NVPSIVSGNLNMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVM-SLPMGHVFDRG 1542
            K  E  + ND + GLS+SH++ D    LN+GGIRKVKV+QV+DS+N M +  MGH + R 
Sbjct: 120  KDFEHQYGNDPSVGLSMSHSIADTSSCLNFGGIRKVKVNQVRDSDNCMPAASMGHSYSRE 179

Query: 1541 DNSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDD 1362
            DNS+IS+   YNK     IS+G  YNN  D                N I+MG   +K DD
Sbjct: 180  DNSTISVGAGYNKNDGGNISLGPTYNNVND----------------NTIAMGSRMSKTDD 223

Query: 1361 N-ISLSHAY-KGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQT 1188
            N +S++H + KG+   M +GH YGK D  ++SMGQ          PF KGD N ISMGQ+
Sbjct: 224  NLLSMAHTFNKGDGGFMLLGHNYGKGDESILSMGQ----------PFDKGDGNFISMGQS 273

Query: 1187 YDKENDSSISMGHPFSKGDSNIISMGQSY-KENDSTITMSHSFSKGDGNIISMGQSYNKG 1011
            Y+KE+ + IS+G  ++KG  N I +G +Y K  ++ IT++                Y+KG
Sbjct: 274  YEKEDGNLISLGTSYTKGHENFIPVGPTYGKSGENFITVA---------------PYDKG 318

Query: 1010 SENVISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPS- 834
            ++++IS+G  Y+K + N   T  S+++ D+++L +G  + KG ++ ISFGG+ DD  P+ 
Sbjct: 319  TDHIISLGPTYDKVDSNIASTIPSFDRGDSSSLPVGQNHHKGQNSSISFGGFHDDPGPNI 378

Query: 833  -GRLISSYDLLMG-QPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTETVXXXXXXXXX 663
               +IS YDLL+G Q S Q  ++ N  +L E +  SLV +  +    T+           
Sbjct: 379  PSGIISGYDLLIGSQNSAQGMDSQN--DLTETNTESLVNS--IPKPNTKNDIVKNKEPKT 434

Query: 662  XXXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAI 483
                  N+FPSNV+SLLSTG+ DGV VKY++WSREK L+G+IKG+GYLC C +CN SKA+
Sbjct: 435  TKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKAL 494

Query: 482  NAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSF 303
            NAYEFERHAG KTKHPNNHIYFENGKTIY +VQEL++TPQ+MLFD IQ +TGS INQK+F
Sbjct: 495  NAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQDMLFDAIQNVTGSTINQKNF 554

Query: 302  RSWKDSFLAATRELQRIYGKDE 237
            R WK S+ AATRELQRIYGKD+
Sbjct: 555  RIWKASYQAATRELQRIYGKDD 576


>ref|XP_007139258.1| hypothetical protein PHAVU_008G014500g [Phaseolus vulgaris]
            gi|593331666|ref|XP_007139259.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|593331672|ref|XP_007139262.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012391|gb|ESW11252.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012392|gb|ESW11253.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
            gi|561012395|gb|ESW11256.1| hypothetical protein
            PHAVU_008G014500g [Phaseolus vulgaris]
          Length = 583

 Score =  554 bits (1428), Expect = e-155
 Identities = 301/624 (48%), Positives = 402/624 (64%), Gaps = 10/624 (1%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MS+Q+K FWM + +GC+ +  + Y+NSSR+EPKR HQWFMD  E E+  NKKQAVE  + 
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENVGYENSSRIEPKRSHQWFMDTGEPEIVSNKKQAVEDVSG 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  G+S  N+S W  +S F SV G F++RLF S+ ART+N  D+N + S+ + NMNMGR
Sbjct: 61   RPISGVSHVNVSQWDTSSGFHSVMGQFSDRLFGSDLARTVNLVDKN-VPSIVSGNMNMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  E  + ND + GLSISH++ DP   LN+GGIRKVKV+QV+DS+N M            
Sbjct: 120  KDFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMP----------- 168

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
              S +M HSY+                             RED N+ IS+G  Y K D N
Sbjct: 169  --SAAMGHSYS-----------------------------RED-NSTISVGAGYNKNDGN 196

Query: 1358 ISLSHAYKG-NDDTMSMGHTYG-KDDNDVISMGQIYNREND----MGHPFRKGDSNIISM 1197
            ISL   Y   ND+T+ MG     K D++++S+   +N+ +     MGH + KGD +I+SM
Sbjct: 197  ISLGPTYNHRNDNTIGMGSRISSKTDDNLLSVAHNFNKGDGGFMLMGHNYGKGDESILSM 256

Query: 1196 GQTYDKENDSSISMGHPFSKGDSNIISMGQSY-KENDSTITMSHSFSKGDGNIISMGQSY 1020
            GQ +DK + + ISMG  + K D N+IS+G SY K ++S I++  +F K   N I++   Y
Sbjct: 257  GQPFDKGDGNFISMGQSYEKEDGNLISLGTSYSKGHESFISIGPTFGKSGENFITVAP-Y 315

Query: 1019 NKGSENVISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDD-- 846
            +KG++++ISMG  Y+K + N   T  SY++ D+++L +G  + KG S+ ISFGG+ DD  
Sbjct: 316  DKGTDHLISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKGQSSTISFGGFHDDPE 375

Query: 845  TNPSGRLISSYDLLMG-QPSVQSSEASNEKELVESAHSLVGTGQVFSSGTETVXXXXXXX 669
             NPSG +IS YDLL+G Q S Q  ++ N+     +  SLV +    ++  +TV       
Sbjct: 376  ANPSGGIISGYDLLIGNQNSAQGLDSQNDLSETNT-ESLVNSIPKLNTKNDTVVKNKEPK 434

Query: 668  XXXXXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSK 489
                    N+FPSNV+SLLSTG+ DGV VKY++WSREK L+G+IKG+GYLC C  C  SK
Sbjct: 435  TTTKKAPTNNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDDCKQSK 494

Query: 488  AINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQK 309
            A+NAYEFERHAG KTKHPNNHIYFENGKTIY +VQEL++TPQ MLFD IQ +TGS INQK
Sbjct: 495  ALNAYEFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIQNVTGSTINQK 554

Query: 308  SFRSWKDSFLAATRELQRIYGKDE 237
            +FR WK S+ AATRELQRIYGKDE
Sbjct: 555  NFRIWKASYQAATRELQRIYGKDE 578


>ref|XP_007009437.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508726350|gb|EOY18247.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 561

 Score =  551 bits (1421), Expect = e-154
 Identities = 309/623 (49%), Positives = 391/623 (62%), Gaps = 9/623 (1%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MSFQ+K FW+ +  GCLT+GE+ YDNSSR EPKR HQWFMD +  ELF NKKQA+E  N+
Sbjct: 1    MSFQHKSFWLPRDGGCLTNGEMGYDNSSRTEPKRGHQWFMDAAAPELFSNKKQAIESVNS 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  GI+  N+SPW NASSFQSVS   ++RLF SE  RT+N  DR N++SV + NMNMGR
Sbjct: 61   RPVSGIADVNVSPWHNASSFQSVSSQLSDRLFGSEPLRTVNLVDR-NMSSVDSGNMNMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  +D + N S+ GLS+SHT+EDP    ++GGIRKVKV+QV+DS N M   MGH + RG 
Sbjct: 120  KDFDDQYVNSSSAGLSMSHTIEDPSSCFSFGGIRKVKVNQVRDSSNGMPASMGHTYSRGV 179

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGD-D 1362
            NS++SM+  Y+K+ +N IS+G  Y + GD++ ISIG T+ + D  NFISMG  + K D D
Sbjct: 180  NSTVSMSTVYSKSDNNAISLGPTYGS-GDENTISIGPTFTKAD-GNFISMGHTFNKRDGD 237

Query: 1361 NISLSHAY-KGNDDTMSMGHTYGKDDNDVISMGQIYNREN----DMGHPFRKGDSNIISM 1197
             IS+ H Y KGN+  +S+G  + K+D   ISMGQ Y + +     +   + KG  N ISM
Sbjct: 238  FISVGHNYNKGNESILSVGQAFEKEDGSFISMGQSYEKGDANLMSLSSSYGKGQENFISM 297

Query: 1196 GQTYDKENDSSISMGHPFSKGDSNIISMGQSYKENDSTIT-MSHSFSKGDGNIISMGQSY 1020
               Y K N+S ISM   F K +  II MG SY + D  IT M+ +  KG+ +I+SMGQ+Y
Sbjct: 298  APAYGKPNESLISMAPTFDKEEDTIIPMGSSYHKADCNITAMAPTQGKGESSILSMGQNY 357

Query: 1019 NKGSENVISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTN 840
             KG  N IS G  +++ E                                        TN
Sbjct: 358  KKGESNTISFGGFHDESE----------------------------------------TN 377

Query: 839  PSGRLISSYDLLM-GQPSVQSSEASNEKELVE-SAHSLVGTGQVFSSGTETVXXXXXXXX 666
            PSG +IS YDLLM  Q S Q+SE  ++KELVE + +S V      +S T+          
Sbjct: 378  PSGSIISGYDLLMNNQNSAQASEVLSQKELVEVNPNSNVNNAPKHNSRTD-ANPKHKEPK 436

Query: 665  XXXXXXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKA 486
                  PN+FPSNV+SLLSTGMLDGV VKY++WSRE                       A
Sbjct: 437  TAKKVPPNNFPSNVKSLLSTGMLDGVAVKYVSWSRE-----------------------A 473

Query: 485  INAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKS 306
            +NAYEFERHA CKTKHPNNHIYFENGKTIY +VQEL++TPQ +LFDVIQ +TGS INQK+
Sbjct: 474  LNAYEFERHANCKTKHPNNHIYFENGKTIYAVVQELKNTPQELLFDVIQNVTGSQINQKN 533

Query: 305  FRSWKDSFLAATRELQRIYGKDE 237
            FR WK S+ AATRELQRIYGKD+
Sbjct: 534  FRIWKASYQAATRELQRIYGKDD 556


>ref|XP_003552682.1| PREDICTED: uncharacterized protein LOC100782217 [Glycine max]
          Length = 582

 Score =  549 bits (1415), Expect = e-153
 Identities = 299/619 (48%), Positives = 403/619 (65%), Gaps = 5/619 (0%)
 Frame = -1

Query: 2078 MSFQNKGFWMAKPSGCLTDGELAYDNSSRMEPKRPHQWFMDGSEVELFPNKKQAVEVQNN 1899
            MS+Q+K FWM + +GC+ +    Y+NSSR+EPKR HQWFMD  E E+F NKKQAVE  + 
Sbjct: 1    MSYQHKSFWMPRDAGCMAEENAGYENSSRIEPKRSHQWFMDTGEPEIFSNKKQAVEAVSG 60

Query: 1898 APFLGISSPNISPWGNASSFQSVSGHFTERLFDSEAARTINFDDRNNITSVSAENMNMGR 1719
             P  G+S  N+S W   S F SV+  F++RLF S+ ART+N  D+N + S+ + N+NMGR
Sbjct: 61   RPISGVSHANVSQWDTNSGFHSVTSQFSDRLFGSDLARTVNLVDKN-VPSIVSGNLNMGR 119

Query: 1718 KVIEDPFTNDSAFGLSISHTLEDPRLGLNYGGIRKVKVSQVKDSENVMSLPMGHVFDRGD 1539
            K  E  + ND + GLSISH++ DP   LN+GGIRKVKV+QV+DS+N M            
Sbjct: 120  KDFEHQYGNDPSVGLSISHSIADPSSCLNFGGIRKVKVNQVRDSDNCMP----------- 168

Query: 1538 NSSISMAHSYNKAADNLISMGLAYNNKGDDSIISIGDTYDREDTNNFISMGQPYTKGDDN 1359
              + SM  SY++  ++ IS+G  YN K D   IS+G TY+    +N I+MG   +K DDN
Sbjct: 169  --AASMGPSYSREDNSTISVGAGYN-KNDGDNISLGPTYNN-GYDNTIAMGSRISKTDDN 224

Query: 1358 ISLSHAYKGNDDTMSMGHTYGKDDNDVISMGQIYNRENDMGHPFRKGDSNIISMGQTYDK 1179
            +            +SM HT+ K D   + MG          H + KGD +I+SMGQ +DK
Sbjct: 225  L------------LSMAHTFSKGDGGFMLMG----------HNYGKGDESIVSMGQPFDK 262

Query: 1178 ENDSSISMGHPFSKGDSNIISMGQSY-KENDSTITMSHSFSKGDGNIISMGQSYNKGSEN 1002
             + + ISMG  + K D N+IS+G SY K ++S I +  ++ K   N I++   Y+KG+ +
Sbjct: 263  GDGNFISMGQSYEKEDGNLISLGTSYTKVHESFIPVGPTYGKSGENFITVAP-YDKGTNH 321

Query: 1001 VISMGHIYNKDEENAIPTSHSYNKRDNNNLSMGHIYSKGDSTIISFGGYDDDTNPS--GR 828
            +ISMG  Y+K + N   T  SY++ D+++L +G  + KG S+ ISFGG+ DD  P+  G 
Sbjct: 322  IISMGPTYDKVDSNIASTVPSYDRGDSSSLPVGQNHHKGQSSSISFGGFHDDPEPNTPGG 381

Query: 827  LISSYDLLMG-QPSVQSSEASNEKELVES-AHSLVGTGQVFSSGTETVXXXXXXXXXXXX 654
            +IS YDLL+G Q S Q  ++ N+  L E+   SLV +    ++  + V            
Sbjct: 382  IISGYDLLIGGQNSAQGLDSQND--LTETNTESLVNSIPKPNTKNDIVVKNKEPKTTKKA 439

Query: 653  XXPNSFPSNVRSLLSTGMLDGVPVKYIAWSREKELRGVIKGSGYLCGCQSCNFSKAINAY 474
               N+FPSNV+SLLSTG+ DGV VKY++WSREK L+G+IKG+GYLC C +CN SKA+NAY
Sbjct: 440  PT-NNFPSNVKSLLSTGIFDGVQVKYVSWSREKSLKGIIKGTGYLCSCDNCNQSKALNAY 498

Query: 473  EFERHAGCKTKHPNNHIYFENGKTIYGIVQELRSTPQNMLFDVIQTITGSPINQKSFRSW 294
            EFERHAG KTKHPNNHIYFENGKTIY +VQEL++T Q+MLFD IQ +TGS INQK+FR W
Sbjct: 499  EFERHAGAKTKHPNNHIYFENGKTIYAVVQELKNTNQDMLFDAIQNVTGSTINQKNFRIW 558

Query: 293  KDSFLAATRELQRIYGKDE 237
            K S+ AATRELQRIYGKDE
Sbjct: 559  KASYQAATRELQRIYGKDE 577


Top