BLASTX nr result

ID: Sinomenium22_contig00022395 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00022395
         (1428 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   407   e-111
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   403   e-110
ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   403   e-110
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   402   e-109
ref|XP_007043579.1| Uncharacterized protein isoform 5 [Theobroma...   387   e-105
ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma...   387   e-105
ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma...   387   e-105
ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [...   387   e-105
ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma...   387   e-105
ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun...   384   e-104
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   376   e-101
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     371   e-100
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   370   e-100
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   357   6e-96
ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779...   355   2e-95
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   353   1e-94
ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807...   353   1e-94
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   351   4e-94
ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phas...   344   6e-92
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   342   2e-91

>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  407 bits (1047), Expect = e-111
 Identities = 221/428 (51%), Positives = 274/428 (64%), Gaps = 27/428 (6%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG    SASSLRE+AA+ T++  R +GH YVELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YS+  L+DHL+GN H ERYAAAKVTLL S PWPFNDGVLFF +S E +  L +   +  R
Sbjct: 58   YSESVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTR 117

Query: 848  LLDTNHATNIGNAKISNAKDDSYDGNNFCVD--------------------GGKD-DMLV 732
            LL T+   N  N  I    DD    NN  V+                    GG++ DM++
Sbjct: 118  LLGTHKNDN--NLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMI 175

Query: 731  PGVLCNDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKD-ESMAPEEH 555
            PGV+  DE+T LE++ +GFG+I AR  E +   K I ++WC W GK    D E++   +H
Sbjct: 176  PGVMIKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDH 235

Query: 554  DFGIVVFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEGKKRKKSFSDPEDVSESLSTQY 375
            DF +V F Y+YNLGR+ L DD   +L  SP        G+KRKKSFSDPED+SESLS QY
Sbjct: 236  DFAVVTFNYHYNLGRKGLFDDVISMLSSSPTEG----SGRKRKKSFSDPEDISESLSNQY 291

Query: 374  DXXXXXXXXXXXSPTSEWASSNRQKLT-----SSKSLRRELRQKQRLAAERMCDICQHRM 210
            D                       +L      SSK++RRELR++QR+AAERMCDICQH+M
Sbjct: 292  DSSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKM 351

Query: 209  LPGKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISD 30
            LPGKDVA L+NMKTG+LVCSSRNV GAFHVFH SCLIHWILLCE EI+ N+L  PK+   
Sbjct: 352  LPGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRS 411

Query: 29   CKRRSRAK 6
             +R+S +K
Sbjct: 412  SRRKSGSK 419


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  403 bits (1036), Expect = e-110
 Identities = 222/425 (52%), Positives = 274/425 (64%), Gaps = 24/425 (5%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG    SA SLRE+ A+ T+   RA+GHTYVELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H ER +AAKVTLLG  PWPFNDGVLFF +S E+  +  V     GR
Sbjct: 58   YSDLVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGR 117

Query: 848  LLDT-NHATNIG------NAKISNAKDDSYDGNNFCVDGGKD---------DMLVPGVLC 717
             LD  N+ +N+       + K++  +    D  +F  + G           D ++PGV  
Sbjct: 118  SLDYHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFL 177

Query: 716  NDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDESMAP-EEHDFGIV 540
             DEI  L ++ IG G+I AR+++ +E   +I R+WC WLGK + +DE +    +HDF IV
Sbjct: 178  KDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHDFAIV 237

Query: 539  VFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXX 366
             F YNY+LGR+ L DD   LL  SP  + EN EG  +KRKKSFSDPEDVSESLS QYD  
Sbjct: 238  TFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQYDSC 297

Query: 365  XXXXXXXXXSPTSEWASSNRQKLT-----SSKSLRRELRQKQRLAAERMCDICQHRMLPG 201
                     S +         +L      SSK+ RRE+R++QR+AAERMCDICQ ++LP 
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 200  KDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKR 21
            KDVAALLN+KTG L CSSRN+NG FHVFHISCLIHWILLCE E+  N+   PKV    KR
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKV----KR 413

Query: 20   RSRAK 6
            RSR K
Sbjct: 414  RSRRK 418


>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  403 bits (1036), Expect = e-110
 Identities = 222/425 (52%), Positives = 274/425 (64%), Gaps = 24/425 (5%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG    SA SLRE+ A+ T+   RA+GHTYVELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H ER +AAKVTLLG  PWPFNDGVLFF +S E+  +  V     GR
Sbjct: 58   YSDLVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGR 117

Query: 848  LLDT-NHATNIG------NAKISNAKDDSYDGNNFCVDGGKD---------DMLVPGVLC 717
             LD  N+ +N+       + K++  +    D  +F  + G           D ++PGV  
Sbjct: 118  SLDYHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFL 177

Query: 716  NDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDESMAP-EEHDFGIV 540
             DEI  L ++ IG G+I AR+++ +E   +I R+WC WLGK + +DE +    +HDF IV
Sbjct: 178  KDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHDFAIV 237

Query: 539  VFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXX 366
             F YNY+LGR+ L DD   LL  SP  + EN EG  +KRKKSFSDPEDVSESLS QYD  
Sbjct: 238  TFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQYDSC 297

Query: 365  XXXXXXXXXSPTSEWASSNRQKLT-----SSKSLRRELRQKQRLAAERMCDICQHRMLPG 201
                     S +         +L      SSK+ RRE+R++QR+AAERMCDICQ ++LP 
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 200  KDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKR 21
            KDVAALLN+KTG L CSSRN+NG FHVFHISCLIHWILLCE E+  N+   PKV    KR
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKV----KR 413

Query: 20   RSRAK 6
            RSR K
Sbjct: 414  RSRRK 418


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  402 bits (1032), Expect = e-109
 Identities = 217/416 (52%), Positives = 268/416 (64%), Gaps = 27/416 (6%)
 Frame = -2

Query: 1172 SASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPCYSDYTLFDHLRG 993
            SASSLRE+AA+ T++  R +GH YVELREDGK   RFIFFCTLCL+PCYS+  L+DHL+G
Sbjct: 349  SASSLREQAARTTLRNVRMQGHPYVELREDGK---RFIFFCTLCLAPCYSESVLYDHLKG 405

Query: 992  NFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGRLLDTNHATNIGN 813
            N H ERYAAAKVTLL S PWPFNDGVLFF +S E +  L +   +  RLL T+   N  N
Sbjct: 406  NLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKNDN--N 463

Query: 812  AKISNAKDDSYDGNNFCVD--------------------GGKD-DMLVPGVLCNDEITRL 696
              I    DD    NN  V+                    GG++ DM++PGV+  DE+T L
Sbjct: 464  LAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTEL 523

Query: 695  ELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKD-ESMAPEEHDFGIVVFTYNYN 519
            E++ +GFG+I AR  E +   K I ++WC W GK    D E++   +HDF +V F Y+YN
Sbjct: 524  EVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAVVTFNYHYN 583

Query: 518  LGRRNLVDDSNPLLVGSPCLNMENIEGKKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 339
            LGR+ L DD   +L  SP        G+KRKKSFSDPED+SESLS QYD           
Sbjct: 584  LGRKGLFDDVISMLSSSPTEG----SGRKRKKSFSDPEDISESLSNQYDSSGEDSLISNS 639

Query: 338  SPTSEWASSNRQKLT-----SSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNM 174
                        +L      SSK++RRELR++QR+AAERMCDICQH+MLPGKDVA L NM
Sbjct: 640  PSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNM 699

Query: 173  KTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            KTG+LVCSSRNV GAFHVFH SCLIHWILLCE EI+ N+L  PK+    +R+S +K
Sbjct: 700  KTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSK 755


>ref|XP_007043579.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508707514|gb|EOX99410.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 416

 Score =  387 bits (993), Expect = e-105
 Identities = 211/412 (51%), Positives = 268/412 (65%), Gaps = 11/412 (2%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG+   SA SL+E+ A+ T+   R++GHTY+ELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  L DHL+G+ H  R AAAKVTLLG+ PWPFNDGVLFF    E+  RL     +Q R
Sbjct: 58   YSDSVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNR 117

Query: 848  LLDT-NHATNIGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELKLIGFG 672
            LL+  N+  N+   +   ++  SY  N  C   G  D+L+PGVL  DEI+ L+++ IGFG
Sbjct: 118  LLEFHNNDDNLAIVEYVGSEVSSYRKNVNC-RAGDSDLLIPGVLIKDEISDLKVRFIGFG 176

Query: 671  EIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYNLGRRNLVD 495
            +I AR  E +    +I R+WC WLGK +   D+ +   +H F +V F YN +LGR+ L+D
Sbjct: 177  KIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLD 236

Query: 494  DSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXSPTSEW 321
            D   LL       +EN +   +KRKKSFSDPED+SESLS QYD             TS  
Sbjct: 237  DVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNI--TSSR 294

Query: 320  ASSNRQ-------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKTGR 162
             + +R        +  SSK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ TG+
Sbjct: 295  LALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGK 354

Query: 161  LVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            LVCSSRNVNGAFHVFH SCLIHWILLCE+E   N    PK     +RRSR K
Sbjct: 355  LVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPK----ARRRSRRK 402


>ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707513|gb|EOX99409.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 478

 Score =  387 bits (993), Expect = e-105
 Identities = 211/412 (51%), Positives = 268/412 (65%), Gaps = 11/412 (2%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG+   SA SL+E+ A+ T+   R++GHTY+ELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  L DHL+G+ H  R AAAKVTLLG+ PWPFNDGVLFF    E+  RL     +Q R
Sbjct: 58   YSDSVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNR 117

Query: 848  LLDT-NHATNIGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELKLIGFG 672
            LL+  N+  N+   +   ++  SY  N  C   G  D+L+PGVL  DEI+ L+++ IGFG
Sbjct: 118  LLEFHNNDDNLAIVEYVGSEVSSYRKNVNC-RAGDSDLLIPGVLIKDEISDLKVRFIGFG 176

Query: 671  EIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYNLGRRNLVD 495
            +I AR  E +    +I R+WC WLGK +   D+ +   +H F +V F YN +LGR+ L+D
Sbjct: 177  KIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLD 236

Query: 494  DSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXSPTSEW 321
            D   LL       +EN +   +KRKKSFSDPED+SESLS QYD             TS  
Sbjct: 237  DVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNI--TSSR 294

Query: 320  ASSNRQ-------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKTGR 162
             + +R        +  SSK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ TG+
Sbjct: 295  LALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGK 354

Query: 161  LVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            LVCSSRNVNGAFHVFH SCLIHWILLCE+E   N    PK     +RRSR K
Sbjct: 355  LVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPK----ARRRSRRK 402


>ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508707512|gb|EOX99408.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 470

 Score =  387 bits (993), Expect = e-105
 Identities = 211/412 (51%), Positives = 268/412 (65%), Gaps = 11/412 (2%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG+   SA SL+E+ A+ T+   R++GHTY+ELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  L DHL+G+ H  R AAAKVTLLG+ PWPFNDGVLFF    E+  RL     +Q R
Sbjct: 58   YSDSVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNR 117

Query: 848  LLDT-NHATNIGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELKLIGFG 672
            LL+  N+  N+   +   ++  SY  N  C   G  D+L+PGVL  DEI+ L+++ IGFG
Sbjct: 118  LLEFHNNDDNLAIVEYVGSEVSSYRKNVNC-RAGDSDLLIPGVLIKDEISDLKVRFIGFG 176

Query: 671  EIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYNLGRRNLVD 495
            +I AR  E +    +I R+WC WLGK +   D+ +   +H F +V F YN +LGR+ L+D
Sbjct: 177  KIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLD 236

Query: 494  DSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXSPTSEW 321
            D   LL       +EN +   +KRKKSFSDPED+SESLS QYD             TS  
Sbjct: 237  DVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNI--TSSR 294

Query: 320  ASSNRQ-------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKTGR 162
             + +R        +  SSK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ TG+
Sbjct: 295  LALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGK 354

Query: 161  LVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            LVCSSRNVNGAFHVFH SCLIHWILLCE+E   N    PK     +RRSR K
Sbjct: 355  LVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPK----ARRRSRRK 402


>ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508707511|gb|EOX99407.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  387 bits (993), Expect = e-105
 Identities = 211/412 (51%), Positives = 268/412 (65%), Gaps = 11/412 (2%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG+   SA SL+E+ A+ T+   R++GHTY+ELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  L DHL+G+ H  R AAAKVTLLG+ PWPFNDGVLFF    E+  RL     +Q R
Sbjct: 58   YSDSVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNR 117

Query: 848  LLDT-NHATNIGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELKLIGFG 672
            LL+  N+  N+   +   ++  SY  N  C   G  D+L+PGVL  DEI+ L+++ IGFG
Sbjct: 118  LLEFHNNDDNLAIVEYVGSEVSSYRKNVNC-RAGDSDLLIPGVLIKDEISDLKVRFIGFG 176

Query: 671  EIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYNLGRRNLVD 495
            +I AR  E +    +I R+WC WLGK +   D+ +   +H F +V F YN +LGR+ L+D
Sbjct: 177  KIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLD 236

Query: 494  DSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXSPTSEW 321
            D   LL       +EN +   +KRKKSFSDPED+SESLS QYD             TS  
Sbjct: 237  DVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNI--TSSR 294

Query: 320  ASSNRQ-------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKTGR 162
             + +R        +  SSK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ TG+
Sbjct: 295  LALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGK 354

Query: 161  LVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            LVCSSRNVNGAFHVFH SCLIHWILLCE+E   N    PK     +RRSR K
Sbjct: 355  LVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPK----ARRRSRRK 402


>ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508707510|gb|EOX99406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  387 bits (993), Expect = e-105
 Identities = 211/412 (51%), Positives = 268/412 (65%), Gaps = 11/412 (2%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG+   SA SL+E+ A+ T+   R++GHTY+ELREDGK   RFIFFCTLCL+PC
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  L DHL+G+ H  R AAAKVTLLG+ PWPFNDGVLFF    E+  RL     +Q R
Sbjct: 58   YSDSVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNR 117

Query: 848  LLDT-NHATNIGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELKLIGFG 672
            LL+  N+  N+   +   ++  SY  N  C   G  D+L+PGVL  DEI+ L+++ IGFG
Sbjct: 118  LLEFHNNDDNLAIVEYVGSEVSSYRKNVNC-RAGDSDLLIPGVLIKDEISDLKVRFIGFG 176

Query: 671  EIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYNLGRRNLVD 495
            +I AR  E +    +I R+WC WLGK +   D+ +   +H F +V F YN +LGR+ L+D
Sbjct: 177  KIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLD 236

Query: 494  DSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXSPTSEW 321
            D   LL       +EN +   +KRKKSFSDPED+SESLS QYD             TS  
Sbjct: 237  DVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNI--TSSR 294

Query: 320  ASSNRQ-------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKTGR 162
             + +R        +  SSK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ TG+
Sbjct: 295  LALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGK 354

Query: 161  LVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            LVCSSRNVNGAFHVFH SCLIHWILLCE+E   N    PK     +RRSR K
Sbjct: 355  LVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPK----ARRRSRRK 402


>ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
            gi|462394196|gb|EMJ00100.1| hypothetical protein
            PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  384 bits (986), Expect = e-104
 Identities = 216/444 (48%), Positives = 269/444 (60%), Gaps = 43/444 (9%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG    SASSLRE+A +  ++  R++GHTYVELREDGK   +FIFFCTLCL+PC
Sbjct: 1    MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGK---KFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H++R AAAKVTLL   PWPFNDGV FFH+  E +  LV+   ++ R
Sbjct: 58   YSDKVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFR 117

Query: 848  LLDTNHATN------IGNAKISNAKDD-----------------------SYDGNNFCVD 756
            +L++    N       G   ISN  +                        S    N   +
Sbjct: 118  MLESPDDENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTAN 177

Query: 755  GGKDDMLVPGVLCNDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDE 576
                 +++P VL  D++T +E K +G G+I AR +E ++  K I R+WC WLGK    +E
Sbjct: 178  EVNSSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGKKAIGNE 237

Query: 575  -SMAPEEHDFGIVVFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEGK--KRKKSFSDPE 405
              +   EHDF +V F+YN +LGRR L+DD   LL  SP +  EN EG   KRKKSFSDPE
Sbjct: 238  YHLKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSKRKKSFSDPE 297

Query: 404  DVSESLSTQYDXXXXXXXXXXXSPTSEWASSN-----------RQKLTSSKSLRRELRQK 258
            D+SESLS QYD              S  ASS              +   +KS+RRELR++
Sbjct: 298  DISESLSNQYDSCGEDSS------ASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQ 351

Query: 257  QRLAAERMCDICQHRMLPGKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCE 78
            QRLA  RMCDICQ RM+PGKDV+AL+N+KTGRL CSSRNVNGAFHVFH SCLIHWILLCE
Sbjct: 352  QRLALGRMCDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCE 411

Query: 77   LEIWMNKLAMPKVISDCKRRSRAK 6
            +EI     A     S  +RRSR K
Sbjct: 412  VEI-----ANQSTNSKVRRRSRRK 430


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  376 bits (966), Expect = e-101
 Identities = 208/433 (48%), Positives = 274/433 (63%), Gaps = 32/433 (7%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  +GV   +A SLRE+A +  ++  R++GH+YVE+REDGK   +FIFFCTLCL+PC
Sbjct: 1    MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGK---KFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H ER AAAKVTLL   PWPFNDGV+FF++S E +  +V P  ++ R
Sbjct: 58   YSDKVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCR 117

Query: 848  LLDTNHATNI-------GNAKISNAKDDSYDG----------------NNFCVDGGKDDM 738
            +L+++   N        GN K +       DG                 +   DG K  +
Sbjct: 118  MLESHDNENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSV 177

Query: 737  LVPGVLCNDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLG--KMNSKDESMAP 564
            ++PG++  DEIT LE++ +G GEI AR +  +     I R+WC WLG   ++S+D    P
Sbjct: 178  VIPGIVVRDEITDLEVREVGLGEIAARFLGKDG----IGRIWCEWLGVKSIDSEDLCNVP 233

Query: 563  EEHDFGIVVFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEGK--KRKKSFSDPEDVSES 390
            E HDF +V F+YN +LGR+ L+DD   LL  SP +   N EG   KRKKSFSDPED+S+S
Sbjct: 234  E-HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCKRKKSFSDPEDISDS 292

Query: 389  LSTQYDXXXXXXXXXXXSPTSEWASSNRQKLTSS-----KSLRRELRQKQRLAAERMCDI 225
            LS QY+           + +         +L ++     KS+RRELR++QRLA+ RMCDI
Sbjct: 293  LSNQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDI 352

Query: 224  CQHRMLPGKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMP 45
            CQ RMLPGKDVA L+N+KTG+L CSSRNVNGAFHVFH SCLIHWILLCE+E+  N+    
Sbjct: 353  CQQRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQ---- 408

Query: 44   KVISDCKRRSRAK 6
               S  +RRSR K
Sbjct: 409  NTGSKARRRSRRK 421


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  371 bits (953), Expect = e-100
 Identities = 198/420 (47%), Positives = 263/420 (62%), Gaps = 25/420 (5%)
 Frame = -2

Query: 1190 LGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPCYSDYTL 1011
            L VS  ++ SL+++A +  ++  R++GHTYVELREDGK   + IFFCTLCL+PCYSD  L
Sbjct: 15   LAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGK---KSIFFCTLCLAPCYSDCVL 71

Query: 1010 FDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGRLLDTNH 831
            FDHL+GN H +R + AKVTLLG  PWPFNDGV+FF++  E +   V+   +Q RLL++  
Sbjct: 72   FDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISNGNQSRLLESQD 131

Query: 830  ATN------------------IGNAKISNAKDDSYDGNNFCVDGGKDDMLVPGVLCNDEI 705
            + N                  I   ++ +  ++     N    G    +L+PGV   DEI
Sbjct: 132  SENNLAIVTYGENLESCANGHIMVDELGHQNENPDSAGNLAGSGENCAVLIPGVRAGDEI 191

Query: 704  TRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDES-MAPEEHDFGIVVFTY 528
              +E++ +G+G I  R  E +     I R+WC WLGK   +DE  +   EHDF IV F+Y
Sbjct: 192  ANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIEDEDFLKVPEHDFAIVTFSY 251

Query: 527  N-YNLGRRNLVDDSNPLLVGSPCLNMEN--IEGKKRKKSFSDPEDVSESLSTQYDXXXXX 357
            N ++LGR  L DD   LL  SP   M+N  +  +KR+KSFSDPED SE+LS QYD     
Sbjct: 252  NNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRKRRKSFSDPEDSSENLSNQYDSCGED 311

Query: 356  XXXXXXSPT--SEWASSNRQ-KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAA 186
                  +     ++     Q +  S+K++RRELR++QR+AAERMCDICQH+MLPGKDVA 
Sbjct: 312  SSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQRIAAERMCDICQHKMLPGKDVAT 371

Query: 185  LLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
            L+N+KTGRL CSSRN NGAFH+FH SCLIHW+LLCE+E   N+   PKV    KRRSR K
Sbjct: 372  LMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEVEKCTNQSEAPKV----KRRSRRK 427


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  370 bits (951), Expect = e-100
 Identities = 201/426 (47%), Positives = 273/426 (64%), Gaps = 25/426 (5%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MA +  LG    ++ SLRE+AA+  ++  R++GHTYVELRE+GK   +FIFFCTLCL+PC
Sbjct: 1    MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGK---KFIFFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LF HL+G  H ER +AAK+TLLG  PWPF+DGVLFFH   E ++++ +   +  R
Sbjct: 58   YSDSVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHER 117

Query: 848  LLDTNHATN-------IGNAKISNAKDDSYDGNNFCV---------DGGKD-DMLVPGVL 720
            LL+ N+  N       +GN+K +  + + ++GN   V         DGG+   +++PGVL
Sbjct: 118  LLEYNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDCSFENLNDGGESCPLVIPGVL 177

Query: 719  CNDEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDESMAP-EEHDFGI 543
              +EI+ ++++ +G+G+I AR  E +     + R+WC WLGK+N   E+M    EH++ I
Sbjct: 178  IKEEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGIENMVKVPEHNYAI 237

Query: 542  VVFTYNYNLGRRNLVDDSNPLLVGSPCLNMENIEGK--KRKKSFSDPEDVSESLSTQYDX 369
            + FTYN +LGR+ L+DD   LL  SP    +N E +  KRKKSFSDPED S S+S QYD 
Sbjct: 238  ITFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDENRQVKRKKSFSDPEDGSLSMSPQYDS 297

Query: 368  XXXXXXXXXXSPTSEWASSNRQKLTSS-----KSLRRELRQKQRLAAERMCDICQHRMLP 204
                        +S        ++ S+     K++RRELR++QRLAAERMCDICQ ++L 
Sbjct: 298  SGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQKILT 357

Query: 203  GKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCK 24
             KDVA LLNMKTGRL CSSRNVNG FHVFH SCLIHWILLCE EI +  L   KV    +
Sbjct: 358  HKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVRRRYR 417

Query: 23   RRSRAK 6
            R+ + K
Sbjct: 418  RKKKTK 423


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  357 bits (917), Expect = 6e-96
 Identities = 196/416 (47%), Positives = 255/416 (61%), Gaps = 15/416 (3%)
 Frame = -2

Query: 1208 MAGQAGLGVS-TVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSP 1032
            MAG+  LG + T  A+SL+E+ A+ T+   R++GH YVELREDGK   RFIFFCTLCL+P
Sbjct: 1    MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGK---RFIFFCTLCLAP 57

Query: 1031 CYSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQG 852
            CYSD  LFDHL+GN H ER + A +TLL   PWPF+DGV FF  S E   +LV+   ++ 
Sbjct: 58   CYSDAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQLVIKNDNES 117

Query: 851  RLLDTNHATNIGNAKISNAKDDSYDGNNFCVDGGKD-----DMLVPGVLCNDEITRLELK 687
            R    N  +++   K   +   + D +  C     D     D+L+ GVL  D+I+ L+ +
Sbjct: 118  R---GNGNSSLAIVKYGGSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQAR 174

Query: 686  LIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDESMAPE-EHDFGIVVFTYNYNLGR 510
             +G+G IGAR++E +     I R+WC WLGK    D   A   +H+F +V F YNY+LGR
Sbjct: 175  FMGYGRIGARLIEKDGNSNDISRIWCEWLGKNTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234

Query: 509  RNLVDDSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXS 336
            + L+DD   LL  SP    +N  G  +KRKKSFSDPEDVSES S QYD            
Sbjct: 235  KGLLDDVKLLLSSSPVQESDNQGGTNRKRKKSFSDPEDVSESFSNQYDSSGEESLTSIGG 294

Query: 335  PTSEWASSNRQ------KLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNM 174
            P +              K+ SSK+LRRELR++  +AAERMCDICQ ++LP KDVA L+NM
Sbjct: 295  PPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVNM 354

Query: 173  KTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRAK 6
             TG+L CSSRN  G +HVFH SCLIHWILL E E+  N+   PK     +R+SR K
Sbjct: 355  NTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPK----GRRKSRRK 406


>ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779572 isoform X1 [Glycine
            max] gi|571494415|ref|XP_006592839.1| PREDICTED:
            uncharacterized protein LOC100779572 isoform X2 [Glycine
            max]
          Length = 501

 Score =  355 bits (912), Expect = 2e-95
 Identities = 197/411 (47%), Positives = 259/411 (63%), Gaps = 14/411 (3%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG      S+ +E+AA+  ++  R++GH YVELRE+GK   +FI+FCTLCL+PC
Sbjct: 1    MAGKLELGPPKSDVSNPKEQAARKILKIVRSQGHPYVELRENGK---KFIYFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H+ER +AAKVTLLG KPWPFNDG++FF  S E +  L V    Q R
Sbjct: 58   YSDDVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSTESHKELEVADSYQNR 117

Query: 848  LLDTNH------ATNIGNAKISNAKDDSYDGNNFCVDGGKDD---MLVPGVLCNDEITRL 696
            LL  N           G+   SNAK  S       +DG +DD   +++P +L  DEI  +
Sbjct: 118  LLKFNDNDVSLAIVKFGDGVQSNAKPRS-------IDGMQDDEYALVIPNLLIGDEIFDV 170

Query: 695  ELKLIGFGEIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYN 519
            +++ +G G+I AR +E       IKR+WC WLGK  N + + +   EHDF +V+F YNY+
Sbjct: 171  KVREVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAYNYD 230

Query: 518  LGRRNLVDDSNPLLVGSPCLNMENIEGKKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 339
            LGR  L+DD N LL  +         G+K K S SD +DVS+S+  QYD           
Sbjct: 231  LGRSGLLDDVNTLLPSAS-------GGQKGKSSLSDFDDVSDSVCNQYDSSAEESSDSNN 283

Query: 338  SPT----SEWASSNRQKLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMK 171
            S +     ++ +    +  SSK+LR+ELR+KQRLAAE++C+ICQ +MLPGKDVAALLN+K
Sbjct: 284  SSSRLTLDQFNNHLCTRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNLK 343

Query: 170  TGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRR 18
            T R+ CSSRN  GAFHVFH SCLIHWI+LCE EI  N L  P V    KR+
Sbjct: 344  TRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIITNHLVCPNVRRVVKRK 394


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  353 bits (905), Expect = 1e-94
 Identities = 198/431 (45%), Positives = 265/431 (61%), Gaps = 30/431 (6%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  L     S  +L+E+  + T+Q  R++GH YVELREDGK   R +FFCTLC SPC
Sbjct: 1    MAGRQ-LDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGK---RLVFFCTLCHSPC 56

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LF+HL+GN H E  AAAK TLL   PWPFNDGVLFF+D  EQ+         + R
Sbjct: 57   YSDSVLFNHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFNDP-EQDKHSPNVNVGKSR 115

Query: 848  LLDTNHATNIGNAKISNAKDDSYDGNNFCVD-------------GGKDDMLVPGVLCNDE 708
            L+DT        A +    +  ++G+ +  +             G  + +++PGVLC DE
Sbjct: 116  LVDTCLEDESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDE 175

Query: 707  ITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKD-ESMAPEEHDFGIVVFT 531
            ++ LE+K IG G+I ARI       KKI+R+WC WL K +S D ++    +HDF +V F 
Sbjct: 176  LSDLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDMDTSVVPDHDFAVVTFP 235

Query: 530  YNYNLGRRNLVDDSNPLLVGSPCLNMENIEG--KKRKKSFSDPEDVSESLSTQYDXXXXX 357
            YNYNLGR+ L+DD   LL  SP    E   G  K+++KSFSDPED SESLS   D     
Sbjct: 236  YNYNLGRKPLLDDRF-LLPSSPYSESEETSGTRKRKRKSFSDPEDFSESLSNHCDSSGEE 294

Query: 356  XXXXXXSPTSEWASSNRQKLT--------------SSKSLRRELRQKQRLAAERMCDICQ 219
                     S+  +++  KL               SSK++RRELR++QR+A+ERMCDICQ
Sbjct: 295  ---------SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQ 345

Query: 218  HRMLPGKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKV 39
             +MLPGKDVA LL+ K+G+L+CSSRN+ GAFH+FH+SCLIHWIL CEL+ ++  +  PK+
Sbjct: 346  QKMLPGKDVATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKM 405

Query: 38   ISDCKRRSRAK 6
             +  KRRS+ K
Sbjct: 406  ETKAKRRSKRK 416


>ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807746 [Glycine max]
          Length = 500

 Score =  353 bits (905), Expect = 1e-94
 Identities = 197/411 (47%), Positives = 257/411 (62%), Gaps = 14/411 (3%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG      S+ +E+AA+  ++  R++GH YVELRE+GK   +FI+FCTLCL+PC
Sbjct: 1    MAGKLELGPPKSDISNPKEQAARKILKIVRSQGHPYVELRENGK---KFIYFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN HRER +AAKVTLLG KPWPFNDG++FF  S E +  L V    + R
Sbjct: 58   YSDDVLFDHLKGNLHRERLSAAKVTLLGPKPWPFNDGLVFFDTSTESDKELEVADSYRNR 117

Query: 848  LLDTNH------ATNIGNAKISNAKDDSYDGNNFCVDGGKDD---MLVPGVLCNDEITRL 696
            LL  N           G    SNAK  S       ++G +DD   +++P +L  DEI  L
Sbjct: 118  LLKFNDDDSSLAIVKFGEGVQSNAKPCS-------IEGMQDDECALVIPNLLIGDEIFDL 170

Query: 695  ELKLIGFGEIGARIVENNETQKKIKRMWCAWLGK-MNSKDESMAPEEHDFGIVVFTYNYN 519
            ++K +G G+I AR +E       IKR+WC WLGK  N + + +   EHDF +V+F YNY+
Sbjct: 171  KVKEVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAYNYD 230

Query: 518  LGRRNLVDDSNPLLVGSPCLNMENIEGKKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 339
            LGR  L+DD   LL  S         G+K K S SD +DVS+ L  QYD           
Sbjct: 231  LGRSGLLDDVKTLLPVSA--------GQKGKTSLSDSDDVSDFLCNQYDSSAEESSDSNN 282

Query: 338  SPT----SEWASSNRQKLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMK 171
            S +     ++ +    +  SSK+LR+ELR+KQRLAAE++C+ICQ +MLPGKDVAALLN+K
Sbjct: 283  SSSRLTLDQFNNHLCTRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNLK 342

Query: 170  TGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRR 18
            T R+ CSSRN  GAFHVFH SCLIHWI+LCE EI +N L  P +    KR+
Sbjct: 343  TRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIIINHLVRPNIRRVVKRK 393


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  351 bits (901), Expect = 4e-94
 Identities = 200/433 (46%), Positives = 267/433 (61%), Gaps = 32/433 (7%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  L V   S  +L+E+  + T+Q  R++GH YVELREDGK   R IFFCTLC SPC
Sbjct: 1    MAGKQ-LDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGK---RLIFFCTLCHSPC 56

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQN------------ 885
            YSD  LF+HL+GN H E  AAAK TLL   PWPFNDGVLFF+D  +              
Sbjct: 57   YSDSVLFNHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFNDPEQDKQDKQSPNVNVGK 116

Query: 884  SRLVVPC-PDQGRLLDTNHATNIGNAKISNAKDDSYDGNNFCVDGGK--DDMLVPGVLCN 714
            SRLV  C  D+  +    +  N+ + + +   +  Y   +  + G +  D +++PGVLC 
Sbjct: 117  SRLVDTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCK 176

Query: 713  DEITRLELKLIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKD-ESMAPEEHDFGIVV 537
            DE++ LE+K IG G+I ARI       K I+R+WC WL K +S D ++    +HDF +V 
Sbjct: 177  DELSDLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDMDTSVVPDHDFAVVT 236

Query: 536  FTYNYNLGRRNLVDDSNPLLVGSPCLNME--NIEGKKRKKSFSDPEDVSESLSTQYDXXX 363
            F YNYNLGR  L+DD   LL  SP    E  ++ GK+++KSFSDPED SESLS   D   
Sbjct: 237  FPYNYNLGRSPLLDDRF-LLPSSPYSESEETSVTGKRKRKSFSDPEDFSESLSNHCDSSG 295

Query: 362  XXXXXXXXSPTSEWASSNRQKLT--------------SSKSLRRELRQKQRLAAERMCDI 225
                       S+  +++  KL               SSK++RRELR++QR+A+ERMCDI
Sbjct: 296  EE---------SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDI 346

Query: 224  CQHRMLPGKDVAALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMP 45
            CQ +MLPGKDVA LL+ K+G+L+CSSRN++GAFH+FH+SCLIHWIL CEL+  +  +  P
Sbjct: 347  CQQKMLPGKDVATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEP 406

Query: 44   KVISDCKRRSRAK 6
            K+    KRRS+ K
Sbjct: 407  KMEPKAKRRSKKK 419


>ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris]
            gi|561023122|gb|ESW21852.1| hypothetical protein
            PHAVU_005G104500g [Phaseolus vulgaris]
          Length = 498

 Score =  344 bits (882), Expect = 6e-92
 Identities = 196/411 (47%), Positives = 255/411 (62%), Gaps = 14/411 (3%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG+  LG      S+ +E+AA+  ++  R++GH YVELRE+GK   +FI+FCTLCL+PC
Sbjct: 1    MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGK---KFIYFCTLCLAPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPCPDQGR 849
            YSD  LFDHL+GN H+ER +AAKVTLLG KPWPFNDG++FF  S E +  L V    + R
Sbjct: 58   YSDDVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNR 117

Query: 848  LLDTNHATN------IGNAKISNAKDDSYDG--NNFCVDGGKDDMLVPGVLCNDEITRLE 693
            LL  N+  N            SNA+  S DG  N+ C       +++P +L  DEI  ++
Sbjct: 118  LLKFNNNDNSLAIVKFDEGVQSNAEPCSTDGMPNDEC------GLVIPHLLIRDEIFDVK 171

Query: 692  LKLIGFGEIGARIVENNETQKKIKRMWCAWLGKM-NSKDESMAPEEHDFGIVVFTYNYNL 516
            +  +G G+I AR +E       IKR+WC WLGK  N + + +   EHDF IV F YNY+L
Sbjct: 172  VSEVGLGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQQDGVEILEHDFAIVNFAYNYDL 231

Query: 515  GRRNLVDDSNPLLVGSPCLNMENIEGKKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXS 336
            GR  L+DD   LL  +         G+K K+S SD +D+S+SL  QYD           S
Sbjct: 232  GRSGLLDDVKSLLPSAS-------GGRKGKRSLSDSDDISDSLCNQYDSSAEESSDSNNS 284

Query: 335  --PTSEWASSNRQKLT---SSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMK 171
              P +    +N    T   SSK++R+ELR+KQRLAAE++C+ICQ +MLPGKDVAALLN+ 
Sbjct: 285  SAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNLN 344

Query: 170  TGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRR 18
            T R+ CSSRN  GAFHVFH SCLIHWI+LCE EI  N L  P V    KR+
Sbjct: 345  TRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRK 395


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  342 bits (877), Expect = 2e-91
 Identities = 200/421 (47%), Positives = 251/421 (59%), Gaps = 20/421 (4%)
 Frame = -2

Query: 1208 MAGQAGLGVSTVSASSLREKAAKITIQRARAEGHTYVELREDGKDGKRFIFFCTLCLSPC 1029
            MAG   +G    +ASSLRE+ A+ T+ R RA GH Y+ELREDGK   RFIFFCTLCLSPC
Sbjct: 1    MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGK---RFIFFCTLCLSPC 57

Query: 1028 YSDYTLFDHLRGNFHRERYAAAKVTLLGSKPWPFNDGVLFFHDSCEQNSRLVVPC-PDQG 852
            YSD  L DHLRGN H ER +AAK TLL   PWPF+DG+ FF  S     +L +    +  
Sbjct: 58   YSDTILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLAIKDGKESS 117

Query: 851  RLLD-TNHATNIGNAK-ISNAK---DDSYDGNNFCVDGGKDDMLVPGVLCNDEITRLELK 687
            R L    ++ N+   K + N K   D   D N    D G D +++P V   +E++ L+  
Sbjct: 118  RFLKFEENSDNLAIVKYVENLKPGCDTVVDENLSGSDEGSD-LVIPSVRLKEEVSDLKAT 176

Query: 686  LIGFGEIGARIVENNETQKKIKRMWCAWLGKMNSKDESMAPE-EHDFGIVVFTYNYNLGR 510
            L+G G+I AR+ E  +   +I R+WC WLGK +S DE      +HDFG+V F Y+Y LG+
Sbjct: 177  LVGSGQIAARMYEKKDGSNEISRIWCEWLGKKSSNDEDKVKVLDHDFGVVTFAYDYELGK 236

Query: 509  RNLVDDSNPLLVGS-PCLNMENIEGK-KRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXS 336
              L DD   LL  S P L   +  G  KRK+S S+PEDVS SL+ QY             
Sbjct: 237  SGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEEESSK---- 292

Query: 335  PTSEWASSN-----------RQKLTSSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVA 189
              +  ASSN             +  S+K++RRE+R++QR+AAE+MCDICQ +MLP KDVA
Sbjct: 293  --TTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKDVA 350

Query: 188  ALLNMKTGRLVCSSRNVNGAFHVFHISCLIHWILLCELEIWMNKLAMPKVISDCKRRSRA 9
             L N KTG+L CSSRNV GAFHVFH SCLIHWIL CE EI  N+    K      RRSR 
Sbjct: 351  TLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTK----GGRRSRK 406

Query: 8    K 6
            K
Sbjct: 407  K 407


Top