BLASTX nr result

ID: Sinomenium22_contig00023222 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00023222
         (1181 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   392   e-106
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   391   e-106
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   376   e-101
ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   376   e-101
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   363   1e-97
ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun...   362   1e-97
ref|XP_007043579.1| Uncharacterized protein isoform 5 [Theobroma...   359   1e-96
ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma...   359   1e-96
ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma...   359   1e-96
ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [...   359   1e-96
ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma...   359   1e-96
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     359   1e-96
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   347   4e-93
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   344   4e-92
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   344   4e-92
ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779...   327   6e-87
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   326   1e-86
ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ...   324   4e-86
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   322   2e-85
ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807...   320   6e-85

>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  392 bits (1007), Expect = e-106
 Identities = 212/416 (50%), Positives = 266/416 (63%), Gaps = 28/416 (6%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SASSLRE+AA+ T++    +GH YVELREDGK   RFIFFCTLCL+PCYSE  L+DHL+G
Sbjct: 13   SASSLREQAARTTLRNVRMQGHPYVELREDGK---RFIFFCTLCLAPCYSESVLYDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H E YAAAKVTLL  +PWPFNDGV+FF +S +++ HL++   N   LL T H +D   
Sbjct: 70   NLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGT-HKNDNNL 128

Query: 365  AKISNAKDHSYDGNNLYVD--------------------GGKD-DMLVPGVLCNDEITHL 481
            A + +  D S   NN +V+                    GG++ DM++PGV+  DE+T L
Sbjct: 129  AIVCHGDDLS-QSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTEL 187

Query: 482  ELKLIGFGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDES--MAPEHDFGIVVFTYNYN 655
            E++ +GFG+I AR            ++WC W GK    D    M P+HDF +V F Y+YN
Sbjct: 188  EVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAVVTFNYHYN 247

Query: 656  LGRRKLVDDSNPLLVGSPCLNLENIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 835
            LGR+ L DD   +L  SP        GRKRKKSFSDPED+SESLS QYD           
Sbjct: 248  LGRKGLFDDVISMLSSSPTEG----SGRKRKKSFSDPEDISESLSNQYDSSGEDSLISNS 303

Query: 836  XXXXXXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNM 1000
                        +L     I SK++RRELR++QR+AAERMCDICQH+MLPGKDVA L+NM
Sbjct: 304  PSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLMNM 363

Query: 1001 KSGRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            K+G+LVCSSRNV GAFHVFH SCLI WILLCE EI TN+L  PK+    +R+S +K
Sbjct: 364  KTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSK 419


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  391 bits (1004), Expect = e-106
 Identities = 212/416 (50%), Positives = 265/416 (63%), Gaps = 28/416 (6%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SASSLRE+AA+ T++    +GH YVELREDGK   RFIFFCTLCL+PCYSE  L+DHL+G
Sbjct: 349  SASSLREQAARTTLRNVRMQGHPYVELREDGK---RFIFFCTLCLAPCYSESVLYDHLKG 405

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H E YAAAKVTLL  +PWPFNDGV+FF +S +++ HL++   N   LL T H +D   
Sbjct: 406  NLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGT-HKNDNNL 464

Query: 365  AKISNAKDHSYDGNNLYVD--------------------GGKD-DMLVPGVLCNDEITHL 481
            A + +  D S   NN +V+                    GG++ DM++PGV+  DE+T L
Sbjct: 465  AIVCHGDDLS-QSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTEL 523

Query: 482  ELKLIGFGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDES--MAPEHDFGIVVFTYNYN 655
            E++ +GFG+I AR            ++WC W GK    D    M P+HDF +V F Y+YN
Sbjct: 524  EVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAVVTFNYHYN 583

Query: 656  LGRRKLVDDSNPLLVGSPCLNLENIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 835
            LGR+ L DD   +L  SP        GRKRKKSFSDPED+SESLS QYD           
Sbjct: 584  LGRKGLFDDVISMLSSSPTEG----SGRKRKKSFSDPEDISESLSNQYDSSGEDSLISNS 639

Query: 836  XXXXXXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNM 1000
                        +L     I SK++RRELR++QR+AAERMCDICQH+MLPGKDVA L NM
Sbjct: 640  PSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNM 699

Query: 1001 KSGRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            K+G+LVCSSRNV GAFHVFH SCLI WILLCE EI TN+L  PK+    +R+S +K
Sbjct: 700  KTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSK 755


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  376 bits (965), Expect = e-101
 Identities = 210/418 (50%), Positives = 256/418 (61%), Gaps = 26/418 (6%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SLRE+ A+ T+    A+GH YVELREDGK   RFIFFCTLCL+PCYS+  LFDHL+G
Sbjct: 13   SAFSLREQLARTTLSNVRAQGHTYVELREDGK---RFIFFCTLCLAPCYSDLVLFDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H E  +AAKVTLLGPNPWPFNDGV+FF +S +      V        LD  H +D+  
Sbjct: 70   NLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLDY-HNNDSNL 128

Query: 365  AKISNAKDHSYDGN--------NLYVDGGKD---------DMLVPGVLCNDEITHLELKL 493
            A +   +D   +GN        +   + G           D ++PGV   DEI  L ++ 
Sbjct: 129  AIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDEIVDLRVRF 188

Query: 494  IGFGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDESMA--PEHDFGIVVFTYNYNLGRR 667
            IG G+I AR            R+WC WLGK + +DE +   P+HDF IV F YNY+LGR+
Sbjct: 189  IGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHDFAIVTFVYNYDLGRK 248

Query: 668  KLVDDSNPLLVGSPCLNLENIQG--RKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXX 841
             L DD   LL  SP  + EN +G  RKRKKSFSDPEDVSESLS QYD             
Sbjct: 249  GLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQYDSCGEDSSASNSST 308

Query: 842  XXXXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKS 1006
                      +L     I SK+ RRE+R++QR+AAERMCDICQ ++LP KDVAALLN+K+
Sbjct: 309  SRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPDKDVAALLNLKT 368

Query: 1007 GRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            G L CSSRN+NG FHVFHISCLI WILLCE E++TN+   PKV    KRRSR K   K
Sbjct: 369  GNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKV----KRRSRRKNGSK 422


>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  376 bits (965), Expect = e-101
 Identities = 210/418 (50%), Positives = 256/418 (61%), Gaps = 26/418 (6%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SLRE+ A+ T+    A+GH YVELREDGK   RFIFFCTLCL+PCYS+  LFDHL+G
Sbjct: 13   SAFSLREQLARTTLSNVRAQGHTYVELREDGK---RFIFFCTLCLAPCYSDLVLFDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H E  +AAKVTLLGPNPWPFNDGV+FF +S +      V        LD  H +D+  
Sbjct: 70   NLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLDY-HNNDSNL 128

Query: 365  AKISNAKDHSYDGN--------NLYVDGGKD---------DMLVPGVLCNDEITHLELKL 493
            A +   +D   +GN        +   + G           D ++PGV   DEI  L ++ 
Sbjct: 129  AIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDEIVDLRVRF 188

Query: 494  IGFGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDESMA--PEHDFGIVVFTYNYNLGRR 667
            IG G+I AR            R+WC WLGK + +DE +   P+HDF IV F YNY+LGR+
Sbjct: 189  IGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHDFAIVTFVYNYDLGRK 248

Query: 668  KLVDDSNPLLVGSPCLNLENIQG--RKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXX 841
             L DD   LL  SP  + EN +G  RKRKKSFSDPEDVSESLS QYD             
Sbjct: 249  GLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQYDSCGEDSSASNSST 308

Query: 842  XXXXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKS 1006
                      +L     I SK+ RRE+R++QR+AAERMCDICQ ++LP KDVAALLN+K+
Sbjct: 309  SRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPDKDVAALLNLKT 368

Query: 1007 GRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            G L CSSRN+NG FHVFHISCLI WILLCE E++TN+   PKV    KRRSR K   K
Sbjct: 369  GNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKV----KRRSRRKNGSK 422


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  363 bits (931), Expect = 1e-97
 Identities = 201/424 (47%), Positives = 263/424 (62%), Gaps = 32/424 (7%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            +A SLRE+A +  ++   ++GH YVE+REDGK   +FIFFCTLCL+PCYS+  LFDHL+G
Sbjct: 13   NACSLREQATRTILRNVRSQGHSYVEVREDGK---KFIFFCTLCLAPCYSDKVLFDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDT-DHFSDTV 361
            N H E  AAAKVTLL PNPWPFNDGV+FF++S + +  +  P  N+  +L++ D+ ++  
Sbjct: 70   NLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLESHDNENNLA 129

Query: 362  HAKIS-NAKDHSYDGN-------NLYVD--------------GGKDDMLVPGVLCNDEIT 475
              K   N K + YD         N Y+D              G K  +++PG++  DEIT
Sbjct: 130  IVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSVVIPGIVVRDEIT 189

Query: 476  HLELKLIGFGEIGARXXXXXXXXXXXXRMWCAWLG--KVNSKDESMAPEHDFGIVVFTYN 649
             LE++ +G GEI AR            R+WC WLG   ++S+D    PEHDF +V F+YN
Sbjct: 190  DLEVREVGLGEIAARFLGKDGIG----RIWCEWLGVKSIDSEDLCNVPEHDFAVVTFSYN 245

Query: 650  YNLGRRKLVDDSNPLLVGSPCLNLENIQGR--KRKKSFSDPEDVSESLSTQY-----DXX 808
             +LGR+ L+DD   LL  SP +   N +G   KRKKSFSDPED+S+SLS QY     D  
Sbjct: 246  IDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCKRKKSFSDPEDISDSLSNQYESFGEDSS 305

Query: 809  XXXXXXXXXXXXXXXXXXDRPKLIPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAA 988
                                 + I +KS+RRELR++QRLA+ RMCDICQ RMLPGKDVA 
Sbjct: 306  ASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDICQQRMLPGKDVAT 365

Query: 989  LLNMKSGRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            L+N+K+G+L CSSRNVNGAFHVFH SCLI WILLCE+E+ TN+       S  +RRSR K
Sbjct: 366  LMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQ----NTGSKARRRSRRK 421

Query: 1169 QRKK 1180
               K
Sbjct: 422  TAAK 425


>ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
            gi|462394196|gb|EMJ00100.1| hypothetical protein
            PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  362 bits (930), Expect = 1e-97
 Identities = 202/430 (46%), Positives = 257/430 (59%), Gaps = 38/430 (8%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SASSLRE+A +  ++   ++GH YVELREDGK   +FIFFCTLCL+PCYS+  LFDHL+G
Sbjct: 13   SASSLREQATRTILRNVRSQGHTYVELREDGK---KFIFFCTLCLAPCYSDKVLFDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDT-DHFSDTV 361
            N H++  AAAKVTLL PNPWPFNDGV FFH+  + + HL +   N+  +L++ D  ++  
Sbjct: 70   NLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLESPDDENNLA 129

Query: 362  HAK-----ISNAKDH-----------------------SYDGNNLYVDGGKDDMLVPGVL 457
              K     ISN  +H                       S    N   +     +++P VL
Sbjct: 130  IVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEVNSSVVIPSVL 189

Query: 458  CNDEITHLELKLIGFGEIGARXXXXXXXXXXXXRMWCAWLGK--VNSKDESMAPEHDFGI 631
              D++T +E K +G G+I AR            R+WC WLGK  + ++     PEHDF +
Sbjct: 190  VRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGKKAIGNEYHLKVPEHDFAV 249

Query: 632  VVFTYNYNLGRRKLVDDSNPLLVGSPCLNLENIQGR--KRKKSFSDPEDVSESLSTQYDX 805
            V F+YN +LGRR L+DD   LL  SP +  EN +G   KRKKSFSDPED+SESLS QYD 
Sbjct: 250  VTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSKRKKSFSDPEDISESLSNQYDS 309

Query: 806  XXXXXXXXXXXXXXXXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLP 970
                                  +L     I +KS+RRELR++QRLA  RMCDICQ RM+P
Sbjct: 310  CGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALGRMCDICQQRMIP 369

Query: 971  GKDVAALLNMKSGRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCK 1150
            GKDV+AL+N+K+GRL CSSRNVNGAFHVFH SCLI WILLCE+EI     A     S  +
Sbjct: 370  GKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEI-----ANQSTNSKVR 424

Query: 1151 RRSRAKQRKK 1180
            RRSR K   K
Sbjct: 425  RRSRRKNAAK 434


>ref|XP_007043579.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508707514|gb|EOX99410.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 416

 Score =  359 bits (922), Expect = 1e-96
 Identities = 195/397 (49%), Positives = 245/397 (61%), Gaps = 9/397 (2%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SL+E+ A+ T+    ++GH Y+ELREDGK   RFIFFCTLCL+PCYS+  L DHL+G
Sbjct: 13   SACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPCYSDSVLLDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            + H    AAAKVTLLG NPWPFNDGV+FF    +    LA    NQ+ LL+  +  D + 
Sbjct: 70   SLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLEFHNNDDNLA 129

Query: 365  AKISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXXX 544
                   + S    N+    G  D+L+PGVL  DEI+ L+++ IGFG+I AR        
Sbjct: 130  IVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLKVRFIGFGKIAARFCEKDGVL 189

Query: 545  XXXXRMWCAWLGKV--NSKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCLN 718
                R+WC WLGK    + D+  AP+H F +V F YN +LGR+ L+DD   LL       
Sbjct: 190  NEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLDDVKSLLTSGSPTG 249

Query: 719  LEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL----- 877
            LEN     RKRKKSFSDPED+SESLS QYD                       +L     
Sbjct: 250  LENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNITSSRLALDRYDDQLLLTRF 309

Query: 878  IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVF 1057
            I SK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ +G+LVCSSRNVNGAFHVF
Sbjct: 310  ISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGKLVCSSRNVNGAFHVF 369

Query: 1058 HISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            H SCLI WILLCE+E   N    PK     +R++ AK
Sbjct: 370  HTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGAK 406


>ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707513|gb|EOX99409.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 478

 Score =  359 bits (922), Expect = 1e-96
 Identities = 195/397 (49%), Positives = 245/397 (61%), Gaps = 9/397 (2%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SL+E+ A+ T+    ++GH Y+ELREDGK   RFIFFCTLCL+PCYS+  L DHL+G
Sbjct: 13   SACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPCYSDSVLLDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            + H    AAAKVTLLG NPWPFNDGV+FF    +    LA    NQ+ LL+  +  D + 
Sbjct: 70   SLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLEFHNNDDNLA 129

Query: 365  AKISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXXX 544
                   + S    N+    G  D+L+PGVL  DEI+ L+++ IGFG+I AR        
Sbjct: 130  IVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLKVRFIGFGKIAARFCEKDGVL 189

Query: 545  XXXXRMWCAWLGKV--NSKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCLN 718
                R+WC WLGK    + D+  AP+H F +V F YN +LGR+ L+DD   LL       
Sbjct: 190  NEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLDDVKSLLTSGSPTG 249

Query: 719  LEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL----- 877
            LEN     RKRKKSFSDPED+SESLS QYD                       +L     
Sbjct: 250  LENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNITSSRLALDRYDDQLLLTRF 309

Query: 878  IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVF 1057
            I SK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ +G+LVCSSRNVNGAFHVF
Sbjct: 310  ISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGKLVCSSRNVNGAFHVF 369

Query: 1058 HISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            H SCLI WILLCE+E   N    PK     +R++ AK
Sbjct: 370  HTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGAK 406


>ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508707512|gb|EOX99408.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 470

 Score =  359 bits (922), Expect = 1e-96
 Identities = 195/397 (49%), Positives = 245/397 (61%), Gaps = 9/397 (2%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SL+E+ A+ T+    ++GH Y+ELREDGK   RFIFFCTLCL+PCYS+  L DHL+G
Sbjct: 13   SACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPCYSDSVLLDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            + H    AAAKVTLLG NPWPFNDGV+FF    +    LA    NQ+ LL+  +  D + 
Sbjct: 70   SLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLEFHNNDDNLA 129

Query: 365  AKISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXXX 544
                   + S    N+    G  D+L+PGVL  DEI+ L+++ IGFG+I AR        
Sbjct: 130  IVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLKVRFIGFGKIAARFCEKDGVL 189

Query: 545  XXXXRMWCAWLGKV--NSKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCLN 718
                R+WC WLGK    + D+  AP+H F +V F YN +LGR+ L+DD   LL       
Sbjct: 190  NEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLDDVKSLLTSGSPTG 249

Query: 719  LEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL----- 877
            LEN     RKRKKSFSDPED+SESLS QYD                       +L     
Sbjct: 250  LENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNITSSRLALDRYDDQLLLTRF 309

Query: 878  IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVF 1057
            I SK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ +G+LVCSSRNVNGAFHVF
Sbjct: 310  ISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGKLVCSSRNVNGAFHVF 369

Query: 1058 HISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            H SCLI WILLCE+E   N    PK     +R++ AK
Sbjct: 370  HTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGAK 406


>ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508707511|gb|EOX99407.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  359 bits (922), Expect = 1e-96
 Identities = 195/397 (49%), Positives = 245/397 (61%), Gaps = 9/397 (2%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SL+E+ A+ T+    ++GH Y+ELREDGK   RFIFFCTLCL+PCYS+  L DHL+G
Sbjct: 13   SACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPCYSDSVLLDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            + H    AAAKVTLLG NPWPFNDGV+FF    +    LA    NQ+ LL+  +  D + 
Sbjct: 70   SLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLEFHNNDDNLA 129

Query: 365  AKISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXXX 544
                   + S    N+    G  D+L+PGVL  DEI+ L+++ IGFG+I AR        
Sbjct: 130  IVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLKVRFIGFGKIAARFCEKDGVL 189

Query: 545  XXXXRMWCAWLGKV--NSKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCLN 718
                R+WC WLGK    + D+  AP+H F +V F YN +LGR+ L+DD   LL       
Sbjct: 190  NEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLDDVKSLLTSGSPTG 249

Query: 719  LEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL----- 877
            LEN     RKRKKSFSDPED+SESLS QYD                       +L     
Sbjct: 250  LENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNITSSRLALDRYDDQLLLTRF 309

Query: 878  IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVF 1057
            I SK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ +G+LVCSSRNVNGAFHVF
Sbjct: 310  ISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGKLVCSSRNVNGAFHVF 369

Query: 1058 HISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            H SCLI WILLCE+E   N    PK     +R++ AK
Sbjct: 370  HTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGAK 406


>ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508707510|gb|EOX99406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  359 bits (922), Expect = 1e-96
 Identities = 195/397 (49%), Positives = 245/397 (61%), Gaps = 9/397 (2%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            SA SL+E+ A+ T+    ++GH Y+ELREDGK   RFIFFCTLCL+PCYS+  L DHL+G
Sbjct: 13   SACSLKEQLARTTLNNVRSQGHTYIELREDGK---RFIFFCTLCLAPCYSDSVLLDHLKG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            + H    AAAKVTLLG NPWPFNDGV+FF    +    LA    NQ+ LL+  +  D + 
Sbjct: 70   SLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLEFHNNDDNLA 129

Query: 365  AKISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXXX 544
                   + S    N+    G  D+L+PGVL  DEI+ L+++ IGFG+I AR        
Sbjct: 130  IVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLKVRFIGFGKIAARFCEKDGVL 189

Query: 545  XXXXRMWCAWLGKV--NSKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCLN 718
                R+WC WLGK    + D+  AP+H F +V F YN +LGR+ L+DD   LL       
Sbjct: 190  NEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVYNCDLGRKGLLDDVKSLLTSGSPTG 249

Query: 719  LEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL----- 877
            LEN     RKRKKSFSDPED+SESLS QYD                       +L     
Sbjct: 250  LENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDSSASNITSSRLALDRYDDQLLLTRF 309

Query: 878  IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVF 1057
            I SK++RRELR++QR+AAERMCDICQ +MLP KDVA L+N+ +G+LVCSSRNVNGAFHVF
Sbjct: 310  ISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVATLMNLNTGKLVCSSRNVNGAFHVF 369

Query: 1058 HISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            H SCLI WILLCE+E   N    PK     +R++ AK
Sbjct: 370  HTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGAK 406


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  359 bits (921), Expect = 1e-96
 Identities = 189/418 (45%), Positives = 255/418 (61%), Gaps = 26/418 (6%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            ++ SL+++A +  ++   ++GH YVELREDGK+    IFFCTLCL+PCYS+  LFDHL+G
Sbjct: 21   TSCSLKDQAKRTILRNVRSQGHTYVELREDGKKS---IFFCTLCLAPCYSDCVLFDHLKG 77

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H +  + AKVTLLGPNPWPFNDGV+FF++  +++    +   NQ  LL++    + + 
Sbjct: 78   NLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISNGNQSRLLESQDSENNLA 137

Query: 365  A------------------KISNAKDHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELK 490
                               ++ +  ++     NL   G    +L+PGV   DEI ++E++
Sbjct: 138  IVTYGENLESCANGHIMVDELGHQNENPDSAGNLAGSGENCAVLIPGVRAGDEIANVEVR 197

Query: 491  LIGFGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDESM--APEHDFGIVVFTYN-YNLG 661
             +G+G I  R            R+WC WLGK   +DE     PEHDF IV F+YN ++LG
Sbjct: 198  EVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIEDEDFLKVPEHDFAIVTFSYNNFSLG 257

Query: 662  RRKLVDDSNPLLVGSPCLNLEN--IQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXX 835
            R  L DD   LL  SP   ++N  +  RKR+KSFSDPED SE+LS QYD           
Sbjct: 258  RMGLHDDVKALLCSSPAAEMQNGDVSSRKRRKSFSDPEDSSENLSNQYDSCGEDSSASAV 317

Query: 836  XXXXXXXXXDR---PKLIPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKS 1006
                     D+    + I +K++RRELR++QR+AAERMCDICQH+MLPGKDVA L+N+K+
Sbjct: 318  TSLMLDQYDDQLLQTRFISNKAIRRELRRQQRIAAERMCDICQHKMLPGKDVATLMNVKT 377

Query: 1007 GRLVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            GRL CSSRN NGAFH+FH SCLI W+LLCE+E  TN+   PKV    KRRSR K   K
Sbjct: 378  GRLACSSRNTNGAFHLFHTSCLIHWVLLCEVEKCTNQSEAPKV----KRRSRRKAASK 431


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  347 bits (891), Expect = 4e-93
 Identities = 187/414 (45%), Positives = 257/414 (62%), Gaps = 26/414 (6%)
 Frame = +2

Query: 14   SLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRGNFH 193
            SLRE+AA+  ++   ++GH YVELRE+GK   +FIFFCTLCL+PCYS+  LF HL+G  H
Sbjct: 16   SLREQAARTILRNVRSQGHTYVELRENGK---KFIFFCTLCLAPCYSDSVLFSHLKGTLH 72

Query: 194  REMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTV---- 361
             E  +AAK+TLLGPNPWPF+DGV+FFH   + ++ + +   N + LL+ ++  + +    
Sbjct: 73   TERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLEYNNNDNNLAIVK 132

Query: 362  ---HAKISNAKDHSYDGNNLYV---------DGGKD-DMLVPGVLCNDEITHLELKLIGF 502
               ++K +  +   ++GN   V         DGG+   +++PGVL  +EI+ ++++ +G+
Sbjct: 133  YVGNSKGNGNRQEEFNGNMRNVEDCSFENLNDGGESCPLVIPGVLIKEEISDIKVRELGY 192

Query: 503  GEIGARXXXXXXXXXXXXRMWCAWLGKVNSKDESMA--PEHDFGIVVFTYNYNLGRRKLV 676
            G+I AR            R+WC WLGKVN   E+M   PEH++ I+ FTYN +LGR+ L+
Sbjct: 193  GQIAARFTEKDGIFSGVSRIWCEWLGKVNDGIENMVKVPEHNYAIITFTYNVDLGRKGLL 252

Query: 677  DDSNPLLVGSPCLNLENIQGR--KRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXX 850
            DD   LL  SP    +N + R  KRKKSFSDPED S S+S QYD                
Sbjct: 253  DDVKLLLSSSPGAESQNDENRQVKRKKSFSDPEDGSLSMSPQYDSSGEDSSASNCVMSSL 312

Query: 851  XXXXDRPKLIPS-----KSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRL 1015
                   +++ +     K++RRELR++QRLAAERMCDICQ ++L  KDVA LLNMK+GRL
Sbjct: 313  SLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQKILTHKDVATLLNMKTGRL 372

Query: 1016 VCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRK 1177
             CSSRNVNG FHVFH SCLI WILLCE EI    L   KV    +R+ + K  K
Sbjct: 373  ACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVRRRYRRKKKTKGNK 426


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  344 bits (883), Expect = 4e-92
 Identities = 195/415 (46%), Positives = 249/415 (60%), Gaps = 23/415 (5%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            S  +L+E+  + T+Q   ++GH YVELREDGK   R +FFCTLC SPCYS+  LF+HL+G
Sbjct: 12   SGGNLKEQLVRRTLQNVRSQGHIYVELREDGK---RLVFFCTLCHSPCYSDSVLFNHLKG 68

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVH 364
            N H EM AAAK TLL PNPWPFNDGV+FF+D  + + H       +  L+DT    D   
Sbjct: 69   NLHTEMLAAAKATLLKPNPWPFNDGVLFFNDP-EQDKHSPNVNVGKSRLVDTC-LEDESS 126

Query: 365  AKISNAKDHSYDGNNLYV--------------DGGKDDMLVPGVLCNDEITHLELKLIGF 502
              I    D+     + YV              +G  + +++PGVLC DE++ LE+K IG 
Sbjct: 127  LAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDELSDLEVKHIGI 186

Query: 503  GEIGARXXXXXXXXXXXXRMWCAWLGKVNSKD--ESMAPEHDFGIVVFTYNYNLGRRKLV 676
            G+I AR            R+WC WL K +S D   S+ P+HDF +V F YNYNLGR+ L+
Sbjct: 187  GKIAARISVRGIDSKKIRRIWCEWLVKKDSDDMDTSVVPDHDFAVVTFPYNYNLGRKPLL 246

Query: 677  DDSNPLLVGSPCLNLENIQG-RKRK-KSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXX 850
            DD   LL  SP    E   G RKRK KSFSDPED SESLS   D                
Sbjct: 247  DDRF-LLPSSPYSESEETSGTRKRKRKSFSDPEDFSESLSNHCDSSGEESQSTNNSNMKL 305

Query: 851  XXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRL 1015
                   +L     I SK++RRELR++QR+A+ERMCDICQ +MLPGKDVA LL+ KSG+L
Sbjct: 306  ILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATLLSWKSGKL 365

Query: 1016 VCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            +CSSRN+ GAFH+FH+SCLI WIL CEL+     +  PK+ +  KRRS+ K   K
Sbjct: 366  MCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSKRKTGTK 420


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  344 bits (883), Expect = 4e-92
 Identities = 191/416 (45%), Positives = 252/416 (60%), Gaps = 24/416 (5%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            S  +L+E+  + T+Q   ++GH YVELREDGK   R IFFCTLC SPCYS+  LF+HL+G
Sbjct: 12   SGGNLKEQLVRRTLQNVRSQGHIYVELREDGK---RLIFFCTLCHSPCYSDSVLFNHLKG 68

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPN--QDGLLDT------ 340
            N H EM AAAK TLL PNPWPFNDGV+FF+D          P  N  +  L+DT      
Sbjct: 69   NLHTEMLAAAKATLLKPNPWPFNDGVLFFNDPEQDKQDKQSPNVNVGKSRLVDTCLEDES 128

Query: 341  -----DHFSDTVHAKISNAKDHSYD--GNNLYVDGGKDDMLVPGVLCNDEITHLELKLIG 499
                 ++  +  H + +   ++ Y    + L  +   D +++PGVLC DE++ LE+K IG
Sbjct: 129  SVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCKDELSDLEVKHIG 188

Query: 500  FGEIGARXXXXXXXXXXXXRMWCAWLGKVNSKD--ESMAPEHDFGIVVFTYNYNLGRRKL 673
             G+I AR            R+WC WL K +S D   S+ P+HDF +V F YNYNLGR  L
Sbjct: 189  IGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDMDTSVVPDHDFAVVTFPYNYNLGRSPL 248

Query: 674  VDDSNPLLVGSPCLNLE--NIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXX 847
            +DD   LL  SP    E  ++ G++++KSFSDPED SESLS   D               
Sbjct: 249  LDDRF-LLPSSPYSESEETSVTGKRKRKSFSDPEDFSESLSNHCDSSGEESQSTNNSNMK 307

Query: 848  XXXXXDRPKL-----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGR 1012
                    +L     I SK++RRELR++QR+A+ERMCDICQ +MLPGKDVA LL+ KSG+
Sbjct: 308  LILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATLLSWKSGK 367

Query: 1013 LVCSSRNVNGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            L+CSSRN++GAFH+FH+SCLI WIL CEL+     +  PK+    KRRS+ K   K
Sbjct: 368  LMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSKKKTGTK 423


>ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779572 isoform X1 [Glycine
            max] gi|571494415|ref|XP_006592839.1| PREDICTED:
            uncharacterized protein LOC100779572 isoform X2 [Glycine
            max]
          Length = 501

 Score =  327 bits (838), Expect = 6e-87
 Identities = 186/400 (46%), Positives = 244/400 (61%), Gaps = 18/400 (4%)
 Frame = +2

Query: 11   SSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRGNF 190
            S+ +E+AA+  ++   ++GH YVELRE+GK   +FI+FCTLCL+PCYS+  LFDHL+GN 
Sbjct: 15   SNPKEQAARKILKIVRSQGHPYVELRENGK---KFIYFCTLCLAPCYSDDVLFDHLKGNL 71

Query: 191  HREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLL---DTD------ 343
            H+E  +AAKVTLLGP PWPFNDG++FF  S + +  L V    Q+ LL   D D      
Sbjct: 72   HKERLSAAKVTLLGPKPWPFNDGLVFFDTSTESHKELEVADSYQNRLLKFNDNDVSLAIV 131

Query: 344  HFSDTVHAKISNAKDHSYDGNNLYVDGGKDD---MLVPGVLCNDEITHLELKLIGFGEIG 514
             F D V    SNAK  S       +DG +DD   +++P +L  DEI  ++++ +G G+I 
Sbjct: 132  KFGDGVQ---SNAKPRS-------IDGMQDDEYALVIPNLLIGDEIFDVKVREVGLGKIA 181

Query: 515  ARXXXXXXXXXXXXRMWCAWLGKVNS--KDESMAPEHDFGIVVFTYNYNLGRRKLVDDSN 688
            AR            R+WC WLGK ++  +D     EHDF +V+F YNY+LGR  L+DD N
Sbjct: 182  ARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAYNYDLGRSGLLDDVN 241

Query: 689  PLLVGSPCLNLENIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDR 868
             LL  +         G+K K S SD +DVS+S+  QYD                      
Sbjct: 242  TLLPSAS-------GGQKGKSSLSDFDDVSDSVCNQYDSSAEESSDSNNSSSRLTLDQFN 294

Query: 869  PKL----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNV 1036
              L    I SK+LR+ELR+KQRLAAE++C+ICQ +MLPGKDVAALLN+K+ R+ CSSRN 
Sbjct: 295  NHLCTRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNLKTRRVACSSRNR 354

Query: 1037 NGAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRR 1156
             GAFHVFH SCLI WI+LCE EI TN L  P V    KR+
Sbjct: 355  TGAFHVFHTSCLIHWIILCEFEIITNHLVCPNVRRVVKRK 394


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  326 bits (836), Expect = 1e-86
 Identities = 181/400 (45%), Positives = 236/400 (59%), Gaps = 13/400 (3%)
 Frame = +2

Query: 8    ASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRGN 187
            A+SL+E+ A+ T+    ++GH YVELREDGK   RFIFFCTLCL+PCYS+  LFDHL+GN
Sbjct: 15   ANSLKEQLARTTLNNVRSKGHPYVELREDGK---RFIFFCTLCLAPCYSDAVLFDHLKGN 71

Query: 188  FHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQD-GLLDTDHFSDTVH 364
             H E  + A +TLL  NPWPF+DGV FF  S ++   L +   N+  G  ++        
Sbjct: 72   LHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQLVIKNDNESRGNGNSSLAIVKYG 131

Query: 365  AKISNAKDHSYDGNNLYVDGGK-DDMLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXX 541
              +    D     N    D G+  D+L+ GVL  D+I+ L+ + +G+G IGAR       
Sbjct: 132  GSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQARFMGYGRIGARLIEKDGN 191

Query: 542  XXXXXRMWCAWLGKVN--SKDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCL 715
                 R+WC WLGK      D++   +H+F +V F YNY+LGR+ L+DD   LL  SP  
Sbjct: 192  SNDISRIWCEWLGKNTPCDLDKAKVLDHEFAVVTFAYNYDLGRKGLLDDVKLLLSSSPVQ 251

Query: 716  NLENIQG--RKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDR------- 868
              +N  G  RKRKKSFSDPEDVSES S QYD                    DR       
Sbjct: 252  ESDNQGGTNRKRKKSFSDPEDVSESFSNQYD-SSGEESLTSIGGPPTRLLLDRHDDQFLH 310

Query: 869  PKLIPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAF 1048
             K+I SK+LRRELR++  +AAERMCDICQ ++LP KDVA L+NM +G+L CSSRN  G +
Sbjct: 311  SKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVNMNTGKLACSSRNTYGQY 370

Query: 1049 HVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAK 1168
            HVFH SCLI WILL E E+  N+   PK     +R++  K
Sbjct: 371  HVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKNGTK 410


>ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145334149|ref|NP_001078455.1| uncharacterized protein
            [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1|
            putative protein [Arabidopsis thaliana]
            gi|110742700|dbj|BAE99261.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1|
            uncharacterized protein AT4G28260 [Arabidopsis thaliana]
            gi|332660061|gb|AEE85461.1| uncharacterized protein
            AT4G28260 [Arabidopsis thaliana]
          Length = 516

 Score =  324 bits (831), Expect = 4e-86
 Identities = 175/399 (43%), Positives = 232/399 (58%), Gaps = 17/399 (4%)
 Frame = +2

Query: 14   SLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRGNFH 193
            +L+E+ A+ T++    +GH Y+ELREDGK   RF+FFCTLCL+PCYS+  L  HL GN H
Sbjct: 15   NLKEQLARTTLKNLRLQGHTYIELREDGK---RFVFFCTLCLAPCYSDTILLGHLNGNLH 71

Query: 194  REMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDT-DHFSDTVHAK 370
            +E  A A++TLLG NPWPF+DGV+FF  S       + P    +G+ DT +H SD     
Sbjct: 72   KERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKS-PVSGGEGVPDTLEHCSDDERFA 130

Query: 371  ISNAKDHSYDGNNLYV-------DGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXXX 529
            I    ++  +G+N+             DD+L+ GVL  +    +E K IGFG I AR   
Sbjct: 131  IVKYDNNKTNGDNVPAAVTDDEPSHAADDLLISGVLIKERTLDVEAKFIGFGRIAARLFE 190

Query: 530  XXXXXXXXXRMWCAWLGKVNSKDESMA--PEHDFGIVVFTYNYNLGRRKLVDDSNPLLVG 703
                     ++WC WLG     DE  A  PEHDF IV F+Y YNLGR  L+DD   LL  
Sbjct: 191  TKGRTTWIDKLWCEWLGDEGPSDEEKATIPEHDFAIVTFSYFYNLGRLGLLDDPGRLLTS 250

Query: 704  SPCL--NLENIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDRPKL 877
            S     N E+  GRKRKKSFSDPED SESL  QYD                        L
Sbjct: 251  SQSESGNGED-SGRKRKKSFSDPEDTSESLCNQYDSSEEVSSGHNSNSSRDLIADYDDSL 309

Query: 878  -----IPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNG 1042
                 + ++++RRELR++QR+ +ER+C++C+ +MLPGKD AA+LNMK+G L C SRN+ G
Sbjct: 310  MSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILNMKTGNLACGSRNLLG 369

Query: 1043 AFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRS 1159
            AFH+FH+SC++ W L CE EI  NK+   K    C + S
Sbjct: 370  AFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKHS 408


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  322 bits (826), Expect = 2e-85
 Identities = 188/407 (46%), Positives = 239/407 (58%), Gaps = 15/407 (3%)
 Frame = +2

Query: 5    SASSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRG 184
            +ASSLRE+ A+ T+ R  A GH Y+ELREDGK   RFIFFCTLCLSPCYS+  L DHLRG
Sbjct: 13   TASSLREQLARTTLSRVRARGHPYLELREDGK---RFIFFCTLCLSPCYSDTILLDHLRG 69

Query: 185  NFHREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDG-LLDTDHFSDT- 358
            N H E  +AAK TLL PNPWPF+DG+ FF  S  +   LA+    +    L  +  SD  
Sbjct: 70   NLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLAIKDGKESSRFLKFEENSDNL 129

Query: 359  -VHAKISNAK---DHSYDGNNLYVDGGKDDMLVPGVLCNDEITHLELKLIGFGEIGARXX 526
             +   + N K   D   D N    D G D +++P V   +E++ L+  L+G G+I AR  
Sbjct: 130  AIVKYVENLKPGCDTVVDENLSGSDEGSD-LVIPSVRLKEEVSDLKATLVGSGQIAARMY 188

Query: 527  XXXXXXXXXXRMWCAWLGKVNSKDESMAP--EHDFGIVVFTYNYNLGRRKLVDDSNPLLV 700
                      R+WC WLGK +S DE      +HDFG+V F Y+Y LG+  L DD   LL 
Sbjct: 189  EKKDGSNEISRIWCEWLGKKSSNDEDKVKVLDHDFGVVTFAYDYELGKSGLFDDVKLLLS 248

Query: 701  GS-PCLNLENIQGR-KRKKSFSDPEDVSESLSTQY-----DXXXXXXXXXXXXXXXXXXX 859
             S P L   + +G  KRK+S S+PEDVS SL+ QY     +                   
Sbjct: 249  SSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEEESSKTTCASSNLVLDRYDDQ 308

Query: 860  XDRPKLIPSKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVN 1039
                + I +K++RRE+R++QR+AAE+MCDICQ +MLP KDVA L N K+G+L CSSRNV 
Sbjct: 309  LMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKDVATLWNRKTGKLACSSRNVY 368

Query: 1040 GAFHVFHISCLIQWILLCELEIRTNKLAMPKVTSDCKRRSRAKQRKK 1180
            GAFHVFH SCLI WIL CE EI  N+     V++   RRSR K   K
Sbjct: 369  GAFHVFHTSCLIHWILYCEFEIVRNQ----TVSTKGGRRSRKKNGTK 411


>ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807746 [Glycine max]
          Length = 500

 Score =  320 bits (821), Expect = 6e-85
 Identities = 176/391 (45%), Positives = 238/391 (60%), Gaps = 9/391 (2%)
 Frame = +2

Query: 11   SSLREKAAKITVQRAHAEGHKYVELREDGKERKRFIFFCTLCLSPCYSEYTLFDHLRGNF 190
            S+ +E+AA+  ++   ++GH YVELRE+GK   +FI+FCTLCL+PCYS+  LFDHL+GN 
Sbjct: 15   SNPKEQAARKILKIVRSQGHPYVELRENGK---KFIYFCTLCLAPCYSDDVLFDHLKGNL 71

Query: 191  HREMYAAAKVTLLGPNPWPFNDGVIFFHDSCDHNSHLAVPCPNQDGLLDTDHFSDTVHAK 370
            HRE  +AAKVTLLGP PWPFNDG++FF  S + +  L V    ++ LL  +   D+  A 
Sbjct: 72   HRERLSAAKVTLLGPKPWPFNDGLVFFDTSTESDKELEVADSYRNRLLKFND-DDSSLAI 130

Query: 371  ISNAKDHSYDGNNLYVDGGKDD---MLVPGVLCNDEITHLELKLIGFGEIGARXXXXXXX 541
            +   +    +     ++G +DD   +++P +L  DEI  L++K +G G+I AR       
Sbjct: 131  VKFGEGVQSNAKPCSIEGMQDDECALVIPNLLIGDEIFDLKVKEVGLGKIAARFLEKCHA 190

Query: 542  XXXXXRMWCAWLGKVNS--KDESMAPEHDFGIVVFTYNYNLGRRKLVDDSNPLLVGSPCL 715
                 R+WC WLGK ++  +D     EHDF +V+F YNY+LGR  L+DD   LL  S   
Sbjct: 191  LNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAYNYDLGRSGLLDDVKTLLPVS--- 247

Query: 716  NLENIQGRKRKKSFSDPEDVSESLSTQYDXXXXXXXXXXXXXXXXXXXXDR----PKLIP 883
                  G+K K S SD +DVS+ L  QYD                           + I 
Sbjct: 248  -----AGQKGKTSLSDSDDVSDFLCNQYDSSAEESSDSNNSSSRLTLDQFNNHLCTRFIS 302

Query: 884  SKSLRRELRQKQRLAAERMCDICQHRMLPGKDVAALLNMKSGRLVCSSRNVNGAFHVFHI 1063
            SK+LR+ELR+KQRLAAE++C+ICQ +MLPGKDVAALLN+K+ R+ CSSRN  GAFHVFH 
Sbjct: 303  SKALRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNLKTRRVACSSRNRTGAFHVFHT 362

Query: 1064 SCLIQWILLCELEIRTNKLAMPKVTSDCKRR 1156
            SCLI WI+LCE EI  N L  P +    KR+
Sbjct: 363  SCLIHWIILCEFEIIINHLVRPNIRRVVKRK 393


Top