BLASTX nr result

ID: Akebia27_contig00004332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00004332
         (2932 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624...   576   e-161
ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma...   575   e-161
ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citr...   572   e-160
ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma...   567   e-158
ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244...   547   e-153
emb|CBI17649.3| unnamed protein product [Vitis vinifera]              547   e-153
ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255...   544   e-152
ref|XP_002512624.1| conserved hypothetical protein [Ricinus comm...   538   e-150
ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221...   536   e-149
ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma...   531   e-148
ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prun...   530   e-147
ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citr...   528   e-147
ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306...   526   e-146
ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Popu...   522   e-145
gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus...   515   e-143
ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [...   513   e-142
ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Popu...   512   e-142
ref|XP_006602841.1| PREDICTED: uncharacterized protein LOC100798...   511   e-142
ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma...   511   e-142
ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobrom...   511   e-142

>ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624954 [Citrus sinensis]
          Length = 407

 Score =  576 bits (1485), Expect = e-161
 Identities = 280/405 (69%), Positives = 322/405 (79%), Gaps = 5/405 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+L+DP  R SCLCSL    +L+C V+FIGS+F+A + K+RL RWG+  S+   K  TCK
Sbjct: 9    SVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCK 67

Query: 791  NE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 967
            N+ CR  G+E LP+GIVSKTSNLEMRPLW              LLAIA GIKQK+ V+++
Sbjct: 68   NQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQI 127

Query: 968  VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 1147
            V+KFPS DF VMLFHYDGVVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPDIVAEYNY
Sbjct: 128  VRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNY 187

Query: 1148 IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYK 1327
            IFLWDED+GVENF+PRRYLSIVKDEGL+ISQPALDP KSEVHH ITAR R SK HRR+YK
Sbjct: 188  IFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYK 247

Query: 1328 LSGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 1507
              G+ RCD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QLGYCAQGD
Sbjct: 248  YKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGD 307

Query: 1508 RTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSL----NIGALAPSRSRV 1675
            RT+NVGVVDSEYIVH GLPTLG   + + N+       QA D L    N  ALAPS+SR 
Sbjct: 308  RTKNVGVVDSEYIVHLGLPTLGVTTEPELNT-----VGQASDDLEQIANPVALAPSQSRR 362

Query: 1676 HSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
            + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+SH
Sbjct: 363  YDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407


>ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705077|gb|EOX96973.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 405

 Score =  575 bits (1483), Expect = e-161
 Identities = 272/400 (68%), Positives = 319/400 (79%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+++DPK R SCLC L    SL+C  +FI  AFIA +YK RLSRW + + +Q +K+N CK
Sbjct: 8    SVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICK 66

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
              CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQKE VN+++
Sbjct: 67   IRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQII 126

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDIVA+Y Y+
Sbjct: 127  KKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYL 186

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQITAR R S+VHRR+YK 
Sbjct: 187  FLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKF 246

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G+ RCDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR
Sbjct: 247  KGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 306

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRP 1690
             +NVGVVD+EYIVH GL TLG   +N+ NS   + T + + S +   LAPS S    NRP
Sbjct: 307  MKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPSSDSETLAPSESHKVDNRP 365

Query: 1691 AVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
             VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ   +S+H
Sbjct: 366  EVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 405


>ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533616|gb|ESR44734.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 407

 Score =  572 bits (1475), Expect = e-160
 Identities = 278/405 (68%), Positives = 320/405 (79%), Gaps = 5/405 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+L+DP  R SCLCSL    +L+C V+FIGS+F+A + K+RL RWG+  S+   K  TCK
Sbjct: 9    SVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCK 67

Query: 791  NE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 967
            N+ CR  G+E LP+GIVSKTSNLEMRPLW              LLAIA GIKQK+ V+++
Sbjct: 68   NQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQI 127

Query: 968  VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 1147
            V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPDIVAEYNY
Sbjct: 128  VRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNY 187

Query: 1148 IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYK 1327
            IFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH ITAR R SK HRR+YK
Sbjct: 188  IFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHHPITARRRNSKAHRRMYK 247

Query: 1328 LSGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 1507
              G+ RCD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QLGYCAQGD
Sbjct: 248  YKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGD 307

Query: 1508 RTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSL----NIGALAPSRSRV 1675
            RT+NVGVVDSEYIVH GLPTLG   + + N+       QA D L    N  ALAPS+SR 
Sbjct: 308  RTKNVGVVDSEYIVHLGLPTLGVTTEPELNA-----VGQASDDLEQIANPVALAPSQSRR 362

Query: 1676 HSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
            + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+SH
Sbjct: 363  YDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407


>ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705078|gb|EOX96974.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 416

 Score =  567 bits (1461), Expect = e-158
 Identities = 272/411 (66%), Positives = 319/411 (77%), Gaps = 11/411 (2%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+++DPK R SCLC L    SL+C  +FI  AFIA +YK RLSRW + + +Q +K+N CK
Sbjct: 8    SVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICK 66

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
              CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQKE VN+++
Sbjct: 67   IRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQII 126

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDIVA+Y Y+
Sbjct: 127  KKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYL 186

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVH------ 1312
            FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQITAR R S+VH      
Sbjct: 187  FLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHSYDTIN 246

Query: 1313 -----RRIYKLSGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLD 1477
                 RR+YK  G+ RCDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD
Sbjct: 247  PSRLNRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLD 306

Query: 1478 MQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALA 1657
            MQLGYCAQGDR +NVGVVD+EYIVH GL TLG   +N+ NS   + T + + S +   LA
Sbjct: 307  MQLGYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPSSDSETLA 365

Query: 1658 PSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
            PS S    NRP VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ   +S+H
Sbjct: 366  PSESHKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 416


>ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera]
          Length = 466

 Score =  547 bits (1410), Expect = e-153
 Identities = 264/408 (64%), Positives = 315/408 (77%), Gaps = 8/408 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S LAD K RRSC+CS+ PT S+LCL+FFIGS  I  DY ++LSRWGM+  +  + +N C+
Sbjct: 61   SQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLNSVSNKCE 120

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
            N+CR  GSE LPKGIV  +S+L+MRPLWG             LLA+AVG+KQK+ VN+MV
Sbjct: 121  NQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQKDLVNKMV 178

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            +KF S  F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+IVAEYNYI
Sbjct: 179  EKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEIVAEYNYI 238

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT RGRRS VHRRI+K 
Sbjct: 239  FLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDVHRRIFKS 298

Query: 1331 SGA-KRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 1507
            SG+ K CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQLGYCAQGD
Sbjct: 299  SGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQLGYCAQGD 358

Query: 1508 RTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASSHTSQAKDSLNIGALAPSR 1666
            RT+NVGVVDS+YIVH GLPTLG  D +K        +S+    T+       I  L  S 
Sbjct: 359  RTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPISKLPASS 418

Query: 1667 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
            +   + R  VRRQSYIE  IFK RW++AVKED+CW DPYQQ  ++++H
Sbjct: 419  TSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 466


>emb|CBI17649.3| unnamed protein product [Vitis vinifera]
          Length = 413

 Score =  547 bits (1410), Expect = e-153
 Identities = 264/408 (64%), Positives = 315/408 (77%), Gaps = 8/408 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S LAD K RRSC+CS+ PT S+LCL+FFIGS  I  DY ++LSRWGM+  +  + +N C+
Sbjct: 8    SQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLNSVSNKCE 67

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
            N+CR  GSE LPKGIV  +S+L+MRPLWG             LLA+AVG+KQK+ VN+MV
Sbjct: 68   NQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQKDLVNKMV 125

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            +KF S  F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+IVAEYNYI
Sbjct: 126  EKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEIVAEYNYI 185

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT RGRRS VHRRI+K 
Sbjct: 186  FLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDVHRRIFKS 245

Query: 1331 SGA-KRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 1507
            SG+ K CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQLGYCAQGD
Sbjct: 246  SGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQLGYCAQGD 305

Query: 1508 RTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASSHTSQAKDSLNIGALAPSR 1666
            RT+NVGVVDS+YIVH GLPTLG  D +K        +S+    T+       I  L  S 
Sbjct: 306  RTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPISKLPASS 365

Query: 1667 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
            +   + R  VRRQSYIE  IFK RW++AVKED+CW DPYQQ  ++++H
Sbjct: 366  TSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 413


>ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera]
            gi|297739491|emb|CBI29673.3| unnamed protein product
            [Vitis vinifera]
          Length = 413

 Score =  544 bits (1402), Expect = e-152
 Identities = 263/403 (65%), Positives = 308/403 (76%), Gaps = 7/403 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMA-------DSIQK 769
            SL +DPK R S LCSL     L C V+FI S F   DYK R SRW ++       +SIQ 
Sbjct: 8    SLPSDPKSR-SYLCSLFIGACLFCGVYFIASEFTVKDYKDRSSRWQISVFQNAHSNSIQN 66

Query: 770  TKTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 949
            T+++ CKN+CRP GSE LP+GIV KTSNLE++PLWG             LLA+AVGIKQK
Sbjct: 67   TQSSKCKNQCRPSGSEALPEGIVVKTSNLEVQPLWGATLNGEKSSPSKSLLAMAVGIKQK 126

Query: 950  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 1129
            E VN++V+KF  S+F VMLFHYDGVVDEWR+  WSD +IHV+ +NQTKWWFAKRFLHPDI
Sbjct: 127  EIVNQIVEKFILSNFVVMLFHYDGVVDEWREFAWSDHAIHVTVVNQTKWWFAKRFLHPDI 186

Query: 1130 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKV 1309
            VAEYNYIFLWDEDLGVENFHP RY+SIV+DEGL+ISQPALDP KS VHHQITAR R S+V
Sbjct: 187  VAEYNYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQPALDPKKSRVHHQITARVRNSRV 246

Query: 1310 HRRIYKLSGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 1489
            HRR YK  G+ RCD  STAPPC GWVEMMAPVFS+ AWRC W+MIQN+LIHAWG+DMQLG
Sbjct: 247  HRRTYKHRGSGRCDDQSTAPPCVGWVEMMAPVFSKAAWRCVWHMIQNELIHAWGVDMQLG 306

Query: 1490 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRS 1669
            YCAQGDRT+NVGVVDSEY+VH  LPTLG  D+N+   +   H+S  +      ALA S  
Sbjct: 307  YCAQGDRTKNVGVVDSEYVVHLALPTLGVLDENELRGEGHDHSSLREKLPKSVALAQSEF 366

Query: 1670 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAK 1798
                NR AVRRQS+IE++IF++RW  AVKED+CW+DPY QPA+
Sbjct: 367  HKVDNRSAVRRQSFIEMQIFRSRWANAVKEDKCWIDPYAQPAE 409


>ref|XP_002512624.1| conserved hypothetical protein [Ricinus communis]
            gi|223548585|gb|EEF50076.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 389

 Score =  538 bits (1387), Expect = e-150
 Identities = 261/396 (65%), Positives = 299/396 (75%)
 Frame = +2

Query: 617  LADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCKNE 796
            L D K RRSCLCS+LPT SLL LVFFIGS F+  DYK+++SRW + DS Q  K  TCKN 
Sbjct: 9    LPDSKSRRSCLCSILPTASLLFLVFFIGSTFVIPDYKEKISRWKIVDSFQSLKFATCKNR 68

Query: 797  CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVKK 976
            C+P+GSE LP+GIVSKTSNL+MRPLWG             L  +AVGIKQ++ V++MVKK
Sbjct: 69   CKPHGSEALPEGIVSKTSNLQMRPLWGFPENDETSSIN--LFTLAVGIKQRDIVDKMVKK 126

Query: 977  FPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIFL 1156
            F SS F+VMLFHYDGVVDEW D EW D+ IH+SA NQTKWWFAKRFLHPDIVAEY+YIFL
Sbjct: 127  FLSSKFSVMLFHYDGVVDEWNDYEWKDQVIHISAHNQTKWWFAKRFLHPDIVAEYSYIFL 186

Query: 1157 WDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKLSG 1336
            WDEDLGVENF P++YLSIVK +GL+ISQPALDP KS +H QITAR RRS VH R +K   
Sbjct: 187  WDEDLGVENFDPQQYLSIVKSKGLEISQPALDPGKSAIHQQITARLRRSIVHSRTFK--- 243

Query: 1337 AKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTQ 1516
               CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLGYCAQGDR +
Sbjct: 244  PGTCDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLGYCAQGDRVK 303

Query: 1517 NVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRPAV 1696
            N+GVVD+EYIVH G PTLGG  ++K                      PSRS     R  V
Sbjct: 304  NIGVVDAEYIVHYGRPTLGGTGESK---------------------EPSRSNKKDPRLEV 342

Query: 1697 RRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1804
            RRQS++E +IF+ RW+KA KED+CW+DPY+Q  KQS
Sbjct: 343  RRQSFVEFKIFQKRWEKAAKEDKCWIDPYEQAEKQS 378


>ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221459 [Cucumis sativus]
          Length = 388

 Score =  536 bits (1380), Expect = e-149
 Identities = 253/395 (64%), Positives = 302/395 (76%), Gaps = 3/395 (0%)
 Frame = +2

Query: 614  LLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCKN 793
            LLA+ K R SCLCS LPT SLLCL  F+GS ++A DY++++SRWG+ D +  +K N C+ 
Sbjct: 9    LLAEQKSRNSCLCSFLPTASLLCLALFVGSVYVAPDYREKISRWGI-DGLVGSKFNKCEK 67

Query: 794  ECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX--LLAIAVGIKQKENVNEM 967
            +CRP GSE LPK IV   SNLEMRPLWG               + A+AVGIKQK+ VN+M
Sbjct: 68   QCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIKQKDLVNKM 127

Query: 968  VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 1147
            V KF SSDFAVMLFHYDG+VDEW+   WS+R IHV+A+NQTKWWFAKRFLHPDIV EYNY
Sbjct: 128  VTKFLSSDFAVMLFHYDGIVDEWKGFNWSNRVIHVTAVNQTKWWFAKRFLHPDIVEEYNY 187

Query: 1148 IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYK 1327
            +FLWDEDLGV+NF+P+ Y+ I++ EGL+ISQPALDP KSEVHHQITARGRRS VHRR ++
Sbjct: 188  VFLWDEDLGVDNFNPKLYVDIIQSEGLEISQPALDPYKSEVHHQITARGRRSTVHRRTFR 247

Query: 1328 LS-GAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQG 1504
             S G K CD NSTAPPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDMQLGYCAQG
Sbjct: 248  PSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQG 307

Query: 1505 DRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSN 1684
            DRT+NVGVVDSEY++H G PTLGG ++N+ +                     S+S V  +
Sbjct: 308  DRTKNVGVVDSEYVIHYGRPTLGGPEENETS---------------------SKSHVKDH 346

Query: 1685 RPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQ 1789
            R  VRRQSYIEL++F+ RW+KA ++DECW DPY +
Sbjct: 347  RADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPE 381


>ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508718003|gb|EOY09900.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 389

 Score =  531 bits (1369), Expect = e-148
 Identities = 248/392 (63%), Positives = 303/392 (77%), Gaps = 1/392 (0%)
 Frame = +2

Query: 614  LLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCKN 793
            L+ + K   SCLC L+P  +LLC+++FIGS+F+A + K++   WG+AD +Q +K   CKN
Sbjct: 11   LVTERKSWSSCLCRLIPATALLCVIYFIGSSFVAPENKEKAFTWGVADILQTSKVENCKN 70

Query: 794  ECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVK 973
            +CRP GSE LP+GI++KTSNL++RPLWG             L A+AVGIKQK+ V+EMVK
Sbjct: 71   QCRPPGSEPLPEGIITKTSNLQLRPLWGFPKKDDTSSS---LFAVAVGIKQKDLVHEMVK 127

Query: 974  KFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIF 1153
            KF SS FAVMLFHYDG+VDEW+  EW+D+ IHVSA NQTKWWFAKRFLHPD+V+EY+YIF
Sbjct: 128  KFLSSGFAVMLFHYDGIVDEWKSFEWNDQVIHVSARNQTKWWFAKRFLHPDVVSEYSYIF 187

Query: 1154 LWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYK-L 1330
            LWDEDLGVE+FHP++Y+SIV+ E L+ISQPALDPAKSEVHHQITARGR+S VHRR +K  
Sbjct: 188  LWDEDLGVEDFHPKKYVSIVESERLEISQPALDPAKSEVHHQITARGRKSMVHRRTFKHR 247

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
            +  + CDG S APPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDMQLGYCAQGDR
Sbjct: 248  ANGRSCDGQSKAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLGYCAQGDR 307

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRP 1690
            T+N+GVVD+EYIVH   PTLGG  +   ++    H ++            S S     R 
Sbjct: 308  TKNIGVVDAEYIVHYNRPTLGGTAEKNHSTVEGGHRNKKS----------SHSHWKDPRV 357

Query: 1691 AVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1786
             VRRQSYIEL+IF+ RW+KAVK D+CW+DPYQ
Sbjct: 358  EVRRQSYIELDIFRKRWEKAVKNDKCWVDPYQ 389


>ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica]
            gi|462419692|gb|EMJ23955.1| hypothetical protein
            PRUPE_ppa006529mg [Prunus persica]
          Length = 407

 Score =  530 bits (1366), Expect = e-147
 Identities = 257/399 (64%), Positives = 304/399 (76%), Gaps = 4/399 (1%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S L DPK R S  CSL    SL+C  +FIG A IA +YK+RL+RW +  + Q TK +TCK
Sbjct: 9    SALPDPKNR-SFYCSLFIVASLICGAYFIGGASIAKEYKERLTRWKVIYTRQNTKFDTCK 67

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
            N C+P GSE LP+GIV+KTS+LE+RPLWG             LLAIAVGIKQKE V+ +V
Sbjct: 68   NRCQPLGSEALPEGIVAKTSDLEVRPLWGSSVNNENSKPSMSLLAIAVGIKQKEIVDRIV 127

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKF SSDF VMLFHYDG VD+WRDL WSDR+IHVS +NQTKWWFAKRFLHPDIV+EY YI
Sbjct: 128  KKFLSSDFVVMLFHYDGAVDKWRDLNWSDRAIHVSVMNQTKWWFAKRFLHPDIVSEYEYI 187

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGVENF P+RYLSIV++EGL+ISQPALDP KS+V+H ITAR ++ KVHRR YK 
Sbjct: 188  FLWDEDLGVENFDPKRYLSIVREEGLEISQPALDPDKSDVYHPITARVKKLKVHRRFYKF 247

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G+ RCD +S+APPC GWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD+QLGYCAQGDR
Sbjct: 248  KGSGRCDNHSSAPPCAGWVEMMAPVFSKAAWQCVWYMIQNDLIHAWGLDVQLGYCAQGDR 307

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGAL----APSRSRVH 1678
            T+NVGVVDSEYIVH GLPTLG  D NK     +         +++       APS S   
Sbjct: 308  TKNVGVVDSEYIVHLGLPTLGVSDGNKAIMLKTRLDFYCLSPIHLSLCNIISAPSASDKV 367

Query: 1679 SNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPA 1795
            ++R  VR QS+I+++IFK RW  AVKED+CW+DP+Q  A
Sbjct: 368  NDRAKVRMQSFIDMQIFKERWSNAVKEDKCWVDPFQLSA 406


>ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533617|gb|ESR44735.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 358

 Score =  528 bits (1361), Expect = e-147
 Identities = 255/362 (70%), Positives = 288/362 (79%), Gaps = 5/362 (1%)
 Frame = +2

Query: 740  RWGMADSIQKTKTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX 916
            RWG+  S+   K  TCKN+ CR  G+E LP+GIVSKTSNLEMRPLW              
Sbjct: 2    RWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMN 61

Query: 917  LLAIAVGIKQKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKW 1096
            LLAIA GIKQK+ V+++V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKW
Sbjct: 62   LLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKW 121

Query: 1097 WFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHH 1276
            WFAKRFLHPDIVAEYNYIFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH
Sbjct: 122  WFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHH 181

Query: 1277 QITARGRRSKVHRRIYKLSGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDL 1456
             ITAR R SK HRR+YK  G+ RCD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDL
Sbjct: 182  PITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDL 241

Query: 1457 IHAWGLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDS 1636
            IHAWGLD+QLGYCAQGDRT+NVGVVDSEYIVH GLPTLG   + + N+       QA D 
Sbjct: 242  IHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNA-----VGQASDD 296

Query: 1637 L----NIGALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1804
            L    N  ALAPS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+
Sbjct: 297  LEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQT 356

Query: 1805 SH 1810
            SH
Sbjct: 357  SH 358


>ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306243 [Fragaria vesca
            subsp. vesca]
          Length = 397

 Score =  526 bits (1354), Expect = e-146
 Identities = 253/392 (64%), Positives = 299/392 (76%), Gaps = 1/392 (0%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+L+DPK R S  CSL   VSL+   +FIG A IA +YK++L+RW +  ++Q T  +TCK
Sbjct: 13   SVLSDPKNR-SFYCSLFIVVSLVTGAYFIGGASIAKEYKEKLTRWKVTYTMQNTNLDTCK 71

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
              C+P G+E LP+GIV+KTS+ ++RPLWG             LLAIAVGIKQKE V+++V
Sbjct: 72   KRCQPSGTEALPEGIVAKTSDFKIRPLWGTSKKDKNSTPSKSLLAIAVGIKQKEIVDKIV 131

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            +KF SSDF VMLFHYDG VD+WRDL WSD +IHVS +NQTKWWFAKRFLHPDIV EY +I
Sbjct: 132  RKFLSSDFVVMLFHYDGAVDKWRDLHWSDTAIHVSVMNQTKWWFAKRFLHPDIVTEYKHI 191

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGVENF P RYLS++ DEGL+ISQPALDP KSEV+H ITAR ++SKVHRR YK 
Sbjct: 192  FLWDEDLGVENFDPERYLSVIWDEGLEISQPALDPVKSEVYHPITARVKKSKVHRRFYKF 251

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G+ RCD  S+ PPC GWVEMMAPVFSR AWRC WYMIQNDL+HAWGLD QLGYCAQGDR
Sbjct: 252  KGSGRCDDQSSGPPCIGWVEMMAPVFSRAAWRCVWYMIQNDLVHAWGLDEQLGYCAQGDR 311

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKP-NSQASSHTSQAKDSLNIGALAPSRSRVHSNR 1687
             +NVGVVDSEYIVH GLPTLG  DDNK  N+   S    +K      ALAPS   + S+R
Sbjct: 312  MKNVGVVDSEYIVHLGLPTLGVTDDNKGINNMVHSQKEDSK------ALAPSGPPIPSDR 365

Query: 1688 PAVRRQSYIELEIFKNRWKKAVKEDECWMDPY 1783
              VR QS+I++ IFK RW+ AVKED CW+DPY
Sbjct: 366  AKVRMQSFIDMRIFKERWRSAVKEDNCWVDPY 397


>ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Populus trichocarpa]
            gi|550316990|gb|EEE99848.2| hypothetical protein
            POPTR_0019s08010g [Populus trichocarpa]
          Length = 381

 Score =  522 bits (1345), Expect = e-145
 Identities = 260/399 (65%), Positives = 300/399 (75%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            SL +D K RRS  CS+ P  S L L+FF   AFIA DYK+RLSRWG+AD+ Q  K + CK
Sbjct: 10   SLPSDSKSRRSHWCSVFPAASFLFLIFFAVYAFIAPDYKERLSRWGIADTFQNFKFSNCK 69

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
            N+CRP GSE+LP+GIVSKTSN +MRPLWG             LLA+AVGI Q++ VN+MV
Sbjct: 70   NQCRPPGSESLPEGIVSKTSNFQMRPLWGFPKNDENSSIN--LLAVAVGITQRDLVNKMV 127

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKF SS+F+VMLFHYDG+VDEWRD EW+DR IHVSA NQTKWWFAKRFLHPDIVA  NYI
Sbjct: 128  KKFLSSNFSVMLFHYDGIVDEWRDFEWNDRVIHVSARNQTKWWFAKRFLHPDIVAACNYI 187

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGVENF+P++Y+SIVK EGL ISQPALD  KS VH QIT R  +S VHRR YK 
Sbjct: 188  FLWDEDLGVENFNPKQYVSIVKSEGLHISQPALD-YKSLVHQQITVRASKSGVHRRTYK- 245

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
                 CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLGYC+QGDR
Sbjct: 246  --PGICDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLGYCSQGDR 303

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRP 1690
            T+N+G+VD+EYIVH G PTLGG  +N+                      PSRS+    R 
Sbjct: 304  TKNIGIVDAEYIVHYGHPTLGGVVENE---------------------EPSRSQKTDPRL 342

Query: 1691 AVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSS 1807
             VRRQS IEL IF+ RWK+AV+ED+CW+DPY++  K+SS
Sbjct: 343  EVRRQSLIELRIFQKRWKEAVEEDQCWIDPYKEAVKESS 381


>gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus guttatus]
          Length = 401

 Score =  515 bits (1327), Expect = e-143
 Identities = 252/397 (63%), Positives = 301/397 (75%), Gaps = 1/397 (0%)
 Frame = +2

Query: 617  LADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCKNE 796
            + +PKR+RS + S LP+  LL  VFFIGSAF+ TDYK+R         I+ TK+ TC+ E
Sbjct: 6    MPEPKRKRSFMWSCLPSAILLSAVFFIGSAFLVTDYKERFLGACNLYPIKATKSKTCEYE 65

Query: 797  CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVKK 976
            CRP G+ETLP+GIVS+T+++EMRPL G             LL IAVGIKQK+NVNE+VKK
Sbjct: 66   CRPNGTETLPRGIVSRTTDMEMRPLSGPPKKKKLKSPMN-LLGIAVGIKQKQNVNEIVKK 124

Query: 977  FPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIFL 1156
            FP +DFAVMLFHYDG V+ WRDLEWS+  +HVSAINQTKWWFAKRFLHPD+VA+Y+YIFL
Sbjct: 125  FPLTDFAVMLFHYDGNVNGWRDLEWSNSVVHVSAINQTKWWFAKRFLHPDVVAQYDYIFL 184

Query: 1157 WDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKLSG 1336
            WDEDLGVENFH  RYLSIVK+EGLQISQPA+D  KSEVH+++T R   SKVHRR   L G
Sbjct: 185  WDEDLGVENFHAGRYLSIVKEEGLQISQPAIDAEKSEVHYKLTEREISSKVHRRAINLHG 244

Query: 1337 -AKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRT 1513
              +RC  NS  PPCTG+VEMMAPVFSR +WRCAW+MIQNDL+HAWGLD QLGYCAQG+RT
Sbjct: 245  PGRRCYENSMEPPCTGFVEMMAPVFSRVSWRCAWHMIQNDLVHAWGLDFQLGYCAQGNRT 304

Query: 1514 QNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRPA 1693
             N+G+VDSEY++H GLPTLGG    K N +    +S  K   N G    S       R A
Sbjct: 305  TNIGIVDSEYLIHLGLPTLGGSSGTKINDEVEKQSSPDKILPNAGKTEISAVEPSDERNA 364

Query: 1694 VRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1804
            VRR+S+IEL+ FKNRWKKAV+EDECW+DP Q P +Q+
Sbjct: 365  VRRESFIELDDFKNRWKKAVREDECWVDPLQTPPQQN 401


>ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|508705079|gb|EOX96975.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 438

 Score =  513 bits (1320), Expect = e-142
 Identities = 244/350 (69%), Positives = 281/350 (80%), Gaps = 2/350 (0%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+++DPK R SCLC L    SL+C  +FI  AFIA +YK RLSRW + + +Q +K+N CK
Sbjct: 90   SVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICK 148

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
              CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQKE VN+++
Sbjct: 149  IRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQII 208

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDIVA+Y Y+
Sbjct: 209  KKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYL 268

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQITAR R S+VHRR+YK 
Sbjct: 269  FLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKF 328

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G+ RCDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR
Sbjct: 329  KGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 388

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQ--AKDSLNIGAL 1654
             +NVGVVD+EYIVH GL TLG   +N+ NS   + T +  + DS  +G +
Sbjct: 389  MKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNITRRQPSSDSETLGTI 438


>ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Populus trichocarpa]
            gi|550341839|gb|ERP62868.1| hypothetical protein
            POPTR_0004s23630g [Populus trichocarpa]
          Length = 383

 Score =  512 bits (1318), Expect = e-142
 Identities = 249/397 (62%), Positives = 298/397 (75%)
 Frame = +2

Query: 620  ADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCKNEC 799
            +DPKR  S LCSLL  +SL+C V+F+GSAF    YK+R++ WG+ +++Q +  + CK+ C
Sbjct: 11   SDPKRG-SYLCSLLIALSLICSVYFVGSAFFGKQYKERITAWGVIEAMQTS--DICKDRC 67

Query: 800  RPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVKKF 979
            RP GSE LP+GIV+K SN +MRPLWG             LLAIAVGIKQK  VN++V+KF
Sbjct: 68   RPSGSEALPQGIVTKKSNYKMRPLWGSSLKNDNPPPSMSLLAIAVGIKQKAIVNQIVEKF 127

Query: 980  PSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIFLW 1159
            P SDF VMLFHYDGVVDEWRDL WS+ +IHVSA+NQTKWWFAKRFLHPDIV+EYNYIFLW
Sbjct: 128  PLSDFVVMLFHYDGVVDEWRDLSWSNSAIHVSAVNQTKWWFAKRFLHPDIVSEYNYIFLW 187

Query: 1160 DEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKLSGA 1339
            DEDLGVENF+PRRYLSIVKDEGL++SQPALDP++S VHHQITAR R S VHR+I K  G 
Sbjct: 188  DEDLGVENFNPRRYLSIVKDEGLEVSQPALDPSRSTVHHQITARIRNSIVHRKILKFRGN 247

Query: 1340 KRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRTQN 1519
             +C GNST+PPCTGWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD +LGYCAQGD T+N
Sbjct: 248  TKCYGNSTSPPCTGWVEMMAPVFSKAAWQCTWYMIQNDLIHAWGLDRKLGYCAQGDWTKN 307

Query: 1520 VGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRPAVR 1699
            VGVVD+EYIVH GL TLG F     N   +S +    D + +                VR
Sbjct: 308  VGVVDAEYIVHLGLSTLGVF-----NGSEASISYVPYDRIIV----------------VR 346

Query: 1700 RQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1810
             QS +E+ IF  RW+ A+KED CW+DPYQ  + Q+ H
Sbjct: 347  TQSSVEMNIFHERWEAAIKEDRCWVDPYQLISNQTRH 383


>ref|XP_006602841.1| PREDICTED: uncharacterized protein LOC100798633 [Glycine max]
          Length = 383

 Score =  511 bits (1317), Expect = e-142
 Identities = 248/399 (62%), Positives = 294/399 (73%), Gaps = 4/399 (1%)
 Frame = +2

Query: 602  FQFSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTN 781
            +Q +L +D K +R C C ++P VSLLC+V    + F A  Y ++L RW M  +I+K K +
Sbjct: 10   YQDNLDSDSKSKRLCFCGIVPAVSLLCVVLLFSTLFFAQRYNEKLLRWKM--NIKKLKND 67

Query: 782  TCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX---LLAIAVGIKQKE 952
             CKN+CR  GS  LP+GI+S TS+LEMR LW                 L A+AVGIKQK+
Sbjct: 68   NCKNQCRTGGSHALPEGIISNTSDLEMRHLWDLPMTKTIENKENASTNLFAMAVGIKQKD 127

Query: 953  NVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIV 1132
             VN++VKKF SS+F VMLFHYDG+VDEW D EW++  IHV+  NQ+KWWFAKRFLHPDIV
Sbjct: 128  LVNKLVKKFLSSNFVVMLFHYDGIVDEWNDFEWNNHVIHVAVANQSKWWFAKRFLHPDIV 187

Query: 1133 AEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVH 1312
            AEY YIFLWDEDLGVE+FHP RY+SI+K EGL+ISQPALD  KSEVHHQITARGRRS VH
Sbjct: 188  AEYGYIFLWDEDLGVEHFHPDRYVSIIKSEGLEISQPALDSNKSEVHHQITARGRRSNVH 247

Query: 1313 RRIYKLSGA-KRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 1489
            RRIYK  G+ KRCD +STAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLDMQLG
Sbjct: 248  RRIYKTGGSGKRCDESSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQLG 307

Query: 1490 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRS 1669
            YCAQGDRT+NVGVVD+EYIVH G PTLGG D N+ +S+   H                  
Sbjct: 308  YCAQGDRTKNVGVVDAEYIVHYGHPTLGGLDVNEVSSRTKDH------------------ 349

Query: 1670 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1786
                 R  VRR SY EL++F+ RW+KAV+ED+CW+DP+Q
Sbjct: 350  -----RVDVRRLSYRELQVFRKRWQKAVEEDDCWVDPFQ 383


>ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508705080|gb|EOX96976.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 374

 Score =  511 bits (1317), Expect = e-142
 Identities = 242/342 (70%), Positives = 276/342 (80%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            S+++DPK R SCLC L    SL+C  +FI  AFIA +YK RLSRW + + +Q +K+N CK
Sbjct: 8    SVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICK 66

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
              CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQKE VN+++
Sbjct: 67   IRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQII 126

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDIVA+Y Y+
Sbjct: 127  KKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYL 186

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQITAR R S+VHRR+YK 
Sbjct: 187  FLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKF 246

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G+ RCDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR
Sbjct: 247  KGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 306

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDS 1636
             +NVGVVD+EYIVH GL TLG   +N+ NS   + T +   S
Sbjct: 307  MKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNITRRQPSS 348


>ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobroma cacao]
            gi|508701811|gb|EOX93707.1| Uncharacterized protein
            TCM_002621 [Theobroma cacao]
          Length = 385

 Score =  511 bits (1317), Expect = e-142
 Identities = 243/397 (61%), Positives = 297/397 (74%)
 Frame = +2

Query: 611  SLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFIATDYKQRLSRWGMADSIQKTKTNTCK 790
            SL A+P+R+R      LP + LL   FFIGSAFI TDYK+R+  W     +Q  +   C+
Sbjct: 10   SLKAEPRRQRLFTHRFLPMILLLSAAFFIGSAFIITDYKERILGWRSVIVLQYKRPKICE 69

Query: 791  NECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMV 970
             +CR YGSE LPKGI+S+TS+LEMRPLWG             LLAIAVGIKQKE+VN++V
Sbjct: 70   TQCRAYGSEALPKGIISETSDLEMRPLWGLQNKKKPKLSMN-LLAIAVGIKQKESVNKIV 128

Query: 971  KKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYI 1150
            KKFP+SDF VMLFHYDG+VD+W+DLEW+D +IHVSA+NQTKWWFAKRFLHPDIV+EY+YI
Sbjct: 129  KKFPASDFVVMLFHYDGIVDQWKDLEWNDLAIHVSAVNQTKWWFAKRFLHPDIVSEYSYI 188

Query: 1151 FLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITARGRRSKVHRRIYKL 1330
            FLWDEDLGV++F+  RYLSI+K EGL+ISQPALD  KSE+HH ITAR ++S VHRR Y++
Sbjct: 189  FLWDEDLGVDHFNAARYLSIIKKEGLEISQPALDVEKSELHHPITARDKKSTVHRRTYEV 248

Query: 1331 SGAKRCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDR 1510
             G  RC+ NST PPCTG+VEMMAPVFSR +WRCAW+MIQ+DL++ WG+D QLGYCAQGDR
Sbjct: 249  IGRTRCNENSTGPPCTGFVEMMAPVFSRASWRCAWHMIQSDLVYGWGVDFQLGYCAQGDR 308

Query: 1511 TQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRP 1690
            TQ +G+VDSEY+VH  LPTLGG   N+                      PS S     R 
Sbjct: 309  TQKIGIVDSEYLVHNALPTLGGVAANE---------------------VPSPSSEPGGRS 347

Query: 1691 AVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQ 1801
             VR+QS+IELEIFKNRWK+AVK+D+CW DPY+   K+
Sbjct: 348  EVRKQSFIELEIFKNRWKRAVKQDKCWFDPYEPSTKK 384


Top