BLASTX nr result

ID: Akebia24_contig00005903 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00005903
         (1283 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624...   562   e-157
ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citr...   559   e-156
ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma...   558   e-156
ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma...   550   e-154
emb|CBI17649.3| unnamed protein product [Vitis vinifera]              535   e-149
ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255...   529   e-147
ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244...   528   e-147
ref|XP_002512624.1| conserved hypothetical protein [Ricinus comm...   526   e-147
ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221...   520   e-145
ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma...   516   e-144
ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prun...   511   e-142
ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citr...   509   e-142
ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Popu...   506   e-141
ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306...   504   e-140
ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Popu...   503   e-140
ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobrom...   499   e-138
ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [...   497   e-138
gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus...   496   e-137
ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma...   494   e-137
gb|EXC29927.1| hypothetical protein L484_015120 [Morus notabilis]     493   e-137

>ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624954 [Citrus sinensis]
          Length = 407

 Score =  562 bits (1448), Expect = e-157
 Identities = 274/412 (66%), Positives = 321/412 (77%), Gaps = 5/412 (1%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK +N +S+L+DP  R SCLCSL    +L+C V+FIGS+F++ + KER+ RWGLV S+ +
Sbjct: 2    MKATNSISVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYS 60

Query: 192  RQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 368
             +  TCKN+ CR  G+E LP+GIVSKTSNLEMRPLW              LLAIA GIKQ
Sbjct: 61   AKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQ 120

Query: 369  KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 548
            K+ V+++V+KFPS DF VMLFHYDGVVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPD
Sbjct: 121  KKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPD 180

Query: 549  IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 728
            IVAEYNYIFLWDED+GVENF+PRRYLSIVKDEGL+ISQPALDP KSEVHH IT+      
Sbjct: 181  IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSK 240

Query: 729  XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                +YK  G  +CD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QL
Sbjct: 241  AHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQL 300

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSL----NIGAL 1076
            GYCAQGDRT+NVGVVDSEYIVH GLPTLG   + + N+       QA D L    N  AL
Sbjct: 301  GYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNT-----VGQASDDLEQIANPVAL 355

Query: 1077 APSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
            APS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+SH
Sbjct: 356  APSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407


>ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533616|gb|ESR44734.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 407

 Score =  559 bits (1440), Expect = e-156
 Identities = 271/409 (66%), Positives = 318/409 (77%), Gaps = 2/409 (0%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK +N +S+L+DP  R SCLCSL    +L+C V+FIGS+F++ + KER+ RWGLV S+ +
Sbjct: 2    MKTTNSISVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYS 60

Query: 192  RQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 368
             +  TCKN+ CR  G+E LP+GIVSKTSNLEMRPLW              LLAIA GIKQ
Sbjct: 61   AKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQ 120

Query: 369  KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 548
            K+ V+++V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPD
Sbjct: 121  KKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPD 180

Query: 549  IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 728
            IVAEYNYIFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH IT+      
Sbjct: 181  IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHHPITARRRNSK 240

Query: 729  XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                +YK  G  +CD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QL
Sbjct: 241  AHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQL 300

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSL-NIGALAPS 1085
            GYCAQGDRT+NVGVVDSEYIVH GLPTLG     +P   A G  S   + + N  ALAPS
Sbjct: 301  GYCAQGDRTKNVGVVDSEYIVHLGLPTLG--VTTEPELNAVGQASDDLEQIANPVALAPS 358

Query: 1086 RSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
            +SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+SH
Sbjct: 359  QSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407


>ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705077|gb|EOX96973.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 405

 Score =  558 bits (1438), Expect = e-156
 Identities = 264/407 (64%), Positives = 314/407 (77%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  N  S+++DPK R SCLC L    SL+C  +FI  AFI+ +YK+R+SRW +++ +QN
Sbjct: 1    MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             ++N CK  CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQK
Sbjct: 60   SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI
Sbjct: 120  EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+       
Sbjct: 180  VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239

Query: 732  XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 911
               +YK  G  +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLG
Sbjct: 240  HRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLG 299

Query: 912  YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRS 1091
            YCAQGDR +NVGVVD+EYIVH GL TLG   +N+ NS     T + + S +   LAPS S
Sbjct: 300  YCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPSSDSETLAPSES 358

Query: 1092 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
                NRP VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ   +S+H
Sbjct: 359  HKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 405


>ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705078|gb|EOX96974.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 416

 Score =  550 bits (1416), Expect = e-154
 Identities = 264/418 (63%), Positives = 314/418 (75%), Gaps = 11/418 (2%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  N  S+++DPK R SCLC L    SL+C  +FI  AFI+ +YK+R+SRW +++ +QN
Sbjct: 1    MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             ++N CK  CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQK
Sbjct: 60   SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI
Sbjct: 120  EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITS------- 710
            VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+       
Sbjct: 180  VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239

Query: 711  ----XXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDL 878
                          +YK  G  +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDL
Sbjct: 240  HSYDTINPSRLNRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDL 299

Query: 879  IHAWGLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDS 1058
            IHAWGLDMQLGYCAQGDR +NVGVVD+EYIVH GL TLG   +N+ NS     T + + S
Sbjct: 300  IHAWGLDMQLGYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPS 358

Query: 1059 LNIGALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
             +   LAPS S    NRP VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ   +S+H
Sbjct: 359  SDSETLAPSESHKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 416


>emb|CBI17649.3| unnamed protein product [Vitis vinifera]
          Length = 413

 Score =  535 bits (1378), Expect = e-149
 Identities = 261/415 (62%), Positives = 311/415 (74%), Gaps = 8/415 (1%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MKL N VS LAD K RRSC+CS+ PT S+LCL+FFIGS  I  DY E++SRWG+   + N
Sbjct: 1    MKLPNCVSQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLN 60

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
              +N C+N+CR  GSE LPKGIV  +S+L+MRPLWG             LLA+AVG+KQK
Sbjct: 61   SVSNKCENQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQK 118

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            + VN+MV+KF S  F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+I
Sbjct: 119  DLVNKMVEKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEI 178

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            VAEYNYIFLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT        
Sbjct: 179  VAEYNYIFLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDV 238

Query: 732  XXXIYKSSG-GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
               I+KSSG G+ CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQL
Sbjct: 239  HRRIFKSSGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQL 298

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASGHTSQAKDSLNI 1067
            GYCAQGDRT+NVGVVDS+YIVH GLPTLG  D +K        +S+    T+       I
Sbjct: 299  GYCAQGDRTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPI 358

Query: 1068 GALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
              L  S +   + R  VRRQSYIE  IFK RW++AVKED+CW DPYQQ  ++++H
Sbjct: 359  SKLPASSTSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 413


>ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera]
            gi|297739491|emb|CBI29673.3| unnamed protein product
            [Vitis vinifera]
          Length = 413

 Score =  529 bits (1362), Expect = e-147
 Identities = 257/410 (62%), Positives = 303/410 (73%), Gaps = 7/410 (1%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLV----- 176
            MK  + +SL +DPK R S LCSL     L C V+FI S F   DYK+R SRW +      
Sbjct: 1    MKTLSCISLPSDPKSR-SYLCSLFIGACLFCGVYFIASEFTVKDYKDRSSRWQISVFQNA 59

Query: 177  --DSIQNRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAI 350
              +SIQN Q++ CKN+CRP GSE LP+GIV KTSNLE++PLWG             LLA+
Sbjct: 60   HSNSIQNTQSSKCKNQCRPSGSEALPEGIVVKTSNLEVQPLWGATLNGEKSSPSKSLLAM 119

Query: 351  AVGIKQKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAK 530
            AVGIKQKE VN++V+KF  S+F VMLFHYDGVVDEWR+  WSD +IHV+ +NQTKWWFAK
Sbjct: 120  AVGIKQKEIVNQIVEKFILSNFVVMLFHYDGVVDEWREFAWSDHAIHVTVVNQTKWWFAK 179

Query: 531  RFLHPDIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITS 710
            RFLHPDIVAEYNYIFLWDEDLGVENFHP RY+SIV+DEGL+ISQPALDP KS VHHQIT+
Sbjct: 180  RFLHPDIVAEYNYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQPALDPKKSRVHHQITA 239

Query: 711  XXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAW 890
                       YK  G  +CD  STAPPC GWVEMMAPVFS+ AWRC W+MIQN+LIHAW
Sbjct: 240  RVRNSRVHRRTYKHRGSGRCDDQSTAPPCVGWVEMMAPVFSKAAWRCVWHMIQNELIHAW 299

Query: 891  GLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIG 1070
            G+DMQLGYCAQGDRT+NVGVVDSEY+VH  LPTLG  D+N+   +   H+S  +      
Sbjct: 300  GVDMQLGYCAQGDRTKNVGVVDSEYVVHLALPTLGVLDENELRGEGHDHSSLREKLPKSV 359

Query: 1071 ALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAK 1220
            ALA S      NR AVRRQS+IE++IF++RW  AVKED+CW+DPY QPA+
Sbjct: 360  ALAQSEFHKVDNRSAVRRQSFIEMQIFRSRWANAVKEDKCWIDPYAQPAE 409


>ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera]
          Length = 466

 Score =  528 bits (1361), Expect = e-147
 Identities = 257/409 (62%), Positives = 307/409 (75%), Gaps = 8/409 (1%)
 Frame = +3

Query: 30   VSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTC 209
            VS LAD K RRSC+CS+ PT S+LCL+FFIGS  I  DY E++SRWG+   + N  +N C
Sbjct: 60   VSQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLNSVSNKC 119

Query: 210  KNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 389
            +N+CR  GSE LPKGIV  +S+L+MRPLWG             LLA+AVG+KQK+ VN+M
Sbjct: 120  ENQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQKDLVNKM 177

Query: 390  VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 569
            V+KF S  F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+IVAEYNY
Sbjct: 178  VEKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEIVAEYNY 237

Query: 570  IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYK 749
            IFLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT           I+K
Sbjct: 238  IFLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDVHRRIFK 297

Query: 750  SSG-GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQG 926
            SSG G+ CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQLGYCAQG
Sbjct: 298  SSGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQLGYCAQG 357

Query: 927  DRTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASGHTSQAKDSLNIGALAPS 1085
            DRT+NVGVVDS+YIVH GLPTLG  D +K        +S+    T+       I  L  S
Sbjct: 358  DRTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPISKLPAS 417

Query: 1086 RSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
             +   + R  VRRQSYIE  IFK RW++AVKED+CW DPYQQ  ++++H
Sbjct: 418  STSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 466


>ref|XP_002512624.1| conserved hypothetical protein [Ricinus communis]
            gi|223548585|gb|EEF50076.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 389

 Score =  526 bits (1356), Expect = e-147
 Identities = 259/405 (63%), Positives = 299/405 (73%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  N VSL  D K RRSCLCS+LPT SLL LVFFIGS F+  DYKE++SRW +VDS Q+
Sbjct: 1    MKSLNPVSL-PDSKSRRSCLCSILPTASLLFLVFFIGSTFVIPDYKEKISRWKIVDSFQS 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             +  TCKN C+P+GSE LP+GIVSKTSNL+MRPLWG             L  +AVGIKQ+
Sbjct: 60   LKFATCKNRCKPHGSEALPEGIVSKTSNLQMRPLWGFPENDETSSIN--LFTLAVGIKQR 117

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            + V++MVKKF SS F+VMLFHYDGVVDEW D EW D+ IH+SA NQTKWWFAKRFLHPDI
Sbjct: 118  DIVDKMVKKFLSSKFSVMLFHYDGVVDEWNDYEWKDQVIHISAHNQTKWWFAKRFLHPDI 177

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            VAEY+YIFLWDEDLGVENF P++YLSIVK +GL+ISQPALDP KS +H QIT+       
Sbjct: 178  VAEYSYIFLWDEDLGVENFDPQQYLSIVKSKGLEISQPALDPGKSAIHQQITARLRRSIV 237

Query: 732  XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 911
                +K      CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLG
Sbjct: 238  HSRTFKPG---TCDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLG 294

Query: 912  YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRS 1091
            YCAQGDR +N+GVVD+EYIVH G PTLGG  ++K                      PSRS
Sbjct: 295  YCAQGDRVKNIGVVDAEYIVHYGRPTLGGTGESK---------------------EPSRS 333

Query: 1092 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1226
                 R  VRRQS++E +IF+ RW+KA KED+CW+DPY+Q  KQS
Sbjct: 334  NKKDPRLEVRRQSFVEFKIFQKRWEKAAKEDKCWIDPYEQAEKQS 378


>ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221459 [Cucumis sativus]
          Length = 388

 Score =  520 bits (1339), Expect = e-145
 Identities = 245/403 (60%), Positives = 298/403 (73%), Gaps = 3/403 (0%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK S  + LLA+ K R SCLCS LPT SLLCL  F+GS +++ DY+E++SRWG +D +  
Sbjct: 1    MKFSGCLPLLAEQKSRNSCLCSFLPTASLLCLALFVGSVYVAPDYREKISRWG-IDGLVG 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX--LLAIAVGIK 365
             + N C+ +CRP GSE LPK IV   SNLEMRPLWG               + A+AVGIK
Sbjct: 60   SKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIK 119

Query: 366  QKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHP 545
            QK+ VN+MV KF SSDFAVMLFHYDG+VDEW+   WS+R IHV+A+NQTKWWFAKRFLHP
Sbjct: 120  QKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKGFNWSNRVIHVTAVNQTKWWFAKRFLHP 179

Query: 546  DIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXX 725
            DIV EYNY+FLWDEDLGV+NF+P+ Y+ I++ EGL+ISQPALDP KSEVHHQIT+     
Sbjct: 180  DIVEEYNYVFLWDEDLGVDNFNPKLYVDIIQSEGLEISQPALDPYKSEVHHQITARGRRS 239

Query: 726  XXXXXIYK-SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDM 902
                  ++ S+GG+ CD NSTAPPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDM
Sbjct: 240  TVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDM 299

Query: 903  QLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAP 1082
            QLGYCAQGDRT+NVGVVDSEY++H G PTLGG ++N+ +                     
Sbjct: 300  QLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETS--------------------- 338

Query: 1083 SRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQ 1211
            S+S V  +R  VRRQSYIEL++F+ RW+KA ++DECW DPY +
Sbjct: 339  SKSHVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPE 381


>ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508718003|gb|EOY09900.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 389

 Score =  516 bits (1329), Expect = e-144
 Identities = 242/400 (60%), Positives = 298/400 (74%), Gaps = 1/400 (0%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  + + L+ + K   SCLC L+P  +LLC+++FIGS+F++ + KE+   WG+ D +Q 
Sbjct: 3    MKSIDCIPLVTERKSWSSCLCRLIPATALLCVIYFIGSSFVAPENKEKAFTWGVADILQT 62

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             +   CKN+CRP GSE LP+GI++KTSNL++RPLWG             L A+AVGIKQK
Sbjct: 63   SKVENCKNQCRPPGSEPLPEGIITKTSNLQLRPLWGFPKKDDTSSS---LFAVAVGIKQK 119

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            + V+EMVKKF SS FAVMLFHYDG+VDEW+  EW+D+ IHVSA NQTKWWFAKRFLHPD+
Sbjct: 120  DLVHEMVKKFLSSGFAVMLFHYDGIVDEWKSFEWNDQVIHVSARNQTKWWFAKRFLHPDV 179

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            V+EY+YIFLWDEDLGVE+FHP++Y+SIV+ E L+ISQPALDPAKSEVHHQIT+       
Sbjct: 180  VSEYSYIFLWDEDLGVEDFHPKKYVSIVESERLEISQPALDPAKSEVHHQITARGRKSMV 239

Query: 732  XXXIYK-SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                +K  + GR CDG S APPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDMQL
Sbjct: 240  HRRTFKHRANGRSCDGQSKAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQL 299

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSR 1088
            GYCAQGDRT+N+GVVD+EYIVH   PTLGG  +   ++   GH ++            S 
Sbjct: 300  GYCAQGDRTKNIGVVDAEYIVHYNRPTLGGTAEKNHSTVEGGHRNKKS----------SH 349

Query: 1089 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1208
            S     R  VRRQSYIEL+IF+ RW+KAVK D+CW+DPYQ
Sbjct: 350  SHWKDPRVEVRRQSYIELDIFRKRWEKAVKNDKCWVDPYQ 389


>ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica]
            gi|462419692|gb|EMJ23955.1| hypothetical protein
            PRUPE_ppa006529mg [Prunus persica]
          Length = 407

 Score =  511 bits (1317), Expect = e-142
 Identities = 249/406 (61%), Positives = 299/406 (73%), Gaps = 4/406 (0%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            + L N  S L DPK R S  CSL    SL+C  +FIG A I+ +YKER++RW ++ + QN
Sbjct: 2    INLFNPASALPDPKNR-SFYCSLFIVASLICGAYFIGGASIAKEYKERLTRWKVIYTRQN 60

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             + +TCKN C+P GSE LP+GIV+KTS+LE+RPLWG             LLAIAVGIKQK
Sbjct: 61   TKFDTCKNRCQPLGSEALPEGIVAKTSDLEVRPLWGSSVNNENSKPSMSLLAIAVGIKQK 120

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            E V+ +VKKF SSDF VMLFHYDG VD+WRDL WSDR+IHVS +NQTKWWFAKRFLHPDI
Sbjct: 121  EIVDRIVKKFLSSDFVVMLFHYDGAVDKWRDLNWSDRAIHVSVMNQTKWWFAKRFLHPDI 180

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            V+EY YIFLWDEDLGVENF P+RYLSIV++EGL+ISQPALDP KS+V+H IT+       
Sbjct: 181  VSEYEYIFLWDEDLGVENFDPKRYLSIVREEGLEISQPALDPDKSDVYHPITARVKKLKV 240

Query: 732  XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 911
                YK  G  +CD +S+APPC GWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD+QLG
Sbjct: 241  HRRFYKFKGSGRCDNHSSAPPCAGWVEMMAPVFSKAAWQCVWYMIQNDLIHAWGLDVQLG 300

Query: 912  YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGAL----A 1079
            YCAQGDRT+NVGVVDSEYIVH GLPTLG  D NK     +         +++       A
Sbjct: 301  YCAQGDRTKNVGVVDSEYIVHLGLPTLGVSDGNKAIMLKTRLDFYCLSPIHLSLCNIISA 360

Query: 1080 PSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPA 1217
            PS S   ++R  VR QS+I+++IFK RW  AVKED+CW+DP+Q  A
Sbjct: 361  PSASDKVNDRAKVRMQSFIDMQIFKERWSNAVKEDKCWVDPFQLSA 406


>ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citrus clementina]
            gi|557533617|gb|ESR44735.1| hypothetical protein
            CICLE_v10001347mg [Citrus clementina]
          Length = 358

 Score =  509 bits (1312), Expect = e-142
 Identities = 246/359 (68%), Positives = 281/359 (78%), Gaps = 2/359 (0%)
 Frame = +3

Query: 162  RWGLVDSIQNRQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX 338
            RWGLV S+ + +  TCKN+ CR  G+E LP+GIVSKTSNLEMRPLW              
Sbjct: 2    RWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMN 61

Query: 339  LLAIAVGIKQKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKW 518
            LLAIA GIKQK+ V+++V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKW
Sbjct: 62   LLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKW 121

Query: 519  WFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHH 698
            WFAKRFLHPDIVAEYNYIFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH
Sbjct: 122  WFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHH 181

Query: 699  QITSXXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDL 878
             IT+          +YK  G  +CD  STAPPC GWVEMMAPVFSR AWRCAWYMIQNDL
Sbjct: 182  PITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDL 241

Query: 879  IHAWGLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDS 1058
            IHAWGLD+QLGYCAQGDRT+NVGVVDSEYIVH GLPTLG     +P   A G  S   + 
Sbjct: 242  IHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLG--VTTEPELNAVGQASDDLEQ 299

Query: 1059 L-NIGALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
            + N  ALAPS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q   Q+SH
Sbjct: 300  IANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 358


>ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Populus trichocarpa]
            gi|550316990|gb|EEE99848.2| hypothetical protein
            POPTR_0019s08010g [Populus trichocarpa]
          Length = 381

 Score =  506 bits (1304), Expect = e-141
 Identities = 253/400 (63%), Positives = 294/400 (73%)
 Frame = +3

Query: 30   VSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTC 209
            VSL +D K RRS  CS+ P  S L L+FF   AFI+ DYKER+SRWG+ D+ QN + + C
Sbjct: 9    VSLPSDSKSRRSHWCSVFPAASFLFLIFFAVYAFIAPDYKERLSRWGIADTFQNFKFSNC 68

Query: 210  KNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 389
            KN+CRP GSE+LP+GIVSKTSN +MRPLWG             LLA+AVGI Q++ VN+M
Sbjct: 69   KNQCRPPGSESLPEGIVSKTSNFQMRPLWGFPKNDENSSIN--LLAVAVGITQRDLVNKM 126

Query: 390  VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 569
            VKKF SS+F+VMLFHYDG+VDEWRD EW+DR IHVSA NQTKWWFAKRFLHPDIVA  NY
Sbjct: 127  VKKFLSSNFSVMLFHYDGIVDEWRDFEWNDRVIHVSARNQTKWWFAKRFLHPDIVAACNY 186

Query: 570  IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYK 749
            IFLWDEDLGVENF+P++Y+SIVK EGL ISQPALD  KS VH QIT            YK
Sbjct: 187  IFLWDEDLGVENFNPKQYVSIVKSEGLHISQPALD-YKSLVHQQITVRASKSGVHRRTYK 245

Query: 750  SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 929
                  CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLGYC+QGD
Sbjct: 246  PG---ICDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLGYCSQGD 302

Query: 930  RTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRSRVHSNR 1109
            RT+N+G+VD+EYIVH G PTLGG  +N+                      PSRS+    R
Sbjct: 303  RTKNIGIVDAEYIVHYGHPTLGGVVENE---------------------EPSRSQKTDPR 341

Query: 1110 PAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSS 1229
              VRRQS IEL IF+ RWK+AV+ED+CW+DPY++  K+SS
Sbjct: 342  LEVRRQSLIELRIFQKRWKEAVEEDQCWIDPYKEAVKESS 381


>ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306243 [Fragaria vesca
            subsp. vesca]
          Length = 397

 Score =  504 bits (1298), Expect = e-140
 Identities = 246/394 (62%), Positives = 292/394 (74%)
 Frame = +3

Query: 24   NFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTN 203
            N VS+L+DPK R S  CSL   VSL+   +FIG A I+ +YKE+++RW +  ++QN   +
Sbjct: 10   NPVSVLSDPKNR-SFYCSLFIVVSLVTGAYFIGGASIAKEYKEKLTRWKVTYTMQNTNLD 68

Query: 204  TCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVN 383
            TCK  C+P G+E LP+GIV+KTS+ ++RPLWG             LLAIAVGIKQKE V+
Sbjct: 69   TCKKRCQPSGTEALPEGIVAKTSDFKIRPLWGTSKKDKNSTPSKSLLAIAVGIKQKEIVD 128

Query: 384  EMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEY 563
            ++V+KF SSDF VMLFHYDG VD+WRDL WSD +IHVS +NQTKWWFAKRFLHPDIV EY
Sbjct: 129  KIVRKFLSSDFVVMLFHYDGAVDKWRDLHWSDTAIHVSVMNQTKWWFAKRFLHPDIVTEY 188

Query: 564  NYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXI 743
             +IFLWDEDLGVENF P RYLS++ DEGL+ISQPALDP KSEV+H IT+           
Sbjct: 189  KHIFLWDEDLGVENFDPERYLSVIWDEGLEISQPALDPVKSEVYHPITARVKKSKVHRRF 248

Query: 744  YKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQ 923
            YK  G  +CD  S+ PPC GWVEMMAPVFSR AWRC WYMIQNDL+HAWGLD QLGYCAQ
Sbjct: 249  YKFKGSGRCDDQSSGPPCIGWVEMMAPVFSRAAWRCVWYMIQNDLVHAWGLDEQLGYCAQ 308

Query: 924  GDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRSRVHS 1103
            GDR +NVGVVDSEYIVH GLPTLG  DDNK  +      SQ +DS    ALAPS   + S
Sbjct: 309  GDRMKNVGVVDSEYIVHLGLPTLGVTDDNKGINNMV--HSQKEDS---KALAPSGPPIPS 363

Query: 1104 NRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPY 1205
            +R  VR QS+I++ IFK RW+ AVKED CW+DPY
Sbjct: 364  DRAKVRMQSFIDMRIFKERWRSAVKEDNCWVDPY 397


>ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Populus trichocarpa]
            gi|550341839|gb|ERP62868.1| hypothetical protein
            POPTR_0004s23630g [Populus trichocarpa]
          Length = 383

 Score =  503 bits (1296), Expect = e-140
 Identities = 250/407 (61%), Positives = 298/407 (73%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  +  S  +DPKR  S LCSLL  +SL+C V+F+GSAF    YKER++ WG+++++Q 
Sbjct: 1    MKTLSCASAPSDPKRG-SYLCSLLIALSLICSVYFVGSAFFGKQYKERITAWGVIEAMQT 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
              ++ CK+ CRP GSE LP+GIV+K SN +MRPLWG             LLAIAVGIKQK
Sbjct: 60   --SDICKDRCRPSGSEALPQGIVTKKSNYKMRPLWGSSLKNDNPPPSMSLLAIAVGIKQK 117

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
              VN++V+KFP SDF VMLFHYDGVVDEWRDL WS+ +IHVSA+NQTKWWFAKRFLHPDI
Sbjct: 118  AIVNQIVEKFPLSDFVVMLFHYDGVVDEWRDLSWSNSAIHVSAVNQTKWWFAKRFLHPDI 177

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            V+EYNYIFLWDEDLGVENF+PRRYLSIVKDEGL++SQPALDP++S VHHQIT+       
Sbjct: 178  VSEYNYIFLWDEDLGVENFNPRRYLSIVKDEGLEVSQPALDPSRSTVHHQITARIRNSIV 237

Query: 732  XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 911
               I K  G  KC GNST+PPCTGWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD +LG
Sbjct: 238  HRKILKFRGNTKCYGNSTSPPCTGWVEMMAPVFSKAAWQCTWYMIQNDLIHAWGLDRKLG 297

Query: 912  YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRS 1091
            YCAQGD T+NVGVVD+EYIVH GL TLG F+     S+AS           I  +   R 
Sbjct: 298  YCAQGDWTKNVGVVDAEYIVHLGLSTLGVFN----GSEAS-----------ISYVPYDRI 342

Query: 1092 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1232
             V      VR QS +E+ IF  RW+ A+KED CW+DPYQ  + Q+ H
Sbjct: 343  IV------VRTQSSVEMNIFHERWEAAIKEDRCWVDPYQLISNQTRH 383


>ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobroma cacao]
            gi|508701811|gb|EOX93707.1| Uncharacterized protein
            TCM_002621 [Theobroma cacao]
          Length = 385

 Score =  499 bits (1285), Expect = e-138
 Identities = 239/405 (59%), Positives = 295/405 (72%)
 Frame = +3

Query: 9    KMKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQ 188
            K ++S  +SL A+P+R+R      LP + LL   FFIGSAFI TDYKER+  W  V  +Q
Sbjct: 2    KKRMSTSISLKAEPRRQRLFTHRFLPMILLLSAAFFIGSAFIITDYKERILGWRSVIVLQ 61

Query: 189  NRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 368
             ++   C+ +CR YGSE LPKGI+S+TS+LEMRPLWG             LLAIAVGIKQ
Sbjct: 62   YKRPKICETQCRAYGSEALPKGIISETSDLEMRPLWGLQNKKKPKLSMN-LLAIAVGIKQ 120

Query: 369  KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 548
            KE+VN++VKKFP+SDF VMLFHYDG+VD+W+DLEW+D +IHVSA+NQTKWWFAKRFLHPD
Sbjct: 121  KESVNKIVKKFPASDFVVMLFHYDGIVDQWKDLEWNDLAIHVSAVNQTKWWFAKRFLHPD 180

Query: 549  IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 728
            IV+EY+YIFLWDEDLGV++F+  RYLSI+K EGL+ISQPALD  KSE+HH IT+      
Sbjct: 181  IVSEYSYIFLWDEDLGVDHFNAARYLSIIKKEGLEISQPALDVEKSELHHPITARDKKST 240

Query: 729  XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                 Y+  G  +C+ NST PPCTG+VEMMAPVFSR +WRCAW+MIQ+DL++ WG+D QL
Sbjct: 241  VHRRTYEVIGRTRCNENSTGPPCTGFVEMMAPVFSRASWRCAWHMIQSDLVYGWGVDFQL 300

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSR 1088
            GYCAQGDRTQ +G+VDSEY+VH  LPTLGG   N+                      PS 
Sbjct: 301  GYCAQGDRTQKIGIVDSEYLVHNALPTLGGVAANE---------------------VPSP 339

Query: 1089 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQ 1223
            S     R  VR+QS+IELEIFKNRWK+AVK+D+CW DPY+   K+
Sbjct: 340  SSEPGGRSEVRKQSFIELEIFKNRWKRAVKQDKCWFDPYEPSTKK 384


>ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|508705079|gb|EOX96975.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 438

 Score =  497 bits (1280), Expect = e-138
 Identities = 237/358 (66%), Positives = 277/358 (77%), Gaps = 2/358 (0%)
 Frame = +3

Query: 9    KMKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQ 188
            KMK  N  S+++DPK R SCLC L    SL+C  +FI  AFI+ +YK+R+SRW +++ +Q
Sbjct: 82   KMKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQ 140

Query: 189  NRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 368
            N ++N CK  CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQ
Sbjct: 141  NSKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQ 200

Query: 369  KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 548
            KE VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPD
Sbjct: 201  KEIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPD 260

Query: 549  IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 728
            IVA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+      
Sbjct: 261  IVADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSR 320

Query: 729  XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                +YK  G  +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQL
Sbjct: 321  VHRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQL 380

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQ--AKDSLNIGAL 1076
            GYCAQGDR +NVGVVD+EYIVH GL TLG   +N+ NS     T +  + DS  +G +
Sbjct: 381  GYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNITRRQPSSDSETLGTI 438


>gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus guttatus]
          Length = 401

 Score =  496 bits (1276), Expect = e-137
 Identities = 244/397 (61%), Positives = 294/397 (74%), Gaps = 1/397 (0%)
 Frame = +3

Query: 39   LADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTCKNE 218
            + +PKR+RS + S LP+  LL  VFFIGSAF+ TDYKER      +  I+  ++ TC+ E
Sbjct: 6    MPEPKRKRSFMWSCLPSAILLSAVFFIGSAFLVTDYKERFLGACNLYPIKATKSKTCEYE 65

Query: 219  CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVKK 398
            CRP G+ETLP+GIVS+T+++EMRPL G             LL IAVGIKQK+NVNE+VKK
Sbjct: 66   CRPNGTETLPRGIVSRTTDMEMRPLSGPPKKKKLKSPMN-LLGIAVGIKQKQNVNEIVKK 124

Query: 399  FPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIFL 578
            FP +DFAVMLFHYDG V+ WRDLEWS+  +HVSAINQTKWWFAKRFLHPD+VA+Y+YIFL
Sbjct: 125  FPLTDFAVMLFHYDGNVNGWRDLEWSNSVVHVSAINQTKWWFAKRFLHPDVVAQYDYIFL 184

Query: 579  WDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYKSSG 758
            WDEDLGVENFH  RYLSIVK+EGLQISQPA+D  KSEVH+++T                G
Sbjct: 185  WDEDLGVENFHAGRYLSIVKEEGLQISQPAIDAEKSEVHYKLTEREISSKVHRRAINLHG 244

Query: 759  -GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRT 935
             GR+C  NS  PPCTG+VEMMAPVFSR +WRCAW+MIQNDL+HAWGLD QLGYCAQG+RT
Sbjct: 245  PGRRCYENSMEPPCTGFVEMMAPVFSRVSWRCAWHMIQNDLVHAWGLDFQLGYCAQGNRT 304

Query: 936  QNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSRSRVHSNRPA 1115
             N+G+VDSEY++H GLPTLGG    K N +    +S  K   N G    S       R A
Sbjct: 305  TNIGIVDSEYLIHLGLPTLGGSSGTKINDEVEKQSSPDKILPNAGKTEISAVEPSDERNA 364

Query: 1116 VRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1226
            VRR+S+IEL+ FKNRWKKAV+EDECW+DP Q P +Q+
Sbjct: 365  VRRESFIELDDFKNRWKKAVREDECWVDPLQTPPQQN 401


>ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508705080|gb|EOX96976.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 374

 Score =  494 bits (1272), Expect = e-137
 Identities = 232/337 (68%), Positives = 268/337 (79%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK  N  S+++DPK R SCLC L    SL+C  +FI  AFI+ +YK+R+SRW +++ +QN
Sbjct: 1    MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             ++N CK  CRP GSE LP+GIV KTSNLEMRPLW              LLAIAVGIKQK
Sbjct: 60   SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI
Sbjct: 120  EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 731
            VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+       
Sbjct: 180  VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239

Query: 732  XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 911
               +YK  G  +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLG
Sbjct: 240  HRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLG 299

Query: 912  YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNS 1022
            YCAQGDR +NVGVVD+EYIVH GL TLG   +N+ NS
Sbjct: 300  YCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNS 336


>gb|EXC29927.1| hypothetical protein L484_015120 [Morus notabilis]
          Length = 382

 Score =  493 bits (1269), Expect = e-137
 Identities = 247/404 (61%), Positives = 291/404 (72%), Gaps = 5/404 (1%)
 Frame = +3

Query: 12   MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 191
            MK S F+SLL D   RRSC CSL+P  SLLCLV+FIGSAFI+ DYKE++S WG+ D++QN
Sbjct: 1    MKPSYFLSLLVDSHSRRSCFCSLIPAASLLCLVYFIGSAFIAPDYKEKLSLWGVTDTLQN 60

Query: 192  RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 371
             + N CKN+CRP GSE LP+GIV KTSNLE RPLWG             L A+AVGIKQK
Sbjct: 61   FKLNKCKNQCRPSGSEALPEGIVCKTSNLEFRPLWGSPKKIESSSVN--LFAVAVGIKQK 118

Query: 372  ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 551
            + VN+MV+KF SS+F VMLFHYDG VD+W+  EWSDR IHVSA+NQTKWWFAKRFLHPDI
Sbjct: 119  DLVNKMVRKFLSSNFVVMLFHYDGNVDKWKTFEWSDRVIHVSAVNQTKWWFAKRFLHPDI 178

Query: 552  VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKS-EVHHQITSXXXXXX 728
            VAEY+YIFLWDEDLGV++F P+ Y+SIV+ EGL+ISQPALDP KS E+HHQIT+      
Sbjct: 179  VAEYDYIFLWDEDLGVDSFDPKLYISIVQSEGLEISQPALDPVKSVELHHQITARGRRST 238

Query: 729  XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 908
                 YK   G+ CD NS APPCTGW+EMMAPVFSR AWRCAW+MI              
Sbjct: 239  VHRRTYKH--GKGCDENSKAPPCTGWIEMMAPVFSRAAWRCAWFMI-------------- 282

Query: 909  GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASGHTSQAKDSLNIGALAPSR 1088
                QGDRT++VGVVD+EY+VH G  TLGG D NK  S A       K+  ++    P  
Sbjct: 283  ----QGDRTKSVGVVDAEYVVHHGRSTLGGGDGNKTKSSAKNRIYGRKNITSMEISPPLH 338

Query: 1089 SRVHS----NRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1208
            S  HS    +R AVRRQSYIEL+IFK RW KAV+ED+CW+DPYQ
Sbjct: 339  SHSHSHPKDHRAAVRRQSYIELDIFKKRWVKAVQEDKCWVDPYQ 382


Top