BLASTX nr result
ID: Akebia22_contig00003110
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00003110 (1727 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624... 562 e-157 ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma... 558 e-156 ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citr... 558 e-156 ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma... 550 e-154 emb|CBI17649.3| unnamed protein product [Vitis vinifera] 536 e-149 ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244... 529 e-147 ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255... 529 e-147 ref|XP_002512624.1| conserved hypothetical protein [Ricinus comm... 526 e-147 ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221... 520 e-145 ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma... 514 e-143 ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prun... 512 e-142 ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citr... 509 e-141 ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Popu... 506 e-141 ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306... 505 e-140 ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Popu... 504 e-140 ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobrom... 499 e-138 ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [... 498 e-138 gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus... 496 e-137 ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma... 494 e-137 gb|EXC29927.1| hypothetical protein L484_015120 [Morus notabilis] 493 e-137 >ref|XP_006470988.1| PREDICTED: uncharacterized protein LOC102624954 [Citrus sinensis] Length = 407 Score = 562 bits (1448), Expect = e-157 Identities = 274/412 (66%), Positives = 321/412 (77%), Gaps = 5/412 (1%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK +N +S+L+DP R SCLCSL +L+C V+FIGS+F++ + KER+ RWGLV S+ + Sbjct: 2 MKATNSISVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYS 60 Query: 204 RQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 380 + TCKN+ CR G+E LP+GIVSKTSNLEMRPLW LLAIA GIKQ Sbjct: 61 AKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQ 120 Query: 381 KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 560 K+ V+++V+KFPS DF VMLFHYDGVVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPD Sbjct: 121 KKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPD 180 Query: 561 IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 740 IVAEYNYIFLWDED+GVENF+PRRYLSIVKDEGL+ISQPALDP KSEVHH IT+ Sbjct: 181 IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSK 240 Query: 741 XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 +YK G +CD STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QL Sbjct: 241 AHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQL 300 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSL----NIGAL 1088 GYCAQGDRT+NVGVVDSEYIVH GLPTLG + + N+ QA D L N AL Sbjct: 301 GYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNT-----VGQASDDLEQIANPVAL 355 Query: 1089 APSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 APS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q Q+SH Sbjct: 356 APSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407 >ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508705077|gb|EOX96973.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 405 Score = 558 bits (1439), Expect = e-156 Identities = 264/407 (64%), Positives = 315/407 (77%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK N S+++DPK R SCLC L SL+C +FI AFI+ +YK+R+SRW +++ +QN Sbjct: 1 MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 ++N CK CRP GSE LP+GIV KTSNLEMRPLW LLAIAVGIKQK Sbjct: 60 SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI Sbjct: 120 EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+ Sbjct: 180 VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239 Query: 744 XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 923 +YK G +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLG Sbjct: 240 HRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLG 299 Query: 924 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRS 1103 YCAQGDR +NVGVVD+EYIVH GL TLG +N+ NS + T + + S + LAPS S Sbjct: 300 YCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPSSDSETLAPSES 358 Query: 1104 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 NRP VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ +S+H Sbjct: 359 HKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 405 >ref|XP_006431494.1| hypothetical protein CICLE_v10001347mg [Citrus clementina] gi|557533616|gb|ESR44734.1| hypothetical protein CICLE_v10001347mg [Citrus clementina] Length = 407 Score = 558 bits (1438), Expect = e-156 Identities = 272/412 (66%), Positives = 319/412 (77%), Gaps = 5/412 (1%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK +N +S+L+DP R SCLCSL +L+C V+FIGS+F++ + KER+ RWGLV S+ + Sbjct: 2 MKTTNSISVLSDPPSR-SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYS 60 Query: 204 RQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 380 + TCKN+ CR G+E LP+GIVSKTSNLEMRPLW LLAIA GIKQ Sbjct: 61 AKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQ 120 Query: 381 KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 560 K+ V+++V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKWWFAKRFLHPD Sbjct: 121 KKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPD 180 Query: 561 IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 740 IVAEYNYIFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH IT+ Sbjct: 181 IVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHHPITARRRNSK 240 Query: 741 XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 +YK G +CD STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLD+QL Sbjct: 241 AHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQL 300 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSL----NIGAL 1088 GYCAQGDRT+NVGVVDSEYIVH GLPTLG + + N+ QA D L N AL Sbjct: 301 GYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNA-----VGQASDDLEQIANPVAL 355 Query: 1089 APSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 APS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q Q+SH Sbjct: 356 APSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQTSH 407 >ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508705078|gb|EOX96974.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 416 Score = 550 bits (1417), Expect = e-154 Identities = 264/418 (63%), Positives = 315/418 (75%), Gaps = 11/418 (2%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK N S+++DPK R SCLC L SL+C +FI AFI+ +YK+R+SRW +++ +QN Sbjct: 1 MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 ++N CK CRP GSE LP+GIV KTSNLEMRPLW LLAIAVGIKQK Sbjct: 60 SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI Sbjct: 120 EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITS------- 722 VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+ Sbjct: 180 VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239 Query: 723 ----XXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDL 890 +YK G +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDL Sbjct: 240 HSYDTINPSRLNRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDL 299 Query: 891 IHAWGLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDS 1070 IHAWGLDMQLGYCAQGDR +NVGVVD+EYIVH GL TLG +N+ NS + T + + S Sbjct: 300 IHAWGLDMQLGYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNIT-RRQPS 358 Query: 1071 LNIGALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 + LAPS S NRP VRRQS+IE+++F+ RW+ AV +D+CW+DPYQQ +S+H Sbjct: 359 SDSETLAPSESHKVDNRPEVRRQSFIEMQMFRKRWENAVNQDKCWVDPYQQSVNKSTH 416 >emb|CBI17649.3| unnamed protein product [Vitis vinifera] Length = 413 Score = 536 bits (1380), Expect = e-149 Identities = 261/415 (62%), Positives = 311/415 (74%), Gaps = 8/415 (1%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MKL N VS LAD K RRSC+CS+ PT S+LCL+FFIGS I DY E++SRWG+ + N Sbjct: 1 MKLPNCVSQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLN 60 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 +N C+N+CR GSE LPKGIV +S+L+MRPLWG LLA+AVG+KQK Sbjct: 61 SVSNKCENQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQK 118 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 + VN+MV+KF S F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+I Sbjct: 119 DLVNKMVEKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEI 178 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 VAEYNYIFLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT Sbjct: 179 VAEYNYIFLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDV 238 Query: 744 XXXIYKSSG-GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 I+KSSG G+ CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQL Sbjct: 239 HRRIFKSSGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQL 298 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASSHTSQAKDSLNI 1079 GYCAQGDRT+NVGVVDS+YIVH GLPTLG D +K +S+ T+ I Sbjct: 299 GYCAQGDRTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPI 358 Query: 1080 GALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 L S + + R VRRQSYIE IFK RW++AVKED+CW DPYQQ ++++H Sbjct: 359 SKLPASSTSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 413 >ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera] Length = 466 Score = 529 bits (1363), Expect = e-147 Identities = 257/409 (62%), Positives = 307/409 (75%), Gaps = 8/409 (1%) Frame = +3 Query: 42 VSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTC 221 VS LAD K RRSC+CS+ PT S+LCL+FFIGS I DY E++SRWG+ + N +N C Sbjct: 60 VSQLADSKSRRSCVCSIFPTASVLCLIFFIGSVLIGQDYSEKLSRWGMSTGMLNSVSNKC 119 Query: 222 KNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 401 +N+CR GSE LPKGIV +S+L+MRPLWG LLA+AVG+KQK+ VN+M Sbjct: 120 ENQCRANGSEALPKGIVVTSSDLDMRPLWGFPKKRKDLKRN--LLAVAVGVKQKDLVNKM 177 Query: 402 VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 581 V+KF S F VMLFHYDGVVDEW+D +W DR +HV+AINQTKWWFAKRFLHP+IVAEYNY Sbjct: 178 VEKFLSYGFVVMLFHYDGVVDEWKDFKWCDRVLHVAAINQTKWWFAKRFLHPEIVAEYNY 237 Query: 582 IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYK 761 IFLWDEDLGV +F+PRRY++ V+ EGL+ISQPALD +KSEVHHQIT I+K Sbjct: 238 IFLWDEDLGVTDFNPRRYVATVQREGLEISQPALDGSKSEVHHQITLRGRRSDVHRRIFK 297 Query: 762 SSG-GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQG 938 SSG G+ CD NSTAPPCTGW+E+MAPVFSREAWRC WYMIQNDLIHAWGLDMQLGYCAQG Sbjct: 298 SSGSGKICDENSTAPPCTGWIEVMAPVFSREAWRCVWYMIQNDLIHAWGLDMQLGYCAQG 357 Query: 939 DRTQNVGVVDSEYIVHQGLPTLGGFDDNK-------PNSQASSHTSQAKDSLNIGALAPS 1097 DRT+NVGVVDS+YIVH GLPTLG D +K +S+ T+ I L S Sbjct: 358 DRTKNVGVVDSDYIVHYGLPTLGANDPDKTTPPVQDDDSEPEKITTTTTAETPISKLPAS 417 Query: 1098 RSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 + + R VRRQSYIE IFK RW++AVKED+CW DPYQQ ++++H Sbjct: 418 STSPINFRVEVRRQSYIEYNIFKKRWRQAVKEDKCWKDPYQQFGEKNTH 466 >ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera] gi|297739491|emb|CBI29673.3| unnamed protein product [Vitis vinifera] Length = 413 Score = 529 bits (1363), Expect = e-147 Identities = 257/410 (62%), Positives = 303/410 (73%), Gaps = 7/410 (1%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLV----- 188 MK + +SL +DPK R S LCSL L C V+FI S F DYK+R SRW + Sbjct: 1 MKTLSCISLPSDPKSR-SYLCSLFIGACLFCGVYFIASEFTVKDYKDRSSRWQISVFQNA 59 Query: 189 --DSIQNRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAI 362 +SIQN Q++ CKN+CRP GSE LP+GIV KTSNLE++PLWG LLA+ Sbjct: 60 HSNSIQNTQSSKCKNQCRPSGSEALPEGIVVKTSNLEVQPLWGATLNGEKSSPSKSLLAM 119 Query: 363 AVGIKQKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAK 542 AVGIKQKE VN++V+KF S+F VMLFHYDGVVDEWR+ WSD +IHV+ +NQTKWWFAK Sbjct: 120 AVGIKQKEIVNQIVEKFILSNFVVMLFHYDGVVDEWREFAWSDHAIHVTVVNQTKWWFAK 179 Query: 543 RFLHPDIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITS 722 RFLHPDIVAEYNYIFLWDEDLGVENFHP RY+SIV+DEGL+ISQPALDP KS VHHQIT+ Sbjct: 180 RFLHPDIVAEYNYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQPALDPKKSRVHHQITA 239 Query: 723 XXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAW 902 YK G +CD STAPPC GWVEMMAPVFS+ AWRC W+MIQN+LIHAW Sbjct: 240 RVRNSRVHRRTYKHRGSGRCDDQSTAPPCVGWVEMMAPVFSKAAWRCVWHMIQNELIHAW 299 Query: 903 GLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIG 1082 G+DMQLGYCAQGDRT+NVGVVDSEY+VH LPTLG D+N+ + H+S + Sbjct: 300 GVDMQLGYCAQGDRTKNVGVVDSEYVVHLALPTLGVLDENELRGEGHDHSSLREKLPKSV 359 Query: 1083 ALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAK 1232 ALA S NR AVRRQS+IE++IF++RW AVKED+CW+DPY QPA+ Sbjct: 360 ALAQSEFHKVDNRSAVRRQSFIEMQIFRSRWANAVKEDKCWIDPYAQPAE 409 >ref|XP_002512624.1| conserved hypothetical protein [Ricinus communis] gi|223548585|gb|EEF50076.1| conserved hypothetical protein [Ricinus communis] Length = 389 Score = 526 bits (1356), Expect = e-147 Identities = 259/405 (63%), Positives = 299/405 (73%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK N VSL D K RRSCLCS+LPT SLL LVFFIGS F+ DYKE++SRW +VDS Q+ Sbjct: 1 MKSLNPVSL-PDSKSRRSCLCSILPTASLLFLVFFIGSTFVIPDYKEKISRWKIVDSFQS 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 + TCKN C+P+GSE LP+GIVSKTSNL+MRPLWG L +AVGIKQ+ Sbjct: 60 LKFATCKNRCKPHGSEALPEGIVSKTSNLQMRPLWGFPENDETSSIN--LFTLAVGIKQR 117 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 + V++MVKKF SS F+VMLFHYDGVVDEW D EW D+ IH+SA NQTKWWFAKRFLHPDI Sbjct: 118 DIVDKMVKKFLSSKFSVMLFHYDGVVDEWNDYEWKDQVIHISAHNQTKWWFAKRFLHPDI 177 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 VAEY+YIFLWDEDLGVENF P++YLSIVK +GL+ISQPALDP KS +H QIT+ Sbjct: 178 VAEYSYIFLWDEDLGVENFDPQQYLSIVKSKGLEISQPALDPGKSAIHQQITARLRRSIV 237 Query: 744 XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 923 +K CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLG Sbjct: 238 HSRTFKPG---TCDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLG 294 Query: 924 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRS 1103 YCAQGDR +N+GVVD+EYIVH G PTLGG ++K PSRS Sbjct: 295 YCAQGDRVKNIGVVDAEYIVHYGRPTLGGTGESK---------------------EPSRS 333 Query: 1104 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1238 R VRRQS++E +IF+ RW+KA KED+CW+DPY+Q KQS Sbjct: 334 NKKDPRLEVRRQSFVEFKIFQKRWEKAAKEDKCWIDPYEQAEKQS 378 >ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221459 [Cucumis sativus] Length = 388 Score = 520 bits (1339), Expect = e-145 Identities = 245/403 (60%), Positives = 298/403 (73%), Gaps = 3/403 (0%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK S + LLA+ K R SCLCS LPT SLLCL F+GS +++ DY+E++SRWG +D + Sbjct: 1 MKFSGCLPLLAEQKSRNSCLCSFLPTASLLCLALFVGSVYVAPDYREKISRWG-IDGLVG 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX--LLAIAVGIK 377 + N C+ +CRP GSE LPK IV SNLEMRPLWG + A+AVGIK Sbjct: 60 SKFNKCEKQCRPNGSEPLPKDIVVTASNLEMRPLWGASKRSYQNPVNSSSNIFAMAVGIK 119 Query: 378 QKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHP 557 QK+ VN+MV KF SSDFAVMLFHYDG+VDEW+ WS+R IHV+A+NQTKWWFAKRFLHP Sbjct: 120 QKDLVNKMVTKFLSSDFAVMLFHYDGIVDEWKGFNWSNRVIHVTAVNQTKWWFAKRFLHP 179 Query: 558 DIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXX 737 DIV EYNY+FLWDEDLGV+NF+P+ Y+ I++ EGL+ISQPALDP KSEVHHQIT+ Sbjct: 180 DIVEEYNYVFLWDEDLGVDNFNPKLYVDIIQSEGLEISQPALDPYKSEVHHQITARGRRS 239 Query: 738 XXXXXIYK-SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDM 914 ++ S+GG+ CD NSTAPPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDM Sbjct: 240 TVHRRTFRPSNGGKGCDVNSTAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDM 299 Query: 915 QLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAP 1094 QLGYCAQGDRT+NVGVVDSEY++H G PTLGG ++N+ + Sbjct: 300 QLGYCAQGDRTKNVGVVDSEYVIHYGRPTLGGPEENETS--------------------- 338 Query: 1095 SRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQ 1223 S+S V +R VRRQSYIEL++F+ RW+KA ++DECW DPY + Sbjct: 339 SKSHVKDHRADVRRQSYIELDVFRKRWQKAAEQDECWQDPYPE 381 >ref|XP_007029398.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508718003|gb|EOY09900.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 389 Score = 514 bits (1323), Expect = e-143 Identities = 241/400 (60%), Positives = 297/400 (74%), Gaps = 1/400 (0%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK + + L+ + K SCLC L+P +LLC+++FIGS+F++ + KE+ WG+ D +Q Sbjct: 3 MKSIDCIPLVTERKSWSSCLCRLIPATALLCVIYFIGSSFVAPENKEKAFTWGVADILQT 62 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 + CKN+CRP GSE LP+GI++KTSNL++RPLWG L A+AVGIKQK Sbjct: 63 SKVENCKNQCRPPGSEPLPEGIITKTSNLQLRPLWGFPKKDDTSSS---LFAVAVGIKQK 119 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 + V+EMVKKF SS FAVMLFHYDG+VDEW+ EW+D+ IHVSA NQTKWWFAKRFLHPD+ Sbjct: 120 DLVHEMVKKFLSSGFAVMLFHYDGIVDEWKSFEWNDQVIHVSARNQTKWWFAKRFLHPDV 179 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 V+EY+YIFLWDEDLGVE+FHP++Y+SIV+ E L+ISQPALDPAKSEVHHQIT+ Sbjct: 180 VSEYSYIFLWDEDLGVEDFHPKKYVSIVESERLEISQPALDPAKSEVHHQITARGRKSMV 239 Query: 744 XXXIYK-SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 +K + GR CDG S APPCTGW+EMMAPVFSR AWRC WYMIQNDLIHAWGLDMQL Sbjct: 240 HRRTFKHRANGRSCDGQSKAPPCTGWIEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDMQL 299 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSR 1100 GYCAQGDRT+N+GVVD+EYIVH PTLGG + ++ H ++ S Sbjct: 300 GYCAQGDRTKNIGVVDAEYIVHYNRPTLGGTAEKNHSTVEGGHRNKKS----------SH 349 Query: 1101 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1220 S R VRRQSYIEL+IF+ RW+KAVK D+CW+DPYQ Sbjct: 350 SHWKDPRVEVRRQSYIELDIFRKRWEKAVKNDKCWVDPYQ 389 >ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica] gi|462419692|gb|EMJ23955.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica] Length = 407 Score = 512 bits (1318), Expect = e-142 Identities = 249/406 (61%), Positives = 299/406 (73%), Gaps = 4/406 (0%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 + L N S L DPK R S CSL SL+C +FIG A I+ +YKER++RW ++ + QN Sbjct: 2 INLFNPASALPDPKNR-SFYCSLFIVASLICGAYFIGGASIAKEYKERLTRWKVIYTRQN 60 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 + +TCKN C+P GSE LP+GIV+KTS+LE+RPLWG LLAIAVGIKQK Sbjct: 61 TKFDTCKNRCQPLGSEALPEGIVAKTSDLEVRPLWGSSVNNENSKPSMSLLAIAVGIKQK 120 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 E V+ +VKKF SSDF VMLFHYDG VD+WRDL WSDR+IHVS +NQTKWWFAKRFLHPDI Sbjct: 121 EIVDRIVKKFLSSDFVVMLFHYDGAVDKWRDLNWSDRAIHVSVMNQTKWWFAKRFLHPDI 180 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 V+EY YIFLWDEDLGVENF P+RYLSIV++EGL+ISQPALDP KS+V+H IT+ Sbjct: 181 VSEYEYIFLWDEDLGVENFDPKRYLSIVREEGLEISQPALDPDKSDVYHPITARVKKLKV 240 Query: 744 XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 923 YK G +CD +S+APPC GWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD+QLG Sbjct: 241 HRRFYKFKGSGRCDNHSSAPPCAGWVEMMAPVFSKAAWQCVWYMIQNDLIHAWGLDVQLG 300 Query: 924 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGAL----A 1091 YCAQGDRT+NVGVVDSEYIVH GLPTLG D NK + +++ A Sbjct: 301 YCAQGDRTKNVGVVDSEYIVHLGLPTLGVSDGNKAIMLKTRLDFYCLSPIHLSLCNIISA 360 Query: 1092 PSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPA 1229 PS S ++R VR QS+I+++IFK RW AVKED+CW+DP+Q A Sbjct: 361 PSASDKVNDRAKVRMQSFIDMQIFKERWSNAVKEDKCWVDPFQLSA 406 >ref|XP_006431495.1| hypothetical protein CICLE_v10001347mg [Citrus clementina] gi|557533617|gb|ESR44735.1| hypothetical protein CICLE_v10001347mg [Citrus clementina] Length = 358 Score = 509 bits (1310), Expect = e-141 Identities = 247/362 (68%), Positives = 282/362 (77%), Gaps = 5/362 (1%) Frame = +3 Query: 174 RWGLVDSIQNRQTNTCKNE-CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXX 350 RWGLV S+ + + TCKN+ CR G+E LP+GIVSKTSNLEMRPLW Sbjct: 2 RWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMN 61 Query: 351 LLAIAVGIKQKENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKW 530 LLAIA GIKQK+ V+++V+KFPS DF VMLFHYD VVDEW+DL W+DR+IHVSA NQTKW Sbjct: 62 LLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDSVVDEWKDLVWADRAIHVSAANQTKW 121 Query: 531 WFAKRFLHPDIVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHH 710 WFAKRFLHPDIVAEYNYIFLWDED+GVENF+PRRYLSIVKDEG +ISQPALDP KSEVHH Sbjct: 122 WFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGFEISQPALDPVKSEVHH 181 Query: 711 QITSXXXXXXXXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDL 890 IT+ +YK G +CD STAPPC GWVEMMAPVFSR AWRCAWYMIQNDL Sbjct: 182 PITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDL 241 Query: 891 IHAWGLDMQLGYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDS 1070 IHAWGLD+QLGYCAQGDRT+NVGVVDSEYIVH GLPTLG + + N+ QA D Sbjct: 242 IHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNA-----VGQASDD 296 Query: 1071 L----NIGALAPSRSRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1238 L N ALAPS+SR + NRP VRRQSYIE++IF+NRWK AV++D+CW+DPY Q Q+ Sbjct: 297 LEQIANPVALAPSQSRRYDNRPEVRRQSYIEMQIFRNRWKHAVEDDKCWVDPYGQSTNQT 356 Query: 1239 SH 1244 SH Sbjct: 357 SH 358 >ref|XP_002325467.2| hypothetical protein POPTR_0019s08010g [Populus trichocarpa] gi|550316990|gb|EEE99848.2| hypothetical protein POPTR_0019s08010g [Populus trichocarpa] Length = 381 Score = 506 bits (1304), Expect = e-141 Identities = 253/400 (63%), Positives = 294/400 (73%) Frame = +3 Query: 42 VSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTC 221 VSL +D K RRS CS+ P S L L+FF AFI+ DYKER+SRWG+ D+ QN + + C Sbjct: 9 VSLPSDSKSRRSHWCSVFPAASFLFLIFFAVYAFIAPDYKERLSRWGIADTFQNFKFSNC 68 Query: 222 KNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEM 401 KN+CRP GSE+LP+GIVSKTSN +MRPLWG LLA+AVGI Q++ VN+M Sbjct: 69 KNQCRPPGSESLPEGIVSKTSNFQMRPLWGFPKNDENSSIN--LLAVAVGITQRDLVNKM 126 Query: 402 VKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNY 581 VKKF SS+F+VMLFHYDG+VDEWRD EW+DR IHVSA NQTKWWFAKRFLHPDIVA NY Sbjct: 127 VKKFLSSNFSVMLFHYDGIVDEWRDFEWNDRVIHVSARNQTKWWFAKRFLHPDIVAACNY 186 Query: 582 IFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYK 761 IFLWDEDLGVENF+P++Y+SIVK EGL ISQPALD KS VH QIT YK Sbjct: 187 IFLWDEDLGVENFNPKQYVSIVKSEGLHISQPALD-YKSLVHQQITVRASKSGVHRRTYK 245 Query: 762 SSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGD 941 CDGNSTAPPCTGWVEMMAPVFSR AWRC WYMIQNDLIHAWGLD QLGYC+QGD Sbjct: 246 PG---ICDGNSTAPPCTGWVEMMAPVFSRAAWRCVWYMIQNDLIHAWGLDYQLGYCSQGD 302 Query: 942 RTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNR 1121 RT+N+G+VD+EYIVH G PTLGG +N+ PSRS+ R Sbjct: 303 RTKNIGIVDAEYIVHYGHPTLGGVVENE---------------------EPSRSQKTDPR 341 Query: 1122 PAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSS 1241 VRRQS IEL IF+ RWK+AV+ED+CW+DPY++ K+SS Sbjct: 342 LEVRRQSLIELRIFQKRWKEAVEEDQCWIDPYKEAVKESS 381 >ref|XP_004301559.1| PREDICTED: uncharacterized protein LOC101306243 [Fragaria vesca subsp. vesca] Length = 397 Score = 505 bits (1301), Expect = e-140 Identities = 245/395 (62%), Positives = 291/395 (73%), Gaps = 1/395 (0%) Frame = +3 Query: 36 NFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTN 215 N VS+L+DPK R S CSL VSL+ +FIG A I+ +YKE+++RW + ++QN + Sbjct: 10 NPVSVLSDPKNR-SFYCSLFIVVSLVTGAYFIGGASIAKEYKEKLTRWKVTYTMQNTNLD 68 Query: 216 TCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVN 395 TCK C+P G+E LP+GIV+KTS+ ++RPLWG LLAIAVGIKQKE V+ Sbjct: 69 TCKKRCQPSGTEALPEGIVAKTSDFKIRPLWGTSKKDKNSTPSKSLLAIAVGIKQKEIVD 128 Query: 396 EMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEY 575 ++V+KF SSDF VMLFHYDG VD+WRDL WSD +IHVS +NQTKWWFAKRFLHPDIV EY Sbjct: 129 KIVRKFLSSDFVVMLFHYDGAVDKWRDLHWSDTAIHVSVMNQTKWWFAKRFLHPDIVTEY 188 Query: 576 NYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXI 755 +IFLWDEDLGVENF P RYLS++ DEGL+ISQPALDP KSEV+H IT+ Sbjct: 189 KHIFLWDEDLGVENFDPERYLSVIWDEGLEISQPALDPVKSEVYHPITARVKKSKVHRRF 248 Query: 756 YKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQ 935 YK G +CD S+ PPC GWVEMMAPVFSR AWRC WYMIQNDL+HAWGLD QLGYCAQ Sbjct: 249 YKFKGSGRCDDQSSGPPCIGWVEMMAPVFSRAAWRCVWYMIQNDLVHAWGLDEQLGYCAQ 308 Query: 936 GDRTQNVGVVDSEYIVHQGLPTLGGFDDNKP-NSQASSHTSQAKDSLNIGALAPSRSRVH 1112 GDR +NVGVVDSEYIVH GLPTLG DDNK N+ S +K ALAPS + Sbjct: 309 GDRMKNVGVVDSEYIVHLGLPTLGVTDDNKGINNMVHSQKEDSK------ALAPSGPPIP 362 Query: 1113 SNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPY 1217 S+R VR QS+I++ IFK RW+ AVKED CW+DPY Sbjct: 363 SDRAKVRMQSFIDMRIFKERWRSAVKEDNCWVDPY 397 >ref|XP_006385071.1| hypothetical protein POPTR_0004s23630g [Populus trichocarpa] gi|550341839|gb|ERP62868.1| hypothetical protein POPTR_0004s23630g [Populus trichocarpa] Length = 383 Score = 504 bits (1297), Expect = e-140 Identities = 247/407 (60%), Positives = 296/407 (72%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK + S +DPKR S LCSLL +SL+C V+F+GSAF YKER++ WG+++++Q Sbjct: 1 MKTLSCASAPSDPKRG-SYLCSLLIALSLICSVYFVGSAFFGKQYKERITAWGVIEAMQT 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 ++ CK+ CRP GSE LP+GIV+K SN +MRPLWG LLAIAVGIKQK Sbjct: 60 --SDICKDRCRPSGSEALPQGIVTKKSNYKMRPLWGSSLKNDNPPPSMSLLAIAVGIKQK 117 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 VN++V+KFP SDF VMLFHYDGVVDEWRDL WS+ +IHVSA+NQTKWWFAKRFLHPDI Sbjct: 118 AIVNQIVEKFPLSDFVVMLFHYDGVVDEWRDLSWSNSAIHVSAVNQTKWWFAKRFLHPDI 177 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 V+EYNYIFLWDEDLGVENF+PRRYLSIVKDEGL++SQPALDP++S VHHQIT+ Sbjct: 178 VSEYNYIFLWDEDLGVENFNPRRYLSIVKDEGLEVSQPALDPSRSTVHHQITARIRNSIV 237 Query: 744 XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 923 I K G KC GNST+PPCTGWVEMMAPVFS+ AW+C WYMIQNDLIHAWGLD +LG Sbjct: 238 HRKILKFRGNTKCYGNSTSPPCTGWVEMMAPVFSKAAWQCTWYMIQNDLIHAWGLDRKLG 297 Query: 924 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRS 1103 YCAQGD T+NVGVVD+EYIVH GL TLG F N +S + D + + Sbjct: 298 YCAQGDWTKNVGVVDAEYIVHLGLSTLGVF-----NGSEASISYVPYDRIIV-------- 344 Query: 1104 RVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQSSH 1244 VR QS +E+ IF RW+ A+KED CW+DPYQ + Q+ H Sbjct: 345 --------VRTQSSVEMNIFHERWEAAIKEDRCWVDPYQLISNQTRH 383 >ref|XP_007049550.1| Uncharacterized protein TCM_002621 [Theobroma cacao] gi|508701811|gb|EOX93707.1| Uncharacterized protein TCM_002621 [Theobroma cacao] Length = 385 Score = 499 bits (1285), Expect = e-138 Identities = 239/405 (59%), Positives = 295/405 (72%) Frame = +3 Query: 21 KMKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQ 200 K ++S +SL A+P+R+R LP + LL FFIGSAFI TDYKER+ W V +Q Sbjct: 2 KKRMSTSISLKAEPRRQRLFTHRFLPMILLLSAAFFIGSAFIITDYKERILGWRSVIVLQ 61 Query: 201 NRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 380 ++ C+ +CR YGSE LPKGI+S+TS+LEMRPLWG LLAIAVGIKQ Sbjct: 62 YKRPKICETQCRAYGSEALPKGIISETSDLEMRPLWGLQNKKKPKLSMN-LLAIAVGIKQ 120 Query: 381 KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 560 KE+VN++VKKFP+SDF VMLFHYDG+VD+W+DLEW+D +IHVSA+NQTKWWFAKRFLHPD Sbjct: 121 KESVNKIVKKFPASDFVVMLFHYDGIVDQWKDLEWNDLAIHVSAVNQTKWWFAKRFLHPD 180 Query: 561 IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 740 IV+EY+YIFLWDEDLGV++F+ RYLSI+K EGL+ISQPALD KSE+HH IT+ Sbjct: 181 IVSEYSYIFLWDEDLGVDHFNAARYLSIIKKEGLEISQPALDVEKSELHHPITARDKKST 240 Query: 741 XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 Y+ G +C+ NST PPCTG+VEMMAPVFSR +WRCAW+MIQ+DL++ WG+D QL Sbjct: 241 VHRRTYEVIGRTRCNENSTGPPCTGFVEMMAPVFSRASWRCAWHMIQSDLVYGWGVDFQL 300 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSR 1100 GYCAQGDRTQ +G+VDSEY+VH LPTLGG N+ PS Sbjct: 301 GYCAQGDRTQKIGIVDSEYLVHNALPTLGGVAANE---------------------VPSP 339 Query: 1101 SRVHSNRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQ 1235 S R VR+QS+IELEIFKNRWK+AVK+D+CW DPY+ K+ Sbjct: 340 SSEPGGRSEVRKQSFIELEIFKNRWKRAVKQDKCWFDPYEPSTKK 384 >ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] gi|508705079|gb|EOX96975.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 438 Score = 498 bits (1281), Expect = e-138 Identities = 237/358 (66%), Positives = 278/358 (77%), Gaps = 2/358 (0%) Frame = +3 Query: 21 KMKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQ 200 KMK N S+++DPK R SCLC L SL+C +FI AFI+ +YK+R+SRW +++ +Q Sbjct: 82 KMKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQ 140 Query: 201 NRQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQ 380 N ++N CK CRP GSE LP+GIV KTSNLEMRPLW LLAIAVGIKQ Sbjct: 141 NSKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQ 200 Query: 381 KENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPD 560 KE VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPD Sbjct: 201 KEIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPD 260 Query: 561 IVAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXX 740 IVA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+ Sbjct: 261 IVADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSR 320 Query: 741 XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 +YK G +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQL Sbjct: 321 VHRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQL 380 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQ--AKDSLNIGAL 1088 GYCAQGDR +NVGVVD+EYIVH GL TLG +N+ NS + T + + DS +G + Sbjct: 381 GYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNITRRQPSSDSETLGTI 438 >gb|EYU46099.1| hypothetical protein MIMGU_mgv1a007624mg [Mimulus guttatus] Length = 401 Score = 496 bits (1278), Expect = e-137 Identities = 244/397 (61%), Positives = 294/397 (74%), Gaps = 1/397 (0%) Frame = +3 Query: 51 LADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQNRQTNTCKNE 230 + +PKR+RS + S LP+ LL VFFIGSAF+ TDYKER + I+ ++ TC+ E Sbjct: 6 MPEPKRKRSFMWSCLPSAILLSAVFFIGSAFLVTDYKERFLGACNLYPIKATKSKTCEYE 65 Query: 231 CRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQKENVNEMVKK 410 CRP G+ETLP+GIVS+T+++EMRPL G LL IAVGIKQK+NVNE+VKK Sbjct: 66 CRPNGTETLPRGIVSRTTDMEMRPLSGPPKKKKLKSPMN-LLGIAVGIKQKQNVNEIVKK 124 Query: 411 FPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDIVAEYNYIFL 590 FP +DFAVMLFHYDG V+ WRDLEWS+ +HVSAINQTKWWFAKRFLHPD+VA+Y+YIFL Sbjct: 125 FPLTDFAVMLFHYDGNVNGWRDLEWSNSVVHVSAINQTKWWFAKRFLHPDVVAQYDYIFL 184 Query: 591 WDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXXXXXIYKSSG 770 WDEDLGVENFH RYLSIVK+EGLQISQPA+D KSEVH+++T G Sbjct: 185 WDEDLGVENFHAGRYLSIVKEEGLQISQPAIDAEKSEVHYKLTEREISSKVHRRAINLHG 244 Query: 771 -GRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRT 947 GR+C NS PPCTG+VEMMAPVFSR +WRCAW+MIQNDL+HAWGLD QLGYCAQG+RT Sbjct: 245 PGRRCYENSMEPPCTGFVEMMAPVFSRVSWRCAWHMIQNDLVHAWGLDFQLGYCAQGNRT 304 Query: 948 QNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSRSRVHSNRPA 1127 N+G+VDSEY++H GLPTLGG K N + +S K N G S R A Sbjct: 305 TNIGIVDSEYLIHLGLPTLGGSSGTKINDEVEKQSSPDKILPNAGKTEISAVEPSDERNA 364 Query: 1128 VRRQSYIELEIFKNRWKKAVKEDECWMDPYQQPAKQS 1238 VRR+S+IEL+ FKNRWKKAV+EDECW+DP Q P +Q+ Sbjct: 365 VRRESFIELDDFKNRWKKAVREDECWVDPLQTPPQQN 401 >ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508705080|gb|EOX96976.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 374 Score = 494 bits (1273), Expect = e-137 Identities = 234/349 (67%), Positives = 272/349 (77%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK N S+++DPK R SCLC L SL+C +FI AFI+ +YK+R+SRW +++ +QN Sbjct: 1 MKAFNCASVVSDPKTR-SCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQN 59 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 ++N CK CRP GSE LP+GIV KTSNLEMRPLW LLAIAVGIKQK Sbjct: 60 SKSNICKIRCRPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQK 119 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 E VN+++KKFPSSDF VMLFHYDG+VDEWRDLEWSD +IHVSA+NQTKWWFAKRFLHPDI Sbjct: 120 EIVNQIIKKFPSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDI 179 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKSEVHHQITSXXXXXXX 743 VA+Y Y+FLWDEDLGV+NF P++YLSIV+DEGL+ISQPALDP KSEVHHQIT+ Sbjct: 180 VADYKYLFLWDEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRV 239 Query: 744 XXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQLG 923 +YK G +CDG STAPPC GWVEMMAPVFSR AWRCAWYMIQNDLIHAWGLDMQLG Sbjct: 240 HRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLG 299 Query: 924 YCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDS 1070 YCAQGDR +NVGVVD+EYIVH GL TLG +N+ NS + T + S Sbjct: 300 YCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNITRRQPSS 348 >gb|EXC29927.1| hypothetical protein L484_015120 [Morus notabilis] Length = 382 Score = 493 bits (1270), Expect = e-137 Identities = 247/404 (61%), Positives = 292/404 (72%), Gaps = 5/404 (1%) Frame = +3 Query: 24 MKLSNFVSLLADPKRRRSCLCSLLPTVSLLCLVFFIGSAFISTDYKERVSRWGLVDSIQN 203 MK S F+SLL D RRSC CSL+P SLLCLV+FIGSAFI+ DYKE++S WG+ D++QN Sbjct: 1 MKPSYFLSLLVDSHSRRSCFCSLIPAASLLCLVYFIGSAFIAPDYKEKLSLWGVTDTLQN 60 Query: 204 RQTNTCKNECRPYGSETLPKGIVSKTSNLEMRPLWGXXXXXXXXXXXXXLLAIAVGIKQK 383 + N CKN+CRP GSE LP+GIV KTSNLE RPLWG L A+AVGIKQK Sbjct: 61 FKLNKCKNQCRPSGSEALPEGIVCKTSNLEFRPLWGSPKKIESSSVN--LFAVAVGIKQK 118 Query: 384 ENVNEMVKKFPSSDFAVMLFHYDGVVDEWRDLEWSDRSIHVSAINQTKWWFAKRFLHPDI 563 + VN+MV+KF SS+F VMLFHYDG VD+W+ EWSDR IHVSA+NQTKWWFAKRFLHPDI Sbjct: 119 DLVNKMVRKFLSSNFVVMLFHYDGNVDKWKTFEWSDRVIHVSAVNQTKWWFAKRFLHPDI 178 Query: 564 VAEYNYIFLWDEDLGVENFHPRRYLSIVKDEGLQISQPALDPAKS-EVHHQITSXXXXXX 740 VAEY+YIFLWDEDLGV++F P+ Y+SIV+ EGL+ISQPALDP KS E+HHQIT+ Sbjct: 179 VAEYDYIFLWDEDLGVDSFDPKLYISIVQSEGLEISQPALDPVKSVELHHQITARGRRST 238 Query: 741 XXXXIYKSSGGRKCDGNSTAPPCTGWVEMMAPVFSREAWRCAWYMIQNDLIHAWGLDMQL 920 YK G+ CD NS APPCTGW+EMMAPVFSR AWRCAW+MI Sbjct: 239 VHRRTYKH--GKGCDENSKAPPCTGWIEMMAPVFSRAAWRCAWFMI-------------- 282 Query: 921 GYCAQGDRTQNVGVVDSEYIVHQGLPTLGGFDDNKPNSQASSHTSQAKDSLNIGALAPSR 1100 QGDRT++VGVVD+EY+VH G TLGG D NK S A + K+ ++ P Sbjct: 283 ----QGDRTKSVGVVDAEYVVHHGRSTLGGGDGNKTKSSAKNRIYGRKNITSMEISPPLH 338 Query: 1101 SRVHS----NRPAVRRQSYIELEIFKNRWKKAVKEDECWMDPYQ 1220 S HS +R AVRRQSYIEL+IFK RW KAV+ED+CW+DPYQ Sbjct: 339 SHSHSHPKDHRAAVRRQSYIELDIFKKRWVKAVQEDKCWVDPYQ 382