BLASTX nr result
ID: Zanthoxylum22_contig00013259
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zanthoxylum22_contig00013259 (1694 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KDO84801.1| hypothetical protein CISIN_1g0013741mg [Citrus si... 664 0.0 gb|KDO84799.1| hypothetical protein CISIN_1g0013741mg, partial [... 664 0.0 ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr... 663 0.0 ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik... 660 0.0 ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma... 528 e-147 ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma... 528 e-147 ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma... 528 e-147 ref|XP_012073826.1| PREDICTED: filament-like plant protein 4 [Ja... 517 e-143 ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ... 514 e-143 ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma... 498 e-138 ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Popu... 496 e-137 ref|XP_011028982.1| PREDICTED: filament-like plant protein 4 [Po... 495 e-137 gb|KHG12402.1| Filament-like plant protein 4 [Gossypium arboreum] 494 e-137 ref|XP_012465872.1| PREDICTED: filament-like plant protein 4 iso... 492 e-136 ref|XP_012465864.1| PREDICTED: filament-like plant protein 4 iso... 492 e-136 ref|XP_012465881.1| PREDICTED: filament-like plant protein 4 iso... 492 e-136 ref|XP_011045118.1| PREDICTED: filament-like plant protein 4 [Po... 491 e-136 ref|XP_010104432.1| hypothetical protein L484_016031 [Morus nota... 487 e-134 ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu... 484 e-133 ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu... 484 e-133 >gb|KDO84801.1| hypothetical protein CISIN_1g0013741mg [Citrus sinensis] Length = 1015 Score = 664 bits (1714), Expect = 0.0 Identities = 343/440 (77%), Positives = 381/440 (86%), Gaps = 2/440 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 A+C S+EVKCSDVSC+ EAYPGDA LNTE+KIDLTV I+QELVAAITQIH FV L K+ Sbjct: 545 ANCISDEVKCSDVSCSAEAYPGDASLNTERKIDLTVQVISQELVAAITQIHDFVLFLGKE 604 Query: 182 ARAVHEATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYKD 361 ARAVH+ TN+NGF QK EEF VSFNKVIDSNT LVDFVFALS+VLAKASELRI+V+GYKD Sbjct: 605 ARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKD 664 Query: 362 TEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYESK 541 TEIEP+SPDCIDKVALPENKVI++DT+G+ YPNGCA+ISNPTSDPEVPDDG+IV YES+ Sbjct: 665 TEIEPNSPDCIDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESE 724 Query: 542 TTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNSN 721 TTACKFS E+ EELKLEKDN+ATDLARCTE LEMTKSQL+ TEQLLAEVK+QLASAQ SN Sbjct: 725 TTACKFSLEEFEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSN 784 Query: 722 SLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKELE 901 SLAETQLKCMAESYRSLET AQELE EVNLL+AKIESL ELQDEKMSHH+A+A+CKELE Sbjct: 785 SLAETQLKCMAESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELE 844 Query: 902 EQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEVIGS 1081 EQLQRNENCAVCSS+AD++KI TI LLGKQLK+LRPQSEVIGS Sbjct: 845 EQLQRNENCAVCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGS 904 Query: 1082 PYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANAT--RAGAESPLDSYTSPCSP 1255 PYSERSQKGE P EPAT S LQ+FD+AEMD+ SANA R GAESPLD YTSPCSP Sbjct: 905 PYSERSQKGEFLPGEPATAS---LQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSP 961 Query: 1256 SDNEASINKSPIHSRHPQHR 1315 S+NEASINKSPI+S+HP+HR Sbjct: 962 SENEASINKSPINSKHPKHR 981 >gb|KDO84799.1| hypothetical protein CISIN_1g0013741mg, partial [Citrus sinensis] gi|641866115|gb|KDO84800.1| hypothetical protein CISIN_1g0013741mg, partial [Citrus sinensis] Length = 1050 Score = 664 bits (1714), Expect = 0.0 Identities = 343/440 (77%), Positives = 381/440 (86%), Gaps = 2/440 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 A+C S+EVKCSDVSC+ EAYPGDA LNTE+KIDLTV I+QELVAAITQIH FV L K+ Sbjct: 580 ANCISDEVKCSDVSCSAEAYPGDASLNTERKIDLTVQVISQELVAAITQIHDFVLFLGKE 639 Query: 182 ARAVHEATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYKD 361 ARAVH+ TN+NGF QK EEF VSFNKVIDSNT LVDFVFALS+VLAKASELRI+V+GYKD Sbjct: 640 ARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKD 699 Query: 362 TEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYESK 541 TEIEP+SPDCIDKVALPENKVI++DT+G+ YPNGCA+ISNPTSDPEVPDDG+IV YES+ Sbjct: 700 TEIEPNSPDCIDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESE 759 Query: 542 TTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNSN 721 TTACKFS E+ EELKLEKDN+ATDLARCTE LEMTKSQL+ TEQLLAEVK+QLASAQ SN Sbjct: 760 TTACKFSLEEFEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSN 819 Query: 722 SLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKELE 901 SLAETQLKCMAESYRSLET AQELE EVNLL+AKIESL ELQDEKMSHH+A+A+CKELE Sbjct: 820 SLAETQLKCMAESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELE 879 Query: 902 EQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEVIGS 1081 EQLQRNENCAVCSS+AD++KI TI LLGKQLK+LRPQSEVIGS Sbjct: 880 EQLQRNENCAVCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGS 939 Query: 1082 PYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANAT--RAGAESPLDSYTSPCSP 1255 PYSERSQKGE P EPAT S LQ+FD+AEMD+ SANA R GAESPLD YTSPCSP Sbjct: 940 PYSERSQKGEFLPGEPATAS---LQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSP 996 Query: 1256 SDNEASINKSPIHSRHPQHR 1315 S+NEASINKSPI+S+HP+HR Sbjct: 997 SENEASINKSPINSKHPKHR 1016 >ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|567885183|ref|XP_006435150.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537271|gb|ESR48389.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537272|gb|ESR48390.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] Length = 1091 Score = 663 bits (1710), Expect = 0.0 Identities = 342/440 (77%), Positives = 381/440 (86%), Gaps = 2/440 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 A+C SEEVKCSDVSC+ EAYPGDA LNTE+KIDLTV I+QELVAAI+QIH FV L K+ Sbjct: 621 ANCISEEVKCSDVSCSAEAYPGDASLNTERKIDLTVQVISQELVAAISQIHDFVLFLGKE 680 Query: 182 ARAVHEATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYKD 361 ARAVH+ TN+NGF QK EEF VSFNKVIDSNT LVDFVFALS+VLAKASELRI+V+GYKD Sbjct: 681 ARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKD 740 Query: 362 TEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYESK 541 TEIEP+SPDCIDKVALPENKVI++DT+G+ YPNGCA+ISNPTSDPEVPDDG+IV YES+ Sbjct: 741 TEIEPNSPDCIDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESE 800 Query: 542 TTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNSN 721 TTACKF+ E+ EELKLEKDN+ATDLARCTE LEMTKSQL+ TEQLLAEVK+QLASAQ SN Sbjct: 801 TTACKFTLEEFEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSN 860 Query: 722 SLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKELE 901 SLAETQLKCMAESYRSLET AQELE EVNLL+AKIESL ELQDEKMSHH+A+A+CKELE Sbjct: 861 SLAETQLKCMAESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELE 920 Query: 902 EQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEVIGS 1081 EQLQRNENCAVCSS+AD++KI TI LLGKQLK+LRPQSEVIGS Sbjct: 921 EQLQRNENCAVCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGS 980 Query: 1082 PYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANAT--RAGAESPLDSYTSPCSP 1255 PYSERSQKGE P EPAT S LQ+FD+AEMD+ SANA R GAESPLD YTSPCSP Sbjct: 981 PYSERSQKGEFLPGEPATAS---LQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSP 1037 Query: 1256 SDNEASINKSPIHSRHPQHR 1315 S+NEASINKSPI+S+HP+HR Sbjct: 1038 SENEASINKSPINSKHPKHR 1057 >ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Citrus sinensis] Length = 1091 Score = 660 bits (1703), Expect = 0.0 Identities = 342/440 (77%), Positives = 379/440 (86%), Gaps = 2/440 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 A+C SEEVKCSDVSC+ EAYPGDA LNTE+KIDLTV I+QELVAAITQIH FV L K+ Sbjct: 621 ANCISEEVKCSDVSCSAEAYPGDARLNTERKIDLTVQVISQELVAAITQIHDFVLFLGKE 680 Query: 182 ARAVHEATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYKD 361 ARAVH+ TN+NGF QK EEF VSFNKVIDSNT LVDFVFALS+VLAKASELRI+V+GYKD Sbjct: 681 ARAVHDTTNENGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKD 740 Query: 362 TEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYESK 541 TEIEP+SPDCIDKVALPENKVI++DT+G+ YPNGCA+ISNPTSDPEVPDDG+IV YES+ Sbjct: 741 TEIEPNSPDCIDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESE 800 Query: 542 TTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNSN 721 TTACKFS E+ EELKLEKDN+ATDLARCTE LEMTKSQL+ TEQLLAEVK+QLASAQ SN Sbjct: 801 TTACKFSLEEFEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSN 860 Query: 722 SLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKELE 901 SLAETQLKCMAESYRSLET AQELE EVNLL+AKIESL ELQDEKMSHH+A+A+CKELE Sbjct: 861 SLAETQLKCMAESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELE 920 Query: 902 EQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEVIGS 1081 EQLQRNENCAVCSS+AD++KI TI LLGKQLK+LRPQSEVIGS Sbjct: 921 EQLQRNENCAVCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGS 980 Query: 1082 PYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANAT--RAGAESPLDSYTSPCSP 1255 PYSERS KGE P EPAT S LQ+FD+AE D+ SANA R GAESPLD YTSPCSP Sbjct: 981 PYSERSPKGEFLPGEPATAS---LQEFDHAETDSVTSANAQPHRVGAESPLDLYTSPCSP 1037 Query: 1256 SDNEASINKSPIHSRHPQHR 1315 S+NEASINKSPI+S+HP+HR Sbjct: 1038 SENEASINKSPINSKHPKHR 1057 >ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508723086|gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 947 Score = 528 bits (1361), Expect = e-147 Identities = 281/445 (63%), Positives = 339/445 (76%), Gaps = 11/445 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQIHGFVS 166 SEEV SD +C +A+ G L EK+I ++ V ++QEL AAI+QIH FV Sbjct: 470 SEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVL 529 Query: 167 SLRKDARAVHEATND-NGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV + +D N K EEFSV++NKV+ SN +L DF+F LS +LAKAS+LR++ Sbjct: 530 SLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVN 589 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+GYKD E E +SPDCIDKV LPENKVIQ+D++G Y NGCA+ISNPTS+PEVPDDGN+V Sbjct: 590 VLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLV 649 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK + KFS E+ EELKLEK+NMA DLARCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 650 SDYESKQSR-KFSSEEFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLA 708 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 SAQ SNSLAETQLKCMAESYRSLETRA ELETEVNLL+ KIE+L E QDEK SHHD LA Sbjct: 709 SAQKSNSLAETQLKCMAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLA 768 Query: 884 RCKELEEQLQRNENCAVCSSKADDD-KIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRP 1060 RCKELEEQLQRNENC+ C++ AD+D K TIFLLGKQLK+LRP Sbjct: 769 RCKELEEQLQRNENCSACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRP 828 Query: 1061 QSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYT 1240 Q++++GSPY+ERSQKGEG E+ TTS +NLQD D E+DTA S NA+R GAESP++ Sbjct: 829 QTDMMGSPYNERSQKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLI 888 Query: 1241 SPCSPSDNEASINKSPIHSRHPQHR 1315 SP SPSD +A++ +SPI+S HP+H+ Sbjct: 889 SPSSPSDTDANLLRSPINSNHPKHK 913 >ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508723085|gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1106 Score = 528 bits (1361), Expect = e-147 Identities = 281/445 (63%), Positives = 339/445 (76%), Gaps = 11/445 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQIHGFVS 166 SEEV SD +C +A+ G L EK+I ++ V ++QEL AAI+QIH FV Sbjct: 629 SEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVL 688 Query: 167 SLRKDARAVHEATND-NGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV + +D N K EEFSV++NKV+ SN +L DF+F LS +LAKAS+LR++ Sbjct: 689 SLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVN 748 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+GYKD E E +SPDCIDKV LPENKVIQ+D++G Y NGCA+ISNPTS+PEVPDDGN+V Sbjct: 749 VLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLV 808 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK + KFS E+ EELKLEK+NMA DLARCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 809 SDYESKQSR-KFSSEEFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLA 867 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 SAQ SNSLAETQLKCMAESYRSLETRA ELETEVNLL+ KIE+L E QDEK SHHD LA Sbjct: 868 SAQKSNSLAETQLKCMAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLA 927 Query: 884 RCKELEEQLQRNENCAVCSSKADDD-KIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRP 1060 RCKELEEQLQRNENC+ C++ AD+D K TIFLLGKQLK+LRP Sbjct: 928 RCKELEEQLQRNENCSACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRP 987 Query: 1061 QSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYT 1240 Q++++GSPY+ERSQKGEG E+ TTS +NLQD D E+DTA S NA+R GAESP++ Sbjct: 988 QTDMMGSPYNERSQKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLI 1047 Query: 1241 SPCSPSDNEASINKSPIHSRHPQHR 1315 SP SPSD +A++ +SPI+S HP+H+ Sbjct: 1048 SPSSPSDTDANLLRSPINSNHPKHK 1072 >ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508723083|gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1102 Score = 528 bits (1361), Expect = e-147 Identities = 281/445 (63%), Positives = 339/445 (76%), Gaps = 11/445 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQIHGFVS 166 SEEV SD +C +A+ G L EK+I ++ V ++QEL AAI+QIH FV Sbjct: 625 SEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVL 684 Query: 167 SLRKDARAVHEATND-NGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV + +D N K EEFSV++NKV+ SN +L DF+F LS +LAKAS+LR++ Sbjct: 685 SLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVN 744 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+GYKD E E +SPDCIDKV LPENKVIQ+D++G Y NGCA+ISNPTS+PEVPDDGN+V Sbjct: 745 VLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLV 804 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK + KFS E+ EELKLEK+NMA DLARCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 805 SDYESKQSR-KFSSEEFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLA 863 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 SAQ SNSLAETQLKCMAESYRSLETRA ELETEVNLL+ KIE+L E QDEK SHHD LA Sbjct: 864 SAQKSNSLAETQLKCMAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLA 923 Query: 884 RCKELEEQLQRNENCAVCSSKADDD-KIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRP 1060 RCKELEEQLQRNENC+ C++ AD+D K TIFLLGKQLK+LRP Sbjct: 924 RCKELEEQLQRNENCSACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRP 983 Query: 1061 QSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYT 1240 Q++++GSPY+ERSQKGEG E+ TTS +NLQD D E+DTA S NA+R GAESP++ Sbjct: 984 QTDMMGSPYNERSQKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLI 1043 Query: 1241 SPCSPSDNEASINKSPIHSRHPQHR 1315 SP SPSD +A++ +SPI+S HP+H+ Sbjct: 1044 SPSSPSDTDANLLRSPINSNHPKHK 1068 >ref|XP_012073826.1| PREDICTED: filament-like plant protein 4 [Jatropha curcas] gi|802607480|ref|XP_012073827.1| PREDICTED: filament-like plant protein 4 [Jatropha curcas] gi|802607482|ref|XP_012073828.1| PREDICTED: filament-like plant protein 4 [Jatropha curcas] gi|643729007|gb|KDP36944.1| hypothetical protein JCGZ_08235 [Jatropha curcas] Length = 1074 Score = 517 bits (1331), Expect = e-143 Identities = 275/447 (61%), Positives = 328/447 (73%), Gaps = 10/447 (2%) Frame = +2 Query: 5 SCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQIHG 157 SC SEEV D + N + P DA L EK+I L+ V +++QEL AAI+ IH Sbjct: 598 SCVSEEVVTVDATSNGQTCPKDASLTGEKEITLSQDIKASTEAVHSVSQELAAAISSIHD 657 Query: 158 FVSSLRKDARAVHEATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELR 337 FV L K+A VH+ ++D G QK EEFSV+ NKV++ NT+LVDF+F LSHVLAKASELR Sbjct: 658 FVLFLGKEAMVVHDTSSDGGLSQKIEEFSVTSNKVLNGNTSLVDFIFDLSHVLAKASELR 717 Query: 338 IDVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGN 517 +V+GYK +E E +SPDCIDKVALPENKV+QRD +G+ Y NGCA+IS+PTS+PEVPDDGN Sbjct: 718 FNVLGYKCSEGEINSPDCIDKVALPENKVLQRDCSGERYQNGCAHISSPTSNPEVPDDGN 777 Query: 518 IVTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQ 697 +V+GY S TT CK S E+ EELK EKDNMA DLARCTE LEMTKSQLH TEQLLAE K+Q Sbjct: 778 LVSGYGSNTTLCKVSLEEFEELKTEKDNMAMDLARCTENLEMTKSQLHETEQLLAEAKAQ 837 Query: 698 LASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDA 877 L SAQ SNSL+ETQLKCMAESYRSLE RA+ELETEVN+L+AK +L ELQ+EK H DA Sbjct: 838 LTSAQKSNSLSETQLKCMAESYRSLEARAEELETEVNILRAKAGTLENELQEEKRCHWDA 897 Query: 878 LARCKELEEQLQRNENCAVCSSKADDD-KIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTL 1054 L R KELEEQLQ E+C+VCS+ AD D K TIFLLGKQLK L Sbjct: 898 LTRSKELEEQLQTKESCSVCSAAADADLKAKQERELTAAAEKLAECQETIFLLGKQLKAL 957 Query: 1055 RPQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDS 1234 RPQ+E++GSPYSERSQ+GEGF ++ TTS +NLQDFD AEMD S N + G ESP D Sbjct: 958 RPQTEIMGSPYSERSQRGEGFGDDEPTTSGMNLQDFDQAEMDATVSTNLPKTGGESPTDF 1017 Query: 1235 YTSPCSPSDNEASINKSPIHSRHPQHR 1315 Y + SD E S+++SPI S+ PQHR Sbjct: 1018 Y----NQSDAETSLSRSPISSKQPQHR 1040 >ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] Length = 1041 Score = 514 bits (1325), Expect = e-143 Identities = 271/448 (60%), Positives = 335/448 (74%), Gaps = 11/448 (2%) Frame = +2 Query: 5 SCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIHG 157 S SE+V+ +D +C P A + +K+I L TV ++ QEL A++ IH Sbjct: 565 SSVSEDVRATDATC-----PEYASITGDKEITLFQDTNAATDTVRSVNQELATAVSSIHD 619 Query: 158 FVSSLRKDARAVHEATNDNGFI-QKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASEL 334 FV L K+A AVH+ ++D + QK E FSV+FNKV++ NT+L+DF+F LS VLAKASEL Sbjct: 620 FVLFLGKEAMAVHDTSSDGSDLSQKIEHFSVTFNKVLNGNTSLIDFIFYLSCVLAKASEL 679 Query: 335 RIDVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDG 514 R +V+GYK +E E +S DCIDKVALPENKV+QRD++G+ Y N CA+IS+PTS+PEVPDDG Sbjct: 680 RFNVLGYKGSEAEINSSDCIDKVALPENKVLQRDSSGESYQNSCAHISSPTSNPEVPDDG 739 Query: 515 NIVTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKS 694 ++V+GY S TT CK S E+ EELK EK+N+A DLARCTE LEMTKSQLH TEQLLAE KS Sbjct: 740 SLVSGYGSNTTLCKVSLEEFEELKSEKNNVALDLARCTENLEMTKSQLHETEQLLAEAKS 799 Query: 695 QLASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHD 874 QLASAQ SNSLAETQLKCMAESYRSLE RA+ELETEVNLLQAK E+L ELQDEK H D Sbjct: 800 QLASAQKSNSLAETQLKCMAESYRSLEARAEELETEVNLLQAKAETLENELQDEKQCHWD 859 Query: 875 ALARCKELEEQLQRNENCAVCSSKAD-DDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKT 1051 AL+R KELEEQLQ E+C+VCS+ AD ++K TIFLLGKQLK Sbjct: 860 ALSRSKELEEQLQTKESCSVCSAAADAENKANQDRELAAAAEKLAECQETIFLLGKQLKA 919 Query: 1052 LRPQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLD 1231 LRPQ+E++GS YSERS+KG+GF E+ TTS +NLQDFD AEMD S N RAGAESP+D Sbjct: 920 LRPQTELMGSAYSERSRKGDGFAEDEPTTSGMNLQDFDQAEMDAIVSTNHHRAGAESPMD 979 Query: 1232 SYTSPCSPSDNEASINKSPIHSRHPQHR 1315 Y PCSPSD E+++++SP++S+ P+HR Sbjct: 980 LYNQPCSPSDTESNLSRSPLNSKQPKHR 1007 >ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508723089|gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1107 Score = 498 bits (1283), Expect = e-138 Identities = 268/446 (60%), Positives = 328/446 (73%), Gaps = 12/446 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQIHGFVS 166 SEEV SD +C +A+ G L EK+I ++ V ++QEL AAI+QIH FV Sbjct: 629 SEEVHGSDGTCIGQAHNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVL 688 Query: 167 SLRKDARAVHEATND-NGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV + +D N K EEFSV++NKV+ SN +L DF+F LS +LAKAS+LR++ Sbjct: 689 SLGKEARAVDDICSDGNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVN 748 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+GYKD E E +SPDCIDKV LPENKVIQ+D++G Y NGCA+ISNPTS+PEVPDDGN+V Sbjct: 749 VLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLV 808 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK + KFS E+ EELKLEK+NMA DLARCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 809 SDYESKQSR-KFSSEEFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLA 867 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 SAQ SNSLAETQLKCMAESYRSLETRA ELETEVNLL+ KIE+L E QDEK SHHD LA Sbjct: 868 SAQKSNSLAETQLKCMAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLA 927 Query: 884 RCKELEEQLQRNENCAVCSSKADDD--KIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLR 1057 RCKELEEQLQRNENC+ C++ AD+D I+L+ + Sbjct: 928 RCKELEEQLQRNENCSACAAAADNDLKNKQVSVYFNLCILRWILPNPLIYLILLPRNIIY 987 Query: 1058 PQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSY 1237 ++++GSPY+ERSQKGEG E+ TTS +NLQD D E+DTA S NA+R GAESP++ Sbjct: 988 SCTDMMGSPYNERSQKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPL 1047 Query: 1238 TSPCSPSDNEASINKSPIHSRHPQHR 1315 SP SPSD +A++ +SPI+S HP+H+ Sbjct: 1048 ISPSSPSDTDANLLRSPINSNHPKHK 1073 >ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] gi|550339754|gb|EEE93914.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] Length = 1077 Score = 496 bits (1277), Expect = e-137 Identities = 266/448 (59%), Positives = 326/448 (72%), Gaps = 11/448 (2%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNT-EAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQI 151 ASC S+E CSD + + + P DAG+ EK+I+L + ++QEL+ AI+QI Sbjct: 604 ASCVSKEAHCSDATTHDRQTCPEDAGIMGEKEIELFQESKTAAQIMHTVSQELLPAISQI 663 Query: 152 HGFVSSLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKAS 328 H FV L K+A VH+ + D+ G QK +EFS++FNKV+ S+ +LVDFV L+H+LA AS Sbjct: 664 HDFVLLLGKEAMTVHDTSCDSIGLSQKIKEFSITFNKVLYSDRSLVDFVSDLAHILALAS 723 Query: 329 ELRIDVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPD 508 LR +V+GYK E E SSPDCIDK+ALPENKV+Q++++ + Y NGCA IS+PTS+PEVPD Sbjct: 724 GLRFNVLGYKGNEAEISSPDCIDKIALPENKVVQKNSSVETYQNGCANISSPTSNPEVPD 783 Query: 509 DGNIVTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEV 688 DGN+V GY S TT+CK S E+ EELK EKDNMA DLARCTE EMTKSQLH TEQLLAEV Sbjct: 784 DGNLVLGYGSNTTSCKVSLEEFEELKSEKDNMAMDLARCTENFEMTKSQLHETEQLLAEV 843 Query: 689 KSQLASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSH 868 KSQLASAQ SNSLAETQLKCM ESYRSLETRAQELETEVNLL+ K E+L LQ+EK SH Sbjct: 844 KSQLASAQKSNSLAETQLKCMTESYRSLETRAQELETEVNLLRLKTETLENVLQEEKKSH 903 Query: 869 HDALARCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLK 1048 AL RCKELEEQLQ NE+ V + +K TIFLLGKQL Sbjct: 904 QGALTRCKELEEQLQTNESSTVTDIECKQEK-----EIAAAAEKLAECQETIFLLGKQLN 958 Query: 1049 TLRPQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPL 1228 +L PQ+E++GSPYSERSQ G+ F E+ TTS +NLQDFD AEMDT AN +AGAESP+ Sbjct: 959 SLCPQTEIMGSPYSERSQIGDVFAEDEPTTSGMNLQDFDQAEMDTGGLANIHKAGAESPI 1018 Query: 1229 DSYTSPCSPSDNEASINKSPIHSRHPQH 1312 +SY PCSPSD E+S+ +SP+ S+ P+H Sbjct: 1019 NSYNHPCSPSDTESSLLRSPVASKPPKH 1046 >ref|XP_011028982.1| PREDICTED: filament-like plant protein 4 [Populus euphratica] gi|743851394|ref|XP_011028983.1| PREDICTED: filament-like plant protein 4 [Populus euphratica] Length = 1081 Score = 495 bits (1274), Expect = e-137 Identities = 271/450 (60%), Positives = 322/450 (71%), Gaps = 12/450 (2%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIH 154 ASC S+EV SD +C + P DA + EK+I L T+ +++EL+AAI+QIH Sbjct: 603 ASCGSKEVHHSDATCERQTCPEDAVIMGEKEITLLQESKAATHTMHTVSEELLAAISQIH 662 Query: 155 GFVSSLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASE 331 FV L K+A AVH+ + D+ G QK EEFSV+F KV+ S+ +L+DF+F LS VLA AS Sbjct: 663 DFVLLLGKEAMAVHDTSCDSIGLSQKIEEFSVTFKKVLCSDRSLIDFMFDLSRVLALASG 722 Query: 332 LRIDVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDD 511 LR +V+GYK E E SSPDCIDKVALPENKVIQ D+ G+ + NGCA IS+PTS+PEVPD Sbjct: 723 LRFNVLGYKCNEAEISSPDCIDKVALPENKVIQNDSLGETFQNGCANISSPTSNPEVPDY 782 Query: 512 GNIVTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVK 691 GN+V GY S TT+CK S E+ EELK EKDNMA DLARCTE EMTKSQLH TEQLLAEVK Sbjct: 783 GNLVPGYGSNTTSCKVSLEEFEELKSEKDNMAMDLARCTENFEMTKSQLHETEQLLAEVK 842 Query: 692 SQLASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHH 871 SQL SA+ SNSLAETQLKCMAESYRSLETRAQELETEVNLL+ K E+L ELQ EK SH Sbjct: 843 SQLVSAKKSNSLAETQLKCMAESYRSLETRAQELETEVNLLRVKTETLESELQGEKTSHQ 902 Query: 872 DALARCKELEEQLQRNENCAVCSSKADDD--KIXXXXXXXXXXXXXXXXXXTIFLLGKQL 1045 DAL RCKELEEQLQ E S ADD K TIFLLGKQL Sbjct: 903 DALTRCKELEEQLQTKER-----SSADDIDLKSKQEKEITAAAEKLAECQETIFLLGKQL 957 Query: 1046 KTLRPQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESP 1225 K LRPQ+E +GSPYSERSQ G+G ++ T S +NLQD D AEMDT S N +AG+ESP Sbjct: 958 KYLRPQTEFMGSPYSERSQSGDGIAKDEPTVSGINLQDSDQAEMDTGASVNFLKAGSESP 1017 Query: 1226 LDSYTSPCSPSDNEASINKSPIHSRHPQHR 1315 DS+ +PC PSD E+++ +SP+ +HP+HR Sbjct: 1018 SDSHNNPCCPSDTESNLLRSPVGLKHPKHR 1047 >gb|KHG12402.1| Filament-like plant protein 4 [Gossypium arboreum] Length = 1078 Score = 494 bits (1272), Expect = e-137 Identities = 264/440 (60%), Positives = 318/440 (72%), Gaps = 10/440 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIHGFVS 166 SEEV S+ CN + + + L K I + T+ I+QEL AAI+QIH FV Sbjct: 605 SEEVDGSEGKCNGQGHLENGSLTEGKDISVPPGDKVTTETLQTISQELAAAISQIHDFVM 664 Query: 167 SLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV ++D G K ++FSV++NKV+ SN NL DF+F LS VLAKASELR + Sbjct: 665 SLGKEARAVDNISSDAYGLSHKIDDFSVTYNKVLCSNVNLDDFIFGLSTVLAKASELRFN 724 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+G+K +E E + PDCIDKVALPENK Q D++G Y NGCA+ISNPTS+PE PDDGN+V Sbjct: 725 VLGFKSSEAEMNGPDCIDKVALPENKGNQNDSSGGRYQNGCAHISNPTSNPEDPDDGNLV 784 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK T+ S E+ EELKLEK+NMA DL+RCTE LEMT+SQLH T QLLAE KSQLA Sbjct: 785 SEYESKQTS-NISSEEFEELKLEKENMAMDLSRCTENLEMTRSQLHETGQLLAEAKSQLA 843 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 +AQ SNSLAETQLKCM ESYRSLETRA ELETEV LL AKI +L ELQDEK SHHDA A Sbjct: 844 AAQKSNSLAETQLKCMVESYRSLETRAGELETEVTLLSAKINTLENELQDEKRSHHDAFA 903 Query: 884 RCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQ 1063 RCKELEEQLQRNE C+VCS+ +D K TIFLLGKQLK RPQ Sbjct: 904 RCKELEEQLQRNEKCSVCSAADNDLKNNQERELAAAAEKLVECQETIFLLGKQLKAFRPQ 963 Query: 1064 SEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTS 1243 ++ IGSPY+ERSQKGEGF E+ TTSS+NLQD D A++DTA S N +R G ESP++S+ + Sbjct: 964 TDKIGSPYNERSQKGEGFREDEPTTSSMNLQDLDQADIDTAASGNRSRTGVESPMESFNT 1023 Query: 1244 PCSPSDNEASINKSPIHSRH 1303 PCSP E + +SP+ S+H Sbjct: 1024 PCSPPHTEGDVLRSPVSSKH 1043 >ref|XP_012465872.1| PREDICTED: filament-like plant protein 4 isoform X2 [Gossypium raimondii] Length = 1091 Score = 492 bits (1267), Expect = e-136 Identities = 261/440 (59%), Positives = 318/440 (72%), Gaps = 10/440 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIHGFVS 166 SEEV S+ CN + +P + L K I + T+ ++QEL AI+QIH FV Sbjct: 618 SEEVDGSEGKCNRQGHPENGSLTEGKDIAVPPGDKVTTETLQTMSQELAVAISQIHDFVM 677 Query: 167 SLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV ++D G K ++FSV++NKV+ SN NL DF+F LS VLAKASELR + Sbjct: 678 SLGKEARAVDNISSDAYGLSLKIDDFSVTYNKVLCSNVNLDDFIFGLSTVLAKASELRFN 737 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+G+K E E + PDCIDKVALPENKV Q D++G Y NGCA+ISNPTS+PE PDDGN+V Sbjct: 738 VLGFKSNEAEMNGPDCIDKVALPENKVNQNDSSGGRYQNGCAHISNPTSNPEDPDDGNLV 797 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK A S E+ EELKLEK+NMA DL+RCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 798 SEYESKQ-ASNISSEEFEELKLEKENMAMDLSRCTENLEMTKSQLHETEQLLAEAKSQLA 856 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 +AQ SNSLAETQLKCM ESYRSLE RA ELET+VNLL KI +L ELQDEK SHHDA + Sbjct: 857 AAQKSNSLAETQLKCMVESYRSLERRAGELETDVNLLSTKINTLENELQDEKRSHHDAFS 916 Query: 884 RCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQ 1063 RCKELEEQLQRNE C+VCS+ +D K TIFLLGK+LK L PQ Sbjct: 917 RCKELEEQLQRNEKCSVCSAADNDLKNNQERELAAAAEKLAECQETIFLLGKKLKALHPQ 976 Query: 1064 SEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTS 1243 ++ IGSPY+ERSQKGEGF E+ TTS +NLQD D A++DTA S N ++ GAESP++S+ Sbjct: 977 TDKIGSPYNERSQKGEGFREDEPTTSGMNLQDLDQADIDTAASGNGSQTGAESPMESFNI 1036 Query: 1244 PCSPSDNEASINKSPIHSRH 1303 PCSP + E ++ +SP+ S+H Sbjct: 1037 PCSPPNTEGNVLRSPVSSKH 1056 >ref|XP_012465864.1| PREDICTED: filament-like plant protein 4 isoform X1 [Gossypium raimondii] Length = 1117 Score = 492 bits (1267), Expect = e-136 Identities = 261/440 (59%), Positives = 318/440 (72%), Gaps = 10/440 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIHGFVS 166 SEEV S+ CN + +P + L K I + T+ ++QEL AI+QIH FV Sbjct: 644 SEEVDGSEGKCNRQGHPENGSLTEGKDIAVPPGDKVTTETLQTMSQELAVAISQIHDFVM 703 Query: 167 SLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV ++D G K ++FSV++NKV+ SN NL DF+F LS VLAKASELR + Sbjct: 704 SLGKEARAVDNISSDAYGLSLKIDDFSVTYNKVLCSNVNLDDFIFGLSTVLAKASELRFN 763 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+G+K E E + PDCIDKVALPENKV Q D++G Y NGCA+ISNPTS+PE PDDGN+V Sbjct: 764 VLGFKSNEAEMNGPDCIDKVALPENKVNQNDSSGGRYQNGCAHISNPTSNPEDPDDGNLV 823 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK A S E+ EELKLEK+NMA DL+RCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 824 SEYESKQ-ASNISSEEFEELKLEKENMAMDLSRCTENLEMTKSQLHETEQLLAEAKSQLA 882 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 +AQ SNSLAETQLKCM ESYRSLE RA ELET+VNLL KI +L ELQDEK SHHDA + Sbjct: 883 AAQKSNSLAETQLKCMVESYRSLERRAGELETDVNLLSTKINTLENELQDEKRSHHDAFS 942 Query: 884 RCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQ 1063 RCKELEEQLQRNE C+VCS+ +D K TIFLLGK+LK L PQ Sbjct: 943 RCKELEEQLQRNEKCSVCSAADNDLKNNQERELAAAAEKLAECQETIFLLGKKLKALHPQ 1002 Query: 1064 SEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTS 1243 ++ IGSPY+ERSQKGEGF E+ TTS +NLQD D A++DTA S N ++ GAESP++S+ Sbjct: 1003 TDKIGSPYNERSQKGEGFREDEPTTSGMNLQDLDQADIDTAASGNGSQTGAESPMESFNI 1062 Query: 1244 PCSPSDNEASINKSPIHSRH 1303 PCSP + E ++ +SP+ S+H Sbjct: 1063 PCSPPNTEGNVLRSPVSSKH 1082 >ref|XP_012465881.1| PREDICTED: filament-like plant protein 4 isoform X3 [Gossypium raimondii] gi|823133136|ref|XP_012465885.1| PREDICTED: filament-like plant protein 4 isoform X3 [Gossypium raimondii] gi|823133138|ref|XP_012465893.1| PREDICTED: filament-like plant protein 4 isoform X3 [Gossypium raimondii] gi|763746975|gb|KJB14414.1| hypothetical protein B456_002G124100 [Gossypium raimondii] gi|763746976|gb|KJB14415.1| hypothetical protein B456_002G124100 [Gossypium raimondii] Length = 1078 Score = 492 bits (1267), Expect = e-136 Identities = 261/440 (59%), Positives = 318/440 (72%), Gaps = 10/440 (2%) Frame = +2 Query: 14 SEEVKCSDVSCNTEAYPGDAGLNTEKKIDL---------TVLNITQELVAAITQIHGFVS 166 SEEV S+ CN + +P + L K I + T+ ++QEL AI+QIH FV Sbjct: 605 SEEVDGSEGKCNRQGHPENGSLTEGKDIAVPPGDKVTTETLQTMSQELAVAISQIHDFVM 664 Query: 167 SLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRID 343 SL K+ARAV ++D G K ++FSV++NKV+ SN NL DF+F LS VLAKASELR + Sbjct: 665 SLGKEARAVDNISSDAYGLSLKIDDFSVTYNKVLCSNVNLDDFIFGLSTVLAKASELRFN 724 Query: 344 VIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIV 523 V+G+K E E + PDCIDKVALPENKV Q D++G Y NGCA+ISNPTS+PE PDDGN+V Sbjct: 725 VLGFKSNEAEMNGPDCIDKVALPENKVNQNDSSGGRYQNGCAHISNPTSNPEDPDDGNLV 784 Query: 524 TGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLA 703 + YESK A S E+ EELKLEK+NMA DL+RCTE LEMTKSQLH TEQLLAE KSQLA Sbjct: 785 SEYESKQ-ASNISSEEFEELKLEKENMAMDLSRCTENLEMTKSQLHETEQLLAEAKSQLA 843 Query: 704 SAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALA 883 +AQ SNSLAETQLKCM ESYRSLE RA ELET+VNLL KI +L ELQDEK SHHDA + Sbjct: 844 AAQKSNSLAETQLKCMVESYRSLERRAGELETDVNLLSTKINTLENELQDEKRSHHDAFS 903 Query: 884 RCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQ 1063 RCKELEEQLQRNE C+VCS+ +D K TIFLLGK+LK L PQ Sbjct: 904 RCKELEEQLQRNEKCSVCSAADNDLKNNQERELAAAAEKLAECQETIFLLGKKLKALHPQ 963 Query: 1064 SEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTS 1243 ++ IGSPY+ERSQKGEGF E+ TTS +NLQD D A++DTA S N ++ GAESP++S+ Sbjct: 964 TDKIGSPYNERSQKGEGFREDEPTTSGMNLQDLDQADIDTAASGNGSQTGAESPMESFNI 1023 Query: 1244 PCSPSDNEASINKSPIHSRH 1303 PCSP + E ++ +SP+ S+H Sbjct: 1024 PCSPPNTEGNVLRSPVSSKH 1043 >ref|XP_011045118.1| PREDICTED: filament-like plant protein 4 [Populus euphratica] gi|743903540|ref|XP_011045119.1| PREDICTED: filament-like plant protein 4 [Populus euphratica] Length = 1077 Score = 491 bits (1264), Expect = e-136 Identities = 264/449 (58%), Positives = 325/449 (72%), Gaps = 11/449 (2%) Frame = +2 Query: 2 ASCESEEVKCSDVSC-NTEAYPGDAGLNTEKKIDLT---------VLNITQELVAAITQI 151 ASC S+E CSD + + + P DAG+ EK+ +L+ + ++QEL+ AI+QI Sbjct: 604 ASCVSKEAHCSDATTPDRQTCPEDAGIMGEKEFELSQESKTAAQIMHTVSQELLPAISQI 663 Query: 152 HGFVSSLRKDARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKAS 328 H FV L K+A AVH+ + D+ G QK +EFS++FNKV+ S+ +LVDFV L+H+LA A Sbjct: 664 HDFVLLLGKEAMAVHDTSCDSIGLSQKIKEFSITFNKVLHSDKSLVDFVSDLAHILALAC 723 Query: 329 ELRIDVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPD 508 LR +V+GYK E E SSPDCIDK+ALPENKV+Q++++ + Y NGCA IS+PTS+PEVPD Sbjct: 724 GLRFNVLGYKGNEAEISSPDCIDKIALPENKVVQKNSSVETYQNGCANISSPTSNPEVPD 783 Query: 509 DGNIVTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEV 688 DGN+V GY S TT+CK S E+ EELK EKDNMA DLARCTE EMTKSQLH TEQLLAEV Sbjct: 784 DGNLVLGYGSNTTSCKVSLEEFEELKSEKDNMAMDLARCTENFEMTKSQLHETEQLLAEV 843 Query: 689 KSQLASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSH 868 KSQLASAQ SNSLAETQLKCM ESYRSLETRAQELETEVNLL+ K +L ELQ+EK SH Sbjct: 844 KSQLASAQKSNSLAETQLKCMTESYRSLETRAQELETEVNLLRLKTGTLENELQEEKKSH 903 Query: 869 HDALARCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLK 1048 AL RCKELEEQLQ NE+ V + +K TIFLLGKQL Sbjct: 904 QGALTRCKELEEQLQTNESSTVTDIECKQEK-----EIAAAAEKLAECQETIFLLGKQLN 958 Query: 1049 TLRPQSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPL 1228 +L PQ+E++GSPYSERSQ G+ E+ TTS +NLQDFD AEMDT AN +AGAESP+ Sbjct: 959 SLCPQTEIMGSPYSERSQIGDVLAEDEPTTSGMNLQDFDQAEMDTGGLANIHKAGAESPI 1018 Query: 1229 DSYTSPCSPSDNEASINKSPIHSRHPQHR 1315 +SY P SPSD E+S+ +SP+ S+ P+HR Sbjct: 1019 NSYNHPYSPSDTESSLLRSPVGSKPPKHR 1047 >ref|XP_010104432.1| hypothetical protein L484_016031 [Morus notabilis] gi|587913144|gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] Length = 1087 Score = 487 bits (1254), Expect = e-134 Identities = 262/445 (58%), Positives = 323/445 (72%), Gaps = 8/445 (1%) Frame = +2 Query: 5 SCESEEVKCSDVSCNT-EAYPGDAGLNTEKKIDLTVLN------ITQELVAAITQIHGFV 163 SC SE+V CSD C+ +A P DAGL +EK+I L+ I +L AAI+QIH FV Sbjct: 615 SCISEDVHCSDAGCDDRQANPEDAGLTSEKEIALSQPAREARQIIRDDLAAAISQIHDFV 674 Query: 164 SSLRKDARAVHE-ATNDNGFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRI 340 L K+A VH+ +T + F Q+ EEFSV+ NKVI S+ +L+DFV LS VLAKASELR Sbjct: 675 LFLGKEAMGVHDTSTEGSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDLSSVLAKASELRF 734 Query: 341 DVIGYKDTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNI 520 V+G+K E E +SPDCIDKV LPENK IQ+D++ +IY NGCA++ N TS+PEVPDDGNI Sbjct: 735 SVLGFKGNEAETNSPDCIDKVVLPENKAIQKDSS-EIYQNGCAHMPNSTSNPEVPDDGNI 793 Query: 521 VTGYESKTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQL 700 V+ YES +CK S E+ ++LK EKDN+A D ARCTE LEMTKSQL TEQLLAE KSQL Sbjct: 794 VSSYESNAKSCKISLEEYDQLKSEKDNLALDFARCTENLEMTKSQLQETEQLLAEAKSQL 853 Query: 701 ASAQNSNSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDAL 880 +S Q SNSL+ETQLKCMAESYRSLETRAQ+LETE+NLL+ K ES+ ELQ+EK +H DAL Sbjct: 854 SSVQKSNSLSETQLKCMAESYRSLETRAQDLETELNLLRTKTESIEAELQEEKRNHQDAL 913 Query: 881 ARCKELEEQLQRNENCAVCSSKADDDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRP 1060 RCKEL+EQLQRNEN K + +K TIFLLGK+LK LRP Sbjct: 914 TRCKELQEQLQRNENNCENEIKPNQEK-----EFAAAAEKLAECQETIFLLGKKLKNLRP 968 Query: 1061 QSEVIGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYT 1240 QSE++GSPYSERSQ GEG E+ TTS +NL + D AE+++ SAN R GAESP+D Y+ Sbjct: 969 QSEIMGSPYSERSQNGEGLNEDEPTTSGMNLPESDQAELESVTSANLNRVGAESPIDVYS 1028 Query: 1241 SPCSPSDNEASINKSPIHSRHPQHR 1315 +P SPSD E SI KSPI+S++P+H+ Sbjct: 1029 APLSPSDAEPSILKSPINSKNPRHK 1053 >ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344134|gb|EEE81259.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 1063 Score = 484 bits (1245), Expect = e-133 Identities = 266/441 (60%), Positives = 314/441 (71%), Gaps = 3/441 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 ASC S+EV SD +C+ + P DA + EK+I L +I IH FV L K+ Sbjct: 604 ASCGSKEVHHSDATCDRQTCPEDAVIMGEKEITLLQESI----------IHDFVLLLGKE 653 Query: 182 ARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYK 358 A AVH+ + D+ G QK EEFS++F KV+ S+ +L+DF+F LS VLA AS LR +V+GYK Sbjct: 654 AMAVHDTSCDSIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYK 713 Query: 359 DTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYES 538 E E +SPDCIDKVALPENKVIQ D+ G+ + NGCA IS+PTS+PEVPD GN+V GY S Sbjct: 714 CNEAEINSPDCIDKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGS 773 Query: 539 KTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNS 718 TT+CK S E+ EELK EKD MA DLARCTE LEMTKSQLH TEQLLAEVKSQL SAQ S Sbjct: 774 NTTSCKVSLEEFEELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKS 833 Query: 719 NSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKEL 898 NSLAETQLKCMAESYRSLETRAQELETEVNLL+ K E+L ELQ+EK SH DAL RCKEL Sbjct: 834 NSLAETQLKCMAESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKEL 893 Query: 899 EEQLQRNENCAVCSSKAD--DDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEV 1072 EEQLQ E SS AD D K TIFLLGKQLK LRPQ+E+ Sbjct: 894 EEQLQTKE-----SSSADGIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEI 948 Query: 1073 IGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTSPCS 1252 +GSPYSERSQ G+G ++ T S +NLQD D AEMDT S N +AG+ESP DSY PC Sbjct: 949 MGSPYSERSQSGDGIAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCY 1008 Query: 1253 PSDNEASINKSPIHSRHPQHR 1315 PSD E+++ +SP+ +HP+HR Sbjct: 1009 PSDTESNLLRSPVGLKHPKHR 1029 >ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344133|gb|ERP63976.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 991 Score = 484 bits (1245), Expect = e-133 Identities = 266/441 (60%), Positives = 314/441 (71%), Gaps = 3/441 (0%) Frame = +2 Query: 2 ASCESEEVKCSDVSCNTEAYPGDAGLNTEKKIDLTVLNITQELVAAITQIHGFVSSLRKD 181 ASC S+EV SD +C+ + P DA + EK+I L +I IH FV L K+ Sbjct: 532 ASCGSKEVHHSDATCDRQTCPEDAVIMGEKEITLLQESI----------IHDFVLLLGKE 581 Query: 182 ARAVHEATNDN-GFIQKFEEFSVSFNKVIDSNTNLVDFVFALSHVLAKASELRIDVIGYK 358 A AVH+ + D+ G QK EEFS++F KV+ S+ +L+DF+F LS VLA AS LR +V+GYK Sbjct: 582 AMAVHDTSCDSIGLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYK 641 Query: 359 DTEIEPSSPDCIDKVALPENKVIQRDTAGKIYPNGCAYISNPTSDPEVPDDGNIVTGYES 538 E E +SPDCIDKVALPENKVIQ D+ G+ + NGCA IS+PTS+PEVPD GN+V GY S Sbjct: 642 CNEAEINSPDCIDKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGS 701 Query: 539 KTTACKFSWEQLEELKLEKDNMATDLARCTEYLEMTKSQLHGTEQLLAEVKSQLASAQNS 718 TT+CK S E+ EELK EKD MA DLARCTE LEMTKSQLH TEQLLAEVKSQL SAQ S Sbjct: 702 NTTSCKVSLEEFEELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKS 761 Query: 719 NSLAETQLKCMAESYRSLETRAQELETEVNLLQAKIESLVKELQDEKMSHHDALARCKEL 898 NSLAETQLKCMAESYRSLETRAQELETEVNLL+ K E+L ELQ+EK SH DAL RCKEL Sbjct: 762 NSLAETQLKCMAESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKEL 821 Query: 899 EEQLQRNENCAVCSSKAD--DDKIXXXXXXXXXXXXXXXXXXTIFLLGKQLKTLRPQSEV 1072 EEQLQ E SS AD D K TIFLLGKQLK LRPQ+E+ Sbjct: 822 EEQLQTKE-----SSSADGIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEI 876 Query: 1073 IGSPYSERSQKGEGFPEEPATTSSLNLQDFDNAEMDTANSANATRAGAESPLDSYTSPCS 1252 +GSPYSERSQ G+G ++ T S +NLQD D AEMDT S N +AG+ESP DSY PC Sbjct: 877 MGSPYSERSQSGDGIAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCY 936 Query: 1253 PSDNEASINKSPIHSRHPQHR 1315 PSD E+++ +SP+ +HP+HR Sbjct: 937 PSDTESNLLRSPVGLKHPKHR 957