BLASTX nr result
ID: Paeonia24_contig00018578
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia24_contig00018578 (2467 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera] 801 0.0 ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-lik... 748 0.0 emb|CBI19835.3| unnamed protein product [Vitis vinifera] 699 0.0 ref|XP_007225499.1| hypothetical protein PRUPE_ppa000819mg [Prun... 689 0.0 ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma... 687 0.0 ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma... 687 0.0 ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma... 687 0.0 gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] 680 0.0 ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr... 667 0.0 ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik... 663 0.0 ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu... 658 0.0 ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu... 658 0.0 ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma... 657 0.0 ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Popu... 641 0.0 ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-lik... 618 e-174 ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-lik... 616 e-173 ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-lik... 616 e-173 ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ... 610 e-171 ref|XP_007017762.1| Uncharacterized protein isoform 8, partial [... 604 e-170 ref|XP_007017760.1| Uncharacterized protein isoform 6 [Theobroma... 604 e-170 >emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera] Length = 1085 Score = 801 bits (2069), Expect = 0.0 Identities = 444/762 (58%), Positives = 533/762 (69%), Gaps = 35/762 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL LPEFS+DN+Q+ HK+NEF RNSELQASRN+CAKT Sbjct: 338 SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ LE+QLQ+++Q K P KS Q+AS+P S+TSMSEDG+DD SCAES Sbjct: 398 ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 WAT S + F A +LELMDDFLEMEKLAC SN SNGA S+ N+ Sbjct: 458 WATGLXSGLSQFKKEN---------ANHLELMDDFLEMEKLACLSNNSNGAFSV----NN 504 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 K SE+V+ A+ E T KDL EQ+ D++ +ANQVSSN ELS D D LPLTKL+S Sbjct: 505 KRSEAVDHGAIAEVTSSKDLQLEQKHDLDSLANQVSSNAELSEVNPQSDKDLLPLTKLRS 564 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869 RIS+VFES+S++SD KIL +IKRVLQD H+ LHQHS S EEIHCSD+TCDRQACPED Sbjct: 565 RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQACPED 624 Query: 870 AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049 AGVTAE+EISLSQD P +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q SP GNG Sbjct: 625 AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 684 Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229 + KIE+FSAT +KVL +S+ DF+ LS+V++KA+EL+FNILGYK EINSSDCIDK Sbjct: 685 SRKIEDFSATVNKVLCXKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 744 Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409 VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ Sbjct: 745 VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 804 Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589 LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES Sbjct: 805 LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 864 Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763 YRSLETRAEELE++VNLL+ + SH++ L RC DLQE+L+RNE CS Sbjct: 865 YRSLETRAEELETEVNLLRGKTETLESEFQEEKRSHENALIRCKDLQEQLERNEGCSVCA 924 Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 S AADI CQETIFLLG+Q AMRP T+L+GSP SE +Q+ E Sbjct: 925 MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLXAMRPQTDLLGSPQSERSQRVE 984 Query: 1941 GYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLKSP 2063 + E+EP D+ + +S + L LYN+P +PS+TESN+LL+SP Sbjct: 985 VFHEDEPTTSGMNLQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 1044 Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 + +KH KHR EKQ RGFSRFFS+KGK +GH Sbjct: 1045 VGSKHPKHRPTKSNSSSSAPTPEKQSRGFSRFFSSKGK-NGH 1085 >ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-like [Vitis vinifera] Length = 1040 Score = 748 bits (1930), Expect = 0.0 Identities = 424/762 (55%), Positives = 508/762 (66%), Gaps = 35/762 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL LPEFS+DN+Q+ HK+NEF RNSELQASRN+CAKT Sbjct: 338 SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397 Query: 186 ANKLQTLESQLQVSDQHKHPLK-----------SQHASS-PSVTSMSEDGHDDVGSCAES 329 A+KLQ LE+QLQ+++Q K P K SQ+AS+ PS+TSMSEDG+DD SCAES Sbjct: 398 ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 WAT +S + F A +LELMDDFLEMEKLAC SN SNGA S+ N+ Sbjct: 458 WATGLVSGLSQF---------KKENANHLELMDDFLEMEKLACLSNNSNGAFSV----NN 504 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 K SE+ D LPLTKL+S Sbjct: 505 KRSEA---------------------------------------------DLLPLTKLRS 519 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869 RIS+VFES+S++SD KIL +IKRVLQD H+ LHQHS S EEIHCSD+TCDRQACPED Sbjct: 520 RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQACPED 579 Query: 870 AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049 AGVTAE+EISLSQD P +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q SP GNG Sbjct: 580 AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 639 Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229 + KIE+FSAT +KVL +S+ DF+ LS+V++KA+EL+FNILGYK EINSSDCIDK Sbjct: 640 SRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 699 Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409 VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ Sbjct: 700 VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 759 Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589 LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES Sbjct: 760 LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 819 Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763 YRSLETRAEELE++VNLL+ + SH++ L RC DLQE+L+RNE CS Sbjct: 820 YRSLETRAEELETEVNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCA 879 Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 S AADI CQETIFLLG+Q NAMRP T+L+GSP SE +Q+ E Sbjct: 880 MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQTDLLGSPQSERSQRVE 939 Query: 1941 GYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLKSP 2063 + E+EP D+ + +S + L LYN+P +PS+TESN+LL+SP Sbjct: 940 VFHEDEPTTSGMNLQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 999 Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 + +KH KHR EKQ RGFSRFFS+KGK +GH Sbjct: 1000 VGSKHPKHRPTKSNSSSSAPTPEKQSRGFSRFFSSKGK-NGH 1040 >emb|CBI19835.3| unnamed protein product [Vitis vinifera] Length = 993 Score = 699 bits (1805), Expect = 0.0 Identities = 404/743 (54%), Positives = 483/743 (65%), Gaps = 16/743 (2%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL LPEFS+DN+Q+ HK+NEF RNSELQASRN+CAKT Sbjct: 338 SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ LE+QLQ+++Q K P KS Q+AS+P S+TSMSEDG+DD SCAES Sbjct: 398 ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 WAT +S + F A +LELMDDFLEMEKLAC SN SNGA S H Sbjct: 458 WATGLVSGLSQFKKEN---------ANHLELMDDFLEMEKLACLSNNSNGA-----FSKH 503 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 D++ +ANQ L+S Sbjct: 504 --------------------------DLDSLANQ-----------------------LRS 514 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869 RIS+VFES+S++SD KIL +IKRVLQD H+ LHQHS ACPED Sbjct: 515 RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHS------------------ACPED 556 Query: 870 AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049 AGVTAE+EISLSQD P +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q SP GNG Sbjct: 557 AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 616 Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229 + KIE+FSAT +KVL +S+ DF+ LS+V++KA+EL+FNILGYK EINSSDCIDK Sbjct: 617 SRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 676 Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409 VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ Sbjct: 677 VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 736 Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589 LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES Sbjct: 737 LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 796 Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763 YRSLETRAEELE++VNLL+ + SH++ L RC DLQE+L+RNE CS Sbjct: 797 YRSLETRAEELETEVNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCA 856 Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 S AADI CQETIFLLG+Q NAMRP T+L+GSP SE +Q+ E Sbjct: 857 MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQTDLLGSPQSERSQRVE 916 Query: 1941 GYSENEPKDDEAEMDSAGALVLYNSPFTPSDTESNILLKSPLSTKHSKHRXXXXXXXXXX 2120 + E+EP L LYN+P +PS+TESN+LL+SP+ +KH KHR Sbjct: 917 VFHEDEP-----TTSGESPLELYNTPRSPSETESNLLLRSPVGSKHPKHRPTKSNSSSSA 971 Query: 2121 XXXEKQ-RGFSRFFSTKGKVSGH 2186 EKQ RGFSRFFS+KGK +GH Sbjct: 972 PTPEKQSRGFSRFFSSKGK-NGH 993 >ref|XP_007225499.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica] gi|462422435|gb|EMJ26698.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica] Length = 993 Score = 689 bits (1777), Expect = 0.0 Identities = 388/757 (51%), Positives = 495/757 (65%), Gaps = 33/757 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 ++PH+ + EFSLDN+QKFHKENEF RNSELQ SR +CA+ Sbjct: 259 SSPHMSPVTEFSLDNVQKFHKENEFLTERLLAMEEETKMLKEALTKRNSELQTSRGMCAQ 318 Query: 183 TANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAE 326 T +KLQTLE+QLQ+++Q K KS Q+AS+P S+TS+SEDG+DD SCAE Sbjct: 319 TVSKLQTLEAQLQINNQQKGSPKSVVQITTEGSSSQNASNPPSLTSLSEDGNDDDRSCAE 378 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXXACN-LELMDDFLEMEKLACSSNESNGAPSISESS 503 SWATT S +H N L LMDDFLEMEKLAC N+SNGA SIS Sbjct: 379 SWATTLGSDLSHIRKEKSNQKSNKAENQNHLNLMDDFLEMEKLACLPNDSNGAVSISSGP 438 Query: 504 NHKASESVNQDALVETTMDKDLHPEQQCDMNPM-ANQVSSNVELSSHKSDPDTDQLPLTK 680 N+K SE N DA + T +KD+ EQQ D++P+ +Q SSNV+LS + D +QLPL K Sbjct: 439 NNKTSERENHDASGDVTAEKDIQSEQQQDLSPLEGDQASSNVKLSGLSPESDENQLPLVK 498 Query: 681 LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860 L+S+IS++ E LSK++D K++ DIK V+Q+A + LH H+ + EE+H SD+ CDRQA Sbjct: 499 LRSKISMLLELLSKDTDFGKVIEDIKHVVQEAQDTLHPHTVNCISEEVHSSDAICDRQAN 558 Query: 861 PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040 PED+ +T EKEI+LSQ P + T+ ++S++L++AIS I+DFVLFLGKE + D P G Sbjct: 559 PEDSRLTTEKEITLSQ---PARGTMELMSEDLASAISLINDFVLFLGKEVMGVHDTFPDG 615 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 N L+ KIEEFS F+K +H N+SL DFVL LSHV++ EL FN+LGYK + E NS DC Sbjct: 616 NELSHKIEEFSGAFNKAIHGNLSLADFVLGLSHVLANVGELKFNVLGYKGVETETNSPDC 675 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPENKVV++DSS RY N C HIS+ S+PEVP D NLV +E N + CK SLEE Sbjct: 676 IDKVALPENKVVEKDSSE-RYQNVCVHISNH-SNPEVPDDGNLVSGYESNAAPCKISLEE 733 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 +EQ+KS KDNLA+DL RC E LE TKSQLQETEQ LAE KSQ SAQ SNSL+ETQL+CM Sbjct: 734 FEQIKSQKDNLAMDLERCNETLEMTKSQLQETEQLLAEAKSQFASAQNSNSLAETQLRCM 793 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESYRSLE RAEELE+++ LLQ R +HQD LARC +LQE+L+R + + Sbjct: 794 AESYRSLEARAEELEAELKLLQVRTETLESELQEEKRNHQDALARCTELQEQLKRELADA 853 Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 A A+ CQETIFLLG+Q ++ P TE +GSP+SE +QKGE Sbjct: 854 AEKLAE--------------------CQETIFLLGKQLKSLHPQTEHMGSPFSERSQKGE 893 Query: 1941 GYSENEP-----KDDEAEMD-----------SAGALVLYNSPFTPSDTESNILLKSPLST 2072 GY+E+ P D+AEM+ S + LYN+P +PSDTE+N LLKSP+++ Sbjct: 894 GYTEDVPTTTVRDSDQAEMEGTAFANVNRVGSESPVNLYNTPCSPSDTEANTLLKSPVNS 953 Query: 2073 KHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174 K+ KHR + QRGFSRFFS+K K Sbjct: 954 KYPKHRPTKSTSSSASSTPTPEKHQRGFSRFFSSKAK 990 >ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508723086|gb|EOY14983.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 947 Score = 687 bits (1773), Expect = 0.0 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 186 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 245 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 246 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 305 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 306 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 365 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 366 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 424 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 425 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 484 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 485 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 544 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 545 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 604 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 605 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 663 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 664 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 723 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 724 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 783 Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 SA + AAD CQETIFLLG+Q ++RP T+++GSPY+E +QK Sbjct: 784 SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 843 Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057 GEG E+EP D+ E+D+A + + SP +PSDT++N LL+ Sbjct: 844 GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 902 Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 SP+++ H KH+ EKQ RGFSRFFS+KGK +GH Sbjct: 903 SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 947 >ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508723085|gb|EOY14982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1106 Score = 687 bits (1773), Expect = 0.0 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 345 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 404 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 405 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 464 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 465 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 524 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 525 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 583 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 584 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 643 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 644 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 703 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 704 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 763 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 764 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 822 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 823 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 882 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 883 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 942 Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 SA + AAD CQETIFLLG+Q ++RP T+++GSPY+E +QK Sbjct: 943 SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 1002 Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057 GEG E+EP D+ E+D+A + + SP +PSDT++N LL+ Sbjct: 1003 GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 1061 Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 SP+++ H KH+ EKQ RGFSRFFS+KGK +GH Sbjct: 1062 SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1106 >ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508723083|gb|EOY14980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1102 Score = 687 bits (1773), Expect = 0.0 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 341 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 400 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 401 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 460 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 461 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 520 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 521 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 579 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 580 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 639 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 640 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 699 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 700 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 759 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 760 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 818 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 819 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 878 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 879 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 938 Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 SA + AAD CQETIFLLG+Q ++RP T+++GSPY+E +QK Sbjct: 939 SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 998 Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057 GEG E+EP D+ E+D+A + + SP +PSDT++N LL+ Sbjct: 999 GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 1057 Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 SP+++ H KH+ EKQ RGFSRFFS+KGK +GH Sbjct: 1058 SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1102 >gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis] Length = 1087 Score = 680 bits (1754), Expect = 0.0 Identities = 389/766 (50%), Positives = 509/766 (66%), Gaps = 38/766 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 ++PHL EF+ DN+QK+ KENEF RNSELQ SR++CAK Sbjct: 339 SSPHLSPATEFTPDNVQKYQKENEFLTERLLAVEEETKMLKEALAKRNSELQVSRSMCAK 398 Query: 183 TANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQ+LE+Q+Q ++QHK KS Q+AS+P S+TSMSEDG+DD SCAE Sbjct: 399 TSSKLQSLEAQIQSNNQHKTTPKSIVQISAEGSFSQNASNPPSLTSMSEDGNDDDRSCAE 458 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXXACN-LELMDDFLEMEKLACSSNESNGAPSISESS 503 SW TT IS+ + N L LMDDFLEMEKLAC SNESNGA S+S+S Sbjct: 459 SWTTTLISEVSQVKKEKSNEKTNRAEKPNHLNLMDDFLEMEKLACLSNESNGAISVSDSM 518 Query: 504 NHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQ-VSSNVELSSHKSDPDTDQLPLTK 680 + K SE+VN DA E M K E+QCD N +ANQ ++SN + + +++QLPL K Sbjct: 519 SSKISETVNHDAS-EVVMRK----EEQCDSNSLANQQLTSNGKSPELRPGSNSEQLPLMK 573 Query: 681 LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCD-RQA 857 LQSRIS++ ES+SK+SD+ IL DIK +Q+ H+ LHQH+ S E++HCSD+ CD RQA Sbjct: 574 LQSRISVLLESVSKDSDVGTILEDIKHAIQETHDTLHQHTVSCISEDVHCSDAGCDDRQA 633 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 PEDAG+T+EKEI+LSQ P +E II +L+AAIS IHDFVLFLGKEA+ D S Sbjct: 634 NPEDAGLTSEKEIALSQ---PAREARQIIRDDLAAAISQIHDFVLFLGKEAMGVHDTSTE 690 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 G+ +++IEEFS T +KV+H+++SL DFVL LS V++KA+EL F++LG+K +AE NS D Sbjct: 691 GSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDLSSVLAKASELRFSVLGFKGNEAETNSPD 750 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENK +Q+DSS Y NGC+H+ +STS+PEVP D N+V S+E N SCK SLE Sbjct: 751 CIDKVVLPENKAIQKDSSE-IYQNGCAHMPNSTSNPEVPDDGNIVSSYESNAKSCKISLE 809 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 EY+QLKS KDNLALD RCTENLE TKSQLQETEQ LAE KSQL+S Q+SNSLSETQLKC Sbjct: 810 EYDQLKSEKDNLALDFARCTENLEMTKSQLQETEQLLAEAKSQLSSVQKSNSLSETQLKC 869 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNE-S 1754 MAESYRSLETRA++LE+++NLL+ + +HQD L RC +LQE+LQRNE + Sbjct: 870 MAESYRSLETRAQDLETELNLLRTKTESIEAELQEEKRNHQDALTRCKELQEQLQRNENN 929 Query: 1755 CSASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 C + + CQETIFLLG++ +RP +E++GSPYSE +Q Sbjct: 930 CENEIKPN------QEKEFAAAAEKLAECQETIFLLGKKLKNLRPQSEIMGSPYSERSQN 983 Query: 1935 GEGYSENE--------PKDDEAEMDSA-----------GALVLYNSPFTPSDTESNILLK 2057 GEG +E+E P+ D+AE++S + +Y++P +PSD E +I LK Sbjct: 984 GEGLNEDEPTTSGMNLPESDQAELESVTSANLNRVGAESPIDVYSAPLSPSDAEPSI-LK 1042 Query: 2058 SPLSTKHSKH---RXXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186 SP+++K+ +H + + RGFSRFFS+KGK +GH Sbjct: 1043 SPINSKNPRHKSPKSGSLSSSSAPTPEKHSRGFSRFFSSKGK-NGH 1087 >ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|567885183|ref|XP_006435150.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537271|gb|ESR48389.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] gi|557537272|gb|ESR48390.1| hypothetical protein CICLE_v10000102mg [Citrus clementina] Length = 1091 Score = 667 bits (1721), Expect = 0.0 Identities = 383/764 (50%), Positives = 494/764 (64%), Gaps = 37/764 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL + EFSLDN+QKF KENEF RNSELQASRNLCAKT Sbjct: 341 SPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 400 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ+LE+Q+Q S Q K P KS Q+AS+P S+TSMSED +DD SCA+S Sbjct: 401 ASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSLTSMSEDDNDDKVSCADS 460 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXAC-NLELMDDFLEMEKLACSSNE--SNGAPSISES 500 WAT IS+ + +LELMDDFLEMEKLAC SN+ SNG + S Sbjct: 461 WATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLACLSNDTNSNGTITASNG 520 Query: 501 SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680 N+K S+ +N DA T +DL EQQ DMNP +++SSN E S+ + D Q L K Sbjct: 521 PNNKTSDILNHDASGAVTSGEDLLSEQQRDMNPSVDKLSSNTESSTVNPEADAGQPQLMK 580 Query: 681 LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860 L+SRIS++ E++SK++DM KI+ DIKRV++D H LHQHS + EE+ CSD +C +A Sbjct: 581 LRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAEAY 640 Query: 861 PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040 P DA + E++I L TV +ISQEL AAIS IHDFVLFLGKEA A D + Sbjct: 641 PGDASLNTERKIDL---------TVQVISQELVAAISQIHDFVLFLGKEARAVHDTT-NE 690 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 NG ++KIEEF +F+KV+ +N L DFV +LS+V++KA+EL N++GYK T+ E NS DC Sbjct: 691 NGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDC 750 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPENKV+++D+SG RYPNGC+HIS+ TSDPEVP D ++V ++E T++CK +LEE Sbjct: 751 IDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESETTACKFTLEE 810 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 +E+LK KDNLA DL RCTENLE TKSQL ETEQ LAEVK+QL SAQ+SNSL+ETQLKCM Sbjct: 811 FEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCM 870 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESYRSLET A+ELE++VNLL+A+ SH + +A+C +L+E+LQRNE+C+ Sbjct: 871 AESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCA 930 Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 + CQETI LLG+Q ++RP +E++GSPYSE +QKGE Sbjct: 931 VCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGSPYSERSQKGE 990 Query: 1941 GYSENEP------KDDEAEMDSA-------------GALVLYNSPFTPSDTESNILLKSP 2063 + EP + D AEMDS L LY SP +PS+ E++I KSP Sbjct: 991 -FLPGEPATASLQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSPSENEASI-NKSP 1048 Query: 2064 LSTKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186 +++KH KHR + RGFSRFFS+KG+ +GH Sbjct: 1049 INSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGR-NGH 1091 >ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED: filament-like plant protein 4-like isoform X2 [Citrus sinensis] Length = 1091 Score = 663 bits (1710), Expect = 0.0 Identities = 382/764 (50%), Positives = 492/764 (64%), Gaps = 37/764 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL + EFSLDN+QKF KENEF RNSELQASRNLCAKT Sbjct: 341 SPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 400 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ+LE+Q+Q S Q K P KS Q+AS+P S+TSMSED +DD SCA+S Sbjct: 401 ASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSLTSMSEDDNDDKVSCADS 460 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXAC-NLELMDDFLEMEKLACSSNE--SNGAPSISES 500 WAT IS+ + +LELMDDFLEMEKLAC SN+ SNG + S Sbjct: 461 WATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLACLSNDTNSNGTITASNG 520 Query: 501 SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680 N+K S+ VN DA T +DL EQQ DMNP +++SSN E S+ + D Q L K Sbjct: 521 PNNKTSDIVNHDASGAVTSGEDLLSEQQRDMNPSVDKLSSNTESSTVNPEADAGQPQLMK 580 Query: 681 LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860 L+SRIS++ E++SK++DM KI+ DIKRV++D H LHQHS + EE+ CSD +C +A Sbjct: 581 LRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAEAY 640 Query: 861 PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040 P DA + E++I L TV +ISQEL AAI+ IHDFVLFLGKEA A D + Sbjct: 641 PGDARLNTERKIDL---------TVQVISQELVAAITQIHDFVLFLGKEARAVHDTT-NE 690 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 NG ++KIEEF +F+KV+ +N L DFV +LS+V++KA+EL N++GYK T+ E NS DC Sbjct: 691 NGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDC 750 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPENKV+++D+SG RYPNGC+HIS+ TSDPEVP D ++V ++E T++CK SLEE Sbjct: 751 IDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESETTACKFSLEE 810 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 +E+LK KDNLA DL RCTENLE TKSQL ETEQ LAEVK+QL SAQ+SNSL+ETQLKCM Sbjct: 811 FEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCM 870 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESYRSLET A+ELE++VNLL+A+ SH + +A+C +L+E+LQRNE+C+ Sbjct: 871 AESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCA 930 Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 + CQETI LLG+Q ++RP +E++GSPYSE + KGE Sbjct: 931 VCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGSPYSERSPKGE 990 Query: 1941 GYSENEP------KDDEAEMDSA-------------GALVLYNSPFTPSDTESNILLKSP 2063 + EP + D AE DS L LY SP +PS+ E++I KSP Sbjct: 991 -FLPGEPATASLQEFDHAETDSVTSANAQPHRVGAESPLDLYTSPCSPSENEASI-NKSP 1048 Query: 2064 LSTKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186 +++KH KHR + RGFSRFFS+KG+ +GH Sbjct: 1049 INSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGR-NGH 1091 >ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344134|gb|EEE81259.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 1063 Score = 658 bits (1697), Expect = 0.0 Identities = 385/759 (50%), Positives = 482/759 (63%), Gaps = 36/759 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL ++PEFSLDN+QKF+KENEF RNSELQASRNLCAKT Sbjct: 332 SPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKMLKEALAKRNSELQASRNLCAKT 391 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ+LE+Q Q+++ K KS Q+ S+P S+TS+SEDG+DD SCA+S Sbjct: 392 ASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSLTSVSEDGNDDTQSCADS 451 Query: 330 WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506 WATTS+S +HF A +LELMDDFLEMEKLAC + +S A +IS S N Sbjct: 452 WATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLACLNADS--ATTISSSPN 509 Query: 507 HKASESVNQDALVETTMDK-DLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683 +KASE+ N DAL E ++ K D E++ D++P+AN VS N + S+ S D D KL Sbjct: 510 NKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNKDSSAINSGSDADLSSFGKL 569 Query: 684 QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACP 863 QSRIS++ ES+SKE D+ KIL +IK+V+ DA + S +E+H SD+TCDRQ CP Sbjct: 570 QSRISMLLESVSKEVDVDKILEEIKQVVHDAET-----AASCGSKEVHHSDATCDRQTCP 624 Query: 864 EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGN 1043 EDA + EKEI+L Q+S IHDFVL LGKEA+A D S Sbjct: 625 EDAVIMGEKEITLLQESI-------------------IHDFVLLLGKEAMAVHDTSCDSI 665 Query: 1044 GLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCI 1223 GL++KIEEFS TF KVL ++ SL DF+ LS V++ A+ L FN+LGYK +AEINS DCI Sbjct: 666 GLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCI 725 Query: 1224 DKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEY 1403 DKVALPENKV+Q DS G + NGC++IS TS+PEVP NLVP + NT+SCK SLEE+ Sbjct: 726 DKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEF 785 Query: 1404 EQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMA 1583 E+LKS KD +A+DL RCTENLE TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCMA Sbjct: 786 EELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMA 845 Query: 1584 ESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA 1763 ESYRSLETRA+ELE++VNLL+ + SHQD L RC +L+E+LQ ES SA Sbjct: 846 ESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKESSSA 905 Query: 1764 SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEG 1943 I CQETIFLLG+Q +RP TE++GSPYSE +Q G+G Sbjct: 906 D---GIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEIMGSPYSERSQSGDG 962 Query: 1944 YSENEP--------KDDEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSPL 2066 +++EP D+AEMD+ ++ YN P PSDTESN LL+SP+ Sbjct: 963 IAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESN-LLRSPV 1021 Query: 2067 STKHSKHRXXXXXXXXXXXXXEKQ---RGFSRFFSTKGK 2174 KH KHR + RGFSRFFS+KGK Sbjct: 1022 GLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 1060 >ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] gi|550344133|gb|ERP63976.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa] Length = 991 Score = 658 bits (1697), Expect = 0.0 Identities = 385/759 (50%), Positives = 482/759 (63%), Gaps = 36/759 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL ++PEFSLDN+QKF+KENEF RNSELQASRNLCAKT Sbjct: 260 SPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKMLKEALAKRNSELQASRNLCAKT 319 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ+LE+Q Q+++ K KS Q+ S+P S+TS+SEDG+DD SCA+S Sbjct: 320 ASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSLTSVSEDGNDDTQSCADS 379 Query: 330 WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506 WATTS+S +HF A +LELMDDFLEMEKLAC + +S A +IS S N Sbjct: 380 WATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLACLNADS--ATTISSSPN 437 Query: 507 HKASESVNQDALVETTMDK-DLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683 +KASE+ N DAL E ++ K D E++ D++P+AN VS N + S+ S D D KL Sbjct: 438 NKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNKDSSAINSGSDADLSSFGKL 497 Query: 684 QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACP 863 QSRIS++ ES+SKE D+ KIL +IK+V+ DA + S +E+H SD+TCDRQ CP Sbjct: 498 QSRISMLLESVSKEVDVDKILEEIKQVVHDAET-----AASCGSKEVHHSDATCDRQTCP 552 Query: 864 EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGN 1043 EDA + EKEI+L Q+S IHDFVL LGKEA+A D S Sbjct: 553 EDAVIMGEKEITLLQESI-------------------IHDFVLLLGKEAMAVHDTSCDSI 593 Query: 1044 GLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCI 1223 GL++KIEEFS TF KVL ++ SL DF+ LS V++ A+ L FN+LGYK +AEINS DCI Sbjct: 594 GLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCI 653 Query: 1224 DKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEY 1403 DKVALPENKV+Q DS G + NGC++IS TS+PEVP NLVP + NT+SCK SLEE+ Sbjct: 654 DKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEF 713 Query: 1404 EQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMA 1583 E+LKS KD +A+DL RCTENLE TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCMA Sbjct: 714 EELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMA 773 Query: 1584 ESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA 1763 ESYRSLETRA+ELE++VNLL+ + SHQD L RC +L+E+LQ ES SA Sbjct: 774 ESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKESSSA 833 Query: 1764 SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEG 1943 I CQETIFLLG+Q +RP TE++GSPYSE +Q G+G Sbjct: 834 D---GIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEIMGSPYSERSQSGDG 890 Query: 1944 YSENEP--------KDDEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSPL 2066 +++EP D+AEMD+ ++ YN P PSDTESN LL+SP+ Sbjct: 891 IAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESN-LLRSPV 949 Query: 2067 STKHSKHRXXXXXXXXXXXXXEKQ---RGFSRFFSTKGK 2174 KH KHR + RGFSRFFS+KGK Sbjct: 950 GLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 988 >ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508723089|gb|EOY14986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1107 Score = 657 bits (1694), Expect = 0.0 Identities = 389/768 (50%), Positives = 500/768 (65%), Gaps = 40/768 (5%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 345 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 404 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 405 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 464 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 465 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 524 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 525 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 583 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 584 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 643 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 644 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 703 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 704 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 763 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 764 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 822 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 823 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 882 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 883 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 942 Query: 1758 SASLAA---DIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMA 1928 SA AA D+ I+L+ N + T+++GSPY+E + Sbjct: 943 SACAAAADNDLKNKQVSVYFNLCILRWILP-NPLIYLILLPRNIIYSCTDMMGSPYNERS 1001 Query: 1929 QKGEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNIL 2051 QKGEG E+EP D+ E+D+A + + SP +PSDT++N L Sbjct: 1002 QKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-L 1060 Query: 2052 LKSPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 L+SP+++ H KH+ EKQ RGFSRFFS+KGK +GH Sbjct: 1061 LRSPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1107 >ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] gi|550339754|gb|EEE93914.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa] Length = 1077 Score = 641 bits (1653), Expect = 0.0 Identities = 377/757 (49%), Positives = 488/757 (64%), Gaps = 34/757 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PH ++ EFSLDN+QKFHKENEF RNSELQASRNLCAKT Sbjct: 332 SPHSSSVTEFSLDNVQKFHKENEFLTERLFAMEEETKMLKEALAKRNSELQASRNLCAKT 391 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A+KLQ+LE+Q +S+Q K KS Q+ S+P S+T++SEDG+DD SCA+S Sbjct: 392 ASKLQSLEAQFHISNQVKSSPKSIIQVPAEGYSSQNISNPPSLTNVSEDGNDDTQSCADS 451 Query: 330 WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506 WAT SIS+F++F A +LE MDDFLEMEKLAC + +S A + S S N Sbjct: 452 WATISISEFSNFKKYNHSEKLNKAENAKHLEFMDDFLEMEKLACLNADS--AATTSNSPN 509 Query: 507 HKASESVNQDALVETTMDKD-LHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683 +K SE N+DA E ++ K+ E++ +++P N +S N + S+ +S D D KL Sbjct: 510 NKTSEVANRDASGEISLQKENTLSEEKHNLDPPVNHLSCNKDSSAIESGSDADLSSFMKL 569 Query: 684 QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTC-DRQAC 860 Q RIS++ +S SK++D+ KIL DIK+V+QDA S +E HCSD+T DRQ C Sbjct: 570 QLRISMLLDSGSKKADLGKILEDIKQVVQDAETGA-----SCVSKEAHCSDATTHDRQTC 624 Query: 861 PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040 PEDAG+ EKEI L Q+S + + +SQEL AIS IHDFVL LGKEA+ D S Sbjct: 625 PEDAGIMGEKEIELFQESKTAAQIMHTVSQELLPAISQIHDFVLLLGKEAMTVHDTSCDS 684 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 GL++KI+EFS TF+KVL+++ SL DFV L+H+++ A+ L FN+LGYK +AEI+S DC Sbjct: 685 IGLSQKIKEFSITFNKVLYSDRSLVDFVSDLAHILALASGLRFNVLGYKGNEAEISSPDC 744 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDK+ALPENKVVQ++SS Y NGC++IS TS+PEVP D NLV + NT+SCK SLEE Sbjct: 745 IDKIALPENKVVQKNSSVETYQNGCANISSPTSNPEVPDDGNLVLGYGSNTTSCKVSLEE 804 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 +E+LKS KDN+A+DL RCTEN E TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCM Sbjct: 805 FEELKSEKDNMAMDLARCTENFEMTKSQLHETEQLLAEVKSQLASAQKSNSLAETQLKCM 864 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 ESYRSLETRA+ELE++VNLL+ + SHQ L RC +L+E+LQ NES Sbjct: 865 TESYRSLETRAQELETEVNLLRLKTETLENVLQEEKKSHQGALTRCKELEEQLQTNES-- 922 Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 S DI CQETIFLLG+Q N++ P TE++GSPYSE +Q G+ Sbjct: 923 -STVTDI--ECKQEKEIAAAAEKLAECQETIFLLGKQLNSLCPQTEIMGSPYSERSQIGD 979 Query: 1941 GYSENEPKD--------DEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSP 2063 ++E+EP D+AEMD+ G + YN P +PSDTES+ LL+SP Sbjct: 980 VFAEDEPTTSGMNLQDFDQAEMDTGGLANIHKAGAESPINSYNHPCSPSDTESS-LLRSP 1038 Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174 +++K KH + RGFSRFFS+KGK Sbjct: 1039 VASKPPKH-GPTKSSSSAPMLEKHSRGFSRFFSSKGK 1074 >ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-like [Solanum tuberosum] Length = 1093 Score = 618 bits (1594), Expect = e-174 Identities = 359/755 (47%), Positives = 471/755 (62%), Gaps = 31/755 (4%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 ++P +LP+FS D++QKFHKENE RNSELQASR++CAK Sbjct: 343 SSPQFSSLPDFSFDSVQKFHKENEQLTERLLAMEEETKMLKEALAHRNSELQASRSICAK 402 Query: 183 TANKLQTLESQLQVSDQHKHPLKSQHASSPS-------------VTSMSEDGHDDVGSCA 323 T++KLQ+LE+QLQ + + K P KS PS + SMSEDG+DD SCA Sbjct: 403 TSSKLQSLEAQLQANVEQKSPQKSTIRRQPSEGSLSHEANHLPRLASMSEDGNDDNVSCA 462 Query: 324 ESWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISES 500 SW T +S TH A +L+LMDDFLEMEKLA S+++NGA S + Sbjct: 463 SSWTTALMSDLTHVKKEKNFDSPHKSESASHLDLMDDFLEMEKLAYQSSDTNGAVSSPDI 522 Query: 501 SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680 N+ E+ D + T D ++ + + +Q S N E+SS P +D K Sbjct: 523 PNNARPETTKVDTSMHVTTSPDSQLKEHNETSVSGDQASRNEEVSSQSHQPLSDTSISMK 582 Query: 681 LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860 LQSRIS V ESLSK++D+++I D++ ++Q+ N L S S E S++ + Q Sbjct: 583 LQSRISTVLESLSKDADIQRIQEDLREIVQEMRNALIPQSTKSIVEITLSSNTATESQPS 642 Query: 861 PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040 +D EKEI +S+DS E++ IS+EL+ A+S IHDFVLFLGKEA A Q +P G Sbjct: 643 LDDGEANLEKEIPVSEDSKSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG 702 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 +G+NEK+++FSAT+ +V+ N +S+ +FVL LSHV+S A++L FNILGYK ++ EI++SDC Sbjct: 703 SGINEKLDDFSATYVEVISNKLSMVNFVLDLSHVLSNASQLHFNILGYKNSETEISTSDC 762 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPENK +Q SG Y NGC+H SDSTSDP++PH+ +LVP+ E ++S KCSLEE Sbjct: 763 IDKVALPENKDLQH--SGEVYANGCAHFSDSTSDPDIPHEGSLVPTSESTSTSLKCSLEE 820 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 EQLK K+N+ALDL R +ENLESTKSQL ETEQ LAEVKSQL SAQ++NSL+ETQLKCM Sbjct: 821 VEQLKLEKENMALDLARYSENLESTKSQLTETEQLLAEVKSQLVSAQKANSLAETQLKCM 880 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESY SLETR EEL+++VN LQA+ +HQDTLA C DL+E+LQR ES Sbjct: 881 AESYNSLETRTEELQTEVNRLQAKIENLDNELQEEKKNHQDTLASCKDLEEQLQRMES-- 938 Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940 AAD+ CQETIFLLG+Q N++RP TE +GSPY + + KGE Sbjct: 939 ---AADLDAKTNQEKDLTAAAEKLAECQETIFLLGKQLNSLRPQTEFMGSPYIDRSSKGE 995 Query: 1941 GYSE-------NEPKDDEAEMDSAGALV--------LYNSPFTPSDTESNILLKSPLSTK 2075 G+ E N +D AEMDSA ++ +YN ++PSDTE N L+SP+S K Sbjct: 996 GFREESTTTSMNIHDNDLAEMDSASSVKATCESPVDIYNVSYSPSDTEVNNPLRSPISLK 1055 Query: 2076 HSKHR-XXXXXXXXXXXXXEKQ-RGFSRFFSTKGK 2174 KHR EKQ RGFSRFFS+KGK Sbjct: 1056 SPKHRSTKSGSSSSAGPTPEKQSRGFSRFFSSKGK 1090 >ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-like plant protein 4-like [Cucumis sativus] Length = 1084 Score = 616 bits (1588), Expect = e-173 Identities = 366/759 (48%), Positives = 476/759 (62%), Gaps = 32/759 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 TPH+ ++P+FSLDN KF KEN+F RNSELQ SR++CAKT Sbjct: 338 TPHMLSVPDFSLDNALKFQKENDFLTERMLAMEEETKMLKEALAKRNSELQTSRSMCAKT 397 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A KLQ LE+QLQ + + KS Q+ S P S+TSMSEDG++D SCA++ Sbjct: 398 ATKLQNLEAQLQNGNHQRSSPKSVVQYTADGFSCQNTSHPPSLTSMSEDGNEDGQSCADT 457 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 + + S +HF +L LMDDFLEMEKLAC SN+SN A S S+N+ Sbjct: 458 LSIAATSDISHFREKKNEKLSKTESGSHLGLMDDFLEMEKLACQSNDSNEAILASNSTNN 517 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 K SE V + EQ D +P VSS+V+LS+ +D ++ LPL KL+S Sbjct: 518 KDSEVVVHQE------SNGIQSEQHLDSSPSTEVVSSSVDLSTECAD--SNGLPLLKLRS 569 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPS--SDFEEIHCSDSTCDRQACP 863 RIS++FES+SK++D KIL DIK ++QDAH+ L Q + + S E+ D+TCDRQA P Sbjct: 570 RISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCDRQANP 629 Query: 864 EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDP-SPGG 1040 +DAG+ E+EI+ SQ PV P+ SQEL AAIS IH+FVLFLGKEA D SP G Sbjct: 630 DDAGLGVEREIAFSQ---PVAHNQPM-SQELEAAISQIHEFVLFLGKEASRVHDTISPDG 685 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 +GL +K+EEFS+TF+K++H N SL DFV+ LSHV+S+A+EL F+ +G K TD + NS DC Sbjct: 686 HGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPDC 745 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPE+KVVQ DS RY NGCSHIS TSD EVP+D NLV S+E N+ K S E+ Sbjct: 746 IDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYDGNLVSSYESNSRLPKFSSED 805 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 E+LK K+NL+ DL RCTE+LE+ K +LQETEQ LAE +SQL AQ+SNSLSETQLKCM Sbjct: 806 IEELKLAKENLSKDLARCTEDLEAAKRKLQETEQLLAESRSQLAFAQKSNSLSETQLKCM 865 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESYRSLE RAE+LE+++NLL+A++ +H + L++C +LQE+LQRNE C Sbjct: 866 AESYRSLEARAEDLETELNLLRAKSETLENDLQDEKRNHHEALSKCQELQEQLQRNEVCC 925 Query: 1761 A--SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 A S A D CQETIFLL +Q ++RP + GSP+SE + + Sbjct: 926 AICSSAIDGDPQKSQEIELTAAAEKLAECQETIFLLSKQLKSLRPQPDFSGSPFSERSHR 985 Query: 1935 GEGYSENEPKD--------DEAEMDSAGA----LVLYNSPFTPSDTESNILLKSPLSTKH 2078 GE + E+EP D +EMD+A + +V SP + SD E L+SP+++KH Sbjct: 986 GEEFIEDEPSKSGTNLLDLDRSEMDTATSTMTQIVGAESPCSASDGEGGSFLRSPINSKH 1045 Query: 2079 SKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 KHR EKQ RGFSRFFS+KGK + H Sbjct: 1046 PKHRPTKSSSSSSSSAPTPEKQTRGFSRFFSSKGKNNSH 1084 >ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-like [Cucumis sativus] Length = 1078 Score = 616 bits (1588), Expect = e-173 Identities = 366/759 (48%), Positives = 476/759 (62%), Gaps = 32/759 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 TPH+ ++P+FSLDN KF KEN+F RNSELQ SR++CAKT Sbjct: 332 TPHMLSVPDFSLDNALKFQKENDFLTERMLAMEEETKMLKEALAKRNSELQTSRSMCAKT 391 Query: 186 ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329 A KLQ LE+QLQ + + KS Q+ S P S+TSMSEDG++D SCA++ Sbjct: 392 ATKLQNLEAQLQNGNHQRSSPKSVVQYTADGFSCQNTSHPPSLTSMSEDGNEDGQSCADT 451 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 + + S +HF +L LMDDFLEMEKLAC SN+SN A S S+N+ Sbjct: 452 LSIAATSDISHFREKKNEKLSKTESGSHLGLMDDFLEMEKLACQSNDSNEAILASNSTNN 511 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 K SE V + EQ D +P VSS+V+LS+ +D ++ LPL KL+S Sbjct: 512 KDSEVVVHQE------SNGIQSEQHLDSSPSTEVVSSSVDLSTECAD--SNGLPLLKLRS 563 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPS--SDFEEIHCSDSTCDRQACP 863 RIS++FES+SK++D KIL DIK ++QDAH+ L Q + + S E+ D+TCDRQA P Sbjct: 564 RISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCDRQANP 623 Query: 864 EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDP-SPGG 1040 +DAG+ E+EI+ SQ PV P+ SQEL AAIS IH+FVLFLGKEA D SP G Sbjct: 624 DDAGLGVEREIAFSQ---PVAHNQPM-SQELEAAISQIHEFVLFLGKEASRVHDTISPDG 679 Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220 +GL +K+EEFS+TF+K++H N SL DFV+ LSHV+S+A+EL F+ +G K TD + NS DC Sbjct: 680 HGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPDC 739 Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400 IDKVALPE+KVVQ DS RY NGCSHIS TSD EVP+D NLV S+E N+ K S E+ Sbjct: 740 IDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYDGNLVSSYESNSRLPKFSSED 799 Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580 E+LK K+NL+ DL RCTE+LE+ K +LQETEQ LAE +SQL AQ+SNSLSETQLKCM Sbjct: 800 IEELKLAKENLSKDLARCTEDLEAAKRKLQETEQLLAESRSQLAFAQKSNSLSETQLKCM 859 Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760 AESYRSLE RAE+LE+++NLL+A++ +H + L++C +LQE+LQRNE C Sbjct: 860 AESYRSLEARAEDLETELNLLRAKSETLENDLQDEKRNHHEALSKCQELQEQLQRNEVCC 919 Query: 1761 A--SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934 A S A D CQETIFLL +Q ++RP + GSP+SE + + Sbjct: 920 AICSSAIDGDPQKSQEIELTAAAEKLAECQETIFLLSKQLKSLRPQPDFSGSPFSERSHR 979 Query: 1935 GEGYSENEPKD--------DEAEMDSAGA----LVLYNSPFTPSDTESNILLKSPLSTKH 2078 GE + E+EP D +EMD+A + +V SP + SD E L+SP+++KH Sbjct: 980 GEEFIEDEPSKSGTNLLDLDRSEMDTATSTMTQIVGAESPCSASDGEGGSFLRSPINSKH 1039 Query: 2079 SKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186 KHR EKQ RGFSRFFS+KGK + H Sbjct: 1040 PKHRPTKSSSSSSSSAPTPEKQTRGFSRFFSSKGKNNSH 1078 >ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated muscle, putative [Ricinus communis] Length = 1041 Score = 610 bits (1572), Expect = e-171 Identities = 370/758 (48%), Positives = 469/758 (61%), Gaps = 35/758 (4%) Frame = +3 Query: 6 TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185 +PHL +PEFSLDN QKFHKENEF RNSELQASRNLCAKT Sbjct: 340 SPHLSAVPEFSLDNAQKFHKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 399 Query: 186 ANKLQTLESQLQVSDQHKH--------PLK---SQHASSP-SVTSMSEDGHDDVGSCAES 329 A++LQ+LE+Q VS+Q K P++ SQ+ S+P S+TSMSEDG+DD SCA+S Sbjct: 400 ASRLQSLEAQ--VSNQQKSSPTSVVQVPIEGYSSQNMSNPPSLTSMSEDGNDDDRSCADS 457 Query: 330 WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509 WAT+ IS+ + E S Sbjct: 458 WATSLISELSQLK-----------------------------------------KEKSTE 476 Query: 510 KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689 K +++ N L MD L E+ +N N VSS +S+ S + DQ L KL+S Sbjct: 477 KLNKTKNTQHL--ELMDDFLEMEKLACLNANVNLVSS---MSAANSGSEADQPCLVKLRS 531 Query: 690 RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869 RIS++ ES+S+++DM KIL D++R++QD H + SS E++ +D+TC PE Sbjct: 532 RISMLLESISQDADMGKILEDVQRIVQDTHGAV-----SSVSEDVRATDATC-----PEY 581 Query: 870 AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049 A +T +KEI+L QD+ +TV ++QEL+ A+S IHDFVLFLGKEA+A D S G+ L Sbjct: 582 ASITGDKEITLFQDTNAATDTVRSVNQELATAVSSIHDFVLFLGKEAMAVHDTSSDGSDL 641 Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229 ++KIE FS TF+KVL+ N SL DF+ LS V++KA+EL FN+LGYK ++AEINSSDCIDK Sbjct: 642 SQKIEHFSVTFNKVLNGNTSLIDFIFYLSCVLAKASELRFNVLGYKGSEAEINSSDCIDK 701 Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409 VALPENKV+Q DSSG Y N C+HIS TS+PEVP D +LV + NT+ CK SLEE+E+ Sbjct: 702 VALPENKVLQRDSSGESYQNSCAHISSPTSNPEVPDDGSLVSGYGSNTTLCKVSLEEFEE 761 Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589 LKS K+N+ALDL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKCMAES Sbjct: 762 LKSEKNNVALDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAES 821 Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS-AS 1766 YRSLE RAEELE++VNLLQA+A H D L+R +L+E+LQ ESCS S Sbjct: 822 YRSLEARAEELETEVNLLQAKAETLENELQDEKQCHWDALSRSKELEEQLQTKESCSVCS 881 Query: 1767 LAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEGY 1946 AAD CQETIFLLG+Q A+RP TEL+GS YSE ++KG+G+ Sbjct: 882 AAADAENKANQDRELAAAAEKLAECQETIFLLGKQLKALRPQTELMGSAYSERSRKGDGF 941 Query: 1947 SENEPKD--------DEAEMDS--------AGA---LVLYNSPFTPSDTESNILLKSPLS 2069 +E+EP D+AEMD+ AGA + LYN P +PSDTESN L +SPL+ Sbjct: 942 AEDEPTTSGMNLQDFDQAEMDAIVSTNHHRAGAESPMDLYNQPCSPSDTESN-LSRSPLN 1000 Query: 2070 TKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174 +K KHR + RGFSRFFS KGK Sbjct: 1001 SKQPKHRSTKSTSSSSSHMATPEKHSRGFSRFFSAKGK 1038 >ref|XP_007017762.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] gi|508723090|gb|EOY14987.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] Length = 951 Score = 604 bits (1557), Expect = e-170 Identities = 337/606 (55%), Positives = 423/606 (69%), Gaps = 15/606 (2%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 341 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 400 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 401 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 460 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 461 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 520 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 521 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 579 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 580 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 639 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 640 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 699 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 700 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 759 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 760 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 818 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 819 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 878 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 879 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 938 Query: 1758 SASLAA 1775 SA AA Sbjct: 939 SACAAA 944 >ref|XP_007017760.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508723088|gb|EOY14985.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 837 Score = 604 bits (1557), Expect = e-170 Identities = 337/606 (55%), Positives = 423/606 (69%), Gaps = 15/606 (2%) Frame = +3 Query: 3 ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182 +TPHL T +FSLDN QK KENEF RNSEL ASRNLCAK Sbjct: 186 STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 245 Query: 183 TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326 T++KLQTLE+QL +S Q + P K SQ+ S+P SVTS+SEDG+DD SCAE Sbjct: 246 TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 305 Query: 327 SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497 SWAT +S+ + F A +L+LMDDFLEMEKLACSSN+S NG +IS+ Sbjct: 306 SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 365 Query: 498 SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677 S+N+K SESVN DA E + K+L E+Q ++P NQVSSN++LS + D DQLP+ Sbjct: 366 STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 424 Query: 678 KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857 KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS + EE+H SD TC QA Sbjct: 425 KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 484 Query: 858 CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037 +TAEKEI++S E V +SQEL+AAIS IHDFVL LGKEA A D Sbjct: 485 HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 544 Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217 GN L+ KIEEFS T++KVL +N+SL DF+ LS +++KA++L N+LGYK + EINS D Sbjct: 545 GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 604 Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397 CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV +E + S K S E Sbjct: 605 CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 663 Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577 E+E+LK K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC Sbjct: 664 EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 723 Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757 MAESYRSLETRA+ELE++VNLL+ + SH DTLARC +L+E+LQRNE+C Sbjct: 724 MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 783 Query: 1758 SASLAA 1775 SA AA Sbjct: 784 SACAAA 789