BLASTX nr result

ID: Paeonia24_contig00018578 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00018578
         (2467 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]   801   0.0  
ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-lik...   748   0.0  
emb|CBI19835.3| unnamed protein product [Vitis vinifera]              699   0.0  
ref|XP_007225499.1| hypothetical protein PRUPE_ppa000819mg [Prun...   689   0.0  
ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma...   687   0.0  
ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma...   687   0.0  
ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma...   687   0.0  
gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]     680   0.0  
ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr...   667   0.0  
ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik...   663   0.0  
ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu...   658   0.0  
ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu...   658   0.0  
ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma...   657   0.0  
ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Popu...   641   0.0  
ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-lik...   618   e-174
ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-lik...   616   e-173
ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-lik...   616   e-173
ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ...   610   e-171
ref|XP_007017762.1| Uncharacterized protein isoform 8, partial [...   604   e-170
ref|XP_007017760.1| Uncharacterized protein isoform 6 [Theobroma...   604   e-170

>emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]
          Length = 1085

 Score =  801 bits (2069), Expect = 0.0
 Identities = 444/762 (58%), Positives = 533/762 (69%), Gaps = 35/762 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  LPEFS+DN+Q+ HK+NEF                     RNSELQASRN+CAKT
Sbjct: 338  SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ LE+QLQ+++Q K P KS           Q+AS+P S+TSMSEDG+DD  SCAES
Sbjct: 398  ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
            WAT   S  + F             A +LELMDDFLEMEKLAC SN SNGA S+    N+
Sbjct: 458  WATGLXSGLSQFKKEN---------ANHLELMDDFLEMEKLACLSNNSNGAFSV----NN 504

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
            K SE+V+  A+ E T  KDL  EQ+ D++ +ANQVSSN ELS      D D LPLTKL+S
Sbjct: 505  KRSEAVDHGAIAEVTSSKDLQLEQKHDLDSLANQVSSNAELSEVNPQSDKDLLPLTKLRS 564

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869
            RIS+VFES+S++SD  KIL +IKRVLQD H+ LHQHS S   EEIHCSD+TCDRQACPED
Sbjct: 565  RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQACPED 624

Query: 870  AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049
            AGVTAE+EISLSQD  P  +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q  SP GNG 
Sbjct: 625  AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 684

Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229
            + KIE+FSAT +KVL   +S+ DF+  LS+V++KA+EL+FNILGYK    EINSSDCIDK
Sbjct: 685  SRKIEDFSATVNKVLCXKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 744

Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409
            VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ
Sbjct: 745  VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 804

Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589
            LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES
Sbjct: 805  LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 864

Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763
            YRSLETRAEELE++VNLL+ +             SH++ L RC DLQE+L+RNE CS   
Sbjct: 865  YRSLETRAEELETEVNLLRGKTETLESEFQEEKRSHENALIRCKDLQEQLERNEGCSVCA 924

Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
             S AADI                   CQETIFLLG+Q  AMRP T+L+GSP SE +Q+ E
Sbjct: 925  MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLXAMRPQTDLLGSPQSERSQRVE 984

Query: 1941 GYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLKSP 2063
             + E+EP          D+ + +S  +           L LYN+P +PS+TESN+LL+SP
Sbjct: 985  VFHEDEPTTSGMNLQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 1044

Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            + +KH KHR             EKQ RGFSRFFS+KGK +GH
Sbjct: 1045 VGSKHPKHRPTKSNSSSSAPTPEKQSRGFSRFFSSKGK-NGH 1085


>ref|XP_002280306.2| PREDICTED: filament-like plant protein 4-like [Vitis vinifera]
          Length = 1040

 Score =  748 bits (1930), Expect = 0.0
 Identities = 424/762 (55%), Positives = 508/762 (66%), Gaps = 35/762 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  LPEFS+DN+Q+ HK+NEF                     RNSELQASRN+CAKT
Sbjct: 338  SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397

Query: 186  ANKLQTLESQLQVSDQHKHPLK-----------SQHASS-PSVTSMSEDGHDDVGSCAES 329
            A+KLQ LE+QLQ+++Q K P K           SQ+AS+ PS+TSMSEDG+DD  SCAES
Sbjct: 398  ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
            WAT  +S  + F             A +LELMDDFLEMEKLAC SN SNGA S+    N+
Sbjct: 458  WATGLVSGLSQF---------KKENANHLELMDDFLEMEKLACLSNNSNGAFSV----NN 504

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
            K SE+                                             D LPLTKL+S
Sbjct: 505  KRSEA---------------------------------------------DLLPLTKLRS 519

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869
            RIS+VFES+S++SD  KIL +IKRVLQD H+ LHQHS S   EEIHCSD+TCDRQACPED
Sbjct: 520  RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHSVSCVVEEIHCSDATCDRQACPED 579

Query: 870  AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049
            AGVTAE+EISLSQD  P  +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q  SP GNG 
Sbjct: 580  AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 639

Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229
            + KIE+FSAT +KVL   +S+ DF+  LS+V++KA+EL+FNILGYK    EINSSDCIDK
Sbjct: 640  SRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 699

Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409
            VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ
Sbjct: 700  VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 759

Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589
            LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES
Sbjct: 760  LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 819

Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763
            YRSLETRAEELE++VNLL+ +             SH++ L RC DLQE+L+RNE CS   
Sbjct: 820  YRSLETRAEELETEVNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCA 879

Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
             S AADI                   CQETIFLLG+Q NAMRP T+L+GSP SE +Q+ E
Sbjct: 880  MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQTDLLGSPQSERSQRVE 939

Query: 1941 GYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLKSP 2063
             + E+EP          D+ + +S  +           L LYN+P +PS+TESN+LL+SP
Sbjct: 940  VFHEDEPTTSGMNLQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 999

Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            + +KH KHR             EKQ RGFSRFFS+KGK +GH
Sbjct: 1000 VGSKHPKHRPTKSNSSSSAPTPEKQSRGFSRFFSSKGK-NGH 1040


>emb|CBI19835.3| unnamed protein product [Vitis vinifera]
          Length = 993

 Score =  699 bits (1805), Expect = 0.0
 Identities = 404/743 (54%), Positives = 483/743 (65%), Gaps = 16/743 (2%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  LPEFS+DN+Q+ HK+NEF                     RNSELQASRN+CAKT
Sbjct: 338  SPHLSPLPEFSIDNVQQCHKDNEFLTERLLGMEEETKMLKEALAKRNSELQASRNICAKT 397

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ LE+QLQ+++Q K P KS           Q+AS+P S+TSMSEDG+DD  SCAES
Sbjct: 398  ASKLQNLEAQLQMNNQQKSPPKSNLQIPNDGSLSQNASNPPSMTSMSEDGNDDAVSCAES 457

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
            WAT  +S  + F             A +LELMDDFLEMEKLAC SN SNGA      S H
Sbjct: 458  WATGLVSGLSQFKKEN---------ANHLELMDDFLEMEKLACLSNNSNGA-----FSKH 503

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
                                      D++ +ANQ                       L+S
Sbjct: 504  --------------------------DLDSLANQ-----------------------LRS 514

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869
            RIS+VFES+S++SD  KIL +IKRVLQD H+ LHQHS                  ACPED
Sbjct: 515  RISMVFESVSEDSDTGKILEEIKRVLQDTHDTLHQHS------------------ACPED 556

Query: 870  AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049
            AGVTAE+EISLSQD  P  +T+ IISQEL+AAIS IH+FVLFLGKEA+A Q  SP GNG 
Sbjct: 557  AGVTAEREISLSQDCKPGTDTLHIISQELAAAISQIHEFVLFLGKEAMAIQGASPDGNGW 616

Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229
            + KIE+FSAT +KVL   +S+ DF+  LS+V++KA+EL+FNILGYK    EINSSDCIDK
Sbjct: 617  SRKIEDFSATVNKVLCRKMSVIDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDK 676

Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409
            VALPENKVVQ+D+SG RYPNGC+HISDSTSDPEVPHD NLVP F+ N +SC CSLEE+EQ
Sbjct: 677  VALPENKVVQKDTSGERYPNGCAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQ 736

Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589
            LKS KD L + L RCTENLESTKSQLQETEQ LAE KSQLTSAQ+ NSL++TQLKCMAES
Sbjct: 737  LKSEKDTLEMHLARCTENLESTKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAES 796

Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA-- 1763
            YRSLETRAEELE++VNLL+ +             SH++ L RC DLQE+L+RNE CS   
Sbjct: 797  YRSLETRAEELETEVNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCA 856

Query: 1764 -SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
             S AADI                   CQETIFLLG+Q NAMRP T+L+GSP SE +Q+ E
Sbjct: 857  MSSAADIDVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQTDLLGSPQSERSQRVE 916

Query: 1941 GYSENEPKDDEAEMDSAGALVLYNSPFTPSDTESNILLKSPLSTKHSKHRXXXXXXXXXX 2120
             + E+EP            L LYN+P +PS+TESN+LL+SP+ +KH KHR          
Sbjct: 917  VFHEDEP-----TTSGESPLELYNTPRSPSETESNLLLRSPVGSKHPKHRPTKSNSSSSA 971

Query: 2121 XXXEKQ-RGFSRFFSTKGKVSGH 2186
               EKQ RGFSRFFS+KGK +GH
Sbjct: 972  PTPEKQSRGFSRFFSSKGK-NGH 993


>ref|XP_007225499.1| hypothetical protein PRUPE_ppa000819mg [Prunus persica]
            gi|462422435|gb|EMJ26698.1| hypothetical protein
            PRUPE_ppa000819mg [Prunus persica]
          Length = 993

 Score =  689 bits (1777), Expect = 0.0
 Identities = 388/757 (51%), Positives = 495/757 (65%), Gaps = 33/757 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            ++PH+  + EFSLDN+QKFHKENEF                     RNSELQ SR +CA+
Sbjct: 259  SSPHMSPVTEFSLDNVQKFHKENEFLTERLLAMEEETKMLKEALTKRNSELQTSRGMCAQ 318

Query: 183  TANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAE 326
            T +KLQTLE+QLQ+++Q K   KS           Q+AS+P S+TS+SEDG+DD  SCAE
Sbjct: 319  TVSKLQTLEAQLQINNQQKGSPKSVVQITTEGSSSQNASNPPSLTSLSEDGNDDDRSCAE 378

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXXACN-LELMDDFLEMEKLACSSNESNGAPSISESS 503
            SWATT  S  +H                N L LMDDFLEMEKLAC  N+SNGA SIS   
Sbjct: 379  SWATTLGSDLSHIRKEKSNQKSNKAENQNHLNLMDDFLEMEKLACLPNDSNGAVSISSGP 438

Query: 504  NHKASESVNQDALVETTMDKDLHPEQQCDMNPM-ANQVSSNVELSSHKSDPDTDQLPLTK 680
            N+K SE  N DA  + T +KD+  EQQ D++P+  +Q SSNV+LS    + D +QLPL K
Sbjct: 439  NNKTSERENHDASGDVTAEKDIQSEQQQDLSPLEGDQASSNVKLSGLSPESDENQLPLVK 498

Query: 681  LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860
            L+S+IS++ E LSK++D  K++ DIK V+Q+A + LH H+ +   EE+H SD+ CDRQA 
Sbjct: 499  LRSKISMLLELLSKDTDFGKVIEDIKHVVQEAQDTLHPHTVNCISEEVHSSDAICDRQAN 558

Query: 861  PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040
            PED+ +T EKEI+LSQ   P + T+ ++S++L++AIS I+DFVLFLGKE +   D  P G
Sbjct: 559  PEDSRLTTEKEITLSQ---PARGTMELMSEDLASAISLINDFVLFLGKEVMGVHDTFPDG 615

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            N L+ KIEEFS  F+K +H N+SL DFVL LSHV++   EL FN+LGYK  + E NS DC
Sbjct: 616  NELSHKIEEFSGAFNKAIHGNLSLADFVLGLSHVLANVGELKFNVLGYKGVETETNSPDC 675

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPENKVV++DSS  RY N C HIS+  S+PEVP D NLV  +E N + CK SLEE
Sbjct: 676  IDKVALPENKVVEKDSSE-RYQNVCVHISNH-SNPEVPDDGNLVSGYESNAAPCKISLEE 733

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
            +EQ+KS KDNLA+DL RC E LE TKSQLQETEQ LAE KSQ  SAQ SNSL+ETQL+CM
Sbjct: 734  FEQIKSQKDNLAMDLERCNETLEMTKSQLQETEQLLAEAKSQFASAQNSNSLAETQLRCM 793

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESYRSLE RAEELE+++ LLQ R             +HQD LARC +LQE+L+R  + +
Sbjct: 794  AESYRSLEARAEELEAELKLLQVRTETLESELQEEKRNHQDALARCTELQEQLKRELADA 853

Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
            A   A+                    CQETIFLLG+Q  ++ P TE +GSP+SE +QKGE
Sbjct: 854  AEKLAE--------------------CQETIFLLGKQLKSLHPQTEHMGSPFSERSQKGE 893

Query: 1941 GYSENEP-----KDDEAEMD-----------SAGALVLYNSPFTPSDTESNILLKSPLST 2072
            GY+E+ P       D+AEM+           S   + LYN+P +PSDTE+N LLKSP+++
Sbjct: 894  GYTEDVPTTTVRDSDQAEMEGTAFANVNRVGSESPVNLYNTPCSPSDTEANTLLKSPVNS 953

Query: 2073 KHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174
            K+ KHR                + QRGFSRFFS+K K
Sbjct: 954  KYPKHRPTKSTSSSASSTPTPEKHQRGFSRFFSSKAK 990


>ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508723086|gb|EOY14983.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 947

 Score =  687 bits (1773), Expect = 0.0
 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 186  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 245

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 246  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 305

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 306  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 365

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 366  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 424

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 425  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 484

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 485  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 544

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 545  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 604

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 605  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 663

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 664  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 723

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 724  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 783

Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            SA + AAD                    CQETIFLLG+Q  ++RP T+++GSPY+E +QK
Sbjct: 784  SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 843

Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057
            GEG  E+EP          D+ E+D+A +           +    SP +PSDT++N LL+
Sbjct: 844  GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 902

Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            SP+++ H KH+               EKQ RGFSRFFS+KGK +GH
Sbjct: 903  SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 947


>ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508723085|gb|EOY14982.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1106

 Score =  687 bits (1773), Expect = 0.0
 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 345  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 404

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 405  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 464

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 465  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 524

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 525  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 583

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 584  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 643

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 644  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 703

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 704  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 763

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 764  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 822

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 823  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 882

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 883  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 942

Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            SA + AAD                    CQETIFLLG+Q  ++RP T+++GSPY+E +QK
Sbjct: 943  SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 1002

Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057
            GEG  E+EP          D+ E+D+A +           +    SP +PSDT++N LL+
Sbjct: 1003 GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 1061

Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            SP+++ H KH+               EKQ RGFSRFFS+KGK +GH
Sbjct: 1062 SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1106


>ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508723083|gb|EOY14980.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1102

 Score =  687 bits (1773), Expect = 0.0
 Identities = 398/766 (51%), Positives = 509/766 (66%), Gaps = 38/766 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 341  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 400

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 401  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 460

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 461  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 520

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 521  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 579

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 580  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 639

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 640  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 699

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 700  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 759

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 760  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 818

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 819  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 878

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 879  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 938

Query: 1758 SA-SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            SA + AAD                    CQETIFLLG+Q  ++RP T+++GSPY+E +QK
Sbjct: 939  SACAAAADNDLKNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQTDMMGSPYNERSQK 998

Query: 1935 GEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNILLK 2057
            GEG  E+EP          D+ E+D+A +           +    SP +PSDT++N LL+
Sbjct: 999  GEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-LLR 1057

Query: 2058 SPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            SP+++ H KH+               EKQ RGFSRFFS+KGK +GH
Sbjct: 1058 SPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1102


>gb|EXC00965.1| hypothetical protein L484_016031 [Morus notabilis]
          Length = 1087

 Score =  680 bits (1754), Expect = 0.0
 Identities = 389/766 (50%), Positives = 509/766 (66%), Gaps = 38/766 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            ++PHL    EF+ DN+QK+ KENEF                     RNSELQ SR++CAK
Sbjct: 339  SSPHLSPATEFTPDNVQKYQKENEFLTERLLAVEEETKMLKEALAKRNSELQVSRSMCAK 398

Query: 183  TANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQ+LE+Q+Q ++QHK   KS           Q+AS+P S+TSMSEDG+DD  SCAE
Sbjct: 399  TSSKLQSLEAQIQSNNQHKTTPKSIVQISAEGSFSQNASNPPSLTSMSEDGNDDDRSCAE 458

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXXACN-LELMDDFLEMEKLACSSNESNGAPSISESS 503
            SW TT IS+ +                 N L LMDDFLEMEKLAC SNESNGA S+S+S 
Sbjct: 459  SWTTTLISEVSQVKKEKSNEKTNRAEKPNHLNLMDDFLEMEKLACLSNESNGAISVSDSM 518

Query: 504  NHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQ-VSSNVELSSHKSDPDTDQLPLTK 680
            + K SE+VN DA  E  M K    E+QCD N +ANQ ++SN +    +   +++QLPL K
Sbjct: 519  SSKISETVNHDAS-EVVMRK----EEQCDSNSLANQQLTSNGKSPELRPGSNSEQLPLMK 573

Query: 681  LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCD-RQA 857
            LQSRIS++ ES+SK+SD+  IL DIK  +Q+ H+ LHQH+ S   E++HCSD+ CD RQA
Sbjct: 574  LQSRISVLLESVSKDSDVGTILEDIKHAIQETHDTLHQHTVSCISEDVHCSDAGCDDRQA 633

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
             PEDAG+T+EKEI+LSQ   P +E   II  +L+AAIS IHDFVLFLGKEA+   D S  
Sbjct: 634  NPEDAGLTSEKEIALSQ---PAREARQIIRDDLAAAISQIHDFVLFLGKEAMGVHDTSTE 690

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            G+  +++IEEFS T +KV+H+++SL DFVL LS V++KA+EL F++LG+K  +AE NS D
Sbjct: 691  GSEFSQRIEEFSVTLNKVIHSDLSLIDFVLDLSSVLAKASELRFSVLGFKGNEAETNSPD 750

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENK +Q+DSS   Y NGC+H+ +STS+PEVP D N+V S+E N  SCK SLE
Sbjct: 751  CIDKVVLPENKAIQKDSSE-IYQNGCAHMPNSTSNPEVPDDGNIVSSYESNAKSCKISLE 809

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            EY+QLKS KDNLALD  RCTENLE TKSQLQETEQ LAE KSQL+S Q+SNSLSETQLKC
Sbjct: 810  EYDQLKSEKDNLALDFARCTENLEMTKSQLQETEQLLAEAKSQLSSVQKSNSLSETQLKC 869

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNE-S 1754
            MAESYRSLETRA++LE+++NLL+ +             +HQD L RC +LQE+LQRNE +
Sbjct: 870  MAESYRSLETRAQDLETELNLLRTKTESIEAELQEEKRNHQDALTRCKELQEQLQRNENN 929

Query: 1755 CSASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            C   +  +                    CQETIFLLG++   +RP +E++GSPYSE +Q 
Sbjct: 930  CENEIKPN------QEKEFAAAAEKLAECQETIFLLGKKLKNLRPQSEIMGSPYSERSQN 983

Query: 1935 GEGYSENE--------PKDDEAEMDSA-----------GALVLYNSPFTPSDTESNILLK 2057
            GEG +E+E        P+ D+AE++S              + +Y++P +PSD E +I LK
Sbjct: 984  GEGLNEDEPTTSGMNLPESDQAELESVTSANLNRVGAESPIDVYSAPLSPSDAEPSI-LK 1042

Query: 2058 SPLSTKHSKH---RXXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186
            SP+++K+ +H   +             +  RGFSRFFS+KGK +GH
Sbjct: 1043 SPINSKNPRHKSPKSGSLSSSSAPTPEKHSRGFSRFFSSKGK-NGH 1087


>ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina]
            gi|567885183|ref|XP_006435150.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537271|gb|ESR48389.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537272|gb|ESR48390.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
          Length = 1091

 Score =  667 bits (1721), Expect = 0.0
 Identities = 383/764 (50%), Positives = 494/764 (64%), Gaps = 37/764 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  + EFSLDN+QKF KENEF                     RNSELQASRNLCAKT
Sbjct: 341  SPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 400

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ+LE+Q+Q S Q K P KS           Q+AS+P S+TSMSED +DD  SCA+S
Sbjct: 401  ASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSLTSMSEDDNDDKVSCADS 460

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXAC-NLELMDDFLEMEKLACSSNE--SNGAPSISES 500
            WAT  IS+ +                  +LELMDDFLEMEKLAC SN+  SNG  + S  
Sbjct: 461  WATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLACLSNDTNSNGTITASNG 520

Query: 501  SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680
             N+K S+ +N DA    T  +DL  EQQ DMNP  +++SSN E S+   + D  Q  L K
Sbjct: 521  PNNKTSDILNHDASGAVTSGEDLLSEQQRDMNPSVDKLSSNTESSTVNPEADAGQPQLMK 580

Query: 681  LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860
            L+SRIS++ E++SK++DM KI+ DIKRV++D H  LHQHS +   EE+ CSD +C  +A 
Sbjct: 581  LRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAEAY 640

Query: 861  PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040
            P DA +  E++I L         TV +ISQEL AAIS IHDFVLFLGKEA A  D +   
Sbjct: 641  PGDASLNTERKIDL---------TVQVISQELVAAISQIHDFVLFLGKEARAVHDTT-NE 690

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            NG ++KIEEF  +F+KV+ +N  L DFV +LS+V++KA+EL  N++GYK T+ E NS DC
Sbjct: 691  NGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDC 750

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPENKV+++D+SG RYPNGC+HIS+ TSDPEVP D ++V ++E  T++CK +LEE
Sbjct: 751  IDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESETTACKFTLEE 810

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
            +E+LK  KDNLA DL RCTENLE TKSQL ETEQ LAEVK+QL SAQ+SNSL+ETQLKCM
Sbjct: 811  FEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCM 870

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESYRSLET A+ELE++VNLL+A+             SH + +A+C +L+E+LQRNE+C+
Sbjct: 871  AESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCA 930

Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
               +                      CQETI LLG+Q  ++RP +E++GSPYSE +QKGE
Sbjct: 931  VCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGSPYSERSQKGE 990

Query: 1941 GYSENEP------KDDEAEMDSA-------------GALVLYNSPFTPSDTESNILLKSP 2063
             +   EP      + D AEMDS                L LY SP +PS+ E++I  KSP
Sbjct: 991  -FLPGEPATASLQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSPSENEASI-NKSP 1048

Query: 2064 LSTKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186
            +++KH KHR                +  RGFSRFFS+KG+ +GH
Sbjct: 1049 INSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGR-NGH 1091


>ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus
            sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED:
            filament-like plant protein 4-like isoform X2 [Citrus
            sinensis]
          Length = 1091

 Score =  663 bits (1710), Expect = 0.0
 Identities = 382/764 (50%), Positives = 492/764 (64%), Gaps = 37/764 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  + EFSLDN+QKF KENEF                     RNSELQASRNLCAKT
Sbjct: 341  SPHLSPVSEFSLDNVQKFQKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 400

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ+LE+Q+Q S Q K P KS           Q+AS+P S+TSMSED +DD  SCA+S
Sbjct: 401  ASKLQSLEAQMQTSTQQKSPTKSVVQIAAEGYTSQNASNPPSLTSMSEDDNDDKVSCADS 460

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXAC-NLELMDDFLEMEKLACSSNE--SNGAPSISES 500
            WAT  IS+ +                  +LELMDDFLEMEKLAC SN+  SNG  + S  
Sbjct: 461  WATALISELSQIKKEKNVEKSNKAETPKHLELMDDFLEMEKLACLSNDTNSNGTITASNG 520

Query: 501  SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680
             N+K S+ VN DA    T  +DL  EQQ DMNP  +++SSN E S+   + D  Q  L K
Sbjct: 521  PNNKTSDIVNHDASGAVTSGEDLLSEQQRDMNPSVDKLSSNTESSTVNPEADAGQPQLMK 580

Query: 681  LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860
            L+SRIS++ E++SK++DM KI+ DIKRV++D H  LHQHS +   EE+ CSD +C  +A 
Sbjct: 581  LRSRISMLLETISKDADMGKIVEDIKRVVEDEHVTLHQHSANCISEEVKCSDVSCSAEAY 640

Query: 861  PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040
            P DA +  E++I L         TV +ISQEL AAI+ IHDFVLFLGKEA A  D +   
Sbjct: 641  PGDARLNTERKIDL---------TVQVISQELVAAITQIHDFVLFLGKEARAVHDTT-NE 690

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            NG ++KIEEF  +F+KV+ +N  L DFV +LS+V++KA+EL  N++GYK T+ E NS DC
Sbjct: 691  NGFSQKIEEFYVSFNKVIDSNTYLVDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDC 750

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPENKV+++D+SG RYPNGC+HIS+ TSDPEVP D ++V ++E  T++CK SLEE
Sbjct: 751  IDKVALPENKVIKKDTSGERYPNGCAHISNPTSDPEVPDDGSIVAAYESETTACKFSLEE 810

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
            +E+LK  KDNLA DL RCTENLE TKSQL ETEQ LAEVK+QL SAQ+SNSL+ETQLKCM
Sbjct: 811  FEELKLEKDNLATDLARCTENLEMTKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCM 870

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESYRSLET A+ELE++VNLL+A+             SH + +A+C +L+E+LQRNE+C+
Sbjct: 871  AESYRSLETHAQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCA 930

Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
               +                      CQETI LLG+Q  ++RP +E++GSPYSE + KGE
Sbjct: 931  VCSSEADENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQSEVIGSPYSERSPKGE 990

Query: 1941 GYSENEP------KDDEAEMDSA-------------GALVLYNSPFTPSDTESNILLKSP 2063
             +   EP      + D AE DS                L LY SP +PS+ E++I  KSP
Sbjct: 991  -FLPGEPATASLQEFDHAETDSVTSANAQPHRVGAESPLDLYTSPCSPSENEASI-NKSP 1048

Query: 2064 LSTKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGKVSGH 2186
            +++KH KHR                +  RGFSRFFS+KG+ +GH
Sbjct: 1049 INSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGR-NGH 1091


>ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
            gi|550344134|gb|EEE81259.2| hypothetical protein
            POPTR_0002s02600g [Populus trichocarpa]
          Length = 1063

 Score =  658 bits (1697), Expect = 0.0
 Identities = 385/759 (50%), Positives = 482/759 (63%), Gaps = 36/759 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL ++PEFSLDN+QKF+KENEF                     RNSELQASRNLCAKT
Sbjct: 332  SPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKMLKEALAKRNSELQASRNLCAKT 391

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ+LE+Q Q+++  K   KS           Q+ S+P S+TS+SEDG+DD  SCA+S
Sbjct: 392  ASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSLTSVSEDGNDDTQSCADS 451

Query: 330  WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506
            WATTS+S  +HF              A +LELMDDFLEMEKLAC + +S  A +IS S N
Sbjct: 452  WATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLACLNADS--ATTISSSPN 509

Query: 507  HKASESVNQDALVETTMDK-DLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683
            +KASE+ N DAL E ++ K D   E++ D++P+AN VS N + S+  S  D D     KL
Sbjct: 510  NKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNKDSSAINSGSDADLSSFGKL 569

Query: 684  QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACP 863
            QSRIS++ ES+SKE D+ KIL +IK+V+ DA       + S   +E+H SD+TCDRQ CP
Sbjct: 570  QSRISMLLESVSKEVDVDKILEEIKQVVHDAET-----AASCGSKEVHHSDATCDRQTCP 624

Query: 864  EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGN 1043
            EDA +  EKEI+L Q+S                    IHDFVL LGKEA+A  D S    
Sbjct: 625  EDAVIMGEKEITLLQESI-------------------IHDFVLLLGKEAMAVHDTSCDSI 665

Query: 1044 GLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCI 1223
            GL++KIEEFS TF KVL ++ SL DF+  LS V++ A+ L FN+LGYK  +AEINS DCI
Sbjct: 666  GLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCI 725

Query: 1224 DKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEY 1403
            DKVALPENKV+Q DS G  + NGC++IS  TS+PEVP   NLVP +  NT+SCK SLEE+
Sbjct: 726  DKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEF 785

Query: 1404 EQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMA 1583
            E+LKS KD +A+DL RCTENLE TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCMA
Sbjct: 786  EELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMA 845

Query: 1584 ESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA 1763
            ESYRSLETRA+ELE++VNLL+ +             SHQD L RC +L+E+LQ  ES SA
Sbjct: 846  ESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKESSSA 905

Query: 1764 SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEG 1943
                 I                   CQETIFLLG+Q   +RP TE++GSPYSE +Q G+G
Sbjct: 906  D---GIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEIMGSPYSERSQSGDG 962

Query: 1944 YSENEP--------KDDEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSPL 2066
             +++EP          D+AEMD+  ++             YN P  PSDTESN LL+SP+
Sbjct: 963  IAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESN-LLRSPV 1021

Query: 2067 STKHSKHRXXXXXXXXXXXXXEKQ---RGFSRFFSTKGK 2174
              KH KHR               +   RGFSRFFS+KGK
Sbjct: 1022 GLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 1060


>ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
            gi|550344133|gb|ERP63976.1| hypothetical protein
            POPTR_0002s02600g [Populus trichocarpa]
          Length = 991

 Score =  658 bits (1697), Expect = 0.0
 Identities = 385/759 (50%), Positives = 482/759 (63%), Gaps = 36/759 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL ++PEFSLDN+QKF+KENEF                     RNSELQASRNLCAKT
Sbjct: 260  SPHLSSVPEFSLDNVQKFNKENEFLTERLFAVEEETKMLKEALAKRNSELQASRNLCAKT 319

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ+LE+Q Q+++  K   KS           Q+ S+P S+TS+SEDG+DD  SCA+S
Sbjct: 320  ASKLQSLEAQFQINNHQKSSPKSITQVPAEGYSSQNISNPPSLTSVSEDGNDDTQSCADS 379

Query: 330  WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506
            WATTS+S  +HF              A +LELMDDFLEMEKLAC + +S  A +IS S N
Sbjct: 380  WATTSVSDVSHFKKDNHIEKSNKAENAKHLELMDDFLEMEKLACLNADS--ATTISSSPN 437

Query: 507  HKASESVNQDALVETTMDK-DLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683
            +KASE+ N DAL E ++ K D   E++ D++P+AN VS N + S+  S  D D     KL
Sbjct: 438  NKASETANTDALAEVSLQKEDALSEEKRDLDPLANHVSCNKDSSAINSGSDADLSSFGKL 497

Query: 684  QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACP 863
            QSRIS++ ES+SKE D+ KIL +IK+V+ DA       + S   +E+H SD+TCDRQ CP
Sbjct: 498  QSRISMLLESVSKEVDVDKILEEIKQVVHDAET-----AASCGSKEVHHSDATCDRQTCP 552

Query: 864  EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGN 1043
            EDA +  EKEI+L Q+S                    IHDFVL LGKEA+A  D S    
Sbjct: 553  EDAVIMGEKEITLLQESI-------------------IHDFVLLLGKEAMAVHDTSCDSI 593

Query: 1044 GLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCI 1223
            GL++KIEEFS TF KVL ++ SL DF+  LS V++ A+ L FN+LGYK  +AEINS DCI
Sbjct: 594  GLSQKIEEFSITFKKVLCSDRSLIDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCI 653

Query: 1224 DKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEY 1403
            DKVALPENKV+Q DS G  + NGC++IS  TS+PEVP   NLVP +  NT+SCK SLEE+
Sbjct: 654  DKVALPENKVIQNDSPGETFQNGCANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEF 713

Query: 1404 EQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMA 1583
            E+LKS KD +A+DL RCTENLE TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCMA
Sbjct: 714  EELKSEKDTMAMDLARCTENLEMTKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMA 773

Query: 1584 ESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCSA 1763
            ESYRSLETRA+ELE++VNLL+ +             SHQD L RC +L+E+LQ  ES SA
Sbjct: 774  ESYRSLETRAQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKESSSA 833

Query: 1764 SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEG 1943
                 I                   CQETIFLLG+Q   +RP TE++GSPYSE +Q G+G
Sbjct: 834  D---GIDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQTEIMGSPYSERSQSGDG 890

Query: 1944 YSENEP--------KDDEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSPL 2066
             +++EP          D+AEMD+  ++             YN P  PSDTESN LL+SP+
Sbjct: 891  IAKDEPTISGINLQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESN-LLRSPV 949

Query: 2067 STKHSKHRXXXXXXXXXXXXXEKQ---RGFSRFFSTKGK 2174
              KH KHR               +   RGFSRFFS+KGK
Sbjct: 950  GLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 988


>ref|XP_007017761.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508723089|gb|EOY14986.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 1107

 Score =  657 bits (1694), Expect = 0.0
 Identities = 389/768 (50%), Positives = 500/768 (65%), Gaps = 40/768 (5%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 345  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 404

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 405  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 464

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 465  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 524

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 525  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 583

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 584  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 643

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 644  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 703

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 704  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 763

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 764  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 822

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 823  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 882

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 883  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 942

Query: 1758 SASLAA---DIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMA 1928
            SA  AA   D+                       I+L+    N +   T+++GSPY+E +
Sbjct: 943  SACAAAADNDLKNKQVSVYFNLCILRWILP-NPLIYLILLPRNIIYSCTDMMGSPYNERS 1001

Query: 1929 QKGEGYSENEPKD--------DEAEMDSAGA-----------LVLYNSPFTPSDTESNIL 2051
            QKGEG  E+EP          D+ E+D+A +           +    SP +PSDT++N L
Sbjct: 1002 QKGEGLLEDEPTTSGMNLQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDAN-L 1060

Query: 2052 LKSPLSTKHSKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
            L+SP+++ H KH+               EKQ RGFSRFFS+KGK +GH
Sbjct: 1061 LRSPINSNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGK-TGH 1107


>ref|XP_002306918.2| hypothetical protein POPTR_0005s25830g [Populus trichocarpa]
            gi|550339754|gb|EEE93914.2| hypothetical protein
            POPTR_0005s25830g [Populus trichocarpa]
          Length = 1077

 Score =  641 bits (1653), Expect = 0.0
 Identities = 377/757 (49%), Positives = 488/757 (64%), Gaps = 34/757 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PH  ++ EFSLDN+QKFHKENEF                     RNSELQASRNLCAKT
Sbjct: 332  SPHSSSVTEFSLDNVQKFHKENEFLTERLFAMEEETKMLKEALAKRNSELQASRNLCAKT 391

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A+KLQ+LE+Q  +S+Q K   KS           Q+ S+P S+T++SEDG+DD  SCA+S
Sbjct: 392  ASKLQSLEAQFHISNQVKSSPKSIIQVPAEGYSSQNISNPPSLTNVSEDGNDDTQSCADS 451

Query: 330  WATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISESSN 506
            WAT SIS+F++F              A +LE MDDFLEMEKLAC + +S  A + S S N
Sbjct: 452  WATISISEFSNFKKYNHSEKLNKAENAKHLEFMDDFLEMEKLACLNADS--AATTSNSPN 509

Query: 507  HKASESVNQDALVETTMDKD-LHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKL 683
            +K SE  N+DA  E ++ K+    E++ +++P  N +S N + S+ +S  D D     KL
Sbjct: 510  NKTSEVANRDASGEISLQKENTLSEEKHNLDPPVNHLSCNKDSSAIESGSDADLSSFMKL 569

Query: 684  QSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTC-DRQAC 860
            Q RIS++ +S SK++D+ KIL DIK+V+QDA         S   +E HCSD+T  DRQ C
Sbjct: 570  QLRISMLLDSGSKKADLGKILEDIKQVVQDAETGA-----SCVSKEAHCSDATTHDRQTC 624

Query: 861  PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040
            PEDAG+  EKEI L Q+S    + +  +SQEL  AIS IHDFVL LGKEA+   D S   
Sbjct: 625  PEDAGIMGEKEIELFQESKTAAQIMHTVSQELLPAISQIHDFVLLLGKEAMTVHDTSCDS 684

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
             GL++KI+EFS TF+KVL+++ SL DFV  L+H+++ A+ L FN+LGYK  +AEI+S DC
Sbjct: 685  IGLSQKIKEFSITFNKVLYSDRSLVDFVSDLAHILALASGLRFNVLGYKGNEAEISSPDC 744

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDK+ALPENKVVQ++SS   Y NGC++IS  TS+PEVP D NLV  +  NT+SCK SLEE
Sbjct: 745  IDKIALPENKVVQKNSSVETYQNGCANISSPTSNPEVPDDGNLVLGYGSNTTSCKVSLEE 804

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
            +E+LKS KDN+A+DL RCTEN E TKSQL ETEQ LAEVKSQL SAQ+SNSL+ETQLKCM
Sbjct: 805  FEELKSEKDNMAMDLARCTENFEMTKSQLHETEQLLAEVKSQLASAQKSNSLAETQLKCM 864

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
             ESYRSLETRA+ELE++VNLL+ +             SHQ  L RC +L+E+LQ NES  
Sbjct: 865  TESYRSLETRAQELETEVNLLRLKTETLENVLQEEKKSHQGALTRCKELEEQLQTNES-- 922

Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
             S   DI                   CQETIFLLG+Q N++ P TE++GSPYSE +Q G+
Sbjct: 923  -STVTDI--ECKQEKEIAAAAEKLAECQETIFLLGKQLNSLCPQTEIMGSPYSERSQIGD 979

Query: 1941 GYSENEPKD--------DEAEMDSAGALVL-----------YNSPFTPSDTESNILLKSP 2063
             ++E+EP          D+AEMD+ G   +           YN P +PSDTES+ LL+SP
Sbjct: 980  VFAEDEPTTSGMNLQDFDQAEMDTGGLANIHKAGAESPINSYNHPCSPSDTESS-LLRSP 1038

Query: 2064 LSTKHSKHRXXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174
            +++K  KH              +  RGFSRFFS+KGK
Sbjct: 1039 VASKPPKH-GPTKSSSSAPMLEKHSRGFSRFFSSKGK 1074


>ref|XP_006342030.1| PREDICTED: filament-like plant protein 6-like [Solanum tuberosum]
          Length = 1093

 Score =  618 bits (1594), Expect = e-174
 Identities = 359/755 (47%), Positives = 471/755 (62%), Gaps = 31/755 (4%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            ++P   +LP+FS D++QKFHKENE                      RNSELQASR++CAK
Sbjct: 343  SSPQFSSLPDFSFDSVQKFHKENEQLTERLLAMEEETKMLKEALAHRNSELQASRSICAK 402

Query: 183  TANKLQTLESQLQVSDQHKHPLKSQHASSPS-------------VTSMSEDGHDDVGSCA 323
            T++KLQ+LE+QLQ + + K P KS     PS             + SMSEDG+DD  SCA
Sbjct: 403  TSSKLQSLEAQLQANVEQKSPQKSTIRRQPSEGSLSHEANHLPRLASMSEDGNDDNVSCA 462

Query: 324  ESWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNESNGAPSISES 500
             SW T  +S  TH               A +L+LMDDFLEMEKLA  S+++NGA S  + 
Sbjct: 463  SSWTTALMSDLTHVKKEKNFDSPHKSESASHLDLMDDFLEMEKLAYQSSDTNGAVSSPDI 522

Query: 501  SNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTK 680
             N+   E+   D  +  T   D   ++  + +   +Q S N E+SS    P +D     K
Sbjct: 523  PNNARPETTKVDTSMHVTTSPDSQLKEHNETSVSGDQASRNEEVSSQSHQPLSDTSISMK 582

Query: 681  LQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQAC 860
            LQSRIS V ESLSK++D+++I  D++ ++Q+  N L   S  S  E    S++  + Q  
Sbjct: 583  LQSRISTVLESLSKDADIQRIQEDLREIVQEMRNALIPQSTKSIVEITLSSNTATESQPS 642

Query: 861  PEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGG 1040
             +D     EKEI +S+DS    E++  IS+EL+ A+S IHDFVLFLGKEA A Q  +P G
Sbjct: 643  LDDGEANLEKEIPVSEDSKSCNESIHGISKELADAMSQIHDFVLFLGKEAKAIQGTAPDG 702

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            +G+NEK+++FSAT+ +V+ N +S+ +FVL LSHV+S A++L FNILGYK ++ EI++SDC
Sbjct: 703  SGINEKLDDFSATYVEVISNKLSMVNFVLDLSHVLSNASQLHFNILGYKNSETEISTSDC 762

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPENK +Q   SG  Y NGC+H SDSTSDP++PH+ +LVP+ E  ++S KCSLEE
Sbjct: 763  IDKVALPENKDLQH--SGEVYANGCAHFSDSTSDPDIPHEGSLVPTSESTSTSLKCSLEE 820

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
             EQLK  K+N+ALDL R +ENLESTKSQL ETEQ LAEVKSQL SAQ++NSL+ETQLKCM
Sbjct: 821  VEQLKLEKENMALDLARYSENLESTKSQLTETEQLLAEVKSQLVSAQKANSLAETQLKCM 880

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESY SLETR EEL+++VN LQA+             +HQDTLA C DL+E+LQR ES  
Sbjct: 881  AESYNSLETRTEELQTEVNRLQAKIENLDNELQEEKKNHQDTLASCKDLEEQLQRMES-- 938

Query: 1761 ASLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGE 1940
               AAD+                   CQETIFLLG+Q N++RP TE +GSPY + + KGE
Sbjct: 939  ---AADLDAKTNQEKDLTAAAEKLAECQETIFLLGKQLNSLRPQTEFMGSPYIDRSSKGE 995

Query: 1941 GYSE-------NEPKDDEAEMDSAGALV--------LYNSPFTPSDTESNILLKSPLSTK 2075
            G+ E       N   +D AEMDSA ++         +YN  ++PSDTE N  L+SP+S K
Sbjct: 996  GFREESTTTSMNIHDNDLAEMDSASSVKATCESPVDIYNVSYSPSDTEVNNPLRSPISLK 1055

Query: 2076 HSKHR-XXXXXXXXXXXXXEKQ-RGFSRFFSTKGK 2174
              KHR              EKQ RGFSRFFS+KGK
Sbjct: 1056 SPKHRSTKSGSSSSAGPTPEKQSRGFSRFFSSKGK 1090


>ref|XP_004168855.1| PREDICTED: LOW QUALITY PROTEIN: filament-like plant protein 4-like
            [Cucumis sativus]
          Length = 1084

 Score =  616 bits (1588), Expect = e-173
 Identities = 366/759 (48%), Positives = 476/759 (62%), Gaps = 32/759 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            TPH+ ++P+FSLDN  KF KEN+F                     RNSELQ SR++CAKT
Sbjct: 338  TPHMLSVPDFSLDNALKFQKENDFLTERMLAMEEETKMLKEALAKRNSELQTSRSMCAKT 397

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A KLQ LE+QLQ  +  +   KS           Q+ S P S+TSMSEDG++D  SCA++
Sbjct: 398  ATKLQNLEAQLQNGNHQRSSPKSVVQYTADGFSCQNTSHPPSLTSMSEDGNEDGQSCADT 457

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
             +  + S  +HF               +L LMDDFLEMEKLAC SN+SN A   S S+N+
Sbjct: 458  LSIAATSDISHFREKKNEKLSKTESGSHLGLMDDFLEMEKLACQSNDSNEAILASNSTNN 517

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
            K SE V             +  EQ  D +P    VSS+V+LS+  +D  ++ LPL KL+S
Sbjct: 518  KDSEVVVHQE------SNGIQSEQHLDSSPSTEVVSSSVDLSTECAD--SNGLPLLKLRS 569

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPS--SDFEEIHCSDSTCDRQACP 863
            RIS++FES+SK++D  KIL DIK ++QDAH+ L Q + +  S   E+   D+TCDRQA P
Sbjct: 570  RISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCDRQANP 629

Query: 864  EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDP-SPGG 1040
            +DAG+  E+EI+ SQ   PV    P+ SQEL AAIS IH+FVLFLGKEA    D  SP G
Sbjct: 630  DDAGLGVEREIAFSQ---PVAHNQPM-SQELEAAISQIHEFVLFLGKEASRVHDTISPDG 685

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            +GL +K+EEFS+TF+K++H N SL DFV+ LSHV+S+A+EL F+ +G K TD + NS DC
Sbjct: 686  HGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPDC 745

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPE+KVVQ DS   RY NGCSHIS  TSD EVP+D NLV S+E N+   K S E+
Sbjct: 746  IDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYDGNLVSSYESNSRLPKFSSED 805

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
             E+LK  K+NL+ DL RCTE+LE+ K +LQETEQ LAE +SQL  AQ+SNSLSETQLKCM
Sbjct: 806  IEELKLAKENLSKDLARCTEDLEAAKRKLQETEQLLAESRSQLAFAQKSNSLSETQLKCM 865

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESYRSLE RAE+LE+++NLL+A++            +H + L++C +LQE+LQRNE C 
Sbjct: 866  AESYRSLEARAEDLETELNLLRAKSETLENDLQDEKRNHHEALSKCQELQEQLQRNEVCC 925

Query: 1761 A--SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            A  S A D                    CQETIFLL +Q  ++RP  +  GSP+SE + +
Sbjct: 926  AICSSAIDGDPQKSQEIELTAAAEKLAECQETIFLLSKQLKSLRPQPDFSGSPFSERSHR 985

Query: 1935 GEGYSENEPKD--------DEAEMDSAGA----LVLYNSPFTPSDTESNILLKSPLSTKH 2078
            GE + E+EP          D +EMD+A +    +V   SP + SD E    L+SP+++KH
Sbjct: 986  GEEFIEDEPSKSGTNLLDLDRSEMDTATSTMTQIVGAESPCSASDGEGGSFLRSPINSKH 1045

Query: 2079 SKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
             KHR               EKQ RGFSRFFS+KGK + H
Sbjct: 1046 PKHRPTKSSSSSSSSAPTPEKQTRGFSRFFSSKGKNNSH 1084


>ref|XP_004136392.1| PREDICTED: filament-like plant protein 4-like [Cucumis sativus]
          Length = 1078

 Score =  616 bits (1588), Expect = e-173
 Identities = 366/759 (48%), Positives = 476/759 (62%), Gaps = 32/759 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            TPH+ ++P+FSLDN  KF KEN+F                     RNSELQ SR++CAKT
Sbjct: 332  TPHMLSVPDFSLDNALKFQKENDFLTERMLAMEEETKMLKEALAKRNSELQTSRSMCAKT 391

Query: 186  ANKLQTLESQLQVSDQHKHPLKS-----------QHASSP-SVTSMSEDGHDDVGSCAES 329
            A KLQ LE+QLQ  +  +   KS           Q+ S P S+TSMSEDG++D  SCA++
Sbjct: 392  ATKLQNLEAQLQNGNHQRSSPKSVVQYTADGFSCQNTSHPPSLTSMSEDGNEDGQSCADT 451

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
             +  + S  +HF               +L LMDDFLEMEKLAC SN+SN A   S S+N+
Sbjct: 452  LSIAATSDISHFREKKNEKLSKTESGSHLGLMDDFLEMEKLACQSNDSNEAILASNSTNN 511

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
            K SE V             +  EQ  D +P    VSS+V+LS+  +D  ++ LPL KL+S
Sbjct: 512  KDSEVVVHQE------SNGIQSEQHLDSSPSTEVVSSSVDLSTECAD--SNGLPLLKLRS 563

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPS--SDFEEIHCSDSTCDRQACP 863
            RIS++FES+SK++D  KIL DIK ++QDAH+ L Q + +  S   E+   D+TCDRQA P
Sbjct: 564  RISMIFESISKDADTGKILEDIKCIVQDAHDALQQPTINCVSCVSEVQSPDTTCDRQANP 623

Query: 864  EDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDP-SPGG 1040
            +DAG+  E+EI+ SQ   PV    P+ SQEL AAIS IH+FVLFLGKEA    D  SP G
Sbjct: 624  DDAGLGVEREIAFSQ---PVAHNQPM-SQELEAAISQIHEFVLFLGKEASRVHDTISPDG 679

Query: 1041 NGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDC 1220
            +GL +K+EEFS+TF+K++H N SL DFV+ LSHV+S+A+EL F+ +G K TD + NS DC
Sbjct: 680  HGLGQKVEEFSSTFNKIVHANTSLVDFVVILSHVLSEASELRFSFIGCKDTDGDTNSPDC 739

Query: 1221 IDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEE 1400
            IDKVALPE+KVVQ DS   RY NGCSHIS  TSD EVP+D NLV S+E N+   K S E+
Sbjct: 740  IDKVALPEHKVVQNDSIDERYTNGCSHISSPTSDLEVPYDGNLVSSYESNSRLPKFSSED 799

Query: 1401 YEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCM 1580
             E+LK  K+NL+ DL RCTE+LE+ K +LQETEQ LAE +SQL  AQ+SNSLSETQLKCM
Sbjct: 800  IEELKLAKENLSKDLARCTEDLEAAKRKLQETEQLLAESRSQLAFAQKSNSLSETQLKCM 859

Query: 1581 AESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS 1760
            AESYRSLE RAE+LE+++NLL+A++            +H + L++C +LQE+LQRNE C 
Sbjct: 860  AESYRSLEARAEDLETELNLLRAKSETLENDLQDEKRNHHEALSKCQELQEQLQRNEVCC 919

Query: 1761 A--SLAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQK 1934
            A  S A D                    CQETIFLL +Q  ++RP  +  GSP+SE + +
Sbjct: 920  AICSSAIDGDPQKSQEIELTAAAEKLAECQETIFLLSKQLKSLRPQPDFSGSPFSERSHR 979

Query: 1935 GEGYSENEPKD--------DEAEMDSAGA----LVLYNSPFTPSDTESNILLKSPLSTKH 2078
            GE + E+EP          D +EMD+A +    +V   SP + SD E    L+SP+++KH
Sbjct: 980  GEEFIEDEPSKSGTNLLDLDRSEMDTATSTMTQIVGAESPCSASDGEGGSFLRSPINSKH 1039

Query: 2079 SKHR--XXXXXXXXXXXXXEKQ-RGFSRFFSTKGKVSGH 2186
             KHR               EKQ RGFSRFFS+KGK + H
Sbjct: 1040 PKHRPTKSSSSSSSSAPTPEKQTRGFSRFFSSKGKNNSH 1078


>ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis]
            gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated
            muscle, putative [Ricinus communis]
          Length = 1041

 Score =  610 bits (1572), Expect = e-171
 Identities = 370/758 (48%), Positives = 469/758 (61%), Gaps = 35/758 (4%)
 Frame = +3

Query: 6    TPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAKT 185
            +PHL  +PEFSLDN QKFHKENEF                     RNSELQASRNLCAKT
Sbjct: 340  SPHLSAVPEFSLDNAQKFHKENEFLTERLLAMEEETKMLKEALAKRNSELQASRNLCAKT 399

Query: 186  ANKLQTLESQLQVSDQHKH--------PLK---SQHASSP-SVTSMSEDGHDDVGSCAES 329
            A++LQ+LE+Q  VS+Q K         P++   SQ+ S+P S+TSMSEDG+DD  SCA+S
Sbjct: 400  ASRLQSLEAQ--VSNQQKSSPTSVVQVPIEGYSSQNMSNPPSLTSMSEDGNDDDRSCADS 457

Query: 330  WATTSISKFTHFXXXXXXXXXXXXXACNLELMDDFLEMEKLACSSNESNGAPSISESSNH 509
            WAT+ IS+ +                                             E S  
Sbjct: 458  WATSLISELSQLK-----------------------------------------KEKSTE 476

Query: 510  KASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLTKLQS 689
            K +++ N   L    MD  L  E+   +N   N VSS   +S+  S  + DQ  L KL+S
Sbjct: 477  KLNKTKNTQHL--ELMDDFLEMEKLACLNANVNLVSS---MSAANSGSEADQPCLVKLRS 531

Query: 690  RISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQACPED 869
            RIS++ ES+S+++DM KIL D++R++QD H  +     SS  E++  +D+TC     PE 
Sbjct: 532  RISMLLESISQDADMGKILEDVQRIVQDTHGAV-----SSVSEDVRATDATC-----PEY 581

Query: 870  AGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPGGNGL 1049
            A +T +KEI+L QD+    +TV  ++QEL+ A+S IHDFVLFLGKEA+A  D S  G+ L
Sbjct: 582  ASITGDKEITLFQDTNAATDTVRSVNQELATAVSSIHDFVLFLGKEAMAVHDTSSDGSDL 641

Query: 1050 NEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSDCIDK 1229
            ++KIE FS TF+KVL+ N SL DF+  LS V++KA+EL FN+LGYK ++AEINSSDCIDK
Sbjct: 642  SQKIEHFSVTFNKVLNGNTSLIDFIFYLSCVLAKASELRFNVLGYKGSEAEINSSDCIDK 701

Query: 1230 VALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLEEYEQ 1409
            VALPENKV+Q DSSG  Y N C+HIS  TS+PEVP D +LV  +  NT+ CK SLEE+E+
Sbjct: 702  VALPENKVLQRDSSGESYQNSCAHISSPTSNPEVPDDGSLVSGYGSNTTLCKVSLEEFEE 761

Query: 1410 LKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKCMAES 1589
            LKS K+N+ALDL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKCMAES
Sbjct: 762  LKSEKNNVALDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAES 821

Query: 1590 YRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESCS-AS 1766
            YRSLE RAEELE++VNLLQA+A             H D L+R  +L+E+LQ  ESCS  S
Sbjct: 822  YRSLEARAEELETEVNLLQAKAETLENELQDEKQCHWDALSRSKELEEQLQTKESCSVCS 881

Query: 1767 LAADIXXXXXXXXXXXXXXXXXXXCQETIFLLGRQFNAMRPPTELVGSPYSEMAQKGEGY 1946
             AAD                    CQETIFLLG+Q  A+RP TEL+GS YSE ++KG+G+
Sbjct: 882  AAADAENKANQDRELAAAAEKLAECQETIFLLGKQLKALRPQTELMGSAYSERSRKGDGF 941

Query: 1947 SENEPKD--------DEAEMDS--------AGA---LVLYNSPFTPSDTESNILLKSPLS 2069
            +E+EP          D+AEMD+        AGA   + LYN P +PSDTESN L +SPL+
Sbjct: 942  AEDEPTTSGMNLQDFDQAEMDAIVSTNHHRAGAESPMDLYNQPCSPSDTESN-LSRSPLN 1000

Query: 2070 TKHSKHR---XXXXXXXXXXXXXEKQRGFSRFFSTKGK 2174
            +K  KHR                +  RGFSRFFS KGK
Sbjct: 1001 SKQPKHRSTKSTSSSSSHMATPEKHSRGFSRFFSAKGK 1038


>ref|XP_007017762.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508723090|gb|EOY14987.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 951

 Score =  604 bits (1557), Expect = e-170
 Identities = 337/606 (55%), Positives = 423/606 (69%), Gaps = 15/606 (2%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 341  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 400

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 401  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 460

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 461  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 520

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 521  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 579

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 580  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 639

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 640  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 699

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 700  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 759

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 760  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 818

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 819  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 878

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 879  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 938

Query: 1758 SASLAA 1775
            SA  AA
Sbjct: 939  SACAAA 944


>ref|XP_007017760.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508723088|gb|EOY14985.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 837

 Score =  604 bits (1557), Expect = e-170
 Identities = 337/606 (55%), Positives = 423/606 (69%), Gaps = 15/606 (2%)
 Frame = +3

Query: 3    ATPHLPTLPEFSLDNLQKFHKENEFXXXXXXXXXXXXXXXXXXXXXRNSELQASRNLCAK 182
            +TPHL T  +FSLDN QK  KENEF                     RNSEL ASRNLCAK
Sbjct: 186  STPHLSTATDFSLDNAQKSQKENEFLTERLLAMEEETKMLKEALAKRNSELLASRNLCAK 245

Query: 183  TANKLQTLESQLQVSDQHKHPLK-----------SQHASSP-SVTSMSEDGHDDVGSCAE 326
            T++KLQTLE+QL +S Q + P K           SQ+ S+P SVTS+SEDG+DD  SCAE
Sbjct: 246  TSSKLQTLEAQLVISSQQRSPSKAIVPIPAEVYSSQNVSNPPSVTSVSEDGNDDDRSCAE 305

Query: 327  SWATTSISKFTHFXXXXXXXXXXXXX-ACNLELMDDFLEMEKLACSSNES--NGAPSISE 497
            SWAT  +S+ + F              A +L+LMDDFLEMEKLACSSN+S  NG  +IS+
Sbjct: 306  SWATALMSELSQFKKEKNVEKPNKTENAKHLDLMDDFLEMEKLACSSNDSTANGTITISD 365

Query: 498  SSNHKASESVNQDALVETTMDKDLHPEQQCDMNPMANQVSSNVELSSHKSDPDTDQLPLT 677
            S+N+K SESVN DA  E +  K+L  E+Q  ++P  NQVSSN++LS    + D DQLP+ 
Sbjct: 366  STNNKISESVNGDASGEISC-KELQSEKQHVLSPSVNQVSSNMDLSVVYPESDADQLPVM 424

Query: 678  KLQSRISLVFESLSKESDMKKILVDIKRVLQDAHNNLHQHSPSSDFEEIHCSDSTCDRQA 857
            KL++R+S+V +S+SK++D++KIL DIKR +QDA + L +HS +   EE+H SD TC  QA
Sbjct: 425  KLRTRLSIVLQSMSKDADVQKILEDIKRAVQDARDTLCEHSVNGVSEEVHGSDGTCIGQA 484

Query: 858  CPEDAGVTAEKEISLSQDSTPVKETVPIISQELSAAISHIHDFVLFLGKEAIAFQDPSPG 1037
                  +TAEKEI++S       E V  +SQEL+AAIS IHDFVL LGKEA A  D    
Sbjct: 485  HNGVGSLTAEKEIAISPGDKVASEIVQTVSQELAAAISQIHDFVLSLGKEARAVDDICSD 544

Query: 1038 GNGLNEKIEEFSATFSKVLHNNISLDDFVLSLSHVMSKANELSFNILGYKATDAEINSSD 1217
            GN L+ KIEEFS T++KVL +N+SL DF+  LS +++KA++L  N+LGYK  + EINS D
Sbjct: 545  GNRLSHKIEEFSVTYNKVLCSNVSLTDFIFDLSTILAKASDLRVNVLGYKDNEEEINSPD 604

Query: 1218 CIDKVALPENKVVQEDSSGGRYPNGCSHISDSTSDPEVPHDENLVPSFELNTSSCKCSLE 1397
            CIDKV LPENKV+Q+DSSGGRY NGC+HIS+ TS+PEVP D NLV  +E +  S K S E
Sbjct: 605  CIDKVVLPENKVIQQDSSGGRYQNGCAHISNPTSNPEVPDDGNLVSDYE-SKQSRKFSSE 663

Query: 1398 EYEQLKSLKDNLALDLVRCTENLESTKSQLQETEQHLAEVKSQLTSAQRSNSLSETQLKC 1577
            E+E+LK  K+N+A+DL RCTENLE TKSQL ETEQ LAE KSQL SAQ+SNSL+ETQLKC
Sbjct: 664  EFEELKLEKENMAMDLARCTENLEMTKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKC 723

Query: 1578 MAESYRSLETRAEELESKVNLLQARAXXXXXXXXXXXXSHQDTLARCNDLQEELQRNESC 1757
            MAESYRSLETRA+ELE++VNLL+ +             SH DTLARC +L+E+LQRNE+C
Sbjct: 724  MAESYRSLETRADELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENC 783

Query: 1758 SASLAA 1775
            SA  AA
Sbjct: 784  SACAAA 789


Top