BLASTX nr result

ID: Cinnamomum23_contig00027891 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00027891
         (1594 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010271408.1| PREDICTED: filament-like plant protein 4 [Ne...   415   e-113
ref|XP_010246408.1| PREDICTED: filament-like plant protein 4 [Ne...   402   e-109
ref|XP_010664792.1| PREDICTED: filament-like plant protein 4 iso...   374   e-100
ref|XP_010664790.1| PREDICTED: filament-like plant protein 4 iso...   374   e-100
emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]   372   e-100
ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma...   359   3e-96
ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma...   359   3e-96
ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma...   359   3e-96
gb|KDO84801.1| hypothetical protein CISIN_1g0013741mg [Citrus si...   356   3e-95
gb|KDO84799.1| hypothetical protein CISIN_1g0013741mg, partial [...   356   3e-95
ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citr...   356   3e-95
ref|XP_010917980.1| PREDICTED: filament-like plant protein 4 [El...   355   7e-95
ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-lik...   351   8e-94
ref|XP_010908836.1| PREDICTED: filament-like plant protein 4 [El...   346   3e-92
ref|XP_010104432.1| hypothetical protein L484_016031 [Morus nota...   346   3e-92
ref|XP_008811426.1| PREDICTED: LOW QUALITY PROTEIN: filament-lik...   343   2e-91
ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Popu...   343   2e-91
ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Popu...   343   2e-91
ref|XP_002510512.1| Myosin heavy chain, striated muscle, putativ...   342   6e-91
ref|XP_008776485.1| PREDICTED: filament-like plant protein 4 iso...   340   1e-90

>ref|XP_010271408.1| PREDICTED: filament-like plant protein 4 [Nelumbo nucifera]
            gi|720049328|ref|XP_010271409.1| PREDICTED: filament-like
            plant protein 4 [Nelumbo nucifera]
          Length = 1082

 Score =  415 bits (1067), Expect = e-113
 Identities = 228/395 (57%), Positives = 277/395 (70%), Gaps = 1/395 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            V+F+L LSHVL++AS LS S+L  K +EGESN+SDCIDKVTLLEN VVQ D  +ER  + 
Sbjct: 697  VNFILDLSHVLAKASELSFSVLGYKGNEGESNNSDCIDKVTLLENKVVQDDTVRERLPNG 756

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C+ I  STSDPE+LQEG     F  +STS K   +E EQLK EKD + MDL  CTE+LEH
Sbjct: 757  CSDIPHSTSDPEVLQEGSFIPGFGLRSTSCKCSFEELEQLKSEKDSMRMDLQRCTENLEH 816

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL ELKS+LASS+  N LADTQLKCMAESY+SLE  A+ELE E       
Sbjct: 817  TKFQLQETEQLLAELKSQLASSQKMNSLADTQLKCMAESYKSLETRAEELEAE------- 869

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
                   +N LH KAE L+ EL EEK N QDALA+CKDLE++++R E C+ C  +S+ D+
Sbjct: 870  -------VNLLHAKAETLENELQEEKMNHQDALAKCKDLEEQLKRNETCSKCSSNSAVDI 922

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE EIAAAAEKLA+CQETIFLLGRQLK++RP S +  G+PYNE HQ +E   E+ 
Sbjct: 923  DIKTKQEREIAAAAEKLAECQETIFLLGRQLKSMRP-SVEFAGSPYNEMHQRDEGFIEDG 981

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITRSPI 1079
               S LN  GM S  D D  +ME + + + R+GGESP D Y    SPSDTE +M+ RSPI
Sbjct: 982  SISSGLNRRGMHSSQDFDHTEMETSVSNISRLGGESPSDAYNSIFSPSDTEANMLMRSPI 1041

Query: 1080 RSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFS 1184
             S+ PKH  TR        A TPE+++R FSRFFS
Sbjct: 1042 SSRRPKHRPTR-SASSSSSALTPERHSRGFSRFFS 1075


>ref|XP_010246408.1| PREDICTED: filament-like plant protein 4 [Nelumbo nucifera]
            gi|720094580|ref|XP_010246409.1| PREDICTED: filament-like
            plant protein 4 [Nelumbo nucifera]
            gi|720094583|ref|XP_010246410.1| PREDICTED: filament-like
            plant protein 4 [Nelumbo nucifera]
          Length = 1096

 Score =  402 bits (1033), Expect = e-109
 Identities = 223/402 (55%), Positives = 270/402 (67%), Gaps = 2/402 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            V+FVL LSHVL++AS LS ++L  K +EGE+NSSDCIDKVTLLEN V+Q D  KER  S 
Sbjct: 710  VNFVLHLSHVLAKASELSFNVLGYKGNEGENNSSDCIDKVTLLENKVIQDDTVKERILSG 769

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  I  STSDPE+LQE      F   STS K+  +E EQLKLE D +  DL  CTE+LEH
Sbjct: 770  CTHIPHSTSDPEVLQEESFGPGFGLSSTSCKFSFEELEQLKLENDNMRRDLQRCTENLEH 829

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL ELKS+LASS+  N LADTQLKCMAESY+SLE  A +LE E       
Sbjct: 830  TKFQLQETEQLLAELKSQLASSQKMNSLADTQLKCMAESYKSLETRAGDLEAE------- 882

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
                   +  L  KAE LD EL +EK N QDAL +CKDLE+++QR + C+ C  +S+ D+
Sbjct: 883  -------VIFLRAKAENLDNELQQEKRNHQDALVKCKDLEEQLQRNDNCSKCSSTSAVDI 935

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE EIAAAAEKLA+CQETIFLLGRQLKALRP   +  G+PYNE HQ +E   E+ 
Sbjct: 936  DLKTKQEREIAAAAEKLAECQETIFLLGRQLKALRP-PVEFAGSPYNEMHQMDEGFMEDE 994

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITRSPI 1079
            P  S  N  GM    DLD  +M  + + M+RMGGESP + Y   L  SDTE +++ RSP+
Sbjct: 995  PRSSFSNPQGMGISQDLDQAEMGTSVSNMNRMGGESPSETYNSILGSSDTEVNLLLRSPV 1054

Query: 1080 RSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
             SK PKH+            PTPEK++R FSRFF S+ K  H
Sbjct: 1055 NSKHPKHSHNSSVSSSSSSTPTPEKHSRGFSRFFSSKQKNTH 1096


>ref|XP_010664792.1| PREDICTED: filament-like plant protein 4 isoform X2 [Vitis vinifera]
          Length = 934

 Score =  374 bits (959), Expect = e-100
 Identities = 210/403 (52%), Positives = 273/403 (67%), Gaps = 3/403 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS+VL++AS L+ ++L  K    E NSSDCIDKV L EN VVQ D   ER+ + 
Sbjct: 555  IDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDKVALPENKVVQKDTSGERYPNG 614

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS STSDPE+  +G L   F+  + S    L+EFEQLK EKD L M LA CTE+LE 
Sbjct: 615  CAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQLKSEKDTLEMHLARCTENLES 674

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL E KS+L S++  N LADTQLKCMAESYRSLE  A+ELE E       
Sbjct: 675  TKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAESYRSLETRAEELETE------- 727

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
                   +N L  K E L++EL EEK + ++AL RCKDL+++++R E C+ C  SS+AD+
Sbjct: 728  -------VNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCAMSSAADI 780

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE E+A+AA+KLA+CQETIFLLG+QL A+RP + DL+G+P +ER Q  E   E+ 
Sbjct: 781  DVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQT-DLLGSPQSERSQRVEVFHEDE 839

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P+ S +N      L D+D  D E+ A++ +HR+GGESPL+ Y  P SPS+TE +++ RSP
Sbjct: 840  PTTSGMN------LQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 893

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            + SK PKH  T+        APTPEK +R FSRFF S+GK  H
Sbjct: 894  VGSKHPKHRPTK--SNSSSSAPTPEKQSRGFSRFFSSKGKNGH 934


>ref|XP_010664790.1| PREDICTED: filament-like plant protein 4 isoform X1 [Vitis vinifera]
            gi|731429849|ref|XP_010664791.1| PREDICTED: filament-like
            plant protein 4 isoform X1 [Vitis vinifera]
          Length = 1085

 Score =  374 bits (959), Expect = e-100
 Identities = 210/403 (52%), Positives = 273/403 (67%), Gaps = 3/403 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS+VL++AS L+ ++L  K    E NSSDCIDKV L EN VVQ D   ER+ + 
Sbjct: 706  IDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDKVALPENKVVQKDTSGERYPNG 765

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS STSDPE+  +G L   F+  + S    L+EFEQLK EKD L M LA CTE+LE 
Sbjct: 766  CAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQLKSEKDTLEMHLARCTENLES 825

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL E KS+L S++  N LADTQLKCMAESYRSLE  A+ELE E       
Sbjct: 826  TKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAESYRSLETRAEELETE------- 878

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
                   +N L  K E L++EL EEK + ++AL RCKDL+++++R E C+ C  SS+AD+
Sbjct: 879  -------VNLLRGKTETLESELQEEKRSHENALIRCKDLQEQLERNEGCSVCAMSSAADI 931

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE E+A+AA+KLA+CQETIFLLG+QL A+RP + DL+G+P +ER Q  E   E+ 
Sbjct: 932  DVKTKQERELASAADKLAECQETIFLLGKQLNAMRPQT-DLLGSPQSERSQRVEVFHEDE 990

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P+ S +N      L D+D  D E+ A++ +HR+GGESPL+ Y  P SPS+TE +++ RSP
Sbjct: 991  PTTSGMN------LQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 1044

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            + SK PKH  T+        APTPEK +R FSRFF S+GK  H
Sbjct: 1045 VGSKHPKHRPTK--SNSSSSAPTPEKQSRGFSRFFSSKGKNGH 1085


>emb|CAN60525.1| hypothetical protein VITISV_000522 [Vitis vinifera]
          Length = 1085

 Score =  372 bits (954), Expect = e-100
 Identities = 209/403 (51%), Positives = 272/403 (67%), Gaps = 3/403 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS+VL++AS L+ ++L  K    E NSSDCIDKV L EN VVQ D   ER+ + 
Sbjct: 706  IDFIFDLSNVLAKASELNFNILGYKGAGEEINSSDCIDKVALPENKVVQKDTSGERYPNG 765

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS STSDPE+  +G L   F+  + S    L+EFEQLK EKD L M LA CTE+LE 
Sbjct: 766  CAHISDSTSDPEVPHDGNLVPGFKSNAASCNCSLEEFEQLKSEKDTLEMHLARCTENLES 825

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL E KS+L S++  N LADTQLKCMAESYRSLE  A+ELE E       
Sbjct: 826  TKSQLQETEQLLAEAKSQLTSAQKLNSLADTQLKCMAESYRSLETRAEELETE------- 878

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
                   +N L  K E L++E  EEK + ++AL RCKDL+++++R E C+ C  SS+AD+
Sbjct: 879  -------VNLLRGKTETLESEFQEEKRSHENALIRCKDLQEQLERNEGCSVCAMSSAADI 931

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE E+A+AA+KLA+CQETIFLLG+QL A+RP + DL+G+P +ER Q  E   E+ 
Sbjct: 932  DVKTKQERELASAADKLAECQETIFLLGKQLXAMRPQT-DLLGSPQSERSQRVEVFHEDE 990

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P+ S +N      L D+D  D E+ A++ +HR+GGESPL+ Y  P SPS+TE +++ RSP
Sbjct: 991  PTTSGMN------LQDIDQVDTESTASINVHRIGGESPLELYNTPRSPSETESNLLLRSP 1044

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            + SK PKH  T+        APTPEK +R FSRFF S+GK  H
Sbjct: 1045 VGSKHPKHRPTK--SNSSSSAPTPEKQSRGFSRFFSSKGKNGH 1085


>ref|XP_007017758.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508723086|gb|EOY14983.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 947

 Score =  359 bits (922), Expect = 3e-96
 Identities = 204/401 (50%), Positives = 267/401 (66%), Gaps = 2/401 (0%)
 Frame = +3

Query: 6    DFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSEC 185
            DF+  LS +L++AS L +++L  K +E E NS DCIDKV L EN V+Q D    R+ + C
Sbjct: 571  DFIFDLSTILAKASDLRVNVLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGC 630

Query: 186  NLISRSTSDPEILQEGRLDANFEQKSTSFKYLQEFEQLKLEKDKLAMDLATCTEDLEHTK 365
              IS  TS+PE+  +G L +++E K +     +EFE+LKLEK+ +AMDLA CTE+LE TK
Sbjct: 631  AHISNPTSNPEVPDDGNLVSDYESKQSRKFSSEEFEELKLEKENMAMDLARCTENLEMTK 690

Query: 366  IKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERHAQ 545
             +L ETEQLL E KS+LAS++ SN LA+TQLKCMAESYRSL              E  A 
Sbjct: 691  SQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAESYRSL--------------ETRAD 736

Query: 546  ELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADVDT 725
            ELE E+N L VK E L+ E  +EK +  D LARCK+LE+++QR E C+ C  +++AD D 
Sbjct: 737  ELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENCSAC--AAAADNDL 794

Query: 726  KTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENAPS 905
            K KQE E+AAAAEKLA+CQETIFLLG+QLK+LRP + D+MG+PYNER Q  E   E+ P+
Sbjct: 795  KNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQT-DMMGSPYNERSQKGEGLLEDEPT 853

Query: 906  PSRLNLPGMCSLHDLDPKDMENAAA-VMHRMGGESPLDGYMPPLSPSDTEGSMITRSPIR 1082
             S +N      L DLD  +++ AA+    R G ESP++  + P SPSDT+ +++ RSPI 
Sbjct: 854  TSGMN------LQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDANLL-RSPIN 906

Query: 1083 SKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            S  PKH ST          PTPEK +R FSRFF S+GKT H
Sbjct: 907  SNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGKTGH 947


>ref|XP_007017757.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508723085|gb|EOY14982.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1106

 Score =  359 bits (922), Expect = 3e-96
 Identities = 204/401 (50%), Positives = 267/401 (66%), Gaps = 2/401 (0%)
 Frame = +3

Query: 6    DFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSEC 185
            DF+  LS +L++AS L +++L  K +E E NS DCIDKV L EN V+Q D    R+ + C
Sbjct: 730  DFIFDLSTILAKASDLRVNVLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGC 789

Query: 186  NLISRSTSDPEILQEGRLDANFEQKSTSFKYLQEFEQLKLEKDKLAMDLATCTEDLEHTK 365
              IS  TS+PE+  +G L +++E K +     +EFE+LKLEK+ +AMDLA CTE+LE TK
Sbjct: 790  AHISNPTSNPEVPDDGNLVSDYESKQSRKFSSEEFEELKLEKENMAMDLARCTENLEMTK 849

Query: 366  IKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERHAQ 545
             +L ETEQLL E KS+LAS++ SN LA+TQLKCMAESYRSL              E  A 
Sbjct: 850  SQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAESYRSL--------------ETRAD 895

Query: 546  ELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADVDT 725
            ELE E+N L VK E L+ E  +EK +  D LARCK+LE+++QR E C+ C  +++AD D 
Sbjct: 896  ELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENCSAC--AAAADNDL 953

Query: 726  KTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENAPS 905
            K KQE E+AAAAEKLA+CQETIFLLG+QLK+LRP + D+MG+PYNER Q  E   E+ P+
Sbjct: 954  KNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQT-DMMGSPYNERSQKGEGLLEDEPT 1012

Query: 906  PSRLNLPGMCSLHDLDPKDMENAAA-VMHRMGGESPLDGYMPPLSPSDTEGSMITRSPIR 1082
             S +N      L DLD  +++ AA+    R G ESP++  + P SPSDT+ +++ RSPI 
Sbjct: 1013 TSGMN------LQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDANLL-RSPIN 1065

Query: 1083 SKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            S  PKH ST          PTPEK +R FSRFF S+GKT H
Sbjct: 1066 SNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGKTGH 1106


>ref|XP_007017755.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508723083|gb|EOY14980.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1102

 Score =  359 bits (922), Expect = 3e-96
 Identities = 204/401 (50%), Positives = 267/401 (66%), Gaps = 2/401 (0%)
 Frame = +3

Query: 6    DFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSEC 185
            DF+  LS +L++AS L +++L  K +E E NS DCIDKV L EN V+Q D    R+ + C
Sbjct: 726  DFIFDLSTILAKASDLRVNVLGYKDNEEEINSPDCIDKVVLPENKVIQQDSSGGRYQNGC 785

Query: 186  NLISRSTSDPEILQEGRLDANFEQKSTSFKYLQEFEQLKLEKDKLAMDLATCTEDLEHTK 365
              IS  TS+PE+  +G L +++E K +     +EFE+LKLEK+ +AMDLA CTE+LE TK
Sbjct: 786  AHISNPTSNPEVPDDGNLVSDYESKQSRKFSSEEFEELKLEKENMAMDLARCTENLEMTK 845

Query: 366  IKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERHAQ 545
             +L ETEQLL E KS+LAS++ SN LA+TQLKCMAESYRSL              E  A 
Sbjct: 846  SQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAESYRSL--------------ETRAD 891

Query: 546  ELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADVDT 725
            ELE E+N L VK E L+ E  +EK +  D LARCK+LE+++QR E C+ C  +++AD D 
Sbjct: 892  ELETEVNLLRVKIETLENEHQDEKRSHHDTLARCKELEEQLQRNENCSAC--AAAADNDL 949

Query: 726  KTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENAPS 905
            K KQE E+AAAAEKLA+CQETIFLLG+QLK+LRP + D+MG+PYNER Q  E   E+ P+
Sbjct: 950  KNKQEKELAAAAEKLAECQETIFLLGKQLKSLRPQT-DMMGSPYNERSQKGEGLLEDEPT 1008

Query: 906  PSRLNLPGMCSLHDLDPKDMENAAA-VMHRMGGESPLDGYMPPLSPSDTEGSMITRSPIR 1082
             S +N      L DLD  +++ AA+    R G ESP++  + P SPSDT+ +++ RSPI 
Sbjct: 1009 TSGMN------LQDLDQTEIDTAASGNASRGGAESPMEPLISPSSPSDTDANLL-RSPIN 1061

Query: 1083 SKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            S  PKH ST          PTPEK +R FSRFF S+GKT H
Sbjct: 1062 SNHPKHKSTLSSSSSSSSTPTPEKQSRGFSRFFSSKGKTGH 1102


>gb|KDO84801.1| hypothetical protein CISIN_1g0013741mg [Citrus sinensis]
          Length = 1015

 Score =  356 bits (913), Expect = 3e-95
 Identities = 205/405 (50%), Positives = 272/405 (67%), Gaps = 5/405 (1%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            VDFV  LS+VL++AS L ++++  K  E E NS DCIDKV L EN V++ D   ER+ + 
Sbjct: 639  VDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVALPENKVIKKDTSGERYPNG 698

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TSDPE+  +G + A +E ++T+ K+ L+EFE+LKLEKD LA DLA CTE+LE 
Sbjct: 699  CAHISNPTSDPEVPDDGSIVAAYESETTACKFSLEEFEELKLEKDNLATDLARCTENLEM 758

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+K++LAS++ SN LA+TQLKCMAESYRSLE               H
Sbjct: 759  TKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCMAESYRSLE--------------TH 804

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            AQELE E+N L  K E+L+ EL +EK +  +A+A+CK+LE+++QR E C  C  SS AD 
Sbjct: 805  AQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCAVC--SSEAD- 861

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K KQ+ ++AAAAE+LA+CQETI LLG+QLK+LRP S +++G+PY+ER Q  E+     
Sbjct: 862  ENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQS-EVIGSPYSERSQKGEF----L 916

Query: 900  PSPSRLNLPGMCSLHDLDPKDME---NAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITR 1070
            P       P   SL + D  +M+   +A A  HR+G ESPLD Y  P SPS+ E S I +
Sbjct: 917  PGE-----PATASLQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSPSENEAS-INK 970

Query: 1071 SPIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            SPI SK PKH  T+        APTPEK++R FSRFF S+G+  H
Sbjct: 971  SPINSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGRNGH 1015


>gb|KDO84799.1| hypothetical protein CISIN_1g0013741mg, partial [Citrus sinensis]
            gi|641866115|gb|KDO84800.1| hypothetical protein
            CISIN_1g0013741mg, partial [Citrus sinensis]
          Length = 1050

 Score =  356 bits (913), Expect = 3e-95
 Identities = 205/405 (50%), Positives = 272/405 (67%), Gaps = 5/405 (1%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            VDFV  LS+VL++AS L ++++  K  E E NS DCIDKV L EN V++ D   ER+ + 
Sbjct: 674  VDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVALPENKVIKKDTSGERYPNG 733

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TSDPE+  +G + A +E ++T+ K+ L+EFE+LKLEKD LA DLA CTE+LE 
Sbjct: 734  CAHISNPTSDPEVPDDGSIVAAYESETTACKFSLEEFEELKLEKDNLATDLARCTENLEM 793

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+K++LAS++ SN LA+TQLKCMAESYRSLE               H
Sbjct: 794  TKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCMAESYRSLE--------------TH 839

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            AQELE E+N L  K E+L+ EL +EK +  +A+A+CK+LE+++QR E C  C  SS AD 
Sbjct: 840  AQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCAVC--SSEAD- 896

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K KQ+ ++AAAAE+LA+CQETI LLG+QLK+LRP S +++G+PY+ER Q  E+     
Sbjct: 897  ENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQS-EVIGSPYSERSQKGEF----L 951

Query: 900  PSPSRLNLPGMCSLHDLDPKDME---NAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITR 1070
            P       P   SL + D  +M+   +A A  HR+G ESPLD Y  P SPS+ E S I +
Sbjct: 952  PGE-----PATASLQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSPSENEAS-INK 1005

Query: 1071 SPIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            SPI SK PKH  T+        APTPEK++R FSRFF S+G+  H
Sbjct: 1006 SPINSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGRNGH 1050


>ref|XP_006435149.1| hypothetical protein CICLE_v10000102mg [Citrus clementina]
            gi|567885183|ref|XP_006435150.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537271|gb|ESR48389.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
            gi|557537272|gb|ESR48390.1| hypothetical protein
            CICLE_v10000102mg [Citrus clementina]
          Length = 1091

 Score =  356 bits (913), Expect = 3e-95
 Identities = 205/405 (50%), Positives = 272/405 (67%), Gaps = 5/405 (1%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            VDFV  LS+VL++AS L ++++  K  E E NS DCIDKV L EN V++ D   ER+ + 
Sbjct: 715  VDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVALPENKVIKKDTSGERYPNG 774

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TSDPE+  +G + A +E ++T+ K+ L+EFE+LKLEKD LA DLA CTE+LE 
Sbjct: 775  CAHISNPTSDPEVPDDGSIVAAYESETTACKFTLEEFEELKLEKDNLATDLARCTENLEM 834

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+K++LAS++ SN LA+TQLKCMAESYRSLE               H
Sbjct: 835  TKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCMAESYRSLE--------------TH 880

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            AQELE E+N L  K E+L+ EL +EK +  +A+A+CK+LE+++QR E C  C  SS AD 
Sbjct: 881  AQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCAVC--SSEAD- 937

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K KQ+ ++AAAAE+LA+CQETI LLG+QLK+LRP S +++G+PY+ER Q  E+     
Sbjct: 938  ENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQS-EVIGSPYSERSQKGEF----L 992

Query: 900  PSPSRLNLPGMCSLHDLDPKDME---NAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITR 1070
            P       P   SL + D  +M+   +A A  HR+G ESPLD Y  P SPS+ E S I +
Sbjct: 993  PGE-----PATASLQEFDHAEMDSVTSANAQPHRVGAESPLDLYTSPCSPSENEAS-INK 1046

Query: 1071 SPIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            SPI SK PKH  T+        APTPEK++R FSRFF S+G+  H
Sbjct: 1047 SPINSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGRNGH 1091


>ref|XP_010917980.1| PREDICTED: filament-like plant protein 4 [Elaeis guineensis]
            gi|743775208|ref|XP_010917981.1| PREDICTED: filament-like
            plant protein 4 [Elaeis guineensis]
            gi|743775210|ref|XP_010917982.1| PREDICTED: filament-like
            plant protein 4 [Elaeis guineensis]
          Length = 1078

 Score =  355 bits (910), Expect = 7e-95
 Identities = 211/402 (52%), Positives = 262/402 (65%), Gaps = 2/402 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+L LS +LSE S    +M SDKR+EGESNSSDCIDKVTLLEN  V+ +  KE FS  
Sbjct: 703  IDFILALSKILSETS---FNMSSDKRNEGESNSSDCIDKVTLLENKEVEHESAKENFSGV 759

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
              L+  S+SDPEI  EG +  +FE K+T  K+ L+EFE LKLEK+ + M+LA C E LE+
Sbjct: 760  RLLVPHSSSDPEI--EGPVGHDFEVKATLQKFSLEEFEHLKLEKENMEMELARCNEMLEY 817

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQ L ELKS+LA+S+ SN L++TQLKCMAESY++L              E  
Sbjct: 818  TKSQLVETEQNLAELKSQLAASQKSNSLSETQLKCMAESYKTL--------------ESR 863

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
             +ELE EI  L  KAE+LD EL EE+ + QD LA+ KDL+++I+R E+   C   S AD 
Sbjct: 864  TKELEAEIVLLLTKAESLDNELQEERRSHQDDLAKYKDLQEQIERNEKSLMC---SDADN 920

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE EIAAAAEKLA+CQETI LLGRQL+ +RP +     +P N RH+ ++Y  EN 
Sbjct: 921  DIKTKQEKEIAAAAEKLAECQETIRLLGRQLQTMRPPAESSTSSP-NNRHRMSDYLLENE 979

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAVM-HRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P PS  N     +L  L   +MENAA  M H  G ESPLDGY   +SP DTE S   RSP
Sbjct: 980  PGPSGFNRQ---TLPHLSHSEMENAAVPMTHTTGSESPLDGYNSHMSPPDTEASSFPRSP 1036

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFSRGKTVH 1202
            I SK  KH S+R           PEK  R FSRFFS+G++ H
Sbjct: 1037 ISSKRQKHRSSRASSSTSFPNTMPEKQGRGFSRFFSKGRSDH 1078


>ref|XP_006473632.1| PREDICTED: filament-like plant protein 4-like isoform X1 [Citrus
            sinensis] gi|568839322|ref|XP_006473633.1| PREDICTED:
            filament-like plant protein 4-like isoform X2 [Citrus
            sinensis]
          Length = 1091

 Score =  351 bits (901), Expect = 8e-94
 Identities = 203/405 (50%), Positives = 270/405 (66%), Gaps = 5/405 (1%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            VDFV  LS+VL++AS L ++++  K  E E NS DCIDKV L EN V++ D   ER+ + 
Sbjct: 715  VDFVFALSNVLAKASELRINVMGYKDTEIEPNSPDCIDKVALPENKVIKKDTSGERYPNG 774

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TSDPE+  +G + A +E ++T+ K+ L+EFE+LKLEKD LA DLA CTE+LE 
Sbjct: 775  CAHISNPTSDPEVPDDGSIVAAYESETTACKFSLEEFEELKLEKDNLATDLARCTENLEM 834

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+K++LAS++ SN LA+TQLKCMAESYRSLE               H
Sbjct: 835  TKSQLYETEQLLAEVKAQLASAQKSNSLAETQLKCMAESYRSLE--------------TH 880

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            AQELE E+N L  K E+L+ EL +EK +  +A+A+CK+LE+++QR E C  C  SS AD 
Sbjct: 881  AQELEAEVNLLRAKIESLENELQDEKMSHHNAMAKCKELEEQLQRNENCAVC--SSEAD- 937

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K KQ+ ++AAAAE+LA+CQETI LLG+QLK+LRP S +++G+PY+ER    E+     
Sbjct: 938  ENKIKQDRDLAAAAERLAECQETILLLGKQLKSLRPQS-EVIGSPYSERSPKGEF----L 992

Query: 900  PSPSRLNLPGMCSLHDLDPKDME---NAAAVMHRMGGESPLDGYMPPLSPSDTEGSMITR 1070
            P       P   SL + D  + +   +A A  HR+G ESPLD Y  P SPS+ E S I +
Sbjct: 993  PGE-----PATASLQEFDHAETDSVTSANAQPHRVGAESPLDLYTSPCSPSENEAS-INK 1046

Query: 1071 SPIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            SPI SK PKH  T+        APTPEK++R FSRFF S+G+  H
Sbjct: 1047 SPINSKHPKHRPTKSTSSSSTSAPTPEKSSRGFSRFFSSKGRNGH 1091


>ref|XP_010908836.1| PREDICTED: filament-like plant protein 4 [Elaeis guineensis]
          Length = 1076

 Score =  346 bits (888), Expect = 3e-92
 Identities = 206/401 (51%), Positives = 258/401 (64%), Gaps = 2/401 (0%)
 Frame = +3

Query: 6    DFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSEC 185
            +F+LGLSH+L E S +S +M   + +EGESNSSDCIDKVTLLEN VVQ    KE  S  C
Sbjct: 706  NFILGLSHILCETSEMSFNMSGKQCNEGESNSSDCIDKVTLLENKVVQHASTKENLSRVC 765

Query: 186  NLISRSTSDPEILQEGRLDANFEQKST-SFKYLQEFEQLKLEKDKLAMDLATCTEDLEHT 362
            +L+  S SDPEI  EG +  +FE K+T     L+EF+ LKLEK+K+ M+LA C E LE T
Sbjct: 766  SLVPHSLSDPEI--EGPISHDFEVKATLKMCSLEEFKCLKLEKEKMEMELARCNEMLERT 823

Query: 363  KIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERHA 542
            K +L E E+ L ELKS L +S+ SN L++TQLKCMAESY++LE   QELE E+       
Sbjct: 824  KHRLVEMEENLAELKSLLTASQKSNSLSETQLKCMAESYKTLESRTQELEAEVVL----- 878

Query: 543  QELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADVD 722
                     LH KAE LD EL EE+ + QD LA+ KDL+++I+R E+ + C   S AD D
Sbjct: 879  ---------LHTKAEILDNELQEERCSHQDDLAKYKDLQEQIERIEKSSMC---SGADTD 926

Query: 723  TKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENAP 902
             K+KQE EIAAAAEKLA+CQETI LLGRQL+A+RP +  L   P N R+  +++  EN P
Sbjct: 927  IKSKQE-EIAAAAEKLAECQETILLLGRQLQAMRPPAESLSSYP-NNRYPMSDFFLENEP 984

Query: 903  SPSRLNLPGMCSLHDLDPKDMENAAAVM-HRMGGESPLDGYMPPLSPSDTEGSMITRSPI 1079
             P   N PG          +MENA+  M HR G ESPLDGY   +SPSDTE S   RSP+
Sbjct: 985  GPIGFN-PG--------HSEMENASVYMTHRTGSESPLDGYNSHMSPSDTEASSFPRSPV 1035

Query: 1080 RSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFSRGKTVH 1202
             SK  KH S+R           PEK+ R FSRFFS+GK+ H
Sbjct: 1036 SSKRQKHRSSRSSSSISLPNTMPEKHGRGFSRFFSKGKSDH 1076


>ref|XP_010104432.1| hypothetical protein L484_016031 [Morus notabilis]
            gi|587913144|gb|EXC00965.1| hypothetical protein
            L484_016031 [Morus notabilis]
          Length = 1087

 Score =  346 bits (887), Expect = 3e-92
 Identities = 200/403 (49%), Positives = 268/403 (66%), Gaps = 3/403 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DFVL LS VL++AS L  S+L  K +E E+NS DCIDKV L EN  +Q D   E + + 
Sbjct: 716  IDFVLDLSSVLAKASELRFSVLGFKGNEAETNSPDCIDKVVLPENKAIQKDS-SEIYQNG 774

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  +  STS+PE+  +G + +++E  + S K  L+E++QLK EKD LA+D A CTE+LE 
Sbjct: 775  CAHMPNSTSNPEVPDDGNIVSSYESNAKSCKISLEEYDQLKSEKDNLALDFARCTENLEM 834

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +LQETEQLL E KS+L+S + SN L++TQLKCMAESYRSL              E  
Sbjct: 835  TKSQLQETEQLLAEAKSQLSSVQKSNSLSETQLKCMAESYRSL--------------ETR 880

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            AQ+LE E+N L  K E+++AEL EEK N QDAL RCK+L++++QR E        ++ + 
Sbjct: 881  AQDLETELNLLRTKTESIEAELQEEKRNHQDALTRCKELQEQLQRNE--------NNCEN 932

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K  QE E AAAAEKLA+CQETIFLLG++LK LRP S ++MG+PY+ER Q  E   E+ 
Sbjct: 933  EIKPNQEKEFAAAAEKLAECQETIFLLGKKLKNLRPQS-EIMGSPYSERSQNGEGLNEDE 991

Query: 900  PSPSRLNLPGMCSLHDLDPKDMEN-AAAVMHRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P+ S +NLP      + D  ++E+  +A ++R+G ESP+D Y  PLSPSD E S I +SP
Sbjct: 992  PTTSGMNLP------ESDQAELESVTSANLNRVGAESPIDVYSAPLSPSDAEPS-ILKSP 1044

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGKTVH 1202
            I SK P+H S +        APTPEK++R FSRFF S+GK  H
Sbjct: 1045 INSKNPRHKSPKSGSLSSSSAPTPEKHSRGFSRFFSSKGKNGH 1087


>ref|XP_008811426.1| PREDICTED: LOW QUALITY PROTEIN: filament-like plant protein 4
            [Phoenix dactylifera]
          Length = 1081

 Score =  343 bits (881), Expect = 2e-91
 Identities = 204/402 (50%), Positives = 257/402 (63%), Gaps = 2/402 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+L LS +LSE S    +M SDK + GESN SDCIDKVT LEN V++    K  FS  
Sbjct: 703  IDFILALSQILSETS---FNMPSDKGNGGESNGSDCIDKVTSLENKVLEHKSTKGNFSGV 759

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKYL-QEFEQLKLEKDKLAMDLATCTEDLEH 359
            C+L+  S+SDPEI  EG    +FE K+T   +  +EF+ LKLEK+ + M+LA C E LE 
Sbjct: 760  CSLVPHSSSDPEI--EGPNGRDFEVKATFQMFSPEEFKHLKLEKENMEMELARCNEMLER 817

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L E EQ L ELKS+LA+S+ SN L++TQLKCMAESY++L              E  
Sbjct: 818  TKSQLVEMEQNLAELKSQLAASQKSNSLSETQLKCMAESYKTL--------------ESR 863

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
             +ELE EI  L  KAE+LD EL EE+ + QD LA+ K+L+++ +R E+    L SS AD 
Sbjct: 864  TKELEAEIVLLQTKAESLDNELQEERRSHQDDLAKYKELQEQTERNEK---SLMSSDADT 920

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            D KTKQE EIAAAAEKL +CQETI +LGRQL+A+RP +  L  +P N RH+ ++Y  EN 
Sbjct: 921  DIKTKQEREIAAAAEKLVECQETIRVLGRQLQAMRPPAESLSSSP-NNRHRMSDYLLENE 979

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAVM-HRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P PS +N   M +       +MENAA  M  R GGESPLDGY   +SPSDTE S   RSP
Sbjct: 980  PGPSGINPQVMRASPHSSHSEMENAAVPMTQRTGGESPLDGYNSHMSPSDTEASSFPRSP 1039

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFSRGKTVH 1202
            I SK  KH S+R           PEK  R FSRFFS+GK+ H
Sbjct: 1040 ISSKRQKHRSSRPSSSTSFPNTMPEKQGRGFSRFFSKGKSDH 1081


>ref|XP_002301986.2| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
            gi|550344134|gb|EEE81259.2| hypothetical protein
            POPTR_0002s02600g [Populus trichocarpa]
          Length = 1063

 Score =  343 bits (881), Expect = 2e-91
 Identities = 205/401 (51%), Positives = 261/401 (65%), Gaps = 4/401 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS VL+ AS L  ++L  K +E E NS DCIDKV L EN V+Q D P E F + 
Sbjct: 689  IDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCIDKVALPENKVIQNDSPGETFQNG 748

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TS+PE+   G L   +   +TS K  L+EFE+LK EKD +AMDLA CTE+LE 
Sbjct: 749  CANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEFEELKSEKDTMAMDLARCTENLEM 808

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+KS+L S++ SN LA+TQLKCMAESYRSL              E  
Sbjct: 809  TKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMAESYRSL--------------ETR 854

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSAD- 716
            AQELE E+N L VK E L++EL EEK + QDAL RCK+LE+++Q  E       SSSAD 
Sbjct: 855  AQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKE-------SSSADG 907

Query: 717  VDTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFEN 896
            +D K+KQE EI AAAEKLA+CQETIFLLG+QLK LRP + ++MG+PY+ER Q+ +   ++
Sbjct: 908  IDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQT-EIMGSPYSERSQSGDGIAKD 966

Query: 897  APSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRS 1073
             P+ S +N      L D D  +M+  A+V   + G ESP D Y  P  PSDTE +++ RS
Sbjct: 967  EPTISGIN------LQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESNLL-RS 1019

Query: 1074 PIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGK 1193
            P+  K PKH  T+         PTPEK+ R FSRFF S+GK
Sbjct: 1020 PVGLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 1060


>ref|XP_006386179.1| hypothetical protein POPTR_0002s02600g [Populus trichocarpa]
            gi|550344133|gb|ERP63976.1| hypothetical protein
            POPTR_0002s02600g [Populus trichocarpa]
          Length = 991

 Score =  343 bits (881), Expect = 2e-91
 Identities = 205/401 (51%), Positives = 261/401 (65%), Gaps = 4/401 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS VL+ AS L  ++L  K +E E NS DCIDKV L EN V+Q D P E F + 
Sbjct: 617  IDFMFDLSRVLALASGLRFNVLGYKCNEAEINSPDCIDKVALPENKVIQNDSPGETFQNG 676

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TS+PE+   G L   +   +TS K  L+EFE+LK EKD +AMDLA CTE+LE 
Sbjct: 677  CANISSPTSNPEVPDYGNLVPGYGSNTTSCKVSLEEFEELKSEKDTMAMDLARCTENLEM 736

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E+KS+L S++ SN LA+TQLKCMAESYRSL              E  
Sbjct: 737  TKSQLHETEQLLAEVKSQLVSAQKSNSLAETQLKCMAESYRSL--------------ETR 782

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSAD- 716
            AQELE E+N L VK E L++EL EEK + QDAL RCK+LE+++Q  E       SSSAD 
Sbjct: 783  AQELETEVNLLRVKTETLESELQEEKTSHQDALTRCKELEEQLQTKE-------SSSADG 835

Query: 717  VDTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFEN 896
            +D K+KQE EI AAAEKLA+CQETIFLLG+QLK LRP + ++MG+PY+ER Q+ +   ++
Sbjct: 836  IDLKSKQEKEITAAAEKLAECQETIFLLGKQLKYLRPQT-EIMGSPYSERSQSGDGIAKD 894

Query: 897  APSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRS 1073
             P+ S +N      L D D  +M+  A+V   + G ESP D Y  P  PSDTE +++ RS
Sbjct: 895  EPTISGIN------LQDSDQAEMDTGASVNFLKAGSESPSDSYNHPCYPSDTESNLL-RS 947

Query: 1074 PIRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFF-SRGK 1193
            P+  K PKH  T+         PTPEK+ R FSRFF S+GK
Sbjct: 948  PVGLKHPKHRPTKSTSSSSSSTPTPEKHPRGFSRFFSSKGK 988


>ref|XP_002510512.1| Myosin heavy chain, striated muscle, putative [Ricinus communis]
            gi|223551213|gb|EEF52699.1| Myosin heavy chain, striated
            muscle, putative [Ricinus communis]
          Length = 1041

 Score =  342 bits (876), Expect = 6e-91
 Identities = 199/400 (49%), Positives = 262/400 (65%), Gaps = 3/400 (0%)
 Frame = +3

Query: 3    VDFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSE 182
            +DF+  LS VL++AS L  ++L  K  E E NSSDCIDKV L EN V+Q D   E + + 
Sbjct: 663  IDFIFYLSCVLAKASELRFNVLGYKGSEAEINSSDCIDKVALPENKVLQRDSSGESYQNS 722

Query: 183  CNLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEH 359
            C  IS  TS+PE+  +G L + +   +T  K  L+EFE+LK EK+ +A+DLA CTE+LE 
Sbjct: 723  CAHISSPTSNPEVPDDGSLVSGYGSNTTLCKVSLEEFEELKSEKNNVALDLARCTENLEM 782

Query: 360  TKIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERH 539
            TK +L ETEQLL E KS+LAS++ SN LA+TQLKCMAESYRSL              E  
Sbjct: 783  TKSQLHETEQLLAEAKSQLASAQKSNSLAETQLKCMAESYRSL--------------EAR 828

Query: 540  AQELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADV 719
            A+ELE E+N L  KAE L+ EL +EK    DAL+R K+LE+++Q  E C+ C  S++AD 
Sbjct: 829  AEELETEVNLLQAKAETLENELQDEKQCHWDALSRSKELEEQLQTKESCSVC--SAAADA 886

Query: 720  DTKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENA 899
            + K  Q+ E+AAAAEKLA+CQETIFLLG+QLKALRP + +LMG+ Y+ER +  +   E+ 
Sbjct: 887  ENKANQDRELAAAAEKLAECQETIFLLGKQLKALRPQT-ELMGSAYSERSRKGDGFAEDE 945

Query: 900  PSPSRLNLPGMCSLHDLDPKDMENAAAV-MHRMGGESPLDGYMPPLSPSDTEGSMITRSP 1076
            P+ S +N      L D D  +M+   +   HR G ESP+D Y  P SPSDTE S ++RSP
Sbjct: 946  PTTSGMN------LQDFDQAEMDAIVSTNHHRAGAESPMDLYNQPCSPSDTE-SNLSRSP 998

Query: 1077 IRSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFS-RGK 1193
            + SK PKH ST+          TPEK++R FSRFFS +GK
Sbjct: 999  LNSKQPKHRSTKSTSSSSSHMATPEKHSRGFSRFFSAKGK 1038


>ref|XP_008776485.1| PREDICTED: filament-like plant protein 4 isoform X2 [Phoenix
            dactylifera]
          Length = 1059

 Score =  340 bits (873), Expect = 1e-90
 Identities = 204/401 (50%), Positives = 260/401 (64%), Gaps = 2/401 (0%)
 Frame = +3

Query: 6    DFVLGLSHVLSEASRLSLSMLSDKRHEGESNSSDCIDKVTLLENNVVQLDPPKERFSSEC 185
            DF+L LSH+L E S +S ++   K  EGESN SDC+DKVTLLEN V++    KE FS  C
Sbjct: 689  DFILALSHILCETSEMSFNISGKKCSEGESNISDCVDKVTLLENKVIRHASIKENFSGVC 748

Query: 186  NLISRSTSDPEILQEGRLDANFEQKSTSFKY-LQEFEQLKLEKDKLAMDLATCTEDLEHT 362
            +L+  S+SDPEI  E  +  +FE K+T  K  L+EF+ LKLEK+ + M+LA C E LEHT
Sbjct: 749  SLVPYSSSDPEI--ERPISHDFEVKATLKKCSLEEFKCLKLEKENMEMELARCNEMLEHT 806

Query: 363  KIKLQETEQLLVELKSELASSKNSNGLADTQLKCMAESYRSLEMHAQELEKEMHAQERHA 542
            K +L ETE+ L ELKS+LA+S+ SN L++TQLKCMAESY++LE   QELE E+       
Sbjct: 807  KHQLVETEENLAELKSQLAASQKSNSLSETQLKCMAESYKALESRTQELEAEVVL----- 861

Query: 543  QELEKEINGLHVKAEALDAELHEEKHNLQDALARCKDLEDEIQRTERCTTCLHSSSADVD 722
                     LH KAE LD EL EE+ + QD LA+ KDL+++I+R E+ + C   S AD D
Sbjct: 862  ---------LHTKAETLDNELQEERCSHQDDLAKYKDLQEQIERNEKSSMC---SGADTD 909

Query: 723  TKTKQENEIAAAAEKLAQCQETIFLLGRQLKALRPSSADLMGAPYNERHQTNEYSFENAP 902
             K+KQE EIAAAAEKLA+CQETI LLGRQL+A+RP +  L   P N R+  ++Y  EN P
Sbjct: 910  IKSKQE-EIAAAAEKLAECQETILLLGRQLQAMRPPAESLSSYP-NNRYPMSDYFLENEP 967

Query: 903  SPSRLNLPGMCSLHDLDPKDMENAAAVMHRM-GGESPLDGYMPPLSPSDTEGSMITRSPI 1079
             PS  N      +H     +ME A+  M ++ GG SPLDGY   +SPSDTE S   RSPI
Sbjct: 968  GPSGFN-----PVH----SEMEIASVHMTQITGGGSPLDGYNFDMSPSDTEASSFPRSPI 1018

Query: 1080 RSKPPKHTSTRXXXXXXXXAPTPEKNARNFSRFFSRGKTVH 1202
             SK  KH S+R           PEK+ R FSRFFS+GK+ H
Sbjct: 1019 SSKRQKHRSSRSSSSTSLPNVMPEKHGRGFSRFFSKGKSDH 1059


Top