BLASTX nr result

ID: Chrysanthemum21_contig00020582 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00020582
         (1488 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI09611.1| Armadillo-like helical [Cynara cardunculus var. s...   469   e-154
ref|XP_021993993.1| proline-, glutamic acid- and leucine-rich pr...   467   e-153
ref|XP_023767643.1| proline-, glutamic acid- and leucine-rich pr...   411   e-134
ref|XP_023767642.1| proline-, glutamic acid- and leucine-rich pr...   411   e-132
ref|XP_018842583.1| PREDICTED: proline-, glutamic acid- and leuc...   300   3e-89
ref|XP_018842582.1| PREDICTED: proline-, glutamic acid- and leuc...   300   1e-88
ref|XP_017646309.1| PREDICTED: proline-, glutamic acid- and leuc...   297   7e-88
gb|KJB67750.1| hypothetical protein B456_010G208000 [Gossypium r...   295   2e-87
ref|XP_012451559.1| PREDICTED: proline-, glutamic acid- and leuc...   295   4e-87
ref|XP_016728366.1| PREDICTED: proline-, glutamic acid- and leuc...   290   5e-87
ref|XP_021296585.1| proline-, glutamic acid- and leucine-rich pr...   289   6e-87
ref|XP_016728365.1| PREDICTED: proline-, glutamic acid- and leuc...   290   1e-85
ref|XP_016728363.1| PREDICTED: proline-, glutamic acid- and leuc...   290   2e-85
gb|EOY19217.1| Uncharacterized protein TCM_044175 isoform 2 [The...   288   3e-85
ref|XP_016649496.1| PREDICTED: uncharacterized protein LOC103327...   287   4e-85
gb|EOY19216.1| Uncharacterized protein TCM_044175 isoform 1 [The...   288   4e-85
ref|XP_017235696.1| PREDICTED: uncharacterized protein LOC108209...   288   1e-84
ref|XP_017984990.1| PREDICTED: proline-, glutamic acid- and leuc...   286   1e-84
ref|XP_021812691.1| uncharacterized protein LOC110755738 isoform...   285   3e-84
ref|XP_010242434.1| PREDICTED: proline-, glutamic acid- and leuc...   287   4e-84

>gb|KVI09611.1| Armadillo-like helical [Cynara cardunculus var. scolymus]
          Length = 828

 Score =  469 bits (1206), Expect = e-154
 Identities = 240/393 (61%), Positives = 275/393 (69%), Gaps = 9/393 (2%)
 Frame = +3

Query: 6    DISNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHT 185
            DISNKAT  ER+ MPSISALMLCCSTML TSYP + K                DGSLPHT
Sbjct: 300  DISNKATRPERLFMPSISALMLCCSTMLRTSYPVQIKVPVRSLLMLAGRVLMVDGSLPHT 359

Query: 186  LYPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQP 365
            LYP +TAMQQE +C ELPVQHSYSL+ILCGIVK A SQL PHAAH++R+VTEY RRC  P
Sbjct: 360  LYPIMTAMQQECICTELPVQHSYSLEILCGIVKEARSQLFPHAAHIVRLVTEYFRRCALP 419

Query: 366  ELRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNS--------- 518
            ELR+K+YALIKLML+SMGVG+T+YLAEDVVSNAS+DLDSVG  GGEA SN          
Sbjct: 420  ELRIKVYALIKLMLMSMGVGITMYLAEDVVSNASVDLDSVGHHGGEACSNPVLKTSEAVP 479

Query: 519  EPVQKKRKHEMAVTSHENQSETIHTRKNPVPVSLKIXXXXXXXXXXXXXXXXRSEGWRSN 698
            +P+QKKRKH+M +TS  NQ +T    KN  P+S+KI                RSE WRSN
Sbjct: 480  QPMQKKRKHDMTITSFGNQPQTSSLHKNHTPISVKIAALEALETLLTVGGALRSERWRSN 539

Query: 699  VDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYL 878
            VD+L+ TV+TDAC GGWTK              WADFQ             PGR+RPPYL
Sbjct: 540  VDVLLITVSTDACKGGWTKQANNVYTLHDSSSSWADFQLASLRALLASLLSPGRIRPPYL 599

Query: 879  AHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFMD 1058
            A GLELF +G Q  G+K+AEFCAHAL+TLEVLIHPRA+PLID  SSI+Y  N  K+SFMD
Sbjct: 600  AQGLELFHRGMQETGTKVAEFCAHALLTLEVLIHPRALPLIDIASSIDYPVNGVKDSFMD 659

Query: 1059 NNLYFGSQKHNPISAGTSKNGFENPESEEDDLY 1157
            NN Y G+QKH+  S GTS+NG E PESEEDDLY
Sbjct: 660  NNTYSGAQKHHLYSGGTSRNGLEYPESEEDDLY 692


>ref|XP_021993993.1| proline-, glutamic acid- and leucine-rich protein 1 [Helianthus
            annuus]
 gb|OTG08476.1| hypothetical protein HannXRQ_Chr11g0342181 [Helianthus annuus]
          Length = 808

 Score =  467 bits (1201), Expect = e-153
 Identities = 240/385 (62%), Positives = 275/385 (71%)
 Frame = +3

Query: 3    FDISNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPH 182
            FD+SNK+T  ERVSM  IS+LMLCCSTMLT+SYPA+AK                DGSL  
Sbjct: 296  FDLSNKSTKPERVSMGCISSLMLCCSTMLTSSYPAQAKVPVQLLLMLVERVLMVDGSLTQ 355

Query: 183  TLYPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQ 362
            TLYPTITAMQQEYVCLELPV H++ LDILCGIVK AHSQLLPHAA +IRIVTEYLR+CE 
Sbjct: 356  TLYPTITAMQQEYVCLELPVHHAHCLDILCGIVKEAHSQLLPHAARIIRIVTEYLRKCEL 415

Query: 363  PELRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNSEPVQKKRK 542
             ELRVK+YAL+KLMLLSMGVGL++YLAED+V NAS DLDS+GDRG EARSNSE  QKKRK
Sbjct: 416  SELRVKVYALLKLMLLSMGVGLSLYLAEDIVHNASTDLDSIGDRGDEARSNSESTQKKRK 475

Query: 543  HEMAVTSHENQSETIHTRKNPVPVSLKIXXXXXXXXXXXXXXXXRSEGWRSNVDLLIKTV 722
             EM V    +Q++T++  KNP+P++LKI                 S+ WRS+VD L+  V
Sbjct: 476  REMTVMPFGDQAQTVYLPKNPIPIALKIAALEVLETLLTMGGASGSDSWRSSVDSLVINV 535

Query: 723  ATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYLAHGLELFR 902
            ATDAC GGW KP             WADFQ             PGR+RPPYLAHGLEL+R
Sbjct: 536  ATDACKGGWIKPANDFHNSHNSSSNWADFQLASLRALLASLLSPGRIRPPYLAHGLELYR 595

Query: 903  KGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFMDNNLYFGSQ 1082
            KGKQ  G+K+AEFCAHALM LE+LIHPRAIPLIDFGSSIEY  N  K+ F +N  Y G+Q
Sbjct: 596  KGKQETGTKIAEFCAHALMALELLIHPRAIPLIDFGSSIEYPVNGVKSRFTENYTYSGAQ 655

Query: 1083 KHNPISAGTSKNGFENPESEEDDLY 1157
             HN  S G S+N  ENPES EDDLY
Sbjct: 656  NHNVFSVGASRNEPENPES-EDDLY 679



 Score = 98.6 bits (244), Expect = 5e-18
 Identities = 55/96 (57%), Positives = 65/96 (67%), Gaps = 6/96 (6%)
 Frame = +2

Query: 1169 NEVTPPLSEKLVSIEGPSSAKVTHEDKGKGIMVETQSLEIISKQSEVLSQTVDFQASDKM 1348
            NE TP L+EK+VS+E  S A+V  EDKGKGI+VET S+E + KQ+EV SQTVDF+  D M
Sbjct: 699  NETTPQLAEKVVSVEVSSGAEVVPEDKGKGILVETPSVEEVKKQTEVQSQTVDFKLVDNM 758

Query: 1349 ETGDMGSKGNTLVSDFMSG------LGDNDDPMDEI 1438
             TGD      T VS  + G      LGDNDDPMDEI
Sbjct: 759  VTGDPEPTDKTPVSGPVGGMESKFDLGDNDDPMDEI 794


>ref|XP_023767643.1| proline-, glutamic acid- and leucine-rich protein 1 isoform X2
            [Lactuca sativa]
          Length = 641

 Score =  411 bits (1056), Expect = e-134
 Identities = 223/390 (57%), Positives = 264/390 (67%), Gaps = 5/390 (1%)
 Frame = +3

Query: 3    FDISNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPH 182
            FD+SNK+T  ER+S+ +IS+LMLCCSTMLTTSY  + K                DGSLP 
Sbjct: 142  FDLSNKST--ERLSITNISSLMLCCSTMLTTSYAVRVKVPIKALLMVVERVLNVDGSLPQ 199

Query: 183  TLYPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQ 362
            T+YPT+TAMQQEY+C +LPV HSYSL+ILC I+K A SQLLPH AH IR+VTEY+RRCE 
Sbjct: 200  TMYPTLTAMQQEYICTQLPVLHSYSLEILCDIIKEARSQLLPHTAHTIRLVTEYIRRCEL 259

Query: 363  PELRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNSEPVQKKRK 542
            PELRVKLY  IK+ML+SMGVG+TIYLAEDV+SNAS+DLDS    GGEA    +P+ KKRK
Sbjct: 260  PELRVKLYGFIKMMLMSMGVGMTIYLAEDVISNASVDLDSTHVHGGEA----QPMNKKRK 315

Query: 543  HEMAVTSHENQSETIHTRKNPVPVSLKIXXXXXXXXXXXXXXXXRSEGWRSNVDLLIKTV 722
            HE A++  ENQS+ I++ KN VP+S+KI                RSE WRSNVD L+ TV
Sbjct: 316  HENAISFLENQSQIIYSPKNHVPISVKIAALGALETLLTVGGALRSESWRSNVDQLLITV 375

Query: 723  ATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYLAHGLELFR 902
            ATDAC  GW K              W DFQ             PGR RP YLA GLELFR
Sbjct: 376  ATDACKNGWAK-----EANDVYTPSWGDFQLASLRALLASLLSPGRFRPAYLAQGLELFR 430

Query: 903  KGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEY--SPNDGKNSFMDNNLY-F 1073
             G Q  G+KLAEFCAHAL++LEVLIHPRA+PLID GSS+EY  + N  K+ FM NN+Y  
Sbjct: 431  GGMQETGTKLAEFCAHALLSLEVLIHPRALPLIDIGSSVEYPVNVNGMKDKFMKNNIYSS 490

Query: 1074 GSQKHN--PISAGTSKNGFENPESEEDDLY 1157
            G QK N   ++ GTS+N     ESEEDDLY
Sbjct: 491  GVQKSNLVTVTGGTSRN---EAESEEDDLY 517


>ref|XP_023767642.1| proline-, glutamic acid- and leucine-rich protein 1 isoform X1
            [Lactuca sativa]
 gb|PLY82584.1| hypothetical protein LSAT_2X104580 [Lactuca sativa]
          Length = 798

 Score =  411 bits (1056), Expect = e-132
 Identities = 223/390 (57%), Positives = 264/390 (67%), Gaps = 5/390 (1%)
 Frame = +3

Query: 3    FDISNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPH 182
            FD+SNK+T  ER+S+ +IS+LMLCCSTMLTTSY  + K                DGSLP 
Sbjct: 299  FDLSNKST--ERLSITNISSLMLCCSTMLTTSYAVRVKVPIKALLMVVERVLNVDGSLPQ 356

Query: 183  TLYPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQ 362
            T+YPT+TAMQQEY+C +LPV HSYSL+ILC I+K A SQLLPH AH IR+VTEY+RRCE 
Sbjct: 357  TMYPTLTAMQQEYICTQLPVLHSYSLEILCDIIKEARSQLLPHTAHTIRLVTEYIRRCEL 416

Query: 363  PELRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNSEPVQKKRK 542
            PELRVKLY  IK+ML+SMGVG+TIYLAEDV+SNAS+DLDS    GGEA    +P+ KKRK
Sbjct: 417  PELRVKLYGFIKMMLMSMGVGMTIYLAEDVISNASVDLDSTHVHGGEA----QPMNKKRK 472

Query: 543  HEMAVTSHENQSETIHTRKNPVPVSLKIXXXXXXXXXXXXXXXXRSEGWRSNVDLLIKTV 722
            HE A++  ENQS+ I++ KN VP+S+KI                RSE WRSNVD L+ TV
Sbjct: 473  HENAISFLENQSQIIYSPKNHVPISVKIAALGALETLLTVGGALRSESWRSNVDQLLITV 532

Query: 723  ATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYLAHGLELFR 902
            ATDAC  GW K              W DFQ             PGR RP YLA GLELFR
Sbjct: 533  ATDACKNGWAK-----EANDVYTPSWGDFQLASLRALLASLLSPGRFRPAYLAQGLELFR 587

Query: 903  KGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEY--SPNDGKNSFMDNNLY-F 1073
             G Q  G+KLAEFCAHAL++LEVLIHPRA+PLID GSS+EY  + N  K+ FM NN+Y  
Sbjct: 588  GGMQETGTKLAEFCAHALLSLEVLIHPRALPLIDIGSSVEYPVNVNGMKDKFMKNNIYSS 647

Query: 1074 GSQKHN--PISAGTSKNGFENPESEEDDLY 1157
            G QK N   ++ GTS+N     ESEEDDLY
Sbjct: 648  GVQKSNLVTVTGGTSRN---EAESEEDDLY 674


>ref|XP_018842583.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X2 [Juglans regia]
          Length = 815

 Score =  300 bits (767), Expect = 3e-89
 Identities = 163/395 (41%), Positives = 222/395 (56%), Gaps = 14/395 (3%)
 Frame = +3

Query: 15   NKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLYP 194
            N     ER+ + S+S LMLCC TMLT+SYP +                  DGSLPH L P
Sbjct: 235  NPTRRSERLLISSVSTLMLCCCTMLTSSYPVQVNVPIQSLLVLVKRVLMVDGSLPHALLP 294

Query: 195  TITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPELR 374
             +T+MQQE +C ELPV HSYSL++L  ++KG  SQLLPHAA ++R++T Y +RC  PELR
Sbjct: 295  FMTSMQQELICSELPVLHSYSLELLSAVIKGTRSQLLPHAASIVRLITSYFKRCALPELR 354

Query: 375  VKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSV-GDRGGEARSNS------EPVQK 533
            +K+Y++ +++L+SMGVG+ IYLA++V++NA +DL  V G +     S +      +P  +
Sbjct: 355  IKVYSITRILLISMGVGVAIYLAQEVINNAFVDLHQVSGCKTSSVNSKAFSEALPQPSHR 414

Query: 534  KRKHEMAV----TSHENQSETIHTRKNP--VPVSLKIXXXXXXXXXXXXXXXXRSEGWRS 695
            KRKH          H+     +   KN    P+S+++                RSE WRS
Sbjct: 415  KRKHVTTTGYLEEQHDRGGLQVEAPKNQSISPISVRVAALEALEALFTVGGALRSESWRS 474

Query: 696  NVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPY 875
            NVDLL+   AT +  G W                W DFQ                VRPPY
Sbjct: 475  NVDLLLINTATSSLEGKWASEEKHSFQPNEPTSIWVDFQLAALRALLASLLSSVHVRPPY 534

Query: 876  LAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFM 1055
            LA GLELF++GKQ  G+KLAEFCAHAL+ LEVLIHPRA+PL+   S +  +  +G N   
Sbjct: 535  LAQGLELFQRGKQETGTKLAEFCAHALLALEVLIHPRALPLMG-SSPVTCNSFEGVNHKF 593

Query: 1056 DNNLYFGSQKHN-PISAGTSKNGFENPESEEDDLY 1157
              N+Y GS KH+ P ++G        P+S++DDLY
Sbjct: 594  SENMYSGSLKHSGPFASGIQGIKDNIPDSDDDDLY 628


>ref|XP_018842582.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X1 [Juglans regia]
          Length = 893

 Score =  300 bits (767), Expect = 1e-88
 Identities = 163/395 (41%), Positives = 222/395 (56%), Gaps = 14/395 (3%)
 Frame = +3

Query: 15   NKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLYP 194
            N     ER+ + S+S LMLCC TMLT+SYP +                  DGSLPH L P
Sbjct: 313  NPTRRSERLLISSVSTLMLCCCTMLTSSYPVQVNVPIQSLLVLVKRVLMVDGSLPHALLP 372

Query: 195  TITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPELR 374
             +T+MQQE +C ELPV HSYSL++L  ++KG  SQLLPHAA ++R++T Y +RC  PELR
Sbjct: 373  FMTSMQQELICSELPVLHSYSLELLSAVIKGTRSQLLPHAASIVRLITSYFKRCALPELR 432

Query: 375  VKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSV-GDRGGEARSNS------EPVQK 533
            +K+Y++ +++L+SMGVG+ IYLA++V++NA +DL  V G +     S +      +P  +
Sbjct: 433  IKVYSITRILLISMGVGVAIYLAQEVINNAFVDLHQVSGCKTSSVNSKAFSEALPQPSHR 492

Query: 534  KRKHEMAV----TSHENQSETIHTRKNP--VPVSLKIXXXXXXXXXXXXXXXXRSEGWRS 695
            KRKH          H+     +   KN    P+S+++                RSE WRS
Sbjct: 493  KRKHVTTTGYLEEQHDRGGLQVEAPKNQSISPISVRVAALEALEALFTVGGALRSESWRS 552

Query: 696  NVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPY 875
            NVDLL+   AT +  G W                W DFQ                VRPPY
Sbjct: 553  NVDLLLINTATSSLEGKWASEEKHSFQPNEPTSIWVDFQLAALRALLASLLSSVHVRPPY 612

Query: 876  LAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFM 1055
            LA GLELF++GKQ  G+KLAEFCAHAL+ LEVLIHPRA+PL+   S +  +  +G N   
Sbjct: 613  LAQGLELFQRGKQETGTKLAEFCAHALLALEVLIHPRALPLMG-SSPVTCNSFEGVNHKF 671

Query: 1056 DNNLYFGSQKHN-PISAGTSKNGFENPESEEDDLY 1157
              N+Y GS KH+ P ++G        P+S++DDLY
Sbjct: 672  SENMYSGSLKHSGPFASGIQGIKDNIPDSDDDDLY 706


>ref|XP_017646309.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            [Gossypium arboreum]
          Length = 871

 Score =  297 bits (760), Expect = 7e-88
 Identities = 163/398 (40%), Positives = 223/398 (56%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 301  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 360

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+YSL++L  ++KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 361  PFMTSVQQELICSELPVLHAYSLELLIALIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 420

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSV---------GDRGGEARSNSEP 524
            R+KLY++I+++LLSMGVG+ IYLA DVV NAS DL+S+          +RG    +  + 
Sbjct: 421  RIKLYSIIRMLLLSMGVGIAIYLAPDVVENASNDLNSLDGEDIETSPANRGPATGALPQL 480

Query: 525  VQKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +         +P+++KI                +SE 
Sbjct: 481  SNRKRKHGAKTGSLEEKQDAPSPKVGESNTHQTIPITVKIAALDTLEVLLTVGGASKSES 540

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P RVR
Sbjct: 541  WRSSIDGLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQLSSLRALLTSFLAPARVR 600

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 601  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 660

Query: 1047 SFMDNNLYFGSQKHN-PISAGTSKNGFENPESEEDDLY 1157
             F++ N+Y GS+K N    +   +      ES +DDLY
Sbjct: 661  RFLE-NIYSGSRKQNTSFLSAMQRTEQGGVESHDDDLY 697


>gb|KJB67750.1| hypothetical protein B456_010G208000 [Gossypium raimondii]
 gb|KJB67751.1| hypothetical protein B456_010G208000 [Gossypium raimondii]
          Length = 844

 Score =  295 bits (755), Expect = 2e-87
 Identities = 162/398 (40%), Positives = 222/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 301  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 360

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+Y L++L  I+KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 361  PFMTSVQQELICSELPVLHAYCLELLIAIIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 420

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGE-ARSNSEPV------- 527
            R+KLY++I+++L+SMGVG+ IYLA DV+ NAS DL+S+G    E + +N+ P        
Sbjct: 421  RIKLYSIIRMLLVSMGVGIAIYLAPDVIENASYDLNSLGGEDIETSPANTGPATGALPQL 480

Query: 528  -QKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +          P+++K+                +SE 
Sbjct: 481  SNRKRKHGAKTGSLEEKQDAPSPKVGESNTHQMTPITVKMAALDTLEVLLTVGAASKSES 540

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P R R
Sbjct: 541  WRSSIDSLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQFSSLRALLTSFLAPARTR 600

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 601  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 660

Query: 1047 SFMDNNLYFGSQKHNPISAGTSKNGFE-NPESEEDDLY 1157
             F++ N+Y GSQK N       +   +   ES +DDLY
Sbjct: 661  RFLE-NIYSGSQKQNTSFLSAMRRTEQGGVESHDDDLY 697


>ref|XP_012451559.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            [Gossypium raimondii]
 ref|XP_012451560.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            [Gossypium raimondii]
 ref|XP_012451561.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            [Gossypium raimondii]
          Length = 873

 Score =  295 bits (755), Expect = 4e-87
 Identities = 162/398 (40%), Positives = 222/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 301  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 360

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+Y L++L  I+KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 361  PFMTSVQQELICSELPVLHAYCLELLIAIIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 420

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGE-ARSNSEPV------- 527
            R+KLY++I+++L+SMGVG+ IYLA DV+ NAS DL+S+G    E + +N+ P        
Sbjct: 421  RIKLYSIIRMLLVSMGVGIAIYLAPDVIENASYDLNSLGGEDIETSPANTGPATGALPQL 480

Query: 528  -QKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +          P+++K+                +SE 
Sbjct: 481  SNRKRKHGAKTGSLEEKQDAPSPKVGESNTHQMTPITVKMAALDTLEVLLTVGAASKSES 540

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P R R
Sbjct: 541  WRSSIDSLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQFSSLRALLTSFLAPARTR 600

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 601  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 660

Query: 1047 SFMDNNLYFGSQKHNPISAGTSKNGFE-NPESEEDDLY 1157
             F++ N+Y GSQK N       +   +   ES +DDLY
Sbjct: 661  RFLE-NIYSGSQKQNTSFLSAMRRTEQGGVESHDDDLY 697


>ref|XP_016728366.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X3 [Gossypium hirsutum]
          Length = 679

 Score =  290 bits (743), Expect = 5e-87
 Identities = 161/398 (40%), Positives = 222/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 109  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 168

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+YSL++L  I+KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 169  PFMTSVQQELICSELPVLHAYSLELLIAIIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 228

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGE-ARSNSEPV------- 527
            R+KLY++I+++L+SMGVG+ IYLA DV+ NAS DL+S+G    E + +N+ P        
Sbjct: 229  RIKLYSIIRMLLVSMGVGIAIYLAPDVIENASNDLNSLGGEDIETSPANTGPATGALPQL 288

Query: 528  -QKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +          P+++K+                +SE 
Sbjct: 289  SNRKRKHGAETGSLEEKRDAPSPKVGESNTHQMTPITVKMAALDTLEVLLTVGAASKSES 348

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P R R
Sbjct: 349  WRSSIDSLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQFSSLRALLTSLLAPARTR 408

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 409  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 468

Query: 1047 SFMDNNLYFGSQKHNPISAGTSKNGFE-NPESEEDDLY 1157
             F++ N+Y GS+K N       +   +   ES + DLY
Sbjct: 469  RFLE-NIYSGSRKQNTSFLSDMRRTEQGGVESHDVDLY 505


>ref|XP_021296585.1| proline-, glutamic acid- and leucine-rich protein 1-like [Herrania
            umbratica]
          Length = 631

 Score =  289 bits (739), Expect = 6e-87
 Identities = 158/398 (39%), Positives = 219/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQ-ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTL 188
            S++AT   ER+   ++S L+ CC  MLT+SYP +                  DGSLPHT+
Sbjct: 53   SHEATRSSERLPASTVSTLIFCCCKMLTSSYPIQVTAPIRAMLALVERLLMVDGSLPHTM 112

Query: 189  YPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPE 368
             P +TAMQ E +C ELPV H+Y+L++L  I+KG   QLLPHAA+++R+VT Y RRC  PE
Sbjct: 113  LPFMTAMQHELICSELPVFHAYALELLIAIIKGMRRQLLPHAAYVVRLVTRYFRRCALPE 172

Query: 369  LRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEAR---------SNSE 521
            LR+KLY++ +++L+SMGVG+ IYLA DV+ NA  DL+S GD   E           ++ +
Sbjct: 173  LRIKLYSITRMLLISMGVGMAIYLAPDVIDNAFNDLNSFGDEDAETSPTNIGPSTGASPQ 232

Query: 522  PVQKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSE 683
            P  +KRKH     S E + +TI +           P+++KI                +SE
Sbjct: 233  PSNRKRKHGTKTGSVE-EKQTISSEVEAPNTHQTTPITVKIAALDTLEVLLTVGGASKSE 291

Query: 684  GWRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRV 863
             W S +D L+   AT++C GGW                W DFQ             P R+
Sbjct: 292  SWCSRIDSLLIKTATNSCKGGWGNEENNIFLPHESTSIWVDFQLSTLRALLASFLAPARI 351

Query: 864  RPPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGK 1043
            RPP+L+ GLELFRKGKQ  G+KLA FCA AL+ LEVLIHPRA+PL DF SS +   +   
Sbjct: 352  RPPFLSQGLELFRKGKQEAGTKLAGFCASALLALEVLIHPRALPLDDFRSSYQTFTDGAS 411

Query: 1044 NSFMDNNLYFGSQKHNPISAGTSKNGFENPESEEDDLY 1157
            + F +N  ++G +     S           +S++DDLY
Sbjct: 412  HRFPENMPFYGQKGDTMFSKSMQGTEQSALKSDDDDLY 449


>ref|XP_016728365.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X2 [Gossypium hirsutum]
          Length = 842

 Score =  290 bits (743), Expect = 1e-85
 Identities = 161/398 (40%), Positives = 222/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 301  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 360

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+YSL++L  I+KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 361  PFMTSVQQELICSELPVLHAYSLELLIAIIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 420

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGE-ARSNSEPV------- 527
            R+KLY++I+++L+SMGVG+ IYLA DV+ NAS DL+S+G    E + +N+ P        
Sbjct: 421  RIKLYSIIRMLLVSMGVGIAIYLAPDVIENASNDLNSLGGEDIETSPANTGPATGALPQL 480

Query: 528  -QKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +          P+++K+                +SE 
Sbjct: 481  SNRKRKHGAETGSLEEKRDAPSPKVGESNTHQMTPITVKMAALDTLEVLLTVGAASKSES 540

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P R R
Sbjct: 541  WRSSIDSLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQFSSLRALLTSLLAPARTR 600

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 601  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 660

Query: 1047 SFMDNNLYFGSQKHNPISAGTSKNGFE-NPESEEDDLY 1157
             F++ N+Y GS+K N       +   +   ES + DLY
Sbjct: 661  RFLE-NIYSGSRKQNTSFLSDMRRTEQGGVESHDVDLY 697


>ref|XP_016728363.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X1 [Gossypium hirsutum]
 ref|XP_016728364.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X1 [Gossypium hirsutum]
          Length = 871

 Score =  290 bits (743), Expect = 2e-85
 Identities = 161/398 (40%), Positives = 222/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLY 191
            S K T  ER+   +IS LM CC  MLT+SYP +                  DGSLPHT+ 
Sbjct: 301  SFKGTSSERLPTATISTLMFCCCKMLTSSYPVQVTVPVRSMLALVERLLRVDGSLPHTML 360

Query: 192  PTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPEL 371
            P +T++QQE +C ELPV H+YSL++L  I+KG   QLLPH+A+++R+VT Y +RC  PEL
Sbjct: 361  PFMTSVQQELICSELPVLHAYSLELLIAIIKGMRRQLLPHSAYIVRVVTRYFKRCSLPEL 420

Query: 372  RVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGE-ARSNSEPV------- 527
            R+KLY++I+++L+SMGVG+ IYLA DV+ NAS DL+S+G    E + +N+ P        
Sbjct: 421  RIKLYSIIRMLLVSMGVGIAIYLAPDVIENASNDLNSLGGEDIETSPANTGPATGALPQL 480

Query: 528  -QKKRKHEMAVTSHENQSETIHTR------KNPVPVSLKIXXXXXXXXXXXXXXXXRSEG 686
              +KRKH     S E + +    +          P+++K+                +SE 
Sbjct: 481  SNRKRKHGAETGSLEEKRDAPSPKVGESNTHQMTPITVKMAALDTLEVLLTVGAASKSES 540

Query: 687  WRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVR 866
            WRS++D L+   A ++C  GW                WADFQ             P R R
Sbjct: 541  WRSSIDSLLMKTAINSCKRGWGNLESNIFLPHESASVWADFQFSSLRALLTSLLAPARTR 600

Query: 867  PPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKN 1046
            PPYL+ GLELFR+GKQ  G KLA+FCA+AL  LEVLIHPRA+PL DF S+   S +   N
Sbjct: 601  PPYLSQGLELFRRGKQEAGMKLAQFCAYALFALEVLIHPRALPLDDFYSACHNSTDGASN 660

Query: 1047 SFMDNNLYFGSQKHNPISAGTSKNGFE-NPESEEDDLY 1157
             F++ N+Y GS+K N       +   +   ES + DLY
Sbjct: 661  RFLE-NIYSGSRKQNTSFLSDMRRTEQGGVESHDVDLY 697


>gb|EOY19217.1| Uncharacterized protein TCM_044175 isoform 2 [Theobroma cacao]
          Length = 803

 Score =  288 bits (738), Expect = 3e-85
 Identities = 158/398 (39%), Positives = 220/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQ-ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTL 188
            S++AT   ER+   ++S L+ CC  MLT+SYP +                  DGSLPHT+
Sbjct: 229  SHEATRSSERLPASTVSTLIFCCCKMLTSSYPIQVTAPIRAMLALVERLLMVDGSLPHTM 288

Query: 189  YPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPE 368
             P +TAMQ E +C ELPV H+++L++L  I+KG   QLLPHAA+++R+VT Y RRC  PE
Sbjct: 289  LPFMTAMQHELICSELPVLHAHALELLIAIIKGMRRQLLPHAAYVVRLVTRYFRRCALPE 348

Query: 369  LRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNS---------E 521
            LR+KLY++ +++L+SMGVG+ IYLA DV+ NA  DL+S GD   E    +         +
Sbjct: 349  LRIKLYSITRMLLISMGVGMAIYLAPDVIDNAINDLNSFGDEDVETSPTNIGPSTGALPQ 408

Query: 522  PVQKKRKHEMAVTSHENQSETIHTRKNPV------PVSLKIXXXXXXXXXXXXXXXXRSE 683
            P  +KRKH     S E + +TI +   P+      P+++KI                +SE
Sbjct: 409  PSNRKRKHGTKTGSPE-EKQTISSEVEPLNPHQTTPITVKIAALDTLEVLLTVGGASKSE 467

Query: 684  GWRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRV 863
             WRS +D L+   AT++C  GW                W DFQ             P R+
Sbjct: 468  SWRSRIDSLLIKTATNSCKRGWGNEENNNFLPHESTSIWVDFQLSSLRALLASFLAPARI 527

Query: 864  RPPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGK 1043
            RPP+L+ GLELFRKGKQ  G+KLA FCA AL+ LEVLIHPRA+PL DF SS +   +   
Sbjct: 528  RPPFLSQGLELFRKGKQEAGTKLAGFCASALLALEVLIHPRALPLDDFPSSYQTFTDGAS 587

Query: 1044 NSFMDNNLYFGSQKHNPISAGTSKNGFENPESEEDDLY 1157
            + F +N  ++G +     S           +S++DDLY
Sbjct: 588  HRFPENMPFYGQKGDTMFSKSMQGAEQSALKSDDDDLY 625


>ref|XP_016649496.1| PREDICTED: uncharacterized protein LOC103327264 isoform X2 [Prunus
            mume]
          Length = 741

 Score =  287 bits (734), Expect = 4e-85
 Identities = 163/394 (41%), Positives = 226/394 (57%), Gaps = 19/394 (4%)
 Frame = +3

Query: 33   ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLYPTITAMQ 212
            ER+ MPS+SALM+CCSTMLTTSYP +                  DGSLPH+L   +TAMQ
Sbjct: 173  ERLPMPSVSALMVCCSTMLTTSYPVQVTVPIRSFLALIERVLIVDGSLPHSLLAFMTAMQ 232

Query: 213  QEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPELRVKLYAL 392
            QE++C ELP+ HSYSL++L  I++G  SQLLPHAA+++R+++ YL+RC  PELR+K+Y++
Sbjct: 233  QEFICSELPLLHSYSLELLTAIIEGVRSQLLPHAAYLVRLLSVYLKRCALPELRIKVYSI 292

Query: 393  IKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARS--NSEP-----------VQK 533
             +++L+SMGVG+ + LA++VV++A IDL+ + +  G A S  NS+P             +
Sbjct: 293  TRILLISMGVGMAVCLAQEVVNSAFIDLNPIANESGGASSSGNSKPSTEALVQTPQHSHR 352

Query: 534  KRKHEMAVTS---HENQSETIHTRKNPV--PVSLKIXXXXXXXXXXXXXXXXRSEGWRSN 698
            KRKH  +  S   H        T KN    P+++KI                +SEGWRS+
Sbjct: 353  KRKHGASSGSLEWHNTSRLEGGTPKNHTTSPIAVKIAALEALEALLTVGGALKSEGWRSD 412

Query: 699  VDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYL 878
            VDLL+  +AT++  G W                    Q                VRPPYL
Sbjct: 413  VDLLLINIATNSLKGAWGGENGNIYQLNEPGDIGGGMQLAALRALLASFLSSSCVRPPYL 472

Query: 879  AHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFMD 1058
            A GL+LFR+GKQ  G+KLAEFCAHAL+ LEVLIHPRA+PL DF  +    P+D  +  + 
Sbjct: 473  AEGLDLFRRGKQETGTKLAEFCAHALLALEVLIHPRALPLADFTDAT--LPSDRVHYKLP 530

Query: 1059 NNLYFGS-QKHNPISAGTSKNGFENPESEEDDLY 1157
             N+Y GS +   P S        +  +S+ DDLY
Sbjct: 531  ENMYSGSLRPRTPFSGDIQGMMHDAADSDHDDLY 564


>gb|EOY19216.1| Uncharacterized protein TCM_044175 isoform 1 [Theobroma cacao]
          Length = 813

 Score =  288 bits (738), Expect = 4e-85
 Identities = 158/398 (39%), Positives = 220/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQ-ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTL 188
            S++AT   ER+   ++S L+ CC  MLT+SYP +                  DGSLPHT+
Sbjct: 239  SHEATRSSERLPASTVSTLIFCCCKMLTSSYPIQVTAPIRAMLALVERLLMVDGSLPHTM 298

Query: 189  YPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPE 368
             P +TAMQ E +C ELPV H+++L++L  I+KG   QLLPHAA+++R+VT Y RRC  PE
Sbjct: 299  LPFMTAMQHELICSELPVLHAHALELLIAIIKGMRRQLLPHAAYVVRLVTRYFRRCALPE 358

Query: 369  LRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNS---------E 521
            LR+KLY++ +++L+SMGVG+ IYLA DV+ NA  DL+S GD   E    +         +
Sbjct: 359  LRIKLYSITRMLLISMGVGMAIYLAPDVIDNAINDLNSFGDEDVETSPTNIGPSTGALPQ 418

Query: 522  PVQKKRKHEMAVTSHENQSETIHTRKNPV------PVSLKIXXXXXXXXXXXXXXXXRSE 683
            P  +KRKH     S E + +TI +   P+      P+++KI                +SE
Sbjct: 419  PSNRKRKHGTKTGSPE-EKQTISSEVEPLNPHQTTPITVKIAALDTLEVLLTVGGASKSE 477

Query: 684  GWRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRV 863
             WRS +D L+   AT++C  GW                W DFQ             P R+
Sbjct: 478  SWRSRIDSLLIKTATNSCKRGWGNEENNNFLPHESTSIWVDFQLSSLRALLASFLAPARI 537

Query: 864  RPPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGK 1043
            RPP+L+ GLELFRKGKQ  G+KLA FCA AL+ LEVLIHPRA+PL DF SS +   +   
Sbjct: 538  RPPFLSQGLELFRKGKQEAGTKLAGFCASALLALEVLIHPRALPLDDFPSSYQTFTDGAS 597

Query: 1044 NSFMDNNLYFGSQKHNPISAGTSKNGFENPESEEDDLY 1157
            + F +N  ++G +     S           +S++DDLY
Sbjct: 598  HRFPENMPFYGQKGDTMFSKSMQGAEQSALKSDDDDLY 635


>ref|XP_017235696.1| PREDICTED: uncharacterized protein LOC108209354 [Daucus carota subsp.
            sativus]
 gb|KZN05502.1| hypothetical protein DCAR_006339 [Daucus carota subsp. sativus]
          Length = 875

 Score =  288 bits (738), Expect = 1e-84
 Identities = 161/388 (41%), Positives = 211/388 (54%), Gaps = 13/388 (3%)
 Frame = +3

Query: 33   ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLYPTITAMQ 212
            E + M + S LM CCSTML+ SYP +                  DGS+   +YP +T M+
Sbjct: 309  ELLLMCNSSTLMNCCSTMLSCSYPVQVSVPIQSLVMLIRRVLMLDGSVSQKMYPLMTTMK 368

Query: 213  QEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPELRVKLYAL 392
            QE +C ELP  H  SL++L GIVKG  SQLLPH A +  ++ EY R+C  PELR+ +Y++
Sbjct: 369  QELICSELPGLHLRSLELLAGIVKGVRSQLLPHVADIAVLIAEYFRKCALPELRIMVYSI 428

Query: 393  IKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNS--------EPVQKKRKHE 548
            I+++L SMGVG+++YL ++V+SN  +DLD      G    NS        +P QKKRKH 
Sbjct: 429  IRILLRSMGVGMSLYLTQEVISNTFVDLDYSSSASGNLNLNSNVFNEPVQQPPQKKRKHA 488

Query: 549  MAVTSHENQSETIHTR----KNPVPVSLKIXXXXXXXXXXXXXXXXRSEGWRSNVDLLIK 716
                S++ QS+ + T     K   P+S+KI                RS+ WR +VD L+ 
Sbjct: 489  STTGSNDEQSDRMGTEMTAPKTKTPISVKIAALHTLEALLTVGGAIRSDSWRPDVDRLLV 548

Query: 717  TVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYLAHGLEL 896
              A DAC GGW K              WA+FQ             P R RPP L+ GLEL
Sbjct: 549  ATALDACKGGWAKEEKNIFLQTGRTDPWAEFQLAALKAFLASLISPSRFRPPSLSQGLEL 608

Query: 897  FRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFMDNNLYFG 1076
            FRKG Q  G+KL+EFC+HALM LEVLIHPRA+ LID  S      +  + S     LY G
Sbjct: 609  FRKGAQETGTKLSEFCSHALMVLEVLIHPRALSLIDSASDANVVGSMPRTS---GRLYSG 665

Query: 1077 SQKHN-PISAGTSKNGFENPESEEDDLY 1157
             Q  N   S GT   G ++PES+EDDLY
Sbjct: 666  KQGVNTSYSGGTFGKGDDDPESDEDDLY 693


>ref|XP_017984990.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1
            isoform X2 [Theobroma cacao]
          Length = 754

 Score =  286 bits (731), Expect = 1e-84
 Identities = 157/398 (39%), Positives = 219/398 (55%), Gaps = 16/398 (4%)
 Frame = +3

Query: 12   SNKATLQ-ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTL 188
            S++AT   E +   ++S L+ CC  MLT+SYP +                  DGSLPHT+
Sbjct: 180  SHEATRSSESLPASTVSTLIFCCCKMLTSSYPIQVTAPIRAMLALVERLLMVDGSLPHTM 239

Query: 189  YPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPE 368
             P +TAMQ E +C ELPV H+++L++L  I+KG   QLLPHAA+++R+VT Y RRC  PE
Sbjct: 240  LPFMTAMQHELICSELPVLHAHALELLIAIIKGMRRQLLPHAAYVVRLVTRYFRRCALPE 299

Query: 369  LRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSNS---------E 521
            LR+KLY++ +++L+SMGVG+ IYLA DV+ NA  DL+S GD   E    +         +
Sbjct: 300  LRIKLYSIPRMLLISMGVGMAIYLAPDVIDNAINDLNSFGDEDVETSPTNIGPSTGALPQ 359

Query: 522  PVQKKRKHEMAVTSHE------NQSETIHTRKNPVPVSLKIXXXXXXXXXXXXXXXXRSE 683
            P  +KRKH     S E      ++ E ++T +   P++LKI                +SE
Sbjct: 360  PSNRKRKHGTKTGSPEEKQTISSEVEALNTHQT-TPITLKIAALDTLEVLLTVGGASKSE 418

Query: 684  GWRSNVDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRV 863
             WRS +D L+   AT++C  GW                W DFQ             P R+
Sbjct: 419  SWRSRIDSLLIKTATNSCKRGWGNEENNNFLPHESTSIWVDFQLSSLRALLASLLAPARI 478

Query: 864  RPPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGK 1043
            RPP+L+ GLELFRKGKQ  G+KLA FCA AL+ LEVLIHPRA+PL DF SS +   +   
Sbjct: 479  RPPFLSQGLELFRKGKQEAGTKLAGFCASALLALEVLIHPRALPLDDFPSSYQTFTDGAS 538

Query: 1044 NSFMDNNLYFGSQKHNPISAGTSKNGFENPESEEDDLY 1157
            + F +N  ++G +     S           +S++DDLY
Sbjct: 539  HRFPENMPFYGQKGDTMFSKSMQGTEQSALKSDDDDLY 576


>ref|XP_021812691.1| uncharacterized protein LOC110755738 isoform X2 [Prunus avium]
          Length = 738

 Score =  285 bits (728), Expect = 3e-84
 Identities = 164/394 (41%), Positives = 223/394 (56%), Gaps = 19/394 (4%)
 Frame = +3

Query: 33   ERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPHTLYPTITAMQ 212
            ER+ MPS+SALM+CCSTMLTTSYP +                  DGSLPH+L   +TAMQ
Sbjct: 173  ERLPMPSVSALMVCCSTMLTTSYPVQVTVPIRSFLALIERVLIVDGSLPHSLLAFMTAMQ 232

Query: 213  QEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQPELRVKLYAL 392
            QE++C ELP+ HSYSL++L  I++G  SQLLPHAA+++R+++ YL+RC  PELR+K+Y++
Sbjct: 233  QEFICSELPLLHSYSLELLTAIIEGVRSQLLPHAAYLVRLLSVYLKRCALPELRIKVYSI 292

Query: 393  IKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARS--NSEP-----------VQK 533
             +++L+SMGVG+ + LA++VV++A IDL+ +    G A S  NS+P             +
Sbjct: 293  TRILLISMGVGMAVCLAQEVVNSAFIDLNPIAKESGGASSGGNSKPSAEALLQTPHHSHR 352

Query: 534  KRKHEMAVTS---HENQSETIHTRKNPV--PVSLKIXXXXXXXXXXXXXXXXRSEGWRSN 698
            KRKH  +  S   H        T KN    P++LKI                +SEGWRS+
Sbjct: 353  KRKHGASSGSLEWHNTSRLEEGTPKNHTTSPIALKIAALEALEALLTVGGALKSEGWRSD 412

Query: 699  VDLLIKTVATDACNGGWTKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXXPGRVRPPYL 878
            VDLL+  +AT++  G W                    Q                VRPPYL
Sbjct: 413  VDLLLINIATNSLKGAWGGENGNIYQLNEPGDIGGGMQLAALRALLASFLSSSCVRPPYL 472

Query: 879  AHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSPNDGKNSFMD 1058
            A GL+LFR+GKQ  G+KLAEFCAHAL+ LEVLIHPRA+PL DF  +   S  D     + 
Sbjct: 473  AEGLDLFRRGKQETGTKLAEFCAHALLALEVLIHPRALPLADFTDTTLLS--DRVRYKLP 530

Query: 1059 NNLYFGS-QKHNPISAGTSKNGFENPESEEDDLY 1157
             N+Y GS +   P S        +  +S+ DDLY
Sbjct: 531  ENMYSGSLRPSTPFSGDIQGMMHDAADSDHDDLY 564


>ref|XP_010242434.1| PREDICTED: proline-, glutamic acid- and leucine-rich protein 1-like
            isoform X5 [Nelumbo nucifera]
          Length = 879

 Score =  287 bits (735), Expect = 4e-84
 Identities = 168/403 (41%), Positives = 223/403 (55%), Gaps = 19/403 (4%)
 Frame = +3

Query: 6    DISNKAT-LQERVSMPSISALMLCCSTMLTTSYPAKAKXXXXXXXXXXXXXXXXDGSLPH 182
            + SN+AT + E++ +  IS LMLCC  MLT  YPA+                  DGSL  
Sbjct: 282  ETSNQATEMSEQLILHRISMLMLCCCRMLTNPYPAQVIVPVRPLLVLVGRVLMVDGSLSQ 341

Query: 183  TLYPTITAMQQEYVCLELPVQHSYSLDILCGIVKGAHSQLLPHAAHMIRIVTEYLRRCEQ 362
            +L P +T MQ+E++C ELP+ H   LD+L GI+K   SQLLPHAA ++R++TEY RRC  
Sbjct: 342  SLLPFLTVMQREFICSELPLLHLCGLDLLTGIIKRVRSQLLPHAADVVRLLTEYFRRCAL 401

Query: 363  PELRVKLYALIKLMLLSMGVGLTIYLAEDVVSNASIDLDSVGDRGGEARSN------SE- 521
            P LRVK+Y++++++L+SMGVG+  YLA++VVSNA +DLDS+    GEA S       SE 
Sbjct: 402  PALRVKVYSILRILLISMGVGMAQYLAQEVVSNALVDLDSIAHGCGEASSTPCSKAASEG 461

Query: 522  ---PVQKKRKHEMAVTSHENQSETIHTRKNPV------PVSLKIXXXXXXXXXXXXXXXX 674
               P  +KRKH       E Q   + T    V      P++++                 
Sbjct: 462  LLLPSYRKRKHGTITGFSEEQQGGVGTEMEAVKGKPITPIAVQTAALQALEALLTVGGAL 521

Query: 675  RSEGWRSNVDLLIKTVATDACNGGW-TKPXXXXXXXXXXXXXWADFQXXXXXXXXXXXXX 851
            RSE WR NVDLL+ TVAT+A NGGW  +                DFQ             
Sbjct: 522  RSECWRQNVDLLLITVATNASNGGWANEEKDIFLLSDEPTSTRTDFQLAALRALLASLLS 581

Query: 852  PGRVRPPYLAHGLELFRKGKQGVGSKLAEFCAHALMTLEVLIHPRAIPLIDFGSSIEYSP 1031
            P RVRPPYL+ GLELFR+GKQ  G+K+AEFCAHAL+ LEVL+HPRA+PL++F S      
Sbjct: 582  PARVRPPYLSQGLELFRRGKQETGTKVAEFCAHALLALEVLMHPRALPLVNFPSGDHPDF 641

Query: 1032 NDGKNSFMDNNLYFGSQKHN-PISAGTSKNGFENPESEEDDLY 1157
              G N     N++    K+N P   G        PES +D+LY
Sbjct: 642  GQGFNCKFPKNIFSSGLKNNSPFPRGILGKDEIEPESNDDELY 684


Top