BLASTX nr result

ID: Chrysanthemum21_contig00009157 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00009157
         (737 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022030440.1| uncharacterized protein LOC110931349 [Helian...   102   6e-23
gb|PLY83630.1| hypothetical protein LSAT_4X27020 [Lactuca sativa]     105   3e-22
gb|PLY68418.1| hypothetical protein LSAT_8X18440 [Lactuca sativa]     105   3e-22
gb|PLY80470.1| hypothetical protein LSAT_2X65260 [Lactuca sativa]     105   3e-22
gb|PLY92339.1| hypothetical protein LSAT_9X109960 [Lactuca sativa]    105   3e-22
ref|XP_021995915.1| uncharacterized protein LOC110893102 [Helian...   104   1e-21
gb|PLY78695.1| hypothetical protein LSAT_9X46640 [Lactuca sativa]     103   1e-21
gb|PLY87860.1| hypothetical protein LSAT_3X34060 [Lactuca sativa]     103   2e-21
gb|OTG28593.1| putative ulp1 protease family, C-terminal catalyt...   102   3e-21
gb|OTG09329.1| hypothetical protein HannXRQ_Chr11g0351751 [Helia...   102   3e-21
ref|XP_021721306.1| uncharacterized protein LOC110688857 [Chenop...   102   4e-21
gb|PLY79665.1| hypothetical protein LSAT_5X126901 [Lactuca sativa]    101   7e-21
ref|XP_022021127.1| uncharacterized protein LOC110921160 [Helian...    97   1e-20
ref|XP_022014209.1| uncharacterized protein LOC110913694 [Helian...    95   3e-20
ref|XP_021987267.1| uncharacterized protein LOC110883914 [Helian...    94   6e-20
ref|XP_021989724.1| uncharacterized protein LOC110886255 [Helian...    93   4e-18
gb|OTG18478.1| putative ulp1 protease family, C-terminal catalyt...    94   5e-18
ref|XP_021977388.1| uncharacterized protein LOC110872817 isoform...    94   5e-18
ref|XP_021977387.1| uncharacterized protein LOC110872817 isoform...    94   5e-18
ref|XP_021977386.1| uncharacterized protein LOC110872817 isoform...    94   5e-18

>ref|XP_022030440.1| uncharacterized protein LOC110931349 [Helianthus annuus]
          Length = 211

 Score =  102 bits (254), Expect = 6e-23
 Identities = 58/136 (42%), Positives = 73/136 (53%), Gaps = 6/136 (4%)
 Frame = +1

Query: 241 DNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAKPRIPKLKWRTKKNFRDC 420
           DNS    T  +KY    E  +   + +LK  GH+K  +I    P   ++KWRT+ N  DC
Sbjct: 47  DNSAKKETNKAKYGDVPENTRNALAAYLKSVGHEKADDIKGITPVRMQMKWRTQYNGIDC 106

Query: 421 GIFTMVHMECYNGEPAMKWDCGLVAESGLPT------DKLRQCDMLRRLRFKFATKILLH 582
           GIFTM HMECY+GEP  KWDCG   E   P          +Q   L  LR KF  KILLH
Sbjct: 107 GIFTMRHMECYHGEPVEKWDCGFNVEYEEPARGKNKKPVNKQTAQLEHLRRKFIAKILLH 166

Query: 583 EINKHAGKLFDLAKEF 630
           EINK    + + +K+F
Sbjct: 167 EINKQRDYVIEDSKKF 182


>gb|PLY83630.1| hypothetical protein LSAT_4X27020 [Lactuca sativa]
          Length = 799

 Score =  105 bits (263), Expect = 3e-22
 Identities = 75/238 (31%), Positives = 115/238 (48%), Gaps = 6/238 (2%)
 Frame = +1

Query: 1    SPSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFF 168
            SP R F   G  + + +  +  S+++K+E F+      F D+  G    L +  ID+ FF
Sbjct: 571  SPYRLFLKVG--VSTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFF 624

Query: 169  PICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKH 348
            P+  + H +               DNS     Y+ KY    + LK LF R+ K   H + 
Sbjct: 625  PVVKSAHIFVIVFNLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRA 683

Query: 349  ANILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKL 522
              I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +     
Sbjct: 684  NAISKESIKPQRLEMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV----- 738

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
             Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 739  -QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 795


>gb|PLY68418.1| hypothetical protein LSAT_8X18440 [Lactuca sativa]
          Length = 964

 Score =  105 bits (263), Expect = 3e-22
 Identities = 75/238 (31%), Positives = 115/238 (48%), Gaps = 6/238 (2%)
 Frame = +1

Query: 1    SPSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFF 168
            SP R F   G  + + +  +  S+++K+E F+      F D+  G    L +  ID+ FF
Sbjct: 736  SPYRLFLKVG--VSTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFF 789

Query: 169  PICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKH 348
            P+  + H +               DNS     Y+ KY    + LK LF R+ K   H + 
Sbjct: 790  PVVKSAHIFVIVFNLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRA 848

Query: 349  ANILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKL 522
              I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +     
Sbjct: 849  NAISKESIKPQRLEMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV----- 903

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
             Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 904  -QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 960


>gb|PLY80470.1| hypothetical protein LSAT_2X65260 [Lactuca sativa]
          Length = 995

 Score =  105 bits (263), Expect = 3e-22
 Identities = 75/238 (31%), Positives = 115/238 (48%), Gaps = 6/238 (2%)
 Frame = +1

Query: 1    SPSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFF 168
            SP R F   G  + + +  +  S+++K+E F+      F D+  G    L +  ID+ FF
Sbjct: 767  SPYRLFLKVG--VSTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFF 820

Query: 169  PICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKH 348
            P+  + H +               DNS     Y+ KY    + LK LF R+ K   H + 
Sbjct: 821  PVVRSAHIFVIVFNLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRA 879

Query: 349  ANILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKL 522
              I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +     
Sbjct: 880  NAISKESIKPQRLEMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV----- 934

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
             Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 935  -QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 991


>gb|PLY92339.1| hypothetical protein LSAT_9X109960 [Lactuca sativa]
          Length = 1076

 Score =  105 bits (263), Expect = 3e-22
 Identities = 75/238 (31%), Positives = 115/238 (48%), Gaps = 6/238 (2%)
 Frame = +1

Query: 1    SPSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFF 168
            SP R F   G  + + +  +  S+++K+E F+      F D+  G    L +  ID+ FF
Sbjct: 848  SPYRLFLKVG--VSTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFF 901

Query: 169  PICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKH 348
            P+  + H +               DNS     Y+ KY    + LK LF R+ K   H + 
Sbjct: 902  PVVKSAHIFVIVFNLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRA 960

Query: 349  ANILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKL 522
              I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +     
Sbjct: 961  NAISKESIKPQRLEMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV----- 1015

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
             Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 1016 -QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 1072


>ref|XP_021995915.1| uncharacterized protein LOC110893102 [Helianthus annuus]
          Length = 1181

 Score =  104 bits (259), Expect = 1e-21
 Identities = 67/194 (34%), Positives = 92/194 (47%), Gaps = 6/194 (3%)
 Frame = +1

Query: 67   SEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNAGHFYXXXXXXXXXXXXXXXDN 246
            ++DK  E+F   I  +        ++    +   PI +  HF+               DN
Sbjct: 961  NKDKMIEAFTMNI-EKILQGAKLKSIKDFKIILVPILHIQHFFVISFNLEEKQIFII-DN 1018

Query: 247  SDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAKPRIPKLKWRTKKNFRDCGI 426
            S    T ++KY    E  +   + +LK  GH K  +I K  P   ++KWRT+ N  DCGI
Sbjct: 1019 SAKEETNEAKYGDVPENTRNALAAYLKSVGHVKADDIEKITPVRMQMKWRTQYNGIDCGI 1078

Query: 427  FTMVHMECYNGEPAMKWDCGLVAESGLPT------DKLRQCDMLRRLRFKFATKILLHEI 588
            FTM HMECY+GEP  KWDCG   E   P          +Q   L  LR KF  KILLHEI
Sbjct: 1079 FTMRHMECYHGEPVEKWDCGFNVEYEEPARGKNKKQVNKQTAQLEDLRRKFIAKILLHEI 1138

Query: 589  NKHAGKLFDLAKEF 630
            N+    + + +K+F
Sbjct: 1139 NEQRDYVIEDSKKF 1152


>gb|PLY78695.1| hypothetical protein LSAT_9X46640 [Lactuca sativa]
          Length = 1028

 Score =  103 bits (258), Expect = 1e-21
 Identities = 74/237 (31%), Positives = 114/237 (48%), Gaps = 6/237 (2%)
 Frame = +1

Query: 4    PSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFFP 171
            P R F   G  + + +  +  S+++K+E F+      F D+  G    L +  ID+ FFP
Sbjct: 801  PYRLFLKVG--VSTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFFP 854

Query: 172  ICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHA 351
            +  + H +               DNS     Y+ KY    + LK LF R+ K   H +  
Sbjct: 855  VVRSAHIFVIVFNLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRAN 913

Query: 352  NILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLR 525
             I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +      
Sbjct: 914  AISKENIKPQRLEMSWRTIKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV------ 967

Query: 526  QCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
            Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 968  QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 1024


>gb|PLY87860.1| hypothetical protein LSAT_3X34060 [Lactuca sativa]
          Length = 829

 Score =  103 bits (256), Expect = 2e-21
 Identities = 71/225 (31%), Positives = 110/225 (48%), Gaps = 6/225 (2%)
 Frame = +1

Query: 40   KSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFFPICNAGHFYXXXX 207
            K+ +  +  S+++K+E F+      F D+  G    L +  ID+ FFP+  + H +    
Sbjct: 612  KTAYLTSTLSDERKYEKFKEN----FHDSTNGYKKILNIKDIDMVFFPVVKSAHIFVIVF 667

Query: 208  XXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKA--KPRIP 381
                       DNS     Y+ KY    + LK LF R+ K   H +   I K   KP+  
Sbjct: 668  NLKKPSIEIL-DNSAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRANAISKESIKPQRL 726

Query: 382  KLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLRRLRFKF 561
            ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +      Q   L +LR ++
Sbjct: 727  EMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV------QQTTLEKLRQRY 780

Query: 562  ATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
            A  +L  EIN    K+ DLA+++ KV  + + +   +A++  + R
Sbjct: 781  AHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHAYKAMQTIQKR 825


>gb|OTG28593.1| putative ulp1 protease family, C-terminal catalytic domain-containing
            protein [Helianthus annuus]
          Length = 1090

 Score =  102 bits (255), Expect = 3e-21
 Identities = 69/216 (31%), Positives = 100/216 (46%), Gaps = 16/216 (7%)
 Frame = +1

Query: 31   CIIKSMFDGTLASEDKKFESFENEILAQFADNVAGL-------ALDGIDLAFFPICNAGH 189
            C I+ +    L + DK+    + +++  F  N+  +       ++    +   PI +  H
Sbjct: 850  CTIRML---ALPNCDKEESENKGKMIKAFTMNIEKILKGAKLKSIKDFKIILVPILHIQH 906

Query: 190  FYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAK 369
            F+               DNS    T +SKY    E  +   + +LK  GH+K  +I    
Sbjct: 907  FFVISFNLEEKQIFII-DNSAKEETNESKYGNVPENTRDALAAYLKSVGHEKADDIKGII 965

Query: 370  PRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTD---------KL 522
            P   ++KWRT+ N  DCGIFTM HMECY GEP  KWDCG   E  +P +           
Sbjct: 966  PVRMQMKWRTQHNGIDCGIFTMRHMECYKGEPVEKWDCGFNVEYEVPANIKKVKNKKPVN 1025

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEF 630
            +Q   L  LR KF  KILLHEIN+    + + +K F
Sbjct: 1026 KQTAQLEDLRRKFIAKILLHEINEQRDYVIEDSKTF 1061


>gb|OTG09329.1| hypothetical protein HannXRQ_Chr11g0351751 [Helianthus annuus]
          Length = 1180

 Score =  102 bits (255), Expect = 3e-21
 Identities = 69/216 (31%), Positives = 100/216 (46%), Gaps = 16/216 (7%)
 Frame = +1

Query: 31   CIIKSMFDGTLASEDKKFESFENEILAQFADNVAGL-------ALDGIDLAFFPICNAGH 189
            C I+ +    L + DK+    + +++  F  N+  +       ++    +   PI +  H
Sbjct: 940  CTIRML---ALPNCDKEESENKGKMIKAFTMNIEKILKGAKLKSIKDFKIILVPILHIQH 996

Query: 190  FYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAK 369
            F+               DNS    T +SKY    E  +   + +LK  GH+K  +I    
Sbjct: 997  FFVISFNLEEKQIFII-DNSAKEETNESKYGNVPENTRDALAAYLKSVGHEKADDIKGII 1055

Query: 370  PRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTD---------KL 522
            P   ++KWRT+ N  DCGIFTM HMECY GEP  KWDCG   E  +P +           
Sbjct: 1056 PVRMQMKWRTQHNGIDCGIFTMRHMECYKGEPVEKWDCGFNVEYEVPANIKKVKNKKPVN 1115

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEF 630
            +Q   L  LR KF  KILLHEIN+    + + +K F
Sbjct: 1116 KQTAQLEDLRRKFIAKILLHEINEQRDYVIEDSKTF 1151


>ref|XP_021721306.1| uncharacterized protein LOC110688857 [Chenopodium quinoa]
 ref|XP_021721308.1| uncharacterized protein LOC110688857 [Chenopodium quinoa]
 ref|XP_021721309.1| uncharacterized protein LOC110688857 [Chenopodium quinoa]
          Length = 543

 Score =  102 bits (253), Expect = 4e-21
 Identities = 65/227 (28%), Positives = 109/227 (48%), Gaps = 9/227 (3%)
 Frame = +1

Query: 1   SPSRHFFPTGCIIKSMFDGTLASEDKKFESF----ENEILAQFADNVAGLA--LDGIDLA 162
           +P R FF T   +  +F      + K+ ++F    +NE++ Q   +   L   +  + + 
Sbjct: 323 APGRFFFSTYPFV-ILFGNAYTEKQKEIDAFLDRLQNELIDQSNTDDTDLLCKIKKLSMV 381

Query: 163 FFPICNAGHFYXXXXXXXXXXXXXXXDNS-DSGATYDSKYKKTCELLKKLFSRHLKLYGH 339
           FFP+    H+Y               +   + G ++ +KY    + L ++F++ L+  G+
Sbjct: 382 FFPVVRGDHYYLLCIDMQKSAFELLDNRILEEGISFKNKYDNHRKQLLRMFTKFLRNIGY 441

Query: 340 K--KHANILKAKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPT 513
              K  N+ K   +   +KWR+ KN+ DCGIF M HME YNGE   +WDCGL  +  +  
Sbjct: 442 DQAKADNLEKKSTKTLNMKWRSNKNYDDCGIFLMKHMETYNGEKDTEWDCGLQTKYEV-- 499

Query: 514 DKLRQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
               Q   L++LR K+  KILLH+ N    ++   A+   K   +EK
Sbjct: 500 ----QLKQLKKLRVKYCAKILLHKENTLHNQVLKYARVHSKEKVEEK 542


>gb|PLY79665.1| hypothetical protein LSAT_5X126901 [Lactuca sativa]
          Length = 659

 Score =  101 bits (252), Expect = 7e-21
 Identities = 73/238 (30%), Positives = 115/238 (48%), Gaps = 6/238 (2%)
 Frame = +1

Query: 1    SPSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAG----LALDGIDLAFF 168
            SP R F   G  + + +  +  S+ +K+E+F+      F D+  G    L +  ID+ FF
Sbjct: 431  SPYRLFLKVG--VSTAYLTSTLSDKRKYENFKEN----FHDSTNGYKKILNIKDIDMVFF 484

Query: 169  PICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKH 348
            P+  + + +               DN      Y+ KY    + LK LF R+ K   H + 
Sbjct: 485  PVVISAYIFVIVFNLKKPSIEIL-DNGAVEGDYEGKYGVIMKPLKNLFVRYFKEINHPRA 543

Query: 349  ANILKA--KPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKL 522
              I K   KP+  ++ WRT KN  DCG+F M HME Y G+P  KW  GL  ES +     
Sbjct: 544  NAISKESIKPQRLEMSWRTVKNKVDCGVFAMRHMETYMGQPLSKWKPGLHKESAV----- 598

Query: 523  RQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
             Q   L +LR ++A  +L  EIN    K+ DLA+++ KV  + + + + +A++  + R
Sbjct: 599  -QQTTLEKLRQRYAHIMLTSEINMLKAKVLDLAEKYQKVEFKVRTDHVYKAMQTIQKR 655


>ref|XP_022021127.1| uncharacterized protein LOC110921160 [Helianthus annuus]
          Length = 215

 Score = 96.7 bits (239), Expect = 1e-20
 Identities = 63/216 (29%), Positives = 99/216 (45%)
 Frame = +1

Query: 55  GTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNAGHFYXXXXXXXXXXXXX 234
           G     +K +E F+  +          L+   IDL   P+ ++  FY             
Sbjct: 7   GNAEEFEKIYELFKRRLEVLMEKEKHLLSFQDIDLVCIPVLSSKDFYLLVLNLNDVSLIL 66

Query: 235 XXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAKPRIPKLKWRTKKNFR 414
                D+       Y+K   ++ KLF ++L  + HK    I K +P+    KW T+KN+ 
Sbjct: 67  M----DNKCKNIDDYEKVPYIIGKLFGKYLNSFKHKNAKEIYKVEPKKADFKWSTEKNYT 122

Query: 415 DCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLRRLRFKFATKILLHEINK 594
           D G+F M HM+ Y GE   ++ CG+ A+         Q + L+ LR K+AT+ILL E+N 
Sbjct: 123 DSGVFLMCHMDNYMGENIKEYHCGIFAKCS------GQVNQLKVLRRKYATRILLSELNL 176

Query: 595 HAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETREL 702
              ++   A  F  +  +EK NI+  AI+N   R L
Sbjct: 177 KKEEILVEANHFHCLRKEEKNNILDYAIKNWNNRGL 212


>ref|XP_022014209.1| uncharacterized protein LOC110913694 [Helianthus annuus]
          Length = 211

 Score = 95.1 bits (235), Expect = 3e-20
 Identities = 62/214 (28%), Positives = 97/214 (45%)
 Frame = +1

Query: 55  GTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNAGHFYXXXXXXXXXXXXX 234
           G     +K +E F+  +          L+   IDL   P+ ++  FY             
Sbjct: 7   GNAEEFEKIYELFKRRLEVLMEKEKHLLSFQDIDLVCIPVLSSKDFYLLVLNLNDVSLIL 66

Query: 235 XXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILKAKPRIPKLKWRTKKNFR 414
                D+       Y+K   ++ KLF ++L  + HK    I K +P+    KW T+KN  
Sbjct: 67  M----DNKCKNIDDYEKVPYIIGKLFGKYLNSFKHKNAKEIYKVEPKKADFKWSTEKNHT 122

Query: 415 DCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLRRLRFKFATKILLHEINK 594
           D G+F M HM+ Y GE   ++ CG+ A+         Q + L+ LR K+AT+ILL E+N 
Sbjct: 123 DSGVFLMCHMDTYMGENIKEYHCGIFAKCS------GQVNQLKVLRRKYATRILLSELNL 176

Query: 595 HAGKLFDLAKEFDKVPPQEKMNIIVEAIRNRETR 696
              ++   A  F  +  +EK NI+  AI+N   R
Sbjct: 177 KKEEILVEANHFHCLRKEEKNNILDYAIKNWNNR 210


>ref|XP_021987267.1| uncharacterized protein LOC110883914 [Helianthus annuus]
          Length = 189

 Score = 94.0 bits (232), Expect = 6e-20
 Identities = 59/189 (31%), Positives = 90/189 (47%)
 Frame = +1

Query: 136 LALDGIDLAFFPICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFS 315
           L+   IDL   P+ ++  FY                  D+       Y+K   ++ KLF 
Sbjct: 8   LSFQDIDLVCIPVLSSKDFYLLVLNLNDVSLILM----DNKCKNIDDYEKVPYIIGKLFG 63

Query: 316 RHLKLYGHKKHANILKAKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVA 495
           ++L  + HK    I K +P+    KW T+KN  D G+F M HM+ Y GE   ++ CG+ A
Sbjct: 64  KYLNSFKHKNAKEIYKVEPKKADFKWSTEKNHTDSGVFLMCHMDTYMGENIKEYHCGIFA 123

Query: 496 ESGLPTDKLRQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEKMNIIVEA 675
           +         Q + L+ LR K+AT+ILL E+N    ++   A  F  +  +EK NI+  A
Sbjct: 124 KCS------GQVNQLKVLRRKYATRILLSELNLKKEEILVEANHFHCLRKEEKNNILDYA 177

Query: 676 IRNRETREL 702
           I+N   R L
Sbjct: 178 IKNWNNRGL 186


>ref|XP_021989724.1| uncharacterized protein LOC110886255 [Helianthus annuus]
 gb|OTG12461.1| putative ulp1 protease family, C-terminal catalytic
           domain-containing protein [Helianthus annuus]
          Length = 398

 Score = 92.8 bits (229), Expect = 4e-18
 Identities = 57/173 (32%), Positives = 81/173 (46%), Gaps = 2/173 (1%)
 Frame = +1

Query: 142 LDGIDLAFFPICNAGHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRH 321
           L GIDL F PI ++ H+Y               DN      +DSKY    ++++     +
Sbjct: 214 LIGIDLVFIPILHSKHYYLICYNLKKALVDVI-DNLGRNVEFDSKYAFRPQIMQNTLCNY 272

Query: 322 LKLYGHKKHANILKAKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKW--DCGLVA 495
           L++  H   + + K +P+I ++ WRT  N  DCGIF M HME Y       W   CGL  
Sbjct: 273 LEMTSHPIASKLRKCEPKILEMPWRTVNNSVDCGIFVMRHMETYKCTTIKDWKPKCGLAG 332

Query: 496 ESGLPTDKLRQCDMLRRLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
           ES        Q   L  LR K+  KILL +IN   G +    +E+ K P +E+
Sbjct: 333 ESEY------QKTQLVDLRMKYLAKILLSDINIRKGVVISEVREYSKRPTEER 379


>gb|OTG18478.1| putative ulp1 protease family, C-terminal catalytic domain-containing
            protein [Helianthus annuus]
          Length = 949

 Score = 93.6 bits (231), Expect = 5e-18
 Identities = 65/217 (29%), Positives = 101/217 (46%)
 Frame = +1

Query: 4    PSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNA 183
            P R F     +    FD T+ +E+++ ++F N  + +   +     + G DL F PI  +
Sbjct: 723  PKRMFCYCNMLGDKDFDITV-NEEERIKTF-NTNMDRILRDTEKKTIIGFDLLFIPILYS 780

Query: 184  GHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILK 363
             H+Y               DN  S A +D+KY +  ++++K    +L++ GH   + +  
Sbjct: 781  KHYYVICFNLTQPQVHVI-DNLASKADFDAKYGQRPQIMQKTLCSYLEIVGHPLASELKM 839

Query: 364  AKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLR 543
             KP   ++ WRT  N  DCGIF M HME Y   P   W CGL+ E         Q   L 
Sbjct: 840  CKPVRLEMPWRTHYNSVDCGIFVMRHMETYKCTPIKVWLCGLLKEVE------GQRGQLN 893

Query: 544  RLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
             LR K+  KIL+ +IN     +    ++F KV  +EK
Sbjct: 894  DLRVKYLVKILMSDINIQKDAIVAEVRQFAKVRDEEK 930


>ref|XP_021977388.1| uncharacterized protein LOC110872817 isoform X4 [Helianthus annuus]
 ref|XP_021993803.1| uncharacterized protein LOC110890496 isoform X4 [Helianthus annuus]
          Length = 995

 Score = 93.6 bits (231), Expect = 5e-18
 Identities = 65/217 (29%), Positives = 101/217 (46%)
 Frame = +1

Query: 4    PSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNA 183
            P R F     +    FD T+ +E+++ ++F N  + +   +     + G DL F PI  +
Sbjct: 769  PKRMFCYCNMLGDKDFDITV-NEEERIKTF-NTNMDRILRDTEKKTIIGFDLLFIPILYS 826

Query: 184  GHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILK 363
             H+Y               DN  S A +D+KY +  ++++K    +L++ GH   + +  
Sbjct: 827  KHYYVICFNLTQPQVHVI-DNLASKADFDAKYGQRPQIMQKTLCSYLEIVGHPLASELKM 885

Query: 364  AKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLR 543
             KP   ++ WRT  N  DCGIF M HME Y   P   W CGL+ E         Q   L 
Sbjct: 886  CKPVRLEMPWRTHYNSVDCGIFVMRHMETYKCTPIKVWLCGLLKEVE------GQRGQLN 939

Query: 544  RLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
             LR K+  KIL+ +IN     +    ++F KV  +EK
Sbjct: 940  DLRVKYLVKILMSDINIQKDAIVAEVRQFAKVRDEEK 976


>ref|XP_021977387.1| uncharacterized protein LOC110872817 isoform X3 [Helianthus annuus]
 ref|XP_021993802.1| uncharacterized protein LOC110890496 isoform X3 [Helianthus annuus]
          Length = 996

 Score = 93.6 bits (231), Expect = 5e-18
 Identities = 65/217 (29%), Positives = 101/217 (46%)
 Frame = +1

Query: 4    PSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNA 183
            P R F     +    FD T+ +E+++ ++F N  + +   +     + G DL F PI  +
Sbjct: 770  PKRMFCYCNMLGDKDFDITV-NEEERIKTF-NTNMDRILRDTEKKTIIGFDLLFIPILYS 827

Query: 184  GHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILK 363
             H+Y               DN  S A +D+KY +  ++++K    +L++ GH   + +  
Sbjct: 828  KHYYVICFNLTQPQVHVI-DNLASKADFDAKYGQRPQIMQKTLCSYLEIVGHPLASELKM 886

Query: 364  AKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLR 543
             KP   ++ WRT  N  DCGIF M HME Y   P   W CGL+ E         Q   L 
Sbjct: 887  CKPVRLEMPWRTHYNSVDCGIFVMRHMETYKCTPIKVWLCGLLKEVE------GQRGQLN 940

Query: 544  RLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
             LR K+  KIL+ +IN     +    ++F KV  +EK
Sbjct: 941  DLRVKYLVKILMSDINIQKDAIVAEVRQFAKVRDEEK 977


>ref|XP_021977386.1| uncharacterized protein LOC110872817 isoform X2 [Helianthus annuus]
 ref|XP_021993801.1| uncharacterized protein LOC110890496 isoform X2 [Helianthus annuus]
          Length = 997

 Score = 93.6 bits (231), Expect = 5e-18
 Identities = 65/217 (29%), Positives = 101/217 (46%)
 Frame = +1

Query: 4    PSRHFFPTGCIIKSMFDGTLASEDKKFESFENEILAQFADNVAGLALDGIDLAFFPICNA 183
            P R F     +    FD T+ +E+++ ++F N  + +   +     + G DL F PI  +
Sbjct: 771  PKRMFCYCNMLGDKDFDITV-NEEERIKTF-NTNMDRILRDTEKKTIIGFDLLFIPILYS 828

Query: 184  GHFYXXXXXXXXXXXXXXXDNSDSGATYDSKYKKTCELLKKLFSRHLKLYGHKKHANILK 363
             H+Y               DN  S A +D+KY +  ++++K    +L++ GH   + +  
Sbjct: 829  KHYYVICFNLTQPQVHVI-DNLASKADFDAKYGQRPQIMQKTLCSYLEIVGHPLASELKM 887

Query: 364  AKPRIPKLKWRTKKNFRDCGIFTMVHMECYNGEPAMKWDCGLVAESGLPTDKLRQCDMLR 543
             KP   ++ WRT  N  DCGIF M HME Y   P   W CGL+ E         Q   L 
Sbjct: 888  CKPVRLEMPWRTHYNSVDCGIFVMRHMETYKCTPIKVWLCGLLKEVE------GQRGQLN 941

Query: 544  RLRFKFATKILLHEINKHAGKLFDLAKEFDKVPPQEK 654
             LR K+  KIL+ +IN     +    ++F KV  +EK
Sbjct: 942  DLRVKYLVKILMSDINIQKDAIVAEVRQFAKVRDEEK 978


Top