BLASTX nr result

ID: Ophiopogon22_contig00032144 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon22_contig00032144
         (1049 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AQK45863.1| hypothetical protein ZEAMMB73_Zm00001d026215 [Zea...    74   5e-11
ref|XP_019173524.1| PREDICTED: hornerin-like, partial [Ipomoea nil]    71   2e-09
emb|SNY30437.1| hypothetical protein SAMN05421748_103406 [Actino...    69   3e-09
ref|WP_082741036.1| hypothetical protein [Sphingomonas sp. CCH9-F2]    68   9e-09
ref|XP_019189377.1| PREDICTED: spore wall protein 2-like, partia...    68   1e-08
ref|XP_022026106.1| hornerin-like [Helianthus annuus]                  67   2e-08
ref|XP_019194910.1| PREDICTED: spidroin-1-like, partial [Ipomoea...    67   2e-08
gb|KJQ34422.1| hypothetical protein VE19_24265, partial [Enterob...    65   2e-08
ref|XP_019197146.1| PREDICTED: spidroin-1-like [Ipomoea nil]           67   3e-08
ref|XP_005774312.1| hypothetical protein EMIHUDRAFT_444379, part...    65   5e-08
ref|XP_022026206.1| uncharacterized protein LOC110926869, partia...    65   1e-07
ref|XP_017900950.1| PREDICTED: spidroin-1-like, partial [Capra h...    64   2e-07
ref|XP_019173246.1| PREDICTED: hornerin-like, partial [Ipomoea nil]    64   2e-07
ref|XP_022006793.1| fibroin heavy chain-like [Helianthus annuus]       64   2e-07
gb|PEN20488.1| hypothetical protein CRM93_14350, partial [Acetob...    63   2e-07
gb|AHM06838.1| Pe-pgrs family protein [Mycobacterium bovis BCG s...    64   3e-07
gb|OCI25394.1| hypothetical protein BBP15_24240, partial [Salmon...    62   3e-07
ref|XP_021996405.1| hornerin-like [Helianthus annuus]                  64   3e-07
ref|XP_022026231.1| filaggrin-like, partial [Helianthus annuus]        64   4e-07
emb|CLM27435.1| PE-PGRS family protein [Mycobacterium tuberculosis]    63   4e-07

>gb|AQK45863.1| hypothetical protein ZEAMMB73_Zm00001d026215 [Zea mays]
          Length = 386

 Score = 74.3 bits (181), Expect = 5e-11
 Identities = 90/308 (29%), Positives = 112/308 (36%), Gaps = 17/308 (5%)
 Frame = -2

Query: 1021 RGSGELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVP 842
            RG  EL   +ARA       G  P      G A   E G+PEG ++     G T  +G P
Sbjct: 119  RGGAELAGRRARA-------GGSPERARGRGRAGPGEGGSPEGGKV-----GPTRARGSP 166

Query: 841  GAGRSEANRTGEGAVRGRLVGKGASS-RRPGAAELRRGSSGYGGAQGRVVTEGILLVGAL 665
            GA +  A + G G       G+G  S  RPGA    R  +G G  +GR         G L
Sbjct: 167  GARQPRAAKPGRGRGARPSAGRGGGSPERPGAGSPERPGAGKGRGRGR--------AGRL 218

Query: 664  RYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVD 485
                +GG                R   R+  AR S +  R GA TG      G G D  +
Sbjct: 219  AGGGRGGE-------------AGRAGARRGRARRSGRAERSGAGTGRAAGVAGAGVDEGE 265

Query: 484  G----DCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVG----MEQGCTAVA 329
            G    +    R +G + G E    G R+R G     R      RG G     E G  A A
Sbjct: 266  GGRAREAAGNRGRGGAGGAEPEEVGARARRGP----RQLAGAGRGRGGGRLTEAGAGAAA 321

Query: 328  RCSGWNWSDGTPARATGGEIKGVAGL----GSCGDGAGKRPARV----AQQRRGACRSEK 173
                    DG  A   G      AG     G  G+G G R  +        RRGA R  +
Sbjct: 322  PA-----EDGAGAAGGGRTRDAGAGAAEPGGRAGEGGGGRRGKERGGGGPHRRGAARRRR 376

Query: 172  GSGCRCQR 149
              G R +R
Sbjct: 377  RGGERRRR 384


>ref|XP_019173524.1| PREDICTED: hornerin-like, partial [Ipomoea nil]
          Length = 3805

 Score = 70.9 bits (172), Expect = 2e-09
 Identities = 81/277 (29%), Positives = 108/277 (38%), Gaps = 11/277 (3%)
 Frame = -2

Query: 964  EGNRPAEQG*GGNARVLENGTPEGLRL---EEVGDGETGGQGVPGAGRSEANRTGEGAV- 797
            EG+R A  G    +    NG+  G      +  GDGE GG+G  G  R    R G+    
Sbjct: 1721 EGDRKARGGRSERSEGARNGSARGKGRRGRKRRGDGEKGGRGEAGGRRKREGRKGQEGEE 1780

Query: 796  -RGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWRRR 620
             RG   G+G   +RPG     RG  G GG +GR   +G             G G    RR
Sbjct: 1781 GRGEGQGEGTGEQRPGKG---RGEKGQGGREGRRGEKG------------RGKGEAGGRR 1825

Query: 619  GLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAE-RGTGADRVDGDCGARRDKGCSAG 443
            G            Q A +  +Q   +    G +G + RG GA+R     G R  +G   G
Sbjct: 1826 GAGGGKGTNGGQEQGARKRRDQGEGKREGRGAKGGKGRGGGAER-----GRRGREGREKG 1880

Query: 442  PEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGE--I 269
             +AR++G R  +  +         RR  G E+G    A+  G     G  A+  GGE   
Sbjct: 1881 EKARKEGGRGEKRGK---------RRKRGRERG----AKEKGGRGGKGGRAKGEGGEGGR 1927

Query: 268  KGVAGLGSCGDGAGKRPARVAQQRR---GACRSEKGS 167
             G  G G  G   G R     ++ R   G  R EK S
Sbjct: 1928 PGEGGRGGGGRKEGGRKGATNERGRGGEGGTRGEKES 1964



 Score = 59.7 bits (143), Expect = 7e-06
 Identities = 87/308 (28%), Positives = 123/308 (39%), Gaps = 17/308 (5%)
 Frame = -2

Query: 1021 RGSGELDLAKARATAHMVAEGN-----RPAEQG*GGNARVLENGTPEGLRLEEVGDG-ET 860
            +G  E    K  A A    +G      R  EQG  G AR       EG   +  G+  + 
Sbjct: 3378 KGGAEGRERKGGARARKQGKGGARAEAREGEQGRKGRARQKRGAKGEGKGEQGRGEARQR 3437

Query: 859  GGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAE--LRRGSSGYGGAQGRVVTEG 686
            G QG  G G+    R GE   +G+    G  +++ G A+   R+G +G GG + R   + 
Sbjct: 3438 GSQGSKGEGKGSKAREGEEGRQGK---GGTEAKKTGRAKGRERKGKTGEGG-EPRAKAKA 3493

Query: 685  ILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLR--RGAATGVEGA- 515
                G+     KG +    + RG P     R R RQ  A+   +R R  +G A G  GA 
Sbjct: 3494 TRQRGS-----KGKH----QERGEP-----RARQRQGGAKDKGERQRGGQGQAKGKGGAR 3539

Query: 514  --ERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGD-QG*RRPATRCRRGVGMEQG 344
              E+G G  R       R  +      EA R G+R  +G  +G R    R R G   E  
Sbjct: 3540 TKEKGKGGGRAKPGGRRRTPRREKEEEEAERTGKRGAQGKRRGERGGEERKRGGKPKETR 3599

Query: 343  CTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQR---RGACRSEK 173
             TA          +G P RAT  E +  +       G G+  A   ++    +GA R EK
Sbjct: 3600 TTAERSREARRHQNGDPERATQRENQRKSRGRGGAQGQGEEDASNGERSAGGQGARREEK 3659

Query: 172  GSGCRCQR 149
             +  R +R
Sbjct: 3660 SADKRDKR 3667


>emb|SNY30437.1| hypothetical protein SAMN05421748_103406 [Actinoplanes
           atraurantiacus]
          Length = 418

 Score = 68.9 bits (167), Expect = 3e-09
 Identities = 93/294 (31%), Positives = 115/294 (39%), Gaps = 40/294 (13%)
 Frame = -2

Query: 907 GTPEGLRLEEV----GDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAEL 740
           GT  G  LE      G G  GG    G G +   R  +G    R   +  ++RR GAA  
Sbjct: 54  GTARGTGLEAARRGRGRGAIGGSARDGLGGAARRRARDGCGAARSAAQRGAARR-GAA-- 110

Query: 739 RRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWRR--RGLPAMVVKRDRGRQRAAR 566
           RRG++    AQ      G    GA R + + G G   RR  RG       R   R+   R
Sbjct: 111 RRGAAR-SAAQRGAARRGAARRGAARSAARRGAGRGARRGVRGGAECGAARSAARRGVRR 169

Query: 565 HSE-----QRLRRGAATGVEGAERGT--GADRVDGDCGARRDKGCSAGPEARRDGERS-- 413
            +E        RRG      G  RG   GA R     G RR   C A     R G R   
Sbjct: 170 GAECGAARSAARRGXXXXRRGVRRGAECGAARSAARRGVRRGAECGAARRGVRRGRRGVR 229

Query: 412 ---------REGDQG*RRPAT-RC---RRGVGMEQGCTAVARCSGWNWSDG--------- 299
                    R G +G RR A  RC   RRG G E      AR  G   +DG         
Sbjct: 230 RGRRGVRRGRCGVRGARREARGRCGTARRGAGCE--VARGARRRGTEPADGFETRRGRRA 287

Query: 298 TPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRG---ACRSEKGSGCRCQRL 146
           T A  +GGE +G  G    G+ AG+R  R  ++ RG     R E+G+G R  R+
Sbjct: 288 TRAEGSGGETRGGTG---AGEAAGERRMRRRREARGKGLGRREERGTGRREARV 338


>ref|WP_082741036.1| hypothetical protein [Sphingomonas sp. CCH9-F2]
          Length = 442

 Score = 67.8 bits (164), Expect = 9e-09
 Identities = 91/296 (30%), Positives = 105/296 (35%), Gaps = 7/296 (2%)
 Frame = -2

Query: 1012 GELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAG 833
            G  DL    A A MVA GNR           + ENG    L     G+ E     VP   
Sbjct: 183  GAHDLVTPGAVA-MVAAGNRM-------RMTIEENGRTRVLESSGAGESEPAPAAVPVVA 234

Query: 832  RSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSC 653
               A+R GEG       G+    R P     R    G    +G     G L        C
Sbjct: 235  EQTASREGEGG------GRERGRRGPSRGSARGRGRGLVAGRGARADRGGL--------C 280

Query: 652  KGG-NGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVDGDC 476
             GG  G   RR G+     +R R   R  R  E R R G      G +   G DR     
Sbjct: 281  PGGVAGDHHRRAGI--RHDRRGRRHCRCRRRDEDRDRGG------GIDDAHGRDRKSDRR 332

Query: 475  GA-RRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDG 299
            GA RR +GC   PE   D  R R   QG        RRG G +QGC     C        
Sbjct: 333  GARRRGRGCRHRPEV-GDRNRHRGAGQGGGGTPVGRRRG-GRDQGCRG---CRPGEGGGR 387

Query: 298  TPARATGGEIKGVAGLGSCGDGAGKRPAR-----VAQQRRGACRSEKGSGCRCQRL 146
                  GG  +G  G G  G GAG R +R       +Q  GA    +G  CR  RL
Sbjct: 388  AAQHREGGGRRGGGGSGQ-GRGAGGRGSRPRRGGAGRQGAGAGTGRRGRRCREARL 442


>ref|XP_019189377.1| PREDICTED: spore wall protein 2-like, partial [Ipomoea nil]
          Length = 830

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 91/304 (29%), Positives = 122/304 (40%), Gaps = 9/304 (2%)
 Frame = -2

Query: 1048 RSEQRT-AR*RGSGELDLAKA---RATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLE 881
            R E++T A+ +G G  +  +    R   H    G RP  QG GG  R  + G+  G    
Sbjct: 350  RKEEKTPAKTKGKGRREGRRGSQKRRADHARTRG-RPKNQGSGGKRRK-KRGSERGR--- 404

Query: 880  EVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGR 701
               D   GGQG PG G  E N   +G  RGR    GA   R G  E  RG+ G    +  
Sbjct: 405  ---DERQGGQGKPGNGEGEGNEREKGEERGR--KGGAKGGRKGGTENGRGAKGESEERQG 459

Query: 700  VVTEGILLVGALRYSCKG---GNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAAT 530
               E     G  R   +G   G G   RR+       +R   R++ A+  E R R+ A  
Sbjct: 460  NTEERRGAQGRKRQKKRGTENGPGAKGRRK-----ESQRKGRRKQGAKEGEARDRKRAGA 514

Query: 529  GVEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGME 350
            G E   +   A R       ++ K   AG EAR  G+R RE  +G ++   + RR  G  
Sbjct: 515  GGERGGKERQATR-----RTKKRKRAGAGAEAR--GQRKREDRKGQQKRKRQERREGGRR 567

Query: 349  QGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGA--GKRPARVAQQRRGACRSE 176
            +   A     G N  D T     G   KG A      +G+   +R  R  +  RG  R  
Sbjct: 568  KE-KAGGEQHGEN--DETTGETEGPGAKGGAEGRRAAEGSRQRERKGRGRKASRGQTRKH 624

Query: 175  KGSG 164
            KG G
Sbjct: 625  KGEG 628


>ref|XP_022026106.1| hornerin-like [Helianthus annuus]
          Length = 749

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 85/300 (28%), Positives = 116/300 (38%), Gaps = 14/300 (4%)
 Frame = -2

Query: 1048 RSEQRTA-R*RGSGELDLAKARATAHMVAEGNRPA------EQG*GGNARVLENGTPEGL 890
            ++E  TA R RG GE +  +  A+     E   P       E+  GG  R  + GT  G 
Sbjct: 328  KAESGTAKRERGEGERERGQGEASTRKAEERGAPRKETGGPEEEKGGKKRRGQRGTGGGG 387

Query: 889  RLEEVG-DGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGY-- 719
              E+ G +G  G +G     ++E  R   GA RG    KG      G  E  RG  G   
Sbjct: 388  --EKGGTEGTRGKRGQQRTAKAERGRRRGGAGRGERKEKGREEG--GGREEGRGKEGKRE 443

Query: 718  GGAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRG 539
             G QGR                K G     ++RG  A   + ++GR R     +++ R  
Sbjct: 444  AGGQGR----------------KDGG----QKRGEEATAKQPEKGRPRRENTQQRKDREE 483

Query: 538  AATGVEGAERGTGAD--RVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRR 365
                 EG  +GTG    R  G  G + +     G   RR+  + RE D+   R       
Sbjct: 484  K----EGTRKGTGQKERRGRGPRGEQAEGRQGQGGRERRERAKRRERDRDTGRARAAAAG 539

Query: 364  GVGMEQGCTAVARCSGWNWSDGTPAR--ATGGEIKGVAGLGSCGDGAGKRPARVAQQRRG 191
            G G   G  A       N + G+PA   A GGE  G  G    G G G+     A++RRG
Sbjct: 540  GGGERPGSRAAHGAGTENGAKGSPAHREAGGGEYTGDTGRAGRGPGPGRGQEGDARRRRG 599


>ref|XP_019194910.1| PREDICTED: spidroin-1-like, partial [Ipomoea nil]
          Length = 928

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 89/312 (28%), Positives = 123/312 (39%), Gaps = 15/312 (4%)
 Frame = -2

Query: 1048 RSEQRTAR*RGSGELDLAKAR-ATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRL---- 884
            + + RT R  G GE   AK R A       G     +  GG       G+ EG R     
Sbjct: 577  KRDPRTRRAGGRGEAQAAKGRGARGQRRRRGEGKQGRRKGGKRTGGGGGSGEGRRAGRRA 636

Query: 883  --EEVGDGETGGQGVPG---AGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGY 719
                 GDG  GG G  G    GR   +R G G  RGR   +G +  R   +  R   +  
Sbjct: 637  EGRATGDGSGGGDGAAGRREGGRQRDHRPGRGHKRGR-SSRGKNQGRTD-SNRRNERANQ 694

Query: 718  GGAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRG 539
            GG Q R  ++      +     +G NG+  R+R   A      +G +        R RR 
Sbjct: 695  GGNQRRRDSKR----RSRGRRNQGKNGHQGRKREGTA----AGQGEEGPTGRRATRRRRA 746

Query: 538  AATGVEGAER---GTGADRVDGDCGARRDKGCSA--GPEARRDGERSREGDQG*RRPATR 374
            A    EGAER     GA++ + + GA+  KG  A  G EAR++  ++R+   G  R    
Sbjct: 747  AGPREEGAERQEADRGAEK-EPETGAQATKGAQADGGGEARKEERKARKDKGGGERGRAE 805

Query: 373  CRRGVGMEQGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRR 194
              +G G E+G             D    R   GE +G AG    G G   +  +  Q  R
Sbjct: 806  GGQGQGRERG-----------REDEKKRRRQQGERQGPAG----GQGRDAKRRKRGQGDR 850

Query: 193  GACRSEKGSGCR 158
            GA   E+G+  R
Sbjct: 851  GAGSKERGTAGR 862



 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 84/307 (27%), Positives = 107/307 (34%), Gaps = 14/307 (4%)
 Frame = -2

Query: 1027 R*RGSGELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGE----- 863
            R  G+GE D   A    H   EG R      GG A   E G+  G    E G G+     
Sbjct: 393  RTEGAGE-DKTGAERGQHR--EGGRRKRGRGGGTAA--ETGSRGGRESRESGRGKPREQE 447

Query: 862  -TGGQGVPGAGR-----SEANRTGEGAVRGRLVGKGASSRRPGAAEL---RRGSSGYGGA 710
             T  +G  G GR      E   TG GA   +  G+GA   RPGA +    R+  +G GG 
Sbjct: 448  KTQREGGEGGGRRGERDGEGTSTGAGAKGRQEQGRGAQDTRPGAPDQKGERKEETGPGGR 507

Query: 709  QGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAAT 530
             G                  GG G                 GR+R  R   +R RR    
Sbjct: 508  GG------------------GGGG-------------ATRSGRRRGGRKHGRRQRREETG 536

Query: 529  GVEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGME 350
              +G    TG  R  G+     + G     E RR+    ++ D   RR   R        
Sbjct: 537  ANQGGPEDTGGGRQRGETEGATEAGEGGPAEGRRENPHEKKRDPRTRRAGGRGEAQAAKG 596

Query: 349  QGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSEKG 170
            +G     R  G    +G   R  GG+  G    G  G G G+R  R A+ R     S  G
Sbjct: 597  RGARGQRRRRG----EGKQGRRKGGKRTG----GGGGSGEGRRAGRRAEGRATGDGSGGG 648

Query: 169  SGCRCQR 149
             G   +R
Sbjct: 649  DGAAGRR 655


>gb|KJQ34422.1| hypothetical protein VE19_24265, partial [Enterobacter cloacae]
          Length = 288

 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 77/264 (29%), Positives = 96/264 (36%), Gaps = 5/264 (1%)
 Frame = -2

Query: 940 G*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSR 761
           G GG     E G   G R  E G+G  GG+G  G GR E    GEG       G G    
Sbjct: 17  GGGGGGGGGEEGEEGGERGGEGGEGGGGGEG--GGGRGEGGGGGEG-------GGGGGGE 67

Query: 760 RPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGR 581
           + G     RG  G GG +     EG            GG G   R R        R+RG 
Sbjct: 68  KEGGGGEGRGGGGEGGEE-----EG------------GGGGEGERGR--------RERGG 102

Query: 580 QRAARHSEQRLRRGA-----ATGVEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGER 416
           +R      +R RRG        G +G   G G ++ + + G RR +G   G   R     
Sbjct: 103 ERGGGGGGKRRRRGGEGERRGRGRKGRRGGGGGEKGEREKGRRRGRGGGKGGGGR----- 157

Query: 415 SREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGD 236
              G +G RR   R ++G G E+              +G   R  GG  +   G G  G 
Sbjct: 158 ---GGKGGRRGRRRGKKGGGGEE--RGGGEGGEGKEGEGGGGRGGGGREEKGGGEGGRGG 212

Query: 235 GAGKRPARVAQQRRGACRSEKGSG 164
           G  K   R    R G  R E G G
Sbjct: 213 GERKGGGRGRGGRGGEKRGEGGGG 236


>ref|XP_019197146.1| PREDICTED: spidroin-1-like [Ipomoea nil]
          Length = 1251

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 80/296 (27%), Positives = 111/296 (37%), Gaps = 25/296 (8%)
 Frame = -2

Query: 961  GNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGR-- 788
            G +    G G  AR   +G  EG    E    E  G      G+    R GEGA +G+  
Sbjct: 346  GKKSQSSGKGRKARKGASGGKEGRATAEAATEEEEGGSTERGGKK--GRRGEGAEKGKGG 403

Query: 787  ----LVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWRRR 620
                  G G   R  G  E R+G +  GG +G    +G     A     + G     R  
Sbjct: 404  GGATAAGSGEKERGSGGGEHRKGGTEQGGGRGGKRRKGSPQARAGEGEKEAGGKTGRRAE 463

Query: 619  GLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVDGDCGARRDKGCSAGP 440
            G P    +R  GR R +R   ++ RR    G E    G G    +G+ G  R +G   GP
Sbjct: 464  GSPQARRERKGGR-RGSREERKKARRDGGEGKE----GQGEQEGEGEEGNGRAEG-GRGP 517

Query: 439  EARRDGERSREGDQ-------G*RR----------PATRCRRGVGMEQGCTAVARCSGWN 311
            +  R G  +R+G+Q       G +R           AT+ RRG   E+G       +G  
Sbjct: 518  QGGRKGTGARKGEQKESKEAEGRKREPTGKGRAGEAATQGRRGQQKERGQGERREENGSP 577

Query: 310  WSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRG--ACRSEKGSGCRCQR 149
             + G   R    E  G  G    G G  +R     +Q  G  A   E+  G + +R
Sbjct: 578  QARGGHERRRAREEGGKRGAEGGGRGGKRRKTGAHRQGEGKKAATQERRRGAQGRR 633



 Score = 59.7 bits (143), Expect = 6e-06
 Identities = 81/287 (28%), Positives = 104/287 (36%), Gaps = 2/287 (0%)
 Frame = -2

Query: 1018 GSGELDLA-KARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVP 842
            G GE +   K    A    +  R  + G  G+    +    +G   +E G GE  G+G  
Sbjct: 448  GEGEKEAGGKTGRRAEGSPQARRERKGGRRGSREERKKARRDGGEGKE-GQGEQEGEGEE 506

Query: 841  GAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALR 662
            G GR+E  R  +G  +G    KG       A   +R  +G G A G   T+G       R
Sbjct: 507  GNGRAEGGRGPQGGRKGTGARKGEQKESKEAEGRKREPTGKGRA-GEAATQG-------R 558

Query: 661  YSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLR-RGAATGVEGAERGTGADRVD 485
               +   G   RR        + +   Q    H  +R R  G   G EG  RG       
Sbjct: 559  RGQQKERGQGERR--------EENGSPQARGGHERRRAREEGGKRGAEGGGRG------- 603

Query: 484  GDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWS 305
               G RR  G      A R GE  +   Q  RR   + RRG    QG    A   G N  
Sbjct: 604  ---GKRRKTG------AHRQGEGKKAATQE-RRRGAQGRRGRKAGQGEGEGAGGQGQNRE 653

Query: 304  DGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSEKGSG 164
             GT  +    E    AG G   +G   R  R  Q R G    EKG+G
Sbjct: 654  AGTAQKGAREEGSHEAGAGRRTNG---RKGREGQTRGGGQEEEKGAG 697


>ref|XP_005774312.1| hypothetical protein EMIHUDRAFT_444379, partial [Emiliania huxleyi
           CCMP1516]
 gb|EOD21883.1| hypothetical protein EMIHUDRAFT_444379, partial [Emiliania huxleyi
           CCMP1516]
          Length = 468

 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 81/238 (34%), Positives = 94/238 (39%), Gaps = 17/238 (7%)
 Frame = -2

Query: 889 RLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAE------LRRGS 728
           R  E G G  GG GVP A   +      GA RGR  G    +RR GA         RRG 
Sbjct: 1   RRGEQGAGGRGG-GVPRAAARD-----RGAPRGRAGGGAGGARRGGAPRAVRARGARRGG 54

Query: 727 SGYGGAQGRVVTEGILLVGALRYSCKGG---NGYWWRRRGLPAMVVKRDRGRQRAARHSE 557
            G GG   R    G    G       GG   +G   R RG    V +  RG +RA R + 
Sbjct: 55  RGRGGGCRRAARGG----GPCGVGLGGGEARHGPSARARG--GAVCRAGRGSRRARRRAR 108

Query: 556 QRLRRGAATGVEG---AERGTGADRVDGDCGARRDKGCS-AGPEARRDGERSREGDQG*R 389
           + LR  AA G  G   A  G    R  G     R + C  A P  RR G  +     G R
Sbjct: 109 RGLRGEAAGGPGGGGCACGGAAGRRARGAAQELRGQPCQRAPPRGRRAGVAAAREAAGAR 168

Query: 388 RPATRCRRGVGMEQGCTAVARCSGWNWSDGTP----ARATGGEIKGVAGLGSCGDGAG 227
           + A    RG G  +G  A ARC      D +     AR  G    G AG G+ G GAG
Sbjct: 169 QAA----RG-GAAKGSRAAARCGARGAGDASQGVGRARPGGAGAGGGAGGGAAGGGAG 221


>ref|XP_022026206.1| uncharacterized protein LOC110926869, partial [Helianthus annuus]
          Length = 2254

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 81/274 (29%), Positives = 102/274 (37%), Gaps = 14/274 (5%)
 Frame = -2

Query: 943  QG*GGNARVLENGTPEG--LRLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGA 770
            +G  GN R     T EG   R  +   G   G+   G G++ A R GEG  R R   + A
Sbjct: 1226 RGKAGNERDRSRRTSEGEGARQGQRRRGTRAGEATEGGGKNRAGRPGEGRGRTRATRREA 1285

Query: 769  SSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRD 590
            + R PG      G  G G  +GR   +G           KGG     R+RG P       
Sbjct: 1286 TRREPGPT--GPGKRGKGSGRGR--EQG-----------KGG-----RKRGRPRKREAAA 1325

Query: 589  RGRQRAAR-HSEQRLRRGAATGVEGAERGTGAD------RVDGDCGA-----RRDKGCSA 446
            RG    AR H+ +R  RG   G +G ERG G        R     GA     RR+KG   
Sbjct: 1326 RGDDGPARGHTARRADRGKGPG-KGEERGDGGGARPATAREGHGTGAGRGRRRRNKGAGG 1384

Query: 445  GPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGEIK 266
             P        +++ D     P TR   G G ++   A         + G P R  G    
Sbjct: 1385 EPAEGAGANPAQQND-----PGTREHSGPGRDRSRGA---------ARGRPGRRAGRRGP 1430

Query: 265  GVAGLGSCGDGAGKRPARVAQQRRGACRSEKGSG 164
               G G  G    K PAR  +  R   R E G G
Sbjct: 1431 DRGGAGRAGGQKNKPPAR--EGPRKTKRKEHGRG 1462



 Score = 60.1 bits (144), Expect = 5e-06
 Identities = 82/287 (28%), Positives = 104/287 (36%), Gaps = 20/287 (6%)
 Frame = -2

Query: 958  NRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVG 779
            +R ++ G GG     + G   G R +  G GE  GQG P   R    R G G       G
Sbjct: 595  SRASKNGGGGREEGNKGGRESGQRAQRPGGGEARGQGGPAGPR--GRRGGHGP------G 646

Query: 778  KGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGY-WWRRRGLPAMV 602
             G +S R GA     G  G GGA+G                    NG+   R RG  A  
Sbjct: 647  GGGNSHRKGAG---GGGGGRGGARG-----------------GRNNGHPTDRGRGAEAGR 686

Query: 601  VKRDRGRQRAARH-----SEQRLRRGAATGVEGAERGTGADRVDGDCGARRDKGCSAGPE 437
             + + G + A R      SEQR  R  A G +   +  GA    G  G         GP 
Sbjct: 687  AEAEGGGRGADRRHGRGPSEQRGGRRGAAGGQKRNQEAGAGHRSGKAGR------EGGPR 740

Query: 436  ARRDG--ERSREGDQG*RRP-ATRCRRGVGMEQ-----------GCTAVARCSGWNWSDG 299
             +  G   R+R G  G R P A R  R  G E+           G  A  + +G +   G
Sbjct: 741  EQGGGPRRRNRRGAGGRREPGAARGERARGAEKDGPAGRGSQAGGAAAKRKATGGSRPGG 800

Query: 298  TPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSEKGSGCR 158
                  G +  G    G  G+G G         RR A R  +GSG R
Sbjct: 801  RGGAQRGSQAGGRRNKGRRGNGEG---------RRKAGRGGEGSGRR 838


>ref|XP_017900950.1| PREDICTED: spidroin-1-like, partial [Capra hircus]
          Length = 1095

 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 87/308 (28%), Positives = 111/308 (36%), Gaps = 24/308 (7%)
 Frame = -2

Query: 1042 EQRTAR*RGSGELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGE 863
            E+R  R  GSG    A    +    AE  R    G  G A     G   G +    G G 
Sbjct: 382  ERRAGRAEGSG----AAEPESRDRGAEKKRGGRGGGSGGA-----GGGPGRQARARGPGR 432

Query: 862  TGGQGV----PGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVV 695
             GG G       AGR E  R G GA  G   G G             G++G GG +GR  
Sbjct: 433  AGGGGAGRTGRAAGRGEPGRRGRGAAHGSGAGAGEQG----------GAAGPGGPRGR-- 480

Query: 694  TEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKR------------DRGRQRAARHSEQR 551
                   G  R +     G    RRG  +   +R            ++G + AAR  + R
Sbjct: 481  ------AGRARAAGTARAGAPGPRRGAASEPERRAGDHREGNADAAEQGTRAAARGGQAR 534

Query: 550  LRR------GAATGVEGAERGTGADRVDGDCG-ARRDKGCSAGPEARRDGERSREGDQG* 392
             RR        A G EG  R  GAD  D   G + R +G   G +A R  +  R G    
Sbjct: 535  SRRKGRAQKSRAGGAEGGPRHPGADSADPRAGPSSRGEG---GRQAGRGEQAGRPGGARQ 591

Query: 391  RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGE-IKGVAGLGSCGDGAGKRPA 215
            R+   R RRG    +G     R +G      T  +  G E   G    G    G G++P 
Sbjct: 592  RKETARGRRGGARGRGRRRPKRAAGGGRQRETKGKEGGAEKAGGQDNAGRHRKGTGRKPG 651

Query: 214  RVAQQRRG 191
               +Q RG
Sbjct: 652  --TKQGRG 657


>ref|XP_019173246.1| PREDICTED: hornerin-like, partial [Ipomoea nil]
          Length = 1307

 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 89/314 (28%), Positives = 110/314 (35%), Gaps = 20/314 (6%)
 Frame = -2

Query: 1045 SEQRTAR*RGSGELDLAKARATAHMVAEGNRPAEQG*G--------GNARVLENGTPEGL 890
            S     R  G GE D  + +A       G + AE+  G           R    G     
Sbjct: 283  SRGERGRRSGPGERDARRGKA-----GSGGQRAERSGGKEGRKEKRAGGRAAAPGKGGAP 337

Query: 889  RLEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYG-- 716
            + +  G G  GG    GAG  E  R   G  RG   GK A  +     E   G  G+G  
Sbjct: 338  KRQRRGRGGDGGGEERGAGGGEGRRGERGRGRGAGEGKRAQGKPRRGNEEGPGRRGHGKG 397

Query: 715  -GAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRG 539
             G+ GR    G       R    GGNG+  +  G      K   G  R  R  + R R G
Sbjct: 398  SGSGGRRTAGG-------RRRQTGGNGHPRKGAGEGNTKGKGREGAPREGRQGKGRPRDG 450

Query: 538  AATGVEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGV 359
               G +GA  G G  R  G       +G + G   RR  E  R+     R    R  +G 
Sbjct: 451  DGQGRQGAGGGKGGPREGG-------RGTAKGAPRRRRREGQRKARDAARATTARGAQGQ 503

Query: 358  GMEQGCTAVARCSGWNWSDGTPARATGGEIKGV------AGLGSCGDGAGKRPARVAQQR 197
            G EQG     R +G       P   T G  KG       A  G+ G G  KR  R   ++
Sbjct: 504  G-EQG-----RPNGRAPKGKAPRATTQGAPKGKATGAQGAKTGAQGQGRRKRAPRARARK 557

Query: 196  RGACRSE---KGSG 164
             GA   E   KG G
Sbjct: 558  AGAKAGEGAPKGKG 571


>ref|XP_022006793.1| fibroin heavy chain-like [Helianthus annuus]
          Length = 6292

 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 83/274 (30%), Positives = 103/274 (37%), Gaps = 5/274 (1%)
 Frame = -2

Query: 997  AKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEAN 818
            A AR  A     G+RPA             GTP G         + GGQG    GR  A 
Sbjct: 368  APARRAAAPAGAGHRPA-------------GTPGG---PAGNTDQGGGQGARQRGRKPAR 411

Query: 817  RTGEGAVRGRLVGKGASSRRPGAAELRRGSSG----YGGAQGRVVTEGILLVGALRYSCK 650
            R       GR   +G ++ RPGAAE  RG         G +GR   +     G  R    
Sbjct: 412  RK---EAEGREPPRGGTADRPGAAEKGRGEQAGRDPKEGERGRSGAKPEETRGEARSETD 468

Query: 649  GGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVDGDCGA 470
            G +G     RG       R   R+RA R +  R R  A    E  +RG  A   D     
Sbjct: 469  GQSGRRTGERGERRSEPARSGARRRARRRAGARGRGRAGKAKEERQRGRNAR--DRGANG 526

Query: 469  RRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPA 290
            +R+ G +AG  A R  +R R+G QG            G + G     R  G    DG   
Sbjct: 527  KREDGAAAGGAAPR--KRERQGGQG------------GKQNGRGGRNRKPG----DGAKR 568

Query: 289  RATGGEIKGVAGLGSCGDGAGK-RPARVAQQRRG 191
            RA     KG  G      GAGK + A+  +QR G
Sbjct: 569  RANREPTKG-GGEEDSRTGAGKAKAAKEGEQRTG 601



 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 84/289 (29%), Positives = 104/289 (35%), Gaps = 36/289 (12%)
 Frame = -2

Query: 961  GNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETG------GQGVPGAGRSEAN-RTGEG 803
            G R   +G GG A+          R EE G+G+ G      GQ     GR     R G+G
Sbjct: 5865 GRRRGGEGEGGQAKA---------RNEEPGEGKAGASSRGEGQRQERRGRGAGKKRGGDG 5915

Query: 802  AVRGRLVGKGASSRRPGAAELRRGSSGYGGA------------------------QGRVV 695
               GR   +GA   RPG    RRG+    GA                        QGR  
Sbjct: 5916 GGGGRAASRGAPGTRPGG---RRGAPTRNGAGRAAGGGRGRKERAQSKPTRGRQEQGRTA 5972

Query: 694  TEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGA 515
             EG    GA      GG+G   R R        R RG +R  R  EQR ++G   G E  
Sbjct: 5973 EEGGPRTGA------GGSG-GRRNRDRRGQAEGRGRGERRGGRKDEQRRKQGPRRGQEAR 6025

Query: 514  ERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG-----*RRPATRCRRGVGME 350
             +G GA          RD+        RR+ +R R G  G      R P  + + G   E
Sbjct: 6026 TKGGGAK-------TSRDR-------PRRNQKRGRPGSSGGGGRTPRAPEEKAKGGGRGE 6071

Query: 349  QGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQ 203
            +G  A     G     GT  R  GG      G G  G   G+R  R A+
Sbjct: 6072 EGRKAETERKG---RKGTTRRGAGGGRDATRGNGP-GPDRGRRDRRRAR 6116



 Score = 63.5 bits (153), Expect = 4e-07
 Identities = 89/284 (31%), Positives = 108/284 (38%), Gaps = 5/284 (1%)
 Frame = -2

Query: 994  KARATAHMVAEGNRPAEQG*GGNARVLENGTPEG--LRLEEVGDGETGGQGVPGAGRSEA 821
            +AR      AEG     +G  GN R     T EG   R  +   G   G+   G G++ A
Sbjct: 2895 EARGGGQGRAEGGSEG-RGKAGNERDRSRRTSEGEGARQGQRRRGTRAGEATEGGGKNRA 2953

Query: 820  NRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGN 641
             R GEG  R R   + A+ R PG      G  G G  +GR   +G           KGG 
Sbjct: 2954 GRPGEGRGRTRATRREATRREPGPT--GPGKRGKGSGRGR--EQG-----------KGG- 2997

Query: 640  GYWWRRRGLPAMVVKRDRGRQRAAR-HSEQRLRRGAATGVEGAERGTGADRVDGDCGARR 464
                R+RG P       RG    AR H+ +R  RG   G +G ERG G         AR 
Sbjct: 2998 ----RKRGRPRKREAAARGDDGPARGHTARRADRGKGPG-KGEERGDGGGA--RPATARE 3050

Query: 463  DKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARA 284
              G  AG    R   R  +G +G RR           EQG T  +R +      G  A  
Sbjct: 3051 GHGTGAG----RGRRRRNKGPEGNRR----------KEQGRTRHSRTTR---EQGNTAGR 3093

Query: 283  TGGEIKGVAGLGSCGDG--AGKRPARVAQQRRGACRSEKGSGCR 158
             G E  G  G G  GDG   G RP      R G    E+  G R
Sbjct: 3094 AGTEAGGRPG-GGRGDGRAGGTRP---GGGREGRGTKEQTPGTR 3133



 Score = 61.6 bits (148), Expect = 2e-06
 Identities = 88/291 (30%), Positives = 106/291 (36%), Gaps = 5/291 (1%)
 Frame = -2

Query: 1021 RGSGELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVP 842
            RG    +  +AR  A     G R   +G GG  R  +            G+G   G G  
Sbjct: 3797 RGRRAEETGEARQRA-----GARERRKGSGGKGRNRDGAAD--------GNGREAGDG-- 3841

Query: 841  GAGRSEANRTGEGAVR-GRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGAL 665
            G GR +  R GE     G   G G   RR    E  R + G  GA G     G       
Sbjct: 3842 GGGRGKRERGGEQPAHPGNDAGGGRGQRR----EEHRTARGGRGAPGGPGKAGGPRAAHA 3897

Query: 664  RYSCKGGNGYWWRRRGLPAMVVKRDRGRQ-RAARHSEQR-LRRGAATGVEGAERGTGADR 491
            R   +   G     RG       R++GRQ +AA+ S  R  RRGA  G  GA   T    
Sbjct: 3898 RADAEPQQGAKGNSRGA------REQGRQGKAAKGSRNRGKRRGAEGGARGARARTRRRA 3951

Query: 490  VDGDCGARRDKGCSAGPEARR--DGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSG 317
             DG  G R + G    P A R  DGER R G        T+   G G   G    +R  G
Sbjct: 3952 GDGASGGRGESGA---PRAGRGTDGERPRRGPAPGGEQPTQNGDGQGEADGEGKQSRARG 4008

Query: 316  WNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSEKGSG 164
                    A A GG  +   G G  G+ A     R    R+GA    KG G
Sbjct: 4009 -----PAEANARGGRPRARKGKGKKGNQARGNGGRNEDARKGA----KGRG 4050



 Score = 60.1 bits (144), Expect = 5e-06
 Identities = 95/318 (29%), Positives = 123/318 (38%), Gaps = 35/318 (11%)
 Frame = -2

Query: 997  AKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRL--EEVGDGE-TGGQGVPGAGRS 827
            A AR  A     G+RPA    GG A   + G  +G     EE    + +GG G P  G  
Sbjct: 3483 APARRAAAPAGAGHRPAGTP-GGPAGNTDQGGGQGGEATGEETRKAQGSGGAGAPAGGAP 3541

Query: 826  EANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKG 647
              +R      RG   G G +    G     RG +  G A G     G    G  + +  G
Sbjct: 3542 PTDRERRRRGRG---GAGRAGPERGGTRPERGEAR-GNAGG-----GPQRDGRAKRAADG 3592

Query: 646  GNGYWWRRRGL---PAMVVKRD---RGRQRAARHSEQRLR------RGAATGVEGAERGT 503
            G G   +R G    PA   +R    RGR RA +  E+R R      RGA  G EG  RG 
Sbjct: 3593 GEGRKTKRTGEERGPAEGARRRAGARGRGRAGKAKEERQRGRNARDRGA-NGKEGGRRGG 3651

Query: 502  GADRVD------GDCG----ARRDKGCSAG-------PEARRDGERSRE--GDQG*RRPA 380
            G  R        G  G    ARR++  + G       P   + G R R+  G QG R+P 
Sbjct: 3652 GGSRATEARAPRGPGGEAERARREEPEAGGRGQTAREPRTHKGGGRGRQQDGGQGRRKPR 3711

Query: 379  TRCRRGVGME-QGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQ 203
             R  +  G   +     AR      +D  PAR   G  +     G+   GAG R  R A 
Sbjct: 3712 RRGEQRTGRSNEPRKGRARKRA---TDTRPARQGPGPEEEEGAAGAAKPGAGARAERPAG 3768

Query: 202  QRRGACRSEKGSGCRCQR 149
            +  G  +  KG   R +R
Sbjct: 3769 RSGGEEQRGKGERGRPKR 3786



 Score = 59.7 bits (143), Expect = 7e-06
 Identities = 63/226 (27%), Positives = 81/226 (35%), Gaps = 6/226 (2%)
 Frame = -2

Query: 874  GDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVV 695
            G G  G +      R    + GE A RG   G G    +  AA  RR   G GG  G   
Sbjct: 2274 GGGREGNRRAAQPARKGEGKRGEAAERGGRAGGGGGHAQERAATQRRAQQGGGGRGGTEA 2333

Query: 694  TEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATG---V 524
              G    G      +G      R+ G       + R +    R  E+R       G    
Sbjct: 2334 ARG----GGAGRKGRGQEAGEPRKGGGSGEREPKGRHQTGERRQGEKRAGNPKPRGRPEE 2389

Query: 523  EGAERGTGADRVDGDCGAR--RDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGME 350
            +G  +G G  + + + G +  R  G +AG +A R G  SR GD+G      R   G G  
Sbjct: 2390 DGGAQGAGRAKTEEEAGKKGTRGAGRAAGRQAGRGGPGSRAGDRGDGTGGGRGPEGAGAA 2449

Query: 349  QGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGS-CGDGAGKRPA 215
            +G                  RA G E  G AG GS  G  A KR A
Sbjct: 2450 RG-----------------ERARGAEKDGPAGRGSQAGGAAAKRKA 2478


>gb|PEN20488.1| hypothetical protein CRM93_14350, partial [Acetobacter fabarum]
          Length = 306

 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 77/272 (28%), Positives = 106/272 (38%), Gaps = 34/272 (12%)
 Frame = -2

Query: 877 VGDGETGGQGVPGAGRSEANRTGE-----GAVRGRLVGKGASSRRPGAAELRRGSSGYGG 713
           VG G+ GG G  G  +    RTG+     G   GR    G   R+ G    ++     GG
Sbjct: 7   VGGGKVGGGGEGGGKKGRNKRTGKKKRKRGKKEGREEKGGGRRRKEGEKRKKKKREKKGG 66

Query: 712 AQGRVVTEGILLV-----GALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRL 548
            +G+   E          G      KG      RRR       KR  G++   R  E+R 
Sbjct: 67  KEGKGRRERKKKKKGEEGGRKGEKEKGKRKKKKRRREREKKEKKRKGGKKEEGRGIERRR 126

Query: 547 RRGAATGVEGAERGTGADRV-DGDCGARRD-------------KGCSAGPEARRDGER-- 416
           RR    G EG ERG G  +   G+ G RR+             K    G + +R+GE+  
Sbjct: 127 RR-KGKGKEGKERGKGRRKERKGEEGGRREGKKGKKRKKEKRKKKERKGRKKKREGEKRG 185

Query: 415 ----SREGDQG*RRPATRCRRGVGM----EQGCTAVARCSGWNWSDGTPARATGGEIKGV 260
                +EG++   R   R R G G     E+G     +  G    +G   R  G + KG 
Sbjct: 186 RKEGEKEGEREKERKKRRGREGRGRGGRREEGGR---KKKGGGGKEGKGRRGGGKKEKGK 242

Query: 259 AGLGSCGDGAGKRPARVAQQRRGACRSEKGSG 164
            G G  G+  G+R  R  ++RRG     KG G
Sbjct: 243 GGEGR-GEKRGRRKERKKKERRGKEEGRKGRG 273


>gb|AHM06838.1| Pe-pgrs family protein [Mycobacterium bovis BCG str. ATCC 35743]
          Length = 577

 Score = 63.5 bits (153), Expect = 3e-07
 Identities = 74/258 (28%), Positives = 88/258 (34%), Gaps = 16/258 (6%)
 Frame = -2

Query: 874  GDGETGGQG-------VPGAGRSEAN--RTGEGAVRGRLVGKGASSRRPGAAELRRGSSG 722
            GDG  GG G       + GAG +  N    G G   G   G G +    G         G
Sbjct: 281  GDGGAGGAGGSGGHALLWGAGGAGGNGGSGGTGGAGGSTAGAGGNGGAGGGGGTGGLLFG 340

Query: 721  YGGAQGRVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRR 542
             GGA G     G  L      S  GG G  W RRG    V         A   +      
Sbjct: 341  NGGAGGHGAAAGNGLAAGNGVSSSGGGGARWDRRGPVGTVAP-------AGPEATPGCGA 393

Query: 541  GAATGVEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRG 362
              A G  G + G G     G  G   +    AG ++ R G     G+ G    A     G
Sbjct: 394  SVAPGGAGGDGGAGGAGGKGGSGLSGNANGGAGGDSGRGGTGGAGGEGG----AAGLLVG 449

Query: 361  VGMEQGCTAVARCSGWNWSDGTPARATG-------GEIKGVAGLGSCGDGAGKRPARVAQ 203
             G   G    A  +     DG  A  TG       G   G  G G  G G G+RP RVA 
Sbjct: 450  TGGHGG-DGGAGGAAVKGGDGGAAAGTGIAGAGGRGGAGGSGGSGGDGGGRGRRPRRVAV 508

Query: 202  QRRGACRSEKGSGCRCQR 149
            +R    R  +G G R +R
Sbjct: 509  RRWRGWRERRGRGRRRRR 526


>gb|OCI25394.1| hypothetical protein BBP15_24240, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis]
          Length = 258

 Score = 62.0 bits (149), Expect = 3e-07
 Identities = 77/250 (30%), Positives = 96/250 (38%), Gaps = 11/250 (4%)
 Frame = -2

Query: 874 GDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVV 695
           G G  GG G  G G  E  R GEG       G+G   +R G  E R G  G GG  G   
Sbjct: 11  GGGGGGGGGRGGGGEGEGER-GEGGEGEEEEGRGGGRKRKGGGE-RGGGGGEGGGGGGGG 68

Query: 694 TEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGA 515
            EG       R   KG  G   +RR       K++ G ++ +       R+G   G EG 
Sbjct: 69  KEG-------REERKGRGGGGKKRR------EKKEGGGKKGSGGEGGGERKGEKEGREGG 115

Query: 514 ERGTGADRVDGDCGARR-DKGCSAGPEARRDGE------RSREGDQG*RRPATR---CRR 365
           E G G     G  G RR  +G   G E    GE      R R G +G  R  +R     R
Sbjct: 116 EGGGGR----GKKGERRGGEGRGKGGEKEMKGEGKRKKKRERRGKRGKERKDSREEGRER 171

Query: 364 GVGMEQGCTA-VARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGA 188
           G G E+G      R  G     G  +R  G E +G  G    G   GK      ++  G 
Sbjct: 172 GRGEERGREGKEGRREGREKGGGKDSRREGREGRGKEG----GRKEGKEGGEGRRRGGGG 227

Query: 187 CRSEKGSGCR 158
            R ++G G R
Sbjct: 228 RRKKRGGGGR 237


>ref|XP_021996405.1| hornerin-like [Helianthus annuus]
          Length = 3695

 Score = 63.9 bits (154), Expect = 3e-07
 Identities = 100/330 (30%), Positives = 120/330 (36%), Gaps = 35/330 (10%)
 Frame = -2

Query: 1048 RSEQRTAR*----------RGSGELDLAKARATAHMVAEGNRPA--EQG*GGNA------ 923
            R E RTAR            G      A+A A     A+GN     EQG  G A      
Sbjct: 1438 REEHRTARGGRGAPGGPGKAGGPRAAHARADAEPQQGAKGNSRGAREQGRQGKAAKGSRN 1497

Query: 922  RVLENGTPEGLR------LEEVGDGETGGQGVPGAGRSEANRTGEGAVRGRLVGKGASSR 761
            R    G   G R          GDG +GG+G  GA R+     GE   RG          
Sbjct: 1498 RGKRRGAEGGARGARARTRRRAGDGASGGRGESGAPRAGRGTDGERPRRGPA-------- 1549

Query: 760  RPGAAELRRGSSGYGGAQG-RVVTEGILLVGALRYSCKGGNGYWWRRRGLPAMVVKRD-R 587
             PG  +  +   G G A G R   +G    G  R           R    P+   +R+ R
Sbjct: 1550 -PGGEQPTQNGDGQGEADGGRKTKQGEGPGGGERKGRAAQGAEGQREERQPSAGKRREER 1608

Query: 586  GRQRAARHSEQRLRRGAATGVEGAERGTGAD------RVDGDCGARRDKGCSAGPEARRD 425
            GR   ++   QR RR A  G      G G +      R       R  K   +G   RR 
Sbjct: 1609 GRAEGSQRPRQR-RRDAQEGRNERPTGPGDEPAKPQPRERAGQKQRGKKTRGSGTRGRR- 1666

Query: 424  GERSREGDQG*RRPA---TRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGEIKGVAG 254
            GER+ EG++    PA    R RRG G   G  A  R  G N   G P  A  G      G
Sbjct: 1667 GERTGEGEEK-GEPAGERERPRRGTG---GTEAGKRKRG-NAPRGDPRPAAAGRAGRKTG 1721

Query: 253  LGSCGDGAGKRPARVAQQRRGACRSEKGSG 164
             G  G GA +R    A  RRGA R +K  G
Sbjct: 1722 AGGEGAGAAQRGKENAGGRRGAQREQKARG 1751



 Score = 60.5 bits (145), Expect = 4e-06
 Identities = 87/308 (28%), Positives = 112/308 (36%), Gaps = 12/308 (3%)
 Frame = -2

Query: 1039 QRTAR*RGSGELDLAKARATAHMVAEGNRPAEQG*GGNAR-----VLENGTPEGLRLEEV 875
            +R +R R  G+    +AR        GN PA +  GG  R        +G P G R  E 
Sbjct: 867  RRGSRARSGGKETDEEARRNGERT--GNSPAGKAGGGGGRRGGGRSGRSGGP-GPRPREG 923

Query: 874  GDGETGGQGVPGAGRS--EANRTGEGAVRGRLVGK-GASSRRPGAAELRRGSSGYGGAQG 704
            G GE    G PG GR+     R G G    +  GK G   RR G  E  R +   G  +G
Sbjct: 924  GAGEGESPGGPGPGRTTRRGRRGGGGEGSPKRAGKAGQGERRAGDREQTRTARERGKGRG 983

Query: 703  RVVTEGILLVGA-LRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATG 527
            +    G    G   +   +GG G      G P     R  GR+  A  ++   R G   G
Sbjct: 984  KERQRGRERGGREAKGGRRGGPGGGGTGEGGPPSGAGRGPGRRGTAAKAQAGERPGERPG 1043

Query: 526  VEGAERGTGADRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*R---RPATRCRRGVG 356
               AERG  A R  G    R        P+A R   R++E D+G R   R   R +   G
Sbjct: 1044 ---AERGGSARRHGGRRHRRAPGTGRRAPQAARRETRTKEADRGARQRGRKPARRKEAEG 1100

Query: 355  MEQGCTAVARCSGWNWSDGTPARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSE 176
             E                  P R       G  G G  G+ AG+ P    + R GA   E
Sbjct: 1101 RE------------------PPRGGHRRPTGSGGEGGEGEQAGRDPKEGERGRSGAKPEE 1142

Query: 175  KGSGCRCQ 152
                 R +
Sbjct: 1143 TRGEARSE 1150



 Score = 59.7 bits (143), Expect = 7e-06
 Identities = 78/283 (27%), Positives = 100/283 (35%), Gaps = 4/283 (1%)
 Frame = -2

Query: 985  ATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGE 806
            AT    A G R  E    G A    +GT          +G    +G  G GR E  R   
Sbjct: 3169 ATRRTAAGGPRRGEPKQRGGAGGPTDGT----------EGARASKGAGGGGRREGRRGTR 3218

Query: 805  GAVRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYSCKGGNGYWWR 626
               RG   G+ A    PG+    RG  G GG QG          G  R    G      R
Sbjct: 3219 KRARGTAAGRQAGRGGPGSRAGDRG-DGTGGGQG---------AGGSRGRPAGSGRAARR 3268

Query: 625  RRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVDGDCGARRDKGCSA 446
            R   P    +R+  RQR  R      +   A   E A +G G  + DG+      K    
Sbjct: 3269 RTAQPEGEARREGQRQRGRRPGGAG-QEAEAGRKEEARQGEGGTKADGETAKGAGKQGEG 3327

Query: 445  GPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTPARATGGEIK 266
            G  + R GE      +  ++  T  R     E+G    AR                GE  
Sbjct: 3328 GEGSGRRGEPEEGESEDGQQTPTAGREAGRREKGDAGGAR-----------GGEEAGERD 3376

Query: 265  GVAGLGSCGDGAGK-RPARVAQQRRGACRSEKGS---GCRCQR 149
            G  G G+ GD AGK RP + A+   G  R  +G+   G R +R
Sbjct: 3377 GAHGPGA-GDAAGKGRPEQRARGGEGPTRGGRGTEKEGARPER 3418


>ref|XP_022026231.1| filaggrin-like, partial [Helianthus annuus]
          Length = 6549

 Score = 63.5 bits (153), Expect = 4e-07
 Identities = 96/355 (27%), Positives = 121/355 (34%), Gaps = 55/355 (15%)
 Frame = -2

Query: 1048 RSEQRTAR*RGSGELDLAKARATAHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGD 869
            +   R  + RG  +      + TA     G RP E+G GG  +    G PEG      G 
Sbjct: 2412 KRRDRNRQERGRKQKSAKPQQTTAEGQERGGRPRERGKGGRKKKAGAG-PEGPGPGAGGP 2470

Query: 868  GETGGQGVP-------------GAGRSEANRTGEGAVRGRLVGKGA-------------- 770
            G  GG+G P             G GR +  R G G       G+GA              
Sbjct: 2471 GR-GGRGGPEPQERTAEQPETDGRGRPKEGRAGHGPRERGKTGRGAGGPEGRGEPEPEQT 2529

Query: 769  -----------SSRRPGAAELRRGSSGYG----GAQGRVVTEGILLVGALRYSCKGGNG- 638
                          R GA E R G   +G      +G    E             GG G 
Sbjct: 2530 RAERHETAGSQQDNRTGAGERRSGRRAHGPRARDGRGTGAAEHHRTTPERHERDTGGTGR 2589

Query: 637  YWWRRRGLPAMVVKRDRGRQRAARHSEQR------LRRGAATGVEGAERGTGADRVDGDC 476
                 +G       + RGR R AR   +R       RRGA+T    A  G GA       
Sbjct: 2590 EGGAGQGGTGGATGQGRGRARKARSDTERGQGARDRRRGASTTAR-ARGGGGAGAGPRGG 2648

Query: 475  GARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGT 296
            G  R +G  A P    +G+R R G++G    A   RRG G  +G  A     G     G 
Sbjct: 2649 GRERPRGRGATP----NGDRRRTGERGTTARARAGRRGPGRARGAGA-----GAGPGAGA 2699

Query: 295  PARA-TGGEIKGVAGLGSC-----GDGAGKRPARVAQQRRGACRSEKGSGCRCQR 149
            P R    G  +G  G         G G G      A Q +G+  S  GSG R +R
Sbjct: 2700 PRRGRRDGHAQGPGGRPGARQRDRGRGTGGTTGAGAGQGQGSGPSAPGSGARARR 2754



 Score = 59.3 bits (142), Expect = 9e-06
 Identities = 84/296 (28%), Positives = 99/296 (33%), Gaps = 27/296 (9%)
 Frame = -2

Query: 976  HMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGEGAV 797
            H    G+ P   G G  AR    G     +     +   GG G    GR+  +  G G  
Sbjct: 5707 HRQGRGSGPGRHGAGQRAR----GGGRARKARSDTERGQGGTGQADGGRARRHGAGRGEG 5762

Query: 796  RG---------RLVGKGAS----SRRPGAAELRR----GSSGYGGAQGRVVTEGILLVGA 668
            RG         R  G+GA+     RR G+   R     G  G GG +GR    G    GA
Sbjct: 5763 RGGPEGRAAGERPRGRGATPNGDRRRTGSGARRHGRGAGGGGRGGPEGRGRGAG---PGA 5819

Query: 667  LRYSCKGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVE---GAERGTGA 497
                   G G    R G P        GR+R         RRGA  G +   G      A
Sbjct: 5820 GAPRRGAGTGTRRGRGGRPGGQTAGTGGRERGE-------RRGAGAGQDKAAGRAHRAAA 5872

Query: 496  DRVDGDCGARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSG 317
                   G R   G  A    R  G R   GD+  R PA R R G G  +G       S 
Sbjct: 5873 PGTTRTAGGRTRTGNKAARGTRGAGARHGRGDKA-RGPAGRERGGSGKARGQRGRDTASR 5931

Query: 316  WNWSDGTPARATG-------GEIKGVAGLGSCGDGAGKRPARVAQQRRGACRSEKG 170
                  TPAR  G        E  G  G G    G GK   R  Q + G   + KG
Sbjct: 5932 RGRGRATPARTPGEGAPRPPRETGGGKGKGQEQTGRGKGAKRPDQGKAGRANTGKG 5987


>emb|CLM27435.1| PE-PGRS family protein [Mycobacterium tuberculosis]
          Length = 796

 Score = 63.2 bits (152), Expect = 4e-07
 Identities = 76/277 (27%), Positives = 94/277 (33%), Gaps = 11/277 (3%)
 Frame = -2

Query: 979  AHMVAEGNRPAEQG*GGNARVLENGTPEGLRLEEVGDGETGGQGVPGAGRSEANRTGEGA 800
            AH  + G      G GG   ++ NG          G G  GG G PGA  S  +  G G 
Sbjct: 465  AHFSSGGKAGGNGGAGGAGGLVGNG----------GAGGAGGNGAPGAPPSGGDPNGGGG 514

Query: 799  VRGRLVGKGASSRRPGAAELRRGSSGYGGAQGRVVTEGILLVGALRYS-----------C 653
              G   GKG      G    + G  G GGA G+    G    GA   +            
Sbjct: 515  GAGGAGGKG------GDGGAQAGDGGAGGAGGKGGNGGNGATGATGLNGLGAGADGTDGG 568

Query: 652  KGGNGYWWRRRGLPAMVVKRDRGRQRAARHSEQRLRRGAATGVEGAERGTGADRVDGDCG 473
            KGGNG      G          G+  AA H +  +  G A G  G   G G D  +G  G
Sbjct: 569  KGGNGGAGGGGGAGG-----QGGKALAATHQDGSMGAGGAGG-NGGAGGMGGDGGNGAKG 622

Query: 472  ARRDKGCSAGPEARRDGERSREGDQG*RRPATRCRRGVGMEQGCTAVARCSGWNWSDGTP 293
               + G   G      G R   G  G     +    G    +G T  +  +G    +G  
Sbjct: 623  TFDNGGDGVGGNGGNGGSRGIGGAGGIGGAGSTA--GADGARGATPTSGGNGGTGGNGAN 680

Query: 292  ARATGGEIKGVAGLGSCGDGAGKRPARVAQQRRGACR 182
            A   GG   G  G G  G   G    R  + R G CR
Sbjct: 681  ATVAGG-AGGAGGKGGNGGLVGNGGGRQRRGRHGRCR 716


Top