BLASTX nr result

ID: Mentha25_contig00015764 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00015764
         (761 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU32418.1| hypothetical protein MIMGU_mgv1a024723mg, partial...   100   6e-19
gb|ETW56319.1| hypothetical protein PFUGPA_01684 [Plasmodium fal...    67   5e-09
ref|XP_004224924.1| hypothetical protein PCYB_144050, partial [P...    65   3e-08
gb|ETB58266.1| hypothetical protein YYC_03920 [Plasmodium yoelii...    63   1e-07
ref|WP_016085163.1| LPXTG-domain-containing protein cell wall an...    63   1e-07
ref|WP_016084301.1| LPXTG-domain-containing protein cell wall an...    63   1e-07
ref|XP_003704003.1| PREDICTED: uncharacterized protein LOC100877...    63   1e-07
gb|ETW48435.1| hypothetical protein PFMALIP_03520 [Plasmodium fa...    63   1e-07
ref|XP_003636096.1| hypothetical protein MTR_027s0011 [Medicago ...    62   2e-07
emb|CCA37660.1| Cell surface glycoprotein 1 [Komagataella pastor...    62   2e-07
ref|XP_002490879.1| Mucin-like protein [Komagataella pastoris GS...    62   2e-07
ref|XP_001911208.1| hypothetical protein [Podospora anserina S m...    62   2e-07
ref|WP_001948897.1| hypothetical protein [Helicobacter pylori] g...    62   2e-07
ref|WP_008389087.1| hypothetical protein [Halosarcina pallida] g...    62   2e-07
ref|WP_002205908.1| hypothetical protein [Helicobacter pylori] g...    62   2e-07
ref|XP_002094035.1| GE20417 [Drosophila yakuba] gi|194180136|gb|...    62   2e-07
ref|XP_001311646.1| hypothetical protein [Trichomonas vaginalis ...    62   2e-07
ref|XP_001308786.1| hypothetical protein [Trichomonas vaginalis ...    62   2e-07
gb|EYU41205.1| hypothetical protein MIMGU_mgv1a002907mg [Mimulus...    62   3e-07
ref|XP_744668.1| hypothetical protein [Plasmodium chabaudi chaba...    62   3e-07

>gb|EYU32418.1| hypothetical protein MIMGU_mgv1a024723mg, partial [Mimulus
           guttatus]
          Length = 577

 Score =  100 bits (249), Expect = 6e-19
 Identities = 75/235 (31%), Positives = 121/235 (51%), Gaps = 3/235 (1%)
 Frame = +2

Query: 50  NKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESDNKEPINTETPAEAQTV 229
           +K  +E  D+  V  E++DQ    +EA +P +        + SDN+E I+TET AE QTV
Sbjct: 60  DKCREEDIDETNVLAELKDQANSNSEAYEPTEGPVSEKINSGSDNEELISTETGAEEQTV 119

Query: 230 DTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDVEERPNENVGEK-PEEI 406
             SAVE   DVK EV E +  +  +KE++ T+N + + +  K+VEE+PNE+  E+ P++I
Sbjct: 120 SRSAVEKQTDVKNEVFEQNVTNGQLKEEKTTENGLDSGDEVKNVEEKPNEDAAEESPKDI 179

Query: 407 AKD-TCNTEYVA-SESMVENADFGNVSTENGLLPTEISATKDPQGNASSENTLLPTEISA 580
             + T   E+VA  E  +E AD GN  T +  LPT++SA + PQ   + ++     E+  
Sbjct: 180 IMEGTDYVEHVAGGEKTMEKADAGNNFTGDVQLPTDLSAIEAPQIQNAEQD-----EVQQ 234

Query: 581 TKDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPSEDEEMEDAVHFDNKQGA 745
            KD    A+++     T I   ++   +           D  +E A   D +  +
Sbjct: 235 GKDEYMKAAADLDKSKTSIDDIQEHGAETVLASRGIVENDNSVETAAKLDGQDSS 289


>gb|ETW56319.1| hypothetical protein PFUGPA_01684 [Plasmodium falciparum Palo
            Alto/Uganda]
          Length = 1722

 Score = 67.4 bits (163), Expect = 5e-09
 Identities = 66/262 (25%), Positives = 123/262 (46%), Gaps = 24/262 (9%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGI------ 163
            +E+ E++KS+ E E + K     ++K++      ++ + +   ++ I E D  +      
Sbjct: 263  EEIAEVDKSVIE-EAVEKQGSVTEEKVQEGVSAIEEIEEIESVTEEIAEEDKSVIEEAVE 321

Query: 164  ---TITESDNKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDS-GVKEDEPTKND 331
               ++TE +  E +  E   E ++V   AVE    V +E+VE ++ D+  V ED+    D
Sbjct: 322  KQGSVTEDEEIESVTEEIAEEDKSVIEEAVEKQGSVTEEIVEEEELDTEEVLEDKSVTGD 381

Query: 332  VV-NENHGKDVEERPNENVGEKPEE---IAKDTCNTEYVA----SESMVENADFGNVSTE 487
            VV  E  GKD E    E+  E+ +E   + ++   TEY++     ES  E      +S  
Sbjct: 382  VVEQEGSGKD-ESEAKESFTEEVDELKSVKEEDQETEYISREIEEESATEQHSEQELSIN 440

Query: 488  NGLLPTEISATKDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQE 667
              ++ TE S TKD +   S+   +L    S  ++        + +L  ++S +++  E++
Sbjct: 441  KEVVETE-SLTKDIEEEKSTTQEILEETQSVNEEIVEEERDTDEVLKEKVSPSEEVIEEQ 499

Query: 668  A----KLGDVRPSEDE--EMED 715
            A    +  + R S DE  E+ED
Sbjct: 500  ASTTEEFVEERSSTDEIVEVED 521


>ref|XP_004224924.1| hypothetical protein PCYB_144050, partial [Plasmodium cynomolgi
            strain B] gi|389586248|dbj|GAB68977.1| hypothetical
            protein PCYB_144050, partial [Plasmodium cynomolgi strain
            B]
          Length = 1031

 Score = 65.1 bits (157), Expect = 3e-08
 Identities = 66/245 (26%), Positives = 109/245 (44%), Gaps = 4/245 (1%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKI-EVKTEVEDQTKPVNEASDPIKESDCGITITES 178
            +E EE+N+   EV E+ + ++EV +++ EV  EV +  + VNE S+ +KE D  +     
Sbjct: 573  EEKEEVNEEKGEVNEVAEGVNEVAEEVNEVAEEVNEVAEEVNEVSEEVKEVDEDV----K 628

Query: 179  DNKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGK- 355
            + KE +N E   + Q  +    E + + K+E  E D+ +  +  D   + + VNE  G+ 
Sbjct: 629  EEKEEVNEEKEEKEQVKEVD--EEVKEEKEEKKEMDEEEKEMDADVKEEKEEVNEEKGEV 686

Query: 356  -DVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQ 532
             +V E  NE   EK E         + VA E+  E  +   V+ E      E +   + +
Sbjct: 687  NEVAEEVNEEKDEKEE--------VKEVAEEANAEKEESNEVAEEANAEKEEANEVVE-E 737

Query: 533  GNASSENTLLPTEISATKDPQGNASSENSLLPTEIS-ATKDPQEQEAKLGDVRPSEDEEM 709
             NA  E      E+S     + NA  E +    E +    D +E   +  DV   E  E 
Sbjct: 738  ANAEKEEA---NEVSE----EANAEKEEANEVAEANEIATDAKEVVDENEDVAAEEPRE- 789

Query: 710  EDAVH 724
             D +H
Sbjct: 790  HDMLH 794


>gb|ETB58266.1| hypothetical protein YYC_03920 [Plasmodium yoelii 17X]
          Length = 834

 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 45/165 (27%), Positives = 82/165 (49%), Gaps = 14/165 (8%)
 Frame = +2

Query: 2   DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESD 181
           +EVEE+ +  +EVEE  + ++E  +++E + EVE++ + + E  + I+E +    I E +
Sbjct: 218 EEVEEVEEEYEEVEE-EEEVEEEYEEVEEEEEVEEEHEEIEEDDEVIEEEEDAEVIEEEE 276

Query: 182 NKEPINTETPAEA-QTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKD 358
           + E I  E   E  +  D   +E   +   EV+E ++ D  ++E+E   ++++ E   +D
Sbjct: 277 DDEVIEEEEDDEVIEEEDDEVIEE--EEDDEVIEEEEDDEVIEEEEDDDDEIIGEEDEED 334

Query: 359 ---VEERPNENVGEKPEEIAKD-----TC-----NTEYVASESMV 454
               +E        K EE+ KD      C     N EY+  ES V
Sbjct: 335 EMGEKESTKSVTNNKNEELMKDKENEIKCNIVMDNKEYIKKESEV 379


>ref|WP_016085163.1| LPXTG-domain-containing protein cell wall anchor domain, partial
           [Bacillus cereus] gi|500344560|gb|EOO62997.1|
           LPXTG-domain-containing protein cell wall anchor domain,
           partial [Bacillus cereus BAG1X2-3]
          Length = 198

 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 49/150 (32%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
 Frame = +2

Query: 11  EEMNKSIDEVEEMNKSIDE-VKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESDNK 187
           EE+ + + EVEE  + + E VK+  E K EV++ TK V EA + +KE    I   + + K
Sbjct: 24  EEVKEPVKEVEETKEEVKEPVKEVEETKEEVKEPTKEVEEAKEEVKEPTKEIEEAKEEIK 83

Query: 188 EPINTETPAEAQTVDTS-----AVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHG 352
           EPI     A+ +  + +     A E + +  KEV E  +  +G+ + EP +N+ V EN G
Sbjct: 84  EPIKEVEEAKEEVKEPTKEIEEAKEEVKEPTKEVEEVKESATGL-DQEPKENNQVVENEG 142

Query: 353 KDVEERPNENVGEKPEEIAKDTCNTEYVAS 442
           +      N+    KPEE  K   +T   AS
Sbjct: 143 RKANTL-NKQHANKPEEGKKSLPSTGGEAS 171


>ref|WP_016084301.1| LPXTG-domain-containing protein cell wall anchor domain [Bacillus
            cereus] gi|500329102|gb|EOO48306.1|
            LPXTG-domain-containing protein cell wall anchor domain
            [Bacillus cereus BAG1X2-2] gi|500334256|gb|EOO53374.1|
            LPXTG-domain-containing protein cell wall anchor domain
            [Bacillus cereus BAG1X2-1] gi|500393826|gb|EOP09830.1|
            LPXTG-domain-containing protein cell wall anchor domain
            [Bacillus cereus BAG2O-1]
          Length = 993

 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 49/150 (32%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
 Frame = +2

Query: 11   EEMNKSIDEVEEMNKSIDE-VKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESDNK 187
            EE+ + + EVEE  + + E VK+  E K EV++ TK V EA + +KE    I   + + K
Sbjct: 819  EEVKEPVKEVEETKEEVKEPVKEVEETKEEVKEPTKEVEEAKEEVKEPTKEIEEAKEEIK 878

Query: 188  EPINTETPAEAQTVDTS-----AVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHG 352
            EPI     A+ +  + +     A E + +  KEV E  +  +G+ + EP +N+ V EN G
Sbjct: 879  EPIKEVEEAKEEVKEPTKEIEEAKEEVKEPTKEVEEVKESATGL-DQEPKENNQVVENEG 937

Query: 353  KDVEERPNENVGEKPEEIAKDTCNTEYVAS 442
            +      N+    KPEE  K   +T   AS
Sbjct: 938  RKANTL-NKQHANKPEEGKKSLPSTGGEAS 966



 Score = 57.0 bits (136), Expect = 7e-06
 Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 11/153 (7%)
 Frame = +2

Query: 11   EEMNKSIDEVEEMNKSIDE-VKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESDNK 187
            EE+ +   EVEE  + + E  K+  E K EV++ TK V EA + +KE    +  T+ + K
Sbjct: 763  EEVTEPTKEVEEAKEEVKEPTKEVEEAKEEVKEPTKEVEEAKEEVKEPTKEVEETKEEVK 822

Query: 188  EPIN--TETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKE--------DEPTKNDVV 337
            EP+    ET  E +       E   +VK+   E ++    VKE         E  K  + 
Sbjct: 823  EPVKEVEETKEEVKEPVKEVEETKEEVKEPTKEVEEAKEEVKEPTKEIEEAKEEIKEPIK 882

Query: 338  NENHGKDVEERPNENVGEKPEEIAKDTCNTEYV 436
                 K+  + P + + E  EE+ + T   E V
Sbjct: 883  EVEEAKEEVKEPTKEIEEAKEEVKEPTKEVEEV 915


>ref|XP_003704003.1| PREDICTED: uncharacterized protein LOC100877061 [Megachile rotundata]
          Length = 3812

 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 64/259 (24%), Positives = 102/259 (39%), Gaps = 19/259 (7%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEV-KTEVEDQTKPVNEASDPIKESDCGITITES 178
            +E  E+  +   VEE    +   +  +E  K EV+    PV E    ++ ++  +   E 
Sbjct: 1384 EEKPEVQPTEAPVEEEKPEVQPTEAPVEEEKPEVQPTEAPVEEEKPEVQPTEAPV---EE 1440

Query: 179  DNKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKD 358
            +  E   TE P E +  +    E   + +K   E    +  V+E++P +   V       
Sbjct: 1441 EKPEVQPTEAPVEEEKPEVQPTEAPVEEEKPTEEVQPTEGPVEEEKPQEK--VKPTEAPV 1498

Query: 359  VEERPNENVG-------EKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISA 517
             EE+P E V        EKPEE AK T   E  A E   E         E      E+  
Sbjct: 1499 EEEKPEEEVKSTEAPAEEKPEEEAKAT---EAPAEEKPEEEVKSTEAPAEEMKPEEEVKP 1555

Query: 518  TKDPQGNASSENTLLPTE-----------ISATKDPQGNASSENSLLPTEISATKDPQEQ 664
            T+ P      +  + PTE           +  T+ P      E  + PTE  A ++  E+
Sbjct: 1556 TEVPVEEEKPQEEVKPTEMPVEEEKPQEEVKPTEAPAEEEKPEVEVQPTEAPAEEEKPEE 1615

Query: 665  EAKLGDVRPSEDEEMEDAV 721
            EAK  +  P E+E+ ++ V
Sbjct: 1616 EAKATEA-PVEEEKPQEEV 1633


>gb|ETW48435.1| hypothetical protein PFMALIP_03520 [Plasmodium falciparum
            MaliPS096_E11]
          Length = 1806

 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 67/253 (26%), Positives = 122/253 (48%), Gaps = 16/253 (6%)
 Frame = +2

Query: 5    EVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITES-D 181
            +VE      +E+ E++KS+  +++ +E +  V ++   V E    I+E +   ++TE  +
Sbjct: 366  DVENTESVTEEIAEVDKSV--IEEAVEKQGSVTEEK--VQEGVSAIEEIEEIESVTEEIE 421

Query: 182  NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDS-GVKEDEPTKNDVV-NENHGK 355
              E +  E   E ++V   AVE    V +E+VE ++ D+  V ED+    DVV  E  GK
Sbjct: 422  EIESVTEEIAEEDKSVIEEAVEKQGSVTEEIVEEEELDTEEVLEDKSVTGDVVEQEGSGK 481

Query: 356  DVEERPNENVGEKPEE---IAKDTCNTEYVA----SESMVENADFGNVSTENGLLPTEIS 514
            D E    E+  E+ +E   + ++   TEY++     ES  E      +S    ++ TE S
Sbjct: 482  D-ESEAKESFTEEVDELKSVKEEDQETEYISREIEEESATEQHSEQELSINKEVVETE-S 539

Query: 515  ATKDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQEA----KLGD 682
             TKD +   S+   +L    S  ++        + +L  ++S +++  E++A    +  +
Sbjct: 540  LTKDIEEEKSTTQEILEETQSVNEEIVEEERDTDEVLKEKVSPSEEVIEEQASTTEEFVE 599

Query: 683  VRPSEDE--EMED 715
             R S DE  E+ED
Sbjct: 600  ERSSTDEIVEVED 612


>ref|XP_003636096.1| hypothetical protein MTR_027s0011 [Medicago truncatula]
            gi|355502031|gb|AES83234.1| hypothetical protein
            MTR_027s0011 [Medicago truncatula]
          Length = 629

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 50/246 (20%), Positives = 107/246 (43%), Gaps = 1/246 (0%)
 Frame = +2

Query: 5    EVEEMNKSID-EVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESD 181
            EVEE  + ++ EVEE  +  +EV++++  + E +++ + V E    + E +    + E +
Sbjct: 288  EVEEEEEEMEVEVEEEEEEEEEVEEEVVEEEEEQEEEEEVEEVEVEVGEEEEEEEVKEEE 347

Query: 182  NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDV 361
             +E +  E   E + V+    E + + ++E  E ++ +   +E+E        E   ++V
Sbjct: 348  VEEEVEVEE--EEEEVEEVEEEEVVEEEEEQEEEEEVEEEEEEEEVEDEVEEIEVEEEEV 405

Query: 362  EERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQGNA 541
            EE   E V  + EE  ++    E    E +V   +   V  E  ++  E    +D +   
Sbjct: 406  EEEEEEEVEVEEEEEEEEVVEVEEEEEEDVVVEVEEEEVEVEEEVVEVEEEEEEDEEEEE 465

Query: 542  SSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPSEDEEMEDAV 721
              E       +   ++ +     E      E+    + +E+E ++ +V   E EE E+ V
Sbjct: 466  EVEEVEEEEVVEEEEEQEEEEEVEEEEEEEEVEDEVEEEEEEEEVEEVEEIEVEEEEEEV 525

Query: 722  HFDNKQ 739
              + ++
Sbjct: 526  EEEEEE 531



 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 53/249 (21%), Positives = 108/249 (43%), Gaps = 4/249 (1%)
 Frame = +2

Query: 5    EVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESDN 184
            E EE  +  +EVEE+   + E +++ EVK E  ++   V E  + ++E +    + E + 
Sbjct: 316  EEEEEQEEEEEVEEVEVEVGEEEEEEEVKEEEVEEEVEVEEEEEEVEEVEEEEVVEEEEE 375

Query: 185  KEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDVE 364
            +E        E +      VE + +V++E VE ++ +    E+E  + +VV      +VE
Sbjct: 376  QEEEEEVEEEEEEEEVEDEVEEI-EVEEEEVEEEEEEEVEVEEEEEEEEVV------EVE 428

Query: 365  ERPNEN----VGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQ 532
            E   E+    V E+  E+ ++    E    E   E  +   V  E  ++  E    ++ +
Sbjct: 429  EEEEEDVVVEVEEEEVEVEEEVVEVEEEEEEDEEEEEEVEEVEEEE-VVEEEEEQEEEEE 487

Query: 533  GNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPSEDEEME 712
                 E   +  E+   ++ +     E   +  E    ++ +E+E    +V   E+EE E
Sbjct: 488  VEEEEEEEEVEDEVEEEEEEEEVEEVEEIEVEEEEEEVEEEEEEEV---EVEEEEEEEEE 544

Query: 713  DAVHFDNKQ 739
            + V  + ++
Sbjct: 545  EVVEVEEEE 553



 Score = 56.6 bits (135), Expect = 9e-06
 Identities = 52/242 (21%), Positives = 104/242 (42%), Gaps = 8/242 (3%)
 Frame = +2

Query: 5    EVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIK--------ESDCG 160
            EVEE  + ++EVEE     +E + + E + E E++ + V +  + I+        E +  
Sbjct: 353  EVEEEEEEVEEVEEEEVVEEEEEQEEEEEVEEEEEEEEVEDEVEEIEVEEEEVEEEEEEE 412

Query: 161  ITITESDNKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVN 340
            + + E + +E +      E + V     E   +V++EVVE ++ +   +E+E    +V  
Sbjct: 413  VEVEEEEEEEEVVEVEEEEEEDVVVEVEEEEVEVEEEVVEVEEEEEEDEEEEEEVEEVEE 472

Query: 341  ENHGKDVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISAT 520
            E   ++ EE+  E   E+ EE  +     E    E  VE  +   V  E   +  E    
Sbjct: 473  EEVVEEEEEQEEEEEVEEEEEEEEVEDEVEEEEEEEEVEEVEEIEVEEEEEEVEEEEEEE 532

Query: 521  KDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPSED 700
             + +     E   +  E+   ++ +     E+ ++  E    ++  E E ++ +V   E+
Sbjct: 533  VEVEEEEEEEEEEV-VEVEEEEEEEEEEEEEDVVVEVE----EEEVEVEEEVVEVEEEEE 587

Query: 701  EE 706
            EE
Sbjct: 588  EE 589


>emb|CCA37660.1| Cell surface glycoprotein 1 [Komagataella pastoris CBS 7435]
          Length = 1618

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 69/266 (25%), Positives = 110/266 (41%), Gaps = 16/266 (6%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKT---EVEDQTKPVNEASDPIKESDCGITIT 172
            +E  E + S DEVEE + S +EV++  E  T   EVE+ T    E  +  +ES     + 
Sbjct: 767  EESTEESTSTDEVEE-STSTEEVEESTEESTSTDEVEESTS-TEEVEESTEESTSTDEVE 824

Query: 173  ESDNKEPINTETPAEAQTVD------TSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDV 334
            ES + E +   T     T D      T   E   +      E D+  S  + +E T+   
Sbjct: 825  ESTSTEEVEESTEESTSTEDAEESTSTEEAEESTEESTSTDEVDESTSTEEAEESTEEST 884

Query: 335  VNENHGKDVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEI- 511
              E    D EE  +    E+  E +  T  TE    E  + + +    STE      E+ 
Sbjct: 885  STE----DAEESTSTEEAEESTEESTSTEETEESTEE--LTSTEEAEESTEEPTSTDEVD 938

Query: 512  --SATKDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQE----QEAK 673
              ++T D   + S+E T    + S+T  PQG    EN     E S+T++ +E     E  
Sbjct: 939  ESTSTDDVDESTSTEGT---EQFSSTDVPQGRPGFENPTEEVESSSTEEFEEPTSTDETD 995

Query: 674  LGDVRPSEDEEMEDAVHFDNKQGAKS 751
                  +  EE E+++  D+ + + S
Sbjct: 996  ESTEEATSTEEAEESISTDDVEQSTS 1021



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 64/257 (24%), Positives = 109/257 (42%), Gaps = 23/257 (8%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNK------------SIDEVKDKIEVKTEVE--DQTKPVNEASDP 139
            DEVEE + S +EVEE  +            S +E ++  E  T  +  D++    EA + 
Sbjct: 553  DEVEE-STSTEEVEESTEESTSTEDAEESTSTEEAEESTEESTSTDEVDESTSTEEAEES 611

Query: 140  IKESDCGITITESDNKEPI--------NTETPAEAQTVDTSAVENLCDVKKEVVEHDDPD 295
             +ES     + ES + E +        +TE P E+    TS  E      +E    D+ D
Sbjct: 612  TEESTSTDEVEESTSTEEVEESIEESTSTEEPEESTEELTSTEE-----AEESTSTDEVD 666

Query: 296  SGVKEDEPTKNDVVNENHGKDVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGN 475
               +E   T+    +       +E       E+ EE  +++ +TE     +  E A+   
Sbjct: 667  ESTEESTSTEEAEESTEESTSTDEVEESTSTEEVEESTEESTSTEDAEESTSTEEAE--- 723

Query: 476  VSTENGLLPTEISATKDPQGNASSENTLLPTEIS-ATKDPQGNASSENSLLPTEISATKD 652
             STE      E ++T + + + S+E     TE S +T+D + + S+E +   TE S + D
Sbjct: 724  ESTE------ESTSTDEVEESTSTEEVEESTEESTSTEDAEESTSTEEAEESTEESTSTD 777

Query: 653  PQEQEAKLGDVRPSEDE 703
              E+     +V  S +E
Sbjct: 778  EVEESTSTEEVEESTEE 794


>ref|XP_002490879.1| Mucin-like protein [Komagataella pastoris GS115]
            gi|238030675|emb|CAY68599.1| Mucin-like protein
            [Komagataella pastoris GS115]
          Length = 1416

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 69/266 (25%), Positives = 110/266 (41%), Gaps = 16/266 (6%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKT---EVEDQTKPVNEASDPIKESDCGITIT 172
            +E  E + S DEVEE + S +EV++  E  T   EVE+ T    E  +  +ES     + 
Sbjct: 565  EESTEESTSTDEVEE-STSTEEVEESTEESTSTDEVEESTS-TEEVEESTEESTSTDEVE 622

Query: 173  ESDNKEPINTETPAEAQTVD------TSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDV 334
            ES + E +   T     T D      T   E   +      E D+  S  + +E T+   
Sbjct: 623  ESTSTEEVEESTEESTSTEDAEESTSTEEAEESTEESTSTDEVDESTSTEEAEESTEEST 682

Query: 335  VNENHGKDVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEI- 511
              E    D EE  +    E+  E +  T  TE    E  + + +    STE      E+ 
Sbjct: 683  STE----DAEESTSTEEAEESTEESTSTEETEESTEE--LTSTEEAEESTEEPTSTDEVD 736

Query: 512  --SATKDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQE----QEAK 673
              ++T D   + S+E T    + S+T  PQG    EN     E S+T++ +E     E  
Sbjct: 737  ESTSTDDVDESTSTEGT---EQFSSTDVPQGRPGFENPTEEVESSSTEEFEEPTSTDETD 793

Query: 674  LGDVRPSEDEEMEDAVHFDNKQGAKS 751
                  +  EE E+++  D+ + + S
Sbjct: 794  ESTEEATSTEEAEESISTDDVEQSTS 819


>ref|XP_001911208.1| hypothetical protein [Podospora anserina S mat+]
            gi|170946232|emb|CAP73033.1| unnamed protein product
            [Podospora anserina S mat+]
          Length = 1605

 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 57/237 (24%), Positives = 103/237 (43%), Gaps = 10/237 (4%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESD 181
            ++ E+  +   E +  +++ +E +D+ E +TEV+D+ +  +E+ +  ++     +  ES 
Sbjct: 861  EKTEKKTEEETEEDTEDETEEETEDETEEETEVDDEDESEDESGEETEDESGDESEDESG 920

Query: 182  NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKED------EPTKNDVVNE 343
            ++    TE+  E  + + S  + L DVKKE    +D   G KED       P        
Sbjct: 921  DESEDKTESE-EKSSGELSGHKRLSDVKKESTGDNDDADGFKEDGDSDFSNPNAKRKEQS 979

Query: 344  NHGKDVEERPNENVGEKPEEIAK----DTCNTEYVASESMVENADFGNVSTENGLLPTEI 511
               KD E+ PN + G   E+ +K    DT     V  +    N      ST +G   T++
Sbjct: 980  ADPKDPEDPPNASKGSNEEQSSKADNLDTTKKNAVDDDDAAAN----KTSTADGPSDTQL 1035

Query: 512  SATKDPQGNASSENTLLPTEISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGD 682
            S+    Q   +S++    TE  +    +  ASSE     +  S ++D +    K  D
Sbjct: 1036 SSITKDQAEENSKSQESSTENGSCDAKE--ASSEEEAATSTASKSEDTRSTTVKPQD 1090


>ref|WP_001948897.1| hypothetical protein [Helicobacter pylori]
           gi|459517477|gb|EMH09605.1| hypothetical protein
           HMPREF1410_00858 [Helicobacter pylori GAM249T]
          Length = 561

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 64/251 (25%), Positives = 102/251 (40%), Gaps = 16/251 (6%)
 Frame = +2

Query: 11  EEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITIT---ESD 181
           EE+ K  +EV+EM + I E K+K EV    +D+ KP ++ +    E+     ++   E+ 
Sbjct: 179 EEVKK--EEVKEMQEEIKE-KEKQEVAESPQDEEKPKDDETQGSVETPKDEEVSKELETQ 235

Query: 182 NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDV 361
            +EPI  ET  E              VK+E+ E       +KE EP K         + +
Sbjct: 236 EQEPIKEETQKE--------------VKEEIKEETQEQEPIKEQEPIKEQ-------EPI 274

Query: 362 EERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQGNA 541
           +E   E   EK E+        E  A + +V+     +   EN     E   T++ Q N 
Sbjct: 275 KEETQEIKEEKQEKTQDSPSTQELEAMQELVKEIQENSNDQEN---KKETQETQETQENT 331

Query: 542 SSENTLLPTEISATKD-----------PQGNASSENSLLP--TEISATKDPQEQEAKLGD 682
            +   +   E+   K+            QG    E +  P   EI  T+D   QE +  D
Sbjct: 332 ETPQDIETQELEIPKEEETQEVAEKTQAQGLEKEEIAETPQEKEIQETQDETPQELEAQD 391

Query: 683 VRPSEDEEMED 715
            +P E+E  +D
Sbjct: 392 EKPQENETPKD 402


>ref|WP_008389087.1| hypothetical protein [Halosarcina pallida]
           gi|445674629|gb|ELZ27166.1| hypothetical protein
           C474_17499 [Halosarcina pallida JCM 14848]
          Length = 310

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 61/220 (27%), Positives = 90/220 (40%), Gaps = 27/220 (12%)
 Frame = +2

Query: 164 TITESDNKEPINTETPAEAQT---VDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDV 334
           T +ES  KEP++ ET A A T   VD  A     +  + V   +DP++ ++E EP++ D 
Sbjct: 44  TESESAVKEPVSEETQASASTDTLVDEEAGREPAEAVESVKNDEDPETVIEEAEPSETDE 103

Query: 335 VNENHGKDV-----------EERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVS 481
            +E    D            EER      E  E +   T     V + S  E  D G  +
Sbjct: 104 ADEAAAADADGAASTGSMVDEERDPAEAAEPAEAVGATTTEDAGVDAGSDAETGD-GTET 162

Query: 482 TENGLLPTEISATKDPQGNASS------ENTLLPTEISATKDPQGNASSENSLLPTEISA 643
            ++    T+ SAT D    AS+      EN   PTE +   +  G +S E   + T++ A
Sbjct: 163 PDDEPAGTDDSATSDAASAASTDSLLDDENADEPTEAAEPAEAVGPSSEE---MDTDVEA 219

Query: 644 TKDP-------QEQEAKLGDVRPSEDEEMEDAVHFDNKQG 742
             DP        ++    GD  P E     DA    N +G
Sbjct: 220 DTDPGDDVEEDVDEPDAAGDADPGEAMASGDAEPVKNVKG 259


>ref|WP_002205908.1| hypothetical protein [Helicobacter pylori]
           gi|393080848|gb|EJB81573.1| poly E-rich protein
           [Helicobacter pylori Hp H-4]
          Length = 473

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 62/246 (25%), Positives = 103/246 (41%), Gaps = 11/246 (4%)
 Frame = +2

Query: 11  EEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITIT---ESD 181
           EE+ K  +EV+EM + I E K+K EV    +D+ KP ++ +    E+     ++   E+ 
Sbjct: 179 EEVKK--EEVKEMQEEIKE-KEKQEVAESPQDEEKPKDDETQGSVETPKDEEVSKELETQ 235

Query: 182 NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKN-DVVNENHGKD 358
            +EPI  ET  E              VK+E+ E       +KE EP K  + + E   + 
Sbjct: 236 EQEPIKEETQKE--------------VKEEIKEETQEQEPIKEQEPIKEQEPIKEQ--EP 279

Query: 359 VEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQGN 538
           ++E   E   EK E+        E  A + +V+     +   EN     E     +   +
Sbjct: 280 IKEETQEIKEEKQEKTQDSPSTQELEAMQELVKEIQENSNDQENKKETQETQENTETPQD 339

Query: 539 ASSENTLLPTE-----ISATKDPQGNASSENSLLPTE--ISATKDPQEQEAKLGDVRPSE 697
             ++   +P E     ++     QG    E +  P E  I  T+D   QE +  D +P E
Sbjct: 340 IETQELEIPKEEETQEVAEKTQAQGLEKEEIAETPQEKEIQETQDETPQELEAQDEKPQE 399

Query: 698 DEEMED 715
           +E  +D
Sbjct: 400 NETPKD 405


>ref|XP_002094035.1| GE20417 [Drosophila yakuba] gi|194180136|gb|EDW93747.1| GE20417
           [Drosophila yakuba]
          Length = 473

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 61/250 (24%), Positives = 102/250 (40%), Gaps = 15/250 (6%)
 Frame = +2

Query: 11  EEMNKSIDEVEEMNKSIDEVKDKIEVKTE--VEDQTKPVNEASDPIKESDCGITITESDN 184
           +E+    ++V + +K  +  KD  + ++E   +D+     +  D + E++   T  + D 
Sbjct: 231 DEVVNGDEDVADDDKQSENDKDDKDAESEDAKDDKDAKSEDVKDKLAENESTETSEDVDT 290

Query: 185 KEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDVE 364
            E    E   E ++ D  A E      K   E    D   K +E   + V  ++  +DV+
Sbjct: 291 TEKSEVEATNEEKSEDVDAAE------KSEAESTKDDDDEKAEETESSKVNEDDKVEDVQ 344

Query: 365 ERPNENVGEKPEEI----------AKDTCNTEYVASESMVENADFGNVSTE-NGLLPTEI 511
            +  E V EKPEE            +D    + V  E  VE  D    S E      TE+
Sbjct: 345 GKSEETVTEKPEEADSAPKSEVDSTEDEKAEKSVEVEDKVEETDNTKESEEVESTGVTEV 404

Query: 512 SATKDPQGNASSENTL--LPTEISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGDV 685
           + TKD +   + EN     P E+SA +  +    +E      E  A   P+ +E+K  + 
Sbjct: 405 ATTKDEEEAKAEENQTQEKPEEVSAAE--KSADETEEKEASAEEKAEDTPENEESKPAEE 462

Query: 686 RPSEDEEMED 715
             S +E  ED
Sbjct: 463 SSSSEESEED 472


>ref|XP_001311646.1| hypothetical protein [Trichomonas vaginalis G3]
            gi|121893464|gb|EAX98716.1| hypothetical protein
            TVAG_480920 [Trichomonas vaginalis G3]
          Length = 1996

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 64/248 (25%), Positives = 102/248 (41%), Gaps = 2/248 (0%)
 Frame = +2

Query: 2    DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESD 181
            DE +E NK  D+ +E  K  +  +D+ + +   ED+ K   E  +  KE +      E+ 
Sbjct: 876  DETKEENK--DKNKEETKEEEPFEDETKEEEPFEDENK--EETKEETKEENKDENKEETK 931

Query: 182  NKEPINTETPAEAQTVDTSAV-ENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKD 358
             +EP   ET  E +  +     E   D  KE  + +  D   KE+EP   +   EN  ++
Sbjct: 932  EEEPFEDETKEENKDENKEETKEETKDKSKEETKEETKDE-TKEEEPFDGETKEENKDEN 990

Query: 359  VEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQGN 538
             EE  +EN  E  EE  ++T   E    E+  E  D     T+      + +  ++P  +
Sbjct: 991  KEENKDENKEETKEENKEETKEEEPFEDETKEETKDETKEETK------DETKEEEPFED 1044

Query: 539  ASSENTLLPTEISATKDPQGNASSENSLLPT-EISATKDPQEQEAKLGDVRPSEDEEMED 715
             S E T         KD     + E +   T E    +D  ++E K       EDE  ED
Sbjct: 1045 ESKEETK-----EENKDENKEETKEETKEETKEEEPFEDENKEETK---EETKEDEPFED 1096

Query: 716  AVHFDNKQ 739
                +NK+
Sbjct: 1097 ETKEENKE 1104


>ref|XP_001308786.1| hypothetical protein [Trichomonas vaginalis G3]
            gi|121890483|gb|EAX95856.1| hypothetical protein
            TVAG_053710 [Trichomonas vaginalis G3]
          Length = 681

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 56/253 (22%), Positives = 117/253 (46%), Gaps = 19/253 (7%)
 Frame = +2

Query: 5    EVEEMNKS-IDEVEEMNKSIDEVKDKIEV-KTEVEDQTKPVNEASDPIKESDCGITITES 178
            E+EE +K+ +DE  E+ K + E+KD+ ++ K +    ++ V++    I E D  I   + 
Sbjct: 429  EIEERHKAAVDEQAEVKKEVHELKDECKLLKRKHGVLSRAVSDYKKKIAEEDAKIAKLKE 488

Query: 179  DNKEPINTETPAEAQTVDTSAVENLCDVK----------KEVVEHDDPDSGVKEDEPTKN 328
            +  E ++ +   E ++   S ++   ++K          K+  + + P+   KEDE   +
Sbjct: 489  NQNEKLHVQV-FEPESSKNSTMQQTDELKNSGKATNASEKDNEDSNPPEKDTKEDEKESS 547

Query: 329  DVVNEN-HGKDVEERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPT 505
               ++    K+ +   +E + EK  + AK++   +  + +    + +    S +    P 
Sbjct: 548  SSKDKKPEAKNKKSSDSEKLNEKESKKAKESEKEQKTSDKEKSSDKEEKEKSNKEEEKPK 607

Query: 506  EISATKDPQGNASSENTLLPTE----ISATK--DPQGNASSENSLLPTEISATKDPQEQE 667
               +     G   S+   + ++    ISAT+  DP GNAS+ NS +P EI+ ++D   Q 
Sbjct: 608  NTESQLRKSGTIHSDFESVDSDENIVISATQTLDPPGNASNSNSPVPDEIADSEDEVMQS 667

Query: 668  AKLGDVRPSEDEE 706
             ++ +   S D +
Sbjct: 668  EEIKEKSDSSDND 680


>gb|EYU41205.1| hypothetical protein MIMGU_mgv1a002907mg [Mimulus guttatus]
          Length = 626

 Score = 61.6 bits (148), Expect = 3e-07
 Identities = 67/233 (28%), Positives = 107/233 (45%), Gaps = 6/233 (2%)
 Frame = +2

Query: 62  DEVKDKIEVKTEVEDQTKPVNEASDPIKESDC--GITITESDNKEPINTETPAEAQTVDT 235
           +E      VK+E+EDQ+   +EA+D I   +     T T S+N E  +T+   E QT   
Sbjct: 53  NEANSNATVKSEIEDQSNHASEANDRINLEEIMGEQTNTNSNNDERSSTKPELEEQTDAI 112

Query: 236 SAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKD-VEERP-NENVGEKPEEIA 409
           S V+     + EVVE +  +  VKED+  +N V   N  K+ +EE P  +   E PE I 
Sbjct: 113 SLVDKHTVSENEVVEQNTTNGVVKEDKSAENGVEGGNPVKNMIEENPIGDGAAEMPETIG 172

Query: 410 -KDTCNTEYVASESMVE-NADFGNVSTENGLLPTEISATKDPQGNASSENTLLPTEISAT 583
            +D    E+V  + +   NA+  +   EN    TE+S     Q    S+N  + TE+   
Sbjct: 173 MEDGGCVEHVPRKELTNGNAESVSDHRENRNSQTELSLDAALQ----SQNAAI-TEVPMG 227

Query: 584 KDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPSEDEEMEDAVHFDNKQG 742
            D +   +S+ S    + +   + +E   ++    P  DE  ++ V    KQG
Sbjct: 228 GDVEMEVASKES---EDKTVNNEAKEHGKEIILASPVSDENAKN-VETATKQG 276


>ref|XP_744668.1| hypothetical protein [Plasmodium chabaudi chabaudi]
           gi|56524714|emb|CAH78120.1| conserved hypothetical
           protein [Plasmodium chabaudi chabaudi]
          Length = 1546

 Score = 61.6 bits (148), Expect = 3e-07
 Identities = 52/235 (22%), Positives = 100/235 (42%), Gaps = 4/235 (1%)
 Frame = +2

Query: 2   DEVEEMNKSIDEVEEMNKSIDEVKDKIEVKTEVEDQTKPVNEASDPIKESDCGITITESD 181
           ++  E N+ +  +EE  +  +E  +K+   + +E+  +  NE ++ +   +        D
Sbjct: 41  EQKNEDNEKLSNIEENKEQKNEDNEKL---SNIEENKEQKNEGNEKLSNIEENKEQKNED 97

Query: 182 NKEPINTETPAEAQTVDTSAVENLCDVKKEVVEHDDPDSGVKEDEPTKNDVVNENHGKDV 361
           N++  N E   E +  D   + N+ + K++  E ++  + ++E++  KN    E++ K+ 
Sbjct: 98  NEKQNNIEENKEQKNEDNEKLSNIEENKEQKNEDNEKQNNIEENKEQKN----EDNEKNT 153

Query: 362 EERPNENVGEKPEEIAKDTCNTEYVASESMVENADFGNVSTENGLLPTEISATKDPQGNA 541
            E  N N           T NT  V   + VEN D  N+          +++T   + N 
Sbjct: 154 IEEKNTNT----------TINTSSVQVTNDVENNDCKNI---------HLNSTDKNRSNI 194

Query: 542 SSENTLLPTE----ISATKDPQGNASSENSLLPTEISATKDPQEQEAKLGDVRPS 694
                L P E    IS +   + N + EN  +P EI   K+  +       V+PS
Sbjct: 195 EDSANLTPIEKKQEISNSSKREDNLNKENKSIPLEIEKEKNNPQNYENSSMVKPS 249


Top