BLASTX nr result

ID: Mentha24_contig00038340 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00038340
         (931 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33150.1| hypothetical protein MIMGU_mgv1a003975mg [Mimulus...   123   9e-26
gb|EYU19859.1| hypothetical protein MIMGU_mgv1a001139mg [Mimulus...   120   6e-25
dbj|BAO49705.1| nuclear pore complex protein Nup136a [Nicotiana ...    84   8e-14
dbj|BAO49706.1| nuclear pore complex protein Nup136b [Nicotiana ...    83   2e-13
ref|XP_006351027.1| PREDICTED: putative GPI-anchored protein PB1...    75   3e-11
ref|XP_004250461.1| PREDICTED: uncharacterized protein LOC101267...    71   5e-10
ref|XP_004195701.1| Piso0_005105 [Millerozyma farinosa CBS 7064]...    71   5e-10
ref|XP_004194602.1| Piso0_005105 [Millerozyma farinosa CBS 7064]...    67   8e-09
ref|XP_003757474.1| PREDICTED: nuclear pore complex protein Nup2...    67   1e-08
ref|YP_335617.1| hemagglutinin-like protein [Burkholderia pseudo...    66   2e-08
ref|WP_006396280.1| hemagglutinin [Burkholderia multivorans] gi|...    66   2e-08
dbj|BAH14734.1| unnamed protein product [Homo sapiens]                 66   2e-08
dbj|BAG63124.1| unnamed protein product [Homo sapiens]                 66   2e-08
ref|NP_001010909.2| mucin-21 precursor [Homo sapiens] gi|2964392...    66   2e-08
gb|AAI05738.1| Mucin 21, cell surface associated [Homo sapiens] ...    66   2e-08
gb|ESX03074.1| hypothetical protein HPODL_02382 [Ogataea parapol...    65   3e-08
ref|YP_001583453.1| hemagluttinin domain-containing protein [Bur...    64   7e-08
ref|WP_006414054.1| hemagglutinin [Burkholderia multivorans] gi|...    64   9e-08
ref|XP_003866329.1| adhesin-like protein [Candida orthopsilosis ...    64   9e-08
ref|XP_006671739.1| class III chitinase ChiA1 [Cordyceps militar...    64   9e-08

>gb|EYU33150.1| hypothetical protein MIMGU_mgv1a003975mg [Mimulus guttatus]
          Length = 551

 Score =  123 bits (309), Expect = 9e-26
 Identities = 88/223 (39%), Positives = 123/223 (55%), Gaps = 3/223 (1%)
 Frame = +2

Query: 200 DVCATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNN 379
           +V   + + M   + EKG  +     +D N  ++TV SAASNG L S    GS T A   
Sbjct: 3   NVPVPSGAPMKSPDIEKGSQLNPPKTDDRN--AETVPSAASNGPLHSSPLAGSFTTASFK 60

Query: 380 NTHQISTSP-PVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVS 556
            T++ ST+  P++FTPS S+ +F+P+G G+ +S  SS   FG++    V+G   FKFG S
Sbjct: 61  ETNEKSTAAAPLLFTPSTSIDNFVPSGTGTSSSTASSGSIFGLAAPSNVTGS-VFKFGAS 119

Query: 557 PNASAKVSEAATTIIP-AGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLK 733
              S  VS A TTI+P  GD K N  T  S+G S+S+ +N V FG   S + + GLSS  
Sbjct: 120 TEPSTAVSAATTTIVPLGGDSKTNADTDPSTGSSSSALSNVVPFGSTSSVHGMFGLSSSV 179

Query: 734 SNPSEIIS-QGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPP 859
           S+ +   + QGSLF + SK  V    T S G + QS + A  P
Sbjct: 180 SHSTANNNLQGSLFGNASKPFVSGLETASQGTSIQSVAPASSP 222


>gb|EYU19859.1| hypothetical protein MIMGU_mgv1a001139mg [Mimulus guttatus]
          Length = 879

 Score =  120 bits (302), Expect = 6e-25
 Identities = 106/292 (36%), Positives = 138/292 (47%), Gaps = 9/292 (3%)
 Frame = +2

Query: 74   DKSKEINNSQSLFSFSPKVSDKFPSL---MXXXXXXXXXXXXXLQDVCATTSSSMTVSEP 244
            D+ KE N+S  LFSFS KV DK PS+                 L +V A+T S   V   
Sbjct: 339  DRPKETNDSPPLFSFSSKVVDKLPSMPFESVKTPEAKLDNSSSLVNVNASTVSQTEVLAS 398

Query: 245  EKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQISTSPPVIFTP 424
            +KG ++    A D NGKSD   SAASNG LVS  P  S TAA +N+              
Sbjct: 399  DKGFHVNPSKAGDINGKSDFAPSAASNGPLVSNPPPVSTTAASHND-------------- 444

Query: 425  SVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVS---PNASAKVSEA-AT 592
               VS+F+P    S TS  SS   FG S  P     P FKFG S   PNA++  S A  T
Sbjct: 445  ---VSNFVP-AVASTTSAISSGSIFGFSAKPTDLSGPVFKFGASVDLPNAASLASTANVT 500

Query: 593  TIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSEIISQGSLF 772
             ++         A Y   G S+S  TN   FG   SG           + + +    S+F
Sbjct: 501  DVVDPNTKPKIDAGY---GSSSSPHTNLAPFGAASSG-----------SSTFVFGSSSIF 546

Query: 773  SSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTAS--FVSSLASAQVSS 922
            ++TSKS V    T S G + Q  S AP PL + + ++S  F SSL+++QVS+
Sbjct: 547  ANTSKSPVSGAETASQGASVQLGSSAPAPLPSFSMSSSTPFGSSLSNSQVSN 598


>dbj|BAO49705.1| nuclear pore complex protein Nup136a [Nicotiana benthamiana]
          Length = 1308

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 90/298 (30%), Positives = 120/298 (40%), Gaps = 30/298 (10%)
 Frame = +2

Query: 92   NNSQSLFSFSPKVS----DKFPSLMXXXXXXXXXXXXXLQDVCATTSSSMTVSEPEKGGY 259
            + SQSL + S K      DK P  +                  +  +SS T + P     
Sbjct: 635  SGSQSLATASNKSKETNVDKVPPFLFSPSTPVTESKPGSSSSLSNLASSPTDARPNPFQL 694

Query: 260  MKSQAAEDTNGKSDTV---------------------TSAASNGLLVSVAPFGSLTAAYN 376
              SQ A D+NGK + V                     TS+ SNGL        + +A  +
Sbjct: 695  DNSQKAVDSNGKLEAVSSGPSSSISTSGIFSLVAPSRTSSFSNGLFTPSPAISTTSALVS 754

Query: 377  NN-THQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGV 553
             N T+ IST    IF P  S+   I    GS T+  SS   FG S    VS EP  KFG 
Sbjct: 755  GNFTNGISTGSSNIFAPLTSIVSMIGATTGSSTASVSSL--FGSSAASSVSKEPPIKFGF 812

Query: 554  SPNASAKVSEAATT-IIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSL 730
            S   S  VS  +TT      D+K+   T    G   SSP    +F   GSGN ISG SS 
Sbjct: 813  SGVPSETVSAPSTTSTAETTDVKSKFETGTIFGNMKSSPFVVASFAATGSGNSISGFSSS 872

Query: 731  KSN---PSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVS 895
              +      I SQGS+F +  +S V A  +V+   T+  S   P    ++  + S  +
Sbjct: 873  VMSAVTTGSIQSQGSVFGTGGESLVSAQTSVAGSGTSVVSGSMPAYFGSSASSPSIAN 930


>dbj|BAO49706.1| nuclear pore complex protein Nup136b [Nicotiana benthamiana]
          Length = 1304

 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 90/294 (30%), Positives = 120/294 (40%), Gaps = 28/294 (9%)
 Frame = +2

Query: 98   SQSLFSFSPKVS--DKFPSLMXXXXXXXXXXXXXLQDVCATTSSSMTVSEPEKGGYMKSQ 271
            SQSL + S K +  DK P  +                  +  +SS T + P       SQ
Sbjct: 635  SQSLATVSNKETNVDKVPPFVFSSSTHSTGSKPGSLSSVSNLASSPTDARPNPFHLGNSQ 694

Query: 272  AAEDTNGKSDTV---------------------TSAASNGLLVSVAPFGSLTAAYNNN-T 385
             A D+NGK + +                     TS  SNGL        + +A+ + N T
Sbjct: 695  KAVDSNGKLEALSSGPSNSISTSGIFSLGAPSSTSGLSNGLFAPSPAISTTSASLSGNFT 754

Query: 386  HQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNA 565
            + IST    IF P  S+   I    GS T+  SS   FG S    VS EP  KFG     
Sbjct: 755  NGISTGSSNIFAPLTSIVSMIGATTGSSTTSASSL--FGSSAASSVSKEPPIKFGFFGVP 812

Query: 566  SAKVSEAATT-IIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNP 742
            S  VS  +TT    A D+K    T  + G   SSP    +F   GSGN ISG SS   + 
Sbjct: 813  SETVSAPSTTSAAEATDVKAKSDTGTTFGNLKSSPFVVASFAATGSGNSISGFSSAVMSA 872

Query: 743  SEI---ISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVS 895
              I    SQGS+FS+  +S V A  +V    T+  S   P    ++  + S  +
Sbjct: 873  VAIGSAQSQGSVFSTGGESLVIAQTSVVGSGTSVVSGSMPAHFDSSASSPSIAN 926


>ref|XP_006351027.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Solanum
            tuberosum]
          Length = 1319

 Score = 75.5 bits (184), Expect = 3e-11
 Identities = 75/237 (31%), Positives = 102/237 (43%), Gaps = 16/237 (6%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVT-----------SAASNGLLVSVAPFG 355
            ++ +SS T   P    +  SQ A D+NGK + V+           S +SNGL  +   F 
Sbjct: 683  SSLASSPTDGRPNPFQWKSSQKAVDSNGKLEAVSTSGIFSFGAPSSTSSNGLFATSPVFS 742

Query: 356  SLTAAYNNN-THQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGE 532
            + +A  + N T+++ST       P  SVS  I    GS  +   S   FG S    VS E
Sbjct: 743  ATSALTSGNFTNEVSTGSSNNVVPLTSVSSTIGATAGSCNASAGSL--FGSSAALLVSKE 800

Query: 533  PFFKFGVSPNASAKVSEAATT-IIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNM 709
            P  KFG        VS  ATT      D+K    T  + G   SSP    +F   GSGN 
Sbjct: 801  PPTKFGFPTIPPKAVSAPATTSTAETTDVKAKSETGPTLGNLKSSPFGGASFAATGSGNS 860

Query: 710  ISGLSS---LKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNT 871
            I G SS     +      SQGS+ S+  +S V A  +V     +  S   P P S++
Sbjct: 861  IFGFSSSVMSTATTGTSQSQGSVSSTGGESLVSAQTSVGGSGISAFSGSMPAPFSSS 917


>ref|XP_004250461.1| PREDICTED: uncharacterized protein LOC101267283 [Solanum
            lycopersicum]
          Length = 1301

 Score = 71.2 bits (173), Expect = 5e-10
 Identities = 74/252 (29%), Positives = 109/252 (43%), Gaps = 14/252 (5%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVT-----------SAASNGLLVSVAPFG 355
            ++ +SS T   P    +  SQ A D+NGK + V+           S +SNGL  +   F 
Sbjct: 679  SSLASSPTDGRPNPFQWNSSQKAVDSNGKLEAVSTSGIFSFGAPPSTSSNGLFATSPAFS 738

Query: 356  SLTA-AYNNNTHQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVS-FGVSETPKVSG 529
            + +A    N T+ +STS   I     S S  I     +  S  +S +S FG S T  V  
Sbjct: 739  ATSALTLGNFTNDVSTSSSNIAVSLTSASSTIGATAATAGSSNASAISLFGSSATSLVPK 798

Query: 530  EPFFKFGVSPNASAKVSEAATT-IIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGN 706
            EP  KFG        VS  ATT      D+K    T  + G   SSP    +    GSGN
Sbjct: 799  EPPTKFGFPTIPPKAVSAPATTSAAETTDVKAKSETGPTFGNLKSSPFGGASLSATGSGN 858

Query: 707  MISGLSSLKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTAS 886
             I G S      S ++S  +  S+ S+ SV + G     L +  +S     +S  +G+  
Sbjct: 859  SIFGFS------SSVMSTATTSSTQSQGSVSSTG--GESLASAETSVGGSGISAFSGSMP 910

Query: 887  FVSSLASAQVSS 922
             + SL+++  S+
Sbjct: 911  ALFSLSASSPST 922


>ref|XP_004195701.1| Piso0_005105 [Millerozyma farinosa CBS 7064]
            gi|359377123|emb|CCE85506.1| Piso0_005105 [Millerozyma
            farinosa CBS 7064]
          Length = 2066

 Score = 71.2 bits (173), Expect = 5e-10
 Identities = 67/246 (27%), Positives = 110/246 (44%), Gaps = 8/246 (3%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
            A +SS+ + S P         A+       D  +S  S+G+  S  P   + ++   ++ 
Sbjct: 1652 APSSSAASSSAPSSNAKSTGIASSSAPSSGDVSSSTPSSGVPSSSVPSSGVASSGVVSSG 1711

Query: 389  QISTSPPVIFTPSVSV-SDFIPTGNGSVTSDPSSRV-SFGVSETP-KVSGEPFFKFGVSP 559
             +S+S P     S  V S   P+ + S + DP S V S GVS +  + +G+P        
Sbjct: 1712 VVSSSTPSTGVASSGVPSSSAPSSSASTSIDPLSSVASSGVSSSSAQATGDP-------- 1763

Query: 560  NASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSN 739
            ++S   S A+T+I P   +  + A   S   S++  +  V+ G   S    SG+SS  + 
Sbjct: 1764 SSSVPSSSASTSIAPVFSVAPSSAPSSSDPSSSAPSSGVVSSGVASSSTPSSGVSSSSAP 1823

Query: 740  PSEIISQGSLFSSTSK-----SSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLA 904
             S ++S G   SST       SSVP+ G VS+GL +   + +  P S+   +    S L 
Sbjct: 1824 SSGVVSSGVTSSSTPSSGVPSSSVPSSGLVSSGLVSSGVASSGVPSSSVPSSGLVSSGLV 1883

Query: 905  SAQVSS 922
            S+   S
Sbjct: 1884 SSSTQS 1889



 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 65/251 (25%), Positives = 109/251 (43%), Gaps = 12/251 (4%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAA------SNGLLVSVAPFGSLTAA 370
            +  SSS   S     G   S A   ++  S   +S A      S+G+  S AP  S  ++
Sbjct: 1521 SVVSSSAPSSNAASTGVASSSAPSSSDPSSSAPSSGAPSTGAPSSGVSSSSAPLSSAQSS 1580

Query: 371  YNNNTHQISTSPPVIFTPSVSV------SDFIPTGNGSVTSDPSSRVSFGVSETPKVSGE 532
               ++   S+  P     S  V      S  +P+ +   + DPSS V+     +  V+  
Sbjct: 1581 SAPSSGISSSGVPSSNAQSTDVASSSVASSSVPSSSAPSSIDPSSSVASSSVPSSSVASS 1640

Query: 533  PFFKFGVSPNASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMI 712
                  VS ++ A  S AA++  P+ + K+ G    S+  S    ++  + G P S    
Sbjct: 1641 SVPSSSVS-SSIAPSSSAASSSAPSSNAKSTGIASSSAPSSGDVSSSTPSSGVPSSSVPS 1699

Query: 713  SGLSSLKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFV 892
            SG++S     S ++S G + SST  + V + G  S+   + S+S +  PLS+    AS  
Sbjct: 1700 SGVAS-----SGVVSSGVVSSSTPSTGVASSGVPSSSAPSSSASTSIDPLSS---VASSG 1751

Query: 893  SSLASAQVSSE 925
             S +SAQ + +
Sbjct: 1752 VSSSSAQATGD 1762


>ref|XP_004194602.1| Piso0_005105 [Millerozyma farinosa CBS 7064]
            gi|359376024|emb|CCE86606.1| Piso0_005105 [Millerozyma
            farinosa CBS 7064]
          Length = 879

 Score = 67.4 bits (163), Expect = 8e-09
 Identities = 58/246 (23%), Positives = 114/246 (46%), Gaps = 8/246 (3%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
            + +SSS++ S       + S+ A  +   S   +S+  +    S  P  S  ++  +++H
Sbjct: 296  SASSSSISSSSAPWSRKVSSRVASSSGPSSSGPSSSGPS----SSVPLSSKISSSVSSSH 351

Query: 389  QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRV-SFGVSETPKVSGEPFFKFGVS--- 556
            Q S+        S   S  +P+     +  PSSR+ S+ V+ +   S  P     +S   
Sbjct: 352  QSSSRVSSKVASSSGPSSSVPSSKAHSSGIPSSRIPSYNVTSSAPSSSAPISSIPISSAS 411

Query: 557  ----PNASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLS 724
                P+++A  S A+++   +     + A+  S+ IS++S ++  +   P S   IS  S
Sbjct: 412  SSSAPSSNAPSSSASSSSASSSSAPISSASSSSAPISSASSSSAPSSSAPSSSAPISSAS 471

Query: 725  SLKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLA 904
            S  +  S   S  +  SS S SS P+    S+  ++ S+S +  P+S+ + +++ +SS +
Sbjct: 472  SSSAPSSSAPSSSAPISSASSSSAPSSNAPSSSASSSSASSSSAPISSASSSSAPISSAS 531

Query: 905  SAQVSS 922
            S+   S
Sbjct: 532  SSSAPS 537


>ref|XP_003757474.1| PREDICTED: nuclear pore complex protein Nup214 [Sarcophilus harrisii]
          Length = 2139

 Score = 66.6 bits (161), Expect = 1e-08
 Identities = 61/218 (27%), Positives = 95/218 (43%), Gaps = 10/218 (4%)
 Frame = +2

Query: 305  VTSAASNGLLVSVAPFGSLTAAYNNNTHQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPS 484
            V + AS+   +SV      TAA  ++  Q +T+PP +  PS SV+   PT +G   +  +
Sbjct: 1610 VKAEASSLSTLSVPEQNEATAASASSVTQGTTAPPSVPPPSNSVASLTPTPSGFGAAAAA 1669

Query: 485  SRVSFGVSETPKVSG----EPFFKFGVSPNASAKVSEAATTIIPA--GDLKNNGATYLSS 646
                F  + T   +G    +P  +   +P+A++   + A T  P+  G      AT  ++
Sbjct: 1670 GTSVFSPTPTTSSTGSAFSQPASEAAPAPSATSVFGQLAATSAPSLFGQQTGTTATTTTA 1729

Query: 647  GISTSSPT-NFVAFGFP---GSGNMISGLSSLKSNPSEIISQGSLFSSTSKSSVPAFGTV 814
                SSP     AFG P   G G    G +     P+   + G  F+ T   SVPAFG  
Sbjct: 1730 TPQVSSPGFGSPAFGTPPTGGFGQAAFGQAPAFGQPASSSTSGFSFNQTGFGSVPAFGQP 1789

Query: 815  SNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVSSEM 928
            ++  T  SS       S+TN  +SF    +SA     +
Sbjct: 1790 ASSTTAVSSGNVFGASSSTNSASSFSFGQSSASTGGSL 1827


>ref|YP_335617.1| hemagglutinin-like protein [Burkholderia pseudomallei 1710b]
            gi|499645378|ref|WP_011326112.1| hemagglutinin
            [Burkholderia pseudomallei] gi|76582377|gb|ABA51851.1|
            haemagluttinin family protein [Burkholderia pseudomallei
            1710b]
          Length = 2757

 Score = 66.2 bits (160), Expect = 2e-08
 Identities = 65/247 (26%), Positives = 113/247 (45%), Gaps = 10/247 (4%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S  + G    S     TN    +++++ S GL  + +   SL+ + +     
Sbjct: 469  STSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGLSSATSSIASLSTSTSTGIGS 528

Query: 392  ISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSP---- 559
            +ST      +   S+S  + + N S+TS  S+  S G+S     SG      G+S     
Sbjct: 529  LSTGLSTTNSNVASLSSGLSSTNSSLTS-LSTSASSGISTAQ--SGVNSLSTGLSTTNST 585

Query: 560  ------NASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGL 721
                  + S  +S A ++I       + G   LSSG+ST++             ++ SGL
Sbjct: 586  VASLSTSTSTGLSSATSSIASLSTSTSTGIGSLSSGLSTTN---------SNVASLSSGL 636

Query: 722  SSLKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSL 901
            SS  S+ + + +  S   ST++S V    ++S GL+T +S+ A    S + G +S  SS+
Sbjct: 637  SSTNSSLTSLSTSASSGISTAQSGV---NSLSTGLSTANSTVASLSTSTSTGLSSATSSI 693

Query: 902  ASAQVSS 922
            AS   S+
Sbjct: 694  ASLSTST 700



 Score = 58.9 bits (141), Expect = 3e-06
 Identities = 66/248 (26%), Positives = 113/248 (45%), Gaps = 11/248 (4%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S  + G    S     TN    +++++ S GL  + +   SL+ + +     
Sbjct: 558  STSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGLSSATSSIASLSTSTSTGIGS 617

Query: 392  ISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNASA 571
            +S+      +   S+S  + + N S+TS  S+  S G+S             GV+ + S 
Sbjct: 618  LSSGLSTTNSNVASLSSGLSSTNSSLTS-LSTSASSGISTAQS---------GVN-SLST 666

Query: 572  KVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSN---- 739
             +S A +T+     L  + +T LSS  +TSS  +       G G++ +GLS+  SN    
Sbjct: 667  GLSTANSTV---ASLSTSTSTGLSS--ATSSIASLSTSTSTGIGSLSTGLSTTNSNVASL 721

Query: 740  PSEIISQGSLFSSTSKSSVPAFGT-------VSNGLTTQSSSFAPPPLSNTNGTASFVSS 898
             S + S  S  +S S S+     T       +S GL+T +S+ A    S + G +S  SS
Sbjct: 722  SSGLSSTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGLSSATSS 781

Query: 899  LASAQVSS 922
            +AS   S+
Sbjct: 782  IASLSTST 789



 Score = 58.9 bits (141), Expect = 3e-06
 Identities = 65/248 (26%), Positives = 109/248 (43%), Gaps = 11/248 (4%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S  + G    S      N    +++++ S GL  + +   SL+ + +     
Sbjct: 647  STSASSGISTAQSGVNSLSTGLSTANSTVASLSTSTSTGLSSATSSIASLSTSTSTGIGS 706

Query: 392  ISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNASA 571
            +ST      +   S+S  + + N S+TS  S+  S G+S     SG      G+S   S 
Sbjct: 707  LSTGLSTTNSNVASLSSGLSSTNSSLTS-LSTSASSGISTAQ--SGVNSLSTGLSTTNST 763

Query: 572  KVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSN---- 739
              S + +T         + AT   + +STS+ T        G G++ +GLS+  SN    
Sbjct: 764  VASLSTSTSTGL-----SSATSSIASLSTSTST--------GIGSLSTGLSTTNSNVASL 810

Query: 740  PSEIISQGSLFSSTSKSSVPAFGT-------VSNGLTTQSSSFAPPPLSNTNGTASFVSS 898
             S + S  S  +S S S+     T       +S GL+T +S+ A    S + G +S  SS
Sbjct: 811  SSGLSSTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTANSTVASLSTSTSTGLSSATSS 870

Query: 899  LASAQVSS 922
            +AS   S+
Sbjct: 871  IASLSTST 878



 Score = 58.5 bits (140), Expect = 4e-06
 Identities = 59/247 (23%), Positives = 110/247 (44%), Gaps = 14/247 (5%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
            AT+S +   +    G    S     TN    +++++ S GL  + +   SL+ + + +  
Sbjct: 1965 ATSSIASLSTSTSTGIGSLSTGLSTTNSTVASLSTSTSTGLSSATSSIASLSTSTSTSIG 2024

Query: 389  QIST----SPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVS 556
             +ST    +   + + S S S  + + N S+ S  +S  +   S +  +S        +S
Sbjct: 2025 SLSTGLSTTDSTVASLSTSTSTGLSSANSSIASLSTSTSTGIGSLSTGLSTTDSTVASLS 2084

Query: 557  PNASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPT-----NFVAFGFPGSGNMISGL 721
             + S  +S A ++I       +NG   LS+G+ST++ T        + G   + + I+ L
Sbjct: 2085 TSTSTGLSSATSSIASLSTSTSNGIGSLSTGLSTTNSTVASLSTSTSTGLSSATSSIASL 2144

Query: 722  SSLKSNPSEIISQG-----SLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTAS 886
            S+  S     +S G     S  +S S S+    G++S GL+T +S+ A    S + G  S
Sbjct: 2145 STSTSTGIGSLSTGLSTTNSTVASLSTSTSTGIGSLSTGLSTTNSTVASLSTSTSTGIGS 2204

Query: 887  FVSSLAS 907
              + L+S
Sbjct: 2205 LSTGLSS 2211



 Score = 57.8 bits (138), Expect = 6e-06
 Identities = 62/257 (24%), Positives = 108/257 (42%), Gaps = 38/257 (14%)
 Frame = +2

Query: 266  SQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQISTSPPVIFTPSVSVSDF 445
            S     TN    +++++ S GL ++ +   SL+ + +     +ST      +   S+S  
Sbjct: 1769 STGLSTTNSNVASLSTSTSTGLNLANSSIASLSTSTSTGIGSLSTGLSTTNSNVASLSTS 1828

Query: 446  IPTGNGSVTSDPSS----------RVSFGVSETPKVSGEPFFKFGVSPNASAKVSEAATT 595
              TG  S TS  +S           +S G+S T            +S + S  +S A ++
Sbjct: 1829 TSTGLSSATSSIASLSTSTSTGIGSLSTGLSTTDSTVAS------LSTSTSTGLSSANSS 1882

Query: 596  IIPAGDLKNNGATYLSSGIST--------------------SSPTNFVAFGFPGSGNMIS 715
            I       + G   LS+G+ST                    SS T+       G G++ +
Sbjct: 1883 IASLSTSTSTGIGSLSTGLSTTDSTVASLSTSTSTGLSSATSSITSLSTSTSTGIGSLST 1942

Query: 716  GLSSLKSNPSEIISQGSLFSSTSKSSVPA--------FGTVSNGLTTQSSSFAPPPLSNT 871
            GLS+  SN + + +  S   S++ SS+ +         G++S GL+T +S+ A    S +
Sbjct: 1943 GLSTTNSNVASLSTSTSTGLSSATSSIASLSTSTSTGIGSLSTGLSTTNSTVASLSTSTS 2002

Query: 872  NGTASFVSSLASAQVSS 922
             G +S  SS+AS   S+
Sbjct: 2003 TGLSSATSSIASLSTST 2019


>ref|WP_006396280.1| hemagglutinin [Burkholderia multivorans] gi|221169983|gb|EEE02449.1|
            hemagglutinin family protein [Burkholderia multivorans
            CGD1]
          Length = 2201

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 63/238 (26%), Positives = 109/238 (45%), Gaps = 5/238 (2%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S         S      N    +++++AS G+  + +  GSL+ A +++   
Sbjct: 666  STSASTGISSANSSIGSLSTGLSSANSSVTSLSTSASTGISSANSSIGSLSTATSSSISS 725

Query: 392  ISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNASA 571
            +STS     + + S    + T   S  S  S+  S G+S      G      G+S   S+
Sbjct: 726  LSTSASTGISSANSSIGSLSTATSSSISSLSTSASTGISSANSSIGS--LSTGLSSTNSS 783

Query: 572  KV---SEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGS--GNMISGLSSLKS 736
                 S A+T I  A     + +T  SS IS+ S +         S  G++ +GLSS  S
Sbjct: 784  VTSLSSSASTGISSANSSIGSLSTATSSSISSLSTSASTGISSANSSIGSLSTGLSSTNS 843

Query: 737  NPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASA 910
            + + + S  S   S++ SS+   G++S GL++ +SS      S + G AS  +S ++A
Sbjct: 844  SVTSLSSSASTGISSANSSI---GSLSTGLSSTNSSVTSLSTSTSTGIASLSTSTSTA 898



 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 65/245 (26%), Positives = 120/245 (48%), Gaps = 2/245 (0%)
 Frame = +2

Query: 194  LQDVCATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAY 373
            +Q+V A   SS T ++   G  + + A+  T+  S   TS AS G+  + +  GSL+ A 
Sbjct: 523  VQNVAAGLVSS-TSTDAINGSQLYAVASTTTSSISSLSTS-ASTGISSANSSIGSLSTAT 580

Query: 374  NNNTHQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGV 553
            +++   +STS     + + S    + TG  S  S  +S +S G+S T            +
Sbjct: 581  SSSISSLSTSASTGISSANSSIKSLSTGLSSTNSSVTS-LSTGLSSTNS------SVTSL 633

Query: 554  SPNASAKVSEAATTIIPAGDLKNNGATYLSS--GISTSSPTNFVAFGFPGSGNMISGLSS 727
            S +AS  +S A ++I   G L    ++  SS   +STS+ T  ++      G++ +GLSS
Sbjct: 634  SSSASTGISSANSSI---GSLSTGLSSTNSSVTSLSTSASTG-ISSANSSIGSLSTGLSS 689

Query: 728  LKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLAS 907
              S+ + + +  S   S++ SS+ +  T ++   +  S+ A   +S+ N +   +S+  S
Sbjct: 690  ANSSVTSLSTSASTGISSANSSIGSLSTATSSSISSLSTSASTGISSANSSIGSLSTATS 749

Query: 908  AQVSS 922
            + +SS
Sbjct: 750  SSISS 754


>dbj|BAH14734.1| unnamed protein product [Homo sapiens]
          Length = 566

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 1/239 (0%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +         S+A+  TN +S T +S AS       +   S  +   N+  
Sbjct: 141 STTSSGASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTVSSRASTATNS-- 198

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSE-TPKVSGEPFFKFGVSPNA 565
           + ST+     T + S S     G G+ T+  SS  S G S  T   S  P    G + N 
Sbjct: 199 ESSTTSSGASTATNSESRTTSNGAGTATNSESSTTSSGASTATNSESSTPSSGAGTATN- 257

Query: 566 SAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPS 745
               SE++TT   AG   N+ ++ +SSGIST + +         S    SG ++  ++ S
Sbjct: 258 ----SESSTTSSGAGTATNSESSTVSSGISTVTNSE--------SSTPSSGANTATNSES 305

Query: 746 EIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVSS 922
              S G+  ++ S SS  + G  S    ++SS+ +    + TN  +S  SS AS   +S
Sbjct: 306 STTSSGANTATNSDSSTTSSG-ASTATNSESSTTSSGASTATNSESSTTSSGASTATNS 363



 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 65/234 (27%), Positives = 103/234 (44%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +   +     S A   TN +S T +S A        +   S  +   N+  
Sbjct: 231 STTSSGASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGISTVTNS-- 288

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNAS 568
           + ST      T + S S    +G  + T+  SS  S G S T   S       G S   +
Sbjct: 289 ESSTPSSGANTATNSESSTTSSGANTATNSDSSTTSSGAS-TATNSESSTTSSGAS---T 344

Query: 569 AKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSE 748
           A  SE++TT   A    N+G++  SSG ST++ +         S  + SG S+  ++ S 
Sbjct: 345 ATNSESSTTSSGASTATNSGSSTTSSGTSTATNSE--------SSTVSSGASTATTSESS 396

Query: 749 IISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASA 910
             S G+  ++ S+SS     TVS+G +T ++S +    S  N   +  SS+ SA
Sbjct: 397 TTSSGASTATNSESS-----TVSSGASTATNSESSTTSSGANTATNSGSSVTSA 445


>dbj|BAG63124.1| unnamed protein product [Homo sapiens]
          Length = 550

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 1/239 (0%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +         S+A+  TN +S T +S AS       +   S  +   N+  
Sbjct: 125 STTSSGASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTVSSRASTATNS-- 182

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSE-TPKVSGEPFFKFGVSPNA 565
           + ST+     T + S S     G G+ T+  SS  S G S  T   S  P    G + N 
Sbjct: 183 ESSTTSSGASTATNSESRTTSNGAGTATNSESSTTSSGASTATNSESSTPSSGAGTATN- 241

Query: 566 SAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPS 745
               SE++TT   AG   N+ ++ +SSGIST + +         S    SG ++  ++ S
Sbjct: 242 ----SESSTTSSGAGTATNSESSTVSSGISTVTNSE--------SSTPSSGANTATNSES 289

Query: 746 EIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVSS 922
              S G+  ++ S SS  + G  S    ++SS+ +    + TN  +S  SS AS   +S
Sbjct: 290 STTSSGANTATNSDSSTTSSG-ASTATNSESSTTSSGASTATNSESSTTSSGASTATNS 347



 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 65/234 (27%), Positives = 103/234 (44%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +   +     S A   TN +S T +S A        +   S  +   N+  
Sbjct: 215 STTSSGASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGISTVTNS-- 272

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNAS 568
           + ST      T + S S    +G  + T+  SS  S G S T   S       G S   +
Sbjct: 273 ESSTPSSGANTATNSESSTTSSGANTATNSDSSTTSSGAS-TATNSESSTTSSGAS---T 328

Query: 569 AKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSE 748
           A  SE++TT   A    N+G++  SSG ST++ +         S  + SG S+  ++ S 
Sbjct: 329 ATNSESSTTSSGASTATNSGSSTTSSGTSTATNSE--------SSTVSSGASTATTSESS 380

Query: 749 IISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASA 910
             S G+  ++ S+SS     TVS+G +T ++S +    S  N   +  SS+ SA
Sbjct: 381 TTSSGASTATNSESS-----TVSSGASTATNSESSTTSSGANTATNSGSSVTSA 429


>ref|NP_001010909.2| mucin-21 precursor [Homo sapiens]
           gi|296439229|sp|Q5SSG8.2|MUC21_HUMAN RecName:
           Full=Mucin-21; Short=MUC-21; AltName: Full=Epiglycanin;
           Flags: Precursor
          Length = 566

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 1/239 (0%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +         S+A+  TN +S T +S AS       +   S  +   N+  
Sbjct: 141 STTSSGASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTVSSRASTATNS-- 198

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSE-TPKVSGEPFFKFGVSPNA 565
           + ST+     T + S S     G G+ T+  SS  S G S  T   S  P    G + N 
Sbjct: 199 ESSTTSSGASTATNSESRTTSNGAGTATNSESSTTSSGASTATNSESSTPSSGAGTATN- 257

Query: 566 SAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPS 745
               SE++TT   AG   N+ ++ +SSGIST + +         S    SG ++  ++ S
Sbjct: 258 ----SESSTTSSGAGTATNSESSTVSSGISTVTNSE--------SSTPSSGANTATNSES 305

Query: 746 EIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVSS 922
              S G+  ++ S SS  + G  S    ++SS+ +    + TN  +S  SS AS   +S
Sbjct: 306 STTSSGANTATNSDSSTTSSG-ASTATNSESSTTSSGASTATNSESSTTSSGASTATNS 363



 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 65/234 (27%), Positives = 103/234 (44%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +   +     S A   TN +S T +S A        +   S  +   N+  
Sbjct: 231 STTSSGASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGISTVTNS-- 288

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNAS 568
           + ST      T + S S    +G  + T+  SS  S G S T   S       G S   +
Sbjct: 289 ESSTPSSGANTATNSESSTTSSGANTATNSDSSTTSSGAS-TATNSESSTTSSGAS---T 344

Query: 569 AKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSE 748
           A  SE++TT   A    N+G++  SSG ST++ +         S  + SG S+  ++ S 
Sbjct: 345 ATNSESSTTSSGASTATNSGSSTTSSGTSTATNSE--------SSTVSSGASTATTSESS 396

Query: 749 IISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASA 910
             S G+  ++ S+SS     TVS+G +T ++S +    S  N   +  SS+ SA
Sbjct: 397 TTSSGASTATNSESS-----TVSSGASTATNSESSTTSSGANTATNSGSSVTSA 445


>gb|AAI05738.1| Mucin 21, cell surface associated [Homo sapiens]
           gi|111494067|gb|AAI05736.1| Mucin 21, cell surface
           associated [Homo sapiens] gi|159576609|dbj|BAF92842.1|
           mucin 21 [Homo sapiens] gi|194391398|dbj|BAG60817.1|
           unnamed protein product [Homo sapiens]
           gi|300295307|gb|ADJ96647.1| epiglycanin [Homo sapiens]
          Length = 566

 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 1/239 (0%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +         S+A+  TN +S T +S AS       +   S  +   N+  
Sbjct: 141 STTSSGASTATNSDSSTTSSEASTATNSESSTTSSGASTATNSESSTVSSRASTATNS-- 198

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSE-TPKVSGEPFFKFGVSPNA 565
           + ST+     T + S S     G G+ T+  SS  S G S  T   S  P    G + N 
Sbjct: 199 ESSTTSSGASTATNSESRTTSNGAGTATNSESSTTSSGASTATNSESSTPSSGAGTATN- 257

Query: 566 SAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPS 745
               SE++TT   AG   N+ ++ +SSGIST + +         S    SG ++  ++ S
Sbjct: 258 ----SESSTTSSGAGTATNSESSTVSSGISTVTNSE--------SSTPSSGANTATNSES 305

Query: 746 EIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVSS 922
              S G+  ++ S SS  + G  S    ++SS+ +    + TN  +S  SS AS   +S
Sbjct: 306 STTSSGANTATNSDSSTTSSG-ASTATNSESSTTSSGASTATNSESSTTSSGASTATNS 363



 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 65/234 (27%), Positives = 103/234 (44%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +TTSS  + +   +     S A   TN +S T +S A        +   S  +   N+  
Sbjct: 231 STTSSGASTATNSESSTPSSGAGTATNSESSTTSSGAGTATNSESSTVSSGISTVTNS-- 288

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNAS 568
           + ST      T + S S    +G  + T+  SS  S G S T   S       G S   +
Sbjct: 289 ESSTPSSGANTATNSESSTTSSGANTATNSDSSTTSSGAS-TATNSESSTTSSGAS---T 344

Query: 569 AKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSE 748
           A  SE++TT   A    N+G++  SSG ST++ +         S  + SG S+  ++ S 
Sbjct: 345 ATNSESSTTSSGASTATNSGSSTTSSGTSTATNSE--------SSTVSSGASTATTSESS 396

Query: 749 IISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASA 910
             S G+  ++ S+SS     TVS+G +T ++S +    S  N   +  SS+ SA
Sbjct: 397 TTSSGASTATNSESS-----TVSSGASTATNSESSTTSSGANTATNSGSSVTSA 445


>gb|ESX03074.1| hypothetical protein HPODL_02382 [Ogataea parapolymorpha DL-1]
          Length = 2172

 Score = 65.5 bits (158), Expect = 3e-08
 Identities = 61/245 (24%), Positives = 102/245 (41%), Gaps = 7/245 (2%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
            ++ + S + S     GY  S A  + N  S    SA S     S       T+A + +++
Sbjct: 359  SSATGSGSSSATSGSGYASSSATSNLNSSS----SATSGSSYASSTVSSGSTSATSGSSY 414

Query: 389  QIST--SPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPN 562
              ST  S     TP+ S          S +S  SS  S+G S +   SG  +        
Sbjct: 415  GSSTASSGSSSATPASSYGSSTAGSGSSSSSSASSNSSYGSSSSAATSGSSYASSSTGSG 474

Query: 563  ASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMI-----SGLSS 727
            +S+  S ++     A    + G++   SG S+++P +       GSG+       S  SS
Sbjct: 475  SSSATSGSSYASSTASSGSSYGSSSTGSGSSSATPASSYGSSTAGSGSSSSSSAGSNSSS 534

Query: 728  LKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLAS 907
               + S  ++ GS + S+S SS  +    S+  ++  SS A   LS+    +S+ SS   
Sbjct: 535  SSGSSSSAVTSGSSYGSSSSSSATSSSNSSSSSSSYGSSTASSGLSSATSGSSYGSSSTG 594

Query: 908  AQVSS 922
            + +SS
Sbjct: 595  SGLSS 599



 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 79/259 (30%), Positives = 120/259 (46%), Gaps = 22/259 (8%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
            + + SS   +      Y  S A+  T+G + + +SA S     +     S ++A +N + 
Sbjct: 1205 SASGSSQITATSGASTYGSSSASTGTSGYNSSGSSATSGASTATSNQNSSSSSASSNGSS 1264

Query: 389  QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPK--VSGEPFFKF-GVSP 559
             ++ S     T S SVS    +G  SV++  SS VS GVS T    +S     +  GVS 
Sbjct: 1265 SVAVS-----TASGSVSSG-SSGASSVSTGSSSGVSSGVSSTASSAISNSASSQVTGVST 1318

Query: 560  NA---SAKVSEAATTIIP--AGDLKNNGATYLSSGI-STSSPTNFVAFGFPGSGNMISGL 721
            ++   SA  + +AT+  P  A    +N  T  SSG  S+SS TN    G  GS N  S  
Sbjct: 1319 SSAASSASSASSATSSDPNTASSGFSNSLTATSSGAGSSSSSTN----GSSGSSNTASSG 1374

Query: 722  SSLKSNPSEIISQGSLFSSTSK-----------SSVPAFGTVSNGLTTQS--SSFAPPPL 862
            SS  S  + + S GS +SS S            S++ +  T S+G T  S  S+ +    
Sbjct: 1375 SSYASGQTSVTSSGSTYSSASSGDPSGGSSQSTSALSSSATNSSGQTATSGQSTRSGSSS 1434

Query: 863  SNTNGTASFVSSLASAQVS 919
            ++++GT+S  SS  SA  S
Sbjct: 1435 ASSSGTSSGTSSGTSAASS 1453


>ref|YP_001583453.1| hemagluttinin domain-containing protein [Burkholderia multivorans
            ATCC 17616] gi|189353792|ref|YP_001949419.1|
            membrane-anchored cell surface protein [Burkholderia
            multivorans ATCC 17616] gi|501172632|ref|WP_012216420.1|
            hemagglutinin [Burkholderia multivorans]
            gi|160344076|gb|ABX17161.1| Haemagluttinin domain protein
            [Burkholderia multivorans ATCC 17616]
            gi|189337814|dbj|BAG46883.1| putative membrane-anchored
            cell surface protein [Burkholderia multivorans ATCC
            17616]
          Length = 2505

 Score = 64.3 bits (155), Expect = 7e-08
 Identities = 64/250 (25%), Positives = 118/250 (47%), Gaps = 13/250 (5%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S         S     TN    +++++AS G+  + +  GSL+ A +++   
Sbjct: 620  STSASTGISSANSSIGSLSTGLSSTNSSVTSLSTSASTGISSANSSIGSLSTATSSSISS 679

Query: 392  ISTSPPVIFTPSVSVSDFIPTG----NGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSP 559
            +STS     + + S    + TG    N SVTS  SS  S G+S      G        + 
Sbjct: 680  LSTSASTGISSANSSIKSLSTGLSSTNSSVTS-LSSSASTGISSANSSIGSLSTGLSSTN 738

Query: 560  NASAKVSEAATTIIPAGDLKNNGATYLSSGIS---------TSSPTNFVAFGFPGSGNMI 712
            ++   +S +A+T I +    N+    LS+G+S         +SS +  ++      G++ 
Sbjct: 739  SSVTSLSTSASTGISSA---NSSIGSLSTGLSSTNSSVTSLSSSASTGISSANSSIGSLS 795

Query: 713  SGLSSLKSNPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFV 892
            +GLSS  S+ + + +  S   S++ SS+   G++S GL++ +SS      S + G +S  
Sbjct: 796  TGLSSTNSSVTSLSTSASTGISSANSSI---GSLSTGLSSTNSSVTSLSTSASTGISSAN 852

Query: 893  SSLASAQVSS 922
            SS+ S   ++
Sbjct: 853  SSIGSLSTAT 862



 Score = 57.4 bits (137), Expect = 8e-06
 Identities = 67/237 (28%), Positives = 111/237 (46%), Gaps = 9/237 (3%)
 Frame = +2

Query: 224  SMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQISTS 403
            S T ++   G  + + A+  T+  +   TSA S G+  + +  GSL+ A +++   +STS
Sbjct: 532  SATSTDAINGSQLYAVASTTTSSIASLSTSA-STGISSANSSIGSLSTATSSSIASLSTS 590

Query: 404  PPVIFTPSVSVSDFIPTG----NGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNASA 571
                 + + S    + TG    N SVTS  S+  S G+S      G      G+S   S+
Sbjct: 591  TSTGISSANSSIGSLSTGLSSTNSSVTS-LSTSASTGISSANSSIGS--LSTGLSSTNSS 647

Query: 572  KVS---EAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSG--NMISGLSSLKS 736
              S    A+T I  A     + +T  SS IS+ S +         S   ++ +GLSS  S
Sbjct: 648  VTSLSTSASTGISSANSSIGSLSTATSSSISSLSTSASTGISSANSSIKSLSTGLSSTNS 707

Query: 737  NPSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLAS 907
            + + + S  S   S++ SS+   G++S GL++ +SS      S + G +S  SS+ S
Sbjct: 708  SVTSLSSSASTGISSANSSI---GSLSTGLSSTNSSVTSLSTSASTGISSANSSIGS 761


>ref|WP_006414054.1| hemagglutinin [Burkholderia multivorans] gi|400228636|gb|EJO58552.1|
            hemagglutinin [Burkholderia multivorans CF2]
          Length = 2560

 Score = 63.9 bits (154), Expect = 9e-08
 Identities = 68/254 (26%), Positives = 117/254 (46%), Gaps = 17/254 (6%)
 Frame = +2

Query: 212  TTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTHQ 391
            +TS+S  +S         S     TN    +++S+AS G+  + +  GSL+ A +++   
Sbjct: 591  STSASTGISSANSSIGSLSTGLSSTNSSVTSLSSSASTGISSTNSSIGSLSTATSSSISS 650

Query: 392  ISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNASA 571
            +STS     + + S    + TG  S  S  +S +S G+S T            +S +AS 
Sbjct: 651  LSTSASTGISSANSSIKSLSTGLSSTNSSVTS-LSTGLSSTNS------SVTSLSSSAST 703

Query: 572  KVSEAATTI--IPAG-DLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNP 742
             +S A ++I  +  G    N+  T LSS  ST      ++      G++ +GLSS  S+ 
Sbjct: 704  GISSANSSIGSLSTGLSSTNSSVTSLSSSASTG-----ISSANSSIGSLSTGLSSTNSSV 758

Query: 743  SEIISQGSLFSSTSKSSVPAFGT------------VSNGLTTQSSSFA--PPPLSNTNGT 880
            + + S  S   S++ SS+ +  T             S G+++ +SS       LS+TN +
Sbjct: 759  TSLSSSASTGISSANSSISSLSTATSSSISSLSTSASTGISSANSSIGSLSTGLSSTNSS 818

Query: 881  ASFVSSLASAQVSS 922
             + +SS AS  +SS
Sbjct: 819  VTSLSSSASTGISS 832


>ref|XP_003866329.1| adhesin-like protein [Candida orthopsilosis Co 90-125]
           gi|380350667|emb|CCG20889.1| adhesin-like protein
           [Candida orthopsilosis Co 90-125]
          Length = 939

 Score = 63.9 bits (154), Expect = 9e-08
 Identities = 60/242 (24%), Positives = 106/242 (43%), Gaps = 4/242 (1%)
 Frame = +2

Query: 209 ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNNNTH 388
           +++++  T S+P       +    DT+  S T + ++S+    S +   S  ++ + +T 
Sbjct: 56  SSSTAQPTSSQPTSNSDTDTDTDTDTDTGSTTTSLSSSSSSSSSSSTENSFPSSPSQSTS 115

Query: 389 QISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPNAS 568
           + S+      TPS +VS  IPT     T+ P+   S     TP  + E    F  SP++S
Sbjct: 116 ETSSFSSNESTPS-TVSSSIPTTLSDPTTSPTEGSSEPAPSTPSTTSESTSSFVSSPSSS 174

Query: 569 AKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNPSE 748
           + VS ++++I        + +   SS +S S P    +   P S +  S  SS   + S 
Sbjct: 175 SSVSSSSSSI------PTSSSDTSSSSMSNSIPVETTSSTIPSSSSSSSSSSSSSESIST 228

Query: 749 IISQGSLF----SSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQV 916
             S+ S      SST  SS P   +  +  ++ S +F P   S  + ++   SS  S+  
Sbjct: 229 TSSESSTSLVEPSSTDISSDPPSSSSDSSSSSSSDTFIPSSSSEFSSSSESSSSFPSSSS 288

Query: 917 SS 922
            S
Sbjct: 289 ES 290


>ref|XP_006671739.1| class III chitinase ChiA1 [Cordyceps militaris CM01]
            gi|346320516|gb|EGX90116.1| class III chitinase ChiA1
            [Cordyceps militaris CM01]
          Length = 897

 Score = 63.9 bits (154), Expect = 9e-08
 Identities = 66/243 (27%), Positives = 112/243 (46%)
 Frame = +2

Query: 200  DVCATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDTVTSAASNGLLVSVAPFGSLTAAYNN 379
            D   TT+++ T +         S     T   S T T+ +S+    S+    S + + ++
Sbjct: 349  DHTTTTTTTTTTTTTSTTSSTTSTTTTTTTSSSSTTTTTSSSS---SIPTSTSTSTSTSS 405

Query: 380  NTHQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSP 559
            +T   STS     + S S S  IPTG    +S  SS V  G S +   S  P    G S 
Sbjct: 406  STSSQSTSSSSTSSVSSSSSSAIPTG----SSSSSSVVPTGSSSSSSSSVVP---TGSSS 458

Query: 560  NASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSN 739
            +A    S ++++++P G   ++ ++ + +G S+SS    V  G   S + +    S  S+
Sbjct: 459  SALPTGSSSSSSVVPTGSSSSSSSSVVPTGSSSSS---VVPTGSSSSSSSVVPTGSSSSS 515

Query: 740  PSEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPPPLSNTNGTASFVSSLASAQVS 919
             S +   GS  SS+S S VP  G+ S+  + Q+SS        T  ++S  S++ ++Q S
Sbjct: 516  GS-VAPTGSSSSSSSSSVVPTGGSSSSSSSVQTSSSTSSSAVPTGSSSS--SAVPTSQSS 572

Query: 920  SEM 928
            S +
Sbjct: 573  SSI 575



 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 66/231 (28%), Positives = 108/231 (46%), Gaps = 6/231 (2%)
 Frame = +2

Query: 209  ATTSSSMTVSEPEKGGYMKSQAAEDTNGKSDT--VTSAASNGLLVSVAPFGSLTAAYNNN 382
            ++TSS  T S         S +A  T   S +  V + +S+    SV P GS ++A    
Sbjct: 405  SSTSSQSTSSSSTSSVSSSSSSAIPTGSSSSSSVVPTGSSSSSSSSVVPTGSSSSALPTG 464

Query: 383  THQISTSPPVIFTPSVSVSDFIPTGNGSVTSDPSSRVSFGVSETPKVSGEPFFKFGVSPN 562
            +   S+  P   + S S S  +PTG+ S +  P+   S   S  P  +G       V+P 
Sbjct: 465  SSSSSSVVPT-GSSSSSSSSVVPTGSSSSSVVPTGSSSSSSSVVP--TGSSSSSGSVAPT 521

Query: 563  ASAKVSEAATTIIPAGDLKNNGATYLSSGISTSSPTNFVAFGFPGSGNMISGLSSLKSNP 742
             S+  S ++++++P G     G++  SS + TSS T+  A   P   +  S + + +S+ 
Sbjct: 522  GSSS-SSSSSSVVPTG-----GSSSSSSSVQTSSSTSSSAV--PTGSSSSSAVPTSQSSS 573

Query: 743  SEIISQGSLFSSTSKSSVPAFGTVSNGLTTQSSSFAPP----PLSNTNGTA 883
            S  IS G+ F +++ SS       S  LTT  ++ APP    P +NT  T+
Sbjct: 574  SISISTGTGFPTSASSS-------SVILTTGPATTAPPVTSAPWTNTTTTS 617


Top