BLASTX nr result

ID: Sinomenium21_contig00007522 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00007522
         (928 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|WP_000782068.1| hypothetical protein [Helicobacter pylori] g...    67   1e-08
gb|EGI64209.1| Sarcalumenin [Acromyrmex echinatior]                    66   2e-08
ref|XP_001305920.1| hypothetical protein [Trichomonas vaginalis ...    64   1e-07
ref|XP_006404744.1| hypothetical protein EUTSA_v10000072mg [Eutr...    63   2e-07
ref|WP_000782079.1| hypothetical protein [Helicobacter pylori] g...    62   3e-07
gb|EFA06237.1| hypothetical protein TcasGA2_TC009087 [Tribolium ...    62   4e-07
ref|XP_001807607.1| PREDICTED: similar to CG6599 CG6599-PA, part...    62   4e-07
ref|WP_021308298.1| hypothetical protein [Helicobacter pylori] g...    61   5e-07
gb|ETP71391.1| hypothetical protein UYO_2646 [Lachnospiraceae ba...    61   7e-07
ref|YP_005783309.1| poly E-rich protein [Helicobacter pylori 201...    61   7e-07
ref|WP_022371394.1| hypothetical protein [Firmicutes bacterium C...    60   9e-07
gb|ACN43225.1| neurofilament medium protein [Pan troglodytes]          60   1e-06
ref|YP_005791151.1| poly E-rich protein [Helicobacter pylori 201...    60   2e-06
ref|WP_000782080.1| hypothetical protein [Helicobacter pylori]         59   2e-06
ref|XP_001351647.1| conserved Plasmodium protein, unknown functi...    59   2e-06
ref|XP_001301665.1| Beige/BEACH domain containing protein [Trich...    59   3e-06
ref|XP_001324167.1| hypothetical protein [Trichomonas vaginalis ...    59   4e-06
gb|ETW48435.1| hypothetical protein PFMALIP_03520 [Plasmodium fa...    58   5e-06
ref|WP_000782036.1| hypothetical protein [Helicobacter pylori] g...    58   5e-06
ref|XP_002046645.1| GJ12366 [Drosophila virilis] gi|194153803|gb...    58   5e-06

>ref|WP_000782068.1| hypothetical protein [Helicobacter pylori]
           gi|407221389|gb|EKE91194.1| putative poly E-rich protein
           [Helicobacter pylori R046Wa]
          Length = 473

 Score = 67.0 bits (162), Expect = 1e-08
 Identities = 69/290 (23%), Positives = 134/290 (46%), Gaps = 17/290 (5%)
 Frame = +3

Query: 48  KFALQEICQASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGK 227
           K  ++E  Q  E    KE P  EE      + + QE +T + +EV+  L+  ++V    +
Sbjct: 171 KEEIKETPQKEEKEEIKETPQEEEK---PKDDETQESETPKDEEVSKELETQEEVKEETQ 227

Query: 228 QAPVEKKGENIKPE--DEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELET 401
           +   E+  E +K E  +EQ+ +   +  E + ++E  + E   S        P+ QELE 
Sbjct: 228 EEVKEETQEEVKEETQEEQVKEQEPIKEETQEIKEEKQEETQDS--------PSAQELEA 279

Query: 402 LGRHDEVIEQMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQ 581
           +    E+++++++N +   E+KK T +EN +     P  +   ++ + D+ I E    +Q
Sbjct: 280 M---QELVKEIQEN-SNGQEDKKET-QENAETPQETPQDVENQAQEIQDKEIQE----TQ 330

Query: 582 SIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQ------EPLSATA 743
            I+E  EI    ++ET+ +          +++ +TPQ  ET   H +      EP+ A A
Sbjct: 331 EIQENTEI---PQEETQEE---------IQENAETPQEKETQEDHYESIEDIPEPVMAKA 378

Query: 744 ---------DYVEQSLKTDVATGGEYENIEDIKDKSNEQESCNKEESTDK 866
                    + V ++  ++ AT    E++ +     N  E+  ++E +DK
Sbjct: 379 MGEELPFLNEAVAKTPNSENATETPKESVTETSKNENNTETPQEKEQSDK 428


>gb|EGI64209.1| Sarcalumenin [Acromyrmex echinatior]
          Length = 899

 Score = 66.2 bits (160), Expect = 2e-08
 Identities = 59/268 (22%), Positives = 114/268 (42%), Gaps = 4/268 (1%)
 Frame = +3

Query: 45  AKFALQEICQASEDSGNK----EKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKV 212
           A+ A +   +A ED G +    EK +  E    + + KL E+++DE  +V D   + ++V
Sbjct: 156 AEKAEEASTEAIEDEGEQIDSSEKVTESEEARTEEDEKLDEEKSDEETKVKDEKFEEEEV 215

Query: 213 DAFGKQAPVEKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQE 392
           D   K    +++ +  + EDE+  +S E  TEE  VE +   E+  +++ ++  +    +
Sbjct: 216 DDEAKSEEEKEEADESQAEDEEQKESKEA-TEEVEVESNEEIEIQQTSMEASEEVEEESD 274

Query: 393 LETLGRHDEVIEQMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMIC 572
            E +    +  E  +   +  +E++K+ IEE  Q  +       K  E+ +D        
Sbjct: 275 KEEVDSKTDEAESTEDQVS--AEDEKSDIEEKTQDESAEAESEEKDKESAEDTETAS--- 329

Query: 573 VSQSIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYV 752
            S+ +EEE +    +    E ++   +      ++    +  ET    D+          
Sbjct: 330 -SEDVEEETKSEEAEAKFAEEEEKEDEDTTASTEEEMKSEKEETKSEEDESKSEEAKSTE 388

Query: 753 EQSLKTDVATGGEYENIEDIKDKSNEQE 836
           E+  KT+   G   E  ED K +  E E
Sbjct: 389 EEDTKTEEEDGKSEE--EDAKSEGEENE 414


>ref|XP_001305920.1| hypothetical protein [Trichomonas vaginalis G3]
            gi|121887466|gb|EAX92990.1| hypothetical protein
            TVAG_261690 [Trichomonas vaginalis G3]
          Length = 952

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 62/263 (23%), Positives = 119/263 (45%), Gaps = 1/263 (0%)
 Frame = +3

Query: 81   EDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGENI 260
            E +  +E+   EEN   K + +++++   E +E  + L + +K++        E K ENI
Sbjct: 475  EQNVEEEEEKIEENQEDKEQIEIEQENVQEEEENKEELNEEEKIE--------ESKEENI 526

Query: 261  KPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQMKQ 440
                EQ+ + ++   E+K  EE+ + E               +E++      E I + ++
Sbjct: 527  CENKEQIEEENDTLIEQK--EENVQEE---------------EEIKEELNEKENICEKEE 569

Query: 441  NAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEINTLQR 620
               E +ENK    EENLQ+      +  K+ E  ++E I E +C  Q+  +E+EIN  Q+
Sbjct: 570  KVDELNENK----EENLQQ------KEEKTEEQKNEENIEEDVCEKQN--DEEEINEQQK 617

Query: 621  DE-TEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQSLKTDVATGGEYE 797
            +E  + +K  I+LN+            E     + E + +   + E  L+ +  T  E +
Sbjct: 618  EEIKDDEKENINLNV------------EEEENKENEEVKSREIHFEVELEKEQLTPEEEK 665

Query: 798  NIEDIKDKSNEQESCNKEESTDK 866
             IE  K++    E  N++   ++
Sbjct: 666  EIESNKEEEEIDEEINQQNEEEE 688


>ref|XP_006404744.1| hypothetical protein EUTSA_v10000072mg [Eutrema salsugineum]
            gi|557105872|gb|ESQ46197.1| hypothetical protein
            EUTSA_v10000072mg [Eutrema salsugineum]
          Length = 666

 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 61/266 (22%), Positives = 116/266 (43%), Gaps = 3/266 (1%)
 Frame = +3

Query: 78   SEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGEN 257
            SE+   +E  S EEN   + ETK +E+ + + +      +  DK ++  ++   EK+ E 
Sbjct: 372  SEEKEKEESSSQEENKEKETETKDKEESSSQEENKEKETETKDKEESSSQEERKEKETEK 431

Query: 258  IKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQMK 437
            I+ E+    +  E+   EK  +E        S+    +   + +++E  G      E+ K
Sbjct: 432  IEKEESSSKEEKEVKETEKLEKEE------ESSSQEKNEDKDTEKIEKEGESSSQ-EESK 484

Query: 438  QNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEINTLQ 617
                E  E ++++ +E  +          K +E  + E  L     SQ   + KE  T +
Sbjct: 485  DKETETKEKEESSSQEETKD---------KGTETKEKEESL-----SQEESKHKETETKE 530

Query: 618  RDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQS-LKTDVATGG-- 788
            ++E+  ++   D       +T+T +  E++    Q   S   + VEQ+  KTD  T G  
Sbjct: 531  KEESSSQEESRD------KETETKEKEESSSNDSQGNESEKKEQVEQNEKKTDDDTNGST 584

Query: 789  EYENIEDIKDKSNEQESCNKEESTDK 866
            +  ++ DI+ K +E  S   +  T+K
Sbjct: 585  KENDVTDIEQKQSEDTSETSQTETEK 610


>ref|WP_000782079.1| hypothetical protein [Helicobacter pylori]
           gi|393029268|gb|EJB30349.1| poly E-rich protein
           [Helicobacter pylori NQ4099]
          Length = 521

 Score = 62.0 bits (149), Expect = 3e-07
 Identities = 66/277 (23%), Positives = 129/277 (46%), Gaps = 8/277 (2%)
 Frame = +3

Query: 12  EGESFLESPSLAKFALQEICQASEDSGNKEKPSHEENYTLKNETKLQEDQ----TDESDE 179
           EGE+ L+  +  +   +E+ +  E+   KEK    EN   + + K  E Q    T + +E
Sbjct: 169 EGET-LKEETQEEIKKEEVKEMQEEIKEKEKQEVAENPQDEEKPKDDETQGSVETPKDEE 227

Query: 180 VTDGLKDGDKVDAFGKQAPVEKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTI 359
           V+  L+  ++V+      P E+K E    ++++  K  E   E++ ++E  + E+     
Sbjct: 228 VSKELETQEQVET-----PKEEKQEQEPIKEQEPIKEQEPIKEQEPIKEQTQ-EIKEEKQ 281

Query: 360 PSASAIPNLQELETLGRHDEVIEQMKQNAAEFSENKKATIEE-NLQRGNVMPPRLLKSSE 536
                 P+ QELE +    E+++++++N+ +  ENKK T E   + +   +   + K ++
Sbjct: 282 EKTQDSPSTQELEAM---QELVKEIQENSND-QENKKETQENAEIPQDKEIQEVVTKKTQ 337

Query: 537 AVDDEPILEMICVSQSIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQ--TAETTM 710
           A + E   E    S    +E + + L++ E       +++      +TQ  Q   AE T 
Sbjct: 338 AQELEIPKEKTQESAEALQETQAHELEKQEIAETPQDVEVPQSQEKETQETQEVVAEKTQ 397

Query: 711 GHDQE-PLSATADYVEQSLKTDVATGGEYENIEDIKD 818
             ++E P   T +  +++ + D      YENIEDI +
Sbjct: 398 SQEKETPQEETQEAQDETPQED-----HYENIEDIPE 429


>gb|EFA06237.1| hypothetical protein TcasGA2_TC009087 [Tribolium castaneum]
          Length = 1411

 Score = 61.6 bits (148), Expect = 4e-07
 Identities = 75/287 (26%), Positives = 131/287 (45%), Gaps = 29/287 (10%)
 Frame = +3

Query: 84   DSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGENI- 260
            D   KE+ S EE+   K+E  L+E +T +  E+ D  + G KV    K   VE+K +++ 
Sbjct: 765  DQTTKEEKSVEES---KDENDLKEVETTQEFEIVDENEKGGKVVTEQKLEIVEEKEKSLE 821

Query: 261  KPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQM-- 434
            KPEDE+     E  +EEKNVE  ++N            +   +E ET+    E  E+   
Sbjct: 822  KPEDEEKQVIEEKSSEEKNVEGESKN------AEETEHVTQKKEDETIVSKKEEFEEKTD 875

Query: 435  ---KQNAAEFSENK----------KATIEENL----QRGNVMPPRLLKSSE---AVDDEP 554
               + NAAE  ++K          K+  E+NL    +   +   ++ K SE     + E 
Sbjct: 876  KEERSNAAETLDDKPPGNGEDVSAKSETEKNLDEKAKEKKLKKQKMKKRSERKKRQNGEQ 935

Query: 555  ILEMICVSQSIEEEKEINTLQR--DETEGKKSCIDLNLGPRDDTQTPQTA--ETTMGHDQ 722
              E+I  + + +E  ++ TL+R  D     K+     L P+      +++  E +   + 
Sbjct: 936  AYEIIYKTMAADESDDL-TLERLSDRDNKTKTAKREKLRPQKTISGSKSSQEEYSSASET 994

Query: 723  EPLSATADYVEQSLKTDVATGGEYENIEDIKDKSNE--QESCNKEES 857
            E  ++T+  V++     +    E E IE++K K  +  Q++ NK+ S
Sbjct: 995  ESSTSTSATVKEHKSFRILNDKEAEEIENLKPKPRKISQKTQNKQRS 1041


>ref|XP_001807607.1| PREDICTED: similar to CG6599 CG6599-PA, partial [Tribolium castaneum]
          Length = 1334

 Score = 61.6 bits (148), Expect = 4e-07
 Identities = 75/287 (26%), Positives = 131/287 (45%), Gaps = 29/287 (10%)
 Frame = +3

Query: 84   DSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGENI- 260
            D   KE+ S EE+   K+E  L+E +T +  E+ D  + G KV    K   VE+K +++ 
Sbjct: 688  DQTTKEEKSVEES---KDENDLKEVETTQEFEIVDENEKGGKVVTEQKLEIVEEKEKSLE 744

Query: 261  KPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQM-- 434
            KPEDE+     E  +EEKNVE  ++N            +   +E ET+    E  E+   
Sbjct: 745  KPEDEEKQVIEEKSSEEKNVEGESKN------AEETEHVTQKKEDETIVSKKEEFEEKTD 798

Query: 435  ---KQNAAEFSENK----------KATIEENL----QRGNVMPPRLLKSSE---AVDDEP 554
               + NAAE  ++K          K+  E+NL    +   +   ++ K SE     + E 
Sbjct: 799  KEERSNAAETLDDKPPGNGEDVSAKSETEKNLDEKAKEKKLKKQKMKKRSERKKRQNGEQ 858

Query: 555  ILEMICVSQSIEEEKEINTLQR--DETEGKKSCIDLNLGPRDDTQTPQTA--ETTMGHDQ 722
              E+I  + + +E  ++ TL+R  D     K+     L P+      +++  E +   + 
Sbjct: 859  AYEIIYKTMAADESDDL-TLERLSDRDNKTKTAKREKLRPQKTISGSKSSQEEYSSASET 917

Query: 723  EPLSATADYVEQSLKTDVATGGEYENIEDIKDKSNE--QESCNKEES 857
            E  ++T+  V++     +    E E IE++K K  +  Q++ NK+ S
Sbjct: 918  ESSTSTSATVKEHKSFRILNDKEAEEIENLKPKPRKISQKTQNKQRS 964


>ref|WP_021308298.1| hypothetical protein [Helicobacter pylori]
           gi|529186345|gb|EPZ74929.1| hypothetical protein
           N206_00115 [Helicobacter pylori UM111]
          Length = 508

 Score = 61.2 bits (147), Expect = 5e-07
 Identities = 64/258 (24%), Positives = 113/258 (43%), Gaps = 9/258 (3%)
 Frame = +3

Query: 72  QASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKG 251
           +  E+   KE P  EE      + ++QE +T +++EV+  L+   +++   K+   E K 
Sbjct: 181 EVKEEEEVKETPQEEEK---PKDNEIQEGETQKNEEVSKELETQKELE-IPKEETQEIKE 236

Query: 252 ENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQEL------ETLGRH 413
           E  +   EQ AK  E   EE    +  + E    + PSA  +  +QEL       + G+ 
Sbjct: 237 EKQEKTQEQ-AKEQEPIKEETQENKEEKQEKTQDS-PSAQELEAMQELVKEIQENSNGQE 294

Query: 414 DEVIEQMKQNAAEFSENKKATIE---ENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQS 584
           D+  ++ ++N     E +K  +E   E   + N   P+ L+  E    E   E     Q 
Sbjct: 295 DK--KETQENTETPQEKEKQDVETPQEKETQENTETPQELEKQELETQEKTQESAETPQE 352

Query: 585 IEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQSL 764
            +E++++ T Q  ET+             ++T+TPQ  E      QE    +A+  ++  
Sbjct: 353 -KEKQDVETPQEKETQ-------------ENTETPQELEKQELETQEKTQESAETPQEKT 398

Query: 765 KTDVATGGEYENIEDIKD 818
           +   A    YE+IEDI +
Sbjct: 399 QKLEAQKDHYESIEDIPE 416


>gb|ETP71391.1| hypothetical protein UYO_2646 [Lachnospiraceae bacterium JC7]
          Length = 1022

 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 65/281 (23%), Positives = 116/281 (41%), Gaps = 3/281 (1%)
 Frame = +3

Query: 30   ESPSLAKFALQEICQASEDSGNKEKPSHEENYTLKNETK---LQEDQTDESDEVTDGLKD 200
            ES + ++  L+E  +   +  N EK +  E  T+  E++     ++ + E DEV+DG+K+
Sbjct: 317  ESETKSETELEETEEKESEEENSEKVAKSEKDTVAEESENDSKDDEVSAEPDEVSDGVKE 376

Query: 201  GDKVDAFGKQAPVEKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIP 380
              + +A  ++A  E +  +      ++   +   TEEK  EE    E+  S   + +   
Sbjct: 377  SAESEAAEEKAETESENSDESETKSEVGSGTGPETEEKESEEENSEEIAESEKDTIT--- 433

Query: 381  NLQELETLGRHDEVIEQMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPIL 560
              +E E + + DEV  +  + + E  E                      S+E+ D E   
Sbjct: 434  --EESEKISKDDEVSAESDEASDEVEE----------------------SAESEDTESTK 469

Query: 561  EMICVSQSIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSAT 740
            E    ++S  E  E   +  DE E +   +D +     DT+ P+T ET +G + E LS  
Sbjct: 470  E----TESDNESTESEDIAEDEEESEAEKMDAS---EKDTE-PET-ETVVGKESEDLSEE 520

Query: 741  ADYVEQSLKTDVATGGEYENIEDIKDKSNEQESCNKEESTD 863
                  S  T+  +G   E  E + +   E+     E   D
Sbjct: 521  QP-ESDSNDTEEVSGDTVEEQETVSEPEEEKSEAAAETEID 560


>ref|YP_005783309.1| poly E-rich protein [Helicobacter pylori 2017]
           gi|504349178|ref|WP_014536280.1| hypothetical protein
           [Helicobacter pylori] gi|325997205|gb|ADZ49413.1| poly
           E-rich protein [Helicobacter pylori 2017]
          Length = 499

 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 63/252 (25%), Positives = 114/252 (45%), Gaps = 5/252 (1%)
 Frame = +3

Query: 78  SEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPV--EKKG 251
           +E  G   K   +E    +   ++QE+  +  +EV +  K     +   ++ P   E +G
Sbjct: 166 NEQEGETPKEEAQEEVKKEEVKEMQEEVKEMQEEVKEKQKQEVAENPQDEEKPKDDETQG 225

Query: 252 ENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQ 431
               P+DE+++K  EL T+E+   +    E+           PN+QELE +    E++++
Sbjct: 226 SVEPPKDEEVSK--ELETQEQEPIKEETQEIKEEKQEKTQDSPNVQELEAM---QELVKE 280

Query: 432 MKQNAAEFSENKKATIE--ENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEI 605
           +++N+ +  ENKK T E  EN +    +  + L+  +  + + I E    +Q +E+E+  
Sbjct: 281 IQENSND-QENKKETQETQENTETPQDIETQELEIPKEEETQEIAEK-TQAQGLEKEEIA 338

Query: 606 NTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHD-QEPLSATADYVEQSLKTDVAT 782
            T Q  E +  +      L  +D+    Q  ET    + QE      +   Q L+T    
Sbjct: 339 ETPQEKEIQETQDETPQELEVQDEKL--QENETPKDENMQESAQNLQEKETQELETPQTQ 396

Query: 783 GGEYENIEDIKD 818
              YENIEDI +
Sbjct: 397 EDHYENIEDIPE 408


>ref|WP_022371394.1| hypothetical protein [Firmicutes bacterium CAG:475]
           gi|524658354|emb|CDD68903.1| unknown [Firmicutes
           bacterium CAG:475]
          Length = 594

 Score = 60.5 bits (145), Expect = 9e-07
 Identities = 55/264 (20%), Positives = 116/264 (43%), Gaps = 3/264 (1%)
 Frame = +3

Query: 81  EDSGNKEKPSHEENYTLK-NETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGEN 257
           E SG +E      +  L   E + +E   +ESD+    +++GD+     + A  +   E 
Sbjct: 253 EQSGTEEDEQESIDEILNAQEEEAEEIAQEESDQTKTEIEEGDQ-----EVAEQQHIDEL 307

Query: 258 IKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQMK 437
           ++ ++E++ + +E   E++  E+    +L    +    A    +E+      +++ EQ +
Sbjct: 308 LQEQEEEVEQKAE--EEDEQTEQVLDEQLEQEVVEPQEAAEEQEEIAPETEEEQIDEQEE 365

Query: 438 QNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEINTLQ 617
             A E SE  +   +E ++             E V++EP+ E     + +EE+ +  T+ 
Sbjct: 366 TTAVEESEQSEPAEQEEIEE-----------QEPVEEEPVQE----DEQVEEQAQEETVD 410

Query: 618 RDETEGKKSCIDLNLGPRDDTQTPQTA--ETTMGHDQEPLSATADYVEQSLKTDVATGGE 791
               E ++  I+       D +   TA  ET    D + +    +  E++++   A   E
Sbjct: 411 EQSEEAEEPQIEEADEDNSDVEEENTASEETEASEDVDDIE-DDEQTEEAVEESTADSAE 469

Query: 792 YENIEDIKDKSNEQESCNKEESTD 863
            EN+ED    S E++  ++EE ++
Sbjct: 470 EENVED----SAEEDVASEEEQSE 489


>gb|ACN43225.1| neurofilament medium protein [Pan troglodytes]
          Length = 499

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 67/286 (23%), Positives = 124/286 (43%), Gaps = 5/286 (1%)
 Frame = +3

Query: 12  EGESFLESPSLAKFALQEICQASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDG 191
           EG S  E  S  +   QE  +   ++  +E  + EE    K E K +E  T E       
Sbjct: 148 EGGSEKEGSSEKEEGEQEEGETEAEAEGEEAEAKEEK---KVEEKSEEVATKEELVADAK 204

Query: 192 LKDGDKVDAFGKQAPVEKKGENIKPED--EQLAKSSELFTEEKNVEEHARNELFSSTIPS 365
           ++  +K  +   ++PVE+KG++  P+   E+  KS      +  VEE  ++ +  S +  
Sbjct: 205 VEKPEKAKSPVPKSPVEEKGKSPVPKSPVEEKGKSP---VSKSPVEEKGKSPVSKSPVEE 261

Query: 366 ASAIPNLQELETLGRHDEVIEQMKQNAAEFSENKKATIEENLQRGNVMP---PRLLKSSE 536
            +  P  +      +    + + +Q   E  E K+A  EE +++    P   P   K+  
Sbjct: 262 KAKSPAPKSPVEEAKSKAEVGKGEQKEEEEKEVKEAPKEEKVEKKEEKPKDVPEKKKAES 321

Query: 537 AVDDEPILEMICVSQSIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGH 716
            V +E + E++ +++S++   E  T    + EGK         P    +  + A    G 
Sbjct: 322 PVKEEAVAEVVTITKSVKVHLEKET----KEEGK---------PLQQEKEKEKAGGEGGS 368

Query: 717 DQEPLSATADYVEQSLKTDVATGGEYENIEDIKDKSNEQESCNKEE 854
           ++E     A   + S K D+A  GE E  E+++ ++ E+ S  +EE
Sbjct: 369 EEEGSDKGA---KGSRKEDIAVNGEVEGKEEVEQETKEKGSGREEE 411


>ref|YP_005791151.1| poly E-rich protein [Helicobacter pylori 2018]
           gi|446704703|ref|WP_000782049.1| hypothetical protein
           [Helicobacter pylori] gi|325995609|gb|ADZ51014.1| poly
           E-rich protein [Helicobacter pylori 2018]
          Length = 492

 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 64/251 (25%), Positives = 113/251 (45%), Gaps = 4/251 (1%)
 Frame = +3

Query: 78  SEDSGNKEKPSHEENYTLKNETKLQEDQTD-ESDEVTDGLKDGDKVDAFGKQAPVEKKGE 254
           +E  G   K   +E    +   ++QE+  + +  EV +  +D +K      Q  VE    
Sbjct: 166 NEQEGETPKEEAQEEVKKEEVKEMQEEVKEKQKQEVAENPQDEEKPKDDETQGSVEP--- 222

Query: 255 NIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDEVIEQM 434
              P+DE+++K  EL T+E+   +    E+           PN+QELE +    E+++++
Sbjct: 223 ---PKDEEVSK--ELETQEQEPIKEETQEIKEEKQEKTQDSPNVQELEAM---QELVKEI 274

Query: 435 KQNAAEFSENKKATIE--ENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEIN 608
           ++N+ +  ENKK T E  EN +    +  + L+  +  + + I E    +Q +E+E+   
Sbjct: 275 QENSND-QENKKETQETQENTETPQDIETQELEIPKEEETQEIAEK-TQAQGLEKEEIAE 332

Query: 609 TLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHD-QEPLSATADYVEQSLKTDVATG 785
           T Q  E +  +      L  +D+    Q  ET    + QE      +   Q L+T     
Sbjct: 333 TPQEKEIQETQDETPQELEVQDEKL--QENETPKDENMQESAQNLQEKETQELETPQTQE 390

Query: 786 GEYENIEDIKD 818
             YENIEDI +
Sbjct: 391 DHYENIEDIPE 401


>ref|WP_000782080.1| hypothetical protein [Helicobacter pylori]
          Length = 515

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 63/275 (22%), Positives = 124/275 (45%), Gaps = 6/275 (2%)
 Frame = +3

Query: 12  EGESFLESPSLAKFALQEICQASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDG 191
           EGE+ L+  +  +   +E+ +  E+   KEK    EN          +D+    D+ T G
Sbjct: 169 EGET-LKEETQEEVKKEEVKEMQEEIKEKEKQEVAEN---------PQDEEKPKDDETQG 218

Query: 192 LKDGDKVDAFGKQAPVEKKGENIKPE--DEQLAKSSELFTEEKNVEEHARNELFSSTIPS 365
             +  K +   K+   +++ E  K E  +++  K  E   E++ ++E  + E+       
Sbjct: 219 SVETPKDEEVSKELETQEQVETPKEEKQEQEPIKEQEPIKEQEPIKEQTQ-EIKEEKQEK 277

Query: 366 ASAIPNLQELETLGRHDEVIEQMKQNAAEFSENKKATIEE-NLQRGNVMPPRLLKSSEAV 542
               P+ QELE +    E+++++++N+ +  ENKK T E   + +   +   + K ++A 
Sbjct: 278 TQDSPSTQELEAM---QELVKEIQENSND-QENKKETQENAEIPQDKEIQEVVTKKTQAQ 333

Query: 543 DDEPILEMICVSQSIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQ--TAETTMGH 716
           + E   E    S    +E + + L++ E       +++      +TQ  Q   AE T   
Sbjct: 334 ELEIPKEKTQESAEALQETQAHELEKQEIAETPQDVEVPQSQEKETQETQEVVAEKTQSQ 393

Query: 717 DQE-PLSATADYVEQSLKTDVATGGEYENIEDIKD 818
           ++E P   T +  +++ + D      YENIEDI +
Sbjct: 394 EKETPQEETQEAQDETPQED-----HYENIEDIPE 423


>ref|XP_001351647.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
            3D7] gi|23504575|emb|CAD51454.1| conserved Plasmodium
            protein, unknown function [Plasmodium falciparum 3D7]
          Length = 3134

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 54/266 (20%), Positives = 121/266 (45%), Gaps = 8/266 (3%)
 Frame = +3

Query: 93   NKEKPSHEEN---YTLKNETKLQEDQTDE-SDEVTDGLKDGDKVDAFGKQAPVEKKGENI 260
            NKEK S ++N     ++ + +L++++ +E  +EV + ++D ++ D   +    E++ E +
Sbjct: 2466 NKEKESKKKNGKKIFMRGDEELKDNEEEEVEEEVEEEIEDDEEEDEEIEDDEEEEEDEEV 2525

Query: 261  KPEDEQLAKSSELFTEEKNVEEHAR----NELFSSTIPSASAIPNLQELETLGRHDEVIE 428
            + +DE+  +  E+  EE+ VEE       +E           + + +E E +   DE +E
Sbjct: 2526 E-DDEEEEEDEEVDDEEEEVEESDEEVECDEEVDDDEEEEEEVDDEEEEEEVEESDEEVE 2584

Query: 429  QMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEKEIN 608
              ++   E  + ++   EE L+          +  E VDDE   E +   +  +EE E++
Sbjct: 2585 DDEEEEEELEDGEEE--EEELEDNE-------EGDEEVDDEEEEEEV---EESDEEVEVD 2632

Query: 609  TLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQSLKTDVATGG 788
              + +E +G +       G  ++       E     ++E +  + + VE     +    G
Sbjct: 2633 EEEEEELDGDEE------GDEEEDNEEDDEEVGDEEEEEEVEESDEEVEDDEGEEEELEG 2686

Query: 789  EYENIEDIKDKSNEQESCNKEESTDK 866
            + E  E++++   E+E  + +E  ++
Sbjct: 2687 DEEEEEEVEESDEEEEELDSDEEDNE 2712


>ref|XP_001301665.1| Beige/BEACH domain containing protein [Trichomonas vaginalis G3]
            gi|121882874|gb|EAX88735.1| Beige/BEACH domain containing
            protein [Trichomonas vaginalis G3]
          Length = 3187

 Score = 58.9 bits (141), Expect = 3e-06
 Identities = 68/277 (24%), Positives = 123/277 (44%), Gaps = 14/277 (5%)
 Frame = +3

Query: 84   DSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPVEKKGENIK 263
            D GN EKP    +   K E K+  +  ++S E  D   + +K++    +  +EK  EN+ 
Sbjct: 1038 DQGNNEKPQENVDENEKLE-KIYNENIEKSQENVD---ENEKLEKIYNEN-IEKSQENVD 1092

Query: 264  PEDEQLAK--SSELFTEEKNVEEHARNE-LFSSTIPSASAIPNLQELETLGR-HDEVIEQ 431
             E+E+L K  +  +    +NV+E+ + E +++  I  +    N+ E E L + ++E IE+
Sbjct: 1093 -ENEKLEKIYNENIEKSRENVDENEKLEKIYNENIEKSRE--NVDENEKLEKIYNENIEK 1149

Query: 432  MKQNAAE-------FSEN---KKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQ 581
             ++N  E       ++EN    +  ++EN +   +    + KS E VD+   LE I    
Sbjct: 1150 SRENVDENEKLEKIYNENIEKSRENVDENEKLEKIYNENIEKSRENVDENEKLEKIYNEN 1209

Query: 582  SIEEEKEINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQS 761
              +  +E + + +++T+ K    D   G           E +    +E  +   D + Q 
Sbjct: 1210 IEKSREEKSEILQEKTDKKLEKFD---GKEKRENDENNQEKSFEEKKEISNENQDEISQQ 1266

Query: 762  LKTDVATGGEYENIEDIKDKSNEQESCNKEESTDKIL 872
               +  T    EN E  KD   E+ S  K E +  IL
Sbjct: 1267 NSQEEIT----ENYEKTKDVDQEKSSEGKVEKSKAIL 1299


>ref|XP_001324167.1| hypothetical protein [Trichomonas vaginalis G3]
            gi|121907045|gb|EAY11944.1| hypothetical protein
            TVAG_399490 [Trichomonas vaginalis G3]
          Length = 4873

 Score = 58.5 bits (140), Expect = 4e-06
 Identities = 64/273 (23%), Positives = 118/273 (43%), Gaps = 4/273 (1%)
 Frame = +3

Query: 60   QEICQASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDGLKDGDKVDAFGKQAPV 239
            ++I Q+ E+  N++KP  E+N    N+   +E++T E+ ++    +  D  D   ++   
Sbjct: 3444 EKIPQSMEEDTNEQKPDEEKNKVTLNK---EEEETPETKDLEVPEEKPDLQDT--EKKAE 3498

Query: 240  EKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAIPNLQELETLGRHDE 419
            E+K E  K E+E+    SE   E  N EE  +       IP+    P+ +E+E    +DE
Sbjct: 3499 EEKQELNKEEEEKHKPVSEDKKEIPNEEEEKKE------IPTEQEQPHKEEVEI---NDE 3549

Query: 420  VIEQMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEK 599
               ++         N+K   EE  +  NV P +   + E V ++  ++     Q IEE+K
Sbjct: 3550 KSTEIPVITNVEQPNEKHQEEEEKKEDNVQPNQEEANEENVKEKDFVQPQQNVQPIEEKK 3609

Query: 600  EINTLQRDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVE----QSLK 767
            E    + +  +  +      +  ++ T  P  +      D+E  + T +  E    +   
Sbjct: 3610 ETPIEKAEVPQEVQKDEVKEIQNQETTDIPPESTKQKDDDEEEKNETNEAEETNKDEEKP 3669

Query: 768  TDVATGGEYENIEDIKDKSNEQESCNKEESTDK 866
             D     E E  +D  ++  ++E   KEES  K
Sbjct: 3670 KDNLVKDEEEQKQDTTEEEEKKEETPKEESNQK 3702


>gb|ETW48435.1| hypothetical protein PFMALIP_03520 [Plasmodium falciparum
            MaliPS096_E11]
          Length = 1806

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 69/300 (23%), Positives = 134/300 (44%), Gaps = 13/300 (4%)
 Frame = +3

Query: 12   EGESFLESPSLAKFALQEICQASEDSGNKEKPSHEENYTLKNETKLQEDQTDESDEVTDG 191
            + E   E  S  +  ++E+   +++   +E    E    L+ E    E+  D+   VTD 
Sbjct: 300  DDEILPEELSATEDVIEEVRSVTDEIVQEESVCEE---ILEQEVSASEEYVDDKS-VTD- 354

Query: 192  LKDGDKVDAFGKQAPVEKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSAS 371
                   D  G +  V +  EN +   E++A+  +   EE   ++ +  E         S
Sbjct: 355  -------DFVGHERSVIQDVENTESVTEEIAEVDKSVIEEAVEKQGSVTE--EKVQEGVS 405

Query: 372  AIPNLQELETLGRHDEVIEQMKQNAAEFSENKKATIEENLQRGNVMPPRLLKSSEAVDDE 551
            AI  ++E+E++    E IE+++    E +E  K+ IEE +++   +   +++  E +D E
Sbjct: 406  AIEEIEEIESV---TEEIEEIESVTEEIAEEDKSVIEEAVEKQGSVTEEIVEEEE-LDTE 461

Query: 552  PILEMICVSQSIEEEKEINTLQRDETEGKKSCI----DLNLGPRDDTQTP------QTAE 701
             +LE   V+  + E++      +DE+E K+S      +L     +D +T       +   
Sbjct: 462  EVLEDKSVTGDVVEQEGSG---KDESEAKESFTEEVDELKSVKEEDQETEYISREIEEES 518

Query: 702  TTMGHDQEPLSATADYVE-QSLKTDVATGGEYENIEDI--KDKSNEQESCNKEESTDKIL 872
             T  H ++ LS   + VE +SL  D+    E    ++I  + +S  +E   +E  TD++L
Sbjct: 519  ATEQHSEQELSINKEVVETESLTKDIEE--EKSTTQEILEETQSVNEEIVEEERDTDEVL 576


>ref|WP_000782036.1| hypothetical protein [Helicobacter pylori]
           gi|393109130|gb|EJC09662.1| poly E-rich protein
           [Helicobacter pylori Hp P-11]
           gi|393129445|gb|EJC29879.1| poly E-rich protein
           [Helicobacter pylori Hp P-11b]
          Length = 517

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 61/260 (23%), Positives = 110/260 (42%), Gaps = 13/260 (5%)
 Frame = +3

Query: 78  SEDSGNKEKPSHEENYTLKNETKLQED-QTDESDEVTDGLKDGDKVDAFGKQAPVEKKGE 254
           +E  G   K   +E    +   ++QE+ +  E  EV +  +D +K      Q  VE   +
Sbjct: 166 NEQEGETPKEEAQEEVKKEEVKEMQEEIKEKEKQEVAESPQDEEKPKDDETQGSVETPKD 225

Query: 255 NIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAI---PNLQELETLGRHDEVI 425
             KP+D++   S E   EE   +E  + E   +           P+ QELE +    E++
Sbjct: 226 EEKPKDDETQGSVETPKEETQEQEPIKEETQENKEEKQEKTQDSPSAQELEAM---QELV 282

Query: 426 EQMKQNAAEFSENKKATIE--ENLQRGNVMPPRLLKSSEAVDDEPILEMICVSQSIEEEK 599
           +++++N+ +  ENKK T E  EN +    +  + L+  +  + + + E    +Q +E+E+
Sbjct: 283 KEIQENSND-QENKKETQETQENTEAPQDIETQELEIPKEEETQEVAEKT-QAQGLEKEE 340

Query: 600 EINTLQ-------RDETEGKKSCIDLNLGPRDDTQTPQTAETTMGHDQEPLSATADYVEQ 758
              T Q       +DET  +    D  L   +  +     E+     +     T D   Q
Sbjct: 341 IAETPQEKEIQETQDETPQELEVQDEKLQENETPKDENMQESAQNLQELETQETQDETLQ 400

Query: 759 SLKTDVATGGEYENIEDIKD 818
             +T       YE+IEDI +
Sbjct: 401 EKETPQTQEDHYESIEDIPE 420


>ref|XP_002046645.1| GJ12366 [Drosophila virilis] gi|194153803|gb|EDW68987.1| GJ12366
           [Drosophila virilis]
          Length = 354

 Score = 58.2 bits (139), Expect = 5e-06
 Identities = 68/297 (22%), Positives = 132/297 (44%), Gaps = 32/297 (10%)
 Frame = +3

Query: 60  QEICQASEDSGN--KEKPSHEENYTLKNETKLQ------------EDQTDESDEVTDGLK 197
           +E  + +E+SG+  KEK   E++   ++ETK +            ED+++E D+ T+   
Sbjct: 70  KETKETAENSGDEPKEKSKEEKSANAEDETKEKPKDGEENKNDKTEDKSEEVDKETE--V 127

Query: 198 DGDKVDAFGKQAPVEKKGENIKPEDEQLAKSSELFTEEKNVEEHARNELFSSTIPSASAI 377
            G++     +++  +   E  KPED    K      E++ VEE A  E    T  SA+A 
Sbjct: 128 KGEEQKETEEKSTADSTPEEKKPEDSSAEK------EKEKVEESAAAEEPKETSESATA- 180

Query: 378 PNLQELETLGRHDEVIEQMKQN-------AAEFSENKK----ATIEENLQRGNVMPPRLL 524
              +E+E   + +E +E+  +        AAE +E K     +T EE  Q        + 
Sbjct: 181 ---EEVEESEKEEEEVEESDKEKTEDRVKAAEATEEKSKTDVSTTEEQQQEVEEEKKEVE 237

Query: 525 KSSEAVDDEPILEMICVSQSIEEEKEINTLQRDETEGKKSCID-LNLGPRDDTQTPQTAE 701
           +S ++ DD+  +        +EEE +  T ++ E E K+   + +    ++ T+     E
Sbjct: 238 ESQDSSDDKVEVSKDNADDKVEEESKEKTEEKVEEESKEKTEEKVQETSKEKTEEKVEEE 297

Query: 702 TTMGHDQEPLSATADYVEQSLKTDVATGGEYENIEDIKD------KSNEQESCNKEE 854
           +    +++ +    + V+    T  A   + + +E+ K+      ++ E+   NKEE
Sbjct: 298 SKEKTEEDKVEEETEEVKVEETTKDAEKTKEDKVEESKEGKVEESENPEETDSNKEE 354


Top