BLASTX nr result

ID: Magnolia22_contig00021523 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00021523
         (1403 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_845344.1 hypothetical protein [Trypanosoma brucei brucei TREU...    73   6e-10
SFF68027.1 hypothetical protein SAMN05421678_101394 [Actinopolym...    69   6e-09
WP_012861764.1 translation initiation factor IF-2 [Sebaldella te...    69   1e-08
EMG49704.1 hypothetical protein G210_5463 [Candida maltosa Xu316]      67   1e-08
XP_017092433.1 PREDICTED: collagen alpha-1(IV) chain isoform X2 ...    69   2e-08
XP_017092432.1 PREDICTED: collagen alpha-2(IV) chain isoform X1 ...    69   2e-08
XP_011774008.1 hypothetical protein, conserved [Trypanosoma bruc...    67   3e-08
XP_015915535.1 PREDICTED: glycine-rich cell wall structural prot...    67   3e-08
XP_016966112.1 PREDICTED: collagen alpha-2(IV) chain isoform X4 ...    67   3e-08
XP_016966110.1 PREDICTED: uncharacterized PE-PGRS family protein...    67   4e-08
XP_016966109.1 PREDICTED: uncharacterized PE-PGRS family protein...    67   4e-08
SEG80453.1 Uncharacterized membrane protein YckC, RDD family [No...    66   6e-08
WP_073279165.1 translation initiation factor IF-2 [Anaerocolumna...    66   8e-08
CDM34005.1 unnamed protein product [Penicillium roqueforti FM164]      65   9e-08
KJL27477.1 hypothetical protein RS83_02523 [Microbacterium oxydans]    65   1e-07
WP_036777757.1 hypothetical protein [Pontibacter actiniarum]           65   1e-07
DAA23419.1 TPA: predicted protein-like [Bos taurus]                    65   1e-07
XP_005058511.1 PREDICTED: putative per-hexamer repeat protein 5 ...    65   2e-07
XP_015329689.1 PREDICTED: uncharacterized protein DDB_G0290685-l...    64   2e-07
CAB09569.2 hypothetical protein [Trypanosoma brucei brucei]            64   3e-07

>XP_845344.1 hypothetical protein [Trypanosoma brucei brucei TREU927] AAX80856.1
            hypothetical protein, conserved [Trypanosoma brucei]
            AAZ11785.1 hypothetical protein, conserved [Trypanosoma
            brucei brucei TREU927]
          Length = 775

 Score = 72.8 bits (177), Expect = 6e-10
 Identities = 75/210 (35%), Positives = 96/210 (45%), Gaps = 11/210 (5%)
 Frame = +3

Query: 558  GGQNIVGGQREYDNRRS-GEYGRRDITPGRDTLGTSALPGQTGHGLNTVGNQREYDSRRS 734
            G +   G QR YDNR   GE+G R               GQ G      G+QR YD+R  
Sbjct: 15   GNEGSRGDQRNYDNRGGRGEWGDRG--------------GQRGDNQRDYGDQRNYDNRGG 60

Query: 735  -GEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGRRDITPG---RDTTLGRDRDITPG 899
             GE+G     GQ G  Q   G+QR+YDNR   GE+G R    G   RD    R+ D   G
Sbjct: 61   RGEWGDRG--GQRGDNQRDYGDQRNYDNRGGRGEWGDRGGQRGDNQRDYGDQRNYDNRGG 118

Query: 900  RVR--DITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNTIAN 1073
            R    D    R  N+    + R+YDN   G+ ++G       DR    G+ G  +    +
Sbjct: 119  RGEWGDRGGQRGDNQRDYGDQRNYDNRG-GRGEWG-------DRG---GQRGDNQRDYGD 167

Query: 1074 QRDYDNT-GRGQTGYNTNQ-GYND-NFDNR 1154
            QR+YDN  GRG+ G    Q G N  N+DNR
Sbjct: 168  QRNYDNRGGRGEWGDRGGQRGDNQRNYDNR 197



 Score = 68.9 bits (167), Expect = 1e-08
 Identities = 81/251 (32%), Positives = 104/251 (41%), Gaps = 10/251 (3%)
 Frame = +3

Query: 477  NQSAPFGQSGYSQNIAAGQPG-YGQNVAGGQNIVGGQREYDNRRS-GEYGRRDITPGRDT 650
            N+ +   Q  Y      G+ G  G      Q   G QR YDNR   GE+G R        
Sbjct: 16   NEGSRGDQRNYDNRGGRGEWGDRGGQRGDNQRDYGDQRNYDNRGGRGEWGDRG------- 68

Query: 651  LGTSALPGQTGHGLNTVGNQREYDSRRS-GEYGQGAVPGQTGYGQNVVGNQRDYDNRRS- 824
                   GQ G      G+QR YD+R   GE+G     GQ G  Q   G+QR+YDNR   
Sbjct: 69   -------GQRGDNQRDYGDQRNYDNRGGRGEWGDRG--GQRGDNQRDYGDQRNYDNRGGR 119

Query: 825  GEYGRRDITPG---RDTTLGRDRDITPGRVR--DITPGRTSNRNTITNSRDYDNTNLGQT 989
            GE+G R    G   RD    R+ D   GR    D    R  N+    + R+YDN   G+ 
Sbjct: 120  GEWGDRGGQRGDNQRDYGDQRNYDNRGGRGEWGDRGGQRGDNQRDYGDQRNYDNRG-GRG 178

Query: 990  DYGRDNTLGRDRDITPGRDGYTRNTIANQRDYDNT-GRGQTGYNTNQGYNDNFDNRDPSY 1166
            ++G       DR    G+ G       NQR+YDN  GRG+ G    Q   +  DN +   
Sbjct: 179  EWG-------DRG---GQRG------DNQRNYDNRGGRGEWGDRGGQRGGNQRDNSNQFG 222

Query: 1167 NRTPSAGRVRR 1199
             R    G + R
Sbjct: 223  YRNERDGGIGR 233


>SFF68027.1 hypothetical protein SAMN05421678_101394 [Actinopolymorpha
            cephalotaxi]
          Length = 544

 Score = 69.3 bits (168), Expect = 6e-09
 Identities = 79/272 (29%), Positives = 102/272 (37%), Gaps = 23/272 (8%)
 Frame = +3

Query: 348  YAQEDGRGNTHVPGFNQP---MSQHTPYHEITTGLDRSREEENIIRNQSAPFGQ---SGY 509
            YAQ+   G     G++Q      Q+        G D+    +     Q A  GQ    GY
Sbjct: 203  YAQQSYAGGYDQSGYDQQGYDQGQYAQQGYGHQGYDQGHYAQQYDEGQYAQQGQYDQQGY 262

Query: 510  SQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTSALPGQTGHG 689
             Q     Q GY Q   G       Q+ YD    G+YG+    PG D  G     GQ GH 
Sbjct: 263  DQGHYGQQQGYDQGQYGQGY---AQQGYDQ---GQYGQ----PGYDQYGQQGQHGQPGHD 312

Query: 690  LNTVGNQRE---YDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRR-------SGEYGR 839
                  Q E   Y     G+Y Q A  GQ GYGQ+    Q  YD  R       S +YGR
Sbjct: 313  QGQYAQQYEQQQYGRPDQGQYAQPAQYGQQGYGQDQYAQQGQYDQGRQEYGQGDSAQYGR 372

Query: 840  RDITPGRDTTLGRDRDITPGRVRDITP-----GRTSNRNTITNSRDYDNTNLGQTDYG-R 1001
             D     D   G       G+     P     GR S  ++  +   +  TN  + +YG R
Sbjct: 373  ND----HDQRRGFAESGEAGQHGSYAPFGQDSGRDSGPDSGHDLGQHSGTNQSRDEYGDR 428

Query: 1002 DNTLGRDRDITPGRDGY-TRNTIANQRDYDNT 1094
            D     D+    GR  Y   +  A +RD D+T
Sbjct: 429  DARDAHDQQPEQGRPDYDPPSGRAARRDPDST 460


>WP_012861764.1 translation initiation factor IF-2 [Sebaldella termitidis] ACZ09170.1
            translation initiation factor IF-2 [Sebaldella termitidis
            ATCC 33386]
          Length = 1116

 Score = 68.9 bits (167), Expect = 1e-08
 Identities = 71/262 (27%), Positives = 102/262 (38%), Gaps = 14/262 (5%)
 Frame = +3

Query: 408  QHTPYHEITTGLDRSREEENIIRN--------QSAPFGQSGYSQNIAAGQPGYGQNVAGG 563
            Q+  Y++     +R  +  N  +N        Q+  + Q+  +QN       Y QN    
Sbjct: 211  QNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQN-RDN 269

Query: 564  QNIVGGQREYDNRRSGEYG---RRDITPGRDTL---GTSALPGQTGHGLNTVGNQREYDS 725
            QN  G  R Y+  R  +      R+    RD     G +   GQ     N  G  R Y+ 
Sbjct: 270  QNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYGQNRDNQNRDGQNRNYNQ 329

Query: 726  RRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRV 905
             R G+           YGQN     RD  NR  G+        G++   G++RD    + 
Sbjct: 330  NRDGQ--------NRNYGQNRDNQNRDGQNRNYGQNRDNQNRDGQNRNYGQNRD---NQN 378

Query: 906  RDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNTIANQRDY 1085
            RD    R   +N    +RD  N N  Q   G++ + G++RD    RDG  RN   N+   
Sbjct: 379  RD-GQNRNYGQNRDNQNRDGQNRNYNQNRDGQNRSYGQNRD-NQNRDGQNRNYGQNR--- 433

Query: 1086 DNTGRGQTGYNTNQGYNDNFDN 1151
            DN  R   G N +  Y  N DN
Sbjct: 434  DNQNR--DGQNRSNDYRGNKDN 453



 Score = 64.7 bits (156), Expect = 2e-07
 Identities = 70/256 (27%), Positives = 97/256 (37%), Gaps = 21/256 (8%)
 Frame = +3

Query: 453  REEENIIRN----QSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYG 620
            R++EN +      Q+  + Q+  +QN       Y QN    QN  G  R Y+  R  +  
Sbjct: 134  RDDENKLNQNRDGQNRNYNQNRDNQNRDGQNRNYNQN-RDNQNRDGQNRNYNQNRDNQNR 192

Query: 621  ---RRDITPGRDTL---GTSALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQ 782
                R+    RD     G +    Q     N  G  R Y+  R  +   G       Y Q
Sbjct: 193  DGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQ---NRNYNQ 249

Query: 783  NVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRD 962
            N     RD  NR   +        G++    ++RD    + RD    R  N+N    +RD
Sbjct: 250  NRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRD---NQNRD-GQNRNYNQNRDNQNRD 305

Query: 963  YDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNTIANQ----RDYDNTGRGQT------- 1109
              N N GQ +    N  G++R+    RDG  RN   N+    RD  N   GQ        
Sbjct: 306  GQNRNYGQ-NRDNQNRDGQNRNYNQNRDGQNRNYGQNRDNQNRDGQNRNYGQNRDNQNRD 364

Query: 1110 GYNTNQGYNDNFDNRD 1157
            G N N G N +  NRD
Sbjct: 365  GQNRNYGQNRDNQNRD 380



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 71/278 (25%), Positives = 105/278 (37%), Gaps = 23/278 (8%)
 Frame = +3

Query: 408  QHTPYHEITTGLDRSREEENIIRN--------QSAPFGQSGYSQNIAAGQPGYGQNVAGG 563
            Q+  Y++     +R  +  N  +N        Q+  + Q+  +QN       Y QN    
Sbjct: 147  QNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQN-RDN 205

Query: 564  QNIVGGQREYDNRRSGEYG---RRDITPGRDTL---GTSALPGQTGHGLNTVGNQREYDS 725
            QN  G  R Y+  R  +      R+    RD     G +    Q     N  G  R Y+ 
Sbjct: 206  QNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQ 265

Query: 726  RRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRV 905
             R  +   G       Y QN     RD  NR   +        G++   G++RD    + 
Sbjct: 266  NRDNQNRDGQ---NRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYGQNRD---NQN 319

Query: 906  RDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNTIANQ--- 1076
            RD    R  N+N    +R+Y      Q   G++   G++RD    RDG  RN   N+   
Sbjct: 320  RD-GQNRNYNQNRDGQNRNYGQNRDNQNRDGQNRNYGQNRD-NQNRDGQNRNYGQNRDNQ 377

Query: 1077 -RDYDNTGRGQTGYNTNQ-----GYNDNFDNRDPSYNR 1172
             RD  N   GQ   N N+      YN N D ++ SY +
Sbjct: 378  NRDGQNRNYGQNRDNQNRDGQNRNYNQNRDGQNRSYGQ 415



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 69/271 (25%), Positives = 107/271 (39%), Gaps = 21/271 (7%)
 Frame = +3

Query: 408  QHTPYHEITTGLDRSREEENIIRN--------QSAPFGQSGYSQNIAAGQPGYGQNVAGG 563
            Q+  Y++     +R  +  N  +N        Q+  + Q+  +QN       Y QN    
Sbjct: 179  QNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQN-RDN 237

Query: 564  QNIVGGQREYDNRRSGEYG---RRDITPGRDTL---GTSALPGQTGHGLNTVGNQREYDS 725
            QN  G  R Y+  R  +      R+    RD     G +    Q     N  G  R Y+ 
Sbjct: 238  QNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQNRDNQNRDGQNRNYNQ 297

Query: 726  RRSGEY--GQGAVPGQTGYGQNVVGNQRDYDNRRSGE---YGRRDITPGRDTTLGRDRDI 890
             R  +   GQ    GQ    QN  G  R+Y+  R G+   YG+      RD   G++R+ 
Sbjct: 298  NRDNQNRDGQNRNYGQNRDNQNRDGQNRNYNQNRDGQNRNYGQNRDNQNRD---GQNRNY 354

Query: 891  TPGRVRDITPGRTSN--RNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNT 1064
               R      G+  N  +N    +RD  N N GQ +    N  G++R+    RDG  R+ 
Sbjct: 355  GQNRDNQNRDGQNRNYGQNRDNQNRDGQNRNYGQ-NRDNQNRDGQNRNYNQNRDGQNRSY 413

Query: 1065 IANQRDYDNTGRGQTGYNTNQGYNDNFDNRD 1157
              N+ + +  G+       N+ Y  N DN++
Sbjct: 414  GQNRDNQNRDGQ-------NRNYGQNRDNQN 437


>EMG49704.1 hypothetical protein G210_5463 [Candida maltosa Xu316]
          Length = 348

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 79/301 (26%), Positives = 117/301 (38%), Gaps = 32/301 (10%)
 Frame = +3

Query: 381  VPGFNQPMSQHTPYHEITTGLDRSREEENIIRN----QSAPFGQSGYS----QNIAAGQP 536
            V   N    ++    E  TG D ++E+    +     QS    ++GY     Q+  +G  
Sbjct: 15   VDNINTQAGKYAGGKEGQTGTDLAQEKYKSYQAGKAAQSGKESETGYGSTTGQSGQSGTT 74

Query: 537  GYGQNVAGGQNIVGGQREYDNRRSGE-YGRRDITPGRDTLGTSALPGQTG-------HGL 692
            GYG    G      GQ  Y +  SG+ YG    + G D  G+S   GQ G       +G 
Sbjct: 75   GYGSGSTGQHGSTTGQTGYGSSTSGDQYGS---STGGDQYGSSTTGGQYGSSTTGGQYGS 131

Query: 693  NTVGNQREYDSRRSGEYGQG-----------AVPGQTGYGQNVVGNQRDYDNRRSGEYGR 839
            +T G Q    + +SG+YG             A  GQTGYG +  G+Q       SG+ G 
Sbjct: 132  STTGGQYGSSTGQSGQYGSSTGQSGYSSSTTAPTGQTGYGSSTTGSQ-------SGQTGY 184

Query: 840  RDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGR 1019
               T G+ T  G     T G+      G +SN +   +     +     + YG D +   
Sbjct: 185  GSSTTGQ-TGYGSS---TTGQTGYGESGHSSNHHGSHHHGSNHHGTSSTSGYGNDKSTTG 240

Query: 1020 DRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQ--GY---NDNFDNRDPSYNRTPSA 1184
                  G  GY  NT        +T    TG+NT+   GY   + N+     + N+  S+
Sbjct: 241  TTGGAYGSSGYDNNTTTGSHGTSST----TGHNTSSSTGYGADSSNYGQGSTTGNQHSSS 296

Query: 1185 G 1187
            G
Sbjct: 297  G 297


>XP_017092433.1 PREDICTED: collagen alpha-1(IV) chain isoform X2 [Drosophila
            bipectinata]
          Length = 1472

 Score = 68.6 bits (166), Expect = 2e-08
 Identities = 80/296 (27%), Positives = 100/296 (33%), Gaps = 11/296 (3%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGYSQ 515
            G PGY Q  G+  T  PG+        P +   TG                  GQ GY  
Sbjct: 506  GQPGYGQTGGQPGTGQPGYGGQTGTGQPGYGGQTGGQPGTGTGQPGFGGQTGTGQPGYGG 565

Query: 516  NIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTSALPGQTGHGLN 695
                GQPGYG     GQ+  GGQ        G+ G +    G+   G     GQTG G  
Sbjct: 566  QTGTGQPGYGGQTGTGQHGFGGQTGTGQPGYGQTGGQSGYGGQTGTGQPGYGGQTGTGQP 625

Query: 696  TVGNQREYDSRRSGEYGQ-------GAVPGQTGY-GQNVVGNQRDYDNRRSGEYGRRDIT 851
              G Q    + + G  GQ       GA  GQ GY GQ  +G         +G YG +  T
Sbjct: 626  GFGGQT--GTGQPGFGGQPGFGGQTGAGSGQPGYGGQPGIGQPSFGGQPGTGSYGGQTGT 683

Query: 852  --PGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDR 1025
              P      G  + +  G+     PG      T      +     GQT  G     G   
Sbjct: 684  GQPVYGGQTGTGQPVFGGQTGTGQPGFGGQTGTGRGQPGFG----GQTGAGSGQP-GYGG 738

Query: 1026 DITPGRDGYTRNTIANQRDYDN-TGRGQTGYNTNQGYNDNFDNRDPSYNRTPSAGR 1190
                G+ GY   T   Q  +    G GQ GY    G      +  P Y     AG+
Sbjct: 739  QPGTGQPGYEGQTGTGQPGFGGPIGTGQPGYGGQTGGQPGTGSGQPVYGGQSGAGQ 794



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 81/303 (26%), Positives = 98/303 (32%), Gaps = 18/303 (5%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQ-----SAPFGQ 500
            G PGY  + G G    PGF        P +   TG  +          Q      A  GQ
Sbjct: 333  GQPGYGGQTGTGEGQ-PGFGGQPGTGQPGYGGQTGAGQPGYGGQTGSGQPGYGGQAGTGQ 391

Query: 501  SGYSQNIAAGQPGYG------QNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTS 662
             GY      GQPGYG      Q   GGQ   G         +G+ G    T G+   G  
Sbjct: 392  PGYGGQTGTGQPGYGGQTGTGQPGYGGQTGTGQPGFGGQTGTGQPGYGGQTGGQPGTGQP 451

Query: 663  ALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRR 842
               GQTG G    G Q       +G+ G G   GQ G GQ   G Q        G+ G  
Sbjct: 452  GYGGQTGAGQPGYGGQ-----TGTGQPGYGQTGGQPGTGQPGYGGQTGTGQPGYGQTGTG 506

Query: 843  DITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGR- 1019
               PG   T G+     PG       G+         +     T  GQ  +G     G+ 
Sbjct: 507  Q--PGYGQTGGQPGTGQPGYGGQTGTGQPGYGG---QTGGQPGTGTGQPGFGGQTGTGQP 561

Query: 1020 --DRDITPGRDGYTRNTIANQRDY-DNTGRGQTGYNT---NQGYNDNFDNRDPSYNRTPS 1181
                    G+ GY   T   Q  +   TG GQ GY       GY        P Y     
Sbjct: 562  GYGGQTGTGQPGYGGQTGTGQHGFGGQTGTGQPGYGQTGGQSGYGGQTGTGQPGYGGQTG 621

Query: 1182 AGR 1190
             G+
Sbjct: 622  TGQ 624


>XP_017092432.1 PREDICTED: collagen alpha-2(IV) chain isoform X1 [Drosophila
            bipectinata]
          Length = 1473

 Score = 68.6 bits (166), Expect = 2e-08
 Identities = 80/296 (27%), Positives = 100/296 (33%), Gaps = 11/296 (3%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGYSQ 515
            G PGY Q  G+  T  PG+        P +   TG                  GQ GY  
Sbjct: 507  GQPGYGQTGGQPGTGQPGYGGQTGTGQPGYGGQTGGQPGTGTGQPGFGGQTGTGQPGYGG 566

Query: 516  NIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTSALPGQTGHGLN 695
                GQPGYG     GQ+  GGQ        G+ G +    G+   G     GQTG G  
Sbjct: 567  QTGTGQPGYGGQTGTGQHGFGGQTGTGQPGYGQTGGQSGYGGQTGTGQPGYGGQTGTGQP 626

Query: 696  TVGNQREYDSRRSGEYGQ-------GAVPGQTGY-GQNVVGNQRDYDNRRSGEYGRRDIT 851
              G Q    + + G  GQ       GA  GQ GY GQ  +G         +G YG +  T
Sbjct: 627  GFGGQT--GTGQPGFGGQPGFGGQTGAGSGQPGYGGQPGIGQPSFGGQPGTGSYGGQTGT 684

Query: 852  --PGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDR 1025
              P      G  + +  G+     PG      T      +     GQT  G     G   
Sbjct: 685  GQPVYGGQTGTGQPVFGGQTGTGQPGFGGQTGTGRGQPGFG----GQTGAGSGQP-GYGG 739

Query: 1026 DITPGRDGYTRNTIANQRDYDN-TGRGQTGYNTNQGYNDNFDNRDPSYNRTPSAGR 1190
                G+ GY   T   Q  +    G GQ GY    G      +  P Y     AG+
Sbjct: 740  QPGTGQPGYEGQTGTGQPGFGGPIGTGQPGYGGQTGGQPGTGSGQPVYGGQSGAGQ 795



 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 81/303 (26%), Positives = 98/303 (32%), Gaps = 18/303 (5%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQ-----SAPFGQ 500
            G PGY  + G G    PGF        P +   TG  +          Q      A  GQ
Sbjct: 334  GQPGYGGQTGTGEGQ-PGFGGQPGTGQPGYGGQTGAGQPGYGGQTGSGQPGYGGQAGTGQ 392

Query: 501  SGYSQNIAAGQPGYG------QNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTS 662
             GY      GQPGYG      Q   GGQ   G         +G+ G    T G+   G  
Sbjct: 393  PGYGGQTGTGQPGYGGQTGTGQPGYGGQTGTGQPGFGGQTGTGQPGYGGQTGGQPGTGQP 452

Query: 663  ALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRR 842
               GQTG G    G Q       +G+ G G   GQ G GQ   G Q        G+ G  
Sbjct: 453  GYGGQTGAGQPGYGGQ-----TGTGQPGYGQTGGQPGTGQPGYGGQTGTGQPGYGQTGTG 507

Query: 843  DITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGR- 1019
               PG   T G+     PG       G+         +     T  GQ  +G     G+ 
Sbjct: 508  Q--PGYGQTGGQPGTGQPGYGGQTGTGQPGYGG---QTGGQPGTGTGQPGFGGQTGTGQP 562

Query: 1020 --DRDITPGRDGYTRNTIANQRDY-DNTGRGQTGYNT---NQGYNDNFDNRDPSYNRTPS 1181
                    G+ GY   T   Q  +   TG GQ GY       GY        P Y     
Sbjct: 563  GYGGQTGTGQPGYGGQTGTGQHGFGGQTGTGQPGYGQTGGQSGYGGQTGTGQPGYGGQTG 622

Query: 1182 AGR 1190
             G+
Sbjct: 623  TGQ 625


>XP_011774008.1 hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972]
            CBH11723.1 hypothetical protein, conserved [Trypanosoma
            brucei gambiense DAL972]
          Length = 742

 Score = 67.4 bits (163), Expect = 3e-08
 Identities = 78/248 (31%), Positives = 105/248 (42%), Gaps = 13/248 (5%)
 Frame = +3

Query: 489  PFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRS-GEYGRRDITPGRDTLGTSA 665
            P G  G+ + + +    + +   G +   G QR YDNR   GE+G R             
Sbjct: 11   PLGGHGHRRMVCSTICLFQRGQWGNEGSRGDQRNYDNRGGRGEWGDRG------------ 58

Query: 666  LPGQTGHGLNTVGNQREYDSRRS-GEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGR 839
              GQ G      G+QR YD+R   GE+G     GQ G  Q   G+QR+YDNR   GE+G 
Sbjct: 59   --GQRGGNQRDYGDQRNYDNRGGRGEWGDRG--GQRGDNQRDYGDQRNYDNRGGRGEWGD 114

Query: 840  RDITPG---RDTTLGRDRDITPGRVR--DITPGRTSNRNTITNSRDYDNTNLGQTDYGRD 1004
            R    G   RD    R+ D   GR    D    R  N+    + R+YDN   G+ ++G  
Sbjct: 115  RGGQRGGNQRDYGDQRNYDNRGGRGEWGDRGGQRGDNQRDYGDQRNYDNRG-GRGEWG-- 171

Query: 1005 NTLGRDRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQGYNDNF---DNR--DPSYN 1169
                 DR    G+ G  +   +NQ  Y N   G  G    +   DNF   DNR  D   +
Sbjct: 172  -----DRG---GQRGGNQRDNSNQFGYRNERDG--GIGRQRSARDNFGAHDNRASDERGS 221

Query: 1170 RTPSAGRV 1193
              P+ G V
Sbjct: 222  TDPAVGEV 229


>XP_015915535.1 PREDICTED: glycine-rich cell wall structural protein 1.8-like
            [Parasteatoda tepidariorum]
          Length = 429

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 65/223 (29%), Positives = 91/223 (40%), Gaps = 8/223 (3%)
 Frame = +3

Query: 492  FGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTL-GTSAL 668
            +GQSGY Q+    Q G+GQ   GGQ I GGQ  Y  +    YG +    G+  L G S  
Sbjct: 166  YGQSGYGQSGYLQQGGFGQGAYGGQGIYGGQSGYGGQSG--YGVQSGYGGQGILSGLSGY 223

Query: 669  PGQTGH-GLNTVGNQREYDSRRSGEYGQGAVPGQTGY-GQNVVGNQRDYDNR-----RSG 827
             GQ+G+ G   +G Q       SG  GQG + GQ+G+ GQ ++G Q  Y  +     +SG
Sbjct: 224  GGQSGYGGQGILGGQ-------SGYGGQGILVGQSGFGGQGILGGQSGYGGQGGYGGQSG 276

Query: 828  EYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDN 1007
             YG   I+ G             G  R        +   + N + Y + N G     +  
Sbjct: 277  WYGGNQISSGGYNWPSGGGG--GGGYRGDKGSYGGSYGNLGNQQGYGSINFGSGYPSQGL 334

Query: 1008 TLGRDRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQGYN 1136
            + G    I     G    +   Q+ Y  +  GQ GY      N
Sbjct: 335  SGGLGNQIYGSGYG---GSYMGQKGYGGSSGGQKGYGNYNNIN 374


>XP_016966112.1 PREDICTED: collagen alpha-2(IV) chain isoform X4 [Drosophila
            biarmipes]
          Length = 1057

 Score = 67.4 bits (163), Expect = 3e-08
 Identities = 105/380 (27%), Positives = 131/380 (34%), Gaps = 10/380 (2%)
 Frame = +3

Query: 78   SGHDRPGHTPITGRGTNTTRPGE-------DVTYLARGDGTYLARDDGTHTVRNGPNDPI 236
            SG    G  P  G  T  T+PG        D T ++ G   Y ++         G   P 
Sbjct: 155  SGQAGYGGQPGVGGQTGATQPGYVGQPGVGDQTGISGGQPGYASQPGVGGQTGVGTGQPG 214

Query: 237  HXXXXXXXXXXXXXXXXXXXQPVVARDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHT 416
            +                   QP V   +  G+   PGYA + G G     G  QP     
Sbjct: 215  YGGQPGVGGQTGAGQPGYGSQPGVGGQTGGGQ---PGYAGQPGVGGQTGAGIGQPGYIGQ 271

Query: 417  PYHEITTGLDRSREEENIIRNQSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYD 596
            P     TG             Q     Q G+     AGQPGY     GGQ  VGGQ   +
Sbjct: 272  PGVGGQTG-----------AAQPGYGSQPGFGGQTEAGQPGY-----GGQPGVGGQTGAE 315

Query: 597  NRRSGEYGRRDITPGRDTLGTSALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGY 776
              + G YGR+   PG        + GQTG G    G Q    S ++G  GQ  V GQTG 
Sbjct: 316  TGQPG-YGRQ---PG--------VGGQTGVGQPGYGGQPGV-SGQAGYGGQPGVDGQTGA 362

Query: 777  GQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNS 956
            GQ   G Q      +SG  G+  +  G  T  G+     PG       G TS   T    
Sbjct: 363  GQPGYGGQPGVGG-QSGYGGQPGV--GGQTGAGQ-----PGYGGQPGVGGTSGAGT---- 410

Query: 957  RDYDNTNLGQTDYGRDNTLGRDRDITPGRDGY-TRNTIANQRDYDNTGRGQTGYNTNQGY 1133
                    GQ  YG    +G    I+ G+ GY ++  +  Q      G GQ GY    G 
Sbjct: 411  --------GQPGYGSQPGVGDQTGISGGQLGYGSQPGVGGQ-----AGAGQPGYGGQPGV 457

Query: 1134 --NDNFDNRDPSYNRTPSAG 1187
                      P Y   P  G
Sbjct: 458  GGQSGISGGQPGYGSQPGVG 477



 Score = 61.2 bits (147), Expect = 3e-06
 Identities = 83/294 (28%), Positives = 102/294 (34%), Gaps = 10/294 (3%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGY-S 512
            G PGY  + G G T   G  QP     P     TG+                 GQ GY S
Sbjct: 392  GQPGYGGQPGVGGTSGAGTGQPGYGSQPGVGDQTGISG---------------GQLGYGS 436

Query: 513  QNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTS--ALPGQTGH 686
            Q    GQ G GQ   GGQ  VGGQ      + G YG +    G+  +GT      GQ G 
Sbjct: 437  QPGVGGQAGAGQPGYGGQPGVGGQSGISGGQPG-YGSQPGVGGQTGVGTGQPGYGGQPGV 495

Query: 687  GLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGRRDITPGRD 863
            G  T   Q+ Y   + G  GQ A  GQ G G      Q  Y  +   G+ G     PG  
Sbjct: 496  GGQTGVGQQGYGG-QPGVSGQAAYGGQPGVGGQTGAGQPGYGGQPGVGQSGISGGQPGYG 554

Query: 864  TTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGR 1043
            +  G       G+       R   +   T          GQ  YG    +G    I+  +
Sbjct: 555  SQPGVGGQTGAGQPSYGGQPRVGGQTGAT---------AGQPGYGGQPGVGGQTGISGVQ 605

Query: 1044 DGY-TRNTIANQRDYDNTGRG---QTGYNTNQGY--NDNFDNRDPSYNRTPSAG 1187
             GY T+  +   +    T  G   Q GY T QG             Y   P AG
Sbjct: 606  PGYGTQPGVGGSQPGYGTQLGYGTQPGYGTQQGVGGQSGIGGSQQGYGTQPGAG 659


>XP_016966110.1 PREDICTED: uncharacterized PE-PGRS family protein PE_PGRS54 isoform
            X2 [Drosophila biarmipes]
          Length = 1501

 Score = 67.4 bits (163), Expect = 4e-08
 Identities = 105/380 (27%), Positives = 131/380 (34%), Gaps = 10/380 (2%)
 Frame = +3

Query: 78   SGHDRPGHTPITGRGTNTTRPGE-------DVTYLARGDGTYLARDDGTHTVRNGPNDPI 236
            SG    G  P  G  T  T+PG        D T ++ G   Y ++         G   P 
Sbjct: 155  SGQAGYGGQPGVGGQTGATQPGYVGQPGVGDQTGISGGQPGYASQPGVGGQTGVGTGQPG 214

Query: 237  HXXXXXXXXXXXXXXXXXXXQPVVARDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHT 416
            +                   QP V   +  G+   PGYA + G G     G  QP     
Sbjct: 215  YGGQPGVGGQTGAGQPGYGSQPGVGGQTGGGQ---PGYAGQPGVGGQTGAGIGQPGYIGQ 271

Query: 417  PYHEITTGLDRSREEENIIRNQSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYD 596
            P     TG             Q     Q G+     AGQPGY     GGQ  VGGQ   +
Sbjct: 272  PGVGGQTG-----------AAQPGYGSQPGFGGQTEAGQPGY-----GGQPGVGGQTGAE 315

Query: 597  NRRSGEYGRRDITPGRDTLGTSALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGY 776
              + G YGR+   PG        + GQTG G    G Q    S ++G  GQ  V GQTG 
Sbjct: 316  TGQPG-YGRQ---PG--------VGGQTGVGQPGYGGQPGV-SGQAGYGGQPGVDGQTGA 362

Query: 777  GQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNS 956
            GQ   G Q      +SG  G+  +  G  T  G+     PG       G TS   T    
Sbjct: 363  GQPGYGGQPGVGG-QSGYGGQPGV--GGQTGAGQ-----PGYGGQPGVGGTSGAGT---- 410

Query: 957  RDYDNTNLGQTDYGRDNTLGRDRDITPGRDGY-TRNTIANQRDYDNTGRGQTGYNTNQGY 1133
                    GQ  YG    +G    I+ G+ GY ++  +  Q      G GQ GY    G 
Sbjct: 411  --------GQPGYGSQPGVGDQTGISGGQLGYGSQPGVGGQ-----AGAGQPGYGGQPGV 457

Query: 1134 --NDNFDNRDPSYNRTPSAG 1187
                      P Y   P  G
Sbjct: 458  GGQSGISGGQPGYGSQPGVG 477



 Score = 61.2 bits (147), Expect = 3e-06
 Identities = 83/294 (28%), Positives = 102/294 (34%), Gaps = 10/294 (3%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGY-S 512
            G PGY  + G G T   G  QP     P     TG+                 GQ GY S
Sbjct: 392  GQPGYGGQPGVGGTSGAGTGQPGYGSQPGVGDQTGISG---------------GQLGYGS 436

Query: 513  QNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTS--ALPGQTGH 686
            Q    GQ G GQ   GGQ  VGGQ      + G YG +    G+  +GT      GQ G 
Sbjct: 437  QPGVGGQAGAGQPGYGGQPGVGGQSGISGGQPG-YGSQPGVGGQTGVGTGQPGYGGQPGV 495

Query: 687  GLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGRRDITPGRD 863
            G  T   Q+ Y   + G  GQ A  GQ G G      Q  Y  +   G+ G     PG  
Sbjct: 496  GGQTGVGQQGYGG-QPGVSGQAAYGGQPGVGGQTGAGQPGYGGQPGVGQSGISGGQPGYG 554

Query: 864  TTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGR 1043
            +  G       G+       R   +   T          GQ  YG    +G    I+  +
Sbjct: 555  SQPGVGGQTGAGQPSYGGQPRVGGQTGAT---------AGQPGYGGQPGVGGQTGISGVQ 605

Query: 1044 DGY-TRNTIANQRDYDNTGRG---QTGYNTNQGY--NDNFDNRDPSYNRTPSAG 1187
             GY T+  +   +    T  G   Q GY T QG             Y   P AG
Sbjct: 606  PGYGTQPGVGGSQPGYGTQLGYGTQPGYGTQQGVGGQSGIGGSQQGYGTQPGAG 659


>XP_016966109.1 PREDICTED: uncharacterized PE-PGRS family protein PE_PGRS54 isoform
            X1 [Drosophila biarmipes]
          Length = 1504

 Score = 67.4 bits (163), Expect = 4e-08
 Identities = 105/380 (27%), Positives = 131/380 (34%), Gaps = 10/380 (2%)
 Frame = +3

Query: 78   SGHDRPGHTPITGRGTNTTRPGE-------DVTYLARGDGTYLARDDGTHTVRNGPNDPI 236
            SG    G  P  G  T  T+PG        D T ++ G   Y ++         G   P 
Sbjct: 155  SGQAGYGGQPGVGGQTGATQPGYVGQPGVGDQTGISGGQPGYASQPGVGGQTGVGTGQPG 214

Query: 237  HXXXXXXXXXXXXXXXXXXXQPVVARDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHT 416
            +                   QP V   +  G+   PGYA + G G     G  QP     
Sbjct: 215  YGGQPGVGGQTGAGQPGYGSQPGVGGQTGGGQ---PGYAGQPGVGGQTGAGIGQPGYIGQ 271

Query: 417  PYHEITTGLDRSREEENIIRNQSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYD 596
            P     TG             Q     Q G+     AGQPGY     GGQ  VGGQ   +
Sbjct: 272  PGVGGQTG-----------AAQPGYGSQPGFGGQTEAGQPGY-----GGQPGVGGQTGAE 315

Query: 597  NRRSGEYGRRDITPGRDTLGTSALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGY 776
              + G YGR+   PG        + GQTG G    G Q    S ++G  GQ  V GQTG 
Sbjct: 316  TGQPG-YGRQ---PG--------VGGQTGVGQPGYGGQPGV-SGQAGYGGQPGVDGQTGA 362

Query: 777  GQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNS 956
            GQ   G Q      +SG  G+  +  G  T  G+     PG       G TS   T    
Sbjct: 363  GQPGYGGQPGVGG-QSGYGGQPGV--GGQTGAGQ-----PGYGGQPGVGGTSGAGT---- 410

Query: 957  RDYDNTNLGQTDYGRDNTLGRDRDITPGRDGY-TRNTIANQRDYDNTGRGQTGYNTNQGY 1133
                    GQ  YG    +G    I+ G+ GY ++  +  Q      G GQ GY    G 
Sbjct: 411  --------GQPGYGSQPGVGDQTGISGGQLGYGSQPGVGGQ-----AGAGQPGYGGQPGV 457

Query: 1134 --NDNFDNRDPSYNRTPSAG 1187
                      P Y   P  G
Sbjct: 458  GGQSGISGGQPGYGSQPGVG 477



 Score = 61.2 bits (147), Expect = 3e-06
 Identities = 83/294 (28%), Positives = 102/294 (34%), Gaps = 10/294 (3%)
 Frame = +3

Query: 336  GVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGY-S 512
            G PGY  + G G T   G  QP     P     TG+                 GQ GY S
Sbjct: 392  GQPGYGGQPGVGGTSGAGTGQPGYGSQPGVGDQTGISG---------------GQLGYGS 436

Query: 513  QNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTS--ALPGQTGH 686
            Q    GQ G GQ   GGQ  VGGQ      + G YG +    G+  +GT      GQ G 
Sbjct: 437  QPGVGGQAGAGQPGYGGQPGVGGQSGISGGQPG-YGSQPGVGGQTGVGTGQPGYGGQPGV 495

Query: 687  GLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGRRDITPGRD 863
            G  T   Q+ Y   + G  GQ A  GQ G G      Q  Y  +   G+ G     PG  
Sbjct: 496  GGQTGVGQQGYGG-QPGVSGQAAYGGQPGVGGQTGAGQPGYGGQPGVGQSGISGGQPGYG 554

Query: 864  TTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGR 1043
            +  G       G+       R   +   T          GQ  YG    +G    I+  +
Sbjct: 555  SQPGVGGQTGAGQPSYGGQPRVGGQTGAT---------AGQPGYGGQPGVGGQTGISGVQ 605

Query: 1044 DGY-TRNTIANQRDYDNTGRG---QTGYNTNQGY--NDNFDNRDPSYNRTPSAG 1187
             GY T+  +   +    T  G   Q GY T QG             Y   P AG
Sbjct: 606  PGYGTQPGVGGSQPGYGTQLGYGTQPGYGTQQGVGGQSGIGGSQQGYGTQPGAG 659


>SEG80453.1 Uncharacterized membrane protein YckC, RDD family [Nonomuraea solani]
          Length = 470

 Score = 65.9 bits (159), Expect = 6e-08
 Identities = 75/309 (24%), Positives = 113/309 (36%), Gaps = 26/309 (8%)
 Frame = +3

Query: 297  QPVVARDSS--RGRRGVPGYAQEDGRGNTHVP-------GF--NQPMSQHTPYHEITTGL 443
            QP   +D S      G P Y Q++   +   P       G+  +    Q  PY +   G 
Sbjct: 5    QPPYPQDESGENSAPGTPHYGQQNAGRHQGEPDPDVTVVGYRADDAYGQQPPYGQQGYGQ 64

Query: 444  DRSREEENIIRNQSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGE--Y 617
             +    +     Q   +GQ GY+Q     QP YGQ    GQ   G Q +Y  ++  +  Y
Sbjct: 65   QQQGYGQQGYGQQQQGYGQQGYNQQ----QPQYGQQPQSGQQGYGQQPQYGQQQQSQPDY 120

Query: 618  GRRDI---TPGRDTLGTS-ALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQN 785
            G++        +   G      GQ G+G      Q++   ++ G YGQ    GQ GYGQ 
Sbjct: 121  GQQQYGQQPQSQPDYGQQYGAAGQQGYGQQPQSGQQQGYGQQQG-YGQQPQSGQQGYGQQ 179

Query: 786  V--VGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSR 959
                G Q+ Y  ++  +YG++          G+ +   PG                    
Sbjct: 180  QPDYGQQQGYGQQQQPDYGQQQ-------GYGQQQQSQPG-------------------- 212

Query: 960  DYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNT-----IANQRDYDNTGRG--QTGYN 1118
                   GQ DYG+    G+      G+ GY +          Q  YD   +G  Q GY 
Sbjct: 213  ------YGQQDYGQQQGYGQQPQ--SGQQGYGQQQPSQPGYGQQHGYDQQQQGYAQQGYG 264

Query: 1119 TNQGYNDNF 1145
              QGY+  +
Sbjct: 265  QQQGYDQQY 273


>WP_073279165.1 translation initiation factor IF-2 [Anaerocolumna jejuensis]
            SHL21163.1 translation initiation factor IF-2
            [Anaerocolumna jejuensis DSM 15929]
          Length = 1123

 Score = 66.2 bits (160), Expect = 8e-08
 Identities = 81/311 (26%), Positives = 116/311 (37%), Gaps = 34/311 (10%)
 Frame = +3

Query: 360  DGRGNTHVPG-FNQPMSQHTPYHEITTGLDRSREEENIIRNQSAPFGQSGYSQNIAAGQ- 533
            D  G T   G  NQ   Q+TPY +           EN  + Q    G  G  QN   G  
Sbjct: 217  DREGFTRAQGAHNQGQGQNTPYRQGNQNNQNRPYGENRSQGQYGQQGNGGRPQNDRGGSN 276

Query: 534  PGYGQNVAGGQNIVGGQREYDNRRSGEYGRRD---ITPGRDTLGTSALPGQTGHGLNTVG 704
             GY    AGG N   G   Y N   G+Y R +      G+     +   GQ  +  N  G
Sbjct: 277  QGYS---AGGANRPYGNNSYGNNNQGQYNRNNNGGTGQGQYNRNNNGGTGQGQYNRNNNG 333

Query: 705  --NQREYDSRRSGEYGQGAV-------PGQTGYGQNVVGNQRDYDNRR---SGE-YGRRD 845
               Q +Y+   +G  GQG          GQ  Y +N  G+     NR    SG+ Y R +
Sbjct: 334  GTGQGQYNRNNNGGTGQGQYNRNNNGGTGQGQYNRNSYGSTGQGYNRNNDGSGQGYNRNN 393

Query: 846  ITPGRDTTLGRDRDIT-------PGRVRDITP--GRTSNRNT--ITNSRDYD-----NTN 977
             +  R +  G+ R  +        G      P  G +SNRN    +++R YD     N +
Sbjct: 394  DSQNRPSGNGQGRPYSGGGRPAGNGNGNGYRPYNGNSSNRNNGQSSDNRGYDRFGGMNKD 453

Query: 978  LGQTDYGRDNTLGRDRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQGYNDNFDNRD 1157
               T+  R  + G   D T   DG    + ++     N+   +   NT   Y  +F N +
Sbjct: 454  KDDTNDNRRRSTGNRTDSTKKFDGPIGMSRSDSVSVRNSRVNKNAKNTKNSYEKDFHNEN 513

Query: 1158 PSYNRTPSAGR 1190
                + P  G+
Sbjct: 514  NDEIKLPKKGQ 524


>CDM34005.1 unnamed protein product [Penicillium roqueforti FM164]
          Length = 522

 Score = 65.5 bits (158), Expect = 9e-08
 Identities = 108/459 (23%), Positives = 151/459 (32%), Gaps = 90/459 (19%)
 Frame = +3

Query: 96   GHTPITGRGTNTTR----------PGEDVTYLARGDGTYLARDDGTHTVRNGPNDPIHXX 245
            GH   TG GT T            PG    Y +   G Y + + G+  +  GP+D     
Sbjct: 76   GHGSGTGTGTGTIHDSRKGPIDVGPGTSENYGSNTAGGYGSSNTGSSKINAGPHDS---- 131

Query: 246  XXXXXXXXXXXXXXXXXQPVVARDSSRGRRG-VPGYAQEDGRGNTHVPGFNQPMSQHTPY 422
                                    S  GR   V  Y   D  G+T      Q     +  
Sbjct: 132  KLANKADPRVDSDLDNRGSTNTFGSGTGRSSEVTDYGSNDKYGSTTGHSAKQTGYGSSNP 191

Query: 423  HEITTGLDRSREEENIIRNQSAPFGQSGY------SQNIAAGQPGYGQNVAGGQNIVGGQ 584
               TTG         +  N +   GQ+GY      S N   GQ GYG +     N   GQ
Sbjct: 192  FNTTTGQTGYGSSNPLSSNTTT--GQTGYGSSNPLSSNTTTGQTGYGSSNPLSSNTTTGQ 249

Query: 585  REY-------DNRRSGEYGRRDITPGRDTLGTSALPGQTGHGL------NTVGNQREYDS 725
              Y        N  +G+ G     P    L ++   GQTG+G       NT   Q  Y S
Sbjct: 250  TGYGSSNPLSSNTTTGQTGYGSSNP----LSSNTTTGQTGYGSSNPLSSNTTTGQTGYGS 305

Query: 726  RRSGEYGQGAVPGQTGYGQ------NVVGNQRDY-------DNRRSGEYGRRDITP-GRD 863
              S         GQTGYG       N    Q  Y        N  +G+ G     P   +
Sbjct: 306  --SNPLSSNTTTGQTGYGSSNPLSSNTTTGQTGYGSSNPLSSNTTTGQTGYGSSNPLSSN 363

Query: 864  TTLGR---------DRDITPGRV---------RDITPGRTS-------NRNTITNSRDY- 965
            TT G+           + T G+           + T G+T        + NT T    Y 
Sbjct: 364  TTTGQTGYGSSNPLSSNTTTGQTGYGSSNPLSSNTTTGQTGYGSSNPLSSNTTTGQTGYG 423

Query: 966  ------DNTNLGQTDYGRDNTLGRD---RDITPGRDGY-TRNTIANQRDYDNTGR----- 1100
                   NT  GQT YG  N L  +      T G+ G+ + N +++ ++Y  TG+     
Sbjct: 424  SSNPLSSNTTTGQTGYGSSNPLSSNDNYSSTTTGQTGHSSNNPLSSSQNYSTTGQSGGGN 483

Query: 1101 -----GQTGYNTNQGYNDNFDNRDPSYNRTPSAGRVRRF 1202
                  +TG  +    +D  +  DP  N   +    R++
Sbjct: 484  SYTETSKTGKTSGPHSSDLLNKLDPRVNADETVYSERKY 522


>KJL27477.1 hypothetical protein RS83_02523 [Microbacterium oxydans]
          Length = 600

 Score = 65.5 bits (158), Expect = 1e-07
 Identities = 65/207 (31%), Positives = 85/207 (41%), Gaps = 8/207 (3%)
 Frame = +3

Query: 576  GGQREYDNRRSGEYGR---RDITPGRDTLGTS--ALPGQTGHGLNTVGNQREYDSRRSGE 740
            G  R+   RR G Y R   RD  P R+  G S  + P + G G N   N+     R  G 
Sbjct: 45   GYSRDSAPRREGGYNRDNNRDSAPRREGGGYSRDSAPRREGGGYNRDNNRDSAPRREGGG 104

Query: 741  YGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRRDITPGRDTTLGRDRDITPGRVRDITP 920
            Y + + P + G G N   N RD   RR G Y R D  P R+   G +RD  P R      
Sbjct: 105  YSRDSAPRREGGGYNR-DNNRDSAPRREGGYDR-DSAPRREG--GYNRDSAPRR-----E 155

Query: 921  GRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRDRDITPGRDGYTRNTIANQRD--YDNT 1094
            G   NR         D+    +  Y RDN    +RD  P R+G      A +R+  Y+  
Sbjct: 156  GGGYNR---------DSAPRREGGYNRDNNRDNNRDSAPRREGGYNRDSAPRREGGYNRD 206

Query: 1095 GRGQTGYNTNQGYN-DNFDNRDPSYNR 1172
                +      GYN D+   R+  YNR
Sbjct: 207  NNRDSAPRREGGYNRDSAPRREGGYNR 233


>WP_036777757.1 hypothetical protein [Pontibacter actiniarum]
          Length = 451

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 75/315 (23%), Positives = 122/315 (38%), Gaps = 18/315 (5%)
 Frame = +3

Query: 306  VARDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQS 485
            + R S +G +G  GY  + G   +    ++Q       Y +   G     E+ N      
Sbjct: 164  MGRYSQQGGQG--GYGSQGGNYGSASQSYSQGNYGQGDYGQRGGGYQGGYEQGN------ 215

Query: 486  APFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTSA 665
              +GQ GYS     GQ G G  ++G     G  R   + R G YG RD      ++G  +
Sbjct: 216  --YGQGGYS-----GQQGQGYGMSGS----GSGR---SNRQGGYGDRD---RYGSMGYGS 258

Query: 666  LPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRRD 845
              G + +  +  G+QR +DS+ S + G G+   Q GYG        +Y + RS  YG + 
Sbjct: 259  SRGMSEYDRDNRGSQRGWDSQSSMQGGYGS---QGGYGSRGGSYNDEYSSGRSARYGSQG 315

Query: 846  ITPGRDTTLGRDRDITPGRVRDITPGRTSNRNT---ITNSRDYDNTNLGQTDYGRDNTLG 1016
               G+D + G       GR      G   + N+   ++    Y + +  Q  YG  ++  
Sbjct: 316  SDYGQDYSAGSSYG-NSGRSSSNRGGGYGDSNSGRGMSGGYGYGSGSSSQGGYGSSSSNQ 374

Query: 1017 RDRDITPGRDGYTRN---------------TIANQRDYDNTGRGQTGYNTNQGYNDNFDN 1151
             D     G+ GY                  ++ +   Y +  RG +     +GY+DN  +
Sbjct: 375  GDYSSRYGQSGYGSGGYNQGGYGGDQSRYGSMDSGTSYYSDNRGSSSRENYRGYSDNDSD 434

Query: 1152 RDPSYNRTPSAGRVR 1196
             D     T  + R R
Sbjct: 435  NDQDRYNTRRSDRYR 449


>DAA23419.1 TPA: predicted protein-like [Bos taurus]
          Length = 637

 Score = 65.5 bits (158), Expect = 1e-07
 Identities = 83/308 (26%), Positives = 118/308 (38%), Gaps = 15/308 (4%)
 Frame = +3

Query: 312  RDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSREEENIIRNQSAP 491
            RD +RGR    G  +  GR         +   ++    E     +    +E   R+++  
Sbjct: 246  RDETRGRDETRGSDETRGRDENRGSDETRGSDENRGSDENKGSDENRHSDETRGRDENRG 305

Query: 492  FGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRD-------T 650
              ++  S          G++   G++   G+ E   R   E   RD T GRD       T
Sbjct: 306  SDETRGSDENRHRDENRGRDETRGRDETRGRDE--TRGRDETRGRDETRGRDENRGSDET 363

Query: 651  LGTSALPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGE 830
             G+    G+  +  +     R+ + RR    G+    G+           RD +NRR  E
Sbjct: 364  RGSDENRGRDENRGSDENRSRDENRRRDETRGRDENRGRDETRGRDETRGRD-ENRRRDE 422

Query: 831  YGRRDITPGRDTTLGRD----RDITPG----RVRDITPGRTSNRNTITNSRDYDNTNLGQ 986
               RD T GRD T GRD    RD T G    R RD T G   N+ +  N R   N     
Sbjct: 423  TRGRDETRGRDETRGRDENRRRDETRGKDETRGRDETRGSDENKGSDENRRRDGNRR--- 479

Query: 987  TDYGRDNTLGRDRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQGYNDNFDNRDPSY 1166
                RD T GRD +   GRD  TR+   N+R  +  GR +T         D    RD + 
Sbjct: 480  ----RDETSGRDENRDRGRD-ETRHRDENRRRDETRGRDETRGRDETRGRDETRGRDENR 534

Query: 1167 NRTPSAGR 1190
             R  + GR
Sbjct: 535  GRDETRGR 542



 Score = 65.5 bits (158), Expect = 1e-07
 Identities = 91/302 (30%), Positives = 126/302 (41%), Gaps = 7/302 (2%)
 Frame = +3

Query: 312  RDSSRGRRGVPGYAQEDGRGNTHVPGFNQPMSQHTPYHEITTGLDRSR-EEENIIRNQSA 488
            RD +RGR    G  +  GR  T     N+  S  T   +   G D +R  +EN  R+++ 
Sbjct: 330  RDETRGRDETRGRDETRGRDETRGRDENRG-SDETRGSDENRGRDENRGSDENRSRDENR 388

Query: 489  PFGQS-GYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRDTLGTSA 665
               ++ G  +N        G++   G++   G+ E  NRR  E   RD T GRD      
Sbjct: 389  RRDETRGRDEN-------RGRDETRGRDETRGRDE--NRRRDETRGRDETRGRD------ 433

Query: 666  LPGQTGHGLNTVGNQREYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYGRRD 845
                      T G  R+ + RR    G+    G+          +   +N+ S E  RRD
Sbjct: 434  ---------ETRG--RDENRRRDETRGKDETRGRDE-------TRGSDENKGSDENRRRD 475

Query: 846  ITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLGRD- 1022
                RD T GRD +   GR  D T  R  NR     +R  D T       GRD T GRD 
Sbjct: 476  GNRRRDETSGRDENRDRGR--DETRHRDENRRR-DETRGRDETRGRDETRGRDETRGRDE 532

Query: 1023 ---RDITPGRDGYTRNTIANQRDYDNTGRGQT-GYNTNQGYNDNFDNRDPSYNRTPSAGR 1190
               RD T GRD  TR +   +   +N GR +T G + N+G +   +N+    NRT    R
Sbjct: 533  NRGRDETRGRD-ETRGSDETRGRDENRGRDETRGRDENRGSD---ENKTSDENRTGDENR 588

Query: 1191 VR 1196
             R
Sbjct: 589  GR 590


>XP_005058511.1 PREDICTED: putative per-hexamer repeat protein 5 [Ficedula
            albicollis]
          Length = 645

 Score = 64.7 bits (156), Expect = 2e-07
 Identities = 94/357 (26%), Positives = 118/357 (33%), Gaps = 81/357 (22%)
 Frame = +3

Query: 312  RDSSRGRRGVPGYAQEDGRG---------NTHVPGFNQPMSQHTPYHEITTGLDRSREEE 464
            RD  +GR   PG  QE G G         +    G  Q     T     T   D+ RE  
Sbjct: 109  RDWDKGRETGPGLGQEHGTGTRAGKRDRDSGRNTGLGQGQGNGTGTRAGTRDWDKGRETG 168

Query: 465  NIIRNQSAPFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRR------------S 608
              +  +     ++G     +    G GQ    G     G R++D  R             
Sbjct: 169  PGLGQEHGTGTRAGKRDRDSGRNTGLGQGQGNGTGTRAGTRDWDKGRETGPGLGQEHGTG 228

Query: 609  GEYGRRDITPGRDT-------LGTSALPG--------QTG------HGLNTVGNQREYDS 725
               G+RD   GR+T        GT    G        +TG      HG  T   +R+ DS
Sbjct: 229  TRAGKRDRDSGRNTGLGQGQGNGTGTRAGTRDWDKGRETGPGLGQEHGTGTRAGKRDRDS 288

Query: 726  RRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRR------------SGEYGRRDITPGRDTT 869
             R+   GQG      G G       RD+D  R                G+RD   GR+T 
Sbjct: 289  GRNTGLGQG-----QGNGTGTRAGTRDWDKGRETGPGLGQEHGTGTRAGKRDRDSGRNTG 343

Query: 870  LGR-----------DRDITPGRVRDITPGRTSNRNTITNSRDYD---NTNLGQTDYGRDN 1007
            LG+            RD   GR      G+     T    RD D   NT LGQ       
Sbjct: 344  LGQGQGNGTGTRAGTRDWDKGRETGPGLGQEHGTGTRAGKRDRDSGRNTGLGQGQGNGTG 403

Query: 1008 TLGRDRDITPGRD-------GYTRNTIANQRDYD---NTGRGQ---TGYNTNQGYND 1139
            T    RD   GR+        +   T A +RD D   NTG GQ    G  T  G  D
Sbjct: 404  TRAGTRDWDKGRETGPGLGQEHGTGTRAGKRDRDSGRNTGLGQGQGNGTGTRAGTRD 460


>XP_015329689.1 PREDICTED: uncharacterized protein DDB_G0290685-like [Bos taurus]
            XP_015321535.1 PREDICTED: uncharacterized protein
            DDB_G0290685-like [Bos taurus]
          Length = 467

 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 87/299 (29%), Positives = 124/299 (41%), Gaps = 6/299 (2%)
 Frame = +3

Query: 312  RDSSRGRRGVPGYAQEDGRGNTHVPGFNQPM-SQHTPYHEITTGLDRSR-EEENIIRNQS 485
            RD +RGR    G  +++ RG     G ++   S  T   +   G D +R  +EN   +++
Sbjct: 202  RDGNRGRDETRG--RDETRGRDETRGRDETRGSDETRGRDENRGSDETRGSDENRGSDEN 259

Query: 486  APFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRSGEYGRRDITPGRD-TLGTS 662
                ++ +S          G +   G +        +NR   E   RD T GRD T G  
Sbjct: 260  KGSDENRHSDETRGRDENRGSDETRGSD--------ENRHRDENRGRDETRGRDETRGRD 311

Query: 663  ALPGQTGHGLNTVGNQ--REYDSRRSGEYGQGAVPGQTGYGQNVVGNQRDYDNRRSGEYG 836
               G+      T G    R  D  R  +  +G+   +   G++   N+   +NR   E  
Sbjct: 312  ETRGRD----ETRGRDETRGRDENRGSDETRGSDENR---GRDE--NRGSDENRSRDENR 362

Query: 837  RRDITPGRDTTLGRDRDITPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRDNTLG 1016
            RRD T GRD   GRD      R RD T GR  NR         D T       GRD T G
Sbjct: 363  RRDETRGRDENRGRDET----RGRDETRGRDENRRR-------DETR------GRDETRG 405

Query: 1017 RDRDITPGRDGYTRNTIANQRDYDNTGRGQT-GYNTNQGYNDNFDNRDPSYNRTPSAGR 1190
            RD   T GRD   R      +D +  GR +T G + N+G ++N   RD +  R  ++GR
Sbjct: 406  RDE--TRGRDENRRRDETRGKD-ETRGRDETRGSDENKGSDEN-RRRDGNRRRDETSGR 460


>CAB09569.2 hypothetical protein [Trypanosoma brucei brucei]
          Length = 679

 Score = 64.3 bits (155), Expect = 3e-07
 Identities = 77/248 (31%), Positives = 105/248 (42%), Gaps = 13/248 (5%)
 Frame = +3

Query: 489  PFGQSGYSQNIAAGQPGYGQNVAGGQNIVGGQREYDNRRS-GEYGRRDITPGRDTLGTSA 665
            P G  G+ + + +    + +   G +   G QR YDNR   GE+G R             
Sbjct: 11   PLGGHGHRRMVCSTICLFQRGQWGNEGSRGDQRNYDNRGGRGEWGDRG------------ 58

Query: 666  LPGQTGHGLNTVGNQREYDSRRS-GEYGQGAVPGQTGYGQNVVGNQRDYDNRRS-GEYGR 839
              GQ G       +QR YD+R   GE+G     GQ G  Q   G+QR+YDNR S GE+G 
Sbjct: 59   --GQRGDNQRDYRDQRNYDNRGGRGEWGDRG--GQRGDNQRDYGDQRNYDNRGSRGEWGD 114

Query: 840  RDITPG---RDTTLGRDRDI--TPGRVRDITPGRTSNRNTITNSRDYDNTNLGQTDYGRD 1004
            R    G   RD    R+ D   + G   D    R  N+    + R+YDN   G+ ++G  
Sbjct: 115  RGGQRGGNQRDYGDQRNYDNRGSRGEWGDRGGQRGGNQRDYGDQRNYDNRG-GRGEWG-- 171

Query: 1005 NTLGRDRDITPGRDGYTRNTIANQRDYDNTGRGQTGYNTNQGYNDNF---DNR--DPSYN 1169
                 DR    G+ G  +   +NQ  Y N   G  G    +   DNF   DNR  D   +
Sbjct: 172  -----DRG---GQRGGNQRDNSNQFGYRNERDG--GIGRQRSARDNFGAHDNRASDERGS 221

Query: 1170 RTPSAGRV 1193
              P+ G V
Sbjct: 222  TDPAVGEV 229


Top