BLASTX nr result

ID: Catharanthus23_contig00025575 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00025575
         (1268 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY12265.1| Glycosyltransferase family 61 protein [Theobroma ...    79   4e-12
ref|XP_004305644.1| PREDICTED: uncharacterized protein LOC101296...    72   5e-11
ref|XP_004170305.1| PREDICTED: glycosyltransferase-like domain-c...    68   7e-09
ref|XP_004161896.1| PREDICTED: glycosyltransferase-like domain-c...    68   7e-09
ref|XP_004157036.1| PREDICTED: glycosyltransferase-like domain-c...    68   7e-09
ref|XP_004147554.1| PREDICTED: glycosyltransferase-like domain-c...    68   7e-09
ref|XP_006452434.1| hypothetical protein CICLE_v10010510mg [Citr...    67   1e-08
ref|XP_001436905.1| hypothetical protein [Paramecium tetraurelia...    67   1e-08
ref|XP_004295843.1| PREDICTED: uncharacterized protein LOC101307...    66   3e-08
ref|NP_850032.1| uncharacterized protein [Arabidopsis thaliana] ...    64   1e-07
emb|CBH16810.1| hypothetical protein, conserved, (fragment) [Try...    64   1e-07
gb|EPS71116.1| hypothetical protein M569_03640 [Genlisea aurea]        64   2e-07
emb|CDI83792.1| hypothetical protein EAH_00043820 [Eimeria acerv...    63   3e-07
ref|WP_021435003.1| hypothetical protein [[Clostridium] difficil...    62   5e-07
emb|CBH16815.1| hypothetical protein, conserved, (fragment) [Try...    62   5e-07
gb|ABC59321.2| Jacob 6 [Entamoeba invadens]                            61   9e-07
ref|XP_828040.1| hypothetical protein [Trypanosoma brucei brucei...    60   2e-06
ref|XP_006295418.1| hypothetical protein CARUB_v10024517mg [Caps...    60   2e-06
ref|XP_004257423.1| Cyst wall-specific glycoprotein Jacob family...    60   2e-06
ref|XP_006404744.1| hypothetical protein EUTSA_v10000072mg [Eutr...    59   3e-06

>gb|EOY12265.1| Glycosyltransferase family 61 protein [Theobroma cacao]
          Length = 459

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 37/80 (46%), Positives = 52/80 (65%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNHTIVRPYARQEDQDVL 66
           +L++ GF C T  HS VC+ + PV ID   +TV+  +DQ       +V+PYAR+ED+  +
Sbjct: 79  QLDSNGFFCHTDVHSEVCLVDNPVRIDNKALTVYAPSDQPQVKR--MVQPYARKEDETAM 136

Query: 65  KNIRPVQFLYGNTTPPACQY 6
           K + PVQ LYGNT PPAC +
Sbjct: 137 KLVTPVQILYGNTNPPACGF 156


>ref|XP_004305644.1| PREDICTED: uncharacterized protein LOC101296887 [Fragaria vesca
           subsp. vesca]
          Length = 452

 Score = 72.4 bits (176), Expect(2) = 5e-11
 Identities = 32/81 (39%), Positives = 50/81 (61%)
 Frame = -1

Query: 248 FKLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNHTIVRPYARQEDQDV 69
           F+L ++G +C +  H   C+  KPV ID    TV+I +D      + I +PYAR+ED+  
Sbjct: 69  FQLHSSGLSCHSDLHFEQCLARKPVIIDKNASTVYIPSDNEANSEYKI-KPYARKEDETA 127

Query: 68  LKNIRPVQFLYGNTTPPACQY 6
           +K + PV+ ++GN TPPAC +
Sbjct: 128 MKVVTPVRIVHGNITPPACDF 148



 Score = 23.1 bits (48), Expect(2) = 5e-11
 Identities = 10/17 (58%), Positives = 12/17 (70%)
 Frame = -3

Query: 327 IEGHDSRESLFTRLVRG 277
           +EG +S   LF RLVRG
Sbjct: 49  VEGKESLRLLFRRLVRG 65


>ref|XP_004170305.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 335

 Score = 68.2 bits (165), Expect = 7e-09
 Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 5/85 (5%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNH---TIVRPYARQEDQ 75
           +LE TGFAC T  HS VC+TN P  I+   +  +IS +   + N+    ++ PYARQED+
Sbjct: 19  QLERTGFACHTDLHSKVCLTNNPTRINNTNLEFYISTNNDSQQNNFSPILIHPYARQEDK 78

Query: 74  DVLKNIRPVQFLY--GNTTPPACQY 6
             L+++ P+Q ++    T  P CQ+
Sbjct: 79  ITLRDVTPLQIIFQPNKTLLPLCQF 103


>ref|XP_004161896.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 407

 Score = 68.2 bits (165), Expect = 7e-09
 Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 5/85 (5%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNH---TIVRPYARQEDQ 75
           +LE TGFAC T  HS VC+TN P  I+   +  +IS +   + N+    ++ PYARQED+
Sbjct: 19  QLERTGFACHTDLHSKVCLTNNPTRINNTNLEFYISTNNDSQQNNFSPILIHPYARQEDK 78

Query: 74  DVLKNIRPVQFLY--GNTTPPACQY 6
             L+++ P+Q ++    T  P CQ+
Sbjct: 79  ITLRDVTPLQIIFQPNKTLLPLCQF 103


>ref|XP_004157036.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 372

 Score = 68.2 bits (165), Expect = 7e-09
 Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 5/85 (5%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNH---TIVRPYARQEDQ 75
           +LE TGFAC T  HS VC+TN P  I+   +  +IS +   + N+    ++ PYARQED+
Sbjct: 19  QLERTGFACHTDLHSKVCLTNNPTRINNTNLEFYISTNNDSQQNNFSPILIHPYARQEDK 78

Query: 74  DVLKNIRPVQFLY--GNTTPPACQY 6
             L+++ P+Q ++    T  P CQ+
Sbjct: 79  ITLRDVTPLQIIFQPNKTLLPLCQF 103


>ref|XP_004147554.1| PREDICTED: glycosyltransferase-like domain-containing protein
           2-like [Cucumis sativus]
          Length = 407

 Score = 68.2 bits (165), Expect = 7e-09
 Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 5/85 (5%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNH---TIVRPYARQEDQ 75
           +LE TGFAC T  HS VC+TN P  I+   +  +IS +   + N+    ++ PYARQED+
Sbjct: 19  QLERTGFACHTDLHSKVCLTNNPTRINNTNLEFYISTNNDSQQNNFSPILIHPYARQEDK 78

Query: 74  DVLKNIRPVQFLY--GNTTPPACQY 6
             L+++ P+Q ++    T  P CQ+
Sbjct: 79  ITLRDVTPLQIIFQPNKTLLPLCQF 103


>ref|XP_006452434.1| hypothetical protein CICLE_v10010510mg [Citrus clementina]
           gi|557555660|gb|ESR65674.1| hypothetical protein
           CICLE_v10010510mg [Citrus clementina]
          Length = 432

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 30/79 (37%), Positives = 51/79 (64%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNHTIVRPYARQEDQDVL 66
           KL+ TGF+C T  HS +C+ NKPV ID + +T+++ + Q+   N T+ +PYA ++D   +
Sbjct: 50  KLDTTGFSCHTDLHSELCLVNKPVRIDNSGLTIYVPSSQSY-VNRTL-KPYANRDDGTAM 107

Query: 65  KNIRPVQFLYGNTTPPACQ 9
             + PV+ + G+   PAC+
Sbjct: 108 SRVSPVKIVNGDVNAPACR 126


>ref|XP_001436905.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
           gi|124404050|emb|CAK69508.1| unnamed protein product
           [Paramecium tetraurelia]
          Length = 426

 Score = 67.4 bits (163), Expect = 1e-08
 Identities = 56/218 (25%), Positives = 95/218 (43%), Gaps = 7/218 (3%)
 Frame = -3

Query: 936 AESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRLENSR 757
           A+SS E    EE +S E   +S +  E  E    +S  +   +E E  +A  +   E+  
Sbjct: 138 AQSSSEEEESEEEES-EAQSSSEEEEEEEEESDAQSSSEEESEEEEESDAQSSSEEESEE 196

Query: 756 PESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDHEYGQV 577
            E + ++   E   E+E +   QSSS      E +ES AQ +S++  + E++ D +    
Sbjct: 197 EEESDAQSSSEEESEEEEESDAQSSSEEESEEEEEESDAQSSSEEESEEEEESDAQSSSE 256

Query: 576 YQSSR-------ESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTIELQES 418
            +S         +S+     E E  S   +SS+E +E  EEE  + S+       E +E 
Sbjct: 257 EESEESEEESDAQSSSEEESEEEEESDAQSSSEEESEESEEESEAQSSSEEESE-ESEEE 315

Query: 417 SRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRE 304
           S        E+ + EE +++ S   E ++  E   S E
Sbjct: 316 SEAQSSSEEESEESEEESEAQSSSEEESEESEAQSSSE 353



 Score = 65.5 bits (158), Expect = 5e-08
 Identities = 55/203 (27%), Positives = 89/203 (43%)
 Frame = -3

Query: 939 DAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRLENS 760
           DA+SS E   +EE +S   S +  +  E  E     S ++ + +E ES   + +      
Sbjct: 169 DAQSSSEEESEEEEESDAQSSSEEESEEEEESDAQSSSEEESEEEEESDAQSSSEEESEE 228

Query: 759 RPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDHEYGQ 580
             E + ++   E   E+E +   QSSS        +ES AQ +S+     E+ ++ E   
Sbjct: 229 EEEESDAQSSSEEESEEEEESDAQSSSEEESEESEEESDAQSSSE-----EESEEEEESD 283

Query: 579 VYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTIELQESSRQLIR 400
              SS E +E    E E  S+  +SS+E +E  EEE    S  + +   E +ES  +   
Sbjct: 284 AQSSSEEESE----ESEEESEAQSSSEEESEESEEE----SEAQSSSEEESEESEEESEA 335

Query: 399 QVHEAIQGEESTQSSSIYREAND 331
           Q     + EES   SS   E+ +
Sbjct: 336 QSSSEEESEESEAQSSSEEESEE 358



 Score = 61.6 bits (148), Expect = 7e-07
 Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 1/213 (0%)
 Frame = -3

Query: 936 AESSEETLIKEEPDSGEVSRNSSKINETIELQ-QPRSHDQTNLQEHESREANQTGRLENS 760
           + S EE+    E  S  ++++SS+  E+ E + + +S  +   +E E  +A  +   E+ 
Sbjct: 120 SSSEEESEESSEESSDLLAQSSSEEEESEEEESEAQSSSEEEEEEEEESDAQSSSEEESE 179

Query: 759 RPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDHEYGQ 580
             E + ++   E   E+E +   QSSS   E  E +ES AQ +S++    E +++ E   
Sbjct: 180 EEEESDAQSSSEEESEEEEESDAQSSSEE-ESEEEEESDAQSSSEE----ESEEEEEESD 234

Query: 579 VYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTIELQESSRQLIR 400
              SS E +     E E  S   +SS+E +E  EEE  + S+       E +ES  Q   
Sbjct: 235 AQSSSEEES-----EEEEESDAQSSSEEESEESEEESDAQSSSEEESE-EEEESDAQSSS 288

Query: 399 QVHEAIQGEESTQSSSIYREANDMIEGHDSRES 301
           +       EES   SS   E+ +  E  +++ S
Sbjct: 289 EEESEESEEESEAQSSSEEESEESEEESEAQSS 321


>ref|XP_004295843.1| PREDICTED: uncharacterized protein LOC101307291 [Fragaria vesca
           subsp. vesca]
          Length = 453

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 30/80 (37%), Positives = 49/80 (61%)
 Frame = -1

Query: 245 KLEATGFACDTAFHSVVCVTNKPVTIDVATMTVHISADQTVEPNHTIVRPYARQEDQDVL 66
           +L+ TG +C    H   C+ NKPV ID    TV+I + +    +   ++PYAR+ED+  +
Sbjct: 70  QLDTTGLSCHFDLHFEQCLANKPVIIDKNASTVYIPSYEA--KSEYKLKPYARKEDETAM 127

Query: 65  KNIRPVQFLYGNTTPPACQY 6
           K + PV+ L+GN +PP+C +
Sbjct: 128 KLVTPVRILHGNISPPSCDF 147


>ref|NP_850032.1| uncharacterized protein [Arabidopsis thaliana]
            gi|330252261|gb|AEC07355.1| uncharacterized protein
            AT2G22795 [Arabidopsis thaliana]
          Length = 734

 Score = 64.3 bits (155), Expect = 1e-07
 Identities = 62/228 (27%), Positives = 106/228 (46%), Gaps = 14/228 (6%)
 Frame = -3

Query: 981  GARQVSSRQLRRIEDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEH 802
            G  Q +S    + E      ET  KEE  S E S++     ET E ++  S ++T  +E 
Sbjct: 414  GGSQETSEVSSQEESKGKESETKDKEESSSQEESKDRE--TETKEKEESSSQEETMDKET 471

Query: 801  ESREA-----------NQTGRLENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEV 655
            E++E             +T ++E+S  E    K  DET +++ES    ++  +  E  + 
Sbjct: 472  EAKEKVESSSQEKNEDKETEKIESSFLEETKEKE-DETKEKEESSSQEKTEEKETETKDN 530

Query: 654  QESTAQGNSKDIRKPEDKQDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEE 475
            +ES++Q  +KD  K  +K + E     + S+E NET   E E  S    + ++ NE +E+
Sbjct: 531  EESSSQEETKD--KENEKIEKEEASSQEESKE-NETETKEKEESSSQEETKEKENEKIEK 587

Query: 474  EHLS---FSTGRVNGTIELQESSRQLIRQVHEAIQGEESTQSSSIYRE 340
            E  +    +  + N  IE +ES+ Q   +  E    E+   SS+  +E
Sbjct: 588  EESAPQEETKEKENEKIEKEESASQEETKEKETETKEKEESSSNESQE 635


>emb|CBH16810.1| hypothetical protein, conserved, (fragment) [Trypanosoma brucei
            gambiense DAL972]
          Length = 2849

 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 60/244 (24%), Positives = 99/244 (40%), Gaps = 19/244 (7%)
 Frame = -3

Query: 975  RQVSSRQLRRIEDAESSEETLIKEEPDSGEVS-----RNSSKINETIELQQPRSHDQTNL 811
            +Q  SR      D  S E T   E+ +    S      +S K ++  E Q+       + 
Sbjct: 2083 KQEESRVSSESPDESSQEPTKESEKQEESRASSATGDESSQKPSKESEKQEESRVYSESP 2142

Query: 810  QEHESREANQTGRLENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGN 631
             E     + ++ + E+SRP SA     DE+ QE   +   Q  SR +   E  + ++Q  
Sbjct: 2143 DESSQEPSKESEKQEDSRPSSATR---DESSQEPSKESEKQEESRVYS--ESPDESSQEP 2197

Query: 630  SKDIRKPEDKQ-----DHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHL 466
            +K+  K E+ +       E  Q      E  E  R+  E+P + +    + +E  EE   
Sbjct: 2198 TKESEKQEESRASSATGDESSQEPSKESEKQEESRVYSESPDESSQEPTKESEKQEESRA 2257

Query: 465  SFSTGRVNGTIELQESSRQLIRQVHEAIQGEESTQSSS---------IYREANDMIEGHD 313
            S +TG  +  +  +ES +Q   +   A + E S  SS          +Y E+ D I    
Sbjct: 2258 SSATGDESSQMSTKESEKQEESRPSSATRDESSQMSSKESEKQEESRVYSESPDEISQEP 2317

Query: 312  SRES 301
            S+ES
Sbjct: 2318 SKES 2321



 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 61/246 (24%), Positives = 102/246 (41%), Gaps = 18/246 (7%)
 Frame = -3

Query: 972  QVSSRQLRRIEDAESSEET------LIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNL 811
            Q  S++  + E++ +S  T      +  +E +  E SR SS   + I  Q+P    +   
Sbjct: 1931 QEPSKESEKQEESRASSATGDESSQMSTKESEKQEESRPSSATRDEIS-QEPTKESE--- 1986

Query: 810  QEHESREANQTGRLENSRPESAGSKMIDE---TLQEDESKQVNQSSSRAFERIEVQESTA 640
            ++ ESR ++ TG   +  P     K  +    +   DES QV+   S   E   V   + 
Sbjct: 1987 KQEESRASSATGDESSQEPTKESEKQEESRPSSATRDESSQVSSKESEKQEESRVYSESP 2046

Query: 639  QGNSKDIRKPEDKQDH---------EYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNE 487
              +S++  K  +KQ+          E  Q      E  E  R+  E+P + +    + +E
Sbjct: 2047 DESSQEPSKESEKQEESRASSATRDESSQEPSKESEKQEESRVSSESPDESSQEPTKESE 2106

Query: 486  TVEEEHLSFSTGRVNGTIELQESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSR 307
              EE   S +TG  +     +ES +Q   +V+     +ES+Q  S   E  +     DSR
Sbjct: 2107 KQEESRASSATGDESSQKPSKESEKQEESRVYSE-SPDESSQEPSKESEKQE-----DSR 2160

Query: 306  ESLFTR 289
             S  TR
Sbjct: 2161 PSSATR 2166


>gb|EPS71116.1| hypothetical protein M569_03640 [Genlisea aurea]
          Length = 492

 Score = 63.5 bits (153), Expect = 2e-07
 Identities = 36/85 (42%), Positives = 55/85 (64%), Gaps = 4/85 (4%)
 Frame = -1

Query: 245 KLEATGFACD-TAFHSVVCVTNKPVTIDVATMTVHIS--ADQTVEPNHTIVRPYARQEDQ 75
           KL  TGF+CD +   S  CV ++ + ID  TMTV ++  A++TV     +VRPYARQED+
Sbjct: 112 KLLETGFSCDGSGISSKHCVVDRDMRIDTTTMTVTVASTAEETV-----VVRPYARQEDK 166

Query: 74  DVLKNIRPVQFLYGNTTPPA-CQYN 3
            +L+ + PV+ + G + P + CQ+N
Sbjct: 167 PLLQRVSPVKIIAGKSLPASPCQHN 191


>emb|CDI83792.1| hypothetical protein EAH_00043820 [Eimeria acervulina]
          Length = 1109

 Score = 62.8 bits (151), Expect = 3e-07
 Identities = 42/197 (21%), Positives = 92/197 (46%), Gaps = 2/197 (1%)
 Frame = -3

Query: 942  EDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQT--NLQEHESREANQTGRL 769
            ++   +EET  KEE    E     S+ +ET + ++    ++T    +  + +EA+Q    
Sbjct: 916  QEVSQTEETSQKEETPQAE---GISQRDETTQTEETPGQEETPQTQETSQQQEASQQQEE 972

Query: 768  ENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDHE 589
             + + E A  +  +   Q++E+ Q  ++S +  E  +V+E++ Q       K   +Q+  
Sbjct: 973  ASQQQEEASQQQEEAPQQQEEASQQEETSQQQQETSQVEEASQQQGETPEVKETSRQEET 1032

Query: 588  YGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTIELQESSRQ 409
              Q  ++S++  ET + + E P Q   ++++  ET +++     T +   T + + +  +
Sbjct: 1033 AQQQQETSQQQEETAQQQEETPQQQEETAQQQEETPQQQE---ETAQQQETSQAETTQEE 1089

Query: 408  LIRQVHEAIQGEESTQS 358
               Q  EA+   E+  S
Sbjct: 1090 KTTQTEEALSEAETAPS 1106


>ref|WP_021435003.1| hypothetical protein [[Clostridium] difficile]
           gi|531783843|gb|EQK59905.1| hypothetical protein
           C676_1734 [Clostridium difficile F548]
          Length = 764

 Score = 62.0 bits (149), Expect = 5e-07
 Identities = 51/230 (22%), Positives = 109/230 (47%), Gaps = 16/230 (6%)
 Frame = -3

Query: 921 ETLIKEEPD---SGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRL--ENSR 757
           E LI+E+ +   + E +R ++++      ++ + ++ T      +RE+N+  R   E SR
Sbjct: 164 ENLIEEQEEIRSTQETTRETNELTRETNEKERKINELTRQTNETTRESNEEERKTSETSR 223

Query: 756 PESAGSKMIDETLQEDES--KQVNQSSSRAFERIEVQESTAQGNSKDIRKPED---KQDH 592
            ES   +   ET +E+    +Q N+   +  E       T + ++++ RK  +   K++ 
Sbjct: 224 KESEDIRSTQETTREENESIRQSNEEERKTNELTRQTNETTRESNEEERKTSEIARKENE 283

Query: 591 EYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETV----EEEHLSFSTGRVNGTIELQ 424
           +     +++RE NE++R   E   + N  +++ NET     EEE  +    R     E  
Sbjct: 284 DIRNTQETTREENESIRQSNEEERKTNELTRQTNETTRESNEEERKTSEIARKEN--EDI 341

Query: 423 ESSRQLIRQVHEAIQ--GEESTQSSSIYREANDMIEGHDSRESLFTRLVR 280
            S+++  R+ +E+I+   EE  +++ + R+ N+     +  E   + + R
Sbjct: 342 RSTQETTREENESIRQSNEEERKTNELTRQTNETTRESNEEERKTSEIAR 391



 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 51/232 (21%), Positives = 101/232 (43%), Gaps = 4/232 (1%)
 Frame = -3

Query: 975 RQVSSRQLRRIEDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHES 796
           R+ S    +  ED  +++ET  +E     + +    K NE       ++++ T     E 
Sbjct: 272 RKTSEIARKENEDIRNTQETTREENESIRQSNEEERKTNELTR----QTNETTRESNEEE 327

Query: 795 REANQTGRLENSRPESAGSKMIDETLQEDES-KQVNQSSSRAFERIEVQESTAQGNSKDI 619
           R+ ++  R EN    S      + T +E+ES +Q N+   +  E       T + ++++ 
Sbjct: 328 RKTSEIARKENEDIRSTQ----ETTREENESIRQSNEEERKTNELTRQTNETTRESNEEE 383

Query: 618 RKPED---KQDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGR 448
           RK  +   K++ +     +++RE NE++R   E   + N  +++ NET  E     S   
Sbjct: 384 RKTSEIARKENEDIRSTQETTREENESIRQSNEEERKTNELTRQTNETTRE-----SNEE 438

Query: 447 VNGTIELQESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRESLFT 292
              T E+     +LIRQ  E+I   +      +    N ++E + ++  + T
Sbjct: 439 ERKTSEIARKENELIRQ--ESISNMQKKIDDKVDEVDNKIVEVNTAKTDMTT 488


>emb|CBH16815.1| hypothetical protein, conserved, (fragment) [Trypanosoma brucei
            gambiense DAL972]
          Length = 2166

 Score = 62.0 bits (149), Expect = 5e-07
 Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 5/229 (2%)
 Frame = -3

Query: 972  QVSSRQLRRIEDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESR 793
            Q+S+++  + E++  S  T         E S+ SSK +E  E  +  S     + +  S+
Sbjct: 713  QMSTKESEKQEESRPSSAT-------RDESSQMSSKESEKQEESRVYSESPDEISQEPSK 765

Query: 792  EANQTGRLENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRK 613
            E+ +    E+SRP SA     DE+ QE   +   Q  SR +   E  + ++Q  +K+  K
Sbjct: 766  ESEKQ---EDSRPSSATR---DESSQEPSKESEKQEESRVYS--ESPDESSQEPTKESEK 817

Query: 612  PEDKQ-----DHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGR 448
             E+ +       E  Q      E  E  R+  E+P + +    + +E  EE   S +TG 
Sbjct: 818  QEESRASSATGDESSQEPSKESEKQEESRVYSESPDESSQEPTKESEKQEESRASSATGD 877

Query: 447  VNGTIELQESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRES 301
             +  +  +ES +Q   +   A + E S  SS   +E+    E   S ES
Sbjct: 878  ESSQMSTKESEKQEESRPSSATRDESSQMSS---KESEKQEESRVSSES 923



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 56/228 (24%), Positives = 99/228 (43%)
 Frame = -3

Query: 972  QVSSRQLRRIEDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESR 793
            Q+S+++  + E++ +S  T         E S+ S+K +E  E  +P S  +    +  S+
Sbjct: 1073 QMSTKESEKQEESRASSAT-------GDESSQMSTKESEKQEESRPSSATRDESSQMSSK 1125

Query: 792  EANQTGRLENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRK 613
            E+ +    E SR  SA      +   ++  KQ    +S A      Q ST +   ++  +
Sbjct: 1126 ESEKQ---EESRASSATRDESSQMSTKESEKQEESRASSATGDESSQMSTKESEKQEESR 1182

Query: 612  PEDKQDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTI 433
            P      E  Q+     E  E  R+  E+P + +    + +E  EE   S +TG  +  +
Sbjct: 1183 PSSATRDESSQMSSKESEKQEESRVYSESPDESSQEPTKESEKQEESRASSATGDESSQM 1242

Query: 432  ELQESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRESLFTR 289
              +ES +Q   +   A + +ES+Q SS   E  +     +SR S  TR
Sbjct: 1243 STKESEKQEESRPSSATR-DESSQMSSKESEKQE-----ESRPSSATR 1284


>gb|ABC59321.2| Jacob 6 [Entamoeba invadens]
          Length = 917

 Score = 61.2 bits (147), Expect = 9e-07
 Identities = 53/227 (23%), Positives = 99/227 (43%), Gaps = 14/227 (6%)
 Frame = -3

Query: 942  EDAESSEETLIKE--EPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRL 769
            ++  S+E++  KE  E  S E S + SK   + E  Q + H ++  +EH   ++ +    
Sbjct: 617  KEESSTEKSQSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKEHSESKSKEESST 676

Query: 768  ENSRP----ESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDK 601
            E S+     ES   +  +   +E+ S + +QS   +  + + + ST +  SK+  + + K
Sbjct: 677  EKSQSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKEESSTEKSQSKEHSESKSK 736

Query: 600  QDHEYGQVYQSSRESNETVRIEP----ENPSQLNNSSKEHNETVEEEHLSFSTGRVNGTI 433
            +  E     +SS E +++         E  S   + SKEH+E+  +EH    +   + T 
Sbjct: 737  EHSESKSKEESSTEKSQSKEHSESKSKEESSTEKSQSKEHSESKSKEHSESKSKEESSTE 796

Query: 432  ELQ----ESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRE 304
            + Q      S+       E  Q +E ++S S     +   E  DS E
Sbjct: 797  KSQSKEHSESKSKEESSTEKSQSKEHSESKSKEHSESKSKEHSDSDE 843



 Score = 58.5 bits (140), Expect = 6e-06
 Identities = 54/225 (24%), Positives = 90/225 (40%), Gaps = 4/225 (1%)
 Frame = -3

Query: 963  SRQLRRIEDAESSEETLIKE----EPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHES 796
            S++    E ++S E +  K     E  S E S + SK   + E  Q + H ++  +EH  
Sbjct: 581  SKEESSTEKSQSKEHSESKSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKEHSE 640

Query: 795  REANQTGRLENSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIR 616
             +         S+ ES+  K   +   E +SK+ ++S S+     E  +S     SK   
Sbjct: 641  SK---------SKEESSTEKSQSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKE 691

Query: 615  KPEDKQDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGRVNGT 436
              E K   E       S+E +E+     E  S   + SKEH+E+  +EH S S  +   +
Sbjct: 692  HSESKSKEESSTEKSQSKEHSESK--SKEESSTEKSQSKEHSESKSKEH-SESKSKEESS 748

Query: 435  IELQESSRQLIRQVHEAIQGEESTQSSSIYREANDMIEGHDSRES 301
             E  +S      +  E    E+S        ++ +  E     ES
Sbjct: 749  TEKSQSKEHSESKSKEESSTEKSQSKEHSESKSKEHSESKSKEES 793


>ref|XP_828040.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
            GUTat10.1] gi|70833424|gb|EAN78928.1| hypothetical
            protein, conserved [Trypanosoma brucei brucei strain
            927/4 GUTat10.1]
          Length = 3452

 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 56/231 (24%), Positives = 93/231 (40%), Gaps = 17/231 (7%)
 Frame = -3

Query: 942  EDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRLEN 763
            E  +  E  +  E PD  E+S+  +K +E  E  +  S     + +  ++E+ +    E 
Sbjct: 2108 ESEKQEESRVYSESPD--EISQEPTKESEKQEESRVYSESPDEISQEPTKESEKQ---EE 2162

Query: 762  SRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDH--- 592
            SRP SA           DES Q++   S   E      +T   +S+   K  +KQ+    
Sbjct: 2163 SRPSSA---------TRDESSQISTKESEKQEESRPSSATRDESSQISTKESEKQEESRP 2213

Query: 591  ------EYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFST----GRVN 442
                  E  Q+     E  E  R+  E+P +++    + +E  EE   S +T     +++
Sbjct: 2214 SSATRDESSQISTKESEKQEESRVYSESPDEISQEPTKESEKQEESRPSSATRDESSQIS 2273

Query: 441  GTIELQESSRQLIRQVHEAIQ----GEESTQSSSIYREANDMIEGHDSRES 301
               E QE SR       E  Q      E  + S +Y E+ D I    ++ES
Sbjct: 2274 KESEKQEESRVYSESPDEISQEPTKESEKQEESRVYSESPDEISQMSTKES 2324


>ref|XP_006295418.1| hypothetical protein CARUB_v10024517mg [Capsella rubella]
            gi|482564126|gb|EOA28316.1| hypothetical protein
            CARUB_v10024517mg [Capsella rubella]
          Length = 740

 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 47/216 (21%), Positives = 96/216 (44%), Gaps = 11/216 (5%)
 Frame = -3

Query: 942  EDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRLEN 763
            ++    +ET +KE  +S     N  K NE IE ++  S D+T  +E E++   ++   E 
Sbjct: 504  QEKTEEKETEVKENKESSSQEENKDKDNEKIEKEESSSQDETKDKETEAKVKEESPSKEK 563

Query: 762  SRPESAGSKMIDETLQEDESK------QVNQSSSRAFERIEVQESTAQGNSKDIRKPEDK 601
            +  +   +K  +E+  ++E+K      + N+ SS+   + +  E+  +  S    + +DK
Sbjct: 564  TEEKETETKENEESSSQEETKDKENETKENEVSSQEETKDKESETKEKEESLPQEETKDK 623

Query: 600  QDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTG-----RVNGT 436
            +     +   SS  S E    E E   Q+  + K+  E   E +   S       +   T
Sbjct: 624  ETETKEKEESSSNNSQENENTESEKKEQVEENEKKTEEDTSESNKESSNSDTEQKQSEET 683

Query: 435  IELQESSRQLIRQVHEAIQGEESTQSSSIYREANDM 328
             E +ES++    +V      + S+ ++++ +E  D+
Sbjct: 684  SEKEESNKNGETEVTRE-HSDSSSDTTNLPQEVKDV 718


>ref|XP_004257423.1| Cyst wall-specific glycoprotein Jacob family protein [Entamoeba
            invadens IP1] gi|440298011|gb|ELP90652.1| Cyst
            wall-specific glycoprotein Jacob family protein
            [Entamoeba invadens IP1]
          Length = 1021

 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 48/210 (22%), Positives = 98/210 (46%), Gaps = 13/210 (6%)
 Frame = -3

Query: 978  ARQVSSRQLRRIEDAESSEETLIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEH- 802
            +++ S  + +   +++S EE+   E+  S E S + SK   + E  Q + H ++  +EH 
Sbjct: 519  SKEHSESKSKEHSESKSKEESST-EKSQSKEHSESKSKEESSTEKSQSKEHSESKSKEHS 577

Query: 801  --ESREANQTGRLENSRPESAGSKMIDETLQED--ESKQVNQSSSRAFERIEVQESTAQG 634
              +S+E + T + ++     + SK   E+  ++  ESK   +SS+   +  E  ES ++ 
Sbjct: 578  ESKSKEESSTEKSQSKEHSESKSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKE 637

Query: 633  NSKDIRKPEDKQDHEYGQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEH----- 469
            +S+   K E   +    + +  S+    +     E  S   + SKEH+E+  +EH     
Sbjct: 638  HSESKSKEESSTEKSQSKEHSESKSKEHSESKSKEESSTEKSQSKEHSESKSKEHSEANQ 697

Query: 468  -LSFSTGRVN--GTIELQESSRQLIRQVHE 388
              S    RVN   T++  +     +++V++
Sbjct: 698  KKSHQLKRVNQKNTLKANQKKSHQLKRVNQ 727


>ref|XP_006404744.1| hypothetical protein EUTSA_v10000072mg [Eutrema salsugineum]
           gi|557105872|gb|ESQ46197.1| hypothetical protein
           EUTSA_v10000072mg [Eutrema salsugineum]
          Length = 666

 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 49/226 (21%), Positives = 105/226 (46%), Gaps = 14/226 (6%)
 Frame = -3

Query: 942 EDAESSEET-LIKEEPDSGEVSRNSSKINETIELQQPRSHDQTNLQEHESREANQTGRLE 766
           +++ESS++T  I+E+ +S    ++  K  E +E ++  S +++  +E E +E  ++   E
Sbjct: 306 DESESSQKTDSIEEKEESSSQEKSEDKGTEKVEKEEASSQEESKDKESEEKEKEESSSQE 365

Query: 765 NSRPESAGSKMIDETLQEDESKQVNQSSSRAFERIEVQESTAQGNSKDIRKPEDKQDHEY 586
            ++ + +  K  +E+  ++E+K+      +  E  + +ES++Q  +K+ ++ E K   E 
Sbjct: 366 ETKDKESEEKEKEESSSQEENKE------KETETKDKEESSSQEENKE-KETETKDKEES 418

Query: 585 GQVYQSSRESNETVRIEPENPSQLNNSSKEHNETVEEEHLSFSTGR----------VNGT 436
               Q  R+  ET +IE E  S       +  E +E+E  S S  +            G 
Sbjct: 419 SS--QEERKEKETEKIEKEESSSKEEKEVKETEKLEKEEESSSQEKNEDKDTEKIEKEGE 476

Query: 435 IELQESSRQL---IRQVHEAIQGEESTQSSSIYREANDMIEGHDSR 307
              QE S+      ++  E+   EE+    +  +E  + +   +S+
Sbjct: 477 SSSQEESKDKETETKEKEESSSQEETKDKGTETKEKEESLSQEESK 522


Top