BLASTX nr result

ID: Ephedra29_contig00004602 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra29_contig00004602
         (2027 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ABR16533.1 unknown [Picea sitchensis]                                 595   0.0  
XP_002983469.1 hypothetical protein SELMODRAFT_445559 [Selaginel...   390   e-125
XP_002974022.1 hypothetical protein SELMODRAFT_442248 [Selaginel...   388   e-124
XP_001758612.1 predicted protein [Physcomitrella patens] EDQ7659...   342   e-107
EWM29418.1 hypothetical protein Naga_100042g35 [Nannochloropsis ...   174   4e-43
XP_003724058.1 PREDICTED: uncharacterized protein LOC100889358 [...   130   1e-28
XP_019627227.1 PREDICTED: uncharacterized protein LOC109472096 [...   129   3e-28
XP_009051377.1 hypothetical protein LOTGIDRAFT_152602 [Lottia gi...   126   3e-27
XP_019633246.1 PREDICTED: uncharacterized protein LOC109476680 [...   118   2e-24
XP_002601781.1 hypothetical protein BRAFLDRAFT_75999 [Branchiost...   117   4e-24
XP_005646321.1 hypothetical protein COCSUDRAFT_42826 [Coccomyxa ...   114   4e-23
XP_013062240.1 PREDICTED: uncharacterized protein LOC106051586 i...   110   4e-22
XP_009051376.1 hypothetical protein LOTGIDRAFT_228186 [Lottia gi...   103   1e-19
XP_011683993.1 PREDICTED: uncharacterized protein LOC100890127 [...    97   6e-19
XP_011440327.1 PREDICTED: uncharacterized protein LOC105337342 [...   100   1e-18
XP_018017320.1 PREDICTED: uncharacterized protein LOC108673941 [...    97   1e-17
XP_013062242.1 PREDICTED: uncharacterized protein LOC106051586 i...    94   9e-17
XP_013408206.1 PREDICTED: uncharacterized protein LOC106172138 [...    86   4e-14
XP_005110624.1 PREDICTED: uncharacterized protein LOC101861622 [...    84   2e-13
XP_001691009.1 hypothetical protein CHLREDRAFT_205590 [Chlamydom...    75   2e-10

>ABR16533.1 unknown [Picea sitchensis]
          Length = 530

 Score =  595 bits (1534), Expect = 0.0
 Identities = 297/475 (62%), Positives = 376/475 (79%), Gaps = 3/475 (0%)
 Frame = +2

Query: 302  AKIAHSSSMEHPLPVSSSEDVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELV 481
            +K   S S+  PL  S  EDVS+ AII EP++E+GI  L+LFFPGA T P +YFPLI+LV
Sbjct: 58   SKSMASPSLAPPLD-SPGEDVSSQAIIFEPVREQGIDVLLLFFPGALTSPQAYFPLIKLV 116

Query: 482  QNASSCRLWAVVLDPREKVISAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGS 661
            QN S+ RLWA +LDP E+ +S+ MIEASI GMITRAKERGF+PG LSIS ++ AGHS+G+
Sbjct: 117  QNISAFRLWASILDPGERSLSSSMIEASIDGMITRAKERGFIPGELSISKIFIAGHSIGA 176

Query: 662  WLGRSIAKKQTAGFIQMGSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIY 841
            W  R+IAKK+  GFI+MG CFF T  DNL QYPKPVL+LGG LDGQLKLAGM N A EIY
Sbjct: 177  WCARNIAKKRAEGFIEMG-CFFDTNSDNLAQYPKPVLSLGGGLDGQLKLAGMANLACEIY 235

Query: 842  KVESEFGDFNTNAVKPVIIIPGMNHAQFSHGIPNKERGDLDGIISLEEAQSIAAKYIASF 1021
             +E E GDFNT+AVKPVIIIPGMNHAQFSHGIPNKERGDLD  IS+E+A+S AA++I+SF
Sbjct: 236  IIEPEMGDFNTHAVKPVIIIPGMNHAQFSHGIPNKERGDLDADISIEQARSQAAEFISSF 295

Query: 1022 ITLHLEGKNVAADQALEILKCGVGQTQKLYRTFWEALDNQEMQAKHWQLKVAGLEELTEE 1201
            +T+H++G+    + A E L+ GV +T  LY+TFWEA+ NQEMQAK WQL++AGL+ELTEE
Sbjct: 296  LTVHIKGQAETRELAFENLRKGVKKTHDLYKTFWEAIQNQEMQAKFWQLQIAGLQELTEE 355

Query: 1202 NIHVTKHDYLENFIYSKPWIDMKLKRIFVQIYCSPSDKYGINNNIWVKMKSSDAIKQSFN 1381
            N+ V KHDYL+NF+YSKPWID+K+KRIFVQ+Y S +DK+GI  NIW+KMKS +AI+  F 
Sbjct: 356  NVVVIKHDYLDNFVYSKPWIDIKMKRIFVQVYLSSADKFGIIKNIWIKMKSYEAIQSIFQ 415

Query: 1382 VGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS--LRWIPSD 1555
               SQ+K VSA +LNA+TF +A+SL PEVF+++F E GK LRF++D  + S    WI SD
Sbjct: 416  KSESQSKNVSAADLNADTFHRALSLTPEVFRRKFYECGKKLRFIEDLVVKSSGQDWIDSD 475

Query: 1556 VDLKPSKDDSNIVDVKSPVLFTG-NDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
            V +KP++D  + VDV+S V+ T  + +  RFAGMHYMK LS+A+CMEWI+LD+F+
Sbjct: 476  VIMKPAEDGLDFVDVQSTVIITPVSGVASRFAGMHYMKILSVAKCMEWIMLDSFQ 530


>XP_002983469.1 hypothetical protein SELMODRAFT_445559 [Selaginella moellendorffii]
            EFJ15370.1 hypothetical protein SELMODRAFT_445559
            [Selaginella moellendorffii]
          Length = 504

 Score =  390 bits (1003), Expect = e-125
 Identities = 208/468 (44%), Positives = 296/468 (63%), Gaps = 15/468 (3%)
 Frame = +2

Query: 359  DVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKV 538
            D S +AII+EP++E G    ++F  GA TP   Y PL + +Q +   R+W  +++P   V
Sbjct: 43   DSSANAIILEPVREGGQDVAIVFASGALTPASDYIPLCQEIQRSCELRMWIAIVNPPNNV 102

Query: 539  ISAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQTAGFIQMGS 718
            IS  +IE S+ G++ R K+RGF PG L++ N++ AGHS G+W GR++A  +  GFIQ+GS
Sbjct: 103  ISQEIIEQSLDGILERMKDRGFRPGQLAMDNIFIAGHSWGAWTGRAVAVGRAQGFIQIGS 162

Query: 719  CFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVII 898
            CF  T PDNL QYPKPVLTL G LDGQ+ L  +  HAGEI+ VE E G FNT   KPV++
Sbjct: 163  CFH-TNPDNLSQYPKPVLTLSGELDGQITLGAIAKHAGEIFDVEEEMGSFNTYGRKPVVV 221

Query: 899  IPGMNHAQFSHGIPNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEIL 1078
            IPGMNHAQ SHG+PNK RGDLD  I +E+A++   + IA+F+T+H   ++ +A   LE+ 
Sbjct: 222  IPGMNHAQVSHGVPNKARGDLDAEIPIEQARTDVGRLIAAFVTVHAAPESASA--LLELE 279

Query: 1079 KCGVGQTQKLYRTFWEALDNQ-EMQAKHWQLKVAGL--EELTEENIHVTKHDYLENFIYS 1249
            K  V  T +  R++WEA+  Q     K++QL +A +  + L+E  + V +HDY +NF+YS
Sbjct: 280  K-AVKNTHETCRSYWEAIKEQGSSGVKNYQLDLAKVSPQVLSEGQVSVIQHDYEDNFVYS 338

Query: 1250 KPWIDMK-LKRIFVQIYCSPSDKYGINNNIWVKMKSSDAIKQSFNVGSSQTKIVS----- 1411
            KPWI+ K  +++FV  Y    D + +  ++W+KMKS +AI  +F     Q    +     
Sbjct: 339  KPWIEHKPARKVFVNTYLKSQDNFQVVRSLWIKMKSREAITTAFKAADDQKDDAASSPSS 398

Query: 1412 --AKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDD----DEMPSLRWIPSDVDLKPS 1573
              A   N  TFQ+A+ LVPE  + +F + G+  RF+DD    D  P  +WI SDV L  +
Sbjct: 399  RIAAGFNERTFQEALKLVPERARAKFLDRGRKPRFVDDLVISDSAP--KWIKSDVTLVAA 456

Query: 1574 KDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
            +D     DV+SPVL +  +MP RFAGMHYMK L++A  M+WI  D +R
Sbjct: 457  ED--GFADVQSPVLISPMEMPPRFAGMHYMKLLTIAGAMKWIFTDCYR 502


>XP_002974022.1 hypothetical protein SELMODRAFT_442248 [Selaginella moellendorffii]
            EFJ24977.1 hypothetical protein SELMODRAFT_442248
            [Selaginella moellendorffii]
          Length = 505

 Score =  388 bits (997), Expect = e-124
 Identities = 208/469 (44%), Positives = 296/469 (63%), Gaps = 16/469 (3%)
 Frame = +2

Query: 359  DVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKV 538
            D S +AII+EP++E G    ++F  GA TP   Y PL + +Q +   R+W  +++P   V
Sbjct: 43   DSSANAIILEPVREGGQDVAIVFASGALTPASDYIPLCQEIQRSCELRMWIAIVNPPNNV 102

Query: 539  ISAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQTAGFIQMGS 718
            IS  +IE S+ G++ R K+RGF PG L++ N++ AGHS G+W GR++A  +  GFIQ+GS
Sbjct: 103  ISQEIIEQSLDGILERMKDRGFRPGQLAMDNIFIAGHSWGAWTGRAVAVGRAQGFIQIGS 162

Query: 719  CFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVII 898
            CF  T PDNL QYPKPVLTL G LDGQ+ L  +  HAGEI+ VE E G FNT   KPV+ 
Sbjct: 163  CFH-TNPDNLSQYPKPVLTLSGELDGQITLGAIAKHAGEIFDVEEEMGSFNTYGRKPVVA 221

Query: 899  IPGMNHAQFSHGIPNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEIL 1078
            IPGMNHAQ SHG+PNK RGDLD  I +E+A++   + IA+F+T+H   ++ +A   LE+ 
Sbjct: 222  IPGMNHAQVSHGVPNKARGDLDAEIPIEQARADVGRLIAAFVTVHAAPESASA--LLELE 279

Query: 1079 KCGVGQTQKLYRTFWEALDNQ-EMQAKHWQLKVAGL--EELTEENIHVTKHDYLENFIYS 1249
            K  V  T ++ R++WEA+  Q     K++QL +A +  + L+E  + V +HDY +NF+YS
Sbjct: 280  K-AVKNTHEMCRSYWEAIKEQGSSGVKNYQLDLAKVSPQVLSEGQVSVIQHDYEDNFVYS 338

Query: 1250 KPWIDMK-LKRIFVQIYCSPSDKYGINNNIWVKMKSSDAIKQSFNVGSSQTKIVS----- 1411
            KPWI+ K  +++FV  Y    D + +  ++W+KMKS +AI  +F     Q    +     
Sbjct: 339  KPWIEHKPARKVFVNTYLKSQDNFQVVRSLWIKMKSREAIITAFKAADDQKDDEAPSPPS 398

Query: 1412 ---AKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDD----DEMPSLRWIPSDVDLKP 1570
               A   N  TFQ+A+ LVPE  + +F + G+  RF+DD    D  P  +WI SDV L  
Sbjct: 399  SRIAAGFNERTFQEALKLVPERARAKFLDRGRKPRFVDDLVISDSAP--KWIKSDVTLVA 456

Query: 1571 SKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
            ++D     DV+SPVL +  +MP RFAGMHYMK L++A  M+WI  D +R
Sbjct: 457  AED--GFADVQSPVLISPMEMPPRFAGMHYMKLLTIAGAMKWIFTDCYR 503


>XP_001758612.1 predicted protein [Physcomitrella patens] EDQ76590.1 predicted
            protein [Physcomitrella patens]
          Length = 483

 Score =  342 bits (877), Expect = e-107
 Identities = 185/433 (42%), Positives = 266/433 (61%), Gaps = 10/433 (2%)
 Frame = +2

Query: 383  IEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREK-VISAGMIE 559
            +EP+ E     L++F PG F    +YFPL+  +Q+    RLW VVL   ++ V+S   ++
Sbjct: 1    MEPIHEGDTDLLLVFCPGTFQTSENYFPLMHTIQSQLKLRLWIVVLHNTDQDVLSTSKVD 60

Query: 560  ASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQTAGFIQMGSCFFSTVP 739
            AS+ G++   KERG+ PG   I N++ AGHS G+W+ R++A ++   FIQ+G C+F +  
Sbjct: 61   ASLTGVLALLKERGYRPGSNEIENIFVAGHSFGAWVSRAVAVRRAQAFIQIG-CYFDSEN 119

Query: 740  DNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIPGMNHA 919
            DNL Q+PKPVLTL GALDGQ+ LA +  HAGE+   E   G +NT AVKPVI I GMNHA
Sbjct: 120  DNLAQHPKPVLTLCGALDGQVTLAAIAKHAGEVAATEQYLGRYNTYAVKPVIFISGMNHA 179

Query: 920  QFSHGIPNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVA-ADQALEILKCGVGQ 1096
              S+G  N ERGDL   IS+++A+   A+ + +F+ +  +G N A   + LEIL  GV  
Sbjct: 180  HASNGRLNLERGDLQATISIDDARHRVAELVTAFLAVQAKGPNEAEGARGLEILTRGVDD 239

Query: 1097 TQKLYRTFWEALDNQEMQAKHWQLKVAGLEELTEENIHVTKHDYLENFIYSKPWIDMKLK 1276
            T   YR  WE++ NQE  A   QL +A L  L  ENI    HD+ +NF+ SKPWID  + 
Sbjct: 240  THARYRALWESIANQEGDAVAHQLHIASLPSLCPENITSIHHDFRDNFVISKPWIDTGMN 299

Query: 1277 RIFVQIYCSPSDKYGINNNIWVKMKSSDAIKQSFNVGSS-------QTKIVSAKELNAET 1435
            R+F+  Y SP++K GI  N+WVKMKS +A+   F  G            +   KE+N +T
Sbjct: 300  RVFITTYLSPAEKQGI-CNLWVKMKSREALLPHFGAGDDAGARYDPAAILTLGKEINTKT 358

Query: 1436 FQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS-LRWIPSDVDLKPSKDDSNIVDVKSPV 1612
            F  A+SLV E  +++F + GK L+F+DD  M S + WI SD+   P+  +S  V+V++P+
Sbjct: 359  FDAALSLVSEDAREKFMKMGKKLQFVDDSLMQSAVSWIESDLSFTPT--ESGDVEVRTPI 416

Query: 1613 LFTGNDMPLRFAG 1651
            L++  ++  RFAG
Sbjct: 417  LYSPANINPRFAG 429


>EWM29418.1 hypothetical protein Naga_100042g35 [Nannochloropsis gaditana]
          Length = 618

 Score =  174 bits (442), Expect = 4e-43
 Identities = 142/493 (28%), Positives = 237/493 (48%), Gaps = 37/493 (7%)
 Frame = +2

Query: 347  SSSEDVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDP 526
            S  ED S+     +P        +++ FPG    P +Y      +Q+A +        D 
Sbjct: 137  SPQEDTSS-----QPPTSVEADMVLVLFPGIGMGPAAYRETALAIQDALAANY-----DV 186

Query: 527  REKVISAGMI----------EASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRS 676
            +  V+ A             E  +  ++T    RG    V + + V  AGHS G++L   
Sbjct: 187  KAYVVVAKFFNNLGYLPQEPERRLASILTEVSLRG----VSARAPVAVAGHSAGAFLAYE 242

Query: 677  IAKKQTAGFIQMGSCFFST-----VPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIY 841
             A  ++  F+ +GS   S       P +++ +PKP+L L G +DG ++  G      E+ 
Sbjct: 243  AALTRSQAFVHLGSTLNSRGVLPWKPRSVLAFPKPILQLLGEMDGYIRFTGGALEYAEVE 302

Query: 842  KVESEFGDFNTNAVKPVIIIPGMNHAQFSHGIPNKE-----RGDLDGIISLEEAQSIAAK 1006
             + ++ G  +    KPV+++PG++H QF  G  +K      R DL   ++L+EA  +  K
Sbjct: 303  SLMAKKGFEDALLDKPVVLLPGVSHQQFGDGSQSKAARMSGRRDLPPYVALQEAHRMTGK 362

Query: 1007 YIASFITLHLEGKNVAADQA-LEILKCGVGQTQKLYRTFWEALDNQEMQAK----HW-QL 1168
             +ASF+  HL   N  A  A   +L+     T ++ R +   LD    QA+     W Q 
Sbjct: 363  IVASFLAYHLFPLNGRARVAGASVLRQAFEATGRMVRPY---LDESTPQAEDDFIRWAQA 419

Query: 1169 KVAGLEELTEENIHVTKHDYLENFIYSKPWIDMKLK--RIFVQIYCSPSDKYGINNNIW- 1339
            +VA +E +   ++    ++    F+YSKP++D + +  ++ V+     + + G    I  
Sbjct: 420  EVASVEGVGRNSVRALLYESEGEFVYSKPFLDTESEYLQVCVRKVQEAAIRIGFTKQISP 479

Query: 1340 ---VKMKSSDAIKQSFNVGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRF 1510
                KMK  +A+ Q+     S+ K  +A ELN  TFQKA+  V E  ++R+   G+ L F
Sbjct: 480  ALDFKMKRQEAVVQALRRYPSK-KAPTAAELNHRTFQKALEKVSEEARRRYDRYGRKLEF 538

Query: 1511 LDDDEMPSLR-----WIPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLS 1675
            + DDE+ + R     W+ + +++K   +D   V V+SPVL+T  D   RFAGM YMK L+
Sbjct: 539  IADDEITAERGGGPAWVATPLEVKARVEDPMRVTVRSPVLYTPIDTLPRFAGMCYMKLLT 598

Query: 1676 LARCMEWILLDAF 1714
             A+ +EWI  DA+
Sbjct: 599  PAQAVEWICHDAY 611


>XP_003724058.1 PREDICTED: uncharacterized protein LOC100889358 [Strongylocentrotus
            purpuratus]
          Length = 486

 Score =  130 bits (328), Expect = 1e-28
 Identities = 128/470 (27%), Positives = 214/470 (45%), Gaps = 23/470 (4%)
 Frame = +2

Query: 377  IIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKVISAGMI 556
            ++++P+K   + A ++  PGA     +Y PL   +Q AS  +L   +       ++    
Sbjct: 29   VLLDPIKNGPMEAGLIVVPGAELRGETYAPLAAQIQEASPLKLHVALTTD---YLNDTPN 85

Query: 557  EASIGGMITRAKERGFVPGVLSISNVYFAGHSVG-----SWLGRSIAKKQTAGFIQMGSC 721
               +G  I RA        +   + ++ AGHS+G     +W+  +    Q AG +  GS 
Sbjct: 86   PVQVGNAIERAITELRNANLPDDAPIFVAGHSLGGTFLQTWVDNN--PTQVAGMMLWGS- 142

Query: 722  FFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEF--GDFNTNAVKPVI 895
            + +    +L  YP PV+ L G LDGQ+++     +A    ++ES    G  +  A +PV+
Sbjct: 143  YLTGATGDLGAYPTPVMHLCGDLDGQVRIT---RNARTFRELESLLVNGPSSLIATRPVV 199

Query: 896  IIPGMNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQAL 1069
            +I G+NH QFS G   P  E+ DL   ++ EEA  + A  I SF+  +++ +      ++
Sbjct: 200  LIEGVNHFQFSSGEKPPLVEKEDLPADVTAEEAYVLLADPINSFMHYNMDYET---SNSM 256

Query: 1070 EILKCGVGQTQKLYRTFWEALDNQ-EMQAKHW----QLKVAGLEELTEENIHVTKHDYLE 1234
            E L      T+          D + +     W    Q  VAG +      +  T   Y++
Sbjct: 257  ENLNSHYIATRNKLAPLTAMKDLEWDGATSPWLITAQEMVAGFDPSQALPVVTTNVGYVD 316

Query: 1235 N--FIYSKPWIDMKLKRIFVQIYC--SPSDKYGIN---NNIWVKMKSSDAIKQSFNVGSS 1393
               F  SKP +D  +      +Y   + +D  GI    N I  KMK+  AI+  F     
Sbjct: 317  QTEFESSKPSVDTSVIETTAMVYFPRNVADLSGIKESANEIAGKMKNQGAIESVFP-SDG 375

Query: 1394 QTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDE-MPSLRWIPSDVDLKP 1570
             T   + K++N   F  A S+ P V ++R+   G  + F+DD+E +    W+ S V+   
Sbjct: 376  YTTPATCKQINEAAFDYAFSMAPTVVKQRYNTRGYSMEFMDDNELLTGQDWVDSTVEFTT 435

Query: 1571 SKDDSNIVDVKSPVLFTGNDMPLR-FAGMHYMKPLSLARCMEWILLDAFR 1717
              D +  + V S  L+T  + P   +AGM Y K +S  R +EWI +D+ R
Sbjct: 436  LPDGT--LQVASGSLYTHPNYPNEIYAGMQYCKLMSPHRALEWIYVDSLR 483


>XP_019627227.1 PREDICTED: uncharacterized protein LOC109472096 [Branchiostoma
            belcheri]
          Length = 486

 Score =  129 bits (324), Expect = 3e-28
 Identities = 132/479 (27%), Positives = 214/479 (44%), Gaps = 33/479 (6%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKVISAGMIE 559
            ++ P   +G    ++  PGA+    +Y PL + +Q+ S  +LW  + D     +   +  
Sbjct: 25   LLRPTNTDGAEMGLIIVPGAYIKGTAYQPLAQTIQDLSPHKLWVGLTDGYVTDLPNPL-- 82

Query: 560  ASIGGMITRAKERGFVPGVLSISNVYFAG-HSVGS---WLGRSIAKKQTAGFIQMGSCFF 727
              +   I   K+     G+   ++V+F G HS+G     +  S   +Q  G +  GS   
Sbjct: 83   -ELSSAIQACKQAIVQDGMK--TDVFFIGAHSLGGTFLQMYLSDNPRQAKGMLLWGSYLT 139

Query: 728  STVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEF----GDFNTNAV--KP 889
            S+ P  +  +P PVLTL G LDG ++L       G  +K   EF    G+F+T  V  KP
Sbjct: 140  SSYP--MSTFPVPVLTLNGDLDGLVRL-------GYSWKKYREFVAMDGNFSTAYVYQKP 190

Query: 890  VIIIPGMNHAQFSHG-IP-NKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQ 1063
            V+++PG+NH   + G +P N    DL   +++E+A  + A    +F+  +    N AA Q
Sbjct: 191  VVVVPGLNHGHIASGPMPSNVLNMDLPAEMTMEQAHRLIANSSVNFMVAN-SPNNTAAMQ 249

Query: 1064 ALEI--LKCGVGQTQKLYRTF--WEALDNQEMQAKHW----QLKVAGLEELTEENIHVTK 1219
            A  +  L+  +  T ++   F    ALD  + +   W    Q  + G     +  + V K
Sbjct: 250  AAAVKHLRTQMNVTGRILAPFDIVSALD-YDGKTSPWVTTAQQSIIGAPAALQSKLTV-K 307

Query: 1220 HDYLENFIY---SKPWIDMKLKRIFVQIY---------CSPSDKYGINNNIWVKMKSSDA 1363
               ++N +    +KP ++ +   + VQ Y            S+ Y   N +  KMK   A
Sbjct: 308  TKVVDNILQLGDNKPKVEKEGDMVTVQTYTKLDYPLNPIDNSEPYVSTNMLSTKMKRQSA 367

Query: 1364 IKQSFNVGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDD-EMPSLR 1540
            + Q    G   + I + K+LN   FQ A +        R+Q+ G  L F DD+ +     
Sbjct: 368  VVQELGPGDYNSPI-TCKDLNQMAFQIASTAASNTAMTRYQQKGHHLTFADDEMKSTGSG 426

Query: 1541 WIPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
            W+   +  +   D +  V V SP L TG D    F GMHY K LS  R +E+I  D+ R
Sbjct: 427  WLSGGLTFEDQGDGT--VKVTSPALVTGLDAWFGFDGMHYCKLLSPFRALEYIYTDSLR 483


>XP_009051377.1 hypothetical protein LOTGIDRAFT_152602 [Lottia gigantea] ESO97511.1
            hypothetical protein LOTGIDRAFT_152602 [Lottia gigantea]
          Length = 490

 Score =  126 bits (317), Expect = 3e-27
 Identities = 125/473 (26%), Positives = 203/473 (42%), Gaps = 27/473 (5%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVL-DPREKVISAGMI 556
            I+ PLK  G+ A ++  PGA     +Y PL   +Q  S  +LW  +L D    + +   +
Sbjct: 23   ILSPLKTSGVDAALIIVPGADIKGGAYRPLARHIQETSDLKLWVALLEDFPFNLPNPLQL 82

Query: 557  EASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAK---KQTAGFIQMGSCFF 727
              +I   ++R K  G     +  +NV+ AGHS+G     +  K   K   G I   S  +
Sbjct: 83   NGAISQAVSRIKAAG-----MKTNNVFVAGHSLGGVFVGNYGKSNSKLVKGIILFAS--Y 135

Query: 728  STVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIPG 907
             T  + L  YP PVLT+ G LDG  +L  + +   E+    ++  D       PVI++ G
Sbjct: 136  LTKGNKLADYPIPVLTVSGDLDGLTRLTRIADTFEELKGDVAKRSDAKYRT--PVIMMTG 193

Query: 908  MNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEILK 1081
            +NH QF+ G    N    D+   IS   A  + +K  A F+   +     A   A   L 
Sbjct: 194  VNHGQFASGTMPSNVLNYDIKSEISSTAAHLLISKNTADFMVTSIGTAGSALKSAKHRLD 253

Query: 1082 CGVGQTQKLYRTFWEALDN--QEMQAKHW----QLKVAGLEELT-------EENI--HVT 1216
                +T   ++   +   N     ++  W    Q  ++GL++         E N+   V+
Sbjct: 254  QAYARTDSFFKPILDMKRNDVNPQRSSSWTISAQYLISGLDKSVLKVTNKEESNLAFPVS 313

Query: 1217 KHDYLENFIYSKPWIDMKLKRIFVQIYCSPSDKYGINNNIWVKMKSSDAI----KQSFNV 1384
            K +    F Y    I+      +       S      +++  K+KS +A+     +S+  
Sbjct: 314  KPEVKTAFSYES--INTHSYLSYANNLMDVSTVMSAPDSLDAKLKSKEAVYKALPKSYKP 371

Query: 1385 GSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS-LRWIPSDVD 1561
             S+ +K  +  ++N E +  A+        KRF   G+ ++F+ D    + + WI S   
Sbjct: 372  HSTASK--TCMDINKEAWTLALKQSSRTAVKRFHSMGRSMKFIKDHVFSTGIDWISSSAS 429

Query: 1562 LKPSKDDSNIVDVKSPVLFTGND-MPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
             K +  D   V  +S  L +  D  P  FAGMHY K LS  R MEWI +D+ R
Sbjct: 430  WKETTSD---VTFQSTALVSKVDAFPAAFAGMHYCKLLSPYRAMEWIYVDSLR 479


>XP_019633246.1 PREDICTED: uncharacterized protein LOC109476680 [Branchiostoma
            belcheri]
          Length = 512

 Score =  118 bits (296), Expect = 2e-24
 Identities = 132/502 (26%), Positives = 214/502 (42%), Gaps = 56/502 (11%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKVISAGMIE 559
            ++ P   +G    ++  PGA+    +Y PL + +Q+ S  +LW  + D     +   +  
Sbjct: 25   LLRPTNTDGAEMGLIIVPGAYIKGTAYQPLAQTIQDLSPHKLWVGLTDGYVTDLPNPL-- 82

Query: 560  ASIGGMITRAKERGFVPGVLSISNVYFAG-HSVGS---WLGRSIAKKQTAGFIQMGSCFF 727
              +   I   K+     G+   ++V+F G HS+G     +  S   +Q  G +  GS   
Sbjct: 83   -ELSSAIQACKQAIVQDGMK--TDVFFIGAHSLGGTFLQMYLSDNPRQAKGMLLWGSYLT 139

Query: 728  STVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEF----GDFNTNAV--KP 889
            S+ P  +  +P PVLTL G LDG ++L       G  +K   EF    G+F+T  V  KP
Sbjct: 140  SSYP--MSTFPVPVLTLNGDLDGLVRL-------GYSWKKYREFVAMDGNFSTAYVYQKP 190

Query: 890  VIIIPGMNHAQFSHG-IP-NKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQ 1063
            V+++PG+NH   + G +P N    DL   +++E+A  + A    +F+  +    N AA Q
Sbjct: 191  VVVVPGLNHGHIASGPMPSNVLNMDLPAEMTMEQAHRLIANSSVNFMVAN-SPNNTAAMQ 249

Query: 1064 ALEI--LKCGVGQTQKLYRTF--WEALDNQEMQAKHW----QLKVAGLEELTEENIHVTK 1219
            A  +  L+  +  T ++   F    ALD  + +   W    Q  + G     +  + V K
Sbjct: 250  AAAVKHLRTQMNVTGRILAPFDIISALD-YDGKTSPWVTTAQQSIIGAPAALQSKLTV-K 307

Query: 1220 HDYLENFIY---SKPWIDMKLKRIFVQIY---------CSPSDKYGINNNIWVKMKSSDA 1363
               ++N +    +KP ++ +   + VQ Y            S+ Y   N +  KMK   A
Sbjct: 308  TKVVDNILELGDNKPKVEKEGDMVTVQTYTKLDYPLNPIDNSEPYVSTNMLSTKMKRQSA 367

Query: 1364 IKQSFNVGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDD-EMPSLR 1540
            + Q    G   + I + K+LN   FQ A +        R+Q+ G  L F DD+ +     
Sbjct: 368  VVQELGPGDYNSPI-TCKDLNQMAFQIASTAASSTAMTRYQQKGHHLTFADDEMKSTGSG 426

Query: 1541 WIPSDVDLKPSKDDSNIV-----------------------DVKSPVLFTGNDMPLRFAG 1651
            W+   +  +   D +  V                        V SP L TG D    F G
Sbjct: 427  WLSGALTFEDQGDGTVKVTSPALVTGLDAWFGFDGMHYYSLQVTSPALVTGLDAWFGFDG 486

Query: 1652 MHYMKPLSLARCMEWILLDAFR 1717
            MHY K LS  R +E+I  D+ R
Sbjct: 487  MHYCKLLSPFRALEYIYTDSLR 508


>XP_002601781.1 hypothetical protein BRAFLDRAFT_75999 [Branchiostoma floridae]
            EEN57793.1 hypothetical protein BRAFLDRAFT_75999
            [Branchiostoma floridae]
          Length = 505

 Score =  117 bits (293), Expect = 4e-24
 Identities = 119/496 (23%), Positives = 208/496 (41%), Gaps = 50/496 (10%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLD------PREKVI 541
            ++ P   +G    ++  PGA+    +Y PL + +Q+ S  +LW  + D      P    +
Sbjct: 25   LLRPTNTDGTEVGLIIVPGAYIKGTAYQPLAQTIQDLSPHKLWVGLTDGYVTDLPNPLEL 84

Query: 542  SAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGS---WLGRSIAKKQTAGFIQM 712
            S+ +          ++ ++  V G +     +   HS+G     +  S   +Q  G +  
Sbjct: 85   SSAI----------QSCKQAMVQGGMKTDVFFIGAHSLGGTFLQMYLSDNPRQAKGMLLW 134

Query: 713  GSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAV--K 886
            GS   ++ P  +  +  PVLTL G LDG ++L        E   +++   +F++ AV  K
Sbjct: 135  GSYLTNSYP--MATFSVPVLTLNGDLDGLVRLGYSWKKYREFVAIDA---NFSSPAVYQK 189

Query: 887  PVIIIPGMNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAAD 1060
            PV+++PG+NH   + G    N    DL   +++E+A  + A    +F+  +         
Sbjct: 190  PVVVVPGLNHGHIASGAMPSNVLNMDLPAEMTMEQAHRLIANSSVNFMVANSPNNTAFRQ 249

Query: 1061 -QALEILKCGVGQTQKLYRTF--WEALDNQEMQAKHW----QLKVAGLEELTEENIHVTK 1219
             +A++ L+  +  T ++   F    ALD  + Q   W    Q  + G     +  + V K
Sbjct: 250  MEAVKQLRTQMNVTGRILAPFDIVSALD-FDGQTSPWVTTAQESIIGAPAELQNKLTV-K 307

Query: 1220 HDYLENFI---YSKPWIDMKLKRIFVQIY---------CSPSDKYGINNNIWVKMKSSDA 1363
             + ++N +     KP+++     + VQ Y            S+ Y   N +  KMK   A
Sbjct: 308  TEVMDNILDLGDHKPFVEKDGDMVTVQTYTKFDYPLNPIDNSEPYVSTNMLSTKMKRQSA 367

Query: 1364 IKQSFNVGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDD-EMPSLR 1540
            + +    G   + I + K+LN   FQ A +    V   R+Q+ G  L F DD+ +     
Sbjct: 368  VTKELGPGHYNSPI-TCKDLNQMAFQIASTAASTVAMARYQQKGHQLTFADDEMKSTGSG 426

Query: 1541 WIPSDVDLKPSKDD-----------------SNIVDVKSPVLFTGNDMPLRFAGMHYMKP 1669
            W+   +  +   D                  +N + V SP L T  D    F GMHY K 
Sbjct: 427  WLSGALTFEDQGDGTVKIQCIYHKARAITYRTNALQVTSPALVTSLDAWFGFDGMHYCKL 486

Query: 1670 LSLARCMEWILLDAFR 1717
            LS  R +E+I  D+ R
Sbjct: 487  LSPFRALEYIYTDSLR 502


>XP_005646321.1 hypothetical protein COCSUDRAFT_42826 [Coccomyxa subellipsoidea
            C-169] EIE21777.1 hypothetical protein COCSUDRAFT_42826
            [Coccomyxa subellipsoidea C-169]
          Length = 508

 Score =  114 bits (285), Expect = 4e-23
 Identities = 116/483 (24%), Positives = 203/483 (42%), Gaps = 49/483 (10%)
 Frame = +2

Query: 416  LMLFFPGAFTPPVSYFPLIELVQNA--SSCRLWAVVLDPREKVI----------SAGMIE 559
            L++  PGA+  P  Y   I  ++        LW     P  + +          +   + 
Sbjct: 35   LLVLLPGAYMKPDDYKGFIAGLRGCLKGKVALWVAAAHPIWQEVDVKAPDAMQQATERVA 94

Query: 560  ASIGGMITRAKERGFVPGVLS---ISNVYFAGHSVGSWLGRSIAKKQTAGFIQMGSCFF- 727
            A+I  +I RA + GF    L    ++N+    HS  +     +A +     I +GS  F 
Sbjct: 95   AAIDVLIARAHQEGFPAAKLPSGRVTNMVILAHSSAALFAAPLAARLAGSLILLGSYLFP 154

Query: 728  -STVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIP 904
             S    +L ++ +PVL LGG LDGQ + + +   A E      + G     A KPVI++P
Sbjct: 155  SSDYHASLREFSRPVLHLGGMLDGQARFSKVAIAALEAAHFAYQAGPMCAAAQKPVILLP 214

Query: 905  GMNHAQFSHGIPNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEILKC 1084
            G+NHA  S+G    E  DL   +S E+A    A  IA F+ +H+           E+++ 
Sbjct: 215  GVNHACLSNGHMRPEANDLAAEVSAEDANLQVAGIIADFVMVHMSTNERFVAHPGELVQA 274

Query: 1085 GVGQTQKLYRTFWEALDNQEMQAKHWQLKVAGLEELTEENIHVTKHDYLENFIYSKPWID 1264
                           L ++++Q +  ++    ++      +  + H  +E FI S+P + 
Sbjct: 275  --------------KLASEDVQRRFVRIASRNVKPKPVTRVLSSVHTDIEAFIRSQPTLQ 320

Query: 1265 MKLKRI----FVQIYC---SPS-----DKYGINNNIWVKMKSSDA--------IKQSFNV 1384
            +          + ++C    P+      ++ +     +K+KS++A        I      
Sbjct: 321  LYKGESEPYWLLHVHCYLHRPNLVPFGHRFPVAPQYILKLKSAEALALVYTGQIPDGRED 380

Query: 1385 GSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS-----LRWIP 1549
            GS   + ++A  +N   +++A+ +V    ++ F   GK L F  D ++       ++WI 
Sbjct: 381  GSLGDQPITAATVNEGMYKEALRIVTRDSRELFLRRGKQLSFPPDRDVSKEIQTPVQWI- 439

Query: 1550 SDVDLKPSKDDSNIVDVKSPVLFT------GNDMP-LRFAGMHYMKPLSLARCMEWILLD 1708
             D+ L+          V SPV+ T      G   P   F G +YMK +SLA  +EWI+ D
Sbjct: 440  KDMPLEFVDVGPKATQVCSPVVMTPAVQTSGKPGPEAAFQGNYYMKIMSLAGAVEWIMCD 499

Query: 1709 AFR 1717
              R
Sbjct: 500  GLR 502


>XP_013062240.1 PREDICTED: uncharacterized protein LOC106051586 isoform X1
            [Biomphalaria glabrata] XP_013062241.1 PREDICTED:
            uncharacterized protein LOC106051586 isoform X1
            [Biomphalaria glabrata]
          Length = 481

 Score =  110 bits (276), Expect = 4e-22
 Identities = 131/487 (26%), Positives = 213/487 (43%), Gaps = 29/487 (5%)
 Frame = +2

Query: 350  SSEDVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPR 529
            SS   +  ++I+ P++  G  A ++F PGA     +Y      +QNAS  RLW V L   
Sbjct: 14   SSPGDAVSSLIVPPIRPSGEEAAVIFIPGANIKGEAYLKTAAAIQNASPLRLW-VALTGN 72

Query: 530  EKVISAGMIE--ASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKK-QTAG 700
              + +   +E   ++   I +  + G     +   N     HS+G     + AKK Q   
Sbjct: 73   YSLETPNPVELPKAVENAIKQLSKAG-----MKGDNYTGIAHSLGGVFLSTYAKKSQLKA 127

Query: 701  FIQMGSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNA 880
             + +GS  + +   +   YP PVLTL G LDGQ ++  +   A E  K++      +   
Sbjct: 128  VVLLGS--YLSRETSFKDYPLPVLTLSGELDGQARITRI---AVEYKKLQDIIKSPDAVF 182

Query: 881  VKPVIIIPGMNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLH------- 1033
              PVI IP +NHAQF+ G+  P   + DL   ++ + A  +  K +++F+T+        
Sbjct: 183  RYPVINIPKINHAQFASGVMPPAVTKYDLTPEVTEDVAHVLIGKQVSNFLTVTFDGPSAM 242

Query: 1034 --LEGKNVAADQALE--------ILKCGVGQTQKLYRTFWEALDNQEMQAK-HWQLKVAG 1180
              LE K    D  ++        +    + +   L  + W  L  + M  +   ++KV  
Sbjct: 243  DVLEAKEAIVDSFVDSGKRFEPLLFVNSMDEVPILLTSPWSVLCQEVMAGQLAPKIKVDN 302

Query: 1181 LEELTEENIHVTKHDYLENFIYSKPWIDMKLK-RIFVQIYCSPSDKYGINNN---IWVKM 1348
            L   TE    V+      N        D+ +K + F+Q   +P D      +   + VK 
Sbjct: 303  LVAPTETIFVVSFPSIARNS------TDLVVKTKSFIQYDSNPLDISTTPESPQEVDVKC 356

Query: 1349 KSSDAIKQSFNVGSSQTKI-VSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDE 1525
            KS +AI+ + NV +S T    + ++LN      A        Q+R++  G+ L F DD  
Sbjct: 357  KSYEAIQSALNVSASLTAANTTCRDLNELALNIAYLNSRSEAQQRYKSKGRPLTFQDDVT 416

Query: 1526 MPS-LRWIPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWIL 1702
              S   W  ++  LK  +DDS +  V+S  L      P+ F G  Y K +S  R MEWI 
Sbjct: 417  YKSGFEW--AENPLKLVEDDSGL-HVQSVALRVSLHSPV-FPGDFYCKVISPYRAMEWIN 472

Query: 1703 LDAFR*H 1723
            +D+ R H
Sbjct: 473  VDSLRAH 479


>XP_009051376.1 hypothetical protein LOTGIDRAFT_228186 [Lottia gigantea] ESO97510.1
            hypothetical protein LOTGIDRAFT_228186 [Lottia gigantea]
          Length = 525

 Score =  103 bits (258), Expect = 1e-19
 Identities = 130/506 (25%), Positives = 204/506 (40%), Gaps = 60/506 (11%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVL-DPREKVISAGMI 556
            I+ PLK  G+ A ++  PGA     +Y PL + +Q  S  +LW  +L D    + +   +
Sbjct: 23   ILSPLKTSGVDAALIIVPGADIKGGAYRPLAKHIQETSDLKLWVALLEDFPLNLPNPLQL 82

Query: 557  EASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAK---KQTAGFIQMGSCFF 727
              +I   +++ K  G     +  +NV+ AGHS+G     +  K   K   G I   S  +
Sbjct: 83   NGAISQAVSKIKAAG-----MKTNNVFVAGHSLGGVFVGNYGKSNSKLVKGIILFAS--Y 135

Query: 728  STVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIPG 907
             T  + L  YP PVLT+ G LDG  +   + +   E+    ++  D       PVII+ G
Sbjct: 136  LTKGNKLADYPVPVLTVSGDLDGLTRCTRIADTFEELKGDVAKRNDAKYRT--PVIIMTG 193

Query: 908  MNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEILK 1081
            +NH QF+ G    N    DL   IS   A  + +K  A F+   +     A   A   L 
Sbjct: 194  VNHGQFASGTMPSNVLNYDLASEISATAAFLLISKNTADFMVTSVGTPGSALTSAKSRLD 253

Query: 1082 CGVGQTQKLYRTFWEALDN--QEMQAKHWQLKVAGL-EELTEENIHVTKHDYLE-NFIYS 1249
                +T    +   +   N      + +W +    L   L +  + VT  +  +  F+ S
Sbjct: 254  QAYARTDDFLKPLLDMKRNDVNPQGSSNWTISAQYLISGLDKSVLKVTNKEASQLPFVES 313

Query: 1250 KPWIDMKLKRIFVQIYCSPSDKYGIN-----------NNIWVKMKSSDAIKQSFNVGSSQ 1396
            KP  ++K    +  I       Y  N           +++  K+KS  A+ ++   G   
Sbjct: 314  KP--EVKTASSYESINTHAHLSYANNLMDVSTVMSAPDSLDAKLKSKAAVYKALPKGYKP 371

Query: 1397 -----------TKIVSAKE----------LNAET-----------FQKAISLV----PEV 1468
                        K V  K           L  +T           FQ A SL      ++
Sbjct: 372  PSSANVTCLDINKAVPRKSFVFGCYILLCLQEDTFRAFQHTLLLKFQAAWSLALRNSSKL 431

Query: 1469 FQKRFQE-AGKMLRFLDDDEMPS-LRWIPSDVDLKPSKDDSNIVDVKSPVLFTGND-MPL 1639
              KRFQ+   + ++F  D    + ++W+ S    K +  D   V  +S  L +  D  P 
Sbjct: 432  AVKRFQDRKARAMKFAKDQVFSTGIQWVLSSASWKETTSD---VTFQSTALVSKVDAFPA 488

Query: 1640 RFAGMHYMKPLSLARCMEWILLDAFR 1717
             FAGMHY K LS  R MEWI +D+ R
Sbjct: 489  AFAGMHYCKLLSPYRAMEWIYVDSLR 514


>XP_011683993.1 PREDICTED: uncharacterized protein LOC100890127 [Strongylocentrotus
            purpuratus]
          Length = 259

 Score = 97.4 bits (241), Expect = 6e-19
 Identities = 62/227 (27%), Positives = 115/227 (50%), Gaps = 5/227 (2%)
 Frame = +2

Query: 374  AIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKVISAGM 553
            A+++EP++++G+ A ++  PGA     +Y PL E +Q  S  +LW  +       I    
Sbjct: 22   AVVVEPVRDQGVEAALIVIPGAEIRGEAYLPLAESIQRESMLKLWVAL---TTDYIGDTP 78

Query: 554  IEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQ---TAGFIQMGSCF 724
                +   I  A +     G+   + ++FAGHS+G    +S   +    T G + + + +
Sbjct: 79   FPPQLTRAINIALDDLVSTGMPENTPIFFAGHSLGGTFLQSYVSRSPSVTKG-VMLWASY 137

Query: 725  FSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIP 904
             +   D+L  YP P++ L G LDGQ+K+  +     ++  ++ +  D    A KPVI++ 
Sbjct: 138  LTGRLDDLAAYPTPIMHLSGDLDGQVKITRIAKPFRDLEALQLK--DETALATKPVIVVD 195

Query: 905  GMNHAQFSHGIPNKE--RGDLDGIISLEEAQSIAAKYIASFITLHLE 1039
            G+NH QF+ G P     RGD+    +  EA ++ A+ +  F++ +L+
Sbjct: 196  GVNHFQFASGDPPPAVVRGDISPDATATEAWTLLARVMRDFMSYNLD 242


>XP_011440327.1 PREDICTED: uncharacterized protein LOC105337342 [Crassostrea gigas]
          Length = 492

 Score =  100 bits (249), Expect = 1e-18
 Identities = 112/468 (23%), Positives = 197/468 (42%), Gaps = 21/468 (4%)
 Frame = +2

Query: 377  IIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVL-DPREKVISAGM 553
            ++  P  ++G+   +L  PGA+    +Y  L   +    + RLW V+L D  + +++   
Sbjct: 28   VLKPPSTKQGVEGALLIAPGAYIKGEAYESLGLQIGETCNFRLWVVLLLDFFDDIVNPPQ 87

Query: 554  IEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQTAGFIQMGSCFFST 733
            ++ ++       KE GF     +   V+ AGHS+G  +     +K       +    + T
Sbjct: 88   LQEAVTKARNSLKEEGFQ----NDGPVFLAGHSLGGTMVSMYGQKSHGLSGVLLYAAYLT 143

Query: 734  VPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIPGMN 913
                L  YP PV+TL G LDG  ++  ++    E+Y   +   D       PVI++ G+N
Sbjct: 144  KGHKLKDYPVPVMTLSGDLDGLTRITRVMITFNELYNDVAT--DPRAKYHTPVILMEGVN 201

Query: 914  HAQFSHG-IP-NKERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALEILKCG 1087
            H QF+ G +P N    DL   ++   A    A Y  SF++  L  +N  + Q   +L  G
Sbjct: 202  HGQFASGRMPSNVATHDLPPDVTNTTAYQRIANYTCSFVSYTL--RNNTSSQ--WVLDEG 257

Query: 1088 VGQTQKLYRTF--WEALDNQEMQAKHW----QLKVAGLEELTEENIHVTKHD-YLENFIY 1246
              +T +L +     + LD       +W    Q  V  L+  ++  +   + D  +  F  
Sbjct: 258  FNKTYQLIQPLRRMKELDTNRFSTSNWTQTAQKSVIALQNASQILVKGVEIDGKVIKFSS 317

Query: 1247 SKPWIDMKLKRIFVQIYCS---PSDKYGINNN------IWVKMKSSDAIKQSFNVGSSQT 1399
              P  ++    ++V  Y     P D + +  N      I  KM S +  K  F  G  + 
Sbjct: 318  LSPKTEVLNSTLYVTTYSEVTYPLDPFDVTLNPLSAVQIQAKMISQERAKGLFPQGLYRR 377

Query: 1400 KIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS--LRWIPSDVDLKPS 1573
               S K +N  +  +A      V ++R+   G+ +  L++D + S  L W+ S + L   
Sbjct: 378  GNFSCKYVNELSVLEAFYSSSSVARERYARKGRQM-ILEEDMLTSNQLAWLVSQLQLVEF 436

Query: 1574 KDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
             D  ++   +    +  +      +G  Y   LS  R MEWI +D+ R
Sbjct: 437  SDGLHVQSQRYETTYKPSQKD--SSGFFYCSLLSPFRAMEWIYVDSLR 482


>XP_018017320.1 PREDICTED: uncharacterized protein LOC108673941 [Hyalella azteca]
          Length = 502

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 116/497 (23%), Positives = 218/497 (43%), Gaps = 28/497 (5%)
 Frame = +2

Query: 311  AHSSSMEHPLPVSSSEDVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNA 490
            A ++  +H    S+ ++ + + II+ PLK+ GI A++L  PGA+     Y PL   +Q  
Sbjct: 19   ASAACQKHQAGSSNIQNKAEEPIILAPLKD-GIEAVLLLVPGAYINAAFYEPLGVAIQTT 77

Query: 491  SSCRLWAVVLDP-REKVISAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVG--- 658
            SS +LW  ++ P    + +       I   ++  +++G V  ++++     AGHS+G   
Sbjct: 78   SSLKLWVGLVRPFVSDLPNPVQCSDDIEKTLSMMRDQGMVTSLIAV-----AGHSLGGVV 132

Query: 659  --SWLGRSIAKKQTAGFIQMGSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAG 832
               W+  ++A       I M S   +     +       L + G LDG   L  +     
Sbjct: 133  MQDWISANLA--NVTSMILMASQLNNA---EVANSSLATLHISGDLDG---LEEITELEP 184

Query: 833  EIYKVESEFGDFNTNAV--KPVIIIPGMNHAQFSHGIPNK--ERGDLDGIISLEEAQSIA 1000
               ++ES     +  AV  KP +++  +NH  F+ G P    +  D+   +S EEA ++ 
Sbjct: 185  TFRRLESSVSQ-DPEAVFRKPTVVLRDVNHMHFASGAPPPLVQSDDILSPLSEEEAHALL 243

Query: 1001 AKYIASFITLHLEGKNVAADQALEILKCGVGQTQKLYRTFWE-----ALDNQEMQAKHWQ 1165
            A ++++F+T  ++        A  +L+     TQ + R   E     +       A   Q
Sbjct: 244  AVHMSAFLTSAMQAPPDEVADARALLQQDFYDTQDIMRPLAEMRELTSTGTLSPFATIGQ 303

Query: 1166 LKVAGLEELTEENIHV--TKHDYLENFIYSKPWIDMKLKRIFVQIY-----CSPSDKYGI 1324
              ++ L+ L   ++ V  T ++ L  F   +P   +      V  Y       P   Y  
Sbjct: 304  QILSNLDPLFYSSLFVNDTSYEDLAPFESHQPVSTLVGDVAQVNTYSLVTDAGPLSSYSS 363

Query: 1325 NNNIWVKMKSSDAIKQSFNVGSSQTKI-----VSAKELNAETFQKAISLVPEVFQKRFQE 1489
               + +K K+SD +K+   V +S   +     V+  ++N E    A+S    V   R++ 
Sbjct: 364  AEELAIKFKNSDDLKK---VLASTDAVFLDSNVTCLDVNQEAINAALSSGGVVVTGRYES 420

Query: 1490 AGK-MLRFLDDDEMPSLRWIPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMK 1666
             G+ +L   D  +     W+ + V+   +  D+ + +V+S  L T   +P  F GM+Y K
Sbjct: 421  RGRPILLRPDSAQQTGPEWLNAPVNY--TLTDAGL-EVQSASLVTAVSVPFGFDGMYYCK 477

Query: 1667 PLSLARCMEWILLDAFR 1717
             +  +R +E+I++D+ R
Sbjct: 478  LMPPSRALEYIMIDSLR 494


>XP_013062242.1 PREDICTED: uncharacterized protein LOC106051586 isoform X2
            [Biomphalaria glabrata]
          Length = 462

 Score = 94.4 bits (233), Expect = 9e-17
 Identities = 122/468 (26%), Positives = 201/468 (42%), Gaps = 29/468 (6%)
 Frame = +2

Query: 350  SSEDVSNDAIIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPR 529
            SS   +  ++I+ P++  G  A ++F PGA     +Y      +QNAS  RLW V L   
Sbjct: 14   SSPGDAVSSLIVPPIRPSGEEAAVIFIPGANIKGEAYLKTAAAIQNASPLRLW-VALTGN 72

Query: 530  EKVISAGMIE--ASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKK-QTAG 700
              + +   +E   ++   I +  + G     +   N     HS+G     + AKK Q   
Sbjct: 73   YSLETPNPVELPKAVENAIKQLSKAG-----MKGDNYTGIAHSLGGVFLSTYAKKSQLKA 127

Query: 701  FIQMGSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNA 880
             + +GS  + +   +   YP PVLTL G LDGQ ++  +   A E  K++      +   
Sbjct: 128  VVLLGS--YLSRETSFKDYPLPVLTLSGELDGQARITRI---AVEYKKLQDIIKSPDAVF 182

Query: 881  VKPVIIIPGMNHAQFSHGI--PNKERGDLDGIISLEEAQSIAAKYIASFITLH------- 1033
              PVI IP +NHAQF+ G+  P   + DL   ++ + A  +  K +++F+T+        
Sbjct: 183  RYPVINIPKINHAQFASGVMPPAVTKYDLTPEVTEDVAHVLIGKQVSNFLTVTFDGPSAM 242

Query: 1034 --LEGKNVAADQALE--------ILKCGVGQTQKLYRTFWEALDNQEMQAK-HWQLKVAG 1180
              LE K    D  ++        +    + +   L  + W  L  + M  +   ++KV  
Sbjct: 243  DVLEAKEAIVDSFVDSGKRFEPLLFVNSMDEVPILLTSPWSVLCQEVMAGQLAPKIKVDN 302

Query: 1181 LEELTEENIHVTKHDYLENFIYSKPWIDMKLK-RIFVQIYCSPSDKYGINNN---IWVKM 1348
            L   TE    V+      N        D+ +K + F+Q   +P D      +   + VK 
Sbjct: 303  LVAPTETIFVVSFPSIARNS------TDLVVKTKSFIQYDSNPLDISTTPESPQEVDVKC 356

Query: 1349 KSSDAIKQSFNVGSSQTKI-VSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDE 1525
            KS +AI+ + NV +S T    + ++LN      A        Q+R++  G+ L F DD  
Sbjct: 357  KSYEAIQSALNVSASLTAANTTCRDLNELALNIAYLNSRSEAQQRYKSKGRPLTFQDDVT 416

Query: 1526 MPS-LRWIPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMK 1666
              S   W  ++  LK  +DDS +  V+S  L      P+ F G  Y K
Sbjct: 417  YKSGFEW--AENPLKLVEDDSGL-HVQSVALRVSLHSPV-FPGDFYCK 460


>XP_013408206.1 PREDICTED: uncharacterized protein LOC106172138 [Lingula anatina]
          Length = 477

 Score = 86.3 bits (212), Expect = 4e-14
 Identities = 110/480 (22%), Positives = 206/480 (42%), Gaps = 32/480 (6%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLD------PREKVI 541
            ++EP   +G    ++  PGA     +Y PL E +Q  +  +LW  ++       P    +
Sbjct: 23   VLEPRYTDGPEFGLVIIPGAEIKGYTYRPLAEKLQVQAPFKLWVGLVGGFFSNTPNPLEL 82

Query: 542  SAGMIEASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAK---KQTAGFIQM 712
              G IE+++G M           G+ + S +Y A HS+G     + A    K  +G +  
Sbjct: 83   PGG-IESALGAMRKA--------GMTAASKIYLAAHSLGGTFLSAYANTNHKNISGILLY 133

Query: 713  GSCFFSTVPDNLVQYPKPVLTLGGALDGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPV 892
            GS  + T   N+  YP PVLTL G +DG  ++  +      +  + ++  D       PV
Sbjct: 134  GS--YLTRSYNMSAYPVPVLTLSGDMDGLNRITRIQETFQNLQDLYAK--DQAVIYRSPV 189

Query: 893  IIIPGMNHAQFSHGIPNKE--RGDLDGIISLEEAQSIAAKYIASFITLHLEGKN----VA 1054
            I +PG+NH QF+ G+  K   + D+   +S + A  + A     F+    +  +    VA
Sbjct: 190  ITLPGVNHGQFASGVMPKMVLQNDIPAEVSNDYAYEMIANASKYFMIATAKTPSDLVVVA 249

Query: 1055 ADQALEILKCGVGQTQKLYRTFWEALDNQEMQAKHWQLKVAGLEELTE----ENIHVTKH 1222
             +Q  +  +    + Q +Y     ++D+   Q   W   V G + ++     + +HV  +
Sbjct: 250  ENQLKQYFQDNQQRMQPIYAV--RSMDSLG-QTSPW--SVVGQQIISRLIDTQKLHV--Y 302

Query: 1223 DYLENFI---YSKPWIDMKLKRIFVQIYCS---PSDKYGIN------NNIWVKMKSSDAI 1366
            + L N I     +P I  K   + +  Y     P +   ++      + I  KM SS+A+
Sbjct: 303  NQLVNEISLQVDEPNIVAKDGELSITTYTEITYPFNPLDVSFYQQSPSQIQAKMSSSEAV 362

Query: 1367 KQSFNVGSSQTKIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPSLR-W 1543
             Q +    + ++ +S KE+N      A+SL   + + RF ++G+ +    DD + +   W
Sbjct: 363  -QRYLPNGNFSEPLSCKEVNQAAIYHALSLAAAIPRTRFMKSGRNITITFDDIVSNEEVW 421

Query: 1544 IPSDVDLKPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR*H 1723
            +   + +  +     +  +       G        G ++ K LS  R +E+I +D+ + H
Sbjct: 422  LAEPLRITQTHHGLQVTSI-------GYHPATESNGFYHCKLLSPFRVLEYIYVDSLKPH 474


>XP_005110624.1 PREDICTED: uncharacterized protein LOC101861622 [Aplysia californica]
          Length = 484

 Score = 84.0 bits (206), Expect = 2e-13
 Identities = 119/471 (25%), Positives = 196/471 (41%), Gaps = 25/471 (5%)
 Frame = +2

Query: 380  IIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQNASSCRLWAVVLDPREKVISAGMIE 559
            II P++  G  A ++F PGA     +Y      +Q A++ RLW V L     + +   ++
Sbjct: 26   IIPPIRSSGPEAAVIFVPGASIDGKAYEETGRAIQAATNIRLW-VALTGNYTLNTPNPLQ 84

Query: 560  --ASIGGMITRAKERGFVPGVLSISNVYFAGHSVGSWLGRSIAKKQTA-GFIQMGSCFFS 730
              A++ G   + ++ G     +   N     HS+G     S AK       + +GS  + 
Sbjct: 85   LPAAMRGAFDKLQKAG-----MKGENYVGVAHSLGGVFLPSYAKDSPLKAVVLLGS--YL 137

Query: 731  TVPDNLVQYPKPVLTLGGALDGQLKLAGMV----NHAGEIYKVESEFGDFNTNAVKPVII 898
            T  ++L  YPKP+LTL G LDGQ ++  +         +I K  S    + T    PV+ 
Sbjct: 138  TSGNSLSGYPKPILTLSGELDGQTRITRIALTYKELVNDISKTPSAL--YRT----PVLN 191

Query: 899  IPGMNHAQFSHG-IPN-KERGDLDGIISLEEAQSIAAKYIASFITLHLEGKNVAADQALE 1072
            I G NHAQF+ G +P    R DL   IS  EA     +++++FI++      V  DQA  
Sbjct: 192  IKGSNHAQFASGPMPGIVARYDLKPEISDAEAHQEIGEHVSNFISVTFNLCKV--DQAKA 249

Query: 1073 ILKCGVGQTQKLYRTFW--EALDNQEMQAKHWQLKVAGLEELTEENIHVTKHDYLEN--F 1240
             +          ++     +A+D +   +  W ++         +     K+    N  F
Sbjct: 250  AISGAFTDAGTRFQPLLSVKAMD-KTGDSSPWSVRAQKEVAADLQEQLTVKNKVASNPAF 308

Query: 1241 IYSKPWIDMKLKRIFVQIYC------SPSDKYGINNN---IWVKMKSSDAIKQSFNVGSS 1393
              +KP       ++ V+ Y       +P D   I  +   + VK+KS DAI   F  G  
Sbjct: 309  TINKPEFSRSGDKVSVKTYTLIDYARNPIDVSTIPESPTELSVKLKSYDAI-HLFLTGDR 367

Query: 1394 QT--KIVSAKELNAETFQKAISLVPEVFQKRFQEAGKMLRFLDDDEMPS-LRWIPSDVDL 1564
             T     + K+LN      A        ++R+Q + + +   DD    +   W    + L
Sbjct: 368  STGQDDSTCKQLNELAVSIAFDASTPDAKRRYQASNRPIILADDICTDNGFSWSTQPLKL 427

Query: 1565 KPSKDDSNIVDVKSPVLFTGNDMPLRFAGMHYMKPLSLARCMEWILLDAFR 1717
                D  ++  V   V    +   + F G+ Y K LS  R MEW+ +D+ R
Sbjct: 428  VEKDDGLHVQSVAMKV----STRSILFPGVFYCKLLSPYRAMEWMNVDSLR 474


>XP_001691009.1 hypothetical protein CHLREDRAFT_205590 [Chlamydomonas reinhardtii]
            EDP05455.1 predicted protein [Chlamydomonas reinhardtii]
          Length = 504

 Score = 74.7 bits (182), Expect = 2e-10
 Identities = 67/273 (24%), Positives = 115/273 (42%), Gaps = 43/273 (15%)
 Frame = +2

Query: 377  IIIEPLKEEGIHALMLFFPGAFTPPVSYFPLIELVQ-------------NASSCRLWAVV 517
            II  P        L+   PGAF PP ++  L   +Q             +  +  L  +V
Sbjct: 21   IIPPPNGSADEEVLIAVAPGAFLPPDAFKSLAAEIQACTPHLRTYVGILSIDTLSLMQLV 80

Query: 518  LDPRE-------KVISAGMIEASIGG-----MITRAKERGFVP------GVLSISNVYFA 643
            +DP         +  + GM +  + G     ++ +A   GF P      G + + N    
Sbjct: 81   VDPALLNTPAAFEAFALGMKQGDVYGILLQQLLDQAVAAGFKPRKEGRGGHVRVCNQLLL 140

Query: 644  GHSVGSWLGRSIAKKQTAGFIQMGSCFFSTVPDNLVQYPK-----------PVLTLGGAL 790
              S G  +G   A  + AG    G+   ++ P N  +YPK           P++T+ G L
Sbjct: 141  AQSAGG-MGFPEAAMKLAG----GTVLLASTP-NAEEYPKRRVVSLETCPGPLMTISGEL 194

Query: 791  DGQLKLAGMVNHAGEIYKVESEFGDFNTNAVKPVIIIPGMNHAQFSHGIPNKERGDLD-G 967
            DGQ++    V +  E   + ++FG+       P++++P +NH   S+GI    RGD+  G
Sbjct: 195  DGQMRWPWHVPYIAETAAMATKFGERYVARNAPILVLPNINHGSTSNGIARPVRGDITAG 254

Query: 968  IISLEEAQSIAAKYIASFITLHLEGKNVAADQA 1066
            +   EE   +  ++I +F+T H+     A  +A
Sbjct: 255  VAPYEECIQVLGRHIGAFVTAHMSHSAAARTEA 287


Top