BLASTX nr result

ID: Ephedra29_contig00013946 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra29_contig00013946
         (1668 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ABR16537.1 unknown [Picea sitchensis]                                 672   0.0  
OAE28622.1 hypothetical protein AXG93_1335s1140 [Marchantia poly...   500   e-169
XP_002984867.1 hypothetical protein SELMODRAFT_423898 [Selaginel...   494   e-166
XP_002985924.1 hypothetical protein SELMODRAFT_446417 [Selaginel...   494   e-166
CEG00779.1 DNA methylase, N-6 adenine-specific, conserved site [...   340   e-108
XP_001422156.1 predicted protein, partial [Ostreococcus lucimari...   333   e-106
XP_002984759.1 hypothetical protein SELMODRAFT_446035 [Selaginel...   333   e-105
XP_001760655.1 predicted protein [Physcomitrella patens] EDQ7439...   338   e-104
XP_002985821.1 hypothetical protein SELMODRAFT_446418 [Selaginel...   331   e-104
XP_007508132.1 predicted protein [Bathycoccus prasinos] CCO20623...   297   1e-89
XP_002500751.1 predicted protein [Micromonas commoda] ACO62009.1...   282   7e-81
XP_005845297.1 hypothetical protein CHLNCDRAFT_137023 [Chlorella...   271   7e-80
KXZ42082.1 hypothetical protein GPECTOR_209g410 [Gonium pectorale]    255   8e-74
XP_002946067.1 hypothetical protein VOLCADRAFT_115653 [Volvox ca...   239   4e-69
XP_005646122.1 putative RNA methylase [Coccomyxa subellipsoidea ...   228   2e-67
GAQ80938.1 hypothetical protein KFL_000660330 [Klebsormidium fla...   238   2e-66
OAI46250.1 hypothetical protein AYO44_11755 [Planctomycetaceae b...   227   1e-65
XP_001697211.1 hypothetical protein CHLREDRAFT_175931 [Chlamydom...   228   7e-65
OFW33426.1 RNA methyltransferase [Actinobacteria bacterium GWC2_...   221   3e-63
WP_041969657.1 RNA methyltransferase [Geobacter sp. OR-1]             218   5e-62

>ABR16537.1 unknown [Picea sitchensis]
          Length = 532

 Score =  672 bits (1733), Expect = 0.0
 Identities = 320/491 (65%), Positives = 395/491 (80%), Gaps = 10/491 (2%)
 Frame = +3

Query: 57   LLSHTSHHNSPTQKPKRKNLSSTFVVQNILPSLTTNNSQSAT------IFPSQKQEAPTT 218
            L S  + ++S TQ    KNL    +   I P+  T  +Q  T      +  S        
Sbjct: 31   LSSPQNSYSSTTQT--HKNL---LIKSQIAPAPVTGPAQPKTLEAPANVLKSHSHLHSDK 85

Query: 219  TSNTNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLR 398
             S+    CKFFATC+PGLEQVVAAEL SP+IGA G++EG GGVYFQGT++TGY ANLWLR
Sbjct: 86   ISSKTEQCKFFATCAPGLEQVVAAELGSPLIGATGIKEGSGGVYFQGTLSTGYSANLWLR 145

Query: 399  SAIRVLHELSYGELSRKDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQS 578
              IRVL ELSY E+ +KDRDCVY F+R +VDWP Y+AS T    ++G+K+W FR+FAVQS
Sbjct: 146  CGIRVLLELSYAEMPKKDRDCVYNFVRNAVDWPQYLASSTL--KKSGYKRWKFRTFAVQS 203

Query: 579  RVWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAII 758
            RVWDC QV+NSMSAS+R KDAICD+IRDACNN++P PPEEGGA +DVPLFLSLY+++A++
Sbjct: 204  RVWDCTQVTNSMSASIRTKDAICDSIRDACNNKRPDPPEEGGAKADVPLFLSLYRDKAVL 263

Query: 759  YRDMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNG----MDDVV 926
            Y+DMSG+SLHRRGYR  MHRASL+EA+AAG+LT+AG+N+RV G GV NKNG    ++++V
Sbjct: 264  YKDMSGVSLHRRGYRDVMHRASLSEAVAAGILTLAGWNTRVQGLGVANKNGGLESLENMV 323

Query: 927  LLDPMCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPN 1106
            LLDPMCGSGTFLIEAALMA+NRAPGL R  WPFKSWHD+D+VSW +C ++A+SAAT +P 
Sbjct: 324  LLDPMCGSGTFLIEAALMAANRAPGLTRKTWPFKSWHDFDSVSWKECWQNASSAATPIPP 383

Query: 1107 HLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLG 1286
             L++LGNDKHEGALSLC RDAEAAGVK+ LEL+CKDCRDY PS  P+LV+VNPPWGFRLG
Sbjct: 384  GLRLLGNDKHEGALSLCARDAEAAGVKEVLELSCKDCRDYSPSVTPTLVVVNPPWGFRLG 443

Query: 1287 NSRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLEC 1466
            N  EN++E V STW+ALGQF+KQ+CNNADMY+ SG+ + +RELR+KAD+KWPI+MGG+EC
Sbjct: 444  NEGENDHEGVESTWEALGQFSKQHCNNADMYILSGKSTATRELRMKADKKWPITMGGVEC 503

Query: 1467 RLLHYFVLPPK 1499
            RLLHY+VLP K
Sbjct: 504  RLLHYYVLPSK 514


>OAE28622.1 hypothetical protein AXG93_1335s1140 [Marchantia polymorpha subsp.
            polymorpha]
          Length = 559

 Score =  500 bits (1288), Expect = e-169
 Identities = 240/430 (55%), Positives = 314/430 (73%), Gaps = 11/430 (2%)
 Frame = +3

Query: 243  KFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHE 422
            +FFATC+PGLE++V AEL SPMIGA  V+ G  GV F GTM TG  ANLW RSAIR+L E
Sbjct: 114  RFFATCAPGLEEIVFAELCSPMIGARSVQIGSAGVAFCGTMATGVRANLWSRSAIRILVE 173

Query: 423  LSYGELSRKDR-DCVYTFIRESVDWPTYIASCTT---------IDNRTGFKKWNFRSFAV 572
            L+ G L R+ R D VY F+R++ DWPT +   ++         +  + G     FR+F+V
Sbjct: 174  LATGPLPRRRRADPVYEFVRDAADWPTLLVDESSDSPSSPPQALSRKRGLVP-KFRTFSV 232

Query: 573  QSRVWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRA 752
             SRV+DC  VSNSM AS RAKDAICDA++DAC   +P PP +G A +DVPLF SLY++R 
Sbjct: 233  NSRVYDCEGVSNSMFASTRAKDAICDAVKDACGGSRPDPPADGVASADVPLFFSLYRDRG 292

Query: 753  IIYRDMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKN-GMDDVVL 929
            I+YRDMSG+SLH+RGYR  MHRA LNEA+AA +LT+AG+NS +PGFG+ N+N G +  VL
Sbjct: 293  ILYRDMSGVSLHKRGYRDVMHRAGLNEAIAAAMLTIAGWNSHLPGFGLANRNAGSNSKVL 352

Query: 930  LDPMCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNH 1109
            LDPMCGSGT LIEAALMA N APGLMR  WPF++WHD+D      C + A +A    P  
Sbjct: 353  LDPMCGSGTLLIEAALMAINSAPGLMRNLWPFETWHDFDPSILEACRQEAIAAEVRAPEG 412

Query: 1110 LKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGN 1289
            L++LGND HEGALSLC RDA+AA V   LEL+CKDC+DY P   PSLV+VNPPWG RL  
Sbjct: 413  LRLLGNDIHEGALSLCERDAKAANVLSMLELSCKDCKDYSPRLSPSLVVVNPPWGARLEP 472

Query: 1290 SRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECR 1469
            +   +++ +++TW+ LG+F K +CN  D+YV SG  +++  + +KAD++W +++GG++CR
Sbjct: 473  NSGGDDDTLLTTWRNLGRFLKSSCNETDVYVLSGNSNVTHAMHMKADKRWAVTVGGVDCR 532

Query: 1470 LLHYFVLPPK 1499
            ++HY+VLPPK
Sbjct: 533  IMHYYVLPPK 542


>XP_002984867.1 hypothetical protein SELMODRAFT_423898 [Selaginella moellendorffii]
            EFJ14117.1 hypothetical protein SELMODRAFT_423898
            [Selaginella moellendorffii]
          Length = 588

 Score =  494 bits (1272), Expect = e-166
 Identities = 243/430 (56%), Positives = 310/430 (72%), Gaps = 6/430 (1%)
 Frame = +3

Query: 243  KFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHE 422
            +FFATC+PGLE VVAAEL +  IGA  V+ G  GVYF G+  TG++ANLW R A+RVL  
Sbjct: 73   RFFATCAPGLEDVVAAELRASSIGAHSVKTGSSGVYFSGSWHTGFLANLWSRCAVRVLQL 132

Query: 423  LSYGELSRKDR-----DCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVW 587
            ++  +L R  R     D VY F++++VDW T + +           +   RSF++ SRVW
Sbjct: 133  IAAADL-RSSRYGQRSDPVYNFVKDAVDWKTLVVAGN---------RGKLRSFSIHSRVW 182

Query: 588  DCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRD 767
            DC +VSN+M A  RAKDAICDA+RD C  ++P PP+   A +D+PLFLSLY+++A++YRD
Sbjct: 183  DCSEVSNTMVACTRAKDAICDALRDCCGGKRPDPPDAYDA-ADLPLFLSLYRDKALLYRD 241

Query: 768  MSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSR-VPGFGVVNKNGMDDVVLLDPMC 944
            MSG SLH RGYR AMH+ASLNEA+AAG+LTMAG+N + VPGFG  NKN     VL+DPMC
Sbjct: 242  MSGTSLHMRGYRDAMHKASLNEAIAAGILTMAGWNDKFVPGFGTFNKNVGGGRVLMDPMC 301

Query: 945  GSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILG 1124
            GSGTFLIEAALMA N APGL+R  WPF  WHDYD + W  CCK AT A  + P  L++LG
Sbjct: 302  GSGTFLIEAALMALNIAPGLLRPRWPFMKWHDYDKMEWKLCCKEATEAQVKAPRDLQLLG 361

Query: 1125 NDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENE 1304
            ND HEG+L LC RDA+ AGV+  L L+C+DCR YVP   PSLV VNPPWG RL      E
Sbjct: 362  NDLHEGSLLLCTRDAKRAGVEHLLRLSCEDCRRYVPPVCPSLVTVNPPWGNRL----NEE 417

Query: 1305 NEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHYF 1484
               +I+T+ ALG+F KQ  + AD+YV SG  +L+R+L++KADRKWPI++GG +CRLLHY+
Sbjct: 418  EPSLIATYMALGRFLKQYSSRADVYVLSGNSTLTRQLQMKADRKWPITVGGFDCRLLHYY 477

Query: 1485 VLPPKPDSLK 1514
            +LPPK  S++
Sbjct: 478  ILPPKTSSVE 487


>XP_002985924.1 hypothetical protein SELMODRAFT_446417 [Selaginella moellendorffii]
            EFJ13101.1 hypothetical protein SELMODRAFT_446417
            [Selaginella moellendorffii]
          Length = 588

 Score =  494 bits (1271), Expect = e-166
 Identities = 243/429 (56%), Positives = 309/429 (72%), Gaps = 6/429 (1%)
 Frame = +3

Query: 243  KFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHE 422
            +FFATC+PGLE VVAAEL +  IGA  V+ G  GVYF G+  TG++ANLW R A+RVL  
Sbjct: 73   RFFATCAPGLEDVVAAELRASSIGAHSVKTGSSGVYFSGSWHTGFLANLWSRCAVRVLQL 132

Query: 423  LSYGELSRKDR-----DCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVW 587
            ++  +L R  R     D VY F++++VDW T + +           +   RSF++ SRVW
Sbjct: 133  IAAADL-RSSRYGQRSDPVYNFVKDAVDWKTLVVAGN---------RGKLRSFSIHSRVW 182

Query: 588  DCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRD 767
            DC +VSN+M A  RAKDAICDA+RD C  ++P PP+   A +D+PLFLSLY+++A++YRD
Sbjct: 183  DCSEVSNTMVACTRAKDAICDALRDCCGGKRPDPPDAYDA-ADLPLFLSLYRDKALLYRD 241

Query: 768  MSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSR-VPGFGVVNKNGMDDVVLLDPMC 944
            MSG SLH RGYR AMH+ASLNEA+AAG+LTMAG+N + VPGFG  NKN     VL+DPMC
Sbjct: 242  MSGTSLHMRGYRDAMHKASLNEAIAAGILTMAGWNDKFVPGFGTFNKNVGGGRVLMDPMC 301

Query: 945  GSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILG 1124
            GSGTFLIEAALMA N APGL+R  WPF  WHDYD + W  CCK AT A  + P  L++LG
Sbjct: 302  GSGTFLIEAALMALNIAPGLLRPRWPFMKWHDYDKMEWKLCCKEATEAQVKAPRDLQLLG 361

Query: 1125 NDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENE 1304
            ND HEG+L LC RDA+ AGV+  L L+C+DCR YVP   PSLV VNPPWG RL      E
Sbjct: 362  NDLHEGSLLLCTRDAKRAGVEHLLRLSCEDCRRYVPPVCPSLVTVNPPWGNRL----NEE 417

Query: 1305 NEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHYF 1484
               +I+T+ ALG+F KQ  + AD+YV SG  +L+R+L++KADRKWPI++GG +CRLLHY+
Sbjct: 418  EPSLIATYMALGRFLKQYSSRADVYVLSGNSTLTRQLQMKADRKWPITVGGFDCRLLHYY 477

Query: 1485 VLPPKPDSL 1511
            +LPPK  S+
Sbjct: 478  ILPPKTSSV 486


>CEG00779.1 DNA methylase, N-6 adenine-specific, conserved site [Ostreococcus
            tauri]
          Length = 466

 Score =  340 bits (873), Expect = e-108
 Identities = 197/458 (43%), Positives = 276/458 (60%), Gaps = 38/458 (8%)
 Frame = +3

Query: 243  KFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHE 422
            +F+ATC PGLE+VVA EL+SPMI A  VR GK GV F GT   GY ANLWLRSA+RVL E
Sbjct: 3    RFYATCHPGLEEVVARELASPMIDAREVRVGKSGVSFVGTQRVGYDANLWLRSAVRVLVE 62

Query: 423  LSYGELSR--KDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWDCI 596
            L  G L       + VY FI+ +  W   +       +R G       +FAV++RVWDC 
Sbjct: 63   LKRGYLDPGVSGTESVYEFIKRAAPWEEVVPGV----SRAG----EALTFAVETRVWDCS 114

Query: 597  QVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSG 776
            Q+++S +A +R KDA+CDA+ DA   R P PP    A +DVPL+++LY++  I+YRDMSG
Sbjct: 115  QITSSHAAKIRVKDAVCDALVDATGTR-PQPPINYAA-ADVPLYVTLYRDEVIVYRDMSG 172

Query: 777  ISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGSGT 956
             SLHRRGYR+AMHRASL+E+ AAG+L++AG+   +  +   + +G    VL+DPMCGSGT
Sbjct: 173  ESLHRRGYRNAMHRASLSESAAAGMLSLAGWPDMLEQWR-RDPDGTPAPVLIDPMCGSGT 231

Query: 957  FLIEAALMASNRAPGLMRTH---WPFKSWHDYDAVSWNDCCKSATSAATEV----PNHLK 1115
            FLIEAALMA+N APGL+R     + F+ W D+D   ++ C + A    +E      +   
Sbjct: 232  FLIEAALMAANVAPGLVRAETIGYAFERWPDHDQRMFDACLEDARRIGSETRAACASPPV 291

Query: 1116 ILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDY-VPS-FHPS---LVIVNPPWGFR 1280
            I+GND H GA++L  + AE A V   +E+   +C  + +PS   P    +++ NPPWG R
Sbjct: 292  IIGNDIHPGAVTLAGQGAETARVSGMVEIFRNNCETFRLPSKIDPEARRVIVTNPPWGKR 351

Query: 1281 LGNSRENENED------------------------VISTWQALGQFAKQNCNNADMYVFS 1388
            +G   + E +D                        +   W +LG F K+ C +   +V S
Sbjct: 352  IGAGGDAEGDDGYAGDGEFDRDTTVNSNGVSAEEALEQCWNSLGVFLKRECPDTSAFVLS 411

Query: 1389 GEQSLSRELRLKADRKWPISMGGLECRLLHYFVLPPKP 1502
            G  ++SR +R++A RK  + +GG++CRLL Y +LPPKP
Sbjct: 412  GNPAVSRAIRMRASRKHVVGIGGVDCRLLRYDILPPKP 449


>XP_001422156.1 predicted protein, partial [Ostreococcus lucimarinus CCE9901]
            ABP00473.1 predicted protein, partial [Ostreococcus
            lucimarinus CCE9901]
          Length = 419

 Score =  333 bits (853), Expect = e-106
 Identities = 186/432 (43%), Positives = 264/432 (61%), Gaps = 19/432 (4%)
 Frame = +3

Query: 243  KFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHE 422
            +F+ATC PGLE VVA EL+SPMI A  +  GK GV F GT   GY AN+WLRSA+RVL E
Sbjct: 1    RFYATCHPGLEDVVARELASPMINASDIVIGKSGVSFTGTQRVGYDANVWLRSAVRVLVE 60

Query: 423  LSYGELSR--KDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWDCI 596
            L  G L         VY F++ +V W   I          G ++ +   F V++R+WDC 
Sbjct: 61   LKRGYLDPYVSGTQSVYEFVKHAVPWEEVIP---------GGERGDGLKFGVETRLWDCS 111

Query: 597  QVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSG 776
            Q+++S +A +R KDA+CDA+ DA   R PSPP+   A +DVPL+++LY++  I+YRDMSG
Sbjct: 112  QITSSHAAKIRVKDAVCDALVDATGTR-PSPPDNYAA-ADVPLYVTLYRDEIIMYRDMSG 169

Query: 777  ISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGSGT 956
             SLHRRGYR+AMHRASL+E+ AAG+L++AG+   +  +   +  G    VL+DPMCGSGT
Sbjct: 170  ESLHRRGYRNAMHRASLSESAAAGMLSLAGWPDMLEEW-QRDPAGSPPPVLIDPMCGSGT 228

Query: 957  FLIEAALMASNRAPGLMRTH---WPFKSWHDYDAVSWNDCCKSA----TSAATEVPNHLK 1115
            FLIE ALMA+N APGL+R+    + F+ W D++  ++ +C ++A        T VP    
Sbjct: 229  FLIEGALMAANVAPGLIRSETIGYAFERWPDHNPRTFEECLENARRIGAETRTAVPTPPV 288

Query: 1116 ILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDY----------VPSFHPSLVIVNP 1265
            I+GND H GA++L  + +E A V   +++   DC  +          +    P LV+ NP
Sbjct: 289  IIGNDIHPGAVTLAGQGSETARVAGMIDVFRNDCEMFNITASSAGAKIAPNAPKLVVTNP 348

Query: 1266 PWGFRLGNSRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPI 1445
            PWG R+G         V   W +LG F K+ C ++  +V SG   +SR +R++A RK  +
Sbjct: 349  PWGKRIGAGEPAGRRRV---WNSLGVFLKRECPDSSAFVLSGNPEVSRAIRMRASRKHVV 405

Query: 1446 SMGGLECRLLHY 1481
             +GG++CRLL Y
Sbjct: 406  GIGGVDCRLLRY 417


>XP_002984759.1 hypothetical protein SELMODRAFT_446035 [Selaginella moellendorffii]
            EFJ14009.1 hypothetical protein SELMODRAFT_446035
            [Selaginella moellendorffii]
          Length = 458

 Score =  333 bits (855), Expect = e-105
 Identities = 195/459 (42%), Positives = 262/459 (57%), Gaps = 8/459 (1%)
 Frame = +3

Query: 147  PSLTTNNSQSATIFPSQKQEAPTTTSNTNNPCK---FFATCSPGLEQVVAAELSSPMIGA 317
            PS    N   A +    K  A   +     P K   F ATC+PGLE VVAAEL +P I A
Sbjct: 8    PSYWKENGAQARVVLEGKPRASRLSDPLAPPRKILSFLATCAPGLEAVVAAELQAPAIRA 67

Query: 318  --LGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHELSYGELSRKDR---DCVYTFIRE 482
              + + E  G V+F GT    + A LWLR   RV+H ++   L   D    D VY F+R 
Sbjct: 68   ANVAIAESGGAVFFSGTWIVAFNAILWLRCGSRVMHLIASANLPWADAIGIDPVYQFVRY 127

Query: 483  SVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWDCIQVSNSMSASVRAKDAICDAIRD 662
            +V W  ++AS     N  G K+  FRSF+V+ RV DC + S S+ A  +A  AI DA+  
Sbjct: 128  AVHWKRFLAS-----NDDGHKR--FRSFSVECRVRDCTRRSVSLYAPRKANAAINDALDL 180

Query: 663  ACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSGISLHRRGYRSAMHRASLNEALA 842
              N     P  +  A  +VPLFL ++K++A +YRDMS   L  R Y+  + + SL   +A
Sbjct: 181  EFNPDVLEPHLKEPA--EVPLFLLIHKDKARLYRDMSS-DLGERSYKEVLDKTSLPGEIA 237

Query: 843  AGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGSGTFLIEAALMASNRAPGLMRTHWP 1022
            AG+LT+AG+N  VPGFG +N+N     VLLDPMCG GT L+EAALM+SN APGL+   WP
Sbjct: 238  AGVLTLAGWNYAVPGFGEINRNA--PTVLLDPMCGCGTLLVEAALMSSNTAPGLLCKSWP 295

Query: 1023 FKSWHDYDAVSWNDCCKSATSAATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLEL 1202
            F  WHD+D   W +C   A++A   VP+  K LGND+ +  +S C R AE AGV   L+L
Sbjct: 296  FTDWHDFDEELWQECRARASAAMLAVPSSHKFLGNDRDQKVISACARAAERAGVAHLLQL 355

Query: 1203 TCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENENEDVISTWQALGQFAKQNCNNADMYV 1382
            + +D   Y P   PSLV+VNPPW        +     ++ T+  LG+F +++C   D YV
Sbjct: 356  SSQDFVQYEPPTRPSLVVVNPPW------ESKERTTTLVPTFSELGRFLRRHCRETDAYV 409

Query: 1383 FSGEQSLSRELRLKADRKWPISMGGLECRLLHYFVLPPK 1499
             SG + L+  +RLKAD  WPI + GL  RL HY++LP K
Sbjct: 410  LSGSRLLASNMRLKADVNWPIKIQGLHFRLCHYYILPKK 448


>XP_001760655.1 predicted protein [Physcomitrella patens] EDQ74394.1 predicted
            protein [Physcomitrella patens]
          Length = 698

 Score =  338 bits (868), Expect = e-104
 Identities = 194/451 (43%), Positives = 264/451 (58%), Gaps = 31/451 (6%)
 Frame = +3

Query: 228  TNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAI 407
            + N   F+ATC  GLEQ++AAELSSP+I A  V    GGV+F+GT +TGY ANLWLR+  
Sbjct: 87   SGNLQSFYATCEKGLEQILAAELSSPLINASQVETDSGGVFFRGTQSTGYNANLWLRTGD 146

Query: 408  RVLHELSYGELSR-KDR-DCVYTFIRESVDWPTYIAS-----CTTIDNRTGFK---KWNF 557
            RVL EL+   L + K R D +Y F+RE+ DWP  +         T+ + T  +   ++ F
Sbjct: 147  RVLCELARCLLPQGKSRFDLLYEFVREAADWPLLLVDDSAPLARTLQSETAERLPNRYKF 206

Query: 558  RSFAVQSRV--WDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFL 731
            + F VQ R+  W   +  N +SASV   +AI DA+RD+C  + P+P E    ++ VPLFL
Sbjct: 207  KKFIVQIRLSNWKSTKDQNYVSASV--SNAIWDALRDSCVGQWPAPAERED-LTAVPLFL 263

Query: 732  SLYKNRAIIYRDMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNG 911
             + ++ A +YRDMSG+SLHRRGYR  + +   NE LAA +LT+AG+N  V GFG  NKN 
Sbjct: 264  DVNEDTAFLYRDMSGVSLHRRGYRDVIDKTKPNEGLAAAILTLAGWNHNVHGFGAANKND 323

Query: 912  MD-DVVLLDPMCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSA 1088
               D VLLDP CG+GT LIEAALMA   APGL R HWPF++WHDYD  +W +C  +A S 
Sbjct: 324  SGKDRVLLDPFCGTGTILIEAALMAYEIAPGLFRPHWPFQTWHDYDPRAWTECRDAAASV 383

Query: 1089 ATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPP 1268
                 +  +++GND ++GA++ C R A AAGV   LEL+ +  R Y P   PSLV+ NP 
Sbjct: 384  QALPVSGARLIGNDMNDGAITRCKRAARAAGVLHLLELSSETSRHYKPPVIPSLVVTNPS 443

Query: 1269 WGFRLG-NSRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPI 1445
                L  +S   E+E +++ W   GQ  K  C  AD+YV S    L+  + +  D KW +
Sbjct: 444  CSAPLTYSSLRGEDERLLTLWLGFGQLLKSQCRGADVYVLSENDWLAHAMHMVPDSKWDL 503

Query: 1446 S-----------------MGGLECRLLHYFV 1487
                               G  E +LLH+ V
Sbjct: 504  QPNKQWRGNHTGRKTDKIFGSQERKLLHFHV 534


>XP_002985821.1 hypothetical protein SELMODRAFT_446418 [Selaginella moellendorffii]
            EFJ12998.1 hypothetical protein SELMODRAFT_446418
            [Selaginella moellendorffii]
          Length = 513

 Score =  331 bits (848), Expect = e-104
 Identities = 187/423 (44%), Positives = 253/423 (59%), Gaps = 5/423 (1%)
 Frame = +3

Query: 246  FFATCSPGLEQVVAAELSSPMIGA--LGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLH 419
            F ATC+PGLE VVAAEL +P I A  + + E  G V+F GT    + A LWLR   RV+H
Sbjct: 99   FLATCAPGLEAVVAAELQAPAIRAANVAIAESGGAVFFSGTWIVAFNAILWLRCGSRVMH 158

Query: 420  ELSYGELSRKDR---DCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWD 590
             ++   L   D    D VY F+R +V W  ++A+     N  G K+  FRSF+V+ RV D
Sbjct: 159  LIASANLPWADAIGIDPVYQFVRYAVHWKRFLAN-----NDDGHKR--FRSFSVECRVRD 211

Query: 591  CIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDM 770
            C + S S+ A  +A  AI DA+    N     P  +  A  +VPLFL ++K++A +YRDM
Sbjct: 212  CTRRSVSLYAPRKANAAINDALDLEFNPDVLEPHLKEPA--EVPLFLLIHKDKARLYRDM 269

Query: 771  SGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGS 950
            S  +L  R Y+  + + SL   +AAG+LT+AG+N  VPGFG +N+N     VLLDPMCG 
Sbjct: 270  SS-NLGERSYKEVLDKTSLPGEIAAGVLTLAGWNYAVPGFGEINRNA--PTVLLDPMCGC 326

Query: 951  GTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILGND 1130
            GT LIEAALM+SN APGL+R  WPF  WHD+D   W +C   A++A   VP+  K LGND
Sbjct: 327  GTLLIEAALMSSNTAPGLLRKSWPFTDWHDFDEELWQECRARASAAMLAVPSSHKFLGND 386

Query: 1131 KHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENENE 1310
            + +  +S C R AE AGV   L+L+ +D   Y P   PSLV+VNPPW        +    
Sbjct: 387  RDQKVISACARAAERAGVAHLLQLSSQDFVQYEPPTRPSLVVVNPPW------ESKERTT 440

Query: 1311 DVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHYFVL 1490
             ++ T+  LG+F +++C   D YV SG + L+  +RLKAD  WPI + GL  RL  Y++L
Sbjct: 441  TLVPTFSELGRFLRRHCRETDAYVLSGSRLLASNMRLKADVNWPIKIQGLHFRLCRYYIL 500

Query: 1491 PPK 1499
            P K
Sbjct: 501  PKK 503


>XP_007508132.1 predicted protein [Bathycoccus prasinos] CCO20623.1 predicted protein
            [Bathycoccus prasinos]
          Length = 603

 Score =  297 bits (760), Expect = 1e-89
 Identities = 207/565 (36%), Positives = 294/565 (52%), Gaps = 85/565 (15%)
 Frame = +3

Query: 63   SHTSHHNSPTQKPKRKNL-SSTFVVQNILPSLTTNNSQSATIFPS---QKQEA-PTTTSN 227
            S ++   +P Q+  R+ L SS+   +N L +  ++NS +         Q QEA P     
Sbjct: 48   STSTTQRAPRQREGRERLFSSSETSRNPLNAQKSSNSSNRRRRGEDTYQDQEAAPERAPV 107

Query: 228  TNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAI 407
            +    +FF TC PGLE+VVA ELSS  I A  V  G  GV F GT+ T Y AN+WLRS  
Sbjct: 108  SKGMSRFFVTCHPGLEEVVAKELSSKEIRAQHVEIGASGVSFVGTLETAYNANIWLRSGT 167

Query: 408  RVLHELSYGELSRKDR--DCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSR 581
            RVL EL+ G+L   +   D VY F++  V W   + +                +F++++R
Sbjct: 168  RVLCELASGDLDPLESGFDSVYDFVKHCVPWQEVLINAEL-------------TFSIEAR 214

Query: 582  VWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEE---GGAMSDVPLFLSLYKNRA 752
            VW   Q+S++  A  RAKDAICD I DAC   +P  P +       +DVPLF++LYK+RA
Sbjct: 215  VWSNSQISSTKLACTRAKDAICDYISDACGGVRPRDPRDFRGNKVKADVPLFMTLYKDRA 274

Query: 753  IIYRDMSGISLHRRGYRS--AMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDV- 923
             +YRD SG SLHRRGYRS  ++HRA+LNEA AAGLL +AG+   +  +    +   DD+ 
Sbjct: 275  TLYRDTSGDSLHRRGYRSNLSVHRAALNEAAAAGLLHIAGWPEALDEWRSQREED-DDIS 333

Query: 924  --VLLDPMCGSGTFLIEAALMASNRAPGLMRTH---WPFKSWHDYDAVSWNDCCKSATSA 1088
              V +DPMCGSGT ++EAALMA N APGL+R     + F++W D++   + DC  +A   
Sbjct: 334  PPVFIDPMCGSGTMVVEAALMAMNVAPGLVRYKNGGYAFQNWPDFNEDVFLDCIDNAEKK 393

Query: 1089 A---------TEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYV--PS 1235
                      ++  N +  LGND H G++SL  R A AAGV   +++      ++V  PS
Sbjct: 394  RIAEEEFHQFSDQKNKVTCLGNDIHPGSVSLAQRSALAAGVPHVVKVYQSSVDEWVVPPS 453

Query: 1236 FHPS---LVIVNPPWGFRLG----------------------------NSRENENE---- 1310
               S    +  NPPWG R+                             N+ + +N+    
Sbjct: 454  LLESKRRSICTNPPWGKRISLDSRMSDVNRNRSGSSNRDSSYGGENGYNNEDQDNDEERW 513

Query: 1311 ----------------DVIST-----WQALGQFAKQNCNNADMYVFSGEQSLSRELRLKA 1427
                            DV ST     W+ LG F K+   N   +V SG+ S+SRE+ ++A
Sbjct: 514  GGGDGGGDDYNEILPGDVNSTEAGDAWRKLGIFLKREMPNQSAFVLSGDPSISREIYMRA 573

Query: 1428 DRKWPISMGGLECRLLHYFVLPPKP 1502
             RK  + +GG++ RLL Y +LPPKP
Sbjct: 574  SRKHVLGIGGVDTRLLRYDILPPKP 598


>XP_002500751.1 predicted protein [Micromonas commoda] ACO62009.1 predicted protein
            [Micromonas commoda]
          Length = 1009

 Score =  282 bits (722), Expect = 7e-81
 Identities = 191/544 (35%), Positives = 266/544 (48%), Gaps = 112/544 (20%)
 Frame = +3

Query: 204  EAPTTTSNTNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMA 383
            E P   S++     ++ATC PGLE VVA EL S +IGA  VR G  GV F+G    GY A
Sbjct: 468  ERPRRASSSTALGDYYATCHPGLEDVVAKELESELIGASDVRVGASGVSFRGDARVGYRA 527

Query: 384  NLWLRSAIRVLHELSYGELSRK--DRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNF 557
            N+WLR AIRVL EL  G +         +Y F+R++  W   I +   +           
Sbjct: 528  NVWLRCAIRVLCELDRGYIDPNVPGGAAIYDFVRDAAPWHEVIPADDGL----------- 576

Query: 558  RSFAVQSRVWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSL 737
             +F+V+SRV  C  V+++  AS RAKDAICDA+ D  N  +P PP+ G + +DVPL+LSL
Sbjct: 577  -TFSVESRVRSCTDVTSTRLASTRAKDAICDALVDV-NGWRPPPPQFGHSSADVPLYLSL 634

Query: 738  YKNRAIIYRDMSGISLHRRGYR-SAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGM 914
            +++ A +YRDMSG SLHRRGYR +A+HRA+LNEA AAG+L++AG+++         + G+
Sbjct: 635  FRDEAKLYRDMSGESLHRRGYRDAAIHRAALNEAAAAGVLSLAGWSAACDR---ARERGL 691

Query: 915  DDVVLLDPMCGSGTFLIEAALMASNRAPGLMRTH-----------------------WPF 1025
                L+DPMCGSGT LIE A+MA   APGL+R                         + F
Sbjct: 692  VLPALVDPMCGSGTLLIEGAMMAGRVAPGLIRVDAAGGFGKGGDGGRDPSPGVQRPAFAF 751

Query: 1026 KSWHDYDAVSWNDCCKSA----TSAATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDS 1193
            + W D+D     +  + A     +A  ++     I+GND H GALSL  R A AAGV   
Sbjct: 752  ERWPDHDPTLLEEVLEEAAEIGAAARKKMGGAPVIIGNDVHAGALSLARRAAMAAGVDGV 811

Query: 1194 LELTCKDCRDYVPSFHPS---------------------------------------LVI 1256
            ++    D  D     HP                                        LV+
Sbjct: 812  IDFVQGDAADLT---HPKLTEFTAAALERDGLQRDVDEPVDLDDLGPTALRAEGGGVLVV 868

Query: 1257 VNPPWGFRLG----------------------------------NSRENEN--------- 1307
             NPPWG R+G                                  + R  E+         
Sbjct: 869  SNPPWGMRIGARDDGYDGDDAGDGYGDGNWDDAASDAGSVRGGDSVRGGESVPSAGTLRA 928

Query: 1308 EDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHYFV 1487
             DV   WQ+LG F ++ C  A  ++ SG+ + +R LR++A RK  + +GG++CRLL Y +
Sbjct: 929  PDVEEAWQSLGAFFRRECGGATAHLLSGDANATRPLRMRARRKRVLGIGGVDCRLLEYRI 988

Query: 1488 LPPK 1499
            LPP+
Sbjct: 989  LPPR 992


>XP_005845297.1 hypothetical protein CHLNCDRAFT_137023 [Chlorella variabilis]
            EFN53195.1 hypothetical protein CHLNCDRAFT_137023
            [Chlorella variabilis]
          Length = 602

 Score =  271 bits (693), Expect = 7e-80
 Identities = 166/366 (45%), Positives = 206/366 (56%), Gaps = 10/366 (2%)
 Frame = +3

Query: 246  FFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLH-- 419
            FFAT  PGLE VVAAEL  P IG       K GV F G + TGY ANLWLRSAIRVL   
Sbjct: 103  FFATSHPGLEAVVAAELLGPAIG-------KAGVSFTGDVATGYRANLWLRSAIRVLMLL 155

Query: 420  ELSYGELSRKDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWDCIQ 599
            E +  E +R   + VY   R+  DW   +    T              F+V+SRVW C  
Sbjct: 156  EETLLEGTRPAGEEVYDAFRDVTDWSALLEPGQT--------------FSVESRVWSCSN 201

Query: 600  VSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSGI 779
            +S+S    VR KDA+CDAIRD     KP PPE G  ++D+PLF + Y +R  IYRDMSG 
Sbjct: 202  LSSSQLLLVRGKDAVCDAIRDR-RGSKPLPPEPG-RVADMPLFCTAYHDRLSIYRDMSGA 259

Query: 780  SLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMD-------DVVLLDP 938
                RGYR AMHRASLNE+ AAG+L M+G++      G  +              VL DP
Sbjct: 260  ----RGYRQAMHRASLNESAAAGILHMSGWHQLCKQEGAADGAAARCSAPCPLPAVLADP 315

Query: 939  MCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHD-YDAVSWNDCCKSATSAATEVPNHLK 1115
            MCGSGTFLIEAALMA+N APG  R  WPF  WHD +D  +W    + A +     P  ++
Sbjct: 316  MCGSGTFLIEAALMATNTAPGSFRRWWPFTQWHDSFDRDAWAAAVEQAAAGRHAPPAGVE 375

Query: 1116 ILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSR 1295
              GND H GALSL +RD +AAGV+  + L   +C  +     P++++ NPPWG RL    
Sbjct: 376  AWGNDVHRGALSLALRDVQAAGVQGMVRLHHGECAGWELPRRPAVLVSNPPWGQRLRGRG 435

Query: 1296 ENENED 1313
              E+ D
Sbjct: 436  AAESFD 441


>KXZ42082.1 hypothetical protein GPECTOR_209g410 [Gonium pectorale]
          Length = 607

 Score =  255 bits (652), Expect = 8e-74
 Identities = 176/504 (34%), Positives = 238/504 (47%), Gaps = 92/504 (18%)
 Frame = +3

Query: 246  FFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGT-MTTGYMANLWLRSAIRVLHE 422
            FFATC PGLE+VVAAEL    +G  G+   K GV F+G  ++ GY ANLWLRSAIRVL  
Sbjct: 12   FFATCHPGLEEVVAAELRQ--LGYRGIEPSKAGVAFRGRRVSDGYAANLWLRSAIRVLVL 69

Query: 423  LSYGELS------RKDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRV 584
            L+ G L       R+    +Y F+ E+  W   +   +T              F+V  R+
Sbjct: 70   LAEGPLGSGPTDRRRGGQALYDFVYEAAPWHELVPPGST--------------FSVAPRL 115

Query: 585  WDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYR 764
            + C  + ++     R KDA+CD+IR    + KP PPE G  ++DVPL+++ + +   ++R
Sbjct: 116  YGCTDLFSTQLVWSRVKDAVCDSIRQHRPD-KPDPPERG-CVADVPLYITCHLDWVKVFR 173

Query: 765  DMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMC 944
            DMSG SLHRRGYR  MHRA+LNEA AAG+L +AG+   V   G       +D+VL DPMC
Sbjct: 174  DMSGESLHRRGYRDVMHRAALNEAAAAGVLALAGWPQLVEEAG-----DGEDLVLADPMC 228

Query: 945  GSGTFLIEAALMASNRAPGLMRT--------------------------------HWPFK 1028
            GSGT LIEAALMA   APGLMR+                                 WPF+
Sbjct: 229  GSGTLLIEAALMARGIAPGLMRSLKLEPDPAAAAAAGGGGRPGGGGRLRAPMATAAWPFQ 288

Query: 1029 SWHDYDAVSWNDCCKSATSAATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTC 1208
             W DYDA +W +    A +A                 GA SL +R A  AGV   +ELT 
Sbjct: 289  RWGDYDAGAWWEAVDGARAA-----------------GAHSLAVRQARKAGVDSMIELTQ 331

Query: 1209 KDCRDYVPSFHPSLVIVNPPWGFRL----------------------GNSRENENED--- 1313
             DC    P   P+LV+ NPPWG RL                      G  R++ + D   
Sbjct: 332  GDCGALAPPVTPNLVVCNPPWGGRLAAEAGDERRRRGGAALHDEGYEGQDRDDHDHDGGG 391

Query: 1314 ----------------------------VISTWQALGQFAKQNCNNADMYVFSGEQSLSR 1409
                                        + + W++L  F  ++C  A  +V SG     R
Sbjct: 392  DPGEEREGGGGSGEGWGEEGGGPETEAYLEAAWRSLDSFLYRHCPGASAFVLSGNDDAFR 451

Query: 1410 ELRLKADRKWPISMGGLECRLLHY 1481
             L+LK   K  + +GG+  ++  Y
Sbjct: 452  YLKLKPSSKQRLVLGGVGVQVAGY 475


>XP_002946067.1 hypothetical protein VOLCADRAFT_115653 [Volvox carteri f.
            nagariensis] EFJ53062.1 hypothetical protein
            VOLCADRAFT_115653, partial [Volvox carteri f.
            nagariensis]
          Length = 475

 Score =  239 bits (610), Expect = 4e-69
 Identities = 162/449 (36%), Positives = 224/449 (49%), Gaps = 46/449 (10%)
 Frame = +3

Query: 273  EQVVAAELSSPMIGALGVREGKGGVYFQGT-MTTGYMANLWLRSAIRVLHELSYGELSRK 449
            +QVVA EL    +G   V   K GV F+G  ++ GY ANLWLRSAIRVL  L+ G+L   
Sbjct: 38   QQVVARELVE--LGYRDVVPSKAGVEFRGRRVSDGYAANLWLRSAIRVLVLLAEGQLGTD 95

Query: 450  DR------DCVYTFIRESVDWPTYIA-SCTTIDNRTGFKKWNFRSFAVQSRVWDCIQVSN 608
             R        +Y  + ++  W   +   CT               F+V+ R+W C  + +
Sbjct: 96   PRGGVRGGQALYDMVYDAAPWHEIVPPGCT---------------FSVEPRLWSCTDIFS 140

Query: 609  SMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSGISLH 788
            +     R KDA+CD IR      KP PP  G  ++DVP+++S Y++   ++RDMSG SLH
Sbjct: 141  TRLVWSRVKDAVCDNIR-RYGREKPLPPARG-QVADVPVYVSCYRDHVRVFRDMSGTSLH 198

Query: 789  RRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGSGTFLIE 968
            RRGYR  MHRA+LNEA AAG+LT++G+   V   G    NG + ++L DPMCGSGT LIE
Sbjct: 199  RRGYRDVMHRAALNEAAAAGVLTLSGWKEAVDDAG---GNG-EGLILADPMCGSGTILIE 254

Query: 969  AALMASNRAPGLMRT--------------------------------------HWPFKSW 1034
            AALMA + APG MR+                                       WPF+ W
Sbjct: 255  AALMARDIAPGFMRSLLLDDAPPTPSAAAVGVGLGGGGPGRGVGRRHAALAPAAWPFQRW 314

Query: 1035 HDYDAVSWNDCCKSATSAATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKD 1214
             DYD+  W++  ++A       P   ++LG D HEGALSL +R A+ AGV + LELT  D
Sbjct: 315  GDYDSRVWSEAVETARD-RVRPPWRGRLLGVDVHEGALSLAVRQAKKAGVYNMLELTHGD 373

Query: 1215 CRDYVPSFHPSLVIVNPPWGFRLGNSRENENEDVISTWQALGQFAKQNCNNADMYVFSGE 1394
            C   VP+  P L                     + + W++L  F  + C  A   V SG 
Sbjct: 374  CGTVVPAATPHL------------------EAFLAAAWRSLDGFLYRQCPGASATVLSGN 415

Query: 1395 QSLSRELRLKADRKWPISMGGLECRLLHY 1481
             S  R L+L+   K  + + G+E ++  Y
Sbjct: 416  ASTYRYLKLRPQSKNRLVLSGVEVQVASY 444


>XP_005646122.1 putative RNA methylase [Coccomyxa subellipsoidea C-169] EIE21578.1
            putative RNA methylase [Coccomyxa subellipsoidea C-169]
          Length = 268

 Score =  228 bits (581), Expect = 2e-67
 Identities = 123/258 (47%), Positives = 163/258 (63%), Gaps = 12/258 (4%)
 Frame = +3

Query: 768  MSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCG 947
            MSG SLHRRGYRSAMH+ASLNEA AAG L +AG+    P       +     VL DPMCG
Sbjct: 1    MSGDSLHRRGYRSAMHKASLNEAAAAGCLALAGW----PQAAAAGMHHCPWKVLADPMCG 56

Query: 948  SGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILGN 1127
            SGTFLIEAALMA++ APGL R  WPF+ W D+DA +W      A  A    P    +LGN
Sbjct: 57   SGTFLIEAALMATHSAPGLYRRRWPFERWPDFDAAAWRRVVADAKGACR--PWKGTLLGN 114

Query: 1128 DKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRL-------- 1283
            D H GALSL  +D   AG+   ++LT   C ++ P+  P++VI NPPWG RL        
Sbjct: 115  DIHSGALSLAAKDLGNAGLSKLVQLTHGPCSEWQPTQRPAMVITNPPWGNRLMSPSGEGG 174

Query: 1284 GNSRENE----NEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISM 1451
             + R++E    + D+ + W+ LG F K  C  AD+ V SG + ++  LR+KADR++P+++
Sbjct: 175  PDGRQSEEASIDPDLEAAWRDLGFFLKGQCPEADVAVLSGNKHITSMLRMKADRRFPMTI 234

Query: 1452 GGLECRLLHYFVLPPKPD 1505
            GG++CRL+ Y VLPPKP+
Sbjct: 235  GGVDCRLIKYKVLPPKPE 252


>GAQ80938.1 hypothetical protein KFL_000660330 [Klebsormidium flaccidum]
          Length = 716

 Score =  238 bits (608), Expect = 2e-66
 Identities = 139/320 (43%), Positives = 186/320 (58%), Gaps = 14/320 (4%)
 Frame = +3

Query: 36  LTTCKNPLLSHTSHHNSPTQKPK-------RKNLSSTF--VVQNILPSLTTNNSQSATIF 188
           L++CK   LS  ++  +P   P          N S TF   +       T++  +++T  
Sbjct: 45  LSSCKLRALSEETNWKTPLTSPNAPPFSILHSNHSCTFHTALGGRRQPRTSHGGENSTTR 104

Query: 189 PSQKQEAPTTTSNTNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMT 368
             + + A        +  ++FATC+PGLE++VA EL+SP+I A  V   + GV F G + 
Sbjct: 105 NVRTEAANFELDAPTSEFRYFATCTPGLEKIVAEELASPLIDAQNVSPARAGVAFMGDLA 164

Query: 369 TGYMANLWLRSAIRVLHELSYGELS--RKDRDCVYTFIRESVDWPTYIASCTTIDNRTG- 539
            GY ANLWLR A+RVL  ++ GEL   R   D VY FIR +VDW   +     +  R G 
Sbjct: 165 VGYRANLWLRCAVRVLIHMAEGELVPWRPAGDEVYDFIRGAVDWQALLL----VGGRNGA 220

Query: 540 --FKKWNFRSFAVQSRVWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMS 713
             FK+   R+F V +RVWDC  V++S   SVRAKDAICDAIRD C   KP PPE G   +
Sbjct: 221 RRFKESQLRTFWVDARVWDCSNVTHSGMVSVRAKDAICDAIRDTCQGLKPPPPENGANDA 280

Query: 714 DVPLFLSLYKNRAIIYRDMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFG 893
           D+PLFL+LY+++A +YRDMSG+SLH+RGYR AMHRASLNE +AAG+L +AGF +   GF 
Sbjct: 281 DLPLFLTLYRDKATLYRDMSGVSLHKRGYRDAMHRASLNEGVAAGMLKLAGFGA--GGFR 338

Query: 894 VVNKNGMDDVVLLDPMCGSG 953
              K          P  GSG
Sbjct: 339 AKRKGAESGGAGASPERGSG 358



 Score =  223 bits (568), Expect = 7e-61
 Identities = 119/257 (46%), Positives = 157/257 (61%), Gaps = 21/257 (8%)
 Frame = +3

Query: 789  RRGYRSAMHRASLNEALAAGLLTMA------GFNSRVPGFGVVNKNGMDDV--------- 923
            R+G   A H  S + A A G LT A      G   R+      + +G  D          
Sbjct: 372  RQGVTLADHNMS-SAATADGGLTSAALEVPDGNGRRLESHAEASASGRGDADEAQGGADV 430

Query: 924  --VLLDPMCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATE 1097
              VLLDPMCGSGT LIEAALMA+N APGL+R  WPF+SW D+D   W  C + A S  T+
Sbjct: 431  AQVLLDPMCGSGTLLIEAALMAANTAPGLLRRRWPFESWPDFDEQLWRQCVRDARSHQTD 490

Query: 1098 VPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGF 1277
             P  L++LGND H GAL LC +DA AA + D++ L+ +DC  YVP   P+ V+VNPPWG 
Sbjct: 491  WPQGLRLLGNDIHPGALELCRKDATAARMIDAITLSQEDCVRYVPPAKPTHVVVNPPWGL 550

Query: 1278 RL----GNSRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPI 1445
            RL      ++E+E+  + S +Q LG F K  C  AD+++ SG    +R +RL+ADRKWP+
Sbjct: 551  RLETLDREAQEDEHYMLASAYQKLGAFLKDQCTGADVFLLSGNADATRNMRLRADRKWPL 610

Query: 1446 SMGGLECRLLHYFVLPP 1496
            S+GG++CR+LHY VLPP
Sbjct: 611  SVGGIDCRVLHYKVLPP 627


>OAI46250.1 hypothetical protein AYO44_11755 [Planctomycetaceae bacterium SCGC
            AG-212-F19]
          Length = 374

 Score =  227 bits (579), Expect = 1e-65
 Identities = 149/416 (35%), Positives = 219/416 (52%), Gaps = 2/416 (0%)
 Frame = +3

Query: 246  FFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHEL 425
            FFATC+ GLE V+A EL +  +GA  V  G+GGV F G     Y ANLWLR+A+RVL  +
Sbjct: 4    FFATCARGLEPVLADELRA--LGAADVAPGRGGVGFAGDKALLYRANLWLRTAVRVLQPI 61

Query: 426  SYGELSRKDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWDCIQVS 605
                ++  D   +Y  +R ++DW  Y+    T+              AV S V D  +++
Sbjct: 62   LQARVASPDE--LYEAVR-TIDWAKYLTPEHTL--------------AVDSNVRDS-RIT 103

Query: 606  NSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDMSGISL 785
            +S  A++R KDAICD   + C  R+PS   E      V L L +Y++ A++  D SG SL
Sbjct: 104  HSKYAALRVKDAICDQFVERCG-RRPSVDVETPL---VGLNLHIYRDEAVLSLDSSGESL 159

Query: 786  HRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGSGTFLI 965
            H+RGYR  + RA LNEALAA L+ + G+    P                DP+CGSGT  I
Sbjct: 160  HKRGYRPILTRAPLNEALAAALILLTGWRGETP--------------FADPLCGSGTLPI 205

Query: 966  EAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSA-TSAATEVPNHLKILGNDKHEG 1142
            EAA +A  R PGL R  + F  W D+D   W D    A      E+P    I+G+D    
Sbjct: 206  EAAWIALRRPPGLTRKRFGFMGWMDFDVALWTDIRDDARRGVRKELP--APIVGSDIRGD 263

Query: 1143 ALSLCIRDAEAAGVKDSLELTCKDCRDY-VPSFHPSLVIVNPPWGFRLGNSRENENEDVI 1319
            A++ C+  A AAGV   L    KD RD+  P   P +++ NPP+G R+G     E  ++ 
Sbjct: 264  AIAFCVSRASAAGVGHLLRFEGKDVRDFRPPDGPPGVIVCNPPYGERIG-----EEHELR 318

Query: 1320 STWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHYFV 1487
              +++LG+   + C    ++VF+G  +L+R+++L    +  +  G + C+LL + V
Sbjct: 319  PLYRSLGEVFSERCPGWRVFVFTGNGALARQIKLPVAERLHLFNGKIPCQLLRFEV 374


>XP_001697211.1 hypothetical protein CHLREDRAFT_175931 [Chlamydomonas reinhardtii]
            EDP00466.1 predicted protein, partial [Chlamydomonas
            reinhardtii]
          Length = 472

 Score =  228 bits (581), Expect = 7e-65
 Identities = 154/438 (35%), Positives = 220/438 (50%), Gaps = 7/438 (1%)
 Frame = +3

Query: 189  PSQKQEAPTTTSNTNNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGT-M 365
            P  ++ AP+ T +      FFATC PGLEQVVA EL S  +G  GV  G+ GV F    +
Sbjct: 45   PRGEESAPSATPS--GWVSFFATCHPGLEQVVANELLS--LGFRGVEPGRAGVSFVARRL 100

Query: 366  TTGYMANLWLRSAIRVLHELSYGELS------RKDRDCVYTFIRESVDWPTYIASCTTID 527
            + GY ANL LR+AIRV+  L+ GEL       ++    +Y  + E+  W   I       
Sbjct: 101  SDGYAANLHLRAAIRVMALLAEGELGADPQAGKRGGQALYDMVYEAAPWHDIIPRGA--- 157

Query: 528  NRTGFKKWNFRSFAVQSRVWDCIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGA 707
                       SF+V+ R+W C  +S +     R KDA+CD IR   ++ KP+PPE+G  
Sbjct: 158  -----------SFSVEPRLWSCTDISTTQLVWSRVKDAVCDNIRQHRSD-KPAPPEKG-K 204

Query: 708  MSDVPLFLSLYKNRAIIYRDMSGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPG 887
            ++DVPL+++ YK+   +YRDMSG SLHRRGYR  MHRA+LNEA AAG+L M+G+   +  
Sbjct: 205  VADVPLYVTCYKDHIKVYRDMSGESLHRRGYRDVMHRAALNEAAAAGVLLMSGWRQALEE 264

Query: 888  FGVVNKNGMDDVVLLDPMCGSGTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDC 1067
             G       + +VL DPM                R   L    WPF+ W DYD+ +W + 
Sbjct: 265  AG-----DGEGLVLADPM----------------REAPLAEGAWPFQHWGDYDSAAWTEQ 303

Query: 1068 CKSATSAATEVPNHLKILGNDKHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPS 1247
             ++A  A    P   +++G D HEGAL L  R A  AGV + LEL+  DC          
Sbjct: 304  VEAA-RARVRPPWRGRLVGIDVHEGALGLAERQARKAGVYNMLELSLADC---------- 352

Query: 1248 LVIVNPPWGFRLGNSRENENEDVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKA 1427
                            + E+  + + W++L  F  + C  A   V SG     + L+LK 
Sbjct: 353  ---------------GQGEDAFLAAAWKSLDGFLYRQCPGASASVISGNPDPFKYLKLKP 397

Query: 1428 DRKWPISMGGLECRLLHY 1481
              K  +++ G+E ++  Y
Sbjct: 398  QSKHRLTLSGMEVQVAGY 415


>OFW33426.1 RNA methyltransferase [Actinobacteria bacterium GWC2_53_9]
          Length = 375

 Score =  221 bits (562), Expect = 3e-63
 Identities = 146/417 (35%), Positives = 212/417 (50%), Gaps = 5/417 (1%)
 Frame = +3

Query: 246  FFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIRVLHEL 425
            FFAT + G+E V+A E+ S  +   G  E  GGV F GTM   Y ANLWLR+A R+L  L
Sbjct: 5    FFATTAKGIEAVLAREIESLGLAVAG--EQTGGVTFTGTMEAAYRANLWLRTANRILMPL 62

Query: 426  SYGELSRKDRDC-----VYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWD 590
                   K+ DC     +Y+ +R+ VDWP Y+    T+              AV + V D
Sbjct: 63   -------KEFDCFSEQDLYSRVRK-VDWPRYLTPSMTL--------------AVDANVRD 100

Query: 591  CIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDM 770
               +++S  A+++ KDAI D +R    +R    P +     D+ + + +++NR  +  D 
Sbjct: 101  S-HITHSKYAALKTKDAIVDHLRTKLGSRPDVNPND----PDLRVNVHIHRNRCTLSLDT 155

Query: 771  SGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGS 950
            SG SLHRRGYR +   A L E LAA L+ +  ++   P               +DPMCGS
Sbjct: 156  SGESLHRRGYRRSQVEAPLKETLAAALVELTDWDGTTP--------------FIDPMCGS 201

Query: 951  GTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILGND 1130
            GT +IEAAL A+N APGL+R  + F+ W D+D   W D       A  +      I G D
Sbjct: 202  GTIIIEAALKAANIAPGLIRERFGFQRWLDFDQALW-DRLTGEARALRKSKLGAVIAGYD 260

Query: 1131 KHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENENE 1310
                A+     +  AAG+ + + L  KD  D+ P   P ++IVNPP+G RLG+  E E  
Sbjct: 261  SSSKAIKAAAANIRAAGLDELITLGTKDISDFTPPPGPGVIIVNPPYGERLGDKEELE-- 318

Query: 1311 DVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHY 1481
                 ++ +G   KQ CN    YVF+G   L++ + LK  R++ +  G +E RLL +
Sbjct: 319  ---PLYKLIGDIFKQRCNGYTGYVFTGNLDLAKRIGLKTSRRFVLYNGPIESRLLKF 372


>WP_041969657.1 RNA methyltransferase [Geobacter sp. OR-1]
          Length = 384

 Score =  218 bits (555), Expect = 5e-62
 Identities = 148/417 (35%), Positives = 215/417 (51%)
 Frame = +3

Query: 231  NNPCKFFATCSPGLEQVVAAELSSPMIGALGVREGKGGVYFQGTMTTGYMANLWLRSAIR 410
            N   +FFATC+ G+E+++A EL++  IG  G+   +GGV F G +     ANLWLR+A R
Sbjct: 10   NRSMEFFATCARGVEEILAGELAT--IGVAGIAAERGGVRFSGELADCRKANLWLRTANR 67

Query: 411  VLHELSYGELSRKDRDCVYTFIRESVDWPTYIASCTTIDNRTGFKKWNFRSFAVQSRVWD 590
            V+  L+           +Y  +R  + W  Y+    T+              AV   + D
Sbjct: 68   VMVPLA--GFPCDSPQSLYDGVR-GIPWNDYLTPEMTL--------------AVDCSLRD 110

Query: 591  CIQVSNSMSASVRAKDAICDAIRDACNNRKPSPPEEGGAMSDVPLFLSLYKNRAIIYRDM 770
               +++S + +++AKDAI D IRD   +R     +E G   ++ L     KNR  +  D 
Sbjct: 111  SA-MTHSHNTALKAKDAIVDTIRDRTGSRPSIDTKEPGLRVNIHLL----KNRCTVSLDS 165

Query: 771  SGISLHRRGYRSAMHRASLNEALAAGLLTMAGFNSRVPGFGVVNKNGMDDVVLLDPMCGS 950
            SG  L RRGYR+  + A L E LAA ++   G++  VP              L DP+CGS
Sbjct: 166  SGAPLDRRGYRTERNEAPLRETLAAAIVLSTGWDGTVP--------------LSDPLCGS 211

Query: 951  GTFLIEAALMASNRAPGLMRTHWPFKSWHDYDAVSWNDCCKSATSAATEVPNHLKILGND 1130
            GT LIEAA+MAS RAPGL R+ + F+ W  +DA  W      A S A +      I G+D
Sbjct: 212  GTILIEAAMMASKRAPGLGRS-FGFEGWPGFDATLWKKELAEARSRALD-KLAAPIFGSD 269

Query: 1131 KHEGALSLCIRDAEAAGVKDSLELTCKDCRDYVPSFHPSLVIVNPPWGFRLGNSRENENE 1310
                 ++   R+AE AGV+D + LT  D  D+ P     ++I NPP+G R+G     E E
Sbjct: 270  SDGRTIATARRNAERAGVRDLIALTRHDMADFNPPPGNGVIICNPPYGERMG-----EME 324

Query: 1311 DVISTWQALGQFAKQNCNNADMYVFSGEQSLSRELRLKADRKWPISMGGLECRLLHY 1481
             +   ++ +G   KQ C     ++F+G Q LS+ + LKA R+ P+  G LECRLL Y
Sbjct: 325  ALKPFYRQIGDLFKQRCKGNTAWIFTGSQELSKNVGLKATRRIPLWNGPLECRLLKY 381


Top