BLASTX nr result

ID: Ephedra27_contig00016326 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00016326
         (1461 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [A...   226   2e-56
ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi...   158   6e-36
ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP15...   150   1e-33
ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selag...   139   4e-30
ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Sela...   137   9e-30
ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Cocco...   136   3e-29
ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chla...   133   2e-28
emb|CCO66478.1| predicted protein [Bathycoccus prasinos]              133   2e-28
ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [...   129   2e-27
ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Sola...   126   2e-26
ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvo...   124   1e-25
ref|XP_004335226.1| KOW motif domain containing protein [Acantha...   123   2e-25
ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyosteliu...   122   3e-25
ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tubero...   118   6e-24
gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium p...   115   4e-23
gb|EXB45122.1| Protein MOS2 [Morus notabilis]                         114   1e-22
gb|EXC18489.1| Protein MOS2 [Morus notabilis]                         112   3e-22
ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyosteliu...   109   3e-21
ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago trun...   108   4e-21
ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]    108   6e-21

>ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [Amborella trichopoda]
            gi|548849308|gb|ERN08173.1| hypothetical protein
            AMTR_s00018p00151280 [Amborella trichopoda]
          Length = 540

 Score =  226 bits (576), Expect = 2e-56
 Identities = 152/451 (33%), Positives = 230/451 (50%), Gaps = 6/451 (1%)
 Frame = -3

Query: 1336 KLSFSMASSSKTRITRPAPIIEE---EDDQKPEYVSEFDXXXXXXXXXXKVIAKLDNTWR 1166
            KLSFS++S   +R  RP    E    E++ K E+V+EFD           VI + +++WR
Sbjct: 2    KLSFSLSSKRSSR-PRPTDFGERNTNEEEPKAEFVTEFDSSKTPSEKSRLVIPRQESSWR 60

Query: 1165 PELKMKNIHPQSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPQHNNIPSDDKSKPRVSENQ 986
             E  MKNI P+ T    + +T E   ++D  V YGLNLRN          KS    S+ +
Sbjct: 61   AEKNMKNIKPEETHLEFEIITHETSIESD--VGYGLNLRN----------KSNGGDSKRE 108

Query: 985  NPENGNKIDSENLKEIPRVSRDEI-SENVKNSGNYDHNSGKIERVSENGKNSGRFDHLSK 809
            N + GN      L  +  V   E+ ++  K+ GN    S K                  K
Sbjct: 109  NEDMGNS----GLSCMEPVEATEVDAKRKKDMGNSSFPSVK-----------------PK 147

Query: 808  NFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGM 629
            N D+                              L +  G DE+ ++P+E FG A+L G 
Sbjct: 148  NLDSE-----------------------------LEEDGGLDEFSDMPIEGFGAAVLAGY 178

Query: 628  GWNESEGVG-KCRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVA 452
            GW E +G+G K +K ++ + Y+RR G  GLG  P+  PEK+ K+  +PGE    + +L+A
Sbjct: 179  GWTEGQGIGRKAKKDIQVVQYIRRAGMGGLGFTPSSVPEKKQKKYVKPGESRESRPELIA 238

Query: 451  AAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 272
              G NGR+RH VGI EKLV R  KG  VGK++R++ G H GLKG+++++   +G   K+ 
Sbjct: 239  PKGSNGRIRHAVGIDEKLVPREIKGFFVGKILRVIGGPHLGLKGQLIEIFGDDGSSQKIG 298

Query: 271  LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMRDRGEDY 92
            L+L++SEE V+VD  +LA+LGS EE++ LKR  E K +  D    K L+ D+      ++
Sbjct: 299  LKLLKSEEMVVVDREELAELGSLEEDKCLKRMRELKLE-GDGNRLKHLRRDERESHNGEF 357

Query: 91   KKGKEYDDIHRRDSR-DKKPDNSSGKYDSRD 2
             K ++ + +H   SR D++ + SS K +  D
Sbjct: 358  GKERKAEPLHGDVSRHDRERERSSSKREKED 388


>ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi|226521177|gb|ACO67164.1|
            predicted protein [Micromonas sp. RCC299]
          Length = 493

 Score =  158 bits (399), Expect = 6e-36
 Identities = 105/337 (31%), Positives = 179/337 (53%), Gaps = 8/337 (2%)
 Frame = -3

Query: 1018 DKSKPRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKNSGNYDHNSGKIER--VSEN 845
            DK +  V      EN  ++ S   ++ P    DE + +  N   ++ N+  + +   +++
Sbjct: 48   DKVEKVVKSIPVQENTFEVGSGRRRKAPSFIPDESAIDADNKERFE-NADLLAKGETAQH 106

Query: 844  GKNSGRFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDR---VFKEEIQSLPDAAGFDEYE 674
                G      K+ D    + + K   ES  G +  ++    FKE+++ LP+ A  D+YE
Sbjct: 107  DVVYGLTKMGPKDGDTPKAKAEEKPAQESFIGKSLAEKELQAFKEDVEDLPEQATLDDYE 166

Query: 673  NVPVEEFGMALLRGMGWNESEGVGKCRK-VVEPIAYVRRDGKVGLGAVPAPKPEKESKRI 497
             +P+E+FG A+LRGMGW E + VG+    +V  + +V R G++GLGA PAP  ++ +K+ 
Sbjct: 167  QMPIEDFGAAMLRGMGWEEGKPVGRNNNGMVAAVEFVPRSGRLGLGADPAPSKQENTKKY 226

Query: 496  KRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGK 317
             +PGE       +V A G  G+ R+V  + EKLV+  + G   GK + +V+G+H GL G+
Sbjct: 227  IKPGESREAPATMVLAKGPEGQSRNVKTLDEKLVKLEEPGAREGKRMCVVEGRHRGLTGR 286

Query: 316  VLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHK 137
            VLK+ ++EGR  +  L L  S E V +   +LADLG+ + ++ ++    +K DR     K
Sbjct: 287  VLKVVKQEGRSDRAQLELDTSGEVVTIRTGELADLGTRDADRAMR----KKDDR--AGGK 340

Query: 136  KELKYDKMRDRGEDYKKGK--EYDDIHRRDSRDKKPD 32
             + +    RD GE  KK +  + DD+     R+K+ +
Sbjct: 341  GDARATGKRDDGEGGKKRERSDRDDVGGAQKREKREE 377


>ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP1545]
           gi|226456211|gb|EEH53513.1| predicted protein
           [Micromonas pusilla CCMP1545]
          Length = 496

 Score =  150 bits (379), Expect = 1e-33
 Identities = 79/178 (44%), Positives = 117/178 (65%), Gaps = 2/178 (1%)
 Frame = -3

Query: 733 RVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGKCRK-VVEPIAYVRRD 557
           + F+E++Q LP+ A  D+YE++P+E+FG A+LRGMGW E + VG+  K +V  + +V R 
Sbjct: 158 QAFREDVQDLPEQASLDDYESMPIEDFGAAMLRGMGWEEGKPVGRNSKGLVAAVEFVPRA 217

Query: 556 GKVGLGAVPAPKPEKESKRIKRPGEEVREKRD-LVAAAGENGRVRHVVGIGEKLVERAKK 380
           G++GLGA PAPKPE    R  +PG E RE+ D +V   G  G+ R+V  + EKLV+R + 
Sbjct: 218 GRLGLGAEPAPKPEVSDARRIKPG-ETRERADVMVLPEGPEGKSRNVKSLDEKLVKREEP 276

Query: 379 GPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGS 206
           GP  GK++ +V+G+H GL+ +VL + + EGR  +  + L  S E V V   +LAD GS
Sbjct: 277 GPREGKMMCVVEGRHRGLECRVLTLTKSEGRSERAQVELKTSGEVVTVRTSELADFGS 334


>ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selaginella moellendorffii]
            gi|300166149|gb|EFJ32756.1| hypothetical protein
            SELMODRAFT_85312 [Selaginella moellendorffii]
          Length = 377

 Score =  139 bits (349), Expect = 4e-30
 Identities = 100/289 (34%), Positives = 147/289 (50%), Gaps = 3/289 (1%)
 Frame = -3

Query: 1006 PRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKNSGNYDH-NSGKIERVSENGKNSG 830
            PR+  +  PE   K    NL   P  S  E    V  +G  D    G   R S+ G  S 
Sbjct: 4    PRLENSWKPEKRMK----NLMSAPEDSVQEFVGEVLPAGPVDGVQYGLSVRSSKAGGGSV 59

Query: 829  RFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFG 650
            + D +                +  +    D++ +  + ++ LP++A    Y+ +PVE+FG
Sbjct: 60   KTDIME---------------ARELRAKLDKEALV-DHLEFLPESASLAAYDAMPVEKFG 103

Query: 649  MALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVR 473
             ALL GMGW +  G+G+   + V+   +VRR  ++GLGA                     
Sbjct: 104  QALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLGLGAA-------------------- 143

Query: 472  EKRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKMEERE 293
             K DLVA  G +G+VRH V IGEKLVERAK+G   GKV+ IV G+H+GL+G+VL  +++ 
Sbjct: 144  -KPDLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGGKVMTIVSGRHAGLRGEVLGRKDKV 202

Query: 292  GRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRND 149
                +V+ ++L  S E V VD ++LAD+GS EEE  +KR  E +  R D
Sbjct: 203  ANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENAMKRLKELRIQRGD 251


>ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Selaginella moellendorffii]
           gi|300154038|gb|EFJ20674.1| hypothetical protein
           SELMODRAFT_176683 [Selaginella moellendorffii]
          Length = 378

 Score =  137 bits (346), Expect = 9e-30
 Identities = 80/193 (41%), Positives = 116/193 (60%), Gaps = 2/193 (1%)
 Frame = -3

Query: 721 EEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVG 545
           + ++ LP++A    Y+ +PVE+FG ALL GMGW +  G+G+   + V+   +VRR  ++G
Sbjct: 81  DHLEFLPESASLAAYDAMPVEKFGQALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLG 140

Query: 544 LGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAVG 365
           LGA                      K DLVA  G +G+VRH V IGEKLVERAK+G   G
Sbjct: 141 LGAA---------------------KPDLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGG 179

Query: 364 KVVRIVDGKHSGLKGKVLKMEEREGRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQF 188
           KV+ IV G+H+GL+G+VL  +++     +V+ ++L  S E V VD ++LAD+GS EEE  
Sbjct: 180 KVMTIVSGRHAGLRGEVLGRKDKVANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENA 239

Query: 187 LKRYWERKQDRND 149
           +KR  E +  R D
Sbjct: 240 MKRLKELRIQRGD 252


>ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Coccomyxa subellipsoidea
           C-169] gi|384249211|gb|EIE22693.1| hypothetical protein
           COCSUDRAFT_66346 [Coccomyxa subellipsoidea C-169]
          Length = 505

 Score =  136 bits (342), Expect = 3e-29
 Identities = 88/222 (39%), Positives = 127/222 (57%), Gaps = 5/222 (2%)
 Frame = -3

Query: 727 FKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 551
           FK++++SLP+ AG D YE +PVE FG A+LRGMGW E + VGK  +++V    YVRR  K
Sbjct: 164 FKQDVESLPEVAGLDAYEAMPVEAFGEAMLRGMGWQEGKSVGKNAKEIVTAKEYVRRPEK 223

Query: 550 VGLGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA 371
           +GLGA PA    K  K IK+ GE    K+D++    E G  +HV     KLVER K+G  
Sbjct: 224 LGLGATPAAMLPKPKKYIKQ-GESREAKKDMIYMDAE-GHQKHVKDENAKLVEREKRGVH 281

Query: 370 VGKVVRIVDGKHSGLKGKVLKMEEREG-RDAKVLLRLVQSEEDVLVDGRDLADLGSFEEE 194
           VGK +R + G H+GL   VL +E + G R  +  +RL  S E V V  ++L +    E  
Sbjct: 282 VGKTMRCIAGTHAGLLCDVLALEPKVGDRSQRATVRLQPSAETVSVRVKELGE--KHESA 339

Query: 193 QFLKRYWERKQDRNDDAHKKELKYD--KMRDRGE-DYKKGKE 77
              +    +++ +     KK+ K+D  K++DR   D + G+E
Sbjct: 340 PEPENGSSKRKHKEHKHEKKQHKHDRSKVKDRSSADEEDGRE 381


>ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chlamydomonas reinhardtii]
           gi|158279228|gb|EDP04989.1| predicted protein
           [Chlamydomonas reinhardtii]
          Length = 597

 Score =  133 bits (335), Expect = 2e-28
 Identities = 79/177 (44%), Positives = 110/177 (62%), Gaps = 3/177 (1%)
 Frame = -3

Query: 736 DRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGKCRKVVEPIAYVRRD 557
           ++ +K+ ++ LP+ A  D YE +P+EEFG A+LRGMGW E  GVG+ RK V+ I YVRR 
Sbjct: 141 EKAYKDSVEDLPEVADLDAYEAMPIEEFGRAMLRGMGWEEGMGVGRNRKQVDAIEYVRRP 200

Query: 556 GKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAK-- 383
            ++GLGA P    E +SK +K  G++ R K+DLV A   +GR R+V  + EKLV RA   
Sbjct: 201 ERLGLGAQPVKVSEDKSKVVKM-GDKPR-KQDLVLAPDADGRQRNVRTLDEKLVSRATVL 258

Query: 382 KGPAVGKVVRIVDGKHSGLKGKVLK-MEEREGRDAKVLLRLVQSEEDVLVDGRDLAD 215
            GP   K +RI+ G HSGL    L+ + + EGR  +  +RL  S EDV V   +L +
Sbjct: 259 PGPQPNKPMRIMTGAHSGLLCTALEALPKPEGRPERWRVRLAASNEDVEVLASELGE 315


>emb|CCO66478.1| predicted protein [Bathycoccus prasinos]
          Length = 510

 Score =  133 bits (334), Expect = 2e-28
 Identities = 99/320 (30%), Positives = 166/320 (51%), Gaps = 15/320 (4%)
 Frame = -3

Query: 982 PENGNKIDSENLKEIPRVSRDEISENVKNSGNYDHNSGKIERVSENGKNSG-RFDHLSKN 806
           P    KID +   E+      E ++  K +         + R++  G  +  R+++  K 
Sbjct: 84  PTESEKIDDDKRFEVSETIGGETAQGNKTTYG-------LTRMAPKGDEAKVRYENKKKE 136

Query: 805 FD-NSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGM 629
            D N +  F  KS++E      + D  FKE+++ LP+ A  + YE++P+E+FG A+LRGM
Sbjct: 137 QDKNKNESFIGKSLAEK-----ELD-AFKEDVEDLPEQASIEAYESMPIEDFGAAMLRGM 190

Query: 628 GWNESEGVGKCRKV--VEPIAYVRRDGKVGLGA----VPAPKPEKES----KRIKRPGEE 479
           GW E EGVG+  K    +P+ +V R G++GLGA    VP     K +    K+I++PGE 
Sbjct: 191 GWKEGEGVGRNAKSGRADPVEFVPRMGRLGLGADTMDVPGATLRKNNNNNDKKIRKPGET 250

Query: 478 VREKRDLVAAAGENGR---VRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLK 308
             EK +++       R   +R+V  I EKLV++ + G   GK + +  GKH GL G+VLK
Sbjct: 251 REEKMEILVRDPNASRAPGMRNVKSIDEKLVKKEELGVKEGKRMYVAKGKHEGLTGRVLK 310

Query: 307 MEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKEL 128
           +++ +GR       L  S E + +   +L ++ +  + Q      ++K+  +DDA  K+ 
Sbjct: 311 IQKADGR---AQFELDSSGEVITLRCSELDEMVNAPDSQ-----RDKKRKSSDDAGSKKS 362

Query: 127 KYDKMRDRGEDYKKGKEYDD 68
           K  K R+  E  ++  + DD
Sbjct: 363 K--KAREEEESEEEDDDDDD 380


>ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [Ostreococcus tauri]
            gi|116056555|emb|CAL52844.1| G-patch nucleic acid binding
            protein (ISS) [Ostreococcus tauri]
          Length = 511

 Score =  129 bits (325), Expect = 2e-27
 Identities = 89/286 (31%), Positives = 148/286 (51%), Gaps = 13/286 (4%)
 Frame = -3

Query: 1123 TGLQFVTEEEQSQTDKKVEYGLNLRNPQHNNIPSDDKSKPRVSENQNPENGNKIDSENLK 944
            TG+     E  S  D+ V   +  R      + +  + K R  +++  +N +  D +   
Sbjct: 102  TGVAGNEIEGTSGDDEPVRDKVIARIENTFEVGTGRRRKVRGRKSRERDNASTDDDDFRA 161

Query: 943  EIPRV--SRDEISENVKN--------SGNYDHNSGKIERVSENGKNSGRFDHLSKNFDNS 794
            +IP    ++DEI  + K         +G      G    +++ G   G       + D +
Sbjct: 162  QIPSFIPTKDEIERDNKRFHVAETIAAGGELAQQGVAYGLTKMGPKHGAVAAEKSDRDKA 221

Query: 793  SGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNES 614
               F  KS+ E         + FKE+++ LP+ A  +EYE++P+E+FG A+LRGMGW E 
Sbjct: 222  GESFIGKSLHEK------ELQAFKEDMEDLPEQASIEEYEDMPIEDFGKAMLRGMGWEEG 275

Query: 613  EGVGKC-RKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLV--AAAG 443
            + VG+  R +VE + ++ R  ++GLGA PA K   + K IK PGE   +K D+V    A 
Sbjct: 276  KAVGRMHRGMVEAVEFIPRAARLGLGAQPAEKDAPQKKYIK-PGETREKKADMVRDQRAV 334

Query: 442  ENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKM 305
             +  +R+V  + EKLV+R + GP  GK++ I DG+H+G+ G++L+M
Sbjct: 335  ADAGMRNVKTLDEKLVKRKEVGPREGKMMYIADGQHAGISGRILRM 380


>ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Solanum lycopersicum]
           gi|460401091|ref|XP_004246062.1| PREDICTED: protein
           MOS2-like isoform 2 [Solanum lycopersicum]
          Length = 485

 Score =  126 bits (317), Expect = 2e-26
 Identities = 91/245 (37%), Positives = 135/245 (55%), Gaps = 10/245 (4%)
 Frame = -3

Query: 727 FKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 551
           FKE+++ LP+  G DEY ++PVE FG ALL+G GW E  G+G+  ++ V+ + Y R   K
Sbjct: 145 FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKRWTAK 204

Query: 550 VGLGAVP-APKPEKES----KRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKL-VER 389
            G+G +P  PKP  ++    K IK+ GE             E  +V H  G  EK+  E+
Sbjct: 205 EGIGFIPEVPKPSSKAEGGVKPIKKKGE-------------EGIKVDHSDGYIEKIDREK 251

Query: 388 AKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 209
             KG  VGK VR+V GK  G+KG+VL   E   R   V+L+L  ++++V +  RDLA+LG
Sbjct: 252 GGKGLYVGKKVRVVRGKEMGMKGEVL---EVNSRGELVILKL--ADKEVKLQARDLAELG 306

Query: 208 SFEEEQFLKRYWE---RKQDRNDDAHKKELKYDKMRDRGEDYKKGKEYDDIHRRDSRDKK 38
           S EEE+ LK+  E   R++  + D  +K+    + RD     +K +       R SRD++
Sbjct: 307 SVEEERCLKKLLELKIREEKSHLDGVRKQSSGSRSRDEATTERKKES------RRSRDER 360

Query: 37  PDNSS 23
            D  S
Sbjct: 361 SDKVS 365


>ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvox carteri f.
           nagariensis] gi|300267224|gb|EFJ51408.1| hypothetical
           protein VOLCADRAFT_87660 [Volvox carteri f. nagariensis]
          Length = 733

 Score =  124 bits (310), Expect = 1e-25
 Identities = 75/173 (43%), Positives = 104/173 (60%), Gaps = 7/173 (4%)
 Frame = -3

Query: 733 RVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGKCRKVVEPIAYVRRDG 554
           R F+E +  LPDA   + YE +PVEEFG A+LRGMGW E  GVG+ R+ V+ I YVRR  
Sbjct: 145 RAFRESVVELPDAMDVEAYEAMPVEEFGKAMLRGMGWEEGMGVGRNRQKVDAIEYVRRPE 204

Query: 553 KVGLGAVP---APKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAK 383
           ++GLGA P   AP P K  K  ++P     +++DLV A   +GR R++  + E+LV R+ 
Sbjct: 205 RLGLGAQPVALAPDPSKPVKMGEKP-----QRQDLVLAPDADGRQRNIRKLDEQLVARST 259

Query: 382 --KGPAVGKVVRIVDGKHSGLKGKVLKMEER--EGRDAKVLLRLVQSEEDVLV 236
              GP  GK +RI  G H+GL    L+   R  EG+  +  +RL  S+E+V V
Sbjct: 260 VLPGPQPGKDMRITGGPHAGLACTALEALPRAPEGKPERWRVRLTASQEEVEV 312


>ref|XP_004335226.1| KOW motif domain containing protein [Acanthamoeba castellanii str.
           Neff] gi|440791981|gb|ELR13213.1| KOW motif domain
           containing protein [Acanthamoeba castellanii str. Neff]
          Length = 491

 Score =  123 bits (309), Expect = 2e-25
 Identities = 85/258 (32%), Positives = 138/258 (53%), Gaps = 11/258 (4%)
 Frame = -3

Query: 742 DRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGKCRK-VVEPIAYV 566
           D D  F+ +++S P+      Y+ VPVE+FG A+LRGM W   + +G   K VV+P+ ++
Sbjct: 91  DEDEKFRFDVESRPEETTASAYDRVPVEKFGEAMLRGMLWKPGDPIGNTNKAVVKPVEFI 150

Query: 565 RRDGKVGLGAVP-APKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVER 389
            R  ++GLGA P AP+P K  K+  +PGE    K+ +V    ++GR RHV  + +KLV  
Sbjct: 151 ARHHRLGLGAAPKAPEPTK--KKFIKPGESREPKQMMVLPRDKDGRQRHVRDLDQKLVAF 208

Query: 388 AKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 209
             +G   G +V +V G+H G+  +++++   +    KV +R  +SEE+V V+  DLA L 
Sbjct: 209 RPEGIHPGNLVGVVSGEHEGMYARIVQLLAGD----KVRIRF-ESEEEVTVNRIDLAPLN 263

Query: 208 SFEEEQFLKRYWERKQDRNDDAHKKELKYDKMRDRGED---------YKKGKEYDDIHRR 56
           S + +Q       R    + D  ++  + D   D GED          +KG E++     
Sbjct: 264 STQLKQGHPALGIRDGGDDGDVEEEGAEDDDDDDDGEDDDDHHKAAGGEKGVEWEQ-ETT 322

Query: 55  DSRDKKPDNSSGKYDSRD 2
           ++ D K   S+GK  +RD
Sbjct: 323 NTIDGKSGGSAGKSSARD 340


>ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyostelium discoideum AX4]
            gi|60464330|gb|EAL62479.1| hypothetical protein
            DDB_G0289933 [Dictyostelium discoideum AX4]
          Length = 542

 Score =  122 bits (307), Expect = 3e-25
 Identities = 99/335 (29%), Positives = 164/335 (48%), Gaps = 31/335 (9%)
 Frame = -3

Query: 1132 STDTGLQFVTEEEQSQTDKKVEYGLNLRNPQHNNIPSDDKSKPRVSE----------NQN 983
            + ++ L  +++ ++ +  + + Y   L   +  ++ +D+ S    S+          N+N
Sbjct: 44   NNNSKLSTLSKRKEPEKKEPINYITTLEGTKVTSLYNDETSTLGSSKLKVIPLTEQLNEN 103

Query: 982  PENGNKIDSENLKEIPRVSRDEISENVKNSGNYDHNSGKIERVSENGK------------ 839
             EN +KI ++  KEI +    E  EN KN  N D +S K  R  E  +            
Sbjct: 104  YENSDKIVAQVAKEIKK----EEKENEKNENNNDDSSNKKLRTEEKLQPFYKASQTGLQL 159

Query: 838  NSGR-------FDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDE 680
            N  R        D  ++N  +SSG      + +    +N+ D+ FK +++S PD +  D+
Sbjct: 160  NPNRKQEIKSIVDGKNENSGSSSGIPLIMKLKDIDKYENETDK-FKHDVESRPDESNLDD 218

Query: 679  YENVPVEEFGMALLRGMGWNESEGVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEKES 506
            YE  PV   G ALLRGMGW   + +G   K +VEPI Y++R G ++GLGA    KP  + 
Sbjct: 219  YEETPVSIIGEALLRGMGWVPGKSIGSTNKGLVEPIEYIKRPGFRLGLGA----KPMDDD 274

Query: 505  KRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGL 326
              I++ G  + +          +G+VRHV GI EK+V   KK    G +V ++ G H G+
Sbjct: 275  DEIRKMGGPIED---------ADGKVRHVKGINEKIVSNVKK-MEEGSLVTVIGGPHKGM 324

Query: 325  KGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 221
              K++ M + +    KV + + +S+E V+VD  DL
Sbjct: 325  NAKIVSMLKND----KVQI-IFKSDEKVIVDKFDL 354


>ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tuberosum]
          Length = 484

 Score =  118 bits (296), Expect = 6e-24
 Identities = 85/241 (35%), Positives = 132/241 (54%), Gaps = 6/241 (2%)
 Frame = -3

Query: 727 FKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 551
           FKE+++ LP+  G DEY ++PVE FG ALL+G GW E  G+G+  ++ V+ + Y +   K
Sbjct: 145 FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKKWTAK 204

Query: 550 VGLGAVP-APKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKL-VERAKKG 377
            G+G +P  PKP  + +   +    +++  D V       +V H  G  EK+  E+A  G
Sbjct: 205 EGIGFIPEVPKPSSKGEGAVK---SIKKSEDGV-------KVDHSDGNIEKIDREKAGNG 254

Query: 376 PAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEE 197
             VGK VR+V GK  G+KG++L   E       V+L+L  ++++V +  RDLA+LGS EE
Sbjct: 255 LYVGKKVRVVRGKEMGMKGEIL---EVNSSGDLVILKL--ADKEVKLQARDLAELGSVEE 309

Query: 196 EQFLKRYWE---RKQDRNDDAHKKELKYDKMRDRGEDYKKGKEYDDIHRRDSRDKKPDNS 26
           E+ LK+  E   R++  N D  +K+    + RD      K +       R SRD++ D  
Sbjct: 310 ERCLKKLLELKIREEKSNLDGVRKQSSGGRSRDEATTESKKES------RRSRDERSDKV 363

Query: 25  S 23
           S
Sbjct: 364 S 364


>gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium pallidum PN500]
          Length = 584

 Score =  115 bits (289), Expect = 4e-23
 Identities = 95/344 (27%), Positives = 153/344 (44%), Gaps = 11/344 (3%)
 Frame = -3

Query: 1312 SSKTRITRPAPIIEEEDDQKPEYVSEFDXXXXXXXXXXKVIAKLDNTWRPELKMKNIHP- 1136
            S +  + +P  +I   + +    ++E +             +K  NT+   +K +     
Sbjct: 89   SDEKPLPKPPKVIPLSEQRHINSITEIETSISTTISTTTSTSKSTNTFENIIKTEREEKD 148

Query: 1135 -----QSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPQHNNIPSDDKSKPRVSENQNPENG 971
                 +S +       E++ S T K  ++GL +       I   DK+K +   N +  N 
Sbjct: 149  KFKRLKSDNNSSSDKNEDDNSNTFKPKQHGLQI-------IKQKDKNKQKDDNNSSGSN- 200

Query: 970  NKIDSENLKEIPRVSRDEISENVKNSGNYDHNSGKIERVSENGKNSGRFDHLSKNFDNSS 791
                           R EI   + N  N DH   ++ +V +      + D L        
Sbjct: 201  ---------------RQEIKSLIGNKKNDDHKDVEMNQVVKVRPLIEKLDGLD------- 238

Query: 790  GRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESE 611
             RFDN         D+D+   FK +++S P+ A  ++YE  P+E FG A+LRGMGW   +
Sbjct: 239  -RFDN---------DDDK---FKFDVESRPEEADNEDYEETPIEVFGEAMLRGMGWQPGQ 285

Query: 610  GVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGEN 437
             +G   K + EPI +V+R G ++GLGA P    +K             +KR LVA  GE+
Sbjct: 286  AIGLTNKGLNEPIQFVKRPGYRLGLGAQPKDVDDK-------------DKRYLVAQKGED 332

Query: 436  GRVRHVVGIGEKLVERAKKGPA---VGKVVRIVDGKHSGLKGKV 314
            G+VRH+VGI EKLV   K G     VG+ V ++ G+H G+  ++
Sbjct: 333  GKVRHMVGISEKLVPMNKSGSQSYNVGERVLVISGQHEGMYAEI 376


>gb|EXB45122.1| Protein MOS2 [Morus notabilis]
          Length = 423

 Score =  114 bits (284), Expect = 1e-22
 Identities = 83/256 (32%), Positives = 128/256 (50%), Gaps = 7/256 (2%)
 Frame = -3

Query: 802 DNSSGRFDNKSVSESVHGDNDRD---RVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRG 632
           D  +G+ + K  +E +      D   +  K ++Q LPD  G  E+E+VPVE FG ALL G
Sbjct: 130 DEINGQDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPDDRGMAEFEDVPVEGFGAALLSG 189

Query: 631 MGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLV 455
            GW+E  G+GK  ++ V+ + Y +R GK GLG V    P   +         + + +D  
Sbjct: 190 YGWHEGRGIGKNAKEDVKVVEYTKRTGKQGLGFVSNVLPPLPNSNRDSLNNSILKPKDNN 249

Query: 454 AAAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKV 275
                N                + K   +GK VRIV G+  GLKG+VL   E+   D ++
Sbjct: 250 TNNNNN--------------SSSNKESLIGKEVRIVHGRELGLKGRVL---EKLSDDNRL 292

Query: 274 LLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMRDR--- 104
           ++RL +S+E V V+ RD+ +LGS E+E  LKR  E +    ++  +K+ K  + + R   
Sbjct: 293 VVRLSRSQETVKVNIRDVTELGSEEDEACLKRLKELRIRGEEEKKEKKSKRRENKSRDSD 352

Query: 103 GEDYKKGKEYDDIHRR 56
           GE  + GK +   H R
Sbjct: 353 GEKQQPGKSWLRSHIR 368


>gb|EXC18489.1| Protein MOS2 [Morus notabilis]
          Length = 476

 Score =  112 bits (281), Expect = 3e-22
 Identities = 100/358 (27%), Positives = 161/358 (44%), Gaps = 45/358 (12%)
 Frame = -3

Query: 1024 SDDKSKP-RVSENQNPENGNKIDSENLKEIPRVSRDEISENVKNSGNYDHNSGKIERVSE 848
            S  K KP + S+N   +N NK    +      V     SE +  +GN   N+  I  +  
Sbjct: 11   SSSKLKPIKPSQNFEDDNDNKSTENDANSRKYVIEFNASETL--TGNATQNAVVIPPIQN 68

Query: 847  NGKNSGRFDHL----SKNFDNSSG-RFDNKSVSESVH--------------GDNDRD--- 734
              +   R  +L    +   D S G +F+ +S+S++ +              GD+D +   
Sbjct: 69   EWRPHKRMKNLDLPIAAQSDGSGGLQFEVESLSDATNSSMSYGLNLRQTAKGDHDDEING 128

Query: 733  ---------------------RVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNE 617
                                 +  K ++Q LP+  G  E+E+VPVE FG ALL G GW+E
Sbjct: 129  QDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPEDRGMAEFEDVPVEGFGAALLSGYGWHE 188

Query: 616  SEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGE 440
              G+GK  ++ V+ + Y +R GK GLG V    P   +         + + +D       
Sbjct: 189  GRGIGKNAKEDVKVVEYTKRTGKQGLGFVMTDLPPLPNSNRDSLNNSIPKPKDNNNNNNN 248

Query: 439  NGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLV 260
            N                + K   +GK VRIV G+  GLKG+VL   E+   D ++++RL 
Sbjct: 249  N--------------SSSNKESLIGKEVRIVRGRELGLKGRVL---EKLSDDNRLVVRLS 291

Query: 259  QSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMRDRGEDYKK 86
            +S+E V V+ +D+A+LGS E+E  LKR  E +    ++  +K+ K  + + R  D +K
Sbjct: 292  RSQETVKVNIQDVAELGSEEDEACLKRLKELRIREEEEKKEKKSKRRENKSRDSDGEK 349


>ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyostelium fasciculatum]
           gi|328865473|gb|EGG13859.1| hypothetical protein
           DFA_11620 [Dictyostelium fasciculatum]
          Length = 531

 Score =  109 bits (272), Expect = 3e-21
 Identities = 66/177 (37%), Positives = 100/177 (56%), Gaps = 1/177 (0%)
 Frame = -3

Query: 748 DNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGKCRK-VVEPIA 572
           +ND D+ FK ++ + PD A  D+YE  P++ FG A+L GMGW   +G+G   K VVEP+ 
Sbjct: 200 NNDEDK-FKFDLSTRPDEANQDDYEETPIDIFGKAMLMGMGWKPGQGIGLTNKGVVEPVQ 258

Query: 571 YVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVE 392
           +++R G++GLGA P+    K             EK+ + A  GE+G+VRH VG+ EKLV 
Sbjct: 259 FLKRAGRLGLGAQPSDVANK-------------EKKYMTAPKGEDGKVRHTVGLSEKLVP 305

Query: 391 RAKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 221
             K G   G  V ++ G H GL   V  + + +    ++++R  +S+E   VD  DL
Sbjct: 306 -LKFGLQPGDRVLVISGPHEGLNATVESLAQSD----RIVIRF-KSDELAAVDKCDL 356


>ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago truncatula]
           gi|355478757|gb|AES59960.1| Pre-mRNA-splicing factor
           spp2 [Medicago truncatula]
          Length = 385

 Score =  108 bits (271), Expect = 4e-21
 Identities = 97/264 (36%), Positives = 131/264 (49%), Gaps = 10/264 (3%)
 Frame = -3

Query: 781 DNKSVSESVHGDNDRDRV---------FKEEIQSLPDAAGFDEYENVPVEEFGMALLRGM 629
           D K  S+ V  D  R +          FK++++ LPD  GFDEY++VPVE FG ALL G 
Sbjct: 42  DKKPQSDDVVVDAPRPKASVEVSMLQKFKDDMERLPDDMGFDEYKDVPVEGFGAALLGGY 101

Query: 628 GWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVREKRDLVA 452
           GW E  G+GK  ++ V+ +   RR GK GLG V A  P   SK+ +R             
Sbjct: 102 GWKEGMGIGKNAKEDVKVVEVKRRTGKEGLGFV-ADLPPPSSKKGER------------- 147

Query: 451 AAGENGRVRHVVGIGEKLVERAKKGPAVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 272
               NGR       GE   ER KK     +VVRIV G+  GLK  V+    R+G D  V+
Sbjct: 148 ----NGR-------GE--TERKKKEE---RVVRIVRGRDVGLKASVVG---RDGEDV-VV 187

Query: 271 LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMRDRGEDY 92
           LR++ S E+V V   D+A+LGS EEE+ L++              K+LK      RG D 
Sbjct: 188 LRVLGSGEEVKVKVEDVAELGSVEEERCLRKL-------------KDLKI-----RGRDE 229

Query: 91  KKGKEYDDIHRRDSRDKKPDNSSG 20
           +KG +      RD  D++  N +G
Sbjct: 230 EKGSK--SKRGRDGVDERRVNGNG 251


>ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 478

 Score =  108 bits (270), Expect = 6e-21
 Identities = 74/221 (33%), Positives = 121/221 (54%), Gaps = 4/221 (1%)
 Frame = -3

Query: 727 FKEEIQSLPDAAGFDEYENVPVEEFGMALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 551
           FK +++ LP+  GF+++E VPVE F  AL+ G GW + +G+G+  ++ V+   Y RR  K
Sbjct: 148 FKADLERLPEDRGFEDFEEVPVESFAAALMNGYGWRQGKGIGRNAKEDVKVREYSRRTDK 207

Query: 550 VGLGAV-PAPKPEKESKRIKRPGEEVREKRDLVAAAGENGRVRHVVGIGEKLVERAKKGP 374
            GLG V   P    + +  K  G E   KRD        GRV+      E     +    
Sbjct: 208 QGLGFVSDVPVGISKKEEEKDGGRERERKRD-------EGRVK------ENRDRESDGLA 254

Query: 373 AVGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGR--DLADLGSFE 200
           ++GK VRIV G+ +GLKG+VL+  + +     ++L+L + +E V +  R  D+A+LGS E
Sbjct: 255 SIGKHVRIVRGRDAGLKGRVLEKLDSDW----LVLKLSKRDEHVKLKVRATDIAELGSKE 310

Query: 199 EEQFLKRYWERKQDRNDDAHKKELKYDKMRDRGEDYKKGKE 77
           EE+FLK+  E K    +   K+  + +++ ++ E+  + KE
Sbjct: 311 EEKFLKKLEELKVKNENTGQKRRREVEQVVEKRENGSRDKE 351


Top