BLASTX nr result

ID: Ephedra25_contig00011082 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00011082
         (1677 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [A...   214   7e-53
ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi...   150   2e-33
ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP15...   149   3e-33
ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selag...   137   1e-29
ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Sela...   137   2e-29
ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chla...   132   3e-28
ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Cocco...   131   8e-28
ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [...   129   3e-27
emb|CCO66478.1| predicted protein [Bathycoccus prasinos]              126   2e-26
ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvo...   125   7e-26
ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyosteliu...   122   6e-25
ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Sola...   119   4e-24
ref|XP_004335226.1| KOW motif domain containing protein [Acantha...   113   2e-22
ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tubero...   112   6e-22
ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyosteliu...   111   1e-21
gb|EXC18489.1| Protein MOS2 [Morus notabilis]                         110   1e-21
gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium p...   110   1e-21
gb|EXB45122.1| Protein MOS2 [Morus notabilis]                         109   3e-21
gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao] gi|508...   107   2e-20
ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago trun...   106   3e-20

>ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [Amborella trichopoda]
            gi|548849308|gb|ERN08173.1| hypothetical protein
            AMTR_s00018p00151280 [Amborella trichopoda]
          Length = 540

 Score =  214 bits (546), Expect = 7e-53
 Identities = 150/451 (33%), Positives = 221/451 (49%), Gaps = 6/451 (1%)
 Frame = +3

Query: 342  KLSFSMASSSKTRITRPAPIIEE---EDDQKPEYVSEFDXXXXXXXXXXXVIAKLDNTWR 512
            KLSFS++S   +R  RP    E    E++ K E+V+EFD           VI + +++WR
Sbjct: 2    KLSFSLSSKRSSR-PRPTDFGERNTNEEEPKAEFVTEFDSSKTPSEKSRLVIPRQESSWR 60

Query: 513  PELKMKNIHPHSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSENQ 692
             E  MKNI P  T    + +T E   ++D  V YGLNLRN          KS    S+ +
Sbjct: 61   AEKNMKNIKPEETHLEFEIITHETSIESD--VGYGLNLRN----------KSNGGDSKRE 108

Query: 693  NPENGNKIDSENLKEIPRVSRDEI-SENVKISGNYDHNSGKIERISENGKNSGRFDHLSK 869
            N + GN      L  +  V   E+ ++  K  GN    S K                  K
Sbjct: 109  NEDMGNS----GLSCMEPVEATEVDAKRKKDMGNSSFPSVK-----------------PK 147

Query: 870  NFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049
            N D+                              L +  G DE+ ++P+E FG A+L G 
Sbjct: 148  NLDSE-----------------------------LEEDGGLDEFSDMPIEGFGAAVLAGY 178

Query: 1050 GWNESEGVG-KCRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVA 1226
            GW E +G+G K +K ++ + Y+RR G  GLG  P+  PEK+ K+  +PGE      +L+A
Sbjct: 179  GWTEGQGIGRKAKKDIQVVQYIRRAGMGGLGFTPSSVPEKKQKKYVKPGESRESRPELIA 238

Query: 1227 AAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 1406
              G NGR+RH VGI EKLV R  KG  +GK++R++ G H GLKG+++++   +G   K+ 
Sbjct: 239  PKGSNGRIRHAVGIDEKLVPREIKGFFVGKILRVIGGPHLGLKGQLIEIFGDDGSSQKIG 298

Query: 1407 LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMXXXXXXX 1586
            L+L++SEE V+VD  +LA+LGS EE++ LKR  E K +  D    K L+ D+        
Sbjct: 299  LKLLKSEEMVVVDREELAELGSLEEDKCLKRMRELKLE-GDGNRLKHLRRDERESHNGEF 357

Query: 1587 XXXXXXXXIHRRDSR-DKKPDNSSGKYDSRD 1676
                    +H   SR D++ + SS K +  D
Sbjct: 358  GKERKAEPLHGDVSRHDRERERSSSKREKED 388


>ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi|226521177|gb|ACO67164.1|
            predicted protein [Micromonas sp. RCC299]
          Length = 493

 Score =  150 bits (379), Expect = 2e-33
 Identities = 81/225 (36%), Positives = 134/225 (59%), Gaps = 4/225 (1%)
 Frame = +3

Query: 867  KNFDNSSGRFDNKSVSESVHGDNDRDR---VFKEEIQSLPDAAGFDEYENVPVEEFGKAL 1037
            K+ D    + + K   ES  G +  ++    FKE+++ LP+ A  D+YE +P+E+FG A+
Sbjct: 118  KDGDTPKAKAEEKPAQESFIGKSLAEKELQAFKEDVEDLPEQATLDDYEQMPIEDFGAAM 177

Query: 1038 LRGMGWNESEGVGKCRK-VVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENR 1214
            LRGMGW E + VG+    +V  + +V R G++GLGA PAP  ++ +K+  +PGE      
Sbjct: 178  LRGMGWEEGKPVGRNNNGMVAAVEFVPRSGRLGLGADPAPSKQENTKKYIKPGESREAPA 237

Query: 1215 DLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRD 1394
             +V A G  G+ R+V  + EKLV+  + G   GK + +V+G+H GL G+VLK+ ++EGR 
Sbjct: 238  TMVLAKGPEGQSRNVKTLDEKLVKLEEPGAREGKRMCVVEGRHRGLTGRVLKVVKQEGRS 297

Query: 1395 AKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRND 1529
             +  L L  S E V +   +LADLG+ + ++ +++  +R   + D
Sbjct: 298  DRAQLELDTSGEVVTIRTGELADLGTRDADRAMRKKDDRAGGKGD 342


>ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP1545]
            gi|226456211|gb|EEH53513.1| predicted protein [Micromonas
            pusilla CCMP1545]
          Length = 496

 Score =  149 bits (377), Expect = 3e-33
 Identities = 89/233 (38%), Positives = 135/233 (57%), Gaps = 4/233 (1%)
 Frame = +3

Query: 786  GNYDHNSGKIERISENGKNSG--RFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKE 959
            G     SG +  +++ G   G  + +   K    ++  F  KS+ E         + F+E
Sbjct: 109  GETAQGSGTVYGLTKMGPKDGDVKLEDDEKPPPPANESFIGKSLQEK------ELQAFRE 162

Query: 960  EIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDGKVGL 1136
            ++Q LP+ A  D+YE++P+E+FG A+LRGMGW E + VG+  K +V  + +V R G++GL
Sbjct: 163  DVQDLPEQASLDDYESMPIEDFGAAMLRGMGWEEGKPVGRNSKGLVAAVEFVPRAGRLGL 222

Query: 1137 GAVPAPKPEKESKRIKRPGEEVRENRD-LVAAAGENGRVRHVVGIGEKLVERAKKGPAIG 1313
            GA PAPKPE    R  +PG E RE  D +V   G  G+ R+V  + EKLV+R + GP  G
Sbjct: 223  GAEPAPKPEVSDARRIKPG-ETRERADVMVLPEGPEGKSRNVKSLDEKLVKREEPGPREG 281

Query: 1314 KVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGS 1472
            K++ +V+G+H GL+ +VL + + EGR  +  + L  S E V V   +LAD GS
Sbjct: 282  KMMCVVEGRHRGLECRVLTLTKSEGRSERAQVELKTSGEVVTVRTSELADFGS 334


>ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selaginella moellendorffii]
            gi|300166149|gb|EFJ32756.1| hypothetical protein
            SELMODRAFT_85312 [Selaginella moellendorffii]
          Length = 377

 Score =  137 bits (346), Expect = 1e-29
 Identities = 101/289 (34%), Positives = 149/289 (51%), Gaps = 3/289 (1%)
 Frame = +3

Query: 672  PRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDH-NSGKIERISENGKNSG 848
            PR+  +  PE   K    NL   P  S  E    V  +G  D    G   R S+ G  S 
Sbjct: 4    PRLENSWKPEKRMK----NLMSAPEDSVQEFVGEVLPAGPVDGVQYGLSVRSSKAGGGSV 59

Query: 849  RFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFG 1028
            + D +                +  +    D++ +  + ++ LP++A    Y+ +PVE+FG
Sbjct: 60   KTDIME---------------ARELRAKLDKEALV-DHLEFLPESASLAAYDAMPVEKFG 103

Query: 1029 KALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVR 1205
            +ALL GMGW +  G+G+   + V+   +VRR  ++GLGA    KP               
Sbjct: 104  QALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLGLGAA---KP--------------- 145

Query: 1206 ENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEERE 1385
               DLVA  G +G+VRH V IGEKLVERAK+G   GKV+ IV G+H+GL+G+VL  +++ 
Sbjct: 146  ---DLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGGKVMTIVSGRHAGLRGEVLGRKDKV 202

Query: 1386 GRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRND 1529
                +V+ ++L  S E V VD ++LAD+GS EEE  +KR  E +  R D
Sbjct: 203  ANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENAMKRLKELRIQRGD 251


>ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Selaginella moellendorffii]
            gi|300154038|gb|EFJ20674.1| hypothetical protein
            SELMODRAFT_176683 [Selaginella moellendorffii]
          Length = 378

 Score =  137 bits (344), Expect = 2e-29
 Identities = 81/193 (41%), Positives = 118/193 (61%), Gaps = 2/193 (1%)
 Frame = +3

Query: 957  EEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVG 1133
            + ++ LP++A    Y+ +PVE+FG+ALL GMGW +  G+G+   + V+   +VRR  ++G
Sbjct: 81   DHLEFLPESASLAAYDAMPVEKFGQALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLG 140

Query: 1134 LGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIG 1313
            LGA    KP                  DLVA  G +G+VRH V IGEKLVERAK+G   G
Sbjct: 141  LGAA---KP------------------DLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGG 179

Query: 1314 KVVRIVDGKHSGLKGKVLKMEEREGRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQF 1490
            KV+ IV G+H+GL+G+VL  +++     +V+ ++L  S E V VD ++LAD+GS EEE  
Sbjct: 180  KVMTIVSGRHAGLRGEVLGRKDKVANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENA 239

Query: 1491 LKRYWERKQDRND 1529
            +KR  E +  R D
Sbjct: 240  MKRLKELRIQRGD 252


>ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chlamydomonas reinhardtii]
            gi|158279228|gb|EDP04989.1| predicted protein
            [Chlamydomonas reinhardtii]
          Length = 597

 Score =  132 bits (333), Expect = 3e-28
 Identities = 78/177 (44%), Positives = 111/177 (62%), Gaps = 3/177 (1%)
 Frame = +3

Query: 942  DRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRKVVEPIAYVRRD 1121
            ++ +K+ ++ LP+ A  D YE +P+EEFG+A+LRGMGW E  GVG+ RK V+ I YVRR 
Sbjct: 141  EKAYKDSVEDLPEVADLDAYEAMPIEEFGRAMLRGMGWEEGMGVGRNRKQVDAIEYVRRP 200

Query: 1122 GKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAK-- 1295
             ++GLGA P    E +SK +K  G++ R+ +DLV A   +GR R+V  + EKLV RA   
Sbjct: 201  ERLGLGAQPVKVSEDKSKVVKM-GDKPRK-QDLVLAPDADGRQRNVRTLDEKLVSRATVL 258

Query: 1296 KGPAIGKVVRIVDGKHSGLKGKVLK-MEEREGRDAKVLLRLVQSEEDVLVDGRDLAD 1463
             GP   K +RI+ G HSGL    L+ + + EGR  +  +RL  S EDV V   +L +
Sbjct: 259  PGPQPNKPMRIMTGAHSGLLCTALEALPKPEGRPERWRVRLAASNEDVEVLASELGE 315


>ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Coccomyxa subellipsoidea
            C-169] gi|384249211|gb|EIE22693.1| hypothetical protein
            COCSUDRAFT_66346 [Coccomyxa subellipsoidea C-169]
          Length = 505

 Score =  131 bits (330), Expect = 8e-28
 Identities = 80/206 (38%), Positives = 118/206 (57%), Gaps = 2/206 (0%)
 Frame = +3

Query: 951  FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127
            FK++++SLP+ AG D YE +PVE FG+A+LRGMGW E + VGK  +++V    YVRR  K
Sbjct: 164  FKQDVESLPEVAGLDAYEAMPVEAFGEAMLRGMGWQEGKSVGKNAKEIVTAKEYVRRPEK 223

Query: 1128 VGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA 1307
            +GLGA PA    K  K IK+ GE     +D++    E G  +HV     KLVER K+G  
Sbjct: 224  LGLGATPAAMLPKPKKYIKQ-GESREAKKDMIYMDAE-GHQKHVKDENAKLVEREKRGVH 281

Query: 1308 IGKVVRIVDGKHSGLKGKVLKMEEREG-RDAKVLLRLVQSEEDVLVDGRDLADLGSFEEE 1484
            +GK +R + G H+GL   VL +E + G R  +  +RL  S E V V  ++L +    E  
Sbjct: 282  VGKTMRCIAGTHAGLLCDVLALEPKVGDRSQRATVRLQPSAETVSVRVKELGE--KHESA 339

Query: 1485 QFLKRYWERKQDRNDDAHKKELKYDK 1562
               +    +++ +     KK+ K+D+
Sbjct: 340  PEPENGSSKRKHKEHKHEKKQHKHDR 365


>ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [Ostreococcus tauri]
            gi|116056555|emb|CAL52844.1| G-patch nucleic acid binding
            protein (ISS) [Ostreococcus tauri]
          Length = 511

 Score =  129 bits (325), Expect = 3e-27
 Identities = 89/286 (31%), Positives = 148/286 (51%), Gaps = 13/286 (4%)
 Frame = +3

Query: 555  TGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSENQNPENGNKIDSENLK 734
            TG+     E  S  D+ V   +  R      + +  + K R  +++  +N +  D +   
Sbjct: 102  TGVAGNEIEGTSGDDEPVRDKVIARIENTFEVGTGRRRKVRGRKSRERDNASTDDDDFRA 161

Query: 735  EIPRV--SRDEISENVK--------ISGNYDHNSGKIERISENGKNSGRFDHLSKNFDNS 884
            +IP    ++DEI  + K         +G      G    +++ G   G       + D +
Sbjct: 162  QIPSFIPTKDEIERDNKRFHVAETIAAGGELAQQGVAYGLTKMGPKHGAVAAEKSDRDKA 221

Query: 885  SGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNES 1064
               F  KS+ E         + FKE+++ LP+ A  +EYE++P+E+FGKA+LRGMGW E 
Sbjct: 222  GESFIGKSLHEK------ELQAFKEDMEDLPEQASIEEYEDMPIEDFGKAMLRGMGWEEG 275

Query: 1065 EGVGKC-RKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLV--AAAG 1235
            + VG+  R +VE + ++ R  ++GLGA PA K   + K IK PGE   +  D+V    A 
Sbjct: 276  KAVGRMHRGMVEAVEFIPRAARLGLGAQPAEKDAPQKKYIK-PGETREKKADMVRDQRAV 334

Query: 1236 ENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKM 1373
             +  +R+V  + EKLV+R + GP  GK++ I DG+H+G+ G++L+M
Sbjct: 335  ADAGMRNVKTLDEKLVKRKEVGPREGKMMYIADGQHAGISGRILRM 380


>emb|CCO66478.1| predicted protein [Bathycoccus prasinos]
          Length = 510

 Score =  126 bits (317), Expect = 2e-26
 Identities = 93/301 (30%), Positives = 156/301 (51%), Gaps = 15/301 (4%)
 Frame = +3

Query: 696  PENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISENGKNSG-RFDHLSKN 872
            P    KID +   E+      E ++  K +         + R++  G  +  R+++  K 
Sbjct: 84   PTESEKIDDDKRFEVSETIGGETAQGNKTTYG-------LTRMAPKGDEAKVRYENKKKE 136

Query: 873  FD-NSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049
             D N +  F  KS++E      + D  FKE+++ LP+ A  + YE++P+E+FG A+LRGM
Sbjct: 137  QDKNKNESFIGKSLAEK-----ELD-AFKEDVEDLPEQASIEAYESMPIEDFGAAMLRGM 190

Query: 1050 GWNESEGVGKCRKV--VEPIAYVRRDGKVGLGA----VPAPKPEKES----KRIKRPGEE 1199
            GW E EGVG+  K    +P+ +V R G++GLGA    VP     K +    K+I++PGE 
Sbjct: 191  GWKEGEGVGRNAKSGRADPVEFVPRMGRLGLGADTMDVPGATLRKNNNNNDKKIRKPGET 250

Query: 1200 VRENRDLVAAAGENGR---VRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLK 1370
              E  +++       R   +R+V  I EKLV++ + G   GK + +  GKH GL G+VLK
Sbjct: 251  REEKMEILVRDPNASRAPGMRNVKSIDEKLVKKEELGVKEGKRMYVAKGKHEGLTGRVLK 310

Query: 1371 MEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKEL 1550
            +++ +GR       L  S E + +   +L ++ +  + Q      ++K+  +DDA  K+ 
Sbjct: 311  IQKADGR---AQFELDSSGEVITLRCSELDEMVNAPDSQ-----RDKKRKSSDDAGSKKS 362

Query: 1551 K 1553
            K
Sbjct: 363  K 363


>ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvox carteri f. nagariensis]
            gi|300267224|gb|EFJ51408.1| hypothetical protein
            VOLCADRAFT_87660 [Volvox carteri f. nagariensis]
          Length = 733

 Score =  125 bits (313), Expect = 7e-26
 Identities = 76/173 (43%), Positives = 104/173 (60%), Gaps = 7/173 (4%)
 Frame = +3

Query: 945  RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRKVVEPIAYVRRDG 1124
            R F+E +  LPDA   + YE +PVEEFGKA+LRGMGW E  GVG+ R+ V+ I YVRR  
Sbjct: 145  RAFRESVVELPDAMDVEAYEAMPVEEFGKAMLRGMGWEEGMGVGRNRQKVDAIEYVRRPE 204

Query: 1125 KVGLGAVP---APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAK 1295
            ++GLGA P   AP P K  K  ++P     + +DLV A   +GR R++  + E+LV R+ 
Sbjct: 205  RLGLGAQPVALAPDPSKPVKMGEKP-----QRQDLVLAPDADGRQRNIRKLDEQLVARST 259

Query: 1296 --KGPAIGKVVRIVDGKHSGLKGKVLKMEER--EGRDAKVLLRLVQSEEDVLV 1442
               GP  GK +RI  G H+GL    L+   R  EG+  +  +RL  S+E+V V
Sbjct: 260  VLPGPQPGKDMRITGGPHAGLACTALEALPRAPEGKPERWRVRLTASQEEVEV 312


>ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyostelium discoideum AX4]
            gi|60464330|gb|EAL62479.1| hypothetical protein
            DDB_G0289933 [Dictyostelium discoideum AX4]
          Length = 542

 Score =  122 bits (305), Expect = 6e-25
 Identities = 99/336 (29%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
 Frame = +3

Query: 543  HSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSE----------NQ 692
            ++ ++ L  +++ ++ +  + + Y   L   K  ++ +D+ S    S+          N+
Sbjct: 43   NNNNSKLSTLSKRKEPEKKEPINYITTLEGTKVTSLYNDETSTLGSSKLKVIPLTEQLNE 102

Query: 693  NPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISENGK----------- 839
            N EN +KI ++  KEI +    E  EN K   N D +S K  R  E  +           
Sbjct: 103  NYENSDKIVAQVAKEIKK----EEKENEKNENNNDDSSNKKLRTEEKLQPFYKASQTGLQ 158

Query: 840  -NSGR-------FDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFD 995
             N  R        D  ++N  +SSG      + +    +N+ D+ FK +++S PD +  D
Sbjct: 159  LNPNRKQEIKSIVDGKNENSGSSSGIPLIMKLKDIDKYENETDK-FKHDVESRPDESNLD 217

Query: 996  EYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEKE 1169
            +YE  PV   G+ALLRGMGW   + +G   K +VEPI Y++R G ++GLGA    KP  +
Sbjct: 218  DYEETPVSIIGEALLRGMGWVPGKSIGSTNKGLVEPIEYIKRPGFRLGLGA----KPMDD 273

Query: 1170 SKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSG 1349
               I++ G  + +          +G+VRHV GI EK+V   KK    G +V ++ G H G
Sbjct: 274  DDEIRKMGGPIED---------ADGKVRHVKGINEKIVSNVKKMEE-GSLVTVIGGPHKG 323

Query: 1350 LKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 1457
            +  K++ M + +    KV + + +S+E V+VD  DL
Sbjct: 324  MNAKIVSMLKND----KVQI-IFKSDEKVIVDKFDL 354


>ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Solanum lycopersicum]
            gi|460401091|ref|XP_004246062.1| PREDICTED: protein
            MOS2-like isoform 2 [Solanum lycopersicum]
          Length = 485

 Score =  119 bits (298), Expect = 4e-24
 Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
 Frame = +3

Query: 951  FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127
            FKE+++ LP+  G DEY ++PVE FG ALL+G GW E  G+G+  ++ V+ + Y R   K
Sbjct: 145  FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKRWTAK 204

Query: 1128 VGLGAVP-APKPEKES----KRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKL-VER 1289
             G+G +P  PKP  ++    K IK+ GE             E  +V H  G  EK+  E+
Sbjct: 205  EGIGFIPEVPKPSSKAEGGVKPIKKKGE-------------EGIKVDHSDGYIEKIDREK 251

Query: 1290 AKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 1469
              KG  +GK VR+V GK  G+KG+VL   E   R   V+L+L  ++++V +  RDLA+LG
Sbjct: 252  GGKGLYVGKKVRVVRGKEMGMKGEVL---EVNSRGELVILKL--ADKEVKLQARDLAELG 306

Query: 1470 SFEEEQFLKRYWERK 1514
            S EEE+ LK+  E K
Sbjct: 307  SVEEERCLKKLLELK 321


>ref|XP_004335226.1| KOW motif domain containing protein [Acanthamoeba castellanii str.
            Neff] gi|440791981|gb|ELR13213.1| KOW motif domain
            containing protein [Acanthamoeba castellanii str. Neff]
          Length = 491

 Score =  113 bits (283), Expect = 2e-22
 Identities = 67/186 (36%), Positives = 110/186 (59%), Gaps = 2/186 (1%)
 Frame = +3

Query: 936  DRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYV 1112
            D D  F+ +++S P+      Y+ VPVE+FG+A+LRGM W   + +G   K VV+P+ ++
Sbjct: 91   DEDEKFRFDVESRPEETTASAYDRVPVEKFGEAMLRGMLWKPGDPIGNTNKAVVKPVEFI 150

Query: 1113 RRDGKVGLGAVP-APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVER 1289
             R  ++GLGA P AP+P K  K+  +PGE     + +V    ++GR RHV  + +KLV  
Sbjct: 151  ARHHRLGLGAAPKAPEPTK--KKFIKPGESREPKQMMVLPRDKDGRQRHVRDLDQKLVAF 208

Query: 1290 AKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 1469
              +G   G +V +V G+H G+  +++++   +    KV +R  +SEE+V V+  DLA L 
Sbjct: 209  RPEGIHPGNLVGVVSGEHEGMYARIVQLLAGD----KVRIRF-ESEEEVTVNRIDLAPLN 263

Query: 1470 SFEEEQ 1487
            S + +Q
Sbjct: 264  STQLKQ 269


>ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tuberosum]
          Length = 484

 Score =  112 bits (279), Expect = 6e-22
 Identities = 75/205 (36%), Positives = 120/205 (58%), Gaps = 6/205 (2%)
 Frame = +3

Query: 951  FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127
            FKE+++ LP+  G DEY ++PVE FG ALL+G GW E  G+G+  ++ V+ + Y +   K
Sbjct: 145  FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKKWTAK 204

Query: 1128 VGLGAVP-APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKL-VERAKKG 1301
             G+G +P  PKP  + +   +    ++++ D V       +V H  G  EK+  E+A  G
Sbjct: 205  EGIGFIPEVPKPSSKGEGAVK---SIKKSEDGV-------KVDHSDGNIEKIDREKAGNG 254

Query: 1302 PAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEE 1481
              +GK VR+V GK  G+KG++L   E       V+L+L  ++++V +  RDLA+LGS EE
Sbjct: 255  LYVGKKVRVVRGKEMGMKGEIL---EVNSSGDLVILKL--ADKEVKLQARDLAELGSVEE 309

Query: 1482 EQFLKRYWE---RKQDRNDDAHKKE 1547
            E+ LK+  E   R++  N D  +K+
Sbjct: 310  ERCLKKLLELKIREEKSNLDGVRKQ 334


>ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyostelium fasciculatum]
            gi|328865473|gb|EGG13859.1| hypothetical protein
            DFA_11620 [Dictyostelium fasciculatum]
          Length = 531

 Score =  111 bits (277), Expect = 1e-21
 Identities = 67/177 (37%), Positives = 100/177 (56%), Gaps = 1/177 (0%)
 Frame = +3

Query: 930  DNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIA 1106
            +ND D+ FK ++ + PD A  D+YE  P++ FGKA+L GMGW   +G+G   K VVEP+ 
Sbjct: 200  NNDEDK-FKFDLSTRPDEANQDDYEETPIDIFGKAMLMGMGWKPGQGIGLTNKGVVEPVQ 258

Query: 1107 YVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVE 1286
            +++R G++GLGA P+    KE K +  P              GE+G+VRH VG+ EKLV 
Sbjct: 259  FLKRAGRLGLGAQPSDVANKEKKYMTAP-------------KGEDGKVRHTVGLSEKLVP 305

Query: 1287 RAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 1457
              K G   G  V ++ G H GL   V  + + +    ++++R  +S+E   VD  DL
Sbjct: 306  -LKFGLQPGDRVLVISGPHEGLNATVESLAQSD----RIVIRF-KSDELAAVDKCDL 356


>gb|EXC18489.1| Protein MOS2 [Morus notabilis]
          Length = 476

 Score =  110 bits (276), Expect = 1e-21
 Identities = 102/352 (28%), Positives = 159/352 (45%), Gaps = 52/352 (14%)
 Frame = +3

Query: 654  SDDKSKP-RVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISE 830
            S  K KP + S+N   +N NK    +      V     SE   ++GN   N+  I  I  
Sbjct: 11   SSSKLKPIKPSQNFEDDNDNKSTENDANSRKYVIEFNASET--LTGNATQNAVVIPPIQN 68

Query: 831  NGKNSGRFDHL----SKNFDNSSG-RFDNKSVSESVH--------------GDNDRD--- 944
              +   R  +L    +   D S G +F+ +S+S++ +              GD+D +   
Sbjct: 69   EWRPHKRMKNLDLPIAAQSDGSGGLQFEVESLSDATNSSMSYGLNLRQTAKGDHDDEING 128

Query: 945  ---------------------RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNE 1061
                                 +  K ++Q LP+  G  E+E+VPVE FG ALL G GW+E
Sbjct: 129  QDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPEDRGMAEFEDVPVEGFGAALLSGYGWHE 188

Query: 1062 SEGVGK-CRKVVEPIAYVRRDGKVGLGAV-----PAPKPEKES--KRIKRPGEEVRENRD 1217
              G+GK  ++ V+ + Y +R GK GLG V     P P   ++S    I +P +    N +
Sbjct: 189  GRGIGKNAKEDVKVVEYTKRTGKQGLGFVMTDLPPLPNSNRDSLNNSIPKPKDNNNNNNN 248

Query: 1218 LVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDA 1397
                                    + K   IGK VRIV G+  GLKG+VL   E+   D 
Sbjct: 249  ---------------------NSSSNKESLIGKEVRIVRGRELGLKGRVL---EKLSDDN 284

Query: 1398 KVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELK 1553
            ++++RL +S+E V V+ +D+A+LGS E+E  LKR  E +    ++  +K+ K
Sbjct: 285  RLVVRLSRSQETVKVNIQDVAELGSEEDEACLKRLKELRIREEEEKKEKKSK 336


>gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium pallidum PN500]
          Length = 584

 Score =  110 bits (276), Expect = 1e-21
 Identities = 81/249 (32%), Positives = 127/249 (51%), Gaps = 14/249 (5%)
 Frame = +3

Query: 660  DKSKPRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSG-----KIERI 824
            DK K   S+N +  + N+ D+ N  +  +     I +  K     D+NS      +I+ +
Sbjct: 148  DKFKRLKSDNNSSSDKNEDDNSNTFKPKQHGLQIIKQKDKNKQKDDNNSSGSNRQEIKSL 207

Query: 825  SENGKNSGRFDHLSKNFDNSSGRFDNKSVSESVHG----DNDRDRVFKEEIQSLPDAAGF 992
              N KN    DH     +        + + E + G    DND D+ FK +++S P+ A  
Sbjct: 208  IGNKKND---DHKDVEMNQV---VKVRPLIEKLDGLDRFDNDDDK-FKFDVESRPEEADN 260

Query: 993  DEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEK 1166
            ++YE  P+E FG+A+LRGMGW   + +G   K + EPI +V+R G ++GLGA P    +K
Sbjct: 261  EDYEETPIEVFGEAMLRGMGWQPGQAIGLTNKGLNEPIQFVKRPGYRLGLGAQPKDVDDK 320

Query: 1167 ESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA---IGKVVRIVDG 1337
                         + R LVA  GE+G+VRH+VGI EKLV   K G     +G+ V ++ G
Sbjct: 321  -------------DKRYLVAQKGEDGKVRHMVGISEKLVPMNKSGSQSYNVGERVLVISG 367

Query: 1338 KHSGLKGKV 1364
            +H G+  ++
Sbjct: 368  QHEGMYAEI 376


>gb|EXB45122.1| Protein MOS2 [Morus notabilis]
          Length = 423

 Score =  109 bits (273), Expect = 3e-21
 Identities = 80/237 (33%), Positives = 121/237 (51%), Gaps = 11/237 (4%)
 Frame = +3

Query: 876  DNSSGRFDNKSVSESVHGDNDRD---RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRG 1046
            D  +G+ + K  +E +      D   +  K ++Q LPD  G  E+E+VPVE FG ALL G
Sbjct: 130  DEINGQDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPDDRGMAEFEDVPVEGFGAALLSG 189

Query: 1047 MGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAV-----PAPKPEKES--KRIKRPGEEV 1202
             GW+E  G+GK  ++ V+ + Y +R GK GLG V     P P   ++S    I +P +  
Sbjct: 190  YGWHEGRGIGKNAKEDVKVVEYTKRTGKQGLGFVSNVLPPLPNSNRDSLNNSILKPKDNN 249

Query: 1203 RENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEER 1382
              N +                        + K   IGK VRIV G+  GLKG+VL   E+
Sbjct: 250  TNNNN---------------------NSSSNKESLIGKEVRIVHGRELGLKGRVL---EK 285

Query: 1383 EGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELK 1553
               D ++++RL +S+E V V+ RD+ +LGS E+E  LKR  E +    ++  +K+ K
Sbjct: 286  LSDDNRLVVRLSRSQETVKVNIRDVTELGSEEDEACLKRLKELRIRGEEEKKEKKSK 342


>gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao]
            gi|508714356|gb|EOY06253.1| MOS2, putative isoform 1
            [Theobroma cacao]
          Length = 465

 Score =  107 bits (266), Expect = 2e-20
 Identities = 75/189 (39%), Positives = 108/189 (57%), Gaps = 2/189 (1%)
 Frame = +3

Query: 954  KEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKV 1130
            KE+++ LP+  GF+E+E+VPVE FGKALL G GW E  G+GK  ++ V+   Y RR  K 
Sbjct: 139  KEDLKRLPEDRGFEEFEDVPVEGFGKALLAGYGWVEGRGIGKNAKEDVKVKQYERRTDKE 198

Query: 1131 GLGAVPAPKPEKESKRIKRPG-EEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA 1307
            GLG        KE+K  + PG   V++  D                  E++V+  K G  
Sbjct: 199  GLGF-----SSKENKE-RLPGFTNVKQKHDT-----------------EEIVKEDKDGFF 235

Query: 1308 IGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQ 1487
            +GK VR+++G+  GLKG +  ME+  G    ++LRL +SEE V V   ++ADLGS EEE+
Sbjct: 236  VGKDVRVIEGREMGLKGTI--MEKLGG--GWIVLRLKKSEEKVKVRLFEIADLGSREEEK 291

Query: 1488 FLKRYWERK 1514
             L++  E K
Sbjct: 292  CLRKLTELK 300



 Score = 69.7 bits (169), Expect = 4e-09
 Identities = 93/325 (28%), Positives = 132/325 (40%), Gaps = 26/325 (8%)
 Frame = +3

Query: 342  KLSFSMASSSKTRITRPAPIIE--EEDDQKPEYVSEFDXXXXXXXXXXX---VIAKLDNT 506
            KLSFS+ S SK       PI     ED    E+V+EFD              VI    N 
Sbjct: 2    KLSFSLPSKSKPTQKTSIPITSAAHEDQYHREFVTEFDPSKTPADPNSKPSFVIPPKQNE 61

Query: 507  WRPELKMKNIH-PHSTD--TGLQFVTEEEQS----QTDKKVEYGLNLRNPKHNNIPSDDK 665
            WRP  KMKN+H P  +D    LQF  E         +D K+ YGLNLR+    N   D +
Sbjct: 62   WRPYKKMKNLHIPLQSDGSRDLQFELESSSDLPLPNSDAKISYGLNLRDNSAKNDAGDQQ 121

Query: 666  SKPRVSENQNPENGNKIDS--ENLKEIPRVSRDEISENVKISG-------NYDHNSGKIE 818
              P   E+  P     + S  E+LK +P     E  E+V + G        Y    G+  
Sbjct: 122  GIP---ESAAPVEAVLLQSLKEDLKRLPEDRGFEEFEDVPVEGFGKALLAGYGWVEGR-- 176

Query: 819  RISENGKNSGRFDHLSKNFDNSSGRFDNKSVSESVHG-DNDRDRVFKEEIQSLPDAAGFD 995
             I +N K   +     +  D     F +K   E + G  N + +   EEI    D  GF 
Sbjct: 177  GIGKNAKEDVKVKQYERRTDKEGLGFSSKENKERLPGFTNVKQKHDTEEIVK-EDKDGFF 235

Query: 996  EYENVPVEEFGKALLRGMGWNESEG---VGKCRKVVEPIAYVRRDGKVGLGAVPAPKPEK 1166
              ++V V E  +  L+G    +  G   V + +K  E +       KV L  +      +
Sbjct: 236  VGKDVRVIEGREMGLKGTIMEKLGGGWIVLRLKKSEEKV-------KVRLFEIADLGSRE 288

Query: 1167 ESKRIKRPGE-EVRENRDLVAAAGE 1238
            E K +++  E ++RE +DL     E
Sbjct: 289  EEKCLRKLTELKIREAKDLKTKGDE 313


>ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago truncatula]
            gi|355478757|gb|AES59960.1| Pre-mRNA-splicing factor spp2
            [Medicago truncatula]
          Length = 385

 Score =  106 bits (264), Expect = 3e-20
 Identities = 86/222 (38%), Positives = 116/222 (52%), Gaps = 10/222 (4%)
 Frame = +3

Query: 897  DNKSVSESVHGDNDRDRV---------FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049
            D K  S+ V  D  R +          FK++++ LPD  GFDEY++VPVE FG ALL G 
Sbjct: 42   DKKPQSDDVVVDAPRPKASVEVSMLQKFKDDMERLPDDMGFDEYKDVPVEGFGAALLGGY 101

Query: 1050 GWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVA 1226
            GW E  G+GK  ++ V+ +   RR GK GLG V A  P   SK+ +R             
Sbjct: 102  GWKEGMGIGKNAKEDVKVVEVKRRTGKEGLGFV-ADLPPPSSKKGER------------- 147

Query: 1227 AAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 1406
                NGR       GE   ER KK     +VVRIV G+  GLK  V+    R+G D  V+
Sbjct: 148  ----NGR-------GE--TERKKKEE---RVVRIVRGRDVGLKASVVG---RDGEDV-VV 187

Query: 1407 LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDD 1532
            LR++ S E+V V   D+A+LGS EEE+ L++  + K    D+
Sbjct: 188  LRVLGSGEEVKVKVEDVAELGSVEEERCLRKLKDLKIRGRDE 229


Top