BLASTX nr result
ID: Ephedra25_contig00011082
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00011082 (1677 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [A... 214 7e-53 ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi... 150 2e-33 ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP15... 149 3e-33 ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selag... 137 1e-29 ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Sela... 137 2e-29 ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chla... 132 3e-28 ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Cocco... 131 8e-28 ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [... 129 3e-27 emb|CCO66478.1| predicted protein [Bathycoccus prasinos] 126 2e-26 ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvo... 125 7e-26 ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyosteliu... 122 6e-25 ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Sola... 119 4e-24 ref|XP_004335226.1| KOW motif domain containing protein [Acantha... 113 2e-22 ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tubero... 112 6e-22 ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyosteliu... 111 1e-21 gb|EXC18489.1| Protein MOS2 [Morus notabilis] 110 1e-21 gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium p... 110 1e-21 gb|EXB45122.1| Protein MOS2 [Morus notabilis] 109 3e-21 gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao] gi|508... 107 2e-20 ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago trun... 106 3e-20 >ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [Amborella trichopoda] gi|548849308|gb|ERN08173.1| hypothetical protein AMTR_s00018p00151280 [Amborella trichopoda] Length = 540 Score = 214 bits (546), Expect = 7e-53 Identities = 150/451 (33%), Positives = 221/451 (49%), Gaps = 6/451 (1%) Frame = +3 Query: 342 KLSFSMASSSKTRITRPAPIIEE---EDDQKPEYVSEFDXXXXXXXXXXXVIAKLDNTWR 512 KLSFS++S +R RP E E++ K E+V+EFD VI + +++WR Sbjct: 2 KLSFSLSSKRSSR-PRPTDFGERNTNEEEPKAEFVTEFDSSKTPSEKSRLVIPRQESSWR 60 Query: 513 PELKMKNIHPHSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSENQ 692 E MKNI P T + +T E ++D V YGLNLRN KS S+ + Sbjct: 61 AEKNMKNIKPEETHLEFEIITHETSIESD--VGYGLNLRN----------KSNGGDSKRE 108 Query: 693 NPENGNKIDSENLKEIPRVSRDEI-SENVKISGNYDHNSGKIERISENGKNSGRFDHLSK 869 N + GN L + V E+ ++ K GN S K K Sbjct: 109 NEDMGNS----GLSCMEPVEATEVDAKRKKDMGNSSFPSVK-----------------PK 147 Query: 870 NFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049 N D+ L + G DE+ ++P+E FG A+L G Sbjct: 148 NLDSE-----------------------------LEEDGGLDEFSDMPIEGFGAAVLAGY 178 Query: 1050 GWNESEGVG-KCRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVA 1226 GW E +G+G K +K ++ + Y+RR G GLG P+ PEK+ K+ +PGE +L+A Sbjct: 179 GWTEGQGIGRKAKKDIQVVQYIRRAGMGGLGFTPSSVPEKKQKKYVKPGESRESRPELIA 238 Query: 1227 AAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 1406 G NGR+RH VGI EKLV R KG +GK++R++ G H GLKG+++++ +G K+ Sbjct: 239 PKGSNGRIRHAVGIDEKLVPREIKGFFVGKILRVIGGPHLGLKGQLIEIFGDDGSSQKIG 298 Query: 1407 LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELKYDKMXXXXXXX 1586 L+L++SEE V+VD +LA+LGS EE++ LKR E K + D K L+ D+ Sbjct: 299 LKLLKSEEMVVVDREELAELGSLEEDKCLKRMRELKLE-GDGNRLKHLRRDERESHNGEF 357 Query: 1587 XXXXXXXXIHRRDSR-DKKPDNSSGKYDSRD 1676 +H SR D++ + SS K + D Sbjct: 358 GKERKAEPLHGDVSRHDRERERSSSKREKED 388 >ref|XP_002505906.1| predicted protein [Micromonas sp. RCC299] gi|226521177|gb|ACO67164.1| predicted protein [Micromonas sp. RCC299] Length = 493 Score = 150 bits (379), Expect = 2e-33 Identities = 81/225 (36%), Positives = 134/225 (59%), Gaps = 4/225 (1%) Frame = +3 Query: 867 KNFDNSSGRFDNKSVSESVHGDNDRDR---VFKEEIQSLPDAAGFDEYENVPVEEFGKAL 1037 K+ D + + K ES G + ++ FKE+++ LP+ A D+YE +P+E+FG A+ Sbjct: 118 KDGDTPKAKAEEKPAQESFIGKSLAEKELQAFKEDVEDLPEQATLDDYEQMPIEDFGAAM 177 Query: 1038 LRGMGWNESEGVGKCRK-VVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENR 1214 LRGMGW E + VG+ +V + +V R G++GLGA PAP ++ +K+ +PGE Sbjct: 178 LRGMGWEEGKPVGRNNNGMVAAVEFVPRSGRLGLGADPAPSKQENTKKYIKPGESREAPA 237 Query: 1215 DLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRD 1394 +V A G G+ R+V + EKLV+ + G GK + +V+G+H GL G+VLK+ ++EGR Sbjct: 238 TMVLAKGPEGQSRNVKTLDEKLVKLEEPGAREGKRMCVVEGRHRGLTGRVLKVVKQEGRS 297 Query: 1395 AKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRND 1529 + L L S E V + +LADLG+ + ++ +++ +R + D Sbjct: 298 DRAQLELDTSGEVVTIRTGELADLGTRDADRAMRKKDDRAGGKGD 342 >ref|XP_003062694.1| predicted protein [Micromonas pusilla CCMP1545] gi|226456211|gb|EEH53513.1| predicted protein [Micromonas pusilla CCMP1545] Length = 496 Score = 149 bits (377), Expect = 3e-33 Identities = 89/233 (38%), Positives = 135/233 (57%), Gaps = 4/233 (1%) Frame = +3 Query: 786 GNYDHNSGKIERISENGKNSG--RFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKE 959 G SG + +++ G G + + K ++ F KS+ E + F+E Sbjct: 109 GETAQGSGTVYGLTKMGPKDGDVKLEDDEKPPPPANESFIGKSLQEK------ELQAFRE 162 Query: 960 EIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDGKVGL 1136 ++Q LP+ A D+YE++P+E+FG A+LRGMGW E + VG+ K +V + +V R G++GL Sbjct: 163 DVQDLPEQASLDDYESMPIEDFGAAMLRGMGWEEGKPVGRNSKGLVAAVEFVPRAGRLGL 222 Query: 1137 GAVPAPKPEKESKRIKRPGEEVRENRD-LVAAAGENGRVRHVVGIGEKLVERAKKGPAIG 1313 GA PAPKPE R +PG E RE D +V G G+ R+V + EKLV+R + GP G Sbjct: 223 GAEPAPKPEVSDARRIKPG-ETRERADVMVLPEGPEGKSRNVKSLDEKLVKREEPGPREG 281 Query: 1314 KVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGS 1472 K++ +V+G+H GL+ +VL + + EGR + + L S E V V +LAD GS Sbjct: 282 KMMCVVEGRHRGLECRVLTLTKSEGRSERAQVELKTSGEVVTVRTSELADFGS 334 >ref|XP_002966729.1| hypothetical protein SELMODRAFT_85312 [Selaginella moellendorffii] gi|300166149|gb|EFJ32756.1| hypothetical protein SELMODRAFT_85312 [Selaginella moellendorffii] Length = 377 Score = 137 bits (346), Expect = 1e-29 Identities = 101/289 (34%), Positives = 149/289 (51%), Gaps = 3/289 (1%) Frame = +3 Query: 672 PRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDH-NSGKIERISENGKNSG 848 PR+ + PE K NL P S E V +G D G R S+ G S Sbjct: 4 PRLENSWKPEKRMK----NLMSAPEDSVQEFVGEVLPAGPVDGVQYGLSVRSSKAGGGSV 59 Query: 849 RFDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFG 1028 + D + + + D++ + + ++ LP++A Y+ +PVE+FG Sbjct: 60 KTDIME---------------ARELRAKLDKEALV-DHLEFLPESASLAAYDAMPVEKFG 103 Query: 1029 KALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVR 1205 +ALL GMGW + G+G+ + V+ +VRR ++GLGA KP Sbjct: 104 QALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLGLGAA---KP--------------- 145 Query: 1206 ENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEERE 1385 DLVA G +G+VRH V IGEKLVERAK+G GKV+ IV G+H+GL+G+VL +++ Sbjct: 146 ---DLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGGKVMTIVSGRHAGLRGEVLGRKDKV 202 Query: 1386 GRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRND 1529 +V+ ++L S E V VD ++LAD+GS EEE +KR E + R D Sbjct: 203 ANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENAMKRLKELRIQRGD 251 >ref|XP_002978017.1| hypothetical protein SELMODRAFT_176683 [Selaginella moellendorffii] gi|300154038|gb|EFJ20674.1| hypothetical protein SELMODRAFT_176683 [Selaginella moellendorffii] Length = 378 Score = 137 bits (344), Expect = 2e-29 Identities = 81/193 (41%), Positives = 118/193 (61%), Gaps = 2/193 (1%) Frame = +3 Query: 957 EEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKVG 1133 + ++ LP++A Y+ +PVE+FG+ALL GMGW + G+G+ + V+ +VRR ++G Sbjct: 81 DHLEFLPESASLAAYDAMPVEKFGQALLLGMGWRDGRGIGRRATEDVKATEFVRRPERLG 140 Query: 1134 LGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIG 1313 LGA KP DLVA G +G+VRH V IGEKLVERAK+G G Sbjct: 141 LGAA---KP------------------DLVAPVGPDGKVRHRVEIGEKLVERAKRGVYGG 179 Query: 1314 KVVRIVDGKHSGLKGKVLKMEEREGRDAKVL-LRLVQSEEDVLVDGRDLADLGSFEEEQF 1490 KV+ IV G+H+GL+G+VL +++ +V+ ++L S E V VD ++LAD+GS EEE Sbjct: 180 KVMTIVSGRHAGLRGEVLGRKDKVANPGEVVGVKLAGSGETVEVDAKNLADVGSVEEENA 239 Query: 1491 LKRYWERKQDRND 1529 +KR E + R D Sbjct: 240 MKRLKELRIQRGD 252 >ref|XP_001691256.1| hypothetical protein CHLREDRAFT_144949 [Chlamydomonas reinhardtii] gi|158279228|gb|EDP04989.1| predicted protein [Chlamydomonas reinhardtii] Length = 597 Score = 132 bits (333), Expect = 3e-28 Identities = 78/177 (44%), Positives = 111/177 (62%), Gaps = 3/177 (1%) Frame = +3 Query: 942 DRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRKVVEPIAYVRRD 1121 ++ +K+ ++ LP+ A D YE +P+EEFG+A+LRGMGW E GVG+ RK V+ I YVRR Sbjct: 141 EKAYKDSVEDLPEVADLDAYEAMPIEEFGRAMLRGMGWEEGMGVGRNRKQVDAIEYVRRP 200 Query: 1122 GKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAK-- 1295 ++GLGA P E +SK +K G++ R+ +DLV A +GR R+V + EKLV RA Sbjct: 201 ERLGLGAQPVKVSEDKSKVVKM-GDKPRK-QDLVLAPDADGRQRNVRTLDEKLVSRATVL 258 Query: 1296 KGPAIGKVVRIVDGKHSGLKGKVLK-MEEREGRDAKVLLRLVQSEEDVLVDGRDLAD 1463 GP K +RI+ G HSGL L+ + + EGR + +RL S EDV V +L + Sbjct: 259 PGPQPNKPMRIMTGAHSGLLCTALEALPKPEGRPERWRVRLAASNEDVEVLASELGE 315 >ref|XP_005647237.1| hypothetical protein COCSUDRAFT_66346 [Coccomyxa subellipsoidea C-169] gi|384249211|gb|EIE22693.1| hypothetical protein COCSUDRAFT_66346 [Coccomyxa subellipsoidea C-169] Length = 505 Score = 131 bits (330), Expect = 8e-28 Identities = 80/206 (38%), Positives = 118/206 (57%), Gaps = 2/206 (0%) Frame = +3 Query: 951 FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127 FK++++SLP+ AG D YE +PVE FG+A+LRGMGW E + VGK +++V YVRR K Sbjct: 164 FKQDVESLPEVAGLDAYEAMPVEAFGEAMLRGMGWQEGKSVGKNAKEIVTAKEYVRRPEK 223 Query: 1128 VGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA 1307 +GLGA PA K K IK+ GE +D++ E G +HV KLVER K+G Sbjct: 224 LGLGATPAAMLPKPKKYIKQ-GESREAKKDMIYMDAE-GHQKHVKDENAKLVEREKRGVH 281 Query: 1308 IGKVVRIVDGKHSGLKGKVLKMEEREG-RDAKVLLRLVQSEEDVLVDGRDLADLGSFEEE 1484 +GK +R + G H+GL VL +E + G R + +RL S E V V ++L + E Sbjct: 282 VGKTMRCIAGTHAGLLCDVLALEPKVGDRSQRATVRLQPSAETVSVRVKELGE--KHESA 339 Query: 1485 QFLKRYWERKQDRNDDAHKKELKYDK 1562 + +++ + KK+ K+D+ Sbjct: 340 PEPENGSSKRKHKEHKHEKKQHKHDR 365 >ref|XP_003078104.1| G-patch nucleic acid binding protein (ISS) [Ostreococcus tauri] gi|116056555|emb|CAL52844.1| G-patch nucleic acid binding protein (ISS) [Ostreococcus tauri] Length = 511 Score = 129 bits (325), Expect = 3e-27 Identities = 89/286 (31%), Positives = 148/286 (51%), Gaps = 13/286 (4%) Frame = +3 Query: 555 TGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSENQNPENGNKIDSENLK 734 TG+ E S D+ V + R + + + K R +++ +N + D + Sbjct: 102 TGVAGNEIEGTSGDDEPVRDKVIARIENTFEVGTGRRRKVRGRKSRERDNASTDDDDFRA 161 Query: 735 EIPRV--SRDEISENVK--------ISGNYDHNSGKIERISENGKNSGRFDHLSKNFDNS 884 +IP ++DEI + K +G G +++ G G + D + Sbjct: 162 QIPSFIPTKDEIERDNKRFHVAETIAAGGELAQQGVAYGLTKMGPKHGAVAAEKSDRDKA 221 Query: 885 SGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNES 1064 F KS+ E + FKE+++ LP+ A +EYE++P+E+FGKA+LRGMGW E Sbjct: 222 GESFIGKSLHEK------ELQAFKEDMEDLPEQASIEEYEDMPIEDFGKAMLRGMGWEEG 275 Query: 1065 EGVGKC-RKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLV--AAAG 1235 + VG+ R +VE + ++ R ++GLGA PA K + K IK PGE + D+V A Sbjct: 276 KAVGRMHRGMVEAVEFIPRAARLGLGAQPAEKDAPQKKYIK-PGETREKKADMVRDQRAV 334 Query: 1236 ENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKM 1373 + +R+V + EKLV+R + GP GK++ I DG+H+G+ G++L+M Sbjct: 335 ADAGMRNVKTLDEKLVKRKEVGPREGKMMYIADGQHAGISGRILRM 380 >emb|CCO66478.1| predicted protein [Bathycoccus prasinos] Length = 510 Score = 126 bits (317), Expect = 2e-26 Identities = 93/301 (30%), Positives = 156/301 (51%), Gaps = 15/301 (4%) Frame = +3 Query: 696 PENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISENGKNSG-RFDHLSKN 872 P KID + E+ E ++ K + + R++ G + R+++ K Sbjct: 84 PTESEKIDDDKRFEVSETIGGETAQGNKTTYG-------LTRMAPKGDEAKVRYENKKKE 136 Query: 873 FD-NSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049 D N + F KS++E + D FKE+++ LP+ A + YE++P+E+FG A+LRGM Sbjct: 137 QDKNKNESFIGKSLAEK-----ELD-AFKEDVEDLPEQASIEAYESMPIEDFGAAMLRGM 190 Query: 1050 GWNESEGVGKCRKV--VEPIAYVRRDGKVGLGA----VPAPKPEKES----KRIKRPGEE 1199 GW E EGVG+ K +P+ +V R G++GLGA VP K + K+I++PGE Sbjct: 191 GWKEGEGVGRNAKSGRADPVEFVPRMGRLGLGADTMDVPGATLRKNNNNNDKKIRKPGET 250 Query: 1200 VRENRDLVAAAGENGR---VRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLK 1370 E +++ R +R+V I EKLV++ + G GK + + GKH GL G+VLK Sbjct: 251 REEKMEILVRDPNASRAPGMRNVKSIDEKLVKKEELGVKEGKRMYVAKGKHEGLTGRVLK 310 Query: 1371 MEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKEL 1550 +++ +GR L S E + + +L ++ + + Q ++K+ +DDA K+ Sbjct: 311 IQKADGR---AQFELDSSGEVITLRCSELDEMVNAPDSQ-----RDKKRKSSDDAGSKKS 362 Query: 1551 K 1553 K Sbjct: 363 K 363 >ref|XP_002947360.1| hypothetical protein VOLCADRAFT_87660 [Volvox carteri f. nagariensis] gi|300267224|gb|EFJ51408.1| hypothetical protein VOLCADRAFT_87660 [Volvox carteri f. nagariensis] Length = 733 Score = 125 bits (313), Expect = 7e-26 Identities = 76/173 (43%), Positives = 104/173 (60%), Gaps = 7/173 (4%) Frame = +3 Query: 945 RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRKVVEPIAYVRRDG 1124 R F+E + LPDA + YE +PVEEFGKA+LRGMGW E GVG+ R+ V+ I YVRR Sbjct: 145 RAFRESVVELPDAMDVEAYEAMPVEEFGKAMLRGMGWEEGMGVGRNRQKVDAIEYVRRPE 204 Query: 1125 KVGLGAVP---APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAK 1295 ++GLGA P AP P K K ++P + +DLV A +GR R++ + E+LV R+ Sbjct: 205 RLGLGAQPVALAPDPSKPVKMGEKP-----QRQDLVLAPDADGRQRNIRKLDEQLVARST 259 Query: 1296 --KGPAIGKVVRIVDGKHSGLKGKVLKMEER--EGRDAKVLLRLVQSEEDVLV 1442 GP GK +RI G H+GL L+ R EG+ + +RL S+E+V V Sbjct: 260 VLPGPQPGKDMRITGGPHAGLACTALEALPRAPEGKPERWRVRLTASQEEVEV 312 >ref|XP_635990.1| hypothetical protein DDB_G0289933 [Dictyostelium discoideum AX4] gi|60464330|gb|EAL62479.1| hypothetical protein DDB_G0289933 [Dictyostelium discoideum AX4] Length = 542 Score = 122 bits (305), Expect = 6e-25 Identities = 99/336 (29%), Positives = 165/336 (49%), Gaps = 31/336 (9%) Frame = +3 Query: 543 HSTDTGLQFVTEEEQSQTDKKVEYGLNLRNPKHNNIPSDDKSKPRVSE----------NQ 692 ++ ++ L +++ ++ + + + Y L K ++ +D+ S S+ N+ Sbjct: 43 NNNNSKLSTLSKRKEPEKKEPINYITTLEGTKVTSLYNDETSTLGSSKLKVIPLTEQLNE 102 Query: 693 NPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISENGK----------- 839 N EN +KI ++ KEI + E EN K N D +S K R E + Sbjct: 103 NYENSDKIVAQVAKEIKK----EEKENEKNENNNDDSSNKKLRTEEKLQPFYKASQTGLQ 158 Query: 840 -NSGR-------FDHLSKNFDNSSGRFDNKSVSESVHGDNDRDRVFKEEIQSLPDAAGFD 995 N R D ++N +SSG + + +N+ D+ FK +++S PD + D Sbjct: 159 LNPNRKQEIKSIVDGKNENSGSSSGIPLIMKLKDIDKYENETDK-FKHDVESRPDESNLD 217 Query: 996 EYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEKE 1169 +YE PV G+ALLRGMGW + +G K +VEPI Y++R G ++GLGA KP + Sbjct: 218 DYEETPVSIIGEALLRGMGWVPGKSIGSTNKGLVEPIEYIKRPGFRLGLGA----KPMDD 273 Query: 1170 SKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSG 1349 I++ G + + +G+VRHV GI EK+V KK G +V ++ G H G Sbjct: 274 DDEIRKMGGPIED---------ADGKVRHVKGINEKIVSNVKKMEE-GSLVTVIGGPHKG 323 Query: 1350 LKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 1457 + K++ M + + KV + + +S+E V+VD DL Sbjct: 324 MNAKIVSMLKND----KVQI-IFKSDEKVIVDKFDL 354 >ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Solanum lycopersicum] gi|460401091|ref|XP_004246062.1| PREDICTED: protein MOS2-like isoform 2 [Solanum lycopersicum] Length = 485 Score = 119 bits (298), Expect = 4e-24 Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 7/195 (3%) Frame = +3 Query: 951 FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127 FKE+++ LP+ G DEY ++PVE FG ALL+G GW E G+G+ ++ V+ + Y R K Sbjct: 145 FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKRWTAK 204 Query: 1128 VGLGAVP-APKPEKES----KRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKL-VER 1289 G+G +P PKP ++ K IK+ GE E +V H G EK+ E+ Sbjct: 205 EGIGFIPEVPKPSSKAEGGVKPIKKKGE-------------EGIKVDHSDGYIEKIDREK 251 Query: 1290 AKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 1469 KG +GK VR+V GK G+KG+VL E R V+L+L ++++V + RDLA+LG Sbjct: 252 GGKGLYVGKKVRVVRGKEMGMKGEVL---EVNSRGELVILKL--ADKEVKLQARDLAELG 306 Query: 1470 SFEEEQFLKRYWERK 1514 S EEE+ LK+ E K Sbjct: 307 SVEEERCLKKLLELK 321 >ref|XP_004335226.1| KOW motif domain containing protein [Acanthamoeba castellanii str. Neff] gi|440791981|gb|ELR13213.1| KOW motif domain containing protein [Acanthamoeba castellanii str. Neff] Length = 491 Score = 113 bits (283), Expect = 2e-22 Identities = 67/186 (36%), Positives = 110/186 (59%), Gaps = 2/186 (1%) Frame = +3 Query: 936 DRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYV 1112 D D F+ +++S P+ Y+ VPVE+FG+A+LRGM W + +G K VV+P+ ++ Sbjct: 91 DEDEKFRFDVESRPEETTASAYDRVPVEKFGEAMLRGMLWKPGDPIGNTNKAVVKPVEFI 150 Query: 1113 RRDGKVGLGAVP-APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVER 1289 R ++GLGA P AP+P K K+ +PGE + +V ++GR RHV + +KLV Sbjct: 151 ARHHRLGLGAAPKAPEPTK--KKFIKPGESREPKQMMVLPRDKDGRQRHVRDLDQKLVAF 208 Query: 1290 AKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLG 1469 +G G +V +V G+H G+ +++++ + KV +R +SEE+V V+ DLA L Sbjct: 209 RPEGIHPGNLVGVVSGEHEGMYARIVQLLAGD----KVRIRF-ESEEEVTVNRIDLAPLN 263 Query: 1470 SFEEEQ 1487 S + +Q Sbjct: 264 STQLKQ 269 >ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tuberosum] Length = 484 Score = 112 bits (279), Expect = 6e-22 Identities = 75/205 (36%), Positives = 120/205 (58%), Gaps = 6/205 (2%) Frame = +3 Query: 951 FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGK 1127 FKE+++ LP+ G DEY ++PVE FG ALL+G GW E G+G+ ++ V+ + Y + K Sbjct: 145 FKEDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKKWTAK 204 Query: 1128 VGLGAVP-APKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKL-VERAKKG 1301 G+G +P PKP + + + ++++ D V +V H G EK+ E+A G Sbjct: 205 EGIGFIPEVPKPSSKGEGAVK---SIKKSEDGV-------KVDHSDGNIEKIDREKAGNG 254 Query: 1302 PAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEE 1481 +GK VR+V GK G+KG++L E V+L+L ++++V + RDLA+LGS EE Sbjct: 255 LYVGKKVRVVRGKEMGMKGEIL---EVNSSGDLVILKL--ADKEVKLQARDLAELGSVEE 309 Query: 1482 EQFLKRYWE---RKQDRNDDAHKKE 1547 E+ LK+ E R++ N D +K+ Sbjct: 310 ERCLKKLLELKIREEKSNLDGVRKQ 334 >ref|XP_004350567.1| hypothetical protein DFA_11620 [Dictyostelium fasciculatum] gi|328865473|gb|EGG13859.1| hypothetical protein DFA_11620 [Dictyostelium fasciculatum] Length = 531 Score = 111 bits (277), Expect = 1e-21 Identities = 67/177 (37%), Positives = 100/177 (56%), Gaps = 1/177 (0%) Frame = +3 Query: 930 DNDRDRVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIA 1106 +ND D+ FK ++ + PD A D+YE P++ FGKA+L GMGW +G+G K VVEP+ Sbjct: 200 NNDEDK-FKFDLSTRPDEANQDDYEETPIDIFGKAMLMGMGWKPGQGIGLTNKGVVEPVQ 258 Query: 1107 YVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVE 1286 +++R G++GLGA P+ KE K + P GE+G+VRH VG+ EKLV Sbjct: 259 FLKRAGRLGLGAQPSDVANKEKKYMTAP-------------KGEDGKVRHTVGLSEKLVP 305 Query: 1287 RAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDL 1457 K G G V ++ G H GL V + + + ++++R +S+E VD DL Sbjct: 306 -LKFGLQPGDRVLVISGPHEGLNATVESLAQSD----RIVIRF-KSDELAAVDKCDL 356 >gb|EXC18489.1| Protein MOS2 [Morus notabilis] Length = 476 Score = 110 bits (276), Expect = 1e-21 Identities = 102/352 (28%), Positives = 159/352 (45%), Gaps = 52/352 (14%) Frame = +3 Query: 654 SDDKSKP-RVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSGKIERISE 830 S K KP + S+N +N NK + V SE ++GN N+ I I Sbjct: 11 SSSKLKPIKPSQNFEDDNDNKSTENDANSRKYVIEFNASET--LTGNATQNAVVIPPIQN 68 Query: 831 NGKNSGRFDHL----SKNFDNSSG-RFDNKSVSESVH--------------GDNDRD--- 944 + R +L + D S G +F+ +S+S++ + GD+D + Sbjct: 69 EWRPHKRMKNLDLPIAAQSDGSGGLQFEVESLSDATNSSMSYGLNLRQTAKGDHDDEING 128 Query: 945 ---------------------RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNE 1061 + K ++Q LP+ G E+E+VPVE FG ALL G GW+E Sbjct: 129 QDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPEDRGMAEFEDVPVEGFGAALLSGYGWHE 188 Query: 1062 SEGVGK-CRKVVEPIAYVRRDGKVGLGAV-----PAPKPEKES--KRIKRPGEEVRENRD 1217 G+GK ++ V+ + Y +R GK GLG V P P ++S I +P + N + Sbjct: 189 GRGIGKNAKEDVKVVEYTKRTGKQGLGFVMTDLPPLPNSNRDSLNNSIPKPKDNNNNNNN 248 Query: 1218 LVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDA 1397 + K IGK VRIV G+ GLKG+VL E+ D Sbjct: 249 ---------------------NSSSNKESLIGKEVRIVRGRELGLKGRVL---EKLSDDN 284 Query: 1398 KVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELK 1553 ++++RL +S+E V V+ +D+A+LGS E+E LKR E + ++ +K+ K Sbjct: 285 RLVVRLSRSQETVKVNIQDVAELGSEEDEACLKRLKELRIREEEEKKEKKSK 336 >gb|EFA83563.1| hypothetical protein PPL_02629 [Polysphondylium pallidum PN500] Length = 584 Score = 110 bits (276), Expect = 1e-21 Identities = 81/249 (32%), Positives = 127/249 (51%), Gaps = 14/249 (5%) Frame = +3 Query: 660 DKSKPRVSENQNPENGNKIDSENLKEIPRVSRDEISENVKISGNYDHNSG-----KIERI 824 DK K S+N + + N+ D+ N + + I + K D+NS +I+ + Sbjct: 148 DKFKRLKSDNNSSSDKNEDDNSNTFKPKQHGLQIIKQKDKNKQKDDNNSSGSNRQEIKSL 207 Query: 825 SENGKNSGRFDHLSKNFDNSSGRFDNKSVSESVHG----DNDRDRVFKEEIQSLPDAAGF 992 N KN DH + + + E + G DND D+ FK +++S P+ A Sbjct: 208 IGNKKND---DHKDVEMNQV---VKVRPLIEKLDGLDRFDNDDDK-FKFDVESRPEEADN 260 Query: 993 DEYENVPVEEFGKALLRGMGWNESEGVGKCRK-VVEPIAYVRRDG-KVGLGAVPAPKPEK 1166 ++YE P+E FG+A+LRGMGW + +G K + EPI +V+R G ++GLGA P +K Sbjct: 261 EDYEETPIEVFGEAMLRGMGWQPGQAIGLTNKGLNEPIQFVKRPGYRLGLGAQPKDVDDK 320 Query: 1167 ESKRIKRPGEEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA---IGKVVRIVDG 1337 + R LVA GE+G+VRH+VGI EKLV K G +G+ V ++ G Sbjct: 321 -------------DKRYLVAQKGEDGKVRHMVGISEKLVPMNKSGSQSYNVGERVLVISG 367 Query: 1338 KHSGLKGKV 1364 +H G+ ++ Sbjct: 368 QHEGMYAEI 376 >gb|EXB45122.1| Protein MOS2 [Morus notabilis] Length = 423 Score = 109 bits (273), Expect = 3e-21 Identities = 80/237 (33%), Positives = 121/237 (51%), Gaps = 11/237 (4%) Frame = +3 Query: 876 DNSSGRFDNKSVSESVHGDNDRD---RVFKEEIQSLPDAAGFDEYENVPVEEFGKALLRG 1046 D +G+ + K +E + D + K ++Q LPD G E+E+VPVE FG ALL G Sbjct: 130 DEINGQDEAKDKNERLRFTPTEDVLLQKLKFDLQRLPDDRGMAEFEDVPVEGFGAALLSG 189 Query: 1047 MGWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAV-----PAPKPEKES--KRIKRPGEEV 1202 GW+E G+GK ++ V+ + Y +R GK GLG V P P ++S I +P + Sbjct: 190 YGWHEGRGIGKNAKEDVKVVEYTKRTGKQGLGFVSNVLPPLPNSNRDSLNNSILKPKDNN 249 Query: 1203 RENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEER 1382 N + + K IGK VRIV G+ GLKG+VL E+ Sbjct: 250 TNNNN---------------------NSSSNKESLIGKEVRIVHGRELGLKGRVL---EK 285 Query: 1383 EGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDDAHKKELK 1553 D ++++RL +S+E V V+ RD+ +LGS E+E LKR E + ++ +K+ K Sbjct: 286 LSDDNRLVVRLSRSQETVKVNIRDVTELGSEEDEACLKRLKELRIRGEEEKKEKKSK 342 >gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao] gi|508714356|gb|EOY06253.1| MOS2, putative isoform 1 [Theobroma cacao] Length = 465 Score = 107 bits (266), Expect = 2e-20 Identities = 75/189 (39%), Positives = 108/189 (57%), Gaps = 2/189 (1%) Frame = +3 Query: 954 KEEIQSLPDAAGFDEYENVPVEEFGKALLRGMGWNESEGVGK-CRKVVEPIAYVRRDGKV 1130 KE+++ LP+ GF+E+E+VPVE FGKALL G GW E G+GK ++ V+ Y RR K Sbjct: 139 KEDLKRLPEDRGFEEFEDVPVEGFGKALLAGYGWVEGRGIGKNAKEDVKVKQYERRTDKE 198 Query: 1131 GLGAVPAPKPEKESKRIKRPG-EEVRENRDLVAAAGENGRVRHVVGIGEKLVERAKKGPA 1307 GLG KE+K + PG V++ D E++V+ K G Sbjct: 199 GLGF-----SSKENKE-RLPGFTNVKQKHDT-----------------EEIVKEDKDGFF 235 Query: 1308 IGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVLLRLVQSEEDVLVDGRDLADLGSFEEEQ 1487 +GK VR+++G+ GLKG + ME+ G ++LRL +SEE V V ++ADLGS EEE+ Sbjct: 236 VGKDVRVIEGREMGLKGTI--MEKLGG--GWIVLRLKKSEEKVKVRLFEIADLGSREEEK 291 Query: 1488 FLKRYWERK 1514 L++ E K Sbjct: 292 CLRKLTELK 300 Score = 69.7 bits (169), Expect = 4e-09 Identities = 93/325 (28%), Positives = 132/325 (40%), Gaps = 26/325 (8%) Frame = +3 Query: 342 KLSFSMASSSKTRITRPAPIIE--EEDDQKPEYVSEFDXXXXXXXXXXX---VIAKLDNT 506 KLSFS+ S SK PI ED E+V+EFD VI N Sbjct: 2 KLSFSLPSKSKPTQKTSIPITSAAHEDQYHREFVTEFDPSKTPADPNSKPSFVIPPKQNE 61 Query: 507 WRPELKMKNIH-PHSTD--TGLQFVTEEEQS----QTDKKVEYGLNLRNPKHNNIPSDDK 665 WRP KMKN+H P +D LQF E +D K+ YGLNLR+ N D + Sbjct: 62 WRPYKKMKNLHIPLQSDGSRDLQFELESSSDLPLPNSDAKISYGLNLRDNSAKNDAGDQQ 121 Query: 666 SKPRVSENQNPENGNKIDS--ENLKEIPRVSRDEISENVKISG-------NYDHNSGKIE 818 P E+ P + S E+LK +P E E+V + G Y G+ Sbjct: 122 GIP---ESAAPVEAVLLQSLKEDLKRLPEDRGFEEFEDVPVEGFGKALLAGYGWVEGR-- 176 Query: 819 RISENGKNSGRFDHLSKNFDNSSGRFDNKSVSESVHG-DNDRDRVFKEEIQSLPDAAGFD 995 I +N K + + D F +K E + G N + + EEI D GF Sbjct: 177 GIGKNAKEDVKVKQYERRTDKEGLGFSSKENKERLPGFTNVKQKHDTEEIVK-EDKDGFF 235 Query: 996 EYENVPVEEFGKALLRGMGWNESEG---VGKCRKVVEPIAYVRRDGKVGLGAVPAPKPEK 1166 ++V V E + L+G + G V + +K E + KV L + + Sbjct: 236 VGKDVRVIEGREMGLKGTIMEKLGGGWIVLRLKKSEEKV-------KVRLFEIADLGSRE 288 Query: 1167 ESKRIKRPGE-EVRENRDLVAAAGE 1238 E K +++ E ++RE +DL E Sbjct: 289 EEKCLRKLTELKIREAKDLKTKGDE 313 >ref|XP_003589709.1| Pre-mRNA-splicing factor spp2 [Medicago truncatula] gi|355478757|gb|AES59960.1| Pre-mRNA-splicing factor spp2 [Medicago truncatula] Length = 385 Score = 106 bits (264), Expect = 3e-20 Identities = 86/222 (38%), Positives = 116/222 (52%), Gaps = 10/222 (4%) Frame = +3 Query: 897 DNKSVSESVHGDNDRDRV---------FKEEIQSLPDAAGFDEYENVPVEEFGKALLRGM 1049 D K S+ V D R + FK++++ LPD GFDEY++VPVE FG ALL G Sbjct: 42 DKKPQSDDVVVDAPRPKASVEVSMLQKFKDDMERLPDDMGFDEYKDVPVEGFGAALLGGY 101 Query: 1050 GWNESEGVGK-CRKVVEPIAYVRRDGKVGLGAVPAPKPEKESKRIKRPGEEVRENRDLVA 1226 GW E G+GK ++ V+ + RR GK GLG V A P SK+ +R Sbjct: 102 GWKEGMGIGKNAKEDVKVVEVKRRTGKEGLGFV-ADLPPPSSKKGER------------- 147 Query: 1227 AAGENGRVRHVVGIGEKLVERAKKGPAIGKVVRIVDGKHSGLKGKVLKMEEREGRDAKVL 1406 NGR GE ER KK +VVRIV G+ GLK V+ R+G D V+ Sbjct: 148 ----NGR-------GE--TERKKKEE---RVVRIVRGRDVGLKASVVG---RDGEDV-VV 187 Query: 1407 LRLVQSEEDVLVDGRDLADLGSFEEEQFLKRYWERKQDRNDD 1532 LR++ S E+V V D+A+LGS EEE+ L++ + K D+ Sbjct: 188 LRVLGSGEEVKVKVEDVAELGSVEEERCLRKLKDLKIRGRDE 229