BLASTX nr result
ID: Akebia22_contig00023060
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00023060 (1591 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006857341.1| hypothetical protein AMTR_s00067p00095130 [A... 278 4e-72 ref|XP_007224114.1| hypothetical protein PRUPE_ppa015679mg [Prun... 270 2e-69 ref|XP_004297831.1| PREDICTED: uncharacterized protein LOC101296... 246 3e-62 ref|XP_002314482.2| hypothetical protein POPTR_0010s08070g [Popu... 238 7e-60 ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis ly... 213 3e-52 ref|XP_006406035.1| hypothetical protein EUTSA_v10020149mg [Eutr... 208 6e-51 ref|XP_006297030.1| hypothetical protein CARUB_v10013033mg [Caps... 203 2e-49 ref|XP_004247930.1| PREDICTED: uncharacterized protein LOC101262... 202 5e-49 ref|NP_188979.2| RNA recognition motif-containing protein [Arabi... 199 2e-48 ref|XP_006354470.1| PREDICTED: dentin sialophosphoprotein-like [... 195 4e-47 ref|NP_001118682.1| RNA recognition motif-containing protein [Ar... 193 2e-46 gb|EYU32569.1| hypothetical protein MIMGU_mgv1a003227mg [Mimulus... 183 2e-43 gb|ABF59161.1| unknown protein [Arabidopsis thaliana] 135 4e-29 ref|XP_001778539.1| predicted protein [Physcomitrella patens] gi... 125 5e-26 ref|XP_007035162.1| RNA-binding (RRM/RBD/RNP motifs) family prot... 118 7e-24 ref|XP_001769584.1| predicted protein [Physcomitrella patens] gi... 111 8e-22 gb|ADE76534.1| unknown [Picea sitchensis] 101 1e-18 ref|XP_002991376.1| hypothetical protein SELMODRAFT_429718 [Sela... 97 2e-17 ref|XP_002988242.1| hypothetical protein SELMODRAFT_447288 [Sela... 95 8e-17 ref|XP_002978664.1| hypothetical protein SELMODRAFT_418465 [Sela... 91 2e-15 >ref|XP_006857341.1| hypothetical protein AMTR_s00067p00095130 [Amborella trichopoda] gi|548861434|gb|ERN18808.1| hypothetical protein AMTR_s00067p00095130 [Amborella trichopoda] Length = 773 Score = 278 bits (712), Expect = 4e-72 Identities = 164/379 (43%), Positives = 224/379 (59%), Gaps = 1/379 (0%) Frame = +2 Query: 401 LRNDNEAEPTKKVPFKMFDSQFLXXXXXXXXXXXXXXXXYTKTKPILQIPVPYKKVTPKK 580 +++D + + KVPF+ FDS Y + P +P+P K+ P Sbjct: 409 VKDDIKDDGPLKVPFEKFDSD------EDLSDSEVAASEYGEPDPF--VPLPSKRSFPSF 460 Query: 581 DPNPFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESM-FRDAY 757 P Q T+ V+FL K T DI+ FG G I+E+ +I S K + F +AY Sbjct: 461 SKRPI-------GQNTLFVKFLPKSATDVDIRKAFGGCGEIEELCIIPSLKATARFNNAY 513 Query: 758 VSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENP 937 VSF EGLQRAL K+++VI G DVV+E+ PL ++N + NLIGDPD P LLENP Sbjct: 514 VSFLRGEGLQRALEKSNLVINGADVVVEADSPLLK-ITNMVSISNLIGDPDAPLPLLENP 572 Query: 938 THSVMVKGLPPDLTLHHLREALSCGAXXXXXXXXXXXXVAYIEFETEDAKEKAIAESSVS 1117 ++MV+GLP + H L +LS + YIEFETEDAKEKA+A S++S Sbjct: 573 VRTIMVRGLPVGIGSHQLSFSLSNFGSISRFIMGSSLSIGYIEFETEDAKEKALAASTIS 632 Query: 1118 ISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFN 1297 +SGK + ILRIDAPRT++VRISN+ E+ + T C +G V RVVSR D DVHF Sbjct: 633 LSGKQIQILRIDAPRTSVVRISNIGRETKQIKILST-CETYGAVNRVVSRDGDTVDVHFM 691 Query: 1298 VSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQNLCGK 1477 +SE NM+ ILN LNG VVN+ QW A+PATV P +IL+SLW +GR +V +++QNLC K Sbjct: 692 LSELGNMLQILNHLNGMVVNQCQWWARPATVFPPQILQSLWKSTEGRQYVLTVVQNLCRK 751 Query: 1478 VEENLSGVAKISQIAAQYY 1534 + A+++ + A+YY Sbjct: 752 TCPGSTVHAEMAGLVARYY 770 >ref|XP_007224114.1| hypothetical protein PRUPE_ppa015679mg [Prunus persica] gi|462421050|gb|EMJ25313.1| hypothetical protein PRUPE_ppa015679mg [Prunus persica] Length = 835 Score = 270 bits (689), Expect = 2e-69 Identities = 148/310 (47%), Positives = 205/310 (66%), Gaps = 4/310 (1%) Frame = +2 Query: 566 VTPKKDPN--PFLFTVK-GRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKE 736 +T K+D N P F+ K G ++ VLVRFL K V + D G I +I L+S ++ Sbjct: 511 LTSKEDLNKIPITFSQKEGSTESKVLVRFLHKNVKDDAVVNALNDCGEIVKIQLLSVSEG 570 Query: 737 SMFRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVP 916 S FRDA+V FKT QRAL KTD++I +VV+ +T ++ N++ PN+IGD ++P Sbjct: 571 SNFRDAWVHFKTSNESQRALRKTDLIIGNSEVVVVATS--LEDVLNKVSIPNVIGDSELP 628 Query: 917 TGLLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEK 1093 L++NPT +VM+K L D++LHHL+ AL+ CG+ VA++EFETEDAKE Sbjct: 629 VALIKNPTRTVMIKHLTHDISLHHLKGALAFCGSGISSFFLGSSSSVAFVEFETEDAKET 688 Query: 1094 AIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGA 1273 AIA S+++ GK LSILRID PRTT+VRISN F + +TIC+ HGQVK+ RG Sbjct: 689 AIAACSINVEGKQLSILRIDVPRTTVVRISN-FGGTVSKKRFQTICNSHGQVKQRKDRGR 747 Query: 1274 DIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNS 1453 DI DVHF ++EWPNM+ ILNSLNG V+ ++W+A+PA V P E+L+ LWS+PD R+HV S Sbjct: 748 DIVDVHFKLAEWPNMLTILNSLNGMEVDGNRWLARPAPVFPPEVLQVLWSRPDERIHVIS 807 Query: 1454 LIQNLCGKVE 1483 +++ L E Sbjct: 808 VLRRLLQNTE 817 >ref|XP_004297831.1| PREDICTED: uncharacterized protein LOC101296092 [Fragaria vesca subsp. vesca] Length = 736 Score = 246 bits (627), Expect = 3e-62 Identities = 135/313 (43%), Positives = 199/313 (63%), Gaps = 5/313 (1%) Frame = +2 Query: 620 QTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRALA 799 ++ V+VRFL K V + I F D G I I L+ + S+FR YV FKT EG +AL Sbjct: 427 ESKVMVRFLHKFVQESSIYKAFDDCGCITRIQLLPLIEGSIFRAGYVHFKTAEGSHKALR 486 Query: 800 KTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLPPDLT 979 K+ +V G VV+++ ++ N+I PNLIGDP+VP L+++PT +VM+K L D++ Sbjct: 487 KSGIVSEGHTVVVDANS--LEDVPNKIAIPNLIGDPEVPLMLVKSPTRTVMIKQLTHDIS 544 Query: 980 LHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAIAESSVSISGKLLSILRIDA 1156 LH L+EAL+ CG+ VAY+EFETEDAKE+AIA S+++ K L ILRID Sbjct: 545 LHQLKEALAFCGSGISSVFLGSSSSVAYVEFETEDAKERAIAAYSINVQEKQLLILRIDV 604 Query: 1157 PRTTIVRISNV----FSESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVN 1324 PRTT++R+++V FS + + I ++CS G V +V R + DV+FN+ WP M++ Sbjct: 605 PRTTVIRMTSVDDALFSNTKVLSDIVSVCSSSGGVGKVKLRNMGMLDVYFNLDHWPKMLS 664 Query: 1325 ILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQNLCGKVEENLSGVA 1504 ILN LNG V+ H+ IAQPATV P +L+ LWS+PD R+HV S+++ L +N Sbjct: 665 ILNRLNGMEVHGHRVIAQPATVFPPAVLQVLWSKPDERIHVKSVLRRLL----QNTDLPV 720 Query: 1505 KISQIAAQYYGER 1543 ++S +A +Y+G++ Sbjct: 721 ELSNLATKYHGDK 733 >ref|XP_002314482.2| hypothetical protein POPTR_0010s08070g [Populus trichocarpa] gi|550329344|gb|EEF00653.2| hypothetical protein POPTR_0010s08070g [Populus trichocarpa] Length = 696 Score = 238 bits (606), Expect = 7e-60 Identities = 134/325 (41%), Positives = 199/325 (61%), Gaps = 2/325 (0%) Frame = +2 Query: 569 TPKKDPN--PFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESM 742 T +KD N P G + +L+RFL K V DI + F + GPI +I +SS K S Sbjct: 379 TSEKDSNQTPLTSLADGDTENKLLLRFLHKDVGDGDIISCFRNCGPISKIEKVSSVKGSN 438 Query: 743 FRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTG 922 DA++ F+T++GL +AL K +V+I+ + I T ++RI PNLIGD D+ Sbjct: 439 LFDAFLHFETRQGLHKALEKPEVLIKNSNAFIHDT-------ASRISIPNLIGDIDISVA 491 Query: 923 LLENPTHSVMVKGLPPDLTLHHLREALSCGAXXXXXXXXXXXXVAYIEFETEDAKEKAIA 1102 L+++PT +V +K L D++ H L+EALS AY+EFE+EDAKE+A+A Sbjct: 492 LVKHPTRTVKIKQLTDDISSHQLKEALSFCRSGINVFLGASSSNAYVEFESEDAKERALA 551 Query: 1103 ESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIF 1282 + + +SGK LSI R+DAPRTT+VRI N+ + + TIC G++ R+ R +I Sbjct: 552 KHFLQVSGKQLSIFRVDAPRTTVVRILNINPQCRSN--VLTICKSFGKLWRMKLRHENIA 609 Query: 1283 DVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQ 1462 DV+F + EWPNM+NILNSLNG + +W+AQPA++ P IL++LW+ PD R HV S +Q Sbjct: 610 DVYFKIDEWPNMLNILNSLNGLEADGSRWVAQPASIFPPIILQALWNHPDERRHVISSMQ 669 Query: 1463 NLCGKVEENLSGVAKISQIAAQYYG 1537 L K+E + A+++ +AA++ G Sbjct: 670 CLLKKLEHPMD-TAELNNLAARFCG 693 >ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis lyrata subsp. lyrata] gi|297329267|gb|EFH59686.1| nucleic acid binding protein [Arabidopsis lyrata subsp. lyrata] Length = 785 Score = 213 bits (541), Expect = 3e-52 Identities = 124/312 (39%), Positives = 183/312 (58%), Gaps = 2/312 (0%) Frame = +2 Query: 563 KVTPKKDPNPFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESM 742 KVT K + F + VL+RFL++ I VF FG + + I S + + Sbjct: 467 KVTKK---SLFALSAGEHSPNKVLLRFLQESCQKKHIVEVFSQFGAVLHVQEIPSFEGCI 523 Query: 743 FRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTG 922 ++DA ++F+T +++AL K V + + V+E+T +M RI P+LIGDPDVP Sbjct: 524 YKDALLTFETNTAVKKALEKGRVTVMNNNAVVEATSQ--EDMVERICIPDLIGDPDVPVA 581 Query: 923 LLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAI 1099 L++ P+ +V + L D + + ++EAL C + A++EFETED KE+A+ Sbjct: 582 LVKEPSRTVKIHPLTHDFSSNQIKEALKFCRSNISKFILGSSRTDAFVEFETEDGKERAL 641 Query: 1100 AESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADI 1279 AE S+SI L I RID PRTT+ RISN+ +R +C +GQ+K+V RG + Sbjct: 642 AEHSISICNTQLFISRIDIPRTTVARISNL--SKSAMKDVRALCVPYGQIKQVYIRGNGV 699 Query: 1280 FDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPA-TVIPVEILRSLWSQPDGRMHVNSL 1456 DV F+VSEWPNM+NILNSLNG ++ + + +PA TVIP EILR LW P + +V S+ Sbjct: 700 VDVLFDVSEWPNMLNILNSLNGMGIDGKKLVVRPATTVIPPEILRVLWKDPQEKRYVKSV 759 Query: 1457 IQNLCGKVEENL 1492 IQNL ++E+ L Sbjct: 760 IQNLVREIEQPL 771 >ref|XP_006406035.1| hypothetical protein EUTSA_v10020149mg [Eutrema salsugineum] gi|557107181|gb|ESQ47488.1| hypothetical protein EUTSA_v10020149mg [Eutrema salsugineum] Length = 730 Score = 208 bits (529), Expect = 6e-51 Identities = 124/312 (39%), Positives = 186/312 (59%), Gaps = 9/312 (2%) Frame = +2 Query: 584 PNPFL-------FTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESM 742 PNP L +V V +RFL + +K F +FG + + I S Sbjct: 412 PNPELTNKSLDALSVGEHSPNKVCLRFLPRFDKEEIVKR-FSEFGAVLDFQEIPSFDGCY 470 Query: 743 FRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTG 922 ++DA ++F+T +++AL K V+++ V++E+T N +I P+LIGDPDVP Sbjct: 471 YKDAVLTFETHSAVKKALKKAVVMVKNYSVIVEATSQEDN--VEKICIPDLIGDPDVPIA 528 Query: 923 LLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAI 1099 LL+ PT +V + L ++ + ++EAL C + A++EFETED KE+A+ Sbjct: 529 LLKEPTRTVKIHPLAHGISSNQIKEALRFCRSDISKFILGSSKTAAFVEFETEDGKERAL 588 Query: 1100 AESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADI 1279 AE S+SI K L I RID PRTT+ RIS+ FS+ + IR +C+ +G++K+++ RG I Sbjct: 589 AEHSISIFNKQLFISRIDIPRTTVARISH-FSKPCMSD-IRKLCAPYGKIKQLLFRGDGI 646 Query: 1280 FDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPA-TVIPVEILRSLWSQPDGRMHVNSL 1456 DVHF+VSEWPNM ILNS+NG ++ +W+ +PA TVIP EIL+ LW P G+ +V L Sbjct: 647 ADVHFDVSEWPNMHTILNSMNGMEIDGMKWVVRPATTVIPHEILKVLWEDPQGKRYVKGL 706 Query: 1457 IQNLCGKVEENL 1492 IQNL ++E+ L Sbjct: 707 IQNLVREIEQPL 718 >ref|XP_006297030.1| hypothetical protein CARUB_v10013033mg [Capsella rubella] gi|482565739|gb|EOA29928.1| hypothetical protein CARUB_v10013033mg [Capsella rubella] Length = 764 Score = 203 bits (517), Expect = 2e-49 Identities = 114/294 (38%), Positives = 175/294 (59%), Gaps = 2/294 (0%) Frame = +2 Query: 629 VLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRALAKTD 808 VL+RFL++ DI VF FG + ++ I S + +++DA ++F+TK ++ AL K Sbjct: 462 VLLRFLQESFNKNDIVEVFSGFGTVLDVQEIPSLEGCIYKDALLTFETKTAVKDALKKVS 521 Query: 809 VVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLPPDLTLHH 988 V+++ V +E+ +M I P+LIGDPDVP L++ P + + + D + + Sbjct: 522 VMVKNYSVCVEAASQ--KDMVETICIPDLIGDPDVPIALVKEPARTAKIHPMTHDFSSNQ 579 Query: 989 LREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRT 1165 ++EAL C + A++EFETED KE+A+A SVSI L I RID PRT Sbjct: 580 IKEALRFCRSNISKFILGSAGTDAFVEFETEDGKERALAGHSVSICNTQLFISRIDIPRT 639 Query: 1166 TIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNG 1345 T+ R SN + G +R +C +G++K+V RG I DV F+VSEWPNM++ILNSLNG Sbjct: 640 TVARFSNFSGSAMGD--VRALCVPYGKIKQVYHRGKGIADVRFDVSEWPNMLSILNSLNG 697 Query: 1346 KVVNEHQWIAQPAT-VIPVEILRSLWSQPDGRMHVNSLIQNLCGKVEENLSGVA 1504 V+ ++ + + AT VIP EIL LW P G+ +V S+I+NL ++E+ + + Sbjct: 698 MEVDGNKLVVRAATAVIPPEILSLLWRDPQGKRYVKSVIRNLVREIEQPIDATS 751 >ref|XP_004247930.1| PREDICTED: uncharacterized protein LOC101262563 [Solanum lycopersicum] Length = 851 Score = 202 bits (513), Expect = 5e-49 Identities = 112/289 (38%), Positives = 178/289 (61%), Gaps = 1/289 (0%) Frame = +2 Query: 617 DQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRAL 796 D+ + ++F+ T D++ F G I ++ + S K + ++ A++ F++K+G Q+AL Sbjct: 547 DENKLTIKFVNVKATEDDVRDCFKSCGAITKV-VFPSVKSTNYKVAHIYFESKKGRQKAL 605 Query: 797 AKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLPPDL 976 +DVVI+ VV+E+T P R+ P+LIG P+VPT L+++P+ +VM+K L ++ Sbjct: 606 EWSDVVIKNI-VVVEATFPPKGR--ERMCIPDLIGYPEVPTSLVKHPSRTVMIKELKHNV 662 Query: 977 TLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAIAESSVSISGKLLSILRID 1153 + H + EAL+ C + VAY+EFET + KE AIA+ S+++ G+ LSILRID Sbjct: 663 SFHDIEEALAFCRSNITGIYFGSSSSVAYVEFETVEGKEIAIAKHSLTMLGETLSILRID 722 Query: 1154 APRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILN 1333 APRTTIVRISN+ S + + C GQ + + I DVHF ++EWP M+ I+N Sbjct: 723 APRTTIVRISNIAIPSRAK--VISFCKNLGQTRHFFKKALGIMDVHFKLAEWPRMLEIIN 780 Query: 1334 SLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQNLCGKV 1480 LNG V+ Q +A+PA + P ++L+ LWSQP+GR H+ + ++ KV Sbjct: 781 RLNGTEVDGQQLVAKPAPIYPPDVLKVLWSQPEGRKHLKTTFNSMLLKV 829 >ref|NP_188979.2| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|11994322|dbj|BAB02281.1| unnamed protein product [Arabidopsis thaliana] gi|332643236|gb|AEE76757.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 811 Score = 199 bits (507), Expect = 2e-48 Identities = 120/313 (38%), Positives = 177/313 (56%), Gaps = 4/313 (1%) Frame = +2 Query: 566 VTPKKDPNPFLFTVKGRDQTT-VLVRFLRKLVTVTDI-KAVFGDFGPIDEISLISSTKES 739 V PK FL G VL+RFL + I KA FG + + I S + Sbjct: 489 VNPKVTKKSFLALSAGEHSPNKVLLRFLPESSMKKHIVKAFSSQFGAVLHVQEIPSIEGC 548 Query: 740 MFRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPT 919 +++DA ++F+T +++AL K V + + V+E+T +M RI P+LIGDPDVP Sbjct: 549 IYKDALLTFETNTAVKKALKKGHVTVMNYNTVVEATSQ--EDMVERICIPDLIGDPDVPV 606 Query: 920 GLLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKA 1096 L++ P +V + L D + + ++EAL C + A++EFETED KE+A Sbjct: 607 ALVKEPARTVKIHPLTHDFSSNQIKEALKFCRSNISKFTLGSSRTDAFVEFETEDGKERA 666 Query: 1097 IAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGAD 1276 +AE S+SI L I RID PRT + RISN+ +R +C +GQ++ V RG Sbjct: 667 LAEHSISICNTQLFISRIDIPRTIVARISNL--SKSAMRDVRALCVPYGQIRGVYIRGTG 724 Query: 1277 IFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPA-TVIPVEILRSLWSQPDGRMHVNS 1453 + DV F++SEWPNM+ ILNS+NG ++ + + +PA TVIP EILR LW P + +V S Sbjct: 725 VADVFFDISEWPNMLAILNSMNGMEIDGKKLVVRPATTVIPPEILRVLWKDPREKRYVKS 784 Query: 1454 LIQNLCGKVEENL 1492 +IQNL ++E+ L Sbjct: 785 VIQNLVREIEQPL 797 >ref|XP_006354470.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum] Length = 928 Score = 195 bits (496), Expect = 4e-47 Identities = 111/289 (38%), Positives = 174/289 (60%), Gaps = 1/289 (0%) Frame = +2 Query: 617 DQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRAL 796 D+ + ++F+ T D+ F G I ++ + S + ++ A++ F++K+G Q+AL Sbjct: 624 DENKMTIKFVNVKATEQDVCDGFKGCGAITKV-VFPSVISTNYKVAHIYFESKKGKQKAL 682 Query: 797 AKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLPPDL 976 +D VIR VV+E+T P R+ P+LIG P+VPT L+++P+ +VM+K L ++ Sbjct: 683 KWSDTVIRNV-VVVEATFPPKGR--ERMCIPDLIGHPEVPTSLVKHPSRTVMIKELKHNV 739 Query: 977 TLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKEKAIAESSVSISGKLLSILRID 1153 + H + E L+ CG+ VAY+EFET + KE AIA+ S+ + G+ LSILRID Sbjct: 740 SFHDIEEVLAFCGSNITGIFFGSSSSVAYVEFETVEGKEIAIAKHSLIMLGETLSILRID 799 Query: 1154 APRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILN 1333 APRTTIVRISN+ S + + C GQ + ++ + DVHF +EWP M+ I+N Sbjct: 800 APRTTIVRISNIPVTSRAK--VISFCKKLGQTRYFFTKAFGVMDVHFKFAEWPRMLEIIN 857 Query: 1334 SLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQNLCGKV 1480 LNG V+ Q +A+PA + P ++L+ LWSQP+GR H+ + +L KV Sbjct: 858 RLNGIEVDGQQLVAKPAPIYPPDVLKVLWSQPEGRKHLKTTFNSLLQKV 906 >ref|NP_001118682.1| RNA recognition motif-containing protein [Arabidopsis thaliana] gi|332643237|gb|AEE76758.1| RNA recognition motif-containing protein [Arabidopsis thaliana] Length = 771 Score = 193 bits (490), Expect = 2e-46 Identities = 108/277 (38%), Positives = 163/277 (58%), Gaps = 2/277 (0%) Frame = +2 Query: 668 DIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIEST 847 D V FG + + I S + +++DA ++F+T +++AL K V + + V+E+T Sbjct: 485 DATTVNPKFGAVLHVQEIPSIEGCIYKDALLTFETNTAVKKALKKGHVTVMNYNTVVEAT 544 Query: 848 RPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXX 1024 +M RI P+LIGDPDVP L++ P +V + L D + + ++EAL C + Sbjct: 545 SQ--EDMVERICIPDLIGDPDVPVALVKEPARTVKIHPLTHDFSSNQIKEALKFCRSNIS 602 Query: 1025 XXXXXXXXXVAYIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESG 1204 A++EFETED KE+A+AE S+SI L I RID PRT + RISN+ Sbjct: 603 KFTLGSSRTDAFVEFETEDGKERALAEHSISICNTQLFISRIDIPRTIVARISNL--SKS 660 Query: 1205 GTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPA 1384 +R +C +GQ++ V RG + DV F++SEWPNM+ ILNS+NG ++ + + +PA Sbjct: 661 AMRDVRALCVPYGQIRGVYIRGTGVADVFFDISEWPNMLAILNSMNGMEIDGKKLVVRPA 720 Query: 1385 -TVIPVEILRSLWSQPDGRMHVNSLIQNLCGKVEENL 1492 TVIP EILR LW P + +V S+IQNL ++E+ L Sbjct: 721 TTVIPPEILRVLWKDPREKRYVKSVIQNLVREIEQPL 757 >gb|EYU32569.1| hypothetical protein MIMGU_mgv1a003227mg [Mimulus guttatus] Length = 598 Score = 183 bits (465), Expect = 2e-43 Identities = 111/316 (35%), Positives = 172/316 (54%), Gaps = 2/316 (0%) Frame = +2 Query: 539 LQIPVPYKKVTPKKDPNPFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISL 718 LQ P P KK F + + VL+RFLR T I F G I ++ Sbjct: 276 LQRPGPTKKTH---------FDGQKSKENKVLIRFLRSNATDAHIFQCFESCGEISKVE- 325 Query: 719 ISSTKESMFRDAYVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLAN-NMSNRIFAPNL 895 I + S+F+ Y+ FKT+EG +AL KT +++ G V +ES N+ I P+L Sbjct: 326 IPYAEASLFKSGYIYFKTREGFNKALKKTSLLVAGGIVTVESASSTRKRNVKTPI--PSL 383 Query: 896 IGDPDVPTGLLENPTHSVMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFE 1072 IGD + P L++NPT ++ ++ L +++ +H+ EALS C VAY+EFE Sbjct: 384 IGDHNTPAALVKNPTRTIKIESLSREISSNHIEEALSFCETNISGYFLGSSDSVAYVEFE 443 Query: 1073 TEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVK 1252 +E KE+A+A+ +++ G+ L +LR+D+PRTT+VRI + I C L G+V Sbjct: 444 SEIGKERALAKQWINVLGRRLVMLRVDSPRTTVVRI--IGRNQSKMKNILAECRLLGKVG 501 Query: 1253 RVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPD 1432 SR + DVHF + EWPNM+ I+N LNG ++ + A+ A V P ++L +LW +P+ Sbjct: 502 ASFSRTPGVLDVHFELDEWPNMLKIINRLNGMEIDGARVQAESAPVFPPDVLLALWQEPN 561 Query: 1433 GRMHVNSLIQNLCGKV 1480 R H+ + +Q L K+ Sbjct: 562 ERKHLKTTMQVLLHKL 577 >gb|ABF59161.1| unknown protein [Arabidopsis thaliana] Length = 226 Score = 135 bits (341), Expect = 4e-29 Identities = 70/147 (47%), Positives = 98/147 (66%), Gaps = 1/147 (0%) Frame = +2 Query: 1055 AYIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICS 1234 A++EFETED KE+A+AE S+SI L I RID PRT + RISN+ +R +C Sbjct: 68 AFVEFETEDGKERALAEHSISICNTQLFISRIDIPRTIVARISNL--SKSAMRDVRALCV 125 Query: 1235 LHGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPA-TVIPVEILR 1411 +GQ++ V RG + DV F++SEWPNM+ ILNS+NG ++ + + +PA TVIP EILR Sbjct: 126 PYGQIRGVYIRGTGVADVFFDISEWPNMLAILNSMNGMEIDGKKLVVRPATTVIPPEILR 185 Query: 1412 SLWSQPDGRMHVNSLIQNLCGKVEENL 1492 LW P + +V S+IQNL ++E+ L Sbjct: 186 VLWKDPREKRYVKSVIQNLVREIEQPL 212 >ref|XP_001778539.1| predicted protein [Physcomitrella patens] gi|162670137|gb|EDQ56712.1| predicted protein [Physcomitrella patens] Length = 666 Score = 125 bits (314), Expect = 5e-26 Identities = 93/309 (30%), Positives = 148/309 (47%), Gaps = 17/309 (5%) Frame = +2 Query: 617 DQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRAL 796 D+ + +R+L T D+K F D G I I + +V FKT E LQ+AL Sbjct: 303 DRHGLFLRYLSPQATPADLKEAFSDCGEIIRAQAIKPRTHQKYTYGFVDFKTAEALQKAL 362 Query: 797 AKTDVVIRGKDVVIE--STRPLANNMSNRI------FAPNLIGDPDVPTGL-------LE 931 K V I+G + E S+ P + + + FA + + D + L + Sbjct: 363 EKDKVYIKGVRIRKEPSSSTPKIPDRNGNVNPLTVDFAQSRVSDNSPGSFLGPSKSKGIR 422 Query: 932 NPTHSVMVKGLPPDLTLHHLREALSCGAXXXXXXXXXXXX--VAYIEFETEDAKEKAIAE 1105 +SV V+ +P + L ++EALS +A +EF+ EDA++KA++ Sbjct: 423 RTGYSVAVEDVPLHIPLTEVKEALSKYGEIAHSSRKEGHGGYIANVEFKGEDARKKALSA 482 Query: 1106 SSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFD 1285 SV ++G SILR+D +T++VR++N T I+ C LHG+V +V+SR I D Sbjct: 483 KSVQLNGMHYSILRVDPIKTSVVRLNNA-GFVDNTEQIQATCELHGRVDKVISRCDGIVD 541 Query: 1286 VHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQN 1465 V+F+ SE NM IL+ LN V W AQP++ + SL G+ + + Sbjct: 542 VYFHPSELENMPRILSRLNEVKVEGQVWQAQPSSCMDPSSYLSLMRTRGGQEWLQQESER 601 Query: 1466 LCGKVEENL 1492 + G++E L Sbjct: 602 MLGRIESAL 610 >ref|XP_007035162.1| RNA-binding (RRM/RBD/RNP motifs) family protein [Theobroma cacao] gi|508714191|gb|EOY06088.1| RNA-binding (RRM/RBD/RNP motifs) family protein [Theobroma cacao] Length = 245 Score = 118 bits (296), Expect = 7e-24 Identities = 67/169 (39%), Positives = 98/169 (57%), Gaps = 1/169 (0%) Frame = +2 Query: 587 NPFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSF 766 +P T +G + VL RFL + + I A F D PI + +S TK+SMF+D V F Sbjct: 60 SPMRSTKEGSKENMVLDRFLTQNIEKHSILAAFCDCWPIVNVEEVSLTKQSMFKDFVVHF 119 Query: 767 KTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHS 946 +T+EG Q L KTD+++ + +E++ + +M + I P+LIGDPD P L++NPT + Sbjct: 120 ETREGYQNTLKKTDLMVLNAEAFVEASS--SEDMDDAISIPDLIGDPDAPVALVKNPTKT 177 Query: 947 VMVKGLPPDLTLHHLREALS-CGAXXXXXXXXXXXXVAYIEFETEDAKE 1090 V VK L D++ L+EAL+ C + V Y+EFETEDAKE Sbjct: 178 VKVKQLSEDISSQQLKEALAFCQSGISSFYLGSTSSVLYVEFETEDAKE 226 >ref|XP_001769584.1| predicted protein [Physcomitrella patens] gi|162679126|gb|EDQ65577.1| predicted protein [Physcomitrella patens] Length = 593 Score = 111 bits (278), Expect = 8e-22 Identities = 87/311 (27%), Positives = 139/311 (44%), Gaps = 23/311 (7%) Frame = +2 Query: 629 VLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKESMFRDAYVSFKTKEGLQRALAKTD 808 + VR+L T D++ F D G I I + + +V FKT E LQ+ALAKT Sbjct: 255 IFVRYLSCRATPKDLREAFADCGDIVRAYAIRARPNVKYTYGFVDFKTAEALQKALAKTR 314 Query: 809 VVIRGKDVVIESTRPLANNMSNR------------IFAPNLIGDPDVPTGLLENPT---- 940 V IRG V E + +++SN+ + + G V ++ P Sbjct: 315 VYIRGSRVTTEPSTTTPHHLSNKSWDDSQNSSAGGVESTTSQGTEKVTGTYVDAPNVKAI 374 Query: 941 ----HSVMVKGLPPDLTLHHLREALSCGAXXXXXXXXXXXXVAY---IEFETEDAKEKAI 1099 + V V G+P ++ L ++ ALS Y +EF+ D+++ A+ Sbjct: 375 RGSGYKVAVMGIPMNVPLSEVQSALSKYGEIVLSDMKQENIGTYSANLEFKAVDSRDDAL 434 Query: 1100 AESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADI 1279 + SV ++G I R++ T++VR+SNV E+ I C L G+V VV+R Sbjct: 435 SAKSVQLNGSQYPIFRVNPVNTSVVRLSNVGVEA-NLDQIGATCELFGRVSEVVARCDFS 493 Query: 1280 FDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLI 1459 DV+F +E NM IL LN +N +W AQP+ + +L +G+ + Sbjct: 494 VDVYFQSTELENMSRILARLNEVTMNGRRWHAQPSPCLGSGSYEALLKTREGQEWLQLES 553 Query: 1460 QNLCGKVEENL 1492 + + KVE L Sbjct: 554 ERMLSKVENAL 564 >gb|ADE76534.1| unknown [Picea sitchensis] Length = 179 Score = 101 bits (251), Expect = 1e-18 Identities = 56/145 (38%), Positives = 87/145 (60%) Frame = +2 Query: 1058 YIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSL 1237 YIEF+T +AKE+A+A V ++GK LSI D P TTI RI+N+ SE+ TT + ++C Sbjct: 5 YIEFKTIEAKERALAARWVFVNGKQLSICWSDFPVTTIARITNLSSETTATT-VHSVCMS 63 Query: 1238 HGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSL 1417 +G V+ + R DVH++++E PNM IL LN +N W+AQPA +P + L Sbjct: 64 YGNVESLQIRKDGSMDVHYSINELPNMPKILEMLNEIAINGSHWMAQPAPRLP----QGL 119 Query: 1418 WSQPDGRMHVNSLIQNLCGKVEENL 1492 P+G+ + + N G ++++L Sbjct: 120 AKTPEGQNWLGKQLSNYIGNIKKHL 144 >ref|XP_002991376.1| hypothetical protein SELMODRAFT_429718 [Selaginella moellendorffii] gi|300140769|gb|EFJ07488.1| hypothetical protein SELMODRAFT_429718 [Selaginella moellendorffii] Length = 595 Score = 97.4 bits (241), Expect = 2e-17 Identities = 89/335 (26%), Positives = 150/335 (44%), Gaps = 22/335 (6%) Frame = +2 Query: 578 KDPNPFLFTVKGRDQTTVLVRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKE-SMFRDA 754 K+P LF + + + ++FL T +D+KA F G I I ++ S + + Sbjct: 253 KNPAQPLFRSDLQPRHGLYIKFLAPQATESDVKAAFRSCGDIHRIQIVRSRNPGAKYIYG 312 Query: 755 YVSFKTKEGLQRALAKTDVVIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLEN 934 +V F T+E L AL K +++IRG V E+++ N D +N Sbjct: 313 FVDFTTEEALNSALKK-EIIIRGIKVQTETSK---NGAPKEKQQKQESDSDDDAAKSNKN 368 Query: 935 PTHSVMVK-----GLPPD-------------LTLHHLREALSCGAXXXXXXXXXXXX--- 1051 T + K LPP+ +L + +AL Sbjct: 369 KTSLAVTKTMDDLALPPEKRRCTLALESIPHASLPQIIDALGQYGEVVNSQTKHSSGGNT 428 Query: 1052 VAYIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTIC 1231 AY+EF++E K+ A+A S V + GK L + R+D P TT+VR+SN+ +++ I + Sbjct: 429 TAYVEFKSEIEKDTALAASKVILGGKSLKLTRVDIPLTTVVRLSNIPADA--REKIASSY 486 Query: 1232 SLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILR 1411 +GQV +V R + D++++ SE NM IL+ LNG V+ + A PA + + + Sbjct: 487 KSYGQVDKVDPRMDGVVDIYYSASEIKNMAKILDKLNGLHVSRRKLRAMPAPISYPKTMA 546 Query: 1412 SLWSQPDGRMHVNSLIQNLCGKVEENLSGVAKISQ 1516 L P G + S + + +E +L+ A S+ Sbjct: 547 ELAGTPQGLKWLESQRELVRKNLEISLAKAAMYSR 581 >ref|XP_002988242.1| hypothetical protein SELMODRAFT_447288 [Selaginella moellendorffii] gi|300143974|gb|EFJ10661.1| hypothetical protein SELMODRAFT_447288 [Selaginella moellendorffii] Length = 595 Score = 95.1 bits (235), Expect = 8e-17 Identities = 84/316 (26%), Positives = 138/316 (43%), Gaps = 22/316 (6%) Frame = +2 Query: 635 VRFLRKLVTVTDIKAVFGDFGPIDEISLISSTKE-SMFRDAYVSFKTKEGLQRALAKTDV 811 ++FL T +D+KA F G I I ++ S + + +V F T+E L AL K ++ Sbjct: 272 IKFLAPQATESDVKAAFRSCGDIHRIQIVRSRNPGAKYIYGFVDFTTEEALNSALKK-EI 330 Query: 812 VIRGKDVVIESTRPLANNMSNRIFAPNLIGDPDVPTGLLENPTHSVMVKGLP-------- 967 +IRG V T P N D +N T + K + Sbjct: 331 IIRGIKV---QTEPSKNGAPKEKQQKQESDSDDDAAKSNKNKTSLAVTKTMDDLALAPEK 387 Query: 968 ----------PDLTLHHLREALSCGAXXXXXXXXXXXX---VAYIEFETEDAKEKAIAES 1108 P +L + +AL AY+EF++E K+ A+A S Sbjct: 388 RRCTLALESIPHASLPQIIDALGQYGEVVNSQTKHSSGGNTTAYVEFKSEIEKDTALAAS 447 Query: 1109 SVSISGKLLSILRIDAPRTTIVRISNVFSESGGTTYIRTICSLHGQVKRVVSRGADIFDV 1288 V + GK L + R+D P TT+VR+SN+ +++ I + +GQV +V R + D+ Sbjct: 448 KVILGGKSLKLTRVDIPLTTVVRLSNIPADA--REKIASSYKSYGQVDKVDPRMDGVVDI 505 Query: 1289 HFNVSEWPNMVNILNSLNGKVVNEHQWIAQPATVIPVEILRSLWSQPDGRMHVNSLIQNL 1468 +++ SE NM IL+ LNG V+ + A PA + + + L P G + S + + Sbjct: 506 YYSASEIKNMAKILDKLNGLHVSRRKLRAMPAPISYPKTMAELAGTPQGLKWLESQRELV 565 Query: 1469 CGKVEENLSGVAKISQ 1516 +E +L+ A S+ Sbjct: 566 RKNLEISLAKAAMYSR 581 >ref|XP_002978664.1| hypothetical protein SELMODRAFT_418465 [Selaginella moellendorffii] gi|300153473|gb|EFJ20111.1| hypothetical protein SELMODRAFT_418465 [Selaginella moellendorffii] Length = 766 Score = 90.5 bits (223), Expect = 2e-15 Identities = 68/220 (30%), Positives = 108/220 (49%), Gaps = 5/220 (2%) Frame = +2 Query: 848 RPLANNMSNRIFAPNL--IGDPDVPTGLLENPTHSVMVKGLPPDLTLHHLREALSCGAXX 1021 RPL N F + +GDP L SV V+ LPP + LR+AL+ Sbjct: 522 RPLPPRNENGGFVSPIPPLGDPQKMRKDL-----SVCVENLPPTTNVTQLRDALAIHGEI 576 Query: 1022 XXXXXXXXXXVA---YIEFETEDAKEKAIAESSVSISGKLLSILRIDAPRTTIVRISNVF 1192 + E+ T++ KE A++ V I+G+L+ I R D P+TT+VR+SN+ Sbjct: 577 YATYLKHRDMNSSTYVFEYLTDEGKESALSAHWVHINGQLVRISRADVPKTTVVRVSNIS 636 Query: 1193 SESGGTTYIRTICSLHGQVKRVVSRGADIFDVHFNVSEWPNMVNILNSLNGKVVNEHQWI 1372 + + +I C GQV+RV SR + DV ++ +E NM+ IL+ LN +N+ +W Sbjct: 637 PSTPDSEFINA-CKSCGQVERVESRRDRVRDVFYHPAETRNMIKILDRLNEVTINQSRWH 695 Query: 1373 AQPATVIPVEILRSLWSQPDGRMHVNSLIQNLCGKVEENL 1492 A+PA + +L ++P+G + I+ L E L Sbjct: 696 AKPAPRCSINML----NRPEGSEFICQQIKALLRNQESML 731