BLASTX nr result
ID: Akebia23_contig00004397
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00004397 (4251 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270255.1| PREDICTED: uncharacterized protein LOC100244... 539 e-150 gb|EXC21916.1| Tripartite motif-containing protein 45 [Morus not... 531 e-147 emb|CBI36502.3| unnamed protein product [Vitis vinifera] 527 e-146 ref|XP_007224346.1| hypothetical protein PRUPE_ppa020677mg, part... 520 e-144 ref|XP_003531222.1| PREDICTED: serine/arginine repetitive matrix... 511 e-141 ref|XP_004134373.1| PREDICTED: uncharacterized protein LOC101203... 511 e-141 ref|XP_004157720.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 506 e-140 ref|XP_006373079.1| RNA recognition motif-containing family prot... 503 e-139 ref|XP_003524186.1| PREDICTED: splicing regulatory glutamine/lys... 503 e-139 ref|XP_006373080.1| hypothetical protein POPTR_0017s08510g [Popu... 496 e-137 ref|XP_002300152.2| RNA recognition motif-containing family prot... 487 e-134 ref|XP_007034241.1| RNA recognition motif-containing protein iso... 481 e-133 ref|XP_007034240.1| RNA recognition motif-containing protein iso... 481 e-133 ref|XP_002518040.1| conserved hypothetical protein [Ricinus comm... 473 e-130 gb|EYU31494.1| hypothetical protein MIMGU_mgv1a001194mg [Mimulus... 463 e-127 ref|XP_002885603.1| hypothetical protein ARALYDRAFT_898933 [Arab... 462 e-127 ref|XP_004296963.1| PREDICTED: uncharacterized protein LOC101297... 460 e-126 ref|XP_007153615.1| hypothetical protein PHAVU_003G050400g [Phas... 458 e-125 ref|XP_006421067.1| hypothetical protein CICLE_v10004448mg [Citr... 458 e-125 ref|XP_006492975.1| PREDICTED: uncharacterized protein LOC102615... 457 e-125 >ref|XP_002270255.1| PREDICTED: uncharacterized protein LOC100244513 [Vitis vinifera] Length = 926 Score = 539 bits (1388), Expect = e-150 Identities = 364/884 (41%), Positives = 431/884 (48%), Gaps = 41/884 (4%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT---VKESI 357 M DR+ S+A+ +PIWMKQ TFKD + V +S Sbjct: 1 MGDRT----SSAVTKPIWMKQAEEAKIKSEAEKAAAAKAAFEATFKDAASASAPAVADSS 56 Query: 358 SSDSDT--EENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPN 531 SSDSD E+ E + +KP+GPVDPSKCT SSFVVVTKDSDGRK+PN Sbjct: 57 SSDSDDAEEDAESRLASKPIGPVDPSKCTAAGAGIAGGAACSASSFVVVTKDSDGRKVPN 116 Query: 532 GGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPF 711 GGAQ++V++ PGVGVGGS+QEG++KDQGDG+Y VTYVV KRGNYMVHVEC+GK IMGSPF Sbjct: 117 GGAQIRVRVSPGVGVGGSDQEGIIKDQGDGSYTVTYVVSKRGNYMVHVECNGKPIMGSPF 176 Query: 712 PVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXX 891 PVFFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 177 PVFFSAGTASGGLLGLAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGASG 236 Query: 892 XXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXX 1071 E+CREYLNGRC KTDCKF HPPHN S Sbjct: 237 GAVLPGIGASLGEVCREYLNGRCAKTDCKFNHPPHNLLMTALAATTTMGTLSQVPMAPSA 296 Query: 1072 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTT 1251 +DS+ S DK GKAD+LKKTLQ+SNLSPLLT Sbjct: 297 AAMAAAQAIVAAQALQAHAAQVQAQAQSAKDSAGSPDKVGKADALKKTLQVSNLSPLLTV 356 Query: 1252 DQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLP 1431 +QLKQLFS+CGTVVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP Sbjct: 357 EQLKQLFSFCGTVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLP 416 Query: 1432 PKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMAS 1611 PKPAILNS + SLP NRAATMKSATE+AS Sbjct: 417 PKPAILNSPLASPSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAS 476 Query: 1612 ARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1791 ARAAEISKKLKADG EEK Sbjct: 477 ARAAEISKKLKADGFVEEEKEEKEENRKSRSPSISHARSKSRSKSPLHYRRRRRSRSFSP 536 Query: 1792 XXXXXXXXXXXXXXXXTNYGSERRSHRQVRDS---NDRSGRWERDRTRDHY-XXXXXXXX 1959 +Y R RD+ +DRS R + DR+ DH+ Sbjct: 537 PSRYSREHRSRSPFRSHHYSIHDHGSRSYRDNKDGSDRSRRRDLDRSHDHHLSSSRRNRS 596 Query: 1960 XXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSG 2139 TRKS RA S SPK R ES S RTRKSSR Sbjct: 597 RSRSPRTRKSYRADSESPKRRVES----------------------SSHRTRKSSRVSPK 634 Query: 2140 SPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE------ 2301 SP+ HR S +SS N +N S RRRSRSKS E Sbjct: 635 SPRHHRGS------RSSPRN----------------DDDNKSKRRRRSRSKSVEGKHYSN 672 Query: 2302 DEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR 2481 ++ + E ++GS SPR +S S +R+RSRSKSAE + Sbjct: 673 EKIDERRDKKSKHRDRRRSRSISAEGKHHKGSGFSPRSFDDSKSKHRKRSRSKSAEGKRV 732 Query: 2482 SDDRKERKKSEKVKLDGRNDEKTDRATKELD--------LSAKDSRDLK----------- 2604 D+ + + EK G++ EK ++ + LS K S +++ Sbjct: 733 LSDKTDEGRDEK----GKHHEKRRSRSRSAEGKYCRLNRLSPKSSDEIRPKHRRHSRSRS 788 Query: 2605 -EYGTSDPRRKDTLL------EDGSSSDEKYGSNHKRSRLDDKD 2715 EY SD + + L+ E + D K S HK +++D + Sbjct: 789 AEYRRSDNKGDEKLMHHKEPKEREVTEDLKEPSKHKMPKIEDME 832 >gb|EXC21916.1| Tripartite motif-containing protein 45 [Morus notabilis] Length = 973 Score = 531 bits (1368), Expect = e-147 Identities = 381/994 (38%), Positives = 467/994 (46%), Gaps = 44/994 (4%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 366 MADRS +ALA+PIW+KQ TFKDVE + K ++ Sbjct: 1 MADRS-----SALAKPIWVKQAEEAKLKSEAEKAAAAKAAFEATFKDVEKSREKGGAAAS 55 Query: 367 SDTEENEDL---VKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGG 537 SD+E +E+ + KP+GP DP+KC SSFVV KD+DGRK PNGG Sbjct: 56 SDSESDEEAEEDLSRKPIGPADPAKCMAAGAGIAGGTACAPSSFVVTAKDADGRKCPNGG 115 Query: 538 AQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPV 717 AQ+KVK+ PGVGVGG+EQEG+VKD GDGTY VTYVVPKRGNYMV+VEC+GK IMGSPFPV Sbjct: 116 AQIKVKVSPGVGVGGTEQEGVVKDMGDGTYTVTYVVPKRGNYMVNVECNGKPIMGSPFPV 175 Query: 718 FFSAXXXXXXXXXXXXXXXXXXX---VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXX 888 FFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 176 FFSAGATTPTSGGLLGLAPTSTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIIPGAS 235 Query: 889 XXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXX 1068 E+CREYLNGRC KTDCK HPPHN S Sbjct: 236 GGAILPGIGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTVSQVPMAPS 295 Query: 1069 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLT 1248 +DSS S DKAGK D+LKKTLQ+SNLSPLLT Sbjct: 296 AAAMAAAQAIVAAQALQAHAAQVQAQAKSGKDSSASPDKAGKDDALKKTLQVSNLSPLLT 355 Query: 1249 TDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSL 1428 +QLKQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRPMNVEMAKSL Sbjct: 356 VEQLKQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPMNVEMAKSL 415 Query: 1429 PPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMA 1608 P KPAILNS + SSLP +RAATMKSATE+A Sbjct: 416 PQKPAILNSQLASSSLPMMMQQAVAMQQMQFQQALLMQQTMMTQQAASRAATMKSATELA 475 Query: 1609 SARAAEISKKLKADGVGNE------EKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1770 +ARAAEISKKLKADG+ +E EK Sbjct: 476 AARAAEISKKLKADGLVSEEKEEKEEKEAKPKSRSPSPSRKKSRSKSRSPINYHRRRRSP 535 Query: 1771 XXXXXXXXXXXXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHY-XXXX 1947 ++Y +ERRS R++RD DR R + R+RDH+ Sbjct: 536 SYSPPSRQARDRRSRSPIRSRHYSSYDNERRSFREIRDGGDRYRRRDSGRSRDHHVSSSR 595 Query: 1948 XXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSR 2127 RKS R S SPK RES R+++R Sbjct: 596 KHRSRSASPGRRKSYRVDSVSPKRHRES-------------------------TPRRATR 630 Query: 2128 AGSGSPKFHR-ESLSPRTKKSSRANXXXXXXXXXXXXXXXXT-------HENVSHY-RRR 2280 AGS SP + R SPR + N HE H RRR Sbjct: 631 AGSRSPSYSRGNRSSPRIDDERKLNHRKRSRSISPDGKYHSNGTRDETRHERSKHRDRRR 690 Query: 2281 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKT---DNRGSKSSPRRAHESVSHYRRRS 2451 SRS SAED+ H + K+ +R + R +S RRRS Sbjct: 691 SRSVSAEDKHHRMSFARSTNETKSKHRRRSRSKSVEGKHRSVEKDANRDDKSKHRGRRRS 750 Query: 2452 RSKSAEDEPRSDDRKERKKSEKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRR 2631 RS S E + SD + ++E K D R R+ D D D E G ++ + Sbjct: 751 RSTSLESKRLSDGKMNETRNEDSKHDVRRSRSRSRSESLEDKFHFD--DSVEGGRNEKSK 808 Query: 2632 KDTLLEDGSSSDEKYGSNHK-RSRLDDKDSE--KHDSVFKDKNDLMDVSEGVKSPSVS-- 2796 S S E SNHK + ++DD + KH + ++ ++ +S S S Sbjct: 809 HHAKRRSRSRSVE---SNHKLKEKVDDGRDKRPKHRGRRRSRSVSVEAKHHRRSRSSSRS 865 Query: 2797 --------ARYNDSAPVDDRTHSRTK-DSSRYEKSTSDRRRHEKIDTTRRE--RDTSGMD 2943 +R + S D + + + K + +RY+KS S RR+ + + R +SG Sbjct: 866 SGETKMKHSRRSGSKSPDGKNNFKDKLNETRYKKSKSGRRKRSRSSLLEEKLRRGSSGSQ 925 Query: 2944 RAHIGCEDLS-RHGRLTSENK--KHEKVESIHRE 3036 + E + R + SE + KHE + H E Sbjct: 926 SSSDESESKNIRRSKSDSEGRPSKHEAPITEHLE 959 >emb|CBI36502.3| unnamed protein product [Vitis vinifera] Length = 888 Score = 527 bits (1357), Expect = e-146 Identities = 359/878 (40%), Positives = 424/878 (48%), Gaps = 35/878 (3%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT---VKESI 357 M DR+ S+A+ +PIWMKQ TFKD + V +S Sbjct: 1 MGDRT----SSAVTKPIWMKQAEEAKIKSEAEKAAAAKAAFEATFKDAASASAPAVADSS 56 Query: 358 SSDSDT--EENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPN 531 SSDSD E+ E + +KP+GPVDPSKCT SSFVVVTKDSDGRK+PN Sbjct: 57 SSDSDDAEEDAESRLASKPIGPVDPSKCTAAGAGIAGGAACSASSFVVVTKDSDGRKVPN 116 Query: 532 GGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPF 711 GGAQ++V++ PGVGVGGS+QEG++KDQGDG+Y VTYVV KRGNYMVHVEC+GK IMGSPF Sbjct: 117 GGAQIRVRVSPGVGVGGSDQEGIIKDQGDGSYTVTYVVSKRGNYMVHVECNGKPIMGSPF 176 Query: 712 PVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXX 891 PVFFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 177 PVFFSAGTASGGLLGLAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGASG 236 Query: 892 XXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXX 1071 E+CREYLNGRC KTDCKF HPPHN S Sbjct: 237 GAVLPGIGASLGEVCREYLNGRCAKTDCKFNHPPHNLLMTALAATTTMGTLSQVPMAPSA 296 Query: 1072 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTT 1251 +DS+ S DK GKAD+LKKTLQ+SNLSPLLT Sbjct: 297 AAMAAAQAIVAAQALQAHAAQVQAQAQSAKDSAGSPDKVGKADALKKTLQVSNLSPLLTV 356 Query: 1252 DQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLP 1431 +QLKQLFS+CGTVVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP Sbjct: 357 EQLKQLFSFCGTVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLP 416 Query: 1432 PKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMAS 1611 PKPAILNS + SLP NRAATMKSATE+AS Sbjct: 417 PKPAILNSPLASPSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAS 476 Query: 1612 ARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1791 ARAAEISKKLKADG EEK Sbjct: 477 ARAAEISKKLKADGFVEEEKEEKEENRKSRSPSISHARSKSRSKSPLHYRRRRRSRSFSP 536 Query: 1792 XXXXXXXXXXXXXXXXTNYGSERRSHRQVRDS---NDRSGRWERDRTRDHY-XXXXXXXX 1959 +Y R RD+ +DRS R + DR+ DH+ Sbjct: 537 PSRYSREHRSRSPFRSHHYSIHDHGSRSYRDNKDGSDRSRRRDLDRSHDHHLSSSRRNRS 596 Query: 1960 XXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSG 2139 TRKS RA S SPK R ES S RTRKSSR Sbjct: 597 RSRSPRTRKSYRADSESPKRRVES----------------------SSHRTRKSSR---- 630 Query: 2140 SPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXX 2319 + E + R K S+ H + RRRSRS SAE + H Sbjct: 631 --HYSNEKIDERRDKKSK-------------------HRD----RRRSRSISAEGKHH-- 663 Query: 2320 XXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKE 2499 +GS SPR +S S +R+RSRSKSAE + D+ + Sbjct: 664 -----------------------KGSGFSPRSFDDSKSKHRKRSRSKSAEGKRVLSDKTD 700 Query: 2500 RKKSEKVKLDGRNDEKTDRATKELD--------LSAKDSRDLK------------EYGTS 2619 + EK G++ EK ++ + LS K S +++ EY S Sbjct: 701 EGRDEK----GKHHEKRRSRSRSAEGKYCRLNRLSPKSSDEIRPKHRRHSRSRSAEYRRS 756 Query: 2620 DPRRKDTLL------EDGSSSDEKYGSNHKRSRLDDKD 2715 D + + L+ E + D K S HK +++D + Sbjct: 757 DNKGDEKLMHHKEPKEREVTEDLKEPSKHKMPKIEDME 794 >ref|XP_007224346.1| hypothetical protein PRUPE_ppa020677mg, partial [Prunus persica] gi|462421282|gb|EMJ25545.1| hypothetical protein PRUPE_ppa020677mg, partial [Prunus persica] Length = 764 Score = 520 bits (1338), Expect = e-144 Identities = 344/822 (41%), Positives = 404/822 (49%), Gaps = 16/822 (1%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS-S 363 MADRS TALA+PIWMKQ TFKDV+ N KE ++ S Sbjct: 1 MADRS-----TALAKPIWMKQAEEARVKSEAEKAAAAKAAFEATFKDVDKNREKEVVAGS 55 Query: 364 DSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQ 543 DS++EE EDL NKP+GPVDP+KCT SSF+VVTKDSDGRK+P+GG Q Sbjct: 56 DSESEEAEDLA-NKPIGPVDPAKCTAAGAGIAGGTACAPSSFMVVTKDSDGRKVPHGGVQ 114 Query: 544 LKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFF 723 +KVK+ PGVGVGGSEQEGMVKD GDGTY VTYVVPKRGNYMV+V+C+GKAIMGSPFPVFF Sbjct: 115 IKVKVIPGVGVGGSEQEGMVKDMGDGTYTVTYVVPKRGNYMVNVDCNGKAIMGSPFPVFF 174 Query: 724 SAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXX 903 SA VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 175 SAGTSTGGLLGLAPASTFPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGIVPGASGGAIL 234 Query: 904 XXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXX 1083 E+CREYL+GRC KTDCK HPPHN S Sbjct: 235 PGIGASLGEVCREYLSGRCAKTDCKLNHPPHNLLMTALAATTSMSNVSQVPMAPSAAAMA 294 Query: 1084 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLK 1263 +DSS S DKAGKAD LKKTLQ+SNLSPLLT +QLK Sbjct: 295 AAQAIVAAQALQAHAAQVQAHAQSNKDSSGSPDKAGKADVLKKTLQVSNLSPLLTVEQLK 354 Query: 1264 QLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPA 1443 QLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AAL LNNMDVGGRP+NVEMAKSLP KPA Sbjct: 355 QLFSFCGTVVECTITDSKHFAYIEYSKPEEASAALQLNNMDVGGRPLNVEMAKSLPQKPA 414 Query: 1444 ILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAA 1623 I+NSSM SSLP NRAATMK+ATE+A+ARAA Sbjct: 415 IMNSSMASSSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKTATELAAARAA 474 Query: 1624 EISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1803 EISKKLKADGV EEK T Sbjct: 475 EISKKLKADGVDIEEKETTEKSRSPSPHFAKSKSKSKSRSRSPINYRRRRKSPSYSPPSR 534 Query: 1804 XXXXXXXXXXXXTNYGSERRSHRQ----VRDSNDRSGRWERDRTRDHYXXXXXXXXXXXX 1971 + + S + R+ +++ +R+ R + DR+ DH+ Sbjct: 535 YPRDRRSRSPLRSRHYSSYDNDRRSFRDIKNEGERTRRRDLDRSHDHH----STHYEKAK 590 Query: 1972 XXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKF 2151 R+ SR+ S KH R L + R+ SR+ S K Sbjct: 591 HRERRRSRSVSTDDKHHRRRLSPRSLDEN--------------KTKHRRRSRSKSVEDKH 636 Query: 2152 H-----RESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHX 2316 H E +TK R + S +R R RS+S E Sbjct: 637 HPDDKTNEMRDEKTKHRDRRR-----------------RDKKSKHRDRRRSRSISPEG-- 677 Query: 2317 XXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRK 2496 K D R SSPR ++ +RRRSRSKSAE + RS+DR Sbjct: 678 --------------------KHDRRHG-SSPRSLDDNKLKHRRRSRSKSAERKHRSNDRA 716 Query: 2497 ERKKSEKVKLDGRND------EKTDRATKELDLSAKDSRDLK 2604 + + EK K R E R + L + D ++LK Sbjct: 717 YKSRDEKEKGHRRRRSRSASLEPKRRRGRRLSPRSSDEKELK 758 >ref|XP_003531222.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X1 [Glycine max] gi|571470905|ref|XP_006585151.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X2 [Glycine max] gi|571470908|ref|XP_006585152.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X3 [Glycine max] Length = 975 Score = 511 bits (1316), Expect = e-141 Identities = 364/986 (36%), Positives = 462/986 (46%), Gaps = 40/986 (4%) Frame = +1 Query: 226 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMN-----------TVKESISSDSD 372 A+PIWMKQ TFK +E +V ES SDS+ Sbjct: 9 AKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKALENKHDKGGGGGGGGSVAES-DSDSE 67 Query: 373 TEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKV 552 EE EDL +KP+GPVDPSKCT SSFVVV KD+D RK+ GGAQ+KV Sbjct: 68 EEEYEDLA-HKPIGPVDPSKCTAAGTGIAGGTACAPSSFVVVAKDADERKVSGGGAQIKV 126 Query: 553 KICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAX 732 ++ PG+GVGG+EQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFSA Sbjct: 127 RVTPGLGVGGTEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAA 186 Query: 733 XXXXXXXXXXXXXXXXXX-VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXX 909 VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 187 GNSTGGLLGLAPASSFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPG 246 Query: 910 XXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXX 1089 E+CR+YLNGRC K DCK HPPHN S Sbjct: 247 IGASLGEVCRDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLSQAPMAPSAAAMAAA 306 Query: 1090 XXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQL 1269 +DS+ S +KA K D+LKKTLQ+SNLSPLLT +QLKQL Sbjct: 307 QAIVAAQALQAHAAQVQAQSA--KDSTGSPEKASKDDALKKTLQVSNLSPLLTVEQLKQL 364 Query: 1270 FSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAIL 1449 F +CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNN+DVGGRP+NVEMAKSLPPKP++ Sbjct: 365 FGFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNIDVGGRPLNVEMAKSLPPKPSVA 424 Query: 1450 NSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEI 1629 NSS+ SSLP NRAATMKSATE+A+ARAAEI Sbjct: 425 NSSLASSSLPLMMQQAVAMQQMQFQQALLMQQSMTAQQAANRAATMKSATELAAARAAEI 484 Query: 1630 SKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1809 SKKL DGVG E+ Sbjct: 485 SKKLNPDGVGT-EEKETKQKSRSPSPPHGRSRSKSRSPINYRRRRRSRSYSPARHSKDHR 543 Query: 1810 XXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKS 1989 ++Y ERRS R +R+ +DR R + DR+ DH+ Sbjct: 544 SRSPLRSHHYSSYDRERRSFRDIREHSDRYRRRDLDRSLDHH------------------ 585 Query: 1990 SRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLS 2169 SAS ++R S VSP TRKS S SPK HRE+ Sbjct: 586 ---SSASRRNRSRS----------------------VSPYTRKS----SVSPKRHRETSP 616 Query: 2170 PRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXX 2349 R +K SRA+ + + + RRRSRS+S++D H Sbjct: 617 HRGRKQSRADSGSPSRRRGSRSSPKIDEKKLRN-RRRSRSRSSDDRLHSIKNEEISHGKS 675 Query: 2350 XXXXXXN------DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKS 2511 DEK +R S+SSPR+ ES S +++R RSKS +D S +R + ++ Sbjct: 676 KHRERRRSRSLSVDEK-PHRRSRSSPRKVDESRSRHKKRLRSKSVDDRHGSPERLDENRT 734 Query: 2512 EKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRK--DTLLEDGSSSDEKYGSN 2685 + R+ +K R ++ +D D++E + + K DT S + K+ Sbjct: 735 RR----SRHSDK--RHSRSRSTETRDQTDVREDERKNQKSKHRDTKRSRSKSVEGKHRFK 788 Query: 2686 HKRSRLDDKDSEKHDS------VFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRT 2847 K DK S++ D +DK+D D S + + + + + HS Sbjct: 789 DKSGENRDKKSKRRDRKRSRSISLEDKHDKGDTSPHINFD--ERNFEPTKSPEGKNHSSD 846 Query: 2848 KDSSRYEKS-----TSDRRRHEKIDTT------RRERDTSGMDRAHIGCEDLSRH---GR 2985 K SR EKS T + + E+ D + +E D+ G + G ++ H G Sbjct: 847 KYGSRGEKSEHQKKTPSKSKSEQFDGSGPLRGNYKEYDSKGKSPSDSGSAEVKHHLSDGE 906 Query: 2986 LTSENKKHEKVESIHREKDHLDDDST 3063 + + + + +E DST Sbjct: 907 NATSEENSKLFGDVFQEPIRTAKDST 932 >ref|XP_004134373.1| PREDICTED: uncharacterized protein LOC101203535 [Cucumis sativus] Length = 936 Score = 511 bits (1315), Expect = e-141 Identities = 334/895 (37%), Positives = 429/895 (47%), Gaps = 29/895 (3%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 366 MADR++ +A+PIWMKQ TFK V+ KE+ SSD Sbjct: 1 MADRNL-----VVAKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKGVDKIPAKEAASSD 55 Query: 367 SDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQL 546 SD E+NEDL + KP+GPVDP++CT +SF VVTKD DGRK+P+GGA + Sbjct: 56 SDFEDNEDL-ERKPIGPVDPARCTAAGAGIAGGAACVPASFTVVTKDVDGRKVPHGGALI 114 Query: 547 KVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS 726 KVK+ PGVGVGG+EQ+G+VKD DGTY +TYVVPKRGNYMV++EC+G+ IMGSPFPVFFS Sbjct: 115 KVKVAPGVGVGGTEQDGIVKDMNDGTYTITYVVPKRGNYMVNIECNGRPIMGSPFPVFFS 174 Query: 727 AXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXX 906 A VNQ MPNMPNYSGSVSGAFPGL+GMIP Sbjct: 175 AGTSSGGLLGLAPASSFPNLVNQNMPNMPNYSGSVSGAFPGLMGMIPGIVAGASGGAILP 234 Query: 907 XXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXS-XXXXXXXXXXXX 1083 E+CREYLNG+C KTDCK HPPHN S Sbjct: 235 GIGASLGEVCREYLNGQCAKTDCKLNHPPHNLLMTAIAATTSMGTISQVPMAPSAAAMAA 294 Query: 1084 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGK-ADSLKKTLQISNLSPLLTTDQL 1260 +DSS S DK+GK AD+LK+TLQ+SNLSPLLT +QL Sbjct: 295 AQAIVAAQALQAHAAQVQAQQAQSAKDSSGSSDKSGKAADALKRTLQVSNLSPLLTVEQL 354 Query: 1261 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1440 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP 414 Query: 1441 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1620 A N S+ SSLP NRAATMKSATE+A+ARA Sbjct: 415 AAANPSLASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARA 474 Query: 1621 AEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1800 AEISKKLK DG+GNEE T Sbjct: 475 AEISKKLKVDGIGNEETETKEKSRSPSLPRERSKSKSKSPIKYRSRRRSPTYSPPYRHSR 534 Query: 1801 XXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXX 1977 + Y +RR +R+ R++++RS R + DR+R Sbjct: 535 DHRSRSPVRSRHYSRYEDDRRGYRESREASERSRRRDLDRSRSRRSPISRKNRSRSISPR 594 Query: 1978 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXG--NVSPRT--------RKSSR 2127 RKS RAGS SP H+RE G SPR R+ SR Sbjct: 595 RRKSYRAGSDSPSHQRERSPQRGRKSDHSDLRSPIRHHGKSRSSPRKDDSDKLKHRRRSR 654 Query: 2128 AGSGSPKFHRE---------SLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRR 2280 + S K H + L R ++ SR + +N+S +RRR Sbjct: 655 SKSVETKHHSDEKINEMQHGKLKNRERRRSR-SASLEDKHSKRRPSPRSLDKNISKHRRR 713 Query: 2281 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSK 2460 SRS S E D K D+ + R+ RRRSRSK Sbjct: 714 SRSNSREKVDDKYHGRRRSRSSSSDSKHLPDSKVDSTRYEKLKNRS-------RRRSRSK 766 Query: 2461 SAEDEPRSDDRKERKKSEKVKLDGRNDEKT-------DRATKELDLSAKDSRDLKEYGTS 2619 S + + R ++ +R + ++++ R ++ R T+ S+ +++ + + Sbjct: 767 SVDGKHRRREKSDRSRDKRLRHRDRRSSRSISPEAGHQRVTRLSPTSSDETKSKRRRRSL 826 Query: 2620 DPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKS 2784 P K + +++G ++ ++SR + E +S + + G +S Sbjct: 827 SPEDKPSDIDNGCIAENPKNLGRQQSRSNSISGENGESNLSPSTEENEFKHGEQS 881 >ref|XP_004157720.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101203535 [Cucumis sativus] Length = 936 Score = 506 bits (1304), Expect = e-140 Identities = 332/895 (37%), Positives = 427/895 (47%), Gaps = 29/895 (3%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 366 MADR++ +A+PIWMKQ TFK V+ KE+ SSD Sbjct: 1 MADRNL-----VVAKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKGVDKIPAKEAASSD 55 Query: 367 SDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQL 546 SD E+NEDL + KP+GPVDP++CT +SF VVTKD DGRK+P+GGA + Sbjct: 56 SDFEDNEDL-ERKPIGPVDPARCTAAGAGIAGGAACVPASFTVVTKDVDGRKVPHGGALI 114 Query: 547 KVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS 726 KVK+ PGVGVGG+EQ+G+VKD DGTY +TYVVPKRGNYMV++EC+G+ IMGSPFPVFFS Sbjct: 115 KVKVAPGVGVGGTEQDGIVKDMNDGTYTITYVVPKRGNYMVNIECNGRPIMGSPFPVFFS 174 Query: 727 AXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXX 906 A VNQ MPNMPNYSGSVSGAFPGL+GMIP Sbjct: 175 AGTSSGGLLGLAPASSFPNLVNQNMPNMPNYSGSVSGAFPGLMGMIPGIVAGASGGAILP 234 Query: 907 XXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXS-XXXXXXXXXXXX 1083 E+CREYLNG+C KTDCK HPPHN S Sbjct: 235 GIGASLGEVCREYLNGQCAKTDCKLNHPPHNLLMTAIAATTSMGTISQVPMAPSAAAMAA 294 Query: 1084 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGK-ADSLKKTLQISNLSPLLTTDQL 1260 +DSS S DK+GK AD+LK+TLQ+SNLSPLLT +QL Sbjct: 295 AQAIVAAQALQAHAAQVQAQQAQSAKDSSGSSDKSGKAADALKRTLQVSNLSPLLTVEQL 354 Query: 1261 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1440 KQLF +CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP Sbjct: 355 KQLFXFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP 414 Query: 1441 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1620 A N S+ SSLP NRAATMKSATE+A+ARA Sbjct: 415 AAANPSLASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARA 474 Query: 1621 AEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1800 AEIS KLK DG+GNEE T Sbjct: 475 AEISXKLKVDGIGNEETETKEKSRSPSLPRERSKSKSKSPIKYRSRRRSPTYSPPYRHSR 534 Query: 1801 XXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXX 1977 + Y +RR +R+ R++++RS R + DR+R Sbjct: 535 DHRSRSPVRSRHYSRYEDDRRGYRESREASERSRRRDLDRSRSRRSPISRKNRSRSISPR 594 Query: 1978 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXG--NVSPRT--------RKSSR 2127 RKS RAGS SP H+RE G SPR R+ SR Sbjct: 595 RRKSYRAGSDSPSHQRERSPQRGRKSDHSDLRSPIRHHGKSRSSPRKDDSDKLKHRRRSR 654 Query: 2128 AGSGSPKFHRE---------SLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRR 2280 + S K H + L R ++ SR + +N+S +RRR Sbjct: 655 SKSVETKHHSDEKINEMQHGKLKNRERRRSR-SASLEDKHSKRRPSPRSLDKNISKHRRR 713 Query: 2281 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSK 2460 SRS S E D K D+ + R+ RRRSRSK Sbjct: 714 SRSNSREKVDDKYHGRRRSRSSSSDSKHLPDSKVDSTRYEKLKNRS-------RRRSRSK 766 Query: 2461 SAEDEPRSDDRKERKKSEKVKLDGRNDEKT-------DRATKELDLSAKDSRDLKEYGTS 2619 S + + R ++ +R + ++++ R ++ R T+ S+ +++ + + Sbjct: 767 SVDGKHRRREKSDRSRDKRLRHRDRRSSRSISPEAGHQRVTRLSPTSSDETKSKRRRRSL 826 Query: 2620 DPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKS 2784 P K + +++G ++ ++SR + E +S + + G +S Sbjct: 827 SPEDKPSDIDNGCIAENPKNLGRQQSRSNSISGENGESNLSPSTEENEFKHGEQS 881 >ref|XP_006373079.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550319785|gb|ERP50876.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 950 Score = 503 bits (1295), Expect = e-139 Identities = 351/946 (37%), Positives = 439/946 (46%), Gaps = 31/946 (3%) Frame = +1 Query: 187 MADRSVPAVSTALA--------RPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT 342 M DR+ ++ A +PIWMKQ TFK V + Sbjct: 1 MTDRNNTTITAAATSTTNHSATKPIWMKQAEEAKLKSEAEKTAAAKAAFDATFK-VLSDK 59 Query: 343 VKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRK 522 ++ SDS+ E+ E+ + NKP+GPVDP+KCT ++F+VVTKD+DGRK Sbjct: 60 AEKPADSDSEEEDAEEDLANKPVGPVDPNKCTAAGGGIAGGTACAPATFMVVTKDADGRK 119 Query: 523 IPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMG 702 +PNGGA +KV++ PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV +EC+GKAIMG Sbjct: 120 VPNGGAVIKVRVSPGVGVGGTEQEGNVKDMGDGTYTVTYVVPKRGNYMVTIECNGKAIMG 179 Query: 703 SPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXX 882 SPFPVFFSA VNQTMPNMPNYS ++SGAFP LLGM P Sbjct: 180 SPFPVFFSAGTSTGGLLGMAPTTTFPNLVNQTMPNMPNYSANISGAFPALLGMTPGITSS 239 Query: 883 XXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXX 1062 E+CREYL GRC KTDCK HPP + S Sbjct: 240 ASGGAILPGAGASLGEVCREYLYGRCAKTDCKLSHPPQSLLMTLLAPTTSMGTLSQVPMA 299 Query: 1063 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPL 1242 +DSS S DKA K D+LKKTL +SNLSPL Sbjct: 300 PSAAAMAAAQAIVAAKALQAHAAQLQAQARSAKDSSGSPDKARKEDALKKTLHVSNLSPL 359 Query: 1243 LTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAK 1422 LT +QLKQLFS+CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGGRP+NVE AK Sbjct: 360 LTVEQLKQLFSFCGTVVECTIADSKHSAYIEYSKPEEATAALALNNMDVGGRPLNVETAK 419 Query: 1423 SLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATE 1602 SLP KP ILNSS SSLP N+AATMKSATE Sbjct: 420 SLPQKP-ILNSSFASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANKAATMKSATE 478 Query: 1603 MASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1782 +A+ARAAEISKKLK DG+ E T Sbjct: 479 LAAARAAEISKKLKDDGLVTGEGETKAESKSPPPPRARSRSKSRSPINYRRRMRSPSYSP 538 Query: 1783 XXXXXXXXXXXXXXXXXXXTNYGSERRSH--RQVRDSNDRSGRWERDRTRDHY-XXXXXX 1953 + Y ERRS+ R RD DR+ R E DR+RDH+ Sbjct: 539 PSRHNRDRRSRSPVRFRYHSRYNYERRSYRDRDSRDDGDRTRRRELDRSRDHHSPVSRRN 598 Query: 1954 XXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAG 2133 TRKS RA S SPKHR+ES + R+RK+S +G Sbjct: 599 RSRSASPRTRKSYRADSGSPKHRQES----------------------SAHRSRKASDSG 636 Query: 2134 SGSPKFHRES-LSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKS-AEDE 2307 S SP+ H S SPR S+ YRRRSRS+S + +E Sbjct: 637 SRSPRHHGGSRSSPRNNPDSKL-----------------------RYRRRSRSRSKSVEE 673 Query: 2308 AHXXXXXXXXXXXXXXXXXXNDEKTD--NRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR 2481 A+ + + G + SPR ++E S +R RSRSKS E + Sbjct: 674 ANEKVDEIREKKSKQHERRSRSLSVELKHHGRRPSPRSSNEDDSKHRSRSRSKSVEVKRH 733 Query: 2482 SDD--------------RKERKKS--EKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYG 2613 S++ R+ R KS ++ R +E D+ TK D SR + G Sbjct: 734 SNEKVDKTGDGKLKHRHRRSRSKSVDDRHHYKERGNETRDKKTKHQDRGR--SRSITAEG 791 Query: 2614 TSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSV 2793 R DGS S + H RS + V ++K++ +SPS Sbjct: 792 KHHRSRSSPRGRDGSKSKHR---RHSRSISPEGKRRSSHRVDQNKDEKSKHRHRRRSPSA 848 Query: 2794 SARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERDT 2931 ++ S + S+ + R + + R +++ D R E +T Sbjct: 849 EGKHGRSPRSSEENKSKHRRRPRSKSAERKRHSNDEKDIRRGENET 894 >ref|XP_003524186.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1-like isoform X1 [Glycine max] gi|571455668|ref|XP_006580150.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1-like isoform X2 [Glycine max] Length = 969 Score = 503 bits (1294), Expect = e-139 Identities = 351/952 (36%), Positives = 448/952 (47%), Gaps = 34/952 (3%) Frame = +1 Query: 226 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS---SDSDTEENEDLV 396 A+PIWMKQ TFK +E K S SDSD+EE + + Sbjct: 9 AKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKALENKHDKGGGSVADSDSDSEEEYEDL 68 Query: 397 KNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICPGVGV 576 +KP+GPV+P+KCT SSFVVVTKD+D RK+ GGAQ+KV++ PG+GV Sbjct: 69 AHKPIGPVEPAKCTAAGTGIAGGTACAPSSFVVVTKDADERKVSGGGAQIKVRVTPGLGV 128 Query: 577 GGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXXXXXX 756 GG+EQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFSA Sbjct: 129 GGTEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAAGNSTGGLL 188 Query: 757 XXXXXXXXXX-VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXXXEI 933 VNQTMPNMPNYSGSVSGAFPGLLGMIP E+ Sbjct: 189 GLAPASSFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPGIGASLGEV 248 Query: 934 CREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXX 1113 CR+YLNGRC K DCK HPPHN S Sbjct: 249 CRDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLSQAPMAPSAAAMAAAQAIVAAQA 308 Query: 1114 XXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCGTVV 1293 +DS+ S +KA K D+LKKTLQ+SNLSPLLT +QLKQLF +CGTVV Sbjct: 309 LQAHAAQVQAQSA--KDSAGSPEKASKDDALKKTLQVSNLSPLLTVEQLKQLFGFCGTVV 366 Query: 1294 ECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMNQSS 1473 EC+ITDSKHFAYIEYSKPEEA AALALNN+DVGGRP+NVEMAKSLP KP++ NSS+ SS Sbjct: 367 ECAITDSKHFAYIEYSKPEEATAALALNNIDVGGRPLNVEMAKSLPQKPSVANSSLASSS 426 Query: 1474 LPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLKADG 1653 LP RAATMKSATE+A+ARAAEISKKL DG Sbjct: 427 LPLMMQQAVAMQQMQFQQALLMQQSMTAQQAATRAATMKSATELAAARAAEISKKLNPDG 486 Query: 1654 VGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1833 VG+ E+ Sbjct: 487 VGS-EEKETKQNSRSSSPPRGRSRSKSRSPISYRRRRRSRSYSPARHSKDHRSRSPLRPH 545 Query: 1834 XXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASP 2013 ++Y ERRS+R +R+ +DR R + DR+ DH SAS Sbjct: 546 HYSSYDRERRSYRDIREHSDRYRRRDSDRSLDH---------------------RSSASR 584 Query: 2014 KHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSR 2193 ++R S VSP TRKS SPK HRE+ R +K SR Sbjct: 585 RNRSRS----------------------VSPYTRKS----PVSPKCHRETSPHRGRKQSR 618 Query: 2194 ANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXXXXN- 2370 + + + + RRRSRS+S++D H Sbjct: 619 VDSGSPSHRRGSRPSPKIDEKKLRN-RRRSRSRSSDDRLHSSKNEEVLHGKSKRRERRRS 677 Query: 2371 -----DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKVKLDGR 2535 DEK +R S+SSPR+ ES S +++RS SKS +D S +R + ++ ++ R Sbjct: 678 KSLSVDEK-PHRRSRSSPRKVDESRSRHKKRSSSKSVDDRHDSPERLDENRNRRL----R 732 Query: 2536 NDEKTDRATKELDLSAKDSRDLKEYGTSDPRRK--DTLLEDGSSSDEKYGSNHKRSRLDD 2709 + +K ++ D +D D++E + + + K DT S + K S K D Sbjct: 733 HSDKRHSRSRSTD--NRDQTDVREDESKNEKSKHRDTKRSRSKSVEGKRRSKDKSGENRD 790 Query: 2710 KDSEKHDS------VFKDKNDLMDVS----------EGVKSPSVSARYNDSAPVDDRTHS 2841 K S+ HD +DK+D S E KSP Y+D + Sbjct: 791 KKSKHHDRRRSRSISLEDKHDKGGTSLHINLDERNFELTKSPEGKNHYSDK-------YG 843 Query: 2842 RTKDSSRYEKSTSDRRRHEKIDTT------RRERDTSGMDRAHIGCEDLSRH 2979 + S ++K T + + + D + +E D+ G + G ++ H Sbjct: 844 NRGEKSEHQKKTPSKSKSGQFDGSGPLRGNYKEDDSKGKSPSDSGSAEVKHH 895 >ref|XP_006373080.1| hypothetical protein POPTR_0017s08510g [Populus trichocarpa] gi|550319786|gb|ERP50877.1| hypothetical protein POPTR_0017s08510g [Populus trichocarpa] Length = 924 Score = 496 bits (1277), Expect = e-137 Identities = 341/895 (38%), Positives = 425/895 (47%), Gaps = 23/895 (2%) Frame = +1 Query: 316 TFKDVEMNTVKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVV 495 TFK V + ++ SDS+ E+ E+ + NKP+GPVDP+KCT ++F+V Sbjct: 26 TFK-VLSDKAEKPADSDSEEEDAEEDLANKPVGPVDPNKCTAAGGGIAGGTACAPATFMV 84 Query: 496 VTKDSDGRKIPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHV 675 VTKD+DGRK+PNGGA +KV++ PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV + Sbjct: 85 VTKDADGRKVPNGGAVIKVRVSPGVGVGGTEQEGNVKDMGDGTYTVTYVVPKRGNYMVTI 144 Query: 676 ECDGKAIMGSPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLL 855 EC+GKAIMGSPFPVFFSA VNQTMPNMPNYS ++SGAFP LL Sbjct: 145 ECNGKAIMGSPFPVFFSAGTSTGGLLGMAPTTTFPNLVNQTMPNMPNYSANISGAFPALL 204 Query: 856 GMIPXXXXXXXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXX 1035 GM P E+CREYL GRC KTDCK HPP + Sbjct: 205 GMTPGITSSASGGAILPGAGASLGEVCREYLYGRCAKTDCKLSHPPQSLLMTLLAPTTSM 264 Query: 1036 XXXSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKT 1215 S +DSS S DKA K D+LKKT Sbjct: 265 GTLSQVPMAPSAAAMAAAQAIVAAKALQAHAAQLQAQARSAKDSSGSPDKARKEDALKKT 324 Query: 1216 LQISNLSPLLTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGG 1395 L +SNLSPLLT +QLKQLFS+CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGG Sbjct: 325 LHVSNLSPLLTVEQLKQLFSFCGTVVECTIADSKHSAYIEYSKPEEATAALALNNMDVGG 384 Query: 1396 RPMNVEMAKSLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNR 1575 RP+NVE AKSLP KP ILNSS SSLP N+ Sbjct: 385 RPLNVETAKSLPQKP-ILNSSFASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANK 443 Query: 1576 AATMKSATEMASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXX 1755 AATMKSATE+A+ARAAEISKKLK DG+ E T Sbjct: 444 AATMKSATELAAARAAEISKKLKDDGLVTGEGETKAESKSPPPPRARSRSKSRSPINYRR 503 Query: 1756 XXXXXXXXXXXXXXXXXXXXXXXXXXXXTNYGSERRSH--RQVRDSNDRSGRWERDRTRD 1929 + Y ERRS+ R RD DR+ R E DR+RD Sbjct: 504 RMRSPSYSPPSRHNRDRRSRSPVRFRYHSRYNYERRSYRDRDSRDDGDRTRRRELDRSRD 563 Query: 1930 HY-XXXXXXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSP 2106 H+ TRKS RA S SPKHR+ES + Sbjct: 564 HHSPVSRRNRSRSASPRTRKSYRADSGSPKHRQES----------------------SAH 601 Query: 2107 RTRKSSRAGSGSPKFHRES-LSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRS 2283 R+RK+S +GS SP+ H S SPR S+ YRRRS Sbjct: 602 RSRKASDSGSRSPRHHGGSRSSPRNNPDSKL-----------------------RYRRRS 638 Query: 2284 RSKS-AEDEAHXXXXXXXXXXXXXXXXXXNDEKTD--NRGSKSSPRRAHESVSHYRRRSR 2454 RS+S + +EA+ + + G + SPR ++E S +R RSR Sbjct: 639 RSRSKSVEEANEKVDEIREKKSKQHERRSRSLSVELKHHGRRPSPRSSNEDDSKHRSRSR 698 Query: 2455 SKSAEDEPRSDD--------------RKERKKS--EKVKLDGRNDEKTDRATKELDLSAK 2586 SKS E + S++ R+ R KS ++ R +E D+ TK D Sbjct: 699 SKSVEVKRHSNEKVDKTGDGKLKHRHRRSRSKSVDDRHHYKERGNETRDKKTKHQDRGR- 757 Query: 2587 DSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDV 2766 SR + G R DGS S + H RS + V ++K++ Sbjct: 758 -SRSITAEGKHHRSRSSPRGRDGSKSKHR---RHSRSISPEGKRRSSHRVDQNKDEKSKH 813 Query: 2767 SEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERDT 2931 +SPS ++ S + S+ + R + + R +++ D R E +T Sbjct: 814 RHRRRSPSAEGKHGRSPRSSEENKSKHRRRPRSKSAERKRHSNDEKDIRRGENET 868 >ref|XP_002300152.2| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550348720|gb|EEE84957.2| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 918 Score = 487 bits (1254), Expect = e-134 Identities = 359/970 (37%), Positives = 451/970 (46%), Gaps = 18/970 (1%) Frame = +1 Query: 199 SVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSDSDTE 378 +V +T+ A+PIWMKQ TFK V + ++++ SDS+ E Sbjct: 13 AVTTTNTSAAKPIWMKQAEEAKLKSEAENTAAAKAAFDATFK-VLSDKAEKAVDSDSEEE 71 Query: 379 ENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKI 558 + E + NKP+GPVDP KCT ++FVVVTKD+DGRK+PNGGA ++V++ Sbjct: 72 DAEKDLANKPVGPVDPGKCTAAGAGIAGGTACAPATFVVVTKDADGRKVPNGGAVIRVRV 131 Query: 559 CPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXX 738 PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV +EC+GKAIMGSPFPVFFSA Sbjct: 132 SPGVGVGGTEQEGAVKDMGDGTYTVTYVVPKRGNYMVTIECNGKAIMGSPFPVFFSAGTS 191 Query: 739 XXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXX 918 VNQTMPNMPNYS SVSGAFP LGM P Sbjct: 192 TGGLLGMAPTTTFPNLVNQTMPNMPNYSASVSGAFPAFLGMTPGIASGASGGAILPGVGA 251 Query: 919 XXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXX 1098 E+CREYL GRC K DCK HPPH+ S Sbjct: 252 SLGEVCREYLYGRCAKMDCKLGHPPHSLLMTLLAPTTTMGTLSHAPMAPSAAAMAAAQAI 311 Query: 1099 XXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSY 1278 +DSS S DKA K D+LKKTL +SNLSPLLT +QLKQLFS+ Sbjct: 312 VAAKALQAHAAQVQAQAQSAKDSSGSPDKARKEDALKKTLHVSNLSPLLTVEQLKQLFSF 371 Query: 1279 CGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSS 1458 CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP +LNSS Sbjct: 372 CGTVVECAIADSKHSAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP-LLNSS 430 Query: 1459 MNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKK 1638 + SSLP N+AA+MKSATE+A+ARAAEISKK Sbjct: 431 LASSSLPMMMQQAVAMQQMQFQQALIMQQTMTAQQAANKAASMKSATELAAARAAEISKK 490 Query: 1639 LKADG--VGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1812 LKADG +G EE Sbjct: 491 LKADGFVIGEEETKAETKSPSPPQARSRSKSRSPINYQRRLRSPSYSPPSRRNRDRRSRS 550 Query: 1813 XXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXXTRKS 1989 NYG RRS+R RD DR + DR+R H TRKS Sbjct: 551 PFRFRYHSRYNYG--RRSYRDSRDIVDRMRMQDSDRSRGRHSPVSRRSRSRSASPRTRKS 608 Query: 1990 SRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLS 2169 R S SPK R ES + R+RK++ +GS SP+ H Sbjct: 609 YRDDSGSPKRRLES----------------------SAQRSRKAADSGSRSPRSH----- 641 Query: 2170 PRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRR--RSRSKSAEDEAHXXXXXXXXXX 2343 ++ SR N ++ Y+R RSRSKS E+ Sbjct: 642 -GGRRLSRRNIT----------------DSKLRYKRHSRSRSKSVEESNDRVNEIQDKKS 684 Query: 2344 XXXXXXXXN-DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKV 2520 + + + G + S R + E S++R RSRSKS E + S ++ + + Sbjct: 685 KQHERRSRSLSVELKHHGRRPSHRSSDEDESNHRSRSRSKSVEVKRHSYEKVGKTE---- 740 Query: 2521 KLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRKDTLLED--GSSSDEKYGSNHKR 2694 DGR + R+ + S D KE G ++ R K T D S S ++H+R Sbjct: 741 --DGRLKHRDRRSRSK---SVDDRHCYKERG-NESRDKKTKHRDRVQSRSISAESNHHRR 794 Query: 2695 SRLDDKDSEKHDSVFKDKNDLMDVS-EG---------VKSPSVSARYNDSAPVDDRTHSR 2844 SR K ++ S K + +S EG KS S R + SA + H R Sbjct: 795 SRSSPKGRDESKS--KHRRHSRPISPEGKRRSNHRIDEKSKHCSRRRSVSA---EGKHIR 849 Query: 2845 TKDSSRYEKSTSDRRRHEKIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVES 3024 + SS E++ S RRRH R S + H E++ R T +K E Sbjct: 850 SPRSS--EENKSKRRRH--------SRSKSAEHKRHSNDEEIKREENETRHEHTSDKTED 899 Query: 3025 IHREKDHLDD 3054 + +++ D Sbjct: 900 ANEDENSFTD 909 >ref|XP_007034241.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] gi|508713270|gb|EOY05167.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] Length = 890 Score = 481 bits (1239), Expect = e-133 Identities = 270/499 (54%), Positives = 308/499 (61%), Gaps = 2/499 (0%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKE--SIS 360 MADR+ STA ++PIWMKQ TFKDV+ N K+ + S Sbjct: 1 MADRN----STA-SKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKDVDKNRNKDVAAAS 55 Query: 361 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 540 SDS++E+ DLV NKP+GPVDP+KC S+F+VVTKD+DGRK+ +GGA Sbjct: 56 SDSESEDTSDLV-NKPIGPVDPAKCMAAGPGIAGGTACAASTFMVVTKDADGRKVQSGGA 114 Query: 541 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 720 Q+KVK+ PGVGVGGSEQEG+VKD GDGTY VTYVVPKRGNYMV++EC+GK IMGSPFPVF Sbjct: 115 QIKVKVSPGVGVGGSEQEGIVKDMGDGTYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVF 174 Query: 721 FSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXX 900 FSA VNQTMPNMPNY+GSVSGAFPGLLGMIP Sbjct: 175 FSAGTSTGGLLGVAPASTYPNLVNQTMPNMPNYTGSVSGAFPGLLGMIPGIVSGASGGAI 234 Query: 901 XXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXX 1080 E+CREYLNGRC KTDCK HPPHN S Sbjct: 235 LPGMGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAM 294 Query: 1081 XXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQL 1260 +DSSDS DKAGKAD+LKKTLQ+SNLSPLLT +QL Sbjct: 295 AAAQAIVAAQALQAHAAQVQAQAQSTKDSSDSPDKAGKADALKKTLQVSNLSPLLTAEQL 354 Query: 1261 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1440 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMD+GGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDIGGRPLNVEMAKSLPQKP 414 Query: 1441 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1620 A+ SS+ SSLP NRAA+MKSATE+A+ARA Sbjct: 415 AV--SSLASSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARA 472 Query: 1621 AEISKKLKADGVGNEEKGT 1677 AEISKKLKADG+ EEK T Sbjct: 473 AEISKKLKADGLVTEEKET 491 Score = 86.3 bits (212), Expect = 1e-13 Identities = 102/371 (27%), Positives = 142/371 (38%), Gaps = 6/371 (1%) Frame = +1 Query: 1840 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2019 + Y SERRS+R RD DRS R + DR+RD R S S ++ Sbjct: 574 SRYDSERRSYRD-RDDIDRSKRRDLDRSRD---------------------RRSSVSRRN 611 Query: 2020 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRAN 2199 R S +SP+TRKS S SPK RES SPR +KSS + Sbjct: 612 RSRS----------------------ISPQTRKSPPVDSDSPKNSRES-SPRVRKSSHPD 648 Query: 2200 XXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE-DEAHXXXXXXXXXXXXXXXXXXNDE 2376 E YR+RSRSKS + D+ Sbjct: 649 SRSPRHHRRSRSSPKNDDERKLKYRKRSRSKSVDSDKKRDQIQGEKSKHRSRRRSRSLSL 708 Query: 2377 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDR-KERKKSEKVKLDGRNDEKTD 2553 + ++RG S + E+ +RRRSRS S E + RS+ + E KK E D R Sbjct: 709 EGEHRGRSRSSASSDENKLKHRRRSRSVSVERKVRSNSKIDEMKKDESRHSDRRRSRSGS 768 Query: 2554 RATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSE---- 2721 + D K+ D RR S S G +H+ SRL ++S+ Sbjct: 769 AEGRHYTKERSDRSRDKKSKHRDRRR--------SRSRSAEGKHHRESRLFPRNSDGNKI 820 Query: 2722 KHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEK 2901 KH + + K+ +EG S + ++ + DR H + + S S R E Sbjct: 821 KHRRLSRSKS-----TEG--KHRSSDKIDERSKRHDRKHLSSAECRHPRGSRSSPRSSED 873 Query: 2902 IDTTRRERDTS 2934 D+ RR R S Sbjct: 874 NDSRRRRRSRS 884 >ref|XP_007034240.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656339|ref|XP_007034242.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656342|ref|XP_007034243.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656345|ref|XP_007034244.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656349|ref|XP_007034245.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656352|ref|XP_007034246.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713269|gb|EOY05166.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713271|gb|EOY05168.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713272|gb|EOY05169.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713273|gb|EOY05170.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713274|gb|EOY05171.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713275|gb|EOY05172.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] Length = 965 Score = 481 bits (1239), Expect = e-133 Identities = 270/499 (54%), Positives = 308/499 (61%), Gaps = 2/499 (0%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKE--SIS 360 MADR+ STA ++PIWMKQ TFKDV+ N K+ + S Sbjct: 1 MADRN----STA-SKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKDVDKNRNKDVAAAS 55 Query: 361 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 540 SDS++E+ DLV NKP+GPVDP+KC S+F+VVTKD+DGRK+ +GGA Sbjct: 56 SDSESEDTSDLV-NKPIGPVDPAKCMAAGPGIAGGTACAASTFMVVTKDADGRKVQSGGA 114 Query: 541 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 720 Q+KVK+ PGVGVGGSEQEG+VKD GDGTY VTYVVPKRGNYMV++EC+GK IMGSPFPVF Sbjct: 115 QIKVKVSPGVGVGGSEQEGIVKDMGDGTYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVF 174 Query: 721 FSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXX 900 FSA VNQTMPNMPNY+GSVSGAFPGLLGMIP Sbjct: 175 FSAGTSTGGLLGVAPASTYPNLVNQTMPNMPNYTGSVSGAFPGLLGMIPGIVSGASGGAI 234 Query: 901 XXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXX 1080 E+CREYLNGRC KTDCK HPPHN S Sbjct: 235 LPGMGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAM 294 Query: 1081 XXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQL 1260 +DSSDS DKAGKAD+LKKTLQ+SNLSPLLT +QL Sbjct: 295 AAAQAIVAAQALQAHAAQVQAQAQSTKDSSDSPDKAGKADALKKTLQVSNLSPLLTAEQL 354 Query: 1261 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1440 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMD+GGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDIGGRPLNVEMAKSLPQKP 414 Query: 1441 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1620 A+ SS+ SSLP NRAA+MKSATE+A+ARA Sbjct: 415 AV--SSLASSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARA 472 Query: 1621 AEISKKLKADGVGNEEKGT 1677 AEISKKLKADG+ EEK T Sbjct: 473 AEISKKLKADGLVTEEKET 491 Score = 86.3 bits (212), Expect = 1e-13 Identities = 102/371 (27%), Positives = 142/371 (38%), Gaps = 6/371 (1%) Frame = +1 Query: 1840 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2019 + Y SERRS+R RD DRS R + DR+RD R S S ++ Sbjct: 574 SRYDSERRSYRD-RDDIDRSKRRDLDRSRD---------------------RRSSVSRRN 611 Query: 2020 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRAN 2199 R S +SP+TRKS S SPK RES SPR +KSS + Sbjct: 612 RSRS----------------------ISPQTRKSPPVDSDSPKNSRES-SPRVRKSSHPD 648 Query: 2200 XXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE-DEAHXXXXXXXXXXXXXXXXXXNDE 2376 E YR+RSRSKS + D+ Sbjct: 649 SRSPRHHRRSRSSPKNDDERKLKYRKRSRSKSVDSDKKRDQIQGEKSKHRSRRRSRSLSL 708 Query: 2377 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDR-KERKKSEKVKLDGRNDEKTD 2553 + ++RG S + E+ +RRRSRS S E + RS+ + E KK E D R Sbjct: 709 EGEHRGRSRSSASSDENKLKHRRRSRSVSVERKVRSNSKIDEMKKDESRHSDRRRSRSGS 768 Query: 2554 RATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSE---- 2721 + D K+ D RR S S G +H+ SRL ++S+ Sbjct: 769 AEGRHYTKERSDRSRDKKSKHRDRRR--------SRSRSAEGKHHRESRLFPRNSDGNKI 820 Query: 2722 KHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEK 2901 KH + + K+ +EG S + ++ + DR H + + S S R E Sbjct: 821 KHRRLSRSKS-----TEG--KHRSSDKIDERSKRHDRKHLSSAECRHPRGSRSSPRSSED 873 Query: 2902 IDTTRRERDTS 2934 D+ RR R S Sbjct: 874 NDSRRRRRSRS 884 >ref|XP_002518040.1| conserved hypothetical protein [Ricinus communis] gi|223542636|gb|EEF44173.1| conserved hypothetical protein [Ricinus communis] Length = 946 Score = 473 bits (1217), Expect = e-130 Identities = 261/493 (52%), Positives = 297/493 (60%) Frame = +1 Query: 199 SVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSDSDTE 378 S + A +PIWMKQ TFK + N +++ SDS+ E Sbjct: 13 SSSTAAAAAPKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKTLTTNKPEKASDSDSEGE 72 Query: 379 ENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKI 558 E+E+ + NKP+GPVDP+KCT S+F+V TKDSDGRK+ +GGAQ+KVK+ Sbjct: 73 ESEEYLANKPVGPVDPTKCTAVGAGIAGGTACAPSTFMVATKDSDGRKVMHGGAQIKVKV 132 Query: 559 CPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXX 738 PGVGVGG+EQEG+VKD GDG+Y VTYVVPKRGNYMV++EC+GK IMGSPFPVFFSA Sbjct: 133 SPGVGVGGTEQEGIVKDMGDGSYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVFFSAGTS 192 Query: 739 XXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXX 918 VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 193 TGGLLGMAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVSGASGGAVLPGIGA 252 Query: 919 XXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXX 1098 E+CREYLNGRC KTDCK HPPHN S Sbjct: 253 SLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAMAAAQAI 312 Query: 1099 XXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSY 1278 +DSS S DKAGK D+LKKTLQ+SNLSPLLT DQLKQLFSY Sbjct: 313 VAAQALQAHAAQVQAQAQSAKDSSGSPDKAGKEDTLKKTLQVSNLSPLLTVDQLKQLFSY 372 Query: 1279 CGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSS 1458 G+VVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP K ++LNSS Sbjct: 373 FGSVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQK-SLLNSS 431 Query: 1459 MNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKK 1638 + SSLP NRAATMKSATE+A+ARAAEISKK Sbjct: 432 VASSSLPLMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARAAEISKK 491 Query: 1639 LKADGVGNEEKGT 1677 LKADG +EEK T Sbjct: 492 LKADGFVDEEKET 504 Score = 82.8 bits (203), Expect = 1e-12 Identities = 92/334 (27%), Positives = 136/334 (40%), Gaps = 11/334 (3%) Frame = +1 Query: 2095 NVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYR 2274 +VSPR ++S RA SGSPK RES R +KSS +N YR Sbjct: 598 SVSPRMKRSYRADSGSPKRRRESSPRRARKSSHGGSRSPRHHRGSRSSPRNDSDNKLKYR 657 Query: 2275 RRSRSKSAED---EAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRR 2445 +RSRSKS ED +A +EK N SKSS R E+ +R Sbjct: 658 KRSRSKSVEDSKEKAKEAQDEKFKKQERRSRSLSVEEK--NNVSKSSSRSIDENEPKHRG 715 Query: 2446 RSRSKSAEDEPRSDDRKERKKSEKVK--LDGR--NDEKTDRATKELDLSAKDSRDLKEYG 2613 RSRSKS E R+ +EKV DGR N ++ +K +++ E Sbjct: 716 RSRSKSVE---------ARRSTEKVNETRDGRLKNRDRKRSRSKSVEVRRHSREKGNESR 766 Query: 2614 TSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSV 2793 + +D S+D G +H+ SR + ++ K K+ S +S + Sbjct: 767 DKKSKHRDRKRSRSISAD---GKHHRGSRSSPRVADD----IKSKHRRHSRSRSPESKKL 819 Query: 2794 SARYNDSAPVD-DRTHSRTKDSSRYEKSTSDRRRHE--KIDTTRRERDTSGMDRAHIG-C 2961 S+ D V+ + SR + S K R E K RR R S + H Sbjct: 820 SSYRMDGTGVEKSKRRSRRRSMSAEGKHCRSPRSSEENKSKHKRRSRSRSAEGKHHSSDI 879 Query: 2962 EDLSRHGRLTSENKKHEKVESIHREKDHLDDDST 3063 +++ R L EN + E++ ++D + D+T Sbjct: 880 KNIKRAENLVHENCVSHETENVTEDQDSVVGDAT 913 >gb|EYU31494.1| hypothetical protein MIMGU_mgv1a001194mg [Mimulus guttatus] Length = 869 Score = 463 bits (1191), Expect = e-127 Identities = 337/941 (35%), Positives = 430/941 (45%), Gaps = 27/941 (2%) Frame = +1 Query: 187 MADRSV-PAVSTALA-----RPIWMKQXXXXXXXXXXXXXXXXXXXXXXTF----KDVEM 336 MADR V A S+ LA +PIWMKQ TF + + + Sbjct: 1 MADRPVNTATSSNLAVAPAPKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFNAQPQQLAL 60 Query: 337 NTVKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDG 516 ++ ES SSDSD+++ + + +GPVDPSKCT ++F+VVTKD+DG Sbjct: 61 PSIAESSSSDSDSDDERSSERERSVGPVDPSKCTAQGAGIAGGTACAGATFMVVTKDADG 120 Query: 517 RKIPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAI 696 K+ GGAQ++V++ PGVGVGG+EQEG+VKD GDG+Y VTYVVPKRGNYMV+VEC+GK I Sbjct: 121 GKVVRGGAQVRVRVSPGVGVGGTEQEGVVKDMGDGSYSVTYVVPKRGNYMVNVECNGKPI 180 Query: 697 MGSPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXX 876 MGSPFPVFFSA +NQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 181 MGSPFPVFFSAGTPTGGLLGIAPPASYPNLINQTMPNMPNYSGSVSGAFPGLLGMIPGVV 240 Query: 877 XXXXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXX 1056 E+CREYLNGRC TDCKF HPPHN S Sbjct: 241 NGASGGVVLPGMGSSLGEMCREYLNGRCASTDCKFNHPPHNLLMTAIAATTTMGTLS--- 297 Query: 1057 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLS 1236 +DS + ADSLKK +Q+SNLS Sbjct: 298 --QVPMAPSAAAMAAAQAIVAAQALQAHAQAQSNKDSYGLGNSERNADSLKKMVQVSNLS 355 Query: 1237 PLLTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEM 1416 PLLT DQLKQLF +CGTVVEC ITDSKHFAYIEY K EEA +ALALNNMDVGGRP+NVEM Sbjct: 356 PLLTVDQLKQLFGFCGTVVECIITDSKHFAYIEYLKAEEATSALALNNMDVGGRPLNVEM 415 Query: 1417 AKSLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSA 1596 AKSLPP+P ILNS + SSLP NRAATMKSA Sbjct: 416 AKSLPPRP-ILNSPLGSSSLPMVMQQAVAMQQMQFQQALLMQQTLTAQQAANRAATMKSA 474 Query: 1597 TEMASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1776 T++A+ARAAEISKKL+ADG+ E K + Sbjct: 475 TDLAAARAAEISKKLQADGLVIEVKDSDRISRSPSPTRAKSKSRSRSKSVSPIKYRPRRR 534 Query: 1777 XXXXXXXXXXXXXXXXXXXXXTNYGSERRSHRQVRDSN---DRSGRWERDRTRDHY-XXX 1944 + S + R R+S DR+ R + R+ D+ Sbjct: 535 SRSYSPPRRNRDYRSRSPVRSRYHSSYEKERRYYRESRDVIDRNRRRDIGRSHDNVSPVS 594 Query: 1945 XXXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSS 2124 T++S + S + + RRES +TRKSS Sbjct: 595 RRKRSRSLSPRTKRSRKDDSGTSRRRRES----------------------PIEKTRKSS 632 Query: 2125 RAGSGSPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXT---------HENVSHYRR 2277 R S SP+ HR S ++ R+ + ++ S RR Sbjct: 633 RPDSRSPQRHRRRSSSSGDEADRSKQHRNHSLSRSDEVKHHSSDKKDSMKEEKSKSRNRR 692 Query: 2278 RSRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRS 2457 RSRS S ED N PR ES S ++RRSRS Sbjct: 693 RSRSNSVEDR--------------------------NGRRSPPPRVVEESKSRHKRRSRS 726 Query: 2458 KSAEDEPRSDDRKERKKSEKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRKD 2637 +S D+ +S ++ ER + +K RN +K R ++ K R G+ RR+ Sbjct: 727 RSPVDKHQSSEKYERSREDK----SRNRDK--RRSRSRSTDGKHRR-----GSKASRRR- 774 Query: 2638 TLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSVSAR----Y 2805 SDE + KRSR D ++ D +G +SPS SA Sbjct: 775 --------SDEHKSKHRKRSR------SNKDEASLERPDEHKSKDGKRSPSNSAENDNDL 820 Query: 2806 NDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERD 2928 ND PV+ + + ++ R TSD ++++ + RD Sbjct: 821 NDFIPVESKDKALETENDR--SVTSDDVYVDRLNESMHVRD 859 >ref|XP_002885603.1| hypothetical protein ARALYDRAFT_898933 [Arabidopsis lyrata subsp. lyrata] gi|297331443|gb|EFH61862.1| hypothetical protein ARALYDRAFT_898933 [Arabidopsis lyrata subsp. lyrata] Length = 985 Score = 462 bits (1190), Expect = e-127 Identities = 342/990 (34%), Positives = 441/990 (44%), Gaps = 34/990 (3%) Frame = +1 Query: 214 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT------------VKESI 357 S A +P WMK TFK V+ T ES Sbjct: 7 SAAAGKPFWMKHAEDAKIKDEGEKDAAAKAAFEATFKGVDQTTHLIEAVAPAPESAPESD 66 Query: 358 SSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGG 537 S D ++ D + KP+GPVDPSK T S+FVVVTKDSDGRK+PNGG Sbjct: 67 SDSDDDDDESDYLSRKPIGPVDPSKSTASGAGIGGGTACVPSTFVVVTKDSDGRKVPNGG 126 Query: 538 AQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPV 717 A ++V++CPGVGVGG++QEG+VKD GDG+Y VTYVVPKRGNYMV++EC+G AIMGSPFPV Sbjct: 127 ALIRVRVCPGVGVGGTDQEGVVKDVGDGSYAVTYVVPKRGNYMVNIECNGSAIMGSPFPV 186 Query: 718 FFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXX 897 FFS +NQTMPNMPNY+GSVSGAFPGLLGM+P Sbjct: 187 FFS-QGSSSTGLMGSAPASYSNLINQTMPNMPNYTGSVSGAFPGLLGMVPGIASGPSGGA 245 Query: 898 XXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXX 1077 E+CREYLNGRCV + CK HPP N S Sbjct: 246 ILPGVGASLGEVCREYLNGRCVNSMCKLNHPPQNLLMTAIAATTSMGNMSQVPMAPSAAA 305 Query: 1078 XXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQ 1257 + S S +K DSLKK LQ+SNLSP LTT+Q Sbjct: 306 MAAAQAIVAAQALQAHASQMQAQAQSNKGSLGSPEKGENGDSLKKFLQVSNLSPSLTTEQ 365 Query: 1258 LKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPK 1437 L+QLFS+CGTVV+CSITDSKH AYIEYS EEA AALALNN +V GRP+NVE+AKSLP K Sbjct: 366 LRQLFSFCGTVVDCSITDSKHLAYIEYSNSEEATAALALNNTEVFGRPLNVEIAKSLPHK 425 Query: 1438 PAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASAR 1617 P+ NSS SSLP NRAATMKSATE+A+AR Sbjct: 426 PSSNNSS---SSLPLMMQQAVAMQQMQFQQAILMQQAVATQQAANRAATMKSATELAAAR 482 Query: 1618 AAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1797 AAEIS+KL+ DGVGN+ K Sbjct: 483 AAEISRKLRPDGVGNDVKEADQKSRSPSKSPGSRSKSKSPISYRRRRRSRSYSPPFRRPR 542 Query: 1798 XXXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXX 1977 T Y RRS+R RD ++ S R+ R D + Sbjct: 543 SHRSRSPLRYHRRST-YEGRRRSYRDSRDISE-SRRYGRS---DEHHSSSSRRSRSVSPK 597 Query: 1978 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHR 2157 RKS + +HRR+S S +KSSRAGS SP+ + Sbjct: 598 KRKSGQEDLELSRHRRDS----------------------SSRGDKKSSRAGSRSPRRRK 635 Query: 2158 ESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXX 2337 E K + R + EN R RSRS+S ED A Sbjct: 636 E-----VKSTPRDD-----------------EENKLKRRTRSRSRSVEDSADMKDESRDE 673 Query: 2338 XXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEK 2517 + + D ++ + R + E+ +RRRSRSKS E++ SD+ + Sbjct: 674 ELKHHKKRSRSRSREDRSKTRDASRNSDETKRKHRRRSRSKSLENDNGSDENVD------ 727 Query: 2518 VKLDG-RNDEKTDRATKELDLSAKDSRDLKE-YGTSDPRRKDTLLEDGSSS----DEKYG 2679 V DG N + R + LD + D+KE G S R +T + D G Sbjct: 728 VAQDGDLNSRHSRRRSNSLD----EDYDMKERRGRSRSRSLETKNRSSGKNKLDEDRNTG 783 Query: 2680 SNHKRSR---LDDKDSEKHDSVFKDKNDLMDVSEGVKSPSVSAR----------YNDSAP 2820 S +RSR ++ K S ++ +DK +SPS + Y+D Sbjct: 784 SRRRRSRSKSVEGKRSYNKETRSRDKKSKRRSGRRSRSPSSEGKQGRDIRSSPGYSDEKK 843 Query: 2821 VDDRTHSRTKDSSRYEKSTSDR-RRHEKIDTTRRERDTSGMDR--AHIGCEDLSRHGRLT 2991 + HSR++ + + S R +RHE++ ++ D DR + + ED R + Sbjct: 844 SRHKQHSRSRSTEKKNSSREKRSKRHERLRSSSPVGDKRRGDRSLSPVSSEDHKIKKRHS 903 Query: 2992 SENKKHEKVESIHREKDHLDDDSTYNREGR 3081 EK S + + D D +S ++R R Sbjct: 904 GSKSVKEKPRSDYEKVDDGDANSDFSRPER 933 >ref|XP_004296963.1| PREDICTED: uncharacterized protein LOC101297633 [Fragaria vesca subsp. vesca] Length = 1040 Score = 460 bits (1183), Expect = e-126 Identities = 266/500 (53%), Positives = 299/500 (59%), Gaps = 3/500 (0%) Frame = +1 Query: 187 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS-- 360 MADR++ A+ +PIWMKQ TFKDV+ + K + + Sbjct: 1 MADRNL-----AVVKPIWMKQAEEARVKSEAEKDAAAKAAFEATFKDVDKSKEKGAAAAG 55 Query: 361 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 540 SDS++EE E+L NKP+GPVD +KCT SSF V TKDSDGRK+PNGGA Sbjct: 56 SDSESEEAENLA-NKPIGPVDATKCTAAGAGIAGGTACAPSSFTVATKDSDGRKVPNGGA 114 Query: 541 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 720 Q+KVKI PG+GVGGSEQEGMVKD GDGTY VTYVVPKRGNYMV +EC+G+AIMGSPFPVF Sbjct: 115 QIKVKIMPGLGVGGSEQEGMVKDMGDGTYTVTYVVPKRGNYMVTIECNGRAIMGSPFPVF 174 Query: 721 FSA-XXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXX 897 FSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 175 FSAGSTSTGGLLGLAPTSTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGALGGA 234 Query: 898 XXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXX 1077 E+CREYLNGRC K DCK HPPH S Sbjct: 235 ILPGIGASLGEVCREYLNGRCAKADCKLNHPPHQLLMTALAATTNMGNVS-QVPMAPSAA 293 Query: 1078 XXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQ 1257 +DSS S DKAGKAD LK+TLQ+SNLSPLLT +Q Sbjct: 294 AMAAAQAIVAAQALQAHAAQHAQAQSNKDSSGSPDKAGKADVLKRTLQVSNLSPLLTVEQ 353 Query: 1258 LKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPK 1437 LKQLFS+CGTVVEC+ITDSKHFAYIEY+KPEEA AALALN+MDVGGRP+NVEMAKSLP K Sbjct: 354 LKQLFSFCGTVVECTITDSKHFAYIEYTKPEEATAALALNSMDVGGRPLNVEMAKSLPQK 413 Query: 1438 PAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASAR 1617 A +NS M SSLP NRAATMK+ATE+A+AR Sbjct: 414 SA-MNSQMASSSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKTATELAAAR 472 Query: 1618 AAEISKKLKADGVGNEEKGT 1677 AAEISKKLKADGV EE T Sbjct: 473 AAEISKKLKADGVEIEETET 492 Score = 92.4 bits (228), Expect = 2e-15 Identities = 110/428 (25%), Positives = 162/428 (37%), Gaps = 17/428 (3%) Frame = +1 Query: 1846 YGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKHRR 2025 Y + RRS+R R+ +DR R R D + RKS R S+SPKHRR Sbjct: 548 YDNGRRSYRDFRNGSDRGRR----RDSDDFQSYASRRRSRSVSPRRKSHRVDSSSPKHRR 603 Query: 2026 ESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRES-LSPR-------TK 2181 E SP R+++R S SP + S LSPR + Sbjct: 604 ER-----------------------SP--RRATRDASRSPGHYTGSKLSPRGDEDKSKHR 638 Query: 2182 KSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXX 2361 K SR+N + RRRSRS S E + H Sbjct: 639 KRSRSNSPEDKHLLNDKKDETRYETSKHRERRRSRSVSVEGKHHR--------------- 683 Query: 2362 XXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR-SDDRKERKKSEKVKLDGRN 2538 +SSPR E+ S +RRRSRSKS E +PR +D+ ++RK + R+ Sbjct: 684 -----------KRSSPRSLDENKSKHRRRSRSKSVEVKPRGADETRDRKLKHRSGRQSRS 732 Query: 2539 DEKTDRATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDS 2718 K+ + + D + RD K R + +E S E G K+S+ D+ Sbjct: 733 --KSLESKRHSDEKTSELRDEKSKHRDRRRSRSKSVEGRRHSKEVDGGRDKKSKRRDRRQ 790 Query: 2719 EKHDSVFKDKNDLMDVSEGVKSP-------SVSARYNDSAPVDDRTHSRTK-DSSRYEKS 2874 + S +D + G SP S R + S + + H + D SR EK+ Sbjct: 791 SRSRSRSSSLEPKLD-TRGESSPRRLDEHKSKHRRRSRSKSAEGKQHLNDRADKSRNEKA 849 Query: 2875 TSDRRRHEKIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVESIHREKDHLDD 3054 RRR + + RER E+ S+ R S ++ E ++ DD Sbjct: 850 KRHRRRRSRSISLERERHRGSRLSPRSSDENDSKQRRRRSRSESSEGKHQRSERDENGDD 909 Query: 3055 DSTYNREG 3078 + + +G Sbjct: 910 ELKHYEDG 917 >ref|XP_007153615.1| hypothetical protein PHAVU_003G050400g [Phaseolus vulgaris] gi|561026969|gb|ESW25609.1| hypothetical protein PHAVU_003G050400g [Phaseolus vulgaris] Length = 957 Score = 458 bits (1178), Expect = e-125 Identities = 256/487 (52%), Positives = 292/487 (59%), Gaps = 3/487 (0%) Frame = +1 Query: 226 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKES--ISSDSDTEENEDLVK 399 A+PIWMKQ TFK +E N K + SDS++EE EDL Sbjct: 9 AKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKGLEKNREKGGGVVQSDSESEEYEDLA- 67 Query: 400 NKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICPGVGVG 579 NKP+GPVDPSKCT SSFVVV KD+D RK+ NGGAQ+KV++ PG+GVG Sbjct: 68 NKPIGPVDPSKCTAAGTGIAGGTACAPSSFVVVAKDADERKVSNGGAQIKVRVTPGLGVG 127 Query: 580 GSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS-AXXXXXXXXX 756 GSEQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFS A Sbjct: 128 GSEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAAGNGSGGLLG 187 Query: 757 XXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXXXEIC 936 VNQTMPNMPNYSGSVSGAFPGLLGMIP E+C Sbjct: 188 LAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPGIGASLGEVC 247 Query: 937 REYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXXX 1116 R+YLNGRC K DCK HPPHN S Sbjct: 248 RDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLS--QAPMAPSAAAMAAAQAIVAAQ 305 Query: 1117 XXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCGTVVE 1296 +DS+ S +K+ K D+LKKTLQ+SNLSPLLT +QLKQLF++CGTVV+ Sbjct: 306 ALQAHAAQVQAQSAKDSAGSPEKSSKDDALKKTLQVSNLSPLLTVEQLKQLFAFCGTVVD 365 Query: 1297 CSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMNQSSL 1476 C+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP+++NSS+ SSL Sbjct: 366 CTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKPSVVNSSLASSSL 425 Query: 1477 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLKADGV 1656 P NRAATMKSATE+A+ARAAEISKKL DG+ Sbjct: 426 PLMMQQAVAMQQMQFQQALRMQQTMTAQQAANRAATMKSATELAAARAAEISKKLNPDGL 485 Query: 1657 GNEEKGT 1677 +EEK T Sbjct: 486 ESEEKET 492 Score = 84.3 bits (207), Expect = 4e-13 Identities = 97/411 (23%), Positives = 167/411 (40%), Gaps = 24/411 (5%) Frame = +1 Query: 1840 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2019 ++Y ERR R R+ +DR + + DR+ DH +R S SPK Sbjct: 544 SSYERERR-FRDSREHSDRYRKRDLDRSLDH---RSSVSRRNKSRSVSPHTRKSSVSPKR 599 Query: 2020 RRESLXXXXXXXXXXXXXXXXXXXGNVSP-RTRKSSRAGSGSPKFHRESLSPRTKKSSRA 2196 RE+ SP R RK SRA SGSP R SP T + Sbjct: 600 HRET-----------------------SPHRGRKQSRADSGSPSRRRGRASPNTDEKKLR 636 Query: 2197 NXXXXXXXXXXXXXXXXTHENVSH------YRRRSRSKSAEDEAHXXXXXXXXXXXXXXX 2358 N +E + H R+RSRS S +++ H Sbjct: 637 NRRHSRSRSSDDRLHSSKNEEILHGKSKHKERKRSRSGSVDEKPH--------------- 681 Query: 2359 XXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKVKLDGRN 2538 R S+SSPR+ ES S Y++RSRSKS +D+ S +R ++ ++ +++ + Sbjct: 682 ----------RRSRSSPRKVDESRSRYKKRSRSKSVDDKHDSPERLDQNRNRRMRHSDKR 731 Query: 2539 DEKTDRATKELDLS---AKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNH-KRSRLD 2706 ++ R+T+ DLS +S++ K R + +E S +K G N K+S+ Sbjct: 732 HSRS-RSTENRDLSEVRVDESKNEKSKHRDSKRGRSKSVEGKHRSKDKSGENRDKKSKHR 790 Query: 2707 DKDSEKHDSVFKDKNDLMDVSEGVK--SPSVSARYNDSAPVDDRTHSRTK-----DSSRY 2865 D+ + S+ + ++D S + + + + S + + H K + S + Sbjct: 791 DRRRSRSTSL-EGEHDKSGTSPHINLDERNFEVKQSRSKFPEGKHHFSDKYGNRDEKSEH 849 Query: 2866 EKSTSDRRRHEKIDTTR------RERDTSGMDRAHIGCEDLSRHGRLTSEN 3000 +K T + + E+ D + ++ D+ G ++ G ++ +H EN Sbjct: 850 QKKTPPKSKSEQFDGSGSFQGNFKDYDSKGKSQSDSGSAEI-KHNLSDGEN 899 >ref|XP_006421067.1| hypothetical protein CICLE_v10004448mg [Citrus clementina] gi|557522940|gb|ESR34307.1| hypothetical protein CICLE_v10004448mg [Citrus clementina] Length = 709 Score = 458 bits (1178), Expect = e-125 Identities = 257/491 (52%), Positives = 299/491 (60%), Gaps = 3/491 (0%) Frame = +1 Query: 214 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDV--EMNTVKESISSDSDTEEN- 384 +TA ++ IW+KQ TFK + + N + +++SDSD+EE+ Sbjct: 5 NTAASKAIWLKQAEEAKLKSEAEKDAAAKAAFEATFKGLTNKANEKQAAVASDSDSEEDW 64 Query: 385 EDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICP 564 E+ + NKP+GPVDPSK T S+F+VVTKDSDGRK+P+GGA++KVK+ P Sbjct: 65 EEDLSNKPIGPVDPSKSTAAGAGIAGGNAGAASTFMVVTKDSDGRKVPHGGAEIKVKVAP 124 Query: 565 GVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXX 744 GVGVGGSEQEG+VKD DGTY VTYVVPKRGNYM+ +EC+GK IMGSPFPVFFSA Sbjct: 125 GVGVGGSEQEGIVKDMNDGTYTVTYVVPKRGNYMLSIECNGKPIMGSPFPVFFSA---GS 181 Query: 745 XXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXX 924 VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 182 NSTGGGLLGMAPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGVVSGASGGAILPGMGASL 241 Query: 925 XEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXX 1104 E+CREYLNGRC KTDCK HPPHN S Sbjct: 242 GEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTTMGTLS-QVPMAPSAAAMAAAQAIV 300 Query: 1105 XXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCG 1284 +D S S DKAGKAD+LKKTLQ+SNLSPLLT +QLKQLFS+CG Sbjct: 301 AAQALQAHAAQVQAQQSAKDLSGSPDKAGKADALKKTLQVSNLSPLLTVEQLKQLFSFCG 360 Query: 1285 TVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMN 1464 TVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKS P KP+ LNSS+ Sbjct: 361 TVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSFPQKPSHLNSSLA 420 Query: 1465 QSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLK 1644 SSLP NRAA+MKSATE+A+ARAAEISKKLK Sbjct: 421 GSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARAAEISKKLK 480 Query: 1645 ADGVGNEEKGT 1677 ADG+ +E+K T Sbjct: 481 ADGLVDEDKET 491 >ref|XP_006492975.1| PREDICTED: uncharacterized protein LOC102615780 isoform X1 [Citrus sinensis] Length = 950 Score = 457 bits (1175), Expect = e-125 Identities = 256/491 (52%), Positives = 299/491 (60%), Gaps = 3/491 (0%) Frame = +1 Query: 214 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDV--EMNTVKESISSDSDTEEN- 384 +TA ++ IW+KQ TFK + + N + +++SDSD+EE+ Sbjct: 5 NTAASKAIWLKQAEEAKLKSEAEKDAAAKAAFEATFKGLTNKANEKQAAVASDSDSEEDW 64 Query: 385 EDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICP 564 E+ + NKP+GPVDPSK T S+F+VVTKDSDGRK+P+GGA++KVK+ P Sbjct: 65 EEDLSNKPIGPVDPSKSTAAGAGIAGGNAGAASTFMVVTKDSDGRKVPHGGAEIKVKVAP 124 Query: 565 GVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXX 744 GVGVGGSEQEG+VKD DGTY VTYVVPKRGNYM+ +EC+GK IMGSPFPVFFSA Sbjct: 125 GVGVGGSEQEGIVKDMNDGTYTVTYVVPKRGNYMLSIECNGKPIMGSPFPVFFSA---GS 181 Query: 745 XXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXX 924 VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 182 NSTGGGLLGMAPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGVVSGASGGAILPGMGASL 241 Query: 925 XEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXX 1104 E+CREYLNGRC KTDCK HPPHN S Sbjct: 242 GEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTTMGTLS-QVPMAPSAAAMAAAQAIV 300 Query: 1105 XXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCG 1284 +D S S DKAGKAD+LKKTLQ+SNLSPLLT +QL+QLFS+CG Sbjct: 301 AAQALQAHAAQVQAQQSAKDLSGSPDKAGKADALKKTLQVSNLSPLLTVEQLRQLFSFCG 360 Query: 1285 TVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMN 1464 TVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKS P KP+ LNSS+ Sbjct: 361 TVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSFPQKPSHLNSSLA 420 Query: 1465 QSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLK 1644 SSLP NRAA+MKSATE+A+ARAAEISKKLK Sbjct: 421 GSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARAAEISKKLK 480 Query: 1645 ADGVGNEEKGT 1677 ADG+ +E+K T Sbjct: 481 ADGLVDEDKET 491 Score = 70.9 bits (172), Expect = 5e-09 Identities = 112/402 (27%), Positives = 144/402 (35%), Gaps = 7/402 (1%) Frame = +1 Query: 1840 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2019 + Y +ER S R RD DRS R E D + H +RKS R+ S SPKH Sbjct: 564 SKYDNERLSTRDTRDGADRSRR-ESDISH-HSPVPRRRKSRSVSPRSRKSYRSDSGSPKH 621 Query: 2020 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESL-SPRTKKSSRA 2196 R+ES RKSSRA S SPK HR S SPR + Sbjct: 622 RQES-------------------------SARKSSRAHSKSPKRHRGSRNSPRNDDAK-- 654 Query: 2197 NXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDE 2376 YR RSRSKS E ++E Sbjct: 655 ----------------------PKYRHRSRSKSME--------------------RHDEE 672 Query: 2377 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKE--RKKSEKVKLDGRNDEKT 2550 K + R KS R R+RSRS S EDE R K KL GR+ + Sbjct: 673 KDEARDGKSQHRE--------RKRSRSLSREDEHHGRGRSPAGNVDENKSKLRGRSRSVS 724 Query: 2551 DRATKELDLSAKDSRD--LKEYGTSDPRRKDTLLEDGSSSDEKYGSNH--KRSRLDDKDS 2718 + A DSRD L+ R K + G S +K +H +RSR D Sbjct: 725 ALDKYKSSEIADDSRDDRLRNRHKRRSRSKSVEGKMGGGSRDKKPKHHDRRRSRSISADG 784 Query: 2719 EKHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHE 2898 + H G +SP R D + SR+K R K +R R Sbjct: 785 KHH--------------RGSRSP----RGLDESRTKLGRRSRSKSIER--KLYRNRGRSR 824 Query: 2899 KIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVES 3024 D RR+ S + A++ + +R S++K H ES Sbjct: 825 SADGRRRK---SMLSPANLDRSESNRRRHSASQSKDHANSES 863