BLASTX nr result
ID: Akebia24_contig00000129
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00000129 (4206 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270255.1| PREDICTED: uncharacterized protein LOC100244... 539 e-150 gb|EXC21916.1| Tripartite motif-containing protein 45 [Morus not... 531 e-147 emb|CBI36502.3| unnamed protein product [Vitis vinifera] 527 e-146 ref|XP_007224346.1| hypothetical protein PRUPE_ppa020677mg, part... 520 e-144 ref|XP_003531222.1| PREDICTED: serine/arginine repetitive matrix... 511 e-141 ref|XP_004134373.1| PREDICTED: uncharacterized protein LOC101203... 511 e-141 ref|XP_004157720.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 506 e-140 ref|XP_006373079.1| RNA recognition motif-containing family prot... 503 e-139 ref|XP_003524186.1| PREDICTED: splicing regulatory glutamine/lys... 503 e-139 ref|XP_006373080.1| hypothetical protein POPTR_0017s08510g [Popu... 496 e-137 ref|XP_002300152.2| RNA recognition motif-containing family prot... 487 e-134 ref|XP_007034241.1| RNA recognition motif-containing protein iso... 481 e-133 ref|XP_007034240.1| RNA recognition motif-containing protein iso... 481 e-133 ref|XP_002518040.1| conserved hypothetical protein [Ricinus comm... 473 e-130 gb|EYU31494.1| hypothetical protein MIMGU_mgv1a001194mg [Mimulus... 463 e-127 ref|XP_002885603.1| hypothetical protein ARALYDRAFT_898933 [Arab... 462 e-127 ref|XP_004296963.1| PREDICTED: uncharacterized protein LOC101297... 460 e-126 ref|XP_007153615.1| hypothetical protein PHAVU_003G050400g [Phas... 458 e-125 ref|XP_006421067.1| hypothetical protein CICLE_v10004448mg [Citr... 458 e-125 ref|XP_006492975.1| PREDICTED: uncharacterized protein LOC102615... 457 e-125 >ref|XP_002270255.1| PREDICTED: uncharacterized protein LOC100244513 [Vitis vinifera] Length = 926 Score = 539 bits (1388), Expect = e-150 Identities = 364/884 (41%), Positives = 431/884 (48%), Gaps = 41/884 (4%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT---VKESI 352 M DR+ S+A+ +PIWMKQ TFKD + V +S Sbjct: 1 MGDRT----SSAVTKPIWMKQAEEAKIKSEAEKAAAAKAAFEATFKDAASASAPAVADSS 56 Query: 353 SSDSDT--EENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPN 526 SSDSD E+ E + +KP+GPVDPSKCT SSFVVVTKDSDGRK+PN Sbjct: 57 SSDSDDAEEDAESRLASKPIGPVDPSKCTAAGAGIAGGAACSASSFVVVTKDSDGRKVPN 116 Query: 527 GGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPF 706 GGAQ++V++ PGVGVGGS+QEG++KDQGDG+Y VTYVV KRGNYMVHVEC+GK IMGSPF Sbjct: 117 GGAQIRVRVSPGVGVGGSDQEGIIKDQGDGSYTVTYVVSKRGNYMVHVECNGKPIMGSPF 176 Query: 707 PVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXX 886 PVFFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 177 PVFFSAGTASGGLLGLAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGASG 236 Query: 887 XXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXX 1066 E+CREYLNGRC KTDCKF HPPHN S Sbjct: 237 GAVLPGIGASLGEVCREYLNGRCAKTDCKFNHPPHNLLMTALAATTTMGTLSQVPMAPSA 296 Query: 1067 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTT 1246 +DS+ S DK GKAD+LKKTLQ+SNLSPLLT Sbjct: 297 AAMAAAQAIVAAQALQAHAAQVQAQAQSAKDSAGSPDKVGKADALKKTLQVSNLSPLLTV 356 Query: 1247 DQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLP 1426 +QLKQLFS+CGTVVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP Sbjct: 357 EQLKQLFSFCGTVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLP 416 Query: 1427 PKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMAS 1606 PKPAILNS + SLP NRAATMKSATE+AS Sbjct: 417 PKPAILNSPLASPSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAS 476 Query: 1607 ARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1786 ARAAEISKKLKADG EEK Sbjct: 477 ARAAEISKKLKADGFVEEEKEEKEENRKSRSPSISHARSKSRSKSPLHYRRRRRSRSFSP 536 Query: 1787 XXXXXXXXXXXXXXXXTNYGSERRSHRQVRDS---NDRSGRWERDRTRDHY-XXXXXXXX 1954 +Y R RD+ +DRS R + DR+ DH+ Sbjct: 537 PSRYSREHRSRSPFRSHHYSIHDHGSRSYRDNKDGSDRSRRRDLDRSHDHHLSSSRRNRS 596 Query: 1955 XXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSG 2134 TRKS RA S SPK R ES S RTRKSSR Sbjct: 597 RSRSPRTRKSYRADSESPKRRVES----------------------SSHRTRKSSRVSPK 634 Query: 2135 SPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE------ 2296 SP+ HR S +SS N +N S RRRSRSKS E Sbjct: 635 SPRHHRGS------RSSPRN----------------DDDNKSKRRRRSRSKSVEGKHYSN 672 Query: 2297 DEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR 2476 ++ + E ++GS SPR +S S +R+RSRSKSAE + Sbjct: 673 EKIDERRDKKSKHRDRRRSRSISAEGKHHKGSGFSPRSFDDSKSKHRKRSRSKSAEGKRV 732 Query: 2477 SDDRKERKKSEKVKLDGRNDEKTDRATKELD--------LSAKDSRDLK----------- 2599 D+ + + EK G++ EK ++ + LS K S +++ Sbjct: 733 LSDKTDEGRDEK----GKHHEKRRSRSRSAEGKYCRLNRLSPKSSDEIRPKHRRHSRSRS 788 Query: 2600 -EYGTSDPRRKDTLL------EDGSSSDEKYGSNHKRSRLDDKD 2710 EY SD + + L+ E + D K S HK +++D + Sbjct: 789 AEYRRSDNKGDEKLMHHKEPKEREVTEDLKEPSKHKMPKIEDME 832 >gb|EXC21916.1| Tripartite motif-containing protein 45 [Morus notabilis] Length = 973 Score = 531 bits (1368), Expect = e-147 Identities = 381/994 (38%), Positives = 467/994 (46%), Gaps = 44/994 (4%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 361 MADRS +ALA+PIW+KQ TFKDVE + K ++ Sbjct: 1 MADRS-----SALAKPIWVKQAEEAKLKSEAEKAAAAKAAFEATFKDVEKSREKGGAAAS 55 Query: 362 SDTEENEDL---VKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGG 532 SD+E +E+ + KP+GP DP+KC SSFVV KD+DGRK PNGG Sbjct: 56 SDSESDEEAEEDLSRKPIGPADPAKCMAAGAGIAGGTACAPSSFVVTAKDADGRKCPNGG 115 Query: 533 AQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPV 712 AQ+KVK+ PGVGVGG+EQEG+VKD GDGTY VTYVVPKRGNYMV+VEC+GK IMGSPFPV Sbjct: 116 AQIKVKVSPGVGVGGTEQEGVVKDMGDGTYTVTYVVPKRGNYMVNVECNGKPIMGSPFPV 175 Query: 713 FFSAXXXXXXXXXXXXXXXXXXX---VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXX 883 FFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 176 FFSAGATTPTSGGLLGLAPTSTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIIPGAS 235 Query: 884 XXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXX 1063 E+CREYLNGRC KTDCK HPPHN S Sbjct: 236 GGAILPGIGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTVSQVPMAPS 295 Query: 1064 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLT 1243 +DSS S DKAGK D+LKKTLQ+SNLSPLLT Sbjct: 296 AAAMAAAQAIVAAQALQAHAAQVQAQAKSGKDSSASPDKAGKDDALKKTLQVSNLSPLLT 355 Query: 1244 TDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSL 1423 +QLKQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRPMNVEMAKSL Sbjct: 356 VEQLKQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPMNVEMAKSL 415 Query: 1424 PPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMA 1603 P KPAILNS + SSLP +RAATMKSATE+A Sbjct: 416 PQKPAILNSQLASSSLPMMMQQAVAMQQMQFQQALLMQQTMMTQQAASRAATMKSATELA 475 Query: 1604 SARAAEISKKLKADGVGNE------EKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1765 +ARAAEISKKLKADG+ +E EK Sbjct: 476 AARAAEISKKLKADGLVSEEKEEKEEKEAKPKSRSPSPSRKKSRSKSRSPINYHRRRRSP 535 Query: 1766 XXXXXXXXXXXXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHY-XXXX 1942 ++Y +ERRS R++RD DR R + R+RDH+ Sbjct: 536 SYSPPSRQARDRRSRSPIRSRHYSSYDNERRSFREIRDGGDRYRRRDSGRSRDHHVSSSR 595 Query: 1943 XXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSR 2122 RKS R S SPK RES R+++R Sbjct: 596 KHRSRSASPGRRKSYRVDSVSPKRHRES-------------------------TPRRATR 630 Query: 2123 AGSGSPKFHR-ESLSPRTKKSSRANXXXXXXXXXXXXXXXXT-------HENVSHY-RRR 2275 AGS SP + R SPR + N HE H RRR Sbjct: 631 AGSRSPSYSRGNRSSPRIDDERKLNHRKRSRSISPDGKYHSNGTRDETRHERSKHRDRRR 690 Query: 2276 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKT---DNRGSKSSPRRAHESVSHYRRRS 2446 SRS SAED+ H + K+ +R + R +S RRRS Sbjct: 691 SRSVSAEDKHHRMSFARSTNETKSKHRRRSRSKSVEGKHRSVEKDANRDDKSKHRGRRRS 750 Query: 2447 RSKSAEDEPRSDDRKERKKSEKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRR 2626 RS S E + SD + ++E K D R R+ D D D E G ++ + Sbjct: 751 RSTSLESKRLSDGKMNETRNEDSKHDVRRSRSRSRSESLEDKFHFD--DSVEGGRNEKSK 808 Query: 2627 KDTLLEDGSSSDEKYGSNHK-RSRLDDKDSE--KHDSVFKDKNDLMDVSEGVKSPSVS-- 2791 S S E SNHK + ++DD + KH + ++ ++ +S S S Sbjct: 809 HHAKRRSRSRSVE---SNHKLKEKVDDGRDKRPKHRGRRRSRSVSVEAKHHRRSRSSSRS 865 Query: 2792 --------ARYNDSAPVDDRTHSRTK-DSSRYEKSTSDRRRHEKIDTTRRE--RDTSGMD 2938 +R + S D + + + K + +RY+KS S RR+ + + R +SG Sbjct: 866 SGETKMKHSRRSGSKSPDGKNNFKDKLNETRYKKSKSGRRKRSRSSLLEEKLRRGSSGSQ 925 Query: 2939 RAHIGCEDLS-RHGRLTSENK--KHEKVESIHRE 3031 + E + R + SE + KHE + H E Sbjct: 926 SSSDESESKNIRRSKSDSEGRPSKHEAPITEHLE 959 >emb|CBI36502.3| unnamed protein product [Vitis vinifera] Length = 888 Score = 527 bits (1357), Expect = e-146 Identities = 359/878 (40%), Positives = 424/878 (48%), Gaps = 35/878 (3%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT---VKESI 352 M DR+ S+A+ +PIWMKQ TFKD + V +S Sbjct: 1 MGDRT----SSAVTKPIWMKQAEEAKIKSEAEKAAAAKAAFEATFKDAASASAPAVADSS 56 Query: 353 SSDSDT--EENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPN 526 SSDSD E+ E + +KP+GPVDPSKCT SSFVVVTKDSDGRK+PN Sbjct: 57 SSDSDDAEEDAESRLASKPIGPVDPSKCTAAGAGIAGGAACSASSFVVVTKDSDGRKVPN 116 Query: 527 GGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPF 706 GGAQ++V++ PGVGVGGS+QEG++KDQGDG+Y VTYVV KRGNYMVHVEC+GK IMGSPF Sbjct: 117 GGAQIRVRVSPGVGVGGSDQEGIIKDQGDGSYTVTYVVSKRGNYMVHVECNGKPIMGSPF 176 Query: 707 PVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXX 886 PVFFSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 177 PVFFSAGTASGGLLGLAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGASG 236 Query: 887 XXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXX 1066 E+CREYLNGRC KTDCKF HPPHN S Sbjct: 237 GAVLPGIGASLGEVCREYLNGRCAKTDCKFNHPPHNLLMTALAATTTMGTLSQVPMAPSA 296 Query: 1067 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTT 1246 +DS+ S DK GKAD+LKKTLQ+SNLSPLLT Sbjct: 297 AAMAAAQAIVAAQALQAHAAQVQAQAQSAKDSAGSPDKVGKADALKKTLQVSNLSPLLTV 356 Query: 1247 DQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLP 1426 +QLKQLFS+CGTVVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP Sbjct: 357 EQLKQLFSFCGTVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLP 416 Query: 1427 PKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMAS 1606 PKPAILNS + SLP NRAATMKSATE+AS Sbjct: 417 PKPAILNSPLASPSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAS 476 Query: 1607 ARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1786 ARAAEISKKLKADG EEK Sbjct: 477 ARAAEISKKLKADGFVEEEKEEKEENRKSRSPSISHARSKSRSKSPLHYRRRRRSRSFSP 536 Query: 1787 XXXXXXXXXXXXXXXXTNYGSERRSHRQVRDS---NDRSGRWERDRTRDHY-XXXXXXXX 1954 +Y R RD+ +DRS R + DR+ DH+ Sbjct: 537 PSRYSREHRSRSPFRSHHYSIHDHGSRSYRDNKDGSDRSRRRDLDRSHDHHLSSSRRNRS 596 Query: 1955 XXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSG 2134 TRKS RA S SPK R ES S RTRKSSR Sbjct: 597 RSRSPRTRKSYRADSESPKRRVES----------------------SSHRTRKSSR---- 630 Query: 2135 SPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXX 2314 + E + R K S+ H + RRRSRS SAE + H Sbjct: 631 --HYSNEKIDERRDKKSK-------------------HRD----RRRSRSISAEGKHH-- 663 Query: 2315 XXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKE 2494 +GS SPR +S S +R+RSRSKSAE + D+ + Sbjct: 664 -----------------------KGSGFSPRSFDDSKSKHRKRSRSKSAEGKRVLSDKTD 700 Query: 2495 RKKSEKVKLDGRNDEKTDRATKELD--------LSAKDSRDLK------------EYGTS 2614 + EK G++ EK ++ + LS K S +++ EY S Sbjct: 701 EGRDEK----GKHHEKRRSRSRSAEGKYCRLNRLSPKSSDEIRPKHRRHSRSRSAEYRRS 756 Query: 2615 DPRRKDTLL------EDGSSSDEKYGSNHKRSRLDDKD 2710 D + + L+ E + D K S HK +++D + Sbjct: 757 DNKGDEKLMHHKEPKEREVTEDLKEPSKHKMPKIEDME 794 >ref|XP_007224346.1| hypothetical protein PRUPE_ppa020677mg, partial [Prunus persica] gi|462421282|gb|EMJ25545.1| hypothetical protein PRUPE_ppa020677mg, partial [Prunus persica] Length = 764 Score = 520 bits (1338), Expect = e-144 Identities = 344/822 (41%), Positives = 404/822 (49%), Gaps = 16/822 (1%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS-S 358 MADRS TALA+PIWMKQ TFKDV+ N KE ++ S Sbjct: 1 MADRS-----TALAKPIWMKQAEEARVKSEAEKAAAAKAAFEATFKDVDKNREKEVVAGS 55 Query: 359 DSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQ 538 DS++EE EDL NKP+GPVDP+KCT SSF+VVTKDSDGRK+P+GG Q Sbjct: 56 DSESEEAEDLA-NKPIGPVDPAKCTAAGAGIAGGTACAPSSFMVVTKDSDGRKVPHGGVQ 114 Query: 539 LKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFF 718 +KVK+ PGVGVGGSEQEGMVKD GDGTY VTYVVPKRGNYMV+V+C+GKAIMGSPFPVFF Sbjct: 115 IKVKVIPGVGVGGSEQEGMVKDMGDGTYTVTYVVPKRGNYMVNVDCNGKAIMGSPFPVFF 174 Query: 719 SAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXX 898 SA VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 175 SAGTSTGGLLGLAPASTFPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGIVPGASGGAIL 234 Query: 899 XXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXX 1078 E+CREYL+GRC KTDCK HPPHN S Sbjct: 235 PGIGASLGEVCREYLSGRCAKTDCKLNHPPHNLLMTALAATTSMSNVSQVPMAPSAAAMA 294 Query: 1079 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLK 1258 +DSS S DKAGKAD LKKTLQ+SNLSPLLT +QLK Sbjct: 295 AAQAIVAAQALQAHAAQVQAHAQSNKDSSGSPDKAGKADVLKKTLQVSNLSPLLTVEQLK 354 Query: 1259 QLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPA 1438 QLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AAL LNNMDVGGRP+NVEMAKSLP KPA Sbjct: 355 QLFSFCGTVVECTITDSKHFAYIEYSKPEEASAALQLNNMDVGGRPLNVEMAKSLPQKPA 414 Query: 1439 ILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAA 1618 I+NSSM SSLP NRAATMK+ATE+A+ARAA Sbjct: 415 IMNSSMASSSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKTATELAAARAA 474 Query: 1619 EISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1798 EISKKLKADGV EEK T Sbjct: 475 EISKKLKADGVDIEEKETTEKSRSPSPHFAKSKSKSKSRSRSPINYRRRRKSPSYSPPSR 534 Query: 1799 XXXXXXXXXXXXTNYGSERRSHRQ----VRDSNDRSGRWERDRTRDHYXXXXXXXXXXXX 1966 + + S + R+ +++ +R+ R + DR+ DH+ Sbjct: 535 YPRDRRSRSPLRSRHYSSYDNDRRSFRDIKNEGERTRRRDLDRSHDHH----STHYEKAK 590 Query: 1967 XXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKF 2146 R+ SR+ S KH R L + R+ SR+ S K Sbjct: 591 HRERRRSRSVSTDDKHHRRRLSPRSLDEN--------------KTKHRRRSRSKSVEDKH 636 Query: 2147 H-----RESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHX 2311 H E +TK R + S +R R RS+S E Sbjct: 637 HPDDKTNEMRDEKTKHRDRRR-----------------RDKKSKHRDRRRSRSISPEG-- 677 Query: 2312 XXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRK 2491 K D R SSPR ++ +RRRSRSKSAE + RS+DR Sbjct: 678 --------------------KHDRRHG-SSPRSLDDNKLKHRRRSRSKSAERKHRSNDRA 716 Query: 2492 ERKKSEKVKLDGRND------EKTDRATKELDLSAKDSRDLK 2599 + + EK K R E R + L + D ++LK Sbjct: 717 YKSRDEKEKGHRRRRSRSASLEPKRRRGRRLSPRSSDEKELK 758 >ref|XP_003531222.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X1 [Glycine max] gi|571470905|ref|XP_006585151.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X2 [Glycine max] gi|571470908|ref|XP_006585152.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X3 [Glycine max] Length = 975 Score = 511 bits (1316), Expect = e-141 Identities = 364/986 (36%), Positives = 462/986 (46%), Gaps = 40/986 (4%) Frame = +2 Query: 221 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMN-----------TVKESISSDSD 367 A+PIWMKQ TFK +E +V ES SDS+ Sbjct: 9 AKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKALENKHDKGGGGGGGGSVAES-DSDSE 67 Query: 368 TEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKV 547 EE EDL +KP+GPVDPSKCT SSFVVV KD+D RK+ GGAQ+KV Sbjct: 68 EEEYEDLA-HKPIGPVDPSKCTAAGTGIAGGTACAPSSFVVVAKDADERKVSGGGAQIKV 126 Query: 548 KICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAX 727 ++ PG+GVGG+EQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFSA Sbjct: 127 RVTPGLGVGGTEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAA 186 Query: 728 XXXXXXXXXXXXXXXXXX-VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXX 904 VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 187 GNSTGGLLGLAPASSFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPG 246 Query: 905 XXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXX 1084 E+CR+YLNGRC K DCK HPPHN S Sbjct: 247 IGASLGEVCRDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLSQAPMAPSAAAMAAA 306 Query: 1085 XXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQL 1264 +DS+ S +KA K D+LKKTLQ+SNLSPLLT +QLKQL Sbjct: 307 QAIVAAQALQAHAAQVQAQSA--KDSTGSPEKASKDDALKKTLQVSNLSPLLTVEQLKQL 364 Query: 1265 FSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAIL 1444 F +CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNN+DVGGRP+NVEMAKSLPPKP++ Sbjct: 365 FGFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNIDVGGRPLNVEMAKSLPPKPSVA 424 Query: 1445 NSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEI 1624 NSS+ SSLP NRAATMKSATE+A+ARAAEI Sbjct: 425 NSSLASSSLPLMMQQAVAMQQMQFQQALLMQQSMTAQQAANRAATMKSATELAAARAAEI 484 Query: 1625 SKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1804 SKKL DGVG E+ Sbjct: 485 SKKLNPDGVGT-EEKETKQKSRSPSPPHGRSRSKSRSPINYRRRRRSRSYSPARHSKDHR 543 Query: 1805 XXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKS 1984 ++Y ERRS R +R+ +DR R + DR+ DH+ Sbjct: 544 SRSPLRSHHYSSYDRERRSFRDIREHSDRYRRRDLDRSLDHH------------------ 585 Query: 1985 SRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLS 2164 SAS ++R S VSP TRKS S SPK HRE+ Sbjct: 586 ---SSASRRNRSRS----------------------VSPYTRKS----SVSPKRHRETSP 616 Query: 2165 PRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXX 2344 R +K SRA+ + + + RRRSRS+S++D H Sbjct: 617 HRGRKQSRADSGSPSRRRGSRSSPKIDEKKLRN-RRRSRSRSSDDRLHSIKNEEISHGKS 675 Query: 2345 XXXXXXN------DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKS 2506 DEK +R S+SSPR+ ES S +++R RSKS +D S +R + ++ Sbjct: 676 KHRERRRSRSLSVDEK-PHRRSRSSPRKVDESRSRHKKRLRSKSVDDRHGSPERLDENRT 734 Query: 2507 EKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRK--DTLLEDGSSSDEKYGSN 2680 + R+ +K R ++ +D D++E + + K DT S + K+ Sbjct: 735 RR----SRHSDK--RHSRSRSTETRDQTDVREDERKNQKSKHRDTKRSRSKSVEGKHRFK 788 Query: 2681 HKRSRLDDKDSEKHDS------VFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRT 2842 K DK S++ D +DK+D D S + + + + + HS Sbjct: 789 DKSGENRDKKSKRRDRKRSRSISLEDKHDKGDTSPHINFD--ERNFEPTKSPEGKNHSSD 846 Query: 2843 KDSSRYEKS-----TSDRRRHEKIDTT------RRERDTSGMDRAHIGCEDLSRH---GR 2980 K SR EKS T + + E+ D + +E D+ G + G ++ H G Sbjct: 847 KYGSRGEKSEHQKKTPSKSKSEQFDGSGPLRGNYKEYDSKGKSPSDSGSAEVKHHLSDGE 906 Query: 2981 LTSENKKHEKVESIHREKDHLDDDST 3058 + + + + +E DST Sbjct: 907 NATSEENSKLFGDVFQEPIRTAKDST 932 >ref|XP_004134373.1| PREDICTED: uncharacterized protein LOC101203535 [Cucumis sativus] Length = 936 Score = 511 bits (1315), Expect = e-141 Identities = 334/895 (37%), Positives = 429/895 (47%), Gaps = 29/895 (3%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 361 MADR++ +A+PIWMKQ TFK V+ KE+ SSD Sbjct: 1 MADRNL-----VVAKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKGVDKIPAKEAASSD 55 Query: 362 SDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQL 541 SD E+NEDL + KP+GPVDP++CT +SF VVTKD DGRK+P+GGA + Sbjct: 56 SDFEDNEDL-ERKPIGPVDPARCTAAGAGIAGGAACVPASFTVVTKDVDGRKVPHGGALI 114 Query: 542 KVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS 721 KVK+ PGVGVGG+EQ+G+VKD DGTY +TYVVPKRGNYMV++EC+G+ IMGSPFPVFFS Sbjct: 115 KVKVAPGVGVGGTEQDGIVKDMNDGTYTITYVVPKRGNYMVNIECNGRPIMGSPFPVFFS 174 Query: 722 AXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXX 901 A VNQ MPNMPNYSGSVSGAFPGL+GMIP Sbjct: 175 AGTSSGGLLGLAPASSFPNLVNQNMPNMPNYSGSVSGAFPGLMGMIPGIVAGASGGAILP 234 Query: 902 XXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXS-XXXXXXXXXXXX 1078 E+CREYLNG+C KTDCK HPPHN S Sbjct: 235 GIGASLGEVCREYLNGQCAKTDCKLNHPPHNLLMTAIAATTSMGTISQVPMAPSAAAMAA 294 Query: 1079 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGK-ADSLKKTLQISNLSPLLTTDQL 1255 +DSS S DK+GK AD+LK+TLQ+SNLSPLLT +QL Sbjct: 295 AQAIVAAQALQAHAAQVQAQQAQSAKDSSGSSDKSGKAADALKRTLQVSNLSPLLTVEQL 354 Query: 1256 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1435 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP 414 Query: 1436 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1615 A N S+ SSLP NRAATMKSATE+A+ARA Sbjct: 415 AAANPSLASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARA 474 Query: 1616 AEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1795 AEISKKLK DG+GNEE T Sbjct: 475 AEISKKLKVDGIGNEETETKEKSRSPSLPRERSKSKSKSPIKYRSRRRSPTYSPPYRHSR 534 Query: 1796 XXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXX 1972 + Y +RR +R+ R++++RS R + DR+R Sbjct: 535 DHRSRSPVRSRHYSRYEDDRRGYRESREASERSRRRDLDRSRSRRSPISRKNRSRSISPR 594 Query: 1973 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXG--NVSPRT--------RKSSR 2122 RKS RAGS SP H+RE G SPR R+ SR Sbjct: 595 RRKSYRAGSDSPSHQRERSPQRGRKSDHSDLRSPIRHHGKSRSSPRKDDSDKLKHRRRSR 654 Query: 2123 AGSGSPKFHRE---------SLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRR 2275 + S K H + L R ++ SR + +N+S +RRR Sbjct: 655 SKSVETKHHSDEKINEMQHGKLKNRERRRSR-SASLEDKHSKRRPSPRSLDKNISKHRRR 713 Query: 2276 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSK 2455 SRS S E D K D+ + R+ RRRSRSK Sbjct: 714 SRSNSREKVDDKYHGRRRSRSSSSDSKHLPDSKVDSTRYEKLKNRS-------RRRSRSK 766 Query: 2456 SAEDEPRSDDRKERKKSEKVKLDGRNDEKT-------DRATKELDLSAKDSRDLKEYGTS 2614 S + + R ++ +R + ++++ R ++ R T+ S+ +++ + + Sbjct: 767 SVDGKHRRREKSDRSRDKRLRHRDRRSSRSISPEAGHQRVTRLSPTSSDETKSKRRRRSL 826 Query: 2615 DPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKS 2779 P K + +++G ++ ++SR + E +S + + G +S Sbjct: 827 SPEDKPSDIDNGCIAENPKNLGRQQSRSNSISGENGESNLSPSTEENEFKHGEQS 881 >ref|XP_004157720.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101203535 [Cucumis sativus] Length = 936 Score = 506 bits (1304), Expect = e-140 Identities = 332/895 (37%), Positives = 427/895 (47%), Gaps = 29/895 (3%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSD 361 MADR++ +A+PIWMKQ TFK V+ KE+ SSD Sbjct: 1 MADRNL-----VVAKPIWMKQAEEAKLKSEAEKDAAAKAAFEATFKGVDKIPAKEAASSD 55 Query: 362 SDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQL 541 SD E+NEDL + KP+GPVDP++CT +SF VVTKD DGRK+P+GGA + Sbjct: 56 SDFEDNEDL-ERKPIGPVDPARCTAAGAGIAGGAACVPASFTVVTKDVDGRKVPHGGALI 114 Query: 542 KVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS 721 KVK+ PGVGVGG+EQ+G+VKD DGTY +TYVVPKRGNYMV++EC+G+ IMGSPFPVFFS Sbjct: 115 KVKVAPGVGVGGTEQDGIVKDMNDGTYTITYVVPKRGNYMVNIECNGRPIMGSPFPVFFS 174 Query: 722 AXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXX 901 A VNQ MPNMPNYSGSVSGAFPGL+GMIP Sbjct: 175 AGTSSGGLLGLAPASSFPNLVNQNMPNMPNYSGSVSGAFPGLMGMIPGIVAGASGGAILP 234 Query: 902 XXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXS-XXXXXXXXXXXX 1078 E+CREYLNG+C KTDCK HPPHN S Sbjct: 235 GIGASLGEVCREYLNGQCAKTDCKLNHPPHNLLMTAIAATTSMGTISQVPMAPSAAAMAA 294 Query: 1079 XXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGK-ADSLKKTLQISNLSPLLTTDQL 1255 +DSS S DK+GK AD+LK+TLQ+SNLSPLLT +QL Sbjct: 295 AQAIVAAQALQAHAAQVQAQQAQSAKDSSGSSDKSGKAADALKRTLQVSNLSPLLTVEQL 354 Query: 1256 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1435 KQLF +CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP Sbjct: 355 KQLFXFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP 414 Query: 1436 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1615 A N S+ SSLP NRAATMKSATE+A+ARA Sbjct: 415 AAANPSLASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARA 474 Query: 1616 AEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1795 AEIS KLK DG+GNEE T Sbjct: 475 AEISXKLKVDGIGNEETETKEKSRSPSLPRERSKSKSKSPIKYRSRRRSPTYSPPYRHSR 534 Query: 1796 XXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXX 1972 + Y +RR +R+ R++++RS R + DR+R Sbjct: 535 DHRSRSPVRSRHYSRYEDDRRGYRESREASERSRRRDLDRSRSRRSPISRKNRSRSISPR 594 Query: 1973 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXG--NVSPRT--------RKSSR 2122 RKS RAGS SP H+RE G SPR R+ SR Sbjct: 595 RRKSYRAGSDSPSHQRERSPQRGRKSDHSDLRSPIRHHGKSRSSPRKDDSDKLKHRRRSR 654 Query: 2123 AGSGSPKFHRE---------SLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRR 2275 + S K H + L R ++ SR + +N+S +RRR Sbjct: 655 SKSVETKHHSDEKINEMQHGKLKNRERRRSR-SASLEDKHSKRRPSPRSLDKNISKHRRR 713 Query: 2276 SRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSK 2455 SRS S E D K D+ + R+ RRRSRSK Sbjct: 714 SRSNSREKVDDKYHGRRRSRSSSSDSKHLPDSKVDSTRYEKLKNRS-------RRRSRSK 766 Query: 2456 SAEDEPRSDDRKERKKSEKVKLDGRNDEKT-------DRATKELDLSAKDSRDLKEYGTS 2614 S + + R ++ +R + ++++ R ++ R T+ S+ +++ + + Sbjct: 767 SVDGKHRRREKSDRSRDKRLRHRDRRSSRSISPEAGHQRVTRLSPTSSDETKSKRRRRSL 826 Query: 2615 DPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKS 2779 P K + +++G ++ ++SR + E +S + + G +S Sbjct: 827 SPEDKPSDIDNGCIAENPKNLGRQQSRSNSISGENGESNLSPSTEENEFKHGEQS 881 >ref|XP_006373079.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550319785|gb|ERP50876.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 950 Score = 503 bits (1295), Expect = e-139 Identities = 351/946 (37%), Positives = 439/946 (46%), Gaps = 31/946 (3%) Frame = +2 Query: 182 MADRSVPAVSTALA--------RPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT 337 M DR+ ++ A +PIWMKQ TFK V + Sbjct: 1 MTDRNNTTITAAATSTTNHSATKPIWMKQAEEAKLKSEAEKTAAAKAAFDATFK-VLSDK 59 Query: 338 VKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRK 517 ++ SDS+ E+ E+ + NKP+GPVDP+KCT ++F+VVTKD+DGRK Sbjct: 60 AEKPADSDSEEEDAEEDLANKPVGPVDPNKCTAAGGGIAGGTACAPATFMVVTKDADGRK 119 Query: 518 IPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMG 697 +PNGGA +KV++ PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV +EC+GKAIMG Sbjct: 120 VPNGGAVIKVRVSPGVGVGGTEQEGNVKDMGDGTYTVTYVVPKRGNYMVTIECNGKAIMG 179 Query: 698 SPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXX 877 SPFPVFFSA VNQTMPNMPNYS ++SGAFP LLGM P Sbjct: 180 SPFPVFFSAGTSTGGLLGMAPTTTFPNLVNQTMPNMPNYSANISGAFPALLGMTPGITSS 239 Query: 878 XXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXX 1057 E+CREYL GRC KTDCK HPP + S Sbjct: 240 ASGGAILPGAGASLGEVCREYLYGRCAKTDCKLSHPPQSLLMTLLAPTTSMGTLSQVPMA 299 Query: 1058 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPL 1237 +DSS S DKA K D+LKKTL +SNLSPL Sbjct: 300 PSAAAMAAAQAIVAAKALQAHAAQLQAQARSAKDSSGSPDKARKEDALKKTLHVSNLSPL 359 Query: 1238 LTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAK 1417 LT +QLKQLFS+CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGGRP+NVE AK Sbjct: 360 LTVEQLKQLFSFCGTVVECTIADSKHSAYIEYSKPEEATAALALNNMDVGGRPLNVETAK 419 Query: 1418 SLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATE 1597 SLP KP ILNSS SSLP N+AATMKSATE Sbjct: 420 SLPQKP-ILNSSFASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANKAATMKSATE 478 Query: 1598 MASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777 +A+ARAAEISKKLK DG+ E T Sbjct: 479 LAAARAAEISKKLKDDGLVTGEGETKAESKSPPPPRARSRSKSRSPINYRRRMRSPSYSP 538 Query: 1778 XXXXXXXXXXXXXXXXXXXTNYGSERRSH--RQVRDSNDRSGRWERDRTRDHY-XXXXXX 1948 + Y ERRS+ R RD DR+ R E DR+RDH+ Sbjct: 539 PSRHNRDRRSRSPVRFRYHSRYNYERRSYRDRDSRDDGDRTRRRELDRSRDHHSPVSRRN 598 Query: 1949 XXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAG 2128 TRKS RA S SPKHR+ES + R+RK+S +G Sbjct: 599 RSRSASPRTRKSYRADSGSPKHRQES----------------------SAHRSRKASDSG 636 Query: 2129 SGSPKFHRES-LSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKS-AEDE 2302 S SP+ H S SPR S+ YRRRSRS+S + +E Sbjct: 637 SRSPRHHGGSRSSPRNNPDSKL-----------------------RYRRRSRSRSKSVEE 673 Query: 2303 AHXXXXXXXXXXXXXXXXXXNDEKTD--NRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR 2476 A+ + + G + SPR ++E S +R RSRSKS E + Sbjct: 674 ANEKVDEIREKKSKQHERRSRSLSVELKHHGRRPSPRSSNEDDSKHRSRSRSKSVEVKRH 733 Query: 2477 SDD--------------RKERKKS--EKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYG 2608 S++ R+ R KS ++ R +E D+ TK D SR + G Sbjct: 734 SNEKVDKTGDGKLKHRHRRSRSKSVDDRHHYKERGNETRDKKTKHQDRGR--SRSITAEG 791 Query: 2609 TSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSV 2788 R DGS S + H RS + V ++K++ +SPS Sbjct: 792 KHHRSRSSPRGRDGSKSKHR---RHSRSISPEGKRRSSHRVDQNKDEKSKHRHRRRSPSA 848 Query: 2789 SARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERDT 2926 ++ S + S+ + R + + R +++ D R E +T Sbjct: 849 EGKHGRSPRSSEENKSKHRRRPRSKSAERKRHSNDEKDIRRGENET 894 >ref|XP_003524186.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1-like isoform X1 [Glycine max] gi|571455668|ref|XP_006580150.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1-like isoform X2 [Glycine max] Length = 969 Score = 503 bits (1294), Expect = e-139 Identities = 351/952 (36%), Positives = 448/952 (47%), Gaps = 34/952 (3%) Frame = +2 Query: 221 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS---SDSDTEENEDLV 391 A+PIWMKQ TFK +E K S SDSD+EE + + Sbjct: 9 AKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKALENKHDKGGGSVADSDSDSEEEYEDL 68 Query: 392 KNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICPGVGV 571 +KP+GPV+P+KCT SSFVVVTKD+D RK+ GGAQ+KV++ PG+GV Sbjct: 69 AHKPIGPVEPAKCTAAGTGIAGGTACAPSSFVVVTKDADERKVSGGGAQIKVRVTPGLGV 128 Query: 572 GGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXXXXXX 751 GG+EQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFSA Sbjct: 129 GGTEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAAGNSTGGLL 188 Query: 752 XXXXXXXXXX-VNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXXXEI 928 VNQTMPNMPNYSGSVSGAFPGLLGMIP E+ Sbjct: 189 GLAPASSFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPGIGASLGEV 248 Query: 929 CREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXX 1108 CR+YLNGRC K DCK HPPHN S Sbjct: 249 CRDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLSQAPMAPSAAAMAAAQAIVAAQA 308 Query: 1109 XXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCGTVV 1288 +DS+ S +KA K D+LKKTLQ+SNLSPLLT +QLKQLF +CGTVV Sbjct: 309 LQAHAAQVQAQSA--KDSAGSPEKASKDDALKKTLQVSNLSPLLTVEQLKQLFGFCGTVV 366 Query: 1289 ECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMNQSS 1468 EC+ITDSKHFAYIEYSKPEEA AALALNN+DVGGRP+NVEMAKSLP KP++ NSS+ SS Sbjct: 367 ECAITDSKHFAYIEYSKPEEATAALALNNIDVGGRPLNVEMAKSLPQKPSVANSSLASSS 426 Query: 1469 LPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLKADG 1648 LP RAATMKSATE+A+ARAAEISKKL DG Sbjct: 427 LPLMMQQAVAMQQMQFQQALLMQQSMTAQQAATRAATMKSATELAAARAAEISKKLNPDG 486 Query: 1649 VGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1828 VG+ E+ Sbjct: 487 VGS-EEKETKQNSRSSSPPRGRSRSKSRSPISYRRRRRSRSYSPARHSKDHRSRSPLRPH 545 Query: 1829 XXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASP 2008 ++Y ERRS+R +R+ +DR R + DR+ DH SAS Sbjct: 546 HYSSYDRERRSYRDIREHSDRYRRRDSDRSLDH---------------------RSSASR 584 Query: 2009 KHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSR 2188 ++R S VSP TRKS SPK HRE+ R +K SR Sbjct: 585 RNRSRS----------------------VSPYTRKS----PVSPKCHRETSPHRGRKQSR 618 Query: 2189 ANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXXXXN- 2365 + + + + RRRSRS+S++D H Sbjct: 619 VDSGSPSHRRGSRPSPKIDEKKLRN-RRRSRSRSSDDRLHSSKNEEVLHGKSKRRERRRS 677 Query: 2366 -----DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKVKLDGR 2530 DEK +R S+SSPR+ ES S +++RS SKS +D S +R + ++ ++ R Sbjct: 678 KSLSVDEK-PHRRSRSSPRKVDESRSRHKKRSSSKSVDDRHDSPERLDENRNRRL----R 732 Query: 2531 NDEKTDRATKELDLSAKDSRDLKEYGTSDPRRK--DTLLEDGSSSDEKYGSNHKRSRLDD 2704 + +K ++ D +D D++E + + + K DT S + K S K D Sbjct: 733 HSDKRHSRSRSTD--NRDQTDVREDESKNEKSKHRDTKRSRSKSVEGKRRSKDKSGENRD 790 Query: 2705 KDSEKHDS------VFKDKNDLMDVS----------EGVKSPSVSARYNDSAPVDDRTHS 2836 K S+ HD +DK+D S E KSP Y+D + Sbjct: 791 KKSKHHDRRRSRSISLEDKHDKGGTSLHINLDERNFELTKSPEGKNHYSDK-------YG 843 Query: 2837 RTKDSSRYEKSTSDRRRHEKIDTT------RRERDTSGMDRAHIGCEDLSRH 2974 + S ++K T + + + D + +E D+ G + G ++ H Sbjct: 844 NRGEKSEHQKKTPSKSKSGQFDGSGPLRGNYKEDDSKGKSPSDSGSAEVKHH 895 >ref|XP_006373080.1| hypothetical protein POPTR_0017s08510g [Populus trichocarpa] gi|550319786|gb|ERP50877.1| hypothetical protein POPTR_0017s08510g [Populus trichocarpa] Length = 924 Score = 496 bits (1277), Expect = e-137 Identities = 341/895 (38%), Positives = 425/895 (47%), Gaps = 23/895 (2%) Frame = +2 Query: 311 TFKDVEMNTVKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVV 490 TFK V + ++ SDS+ E+ E+ + NKP+GPVDP+KCT ++F+V Sbjct: 26 TFK-VLSDKAEKPADSDSEEEDAEEDLANKPVGPVDPNKCTAAGGGIAGGTACAPATFMV 84 Query: 491 VTKDSDGRKIPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHV 670 VTKD+DGRK+PNGGA +KV++ PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV + Sbjct: 85 VTKDADGRKVPNGGAVIKVRVSPGVGVGGTEQEGNVKDMGDGTYTVTYVVPKRGNYMVTI 144 Query: 671 ECDGKAIMGSPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLL 850 EC+GKAIMGSPFPVFFSA VNQTMPNMPNYS ++SGAFP LL Sbjct: 145 ECNGKAIMGSPFPVFFSAGTSTGGLLGMAPTTTFPNLVNQTMPNMPNYSANISGAFPALL 204 Query: 851 GMIPXXXXXXXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXX 1030 GM P E+CREYL GRC KTDCK HPP + Sbjct: 205 GMTPGITSSASGGAILPGAGASLGEVCREYLYGRCAKTDCKLSHPPQSLLMTLLAPTTSM 264 Query: 1031 XXXSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKT 1210 S +DSS S DKA K D+LKKT Sbjct: 265 GTLSQVPMAPSAAAMAAAQAIVAAKALQAHAAQLQAQARSAKDSSGSPDKARKEDALKKT 324 Query: 1211 LQISNLSPLLTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGG 1390 L +SNLSPLLT +QLKQLFS+CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGG Sbjct: 325 LHVSNLSPLLTVEQLKQLFSFCGTVVECTIADSKHSAYIEYSKPEEATAALALNNMDVGG 384 Query: 1391 RPMNVEMAKSLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNR 1570 RP+NVE AKSLP KP ILNSS SSLP N+ Sbjct: 385 RPLNVETAKSLPQKP-ILNSSFASSSLPMMMQQAVAMQQMQFQQALLMQQTMTAQQAANK 443 Query: 1571 AATMKSATEMASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXX 1750 AATMKSATE+A+ARAAEISKKLK DG+ E T Sbjct: 444 AATMKSATELAAARAAEISKKLKDDGLVTGEGETKAESKSPPPPRARSRSKSRSPINYRR 503 Query: 1751 XXXXXXXXXXXXXXXXXXXXXXXXXXXXTNYGSERRSH--RQVRDSNDRSGRWERDRTRD 1924 + Y ERRS+ R RD DR+ R E DR+RD Sbjct: 504 RMRSPSYSPPSRHNRDRRSRSPVRFRYHSRYNYERRSYRDRDSRDDGDRTRRRELDRSRD 563 Query: 1925 HY-XXXXXXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSP 2101 H+ TRKS RA S SPKHR+ES + Sbjct: 564 HHSPVSRRNRSRSASPRTRKSYRADSGSPKHRQES----------------------SAH 601 Query: 2102 RTRKSSRAGSGSPKFHRES-LSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRS 2278 R+RK+S +GS SP+ H S SPR S+ YRRRS Sbjct: 602 RSRKASDSGSRSPRHHGGSRSSPRNNPDSKL-----------------------RYRRRS 638 Query: 2279 RSKS-AEDEAHXXXXXXXXXXXXXXXXXXNDEKTD--NRGSKSSPRRAHESVSHYRRRSR 2449 RS+S + +EA+ + + G + SPR ++E S +R RSR Sbjct: 639 RSRSKSVEEANEKVDEIREKKSKQHERRSRSLSVELKHHGRRPSPRSSNEDDSKHRSRSR 698 Query: 2450 SKSAEDEPRSDD--------------RKERKKS--EKVKLDGRNDEKTDRATKELDLSAK 2581 SKS E + S++ R+ R KS ++ R +E D+ TK D Sbjct: 699 SKSVEVKRHSNEKVDKTGDGKLKHRHRRSRSKSVDDRHHYKERGNETRDKKTKHQDRGR- 757 Query: 2582 DSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDV 2761 SR + G R DGS S + H RS + V ++K++ Sbjct: 758 -SRSITAEGKHHRSRSSPRGRDGSKSKHR---RHSRSISPEGKRRSSHRVDQNKDEKSKH 813 Query: 2762 SEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERDT 2926 +SPS ++ S + S+ + R + + R +++ D R E +T Sbjct: 814 RHRRRSPSAEGKHGRSPRSSEENKSKHRRRPRSKSAERKRHSNDEKDIRRGENET 868 >ref|XP_002300152.2| RNA recognition motif-containing family protein [Populus trichocarpa] gi|550348720|gb|EEE84957.2| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 918 Score = 487 bits (1254), Expect = e-134 Identities = 359/970 (37%), Positives = 451/970 (46%), Gaps = 18/970 (1%) Frame = +2 Query: 194 SVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSDSDTE 373 +V +T+ A+PIWMKQ TFK V + ++++ SDS+ E Sbjct: 13 AVTTTNTSAAKPIWMKQAEEAKLKSEAENTAAAKAAFDATFK-VLSDKAEKAVDSDSEEE 71 Query: 374 ENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKI 553 + E + NKP+GPVDP KCT ++FVVVTKD+DGRK+PNGGA ++V++ Sbjct: 72 DAEKDLANKPVGPVDPGKCTAAGAGIAGGTACAPATFVVVTKDADGRKVPNGGAVIRVRV 131 Query: 554 CPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXX 733 PGVGVGG+EQEG VKD GDGTY VTYVVPKRGNYMV +EC+GKAIMGSPFPVFFSA Sbjct: 132 SPGVGVGGTEQEGAVKDMGDGTYTVTYVVPKRGNYMVTIECNGKAIMGSPFPVFFSAGTS 191 Query: 734 XXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXX 913 VNQTMPNMPNYS SVSGAFP LGM P Sbjct: 192 TGGLLGMAPTTTFPNLVNQTMPNMPNYSASVSGAFPAFLGMTPGIASGASGGAILPGVGA 251 Query: 914 XXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXX 1093 E+CREYL GRC K DCK HPPH+ S Sbjct: 252 SLGEVCREYLYGRCAKMDCKLGHPPHSLLMTLLAPTTTMGTLSHAPMAPSAAAMAAAQAI 311 Query: 1094 XXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSY 1273 +DSS S DKA K D+LKKTL +SNLSPLLT +QLKQLFS+ Sbjct: 312 VAAKALQAHAAQVQAQAQSAKDSSGSPDKARKEDALKKTLHVSNLSPLLTVEQLKQLFSF 371 Query: 1274 CGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSS 1453 CGTVVEC+I DSKH AYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP +LNSS Sbjct: 372 CGTVVECAIADSKHSAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKP-LLNSS 430 Query: 1454 MNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKK 1633 + SSLP N+AA+MKSATE+A+ARAAEISKK Sbjct: 431 LASSSLPMMMQQAVAMQQMQFQQALIMQQTMTAQQAANKAASMKSATELAAARAAEISKK 490 Query: 1634 LKADG--VGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1807 LKADG +G EE Sbjct: 491 LKADGFVIGEEETKAETKSPSPPQARSRSKSRSPINYQRRLRSPSYSPPSRRNRDRRSRS 550 Query: 1808 XXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRD-HYXXXXXXXXXXXXXXTRKS 1984 NYG RRS+R RD DR + DR+R H TRKS Sbjct: 551 PFRFRYHSRYNYG--RRSYRDSRDIVDRMRMQDSDRSRGRHSPVSRRSRSRSASPRTRKS 608 Query: 1985 SRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLS 2164 R S SPK R ES + R+RK++ +GS SP+ H Sbjct: 609 YRDDSGSPKRRLES----------------------SAQRSRKAADSGSRSPRSH----- 641 Query: 2165 PRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRR--RSRSKSAEDEAHXXXXXXXXXX 2338 ++ SR N ++ Y+R RSRSKS E+ Sbjct: 642 -GGRRLSRRNIT----------------DSKLRYKRHSRSRSKSVEESNDRVNEIQDKKS 684 Query: 2339 XXXXXXXXN-DEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKV 2515 + + + G + S R + E S++R RSRSKS E + S ++ + + Sbjct: 685 KQHERRSRSLSVELKHHGRRPSHRSSDEDESNHRSRSRSKSVEVKRHSYEKVGKTE---- 740 Query: 2516 KLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRKDTLLED--GSSSDEKYGSNHKR 2689 DGR + R+ + S D KE G ++ R K T D S S ++H+R Sbjct: 741 --DGRLKHRDRRSRSK---SVDDRHCYKERG-NESRDKKTKHRDRVQSRSISAESNHHRR 794 Query: 2690 SRLDDKDSEKHDSVFKDKNDLMDVS-EG---------VKSPSVSARYNDSAPVDDRTHSR 2839 SR K ++ S K + +S EG KS S R + SA + H R Sbjct: 795 SRSSPKGRDESKS--KHRRHSRPISPEGKRRSNHRIDEKSKHCSRRRSVSA---EGKHIR 849 Query: 2840 TKDSSRYEKSTSDRRRHEKIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVES 3019 + SS E++ S RRRH R S + H E++ R T +K E Sbjct: 850 SPRSS--EENKSKRRRH--------SRSKSAEHKRHSNDEEIKREENETRHEHTSDKTED 899 Query: 3020 IHREKDHLDD 3049 + +++ D Sbjct: 900 ANEDENSFTD 909 >ref|XP_007034241.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] gi|508713270|gb|EOY05167.1| RNA recognition motif-containing protein isoform 2 [Theobroma cacao] Length = 890 Score = 481 bits (1239), Expect = e-133 Identities = 270/499 (54%), Positives = 308/499 (61%), Gaps = 2/499 (0%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKE--SIS 355 MADR+ STA ++PIWMKQ TFKDV+ N K+ + S Sbjct: 1 MADRN----STA-SKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKDVDKNRNKDVAAAS 55 Query: 356 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 535 SDS++E+ DLV NKP+GPVDP+KC S+F+VVTKD+DGRK+ +GGA Sbjct: 56 SDSESEDTSDLV-NKPIGPVDPAKCMAAGPGIAGGTACAASTFMVVTKDADGRKVQSGGA 114 Query: 536 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 715 Q+KVK+ PGVGVGGSEQEG+VKD GDGTY VTYVVPKRGNYMV++EC+GK IMGSPFPVF Sbjct: 115 QIKVKVSPGVGVGGSEQEGIVKDMGDGTYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVF 174 Query: 716 FSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXX 895 FSA VNQTMPNMPNY+GSVSGAFPGLLGMIP Sbjct: 175 FSAGTSTGGLLGVAPASTYPNLVNQTMPNMPNYTGSVSGAFPGLLGMIPGIVSGASGGAI 234 Query: 896 XXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXX 1075 E+CREYLNGRC KTDCK HPPHN S Sbjct: 235 LPGMGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAM 294 Query: 1076 XXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQL 1255 +DSSDS DKAGKAD+LKKTLQ+SNLSPLLT +QL Sbjct: 295 AAAQAIVAAQALQAHAAQVQAQAQSTKDSSDSPDKAGKADALKKTLQVSNLSPLLTAEQL 354 Query: 1256 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1435 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMD+GGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDIGGRPLNVEMAKSLPQKP 414 Query: 1436 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1615 A+ SS+ SSLP NRAA+MKSATE+A+ARA Sbjct: 415 AV--SSLASSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARA 472 Query: 1616 AEISKKLKADGVGNEEKGT 1672 AEISKKLKADG+ EEK T Sbjct: 473 AEISKKLKADGLVTEEKET 491 Score = 86.3 bits (212), Expect = 1e-13 Identities = 102/371 (27%), Positives = 142/371 (38%), Gaps = 6/371 (1%) Frame = +2 Query: 1835 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2014 + Y SERRS+R RD DRS R + DR+RD R S S ++ Sbjct: 574 SRYDSERRSYRD-RDDIDRSKRRDLDRSRD---------------------RRSSVSRRN 611 Query: 2015 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRAN 2194 R S +SP+TRKS S SPK RES SPR +KSS + Sbjct: 612 RSRS----------------------ISPQTRKSPPVDSDSPKNSRES-SPRVRKSSHPD 648 Query: 2195 XXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE-DEAHXXXXXXXXXXXXXXXXXXNDE 2371 E YR+RSRSKS + D+ Sbjct: 649 SRSPRHHRRSRSSPKNDDERKLKYRKRSRSKSVDSDKKRDQIQGEKSKHRSRRRSRSLSL 708 Query: 2372 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDR-KERKKSEKVKLDGRNDEKTD 2548 + ++RG S + E+ +RRRSRS S E + RS+ + E KK E D R Sbjct: 709 EGEHRGRSRSSASSDENKLKHRRRSRSVSVERKVRSNSKIDEMKKDESRHSDRRRSRSGS 768 Query: 2549 RATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSE---- 2716 + D K+ D RR S S G +H+ SRL ++S+ Sbjct: 769 AEGRHYTKERSDRSRDKKSKHRDRRR--------SRSRSAEGKHHRESRLFPRNSDGNKI 820 Query: 2717 KHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEK 2896 KH + + K+ +EG S + ++ + DR H + + S S R E Sbjct: 821 KHRRLSRSKS-----TEG--KHRSSDKIDERSKRHDRKHLSSAECRHPRGSRSSPRSSED 873 Query: 2897 IDTTRRERDTS 2929 D+ RR R S Sbjct: 874 NDSRRRRRSRS 884 >ref|XP_007034240.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656339|ref|XP_007034242.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656342|ref|XP_007034243.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656345|ref|XP_007034244.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656349|ref|XP_007034245.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|590656352|ref|XP_007034246.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713269|gb|EOY05166.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713271|gb|EOY05168.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713272|gb|EOY05169.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713273|gb|EOY05170.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713274|gb|EOY05171.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] gi|508713275|gb|EOY05172.1| RNA recognition motif-containing protein isoform 1 [Theobroma cacao] Length = 965 Score = 481 bits (1239), Expect = e-133 Identities = 270/499 (54%), Positives = 308/499 (61%), Gaps = 2/499 (0%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKE--SIS 355 MADR+ STA ++PIWMKQ TFKDV+ N K+ + S Sbjct: 1 MADRN----STA-SKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKDVDKNRNKDVAAAS 55 Query: 356 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 535 SDS++E+ DLV NKP+GPVDP+KC S+F+VVTKD+DGRK+ +GGA Sbjct: 56 SDSESEDTSDLV-NKPIGPVDPAKCMAAGPGIAGGTACAASTFMVVTKDADGRKVQSGGA 114 Query: 536 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 715 Q+KVK+ PGVGVGGSEQEG+VKD GDGTY VTYVVPKRGNYMV++EC+GK IMGSPFPVF Sbjct: 115 QIKVKVSPGVGVGGSEQEGIVKDMGDGTYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVF 174 Query: 716 FSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXX 895 FSA VNQTMPNMPNY+GSVSGAFPGLLGMIP Sbjct: 175 FSAGTSTGGLLGVAPASTYPNLVNQTMPNMPNYTGSVSGAFPGLLGMIPGIVSGASGGAI 234 Query: 896 XXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXX 1075 E+CREYLNGRC KTDCK HPPHN S Sbjct: 235 LPGMGASLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAM 294 Query: 1076 XXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQL 1255 +DSSDS DKAGKAD+LKKTLQ+SNLSPLLT +QL Sbjct: 295 AAAQAIVAAQALQAHAAQVQAQAQSTKDSSDSPDKAGKADALKKTLQVSNLSPLLTAEQL 354 Query: 1256 KQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKP 1435 KQLFS+CGTVVEC+ITDSKHFAYIEYSKPEEA AALALNNMD+GGRP+NVEMAKSLP KP Sbjct: 355 KQLFSFCGTVVECTITDSKHFAYIEYSKPEEATAALALNNMDIGGRPLNVEMAKSLPQKP 414 Query: 1436 AILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARA 1615 A+ SS+ SSLP NRAA+MKSATE+A+ARA Sbjct: 415 AV--SSLASSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARA 472 Query: 1616 AEISKKLKADGVGNEEKGT 1672 AEISKKLKADG+ EEK T Sbjct: 473 AEISKKLKADGLVTEEKET 491 Score = 86.3 bits (212), Expect = 1e-13 Identities = 102/371 (27%), Positives = 142/371 (38%), Gaps = 6/371 (1%) Frame = +2 Query: 1835 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2014 + Y SERRS+R RD DRS R + DR+RD R S S ++ Sbjct: 574 SRYDSERRSYRD-RDDIDRSKRRDLDRSRD---------------------RRSSVSRRN 611 Query: 2015 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRAN 2194 R S +SP+TRKS S SPK RES SPR +KSS + Sbjct: 612 RSRS----------------------ISPQTRKSPPVDSDSPKNSRES-SPRVRKSSHPD 648 Query: 2195 XXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAE-DEAHXXXXXXXXXXXXXXXXXXNDE 2371 E YR+RSRSKS + D+ Sbjct: 649 SRSPRHHRRSRSSPKNDDERKLKYRKRSRSKSVDSDKKRDQIQGEKSKHRSRRRSRSLSL 708 Query: 2372 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDR-KERKKSEKVKLDGRNDEKTD 2548 + ++RG S + E+ +RRRSRS S E + RS+ + E KK E D R Sbjct: 709 EGEHRGRSRSSASSDENKLKHRRRSRSVSVERKVRSNSKIDEMKKDESRHSDRRRSRSGS 768 Query: 2549 RATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSE---- 2716 + D K+ D RR S S G +H+ SRL ++S+ Sbjct: 769 AEGRHYTKERSDRSRDKKSKHRDRRR--------SRSRSAEGKHHRESRLFPRNSDGNKI 820 Query: 2717 KHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEK 2896 KH + + K+ +EG S + ++ + DR H + + S S R E Sbjct: 821 KHRRLSRSKS-----TEG--KHRSSDKIDERSKRHDRKHLSSAECRHPRGSRSSPRSSED 873 Query: 2897 IDTTRRERDTS 2929 D+ RR R S Sbjct: 874 NDSRRRRRSRS 884 >ref|XP_002518040.1| conserved hypothetical protein [Ricinus communis] gi|223542636|gb|EEF44173.1| conserved hypothetical protein [Ricinus communis] Length = 946 Score = 473 bits (1217), Expect = e-130 Identities = 261/493 (52%), Positives = 297/493 (60%) Frame = +2 Query: 194 SVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESISSDSDTE 373 S + A +PIWMKQ TFK + N +++ SDS+ E Sbjct: 13 SSSTAAAAAPKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKTLTTNKPEKASDSDSEGE 72 Query: 374 ENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKI 553 E+E+ + NKP+GPVDP+KCT S+F+V TKDSDGRK+ +GGAQ+KVK+ Sbjct: 73 ESEEYLANKPVGPVDPTKCTAVGAGIAGGTACAPSTFMVATKDSDGRKVMHGGAQIKVKV 132 Query: 554 CPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXX 733 PGVGVGG+EQEG+VKD GDG+Y VTYVVPKRGNYMV++EC+GK IMGSPFPVFFSA Sbjct: 133 SPGVGVGGTEQEGIVKDMGDGSYTVTYVVPKRGNYMVNIECNGKPIMGSPFPVFFSAGTS 192 Query: 734 XXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXX 913 VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 193 TGGLLGMAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVSGASGGAVLPGIGA 252 Query: 914 XXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXX 1093 E+CREYLNGRC KTDCK HPPHN S Sbjct: 253 SLGEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTSMGTLSQVPMAPSAAAMAAAQAI 312 Query: 1094 XXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSY 1273 +DSS S DKAGK D+LKKTLQ+SNLSPLLT DQLKQLFSY Sbjct: 313 VAAQALQAHAAQVQAQAQSAKDSSGSPDKAGKEDTLKKTLQVSNLSPLLTVDQLKQLFSY 372 Query: 1274 CGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSS 1453 G+VVECSITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP K ++LNSS Sbjct: 373 FGSVVECSITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQK-SLLNSS 431 Query: 1454 MNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKK 1633 + SSLP NRAATMKSATE+A+ARAAEISKK Sbjct: 432 VASSSLPLMMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKSATELAAARAAEISKK 491 Query: 1634 LKADGVGNEEKGT 1672 LKADG +EEK T Sbjct: 492 LKADGFVDEEKET 504 Score = 82.8 bits (203), Expect = 1e-12 Identities = 92/334 (27%), Positives = 136/334 (40%), Gaps = 11/334 (3%) Frame = +2 Query: 2090 NVSPRTRKSSRAGSGSPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYR 2269 +VSPR ++S RA SGSPK RES R +KSS +N YR Sbjct: 598 SVSPRMKRSYRADSGSPKRRRESSPRRARKSSHGGSRSPRHHRGSRSSPRNDSDNKLKYR 657 Query: 2270 RRSRSKSAED---EAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRR 2440 +RSRSKS ED +A +EK N SKSS R E+ +R Sbjct: 658 KRSRSKSVEDSKEKAKEAQDEKFKKQERRSRSLSVEEK--NNVSKSSSRSIDENEPKHRG 715 Query: 2441 RSRSKSAEDEPRSDDRKERKKSEKVK--LDGR--NDEKTDRATKELDLSAKDSRDLKEYG 2608 RSRSKS E R+ +EKV DGR N ++ +K +++ E Sbjct: 716 RSRSKSVE---------ARRSTEKVNETRDGRLKNRDRKRSRSKSVEVRRHSREKGNESR 766 Query: 2609 TSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSV 2788 + +D S+D G +H+ SR + ++ K K+ S +S + Sbjct: 767 DKKSKHRDRKRSRSISAD---GKHHRGSRSSPRVADD----IKSKHRRHSRSRSPESKKL 819 Query: 2789 SARYNDSAPVD-DRTHSRTKDSSRYEKSTSDRRRHE--KIDTTRRERDTSGMDRAHIG-C 2956 S+ D V+ + SR + S K R E K RR R S + H Sbjct: 820 SSYRMDGTGVEKSKRRSRRRSMSAEGKHCRSPRSSEENKSKHKRRSRSRSAEGKHHSSDI 879 Query: 2957 EDLSRHGRLTSENKKHEKVESIHREKDHLDDDST 3058 +++ R L EN + E++ ++D + D+T Sbjct: 880 KNIKRAENLVHENCVSHETENVTEDQDSVVGDAT 913 >gb|EYU31494.1| hypothetical protein MIMGU_mgv1a001194mg [Mimulus guttatus] Length = 869 Score = 463 bits (1191), Expect = e-127 Identities = 337/941 (35%), Positives = 430/941 (45%), Gaps = 27/941 (2%) Frame = +2 Query: 182 MADRSV-PAVSTALA-----RPIWMKQXXXXXXXXXXXXXXXXXXXXXXTF----KDVEM 331 MADR V A S+ LA +PIWMKQ TF + + + Sbjct: 1 MADRPVNTATSSNLAVAPAPKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFNAQPQQLAL 60 Query: 332 NTVKESISSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDG 511 ++ ES SSDSD+++ + + +GPVDPSKCT ++F+VVTKD+DG Sbjct: 61 PSIAESSSSDSDSDDERSSERERSVGPVDPSKCTAQGAGIAGGTACAGATFMVVTKDADG 120 Query: 512 RKIPNGGAQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAI 691 K+ GGAQ++V++ PGVGVGG+EQEG+VKD GDG+Y VTYVVPKRGNYMV+VEC+GK I Sbjct: 121 GKVVRGGAQVRVRVSPGVGVGGTEQEGVVKDMGDGSYSVTYVVPKRGNYMVNVECNGKPI 180 Query: 692 MGSPFPVFFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXX 871 MGSPFPVFFSA +NQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 181 MGSPFPVFFSAGTPTGGLLGIAPPASYPNLINQTMPNMPNYSGSVSGAFPGLLGMIPGVV 240 Query: 872 XXXXXXXXXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXX 1051 E+CREYLNGRC TDCKF HPPHN S Sbjct: 241 NGASGGVVLPGMGSSLGEMCREYLNGRCASTDCKFNHPPHNLLMTAIAATTTMGTLS--- 297 Query: 1052 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLS 1231 +DS + ADSLKK +Q+SNLS Sbjct: 298 --QVPMAPSAAAMAAAQAIVAAQALQAHAQAQSNKDSYGLGNSERNADSLKKMVQVSNLS 355 Query: 1232 PLLTTDQLKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEM 1411 PLLT DQLKQLF +CGTVVEC ITDSKHFAYIEY K EEA +ALALNNMDVGGRP+NVEM Sbjct: 356 PLLTVDQLKQLFGFCGTVVECIITDSKHFAYIEYLKAEEATSALALNNMDVGGRPLNVEM 415 Query: 1412 AKSLPPKPAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSA 1591 AKSLPP+P ILNS + SSLP NRAATMKSA Sbjct: 416 AKSLPPRP-ILNSPLGSSSLPMVMQQAVAMQQMQFQQALLMQQTLTAQQAANRAATMKSA 474 Query: 1592 TEMASARAAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1771 T++A+ARAAEISKKL+ADG+ E K + Sbjct: 475 TDLAAARAAEISKKLQADGLVIEVKDSDRISRSPSPTRAKSKSRSRSKSVSPIKYRPRRR 534 Query: 1772 XXXXXXXXXXXXXXXXXXXXXTNYGSERRSHRQVRDSN---DRSGRWERDRTRDHY-XXX 1939 + S + R R+S DR+ R + R+ D+ Sbjct: 535 SRSYSPPRRNRDYRSRSPVRSRYHSSYEKERRYYRESRDVIDRNRRRDIGRSHDNVSPVS 594 Query: 1940 XXXXXXXXXXXTRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSS 2119 T++S + S + + RRES +TRKSS Sbjct: 595 RRKRSRSLSPRTKRSRKDDSGTSRRRRES----------------------PIEKTRKSS 632 Query: 2120 RAGSGSPKFHRESLSPRTKKSSRANXXXXXXXXXXXXXXXXT---------HENVSHYRR 2272 R S SP+ HR S ++ R+ + ++ S RR Sbjct: 633 RPDSRSPQRHRRRSSSSGDEADRSKQHRNHSLSRSDEVKHHSSDKKDSMKEEKSKSRNRR 692 Query: 2273 RSRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRS 2452 RSRS S ED N PR ES S ++RRSRS Sbjct: 693 RSRSNSVEDR--------------------------NGRRSPPPRVVEESKSRHKRRSRS 726 Query: 2453 KSAEDEPRSDDRKERKKSEKVKLDGRNDEKTDRATKELDLSAKDSRDLKEYGTSDPRRKD 2632 +S D+ +S ++ ER + +K RN +K R ++ K R G+ RR+ Sbjct: 727 RSPVDKHQSSEKYERSREDK----SRNRDK--RRSRSRSTDGKHRR-----GSKASRRR- 774 Query: 2633 TLLEDGSSSDEKYGSNHKRSRLDDKDSEKHDSVFKDKNDLMDVSEGVKSPSVSAR----Y 2800 SDE + KRSR D ++ D +G +SPS SA Sbjct: 775 --------SDEHKSKHRKRSR------SNKDEASLERPDEHKSKDGKRSPSNSAENDNDL 820 Query: 2801 NDSAPVDDRTHSRTKDSSRYEKSTSDRRRHEKIDTTRRERD 2923 ND PV+ + + ++ R TSD ++++ + RD Sbjct: 821 NDFIPVESKDKALETENDR--SVTSDDVYVDRLNESMHVRD 859 >ref|XP_002885603.1| hypothetical protein ARALYDRAFT_898933 [Arabidopsis lyrata subsp. lyrata] gi|297331443|gb|EFH61862.1| hypothetical protein ARALYDRAFT_898933 [Arabidopsis lyrata subsp. lyrata] Length = 985 Score = 462 bits (1190), Expect = e-127 Identities = 342/990 (34%), Positives = 441/990 (44%), Gaps = 34/990 (3%) Frame = +2 Query: 209 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNT------------VKESI 352 S A +P WMK TFK V+ T ES Sbjct: 7 SAAAGKPFWMKHAEDAKIKDEGEKDAAAKAAFEATFKGVDQTTHLIEAVAPAPESAPESD 66 Query: 353 SSDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGG 532 S D ++ D + KP+GPVDPSK T S+FVVVTKDSDGRK+PNGG Sbjct: 67 SDSDDDDDESDYLSRKPIGPVDPSKSTASGAGIGGGTACVPSTFVVVTKDSDGRKVPNGG 126 Query: 533 AQLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPV 712 A ++V++CPGVGVGG++QEG+VKD GDG+Y VTYVVPKRGNYMV++EC+G AIMGSPFPV Sbjct: 127 ALIRVRVCPGVGVGGTDQEGVVKDVGDGSYAVTYVVPKRGNYMVNIECNGSAIMGSPFPV 186 Query: 713 FFSAXXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXX 892 FFS +NQTMPNMPNY+GSVSGAFPGLLGM+P Sbjct: 187 FFS-QGSSSTGLMGSAPASYSNLINQTMPNMPNYTGSVSGAFPGLLGMVPGIASGPSGGA 245 Query: 893 XXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXX 1072 E+CREYLNGRCV + CK HPP N S Sbjct: 246 ILPGVGASLGEVCREYLNGRCVNSMCKLNHPPQNLLMTAIAATTSMGNMSQVPMAPSAAA 305 Query: 1073 XXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQ 1252 + S S +K DSLKK LQ+SNLSP LTT+Q Sbjct: 306 MAAAQAIVAAQALQAHASQMQAQAQSNKGSLGSPEKGENGDSLKKFLQVSNLSPSLTTEQ 365 Query: 1253 LKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPK 1432 L+QLFS+CGTVV+CSITDSKH AYIEYS EEA AALALNN +V GRP+NVE+AKSLP K Sbjct: 366 LRQLFSFCGTVVDCSITDSKHLAYIEYSNSEEATAALALNNTEVFGRPLNVEIAKSLPHK 425 Query: 1433 PAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASAR 1612 P+ NSS SSLP NRAATMKSATE+A+AR Sbjct: 426 PSSNNSS---SSLPLMMQQAVAMQQMQFQQAILMQQAVATQQAANRAATMKSATELAAAR 482 Query: 1613 AAEISKKLKADGVGNEEKGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1792 AAEIS+KL+ DGVGN+ K Sbjct: 483 AAEISRKLRPDGVGNDVKEADQKSRSPSKSPGSRSKSKSPISYRRRRRSRSYSPPFRRPR 542 Query: 1793 XXXXXXXXXXXXXXTNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXX 1972 T Y RRS+R RD ++ S R+ R D + Sbjct: 543 SHRSRSPLRYHRRST-YEGRRRSYRDSRDISE-SRRYGRS---DEHHSSSSRRSRSVSPK 597 Query: 1973 TRKSSRAGSASPKHRRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHR 2152 RKS + +HRR+S S +KSSRAGS SP+ + Sbjct: 598 KRKSGQEDLELSRHRRDS----------------------SSRGDKKSSRAGSRSPRRRK 635 Query: 2153 ESLSPRTKKSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXX 2332 E K + R + EN R RSRS+S ED A Sbjct: 636 E-----VKSTPRDD-----------------EENKLKRRTRSRSRSVEDSADMKDESRDE 673 Query: 2333 XXXXXXXXXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEK 2512 + + D ++ + R + E+ +RRRSRSKS E++ SD+ + Sbjct: 674 ELKHHKKRSRSRSREDRSKTRDASRNSDETKRKHRRRSRSKSLENDNGSDENVD------ 727 Query: 2513 VKLDG-RNDEKTDRATKELDLSAKDSRDLKE-YGTSDPRRKDTLLEDGSSS----DEKYG 2674 V DG N + R + LD + D+KE G S R +T + D G Sbjct: 728 VAQDGDLNSRHSRRRSNSLD----EDYDMKERRGRSRSRSLETKNRSSGKNKLDEDRNTG 783 Query: 2675 SNHKRSR---LDDKDSEKHDSVFKDKNDLMDVSEGVKSPSVSAR----------YNDSAP 2815 S +RSR ++ K S ++ +DK +SPS + Y+D Sbjct: 784 SRRRRSRSKSVEGKRSYNKETRSRDKKSKRRSGRRSRSPSSEGKQGRDIRSSPGYSDEKK 843 Query: 2816 VDDRTHSRTKDSSRYEKSTSDR-RRHEKIDTTRRERDTSGMDR--AHIGCEDLSRHGRLT 2986 + HSR++ + + S R +RHE++ ++ D DR + + ED R + Sbjct: 844 SRHKQHSRSRSTEKKNSSREKRSKRHERLRSSSPVGDKRRGDRSLSPVSSEDHKIKKRHS 903 Query: 2987 SENKKHEKVESIHREKDHLDDDSTYNREGR 3076 EK S + + D D +S ++R R Sbjct: 904 GSKSVKEKPRSDYEKVDDGDANSDFSRPER 933 >ref|XP_004296963.1| PREDICTED: uncharacterized protein LOC101297633 [Fragaria vesca subsp. vesca] Length = 1040 Score = 460 bits (1183), Expect = e-126 Identities = 266/500 (53%), Positives = 299/500 (59%), Gaps = 3/500 (0%) Frame = +2 Query: 182 MADRSVPAVSTALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKESIS-- 355 MADR++ A+ +PIWMKQ TFKDV+ + K + + Sbjct: 1 MADRNL-----AVVKPIWMKQAEEARVKSEAEKDAAAKAAFEATFKDVDKSKEKGAAAAG 55 Query: 356 SDSDTEENEDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGA 535 SDS++EE E+L NKP+GPVD +KCT SSF V TKDSDGRK+PNGGA Sbjct: 56 SDSESEEAENLA-NKPIGPVDATKCTAAGAGIAGGTACAPSSFTVATKDSDGRKVPNGGA 114 Query: 536 QLKVKICPGVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVF 715 Q+KVKI PG+GVGGSEQEGMVKD GDGTY VTYVVPKRGNYMV +EC+G+AIMGSPFPVF Sbjct: 115 QIKVKIMPGLGVGGSEQEGMVKDMGDGTYTVTYVVPKRGNYMVTIECNGRAIMGSPFPVF 174 Query: 716 FSA-XXXXXXXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXX 892 FSA VNQTMPNMPNYSGSVSGAFPGLLGMIP Sbjct: 175 FSAGSTSTGGLLGLAPTSTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGIVPGALGGA 234 Query: 893 XXXXXXXXXXEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXX 1072 E+CREYLNGRC K DCK HPPH S Sbjct: 235 ILPGIGASLGEVCREYLNGRCAKADCKLNHPPHQLLMTALAATTNMGNVS-QVPMAPSAA 293 Query: 1073 XXXXXXXXXXXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQ 1252 +DSS S DKAGKAD LK+TLQ+SNLSPLLT +Q Sbjct: 294 AMAAAQAIVAAQALQAHAAQHAQAQSNKDSSGSPDKAGKADVLKRTLQVSNLSPLLTVEQ 353 Query: 1253 LKQLFSYCGTVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPK 1432 LKQLFS+CGTVVEC+ITDSKHFAYIEY+KPEEA AALALN+MDVGGRP+NVEMAKSLP K Sbjct: 354 LKQLFSFCGTVVECTITDSKHFAYIEYTKPEEATAALALNSMDVGGRPLNVEMAKSLPQK 413 Query: 1433 PAILNSSMNQSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASAR 1612 A +NS M SSLP NRAATMK+ATE+A+AR Sbjct: 414 SA-MNSQMASSSLPMVMQQAVAMQQMQFQQALLMQQTMTAQQAANRAATMKTATELAAAR 472 Query: 1613 AAEISKKLKADGVGNEEKGT 1672 AAEISKKLKADGV EE T Sbjct: 473 AAEISKKLKADGVEIEETET 492 Score = 92.4 bits (228), Expect = 2e-15 Identities = 110/428 (25%), Positives = 162/428 (37%), Gaps = 17/428 (3%) Frame = +2 Query: 1841 YGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKHRR 2020 Y + RRS+R R+ +DR R R D + RKS R S+SPKHRR Sbjct: 548 YDNGRRSYRDFRNGSDRGRR----RDSDDFQSYASRRRSRSVSPRRKSHRVDSSSPKHRR 603 Query: 2021 ESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRES-LSPR-------TK 2176 E SP R+++R S SP + S LSPR + Sbjct: 604 ER-----------------------SP--RRATRDASRSPGHYTGSKLSPRGDEDKSKHR 638 Query: 2177 KSSRANXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXX 2356 K SR+N + RRRSRS S E + H Sbjct: 639 KRSRSNSPEDKHLLNDKKDETRYETSKHRERRRSRSVSVEGKHHR--------------- 683 Query: 2357 XXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPR-SDDRKERKKSEKVKLDGRN 2533 +SSPR E+ S +RRRSRSKS E +PR +D+ ++RK + R+ Sbjct: 684 -----------KRSSPRSLDENKSKHRRRSRSKSVEVKPRGADETRDRKLKHRSGRQSRS 732 Query: 2534 DEKTDRATKELDLSAKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNHKRSRLDDKDS 2713 K+ + + D + RD K R + +E S E G K+S+ D+ Sbjct: 733 --KSLESKRHSDEKTSELRDEKSKHRDRRRSRSKSVEGRRHSKEVDGGRDKKSKRRDRRQ 790 Query: 2714 EKHDSVFKDKNDLMDVSEGVKSP-------SVSARYNDSAPVDDRTHSRTK-DSSRYEKS 2869 + S +D + G SP S R + S + + H + D SR EK+ Sbjct: 791 SRSRSRSSSLEPKLD-TRGESSPRRLDEHKSKHRRRSRSKSAEGKQHLNDRADKSRNEKA 849 Query: 2870 TSDRRRHEKIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVESIHREKDHLDD 3049 RRR + + RER E+ S+ R S ++ E ++ DD Sbjct: 850 KRHRRRRSRSISLERERHRGSRLSPRSSDENDSKQRRRRSRSESSEGKHQRSERDENGDD 909 Query: 3050 DSTYNREG 3073 + + +G Sbjct: 910 ELKHYEDG 917 >ref|XP_007153615.1| hypothetical protein PHAVU_003G050400g [Phaseolus vulgaris] gi|561026969|gb|ESW25609.1| hypothetical protein PHAVU_003G050400g [Phaseolus vulgaris] Length = 957 Score = 458 bits (1178), Expect = e-125 Identities = 256/487 (52%), Positives = 292/487 (59%), Gaps = 3/487 (0%) Frame = +2 Query: 221 ARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDVEMNTVKES--ISSDSDTEENEDLVK 394 A+PIWMKQ TFK +E N K + SDS++EE EDL Sbjct: 9 AKPIWMKQAEEAKLKSEAEKAAAAKAAFEATFKGLEKNREKGGGVVQSDSESEEYEDLA- 67 Query: 395 NKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICPGVGVG 574 NKP+GPVDPSKCT SSFVVV KD+D RK+ NGGAQ+KV++ PG+GVG Sbjct: 68 NKPIGPVDPSKCTAAGTGIAGGTACAPSSFVVVAKDADERKVSNGGAQIKVRVTPGLGVG 127 Query: 575 GSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFS-AXXXXXXXXX 751 GSEQEGMVKD GDGTY VTYVVPKRGNYMV VEC+G+ IMGSPFPVFFS A Sbjct: 128 GSEQEGMVKDMGDGTYTVTYVVPKRGNYMVSVECNGRPIMGSPFPVFFSAAGNGSGGLLG 187 Query: 752 XXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXXXEIC 931 VNQTMPNMPNYSGSVSGAFPGLLGMIP E+C Sbjct: 188 LAPASTFPNLVNQTMPNMPNYSGSVSGAFPGLLGMIPGVVAGASGGAILPGIGASLGEVC 247 Query: 932 REYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXXX 1111 R+YLNGRC K DCK HPPHN S Sbjct: 248 RDYLNGRCAKVDCKLNHPPHNLLMTALAATTSMGTLS--QAPMAPSAAAMAAAQAIVAAQ 305 Query: 1112 XXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCGTVVE 1291 +DS+ S +K+ K D+LKKTLQ+SNLSPLLT +QLKQLF++CGTVV+ Sbjct: 306 ALQAHAAQVQAQSAKDSAGSPEKSSKDDALKKTLQVSNLSPLLTVEQLKQLFAFCGTVVD 365 Query: 1292 CSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMNQSSL 1471 C+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKSLP KP+++NSS+ SSL Sbjct: 366 CTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSLPQKPSVVNSSLASSSL 425 Query: 1472 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLKADGV 1651 P NRAATMKSATE+A+ARAAEISKKL DG+ Sbjct: 426 PLMMQQAVAMQQMQFQQALRMQQTMTAQQAANRAATMKSATELAAARAAEISKKLNPDGL 485 Query: 1652 GNEEKGT 1672 +EEK T Sbjct: 486 ESEEKET 492 Score = 84.3 bits (207), Expect = 4e-13 Identities = 97/411 (23%), Positives = 167/411 (40%), Gaps = 24/411 (5%) Frame = +2 Query: 1835 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2014 ++Y ERR R R+ +DR + + DR+ DH +R S SPK Sbjct: 544 SSYERERR-FRDSREHSDRYRKRDLDRSLDH---RSSVSRRNKSRSVSPHTRKSSVSPKR 599 Query: 2015 RRESLXXXXXXXXXXXXXXXXXXXGNVSP-RTRKSSRAGSGSPKFHRESLSPRTKKSSRA 2191 RE+ SP R RK SRA SGSP R SP T + Sbjct: 600 HRET-----------------------SPHRGRKQSRADSGSPSRRRGRASPNTDEKKLR 636 Query: 2192 NXXXXXXXXXXXXXXXXTHENVSH------YRRRSRSKSAEDEAHXXXXXXXXXXXXXXX 2353 N +E + H R+RSRS S +++ H Sbjct: 637 NRRHSRSRSSDDRLHSSKNEEILHGKSKHKERKRSRSGSVDEKPH--------------- 681 Query: 2354 XXXNDEKTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKERKKSEKVKLDGRN 2533 R S+SSPR+ ES S Y++RSRSKS +D+ S +R ++ ++ +++ + Sbjct: 682 ----------RRSRSSPRKVDESRSRYKKRSRSKSVDDKHDSPERLDQNRNRRMRHSDKR 731 Query: 2534 DEKTDRATKELDLS---AKDSRDLKEYGTSDPRRKDTLLEDGSSSDEKYGSNH-KRSRLD 2701 ++ R+T+ DLS +S++ K R + +E S +K G N K+S+ Sbjct: 732 HSRS-RSTENRDLSEVRVDESKNEKSKHRDSKRGRSKSVEGKHRSKDKSGENRDKKSKHR 790 Query: 2702 DKDSEKHDSVFKDKNDLMDVSEGVK--SPSVSARYNDSAPVDDRTHSRTK-----DSSRY 2860 D+ + S+ + ++D S + + + + S + + H K + S + Sbjct: 791 DRRRSRSTSL-EGEHDKSGTSPHINLDERNFEVKQSRSKFPEGKHHFSDKYGNRDEKSEH 849 Query: 2861 EKSTSDRRRHEKIDTTR------RERDTSGMDRAHIGCEDLSRHGRLTSEN 2995 +K T + + E+ D + ++ D+ G ++ G ++ +H EN Sbjct: 850 QKKTPPKSKSEQFDGSGSFQGNFKDYDSKGKSQSDSGSAEI-KHNLSDGEN 899 >ref|XP_006421067.1| hypothetical protein CICLE_v10004448mg [Citrus clementina] gi|557522940|gb|ESR34307.1| hypothetical protein CICLE_v10004448mg [Citrus clementina] Length = 709 Score = 458 bits (1178), Expect = e-125 Identities = 257/491 (52%), Positives = 299/491 (60%), Gaps = 3/491 (0%) Frame = +2 Query: 209 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDV--EMNTVKESISSDSDTEEN- 379 +TA ++ IW+KQ TFK + + N + +++SDSD+EE+ Sbjct: 5 NTAASKAIWLKQAEEAKLKSEAEKDAAAKAAFEATFKGLTNKANEKQAAVASDSDSEEDW 64 Query: 380 EDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICP 559 E+ + NKP+GPVDPSK T S+F+VVTKDSDGRK+P+GGA++KVK+ P Sbjct: 65 EEDLSNKPIGPVDPSKSTAAGAGIAGGNAGAASTFMVVTKDSDGRKVPHGGAEIKVKVAP 124 Query: 560 GVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXX 739 GVGVGGSEQEG+VKD DGTY VTYVVPKRGNYM+ +EC+GK IMGSPFPVFFSA Sbjct: 125 GVGVGGSEQEGIVKDMNDGTYTVTYVVPKRGNYMLSIECNGKPIMGSPFPVFFSA---GS 181 Query: 740 XXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXX 919 VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 182 NSTGGGLLGMAPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGVVSGASGGAILPGMGASL 241 Query: 920 XEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXX 1099 E+CREYLNGRC KTDCK HPPHN S Sbjct: 242 GEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTTMGTLS-QVPMAPSAAAMAAAQAIV 300 Query: 1100 XXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCG 1279 +D S S DKAGKAD+LKKTLQ+SNLSPLLT +QLKQLFS+CG Sbjct: 301 AAQALQAHAAQVQAQQSAKDLSGSPDKAGKADALKKTLQVSNLSPLLTVEQLKQLFSFCG 360 Query: 1280 TVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMN 1459 TVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKS P KP+ LNSS+ Sbjct: 361 TVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSFPQKPSHLNSSLA 420 Query: 1460 QSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLK 1639 SSLP NRAA+MKSATE+A+ARAAEISKKLK Sbjct: 421 GSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARAAEISKKLK 480 Query: 1640 ADGVGNEEKGT 1672 ADG+ +E+K T Sbjct: 481 ADGLVDEDKET 491 >ref|XP_006492975.1| PREDICTED: uncharacterized protein LOC102615780 isoform X1 [Citrus sinensis] Length = 950 Score = 457 bits (1175), Expect = e-125 Identities = 256/491 (52%), Positives = 299/491 (60%), Gaps = 3/491 (0%) Frame = +2 Query: 209 STALARPIWMKQXXXXXXXXXXXXXXXXXXXXXXTFKDV--EMNTVKESISSDSDTEEN- 379 +TA ++ IW+KQ TFK + + N + +++SDSD+EE+ Sbjct: 5 NTAASKAIWLKQAEEAKLKSEAEKDAAAKAAFEATFKGLTNKANEKQAAVASDSDSEEDW 64 Query: 380 EDLVKNKPLGPVDPSKCTXXXXXXXXXXXXXXSSFVVVTKDSDGRKIPNGGAQLKVKICP 559 E+ + NKP+GPVDPSK T S+F+VVTKDSDGRK+P+GGA++KVK+ P Sbjct: 65 EEDLSNKPIGPVDPSKSTAAGAGIAGGNAGAASTFMVVTKDSDGRKVPHGGAEIKVKVAP 124 Query: 560 GVGVGGSEQEGMVKDQGDGTYMVTYVVPKRGNYMVHVECDGKAIMGSPFPVFFSAXXXXX 739 GVGVGGSEQEG+VKD DGTY VTYVVPKRGNYM+ +EC+GK IMGSPFPVFFSA Sbjct: 125 GVGVGGSEQEGIVKDMNDGTYTVTYVVPKRGNYMLSIECNGKPIMGSPFPVFFSA---GS 181 Query: 740 XXXXXXXXXXXXXXVNQTMPNMPNYSGSVSGAFPGLLGMIPXXXXXXXXXXXXXXXXXXX 919 VNQTMPNMPNYS SVSGAFPGLLGMIP Sbjct: 182 NSTGGGLLGMAPNLVNQTMPNMPNYSASVSGAFPGLLGMIPGVVSGASGGAILPGMGASL 241 Query: 920 XEICREYLNGRCVKTDCKFIHPPHNXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXX 1099 E+CREYLNGRC KTDCK HPPHN S Sbjct: 242 GEVCREYLNGRCAKTDCKLNHPPHNLLMTALAATTTMGTLS-QVPMAPSAAAMAAAQAIV 300 Query: 1100 XXXXXXXXXXXXXXXXXXRDSSDSRDKAGKADSLKKTLQISNLSPLLTTDQLKQLFSYCG 1279 +D S S DKAGKAD+LKKTLQ+SNLSPLLT +QL+QLFS+CG Sbjct: 301 AAQALQAHAAQVQAQQSAKDLSGSPDKAGKADALKKTLQVSNLSPLLTVEQLRQLFSFCG 360 Query: 1280 TVVECSITDSKHFAYIEYSKPEEAIAALALNNMDVGGRPMNVEMAKSLPPKPAILNSSMN 1459 TVVEC+ITDSKHFAYIEYSKPEEA AALALNNMDVGGRP+NVEMAKS P KP+ LNSS+ Sbjct: 361 TVVECTITDSKHFAYIEYSKPEEATAALALNNMDVGGRPLNVEMAKSFPQKPSHLNSSLA 420 Query: 1460 QSSLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNRAATMKSATEMASARAAEISKKLK 1639 SSLP NRAA+MKSATE+A+ARAAEISKKLK Sbjct: 421 GSSLPMMMQQAVAMQQMQFQQALLMQQTLTAQQAANRAASMKSATELAAARAAEISKKLK 480 Query: 1640 ADGVGNEEKGT 1672 ADG+ +E+K T Sbjct: 481 ADGLVDEDKET 491 Score = 70.9 bits (172), Expect = 5e-09 Identities = 112/402 (27%), Positives = 144/402 (35%), Gaps = 7/402 (1%) Frame = +2 Query: 1835 TNYGSERRSHRQVRDSNDRSGRWERDRTRDHYXXXXXXXXXXXXXXTRKSSRAGSASPKH 2014 + Y +ER S R RD DRS R E D + H +RKS R+ S SPKH Sbjct: 564 SKYDNERLSTRDTRDGADRSRR-ESDISH-HSPVPRRRKSRSVSPRSRKSYRSDSGSPKH 621 Query: 2015 RRESLXXXXXXXXXXXXXXXXXXXGNVSPRTRKSSRAGSGSPKFHRESL-SPRTKKSSRA 2191 R+ES RKSSRA S SPK HR S SPR + Sbjct: 622 RQES-------------------------SARKSSRAHSKSPKRHRGSRNSPRNDDAK-- 654 Query: 2192 NXXXXXXXXXXXXXXXXTHENVSHYRRRSRSKSAEDEAHXXXXXXXXXXXXXXXXXXNDE 2371 YR RSRSKS E ++E Sbjct: 655 ----------------------PKYRHRSRSKSME--------------------RHDEE 672 Query: 2372 KTDNRGSKSSPRRAHESVSHYRRRSRSKSAEDEPRSDDRKE--RKKSEKVKLDGRNDEKT 2545 K + R KS R R+RSRS S EDE R K KL GR+ + Sbjct: 673 KDEARDGKSQHRE--------RKRSRSLSREDEHHGRGRSPAGNVDENKSKLRGRSRSVS 724 Query: 2546 DRATKELDLSAKDSRD--LKEYGTSDPRRKDTLLEDGSSSDEKYGSNH--KRSRLDDKDS 2713 + A DSRD L+ R K + G S +K +H +RSR D Sbjct: 725 ALDKYKSSEIADDSRDDRLRNRHKRRSRSKSVEGKMGGGSRDKKPKHHDRRRSRSISADG 784 Query: 2714 EKHDSVFKDKNDLMDVSEGVKSPSVSARYNDSAPVDDRTHSRTKDSSRYEKSTSDRRRHE 2893 + H G +SP R D + SR+K R K +R R Sbjct: 785 KHH--------------RGSRSP----RGLDESRTKLGRRSRSKSIER--KLYRNRGRSR 824 Query: 2894 KIDTTRRERDTSGMDRAHIGCEDLSRHGRLTSENKKHEKVES 3019 D RR+ S + A++ + +R S++K H ES Sbjct: 825 SADGRRRK---SMLSPANLDRSESNRRRHSASQSKDHANSES 863