BLASTX nr result
ID: Cinnamomum24_contig00011543
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum24_contig00011543 (3819 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C i... 1043 0.0 ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C i... 1007 0.0 ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C i... 1004 0.0 ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C i... 984 0.0 ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [... 942 0.0 ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i... 926 0.0 ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i... 914 0.0 ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C i... 910 0.0 ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [... 904 0.0 ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i... 902 0.0 ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i... 890 0.0 ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i... 878 0.0 ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [... 836 0.0 gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r... 832 0.0 gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium r... 832 0.0 ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c... 825 0.0 ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l... 820 0.0 ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr... 819 0.0 gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 818 0.0 gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 816 0.0 >ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo nucifera] gi|719963615|ref|XP_010250275.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo nucifera] Length = 1088 Score = 1043 bits (2696), Expect = 0.0 Identities = 599/1109 (54%), Positives = 701/1109 (63%), Gaps = 27/1109 (2%) Frame = -2 Query: 3653 QSSIPGMTPQAPASGPTVAPS--IQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTP 3480 QSS G+T QA G PS S T+EP+ +S+RAKF+T P Sbjct: 8 QSSASGITAQASGLGQATGPSNPTVASPAPVSGPSNPKGPSGTTNEPAQESIRAKFITGP 67 Query: 3479 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3300 G+VVPAPSF YSVI SPA+ P SA A QP +P QS S P+FS Sbjct: 68 GYVVPAPSFSYSVIPKQNTASGSSLENSSSPALVSNQPASATAFQPSIPGQSLSSGPTFS 127 Query: 3299 YNLISQPNVGSASGQQLQTGTVTGPGNI---QVGKFVPPNTAASLQPPVPGRP---NQFV 3138 YN+I +GS++ Q+LQ+ T G G + QVG P TAASLQPPVPG+P N F Sbjct: 128 YNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHPNTFG 187 Query: 3137 PGTIPQNMPAPMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQ 2958 PGT Q M + SP+SVPKG PS+ QL Q SSN+SAS AV + Sbjct: 188 PGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAR 241 Query: 2957 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXX 2778 E GTV ASSSS ++P +VS SS + +P++ P T+W Sbjct: 242 EAGTVSPASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGT 300 Query: 2777 XXXXXXXXXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQ 2607 AMD S+S LRP+ QQ++ PY + Sbjct: 301 PGPPGIAPSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPAL 350 Query: 2606 PAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXX 2427 P+M PPPQG WL PPQ+ GLQRPP++PYP LP +PLP+RG+ Sbjct: 351 PSMPPPPQGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISP 409 Query: 2426 XXXXXXXXXXAGSGQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKS 2283 G P+SSVG+ PPPG DQ K D G + Sbjct: 410 LGPP--------GGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNA 461 Query: 2282 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2103 + D WTAHKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW L Sbjct: 462 K-VDAWTAHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWAL 520 Query: 2102 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVS 1926 V+TNDGKKYY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT QN+ +K SAP+S Sbjct: 521 VTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPIS 580 Query: 1925 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNG 1746 ++ PA+NTGGREA +LR SG SSSALD++KKKLQD+ P TSSPLP SS P +DLNG Sbjct: 581 VTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNG 640 Query: 1745 LGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERG 1569 PVEA KG QSEN K+K+KD NGDGN+ GP+KEECIIQFKEMLKERG Sbjct: 641 SRPVEAAVKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERG 699 Query: 1568 VAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFK 1389 VAPFSKWEKELPKI+FDPRFKAVPGY+ARRALFEHYVRT EGFK Sbjct: 700 VAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFK 759 Query: 1388 QLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQR 1209 QLLEEASEDID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q R Sbjct: 760 QLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIR 819 Query: 1208 AAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXX 1029 AAA S FKS+LR+ GDINTSSRWSRVKDSLR+DPRYKSVKHE+RE+LFNEYISEL Sbjct: 820 AAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADE 879 Query: 1028 XXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKD 849 E+DKL RVRLKV+RKEAVA YQALLVETIKD Sbjct: 880 EAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKD 939 Query: 848 PKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAE 669 P+ SWTES P+LEKDPQGRA+N LD D EKLFREHVK LYER AREFR LL EVIT E Sbjct: 940 PQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTE 999 Query: 668 TAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKE 492 A QMT+DGK LTSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+ ++K SD KE Sbjct: 1000 AASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKE 1059 Query: 491 EKPHSEFKNKISADSERSPAP-RRTHSRR 408 EK + E K + S DS RSP RR+HSRR Sbjct: 1060 EKLNIETKARSSLDSGRSPTGLRRSHSRR 1088 >ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Elaeis guineensis] Length = 1097 Score = 1007 bits (2603), Expect = 0.0 Identities = 570/1087 (52%), Positives = 668/1087 (61%), Gaps = 10/1087 (0%) Frame = -2 Query: 3641 PGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPGFVVPA 3462 P +P SGP++ ++ VS N + P+ D VRAKF T+ GFVVPA Sbjct: 57 PVTSPSFMDSGPSL--TVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQGFVVPA 114 Query: 3461 PSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQ 3282 PSF Y V SP ++ +PP A ALQPPVP Q G+ PSFSYN++S Sbjct: 115 PSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSYNVVSN 174 Query: 3281 PNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMP 3111 N GSA+GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I + P Sbjct: 175 ANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCP 234 Query: 3110 APMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAAS 2931 APMQ P+S+P G S AVV E GT S Sbjct: 235 APMQLPLSIPTG--------------------------------TSDAVVTEAGTSITTS 262 Query: 2930 SSSQSTALPVYVSSSSSMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXXXXXXX 2754 SQS L V SSSS P+ + Sbjct: 263 IDSQSAQLSATVPSSSSTASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLPGIPNS 322 Query: 2753 XXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPP 2589 ++PA +PS LRPM QQ Y PY S P PP Sbjct: 323 ATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPP 381 Query: 2588 PQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXX 2409 PQ WL PPQ GLQR P++PY LPAPF LPV G+ Sbjct: 382 PQALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGP 441 Query: 2408 XXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYY 2229 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYY Sbjct: 442 ASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYY 500 Query: 2228 YNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVS 2049 YNS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVS Sbjct: 501 YNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVS 560 Query: 2048 SWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTS 1869 SWQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MALRTS Sbjct: 561 SWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTS 618 Query: 1868 GAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEK 1689 GA SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ NSK+K Sbjct: 619 GAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDK 677 Query: 1688 LKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPR 1512 +KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPR Sbjct: 678 VKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPR 734 Query: 1511 FKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSF 1332 FKAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTDY +F Sbjct: 735 FKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTF 794 Query: 1331 KRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINT 1152 KRKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T Sbjct: 795 KRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITT 850 Query: 1151 SSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXX 972 +SRWSRVK++LRNDPRYK+VKHEER LFNEYISEL EQ+KL Sbjct: 851 TSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKER 910 Query: 971 XXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGR 792 RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGR Sbjct: 911 EREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGR 970 Query: 791 ASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWSEA 612 A+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK L SWSEA Sbjct: 971 ATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEA 1030 Query: 611 KRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPA 432 KRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R + Sbjct: 1031 KRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-S 1089 Query: 431 PRRTHSR 411 PRR+H R Sbjct: 1090 PRRSHGR 1096 >ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Elaeis guineensis] Length = 1055 Score = 1004 bits (2595), Expect = 0.0 Identities = 569/1086 (52%), Positives = 667/1086 (61%), Gaps = 9/1086 (0%) Frame = -2 Query: 3641 PGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPGFVVPA 3462 P +P SGP++ ++ VS N + P+ D VRAKF T+ GFVVPA Sbjct: 57 PVTSPSFMDSGPSL--TVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQGFVVPA 114 Query: 3461 PSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQ 3282 PSF Y V SP ++ +PP A ALQPPVP Q G+ PSFSYN++S Sbjct: 115 PSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSYNVVSN 174 Query: 3281 PNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMP 3111 N GSA+GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I + P Sbjct: 175 ANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCP 234 Query: 3110 APMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAAS 2931 APMQ P+S+P G S AVV E GT S Sbjct: 235 APMQLPLSIPTG--------------------------------TSDAVVTEAGTSITTS 262 Query: 2930 SSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXX 2751 SQS L V SSSS ++ Sbjct: 263 IDSQSAQLSATVPSSSSTASVSST------------------------------------ 286 Query: 2750 XXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPPP 2586 ++PA +PS LRPM QQ Y PY S P PPP Sbjct: 287 -----VTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPP 340 Query: 2585 QGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXX 2406 Q WL PPQ GLQR P++PY LPAPF LPV G+ Sbjct: 341 QALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPA 400 Query: 2405 XXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYY 2226 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYYY Sbjct: 401 STTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYY 459 Query: 2225 NSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSS 2046 NS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSS Sbjct: 460 NSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSS 519 Query: 2045 WQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSG 1866 WQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MALRTSG Sbjct: 520 WQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSG 577 Query: 1865 AMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKL 1686 A SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ NSK+K+ Sbjct: 578 AAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKV 636 Query: 1685 KDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRF 1509 KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRF Sbjct: 637 KD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRF 693 Query: 1508 KAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFK 1329 KAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTDY +FK Sbjct: 694 KAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFK 753 Query: 1328 RKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTS 1149 RKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T+ Sbjct: 754 RKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTT 809 Query: 1148 SRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXX 969 SRWSRVK++LRNDPRYK+VKHEER LFNEYISEL EQ+KL Sbjct: 810 SRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKERE 869 Query: 968 XXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRA 789 RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA Sbjct: 870 REMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRA 929 Query: 788 SNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWSEAK 609 +NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK L SWSEAK Sbjct: 930 TNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAK 989 Query: 608 RLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAP 429 RLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R +P Sbjct: 990 RLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SP 1048 Query: 428 RRTHSR 411 RR+H R Sbjct: 1049 RRSHGR 1054 >ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis guineensis] Length = 1066 Score = 984 bits (2544), Expect = 0.0 Identities = 562/1087 (51%), Positives = 660/1087 (60%), Gaps = 10/1087 (0%) Frame = -2 Query: 3641 PGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPGFVVPA 3462 P +P SGP++ ++ VS N + P+ D VRAKF T+ GFVVPA Sbjct: 57 PVTSPSFMDSGPSL--TVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQGFVVPA 114 Query: 3461 PSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQ 3282 PSF Y V SP ++ +PP A ALQPPVP Q G+ PSFSYN++S Sbjct: 115 PSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSYNVVSN 174 Query: 3281 PNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMP 3111 N GSA+GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I + P Sbjct: 175 ANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCP 234 Query: 3110 APMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAAS 2931 APMQ P+S+P G S AVV E GT S Sbjct: 235 APMQLPLSIPTG--------------------------------TSDAVVTEAGTSITTS 262 Query: 2930 SSSQSTALPVYVSSSSSMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXXXXXXX 2754 SQS L V SSSS P+ + Sbjct: 263 IDSQSAQLSATVPSSSSTASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLPGIPNS 322 Query: 2753 XXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPP 2589 ++PA +PS LRPM QQ Y PY S P PP Sbjct: 323 ATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPP 381 Query: 2588 PQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXX 2409 PQ WL PPQ GLQR P++PY P + Sbjct: 382 PQALWLHPPQAGGLQRAPFLPYSVANQGPASTTM-------------------------- 415 Query: 2408 XXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYY 2229 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYY Sbjct: 416 -----GSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYY 469 Query: 2228 YNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVS 2049 YNS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVS Sbjct: 470 YNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVS 529 Query: 2048 SWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTS 1869 SWQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MALRTS Sbjct: 530 SWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTS 587 Query: 1868 GAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEK 1689 GA SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ NSK+K Sbjct: 588 GAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDK 646 Query: 1688 LKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPR 1512 +KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPR Sbjct: 647 VKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPR 703 Query: 1511 FKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSF 1332 FKAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTDY +F Sbjct: 704 FKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTF 763 Query: 1331 KRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINT 1152 KRKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T Sbjct: 764 KRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITT 819 Query: 1151 SSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXX 972 +SRWSRVK++LRNDPRYK+VKHEER LFNEYISEL EQ+KL Sbjct: 820 TSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKER 879 Query: 971 XXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGR 792 RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGR Sbjct: 880 EREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGR 939 Query: 791 ASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWSEA 612 A+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK L SWSEA Sbjct: 940 ATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEA 999 Query: 611 KRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPA 432 KRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R + Sbjct: 1000 KRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-S 1058 Query: 431 PRRTHSR 411 PRR+H R Sbjct: 1059 PRRSHGR 1065 >ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda] Length = 1085 Score = 942 bits (2435), Expect = 0.0 Identities = 548/1110 (49%), Positives = 674/1110 (60%), Gaps = 25/1110 (2%) Frame = -2 Query: 3662 MSSQSSI-PGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVT 3486 MSSQ+ + P + P AP P Q + N + +SVRAKFV Sbjct: 1 MSSQAWLSPEVQPSAPGVPPQPLTPGQTTTGGPPGPSPPIPRPQN--DQPQESVRAKFVA 58 Query: 3485 TPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXSPAVKFTP---PTSAAALQPPVPRQSSGS 3315 +PG+++PAPSF Y V+ P P P SA ++QPPVP S+ S Sbjct: 59 SPGYILPAPSFSYGVVSQNNNA----------PRASLPPQSTPLSAVSVQPPVPGHSATS 108 Query: 3314 VPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR------ 3153 SFSY++ S SA T +Q GK P +AASLQPPVPG+ Sbjct: 109 GASFSYSVASHATTTSA----------TASNPMQGGKPAGPTSAASLQPPVPGQSSVSVH 158 Query: 3152 PNQFVPGTIPQNMPAPMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSAS 2973 PN + P QN A + P V KG PS V++E Q +SN+ AS Sbjct: 159 PNSWDPERPVQNALAQARPPFLVRKGPPSTSGFSFSGNSQS--VSSEDSQKHQASNSDAS 216 Query: 2972 AAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY--PMTMWTQXXXXXXXXXX 2799 AAV QE T +SS++Q+T LP SS++S V ++P+ Y P M Sbjct: 217 AAVAQEAKTSQPSSSTAQTTPLPA-PSSTTSRPVSSSPNTYATPFYMPKAPPFPGPPRLP 275 Query: 2798 XXXXXXXXXXXXXXXXXXXXXANARPAAMDP-SASLRPMXXXXXXXXXXXXXXVHQQ--- 2631 N RP+ +D SA +RP Q Sbjct: 276 VTPGTPGPPGIALSAPQLSSSVNIRPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQ 335 Query: 2630 --LYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXX 2457 +Y PY + P + PPPQ W+ P Q+ GLQRPP++PYP P PFP+P+R I Sbjct: 336 PPIYSPYPTLPGVVPPPQAMWMHPSQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAM 395 Query: 2456 XXXXXXXXXXXXXXXXXXXXA--GSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKS 2283 A G+G + QSPPPGID++K + T+ + + Sbjct: 396 PDSSQPPGVSPIGPPGGIPLADHGAGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSN 455 Query: 2282 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2103 ED D WTAHKT+TGAVYYYN+LTG+STYE+P FKGE DKV +Q TPVS EKLVGTDW L Sbjct: 456 EDTDQWTAHKTDTGAVYYYNALTGESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWAL 515 Query: 2102 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS-LQTSMTSQNASFGMDKGSAPVS 1926 V+TNDGKKYY+NTK+K+SSWQ+P EVAELRK+Q++D+ L+ + QNA DKGS S Sbjct: 516 VATNDGKKYYYNTKSKISSWQVPPEVAELRKKQEADAALKANAPVQNAGISSDKGSVSSS 575 Query: 1925 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS-VPVISDLN 1749 LS PA+NTGGREAM +++ A SSSALD++KKKLQD+GMPVTSS LP+S+ VP SD N Sbjct: 576 LSAPAINTGGREAMTFKSATAPVSSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDAN 635 Query: 1748 GLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKER 1572 G V+ KGQQSENSK+KLK A G++ GPTKEEC+IQFKEMLKE+ Sbjct: 636 GQRVVDTTVKGQQSENSKDKLKVAQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEK 695 Query: 1571 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1392 G+APFSKWEKELPKILFDPRFKA+PGYT RR+LFEH+VRT EGF Sbjct: 696 GIAPFSKWEKELPKILFDPRFKAIPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGF 755 Query: 1391 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQ 1212 KQLLE ASEDI+HKTDY +FK+KWG DPRF ALDRK+RE LLNERVLPL+KA E+K Q Sbjct: 756 KQLLEGASEDINHKTDYETFKKKWGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAI 815 Query: 1211 RAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXX 1032 RAAAV+SFKSML + DIN SRWS+VKDSLRNDPRYKSVKHE+REVLF EYISEL Sbjct: 816 RAAAVASFKSMLHEKVDINIGSRWSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAE 875 Query: 1031 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 852 E++KL RVR K RRK+AV SYQALL E IK Sbjct: 876 QEADRAAKAKREEEEKLKERERELRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIK 935 Query: 851 DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 672 DPKASWTES PKLEKDP GRA+NP+L+ AD EKLFREHVK L ER AREFR+LLAEVIT Sbjct: 936 DPKASWTESKPKLEKDPLGRATNPELEPADMEKLFREHVKVLNERCAREFRSLLAEVITP 995 Query: 671 ETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSK 495 E A Q ++DGK L SWS AK+LL+PDPRY KMPR++RES+W+R+AE+M RRQ+ S+ K Sbjct: 996 EAAAQASEDGKTLLNSWSTAKKLLRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQK 1055 Query: 494 EEKPHSEFKNKISADSER-SPAPRRTHSRR 408 EEK + + ++ A S + SP+ RR+H R+ Sbjct: 1056 EEKTNIDDPSRRPAGSSKSSPSVRRSHGRK 1085 >ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis vinifera] gi|297738259|emb|CBI27460.3| unnamed protein product [Vitis vinifera] Length = 1046 Score = 926 bits (2392), Expect = 0.0 Identities = 537/1097 (48%), Positives = 645/1097 (58%), Gaps = 7/1097 (0%) Frame = -2 Query: 3677 IKLIKMSSQSSIPGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRA 3498 +++ +SQ+ + G+ P+ GP P+ ++ S +S + Sbjct: 9 VEVQSSASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQG 67 Query: 3497 KFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSG 3318 KFV P V+P PSF YS I + P S Q PVP SS Sbjct: 68 KFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSS 127 Query: 3317 SVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFV 3138 S PSFSYN I+ G Q Q+ T +I G P AAS Sbjct: 128 SGPSFSYN-IAHKGAGFPGSQPFQSST-----SIASGPRGPTPNAASFS----------- 170 Query: 3137 PGTIPQNMPAPMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQ 2958 G+P + + Q S N S AV Q Sbjct: 171 ------------------FNGNPQL---------------VQKDQTLKSDN---SGAVAQ 194 Query: 2957 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXX 2778 E G++ +AS SQS P SSS+M V ++P + P T+W Sbjct: 195 EAGSMSSASHVSQSVPFP---CSSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGT 251 Query: 2777 XXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAM 2598 A P+A +S + QQ+YP Y S PA Sbjct: 252 PGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPAT 311 Query: 2597 APPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXX 2418 QG WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 312 NASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGT 371 Query: 2417 XXXXXXXAG-SGQ---PTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKT 2250 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT Sbjct: 372 AGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKT 430 Query: 2249 ETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYH 2070 +TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+ Sbjct: 431 DTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYY 490 Query: 2069 NTKTKVSSWQLPVEVAELRKRQDSDSL-QTSMTSQNASFGMDKGSAPVSLSVPAVNTGGR 1893 NTKTK+SSWQ+P E+ E+RK+QDS +L + +M + N + +KG +P++LS PAV TGGR Sbjct: 491 NTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGR 550 Query: 1892 EAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQ 1713 +A LRTS S+SALDM+KKKLQD+G P TSSP+ SS P+ S+LNG +E KG Sbjct: 551 DATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPV-HSSGPIASELNGSRVIEPTVKGL 609 Query: 1712 QSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKEL 1536 QSENSK+KLKD NGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKEL Sbjct: 610 QSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKEL 669 Query: 1535 PKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDID 1356 PKI+FDPRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDID Sbjct: 670 PKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDID 729 Query: 1355 HKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSML 1176 HKT+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSML Sbjct: 730 HKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSML 789 Query: 1175 RDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXX 996 RD GDI TS+RWSRVKDSLRNDPRYK VKHE+RE+LFNEYISEL Sbjct: 790 RDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKE 849 Query: 995 EQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPK 816 EQDKL RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PK Sbjct: 850 EQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPK 909 Query: 815 LEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKN 636 LEKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK Sbjct: 910 LEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKT 969 Query: 635 ALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKIS 456 LTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S Sbjct: 970 VLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSS 1029 Query: 455 ADSERSPA-PRRTHSRR 408 DS R P+ RR H RR Sbjct: 1030 VDSGRFPSGSRRAHERR 1046 >ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo nucifera] Length = 894 Score = 914 bits (2362), Expect = 0.0 Identities = 516/916 (56%), Positives = 599/916 (65%), Gaps = 19/916 (2%) Frame = -2 Query: 3098 SPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQ 2919 SP+SVPKG PS+ QL Q SSN+SAS AV +E GTV ASSSS Sbjct: 7 SPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAREAGTVSPASSSSV 60 Query: 2918 STALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2739 ++P +VS SS + +P++ P T+W Sbjct: 61 PVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIAPSTPLS 119 Query: 2738 XA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQ 2568 AMD S+S LRP+ QQ++ PY + P+M PPPQG WL Sbjct: 120 STVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPALPSMPPPPQGLWL- 168 Query: 2567 PPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGS 2388 PPQ+ GLQRPP++PYP LP +PLP+RG+ Sbjct: 169 PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPP--------G 220 Query: 2387 GQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTET 2244 G P+SSVG+ PPPG DQ K D G ++ D WTAHKTET Sbjct: 221 GTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWTAHKTET 279 Query: 2243 GAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNT 2064 G VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW LV+TNDGKKYY+N+ Sbjct: 280 GVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNS 339 Query: 2063 KTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVSLSVPAVNTGGREA 1887 KTK+SSWQ+P+EV ELR++ D D+L+ +MT QN+ +K SAP+S++ PA+NTGGREA Sbjct: 340 KTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREA 399 Query: 1886 MALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQS 1707 +LR SG SSSALD++KKKLQD+ P TSSPLP SS P +DLNG PVEA KG QS Sbjct: 400 TSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQS 459 Query: 1706 ENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPK 1530 EN K+K+KD NGDGN+ GP+KEECIIQFKEMLKERGVAPFSKWEKELPK Sbjct: 460 EN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPK 518 Query: 1529 ILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHK 1350 I+FDPRFKAVPGY+ARRALFEHYVRT EGFKQLLEEASEDID + Sbjct: 519 IVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQR 578 Query: 1349 TDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRD 1170 TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q RAAA S FKS+LR+ Sbjct: 579 TDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLRE 638 Query: 1169 SGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQ 990 GDINTSSRWSRVKDSLR+DPRYKSVKHE+RE+LFNEYISEL E+ Sbjct: 639 KGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEE 698 Query: 989 DKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLE 810 DKL RVRLKV+RKEAVA YQALLVETIKDP+ SWTES P+LE Sbjct: 699 DKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLE 758 Query: 809 KDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNAL 630 KDPQGRA+N LD D EKLFREHVK LYER AREFR LL EVIT E A QMT+DGK L Sbjct: 759 KDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVL 818 Query: 629 TSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKEEKPHSEFKNKISA 453 TSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+ ++K SD KEEK + E K + S Sbjct: 819 TSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSL 878 Query: 452 DSERSPAP-RRTHSRR 408 DS RSP RR+HSRR Sbjct: 879 DSGRSPTGLRRSHSRR 894 >ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C isoform X5 [Elaeis guineensis] Length = 916 Score = 910 bits (2352), Expect = 0.0 Identities = 513/951 (53%), Positives = 596/951 (62%), Gaps = 10/951 (1%) Frame = -2 Query: 3233 TGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMPAPMQSPISVPKGHPSV 3063 T N+Q G+F PP TAASLQPPVP P VPG I + PAPMQ P+S+P G Sbjct: 10 TNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCPAPMQLPLSIPTG---- 65 Query: 3062 XXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSS 2883 S AVV E GT S SQS L V SSS Sbjct: 66 ----------------------------TSDAVVTEAGTSITTSIDSQSAQLSATVPSSS 97 Query: 2882 SMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDP 2706 S P+ + ++PA +P Sbjct: 98 STASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLPGIPNSATVSSTVTSQPAGTNP 157 Query: 2705 SASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQR 2541 S LRPM QQ Y PY S P PPPQ WL PPQ GLQR Sbjct: 158 SP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLHPPQAGGLQR 216 Query: 2540 PPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGT 2361 P++PY LPAPF LPV G+ GS Q S+VG Sbjct: 217 APFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTMGSSQSGSNVGI 276 Query: 2360 QSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSF 2181 +SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYYYNS+TG+STYERPSSF Sbjct: 277 ESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYYNSVTGESTYERPSSF 335 Query: 2180 KGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQD 2001 GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSSWQ+P EV ELRK Q+ Sbjct: 336 NGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPAEVLELRKSQE 395 Query: 2000 SDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKL 1821 SD+L+ + + + DKGSAP+S+S PAV TGGR++MALRTSGA SSSALD+VKKKL Sbjct: 396 SDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSGAAVSSSALDLVKKKL 453 Query: 1820 QDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXX 1641 QDAG PVTSSP+P PV SDLNG VE KGQQ NSK+K+KD DGNM Sbjct: 454 QDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKVKD---DGNMSDSSSD 509 Query: 1640 XXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEH 1464 GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP Y+AR+ +FEH Sbjct: 510 SDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPSYSARKTIFEH 569 Query: 1463 YVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRK 1284 +VRT + FKQLLEEASE+IDHKTDY +FKRKWGSDPRF LDRK Sbjct: 570 FVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWGSDPRFGVLDRK 629 Query: 1283 DREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPR 1104 +RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T+SRWSRVK++LRNDPR Sbjct: 630 ERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWSRVKENLRNDPR 685 Query: 1103 YKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXX 924 YK+VKHEER LFNEYISEL EQ+KL Sbjct: 686 YKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMRKRKEREEQEME 745 Query: 923 RVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFR 744 RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFR Sbjct: 746 RVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPDLGQGDAEKLFR 805 Query: 743 EHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRK 564 +HVK LYER AR FR LL+EVITAE A Q TDDGK L SWSEAKRLLKPDPRYSKMP K Sbjct: 806 DHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLKPDPRYSKMPGK 865 Query: 563 DRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSR 411 DRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R +PRR+H R Sbjct: 866 DREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SPRRSHGR 915 >ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [Musa acuminata subsp. malaccensis] Length = 1128 Score = 904 bits (2337), Expect = 0.0 Identities = 524/1062 (49%), Positives = 645/1062 (60%), Gaps = 23/1062 (2%) Frame = -2 Query: 3524 EPSNDSVRAKFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQ 3345 + S DS+RAKF + PGFVV APSF Y VI S +K TPP AAALQ Sbjct: 87 DTSQDSIRAKFSSPPGFVVAAPSFSYGVIPRTNLTSGNPQQSSSS-GLKLTPPVPAAALQ 145 Query: 3344 PPVPRQSSGSVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPP 3165 PPVP Q G+ P F YN++S NV A+GQQ+Q TV ++Q GKF+PP+ A+SLQPP Sbjct: 146 PPVPGQFLGTRP-FPYNVVSHANVVPAAGQQIQLNTVPVQAHLQGGKFIPPS-ASSLQPP 203 Query: 3164 VPG---RPNQFVPGTIPQNMPAPMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKH 2994 VP RP F PG + P+PMQ P+SVP+G +Q A + Sbjct: 204 VPRQPVRPTPFGPGAVSLISPSPMQFPLSVPQGDAIKQTNFSFSGHNQFSTAEKDETILS 263 Query: 2993 SSNTSASAAVVQETG---TVPAASSSSQSTALPVYVSS---------SSSMIVPAAPSVY 2850 S ++ A V+ T T+ + S S ++P+ S+ ++SM++PAAPS Sbjct: 264 SEKCTSDAVAVETTSDSSTLVNSQSVQTSQSMPLGTSTGLGINANACAASMLIPAAPSFT 323 Query: 2849 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARP-----AAMDPSASLRPM 2685 ++ RP AA+ P+++ P+ Sbjct: 324 AHAEMPNARGIPGLTGNSSSATASTGATIKPTPTNSSISSPRPIIPVTAALPPTSTSVPV 383 Query: 2684 XXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPA 2505 QQ Y SQP MAP PQ W PPQ +Q + PYP PA Sbjct: 384 PFPVPQNV-------QQQTNVHYSSQPTMAPSPQASWSHPPQAGPMQHVSFSPYPGFFPA 436 Query: 2504 PFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQ 2325 PF LPV+GI AGS QP SS+ +S +DQDK+ Sbjct: 437 PFSLPVQGIPPAVPLPFIQPPGVSLMVSQVEPTAVTAGSLQPGSSMVAESSSSVVDQDKK 496 Query: 2324 SDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQST 2145 S+ G+ + +E + WTAHKTETGAVYYYNS+TG+STY++PS+FKGE +K QS Sbjct: 497 SNNLDKDEGDTS-NELENAWTAHKTETGAVYYYNSITGKSTYQKPSNFKGESEKATTQSN 555 Query: 2144 PVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-Q 1968 VS EKL GTDW +V+T+DG+KYY++TK KVSSW +P EVAELRK Q+S S + S T Q Sbjct: 556 AVSWEKLAGTDWTIVTTSDGRKYYYDTKNKVSSWHVPAEVAELRKNQESGSTEGSATQLQ 615 Query: 1967 NASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP 1788 +AS DK SAP +++ PA G ++MALR+SGA SSSALDMVKKKLQ+AG P+TS Sbjct: 616 DASTQGDKVSAPANIAAPAAQIGAHDSMALRSSGAPVSSSALDMVKKKLQEAGTPMTSPH 675 Query: 1787 LPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKE 1611 ++SVP SD NGL EA+AKG + K+K KDANG+GNM GP+KE Sbjct: 676 --STSVPATSDANGLKATEAVAKGVIN---KDKAKDANGEGNMSDSSSDSDDEESGPSKE 730 Query: 1610 ECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXX 1431 ECIIQFKEMLKERGVAPFSKW+KELPKI+FDPRFKAVP +ARRALFEHYVRT Sbjct: 731 ECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAVPSQSARRALFEHYVRTRAEEERK 790 Query: 1430 XXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVL 1251 + FKQLLEEA EDIDHKTDYHSFKRKWG DPRFEA+DRK+RE LLNE+V Sbjct: 791 EKRAAQKAALDAFKQLLEEALEDIDHKTDYHSFKRKWGGDPRFEAIDRKERELLLNEKV- 849 Query: 1250 PLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREV 1071 KAA++K++ R AA +SFKSMLRD+ DI TSSRWSR+K+SLR+DPRYK+VKHE+RE Sbjct: 850 ---KAADEKMRALRMAAATSFKSMLRDNRDITTSSRWSRIKESLRDDPRYKAVKHEQRET 906 Query: 1070 LFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEA 891 LFNEYI+EL EQDKL RV+LKVRRKEA Sbjct: 907 LFNEYIAELKSAVDEVERSAKAKRDEQDKLKERERELRKRKEREEKEMERVKLKVRRKEA 966 Query: 890 VASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSA 711 SY+ LLVE IKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFREHVK LYER Sbjct: 967 EYSYRTLLVEMIKDPKASWTESKPKLEKDPQGRATNPDLTQEDAEKLFREHVKDLYERCV 1026 Query: 710 REFRALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAE 531 +FR LLAEV+T E A DDGK L SWSEAK LLKPDPRYSKMP KDRES+W+R E Sbjct: 1027 NDFRTLLAEVVTVEAAAAKNDDGKTVLNSWSEAKLLLKPDPRYSKMPSKDRESLWRRHTE 1086 Query: 530 EMQRRQKPSDSKEEKPHSEFKNKISADSE-RSPAPRRTHSRR 408 +M RR K +E P + +N++S+ ++ +P R+H RR Sbjct: 1087 DMLRRPKSVSDTKESPGTNGRNRMSSAADPLKRSPGRSHRRR 1128 >ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis vinifera] Length = 1013 Score = 902 bits (2330), Expect = 0.0 Identities = 509/975 (52%), Positives = 604/975 (61%), Gaps = 7/975 (0%) Frame = -2 Query: 3311 PSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPG 3132 PSFSY+ I S + QQL +G+V P + Q PVPG Sbjct: 80 PSFSYSGIPHVTTASGTSQQLPSGSVISSN--------PLASTVVFQTPVPG-------- 123 Query: 3131 TIPQNMPAPMQSPISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQET 2952 P + P S KG A S+T S AV QE Sbjct: 124 --PSSSSGPSFSYNIAHKG------------------AGFPGSQPFQSSTDNSGAVAQEA 163 Query: 2951 GTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXX 2772 G++ +AS SQS P SSS+M V ++P + P T+W Sbjct: 164 GSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPG 220 Query: 2771 XXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAP 2592 A P+A +S + QQ+YP Y S PA Sbjct: 221 PPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNA 280 Query: 2591 PPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXX 2412 QG WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 281 SSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAG 340 Query: 2411 XXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTET 2244 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT+T Sbjct: 341 GTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDT 399 Query: 2243 GAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNT 2064 G VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NT Sbjct: 400 GVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNT 459 Query: 2063 KTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREA 1887 KTK+SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG +P++LS PAV TGGR+A Sbjct: 460 KTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDA 519 Query: 1886 MALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQS 1707 LRTS S+SALDM+KKKLQD+G P TSSP+ +S P+ S+LNG +E KG QS Sbjct: 520 TPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQS 578 Query: 1706 ENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPK 1530 ENSK+KLKD NGDGNM GPTKEECIIQFKEMLKERGVAPFSKWEKELPK Sbjct: 579 ENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPK 638 Query: 1529 ILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHK 1350 I+FDPRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDIDHK Sbjct: 639 IVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHK 698 Query: 1349 TDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRD 1170 T+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSMLRD Sbjct: 699 TEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRD 758 Query: 1169 SGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQ 990 GDI TS+RWSRVKDSLRNDPRYK VKHE+RE+LFNEYISEL EQ Sbjct: 759 KGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQ 818 Query: 989 DKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLE 810 DKL RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLE Sbjct: 819 DKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLE 878 Query: 809 KDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNAL 630 KDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK L Sbjct: 879 KDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVL 938 Query: 629 TSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISAD 450 TSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S D Sbjct: 939 TSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVD 998 Query: 449 SERSPA-PRRTHSRR 408 S R P+ RR H RR Sbjct: 999 SGRFPSGSRRAHERR 1013 Score = 68.2 bits (165), Expect = 5e-08 Identities = 70/291 (24%), Positives = 103/291 (35%), Gaps = 11/291 (3%) Frame = -2 Query: 3677 IKLIKMSSQSSIPGMTPQAPASGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRA 3498 +++ +SQ+ + G+ P+ GP P+ ++ S +S + Sbjct: 9 VEVQSSASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQG 67 Query: 3497 KFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXSPAVKFTPPTSAAALQPPVPRQSSG 3318 KFV P V+P PSF YS I + P S Q PVP SS Sbjct: 68 KFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSS 127 Query: 3317 SVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFV 3138 S PSFSYN I+ G Q Q+ T Q + + S P P + Sbjct: 128 SGPSFSYN-IAHKGAGFPGSQPFQSSTDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMS 186 Query: 3137 PGTIPQNMPAPMQSP----ISVPKGHPSVXXXXXXXXXSQLPVAAESPQNKHSSNTSAS- 2973 + P+ P + P VP G P +A +P + + + SAS Sbjct: 187 VSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPG-----IAPSTPLSSNLAVPSASM 241 Query: 2972 --AAVVQETGTVPAASSSS----QSTALPVYVSSSSSMIVPAAPSVYPMTM 2838 ++ V PAA SS Q P Y S ++ P + P M Sbjct: 242 DFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQM 292 >ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis vinifera] Length = 903 Score = 890 bits (2301), Expect = 0.0 Identities = 487/880 (55%), Positives = 576/880 (65%), Gaps = 7/880 (0%) Frame = -2 Query: 3026 PVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2847 P + Q S N S AV QE G++ +AS SQS P SSS+M V ++P + P Sbjct: 32 PQLVQKDQTLKSDN---SGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGP 85 Query: 2846 MTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2667 T+W A P+A +S Sbjct: 86 TTLWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPA 145 Query: 2666 XXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2487 + QQ+YP Y S PA QG WLQPPQ+ GL RPP++PYP P PFPLP Sbjct: 146 APVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPA 205 Query: 2486 RGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSD 2319 G+ A SG TS + ++ PPPGID +K + Sbjct: 206 HGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVN 265 Query: 2318 GNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPV 2139 G + +G A +E D WTAHKT+TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPV Sbjct: 266 GAGTKDGA-AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPV 324 Query: 2138 STEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNA 1962 S EKL GTDW LV+TNDGKKYY+NTKTK+SSWQ+P E+ E+RK+QDS +L+ +M + N Sbjct: 325 SWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNT 384 Query: 1961 SFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLP 1782 + +KG +P++LS PAV TGGR+A LRTS S+SALDM+KKKLQD+G P TSSP+ Sbjct: 385 NVSTEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVH 444 Query: 1781 ASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEEC 1605 +S P+ S+LNG +E KG QSENSK+KLKD NGDGNM GPTKEEC Sbjct: 445 SSG-PIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEEC 503 Query: 1604 IIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXX 1425 IIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+PGY+ARR+LFEHYVRT Sbjct: 504 IIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEK 563 Query: 1424 XXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPL 1245 EGFKQLLEEASEDIDHKT+Y +F++KWG DPRFEALDRKDRE LLNERVLPL Sbjct: 564 RAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPL 623 Query: 1244 KKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLF 1065 K+AAE+K Q RAAAVSSFKSMLRD GDI TS+RWSRVKDSLRNDPRYK VKHE+RE+LF Sbjct: 624 KRAAEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILF 683 Query: 1064 NEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVA 885 NEYISEL EQDKL RVRLKVRRKEAV+ Sbjct: 684 NEYISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVS 743 Query: 884 SYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSARE 705 SYQALLVETIKDP+ SWTES PKLEKDPQ RA+N DLD +D EKLFREH+K L+ER A E Sbjct: 744 SYQALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHE 803 Query: 704 FRALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEM 525 FRALL+EV+TAE A Q T+DGK LTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM Sbjct: 804 FRALLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEM 863 Query: 524 QRRQKPSDSKEEKPHSEFKNKISADSERSPA-PRRTHSRR 408 R+QK + + E+ H+E K + S DS R P+ RR H RR Sbjct: 864 LRKQKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRRAHERR 903 >ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis vinifera] Length = 848 Score = 878 bits (2269), Expect = 0.0 Identities = 477/851 (56%), Positives = 563/851 (66%), Gaps = 7/851 (0%) Frame = -2 Query: 2939 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXX 2760 +AS SQS P SSS+M V ++P + P T+W Sbjct: 3 SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 59 Query: 2759 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQG 2580 A P+A +S + QQ+YP Y S PA QG Sbjct: 60 APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 119 Query: 2579 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXX 2400 WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 120 PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 179 Query: 2399 XAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2232 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT+TG VY Sbjct: 180 SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 238 Query: 2231 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2052 YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+ Sbjct: 239 YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 298 Query: 2051 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 1875 SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG +P++LS PAV TGGR+A LR Sbjct: 299 SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 358 Query: 1874 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSK 1695 TS S+SALDM+KKKLQD+G P TSSP+ +S P+ S+LNG +E KG QSENSK Sbjct: 359 TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 417 Query: 1694 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1518 +KLKD NGDGNM GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD Sbjct: 418 DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 477 Query: 1517 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1338 PRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDIDHKT+Y Sbjct: 478 PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 537 Query: 1337 SFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDI 1158 +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSMLRD GDI Sbjct: 538 TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 597 Query: 1157 NTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 978 TS+RWSRVKDSLRNDPRYK VKHE+RE+LFNEYISEL EQDKL Sbjct: 598 TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 657 Query: 977 XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 798 RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ Sbjct: 658 ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 717 Query: 797 GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWS 618 RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK LTSWS Sbjct: 718 ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 777 Query: 617 EAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERS 438 AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S DS R Sbjct: 778 TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 837 Query: 437 PA-PRRTHSRR 408 P+ RR H RR Sbjct: 838 PSGSRRAHERR 848 >ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii] gi|763747828|gb|KJB15267.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 887 Score = 836 bits (2160), Expect = 0.0 Identities = 463/876 (52%), Positives = 570/876 (65%), Gaps = 8/876 (0%) Frame = -2 Query: 3011 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2844 +PQ ++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2843 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2664 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2663 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2484 V QQ+YPPY S P+M PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2483 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2304 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2303 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2124 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2123 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1947 GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++ N + Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374 Query: 1946 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1767 KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433 Query: 1766 VISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1590 +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QFK Sbjct: 434 ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491 Query: 1589 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1410 EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551 Query: 1409 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1230 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE Sbjct: 552 AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611 Query: 1229 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYIS 1050 +K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHE+REVLFNEYIS Sbjct: 612 EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671 Query: 1049 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 870 EL E++KL RVRLKVRRKEAVAS+QAL Sbjct: 672 ELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 731 Query: 869 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 690 LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRALL Sbjct: 732 LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 791 Query: 689 AEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 510 AEVIT + Q T+ GK AL SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK Sbjct: 792 AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 851 Query: 509 PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 408 + +EE+ H++ K + S S RRTH RR Sbjct: 852 SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887 >gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 888 Score = 832 bits (2149), Expect = 0.0 Identities = 464/877 (52%), Positives = 570/877 (64%), Gaps = 9/877 (1%) Frame = -2 Query: 3011 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2844 +PQ ++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2843 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2664 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2663 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2484 V QQ+YPPY S P+M PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2483 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2304 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2303 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2124 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2123 VGTDWVLVSTNDGKKYYHNTKTKV-SSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGM 1950 GTDW LV+TNDGKKYY+N+KTKV SSWQ+P EV ELRK+QDS+ S + +++ N Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVA 374 Query: 1949 DKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSV 1770 +KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 EKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPV 433 Query: 1769 PVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQF 1593 +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QF Sbjct: 434 TATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQF 491 Query: 1592 KEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXX 1413 KEMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 KEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQ 551 Query: 1412 XXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAA 1233 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AA Sbjct: 552 KAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAA 611 Query: 1232 EQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYI 1053 E+K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHE+REVLFNEYI Sbjct: 612 EEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYI 671 Query: 1052 SELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQA 873 SEL E++KL RVRLKVRRKEAVAS+QA Sbjct: 672 SELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQA 731 Query: 872 LLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRAL 693 LLVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRAL Sbjct: 732 LLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRAL 791 Query: 692 LAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQ 513 LAEVIT + Q T+ GK AL SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+Q Sbjct: 792 LAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQ 851 Query: 512 KPSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 408 K + +EE+ H++ K + S S RRTH RR Sbjct: 852 KSALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888 >gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 886 Score = 832 bits (2149), Expect = 0.0 Identities = 462/876 (52%), Positives = 569/876 (64%), Gaps = 8/876 (0%) Frame = -2 Query: 3011 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2844 +PQ ++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2843 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2664 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2663 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2484 V QQ+YPPY S P+M PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2483 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2304 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2303 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2124 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2123 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1947 GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++ N + Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374 Query: 1946 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1767 KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433 Query: 1766 VISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1590 +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QFK Sbjct: 434 ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491 Query: 1589 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1410 EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551 Query: 1409 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1230 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE Sbjct: 552 AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611 Query: 1229 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYIS 1050 +K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHE+REVLFNEYIS Sbjct: 612 EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671 Query: 1049 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 870 EL ++KL RVRLKVRRKEAVAS+QAL Sbjct: 672 ELKAIEEKAERKDKVKKE-EEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 730 Query: 869 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 690 LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRALL Sbjct: 731 LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 790 Query: 689 AEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 510 AEVIT + Q T+ GK AL SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK Sbjct: 791 AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 850 Query: 509 PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 408 + +EE+ H++ K + S S RRTH RR Sbjct: 851 SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 886 >ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao] gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] Length = 816 Score = 825 bits (2131), Expect = 0.0 Identities = 434/749 (57%), Positives = 524/749 (69%), Gaps = 6/749 (0%) Frame = -2 Query: 2636 QQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXX 2457 QQ+YP Y P+MA PQG W+Q P + G RPP++PYPT P PFP G+ Sbjct: 75 QQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYPGPFPSASSGMPHPAPSS 134 Query: 2456 XXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQS--PPPGIDQDKQSDGNTSTNGEIAKS 2283 + Q + + G Q+ PP GID + N T E A + Sbjct: 135 DSQPPGVSPLATSPFAPSIAIPANQSSVASGIQTGFPPQGID-----NRNVGTRVEAAVN 189 Query: 2282 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2103 E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPDKVPVQ TPVS E+L GT+W L Sbjct: 190 EQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWAL 249 Query: 2102 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMDKGSAPVS 1926 V+T+DGKKYY+N+KTK+SSWQ+P EVAELRK+QD+D S + ++ N +KGS P+S Sbjct: 250 VTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPIS 309 Query: 1925 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP-LPASSVPVISDLN 1749 LS PAV+TGGR+AM LRTS SSSALD++KKKLQD+G+P +SS +P V +LN Sbjct: 310 LSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELN 369 Query: 1748 GLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKER 1572 G V+ KG QSENSK+KLKDANGDGN+ GP+KEECI+QFKEMLKER Sbjct: 370 GSRAVDV--KGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKER 427 Query: 1571 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1392 GVAPFSKWEKELPKI+FDPRFKA+P ++ARR LFEHYV+T EGF Sbjct: 428 GVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGF 487 Query: 1391 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQ 1212 KQLL+EASEDIDH T+Y +FKRKWGSD RFEALDRKDRE LL ERVLPLK+AAE+K Q Sbjct: 488 KQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAI 547 Query: 1211 RAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNEYISELXXXX 1032 RAAA SS KSML++ GDI +SRWSRVKDS+R+DPRYK VKHE+REVLFNEYISEL Sbjct: 548 RAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVE 607 Query: 1031 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 852 E++KL RVRLKVRRKEAVAS+QALLVETIK Sbjct: 608 EKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 667 Query: 851 DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 672 DP+ASWTES PKLEKDPQGRA+NPDLD +DTEKLFREH+K L+ER +FRALLAEVIT Sbjct: 668 DPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQ 727 Query: 671 ETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKE 492 + A Q T+ GK SWS AKRLLKPDPRYSKMPRK+RE++W+R+AE+M R+QK + +E Sbjct: 728 DAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQE 787 Query: 491 EKPHSEFKNKISADSER-SPAPRRTHSRR 408 E+ ++ K + S D R S R+ H RR Sbjct: 788 EEKRTDAKVRSSGDLGRFSSGSRKVHERR 816 >ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis] Length = 978 Score = 820 bits (2118), Expect = 0.0 Identities = 467/937 (49%), Positives = 569/937 (60%), Gaps = 10/937 (1%) Frame = -2 Query: 3188 TAASLQPPVPGRPNQFVPGTIPQ------NMPAPMQSPISVPKGHPSVXXXXXXXXXS-Q 3030 T S+ P + G IPQ N S SV +PSV S Sbjct: 47 TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106 Query: 3029 LPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY 2850 V SP + N + AV ++ G + S++SQ V S S++ +A ++ Sbjct: 107 QTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALS 165 Query: 2849 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXX 2670 T W ++A SA LRP Sbjct: 166 TTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPS 224 Query: 2669 XXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLP 2490 HQ +YP Y S P + PQG LQPPQ+ P++PYP P+PFPLP Sbjct: 225 APSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSPFPLP 283 Query: 2489 VRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDG 2316 G+ A G +S T++PP G D+ K+ Sbjct: 284 AHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVH 342 Query: 2315 NTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVS 2136 + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S Sbjct: 343 DVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPIS 402 Query: 2135 TEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASF 1956 E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ + N + Sbjct: 403 MEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNI 461 Query: 1955 GMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPAS 1776 ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T+SP P S Sbjct: 462 VIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVS 520 Query: 1775 SVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECII 1599 S S+ NG VE KG Q+EN+K+KLKD NGDG M +GPTKEECII Sbjct: 521 SAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECII 580 Query: 1598 QFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXX 1419 +FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 581 KFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRA 640 Query: 1418 XXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKK 1239 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+ Sbjct: 641 AQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKR 700 Query: 1238 AAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNE 1059 AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HE+REV+FNE Sbjct: 701 AAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNE 760 Query: 1058 YISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASY 879 Y+ EL EQ+KL RVRLKVRRKEAV S+ Sbjct: 761 YVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSF 820 Query: 878 QALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFR 699 QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR Sbjct: 821 QALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFR 880 Query: 698 ALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQR 519 LLAEVITAE A Q T+DGK L SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR Sbjct: 881 GLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQR 940 Query: 518 RQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 408 + K S + E H + K++ S D R P+ R + R Sbjct: 941 KHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977 >ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] gi|557539684|gb|ESR50728.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] Length = 1015 Score = 819 bits (2115), Expect = 0.0 Identities = 478/1004 (47%), Positives = 589/1004 (58%), Gaps = 10/1004 (0%) Frame = -2 Query: 3389 PAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQV 3210 P ++ ++ A PP +Q + + P + +P GS T T G Sbjct: 31 PFIRSDQIMTSPAWLPPEVQQLTANAP-----ISGKPVGGSLVASSTPTPTSNGSDTATN 85 Query: 3209 GKFVPPNTAASLQPPVPGRPNQFVPGTIPQ------NMPAPMQSPISVPKGHPSVXXXXX 3048 P+ A S+ G IPQ N S SV +PSV Sbjct: 86 DSISGPSQAKSVTA---------TGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVS 136 Query: 3047 XXXXS-QLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIV 2871 S V SP + N + AV ++ G + S++SQ V S S++ Sbjct: 137 SFTYSASQTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVAT 195 Query: 2870 PAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLR 2691 +A ++ T W ++A SA LR Sbjct: 196 SSATALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLR 254 Query: 2690 PMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGL 2511 P HQ +YP + S P + PQ LQPPQ+ P++PYP Sbjct: 255 PSVPTPSAPSNSGSAIQHQ-IYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAY 313 Query: 2510 PAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGID 2337 P+PFPLP G+ A G +S T++PP G D Sbjct: 314 PSPFPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTD 373 Query: 2336 QDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVP 2157 + K+ + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVP Sbjct: 374 K-KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVP 432 Query: 2156 VQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSM 1977 VQ TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ Sbjct: 433 VQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQ 491 Query: 1976 TSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVT 1797 + N + ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T Sbjct: 492 SVPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-T 550 Query: 1796 SSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGP 1620 +SP P SS S+ NG VE KG Q+EN+K+KLKD NGDG M +GP Sbjct: 551 ASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGP 610 Query: 1619 TKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXX 1440 TKEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 611 TKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEE 670 Query: 1439 XXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNE 1260 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNE Sbjct: 671 ERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNE 730 Query: 1259 RVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEE 1080 RVLPLK+AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HE+ Sbjct: 731 RVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHED 790 Query: 1079 REVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRR 900 REV+FNEY+ EL EQ+KL RVRLKVRR Sbjct: 791 REVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRR 850 Query: 899 KEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYE 720 KEAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYE Sbjct: 851 KEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYE 910 Query: 719 RSAREFRALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKR 540 R A +FR LLAEVITAE A Q T+DGK L SWS AKR+LKPDPRYSKMPRK+RE++W+R Sbjct: 911 RCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRR 970 Query: 539 FAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 408 AEE+QR+ K S + E H + K++ S D R P+ R + R Sbjct: 971 HAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 1014 >gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 978 Score = 818 bits (2114), Expect = 0.0 Identities = 466/937 (49%), Positives = 569/937 (60%), Gaps = 10/937 (1%) Frame = -2 Query: 3188 TAASLQPPVPGRPNQFVPGTIPQ------NMPAPMQSPISVPKGHPSVXXXXXXXXXS-Q 3030 T S+ P + G IPQ N S SV +PSV S Sbjct: 47 TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106 Query: 3029 LPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY 2850 V SP + N + AV ++ G + S++SQ V S S++ +A ++ Sbjct: 107 QTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALS 165 Query: 2849 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXX 2670 T W ++A SA LRP Sbjct: 166 TTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPS 224 Query: 2669 XXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLP 2490 HQ +YP Y S P + PQG L+PPQ+ P++PYP P+PFPLP Sbjct: 225 APSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLP 283 Query: 2489 VRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDG 2316 G+ A G +S T++PP G D+ K+ Sbjct: 284 AHGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVH 342 Query: 2315 NTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVS 2136 + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S Sbjct: 343 DVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPIS 402 Query: 2135 TEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASF 1956 E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ + N + Sbjct: 403 MEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNI 461 Query: 1955 GMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPAS 1776 ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T+SP P S Sbjct: 462 VIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVS 520 Query: 1775 SVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECII 1599 S S+ NG VE KG Q+EN+K+KLKD NGDG M +GPTKEECII Sbjct: 521 SAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECII 580 Query: 1598 QFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXX 1419 +FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 581 KFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRA 640 Query: 1418 XXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKK 1239 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+ Sbjct: 641 AQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKR 700 Query: 1238 AAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEEREVLFNE 1059 AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HE+REV+FNE Sbjct: 701 AAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNE 760 Query: 1058 YISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASY 879 Y+ EL EQ+KL RVRLKVRRKEAV S+ Sbjct: 761 YVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSF 820 Query: 878 QALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFR 699 QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR Sbjct: 821 QALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFR 880 Query: 698 ALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQR 519 LLAEVITAE A Q T+DGK L SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR Sbjct: 881 GLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQR 940 Query: 518 RQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 408 + K S + E H + K++ S D R P+ R + R Sbjct: 941 KHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977 >gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] gi|641834042|gb|KDO53045.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 857 Score = 816 bits (2108), Expect = 0.0 Identities = 434/769 (56%), Positives = 520/769 (67%), Gaps = 3/769 (0%) Frame = -2 Query: 2705 SASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMP 2526 SA LRP HQ +YP Y S P + PQG L+PPQ+ P++P Sbjct: 92 SAGLRPSVPTPSAPSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLP 150 Query: 2525 YPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSP 2352 YP P+PFPLP G+ A G +S T++P Sbjct: 151 YPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAP 210 Query: 2351 PPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGE 2172 P G D+ K+ + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGE Sbjct: 211 PSGTDK-KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGE 269 Query: 2171 PDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS 1992 PDKVPVQ TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+ Sbjct: 270 PDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDT 329 Query: 1991 LQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDA 1812 L+ + N + ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+ Sbjct: 330 LK-EQSVPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDS 388 Query: 1811 GMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXX 1635 G P T+SP P SS S+ NG VE KG Q+EN+K+KLKD NGDG M Sbjct: 389 GTP-TASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSED 447 Query: 1634 XXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVR 1455 +GPTKEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+ Sbjct: 448 GETGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVK 507 Query: 1454 TXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDRE 1275 T EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE Sbjct: 508 TRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRE 567 Query: 1274 ALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKS 1095 LLNERVLPLK+AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKS Sbjct: 568 LLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKS 627 Query: 1094 VKHEEREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVR 915 V+HE+REV+FNEY+ EL EQ+KL RVR Sbjct: 628 VRHEDREVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVR 687 Query: 914 LKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHV 735 LKVRRKEAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+ Sbjct: 688 LKVRRKEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHI 747 Query: 734 KTLYERSAREFRALLAEVITAETAVQMTDDGKNALTSWSEAKRLLKPDPRYSKMPRKDRE 555 KTLYER A +FR LLAEVITAE A Q T+DGK L SWS AKR+LKP+PRYSKMPRK+RE Sbjct: 748 KTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKERE 807 Query: 554 SIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 408 ++W+R AEE+QR+ K S + E H + K++ S D R P+ R + R Sbjct: 808 ALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 856