BLASTX nr result
ID: Cinnamomum25_contig00006944
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum25_contig00006944 (3811 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C i... 1042 0.0 ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C i... 1004 0.0 ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C i... 999 0.0 ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C i... 981 0.0 ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [... 943 0.0 ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i... 929 0.0 ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i... 920 0.0 ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C i... 907 0.0 ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i... 905 0.0 ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [... 904 0.0 ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i... 898 0.0 ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i... 882 0.0 ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [... 843 0.0 gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r... 838 0.0 gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium r... 838 0.0 ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c... 831 0.0 ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l... 828 0.0 gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 827 0.0 ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr... 827 0.0 gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 824 0.0 >ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo nucifera] gi|719963615|ref|XP_010250275.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo nucifera] Length = 1088 Score = 1042 bits (2694), Expect = 0.0 Identities = 600/1109 (54%), Positives = 702/1109 (63%), Gaps = 27/1109 (2%) Frame = -2 Query: 3651 QSSIPGMTPQAPVSGPTVAPS--IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478 QSS G+T QA G PS S ++EP+ +S+RAKF+T Sbjct: 8 QSSASGITAQASGLGQATGPSNPTVASPAPVSGPSNPKGPSGTTNEPAQESIRAKFITGP 67 Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298 G+VVPAPSF YSVI +PA+ P SA A QP +P QS S P+FS Sbjct: 68 GYVVPAPSFSYSVIPKQNTASGSSLENSSSPALVSNQPASATAFQPSIPGQSLSSGPTFS 127 Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNI---QVGKFVPPNTAASLQPPVPGRP---NQFV 3136 YN+I +GS+ Q+LQ+ T G G + QVG P TAASLQPPVPG+P N F Sbjct: 128 YNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHPNTFG 187 Query: 3135 PGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQ 2956 PGT Q M + SP+SVPKG PSI QL Q SSN+SAS AVA+ Sbjct: 188 PGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAR 241 Query: 2955 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXX 2776 E GTV ASSSS ++P +VS SS + +P++ P T+W Sbjct: 242 EAGTVSPASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGT 300 Query: 2775 XXXXXXXXXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQ 2605 AMD S+S LRP+ QQ++ PY + Sbjct: 301 PGPPGIAPSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPAL 350 Query: 2604 PAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXX 2425 P+M P PQG WL PPQ+ GLQRPP++PYP LP +PLP+RG+ Sbjct: 351 PSMPPPPQGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISP 409 Query: 2424 XXXXXXXXXXAGSGQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKS 2281 G P+SSVG+ PPPG DQ K D G + Sbjct: 410 LGPP--------GGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNA 461 Query: 2280 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2101 + D WTAHKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW L Sbjct: 462 K-VDAWTAHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWAL 520 Query: 2100 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVS 1924 V+TNDGKKYY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT QN+ +K SAP+S Sbjct: 521 VTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPIS 580 Query: 1923 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNG 1744 ++ PA+NTGGREA +LR SG SSSALD++KKKLQD+ P TSSPLP SS P T+DLNG Sbjct: 581 VTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNG 640 Query: 1743 LGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERG 1567 PVEA KG QSEN K+K+KD NGDGN+ GP+KEECIIQFKEMLKERG Sbjct: 641 SRPVEAAVKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERG 699 Query: 1566 VAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFK 1387 VAPFSKWEKELPKI+FDPRFKAVPGY+ARRALFEHYVRT EGFK Sbjct: 700 VAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFK 759 Query: 1386 QLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQR 1207 QLLEEASEDID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q R Sbjct: 760 QLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIR 819 Query: 1206 AAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXX 1027 AAA S FKS+LR+ GDINTSSRWSRVKDSLR+DPRYKSVKHEDRE+LFNEYISEL Sbjct: 820 AAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADE 879 Query: 1026 XXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKD 847 E+DKL RVRLKV+RKEAVA YQALLVETIKD Sbjct: 880 EAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKD 939 Query: 846 PKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAE 667 P+ SWTES P+LEKDPQGRA+N LD D EKLFREHVK LYER AREFR LL EVIT E Sbjct: 940 PQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTE 999 Query: 666 TAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKE 490 A+QMT+DGK VLTSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+ ++K SD KE Sbjct: 1000 AASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKE 1059 Query: 489 EKPHSEFKNKISADSERSPAP-RRTHSRR 406 EK + E K + S DS RSP RR+HSRR Sbjct: 1060 EKLNIETKARSSLDSGRSPTGLRRSHSRR 1088 >ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Elaeis guineensis] Length = 1097 Score = 1004 bits (2596), Expect = 0.0 Identities = 566/1091 (51%), Positives = 676/1091 (61%), Gaps = 14/1091 (1%) Frame = -2 Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475 P T PV+ P+ S + VS N++ P+ D VRAKF T+ G Sbjct: 50 PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109 Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295 FVVPAPSF Y V +P ++ +PP A ALQPPVP Q G+ PSFSY Sbjct: 110 FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169 Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124 N++S N GSA GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I Sbjct: 170 NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229 Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944 + PA MQ P+S+P G A + S TS + AQ + T Sbjct: 230 TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273 Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764 VP++SS++ ++SS +++P+ PS Sbjct: 274 VPSSSSTASGINPN---ANSSGILMPSTPSF------------TGHPGMPGLAGTPGLPG 318 Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPA 2599 ++PA +PS LRPM QQ Y PY S P Sbjct: 319 IPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPG 377 Query: 2598 MAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXX 2419 P PQ WL PPQ GLQR P++PY LPAPF LPV G+ Sbjct: 378 TIPPPQALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVA 437 Query: 2418 XXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETG 2239 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G Sbjct: 438 NQGPASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESG 496 Query: 2238 AVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTK 2059 VYYYNS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK Sbjct: 497 VVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTK 556 Query: 2058 TKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMA 1879 KVSSWQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MA Sbjct: 557 NKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMA 614 Query: 1878 LRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSEN 1699 LRTSGA SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ N Sbjct: 615 LRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTN 673 Query: 1698 SKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKIL 1522 SK+K+KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+ Sbjct: 674 SKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIV 730 Query: 1521 FDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTD 1342 FDPRFKAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTD Sbjct: 731 FDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTD 790 Query: 1341 YHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSG 1162 Y +FKRKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ Sbjct: 791 YQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNK 846 Query: 1161 DINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDK 982 DI T+SRWSRVK++LRNDPRYK+VKHE+R LFNEYISEL EQ+K Sbjct: 847 DITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEK 906 Query: 981 LXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKD 802 L RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKD Sbjct: 907 LKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKD 966 Query: 801 PQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTS 622 PQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L S Sbjct: 967 PQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNS 1026 Query: 621 WSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSE 442 WSEAKRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D Sbjct: 1027 WSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFS 1086 Query: 441 RSPAPRRTHSR 409 R +PRR+H R Sbjct: 1087 RR-SPRRSHGR 1096 >ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Elaeis guineensis] Length = 1055 Score = 999 bits (2583), Expect = 0.0 Identities = 563/1086 (51%), Positives = 671/1086 (61%), Gaps = 9/1086 (0%) Frame = -2 Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475 P T PV+ P+ S + VS N++ P+ D VRAKF T+ G Sbjct: 50 PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109 Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295 FVVPAPSF Y V +P ++ +PP A ALQPPVP Q G+ PSFSY Sbjct: 110 FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169 Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124 N++S N GSA GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I Sbjct: 170 NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229 Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944 + PA MQ P+S+P G A + S TS + AQ + T Sbjct: 230 TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273 Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764 VP+ SSS+ S + V + + P P V P Sbjct: 274 VPS-SSSTASVSSTVTSQPAGTNPSPLRPMVPP--------------------------- 305 Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTP 2584 P ++ P+++ P+ QQ Y PY S P P P Sbjct: 306 --------------PVSLPPTSTPVPVQQNI-----------QQQFYQPYPSLPGTIPPP 340 Query: 2583 QGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXX 2404 Q WL PPQ GLQR P++PY LPAPF LPV G+ Sbjct: 341 QALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPA 400 Query: 2403 XXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYY 2224 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYYY Sbjct: 401 STTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYY 459 Query: 2223 NSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSS 2044 NS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSS Sbjct: 460 NSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSS 519 Query: 2043 WQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSG 1864 WQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MALRTSG Sbjct: 520 WQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSG 577 Query: 1863 AMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKL 1684 A SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ NSK+K+ Sbjct: 578 AAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKV 636 Query: 1683 KDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRF 1507 KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRF Sbjct: 637 KD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRF 693 Query: 1506 KAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFK 1327 KAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTDY +FK Sbjct: 694 KAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFK 753 Query: 1326 RKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTS 1147 RKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T+ Sbjct: 754 RKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTT 809 Query: 1146 SRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXX 967 SRWSRVK++LRNDPRYK+VKHE+R LFNEYISEL EQ+KL Sbjct: 810 SRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKERE 869 Query: 966 XXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRA 787 RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA Sbjct: 870 REMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRA 929 Query: 786 SNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAK 607 +NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L SWSEAK Sbjct: 930 TNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAK 989 Query: 606 RLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAP 427 RLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R +P Sbjct: 990 RLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SP 1048 Query: 426 RRTHSR 409 RR+H R Sbjct: 1049 RRSHGR 1054 >ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis guineensis] Length = 1066 Score = 981 bits (2537), Expect = 0.0 Identities = 558/1091 (51%), Positives = 668/1091 (61%), Gaps = 14/1091 (1%) Frame = -2 Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475 P T PV+ P+ S + VS N++ P+ D VRAKF T+ G Sbjct: 50 PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109 Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295 FVVPAPSF Y V +P ++ +PP A ALQPPVP Q G+ PSFSY Sbjct: 110 FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169 Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124 N++S N GSA GQQ Q T T N+Q G+F PP TAASLQPPVP P VPG I Sbjct: 170 NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229 Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944 + PA MQ P+S+P G A + S TS + AQ + T Sbjct: 230 TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273 Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764 VP++SS++ ++SS +++P+ PS Sbjct: 274 VPSSSSTASGINPN---ANSSGILMPSTPSF------------TGHPGMPGLAGTPGLPG 318 Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPA 2599 ++PA +PS LRPM QQ Y PY S P Sbjct: 319 IPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPG 377 Query: 2598 MAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXX 2419 P PQ WL PPQ GLQR P++PY P + Sbjct: 378 TIPPPQALWLHPPQAGGLQRAPFLPYSVANQGPASTTM---------------------- 415 Query: 2418 XXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETG 2239 GS Q S+VG +SP GID +K ++ + +GE K+E+AD WTAHKTE+G Sbjct: 416 ---------GSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESG 465 Query: 2238 AVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTK 2059 VYYYNS+TG+STYERPSSF GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK Sbjct: 466 VVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTK 525 Query: 2058 TKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMA 1879 KVSSWQ+P EV ELRK Q+SD+L+ + + + DKGSAP+S+S PAV TGGR++MA Sbjct: 526 NKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMA 583 Query: 1878 LRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSEN 1699 LRTSGA SSSALD+VKKKLQDAG PVTSSP+P PV SDLNG VE KGQQ N Sbjct: 584 LRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTN 642 Query: 1698 SKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKIL 1522 SK+K+KD DGNM GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+ Sbjct: 643 SKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIV 699 Query: 1521 FDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTD 1342 FDPRFKAVP Y+AR+ +FEH+VRT + FKQLLEEASE+IDHKTD Sbjct: 700 FDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTD 759 Query: 1341 YHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSG 1162 Y +FKRKWGSDPRF LDRK+RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ Sbjct: 760 YQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNK 815 Query: 1161 DINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDK 982 DI T+SRWSRVK++LRNDPRYK+VKHE+R LFNEYISEL EQ+K Sbjct: 816 DITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEK 875 Query: 981 LXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKD 802 L RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKD Sbjct: 876 LKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKD 935 Query: 801 PQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTS 622 PQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L S Sbjct: 936 PQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNS 995 Query: 621 WSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSE 442 WSEAKRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D Sbjct: 996 WSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFS 1055 Query: 441 RSPAPRRTHSR 409 R +PRR+H R Sbjct: 1056 RR-SPRRSHGR 1065 >ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda] Length = 1085 Score = 943 bits (2438), Expect = 0.0 Identities = 547/1106 (49%), Positives = 672/1106 (60%), Gaps = 24/1106 (2%) Frame = -2 Query: 3651 QSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAGF 3472 Q S PG+ PQ G T ++ +SVRAKFV + G+ Sbjct: 12 QPSAPGVPPQPLTPGQTTTGG---------PPGPSPPIPRPQNDQPQESVRAKFVASPGY 62 Query: 3471 VVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTP---PTSAAALQPPVPRQSSGSVPSF 3301 ++PAPSF Y V+ P P P SA ++QPPVP S+ S SF Sbjct: 63 ILPAPSFSYGVVSQNNNA----------PRASLPPQSTPLSAVSVQPPVPGHSATSGASF 112 Query: 3300 SYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR------PNQF 3139 SY++ S SA T +Q GK P +AASLQPPVPG+ PN + Sbjct: 113 SYSVASHATTTSA----------TASNPMQGGKPAGPTSAASLQPPVPGQSSVSVHPNSW 162 Query: 3138 VPGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVA 2959 P QN A + P V KG PS V++E Q Q+SN+ ASAAVA Sbjct: 163 DPERPVQNALAQARPPFLVRKGPPSTSGFSFSGNSQS--VSSEDSQKHQASNSDASAAVA 220 Query: 2958 QETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY--PMTIWTQXXXXXXXXXXXXXX 2785 QE T +SS++Q+T LP SS++S V ++P+ Y P + Sbjct: 221 QEAKTSQPSSSTAQTTPLPA-PSSTTSRPVSSSPNTYATPFYMPKAPPFPGPPRLPVTPG 279 Query: 2784 XXXXXXXXXXXXXXXXXANARPAAMDP-SASLRPMXXXXXXXXXXXXXXVHQQ-----LY 2623 N RP+ +D SA +RP Q +Y Sbjct: 280 TPGPPGIALSAPQLSSSVNIRPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQPPIY 339 Query: 2622 PPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXX 2443 PY + P + P PQ W+ P Q+ GLQRPP++PYP P PFP+P+R I Sbjct: 340 SPYPTLPGVVPPPQAMWMHPSQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAMPDSS 399 Query: 2442 XXXXXXXXXXXXXXXXA--GSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDAD 2269 A G+G + QSPPPGID++K + T+ + +ED D Sbjct: 400 QPPGVSPIGPPGGIPLADHGAGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSNEDTD 459 Query: 2268 LWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTN 2089 WTAHKT+TGAVYYYN+LTG+STYE+P FKGE DKV +Q TPVS EKLVGTDW LV+TN Sbjct: 460 QWTAHKTDTGAVYYYNALTGESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWALVATN 519 Query: 2088 DGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS-LQTSMTSQNASFGMDKGSAPVSLSVP 1912 DGKKYY+NTK+K+SSWQ+P EVAELRK+Q++D+ L+ + QNA DKGS SLS P Sbjct: 520 DGKKYYYNTKSKISSWQVPPEVAELRKKQEADAALKANAPVQNAGISSDKGSVSSSLSAP 579 Query: 1911 AVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS-VPVTSDLNGLGP 1735 A+NTGGREAM +++ A SSSALD++KKKLQD+GMPVTSS LP+S+ VP TSD NG Sbjct: 580 AINTGGREAMTFKSATAPVSSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDANGQRV 639 Query: 1734 VEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAP 1558 V+ KGQQSENSK+KLK A G++ GPTKEEC+IQFKEMLKE+G+AP Sbjct: 640 VDTTVKGQQSENSKDKLKVAQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEKGIAP 699 Query: 1557 FSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLL 1378 FSKWEKELPKILFDPRFKA+PGYT RR+LFEH+VRT EGFKQLL Sbjct: 700 FSKWEKELPKILFDPRFKAIPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGFKQLL 759 Query: 1377 EEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAA 1198 E ASEDI+HKTDY +FK+KWG DPRF ALDRK+RE LLNERVLPL+KA E+K Q RAAA Sbjct: 760 EGASEDINHKTDYETFKKKWGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAIRAAA 819 Query: 1197 VSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXX 1018 V+SFKSML + DIN SRWS+VKDSLRNDPRYKSVKHEDREVLF EYISEL Sbjct: 820 VASFKSMLHEKVDINIGSRWSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAEQEAD 879 Query: 1017 XXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKA 838 E++KL RVR K RRK+AV SYQALL E IKDPKA Sbjct: 880 RAAKAKREEEEKLKERERELRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIKDPKA 939 Query: 837 SWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAA 658 SWTES PKLEKDP GRA+NP+L+ AD EKLFREHVK L ER AREFR+LLAEVIT E AA Sbjct: 940 SWTESKPKLEKDPLGRATNPELEPADMEKLFREHVKVLNERCAREFRSLLAEVITPEAAA 999 Query: 657 QMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKEEKP 481 Q ++DGK +L SWS AK+LL+PDPRY KMPR++RES+W+R+AE+M RRQ+ S+ KEEK Sbjct: 1000 QASEDGKTLLNSWSTAKKLLRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQKEEKT 1059 Query: 480 HSEFKNKISADSER-SPAPRRTHSRR 406 + + ++ A S + SP+ RR+H R+ Sbjct: 1060 NIDDPSRRPAGSSKSSPSVRRSHGRK 1085 >ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis vinifera] gi|297738259|emb|CBI27460.3| unnamed protein product [Vitis vinifera] Length = 1046 Score = 929 bits (2401), Expect = 0.0 Identities = 536/1091 (49%), Positives = 643/1091 (58%), Gaps = 7/1091 (0%) Frame = -2 Query: 3657 SSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478 +SQ+ + G+ P GP P+ ++ +S +S + KFV Sbjct: 15 ASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQGKFVNAP 73 Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298 V+P PSF YS I + P S Q PVP SS S PSFS Sbjct: 74 PHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSSSGPSFS 133 Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPGTIPQ 3118 YN I+ G Q Q+ T +I G P AAS Sbjct: 134 YN-IAHKGAGFPGSQPFQSST-----SIASGPRGPTPNAASFSF---------------- 171 Query: 3117 NMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVP 2938 G+P + Q Q+ + S AVAQE G++ Sbjct: 172 -------------NGNPQLV------------------QKDQTLKSDNSGAVAQEAGSMS 200 Query: 2937 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXX 2758 +AS SQS P SSS+M V ++P + P T+W Sbjct: 201 SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 257 Query: 2757 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQG 2578 A P+A +S + QQ+YP Y S PA + QG Sbjct: 258 APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 317 Query: 2577 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXX 2398 WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 318 PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 377 Query: 2397 XAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2230 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT+TG VY Sbjct: 378 SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 436 Query: 2229 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2050 YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+ Sbjct: 437 YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 496 Query: 2049 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 1873 SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG +P++LS PAV TGGR+A LR Sbjct: 497 SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 556 Query: 1872 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSK 1693 TS S+SALDM+KKKLQD+G P TSSP+ +S P+ S+LNG +E KG QSENSK Sbjct: 557 TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 615 Query: 1692 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1516 +KLKD NGDGNM GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD Sbjct: 616 DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 675 Query: 1515 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1336 PRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDIDHKT+Y Sbjct: 676 PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 735 Query: 1335 SFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDI 1156 +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSMLRD GDI Sbjct: 736 TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 795 Query: 1155 NTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 976 TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL EQDKL Sbjct: 796 TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 855 Query: 975 XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 796 RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ Sbjct: 856 ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 915 Query: 795 GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWS 616 RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VLTSWS Sbjct: 916 ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 975 Query: 615 EAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERS 436 AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S DS R Sbjct: 976 TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 1035 Query: 435 PA-PRRTHSRR 406 P+ RR H RR Sbjct: 1036 PSGSRRAHERR 1046 >ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo nucifera] Length = 894 Score = 920 bits (2378), Expect = 0.0 Identities = 521/922 (56%), Positives = 604/922 (65%), Gaps = 19/922 (2%) Frame = -2 Query: 3114 MPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPA 2935 M + SP+SVPKG PSI QL Q SSN+SAS AVA+E GTV Sbjct: 1 MASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAREAGTVSP 54 Query: 2934 ASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXX 2755 ASSSS ++P +VS SS + +P++ P T+W Sbjct: 55 ASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIA 113 Query: 2754 XXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTP 2584 AMD S+S LRP+ QQ++ PY + P+M P P Sbjct: 114 PSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPALPSMPPPP 163 Query: 2583 QGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXX 2404 QG WL PPQ+ GLQRPP++PYP LP +PLP+RG+ Sbjct: 164 QGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPP--- 219 Query: 2403 XXXAGSGQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWT 2260 G P+SSVG+ PPPG DQ K D G ++ D WT Sbjct: 220 -----GGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWT 273 Query: 2259 AHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGK 2080 AHKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW LV+TNDGK Sbjct: 274 AHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGK 333 Query: 2079 KYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVSLSVPAVN 1903 KYY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT QN+ +K SAP+S++ PA+N Sbjct: 334 KYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAIN 393 Query: 1902 TGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAI 1723 TGGREA +LR SG SSSALD++KKKLQD+ P TSSPLP SS P T+DLNG PVEA Sbjct: 394 TGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAA 453 Query: 1722 AKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKW 1546 KG QSEN K+K+KD NGDGN+ GP+KEECIIQFKEMLKERGVAPFSKW Sbjct: 454 VKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKW 512 Query: 1545 EKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEAS 1366 EKELPKI+FDPRFKAVPGY+ARRALFEHYVRT EGFKQLLEEAS Sbjct: 513 EKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEAS 572 Query: 1365 EDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSF 1186 EDID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q RAAA S F Sbjct: 573 EDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGF 632 Query: 1185 KSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXX 1006 KS+LR+ GDINTSSRWSRVKDSLR+DPRYKSVKHEDRE+LFNEYISEL Sbjct: 633 KSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAK 692 Query: 1005 XXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTE 826 E+DKL RVRLKV+RKEAVA YQALLVETIKDP+ SWTE Sbjct: 693 VKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTE 752 Query: 825 SNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTD 646 S P+LEKDPQGRA+N LD D EKLFREHVK LYER AREFR LL EVIT E A+QMT+ Sbjct: 753 SRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTN 812 Query: 645 DGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKEEKPHSEF 469 DGK VLTSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+ ++K SD KEEK + E Sbjct: 813 DGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIET 872 Query: 468 KNKISADSERSPAP-RRTHSRR 406 K + S DS RSP RR+HSRR Sbjct: 873 KARSSLDSGRSPTGLRRSHSRR 894 >ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C isoform X5 [Elaeis guineensis] Length = 916 Score = 907 bits (2345), Expect = 0.0 Identities = 509/950 (53%), Positives = 605/950 (63%), Gaps = 9/950 (0%) Frame = -2 Query: 3231 TGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMPASMQSPISVPKGHPSI 3061 T N+Q G+F PP TAASLQPPVP P VPG I + PA MQ P+S+P G Sbjct: 10 TNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCPAPMQLPLSIPTGTSD- 68 Query: 3060 XXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSS 2881 A + S TS + AQ + TVP++SS++ ++SS Sbjct: 69 ---------------AVVTEAGTSITTSIDSQSAQLSATVPSSSSTASGINPN---ANSS 110 Query: 2880 SMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPS 2701 +++P+ PS ++PA +PS Sbjct: 111 GILMPSTPSF------------TGHPGMPGLAGTPGLPGIPNSATVSSTVTSQPAGTNPS 158 Query: 2700 ASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRP 2536 LRPM QQ Y PY S P P PQ WL PPQ GLQR Sbjct: 159 P-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLHPPQAGGLQRA 217 Query: 2535 PYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQ 2356 P++PY LPAPF LPV G+ GS Q S+VG + Sbjct: 218 PFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTMGSSQSGSNVGIE 277 Query: 2355 SPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFK 2176 SP GID +K ++ + +GE K+E+AD WTAHKTE+G VYYYNS+TG+STYERPSSF Sbjct: 278 SPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYYNSVTGESTYERPSSFN 336 Query: 2175 GEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDS 1996 GEP+ V QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSSWQ+P EV ELRK Q+S Sbjct: 337 GEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPAEVLELRKSQES 396 Query: 1995 DSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQ 1816 D+L+ + + + DKGSAP+S+S PAV TGGR++MALRTSGA SSSALD+VKKKLQ Sbjct: 397 DALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSGAAVSSSALDLVKKKLQ 454 Query: 1815 DAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXX 1636 DAG PVTSSP+P PV SDLNG VE KGQQ NSK+K+KD DGNM Sbjct: 455 DAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKVKD---DGNMSDSSSDS 510 Query: 1635 XXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHY 1459 GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP Y+AR+ +FEH+ Sbjct: 511 DDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPSYSARKTIFEHF 570 Query: 1458 VRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKD 1279 VRT + FKQLLEEASE+IDHKTDY +FKRKWGSDPRF LDRK+ Sbjct: 571 VRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWGSDPRFGVLDRKE 630 Query: 1278 REALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRY 1099 RE LLNE+V KAAE+K+Q R AAV+SFKSMLRD+ DI T+SRWSRVK++LRNDPRY Sbjct: 631 RELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWSRVKENLRNDPRY 686 Query: 1098 KSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXR 919 K+VKHE+R LFNEYISEL EQ+KL R Sbjct: 687 KAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMRKRKEREEQEMER 746 Query: 918 VRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFRE 739 VRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFR+ Sbjct: 747 VRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPDLGQGDAEKLFRD 806 Query: 738 HVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKD 559 HVK LYER AR FR LL+EVITAE AAQ TDDGK +L SWSEAKRLLKPDPRYSKMP KD Sbjct: 807 HVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLKPDPRYSKMPGKD 866 Query: 558 RESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSR 409 RE +W+R+AE+M R+QKP+ +EKP ++ +N+ S+D R +PRR+H R Sbjct: 867 REYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SPRRSHGR 915 >ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis vinifera] Length = 1013 Score = 905 bits (2338), Expect = 0.0 Identities = 512/976 (52%), Positives = 606/976 (62%), Gaps = 8/976 (0%) Frame = -2 Query: 3309 PSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPG 3130 PSFSY+ I S QQL +G+V P + Q PVPG Sbjct: 80 PSFSYSGIPHVTTASGTSQQLPSGSVISSN--------PLASTVVFQTPVPG-------- 123 Query: 3129 TIPQNMPASMQSP-ISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQE 2953 P+S P S H A S+T S AVAQE Sbjct: 124 ------PSSSSGPSFSYNIAHKG---------------AGFPGSQPFQSSTDNSGAVAQE 162 Query: 2952 TGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXX 2773 G++ +AS SQS P SSS+M V ++P + P T+W Sbjct: 163 AGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTP 219 Query: 2772 XXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMA 2593 A P+A +S + QQ+YP Y S PA Sbjct: 220 GPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATN 279 Query: 2592 PTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXX 2413 + QG WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 280 ASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTA 339 Query: 2412 XXXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTE 2245 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT+ Sbjct: 340 GGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTD 398 Query: 2244 TGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHN 2065 TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+N Sbjct: 399 TGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYN 458 Query: 2064 TKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGRE 1888 TKTK+SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG +P++LS PAV TGGR+ Sbjct: 459 TKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRD 518 Query: 1887 AMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQ 1708 A LRTS S+SALDM+KKKLQD+G P TSSP+ +S P+ S+LNG +E KG Q Sbjct: 519 ATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQ 577 Query: 1707 SENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELP 1531 SENSK+KLKD NGDGNM GPTKEECIIQFKEMLKERGVAPFSKWEKELP Sbjct: 578 SENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELP 637 Query: 1530 KILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDH 1351 KI+FDPRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDIDH Sbjct: 638 KIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDH 697 Query: 1350 KTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLR 1171 KT+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSMLR Sbjct: 698 KTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLR 757 Query: 1170 DSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXE 991 D GDI TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL E Sbjct: 758 DKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEE 817 Query: 990 QDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKL 811 QDKL RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKL Sbjct: 818 QDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKL 877 Query: 810 EKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNV 631 EKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK V Sbjct: 878 EKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTV 937 Query: 630 LTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISA 451 LTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S Sbjct: 938 LTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSV 997 Query: 450 DSERSPA-PRRTHSRR 406 DS R P+ RR H RR Sbjct: 998 DSGRFPSGSRRAHERR 1013 Score = 63.9 bits (154), Expect = 1e-06 Identities = 73/301 (24%), Positives = 104/301 (34%), Gaps = 30/301 (9%) Frame = -2 Query: 3657 SSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478 +SQ+ + G+ P GP P+ ++ +S +S + KFV Sbjct: 15 ASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQGKFVNAP 73 Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298 V+P PSF YS I + P S Q PVP SS S PSFS Sbjct: 74 PHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSSSGPSFS 133 Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNIQ----------VGKFVP----PNTAASLQPPV 3160 YN I+ G Q Q+ T Q V + VP +T + P Sbjct: 134 YN-IAHKGAGFPGSQPFQSSTDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMSVSSSPK 192 Query: 3159 PGRPNQFVPGT----IPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVA-------- 3016 G ++P +P MP + +P G P I +P A Sbjct: 193 MGPTTLWMPSNPSFPVPSGMPVTPGTP-----GPPGIAPSTPLSSNLAVPSASMDFSSSV 247 Query: 3015 ---AESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTAL-PVYVSSSSSMIVPAAPSVY 2848 A P SSN + + ++PA ++SSQ L P + P+VY Sbjct: 248 VSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVY 307 Query: 2847 P 2845 P Sbjct: 308 P 308 >ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [Musa acuminata subsp. malaccensis] Length = 1128 Score = 904 bits (2335), Expect = 0.0 Identities = 529/1063 (49%), Positives = 647/1063 (60%), Gaps = 24/1063 (2%) Frame = -2 Query: 3522 EPSNDSVRAKFVTTAGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQ 3343 + S DS+RAKF + GFVV APSF Y VI + +K TPP AAALQ Sbjct: 87 DTSQDSIRAKFSSPPGFVVAAPSFSYGVIPRTNLTSGNPQQSSSS-GLKLTPPVPAAALQ 145 Query: 3342 PPVPRQSSGSVPSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPP 3163 PPVP Q G+ P F YN++S NV A GQQ+Q TV ++Q GKF+PP+ A+SLQPP Sbjct: 146 PPVPGQFLGTRP-FPYNVVSHANVVPAAGQQIQLNTVPVQAHLQGGKFIPPS-ASSLQPP 203 Query: 3162 VPG---RPNQFVPGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQ 2992 VP RP F PG + P+ MQ P+SVP+G +Q A E + Sbjct: 204 VPRQPVRPTPFGPGAVSLISPSPMQFPLSVPQGDAIKQTNFSFSGHNQFSTA-EKDETIL 262 Query: 2991 SSNTSASAAVAQET----GTVPAASSSSQSTALPVYVSS---------SSSMIVPAAPSV 2851 SS S AVA ET T+ + S S ++P+ S+ ++SM++PAAPS Sbjct: 263 SSEKCTSDAVAVETTSDSSTLVNSQSVQTSQSMPLGTSTGLGINANACAASMLIPAAPSF 322 Query: 2850 YPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARP-----AAMDPSASLRP 2686 ++ RP AA+ P+++ P Sbjct: 323 TAHAEMPNARGIPGLTGNSSSATASTGATIKPTPTNSSISSPRPIIPVTAALPPTSTSVP 382 Query: 2685 MXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLP 2506 + QQ Y SQP MAP+PQ W PPQ +Q + PYP P Sbjct: 383 VPFPVPQNV-------QQQTNVHYSSQPTMAPSPQASWSHPPQAGPMQHVSFSPYPGFFP 435 Query: 2505 APFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDK 2326 APF LPV+GI AGS QP SS+ +S +DQDK Sbjct: 436 APFSLPVQGIPPAVPLPFIQPPGVSLMVSQVEPTAVTAGSLQPGSSMVAESSSSVVDQDK 495 Query: 2325 QSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQS 2146 +S+ G+ + +E + WTAHKTETGAVYYYNS+TG+STY++PS+FKGE +K QS Sbjct: 496 KSNNLDKDEGDTS-NELENAWTAHKTETGAVYYYNSITGKSTYQKPSNFKGESEKATTQS 554 Query: 2145 TPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS- 1969 VS EKL GTDW +V+T+DG+KYY++TK KVSSW +P EVAELRK Q+S S + S T Sbjct: 555 NAVSWEKLAGTDWTIVTTSDGRKYYYDTKNKVSSWHVPAEVAELRKNQESGSTEGSATQL 614 Query: 1968 QNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSS 1789 Q+AS DK SAP +++ PA G ++MALR+SGA SSSALDMVKKKLQ+AG P+TS Sbjct: 615 QDASTQGDKVSAPANIAAPAAQIGAHDSMALRSSGAPVSSSALDMVKKKLQEAGTPMTSP 674 Query: 1788 PLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTK 1612 ++SVP TSD NGL EA+AKG + K+K KDANG+GNM GP+K Sbjct: 675 H--STSVPATSDANGLKATEAVAKGVIN---KDKAKDANGEGNMSDSSSDSDDEESGPSK 729 Query: 1611 EECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXX 1432 EECIIQFKEMLKERGVAPFSKW+KELPKI+FDPRFKAVP +ARRALFEHYVRT Sbjct: 730 EECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAVPSQSARRALFEHYVRTRAEEER 789 Query: 1431 XXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERV 1252 + FKQLLEEA EDIDHKTDYHSFKRKWG DPRFEA+DRK+RE LLNE+V Sbjct: 790 KEKRAAQKAALDAFKQLLEEALEDIDHKTDYHSFKRKWGGDPRFEAIDRKERELLLNEKV 849 Query: 1251 LPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDRE 1072 KAA++K++ R AA +SFKSMLRD+ DI TSSRWSR+K+SLR+DPRYK+VKHE RE Sbjct: 850 ----KAADEKMRALRMAAATSFKSMLRDNRDITTSSRWSRIKESLRDDPRYKAVKHEQRE 905 Query: 1071 VLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKE 892 LFNEYI+EL EQDKL RV+LKVRRKE Sbjct: 906 TLFNEYIAELKSAVDEVERSAKAKRDEQDKLKERERELRKRKEREEKEMERVKLKVRRKE 965 Query: 891 AVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERS 712 A SY+ LLVE IKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFREHVK LYER Sbjct: 966 AEYSYRTLLVEMIKDPKASWTESKPKLEKDPQGRATNPDLTQEDAEKLFREHVKDLYERC 1025 Query: 711 AREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFA 532 +FR LLAEV+T E AA DDGK VL SWSEAK LLKPDPRYSKMP KDRES+W+R Sbjct: 1026 VNDFRTLLAEVVTVEAAAAKNDDGKTVLNSWSEAKLLLKPDPRYSKMPSKDRESLWRRHT 1085 Query: 531 EEMQRRQKPSDSKEEKPHSEFKNKISADSE-RSPAPRRTHSRR 406 E+M RR K +E P + +N++S+ ++ +P R+H RR Sbjct: 1086 EDMLRRPKSVSDTKESPGTNGRNRMSSAADPLKRSPGRSHRRR 1128 >ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis vinifera] Length = 903 Score = 898 bits (2320), Expect = 0.0 Identities = 488/873 (55%), Positives = 578/873 (66%), Gaps = 7/873 (0%) Frame = -2 Query: 3003 QNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQX 2824 Q Q+ + S AVAQE G++ +AS SQS P SSS+M V ++P + P T+W Sbjct: 36 QKDQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPS 92 Query: 2823 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXX 2644 A P+A +S Sbjct: 93 NPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNP 152 Query: 2643 XVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXX 2464 + QQ+YP Y S PA + QG WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 153 AIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPS 212 Query: 2463 XXXXXXXXXXXXXXXXXXXXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNG 2296 A SG TS + ++ PPPGID +K +G + +G Sbjct: 213 VPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG 272 Query: 2295 EIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVG 2116 A +E D WTAHKT+TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL G Sbjct: 273 A-AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTG 331 Query: 2115 TDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKG 1939 TDW LV+TNDGKKYY+NTKTK+SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG Sbjct: 332 TDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKG 391 Query: 1938 SAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVT 1759 +P++LS PAV TGGR+A LRTS S+SALDM+KKKLQD+G P TSSP+ +S P+ Sbjct: 392 PSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIA 450 Query: 1758 SDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEM 1582 S+LNG +E KG QSENSK+KLKD NGDGNM GPTKEECIIQFKEM Sbjct: 451 SELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEM 510 Query: 1581 LKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXX 1402 LKERGVAPFSKWEKELPKI+FDPRFKA+PGY+ARR+LFEHYVRT Sbjct: 511 LKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAA 570 Query: 1401 XEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQK 1222 EGFKQLLEEASEDIDHKT+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Sbjct: 571 IEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEK 630 Query: 1221 IQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISEL 1042 Q RAAAVSSFKSMLRD GDI TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL Sbjct: 631 AQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISEL 690 Query: 1041 XXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLV 862 EQDKL RVRLKVRRKEAV+SYQALLV Sbjct: 691 KAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLV 750 Query: 861 ETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAE 682 ETIKDP+ SWTES PKLEKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+E Sbjct: 751 ETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSE 810 Query: 681 VITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPS 502 V+TAE A Q T+DGK VLTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + Sbjct: 811 VLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLA 870 Query: 501 DSKEEKPHSEFKNKISADSERSPA-PRRTHSRR 406 + E+ H+E K + S DS R P+ RR H RR Sbjct: 871 QDQTEEKHTEVKGRSSVDSGRFPSGSRRAHERR 903 >ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis vinifera] Length = 848 Score = 882 bits (2280), Expect = 0.0 Identities = 479/851 (56%), Positives = 565/851 (66%), Gaps = 7/851 (0%) Frame = -2 Query: 2937 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXX 2758 +AS SQS P SSS+M V ++P + P T+W Sbjct: 3 SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 59 Query: 2757 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQG 2578 A P+A +S + QQ+YP Y S PA + QG Sbjct: 60 APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 119 Query: 2577 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXX 2398 WLQPPQ+ GL RPP++PYP P PFPLP G+ Sbjct: 120 PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 179 Query: 2397 XAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2230 A SG TS + ++ PPPGID +K +G + +G A +E D WTAHKT+TG VY Sbjct: 180 SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 238 Query: 2229 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2050 YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+ Sbjct: 239 YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 298 Query: 2049 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 1873 SSWQ+P E+ E+RK+QDS +L+ +M + N + +KG +P++LS PAV TGGR+A LR Sbjct: 299 SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 358 Query: 1872 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSK 1693 TS S+SALDM+KKKLQD+G P TSSP+ +S P+ S+LNG +E KG QSENSK Sbjct: 359 TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 417 Query: 1692 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1516 +KLKD NGDGNM GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD Sbjct: 418 DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 477 Query: 1515 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1336 PRFKA+PGY+ARR+LFEHYVRT EGFKQLLEEASEDIDHKT+Y Sbjct: 478 PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 537 Query: 1335 SFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDI 1156 +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q RAAAVSSFKSMLRD GDI Sbjct: 538 TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 597 Query: 1155 NTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 976 TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL EQDKL Sbjct: 598 TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 657 Query: 975 XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 796 RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ Sbjct: 658 ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 717 Query: 795 GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWS 616 RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VLTSWS Sbjct: 718 ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 777 Query: 615 EAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERS 436 AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK + + E+ H+E K + S DS R Sbjct: 778 TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 837 Query: 435 PA-PRRTHSRR 406 P+ RR H RR Sbjct: 838 PSGSRRAHERR 848 >ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii] gi|763747828|gb|KJB15267.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 887 Score = 843 bits (2177), Expect = 0.0 Identities = 466/876 (53%), Positives = 573/876 (65%), Gaps = 8/876 (0%) Frame = -2 Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842 +PQ Q++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482 V QQ+YPPY S P+M +PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1945 GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++ N + Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374 Query: 1944 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1765 KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433 Query: 1764 VTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1588 T +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QFK Sbjct: 434 ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491 Query: 1587 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1408 EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551 Query: 1407 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1228 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE Sbjct: 552 AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611 Query: 1227 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYIS 1048 +K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYIS Sbjct: 612 EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671 Query: 1047 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 868 EL E++KL RVRLKVRRKEAVAS+QAL Sbjct: 672 ELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 731 Query: 867 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 688 LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRALL Sbjct: 732 LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 791 Query: 687 AEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 508 AEVIT + AQ T+ GK L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK Sbjct: 792 AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 851 Query: 507 PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406 + +EE+ H++ K + S S RRTH RR Sbjct: 852 SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887 >gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 888 Score = 838 bits (2166), Expect = 0.0 Identities = 467/877 (53%), Positives = 573/877 (65%), Gaps = 9/877 (1%) Frame = -2 Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842 +PQ Q++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482 V QQ+YPPY S P+M +PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKV-SSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGM 1948 GTDW LV+TNDGKKYY+N+KTKV SSWQ+P EV ELRK+QDS+ S + +++ N Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVA 374 Query: 1947 DKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSV 1768 +KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 EKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPV 433 Query: 1767 PVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQF 1591 T +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QF Sbjct: 434 TATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQF 491 Query: 1590 KEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXX 1411 KEMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 KEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQ 551 Query: 1410 XXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAA 1231 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AA Sbjct: 552 KAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAA 611 Query: 1230 EQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYI 1051 E+K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYI Sbjct: 612 EEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYI 671 Query: 1050 SELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQA 871 SEL E++KL RVRLKVRRKEAVAS+QA Sbjct: 672 SELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQA 731 Query: 870 LLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRAL 691 LLVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRAL Sbjct: 732 LLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRAL 791 Query: 690 LAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQ 511 LAEVIT + AQ T+ GK L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+Q Sbjct: 792 LAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQ 851 Query: 510 KPSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406 K + +EE+ H++ K + S S RRTH RR Sbjct: 852 KSALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888 >gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 886 Score = 838 bits (2166), Expect = 0.0 Identities = 465/876 (53%), Positives = 572/876 (65%), Gaps = 8/876 (0%) Frame = -2 Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842 +PQ Q++ S + TGT A+SS SQS LPV+ SS +M PS P+ Sbjct: 24 NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83 Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662 T ++ A A PS+++ Sbjct: 84 T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136 Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482 V QQ+YPPY S P+M +PQG+W+Q P + G RPP++PYPT P PFP Sbjct: 137 PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196 Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302 G+ A + Q + ++ T PP GID K + +T Sbjct: 197 GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254 Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122 E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L Sbjct: 255 KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314 Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1945 GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++ N + Sbjct: 315 AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374 Query: 1944 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1765 KGS P+SLS PAVNTGGR+AM LRTS SSSALD++KKKLQD G+P +SSP+P V Sbjct: 375 KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433 Query: 1764 VTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1588 T +LNG V+ KG QSE++K+KLKDANGDG++ GP+KEECI+QFK Sbjct: 434 ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491 Query: 1587 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1408 EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T Sbjct: 492 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551 Query: 1407 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1228 EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE Sbjct: 552 AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611 Query: 1227 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYIS 1048 +K + RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYIS Sbjct: 612 EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671 Query: 1047 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 868 EL ++KL RVRLKVRRKEAVAS+QAL Sbjct: 672 ELKAIEEKAERKDKVKKE-EEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 730 Query: 867 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 688 LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER +FRALL Sbjct: 731 LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 790 Query: 687 AEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 508 AEVIT + AQ T+ GK L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK Sbjct: 791 AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 850 Query: 507 PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406 + +EE+ H++ K + S S RRTH RR Sbjct: 851 SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 886 >ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao] gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] Length = 816 Score = 831 bits (2146), Expect = 0.0 Identities = 437/749 (58%), Positives = 527/749 (70%), Gaps = 6/749 (0%) Frame = -2 Query: 2634 QQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXX 2455 QQ+YP Y P+MA +PQG W+Q P + G RPP++PYPT P PFP G+ Sbjct: 75 QQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYPGPFPSASSGMPHPAPSS 134 Query: 2454 XXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQS--PPPGIDQDKQSDGNTSTNGEIAKS 2281 + Q + + G Q+ PP GID + N T E A + Sbjct: 135 DSQPPGVSPLATSPFAPSIAIPANQSSVASGIQTGFPPQGID-----NRNVGTRVEAAVN 189 Query: 2280 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2101 E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPDKVPVQ TPVS E+L GT+W L Sbjct: 190 EQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWAL 249 Query: 2100 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMDKGSAPVS 1924 V+T+DGKKYY+N+KTK+SSWQ+P EVAELRK+QD+D S + ++ N +KGS P+S Sbjct: 250 VTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPIS 309 Query: 1923 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP-LPASSVPVTSDLN 1747 LS PAV+TGGR+AM LRTS SSSALD++KKKLQD+G+P +SS +P V +LN Sbjct: 310 LSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELN 369 Query: 1746 GLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKER 1570 G V+ KG QSENSK+KLKDANGDGN+ GP+KEECI+QFKEMLKER Sbjct: 370 GSRAVDV--KGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKER 427 Query: 1569 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1390 GVAPFSKWEKELPKI+FDPRFKA+P ++ARR LFEHYV+T EGF Sbjct: 428 GVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGF 487 Query: 1389 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQ 1210 KQLL+EASEDIDH T+Y +FKRKWGSD RFEALDRKDRE LL ERVLPLK+AAE+K Q Sbjct: 488 KQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAI 547 Query: 1209 RAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXX 1030 RAAA SS KSML++ GDI +SRWSRVKDS+R+DPRYK VKHEDREVLFNEYISEL Sbjct: 548 RAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVE 607 Query: 1029 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 850 E++KL RVRLKVRRKEAVAS+QALLVETIK Sbjct: 608 EKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 667 Query: 849 DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 670 DP+ASWTES PKLEKDPQGRA+NPDLD +DTEKLFREH+K L+ER +FRALLAEVIT Sbjct: 668 DPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQ 727 Query: 669 ETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKE 490 + AAQ T+ GK V SWS AKRLLKPDPRYSKMPRK+RE++W+R+AE+M R+QK + +E Sbjct: 728 DAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQE 787 Query: 489 EKPHSEFKNKISADSER-SPAPRRTHSRR 406 E+ ++ K + S D R S R+ H RR Sbjct: 788 EEKRTDAKVRSSGDLGRFSSGSRKVHERR 816 >ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis] Length = 978 Score = 828 bits (2139), Expect = 0.0 Identities = 467/936 (49%), Positives = 569/936 (60%), Gaps = 9/936 (0%) Frame = -2 Query: 3186 TAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXXXXXXSQL 3025 T S+ P + G IPQ N S S SV +PS+ S Sbjct: 47 TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106 Query: 3024 PVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2845 N+Q ++ G + S++SQ V S S++ +A ++ Sbjct: 107 QTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALST 166 Query: 2844 MTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2665 T W ++A SA LRP Sbjct: 167 TTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPSA 225 Query: 2664 XXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2485 HQ +YP Y S P + +PQG LQPPQ+ P++PYP P+PFPLP Sbjct: 226 PSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSPFPLPA 284 Query: 2484 RGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDGN 2311 G+ A G +S T++PP G D+ K+ + Sbjct: 285 HGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVHD 343 Query: 2310 TSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVST 2131 S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S Sbjct: 344 VSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISM 403 Query: 2130 EKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFG 1951 E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ + N + Sbjct: 404 EHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNIV 462 Query: 1950 MDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS 1771 ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T+SP P SS Sbjct: 463 IEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSS 521 Query: 1770 VPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQ 1594 TS+ NG VE KG Q+EN+K+KLKD NGDG M +GPTKEECII+ Sbjct: 522 AAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIK 581 Query: 1593 FKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXX 1414 FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 582 FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAA 641 Query: 1413 XXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKA 1234 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+A Sbjct: 642 QKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRA 701 Query: 1233 AEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEY 1054 AE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HEDREV+FNEY Sbjct: 702 AEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEY 761 Query: 1053 ISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQ 874 + EL EQ+KL RVRLKVRRKEAV S+Q Sbjct: 762 VRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQ 821 Query: 873 ALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRA 694 ALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR Sbjct: 822 ALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRG 881 Query: 693 LLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRR 514 LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR+ Sbjct: 882 LLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRK 941 Query: 513 QKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406 K S + E H + K++ S D R P+ R + R Sbjct: 942 HKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977 >gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 978 Score = 827 bits (2135), Expect = 0.0 Identities = 466/936 (49%), Positives = 569/936 (60%), Gaps = 9/936 (0%) Frame = -2 Query: 3186 TAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXXXXXXSQL 3025 T S+ P + G IPQ N S S SV +PS+ S Sbjct: 47 TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106 Query: 3024 PVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2845 N+Q ++ G + S++SQ V S S++ +A ++ Sbjct: 107 QTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALST 166 Query: 2844 MTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2665 T W ++A SA LRP Sbjct: 167 TTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPSA 225 Query: 2664 XXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2485 HQ +YP Y S P + +PQG L+PPQ+ P++PYP P+PFPLP Sbjct: 226 PSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLPA 284 Query: 2484 RGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDGN 2311 G+ A G +S T++PP G D+ K+ + Sbjct: 285 HGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVHD 343 Query: 2310 TSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVST 2131 S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S Sbjct: 344 VSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISM 403 Query: 2130 EKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFG 1951 E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ + N + Sbjct: 404 EHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNIV 462 Query: 1950 MDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS 1771 ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T+SP P SS Sbjct: 463 IEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSS 521 Query: 1770 VPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQ 1594 TS+ NG VE KG Q+EN+K+KLKD NGDG M +GPTKEECII+ Sbjct: 522 AAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIK 581 Query: 1593 FKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXX 1414 FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 582 FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAA 641 Query: 1413 XXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKA 1234 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+A Sbjct: 642 QKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRA 701 Query: 1233 AEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEY 1054 AE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HEDREV+FNEY Sbjct: 702 AEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEY 761 Query: 1053 ISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQ 874 + EL EQ+KL RVRLKVRRKEAV S+Q Sbjct: 762 VRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQ 821 Query: 873 ALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRA 694 ALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR Sbjct: 822 ALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRG 881 Query: 693 LLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRR 514 LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR+ Sbjct: 882 LLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRK 941 Query: 513 QKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406 K S + E H + K++ S D R P+ R + R Sbjct: 942 HKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977 >ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] gi|557539684|gb|ESR50728.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] Length = 1015 Score = 827 bits (2135), Expect = 0.0 Identities = 478/1003 (47%), Positives = 589/1003 (58%), Gaps = 9/1003 (0%) Frame = -2 Query: 3387 PAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQV 3208 P ++ ++ A PP +Q + + P + +P GS T T G Sbjct: 31 PFIRSDQIMTSPAWLPPEVQQLTANAP-----ISGKPVGGSLVASSTPTPTSNGSDTATN 85 Query: 3207 GKFVPPNTAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXX 3046 P+ A S+ G IPQ N S S SV +PS+ Sbjct: 86 DSISGPSQAKSVTA---------TGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVS 136 Query: 3045 XXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVP 2866 S N+Q ++ G + S++SQ V S S++ Sbjct: 137 SFTYSASQTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATS 196 Query: 2865 AAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRP 2686 +A ++ T W ++A SA LRP Sbjct: 197 SATALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRP 255 Query: 2685 MXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLP 2506 HQ +YP + S P + +PQ LQPPQ+ P++PYP P Sbjct: 256 SVPTPSAPSNSGSAIQHQ-IYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAYP 314 Query: 2505 APFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQ 2332 +PFPLP G+ A G +S T++PP G D+ Sbjct: 315 SPFPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK 374 Query: 2331 DKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPV 2152 K+ + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPV Sbjct: 375 -KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPV 433 Query: 2151 QSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMT 1972 Q TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+ + Sbjct: 434 QPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQS 492 Query: 1971 SQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTS 1792 N + ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+G P T+ Sbjct: 493 VPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TA 551 Query: 1791 SPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPT 1615 SP P SS TS+ NG VE KG Q+EN+K+KLKD NGDG M +GPT Sbjct: 552 SPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPT 611 Query: 1614 KEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXX 1435 KEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+T Sbjct: 612 KEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEE 671 Query: 1434 XXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNER 1255 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNER Sbjct: 672 RKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNER 731 Query: 1254 VLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDR 1075 VLPLK+AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKSV+HEDR Sbjct: 732 VLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDR 791 Query: 1074 EVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRK 895 EV+FNEY+ EL EQ+KL RVRLKVRRK Sbjct: 792 EVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRK 851 Query: 894 EAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYER 715 EAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER Sbjct: 852 EAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYER 911 Query: 714 SAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRF 535 A +FR LLAEVITAE AAQ T+DGK VL SWS AKR+LKPDPRYSKMPRK+RE++W+R Sbjct: 912 CAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRRH 971 Query: 534 AEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406 AEE+QR+ K S + E H + K++ S D R P+ R + R Sbjct: 972 AEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 1014 >gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] gi|641834042|gb|KDO53045.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 857 Score = 824 bits (2128), Expect = 0.0 Identities = 438/769 (56%), Positives = 524/769 (68%), Gaps = 3/769 (0%) Frame = -2 Query: 2703 SASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMP 2524 SA LRP HQ +YP Y S P + +PQG L+PPQ+ P++P Sbjct: 92 SAGLRPSVPTPSAPSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLP 150 Query: 2523 YPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSP 2350 YP P+PFPLP G+ A G +S T++P Sbjct: 151 YPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAP 210 Query: 2349 PPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGE 2170 P G D+ K+ + S+ + +E D WTAHKT+TG VYYYN++TG+STYE+P+ FKGE Sbjct: 211 PSGTDK-KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGE 269 Query: 2169 PDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS 1990 PDKVPVQ TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+ Sbjct: 270 PDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDT 329 Query: 1989 LQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDA 1810 L+ + N + ++KGS +SLS PAVNTGGR+A ALRTS SSSALD++KKKLQD+ Sbjct: 330 LK-EQSVPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDS 388 Query: 1809 GMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXX 1633 G P T+SP P SS TS+ NG VE KG Q+EN+K+KLKD NGDG M Sbjct: 389 GTP-TASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSED 447 Query: 1632 XXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVR 1453 +GPTKEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+ +ARRALFE YV+ Sbjct: 448 GETGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVK 507 Query: 1452 TXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDRE 1273 T EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE Sbjct: 508 TRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRE 567 Query: 1272 ALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKS 1093 LLNERVLPLK+AAE+K Q RAAA SSFKSMLR+ GDI SSRWS+VKD LR+DPRYKS Sbjct: 568 LLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKS 627 Query: 1092 VKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVR 913 V+HEDREV+FNEY+ EL EQ+KL RVR Sbjct: 628 VRHEDREVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVR 687 Query: 912 LKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHV 733 LKVRRKEAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+ Sbjct: 688 LKVRRKEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHI 747 Query: 732 KTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRE 553 KTLYER A +FR LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE Sbjct: 748 KTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKERE 807 Query: 552 SIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406 ++W+R AEE+QR+ K S + E H + K++ S D R P+ R + R Sbjct: 808 ALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 856