BLASTX nr result

ID: Cinnamomum25_contig00006944 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum25_contig00006944
         (3811 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C i...  1042   0.0  
ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C i...  1004   0.0  
ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C i...   999   0.0  
ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C i...   981   0.0  
ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [...   943   0.0  
ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i...   929   0.0  
ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i...   920   0.0  
ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C i...   907   0.0  
ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i...   905   0.0  
ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [...   904   0.0  
ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i...   898   0.0  
ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i...   882   0.0  
ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [...   843   0.0  
gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r...   838   0.0  
gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium r...   838   0.0  
ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c...   831   0.0  
ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l...   828   0.0  
gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin...   827   0.0  
ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr...   827   0.0  
gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin...   824   0.0  

>ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera] gi|719963615|ref|XP_010250275.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera]
          Length = 1088

 Score = 1042 bits (2694), Expect = 0.0
 Identities = 600/1109 (54%), Positives = 702/1109 (63%), Gaps = 27/1109 (2%)
 Frame = -2

Query: 3651 QSSIPGMTPQAPVSGPTVAPS--IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478
            QSS  G+T QA   G    PS     S                ++EP+ +S+RAKF+T  
Sbjct: 8    QSSASGITAQASGLGQATGPSNPTVASPAPVSGPSNPKGPSGTTNEPAQESIRAKFITGP 67

Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298
            G+VVPAPSF YSVI               +PA+    P SA A QP +P QS  S P+FS
Sbjct: 68   GYVVPAPSFSYSVIPKQNTASGSSLENSSSPALVSNQPASATAFQPSIPGQSLSSGPTFS 127

Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNI---QVGKFVPPNTAASLQPPVPGRP---NQFV 3136
            YN+I    +GS+  Q+LQ+ T  G G +   QVG   P  TAASLQPPVPG+P   N F 
Sbjct: 128  YNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHPNTFG 187

Query: 3135 PGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQ 2956
            PGT  Q M +   SP+SVPKG PSI          QL       Q   SSN+SAS AVA+
Sbjct: 188  PGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAR 241

Query: 2955 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXX 2776
            E GTV  ASSSS   ++P +VS SS +    +P++ P T+W                   
Sbjct: 242  EAGTVSPASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGT 300

Query: 2775 XXXXXXXXXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQ 2605
                                 AMD S+S  LRP+                QQ++ PY + 
Sbjct: 301  PGPPGIAPSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPAL 350

Query: 2604 PAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXX 2425
            P+M P PQG WL PPQ+ GLQRPP++PYP  LP  +PLP+RG+                 
Sbjct: 351  PSMPPPPQGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISP 409

Query: 2424 XXXXXXXXXXAGSGQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKS 2281
                         G P+SSVG+              PPPG DQ K  D      G    +
Sbjct: 410  LGPP--------GGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNA 461

Query: 2280 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2101
            +  D WTAHKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW L
Sbjct: 462  K-VDAWTAHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWAL 520

Query: 2100 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVS 1924
            V+TNDGKKYY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT  QN+    +K SAP+S
Sbjct: 521  VTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPIS 580

Query: 1923 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNG 1744
            ++ PA+NTGGREA +LR SG   SSSALD++KKKLQD+  P TSSPLP SS P T+DLNG
Sbjct: 581  VTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNG 640

Query: 1743 LGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERG 1567
              PVEA  KG QSEN K+K+KD NGDGN+            GP+KEECIIQFKEMLKERG
Sbjct: 641  SRPVEAAVKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERG 699

Query: 1566 VAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFK 1387
            VAPFSKWEKELPKI+FDPRFKAVPGY+ARRALFEHYVRT                 EGFK
Sbjct: 700  VAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFK 759

Query: 1386 QLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQR 1207
            QLLEEASEDID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q  R
Sbjct: 760  QLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIR 819

Query: 1206 AAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXX 1027
            AAA S FKS+LR+ GDINTSSRWSRVKDSLR+DPRYKSVKHEDRE+LFNEYISEL     
Sbjct: 820  AAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADE 879

Query: 1026 XXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKD 847
                       E+DKL                   RVRLKV+RKEAVA YQALLVETIKD
Sbjct: 880  EAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKD 939

Query: 846  PKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAE 667
            P+ SWTES P+LEKDPQGRA+N  LD  D EKLFREHVK LYER AREFR LL EVIT E
Sbjct: 940  PQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTE 999

Query: 666  TAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKE 490
             A+QMT+DGK VLTSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+  ++K  SD KE
Sbjct: 1000 AASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKE 1059

Query: 489  EKPHSEFKNKISADSERSPAP-RRTHSRR 406
            EK + E K + S DS RSP   RR+HSRR
Sbjct: 1060 EKLNIETKARSSLDSGRSPTGLRRSHSRR 1088


>ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Elaeis
            guineensis]
          Length = 1097

 Score = 1004 bits (2596), Expect = 0.0
 Identities = 566/1091 (51%), Positives = 676/1091 (61%), Gaps = 14/1091 (1%)
 Frame = -2

Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475
            P  T   PV+ P+   S     + VS               N++ P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124
            N++S  N GSA GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG I
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944
              + PA MQ P+S+P G                   A   +   S  TS  +  AQ + T
Sbjct: 230  TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273

Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764
            VP++SS++         ++SS +++P+ PS                              
Sbjct: 274  VPSSSSTASGINPN---ANSSGILMPSTPSF------------TGHPGMPGLAGTPGLPG 318

Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPA 2599
                        ++PA  +PS  LRPM                     QQ Y PY S P 
Sbjct: 319  IPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPG 377

Query: 2598 MAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXX 2419
              P PQ  WL PPQ  GLQR P++PY   LPAPF LPV G+                   
Sbjct: 378  TIPPPQALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVA 437

Query: 2418 XXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETG 2239
                     GS Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G
Sbjct: 438  NQGPASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESG 496

Query: 2238 AVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTK 2059
             VYYYNS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK
Sbjct: 497  VVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTK 556

Query: 2058 TKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMA 1879
             KVSSWQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++MA
Sbjct: 557  NKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMA 614

Query: 1878 LRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSEN 1699
            LRTSGA  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  N
Sbjct: 615  LRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTN 673

Query: 1698 SKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKIL 1522
            SK+K+KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+
Sbjct: 674  SKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIV 730

Query: 1521 FDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTD 1342
            FDPRFKAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKTD
Sbjct: 731  FDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTD 790

Query: 1341 YHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSG 1162
            Y +FKRKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD+ 
Sbjct: 791  YQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNK 846

Query: 1161 DINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDK 982
            DI T+SRWSRVK++LRNDPRYK+VKHE+R  LFNEYISEL                EQ+K
Sbjct: 847  DITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEK 906

Query: 981  LXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKD 802
            L                   RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKD
Sbjct: 907  LKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKD 966

Query: 801  PQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTS 622
            PQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L S
Sbjct: 967  PQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNS 1026

Query: 621  WSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSE 442
            WSEAKRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+   +EKP ++ +N+ S+D  
Sbjct: 1027 WSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFS 1086

Query: 441  RSPAPRRTHSR 409
            R  +PRR+H R
Sbjct: 1087 RR-SPRRSHGR 1096


>ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Elaeis
            guineensis]
          Length = 1055

 Score =  999 bits (2583), Expect = 0.0
 Identities = 563/1086 (51%), Positives = 671/1086 (61%), Gaps = 9/1086 (0%)
 Frame = -2

Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475
            P  T   PV+ P+   S     + VS               N++ P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124
            N++S  N GSA GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG I
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944
              + PA MQ P+S+P G                   A   +   S  TS  +  AQ + T
Sbjct: 230  TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273

Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764
            VP+ SSS+ S +  V    + +   P  P V P                           
Sbjct: 274  VPS-SSSTASVSSTVTSQPAGTNPSPLRPMVPP--------------------------- 305

Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTP 2584
                          P ++ P+++  P+                QQ Y PY S P   P P
Sbjct: 306  --------------PVSLPPTSTPVPVQQNI-----------QQQFYQPYPSLPGTIPPP 340

Query: 2583 QGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXX 2404
            Q  WL PPQ  GLQR P++PY   LPAPF LPV G+                        
Sbjct: 341  QALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPA 400

Query: 2403 XXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYY 2224
                GS Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G VYYY
Sbjct: 401  STTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYY 459

Query: 2223 NSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSS 2044
            NS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSS
Sbjct: 460  NSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSS 519

Query: 2043 WQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSG 1864
            WQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++MALRTSG
Sbjct: 520  WQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSG 577

Query: 1863 AMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKL 1684
            A  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  NSK+K+
Sbjct: 578  AAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKV 636

Query: 1683 KDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRF 1507
            KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRF
Sbjct: 637  KD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRF 693

Query: 1506 KAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFK 1327
            KAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKTDY +FK
Sbjct: 694  KAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFK 753

Query: 1326 RKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTS 1147
            RKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD+ DI T+
Sbjct: 754  RKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTT 809

Query: 1146 SRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXX 967
            SRWSRVK++LRNDPRYK+VKHE+R  LFNEYISEL                EQ+KL    
Sbjct: 810  SRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKERE 869

Query: 966  XXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRA 787
                           RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA
Sbjct: 870  REMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRA 929

Query: 786  SNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAK 607
            +NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L SWSEAK
Sbjct: 930  TNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAK 989

Query: 606  RLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAP 427
            RLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+   +EKP ++ +N+ S+D  R  +P
Sbjct: 990  RLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SP 1048

Query: 426  RRTHSR 409
            RR+H R
Sbjct: 1049 RRSHGR 1054


>ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis
            guineensis]
          Length = 1066

 Score =  981 bits (2537), Expect = 0.0
 Identities = 558/1091 (51%), Positives = 668/1091 (61%), Gaps = 14/1091 (1%)
 Frame = -2

Query: 3639 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAG 3475
            P  T   PV+ P+   S     + VS               N++ P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3474 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3295
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3294 NLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTI 3124
            N++S  N GSA GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG I
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3123 PQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGT 2944
              + PA MQ P+S+P G                   A   +   S  TS  +  AQ + T
Sbjct: 230  TPSCPAPMQLPLSIPTGTSD----------------AVVTEAGTSITTSIDSQSAQLSAT 273

Query: 2943 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXX 2764
            VP++SS++         ++SS +++P+ PS                              
Sbjct: 274  VPSSSSTASGINPN---ANSSGILMPSTPSF------------TGHPGMPGLAGTPGLPG 318

Query: 2763 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPA 2599
                        ++PA  +PS  LRPM                     QQ Y PY S P 
Sbjct: 319  IPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPG 377

Query: 2598 MAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXX 2419
              P PQ  WL PPQ  GLQR P++PY      P    +                      
Sbjct: 378  TIPPPQALWLHPPQAGGLQRAPFLPYSVANQGPASTTM---------------------- 415

Query: 2418 XXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETG 2239
                     GS Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G
Sbjct: 416  ---------GSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESG 465

Query: 2238 AVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTK 2059
             VYYYNS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK
Sbjct: 466  VVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTK 525

Query: 2058 TKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMA 1879
             KVSSWQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++MA
Sbjct: 526  NKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMA 583

Query: 1878 LRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSEN 1699
            LRTSGA  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  N
Sbjct: 584  LRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTN 642

Query: 1698 SKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKIL 1522
            SK+K+KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+
Sbjct: 643  SKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIV 699

Query: 1521 FDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTD 1342
            FDPRFKAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKTD
Sbjct: 700  FDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTD 759

Query: 1341 YHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSG 1162
            Y +FKRKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD+ 
Sbjct: 760  YQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNK 815

Query: 1161 DINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDK 982
            DI T+SRWSRVK++LRNDPRYK+VKHE+R  LFNEYISEL                EQ+K
Sbjct: 816  DITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEK 875

Query: 981  LXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKD 802
            L                   RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKD
Sbjct: 876  LKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKD 935

Query: 801  PQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTS 622
            PQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE AAQ TDDGK +L S
Sbjct: 936  PQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNS 995

Query: 621  WSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSE 442
            WSEAKRLLKPDPRYSKMP KDRE +W+R+AE+M R+QKP+   +EKP ++ +N+ S+D  
Sbjct: 996  WSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFS 1055

Query: 441  RSPAPRRTHSR 409
            R  +PRR+H R
Sbjct: 1056 RR-SPRRSHGR 1065


>ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda]
          Length = 1085

 Score =  943 bits (2438), Expect = 0.0
 Identities = 547/1106 (49%), Positives = 672/1106 (60%), Gaps = 24/1106 (2%)
 Frame = -2

Query: 3651 QSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTAGF 3472
            Q S PG+ PQ    G T                         ++   +SVRAKFV + G+
Sbjct: 12   QPSAPGVPPQPLTPGQTTTGG---------PPGPSPPIPRPQNDQPQESVRAKFVASPGY 62

Query: 3471 VVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTP---PTSAAALQPPVPRQSSGSVPSF 3301
            ++PAPSF Y V+                P     P   P SA ++QPPVP  S+ S  SF
Sbjct: 63   ILPAPSFSYGVVSQNNNA----------PRASLPPQSTPLSAVSVQPPVPGHSATSGASF 112

Query: 3300 SYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR------PNQF 3139
            SY++ S     SA          T    +Q GK   P +AASLQPPVPG+      PN +
Sbjct: 113  SYSVASHATTTSA----------TASNPMQGGKPAGPTSAASLQPPVPGQSSVSVHPNSW 162

Query: 3138 VPGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVA 2959
             P    QN  A  + P  V KG PS              V++E  Q  Q+SN+ ASAAVA
Sbjct: 163  DPERPVQNALAQARPPFLVRKGPPSTSGFSFSGNSQS--VSSEDSQKHQASNSDASAAVA 220

Query: 2958 QETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY--PMTIWTQXXXXXXXXXXXXXX 2785
            QE  T   +SS++Q+T LP   SS++S  V ++P+ Y  P  +                 
Sbjct: 221  QEAKTSQPSSSTAQTTPLPA-PSSTTSRPVSSSPNTYATPFYMPKAPPFPGPPRLPVTPG 279

Query: 2784 XXXXXXXXXXXXXXXXXANARPAAMDP-SASLRPMXXXXXXXXXXXXXXVHQQ-----LY 2623
                              N RP+ +D  SA +RP                  Q     +Y
Sbjct: 280  TPGPPGIALSAPQLSSSVNIRPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQPPIY 339

Query: 2622 PPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXX 2443
             PY + P + P PQ  W+ P Q+ GLQRPP++PYP   P PFP+P+R I           
Sbjct: 340  SPYPTLPGVVPPPQAMWMHPSQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAMPDSS 399

Query: 2442 XXXXXXXXXXXXXXXXA--GSGQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDAD 2269
                            A  G+G   +    QSPPPGID++K +   T+ +     +ED D
Sbjct: 400  QPPGVSPIGPPGGIPLADHGAGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSNEDTD 459

Query: 2268 LWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTN 2089
             WTAHKT+TGAVYYYN+LTG+STYE+P  FKGE DKV +Q TPVS EKLVGTDW LV+TN
Sbjct: 460  QWTAHKTDTGAVYYYNALTGESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWALVATN 519

Query: 2088 DGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS-LQTSMTSQNASFGMDKGSAPVSLSVP 1912
            DGKKYY+NTK+K+SSWQ+P EVAELRK+Q++D+ L+ +   QNA    DKGS   SLS P
Sbjct: 520  DGKKYYYNTKSKISSWQVPPEVAELRKKQEADAALKANAPVQNAGISSDKGSVSSSLSAP 579

Query: 1911 AVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS-VPVTSDLNGLGP 1735
            A+NTGGREAM  +++ A  SSSALD++KKKLQD+GMPVTSS LP+S+ VP TSD NG   
Sbjct: 580  AINTGGREAMTFKSATAPVSSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDANGQRV 639

Query: 1734 VEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAP 1558
            V+   KGQQSENSK+KLK A   G++            GPTKEEC+IQFKEMLKE+G+AP
Sbjct: 640  VDTTVKGQQSENSKDKLKVAQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEKGIAP 699

Query: 1557 FSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLL 1378
            FSKWEKELPKILFDPRFKA+PGYT RR+LFEH+VRT                 EGFKQLL
Sbjct: 700  FSKWEKELPKILFDPRFKAIPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGFKQLL 759

Query: 1377 EEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAA 1198
            E ASEDI+HKTDY +FK+KWG DPRF ALDRK+RE LLNERVLPL+KA E+K Q  RAAA
Sbjct: 760  EGASEDINHKTDYETFKKKWGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAIRAAA 819

Query: 1197 VSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXX 1018
            V+SFKSML +  DIN  SRWS+VKDSLRNDPRYKSVKHEDREVLF EYISEL        
Sbjct: 820  VASFKSMLHEKVDINIGSRWSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAEQEAD 879

Query: 1017 XXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKA 838
                    E++KL                   RVR K RRK+AV SYQALL E IKDPKA
Sbjct: 880  RAAKAKREEEEKLKERERELRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIKDPKA 939

Query: 837  SWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAA 658
            SWTES PKLEKDP GRA+NP+L+ AD EKLFREHVK L ER AREFR+LLAEVIT E AA
Sbjct: 940  SWTESKPKLEKDPLGRATNPELEPADMEKLFREHVKVLNERCAREFRSLLAEVITPEAAA 999

Query: 657  QMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKEEKP 481
            Q ++DGK +L SWS AK+LL+PDPRY KMPR++RES+W+R+AE+M RRQ+  S+ KEEK 
Sbjct: 1000 QASEDGKTLLNSWSTAKKLLRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQKEEKT 1059

Query: 480  HSEFKNKISADSER-SPAPRRTHSRR 406
            + +  ++  A S + SP+ RR+H R+
Sbjct: 1060 NIDDPSRRPAGSSKSSPSVRRSHGRK 1085


>ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis
            vinifera] gi|297738259|emb|CBI27460.3| unnamed protein
            product [Vitis vinifera]
          Length = 1046

 Score =  929 bits (2401), Expect = 0.0
 Identities = 536/1091 (49%), Positives = 643/1091 (58%), Gaps = 7/1091 (0%)
 Frame = -2

Query: 3657 SSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478
            +SQ+ + G+    P  GP   P+  ++                +S    +S + KFV   
Sbjct: 15   ASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQGKFVNAP 73

Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298
              V+P PSF YS I                  +   P  S    Q PVP  SS S PSFS
Sbjct: 74   PHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSSSGPSFS 133

Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPGTIPQ 3118
            YN I+    G    Q  Q+ T     +I  G   P   AAS                   
Sbjct: 134  YN-IAHKGAGFPGSQPFQSST-----SIASGPRGPTPNAASFSF---------------- 171

Query: 3117 NMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVP 2938
                          G+P +                   Q  Q+  +  S AVAQE G++ 
Sbjct: 172  -------------NGNPQLV------------------QKDQTLKSDNSGAVAQEAGSMS 200

Query: 2937 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXX 2758
            +AS  SQS   P    SSS+M V ++P + P T+W                         
Sbjct: 201  SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 257

Query: 2757 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQG 2578
                      A P+A    +S                  + QQ+YP Y S PA   + QG
Sbjct: 258  APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 317

Query: 2577 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXX 2398
             WLQPPQ+ GL RPP++PYP   P PFPLP  G+                          
Sbjct: 318  PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 377

Query: 2397 XAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2230
             A  SG     TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT+TG VY
Sbjct: 378  SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 436

Query: 2229 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2050
            YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+
Sbjct: 437  YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 496

Query: 2049 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 1873
            SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG +P++LS PAV TGGR+A  LR
Sbjct: 497  SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 556

Query: 1872 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSK 1693
            TS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ S+LNG   +E   KG QSENSK
Sbjct: 557  TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 615

Query: 1692 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1516
            +KLKD NGDGNM            GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD
Sbjct: 616  DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 675

Query: 1515 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1336
            PRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDIDHKT+Y 
Sbjct: 676  PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 735

Query: 1335 SFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDI 1156
            +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q  RAAAVSSFKSMLRD GDI
Sbjct: 736  TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 795

Query: 1155 NTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 976
             TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL                EQDKL 
Sbjct: 796  TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 855

Query: 975  XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 796
                              RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ
Sbjct: 856  ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 915

Query: 795  GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWS 616
             RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VLTSWS
Sbjct: 916  ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 975

Query: 615  EAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERS 436
             AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK +  + E+ H+E K + S DS R 
Sbjct: 976  TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 1035

Query: 435  PA-PRRTHSRR 406
            P+  RR H RR
Sbjct: 1036 PSGSRRAHERR 1046


>ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo
            nucifera]
          Length = 894

 Score =  920 bits (2378), Expect = 0.0
 Identities = 521/922 (56%), Positives = 604/922 (65%), Gaps = 19/922 (2%)
 Frame = -2

Query: 3114 MPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPA 2935
            M +   SP+SVPKG PSI          QL       Q   SSN+SAS AVA+E GTV  
Sbjct: 1    MASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAREAGTVSP 54

Query: 2934 ASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXX 2755
            ASSSS   ++P +VS SS +    +P++ P T+W                          
Sbjct: 55   ASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIA 113

Query: 2754 XXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTP 2584
                          AMD S+S  LRP+                QQ++ PY + P+M P P
Sbjct: 114  PSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPALPSMPPPP 163

Query: 2583 QGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXX 2404
            QG WL PPQ+ GLQRPP++PYP  LP  +PLP+RG+                        
Sbjct: 164  QGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPP--- 219

Query: 2403 XXXAGSGQPTSSVGT------------QSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWT 2260
                  G P+SSVG+              PPPG DQ K  D      G    ++  D WT
Sbjct: 220  -----GGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWT 273

Query: 2259 AHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGK 2080
            AHKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW LV+TNDGK
Sbjct: 274  AHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGK 333

Query: 2079 KYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVSLSVPAVN 1903
            KYY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT  QN+    +K SAP+S++ PA+N
Sbjct: 334  KYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAIN 393

Query: 1902 TGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAI 1723
            TGGREA +LR SG   SSSALD++KKKLQD+  P TSSPLP SS P T+DLNG  PVEA 
Sbjct: 394  TGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAA 453

Query: 1722 AKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKW 1546
             KG QSEN K+K+KD NGDGN+            GP+KEECIIQFKEMLKERGVAPFSKW
Sbjct: 454  VKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKW 512

Query: 1545 EKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEAS 1366
            EKELPKI+FDPRFKAVPGY+ARRALFEHYVRT                 EGFKQLLEEAS
Sbjct: 513  EKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEAS 572

Query: 1365 EDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSF 1186
            EDID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLPLKKAAE+K Q  RAAA S F
Sbjct: 573  EDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGF 632

Query: 1185 KSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXX 1006
            KS+LR+ GDINTSSRWSRVKDSLR+DPRYKSVKHEDRE+LFNEYISEL            
Sbjct: 633  KSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAK 692

Query: 1005 XXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTE 826
                E+DKL                   RVRLKV+RKEAVA YQALLVETIKDP+ SWTE
Sbjct: 693  VKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTE 752

Query: 825  SNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTD 646
            S P+LEKDPQGRA+N  LD  D EKLFREHVK LYER AREFR LL EVIT E A+QMT+
Sbjct: 753  SRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTN 812

Query: 645  DGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK-PSDSKEEKPHSEF 469
            DGK VLTSWS AKRLLK DPRYSKMPRK+RE++W+R AEE+  ++K  SD KEEK + E 
Sbjct: 813  DGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIET 872

Query: 468  KNKISADSERSPAP-RRTHSRR 406
            K + S DS RSP   RR+HSRR
Sbjct: 873  KARSSLDSGRSPTGLRRSHSRR 894


>ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C isoform X5 [Elaeis
            guineensis]
          Length = 916

 Score =  907 bits (2345), Expect = 0.0
 Identities = 509/950 (53%), Positives = 605/950 (63%), Gaps = 9/950 (0%)
 Frame = -2

Query: 3231 TGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTIPQNMPASMQSPISVPKGHPSI 3061
            T   N+Q G+F PP TAASLQPPVP     P   VPG I  + PA MQ P+S+P G    
Sbjct: 10   TNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCPAPMQLPLSIPTGTSD- 68

Query: 3060 XXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSS 2881
                           A   +   S  TS  +  AQ + TVP++SS++         ++SS
Sbjct: 69   ---------------AVVTEAGTSITTSIDSQSAQLSATVPSSSSTASGINPN---ANSS 110

Query: 2880 SMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPS 2701
             +++P+ PS                                          ++PA  +PS
Sbjct: 111  GILMPSTPSF------------TGHPGMPGLAGTPGLPGIPNSATVSSTVTSQPAGTNPS 158

Query: 2700 ASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRP 2536
              LRPM                     QQ Y PY S P   P PQ  WL PPQ  GLQR 
Sbjct: 159  P-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLHPPQAGGLQRA 217

Query: 2535 PYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQ 2356
            P++PY   LPAPF LPV G+                            GS Q  S+VG +
Sbjct: 218  PFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTMGSSQSGSNVGIE 277

Query: 2355 SPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFK 2176
            SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G VYYYNS+TG+STYERPSSF 
Sbjct: 278  SPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYYNSVTGESTYERPSSFN 336

Query: 2175 GEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDS 1996
            GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSSWQ+P EV ELRK Q+S
Sbjct: 337  GEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPAEVLELRKSQES 396

Query: 1995 DSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQ 1816
            D+L+ +  +   +   DKGSAP+S+S PAV TGGR++MALRTSGA  SSSALD+VKKKLQ
Sbjct: 397  DALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSGAAVSSSALDLVKKKLQ 454

Query: 1815 DAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXX 1636
            DAG PVTSSP+P    PV SDLNG   VE   KGQQ  NSK+K+KD   DGNM       
Sbjct: 455  DAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKVKD---DGNMSDSSSDS 510

Query: 1635 XXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHY 1459
                 GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP Y+AR+ +FEH+
Sbjct: 511  DDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPSYSARKTIFEHF 570

Query: 1458 VRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKD 1279
            VRT                 + FKQLLEEASE+IDHKTDY +FKRKWGSDPRF  LDRK+
Sbjct: 571  VRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWGSDPRFGVLDRKE 630

Query: 1278 REALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRY 1099
            RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD+ DI T+SRWSRVK++LRNDPRY
Sbjct: 631  RELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWSRVKENLRNDPRY 686

Query: 1098 KSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXR 919
            K+VKHE+R  LFNEYISEL                EQ+KL                   R
Sbjct: 687  KAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMRKRKEREEQEMER 746

Query: 918  VRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFRE 739
            VRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFR+
Sbjct: 747  VRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPDLGQGDAEKLFRD 806

Query: 738  HVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKD 559
            HVK LYER AR FR LL+EVITAE AAQ TDDGK +L SWSEAKRLLKPDPRYSKMP KD
Sbjct: 807  HVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLKPDPRYSKMPGKD 866

Query: 558  RESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSR 409
            RE +W+R+AE+M R+QKP+   +EKP ++ +N+ S+D  R  +PRR+H R
Sbjct: 867  REYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SPRRSHGR 915


>ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis
            vinifera]
          Length = 1013

 Score =  905 bits (2338), Expect = 0.0
 Identities = 512/976 (52%), Positives = 606/976 (62%), Gaps = 8/976 (0%)
 Frame = -2

Query: 3309 PSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPG 3130
            PSFSY+ I      S   QQL +G+V            P  +    Q PVPG        
Sbjct: 80   PSFSYSGIPHVTTASGTSQQLPSGSVISSN--------PLASTVVFQTPVPG-------- 123

Query: 3129 TIPQNMPASMQSP-ISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQSSNTSASAAVAQE 2953
                  P+S   P  S    H                 A         S+T  S AVAQE
Sbjct: 124  ------PSSSSGPSFSYNIAHKG---------------AGFPGSQPFQSSTDNSGAVAQE 162

Query: 2952 TGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXX 2773
             G++ +AS  SQS   P    SSS+M V ++P + P T+W                    
Sbjct: 163  AGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTP 219

Query: 2772 XXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMA 2593
                           A P+A    +S                  + QQ+YP Y S PA  
Sbjct: 220  GPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATN 279

Query: 2592 PTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXX 2413
             + QG WLQPPQ+ GL RPP++PYP   P PFPLP  G+                     
Sbjct: 280  ASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTA 339

Query: 2412 XXXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTE 2245
                  A  SG     TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT+
Sbjct: 340  GGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTD 398

Query: 2244 TGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHN 2065
            TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+N
Sbjct: 399  TGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYN 458

Query: 2064 TKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGRE 1888
            TKTK+SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG +P++LS PAV TGGR+
Sbjct: 459  TKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRD 518

Query: 1887 AMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQ 1708
            A  LRTS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ S+LNG   +E   KG Q
Sbjct: 519  ATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQ 577

Query: 1707 SENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELP 1531
            SENSK+KLKD NGDGNM            GPTKEECIIQFKEMLKERGVAPFSKWEKELP
Sbjct: 578  SENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELP 637

Query: 1530 KILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDH 1351
            KI+FDPRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDIDH
Sbjct: 638  KIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDH 697

Query: 1350 KTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLR 1171
            KT+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q  RAAAVSSFKSMLR
Sbjct: 698  KTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLR 757

Query: 1170 DSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXE 991
            D GDI TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL                E
Sbjct: 758  DKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEE 817

Query: 990  QDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKL 811
            QDKL                   RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKL
Sbjct: 818  QDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKL 877

Query: 810  EKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNV 631
            EKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK V
Sbjct: 878  EKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTV 937

Query: 630  LTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISA 451
            LTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK +  + E+ H+E K + S 
Sbjct: 938  LTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSV 997

Query: 450  DSERSPA-PRRTHSRR 406
            DS R P+  RR H RR
Sbjct: 998  DSGRFPSGSRRAHERR 1013



 Score = 63.9 bits (154), Expect = 1e-06
 Identities = 73/301 (24%), Positives = 104/301 (34%), Gaps = 30/301 (9%)
 Frame = -2

Query: 3657 SSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNSSEPSNDSVRAKFVTTA 3478
            +SQ+ + G+    P  GP   P+  ++                +S    +S + KFV   
Sbjct: 15   ASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQGKFVNAP 73

Query: 3477 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3298
              V+P PSF YS I                  +   P  S    Q PVP  SS S PSFS
Sbjct: 74   PHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSSSGPSFS 133

Query: 3297 YNLISQPNVGSANGQQLQTGTVTGPGNIQ----------VGKFVP----PNTAASLQPPV 3160
            YN I+    G    Q  Q+ T       Q          V + VP     +T +    P 
Sbjct: 134  YN-IAHKGAGFPGSQPFQSSTDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMSVSSSPK 192

Query: 3159 PGRPNQFVPGT----IPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVA-------- 3016
             G    ++P      +P  MP +  +P     G P I           +P A        
Sbjct: 193  MGPTTLWMPSNPSFPVPSGMPVTPGTP-----GPPGIAPSTPLSSNLAVPSASMDFSSSV 247

Query: 3015 ---AESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTAL-PVYVSSSSSMIVPAAPSVY 2848
               A  P    SSN +    +     ++PA ++SSQ   L P  +           P+VY
Sbjct: 248  VSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVY 307

Query: 2847 P 2845
            P
Sbjct: 308  P 308


>ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [Musa acuminata subsp.
            malaccensis]
          Length = 1128

 Score =  904 bits (2335), Expect = 0.0
 Identities = 529/1063 (49%), Positives = 647/1063 (60%), Gaps = 24/1063 (2%)
 Frame = -2

Query: 3522 EPSNDSVRAKFVTTAGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQ 3343
            + S DS+RAKF +  GFVV APSF Y VI               +  +K TPP  AAALQ
Sbjct: 87   DTSQDSIRAKFSSPPGFVVAAPSFSYGVIPRTNLTSGNPQQSSSS-GLKLTPPVPAAALQ 145

Query: 3342 PPVPRQSSGSVPSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPP 3163
            PPVP Q  G+ P F YN++S  NV  A GQQ+Q  TV    ++Q GKF+PP+ A+SLQPP
Sbjct: 146  PPVPGQFLGTRP-FPYNVVSHANVVPAAGQQIQLNTVPVQAHLQGGKFIPPS-ASSLQPP 203

Query: 3162 VPG---RPNQFVPGTIPQNMPASMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKQ 2992
            VP    RP  F PG +    P+ MQ P+SVP+G             +Q   A E  +   
Sbjct: 204  VPRQPVRPTPFGPGAVSLISPSPMQFPLSVPQGDAIKQTNFSFSGHNQFSTA-EKDETIL 262

Query: 2991 SSNTSASAAVAQET----GTVPAASSSSQSTALPVYVSS---------SSSMIVPAAPSV 2851
            SS    S AVA ET     T+  + S   S ++P+  S+         ++SM++PAAPS 
Sbjct: 263  SSEKCTSDAVAVETTSDSSTLVNSQSVQTSQSMPLGTSTGLGINANACAASMLIPAAPSF 322

Query: 2850 YPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARP-----AAMDPSASLRP 2686
                                                   ++ RP     AA+ P+++  P
Sbjct: 323  TAHAEMPNARGIPGLTGNSSSATASTGATIKPTPTNSSISSPRPIIPVTAALPPTSTSVP 382

Query: 2685 MXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLP 2506
            +                QQ    Y SQP MAP+PQ  W  PPQ   +Q   + PYP   P
Sbjct: 383  VPFPVPQNV-------QQQTNVHYSSQPTMAPSPQASWSHPPQAGPMQHVSFSPYPGFFP 435

Query: 2505 APFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDK 2326
            APF LPV+GI                           AGS QP SS+  +S    +DQDK
Sbjct: 436  APFSLPVQGIPPAVPLPFIQPPGVSLMVSQVEPTAVTAGSLQPGSSMVAESSSSVVDQDK 495

Query: 2325 QSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQS 2146
            +S+      G+ + +E  + WTAHKTETGAVYYYNS+TG+STY++PS+FKGE +K   QS
Sbjct: 496  KSNNLDKDEGDTS-NELENAWTAHKTETGAVYYYNSITGKSTYQKPSNFKGESEKATTQS 554

Query: 2145 TPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS- 1969
              VS EKL GTDW +V+T+DG+KYY++TK KVSSW +P EVAELRK Q+S S + S T  
Sbjct: 555  NAVSWEKLAGTDWTIVTTSDGRKYYYDTKNKVSSWHVPAEVAELRKNQESGSTEGSATQL 614

Query: 1968 QNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSS 1789
            Q+AS   DK SAP +++ PA   G  ++MALR+SGA  SSSALDMVKKKLQ+AG P+TS 
Sbjct: 615  QDASTQGDKVSAPANIAAPAAQIGAHDSMALRSSGAPVSSSALDMVKKKLQEAGTPMTSP 674

Query: 1788 PLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTK 1612
               ++SVP TSD NGL   EA+AKG  +   K+K KDANG+GNM            GP+K
Sbjct: 675  H--STSVPATSDANGLKATEAVAKGVIN---KDKAKDANGEGNMSDSSSDSDDEESGPSK 729

Query: 1611 EECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXX 1432
            EECIIQFKEMLKERGVAPFSKW+KELPKI+FDPRFKAVP  +ARRALFEHYVRT      
Sbjct: 730  EECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAVPSQSARRALFEHYVRTRAEEER 789

Query: 1431 XXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERV 1252
                       + FKQLLEEA EDIDHKTDYHSFKRKWG DPRFEA+DRK+RE LLNE+V
Sbjct: 790  KEKRAAQKAALDAFKQLLEEALEDIDHKTDYHSFKRKWGGDPRFEAIDRKERELLLNEKV 849

Query: 1251 LPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDRE 1072
                KAA++K++  R AA +SFKSMLRD+ DI TSSRWSR+K+SLR+DPRYK+VKHE RE
Sbjct: 850  ----KAADEKMRALRMAAATSFKSMLRDNRDITTSSRWSRIKESLRDDPRYKAVKHEQRE 905

Query: 1071 VLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKE 892
             LFNEYI+EL                EQDKL                   RV+LKVRRKE
Sbjct: 906  TLFNEYIAELKSAVDEVERSAKAKRDEQDKLKERERELRKRKEREEKEMERVKLKVRRKE 965

Query: 891  AVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERS 712
            A  SY+ LLVE IKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFREHVK LYER 
Sbjct: 966  AEYSYRTLLVEMIKDPKASWTESKPKLEKDPQGRATNPDLTQEDAEKLFREHVKDLYERC 1025

Query: 711  AREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFA 532
              +FR LLAEV+T E AA   DDGK VL SWSEAK LLKPDPRYSKMP KDRES+W+R  
Sbjct: 1026 VNDFRTLLAEVVTVEAAAAKNDDGKTVLNSWSEAKLLLKPDPRYSKMPSKDRESLWRRHT 1085

Query: 531  EEMQRRQKPSDSKEEKPHSEFKNKISADSE-RSPAPRRTHSRR 406
            E+M RR K     +E P +  +N++S+ ++    +P R+H RR
Sbjct: 1086 EDMLRRPKSVSDTKESPGTNGRNRMSSAADPLKRSPGRSHRRR 1128


>ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis
            vinifera]
          Length = 903

 Score =  898 bits (2320), Expect = 0.0
 Identities = 488/873 (55%), Positives = 578/873 (66%), Gaps = 7/873 (0%)
 Frame = -2

Query: 3003 QNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQX 2824
            Q  Q+  +  S AVAQE G++ +AS  SQS   P    SSS+M V ++P + P T+W   
Sbjct: 36   QKDQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPS 92

Query: 2823 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXX 2644
                                            A P+A    +S                 
Sbjct: 93   NPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNP 152

Query: 2643 XVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXX 2464
             + QQ+YP Y S PA   + QG WLQPPQ+ GL RPP++PYP   P PFPLP  G+    
Sbjct: 153  AIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPS 212

Query: 2463 XXXXXXXXXXXXXXXXXXXXXXXAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNG 2296
                                   A  SG     TS + ++ PPPGID +K  +G  + +G
Sbjct: 213  VPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG 272

Query: 2295 EIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVG 2116
              A +E  D WTAHKT+TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL G
Sbjct: 273  A-AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTG 331

Query: 2115 TDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKG 1939
            TDW LV+TNDGKKYY+NTKTK+SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG
Sbjct: 332  TDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKG 391

Query: 1938 SAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVT 1759
             +P++LS PAV TGGR+A  LRTS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ 
Sbjct: 392  PSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIA 450

Query: 1758 SDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEM 1582
            S+LNG   +E   KG QSENSK+KLKD NGDGNM            GPTKEECIIQFKEM
Sbjct: 451  SELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEM 510

Query: 1581 LKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXX 1402
            LKERGVAPFSKWEKELPKI+FDPRFKA+PGY+ARR+LFEHYVRT                
Sbjct: 511  LKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAA 570

Query: 1401 XEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQK 1222
             EGFKQLLEEASEDIDHKT+Y +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K
Sbjct: 571  IEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEK 630

Query: 1221 IQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISEL 1042
             Q  RAAAVSSFKSMLRD GDI TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL
Sbjct: 631  AQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISEL 690

Query: 1041 XXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLV 862
                            EQDKL                   RVRLKVRRKEAV+SYQALLV
Sbjct: 691  KAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLV 750

Query: 861  ETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAE 682
            ETIKDP+ SWTES PKLEKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+E
Sbjct: 751  ETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSE 810

Query: 681  VITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPS 502
            V+TAE A Q T+DGK VLTSWS AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK +
Sbjct: 811  VLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLA 870

Query: 501  DSKEEKPHSEFKNKISADSERSPA-PRRTHSRR 406
              + E+ H+E K + S DS R P+  RR H RR
Sbjct: 871  QDQTEEKHTEVKGRSSVDSGRFPSGSRRAHERR 903


>ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis
            vinifera]
          Length = 848

 Score =  882 bits (2280), Expect = 0.0
 Identities = 479/851 (56%), Positives = 565/851 (66%), Gaps = 7/851 (0%)
 Frame = -2

Query: 2937 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXX 2758
            +AS  SQS   P    SSS+M V ++P + P T+W                         
Sbjct: 3    SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 59

Query: 2757 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQG 2578
                      A P+A    +S                  + QQ+YP Y S PA   + QG
Sbjct: 60   APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 119

Query: 2577 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXX 2398
             WLQPPQ+ GL RPP++PYP   P PFPLP  G+                          
Sbjct: 120  PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 179

Query: 2397 XAG-SGQP---TSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2230
             A  SG     TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT+TG VY
Sbjct: 180  SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 238

Query: 2229 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2050
            YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+
Sbjct: 239  YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 298

Query: 2049 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 1873
            SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG +P++LS PAV TGGR+A  LR
Sbjct: 299  SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 358

Query: 1872 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSK 1693
            TS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ S+LNG   +E   KG QSENSK
Sbjct: 359  TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 417

Query: 1692 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1516
            +KLKD NGDGNM            GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD
Sbjct: 418  DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 477

Query: 1515 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1336
            PRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDIDHKT+Y 
Sbjct: 478  PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 537

Query: 1335 SFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDI 1156
            +F++KWG DPRFEALDRKDRE LLNERVLPLK+AAE+K Q  RAAAVSSFKSMLRD GDI
Sbjct: 538  TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 597

Query: 1155 NTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 976
             TS+RWSRVKDSLRNDPRYK VKHEDRE+LFNEYISEL                EQDKL 
Sbjct: 598  TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 657

Query: 975  XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 796
                              RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ
Sbjct: 658  ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 717

Query: 795  GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWS 616
             RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VLTSWS
Sbjct: 718  ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 777

Query: 615  EAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERS 436
             AKRLL+ D RY KMPRKDRES+W+R++EEM R+QK +  + E+ H+E K + S DS R 
Sbjct: 778  TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 837

Query: 435  PA-PRRTHSRR 406
            P+  RR H RR
Sbjct: 838  PSGSRRAHERR 848


>ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii]
            gi|763747828|gb|KJB15267.1| hypothetical protein
            B456_002G167700 [Gossypium raimondii]
          Length = 887

 Score =  843 bits (2177), Expect = 0.0
 Identities = 466/876 (53%), Positives = 573/876 (65%), Gaps = 8/876 (0%)
 Frame = -2

Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842
            +PQ  Q++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482
                   V QQ+YPPY S P+M  +PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302
            G+                           A + Q + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1945
             GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++  N     +
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374

Query: 1944 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1765
            KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V 
Sbjct: 375  KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433

Query: 1764 VTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1588
             T +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QFK
Sbjct: 434  ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491

Query: 1587 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1408
            EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T              
Sbjct: 492  EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551

Query: 1407 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1228
               EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE
Sbjct: 552  AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611

Query: 1227 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYIS 1048
            +K +  RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYIS
Sbjct: 612  EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671

Query: 1047 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 868
            EL                E++KL                   RVRLKVRRKEAVAS+QAL
Sbjct: 672  ELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 731

Query: 867  LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 688
            LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRALL
Sbjct: 732  LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 791

Query: 687  AEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 508
            AEVIT +  AQ T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK
Sbjct: 792  AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 851

Query: 507  PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406
             +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 852  SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887


>gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 888

 Score =  838 bits (2166), Expect = 0.0
 Identities = 467/877 (53%), Positives = 573/877 (65%), Gaps = 9/877 (1%)
 Frame = -2

Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842
            +PQ  Q++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482
                   V QQ+YPPY S P+M  +PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302
            G+                           A + Q + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKV-SSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGM 1948
             GTDW LV+TNDGKKYY+N+KTKV SSWQ+P EV ELRK+QDS+ S + +++  N     
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVA 374

Query: 1947 DKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSV 1768
            +KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V
Sbjct: 375  EKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPV 433

Query: 1767 PVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQF 1591
              T +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QF
Sbjct: 434  TATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQF 491

Query: 1590 KEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXX 1411
            KEMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T             
Sbjct: 492  KEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQ 551

Query: 1410 XXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAA 1231
                EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AA
Sbjct: 552  KAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAA 611

Query: 1230 EQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYI 1051
            E+K +  RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYI
Sbjct: 612  EEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYI 671

Query: 1050 SELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQA 871
            SEL                E++KL                   RVRLKVRRKEAVAS+QA
Sbjct: 672  SELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQA 731

Query: 870  LLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRAL 691
            LLVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRAL
Sbjct: 732  LLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRAL 791

Query: 690  LAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQ 511
            LAEVIT +  AQ T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+Q
Sbjct: 792  LAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQ 851

Query: 510  KPSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406
            K +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 852  KSALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888


>gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 886

 Score =  838 bits (2166), Expect = 0.0
 Identities = 465/876 (53%), Positives = 572/876 (65%), Gaps = 8/876 (0%)
 Frame = -2

Query: 3009 SPQNKQSSNTSASAAVAQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2842
            +PQ  Q++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2841 TIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2662
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2661 XXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2482
                   V QQ+YPPY S P+M  +PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2481 GIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQSPPPGIDQDKQSDGNTST 2302
            G+                           A + Q + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAALANQ-SLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2301 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2122
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2121 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 1945
             GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++  N     +
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374

Query: 1944 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1765
            KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V 
Sbjct: 375  KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433

Query: 1764 VTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1588
             T +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QFK
Sbjct: 434  ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491

Query: 1587 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1408
            EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T              
Sbjct: 492  EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551

Query: 1407 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAE 1228
               EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL LK+AAE
Sbjct: 552  AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611

Query: 1227 QKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYIS 1048
            +K +  RAAA SSFKSML++ GDIN +SRWSRVKDSLR+DPRYK VKHEDREVLFNEYIS
Sbjct: 612  EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671

Query: 1047 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 868
            EL                 ++KL                   RVRLKVRRKEAVAS+QAL
Sbjct: 672  ELKAIEEKAERKDKVKKE-EEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 730

Query: 867  LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 688
            LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRALL
Sbjct: 731  LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 790

Query: 687  AEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQK 508
            AEVIT +  AQ T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+AE+M R+QK
Sbjct: 791  AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 850

Query: 507  PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 406
             +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 851  SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 886


>ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao]
            gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein
            40C [Theobroma cacao]
          Length = 816

 Score =  831 bits (2146), Expect = 0.0
 Identities = 437/749 (58%), Positives = 527/749 (70%), Gaps = 6/749 (0%)
 Frame = -2

Query: 2634 QQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGIXXXXXXX 2455
            QQ+YP Y   P+MA +PQG W+Q P + G  RPP++PYPT  P PFP    G+       
Sbjct: 75   QQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYPGPFPSASSGMPHPAPSS 134

Query: 2454 XXXXXXXXXXXXXXXXXXXXAGSGQPTSSVGTQS--PPPGIDQDKQSDGNTSTNGEIAKS 2281
                                  + Q + + G Q+  PP GID     + N  T  E A +
Sbjct: 135  DSQPPGVSPLATSPFAPSIAIPANQSSVASGIQTGFPPQGID-----NRNVGTRVEAAVN 189

Query: 2280 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2101
            E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPDKVPVQ TPVS E+L GT+W L
Sbjct: 190  EQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWAL 249

Query: 2100 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMDKGSAPVS 1924
            V+T+DGKKYY+N+KTK+SSWQ+P EVAELRK+QD+D S + ++   N     +KGS P+S
Sbjct: 250  VTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPIS 309

Query: 1923 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP-LPASSVPVTSDLN 1747
            LS PAV+TGGR+AM LRTS    SSSALD++KKKLQD+G+P +SS  +P   V    +LN
Sbjct: 310  LSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELN 369

Query: 1746 GLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKER 1570
            G   V+   KG QSENSK+KLKDANGDGN+            GP+KEECI+QFKEMLKER
Sbjct: 370  GSRAVDV--KGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKER 427

Query: 1569 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1390
            GVAPFSKWEKELPKI+FDPRFKA+P ++ARR LFEHYV+T                 EGF
Sbjct: 428  GVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGF 487

Query: 1389 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKAAEQKIQEQ 1210
            KQLL+EASEDIDH T+Y +FKRKWGSD RFEALDRKDRE LL ERVLPLK+AAE+K Q  
Sbjct: 488  KQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAI 547

Query: 1209 RAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEYISELXXXX 1030
            RAAA SS KSML++ GDI  +SRWSRVKDS+R+DPRYK VKHEDREVLFNEYISEL    
Sbjct: 548  RAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVE 607

Query: 1029 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 850
                        E++KL                   RVRLKVRRKEAVAS+QALLVETIK
Sbjct: 608  EKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 667

Query: 849  DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 670
            DP+ASWTES PKLEKDPQGRA+NPDLD +DTEKLFREH+K L+ER   +FRALLAEVIT 
Sbjct: 668  DPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQ 727

Query: 669  ETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRRQKPSDSKE 490
            + AAQ T+ GK V  SWS AKRLLKPDPRYSKMPRK+RE++W+R+AE+M R+QK +  +E
Sbjct: 728  DAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQE 787

Query: 489  EKPHSEFKNKISADSER-SPAPRRTHSRR 406
            E+  ++ K + S D  R S   R+ H RR
Sbjct: 788  EEKRTDAKVRSSGDLGRFSSGSRKVHERR 816


>ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis]
          Length = 978

 Score =  828 bits (2139), Expect = 0.0
 Identities = 467/936 (49%), Positives = 569/936 (60%), Gaps = 9/936 (0%)
 Frame = -2

Query: 3186 TAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXXXXXXSQL 3025
            T  S+  P   +      G IPQ      N   S  S  SV   +PS+         S  
Sbjct: 47   TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106

Query: 3024 PVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2845
                    N+Q           ++ G   + S++SQ     V   S S++   +A ++  
Sbjct: 107  QTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALST 166

Query: 2844 MTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2665
             T W                                 ++A       SA LRP       
Sbjct: 167  TTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPSA 225

Query: 2664 XXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2485
                     HQ +YP Y S P +  +PQG  LQPPQ+      P++PYP   P+PFPLP 
Sbjct: 226  PSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSPFPLPA 284

Query: 2484 RGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDGN 2311
             G+                           A  G     +S  T++PP G D+ K+   +
Sbjct: 285  HGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVHD 343

Query: 2310 TSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVST 2131
             S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S 
Sbjct: 344  VSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISM 403

Query: 2130 EKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFG 1951
            E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +  N +  
Sbjct: 404  EHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNIV 462

Query: 1950 MDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS 1771
            ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+SP P SS
Sbjct: 463  IEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSS 521

Query: 1770 VPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQ 1594
               TS+ NG   VE   KG Q+EN+K+KLKD NGDG M           +GPTKEECII+
Sbjct: 522  AAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIK 581

Query: 1593 FKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXX 1414
            FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T            
Sbjct: 582  FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAA 641

Query: 1413 XXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKA 1234
                 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+A
Sbjct: 642  QKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRA 701

Query: 1233 AEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEY 1054
            AE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDREV+FNEY
Sbjct: 702  AEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEY 761

Query: 1053 ISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQ 874
            + EL                EQ+KL                   RVRLKVRRKEAV S+Q
Sbjct: 762  VRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQ 821

Query: 873  ALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRA 694
            ALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR 
Sbjct: 822  ALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRG 881

Query: 693  LLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRR 514
            LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR+
Sbjct: 882  LLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRK 941

Query: 513  QKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406
             K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 942  HKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977


>gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis]
          Length = 978

 Score =  827 bits (2135), Expect = 0.0
 Identities = 466/936 (49%), Positives = 569/936 (60%), Gaps = 9/936 (0%)
 Frame = -2

Query: 3186 TAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXXXXXXSQL 3025
            T  S+  P   +      G IPQ      N   S  S  SV   +PS+         S  
Sbjct: 47   TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106

Query: 3024 PVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2845
                    N+Q           ++ G   + S++SQ     V   S S++   +A ++  
Sbjct: 107  QTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALST 166

Query: 2844 MTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2665
             T W                                 ++A       SA LRP       
Sbjct: 167  TTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPSA 225

Query: 2664 XXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2485
                     HQ +YP Y S P +  +PQG  L+PPQ+      P++PYP   P+PFPLP 
Sbjct: 226  PSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLPA 284

Query: 2484 RGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQDKQSDGN 2311
             G+                           A  G     +S  T++PP G D+ K+   +
Sbjct: 285  HGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVHD 343

Query: 2310 TSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVST 2131
             S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S 
Sbjct: 344  VSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISM 403

Query: 2130 EKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFG 1951
            E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +  N +  
Sbjct: 404  EHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNIV 462

Query: 1950 MDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS 1771
            ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+SP P SS
Sbjct: 463  IEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSS 521

Query: 1770 VPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQ 1594
               TS+ NG   VE   KG Q+EN+K+KLKD NGDG M           +GPTKEECII+
Sbjct: 522  AAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIK 581

Query: 1593 FKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXX 1414
            FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T            
Sbjct: 582  FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAA 641

Query: 1413 XXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPLKKA 1234
                 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLPLK+A
Sbjct: 642  QKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRA 701

Query: 1233 AEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDREVLFNEY 1054
            AE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDREV+FNEY
Sbjct: 702  AEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEY 761

Query: 1053 ISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQ 874
            + EL                EQ+KL                   RVRLKVRRKEAV S+Q
Sbjct: 762  VRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQ 821

Query: 873  ALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRA 694
            ALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR 
Sbjct: 822  ALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRG 881

Query: 693  LLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAEEMQRR 514
            LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R AEE+QR+
Sbjct: 882  LLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRK 941

Query: 513  QKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406
             K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 942  HKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977


>ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina]
            gi|557539684|gb|ESR50728.1| hypothetical protein
            CICLE_v10030612mg [Citrus clementina]
          Length = 1015

 Score =  827 bits (2135), Expect = 0.0
 Identities = 478/1003 (47%), Positives = 589/1003 (58%), Gaps = 9/1003 (0%)
 Frame = -2

Query: 3387 PAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQPNVGSANGQQLQTGTVTGPGNIQV 3208
            P ++     ++ A  PP  +Q + + P     +  +P  GS       T T  G      
Sbjct: 31   PFIRSDQIMTSPAWLPPEVQQLTANAP-----ISGKPVGGSLVASSTPTPTSNGSDTATN 85

Query: 3207 GKFVPPNTAASLQPPVPGRPNQFVPGTIPQ------NMPASMQSPISVPKGHPSIXXXXX 3046
                 P+ A S+             G IPQ      N   S  S  SV   +PS+     
Sbjct: 86   DSISGPSQAKSVTA---------TGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVS 136

Query: 3045 XXXXSQLPVAAESPQNKQSSNTSASAAVAQETGTVPAASSSSQSTALPVYVSSSSSMIVP 2866
                S          N+Q           ++ G   + S++SQ     V   S S++   
Sbjct: 137  SFTYSASQTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATS 196

Query: 2865 AAPSVYPMTIWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRP 2686
            +A ++   T W                                 ++A       SA LRP
Sbjct: 197  SATALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRP 255

Query: 2685 MXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMPYPTGLP 2506
                            HQ +YP + S P +  +PQ   LQPPQ+      P++PYP   P
Sbjct: 256  SVPTPSAPSNSGSAIQHQ-IYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAYP 314

Query: 2505 APFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSPPPGIDQ 2332
            +PFPLP  G+                           A  G     +S  T++PP G D+
Sbjct: 315  SPFPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK 374

Query: 2331 DKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPV 2152
             K+   + S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPV
Sbjct: 375  -KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPV 433

Query: 2151 QSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMT 1972
            Q TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +
Sbjct: 434  QPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQS 492

Query: 1971 SQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTS 1792
              N +  ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+
Sbjct: 493  VPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TA 551

Query: 1791 SPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPT 1615
            SP P SS   TS+ NG   VE   KG Q+EN+K+KLKD NGDG M           +GPT
Sbjct: 552  SPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPT 611

Query: 1614 KEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXX 1435
            KEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T     
Sbjct: 612  KEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEE 671

Query: 1434 XXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNER 1255
                        EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNER
Sbjct: 672  RKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNER 731

Query: 1254 VLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKSVKHEDR 1075
            VLPLK+AAE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDR
Sbjct: 732  VLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDR 791

Query: 1074 EVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRK 895
            EV+FNEY+ EL                EQ+KL                   RVRLKVRRK
Sbjct: 792  EVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRK 851

Query: 894  EAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYER 715
            EAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER
Sbjct: 852  EAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYER 911

Query: 714  SAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRF 535
             A +FR LLAEVITAE AAQ T+DGK VL SWS AKR+LKPDPRYSKMPRK+RE++W+R 
Sbjct: 912  CAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRRH 971

Query: 534  AEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406
            AEE+QR+ K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 972  AEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 1014


>gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis]
            gi|641834042|gb|KDO53045.1| hypothetical protein
            CISIN_1g002026mg [Citrus sinensis]
          Length = 857

 Score =  824 bits (2128), Expect = 0.0
 Identities = 438/769 (56%), Positives = 524/769 (68%), Gaps = 3/769 (0%)
 Frame = -2

Query: 2703 SASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPTPQGHWLQPPQVSGLQRPPYMP 2524
            SA LRP                HQ +YP Y S P +  +PQG  L+PPQ+      P++P
Sbjct: 92   SAGLRPSVPTPSAPSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLP 150

Query: 2523 YPTGLPAPFPLPVRGIXXXXXXXXXXXXXXXXXXXXXXXXXXXAGSGQPT--SSVGTQSP 2350
            YP   P+PFPLP  G+                           A  G     +S  T++P
Sbjct: 151  YPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAP 210

Query: 2349 PPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGE 2170
            P G D+ K+   + S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGE
Sbjct: 211  PSGTDK-KEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGE 269

Query: 2169 PDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS 1990
            PDKVPVQ TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+
Sbjct: 270  PDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDT 329

Query: 1989 LQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDA 1810
            L+   +  N +  ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+
Sbjct: 330  LK-EQSVPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDS 388

Query: 1809 GMPVTSSPLPASSVPVTSDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXX 1633
            G P T+SP P SS   TS+ NG   VE   KG Q+EN+K+KLKD NGDG M         
Sbjct: 389  GTP-TASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSED 447

Query: 1632 XXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVR 1453
              +GPTKEECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+
Sbjct: 448  GETGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVK 507

Query: 1452 TXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDRE 1273
            T                 EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE
Sbjct: 508  TRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRE 567

Query: 1272 ALLNERVLPLKKAAEQKIQEQRAAAVSSFKSMLRDSGDINTSSRWSRVKDSLRNDPRYKS 1093
             LLNERVLPLK+AAE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKS
Sbjct: 568  LLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKS 627

Query: 1092 VKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVR 913
            V+HEDREV+FNEY+ EL                EQ+KL                   RVR
Sbjct: 628  VRHEDREVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVR 687

Query: 912  LKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHV 733
            LKVRRKEAV S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+
Sbjct: 688  LKVRRKEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHI 747

Query: 732  KTLYERSAREFRALLAEVITAETAAQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRE 553
            KTLYER A +FR LLAEVITAE AAQ T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE
Sbjct: 748  KTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKERE 807

Query: 552  SIWKRFAEEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 406
            ++W+R AEE+QR+ K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 808  ALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 856


Top