BLASTX nr result

ID: Cinnamomum23_contig00001821 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00001821
         (4016 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C i...  1041   0.0  
ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C i...  1005   0.0  
ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C i...  1002   0.0  
ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C i...   983   0.0  
ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [...   936   0.0  
ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i...   926   0.0  
ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i...   913   0.0  
ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C i...   908   0.0  
ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [...   904   0.0  
ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i...   903   0.0  
ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i...   892   0.0  
ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i...   880   0.0  
ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [...   837   0.0  
gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r...   833   0.0  
gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium r...   833   0.0  
ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [...   832   0.0  
ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c...   822   0.0  
ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l...   819   0.0  
ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr...   818   0.0  
gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin...   818   0.0  

>ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera] gi|719963615|ref|XP_010250275.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera]
          Length = 1088

 Score = 1041 bits (2691), Expect = 0.0
 Identities = 596/1101 (54%), Positives = 703/1101 (63%), Gaps = 19/1101 (1%)
 Frame = -1

Query: 3794 QSSIPGMTPQAPVSGPTVAPS--IQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTP 3621
            QSS  G+T QA   G    PS     S                T+EP+ +S+RAKF+T P
Sbjct: 8    QSSASGITAQASGLGQATGPSNPTVASPAPVSGPSNPKGPSGTTNEPAQESIRAKFITGP 67

Query: 3620 GFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFS 3441
            G+VVPAPSF YSVI               +PA+    P SA A QP +P QS  S P+FS
Sbjct: 68   GYVVPAPSFSYSVIPKQNTASGSSLENSSSPALVSNQPASATAFQPSIPGQSLSSGPTFS 127

Query: 3440 YNLISQPNVGSASGQQLQTGTVTGPGNI---QVGKFVPPNTAASLQPPVPGRP---NQFV 3279
            YN+I    +GS++ Q+LQ+ T  G G +   QVG   P  TAASLQPPVPG+P   N F 
Sbjct: 128  YNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHPNTFG 187

Query: 3278 PGTVPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQ 3099
            PGT  Q M +   SP+SVPKG PSI          QL       Q   SSN+SAS AV +
Sbjct: 188  PGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAR 241

Query: 3098 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXX 2919
            E GTV  ASSSS   ++P +VS SS +    +P++ P T+W                   
Sbjct: 242  EAGTVSPASSSSVPVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGT 300

Query: 2918 XXXXXXXXXXXXXXA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQ 2748
                                 AMD S+S  LRP+                QQ++ PY + 
Sbjct: 301  PGPPGIAPSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPAL 350

Query: 2747 PAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXX 2568
            P+M PPPQG WL PPQ+ GLQRPP++PYP  LP  +PLP+RG                  
Sbjct: 351  PSMPPPPQGLWL-PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISP 409

Query: 2567 XXXXXXXPATA--GLAQPTSSVGTQS--PPPGIDQDKQSDGNTSTNGEIAKSEDADLWTA 2400
                   P+++   +  P+++ G Q   PPPG DQ K  D      G    ++  D WTA
Sbjct: 410  LGPPGGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWTA 468

Query: 2399 HKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKK 2220
            HKTETG VYYYN+LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW LV+TNDGKK
Sbjct: 469  HKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKK 528

Query: 2219 YYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVSLSVPAVNT 2043
            YY+N+KTK+SSWQ+P+EV ELR++ D D+L+ +MT  QN+    +K SAP+S++ PA+NT
Sbjct: 529  YYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINT 588

Query: 2042 GGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIA 1863
            GGREA +LR SG   SSSALD++KKKLQD+  P TSSPLP SS P  +DLNG  PVEA  
Sbjct: 589  GGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAV 648

Query: 1862 KGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWE 1686
            KG QSEN K+K+KD NGDGN+            GP+KEECIIQFKEMLKERGVAPFSKWE
Sbjct: 649  KGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWE 707

Query: 1685 KELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASE 1506
            KELPKI+FDPRFKAVPGY+ARRALFEHYVRT                 EGFKQLLEEASE
Sbjct: 708  KELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASE 767

Query: 1505 DIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFK 1326
            DID +TDY +FK KWGSDPRFEALDRK+RE LLNERVLP+KKAAE+K Q  RAAA S FK
Sbjct: 768  DIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFK 827

Query: 1325 SMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXX 1146
            S+LR+ GDINTSSRWSRVKD LR+DPRYKSVKHEDRE+LFNEYISEL             
Sbjct: 828  SLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKV 887

Query: 1145 XXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTES 966
               E+DKL                   RVRLKV+RKEAVA YQALLVETIKDP+ SWTES
Sbjct: 888  KREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTES 947

Query: 965  NPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDD 786
             P+LEKDPQGRA+N  LD  D EKLFREHVK LYER AREFR LL EVIT E A QMT+D
Sbjct: 948  RPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTND 1007

Query: 785  GKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQK-PSDSKEEKPHSEFK 609
            GK VLTSWS AKRLLK DPRYSKMPRK+RE++W+R A+E+  ++K  SD KEEK + E K
Sbjct: 1008 GKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETK 1067

Query: 608  NKISADSERSPAP-RRTHSRR 549
             + S DS RSP   RR+HSRR
Sbjct: 1068 ARSSLDSGRSPTGLRRSHSRR 1088


>ref|XP_010906097.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Elaeis
            guineensis]
          Length = 1097

 Score = 1005 bits (2598), Expect = 0.0
 Identities = 567/1092 (51%), Positives = 666/1092 (60%), Gaps = 15/1092 (1%)
 Frame = -1

Query: 3782 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPG 3618
            P  T   PV+ P+   S     + VS               N + P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3617 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3438
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3437 NLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTV 3267
            N++S  N GSA+GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG +
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3266 PQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGT 3087
              + PAPMQ P+S+P G                                 S AVV E GT
Sbjct: 230  TPSCPAPMQLPLSIPTG--------------------------------TSDAVVTEAGT 257

Query: 3086 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXX 2910
                S  SQS  L   V SSSS      P+      +                       
Sbjct: 258  SITTSIDSQSAQLSATVPSSSSTASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLP 317

Query: 2909 XXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQP 2745
                         ++PA  +PS  LRPM                     QQ Y PY S P
Sbjct: 318  GIPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLP 376

Query: 2744 AMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXX 2565
               PPPQ  WL PPQ  GLQR P++PY   LPAPF LPV G                   
Sbjct: 377  GTIPPPQALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTV 436

Query: 2564 XXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTET 2385
                    T G +Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+
Sbjct: 437  ANQGPASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTES 495

Query: 2384 GAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNT 2205
            G VYYYNS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++T
Sbjct: 496  GVVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDT 555

Query: 2204 KTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAM 2025
            K KVSSWQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++M
Sbjct: 556  KNKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSM 613

Query: 2024 ALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSE 1845
            ALRTSGA  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  
Sbjct: 614  ALRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGT 672

Query: 1844 NSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI 1668
            NSK+K+KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI
Sbjct: 673  NSKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKI 729

Query: 1667 LFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKT 1488
            +FDPRFKAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKT
Sbjct: 730  VFDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKT 789

Query: 1487 DYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDG 1308
            DY +FKRKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD 
Sbjct: 790  DYQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDN 845

Query: 1307 GDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQD 1128
             DI T+SRWSRVK+ LRNDPRYK+VKHE+R  LFNEYISEL                EQ+
Sbjct: 846  KDITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQE 905

Query: 1127 KLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEK 948
            KL                   RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEK
Sbjct: 906  KLKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEK 965

Query: 947  DPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLT 768
            DPQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK +L 
Sbjct: 966  DPQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILN 1025

Query: 767  SWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISADS 588
            SWSEAKRLLKPDPRYSKMP KDRE +W+R+A++M R+QKP+   +EKP ++ +N+ S+D 
Sbjct: 1026 SWSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDF 1085

Query: 587  ERSPAPRRTHSR 552
             R  +PRR+H R
Sbjct: 1086 SRR-SPRRSHGR 1096


>ref|XP_010906099.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Elaeis
            guineensis]
          Length = 1055

 Score = 1002 bits (2590), Expect = 0.0
 Identities = 566/1091 (51%), Positives = 665/1091 (60%), Gaps = 14/1091 (1%)
 Frame = -1

Query: 3782 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPG 3618
            P  T   PV+ P+   S     + VS               N + P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3617 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3438
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3437 NLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTV 3267
            N++S  N GSA+GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG +
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3266 PQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGT 3087
              + PAPMQ P+S+P G                                 S AVV E GT
Sbjct: 230  TPSCPAPMQLPLSIPTG--------------------------------TSDAVVTEAGT 257

Query: 3086 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXX 2907
                S  SQS  L   V SSSS    ++                                
Sbjct: 258  SITTSIDSQSAQLSATVPSSSSTASVSST------------------------------- 286

Query: 2906 XXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPA 2742
                        ++PA  +PS  LRPM                     QQ Y PY S P 
Sbjct: 287  ----------VTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPG 335

Query: 2741 MAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXX 2562
              PPPQ  WL PPQ  GLQR P++PY   LPAPF LPV G                    
Sbjct: 336  TIPPPQALWLHPPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVA 395

Query: 2561 XXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETG 2382
                   T G +Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G
Sbjct: 396  NQGPASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESG 454

Query: 2381 AVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTK 2202
             VYYYNS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK
Sbjct: 455  VVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTK 514

Query: 2201 TKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMA 2022
             KVSSWQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++MA
Sbjct: 515  NKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMA 572

Query: 2021 LRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSEN 1842
            LRTSGA  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  N
Sbjct: 573  LRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTN 631

Query: 1841 SKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKIL 1665
            SK+K+KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+
Sbjct: 632  SKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIV 688

Query: 1664 FDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTD 1485
            FDPRFKAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKTD
Sbjct: 689  FDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTD 748

Query: 1484 YHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDGG 1305
            Y +FKRKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD  
Sbjct: 749  YQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNK 804

Query: 1304 DINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDK 1125
            DI T+SRWSRVK+ LRNDPRYK+VKHE+R  LFNEYISEL                EQ+K
Sbjct: 805  DITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEK 864

Query: 1124 LXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKD 945
            L                   RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKD
Sbjct: 865  LKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKD 924

Query: 944  PQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLTS 765
            PQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK +L S
Sbjct: 925  PQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNS 984

Query: 764  WSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISADSE 585
            WSEAKRLLKPDPRYSKMP KDRE +W+R+A++M R+QKP+   +EKP ++ +N+ S+D  
Sbjct: 985  WSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFS 1044

Query: 584  RSPAPRRTHSR 552
            R  +PRR+H R
Sbjct: 1045 RR-SPRRSHGR 1054


>ref|XP_010906098.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis
            guineensis]
          Length = 1066

 Score =  983 bits (2540), Expect = 0.0
 Identities = 559/1092 (51%), Positives = 658/1092 (60%), Gaps = 15/1092 (1%)
 Frame = -1

Query: 3782 PGMTPQAPVSGPTVAPS-----IQVSXXXXXXXXXXXXXXPNTSEPSNDSVRAKFVTTPG 3618
            P  T   PV+ P+   S     + VS               N + P+ D VRAKF T+ G
Sbjct: 50   PATTAITPVTSPSFMDSGPSLTVTVSTMTSVGPPPPRGVIVNANTPTQDPVRAKFATSQG 109

Query: 3617 FVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSGSVPSFSY 3438
            FVVPAPSF Y V                +P ++ +PP  A ALQPPVP Q  G+ PSFSY
Sbjct: 110  FVVPAPSFSYGVFPRVNSASGSAHQSSSSPGLRLSPPMPATALQPPVPGQFLGNRPSFSY 169

Query: 3437 NLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTV 3267
            N++S  N GSA+GQQ Q  T T   N+Q G+F PP TAASLQPPVP     P   VPG +
Sbjct: 170  NVVSNANAGSATGQQFQLTTATNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAI 229

Query: 3266 PQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGT 3087
              + PAPMQ P+S+P G                                 S AVV E GT
Sbjct: 230  TPSCPAPMQLPLSIPTG--------------------------------TSDAVVTEAGT 257

Query: 3086 VPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXX 2910
                S  SQS  L   V SSSS      P+      +                       
Sbjct: 258  SITTSIDSQSAQLSATVPSSSSTASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLP 317

Query: 2909 XXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQP 2745
                         ++PA  +PS  LRPM                     QQ Y PY S P
Sbjct: 318  GIPNSATVSSTVTSQPAGTNPSP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLP 376

Query: 2744 AMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXX 2565
               PPPQ  WL PPQ  GLQR P++PY      P                          
Sbjct: 377  GTIPPPQALWLHPPQAGGLQRAPFLPYSVANQGP-------------------------- 410

Query: 2564 XXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTET 2385
                    T G +Q  S+VG +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+
Sbjct: 411  -----ASTTMGSSQSGSNVGIESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTES 464

Query: 2384 GAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNT 2205
            G VYYYNS+TG+STYERPSSF GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++T
Sbjct: 465  GVVYYYNSVTGESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDT 524

Query: 2204 KTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAM 2025
            K KVSSWQ+P EV ELRK Q+SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++M
Sbjct: 525  KNKVSSWQVPAEVLELRKSQESDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSM 582

Query: 2024 ALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSE 1845
            ALRTSGA  SSSALD+VKKKLQDAG PVTSSP+P    PV SDLNG   VE   KGQQ  
Sbjct: 583  ALRTSGAAVSSSALDLVKKKLQDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGT 641

Query: 1844 NSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI 1668
            NSK+K+KD   DGNM            GPTKEECI QFKEMLKERGVAPFSKWEKELPKI
Sbjct: 642  NSKDKVKD---DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKI 698

Query: 1667 LFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKT 1488
            +FDPRFKAVP Y+AR+ +FEH+VRT                 + FKQLLEEASE+IDHKT
Sbjct: 699  VFDPRFKAVPSYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKT 758

Query: 1487 DYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDG 1308
            DY +FKRKWGSDPRF  LDRK+RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD 
Sbjct: 759  DYQTFKRKWGSDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDN 814

Query: 1307 GDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQD 1128
             DI T+SRWSRVK+ LRNDPRYK+VKHE+R  LFNEYISEL                EQ+
Sbjct: 815  KDITTTSRWSRVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQE 874

Query: 1127 KLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEK 948
            KL                   RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEK
Sbjct: 875  KLKEREREMRKRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEK 934

Query: 947  DPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLT 768
            DPQGRA+NPDL + D EKLFR+HVK LYER AR FR LL+EVITAE A Q TDDGK +L 
Sbjct: 935  DPQGRATNPDLGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILN 994

Query: 767  SWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISADS 588
            SWSEAKRLLKPDPRYSKMP KDRE +W+R+A++M R+QKP+   +EKP ++ +N+ S+D 
Sbjct: 995  SWSEAKRLLKPDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDF 1054

Query: 587  ERSPAPRRTHSR 552
             R  +PRR+H R
Sbjct: 1055 SRR-SPRRSHGR 1065


>ref|XP_011624657.1| PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda]
          Length = 1085

 Score =  936 bits (2419), Expect = 0.0
 Identities = 545/1110 (49%), Positives = 667/1110 (60%), Gaps = 28/1110 (2%)
 Frame = -1

Query: 3794 QSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSND----SVRAKFVT 3627
            Q S PG+ PQ    G T                      P    P ND    SVRAKFV 
Sbjct: 12   QPSAPGVPPQPLTPGQTTTGG-------------PPGPSPPIPRPQNDQPQESVRAKFVA 58

Query: 3626 TPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTP---PTSAAALQPPVPRQSSGS 3456
            +PG+++PAPSF Y V+                P     P   P SA ++QPPVP  S+ S
Sbjct: 59   SPGYILPAPSFSYGVVSQNNNA----------PRASLPPQSTPLSAVSVQPPVPGHSATS 108

Query: 3455 VPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGR------ 3294
              SFSY++ S     SA          T    +Q GK   P +AASLQPPVPG+      
Sbjct: 109  GASFSYSVASHATTTSA----------TASNPMQGGKPAGPTSAASLQPPVPGQSSVSVH 158

Query: 3293 PNQFVPGTVPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSAS 3114
            PN + P    QN  A  + P  V KG PS              V++E  Q   +SN+ AS
Sbjct: 159  PNSWDPERPVQNALAQARPPFLVRKGPPSTSGFSFSGNSQS--VSSEDSQKHQASNSDAS 216

Query: 3113 AAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY--PMTMWTQXXXXXXXXXX 2940
            AAV QE  T   +SS++Q+T LP   SS++S  V ++P+ Y  P  M             
Sbjct: 217  AAVAQEAKTSQPSSSTAQTTPLPA-PSSTTSRPVSSSPNTYATPFYMPKAPPFPGPPRLP 275

Query: 2939 XXXXXXXXXXXXXXXXXXXXXANARPAAMDP-SASLRPMXXXXXXXXXXXXXXVHQQ--- 2772
                                  N RP+ +D  SA +RP                  Q   
Sbjct: 276  VTPGTPGPPGIALSAPQLSSSVNIRPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQ 335

Query: 2771 --LYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXX 2598
              +Y PY + P + PPPQ  W+ P Q+ GLQRPP++PYP   P PFP+P+R         
Sbjct: 336  PPIYSPYPTLPGVVPPPQAMWMHPSQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAM 395

Query: 2597 XXXXXXXXXXXXXXXXXPATA--GLAQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKS 2424
                                A  G     +    QSPPPGID++K +   T+ +     +
Sbjct: 396  PDSSQPPGVSPIGPPGGIPLADHGAGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSN 455

Query: 2423 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2244
            ED D WTAHKT+TGAVYYYN+LTG+STYE+P  FKGE DKV +Q TPVS EKLVGTDW L
Sbjct: 456  EDTDQWTAHKTDTGAVYYYNALTGESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWAL 515

Query: 2243 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDS-LQTSMTSQNASFGMDKGSAPVS 2067
            V+TNDGKKYY+NTK+K+SSWQ+P EVAELRK+Q++D+ L+ +   QNA    DKGS   S
Sbjct: 516  VATNDGKKYYYNTKSKISSWQVPPEVAELRKKQEADAALKANAPVQNAGISSDKGSVSSS 575

Query: 2066 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASS-VPVISDLN 1890
            LS PA+NTGGREAM  +++ A  SSSALD++KKKLQD+GMPVTSS LP+S+ VP  SD N
Sbjct: 576  LSAPAINTGGREAMTFKSATAPVSSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDAN 635

Query: 1889 GLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKER 1713
            G   V+   KGQQSENSK+KLK A   G++            GPTKEEC+IQFKEMLKE+
Sbjct: 636  GQRVVDTTVKGQQSENSKDKLKVAQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEK 695

Query: 1712 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1533
            G+APFSKWEKELPKILFDPRFKA+PGYT RR+LFEH+VRT                 EGF
Sbjct: 696  GIAPFSKWEKELPKILFDPRFKAIPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGF 755

Query: 1532 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQ 1353
            KQLLE ASEDI+HKTDY +FK+KWG DPRF ALDRK+RE LLNERVLP++KA E+K Q  
Sbjct: 756  KQLLEGASEDINHKTDYETFKKKWGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAI 815

Query: 1352 RAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXX 1173
            RAAAV+SFKSML +  DIN  SRWS+VKD LRNDPRYKSVKHEDREVLF EYISEL    
Sbjct: 816  RAAAVASFKSMLHEKVDINIGSRWSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAE 875

Query: 1172 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 993
                        E++KL                   RVR K RRK+AV SYQALL E IK
Sbjct: 876  QEADRAAKAKREEEEKLKERERELRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIK 935

Query: 992  DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 813
            DPKASWTES PKLEKDP GRA+NP+L+ AD EKLFREHVK L ER AREFR+LLAEVIT 
Sbjct: 936  DPKASWTESKPKLEKDPLGRATNPELEPADMEKLFREHVKVLNERCAREFRSLLAEVITP 995

Query: 812  ETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQK-PSDSK 636
            E A Q ++DGK +L SWS AK+LL+PDPRY KMPR++RES+W+R+A++M RRQ+  S+ K
Sbjct: 996  EAAAQASEDGKTLLNSWSTAKKLLRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQK 1055

Query: 635  EEKPHSEFKNKISADSER-SPAPRRTHSRR 549
            EEK + +  ++  A S + SP+ RR+H R+
Sbjct: 1056 EEKTNIDDPSRRPAGSSKSSPSVRRSHGRK 1085


>ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis
            vinifera] gi|297738259|emb|CBI27460.3| unnamed protein
            product [Vitis vinifera]
          Length = 1046

 Score =  926 bits (2394), Expect = 0.0
 Identities = 535/1097 (48%), Positives = 645/1097 (58%), Gaps = 7/1097 (0%)
 Frame = -1

Query: 3818 IKLIKMSSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRA 3639
            +++   +SQ+ + G+    P  GP   P+  ++                 S    +S + 
Sbjct: 9    VEVQSSASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQG 67

Query: 3638 KFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSG 3459
            KFV  P  V+P PSF YS I                  +   P  S    Q PVP  SS 
Sbjct: 68   KFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSS 127

Query: 3458 SVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFV 3279
            S PSFSYN I+    G    Q  Q+ T     +I  G   P   AAS             
Sbjct: 128  SGPSFSYN-IAHKGAGFPGSQPFQSST-----SIASGPRGPTPNAASFS----------- 170

Query: 3278 PGTVPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQ 3099
                                G+P +                +  Q   S N   S AV Q
Sbjct: 171  ------------------FNGNPQL---------------VQKDQTLKSDN---SGAVAQ 194

Query: 3098 ETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXX 2919
            E G++ +AS  SQS   P    SSS+M V ++P + P T+W                   
Sbjct: 195  EAGSMSSASHVSQSVPFP---CSSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGT 251

Query: 2918 XXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAM 2739
                            A P+A    +S                  + QQ+YP Y S PA 
Sbjct: 252  PGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPAT 311

Query: 2738 APPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXXX 2559
                QG WLQPPQ+ GL RPP++PYP   P PFPLP  G                     
Sbjct: 312  NASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGT 371

Query: 2558 XXXXPATAGLA----QPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKT 2391
                P +A ++      TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT
Sbjct: 372  AGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKT 430

Query: 2390 ETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYH 2211
            +TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+
Sbjct: 431  DTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYY 490

Query: 2210 NTKTKVSSWQLPVEVAELRKRQDSDSL-QTSMTSQNASFGMDKGSAPVSLSVPAVNTGGR 2034
            NTKTK+SSWQ+P E+ E+RK+QDS +L + +M + N +   +KG +P++LS PAV TGGR
Sbjct: 491  NTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGR 550

Query: 2033 EAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQ 1854
            +A  LRTS    S+SALDM+KKKLQD+G P TSSP+  SS P+ S+LNG   +E   KG 
Sbjct: 551  DATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPV-HSSGPIASELNGSRVIEPTVKGL 609

Query: 1853 QSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKEL 1677
            QSENSK+KLKD NGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKEL
Sbjct: 610  QSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKEL 669

Query: 1676 PKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDID 1497
            PKI+FDPRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDID
Sbjct: 670  PKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDID 729

Query: 1496 HKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSML 1317
            HKT+Y +F++KWG DPRFEALDRKDRE LLNERVLP+K+AAE+K Q  RAAAVSSFKSML
Sbjct: 730  HKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSML 789

Query: 1316 RDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXX 1137
            RD GDI TS+RWSRVKD LRNDPRYK VKHEDRE+LFNEYISEL                
Sbjct: 790  RDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKE 849

Query: 1136 EQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPK 957
            EQDKL                   RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PK
Sbjct: 850  EQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPK 909

Query: 956  LEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKN 777
            LEKDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK 
Sbjct: 910  LEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKT 969

Query: 776  VLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKIS 597
            VLTSWS AKRLL+ D RY KMPRKDRES+W+R+++EM R+QK +  + E+ H+E K + S
Sbjct: 970  VLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSS 1029

Query: 596  ADSERSPA-PRRTHSRR 549
             DS R P+  RR H RR
Sbjct: 1030 VDSGRFPSGSRRAHERR 1046


>ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo
            nucifera]
          Length = 894

 Score =  913 bits (2359), Expect = 0.0
 Identities = 514/908 (56%), Positives = 601/908 (66%), Gaps = 11/908 (1%)
 Frame = -1

Query: 3239 SPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQ 3060
            SP+SVPKG PSI          QL       Q   SSN+SAS AV +E GTV  ASSSS 
Sbjct: 7    SPVSVPKGAPSIATSFSFNRIPQLA------QKDLSSNSSASVAVAREAGTVSPASSSSV 60

Query: 3059 STALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2880
              ++P +VS SS +    +P++ P T+W                                
Sbjct: 61   PVSMPFHVSPSS-LAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIAPSTPLS 119

Query: 2879 XA-NARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQ 2709
                    AMD S+S  LRP+                QQ++ PY + P+M PPPQG WL 
Sbjct: 120  STVTVNSEAMDSSSSTSLRPVVPSTV----------QQQMHSPYPALPSMPPPPQGLWL- 168

Query: 2708 PPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXXXXXXXPATA-- 2535
            PPQ+ GLQRPP++PYP  LP  +PLP+RG                         P+++  
Sbjct: 169  PPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTPSSSVG 228

Query: 2534 GLAQPTSSVGTQS--PPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNS 2361
             +  P+++ G Q   PPPG DQ K  D      G    ++  D WTAHKTETG VYYYN+
Sbjct: 229  SVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWTAHKTETGVVYYYNA 287

Query: 2360 LTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQ 2181
            LTG+STYERPS F GEPDKV VQ TPVS EKLVGTDW LV+TNDGKKYY+N+KTK+SSWQ
Sbjct: 288  LTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKISSWQ 347

Query: 2180 LPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGA 2004
            +P+EV ELR++ D D+L+ +MT  QN+    +K SAP+S++ PA+NTGGREA +LR SG 
Sbjct: 348  VPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLRPSGV 407

Query: 2003 MASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLK 1824
              SSSALD++KKKLQD+  P TSSPLP SS P  +DLNG  PVEA  KG QSEN K+K+K
Sbjct: 408  AGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSEN-KDKVK 466

Query: 1823 DANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1647
            D NGDGN+            GP+KEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK
Sbjct: 467  DINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 526

Query: 1646 AVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKR 1467
            AVPGY+ARRALFEHYVRT                 EGFKQLLEEASEDID +TDY +FK 
Sbjct: 527  AVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQTFKM 586

Query: 1466 KWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDGGDINTSS 1287
            KWGSDPRFEALDRK+RE LLNERVLP+KKAAE+K Q  RAAA S FKS+LR+ GDINTSS
Sbjct: 587  KWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDINTSS 646

Query: 1286 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXX 1107
            RWSRVKD LR+DPRYKSVKHEDRE+LFNEYISEL                E+DKL     
Sbjct: 647  RWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEEDKLKERER 706

Query: 1106 XXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRAS 927
                          RVRLKV+RKEAVA YQALLVETIKDP+ SWTES P+LEKDPQGRA+
Sbjct: 707  EMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQGRAT 766

Query: 926  NPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLTSWSEAKR 747
            N  LD  D EKLFREHVK LYER AREFR LL EVIT E A QMT+DGK VLTSWS AKR
Sbjct: 767  NSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWSTAKR 826

Query: 746  LLKPDPRYSKMPRKDRESIWKRFADEMQRRQK-PSDSKEEKPHSEFKNKISADSERSPAP 570
            LLK DPRYSKMPRK+RE++W+R A+E+  ++K  SD KEEK + E K + S DS RSP  
Sbjct: 827  LLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGRSPTG 886

Query: 569  -RRTHSRR 549
             RR+HSRR
Sbjct: 887  LRRSHSRR 894


>ref|XP_010906101.1| PREDICTED: pre-mRNA-processing protein 40C isoform X5 [Elaeis
            guineensis]
          Length = 916

 Score =  908 bits (2347), Expect = 0.0
 Identities = 510/951 (53%), Positives = 595/951 (62%), Gaps = 10/951 (1%)
 Frame = -1

Query: 3374 TGPGNIQVGKFVPPNTAASLQPPVPGR---PNQFVPGTVPQNMPAPMQSPISVPKGHPSI 3204
            T   N+Q G+F PP TAASLQPPVP     P   VPG +  + PAPMQ P+S+P G    
Sbjct: 10   TNQANLQGGRFAPPTTAASLQPPVPRPSICPGANVPGAITPSCPAPMQLPLSIPTG---- 65

Query: 3203 XXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSS 3024
                                         S AVV E GT    S  SQS  L   V SSS
Sbjct: 66   ----------------------------TSDAVVTEAGTSITTSIDSQSAQLSATVPSSS 97

Query: 3023 SMIVPAAPSVYPM-TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDP 2847
            S      P+      +                                    ++PA  +P
Sbjct: 98   STASGINPNANSSGILMPSTPSFTGHPGMPGLAGTPGLPGIPNSATVSSTVTSQPAGTNP 157

Query: 2846 SASLRPMXXXXXXXXXXXXXXV-----HQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQR 2682
            S  LRPM                     QQ Y PY S P   PPPQ  WL PPQ  GLQR
Sbjct: 158  SP-LRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLHPPQAGGLQR 216

Query: 2681 PPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLAQPTSSVGT 2502
             P++PY   LPAPF LPV G                           T G +Q  S+VG 
Sbjct: 217  APFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTMGSSQSGSNVGI 276

Query: 2501 QSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSF 2322
            +SP  GID +K ++ +   +GE  K+E+AD WTAHKTE+G VYYYNS+TG+STYERPSSF
Sbjct: 277  ESPSVGIDHEKHAN-DPHKDGESTKNEEADAWTAHKTESGVVYYYNSVTGESTYERPSSF 335

Query: 2321 KGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQD 2142
             GEP+ V  QSTPVS EKL GT+W LV+TNDG+KYY++TK KVSSWQ+P EV ELRK Q+
Sbjct: 336  NGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPAEVLELRKSQE 395

Query: 2141 SDSLQTSMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKL 1962
            SD+L+ +  +   +   DKGSAP+S+S PAV TGGR++MALRTSGA  SSSALD+VKKKL
Sbjct: 396  SDALKGN--ANQLTNVADKGSAPISMSAPAVETGGRDSMALRTSGAAVSSSALDLVKKKL 453

Query: 1961 QDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXX 1782
            QDAG PVTSSP+P    PV SDLNG   VE   KGQQ  NSK+K+KD   DGNM      
Sbjct: 454  QDAGTPVTSSPVPTPG-PVASDLNGSKAVETAPKGQQGTNSKDKVKD---DGNMSDSSSD 509

Query: 1781 XXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEH 1605
                  GPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP Y+AR+ +FEH
Sbjct: 510  SDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPSYSARKTIFEH 569

Query: 1604 YVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRK 1425
            +VRT                 + FKQLLEEASE+IDHKTDY +FKRKWGSDPRF  LDRK
Sbjct: 570  FVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWGSDPRFGVLDRK 629

Query: 1424 DREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPR 1245
            +RE LLNE+V    KAAE+K+Q  R AAV+SFKSMLRD  DI T+SRWSRVK+ LRNDPR
Sbjct: 630  ERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWSRVKENLRNDPR 685

Query: 1244 YKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXX 1065
            YK+VKHE+R  LFNEYISEL                EQ+KL                   
Sbjct: 686  YKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMRKRKEREEQEME 745

Query: 1064 RVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFR 885
            RVRLKVRRKEAVASYQALLVETIKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFR
Sbjct: 746  RVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPDLGQGDAEKLFR 805

Query: 884  EHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRK 705
            +HVK LYER AR FR LL+EVITAE A Q TDDGK +L SWSEAKRLLKPDPRYSKMP K
Sbjct: 806  DHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLKPDPRYSKMPGK 865

Query: 704  DRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSR 552
            DRE +W+R+A++M R+QKP+   +EKP ++ +N+ S+D  R  +PRR+H R
Sbjct: 866  DREYLWRRYAEDMMRKQKPASDPKEKPDTDGRNRTSSDFSRR-SPRRSHGR 915


>ref|XP_009388080.1| PREDICTED: pre-mRNA-processing protein 40C [Musa acuminata subsp.
            malaccensis]
          Length = 1128

 Score =  904 bits (2335), Expect = 0.0
 Identities = 522/1062 (49%), Positives = 642/1062 (60%), Gaps = 23/1062 (2%)
 Frame = -1

Query: 3665 EPSNDSVRAKFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQ 3486
            + S DS+RAKF + PGFVV APSF Y VI               +  +K TPP  AAALQ
Sbjct: 87   DTSQDSIRAKFSSPPGFVVAAPSFSYGVIPRTNLTSGNPQQSSSS-GLKLTPPVPAAALQ 145

Query: 3485 PPVPRQSSGSVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPP 3306
            PPVP Q  G+ P F YN++S  NV  A+GQQ+Q  TV    ++Q GKF+PP+ A+SLQPP
Sbjct: 146  PPVPGQFLGTRP-FPYNVVSHANVVPAAGQQIQLNTVPVQAHLQGGKFIPPS-ASSLQPP 203

Query: 3305 VPG---RPNQFVPGTVPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKH 3135
            VP    RP  F PG V    P+PMQ P+SVP+G             +Q   A +      
Sbjct: 204  VPRQPVRPTPFGPGAVSLISPSPMQFPLSVPQGDAIKQTNFSFSGHNQFSTAEKDETILS 263

Query: 3134 SSNTSASAAVVQETG---TVPAASSSSQSTALPVYVSS---------SSSMIVPAAPSVY 2991
            S   ++ A  V+ T    T+  + S   S ++P+  S+         ++SM++PAAPS  
Sbjct: 264  SEKCTSDAVAVETTSDSSTLVNSQSVQTSQSMPLGTSTGLGINANACAASMLIPAAPSFT 323

Query: 2990 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARP-----AAMDPSASLRPM 2826
                                                  ++ RP     AA+ P+++  P+
Sbjct: 324  AHAEMPNARGIPGLTGNSSSATASTGATIKPTPTNSSISSPRPIIPVTAALPPTSTSVPV 383

Query: 2825 XXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPA 2646
                            QQ    Y SQP MAP PQ  W  PPQ   +Q   + PYP   PA
Sbjct: 384  PFPVPQNV-------QQQTNVHYSSQPTMAPSPQASWSHPPQAGPMQHVSFSPYPGFFPA 436

Query: 2645 PFPLPVRGXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQ 2466
            PF LPV+G                           TAG  QP SS+  +S    +DQDK+
Sbjct: 437  PFSLPVQGIPPAVPLPFIQPPGVSLMVSQVEPTAVTAGSLQPGSSMVAESSSSVVDQDKK 496

Query: 2465 SDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQST 2286
            S+      G+ + +E  + WTAHKTETGAVYYYNS+TG+STY++PS+FKGE +K   QS 
Sbjct: 497  SNNLDKDEGDTS-NELENAWTAHKTETGAVYYYNSITGKSTYQKPSNFKGESEKATTQSN 555

Query: 2285 PVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-Q 2109
             VS EKL GTDW +V+T+DG+KYY++TK KVSSW +P EVAELRK Q+S S + S T  Q
Sbjct: 556  AVSWEKLAGTDWTIVTTSDGRKYYYDTKNKVSSWHVPAEVAELRKNQESGSTEGSATQLQ 615

Query: 2108 NASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP 1929
            +AS   DK SAP +++ PA   G  ++MALR+SGA  SSSALDMVKKKLQ+AG P+TS  
Sbjct: 616  DASTQGDKVSAPANIAAPAAQIGAHDSMALRSSGAPVSSSALDMVKKKLQEAGTPMTSPH 675

Query: 1928 LPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKE 1752
              ++SVP  SD NGL   EA+AKG  +   K+K KDANG+GNM            GP+KE
Sbjct: 676  --STSVPATSDANGLKATEAVAKGVIN---KDKAKDANGEGNMSDSSSDSDDEESGPSKE 730

Query: 1751 ECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXX 1572
            ECIIQFKEMLKERGVAPFSKW+KELPKI+FDPRFKAVP  +ARRALFEHYVRT       
Sbjct: 731  ECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAVPSQSARRALFEHYVRTRAEEERK 790

Query: 1571 XXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVL 1392
                      + FKQLLEEA EDIDHKTDYHSFKRKWG DPRFEA+DRK+RE LLNE+V 
Sbjct: 791  EKRAAQKAALDAFKQLLEEALEDIDHKTDYHSFKRKWGGDPRFEAIDRKERELLLNEKV- 849

Query: 1391 PIKKAAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREV 1212
               KAA++K++  R AA +SFKSMLRD  DI TSSRWSR+K+ LR+DPRYK+VKHE RE 
Sbjct: 850  ---KAADEKMRALRMAAATSFKSMLRDNRDITTSSRWSRIKESLRDDPRYKAVKHEQRET 906

Query: 1211 LFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEA 1032
            LFNEYI+EL                EQDKL                   RV+LKVRRKEA
Sbjct: 907  LFNEYIAELKSAVDEVERSAKAKRDEQDKLKERERELRKRKEREEKEMERVKLKVRRKEA 966

Query: 1031 VASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSA 852
              SY+ LLVE IKDPKASWTES PKLEKDPQGRA+NPDL + D EKLFREHVK LYER  
Sbjct: 967  EYSYRTLLVEMIKDPKASWTESKPKLEKDPQGRATNPDLTQEDAEKLFREHVKDLYERCV 1026

Query: 851  REFRALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAD 672
             +FR LLAEV+T E A    DDGK VL SWSEAK LLKPDPRYSKMP KDRES+W+R  +
Sbjct: 1027 NDFRTLLAEVVTVEAAAAKNDDGKTVLNSWSEAKLLLKPDPRYSKMPSKDRESLWRRHTE 1086

Query: 671  EMQRRQKPSDSKEEKPHSEFKNKISADSE-RSPAPRRTHSRR 549
            +M RR K     +E P +  +N++S+ ++    +P R+H RR
Sbjct: 1087 DMLRRPKSVSDTKESPGTNGRNRMSSAADPLKRSPGRSHRRR 1128


>ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis
            vinifera]
          Length = 1013

 Score =  903 bits (2334), Expect = 0.0
 Identities = 507/975 (52%), Positives = 605/975 (62%), Gaps = 7/975 (0%)
 Frame = -1

Query: 3452 PSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQVGKFVPPNTAASLQPPVPGRPNQFVPG 3273
            PSFSY+ I      S + QQL +G+V            P  +    Q PVPG        
Sbjct: 80   PSFSYSGIPHVTTASGTSQQLPSGSVISSN--------PLASTVVFQTPVPG-------- 123

Query: 3272 TVPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVAAESPQNKHSSNTSASAAVVQET 3093
              P +   P  S     KG                  A         S+T  S AV QE 
Sbjct: 124  --PSSSSGPSFSYNIAHKG------------------AGFPGSQPFQSSTDNSGAVAQEA 163

Query: 3092 GTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXX 2913
            G++ +AS  SQS   P    SSS+M V ++P + P T+W                     
Sbjct: 164  GSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPG 220

Query: 2912 XXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAP 2733
                          A P+A    +S                  + QQ+YP Y S PA   
Sbjct: 221  PPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNA 280

Query: 2732 PPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXXXXX 2553
              QG WLQPPQ+ GL RPP++PYP   P PFPLP  G                       
Sbjct: 281  SSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAG 340

Query: 2552 XXPATAGLA----QPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTET 2385
              P +A ++      TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT+T
Sbjct: 341  GTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDT 399

Query: 2384 GAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNT 2205
            G VYYYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NT
Sbjct: 400  GVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNT 459

Query: 2204 KTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREA 2028
            KTK+SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG +P++LS PAV TGGR+A
Sbjct: 460  KTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDA 519

Query: 2027 MALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQS 1848
              LRTS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ S+LNG   +E   KG QS
Sbjct: 520  TPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQS 578

Query: 1847 ENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPK 1671
            ENSK+KLKD NGDGNM            GPTKEECIIQFKEMLKERGVAPFSKWEKELPK
Sbjct: 579  ENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPK 638

Query: 1670 ILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHK 1491
            I+FDPRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDIDHK
Sbjct: 639  IVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHK 698

Query: 1490 TDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRD 1311
            T+Y +F++KWG DPRFEALDRKDRE LLNERVLP+K+AAE+K Q  RAAAVSSFKSMLRD
Sbjct: 699  TEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRD 758

Query: 1310 GGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQ 1131
             GDI TS+RWSRVKD LRNDPRYK VKHEDRE+LFNEYISEL                EQ
Sbjct: 759  KGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQ 818

Query: 1130 DKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLE 951
            DKL                   RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLE
Sbjct: 819  DKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLE 878

Query: 950  KDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVL 771
            KDPQ RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VL
Sbjct: 879  KDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVL 938

Query: 770  TSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISAD 591
            TSWS AKRLL+ D RY KMPRKDRES+W+R+++EM R+QK +  + E+ H+E K + S D
Sbjct: 939  TSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVD 998

Query: 590  SERSPA-PRRTHSRR 549
            S R P+  RR H RR
Sbjct: 999  SGRFPSGSRRAHERR 1013



 Score = 66.6 bits (161), Expect = 2e-07
 Identities = 75/307 (24%), Positives = 106/307 (34%), Gaps = 30/307 (9%)
 Frame = -1

Query: 3818 IKLIKMSSQSSIPGMTPQAPVSGPTVAPSIQVSXXXXXXXXXXXXXXPNTSEPSNDSVRA 3639
            +++   +SQ+ + G+    P  GP   P+  ++                 S    +S + 
Sbjct: 9    VEVQSSASQNPVTGLPAGGPSGGPPT-PTGAIAPASVATIRTSEGASGTASNSIQESAQG 67

Query: 3638 KFVTTPGFVVPAPSFQYSVIXXXXXXXXXXXXXXXAPAVKFTPPTSAAALQPPVPRQSSG 3459
            KFV  P  V+P PSF YS I                  +   P  S    Q PVP  SS 
Sbjct: 68   KFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPVPGPSSS 127

Query: 3458 SVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQ----------VGKFVP----PNTAA 3321
            S PSFSYN I+    G    Q  Q+ T       Q          V + VP     +T +
Sbjct: 128  SGPSFSYN-IAHKGAGFPGSQPFQSSTDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMS 186

Query: 3320 SLQPPVPGRPNQFVPGT----VPQNMPAPMQSPISVPKGHPSIXXXXXXXXXSQLPVA-- 3159
                P  G    ++P      VP  MP    +P     G P I           +P A  
Sbjct: 187  VSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTP-----GPPGIAPSTPLSSNLAVPSASM 241

Query: 3158 ---------AESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTAL-PVYVSSSSSMIVP 3009
                     A  P    SSN +    +     ++PA ++SSQ   L P  +         
Sbjct: 242  DFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFV 301

Query: 3008 AAPSVYP 2988
              P+VYP
Sbjct: 302  PYPAVYP 308


>ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis
            vinifera]
          Length = 903

 Score =  892 bits (2305), Expect = 0.0
 Identities = 485/880 (55%), Positives = 577/880 (65%), Gaps = 7/880 (0%)
 Frame = -1

Query: 3167 PVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYP 2988
            P   +  Q   S N   S AV QE G++ +AS  SQS   P    SSS+M V ++P + P
Sbjct: 32   PQLVQKDQTLKSDN---SGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGP 85

Query: 2987 MTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXX 2808
             T+W                                   A P+A    +S          
Sbjct: 86   TTLWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPA 145

Query: 2807 XXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPV 2628
                    + QQ+YP Y S PA     QG WLQPPQ+ GL RPP++PYP   P PFPLP 
Sbjct: 146  APVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPA 205

Query: 2627 RGXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLA----QPTSSVGTQSPPPGIDQDKQSD 2460
             G                         P +A ++      TS + ++ PPPGID +K  +
Sbjct: 206  HGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVN 265

Query: 2459 GNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPV 2280
            G  + +G  A +E  D WTAHKT+TG VYYYN+LTG+STYE+PS FKGE DKV VQ TPV
Sbjct: 266  GAGTKDGA-AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPV 324

Query: 2279 STEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQT-SMTSQNA 2103
            S EKL GTDW LV+TNDGKKYY+NTKTK+SSWQ+P E+ E+RK+QDS +L+  +M + N 
Sbjct: 325  SWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNT 384

Query: 2102 SFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLP 1923
            +   +KG +P++LS PAV TGGR+A  LRTS    S+SALDM+KKKLQD+G P TSSP+ 
Sbjct: 385  NVSTEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVH 444

Query: 1922 ASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEEC 1746
            +S  P+ S+LNG   +E   KG QSENSK+KLKD NGDGNM            GPTKEEC
Sbjct: 445  SSG-PIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEEC 503

Query: 1745 IIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXX 1566
            IIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+PGY+ARR+LFEHYVRT         
Sbjct: 504  IIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEK 563

Query: 1565 XXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPI 1386
                    EGFKQLLEEASEDIDHKT+Y +F++KWG DPRFEALDRKDRE LLNERVLP+
Sbjct: 564  RAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPL 623

Query: 1385 KKAAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLF 1206
            K+AAE+K Q  RAAAVSSFKSMLRD GDI TS+RWSRVKD LRNDPRYK VKHEDRE+LF
Sbjct: 624  KRAAEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILF 683

Query: 1205 NEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVA 1026
            NEYISEL                EQDKL                   RVRLKVRRKEAV+
Sbjct: 684  NEYISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVS 743

Query: 1025 SYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSARE 846
            SYQALLVETIKDP+ SWTES PKLEKDPQ RA+N DLD +D EKLFREH+K L+ER A E
Sbjct: 744  SYQALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHE 803

Query: 845  FRALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEM 666
            FRALL+EV+TAE A Q T+DGK VLTSWS AKRLL+ D RY KMPRKDRES+W+R+++EM
Sbjct: 804  FRALLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEM 863

Query: 665  QRRQKPSDSKEEKPHSEFKNKISADSERSPA-PRRTHSRR 549
             R+QK +  + E+ H+E K + S DS R P+  RR H RR
Sbjct: 864  LRKQKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRRAHERR 903


>ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis
            vinifera]
          Length = 848

 Score =  880 bits (2273), Expect = 0.0
 Identities = 475/851 (55%), Positives = 564/851 (66%), Gaps = 7/851 (0%)
 Frame = -1

Query: 3080 AASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXX 2901
            +AS  SQS   P    SSS+M V ++P + P T+W                         
Sbjct: 3    SASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGI 59

Query: 2900 XXXXXXXXANARPAAMDPSASLRPMXXXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQG 2721
                      A P+A    +S                  + QQ+YP Y S PA     QG
Sbjct: 60   APSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQG 119

Query: 2720 HWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXXXXXXXXXXXPA 2541
             WLQPPQ+ GL RPP++PYP   P PFPLP  G                         P 
Sbjct: 120  PWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPI 179

Query: 2540 TAGLA----QPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKSEDADLWTAHKTETGAVY 2373
            +A ++      TS + ++ PPPGID +K  +G  + +G  A +E  D WTAHKT+TG VY
Sbjct: 180  SAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA-AVNEQVDAWTAHKTDTGVVY 238

Query: 2372 YYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKV 2193
            YYN+LTG+STYE+PS FKGE DKV VQ TPVS EKL GTDW LV+TNDGKKYY+NTKTK+
Sbjct: 239  YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 298

Query: 2192 SSWQLPVEVAELRKRQDSDSLQT-SMTSQNASFGMDKGSAPVSLSVPAVNTGGREAMALR 2016
            SSWQ+P E+ E+RK+QDS +L+  +M + N +   +KG +P++LS PAV TGGR+A  LR
Sbjct: 299  SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 358

Query: 2015 TSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNGLGPVEAIAKGQQSENSK 1836
            TS    S+SALDM+KKKLQD+G P TSSP+ +S  P+ S+LNG   +E   KG QSENSK
Sbjct: 359  TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSG-PIASELNGSRVIEPTVKGLQSENSK 417

Query: 1835 EKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFD 1659
            +KLKD NGDGNM            GPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FD
Sbjct: 418  DKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 477

Query: 1658 PRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKTDYH 1479
            PRFKA+PGY+ARR+LFEHYVRT                 EGFKQLLEEASEDIDHKT+Y 
Sbjct: 478  PRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQ 537

Query: 1478 SFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQRAAAVSSFKSMLRDGGDI 1299
            +F++KWG DPRFEALDRKDRE LLNERVLP+K+AAE+K Q  RAAAVSSFKSMLRD GDI
Sbjct: 538  TFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDI 597

Query: 1298 NTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXXXXXXXXXXXXXEQDKLX 1119
             TS+RWSRVKD LRNDPRYK VKHEDRE+LFNEYISEL                EQDKL 
Sbjct: 598  TTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLK 657

Query: 1118 XXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKDPKASWTESNPKLEKDPQ 939
                              RVRLKVRRKEAV+SYQALLVETIKDP+ SWTES PKLEKDPQ
Sbjct: 658  ERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQ 717

Query: 938  GRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAETAVQMTDDGKNVLTSWS 759
             RA+N DLD +D EKLFREH+K L+ER A EFRALL+EV+TAE A Q T+DGK VLTSWS
Sbjct: 718  ARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWS 777

Query: 758  EAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEEKPHSEFKNKISADSERS 579
             AKRLL+ D RY KMPRKDRES+W+R+++EM R+QK +  + E+ H+E K + S DS R 
Sbjct: 778  TAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRF 837

Query: 578  PA-PRRTHSRR 549
            P+  RR H RR
Sbjct: 838  PSGSRRAHERR 848


>ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii]
            gi|763747828|gb|KJB15267.1| hypothetical protein
            B456_002G167700 [Gossypium raimondii]
          Length = 887

 Score =  837 bits (2163), Expect = 0.0
 Identities = 462/876 (52%), Positives = 568/876 (64%), Gaps = 8/876 (0%)
 Frame = -1

Query: 3152 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2985
            +PQ   ++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2984 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2805
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2804 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2625
                   V QQ+YPPY S P+M   PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2624 GXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTST 2445
            G                          A A LA  + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAA-LANQSLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2444 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2265
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2264 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 2088
             GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++  N     +
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374

Query: 2087 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1908
            KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V 
Sbjct: 375  KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433

Query: 1907 VISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1731
               +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QFK
Sbjct: 434  ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491

Query: 1730 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1551
            EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T              
Sbjct: 492  EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551

Query: 1550 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAE 1371
               EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL +K+AAE
Sbjct: 552  AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611

Query: 1370 QKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYIS 1191
            +K +  RAAA SSFKSML++ GDIN +SRWSRVKD LR+DPRYK VKHEDREVLFNEYIS
Sbjct: 612  EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671

Query: 1190 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 1011
            EL                E++KL                   RVRLKVRRKEAVAS+QAL
Sbjct: 672  ELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 731

Query: 1010 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 831
            LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRALL
Sbjct: 732  LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 791

Query: 830  AEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQK 651
            AEVIT +   Q T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+A++M R+QK
Sbjct: 792  AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 851

Query: 650  PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 549
             +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 852  SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887


>gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 888

 Score =  833 bits (2152), Expect = 0.0
 Identities = 463/877 (52%), Positives = 568/877 (64%), Gaps = 9/877 (1%)
 Frame = -1

Query: 3152 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2985
            +PQ   ++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2984 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2805
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2804 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2625
                   V QQ+YPPY S P+M   PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2624 GXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTST 2445
            G                          A A LA  + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAA-LANQSLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2444 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2265
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2264 VGTDWVLVSTNDGKKYYHNTKTKV-SSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGM 2091
             GTDW LV+TNDGKKYY+N+KTKV SSWQ+P EV ELRK+QDS+ S + +++  N     
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVA 374

Query: 2090 DKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSV 1911
            +KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V
Sbjct: 375  EKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPV 433

Query: 1910 PVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQF 1734
                +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QF
Sbjct: 434  TATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQF 491

Query: 1733 KEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXX 1554
            KEMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T             
Sbjct: 492  KEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQ 551

Query: 1553 XXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAA 1374
                EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL +K+AA
Sbjct: 552  KAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAA 611

Query: 1373 EQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYI 1194
            E+K +  RAAA SSFKSML++ GDIN +SRWSRVKD LR+DPRYK VKHEDREVLFNEYI
Sbjct: 612  EEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYI 671

Query: 1193 SELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQA 1014
            SEL                E++KL                   RVRLKVRRKEAVAS+QA
Sbjct: 672  SELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQA 731

Query: 1013 LLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRAL 834
            LLVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRAL
Sbjct: 732  LLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRAL 791

Query: 833  LAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQ 654
            LAEVIT +   Q T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+A++M R+Q
Sbjct: 792  LAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQ 851

Query: 653  KPSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 549
            K +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 852  KSALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888


>gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 886

 Score =  833 bits (2152), Expect = 0.0
 Identities = 461/876 (52%), Positives = 567/876 (64%), Gaps = 8/876 (0%)
 Frame = -1

Query: 3152 SPQNKHSSNTSASAAVVQETGTVPAASSS----SQSTALPVYVSSSSSMIVPAAPSVYPM 2985
            +PQ   ++    S +    TGT   A+SS    SQS  LPV+ SS  +M     PS  P+
Sbjct: 24   NPQLVQNAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPV 83

Query: 2984 TMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXXXX 2805
            T  ++                                 A  A   PS+++          
Sbjct: 84   T--SRMPTTPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAV-----PGPGA 136

Query: 2804 XXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVR 2625
                   V QQ+YPPY S P+M   PQG+W+Q P + G  RPP++PYPT  P PFP    
Sbjct: 137  PVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSS 196

Query: 2624 GXXXXXXXXXXXXXXXXXXXXXXXXXPATAGLAQPTSSVGTQSPPPGIDQDKQSDGNTST 2445
            G                          A A LA  + ++ T  PP GID  K    + +T
Sbjct: 197  GMPLPAPSSDSQPPGVRPLGMSPFAPSAAA-LANQSLAILTGFPPQGIDNRKLVH-DVTT 254

Query: 2444 NGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKL 2265
              E A +E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPD+V VQ TPVS E+L
Sbjct: 255  KVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQL 314

Query: 2264 VGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMD 2088
             GTDW LV+TNDGKKYY+N+KTK+SSWQ+P EV ELRK+QDS+ S + +++  N     +
Sbjct: 315  AGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAE 374

Query: 2087 KGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVP 1908
            KGS P+SLS PAVNTGGR+AM LRTS    SSSALD++KKKLQD G+P +SSP+P   V 
Sbjct: 375  KGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVT 433

Query: 1907 VISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFK 1731
               +LNG   V+   KG QSE++K+KLKDANGDG++            GP+KEECI+QFK
Sbjct: 434  ATHELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFK 491

Query: 1730 EMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXX 1551
            EMLKERGVAPFSKWEKELPKI+FDPRFKA+P ++ARR+LFEHYV+T              
Sbjct: 492  EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQK 551

Query: 1550 XXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAE 1371
               EGFKQLL+EASEDIDH T+Y +FKRKWGSDPRFEALDRKDRE LLNERVL +K+AAE
Sbjct: 552  AAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAE 611

Query: 1370 QKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYIS 1191
            +K +  RAAA SSFKSML++ GDIN +SRWSRVKD LR+DPRYK VKHEDREVLFNEYIS
Sbjct: 612  EKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYIS 671

Query: 1190 ELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQAL 1011
            EL                 ++KL                   RVRLKVRRKEAVAS+QAL
Sbjct: 672  ELKAIEEKAERKDKVKKE-EEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQAL 730

Query: 1010 LVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALL 831
            LVETIKDP+ASWTES PKLEKDPQGRA+NPDLD +D EKLFREH+K L+ER   +FRALL
Sbjct: 731  LVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALL 790

Query: 830  AEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQK 651
            AEVIT +   Q T+ GK  L SWS AKRLLKPDPRY+KMPRK+RE++W+R+A++M R+QK
Sbjct: 791  AEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQK 850

Query: 650  PSDSKEEKPHSEFKNKISAD--SERSPAPRRTHSRR 549
             +  +EE+ H++ K + S       S   RRTH RR
Sbjct: 851  SALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 886


>ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [Prunus mume]
          Length = 858

 Score =  832 bits (2148), Expect = 0.0
 Identities = 459/867 (52%), Positives = 556/867 (64%), Gaps = 15/867 (1%)
 Frame = -1

Query: 3107 VVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVYPMTMWTQXXXXXXXXXXXXXX 2928
            V QETG V  +S+SS S +LP   SSSS+M + +AP++   T W                
Sbjct: 7    VAQETGNVSLSSTSSHSGSLPAPTSSSSTMNLLSAPNMGTTTSWVPTAPSFNLTSGMPGT 66

Query: 2927 XXXXXXXXXXXXXXXXXANARPAAMDPSAS--LRPMXXXXXXXXXXXXXXVHQQLYPPYH 2754
                                 P+A   S+S  LRP                  Q+  PY 
Sbjct: 67   PGTPGPPGIAHPVQISFNPTAPSAPIDSSSVALRPSMQIAPVASSAV----QPQVGAPYP 122

Query: 2753 SQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXXXXXXXXXX 2574
            S  +M  PPQG WLQ PQ+ G  RPP++PYP   P PFP P                   
Sbjct: 123  SLSSMGAPPQGVWLQSPQIGGFPRPPFLPYPAAFPVPFPSPAH------VMPLPSVPLPD 176

Query: 2573 XXXXXXXXXPATAGLAQPTSSVGTQS----------PPPGIDQDKQSDGNTSTNGEIAKS 2424
                       TA ++ P+++ G Q           P PGID  KQS    + N   + +
Sbjct: 177  SQPPGVTPVGNTAAISSPSAASGHQLAGFSGIQIELPLPGIDNRKQSHDAGNEN-RASVN 235

Query: 2423 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2244
            E  D WTAHKTETG VYYYN+LTG+STY++P  FK EPDKV +Q TPVST  L GTDWVL
Sbjct: 236  EQLDAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLSGTDWVL 295

Query: 2243 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTS-QNASFGMDKGSAPVS 2067
            V+T+DGKK+YHN+KTKVSSWQ+P EV ELRK+QD+D  +    S  N +   +KGSAP+S
Sbjct: 296  VTTSDGKKFYHNSKTKVSSWQIPNEVIELRKKQDADVPKEHPVSIPNNNVMTEKGSAPIS 355

Query: 2066 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPASSVPVISDLNG 1887
            L+ PA+N GGREAMA + S    +SSALD++KKKLQD+G PVTSSP+PA S     + NG
Sbjct: 356  LTAPAINMGGREAMAFKPSAVQGTSSALDLIKKKLQDSGAPVTSSPVPAPS-----ESNG 410

Query: 1886 LGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECIIQFKEMLKERG 1710
               VE+  KGQQS+NSK+KLKD NGDGN+            GPTKEECI QFKEMLKERG
Sbjct: 411  SRGVESTPKGQQSDNSKDKLKDINGDGNLSDSSSDSEDADSGPTKEECITQFKEMLKERG 470

Query: 1709 VAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFK 1530
            VAPFSKW+KELPKI+FDPRFKA+P ++ARR+LFEHYV+T                 EGFK
Sbjct: 471  VAPFSKWDKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFK 530

Query: 1529 QLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQR 1350
            QLL+EASEDIDH TDY SF++KW +DPRFEALDRKDRE LLNERVLP+K+AAE+K Q  R
Sbjct: 531  QLLDEASEDIDHNTDYQSFRKKWANDPRFEALDRKDREHLLNERVLPLKRAAEEKAQAAR 590

Query: 1349 AAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXXX 1170
            AAA +SFKSML++ GDI  SSRWSRVKD LRNDPRYKSV+HEDRE+LFN+YIS+L     
Sbjct: 591  AAASTSFKSMLQEKGDITVSSRWSRVKDSLRNDPRYKSVRHEDREILFNQYISDLKAVEE 650

Query: 1169 XXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIKD 990
                       EQ+KL                   RVRLKVRRKEAVA++QALLVETIKD
Sbjct: 651  EAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEAVATFQALLVETIKD 710

Query: 989  PKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITAE 810
            P+ASWT S PKLEKDPQ RA+NPDL+ +D EKLFREH+K L ER A EFRALLAEV+TAE
Sbjct: 711  PQASWTGSKPKLEKDPQRRAANPDLEPSDMEKLFREHIKRLNERCAHEFRALLAEVLTAE 770

Query: 809  TAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKEE 630
             A Q T+DGK VL SWS AKRLLKPDPRY+KM RK+RE +W+R+++EM R+QK +   +E
Sbjct: 771  AASQETEDGKTVLNSWSTAKRLLKPDPRYNKMARKEREVLWRRYSEEMLRKQKSALDHKE 830

Query: 629  KPHSEFKNKISADSERSP-APRRTHSR 552
               ++ K++ S D  R P   R TH R
Sbjct: 831  DRKTDAKSRSSVDGGRVPFGSRGTHDR 857


>ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao]
            gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein
            40C [Theobroma cacao]
          Length = 816

 Score =  822 bits (2123), Expect = 0.0
 Identities = 434/749 (57%), Positives = 522/749 (69%), Gaps = 6/749 (0%)
 Frame = -1

Query: 2777 QQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLPVRGXXXXXXXX 2598
            QQ+YP Y   P+MA  PQG W+Q P + G  RPP++PYPT  P PFP    G        
Sbjct: 75   QQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYPGPFPSASSGMPHPAPSS 134

Query: 2597 XXXXXXXXXXXXXXXXXPAT--AGLAQPTSSVGTQSPPPGIDQDKQSDGNTSTNGEIAKS 2424
                                  A  +   S + T  PP GID     + N  T  E A +
Sbjct: 135  DSQPPGVSPLATSPFAPSIAIPANQSSVASGIQTGFPPQGID-----NRNVGTRVEAAVN 189

Query: 2423 EDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVSTEKLVGTDWVL 2244
            E +D+WTAHKT+TG VYYYN+LTG+STYE+P+ FKGEPDKVPVQ TPVS E+L GT+W L
Sbjct: 190  EQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWAL 249

Query: 2243 VSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSD-SLQTSMTSQNASFGMDKGSAPVS 2067
            V+T+DGKKYY+N+KTK+SSWQ+P EVAELRK+QD+D S + ++   N     +KGS P+S
Sbjct: 250  VTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPIS 309

Query: 2066 LSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMP-VTSSPLPASSVPVISDLN 1890
            LS PAV+TGGR+AM LRTS    SSSALD++KKKLQD+G+P  +SS +P   V    +LN
Sbjct: 310  LSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELN 369

Query: 1889 GLGPVEAIAKGQQSENSKEKLKDANGDGNM-XXXXXXXXXXSGPTKEECIIQFKEMLKER 1713
            G   V+   KG QSENSK+KLKDANGDGN+           SGP+KEECI+QFKEMLKER
Sbjct: 370  GSRAVD--VKGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKER 427

Query: 1712 GVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1533
            GVAPFSKWEKELPKI+FDPRFKA+P ++ARR LFEHYV+T                 EGF
Sbjct: 428  GVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGF 487

Query: 1532 KQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKKAAEQKIQEQ 1353
            KQLL+EASEDIDH T+Y +FKRKWGSD RFEALDRKDRE LL ERVLP+K+AAE+K Q  
Sbjct: 488  KQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAI 547

Query: 1352 RAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISELXXXX 1173
            RAAA SS KSML++ GDI  +SRWSRVKD +R+DPRYK VKHEDREVLFNEYISEL    
Sbjct: 548  RAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVE 607

Query: 1172 XXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASYQALLVETIK 993
                        E++KL                   RVRLKVRRKEAVAS+QALLVETIK
Sbjct: 608  EKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 667

Query: 992  DPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFRALLAEVITA 813
            DP+ASWTES PKLEKDPQGRA+NPDLD +DTEKLFREH+K L+ER   +FRALLAEVIT 
Sbjct: 668  DPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQ 727

Query: 812  ETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQRRQKPSDSKE 633
            + A Q T+ GK V  SWS AKRLLKPDPRYSKMPRK+RE++W+R+A++M R+QK +  +E
Sbjct: 728  DAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQE 787

Query: 632  EKPHSEFKNKISADSER-SPAPRRTHSRR 549
            E+  ++ K + S D  R S   R+ H RR
Sbjct: 788  EEKRTDAKVRSSGDLGRFSSGSRKVHERR 816


>ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis]
          Length = 978

 Score =  819 bits (2116), Expect = 0.0
 Identities = 465/937 (49%), Positives = 569/937 (60%), Gaps = 10/937 (1%)
 Frame = -1

Query: 3329 TAASLQPPVPGRPNQFVPGTVPQ------NMPAPMQSPISVPKGHPSIXXXXXXXXXS-Q 3171
            T  S+  P   +      G +PQ      N      S  SV   +PS+         S  
Sbjct: 47   TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106

Query: 3170 LPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY 2991
              V   SP  +   N +   AV ++ G   + S++SQ     V   S S++   +A ++ 
Sbjct: 107  QTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALS 165

Query: 2990 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXX 2811
              T W                                 ++A       SA LRP      
Sbjct: 166  TTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPS 224

Query: 2810 XXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLP 2631
                      HQ +YP Y S P +   PQG  LQPPQ+      P++PYP   P+PFPLP
Sbjct: 225  APSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSPFPLP 283

Query: 2630 VRGXXXXXXXXXXXXXXXXXXXXXXXXXPATA--GLAQPTSSVGTQSPPPGIDQDKQSDG 2457
              G                           +A  G     +S  T++PP G D+ K+   
Sbjct: 284  AHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVH 342

Query: 2456 NTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVS 2277
            + S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S
Sbjct: 343  DVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPIS 402

Query: 2276 TEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASF 2097
             E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +  N + 
Sbjct: 403  MEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNI 461

Query: 2096 GMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPAS 1917
             ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+SP P S
Sbjct: 462  VIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVS 520

Query: 1916 SVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECII 1740
            S    S+ NG   VE   KG Q+EN+K+KLKD NGDG M            GPTKEECII
Sbjct: 521  SAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECII 580

Query: 1739 QFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXX 1560
            +FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T           
Sbjct: 581  KFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRA 640

Query: 1559 XXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKK 1380
                  EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLP+K+
Sbjct: 641  AQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKR 700

Query: 1379 AAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNE 1200
            AAE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDREV+FNE
Sbjct: 701  AAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNE 760

Query: 1199 YISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASY 1020
            Y+ EL                EQ+KL                   RVRLKVRRKEAV S+
Sbjct: 761  YVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSF 820

Query: 1019 QALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFR 840
            QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR
Sbjct: 821  QALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFR 880

Query: 839  ALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQR 660
             LLAEVITAE A Q T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R A+E+QR
Sbjct: 881  GLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQR 940

Query: 659  RQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 549
            + K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 941  KHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977


>ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina]
            gi|557539684|gb|ESR50728.1| hypothetical protein
            CICLE_v10030612mg [Citrus clementina]
          Length = 1015

 Score =  818 bits (2113), Expect = 0.0
 Identities = 476/1001 (47%), Positives = 589/1001 (58%), Gaps = 7/1001 (0%)
 Frame = -1

Query: 3530 PAVKFTPPTSAAALQPPVPRQSSGSVPSFSYNLISQPNVGSASGQQLQTGTVTGPGNIQV 3351
            P ++     ++ A  PP  +Q + + P     +  +P  GS       T T  G      
Sbjct: 31   PFIRSDQIMTSPAWLPPEVQQLTANAP-----ISGKPVGGSLVASSTPTPTSNGSDTATN 85

Query: 3350 GKFVPPNTAASLQPP---VPGRPNQFVPGTVPQNMPAPMQSPISVPKGHPSIXXXXXXXX 3180
                 P+ A S+      +P     F      QN      S  SV   +PS+        
Sbjct: 86   DSISGPSQAKSVTATGGVIPQSSFSF------QNSEGSGHSASSVINSNPSVPPGVSSFT 139

Query: 3179 XS-QLPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAA 3003
             S    V   SP  +   N +   AV ++ G   + S++SQ     V   S S++   +A
Sbjct: 140  YSASQTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVATSSA 198

Query: 3002 PSVYPMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMX 2823
             ++   T W                                 ++A       SA LRP  
Sbjct: 199  TALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSV 257

Query: 2822 XXXXXXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAP 2643
                          HQ +YP + S P +   PQ   LQPPQ+      P++PYP   P+P
Sbjct: 258  PTPSAPSNSGSAIQHQ-IYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAYPSP 316

Query: 2642 FPLPVRGXXXXXXXXXXXXXXXXXXXXXXXXXPATA--GLAQPTSSVGTQSPPPGIDQDK 2469
            FPLP  G                           +A  G     +S  T++PP G D+ K
Sbjct: 317  FPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-K 375

Query: 2468 QSDGNTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQS 2289
            +   + S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ 
Sbjct: 376  EHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQP 435

Query: 2288 TPVSTEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQ 2109
            TP+S E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +  
Sbjct: 436  TPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVP 494

Query: 2108 NASFGMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSP 1929
            N +  ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+SP
Sbjct: 495  NTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASP 553

Query: 1928 LPASSVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKE 1752
             P SS    S+ NG   VE   KG Q+EN+K+KLKD NGDG M            GPTKE
Sbjct: 554  APVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKE 613

Query: 1751 ECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXX 1572
            ECII+FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T       
Sbjct: 614  ECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERK 673

Query: 1571 XXXXXXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVL 1392
                      EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVL
Sbjct: 674  EKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVL 733

Query: 1391 PIKKAAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREV 1212
            P+K+AAE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDREV
Sbjct: 734  PLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREV 793

Query: 1211 LFNEYISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEA 1032
            +FNEY+ EL                EQ+KL                   RVRLKVRRKEA
Sbjct: 794  IFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEA 853

Query: 1031 VASYQALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSA 852
            V S+QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A
Sbjct: 854  VTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCA 913

Query: 851  REFRALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFAD 672
             +FR LLAEVITAE A Q T+DGK VL SWS AKR+LKPDPRYSKMPRK+RE++W+R A+
Sbjct: 914  HDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRRHAE 973

Query: 671  EMQRRQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 549
            E+QR+ K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 974  EIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 1014


>gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis]
          Length = 978

 Score =  818 bits (2112), Expect = 0.0
 Identities = 464/937 (49%), Positives = 569/937 (60%), Gaps = 10/937 (1%)
 Frame = -1

Query: 3329 TAASLQPPVPGRPNQFVPGTVPQ------NMPAPMQSPISVPKGHPSIXXXXXXXXXS-Q 3171
            T  S+  P   +      G +PQ      N      S  SV   +PS+         S  
Sbjct: 47   TNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSAS 106

Query: 3170 LPVAAESPQNKHSSNTSASAAVVQETGTVPAASSSSQSTALPVYVSSSSSMIVPAAPSVY 2991
              V   SP  +   N +   AV ++ G   + S++SQ     V   S S++   +A ++ 
Sbjct: 107  QTVVGYSPNQQFQPNMNKLEAV-EDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALS 165

Query: 2990 PMTMWTQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXANARPAAMDPSASLRPMXXXXX 2811
              T W                                 ++A       SA LRP      
Sbjct: 166  TTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-GDFYSSAGLRPSVPTPS 224

Query: 2810 XXXXXXXXXVHQQLYPPYHSQPAMAPPPQGHWLQPPQVSGLQRPPYMPYPTGLPAPFPLP 2631
                      HQ +YP Y S P +   PQG  L+PPQ+      P++PYP   P+PFPLP
Sbjct: 225  APSNSGSAIQHQ-IYPTYPSLPPIGVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLP 283

Query: 2630 VRGXXXXXXXXXXXXXXXXXXXXXXXXXPATA--GLAQPTSSVGTQSPPPGIDQDKQSDG 2457
              G                           +A  G     +S  T++PP G D+ K+   
Sbjct: 284  AHGMPNPSVSQIDAQPPGLSSVRTAAATSHSAIPGHQLVGTSGNTEAPPSGTDK-KEHVH 342

Query: 2456 NTSTNGEIAKSEDADLWTAHKTETGAVYYYNSLTGQSTYERPSSFKGEPDKVPVQSTPVS 2277
            + S+    + +E  D WTAHKT+TG VYYYN++TG+STYE+P+ FKGEPDKVPVQ TP+S
Sbjct: 343  DVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPIS 402

Query: 2276 TEKLVGTDWVLVSTNDGKKYYHNTKTKVSSWQLPVEVAELRKRQDSDSLQTSMTSQNASF 2097
             E L GTDW LV+TNDGKKYY+N+K KVSSWQ+P EV EL+K++D D+L+   +  N + 
Sbjct: 403  MEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLK-EQSVPNTNI 461

Query: 2096 GMDKGSAPVSLSVPAVNTGGREAMALRTSGAMASSSALDMVKKKLQDAGMPVTSSPLPAS 1917
             ++KGS  +SLS PAVNTGGR+A ALRTS    SSSALD++KKKLQD+G P T+SP P S
Sbjct: 462  VIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVS 520

Query: 1916 SVPVISDLNGLGPVEAIAKGQQSENSKEKLKDANGDGNMXXXXXXXXXXS-GPTKEECII 1740
            S    S+ NG   VE   KG Q+EN+K+KLKD NGDG M            GPTKEECII
Sbjct: 521  SAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECII 580

Query: 1739 QFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYTARRALFEHYVRTXXXXXXXXXXX 1560
            +FKEMLKERGVAPFSKWEKELPKI+FDPRFKA+   +ARRALFE YV+T           
Sbjct: 581  KFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRA 640

Query: 1559 XXXXXXEGFKQLLEEASEDIDHKTDYHSFKRKWGSDPRFEALDRKDREALLNERVLPIKK 1380
                  EGFKQLLEE SEDIDH TDY +FK+KWGSDPRFEALDRKDRE LLNERVLP+K+
Sbjct: 641  AQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKR 700

Query: 1379 AAEQKIQEQRAAAVSSFKSMLRDGGDINTSSRWSRVKDGLRNDPRYKSVKHEDREVLFNE 1200
            AAE+K Q  RAAA SSFKSMLR+ GDI  SSRWS+VKD LR+DPRYKSV+HEDREV+FNE
Sbjct: 701  AAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNE 760

Query: 1199 YISELXXXXXXXXXXXXXXXXEQDKLXXXXXXXXXXXXXXXXXXXRVRLKVRRKEAVASY 1020
            Y+ EL                EQ+KL                   RVRLKVRRKEAV S+
Sbjct: 761  YVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSF 820

Query: 1019 QALLVETIKDPKASWTESNPKLEKDPQGRASNPDLDKADTEKLFREHVKTLYERSAREFR 840
            QALLVETIKDP+ASWTES PKLEKDPQGRA+N DLD +D EKLFREH+KTLYER A +FR
Sbjct: 821  QALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFR 880

Query: 839  ALLAEVITAETAVQMTDDGKNVLTSWSEAKRLLKPDPRYSKMPRKDRESIWKRFADEMQR 660
             LLAEVITAE A Q T+DGK VL SWS AKR+LKP+PRYSKMPRK+RE++W+R A+E+QR
Sbjct: 881  GLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQR 940

Query: 659  RQKPSDSKEEKPHSEFKNKISADSERSPAPRRTHSRR 549
            + K S  + E  H + K++ S D  R P+  R +  R
Sbjct: 941  KHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQER 977


Top