BLASTX nr result

ID: Zanthoxylum22_contig00002317 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00002317
         (3246 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l...  1199   0.0  
ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr...  1199   0.0  
gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin...  1198   0.0  
gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin...  1029   0.0  
ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i...   922   0.0  
ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i...   909   0.0  
ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i...   865   0.0  
ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i...   859   0.0  
ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C i...   848   0.0  
gb|KDP22962.1| hypothetical protein JCGZ_01659 [Jatropha curcas]      847   0.0  
ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C i...   839   0.0  
ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c...   832   0.0  
ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [...   825   0.0  
gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r...   821   0.0  
ref|XP_008353148.1| PREDICTED: pre-mRNA-processing protein 40C-l...   820   0.0  
ref|XP_009351698.1| PREDICTED: pre-mRNA-processing protein 40C [...   819   0.0  
ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [...   815   0.0  
ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative ...   805   0.0  
ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prun...   789   0.0  
ref|XP_008360017.1| PREDICTED: LOW QUALITY PROTEIN: pre-mRNA-pro...   787   0.0  

>ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis]
          Length = 978

 Score = 1199 bits (3103), Expect = 0.0
 Identities = 652/982 (66%), Positives = 701/982 (71%), Gaps = 3/982 (0%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTPIAPVSNVSGIAPSDSINEHSQEKSV 3008
            MTSPAWLPPE QQLT+NAPIS KP GG L+ASSTPIAP SN S  A +DSI+  SQ KSV
Sbjct: 1    MTSPAWLPPEVQQLTANAPISGKPVGGSLVASSTPIAPTSNGSDTATNDSISGPSQAKSV 60

Query: 3007 TAPGGVVPHPSFAFRNSGGTQHSTSFVINSNPSVAPDVSSLSYSVSQTVAGYSPNRQFQP 2828
            TA GGV+P  SF+F+NS G+ HS S VINSNPSV P VSS +YS SQTV GYSPN+QFQP
Sbjct: 61   TATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSASQTVVGYSPNQQFQP 120

Query: 2827 NTTKPGTVSHAVFGSSTSTNSQPVP--LXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMX 2654
            N  K   V  A  GSSTSTNSQPV   +            A  +  TTSWMPTIPSF   
Sbjct: 121  NMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALSTTTSWMPTIPSFSTP 180

Query: 2653 XXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPT 2474
                               TKDT SA GDF +SA LRPSVP  SAPSNSGS +QH I+PT
Sbjct: 181  PGLFVTPQTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSVPTPSAPSNSGSAIQHQIYPT 240

Query: 2473 YPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPP 2294
            YPS                  GV PWLPFLPYPA YPSPFPLPAH MP+PSVS  DAQPP
Sbjct: 241  YPSLPPIGVSPQGPLLQPPQMGVRPWLPFLPYPAAYPSPFPLPAHGMPNPSVSQIDAQPP 300

Query: 2293 GVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDA 2114
            G+S MRT+ A S HSAI GHQLVG+SG  TEA PSG DKKEHVHDVS++ G SVNEQLDA
Sbjct: 301  GLSSMRTAAATS-HSAIPGHQLVGTSG-NTEAPPSGTDKKEHVHDVSSRIGASVNEQLDA 358

Query: 2113 WTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTND 1934
            WTAHKTDTGIVYYYNAVTGESTY+KPAGFKGE DKVPVQPTP+SME L GTDWALVTTND
Sbjct: 359  WTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISMEHLTGTDWALVTTND 418

Query: 1933 GKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGSTT-SLSAPA 1757
            GKKYYYN+K+KVSSWQIPSE+TEL+KKEDDD LKE   S  NTNIVIEKGS   SLS+PA
Sbjct: 419  GKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKE--QSVPNTNIVIEKGSNAISLSSPA 476

Query: 1756 INTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEV 1577
            +NTGGRDATALRTSS+PGSSSALDLIKKKLQDSG               SE NGSK VEV
Sbjct: 477  VNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTPTASPAPVSSAAATSESNGSKAVEV 536

Query: 1576 TIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSK 1397
            T+KGLQNEN KDKLKDI                  GPTKEECIIKFKEMLKERGVAPFSK
Sbjct: 537  TVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKERGVAPFSK 596

Query: 1396 WEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEA 1217
            WEKELPKIVFDPRFKAI SQSARRALFER+VKT                 EGFKQLLEE 
Sbjct: 597  WEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGFKQLLEEV 656

Query: 1216 SEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASS 1037
            SEDID +TDYQTF+KKWGSD RFEALDRKDRELLLNERVLPLK              ASS
Sbjct: 657  SEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAASS 716

Query: 1036 FKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXX 857
            FKSMLREKGD+TL+SRWSKVKDILR+DPRYKSV+HEDREV+FNEYVR             
Sbjct: 717  FKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAEEEAEREA 776

Query: 856  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWT 677
                                                    VTSFQALLVETIK PQASWT
Sbjct: 777  KARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIKDPQASWT 836

Query: 676  ESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXX 497
            ESRPKLEKDPQGRATN +LD SD EKLFREH+K LYERCA+DFR                
Sbjct: 837  ESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITAEAAAQET 896

Query: 496  EDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDS 317
            EDGKTVLNSWSTAKRVLKP+PRY+KMPRKERE  WRR+AE++ RK KSSLD+NED+HKDS
Sbjct: 897  EDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQNEDNHKDS 956

Query: 316  KSRSSADGGRLPSGSRRNHERR 251
            KSRSS DGGR PS SRRN ERR
Sbjct: 957  KSRSSTDGGRPPSSSRRNQERR 978


>ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina]
            gi|557539684|gb|ESR50728.1| hypothetical protein
            CICLE_v10030612mg [Citrus clementina]
          Length = 1015

 Score = 1199 bits (3102), Expect = 0.0
 Identities = 654/989 (66%), Positives = 704/989 (71%), Gaps = 3/989 (0%)
 Frame = -3

Query: 3208 FCAADQIMTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTPIAPVSNVSGIAPSDSINE 3029
            F  +DQIMTSPAWLPPE QQLT+NAPIS KP GG L+ASSTP  P SN S  A +DSI+ 
Sbjct: 32   FIRSDQIMTSPAWLPPEVQQLTANAPISGKPVGGSLVASSTP-TPTSNGSDTATNDSISG 90

Query: 3028 HSQEKSVTAPGGVVPHPSFAFRNSGGTQHSTSFVINSNPSVAPDVSSLSYSVSQTVAGYS 2849
             SQ KSVTA GGV+P  SF+F+NS G+ HS S VINSNPSV P VSS +YS SQTV GYS
Sbjct: 91   PSQAKSVTATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSASQTVVGYS 150

Query: 2848 PNRQFQPNTTKPGTVSHAVFGSSTSTNSQPVP--LXXXXXXXXXXXXAHKVGATTSWMPT 2675
            PN+QFQPN  K   V  A  GSSTSTNSQPV   +            A  +  TTSWMPT
Sbjct: 151  PNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALSTTTSWMPT 210

Query: 2674 IPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTV 2495
            IPSF                      TKDT SA GDF +SA LRPSVP  SAPSNSGS +
Sbjct: 211  IPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSVPTPSAPSNSGSAI 270

Query: 2494 QHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVS 2315
            QH I+PT+PS                  GV PWLPFLPYPA YPSPFPLPAH MP+PSVS
Sbjct: 271  QHQIYPTHPSLPPVGVSPQRPLLQPPQMGVRPWLPFLPYPAAYPSPFPLPAHGMPNPSVS 330

Query: 2314 SADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDS 2135
              DAQPPG+S MRT+ A S HSAI GHQLVG+SG  TEA PSG DKKEHVHDVS++ G S
Sbjct: 331  QIDAQPPGLSSMRTAAATS-HSAIPGHQLVGTSG-NTEAPPSGTDKKEHVHDVSSRIGAS 388

Query: 2134 VNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDW 1955
            VNEQLDAWTAHKTDTGIVYYYNAVTGESTY+KPAGFKGE DKVPVQPTP+SME L GTDW
Sbjct: 389  VNEQLDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISMEHLTGTDW 448

Query: 1954 ALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGSTT 1775
            ALVTTNDGKKYYYN+K+KVSSWQIPSE+TEL+KKEDDD LKE   S  NTNIVIEKGS  
Sbjct: 449  ALVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKE--QSVPNTNIVIEKGSNA 506

Query: 1774 -SLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELN 1598
             SLS+PA+NTGGRDATALRTSS+PGSSSALDLIKKKLQDSG               SE N
Sbjct: 507  ISLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTPTASPAPVSSAAATSESN 566

Query: 1597 GSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKER 1418
            GSK VEVT+KGLQNEN KDKLKDI                  GPTKEECIIKFKEMLKER
Sbjct: 567  GSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKER 626

Query: 1417 GVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGF 1238
            GVAPFSKWEKELPKIVFDPRFKAI SQSARRALFER+VKT                 EGF
Sbjct: 627  GVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGF 686

Query: 1237 KQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXX 1058
            KQLLEE SEDID +TDYQTF+KKWGSD RFEALDRKDRELLLNERVLPLK          
Sbjct: 687  KQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 746

Query: 1057 XXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXX 878
                ASSFKSMLREKGD+TL+SRWSKVKDILR+DPRYKSV+HEDREV+FNEYVR      
Sbjct: 747  RAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAE 806

Query: 877  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIK 698
                                                           VTSFQALLVETIK
Sbjct: 807  EEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIK 866

Query: 697  GPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXX 518
             PQASWTESRPKLEKDPQGRATN +LD SD EKLFREH+K LYERCA+DFR         
Sbjct: 867  DPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITA 926

Query: 517  XXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKN 338
                   EDGKTVLNSWSTAKRVLKPDPRY+KMPRKERE  WRR+AE++ RK KSSLD+N
Sbjct: 927  EAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQN 986

Query: 337  EDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            ED+HKDSKSRSS DGGR PS SRRN ERR
Sbjct: 987  EDNHKDSKSRSSTDGGRPPSSSRRNQERR 1015


>gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis]
          Length = 978

 Score = 1198 bits (3099), Expect = 0.0
 Identities = 651/982 (66%), Positives = 701/982 (71%), Gaps = 3/982 (0%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTPIAPVSNVSGIAPSDSINEHSQEKSV 3008
            MTSPAWLPPE QQLT+NAPIS KP GG L+ASSTPIAP SN S  A +DSI+  SQ KSV
Sbjct: 1    MTSPAWLPPEVQQLTANAPISGKPVGGSLVASSTPIAPTSNGSDTATNDSISGPSQAKSV 60

Query: 3007 TAPGGVVPHPSFAFRNSGGTQHSTSFVINSNPSVAPDVSSLSYSVSQTVAGYSPNRQFQP 2828
            TA GGV+P  SF+F+NS G+ HS S VINSNPSV P VSS +YS SQTV GYSPN+QFQP
Sbjct: 61   TATGGVIPQSSFSFQNSEGSGHSASSVINSNPSVPPGVSSFTYSASQTVVGYSPNQQFQP 120

Query: 2827 NTTKPGTVSHAVFGSSTSTNSQPVP--LXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMX 2654
            N  K   V  A  GSSTSTNSQPV   +            A  +  TTSWMPTIPSF   
Sbjct: 121  NMNKLEAVEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALSTTTSWMPTIPSFSTP 180

Query: 2653 XXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPT 2474
                               TKDT SA GDF +SA LRPSVP  SAPSNSGS +QH I+PT
Sbjct: 181  PGLFVTPQTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSVPTPSAPSNSGSAIQHQIYPT 240

Query: 2473 YPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPP 2294
            YPS                  GV PWLPFLPYPA YPSPFPLPAH MP+PSVS  DAQPP
Sbjct: 241  YPSLPPIGVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLPAHGMPNPSVSQIDAQPP 300

Query: 2293 GVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDA 2114
            G+S +RT+ A S HSAI GHQLVG+SG  TEA PSG DKKEHVHDVS++ G SVNEQLDA
Sbjct: 301  GLSSVRTAAATS-HSAIPGHQLVGTSG-NTEAPPSGTDKKEHVHDVSSRIGASVNEQLDA 358

Query: 2113 WTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTND 1934
            WTAHKTDTGIVYYYNAVTGESTY+KPAGFKGE DKVPVQPTP+SME L GTDWALVTTND
Sbjct: 359  WTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISMEHLTGTDWALVTTND 418

Query: 1933 GKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGSTT-SLSAPA 1757
            GKKYYYN+K+KVSSWQIPSE+TEL+KKEDDD LKE   S  NTNIVIEKGS   SLS+PA
Sbjct: 419  GKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKE--QSVPNTNIVIEKGSNAISLSSPA 476

Query: 1756 INTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEV 1577
            +NTGGRDATALRTSS+PGSSSALDLIKKKLQDSG               SE NGSK VEV
Sbjct: 477  VNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTPTASPAPVSSAAATSESNGSKAVEV 536

Query: 1576 TIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSK 1397
            T+KGLQNEN KDKLKDI                  GPTKEECIIKFKEMLKERGVAPFSK
Sbjct: 537  TVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKERGVAPFSK 596

Query: 1396 WEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEA 1217
            WEKELPKIVFDPRFKAI SQSARRALFER+VKT                 EGFKQLLEE 
Sbjct: 597  WEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGFKQLLEEV 656

Query: 1216 SEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASS 1037
            SEDID +TDYQTF+KKWGSD RFEALDRKDRELLLNERVLPLK              ASS
Sbjct: 657  SEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAASS 716

Query: 1036 FKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXX 857
            FKSMLREKGD+TL+SRWSKVKDILR+DPRYKSV+HEDREV+FNEYVR             
Sbjct: 717  FKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAEEEAEREA 776

Query: 856  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWT 677
                                                    VTSFQALLVETIK PQASWT
Sbjct: 777  KARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIKDPQASWT 836

Query: 676  ESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXX 497
            ESRPKLEKDPQGRATN +LD SD EKLFREH+K LYERCA+DFR                
Sbjct: 837  ESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITAEAAAQET 896

Query: 496  EDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDS 317
            EDGKTVLNSWSTAKRVLKP+PRY+KMPRKERE  WRR+AE++ RK KSSLD+NED+HKDS
Sbjct: 897  EDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQNEDNHKDS 956

Query: 316  KSRSSADGGRLPSGSRRNHERR 251
            KSRSS DGGR PS SRRN ERR
Sbjct: 957  KSRSSTDGGRPPSSSRRNQERR 978


>gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis]
            gi|641834042|gb|KDO53045.1| hypothetical protein
            CISIN_1g002026mg [Citrus sinensis]
          Length = 857

 Score = 1029 bits (2660), Expect = 0.0
 Identities = 563/855 (65%), Positives = 603/855 (70%), Gaps = 3/855 (0%)
 Frame = -3

Query: 2806 VSHAVFGSSTSTNSQPVP--LXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXX 2633
            V  A  GSSTSTNSQPV   +            A  +  TTSWMPTIPSF          
Sbjct: 7    VEDAGLGSSTSTNSQPVQASVRTFSDSTVATSSATALSTTTSWMPTIPSFSTPPGLFVTP 66

Query: 2632 XXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXX 2453
                        TKDT SA GDF +SA LRPSVP  SAPSNSGS +QH I+PTYPS    
Sbjct: 67   QTQAPPGLLTLRTKDTSSAFGDFYSSAGLRPSVPTPSAPSNSGSAIQHQIYPTYPSLPPI 126

Query: 2452 XXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRT 2273
                          GV PWLPFLPYPA YPSPFPLPAH MP+PSVS  DAQPPG+S +RT
Sbjct: 127  GVSPQGPLLRPPQMGVRPWLPFLPYPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSVRT 186

Query: 2272 STANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTD 2093
            + A S HSAI GHQLVG+SG  TEA PSG DKKEHVHDVS++ G SVNEQLDAWTAHKTD
Sbjct: 187  AAATS-HSAIPGHQLVGTSG-NTEAPPSGTDKKEHVHDVSSRIGASVNEQLDAWTAHKTD 244

Query: 2092 TGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYN 1913
            TGIVYYYNAVTGESTY+KPAGFKGE DKVPVQPTP+SME L GTDWALVTTNDGKKYYYN
Sbjct: 245  TGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYN 304

Query: 1912 NKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGSTT-SLSAPAINTGGRD 1736
            +K+KVSSWQIPSE+TEL+KKEDDD LKE   S  NTNIVIEKGS   SLS+PA+NTGGRD
Sbjct: 305  SKMKVSSWQIPSEVTELKKKEDDDTLKE--QSVPNTNIVIEKGSNAISLSSPAVNTGGRD 362

Query: 1735 ATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQN 1556
            ATALRTSS+PGSSSALDLIKKKLQDSG               SE NGSK VEVT+KGLQN
Sbjct: 363  ATALRTSSMPGSSSALDLIKKKLQDSGTPTASPAPVSSAAATSESNGSKAVEVTVKGLQN 422

Query: 1555 ENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPK 1376
            EN KDKLKDI                  GPTKEECIIKFKEMLKERGVAPFSKWEKELPK
Sbjct: 423  ENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKERGVAPFSKWEKELPK 482

Query: 1375 IVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQN 1196
            IVFDPRFKAI SQSARRALFER+VKT                 EGFKQLLEE SEDID +
Sbjct: 483  IVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHS 542

Query: 1195 TDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLRE 1016
            TDYQTF+KKWGSD RFEALDRKDRELLLNERVLPLK              ASSFKSMLRE
Sbjct: 543  TDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLRE 602

Query: 1015 KGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXX 836
            KGD+TL+SRWSKVKDILR+DPRYKSV+HEDREV+FNEYVR                    
Sbjct: 603  KGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAEEEAEREAKARREEQ 662

Query: 835  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLE 656
                                             VTSFQALLVETIK PQASWTESRPKLE
Sbjct: 663  EKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIKDPQASWTESRPKLE 722

Query: 655  KDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVL 476
            KDPQGRATN +LD SD EKLFREH+K LYERCA+DFR                EDGKTVL
Sbjct: 723  KDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVL 782

Query: 475  NSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSAD 296
            NSWSTAKRVLKP+PRY+KMPRKERE  WRR+AE++ RK KSSLD+NED+HKDSKSRSS D
Sbjct: 783  NSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTD 842

Query: 295  GGRLPSGSRRNHERR 251
            GGR PS SRRN ERR
Sbjct: 843  GGRPPSSSRRNQERR 857


>ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis
            vinifera]
          Length = 1013

 Score =  922 bits (2384), Expect = 0.0
 Identities = 526/1017 (51%), Positives = 623/1017 (61%), Gaps = 38/1017 (3%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTP---IAPVS--------NVSGIAPSD 3041
            M SPAWLP E Q   S  P++  PAGGP     TP   IAP S          SG A S+
Sbjct: 1    MASPAWLPVEVQSSASQNPVTGLPAGGPSGGPPTPTGAIAPASVATIRTSEGASGTA-SN 59

Query: 3040 SINEHSQEKSVTAPGGVVPHPSFAFRN-------SGGTQHSTS-FVINSNPSVAPDV--- 2894
            SI E +Q K V AP  V+P PSF++         SG +Q   S  VI+SNP  +  V   
Sbjct: 60   SIQESAQGKFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQT 119

Query: 2893 ----------SSLSYSVSQTVAGYSPNRQFQPNTTKPGTVSHAVFGSSTSTN-SQPVPLX 2747
                       S SY+++   AG+  ++ FQ +T   G V+      S++++ SQ VP  
Sbjct: 120  PVPGPSSSSGPSFSYNIAHKGAGFPGSQPFQSSTDNSGAVAQEAGSMSSASHVSQSVPFP 179

Query: 2746 XXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVG- 2570
                         K+G TT WMP+ PSFP+                       +  AV  
Sbjct: 180  CSSSTMSVSSSP-KMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPS 238

Query: 2569 ---DFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMP 2399
               DF++S V R   P  +AP +S   +Q  I+P+Y S                  G +P
Sbjct: 239  ASMDFSSSVVSRAIFP--AAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLP 296

Query: 2398 WLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGS 2219
              PF+PYPAVYP+PFPLPAH MP PSV   D+QPPGV+ + T+      +A+SGH L  +
Sbjct: 297  RPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANT 356

Query: 2218 SGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDK 2039
            SGM +E  P GID  +HV+   TK G +VNEQ+DAWTAHKTDTG+VYYYNA+TGESTY+K
Sbjct: 357  SGMLSELPPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKTDTGVVYYYNALTGESTYEK 416

Query: 2038 PAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELR 1859
            P+ FKGE DKV VQPTPVS E L GTDWALVTTNDGKKYYYN K K+SSWQIP+ELTE+R
Sbjct: 417  PSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMR 476

Query: 1858 KKEDDDALKEHIMSTLNTNIVIEKG-STTSLSAPAINTGGRDATALRTSSVPGSSSALDL 1682
            KK+D  ALKEH M   NTN+  EKG S  +LSAPA+ TGGRDAT LRTS+VPGS+SALD+
Sbjct: 477  KKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDM 536

Query: 1681 IKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXX 1502
            IKKKLQDSG               SELNGS+V+E T+KGLQ+EN KDKLKD         
Sbjct: 537  IKKKLQDSGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSD 596

Query: 1501 XXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRA 1322
                      GPTKEECII+FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIP  SARR+
Sbjct: 597  SSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRS 656

Query: 1321 LFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEA 1142
            LFE +V+T                 EGFKQLLEEASEDID  T+YQTFRKKWG D RFEA
Sbjct: 657  LFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEA 716

Query: 1141 LDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILR 962
            LDRKDRELLLNERVLPLK               SSFKSMLR+KGD+T ++RWS+VKD LR
Sbjct: 717  LDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLR 776

Query: 961  NDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 782
            NDPRYK VKHEDRE++FNEY+                                       
Sbjct: 777  NDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREE 836

Query: 781  XXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAE 602
                           V+S+QALLVETIK PQ SWTES+PKLEKDPQ RATN +LDPSD E
Sbjct: 837  QEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLE 896

Query: 601  KLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNK 422
            KLFREH+KML+ER A++FR                EDGKTVL SWSTAKR+L+ D RY K
Sbjct: 897  KLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIK 956

Query: 421  MPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            MPRK+RE  WRRY+E+MLRK+K + D+ E+ H + K RSS D GR PSGSRR HERR
Sbjct: 957  MPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRRAHERR 1013


>ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis
            vinifera] gi|297738259|emb|CBI27460.3| unnamed protein
            product [Vitis vinifera]
          Length = 1046

 Score =  909 bits (2348), Expect = 0.0
 Identities = 529/1050 (50%), Positives = 623/1050 (59%), Gaps = 71/1050 (6%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTP---IAPVS--------NVSGIAPSD 3041
            M SPAWLP E Q   S  P++  PAGGP     TP   IAP S          SG A S+
Sbjct: 1    MASPAWLPVEVQSSASQNPVTGLPAGGPSGGPPTPTGAIAPASVATIRTSEGASGTA-SN 59

Query: 3040 SINEHSQEKSVTAPGGVVPHPSFAFRN-------SGGTQHSTS-FVINSNPSVAPDV--- 2894
            SI E +Q K V AP  V+P PSF++         SG +Q   S  VI+SNP  +  V   
Sbjct: 60   SIQESAQGKFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQT 119

Query: 2893 ----------SSLSYSVSQTVAGYSPNRQFQPNTT--------KPGTVSHAVFG------ 2786
                       S SY+++   AG+  ++ FQ +T+         P   S +  G      
Sbjct: 120  PVPGPSSSSGPSFSYNIAHKGAGFPGSQPFQSSTSIASGPRGPTPNAASFSFNGNPQLVQ 179

Query: 2785 --------------------SSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPS 2666
                                SS S  SQ VP               K+G TT WMP+ PS
Sbjct: 180  KDQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMSVSSSP-KMGPTTLWMPSNPS 238

Query: 2665 FPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVG----DFNTSAVLRPSVPMASAPSNSGST 2498
            FP+                       +  AV     DF++S V R   P  +AP +S   
Sbjct: 239  FPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFP--AAPVSSNPA 296

Query: 2497 VQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSV 2318
            +Q  I+P+Y S                  G +P  PF+PYPAVYP+PFPLPAH MP PSV
Sbjct: 297  IQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSV 356

Query: 2317 SSADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGD 2138
               D+QPPGV+ + T+      +A+SGH L  +SGM +E  P GID  +HV+   TK G 
Sbjct: 357  PLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA 416

Query: 2137 SVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTD 1958
            +VNEQ+DAWTAHKTDTG+VYYYNA+TGESTY+KP+ FKGE DKV VQPTPVS E L GTD
Sbjct: 417  AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTD 476

Query: 1957 WALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKG-S 1781
            WALVTTNDGKKYYYN K K+SSWQIP+ELTE+RKK+D  ALKEH M   NTN+  EKG S
Sbjct: 477  WALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPS 536

Query: 1780 TTSLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSEL 1601
              +LSAPA+ TGGRDAT LRTS+VPGS+SALD+IKKKLQDSG               SEL
Sbjct: 537  PIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSGPIASEL 596

Query: 1600 NGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKE 1421
            NGS+V+E T+KGLQ+EN KDKLKD                   GPTKEECII+FKEMLKE
Sbjct: 597  NGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKE 656

Query: 1420 RGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEG 1241
            RGVAPFSKWEKELPKIVFDPRFKAIP  SARR+LFE +V+T                 EG
Sbjct: 657  RGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEG 716

Query: 1240 FKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXX 1061
            FKQLLEEASEDID  T+YQTFRKKWG D RFEALDRKDRELLLNERVLPLK         
Sbjct: 717  FKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQA 776

Query: 1060 XXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXX 881
                  SSFKSMLR+KGD+T ++RWS+VKD LRNDPRYK VKHEDRE++FNEY+      
Sbjct: 777  IRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAA 836

Query: 880  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETI 701
                                                            V+S+QALLVETI
Sbjct: 837  EEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETI 896

Query: 700  KGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXX 521
            K PQ SWTES+PKLEKDPQ RATN +LDPSD EKLFREH+KML+ER A++FR        
Sbjct: 897  KDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLT 956

Query: 520  XXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDK 341
                    EDGKTVL SWSTAKR+L+ D RY KMPRK+RE  WRRY+E+MLRK+K + D+
Sbjct: 957  AEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQ 1016

Query: 340  NEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
             E+ H + K RSS D GR PSGSRR HERR
Sbjct: 1017 TEEKHTEVKGRSSVDSGRFPSGSRRAHERR 1046


>ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis
            vinifera]
          Length = 903

 Score =  865 bits (2235), Expect = 0.0
 Identities = 480/914 (52%), Positives = 567/914 (62%), Gaps = 5/914 (0%)
 Frame = -3

Query: 2977 SFAFRNSGGTQHSTSFVINSNPSVAPDVSSLSYSVSQTVAGYSPNRQFQPNTTKPGTVSH 2798
            S A    G T ++ SF  N NP +     +L    S  VA  + +     + ++  +V  
Sbjct: 11   SIASGPRGPTPNAASFSFNGNPQLVQKDQTLKSDNSGAVAQEAGSMSSASHVSQ--SVPF 68

Query: 2797 AVFGSSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXX 2618
                S+ S +S P                 K+G TT WMP+ PSFP+             
Sbjct: 69   PCSSSTMSVSSSP-----------------KMGPTTLWMPSNPSFPVPSGMPVTPGTPGP 111

Query: 2617 XXXXXXXTKDTLSAVG----DFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXX 2450
                      +  AV     DF++S V R   P  +AP +S   +Q  I+P+Y S     
Sbjct: 112  PGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFP--AAPVSSNPAIQQQIYPSYSSLPATN 169

Query: 2449 XXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTS 2270
                         G +P  PF+PYPAVYP+PFPLPAH MP PSV   D+QPPGV+ + T+
Sbjct: 170  ASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTA 229

Query: 2269 TANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDT 2090
                  +A+SGH L  +SGM +E  P GID  +HV+   TK G +VNEQ+DAWTAHKTDT
Sbjct: 230  GGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKTDT 289

Query: 2089 GIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNN 1910
            G+VYYYNA+TGESTY+KP+ FKGE DKV VQPTPVS E L GTDWALVTTNDGKKYYYN 
Sbjct: 290  GVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNT 349

Query: 1909 KLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKG-STTSLSAPAINTGGRDA 1733
            K K+SSWQIP+ELTE+RKK+D  ALKEH M   NTN+  EKG S  +LSAPA+ TGGRDA
Sbjct: 350  KTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDA 409

Query: 1732 TALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNE 1553
            T LRTS+VPGS+SALD+IKKKLQDSG               SELNGS+V+E T+KGLQ+E
Sbjct: 410  TPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSE 469

Query: 1552 NIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKI 1373
            N KDKLKD                   GPTKEECII+FKEMLKERGVAPFSKWEKELPKI
Sbjct: 470  NSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI 529

Query: 1372 VFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNT 1193
            VFDPRFKAIP  SARR+LFE +V+T                 EGFKQLLEEASEDID  T
Sbjct: 530  VFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKT 589

Query: 1192 DYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREK 1013
            +YQTFRKKWG D RFEALDRKDRELLLNERVLPLK               SSFKSMLR+K
Sbjct: 590  EYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDK 649

Query: 1012 GDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXX 833
            GD+T ++RWS+VKD LRNDPRYK VKHEDRE++FNEY+                      
Sbjct: 650  GDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQD 709

Query: 832  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEK 653
                                            V+S+QALLVETIK PQ SWTES+PKLEK
Sbjct: 710  KLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEK 769

Query: 652  DPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLN 473
            DPQ RATN +LDPSD EKLFREH+KML+ER A++FR                EDGKTVL 
Sbjct: 770  DPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLT 829

Query: 472  SWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADG 293
            SWSTAKR+L+ D RY KMPRK+RE  WRRY+E+MLRK+K + D+ E+ H + K RSS D 
Sbjct: 830  SWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDS 889

Query: 292  GRLPSGSRRNHERR 251
            GR PSGSRR HERR
Sbjct: 890  GRFPSGSRRAHERR 903


>ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis
            vinifera]
          Length = 848

 Score =  859 bits (2220), Expect = 0.0
 Identities = 469/850 (55%), Positives = 544/850 (64%), Gaps = 5/850 (0%)
 Frame = -3

Query: 2785 SSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXX 2606
            SS S  SQ VP               K+G TT WMP+ PSFP+                 
Sbjct: 2    SSASHVSQSVPFPCSSSTMSVSSSP-KMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGIA 60

Query: 2605 XXXTKDTLSAVG----DFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXX 2438
                  +  AV     DF++S V R   P  +AP +S   +Q  I+P+Y S         
Sbjct: 61   PSTPLSSNLAVPSASMDFSSSVVSRAIFP--AAPVSSNPAIQQQIYPSYSSLPATNASSQ 118

Query: 2437 XXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANS 2258
                     G +P  PF+PYPAVYP+PFPLPAH MP PSV   D+QPPGV+ + T+    
Sbjct: 119  GPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTP 178

Query: 2257 KHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVY 2078
              +A+SGH L  +SGM +E  P GID  +HV+   TK G +VNEQ+DAWTAHKTDTG+VY
Sbjct: 179  ISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKTDTGVVY 238

Query: 2077 YYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKV 1898
            YYNA+TGESTY+KP+ FKGE DKV VQPTPVS E L GTDWALVTTNDGKKYYYN K K+
Sbjct: 239  YYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKL 298

Query: 1897 SSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKG-STTSLSAPAINTGGRDATALR 1721
            SSWQIP+ELTE+RKK+D  ALKEH M   NTN+  EKG S  +LSAPA+ TGGRDAT LR
Sbjct: 299  SSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLR 358

Query: 1720 TSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKD 1541
            TS+VPGS+SALD+IKKKLQDSG               SELNGS+V+E T+KGLQ+EN KD
Sbjct: 359  TSAVPGSASALDMIKKKLQDSGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKD 418

Query: 1540 KLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDP 1361
            KLKD                   GPTKEECII+FKEMLKERGVAPFSKWEKELPKIVFDP
Sbjct: 419  KLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDP 478

Query: 1360 RFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQT 1181
            RFKAIP  SARR+LFE +V+T                 EGFKQLLEEASEDID  T+YQT
Sbjct: 479  RFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQT 538

Query: 1180 FRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVT 1001
            FRKKWG D RFEALDRKDRELLLNERVLPLK               SSFKSMLR+KGD+T
Sbjct: 539  FRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDIT 598

Query: 1000 LNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXX 821
             ++RWS+VKD LRNDPRYK VKHEDRE++FNEY+                          
Sbjct: 599  TSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKE 658

Query: 820  XXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQG 641
                                        V+S+QALLVETIK PQ SWTES+PKLEKDPQ 
Sbjct: 659  RERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQA 718

Query: 640  RATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWST 461
            RATN +LDPSD EKLFREH+KML+ER A++FR                EDGKTVL SWST
Sbjct: 719  RATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWST 778

Query: 460  AKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLP 281
            AKR+L+ D RY KMPRK+RE  WRRY+E+MLRK+K + D+ E+ H + K RSS D GR P
Sbjct: 779  AKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRFP 838

Query: 280  SGSRRNHERR 251
            SGSRR HERR
Sbjct: 839  SGSRRAHERR 848


>ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761009|ref|XP_012089635.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761012|ref|XP_012089636.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761015|ref|XP_012089637.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas]
          Length = 846

 Score =  848 bits (2191), Expect = 0.0
 Identities = 468/847 (55%), Positives = 544/847 (64%), Gaps = 2/847 (0%)
 Frame = -3

Query: 2785 SSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXX 2606
            SSTST SQ + L            +  +G +TS MP +PS  +                 
Sbjct: 2    SSTSTVSQSISLPLHSPSSSTLPSSPNLGPSTSQMPVVPSLLVPPRLAGTTRAPESSALV 61

Query: 2605 XXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXX 2426
                    S   D  +SAV RP + + + P+ S   VQ   +PTYPS             
Sbjct: 62   SCAPMTLPSVPVDPASSAVQRPMM-LTNTPA-SNPVVQQQAYPTYPSLPAMAAPPQGLWF 119

Query: 2425 XXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSA 2246
                 G +P  PFLPYPAV+P PFPLPAHS+P  SVSS D+QPPGV+ + T+ AN   SA
Sbjct: 120  QPPQMGGLPRPPFLPYPAVFPGPFPLPAHSIPRASVSSPDSQPPGVTPVGTAGANPPSSA 179

Query: 2245 ISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNA 2066
             SG QL+G+ GM+ E  P GID K+H+H    K   ++NE LD+WTAHKTDTGIVYYYNA
Sbjct: 180  ASGLQLIGTPGMQKELPPPGIDNKDHIHVFDNKDNVAINEPLDSWTAHKTDTGIVYYYNA 239

Query: 2065 VTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQ 1886
            +T  STY+KP GFKGE +KVP+QPTPVSME+LAGTDWAL+TTNDGKKYYYNNK K+SSWQ
Sbjct: 240  ITRVSTYEKPLGFKGEPEKVPMQPTPVSMENLAGTDWALITTNDGKKYYYNNKTKLSSWQ 299

Query: 1885 IPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGS-TTSLSAPAINTGGRDATALRTSSV 1709
            IPSE+TEL KK++ +  KE  +S L +N+  EKGS   SLSAPAINTGGRDATALRTSS 
Sbjct: 300  IPSEVTELTKKQEAEVSKELEVSLLRSNVSTEKGSGPVSLSAPAINTGGRDATALRTSSA 359

Query: 1708 PGSSSALDLIKKKLQDSGI-XXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLK 1532
            PG SSALDLIKKKLQ+SG                 E NGS+  E T KGL +E   DKLK
Sbjct: 360  PGPSSALDLIKKKLQESGTPVNSSPALVSLGMGTPESNGSRAAEATAKGLLSETSNDKLK 419

Query: 1531 DIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFK 1352
            D                   GPTKEECII+FKEMLKERG+APFSKWEKELPKIVFDPRFK
Sbjct: 420  DTNGGGNASDSSSDSEDEDSGPTKEECIIQFKEMLKERGIAPFSKWEKELPKIVFDPRFK 479

Query: 1351 AIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRK 1172
            AIPS SARR+LFE +VKT                 EGFKQLL EASEDIDQ TDYQTFRK
Sbjct: 480  AIPSHSARRSLFEHYVKTRAEEQRKEKRASQKAAIEGFKQLLVEASEDIDQYTDYQTFRK 539

Query: 1171 KWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNS 992
            KW +D RFEALDRKDRE LLNERV+PLK              A+SFKSML++KGD+T+NS
Sbjct: 540  KWENDPRFEALDRKDREHLLNERVIPLKKAAQEKVQAERAAAAASFKSMLQDKGDITINS 599

Query: 991  RWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXX 812
            RWSKVK+ LRNDPRYKSVKHEDRE +FNEY+                             
Sbjct: 600  RWSKVKESLRNDPRYKSVKHEDREFLFNEYLSELKAVEEEAEREAKVKKEEQEKLKERER 659

Query: 811  XXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRAT 632
                                     V+SFQALLVETIK PQASWTES+PKLEKD QGRAT
Sbjct: 660  ELRKRKEREEQEMERVRLKVRRKEAVSSFQALLVETIKDPQASWTESKPKLEKDSQGRAT 719

Query: 631  NPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKR 452
            NP+LDPSD EKLFREHVKML+ERC  DF+                E+GKTVL+SWST KR
Sbjct: 720  NPDLDPSDTEKLFREHVKMLHERCTQDFKALLAEVINAETAAQKSENGKTVLDSWSTVKR 779

Query: 451  VLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGS 272
            +LKPDPRYNKMPRKERE  WRRY +D+LRK++++LD+ E+ H DSKSR+SAD GR  SGS
Sbjct: 780  LLKPDPRYNKMPRKEREILWRRYTQDILRKQQTTLDQKEEKHTDSKSRNSADSGRYLSGS 839

Query: 271  RRNHERR 251
            RR H+ R
Sbjct: 840  RRTHDGR 846


>gb|KDP22962.1| hypothetical protein JCGZ_01659 [Jatropha curcas]
          Length = 846

 Score =  847 bits (2189), Expect = 0.0
 Identities = 468/847 (55%), Positives = 543/847 (64%), Gaps = 2/847 (0%)
 Frame = -3

Query: 2785 SSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXX 2606
            SSTST SQ + L            +  +G +TS MP +PS  +                 
Sbjct: 2    SSTSTVSQSISLPLHSPSSSTLPSSPNLGPSTSQMPVVPSLLVPPRLAGTTRAPESSALV 61

Query: 2605 XXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXX 2426
                    S   D  +SAV RP + + + P+ S   VQ   +PTYPS             
Sbjct: 62   SCAPMTLPSVPVDPASSAVQRPMM-LTNTPA-SNPVVQQQAYPTYPSLPAMAAPPQGLWF 119

Query: 2425 XXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSA 2246
                 G +P  PFLPYPAV+P PFPLPAHS+P  SVSS D+QPPGV+ + T+ AN   SA
Sbjct: 120  QPPQMGGLPRPPFLPYPAVFPGPFPLPAHSIPRASVSSPDSQPPGVTPVGTAGANPPSSA 179

Query: 2245 ISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNA 2066
             SG QL+G+ GM+ E  P GID K+H+H    K   ++NE LD+WTAHKTDTGIVYYYNA
Sbjct: 180  ASGLQLIGTPGMQKELPPPGIDNKDHIHVFDNKDNVAINEPLDSWTAHKTDTGIVYYYNA 239

Query: 2065 VTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQ 1886
            +T  STY+KP GFKGE +KVP+QPTPVSME+LAGTDWAL+TTNDGKKYYYNNK KV SWQ
Sbjct: 240  ITRVSTYEKPLGFKGEPEKVPMQPTPVSMENLAGTDWALITTNDGKKYYYNNKTKVCSWQ 299

Query: 1885 IPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGS-TTSLSAPAINTGGRDATALRTSSV 1709
            IPSE+TEL KK++ +  KE  +S L +N+  EKGS   SLSAPAINTGGRDATALRTSS 
Sbjct: 300  IPSEVTELTKKQEAEVSKELEVSLLRSNVSTEKGSGPVSLSAPAINTGGRDATALRTSSA 359

Query: 1708 PGSSSALDLIKKKLQDSGI-XXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLK 1532
            PG SSALDLIKKKLQ+SG                 E NGS+  E T KGL +E   DKLK
Sbjct: 360  PGPSSALDLIKKKLQESGTPVNSSPALVSLGMGTPESNGSRAAEATAKGLLSETSNDKLK 419

Query: 1531 DIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFK 1352
            D                   GPTKEECII+FKEMLKERG+APFSKWEKELPKIVFDPRFK
Sbjct: 420  DTNGGGNASDSSSDSEDEDSGPTKEECIIQFKEMLKERGIAPFSKWEKELPKIVFDPRFK 479

Query: 1351 AIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRK 1172
            AIPS SARR+LFE +VKT                 EGFKQLL EASEDIDQ TDYQTFRK
Sbjct: 480  AIPSHSARRSLFEHYVKTRAEEQRKEKRASQKAAIEGFKQLLVEASEDIDQYTDYQTFRK 539

Query: 1171 KWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNS 992
            KW +D RFEALDRKDRE LLNERV+PLK              A+SFKSML++KGD+T+NS
Sbjct: 540  KWENDPRFEALDRKDREHLLNERVIPLKKAAQEKVQAERAAAAASFKSMLQDKGDITINS 599

Query: 991  RWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXX 812
            RWSKVK+ LRNDPRYKSVKHEDRE +FNEY+                             
Sbjct: 600  RWSKVKESLRNDPRYKSVKHEDREFLFNEYLSELKAVEEEAEREAKVKKEEQEKLKERER 659

Query: 811  XXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRAT 632
                                     V+SFQALLVETIK PQASWTES+PKLEKD QGRAT
Sbjct: 660  ELRKRKEREEQEMERVRLKVRRKEAVSSFQALLVETIKDPQASWTESKPKLEKDSQGRAT 719

Query: 631  NPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKR 452
            NP+LDPSD EKLFREHVKML+ERC  DF+                E+GKTVL+SWST KR
Sbjct: 720  NPDLDPSDTEKLFREHVKMLHERCTQDFKALLAEVINAETAAQKSENGKTVLDSWSTVKR 779

Query: 451  VLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGS 272
            +LKPDPRYNKMPRKERE  WRRY +D+LRK++++LD+ E+ H DSKSR+SAD GR  SGS
Sbjct: 780  LLKPDPRYNKMPRKEREILWRRYTQDILRKQQTTLDQKEEKHTDSKSRNSADSGRYLSGS 839

Query: 271  RRNHERR 251
            RR H+ R
Sbjct: 840  RRTHDGR 846


>ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas] gi|802761021|ref|XP_012089639.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas] gi|802761024|ref|XP_012089640.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas]
          Length = 817

 Score =  839 bits (2168), Expect = 0.0
 Identities = 459/818 (56%), Positives = 533/818 (65%), Gaps = 2/818 (0%)
 Frame = -3

Query: 2698 ATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASA 2519
            ++TS MP +PS  +                         S   D  +SAV RP + + + 
Sbjct: 2    SSTSTMPVVPSLLVPPRLAGTTRAPESSALVSCAPMTLPSVPVDPASSAVQRPMM-LTNT 60

Query: 2518 PSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAH 2339
            P+ S   VQ   +PTYPS                  G +P  PFLPYPAV+P PFPLPAH
Sbjct: 61   PA-SNPVVQQQAYPTYPSLPAMAAPPQGLWFQPPQMGGLPRPPFLPYPAVFPGPFPLPAH 119

Query: 2338 SMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHD 2159
            S+P  SVSS D+QPPGV+ + T+ AN   SA SG QL+G+ GM+ E  P GID K+H+H 
Sbjct: 120  SIPRASVSSPDSQPPGVTPVGTAGANPPSSAASGLQLIGTPGMQKELPPPGIDNKDHIHV 179

Query: 2158 VSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSM 1979
               K   ++NE LD+WTAHKTDTGIVYYYNA+T  STY+KP GFKGE +KVP+QPTPVSM
Sbjct: 180  FDNKDNVAINEPLDSWTAHKTDTGIVYYYNAITRVSTYEKPLGFKGEPEKVPMQPTPVSM 239

Query: 1978 ESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNI 1799
            E+LAGTDWAL+TTNDGKKYYYNNK K+SSWQIPSE+TEL KK++ +  KE  +S L +N+
Sbjct: 240  ENLAGTDWALITTNDGKKYYYNNKTKLSSWQIPSEVTELTKKQEAEVSKELEVSLLRSNV 299

Query: 1798 VIEKGS-TTSLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGI-XXXXXXXXX 1625
              EKGS   SLSAPAINTGGRDATALRTSS PG SSALDLIKKKLQ+SG           
Sbjct: 300  STEKGSGPVSLSAPAINTGGRDATALRTSSAPGPSSALDLIKKKLQESGTPVNSSPALVS 359

Query: 1624 XXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECII 1445
                  E NGS+  E T KGL +E   DKLKD                   GPTKEECII
Sbjct: 360  LGMGTPESNGSRAAEATAKGLLSETSNDKLKDTNGGGNASDSSSDSEDEDSGPTKEECII 419

Query: 1444 KFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXX 1265
            +FKEMLKERG+APFSKWEKELPKIVFDPRFKAIPS SARR+LFE +VKT           
Sbjct: 420  QFKEMLKERGIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEQRKEKRA 479

Query: 1264 XXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKX 1085
                  EGFKQLL EASEDIDQ TDYQTFRKKW +D RFEALDRKDRE LLNERV+PLK 
Sbjct: 480  SQKAAIEGFKQLLVEASEDIDQYTDYQTFRKKWENDPRFEALDRKDREHLLNERVIPLKK 539

Query: 1084 XXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNE 905
                         A+SFKSML++KGD+T+NSRWSKVK+ LRNDPRYKSVKHEDRE +FNE
Sbjct: 540  AAQEKVQAERAAAAASFKSMLQDKGDITINSRWSKVKESLRNDPRYKSVKHEDREFLFNE 599

Query: 904  YVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSF 725
            Y+                                                      V+SF
Sbjct: 600  YLSELKAVEEEAEREAKVKKEEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSF 659

Query: 724  QALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFR 545
            QALLVETIK PQASWTES+PKLEKD QGRATNP+LDPSD EKLFREHVKML+ERC  DF+
Sbjct: 660  QALLVETIKDPQASWTESKPKLEKDSQGRATNPDLDPSDTEKLFREHVKMLHERCTQDFK 719

Query: 544  XXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLR 365
                            E+GKTVL+SWST KR+LKPDPRYNKMPRKERE  WRRY +D+LR
Sbjct: 720  ALLAEVINAETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMPRKEREILWRRYTQDILR 779

Query: 364  KRKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            K++++LD+ E+ H DSKSR+SAD GR  SGSRR H+ R
Sbjct: 780  KQQTTLDQKEEKHTDSKSRNSADSGRYLSGSRRTHDGR 817


>ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao]
            gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein
            40C [Theobroma cacao]
          Length = 816

 Score =  832 bits (2148), Expect = 0.0
 Identities = 463/817 (56%), Positives = 529/817 (64%), Gaps = 3/817 (0%)
 Frame = -3

Query: 2692 TSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPS 2513
            TSWMPT  SFPM                       T SA  D  +SAV RPS     AP 
Sbjct: 13   TSWMPTTQSFPMSTESSGTSGTAGHPGLVPSVQMITASAAVDSPSSAVPRPS-----APV 67

Query: 2512 NSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSM 2333
            +S   VQ  I+PTY                    G  P  PF+PYP +YP PFP  +  M
Sbjct: 68   SSNQAVQQQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYPGPFPSASSGM 127

Query: 2332 PHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVS 2153
            PHP+ SS D+QPPGVS + TS   +   AI  +Q   +SG++T   P GID +    +V 
Sbjct: 128  PHPAPSS-DSQPPGVSPLATSPF-APSIAIPANQSSVASGIQTGFPPQGIDNR----NVG 181

Query: 2152 TKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMES 1973
            T+   +VNEQ D WTAHKTDTGIVYYYNA+TGESTY+KPAGFKGE DKVPVQPTPVS+E 
Sbjct: 182  TRVEAAVNEQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQ 241

Query: 1972 LAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVI 1793
            LAGT+WALVTT+DGKKYYYN+K K+SSWQIPSE+ ELRKK+D+D  KEH +   N ++V 
Sbjct: 242  LAGTEWALVTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVA 301

Query: 1792 EKGST-TSLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGI--XXXXXXXXXX 1622
            EKGST  SLSAPA++TGGRDA  LRTS VPGSSSALDLIKKKLQDSG+            
Sbjct: 302  EKGSTPISLSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMP 361

Query: 1621 XXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIK 1442
                 ELNGS+ V+V  KGLQ+EN KDKLKD                   GP+KEECI++
Sbjct: 362  VTAAQELNGSRAVDV--KGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQ 419

Query: 1441 FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXX 1262
            FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS SARR LFE +VKT            
Sbjct: 420  FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAA 479

Query: 1261 XXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXX 1082
                 EGFKQLL+EASEDID NT+YQTF++KWGSD RFEALDRKDRELLL ERVLPLK  
Sbjct: 480  LKAAIEGFKQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRA 539

Query: 1081 XXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEY 902
                        ASS KSML+EKGD+T+NSRWS+VKD +R+DPRYK VKHEDREV+FNEY
Sbjct: 540  AEEKAQAIRAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEY 599

Query: 901  VRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQ 722
            +                                                      V SFQ
Sbjct: 600  ISELKAVEEKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQ 659

Query: 721  ALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRX 542
            ALLVETIK PQASWTES+PKLEKDPQGRA NP+LDPSD EKLFREH+KML+ERC +DFR 
Sbjct: 660  ALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRA 719

Query: 541  XXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRK 362
                           E GKTV NSWSTAKR+LKPDPRY+KMPRKERE  WRRYAEDMLRK
Sbjct: 720  LLAEVITQDAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRK 779

Query: 361  RKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            +KS+LD+ E+   D+K RSS D GR  SGSR+ HERR
Sbjct: 780  QKSALDQEEEKRTDAKVRSSGDLGRFSSGSRKVHERR 816


>ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii]
            gi|763747828|gb|KJB15267.1| hypothetical protein
            B456_002G167700 [Gossypium raimondii]
          Length = 887

 Score =  825 bits (2132), Expect = 0.0
 Identities = 467/870 (53%), Positives = 539/870 (61%), Gaps = 5/870 (0%)
 Frame = -3

Query: 2845 NRQFQPN---TTKPGTVSHAVFGSSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPT 2675
            N Q QP+   T   GT + A    ST + S P+P+                   TS MPT
Sbjct: 30   NAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPVTSRMPT 89

Query: 2674 IPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTV 2495
             P FPM                       T SA  D  +SAV  P  P++  P+     V
Sbjct: 90   TPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAVPGPGAPVSLNPA-----V 144

Query: 2494 QHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVS 2315
            Q  ++P Y S                  G  P  PF+PYP VYP PFP  +  MP P+ S
Sbjct: 145  QQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSSGMPLPAPS 204

Query: 2314 SADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDS 2135
            S D+QPPGV  +  S      +A++   L   +G      P GID ++ VHDV+TK   +
Sbjct: 205  S-DSQPPGVRPLGMSPFAPSAAALANQSLAILTGFP----PQGIDNRKLVHDVTTKVESA 259

Query: 2134 VNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDW 1955
             NEQ D WTAHKTDTG+VYYYNA+TGESTY+KPAGFKGE D+V VQPTPVS+E LAGTDW
Sbjct: 260  GNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDW 319

Query: 1954 ALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGST- 1778
            ALVTTNDGKKYYYN+K K+SSWQIP+E+TELRKK+D +  KE+ +S  N ++V EKGST 
Sbjct: 320  ALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTP 379

Query: 1777 TSLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELN 1598
             SLSAPA+NTGGRDA  LRTS VPGSSSALDLIKKKLQD G+               ELN
Sbjct: 380  ISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPSSSPVPVVPVTATHELN 439

Query: 1597 GSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKER 1418
            GS+ V+V  KGLQ+E+ KDKLKD                   GP+KEECI++FKEMLKER
Sbjct: 440  GSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKER 497

Query: 1417 GVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGF 1238
            GVAPFSKWEKELPKIVFDPRFKAIPS SARR+LFE +VKT                 EGF
Sbjct: 498  GVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGF 557

Query: 1237 KQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXX 1058
            KQLL+EASEDID +T+YQTF++KWGSD RFEALDRKDRELLLNERVL LK          
Sbjct: 558  KQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAI 617

Query: 1057 XXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXX 878
                ASSFKSML+EKGD+ +NSRWS+VKD LR+DPRYK VKHEDREV+FNEY+       
Sbjct: 618  RAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIE 677

Query: 877  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIK 698
                                                           V SFQALLVETIK
Sbjct: 678  EKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 737

Query: 697  GPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXX 518
             PQASWTES+PKLEKDPQGRA NP+LD SD EKLFREH+KML+ERC  DFR         
Sbjct: 738  DPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQ 797

Query: 517  XXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKN 338
                   E GKT LNSWSTAKR+LKPDPRYNKMPRKERE  WRRYAEDMLRK+KS+LD+ 
Sbjct: 798  DATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQE 857

Query: 337  EDSHKDSKSRSS-ADGGRLPSGSRRNHERR 251
            E+ H D K RSS  D GR  SG+RR HERR
Sbjct: 858  EEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887


>gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 888

 Score =  821 bits (2121), Expect = 0.0
 Identities = 468/871 (53%), Positives = 539/871 (61%), Gaps = 6/871 (0%)
 Frame = -3

Query: 2845 NRQFQPN---TTKPGTVSHAVFGSSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPT 2675
            N Q QP+   T   GT + A    ST + S P+P+                   TS MPT
Sbjct: 30   NAQIQPSKSDTLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPVTSRMPT 89

Query: 2674 IPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTV 2495
             P FPM                       T SA  D  +SAV  P  P++  P+     V
Sbjct: 90   TPPFPMSSGSSGTSGTPGHPGSIPSIQMITASAAVDSPSSAVPGPGAPVSLNPA-----V 144

Query: 2494 QHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVS 2315
            Q  ++P Y S                  G  P  PF+PYP VYP PFP  +  MP P+ S
Sbjct: 145  QQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSSGMPLPAPS 204

Query: 2314 SADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDS 2135
            S D+QPPGV  +  S      +A++   L   +G      P GID ++ VHDV+TK   +
Sbjct: 205  S-DSQPPGVRPLGMSPFAPSAAALANQSLAILTGFP----PQGIDNRKLVHDVTTKVESA 259

Query: 2134 VNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDW 1955
             NEQ D WTAHKTDTG+VYYYNA+TGESTY+KPAGFKGE D+V VQPTPVS+E LAGTDW
Sbjct: 260  GNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDW 319

Query: 1954 ALVTTNDGKKYYYNNKLKV-SSWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGST 1778
            ALVTTNDGKKYYYN+K KV SSWQIP+E+TELRKK+D +  KE+ +S  N ++V EKGST
Sbjct: 320  ALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGST 379

Query: 1777 T-SLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSEL 1601
              SLSAPA+NTGGRDA  LRTS VPGSSSALDLIKKKLQD G+               EL
Sbjct: 380  PISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPSSSPVPVVPVTATHEL 439

Query: 1600 NGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKE 1421
            NGS+ V+V  KGLQ+E+ KDKLKD                   GP+KEECI++FKEMLKE
Sbjct: 440  NGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKE 497

Query: 1420 RGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEG 1241
            RGVAPFSKWEKELPKIVFDPRFKAIPS SARR+LFE +VKT                 EG
Sbjct: 498  RGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEG 557

Query: 1240 FKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXX 1061
            FKQLL+EASEDID +T+YQTF++KWGSD RFEALDRKDRELLLNERVL LK         
Sbjct: 558  FKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARA 617

Query: 1060 XXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXX 881
                 ASSFKSML+EKGD+ +NSRWS+VKD LR+DPRYK VKHEDREV+FNEY+      
Sbjct: 618  IRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAI 677

Query: 880  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETI 701
                                                            V SFQALLVETI
Sbjct: 678  EEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETI 737

Query: 700  KGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXX 521
            K PQASWTES+PKLEKDPQGRA NP+LD SD EKLFREH+KML+ERC  DFR        
Sbjct: 738  KDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVIT 797

Query: 520  XXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDK 341
                    E GKT LNSWSTAKR+LKPDPRYNKMPRKERE  WRRYAEDMLRK+KS+LD+
Sbjct: 798  QDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQ 857

Query: 340  NEDSHKDSKSRSS-ADGGRLPSGSRRNHERR 251
             E+ H D K RSS  D GR  SG+RR HERR
Sbjct: 858  EEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888


>ref|XP_008353148.1| PREDICTED: pre-mRNA-processing protein 40C-like [Malus domestica]
          Length = 981

 Score =  820 bits (2117), Expect = 0.0
 Identities = 492/1014 (48%), Positives = 589/1014 (58%), Gaps = 35/1014 (3%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTPIAPVSNVS-------GIAPSDSINE 3029
            M SPAWLP E +       +S  PAGG   ++ T  +P S V        G + +DSI E
Sbjct: 1    MASPAWLPQEVKP-----SVSVSPAGGA--STQTAASPASIVGPTTSSSLGGSVNDSIQE 53

Query: 3028 HSQEKSVTAPGGVVPHPSFAFRN------SGGT--QHSTSFVINSNPSVAPDV------- 2894
              Q     AP   VP PSF++        S GT  Q S S  I SNP  +P V       
Sbjct: 54   PLQNTFGNAPSFAVPGPSFSYNVPPNANISFGTSQQSSPSSAIKSNPPASPVVQAPVHGL 113

Query: 2893 ----SSLSYSVSQTVAGYSPNRQFQPNTTKPGTVSHAVFG---SSTSTNSQPVPLXXXXX 2735
                S  SY++ ++   +  N+QFQ     P  V+        SSTS++S  +P      
Sbjct: 114  SSSASPFSYNIPKSGYSFPSNQQFQSGMNIPPAVAQETGNASLSSTSSHSGSLPAPTTSN 173

Query: 2734 XXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAV--GDFN 2561
                       G  T W+ T PSF M                           V     +
Sbjct: 174  STMNISSTPNAGPKTLWVSTAPSFNMTPGMPGTPRTPGPPGIAHSVQISFNPTVPSAPID 233

Query: 2560 TSAVLRPS---VPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLP 2390
            +S   RPS   VP+AS      S VQ  +   YPS                    +P  P
Sbjct: 234  SSVANRPSMQAVPVAS------SAVQPHVSAPYPSLSAMGAPWLSSPQIGG----LPRPP 283

Query: 2389 FLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGM 2210
            FLPYPA +P PFPLPAH MP  SV   D+QPPGV+ +  + AN+  S  SGHQL GSS M
Sbjct: 284  FLPYPAAFPGPFPLPAHVMPLASVPLPDSQPPGVTPVGNTAANAVSSVGSGHQLAGSSVM 343

Query: 2209 RTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAG 2030
            + E    G+  +            +VNEQL AWTAHKT+TG+VYYYNA+TGESTYDKP G
Sbjct: 344  QKELPHPGVGPENRA---------AVNEQLVAWTAHKTETGVVYYYNALTGESTYDKPPG 394

Query: 2029 FKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKE 1850
            FK E DKV +QPTPVS  +LAGTDW LVTT+DGKK+Y+N+K KVSSWQIP+E+ EL+K++
Sbjct: 395  FKEEPDKVSMQPTPVSTVNLAGTDWVLVTTSDGKKFYHNSKTKVSSWQIPNEVIELKKQQ 454

Query: 1849 DDDALKEHIMSTLNTNIVIEKGST-TSLSAPAINTGGRDATALRTSSVPGSSSALDLIKK 1673
            D D  KEH +S  N N++IEKGS   S+SAPAINTGGR+A   + S+V G+SSALDLIK+
Sbjct: 455  DSDVPKEHTLSVPNNNLMIEKGSAPVSMSAPAINTGGREAMPFKPSAVLGTSSALDLIKR 514

Query: 1672 KLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXX 1493
            KLQD                 SE NG++ VE T KG Q+EN KDKLK+            
Sbjct: 515  KLQD-------PVTSSPIPAPSESNGARGVESTPKGQQSENSKDKLKETNGDGNLSDSSS 567

Query: 1492 XXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFE 1313
                   GPTKEECII+FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS  ARR+LFE
Sbjct: 568  DSEDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHEARRSLFE 627

Query: 1312 RFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDR 1133
             +VKT                 EGFKQLL+EASEDID+NTDYQ+FR+KWG+D RFEALDR
Sbjct: 628  HYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDRNTDYQSFRRKWGNDPRFEALDR 687

Query: 1132 KDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDP 953
            KDRE LLNERVLPLK              ++ FKSML+EKGD+T++SRWS+VKD LRNDP
Sbjct: 688  KDREHLLNERVLPLKRAAEEKVQAVRAAASAGFKSMLKEKGDITVSSRWSRVKDNLRNDP 747

Query: 952  RYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 773
            RYK+V+HEDRE +FNEY+                                          
Sbjct: 748  RYKNVRHEDREALFNEYISGLKAVEEEAEREAKAKRDEQEKLRERERELRKRKEREEQET 807

Query: 772  XXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLF 593
                        V +FQALLVETIK PQASWT SRPKLEKDPQ RA NP+LDPSD EKLF
Sbjct: 808  ERVRLKVRRKEAVATFQALLVETIKDPQASWTGSRPKLEKDPQRRAANPDLDPSDMEKLF 867

Query: 592  REHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPR 413
            REHVKML ERCA++FR                EDGKTVLNSWSTAKR+LK DPRY+K PR
Sbjct: 868  REHVKMLNERCAHEFRTLLAEVLTAEAASQETEDGKTVLNSWSTAKRILKVDPRYDKTPR 927

Query: 412  KEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            KERE  WRRY+E+MLRK+KS++D+ ED   D+K+RSSAD GR P GSR  H+RR
Sbjct: 928  KEREVLWRRYSEEMLRKQKSAVDQKEDRKTDAKTRSSADAGRNPYGSRGTHDRR 981


>ref|XP_009351698.1| PREDICTED: pre-mRNA-processing protein 40C [Pyrus x bretschneideri]
          Length = 981

 Score =  819 bits (2115), Expect = 0.0
 Identities = 495/1029 (48%), Positives = 591/1029 (57%), Gaps = 50/1029 (4%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLLASSTPIAPVSNV-------SGIAPSDSINE 3029
            M SPAWLP E +   S       PAGG   ++ T ++P S V       SG + +DSI E
Sbjct: 1    MASPAWLPQEVKPSAS-----VSPAGGA--STQTAVSPASIVGPTTSSSSGGSVNDSIQE 53

Query: 3028 HSQEKSVTAPGGVVPHPSFAFRN------SGGT--QHSTSFVINSNPSVAPDV------- 2894
              Q K   AP   VP PSF++        S GT  Q S S  I SNP  +P V       
Sbjct: 54   PLQNKFGNAPSFAVPAPSFSYNVPPNANISFGTSQQSSPSSAIKSNPPASPMVQAPVHGL 113

Query: 2893 ----SSLSYSVSQTVAGYSPNRQFQPNTTKPGTVSHAVFG---SSTSTNSQPVPLXXXXX 2735
                S  SY++ ++   +  N+QFQ     P  V+        SSTST+S  +P      
Sbjct: 114  SSSASPFSYNIPKSGYSFPSNQQFQSGMNIPPAVAQETGNALLSSTSTHSGSLPAPTSSN 173

Query: 2734 XXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGDFNTS 2555
                       G  T W+ T PSF M                         S    FN +
Sbjct: 174  STMNISSTPNAGPKTLWVSTAPSFNMTPGMPGTPRTPGPPGIAH-------SVQISFNPT 226

Query: 2554 AVLRPSVPMASAPSN---------SGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVM 2402
            A   PS P+ S+ +N         + S VQ  +   YPS                     
Sbjct: 227  A---PSAPIDSSVANRPSMQAVPVASSAVQPHVGAPYPSLSAMGA--------------- 268

Query: 2401 PWL-----------PFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSK 2255
            PWL           PFLPYPA +P PFPLPAH MP  SV   D+QPPGV+ +  + ANS 
Sbjct: 269  PWLSSPQIGGLARPPFLPYPAAFPGPFPLPAHVMPLASVPLPDSQPPGVTPVGNTAANSV 328

Query: 2254 HSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYY 2075
             S  SGHQ  GSS M+ E    G+  +            +VNEQL AWTAHKT+TG+VYY
Sbjct: 329  SSVGSGHQSAGSSVMQKELPHPGVGPENRA---------AVNEQLVAWTAHKTETGVVYY 379

Query: 2074 YNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVS 1895
            YNA+TGESTYDKP GFK E DKV +QPTPVS  +LAGTDW LVTT+DGKK+Y+N+K KVS
Sbjct: 380  YNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLAGTDWVLVTTSDGKKFYHNSKTKVS 439

Query: 1894 SWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKG-STTSLSAPAINTGGRDATALRT 1718
            SWQIP+E+ EL++++D D  KEH  S  N N++IEKG +  S+SAPAINTGGR+A   + 
Sbjct: 440  SWQIPNEVIELKEQQDSDVPKEHTPSVPNNNLMIEKGPAPVSMSAPAINTGGREAMPFKP 499

Query: 1717 SSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDK 1538
            S+V G+SSALDLIK+KLQD                 SE NG++ VE T KG Q+EN KDK
Sbjct: 500  SAVQGTSSALDLIKRKLQD-------PVTSSPIPAPSESNGARGVESTPKGQQSENSKDK 552

Query: 1537 LKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPR 1358
            LK+                   GP+KEECII+FKEMLKERGVAPFSKWEKELPKIVFDPR
Sbjct: 553  LKETNGDGNLSDSSSDSEDADSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPR 612

Query: 1357 FKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTF 1178
            FKAIPS  ARR+LFE +VKT                 EGFKQLL+EASEDID+NTDYQ+F
Sbjct: 613  FKAIPSHEARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDRNTDYQSF 672

Query: 1177 RKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTL 998
            RKKWG+D RFEALDRKDRE LLNERVLPLK              ++ FKSML+EKGDVT+
Sbjct: 673  RKKWGNDSRFEALDRKDREHLLNERVLPLKRAAEEKAQAVRAAASAGFKSMLKEKGDVTV 732

Query: 997  NSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXX 818
            +SRWS+VKD LRNDPRYK+V+HEDREV+FNEY+                           
Sbjct: 733  SSRWSRVKDSLRNDPRYKNVRHEDREVLFNEYILGLKAVEEEAEREAKAKRDEQEKLRER 792

Query: 817  XXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGR 638
                                         +FQALLVETIK PQASWT SRPKLEKDPQ R
Sbjct: 793  ERELRKRKEREEQETERVRLKVRRKEAFATFQALLVETIKDPQASWTGSRPKLEKDPQRR 852

Query: 637  ATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTA 458
            A NP+LDPSD EKLFREHVKML ERCA++FR                EDGKTVLNSWSTA
Sbjct: 853  AANPDLDPSDMEKLFREHVKMLNERCAHEFRTLLAEVLTAEAASQETEDGKTVLNSWSTA 912

Query: 457  KRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPS 278
            KR+LK D RY+K PRKERE  WRRY+E+MLRK+KS++D+ ED   D+K+RSSAD GR P 
Sbjct: 913  KRILKVDTRYDKTPRKEREVLWRRYSEEMLRKQKSAVDQKEDRRTDAKTRSSADAGRNPY 972

Query: 277  GSRRNHERR 251
            GSR  H+RR
Sbjct: 973  GSRGTHDRR 981


>ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [Prunus mume]
          Length = 858

 Score =  815 bits (2104), Expect = 0.0
 Identities = 454/848 (53%), Positives = 534/848 (62%), Gaps = 4/848 (0%)
 Frame = -3

Query: 2785 SSTSTNSQPVPLXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXX 2606
            SSTS++S  +P             A  +G TTSW+PT PSF +                 
Sbjct: 17   SSTSSHSGSLPAPTSSSSTMNLLSAPNMGTTTSWVPTAPSFNLTSGMPGTPGTPGPPGIA 76

Query: 2605 XXXT---KDTLSAVGDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXX 2435
                     T  +    ++S  LRPS+ +A   S   S VQ  +   YPS          
Sbjct: 77   HPVQISFNPTAPSAPIDSSSVALRPSMQIAPVAS---SAVQPQVGAPYPSLSSMGAPPQG 133

Query: 2434 XXXXXXXXGVMPWLPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSK 2255
                    G  P  PFLPYPA +P PFP PAH MP PSV   D+QPPGV+ +  + A S 
Sbjct: 134  VWLQSPQIGGFPRPPFLPYPAAFPVPFPSPAHVMPLPSVPLPDSQPPGVTPVGNTAAISS 193

Query: 2254 HSAISGHQLVGSSGMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYY 2075
             SA SGHQL G SG++ E    GID ++  HD   +   SVNEQLDAWTAHKT+TG+VYY
Sbjct: 194  PSAASGHQLAGFSGIQIELPLPGIDNRKQSHDAGNENRASVNEQLDAWTAHKTETGVVYY 253

Query: 2074 YNAVTGESTYDKPAGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVS 1895
            YNA+TGESTYDKP GFK E DKV +QPTPVS  +L+GTDW LVTT+DGKK+Y+N+K KVS
Sbjct: 254  YNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLSGTDWVLVTTSDGKKFYHNSKTKVS 313

Query: 1894 SWQIPSELTELRKKEDDDALKEHIMSTLNTNIVIEKGST-TSLSAPAINTGGRDATALRT 1718
            SWQIP+E+ ELRKK+D D  KEH +S  N N++ EKGS   SL+APAIN GGR+A A + 
Sbjct: 314  SWQIPNEVIELRKKQDADVPKEHPVSIPNNNVMTEKGSAPISLTAPAINMGGREAMAFKP 373

Query: 1717 SSVPGSSSALDLIKKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDK 1538
            S+V G+SSALDLIKKKLQDSG               SE NGS+ VE T KG Q++N KDK
Sbjct: 374  SAVQGTSSALDLIKKKLQDSG----APVTSSPVPAPSESNGSRGVESTPKGQQSDNSKDK 429

Query: 1537 LKDIXXXXXXXXXXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPR 1358
            LKDI                  GPTKEECI +FKEMLKERGVAPFSKW+KELPKIVFDPR
Sbjct: 430  LKDINGDGNLSDSSSDSEDADSGPTKEECITQFKEMLKERGVAPFSKWDKELPKIVFDPR 489

Query: 1357 FKAIPSQSARRALFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTF 1178
            FKAIPS SARR+LFE +VKT                 EGFKQLL+EASEDID NTDYQ+F
Sbjct: 490  FKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDHNTDYQSF 549

Query: 1177 RKKWGSDRRFEALDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTL 998
            RKKW +D RFEALDRKDRE LLNERVLPLK              ++SFKSML+EKGD+T+
Sbjct: 550  RKKWANDPRFEALDRKDREHLLNERVLPLKRAAEEKAQAARAAASTSFKSMLQEKGDITV 609

Query: 997  NSRWSKVKDILRNDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXX 818
            +SRWS+VKD LRNDPRYKSV+HEDRE++FN+Y+                           
Sbjct: 610  SSRWSRVKDSLRNDPRYKSVRHEDREILFNQYISDLKAVEEEAEREAKAKRDEQEKLRER 669

Query: 817  XXXXXXXXXXXXXXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGR 638
                                       V +FQALLVETIK PQASWT S+PKLEKDPQ R
Sbjct: 670  ERELRKRKEREEQETERVRLKVRRKEAVATFQALLVETIKDPQASWTGSKPKLEKDPQRR 729

Query: 637  ATNPELDPSDAEKLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTA 458
            A NP+L+PSD EKLFREH+K L ERCA++FR                EDGKTVLNSWSTA
Sbjct: 730  AANPDLEPSDMEKLFREHIKRLNERCAHEFRALLAEVLTAEAASQETEDGKTVLNSWSTA 789

Query: 457  KRVLKPDPRYNKMPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPS 278
            KR+LKPDPRYNKM RKERE  WRRY+E+MLRK+KS+LD  ED   D+KSRSS DGGR+P 
Sbjct: 790  KRLLKPDPRYNKMARKEREVLWRRYSEEMLRKQKSALDHKEDRKTDAKSRSSVDGGRVPF 849

Query: 277  GSRRNHER 254
            GSR  H+R
Sbjct: 850  GSRGTHDR 857


>ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative [Ricinus communis]
            gi|223545064|gb|EEF46576.1| Pre-mRNA-processing protein
            PRP40, putative [Ricinus communis]
          Length = 886

 Score =  805 bits (2080), Expect = 0.0
 Identities = 461/898 (51%), Positives = 551/898 (61%), Gaps = 7/898 (0%)
 Frame = -3

Query: 2923 NSNPSV-APDVS--SLSYSVSQTVAGYSPNRQFQPNTTKPGTVSHAVFGSSTSTNSQPVP 2753
            NSNP V  P  +  S SY++SQ+   +S N+QF   +    +V  A     T+ +S P+ 
Sbjct: 12   NSNPPVPVPGFTPPSFSYNISQSALHFSANQQFHSTSDASASVPQA-----TALSSAPIV 66

Query: 2752 LXXXXXXXXXXXXAHKVGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXTKDTLSAV 2573
                              ++TS   T  S P                             
Sbjct: 67   SH---------------SSSTSTKTTSLSSPSFLVPPGLAGTPGPAGSVSCGPMILPPVT 111

Query: 2572 GDFNTSAVLRPSVPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWL 2393
             D  TS+V RP +P  +  SN    VQ   + TYPS                  G MP  
Sbjct: 112  VDSATSSVQRPVMPTVTHASNP--VVQQQSYHTYPSLPAMAASAQGLWFHPPQMGGMPRT 169

Query: 2392 PFLPYP-AVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSS 2216
            PFLPYP AV+P  +PLPAH +  PS+SS D QP G   +    AN   SA SGHQL+G+ 
Sbjct: 170  PFLPYPPAVFPGSYPLPAHGISRPSISSPDFQPSGAPPVGIPGANPPSSAASGHQLMGTP 229

Query: 2215 GMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKP 2036
            GM+ E  P GID +  +HD  TK   + ++ LDAWTAHKTD G+VYYYNAVTG STY+KP
Sbjct: 230  GMQKEIPPPGIDNRSQIHDFGTKNNAATSDSLDAWTAHKTDAGVVYYYNAVTGVSTYEKP 289

Query: 2035 AGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRK 1856
             GFK E +KVP+QPTPVSME+LAGTDWAL+TTNDGK YYYNNK K+SSWQIPSE+TEL+K
Sbjct: 290  PGFKSEPEKVPMQPTPVSMENLAGTDWALITTNDGKNYYYNNKTKLSSWQIPSEVTELKK 349

Query: 1855 KEDDDALKEHIMSTLNTNIVIEKGST-TSLSAPAINTGGRDATALRTSSVPGSSSALDLI 1679
            K++ + LKE  MS  +++++ EKGS   SLSAPAINTGGRDATALR S+  G+SSALDLI
Sbjct: 350  KQEAE-LKEQEMSVSSSSVLNEKGSVQISLSAPAINTGGRDATALRASNALGASSALDLI 408

Query: 1678 KKKLQDSGI-XXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXX 1502
            KKKLQDSG                 E NGS+ +E T KGL +EN K+KLKD         
Sbjct: 409  KKKLQDSGTPVTSSPAPVSLGITTPESNGSRAMEATSKGLPSENSKEKLKDANGDANASD 468

Query: 1501 XXXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRA 1322
                      GPTKEECII+FK+MLKERG+APFSKWEK LPKIVFDPRF+AIPS SARR+
Sbjct: 469  SSSDSEEEDNGPTKEECIIQFKDMLKERGIAPFSKWEKVLPKIVFDPRFQAIPSHSARRS 528

Query: 1321 LFERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEA 1142
            LFE +VKT                 EGF+QLLEEASE+ID NTDYQ+FR+KWG+D RFEA
Sbjct: 529  LFEHYVKTRAEEERKEKRAAQKAAIEGFRQLLEEASEEIDHNTDYQSFRRKWGNDPRFEA 588

Query: 1141 LDRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILR 962
            +DRKDRE LL+ERVLPLK              A+SFKSML++KGD+T+NSRWSKVK+ LR
Sbjct: 589  VDRKDREHLLHERVLPLKKAAQEKAQAERAAAAASFKSMLQDKGDLTVNSRWSKVKESLR 648

Query: 961  NDPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 782
            NDPRYKSVKHE+REV+FNEY+                                       
Sbjct: 649  NDPRYKSVKHEEREVLFNEYLSELKAAEEEAEWKAKVKREEQEKLKERERELRKRKEREE 708

Query: 781  XXXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAE 602
                           V SFQALLVETIK PQASWTES+ +LEKDPQGR TNP LDPSD E
Sbjct: 709  QEMERVREKVRRKEAVASFQALLVETIKDPQASWTESKTRLEKDPQGRGTNPNLDPSDTE 768

Query: 601  KLFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNK 422
            KLFREHVKML+ERC  +F+                EDGKTVL+SW+TAKRVLK DPRYNK
Sbjct: 769  KLFREHVKMLHERCTNEFKALLAEVINAEAASQKTEDGKTVLDSWTTAKRVLKLDPRYNK 828

Query: 421  MPRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSS-ADGGRLPSGSRRNHERR 251
            MPRKERE  WRR+AEDMLRK+K++LD+ ED H D + RSS  D GR  SGS+R H+RR
Sbjct: 829  MPRKEREVLWRRHAEDMLRKQKTTLDEKEDKHTDPRGRSSTTDSGRHLSGSKRTHDRR 886


>ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prunus persica]
            gi|462418875|gb|EMJ23138.1| hypothetical protein
            PRUPE_ppa001490mg [Prunus persica]
          Length = 814

 Score =  789 bits (2038), Expect = 0.0
 Identities = 444/822 (54%), Positives = 517/822 (62%), Gaps = 4/822 (0%)
 Frame = -3

Query: 2704 VGATTSWMPTIPSFPMXXXXXXXXXXXXXXXXXXXXT---KDTLSAVGDFNTSAVLRPSV 2534
            +G TTSW+PT PSF +                          T  +    ++S  LRPS+
Sbjct: 9    MGTTTSWVPTGPSFNLTSGMPGTPGTPGPPGIAHPVQISFNPTAPSAPIDSSSVALRPSM 68

Query: 2533 PMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPWLPFLPYPAVYPSPF 2354
             +A   S   S VQ  +   Y S                  G  P  PFLPYPA +P PF
Sbjct: 69   QIAPVAS---SAVQPQVGAPYLSLSSMGAPPQGVWLQSPQIGGFPRPPFLPYPAAFPGPF 125

Query: 2353 PLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSSGMRTEALPSGIDKK 2174
            PLPAH MP PSV   D+QPPGV  +  + A S  SA SGHQL GSSG++ E    GI  +
Sbjct: 126  PLPAHVMPLPSVPLPDSQPPGVIPVGNTAAISSPSAASGHQLAGSSGIQIELPHPGIGNE 185

Query: 2173 EHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKPAGFKGELDKVPVQP 1994
                        SVNEQLDAWTAHKT+TG+VYYYNA+TGESTYDKP GFK E DKV +QP
Sbjct: 186  NRA---------SVNEQLDAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQP 236

Query: 1993 TPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRKKEDDDALKEHIMST 1814
            TPVS  +L+GTDW LVTT+DGKK+Y+N K KVSSWQIP+E+ ELRKK+D D  KEH +S 
Sbjct: 237  TPVSTVNLSGTDWVLVTTSDGKKFYHNGKTKVSSWQIPNEVIELRKKQDADVPKEHPVSI 296

Query: 1813 LNTNIVIEKGST-TSLSAPAINTGGRDATALRTSSVPGSSSALDLIKKKLQDSGIXXXXX 1637
               N++ EKGS   SL+APAINTGGR+A A + S+V G+SSALDLIKKKLQDSG      
Sbjct: 297  PINNVMTEKGSAPISLTAPAINTGGREAMAFKPSAVQGTSSALDLIKKKLQDSG----AP 352

Query: 1636 XXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXXXXXXXXXXXGPTKE 1457
                     SE NGS+ VE T KG Q++N KDKLKDI                  GPTKE
Sbjct: 353  VTSSPVPAPSESNGSRGVESTPKGQQSDNSKDKLKDINGDGNLSDSSSDSEDADSGPTKE 412

Query: 1456 ECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRALFERFVKTXXXXXXX 1277
            ECI +FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS SARR+LFE +VKT       
Sbjct: 413  ECITQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERK 472

Query: 1276 XXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEALDRKDRELLLNERVL 1097
                      EGFKQLL+EASEDID  TDYQ+FRKKW +D RFEALDRKDRE LLNERVL
Sbjct: 473  EKRAAQKAAIEGFKQLLDEASEDIDHKTDYQSFRKKWANDPRFEALDRKDREHLLNERVL 532

Query: 1096 PLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILRNDPRYKSVKHEDREV 917
            PLK              A+SFKSML+EKGD+T++SRWS+VKD LRNDPRYKS++HEDRE+
Sbjct: 533  PLKRAAEEKAQAVRAAAATSFKSMLQEKGDITVSSRWSRVKDSLRNDPRYKSLRHEDREI 592

Query: 916  MFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 737
            +FN+Y+                                                      
Sbjct: 593  LFNQYISDLKAVEEEAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEA 652

Query: 736  VTSFQALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAEKLFREHVKMLYERCA 557
            V +FQALLVETIK PQASWT S+PKLEKDPQ RA NP+L+PSD EKLFREH+K L ERCA
Sbjct: 653  VATFQALLVETIKDPQASWTGSKPKLEKDPQRRAANPDLEPSDMEKLFREHIKRLNERCA 712

Query: 556  YDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKMPRKEREPQWRRYAE 377
            ++FR                EDGKTVLNSWSTAKR+LKPDPRYNKM RKERE  WRR++E
Sbjct: 713  HEFRALLAEVLTAEAASQETEDGKTVLNSWSTAKRLLKPDPRYNKMARKEREVLWRRFSE 772

Query: 376  DMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            +MLRK+KS+LD  ED   D+KSRSS D GR+P GSR  H+RR
Sbjct: 773  EMLRKQKSALDHKEDRKTDAKSRSSVDSGRVPFGSRGTHDRR 814


>ref|XP_008360017.1| PREDICTED: LOW QUALITY PROTEIN: pre-mRNA-processing protein 40C-like
            [Malus domestica]
          Length = 982

 Score =  787 bits (2032), Expect = 0.0
 Identities = 482/1016 (47%), Positives = 576/1016 (56%), Gaps = 37/1016 (3%)
 Frame = -3

Query: 3187 MTSPAWLPPEAQQLTSNAPISAKPAGGPLL-ASSTPIAPVSNVSGIAPSDSINEHSQE-- 3017
            M SPA LP E +       +S  PAGG     +++P + V   +  +P  S+N+  QE  
Sbjct: 1    MASPASLPQEVKP-----SVSVSPAGGASTQTAASPASSVGPTTSSSPCGSVNDSVQEPL 55

Query: 3016 --KSVTAPGGVVPHPSFAFRN------SGGT--QHSTSFVINSNPSVAPDVSS------- 2888
              K   AP   VP PSF++        S GT  Q S S  I SNP   P V +       
Sbjct: 56   QIKFGNAPTFAVPAPSFSYNVPPNANISFGTSQQSSPSSAIKSNPPAPPMVQAPVHGLSS 115

Query: 2887 -----LSYSVSQTVAGYSPNRQFQPNTT-KPGTVSHAVFGSSTSTN----SQPVPLXXXX 2738
                  SY++ ++   +  N+QFQ  T   P T       S +ST+    S P P     
Sbjct: 116  SASPPFSYNIPKSGYSFPNNQQFQSGTNITPATAQETGNASLSSTSLHSGSLPAPTSSST 175

Query: 2737 XXXXXXXXAHKVGATTSWMPTIPSF---PMXXXXXXXXXXXXXXXXXXXXTKDTLSAVGD 2567
                        G   S +PT+PSF   P                        T  +   
Sbjct: 176  VNISSAP---NAGPKASLVPTVPSFNMTPGMPGTPRTPGLPGIAHSVQISFNPTAPSAXI 232

Query: 2566 FNTSAVLRPS---VPMASAPSNSGSTVQHPIHPTYPSXXXXXXXXXXXXXXXXXXGVMPW 2396
             ++S  LRP+   VP+AS      S V   + P YPS                    +P 
Sbjct: 233  DSSSVALRPNMQAVPVAS------SAVHPHVGPPYPSLSAMGAPWLPSPQIGG----LPR 282

Query: 2395 LPFLPYPAVYPSPFPLPAHSMPHPSVSSADAQPPGVSYMRTSTANSKHSAISGHQLVGSS 2216
             PFLPYPA +P PFPLP H MP  SV   D+QPPGV+ +  + AN   S  S HQL GSS
Sbjct: 283  PPFLPYPAAFPGPFPLPVHVMPLSSVPLPDSQPPGVTPVGNTVANXLSSVGSRHQLAGSS 342

Query: 2215 GMRTEALPSGIDKKEHVHDVSTKGGDSVNEQLDAWTAHKTDTGIVYYYNAVTGESTYDKP 2036
            GM+ E    G         V T G   VNEQLDAWTAHKT+TG+VYYYNA+TGEST DKP
Sbjct: 343  GMQKELPHPG---------VGTDGRAVVNEQLDAWTAHKTETGVVYYYNALTGESTXDKP 393

Query: 2035 AGFKGELDKVPVQPTPVSMESLAGTDWALVTTNDGKKYYYNNKLKVSSWQIPSELTELRK 1856
             GF+ E  KV +QPTPVS  +L GTDW LVTT+DGKK+Y+N+K KVSSWQIP E+ ELR 
Sbjct: 394  PGFREEPGKVSMQPTPVSTVNLTGTDWVLVTTSDGKKFYHNSKTKVSSWQIPDEVIELRN 453

Query: 1855 KEDDDALKEHIMSTLNTNIVIEKGST-TSLSAPAINTGGRDATALRTSSVPGSSSALDLI 1679
            K+D D  KEH +S  N N++IE GS   SLSAPAINTGGR+A   + S V G  SALDLI
Sbjct: 454  KQDSDVPKEHTISVPNNNLMIEXGSAPVSLSAPAINTGGREAMPFKPSXVQGXXSALDLI 513

Query: 1678 KKKLQDSGIXXXXXXXXXXXXXXSELNGSKVVEVTIKGLQNENIKDKLKDIXXXXXXXXX 1499
            K+KLQD                 SE NG++ VE T KG Q+EN +DKLKD          
Sbjct: 514  KRKLQD-------PVTSSPISAPSESNGARGVESTPKGQQSENXEDKLKDTNGDGNLSDS 566

Query: 1498 XXXXXXXXXGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSQSARRAL 1319
                      PTKEECII+FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS  ARR+L
Sbjct: 567  SSDSEDADSXPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHEARRSL 626

Query: 1318 FERFVKTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDQNTDYQTFRKKWGSDRRFEAL 1139
            FE +VKT                 E FKQLL+E SE+ID NTDYQ+FRKKW +D RF+AL
Sbjct: 627  FEHYVKTRAEEERKEKRAAQKAAIEEFKQLLDETSENIDHNTDYQSFRKKWSNDPRFKAL 686

Query: 1138 DRKDRELLLNERVLPLKXXXXXXXXXXXXXXASSFKSMLREKGDVTLNSRWSKVKDILRN 959
            DRKDRE LLNERVLPLK              A+ FKSML+EKGD+T++SRWS+VKD LRN
Sbjct: 687  DRKDREHLLNERVLPLKRAAEEKAQAEXAAAAAGFKSMLKEKGDITVSSRWSRVKDSLRN 746

Query: 958  DPRYKSVKHEDREVMFNEYVRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 779
            DPRYK+V+HEDREV+FN+Y+                                        
Sbjct: 747  DPRYKNVRHEDREVLFNQYISGLKAVEEEAEREAKAKRDEQEKLRERERELRKRKEREEQ 806

Query: 778  XXXXXXXXXXXXXXVTSFQALLVETIKGPQASWTESRPKLEKDPQGRATNPELDPSDAEK 599
                          V +FQALLVETIK PQASWT S+PKLEKDPQ RA NP+LDPSD +K
Sbjct: 807  ETERVRLKVXRKEAVATFQALLVETIKDPQASWTGSKPKLEKDPQRRAANPDLDPSDMDK 866

Query: 598  LFREHVKMLYERCAYDFRXXXXXXXXXXXXXXXXEDGKTVLNSWSTAKRVLKPDPRYNKM 419
            LFREHVKML ERCA++FR                EDGKTVLNSWSTAKR+L+ DPRY+K 
Sbjct: 867  LFREHVKMLNERCAHEFRTLLAEVLTAEAASQETEDGKTVLNSWSTAKRILRTDPRYDKT 926

Query: 418  PRKEREPQWRRYAEDMLRKRKSSLDKNEDSHKDSKSRSSADGGRLPSGSRRNHERR 251
            PRKERE  WRRY+E+MLRK+KS++ + ED   D+KSRSS D GR P GSR  H+RR
Sbjct: 927  PRKEREVLWRRYSEEMLRKQKSAVGRKEDRKTDAKSRSSIDAGRNPYGSRGTHDRR 982


Top