BLASTX nr result

ID: Forsythia21_contig00013274 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00013274
         (3441 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011073766.1| PREDICTED: pre-mRNA-processing protein 40C [...   961   0.0  
ref|XP_012842923.1| PREDICTED: pre-mRNA-processing protein 40C [...   957   0.0  
ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i...   913   0.0  
ref|XP_010319354.1| PREDICTED: pre-mRNA-processing protein 40C i...   907   0.0  
ref|XP_006360860.1| PREDICTED: pre-mRNA-processing protein 40C-l...   906   0.0  
ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-l...   904   0.0  
ref|XP_004236882.1| PREDICTED: pre-mRNA-processing protein 40C i...   904   0.0  
ref|XP_006360858.1| PREDICTED: pre-mRNA-processing protein 40C-l...   903   0.0  
ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i...   900   0.0  
ref|XP_010319355.1| PREDICTED: pre-mRNA-processing protein 40C i...   892   0.0  
ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i...   888   0.0  
ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i...   881   0.0  
ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C i...   874   0.0  
ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [...   859   0.0  
ref|XP_008353148.1| PREDICTED: pre-mRNA-processing protein 40C-l...   845   0.0  
ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i...   840   0.0  
ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prun...   840   0.0  
ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C i...   838   0.0  
ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C i...   835   0.0  
ref|XP_009351698.1| PREDICTED: pre-mRNA-processing protein 40C [...   835   0.0  

>ref|XP_011073766.1| PREDICTED: pre-mRNA-processing protein 40C [Sesamum indicum]
          Length = 758

 Score =  961 bits (2483), Expect = 0.0
 Identities = 498/724 (68%), Positives = 560/724 (77%), Gaps = 1/724 (0%)
 Frame = -3

Query: 2512 GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNLEXXXXXXX 2336
            GPWLQ  QIS   RPPFSP+  VIPGP+  PTR   P+SV+ P+ QPPGV+         
Sbjct: 31   GPWLQPQQISAFARPPFSPFAAVIPGPYPTPTRGTPPVSVALPDIQPPGVSPAVSAVGAP 90

Query: 2335 XXXXXSGDQSTVGSTQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYY 2156
                 +G Q  +G    ELPPG++++K V N E+KDEA ++EQLDAWTAHRTE+G VYYY
Sbjct: 91   TSSSTAGGQPAIGFGLAELPPGVENNKYVGNAETKDEAPIKEQLDAWTAHRTETGTVYYY 150

Query: 2155 SSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSS 1976
            ++LTG STYEKP GFK E DKA VQPTPISWEKL GTDW  VTTNDGKRYYYNT TQLSS
Sbjct: 151  NALTGESTYEKPPGFKGESDKATVQPTPISWEKLTGTDWTLVTTNDGKRYYYNTTTQLSS 210

Query: 1975 WQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPL 1796
            WQIP+EV EL+KKQDAD+LKAQS+SV  TN+ITE+G   V+LSTPAANTGGRDATA+RP 
Sbjct: 211  WQIPSEVTELRKKQDADALKAQSVSVTATNIITERGPDAVNLSTPAANTGGRDATAIRPS 270

Query: 1795 GVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEK 1616
             VS  SSALDLIK+KLQDSG+  ++SPGP+LS  + LELNGSKP EA  K   +E+  EK
Sbjct: 271  SVSA-SSALDLIKKKLQDSGMPDSSSPGPSLSSAVALELNGSKPMEASIKGLLNENNKEK 329

Query: 1615 RKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPR 1436
            RKDAN               D GPTKEECI+QFKEMLKERGVAPFSKW+KELPKIVFDPR
Sbjct: 330  RKDANTDGDISNSSSDSEDEDGGPTKEECILQFKEMLKERGVAPFSKWEKELPKIVFDPR 389

Query: 1435 FKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTF 1256
            FKAIP+HSARRALFEHY                    EGFKQLLEEAKEDID++TDYQTF
Sbjct: 390  FKAIPNHSARRALFEHYVRTRAEEERKEKRAAQKAALEGFKQLLEEAKEDIDHNTDYQTF 449

Query: 1255 KRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITS 1076
            KR+WG+DPRF+AL RKERE LLNERVLPLKRTA+EKAQAE  A ISNFKSML D+GDITS
Sbjct: 450  KRRWGEDPRFQALDRKEREALLNERVLPLKRTAQEKAQAERVAAISNFKSMLHDKGDITS 509

Query: 1075 SSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXX 896
            SSRWSKVK+SLK D RYKS+KHEDREKLFNEY++ELKAAE+    KAK KQD        
Sbjct: 510  SSRWSKVKESLKCDPRYKSVKHEDREKLFNEYVAELKAAEEETVRKAKAKQDEEEKLKER 569

Query: 895  XXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGR 716
                                KARR EA+ESY+ALLVETIKDPQAS TESKPKLEKDPQGR
Sbjct: 570  ERALRKRKEREEQEVERVRQKARRKEALESYQALLVETIKDPQASWTESKPKLEKDPQGR 629

Query: 715  AANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTA 536
            AANPHLD+SD EKLFREHVKTL ERCAV+FKALL EVI+ADAAA+ET+DGKT + SWSTA
Sbjct: 630  AANPHLDKSDLEKLFREHVKTLYERCAVEFKALLTEVISADAAAQETQDGKTAITSWSTA 689

Query: 535  KQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKYMS 356
            KQLLKNDPRYNKMPRK+RESLW RH EEI RKQK V DQE EK AEG+SR+SVDS K++S
Sbjct: 690  KQLLKNDPRYNKMPRKERESLWRRHAEEIQRKQKKVHDQEGEKPAEGKSRTSVDSGKHLS 749

Query: 355  GSRR 344
            GSRR
Sbjct: 750  GSRR 753


>ref|XP_012842923.1| PREDICTED: pre-mRNA-processing protein 40C [Erythranthe guttatus]
            gi|604322248|gb|EYU32634.1| hypothetical protein
            MIMGU_mgv1a001237mg [Erythranthe guttata]
          Length = 858

 Score =  957 bits (2474), Expect = 0.0
 Identities = 534/908 (58%), Positives = 626/908 (68%), Gaps = 5/908 (0%)
 Frame = -3

Query: 3046 PGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKTNVRT--TQE 2873
            PGSF  G+  Q M           +G+S HSANFSFNGN Q  Q D   +TNVR   TQE
Sbjct: 4    PGSFATGSAVQAM-----------EGNSLHSANFSFNGNVQSAQADQPNRTNVRGDGTQE 52

Query: 2872 IGXXXXXXXXXXXXS-RPALTNPSPSVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLT 2696
             G            S +PA  N SPS T FA+N FS+ +  +P  P+FQVP G+ +TP T
Sbjct: 53   TGAITSSPAFMQSSSSQPARPNSSPSTTHFASNKFSN-TTWMPTAPTFQVPTGILKTP-T 110

Query: 2695 PGPPGIASSVPSSSNIIAVPSSVDSPALPRSFMSTAPVLSSXXXXXXXXXXXXXXXXXXX 2516
            PGPPG+ SS PS       PS++DS AL R FM T P LS+                   
Sbjct: 111  PGPPGLTSSAPS-------PSNLDSGALIRPFMHTGPFLSNPSIQHNAAPP--------- 154

Query: 2515 QGPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNLEXXXXXX 2339
             GPW +  QI    RPPFSPY  VIPGP+ +PTR   P+SVSFP+ QPPGV+        
Sbjct: 155  -GPWFRPQQIGAFGRPPFSPYAAVIPGPYPMPTRGTQPVSVSFPDIQPPGVS-------- 205

Query: 2338 XXXXXXSGDQSTVGSTQEELPPGIDSSKRVIND-ESKDEASVREQLDAWTAHRTESGVVY 2162
                  +   S  G T  ELPPG D+SK   N   +KDEA  +E LDAWTAHR E+G +Y
Sbjct: 206  -----HAASASISGPT--ELPPGTDNSKHGGNAVTTKDEAPTKE-LDAWTAHRAETGTIY 257

Query: 2161 YYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQL 1982
            YY++LTG STYEKPSGFK E +K  +QPTPISWEKL GTDW  VTTNDGK YYYN  TQL
Sbjct: 258  YYNALTGESTYEKPSGFKGESNKPTMQPTPISWEKLIGTDWTTVTTNDGKVYYYNAATQL 317

Query: 1981 SSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALR 1802
            SSWQ+P+EV EL+KKQDAD+LKAQSLS   TNV+ EKGS PVSLSTPAANTGGRDATA++
Sbjct: 318  SSWQVPSEVTELRKKQDADALKAQSLSATYTNVVAEKGSDPVSLSTPAANTGGRDATAVK 377

Query: 1801 PLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCI 1622
               VSG SSALDLIK+KLQDSG+  +TSPGP+LS     E+NGSK  E +    ++E+  
Sbjct: 378  SSSVSGSSSALDLIKKKLQDSGLPDSTSPGPSLS-----EINGSKSIEFL----ENENNK 428

Query: 1621 EKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFD 1442
            +KRKDAN               D GPTKEECI+QFKEMLKERGVAPFSKW+KELPKIVFD
Sbjct: 429  DKRKDANGDGDLSNSSSDSEDEDGGPTKEECILQFKEMLKERGVAPFSKWEKELPKIVFD 488

Query: 1441 PRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQ 1262
             RFKAI +HSARRALFEHY                    EGFKQLLEEAKEDID++TDY+
Sbjct: 489  ARFKAISNHSARRALFEHYVRTRAEEERKEKRAAQKAASEGFKQLLEEAKEDIDHNTDYE 548

Query: 1261 TFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDI 1082
            TFKRKWG+D RF+AL RKEREFLLNERV PL++ A+E+AQAE AA  S+FKSML+D GD+
Sbjct: 549  TFKRKWGQDHRFQALERKEREFLLNERVSPLRKIAQERAQAERAAATSDFKSMLKDNGDV 608

Query: 1081 TSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXX 902
            TS+SRWSKVKDSLK D RY S+KH+DREKLFNEY++ELKAAE+    KA+  QD      
Sbjct: 609  TSTSRWSKVKDSLKSDPRYMSVKHDDREKLFNEYVAELKAAEEETVRKARAVQDEEDKIK 668

Query: 901  XXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQ 722
                                  KARR EA+ESY+ALLVETIKDPQAS T SKPKL+KDPQ
Sbjct: 669  ERERALRKRKEREEQEVERVRQKARRKEAIESYQALLVETIKDPQASWTASKPKLDKDPQ 728

Query: 721  GRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWS 542
            GRAANPHLD+SD EKLFREHVK+L+ERC  +F+ALL +VITA+A+ARETEDGKTV+ SWS
Sbjct: 729  GRAANPHLDKSDLEKLFREHVKSLHERCVGEFRALLTDVITAEASARETEDGKTVITSWS 788

Query: 541  TAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKY 362
            TAKQ+LK+DPRYNKMPRK+RESLW RH EEI RK K   DQ  EK  EG+SR+S +  K+
Sbjct: 789  TAKQVLKSDPRYNKMPRKERESLWRRHSEEIQRKLKKDSDQ-GEKPVEGKSRASAEPGKH 847

Query: 361  MSGSRRNY 338
            +SGS R +
Sbjct: 848  LSGSGRTH 855


>ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis
            vinifera] gi|297738259|emb|CBI27460.3| unnamed protein
            product [Vitis vinifera]
          Length = 1046

 Score =  913 bits (2360), Expect = 0.0
 Identities = 514/992 (51%), Positives = 638/992 (64%), Gaps = 15/992 (1%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSY--LNENNLPSGSSQQLSASPAVVQGHSPAGKNASSPT 3101
            Q++++ K  +A  + +  PSFSY  +      SG+SQQL +   +      +     +P 
Sbjct: 62   QESAQGKFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTPV 121

Query: 3100 P----SAQPAFFHPPAPSHTSRPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNG 2933
            P    S+ P+F +  A      PGS    ++  + +          +G + ++A+FSFNG
Sbjct: 122  PGPSSSSGPSFSYNIAHKGAGFPGSQPFQSSTSIASGP--------RGPTPNAASFSFNG 173

Query: 2932 NQQMMQNDLSLKTNVR--TTQEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMS 2759
            N Q++Q D +LK++      QE G              P     S +++V ++      +
Sbjct: 174  NPQLVQKDQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTT 230

Query: 2758 VRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVD--SPALPRSFMSTAP 2585
            + +P  PSF VP GMP TP TPGPPGIA S P SSN+    +S+D  S  + R+    AP
Sbjct: 231  LWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAP 290

Query: 2584 VLSSXXXXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRP 2414
            V S+                      GPWLQ PQ+ G+ RPPF PYP V P PF LP   
Sbjct: 291  VSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHG 350

Query: 2413 MLPLSVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGSTQEELPPGIDSSKRVIND 2240
            M   SV  P++QPPGV               SG    +T G   E  PPGID +K V   
Sbjct: 351  MPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGA 410

Query: 2239 ESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWE 2060
             +KD A+V EQ+DAWTAH+T++GVVYYY++LTG STYEKPS FK E DK  VQPTP+SWE
Sbjct: 411  GTKDGAAVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWE 470

Query: 2059 KLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVI 1880
            KL GTDWA VTTNDGK+YYYNT+T+LSSWQIP E+ E++KKQD+ +LK  ++   NTNV 
Sbjct: 471  KLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVS 530

Query: 1879 TEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPALS 1700
            TEKG +P++LS PA  TGGRDAT LR   V G +SALD+IK+KLQDSG  A +SP  + S
Sbjct: 531  TEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHS-S 589

Query: 1699 GGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQ 1520
            G +  ELNGS+  E   K  Q E+  +K KD N               D GPTKEECIIQ
Sbjct: 590  GPIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQ 649

Query: 1519 FKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXX 1340
            FKEMLKERGVAPFSKW+KELPKIVFDPRFKAIP +SARR+LFEHY               
Sbjct: 650  FKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAA 709

Query: 1339 XXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRT 1160
                 EGFKQLLEEA EDID+ T+YQTF++KWG DPRFEAL RK+RE LLNERVLPLKR 
Sbjct: 710  QRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRA 769

Query: 1159 AEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEY 980
            AEEKAQA  AA +S+FKSML+D+GDIT+S+RWS+VKDSL+ D RYK +KHEDRE LFNEY
Sbjct: 770  AEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEY 829

Query: 979  ISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYK 800
            ISELKAAE+ +E +AK+K++                            K RR EAV SY+
Sbjct: 830  ISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQ 889

Query: 799  ALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKA 620
            ALLVETIKDPQ S TESKPKLEKDPQ RA N  LD SD EKLFREH+K L+ER A +F+A
Sbjct: 890  ALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRA 949

Query: 619  LLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRK 440
            LL+EV+TA+AA +ETEDGKTV+ SWSTAK+LL++D RY KMPRKDRES+W R+ EE+LRK
Sbjct: 950  LLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRK 1009

Query: 439  QKSVRDQEAEKHAEGRSRSSVDSDKYMSGSRR 344
            QK  +DQ  EKH E + RSSVDS ++ SGSRR
Sbjct: 1010 QKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRR 1041


>ref|XP_010319354.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Solanum
            lycopersicum]
          Length = 1040

 Score =  907 bits (2345), Expect = 0.0
 Identities = 527/1021 (51%), Positives = 636/1021 (62%), Gaps = 53/1021 (5%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAV--------------VQG 3137
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +              + G
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQSSSSPVIPSTSAGSSASLQPPIPG 99

Query: 3136 HS------------------------------PAGKNASSPTPSAQPAFFHPPAPSHTSR 3047
             S                              PA  + S    ++  A   PP P  ++R
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRPAAPDHSVDINASPAASLQPPLPLVSTR 159

Query: 3046 PGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKTNVRT--TQE 2873
              SF+PGT A               G     +N SFNG  QMMQ D ++K N R    QE
Sbjct: 160  LSSFMPGTAASA-------------GPLISGSNLSFNGGPQMMQTDQTMKPNRRVDLAQE 206

Query: 2872 IGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRTPLT 2696
             G            S+    +   S   F  +   S ++ R+P  P FQVP G+PR+P+T
Sbjct: 207  TGGMTSATLVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPRSPVT 266

Query: 2695 PGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXXXXX 2525
            PGPPG+  ++PSSSN+ A  S    P+LP R       VL+  S                
Sbjct: 267  PGPPGLGPAIPSSSNLTATVSP-GGPSLPLRPNAPPVHVLANPSVQQQTYSPYHSPAPIA 325

Query: 2524 XXXQGPWLQSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLS-VSFPNAQPPGVNLEXXX 2348
               QGPWLQ P ++ ++RPPF  YP     P+  +    PLS V+ P+ +PPGV      
Sbjct: 326  PSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPYPLSATGAPLSSVTLPDTRPPGV----AP 381

Query: 2347 XXXXXXXXXSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESG 2171
                     +  QST  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRTE+G
Sbjct: 382  VAAPPGVPTTASQSTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRTETG 441

Query: 2170 VVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTR 1991
             +YYY+SLTG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG++YYYNT+
Sbjct: 442  AIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQKYYYNTK 501

Query: 1990 TQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDAT 1811
            T+LSSWQIP EV ELKKK DAD+L+AQS S++N N   EKGSAP+SLS PA +TGGRDAT
Sbjct: 502  TKLSSWQIPIEVTELKKKHDADALQAQSPSILNVNESAEKGSAPISLSIPAVSTGGRDAT 561

Query: 1810 ALRPLGVSGPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKVSQH 1634
            +LRP  V G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T++ Q 
Sbjct: 562  SLRPSLVPG-SSALDLVKKKLMDFGTPLAVSSPAPASSGVISSEVNGSKALESTTRIPQK 620

Query: 1633 EDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPK 1454
            E+  EK K+AN               +  PTKE+CIIQFKEMLKERGVAPFSKW+KELPK
Sbjct: 621  ENSKEKSKEANDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKELPK 680

Query: 1453 IVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYD 1274
            IVFDPRFKAIPS+SAR+ LFEHY                    EGFKQLLEEAKEDI  D
Sbjct: 681  IVFDPRFKAIPSYSARKTLFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDISED 740

Query: 1273 TDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQD 1094
            TDYQ+FK+KW  DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSML++
Sbjct: 741  TDYQSFKKKWSHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSMLRE 800

Query: 1093 RGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXX 914
            +GDIT ++RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K D  
Sbjct: 801  QGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKHDEE 860

Query: 913  XXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLE 734
                                      KARR EAVESY+ALLVE IKDPQAS TESKPKLE
Sbjct: 861  DKLKERERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKPKLE 920

Query: 733  KDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVV 554
            KDPQGRAANPHLDQSD EKLFREHVK L ERC  +FK LLAEVIT +A +RETEDGKTV 
Sbjct: 921  KDPQGRAANPHLDQSDLEKLFREHVKVLYERCVQEFKVLLAEVITVEACSRETEDGKTVA 980

Query: 553  NSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVD 374
            NSWSTAKQ+LK D RY+KM RKD E+LW R+VE+I R+QKS  D EA+K    RS+ S D
Sbjct: 981  NSWSTAKQVLKGDLRYSKMARKDSETLWRRYVEDIHRRQKSTLD-EADK---ARSKGSSD 1036

Query: 373  S 371
            S
Sbjct: 1037 S 1037


>ref|XP_006360860.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X3 [Solanum
            tuberosum]
          Length = 1036

 Score =  906 bits (2341), Expect = 0.0
 Identities = 533/1022 (52%), Positives = 641/1022 (62%), Gaps = 54/1022 (5%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAV--------------VQG 3137
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +              + G
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQPSSSPVIPSTSAGSSALLQPPIPG 99

Query: 3136 HS------------------------------PAGKNASSPTPSAQPAF-FHPPAPSHTS 3050
             S                              PA  + S    +A PA    PP P  ++
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRPAAPDHSVDVKNASPAASLQPPLPLVST 159

Query: 3049 RPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKTNVRT--TQ 2876
            R  SF+PG TA               G     +N SFNG  QMMQ D ++K N R    Q
Sbjct: 160  RLSSFMPGITAAA-------------GPLISGSNLSFNGGPQMMQTDQTMKPNRRVDVAQ 206

Query: 2875 EIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRTPL 2699
            E G            S+    +   S   F  +   S ++ R+P  P FQVP G+P++P+
Sbjct: 207  ETGGMTSATFVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPKSPV 266

Query: 2698 TPGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXXXX 2528
            TPGP     ++PSSSN+ A  +S   P+LP R   S   VL+  S               
Sbjct: 267  TPGP-----AIPSSSNLTAT-ASPGGPSLPLRPNASPVHVLANPSVQQQTYSPYFSPTPI 320

Query: 2527 XXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLS-VSFPNAQPPGVNLEXX 2351
                QGPWLQ P ++ ++RPPF  YP     PF  +    PLS V+ P+ +PPGV     
Sbjct: 321  TPSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPFPLSATGAPLSSVTLPDTRPPGV----A 376

Query: 2350 XXXXXXXXXXSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTES 2174
                      +  Q T  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRTE+
Sbjct: 377  PVAAPPGVPTTASQPTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRTET 436

Query: 2173 GVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNT 1994
            G +YYY+SLTG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG+RYYYNT
Sbjct: 437  GAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQRYYYNT 496

Query: 1993 RTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDA 1814
            +T+LSSWQIP+EV ELKKK DAD+L+AQS S++N N  TEKGSAP+SLS PA +TGGRDA
Sbjct: 497  KTKLSSWQIPSEVTELKKKHDADALQAQSPSILNVNESTEKGSAPISLSIPAVSTGGRDA 556

Query: 1813 TALRPLGVSGPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKVSQ 1637
            T+LRP  V G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T+V Q
Sbjct: 557  TSLRPSLVPG-SSALDLVKKKLMDFGAPLAVSSPVPASSGVISSEVNGSKALESTTRVPQ 615

Query: 1636 HEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELP 1457
             E+  EK K+ N               +  PTKE+CIIQFKEMLKERGVAPFSKW+KELP
Sbjct: 616  KENSKEKSKEVNDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKELP 675

Query: 1456 KIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDY 1277
            KIVFDPRFKAIPS+SAR+ALFEHY                    EGFKQLLEEAKEDI+ 
Sbjct: 676  KIVFDPRFKAIPSYSARKALFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDINE 735

Query: 1276 DTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQ 1097
            DTDYQ+FK+KWG DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSML+
Sbjct: 736  DTDYQSFKKKWGHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSMLR 795

Query: 1096 DRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDX 917
            ++GDIT ++RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K D 
Sbjct: 796  EQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKHDE 855

Query: 916  XXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKL 737
                                       KARR EAVESY+ALLVE IKDPQAS TESKPKL
Sbjct: 856  EDKLKLRERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKPKL 915

Query: 736  EKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTV 557
            EKDPQGRAANPHLDQSD EKLFREHVK L ERCA +FK LLAEVIT +A +RETE+GKTV
Sbjct: 916  EKDPQGRAANPHLDQSDLEKLFREHVKVLYERCAQEFKVLLAEVITVEACSRETENGKTV 975

Query: 556  VNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSV 377
             NSWSTAKQLLK D RY+KM RKDRE+LW R+VE+I R+QKS  D EA+K    RS+ S 
Sbjct: 976  ANSWSTAKQLLKGDLRYSKMARKDRETLWRRYVEDIHRRQKSTLD-EADK---ARSKGSS 1031

Query: 376  DS 371
            DS
Sbjct: 1032 DS 1033


>ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X4 [Solanum
            tuberosum]
          Length = 1027

 Score =  904 bits (2336), Expect = 0.0
 Identities = 529/1013 (52%), Positives = 637/1013 (62%), Gaps = 45/1013 (4%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAVVQGHSPAGKNASSPTP- 3098
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +    + +      P P 
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQPSSSPVIPSTSAGSSALLQPPIPG 99

Query: 3097 --------------------------------SAQPAF-FHPPAPSHTSRPGSFVPGTTA 3017
                                            +A PA    PP P  ++R  SF+PG TA
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRNASPAASLQPPLPLVSTRLSSFMPGITA 159

Query: 3016 QLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKT----NVRTTQEIGXXXXXX 2849
                           G     +N SFNG  QMMQ D ++K      V   QE G      
Sbjct: 160  AA-------------GPLISGSNLSFNGGPQMMQTDQTMKPLQNRRVDVAQETGGMTSAT 206

Query: 2848 XXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRTPLTPGPPGIAS 2672
                  S+    +   S   F  +   S ++ R+P  P FQVP G+P++P+TPGP     
Sbjct: 207  FVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPKSPVTPGP----- 261

Query: 2671 SVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXXXXXXXXQGPWL 2501
            ++PSSSN+ A  +S   P+LP R   S   VL+  S                   QGPWL
Sbjct: 262  AIPSSSNLTAT-ASPGGPSLPLRPNASPVHVLANPSVQQQTYSPYFSPTPITPSHQGPWL 320

Query: 2500 QSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLS-VSFPNAQPPGVNLEXXXXXXXXXXX 2324
            Q P ++ ++RPPF  YP     PF  +    PLS V+ P+ +PPGV              
Sbjct: 321  QPPPVTTMLRPPFPSYPAGFAVPFPLSATGAPLSSVTLPDTRPPGV----APVAAPPGVP 376

Query: 2323 XSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSL 2147
             +  Q T  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRTE+G +YYY+SL
Sbjct: 377  TTASQPTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRTETGAIYYYNSL 436

Query: 2146 TGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQI 1967
            TG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG+RYYYNT+T+LSSWQI
Sbjct: 437  TGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQRYYYNTKTKLSSWQI 496

Query: 1966 PNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVS 1787
            P+EV ELKKK DAD+L+AQS S++N N  TEKGSAP+SLS PA +TGGRDAT+LRP  V 
Sbjct: 497  PSEVTELKKKHDADALQAQSPSILNVNESTEKGSAPISLSIPAVSTGGRDATSLRPSLVP 556

Query: 1786 GPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRK 1610
            G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T+V Q E+  EK K
Sbjct: 557  G-SSALDLVKKKLMDFGAPLAVSSPVPASSGVISSEVNGSKALESTTRVPQKENSKEKSK 615

Query: 1609 DANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFK 1430
            + N               +  PTKE+CIIQFKEMLKERGVAPFSKW+KELPKIVFDPRFK
Sbjct: 616  EVNDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 675

Query: 1429 AIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKR 1250
            AIPS+SAR+ALFEHY                    EGFKQLLEEAKEDI+ DTDYQ+FK+
Sbjct: 676  AIPSYSARKALFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDINEDTDYQSFKK 735

Query: 1249 KWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSS 1070
            KWG DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSML+++GDIT ++
Sbjct: 736  KWGHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSMLREQGDITLNT 795

Query: 1069 RWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXX 890
            RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K D          
Sbjct: 796  RWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKHDEEDKLKLRER 855

Query: 889  XXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAA 710
                              KARR EAVESY+ALLVE IKDPQAS TESKPKLEKDPQGRAA
Sbjct: 856  ALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKPKLEKDPQGRAA 915

Query: 709  NPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQ 530
            NPHLDQSD EKLFREHVK L ERCA +FK LLAEVIT +A +RETE+GKTV NSWSTAKQ
Sbjct: 916  NPHLDQSDLEKLFREHVKVLYERCAQEFKVLLAEVITVEACSRETENGKTVANSWSTAKQ 975

Query: 529  LLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDS 371
            LLK D RY+KM RKDRE+LW R+VE+I R+QKS  D EA+K    RS+ S DS
Sbjct: 976  LLKGDLRYSKMARKDRETLWRRYVEDIHRRQKSTLD-EADK---ARSKGSSDS 1024


>ref|XP_004236882.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Solanum
            lycopersicum]
          Length = 1042

 Score =  904 bits (2336), Expect = 0.0
 Identities = 526/1023 (51%), Positives = 635/1023 (62%), Gaps = 55/1023 (5%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAV--------------VQG 3137
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +              + G
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQSSSSPVIPSTSAGSSASLQPPIPG 99

Query: 3136 HS------------------------------PAGKNASSPTPSAQPAFFHPPAPSHTSR 3047
             S                              PA  + S    ++  A   PP P  ++R
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRPAAPDHSVDINASPAASLQPPLPLVSTR 159

Query: 3046 PGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKT----NVRTT 2879
              SF+PGT A               G     +N SFNG  QMMQ D ++K      V   
Sbjct: 160  LSSFMPGTAASA-------------GPLISGSNLSFNGGPQMMQTDQTMKPLQNRRVDLA 206

Query: 2878 QEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRTP 2702
            QE G            S+    +   S   F  +   S ++ R+P  P FQVP G+PR+P
Sbjct: 207  QETGGMTSATLVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPRSP 266

Query: 2701 LTPGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXXX 2531
            +TPGPPG+  ++PSSSN+ A  S    P+LP R       VL+  S              
Sbjct: 267  VTPGPPGLGPAIPSSSNLTATVSP-GGPSLPLRPNAPPVHVLANPSVQQQTYSPYHSPAP 325

Query: 2530 XXXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLS-VSFPNAQPPGVNLEX 2354
                 QGPWLQ P ++ ++RPPF  YP     P+  +    PLS V+ P+ +PPGV    
Sbjct: 326  IAPSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPYPLSATGAPLSSVTLPDTRPPGV---- 381

Query: 2353 XXXXXXXXXXXSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTE 2177
                       +  QST  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRTE
Sbjct: 382  APVAAPPGVPTTASQSTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRTE 441

Query: 2176 SGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYN 1997
            +G +YYY+SLTG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG++YYYN
Sbjct: 442  TGAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQKYYYN 501

Query: 1996 TRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRD 1817
            T+T+LSSWQIP EV ELKKK DAD+L+AQS S++N N   EKGSAP+SLS PA +TGGRD
Sbjct: 502  TKTKLSSWQIPIEVTELKKKHDADALQAQSPSILNVNESAEKGSAPISLSIPAVSTGGRD 561

Query: 1816 ATALRPLGVSGPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKVS 1640
            AT+LRP  V G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T++ 
Sbjct: 562  ATSLRPSLVPG-SSALDLVKKKLMDFGTPLAVSSPAPASSGVISSEVNGSKALESTTRIP 620

Query: 1639 QHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKEL 1460
            Q E+  EK K+AN               +  PTKE+CIIQFKEMLKERGVAPFSKW+KEL
Sbjct: 621  QKENSKEKSKEANDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKEL 680

Query: 1459 PKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDID 1280
            PKIVFDPRFKAIPS+SAR+ LFEHY                    EGFKQLLEEAKEDI 
Sbjct: 681  PKIVFDPRFKAIPSYSARKTLFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDIS 740

Query: 1279 YDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSML 1100
             DTDYQ+FK+KW  DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSML
Sbjct: 741  EDTDYQSFKKKWSHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSML 800

Query: 1099 QDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQD 920
            +++GDIT ++RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K D
Sbjct: 801  REQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKHD 860

Query: 919  XXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPK 740
                                        KARR EAVESY+ALLVE IKDPQAS TESKPK
Sbjct: 861  EEDKLKERERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKPK 920

Query: 739  LEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKT 560
            LEKDPQGRAANPHLDQSD EKLFREHVK L ERC  +FK LLAEVIT +A +RETEDGKT
Sbjct: 921  LEKDPQGRAANPHLDQSDLEKLFREHVKVLYERCVQEFKVLLAEVITVEACSRETEDGKT 980

Query: 559  VVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSS 380
            V NSWSTAKQ+LK D RY+KM RKD E+LW R+VE+I R+QKS  D EA+K    RS+ S
Sbjct: 981  VANSWSTAKQVLKGDLRYSKMARKDSETLWRRYVEDIHRRQKSTLD-EADK---ARSKGS 1036

Query: 379  VDS 371
             DS
Sbjct: 1037 SDS 1039


>ref|XP_006360858.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Solanum
            tuberosum] gi|565390252|ref|XP_006360859.1| PREDICTED:
            pre-mRNA-processing protein 40C-like isoform X2 [Solanum
            tuberosum]
          Length = 1038

 Score =  903 bits (2333), Expect = 0.0
 Identities = 532/1024 (51%), Positives = 640/1024 (62%), Gaps = 56/1024 (5%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAV--------------VQG 3137
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +              + G
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQPSSSPVIPSTSAGSSALLQPPIPG 99

Query: 3136 HS------------------------------PAGKNASSPTPSAQPAF-FHPPAPSHTS 3050
             S                              PA  + S    +A PA    PP P  ++
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRPAAPDHSVDVKNASPAASLQPPLPLVST 159

Query: 3049 RPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKT----NVRT 2882
            R  SF+PG TA               G     +N SFNG  QMMQ D ++K      V  
Sbjct: 160  RLSSFMPGITAAA-------------GPLISGSNLSFNGGPQMMQTDQTMKPLQNRRVDV 206

Query: 2881 TQEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRT 2705
             QE G            S+    +   S   F  +   S ++ R+P  P FQVP G+P++
Sbjct: 207  AQETGGMTSATFVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPKS 266

Query: 2704 PLTPGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXX 2534
            P+TPGP     ++PSSSN+ A  +S   P+LP R   S   VL+  S             
Sbjct: 267  PVTPGP-----AIPSSSNLTAT-ASPGGPSLPLRPNASPVHVLANPSVQQQTYSPYFSPT 320

Query: 2533 XXXXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLS-VSFPNAQPPGVNLE 2357
                  QGPWLQ P ++ ++RPPF  YP     PF  +    PLS V+ P+ +PPGV   
Sbjct: 321  PITPSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPFPLSATGAPLSSVTLPDTRPPGV--- 377

Query: 2356 XXXXXXXXXXXXSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRT 2180
                        +  Q T  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRT
Sbjct: 378  -APVAAPPGVPTTASQPTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRT 436

Query: 2179 ESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYY 2000
            E+G +YYY+SLTG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG+RYYY
Sbjct: 437  ETGAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQRYYY 496

Query: 1999 NTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGR 1820
            NT+T+LSSWQIP+EV ELKKK DAD+L+AQS S++N N  TEKGSAP+SLS PA +TGGR
Sbjct: 497  NTKTKLSSWQIPSEVTELKKKHDADALQAQSPSILNVNESTEKGSAPISLSIPAVSTGGR 556

Query: 1819 DATALRPLGVSGPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKV 1643
            DAT+LRP  V G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T+V
Sbjct: 557  DATSLRPSLVPG-SSALDLVKKKLMDFGAPLAVSSPVPASSGVISSEVNGSKALESTTRV 615

Query: 1642 SQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKE 1463
             Q E+  EK K+ N               +  PTKE+CIIQFKEMLKERGVAPFSKW+KE
Sbjct: 616  PQKENSKEKSKEVNDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKE 675

Query: 1462 LPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDI 1283
            LPKIVFDPRFKAIPS+SAR+ALFEHY                    EGFKQLLEEAKEDI
Sbjct: 676  LPKIVFDPRFKAIPSYSARKALFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDI 735

Query: 1282 DYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSM 1103
            + DTDYQ+FK+KWG DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSM
Sbjct: 736  NEDTDYQSFKKKWGHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSM 795

Query: 1102 LQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQ 923
            L+++GDIT ++RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K 
Sbjct: 796  LREQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKH 855

Query: 922  DXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKP 743
            D                            KARR EAVESY+ALLVE IKDPQAS TESKP
Sbjct: 856  DEEDKLKLRERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKP 915

Query: 742  KLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGK 563
            KLEKDPQGRAANPHLDQSD EKLFREHVK L ERCA +FK LLAEVIT +A +RETE+GK
Sbjct: 916  KLEKDPQGRAANPHLDQSDLEKLFREHVKVLYERCAQEFKVLLAEVITVEACSRETENGK 975

Query: 562  TVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRS 383
            TV NSWSTAKQLLK D RY+KM RKDRE+LW R+VE+I R+QKS  D EA+K    RS+ 
Sbjct: 976  TVANSWSTAKQLLKGDLRYSKMARKDRETLWRRYVEDIHRRQKSTLD-EADK---ARSKG 1031

Query: 382  SVDS 371
            S DS
Sbjct: 1032 SSDS 1035


>ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis
            vinifera]
          Length = 903

 Score =  900 bits (2326), Expect = 0.0
 Identities = 491/886 (55%), Positives = 595/886 (67%), Gaps = 9/886 (1%)
 Frame = -3

Query: 2974 QGSSSHSANFSFNGNQQMMQNDLSLKTNVR--TTQEIGXXXXXXXXXXXXSRPALTNPSP 2801
            +G + ++A+FSFNGN Q++Q D +LK++      QE G              P     S 
Sbjct: 17   RGPTPNAASFSFNGNPQLVQKDQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPC---SSS 73

Query: 2800 SVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVD- 2624
            +++V ++      ++ +P  PSF VP GMP TP TPGPPGIA S P SSN+    +S+D 
Sbjct: 74   TMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDF 133

Query: 2623 -SPALPRSFMSTAPVLSSXXXXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPPFSPY 2453
             S  + R+    APV S+                      GPWLQ PQ+ G+ RPPF PY
Sbjct: 134  SSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPY 193

Query: 2452 PNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGSTQEE 2282
            P V P PF LP   M   SV  P++QPPGV               SG    +T G   E 
Sbjct: 194  PAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSEL 253

Query: 2281 LPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDE 2102
             PPGID +K V    +KD A+V EQ+DAWTAH+T++GVVYYY++LTG STYEKPS FK E
Sbjct: 254  PPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGE 313

Query: 2101 PDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADS 1922
             DK  VQPTP+SWEKL GTDWA VTTNDGK+YYYNT+T+LSSWQIP E+ E++KKQD+ +
Sbjct: 314  ADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVA 373

Query: 1921 LKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQD 1742
            LK  ++   NTNV TEKG +P++LS PA  TGGRDAT LR   V G +SALD+IK+KLQD
Sbjct: 374  LKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQD 433

Query: 1741 SGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXX 1562
            SG  A +SP  + SG +  ELNGS+  E   K  Q E+  +K KD N             
Sbjct: 434  SGAPATSSPVHS-SGPIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSE 492

Query: 1561 XXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYX 1382
              D GPTKEECIIQFKEMLKERGVAPFSKW+KELPKIVFDPRFKAIP +SARR+LFEHY 
Sbjct: 493  DVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYV 552

Query: 1381 XXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKER 1202
                               EGFKQLLEEA EDID+ T+YQTF++KWG DPRFEAL RK+R
Sbjct: 553  RTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDR 612

Query: 1201 EFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYK 1022
            E LLNERVLPLKR AEEKAQA  AA +S+FKSML+D+GDIT+S+RWS+VKDSL+ D RYK
Sbjct: 613  ELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYK 672

Query: 1021 SIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXX 842
             +KHEDRE LFNEYISELKAAE+ +E +AK+K++                          
Sbjct: 673  CVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERV 732

Query: 841  XXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREH 662
              K RR EAV SY+ALLVETIKDPQ S TESKPKLEKDPQ RA N  LD SD EKLFREH
Sbjct: 733  RLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREH 792

Query: 661  VKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDR 482
            +K L+ER A +F+ALL+EV+TA+AA +ETEDGKTV+ SWSTAK+LL++D RY KMPRKDR
Sbjct: 793  IKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDR 852

Query: 481  ESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKYMSGSRR 344
            ES+W R+ EE+LRKQK  +DQ  EKH E + RSSVDS ++ SGSRR
Sbjct: 853  ESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRR 898


>ref|XP_010319355.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Solanum
            lycopersicum]
          Length = 1014

 Score =  892 bits (2306), Expect = 0.0
 Identities = 522/1022 (51%), Positives = 627/1022 (61%), Gaps = 54/1022 (5%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSYLNENNLPSGSSQQLSASPAV--------------VQG 3137
            Q+A++ K  S   Y+V R SFSY+N N +PSGSSQQ S+SP +              + G
Sbjct: 41   QEAAQGKFISPPGYSVCRASFSYMNAN-VPSGSSQQSSSSPVIPSTSAGSSASLQPPIPG 99

Query: 3136 HS------------------------------PAGKNASSPTPSAQPAFFHPPAPSHTSR 3047
             S                              PA  + S    ++  A   PP P  ++R
Sbjct: 100  QSANVGSSFSYNISQTDNNFSSGLQFSSSTLRPAAPDHSVDINASPAASLQPPLPLVSTR 159

Query: 3046 PGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKT----NVRTT 2879
              SF+PGT A               G     +N SFNG  QMMQ D ++K      V   
Sbjct: 160  LSSFMPGTAASA-------------GPLISGSNLSFNGGPQMMQTDQTMKPLQNRRVDLA 206

Query: 2878 QEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSV-RLPPVPSFQVPPGMPRTP 2702
            QE G            S+    +   S   F  +   S ++ R+P  P FQVP G+PR+P
Sbjct: 207  QETGGMTSATLVMHSVSQAVHMSSGSSTAAFPTSHMGSPNIIRMPHAPQFQVPAGVPRSP 266

Query: 2701 LTPGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS--SXXXXXXXXXXXXXX 2531
            +TPGPPG+  ++PSSSN+ A  S    P+LP R       VL+  S              
Sbjct: 267  VTPGPPGLGPAIPSSSNLTATVSP-GGPSLPLRPNAPPVHVLANPSVQQQTYSPYHSPAP 325

Query: 2530 XXXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPFLPTRPMLPLSVSFPNAQPPGVNLEXX 2351
                 QGPWLQ P ++ ++RPPF  YP                  + P A PPGV     
Sbjct: 326  IAPSHQGPWLQPPPVTTMLRPPFPSYP------------------AAPVAAPPGV----- 362

Query: 2350 XXXXXXXXXXSGDQSTVGS-TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTES 2174
                      +  QST  S  Q ELPPG+DS K V + ++K  AS  EQL+ WTAHRTE+
Sbjct: 363  --------PTTASQSTHASGLQPELPPGVDSGKHVNDADTKQGASTSEQLETWTAHRTET 414

Query: 2173 GVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNT 1994
            G +YYY+SLTG STYEKP+GF+ EP K A QPTP+SWE+LAGTDWA V TNDG++YYYNT
Sbjct: 415  GAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWALVATNDGQKYYYNT 474

Query: 1993 RTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDA 1814
            +T+LSSWQIP EV ELKKK DAD+L+AQS S++N N   EKGSAP+SLS PA +TGGRDA
Sbjct: 475  KTKLSSWQIPIEVTELKKKHDADALQAQSPSILNVNESAEKGSAPISLSIPAVSTGGRDA 534

Query: 1813 TALRPLGVSGPSSALDLIKRKLQDSGI-AAATSPGPALSGGMVLELNGSKPSEAVTKVSQ 1637
            T+LRP  V G SSALDL+K+KL D G   A +SP PA SG +  E+NGSK  E+ T++ Q
Sbjct: 535  TSLRPSLVPG-SSALDLVKKKLMDFGTPLAVSSPAPASSGVISSEVNGSKALESTTRIPQ 593

Query: 1636 HEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELP 1457
             E+  EK K+AN               +  PTKE+CIIQFKEMLKERGVAPFSKW+KELP
Sbjct: 594  KENSKEKSKEANDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKERGVAPFSKWEKELP 653

Query: 1456 KIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDY 1277
            KIVFDPRFKAIPS+SAR+ LFEHY                    EGFKQLLEEAKEDI  
Sbjct: 654  KIVFDPRFKAIPSYSARKTLFEHYVKTRADEERKEKRAAQKAAVEGFKQLLEEAKEDISE 713

Query: 1276 DTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQ 1097
            DTDYQ+FK+KW  DPRFE+L RKERE LLNERVL L++ A+EKA A  AAVIS FKSML+
Sbjct: 714  DTDYQSFKKKWSHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHAVRAAVISQFKSMLR 773

Query: 1096 DRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDX 917
            ++GDIT ++RWSKVKDSL+ D RYKS+KHEDRE LFNEY+SELKAAE+ +   AK K D 
Sbjct: 774  EQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAAEQEVARIAKAKHDE 833

Query: 916  XXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKL 737
                                       KARR EAVESY+ALLVE IKDPQAS TESKPKL
Sbjct: 834  EDKLKERERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEIIKDPQASWTESKPKL 893

Query: 736  EKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTV 557
            EKDPQGRAANPHLDQSD EKLFREHVK L ERC  +FK LLAEVIT +A +RETEDGKTV
Sbjct: 894  EKDPQGRAANPHLDQSDLEKLFREHVKVLYERCVQEFKVLLAEVITVEACSRETEDGKTV 953

Query: 556  VNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSV 377
             NSWSTAKQ+LK D RY+KM RKD E+LW R+VE+I R+QKS  D EA+K    RS+ S 
Sbjct: 954  ANSWSTAKQVLKGDLRYSKMARKDSETLWRRYVEDIHRRQKSTLD-EADK---ARSKGSS 1009

Query: 376  DS 371
            DS
Sbjct: 1010 DS 1011


>ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis
            vinifera]
          Length = 1013

 Score =  888 bits (2295), Expect = 0.0
 Identities = 508/986 (51%), Positives = 615/986 (62%), Gaps = 9/986 (0%)
 Frame = -3

Query: 3274 QDASEPKQNSATAYAVVRPSFSY--LNENNLPSGSSQQLSASPAVVQGHSPAGKNASSPT 3101
            Q++++ K  +A  + +  PSFSY  +      SG+SQQL           P+G   SS  
Sbjct: 62   QESAQGKFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQL-----------PSGSVISS-N 109

Query: 3100 PSAQPAFFHPPAPSHTSRPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQM 2921
            P A    F  P P  +S  G       A                         F G+Q  
Sbjct: 110  PLASTVVFQTPVPGPSSSSGPSFSYNIAH--------------------KGAGFPGSQPF 149

Query: 2920 MQNDLSLKTNVRTTQEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSVRLPPV 2741
                 S   +    QE G              P     S +++V ++      ++ +P  
Sbjct: 150  QS---STDNSGAVAQEAGSMSSASHVSQSVPFPC---SSSTMSVSSSPKMGPTTLWMPSN 203

Query: 2740 PSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVD--SPALPRSFMSTAPVLSSXX 2567
            PSF VP GMP TP TPGPPGIA S P SSN+    +S+D  S  + R+    APV S+  
Sbjct: 204  PSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPA 263

Query: 2566 XXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSV 2396
                                GPWLQ PQ+ G+ RPPF PYP V P PF LP   M   SV
Sbjct: 264  IQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSV 323

Query: 2395 SFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGSTQEELPPGIDSSKRVINDESKDEA 2222
              P++QPPGV               SG    +T G   E  PPGID +K V    +KD A
Sbjct: 324  PLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGA 383

Query: 2221 SVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTD 2042
            +V EQ+DAWTAH+T++GVVYYY++LTG STYEKPS FK E DK  VQPTP+SWEKL GTD
Sbjct: 384  AVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTD 443

Query: 2041 WAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSA 1862
            WA VTTNDGK+YYYNT+T+LSSWQIP E+ E++KKQD+ +LK  ++   NTNV TEKG +
Sbjct: 444  WALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPS 503

Query: 1861 PVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLE 1682
            P++LS PA  TGGRDAT LR   V G +SALD+IK+KLQDSG  A +SP  + SG +  E
Sbjct: 504  PIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPVHS-SGPIASE 562

Query: 1681 LNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLK 1502
            LNGS+  E   K  Q E+  +K KD N               D GPTKEECIIQFKEMLK
Sbjct: 563  LNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLK 622

Query: 1501 ERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXE 1322
            ERGVAPFSKW+KELPKIVFDPRFKAIP +SARR+LFEHY                    E
Sbjct: 623  ERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIE 682

Query: 1321 GFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQ 1142
            GFKQLLEEA EDID+ T+YQTF++KWG DPRFEAL RK+RE LLNERVLPLKR AEEKAQ
Sbjct: 683  GFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQ 742

Query: 1141 AEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKA 962
            A  AA +S+FKSML+D+GDIT+S+RWS+VKDSL+ D RYK +KHEDRE LFNEYISELKA
Sbjct: 743  AIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKA 802

Query: 961  AEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVET 782
            AE+ +E +AK+K++                            K RR EAV SY+ALLVET
Sbjct: 803  AEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVET 862

Query: 781  IKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVI 602
            IKDPQ S TESKPKLEKDPQ RA N  LD SD EKLFREH+K L+ER A +F+ALL+EV+
Sbjct: 863  IKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVL 922

Query: 601  TADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRD 422
            TA+AA +ETEDGKTV+ SWSTAK+LL++D RY KMPRKDRES+W R+ EE+LRKQK  +D
Sbjct: 923  TAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQD 982

Query: 421  QEAEKHAEGRSRSSVDSDKYMSGSRR 344
            Q  EKH E + RSSVDS ++ SGSRR
Sbjct: 983  QTEEKHTEVKGRSSVDSGRFPSGSRR 1008


>ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis
            vinifera]
          Length = 848

 Score =  881 bits (2277), Expect = 0.0
 Identities = 474/828 (57%), Positives = 568/828 (68%), Gaps = 7/828 (0%)
 Frame = -3

Query: 2806 SPSVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSV 2627
            S +++V ++      ++ +P  PSF VP GMP TP TPGPPGIA S P SSN+    +S+
Sbjct: 17   SSTMSVSSSPKMGPTTLWMPSNPSFPVPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASM 76

Query: 2626 D--SPALPRSFMSTAPVLSSXXXXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPPFS 2459
            D  S  + R+    APV S+                      GPWLQ PQ+ G+ RPPF 
Sbjct: 77   DFSSSVVSRAIFPAAPVSSNPAIQQQIYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFV 136

Query: 2458 PYPNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGSTQ 2288
            PYP V P PF LP   M   SV  P++QPPGV               SG    +T G   
Sbjct: 137  PYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLS 196

Query: 2287 EELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFK 2108
            E  PPGID +K V    +KD A+V EQ+DAWTAH+T++GVVYYY++LTG STYEKPS FK
Sbjct: 197  ELPPPGIDDNKHVNGAGTKDGAAVNEQVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFK 256

Query: 2107 DEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDA 1928
             E DK  VQPTP+SWEKL GTDWA VTTNDGK+YYYNT+T+LSSWQIP E+ E++KKQD+
Sbjct: 257  GEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDS 316

Query: 1927 DSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKL 1748
             +LK  ++   NTNV TEKG +P++LS PA  TGGRDAT LR   V G +SALD+IK+KL
Sbjct: 317  VALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKL 376

Query: 1747 QDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXX 1568
            QDSG  A +SP  + SG +  ELNGS+  E   K  Q E+  +K KD N           
Sbjct: 377  QDSGAPATSSPVHS-SGPIASELNGSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSD 435

Query: 1567 XXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEH 1388
                D GPTKEECIIQFKEMLKERGVAPFSKW+KELPKIVFDPRFKAIP +SARR+LFEH
Sbjct: 436  SEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEH 495

Query: 1387 YXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRK 1208
            Y                    EGFKQLLEEA EDID+ T+YQTF++KWG DPRFEAL RK
Sbjct: 496  YVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRK 555

Query: 1207 EREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDAR 1028
            +RE LLNERVLPLKR AEEKAQA  AA +S+FKSML+D+GDIT+S+RWS+VKDSL+ D R
Sbjct: 556  DRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPR 615

Query: 1027 YKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXX 848
            YK +KHEDRE LFNEYISELKAAE+ +E +AK+K++                        
Sbjct: 616  YKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERERELRKRKEREEQEME 675

Query: 847  XXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFR 668
                K RR EAV SY+ALLVETIKDPQ S TESKPKLEKDPQ RA N  LD SD EKLFR
Sbjct: 676  RVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFR 735

Query: 667  EHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRK 488
            EH+K L+ER A +F+ALL+EV+TA+AA +ETEDGKTV+ SWSTAK+LL++D RY KMPRK
Sbjct: 736  EHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRK 795

Query: 487  DRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKYMSGSRR 344
            DRES+W R+ EE+LRKQK  +DQ  EKH E + RSSVDS ++ SGSRR
Sbjct: 796  DRESVWRRYSEEMLRKQKLAQDQTEEKHTEVKGRSSVDSGRFPSGSRR 843


>ref|XP_010250268.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera] gi|719963615|ref|XP_010250275.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Nelumbo
            nucifera]
          Length = 1088

 Score =  874 bits (2258), Expect = 0.0
 Identities = 498/970 (51%), Positives = 609/970 (62%), Gaps = 8/970 (0%)
 Frame = -3

Query: 3220 PSFSYLNENNLPSGSSQQLSASPAVVQGHSPAGKN-ASSPTPSAQPAFFHPPAPSHTSRP 3044
            P+FSY        GSS Q     +   G  P G +   + TPS   A   PP P     P
Sbjct: 124  PTFSYNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHP 183

Query: 3043 GSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGNQQMMQNDLSLKTN--VRTTQEI 2870
             +F PGT AQ M          P+G+ S + +FSFN   Q+ Q DLS  ++  V   +E 
Sbjct: 184  NTFGPGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLAQKDLSSNSSASVAVAREA 243

Query: 2869 GXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPG 2690
            G            S P   +PS S+    + +    ++ +P  PSF  PPGMP TP TPG
Sbjct: 244  GTVSPASSSSVPVSMPFHVSPS-SLAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPG 302

Query: 2689 PPGIASSVPSSSNIIAVPSSVDSPALPRSFMSTAPVL-SSXXXXXXXXXXXXXXXXXXXQ 2513
            PPGIA S P SS +     ++DS     S  S  PV+ S+                   Q
Sbjct: 303  PPGIAPSTPLSSTVTVNSEAMDSS----SSTSLRPVVPSTVQQQMHSPYPALPSMPPPPQ 358

Query: 2512 GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNL--EXXXXX 2342
            G WL  PQI G+ RPPF PYP V+PG + LP R M   SV  P++QPPG++         
Sbjct: 359  GLWLP-PQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTP 417

Query: 2341 XXXXXXXSGDQSTVGSTQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVY 2162
                       +T G   +  PPG D  K + +   K  A+V  ++DAWTAH+TE+GVVY
Sbjct: 418  SSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAKVDAWTAHKTETGVVY 477

Query: 2161 YYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQL 1982
            YY++LTG STYE+PS F  EPDK  VQPTP+S EKL GTDWA VTTNDGK+YYYN++T++
Sbjct: 478  YYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKI 537

Query: 1981 SSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALR 1802
            SSWQ+P EV EL++K D D+LK     V N+   +EK SAP+S++ PA NTGGR+AT+LR
Sbjct: 538  SSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLR 597

Query: 1801 PLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCI 1622
            P GV+G SSALDLIK+KLQDS   A +SP P  SG    +LNGS+P EA  K  Q E+  
Sbjct: 598  PSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSEN-K 656

Query: 1621 EKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFD 1442
            +K KD N               D GP+KEECIIQFKEMLKERGVAPFSKW+KELPKIVFD
Sbjct: 657  DKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFD 716

Query: 1441 PRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQ 1262
            PRFKA+P +SARRALFEHY                    EGFKQLLEEA EDID  TDYQ
Sbjct: 717  PRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQ 776

Query: 1261 TFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDI 1082
            TFK KWG DPRFEAL RKERE LLNERVLPLK+ AEEKAQA  AA  S FKS+L+++GDI
Sbjct: 777  TFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDI 836

Query: 1081 TSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXX 902
             +SSRWS+VKDSL+ D RYKS+KHEDRE LFNEYISELKAA++  E +AK K++      
Sbjct: 837  NTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEEDKLK 896

Query: 901  XXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQ 722
                                  K +R EAV  Y+ALLVETIKDPQ S TES+P+LEKDPQ
Sbjct: 897  EREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQ 956

Query: 721  GRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWS 542
            GRA N  LD  D+EKLFREHVK L ERCA +F+ LL EVIT +AA++ T DGKTV+ SWS
Sbjct: 957  GRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWS 1016

Query: 541  TAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQEAEK-HAEGRSRSSVDSDK 365
            TAK+LLK DPRY+KMPRK+RE+LW RH EEIL K+K V D + EK + E ++RSS+DS +
Sbjct: 1017 TAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGR 1076

Query: 364  YMSGSRRNYS 335
              +G RR++S
Sbjct: 1077 SPTGLRRSHS 1086


>ref|XP_008221026.1| PREDICTED: pre-mRNA-processing protein 40C [Prunus mume]
          Length = 858

 Score =  859 bits (2219), Expect = 0.0
 Identities = 465/831 (55%), Positives = 567/831 (68%), Gaps = 5/831 (0%)
 Frame = -3

Query: 2824 PALTNPSPSVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNII 2645
            PA T+ S ++ + +A +  + +  +P  PSF +  GMP TP TPGPPGIA  V  S N  
Sbjct: 27   PAPTSSSSTMNLLSAPNMGTTTSWVPTAPSFNLTSGMPGTPGTPGPPGIAHPVQISFNPT 86

Query: 2644 AVPSSVDSPALP-RSFMSTAPVLSSXXXXXXXXXXXXXXXXXXXQ-GPWLQSPQISGVVR 2471
            A  + +DS ++  R  M  APV SS                     G WLQSPQI G  R
Sbjct: 87   APSAPIDSSSVALRPSMQIAPVASSAVQPQVGAPYPSLSSMGAPPQGVWLQSPQIGGFPR 146

Query: 2470 PPFSPYPNVIPGPFLPTRPMLPL-SVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ-STVG 2297
            PPF PYP   P PF     ++PL SV  P++QPPGV               SG Q +   
Sbjct: 147  PPFLPYPAAFPVPFPSPAHVMPLPSVPLPDSQPPGVTPVGNTAAISSPSAASGHQLAGFS 206

Query: 2296 STQEELP-PGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKP 2120
              Q ELP PGID+ K+  +  +++ ASV EQLDAWTAH+TE+GVVYYY++LTG STY+KP
Sbjct: 207  GIQIELPLPGIDNRKQSHDAGNENRASVNEQLDAWTAHKTETGVVYYYNALTGESTYDKP 266

Query: 2119 SGFKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKK 1940
             GFK+EPDK ++QPTP+S   L+GTDW  VTT+DGK++Y+N++T++SSWQIPNEV+EL+K
Sbjct: 267  PGFKEEPDKVSMQPTPVSTVNLSGTDWVLVTTSDGKKFYHNSKTKVSSWQIPNEVIELRK 326

Query: 1939 KQDADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLI 1760
            KQDAD  K   +S+ N NV+TEKGSAP+SL+ PA N GGR+A A +P  V G SSALDLI
Sbjct: 327  KQDADVPKEHPVSIPNNNVMTEKGSAPISLTAPAINMGGREAMAFKPSAVQGTSSALDLI 386

Query: 1759 KRKLQDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXX 1580
            K+KLQDSG    +SP PA S     E NGS+  E+  K  Q ++  +K KD N       
Sbjct: 387  KKKLQDSGAPVTSSPVPAPS-----ESNGSRGVESTPKGQQSDNSKDKLKDINGDGNLSD 441

Query: 1579 XXXXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRA 1400
                    D GPTKEECI QFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARR+
Sbjct: 442  SSSDSEDADSGPTKEECITQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRS 501

Query: 1399 LFEHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEA 1220
            LFEHY                    EGFKQLL+EA EDID++TDYQ+F++KW  DPRFEA
Sbjct: 502  LFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDHNTDYQSFRKKWANDPRFEA 561

Query: 1219 LSRKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLK 1040
            L RK+RE LLNERVLPLKR AEEKAQA  AA  ++FKSMLQ++GDIT SSRWS+VKDSL+
Sbjct: 562  LDRKDREHLLNERVLPLKRAAEEKAQAARAAASTSFKSMLQEKGDITVSSRWSRVKDSLR 621

Query: 1039 GDARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXX 860
             D RYKS++HEDRE LFN+YIS+LKA E+  E +AK K+D                    
Sbjct: 622  NDPRYKSVRHEDREILFNQYISDLKAVEEEAEREAKAKRDEQEKLRERERELRKRKEREE 681

Query: 859  XXXXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSE 680
                    K RR EAV +++ALLVETIKDPQAS T SKPKLEKDPQ RAANP L+ SD E
Sbjct: 682  QETERVRLKVRRKEAVATFQALLVETIKDPQASWTGSKPKLEKDPQRRAANPDLEPSDME 741

Query: 679  KLFREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNK 500
            KLFREH+K LNERCA +F+ALLAEV+TA+AA++ETEDGKTV+NSWSTAK+LLK DPRYNK
Sbjct: 742  KLFREHIKRLNERCAHEFRALLAEVLTAEAASQETEDGKTVLNSWSTAKRLLKPDPRYNK 801

Query: 499  MPRKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKYMSGSR 347
            M RK+RE LW R+ EE+LRKQKS  D + ++  + +SRSSVD  +   GSR
Sbjct: 802  MARKEREVLWRRYSEEMLRKQKSALDHKEDRKTDAKSRSSVDGGRVPFGSR 852


>ref|XP_008353148.1| PREDICTED: pre-mRNA-processing protein 40C-like [Malus domestica]
          Length = 981

 Score =  845 bits (2184), Expect = 0.0
 Identities = 492/984 (50%), Positives = 606/984 (61%), Gaps = 12/984 (1%)
 Frame = -3

Query: 3262 EPKQNS---ATAYAVVRPSFSYL--NENNLPSGSSQQLSASPAVVQGHSPAGKNASSP-- 3104
            EP QN+   A ++AV  PSFSY      N+  G+SQQ S S A+ + + PA     +P  
Sbjct: 53   EPLQNTFGNAPSFAVPGPSFSYNVPPNANISFGTSQQSSPSSAI-KSNPPASPVVQAPVH 111

Query: 3103 --TPSAQPAFFHPPAPSHTSRPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGN 2930
              + SA P  ++ P                                      + +SF  N
Sbjct: 112  GLSSSASPFSYNIP-------------------------------------KSGYSFPSN 134

Query: 2929 QQMMQNDLSLKTNVRTTQEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSVRL 2750
            QQ  Q+ +++   V   QE G            S PA T  + ++ + +  +    ++ +
Sbjct: 135  QQF-QSGMNIPPAV--AQETGNASLSSTSSHSGSLPAPTTSNSTMNISSTPNAGPKTLWV 191

Query: 2749 PPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVDSPALPRSFMSTAPVLSSX 2570
               PSF + PGMP TP TPGPPGIA SV  S N     + +DS    R  M   PV SS 
Sbjct: 192  STAPSFNMTPGMPGTPRTPGPPGIAHSVQISFNPTVPSAPIDSSVANRPSMQAVPVASSA 251

Query: 2569 XXXXXXXXXXXXXXXXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSVS 2393
                                PWL SPQI G+ RPPF PYP   PGPF LP   M   SV 
Sbjct: 252  VQPHVSAPYPSLSAMG---APWLSSPQIGGLPRPPFLPYPAAFPGPFPLPAHVMPLASVP 308

Query: 2392 FPNAQPPGVNLEXXXXXXXXXXXXSGDQSTVGST-QEELP-PGIDSSKRVINDESKDEAS 2219
             P++QPPGV               SG Q    S  Q+ELP PG+    R         A+
Sbjct: 309  LPDSQPPGVTPVGNTAANAVSSVGSGHQLAGSSVMQKELPHPGVGPENR---------AA 359

Query: 2218 VREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDW 2039
            V EQL AWTAH+TE+GVVYYY++LTG STY+KP GFK+EPDK ++QPTP+S   LAGTDW
Sbjct: 360  VNEQLVAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLAGTDW 419

Query: 2038 AAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAP 1859
              VTT+DGK++Y+N++T++SSWQIPNEV+ELKK+QD+D  K  +LSV N N++ EKGSAP
Sbjct: 420  VLVTTSDGKKFYHNSKTKVSSWQIPNEVIELKKQQDSDVPKEHTLSVPNNNLMIEKGSAP 479

Query: 1858 VSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLEL 1679
            VS+S PA NTGGR+A   +P  V G SSALDLIKRKLQD      +SP PA S     E 
Sbjct: 480  VSMSAPAINTGGREAMPFKPSAVLGTSSALDLIKRKLQD---PVTSSPIPAPS-----ES 531

Query: 1678 NGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKE 1499
            NG++  E+  K  Q E+  +K K+ N               D GPTKEECIIQFKEMLKE
Sbjct: 532  NGARGVESTPKGQQSENSKDKLKETNGDGNLSDSSSDSEDADSGPTKEECIIQFKEMLKE 591

Query: 1498 RGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEG 1319
            RGVAPFSKW+KELPKIVFDPRFKAIPSH ARR+LFEHY                    EG
Sbjct: 592  RGVAPFSKWEKELPKIVFDPRFKAIPSHEARRSLFEHYVKTRAEEERKEKRAAQKAAIEG 651

Query: 1318 FKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQA 1139
            FKQLL+EA EDID +TDYQ+F+RKWG DPRFEAL RK+RE LLNERVLPLKR AEEK QA
Sbjct: 652  FKQLLDEASEDIDRNTDYQSFRRKWGNDPRFEALDRKDREHLLNERVLPLKRAAEEKVQA 711

Query: 1138 EHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAA 959
              AA  + FKSML+++GDIT SSRWS+VKD+L+ D RYK+++HEDRE LFNEYIS LKA 
Sbjct: 712  VRAAASAGFKSMLKEKGDITVSSRWSRVKDNLRNDPRYKNVRHEDREALFNEYISGLKAV 771

Query: 958  EKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETI 779
            E+  E +AK K+D                            K RR EAV +++ALLVETI
Sbjct: 772  EEEAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEAVATFQALLVETI 831

Query: 778  KDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVIT 599
            KDPQAS T S+PKLEKDPQ RAANP LD SD EKLFREHVK LNERCA +F+ LLAEV+T
Sbjct: 832  KDPQASWTGSRPKLEKDPQRRAANPDLDPSDMEKLFREHVKMLNERCAHEFRTLLAEVLT 891

Query: 598  ADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQ 419
            A+AA++ETEDGKTV+NSWSTAK++LK DPRY+K PRK+RE LW R+ EE+LRKQKS  DQ
Sbjct: 892  AEAASQETEDGKTVLNSWSTAKRILKVDPRYDKTPRKEREVLWRRYSEEMLRKQKSAVDQ 951

Query: 418  EAEKHAEGRSRSSVDSDKYMSGSR 347
            + ++  + ++RSS D+ +   GSR
Sbjct: 952  KEDRKTDAKTRSSADAGRNPYGSR 975


>ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo
            nucifera]
          Length = 894

 Score =  840 bits (2170), Expect = 0.0
 Identities = 471/887 (53%), Positives = 577/887 (65%), Gaps = 7/887 (0%)
 Frame = -3

Query: 2974 QGSSSHSANFSFNGNQQMMQNDLSLKTN--VRTTQEIGXXXXXXXXXXXXSRPALTNPSP 2801
            +G+ S + +FSFN   Q+ Q DLS  ++  V   +E G            S P   +PS 
Sbjct: 13   KGAPSIATSFSFNRIPQLAQKDLSSNSSASVAVAREAGTVSPASSSSVPVSMPFHVSPS- 71

Query: 2800 SVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVDS 2621
            S+    + +    ++ +P  PSF  PPGMP TP TPGPPGIA S P SS +     ++DS
Sbjct: 72   SLAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIAPSTPLSSTVTVNSEAMDS 131

Query: 2620 PALPRSFMSTAPVL-SSXXXXXXXXXXXXXXXXXXXQGPWLQSPQISGVVRPPFSPYPNV 2444
                 S  S  PV+ S+                   QG WL  PQI G+ RPPF PYP V
Sbjct: 132  S----SSTSLRPVVPSTVQQQMHSPYPALPSMPPPPQGLWLP-PQIGGLQRPPFLPYPGV 186

Query: 2443 IPGPF-LPTRPMLPLSVSFPNAQPPGVNL--EXXXXXXXXXXXXSGDQSTVGSTQEELPP 2273
            +PG + LP R M   SV  P++QPPG++                    +T G   +  PP
Sbjct: 187  LPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTPSSSVGSVHLPSNTTGKQPDLPPP 246

Query: 2272 GIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDK 2093
            G D  K + +   K  A+V  ++DAWTAH+TE+GVVYYY++LTG STYE+PS F  EPDK
Sbjct: 247  GTDQHKHIDDLADKVGATVNAKVDAWTAHKTETGVVYYYNALTGESTYERPSEFHGEPDK 306

Query: 2092 AAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKA 1913
              VQPTP+S EKL GTDWA VTTNDGK+YYYN++T++SSWQ+P EV EL++K D D+LK 
Sbjct: 307  VTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKG 366

Query: 1912 QSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGI 1733
                V N+   +EK SAP+S++ PA NTGGR+AT+LRP GV+G SSALDLIK+KLQDS  
Sbjct: 367  NMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIA 426

Query: 1732 AAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXD 1553
             A +SP P  SG    +LNGS+P EA  K  Q E+  +K KD N               D
Sbjct: 427  PATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSEN-KDKVKDINGDGNISDSSSDSEDED 485

Query: 1552 RGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXX 1373
             GP+KEECIIQFKEMLKERGVAPFSKW+KELPKIVFDPRFKA+P +SARRALFEHY    
Sbjct: 486  SGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTR 545

Query: 1372 XXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFL 1193
                            EGFKQLLEEA EDID  TDYQTFK KWG DPRFEAL RKERE L
Sbjct: 546  AEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELL 605

Query: 1192 LNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIK 1013
            LNERVLPLK+ AEEKAQA  AA  S FKS+L+++GDI +SSRWS+VKDSL+ D RYKS+K
Sbjct: 606  LNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVK 665

Query: 1012 HEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXK 833
            HEDRE LFNEYISELKAA++  E +AK K++                            K
Sbjct: 666  HEDRELLFNEYISELKAADEEAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLK 725

Query: 832  ARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKT 653
             +R EAV  Y+ALLVETIKDPQ S TES+P+LEKDPQGRA N  LD  D+EKLFREHVK 
Sbjct: 726  VQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKI 785

Query: 652  LNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESL 473
            L ERCA +F+ LL EVIT +AA++ T DGKTV+ SWSTAK+LLK DPRY+KMPRK+RE+L
Sbjct: 786  LYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREAL 845

Query: 472  WWRHVEEILRKQKSVRDQEAEK-HAEGRSRSSVDSDKYMSGSRRNYS 335
            W RH EEIL K+K V D + EK + E ++RSS+DS +  +G RR++S
Sbjct: 846  WRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGRSPTGLRRSHS 892


>ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prunus persica]
            gi|462418875|gb|EMJ23138.1| hypothetical protein
            PRUPE_ppa001490mg [Prunus persica]
          Length = 814

 Score =  840 bits (2169), Expect = 0.0
 Identities = 462/807 (57%), Positives = 546/807 (67%), Gaps = 5/807 (0%)
 Frame = -3

Query: 2752 LPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVDSPALP-RSFMSTAPVLS 2576
            +P  PSF +  GMP TP TPGPPGIA  V  S N  A  + +DS ++  R  M  APV S
Sbjct: 16   VPTGPSFNLTSGMPGTPGTPGPPGIAHPVQISFNPTAPSAPIDSSSVALRPSMQIAPVAS 75

Query: 2575 SXXXXXXXXXXXXXXXXXXXQ-GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPL 2402
            S                     G WLQSPQI G  RPPF PYP   PGPF LP   M   
Sbjct: 76   SAVQPQVGAPYLSLSSMGAPPQGVWLQSPQIGGFPRPPFLPYPAAFPGPFPLPAHVMPLP 135

Query: 2401 SVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQSTVGS-TQEELP-PGIDSSKRVINDESKD 2228
            SV  P++QPPGV               SG Q    S  Q ELP PGI +  R        
Sbjct: 136  SVPLPDSQPPGVIPVGNTAAISSPSAASGHQLAGSSGIQIELPHPGIGNENR-------- 187

Query: 2227 EASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAG 2048
             ASV EQLDAWTAH+TE+GVVYYY++LTG STY+KP GFK+EPDK ++QPTP+S   L+G
Sbjct: 188  -ASVNEQLDAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLSG 246

Query: 2047 TDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKG 1868
            TDW  VTT+DGK++Y+N +T++SSWQIPNEV+EL+KKQDAD  K   +S+   NV+TEKG
Sbjct: 247  TDWVLVTTSDGKKFYHNGKTKVSSWQIPNEVIELRKKQDADVPKEHPVSIPINNVMTEKG 306

Query: 1867 SAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMV 1688
            SAP+SL+ PA NTGGR+A A +P  V G SSALDLIK+KLQDSG    +SP PA S    
Sbjct: 307  SAPISLTAPAINTGGREAMAFKPSAVQGTSSALDLIKKKLQDSGAPVTSSPVPAPS---- 362

Query: 1687 LELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEM 1508
             E NGS+  E+  K  Q ++  +K KD N               D GPTKEECI QFKEM
Sbjct: 363  -ESNGSRGVESTPKGQQSDNSKDKLKDINGDGNLSDSSSDSEDADSGPTKEECITQFKEM 421

Query: 1507 LKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXX 1328
            LKERGVAPFSKW+KELPKIVFDPRFKAIPSHSARR+LFEHY                   
Sbjct: 422  LKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAA 481

Query: 1327 XEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEK 1148
             EGFKQLL+EA EDID+ TDYQ+F++KW  DPRFEAL RK+RE LLNERVLPLKR AEEK
Sbjct: 482  IEGFKQLLDEASEDIDHKTDYQSFRKKWANDPRFEALDRKDREHLLNERVLPLKRAAEEK 541

Query: 1147 AQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISEL 968
            AQA  AA  ++FKSMLQ++GDIT SSRWS+VKDSL+ D RYKS++HEDRE LFN+YIS+L
Sbjct: 542  AQAVRAAAATSFKSMLQEKGDITVSSRWSRVKDSLRNDPRYKSLRHEDREILFNQYISDL 601

Query: 967  KAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLV 788
            KA E+  E +AK K+D                            K RR EAV +++ALLV
Sbjct: 602  KAVEEEAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEAVATFQALLV 661

Query: 787  ETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAE 608
            ETIKDPQAS T SKPKLEKDPQ RAANP L+ SD EKLFREH+K LNERCA +F+ALLAE
Sbjct: 662  ETIKDPQASWTGSKPKLEKDPQRRAANPDLEPSDMEKLFREHIKRLNERCAHEFRALLAE 721

Query: 607  VITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSV 428
            V+TA+AA++ETEDGKTV+NSWSTAK+LLK DPRYNKM RK+RE LW R  EE+LRKQKS 
Sbjct: 722  VLTAEAASQETEDGKTVLNSWSTAKRLLKPDPRYNKMARKEREVLWRRFSEEMLRKQKSA 781

Query: 427  RDQEAEKHAEGRSRSSVDSDKYMSGSR 347
             D + ++  + +SRSSVDS +   GSR
Sbjct: 782  LDHKEDRKTDAKSRSSVDSGRVPFGSR 808


>ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761009|ref|XP_012089635.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761012|ref|XP_012089636.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas] gi|802761015|ref|XP_012089637.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X1 [Jatropha
            curcas]
          Length = 846

 Score =  838 bits (2164), Expect = 0.0
 Identities = 454/832 (54%), Positives = 551/832 (66%), Gaps = 5/832 (0%)
 Frame = -3

Query: 2818 LTNPSPSVTVFAANSFSSMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAV 2639
            L +PS S    + N   S S ++P VPS  VPP +  T   P    + S  P +  + +V
Sbjct: 15   LHSPSSSTLPSSPNLGPSTS-QMPVVPSLLVPPRLAGTTRAPESSALVSCAPMT--LPSV 71

Query: 2638 PSSVDSPALPRSFMSTAPVLSSXXXXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPP 2465
            P    S A+ R  M T    S+                      G W Q PQ+ G+ RPP
Sbjct: 72   PVDPASSAVQRPMMLTNTPASNPVVQQQAYPTYPSLPAMAAPPQGLWFQPPQMGGLPRPP 131

Query: 2464 FSPYPNVIPGPF-LPTRPMLPLSVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGS 2294
            F PYP V PGPF LP   +   SVS P++QPPGV               SG Q   T G 
Sbjct: 132  FLPYPAVFPGPFPLPAHSIPRASVSSPDSQPPGVTPVGTAGANPPSSAASGLQLIGTPGM 191

Query: 2293 TQEELPPGIDSSKRVINDESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSG 2114
             +E  PPGID+   +   ++KD  ++ E LD+WTAH+T++G+VYYY+++T VSTYEKP G
Sbjct: 192  QKELPPPGIDNKDHIHVFDNKDNVAINEPLDSWTAHKTDTGIVYYYNAITRVSTYEKPLG 251

Query: 2113 FKDEPDKAAVQPTPISWEKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQ 1934
            FK EP+K  +QPTP+S E LAGTDWA +TTNDGK+YYYN +T+LSSWQIP+EV EL KKQ
Sbjct: 252  FKGEPEKVPMQPTPVSMENLAGTDWALITTNDGKKYYYNNKTKLSSWQIPSEVTELTKKQ 311

Query: 1933 DADSLKAQSLSVINTNVITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKR 1754
            +A+  K   +S++ +NV TEKGS PVSLS PA NTGGRDATALR     GPSSALDLIK+
Sbjct: 312  EAEVSKELEVSLLRSNVSTEKGSGPVSLSAPAINTGGRDATALRTSSAPGPSSALDLIKK 371

Query: 1753 KLQDSGIAAATSPGPALSGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXX 1574
            KLQ+SG    +SP     G    E NGS+ +EA  K    E   +K KD N         
Sbjct: 372  KLQESGTPVNSSPALVSLGMGTPESNGSRAAEATAKGLLSETSNDKLKDTNGGGNASDSS 431

Query: 1573 XXXXXXDRGPTKEECIIQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALF 1394
                  D GPTKEECIIQFKEMLKERG+APFSKW+KELPKIVFDPRFKAIPSHSARR+LF
Sbjct: 432  SDSEDEDSGPTKEECIIQFKEMLKERGIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLF 491

Query: 1393 EHYXXXXXXXXXXXXXXXXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALS 1214
            EHY                    EGFKQLL EA EDID  TDYQTF++KW  DPRFEAL 
Sbjct: 492  EHYVKTRAEEQRKEKRASQKAAIEGFKQLLVEASEDIDQYTDYQTFRKKWENDPRFEALD 551

Query: 1213 RKEREFLLNERVLPLKRTAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGD 1034
            RK+RE LLNERV+PLK+ A+EK QAE AA  ++FKSMLQD+GDIT +SRWSKVK+SL+ D
Sbjct: 552  RKDREHLLNERVIPLKKAAQEKVQAERAAAAASFKSMLQDKGDITINSRWSKVKESLRND 611

Query: 1033 ARYKSIKHEDREKLFNEYISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXX 854
             RYKS+KHEDRE LFNEY+SELKA E+  E +AK K++                      
Sbjct: 612  PRYKSVKHEDREFLFNEYLSELKAVEEEAEREAKVKKEEQEKLKERERELRKRKEREEQE 671

Query: 853  XXXXXXKARRMEAVESYKALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKL 674
                  K RR EAV S++ALLVETIKDPQAS TESKPKLEKD QGRA NP LD SD+EKL
Sbjct: 672  MERVRLKVRRKEAVSSFQALLVETIKDPQASWTESKPKLEKDSQGRATNPDLDPSDTEKL 731

Query: 673  FREHVKTLNERCAVDFKALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMP 494
            FREHVK L+ERC  DFKALLAEVI A+ AA+++E+GKTV++SWST K+LLK DPRYNKMP
Sbjct: 732  FREHVKMLHERCTQDFKALLAEVINAETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMP 791

Query: 493  RKDRESLWWRHVEEILRKQKSVRDQEAEKHAEGRSRSSVDSDKYMSGSRRNY 338
            RK+RE LW R+ ++ILRKQ++  DQ+ EKH + +SR+S DS +Y+SGSRR +
Sbjct: 792  RKEREILWRRYTQDILRKQQTTLDQKEEKHTDSKSRNSADSGRYLSGSRRTH 843


>ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas] gi|802761021|ref|XP_012089639.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas] gi|802761024|ref|XP_012089640.1| PREDICTED:
            pre-mRNA-processing protein 40C isoform X2 [Jatropha
            curcas]
          Length = 817

 Score =  835 bits (2157), Expect = 0.0
 Identities = 448/815 (54%), Positives = 543/815 (66%), Gaps = 5/815 (0%)
 Frame = -3

Query: 2767 SMSVRLPPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVDSPALPRSFMSTA 2588
            S +  +P VPS  VPP +  T   P    + S  P +  + +VP    S A+ R  M T 
Sbjct: 2    SSTSTMPVVPSLLVPPRLAGTTRAPESSALVSCAPMT--LPSVPVDPASSAVQRPMMLTN 59

Query: 2587 PVLSSXXXXXXXXXXXXXXXXXXXQ--GPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTR 2417
               S+                      G W Q PQ+ G+ RPPF PYP V PGPF LP  
Sbjct: 60   TPASNPVVQQQAYPTYPSLPAMAAPPQGLWFQPPQMGGLPRPPFLPYPAVFPGPFPLPAH 119

Query: 2416 PMLPLSVSFPNAQPPGVNLEXXXXXXXXXXXXSGDQ--STVGSTQEELPPGIDSSKRVIN 2243
             +   SVS P++QPPGV               SG Q   T G  +E  PPGID+   +  
Sbjct: 120  SIPRASVSSPDSQPPGVTPVGTAGANPPSSAASGLQLIGTPGMQKELPPPGIDNKDHIHV 179

Query: 2242 DESKDEASVREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISW 2063
             ++KD  ++ E LD+WTAH+T++G+VYYY+++T VSTYEKP GFK EP+K  +QPTP+S 
Sbjct: 180  FDNKDNVAINEPLDSWTAHKTDTGIVYYYNAITRVSTYEKPLGFKGEPEKVPMQPTPVSM 239

Query: 2062 EKLAGTDWAAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNV 1883
            E LAGTDWA +TTNDGK+YYYN +T+LSSWQIP+EV EL KKQ+A+  K   +S++ +NV
Sbjct: 240  ENLAGTDWALITTNDGKKYYYNNKTKLSSWQIPSEVTELTKKQEAEVSKELEVSLLRSNV 299

Query: 1882 ITEKGSAPVSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPAL 1703
             TEKGS PVSLS PA NTGGRDATALR     GPSSALDLIK+KLQ+SG    +SP    
Sbjct: 300  STEKGSGPVSLSAPAINTGGRDATALRTSSAPGPSSALDLIKKKLQESGTPVNSSPALVS 359

Query: 1702 SGGMVLELNGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECII 1523
             G    E NGS+ +EA  K    E   +K KD N               D GPTKEECII
Sbjct: 360  LGMGTPESNGSRAAEATAKGLLSETSNDKLKDTNGGGNASDSSSDSEDEDSGPTKEECII 419

Query: 1522 QFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXX 1343
            QFKEMLKERG+APFSKW+KELPKIVFDPRFKAIPSHSARR+LFEHY              
Sbjct: 420  QFKEMLKERGIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEQRKEKRA 479

Query: 1342 XXXXXXEGFKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKR 1163
                  EGFKQLL EA EDID  TDYQTF++KW  DPRFEAL RK+RE LLNERV+PLK+
Sbjct: 480  SQKAAIEGFKQLLVEASEDIDQYTDYQTFRKKWENDPRFEALDRKDREHLLNERVIPLKK 539

Query: 1162 TAEEKAQAEHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNE 983
             A+EK QAE AA  ++FKSMLQD+GDIT +SRWSKVK+SL+ D RYKS+KHEDRE LFNE
Sbjct: 540  AAQEKVQAERAAAAASFKSMLQDKGDITINSRWSKVKESLRNDPRYKSVKHEDREFLFNE 599

Query: 982  YISELKAAEKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESY 803
            Y+SELKA E+  E +AK K++                            K RR EAV S+
Sbjct: 600  YLSELKAVEEEAEREAKVKKEEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSF 659

Query: 802  KALLVETIKDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFK 623
            +ALLVETIKDPQAS TESKPKLEKD QGRA NP LD SD+EKLFREHVK L+ERC  DFK
Sbjct: 660  QALLVETIKDPQASWTESKPKLEKDSQGRATNPDLDPSDTEKLFREHVKMLHERCTQDFK 719

Query: 622  ALLAEVITADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILR 443
            ALLAEVI A+ AA+++E+GKTV++SWST K+LLK DPRYNKMPRK+RE LW R+ ++ILR
Sbjct: 720  ALLAEVINAETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMPRKEREILWRRYTQDILR 779

Query: 442  KQKSVRDQEAEKHAEGRSRSSVDSDKYMSGSRRNY 338
            KQ++  DQ+ EKH + +SR+S DS +Y+SGSRR +
Sbjct: 780  KQQTTLDQKEEKHTDSKSRNSADSGRYLSGSRRTH 814


>ref|XP_009351698.1| PREDICTED: pre-mRNA-processing protein 40C [Pyrus x bretschneideri]
          Length = 981

 Score =  835 bits (2157), Expect = 0.0
 Identities = 486/984 (49%), Positives = 604/984 (61%), Gaps = 12/984 (1%)
 Frame = -3

Query: 3262 EPKQN---SATAYAVVRPSFSYL--NENNLPSGSSQQLSASPAVVQGHSPAGKNASSP-- 3104
            EP QN   +A ++AV  PSFSY      N+  G+SQQ S S A+ + + PA     +P  
Sbjct: 53   EPLQNKFGNAPSFAVPAPSFSYNVPPNANISFGTSQQSSPSSAI-KSNPPASPMVQAPVH 111

Query: 3103 --TPSAQPAFFHPPAPSHTSRPGSFVPGTTAQLMNXXXXXXXXXPQGSSSHSANFSFNGN 2930
              + SA P  ++ P                                      + +SF  N
Sbjct: 112  GLSSSASPFSYNIP-------------------------------------KSGYSFPSN 134

Query: 2929 QQMMQNDLSLKTNVRTTQEIGXXXXXXXXXXXXSRPALTNPSPSVTVFAANSFSSMSVRL 2750
            QQ  Q+ +++   V   QE G            S PA T+ + ++ + +  +    ++ +
Sbjct: 135  QQF-QSGMNIPPAV--AQETGNALLSSTSTHSGSLPAPTSSNSTMNISSTPNAGPKTLWV 191

Query: 2749 PPVPSFQVPPGMPRTPLTPGPPGIASSVPSSSNIIAVPSSVDSPALPRSFMSTAPVLSSX 2570
               PSF + PGMP TP TPGPPGIA SV  S N  A  + +DS    R  M   PV SS 
Sbjct: 192  STAPSFNMTPGMPGTPRTPGPPGIAHSVQISFNPTAPSAPIDSSVANRPSMQAVPVASSA 251

Query: 2569 XXXXXXXXXXXXXXXXXXQGPWLQSPQISGVVRPPFSPYPNVIPGPF-LPTRPMLPLSVS 2393
                                PWL SPQI G+ RPPF PYP   PGPF LP   M   SV 
Sbjct: 252  VQPHVGAPYPSLSAMG---APWLSSPQIGGLARPPFLPYPAAFPGPFPLPAHVMPLASVP 308

Query: 2392 FPNAQPPGVNLEXXXXXXXXXXXXSGDQSTVGST-QEELP-PGIDSSKRVINDESKDEAS 2219
             P++QPPGV               SG QS   S  Q+ELP PG+    R         A+
Sbjct: 309  LPDSQPPGVTPVGNTAANSVSSVGSGHQSAGSSVMQKELPHPGVGPENR---------AA 359

Query: 2218 VREQLDAWTAHRTESGVVYYYSSLTGVSTYEKPSGFKDEPDKAAVQPTPISWEKLAGTDW 2039
            V EQL AWTAH+TE+GVVYYY++LTG STY+KP GFK+EPDK ++QPTP+S   LAGTDW
Sbjct: 360  VNEQLVAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQPTPVSTVNLAGTDW 419

Query: 2038 AAVTTNDGKRYYYNTRTQLSSWQIPNEVMELKKKQDADSLKAQSLSVINTNVITEKGSAP 1859
              VTT+DGK++Y+N++T++SSWQIPNEV+ELK++QD+D  K  + SV N N++ EKG AP
Sbjct: 420  VLVTTSDGKKFYHNSKTKVSSWQIPNEVIELKEQQDSDVPKEHTPSVPNNNLMIEKGPAP 479

Query: 1858 VSLSTPAANTGGRDATALRPLGVSGPSSALDLIKRKLQDSGIAAATSPGPALSGGMVLEL 1679
            VS+S PA NTGGR+A   +P  V G SSALDLIKRKLQD      +SP PA S     E 
Sbjct: 480  VSMSAPAINTGGREAMPFKPSAVQGTSSALDLIKRKLQD---PVTSSPIPAPS-----ES 531

Query: 1678 NGSKPSEAVTKVSQHEDCIEKRKDANXXXXXXXXXXXXXXXDRGPTKEECIIQFKEMLKE 1499
            NG++  E+  K  Q E+  +K K+ N               D GP+KEECIIQFKEMLKE
Sbjct: 532  NGARGVESTPKGQQSENSKDKLKETNGDGNLSDSSSDSEDADSGPSKEECIIQFKEMLKE 591

Query: 1498 RGVAPFSKWDKELPKIVFDPRFKAIPSHSARRALFEHYXXXXXXXXXXXXXXXXXXXXEG 1319
            RGVAPFSKW+KELPKIVFDPRFKAIPSH ARR+LFEHY                    EG
Sbjct: 592  RGVAPFSKWEKELPKIVFDPRFKAIPSHEARRSLFEHYVKTRAEEERKEKRAAQKAAIEG 651

Query: 1318 FKQLLEEAKEDIDYDTDYQTFKRKWGKDPRFEALSRKEREFLLNERVLPLKRTAEEKAQA 1139
            FKQLL+EA EDID +TDYQ+F++KWG D RFEAL RK+RE LLNERVLPLKR AEEKAQA
Sbjct: 652  FKQLLDEASEDIDRNTDYQSFRKKWGNDSRFEALDRKDREHLLNERVLPLKRAAEEKAQA 711

Query: 1138 EHAAVISNFKSMLQDRGDITSSSRWSKVKDSLKGDARYKSIKHEDREKLFNEYISELKAA 959
              AA  + FKSML+++GD+T SSRWS+VKDSL+ D RYK+++HEDRE LFNEYI  LKA 
Sbjct: 712  VRAAASAGFKSMLKEKGDVTVSSRWSRVKDSLRNDPRYKNVRHEDREVLFNEYILGLKAV 771

Query: 958  EKSIEGKAKTKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXKARRMEAVESYKALLVETI 779
            E+  E +AK K+D                            K RR EA  +++ALLVETI
Sbjct: 772  EEEAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEAFATFQALLVETI 831

Query: 778  KDPQASLTESKPKLEKDPQGRAANPHLDQSDSEKLFREHVKTLNERCAVDFKALLAEVIT 599
            KDPQAS T S+PKLEKDPQ RAANP LD SD EKLFREHVK LNERCA +F+ LLAEV+T
Sbjct: 832  KDPQASWTGSRPKLEKDPQRRAANPDLDPSDMEKLFREHVKMLNERCAHEFRTLLAEVLT 891

Query: 598  ADAAARETEDGKTVVNSWSTAKQLLKNDPRYNKMPRKDRESLWWRHVEEILRKQKSVRDQ 419
            A+AA++ETEDGKTV+NSWSTAK++LK D RY+K PRK+RE LW R+ EE+LRKQKS  DQ
Sbjct: 892  AEAASQETEDGKTVLNSWSTAKRILKVDTRYDKTPRKEREVLWRRYSEEMLRKQKSAVDQ 951

Query: 418  EAEKHAEGRSRSSVDSDKYMSGSR 347
            + ++  + ++RSS D+ +   GSR
Sbjct: 952  KEDRRTDAKTRSSADAGRNPYGSR 975


Top