BLASTX nr result

ID: Paeonia24_contig00004030 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00004030
         (3288 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI19367.3| unnamed protein product [Vitis vinifera]             1131   0.0  
ref|XP_002283496.2| PREDICTED: pre-mRNA-processing factor 40 hom...  1098   0.0  
ref|XP_007018436.1| Pre-mRNA-processing protein 40A isoform 1 [T...  1042   0.0  
ref|XP_007227030.1| hypothetical protein PRUPE_ppa000697mg [Prun...  1039   0.0  
ref|XP_007018438.1| Pre-mRNA-processing protein 40A isoform 3 [T...  1036   0.0  
ref|XP_007018440.1| Pre-mRNA-processing protein 40A isoform 5 [T...  1024   0.0  
ref|XP_007018439.1| Pre-mRNA-processing protein 40A isoform 4 [T...   964   0.0  
ref|XP_002320019.2| FF domain-containing family protein [Populus...   952   0.0  
gb|EXC51391.1| Pre-mRNA-processing factor 40-A-like protein [Mor...   938   0.0  
ref|XP_007018441.1| Pre-mRNA-processing protein 40A isoform 6 [T...   917   0.0  
ref|XP_007018442.1| Pre-mRNA-processing protein 40A isoform 7 [T...   903   0.0  
ref|XP_007018443.1| Pre-mRNA-processing protein 40A isoform 8 [T...   898   0.0  
ref|XP_004292768.1| PREDICTED: pre-mRNA-processing protein 40A-l...   892   0.0  
ref|XP_002510055.1| protein binding protein, putative [Ricinus c...   884   0.0  
ref|XP_006827042.1| hypothetical protein AMTR_s00010p00227470 [A...   883   0.0  
ref|XP_004141297.1| PREDICTED: pre-mRNA-processing protein 40A-l...   879   0.0  
ref|XP_006343435.1| PREDICTED: pre-mRNA-processing protein 40A-l...   873   0.0  
ref|XP_006343434.1| PREDICTED: pre-mRNA-processing protein 40A-l...   873   0.0  
ref|XP_006343433.1| PREDICTED: pre-mRNA-processing protein 40A-l...   873   0.0  
ref|XP_004242948.1| PREDICTED: pre-mRNA-processing protein 40A-l...   855   0.0  

>emb|CBI19367.3| unnamed protein product [Vitis vinifera]
          Length = 1030

 Score = 1131 bits (2925), Expect = 0.0
 Identities = 620/986 (62%), Positives = 696/986 (70%), Gaps = 8/986 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N PQSSGAQP RPP +GSMGPQ+FGPP SMQFRPAV  QQG  FI AASQQFRPIGQ 
Sbjct: 1    MANNPQSSGAQPLRPPAVGSMGPQNFGPPLSMQFRPAVPGQQGHPFIPAASQQFRPIGQN 60

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGH---ATQAIQMPYIQQNRPLTSGSPQS 2591
            ISS  VG PSGQ+Q  QFSQ MQQ PPRP QPG    ++Q I MPYIQQNRPLTS SPQ 
Sbjct: 61   ISSPNVGGPSGQNQPPQFSQAMQQLPPRPNQPGPIAPSSQPIPMPYIQQNRPLTSSSPQP 120

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL G GMP SSSYTFAP+SFGQPQS +NA  QFQ + QM +   PVGG
Sbjct: 121  NQTAPPLNSHMPGLAGPGMPFSSSYTFAPASFGQPQSTINASAQFQPISQMHA---PVGG 177

Query: 2413 QPWLSSGNQSAALISPVQQTGHSSVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYN 2234
            QPWLSSG+QS AL++PV Q G      A +PA N PN T QSSSDWQEHTSADGRRYYYN
Sbjct: 178  QPWLSSGSQSGALVTPVHQAGQQPSVTADIPAGNVPNPTHQSSSDWQEHTSADGRRYYYN 237

Query: 2233 KKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWT 2054
            KKTR SSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKWT
Sbjct: 238  KKTRLSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKWT 287

Query: 2053 IPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVI 1874
            IPEELK+AREQAEK+ SQ  Q E                      ++SVS TTSST+  +
Sbjct: 288  IPEELKLAREQAEKSVSQETQSEMGTTSNEPAVVAVSLAETPSTASVSVSSTTSSTISGM 347

Query: 1873 ASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNT 1694
             SSP+PVTPVVA   +P P+ VSG+SAIP+AQ  + T+ VG+   P   TPLPAA+ G+T
Sbjct: 348  TSSPVPVTPVVAVV-NPPPVVVSGTSAIPIAQSAVTTSAVGVQ--PSMGTPLPAAVSGST 404

Query: 1693 VVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTP 1517
             V  A  N + TSM S +N+S+   +    G SMQDIEEAKKG+AVAGKINVTP+EEKT 
Sbjct: 405  GVAAAFINPNATSMTSFENLSADATN----GASMQDIEEAKKGVAVAGKINVTPLEEKTL 460

Query: 1516 DEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNE 1337
            D+EPLVY+ K EAK+AFKALLESA+VESDW+W+QAM+ IINDKRYGALKTLGERKQAFNE
Sbjct: 461  DDEPLVYSTKLEAKNAFKALLESANVESDWTWDQAMKAIINDKRYGALKTLGERKQAFNE 520

Query: 1336 YLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPR 1157
            YLGQRKK+EAEE+R+RQKKAREEFT MLEE +ELTSSI+W +AV MF++DERF AVER R
Sbjct: 521  YLGQRKKIEAEERRMRQKKAREEFTTMLEECKELTSSIKWSKAVDMFQDDERFKAVERSR 580

Query: 1156 DREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDER 977
            DREDLF+ +I               KRN ME+RQFLESCDFIKVNSQWRKVQDRLEDDER
Sbjct: 581  DREDLFENFIMELQKKERTKALEEQKRNRMEYRQFLESCDFIKVNSQWRKVQDRLEDDER 640

Query: 976  CLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLT 797
            C RLEKI RL +FQ                      RAERKNRDEFRKLM+EHVAAGTLT
Sbjct: 641  CSRLEKIDRLEIFQEYIRDLEREEEEQRKIQKEQLRRAERKNRDEFRKLMEEHVAAGTLT 700

Query: 796  AKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGK 617
            AKTHWRDYCMKVKDS+ YLAV+SNTSGSTPKDLFED+AEELEKQYHED+ARIKDA+KL K
Sbjct: 701  AKTHWRDYCMKVKDSSPYLAVASNTSGSTPKDLFEDVAEELEKQYHEDKARIKDAMKLSK 760

Query: 616  ITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFT 437
            +T+ASTWT  DFKAAIL+DVGSP ISD+N                           +DF 
Sbjct: 761  VTIASTWTFGDFKAAILDDVGSPNISDVNLKLVFEELLDRIKEKEEKEAKKRQRLADDFN 820

Query: 436  KLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQXXXXXXXXXXX 260
             LLRS KEI ASSNWEDCKPL E+SQEYRSIGEESF +EIFEEYIAHLQ           
Sbjct: 821  DLLRSKKEITASSNWEDCKPLFEESQEYRSIGEESFGREIFEEYIAHLQEKAKEKERKRE 880

Query: 259  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIVDVHGYXXXXX 80
                                                  KDETESE VD+   +GY     
Sbjct: 881  EEKAKKEKEREEKEKRKEKERKEKDRDREREKGKERSRKDETESENVDVTGSYGYKEDKK 940

Query: 79   XXXXXXXXXXXXHQSAVDEGSSDKDE 2
                        HQSAVD+ SSDK+E
Sbjct: 941  REKDKDRKHRKRHQSAVDDASSDKEE 966


>ref|XP_002283496.2| PREDICTED: pre-mRNA-processing factor 40 homolog B-like [Vitis
            vinifera]
          Length = 1020

 Score = 1098 bits (2841), Expect = 0.0
 Identities = 607/997 (60%), Positives = 688/997 (69%), Gaps = 9/997 (0%)
 Frame = -3

Query: 2965 LLKGFCVCAGMSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAA 2792
            L +  C+CAGM+N PQSSGAQP RPP +GSMGPQ+FGPP SMQFRPAV  QQG  FI AA
Sbjct: 5    LFEEACLCAGMANNPQSSGAQPLRPPAVGSMGPQNFGPPLSMQFRPAVPGQQGHPFIPAA 64

Query: 2791 SQQFRPIGQGISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGH---ATQAIQMPYIQQN 2621
            SQQFRPIGQ ISS  VG PSGQ+Q  QFSQ MQQ PPRP QPG    ++Q I MPYIQQN
Sbjct: 65   SQQFRPIGQNISSPNVGGPSGQNQPPQFSQAMQQLPPRPNQPGPIAPSSQPIPMPYIQQN 124

Query: 2620 RPLTSGSPQSQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQ 2444
            RPLTS SPQ  QTA PL++ MPGL            FAP+SFGQPQS +NA  QFQ + Q
Sbjct: 125  RPLTSSSPQPNQTAPPLNSHMPGL------------FAPASFGQPQSTINASAQFQPISQ 172

Query: 2443 MQSSNVPVGGQPWLSSGNQSAALISPVQQTGHSSVNAATVP--AINGPNSTLQSSSDWQE 2270
            M +   PVGGQPWLSSG+QS AL++PV Q G      A +P  A N PN T QSSSDWQE
Sbjct: 173  MHA---PVGGQPWLSSGSQSGALVTPVHQAGQQPSVTADIPVSAGNVPNPTHQSSSDWQE 229

Query: 2269 HTSADGRRYYYNKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKY 2090
            HTSADGRRYYYNKKTR SSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKY
Sbjct: 230  HTSADGRRYYYNKKTRLSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKY 279

Query: 2089 YYNKVTKQSKWTIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAIS 1910
            YYNKVTKQSKWTIPEELK+AREQAEK+ SQ  Q E                      ++S
Sbjct: 280  YYNKVTKQSKWTIPEELKLAREQAEKSVSQETQSEMGTTSNEPAVVAVSLAETPSTASVS 339

Query: 1909 VSHTTSSTLPVIASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVT 1730
            VS TTSST+  + SSP+PVTPVVA   +P P+ VSG+SAIP+AQ  + T+ VG+   P  
Sbjct: 340  VSSTTSSTISGMTSSPVPVTPVVAVV-NPPPVVVSGTSAIPIAQSAVTTSAVGVQ--PSM 396

Query: 1729 VTPLPAALLGNTVVPDASANASTTSMSLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGK 1550
             TPLPAA+ G+T                  +++ +++ + +G SMQDIEEAKKG+AVAGK
Sbjct: 397  GTPLPAAVSGST-----------------GVAANLSADATNGASMQDIEEAKKGVAVAGK 439

Query: 1549 INVTPVEEKTPDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALK 1370
            INVTP+EEKT D+EPLVY+ K EAK+AFKALLESA+VESDW+W+QAM+ IINDKRYGALK
Sbjct: 440  INVTPLEEKTLDDEPLVYSTKLEAKNAFKALLESANVESDWTWDQAMKAIINDKRYGALK 499

Query: 1369 TLGERKQAFNEYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFEN 1190
            TLGERKQAFNEYLGQRKK+EAEE+R+RQKKAREEFT MLEE +ELTSSI+W +AV MF++
Sbjct: 500  TLGERKQAFNEYLGQRKKIEAEERRMRQKKAREEFTTMLEECKELTSSIKWSKAVDMFQD 559

Query: 1189 DERFTAVERPRDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWR 1010
            DERF AVER RDREDLF+ +I               KRN ME+RQFLESCDFIKVNSQWR
Sbjct: 560  DERFKAVERSRDREDLFENFIMELQKKERTKALEEQKRNRMEYRQFLESCDFIKVNSQWR 619

Query: 1009 KVQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKL 830
            KVQDRLEDDERC RLEKI RL +FQ                      RAERKNRDEFRKL
Sbjct: 620  KVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEREEEEQRKIQKEQLRRAERKNRDEFRKL 679

Query: 829  MDEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDR 650
            M+EHVAAGTLTAKTHWRDYCMKVKDS+ YLAV+SNTSGSTPKDLFED+AEELEKQYHED+
Sbjct: 680  MEEHVAAGTLTAKTHWRDYCMKVKDSSPYLAVASNTSGSTPKDLFEDVAEELEKQYHEDK 739

Query: 649  ARIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXX 470
            ARIKDA+KL K+T+ASTWT  DFKAAIL+DVGSP ISD+N                    
Sbjct: 740  ARIKDAMKLSKVTIASTWTFGDFKAAILDDVGSPNISDVNLKLVFEELLDRIKEKEEKEA 799

Query: 469  XXXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
                   +DF  LLRS KEI ASSNWEDCKPL E+SQEYRSIGEESF +EIFEEYIAHLQ
Sbjct: 800  KKRQRLADDFNDLLRSKKEITASSNWEDCKPLFEESQEYRSIGEESFGREIFEEYIAHLQ 859

Query: 292  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDI 113
                                                             KDETESE VD+
Sbjct: 860  EKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKDRDREREKGKERSRKDETESENVDV 919

Query: 112  VDVHGYXXXXXXXXXXXXXXXXXHQSAVDEGSSDKDE 2
               +GY                 HQSAVD+ SSDK+E
Sbjct: 920  TGSYGYKEDKKREKDKDRKHRKRHQSAVDDASSDKEE 956


>ref|XP_007018436.1| Pre-mRNA-processing protein 40A isoform 1 [Theobroma cacao]
            gi|590596803|ref|XP_007018437.1| Pre-mRNA-processing
            protein 40A isoform 1 [Theobroma cacao]
            gi|508723764|gb|EOY15661.1| Pre-mRNA-processing protein
            40A isoform 1 [Theobroma cacao]
            gi|508723765|gb|EOY15662.1| Pre-mRNA-processing protein
            40A isoform 1 [Theobroma cacao]
          Length = 1032

 Score = 1042 bits (2695), Expect = 0.0
 Identities = 580/987 (58%), Positives = 669/987 (67%), Gaps = 9/987 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDE 980
            RDREDLF+ YI               +RN+ E+R+FLESCDFIK NSQWRKVQDRLEDDE
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKANSQWRKVQDRLEDDE 641

Query: 979  RCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTL 800
            RC RLEKI RL +FQ                      RAERKNRD FRKLMDEHV  GTL
Sbjct: 642  RCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTL 701

Query: 799  TAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLG 620
            TAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQY +D+  IKDA+K G
Sbjct: 702  TAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKTHIKDAMKSG 761

Query: 619  KITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDF 440
            KI++ STWT+EDFKAAI EDVGS PISDIN                           +DF
Sbjct: 762  KISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAKKRQRLADDF 821

Query: 439  TKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQXXXXXXXXXX 263
            TKLL + KEI ASS+WED +PL E+SQEYRSI EES ++EIFEEYIA+LQ          
Sbjct: 822  TKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAYLQEKAKEKERKR 881

Query: 262  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIVDVHGYXXXX 83
                                                   KDET+SE +DI D HG+    
Sbjct: 882  EEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTKKDETDSENLDISDSHGHKEDK 941

Query: 82   XXXXXXXXXXXXXHQSAVDEGSSDKDE 2
                         HQS  D+GSSDKD+
Sbjct: 942  KKEKEKDRKHRKRHQSGGDDGSSDKDD 968


>ref|XP_007227030.1| hypothetical protein PRUPE_ppa000697mg [Prunus persica]
            gi|462423966|gb|EMJ28229.1| hypothetical protein
            PRUPE_ppa000697mg [Prunus persica]
          Length = 1031

 Score = 1039 bits (2686), Expect = 0.0
 Identities = 573/987 (58%), Positives = 681/987 (68%), Gaps = 9/987 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGP-PSMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N PQSS AQPFRPP + S+GPQSFG  PS+Q+RP V  QQGQ FIQ+ASQQF+P+GQG
Sbjct: 1    MANNPQSSAAQPFRPPPVASLGPQSFGSSPSLQYRPVVPTQQGQQFIQSASQQFQPVGQG 60

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQS 2591
            I SS VGMP+ QSQ LQFSQPMQ YP RP+QPGHAT   QA+ M Y+Q  RP+TS   QS
Sbjct: 61   IPSSNVGMPASQSQQLQFSQPMQPYPLRPSQPGHATPSSQALPMQYMQ-TRPITSAPSQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
            QQ A P +NQMPGL G GMP SSSY FAP S+ QPQ+NV++  QFQ + Q+Q+ +V V G
Sbjct: 120  QQPALPFNNQMPGLAGGGMPYSSSYIFAPPSYAQPQNNVSSSSQFQPISQVQA-HVSVTG 178

Query: 2413 QPWLSSGNQSAALISPVQQTGHS-SVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPW+SSGNQ AA+ +PV Q+G   S    T  A+N P+ T QSSSDWQEHTS DGRRYY+
Sbjct: 179  QPWVSSGNQGAAVPTPVPQSGQQPSSTTFTDSAVNVPSQTQQSSSDWQEHTSGDGRRYYF 238

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            N++T+QSSWEKP+ELMTP+E          RADASTVWKE+T+ +G+KYYYNKVT++SKW
Sbjct: 239  NRRTKQSSWEKPLELMTPME----------RADASTVWKEYTSSDGKKYYYNKVTRESKW 288

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA++  +QG + E                        SV  +TSS LP 
Sbjct: 289  TIPEELKLAREQAQRELAQGTRSEMNLTSHAPPAVASAETPMGSS---SVGPSTSSALPG 345

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
            + SSP+ V PV ++  +PSP+A +GSS    AQ  I T  VG+    VTVTP PA++ G+
Sbjct: 346  MVSSPVAVIPV-SSFSNPSPIAPTGSSVASGAQSSI-TGGVGIQPPVVTVTPPPASVSGS 403

Query: 1696 TVVPDASANASTTSMS-LDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            T VP    NA T S+S  +N++SQ   S+ DG   QDIEEAK+GMAVAGK+NVTP EEKT
Sbjct: 404  TGVPPTLVNAITKSVSTFENVTSQDIGSADDGAFTQDIEEAKRGMAVAGKVNVTPSEEKT 463

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
             DEEPLVYA+KQEAK+AFKALLESA+V SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 464  VDEEPLVYASKQEAKNAFKALLESANVHSDWTWEQTMREIINDKRYGALKTLGERKQAFN 523

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLE EE+R+RQKKAREEF+KMLEES+EL S+ RW +AVSMFENDERF AVER 
Sbjct: 524  EYLGQRKKLENEERRMRQKKAREEFSKMLEESKELMSATRWSKAVSMFENDERFKAVERA 583

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDE 980
            RDREDL++ YI               K+N+ E+R+FLESCDFIKVNSQWRKVQDRLEDDE
Sbjct: 584  RDREDLYESYIVELERKEKEKAAEDHKQNIAEYRKFLESCDFIKVNSQWRKVQDRLEDDE 643

Query: 979  RCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTL 800
            RCLRLEK+ RL +FQ                      R ERKNRDEFRKLM+EHVA GTL
Sbjct: 644  RCLRLEKLDRLLIFQDYIRDLEKEEEEQKKIQKEQLRRVERKNRDEFRKLMEEHVADGTL 703

Query: 799  TAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLG 620
            TAKT+WRDYCMKVKD ++Y AV+SNTSGSTPK+LFED+AEELEKQYHED+ARIKDA+KLG
Sbjct: 704  TAKTYWRDYCMKVKDLSSYEAVASNTSGSTPKELFEDVAEELEKQYHEDKARIKDAMKLG 763

Query: 619  KITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDF 440
            K+TLAST T E+FK AILED+G P ISDIN                           +DF
Sbjct: 764  KVTLASTLTFEEFKVAILEDIGFPSISDINFKLVYEELLERAKEKEEKEAKKRQRLGDDF 823

Query: 439  TKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQXXXXXXXXXX 263
             KLL + KEI ASSNWEDCK L E++QEYRSIGEE+F +E+FEEYI +LQ          
Sbjct: 824  NKLLHTFKEITASSNWEDCKHLFEETQEYRSIGEENFSREVFEEYITNLQEKAKEKERKR 883

Query: 262  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIVDVHGYXXXX 83
                                                   KDET+SE VDI D HG+    
Sbjct: 884  EEEKAKKEREREEKEKRKDKERKEKEREREKEKGKERSKKDETDSENVDITDSHGHKEDK 943

Query: 82   XXXXXXXXXXXXXHQSAVDEGSSDKDE 2
                         HQS++D+  SDK+E
Sbjct: 944  KREKDKDRKHRKRHQSSMDDVGSDKEE 970


>ref|XP_007018438.1| Pre-mRNA-processing protein 40A isoform 3 [Theobroma cacao]
            gi|508723766|gb|EOY15663.1| Pre-mRNA-processing protein
            40A isoform 3 [Theobroma cacao]
          Length = 1041

 Score = 1036 bits (2679), Expect = 0.0
 Identities = 581/996 (58%), Positives = 670/996 (67%), Gaps = 18/996 (1%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKV---------NSQWRK 1007
            RDREDLF+ YI               +RN+ E+R+FLESCDFIKV         NSQWRK
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHFQKRIQANSQWRK 641

Query: 1006 VQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLM 827
            VQDRLEDDERC RLEKI RL +FQ                      RAERKNRD FRKLM
Sbjct: 642  VQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLM 701

Query: 826  DEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRA 647
            DEHV  GTLTAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQY +D+ 
Sbjct: 702  DEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKT 761

Query: 646  RIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXX 467
             IKDA+K GKI++ STWT+EDFKAAI EDVGS PISDIN                     
Sbjct: 762  HIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAK 821

Query: 466  XXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQX 290
                  +DFTKLL + KEI ASS+WED +PL E+SQEYRSI EES ++EIFEEYIA+LQ 
Sbjct: 822  KRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAYLQE 881

Query: 289  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIV 110
                                                            KDET+SE +DI 
Sbjct: 882  KAKEKERKREEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTKKDETDSENLDIS 941

Query: 109  DVHGYXXXXXXXXXXXXXXXXXHQSAVDEGSSDKDE 2
            D HG+                 HQS  D+GSSDKD+
Sbjct: 942  DSHGHKEDKKKEKEKDRKHRKRHQSGGDDGSSDKDD 977


>ref|XP_007018440.1| Pre-mRNA-processing protein 40A isoform 5 [Theobroma cacao]
            gi|508723768|gb|EOY15665.1| Pre-mRNA-processing protein
            40A isoform 5 [Theobroma cacao]
          Length = 904

 Score = 1024 bits (2647), Expect = 0.0
 Identities = 560/899 (62%), Positives = 644/899 (71%), Gaps = 18/899 (2%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKV---------NSQWRK 1007
            RDREDLF+ YI               +RN+ E+R+FLESCDFIKV         NSQWRK
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHFQKRIQANSQWRK 641

Query: 1006 VQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLM 827
            VQDRLEDDERC RLEKI RL +FQ                      RAERKNRD FRKLM
Sbjct: 642  VQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLM 701

Query: 826  DEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRA 647
            DEHV  GTLTAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQY +D+ 
Sbjct: 702  DEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKT 761

Query: 646  RIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXX 467
             IKDA+K GKI++ STWT+EDFKAAI EDVGS PISDIN                     
Sbjct: 762  HIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAK 821

Query: 466  XXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
                  +DFTKLL + KEI ASS+WED +PL E+SQEYRSI EES ++EIFEEYIA+LQ
Sbjct: 822  KRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAYLQ 880


>ref|XP_007018439.1| Pre-mRNA-processing protein 40A isoform 4 [Theobroma cacao]
            gi|508723767|gb|EOY15664.1| Pre-mRNA-processing protein
            40A isoform 4 [Theobroma cacao]
          Length = 844

 Score =  964 bits (2491), Expect = 0.0
 Identities = 528/858 (61%), Positives = 608/858 (70%), Gaps = 17/858 (1%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKV---------NSQWRK 1007
            RDREDLF+ YI               +RN+ E+R+FLESCDFIKV         NSQWRK
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHFQKRIQANSQWRK 641

Query: 1006 VQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLM 827
            VQDRLEDDERC RLEKI RL +FQ                      RAERKNRD FRKLM
Sbjct: 642  VQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLM 701

Query: 826  DEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRA 647
            DEHV  GTLTAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQY +D+ 
Sbjct: 702  DEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKT 761

Query: 646  RIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXX 467
             IKDA+K GKI++ STWT+EDFKAAI EDVGS PISDIN                     
Sbjct: 762  HIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAK 821

Query: 466  XXXXXXEDFTKLLRSKEI 413
                  +DFTKLL + ++
Sbjct: 822  KRQRLADDFTKLLHTYKV 839


>ref|XP_002320019.2| FF domain-containing family protein [Populus trichocarpa]
            gi|550323102|gb|EEE98334.2| FF domain-containing family
            protein [Populus trichocarpa]
          Length = 1019

 Score =  952 bits (2462), Expect = 0.0
 Identities = 512/865 (59%), Positives = 613/865 (70%), Gaps = 5/865 (0%)
 Frame = -3

Query: 2872 PQSFGPPSMQFRPAVNPQQGQSFIQAASQQFRPIGQGISSSPVGMPSGQSQTLQFSQPMQ 2693
            PQS G    QFRP V  QQGQ FIQ ASQQFRP+GQG+ SS VGMP+ QSQ LQFSQP+Q
Sbjct: 5    PQSSGG---QFRPMVPTQQGQPFIQVASQQFRPVGQGMPSSHVGMPAAQSQHLQFSQPIQ 61

Query: 2692 QYPPRPTQPGH-ATQAIQMPYIQQNRPLTSGSPQSQQTAHPLSNQMPGLVGSGMPLSSSY 2516
            Q PP P QPG  + QA+ MPY Q NRPLTS  PQ  Q A PLSN M  +  SG+P SS Y
Sbjct: 62   QLPPWPNQPGAPSAQALSMPYGQLNRPLTSSQPQ--QNAPPLSNHMHVVGTSGVPNSSPY 119

Query: 2515 TFAPSSFGQPQSNVNA-PQFQSLPQMQSSNVPVGGQPWLSSGNQSAALISPVQQTG-HSS 2342
             FAPSSFG  Q++ +A PQF  + QM +  VP+GGQPWLSSG+  A+L+ PVQ      S
Sbjct: 120  AFAPSSFGLTQNSASALPQFPPMSQMHAHVVPMGGQPWLSSGSHGASLVPPVQPAVVQPS 179

Query: 2341 VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYNKKTRQSSWEKPMELMTPIEXXXXX 2162
            +++++   +   +++ QS SDWQEHT++DGRRYYYN++T+QSSW+KP ELMTPIE     
Sbjct: 180  ISSSSDSTVAVSSNSQQSLSDWQEHTASDGRRYYYNRRTKQSSWDKPFELMTPIE----- 234

Query: 2161 XXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKMAREQAEKAASQGIQLEX 1982
                 RADASTVWKEFTT EG+KYYYNKVTKQSKW+IPEELKMAREQA++   QG Q E 
Sbjct: 235  -----RADASTVWKEFTTQEGKKYYYNKVTKQSKWSIPEELKMAREQAQQTVGQGNQSET 289

Query: 1981 XXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVIASSPIPVTPVVAAEPSPSPLAVSG 1802
                                  +SVS ++S  LP ++SSPI VT V     +P P+ VSG
Sbjct: 290  DAASNVPTAVAVTSSETSTTA-VSVS-SSSVMLPGVSSSPISVTAVA----NPPPVVVSG 343

Query: 1801 SSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNTVVPDASANASTTSMS-LDNISSQV 1625
            S A+PVA    A+ V     +  +VTPLP A+   T  P A+ +A TTS+S +DN+ SQ 
Sbjct: 344  SPALPVAHSTTASAV----GVQPSVTPLPTAVSVGTGAPAAAVDAKTTSLSSIDNLLSQS 399

Query: 1624 ASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTPDEEPLVYANKQEAKSAFKALLESA 1445
            A++S+DG SM D  E  K     GK N +P+EEKTPDEEPLV+ANK EAK+AFKALLESA
Sbjct: 400  AANSVDGASMMDTAEFNKVSMDMGKTNASPLEEKTPDEEPLVFANKLEAKNAFKALLESA 459

Query: 1444 HVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEEKRLRQKKAREEF 1265
            +V+SDW+WEQ MR IINDKRY ALKTLGERKQAFNEYLGQRKKLEAEE+R+RQKKAREEF
Sbjct: 460  NVQSDWTWEQTMREIINDKRYAALKTLGERKQAFNEYLGQRKKLEAEERRVRQKKAREEF 519

Query: 1264 TKMLEESQELTSSIRWREAVSMFENDERFTAVERPRDREDLFDGYIXXXXXXXXXXXXXX 1085
             KMLEES+ELTSS++W +A+S+FENDER+ A+ER RDREDLFD YI              
Sbjct: 520  AKMLEESKELTSSMKWSKAISLFENDERYKALERARDREDLFDSYIVDLERKEKEKAAED 579

Query: 1084 XKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXX 905
             +RNV E+R+FLESCDFIK +SQWRK+QDRLEDDERCL LEK+ RL +FQ          
Sbjct: 580  RRRNVAEYRKFLESCDFIKASSQWRKIQDRLEDDERCLCLEKLDRLLIFQDYIRDLEKEE 639

Query: 904  XXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSN 725
                        RAERKNRDEFRKL++EHVA+G+LTAKTHW DYC+KVKD   Y AV++N
Sbjct: 640  EEQKKIQKEQLRRAERKNRDEFRKLLEEHVASGSLTAKTHWLDYCLKVKDLPPYQAVATN 699

Query: 724  TSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGKITLASTWTLEDFKAAILEDVGSPP 545
            TSGS PKDLFED++EELEKQYH+D+ RIKDA+KLGKIT+ STWT EDFK A+ +D+GSPP
Sbjct: 700  TSGSKPKDLFEDVSEELEKQYHDDKTRIKDAMKLGKITMVSTWTFEDFKGAVADDIGSPP 759

Query: 544  ISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLED 368
            ISDIN                           +DFTKLL + KE+  SSNWEDCKPL E+
Sbjct: 760  ISDINLKLLYEELVERAKEKEEKEAKKQQRLADDFTKLLYTLKEVTPSSNWEDCKPLFEE 819

Query: 367  SQEYRSIGEESFKKEIFEEYIAHLQ 293
            SQEYRSIGEES  KEIFEEY+ HLQ
Sbjct: 820  SQEYRSIGEESLSKEIFEEYVTHLQ 844


>gb|EXC51391.1| Pre-mRNA-processing factor 40-A-like protein [Morus notabilis]
          Length = 994

 Score =  938 bits (2425), Expect = 0.0
 Identities = 512/854 (59%), Positives = 608/854 (71%), Gaps = 11/854 (1%)
 Frame = -3

Query: 2821 QQGQSFIQAASQQFRPIGQGISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGH---ATQ 2651
            Q GQ FI + SQQF+P+GQGI    +GM    SQ +QFSQ MQQYPPRP+QPGH   ++Q
Sbjct: 5    QHGQPFIPS-SQQFQPVGQGIPPPNLGMHPAHSQPVQFSQQMQQYPPRPSQPGHPMPSSQ 63

Query: 2650 AIQMPYIQQNRPLTSGSPQSQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVN 2471
             + M YIQ  RP+  G PQSQQ A P +NQMP   G  MP SSSY++APSSF QPQ+N +
Sbjct: 64   GLPMSYIQ-TRPIAPGPPQSQQHAAPFTNQMPP--GGAMPFSSSYSYAPSSFVQPQNNAS 120

Query: 2470 A-PQFQSLPQMQSSNVPVGGQPWLSSGNQSAALISPVQQTGH-----SSVNAATVPAING 2309
            +  QFQ + QMQ+   P  GQPWLSSG  SA  ++P QQ G      SS +AAT    N 
Sbjct: 121  SVSQFQQMSQMQAPTAPGPGQPWLSSGIHSAPPVAPGQQVGQPPSAASSADAAT----NV 176

Query: 2308 PNSTLQSSSDWQEHTSADGRRYYYNKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADAST 2129
            P++T QSSSDWQEHTS+DGRRYYYNK+T+QS W+KP+ELMTPIE          RADAST
Sbjct: 177  PSTTQQSSSDWQEHTSSDGRRYYYNKRTKQSVWDKPVELMTPIE----------RADAST 226

Query: 2128 VWKEFTTPEGRKYYYNKVTKQSKWTIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXX 1949
            VWKE+++P+GRKYYYNKVTKQSKWTIPEELK+AREQA+K +SQG+Q E            
Sbjct: 227  VWKEYSSPDGRKYYYNKVTKQSKWTIPEELKLAREQAQKESSQGMQSETGLASHGPVAVG 286

Query: 1948 XXXXXXXXXXAISVSHTTSSTLPVIASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVI 1769
                        S +   ++    +ASSP+ VTPV A+ P+ S + +SGSSA P +Q  +
Sbjct: 287  SSEMPSAGTPVASGAPLVATG---VASSPVAVTPV-ASLPNSS-MTISGSSATPGSQSAV 341

Query: 1768 ATTVVGLHSLPVTVTPLPAALLGNTVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQ 1592
            A+ V  +    VTVTPL  A+ G+T V  A  NA+TT + + DN  SQ  +SS+DG S+ 
Sbjct: 342  ASAVA-VQPPMVTVTPLNPAISGSTGVSPALGNANTTPVRTYDNRVSQDIASSVDGASIL 400

Query: 1591 DIEEAKKGMAVAGKINVTPVEEKTPDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQA 1412
            DIEEAKKGMAVAGKINVTPVEEK  D+EPLV+ANKQEAK+AFK+LLESA+V+SDW+WEQA
Sbjct: 401  DIEEAKKGMAVAGKINVTPVEEKPVDDEPLVFANKQEAKNAFKSLLESANVQSDWTWEQA 460

Query: 1411 MRVIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELT 1232
            MR IINDKRYGALKTLGERKQAFNEYLGQRKKLEAEE+R+RQKKAREEFT MLEES+ELT
Sbjct: 461  MREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTIMLEESKELT 520

Query: 1231 SSIRWREAVSMFENDERFTAVERPRDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQF 1052
            SS RW +AVSMFENDERF AVER RDREDLF+ YI               +RN  E+R+F
Sbjct: 521  SSTRWSKAVSMFENDERFKAVERARDREDLFESYIVELERKEKEKAAEEHRRNAAEYRKF 580

Query: 1051 LESCDFIKVNSQWRKVQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXX 872
            LESCDFIKVNSQWRKVQ RLEDDERCLRLEK+ RL +FQ                     
Sbjct: 581  LESCDFIKVNSQWRKVQVRLEDDERCLRLEKLDRLLIFQDYIRDLEKEEEEQKKIQKEQL 640

Query: 871  XRAERKNRDEFRKLMDEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFE 692
             R ERKNRDEFRKLM+EH+ A  LTAKT WRDYC+KVKD   Y AV+SNTSGSTPKDLFE
Sbjct: 641  RRVERKNRDEFRKLMEEHIDAAALTAKTPWRDYCLKVKDLPQYEAVASNTSGSTPKDLFE 700

Query: 691  DIAEELEKQYHEDRARIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTXXXXX 512
            D+ EELEKQYH+D+AR+KD +KLGK++  S+WT +DFKAAILED+GSPPI +IN      
Sbjct: 701  DVTEELEKQYHDDKARVKDTLKLGKVSFESSWTFDDFKAAILEDIGSPPILEINLKLVYE 760

Query: 511  XXXXXXXXXXXXXXXXXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEES 335
                                 +DFTKLL S KEI  +SNWEDC+ L E+ QEYR+IGEES
Sbjct: 761  ELLERAKEKEEKETKKRQRLADDFTKLLHSKKEITTTSNWEDCRQLFEECQEYRAIGEES 820

Query: 334  FKKEIFEEYIAHLQ 293
              ++IFEEYI HLQ
Sbjct: 821  VTRDIFEEYITHLQ 834


>ref|XP_007018441.1| Pre-mRNA-processing protein 40A isoform 6 [Theobroma cacao]
            gi|508723769|gb|EOY15666.1| Pre-mRNA-processing protein
            40A isoform 6 [Theobroma cacao]
          Length = 774

 Score =  917 bits (2371), Expect = 0.0
 Identities = 499/791 (63%), Positives = 573/791 (72%), Gaps = 17/791 (2%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKV---------NSQWRK 1007
            RDREDLF+ YI               +RN+ E+R+FLESCDFIKV         NSQWRK
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHFQKRIQANSQWRK 641

Query: 1006 VQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLM 827
            VQDRLEDDERC RLEKI RL +FQ                      RAERKNRD FRKLM
Sbjct: 642  VQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLM 701

Query: 826  DEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRA 647
            DEHV  GTLTAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQY +D+ 
Sbjct: 702  DEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKT 761

Query: 646  RIKDAVKLGKI 614
             IKDA+K GK+
Sbjct: 762  HIKDAMKSGKV 772


>ref|XP_007018442.1| Pre-mRNA-processing protein 40A isoform 7 [Theobroma cacao]
            gi|508723770|gb|EOY15667.1| Pre-mRNA-processing protein
            40A isoform 7 [Theobroma cacao]
          Length = 787

 Score =  903 bits (2333), Expect = 0.0
 Identities = 489/765 (63%), Positives = 559/765 (73%), Gaps = 8/765 (1%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDE 980
            RDREDLF+ YI               +RN+ E+R+FLESCDFIK NSQWRKVQDRLEDDE
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKANSQWRKVQDRLEDDE 641

Query: 979  RCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTL 800
            RC RLEKI RL +FQ                      RAERKNRD FRKLMDEHV  GTL
Sbjct: 642  RCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTL 701

Query: 799  TAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQ 665
            TAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQ
Sbjct: 702  TAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQ 746


>ref|XP_007018443.1| Pre-mRNA-processing protein 40A isoform 8 [Theobroma cacao]
            gi|508723771|gb|EOY15668.1| Pre-mRNA-processing protein
            40A isoform 8 [Theobroma cacao]
          Length = 789

 Score =  898 bits (2320), Expect = 0.0
 Identities = 489/767 (63%), Positives = 559/767 (72%), Gaps = 10/767 (1%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQG 2762
            M+N  Q S AQP  PP +GS+GPQS+G P S QFRP V  QQGQ F+ AASQQFRP+GQ 
Sbjct: 1    MANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQ- 59

Query: 2761 ISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPG---HATQAIQMPYIQQNRPLTSGSPQS 2591
            + SS VGMP+ Q+Q +QFSQPMQQ+PPRP QPG    + Q + +P+ Q NRPLTSGSPQS
Sbjct: 60   VPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQS 119

Query: 2590 QQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGG 2414
             QTA PL++ MPGL   GMP SSSY++ PSSFGQPQ+NV+A  QFQ   Q+ +S  PV G
Sbjct: 120  HQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAG 179

Query: 2413 QPWLSSGNQSAALISPVQQTGHSS-VNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYY 2237
            QPWLSSGNQS +L  P+QQTG    + ++   A N P  T  S+SDWQEHTSADGRRYYY
Sbjct: 180  QPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYY 239

Query: 2236 NKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKW 2057
            NKKTRQSSWEKP+ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKW
Sbjct: 240  NKKTRQSSWEKPLELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKW 289

Query: 2056 TIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPV 1877
            TIPEELK+AREQA+  ASQG   +                       I VS  TS     
Sbjct: 290  TIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAA-IPVSSNTSQ---- 344

Query: 1876 IASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGN 1697
             ASSP+ VTPV AA  +PSP  VSGS+ +PV+Q   AT    + S  V VTPLPA   G 
Sbjct: 345  -ASSPVSVTPV-AAVANPSPTLVSGSTVVPVSQSA-ATNASEVQSPAVAVTPLPAVSSGG 401

Query: 1696 TVVPDASANASTTSM-SLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
            +  P  S NA+TT + SL++ +SQ +    +G S QDIEEAKKGMA AGK+NVTPVEEK 
Sbjct: 402  STTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKV 461

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
            PD+EPLVYANKQEAK+AFK+LLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 462  PDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 521

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKKLEAEE+R+RQKKAREEFTKMLEES+ELTSS+RW +A S+FENDERF AVER 
Sbjct: 522  EYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERA 581

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDE 980
            RDREDLF+ YI               +RN+ E+R+FLESCDFIK NSQWRKVQDRLEDDE
Sbjct: 582  RDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKANSQWRKVQDRLEDDE 641

Query: 979  RCLRLEKIQRLFVFQ--XXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAG 806
            RC RLEKI RL +FQ                        RAERKNRD FRKLMDEHV  G
Sbjct: 642  RCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKVEEQLRRAERKNRDAFRKLMDEHVVDG 701

Query: 805  TLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQ 665
            TLTAKT+WRDYC+KVKD   YLAV+SNTSGSTPKDLFED+ EELEKQ
Sbjct: 702  TLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQ 748


>ref|XP_004292768.1| PREDICTED: pre-mRNA-processing protein 40A-like [Fragaria vesca
            subsp. vesca]
          Length = 990

 Score =  892 bits (2305), Expect = 0.0
 Identities = 500/985 (50%), Positives = 615/985 (62%), Gaps = 7/985 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPPIGSMGPQSFGPPSMQFRPAVNPQQGQSFIQAASQQFRPIGQGIS 2756
            M+N PQSS AQ                    +RP V  QQGQ FI   SQQF+P+GQG  
Sbjct: 1    MANNPQSSAAQ--------------------YRPMVPAQQGQHFISPGSQQFQPVGQG-- 38

Query: 2755 SSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHA---TQAIQMPYIQQNRPLTSGSPQSQQ 2585
                       Q LQ+SQ MQ YP RP QPGHA   +QA+ MPY Q  RP+TS  P SQQ
Sbjct: 39   -----------QPLQYSQQMQPYPLRPNQPGHAQPSSQALPMPYYQP-RPVTSVPPHSQQ 86

Query: 2584 TAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAP-QFQSLPQMQSSNVPVGGQP 2408
             A P +NQMPG     MP  SSY +A  S+ QPQ+N N+  QFQ + Q Q+  VP  GQP
Sbjct: 87   PAPPFNNQMPG-----MPYPSSYMYAQPSYAQPQNNANSSSQFQPMSQDQAHGVPTAGQP 141

Query: 2407 WLSSGNQSAALISPVQQTGHSSVNAATV-PAINGPNSTLQSSSDWQEHTSADGRRYYYNK 2231
            W+SS +   A ++P QQ      +     PA+N PN    SSSDWQEH ++DGRRYY+N+
Sbjct: 142  WMSSSSHQGAAVTPQQQPSQQPTSTPFPDPAVNAPNLAQPSSSDWQEHMASDGRRYYFNR 201

Query: 2230 KTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWTI 2051
             TRQSSWEKP+ELMTP+E          RADASTVWKE+T+ +G+KYYYNKVT++SKWTI
Sbjct: 202  STRQSSWEKPLELMTPLE----------RADASTVWKEYTSADGKKYYYNKVTRESKWTI 251

Query: 2050 PEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVIA 1871
            PEELK+AREQA++  +QG Q E                        SV  +TSS  P   
Sbjct: 252  PEELKLAREQAQREHTQGTQSEMTSTSHAPPATASAEIHAGAS---SVGPSTSSAQPGTV 308

Query: 1870 SSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNTV 1691
            SSP+ VTP+ +A  +PSP   SG S  P  Q  +AT  VG+    V V+PLPA+ +G+T 
Sbjct: 309  SSPVAVTPI-SAFSNPSPTTPSGLSVAPGVQSSMATGSVGVQPAVVNVSPLPASNVGSTG 367

Query: 1690 VPDASANASTTSMSLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTPDE 1511
            +P    N  T S++ +N + Q ++SS+DG S QDIEEAKKGMAVAGK+NVTP EEK  D+
Sbjct: 368  LPSTLVNTITKSVN-ENQAPQDSASSIDGASSQDIEEAKKGMAVAGKVNVTPSEEKAIDD 426

Query: 1510 EPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNEYL 1331
            EPLVYA+KQEAK+AFK+LLESA+V SDW+WEQAMR IINDKRYGAL+TLGERKQAFNEYL
Sbjct: 427  EPLVYASKQEAKNAFKSLLESANVHSDWTWEQAMREIINDKRYGALRTLGERKQAFNEYL 486

Query: 1330 GQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPRDR 1151
            GQRKKLE EE+R+RQK+AREEFTKMLEES+ELTS+IRW +AV+MFENDERF AVER RDR
Sbjct: 487  GQRKKLENEERRIRQKRAREEFTKMLEESKELTSTIRWSKAVTMFENDERFKAVERARDR 546

Query: 1150 EDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDERCL 971
            EDL++ YI               +RN+ E+++FLESCDFIK    WRKVQDRLEDDERCL
Sbjct: 547  EDLYESYIVELERKEKEIAAEEHRRNISEYKEFLESCDFIK----WRKVQDRLEDDERCL 602

Query: 970  RLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLTAK 791
            RL+K  RL +FQ                      R ERKNRDEFRK+++EH A GTLTAK
Sbjct: 603  RLDKFDRLLIFQDHIRDLEKEEEEQKKIQKEQLRRIERKNRDEFRKILEEHAADGTLTAK 662

Query: 790  THWRDYCMKVKDSTAYLAVSSNTSG-STPKDLFEDIAEELEKQYHEDRARIKDAVKLGKI 614
            T WRDYCMKVKD   Y AV++NT G STPKDLFED+AE+LEKQ+ ED+AR+KDA+K G+I
Sbjct: 663  TQWRDYCMKVKDLPQYEAVAANTHGSSTPKDLFEDVAEDLEKQFVEDKARVKDAMKQGQI 722

Query: 613  TLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFTK 434
            T+ S+WT E+FKAA++ D+G P IS++N                           +DF K
Sbjct: 723  TMVSSWTFEEFKAAVVNDIGFPSISELNLKLAYEDILERAREKEEKEAKKRLRIADDFHK 782

Query: 433  LLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQXXXXXXXXXXXX 257
            LL + KEI  SS+WEDCK L E++QEYRS+G+E F +EIFEEYI  L             
Sbjct: 783  LLHTFKEITVSSSWEDCKQLFEETQEYRSVGDEDFGREIFEEYITSLHERAKEKERKREE 842

Query: 256  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIVDVHGYXXXXXX 77
                                                 KDET+SE VD+ D H +      
Sbjct: 843  EKAKKEKEREEKEKRKDKERKEKDREREKEKGKERSKKDETDSETVDMTDSHDHKEDKKR 902

Query: 76   XXXXXXXXXXXHQSAVDEGSSDKDE 2
                       HQS++D+  SDK+E
Sbjct: 903  EKDKDRKHRKRHQSSIDDVGSDKEE 927


>ref|XP_002510055.1| protein binding protein, putative [Ricinus communis]
            gi|223550756|gb|EEF52242.1| protein binding protein,
            putative [Ricinus communis]
          Length = 970

 Score =  884 bits (2285), Expect = 0.0
 Identities = 518/987 (52%), Positives = 617/987 (62%), Gaps = 9/987 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPPIGSMGPQSFGPPSMQFRPAVNPQQGQSFIQAASQQFRPIGQGIS 2756
            M NT QSSG Q                    FRPA   QQGQ F+    QQF P+ QG+ 
Sbjct: 1    MDNTSQSSGTQ--------------------FRPA---QQGQPFMP---QQFLPVVQGMP 34

Query: 2755 SSP-VGMPSGQSQTLQFSQPMQQYPPRPTQPGHATQAIQM----PYIQQNRP-LTSGSPQ 2594
            S+  + MP+GQ+QTLQFSQPMQ  PP P  P H   + Q     PY+ QNRP LTSG PQ
Sbjct: 35   SNVGMPMPAGQTQTLQFSQPMQP-PPWPNHPAHVAPSSQPVPLPPYVHQNRPPLTSGPPQ 93

Query: 2593 SQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSN-VNAPQFQSLPQMQSSNVPVG 2417
             QQTA                      FAPSS+GQ Q+N +++ QFQ +PQM +  VP G
Sbjct: 94   LQQTAS--------------------LFAPSSYGQLQNNAISSSQFQPMPQMHTPVVPAG 133

Query: 2416 GQPWLSSGNQSAALISPVQQTGHS-SVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYY 2240
            GQ WL SG+   A+ +PVQ TG   SV++++   +N PN   QS SDWQEHT++DGRRYY
Sbjct: 134  GQHWLPSGSNGVAVATPVQPTGQQPSVSSSSDSVLNVPNQ--QSLSDWQEHTASDGRRYY 191

Query: 2239 YNKKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSK 2060
            YNK+T+QSSWEKP+ELMTP+E          RADASTVWKEFTTPEG+KYYYNK+TKQSK
Sbjct: 192  YNKRTKQSSWEKPLELMTPLE----------RADASTVWKEFTTPEGKKYYYNKITKQSK 241

Query: 2059 WTIPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLP 1880
            W++P+ELK+AREQA++ A+QG + E                       I V    SST  
Sbjct: 242  WSMPDELKLAREQAQQTATQGTKSEADAASHASVTVNASSGEMSTTV-IPVGSGFSSTSG 300

Query: 1879 VIASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLG 1700
            V ASSP+PVTPVVA   S    AVS SSA+PVAQ +IA    G+    VT+T LPAA  G
Sbjct: 301  V-ASSPVPVTPVVAV--SNPVAAVSSSSALPVAQSIIANAA-GVQPPAVTMTVLPAAAGG 356

Query: 1699 NTVVPDASANASTTSMSLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKT 1520
                              DN++S+ A+ S+DG S+Q+ EE KKG  V+ K +    EEK 
Sbjct: 357  -----------------FDNVASKGAAPSVDGASIQNSEEVKKGSGVSIKSDANLTEEKN 399

Query: 1519 PDEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFN 1340
             D+EPL +A+KQEAK+AFKALLESA+V+SDW+WEQ MR IINDKRYGALKTLGERKQAFN
Sbjct: 400  LDDEPLTFASKQEAKNAFKALLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFN 459

Query: 1339 EYLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERP 1160
            EYLGQRKK+EAEE+R+RQK+AREEFTKMLEES+ELTSS++W +AVS+FENDERF AVE+ 
Sbjct: 460  EYLGQRKKIEAEERRMRQKRAREEFTKMLEESKELTSSMKWSKAVSLFENDERFKAVEKA 519

Query: 1159 RDREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDE 980
            RDREDLFD YI               +RNV EF++FLESCDFIKVNSQWRKVQDRLEDDE
Sbjct: 520  RDREDLFDNYIVELERKEREKAAEDHRRNVTEFKKFLESCDFIKVNSQWRKVQDRLEDDE 579

Query: 979  RCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTL 800
            RCLRLEK+ RL VFQ                      RAERKNRD FRKL++EHVA G+L
Sbjct: 580  RCLRLEKLDRLLVFQDYIRDLEKEEEEQKKIQKEQLRRAERKNRDGFRKLLEEHVADGSL 639

Query: 799  TAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLG 620
            TAK HW DYC+KVKD   Y AV++NTSGSTPKDLFED+AEELEKQY +D+AR+KDA+K G
Sbjct: 640  TAKAHWLDYCLKVKDLPQYHAVATNTSGSTPKDLFEDVAEELEKQYRDDKARVKDAIKSG 699

Query: 619  KITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDF 440
            KI + STW  EDFKAAIL+DV SPP+SDIN                           +D 
Sbjct: 700  KIIMTSTWIFEDFKAAILDDVSSPPVSDINLQLIYDELLERAKEKEEKEAKKRQRLADDL 759

Query: 439  TKLLRS-KEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQXXXXXXXXXX 263
            TKLL + KEI+ASS+WEDC+PL E+SQEYR+IGEES  KEIFEEYIAHLQ          
Sbjct: 760  TKLLHTYKEIMASSSWEDCRPLFEESQEYRAIGEESVIKEIFEEYIAHLQEKAKEKERKR 819

Query: 262  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKDETESEIVDIVDVHGYXXXX 83
                                                   KDET+SE VD  D +G+    
Sbjct: 820  EEEKVKKEKEREEKEKRKERERKEKEKEREREKAKERIKKDETDSENVDTTDSYGHKEDK 879

Query: 82   XXXXXXXXXXXXXHQSAVDEGSSDKDE 2
                         H S  DE SSDKDE
Sbjct: 880  KREKDKDRKHRKRHHSGTDEVSSDKDE 906


>ref|XP_006827042.1| hypothetical protein AMTR_s00010p00227470 [Amborella trichopoda]
            gi|548831471|gb|ERM94279.1| hypothetical protein
            AMTR_s00010p00227470 [Amborella trichopoda]
          Length = 985

 Score =  883 bits (2281), Expect = 0.0
 Identities = 503/881 (57%), Positives = 589/881 (66%), Gaps = 16/881 (1%)
 Frame = -3

Query: 2887 IGSMGPQSFGPP-SMQFRPAVNPQQGQSFIQAASQQFRPIGQGISSSPVGMPSG-QSQTL 2714
            +G  GPQ++G P SMQFRP V  QQ Q FI A SQQFRP+GQGI +S +G PS  Q+Q  
Sbjct: 1    MGPGGPQNYGTPMSMQFRPMVPTQQSQPFISAPSQQFRPVGQGIPASNIGSPSPVQAQQA 60

Query: 2713 QFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQSQQTAHPLSNQMPGLVG 2543
            Q++  MQQ PPRP Q        Q + + YIQ NRP+TSG  Q  Q    ++   PGL G
Sbjct: 61   QYALGMQQLPPRPAQTAQVAPSPQTVPLSYIQPNRPMTSGPLQIPQNPQHVNIHPPGLGG 120

Query: 2542 SGMPLSSSYTF-APSSFGQPQSNVN-APQFQSLPQMQSSNVPVG--GQPWLSSGNQSAAL 2375
             G  LSSSYTF APSS+  PQ+N+N + Q+Q   QMQ   VP G  GQPWLSSG+QS  +
Sbjct: 121  PGTVLSSSYTFTAPSSYVHPQNNINISSQYQPSSQMQVPGVPSGSGGQPWLSSGSQSTTV 180

Query: 2374 ISPVQQTGH-SSVNAATVP-AINGPNSTLQSSSDWQEHTSADGRRYYYNKKTRQSSWEKP 2201
            I PV Q    SS  A+T P A   PN T QSSSDWQEHTSADGRRYYYNKKTRQSSWEKP
Sbjct: 181  IPPVVQASQQSSFAASTAPVATPQPNPTSQSSSDWQEHTSADGRRYYYNKKTRQSSWEKP 240

Query: 2200 MELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKMAREQ 2021
            +ELMTPIE          RADASTVWKEFTTPEGRKYYYNKVTKQSKWTIP+ELK+AREQ
Sbjct: 241  LELMTPIE----------RADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPDELKLAREQ 290

Query: 2020 AEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVIASSPIPVTPVV 1841
            AEK    G QL                             T S T  V+ASS  PVT  V
Sbjct: 291  AEK---NGTQL-----------------------------TNSETTDVVASS-TPVTVTV 317

Query: 1840 AAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTP---LPAALLGNTVVPDASAN 1670
                 PS +A     AI   Q  + +T  G+ + PV VTP   +PAA      V  +SA 
Sbjct: 318  PLTEMPSTVA-----AISATQSAMPST-SGMATSPVLVTPVVSVPAA-----AVDPSSAG 366

Query: 1669 ASTTSMSLDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPV-EEKTPDEEPLVYA 1493
            A+   + +DN+S +  +   D TS QD+EEA+K M VAGK+N+TP  +EKT DEEPLV+A
Sbjct: 367  AAYEKIKVDNVSPESIAQVADETSAQDLEEARKAMPVAGKVNITPTSDEKTVDEEPLVFA 426

Query: 1492 NKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKL 1313
            +KQEAK+AFK LL SAHVESDW+W+QAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKL
Sbjct: 427  SKQEAKNAFKELLVSAHVESDWTWDQAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKL 486

Query: 1312 EAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPRDREDLFDG 1133
            EAEEKR RQKKARE+F KMLEES+ELTS+ +W +A++MFE+DERF AVER RDRE+LF+ 
Sbjct: 487  EAEEKRTRQKKAREDFVKMLEESKELTSATKWSKAITMFEDDERFRAVERGRDREELFEM 546

Query: 1132 YIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDERCLRLEKIQ 953
            ++               +RNV E+R FLESCDFIK +SQWRKVQDRLEDDERC RLEKI 
Sbjct: 547  HLEELHRKERAKAQEEHRRNVQEYRAFLESCDFIKASSQWRKVQDRLEDDERCARLEKID 606

Query: 952  RLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLTAKTHWRDY 773
            RL +FQ                      RAERKNRD+FRKLM+ H+AAG LTAKTHWR+Y
Sbjct: 607  RLEIFQEYIRDLEKEEEEQRKLQKEHLRRAERKNRDDFRKLMEGHIAAGILTAKTHWREY 666

Query: 772  CMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGKITLASTWT 593
            CMKVKD  AYLAVSSNTSGSTPKDLFED AEEL+KQY EDR RIKDAVK+ +  + STW+
Sbjct: 667  CMKVKDLPAYLAVSSNTSGSTPKDLFEDTAEELDKQYQEDRTRIKDAVKMARFVMTSTWS 726

Query: 592  LEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFTKLLRS-KE 416
             E+FK AI ED     IS+ N                           +D   LL S K+
Sbjct: 727  FENFKEAISEDNNLKSISETNLKLVFDELLERLKEKEEKEAKKRQRMADDLKDLLYSIKD 786

Query: 415  IIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
            I ASS WE+CKPLLE++Q YRSI +ESF ++IFEEY+A+LQ
Sbjct: 787  ISASSRWEECKPLLEENQAYRSINDESFARQIFEEYVAYLQ 827


>ref|XP_004141297.1| PREDICTED: pre-mRNA-processing protein 40A-like [Cucumis sativus]
          Length = 985

 Score =  879 bits (2272), Expect = 0.0
 Identities = 506/955 (52%), Positives = 612/955 (64%), Gaps = 7/955 (0%)
 Frame = -3

Query: 2845 QFRPAVNPQQGQSFIQAASQQFRPIGQGISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQP 2666
            QFRP +  Q GQ+FI +++QQF+  GQ ISSS VG+P+GQ Q  Q+ Q M Q   RP  P
Sbjct: 11   QFRPVIPAQPGQAFISSSAQQFQLAGQNISSSNVGVPAGQVQPHQYPQSMPQLVQRPGHP 70

Query: 2665 GHAT---QAIQMPYIQQNRPLTSGSPQSQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSF 2495
             + T   Q IQMPY+Q  RPLTS  PQSQQ     +N M GL   G+PLSS YTF     
Sbjct: 71   SYVTPSSQPIQMPYVQ-TRPLTSVPPQSQQNVAAPNNHMHGLGAHGLPLSSPYTF----- 124

Query: 2494 GQPQSNVNAPQFQSLPQMQSSNVPVGG-QPWLSSGNQSAALISPVQQTG-HSSVNAATVP 2321
             QP S ++AP            V VG  QPWLSS +Q+  L+SP+ Q   HSSV+A   P
Sbjct: 125  -QPMSQMHAP------------VSVGNSQPWLSSASQTTNLVSPIDQANQHSSVSAVN-P 170

Query: 2320 AINGPNSTLQSSSDWQEHTSADGRRYYYNKKTRQSSWEKPMELMTPIEXXXXXXXXXXRA 2141
            A N P    Q SSDWQEH SADGRRYYYNKKT+QSSWEKP+ELMTP+E          RA
Sbjct: 171  AANAPVFNQQLSSDWQEHASADGRRYYYNKKTKQSSWEKPLELMTPLE----------RA 220

Query: 2140 DASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKMAREQAEKAASQGIQLEXXXXXXXX 1961
            DASTVWKEFT P+GRKYYYNKVTK+SKWT+PEELK+AREQA+K A+QG Q +        
Sbjct: 221  DASTVWKEFTAPDGRKYYYNKVTKESKWTMPEELKLAREQAQKEATQGTQTDISVMAPQP 280

Query: 1960 XXXXXXXXXXXXXXAISVSHTTSSTLPVIASSPIPVTPVVAAEPSPSPLAVSGSSAIPVA 1781
                          + SV+ + S T+  +A+SP+PVTP V+   SPS + V+GSSAI   
Sbjct: 281  TLAAGLSHAETPAIS-SVNSSISPTVSGVATSPVPVTPFVSVSNSPSVM-VTGSSAI-TG 337

Query: 1780 QPVIATTVVGLHSLPVTVTPLPAALLGNTVVPDA-SANASTTSMSLDNISSQVASSSLDG 1604
             P+ +TT     S+  TV+    A  G T  P    ANAS+ +   ++++SQ   +++DG
Sbjct: 338  TPIASTT-----SVSGTVSSQSVAASGGTGPPAVVHANASSVT-PFESLASQDVKNTVDG 391

Query: 1603 TSMQDIEEAKKGMAVAGKINVTPVEEKTPDEEPLVYANKQEAKSAFKALLESAHVESDWS 1424
            TS +DIEEA+KGMAVAGK+N T +EEK+ D+EPLV+ANKQEAK+AFKALLES +V+SDW+
Sbjct: 392  TSTEDIEEARKGMAVAGKVNETVLEEKSADDEPLVFANKQEAKNAFKALLESVNVQSDWT 451

Query: 1423 WEQAMRVIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEEKRLRQKKAREEFTKMLEES 1244
            WEQAMR IINDKRYGALKTLGERKQAF+EYLG RKKL+AEE+R+RQKKAREEFTKMLEES
Sbjct: 452  WEQAMREIINDKRYGALKTLGERKQAFHEYLGHRKKLDAEERRIRQKKAREEFTKMLEES 511

Query: 1243 QELTSSIRWREAVSMFENDERFTAVERPRDREDLFDGYIXXXXXXXXXXXXXXXKRNVME 1064
            +ELTSS RW +AVSMFENDERF AVER RDREDLF+ YI               K+N+ E
Sbjct: 512  KELTSSTRWSKAVSMFENDERFKAVERSRDREDLFESYIVELERKEKERAAEEHKKNIAE 571

Query: 1063 FRQFLESCDFIKVNSQWRKVQDRLEDDERCLRLEKIQRLFVFQXXXXXXXXXXXXXXXXX 884
            +R+FLESCD+IKV+SQWRKVQDRLEDDERC RLEK+ RL +FQ                 
Sbjct: 572  YRKFLESCDYIKVSSQWRKVQDRLEDDERCSRLEKLDRLLIFQDYIRDLEKEEEDQKKIQ 631

Query: 883  XXXXXRAERKNRDEFRKLMDEHVAAGTLTAKTHWRDYCMKVKDSTAYLAVSSNTSGSTPK 704
                 R ERKNRDEFRKLM+EH+AAG  TAKT WRDYC+KVK+   Y AV+SNTSGSTPK
Sbjct: 632  KERVRRIERKNRDEFRKLMEEHIAAGVFTAKTFWRDYCLKVKELPQYQAVASNTSGSTPK 691

Query: 703  DLFEDIAEELEKQYHEDRARIKDAVKLGKITLASTWTLEDFKAAILEDVGSPPISDINTX 524
            DLFED+ E+LE +YHE++ +IKD VK  KIT+ S+WT +DFKAAI E+ GS  +SDIN  
Sbjct: 692  DLFEDVLEDLENKYHEEKTQIKDVVKAAKITITSSWTFDDFKAAI-EESGSLAVSDINFK 750

Query: 523  XXXXXXXXXXXXXXXXXXXXXXXXXEDFTKLLRS-KEIIASSNWEDCKPLLEDSQEYRSI 347
                                     +DF+ LL+S KEI  SSNWED K L E+S+EYRSI
Sbjct: 751  LVYEDLLERAKEKEEKEAKRRQRLADDFSGLLQSLKEITTSSNWEDSKQLFEESEEYRSI 810

Query: 346  GEESFKKEIFEEYIAHLQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 167
            GEESF KE+FEE+I HLQ                                          
Sbjct: 811  GEESFAKEVFEEHITHLQ--EKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKDRERE 868

Query: 166  XXXXXXXKDETESEIVDIVDVHGYXXXXXXXXXXXXXXXXXHQSAVDEGSSDKDE 2
                   KDET+SE VD+ D H Y                 H SA D+G+SDKDE
Sbjct: 869  KEKGRVKKDETDSENVDVSDTHVYREDKKRDKDKDRKHRKRHHSATDDGASDKDE 923


>ref|XP_006343435.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X3 [Solanum
            tuberosum]
          Length = 864

 Score =  873 bits (2256), Expect = 0.0
 Identities = 492/889 (55%), Positives = 584/889 (65%), Gaps = 8/889 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPPSMQFRPAVNPQQGQSFIQ--AASQQFRPIGQ 2765
            M++ P  SG QP  PP +GS  PQ FG   MQFRPA++ QQGQ F    +AS Q+RP+GQ
Sbjct: 1    MASNPPPSGPQPLWPPSVGSTPPQGFGSFPMQFRPALSTQQGQHFAPPISASPQYRPVGQ 60

Query: 2764 GISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQ 2594
               +   GMP GQ Q  QFSQ MQQ+PPRP Q GH T   QAIQM YIQ      S  PQ
Sbjct: 61   ---TPNAGMPPGQGQIPQFSQTMQQFPPRPGQSGHGTPSSQAIQMSYIQ------SSIPQ 111

Query: 2593 SQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAPQFQSLPQMQSSNVPVGG 2414
             QQ   PL++ MPG+ G+G P SSSYT   SS                 QM     P GG
Sbjct: 112  PQQVNPPLNSHMPGVSGAGNPFSSSYTVQSSS-----------------QMHGPTFPAGG 154

Query: 2413 QPWLSSGNQSAALISPVQQTGHSSVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYN 2234
            Q WLSSG+Q+  + +P   + H    +A  PA+    ++ Q++SDWQE+ +ADGRRYYYN
Sbjct: 155  QTWLSSGSQTTPVAAPTPPSSHQL--SAVAPAVPASTASQQTASDWQEYEAADGRRYYYN 212

Query: 2233 KKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWT 2054
            K T+QSSWEKP+ELMTP+E          RADASTVWKEFTT +GRKYYYNK TKQSKWT
Sbjct: 213  KNTKQSSWEKPLELMTPLE----------RADASTVWKEFTTADGRKYYYNKETKQSKWT 262

Query: 2053 IPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVI 1874
            IP+ELK+ARE AE AA Q +Q                           VS T SST+  +
Sbjct: 263  IPDELKLARELAENAAGQVVQT-GTSTNSGVQVSEAVTPAEQPSAVTPVSSTPSSTVSGV 321

Query: 1873 ASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNT 1694
            ASSP+PVTP V+   +P PL VSGSSAIP     + T+  G+ S  V+ +   AAL    
Sbjct: 322  ASSPVPVTPAVSDVNTP-PLVVSGSSAIPSVSLAV-TSSAGVSSPAVSGSTESAAL---- 375

Query: 1693 VVPDASANASTTSMS-LDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTP 1517
                  ANA  T MS ++N+S QVASS L G S QDIEEAKKGMAVAGKINV P EEK+ 
Sbjct: 376  ------ANAYQTQMSGIENLSPQVASS-LSGASSQDIEEAKKGMAVAGKINVVPAEEKSA 428

Query: 1516 DEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNE 1337
            DEEP +YA KQEAK+AFKALLESA+VESDW+WEQ MRVIINDKRYGALKTLGERKQAFNE
Sbjct: 429  DEEPFLYATKQEAKNAFKALLESANVESDWTWEQTMRVIINDKRYGALKTLGERKQAFNE 488

Query: 1336 YLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPR 1157
            YL QRKK EAEE+RLRQ+KA+EEFTKMLEES+ELTSS RW +AV+MFE+DERF AVER  
Sbjct: 489  YLMQRKKQEAEERRLRQRKAKEEFTKMLEESKELTSSTRWSKAVTMFEDDERFKAVEREA 548

Query: 1156 DREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDER 977
            DREDLF  Y+               +RN +E++QFLE+C FIKV++QWRKVQD LEDDER
Sbjct: 549  DREDLFRNYLVDLQKKERSKAQEEYRRNRLEYKQFLETCGFIKVDTQWRKVQDLLEDDER 608

Query: 976  CLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLT 797
            C RLEK+ RL +FQ                      RAERKNRD FRK+++EH+AAG LT
Sbjct: 609  CSRLEKLDRLEIFQEYIRDLEKEDEEQRKLQKEQLRRAERKNRDAFRKMIEEHIAAGMLT 668

Query: 796  AKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGK 617
            AKT WRDYC  VK+  AY AV+SNTSGSTPKDLFED+ EELEKQYHED+ R+KD VK  K
Sbjct: 669  AKTSWRDYCQMVKEFVAYQAVASNTSGSTPKDLFEDVTEELEKQYHEDKIRVKDVVKSEK 728

Query: 616  ITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFT 437
            IT++STWT EDFK AI E +GSP I D+N                           +DFT
Sbjct: 729  ITISSTWTFEDFKVAIFEGIGSPSIHDVNLQLIFEDLVERAKEKEEKEAKKHQRLAKDFT 788

Query: 436  -KLLRSKEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
             KL   KEI  SS+WE+ K L+EDS E+R+IGEE+  + +FEEY+A LQ
Sbjct: 789  DKLSSIKEITDSSSWEESKELVEDSSEFRAIGEETISRAVFEEYVAWLQ 837


>ref|XP_006343434.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X2 [Solanum
            tuberosum]
          Length = 872

 Score =  873 bits (2256), Expect = 0.0
 Identities = 492/889 (55%), Positives = 584/889 (65%), Gaps = 8/889 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPPSMQFRPAVNPQQGQSFIQ--AASQQFRPIGQ 2765
            M++ P  SG QP  PP +GS  PQ FG   MQFRPA++ QQGQ F    +AS Q+RP+GQ
Sbjct: 1    MASNPPPSGPQPLWPPSVGSTPPQGFGSFPMQFRPALSTQQGQHFAPPISASPQYRPVGQ 60

Query: 2764 GISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQ 2594
               +   GMP GQ Q  QFSQ MQQ+PPRP Q GH T   QAIQM YIQ      S  PQ
Sbjct: 61   ---TPNAGMPPGQGQIPQFSQTMQQFPPRPGQSGHGTPSSQAIQMSYIQ------SSIPQ 111

Query: 2593 SQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAPQFQSLPQMQSSNVPVGG 2414
             QQ   PL++ MPG+ G+G P SSSYT   SS                 QM     P GG
Sbjct: 112  PQQVNPPLNSHMPGVSGAGNPFSSSYTVQSSS-----------------QMHGPTFPAGG 154

Query: 2413 QPWLSSGNQSAALISPVQQTGHSSVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYN 2234
            Q WLSSG+Q+  + +P   + H    +A  PA+    ++ Q++SDWQE+ +ADGRRYYYN
Sbjct: 155  QTWLSSGSQTTPVAAPTPPSSHQL--SAVAPAVPASTASQQTASDWQEYEAADGRRYYYN 212

Query: 2233 KKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWT 2054
            K T+QSSWEKP+ELMTP+E          RADASTVWKEFTT +GRKYYYNK TKQSKWT
Sbjct: 213  KNTKQSSWEKPLELMTPLE----------RADASTVWKEFTTADGRKYYYNKETKQSKWT 262

Query: 2053 IPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVI 1874
            IP+ELK+ARE AE AA Q +Q                           VS T SST+  +
Sbjct: 263  IPDELKLARELAENAAGQVVQT-GTSTNSGVQVSEAVTPAEQPSAVTPVSSTPSSTVSGV 321

Query: 1873 ASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNT 1694
            ASSP+PVTP V+   +P PL VSGSSAIP     + T+  G+ S  V+ +   AAL    
Sbjct: 322  ASSPVPVTPAVSDVNTP-PLVVSGSSAIPSVSLAV-TSSAGVSSPAVSGSTESAAL---- 375

Query: 1693 VVPDASANASTTSMS-LDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTP 1517
                  ANA  T MS ++N+S QVASS L G S QDIEEAKKGMAVAGKINV P EEK+ 
Sbjct: 376  ------ANAYQTQMSGIENLSPQVASS-LSGASSQDIEEAKKGMAVAGKINVVPAEEKSA 428

Query: 1516 DEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNE 1337
            DEEP +YA KQEAK+AFKALLESA+VESDW+WEQ MRVIINDKRYGALKTLGERKQAFNE
Sbjct: 429  DEEPFLYATKQEAKNAFKALLESANVESDWTWEQTMRVIINDKRYGALKTLGERKQAFNE 488

Query: 1336 YLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPR 1157
            YL QRKK EAEE+RLRQ+KA+EEFTKMLEES+ELTSS RW +AV+MFE+DERF AVER  
Sbjct: 489  YLMQRKKQEAEERRLRQRKAKEEFTKMLEESKELTSSTRWSKAVTMFEDDERFKAVEREA 548

Query: 1156 DREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDER 977
            DREDLF  Y+               +RN +E++QFLE+C FIKV++QWRKVQD LEDDER
Sbjct: 549  DREDLFRNYLVDLQKKERSKAQEEYRRNRLEYKQFLETCGFIKVDTQWRKVQDLLEDDER 608

Query: 976  CLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLT 797
            C RLEK+ RL +FQ                      RAERKNRD FRK+++EH+AAG LT
Sbjct: 609  CSRLEKLDRLEIFQEYIRDLEKEDEEQRKLQKEQLRRAERKNRDAFRKMIEEHIAAGMLT 668

Query: 796  AKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGK 617
            AKT WRDYC  VK+  AY AV+SNTSGSTPKDLFED+ EELEKQYHED+ R+KD VK  K
Sbjct: 669  AKTSWRDYCQMVKEFVAYQAVASNTSGSTPKDLFEDVTEELEKQYHEDKIRVKDVVKSEK 728

Query: 616  ITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFT 437
            IT++STWT EDFK AI E +GSP I D+N                           +DFT
Sbjct: 729  ITISSTWTFEDFKVAIFEGIGSPSIHDVNLQLIFEDLVERAKEKEEKEAKKHQRLAKDFT 788

Query: 436  -KLLRSKEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
             KL   KEI  SS+WE+ K L+EDS E+R+IGEE+  + +FEEY+A LQ
Sbjct: 789  DKLSSIKEITDSSSWEESKELVEDSSEFRAIGEETISRAVFEEYVAWLQ 837


>ref|XP_006343433.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Solanum
            tuberosum]
          Length = 1031

 Score =  873 bits (2256), Expect = 0.0
 Identities = 492/889 (55%), Positives = 584/889 (65%), Gaps = 8/889 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPPSMQFRPAVNPQQGQSFIQ--AASQQFRPIGQ 2765
            M++ P  SG QP  PP +GS  PQ FG   MQFRPA++ QQGQ F    +AS Q+RP+GQ
Sbjct: 1    MASNPPPSGPQPLWPPSVGSTPPQGFGSFPMQFRPALSTQQGQHFAPPISASPQYRPVGQ 60

Query: 2764 GISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQ 2594
               +   GMP GQ Q  QFSQ MQQ+PPRP Q GH T   QAIQM YIQ      S  PQ
Sbjct: 61   ---TPNAGMPPGQGQIPQFSQTMQQFPPRPGQSGHGTPSSQAIQMSYIQ------SSIPQ 111

Query: 2593 SQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAPQFQSLPQMQSSNVPVGG 2414
             QQ   PL++ MPG+ G+G P SSSYT   SS                 QM     P GG
Sbjct: 112  PQQVNPPLNSHMPGVSGAGNPFSSSYTVQSSS-----------------QMHGPTFPAGG 154

Query: 2413 QPWLSSGNQSAALISPVQQTGHSSVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYN 2234
            Q WLSSG+Q+  + +P   + H    +A  PA+    ++ Q++SDWQE+ +ADGRRYYYN
Sbjct: 155  QTWLSSGSQTTPVAAPTPPSSHQL--SAVAPAVPASTASQQTASDWQEYEAADGRRYYYN 212

Query: 2233 KKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWT 2054
            K T+QSSWEKP+ELMTP+E          RADASTVWKEFTT +GRKYYYNK TKQSKWT
Sbjct: 213  KNTKQSSWEKPLELMTPLE----------RADASTVWKEFTTADGRKYYYNKETKQSKWT 262

Query: 2053 IPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVI 1874
            IP+ELK+ARE AE AA Q +Q                           VS T SST+  +
Sbjct: 263  IPDELKLARELAENAAGQVVQT-GTSTNSGVQVSEAVTPAEQPSAVTPVSSTPSSTVSGV 321

Query: 1873 ASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNT 1694
            ASSP+PVTP V+   +P PL VSGSSAIP     + T+  G+ S  V+ +   AAL    
Sbjct: 322  ASSPVPVTPAVSDVNTP-PLVVSGSSAIPSVSLAV-TSSAGVSSPAVSGSTESAAL---- 375

Query: 1693 VVPDASANASTTSMS-LDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTP 1517
                  ANA  T MS ++N+S QVASS L G S QDIEEAKKGMAVAGKINV P EEK+ 
Sbjct: 376  ------ANAYQTQMSGIENLSPQVASS-LSGASSQDIEEAKKGMAVAGKINVVPAEEKSA 428

Query: 1516 DEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNE 1337
            DEEP +YA KQEAK+AFKALLESA+VESDW+WEQ MRVIINDKRYGALKTLGERKQAFNE
Sbjct: 429  DEEPFLYATKQEAKNAFKALLESANVESDWTWEQTMRVIINDKRYGALKTLGERKQAFNE 488

Query: 1336 YLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPR 1157
            YL QRKK EAEE+RLRQ+KA+EEFTKMLEES+ELTSS RW +AV+MFE+DERF AVER  
Sbjct: 489  YLMQRKKQEAEERRLRQRKAKEEFTKMLEESKELTSSTRWSKAVTMFEDDERFKAVEREA 548

Query: 1156 DREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDER 977
            DREDLF  Y+               +RN +E++QFLE+C FIKV++QWRKVQD LEDDER
Sbjct: 549  DREDLFRNYLVDLQKKERSKAQEEYRRNRLEYKQFLETCGFIKVDTQWRKVQDLLEDDER 608

Query: 976  CLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLT 797
            C RLEK+ RL +FQ                      RAERKNRD FRK+++EH+AAG LT
Sbjct: 609  CSRLEKLDRLEIFQEYIRDLEKEDEEQRKLQKEQLRRAERKNRDAFRKMIEEHIAAGMLT 668

Query: 796  AKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGK 617
            AKT WRDYC  VK+  AY AV+SNTSGSTPKDLFED+ EELEKQYHED+ R+KD VK  K
Sbjct: 669  AKTSWRDYCQMVKEFVAYQAVASNTSGSTPKDLFEDVTEELEKQYHEDKIRVKDVVKSEK 728

Query: 616  ITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFT 437
            IT++STWT EDFK AI E +GSP I D+N                           +DFT
Sbjct: 729  ITISSTWTFEDFKVAIFEGIGSPSIHDVNLQLIFEDLVERAKEKEEKEAKKHQRLAKDFT 788

Query: 436  -KLLRSKEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
             KL   KEI  SS+WE+ K L+EDS E+R+IGEE+  + +FEEY+A LQ
Sbjct: 789  DKLSSIKEITDSSSWEESKELVEDSSEFRAIGEETISRAVFEEYVAWLQ 837


>ref|XP_004242948.1| PREDICTED: pre-mRNA-processing protein 40A-like [Solanum
            lycopersicum]
          Length = 998

 Score =  855 bits (2209), Expect = 0.0
 Identities = 485/889 (54%), Positives = 576/889 (64%), Gaps = 8/889 (0%)
 Frame = -3

Query: 2935 MSNTPQSSGAQPFRPP-IGSMGPQSFGPPSMQFRPAVNPQQGQSFIQ--AASQQFRPIGQ 2765
            M++ P  SG QP  PP +GS  PQ FG   MQFRPA++ QQGQ F    +AS Q+RP+GQ
Sbjct: 1    MASNPPPSGPQPLWPPSVGSTPPQGFGSFPMQFRPALSTQQGQHFAPPISASPQYRPVGQ 60

Query: 2764 GISSSPVGMPSGQSQTLQFSQPMQQYPPRPTQPGHAT---QAIQMPYIQQNRPLTSGSPQ 2594
               +   GMP GQ Q  QFSQ MQQ+PPRP QPGH T   QAIQM Y Q      S   Q
Sbjct: 61   ---TPNAGMPPGQGQIPQFSQTMQQFPPRPGQPGHGTPSSQAIQMSYNQ------SSISQ 111

Query: 2593 SQQTAHPLSNQMPGLVGSGMPLSSSYTFAPSSFGQPQSNVNAPQFQSLPQMQSSNVPVGG 2414
             QQ   PL++ MPG+ G+G P SSSYT   SS                 QM     P GG
Sbjct: 112  PQQVNPPLNSHMPGVSGAGNPFSSSYTVQSSS-----------------QMHGPTFPAGG 154

Query: 2413 QPWLSSGNQSAALISPVQQTGHSSVNAATVPAINGPNSTLQSSSDWQEHTSADGRRYYYN 2234
            QPWLSSG+Q+  +  P   + H  +  A  PA+    ++ Q++SDWQE+ +ADGRRYYYN
Sbjct: 155  QPWLSSGSQTTPVGDPTPPSSHQLL--AVAPAVPASTASQQTASDWQEYEAADGRRYYYN 212

Query: 2233 KKTRQSSWEKPMELMTPIEXXXXXXXXXXRADASTVWKEFTTPEGRKYYYNKVTKQSKWT 2054
            K T+QSSWEKP+ELMTP+E          RADASTVWKEFTT +GRKYYYNK TKQSKWT
Sbjct: 213  KNTKQSSWEKPLELMTPLE----------RADASTVWKEFTTADGRKYYYNKETKQSKWT 262

Query: 2053 IPEELKMAREQAEKAASQGIQLEXXXXXXXXXXXXXXXXXXXXXXAISVSHTTSSTLPVI 1874
            +P+ELK+ARE AE  ASQ +Q                           VS T SST+  +
Sbjct: 263  MPDELKLARELAENVASQVVQT-GTSTNSGVQVSEAVTSTEQPSAVTPVSSTPSSTVSGV 321

Query: 1873 ASSPIPVTPVVAAEPSPSPLAVSGSSAIPVAQPVIATTVVGLHSLPVTVTPLPAALLGNT 1694
             SSP+PVTP V+   +P PL VSGSSAIP     + T+  G+ S  V+     AAL    
Sbjct: 322  PSSPVPVTPAVSDVNTP-PLVVSGSSAIPTVSFAV-TSSAGISSPAVSGNTRSAAL---- 375

Query: 1693 VVPDASANASTTSMS-LDNISSQVASSSLDGTSMQDIEEAKKGMAVAGKINVTPVEEKTP 1517
                  ANA  T MS ++N+S QVASS L G S QDIEEAKKGMAVAGKINV P EEK+ 
Sbjct: 376  ------ANAYQTQMSGIENLSPQVASS-LSGASSQDIEEAKKGMAVAGKINVVPAEEKSA 428

Query: 1516 DEEPLVYANKQEAKSAFKALLESAHVESDWSWEQAMRVIINDKRYGALKTLGERKQAFNE 1337
            DEEP +YA KQEAK AFK+LLESA VESDW+WEQ MRVIINDKRYGALKTLGERKQAFNE
Sbjct: 429  DEEPFLYATKQEAKHAFKSLLESATVESDWTWEQTMRVIINDKRYGALKTLGERKQAFNE 488

Query: 1336 YLGQRKKLEAEEKRLRQKKAREEFTKMLEESQELTSSIRWREAVSMFENDERFTAVERPR 1157
            YL QRKK EAEE+RLRQ+KA+EEFTKMLEES+ELTSS RW +AV+MFE+DERF  VER  
Sbjct: 489  YLMQRKKQEAEERRLRQRKAKEEFTKMLEESKELTSSTRWSKAVTMFEDDERFKGVEREA 548

Query: 1156 DREDLFDGYIXXXXXXXXXXXXXXXKRNVMEFRQFLESCDFIKVNSQWRKVQDRLEDDER 977
            DREDLF  Y+               +RN +E++QFLE+C FIKV++QWRKVQD LEDDER
Sbjct: 549  DREDLFRNYLVDLQKKERSKAQEEYRRNRLEYKQFLETCGFIKVDTQWRKVQDLLEDDER 608

Query: 976  CLRLEKIQRLFVFQXXXXXXXXXXXXXXXXXXXXXXRAERKNRDEFRKLMDEHVAAGTLT 797
            C RLEK+ RL +FQ                      RAERKNRD FRK+++EH+AAG LT
Sbjct: 609  CSRLEKLDRLDIFQEYIRDLEKEDEEQRKLQKEQLRRAERKNRDAFRKMIEEHIAAGMLT 668

Query: 796  AKTHWRDYCMKVKDSTAYLAVSSNTSGSTPKDLFEDIAEELEKQYHEDRARIKDAVKLGK 617
            AKT+WRDY   VK+S AY AV+SNTSGSTPKDLFED+ EELEKQYHED+  +KD VK  K
Sbjct: 669  AKTYWRDYWQMVKESVAYQAVASNTSGSTPKDLFEDVTEELEKQYHEDKIHVKDVVKSEK 728

Query: 616  ITLASTWTLEDFKAAILEDVGSPPISDINTXXXXXXXXXXXXXXXXXXXXXXXXXXEDFT 437
            IT++ T T EDFK AILE + SP I D+N                           +DFT
Sbjct: 729  ITISPTCTFEDFKVAILEGISSPSIQDVNLQLIFEDLVERAKEKEEKEAKKRQRLAKDFT 788

Query: 436  -KLLRSKEIIASSNWEDCKPLLEDSQEYRSIGEESFKKEIFEEYIAHLQ 293
             KL   KEI  SS+WE+ K L+EDS E+R+IGEE+  + +FEEY+A LQ
Sbjct: 789  DKLSSIKEITDSSSWEESKELVEDSSEFRAIGEETISRAVFEEYVAWLQ 837


Top