BLASTX nr result

ID: Catharanthus22_contig00007966 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00007966
         (2907 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI19367.3| unnamed protein product [Vitis vinifera]              677   0.0  
ref|XP_006352103.1| PREDICTED: pre-mRNA-processing protein 40A-l...   662   0.0  
ref|XP_002283496.2| PREDICTED: pre-mRNA-processing factor 40 hom...   660   0.0  
ref|XP_006352104.1| PREDICTED: pre-mRNA-processing protein 40A-l...   660   0.0  
ref|XP_004250825.1| PREDICTED: pre-mRNA-processing protein 40A-l...   654   0.0  
ref|XP_006486888.1| PREDICTED: pre-mRNA-processing protein 40A-l...   649   0.0  
gb|EOY15661.1| Pre-mRNA-processing protein 40A isoform 1 [Theobr...   646   0.0  
ref|XP_002522113.1| protein binding protein, putative [Ricinus c...   644   0.0  
ref|XP_006422754.1| hypothetical protein CICLE_v100277412mg, par...   642   0.0  
gb|EOY15663.1| Pre-mRNA-processing protein 40A isoform 3 [Theobr...   638   e-180
gb|EMJ28229.1| hypothetical protein PRUPE_ppa000697mg [Prunus pe...   638   e-180
gb|EOY15665.1| Pre-mRNA-processing protein 40A isoform 5 [Theobr...   634   e-179
ref|XP_006422757.1| hypothetical protein CICLE_v10027732mg [Citr...   632   e-178
ref|XP_006422756.1| hypothetical protein CICLE_v10027732mg [Citr...   632   e-178
ref|XP_006827042.1| hypothetical protein AMTR_s00010p00227470 [A...   630   e-177
ref|XP_006486884.1| PREDICTED: pre-mRNA-processing protein 40A-l...   627   e-177
ref|XP_004496865.1| PREDICTED: pre-mRNA-processing protein 40A-l...   622   e-175
ref|XP_006606005.1| PREDICTED: pre-mRNA-processing protein 40A-l...   622   e-175
ref|XP_006589614.1| PREDICTED: pre-mRNA-processing protein 40A-l...   621   e-175
ref|XP_003535678.1| PREDICTED: pre-mRNA-processing protein 40A-l...   621   e-175

>emb|CBI19367.3| unnamed protein product [Vitis vinifera]
          Length = 1030

 Score =  677 bits (1748), Expect = 0.0
 Identities = 407/888 (45%), Positives = 499/888 (56%), Gaps = 30/888 (3%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG--ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M QLP RP     + P SQ IP+P  QQ+R                  ++PG AG 
Sbjct: 78   FSQAMQQLPPRPNQPGPIAPSSQPIPMPYIQQNRPLTSSSPQPNQTAPPLNSHMPGLAGP 137

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G+P            GQ Q+  +++ Q+QPISQ      PV GQPW S+G+QS    TP 
Sbjct: 138  GMPFSSSYTFAPASFGQPQSTINASAQFQPISQMHA---PVGGQPWLSSGSQSGALVTPV 194

Query: 357  QQTGEQSS-TAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMTP 533
             Q G+Q S TA   A   P    +  S W EHT+ +G++YYYN+ T++SSWEKPLELMTP
Sbjct: 195  HQAGQQPSVTADIPAGNVPNPTHQSSSDWQEHTSADGRRYYYNKKTRLSSWEKPLELMTP 254

Query: 534  IERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQ---- 701
            IERADAST W+E T P+GR YYYNKVTKQSKW IP+ELKLARE            +    
Sbjct: 255  IERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAEKSVSQETQSEMGTT 314

Query: 702  -------AVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXX 860
                   AV   ++ +TA VS+S   +S ++   SS      V +++   PV        
Sbjct: 315  SNEPAVVAVSLAETPSTASVSVSSTTSSTISGMTSSPVPVTPVVAVVNPPPVVVSGTSAI 374

Query: 861  XXXXXXXXXXAV----AMMDPXXXXXXXXXXXXXXXXDANITSATE-KSFTINSSDTLLA 1025
                      AV    +M  P                + N TS T  ++ + ++++    
Sbjct: 375  PIAQSAVTTSAVGVQPSMGTPLPAAVSGSTGVAAAFINPNATSMTSFENLSADATNGASM 434

Query: 1026 QDAVTSVVGVSPGNAEKEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANV 1205
            QD   +  GV        AV G       EEK ++  P+VY  KLEAKNAFKALLE ANV
Sbjct: 435  QDIEEAKKGV--------AVAGKINVTPLEEKTLDDEPLVYSTKLEAKNAFKALLESANV 486

Query: 1206 GSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRK 1385
             SDW WDQAM+AIINDKRYGAL+TLGERKQAFNE++G                   +F  
Sbjct: 487  ESDWTWDQAMKAIINDKRYGALKTLGERKQAFNEYLGQRKKIEAEERRMRQKKAREEFTT 546

Query: 1386 MLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHK 1565
            MLEE KELTSS +WSK +  F+DDERFKAVER RDRE+LFEN+I EL++KER KALEE K
Sbjct: 547  MLEECKELTSSIKWSKAVDMFQDDERFKAVERSRDREDLFENFIMELQKKERTKALEEQK 606

Query: 1566 RYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXX 1745
            R R+EY +FL+SCDFIK NSQWRKVQDRLE DERCSRLEKIDRLEIFQEY RD       
Sbjct: 607  RNRMEYRQFLESCDFIKVNSQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEREEEE 666

Query: 1746 XXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTS 1925
                        ERKNRDEFRKLMEEHV+AG LT+KTHWRDYCMKVKDS  Y+AVASNTS
Sbjct: 667  QRKIQKEQLRRAERKNRDEFRKLMEEHVAAGTLTAKTHWRDYCMKVKDSSPYLAVASNTS 726

Query: 1926 GSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVS 2105
            GST KDLFEDV EELEKQ+ EDKA+IKDAMK  +V + ST TF DFK AIL D     +S
Sbjct: 727  GSTPKDLFEDVAEELEKQYHEDKARIKDAMKLSKVTIASTWTFGDFKAAILDDVGSPNIS 786

Query: 2106 DYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE-- 2279
            D N+K+VF+                     DD  + + + K+IT SS WED K L +E  
Sbjct: 787  DVNLKLVFEELLDRIKEKEEKEAKKRQRLADDFNDLLRSKKEITASSNWEDCKPLFEESQ 846

Query: 2280 --RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENR 2453
              R +GEESF REIF++ I                                  +R E++R
Sbjct: 847  EYRSIGEESFGREIFEEYIAHLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKDR 906

Query: 2454 -------KRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRH 2576
                   K + R ++           YG  ED++R  KD+ R  R RH
Sbjct: 907  DREREKGKERSRKDETESENVDVTGSYGYKEDKKRE-KDKDRKHRKRH 953


>ref|XP_006352103.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Solanum
            tuberosum]
          Length = 1003

 Score =  662 bits (1708), Expect = 0.0
 Identities = 402/895 (44%), Positives = 503/895 (56%), Gaps = 12/895 (1%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVP-DFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGLG 179
            FPQ M Q+  RP       SQ  P P DFQ++                   ++PG  G  
Sbjct: 71   FPQPMQQVAGRPVVGGHSMSQGPPNPHDFQRN-------------PPMSNNHMPGSGGPS 117

Query: 180  VPXXXXXXXXXXXAGQQQTNTDSAT-QYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
             P           +   Q N DS++ QYQ  +Q     FP   QPW    N ++ + T  
Sbjct: 118  FPLS---------SSYNQVNADSSSSQYQ--TQIHDHRFPSGVQPWMPTSNHNVNSATTM 166

Query: 357  QQTGEQSSTAI-NDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMTP 533
            Q TGE ++  +  +A  + ++ E  PS WIEHT+RNGKKYYYNR T++SSWEKPLELMT 
Sbjct: 167  QNTGELAAPLVLPEANNRVDSAETTPSDWIEHTSRNGKKYYYNRRTRISSWEKPLELMTE 226

Query: 534  IERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQAVKD 713
            +ERADASTDWRE T P GR YYYNKVT++SKW++PDE+KLARE               KD
Sbjct: 227  MERADASTDWREFTSPAGRKYYYNKVTRKSKWKMPDEVKLARE---------------KD 271

Query: 714  VDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXXXXXA 893
              S A+   S+S + TS    D S    Q  + S I V+PV                  +
Sbjct: 272  TISHASDFGSISSIKTSSPGADGSFVSAQGAMTSPIAVSPVANLPAIVASESSSLSGKVS 331

Query: 894  VAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTIN---SSDTLLAQDAVTSVVGVSPG 1064
               +D                    I  A     ++    SS+T  AQDAV    GVSP 
Sbjct: 332  SPTIDAVEMQNSSEPASPAVANSEKIGIAVTLGNSVTIPVSSETTSAQDAVACGNGVSPE 391

Query: 1065 NAEK----EAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANVGSDWNWDQA 1232
            N E+     A+ G   +  SEEK VE GP+VYE+K+EAKNAFK LLE AN+GSD  WDQA
Sbjct: 392  NREEVKQDAAITGIGSATPSEEKTVELGPLVYESKVEAKNAFKTLLESANIGSDCTWDQA 451

Query: 1233 MRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKMLEESKELT 1412
            MRAIIND+RYGAL++LGERKQAFNE++                    DFR MLE+ KEL+
Sbjct: 452  MRAIINDRRYGALKSLGERKQAFNEYLSQRKKLEAEERRVKQKKAREDFRIMLEDCKELS 511

Query: 1413 SSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKRYRVEYLEF 1592
             SSRWSK IS FE DERFKAVER +DRE+LFE+Y +ELE+KERA+ALEE KR RVEYLEF
Sbjct: 512  PSSRWSKAISIFELDERFKAVERAKDREDLFEDYKEELEKKERARALEEQKRNRVEYLEF 571

Query: 1593 LKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXXXXXXXXXX 1772
            LKSCDFIKA+SQWRKVQDRLEADERC RLEKIDRLEIFQEY RD                
Sbjct: 572  LKSCDFIKASSQWRKVQDRLEADERCPRLEKIDRLEIFQEYIRDLEREEEEQRKLRMEEL 631

Query: 1773 XXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSGSTAKDLFE 1952
               ERKNRDEFRKLMEEHV+ G+L +KT WRDYC+K+KD  AY+AV+SNTSGS+AKDLF 
Sbjct: 632  RKAERKNRDEFRKLMEEHVAVGMLNAKTIWRDYCIKIKDIAAYLAVSSNTSGSSAKDLFA 691

Query: 1953 DVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSDYNMKVVFD 2132
            DVV+EL+KQ+L+DK++I+DA++  E  +TST T +DFK AI KD     +SD N+K+VF+
Sbjct: 692  DVVDELDKQYLDDKSRIRDAVRMTENGLTSTWTLDDFKDAIAKDISSPPISDTNLKLVFE 751

Query: 2133 XXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDERFVGEESFFRE 2312
                                 D+ YE +  SK+IT SS+WED K L  +R +G+ES   E
Sbjct: 752  ELLERAREREEKEAKKRKRLADEFYELLHASKEITASSKWEDCKSLFGDRIMGDESLLLE 811

Query: 2313 IFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRKRKDRYNKXXXXX 2492
            IFDK + E                                 ++ + ++RKDR  K     
Sbjct: 812  IFDKFVNE------------------LKEKAKEKDRKRQEDKARKEKERKDREKKKEKHR 853

Query: 2493 XXXXXXYGGYEDERRSGKDRSRDFRSRHN--DDRKKMKRVXXXXXSRTSGKRL*R 2651
                       D+ R  K R    RS+ +  D  K++KR       R S K + R
Sbjct: 854  ----------RDKHRGDKSRKERERSKKDSTDSDKEIKRSGSDRDKRDSDKEIRR 898


>ref|XP_002283496.2| PREDICTED: pre-mRNA-processing factor 40 homolog B-like [Vitis
            vinifera]
          Length = 1020

 Score =  660 bits (1704), Expect = 0.0
 Identities = 405/889 (45%), Positives = 498/889 (56%), Gaps = 31/889 (3%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG--ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M QLP RP     + P SQ IP+P  QQ+R                  ++PG   L
Sbjct: 92   FSQAMQQLPPRPNQPGPIAPSSQPIPMPYIQQNRPLTSSSPQPNQTAPPLNSHMPG---L 148

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
              P            GQ Q+  +++ Q+QPISQ      PV GQPW S+G+QS    TP 
Sbjct: 149  FAPASF---------GQPQSTINASAQFQPISQMHA---PVGGQPWLSSGSQSGALVTPV 196

Query: 357  QQTGEQSSTAINDAIPK---PETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELM 527
             Q G+Q S   +  +     P    +  S W EHT+ +G++YYYN+ T++SSWEKPLELM
Sbjct: 197  HQAGQQPSVTADIPVSAGNVPNPTHQSSSDWQEHTSADGRRYYYNKKTRLSSWEKPLELM 256

Query: 528  TPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQAV 707
            TPIERADAST W+E T P+GR YYYNKVTKQSKW IP+ELKLARE            +  
Sbjct: 257  TPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAEKSVSQETQSEMG 316

Query: 708  KDVDSQATAPVSLSGV-NTSPVNVDVSSFPCQAGVPSL-IPVAPVDXXXXXXXXXXXXXX 881
               +  A   VSL+   +T+ V+V  ++    +G+ S  +PV PV               
Sbjct: 317  TTSNEPAVVAVSLAETPSTASVSVSSTTSSTISGMTSSPVPVTPV--------------- 361

Query: 882  XXXAVAMMDPXXXXXXXXXXXXXXXXDANITSA--------TEKSFTINSSDTLLAQDAV 1037
                VA+++P                 A  TSA        T     ++ S  + A  + 
Sbjct: 362  ----VAVVNPPPVVVSGTSAIPIAQ-SAVTTSAVGVQPSMGTPLPAAVSGSTGVAANLSA 416

Query: 1038 TSVVGVSPGNAEKEAVIGTQESGKS-----EEKKVEQGPVVYENKLEAKNAFKALLEMAN 1202
             +  G S  + E EA  G   +GK      EEK ++  P+VY  KLEAKNAFKALLE AN
Sbjct: 417  DATNGASMQDIE-EAKKGVAVAGKINVTPLEEKTLDDEPLVYSTKLEAKNAFKALLESAN 475

Query: 1203 VGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFR 1382
            V SDW WDQAM+AIINDKRYGAL+TLGERKQAFNE++G                   +F 
Sbjct: 476  VESDWTWDQAMKAIINDKRYGALKTLGERKQAFNEYLGQRKKIEAEERRMRQKKAREEFT 535

Query: 1383 KMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEH 1562
             MLEE KELTSS +WSK +  F+DDERFKAVER RDRE+LFEN+I EL++KER KALEE 
Sbjct: 536  TMLEECKELTSSIKWSKAVDMFQDDERFKAVERSRDREDLFENFIMELQKKERTKALEEQ 595

Query: 1563 KRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXX 1742
            KR R+EY +FL+SCDFIK NSQWRKVQDRLE DERCSRLEKIDRLEIFQEY RD      
Sbjct: 596  KRNRMEYRQFLESCDFIKVNSQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEREEE 655

Query: 1743 XXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNT 1922
                         ERKNRDEFRKLMEEHV+AG LT+KTHWRDYCMKVKDS  Y+AVASNT
Sbjct: 656  EQRKIQKEQLRRAERKNRDEFRKLMEEHVAAGTLTAKTHWRDYCMKVKDSSPYLAVASNT 715

Query: 1923 SGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAV 2102
            SGST KDLFEDV EELEKQ+ EDKA+IKDAMK  +V + ST TF DFK AIL D     +
Sbjct: 716  SGSTPKDLFEDVAEELEKQYHEDKARIKDAMKLSKVTIASTWTFGDFKAAILDDVGSPNI 775

Query: 2103 SDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE- 2279
            SD N+K+VF+                     DD  + + + K+IT SS WED K L +E 
Sbjct: 776  SDVNLKLVFEELLDRIKEKEEKEAKKRQRLADDFNDLLRSKKEITASSNWEDCKPLFEES 835

Query: 2280 ---RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEEN 2450
               R +GEESF REIF++ I                                  +R E++
Sbjct: 836  QEYRSIGEESFGREIFEEYIAHLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKD 895

Query: 2451 R-------KRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRH 2576
            R       K + R ++           YG  ED++R  KD+ R  R RH
Sbjct: 896  RDREREKGKERSRKDETESENVDVTGSYGYKEDKKRE-KDKDRKHRKRH 943


>ref|XP_006352104.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X2 [Solanum
            tuberosum]
          Length = 1003

 Score =  660 bits (1703), Expect = 0.0
 Identities = 401/895 (44%), Positives = 502/895 (56%), Gaps = 12/895 (1%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVP-DFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGLG 179
            FPQ M Q+  RP       SQ  P P DFQ++                   ++PG  G  
Sbjct: 72   FPQPMQQVAGRPVVGGHSMSQGPPNPHDFQRN-------------PPMSNNHMPGSGGPS 118

Query: 180  VPXXXXXXXXXXXAGQQQTNTDSAT-QYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
             P           +     N DS++ QYQ  +Q     FP   QPW    N ++ + T  
Sbjct: 119  FPL----------SSSYNVNADSSSSQYQ--TQIHDHRFPSGVQPWMPTSNHNVNSATTM 166

Query: 357  QQTGEQSSTAI-NDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMTP 533
            Q TGE ++  +  +A  + ++ E  PS WIEHT+RNGKKYYYNR T++SSWEKPLELMT 
Sbjct: 167  QNTGELAAPLVLPEANNRVDSAETTPSDWIEHTSRNGKKYYYNRRTRISSWEKPLELMTE 226

Query: 534  IERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQAVKD 713
            +ERADASTDWRE T P GR YYYNKVT++SKW++PDE+KLARE               KD
Sbjct: 227  MERADASTDWREFTSPAGRKYYYNKVTRKSKWKMPDEVKLARE---------------KD 271

Query: 714  VDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXXXXXA 893
              S A+   S+S + TS    D S    Q  + S I V+PV                  +
Sbjct: 272  TISHASDFGSISSIKTSSPGADGSFVSAQGAMTSPIAVSPVANLPAIVASESSSLSGKVS 331

Query: 894  VAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTIN---SSDTLLAQDAVTSVVGVSPG 1064
               +D                    I  A     ++    SS+T  AQDAV    GVSP 
Sbjct: 332  SPTIDAVEMQNSSEPASPAVANSEKIGIAVTLGNSVTIPVSSETTSAQDAVACGNGVSPE 391

Query: 1065 NAEK----EAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANVGSDWNWDQA 1232
            N E+     A+ G   +  SEEK VE GP+VYE+K+EAKNAFK LLE AN+GSD  WDQA
Sbjct: 392  NREEVKQDAAITGIGSATPSEEKTVELGPLVYESKVEAKNAFKTLLESANIGSDCTWDQA 451

Query: 1233 MRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKMLEESKELT 1412
            MRAIIND+RYGAL++LGERKQAFNE++                    DFR MLE+ KEL+
Sbjct: 452  MRAIINDRRYGALKSLGERKQAFNEYLSQRKKLEAEERRVKQKKAREDFRIMLEDCKELS 511

Query: 1413 SSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKRYRVEYLEF 1592
             SSRWSK IS FE DERFKAVER +DRE+LFE+Y +ELE+KERA+ALEE KR RVEYLEF
Sbjct: 512  PSSRWSKAISIFELDERFKAVERAKDREDLFEDYKEELEKKERARALEEQKRNRVEYLEF 571

Query: 1593 LKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXXXXXXXXXX 1772
            LKSCDFIKA+SQWRKVQDRLEADERC RLEKIDRLEIFQEY RD                
Sbjct: 572  LKSCDFIKASSQWRKVQDRLEADERCPRLEKIDRLEIFQEYIRDLEREEEEQRKLRMEEL 631

Query: 1773 XXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSGSTAKDLFE 1952
               ERKNRDEFRKLMEEHV+ G+L +KT WRDYC+K+KD  AY+AV+SNTSGS+AKDLF 
Sbjct: 632  RKAERKNRDEFRKLMEEHVAVGMLNAKTIWRDYCIKIKDIAAYLAVSSNTSGSSAKDLFA 691

Query: 1953 DVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSDYNMKVVFD 2132
            DVV+EL+KQ+L+DK++I+DA++  E  +TST T +DFK AI KD     +SD N+K+VF+
Sbjct: 692  DVVDELDKQYLDDKSRIRDAVRMTENGLTSTWTLDDFKDAIAKDISSPPISDTNLKLVFE 751

Query: 2133 XXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDERFVGEESFFRE 2312
                                 D+ YE +  SK+IT SS+WED K L  +R +G+ES   E
Sbjct: 752  ELLERAREREEKEAKKRKRLADEFYELLHASKEITASSKWEDCKSLFGDRIMGDESLLLE 811

Query: 2313 IFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRKRKDRYNKXXXXX 2492
            IFDK + E                                 ++ + ++RKDR  K     
Sbjct: 812  IFDKFVNE------------------LKEKAKEKDRKRQEDKARKEKERKDREKKKEKHR 853

Query: 2493 XXXXXXYGGYEDERRSGKDRSRDFRSRHN--DDRKKMKRVXXXXXSRTSGKRL*R 2651
                       D+ R  K R    RS+ +  D  K++KR       R S K + R
Sbjct: 854  ----------RDKHRGDKSRKERERSKKDSTDSDKEIKRSGSDRDKRDSDKEIRR 898


>ref|XP_004250825.1| PREDICTED: pre-mRNA-processing protein 40A-like [Solanum
            lycopersicum]
          Length = 1044

 Score =  654 bits (1687), Expect = 0.0
 Identities = 400/890 (44%), Positives = 499/890 (56%), Gaps = 23/890 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVP-DFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGLG 179
            FPQ M Q+  RP       SQ  P P DFQ++                   ++ G  G G
Sbjct: 71   FPQPMQQVAGRPVVGGHNMSQGPPNPHDFQRN-------------PPMSNNHMTGSGGPG 117

Query: 180  VPXXXXXXXXXXXAGQQQTNTDSAT-QYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
             P           +   Q N DS++ QYQ  +Q     FP   QPW    NQ++ + T  
Sbjct: 118  FPLS---------SSYNQVNADSSSSQYQ--TQIHDHRFPSGVQPWMPTSNQNVNSATTM 166

Query: 357  QQTGEQSSTAINDAIPKPETG----EKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLEL 524
            Q TGE ++  +   +P+   G    E  PS WIEHT+RNGKKYYYNR T++SSWEKPLEL
Sbjct: 167  QNTGELAAPLV---VPEANNGVDSVETTPSDWIEHTSRNGKKYYYNRRTRISSWEKPLEL 223

Query: 525  MTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQA 704
            MT +ERADASTDWRE T P GR YYYNKVT++S W++PDE+KLARE              
Sbjct: 224  MTEMERADASTDWREFTSPAGRKYYYNKVTRKSNWKMPDEVKLARE-------------- 269

Query: 705  VKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXXX 884
             K  DS A+   S+S V TS    D S    Q  + S I V+PV                
Sbjct: 270  -KHTDSHASDFGSISSVKTSSPGADGSFVSAQGAMTSPIAVSPVANLPTIVASESSLSGK 328

Query: 885  XX-----AVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINS-SDTLLAQDAVTSV 1046
                   AV M +                    IT     S  I + S+T  AQD V   
Sbjct: 329  LSSPTVDAVEMQNSSEPASPAVANSEKI----GITVTLGNSVVIPARSETTSAQDEVACD 384

Query: 1047 VGVSPGNAEK----EAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANVGSD 1214
             G SP N E+     A+ G   +  SEEK VE GP+VYE+K+EAKNAF+ LLE AN+GSD
Sbjct: 385  DGASPDNREEVKHVAAITGIGSAAPSEEKTVELGPLVYESKVEAKNAFRTLLESANIGSD 444

Query: 1215 WNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKMLE 1394
              WDQAMRAIIND+RYGAL++LGERKQAFNE++                    DFR MLE
Sbjct: 445  CTWDQAMRAIINDRRYGALKSLGERKQAFNEYLSQRKKLEAEERRVKQKKAREDFRIMLE 504

Query: 1395 ESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKRYR 1574
            + KEL+ SSRWSK+IS FE DERFKAVER +DRE+LFE+Y +ELE+KERA+ALEE KR R
Sbjct: 505  DCKELSPSSRWSKVISIFEHDERFKAVERAKDREDLFEDYKEELEKKERARALEEQKRNR 564

Query: 1575 VEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXXXX 1754
            VEYLEFLKSCDFIKA+SQWRKVQDRLEADERC RLEKIDRLEIFQEY RD          
Sbjct: 565  VEYLEFLKSCDFIKASSQWRKVQDRLEADERCPRLEKIDRLEIFQEYIRDLEREEEEQRK 624

Query: 1755 XXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSGST 1934
                     ERKNRDEFRKLMEEHV+ G+L +KT WRDYC+K+KD  AY AV+SNTSGS+
Sbjct: 625  LRMEELRKAERKNRDEFRKLMEEHVAVGMLNAKTIWRDYCIKIKDIAAYQAVSSNTSGSS 684

Query: 1935 AKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSDYN 2114
            AKDLF DVV+EL+KQ+L+DK++I+DA++  E+ +TST T +DFK AI K      +SD N
Sbjct: 685  AKDLFADVVDELDKQYLDDKSRIRDAVRMTEIGLTSTWTLDDFKDAIAKYISSPPISDTN 744

Query: 2115 MKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDERFVGE 2294
            +K+VF+                     D+ YE +  SK+IT SS+WED K L  +R  G+
Sbjct: 745  LKLVFEELLERAREREEKEAKKRKRLADEFYELLHASKEITASSKWEDCKSLFGDRITGD 804

Query: 2295 ESFFREIFDKVILE-----HXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRKR 2459
            ES   EIFDK + E                                      R +++RK 
Sbjct: 805  ESLLLEIFDKFVNELKEKAKEKDRKRQEDKARKEKERKDREKKKEKHRRDKHRGDKSRKE 864

Query: 2460 KDRYNKXXXXXXXXXXXYGGYEDERRSGKD--RSRDFRSRHNDDRKKMKR 2603
            ++R  +            G   D+R S K+  RS D + + +D  K++KR
Sbjct: 865  RERSKRDSSDSDKEIRRSGSDRDKRDSDKEIKRSGDKKIKRSDSDKEIKR 914


>ref|XP_006486888.1| PREDICTED: pre-mRNA-processing protein 40A-like [Citrus sinensis]
          Length = 1001

 Score =  649 bits (1673), Expect = 0.0
 Identities = 383/889 (43%), Positives = 501/889 (56%), Gaps = 24/889 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVM----PQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFA 170
            FPQ M QLP RPG        P  Q +P+P+ QQS H                 Y  G  
Sbjct: 55   FPQLMHQLPARPGQPAPSHGPPPPQVVPLPNAQQSNHIASGSSLPQANVQAPTNYASGLG 114

Query: 171  GLGVPXXXXXXXXXXXAGQQQ--TNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITT 344
            GL  P            GQ Q   N ++  QYQP+SQ  + S P  GQ   S    S +T
Sbjct: 115  GLARPFSASYTFAPSSYGQPQGTVNVNTGNQYQPMSQMHVPSNPAGGQLGVSI---SQST 171

Query: 345  FTPAQQTGEQ--SSTAINDAIP-KPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKP 515
             TP Q T EQ  ++TA   A   +P++ E   + WIEHTA +G++YYYN+ T+ S+W+KP
Sbjct: 172  STPLQHTNEQVAANTASTMASTFQPKSAEVAQTDWIEHTAADGRRYYYNKRTRQSTWDKP 231

Query: 516  LELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXX 695
            LELMTPIERADA++DW+E T PDGR YYYNKVTKQSKW IPDELKLARE           
Sbjct: 232  LELMTPIERADAASDWKEFTSPDGRKYYYNKVTKQSKWSIPDELKLAREQAERASTKGTQ 291

Query: 696  XQAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXX 875
             +A  ++ +  + P S   V  SP N D+SS   Q    S + V P+             
Sbjct: 292  SEASPNLQTSNSVPSS--AVTASP-NADISSSTVQVVASSPVSVVPIIAASSIQPAMVSA 348

Query: 876  XXXXXAVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINS-------SDTLLAQDA 1034
                  +A                      +++S+   + T+N+       S  L A + 
Sbjct: 349  SSASPVIASSVAVSADGIQTTVDALTPM-TSVSSSVGDAVTVNTDTETKNYSSNLSASNV 407

Query: 1035 VTSVVGVSPGNAE---KEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANV 1205
            V + V V     E   K+AV G +   + EEK V Q  + Y NKLEAKNAFKALLE ANV
Sbjct: 408  VAAAVEVPAQETEEMRKDAVTGEKIGDELEEKTVGQEHLAYANKLEAKNAFKALLESANV 467

Query: 1206 GSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRK 1385
            GSDW+WDQAM+AIIND+RYGAL+TLGERKQAFNE++G                   D++K
Sbjct: 468  GSDWSWDQAMQAIINDRRYGALKTLGERKQAFNEYLGQRKKQEAEERRFKLKKAREDYKK 527

Query: 1386 MLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHK 1565
            MLEES ELTSS+RWSK ++ FE+DERFKA++R+RDR +LF+++++EL +KERAKA EE +
Sbjct: 528  MLEESVELTSSTRWSKAVTMFENDERFKALDRERDRRDLFDDHLEELRQKERAKAQEERR 587

Query: 1566 RYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXX 1745
            ++ +EY +FL+SCDFIKA++QWRKVQDRLEADERCSRLEKIDRLEIF+EY  D       
Sbjct: 588  QHLIEYRQFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLEIFKEYIIDLEKEEEE 647

Query: 1746 XXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTS 1925
                        ERKNRDEFRKL+E  V++G LT+KTHWRDYCMKVKD  AY+AVASNTS
Sbjct: 648  QRKIQKEVLRRAERKNRDEFRKLLEGDVASGTLTAKTHWRDYCMKVKDLHAYMAVASNTS 707

Query: 1926 GSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVS 2105
            GST KDLFEDV EEL+KQ+ EDK +IKDA+K  ++ ++ST TFEDFK +IL+D     +S
Sbjct: 708  GSTPKDLFEDVAEELQKQYQEDKTRIKDAVKLKKISLSSTWTFEDFKASILEDVTSPPIS 767

Query: 2106 DYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE-- 2279
            D N+K+VFD                     DD +  + + K+I+ SS WED  +L +   
Sbjct: 768  DVNIKLVFDDLLERVKEKEEKEAKKRKRLADDFFALLCSIKEISASSAWEDCIQLFEGSR 827

Query: 2280 --RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENR 2453
                +GEES  REIFD+ + +                                Q  E++R
Sbjct: 828  EFSSIGEESICREIFDEYVTQLKEQAKENERKRKEEKSKKEKEREDRDRKKQKQGREKDR 887

Query: 2454 KR-KDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRHNDDRKKM 2597
             R +++ +                 + +RSGKD  +  R RH+  +  +
Sbjct: 888  AREREKEDHSKKDGAESDHDDSAEYENKRSGKDSDKKHRKRHHSGQDSL 936


>gb|EOY15661.1| Pre-mRNA-processing protein 40A isoform 1 [Theobroma cacao]
            gi|508723765|gb|EOY15662.1| Pre-mRNA-processing protein
            40A isoform 1 [Theobroma cacao]
          Length = 1032

 Score =  646 bits (1666), Expect = 0.0
 Identities = 397/918 (43%), Positives = 494/918 (53%), Gaps = 38/918 (4%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVM--PQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M Q P RP    +  P +Q + VP  Q +R                  ++PG    
Sbjct: 77   FSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAP 136

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G+P            GQ Q N  +++Q+QP SQ   S  PV GQPW S+GNQS++   P 
Sbjct: 137  GMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPI 196

Query: 357  QQTGEQ------SSTAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPL 518
            QQTG+Q      + TA N  I  P +     S W EHT+ +G++YYYN+ T+ SSWEKPL
Sbjct: 197  QQTGQQPPLISSADTAANAPIHTPPSA----SDWQEHTSADGRRYYYNKKTRQSSWEKPL 252

Query: 519  ELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXX 698
            ELMTPIERADAST W+E T P+GR YYYNKVTKQSKW IP+ELKLARE            
Sbjct: 253  ELMTPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPS 312

Query: 699  QAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVP-SLIPVAPVDXXXXXXXXXXXX 875
                 V SQA    ++S        + VSS   QA  P S+ PVA V             
Sbjct: 313  DT--GVASQAPVAGAVSSAEMPAAAIPVSSNTSQASSPVSVTPVAAVANPSPTLVSGSTV 370

Query: 876  XXXXXAVAM----MDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQDAVTS 1043
                 + A     +                     +TS    +  I S ++  +QD+V  
Sbjct: 371  VPVSQSAATNASEVQSPAVAVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHF 430

Query: 1044 VVGVSPGNAEKEAVIGTQESGKS-----EEKKVEQGPVVYENKLEAKNAFKALLEMANVG 1208
              G S  + E EA  G   +GK      EEK  +  P+VY NK EAKNAFK+LLE ANV 
Sbjct: 431  TNGASAQDIE-EAKKGMATAGKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQ 489

Query: 1209 SDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKM 1388
            SDW W+Q MR IINDKRYGAL+TLGERKQAFNE++G                   +F KM
Sbjct: 490  SDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKM 549

Query: 1389 LEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKR 1568
            LEESKELTSS RWSK  S FE+DERFKAVER RDRE+LFENYI ELERKER  A EE +R
Sbjct: 550  LEESKELTSSMRWSKAQSLFENDERFKAVERARDREDLFENYIVELERKERENAAEEKRR 609

Query: 1569 YRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXX 1748
               EY +FL+SCDFIKANSQWRKVQDRLE DERCSRLEKIDRL +FQ+Y  D        
Sbjct: 610  NIAEYRKFLESCDFIKANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEK 669

Query: 1749 XXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSG 1928
                       ERKNRD FRKLM+EHV  G LT+KT+WRDYC+KVKD P Y+AVASNTSG
Sbjct: 670  KKMQKEQLRRAERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSG 729

Query: 1929 STAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSD 2108
            ST KDLFEDVVEELEKQ+ +DK  IKDAMK+G++ + ST T EDFK AI +D     +SD
Sbjct: 730  STPKDLFEDVVEELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISD 789

Query: 2109 YNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE--- 2279
             N+K+V++                     DD  + +   K+IT SS WEDS+ L +E   
Sbjct: 790  INLKLVYEELLKSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQE 849

Query: 2280 -RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRK 2456
             R + EES  REIF++ I                                  +R E+ R+
Sbjct: 850  YRSIAEESLRREIFEEYIAYLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKERE 909

Query: 2457 R-----KDRYNKXXXXXXXXXXXYG-GYEDERRSGKDRSRDFRSRH----------NDDR 2588
            R     K+R  K              G++++++  K++ R  R RH           DDR
Sbjct: 910  REREKGKERTKKDETDSENLDISDSHGHKEDKKKEKEKDRKHRKRHQSGGDDGSSDKDDR 969

Query: 2589 KKMKRVXXXXXSRTSGKR 2642
            ++ K+       R   ++
Sbjct: 970  EESKKSRRHGSDRKKSRK 987


>ref|XP_002522113.1| protein binding protein, putative [Ricinus communis]
            gi|223538712|gb|EEF40313.1| protein binding protein,
            putative [Ricinus communis]
          Length = 956

 Score =  644 bits (1661), Expect = 0.0
 Identities = 368/784 (46%), Positives = 460/784 (58%), Gaps = 6/784 (0%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG--ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            FP  + Q P+RPG      P SQ I +P+ Q +RH                 Y PG  G 
Sbjct: 58   FPPSVQQFPSRPGQPGHGPPPSQVISLPNAQANRHVTSGSSLPPPSVPTSINYAPGLGGP 117

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G P            GQ     ++ +QYQPISQ    S P  G   +S+ NQSIT  TP 
Sbjct: 118  GAPLSSSYTFVPSSYGQPPVAANTVSQYQPISQMRPPSIPAGGLAGSSSVNQSITPVTPM 177

Query: 357  QQTGEQSSTAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMTPI 536
            Q  GEQSS   ND  P  +  E+    W EH A NG++YYYN+ T+ SSWEKP ELMTPI
Sbjct: 178  QLNGEQSSVT-NDLHPT-KPNEETTMDWKEHLAANGRRYYYNKRTRQSSWEKPFELMTPI 235

Query: 537  ERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQAVKDV 716
            ERADASTDW+E   P+GRTYYYNK TKQSKW IP+ELKLAR+              + + 
Sbjct: 236  ERADASTDWKEFASPEGRTYYYNKTTKQSKWEIPEELKLARKRLEKASLVEAQADTLANS 295

Query: 717  DSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXXXXXAV 896
               A  P S   V+ +P   D SS   Q    S +PV PV                    
Sbjct: 296  HVPAFVPPS---VDKAPSVADASSLTAQVTPSSPVPVTPV-------------------A 333

Query: 897  AMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQDAVTSVVGVSPGNAEK 1076
            A +D                 ++   +    S T NS +    ++ V++V     G +EK
Sbjct: 334  AAVD----------LQSQPASESPGLAVMASSLTSNSDEVQTTENIVSTV----SGRSEK 379

Query: 1077 EAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANVGSDWNWDQAMRAIINDK 1256
               IG       EEK V Q P+ Y +KLEAKNAFKALLE A+VGSDW WDQAMR IIND+
Sbjct: 380  VNSIGI------EEKIVSQEPLTYTDKLEAKNAFKALLESASVGSDWTWDQAMRVIINDR 433

Query: 1257 RYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKMLEESKELTSSSRWSKI 1436
            RYGALRTLGERKQAFNE++                    +F+ MLEESKELTS+ RWSK 
Sbjct: 434  RYGALRTLGERKQAFNEYLSQKKKQDAEERRSKQKKAREEFKNMLEESKELTSTMRWSKA 493

Query: 1437 ISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKRYRVEYLEFLKSCDFIK 1616
            ++ FE+DERFKAVER+RDR ++F+++++EL  KERAKA EE KR  +EY +FL+SCDFIK
Sbjct: 494  VTLFENDERFKAVERERDRRDIFDSFLQELGDKERAKAQEERKRNIMEYRQFLESCDFIK 553

Query: 1617 ANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXXXXXXXXXXXXTERKNR 1796
            A++QWRKVQDRLEADERCSRLEKIDRLEIFQ+Y RD                   ERKNR
Sbjct: 554  ASTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLRDLEKEEEEQRKIQKEEQRKAERKNR 613

Query: 1797 DEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSGSTAKDLFEDVVEELEK 1976
            DEFRKL+EEHV+AG +T+KTHWRDY +KVKD PAY+AVASNTSGST KDLFEDV+EELEK
Sbjct: 614  DEFRKLLEEHVAAGTMTAKTHWRDYYLKVKDLPAYLAVASNTSGSTPKDLFEDVLEELEK 673

Query: 1977 QFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSDYNMKVVFDXXXXXXXX 2156
            Q+ EDK++IKDA+K  +V + ST T +D K AI++D    ++SD N+K+VFD        
Sbjct: 674  QYHEDKSRIKDAVKLKKVAMASTWTLDDLKAAIVEDISSPSISDMNLKIVFDELLERAKE 733

Query: 2157 XXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE----RFVGEESFFREIFDK 2324
                         DD    + ++KDIT SS+WE  K L +       + EES  ++IF++
Sbjct: 734  KEEKDAKKRKRLADDFLNLLHSTKDITASSKWESCKELFEGSREFSSINEESICQDIFEE 793

Query: 2325 VILE 2336
             I +
Sbjct: 794  YIAQ 797


>ref|XP_006422754.1| hypothetical protein CICLE_v100277412mg, partial [Citrus clementina]
            gi|557524688|gb|ESR35994.1| hypothetical protein
            CICLE_v100277412mg, partial [Citrus clementina]
          Length = 864

 Score =  642 bits (1655), Expect = 0.0
 Identities = 370/801 (46%), Positives = 476/801 (59%), Gaps = 23/801 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVM----PQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFA 170
            FPQ M QLP RPG        P  Q +P+P+ QQS H                 Y     
Sbjct: 55   FPQLMHQLPARPGQPAPSHGPPPPQVVPLPNAQQSNHIASGSSLPQANVQAPTSYASSLG 114

Query: 171  GLGVPXXXXXXXXXXXAGQQQ--TNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITT 344
            GL  P            GQ Q   N ++  QYQP+SQ  + S P  GQ   S    S +T
Sbjct: 115  GLARPFSASYTFAPSSYGQPQGTVNVNTGNQYQPMSQMHVPSNPAGGQLGVSI---SQST 171

Query: 345  FTPAQQTGEQ--SSTAINDAIP-KPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKP 515
             TP Q T EQ  ++TA   A   +P++ E   + WIEHTA +G++YYYN+ T+ S+W+KP
Sbjct: 172  STPLQHTHEQVAANTAPTMASTFQPKSAEVAQTDWIEHTAADGRRYYYNKRTRQSTWDKP 231

Query: 516  LELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXX 695
            LELMTPIERADA++DW+E T PDGR YYYNKVTKQSKW IPDELKLARE           
Sbjct: 232  LELMTPIERADAASDWKEFTSPDGRKYYYNKVTKQSKWSIPDELKLAREQAERASTKGTQ 291

Query: 696  XQAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXX 875
             +A  ++ +  + P S   V  SP N D+SS   Q    S + V P+             
Sbjct: 292  SEASPNLQTSNSVPSS--AVTASP-NADISSSTVQVVASSPVSVVPIIAASSIQPAMVSA 348

Query: 876  XXXXXAVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINS-------SDTLLAQDA 1034
                  +A                      +++S+   + T+N+       S  L A + 
Sbjct: 349  SSASPVIASSVAVSADGIQTTVDALTPM-ISVSSSVGDAVTVNTDTETKNYSSNLPASNV 407

Query: 1035 VTSVVGVSPGNAE---KEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANV 1205
            V + V V     E   K+AV G +   + EEK V Q  + Y NKLEAKNAFKALLE ANV
Sbjct: 408  VAAAVEVPAQETEEMRKDAVTGEKIGDELEEKTVGQEHLAYANKLEAKNAFKALLESANV 467

Query: 1206 GSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRK 1385
            GSDW+WDQAM+AIIND+RYGAL+TLGERKQAFNE++G                   D++K
Sbjct: 468  GSDWSWDQAMQAIINDRRYGALKTLGERKQAFNEYLGQRKKQEAEERRFKLKKAREDYKK 527

Query: 1386 MLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHK 1565
            MLEES ELTSS+RWSK ++ FE+DERFKA++R+RDR +LF+++++EL +KERAKA EE +
Sbjct: 528  MLEESVELTSSTRWSKAVTMFENDERFKALDRERDRRDLFDDHLEELRQKERAKAQEERR 587

Query: 1566 RYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXX 1745
            ++ +EY +FL+SCDFIKA++QWRKVQDRLEADERCSRLEKIDRLEIF+EY  D       
Sbjct: 588  QHLIEYRQFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLEIFKEYIIDLEKEEEE 647

Query: 1746 XXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTS 1925
                        ERKNRDEFRKL+E  V++G LT+KTHWRDYCMKVKD  AY+AVASNTS
Sbjct: 648  QRKIQKEVLRRAERKNRDEFRKLLEGDVASGTLTAKTHWRDYCMKVKDLHAYMAVASNTS 707

Query: 1926 GSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVS 2105
            GST KDLFEDV EEL+KQ+ EDK +IKDA+K  ++ ++ST TFEDFK +IL+D     +S
Sbjct: 708  GSTPKDLFEDVAEELQKQYQEDKTRIKDAVKLKKISLSSTWTFEDFKASILEDVTSPPIS 767

Query: 2106 DYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE-- 2279
            D N+K+VFD                     DD +  + + K+I+ SS WED  +L +   
Sbjct: 768  DVNIKLVFDDLLERVKEKEEKEAKKRKRLADDFFALLCSIKEISASSAWEDCIQLFEGSR 827

Query: 2280 --RFVGEESFFREIFDKVILE 2336
                +GEES  REIFD+ + +
Sbjct: 828  EFSSIGEESICREIFDEYVTQ 848


>gb|EOY15663.1| Pre-mRNA-processing protein 40A isoform 3 [Theobroma cacao]
          Length = 1041

 Score =  638 bits (1646), Expect = e-180
 Identities = 397/927 (42%), Positives = 494/927 (53%), Gaps = 47/927 (5%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVM--PQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M Q P RP    +  P +Q + VP  Q +R                  ++PG    
Sbjct: 77   FSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAP 136

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G+P            GQ Q N  +++Q+QP SQ   S  PV GQPW S+GNQS++   P 
Sbjct: 137  GMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPI 196

Query: 357  QQTGEQ------SSTAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPL 518
            QQTG+Q      + TA N  I  P +     S W EHT+ +G++YYYN+ T+ SSWEKPL
Sbjct: 197  QQTGQQPPLISSADTAANAPIHTPPSA----SDWQEHTSADGRRYYYNKKTRQSSWEKPL 252

Query: 519  ELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXX 698
            ELMTPIERADAST W+E T P+GR YYYNKVTKQSKW IP+ELKLARE            
Sbjct: 253  ELMTPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPS 312

Query: 699  QAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVP-SLIPVAPVDXXXXXXXXXXXX 875
                 V SQA    ++S        + VSS   QA  P S+ PVA V             
Sbjct: 313  DT--GVASQAPVAGAVSSAEMPAAAIPVSSNTSQASSPVSVTPVAAVANPSPTLVSGSTV 370

Query: 876  XXXXXAVAM----MDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQDAVTS 1043
                 + A     +                     +TS    +  I S ++  +QD+V  
Sbjct: 371  VPVSQSAATNASEVQSPAVAVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHF 430

Query: 1044 VVGVSPGNAEKEAVIGTQESGKS-----EEKKVEQGPVVYENKLEAKNAFKALLEMANVG 1208
              G S  + E EA  G   +GK      EEK  +  P+VY NK EAKNAFK+LLE ANV 
Sbjct: 431  TNGASAQDIE-EAKKGMATAGKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQ 489

Query: 1209 SDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKM 1388
            SDW W+Q MR IINDKRYGAL+TLGERKQAFNE++G                   +F KM
Sbjct: 490  SDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKM 549

Query: 1389 LEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKR 1568
            LEESKELTSS RWSK  S FE+DERFKAVER RDRE+LFENYI ELERKER  A EE +R
Sbjct: 550  LEESKELTSSMRWSKAQSLFENDERFKAVERARDREDLFENYIVELERKERENAAEEKRR 609

Query: 1569 YRVEYLEFLKSCDFIK---------ANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTR 1721
               EY +FL+SCDFIK         ANSQWRKVQDRLE DERCSRLEKIDRL +FQ+Y  
Sbjct: 610  NIAEYRKFLESCDFIKVQHFQKRIQANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIH 669

Query: 1722 DXXXXXXXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAY 1901
            D                   ERKNRD FRKLM+EHV  G LT+KT+WRDYC+KVKD P Y
Sbjct: 670  DLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPY 729

Query: 1902 IAVASNTSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILK 2081
            +AVASNTSGST KDLFEDVVEELEKQ+ +DK  IKDAMK+G++ + ST T EDFK AI +
Sbjct: 730  LAVASNTSGSTPKDLFEDVVEELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISE 789

Query: 2082 DSKLSAVSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDS 2261
            D     +SD N+K+V++                     DD  + +   K+IT SS WEDS
Sbjct: 790  DVGSLPISDINLKLVYEELLKSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDS 849

Query: 2262 KRLVDE----RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2429
            + L +E    R + EES  REIF++ I                                 
Sbjct: 850  RPLFEESQEYRSIAEESLRREIFEEYIAYLQEKAKEKERKREEEKAKKEKEREEKEKRKE 909

Query: 2430 XQRSEENRKR-----KDRYNKXXXXXXXXXXXYG-GYEDERRSGKDRSRDFRSRH----- 2576
             +R E+ R+R     K+R  K              G++++++  K++ R  R RH     
Sbjct: 910  KERKEKEREREREKGKERTKKDETDSENLDISDSHGHKEDKKKEKEKDRKHRKRHQSGGD 969

Query: 2577 -----NDDRKKMKRVXXXXXSRTSGKR 2642
                  DDR++ K+       R   ++
Sbjct: 970  DGSSDKDDREESKKSRRHGSDRKKSRK 996


>gb|EMJ28229.1| hypothetical protein PRUPE_ppa000697mg [Prunus persica]
          Length = 1031

 Score =  638 bits (1645), Expect = e-180
 Identities = 388/894 (43%), Positives = 499/894 (55%), Gaps = 36/894 (4%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG--ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M   P RP       P SQA+P+  + Q+R                   +PG AG 
Sbjct: 78   FSQPMQPYPLRPSQPGHATPSSQALPM-QYMQTRPITSAPSQSQQPALPFNNQMPGLAGG 136

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G+P             Q Q N  S++Q+QPISQ   +   V GQPW S+GNQ     TP 
Sbjct: 137  GMPYSSSYIFAPPSYAQPQNNVSSSSQFQPISQVQ-AHVSVTGQPWVSSGNQGAAVPTPV 195

Query: 357  QQTGEQ--SSTAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMT 530
             Q+G+Q  S+T  + A+  P   ++  S W EHT+ +G++YY+NR TK SSWEKPLELMT
Sbjct: 196  PQSGQQPSSTTFTDSAVNVPSQTQQSSSDWQEHTSGDGRRYYFNRRTKQSSWEKPLELMT 255

Query: 531  PIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQ--- 701
            P+ERADAST W+E+T  DG+ YYYNKVT++SKW IP+ELKLARE            +   
Sbjct: 256  PMERADASTVWKEYTSSDGKKYYYNKVTRESKWTIPEELKLAREQAQRELAQGTRSEMNL 315

Query: 702  ----------AVKDVDSQATAPVS---LSGVNTSPVNV-DVSSFPCQAGVPSLIPVAPVD 839
                      A   + S +  P +   L G+ +SPV V  VSSF      PS  P+AP  
Sbjct: 316  TSHAPPAVASAETPMGSSSVGPSTSSALPGMVSSPVAVIPVSSF----SNPS--PIAPTG 369

Query: 840  XXXXXXXXXXXXXXXXXAVAMMDPXXXXXXXXXXXXXXXX-DANITSATEKSFTINSSDT 1016
                              V +  P                    + +A  KS  +++ + 
Sbjct: 370  SSVASGAQSSITG----GVGIQPPVVTVTPPPASVSGSTGVPPTLVNAITKS--VSTFEN 423

Query: 1017 LLAQDAVTSVVGVSPGNAEKE----AVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKA 1184
            + +QD  ++  G    + E+     AV G      SEEK V++ P+VY +K EAKNAFKA
Sbjct: 424  VTSQDIGSADDGAFTQDIEEAKRGMAVAGKVNVTPSEEKTVDEEPLVYASKQEAKNAFKA 483

Query: 1185 LLEMANVGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXX 1364
            LLE ANV SDW W+Q MR IINDKRYGAL+TLGERKQAFNE++G                
Sbjct: 484  LLESANVHSDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLENEERRMRQKK 543

Query: 1365 XXXDFRKMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERA 1544
               +F KMLEESKEL S++RWSK +S FE+DERFKAVER RDRE+L+E+YI ELERKE+ 
Sbjct: 544  AREEFSKMLEESKELMSATRWSKAVSMFENDERFKAVERARDREDLYESYIVELERKEKE 603

Query: 1545 KALEEHKRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRD 1724
            KA E+HK+   EY +FL+SCDFIK NSQWRKVQDRLE DERC RLEK+DRL IFQ+Y RD
Sbjct: 604  KAAEDHKQNIAEYRKFLESCDFIKVNSQWRKVQDRLEDDERCLRLEKLDRLLIFQDYIRD 663

Query: 1725 XXXXXXXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYI 1904
                               ERKNRDEFRKLMEEHV+ G LT+KT+WRDYCMKVKD  +Y 
Sbjct: 664  LEKEEEEQKKIQKEQLRRVERKNRDEFRKLMEEHVADGTLTAKTYWRDYCMKVKDLSSYE 723

Query: 1905 AVASNTSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKD 2084
            AVASNTSGST K+LFEDV EELEKQ+ EDKA+IKDAMK G+V + ST+TFE+FK+AIL+D
Sbjct: 724  AVASNTSGSTPKELFEDVAEELEKQYHEDKARIKDAMKLGKVTLASTLTFEEFKVAILED 783

Query: 2085 SKLSAVSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSK 2264
                ++SD N K+V++                    GDD  + +   K+IT SS WED K
Sbjct: 784  IGFPSISDINFKLVYEELLERAKEKEEKEAKKRQRLGDDFNKLLHTFKEITASSNWEDCK 843

Query: 2265 RLVDE----RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2432
             L +E    R +GEE+F RE+F++ I                                  
Sbjct: 844  HLFEETQEYRSIGEENFSREVFEEYITNLQEKAKEKERKREEEKAKKEREREEKEKRKDK 903

Query: 2433 QRSEENRKR-----KDRYNKXXXXXXXXXXXYG-GYEDERRSGKDRSRDFRSRH 2576
            +R E+ R+R     K+R  K              G++++++  KD+ R  R RH
Sbjct: 904  ERKEKEREREKEKGKERSKKDETDSENVDITDSHGHKEDKKREKDKDRKHRKRH 957


>gb|EOY15665.1| Pre-mRNA-processing protein 40A isoform 5 [Theobroma cacao]
          Length = 904

 Score =  634 bits (1636), Expect = e-179
 Identities = 379/807 (46%), Positives = 459/807 (56%), Gaps = 31/807 (3%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVM--PQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGL 176
            F Q M Q P RP    +  P +Q + VP  Q +R                  ++PG    
Sbjct: 77   FSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAP 136

Query: 177  GVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTPA 356
            G+P            GQ Q N  +++Q+QP SQ   S  PV GQPW S+GNQS++   P 
Sbjct: 137  GMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPI 196

Query: 357  QQTGEQ------SSTAINDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPL 518
            QQTG+Q      + TA N  I  P +     S W EHT+ +G++YYYN+ T+ SSWEKPL
Sbjct: 197  QQTGQQPPLISSADTAANAPIHTPPSA----SDWQEHTSADGRRYYYNKKTRQSSWEKPL 252

Query: 519  ELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXX 698
            ELMTPIERADAST W+E T P+GR YYYNKVTKQSKW IP+ELKLARE            
Sbjct: 253  ELMTPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPS 312

Query: 699  QAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVP-SLIPVAPVDXXXXXXXXXXXX 875
                 V SQA    ++S        + VSS   QA  P S+ PVA V             
Sbjct: 313  DT--GVASQAPVAGAVSSAEMPAAAIPVSSNTSQASSPVSVTPVAAVANPSPTLVSGSTV 370

Query: 876  XXXXXAVAM----MDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQDAVTS 1043
                 + A     +                     +TS    +  I S ++  +QD+V  
Sbjct: 371  VPVSQSAATNASEVQSPAVAVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHF 430

Query: 1044 VVGVSPGNAEKEAVIGTQESGKS-----EEKKVEQGPVVYENKLEAKNAFKALLEMANVG 1208
              G S  + E EA  G   +GK      EEK  +  P+VY NK EAKNAFK+LLE ANV 
Sbjct: 431  TNGASAQDIE-EAKKGMATAGKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQ 489

Query: 1209 SDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKM 1388
            SDW W+Q MR IINDKRYGAL+TLGERKQAFNE++G                   +F KM
Sbjct: 490  SDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKM 549

Query: 1389 LEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKR 1568
            LEESKELTSS RWSK  S FE+DERFKAVER RDRE+LFENYI ELERKER  A EE +R
Sbjct: 550  LEESKELTSSMRWSKAQSLFENDERFKAVERARDREDLFENYIVELERKERENAAEEKRR 609

Query: 1569 YRVEYLEFLKSCDFIK---------ANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTR 1721
               EY +FL+SCDFIK         ANSQWRKVQDRLE DERCSRLEKIDRL +FQ+Y  
Sbjct: 610  NIAEYRKFLESCDFIKVQHFQKRIQANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIH 669

Query: 1722 DXXXXXXXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAY 1901
            D                   ERKNRD FRKLM+EHV  G LT+KT+WRDYC+KVKD P Y
Sbjct: 670  DLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPY 729

Query: 1902 IAVASNTSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILK 2081
            +AVASNTSGST KDLFEDVVEELEKQ+ +DK  IKDAMK+G++ + ST T EDFK AI +
Sbjct: 730  LAVASNTSGSTPKDLFEDVVEELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISE 789

Query: 2082 DSKLSAVSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDS 2261
            D     +SD N+K+V++                     DD  + +   K+IT SS WEDS
Sbjct: 790  DVGSLPISDINLKLVYEELLKSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDS 849

Query: 2262 KRLVDE----RFVGEESFFREIFDKVI 2330
            + L +E    R + EES  REIF++ I
Sbjct: 850  RPLFEESQEYRSIAEESLRREIFEEYI 876


>ref|XP_006422757.1| hypothetical protein CICLE_v10027732mg [Citrus clementina]
            gi|557524691|gb|ESR35997.1| hypothetical protein
            CICLE_v10027732mg [Citrus clementina]
          Length = 1029

 Score =  632 bits (1629), Expect = e-178
 Identities = 378/926 (40%), Positives = 506/926 (54%), Gaps = 47/926 (5%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG----ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFA 170
            F   M  LP RPG    + V P  Q + +P+ Q S H +               Y PG  
Sbjct: 82   FRPLMHPLPARPGPPAPSHVPPPPQVMSLPNAQPSNH-IPPSSLPRPNVQALSSYPPGLG 140

Query: 171  GLGVPXXXXXXXXXXXAGQQQT--NTDSATQYQPISQTTISSFPVEGQPWASAGNQSITT 344
            GLG P            GQ Q   N ++ +Q QP+SQ  + S    GQ   S  +QS  +
Sbjct: 141  GLGRPVAASYTFAPSSYGQPQLIGNVNTGSQ-QPMSQMHVPSISAGGQLGVSV-SQSTVS 198

Query: 345  FTPAQQTGEQ-SSTAINDAIP--KPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKP 515
             TP Q T EQ ++T  +  +P  +P++ E V + W EHT+ +G++YY+N+ T+VS+W+KP
Sbjct: 199  STPVQPTDEQMAATTASAPLPTLQPKSAEGVQTDWKEHTSADGRRYYFNKRTRVSTWDKP 258

Query: 516  LELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXX 695
             ELMT IERADASTDW+E T PDGR YYYNKVTKQSKW +PDELKLARE           
Sbjct: 259  FELMTTIERADASTDWKEFTSPDGRKYYYNKVTKQSKWSLPDELKLAREQAEKASIKGTQ 318

Query: 696  XQAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXX 875
             +   +  +  + P   S V  +P + D+SS   +  V S + V P+             
Sbjct: 319  SETSPNSQTSISFP---SSVVKAPSSADISSSTVEVIVSSPVAVVPIIAASETQPALVSV 375

Query: 876  XXXXXAVAM------------MDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTL 1019
                  +              +D                 D  +  A      +++SD +
Sbjct: 376  PSTSPVITSSVVANADGFPKTVDAIAPMIDVSSSIGEAVTDNTVAEAKNNLSNMSASDLV 435

Query: 1020 LAQDAVTSVVGVSPGNAEKEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMA 1199
             A D V   V        K+AV G + S   EEK VEQ    Y NKLEAKNAFKALLE A
Sbjct: 436  GASDKVPPPV---TEETRKDAVRGEKVSDALEEKTVEQEHFAYANKLEAKNAFKALLESA 492

Query: 1200 NVGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDF 1379
            NVGSDW WDQA+RAIIND+RYGALRTLGERK AFNE++G                   D+
Sbjct: 493  NVGSDWTWDQALRAIINDRRYGALRTLGERKTAFNEYLGQKKKQDAEERRLKLKKARDDY 552

Query: 1380 RKMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEE 1559
            +KMLEES ELTSS+RWSK ++ FE+DERFKA+ER+RDR+++F++++ EL++KERAKA EE
Sbjct: 553  KKMLEESVELTSSTRWSKAVTMFENDERFKALERERDRKDMFDDHLDELKQKERAKAQEE 612

Query: 1560 HKRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXX 1739
             KR  +EY +FL+SCDFIKAN+QWRKVQDRLEADERCSRL+K+DRLEIFQEY  D     
Sbjct: 613  RKRNIIEYRKFLESCDFIKANTQWRKVQDRLEADERCSRLDKMDRLEIFQEYLNDLEKEE 672

Query: 1740 XXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASN 1919
                         TERKNRDEFRKLME  V+ G LT+KT+WRDYC+KVKDSP Y+AVASN
Sbjct: 673  EEQRKIQKEELSKTERKNRDEFRKLMEADVALGTLTAKTNWRDYCIKVKDSPPYMAVASN 732

Query: 1920 TSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSA 2099
            TSGST KDLFEDVVEEL+KQF EDK +IKDA+K  ++ ++ST TFEDFK ++L+D+    
Sbjct: 733  TSGSTPKDLFEDVVEELQKQFQEDKTRIKDAVKLRKITLSSTWTFEDFKASVLEDATSPP 792

Query: 2100 VSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVD- 2276
            +SD N+K++FD                     D+ ++ + + K+I+ +S WE+ ++L++ 
Sbjct: 793  ISDVNLKLIFDDLLIKVKEKEEKEAKKRKRLEDEFFDLLCSVKEISATSTWENCRQLLEG 852

Query: 2277 -ERF--VGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ-RSE 2444
             + F  +G+ES  R +FD+ + +                                Q R +
Sbjct: 853  SQEFSSIGDESICRGVFDEFVTQLKEQAKDYERKRKEEKAKREKEREERDRRKLKQGRDK 912

Query: 2445 ENRKRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRS------------------ 2570
            E  + +++ +                 D +RSGKD  +  R                   
Sbjct: 913  ERAREREKEDHSKKDGADSDHDDSAENDSKRSGKDNDKKHRKRHQSAHDSLDENEKDRSK 972

Query: 2571 ---RHNDDRKKMKRVXXXXXSRTSGK 2639
               RHN DRKK +R+     S    +
Sbjct: 973  NPHRHNSDRKKPRRLASTPESENESR 998


>ref|XP_006422756.1| hypothetical protein CICLE_v10027732mg [Citrus clementina]
            gi|557524690|gb|ESR35996.1| hypothetical protein
            CICLE_v10027732mg [Citrus clementina]
          Length = 996

 Score =  632 bits (1629), Expect = e-178
 Identities = 378/926 (40%), Positives = 506/926 (54%), Gaps = 47/926 (5%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG----ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFA 170
            F   M  LP RPG    + V P  Q + +P+ Q S H +               Y PG  
Sbjct: 49   FRPLMHPLPARPGPPAPSHVPPPPQVMSLPNAQPSNH-IPPSSLPRPNVQALSSYPPGLG 107

Query: 171  GLGVPXXXXXXXXXXXAGQQQT--NTDSATQYQPISQTTISSFPVEGQPWASAGNQSITT 344
            GLG P            GQ Q   N ++ +Q QP+SQ  + S    GQ   S  +QS  +
Sbjct: 108  GLGRPVAASYTFAPSSYGQPQLIGNVNTGSQ-QPMSQMHVPSISAGGQLGVSV-SQSTVS 165

Query: 345  FTPAQQTGEQ-SSTAINDAIP--KPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKP 515
             TP Q T EQ ++T  +  +P  +P++ E V + W EHT+ +G++YY+N+ T+VS+W+KP
Sbjct: 166  STPVQPTDEQMAATTASAPLPTLQPKSAEGVQTDWKEHTSADGRRYYFNKRTRVSTWDKP 225

Query: 516  LELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXX 695
             ELMT IERADASTDW+E T PDGR YYYNKVTKQSKW +PDELKLARE           
Sbjct: 226  FELMTTIERADASTDWKEFTSPDGRKYYYNKVTKQSKWSLPDELKLAREQAEKASIKGTQ 285

Query: 696  XQAVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXX 875
             +   +  +  + P   S V  +P + D+SS   +  V S + V P+             
Sbjct: 286  SETSPNSQTSISFP---SSVVKAPSSADISSSTVEVIVSSPVAVVPIIAASETQPALVSV 342

Query: 876  XXXXXAVAM------------MDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTL 1019
                  +              +D                 D  +  A      +++SD +
Sbjct: 343  PSTSPVITSSVVANADGFPKTVDAIAPMIDVSSSIGEAVTDNTVAEAKNNLSNMSASDLV 402

Query: 1020 LAQDAVTSVVGVSPGNAEKEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMA 1199
             A D V   V        K+AV G + S   EEK VEQ    Y NKLEAKNAFKALLE A
Sbjct: 403  GASDKVPPPV---TEETRKDAVRGEKVSDALEEKTVEQEHFAYANKLEAKNAFKALLESA 459

Query: 1200 NVGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDF 1379
            NVGSDW WDQA+RAIIND+RYGALRTLGERK AFNE++G                   D+
Sbjct: 460  NVGSDWTWDQALRAIINDRRYGALRTLGERKTAFNEYLGQKKKQDAEERRLKLKKARDDY 519

Query: 1380 RKMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEE 1559
            +KMLEES ELTSS+RWSK ++ FE+DERFKA+ER+RDR+++F++++ EL++KERAKA EE
Sbjct: 520  KKMLEESVELTSSTRWSKAVTMFENDERFKALERERDRKDMFDDHLDELKQKERAKAQEE 579

Query: 1560 HKRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXX 1739
             KR  +EY +FL+SCDFIKAN+QWRKVQDRLEADERCSRL+K+DRLEIFQEY  D     
Sbjct: 580  RKRNIIEYRKFLESCDFIKANTQWRKVQDRLEADERCSRLDKMDRLEIFQEYLNDLEKEE 639

Query: 1740 XXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASN 1919
                         TERKNRDEFRKLME  V+ G LT+KT+WRDYC+KVKDSP Y+AVASN
Sbjct: 640  EEQRKIQKEELSKTERKNRDEFRKLMEADVALGTLTAKTNWRDYCIKVKDSPPYMAVASN 699

Query: 1920 TSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSA 2099
            TSGST KDLFEDVVEEL+KQF EDK +IKDA+K  ++ ++ST TFEDFK ++L+D+    
Sbjct: 700  TSGSTPKDLFEDVVEELQKQFQEDKTRIKDAVKLRKITLSSTWTFEDFKASVLEDATSPP 759

Query: 2100 VSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVD- 2276
            +SD N+K++FD                     D+ ++ + + K+I+ +S WE+ ++L++ 
Sbjct: 760  ISDVNLKLIFDDLLIKVKEKEEKEAKKRKRLEDEFFDLLCSVKEISATSTWENCRQLLEG 819

Query: 2277 -ERF--VGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ-RSE 2444
             + F  +G+ES  R +FD+ + +                                Q R +
Sbjct: 820  SQEFSSIGDESICRGVFDEFVTQLKEQAKDYERKRKEEKAKREKEREERDRRKLKQGRDK 879

Query: 2445 ENRKRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRS------------------ 2570
            E  + +++ +                 D +RSGKD  +  R                   
Sbjct: 880  ERAREREKEDHSKKDGADSDHDDSAENDSKRSGKDNDKKHRKRHQSAHDSLDENEKDRSK 939

Query: 2571 ---RHNDDRKKMKRVXXXXXSRTSGK 2639
               RHN DRKK +R+     S    +
Sbjct: 940  NPHRHNSDRKKPRRLASTPESENESR 965


>ref|XP_006827042.1| hypothetical protein AMTR_s00010p00227470 [Amborella trichopoda]
            gi|548831471|gb|ERM94279.1| hypothetical protein
            AMTR_s00010p00227470 [Amborella trichopoda]
          Length = 985

 Score =  630 bits (1624), Expect = e-177
 Identities = 376/877 (42%), Positives = 489/877 (55%), Gaps = 23/877 (2%)
 Frame = +3

Query: 15   MTQLPTRPG--ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGLGVPX 188
            M QLP RP   A V P  Q +P+   Q +R                  + PG  G G   
Sbjct: 66   MQQLPPRPAQTAQVAPSPQTVPLSYIQPNRPMTSGPLQIPQNPQHVNIHPPGLGGPGTVL 125

Query: 189  XXXXXXXXXXAG-QQQTNTDSATQYQPISQTTISSFPVE--GQPWASAGNQSITTFTPAQ 359
                      +    Q N + ++QYQP SQ  +   P    GQPW S+G+QS T   P  
Sbjct: 126  SSSYTFTAPSSYVHPQNNINISSQYQPSSQMQVPGVPSGSGGQPWLSSGSQSTTVIPPVV 185

Query: 360  QTGEQSSTAINDA---IPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLELMT 530
            Q  +QSS A + A    P+P    +  S W EHT+ +G++YYYN+ T+ SSWEKPLELMT
Sbjct: 186  QASQQSSFAASTAPVATPQPNPTSQSSSDWQEHTSADGRRYYYNKKTRQSSWEKPLELMT 245

Query: 531  PIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQAVK 710
            PIERADAST W+E T P+GR YYYNKVTKQSKW IPDELKLARE            +   
Sbjct: 246  PIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPDELKLAREQAEKNGTQLTNSETTD 305

Query: 711  DVDSQA----TAPVSLSGVNTSPVNVDVSSFPCQAGVP-SLIPVAPVDXXXXXXXXXXXX 875
             V S      T P++      + ++   S+ P  +G+  S + V PV             
Sbjct: 306  VVASSTPVTVTVPLTEMPSTVAAISATQSAMPSTSGMATSPVLVTPV------------- 352

Query: 876  XXXXXAVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQDA-VTSVVG 1052
                   A +DP                 ++  +A EK    N S   +AQ A  TS   
Sbjct: 353  --VSVPAAAVDP-----------------SSAGAAYEKIKVDNVSPESIAQVADETSAQD 393

Query: 1053 VSPGNAEKEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKALLEMANVGSDWNWDQA 1232
            +               +  S+EK V++ P+V+ +K EAKNAFK LL  A+V SDW WDQA
Sbjct: 394  LEEARKAMPVAGKVNITPTSDEKTVDEEPLVFASKQEAKNAFKELLVSAHVESDWTWDQA 453

Query: 1233 MRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKMLEESKELT 1412
            MR IINDKRYGAL+TLGERKQAFNE++G                   DF KMLEESKELT
Sbjct: 454  MRVIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEEKRTRQKKAREDFVKMLEESKELT 513

Query: 1413 SSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKRYRVEYLEF 1592
            S+++WSK I+ FEDDERF+AVER RDREELFE +++EL RKERAKA EEH+R   EY  F
Sbjct: 514  SATKWSKAITMFEDDERFRAVERGRDREELFEMHLEELHRKERAKAQEEHRRNVQEYRAF 573

Query: 1593 LKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXXXXXXXXXX 1772
            L+SCDFIKA+SQWRKVQDRLE DERC+RLEKIDRLEIFQEY RD                
Sbjct: 574  LESCDFIKASSQWRKVQDRLEDDERCARLEKIDRLEIFQEYIRDLEKEEEEQRKLQKEHL 633

Query: 1773 XXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSGSTAKDLFE 1952
               ERKNRD+FRKLME H++AG+LT+KTHWR+YCMKVKD PAY+AV+SNTSGST KDLFE
Sbjct: 634  RRAERKNRDDFRKLMEGHIAAGILTAKTHWREYCMKVKDLPAYLAVSSNTSGSTPKDLFE 693

Query: 1953 DVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSDYNMKVVFD 2132
            D  EEL+KQ+ ED+ +IKDA+K    ++TST +FE+FK AI +D+ L ++S+ N+K+VFD
Sbjct: 694  DTAEELDKQYQEDRTRIKDAVKMARFVMTSTWSFENFKEAISEDNNLKSISETNLKLVFD 753

Query: 2133 XXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE----RFVGEES 2300
                                 DD+ + + + KDI+ SSRWE+ K L++E    R + +ES
Sbjct: 754  ELLERLKEKEEKEAKKRQRMADDLKDLLYSIKDISASSRWEECKPLLEENQAYRSINDES 813

Query: 2301 FFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEE----NRKRKDR 2468
            F R+IF++ +                                  +R E+    +R++KDR
Sbjct: 814  FARQIFEEYVAYLQEKIKEKERKREEEKARKEKEREEKEKRKEKERKEKERDRDREKKDR 873

Query: 2469 YNKXXXXXXXXXXXYG-GYEDERRSGKDRSRDFRSRH 2576
              +              G++D+++  K++ R  R RH
Sbjct: 874  ARRDEMDVENLDVINDFGHKDDKKREKEKDRRHRKRH 910


>ref|XP_006486884.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Citrus
            sinensis] gi|568867105|ref|XP_006486885.1| PREDICTED:
            pre-mRNA-processing protein 40A-like isoform X2 [Citrus
            sinensis] gi|568867107|ref|XP_006486886.1| PREDICTED:
            pre-mRNA-processing protein 40A-like isoform X3 [Citrus
            sinensis] gi|568867109|ref|XP_006486887.1| PREDICTED:
            pre-mRNA-processing protein 40A-like isoform X4 [Citrus
            sinensis]
          Length = 1029

 Score =  627 bits (1618), Expect = e-177
 Identities = 379/931 (40%), Positives = 507/931 (54%), Gaps = 52/931 (5%)
 Frame = +3

Query: 3    FPQQMTQLPTRPG----ADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFA 170
            F   M  LP RPG    + V P  Q + +P+ Q S H +               Y PG  
Sbjct: 82   FRPLMHPLPARPGPPAPSHVPPPPQVMSLPNAQPSNH-IPPSSLPRPNVQALSSYPPGLG 140

Query: 171  GLGVPXXXXXXXXXXXAGQQQT--NTDSATQYQPISQTTISSFPVEGQPWASAGNQSITT 344
            GLG P            GQ Q   N +  +Q QP+SQ  + S    GQ   S  +QS  +
Sbjct: 141  GLGRPVAASYTFAPSSYGQPQLIGNVNIGSQ-QPMSQMHVPSISAGGQLGVSV-SQSTVS 198

Query: 345  FTPAQQTGEQ-SSTAINDAIP--KPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKP 515
             TP Q T EQ ++T  +  +P  +P++ E V + W EHT+ +G++YY+N+ T+VS+W+KP
Sbjct: 199  STPVQPTDEQMAATTASAPLPTLQPKSAEGVQTDWKEHTSADGRRYYFNKRTRVSTWDKP 258

Query: 516  LELMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXX 695
             ELMT IERADASTDW+E T PDGR YYYNKVTKQSKW +PDELKLARE           
Sbjct: 259  FELMTTIERADASTDWKEFTSPDGRKYYYNKVTKQSKWSLPDELKLAREQAEKA------ 312

Query: 696  XQAVKDVDSQAT----APVSL-SGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXX 860
              ++K   S+ +     P+S  S V  +P + D+SS   +    S + V P+        
Sbjct: 313  --SIKGTQSETSPNSQTPISFPSSVVKAPSSADISSSTVEVIASSPVAVVPIIAASETQP 370

Query: 861  XXXXXXXXXXAVAM------------MDPXXXXXXXXXXXXXXXXDANITSATEKSFTIN 1004
                       +              +D                 D  +  A      ++
Sbjct: 371  ALVSVPSTSPVITSSVVANADGVPKTVDAIAPMIDVSSSIGEAVTDNTVAEAKNNLSNMS 430

Query: 1005 SSDTLLAQDAVTSVVGVSPGNAEKEAVIGTQESGKSEEKKVEQGPVVYENKLEAKNAFKA 1184
            +SD + A D V   V        K+AV G + S   EEK VEQ    Y NKLEAKNAFKA
Sbjct: 431  ASDLVGASDKVPPPV---TEETRKDAVRGEKVSDALEEKTVEQEHFAYANKLEAKNAFKA 487

Query: 1185 LLEMANVGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXX 1364
            LLE ANVGSDW WDQA+RAIIND+RYGALRTLGERK AFNE++G                
Sbjct: 488  LLESANVGSDWTWDQALRAIINDRRYGALRTLGERKTAFNEYLGQKKKQDAEERRLKLKK 547

Query: 1365 XXXDFRKMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERA 1544
               D++KMLEES ELTSS+RWSK ++ FE+DERFKA+ER+RDR+++F++++ EL++KERA
Sbjct: 548  ARDDYKKMLEESVELTSSTRWSKAVTMFENDERFKALERERDRKDMFDDHLDELKQKERA 607

Query: 1545 KALEEHKRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRD 1724
            K  EE KR  +EY +FL+SCDFIKAN+QWRKVQDRLEADERCSRL+K+DRLEIFQEY  D
Sbjct: 608  KVQEERKRNIIEYRKFLESCDFIKANTQWRKVQDRLEADERCSRLDKMDRLEIFQEYLND 667

Query: 1725 XXXXXXXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYI 1904
                              TERKNRDEFRKLME  V+ G LT+KT+WRDYC+KVKDSP Y+
Sbjct: 668  LEKEEEEQRKIQKEELSKTERKNRDEFRKLMEADVALGTLTAKTNWRDYCIKVKDSPPYM 727

Query: 1905 AVASNTSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKD 2084
            AVASNTSGST KDLFEDVVEEL+KQF EDK +IKDA+K  ++ ++ST TFEDFK ++L+D
Sbjct: 728  AVASNTSGSTPKDLFEDVVEELQKQFQEDKTRIKDAVKLRKITLSSTWTFEDFKASVLED 787

Query: 2085 SKLSAVSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSK 2264
            +    +SD N+K++FD                     D+ ++ + + K+I+ +S WE+ +
Sbjct: 788  ATSPPISDVNLKLIFDDLLIKVKEKEEKEAKKRKRLEDEFFDLLCSVKEISATSTWENCR 847

Query: 2265 RLVD--ERF--VGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2432
            +L++  + F  +G+ES  R +FD+ + +                                
Sbjct: 848  QLLEGSQEFSSIGDESICRGVFDEFVTQLKEQAKDYERKRKEEKAKREKEREERDRRKLK 907

Query: 2433 Q-RSEENRKRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRS------------- 2570
            Q R +E  + +++ +                 D +RSGKD  +  R              
Sbjct: 908  QGRDKERAREREKEDHSKKDGADSDHDDSAENDSKRSGKDNDKKHRKRHQSAHDSLDENE 967

Query: 2571 --------RHNDDRKKMKRVXXXXXSRTSGK 2639
                    RHN DRKK +R+     S    +
Sbjct: 968  KDRSKNPHRHNSDRKKPRRLASTPESENESR 998


>ref|XP_004496865.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Cicer
            arietinum]
          Length = 1013

 Score =  622 bits (1605), Expect = e-175
 Identities = 358/801 (44%), Positives = 471/801 (58%), Gaps = 23/801 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVPDFQQSRHGMXXXXXXXXXXXXXXXYVPGFAGLG- 179
            F Q + Q P R G  +   SQ IP+P  + +                   Y PG  G G 
Sbjct: 73   FSQPIQQFPPRAGQQLPHPSQVIPMPVVRPNMQPSSESMMPQPDSQAPNGYTPGLGGPGN 132

Query: 180  -VPXXXXXXXXXXXAGQ-QQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFTP 353
             +P            GQ QQ N  S +QYQP+ Q    S          + +QSIT  T 
Sbjct: 133  GMPLSSSYMFAPSSYGQAQQPNFISTSQYQPVPQIQAPS---------GSSSQSITPGTS 183

Query: 354  AQQTGEQSSTAI---NDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLEL 524
             Q  GEQ + A    +  + +P   +  PS WIEH +  G+++YYN+ TK+SSWEKP EL
Sbjct: 184  HQSNGEQPTVATFMHSATVVQPHLAKVGPSDWIEHNSSTGRRFYYNKRTKLSSWEKPFEL 243

Query: 525  MTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQA 704
            MTPIER DAST+W+E++ PDGR YYYNK+TK+SKW IP+ELKLARE            +A
Sbjct: 244  MTPIERVDASTNWKEYSSPDGRKYYYNKITKESKWLIPEELKLAREQVEKAMINGALPEA 303

Query: 705  VKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPV---DXXXXXXXXXXXX 875
            V +  +Q +A    + V+ +  + D SS P Q    S +PVAPV                
Sbjct: 304  VLNPRTQPSA----TSVSEAMPSADNSSLPGQGEPSSPVPVAPVVTTSPSNLQSEIAPGS 359

Query: 876  XXXXXAVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLL-------AQDA 1034
                   A+                   DA++ S       IN+S T +       AQD 
Sbjct: 360  CGSPSTSAVTGTKVDEPEAPPVNTINPSDASVGSDRAIVSDINTSVTPMNDTNNFAAQDT 419

Query: 1035 VTSVVGVSPGNAEKEAVIGTQESGK---SEEKKVEQGPVVYENKLEAKNAFKALLEMANV 1205
            V S  GV   + +   +  T E+     SE K VE   +VY NK+EAK+AFK+LLE  NV
Sbjct: 420  VGSADGVPGEDKDDGKIDSTGENVNEVASETKTVEPESLVYANKMEAKDAFKSLLESVNV 479

Query: 1206 GSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRK 1385
            G DW+WD+AMR IINDKRYGAL++LGERKQAFNE++                    DFRK
Sbjct: 480  GPDWSWDRAMRLIINDKRYGALKSLGERKQAFNEYLSQRKKQEAEEKRMKHKKAREDFRK 539

Query: 1386 MLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHK 1565
            MLEES ELTSS+R+SK ++ FE+DERFKAVER RDR+++F+++++EL  KERAKALEE K
Sbjct: 540  MLEESTELTSSTRFSKAVAIFENDERFKAVERDRDRKDMFDSFLEELMNKERAKALEERK 599

Query: 1566 RYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXX 1745
            R  VEY +FL+SCDFIKAN+QWRKVQDRLEADERCSRLEKIDRLEIFQ+Y RD       
Sbjct: 600  RNTVEYRKFLESCDFIKANTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLRDLEKEEEE 659

Query: 1746 XXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTS 1925
                       TERKNRDEFRKLM+EH+++G+LT+KTHWRDY  KVKD PAY+AVASNTS
Sbjct: 660  QKKIQKEELRKTERKNRDEFRKLMDEHIASGILTAKTHWRDYHFKVKDLPAYLAVASNTS 719

Query: 1926 GSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVS 2105
            GST K+LFEDV EELEKQ++E+K+QIKDA+K  ++ + +T TFEDFK A+ +     + S
Sbjct: 720  GSTPKELFEDVAEELEKQYVEEKSQIKDAVKLAKITLLTTWTFEDFKSALSEHISSPSTS 779

Query: 2106 DYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE-- 2279
            D N+K+VF+                     DD +  + + KDIT SS+WEDS+ L+++  
Sbjct: 780  DSNLKLVFEELLERAKEKEEKEAKKRKRLADDFFRLLFSIKDITESSKWEDSEPLLEDSQ 839

Query: 2280 --RFVGEESFFREIFDKVILE 2336
              R +G+ S  +++F++ + +
Sbjct: 840  EFRSIGDASLCKQMFEEYVAQ 860


>ref|XP_006606005.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Glycine
            max] gi|571567022|ref|XP_006606006.1| PREDICTED:
            pre-mRNA-processing protein 40A-like isoform X2 [Glycine
            max]
          Length = 1008

 Score =  622 bits (1603), Expect = e-175
 Identities = 370/900 (41%), Positives = 485/900 (53%), Gaps = 38/900 (4%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVPDFQQSRH----GMXXXXXXXXXXXXXXXYVPGFA 170
            F Q + QLP RP   + P SQAIP+P  + + H     M               Y PG  
Sbjct: 81   FSQPIQQLPPRPSPQLPPPSQAIPMPVARPNMHIPSESMMQQSDSQAHSQAPNGYTPGLG 140

Query: 171  GLGVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFT 350
            G G+P            GQ Q N +S  Q+QP+ Q               + +QSITT  
Sbjct: 141  GPGMPLSSSYTFAPSTYGQVQANFNSTGQFQPVPQI---------HALTGSSSQSITTGA 191

Query: 351  PAQQTGEQS--STAINDA-IPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLE 521
              Q  G Q   +T +  A I +P+  +  P+ WIEHT+  G+ +YYN+ TKVSSWEKP E
Sbjct: 192  TLQSNGGQPLVTTVMPLATIAQPQLTKNGPTDWIEHTSATGRTFYYNKKTKVSSWEKPFE 251

Query: 522  LMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQ 701
            LMTPIER DA+T+W+E+T PDGR YYYNK+T +SKW IP+ELKLARE            +
Sbjct: 252  LMTPIERVDATTNWKEYTSPDGRKYYYNKITNESKWSIPEELKLAREQVEKAIVSGSRPE 311

Query: 702  AVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXX 881
            A+ +   Q   P        +  N D S+ P Q    S + VAPV               
Sbjct: 312  ALLNSHPQ---PSPTPSATEATPNTDNSTLPSQGEPSSPVSVAPV--------------- 353

Query: 882  XXXAVAMMDPXXXXXXXXXXXXXXXXDANITSATEKSFTINSSDTLLAQD---------A 1034
               +++                     A +        T+  SDT +  D         A
Sbjct: 354  VTTSISNPQSEMPSGPSLSTSANAITGAKVDELEAPVNTVTPSDTCVGSDKAVVTDINTA 413

Query: 1035 VTSVVGVSPGNAEK------EAVIGTQESGKS------------EEKKVEQGPVVYENKL 1160
            VT +  V+  +A+          +  +E GK+            E K VE  P VY NK+
Sbjct: 414  VTPMNDVNNDSAQDTLGSADRVPVEDKEDGKNDLIGEKSNDVAAETKAVEPEPPVYANKM 473

Query: 1161 EAKNAFKALLEMANVGSDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXX 1340
            EAK+AFKALLE  NVGSDW WD++MR IINDKRYGAL+TL ERKQAFNE++         
Sbjct: 474  EAKDAFKALLESVNVGSDWTWDRSMRLIINDKRYGALKTLVERKQAFNEYLNQRKKQEAE 533

Query: 1341 XXXXXXXXXXXDFRKMLEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIK 1520
                       DF+KMLEES +LTSS+RWSK +S FE+DERFKAVER RDR ++FE++++
Sbjct: 534  EKRMKQKKAREDFKKMLEESTDLTSSTRWSKAVSIFENDERFKAVERDRDRRDMFESFLE 593

Query: 1521 ELERKERAKALEEHKRYRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLE 1700
            EL  KERAK  EE KR  +EY +FL+SCDFIKA++QWRKVQDRLEADERCSRLEKIDRLE
Sbjct: 594  ELLNKERAKVQEERKRNIMEYRKFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLE 653

Query: 1701 IFQEYTRDXXXXXXXXXXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMK 1880
            IFQ+Y RD                  TERKNR+EFRKLM EH+++G+LT+KTHWRDY  K
Sbjct: 654  IFQDYLRDLEKEEEEQKKIQKEEVRKTERKNREEFRKLMGEHIASGILTAKTHWRDYYTK 713

Query: 1881 VKDSPAYIAVASNTSGSTAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFED 2060
            VKD  AY+AVASNTSGST KDLFEDV EELEKQ+ E+K++IKDA+K  ++ ++ST+TFED
Sbjct: 714  VKDLHAYVAVASNTSGSTPKDLFEDVAEELEKQYHEEKSRIKDAVKLTKITLSSTLTFED 773

Query: 2061 FKIAILKDSKLSAVSDYNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITH 2240
            FK  +LKD     +SD+N+K+VFD                     DD +  + ++KD T 
Sbjct: 774  FKSVLLKDISTPPISDFNLKLVFDELLERVKEKEEKEAKKRKRLADDFFHLLHSTKDFTV 833

Query: 2241 SSRWEDSKRLVDE----RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXX 2408
            SS+WED + LV++    R +G+ES  +E+F++ I +                        
Sbjct: 834  SSKWEDCRPLVEDSQEFRSIGDESLCKEVFEEYIAQ------------------------ 869

Query: 2409 XXXXXXXXQRSEENRKRKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRHNDDR 2588
                    +  E  RKRK+   K            G    E+  G++R +D    H  D+
Sbjct: 870  -----LKEEAKENERKRKEERAKKEKDREERERRKGKQRKEKEGGRERGKD--EAHKKDK 922


>ref|XP_006589614.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X2 [Glycine
            max]
          Length = 1017

 Score =  621 bits (1602), Expect = e-175
 Identities = 368/884 (41%), Positives = 483/884 (54%), Gaps = 22/884 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVPDFQQSRH----GMXXXXXXXXXXXXXXXYVPGFA 170
            F Q + QLP RP   + P SQAIP+P  + + H     M               Y PG  
Sbjct: 81   FSQPIQQLPPRPSPQLPPPSQAIPMPVARPNMHIPSESMMHQPDSQVHSQAPNGYTPGLG 140

Query: 171  GLGVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFT 350
            G  +P            GQ QTN  S  Q+QP+ Q               + +QSITT  
Sbjct: 141  GPAMPLSASYTFAPSAYGQVQTNFSSTGQFQPVPQI---------HALTGSSSQSITTGA 191

Query: 351  PAQQTGEQSSTAI---NDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLE 521
              Q  G Q S      +  I +P+  +  P+ WIEHT+  G+ +YYN+ TKVSSWEKP E
Sbjct: 192  TLQSNGGQPSVTTVMPSATIAQPQLAKNGPTDWIEHTSATGRTFYYNKKTKVSSWEKPFE 251

Query: 522  LMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQ 701
            LMTPIER DA+T+W+E+T PDGR YYYNK+T +SKW +P+ELKLARE            +
Sbjct: 252  LMTPIERVDATTNWKEYTSPDGRKYYYNKITNESKWSVPEELKLARELVEKAIVSGARPE 311

Query: 702  AVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXX 881
            A+  ++S      + S +  +P N D SS P Q    S + V+PV               
Sbjct: 312  AL--LNSHPQPSPTPSAIEATP-NADNSSLPSQGEPSSPVSVSPVVTTSISNLQSEMPSG 368

Query: 882  XXXAVA-------MMDPXXXXXXXXXXXXXXXXDANITSATEKSFT-INSSDTLLAQDAV 1037
               + A       + +                 D  I +    + T +N  D   AQ  +
Sbjct: 369  SSPSPADAITGTKVDELEAPLNTVTPSDTSVGSDKAIVTDINTAVTPMNDVDNDSAQATL 428

Query: 1038 TSVVGVSPGNAE--KEAVIGTQESGKSEEKK-VEQGPVVYENKLEAKNAFKALLEMANVG 1208
             S  GVS  + E  K   IG + + ++ E K VE  P VY NK+EAK+AFKALLE  NVG
Sbjct: 429  GSADGVSAEDKEDGKNDSIGEKSNDEAAETKAVEPEPPVYANKMEAKDAFKALLESVNVG 488

Query: 1209 SDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKM 1388
            SDW WD++MR IINDKRYGAL+TLGERKQAFNE++                    DF+KM
Sbjct: 489  SDWTWDRSMRLIINDKRYGALKTLGERKQAFNEYLNQRKKQEAEEKRMKQKKAREDFKKM 548

Query: 1389 LEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKR 1568
            LEES +LTSS+RWSK +S FE+DERFKAVER RDR ++FE++++EL  KERAK  EE KR
Sbjct: 549  LEESTDLTSSARWSKAVSIFENDERFKAVERDRDRRDMFESFLEELLNKERAKVQEERKR 608

Query: 1569 YRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXX 1748
              +EY +FL+SCDFIKA++QWRKVQDRLEADERCSRLEKIDRLEIFQ+Y  D        
Sbjct: 609  NIMEYKKFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLHDLEKEEEEQ 668

Query: 1749 XXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSG 1928
                      TERKNR+EFRKLMEEH+++G+LT+KTHWRDY  KVKD  AY+AVASNTSG
Sbjct: 669  KKIQKEELRKTERKNREEFRKLMEEHIASGILTAKTHWRDYYTKVKDLHAYVAVASNTSG 728

Query: 1929 STAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSD 2108
            ST KDLFEDV EELEKQ+ E+K++IKD +K  ++ ++ST  FEDFK A+ K      +SD
Sbjct: 729  STPKDLFEDVAEELEKQYHEEKSRIKDTVKLAKITLSSTWAFEDFKSALSKAISTPPISD 788

Query: 2109 YNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE--- 2279
            +N+K+VFD                     DD +  + ++KDIT S +WED +  V++   
Sbjct: 789  FNLKLVFDELLERAKEKEEKEAKKRKRLSDDFFHLLHSTKDITVSLKWEDCRPHVEDSQE 848

Query: 2280 -RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRK 2456
             R +G+ES  +E+F++ I +                                +  E  RK
Sbjct: 849  FRSIGDESLCKEVFEEYIAQ-----------------------------LKEEAKESERK 879

Query: 2457 RKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRHNDDR 2588
            RK+   K            G    E+  G++R +D    H  D+
Sbjct: 880  RKEERAKKEKDREERERRKGKQRKEKEGGRERGKD--EAHKKDK 921


>ref|XP_003535678.1| PREDICTED: pre-mRNA-processing protein 40A-like isoform X1 [Glycine
            max]
          Length = 1017

 Score =  621 bits (1602), Expect = e-175
 Identities = 368/884 (41%), Positives = 483/884 (54%), Gaps = 22/884 (2%)
 Frame = +3

Query: 3    FPQQMTQLPTRPGADVMPQSQAIPVPDFQQSRH----GMXXXXXXXXXXXXXXXYVPGFA 170
            F Q + QLP RP   + P SQAIP+P  + + H     M               Y PG  
Sbjct: 81   FSQPIQQLPPRPSPQLPPPSQAIPMPVARPNMHIPSESMMHQPDSQVHSQAPNGYTPGLG 140

Query: 171  GLGVPXXXXXXXXXXXAGQQQTNTDSATQYQPISQTTISSFPVEGQPWASAGNQSITTFT 350
            G  +P            GQ QTN  S  Q+QP+ Q               + +QSITT  
Sbjct: 141  GPAMPLSASYTFAPSAYGQVQTNFSSTGQFQPVPQI---------HALTGSSSQSITTGA 191

Query: 351  PAQQTGEQSSTAI---NDAIPKPETGEKVPSVWIEHTARNGKKYYYNRITKVSSWEKPLE 521
              Q  G Q S      +  I +P+  +  P+ WIEHT+  G+ +YYN+ TKVSSWEKP E
Sbjct: 192  TLQSNGGQPSVTTVMPSATIAQPQLAKNGPTDWIEHTSATGRTFYYNKKTKVSSWEKPFE 251

Query: 522  LMTPIERADASTDWREHTGPDGRTYYYNKVTKQSKWRIPDELKLAREXXXXXXXXXXXXQ 701
            LMTPIER DA+T+W+E+T PDGR YYYNK+T +SKW +P+ELKLARE            +
Sbjct: 252  LMTPIERVDATTNWKEYTSPDGRKYYYNKITNESKWSVPEELKLARELVEKAIVSGARPE 311

Query: 702  AVKDVDSQATAPVSLSGVNTSPVNVDVSSFPCQAGVPSLIPVAPVDXXXXXXXXXXXXXX 881
            A+  ++S      + S +  +P N D SS P Q    S + V+PV               
Sbjct: 312  AL--LNSHPQPSPTPSAIEATP-NADNSSLPSQGEPSSPVSVSPVVTTSISNLQSEMPSG 368

Query: 882  XXXAVA-------MMDPXXXXXXXXXXXXXXXXDANITSATEKSFT-INSSDTLLAQDAV 1037
               + A       + +                 D  I +    + T +N  D   AQ  +
Sbjct: 369  SSPSPADAITGTKVDELEAPLNTVTPSDTSVGSDKAIVTDINTAVTPMNDVDNDSAQATL 428

Query: 1038 TSVVGVSPGNAE--KEAVIGTQESGKSEEKK-VEQGPVVYENKLEAKNAFKALLEMANVG 1208
             S  GVS  + E  K   IG + + ++ E K VE  P VY NK+EAK+AFKALLE  NVG
Sbjct: 429  GSADGVSAEDKEDGKNDSIGEKSNDEAAETKAVEPEPPVYANKMEAKDAFKALLESVNVG 488

Query: 1209 SDWNWDQAMRAIINDKRYGALRTLGERKQAFNEFVGXXXXXXXXXXXXXXXXXXXDFRKM 1388
            SDW WD++MR IINDKRYGAL+TLGERKQAFNE++                    DF+KM
Sbjct: 489  SDWTWDRSMRLIINDKRYGALKTLGERKQAFNEYLNQRKKQEAEEKRMKQKKAREDFKKM 548

Query: 1389 LEESKELTSSSRWSKIISRFEDDERFKAVERQRDREELFENYIKELERKERAKALEEHKR 1568
            LEES +LTSS+RWSK +S FE+DERFKAVER RDR ++FE++++EL  KERAK  EE KR
Sbjct: 549  LEESTDLTSSARWSKAVSIFENDERFKAVERDRDRRDMFESFLEELLNKERAKVQEERKR 608

Query: 1569 YRVEYLEFLKSCDFIKANSQWRKVQDRLEADERCSRLEKIDRLEIFQEYTRDXXXXXXXX 1748
              +EY +FL+SCDFIKA++QWRKVQDRLEADERCSRLEKIDRLEIFQ+Y  D        
Sbjct: 609  NIMEYKKFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLHDLEKEEEEQ 668

Query: 1749 XXXXXXXXXXTERKNRDEFRKLMEEHVSAGVLTSKTHWRDYCMKVKDSPAYIAVASNTSG 1928
                      TERKNR+EFRKLMEEH+++G+LT+KTHWRDY  KVKD  AY+AVASNTSG
Sbjct: 669  KKIQKEELRKTERKNREEFRKLMEEHIASGILTAKTHWRDYYTKVKDLHAYVAVASNTSG 728

Query: 1929 STAKDLFEDVVEELEKQFLEDKAQIKDAMKNGEVIVTSTMTFEDFKIAILKDSKLSAVSD 2108
            ST KDLFEDV EELEKQ+ E+K++IKD +K  ++ ++ST  FEDFK A+ K      +SD
Sbjct: 729  STPKDLFEDVAEELEKQYHEEKSRIKDTVKLAKITLSSTWAFEDFKSALSKAISTPPISD 788

Query: 2109 YNMKVVFDXXXXXXXXXXXXXXXXXXXXGDDVYEFMINSKDITHSSRWEDSKRLVDE--- 2279
            +N+K+VFD                     DD +  + ++KDIT S +WED +  V++   
Sbjct: 789  FNLKLVFDELLERAKEKEEKEAKKRKRLSDDFFHLLHSTKDITVSLKWEDCRPHVEDSQE 848

Query: 2280 -RFVGEESFFREIFDKVILEHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRSEENRK 2456
             R +G+ES  +E+F++ I +                                +  E  RK
Sbjct: 849  FRSIGDESLCKEVFEEYIAQ-----------------------------LKEEAKESERK 879

Query: 2457 RKDRYNKXXXXXXXXXXXYGGYEDERRSGKDRSRDFRSRHNDDR 2588
            RK+   K            G    E+  G++R +D    H  D+
Sbjct: 880  RKEERAKKEKDREERERRKGKQRKEKEGGRERGKD--EAHKKDK 921


Top