BLASTX nr result

ID: Paeonia22_contig00013046 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00013046
         (3532 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272014.2| PREDICTED: transcription elongation regulato...  1043   0.0  
ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prun...   898   0.0  
ref|XP_002315059.2| hypothetical protein POPTR_0010s17750g [Popu...   889   0.0  
ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr...   882   0.0  
ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-l...   879   0.0  
ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c...   875   0.0  
ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative ...   874   0.0  
ref|XP_006590812.1| PREDICTED: pre-mRNA-processing protein 40C-l...   860   0.0  
ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-l...   858   0.0  
ref|XP_006590813.1| PREDICTED: pre-mRNA-processing protein 40C-l...   854   0.0  
ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-l...   852   0.0  
gb|EXC33082.1| Transcription elongation regulator 1 [Morus notab...   847   0.0  
ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C-l...   834   0.0  
ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-l...   825   0.0  
ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phas...   817   0.0  
ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-l...   815   0.0  
ref|XP_006592054.1| PREDICTED: pre-mRNA-processing protein 40C-l...   809   0.0  
ref|XP_004236882.1| PREDICTED: pre-mRNA-processing protein 40C-l...   798   0.0  
ref|XP_003607201.1| Transcription elongation regulator [Medicago...   791   0.0  
ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-l...   786   0.0  

>ref|XP_002272014.2| PREDICTED: transcription elongation regulator 1-like [Vitis vinifera]
            gi|297738259|emb|CBI27460.3| unnamed protein product
            [Vitis vinifera]
          Length = 1046

 Score = 1043 bits (2698), Expect = 0.0
 Identities = 582/1047 (55%), Positives = 660/1047 (63%), Gaps = 46/1047 (4%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXA--QIAGXXXXXXXXXXXXXS 3029
            MASPAWLP EVQSS SQ PV+GL                A   +A              S
Sbjct: 1    MASPAWLPVEVQSSASQNPVTGLPAGGPSGGPPTPTGAIAPASVATIRTSEGASGTASNS 60

Query: 3028 MHESTQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXX 2849
            + ES Q K VNAP  V+P  SFSYS +P+              +VI SNP  STVV    
Sbjct: 61   IQESAQGKFVNAPPHVLPGPSFSYSGIPHVTTASGTSQQLPSGSVISSNPLASTVVFQTP 120

Query: 2848 XXXXXXXXXXXSY--ISHTSAGLPDGQQFQS----------------------------- 2762
                           I+H  AG P  Q FQS                             
Sbjct: 121  VPGPSSSSGPSFSYNIAHKGAGFPGSQPFQSSTSIASGPRGPTPNAASFSFNGNPQLVQK 180

Query: 2761 ----KSNTSAADVNILSSALSIXXXXXXXXXXXXXXXXXXXXXXXLVSETPWMRAGQAFP 2594
                KS+ S A      S  S                        +   T WM +  +FP
Sbjct: 181  DQTLKSDNSGAVAQEAGSMSSASHVSQSVPFPCSSSTMSVSSSPKMGPTTLWMPSNPSFP 240

Query: 2593 VPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTAPILSNSAVQQQ 2414
            VP+ +  T G P PPGI  S   SSN  VP            R + P AP+ SN A+QQQ
Sbjct: 241  VPSGMPVTPGTPGPPGIAPSTPLSSNLAVPSASMDFSSSVVSRAIFPAAPVSSNPAIQQQ 300

Query: 2413 TYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAHGLPLPSVPLPD 2234
             Y +Y +LPA     Q  WL+PPQMGGLPR PF+ YP+V P PFP PAHG+PLPSVPLPD
Sbjct: 301  IYPSYSSLPATNASSQGPWLQPPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPD 360

Query: 2233 SHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNGAGIKDGFAVNE 2054
            S PPGVTPVG++G    SA  S H L  +SGM  EL PPGI +NKHVNGAG KDG AVNE
Sbjct: 361  SQPPGVTPVGTAGGTPISAAVSGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDGAAVNE 420

Query: 2053 QSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSWEKLAGTDWALV 1874
            Q D+WT HKTDTGVVYYYNALTG STYEKPS FK E+DKVT++P PVSWEKL GTDWALV
Sbjct: 421  QVDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALV 480

Query: 1873 TTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNVLPEKGSAPISL 1694
            TTNDG KYYYNTKT LSSWQIPTE+ E++KKQ    L EH ML P+TNV  EKG +PI+L
Sbjct: 481  TTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIAL 540

Query: 1693 SASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPATSGLAVSELNGV 1514
            SA A+ TGGRDATP R+SA PGS+SALD++KKKLQDSGAP TSS V  +SG   SELNG 
Sbjct: 541  SAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPATSSPV-HSSGPIASELNGS 599

Query: 1513 RAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCIIQFKEMLKDRGV 1334
            R +E TV+ LQSENSKDKL D   DGN+        D D GPTKE+CIIQFKEMLK+RGV
Sbjct: 600  RVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGV 659

Query: 1333 APFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXXXXXXXXEIFKK 1154
            APFSKWEKELPKIVFD RFKAIP +SARRSLF+HYV+T                 E FK+
Sbjct: 660  APFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQ 719

Query: 1153 LLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXXXXXXXXXXXXX 974
            LLEEASEDIDH T+YQTFRKKWG+DPRFE LDRKDRELLLNERVLPL             
Sbjct: 720  LLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 779

Query: 973  XXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNEYISELKTAEVE 794
               S+FKSMLRDKGDIT +TRWSRVKDSLRND RYK VKH+DRE+LFNEYISELK AE E
Sbjct: 780  AAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEE 839

Query: 793  AVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALLVEAIRDP 614
              RE                           EME        KEAV+SYQALLVE I+DP
Sbjct: 840  VEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDP 899

Query: 613  QASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLAEVL---S 443
            Q SWTESKPKLEKDPQ RATN DLD SD+EKLFREH+K+LHER   +FR LL+EVL   +
Sbjct: 900  QVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEA 959

Query: 442  GGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQKLADD---- 275
              Q++EDGKTVL SWSTAKRLL+ D RY KMPRK+RE +WRRY EEM R+QKLA D    
Sbjct: 960  ATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEE 1019

Query: 274  --TQAKTRNSFDSGRLTLPSKRSYEQR 200
              T+ K R+S DSGR    S+R++E+R
Sbjct: 1020 KHTEVKGRSSVDSGRFPSGSRRAHERR 1046


>ref|XP_007221939.1| hypothetical protein PRUPE_ppa001490mg [Prunus persica]
            gi|462418875|gb|EMJ23138.1| hypothetical protein
            PRUPE_ppa001490mg [Prunus persica]
          Length = 814

 Score =  898 bits (2321), Expect = 0.0
 Identities = 480/818 (58%), Positives = 567/818 (69%), Gaps = 9/818 (1%)
 Frame = -1

Query: 2626 TPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTA 2447
            T W+  G +F + + + GT G P PPGI   V  S NP  P           +RP M  A
Sbjct: 13   TSWVPTGPSFNLTSGMPGTPGTPGPPGIAHPVQISFNPTAP-SAPIDSSSVALRPSMQIA 71

Query: 2446 PILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAH 2267
            P+ S SAVQ Q    Y +L +M  PPQ +WL+ PQ+GG PR PFL YP+  PGPFP PAH
Sbjct: 72   PVAS-SAVQPQVGAPYLSLSSMGAPPQGVWLQSPQIGGFPRPPFLPYPAAFPGPFPLPAH 130

Query: 2266 GLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNG 2087
             +PLPSVPLPDS PPGV PVG++ AI + +  S H L  SSG+  EL  PGIGN    + 
Sbjct: 131  VMPLPSVPLPDSQPPGVIPVGNTAAISSPSAASGHQLAGSSGIQIELPHPGIGNENRAS- 189

Query: 2086 AGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSW 1907
                    VNEQ D+WT HKT+TGVVYYYNALTG STY+KP GFKEE DKV+++P PVS 
Sbjct: 190  --------VNEQLDAWTAHKTETGVVYYYNALTGESTYDKPPGFKEEPDKVSMQPTPVST 241

Query: 1906 EKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNV 1727
              L+GTDW LVTT+DG K+Y+N KT +SSWQIP EV+EL+KKQ  ++  EH + +P  NV
Sbjct: 242  VNLSGTDWVLVTTSDGKKFYHNGKTKVSSWQIPNEVIELRKKQDADVPKEHPVSIPINNV 301

Query: 1726 LPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPAT 1547
            + EKGSAPISL+A AI TGGR+A  F+ SA  G+SSALDL+KKKLQDSGAP TSS VPA 
Sbjct: 302  MTEKGSAPISLTAPAINTGGREAMAFKPSAVQGTSSALDLIKKKLQDSGAPVTSSPVPAP 361

Query: 1546 SGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCII 1367
                 SE NG R VE+T +  QS+NSKDKL D   DGN+        DAD GPTKE+CI 
Sbjct: 362  -----SESNGSRGVESTPKGQQSDNSKDKLKDINGDGNLSDSSSDSEDADSGPTKEECIT 416

Query: 1366 QFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXX 1187
            QFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPSHSARRSLF+HYVKT           
Sbjct: 417  QFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRA 476

Query: 1186 XXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXX 1007
                  E FK+LL+EASEDIDH TDYQ+FRKKW NDPRFE LDRKDRE LLNERVLPL  
Sbjct: 477  AQKAAIEGFKQLLDEASEDIDHKTDYQSFRKKWANDPRFEALDRKDREHLLNERVLPLKR 536

Query: 1006 XXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNE 827
                         A++FKSML++KGDIT ++RWSRVKDSLRND RYKS++H+DRE+LFN+
Sbjct: 537  AAEEKAQAVRAAAATSFKSMLQEKGDITVSSRWSRVKDSLRNDPRYKSLRHEDREILFNQ 596

Query: 826  YISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASY 647
            YIS+LK  E EA RE                           E E        KEAVA++
Sbjct: 597  YISDLKAVEEEAEREAKAKRDEQEKLRERERELRKRKEREEQETERVRLKVRRKEAVATF 656

Query: 646  QALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFR 467
            QALLVE I+DPQASWT SKPKLEKDPQ RA NPDL+ SD+EKLFREH+K L+ERC  +FR
Sbjct: 657  QALLVETIKDPQASWTGSKPKLEKDPQRRAANPDLEPSDMEKLFREHIKRLNERCAHEFR 716

Query: 466  DLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQR 296
             LLAEVL   +  Q++EDGKTVLNSWSTAKRLLKPDPRY+KM RKERE LWRR+ EEM R
Sbjct: 717  ALLAEVLTAEAASQETEDGKTVLNSWSTAKRLLKPDPRYNKMARKEREVLWRRFSEEMLR 776

Query: 295  RQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            +QK A D      T AK+R+S DSGR+   S+ ++++R
Sbjct: 777  KQKSALDHKEDRKTDAKSRSSVDSGRVPFGSRGTHDRR 814


>ref|XP_002315059.2| hypothetical protein POPTR_0010s17750g [Populus trichocarpa]
            gi|550330031|gb|EEF01230.2| hypothetical protein
            POPTR_0010s17750g [Populus trichocarpa]
          Length = 963

 Score =  889 bits (2296), Expect = 0.0
 Identities = 508/947 (53%), Positives = 594/947 (62%), Gaps = 21/947 (2%)
 Frame = -1

Query: 2977 PASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXXXXXXXXXXXXXSYISHT 2798
            PA +F+Y+V PN              A + SNPP   V                  I  T
Sbjct: 45   PAPTFTYNVTPNMSSG----------AALNSNPPGQPVPVPGPASSVGLSFSYK--IPQT 92

Query: 2797 SAGLPDGQQFQSKSNTSAADVNILSSALSIXXXXXXXXXXXXXXXXXXXXXXXL-----V 2633
              G P  QQ QS  + S A      SA S+                              
Sbjct: 93   GPGFPGNQQLQSSVDKSPAIAQ--GSAPSVAPIASQSASFPLHSPSSSYTSLSSNLGPTP 150

Query: 2632 SETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMP 2453
            S+TP   A  +F +P  +  T G  AP G++ S   +     P            RP+MP
Sbjct: 151  SQTP---ATASFYLPPGLPRTPGTLAPQGLVPSAPMTQ----PSVAADSLPLGVQRPIMP 203

Query: 2452 TAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFP 2273
            T P  S++AVQQQTY TYP+LP MA  PQ+LW+ PP +GG+PR PFLSYP+  PG FP P
Sbjct: 204  TMP--SSNAVQQQTYPTYPSLPVMAASPQALWMHPPPIGGMPRQPFLSYPAAFPGSFPPP 261

Query: 2272 AHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHV 2093
             HG+P PSV LPDS PPGV PVG S AI  S+  S H L  + GM  EL PPGI N+ H+
Sbjct: 262  GHGMPYPSVSLPDSQPPGVVPVGHSYAIPMSSSASVHQLPGAPGMQTELPPPGIDNHNHL 321

Query: 2092 NGAGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPV 1913
            + +GI+D  AV+E S +WT HKTDTGV YYYNA+TG STYEKP GFKE  +KV ++P PV
Sbjct: 322  HHSGIRDNAAVSEPSHAWTAHKTDTGVFYYYNAVTGVSTYEKPPGFKEP-EKVPVQPTPV 380

Query: 1912 SWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDT 1733
            S E LAGTDW L+TTND  KYYYN KT LSSWQIP+EV EL+K Q  E+   + M V   
Sbjct: 381  SMENLAGTDWVLITTNDSKKYYYNNKTKLSSWQIPSEVTELRKNQEAEVSKGNAMSVSQV 440

Query: 1732 NVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVP 1553
            N L EKGSAPISLSA A  TGGRDAT  R  + PG+SSALDL+KKKLQ+ GAP  S+AV 
Sbjct: 441  NALTEKGSAPISLSAPAANTGGRDATALRVLSVPGTSSALDLIKKKLQEFGAPAISAAVS 500

Query: 1552 ATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQC 1373
             +SG A SE NG R VEA  + L SE SKDKL DA  DGNI        D D GP+KE+C
Sbjct: 501  VSSGAAASESNGSRVVEAAAKGLPSEISKDKLKDANGDGNISDSSTDSEDEDDGPSKEEC 560

Query: 1372 IIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXX 1193
            IIQFKEMLK+RGVAPFSKWEKELP        +AIPSHSARRSLF+HYVKT         
Sbjct: 561  IIQFKEMLKERGVAPFSKWEKELPN---SDLLQAIPSHSARRSLFEHYVKTRAEEKRKEK 617

Query: 1192 XXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPL 1013
                    E FK+LLEEASEDIDHNTDYQTFRKKWGNDPRFE LDRKDRE LLNER+  L
Sbjct: 618  RAAQKAAVEGFKQLLEEASEDIDHNTDYQTFRKKWGNDPRFEALDRKDREHLLNERIHLL 677

Query: 1012 XXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLF 833
                           A++FKSMLRDKGDIT ++RWSRVKDSLRND RYKSVKH+DREV F
Sbjct: 678  KKAAQEKAQAERAYAAASFKSMLRDKGDITVSSRWSRVKDSLRNDPRYKSVKHEDREVFF 737

Query: 832  NEYISELKTAEVEAVRETN-------VXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXX 674
            NEY+ ELK AE EA R+         +                        EME      
Sbjct: 738  NEYLYELKAAE-EAERDARGKTEEQLLSSSVQDKLKERERELRKRKEREEQEMERVRVKV 796

Query: 673  XXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLL 494
              KEAVAS+QALLVE ++DPQASWTESKPKL+KDPQ RAT+PDLD SD EKLFREH+K+L
Sbjct: 797  RRKEAVASFQALLVETLKDPQASWTESKPKLDKDPQRRATHPDLDPSDTEKLFREHMKML 856

Query: 493  HERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLW 323
            HERC  DF+ LLAEV+   +  QK++DGKTVL+SWSTAKRL+KPDPRY+KMPRKERE LW
Sbjct: 857  HERCTNDFKALLAEVITAETAAQKTDDGKTVLDSWSTAKRLIKPDPRYNKMPRKERETLW 916

Query: 322  RRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RRY EEM R+QK   D      T +K R++ DSGR    S+R+ ++R
Sbjct: 917  RRYAEEMLRKQKFEPDPKEDKHTDSKNRSANDSGRYHSGSRRTNDRR 963


>ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina]
            gi|557539684|gb|ESR50728.1| hypothetical protein
            CICLE_v10030612mg [Citrus clementina]
          Length = 1015

 Score =  882 bits (2279), Expect = 0.0
 Identities = 512/1022 (50%), Positives = 624/1022 (61%), Gaps = 17/1022 (1%)
 Frame = -1

Query: 3214 SDNIMASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXX 3035
            SD IM SPAWLP EVQ  T+ AP+SG                   +              
Sbjct: 35   SDQIMTSPAWLPPEVQQLTANAPISG-------------KPVGGSLVASSTPTPTSNGSD 81

Query: 3034 XSMHES----TQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVST 2867
             + ++S    +QAK V A G V+P SSFS+     +            ++VI SNP V  
Sbjct: 82   TATNDSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSA--------SSVINSNPSVPP 133

Query: 2866 VVXXXXXXXXXXXXXXXSYISHTSAGLPDGQQFQSKSNTSAA--DVNILSSALSIXXXXX 2693
             V                  S T  G    QQFQ   N   A  D  + SS  +      
Sbjct: 134  GVSSFTYSA-----------SQTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQ 182

Query: 2692 XXXXXXXXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNP 2513
                              L + T WM    +F  P  +  T    APPG+L   +  ++ 
Sbjct: 183  ASVRTFSDSTVATSSATALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSS 242

Query: 2512 IVPYXXXXXXXXXXVRPVMPT--APILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQM 2339
                          +RP +PT  AP  S SA+Q Q Y T+P+LP + + PQ   L+PPQM
Sbjct: 243  AF----GDFYSSAGLRPSVPTPSAPSNSGSAIQHQIYPTHPSLPPVGVSPQRPLLQPPQM 298

Query: 2338 GGLPRLPFLSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHP 2159
            G  P LPFL YP+  P PFP PAHG+P PSV   D+ PPG++ + ++ A   SA+   H 
Sbjct: 299  GVRPWLPFLPYPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPG-HQ 357

Query: 2158 LLCSSGMHPELSPPGIGNNKHVNGAGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGAS 1979
            L+ +SG + E  P G    +HV+    + G +VNEQ D+WT HKTDTG+VYYYNA+TG S
Sbjct: 358  LVGTSG-NTEAPPSGTDKKEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGES 416

Query: 1978 TYEKPSGFKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEV 1799
            TYEKP+GFK E DKV ++P P+S E L GTDWALVTTNDG KYYYN+K  +SSWQIP+EV
Sbjct: 417  TYEKPAGFKGEPDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEV 476

Query: 1798 MELKKKQGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSS 1619
             ELKKK+  + L E    VP+TN++ EKGS  ISLS+ A+ TGGRDAT  R+S+ PGSSS
Sbjct: 477  TELKKKEDDDTLKEQS--VPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSS 534

Query: 1618 ALDLVKKKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRD 1439
            ALDL+KKKLQDSG P T+S  P +S  A SE NG +AVE TV+ LQ+EN+KDKL D   D
Sbjct: 535  ALDLIKKKLQDSGTP-TASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGD 593

Query: 1438 GNIXXXXXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSH 1259
            G +        D + GPTKE+CII+FKEMLK+RGVAPFSKWEKELPKIVFD RFKAI S 
Sbjct: 594  GTMSDSSSDSEDGETGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQ 653

Query: 1258 SARRSLFDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGND 1079
            SARR+LF+ YVKT                 E FK+LLEE SEDIDH+TDYQTF+KKWG+D
Sbjct: 654  SARRALFERYVKTRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSD 713

Query: 1078 PRFEVLDRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRV 899
            PRFE LDRKDRELLLNERVLPL               AS+FKSMLR+KGDIT ++RWS+V
Sbjct: 714  PRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKV 773

Query: 898  KDSLRNDSRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXX 719
            KD LR+D RYKSV+H+DREV+FNEY+ ELK AE EA RE                     
Sbjct: 774  KDILRDDPRYKSVRHEDREVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKR 833

Query: 718  XXXXXXEMEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLD 539
                  EME        KEAV S+QALLVE I+DPQASWTES+PKLEKDPQGRATN DLD
Sbjct: 834  KEREEQEMERVRLKVRRKEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLD 893

Query: 538  LSDIEKLFREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPD 368
             SD EKLFREH+K L+ERC  DFR LLAEV+   +  Q++EDGKTVLNSWSTAKR+LKPD
Sbjct: 894  SSDREKLFREHIKTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPD 953

Query: 367  PRYHKMPRKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYE 206
            PRY KMPRKERE LWRR+ EE+QR+ K + D        +K+R+S D GR    S+R+ E
Sbjct: 954  PRYSKMPRKEREALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQE 1013

Query: 205  QR 200
            +R
Sbjct: 1014 RR 1015


>ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C-like [Citrus sinensis]
          Length = 978

 Score =  879 bits (2270), Expect = 0.0
 Identities = 510/1018 (50%), Positives = 622/1018 (61%), Gaps = 17/1018 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            M SPAWLP EVQ  T+ AP+SG                   +A              + +
Sbjct: 1    MTSPAWLPPEVQQLTANAPISGKPVGGSL------------VASSTPIAPTSNGSDTATN 48

Query: 3022 ES----TQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXX 2855
            +S    +QAK V A G V+P SSFS+     +            ++VI SNP V   V  
Sbjct: 49   DSISGPSQAKSVTATGGVIPQSSFSFQNSEGSGHSA--------SSVINSNPSVPPGVSS 100

Query: 2854 XXXXXXXXXXXXXSYISHTSAGLPDGQQFQSKSNTSAA--DVNILSSALSIXXXXXXXXX 2681
                            S T  G    QQFQ   N   A  D  + SS  +          
Sbjct: 101  FTYSA-----------SQTVVGYSPNQQFQPNMNKLEAVEDAGLGSSTSTNSQPVQASVR 149

Query: 2680 XXXXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPY 2501
                          L + T WM    +F  P  +  T    APPG+L   +  ++     
Sbjct: 150  TFSDSTVATSSATALSTTTSWMPTIPSFSTPPGLFVTPQTQAPPGLLTLRTKDTSSAF-- 207

Query: 2500 XXXXXXXXXXVRPVMPT--APILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLP 2327
                      +RP +PT  AP  S SA+Q Q Y TYP+LP + + PQ   L+PPQMG  P
Sbjct: 208  --GDFYSSAGLRPSVPTPSAPSNSGSAIQHQIYPTYPSLPPIGVSPQGPLLQPPQMGVRP 265

Query: 2326 RLPFLSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCS 2147
             LPFL YP+  P PFP PAHG+P PSV   D+ PPG++ + ++ A   SA+   H L+ +
Sbjct: 266  WLPFLPYPAAYPSPFPLPAHGMPNPSVSQIDAQPPGLSSMRTAAATSHSAIPG-HQLVGT 324

Query: 2146 SGMHPELSPPGIGNNKHVNGAGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEK 1967
            SG + E  P G    +HV+    + G +VNEQ D+WT HKTDTG+VYYYNA+TG STYEK
Sbjct: 325  SG-NTEAPPSGTDKKEHVHDVSSRIGASVNEQLDAWTAHKTDTGIVYYYNAVTGESTYEK 383

Query: 1966 PSGFKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELK 1787
            P+GFK E DKV ++P P+S E L GTDWALVTTNDG KYYYN+K  +SSWQIP+EV ELK
Sbjct: 384  PAGFKGEPDKVPVQPTPISMEHLTGTDWALVTTNDGKKYYYNSKMKVSSWQIPSEVTELK 443

Query: 1786 KKQGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDL 1607
            KK+  + L E    VP+TN++ EKGS  ISLS+ A+ TGGRDAT  R+S+ PGSSSALDL
Sbjct: 444  KKEDDDTLKEQS--VPNTNIVIEKGSNAISLSSPAVNTGGRDATALRTSSMPGSSSALDL 501

Query: 1606 VKKKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIX 1427
            +KKKLQDSG P T+S  P +S  A SE NG +AVE TV+ LQ+EN+KDKL D   DG + 
Sbjct: 502  IKKKLQDSGTP-TASPAPVSSAAATSESNGSKAVEVTVKGLQNENTKDKLKDINGDGTMS 560

Query: 1426 XXXXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARR 1247
                   D + GPTKE+CII+FKEMLK+RGVAPFSKWEKELPKIVFD RFKAI S SARR
Sbjct: 561  DSSSDSEDGETGPTKEECIIKFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIQSQSARR 620

Query: 1246 SLFDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFE 1067
            +LF+ YVKT                 E FK+LLEE SEDIDH+TDYQTF+KKWG+DPRFE
Sbjct: 621  ALFERYVKTRAEEERKEKRAAQKAAIEGFKQLLEEVSEDIDHSTDYQTFKKKWGSDPRFE 680

Query: 1066 VLDRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSL 887
             LDRKDRELLLNERVLPL               AS+FKSMLR+KGDIT ++RWS+VKD L
Sbjct: 681  ALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAASSFKSMLREKGDITLSSRWSKVKDIL 740

Query: 886  RNDSRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXX 707
            R+D RYKSV+H+DREV+FNEY+ ELK AE EA RE                         
Sbjct: 741  RDDPRYKSVRHEDREVIFNEYVRELKAAEEEAEREAKARREEQEKLKEREREMRKRKERE 800

Query: 706  XXEMEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDI 527
              EME        KEAV S+QALLVE I+DPQASWTES+PKLEKDPQGRATN DLD SD 
Sbjct: 801  EQEMERVRLKVRRKEAVTSFQALLVETIKDPQASWTESRPKLEKDPQGRATNADLDSSDR 860

Query: 526  EKLFREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYH 356
            EKLFREH+K L+ERC  DFR LLAEV+   +  Q++EDGKTVLNSWSTAKR+LKP+PRY 
Sbjct: 861  EKLFREHIKTLYERCAHDFRGLLAEVITAEAAAQETEDGKTVLNSWSTAKRVLKPEPRYS 920

Query: 355  KMPRKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            KMPRKERE LWRR+ EE+QR+ K + D        +K+R+S D GR    S+R+ E+R
Sbjct: 921  KMPRKEREALWRRHAEEIQRKHKSSLDQNEDNHKDSKSRSSTDGGRPPSSSRRNQERR 978


>ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao]
            gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein
            40C [Theobroma cacao]
          Length = 816

 Score =  875 bits (2261), Expect = 0.0
 Identities = 476/827 (57%), Positives = 562/827 (67%), Gaps = 18/827 (2%)
 Frame = -1

Query: 2626 TPWMRAGQAFPVPARISGTLGMPAPPGILQSVS--------SSSNPIVPYXXXXXXXXXX 2471
            T WM   Q+FP+    SGT G    PG++ SV          S +  VP           
Sbjct: 13   TSWMPTTQSFPMSTESSGTSGTAGHPGLVPSVQMITASAAVDSPSSAVP----------- 61

Query: 2470 VRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVP 2291
                 P+AP+ SN AVQQQ Y TY  LP+MA  PQ  W++ P MGG PR PF+ YP++ P
Sbjct: 62   ----RPSAPVSSNQAVQQQIYPTYTPLPSMASSPQGFWMQHPPMGGFPRPPFVPYPTIYP 117

Query: 2290 GPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGI 2111
            GPFP  + G+P P+ P  DS PPGV+P+ +S    + A+  A+    +SG+     P GI
Sbjct: 118  GPFPSASSGMPHPA-PSSDSQPPGVSPLATSPFAPSIAIP-ANQSSVASGIQTGFPPQGI 175

Query: 2110 GNNKHVNGAGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVT 1931
             N       G +   AVNEQSD WT HKTDTG+VYYYNALTG STYEKP+GFK E DKV 
Sbjct: 176  DNRN----VGTRVEAAVNEQSDIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVP 231

Query: 1930 IEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHM 1751
            ++P PVS E+LAGT+WALVTT+DG KYYYN+KT +SSWQIP+EV EL+KKQ  ++  EH 
Sbjct: 232  VQPTPVSVEQLAGTEWALVTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHA 291

Query: 1750 MLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAP- 1574
            + VP+ +V+ EKGS PISLSA A+ TGGRDA P R+S  PGSSSALDL+KKKLQDSG P 
Sbjct: 292  VPVPNIDVVAEKGSTPISLSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPS 351

Query: 1573 DTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADG 1394
             +SS+VP     A  ELNG RAV+  V+ LQSENSKDKL DA  DGNI        D D 
Sbjct: 352  SSSSSVPVMPVTAAQELNGSRAVD--VKGLQSENSKDKLKDANGDGNISDSSSDSEDTDS 409

Query: 1393 GPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXX 1214
            GP+KE+CI+QFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPSHSARR+LF+HYVKT  
Sbjct: 410  GPSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRA 469

Query: 1213 XXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLL 1034
                           E FK+LL+EASEDIDHNT+YQTF++KWG+D RFE LDRKDRELLL
Sbjct: 470  EEERREKRAALKAAIEGFKQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLL 529

Query: 1033 NERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKH 854
             ERVLPL               AS+ KSML++KGDIT  +RWSRVKDS+R+D RYK VKH
Sbjct: 530  TERVLPLKRAAEEKAQAIRAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKH 589

Query: 853  DDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXX 674
            +DREVLFNEYISELK  E +A R+  V                        EME      
Sbjct: 590  EDREVLFNEYISELKAVEEKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKV 649

Query: 673  XXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLL 494
              KEAVAS+QALLVE I+DPQASWTESKPKLEKDPQGRA NPDLD SD EKLFREH+K+L
Sbjct: 650  RRKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKML 709

Query: 493  HERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLW 323
             ERC  DFR LLAEV+   +  Q++E GKTV NSWSTAKRLLKPDPRY KMPRKERE LW
Sbjct: 710  FERCTHDFRALLAEVITQDAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALW 769

Query: 322  RRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RRY E+M R+QK A D      T AK R+S D GR +  S++ +E+R
Sbjct: 770  RRYAEDMLRKQKSALDQEEEKRTDAKVRSSGDLGRFSSGSRKVHERR 816


>ref|XP_002515795.1| Pre-mRNA-processing protein PRP40, putative [Ricinus communis]
            gi|223545064|gb|EEF46576.1| Pre-mRNA-processing protein
            PRP40, putative [Ricinus communis]
          Length = 886

 Score =  874 bits (2257), Expect = 0.0
 Identities = 471/813 (57%), Positives = 556/813 (68%), Gaps = 12/813 (1%)
 Frame = -1

Query: 2602 AFPVPARISGTLGMPAPPGILQSVSSSSNP-IVPYXXXXXXXXXXVRPVMPTAPILSNSA 2426
            +F VP  ++GT   P P G     S S  P I+P            RPVMPT    SN  
Sbjct: 83   SFLVPPGLAGT---PGPAG-----SVSCGPMILPPVTVDSATSSVQRPVMPTVTHASNPV 134

Query: 2425 VQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYP-SVVPGPFPFPAHGLPLPS 2249
            VQQQ+YHTYP+LPAMA   Q LW  PPQMGG+PR PFL YP +V PG +P PAHG+  PS
Sbjct: 135  VQQQSYHTYPSLPAMAASAQGLWFHPPQMGGMPRTPFLPYPPAVFPGSYPLPAHGISRPS 194

Query: 2248 VPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNGAGIKDG 2069
            +  PD  P G  PVG  GA   S+  S H L+ + GM  E+ PPGI N   ++  G K+ 
Sbjct: 195  ISSPDFQPSGAPPVGIPGANPPSSAASGHQLMGTPGMQKEIPPPGIDNRSQIHDFGTKNN 254

Query: 2068 FAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSWEKLAGT 1889
             A ++  D+WT HKTD GVVYYYNA+TG STYEKP GFK E +KV ++P PVS E LAGT
Sbjct: 255  AATSDSLDAWTAHKTDAGVVYYYNAVTGVSTYEKPPGFKSEPEKVPMQPTPVSMENLAGT 314

Query: 1888 DWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNVLPEKGS 1709
            DWAL+TTNDG  YYYN KT LSSWQIP+EV ELKKKQ  E L E  M V  ++VL EKGS
Sbjct: 315  DWALITTNDGKNYYYNNKTKLSSWQIPSEVTELKKKQEAE-LKEQEMSVSSSSVLNEKGS 373

Query: 1708 APISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPATSGLAVS 1529
              ISLSA AI TGGRDAT  R+S A G+SSALDL+KKKLQDSG P TSS  P + G+   
Sbjct: 374  VQISLSAPAINTGGRDATALRASNALGASSALDLIKKKLQDSGTPVTSSPAPVSLGITTP 433

Query: 1528 ELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCIIQFKEML 1349
            E NG RA+EAT + L SENSK+KL DA  D N         + D GPTKE+CIIQFK+ML
Sbjct: 434  ESNGSRAMEATSKGLPSENSKEKLKDANGDANASDSSSDSEEEDNGPTKEECIIQFKDML 493

Query: 1348 KDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXXXXXXXX 1169
            K+RG+APFSKWEK LPKIVFD RF+AIPSHSARRSLF+HYVKT                 
Sbjct: 494  KERGIAPFSKWEKVLPKIVFDPRFQAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAI 553

Query: 1168 EIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXXXXXXXX 989
            E F++LLEEASE+IDHNTDYQ+FR+KWGNDPRFE +DRKDRE LL+ERVLPL        
Sbjct: 554  EGFRQLLEEASEEIDHNTDYQSFRRKWGNDPRFEAVDRKDREHLLHERVLPLKKAAQEKA 613

Query: 988  XXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNEYISELK 809
                   A++FKSML+DKGD+T  +RWS+VK+SLRND RYKSVKH++REVLFNEY+SELK
Sbjct: 614  QAERAAAAASFKSMLQDKGDLTVNSRWSKVKESLRNDPRYKSVKHEEREVLFNEYLSELK 673

Query: 808  TAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALLVE 629
             AE EA  +  V                        EME        KEAVAS+QALLVE
Sbjct: 674  AAEEEAEWKAKVKREEQEKLKERERELRKRKEREEQEMERVREKVRRKEAVASFQALLVE 733

Query: 628  AIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLAEV 449
             I+DPQASWTESK +LEKDPQGR TNP+LD SD EKLFREHVK+LHERC  +F+ LLAEV
Sbjct: 734  TIKDPQASWTESKTRLEKDPQGRGTNPNLDPSDTEKLFREHVKMLHERCTNEFKALLAEV 793

Query: 448  L---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQKLAD 278
            +   +  QK+EDGKTVL+SW+TAKR+LK DPRY+KMPRKERE LWRR+ E+M R+QK   
Sbjct: 794  INAEAASQKTEDGKTVLDSWTTAKRVLKLDPRYNKMPRKEREVLWRRHAEDMLRKQKTTL 853

Query: 277  D------TQAKTRNS-FDSGRLTLPSKRSYEQR 200
            D      T  + R+S  DSGR    SKR++++R
Sbjct: 854  DEKEDKHTDPRGRSSTTDSGRHLSGSKRTHDRR 886


>ref|XP_006590812.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine
            max]
          Length = 980

 Score =  860 bits (2223), Expect = 0.0
 Identities = 496/1017 (48%), Positives = 612/1017 (60%), Gaps = 16/1017 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            MASPAWLPQE     SQ PVSG                                   +  
Sbjct: 1    MASPAWLPQE-----SQPPVSG------------------------ETPLPMASSAHTTP 31

Query: 3022 ESTQAKVVNAPGFVVPASS--FSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXX 2849
             +  AK+ NA  FV PA +  F+Y +L N           +   + KSN  V+ +V    
Sbjct: 32   STAHAKLFNATAFVAPAPAPPFAYGMLQNVNASGSSQQSSTHPGM-KSNSAVNPMVVQPP 90

Query: 2848 XXXXXXXXXXXSYISHTSAGLPDGQQF-QSKSN---TSAADVNILSSALSIXXXXXXXXX 2681
                         I  + A     QQ  QS +N   + A DV  LSSA SI         
Sbjct: 91   GVSLHAAPSFSYNIPQSGAIFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTS 150

Query: 2680 XXXXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPY 2501
                            S   WM    +FPV   +  T G P PPG+  S   SSNP  P 
Sbjct: 151  TSIMPPPSDPNYRPATS---WMPTAMSFPV-LPVMPTQGNPGPPGLASSAIISSNPAAPS 206

Query: 2500 XXXXXXXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRL 2321
                      +RP MPT+ I S+    Q+    YP++PAMA PPQ LWL+PPQM G+ R 
Sbjct: 207  TGTDSSPAALLRPNMPTSAIASDPTAPQKGL-PYPSVPAMAAPPQGLWLQPPQMSGVLRP 265

Query: 2320 PFLSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSG 2141
            P+L YP+  PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   TS   S+H L  ++ 
Sbjct: 266  PYLQYPAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGG--TSTPSSSHQLRGTTA 323

Query: 2140 MHPELSPPGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKP 1964
            +  E+      + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY+KP
Sbjct: 324  LQTEVISGPADDKKKLNSVDTVNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKP 383

Query: 1963 SGFKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKK 1784
            +GFK ES +V+ +PIPVS   L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKK
Sbjct: 384  AGFKGESHQVSAQPIPVSMMDLPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKK 443

Query: 1783 KQGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLV 1604
            KQ G++  +H+M V +TNVL ++GS  ++L+A AI TGGRDA   + S+   S SALDL+
Sbjct: 444  KQDGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLI 503

Query: 1603 KKKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXX 1424
            KKKLQDSG P  SS++PA S     E NG + V++T + LQ +N+KDK  D   D N+  
Sbjct: 504  KKKLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSD 563

Query: 1423 XXXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRS 1244
                  D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRS
Sbjct: 564  TSSDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRS 623

Query: 1243 LFDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEV 1064
            LF+HYVKT                 E FK+LL+EASEDI++NTDYQTFRKKW NDPRFE 
Sbjct: 624  LFEHYVKTRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEA 683

Query: 1063 LDRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLR 884
            LDRK++E LLNERVLPL               A++FKSML+++GDI+  +RWSRVK++LR
Sbjct: 684  LDRKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLR 743

Query: 883  NDSRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXX 704
            +D RYK V+H+DREVLFNEYISELK AE  A RET                         
Sbjct: 744  DDPRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKMEEQDKLRERERELRKRKEREE 803

Query: 703  XEMEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIE 524
             EME        K+AV  +QALLVE I+DP  SWTESKPKLEKD Q RATNPDLD  D E
Sbjct: 804  QEMERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTE 863

Query: 523  KLFREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHK 353
            KLFREHVK+L ERC  +FR LLAEVL   +  Q+++DGKTVLNSWSTAKRLLK DPRY+K
Sbjct: 864  KLFREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNK 923

Query: 352  MPRKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            +PRKERE LWRRY E+M RRQK + D      T A+ RN  +S +    S RSYE+R
Sbjct: 924  VPRKEREALWRRYAEDMLRRQKASHDSREEKHTDAEGRNYLESSKHPFESGRSYERR 980


>ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine
            max]
          Length = 980

 Score =  858 bits (2216), Expect = 0.0
 Identities = 495/1017 (48%), Positives = 611/1017 (60%), Gaps = 16/1017 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            MASPAWLPQE     SQ PVSG                                   +  
Sbjct: 1    MASPAWLPQE-----SQPPVSG------------------------ETPLPMASSAHTTP 31

Query: 3022 ESTQAKVVNAPGFVVPASS--FSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXX 2849
             +  AK+ NA  FV PA +  F+Y +L N           +   + KSN  V+ +V    
Sbjct: 32   STAHAKLFNATAFVAPAPAPPFAYGMLQNVNASGSSQQSSTHPGM-KSNSAVNPMVVQPP 90

Query: 2848 XXXXXXXXXXXSYISHTSAGLPDGQQF-QSKSN---TSAADVNILSSALSIXXXXXXXXX 2681
                         I  + A     QQ  QS +N   + A DV  LSSA SI         
Sbjct: 91   GVSLHAAPSFSYNIPQSGAIFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTS 150

Query: 2680 XXXXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPY 2501
                            S   WM    +FPV   +  T G P PPG+  S   SSNP  P 
Sbjct: 151  TSIMPPPSDPNYRPATS---WMPTAMSFPV-LPVMPTQGNPGPPGLASSAIISSNPAAPS 206

Query: 2500 XXXXXXXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRL 2321
                      +RP MPT+ I S+    Q+    YP++PAMA PPQ LWL+PPQM G+ R 
Sbjct: 207  TGTDSSPAALLRPNMPTSAIASDPTAPQKGL-PYPSVPAMAAPPQGLWLQPPQMSGVLRP 265

Query: 2320 PFLSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSG 2141
            P+L YP+  PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   TS   S+H L  ++ 
Sbjct: 266  PYLQYPAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGG--TSTPSSSHQLRGTTA 323

Query: 2140 MHPELSPPGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKP 1964
            +  E+      + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY+KP
Sbjct: 324  LQTEVISGPADDKKKLNSVDTVNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKP 383

Query: 1963 SGFKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKK 1784
            +GFK ES +V+ +PIPVS   L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKK
Sbjct: 384  AGFKGESHQVSAQPIPVSMMDLPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKK 443

Query: 1783 KQGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLV 1604
            KQ G++  +H+M V +TNVL ++GS  ++L+A AI TGGRDA   + S+   S SALDL+
Sbjct: 444  KQDGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLI 503

Query: 1603 KKKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXX 1424
            KKKLQDSG P  SS++PA S     E NG + V++T + LQ +N+KDK  D   D N+  
Sbjct: 504  KKKLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSD 563

Query: 1423 XXXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRS 1244
                  D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRS
Sbjct: 564  TSSDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRS 623

Query: 1243 LFDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEV 1064
            LF+HYVKT                 E FK+LL+EASEDI++NTDYQTFRKKW NDPRFE 
Sbjct: 624  LFEHYVKTRAEEERKEKRAALKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEA 683

Query: 1063 LDRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLR 884
            LDRK++E LLNERVLPL               A++FKSML+++GDI+  +RWSRVK++LR
Sbjct: 684  LDRKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLR 743

Query: 883  NDSRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXX 704
            +D RYK V+H+DREVLFNEYISELK AE  A RET                         
Sbjct: 744  DDPRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREE 803

Query: 703  XEMEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIE 524
             EME        K+AV  +QALLVE I+DP  SWTESKPKLEKD Q RATNPDLD  D E
Sbjct: 804  QEMERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTE 863

Query: 523  KLFREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHK 353
            KLFREHVK+L ERC  +FR LLAEVL   +  Q+++DGKTVLNSWSTAKRLLK DPRY+K
Sbjct: 864  KLFREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNK 923

Query: 352  MPRKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            +PRKERE LWRRY E+M R QK + D      T A+ RN  +S +    S RSYE+R
Sbjct: 924  VPRKEREALWRRYAEDMLRGQKASHDSREEKHTDAEGRNYLESSKPPFESGRSYERR 980


>ref|XP_006590813.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine
            max]
          Length = 968

 Score =  854 bits (2207), Expect = 0.0
 Identities = 494/1015 (48%), Positives = 606/1015 (59%), Gaps = 14/1015 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            MASPAWLPQE     SQ PVSG                                    M 
Sbjct: 1    MASPAWLPQE-----SQPPVSG-------------------------------ETPLPMA 24

Query: 3022 ESTQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXXXX 2843
             S       AP    PA  F+Y +L N           +   + KSN  V+ +V      
Sbjct: 25   SSAHTTPSTAPA---PAPPFAYGMLQNVNASGSSQQSSTHPGM-KSNSAVNPMVVQPPGV 80

Query: 2842 XXXXXXXXXSYISHTSAGLPDGQQF-QSKSN---TSAADVNILSSALSIXXXXXXXXXXX 2675
                       I  + A     QQ  QS +N   + A DV  LSSA SI           
Sbjct: 81   SLHAAPSFSYNIPQSGAIFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTS 140

Query: 2674 XXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXX 2495
                          S   WM    +FPV   +  T G P PPG+  S   SSNP  P   
Sbjct: 141  IMPPPSDPNYRPATS---WMPTAMSFPV-LPVMPTQGNPGPPGLASSAIISSNPAAPSTG 196

Query: 2494 XXXXXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPF 2315
                    +RP MPT+ I S+    Q+    YP++PAMA PPQ LWL+PPQM G+ R P+
Sbjct: 197  TDSSPAALLRPNMPTSAIASDPTAPQKGL-PYPSVPAMAAPPQGLWLQPPQMSGVLRPPY 255

Query: 2314 LSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMH 2135
            L YP+  PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   TS   S+H L  ++ + 
Sbjct: 256  LQYPAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGG--TSTPSSSHQLRGTTALQ 313

Query: 2134 PELSPPGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSG 1958
             E+      + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY+KP+G
Sbjct: 314  TEVISGPADDKKKLNSVDTVNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAG 373

Query: 1957 FKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQ 1778
            FK ES +V+ +PIPVS   L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKKKQ
Sbjct: 374  FKGESHQVSAQPIPVSMMDLPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQ 433

Query: 1777 GGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKK 1598
             G++  +H+M V +TNVL ++GS  ++L+A AI TGGRDA   + S+   S SALDL+KK
Sbjct: 434  DGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKK 493

Query: 1597 KLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXX 1418
            KLQDSG P  SS++PA S     E NG + V++T + LQ +N+KDK  D   D N+    
Sbjct: 494  KLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTS 553

Query: 1417 XXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLF 1238
                D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSLF
Sbjct: 554  SDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLF 613

Query: 1237 DHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLD 1058
            +HYVKT                 E FK+LL+EASEDI++NTDYQTFRKKW NDPRFE LD
Sbjct: 614  EHYVKTRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALD 673

Query: 1057 RKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRND 878
            RK++E LLNERVLPL               A++FKSML+++GDI+  +RWSRVK++LR+D
Sbjct: 674  RKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDD 733

Query: 877  SRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXE 698
             RYK V+H+DREVLFNEYISELK AE  A RET                          E
Sbjct: 734  PRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKMEEQDKLRERERELRKRKEREEQE 793

Query: 697  MEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKL 518
            ME        K+AV  +QALLVE I+DP  SWTESKPKLEKD Q RATNPDLD  D EKL
Sbjct: 794  MERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKL 853

Query: 517  FREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMP 347
            FREHVK+L ERC  +FR LLAEVL   +  Q+++DGKTVLNSWSTAKRLLK DPRY+K+P
Sbjct: 854  FREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVP 913

Query: 346  RKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RKERE LWRRY E+M RRQK + D      T A+ RN  +S +    S RSYE+R
Sbjct: 914  RKEREALWRRYAEDMLRRQKASHDSREEKHTDAEGRNYLESSKHPFESGRSYERR 968


>ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine
            max]
          Length = 968

 Score =  852 bits (2200), Expect = 0.0
 Identities = 493/1015 (48%), Positives = 605/1015 (59%), Gaps = 14/1015 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            MASPAWLPQE     SQ PVSG                                    M 
Sbjct: 1    MASPAWLPQE-----SQPPVSG-------------------------------ETPLPMA 24

Query: 3022 ESTQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXXXX 2843
             S       AP    PA  F+Y +L N           +   + KSN  V+ +V      
Sbjct: 25   SSAHTTPSTAPA---PAPPFAYGMLQNVNASGSSQQSSTHPGM-KSNSAVNPMVVQPPGV 80

Query: 2842 XXXXXXXXXSYISHTSAGLPDGQQF-QSKSN---TSAADVNILSSALSIXXXXXXXXXXX 2675
                       I  + A     QQ  QS +N   + A DV  LSSA SI           
Sbjct: 81   SLHAAPSFSYNIPQSGAIFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTS 140

Query: 2674 XXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXX 2495
                          S   WM    +FPV   +  T G P PPG+  S   SSNP  P   
Sbjct: 141  IMPPPSDPNYRPATS---WMPTAMSFPV-LPVMPTQGNPGPPGLASSAIISSNPAAPSTG 196

Query: 2494 XXXXXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPF 2315
                    +RP MPT+ I S+    Q+    YP++PAMA PPQ LWL+PPQM G+ R P+
Sbjct: 197  TDSSPAALLRPNMPTSAIASDPTAPQKGL-PYPSVPAMAAPPQGLWLQPPQMSGVLRPPY 255

Query: 2314 LSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMH 2135
            L YP+  PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   TS   S+H L  ++ + 
Sbjct: 256  LQYPAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGG--TSTPSSSHQLRGTTALQ 313

Query: 2134 PELSPPGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSG 1958
             E+      + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY+KP+G
Sbjct: 314  TEVISGPADDKKKLNSVDTVNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAG 373

Query: 1957 FKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQ 1778
            FK ES +V+ +PIPVS   L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKKKQ
Sbjct: 374  FKGESHQVSAQPIPVSMMDLPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQ 433

Query: 1777 GGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKK 1598
             G++  +H+M V +TNVL ++GS  ++L+A AI TGGRDA   + S+   S SALDL+KK
Sbjct: 434  DGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKK 493

Query: 1597 KLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXX 1418
            KLQDSG P  SS++PA S     E NG + V++T + LQ +N+KDK  D   D N+    
Sbjct: 494  KLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTS 553

Query: 1417 XXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLF 1238
                D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSLF
Sbjct: 554  SDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLF 613

Query: 1237 DHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLD 1058
            +HYVKT                 E FK+LL+EASEDI++NTDYQTFRKKW NDPRFE LD
Sbjct: 614  EHYVKTRAEEERKEKRAALKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALD 673

Query: 1057 RKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRND 878
            RK++E LLNERVLPL               A++FKSML+++GDI+  +RWSRVK++LR+D
Sbjct: 674  RKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDD 733

Query: 877  SRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXE 698
             RYK V+H+DREVLFNEYISELK AE  A RET                          E
Sbjct: 734  PRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQE 793

Query: 697  MEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKL 518
            ME        K+AV  +QALLVE I+DP  SWTESKPKLEKD Q RATNPDLD  D EKL
Sbjct: 794  MERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKL 853

Query: 517  FREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMP 347
            FREHVK+L ERC  +FR LLAEVL   +  Q+++DGKTVLNSWSTAKRLLK DPRY+K+P
Sbjct: 854  FREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVP 913

Query: 346  RKEREPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RKERE LWRRY E+M R QK + D      T A+ RN  +S +    S RSYE+R
Sbjct: 914  RKEREALWRRYAEDMLRGQKASHDSREEKHTDAEGRNYLESSKPPFESGRSYERR 968


>gb|EXC33082.1| Transcription elongation regulator 1 [Morus notabilis]
          Length = 829

 Score =  847 bits (2188), Expect = 0.0
 Identities = 464/815 (56%), Positives = 549/815 (67%), Gaps = 14/815 (1%)
 Frame = -1

Query: 2602 AFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPT--APILSNS 2429
            AF +P    G  G P PPGILQS   SSN I              RP+MP+    + SNS
Sbjct: 19   AFTMPPGTPGAPGTPGPPGILQSTHISSN-ITVGPVAVDTSLTVQRPIMPSPMGAMASNS 77

Query: 2428 AVQQQTYHTYPALPAMALPPQSLWLRP-PQMGGLPRLPFLSYPSVVPGPFPFPAHGLPLP 2252
            AVQQQ    Y +LP+MA PPQ  WL+P PQMGG+PRLP L Y +  PGPFP  A G+P P
Sbjct: 78   AVQQQIGVPYQSLPSMAAPPQGPWLQPSPQMGGVPRLPNLLYHAAFPGPFPSMARGIP-P 136

Query: 2251 SVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLC-SSGMHPELSPPGIGNNKHVNGAGIK 2075
            SVP PDS PPG+ PVG++    T    S  P++  SSG   EL         HV     +
Sbjct: 137  SVPGPDSQPPGIAPVGNTRLTPTPFAASVQPVVAGSSGTRMELHTSD--EQTHVRDVRSQ 194

Query: 2074 DGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSWEKLA 1895
                VNEQSD+WT HKT+ GVVYYYN LTG STY+KP GFK E +KV+++P+PVS   L 
Sbjct: 195  VSADVNEQSDAWTAHKTEAGVVYYYNTLTGESTYDKPPGFKGEPEKVSVQPVPVSMVNLP 254

Query: 1894 GTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNVLPEK 1715
            GTDW LV+T+DG KYYYN KT +SSWQIP EV EL+KKQ  +I  E+   VP+ NVL EK
Sbjct: 255  GTDWVLVSTSDGKKYYYNNKTKVSSWQIPNEVTELRKKQESDIPKENSTSVPNNNVLAEK 314

Query: 1714 GSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPATSGLA 1535
            GS PI+L+A AI TGGRDA   RS++A GSSSALDL+KKKLQ+ G P TSS+     G+A
Sbjct: 315  GSTPINLNAPAINTGGRDAMALRSTSAQGSSSALDLIKKKLQEFGTPVTSSSGQVQPGIA 374

Query: 1534 VSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCIIQFKE 1355
             SE NG RAVE T +  QSE+SKDK  DA  D N+        DAD GPTKE+CIIQFKE
Sbjct: 375  ASESNGSRAVEPTAKGQQSESSKDKPKDANGDRNMTDSSSDSEDADSGPTKEECIIQFKE 434

Query: 1354 MLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXXXXXX 1175
            MLK+RGVAPFSKWEKELPKIVFD RFKAIPS+S RRSLF+HYVKT               
Sbjct: 435  MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSLRRSLFEHYVKTRVEEERKEKRAALKA 494

Query: 1174 XXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXXXXXX 995
              E FKKLL+EASEDIDH T YQTFRKKWG+DPRF  LDRKDRE LLNERVLPL      
Sbjct: 495  AIEGFKKLLDEASEDIDHKTYYQTFRKKWGDDPRFLALDRKDREHLLNERVLPLKRATEE 554

Query: 994  XXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNEYISE 815
                     ASNFKSMLR+KGD+T  +RWSRVK+SLR+D RYKSVKH+DREVLFNEY+S+
Sbjct: 555  KAQAIRAAAASNFKSMLREKGDVTVNSRWSRVKESLRDDPRYKSVKHEDREVLFNEYLSD 614

Query: 814  LKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALL 635
            L+ AE E  RE                           EME        KEAV S+QALL
Sbjct: 615  LRAAEEEVEREAKAKRDEQDKLKERERELRKRKEREEQEMERVRIKVRRKEAVVSFQALL 674

Query: 634  VEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLA 455
            VE I+DPQASWTESK KLEKDPQGRA+NPDLD S++EKLFREH+K L ERC ++++ LLA
Sbjct: 675  VETIKDPQASWTESKSKLEKDPQGRASNPDLDSSEMEKLFREHIKTLQERCAREYKALLA 734

Query: 454  EVLSGG---QKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQKL 284
            E+L+     ++++DGKTVLNSWSTAKRLLKPDPRY+KMPRK+RE LWRRY E+M R+Q+ 
Sbjct: 735  ELLTADAAERETDDGKTVLNSWSTAKRLLKPDPRYNKMPRKDRETLWRRYAEDMLRKQQK 794

Query: 283  ADDT-------QAKTRNSFDSGRLTLPSKRSYEQR 200
            ++           + R S DSGRL    + ++E+R
Sbjct: 795  SEPNSKEDKKIDPRNRTSVDSGRLPSGLRGTHERR 829


>ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C-like [Cicer arietinum]
          Length = 953

 Score =  834 bits (2154), Expect = 0.0
 Identities = 446/818 (54%), Positives = 542/818 (66%), Gaps = 9/818 (1%)
 Frame = -1

Query: 2626 TPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTA 2447
            T WM     FPV   + GT   P PPG+ +     SNP  P            RP MPTA
Sbjct: 143  TLWMPTAPTFPVHTLMPGT---PGPPGLAKPGIMPSNPAAPSSNTDFPSSAVPRPNMPTA 199

Query: 2446 PILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAH 2267
            PI S+     +    YP +P+M  PPQ  WL+PPQM G+ R PFL YP+  PGPFPFPA 
Sbjct: 200  PIGSDPNASHKGL-PYPPIPSMVAPPQGFWLQPPQMSGVHRPPFLQYPAAFPGPFPFPAR 258

Query: 2266 GLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNG 2087
            G+ LP+VP+PDS PPGVTPVG++G    S   S+H L  +SG+   +      ++K +N 
Sbjct: 259  GVTLPAVPVPDSQPPGVTPVGAAGISAFSV--SSHQLRGTSGLQTVVISAH-ADDKKLNA 315

Query: 2086 AGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSW 1907
                +  A N+Q D+WT HKT+ G+VYYYNALTG STY+KP+GFK E+ +V+++P PVS 
Sbjct: 316  TVTHNEDAANDQLDAWTAHKTEAGIVYYYNALTGESTYDKPAGFKGEAHQVSVQPTPVSV 375

Query: 1906 EKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNV 1727
              L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKKKQ G+   +H+M V +  V
Sbjct: 376  VDLPGTDWQLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDAAKDHLMPVLNATV 435

Query: 1726 LPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPAT 1547
            LP++G   ++L+A AI TGGRDA   +  +   S SALDL+KKKLQ+SG P TSS++P  
Sbjct: 436  LPDRGFGMVTLNAPAITTGGRDAATVKPFSVQSSPSALDLIKKKLQESGTPITSSSIPMP 495

Query: 1546 SGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCII 1367
            S    SE NG +A ++T + LQ++NSKD+  DA  D N         D D GP+KE+CI 
Sbjct: 496  SVQPGSESNGSKATDSTAKSLQNDNSKDRQKDANGDANASDTSSDSEDEDSGPSKEECIN 555

Query: 1366 QFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXX 1187
            QFKEMLK+RGVAPFSKWEKELPK VFD RFKAIPS+SARRSLF+HYVKT           
Sbjct: 556  QFKEMLKERGVAPFSKWEKELPKFVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRA 615

Query: 1186 XXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXX 1007
                  E FK+LL+EASEDI+HNTDY TFRKKW ND RFE LDRK+RE LLNERVLPL  
Sbjct: 616  AQKAAIEGFKQLLDEASEDINHNTDYHTFRKKWANDSRFEALDRKEREHLLNERVLPLKK 675

Query: 1006 XXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNE 827
                         A+ FKSML+++GDIT  +RWSR+K+SLR+D RYKSVKH+DREVLFNE
Sbjct: 676  AVEEKAQAMWDAAAAGFKSMLKEQGDITFNSRWSRIKESLRDDPRYKSVKHEDREVLFNE 735

Query: 826  YISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASY 647
            YISELK AE  A RE+                          EME        KEAV S 
Sbjct: 736  YISELKAAEHAAERESRAKKEEQEKLRERERELRKRKEREEHEMERVRLKIRRKEAVTSL 795

Query: 646  QALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFR 467
            QALLVE I+DP ASWTESKPKLEKDPQGRATN DLD +D+EKLFR+H+K+L ERC  DFR
Sbjct: 796  QALLVETIKDPMASWTESKPKLEKDPQGRATNSDLDSADMEKLFRDHIKMLQERCAHDFR 855

Query: 466  DLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQR 296
             LLAEVL   +  Q+++DGKTVLNSWSTAKRLLK DPRY+K PRK+RE LWRRY E+M R
Sbjct: 856  ALLAEVLTSEAASQETDDGKTVLNSWSTAKRLLKSDPRYNKFPRKDREALWRRYVEDMLR 915

Query: 295  RQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RQK + D      T A+ RNS  S +L L S RS+E+R
Sbjct: 916  RQKSSHDSKEDKHTDARGRNSQQSSKLPLESGRSHERR 953


>ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine
            max]
          Length = 930

 Score =  825 bits (2132), Expect = 0.0
 Identities = 482/1012 (47%), Positives = 593/1012 (58%), Gaps = 11/1012 (1%)
 Frame = -1

Query: 3202 MASPAWLPQEVQSSTSQAPVSGLXXXXXXXXXXXXXXXXAQIAGXXXXXXXXXXXXXSMH 3023
            MASPAWLPQE     +Q PVSG                                    M 
Sbjct: 1    MASPAWLPQE-----AQPPVSG-------------------------------ETPLPMA 24

Query: 3022 ESTQAKVVNAPGFV-VPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXXX 2846
             ST       P     PAS F++ +L N           +  A+I SN  V+ +V     
Sbjct: 25   SSTPNSAPATPSTAPAPASPFAHGMLQNVNASGSSQLLSTHPAII-SNSAVNPMVVQPPG 83

Query: 2845 XXXXXXXXXXSYISHTSAGLPDGQQFQSKSNTSAADVNILSSALSIXXXXXXXXXXXXXX 2666
                        I  + A     QQ       S+ DV+ LSSA SI              
Sbjct: 84   VSSHAAPSFSYNIPQSGAIFSSNQQHAQ----SSTDVSKLSSASSIPHSVPAHTSTSLMP 139

Query: 2665 XXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXX 2486
                       S   WM    +FPV   +  T G P PPG+  S   SSNP         
Sbjct: 140  PPSDPNYCPATS---WMPTALSFPVHP-VMPTQGNPGPPGLASSAIISSNPAA------- 188

Query: 2485 XXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSY 2306
                                         P++PA+A PPQ LWL+PPQM G+ R P+L Y
Sbjct: 189  -----------------------------PSIPALAAPPQGLWLQPPQMSGVLRPPYLQY 219

Query: 2305 PSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPEL 2126
            P+  PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   T +  S++ L  ++ +  E+
Sbjct: 220  PAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGGTPTPSA-SSYQLRGTTALQTEV 278

Query: 2125 SPPGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKE 1949
                  + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY KPSGFK 
Sbjct: 279  ISGSADDKKKLNSVDTLNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKG 338

Query: 1948 ESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGE 1769
            ES +V+ +P PVS   L GTDW LV+T+DG KYYYN  T  S WQIP EV ELKKKQ G+
Sbjct: 339  ESHQVSAQPTPVSMIDLPGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGD 398

Query: 1768 ILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQ 1589
            +  +H+M VP+TNVL ++GS  ++L+A AI TGGRDA   + S    SSSALDL+KKKLQ
Sbjct: 399  VTKDHLMSVPNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALDLIKKKLQ 458

Query: 1588 DSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXX 1409
            DSG P T S++ A S     E NG + V++T + +Q +N+KDK  D   D ++       
Sbjct: 459  DSGTPITPSSIHAPSVQIGPESNGSKTVDSTAKGVQVDNNKDKQKDTNGDADVSDTSSDS 518

Query: 1408 XDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHY 1229
             D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSLF+HY
Sbjct: 519  EDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHY 578

Query: 1228 VKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKD 1049
            VKT                 E FK+LL+EASEDI++NTD+QTFRKKWGNDPRFE LDRK+
Sbjct: 579  VKTRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDFQTFRKKWGNDPRFEALDRKE 638

Query: 1048 RELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRY 869
            +E LLNERVLPL               A++FKSML+++GD++  +RW+RVK+SLR+D RY
Sbjct: 639  QEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDMSFNSRWARVKESLRDDPRY 698

Query: 868  KSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEW 689
            KSV+H+DREVLFNEYISELK AE  A RET                          EME 
Sbjct: 699  KSVRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMER 758

Query: 688  XXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFRE 509
                   KEAV S+QALLVE I+DP ASWTESKPKLEKDPQ RATNPDLD SD EKLFRE
Sbjct: 759  VRLKIRRKEAVTSFQALLVETIKDPLASWTESKPKLEKDPQRRATNPDLDPSDTEKLFRE 818

Query: 508  HVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKE 338
            HVK+L ERC  +FR LLAEVL   +  Q++ DGKTVLNSWSTAKRLLK DPRY+K+PRKE
Sbjct: 819  HVKMLQERCAHEFRVLLAEVLTSDAASQETNDGKTVLNSWSTAKRLLKSDPRYNKVPRKE 878

Query: 337  REPLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RE LWRRY E+M RRQK + D      T AK R   +S +  L S RS+E+R
Sbjct: 879  REALWRRYAEDMLRRQKASYDSREEKHTDAKGRTYLESSKHPLESGRSHERR 930


>ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris]
            gi|561004663|gb|ESW03657.1| hypothetical protein
            PHAVU_011G031500g [Phaseolus vulgaris]
          Length = 977

 Score =  817 bits (2110), Expect = 0.0
 Identities = 470/958 (49%), Positives = 586/958 (61%), Gaps = 18/958 (1%)
 Frame = -1

Query: 3019 STQAKVVNAPGFVVPASSFSYSVLPNAXXXXXXXXXXSPTAVIKSNPPVSTVVXXXXXXX 2840
            S+ A    AP    P   F Y VL NA          +   VIKSN  V+ VV       
Sbjct: 30   SSNATPSTAPA-PAPVPPFPYGVLQNANASGSSQQSSAHN-VIKSNSIVNPVVFQPPVPG 87

Query: 2839 XXXXXXXXSY--ISHTSAGLPDGQQ-FQSKSNTS---AADVNILSSALSIXXXXXXXXXX 2678
                        I  + A  P  QQ  QS S  S   A DV  LSSA S           
Sbjct: 88   VSSHAALSFSYNIPPSGAAFPSNQQNTQSSSEISDSVAQDVTKLSSASSTPHSVPAHTST 147

Query: 2677 XXXXXXXXXXXXXLVSETPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYX 2498
                             T WM    + PV   +  T G P PPG+  S   S NP VP  
Sbjct: 148  PIMPPSDPNYRPT----TSWMPTAMSLPVHP-VMPTPGNPGPPGLASSSMISINPAVPST 202

Query: 2497 XXXXXXXXXVRPVMPTAPILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLP 2318
                     +RP MP + I S+     +    YP++P+MA PPQ LWL+ PQM G+ R P
Sbjct: 203  GTDSSSAALLRPNMPISAIASDPTNPLKGL-PYPSMPSMAAPPQGLWLQTPQMSGVFRPP 261

Query: 2317 FLSYPSVVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGM 2138
            +L YP+  PGPFPFPA G+ LP+VP+PDS P GVTPV  SG   T +  S++ L  ++ +
Sbjct: 262  YLQYPAPFPGPFPFPARGVTLPAVPIPDSQPRGVTPV--SGGSSTFSPASSNQLRGTTAL 319

Query: 2137 HPELSPPGIGNNKHVNGA-GIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPS 1961
              E+      + K +N      +  + N+Q ++WT HKT+ G++YYYNA+TG STY+KP+
Sbjct: 320  QTEVISGPADDKKKLNAVIAPNEDTSNNDQLEAWTAHKTEAGIIYYYNAMTGESTYDKPA 379

Query: 1960 GFKEESDKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKK 1781
            GF  ES +V+ +P PVS   L GTDW LV+T+DG KYYYN +T  S WQIP EV ELKKK
Sbjct: 380  GFIGESHQVSAQPTPVSMTDLPGTDWLLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKK 439

Query: 1780 QGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVK 1601
            Q G++  + +M VP+ NVL ++GS  ++L+A AI TGGRDA   + S    SSSALDL+K
Sbjct: 440  QDGDVTKDQLMSVPNNNVLSDRGSGMVTLNAPAINTGGRDAAALKPSNLQNSSSALDLIK 499

Query: 1600 KKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXX 1421
            KKLQDSG P TSS++PA S    SE NG +AVE+T + +Q++NSKDK  D+    N+   
Sbjct: 500  KKLQDSGTPVTSSSIPAPSVQTGSESNGSKAVESTSKGMQADNSKDKQKDSNGAANVSDT 559

Query: 1420 XXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSL 1241
                 D D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSL
Sbjct: 560  SSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSL 619

Query: 1240 FDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVL 1061
            F+HYVKT                 E FK+LL+EASEDI++NTDYQ+FRKKW NDPRFE L
Sbjct: 620  FEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDINYNTDYQSFRKKWANDPRFEAL 679

Query: 1060 DRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRN 881
            DRK++E LLN+RV PL               A++FKSML+D+GDI+  +RWSRVK+SLR+
Sbjct: 680  DRKEQEHLLNDRVFPLKKAAEEKTQAMRAAAAASFKSMLKDRGDISFNSRWSRVKESLRD 739

Query: 880  DSRYKSVKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXX 701
            D RYKSV+H+DREVLFNEY+SELK AE  A RET                          
Sbjct: 740  DPRYKSVRHEDREVLFNEYLSELKAAEYAAERETKAKREEQDKLRERERELRKRKEREEQ 799

Query: 700  EMEWXXXXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEK 521
            EME        KEAV S+QALLVE I+DP ASWTESKPKLEKDPQGRATNP+LD SD EK
Sbjct: 800  EMERVRLKIRRKEAVTSFQALLVEIIKDPLASWTESKPKLEKDPQGRATNPELDSSDTEK 859

Query: 520  LFREHVKLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKM 350
            LFREHVK+L ERC  +FR L+A+VL   +   +++DGKTVLNSWSTAKR+LK DPRY+K+
Sbjct: 860  LFREHVKMLQERCAHEFRVLIADVLTSDAASHENDDGKTVLNSWSTAKRVLKSDPRYNKV 919

Query: 349  PRKEREPLWRRYGEEMQRRQKLADD--------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            PRKERE LWRRY E+M RRQK +          +  + RN  +S +  L S RS+++R
Sbjct: 920  PRKEREALWRRYAEDMLRRQKASHSHDSREDKHSDGRGRNPLESSKYPLQSGRSHDRR 977


>ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine
            max]
          Length = 854

 Score =  815 bits (2105), Expect = 0.0
 Identities = 438/819 (53%), Positives = 538/819 (65%), Gaps = 10/819 (1%)
 Frame = -1

Query: 2626 TPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTA 2447
            T WM    +FPV   +  T G P PPG+  S   SSNP                      
Sbjct: 74   TSWMPTALSFPVHP-VMPTQGNPGPPGLASSAIISSNPAA-------------------- 112

Query: 2446 PILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAH 2267
                            P++PA+A PPQ LWL+PPQM G+ R P+L YP+  PGPFPFPA 
Sbjct: 113  ----------------PSIPALAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPAR 156

Query: 2266 GLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNG 2087
            G+ LP+VP+PDS PPGVTPVG++G   T +  S++ L  ++ +  E+      + K +N 
Sbjct: 157  GVALPAVPIPDSQPPGVTPVGAAGGTPTPSA-SSYQLRGTTALQTEVISGSADDKKKLNS 215

Query: 2086 AG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVS 1910
               + +  A N+Q D+WT HKT+ G++YYYNA+TG STY KPSGFK ES +V+ +P PVS
Sbjct: 216  VDTLNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGESHQVSAQPTPVS 275

Query: 1909 WEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTN 1730
               L GTDW LV+T+DG KYYYN  T  S WQIP EV ELKKKQ G++  +H+M VP+TN
Sbjct: 276  MIDLPGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVPNTN 335

Query: 1729 VLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAPDTSSAVPA 1550
            VL ++GS  ++L+A AI TGGRDA   + S    SSSALDL+KKKLQDSG P T S++ A
Sbjct: 336  VLSDRGSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALDLIKKKLQDSGTPITPSSIHA 395

Query: 1549 TSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCI 1370
             S     E NG + V++T + +Q +N+KDK  D   D ++        D D GP+KE+CI
Sbjct: 396  PSVQIGPESNGSKTVDSTAKGVQVDNNKDKQKDTNGDADVSDTSSDSEDEDNGPSKEECI 455

Query: 1369 IQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXX 1190
            IQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSLF+HYVKT          
Sbjct: 456  IQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKR 515

Query: 1189 XXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLX 1010
                   E FK+LL+EASEDI++NTD+QTFRKKWGNDPRFE LDRK++E LLNERVLPL 
Sbjct: 516  AAQKAAIEGFKRLLDEASEDINYNTDFQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLK 575

Query: 1009 XXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFN 830
                          A++FKSML+++GD++  +RW+RVK+SLR+D RYKSV+H+DREVLFN
Sbjct: 576  KAAEEKAQAMRAAAAASFKSMLKERGDMSFNSRWARVKESLRDDPRYKSVRHEDREVLFN 635

Query: 829  EYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVAS 650
            EYISELK AE  A RET                          EME        KEAV S
Sbjct: 636  EYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTS 695

Query: 649  YQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDF 470
            +QALLVE I+DP ASWTESKPKLEKDPQ RATNPDLD SD EKLFREHVK+L ERC  +F
Sbjct: 696  FQALLVETIKDPLASWTESKPKLEKDPQRRATNPDLDPSDTEKLFREHVKMLQERCAHEF 755

Query: 469  RDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQ 299
            R LLAEVL   +  Q++ DGKTVLNSWSTAKRLLK DPRY+K+PRKERE LWRRY E+M 
Sbjct: 756  RVLLAEVLTSDAASQETNDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDML 815

Query: 298  RRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
            RRQK + D      T AK R   +S +  L S RS+E+R
Sbjct: 816  RRQKASYDSREEKHTDAKGRTYLESSKHPLESGRSHERR 854


>ref|XP_006592054.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X3 [Glycine
            max]
          Length = 778

 Score =  809 bits (2090), Expect = 0.0
 Identities = 430/770 (55%), Positives = 530/770 (68%), Gaps = 15/770 (1%)
 Frame = -1

Query: 2464 PVMPTA-----PILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPS 2300
            PVMPT      P L++SA+        P++PA+A PPQ LWL+PPQM G+ R P+L YP+
Sbjct: 11   PVMPTQGNPGPPGLASSAIISSN-PAAPSIPALAAPPQGLWLQPPQMSGVLRPPYLQYPA 69

Query: 2299 VVPGPFPFPAHGLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSP 2120
              PGPFPFPA G+ LP+VP+PDS PPGVTPVG++G   T +  S++ L  ++ +  E+  
Sbjct: 70   PFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGGTPTPSA-SSYQLRGTTALQTEVIS 128

Query: 2119 PGIGNNKHVNGAG-IKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEES 1943
                + K +N    + +  A N+Q D+WT HKT+ G++YYYNA+TG STY KPSGFK ES
Sbjct: 129  GSADDKKKLNSVDTLNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGES 188

Query: 1942 DKVTIEPIPVSWEKLAGTDWALVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEIL 1763
             +V+ +P PVS   L GTDW LV+T+DG KYYYN  T  S WQIP EV ELKKKQ G++ 
Sbjct: 189  HQVSAQPTPVSMIDLPGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVT 248

Query: 1762 NEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDS 1583
             +H+M VP+TNVL ++GS  ++L+A AI TGGRDA   + S    SSSALDL+KKKLQDS
Sbjct: 249  KDHLMSVPNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALDLIKKKLQDS 308

Query: 1582 GAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXD 1403
            G P T S++ A S     E NG + V++T + +Q +N+KDK  D   D ++        D
Sbjct: 309  GTPITPSSIHAPSVQIGPESNGSKTVDSTAKGVQVDNNKDKQKDTNGDADVSDTSSDSED 368

Query: 1402 ADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVK 1223
             D GP+KE+CIIQFKEMLK+RGVAPFSKWEKELPKIVFD RFKAIPS+SARRSLF+HYVK
Sbjct: 369  EDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVK 428

Query: 1222 TXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRE 1043
            T                 E FK+LL+EASEDI++NTD+QTFRKKWGNDPRFE LDRK++E
Sbjct: 429  TRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDFQTFRKKWGNDPRFEALDRKEQE 488

Query: 1042 LLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKS 863
             LLNERVLPL               A++FKSML+++GD++  +RW+RVK+SLR+D RYKS
Sbjct: 489  HLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDMSFNSRWARVKESLRDDPRYKS 548

Query: 862  VKHDDREVLFNEYISELKTAEVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXX 683
            V+H+DREVLFNEYISELK AE  A RET                          EME   
Sbjct: 549  VRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVR 608

Query: 682  XXXXXKEAVASYQALLVEAIRDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHV 503
                 KEAV S+QALLVE I+DP ASWTESKPKLEKDPQ RATNPDLD SD EKLFREHV
Sbjct: 609  LKIRRKEAVTSFQALLVETIKDPLASWTESKPKLEKDPQRRATNPDLDPSDTEKLFREHV 668

Query: 502  KLLHERCVQDFRDLLAEVL---SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKERE 332
            K+L ERC  +FR LLAEVL   +  Q++ DGKTVLNSWSTAKRLLK DPRY+K+PRKERE
Sbjct: 669  KMLQERCAHEFRVLLAEVLTSDAASQETNDGKTVLNSWSTAKRLLKSDPRYNKVPRKERE 728

Query: 331  PLWRRYGEEMQRRQKLADD------TQAKTRNSFDSGRLTLPSKRSYEQR 200
             LWRRY E+M RRQK + D      T AK R   +S +  L S RS+E+R
Sbjct: 729  ALWRRYAEDMLRRQKASYDSREEKHTDAKGRTYLESSKHPLESGRSHERR 778


>ref|XP_004236882.1| PREDICTED: pre-mRNA-processing protein 40C-like [Solanum
            lycopersicum]
          Length = 1042

 Score =  798 bits (2061), Expect = 0.0
 Identities = 436/794 (54%), Positives = 528/794 (66%), Gaps = 6/794 (0%)
 Frame = -1

Query: 2599 FPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTAPILSNSAVQ 2420
            F VPA +  +   P PPG+  ++ SSSN  +            +RP  P   +L+N +VQ
Sbjct: 256  FQVPAGVPRSPVTPGPPGLGPAIPSSSN--LTATVSPGGPSLPLRPNAPPVHVLANPSVQ 313

Query: 2419 QQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAHGLPLPSVPL 2240
            QQTY  Y +   +A   Q  WL+PP +  + R PF SYP+    P+P  A G PL SV L
Sbjct: 314  QQTYSPYHSPAPIAPSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPYPLSATGAPLSSVTL 373

Query: 2239 PDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNGAGIKDGFAV 2060
            PD+ PPGV PV +   + T+A  S H    +SG+ PEL PPG+ + KHVN A  K G + 
Sbjct: 374  PDTRPPGVAPVAAPPGVPTTASQSTH----ASGLQPEL-PPGVDSGKHVNDADTKQGAST 428

Query: 2059 NEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSWEKLAGTDWA 1880
            +EQ ++WT H+T+TG +YYYN+LTG STYEKP+GF+ E  KV  +P PVSWE+LAGTDWA
Sbjct: 429  SEQLETWTAHRTETGAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWA 488

Query: 1879 LVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNVLPEKGSAPI 1700
            LV TNDG KYYYNTKT LSSWQIP EV ELKKK   + L      + + N   EKGSAPI
Sbjct: 489  LVATNDGQKYYYNTKTKLSSWQIPIEVTELKKKHDADALQAQSPSILNVNESAEKGSAPI 548

Query: 1699 SLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAP-DTSSAVPATSGLAVSEL 1523
            SLS  A+ TGGRDAT  R S  PGSS ALDLVKKKL D G P   SS  PA+SG+  SE+
Sbjct: 549  SLSIPAVSTGGRDATSLRPSLVPGSS-ALDLVKKKLMDFGTPLAVSSPAPASSGVISSEV 607

Query: 1522 NGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCIIQFKEMLKD 1343
            NG +A+E+T R  Q ENSK+K  +A  +GN+        D +  PTKE CIIQFKEMLK+
Sbjct: 608  NGSKALESTTRIPQKENSKEKSKEANDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKE 667

Query: 1342 RGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXXXXXXXXEI 1163
            RGVAPFSKWEKELPKIVFD RFKAIPS+SAR++LF+HYVKT                 E 
Sbjct: 668  RGVAPFSKWEKELPKIVFDPRFKAIPSYSARKTLFEHYVKTRADEERKEKRAAQKAAVEG 727

Query: 1162 FKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXXXXXXXXXX 983
            FK+LLEEA EDI  +TDYQ+F+KKW +DPRFE LDRK+RE+LLNERVL L          
Sbjct: 728  FKQLLEEAKEDISEDTDYQSFKKKWSHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHA 787

Query: 982  XXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNEYISELKTA 803
                  S FKSMLR++GDIT  TRWS+VKDSLR+D RYKSVKH+DRE LFNEY+SELK A
Sbjct: 788  VRAAVISQFKSMLREQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAA 847

Query: 802  EVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALLVEAI 623
            E E  R                            E+E        KEAV SYQALLVE I
Sbjct: 848  EQEVARIAKAKHDEEDKLKERERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEII 907

Query: 622  RDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLAEVL- 446
            +DPQASWTESKPKLEKDPQGRA NP LD SD+EKLFREHVK+L+ERCVQ+F+ LLAEV+ 
Sbjct: 908  KDPQASWTESKPKLEKDPQGRAANPHLDQSDLEKLFREHVKVLYERCVQEFKVLLAEVIT 967

Query: 445  --SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQK--LAD 278
              +  +++EDGKTV NSWSTAK++LK D RY KM RK+ E LWRRY E++ RRQK  L +
Sbjct: 968  VEACSRETEDGKTVANSWSTAKQVLKGDLRYSKMARKDSETLWRRYVEDIHRRQKSTLDE 1027

Query: 277  DTQAKTRNSFDSGR 236
              +A+++ S DS R
Sbjct: 1028 ADKARSKGSSDSRR 1041


>ref|XP_003607201.1| Transcription elongation regulator [Medicago truncatula]
            gi|355508256|gb|AES89398.1| Transcription elongation
            regulator [Medicago truncatula]
          Length = 1013

 Score =  791 bits (2043), Expect = 0.0
 Identities = 439/865 (50%), Positives = 538/865 (62%), Gaps = 56/865 (6%)
 Frame = -1

Query: 2626 TPWMRAGQAFPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTA 2447
            T WM     FP+   + GT G P PPG+ + V   SNP  P            R  MPTA
Sbjct: 155  TLWMPTAPTFPIHPVMPGTPGTPGPPGLTKPVMIPSNPAAP-STTGFPSAAVPRQNMPTA 213

Query: 2446 PILSNSAVQQQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAH 2267
               S+     +    YP +P+M  PPQ  WL+PPQM G+ R PF  YP+  PGPFPFPA 
Sbjct: 214  ---SDPNASHRGGLPYPPIPSMVAPPQGYWLQPPQMSGVLRPPFHQYPAAFPGPFPFPAR 270

Query: 2266 GLPLPSVPLPDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNG 2087
            G  LP+VP+PDS PPGVTPVG+  A +++   S H L  +SG+  E+      +   +N 
Sbjct: 271  GGALPAVPVPDSQPPGVTPVGA--ASISAPSSSNHLLRGTSGVQTEVISAHTDDKHKLNA 328

Query: 2086 AGIKDGFAVNEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSW 1907
               ++  A N+Q D+WT HKT+ G+VYYYNALTG STY+KP+GFK E+ +V+++P PVS 
Sbjct: 329  TVTQNEDAANDQLDAWTAHKTEAGIVYYYNALTGQSTYDKPAGFKGEAHQVSVQPTPVSM 388

Query: 1906 EKLAGTDWALVTTNDGNKYYYNTKTL---------------------------LSSWQIP 1808
              L GTDW LV+T+DG KYYYN +T                             S WQIP
Sbjct: 389  VDLPGTDWQLVSTSDGKKYYYNNRTKRNKTGAENSWTIQQAAEYNHNQKQHINTSCWQIP 448

Query: 1807 TEVMELKKKQGGEILNEHMMLVPDTNVLPEKGSAPISLSASAIVTGGRDATPFRSSAAPG 1628
             EV ELKKKQ  ++  +H   VP+TNVL E+GS  ++L+A AI TGGRDA   +      
Sbjct: 449  NEVAELKKKQDSDVTKDHPTPVPNTNVLSERGSGMVALNAPAITTGGRDAVASKPFIVQS 508

Query: 1627 SSSALDLVKKKLQDSGAPDTSSAVPATSGLAVSELNGVRAVEATVRCLQSENSKDKLNDA 1448
            S SALDL+KKKLQ+SGAP TSS++P  S    SE NG +A ++T + LQ++NSKDK  DA
Sbjct: 509  SPSALDLIKKKLQESGAPVTSSSIPTPSVQPGSESNGSKATDSTAKSLQNDNSKDKQKDA 568

Query: 1447 IRDGNIXXXXXXXXDADGGPTKEQCIIQFKEMLKDRGVAPFSKWEKELPKIVFDSRFKAI 1268
              D N+        D D GP+KE+CI QFKEMLK+RGVAPFSKWEKELPKIVFD RFKAI
Sbjct: 569  NGDANVSDTSSDSEDEDSGPSKEECINQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAI 628

Query: 1267 PSHSARRSLFDHYVKTXXXXXXXXXXXXXXXXXEIFKKLLEEASEDIDHNTDYQTFRKKW 1088
            PS+SARRSLF+HYVK                  E FK+LL+EASEDID  TD  TFRKKW
Sbjct: 629  PSYSARRSLFEHYVKNRAEEERKEKRAAQKAAIEGFKQLLDEASEDIDDKTDSHTFRKKW 688

Query: 1087 GNDPRFEVLDRKDRELLLNERVLPLXXXXXXXXXXXXXXXASNFKSMLRDKGDITATTRW 908
            GNDPRFE LDRK+RE LLNERVLPL               A +FKSML+++G+IT  +RW
Sbjct: 689  GNDPRFEALDRKEREHLLNERVLPLKKATEEKAQAMRDAAADSFKSMLKEQGEITFNSRW 748

Query: 907  SR--------------------VKDSLRNDSRYKSVKHDDREVLFNEYISELKTAEVEAV 788
            SR                    VK+SLR+D RYKSVKH+DRE+LFNEYISELK  E  A 
Sbjct: 749  SRMLYGTKCWAVKNQHENKVSLVKESLRDDPRYKSVKHEDRELLFNEYISELKAVEHAAE 808

Query: 787  RETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALLVEAIRDPQA 608
            RET                          EME        KEAV S+QALLVE I+DP A
Sbjct: 809  RETRAKREEQDKLRERERELRKRKEREEHEMERVRLKIRRKEAVTSFQALLVERIKDPMA 868

Query: 607  SWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLAEVL---SGG 437
            SWTESKPKLEKDPQGRATN DLD +D+EKLFR+HVK+L ER  +DFR LLAE L   +  
Sbjct: 869  SWTESKPKLEKDPQGRATNSDLDSADMEKLFRDHVKMLQERRARDFRALLAEFLTSEAAS 928

Query: 436  QKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQKLADD------ 275
            Q+++DGKTVLNSWSTAKRL+K DPRY+K+P ++RE LWRRY E+M RRQK + D      
Sbjct: 929  QETDDGKTVLNSWSTAKRLIKSDPRYNKVPSEDREALWRRYAEDMIRRQKSSHDSKEEKH 988

Query: 274  TQAKTRNSFDSGRLTLPSKRSYEQR 200
            T A+ R S +S +  L S RS+E+R
Sbjct: 989  TDARGRKSLESSKNPLESGRSHERR 1013


>ref|XP_006360861.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X4 [Solanum
            tuberosum]
          Length = 1027

 Score =  786 bits (2030), Expect = 0.0
 Identities = 435/794 (54%), Positives = 526/794 (66%), Gaps = 6/794 (0%)
 Frame = -1

Query: 2599 FPVPARISGTLGMPAPPGILQSVSSSSNPIVPYXXXXXXXXXXVRPVMPTAPILSNSAVQ 2420
            F VPA   G    P  PG   ++ SSSN  +            +RP      +L+N +VQ
Sbjct: 246  FQVPA---GVPKSPVTPG--PAIPSSSN--LTATASPGGPSLPLRPNASPVHVLANPSVQ 298

Query: 2419 QQTYHTYPALPAMALPPQSLWLRPPQMGGLPRLPFLSYPSVVPGPFPFPAHGLPLPSVPL 2240
            QQTY  Y +   +    Q  WL+PP +  + R PF SYP+    PFP  A G PL SV L
Sbjct: 299  QQTYSPYFSPTPITPSHQGPWLQPPPVTTMLRPPFPSYPAGFAVPFPLSATGAPLSSVTL 358

Query: 2239 PDSHPPGVTPVGSSGAILTSAVDSAHPLLCSSGMHPELSPPGIGNNKHVNGAGIKDGFAV 2060
            PD+ PPGV PV +   + T+A    H    +SG+ PEL PPG+ + KHVN A  K G + 
Sbjct: 359  PDTRPPGVAPVAAPPGVPTTASQPTH----ASGLQPEL-PPGVDSGKHVNDADTKQGAST 413

Query: 2059 NEQSDSWTTHKTDTGVVYYYNALTGASTYEKPSGFKEESDKVTIEPIPVSWEKLAGTDWA 1880
            +EQ ++WT H+T+TG +YYYN+LTG STYEKP+GF+ E  KV  +P PVSWE+LAGTDWA
Sbjct: 414  SEQLETWTAHRTETGAIYYYNSLTGESTYEKPAGFRGEPGKVAAQPTPVSWERLAGTDWA 473

Query: 1879 LVTTNDGNKYYYNTKTLLSSWQIPTEVMELKKKQGGEILNEHMMLVPDTNVLPEKGSAPI 1700
            LV TNDG +YYYNTKT LSSWQIP+EV ELKKK   + L      + + N   EKGSAPI
Sbjct: 474  LVATNDGQRYYYNTKTKLSSWQIPSEVTELKKKHDADALQAQSPSILNVNESTEKGSAPI 533

Query: 1699 SLSASAIVTGGRDATPFRSSAAPGSSSALDLVKKKLQDSGAP-DTSSAVPATSGLAVSEL 1523
            SLS  A+ TGGRDAT  R S  PGSS ALDLVKKKL D GAP   SS VPA+SG+  SE+
Sbjct: 534  SLSIPAVSTGGRDATSLRPSLVPGSS-ALDLVKKKLMDFGAPLAVSSPVPASSGVISSEV 592

Query: 1522 NGVRAVEATVRCLQSENSKDKLNDAIRDGNIXXXXXXXXDADGGPTKEQCIIQFKEMLKD 1343
            NG +A+E+T R  Q ENSK+K  +   +GN+        D +  PTKE CIIQFKEMLK+
Sbjct: 593  NGSKALESTTRVPQKENSKEKSKEVNDNGNLSESSSDSEDDESVPTKEDCIIQFKEMLKE 652

Query: 1342 RGVAPFSKWEKELPKIVFDSRFKAIPSHSARRSLFDHYVKTXXXXXXXXXXXXXXXXXEI 1163
            RGVAPFSKWEKELPKIVFD RFKAIPS+SAR++LF+HYVKT                 E 
Sbjct: 653  RGVAPFSKWEKELPKIVFDPRFKAIPSYSARKALFEHYVKTRADEERKEKRAAQKAAVEG 712

Query: 1162 FKKLLEEASEDIDHNTDYQTFRKKWGNDPRFEVLDRKDRELLLNERVLPLXXXXXXXXXX 983
            FK+LLEEA EDI+ +TDYQ+F+KKWG+DPRFE LDRK+RE+LLNERVL L          
Sbjct: 713  FKQLLEEAKEDINEDTDYQSFKKKWGHDPRFESLDRKEREVLLNERVLQLRKAAQEKAHA 772

Query: 982  XXXXXASNFKSMLRDKGDITATTRWSRVKDSLRNDSRYKSVKHDDREVLFNEYISELKTA 803
                  S FKSMLR++GDIT  TRWS+VKDSLR+D RYKSVKH+DRE LFNEY+SELK A
Sbjct: 773  VRAAVISQFKSMLREQGDITLNTRWSKVKDSLRSDPRYKSVKHEDRETLFNEYLSELKAA 832

Query: 802  EVEAVRETNVXXXXXXXXXXXXXXXXXXXXXXXXEMEWXXXXXXXKEAVASYQALLVEAI 623
            E E  R                            E+E        KEAV SYQALLVE I
Sbjct: 833  EQEVARIAKAKHDEEDKLKLRERALRKRKEREEQEVERVRSKARRKEAVESYQALLVEII 892

Query: 622  RDPQASWTESKPKLEKDPQGRATNPDLDLSDIEKLFREHVKLLHERCVQDFRDLLAEVL- 446
            +DPQASWTESKPKLEKDPQGRA NP LD SD+EKLFREHVK+L+ERC Q+F+ LLAEV+ 
Sbjct: 893  KDPQASWTESKPKLEKDPQGRAANPHLDQSDLEKLFREHVKVLYERCAQEFKVLLAEVIT 952

Query: 445  --SGGQKSEDGKTVLNSWSTAKRLLKPDPRYHKMPRKEREPLWRRYGEEMQRRQK--LAD 278
              +  +++E+GKTV NSWSTAK+LLK D RY KM RK+RE LWRRY E++ RRQK  L +
Sbjct: 953  VEACSRETENGKTVANSWSTAKQLLKGDLRYSKMARKDRETLWRRYVEDIHRRQKSTLDE 1012

Query: 277  DTQAKTRNSFDSGR 236
              +A+++ S DS R
Sbjct: 1013 ADKARSKGSSDSRR 1026