BLASTX nr result

ID: Magnolia22_contig00002717 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00002717
         (3590 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010250268.1 PREDICTED: pre-mRNA-processing protein 40C [Nelum...   887   0.0  
XP_011624657.1 PREDICTED: pre-mRNA-processing protein 40C [Ambor...   880   0.0  
XP_010906097.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   879   0.0  
XP_010654542.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   861   0.0  
XP_010654535.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   861   0.0  
XP_010654529.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   861   0.0  
XP_002272014.2 PREDICTED: pre-mRNA-processing protein 40C isofor...   861   0.0  
XP_012467146.1 PREDICTED: pre-mRNA-processing protein 40C [Gossy...   833   0.0  
KJB15269.1 hypothetical protein B456_002G167700 [Gossypium raimo...   829   0.0  
KJB15270.1 hypothetical protein B456_002G167700 [Gossypium raimo...   829   0.0  
XP_016707727.1 PREDICTED: pre-mRNA-processing protein 40C-like i...   827   0.0  
XP_016707728.1 PREDICTED: pre-mRNA-processing protein 40C-like i...   823   0.0  
XP_017637434.1 PREDICTED: pre-mRNA-processing protein 40C [Gossy...   822   0.0  
GAV80419.1 WW domain-containing protein/FF domain-containing pro...   823   0.0  
XP_016703241.1 PREDICTED: pre-mRNA-processing protein 40C-like i...   818   0.0  
XP_018840821.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   822   0.0  
JAT41262.1 Transcription elongation regulator 1, partial [Anthur...   828   0.0  
XP_018840830.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   819   0.0  
XP_015895736.1 PREDICTED: pre-mRNA-processing protein 40C isofor...   817   0.0  
XP_016703242.1 PREDICTED: pre-mRNA-processing protein 40C-like i...   813   0.0  

>XP_010250268.1 PREDICTED: pre-mRNA-processing protein 40C [Nelumbo nucifera]
          Length = 1088

 Score =  887 bits (2293), Expect = 0.0
 Identities = 473/777 (60%), Positives = 542/777 (69%), Gaps = 5/777 (0%)
 Frame = -1

Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889
            TV    MDSS S             S  P +   VQQQ++ PYP+LP+M PPPQ LWL P
Sbjct: 317  TVNSEAMDSSSS------------TSLRPVVPSTVQQQMHSPYPALPSMPPPPQGLWL-P 363

Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709
            PQ+GGLQR               + +RG+PLPSVP+PDSQPPG+S +GPPGGT  + +G+
Sbjct: 364  PQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTPSSSVGS 423

Query: 2708 VQ-----TXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNAL 2544
            V      T         G DQ+K  +DL +K G T   +  D WTAHKTETG VYYYNAL
Sbjct: 424  VHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWTAHKTETGVVYYYNAL 482

Query: 2543 TGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQV 2364
            TGESTYE+P+ F GEPDKVT   TPVS EKL GTDW LVTTNDGKKYYYN++ K+SSWQV
Sbjct: 483  TGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKISSWQV 542

Query: 2363 PLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGP 2184
            P+EV E+R+K + ++LK N   VQN+ A ++K   P+S++AP++NTGGR+A +LR SG  
Sbjct: 543  PMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLRPSGVA 602

Query: 2183 VSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004
             SSSALDLIKKKLQD                   DLNG + VEA  KG QSEN KDK+KD
Sbjct: 603  GSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSEN-KDKVKD 661

Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824
             NGDGN+           SGP+KEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA
Sbjct: 662  INGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKA 721

Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644
            VPGYSARRALFEH+VRT                 EGFKQLLEEASEDID + DYQTFK K
Sbjct: 722  VPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQTFKMK 781

Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464
            WG+DPRFE LDRKERELLLNERVLPLKKAAEEK +AIR AA S FKS+LREK DINT+SR
Sbjct: 782  WGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDINTSSR 841

Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284
            WSRVKD LR+DPRYKSVKHEDRE+LFNEYIS                             
Sbjct: 842  WSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEEDKLKERERE 901

Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104
                           RLKV+RKEAVA YQALLVETIKDP+ SWTES+P+L+KDPQGRATN
Sbjct: 902  MRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQGRATN 961

Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924
              LD  D EKLFREHVK+LYERCAR+FR +L EVIT E A+Q+T+DGKTVLTSWS AK L
Sbjct: 962  SVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWSTAKRL 1021

Query: 923  LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            LK DPRYSKMPRKERE+LWRR+AEE+  K+KL SD KEEK N E + R S DS RSP
Sbjct: 1022 LKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGRSP 1078



 Score = 62.4 bits (150), Expect = 6e-06
 Identities = 34/57 (59%), Positives = 37/57 (64%), Gaps = 1/57 (1%)
 Frame = -1

Query: 3578 GNSQGGKLTP-TTAASLQPPAPGQSGHANQFVPGKFPQNMAAPLQPPYPVPRGHPSI 3411
            G+SQ G  TP TTAASLQPP PGQ GH N F PG   Q MA+    P  VP+G PSI
Sbjct: 156  GHSQVGNSTPSTTAASLQPPVPGQPGHPNTFGPGTGAQFMASQGPSPVSVPKGAPSI 212


>XP_011624657.1 PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda]
          Length = 1085

 Score =  880 bits (2273), Expect = 0.0
 Identities = 503/959 (52%), Positives = 598/959 (62%), Gaps = 11/959 (1%)
 Frame = -1

Query: 3590 ATAPGNSQGGKLT-PTTAASLQPPAPGQSG---HANQFVPGKFPQNMAAPLQPPYPVPRG 3423
            ATA    QGGK   PT+AASLQPP PGQS    H N + P +  QN  A  +PP+ V +G
Sbjct: 125  ATASNPMQGGKPAGPTSAASLQPPVPGQSSVSVHPNSWDPERPVQNALAQARPPFLVRKG 184

Query: 3422 HPSIXXXXXXXXXSQLPATAEASPKXXXXXXXXXXXXXXXXXXXXXXXXXTQSIVLPAHT 3243
             PS            +  ++E S K                          Q+  LPA +
Sbjct: 185  PPSTSGFSFSGNSQSV--SSEDSQKHQASNSDASAAVAQEAKTSQPSSSTAQTTPLPAPS 242

Query: 3242 XXXXSMIPPVPPNMYPTSSMWVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGNANTV 3063
                  +    PN Y T   +                                  ++  +
Sbjct: 243  STTSRPVSS-SPNTYATP--FYMPKAPPFPGPPRLPVTPGTPGPPGIALSAPQLSSSVNI 299

Query: 3062 RPAIMDS-SVSLRPML-SPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889
            RP+++D+ S  +RP + S A    N+    + Q  Q  IY PYP+LP + PPPQA+W+HP
Sbjct: 300  RPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQPPIYSPYPTLPGVVPPPQAMWMHP 359

Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDS-QPPGVSLIGPPGGTSPAIIG 2712
             QMGGLQR               + +R + +P V +PDS QPPGVS IGPPGG   A  G
Sbjct: 360  SQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAMPDSSQPPGVSPIGPPGGIPLADHG 419

Query: 2711 A-VQ-TXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTG 2538
            A +Q T         GID+ K   D TNKD +    ED D WTAHKT+TGAVYYYNALTG
Sbjct: 420  AGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSNEDTDQWTAHKTDTGAVYYYNALTG 479

Query: 2537 ESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPL 2358
            ESTYEKP GFKGE DKV    TPVS EKL GTDW LV TNDGKKYYYNT++K+SSWQVP 
Sbjct: 480  ESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWALVATNDGKKYYYNTKSKISSWQVPP 539

Query: 2357 EVAEMRKKQESE-SLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPV 2181
            EVAE+RKKQE++ +LKAN A VQNA   +DKG V  SLSAP++NTGGR+A+  +++  PV
Sbjct: 540  EVAELRKKQEADAALKAN-APVQNAGISSDKGSVSSSLSAPAINTGGREAMTFKSATAPV 598

Query: 2180 SSSALDLIKKKLQD-XXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004
            SSSALDLIKKKLQD                    D NG + V+ T KGQQSENSKDKLK 
Sbjct: 599  SSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDANGQRVVDTTVKGQQSENSKDKLKV 658

Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824
            A   G++           SGPTKEEC+IQFKEMLKE+G+APFSKWEKELPKILFDPRFKA
Sbjct: 659  AQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEKGIAPFSKWEKELPKILFDPRFKA 718

Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644
            +PGY+ RR+LFEHFVRT                 EGFKQLLE ASEDI+HK DY+TFK+K
Sbjct: 719  IPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGFKQLLEGASEDINHKTDYETFKKK 778

Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464
            WG DPRF  LDRKERE+LLNERVLPL+KA EEK +AIR AAV+SFKSML EK DIN  SR
Sbjct: 779  WGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAIRAAAVASFKSMLHEKVDINIGSR 838

Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284
            WS+VKD LRNDPRYKSVKHEDREVLF EYIS                             
Sbjct: 839  WSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAEQEADRAAKAKREEEEKLKERERE 898

Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104
                           R K RRK+AV SYQALL E IKDPKASWTESKPKL+KDP GRATN
Sbjct: 899  LRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIKDPKASWTESKPKLEKDPLGRATN 958

Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924
            P+L+ AD EKLFREHVKVL ERCAR+FR++L+EVIT E AAQ ++DGKT+L SWS AK L
Sbjct: 959  PELEPADMEKLFREHVKVLNERCAREFRSLLAEVITPEAAAQASEDGKTLLNSWSTAKKL 1018

Query: 923  LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSPPA 747
            L+PDPRY KMPR+ERESLW+RYAE+M R+Q+  S+ KEEK+N +  +R  + S++S P+
Sbjct: 1019 LRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQKEEKTNIDDPSRRPAGSSKSSPS 1077


>XP_010906097.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis
            guineensis]
          Length = 1097

 Score =  879 bits (2271), Expect = 0.0
 Identities = 474/776 (61%), Positives = 540/776 (69%), Gaps = 3/776 (0%)
 Frame = -1

Query: 3068 TVRPAIMDSSVSLRPMLSP-ASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892
            T +PA  + S  LRPM+ P  S PP ST   + QN+QQQ Y PYPSLP   PPPQALWLH
Sbjct: 330  TSQPAGTNPS-PLRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLH 388

Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIG 2712
            PPQ GGLQR               + + G+P P++PLP  QPPGV  +   G  S  + G
Sbjct: 389  PPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTM-G 447

Query: 2711 AVQTXXXXXXXXXG--IDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTG 2538
            + Q+            ID  K AND  +KDG++ K E+AD WTAHKTE+G VYYYN++TG
Sbjct: 448  SSQSGSNVGIESPSVGIDHEKHAND-PHKDGESTKNEEADAWTAHKTESGVVYYYNSVTG 506

Query: 2537 ESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPL 2358
            ESTYE+P+ F GEP+ VT  STPVS EKLAGT+W LVTTNDG+KYYY+T+NKVSSWQVP 
Sbjct: 507  ESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPA 566

Query: 2357 EVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVS 2178
            EV E+RK QES++LK NA  + N   +ADKG  P+S+SAP+V TGGRD++ALRTSG  VS
Sbjct: 567  EVLELRKSQESDALKGNANQLTN---VADKGSAPISMSAPAVETGGRDSMALRTSGAAVS 623

Query: 2177 SSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDAN 1998
            SSALDL+KKKLQD                   DLNG K+VE   KGQQ  NSKDK+KD  
Sbjct: 624  SSALDLVKKKLQD-AGTPVTSSPVPTPGPVASDLNGSKAVETAPKGQQGTNSKDKVKD-- 680

Query: 1997 GDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVP 1818
             DGNM           SGPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP
Sbjct: 681  -DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVP 739

Query: 1817 GYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWG 1638
             YSAR+ +FEHFVRT                 + FKQLLEEASE+IDHK DYQTFKRKWG
Sbjct: 740  SYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWG 799

Query: 1637 NDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWS 1458
            +DPRF  LDRKERELLLNE+V    KAAEEK++AIR AAV+SFKSMLR+ +DI TTSRWS
Sbjct: 800  SDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWS 855

Query: 1457 RVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1278
            RVK+ LRNDPRYK+VKHE+R  LFNEYIS                               
Sbjct: 856  RVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMR 915

Query: 1277 XXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPD 1098
                         RLKVRRKEAVASYQALLVETIKDPKASWTESKPKL+KDPQGRATNPD
Sbjct: 916  KRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPD 975

Query: 1097 LDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLK 918
            L + D EKLFR+HVK LYERCAR FR +LSEVITAE AAQ TDDGKT+L SWSEAK LLK
Sbjct: 976  LGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLK 1035

Query: 917  PDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSPP 750
            PDPRYSKMP K+RE LWRRYAE+M RKQK  SD K EK + + RNR SSD +R  P
Sbjct: 1036 PDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPK-EKPDTDGRNRTSSDFSRRSP 1090


>XP_010654542.1 PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis
            vinifera]
          Length = 848

 Score =  861 bits (2225), Expect = 0.0
 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%)
 Frame = -1

Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892
            V  A MD S S+   +S A FP  P S+ P     +QQQIYP Y SLPA     Q  WL 
Sbjct: 71   VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 123

Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718
            PPQMGGL R               +   G+PLPSVPLPDSQPPGV+ +G  GGT  S A+
Sbjct: 124  PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 183

Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547
             G   A  +         GID NK  N    KDG  A  E  D WTAHKT+TG VYYYNA
Sbjct: 184  SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 242

Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367
            LTGESTYEKP+ FKGE DKVT   TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ
Sbjct: 243  LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 302

Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187
            +P E+ EMRKKQ+S +LK +A    N +   +KG  P++LSAP+V TGGRDA  LRTS  
Sbjct: 303  IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 362

Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007
            P S+SALD+IKKKLQD                   +LNG + +E T KG QSENSKDKLK
Sbjct: 363  PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 421

Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827
            D NGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK
Sbjct: 422  DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 481

Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647
            A+PGYSARR+LFEH+VRT                 EGFKQLLEEASEDIDHK +YQTF++
Sbjct: 482  AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 541

Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467
            KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++
Sbjct: 542  KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 601

Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287
            RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS                            
Sbjct: 602  RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 661

Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107
                            RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT
Sbjct: 662  ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 721

Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927
            N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK 
Sbjct: 722  NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 781

Query: 926  LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            LL+ D RY KMPRK+RES+WRRY+EEM RKQKL  D  EEK + EV+ R S DS R P
Sbjct: 782  LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 838


>XP_010654535.1 PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis
            vinifera]
          Length = 903

 Score =  861 bits (2225), Expect = 0.0
 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%)
 Frame = -1

Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892
            V  A MD S S+   +S A FP  P S+ P     +QQQIYP Y SLPA     Q  WL 
Sbjct: 126  VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 178

Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718
            PPQMGGL R               +   G+PLPSVPLPDSQPPGV+ +G  GGT  S A+
Sbjct: 179  PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 238

Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547
             G   A  +         GID NK  N    KDG  A  E  D WTAHKT+TG VYYYNA
Sbjct: 239  SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 297

Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367
            LTGESTYEKP+ FKGE DKVT   TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ
Sbjct: 298  LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 357

Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187
            +P E+ EMRKKQ+S +LK +A    N +   +KG  P++LSAP+V TGGRDA  LRTS  
Sbjct: 358  IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 417

Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007
            P S+SALD+IKKKLQD                   +LNG + +E T KG QSENSKDKLK
Sbjct: 418  PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 476

Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827
            D NGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK
Sbjct: 477  DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 536

Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647
            A+PGYSARR+LFEH+VRT                 EGFKQLLEEASEDIDHK +YQTF++
Sbjct: 537  AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 596

Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467
            KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++
Sbjct: 597  KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 656

Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287
            RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS                            
Sbjct: 657  RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 716

Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107
                            RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT
Sbjct: 717  ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 776

Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927
            N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK 
Sbjct: 777  NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 836

Query: 926  LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            LL+ D RY KMPRK+RES+WRRY+EEM RKQKL  D  EEK + EV+ R S DS R P
Sbjct: 837  LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 893


>XP_010654529.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis
            vinifera]
          Length = 1013

 Score =  861 bits (2225), Expect = 0.0
 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%)
 Frame = -1

Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892
            V  A MD S S+   +S A FP  P S+ P     +QQQIYP Y SLPA     Q  WL 
Sbjct: 236  VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 288

Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718
            PPQMGGL R               +   G+PLPSVPLPDSQPPGV+ +G  GGT  S A+
Sbjct: 289  PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 348

Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547
             G   A  +         GID NK  N    KDG  A  E  D WTAHKT+TG VYYYNA
Sbjct: 349  SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 407

Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367
            LTGESTYEKP+ FKGE DKVT   TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ
Sbjct: 408  LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 467

Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187
            +P E+ EMRKKQ+S +LK +A    N +   +KG  P++LSAP+V TGGRDA  LRTS  
Sbjct: 468  IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 527

Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007
            P S+SALD+IKKKLQD                   +LNG + +E T KG QSENSKDKLK
Sbjct: 528  PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 586

Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827
            D NGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK
Sbjct: 587  DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 646

Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647
            A+PGYSARR+LFEH+VRT                 EGFKQLLEEASEDIDHK +YQTF++
Sbjct: 647  AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 706

Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467
            KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++
Sbjct: 707  KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 766

Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287
            RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS                            
Sbjct: 767  RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 826

Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107
                            RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT
Sbjct: 827  ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 886

Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927
            N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK 
Sbjct: 887  NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 946

Query: 926  LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            LL+ D RY KMPRK+RES+WRRY+EEM RKQKL  D  EEK + EV+ R S DS R P
Sbjct: 947  LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 1003


>XP_002272014.2 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis
            vinifera] CBI27460.3 unnamed protein product, partial
            [Vitis vinifera]
          Length = 1046

 Score =  861 bits (2225), Expect = 0.0
 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%)
 Frame = -1

Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892
            V  A MD S S+   +S A FP  P S+ P     +QQQIYP Y SLPA     Q  WL 
Sbjct: 269  VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 321

Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718
            PPQMGGL R               +   G+PLPSVPLPDSQPPGV+ +G  GGT  S A+
Sbjct: 322  PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 381

Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547
             G   A  +         GID NK  N    KDG  A  E  D WTAHKT+TG VYYYNA
Sbjct: 382  SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 440

Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367
            LTGESTYEKP+ FKGE DKVT   TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ
Sbjct: 441  LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 500

Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187
            +P E+ EMRKKQ+S +LK +A    N +   +KG  P++LSAP+V TGGRDA  LRTS  
Sbjct: 501  IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 560

Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007
            P S+SALD+IKKKLQD                   +LNG + +E T KG QSENSKDKLK
Sbjct: 561  PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 619

Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827
            D NGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK
Sbjct: 620  DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 679

Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647
            A+PGYSARR+LFEH+VRT                 EGFKQLLEEASEDIDHK +YQTF++
Sbjct: 680  AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 739

Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467
            KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++
Sbjct: 740  KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 799

Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287
            RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS                            
Sbjct: 800  RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 859

Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107
                            RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT
Sbjct: 860  ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 919

Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927
            N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK 
Sbjct: 920  NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 979

Query: 926  LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            LL+ D RY KMPRK+RES+WRRY+EEM RKQKL  D  EEK + EV+ R S DS R P
Sbjct: 980  LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 1036


>XP_012467146.1 PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii]
            KJB15267.1 hypothetical protein B456_002G167700
            [Gossypium raimondii]
          Length = 887

 Score =  833 bits (2152), Expect = 0.0
 Identities = 449/750 (59%), Positives = 518/750 (69%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P MGG  R        
Sbjct: 126  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 186  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D+T K  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 245  NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 304  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 363

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
             SV N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 364  VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 422

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 423  SSSPVPVVPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 480

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 481  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 540

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGFKQLL+EASEDIDH  +YQTFKRKWG+DPRFE LDRK+RELLLN
Sbjct: 541  EERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 600

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 601  ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 660

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLFNEYIS                                            RLKVR
Sbjct: 661  DREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 720

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 721  RKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 780

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L+EVIT +  AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 781  ERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 840

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQK   D +EEK + +V+ R S
Sbjct: 841  RYAEDMLRKQKSALDQEEEK-HTDVKGRSS 869


>KJB15269.1 hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 886

 Score =  829 bits (2141), Expect = 0.0
 Identities = 449/750 (59%), Positives = 518/750 (69%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P MGG  R        
Sbjct: 126  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 186  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D+T K  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 245  NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 304  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 363

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
             SV N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 364  VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 422

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 423  SSSPVPVVPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 480

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 481  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 540

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGFKQLL+EASEDIDH  +YQTFKRKWG+DPRFE LDRK+RELLLN
Sbjct: 541  EERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 600

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 601  ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 660

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLFNEYIS                                            RLKVR
Sbjct: 661  DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 719

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 720  RKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 779

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L+EVIT +  AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 780  ERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQK   D +EEK + +V+ R S
Sbjct: 840  RYAEDMLRKQKSALDQEEEK-HTDVKGRSS 868


>KJB15270.1 hypothetical protein B456_002G167700 [Gossypium raimondii]
          Length = 888

 Score =  829 bits (2141), Expect = 0.0
 Identities = 450/751 (59%), Positives = 518/751 (68%), Gaps = 3/751 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P MGG  R        
Sbjct: 126  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 186  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D+T K  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 245  NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKV-SSWQVPLEVAEMRKKQESESLKAN 2307
               TPVS E+LAGTDW LVTTNDGKKYYYN++ KV SSWQ+P EV E+RKKQ+SE  K N
Sbjct: 304  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKEN 363

Query: 2306 AASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXX 2127
            A SV N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD    
Sbjct: 364  AVSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP 423

Query: 2126 XXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXS 1947
                           +LNG ++V+   KG QSE++KDKLKDANGDG++           S
Sbjct: 424  SSSPVPVVPVTATH-ELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADS 480

Query: 1946 GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXX 1767
            GP+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T  
Sbjct: 481  GPSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRA 540

Query: 1766 XXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLL 1587
                           EGFKQLL+EASEDIDH  +YQTFKRKWG+DPRFE LDRK+RELLL
Sbjct: 541  EEERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLL 600

Query: 1586 NERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKH 1407
            NERVL LK+AAEEK RAIR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKH
Sbjct: 601  NERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKH 660

Query: 1406 EDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKV 1227
            EDREVLFNEYIS                                            RLKV
Sbjct: 661  EDREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKV 720

Query: 1226 RRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVL 1047
            RRKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L
Sbjct: 721  RRKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKML 780

Query: 1046 YERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLW 867
            +ERC  DFRA+L+EVIT +  AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LW
Sbjct: 781  FERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALW 840

Query: 866  RRYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RRYAE+M RKQK   D +EEK + +V+ R S
Sbjct: 841  RRYAEDMLRKQKSALDQEEEK-HTDVKGRSS 870


>XP_016707727.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium
            hirsutum]
          Length = 886

 Score =  827 bits (2136), Expect = 0.0
 Identities = 447/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P MGG  R        
Sbjct: 125  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 184

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPG   +G  P   S A +              GID
Sbjct: 185  VYPGPFPSTSSGMPLPA-PSSDSQPPGFRPLGMSPFAPSAAALANQSLAILTGFPPQGID 243

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D+T K  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 244  NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 303  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
             SV N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 363  VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 422  SSSPVPVMPVTATHELNGLRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 480  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGFKQLL+EASEDI H  +YQTFKRKWG+DPRFE LDRK+RELLLN
Sbjct: 540  EERKEKRAAQKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 600  ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLFNEYIS                                            RLKVR
Sbjct: 660  DREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 719

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 720  RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 779

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L++VIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 780  ERCVNDFRALLAKVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQKL  D +EEK + +V+ R S
Sbjct: 840  RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 868


>XP_016707728.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Gossypium
            hirsutum]
          Length = 885

 Score =  823 bits (2125), Expect = 0.0
 Identities = 447/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P MGG  R        
Sbjct: 125  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 184

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPG   +G  P   S A +              GID
Sbjct: 185  VYPGPFPSTSSGMPLPA-PSSDSQPPGFRPLGMSPFAPSAAALANQSLAILTGFPPQGID 243

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D+T K  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 244  NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 303  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
             SV N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 363  VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 422  SSSPVPVMPVTATHELNGLRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 480  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGFKQLL+EASEDI H  +YQTFKRKWG+DPRFE LDRK+RELLLN
Sbjct: 540  EERKEKRAAQKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 600  ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLFNEYIS                                            RLKVR
Sbjct: 660  DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 719  RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 778

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L++VIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 779  ERCVNDFRALLAKVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQKL  D +EEK + +V+ R S
Sbjct: 839  RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867


>XP_017637434.1 PREDICTED: pre-mRNA-processing protein 40C [Gossypium arboreum]
          Length = 885

 Score =  822 bits (2123), Expect = 0.0
 Identities = 444/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P +GG  R        
Sbjct: 125  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 185  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D++ +  ++A  E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT
Sbjct: 244  NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 303  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPYEVTELRKKQDSEVSKENA 362

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
              V N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 363  VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 422  SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 480  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGF+QLL+EASEDIDH  +YQTFKRKWG+DPRFE LDRK+RELLLN
Sbjct: 540  EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK R IR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 600  ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLFNEYIS                                            RLKVR
Sbjct: 660  DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 719  RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 778

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 779  ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQKL  D +EEK + +V+ R S
Sbjct: 839  RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867


>GAV80419.1 WW domain-containing protein/FF domain-containing protein, partial
            [Cephalotus follicularis]
          Length = 980

 Score =  823 bits (2127), Expect = 0.0
 Identities = 441/766 (57%), Positives = 512/766 (66%), Gaps = 5/766 (0%)
 Frame = -1

Query: 3035 SLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXX 2856
            SL P+   A    +++       VQQQ+YP YPSLPAM   PQ LW+HPPQMGG+ R   
Sbjct: 209  SLVPVTKGAPSNADTSTAVSQAGVQQQMYPTYPSLPAMAASPQGLWVHPPQMGGMPRPPF 268

Query: 2855 XXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-----PPGGTSPAIIGAVQTXXX 2691
                           R V LPSV   DSQPPGV+ +G     P    +P     V T   
Sbjct: 269  LPYPAVYPGPFLAPARNVALPSVLSLDSQPPGVTPMGTTGAIPMSSAAPGHHLVVTTGIQ 328

Query: 2690 XXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAG 2511
                  GID     +D+TN     A  + ++ WTA +T+TG VYYYNA+TGESTYEKP G
Sbjct: 329  TELPPPGIDDRTHYHDVTNNGA--AFNKQSEVWTAFRTDTGNVYYYNAITGESTYEKPPG 386

Query: 2510 FKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQ 2331
            FK EPDKV    +P   E L GTDWVLV+TNDGKKYYYN++ K+SSWQ+P EVAE+RKKQ
Sbjct: 387  FKVEPDKVPMQPSPTLMEYLPGTDWVLVSTNDGKKYYYNSKTKLSSWQIPTEVAELRKKQ 446

Query: 2330 ESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKK 2151
            + +  K +  SV N + L +KG  P+SLSAP+VNTGGRDA ALRTSG P SSSALDLIKK
Sbjct: 447  DDDVSKEHPISVPNTNVLTEKGSSPISLSAPAVNTGGRDATALRTSGVPGSSSALDLIKK 506

Query: 2150 KLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXX 1971
            KLQD                   + NG ++VEAT KG QSENSKDKLKDANGDGN+    
Sbjct: 507  KLQDPGAPITSSLTPASSGTAALESNGSRAVEATVKGLQSENSKDKLKDANGDGNVSDSS 566

Query: 1970 XXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALF 1791
                   SGPTKE C++QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LF
Sbjct: 567  SDSEDVDSGPTKEVCLVQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLF 626

Query: 1790 EHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLD 1611
            EH+V+T                 EGFKQLLEEASEDIDH  DYQTFK+KW +DPRFE LD
Sbjct: 627  EHYVKTRAEEERKEKRAAQKVAIEGFKQLLEEASEDIDHYTDYQTFKKKWDSDPRFEALD 686

Query: 1610 RKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRND 1431
            RK+RELLLNERVLPLK+AAEEK +AIR AA S FKSMLREK DI   SRWS+VKD LRND
Sbjct: 687  RKDRELLLNERVLPLKRAAEEKAQAIRVAAASDFKSMLREKGDITAISRWSKVKDVLRND 746

Query: 1430 PRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1251
            PRYKSVKHEDRE+LF++YI+                                        
Sbjct: 747  PRYKSVKHEDREILFSQYIAELKAVEEEAEREAKAKKHEQERLKERERELRKRKEREEQE 806

Query: 1250 XXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKL 1071
                R+KVRRKEAVAS QALLVETIKDP+ASWTESKPKL+KDPQGRATNPD D  D EKL
Sbjct: 807  VERVRVKVRRKEAVASLQALLVETIKDPQASWTESKPKLEKDPQGRATNPDFDPYDIEKL 866

Query: 1070 FREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMP 891
            FREH+K+L++RCA DF+A+LSEV+T E A Q   DGKT L SWS AK LLKPD RY++MP
Sbjct: 867  FREHIKILHQRCAHDFKALLSEVVTTEAAVQ-KSDGKTALNSWSTAKRLLKPDARYNRMP 925

Query: 890  RKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            RK+RE LWRRY EEM RKQK D D K+EK + + + R S DS R P
Sbjct: 926  RKDREGLWRRYVEEMLRKQKPDFDQKDEK-HKDAKGRSSIDSGRLP 970


>XP_016703241.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium
            hirsutum]
          Length = 886

 Score =  818 bits (2112), Expect = 0.0
 Identities = 440/750 (58%), Positives = 516/750 (68%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P +GG  R        
Sbjct: 125  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 185  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D++ +  ++A  E +D WTAHKT+TG VYYYNALTGES+YEKPAGFKGEPD+VT
Sbjct: 244  NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESSYEKPAGFKGEPDQVT 302

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 303  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
              V N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 363  VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 422  SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 480  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGF+QLL+EASEDIDH  +YQTFKR+WG+DPRFE LDRK+R LLLN
Sbjct: 540  EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRQWGSDPRFEALDRKDRGLLLN 599

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK R IR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 600  ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLF+EYIS                                            RLKVR
Sbjct: 660  DREVLFDEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 719

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 720  RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAVNPDLDSSDMEKLFREHIKMLF 779

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 780  ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQKL  D +EEK + +V+ R S
Sbjct: 840  RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 868


>XP_018840821.1 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Juglans regia]
          Length = 1013

 Score =  822 bits (2122), Expect = 0.0
 Identities = 440/775 (56%), Positives = 517/775 (66%), Gaps = 5/775 (0%)
 Frame = -1

Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889
            TV     DSS S  P       P   T P L+ +  Q    PY S PAM  PPQ +WL P
Sbjct: 237  TVLSVATDSSSSAVPR------PTMPTAPVLSSSAVQTANYPYASFPAMAAPPQGMWLQP 290

Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG--PPGGTSPAII 2715
             QMGGL R               +  RG+ LPSVPLPDSQPPGV+ +G  P    S A  
Sbjct: 291  SQMGGLPRSPFQPYPAAFPGPFPLPARGMALPSVPLPDSQPPGVTPLGTAPTISVSSAAS 350

Query: 2714 G---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNAL 2544
            G   A            GID  K   ++  +DG  A KE  D WTAHKTE G VYYYNA+
Sbjct: 351  GHMLAGTLRMQPELPPPGIDNRKNVEEVGTQDG-AAVKEQLDAWTAHKTEAGVVYYYNAV 409

Query: 2543 TGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQV 2364
            TGESTY+KP GFKGE DKV    TPVS+  + GTDWVLVTT+DGKKYYYN++ K+SSWQ+
Sbjct: 410  TGESTYDKPLGFKGEHDKVHVQPTPVSTTSILGTDWVLVTTSDGKKYYYNSKTKISSWQI 469

Query: 2363 PLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGP 2184
            P EV E++KKQ+ E    ++ S+ +A+   +KG  P+SL+AP+++TGGRDA+AL+    P
Sbjct: 470  PSEVTELKKKQDGE----HSISLPHANLSTEKGSAPISLNAPAISTGGRDAMALKALAVP 525

Query: 2183 VSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004
             SSSALD+IKKKLQD                   +LNG ++V+ T KG QSE+S+DKLKD
Sbjct: 526  GSSSALDMIKKKLQDSGSPITSSPNPAPSGIAASELNGSRAVDTTVKGLQSEDSRDKLKD 585

Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824
            ANGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA
Sbjct: 586  ANGDGNMSDSSSDSEDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKA 645

Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644
            +P YSARR+LFEH+V+T                 EGFKQLL EASEDIDH  DYQTF++K
Sbjct: 646  IPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLGEASEDIDHNTDYQTFRKK 705

Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464
            WG DPRFE LDRK+RE LLNERV PLKKAAEEK++A+R AA +SFKSMLREK DI   SR
Sbjct: 706  WGADPRFEVLDRKDREHLLNERVFPLKKAAEEKVQALRAAAATSFKSMLREKRDITANSR 765

Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284
            WS+VKD LRND RYKS KHEDRE+ FNEYIS                             
Sbjct: 766  WSKVKDSLRNDSRYKSAKHEDREIFFNEYISELKAGEEQSEREAKAKREEQEKLKERERE 825

Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104
                           RLKVRRKEAVAS+QALLVE IKDP+ASWTESKPKL+KDPQGRATN
Sbjct: 826  LRKRKEREEQEMERVRLKVRRKEAVASFQALLVEIIKDPQASWTESKPKLEKDPQGRATN 885

Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924
             DLD +D EKLFREH+K+L ERC ++FR +L+EV+TAE AAQ T++GKTVL SWS AK L
Sbjct: 886  TDLDPSDIEKLFREHIKMLNERCVQEFRYLLAEVLTAEAAAQETEEGKTVLNSWSTAKRL 945

Query: 923  LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSAR 759
            LKPDPRY+KMPRKERE LWRRYA+E+ R+QK+  D KEEK + E + R S+DS R
Sbjct: 946  LKPDPRYNKMPRKEREVLWRRYADEILRRQKVALDQKEEKKHVESKGRNSADSGR 1000


>JAT41262.1 Transcription elongation regulator 1, partial [Anthurium amnicola]
          Length = 1216

 Score =  828 bits (2139), Expect = 0.0
 Identities = 489/954 (51%), Positives = 572/954 (59%), Gaps = 8/954 (0%)
 Frame = -1

Query: 3590 ATAPGNS--QGGKLTPTTAA-SLQPPAPGQSGHANQFVPGKFPQNMAAPLQPPYPVPRGH 3420
            +TA GN+  QG  L P++A  SLQ    GQS      +PG   QN    +Q P      +
Sbjct: 277  STAIGNNNLQGETLAPSSAPPSLQSSVRGQSSALRSTLPGTAKQNPPTLMQLPSSTSFSY 336

Query: 3419 PSIXXXXXXXXXSQLPATAEASPKXXXXXXXXXXXXXXXXXXXXXXXXXTQSIVLPAHTX 3240
                        +  P   E S K                         +QS+ + A   
Sbjct: 337  SG----------NSQPGIVETSEKTVSPNSNASSAIAAEPVAAAVAPISSQSMQMSAQVP 386

Query: 3239 XXXSMIPPVPPNMYPTSSMWVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGNANTVR 3060
               S   P  PN    +   VQ                                NA TVR
Sbjct: 387  PSFSTNVPSSPNPNVAT---VQVPVIPSFARPPGIPGNVGPGPAGLASCVSPSSNA-TVR 442

Query: 3059 PAIMDSSVSLRPML-SPASFPPNS-TVPT-LAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889
            P ++DSS S RP+L +PAS P NS + P  + QNVQQQ YPPYPS+ A  PPPQA WLH 
Sbjct: 443  PVLVDSS-SARPILPAPASIPTNSVSAPAPIPQNVQQQSYPPYPSITA-APPPQAPWLHA 500

Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709
                  Q                + ++ +P P VPLP  QPPGVS I   GGT  A I  
Sbjct: 501  SHAVSFQHAPFLPYPGALCTPFPLPMQSMPSPYVPLPSLQPPGVSTIVVSGGTKSASIEP 560

Query: 2708 VQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGEST 2529
            VQ              NK A D T KDGD AKK+ +  WTAHKT+ GA+YYYN+LTGEST
Sbjct: 561  VQPGNNFIAQSPSGTDNKLATDPTIKDGDIAKKDGSGPWTAHKTDAGAIYYYNSLTGEST 620

Query: 2528 YEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVA 2349
            YEKP+GFKGEP KV    TPVS EKLAGTDW LVTTNDGKKYYYN++ KVSSWQ+P EVA
Sbjct: 621  YEKPSGFKGEPGKVVCQPTPVSWEKLAGTDWSLVTTNDGKKYYYNSKTKVSSWQIPSEVA 680

Query: 2348 EMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSA 2169
            E++  + S+  K    S+QNAS   DKG   VSL+AP+V TGGRDA   +T    +SSSA
Sbjct: 681  ELKNNEVSDHSKEGTNSIQNASVTDDKGSSLVSLNAPAVQTGGRDAATSKTPAPLISSSA 740

Query: 2168 LDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDG 1989
            LDLIKKKLQD                   DL+GPK+VE T KGQ SENSKDKLK  NGD 
Sbjct: 741  LDLIKKKLQD-AGTPMTSLPLPTSVPTLSDLSGPKAVETTAKGQHSENSKDKLKGINGDA 799

Query: 1988 NMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYS 1809
            N+           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKAV  +S
Sbjct: 800  NLSESSSDSDDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIIFDPRFKAVQSHS 859

Query: 1808 ARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDP 1629
             RR+LFEH+VRT                 EGFKQLL+E SEDI+HK DYQ+FKRKWG DP
Sbjct: 860  VRRSLFEHYVRTRADEERKEKRAAQKALIEGFKQLLDEVSEDINHKTDYQSFKRKWGRDP 919

Query: 1628 RFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVK 1449
            RFE L RKE+E LL ER+L LKK  EEK +A+R    ++FK +LREK +++ +SRWSRVK
Sbjct: 920  RFEALGRKEKEALLTERILSLKKVVEEKTQAVR----ANFKCLLREKAEVSASSRWSRVK 975

Query: 1448 DGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1269
            D LRNDPRY++VKHEDREV FNE+IS                                  
Sbjct: 976  DSLRNDPRYRAVKHEDREVFFNEHISELKEAEAEAQLAVKAKIEEQEKLKKREQEMRKRK 1035

Query: 1268 XXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDK 1089
                      RL+VRRKEA +SYQALLVETIKDPKASWTESKPKL+KDPQGRA NPDLD+
Sbjct: 1036 QREEQEMEAVRLRVRRKEAESSYQALLVETIKDPKASWTESKPKLEKDPQGRAANPDLDQ 1095

Query: 1088 ADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDP 909
            AD EKLFREHVK LYERCAR++RA+L+E+ITAE AA+VTDDGKTVLTSWSEAK LLKPD 
Sbjct: 1096 ADMEKLFREHVKNLYERCAREYRALLAELITAEVAARVTDDGKTVLTSWSEAKKLLKPDS 1155

Query: 908  RYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSD--SARSP 753
            RYSKMP KERES+W R+A+E+ RK K  SD+K E+ + EV+ R S      RSP
Sbjct: 1156 RYSKMPSKERESIWSRHADEIHRKLKSASDIK-ERVDGEVKGRASCTDIGGRSP 1208


>XP_018840830.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Juglans regia]
          Length = 1011

 Score =  819 bits (2116), Expect = 0.0
 Identities = 437/777 (56%), Positives = 516/777 (66%), Gaps = 7/777 (0%)
 Frame = -1

Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889
            TV     DSS S  P       P   T P L+ +  Q    PY S PAM  PPQ +WL P
Sbjct: 237  TVLSVATDSSSSAVPR------PTMPTAPVLSSSAVQTANYPYASFPAMAAPPQGMWLQP 290

Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709
             QMGGL R               +  RG+ LPSVPLPDSQPPGV+    P GT+P I  +
Sbjct: 291  SQMGGLPRSPFQPYPAAFPGPFPLPARGMALPSVPLPDSQPPGVT----PLGTAPTISVS 346

Query: 2708 VQTXXXXXXXXXGI-------DQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYN 2550
                         +       D  K   ++  +DG  A KE  D WTAHKTE G VYYYN
Sbjct: 347  SAASGHMLAGTLRMQPELPPPDNRKNVEEVGTQDG-AAVKEQLDAWTAHKTEAGVVYYYN 405

Query: 2549 ALTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSW 2370
            A+TGESTY+KP GFKGE DKV    TPVS+  + GTDWVLVTT+DGKKYYYN++ K+SSW
Sbjct: 406  AVTGESTYDKPLGFKGEHDKVHVQPTPVSTTSILGTDWVLVTTSDGKKYYYNSKTKISSW 465

Query: 2369 QVPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSG 2190
            Q+P EV E++KKQ+ E    ++ S+ +A+   +KG  P+SL+AP+++TGGRDA+AL+   
Sbjct: 466  QIPSEVTELKKKQDGE----HSISLPHANLSTEKGSAPISLNAPAISTGGRDAMALKALA 521

Query: 2189 GPVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKL 2010
             P SSSALD+IKKKLQD                   +LNG ++V+ T KG QSE+S+DKL
Sbjct: 522  VPGSSSALDMIKKKLQDSGSPITSSPNPAPSGIAASELNGSRAVDTTVKGLQSEDSRDKL 581

Query: 2009 KDANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRF 1830
            KDANGDGNM           SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRF
Sbjct: 582  KDANGDGNMSDSSSDSEDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRF 641

Query: 1829 KAVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFK 1650
            KA+P YSARR+LFEH+V+T                 EGFKQLL EASEDIDH  DYQTF+
Sbjct: 642  KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLGEASEDIDHNTDYQTFR 701

Query: 1649 RKWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTT 1470
            +KWG DPRFE LDRK+RE LLNERV PLKKAAEEK++A+R AA +SFKSMLREK DI   
Sbjct: 702  KKWGADPRFEVLDRKDREHLLNERVFPLKKAAEEKVQALRAAAATSFKSMLREKRDITAN 761

Query: 1469 SRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXX 1290
            SRWS+VKD LRND RYKS KHEDRE+ FNEYIS                           
Sbjct: 762  SRWSKVKDSLRNDSRYKSAKHEDREIFFNEYISELKAGEEQSEREAKAKREEQEKLKERE 821

Query: 1289 XXXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRA 1110
                             RLKVRRKEAVAS+QALLVE IKDP+ASWTESKPKL+KDPQGRA
Sbjct: 822  RELRKRKEREEQEMERVRLKVRRKEAVASFQALLVEIIKDPQASWTESKPKLEKDPQGRA 881

Query: 1109 TNPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAK 930
            TN DLD +D EKLFREH+K+L ERC ++FR +L+EV+TAE AAQ T++GKTVL SWS AK
Sbjct: 882  TNTDLDPSDIEKLFREHIKMLNERCVQEFRYLLAEVLTAEAAAQETEEGKTVLNSWSTAK 941

Query: 929  HLLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSAR 759
             LLKPDPRY+KMPRKERE LWRRYA+E+ R+QK+  D KEEK + E + R S+DS R
Sbjct: 942  RLLKPDPRYNKMPRKEREVLWRRYADEILRRQKVALDQKEEKKHVESKGRNSADSGR 998


>XP_015895736.1 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Ziziphus
            jujuba]
          Length = 982

 Score =  817 bits (2111), Expect = 0.0
 Identities = 436/768 (56%), Positives = 515/768 (67%), Gaps = 4/768 (0%)
 Frame = -1

Query: 3044 SSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQR 2865
            SS+  RP +     P NS++       Q QI   YPSLPA+   PQ LWL PPQMGG+ R
Sbjct: 214  SSMVQRPGMPTGPVPLNSSI-------QPQIGASYPSLPALAGHPQGLWLQPPQMGGMPR 266

Query: 2864 XXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGG----TSPAIIGAVQTX 2697
                           +   G+ LPSVP+PD QPPGV+ +   G     ++ + +  V   
Sbjct: 267  QPVVPYSAAFPGPLPLMAHGMHLPSVPVPDPQPPGVTPVENSGSIPVSSTASSLQLVGPS 326

Query: 2696 XXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKP 2517
                          + ND+  +D   A  E  D WTAHKT+TG VYYYNALTGESTY KP
Sbjct: 327  GMHTLVHKSAGDRTKVNDVGVQDR-AAINEQLDAWTAHKTDTGVVYYYNALTGESTYAKP 385

Query: 2516 AGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRK 2337
            A FKGEPDKV+    PVS   L GTDWVLVTT+DGKKYY N + KVSSWQ+P EV E++K
Sbjct: 386  ADFKGEPDKVSVQPIPVSMVNLPGTDWVLVTTSDGKKYYCNNKTKVSSWQIPNEVTELKK 445

Query: 2336 KQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLI 2157
            K + E  K +  SV N S + +KG   +SLS P++NTGGRDAIALR+SG   SSSALDLI
Sbjct: 446  KPDGEVSKEHLMSVPNTSVVMEKGSTTISLSTPAINTGGRDAIALRSSGVQPSSSALDLI 505

Query: 2156 KKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXX 1977
            KKKLQD                   + NG K+VEATTKGQQSENSKDKLKDANGDGN   
Sbjct: 506  KKKLQDSGAPVVSSPVPAPSGMTGSESNGSKAVEATTKGQQSENSKDKLKDANGDGNFSD 565

Query: 1976 XXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRA 1797
                     SGPTKEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P YSARR+
Sbjct: 566  SSSDSEDADSGPTKEECIVQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRS 625

Query: 1796 LFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFET 1617
            LFEH+V+T                 EGFKQLL+EASE+IDH+ DYQTF++KWGNDPRF  
Sbjct: 626  LFEHYVKTRVEEERKEKRAAQKAAIEGFKQLLDEASEEIDHETDYQTFRKKWGNDPRFMA 685

Query: 1616 LDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLR 1437
            LDRK+RE LLNERVLPLK+AAEEK +AIR AA S FKSMLREK DI   SRWSRVKD LR
Sbjct: 686  LDRKDRENLLNERVLPLKRAAEEKAQAIRAAAASGFKSMLREKGDITVNSRWSRVKDSLR 745

Query: 1436 NDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1257
            NDPRYKSVKHEDREVLFNEY++                                      
Sbjct: 746  NDPRYKSVKHEDREVLFNEYLADLRATEEEAEREAKLKRQEQDKLKERERELRKRKEREE 805

Query: 1256 XXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTE 1077
                  R+KVRRKEA+AS+QALLVETIKDP+ASWTESK KL+KDPQGRA NPDLD  + E
Sbjct: 806  QEMERVRVKVRRKEAIASFQALLVETIKDPQASWTESKTKLEKDPQGRAANPDLDSLEME 865

Query: 1076 KLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSK 897
            KLFREH+K+L+ERCAR+F+ +L+EV+TA+ AAQ T+DGKTVL SWS AK LLK DPRY+K
Sbjct: 866  KLFREHIKMLHERCAREFKTLLAEVLTADAAAQETEDGKTVLNSWSTAKRLLKRDPRYNK 925

Query: 896  MPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753
            MPRK+RE+LWRR+AEEM RKQK + + KE+K   + ++R + +S R P
Sbjct: 926  MPRKDREALWRRHAEEMLRKQKSELERKEDK-KIDAKSRSTIESGRFP 972


>XP_016703242.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Gossypium
            hirsutum]
          Length = 885

 Score =  813 bits (2101), Expect = 0.0
 Identities = 440/750 (58%), Positives = 516/750 (68%), Gaps = 2/750 (0%)
 Frame = -1

Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841
            SP+S  P    P +L   VQQQ+YPPY SLP+M   PQ  W+  P +GG  R        
Sbjct: 125  SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184

Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664
                       G+PLP+ P  DSQPPGV  +G  P   S A +              GID
Sbjct: 185  VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243

Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484
              K  +D++ +  ++A  E +D WTAHKT+TG VYYYNALTGES+YEKPAGFKGEPD+VT
Sbjct: 244  NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESSYEKPAGFKGEPDQVT 302

Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304
               TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE  K NA
Sbjct: 303  VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362

Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124
              V N   +A+KG  P+SLSAP+VNTGGRDA+ LRTS  P SSSALDLIKKKLQD     
Sbjct: 363  VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421

Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944
                          +LNG ++V+   KG QSE++KDKLKDANGDG++           SG
Sbjct: 422  SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479

Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764
            P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T   
Sbjct: 480  PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539

Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584
                          EGF+QLL+EASEDIDH  +YQTFKR+WG+DPRFE LDRK+R LLLN
Sbjct: 540  EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRQWGSDPRFEALDRKDRGLLLN 599

Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404
            ERVL LK+AAEEK R IR AA SSFKSML+EK DIN  SRWSRVKD LR+DPRYK VKHE
Sbjct: 600  ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659

Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224
            DREVLF+EYIS                                            RLKVR
Sbjct: 660  DREVLFDEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718

Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044
            RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+
Sbjct: 719  RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAVNPDLDSSDMEKLFREHIKMLF 778

Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864
            ERC  DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR
Sbjct: 779  ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838

Query: 863  RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774
            RYAE+M RKQKL  D +EEK + +V+ R S
Sbjct: 839  RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867


Top