BLASTX nr result
ID: Magnolia22_contig00002717
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Magnolia22_contig00002717 (3590 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_010250268.1 PREDICTED: pre-mRNA-processing protein 40C [Nelum... 887 0.0 XP_011624657.1 PREDICTED: pre-mRNA-processing protein 40C [Ambor... 880 0.0 XP_010906097.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 879 0.0 XP_010654542.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 861 0.0 XP_010654535.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 861 0.0 XP_010654529.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 861 0.0 XP_002272014.2 PREDICTED: pre-mRNA-processing protein 40C isofor... 861 0.0 XP_012467146.1 PREDICTED: pre-mRNA-processing protein 40C [Gossy... 833 0.0 KJB15269.1 hypothetical protein B456_002G167700 [Gossypium raimo... 829 0.0 KJB15270.1 hypothetical protein B456_002G167700 [Gossypium raimo... 829 0.0 XP_016707727.1 PREDICTED: pre-mRNA-processing protein 40C-like i... 827 0.0 XP_016707728.1 PREDICTED: pre-mRNA-processing protein 40C-like i... 823 0.0 XP_017637434.1 PREDICTED: pre-mRNA-processing protein 40C [Gossy... 822 0.0 GAV80419.1 WW domain-containing protein/FF domain-containing pro... 823 0.0 XP_016703241.1 PREDICTED: pre-mRNA-processing protein 40C-like i... 818 0.0 XP_018840821.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 822 0.0 JAT41262.1 Transcription elongation regulator 1, partial [Anthur... 828 0.0 XP_018840830.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 819 0.0 XP_015895736.1 PREDICTED: pre-mRNA-processing protein 40C isofor... 817 0.0 XP_016703242.1 PREDICTED: pre-mRNA-processing protein 40C-like i... 813 0.0 >XP_010250268.1 PREDICTED: pre-mRNA-processing protein 40C [Nelumbo nucifera] Length = 1088 Score = 887 bits (2293), Expect = 0.0 Identities = 473/777 (60%), Positives = 542/777 (69%), Gaps = 5/777 (0%) Frame = -1 Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889 TV MDSS S S P + VQQQ++ PYP+LP+M PPPQ LWL P Sbjct: 317 TVNSEAMDSSSS------------TSLRPVVPSTVQQQMHSPYPALPSMPPPPQGLWL-P 363 Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709 PQ+GGLQR + +RG+PLPSVP+PDSQPPG+S +GPPGGT + +G+ Sbjct: 364 PQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTPSSSVGS 423 Query: 2708 VQ-----TXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNAL 2544 V T G DQ+K +DL +K G T + D WTAHKTETG VYYYNAL Sbjct: 424 VHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAK-VDAWTAHKTETGVVYYYNAL 482 Query: 2543 TGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQV 2364 TGESTYE+P+ F GEPDKVT TPVS EKL GTDW LVTTNDGKKYYYN++ K+SSWQV Sbjct: 483 TGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKISSWQV 542 Query: 2363 PLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGP 2184 P+EV E+R+K + ++LK N VQN+ A ++K P+S++AP++NTGGR+A +LR SG Sbjct: 543 PMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLRPSGVA 602 Query: 2183 VSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004 SSSALDLIKKKLQD DLNG + VEA KG QSEN KDK+KD Sbjct: 603 GSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSEN-KDKVKD 661 Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824 NGDGN+ SGP+KEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA Sbjct: 662 INGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKA 721 Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644 VPGYSARRALFEH+VRT EGFKQLLEEASEDID + DYQTFK K Sbjct: 722 VPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQTFKMK 781 Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464 WG+DPRFE LDRKERELLLNERVLPLKKAAEEK +AIR AA S FKS+LREK DINT+SR Sbjct: 782 WGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDINTSSR 841 Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284 WSRVKD LR+DPRYKSVKHEDRE+LFNEYIS Sbjct: 842 WSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEEDKLKERERE 901 Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104 RLKV+RKEAVA YQALLVETIKDP+ SWTES+P+L+KDPQGRATN Sbjct: 902 MRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQGRATN 961 Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924 LD D EKLFREHVK+LYERCAR+FR +L EVIT E A+Q+T+DGKTVLTSWS AK L Sbjct: 962 SVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWSTAKRL 1021 Query: 923 LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 LK DPRYSKMPRKERE+LWRR+AEE+ K+KL SD KEEK N E + R S DS RSP Sbjct: 1022 LKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGRSP 1078 Score = 62.4 bits (150), Expect = 6e-06 Identities = 34/57 (59%), Positives = 37/57 (64%), Gaps = 1/57 (1%) Frame = -1 Query: 3578 GNSQGGKLTP-TTAASLQPPAPGQSGHANQFVPGKFPQNMAAPLQPPYPVPRGHPSI 3411 G+SQ G TP TTAASLQPP PGQ GH N F PG Q MA+ P VP+G PSI Sbjct: 156 GHSQVGNSTPSTTAASLQPPVPGQPGHPNTFGPGTGAQFMASQGPSPVSVPKGAPSI 212 >XP_011624657.1 PREDICTED: pre-mRNA-processing protein 40C [Amborella trichopoda] Length = 1085 Score = 880 bits (2273), Expect = 0.0 Identities = 503/959 (52%), Positives = 598/959 (62%), Gaps = 11/959 (1%) Frame = -1 Query: 3590 ATAPGNSQGGKLT-PTTAASLQPPAPGQSG---HANQFVPGKFPQNMAAPLQPPYPVPRG 3423 ATA QGGK PT+AASLQPP PGQS H N + P + QN A +PP+ V +G Sbjct: 125 ATASNPMQGGKPAGPTSAASLQPPVPGQSSVSVHPNSWDPERPVQNALAQARPPFLVRKG 184 Query: 3422 HPSIXXXXXXXXXSQLPATAEASPKXXXXXXXXXXXXXXXXXXXXXXXXXTQSIVLPAHT 3243 PS + ++E S K Q+ LPA + Sbjct: 185 PPSTSGFSFSGNSQSV--SSEDSQKHQASNSDASAAVAQEAKTSQPSSSTAQTTPLPAPS 242 Query: 3242 XXXXSMIPPVPPNMYPTSSMWVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGNANTV 3063 + PN Y T + ++ + Sbjct: 243 STTSRPVSS-SPNTYATP--FYMPKAPPFPGPPRLPVTPGTPGPPGIALSAPQLSSSVNI 299 Query: 3062 RPAIMDS-SVSLRPML-SPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889 RP+++D+ S +RP + S A N+ + Q Q IY PYP+LP + PPPQA+W+HP Sbjct: 300 RPSVIDTNSAIMRPNIASSAPGTSNAASVPITQTAQPPIYSPYPTLPGVVPPPQAMWMHP 359 Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDS-QPPGVSLIGPPGGTSPAIIG 2712 QMGGLQR + +R + +P V +PDS QPPGVS IGPPGG A G Sbjct: 360 SQMGGLQRPPFLPYPGTFPGPFPMPLRPITVPPVAMPDSSQPPGVSPIGPPGGIPLADHG 419 Query: 2711 A-VQ-TXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTG 2538 A +Q T GID+ K D TNKD + ED D WTAHKT+TGAVYYYNALTG Sbjct: 420 AGIQVTISEEQSPPPGIDKEKDTIDYTNKDDNAVSNEDTDQWTAHKTDTGAVYYYNALTG 479 Query: 2537 ESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPL 2358 ESTYEKP GFKGE DKV TPVS EKL GTDW LV TNDGKKYYYNT++K+SSWQVP Sbjct: 480 ESTYEKPPGFKGEVDKVILQRTPVSWEKLVGTDWALVATNDGKKYYYNTKSKISSWQVPP 539 Query: 2357 EVAEMRKKQESE-SLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPV 2181 EVAE+RKKQE++ +LKAN A VQNA +DKG V SLSAP++NTGGR+A+ +++ PV Sbjct: 540 EVAELRKKQEADAALKAN-APVQNAGISSDKGSVSSSLSAPAINTGGREAMTFKSATAPV 598 Query: 2180 SSSALDLIKKKLQD-XXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004 SSSALDLIKKKLQD D NG + V+ T KGQQSENSKDKLK Sbjct: 599 SSSALDLIKKKLQDSGMPVTSSALPSSTPVPTTSDANGQRVVDTTVKGQQSENSKDKLKV 658 Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824 A G++ SGPTKEEC+IQFKEMLKE+G+APFSKWEKELPKILFDPRFKA Sbjct: 659 AQEVGHVSDSSSDSEDVDSGPTKEECVIQFKEMLKEKGIAPFSKWEKELPKILFDPRFKA 718 Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644 +PGY+ RR+LFEHFVRT EGFKQLLE ASEDI+HK DY+TFK+K Sbjct: 719 IPGYTERRSLFEHFVRTRAEEERKEKRAAQKAAIEGFKQLLEGASEDINHKTDYETFKKK 778 Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464 WG DPRF LDRKERE+LLNERVLPL+KA EEK +AIR AAV+SFKSML EK DIN SR Sbjct: 779 WGYDPRFVALDRKEREMLLNERVLPLRKAVEEKTQAIRAAAVASFKSMLHEKVDINIGSR 838 Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284 WS+VKD LRNDPRYKSVKHEDREVLF EYIS Sbjct: 839 WSKVKDSLRNDPRYKSVKHEDREVLFLEYISELKAAEQEADRAAKAKREEEEKLKERERE 898 Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104 R K RRK+AV SYQALL E IKDPKASWTESKPKL+KDP GRATN Sbjct: 899 LRKRKEREEQEVERVRQKARRKDAVVSYQALLTERIKDPKASWTESKPKLEKDPLGRATN 958 Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924 P+L+ AD EKLFREHVKVL ERCAR+FR++L+EVIT E AAQ ++DGKT+L SWS AK L Sbjct: 959 PELEPADMEKLFREHVKVLNERCAREFRSLLAEVITPEAAAQASEDGKTLLNSWSTAKKL 1018 Query: 923 LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSPPA 747 L+PDPRY KMPR+ERESLW+RYAE+M R+Q+ S+ KEEK+N + +R + S++S P+ Sbjct: 1019 LRPDPRYEKMPRRERESLWQRYAEDMDRRQRAASEQKEEKTNIDDPSRRPAGSSKSSPS 1077 >XP_010906097.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Elaeis guineensis] Length = 1097 Score = 879 bits (2271), Expect = 0.0 Identities = 474/776 (61%), Positives = 540/776 (69%), Gaps = 3/776 (0%) Frame = -1 Query: 3068 TVRPAIMDSSVSLRPMLSP-ASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892 T +PA + S LRPM+ P S PP ST + QN+QQQ Y PYPSLP PPPQALWLH Sbjct: 330 TSQPAGTNPS-PLRPMVPPPVSLPPTSTPVPVQQNIQQQFYQPYPSLPGTIPPPQALWLH 388 Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIG 2712 PPQ GGLQR + + G+P P++PLP QPPGV + G S + G Sbjct: 389 PPQAGGLQRAPFLPYSGVLPAPFQLPVHGMPPPAIPLPSIQPPGVPTVANQGPASTTM-G 447 Query: 2711 AVQTXXXXXXXXXG--IDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTG 2538 + Q+ ID K AND +KDG++ K E+AD WTAHKTE+G VYYYN++TG Sbjct: 448 SSQSGSNVGIESPSVGIDHEKHAND-PHKDGESTKNEEADAWTAHKTESGVVYYYNSVTG 506 Query: 2537 ESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPL 2358 ESTYE+P+ F GEP+ VT STPVS EKLAGT+W LVTTNDG+KYYY+T+NKVSSWQVP Sbjct: 507 ESTYERPSSFNGEPENVTAQSTPVSWEKLAGTNWTLVTTNDGRKYYYDTKNKVSSWQVPA 566 Query: 2357 EVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVS 2178 EV E+RK QES++LK NA + N +ADKG P+S+SAP+V TGGRD++ALRTSG VS Sbjct: 567 EVLELRKSQESDALKGNANQLTN---VADKGSAPISMSAPAVETGGRDSMALRTSGAAVS 623 Query: 2177 SSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDAN 1998 SSALDL+KKKLQD DLNG K+VE KGQQ NSKDK+KD Sbjct: 624 SSALDLVKKKLQD-AGTPVTSSPVPTPGPVASDLNGSKAVETAPKGQQGTNSKDKVKD-- 680 Query: 1997 GDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVP 1818 DGNM SGPTKEECI QFKEMLKERGVAPFSKWEKELPKI+FDPRFKAVP Sbjct: 681 -DGNMSDSSSDSDDEESGPTKEECISQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVP 739 Query: 1817 GYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWG 1638 YSAR+ +FEHFVRT + FKQLLEEASE+IDHK DYQTFKRKWG Sbjct: 740 SYSARKTIFEHFVRTRVEEERKEKRAAQKAAIDAFKQLLEEASEEIDHKTDYQTFKRKWG 799 Query: 1637 NDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWS 1458 +DPRF LDRKERELLLNE+V KAAEEK++AIR AAV+SFKSMLR+ +DI TTSRWS Sbjct: 800 SDPRFGVLDRKERELLLNEKV----KAAEEKMQAIRMAAVTSFKSMLRDNKDITTTSRWS 855 Query: 1457 RVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1278 RVK+ LRNDPRYK+VKHE+R LFNEYIS Sbjct: 856 RVKENLRNDPRYKAVKHEERVTLFNEYISELKAVEEEAERSARAKRDEQEKLKEREREMR 915 Query: 1277 XXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPD 1098 RLKVRRKEAVASYQALLVETIKDPKASWTESKPKL+KDPQGRATNPD Sbjct: 916 KRKEREEQEMERVRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLEKDPQGRATNPD 975 Query: 1097 LDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLK 918 L + D EKLFR+HVK LYERCAR FR +LSEVITAE AAQ TDDGKT+L SWSEAK LLK Sbjct: 976 LGQGDAEKLFRDHVKDLYERCARGFRLLLSEVITAEAAAQTTDDGKTILNSWSEAKRLLK 1035 Query: 917 PDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSPP 750 PDPRYSKMP K+RE LWRRYAE+M RKQK SD K EK + + RNR SSD +R P Sbjct: 1036 PDPRYSKMPGKDREYLWRRYAEDMMRKQKPASDPK-EKPDTDGRNRTSSDFSRRSP 1090 >XP_010654542.1 PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis vinifera] Length = 848 Score = 861 bits (2225), Expect = 0.0 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%) Frame = -1 Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892 V A MD S S+ +S A FP P S+ P +QQQIYP Y SLPA Q WL Sbjct: 71 VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 123 Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718 PPQMGGL R + G+PLPSVPLPDSQPPGV+ +G GGT S A+ Sbjct: 124 PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 183 Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547 G A + GID NK N KDG A E D WTAHKT+TG VYYYNA Sbjct: 184 SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 242 Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367 LTGESTYEKP+ FKGE DKVT TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ Sbjct: 243 LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 302 Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187 +P E+ EMRKKQ+S +LK +A N + +KG P++LSAP+V TGGRDA LRTS Sbjct: 303 IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 362 Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007 P S+SALD+IKKKLQD +LNG + +E T KG QSENSKDKLK Sbjct: 363 PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 421 Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827 D NGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK Sbjct: 422 DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 481 Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647 A+PGYSARR+LFEH+VRT EGFKQLLEEASEDIDHK +YQTF++ Sbjct: 482 AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 541 Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467 KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++ Sbjct: 542 KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 601 Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287 RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS Sbjct: 602 RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 661 Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107 RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT Sbjct: 662 ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 721 Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927 N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK Sbjct: 722 NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 781 Query: 926 LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 LL+ D RY KMPRK+RES+WRRY+EEM RKQKL D EEK + EV+ R S DS R P Sbjct: 782 LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 838 >XP_010654535.1 PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis vinifera] Length = 903 Score = 861 bits (2225), Expect = 0.0 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%) Frame = -1 Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892 V A MD S S+ +S A FP P S+ P +QQQIYP Y SLPA Q WL Sbjct: 126 VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 178 Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718 PPQMGGL R + G+PLPSVPLPDSQPPGV+ +G GGT S A+ Sbjct: 179 PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 238 Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547 G A + GID NK N KDG A E D WTAHKT+TG VYYYNA Sbjct: 239 SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 297 Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367 LTGESTYEKP+ FKGE DKVT TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ Sbjct: 298 LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 357 Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187 +P E+ EMRKKQ+S +LK +A N + +KG P++LSAP+V TGGRDA LRTS Sbjct: 358 IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 417 Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007 P S+SALD+IKKKLQD +LNG + +E T KG QSENSKDKLK Sbjct: 418 PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 476 Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827 D NGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK Sbjct: 477 DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 536 Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647 A+PGYSARR+LFEH+VRT EGFKQLLEEASEDIDHK +YQTF++ Sbjct: 537 AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 596 Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467 KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++ Sbjct: 597 KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 656 Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287 RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS Sbjct: 657 RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 716 Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107 RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT Sbjct: 717 ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 776 Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927 N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK Sbjct: 777 NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 836 Query: 926 LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 LL+ D RY KMPRK+RES+WRRY+EEM RKQKL D EEK + EV+ R S DS R P Sbjct: 837 LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 893 >XP_010654529.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis vinifera] Length = 1013 Score = 861 bits (2225), Expect = 0.0 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%) Frame = -1 Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892 V A MD S S+ +S A FP P S+ P +QQQIYP Y SLPA Q WL Sbjct: 236 VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 288 Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718 PPQMGGL R + G+PLPSVPLPDSQPPGV+ +G GGT S A+ Sbjct: 289 PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 348 Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547 G A + GID NK N KDG A E D WTAHKT+TG VYYYNA Sbjct: 349 SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 407 Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367 LTGESTYEKP+ FKGE DKVT TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ Sbjct: 408 LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 467 Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187 +P E+ EMRKKQ+S +LK +A N + +KG P++LSAP+V TGGRDA LRTS Sbjct: 468 IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 527 Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007 P S+SALD+IKKKLQD +LNG + +E T KG QSENSKDKLK Sbjct: 528 PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 586 Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827 D NGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK Sbjct: 587 DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 646 Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647 A+PGYSARR+LFEH+VRT EGFKQLLEEASEDIDHK +YQTF++ Sbjct: 647 AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 706 Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467 KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++ Sbjct: 707 KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 766 Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287 RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS Sbjct: 767 RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 826 Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107 RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT Sbjct: 827 ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 886 Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927 N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK Sbjct: 887 NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 946 Query: 926 LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 LL+ D RY KMPRK+RES+WRRY+EEM RKQKL D EEK + EV+ R S DS R P Sbjct: 947 LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 1003 >XP_002272014.2 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis vinifera] CBI27460.3 unnamed protein product, partial [Vitis vinifera] Length = 1046 Score = 861 bits (2225), Expect = 0.0 Identities = 472/778 (60%), Positives = 533/778 (68%), Gaps = 7/778 (0%) Frame = -1 Query: 3065 VRPAIMDSSVSLRPMLSPASFP--PNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLH 2892 V A MD S S+ +S A FP P S+ P +QQQIYP Y SLPA Q WL Sbjct: 269 VPSASMDFSSSV---VSRAIFPAAPVSSNPA----IQQQIYPSYSSLPATNASSQGPWLQ 321 Query: 2891 PPQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGT--SPAI 2718 PPQMGGL R + G+PLPSVPLPDSQPPGV+ +G GGT S A+ Sbjct: 322 PPQMGGLPRPPFVPYPAVYPTPFPLPAHGMPLPSVPLPDSQPPGVTPVGTAGGTPISAAV 381 Query: 2717 IG---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNA 2547 G A + GID NK N KDG A E D WTAHKT+TG VYYYNA Sbjct: 382 SGHHLANTSGMLSELPPPGIDDNKHVNGAGTKDG-AAVNEQVDAWTAHKTDTGVVYYYNA 440 Query: 2546 LTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQ 2367 LTGESTYEKP+ FKGE DKVT TPVS EKL GTDW LVTTNDGKKYYYNT+ K+SSWQ Sbjct: 441 LTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVTTNDGKKYYYNTKTKLSSWQ 500 Query: 2366 VPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGG 2187 +P E+ EMRKKQ+S +LK +A N + +KG P++LSAP+V TGGRDA LRTS Sbjct: 501 IPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALSAPAVTTGGRDATPLRTSAV 560 Query: 2186 PVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLK 2007 P S+SALD+IKKKLQD +LNG + +E T KG QSENSKDKLK Sbjct: 561 PGSASALDMIKKKLQD-SGAPATSSPVHSSGPIASELNGSRVIEPTVKGLQSENSKDKLK 619 Query: 2006 DANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFK 1827 D NGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFK Sbjct: 620 DTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFK 679 Query: 1826 AVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKR 1647 A+PGYSARR+LFEH+VRT EGFKQLLEEASEDIDHK +YQTF++ Sbjct: 680 AIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQLLEEASEDIDHKTEYQTFRK 739 Query: 1646 KWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTS 1467 KWG+DPRFE LDRK+RELLLNERVLPLK+AAEEK +AIR AAVSSFKSMLR+K DI T++ Sbjct: 740 KWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRAAAVSSFKSMLRDKGDITTST 799 Query: 1466 RWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1287 RWSRVKD LRNDPRYK VKHEDRE+LFNEYIS Sbjct: 800 RWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEEVEREAKSKKEEQDKLKERER 859 Query: 1286 XXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRAT 1107 RLKVRRKEAV+SYQALLVETIKDP+ SWTESKPKL+KDPQ RAT Sbjct: 860 ELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDPQVSWTESKPKLEKDPQARAT 919 Query: 1106 NPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKH 927 N DLD +D EKLFREH+K+L+ER A +FRA+LSEV+TAE A Q T+DGKTVLTSWS AK Sbjct: 920 NSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEAATQETEDGKTVLTSWSTAKR 979 Query: 926 LLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 LL+ D RY KMPRK+RES+WRRY+EEM RKQKL D EEK + EV+ R S DS R P Sbjct: 980 LLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEEK-HTEVKGRSSVDSGRFP 1036 >XP_012467146.1 PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii] KJB15267.1 hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 887 Score = 833 bits (2152), Expect = 0.0 Identities = 449/750 (59%), Positives = 518/750 (69%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P MGG R Sbjct: 126 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 186 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D+T K ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 245 NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 304 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 363 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 SV N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 364 VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 422 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 423 SSSPVPVVPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 480 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 481 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 540 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGFKQLL+EASEDIDH +YQTFKRKWG+DPRFE LDRK+RELLLN Sbjct: 541 EERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 600 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 601 ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 660 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLFNEYIS RLKVR Sbjct: 661 DREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 720 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 721 RKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 780 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L+EVIT + AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 781 ERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 840 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQK D +EEK + +V+ R S Sbjct: 841 RYAEDMLRKQKSALDQEEEK-HTDVKGRSS 869 >KJB15269.1 hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 886 Score = 829 bits (2141), Expect = 0.0 Identities = 449/750 (59%), Positives = 518/750 (69%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P MGG R Sbjct: 126 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 186 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D+T K ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 245 NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 304 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 363 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 SV N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 364 VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 422 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 423 SSSPVPVVPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 480 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 481 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 540 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGFKQLL+EASEDIDH +YQTFKRKWG+DPRFE LDRK+RELLLN Sbjct: 541 EERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 600 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 601 ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 660 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLFNEYIS RLKVR Sbjct: 661 DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 719 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 720 RKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 779 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L+EVIT + AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 780 ERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQK D +EEK + +V+ R S Sbjct: 840 RYAEDMLRKQKSALDQEEEK-HTDVKGRSS 868 >KJB15270.1 hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 888 Score = 829 bits (2141), Expect = 0.0 Identities = 450/751 (59%), Positives = 518/751 (68%), Gaps = 3/751 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P MGG R Sbjct: 126 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 185 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 186 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAILTGFPPQGID 244 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D+T K ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 245 NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 303 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKV-SSWQVPLEVAEMRKKQESESLKAN 2307 TPVS E+LAGTDW LVTTNDGKKYYYN++ KV SSWQ+P EV E+RKKQ+SE K N Sbjct: 304 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKEN 363 Query: 2306 AASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXX 2127 A SV N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 364 AVSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP 423 Query: 2126 XXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXS 1947 +LNG ++V+ KG QSE++KDKLKDANGDG++ S Sbjct: 424 SSSPVPVVPVTATH-ELNGSRAVDV--KGLQSESNKDKLKDANGDGSISDSSSDSEDADS 480 Query: 1946 GPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXX 1767 GP+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 481 GPSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRA 540 Query: 1766 XXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLL 1587 EGFKQLL+EASEDIDH +YQTFKRKWG+DPRFE LDRK+RELLL Sbjct: 541 EEERKEKRAAQKAAIEGFKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLL 600 Query: 1586 NERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKH 1407 NERVL LK+AAEEK RAIR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKH Sbjct: 601 NERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKH 660 Query: 1406 EDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKV 1227 EDREVLFNEYIS RLKV Sbjct: 661 EDREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKV 720 Query: 1226 RRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVL 1047 RRKEAVAS+QALLVETIKDP+ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L Sbjct: 721 RRKEAVASFQALLVETIKDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKML 780 Query: 1046 YERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLW 867 +ERC DFRA+L+EVIT + AQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LW Sbjct: 781 FERCVNDFRALLAEVITQDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALW 840 Query: 866 RRYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RRYAE+M RKQK D +EEK + +V+ R S Sbjct: 841 RRYAEDMLRKQKSALDQEEEK-HTDVKGRSS 870 >XP_016707727.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium hirsutum] Length = 886 Score = 827 bits (2136), Expect = 0.0 Identities = 447/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P MGG R Sbjct: 125 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 184 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPG +G P S A + GID Sbjct: 185 VYPGPFPSTSSGMPLPA-PSSDSQPPGFRPLGMSPFAPSAAALANQSLAILTGFPPQGID 243 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D+T K ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 244 NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 303 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 SV N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 363 VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 422 SSSPVPVMPVTATHELNGLRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 480 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGFKQLL+EASEDI H +YQTFKRKWG+DPRFE LDRK+RELLLN Sbjct: 540 EERKEKRAAQKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 600 ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLFNEYIS RLKVR Sbjct: 660 DREVLFNEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 719 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 720 RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 779 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L++VIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 780 ERCVNDFRALLAKVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQKL D +EEK + +V+ R S Sbjct: 840 RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 868 >XP_016707728.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Gossypium hirsutum] Length = 885 Score = 823 bits (2125), Expect = 0.0 Identities = 447/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P MGG R Sbjct: 125 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPT 184 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPG +G P S A + GID Sbjct: 185 VYPGPFPSTSSGMPLPA-PSSDSQPPGFRPLGMSPFAPSAAALANQSLAILTGFPPQGID 243 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D+T K ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 244 NRKLVHDVTTKV-ESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 303 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 SV N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 363 VSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 422 SSSPVPVMPVTATHELNGLRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 480 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGFKQLL+EASEDI H +YQTFKRKWG+DPRFE LDRK+RELLLN Sbjct: 540 EERKEKRAAQKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK RAIR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 600 ERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLFNEYIS RLKVR Sbjct: 660 DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 719 RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 778 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L++VIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 779 ERCVNDFRALLAKVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQKL D +EEK + +V+ R S Sbjct: 839 RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867 >XP_017637434.1 PREDICTED: pre-mRNA-processing protein 40C [Gossypium arboreum] Length = 885 Score = 822 bits (2123), Expect = 0.0 Identities = 444/750 (59%), Positives = 517/750 (68%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P +GG R Sbjct: 125 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 185 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D++ + ++A E +D WTAHKT+TG VYYYNALTGESTYEKPAGFKGEPD+VT Sbjct: 244 NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVT 302 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 303 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPYEVTELRKKQDSEVSKENA 362 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 V N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 363 VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 422 SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 480 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGF+QLL+EASEDIDH +YQTFKRKWG+DPRFE LDRK+RELLLN Sbjct: 540 EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLN 599 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK R IR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 600 ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLFNEYIS RLKVR Sbjct: 660 DREVLFNEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 719 RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLF 778 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 779 ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQKL D +EEK + +V+ R S Sbjct: 839 RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867 >GAV80419.1 WW domain-containing protein/FF domain-containing protein, partial [Cephalotus follicularis] Length = 980 Score = 823 bits (2127), Expect = 0.0 Identities = 441/766 (57%), Positives = 512/766 (66%), Gaps = 5/766 (0%) Frame = -1 Query: 3035 SLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXX 2856 SL P+ A +++ VQQQ+YP YPSLPAM PQ LW+HPPQMGG+ R Sbjct: 209 SLVPVTKGAPSNADTSTAVSQAGVQQQMYPTYPSLPAMAASPQGLWVHPPQMGGMPRPPF 268 Query: 2855 XXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-----PPGGTSPAIIGAVQTXXX 2691 R V LPSV DSQPPGV+ +G P +P V T Sbjct: 269 LPYPAVYPGPFLAPARNVALPSVLSLDSQPPGVTPMGTTGAIPMSSAAPGHHLVVTTGIQ 328 Query: 2690 XXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAG 2511 GID +D+TN A + ++ WTA +T+TG VYYYNA+TGESTYEKP G Sbjct: 329 TELPPPGIDDRTHYHDVTNNGA--AFNKQSEVWTAFRTDTGNVYYYNAITGESTYEKPPG 386 Query: 2510 FKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQ 2331 FK EPDKV +P E L GTDWVLV+TNDGKKYYYN++ K+SSWQ+P EVAE+RKKQ Sbjct: 387 FKVEPDKVPMQPSPTLMEYLPGTDWVLVSTNDGKKYYYNSKTKLSSWQIPTEVAELRKKQ 446 Query: 2330 ESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKK 2151 + + K + SV N + L +KG P+SLSAP+VNTGGRDA ALRTSG P SSSALDLIKK Sbjct: 447 DDDVSKEHPISVPNTNVLTEKGSSPISLSAPAVNTGGRDATALRTSGVPGSSSALDLIKK 506 Query: 2150 KLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXX 1971 KLQD + NG ++VEAT KG QSENSKDKLKDANGDGN+ Sbjct: 507 KLQDPGAPITSSLTPASSGTAALESNGSRAVEATVKGLQSENSKDKLKDANGDGNVSDSS 566 Query: 1970 XXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALF 1791 SGPTKE C++QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LF Sbjct: 567 SDSEDVDSGPTKEVCLVQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLF 626 Query: 1790 EHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLD 1611 EH+V+T EGFKQLLEEASEDIDH DYQTFK+KW +DPRFE LD Sbjct: 627 EHYVKTRAEEERKEKRAAQKVAIEGFKQLLEEASEDIDHYTDYQTFKKKWDSDPRFEALD 686 Query: 1610 RKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRND 1431 RK+RELLLNERVLPLK+AAEEK +AIR AA S FKSMLREK DI SRWS+VKD LRND Sbjct: 687 RKDRELLLNERVLPLKRAAEEKAQAIRVAAASDFKSMLREKGDITAISRWSKVKDVLRND 746 Query: 1430 PRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1251 PRYKSVKHEDRE+LF++YI+ Sbjct: 747 PRYKSVKHEDREILFSQYIAELKAVEEEAEREAKAKKHEQERLKERERELRKRKEREEQE 806 Query: 1250 XXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKL 1071 R+KVRRKEAVAS QALLVETIKDP+ASWTESKPKL+KDPQGRATNPD D D EKL Sbjct: 807 VERVRVKVRRKEAVASLQALLVETIKDPQASWTESKPKLEKDPQGRATNPDFDPYDIEKL 866 Query: 1070 FREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMP 891 FREH+K+L++RCA DF+A+LSEV+T E A Q DGKT L SWS AK LLKPD RY++MP Sbjct: 867 FREHIKILHQRCAHDFKALLSEVVTTEAAVQ-KSDGKTALNSWSTAKRLLKPDARYNRMP 925 Query: 890 RKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 RK+RE LWRRY EEM RKQK D D K+EK + + + R S DS R P Sbjct: 926 RKDREGLWRRYVEEMLRKQKPDFDQKDEK-HKDAKGRSSIDSGRLP 970 >XP_016703241.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium hirsutum] Length = 886 Score = 818 bits (2112), Expect = 0.0 Identities = 440/750 (58%), Positives = 516/750 (68%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P +GG R Sbjct: 125 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 185 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D++ + ++A E +D WTAHKT+TG VYYYNALTGES+YEKPAGFKGEPD+VT Sbjct: 244 NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESSYEKPAGFKGEPDQVT 302 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 303 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 V N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 363 VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 422 SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 480 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGF+QLL+EASEDIDH +YQTFKR+WG+DPRFE LDRK+R LLLN Sbjct: 540 EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRQWGSDPRFEALDRKDRGLLLN 599 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK R IR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 600 ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLF+EYIS RLKVR Sbjct: 660 DREVLFDEYISELKAIEEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVR 719 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 720 RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAVNPDLDSSDMEKLFREHIKMLF 779 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 780 ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 839 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQKL D +EEK + +V+ R S Sbjct: 840 RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 868 >XP_018840821.1 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Juglans regia] Length = 1013 Score = 822 bits (2122), Expect = 0.0 Identities = 440/775 (56%), Positives = 517/775 (66%), Gaps = 5/775 (0%) Frame = -1 Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889 TV DSS S P P T P L+ + Q PY S PAM PPQ +WL P Sbjct: 237 TVLSVATDSSSSAVPR------PTMPTAPVLSSSAVQTANYPYASFPAMAAPPQGMWLQP 290 Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG--PPGGTSPAII 2715 QMGGL R + RG+ LPSVPLPDSQPPGV+ +G P S A Sbjct: 291 SQMGGLPRSPFQPYPAAFPGPFPLPARGMALPSVPLPDSQPPGVTPLGTAPTISVSSAAS 350 Query: 2714 G---AVQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNAL 2544 G A GID K ++ +DG A KE D WTAHKTE G VYYYNA+ Sbjct: 351 GHMLAGTLRMQPELPPPGIDNRKNVEEVGTQDG-AAVKEQLDAWTAHKTEAGVVYYYNAV 409 Query: 2543 TGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQV 2364 TGESTY+KP GFKGE DKV TPVS+ + GTDWVLVTT+DGKKYYYN++ K+SSWQ+ Sbjct: 410 TGESTYDKPLGFKGEHDKVHVQPTPVSTTSILGTDWVLVTTSDGKKYYYNSKTKISSWQI 469 Query: 2363 PLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGP 2184 P EV E++KKQ+ E ++ S+ +A+ +KG P+SL+AP+++TGGRDA+AL+ P Sbjct: 470 PSEVTELKKKQDGE----HSISLPHANLSTEKGSAPISLNAPAISTGGRDAMALKALAVP 525 Query: 2183 VSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKD 2004 SSSALD+IKKKLQD +LNG ++V+ T KG QSE+S+DKLKD Sbjct: 526 GSSSALDMIKKKLQDSGSPITSSPNPAPSGIAASELNGSRAVDTTVKGLQSEDSRDKLKD 585 Query: 2003 ANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKA 1824 ANGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKA Sbjct: 586 ANGDGNMSDSSSDSEDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKA 645 Query: 1823 VPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRK 1644 +P YSARR+LFEH+V+T EGFKQLL EASEDIDH DYQTF++K Sbjct: 646 IPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLGEASEDIDHNTDYQTFRKK 705 Query: 1643 WGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSR 1464 WG DPRFE LDRK+RE LLNERV PLKKAAEEK++A+R AA +SFKSMLREK DI SR Sbjct: 706 WGADPRFEVLDRKDREHLLNERVFPLKKAAEEKVQALRAAAATSFKSMLREKRDITANSR 765 Query: 1463 WSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1284 WS+VKD LRND RYKS KHEDRE+ FNEYIS Sbjct: 766 WSKVKDSLRNDSRYKSAKHEDREIFFNEYISELKAGEEQSEREAKAKREEQEKLKERERE 825 Query: 1283 XXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATN 1104 RLKVRRKEAVAS+QALLVE IKDP+ASWTESKPKL+KDPQGRATN Sbjct: 826 LRKRKEREEQEMERVRLKVRRKEAVASFQALLVEIIKDPQASWTESKPKLEKDPQGRATN 885 Query: 1103 PDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHL 924 DLD +D EKLFREH+K+L ERC ++FR +L+EV+TAE AAQ T++GKTVL SWS AK L Sbjct: 886 TDLDPSDIEKLFREHIKMLNERCVQEFRYLLAEVLTAEAAAQETEEGKTVLNSWSTAKRL 945 Query: 923 LKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSAR 759 LKPDPRY+KMPRKERE LWRRYA+E+ R+QK+ D KEEK + E + R S+DS R Sbjct: 946 LKPDPRYNKMPRKEREVLWRRYADEILRRQKVALDQKEEKKHVESKGRNSADSGR 1000 >JAT41262.1 Transcription elongation regulator 1, partial [Anthurium amnicola] Length = 1216 Score = 828 bits (2139), Expect = 0.0 Identities = 489/954 (51%), Positives = 572/954 (59%), Gaps = 8/954 (0%) Frame = -1 Query: 3590 ATAPGNS--QGGKLTPTTAA-SLQPPAPGQSGHANQFVPGKFPQNMAAPLQPPYPVPRGH 3420 +TA GN+ QG L P++A SLQ GQS +PG QN +Q P + Sbjct: 277 STAIGNNNLQGETLAPSSAPPSLQSSVRGQSSALRSTLPGTAKQNPPTLMQLPSSTSFSY 336 Query: 3419 PSIXXXXXXXXXSQLPATAEASPKXXXXXXXXXXXXXXXXXXXXXXXXXTQSIVLPAHTX 3240 + P E S K +QS+ + A Sbjct: 337 SG----------NSQPGIVETSEKTVSPNSNASSAIAAEPVAAAVAPISSQSMQMSAQVP 386 Query: 3239 XXXSMIPPVPPNMYPTSSMWVQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGNANTVR 3060 S P PN + VQ NA TVR Sbjct: 387 PSFSTNVPSSPNPNVAT---VQVPVIPSFARPPGIPGNVGPGPAGLASCVSPSSNA-TVR 442 Query: 3059 PAIMDSSVSLRPML-SPASFPPNS-TVPT-LAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889 P ++DSS S RP+L +PAS P NS + P + QNVQQQ YPPYPS+ A PPPQA WLH Sbjct: 443 PVLVDSS-SARPILPAPASIPTNSVSAPAPIPQNVQQQSYPPYPSITA-APPPQAPWLHA 500 Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709 Q + ++ +P P VPLP QPPGVS I GGT A I Sbjct: 501 SHAVSFQHAPFLPYPGALCTPFPLPMQSMPSPYVPLPSLQPPGVSTIVVSGGTKSASIEP 560 Query: 2708 VQTXXXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGEST 2529 VQ NK A D T KDGD AKK+ + WTAHKT+ GA+YYYN+LTGEST Sbjct: 561 VQPGNNFIAQSPSGTDNKLATDPTIKDGDIAKKDGSGPWTAHKTDAGAIYYYNSLTGEST 620 Query: 2528 YEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVA 2349 YEKP+GFKGEP KV TPVS EKLAGTDW LVTTNDGKKYYYN++ KVSSWQ+P EVA Sbjct: 621 YEKPSGFKGEPGKVVCQPTPVSWEKLAGTDWSLVTTNDGKKYYYNSKTKVSSWQIPSEVA 680 Query: 2348 EMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSA 2169 E++ + S+ K S+QNAS DKG VSL+AP+V TGGRDA +T +SSSA Sbjct: 681 ELKNNEVSDHSKEGTNSIQNASVTDDKGSSLVSLNAPAVQTGGRDAATSKTPAPLISSSA 740 Query: 2168 LDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDG 1989 LDLIKKKLQD DL+GPK+VE T KGQ SENSKDKLK NGD Sbjct: 741 LDLIKKKLQD-AGTPMTSLPLPTSVPTLSDLSGPKAVETTAKGQHSENSKDKLKGINGDA 799 Query: 1988 NMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYS 1809 N+ SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRFKAV +S Sbjct: 800 NLSESSSDSDDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIIFDPRFKAVQSHS 859 Query: 1808 ARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDP 1629 RR+LFEH+VRT EGFKQLL+E SEDI+HK DYQ+FKRKWG DP Sbjct: 860 VRRSLFEHYVRTRADEERKEKRAAQKALIEGFKQLLDEVSEDINHKTDYQSFKRKWGRDP 919 Query: 1628 RFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVK 1449 RFE L RKE+E LL ER+L LKK EEK +A+R ++FK +LREK +++ +SRWSRVK Sbjct: 920 RFEALGRKEKEALLTERILSLKKVVEEKTQAVR----ANFKCLLREKAEVSASSRWSRVK 975 Query: 1448 DGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1269 D LRNDPRY++VKHEDREV FNE+IS Sbjct: 976 DSLRNDPRYRAVKHEDREVFFNEHISELKEAEAEAQLAVKAKIEEQEKLKKREQEMRKRK 1035 Query: 1268 XXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDK 1089 RL+VRRKEA +SYQALLVETIKDPKASWTESKPKL+KDPQGRA NPDLD+ Sbjct: 1036 QREEQEMEAVRLRVRRKEAESSYQALLVETIKDPKASWTESKPKLEKDPQGRAANPDLDQ 1095 Query: 1088 ADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDP 909 AD EKLFREHVK LYERCAR++RA+L+E+ITAE AA+VTDDGKTVLTSWSEAK LLKPD Sbjct: 1096 ADMEKLFREHVKNLYERCAREYRALLAELITAEVAARVTDDGKTVLTSWSEAKKLLKPDS 1155 Query: 908 RYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSD--SARSP 753 RYSKMP KERES+W R+A+E+ RK K SD+K E+ + EV+ R S RSP Sbjct: 1156 RYSKMPSKERESIWSRHADEIHRKLKSASDIK-ERVDGEVKGRASCTDIGGRSP 1208 >XP_018840830.1 PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Juglans regia] Length = 1011 Score = 819 bits (2116), Expect = 0.0 Identities = 437/777 (56%), Positives = 516/777 (66%), Gaps = 7/777 (0%) Frame = -1 Query: 3068 TVRPAIMDSSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHP 2889 TV DSS S P P T P L+ + Q PY S PAM PPQ +WL P Sbjct: 237 TVLSVATDSSSSAVPR------PTMPTAPVLSSSAVQTANYPYASFPAMAAPPQGMWLQP 290 Query: 2888 PQMGGLQRXXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGGTSPAIIGA 2709 QMGGL R + RG+ LPSVPLPDSQPPGV+ P GT+P I + Sbjct: 291 SQMGGLPRSPFQPYPAAFPGPFPLPARGMALPSVPLPDSQPPGVT----PLGTAPTISVS 346 Query: 2708 VQTXXXXXXXXXGI-------DQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYN 2550 + D K ++ +DG A KE D WTAHKTE G VYYYN Sbjct: 347 SAASGHMLAGTLRMQPELPPPDNRKNVEEVGTQDG-AAVKEQLDAWTAHKTEAGVVYYYN 405 Query: 2549 ALTGESTYEKPAGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSW 2370 A+TGESTY+KP GFKGE DKV TPVS+ + GTDWVLVTT+DGKKYYYN++ K+SSW Sbjct: 406 AVTGESTYDKPLGFKGEHDKVHVQPTPVSTTSILGTDWVLVTTSDGKKYYYNSKTKISSW 465 Query: 2369 QVPLEVAEMRKKQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSG 2190 Q+P EV E++KKQ+ E ++ S+ +A+ +KG P+SL+AP+++TGGRDA+AL+ Sbjct: 466 QIPSEVTELKKKQDGE----HSISLPHANLSTEKGSAPISLNAPAISTGGRDAMALKALA 521 Query: 2189 GPVSSSALDLIKKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKL 2010 P SSSALD+IKKKLQD +LNG ++V+ T KG QSE+S+DKL Sbjct: 522 VPGSSSALDMIKKKLQDSGSPITSSPNPAPSGIAASELNGSRAVDTTVKGLQSEDSRDKL 581 Query: 2009 KDANGDGNMXXXXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRF 1830 KDANGDGNM SGPTKEECIIQFKEMLKERGVAPFSKWEKELPKI+FDPRF Sbjct: 582 KDANGDGNMSDSSSDSEDADSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRF 641 Query: 1829 KAVPGYSARRALFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFK 1650 KA+P YSARR+LFEH+V+T EGFKQLL EASEDIDH DYQTF+ Sbjct: 642 KAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLGEASEDIDHNTDYQTFR 701 Query: 1649 RKWGNDPRFETLDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTT 1470 +KWG DPRFE LDRK+RE LLNERV PLKKAAEEK++A+R AA +SFKSMLREK DI Sbjct: 702 KKWGADPRFEVLDRKDREHLLNERVFPLKKAAEEKVQALRAAAATSFKSMLREKRDITAN 761 Query: 1469 SRWSRVKDGLRNDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXX 1290 SRWS+VKD LRND RYKS KHEDRE+ FNEYIS Sbjct: 762 SRWSKVKDSLRNDSRYKSAKHEDREIFFNEYISELKAGEEQSEREAKAKREEQEKLKERE 821 Query: 1289 XXXXXXXXXXXXXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRA 1110 RLKVRRKEAVAS+QALLVE IKDP+ASWTESKPKL+KDPQGRA Sbjct: 822 RELRKRKEREEQEMERVRLKVRRKEAVASFQALLVEIIKDPQASWTESKPKLEKDPQGRA 881 Query: 1109 TNPDLDKADTEKLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAK 930 TN DLD +D EKLFREH+K+L ERC ++FR +L+EV+TAE AAQ T++GKTVL SWS AK Sbjct: 882 TNTDLDPSDIEKLFREHIKMLNERCVQEFRYLLAEVLTAEAAAQETEEGKTVLNSWSTAK 941 Query: 929 HLLKPDPRYSKMPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSAR 759 LLKPDPRY+KMPRKERE LWRRYA+E+ R+QK+ D KEEK + E + R S+DS R Sbjct: 942 RLLKPDPRYNKMPRKEREVLWRRYADEILRRQKVALDQKEEKKHVESKGRNSADSGR 998 >XP_015895736.1 PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Ziziphus jujuba] Length = 982 Score = 817 bits (2111), Expect = 0.0 Identities = 436/768 (56%), Positives = 515/768 (67%), Gaps = 4/768 (0%) Frame = -1 Query: 3044 SSVSLRPMLSPASFPPNSTVPTLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQR 2865 SS+ RP + P NS++ Q QI YPSLPA+ PQ LWL PPQMGG+ R Sbjct: 214 SSMVQRPGMPTGPVPLNSSI-------QPQIGASYPSLPALAGHPQGLWLQPPQMGGMPR 266 Query: 2864 XXXXXXXXXXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIGPPGG----TSPAIIGAVQTX 2697 + G+ LPSVP+PD QPPGV+ + G ++ + + V Sbjct: 267 QPVVPYSAAFPGPLPLMAHGMHLPSVPVPDPQPPGVTPVENSGSIPVSSTASSLQLVGPS 326 Query: 2696 XXXXXXXXGIDQNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKP 2517 + ND+ +D A E D WTAHKT+TG VYYYNALTGESTY KP Sbjct: 327 GMHTLVHKSAGDRTKVNDVGVQDR-AAINEQLDAWTAHKTDTGVVYYYNALTGESTYAKP 385 Query: 2516 AGFKGEPDKVTTHSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRK 2337 A FKGEPDKV+ PVS L GTDWVLVTT+DGKKYY N + KVSSWQ+P EV E++K Sbjct: 386 ADFKGEPDKVSVQPIPVSMVNLPGTDWVLVTTSDGKKYYCNNKTKVSSWQIPNEVTELKK 445 Query: 2336 KQESESLKANAASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLI 2157 K + E K + SV N S + +KG +SLS P++NTGGRDAIALR+SG SSSALDLI Sbjct: 446 KPDGEVSKEHLMSVPNTSVVMEKGSTTISLSTPAINTGGRDAIALRSSGVQPSSSALDLI 505 Query: 2156 KKKLQDXXXXXXXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXX 1977 KKKLQD + NG K+VEATTKGQQSENSKDKLKDANGDGN Sbjct: 506 KKKLQDSGAPVVSSPVPAPSGMTGSESNGSKAVEATTKGQQSENSKDKLKDANGDGNFSD 565 Query: 1976 XXXXXXXXXSGPTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRA 1797 SGPTKEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P YSARR+ Sbjct: 566 SSSDSEDADSGPTKEECIVQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRS 625 Query: 1796 LFEHFVRTXXXXXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFET 1617 LFEH+V+T EGFKQLL+EASE+IDH+ DYQTF++KWGNDPRF Sbjct: 626 LFEHYVKTRVEEERKEKRAAQKAAIEGFKQLLDEASEEIDHETDYQTFRKKWGNDPRFMA 685 Query: 1616 LDRKERELLLNERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLR 1437 LDRK+RE LLNERVLPLK+AAEEK +AIR AA S FKSMLREK DI SRWSRVKD LR Sbjct: 686 LDRKDRENLLNERVLPLKRAAEEKAQAIRAAAASGFKSMLREKGDITVNSRWSRVKDSLR 745 Query: 1436 NDPRYKSVKHEDREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1257 NDPRYKSVKHEDREVLFNEY++ Sbjct: 746 NDPRYKSVKHEDREVLFNEYLADLRATEEEAEREAKLKRQEQDKLKERERELRKRKEREE 805 Query: 1256 XXXXXXRLKVRRKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTE 1077 R+KVRRKEA+AS+QALLVETIKDP+ASWTESK KL+KDPQGRA NPDLD + E Sbjct: 806 QEMERVRVKVRRKEAIASFQALLVETIKDPQASWTESKTKLEKDPQGRAANPDLDSLEME 865 Query: 1076 KLFREHVKVLYERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSK 897 KLFREH+K+L+ERCAR+F+ +L+EV+TA+ AAQ T+DGKTVL SWS AK LLK DPRY+K Sbjct: 866 KLFREHIKMLHERCAREFKTLLAEVLTADAAAQETEDGKTVLNSWSTAKRLLKRDPRYNK 925 Query: 896 MPRKERESLWRRYAEEMQRKQKLDSDLKEEKSNPEVRNRISSDSARSP 753 MPRK+RE+LWRR+AEEM RKQK + + KE+K + ++R + +S R P Sbjct: 926 MPRKDREALWRRHAEEMLRKQKSELERKEDK-KIDAKSRSTIESGRFP 972 >XP_016703242.1 PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Gossypium hirsutum] Length = 885 Score = 813 bits (2101), Expect = 0.0 Identities = 440/750 (58%), Positives = 516/750 (68%), Gaps = 2/750 (0%) Frame = -1 Query: 3017 SPASFPPNSTVP-TLAQNVQQQIYPPYPSLPAMTPPPQALWLHPPQMGGLQRXXXXXXXX 2841 SP+S P P +L VQQQ+YPPY SLP+M PQ W+ P +GG R Sbjct: 125 SPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPLGGFPRPPFVPYPT 184 Query: 2840 XXXXXXXVQIRGVPLPSVPLPDSQPPGVSLIG-PPGGTSPAIIGAVQTXXXXXXXXXGID 2664 G+PLP+ P DSQPPGV +G P S A + GID Sbjct: 185 VYPGPFPSTSSGMPLPA-PSSDSQPPGVRPLGMSPFAPSAAALANQSLAIQTGFPPQGID 243 Query: 2663 QNKQANDLTNKDGDTAKKEDADTWTAHKTETGAVYYYNALTGESTYEKPAGFKGEPDKVT 2484 K +D++ + ++A E +D WTAHKT+TG VYYYNALTGES+YEKPAGFKGEPD+VT Sbjct: 244 NRKLGHDVSTRV-ESAVNEQSDVWTAHKTDTGVVYYYNALTGESSYEKPAGFKGEPDQVT 302 Query: 2483 THSTPVSSEKLAGTDWVLVTTNDGKKYYYNTRNKVSSWQVPLEVAEMRKKQESESLKANA 2304 TPVS E+LAGTDW LVTTNDGKKYYYN++ K+SSWQ+P EV E+RKKQ+SE K NA Sbjct: 303 VQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENA 362 Query: 2303 ASVQNASALADKGYVPVSLSAPSVNTGGRDAIALRTSGGPVSSSALDLIKKKLQDXXXXX 2124 V N +A+KG P+SLSAP+VNTGGRDA+ LRTS P SSSALDLIKKKLQD Sbjct: 363 VPVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQD-PGVP 421 Query: 2123 XXXXXXXXXXXXXPDLNGPKSVEATTKGQQSENSKDKLKDANGDGNMXXXXXXXXXXXSG 1944 +LNG ++V+ KG QSE++KDKLKDANGDG++ SG Sbjct: 422 SSSPVPVMPVTATHELNGSRAVD--VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSG 479 Query: 1943 PTKEECIIQFKEMLKERGVAPFSKWEKELPKILFDPRFKAVPGYSARRALFEHFVRTXXX 1764 P+KEECI+QFKEMLKERGVAPFSKWEKELPKI+FDPRFKA+P +SARR+LFEH+V+T Sbjct: 480 PSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAE 539 Query: 1763 XXXXXXXXXXXXXXEGFKQLLEEASEDIDHKIDYQTFKRKWGNDPRFETLDRKERELLLN 1584 EGF+QLL+EASEDIDH +YQTFKR+WG+DPRFE LDRK+R LLLN Sbjct: 540 EERKEKRAAQKAAIEGFRQLLDEASEDIDHDTNYQTFKRQWGSDPRFEALDRKDRGLLLN 599 Query: 1583 ERVLPLKKAAEEKLRAIRTAAVSSFKSMLREKEDINTTSRWSRVKDGLRNDPRYKSVKHE 1404 ERVL LK+AAEEK R IR AA SSFKSML+EK DIN SRWSRVKD LR+DPRYK VKHE Sbjct: 600 ERVLLLKRAAEEKARVIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHE 659 Query: 1403 DREVLFNEYISXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRLKVR 1224 DREVLF+EYIS RLKVR Sbjct: 660 DREVLFDEYIS-ELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVR 718 Query: 1223 RKEAVASYQALLVETIKDPKASWTESKPKLDKDPQGRATNPDLDKADTEKLFREHVKVLY 1044 RKEAVAS+QALLVETIKD +ASWTESKPKL+KDPQGRA NPDLD +D EKLFREH+K+L+ Sbjct: 719 RKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAVNPDLDSSDMEKLFREHIKMLF 778 Query: 1043 ERCARDFRAVLSEVITAERAAQVTDDGKTVLTSWSEAKHLLKPDPRYSKMPRKERESLWR 864 ERC DFRA+L+EVIT + AAQ T+ GKT L SWS AK LLKPDPRY+KMPRKERE+LWR Sbjct: 779 ERCVNDFRALLAEVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWR 838 Query: 863 RYAEEMQRKQKLDSDLKEEKSNPEVRNRIS 774 RYAE+M RKQKL D +EEK + +V+ R S Sbjct: 839 RYAEDMLRKQKLALDQEEEK-HTDVKGRSS 867