BLASTX nr result
ID: Acanthopanax21_contig00021484
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Acanthopanax21_contig00021484 (2170 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KZN05762.1| hypothetical protein DCAR_006599 [Daucus carota s... 832 0.0 ref|XP_017235946.1| PREDICTED: pre-mRNA-processing protein 40C [... 832 0.0 ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i... 785 0.0 ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i... 785 0.0 ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i... 785 0.0 ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i... 785 0.0 ref|XP_021896567.1| LOW QUALITY PROTEIN: pre-mRNA-processing pro... 755 0.0 ref|XP_011073766.1| pre-mRNA-processing protein 40C [Sesamum ind... 753 0.0 dbj|GAV80419.1| WW domain-containing protein/FF domain-containin... 759 0.0 ref|XP_021292779.1| pre-mRNA-processing protein 40C [Herrania um... 752 0.0 ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [... 754 0.0 gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] 748 0.0 gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium r... 750 0.0 gb|OVA12114.1| WW domain [Macleaya cordata] 755 0.0 gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r... 749 0.0 ref|XP_016707727.1| PREDICTED: pre-mRNA-processing protein 40C-l... 747 0.0 ref|XP_007045322.2| PREDICTED: pre-mRNA-processing protein 40C, ... 748 0.0 ref|XP_017637434.1| PREDICTED: pre-mRNA-processing protein 40C [... 747 0.0 gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 743 0.0 ref|XP_016703241.1| PREDICTED: pre-mRNA-processing protein 40C-l... 744 0.0 >gb|KZN05762.1| hypothetical protein DCAR_006599 [Daucus carota subsp. sativus] Length = 841 Score = 832 bits (2149), Expect = 0.0 Identities = 425/628 (67%), Positives = 475/628 (75%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 LDAWTAH+TETGAVYYYNAVTGESTYEKP+ FKGE E+VA QPTP+SWERL +TDWTLVT Sbjct: 215 LDAWTAHKTETGAVYYYNAVTGESTYEKPAGFKGEAERVATQPTPISWERLGTTDWTLVT 274 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGKRYYYNTKTKLSSWQ+PTEVTELKKKQ +DA KEQ SVP+ +TEKESAP+ S Sbjct: 275 TNDGKRYYYNTKTKLSSWQVPTEVTELKKKQDLDATKEQPTSVPNAVAVTEKESAPIILS 334 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAVNTGGRDA LRSP VPG+SSALD++KRKLQ+ G PA NGS Sbjct: 335 APAVNTGGRDAAPLRSPNVPGASSALDMVKRKLQDSGTPATPTPVSSVTGTVASELNGSR 394 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E A NGT++E KDK KD NGD + P+KEECIIQFK MLKERGV Sbjct: 395 TLENA-GNGTQVEIHKDKHKDDNGDDPMSDSSSDSEDVDTRPSKEECIIQFKAMLKERGV 453 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAI GYS RRSLFEHY F+Q Sbjct: 454 APFSKWEKELPKIVFDPRFKAIAGYSARRSLFEHYVRTRAEEERKEKRAAQKAAVEAFKQ 513 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEAK+DIDHNTD++ FKKKWGHDPRFEALDRKERE L+NERVLPLKK AQ K QAM A Sbjct: 514 LLEEAKKDIDHNTDHHAFKKKWGHDPRFEALDRKERENLLNERVLPLKKEAQAKDQAMRA 573 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS+FKSML+DRGDI+ SSRWSKVKD +RN+ YKSVKH+DREVLFNE+IS+LKS Sbjct: 574 AAASNFKSMLRDRGDITASSRWSKVKDSIRNEQWYKSVKHEDREVLFNEFISDLKSAEQE 633 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 ARRKEAVESYQALLVETIKD Sbjct: 634 AERIVKAKRDEEEKLKEIERQTRKRKEREEQEVERVRSKARRKEAVESYQALLVETIKDA 693 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESK KLEKDPQGRAT+ LDQSDLEKLFREHVKML+ERC RDF+ALLAEV+T EA Sbjct: 694 QASWTESKLKLEKDPQGRATKYHLDQSDLEKLFREHVKMLNERCTRDFRALLAEVITAEA 753 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 ++E++DGKTVFTSWSTAKHLLK D+RYTKMPRKERESLWRRHVDDMQRR KL+LN++ E Sbjct: 754 AMKERDDGKTVFTSWSTAKHLLKADVRYTKMPRKERESLWRRHVDDMQRRLKLSLNEQTE 813 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH +E K+ P V++GK+ SGSRR HE+R Sbjct: 814 KHSLEAKNHPAVEAGKHHSGSRRNHEKR 841 >ref|XP_017235946.1| PREDICTED: pre-mRNA-processing protein 40C [Daucus carota subsp. sativus] Length = 858 Score = 832 bits (2149), Expect = 0.0 Identities = 425/628 (67%), Positives = 475/628 (75%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 LDAWTAH+TETGAVYYYNAVTGESTYEKP+ FKGE E+VA QPTP+SWERL +TDWTLVT Sbjct: 232 LDAWTAHKTETGAVYYYNAVTGESTYEKPAGFKGEAERVATQPTPISWERLGTTDWTLVT 291 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGKRYYYNTKTKLSSWQ+PTEVTELKKKQ +DA KEQ SVP+ +TEKESAP+ S Sbjct: 292 TNDGKRYYYNTKTKLSSWQVPTEVTELKKKQDLDATKEQPTSVPNAVAVTEKESAPIILS 351 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAVNTGGRDA LRSP VPG+SSALD++KRKLQ+ G PA NGS Sbjct: 352 APAVNTGGRDAAPLRSPNVPGASSALDMVKRKLQDSGTPATPTPVSSVTGTVASELNGSR 411 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E A NGT++E KDK KD NGD + P+KEECIIQFK MLKERGV Sbjct: 412 TLENA-GNGTQVEIHKDKHKDDNGDDPMSDSSSDSEDVDTRPSKEECIIQFKAMLKERGV 470 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAI GYS RRSLFEHY F+Q Sbjct: 471 APFSKWEKELPKIVFDPRFKAIAGYSARRSLFEHYVRTRAEEERKEKRAAQKAAVEAFKQ 530 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEAK+DIDHNTD++ FKKKWGHDPRFEALDRKERE L+NERVLPLKK AQ K QAM A Sbjct: 531 LLEEAKKDIDHNTDHHAFKKKWGHDPRFEALDRKERENLLNERVLPLKKEAQAKDQAMRA 590 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS+FKSML+DRGDI+ SSRWSKVKD +RN+ YKSVKH+DREVLFNE+IS+LKS Sbjct: 591 AAASNFKSMLRDRGDITASSRWSKVKDSIRNEQWYKSVKHEDREVLFNEFISDLKSAEQE 650 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 ARRKEAVESYQALLVETIKD Sbjct: 651 AERIVKAKRDEEEKLKEIERQTRKRKEREEQEVERVRSKARRKEAVESYQALLVETIKDA 710 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESK KLEKDPQGRAT+ LDQSDLEKLFREHVKML+ERC RDF+ALLAEV+T EA Sbjct: 711 QASWTESKLKLEKDPQGRATKYHLDQSDLEKLFREHVKMLNERCTRDFRALLAEVITAEA 770 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 ++E++DGKTVFTSWSTAKHLLK D+RYTKMPRKERESLWRRHVDDMQRR KL+LN++ E Sbjct: 771 AMKERDDGKTVFTSWSTAKHLLKADVRYTKMPRKERESLWRRHVDDMQRRLKLSLNEQTE 830 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH +E K+ P V++GK+ SGSRR HE+R Sbjct: 831 KHSLEAKNHPAVEAGKHHSGSRRNHEKR 858 >ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis vinifera] Length = 848 Score = 785 bits (2027), Expect = 0.0 Identities = 400/628 (63%), Positives = 458/628 (72%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 +DAWTAH+T+TG VYYYNA+TGESTYEKPSDFKGE +KV VQPTPVSWE+L TDW LVT Sbjct: 224 VDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVT 283 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YYYNTKTKLSSWQIPTE+TE++KKQ A KE ++ P+ + TEK +P++ S Sbjct: 284 TNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALS 343 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAV TGGRDAT LR+ VPGS+SALD+IK+KLQ+ GAPA NGS Sbjct: 344 APAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPA-TSSPVHSSGPIASELNGSR 402 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E V G + ENSKDKLKD NGDG + GPTKEECIIQFKEMLKERGV Sbjct: 403 VIE-PTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGV 461 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIPGYS RRSLFEHY F+Q Sbjct: 462 APFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQ 521 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEA EDIDH T+Y TF+KKWG DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 522 LLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 581 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AA S FKSML+D+GDI+TS+RWS+VKD LRNDPRYK VKH+DRE+LFNEYISELK+ Sbjct: 582 AAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEE 641 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SYQALLVETIKDP Sbjct: 642 VEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDP 701 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 Q SW ESKPKLEKDPQ RAT + LD SDLEKLFREH+KMLHER +F+ALL+EV+T EA Sbjct: 702 QVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEA 761 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE EDGKTV TSWSTAK LL+ D RY KMPRK+RES+WRR+ ++M R+QKLA +Q E Sbjct: 762 ATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEE 821 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH E K R VDSG++PSGSRR HERR Sbjct: 822 KH-TEVKGRSSVDSGRFPSGSRRAHERR 848 >ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis vinifera] Length = 903 Score = 785 bits (2027), Expect = 0.0 Identities = 400/628 (63%), Positives = 458/628 (72%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 +DAWTAH+T+TG VYYYNA+TGESTYEKPSDFKGE +KV VQPTPVSWE+L TDW LVT Sbjct: 279 VDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVT 338 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YYYNTKTKLSSWQIPTE+TE++KKQ A KE ++ P+ + TEK +P++ S Sbjct: 339 TNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALS 398 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAV TGGRDAT LR+ VPGS+SALD+IK+KLQ+ GAPA NGS Sbjct: 399 APAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPA-TSSPVHSSGPIASELNGSR 457 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E V G + ENSKDKLKD NGDG + GPTKEECIIQFKEMLKERGV Sbjct: 458 VIE-PTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGV 516 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIPGYS RRSLFEHY F+Q Sbjct: 517 APFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQ 576 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEA EDIDH T+Y TF+KKWG DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 577 LLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 636 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AA S FKSML+D+GDI+TS+RWS+VKD LRNDPRYK VKH+DRE+LFNEYISELK+ Sbjct: 637 AAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEE 696 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SYQALLVETIKDP Sbjct: 697 VEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDP 756 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 Q SW ESKPKLEKDPQ RAT + LD SDLEKLFREH+KMLHER +F+ALL+EV+T EA Sbjct: 757 QVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEA 816 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE EDGKTV TSWSTAK LL+ D RY KMPRK+RES+WRR+ ++M R+QKLA +Q E Sbjct: 817 ATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEE 876 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH E K R VDSG++PSGSRR HERR Sbjct: 877 KH-TEVKGRSSVDSGRFPSGSRRAHERR 903 >ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis vinifera] Length = 1013 Score = 785 bits (2027), Expect = 0.0 Identities = 400/628 (63%), Positives = 458/628 (72%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 +DAWTAH+T+TG VYYYNA+TGESTYEKPSDFKGE +KV VQPTPVSWE+L TDW LVT Sbjct: 389 VDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVT 448 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YYYNTKTKLSSWQIPTE+TE++KKQ A KE ++ P+ + TEK +P++ S Sbjct: 449 TNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALS 508 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAV TGGRDAT LR+ VPGS+SALD+IK+KLQ+ GAPA NGS Sbjct: 509 APAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPA-TSSPVHSSGPIASELNGSR 567 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E V G + ENSKDKLKD NGDG + GPTKEECIIQFKEMLKERGV Sbjct: 568 VIE-PTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGV 626 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIPGYS RRSLFEHY F+Q Sbjct: 627 APFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQ 686 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEA EDIDH T+Y TF+KKWG DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 687 LLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 746 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AA S FKSML+D+GDI+TS+RWS+VKD LRNDPRYK VKH+DRE+LFNEYISELK+ Sbjct: 747 AAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEE 806 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SYQALLVETIKDP Sbjct: 807 VEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDP 866 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 Q SW ESKPKLEKDPQ RAT + LD SDLEKLFREH+KMLHER +F+ALL+EV+T EA Sbjct: 867 QVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEA 926 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE EDGKTV TSWSTAK LL+ D RY KMPRK+RES+WRR+ ++M R+QKLA +Q E Sbjct: 927 ATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEE 986 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH E K R VDSG++PSGSRR HERR Sbjct: 987 KH-TEVKGRSSVDSGRFPSGSRRAHERR 1013 >ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis vinifera] emb|CBI27460.3| unnamed protein product, partial [Vitis vinifera] Length = 1046 Score = 785 bits (2027), Expect = 0.0 Identities = 400/628 (63%), Positives = 458/628 (72%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 +DAWTAH+T+TG VYYYNA+TGESTYEKPSDFKGE +KV VQPTPVSWE+L TDW LVT Sbjct: 422 VDAWTAHKTDTGVVYYYNALTGESTYEKPSDFKGEADKVTVQPTPVSWEKLTGTDWALVT 481 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YYYNTKTKLSSWQIPTE+TE++KKQ A KE ++ P+ + TEK +P++ S Sbjct: 482 TNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPIALS 541 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAV TGGRDAT LR+ VPGS+SALD+IK+KLQ+ GAPA NGS Sbjct: 542 APAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAPA-TSSPVHSSGPIASELNGSR 600 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +E V G + ENSKDKLKD NGDG + GPTKEECIIQFKEMLKERGV Sbjct: 601 VIE-PTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKERGV 659 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIPGYS RRSLFEHY F+Q Sbjct: 660 APFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGFKQ 719 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEA EDIDH T+Y TF+KKWG DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 720 LLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 779 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AA S FKSML+D+GDI+TS+RWS+VKD LRNDPRYK VKH+DRE+LFNEYISELK+ Sbjct: 780 AAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAEEE 839 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SYQALLVETIKDP Sbjct: 840 VEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIKDP 899 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 Q SW ESKPKLEKDPQ RAT + LD SDLEKLFREH+KMLHER +F+ALL+EV+T EA Sbjct: 900 QVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTAEA 959 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE EDGKTV TSWSTAK LL+ D RY KMPRK+RES+WRR+ ++M R+QKLA +Q E Sbjct: 960 ATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQTEE 1019 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH E K R VDSG++PSGSRR HERR Sbjct: 1020 KH-TEVKGRSSVDSGRFPSGSRRAHERR 1046 >ref|XP_021896567.1| LOW QUALITY PROTEIN: pre-mRNA-processing protein 40C [Carica papaya] Length = 700 Score = 755 bits (1949), Expect = 0.0 Identities = 384/628 (61%), Positives = 453/628 (72%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 LDAWTAH+TE+G VYYYNA+T +STYEKP DFKGE +KV VQPTPVS E L+ +DW LVT Sbjct: 77 LDAWTAHKTESGIVYYYNALTRQSTYEKPPDFKGEPDKVPVQPTPVSMECLSGSDWALVT 136 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YY+N+KTK+SSWQ+P EVT+L+KKQG + ++E S+SVP++D +TEK SA S S Sbjct: 137 TNDGKKYYFNSKTKISSWQVPNEVTDLRKKQGNEIFREHSLSVPNVDPVTEKGSASTSLS 196 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 PA+NTGGRDA A+R+ G PGSSSALD +K+KLQ+PGAPA NGS Sbjct: 197 TPAINTGGRDAIAIRTSGPPGSSSALDXVKKKLQDPGAPA-TSLPAPVSSVAASEVNGSK 255 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 VE A G + ENSKDKLKD GDG + GPTKEECIIQFKEMLKERG+ Sbjct: 256 VVELA--KGLQSENSKDKLKDTTGDGNMSDSSSDSEDADSGPTKEECIIQFKEMLKERGI 313 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIP +S RRS+FEHY F+Q Sbjct: 314 APFSKWEKELPKIVFDPRFKAIPSHSARRSIFEHYVKTRAEEKRKEKRAAQKAAIEGFKQ 373 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LL+EA +DIDHNTDY TF+ KWG DPRFEAL+RK+REVL+NERVLPLK+AA++K QA+ Sbjct: 374 LLDEAFKDIDHNTDYQTFRMKWGGDPRFEALERKDREVLLNERVLPLKRAAEQKAQAIRE 433 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS FKSML++R DI+ +SRWS+VKD LRND RYKSV H+DREVLFNEYISELK++ Sbjct: 434 AAASTFKSMLRERVDINVNSRWSRVKDSLRNDLRYKSVGHEDREVLFNEYISELKAIQAE 493 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SY+ALLVET+KDP Sbjct: 494 ANREAKAKREEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYKALLVETVKDP 553 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW +SKPKLEKDPQGRAT LD SD EKLFREH+ MLHERC DFK +LAEV+T+EA Sbjct: 554 QASWTDSKPKLEKDPQGRATNPDLDPSDTEKLFREHITMLHERCVHDFKIMLAEVITLEA 613 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE E+GKT+ SWSTAK LLKVD RY+K+PRKERE WRR+V+DM RRQK A +Q+ E Sbjct: 614 AAQETEEGKTILNSWSTAKRLLKVDPRYSKIPRKERELFWRRYVEDMSRRQKAAHDQKEE 673 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH K R +DSG+ PSGSRR HE+R Sbjct: 674 KH-TNGKGRSSIDSGRLPSGSRRTHEQR 700 >ref|XP_011073766.1| pre-mRNA-processing protein 40C [Sesamum indicum] Length = 758 Score = 753 bits (1944), Expect = 0.0 Identities = 391/628 (62%), Positives = 446/628 (71%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 LDAWTAHRTETG VYYYNA+TGESTYEKP FKGE +K VQPTP+SWE+L TDWTLVT Sbjct: 134 LDAWTAHRTETGTVYYYNALTGESTYEKPPGFKGESDKATVQPTPISWEKLTGTDWTLVT 193 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGKRYYYNT T+LSSWQIP+EVTEL+KKQ DA K QS+SV + +TE+ V+ S Sbjct: 194 TNDGKRYYYNTTTQLSSWQIPSEVTELRKKQDADALKAQSVSVTATNIITERGPDAVNLS 253 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 PA NTGGRDATA+R P +SSALDLIK+KLQ+ G P NGS Sbjct: 254 TPAANTGGRDATAIR-PSSVSASSALDLIKKKLQDSGMPDSSSPGPSLSSAVALELNGSK 312 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 +EA+ + G EN+K+K KDAN DG I GPTKEECI+QFKEMLKERGV Sbjct: 313 PMEAS-IKGLLNENNKEKRKDANTDGDISNSSSDSEDEDGGPTKEECILQFKEMLKERGV 371 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIP +S RR+LFEHY F+Q Sbjct: 372 APFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTRAEEERKEKRAAQKAALEGFKQ 431 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEAKEDIDHNTDY TFK++WG DPRF+ALDRKERE L+NERVLPLK+ AQEK QA Sbjct: 432 LLEEAKEDIDHNTDYQTFKRRWGEDPRFQALDRKEREALLNERVLPLKRTAQEKAQAERV 491 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AA S+FKSML D+GDI++SSRWSKVK+ L+ DPRYKSVKH+DRE LFNEY++ELK+ Sbjct: 492 AAISNFKSMLHDKGDITSSSRWSKVKESLKCDPRYKSVKHEDREKLFNEYVAELKAAEEE 551 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 ARRKEA+ESYQALLVETIKDP Sbjct: 552 TVRKAKAKQDEEEKLKERERALRKRKEREEQEVERVRQKARRKEALESYQALLVETIKDP 611 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESKPKLEKDPQGRA LD+SDLEKLFREHVK L+ERC +FKALL EV++ +A Sbjct: 612 QASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLYERCAVEFKALLTEVISADA 671 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE +DGKT TSWSTAK LLK D RY KMPRKERESLWRRH +++QR+QK +QE E Sbjct: 672 AAQETQDGKTAITSWSTAKQLLKNDPRYNKMPRKERESLWRRHAEEIQRKQKKVHDQEGE 731 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 K E KSR VDSGK+ SGSRR H+RR Sbjct: 732 K-PAEGKSRTSVDSGKHLSGSRRAHDRR 758 >dbj|GAV80419.1| WW domain-containing protein/FF domain-containing protein, partial [Cephalotus follicularis] Length = 980 Score = 759 bits (1959), Expect = 0.0 Identities = 393/627 (62%), Positives = 449/627 (71%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 + WTA RT+TG VYYYNA+TGESTYEKP FK E +KV +QP+P E L TDW LV+T Sbjct: 357 EVWTAFRTDTGNVYYYNAITGESTYEKPPGFKVEPDKVPMQPSPTLMEYLPGTDWVLVST 416 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTKLSSWQIPTEV EL+KKQ D KE ISVP+ + LTEK S+P+S SA Sbjct: 417 NDGKKYYYNSKTKLSSWQIPTEVAELRKKQDDDVSKEHPISVPNTNVLTEKGSSPISLSA 476 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDATALR+ GVPGSSSALDLIK+KLQ+PGAP NGS A Sbjct: 477 PAVNTGGRDATALRTSGVPGSSSALDLIKKKLQDPGAPITSSLTPASSGTAALESNGSRA 536 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 VE A V G + ENSKDKLKDANGDG + GPTKE C++QFKEMLKERGVA Sbjct: 537 VE-ATVKGLQSENSKDKLKDANGDGNVSDSSSDSEDVDSGPTKEVCLVQFKEMLKERGVA 595 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY F+QL Sbjct: 596 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKVAIEGFKQL 655 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 LEEA EDIDH TDY TFKKKW DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 656 LEEASEDIDHYTDYQTFKKKWDSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRVA 715 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AASDFKSML+++GDI+ SRWSKVKD LRNDPRYKSVKH+DRE+LF++YI+ELK+V Sbjct: 716 AASDFKSMLREKGDITAISRWSKVKDVLRNDPRYKSVKHEDREILFSQYIAELKAVEEEA 775 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S QALLVETIKDPQ Sbjct: 776 EREAKAKKHEQERLKERERELRKRKEREEQEVERVRVKVRRKEAVASLQALLVETIKDPQ 835 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRAT D D+EKLFREH+K+LH+RC DFKALL+EVVT EA Sbjct: 836 ASWTESKPKLEKDPQGRATNPDFDPYDIEKLFREHIKILHQRCAHDFKALLSEVVTTEAA 895 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 +Q K DGKT SWSTAK LLK D RY +MPRK+RE LWRR+V++M R+QK +Q+ EK Sbjct: 896 VQ-KSDGKTALNSWSTAKRLLKPDARYNRMPRKDREGLWRRYVEEMLRKQKPDFDQKDEK 954 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 HK + K R +DSG+ PSGSRR ERR Sbjct: 955 HK-DAKGRSSIDSGRLPSGSRRTRERR 980 >ref|XP_021292779.1| pre-mRNA-processing protein 40C [Herrania umbratica] Length = 815 Score = 752 bits (1941), Expect = 0.0 Identities = 387/627 (61%), Positives = 452/627 (72%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE +KV VQPTPVS E+LA TDW LVTT Sbjct: 193 DIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTDWALVTT 252 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 +DGK+YYYN+KTK+SSWQIP+EV EL+KKQ D KE ++ VP+ID + EK S P+S SA Sbjct: 253 SDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDFSKEHAVPVPNIDVVAEKGSTPISLSA 312 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+ G P+ NGS Sbjct: 313 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSPVPVMPVTAAQELNGSRT 372 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + ENSKDKLKDA GDG I GP+KEECI+QFKEMLKERGVA Sbjct: 373 VD---VKGLQSENSKDKLKDATGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKERGVA 429 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RR+LFEHY F+QL Sbjct: 430 PFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGFKQL 489 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDIDHNT+Y TFK+KWG D RFEALDRK+RE+L+ ERVLPLK+AA+EK QA+ AA Sbjct: 490 LDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAIRAA 549 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD +R+D RYK VKH+DREVLFNEYISELK+V Sbjct: 550 AASSFKSMLKEKGDITVNSRWSRVKDSIRDDMRYKCVKHEDREVLFNEYISELKAVEEKA 609 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKDPQ Sbjct: 610 ERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDPQ 669 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 670 ASWTESKPKLEKDPQGRAANPDLDASDTEKLFREHIKMLFERCTHDFRALLAEVITQDAA 729 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKTVF SWSTAK LLK D RY+KMPRKERE+LWRR+ +DM R+QK+AL+QE EK Sbjct: 730 AQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKVALDQEEEK 789 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 + + K R D G++ SGSR++HERR Sbjct: 790 -RTDAKGRSSGDLGRFSSGSRKVHERR 815 >ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii] gb|KJB15267.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 887 Score = 754 bits (1946), Expect = 0.0 Identities = 388/627 (61%), Positives = 452/627 (72%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 265 DVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 324 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTK+SSWQIP EVTEL+KKQ + KE ++SVP+ID + EK S P+S SA Sbjct: 325 NDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPISLSA 384 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NGS A Sbjct: 385 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVVPVTATHELNGSRA 443 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGVA Sbjct: 444 VD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVA 500 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY F+QL Sbjct: 501 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQL 560 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDIDH+T+Y TFK+KWG DPRFEALDRK+RE+L+NERVL LK+AA+EK +A+ AA Sbjct: 561 LDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAIRAA 620 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLFNEYISELK++ Sbjct: 621 AASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIEEKA 680 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKDPQ Sbjct: 681 ERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDPQ 740 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 741 ASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQDAT 800 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QK AL+QE EK Sbjct: 801 AQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQEEEK 860 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 H D G+Y SG+RR HERR Sbjct: 861 HTDVKGRSSGGDFGRYSSGTRRTHERR 887 >gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] Length = 816 Score = 748 bits (1930), Expect = 0.0 Identities = 387/628 (61%), Positives = 453/628 (72%), Gaps = 1/628 (0%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE +KV VQPTPVS E+LA T+W LVTT Sbjct: 193 DIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWALVTT 252 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 +DGK+YYYN+KTK+SSWQIP+EV EL+KKQ D KE ++ VP+ID + EK S P+S SA Sbjct: 253 SDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPISLSA 312 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAP-AIXXXXXXXXXXXXXXXNGSG 541 PAV+TGGRDA LR+ VPGSSSALDLIK+KLQ+ G P + NGS Sbjct: 313 PAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELNGSR 372 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 AV+ V G + ENSKDKLKDANGDG I GP+KEECI+QFKEMLKERGV Sbjct: 373 AVD---VKGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKERGV 429 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIP +S RR+LFEHY F+Q Sbjct: 430 APFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGFKQ 489 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LL+EA EDIDHNT+Y TFK+KWG D RFEALDRK+RE+L+ ERVLPLK+AA+EK QA+ A Sbjct: 490 LLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAIRA 549 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS KSML+++GDI+ +SRWS+VKD +R+DPRYK VKH+DREVLFNEYISELK+V Sbjct: 550 AAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVEEK 609 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV S+QALLVETIKDP Sbjct: 610 AERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDP 669 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESKPKLEKDPQGRA LD SD EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 670 QASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQDA 729 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE E GKTVF SWSTAK LLK D RY+KMPRKERE+LWRR+ +DM R+QK AL+QE E Sbjct: 730 AAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQEEE 789 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 K + + K R D G++ SGSR++HERR Sbjct: 790 K-RTDAKVRSSGDLGRFSSGSRKVHERR 816 >gb|KJB15269.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 886 Score = 750 bits (1936), Expect = 0.0 Identities = 388/627 (61%), Positives = 452/627 (72%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 265 DVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 324 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTK+SSWQIP EVTEL+KKQ + KE ++SVP+ID + EK S P+S SA Sbjct: 325 NDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPISLSA 384 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NGS A Sbjct: 385 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVVPVTATHELNGSRA 443 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGVA Sbjct: 444 VD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVA 500 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY F+QL Sbjct: 501 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQL 560 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDIDH+T+Y TFK+KWG DPRFEALDRK+RE+L+NERVL LK+AA+EK +A+ AA Sbjct: 561 LDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAIRAA 620 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLFNEYISELK++ Sbjct: 621 AASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAI-EEK 679 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKDPQ Sbjct: 680 AERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDPQ 739 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 740 ASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQDAT 799 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QK AL+QE EK Sbjct: 800 AQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQEEEK 859 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 H D G+Y SG+RR HERR Sbjct: 860 HTDVKGRSSGGDFGRYSSGTRRTHERR 886 >gb|OVA12114.1| WW domain [Macleaya cordata] Length = 1041 Score = 755 bits (1949), Expect = 0.0 Identities = 390/627 (62%), Positives = 450/627 (71%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 +DAWTAH+T+TGAVYYYNAVTGESTYEKPS FKGE KV VQPTPVSWE+LA TDW+LVT Sbjct: 417 VDAWTAHKTDTGAVYYYNAVTGESTYEKPSGFKGEPGKVTVQPTPVSWEKLAGTDWSLVT 476 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 T+DGK+YYYN KTK+SSWQIP EVTEL+KKQ VD K +S + N EK S P++ S Sbjct: 477 TDDGKKYYYNNKTKVSSWQIPNEVTELRKKQDVDTLKTNLLSTQNA-NAPEKGSGPINLS 535 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAVNTGGRDAT+LR P P SSSALDLIK+KLQ+ G PA NGS Sbjct: 536 APAVNTGGRDATSLR-PCAPASSSALDLIKKKLQDSGFPAPASPLQASSGPTTSDINGSR 594 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 AV+A V G ENSKDK KDANGD + GP+KEECI QFKEMLKERG+ Sbjct: 595 AVDATV-KGNLGENSKDKQKDANGDENMSDSSSDSEDVDSGPSKEECITQFKEMLKERGI 653 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 PFSKWEKELPKIVFDPRFKA+PGYS RRSLFEHY F+Q Sbjct: 654 LPFSKWEKELPKIVFDPRFKAVPGYSTRRSLFEHYVRTRAEEERKEKRAAQKAAIEGFKQ 713 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEEA EDIDH DY +FKKKWG+DPRFEALDRKERE+L+NERVLPLKKAA+EK +++ Sbjct: 714 LLEEASEDIDHKADYQSFKKKWGNDPRFEALDRKERELLLNERVLPLKKAAEEKIRSVRE 773 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS FKSML+++GDI TS+RWS+VKD LRNDPRY+SVKH+DRE+LFN+YISELKS Sbjct: 774 AAASSFKSMLREKGDIDTSTRWSRVKDSLRNDPRYRSVKHEDREILFNDYISELKSAEEE 833 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV SYQALLVETI+DP Sbjct: 834 AERTSKAKREEQDKLKERERETRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIRDP 893 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ES+PKLEKDPQGRA LD++D EKLFREHVK+L+ERC R+F+ALLAEV+T EA Sbjct: 894 QASWTESRPKLEKDPQGRAINPDLDKADTEKLFREHVKILYERCAREFQALLAEVLTAEA 953 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 Q EDGKTV TSWS AK LLK D RY+KMPRKERESLWRR+ +DM+R+QK+ + + + Sbjct: 954 SEQVTEDGKTVLTSWSAAKRLLKSDPRYSKMPRKERESLWRRYAEDMERKQKVGSDSKED 1013 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHER 1882 K +TKSR +DS + P GSRRIH R Sbjct: 1014 KLNTDTKSRSSLDSRRSPGGSRRIHGR 1040 >gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 888 Score = 749 bits (1934), Expect = 0.0 Identities = 388/628 (61%), Positives = 452/628 (71%), Gaps = 1/628 (0%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 265 DVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 324 Query: 185 NDGKRYYYNTKTK-LSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 NDGK+YYYN+KTK +SSWQIP EVTEL+KKQ + KE ++SVP+ID + EK S P+S S Sbjct: 325 NDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPISLS 384 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 APAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NGS Sbjct: 385 APAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVVPVTATHELNGSR 443 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 AV+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGV Sbjct: 444 AVD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGV 500 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY F+Q Sbjct: 501 APFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQ 560 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LL+EA EDIDH+T+Y TFK+KWG DPRFEALDRK+RE+L+NERVL LK+AA+EK +A+ A Sbjct: 561 LLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAIRA 620 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLFNEYISELK++ Sbjct: 621 AAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIEEK 680 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV S+QALLVETIKDP Sbjct: 681 AERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDP 740 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 741 QASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQDA 800 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QK AL+QE E Sbjct: 801 TAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQEEE 860 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 KH D G+Y SG+RR HERR Sbjct: 861 KHTDVKGRSSGGDFGRYSSGTRRTHERR 888 >ref|XP_016707727.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium hirsutum] Length = 886 Score = 747 bits (1929), Expect = 0.0 Identities = 385/627 (61%), Positives = 450/627 (71%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 264 DVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 323 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTK+SSWQIP EVTEL+KKQ + KE ++SVP+ID + EK S P+S SA Sbjct: 324 NDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPISLSA 383 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NG A Sbjct: 384 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVMPVTATHELNGLRA 442 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGVA Sbjct: 443 VD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVA 499 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY F+QL Sbjct: 500 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQL 559 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDI H+T+Y TFK+KWG DPRFEALDRK+RE+L+NERVL LK+AA+EK +A+ AA Sbjct: 560 LDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAIRAA 619 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLFNEYISELK++ Sbjct: 620 AASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIEEKA 679 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKD Q Sbjct: 680 ERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDSQ 739 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLA+V+T +A Sbjct: 740 ASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAKVITQDAA 799 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QKLAL+QE EK Sbjct: 800 AQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKLALDQEEEK 859 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 H D G+Y SG+RR HERR Sbjct: 860 HTDVKGRSSGGDFGRYSSGTRRTHERR 886 >ref|XP_007045322.2| PREDICTED: pre-mRNA-processing protein 40C, partial [Theobroma cacao] Length = 899 Score = 748 bits (1930), Expect = 0.0 Identities = 387/628 (61%), Positives = 453/628 (72%), Gaps = 1/628 (0%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE +KV VQPTPVS E+LA T+W LVTT Sbjct: 276 DIWTAHKTDTGIVYYYNALTGESTYEKPAGFKGEPDKVPVQPTPVSVEQLAGTEWALVTT 335 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 +DGK+YYYN+KTK+SSWQIP+EV EL+KKQ D KE ++ VP+ID + EK S P+S SA Sbjct: 336 SDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPISLSA 395 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAP-AIXXXXXXXXXXXXXXXNGSG 541 PAV+TGGRDA LR+ VPGSSSALDLIK+KLQ+ G P + NGS Sbjct: 396 PAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQELNGSR 455 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 AV+ V G + ENSKDKLKDANGDG I GP+KEECI+QFKEMLKERGV Sbjct: 456 AVD---VKGLQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKERGV 512 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAIP +S RR+LFEHY F+Q Sbjct: 513 APFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEGFKQ 572 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LL+EA EDIDHNT+Y TFK+KWG D RFEALDRK+RE+L+ ERVLPLK+AA+EK QA+ A Sbjct: 573 LLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQAIRA 632 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS KSML+++GDI+ +SRWS+VKD +R+DPRYK VKH+DREVLFNEYISELK+V Sbjct: 633 AAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAVEEK 692 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV S+QALLVETIKDP Sbjct: 693 AERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDP 752 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ESKPKLEKDPQGRA LD SD EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 753 QASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVITQDA 812 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE E GKTVF SWSTAK LLK D RY+KMPRKERE+LWRR+ +DM R+QK AL+QE E Sbjct: 813 AAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQEEE 872 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 K + + K R D G++ SGSR++HERR Sbjct: 873 K-RTDAKVRSSGDLGRFSSGSRKVHERR 899 >ref|XP_017637434.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium arboreum] Length = 885 Score = 747 bits (1928), Expect = 0.0 Identities = 387/627 (61%), Positives = 450/627 (71%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGESTYEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 264 DVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 323 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTK+SSWQIP EVTEL+KKQ + KE ++ VP+ID + EK S P+S SA Sbjct: 324 NDGKKYYYNSKTKISSWQIPYEVTELRKKQDSEVSKENAVPVPNIDVVAEKGSTPISLSA 383 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NGS A Sbjct: 384 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVMPVTATHELNGSRA 442 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGVA Sbjct: 443 VD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVA 499 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY FRQL Sbjct: 500 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFRQL 559 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDIDH+T+Y TFK+KWG DPRFEALDRK+RE+L+NERVL LK+AA+EK + + AA Sbjct: 560 LDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARVIRAA 619 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLFNEYISELK++ Sbjct: 620 AASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAI-EEK 678 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKD Q Sbjct: 679 AERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDSQ 738 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 739 ASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQDAA 798 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QKLAL+QE EK Sbjct: 799 AQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKLALDQEEEK 858 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 H D G+Y SG+RR HERR Sbjct: 859 HTDVKGRSSGGDFGRYSSGTRRTHERR 885 >gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] gb|KDO53045.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 857 Score = 743 bits (1918), Expect = 0.0 Identities = 386/628 (61%), Positives = 448/628 (71%) Frame = +2 Query: 2 LDAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVT 181 LDAWTAH+T+TG VYYYNAVTGESTYEKP+ FKGE +KV VQPTP+S E L TDW LVT Sbjct: 235 LDAWTAHKTDTGIVYYYNAVTGESTYEKPAGFKGEPDKVPVQPTPISMEHLTGTDWALVT 294 Query: 182 TNDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFS 361 TNDGK+YYYN+K K+SSWQIP+EVTELKKK+ D KEQS VP+ + + EK S +S S Sbjct: 295 TNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQS--VPNTNIVIEKGSNAISLS 352 Query: 362 APAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSG 541 +PAVNTGGRDATALR+ +PGSSSALDLIK+KLQ+ G P NGS Sbjct: 353 SPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTPTASPAPVSSAAATSES-NGSK 411 Query: 542 AVEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGV 721 AVE V G + EN+KDKLKD NGDGT+ GPTKEECII+FKEMLKERGV Sbjct: 412 AVEVTV-KGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKERGV 470 Query: 722 APFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQ 901 APFSKWEKELPKIVFDPRFKAI S RR+LFE Y F+Q Sbjct: 471 APFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGFKQ 530 Query: 902 LLEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHA 1081 LLEE EDIDH+TDY TFKKKWG DPRFEALDRK+RE+L+NERVLPLK+AA+EK QA+ A Sbjct: 531 LLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAIRA 590 Query: 1082 AAASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXX 1261 AAAS FKSML+++GDI+ SSRWSKVKD LR+DPRYKSV+H+DREV+FNEY+ ELK+ Sbjct: 591 AAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAEEE 650 Query: 1262 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDP 1441 RRKEAV S+QALLVETIKDP Sbjct: 651 AEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIKDP 710 Query: 1442 QASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEA 1621 QASW ES+PKLEKDPQGRAT LD SD EKLFREH+K L+ERC DF+ LLAEV+T EA Sbjct: 711 QASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITAEA 770 Query: 1622 GLQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAE 1801 QE EDGKTV SWSTAK +LK + RY+KMPRKERE+LWRRH +++QR+ K +L+Q + Sbjct: 771 AAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQNED 830 Query: 1802 KHKIETKSRPPVDSGKYPSGSRRIHERR 1885 HK ++KSR D G+ PS SRR ERR Sbjct: 831 NHK-DSKSRSSTDGGRPPSSSRRNQERR 857 >ref|XP_016703241.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Gossypium hirsutum] Length = 886 Score = 744 bits (1920), Expect = 0.0 Identities = 383/627 (61%), Positives = 449/627 (71%) Frame = +2 Query: 5 DAWTAHRTETGAVYYYNAVTGESTYEKPSDFKGELEKVAVQPTPVSWERLASTDWTLVTT 184 D WTAH+T+TG VYYYNA+TGES+YEKP+ FKGE ++V VQPTPVS E+LA TDW LVTT Sbjct: 264 DVWTAHKTDTGVVYYYNALTGESSYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTT 323 Query: 185 NDGKRYYYNTKTKLSSWQIPTEVTELKKKQGVDAYKEQSISVPDIDNLTEKESAPVSFSA 364 NDGK+YYYN+KTK+SSWQIP EVTEL+KKQ + KE ++ VP+ID + EK S P+S SA Sbjct: 324 NDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVPVPNIDVVAEKGSTPISLSA 383 Query: 365 PAVNTGGRDATALRSPGVPGSSSALDLIKRKLQEPGAPAIXXXXXXXXXXXXXXXNGSGA 544 PAVNTGGRDA LR+ VPGSSSALDLIK+KLQ+PG P+ NGS A Sbjct: 384 PAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPS-SSPVPVMPVTATHELNGSRA 442 Query: 545 VEAAVVNGTKIENSKDKLKDANGDGTIXXXXXXXXXXXXGPTKEECIIQFKEMLKERGVA 724 V+ V G + E++KDKLKDANGDG+I GP+KEECI+QFKEMLKERGVA Sbjct: 443 VD---VKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVA 499 Query: 725 PFSKWEKELPKIVFDPRFKAIPGYSVRRSLFEHYXXXXXXXXXXXXXXXXXXXXXXFRQL 904 PFSKWEKELPKIVFDPRFKAIP +S RRSLFEHY FRQL Sbjct: 500 PFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFRQL 559 Query: 905 LEEAKEDIDHNTDYYTFKKKWGHDPRFEALDRKEREVLINERVLPLKKAAQEKTQAMHAA 1084 L+EA EDIDH+T+Y TFK++WG DPRFEALDRK+R +L+NERVL LK+AA+EK + + AA Sbjct: 560 LDEASEDIDHDTNYQTFKRQWGSDPRFEALDRKDRGLLLNERVLLLKRAAEEKARVIRAA 619 Query: 1085 AASDFKSMLQDRGDISTSSRWSKVKDGLRNDPRYKSVKHDDREVLFNEYISELKSVGXXX 1264 AAS FKSML+++GDI+ +SRWS+VKD LR+DPRYK VKH+DREVLF+EYISELK++ Sbjct: 620 AASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFDEYISELKAIEEKA 679 Query: 1265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARRKEAVESYQALLVETIKDPQ 1444 RRKEAV S+QALLVETIKD Q Sbjct: 680 ERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDSQ 739 Query: 1445 ASWMESKPKLEKDPQGRATRNQLDQSDLEKLFREHVKMLHERCERDFKALLAEVVTVEAG 1624 ASW ESKPKLEKDPQGRA LD SD+EKLFREH+KML ERC DF+ALLAEV+T +A Sbjct: 740 ASWTESKPKLEKDPQGRAVNPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQDAA 799 Query: 1625 LQEKEDGKTVFTSWSTAKHLLKVDLRYTKMPRKERESLWRRHVDDMQRRQKLALNQEAEK 1804 QE E GKT SWSTAK LLK D RY KMPRKERE+LWRR+ +DM R+QKLAL+QE EK Sbjct: 800 AQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKLALDQEEEK 859 Query: 1805 HKIETKSRPPVDSGKYPSGSRRIHERR 1885 H D G+Y SG+RR HERR Sbjct: 860 HTDVKGRSSGGDFGRYSSGTRRTHERR 886