BLASTX nr result
ID: Rehmannia28_contig00016914
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00016914 (2074 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073766.1| PREDICTED: pre-mRNA-processing protein 40C [... 807 0.0 ref|XP_012842923.1| PREDICTED: pre-mRNA-processing protein 40C [... 758 0.0 ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C i... 679 0.0 gb|KJB15268.1| hypothetical protein B456_002G167700 [Gossypium r... 675 0.0 ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C i... 679 0.0 ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma c... 676 0.0 ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C i... 679 0.0 ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [... 675 0.0 ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C i... 679 0.0 gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 669 0.0 gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium r... 670 0.0 ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citr... 671 0.0 gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sin... 669 0.0 ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C [... 669 0.0 ref|XP_002515795.1| PREDICTED: pre-mRNA-processing protein 40C i... 661 0.0 ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C i... 658 0.0 ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C i... 658 0.0 gb|KDP22962.1| hypothetical protein JCGZ_01659 [Jatropha curcas] 656 0.0 ref|XP_010112279.1| Transcription elongation regulator 1 [Morus ... 655 0.0 ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C i... 655 0.0 >ref|XP_011073766.1| PREDICTED: pre-mRNA-processing protein 40C [Sesamum indicum] Length = 758 Score = 807 bits (2084), Expect = 0.0 Identities = 425/569 (74%), Positives = 454/569 (79%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGKRYYYN+ TQ SSWQIPSEVTELRKKQ+ADA KAQSV V TN +TE+G D V Sbjct: 191 LVTTNDGKRYYYNTTTQLSSWQIPSEVTELRKKQDADALKAQSVSVTATNIITERGPDAV 250 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +LSTPAANTGGRDATA R S VS +SSALDLIKKKLQDSG+PDS+S GP+ S + A ELN Sbjct: 251 NLSTPAANTGGRDATAIRPSSVS-ASSALDLIKKKLQDSGMPDSSSPGPSLSSAVALELN 309 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GSKP+EA K L ENNK+K+KDA GPTKEECILQFKEMLKER Sbjct: 310 GSKPMEASIKGLLNENNKEKRKDANTDGDISNSSSDSEDEDGGPTKEECILQFKEMLKER 369 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRT EGF Sbjct: 370 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTRAEEERKEKRAAQKAALEGF 429 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEAKE+IDHNTDYQTFKRRWGEDPRFQALDRK+RE LLNERV PLKRT Sbjct: 430 KQLLEEAKEDIDHNTDYQTFKRRWGEDPRFQALDRKEREALLNERVLPLKRTAQEKAQAE 489 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A++SNFKS+L DKGDI S+SRWSKVK+SLK DPRYKSVKH+DREKLFNEYVAELKAAE Sbjct: 490 RVAAISNFKSMLHDKGDITSSSRWSKVKESLKCDPRYKSVKHEDREKLFNEYVAELKAAE 549 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EETVRKAKAKQD EA+ESYQALLVETIK Sbjct: 550 EETVRKAKAKQDEEEKLKERERALRKRKEREEQEVERVRQKARRKEALESYQALLVETIK 609 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTL ERC +EF+ALLTEVI+A Sbjct: 610 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLYERCAVEFKALLTEVISA 669 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 +A+AQET+DGKT ITSWSTAKQLLK+DPRYNKMPRKERESLWRRHAEEI RKQKK HDQ Sbjct: 670 DAAAQETQDGKTAITSWSTAKQLLKNDPRYNKMPRKERESLWRRHAEEIQRKQKKVHDQE 729 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 GEKP EGK RTSVDSGKHLSGSRR HDRR Sbjct: 730 GEKPAEGKSRTSVDSGKHLSGSRRAHDRR 758 >ref|XP_012842923.1| PREDICTED: pre-mRNA-processing protein 40C [Erythranthe guttata] gi|604322248|gb|EYU32634.1| hypothetical protein MIMGU_mgv1a001237mg [Erythranthe guttata] Length = 858 Score = 758 bits (1957), Expect = 0.0 Identities = 403/567 (71%), Positives = 436/567 (76%) Frame = -3 Query: 2069 VTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPVS 1890 VTTNDGK YYYN+ TQ SSWQ+PSEVTELRKKQ+ADA KAQS+ TN V EKGSDPVS Sbjct: 301 VTTNDGKVYYYNAATQLSSWQVPSEVTELRKKQDADALKAQSLSATYTNVVAEKGSDPVS 360 Query: 1889 LSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELNG 1710 LSTPAANTGGRDATA ++S VSGSSSALDLIKKKLQDSG+PDSTS GP+ S E+NG Sbjct: 361 LSTPAANTGGRDATAVKSSSVSGSSSALDLIKKKLQDSGLPDSTSPGPSLS-----EING 415 Query: 1709 SKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKERG 1530 SK IE L+ ENNKDK+KDA GPTKEECILQFKEMLKERG Sbjct: 416 SKSIEF----LENENNKDKRKDANGDGDLSNSSSDSEDEDGGPTKEECILQFKEMLKERG 471 Query: 1529 VAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGFK 1350 VAPFSKWEKELPKIVFD RFKAI NHSARRALFEHYVRT EGFK Sbjct: 472 VAPFSKWEKELPKIVFDARFKAISNHSARRALFEHYVRTRAEEERKEKRAAQKAASEGFK 531 Query: 1349 QLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXXX 1170 QLLEEAKE+IDHNTDY+TFKR+WG+D RFQAL+RK+RE LLNERV PL++ Sbjct: 532 QLLEEAKEDIDHNTDYETFKRKWGQDHRFQALERKEREFLLNERVSPLRKIAQERAQAER 591 Query: 1169 XASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAEE 990 A+ S+FKS+L+D GD+ S SRWSKVKDSLKSDPRY SVKHDDREKLFNEYVAELKAAEE Sbjct: 592 AAATSDFKSMLKDNGDVTSTSRWSKVKDSLKSDPRYMSVKHDDREKLFNEYVAELKAAEE 651 Query: 989 ETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIKD 810 ETVRKA+A QD EAIESYQALLVETIKD Sbjct: 652 ETVRKARAVQDEEDKIKERERALRKRKEREEQEVERVRQKARRKEAIESYQALLVETIKD 711 Query: 809 PQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITAE 630 PQASWT SKPKL+KDPQGRAANPHLDKSDLEKLFREHVK+L ERCV EFRALLT+VITAE Sbjct: 712 PQASWTASKPKLDKDPQGRAANPHLDKSDLEKLFREHVKSLHERCVGEFRALLTDVITAE 771 Query: 629 ASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQGE 450 ASA+ETEDGKT+ITSWSTAKQ+LKSDPRYNKMPRKERESLWRRH+EEI RK KK DQGE Sbjct: 772 ASARETEDGKTVITSWSTAKQVLKSDPRYNKMPRKERESLWRRHSEEIQRKLKKDSDQGE 831 Query: 449 KPTEGKVRTSVDSGKHLSGSRRPHDRR 369 KP EGK R S + GKHLSGS R H RR Sbjct: 832 KPVEGKSRASAEPGKHLSGSGRTHHRR 858 >ref|XP_010654542.1| PREDICTED: pre-mRNA-processing protein 40C isoform X4 [Vitis vinifera] Length = 848 Score = 679 bits (1753), Expect = 0.0 Identities = 358/569 (62%), Positives = 421/569 (73%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYN++T+ SSWQIP+E+TE+RKKQ++ A K ++ NTN TEKG P+ Sbjct: 281 LVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPI 340 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +LS PA TGGRDAT RTS V GS+SALD+IKKKLQDSG P +TS SSG A+ELN Sbjct: 341 ALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAP-ATSSPVHSSGPIASELN 399 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ IE K LQ EN+KDK KD GPTKEECI+QFKEMLKER Sbjct: 400 GSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKER 459 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP +SARR+LFEHYVRT EGF Sbjct: 460 GVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGF 519 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEA E+IDH T+YQTF+++WG+DPRF+ALDRKDRELLLNERV PLKR Sbjct: 520 KQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 579 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+VS+FKS+L+DKGDI +++RWS+VKDSL++DPRYK VKH+DRE LFNEY++ELKAAE Sbjct: 580 RAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAE 639 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK+K++ EA+ SYQALLVETIK Sbjct: 640 EEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIK 699 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQ SWTESKPKLEKDPQ RA N LD SDLEKLFREH+K L ER EFRALL+EV+TA Sbjct: 700 DPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTA 759 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 EA+ QETEDGKT++TSWSTAK+LL+SD RY KMPRK+RES+WRR++EE+LRKQK A DQ Sbjct: 760 EAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQT 819 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK TE K R+SVDSG+ SGSRR H+RR Sbjct: 820 EEKHTEVKGRSSVDSGRFPSGSRRAHERR 848 >gb|KJB15268.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 736 Score = 675 bits (1741), Expect = 0.0 Identities = 358/570 (62%), Positives = 419/570 (73%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+T+ SSWQIP+EVTELRKKQ+++ K +V V N + V EKGS P+ Sbjct: 170 LVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPI 229 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDA RTS+V GSSSALDLIKKKLQD G+P S+S P +A ELN Sbjct: 230 SLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVTATHELN 288 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ ++ G LQ E+NKDK KDA GP+KEECI+QFKEMLKER Sbjct: 289 GSRAVDVKG--LQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKER 346 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EGF Sbjct: 347 GVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGF 406 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLL+EA E+IDH+T+YQTFKR+WG DPRF+ALDRKDRELLLNERV LKR Sbjct: 407 KQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAI 466 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDIN NSRWS+VKDSL+ DPRYK VKH+DRE LFNEY++ELKA E Sbjct: 467 RAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIE 526 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 E+ RK K K++ EA+ S+QALLVETIK Sbjct: 527 EKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 586 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKDPQGRAANP LD SD+EKLFREH+K L ERCV +FRALL EVIT Sbjct: 587 DPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQ 646 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 +A+AQETE GKT + SWSTAK+LLK DPRYNKMPRKERE+LWRR+AE++LRKQK A DQ Sbjct: 647 DATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQE 706 Query: 455 GEKPTEGKVRTS-VDSGKHLSGSRRPHDRR 369 EK T+ K R+S D G++ SG+RR H+RR Sbjct: 707 EEKHTDVKGRSSGGDFGRYSSGTRRTHERR 736 >ref|XP_010654535.1| PREDICTED: pre-mRNA-processing protein 40C isoform X3 [Vitis vinifera] Length = 903 Score = 679 bits (1753), Expect = 0.0 Identities = 358/569 (62%), Positives = 421/569 (73%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYN++T+ SSWQIP+E+TE+RKKQ++ A K ++ NTN TEKG P+ Sbjct: 336 LVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPI 395 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +LS PA TGGRDAT RTS V GS+SALD+IKKKLQDSG P +TS SSG A+ELN Sbjct: 396 ALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAP-ATSSPVHSSGPIASELN 454 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ IE K LQ EN+KDK KD GPTKEECI+QFKEMLKER Sbjct: 455 GSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKER 514 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP +SARR+LFEHYVRT EGF Sbjct: 515 GVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGF 574 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEA E+IDH T+YQTF+++WG+DPRF+ALDRKDRELLLNERV PLKR Sbjct: 575 KQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 634 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+VS+FKS+L+DKGDI +++RWS+VKDSL++DPRYK VKH+DRE LFNEY++ELKAAE Sbjct: 635 RAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAE 694 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK+K++ EA+ SYQALLVETIK Sbjct: 695 EEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIK 754 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQ SWTESKPKLEKDPQ RA N LD SDLEKLFREH+K L ER EFRALL+EV+TA Sbjct: 755 DPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTA 814 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 EA+ QETEDGKT++TSWSTAK+LL+SD RY KMPRK+RES+WRR++EE+LRKQK A DQ Sbjct: 815 EAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQT 874 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK TE K R+SVDSG+ SGSRR H+RR Sbjct: 875 EEKHTEVKGRSSVDSGRFPSGSRRAHERR 903 >ref|XP_007045322.1| Pre-mRNA-processing protein 40C [Theobroma cacao] gi|508709257|gb|EOY01154.1| Pre-mRNA-processing protein 40C [Theobroma cacao] Length = 816 Score = 676 bits (1743), Expect = 0.0 Identities = 355/570 (62%), Positives = 414/570 (72%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTT+DGK+YYYNS+T+ SSWQIPSEV ELRKKQ+ D K +VPV N + V EKGS P+ Sbjct: 249 LVTTSDGKKYYYNSKTKISSWQIPSEVAELRKKQDNDVSKEHAVPVPNIDVVAEKGSTPI 308 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQG-PASSGSAATEL 1716 SLS PA +TGGRDA RTS+V GSSSALDLIKKKLQDSG+P S+S P +AA EL Sbjct: 309 SLSAPAVSTGGRDAMPLRTSVVPGSSSALDLIKKKLQDSGVPSSSSSSVPVMPVTAAQEL 368 Query: 1715 NGSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKE 1536 NGS+ ++ G LQ EN+KDK KDA GP+KEECI+QFKEMLKE Sbjct: 369 NGSRAVDVKG--LQSENSKDKLKDANGDGNISDSSSDSEDTDSGPSKEECIMQFKEMLKE 426 Query: 1535 RGVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEG 1356 RGVAPFSKWEKELPKIVFDPRFKAIP+HSARR LFEHYV+T EG Sbjct: 427 RGVAPFSKWEKELPKIVFDPRFKAIPSHSARRTLFEHYVKTRAEEERREKRAALKAAIEG 486 Query: 1355 FKQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXX 1176 FKQLL+EA E+IDHNT+YQTFKR+WG D RF+ALDRKDRELLL ERV PLKR Sbjct: 487 FKQLLDEASEDIDHNTNYQTFKRKWGSDLRFEALDRKDRELLLTERVLPLKRAAEEKAQA 546 Query: 1175 XXXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAA 996 A+ S+ KS+L++KGDI NSRWS+VKDS++ DPRYK VKH+DRE LFNEY++ELKA Sbjct: 547 IRAAAASSLKSMLKEKGDITVNSRWSRVKDSIRDDPRYKCVKHEDREVLFNEYISELKAV 606 Query: 995 EEETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETI 816 EE+ RK + K++ EA+ S+QALLVETI Sbjct: 607 EEKAERKERVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETI 666 Query: 815 KDPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVIT 636 KDPQASWTESKPKLEKDPQGRAANP LD SD EKLFREH+K L ERC +FRALL EVIT Sbjct: 667 KDPQASWTESKPKLEKDPQGRAANPDLDPSDTEKLFREHIKMLFERCTHDFRALLAEVIT 726 Query: 635 AEASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ 456 +A+AQETE GKT+ SWSTAK+LLK DPRY+KMPRKERE+LWRR+AE++LRKQK A DQ Sbjct: 727 QDAAAQETEGGKTVFNSWSTAKRLLKPDPRYSKMPRKEREALWRRYAEDMLRKQKSALDQ 786 Query: 455 -GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK T+ KVR+S D G+ SGSR+ H+RR Sbjct: 787 EEEKRTDAKVRSSGDLGRFSSGSRKVHERR 816 >ref|XP_010654529.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Vitis vinifera] Length = 1013 Score = 679 bits (1753), Expect = 0.0 Identities = 358/569 (62%), Positives = 421/569 (73%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYN++T+ SSWQIP+E+TE+RKKQ++ A K ++ NTN TEKG P+ Sbjct: 446 LVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPI 505 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +LS PA TGGRDAT RTS V GS+SALD+IKKKLQDSG P +TS SSG A+ELN Sbjct: 506 ALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAP-ATSSPVHSSGPIASELN 564 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ IE K LQ EN+KDK KD GPTKEECI+QFKEMLKER Sbjct: 565 GSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKER 624 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP +SARR+LFEHYVRT EGF Sbjct: 625 GVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGF 684 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEA E+IDH T+YQTF+++WG+DPRF+ALDRKDRELLLNERV PLKR Sbjct: 685 KQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 744 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+VS+FKS+L+DKGDI +++RWS+VKDSL++DPRYK VKH+DRE LFNEY++ELKAAE Sbjct: 745 RAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAE 804 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK+K++ EA+ SYQALLVETIK Sbjct: 805 EEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIK 864 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQ SWTESKPKLEKDPQ RA N LD SDLEKLFREH+K L ER EFRALL+EV+TA Sbjct: 865 DPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTA 924 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 EA+ QETEDGKT++TSWSTAK+LL+SD RY KMPRK+RES+WRR++EE+LRKQK A DQ Sbjct: 925 EAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQT 984 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK TE K R+SVDSG+ SGSRR H+RR Sbjct: 985 EEKHTEVKGRSSVDSGRFPSGSRRAHERR 1013 >ref|XP_012467146.1| PREDICTED: pre-mRNA-processing protein 40C [Gossypium raimondii] gi|763747828|gb|KJB15267.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 887 Score = 675 bits (1741), Expect = 0.0 Identities = 358/570 (62%), Positives = 419/570 (73%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+T+ SSWQIP+EVTELRKKQ+++ K +V V N + V EKGS P+ Sbjct: 321 LVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPI 380 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDA RTS+V GSSSALDLIKKKLQD G+P S+S P +A ELN Sbjct: 381 SLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVTATHELN 439 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ ++ G LQ E+NKDK KDA GP+KEECI+QFKEMLKER Sbjct: 440 GSRAVDVKG--LQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKER 497 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EGF Sbjct: 498 GVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGF 557 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLL+EA E+IDH+T+YQTFKR+WG DPRF+ALDRKDRELLLNERV LKR Sbjct: 558 KQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAI 617 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDIN NSRWS+VKDSL+ DPRYK VKH+DRE LFNEY++ELKA E Sbjct: 618 RAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIE 677 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 E+ RK K K++ EA+ S+QALLVETIK Sbjct: 678 EKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIK 737 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKDPQGRAANP LD SD+EKLFREH+K L ERCV +FRALL EVIT Sbjct: 738 DPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVITQ 797 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 +A+AQETE GKT + SWSTAK+LLK DPRYNKMPRKERE+LWRR+AE++LRKQK A DQ Sbjct: 798 DATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQE 857 Query: 455 GEKPTEGKVRTS-VDSGKHLSGSRRPHDRR 369 EK T+ K R+S D G++ SG+RR H+RR Sbjct: 858 EEKHTDVKGRSSGGDFGRYSSGTRRTHERR 887 >ref|XP_002272014.2| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Vitis vinifera] gi|297738259|emb|CBI27460.3| unnamed protein product [Vitis vinifera] Length = 1046 Score = 679 bits (1753), Expect = 0.0 Identities = 358/569 (62%), Positives = 421/569 (73%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYN++T+ SSWQIP+E+TE+RKKQ++ A K ++ NTN TEKG P+ Sbjct: 479 LVTTNDGKKYYYNTKTKLSSWQIPTELTEMRKKQDSVALKEHAMLAPNTNVSTEKGPSPI 538 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +LS PA TGGRDAT RTS V GS+SALD+IKKKLQDSG P +TS SSG A+ELN Sbjct: 539 ALSAPAVTTGGRDATPLRTSAVPGSASALDMIKKKLQDSGAP-ATSSPVHSSGPIASELN 597 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ IE K LQ EN+KDK KD GPTKEECI+QFKEMLKER Sbjct: 598 GSRVIEPTVKGLQSENSKDKLKDTNGDGNMSDSSSDSEDVDSGPTKEECIIQFKEMLKER 657 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP +SARR+LFEHYVRT EGF Sbjct: 658 GVAPFSKWEKELPKIVFDPRFKAIPGYSARRSLFEHYVRTRAEEERKEKRAAQRAAIEGF 717 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEA E+IDH T+YQTF+++WG+DPRF+ALDRKDRELLLNERV PLKR Sbjct: 718 KQLLEEASEDIDHKTEYQTFRKKWGDDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 777 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+VS+FKS+L+DKGDI +++RWS+VKDSL++DPRYK VKH+DRE LFNEY++ELKAAE Sbjct: 778 RAAAVSSFKSMLRDKGDITTSTRWSRVKDSLRNDPRYKCVKHEDREILFNEYISELKAAE 837 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK+K++ EA+ SYQALLVETIK Sbjct: 838 EEVEREAKSKKEEQDKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSYQALLVETIK 897 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQ SWTESKPKLEKDPQ RA N LD SDLEKLFREH+K L ER EFRALL+EV+TA Sbjct: 898 DPQVSWTESKPKLEKDPQARATNSDLDPSDLEKLFREHIKMLHERRAHEFRALLSEVLTA 957 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 EA+ QETEDGKT++TSWSTAK+LL+SD RY KMPRK+RES+WRR++EE+LRKQK A DQ Sbjct: 958 EAATQETEDGKTVLTSWSTAKRLLRSDTRYIKMPRKDRESVWRRYSEEMLRKQKLAQDQT 1017 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK TE K R+SVDSG+ SGSRR H+RR Sbjct: 1018 EEKHTEVKGRSSVDSGRFPSGSRRAHERR 1046 >gb|KDO53044.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] gi|641834042|gb|KDO53045.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 857 Score = 669 bits (1727), Expect = 0.0 Identities = 355/569 (62%), Positives = 410/569 (72%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+ + SSWQIPSEVTEL+KK++ D K QSVP NTN V EKGS+ + Sbjct: 292 LVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP--NTNIVIEKGSNAI 349 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS+PA NTGGRDATA RTS + GSSSALDLIKKKLQDSG P + S P SS +A +E N Sbjct: 350 SLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSSAAATSESN 408 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GSK +E K LQ EN KDK KD GPTKEECI++FKEMLKER Sbjct: 409 GSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKER 468 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAI + SARRALFE YV+T EGF Sbjct: 469 GVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGF 528 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEE E+IDH+TDYQTFK++WG DPRF+ALDRKDRELLLNERV PLKR Sbjct: 529 KQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 588 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDI +SRWSKVKD L+ DPRYKSV+H+DRE +FNEYV ELKAAE Sbjct: 589 RAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAE 648 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AKA+++ EA+ S+QALLVETIK Sbjct: 649 EEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIK 708 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTES+PKLEKDPQGRA N LD SD EKLFREH+KTL ERC +FR LL EVITA Sbjct: 709 DPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITA 768 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA+AQETEDGKT++ SWSTAK++LK +PRY+KMPRKERE+LWRRHAEEI RK K + DQ Sbjct: 769 EAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQN 828 Query: 452 E-KPTEGKVRTSVDSGKHLSGSRRPHDRR 369 E + K R+S D G+ S SRR +RR Sbjct: 829 EDNHKDSKSRSSTDGGRPPSSSRRNQERR 857 >gb|KJB15270.1| hypothetical protein B456_002G167700 [Gossypium raimondii] Length = 888 Score = 670 bits (1729), Expect = 0.0 Identities = 358/571 (62%), Positives = 419/571 (73%), Gaps = 3/571 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQS-SSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDP 1896 LVTTNDGK+YYYNS+T+ SSWQIP+EVTELRKKQ+++ K +V V N + V EKGS P Sbjct: 321 LVTTNDGKKYYYNSKTKVISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTP 380 Query: 1895 VSLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATEL 1716 +SLS PA NTGGRDA RTS+V GSSSALDLIKKKLQD G+P S+S P +A EL Sbjct: 381 ISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVP-SSSPVPVVPVTATHEL 439 Query: 1715 NGSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKE 1536 NGS+ ++ G LQ E+NKDK KDA GP+KEECI+QFKEMLKE Sbjct: 440 NGSRAVDVKG--LQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKE 497 Query: 1535 RGVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEG 1356 RGVAPFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EG Sbjct: 498 RGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEG 557 Query: 1355 FKQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXX 1176 FKQLL+EA E+IDH+T+YQTFKR+WG DPRF+ALDRKDRELLLNERV LKR Sbjct: 558 FKQLLDEASEDIDHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARA 617 Query: 1175 XXXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAA 996 A+ S+FKS+L++KGDIN NSRWS+VKDSL+ DPRYK VKH+DRE LFNEY++ELKA Sbjct: 618 IRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAI 677 Query: 995 EEETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETI 816 EE+ RK K K++ EA+ S+QALLVETI Sbjct: 678 EEKAERKDKVKKEEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETI 737 Query: 815 KDPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVIT 636 KDPQASWTESKPKLEKDPQGRAANP LD SD+EKLFREH+K L ERCV +FRALL EVIT Sbjct: 738 KDPQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAEVIT 797 Query: 635 AEASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ 456 +A+AQETE GKT + SWSTAK+LLK DPRYNKMPRKERE+LWRR+AE++LRKQK A DQ Sbjct: 798 QDATAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKSALDQ 857 Query: 455 -GEKPTEGKVRTS-VDSGKHLSGSRRPHDRR 369 EK T+ K R+S D G++ SG+RR H+RR Sbjct: 858 EEEKHTDVKGRSSGGDFGRYSSGTRRTHERR 888 >ref|XP_006437488.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] gi|557539684|gb|ESR50728.1| hypothetical protein CICLE_v10030612mg [Citrus clementina] Length = 1015 Score = 671 bits (1731), Expect = 0.0 Identities = 356/569 (62%), Positives = 410/569 (72%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+ + SSWQIPSEVTEL+KK++ D K QSVP NTN V EKGS+ + Sbjct: 450 LVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP--NTNIVIEKGSNAI 507 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS+PA NTGGRDATA RTS + GSSSALDLIKKKLQDSG P + S P SS +A +E N Sbjct: 508 SLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSSAAATSESN 566 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GSK +E K LQ EN KDK KD GPTKEECI++FKEMLKER Sbjct: 567 GSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKER 626 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAI + SARRALFE YV+T EGF Sbjct: 627 GVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGF 686 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEE E+IDH+TDYQTFK++WG DPRF+ALDRKDRELLLNERV PLKR Sbjct: 687 KQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 746 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDI +SRWSKVKD L+ DPRYKSV+H+DRE +FNEYV ELKAAE Sbjct: 747 RAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAE 806 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AKA+++ EA+ S+QALLVETIK Sbjct: 807 EEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIK 866 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTES+PKLEKDPQGRA N LD SD EKLFREH+KTL ERC +FR LL EVITA Sbjct: 867 DPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITA 926 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA+AQETEDGKT++ SWSTAK++LK DPRY+KMPRKERE+LWRRHAEEI RK K + DQ Sbjct: 927 EAAAQETEDGKTVLNSWSTAKRVLKPDPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQN 986 Query: 452 E-KPTEGKVRTSVDSGKHLSGSRRPHDRR 369 E + K R+S D G+ S SRR +RR Sbjct: 987 EDNHKDSKSRSSTDGGRPPSSSRRNQERR 1015 >gb|KDO53043.1| hypothetical protein CISIN_1g002026mg [Citrus sinensis] Length = 978 Score = 669 bits (1727), Expect = 0.0 Identities = 355/569 (62%), Positives = 410/569 (72%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+ + SSWQIPSEVTEL+KK++ D K QSVP NTN V EKGS+ + Sbjct: 413 LVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP--NTNIVIEKGSNAI 470 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS+PA NTGGRDATA RTS + GSSSALDLIKKKLQDSG P + S P SS +A +E N Sbjct: 471 SLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSSAAATSESN 529 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GSK +E K LQ EN KDK KD GPTKEECI++FKEMLKER Sbjct: 530 GSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKER 589 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAI + SARRALFE YV+T EGF Sbjct: 590 GVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGF 649 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEE E+IDH+TDYQTFK++WG DPRF+ALDRKDRELLLNERV PLKR Sbjct: 650 KQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 709 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDI +SRWSKVKD L+ DPRYKSV+H+DRE +FNEYV ELKAAE Sbjct: 710 RAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAE 769 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AKA+++ EA+ S+QALLVETIK Sbjct: 770 EEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIK 829 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTES+PKLEKDPQGRA N LD SD EKLFREH+KTL ERC +FR LL EVITA Sbjct: 830 DPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITA 889 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA+AQETEDGKT++ SWSTAK++LK +PRY+KMPRKERE+LWRRHAEEI RK K + DQ Sbjct: 890 EAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQN 949 Query: 452 E-KPTEGKVRTSVDSGKHLSGSRRPHDRR 369 E + K R+S D G+ S SRR +RR Sbjct: 950 EDNHKDSKSRSSTDGGRPPSSSRRNQERR 978 >ref|XP_006484634.1| PREDICTED: pre-mRNA-processing protein 40C [Citrus sinensis] Length = 978 Score = 669 bits (1727), Expect = 0.0 Identities = 355/569 (62%), Positives = 410/569 (72%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+ + SSWQIPSEVTEL+KK++ D K QSVP NTN V EKGS+ + Sbjct: 413 LVTTNDGKKYYYNSKMKVSSWQIPSEVTELKKKEDDDTLKEQSVP--NTNIVIEKGSNAI 470 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS+PA NTGGRDATA RTS + GSSSALDLIKKKLQDSG P + S P SS +A +E N Sbjct: 471 SLSSPAVNTGGRDATALRTSSMPGSSSALDLIKKKLQDSGTP-TASPAPVSSAAATSESN 529 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GSK +E K LQ EN KDK KD GPTKEECI++FKEMLKER Sbjct: 530 GSKAVEVTVKGLQNENTKDKLKDINGDGTMSDSSSDSEDGETGPTKEECIIKFKEMLKER 589 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAI + SARRALFE YV+T EGF Sbjct: 590 GVAPFSKWEKELPKIVFDPRFKAIQSQSARRALFERYVKTRAEEERKEKRAAQKAAIEGF 649 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEE E+IDH+TDYQTFK++WG DPRF+ALDRKDRELLLNERV PLKR Sbjct: 650 KQLLEEVSEDIDHSTDYQTFKKKWGSDPRFEALDRKDRELLLNERVLPLKRAAEEKAQAI 709 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S+FKS+L++KGDI +SRWSKVKD L+ DPRYKSV+H+DRE +FNEYV ELKAAE Sbjct: 710 RAAAASSFKSMLREKGDITLSSRWSKVKDILRDDPRYKSVRHEDREVIFNEYVRELKAAE 769 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AKA+++ EA+ S+QALLVETIK Sbjct: 770 EEAEREAKARREEQEKLKEREREMRKRKEREEQEMERVRLKVRRKEAVTSFQALLVETIK 829 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTES+PKLEKDPQGRA N LD SD EKLFREH+KTL ERC +FR LL EVITA Sbjct: 830 DPQASWTESRPKLEKDPQGRATNADLDSSDREKLFREHIKTLYERCAHDFRGLLAEVITA 889 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA+AQETEDGKT++ SWSTAK++LK +PRY+KMPRKERE+LWRRHAEEI RK K + DQ Sbjct: 890 EAAAQETEDGKTVLNSWSTAKRVLKPEPRYSKMPRKEREALWRRHAEEIQRKHKSSLDQN 949 Query: 452 E-KPTEGKVRTSVDSGKHLSGSRRPHDRR 369 E + K R+S D G+ S SRR +RR Sbjct: 950 EDNHKDSKSRSSTDGGRPPSSSRRNQERR 978 >ref|XP_002515795.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Ricinus communis] gi|223545064|gb|EEF46576.1| Pre-mRNA-processing protein PRP40, putative [Ricinus communis] Length = 886 Score = 661 bits (1706), Expect = 0.0 Identities = 346/570 (60%), Positives = 415/570 (72%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 L+TTNDGK YYYN++T+ SSWQIPSEVTEL+KKQEA+ K Q + V +++ + EKGS + Sbjct: 318 LITTNDGKNYYYNNKTKLSSWQIPSEVTELKKKQEAE-LKEQEMSVSSSSVLNEKGSVQI 376 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDATA R S G+SSALDLIKKKLQDSG P ++S P S G E N Sbjct: 377 SLSAPAINTGGRDATALRASNALGASSALDLIKKKLQDSGTPVTSSPAPVSLGITTPESN 436 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ +EA K L EN+K+K KDA GPTKEECI+QFK+MLKER Sbjct: 437 GSRAMEATSKGLPSENSKEKLKDANGDANASDSSSDSEEEDNGPTKEECIIQFKDMLKER 496 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 G+APFSKWEK LPKIVFDPRF+AIP+HSARR+LFEHYV+T EGF Sbjct: 497 GIAPFSKWEKVLPKIVFDPRFQAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGF 556 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 +QLLEEA EEIDHNTDYQ+F+R+WG DPRF+A+DRKDRE LL+ERV PLK+ Sbjct: 557 RQLLEEASEEIDHNTDYQSFRRKWGNDPRFEAVDRKDREHLLHERVLPLKKAAQEKAQAE 616 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ ++FKS+LQDKGD+ NSRWSKVK+SL++DPRYKSVKH++RE LFNEY++ELKAAE Sbjct: 617 RAAAAASFKSMLQDKGDLTVNSRWSKVKESLRNDPRYKSVKHEEREVLFNEYLSELKAAE 676 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE KAK K++ EA+ S+QALLVETIK Sbjct: 677 EEAEWKAKVKREEQEKLKERERELRKRKEREEQEMERVREKVRRKEAVASFQALLVETIK 736 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESK +LEKDPQGR NP+LD SD EKLFREHVK L ERC EF+ALL EVI A Sbjct: 737 DPQASWTESKTRLEKDPQGRGTNPNLDPSDTEKLFREHVKMLHERCTNEFKALLAEVINA 796 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA++Q+TEDGKT++ SW+TAK++LK DPRYNKMPRKERE LWRRHAE++LRKQK D+ Sbjct: 797 EAASQKTEDGKTVLDSWTTAKRVLKLDPRYNKMPRKEREVLWRRHAEDMLRKQKTTLDEK 856 Query: 452 E-KPTEGKVRTS-VDSGKHLSGSRRPHDRR 369 E K T+ + R+S DSG+HLSGS+R HDRR Sbjct: 857 EDKHTDPRGRSSTTDSGRHLSGSKRTHDRR 886 >ref|XP_012089638.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Jatropha curcas] gi|802761021|ref|XP_012089639.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Jatropha curcas] gi|802761024|ref|XP_012089640.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Jatropha curcas] Length = 817 Score = 658 bits (1697), Expect = 0.0 Identities = 346/569 (60%), Positives = 402/569 (70%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 L+TTNDGK+YYYN++T+ SSWQIPSEVTEL KKQEA+ K V ++ +N TEKGS PV Sbjct: 249 LITTNDGKKYYYNNKTKLSSWQIPSEVTELTKKQEAEVSKELEVSLLRSNVSTEKGSGPV 308 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDATA RTS G SSALDLIKKKLQ+SG P ++S S G E N Sbjct: 309 SLSAPAINTGGRDATALRTSSAPGPSSALDLIKKKLQESGTPVNSSPALVSLGMGTPESN 368 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ EA K L E + DK KD GPTKEECI+QFKEMLKER Sbjct: 369 GSRAAEATAKGLLSETSNDKLKDTNGGGNASDSSSDSEDEDSGPTKEECIIQFKEMLKER 428 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 G+APFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EGF Sbjct: 429 GIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEQRKEKRASQKAAIEGF 488 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLL EA E+ID TDYQTF+++W DPRF+ALDRKDRE LLNERV PLK+ Sbjct: 489 KQLLVEASEDIDQYTDYQTFRKKWENDPRFEALDRKDREHLLNERVIPLKKAAQEKVQAE 548 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ ++FKS+LQDKGDI NSRWSKVK+SL++DPRYKSVKH+DRE LFNEY++ELKA E Sbjct: 549 RAAAAASFKSMLQDKGDITINSRWSKVKESLRNDPRYKSVKHEDREFLFNEYLSELKAVE 608 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK K++ EA+ S+QALLVETIK Sbjct: 609 EEAEREAKVKKEEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSFQALLVETIK 668 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKD QGRA NP LD SD EKLFREHVK L ERC +F+ALL EVI A Sbjct: 669 DPQASWTESKPKLEKDSQGRATNPDLDPSDTEKLFREHVKMLHERCTQDFKALLAEVINA 728 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 E +AQ++E+GKT++ SWST K+LLK DPRYNKMPRKERE LWRR+ ++ILRKQ+ DQ Sbjct: 729 ETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMPRKEREILWRRYTQDILRKQQTTLDQK 788 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK T+ K R S DSG++LSGSRR HD R Sbjct: 789 EEKHTDSKSRNSADSGRYLSGSRRTHDGR 817 >ref|XP_012089634.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha curcas] gi|802761009|ref|XP_012089635.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha curcas] gi|802761012|ref|XP_012089636.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha curcas] gi|802761015|ref|XP_012089637.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Jatropha curcas] Length = 846 Score = 658 bits (1697), Expect = 0.0 Identities = 346/569 (60%), Positives = 402/569 (70%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 L+TTNDGK+YYYN++T+ SSWQIPSEVTEL KKQEA+ K V ++ +N TEKGS PV Sbjct: 278 LITTNDGKKYYYNNKTKLSSWQIPSEVTELTKKQEAEVSKELEVSLLRSNVSTEKGSGPV 337 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDATA RTS G SSALDLIKKKLQ+SG P ++S S G E N Sbjct: 338 SLSAPAINTGGRDATALRTSSAPGPSSALDLIKKKLQESGTPVNSSPALVSLGMGTPESN 397 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ EA K L E + DK KD GPTKEECI+QFKEMLKER Sbjct: 398 GSRAAEATAKGLLSETSNDKLKDTNGGGNASDSSSDSEDEDSGPTKEECIIQFKEMLKER 457 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 G+APFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EGF Sbjct: 458 GIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEQRKEKRASQKAAIEGF 517 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLL EA E+ID TDYQTF+++W DPRF+ALDRKDRE LLNERV PLK+ Sbjct: 518 KQLLVEASEDIDQYTDYQTFRKKWENDPRFEALDRKDREHLLNERVIPLKKAAQEKVQAE 577 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ ++FKS+LQDKGDI NSRWSKVK+SL++DPRYKSVKH+DRE LFNEY++ELKA E Sbjct: 578 RAAAAASFKSMLQDKGDITINSRWSKVKESLRNDPRYKSVKHEDREFLFNEYLSELKAVE 637 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK K++ EA+ S+QALLVETIK Sbjct: 638 EEAEREAKVKKEEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSFQALLVETIK 697 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKD QGRA NP LD SD EKLFREHVK L ERC +F+ALL EVI A Sbjct: 698 DPQASWTESKPKLEKDSQGRATNPDLDPSDTEKLFREHVKMLHERCTQDFKALLAEVINA 757 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 E +AQ++E+GKT++ SWST K+LLK DPRYNKMPRKERE LWRR+ ++ILRKQ+ DQ Sbjct: 758 ETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMPRKEREILWRRYTQDILRKQQTTLDQK 817 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK T+ K R S DSG++LSGSRR HD R Sbjct: 818 EEKHTDSKSRNSADSGRYLSGSRRTHDGR 846 >gb|KDP22962.1| hypothetical protein JCGZ_01659 [Jatropha curcas] Length = 846 Score = 656 bits (1692), Expect = 0.0 Identities = 345/569 (60%), Positives = 401/569 (70%), Gaps = 1/569 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 L+TTNDGK+YYYN++T+ SWQIPSEVTEL KKQEA+ K V ++ +N TEKGS PV Sbjct: 278 LITTNDGKKYYYNNKTKVCSWQIPSEVTELTKKQEAEVSKELEVSLLRSNVSTEKGSGPV 337 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 SLS PA NTGGRDATA RTS G SSALDLIKKKLQ+SG P ++S S G E N Sbjct: 338 SLSAPAINTGGRDATALRTSSAPGPSSALDLIKKKLQESGTPVNSSPALVSLGMGTPESN 397 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ EA K L E + DK KD GPTKEECI+QFKEMLKER Sbjct: 398 GSRAAEATAKGLLSETSNDKLKDTNGGGNASDSSSDSEDEDSGPTKEECIIQFKEMLKER 457 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 G+APFSKWEKELPKIVFDPRFKAIP+HSARR+LFEHYV+T EGF Sbjct: 458 GIAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEQRKEKRASQKAAIEGF 517 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLL EA E+ID TDYQTF+++W DPRF+ALDRKDRE LLNERV PLK+ Sbjct: 518 KQLLVEASEDIDQYTDYQTFRKKWENDPRFEALDRKDREHLLNERVIPLKKAAQEKVQAE 577 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ ++FKS+LQDKGDI NSRWSKVK+SL++DPRYKSVKH+DRE LFNEY++ELKA E Sbjct: 578 RAAAAASFKSMLQDKGDITINSRWSKVKESLRNDPRYKSVKHEDREFLFNEYLSELKAVE 637 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK K++ EA+ S+QALLVETIK Sbjct: 638 EEAEREAKVKKEEQEKLKERERELRKRKEREEQEMERVRLKVRRKEAVSSFQALLVETIK 697 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESKPKLEKD QGRA NP LD SD EKLFREHVK L ERC +F+ALL EVI A Sbjct: 698 DPQASWTESKPKLEKDSQGRATNPDLDPSDTEKLFREHVKMLHERCTQDFKALLAEVINA 757 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQ- 456 E +AQ++E+GKT++ SWST K+LLK DPRYNKMPRKERE LWRR+ ++ILRKQ+ DQ Sbjct: 758 ETAAQKSENGKTVLDSWSTVKRLLKPDPRYNKMPRKEREILWRRYTQDILRKQQTTLDQK 817 Query: 455 GEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 EK T+ K R S DSG++LSGSRR HD R Sbjct: 818 EEKHTDSKSRNSADSGRYLSGSRRTHDGR 846 >ref|XP_010112279.1| Transcription elongation regulator 1 [Morus notabilis] gi|587946758|gb|EXC33082.1| Transcription elongation regulator 1 [Morus notabilis] Length = 829 Score = 655 bits (1690), Expect = 0.0 Identities = 336/570 (58%), Positives = 416/570 (72%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LV+T+DGK+YYYN++T+ SSWQIP+EVTELRKKQE+D K S V N N + EKGS P+ Sbjct: 260 LVSTSDGKKYYYNNKTKVSSWQIPNEVTELRKKQESDIPKENSTSVPNNNVLAEKGSTPI 319 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 +L+ PA NTGGRDA A R++ GSSSALDLIKKKLQ+ G P ++S G G AA+E N Sbjct: 320 NLNAPAINTGGRDAMALRSTSAQGSSSALDLIKKKLQEFGTPVTSSSGQVQPGIAASESN 379 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+ +E K Q E++KDK KDA GPTKEECI+QFKEMLKER Sbjct: 380 GSRAVEPTAKGQQSESSKDKPKDANGDRNMTDSSSDSEDADSGPTKEECIIQFKEMLKER 439 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKAIP++S RR+LFEHYV+T EGF Sbjct: 440 GVAPFSKWEKELPKIVFDPRFKAIPSYSLRRSLFEHYVKTRVEEERKEKRAALKAAIEGF 499 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 K+LL+EA E+IDH T YQTF+++WG+DPRF ALDRKDRE LLNERV PLKR Sbjct: 500 KKLLDEASEDIDHKTYYQTFRKKWGDDPRFLALDRKDREHLLNERVLPLKRATEEKAQAI 559 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ SNFKS+L++KGD+ NSRWS+VK+SL+ DPRYKSVKH+DRE LFNEY+++L+AAE Sbjct: 560 RAAAASNFKSMLREKGDVTVNSRWSRVKESLRDDPRYKSVKHEDREVLFNEYLSDLRAAE 619 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AKAK+D EA+ S+QALLVETIK Sbjct: 620 EEVEREAKAKRDEQDKLKERERELRKRKEREEQEMERVRIKVRRKEAVVSFQALLVETIK 679 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQASWTESK KLEKDPQGRA+NP LD S++EKLFREH+KTL ERC E++ALL E++TA Sbjct: 680 DPQASWTESKSKLEKDPQGRASNPDLDSSEMEKLFREHIKTLQERCAREYKALLAELLTA 739 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKA--HD 459 +A+ +ET+DGKT++ SWSTAK+LLK DPRYNKMPRK+RE+LWRR+AE++LRKQ+K+ + Sbjct: 740 DAAERETDDGKTVLNSWSTAKRLLKPDPRYNKMPRKDRETLWRRYAEDMLRKQQKSEPNS 799 Query: 458 QGEKPTEGKVRTSVDSGKHLSGSRRPHDRR 369 + +K + + RTSVDSG+ SG R H+RR Sbjct: 800 KEDKKIDPRNRTSVDSGRLPSGLRGTHERR 829 >ref|XP_010250283.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Nelumbo nucifera] Length = 894 Score = 655 bits (1689), Expect = 0.0 Identities = 347/570 (60%), Positives = 406/570 (71%), Gaps = 2/570 (0%) Frame = -3 Query: 2072 LVTTNDGKRYYYNSRTQSSSWQIPSEVTELRKKQEADAFKAQSVPVINTNAVTEKGSDPV 1893 LVTTNDGK+YYYNS+T+ SSWQ+P EVTELR+K + DA K V N+ A +EK S P+ Sbjct: 326 LVTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPI 385 Query: 1892 SLSTPAANTGGRDATAARTSIVSGSSSALDLIKKKLQDSGIPDSTSQGPASSGSAATELN 1713 S++ PA NTGGR+AT+ R S V+GSSSALDLIKKKLQDS P ++S P SSG +LN Sbjct: 386 SVTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLN 445 Query: 1712 GSKPIEAPGKDLQMENNKDKQKDAXXXXXXXXXXXXXXXXXXGPTKEECILQFKEMLKER 1533 GS+P+EA K LQ EN KDK KD GP+KEECI+QFKEMLKER Sbjct: 446 GSRPVEAAVKGLQSEN-KDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKER 504 Query: 1532 GVAPFSKWEKELPKIVFDPRFKAIPNHSARRALFEHYVRTXXXXXXXXXXXXXXXXXEGF 1353 GVAPFSKWEKELPKIVFDPRFKA+P +SARRALFEHYVRT EGF Sbjct: 505 GVAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGF 564 Query: 1352 KQLLEEAKEEIDHNTDYQTFKRRWGEDPRFQALDRKDRELLLNERVFPLKRTXXXXXXXX 1173 KQLLEEA E+ID TDYQTFK +WG DPRF+ALDRK+RELLLNERV PLK+ Sbjct: 565 KQLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAI 624 Query: 1172 XXASVSNFKSLLQDKGDINSNSRWSKVKDSLKSDPRYKSVKHDDREKLFNEYVAELKAAE 993 A+ S FKSLL++KGDIN++SRWS+VKDSL+SDPRYKSVKH+DRE LFNEY++ELKAA+ Sbjct: 625 RAAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAAD 684 Query: 992 EETVRKAKAKQDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEAIESYQALLVETIK 813 EE R+AK K++ EA+ YQALLVETIK Sbjct: 685 EEAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIK 744 Query: 812 DPQASWTESKPKLEKDPQGRAANPHLDKSDLEKLFREHVKTLLERCVLEFRALLTEVITA 633 DPQ SWTES+P+LEKDPQGRA N LD D EKLFREHVK L ERC EFR LL EVIT Sbjct: 745 DPQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITT 804 Query: 632 EASAQETEDGKTIITSWSTAKQLLKSDPRYNKMPRKERESLWRRHAEEILRKQKKAHDQG 453 EA++Q T DGKT++TSWSTAK+LLK+DPRY+KMPRKERE+LWRRHAEEIL K+K D Sbjct: 805 EAASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPK 864 Query: 452 EKP--TEGKVRTSVDSGKHLSGSRRPHDRR 369 E+ E K R+S+DSG+ +G RR H RR Sbjct: 865 EEKLNIETKARSSLDSGRSPTGLRRSHSRR 894