BLASTX nr result
ID: Astragalus23_contig00000723
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00000723 (2918 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C i... 1291 0.0 ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-l... 1249 0.0 ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-l... 1249 0.0 ref|XP_006590812.2| PREDICTED: pre-mRNA-processing protein 40C-l... 1248 0.0 ref|XP_014619436.1| PREDICTED: pre-mRNA-processing protein 40C-l... 1248 0.0 gb|KHN45824.1| Transcription elongation regulator 1 [Glycine soja] 1244 0.0 ref|XP_020214216.1| pre-mRNA-processing protein 40C isoform X1 [... 1236 0.0 ref|XP_003607201.2| pre-mRNA-processing protein 40C [Medicago tr... 1224 0.0 gb|KHN05434.1| Transcription elongation regulator 1, partial [Gl... 1219 0.0 ref|XP_014493851.1| pre-mRNA-processing protein 40C isoform X1 [... 1180 0.0 ref|XP_017433133.1| PREDICTED: pre-mRNA-processing protein 40C [... 1179 0.0 gb|KYP68375.1| Transcription elongation regulator 1 [Cajanus cajan] 1175 0.0 ref|XP_015950130.1| pre-mRNA-processing protein 40C isoform X1 [... 1174 0.0 ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phas... 1167 0.0 ref|XP_020991278.1| pre-mRNA-processing protein 40C isoform X2 [... 1162 0.0 ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-l... 1158 0.0 ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-l... 1132 0.0 ref|XP_019413264.1| PREDICTED: pre-mRNA-processing protein 40C [... 1124 0.0 gb|OIV99579.1| hypothetical protein TanjilG_17389 [Lupinus angus... 1108 0.0 ref|XP_014619437.1| PREDICTED: pre-mRNA-processing protein 40C-l... 1103 0.0 >ref|XP_004505734.1| PREDICTED: pre-mRNA-processing protein 40C isoform X1 [Cicer arietinum] ref|XP_012572707.1| PREDICTED: pre-mRNA-processing protein 40C isoform X2 [Cicer arietinum] Length = 953 Score = 1291 bits (3341), Expect = 0.0 Identities = 675/926 (72%), Positives = 721/926 (77%), Gaps = 1/926 (0%) Frame = -3 Query: 2913 SYNMHHNVNASGNSQQSPAHPGMKSNSPMVVQPPVPPGLSPHAA-PSFLYNTSQNVPPFS 2737 SY ++ NVNASGNSQQS +H GMK NS V PP+ PG P AA PSF YN SQ+V PF+ Sbjct: 33 SYGVNQNVNASGNSQQSSSHSGMKPNSG--VNPPLVPGFPPRAATPSFSYNVSQSVAPFT 90 Query: 2736 SNQHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMPTAX 2557 NQH S TNM AQ+ SKVSSASS HP PAP SIS+MPPPSDPNYRPTT WMPTA Sbjct: 91 GNQHAQSSTNMSDSIAQDFSKVSSASSNPHPIPAPTSISAMPPPSDPNYRPTTLWMPTAP 150 Query: 2556 XXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASDPTA 2377 ++P +PAAPS+ TD S+AV R NM TAPI SDP A Sbjct: 151 TFPVHTLMPGTPGPPGLAKPG---IMPSNPAAPSSNTDFPSSAVPRPNMPTAPIGSDPNA 207 Query: 2376 SQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVHLPAV 2197 S KGLPYPPIPSMVAPPQGFWLQPPQMSGV RPPFLQYP ARGV LPAV Sbjct: 208 SHKGLPYPPIPSMVAPPQGFWLQPPQMSGVHRPPFLQYP--AAFPGPFPFPARGVTLPAV 265 Query: 2196 PVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKKLNAVVTQNEDATN 2017 PVPDSQPPGVTPVGA G A S S HQ RGT GLQT VISAH DDKKLNA VT NEDA N Sbjct: 266 PVPDSQPPGVTPVGAAGISAFSVSSHQLRGTSGLQTVVISAHADDKKLNATVTHNEDAAN 325 Query: 2016 DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPGTDWQL 1837 DQLDAWTAHKTEAG+VYYYNA+T ESTY KPAGFKGE+HQVSVQPTP SVVDLPGTDWQL Sbjct: 326 DQLDAWTAHKTEAGIVYYYNALTGESTYDKPAGFKGEAHQVSVQPTPVSVVDLPGTDWQL 385 Query: 1836 VSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRGSGMVT 1657 VSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGD A+DHLM V VL DRG GMVT Sbjct: 386 VSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDAAKDHLMPVLNATVLPDRGFGMVT 445 Query: 1656 LNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSGSESNG 1477 LNAPAI TGGRDAA KP SVQS SALDLIKKKLQESGTP+ SS+IP PSVQ GSESNG Sbjct: 446 LNAPAITTGGRDAATVKPFSVQSSPSALDLIKKKLQESGTPITSSSIPMPSVQPGSESNG 505 Query: 1476 SKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEMLKERG 1297 SKAT+STAK LQNDNSKD++KDANGD N D+GPSKEECINQFKEMLKERG Sbjct: 506 SKATDSTAKSLQNDNSKDRQKDANGDANASDTSSDSEDEDSGPSKEECINQFKEMLKERG 565 Query: 1296 VAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXXXEGFK 1117 VAPFSKWEKELPK VFDPRFKAIPS+SARRSLFEHYVKT EGFK Sbjct: 566 VAPFSKWEKELPKFVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFK 625 Query: 1116 QLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXXXXXXX 937 QLLDEASEDINH+TDYHTFRKKW ND RFEA+DRKEREHLLNERVLPLKKA Sbjct: 626 QLLDEASEDINHNTDYHTFRKKWANDSRFEALDRKEREHLLNERVLPLKKAVEEKAQAMW 685 Query: 936 XXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISELKAIEH 757 A FKSML+E+GDI NSRWSR+KESLRDDPRYKSVKHEDRE LFNEYISELKA EH Sbjct: 686 DAAAAGFKSMLKEQGDITFNSRWSRIKESLRDDPRYKSVKHEDREVLFNEYISELKAAEH 745 Query: 756 AAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLVETIKD 577 AAERE+RAK++EQ+KL RLKIRRK+AVTS QALLVETIKD Sbjct: 746 AAERESRAKKEEQEKLRERERELRKRKEREEHEMERVRLKIRRKEAVTSLQALLVETIKD 805 Query: 576 PMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAEVLTSE 397 PMASWTESKPKLEKDPQGRA NSDLD AD EKLFR+HIKMLQERC +FR LLAEVLTSE Sbjct: 806 PMASWTESKPKLEKDPQGRATNSDLDSADMEKLFRDHIKMLQERCAHDFRALLAEVLTSE 865 Query: 396 AASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSANDSKE 217 AASQET+D KT LNSWSTAKRLLKSD RY+K PR +RE LWRRYVEDMLRR+KS++DSKE Sbjct: 866 AASQETDDGKTVLNSWSTAKRLLKSDPRYNKFPRKDREALWRRYVEDMLRRQKSSHDSKE 925 Query: 216 EKRTDARSKYSLESSKLPGESRRSLE 139 +K TDAR + S +SSKLP ES RS E Sbjct: 926 DKHTDARGRNSQQSSKLPLESGRSHE 951 >ref|XP_003538973.2| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] ref|XP_014619433.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] gb|KRH29195.1| hypothetical protein GLYMA_11G103100 [Glycine max] gb|KRH29196.1| hypothetical protein GLYMA_11G103100 [Glycine max] Length = 968 Score = 1249 bits (3233), Expect = 0.0 Identities = 664/932 (71%), Positives = 719/932 (77%), Gaps = 6/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 41 FAYGMLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGA 97 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 98 IFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 157 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 158 PTAMSFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 216 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 217 DPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 274 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 275 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVN 334 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLP Sbjct: 335 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLP 394 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDR Sbjct: 395 GTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDR 454 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+ Sbjct: 455 GSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQT 514 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKE Sbjct: 515 GPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKE 574 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 575 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAALKA 634 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 635 AIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEE 694 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISE Sbjct: 695 KAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISE 754 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AKR+EQDKL RLKIRRKDAVT FQALL Sbjct: 755 LKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALL 814 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 815 VETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLA 874 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTS+AASQET+D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLR +K+ Sbjct: 875 EVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRGQKA 934 Query: 234 ANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 ++DS+EEK TDA + LESSK P ES RS E Sbjct: 935 SHDSREEKHTDAEGRNYLESSKPPFESGRSYE 966 >ref|XP_006590824.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] ref|XP_014619432.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] gb|KRH29197.1| hypothetical protein GLYMA_11G103100 [Glycine max] gb|KRH29198.1| hypothetical protein GLYMA_11G103100 [Glycine max] Length = 980 Score = 1249 bits (3233), Expect = 0.0 Identities = 664/932 (71%), Positives = 719/932 (77%), Gaps = 6/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 53 FAYGMLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGA 109 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 110 IFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 169 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 170 PTAMSFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 228 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 229 DPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 286 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 287 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVN 346 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLP Sbjct: 347 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLP 406 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDR Sbjct: 407 GTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDR 466 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+ Sbjct: 467 GSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQT 526 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKE Sbjct: 527 GPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKE 586 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 587 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAALKA 646 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 647 AIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEE 706 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISE Sbjct: 707 KAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISE 766 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AKR+EQDKL RLKIRRKDAVT FQALL Sbjct: 767 LKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALL 826 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 827 VETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLA 886 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTS+AASQET+D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLR +K+ Sbjct: 887 EVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRGQKA 946 Query: 234 ANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 ++DS+EEK TDA + LESSK P ES RS E Sbjct: 947 SHDSREEKHTDAEGRNYLESSKPPFESGRSYE 978 >ref|XP_006590812.2| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] ref|XP_014619435.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] Length = 980 Score = 1248 bits (3230), Expect = 0.0 Identities = 663/932 (71%), Positives = 718/932 (77%), Gaps = 6/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 53 FAYGMLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGA 109 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 110 IFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 169 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 170 PTAMSFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 228 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 229 DPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 286 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 287 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVN 346 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLP Sbjct: 347 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLP 406 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDR Sbjct: 407 GTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDR 466 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+ Sbjct: 467 GSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQT 526 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKE Sbjct: 527 GPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKE 586 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGV PFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 587 MLKERGVVPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 646 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 647 AIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEE 706 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISE Sbjct: 707 KAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISE 766 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AK +EQDKL RLKIRRKDAVT FQALL Sbjct: 767 LKAAEHAAERETKAKMEEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALL 826 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 827 VETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLA 886 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTS+AASQET+D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 887 EVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 946 Query: 234 ANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 ++DS+EEK TDA + LESSK P ES RS E Sbjct: 947 SHDSREEKHTDAEGRNYLESSKHPFESGRSYE 978 >ref|XP_014619436.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] ref|XP_006590813.2| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] gb|KRH29182.1| hypothetical protein GLYMA_11G102600 [Glycine max] gb|KRH29183.1| hypothetical protein GLYMA_11G102600 [Glycine max] Length = 968 Score = 1248 bits (3230), Expect = 0.0 Identities = 663/932 (71%), Positives = 718/932 (77%), Gaps = 6/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 41 FAYGMLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGA 97 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 98 IFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 157 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 158 PTAMSFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 216 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 217 DPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 274 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 275 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVN 334 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLP Sbjct: 335 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLP 394 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDR Sbjct: 395 GTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDR 454 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+ Sbjct: 455 GSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQT 514 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKE Sbjct: 515 GPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKE 574 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGV PFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 575 MLKERGVVPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 634 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 635 AIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEE 694 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISE Sbjct: 695 KAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISE 754 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AK +EQDKL RLKIRRKDAVT FQALL Sbjct: 755 LKAAEHAAERETKAKMEEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALL 814 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 815 VETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLA 874 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTS+AASQET+D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 875 EVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 934 Query: 234 ANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 ++DS+EEK TDA + LESSK P ES RS E Sbjct: 935 SHDSREEKHTDAEGRNYLESSKHPFESGRSYE 966 >gb|KHN45824.1| Transcription elongation regulator 1 [Glycine soja] Length = 924 Score = 1244 bits (3218), Expect = 0.0 Identities = 662/928 (71%), Positives = 716/928 (77%), Gaps = 6/928 (0%) Frame = -3 Query: 2904 MHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVPPFSS 2734 M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ FSS Sbjct: 1 MLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGAIFSS 57 Query: 2733 NQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMPTAX 2557 NQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNY P TSWMPTA Sbjct: 58 NQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSLMPPPSDPNYCPATSWMPTAM 117 Query: 2556 XXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASDPTA 2377 +I +PAAPSTGTDSS A+LR NM T+ IASDPTA Sbjct: 118 SFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIASDPTA 176 Query: 2376 SQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVHLPAV 2197 QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV LPAV Sbjct: 177 PQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVALPAV 234 Query: 2196 PVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQNEDAT 2020 P+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T NEDA Sbjct: 235 PIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVNEDAA 294 Query: 2019 N-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPGTDW 1843 N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLPGTDW Sbjct: 295 NNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLPGTDW 354 Query: 1842 QLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRGSGM 1663 +LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDRGSGM Sbjct: 355 RLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDRGSGM 414 Query: 1662 VTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSGSES 1483 VTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+G ES Sbjct: 415 VTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQTGPES 474 Query: 1482 NGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEMLKE 1303 NGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKEMLKE Sbjct: 475 NGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKEMLKE 534 Query: 1302 RGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXXXEG 1123 RGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT EG Sbjct: 535 RGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAALKAAIEG 594 Query: 1122 FKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXXXXX 943 FK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 595 FKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEEKAQA 654 Query: 942 XXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISELKAI 763 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISELKA Sbjct: 655 MRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISELKAA 714 Query: 762 EHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLVETI 583 EHAAERET+AKR+EQDKL RLKIRRKDAVT FQALLVETI Sbjct: 715 EHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALLVETI 774 Query: 582 KDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAEVLT 403 KDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQERC EFR LLAEVLT Sbjct: 775 KDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLAEVLT 834 Query: 402 SEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSANDS 223 S+AASQET+D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLRR+K+++DS Sbjct: 835 SDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQKASHDS 894 Query: 222 KEEKRTDARSKYSLESSKLPGESRRSLE 139 +EEK TDA + LESSK P ES RS E Sbjct: 895 REEKHTDAEGRNYLESSKPPFESGRSYE 922 >ref|XP_020214216.1| pre-mRNA-processing protein 40C isoform X1 [Cajanus cajan] Length = 967 Score = 1236 bits (3198), Expect = 0.0 Identities = 652/922 (70%), Positives = 713/922 (77%), Gaps = 6/922 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y + H+VNAS +SQ S HP MKSNS PM VQP VP G+S HAAPSF YN Q+ Sbjct: 45 FAYGVLHSVNASVSSQHSSTHPAMKSNSSANPMAVQPQVP-GVSSHAAPSFSYNIPQSGA 103 Query: 2745 PFSSN-QHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSN H S TNM AQ+VSK+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 104 SFSSNLHHAQSNTNMSDSAAQDVSKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 163 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 164 PTALPFPVHPLMPTPGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 222 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 223 DPTAPQKGLPYPSMPAMAAPPQGVWLQPPQMSGVLRPPYLQYP--APFPAPFPFPARGVA 280 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGAVG ++ S HQ RGT LQTEVIS DDKK LNAV TQN Sbjct: 281 LPAVPIPDSQPPGVTPVGAVGGTSTLVSGHQLRGTIALQTEVISGPADDKKKLNAVETQN 340 Query: 2031 EDA-TNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 +DA +NDQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGE HQVS QPTP S+ DLP Sbjct: 341 QDAASNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGEPHQVSAQPTPVSMTDLP 400 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW LVSTSDGKKYYYNN+TKTSCWQIPNEV ELKKKQDGDV +DHLMSV TNVLSDR Sbjct: 401 GTDWMLVSTSDGKKYYYNNRTKTSCWQIPNEVTELKKKQDGDVTKDHLMSVPNTNVLSDR 460 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSG+VTLNAPAINTGGRDAAA K SS Q+ SSALDLIKKKLQ+SGTPVASSAIP PSVQ+ Sbjct: 461 GSGLVTLNAPAINTGGRDAAALKSSSQQTSSSALDLIKKKLQDSGTPVASSAIPAPSVQT 520 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 GSESNGSK EST KGLQ D +KDK+KD NGD N+ D+GPSKEECI QFKE Sbjct: 521 GSESNGSKIVESTTKGLQVDTNKDKQKDTNGDANISDTSSDSEEEDSGPSKEECIIQFKE 580 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 581 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQRA 640 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFKQLLDEA EDINH+TDY TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 641 AIEGFKQLLDEALEDINHNTDYQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAEE 700 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 A FKSML+ERGDI NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISE Sbjct: 701 KAQAMRAAAAAGFKSMLKERGDIIFNSRWSRVKESLRDDPRYKSVRHEDREVLFNEYISE 760 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA E+AAERET+AKR+EQDKL R+KIRRK+AVTSFQALL Sbjct: 761 LKAAEYAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRVKIRRKEAVTSFQALL 820 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ASWTESKPKLEKDPQGRA NSDLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 821 VETIKDPLASWTESKPKLEKDPQGRATNSDLDPTDTEKLFREHVKMLQERCAHEFRVLLA 880 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTSEAAS+E++D KT LNSWSTAKR+LKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 881 EVLTSEAASRESDDGKTVLNSWSTAKRVLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 940 Query: 234 ANDSKEEKRTDARSKYSLESSK 169 ++DS+E+K D++ + SLES + Sbjct: 941 SHDSREDKHKDSKPRNSLESGR 962 >ref|XP_003607201.2| pre-mRNA-processing protein 40C [Medicago truncatula] gb|AES89398.2| pre-mRNA-processing protein 40C [Medicago truncatula] Length = 959 Score = 1224 bits (3168), Expect = 0.0 Identities = 658/930 (70%), Positives = 704/930 (75%), Gaps = 5/930 (0%) Frame = -3 Query: 2913 SYNMHHNVNASGNSQQSP--AHPGMKSNSPMVVQPPVPPGLSPHAAPSFLYNTSQNVPP- 2743 SY H NVN+S NSQQ +H GM NS VV PP AAPSF YN Q+ PP Sbjct: 36 SYAPHQNVNSSANSQQQQQASHSGMNPNS--VVNPPFHTHTPRPAAPSFSYNFPQSAPPA 93 Query: 2742 FSSNQHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMPT 2563 F+ NQH S TNM Q+ SKV SAS LH APAP SIS+M P SDPNYRPTT WMPT Sbjct: 94 FTGNQHGQSNTNMPDSVTQDFSKVPSASINLHSAPAPTSISAMAPRSDPNYRPTTLWMPT 153 Query: 2562 AXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASDP 2383 A +IP +PAAPST T S AV RQNM P ASDP Sbjct: 154 APTFPIHPVMPGTPGTPGPPGLTKPVMIPSNPAAPST-TGFPSAAVPRQNM---PTASDP 209 Query: 2382 TASQKG-LPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVHL 2206 AS +G LPYPPIPSMVAPPQG+WLQPPQMSGVLRPPF QYP ARG L Sbjct: 210 NASHRGGLPYPPIPSMVAPPQGYWLQPPQMSGVLRPPFHQYP--AAFPGPFPFPARGGAL 267 Query: 2205 PAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDK-KLNAVVTQNE 2029 PAVPVPDSQPPGVTPVGA A S+S H RGT G+QTEVISAH DDK KLNA VTQNE Sbjct: 268 PAVPVPDSQPPGVTPVGAASISAPSSSNHLLRGTSGVQTEVISAHTDDKHKLNATVTQNE 327 Query: 2028 DATNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPGT 1849 DA NDQLDAWTAHKTEAG+VYYYNA+T +STY KPAGFKGE+HQVSVQPTP S+VDLPGT Sbjct: 328 DAANDQLDAWTAHKTEAGIVYYYNALTGQSTYDKPAGFKGEAHQVSVQPTPVSMVDLPGT 387 Query: 1848 DWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRGS 1669 DWQLVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQD DV +DH V TNVLS+RGS Sbjct: 388 DWQLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDSDVTKDHPTPVPNTNVLSERGS 447 Query: 1668 GMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSGS 1489 GMV LNAPAI TGGRDA A KP VQS SALDLIKKKLQESG PV SS+IPTPSVQ GS Sbjct: 448 GMVALNAPAITTGGRDAVASKPFIVQSSPSALDLIKKKLQESGAPVTSSSIPTPSVQPGS 507 Query: 1488 ESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEML 1309 ESNGSKAT+STAK LQNDNSKDK+KDANGD NV D+GPSKEECINQFKEML Sbjct: 508 ESNGSKATDSTAKSLQNDNSKDKQKDANGDANVSDTSSDSEDEDSGPSKEECINQFKEML 567 Query: 1308 KERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXXX 1129 KERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVK Sbjct: 568 KERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKNRAEEERKEKRAAQKAAI 627 Query: 1128 EGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXXX 949 EGFKQLLDEASEDI+ TD HTFRKKWGND RFEA+DRKEREHLLNERVLPLKKAT Sbjct: 628 EGFKQLLDEASEDIDDKTDSHTFRKKWGNDPRFEALDRKEREHLLNERVLPLKKATEEKA 687 Query: 948 XXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISELK 769 SFKSML+E+G+I NSRWSRVKESLRDDPRYKSVKHEDRE LFNEYISELK Sbjct: 688 QAMRDAAADSFKSMLKEQGEITFNSRWSRVKESLRDDPRYKSVKHEDRELLFNEYISELK 747 Query: 768 AIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLVE 589 A+EHAAERETRAKR+EQDKL RLKIRRK+AVTSFQALLVE Sbjct: 748 AVEHAAERETRAKREEQDKLRERERELRKRKEREEHEMERVRLKIRRKEAVTSFQALLVE 807 Query: 588 TIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAEV 409 IKDPMASWTESKPKLEKDPQGRA NSDLD AD EKLFR+H+KMLQER ++FR LLAE Sbjct: 808 RIKDPMASWTESKPKLEKDPQGRATNSDLDSADMEKLFRDHVKMLQERRARDFRALLAEF 867 Query: 408 LTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSAN 229 LTSEAASQET+D KT LNSWSTAKRL+KSD RY+KVP +RE LWRRY EDM+RR+KS++ Sbjct: 868 LTSEAASQETDDGKTVLNSWSTAKRLIKSDPRYNKVPSEDREALWRRYAEDMIRRQKSSH 927 Query: 228 DSKEEKRTDARSKYSLESSKLPGESRRSLE 139 DSKEEK TDAR + SLESSK P ES RS E Sbjct: 928 DSKEEKHTDARGRKSLESSKNPLESGRSHE 957 >gb|KHN05434.1| Transcription elongation regulator 1, partial [Glycine soja] Length = 996 Score = 1219 bits (3153), Expect = 0.0 Identities = 650/932 (69%), Positives = 712/932 (76%), Gaps = 6/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F++ M NVNASG+SQ HP + SNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 77 FAHGMLQNVNASGSSQLLSTHPAIISNSAVNPMVVQPP---GVSSHAAPSFSYNIPQSGA 133 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S T+ VSK+SSASSI H PA S S MPPPSDPNY P TSWM Sbjct: 134 IFSSNQQHAQSSTD--------VSKLSSASSIPHSVPAHTSTSLMPPPSDPNYCPATSWM 185 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A LR NM T IAS Sbjct: 186 PTALSFPVHPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAAFLRPNMPTPAIAS 244 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP IP++ APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 245 DPTAPQKGLPYPSIPALAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 302 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 303 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGSADDKKKLNSVDTLN 362 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KP+GFKGESHQVS QPTP S++DLP Sbjct: 363 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGESHQVSAQPTPVSMIDLP 422 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN TKTSCWQIPNEVAELKKKQDGDV +DHLMSV TNVLSDR Sbjct: 423 GTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVPNTNVLSDR 482 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPS++Q+ SSALDLIKKKLQ+SGTP+ S+I PSVQ Sbjct: 483 GSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALDLIKKKLQDSGTPITPSSIHAPSVQI 542 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKG+Q DN+KDK+KD NGD +V DNGPSKEECI QFKE Sbjct: 543 GPESNGSKTVDSTAKGVQVDNNKDKQKDTNGDADVSDTSSDSEDEDNGPSKEECIIQFKE 602 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 603 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 662 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TD+ TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 663 AIEGFKRLLDEASEDINYNTDFQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAEE 722 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGD++ NSRW+RVKESLRDDPRYKSV+HEDRE LFNEYISE Sbjct: 723 KAQAMRAAAAASFKSMLKERGDMSFNSRWARVKESLRDDPRYKSVRHEDREVLFNEYISE 782 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AKR+EQDKL RLKIRRK+AVTSFQALL Sbjct: 783 LKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTSFQALL 842 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ASWTESKPKLEKDPQ RA N DLDP+DTEKLFREH+KMLQERC EFR LLA Sbjct: 843 VETIKDPLASWTESKPKLEKDPQRRATNPDLDPSDTEKLFREHVKMLQERCAHEFRVLLA 902 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTS+AASQET D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 903 EVLTSDAASQETNDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 962 Query: 234 ANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 + DS+EEK TDA+ + LESSK P ES RS E Sbjct: 963 SYDSREEKHTDAKGRTYLESSKHPLESGRSHE 994 >ref|XP_014493851.1| pre-mRNA-processing protein 40C isoform X1 [Vigna radiata var. radiata] Length = 975 Score = 1180 bits (3053), Expect = 0.0 Identities = 633/932 (67%), Positives = 705/932 (75%), Gaps = 8/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F + + N N SG+ QQS H +KSNS P+V QPPVP G+S HAA SF YN Q+ Sbjct: 45 FPHGVLQNANVSGSPQQSSTHNVIKSNSTVNPVVFQPPVP-GVSSHAALSFSYNVPQSGG 103 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSS+Q H S + AQ+V+K+SSA+S H PA S MPP SDPNYRPTTSWM Sbjct: 104 AFSSSQQHTQSSGKISESVAQDVTKLSSAASTPHSVPAHTSTMIMPP-SDPNYRPTTSWM 162 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I + A PSTGTDSSS A+ R NM + IAS Sbjct: 163 PTAMSFPLHPVMPTPGNPGPPGLTSSS-IISINTAVPSTGTDSSSAALPRPNMPISAIAS 221 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA KGLPYP +PSM APPQG WLQ PQMSGV RPP+LQYP ARGV Sbjct: 222 DPTAPLKGLPYPSMPSMAAPPQGLWLQAPQMSGVFRPPYLQYP--APFPGPFPFPARGVT 279 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPV ++ S +QPRGT LQTE IS DDKK LNAVVTQN Sbjct: 280 LPAVPIPDSQPPGVTPVSGGSGTSTLASSNQPRGTTALQTEAISGPADDKKKLNAVVTQN 339 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 E A N DQL+AWTAHKTEAG++YYYNA+T ESTY KPAGF GE HQVS QPTP S++DLP Sbjct: 340 EGAANNDQLEAWTAHKTEAGIIYYYNALTGESTYDKPAGFIGEPHQVSAQPTPVSMMDLP 399 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW VSTSDGKKYYYNN+TKTSCWQIPNEV+ELKKKQDGDVA+D LMSV TNVLSDR Sbjct: 400 GTDWLSVSTSDGKKYYYNNRTKTSCWQIPNEVSELKKKQDGDVAKDQLMSVPNTNVLSDR 459 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPS++QSPSSALDLIKKKLQ+SGTPV SS+IP PSVQ+ Sbjct: 460 GSGMVTLNAPAINTGGRDAAALKPSNLQSPSSALDLIKKKLQDSGTPVTSSSIPVPSVQT 519 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 GSESNGSKA EST+KG+Q DNSKDK+KD NG TNV D+GPSKEECI QFKE Sbjct: 520 GSESNGSKAIESTSKGMQADNSKDKQKDTNGATNVSDTSSDSEDEDSGPSKEECIIQFKE 579 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 580 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 639 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EG+KQLLDEASEDIN++TDY TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 640 AIEGYKQLLDEASEDINYNTDYQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAEE 699 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKESLRDDPRYKSV+HEDRE LFNEY+SE Sbjct: 700 KTQAMRAAAAASFKSMLKERGDISFNSRWSRVKESLRDDPRYKSVRHEDREGLFNEYLSE 759 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA E+A ERET+AKR+EQDKL RLKIRRK+AVTSFQALL Sbjct: 760 LKAAEYATERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTSFQALL 819 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ASWTESKPKLEKDPQGRA N +LD +DTEKLFREH+KMLQERC EFR LLA Sbjct: 820 VETIKDPLASWTESKPKLEKDPQGRATNPELDSSDTEKLFREHVKMLQERCAHEFRVLLA 879 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLT++AAS E +D KT LNSWSTAKR+LKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 880 EVLTTDAASHENDDGKTVLNSWSTAKRVLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 939 Query: 234 --ANDSKEEKRTDARSKYSLESSKLPGESRRS 145 ++DS+E+K TDA+ + SLESSK ES RS Sbjct: 940 SHSHDSREDKHTDAKGRSSLESSKHQLESGRS 971 >ref|XP_017433133.1| PREDICTED: pre-mRNA-processing protein 40C [Vigna angularis] ref|XP_017433134.1| PREDICTED: pre-mRNA-processing protein 40C [Vigna angularis] gb|KOM51044.1| hypothetical protein LR48_Vigan08g187100 [Vigna angularis] dbj|BAT91083.1| hypothetical protein VIGAN_06238900 [Vigna angularis var. angularis] Length = 975 Score = 1179 bits (3051), Expect = 0.0 Identities = 632/932 (67%), Positives = 704/932 (75%), Gaps = 8/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F + + N N SG+ Q S H +KSNS P+V QPPVP G+S HAA SF YN Q+ Sbjct: 45 FPHGVLQNANVSGSPQLSSTHNVIKSNSTVNPVVFQPPVP-GVSSHAALSFSYNVPQSGG 103 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSS+Q H S + AQ+V+K+SSA+S H PA S + MPP SDPNYRPTTSWM Sbjct: 104 AFSSSQQHTQSSGKISESVAQDVTKLSSAASTPHSIPAHTSTTIMPP-SDPNYRPTTSWM 162 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I + PSTGTDSSS A+ R NM + IAS Sbjct: 163 PTAMSFPLHPVMPTPGNPGPPGLTSSS-IISINTVVPSTGTDSSSAALPRPNMPISAIAS 221 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA KGLPYP +PSM APPQG WLQ PQMSGV RPP+LQYP ARG+ Sbjct: 222 DPTAPLKGLPYPSMPSMAAPPQGLWLQAPQMSGVFRPPYLQYP--APFPGPFPFPARGIT 279 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPV G ++ S +Q RGT LQTEVIS DDKK LNAVVTQN Sbjct: 280 LPAVPIPDSQPPGVTPVSGGGGTSTPASSNQLRGTTALQTEVISGPADDKKKLNAVVTQN 339 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQL+AWTAHKTEAG++YYYNA+T ESTY KPAGF GE HQVS QPTP S++DLP Sbjct: 340 EDAANNDQLEAWTAHKTEAGIIYYYNALTGESTYDKPAGFIGEPHQVSAQPTPVSMMDLP 399 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW VSTSDGKKYYYNN+TKTSCWQIPNEV+ELKKKQDGDV +D LMSV TNVLSDR Sbjct: 400 GTDWLSVSTSDGKKYYYNNRTKTSCWQIPNEVSELKKKQDGDVTKDQLMSVPNTNVLSDR 459 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPS++QSPSSALDLIKKKLQ+SGTP+ SS+IP PSVQ+ Sbjct: 460 GSGMVTLNAPAINTGGRDAAALKPSNLQSPSSALDLIKKKLQDSGTPITSSSIPVPSVQT 519 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 GSESNGSKA EST+KG+Q DNSKDK+KD NG NV D+GPSKEECI QFKE Sbjct: 520 GSESNGSKAVESTSKGMQADNSKDKQKDTNGAANVSDTSSDSEDEDSGPSKEECIIQFKE 579 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 580 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 639 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFKQLLDEA EDIN++TDY TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 640 AIEGFKQLLDEALEDINYNTDYQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAEE 699 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISE Sbjct: 700 KTQAMRAAAAASFKSMLKERGDISFNSRWSRVKESLRDDPRYKSVRHEDREGLFNEYISE 759 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA E+A ERET+AKR+EQDKL RLKIRRK+AVTSFQALL Sbjct: 760 LKAAEYATERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTSFQALL 819 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VETIKDP+ASWTESKPKLEKDPQGRA N +LD +DTEKLFREH+KMLQERC EFR LLA Sbjct: 820 VETIKDPLASWTESKPKLEKDPQGRATNPELDSSDTEKLFREHVKMLQERCAHEFRVLLA 879 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLT++AAS E ED KT LNSWSTAKR+LKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 880 EVLTTDAASHENEDGKTVLNSWSTAKRVLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 939 Query: 234 --ANDSKEEKRTDARSKYSLESSKLPGESRRS 145 ++DS+E+K TDA+ + SLESSK P ES RS Sbjct: 940 SHSHDSREDKHTDAKGRNSLESSKHPLESGRS 971 >gb|KYP68375.1| Transcription elongation regulator 1 [Cajanus cajan] Length = 918 Score = 1175 bits (3040), Expect = 0.0 Identities = 627/922 (68%), Positives = 685/922 (74%), Gaps = 6/922 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y + H+VNAS +SQ S HP MKSNS PM VQP VP G+S HAAPSF YN Q+ Sbjct: 45 FAYGVLHSVNASVSSQHSSTHPAMKSNSSANPMAVQPQVP-GVSSHAAPSFSYNIPQSGA 103 Query: 2745 PFSSN-QHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSN H S TNM AQ+VSK+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 104 SFSSNLHHAQSNTNMSDSAAQDVSKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 163 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 164 PTALPFPVHPLMPTPGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 222 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 223 DPTAPQKGLPYPSMPAMAAPPQGVWLQPPQMSGVLRPPYLQYP--APFPAPFPFPARGVA 280 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGAVG ++ S HQ RGT LQTEVIS DDKK LNAV TQN Sbjct: 281 LPAVPIPDSQPPGVTPVGAVGGTSTLVSGHQLRGTIALQTEVISGPADDKKKLNAVETQN 340 Query: 2031 EDA-TNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 +DA +NDQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGE HQVS QPTP S+ DLP Sbjct: 341 QDAASNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGEPHQVSAQPTPVSMTDLP 400 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW LVSTSDGKKYYYNN+TKTSCWQIPNEV ELKKKQDGDV +DHLMSV TNVLSDR Sbjct: 401 GTDWMLVSTSDGKKYYYNNRTKTSCWQIPNEVTELKKKQDGDVTKDHLMSVPNTNVLSDR 460 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSG+VTLNAPAINTGGRDAAA K SS Q+ SSALDLIKKKLQ+SGTPVASSAIP PSVQ+ Sbjct: 461 GSGLVTLNAPAINTGGRDAAALKSSSQQTSSSALDLIKKKLQDSGTPVASSAIPAPSVQT 520 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 GSESNGSK EST KGLQ D +KDK+KD NGD N+ D+GPSKEECI QFKE Sbjct: 521 GSESNGSKIVESTTKGLQVDTNKDKQKDTNGDANISDTSSDSEEEDSGPSKEECIIQFKE 580 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 581 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQRA 640 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFKQLLDEA EDINH+TDY TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 641 AIEGFKQLLDEALEDINHNTDYQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAEE 700 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 A FKSML+ERGDI NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISE Sbjct: 701 KAQAMRAAAAAGFKSMLKERGDIIFNSRWSRVKESLRDDPRYKSVRHEDREVLFNEYISE 760 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA E+AAERET+AKR+EQ Sbjct: 761 LKAAEYAAERETKAKREEQ----------------------------------------- 779 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 ASWTESKPKLEKDPQGRA NSDLDP DTEKLFREH+KMLQERC EFR LLA Sbjct: 780 --------ASWTESKPKLEKDPQGRATNSDLDPTDTEKLFREHVKMLQERCAHEFRVLLA 831 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 EVLTSEAAS+E++D KT LNSWSTAKR+LKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 832 EVLTSEAASRESDDGKTVLNSWSTAKRVLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 891 Query: 234 ANDSKEEKRTDARSKYSLESSK 169 ++DS+E+K D++ + SLES + Sbjct: 892 SHDSREDKHKDSKPRNSLESGR 913 >ref|XP_015950130.1| pre-mRNA-processing protein 40C isoform X1 [Arachis duranensis] ref|XP_015950131.1| pre-mRNA-processing protein 40C isoform X1 [Arachis duranensis] Length = 955 Score = 1174 bits (3036), Expect = 0.0 Identities = 636/931 (68%), Positives = 691/931 (74%), Gaps = 5/931 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 FSY MH NVNASGN QQS HPGMKSN+ P VQPPVP GL PHAAPSF YN Q+ P Sbjct: 37 FSYGMHQNVNASGNPQQS-IHPGMKSNTTMMPTAVQPPVP-GLPPHAAPSFSYNIWQSGP 94 Query: 2745 PFSSNQHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMP 2566 FSSNQ S TN P Q+VSK SS SS+ H PA SI MPPPSDPN+RPTTSWMP Sbjct: 95 AFSSNQLTQSNTNKSDPVVQDVSKASSVSSVPHSVPAHTSI--MPPPSDPNFRPTTSWMP 152 Query: 2565 TAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASD 2386 T + A PST SSS A LR NM A IASD Sbjct: 153 TPLSFPGHPVMPGAPGNPAPPGLTSSIISTNLAAPPSTV--SSSAAPLRPNMPAAAIASD 210 Query: 2385 PTASQKGLPYPPIPSMVAPP-QGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 PT +QKG PY +P MVAPP QGFWLQPPQMSG+LRPPFLQYP RGV+ Sbjct: 211 PTLTQKGTPYASMPRMVAPPPQGFWLQPPQMSGILRPPFLQYP--AAFPGPFPYPVRGVN 268 Query: 2208 LPAVPVPDSQPPGVTPVGAVG-TFASSTSIHQPRGTGGLQTEVISAHPDDKKLNAVVTQN 2032 PAV +PDSQPPGVTPV A T A S S +Q R LQT++IS DDKK N VTQN Sbjct: 269 PPAVTLPDSQPPGVTPVNATAATSAPSASDNQLRQGTDLQTDLISGPADDKKSN--VTQN 326 Query: 2031 EDATNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPG 1852 A N++LDAWTAHKTE GVVYYYNA+T ESTY KP GFKGE HQ++VQPTP S+VD+PG Sbjct: 327 VGAANEKLDAWTAHKTETGVVYYYNALTGESTYDKPVGFKGEPHQIAVQPTPVSMVDIPG 386 Query: 1851 TDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRG 1672 TDW LVSTSDGKKYYYN QTKTS W++PNEVAELKKKQDGDV +DH MSV TNVLSDRG Sbjct: 387 TDWMLVSTSDGKKYYYNKQTKTSSWEVPNEVAELKKKQDGDVTKDHSMSVPNTNVLSDRG 446 Query: 1671 SGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSG 1492 SGMVTLN PAINTGGRDAAA KPSSVQS SSALDLIKKKLQESG PVASS++P VQ+G Sbjct: 447 SGMVTLNTPAINTGGRDAAALKPSSVQSSSSALDLIKKKLQESGMPVASSSVPVSPVQTG 506 Query: 1491 SESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEM 1312 SESNGSKA ES AKGLQNDN KDK+KDANGD N D+GPSKEECI QFKEM Sbjct: 507 SESNGSKAAESAAKGLQNDN-KDKQKDANGDANTSDTSSDSEDEDSGPSKEECIIQFKEM 565 Query: 1311 LKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXX 1132 LKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 566 LKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAA 625 Query: 1131 XEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXX 952 EGFKQLLDEASEDI+H+TDY TFRKKWGND RFEA+DRKER+HLL+ERVLPLKKA Sbjct: 626 IEGFKQLLDEASEDIDHNTDYQTFRKKWGNDPRFEALDRKERQHLLSERVLPLKKAAEQK 685 Query: 951 XXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISEL 772 SFKSML+ERGDI++NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISEL Sbjct: 686 AQASRIAAATSFKSMLKERGDISINSRWSRVKESLRDDPRYKSVRHEDRELLFNEYISEL 745 Query: 771 KAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLV 592 KA EHAAERE +AKR+EQDKL R+KIRRK+A+TSFQALLV Sbjct: 746 KATEHAAERENKAKREEQDKLRERERELRKRKEREEQEMERVRVKIRRKEAITSFQALLV 805 Query: 591 ETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAE 412 ETIKDP+ASWTESK KLEKDPQGRA N DLDPADTEKLFREHIKMLQERC +FRTLLAE Sbjct: 806 ETIKDPLASWTESKHKLEKDPQGRATNPDLDPADTEKLFREHIKMLQERCAHDFRTLLAE 865 Query: 411 VLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSA 232 VLT EAAS E ED KT LNSWSTAKRLLKSD RY+KVPR +RE LWRRY ED+ RR+KS Sbjct: 866 VLTLEAASHEGEDGKTVLNSWSTAKRLLKSDPRYNKVPRKDREPLWRRYTEDVQRRQKS- 924 Query: 231 NDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 S+EEK D + + +LES KL E+ RS E Sbjct: 925 --SQEEKNADTKGRNTLESIKLALEAGRSHE 953 >ref|XP_007131663.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] gb|ESW03657.1| hypothetical protein PHAVU_011G031500g [Phaseolus vulgaris] Length = 977 Score = 1167 bits (3019), Expect = 0.0 Identities = 625/932 (67%), Positives = 701/932 (75%), Gaps = 8/932 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F Y + N NASG+SQQS AH +KSNS P+V QPPVP G+S HAA SF YN + Sbjct: 47 FPYGVLQNANASGSSQQSSAHNVIKSNSIVNPVVFQPPVP-GVSSHAALSFSYNIPPSGA 105 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 F SNQ + S + + AQ+V+K+SSASS H PA S MPP SDPNYRPTTSWM Sbjct: 106 AFPSNQQNTQSSSEISDSVAQDVTKLSSASSTPHSVPAHTSTPIMPP-SDPNYRPTTSWM 164 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PA PSTGTDSSS A+LR NM + IAS Sbjct: 165 PTAMSLPVHPVMPTPGNPGPPGLASSS-MISINPAVPSTGTDSSSAALLRPNMPISAIAS 223 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPT KGLPYP +PSM APPQG WLQ PQMSGV RPP+LQYP ARGV Sbjct: 224 DPTNPLKGLPYPSMPSMAAPPQGLWLQTPQMSGVFRPPYLQYP--APFPGPFPFPARGVT 281 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQP GVTPV + S S +Q RGT LQTEVIS DDKK LNAV+ N Sbjct: 282 LPAVPIPDSQPRGVTPVSGGSSTFSPASSNQLRGTTALQTEVISGPADDKKKLNAVIAPN 341 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 ED +N DQL+AWTAHKTEAG++YYYNA+T ESTY KPAGF GESHQVS QPTP S+ DLP Sbjct: 342 EDTSNNDQLEAWTAHKTEAGIIYYYNAMTGESTYDKPAGFIGESHQVSAQPTPVSMTDLP 401 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +D LMSV NVLSDR Sbjct: 402 GTDWLLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDQLMSVPNNNVLSDR 461 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPS++Q+ SSALDLIKKKLQ+SGTPV SS+IP PSVQ+ Sbjct: 462 GSGMVTLNAPAINTGGRDAAALKPSNLQNSSSALDLIKKKLQDSGTPVTSSSIPAPSVQT 521 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 GSESNGSKA EST+KG+Q DNSKDK+KD+NG NV D+GPSKEECI QFKE Sbjct: 522 GSESNGSKAVESTSKGMQADNSKDKQKDSNGAANVSDTSSDSEDEDSGPSKEECIIQFKE 581 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 582 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 641 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFKQLLDEASEDIN++TDY +FRKKW ND RFEA+DRKE+EHLLN+RV PLKKA Sbjct: 642 AIEGFKQLLDEASEDINYNTDYQSFRKKWANDPRFEALDRKEQEHLLNDRVFPLKKAAEE 701 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML++RGDI+ NSRWSRVKESLRDDPRYKSV+HEDRE LFNEY+SE Sbjct: 702 KTQAMRAAAAASFKSMLKDRGDISFNSRWSRVKESLRDDPRYKSVRHEDREVLFNEYLSE 761 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA E+AAERET+AKR+EQDKL RLKIRRK+AVTSFQALL Sbjct: 762 LKAAEYAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTSFQALL 821 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLA 415 VE IKDP+ASWTESKPKLEKDPQGRA N +LD +DTEKLFREH+KMLQERC EFR L+A Sbjct: 822 VEIIKDPLASWTESKPKLEKDPQGRATNPELDSSDTEKLFREHVKMLQERCAHEFRVLIA 881 Query: 414 EVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKS 235 +VLTS+AAS E +D KT LNSWSTAKR+LKSD RY+KVPR ERE LWRRY EDMLRR+K+ Sbjct: 882 DVLTSDAASHENDDGKTVLNSWSTAKRVLKSDPRYNKVPRKEREALWRRYAEDMLRRQKA 941 Query: 234 --ANDSKEEKRTDARSKYSLESSKLPGESRRS 145 ++DS+E+K +D R + LESSK P +S RS Sbjct: 942 SHSHDSREDKHSDGRGRNPLESSKYPLQSGRS 973 >ref|XP_020991278.1| pre-mRNA-processing protein 40C isoform X2 [Arachis duranensis] Length = 947 Score = 1162 bits (3005), Expect = 0.0 Identities = 633/931 (67%), Positives = 688/931 (73%), Gaps = 5/931 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 FSY MH NVNASGN QQS HPGMKSN+ P VQPPVP GL PHAAPSF YN Q+ P Sbjct: 37 FSYGMHQNVNASGNPQQS-IHPGMKSNTTMMPTAVQPPVP-GLPPHAAPSFSYNIWQSGP 94 Query: 2745 PFSSNQHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMP 2566 FSSNQ S T+ VSK SS SS+ H PA SI MPPPSDPN+RPTTSWMP Sbjct: 95 AFSSNQLTQSNTD--------VSKASSVSSVPHSVPAHTSI--MPPPSDPNFRPTTSWMP 144 Query: 2565 TAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASD 2386 T + A PST SSS A LR NM A IASD Sbjct: 145 TPLSFPGHPVMPGAPGNPAPPGLTSSIISTNLAAPPSTV--SSSAAPLRPNMPAAAIASD 202 Query: 2385 PTASQKGLPYPPIPSMVAPP-QGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 PT +QKG PY +P MVAPP QGFWLQPPQMSG+LRPPFLQYP RGV+ Sbjct: 203 PTLTQKGTPYASMPRMVAPPPQGFWLQPPQMSGILRPPFLQYP--AAFPGPFPYPVRGVN 260 Query: 2208 LPAVPVPDSQPPGVTPVGAVG-TFASSTSIHQPRGTGGLQTEVISAHPDDKKLNAVVTQN 2032 PAV +PDSQPPGVTPV A T A S S +Q R LQT++IS DDKK N VTQN Sbjct: 261 PPAVTLPDSQPPGVTPVNATAATSAPSASDNQLRQGTDLQTDLISGPADDKKSN--VTQN 318 Query: 2031 EDATNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPG 1852 A N++LDAWTAHKTE GVVYYYNA+T ESTY KP GFKGE HQ++VQPTP S+VD+PG Sbjct: 319 VGAANEKLDAWTAHKTETGVVYYYNALTGESTYDKPVGFKGEPHQIAVQPTPVSMVDIPG 378 Query: 1851 TDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRG 1672 TDW LVSTSDGKKYYYN QTKTS W++PNEVAELKKKQDGDV +DH MSV TNVLSDRG Sbjct: 379 TDWMLVSTSDGKKYYYNKQTKTSSWEVPNEVAELKKKQDGDVTKDHSMSVPNTNVLSDRG 438 Query: 1671 SGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSG 1492 SGMVTLN PAINTGGRDAAA KPSSVQS SSALDLIKKKLQESG PVASS++P VQ+G Sbjct: 439 SGMVTLNTPAINTGGRDAAALKPSSVQSSSSALDLIKKKLQESGMPVASSSVPVSPVQTG 498 Query: 1491 SESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEM 1312 SESNGSKA ES AKGLQNDN KDK+KDANGD N D+GPSKEECI QFKEM Sbjct: 499 SESNGSKAAESAAKGLQNDN-KDKQKDANGDANTSDTSSDSEDEDSGPSKEECIIQFKEM 557 Query: 1311 LKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXX 1132 LKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 558 LKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAA 617 Query: 1131 XEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXX 952 EGFKQLLDEASEDI+H+TDY TFRKKWGND RFEA+DRKER+HLL+ERVLPLKKA Sbjct: 618 IEGFKQLLDEASEDIDHNTDYQTFRKKWGNDPRFEALDRKERQHLLSERVLPLKKAAEQK 677 Query: 951 XXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISEL 772 SFKSML+ERGDI++NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISEL Sbjct: 678 AQASRIAAATSFKSMLKERGDISINSRWSRVKESLRDDPRYKSVRHEDRELLFNEYISEL 737 Query: 771 KAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLV 592 KA EHAAERE +AKR+EQDKL R+KIRRK+A+TSFQALLV Sbjct: 738 KATEHAAERENKAKREEQDKLRERERELRKRKEREEQEMERVRVKIRRKEAITSFQALLV 797 Query: 591 ETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAE 412 ETIKDP+ASWTESK KLEKDPQGRA N DLDPADTEKLFREHIKMLQERC +FRTLLAE Sbjct: 798 ETIKDPLASWTESKHKLEKDPQGRATNPDLDPADTEKLFREHIKMLQERCAHDFRTLLAE 857 Query: 411 VLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSA 232 VLT EAAS E ED KT LNSWSTAKRLLKSD RY+KVPR +RE LWRRY ED+ RR+KS Sbjct: 858 VLTLEAASHEGEDGKTVLNSWSTAKRLLKSDPRYNKVPRKDREPLWRRYTEDVQRRQKS- 916 Query: 231 NDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 S+EEK D + + +LES KL E+ RS E Sbjct: 917 --SQEEKNADTKGRNTLESIKLALEAGRSHE 945 >ref|XP_003540642.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X1 [Glycine max] gb|KRH24206.1| hypothetical protein GLYMA_12G028100 [Glycine max] Length = 930 Score = 1158 bits (2995), Expect = 0.0 Identities = 629/933 (67%), Positives = 691/933 (74%), Gaps = 7/933 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F++ M NVNASG+SQ HP + SNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 45 FAHGMLQNVNASGSSQLLSTHPAIISNSAVNPMVVQPP---GVSSHAAPSFSYNIPQSGA 101 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S T+ VSK+SSASSI H PA S S MPPPSDPNY P TSWM Sbjct: 102 IFSSNQQHAQSSTD--------VSKLSSASSIPHSVPAHTSTSLMPPPSDPNYCPATSWM 153 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA P P P+ G +A I+S Sbjct: 154 PTALS------------------------FPVHPVMPTQGNPGPPGLAS-----SAIISS 184 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 +P A P IP++ APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 185 NPAA-------PSIPALAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 235 Query: 2208 LPAVPVPDSQPPGVTPVGAVG-TFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQ 2035 LPAVP+PDSQPPGVTPVGA G T S S +Q RGT LQTEVIS DDKK LN+V T Sbjct: 236 LPAVPIPDSQPPGVTPVGAAGGTPTPSASSYQLRGTTALQTEVISGSADDKKKLNSVDTL 295 Query: 2034 NEDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDL 1858 NEDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KP+GFKGESHQVS QPTP S++DL Sbjct: 296 NEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYHKPSGFKGESHQVSAQPTPVSMIDL 355 Query: 1857 PGTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSD 1678 PGTDW+LVSTSDGKKYYYNN TKTSCWQIPNEVAELKKKQDGDV +DHLMSV TNVLSD Sbjct: 356 PGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVPNTNVLSD 415 Query: 1677 RGSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQ 1498 RGSGMVTLNAPAINTGGRDAAA KPS++Q+ SSALDLIKKKLQ+SGTP+ S+I PSVQ Sbjct: 416 RGSGMVTLNAPAINTGGRDAAALKPSTLQNSSSALDLIKKKLQDSGTPITPSSIHAPSVQ 475 Query: 1497 SGSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFK 1318 G ESNGSK +STAKG+Q DN+KDK+KD NGD +V DNGPSKEECI QFK Sbjct: 476 IGPESNGSKTVDSTAKGVQVDNNKDKQKDTNGDADVSDTSSDSEDEDNGPSKEECIIQFK 535 Query: 1317 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXX 1138 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 536 EMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQK 595 Query: 1137 XXXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATX 958 EGFK+LLDEASEDIN++TD+ TFRKKWGND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 596 AAIEGFKRLLDEASEDINYNTDFQTFRKKWGNDPRFEALDRKEQEHLLNERVLPLKKAAE 655 Query: 957 XXXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYIS 778 ASFKSML+ERGD++ NSRW+RVKESLRDDPRYKSV+HEDRE LFNEYIS Sbjct: 656 EKAQAMRAAAAASFKSMLKERGDMSFNSRWARVKESLRDDPRYKSVRHEDREVLFNEYIS 715 Query: 777 ELKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQAL 598 ELKA EHAAERET+AKR+EQDKL RLKIRRK+AVTSFQAL Sbjct: 716 ELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKEAVTSFQAL 775 Query: 597 LVETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLL 418 LVETIKDP+ASWTESKPKLEKDPQ RA N DLDP+DTEKLFREH+KMLQERC EFR LL Sbjct: 776 LVETIKDPLASWTESKPKLEKDPQRRATNPDLDPSDTEKLFREHVKMLQERCAHEFRVLL 835 Query: 417 AEVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKK 238 AEVLTS+AASQET D KT LNSWSTAKRLLKSD RY+KVPR ERE LWRRY EDMLRR+K Sbjct: 836 AEVLTSDAASQETNDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQK 895 Query: 237 SANDSKEEKRTDARSKYSLESSKLPGESRRSLE 139 ++ DS+EEK TDA+ + LESSK P ES RS E Sbjct: 896 ASYDSREEKHTDAKGRTYLESSKHPLESGRSHE 928 >ref|XP_006592053.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X2 [Glycine max] Length = 854 Score = 1132 bits (2929), Expect = 0.0 Identities = 613/901 (68%), Positives = 671/901 (74%), Gaps = 4/901 (0%) Frame = -3 Query: 2829 MVVQPPVPPGLSPHAAPSFLYNTSQNVPPFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSI 2653 MVVQPP G+S HAAPSF YN Q+ FSSNQ H S T+ VSK+SSASSI Sbjct: 1 MVVQPP---GVSSHAAPSFSYNIPQSGAIFSSNQQHAQSSTD--------VSKLSSASSI 49 Query: 2652 LHPAPAPNSISSMPPPSDPNYRPTTSWMPTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPC 2473 H PA S S MPPPSDPNY P TSWMPTA P Sbjct: 50 PHSVPAHTSTSLMPPPSDPNYCPATSWMPTALS------------------------FPV 85 Query: 2472 SPAAPSTGTDSSSTAVLRQNMLTAPIASDPTASQKGLPYPPIPSMVAPPQGFWLQPPQMS 2293 P P+ G +A I+S+P A P IP++ APPQG WLQPPQMS Sbjct: 86 HPVMPTQGNPGPPGLAS-----SAIISSNPAA-------PSIPALAAPPQGLWLQPPQMS 133 Query: 2292 GVLRPPFLQYPHXXXXXXXXXXXARGVHLPAVPVPDSQPPGVTPVGAVG-TFASSTSIHQ 2116 GVLRPP+LQYP ARGV LPAVP+PDSQPPGVTPVGA G T S S +Q Sbjct: 134 GVLRPPYLQYP--APFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGGTPTPSASSYQ 191 Query: 2115 PRGTGGLQTEVISAHPDDKK-LNAVVTQNEDATN-DQLDAWTAHKTEAGVVYYYNAVTRE 1942 RGT LQTEVIS DDKK LN+V T NEDA N DQLDAWTAHKTEAG++YYYNAVT E Sbjct: 192 LRGTTALQTEVISGSADDKKKLNSVDTLNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGE 251 Query: 1941 STYAKPAGFKGESHQVSVQPTPASVVDLPGTDWQLVSTSDGKKYYYNNQTKTSCWQIPNE 1762 STY KP+GFKGESHQVS QPTP S++DLPGTDW+LVSTSDGKKYYYNN TKTSCWQIPNE Sbjct: 252 STYHKPSGFKGESHQVSAQPTPVSMIDLPGTDWRLVSTSDGKKYYYNNLTKTSCWQIPNE 311 Query: 1761 VAELKKKQDGDVARDHLMSVSKTNVLSDRGSGMVTLNAPAINTGGRDAAAPKPSSVQSPS 1582 VAELKKKQDGDV +DHLMSV TNVLSDRGSGMVTLNAPAINTGGRDAAA KPS++Q+ S Sbjct: 312 VAELKKKQDGDVTKDHLMSVPNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSTLQNSS 371 Query: 1581 SALDLIKKKLQESGTPVASSAIPTPSVQSGSESNGSKATESTAKGLQNDNSKDKEKDANG 1402 SALDLIKKKLQ+SGTP+ S+I PSVQ G ESNGSK +STAKG+Q DN+KDK+KD NG Sbjct: 372 SALDLIKKKLQDSGTPITPSSIHAPSVQIGPESNGSKTVDSTAKGVQVDNNKDKQKDTNG 431 Query: 1401 DTNVXXXXXXXXXXDNGPSKEECINQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS 1222 D +V DNGPSKEECI QFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS Sbjct: 432 DADVSDTSSDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPS 491 Query: 1221 HSARRSLFEHYVKTXXXXXXXXXXXXXXXXXEGFKQLLDEASEDINHDTDYHTFRKKWGN 1042 +SARRSLFEHYVKT EGFK+LLDEASEDIN++TD+ TFRKKWGN Sbjct: 492 YSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDFQTFRKKWGN 551 Query: 1041 DLRFEAVDRKEREHLLNERVLPLKKATXXXXXXXXXXXXASFKSMLEERGDIALNSRWSR 862 D RFEA+DRKE+EHLLNERVLPLKKA ASFKSML+ERGD++ NSRW+R Sbjct: 552 DPRFEALDRKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDMSFNSRWAR 611 Query: 861 VKESLRDDPRYKSVKHEDREFLFNEYISELKAIEHAAERETRAKRDEQDKLXXXXXXXXX 682 VKESLRDDPRYKSV+HEDRE LFNEYISELKA EHAAERET+AKR+EQDKL Sbjct: 612 VKESLRDDPRYKSVRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRK 671 Query: 681 XXXXXXXXXXXXRLKIRRKDAVTSFQALLVETIKDPMASWTESKPKLEKDPQGRAANSDL 502 RLKIRRK+AVTSFQALLVETIKDP+ASWTESKPKLEKDPQ RA N DL Sbjct: 672 RKEREEQEMERVRLKIRRKEAVTSFQALLVETIKDPLASWTESKPKLEKDPQRRATNPDL 731 Query: 501 DPADTEKLFREHIKMLQERCVQEFRTLLAEVLTSEAASQETEDEKTPLNSWSTAKRLLKS 322 DP+DTEKLFREH+KMLQERC EFR LLAEVLTS+AASQET D KT LNSWSTAKRLLKS Sbjct: 732 DPSDTEKLFREHVKMLQERCAHEFRVLLAEVLTSDAASQETNDGKTVLNSWSTAKRLLKS 791 Query: 321 DLRYSKVPRNEREGLWRRYVEDMLRRKKSANDSKEEKRTDARSKYSLESSKLPGESRRSL 142 D RY+KVPR ERE LWRRY EDMLRR+K++ DS+EEK TDA+ + LESSK P ES RS Sbjct: 792 DPRYNKVPRKEREALWRRYAEDMLRRQKASYDSREEKHTDAKGRTYLESSKHPLESGRSH 851 Query: 141 E 139 E Sbjct: 852 E 852 >ref|XP_019413264.1| PREDICTED: pre-mRNA-processing protein 40C [Lupinus angustifolius] Length = 930 Score = 1124 bits (2907), Expect = 0.0 Identities = 608/911 (66%), Positives = 671/911 (73%), Gaps = 2/911 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNSPMVVQPPVPPGLSPHAAPSFLYNTSQNVP-PF 2740 FSYN N Q+ + S++ VVQ P+P +S H APSF +N Q F Sbjct: 23 FSYN--------NNMPQNVINTSHHSSTYSVVQHPLP-AMSSHPAPSFSFNIPQTGHHAF 73 Query: 2739 SSNQHLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWMPTA 2560 S+N H S NM AQ+VSKVSSASSI + P +S +M P DPNYRP TSWMP+A Sbjct: 74 STNHHPQSTMNMSDSVAQDVSKVSSASSIPNSVPPHSSNPTMRPTYDPNYRPPTSWMPSA 133 Query: 2559 XXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIASDPT 2380 VI + +APST +DSSS AV RQNM A IASDPT Sbjct: 134 PSFPMHHVIPGTPGNPAPPGLTPALVISSNLSAPSTSSDSSSAAVPRQNMPNAAIASDPT 193 Query: 2379 ASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVHLPA 2200 QKG PYP IP M A PQG WL PPQ+SGVLRPPFL YP ARGV LPA Sbjct: 194 LQQKGTPYPSIPVMAASPQGLWLPPPQISGVLRPPFLPYP--AAFPGPFPFPARGVTLPA 251 Query: 2199 VPVPDSQPPGVTPVGAVG-TFASSTSIHQPRGTGGLQTEVISAHPDDKKLNAVVTQNEDA 2023 VPVPDSQPPGVTP+ A G T AS S HQ RGT G QTEVI H D KK+ V QNED Sbjct: 252 VPVPDSQPPGVTPMTAAGVTSASPASSHQLRGTTGFQTEVIPGHADYKKI-LNVAQNEDP 310 Query: 2022 TNDQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLPGTDW 1843 ND LDAWTAHKTEAG+VYYYNA T ESTY KPAGFKGE HQV+VQPTP S+V+LPGTDW Sbjct: 311 ANDHLDAWTAHKTEAGIVYYYNASTGESTYDKPAGFKGEPHQVAVQPTPVSMVNLPGTDW 370 Query: 1842 QLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDRGSGM 1663 LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSV TNVLSD+G GM Sbjct: 371 VLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVPNTNVLSDKGPGM 430 Query: 1662 VTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQSGSES 1483 VTLNAPAINTGGRDAAA KPSSVQS SSALDLIKKKLQ+SGTPV SS+IP SV GSES Sbjct: 431 VTLNAPAINTGGRDAAALKPSSVQSSSSALDLIKKKLQDSGTPVTSSSIPNSSV--GSES 488 Query: 1482 NGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKEMLKE 1303 NGSKA ESTAKGLQ DN+KDK+KD NGD NV D+GPSKEECI QFKEMLKE Sbjct: 489 NGSKAAESTAKGLQIDNNKDKQKDTNGDANVSDTSSDSEDEDSGPSKEECILQFKEMLKE 548 Query: 1302 RGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXXXXEG 1123 RGVAPFSKW+KELPKIVFDPRFKAIPS+SARRSLFEHYVKT EG Sbjct: 549 RGVAPFSKWDKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEG 608 Query: 1122 FKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXXXXXX 943 FK+LLDEA EDINH+TDY TFR+KWG+D RF+A+DRKEREHLL+ERVLPLKKA Sbjct: 609 FKKLLDEALEDINHNTDYQTFREKWGDDPRFKALDRKEREHLLSERVLPLKKAAEEKAQA 668 Query: 942 XXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISELKAI 763 +SFKS+++E+GDI NSRWSRVKESLRDDPRYKSV+HEDRE LFNEYISELKA Sbjct: 669 MREAATSSFKSLIKEKGDITFNSRWSRVKESLRDDPRYKSVRHEDREILFNEYISELKAA 728 Query: 762 EHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALLVETI 583 EHAAERETRAKR+EQ+KL RLKIRRK+AV SFQALLVETI Sbjct: 729 EHAAERETRAKREEQEKLRERERELRKRKEREEHEMERVRLKIRRKEAVASFQALLVETI 788 Query: 582 KDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQERCVQEFRTLLAEVLT 403 KDP+ASWTESK KLEKDPQGRA N +L PAD EKLFR+HIKMLQER V EFR LLAEVLT Sbjct: 789 KDPLASWTESKSKLEKDPQGRANNPELGPADMEKLFRDHIKMLQERRVNEFRVLLAEVLT 848 Query: 402 SEAASQETEDEKTPLNSWSTAKRLLKSDLRYSKVPRNEREGLWRRYVEDMLRRKKSANDS 223 EAAS+ETED KT LNSWSTAKR+LKSD RY+KVPR ERE LW RY ED+LRR+KS++D Sbjct: 849 IEAASRETEDGKTVLNSWSTAKRVLKSDPRYNKVPREERETLWHRYAEDVLRRQKSSHDP 908 Query: 222 KEEKRTDARSK 190 +EEK TD++ + Sbjct: 909 REEKHTDSKGR 919 >gb|OIV99579.1| hypothetical protein TanjilG_17389 [Lupinus angustifolius] Length = 891 Score = 1108 bits (2865), Expect = 0.0 Identities = 595/881 (67%), Positives = 658/881 (74%), Gaps = 4/881 (0%) Frame = -3 Query: 2820 QPPVPPGLSPH-AAPSFLYNTSQNVPPFSSNQHLHSGT--NMLVPTAQNVSKVSSASSIL 2650 + P P +P+ +APS ++ + N+P N HS T NM AQ+VSKVSSASSI Sbjct: 5 ETPSPSPSTPNTSAPSAPFSYNNNMPQNVINTSHHSSTYSNMSDSVAQDVSKVSSASSIP 64 Query: 2649 HPAPAPNSISSMPPPSDPNYRPTTSWMPTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCS 2470 + P +S +M P DPNYRP TSWMP+A VI + Sbjct: 65 NSVPPHSSNPTMRPTYDPNYRPPTSWMPSAPSFPMHHVIPGTPGNPAPPGLTPALVISSN 124 Query: 2469 PAAPSTGTDSSSTAVLRQNMLTAPIASDPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSG 2290 +APST +DSSS AV RQNM A IASDPT QKG PYP IP M A PQG WL PPQ+SG Sbjct: 125 LSAPSTSSDSSSAAVPRQNMPNAAIASDPTLQQKGTPYPSIPVMAASPQGLWLPPPQISG 184 Query: 2289 VLRPPFLQYPHXXXXXXXXXXXARGVHLPAVPVPDSQPPGVTPVGAVG-TFASSTSIHQP 2113 VLRPPFL YP ARGV LPAVPVPDSQPPGVTP+ A G T AS S HQ Sbjct: 185 VLRPPFLPYP--AAFPGPFPFPARGVTLPAVPVPDSQPPGVTPMTAAGVTSASPASSHQL 242 Query: 2112 RGTGGLQTEVISAHPDDKKLNAVVTQNEDATNDQLDAWTAHKTEAGVVYYYNAVTRESTY 1933 RGT G QTEVI H D KK+ V QNED ND LDAWTAHKTEAG+VYYYNA T ESTY Sbjct: 243 RGTTGFQTEVIPGHADYKKI-LNVAQNEDPANDHLDAWTAHKTEAGIVYYYNASTGESTY 301 Query: 1932 AKPAGFKGESHQVSVQPTPASVVDLPGTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAE 1753 KPAGFKGE HQV+VQPTP S+V+LPGTDW LVSTSDGKKYYYNN+TKTSCWQIPNEVAE Sbjct: 302 DKPAGFKGEPHQVAVQPTPVSMVNLPGTDWVLVSTSDGKKYYYNNRTKTSCWQIPNEVAE 361 Query: 1752 LKKKQDGDVARDHLMSVSKTNVLSDRGSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSAL 1573 LKKKQDGDV +DHLMSV TNVLSD+G GMVTLNAPAINTGGRDAAA KPSSVQS SSAL Sbjct: 362 LKKKQDGDVTKDHLMSVPNTNVLSDKGPGMVTLNAPAINTGGRDAAALKPSSVQSSSSAL 421 Query: 1572 DLIKKKLQESGTPVASSAIPTPSVQSGSESNGSKATESTAKGLQNDNSKDKEKDANGDTN 1393 DLIKKKLQ+SGTPV SS+IP SV GSESNGSKA ESTAKGLQ DN+KDK+KD NGD N Sbjct: 422 DLIKKKLQDSGTPVTSSSIPNSSV--GSESNGSKAAESTAKGLQIDNNKDKQKDTNGDAN 479 Query: 1392 VXXXXXXXXXXDNGPSKEECINQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSA 1213 V D+GPSKEECI QFKEMLKERGVAPFSKW+KELPKIVFDPRFKAIPS+SA Sbjct: 480 VSDTSSDSEDEDSGPSKEECILQFKEMLKERGVAPFSKWDKELPKIVFDPRFKAIPSYSA 539 Query: 1212 RRSLFEHYVKTXXXXXXXXXXXXXXXXXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLR 1033 RRSLFEHYVKT EGFK+LLDEA EDINH+TDY TFR+KWG+D R Sbjct: 540 RRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKKLLDEALEDINHNTDYQTFREKWGDDPR 599 Query: 1032 FEAVDRKEREHLLNERVLPLKKATXXXXXXXXXXXXASFKSMLEERGDIALNSRWSRVKE 853 F+A+DRKEREHLL+ERVLPLKKA +SFKS+++E+GDI NSRWSRVKE Sbjct: 600 FKALDRKEREHLLSERVLPLKKAAEEKAQAMREAATSSFKSLIKEKGDITFNSRWSRVKE 659 Query: 852 SLRDDPRYKSVKHEDREFLFNEYISELKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXX 673 SLRDDPRYKSV+HEDRE LFNEYISELKA EHAAERETRAKR+EQ+KL Sbjct: 660 SLRDDPRYKSVRHEDREILFNEYISELKAAEHAAERETRAKREEQEKLRERERELRKRKE 719 Query: 672 XXXXXXXXXRLKIRRKDAVTSFQALLVETIKDPMASWTESKPKLEKDPQGRAANSDLDPA 493 RLKIRRK+AV SFQALLVETIKDP+ASWTESK KLEKDPQGRA N +L PA Sbjct: 720 REEHEMERVRLKIRRKEAVASFQALLVETIKDPLASWTESKSKLEKDPQGRANNPELGPA 779 Query: 492 DTEKLFREHIKMLQERCVQEFRTLLAEVLTSEAASQETEDEKTPLNSWSTAKRLLKSDLR 313 D EKLFR+HIKMLQER V EFR LLAEVLT EAAS+ETED KT LNSWSTAKR+LKSD R Sbjct: 780 DMEKLFRDHIKMLQERRVNEFRVLLAEVLTIEAASRETEDGKTVLNSWSTAKRVLKSDPR 839 Query: 312 YSKVPRNEREGLWRRYVEDMLRRKKSANDSKEEKRTDARSK 190 Y+KVPR ERE LW RY ED+LRR+KS++D +EEK TD++ + Sbjct: 840 YNKVPREERETLWHRYAEDVLRRQKSSHDPREEKHTDSKGR 880 >ref|XP_014619437.1| PREDICTED: pre-mRNA-processing protein 40C-like isoform X3 [Glycine max] Length = 876 Score = 1103 bits (2852), Expect = 0.0 Identities = 587/829 (70%), Positives = 633/829 (76%), Gaps = 6/829 (0%) Frame = -3 Query: 2916 FSYNMHHNVNASGNSQQSPAHPGMKSNS---PMVVQPPVPPGLSPHAAPSFLYNTSQNVP 2746 F+Y M NVNASG+SQQS HPGMKSNS PMVVQPP G+S HAAPSF YN Q+ Sbjct: 53 FAYGMLQNVNASGSSQQSSTHPGMKSNSAVNPMVVQPP---GVSLHAAPSFSYNIPQSGA 109 Query: 2745 PFSSNQ-HLHSGTNMLVPTAQNVSKVSSASSILHPAPAPNSISSMPPPSDPNYRPTTSWM 2569 FSSNQ H S TNM AQ+V K+SSASSI H PA S S MPPPSDPNYRP TSWM Sbjct: 110 IFSSNQQHAQSSTNMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWM 169 Query: 2568 PTAXXXXXXXXXXXXXXXXXXXXXXXXGVIPCSPAAPSTGTDSSSTAVLRQNMLTAPIAS 2389 PTA +I +PAAPSTGTDSS A+LR NM T+ IAS Sbjct: 170 PTAMSFPVLPVMPTQGNPGPPGLASSA-IISSNPAAPSTGTDSSPAALLRPNMPTSAIAS 228 Query: 2388 DPTASQKGLPYPPIPSMVAPPQGFWLQPPQMSGVLRPPFLQYPHXXXXXXXXXXXARGVH 2209 DPTA QKGLPYP +P+M APPQG WLQPPQMSGVLRPP+LQYP ARGV Sbjct: 229 DPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYP--APFPGPFPFPARGVA 286 Query: 2208 LPAVPVPDSQPPGVTPVGAVGTFASSTSIHQPRGTGGLQTEVISAHPDDKK-LNAVVTQN 2032 LPAVP+PDSQPPGVTPVGA G ++ +S HQ RGT LQTEVIS DDKK LN+V T N Sbjct: 287 LPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVN 346 Query: 2031 EDATN-DQLDAWTAHKTEAGVVYYYNAVTRESTYAKPAGFKGESHQVSVQPTPASVVDLP 1855 EDA N DQLDAWTAHKTEAG++YYYNAVT ESTY KPAGFKGESHQVS QP P S++DLP Sbjct: 347 EDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLP 406 Query: 1854 GTDWQLVSTSDGKKYYYNNQTKTSCWQIPNEVAELKKKQDGDVARDHLMSVSKTNVLSDR 1675 GTDW+LVSTSDGKKYYYNN+TKTSCWQIPNEVAELKKKQDGDV +DHLMSVS TNVLSDR Sbjct: 407 GTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDR 466 Query: 1674 GSGMVTLNAPAINTGGRDAAAPKPSSVQSPSSALDLIKKKLQESGTPVASSAIPTPSVQS 1495 GSGMVTLNAPAINTGGRDAAA KPSS+Q+ SALDLIKKKLQ+SGTPVASS+IP PSVQ+ Sbjct: 467 GSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQT 526 Query: 1494 GSESNGSKATESTAKGLQNDNSKDKEKDANGDTNVXXXXXXXXXXDNGPSKEECINQFKE 1315 G ESNGSK +STAKGLQ DN+KDK KD NGD NV DNGPSKEECI QFKE Sbjct: 527 GPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKE 586 Query: 1314 MLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTXXXXXXXXXXXXXXX 1135 MLKERGV PFSKWEKELPKIVFDPRFKAIPS+SARRSLFEHYVKT Sbjct: 587 MLKERGVVPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKA 646 Query: 1134 XXEGFKQLLDEASEDINHDTDYHTFRKKWGNDLRFEAVDRKEREHLLNERVLPLKKATXX 955 EGFK+LLDEASEDIN++TDY TFRKKW ND RFEA+DRKE+EHLLNERVLPLKKA Sbjct: 647 AIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEE 706 Query: 954 XXXXXXXXXXASFKSMLEERGDIALNSRWSRVKESLRDDPRYKSVKHEDREFLFNEYISE 775 ASFKSML+ERGDI+ NSRWSRVKE+LRDDPRYK V+HEDRE LFNEYISE Sbjct: 707 KAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISE 766 Query: 774 LKAIEHAAERETRAKRDEQDKLXXXXXXXXXXXXXXXXXXXXXRLKIRRKDAVTSFQALL 595 LKA EHAAERET+AK +EQDKL RLKIRRKDAVT FQALL Sbjct: 767 LKAAEHAAERETKAKMEEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALL 826 Query: 594 VETIKDPMASWTESKPKLEKDPQGRAANSDLDPADTEKLFREHIKMLQE 448 VETIKDP+ SWTESKPKLEKD Q RA N DLDP DTEKLFREH+KMLQE Sbjct: 827 VETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQE 875