BLASTX nr result
ID: Sinomenium21_contig00020123
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00020123 (2089 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI29964.3| unnamed protein product [Vitis vinifera] 851 0.0 ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-lik... 850 0.0 ref|XP_007204299.1| hypothetical protein PRUPE_ppa000262mg [Prun... 836 0.0 ref|XP_006481686.1| PREDICTED: uncharacterized protein LOC102624... 828 0.0 ref|XP_006481685.1| PREDICTED: uncharacterized protein LOC102624... 828 0.0 ref|XP_002308344.2| hypothetical protein POPTR_0006s21160g [Popu... 822 0.0 gb|EXB29323.1| DNA damage-binding protein 1b [Morus notabilis] 809 0.0 ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-lik... 800 0.0 ref|XP_006838801.1| hypothetical protein AMTR_s00002p00260810 [A... 794 0.0 ref|XP_007029116.1| Cleavage and polyadenylation specificity fac... 794 0.0 ref|XP_006351358.1| PREDICTED: pre-mRNA-splicing factor prp12-li... 792 0.0 ref|XP_004249760.1| PREDICTED: pre-mRNA-splicing factor prp12-li... 790 0.0 ref|XP_004303372.1| PREDICTED: pre-mRNA-splicing factor rse-1-li... 788 0.0 ref|XP_006407388.1| hypothetical protein EUTSA_v10019900mg [Eutr... 762 0.0 ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus ... 761 0.0 ref|XP_007163031.1| hypothetical protein PHAVU_001G200200g [Phas... 760 0.0 ref|XP_006577113.1| PREDICTED: splicing factor 3B subunit 3-like... 756 0.0 ref|XP_006577112.1| PREDICTED: splicing factor 3B subunit 3-like... 756 0.0 ref|XP_004494300.1| PREDICTED: uncharacterized protein LOC101490... 748 0.0 ref|XP_007029117.1| Cleavage and polyadenylation specificity fac... 746 0.0 >emb|CBI29964.3| unnamed protein product [Vitis vinifera] Length = 1363 Score = 851 bits (2198), Expect = 0.0 Identities = 456/685 (66%), Positives = 516/685 (75%), Gaps = 8/685 (1%) Frame = -2 Query: 2085 NRPGAGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVP 1906 N A L IGV I FVIGTHKPSVEILSF+P+ GLRILASG ISLTNTLGTA+SGCVP Sbjct: 687 NSSAAALLIGVNIGRIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVP 746 Query: 1905 QDVRLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXX 1726 QD RLVLVDR Y+LSGLRNGMLLRFE P +S F SEL + SPS TNIN+ Sbjct: 747 QDARLVLVDRFYVLSGLRNGMLLRFELPAASMVFSSELSSHSPS-----TNINS------ 795 Query: 1725 XXXXIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPW 1546 P+ LQLIAIRRIGITPVFLVPL D L+ADIIALSDRPW Sbjct: 796 ----------------------PVNLQLIAIRRIGITPVFLVPLSDSLEADIIALSDRPW 833 Query: 1545 LLQTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHL 1366 LLQ+ARHSLSYTSISFQPSTHVTPVCS++CP GILFVAE+SLHLVEMVHSKRLNVQKF+L Sbjct: 834 LLQSARHSLSYTSISFQPSTHVTPVCSMECPMGILFVAENSLHLVEMVHSKRLNVQKFYL 893 Query: 1365 GGTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETG 1186 GGTPRKVLYHSE+RLLLVMRT+L SSDIC VDPLSGS+L+++ LE GETG Sbjct: 894 GGTPRKVLYHSESRLLLVMRTELSQDTY------SSDICCVDPLSGSVLSSFKLELGETG 947 Query: 1185 KSMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPN 1015 KSM+LV+V NEQVLV+GTS S+G +MPSGEAESTKGRL+VL + H++ S C Sbjct: 948 KSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTKGRLIVLCLEHMQNSDSGSMTFCSK 1007 Query: 1014 XXXXXXXXSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGA 841 SP +IVG A EQ W+L L + T PG Sbjct: 1008 AGSSSQRTSPFREIVGYAAEQLSGSSLCSSPDDTSCDGVRLEESEAWQLRLAYTATWPGM 1067 Query: 840 VLAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDC 661 VLA+CPYLDRYFLASAGN +V GF ++NP RVR+FA GRTRF I LT FTRIAVGDC Sbjct: 1068 VLAICPYLDRYFLASAGNSFYVCGFPNDNPQRVRRFAVGRTRFMIMSLTAHFTRIAVGDC 1127 Query: 660 RDGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDN 481 RDG++FYSY ED +KLEQLYCDP+QRLVADC LMD+DTAVVSDR+G+ AVLS N+LEDN Sbjct: 1128 RDGVVFYSYHEDSRKLEQLYCDPEQRLVADCILMDVDTAVVSDRKGSIAVLSCSNHLEDN 1187 Query: 480 ASPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLL 301 ASPECNLTL+CSYY+GE MSI+KGSFSYKLP DDVLKGC ++ ++D +SI+A TLL Sbjct: 1188 ASPECNLTLNCSYYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENSIMAGTLL 1247 Query: 300 GSVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRV---GVPKILDGD 130 GS+++ IPIS EEHELLEAVQ+RL VH LTAPILGNDHNEFR R++ V GV KILDGD Sbjct: 1248 GSIIMLIPISREEHELLEAVQARLAVHQLTAPILGNDHNEFRSRENSVRKAGVSKILDGD 1307 Query: 129 MLAQFLELTSMQQEAVLALPLGLSE 55 MLAQFLELTSMQQEAVLALPLG E Sbjct: 1308 MLAQFLELTSMQQEAVLALPLGSLE 1332 >ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-like [Vitis vinifera] Length = 1387 Score = 850 bits (2196), Expect = 0.0 Identities = 454/695 (65%), Positives = 517/695 (74%), Gaps = 18/695 (2%) Frame = -2 Query: 2085 NRPGAGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVP 1906 N A L IGV I FVIGTHKPSVEILSF+P+ GLRILASG ISLTNTLGTA+SGCVP Sbjct: 687 NSSAAALLIGVNIGRIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVP 746 Query: 1905 QDVRLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXX 1726 QD RLVLVDR Y+LSGLRNGMLLRFE P +S F SEL + SPS+S C N Sbjct: 747 QDARLVLVDRFYVLSGLRNGMLLRFELPAASMVFSSELSSHSPSVSSCSVN--------- 797 Query: 1725 XXXXIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPW 1546 + + + P+ LQLIAIRRIGITPVFLVPL D L+ADIIALSDRPW Sbjct: 798 ----------DADTNLSKNINSPVNLQLIAIRRIGITPVFLVPLSDSLEADIIALSDRPW 847 Query: 1545 LLQTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHL 1366 LLQ+ARHSLSYTSISFQPSTHVTPVCS++CP GILFVAE+SLHLVEMVHSKRLNVQKF+L Sbjct: 848 LLQSARHSLSYTSISFQPSTHVTPVCSMECPMGILFVAENSLHLVEMVHSKRLNVQKFYL 907 Query: 1365 GGTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETG 1186 GGTPRKVLYHSE+RLLLVMRT+L SSDIC VDPLSGS+L+++ LE GETG Sbjct: 908 GGTPRKVLYHSESRLLLVMRTELSQDTY------SSDICCVDPLSGSVLSSFKLELGETG 961 Query: 1185 KSMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPN 1015 KSM+LV+V NEQVLV+GTS S+G +MPSGEAESTKGRL+VL + H++ S C Sbjct: 962 KSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTKGRLIVLCLEHMQNSDSGSMTFCSK 1021 Query: 1014 XXXXXXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGA 841 SP +IVG A EQ W+L L + T PG Sbjct: 1022 AGSSSQRTSPFREIVGYAAEQLSGSSLCSSPDDTSCDGVRLEESEAWQLRLAYTATWPGM 1081 Query: 840 VLAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDC 661 VLA+CPYLDRYFLASAGN +V GF ++NP RVR+FA GRTRF I LT FTRIAVGDC Sbjct: 1082 VLAICPYLDRYFLASAGNSFYVCGFPNDNPQRVRRFAVGRTRFMIMSLTAHFTRIAVGDC 1141 Query: 660 RDGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLE-- 487 RDG++FYSY ED +KLEQLYCDP+QRLVADC LMD+DTAVVSDR+G+ AVLS N+LE Sbjct: 1142 RDGVVFYSYHEDSRKLEQLYCDPEQRLVADCILMDVDTAVVSDRKGSIAVLSCSNHLEEL 1201 Query: 486 -----------DNASPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVV 340 DNASPECNLTL+CSYY+GE MSI+KGSFSYKLP DDVLKGC ++ ++ Sbjct: 1202 HGFKFLIISCPDNASPECNLTLNCSYYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTII 1261 Query: 339 DLLHSSIVASTLLGSVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSR 160 D +SI+A TLLGS+++ IPIS EEHELLEAVQ+RL VH LTAPILGNDHNEFR R++ Sbjct: 1262 DFSENSIMAGTLLGSIIMLIPISREEHELLEAVQARLAVHQLTAPILGNDHNEFRSRENS 1321 Query: 159 VGVPKILDGDMLAQFLELTSMQQEAVLALPLGLSE 55 GV KILDGDMLAQFLELTSMQQEAVLALPLG E Sbjct: 1322 AGVSKILDGDMLAQFLELTSMQQEAVLALPLGSLE 1356 >ref|XP_007204299.1| hypothetical protein PRUPE_ppa000262mg [Prunus persica] gi|462399830|gb|EMJ05498.1| hypothetical protein PRUPE_ppa000262mg [Prunus persica] Length = 1378 Score = 836 bits (2159), Expect = 0.0 Identities = 447/679 (65%), Positives = 506/679 (74%), Gaps = 5/679 (0%) Frame = -2 Query: 2085 NRPGAGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVP 1906 N A LP GV I + FVIGTHKPSVE+LS VP GLR+LASG ISLTNTLGTAISGC+P Sbjct: 686 NSCDATLPFGVDISNIFVIGTHKPSVEVLSLVPNEGLRVLASGTISLTNTLGTAISGCIP 745 Query: 1905 QDVRLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXX 1726 QDVRLVLVDRLY+LSGLRNGMLLRFEWP S + +P S S+ N N Sbjct: 746 QDVRLVLVDRLYVLSGLRNGMLLRFEWPASPT-----MPVGSLSV-----NTNTVFPSVS 795 Query: 1725 XXXXIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPW 1546 + + S + D PI LQLIA RRIGITPVFLVPL D LD DI+ LSDRPW Sbjct: 796 AANSFGPKIYDVKFSEKTKDKFPIELQLIATRRIGITPVFLVPLSDSLDGDIVVLSDRPW 855 Query: 1545 LLQTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHL 1366 LL TARHSLSYTSISFQ STHVTPVC V+CPKGILFVAE+ LHLVEMVHSKRLNVQKFHL Sbjct: 856 LLHTARHSLSYTSISFQSSTHVTPVCYVECPKGILFVAENCLHLVEMVHSKRLNVQKFHL 915 Query: 1365 GGTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETG 1186 GGTPR+VLYHSE+RLLLVMRTDL SSSDIC VDPLSGS+L+++ LEPGETG Sbjct: 916 GGTPREVLYHSESRLLLVMRTDLSNDT------SSSDICCVDPLSGSVLSSFKLEPGETG 969 Query: 1185 KSMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSSRH---CPN 1015 KSM+LV+VGNEQVLVVGTS S+G IMPSGEAESTKGRL+VL + H++ S C Sbjct: 970 KSMELVRVGNEQVLVVGTSLSSGPAIMPSGEAESTKGRLIVLCLEHVQNSDSGSMTLCSK 1029 Query: 1014 XXXXXXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGA 841 SP +IVG ATEQ W+ L + T PG Sbjct: 1030 AGSSSQRASPFHEIVGYATEQLSSSSLCSSPDDTSCDGIKLEETEAWQFRLAYVTKWPGM 1089 Query: 840 VLAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDC 661 VLA+CPYLDRYFLAS+GN +V GF ++N RVRKFA RTRF IT LT FT IAVGDC Sbjct: 1090 VLAICPYLDRYFLASSGNAFYVCGFPNDNSQRVRKFAWARTRFMITSLTAHFTTIAVGDC 1149 Query: 660 RDGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDN 481 RDG+LFY+Y ED KKL+QLY DP QRLVADC LMD++TAVVSDR+G+ AVLS +YLED Sbjct: 1150 RDGVLFYAYHEDSKKLQQLYFDPCQRLVADCILMDVNTAVVSDRKGSIAVLSCADYLEDT 1209 Query: 480 ASPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLL 301 ASPECNLT+SC+YY+GE MSIRKGSFSYKLP DDVLKGC D +D ++I+ STLL Sbjct: 1210 ASPECNLTVSCAYYMGEIAMSIRKGSFSYKLPADDVLKGC---DGNIDFSQNAIIVSTLL 1266 Query: 300 GSVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLA 121 GS++ F+PIS EE+ELLEAVQ RLVVHPLTAPILGNDHNE+R R++ VGVPKILDGDML+ Sbjct: 1267 GSIITFVPISREEYELLEAVQDRLVVHPLTAPILGNDHNEYRSRENPVGVPKILDGDMLS 1326 Query: 120 QFLELTSMQQEAVLALPLG 64 QFLELT MQQEAVL+ PLG Sbjct: 1327 QFLELTGMQQEAVLSSPLG 1345 >ref|XP_006481686.1| PREDICTED: uncharacterized protein LOC102624787 isoform X2 [Citrus sinensis] Length = 1265 Score = 828 bits (2138), Expect = 0.0 Identities = 439/673 (65%), Positives = 507/673 (75%), Gaps = 5/673 (0%) Frame = -2 Query: 2067 LPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLV 1888 LP GV I TFVIGTH+PSVE+LSFVP+ GLR+LASG I LTNT+GTAISGC+PQDVRLV Sbjct: 569 LPAGVIIGYTFVIGTHRPSVEVLSFVPKEGLRVLASGSIVLTNTMGTAISGCIPQDVRLV 628 Query: 1887 LVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIR 1708 L D+ Y+L+GLRNGMLLRFEWP S+ S P SP IS N Sbjct: 629 LADQFYVLAGLRNGMLLRFEWPPDSNIPSSVAPIHSP-ISATFRNTENIRSGIAATSSFG 687 Query: 1707 QQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTAR 1528 + A S D++PI LQLIA RRIGITPVFLVPL D LDAD+IALSDRPWLLQTAR Sbjct: 688 SEMSAFNLSEESKDELPINLQLIATRRIGITPVFLVPLSDLLDADMIALSDRPWLLQTAR 747 Query: 1527 HSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRK 1348 HSL+YTSISFQPSTH TPVCSV+CPKGILFVAE+SL+LVEMVH+KRLNV KFHLGGTP+K Sbjct: 748 HSLAYTSISFQPSTHATPVCSVECPKGILFVAENSLNLVEMVHNKRLNVPKFHLGGTPKK 807 Query: 1347 VLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLV 1168 VLYHSE+RLL+VMRT+L SSDIC VDPLSGS+L+++ LE GETGKSM+LV Sbjct: 808 VLYHSESRLLIVMRTELNNDTC------SSDICCVDPLSGSVLSSFKLELGETGKSMELV 861 Query: 1167 KVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGD---SSRHCPNXXXXXX 997 +VG+EQVLVVGTS S+G IMPSGEAESTKGRL+VL I H++ S C Sbjct: 862 RVGHEQVLVVGTSLSSGPAIMPSGEAESTKGRLIVLCIEHMQNSDCGSMTFCSKAGSSSQ 921 Query: 996 XXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAVCP 823 SP +IVG ATEQ W+L L + TT PG VLA+CP Sbjct: 922 RTSPFREIVGYATEQLSSSSLCSSPDDASCDGIKLEETETWQLRLAYSTTWPGMVLAICP 981 Query: 822 YLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILF 643 YLDRYFLASAGN +V GF ++NP RVR+FA GRTRF I LT FTRIAVGDCRDGILF Sbjct: 982 YLDRYFLASAGNAFYVCGFPNDNPQRVRRFAVGRTRFMIMLLTAHFTRIAVGDCRDGILF 1041 Query: 642 YSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECN 463 YSY ED +KLEQ+YCDP QRLVADC LMD+DTAVVSDR+G+ AVLS + LEDNASPECN Sbjct: 1042 YSYHEDARKLEQIYCDPSQRLVADCVLMDVDTAVVSDRKGSIAVLSCSDRLEDNASPECN 1101 Query: 462 LTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIF 283 LT +C+Y++GE +SIRKGSF YKLP DD L C + + ++I+ASTLLGS++IF Sbjct: 1102 LTPNCAYHMGEIAVSIRKGSFIYKLPADDTLGDCLAS---FESSQTTIIASTLLGSIVIF 1158 Query: 282 IPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELT 103 IPISSEE+ELLEAVQ+RL +HPLTAP+LGNDHNEFR R++ VGVPKILDGDML+QFLELT Sbjct: 1159 IPISSEEYELLEAVQARLAIHPLTAPLLGNDHNEFRSRENPVGVPKILDGDMLSQFLELT 1218 Query: 102 SMQQEAVLALPLG 64 S QQEAVL+ LG Sbjct: 1219 STQQEAVLSFTLG 1231 >ref|XP_006481685.1| PREDICTED: uncharacterized protein LOC102624787 isoform X1 [Citrus sinensis] Length = 1394 Score = 828 bits (2138), Expect = 0.0 Identities = 439/673 (65%), Positives = 507/673 (75%), Gaps = 5/673 (0%) Frame = -2 Query: 2067 LPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLV 1888 LP GV I TFVIGTH+PSVE+LSFVP+ GLR+LASG I LTNT+GTAISGC+PQDVRLV Sbjct: 698 LPAGVIIGYTFVIGTHRPSVEVLSFVPKEGLRVLASGSIVLTNTMGTAISGCIPQDVRLV 757 Query: 1887 LVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIR 1708 L D+ Y+L+GLRNGMLLRFEWP S+ S P SP IS N Sbjct: 758 LADQFYVLAGLRNGMLLRFEWPPDSNIPSSVAPIHSP-ISATFRNTENIRSGIAATSSFG 816 Query: 1707 QQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTAR 1528 + A S D++PI LQLIA RRIGITPVFLVPL D LDAD+IALSDRPWLLQTAR Sbjct: 817 SEMSAFNLSEESKDELPINLQLIATRRIGITPVFLVPLSDLLDADMIALSDRPWLLQTAR 876 Query: 1527 HSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRK 1348 HSL+YTSISFQPSTH TPVCSV+CPKGILFVAE+SL+LVEMVH+KRLNV KFHLGGTP+K Sbjct: 877 HSLAYTSISFQPSTHATPVCSVECPKGILFVAENSLNLVEMVHNKRLNVPKFHLGGTPKK 936 Query: 1347 VLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLV 1168 VLYHSE+RLL+VMRT+L SSDIC VDPLSGS+L+++ LE GETGKSM+LV Sbjct: 937 VLYHSESRLLIVMRTELNNDTC------SSDICCVDPLSGSVLSSFKLELGETGKSMELV 990 Query: 1167 KVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGD---SSRHCPNXXXXXX 997 +VG+EQVLVVGTS S+G IMPSGEAESTKGRL+VL I H++ S C Sbjct: 991 RVGHEQVLVVGTSLSSGPAIMPSGEAESTKGRLIVLCIEHMQNSDCGSMTFCSKAGSSSQ 1050 Query: 996 XXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAVCP 823 SP +IVG ATEQ W+L L + TT PG VLA+CP Sbjct: 1051 RTSPFREIVGYATEQLSSSSLCSSPDDASCDGIKLEETETWQLRLAYSTTWPGMVLAICP 1110 Query: 822 YLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILF 643 YLDRYFLASAGN +V GF ++NP RVR+FA GRTRF I LT FTRIAVGDCRDGILF Sbjct: 1111 YLDRYFLASAGNAFYVCGFPNDNPQRVRRFAVGRTRFMIMLLTAHFTRIAVGDCRDGILF 1170 Query: 642 YSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECN 463 YSY ED +KLEQ+YCDP QRLVADC LMD+DTAVVSDR+G+ AVLS + LEDNASPECN Sbjct: 1171 YSYHEDARKLEQIYCDPSQRLVADCVLMDVDTAVVSDRKGSIAVLSCSDRLEDNASPECN 1230 Query: 462 LTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIF 283 LT +C+Y++GE +SIRKGSF YKLP DD L C + + ++I+ASTLLGS++IF Sbjct: 1231 LTPNCAYHMGEIAVSIRKGSFIYKLPADDTLGDCLAS---FESSQTTIIASTLLGSIVIF 1287 Query: 282 IPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELT 103 IPISSEE+ELLEAVQ+RL +HPLTAP+LGNDHNEFR R++ VGVPKILDGDML+QFLELT Sbjct: 1288 IPISSEEYELLEAVQARLAIHPLTAPLLGNDHNEFRSRENPVGVPKILDGDMLSQFLELT 1347 Query: 102 SMQQEAVLALPLG 64 S QQEAVL+ LG Sbjct: 1348 STQQEAVLSFTLG 1360 >ref|XP_002308344.2| hypothetical protein POPTR_0006s21160g [Populus trichocarpa] gi|550336774|gb|EEE91867.2| hypothetical protein POPTR_0006s21160g [Populus trichocarpa] Length = 1397 Score = 822 bits (2124), Expect = 0.0 Identities = 432/675 (64%), Positives = 507/675 (75%), Gaps = 5/675 (0%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVR 1894 A LP+GV +TFVIGTHKPSVE++SFVP +GLRI+ASG ISLT++LGT +SGC+PQDVR Sbjct: 695 AALPVGVDTGNTFVIGTHKPSVEVVSFVPGDGLRIIASGTISLTSSLGTTVSGCIPQDVR 754 Query: 1893 LVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXX 1714 LVL DR Y+LSGLRNGMLLRFEWP +SS F E+P+ SI C+ + + Sbjct: 755 LVLADRFYVLSGLRNGMLLRFEWPSASSMFSVEIPSHGCSIGSCMLSSDTAISNTAAISL 814 Query: 1713 IRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQT 1534 + A + DD+PI LQLIA RRIGITPVFLVPL D LD+D+IALSDRPWLL Sbjct: 815 -EPKMLAVDSIDNTMDDLPINLQLIATRRIGITPVFLVPLSDSLDSDMIALSDRPWLLHA 873 Query: 1533 ARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTP 1354 ARHSLSYTSISFQPSTH TPVCSV+CPKGILFVA++SLHLVEMVHS RLNVQKFHLGGTP Sbjct: 874 ARHSLSYTSISFQPSTHATPVCSVECPKGILFVADNSLHLVEMVHSTRLNVQKFHLGGTP 933 Query: 1353 RKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQ 1174 RKV YHSE++LLLVMRT+L SSDIC VDPLSGS ++++ LE GETGKSM+ Sbjct: 934 RKVQYHSESKLLLVMRTELSNDNDTC----SSDICCVDPLSGSTVSSFKLERGETGKSME 989 Query: 1173 LVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXX 1003 LVK+GNEQVLV+GTS S+G IMPSGEAESTKGR++VL + +++ S C Sbjct: 990 LVKIGNEQVLVIGTSLSSGPAIMPSGEAESTKGRVIVLCLENLQNSDSGSMTFCSKAGSS 1049 Query: 1002 XXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAV 829 SP +IVG A EQ W+L V TTLPG VLA+ Sbjct: 1050 SQRTSPFREIVGYAAEQLSSSSLCSSPDDTSCDGVKLEETETWQLRFVSATTLPGMVLAI 1109 Query: 828 CPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGI 649 CPYLDR+FLASAGN +V GF ++N RV+KFA GRTRF I LT TRIAVGDCRDGI Sbjct: 1110 CPYLDRFFLASAGNSFYVCGFANDNK-RVKKFAVGRTRFMIMSLTAYHTRIAVGDCRDGI 1168 Query: 648 LFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPE 469 LFY+Y + KKLEQLYCDP QRLVA C LMD+DTAVVSDR+G+ AVLS + E SPE Sbjct: 1169 LFYAYHVESKKLEQLYCDPSQRLVAGCVLMDVDTAVVSDRKGSIAVLSRSDRFECTGSPE 1228 Query: 468 CNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVM 289 CNLTL+C+YY+GE MSIRKGSF+YKLP DD+L GC +D +++IVASTLLGS++ Sbjct: 1229 CNLTLNCAYYMGEIAMSIRKGSFTYKLPADDILTGCDGVITKMDASNNTIVASTLLGSII 1288 Query: 288 IFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLE 109 +FIP+S EE ELL+AVQSRLVVHPLTAP+LGNDH+EFR R++ VGVPKILDGDMLAQFLE Sbjct: 1289 VFIPLSREEFELLQAVQSRLVVHPLTAPVLGNDHHEFRSRENPVGVPKILDGDMLAQFLE 1348 Query: 108 LTSMQQEAVLALPLG 64 LTS QQEAVL+LPLG Sbjct: 1349 LTSSQQEAVLSLPLG 1363 >gb|EXB29323.1| DNA damage-binding protein 1b [Morus notabilis] Length = 1388 Score = 809 bits (2090), Expect = 0.0 Identities = 441/688 (64%), Positives = 503/688 (73%), Gaps = 15/688 (2%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVR 1894 + LP V I FV+GTHKPSVE+L F P+ GLR++A+G I+LT +GTA+SGCVPQDVR Sbjct: 691 SALPSEVDISKAFVVGTHKPSVEVLVFDPDEGLRVIANGTIALTTIMGTAVSGCVPQDVR 750 Query: 1893 LVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXX 1714 LV V+RLYILSGLRNGMLLRFEWP SAF T SPS+ L N NA Sbjct: 751 LVYVNRLYILSGLRNGMLLRFEWP---SAF-----TFSPSV---LANRNALSSVLVDAGP 799 Query: 1713 IRQQCCAGERSGRPGDDV----------PIYLQLIAIRRIGITPVFLVPLHDCLDADIIA 1564 + A G +DV PI LQLIAIRRIGITPVFLVPL LDADIIA Sbjct: 800 VFSSTSAPNSFGLKANDVKLSEKAKSKNPINLQLIAIRRIGITPVFLVPLSSSLDADIIA 859 Query: 1563 LSDRPWLLQTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLN 1384 LSDRPWLL TARHSLSYTSISFQ STHVTPVCS +CPKGILFVAE+SLHLVEMVH KRLN Sbjct: 860 LSDRPWLLHTARHSLSYTSISFQASTHVTPVCSAECPKGILFVAENSLHLVEMVHCKRLN 919 Query: 1383 VQKFHLGGTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVL 1204 VQK LGGTPRKVLYHSE+RLLLVMRTDL SSDIC VDPLSG++L+++ L Sbjct: 920 VQKLSLGGTPRKVLYHSESRLLLVMRTDLTNDTC------SSDICCVDPLSGTVLSSFKL 973 Query: 1203 EPGETGKSMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS-- 1030 + GETGKSM+LV+VGNEQVLVVGT S+G IMPSGEAESTKGRL+VL + H + S Sbjct: 974 DHGETGKSMELVRVGNEQVLVVGTRLSSGPAIMPSGEAESTKGRLIVLCLEHAQNSDSGS 1033 Query: 1029 -RHCPNXXXXXXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQ 859 SP +IVG ATEQ W+L L + Sbjct: 1034 MTFSSKAGSSSQRASPFREIVGYATEQLSSSSLCSSPDDTSCDGIKLEETEAWQLRLAYS 1093 Query: 858 TTLPGAVLAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTR 679 PG VLA+CPYL+RYFLASAGN +V GF ++N RVRKFA GRTRF IT LT FTR Sbjct: 1094 VMWPGMVLAICPYLERYFLASAGNSFYVCGFPNDNSQRVRKFAVGRTRFMITSLTAHFTR 1153 Query: 678 IAVGDCRDGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSK 499 IAVGDCRDGILF+SY ED +KLEQLYCDP QRLVADC LMDLDTAVVSDR+G+ AVLS Sbjct: 1154 IAVGDCRDGILFFSYHEDARKLEQLYCDPSQRLVADCLLMDLDTAVVSDRKGSIAVLSCA 1213 Query: 498 NYLEDNASPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSI 319 ++LEDNASPECNL +SC+YY+GE MSI+KGSFSY LP DDVLKG ++ +D ++I Sbjct: 1214 DHLEDNASPECNLNVSCAYYMGEIAMSIKKGSFSYSLPADDVLKG---SNMKIDSARNTI 1270 Query: 318 VASTLLGSVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKIL 139 +ASTLLGS++ FIP+S +E+ELLEAVQSRLVVHPLTAPILGNDHNEFR R++ GVPKIL Sbjct: 1271 IASTLLGSIITFIPLSRDEYELLEAVQSRLVVHPLTAPILGNDHNEFRSRENPPGVPKIL 1330 Query: 138 DGDMLAQFLELTSMQQEAVLALPLGLSE 55 DGDML QFLELT MQQEAVL+LPLG + Sbjct: 1331 DGDMLTQFLELTRMQQEAVLSLPLGTKD 1358 >ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus] Length = 1376 Score = 800 bits (2067), Expect = 0.0 Identities = 420/670 (62%), Positives = 504/670 (75%), Gaps = 6/670 (0%) Frame = -2 Query: 2055 VQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVDR 1876 V D+ VIGTH+PSVEILSFVP GL +LASG ISL N LG A+SGC+PQDVRLVLVDR Sbjct: 693 VSCDTIIVIGTHRPSVEILSFVPSIGLTVLASGTISLMNILGNAVSGCIPQDVRLVLVDR 752 Query: 1875 LYILSGLRNGMLLRFEWPVSSSAFPSELP-TQSPSISHCLTNINAXXXXXXXXXXIRQQC 1699 Y+L+GLRNGMLLRFEWP +++ S++P T P + C + + ++ Sbjct: 753 FYVLTGLRNGMLLRFEWPHTATMNSSDMPHTVVPFLLSCSDSFS-------------KEF 799 Query: 1698 CAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARHSL 1519 + + D++P LQLIAIRRIGITPVFLVPL D LD+DIIALSDRPWLL +ARHSL Sbjct: 800 HNADILEKHEDEIPSCLQLIAIRRIGITPVFLVPLTDRLDSDIIALSDRPWLLHSARHSL 859 Query: 1518 SYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKVLY 1339 SYTSISFQPSTHVTPVCS DCP G+LFVAESSLHLVEMVH+KRLNVQKFHLGGTPRKVLY Sbjct: 860 SYTSISFQPSTHVTPVCSADCPSGLLFVAESSLHLVEMVHTKRLNVQKFHLGGTPRKVLY 919 Query: 1338 HSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVKVG 1159 HSE++LLLVMRT L + SSSDIC VDPLSGS+L+++ LE GETGKSM+LV+ G Sbjct: 920 HSESKLLLVMRTQL------INDTSSSDICCVDPLSGSILSSHKLEIGETGKSMELVRNG 973 Query: 1158 NEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGD---SSRHCPNXXXXXXXXS 988 NEQVLVVGTS S+G IM SGEAESTKGRL+VL + H++ S C S Sbjct: 974 NEQVLVVGTSLSSGPAIMASGEAESTKGRLIVLCLEHVQNSDTGSMTFCSKAGLSSLQAS 1033 Query: 987 PLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAVCPYLD 814 P +IVG ATEQ W+L +V+ T+LPG VLA+CPYLD Sbjct: 1034 PFREIVGYATEQLSSSSLCSSPDDASSDGIKLEETEAWQLRVVYSTSLPGMVLAICPYLD 1093 Query: 813 RYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFYSY 634 RYFLASAGN +V GF +++ RV++FA GRTRF IT LT RIAVGDCRDGILF+SY Sbjct: 1094 RYFLASAGNAFYVCGFPNDSFQRVKRFAVGRTRFMITSLTAHVNRIAVGDCRDGILFFSY 1153 Query: 633 QEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNLTL 454 QED KKLEQ+Y DP QRLVADC L+D+DTAVVSDR+G+ A+LS + LEDNASPECNLTL Sbjct: 1154 QEDAKKLEQIYSDPSQRLVADCTLLDVDTAVVSDRKGSIAILSCSDRLEDNASPECNLTL 1213 Query: 453 SCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIFIPI 274 +C+YY+GE M++RKGSFSYKLP DD+L+GC D H++I+ASTLLGS++IF P+ Sbjct: 1214 NCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFDSSHNTIIASTLLGSIVIFTPL 1273 Query: 273 SSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELTSMQ 94 S +E+ELLEAVQ++L VHPLT+PILGNDH E+R R++ +GVPKILDGD+L QFLELTSMQ Sbjct: 1274 SRDEYELLEAVQAKLAVHPLTSPILGNDHYEYRSRENPIGVPKILDGDILTQFLELTSMQ 1333 Query: 93 QEAVLALPLG 64 QE VL+ +G Sbjct: 1334 QELVLSSSVG 1343 >ref|XP_006838801.1| hypothetical protein AMTR_s00002p00260810 [Amborella trichopoda] gi|548841307|gb|ERN01370.1| hypothetical protein AMTR_s00002p00260810 [Amborella trichopoda] Length = 1396 Score = 794 bits (2051), Expect = 0.0 Identities = 426/686 (62%), Positives = 501/686 (73%), Gaps = 12/686 (1%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVR 1894 AG P G++I T VIGTHKPSVE++SFVP G R+LA G ISLTNT+G++ISGC+PQDVR Sbjct: 698 AGFPSGIEIGKTCVIGTHKPSVELVSFVPNEGFRLLAIGAISLTNTMGSSISGCIPQDVR 757 Query: 1893 LVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXX 1714 LV VDR YILSGLRNGMLLRFEWPV SS PSELP S S+ C T + Sbjct: 758 LVYVDRYYILSGLRNGMLLRFEWPVISSTNPSELPNLS-SLLPC-TGTSDSPLSKSTVPI 815 Query: 1713 IRQQCCAGERSGRPGDD-VPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQ 1537 +QC RP ++ +PI LQLIA+RRIG++PV LVPL + L ADIIALSDRPWLLQ Sbjct: 816 FYEQCIGVNMMERPAENSLPIQLQLIAVRRIGVSPVILVPLCESLHADIIALSDRPWLLQ 875 Query: 1536 TARHS--LSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLG 1363 TARHS ++YTSISFQP+TH TPVC DCP G+LFVAE+SLHLVEMVH+KRLNVQKF LG Sbjct: 876 TARHSQRIAYTSISFQPATHATPVCLDDCPSGVLFVAENSLHLVEMVHTKRLNVQKFGLG 935 Query: 1362 GTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGK 1183 GTPR+VLYHSE+R L V+RTD SSDIC VDPLSGS+L+ + +PGET K Sbjct: 936 GTPRRVLYHSESRTLQVLRTDCNYGS-----GISSDICCVDPLSGSVLSGFKFDPGETAK 990 Query: 1182 SMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSSRHCPNXXXX 1003 MQL+K+ NEQVLVVGTS S+G IMP+GEAES +GRL+V + H++ S + Sbjct: 991 CMQLMKLRNEQVLVVGTSISSGPAIMPNGEAESIRGRLIVFGLDHMQHSDSSSLASDSKL 1050 Query: 1002 XXXXS---PLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAV 838 P +IVG ATEQ C L + + TLPG V Sbjct: 1051 GSSSQLSSPFREIVGYATEQLSCSSICSSPDDASGDGVKLEECEACNLRVKWSFTLPGVV 1110 Query: 837 LAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCR 658 LA+CPYLDRY L SAGN LFVYG L+ENP R+R+F S RTRFTITC+T RIAVGDCR Sbjct: 1111 LAICPYLDRYILVSAGNNLFVYGILNENPQRLRRFTSARTRFTITCITAHLNRIAVGDCR 1170 Query: 657 DGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNA 478 DG+LFYSYQEDL+KLEQLYCDP QR+VADC+L+DLDT VVSDRRGN LS NY EDN Sbjct: 1171 DGLLFYSYQEDLRKLEQLYCDPVQRIVADCSLLDLDTGVVSDRRGNICFLSCANYSEDNV 1230 Query: 477 SPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLG 298 SPE NLT+SCSYY+GET+ SIRKGSFSY+ D +LKG D ++D S IVASTLLG Sbjct: 1231 SPERNLTISCSYYVGETISSIRKGSFSYRNSGDGILKGSRIIDPLLDCADSHIVASTLLG 1290 Query: 297 SVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQ 118 SV+IFI IS EE++LL+AVQ+RL VHPLTAPILGN+H++FRGR S VGVPKILDGDMLAQ Sbjct: 1291 SVVIFIRISREEYDLLDAVQARLAVHPLTAPILGNNHDDFRGRGSPVGVPKILDGDMLAQ 1350 Query: 117 FLELTSMQQEAVLAL----PLGLSER 52 FLELTS+QQ+A+LA P+G S + Sbjct: 1351 FLELTSLQQKAILASEMPNPVGTSSK 1376 >ref|XP_007029116.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 1 [Theobroma cacao] gi|508717721|gb|EOY09618.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 1 [Theobroma cacao] Length = 1391 Score = 794 bits (2051), Expect = 0.0 Identities = 426/674 (63%), Positives = 491/674 (72%), Gaps = 5/674 (0%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVR 1894 A LP+GV + TFVIGTH+PSVEILSF P+ GLR+LA+G ISL + + TA+SGC+PQDVR Sbjct: 696 AVLPVGVGMGITFVIGTHRPSVEILSFTPQ-GLRVLATGTISLASAMETAVSGCIPQDVR 754 Query: 1893 LVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXX 1714 LVLVD+ Y+LSGLRNGMLLRFEWP + + SE + + + N++ Sbjct: 755 LVLVDQFYVLSGLRNGMLLRFEWPSAVATSSSECCSSTSPLPE---NVDRVLLNTKTANL 811 Query: 1713 IRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQT 1534 + CA S + DD+PI LQLIA RRIGITPVFLVPL D LDADIIALSDRPWLL T Sbjct: 812 FGSEICAVNVSEK--DDLPINLQLIATRRIGITPVFLVPLSDSLDADIIALSDRPWLLHT 869 Query: 1533 ARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTP 1354 ARHSLSYTSISFQPSTH TPVCS +CPKGILFV E+SLHLVEMVH RLNVQKFHLGGTP Sbjct: 870 ARHSLSYTSISFQPSTHATPVCSAECPKGILFVTENSLHLVEMVHGNRLNVQKFHLGGTP 929 Query: 1353 RKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQ 1174 RKVLYHSE++LL+VMRTDL SSDIC VDPL+ S++ ++ LE GETGK M+ Sbjct: 930 RKVLYHSESKLLIVMRTDLSNDTC------SSDICCVDPLTVSVVASFKLELGETGKCME 983 Query: 1173 LVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXX 1003 LV+ GNEQVLVVGTS S G IMPSGEAESTKGRL+VL I H++ S Sbjct: 984 LVRAGNEQVLVVGTSLSPGPAIMPSGEAESTKGRLIVLCIEHVQNSDSGSMTFSSMAGSS 1043 Query: 1002 XXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAV 829 SP +IVG A EQ W+L L + TT P VLA+ Sbjct: 1044 SQRNSPFCEIVGHANEQLSSSSICSSPDDTSCDGIKLEETEAWQLRLAYATTWPAMVLAI 1103 Query: 828 CPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGI 649 CPYLD YFLASAGN +V FLS NP RVR+FA RTRF I LT TRIAVGDCRDGI Sbjct: 1104 CPYLDHYFLASAGNTFYVCAFLSGNPQRVRRFALARTRFMIMSLTAHSTRIAVGDCRDGI 1163 Query: 648 LFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPE 469 LFYSY E+ KKL+Q YCDP QRLVADC L D+DTAVVSDR+G+ AVLS + LEDNASPE Sbjct: 1164 LFYSYHEETKKLDQTYCDPSQRLVADCVLTDVDTAVVSDRKGSVAVLSCSDRLEDNASPE 1223 Query: 468 CNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVM 289 NLTL+ +YY+GE MSIRKGSF YKLP DD+L C + VD H +I+ASTLLGS+M Sbjct: 1224 RNLTLTSAYYMGEIAMSIRKGSFIYKLPADDMLNSCEGLNASVDPSHGTIMASTLLGSIM 1283 Query: 288 IFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLE 109 IFIPIS EEHELLEAVQ+RL+VHPLTAP+LGNDHNE+R ++ GVPKILDGDMLAQFLE Sbjct: 1284 IFIPISREEHELLEAVQARLIVHPLTAPVLGNDHNEYRSCENPAGVPKILDGDMLAQFLE 1343 Query: 108 LTSMQQEAVLALPL 67 LTSMQQEAVL+ + Sbjct: 1344 LTSMQQEAVLSFSI 1357 >ref|XP_006351358.1| PREDICTED: pre-mRNA-splicing factor prp12-like isoform X1 [Solanum tuberosum] Length = 1393 Score = 792 bits (2045), Expect = 0.0 Identities = 420/678 (61%), Positives = 489/678 (72%), Gaps = 6/678 (0%) Frame = -2 Query: 2079 PGAGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQD 1900 P LP+G+ I + FVIGTHKPSVE+LSF + G +LA G I+LTNTLGT +SGC+PQD Sbjct: 691 PLGSLPVGLDISNIFVIGTHKPSVEVLSFTSDKGPSVLAVGSITLTNTLGTTVSGCIPQD 750 Query: 1899 VRLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXX 1720 VRLVLVDRLY+LSGLRNGMLLRFEWP S+ P + C+ +N Sbjct: 751 VRLVLVDRLYVLSGLRNGMLLRFEWPSISAVSSLVSPGLQTFDNSCM--VNCTSSSIFAS 808 Query: 1719 XXIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLL 1540 R Q D P+YLQL+A+RRIGITPVFL+PL+D LDAD+IALSDRPWLL Sbjct: 809 QNFRTQPTQVTSLLDKTKDFPVYLQLVAVRRIGITPVFLIPLNDSLDADVIALSDRPWLL 868 Query: 1539 QTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGG 1360 QTARHSLSYTSISF PSTHVTPVCS +CPKGI+FVAE+SLHLVEMV SKRLNVQKFH GG Sbjct: 869 QTARHSLSYTSISFPPSTHVTPVCSTECPKGIIFVAENSLHLVEMVPSKRLNVQKFHFGG 928 Query: 1359 TPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKS 1180 TPRKVLYHS++RLLLV+RTDL SSD+C +DPLSGS+L+++ EPGE GK Sbjct: 929 TPRKVLYHSDSRLLLVLRTDLSDD------LCSSDVCCIDPLSGSVLSSFKFEPGEIGKC 982 Query: 1179 MQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXX 1009 M LVK GNEQVLVVGT S+G IMPSGEAESTKGRL+VL + ++ S Sbjct: 983 MDLVKAGNEQVLVVGTGLSSGPAIMPSGEAESTKGRLIVLCLEQMQNSDSGSIAFSSRAG 1042 Query: 1008 XXXXXXSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVL 835 SP +I G A EQ W L L + TT PG VL Sbjct: 1043 SSSQRTSPFREIGGYAAEQLSSSSLCSSPDDNSCDGIKLEESEAWHLRLGYSTTWPGMVL 1102 Query: 834 AVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRD 655 AVCPYLDR+FLASA N +V GF ++N RVR+ A GRTRF I LT FTRIAVGDCRD Sbjct: 1103 AVCPYLDRFFLASAANCFYVCGFPNDNAQRVRRLAVGRTRFMIMTLTAHFTRIAVGDCRD 1162 Query: 654 GILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDN-A 478 GILFYSYQED +KL+Q+YCDP QRLV+DC LMD DTA VSDR+G+ A+LS N+LEDN Sbjct: 1163 GILFYSYQEDARKLDQVYCDPVQRLVSDCTLMDGDTAAVSDRKGSLAILSCLNHLEDNFN 1222 Query: 477 SPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLG 298 SPE NL L+CS+Y+GE + IRKGSFSYKLP DD L+GC A V D+ +SI+ASTLLG Sbjct: 1223 SPERNLALTCSFYMGEIAIRIRKGSFSYKLPADDALRGCQVASNVGDISQNSIMASTLLG 1282 Query: 297 SVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQ 118 S++IFIP++ EE++LLEAVQ+RLV+HPLTAPILGNDH E+R R S PK LDGDMLAQ Sbjct: 1283 SIIIFIPLTREEYDLLEAVQARLVIHPLTAPILGNDHTEYRCRGSTARAPKALDGDMLAQ 1342 Query: 117 FLELTSMQQEAVLALPLG 64 FLELTSMQQEAVLALPLG Sbjct: 1343 FLELTSMQQEAVLALPLG 1360 >ref|XP_004249760.1| PREDICTED: pre-mRNA-splicing factor prp12-like [Solanum lycopersicum] Length = 1394 Score = 790 bits (2039), Expect = 0.0 Identities = 421/684 (61%), Positives = 493/684 (72%), Gaps = 10/684 (1%) Frame = -2 Query: 2085 NRPGA---GLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISG 1915 NR G LP+G+ I +TFVIGTHKPSVE+LSF + GL +LA G I+LTNTLGT +SG Sbjct: 686 NRSGVRLDSLPVGLDISNTFVIGTHKPSVEVLSFTSDKGLSVLAVGSITLTNTLGTTVSG 745 Query: 1914 CVPQDVRLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXX 1735 C+PQD+RLVLVDRLY+LSGLRNGMLLRFEWP S+ + P + C+ N Sbjct: 746 CIPQDIRLVLVDRLYVLSGLRNGMLLRFEWPSISAIYSLVSPGLQTFDNSCMAN--CISS 803 Query: 1734 XXXXXXXIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSD 1555 R Q D P+YLQL+A+RRIGITPVFL+PL+D LDAD+IALSD Sbjct: 804 STSASQNFRSQPTQVTSLLDKTKDFPVYLQLVAVRRIGITPVFLIPLNDSLDADVIALSD 863 Query: 1554 RPWLLQTARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQK 1375 RPWLLQTARHSLSYTSISF PSTHVTPVCS +CPKGI+FVAE+SLHLVEMV SKRLNVQK Sbjct: 864 RPWLLQTARHSLSYTSISFPPSTHVTPVCSTECPKGIIFVAENSLHLVEMVPSKRLNVQK 923 Query: 1374 FHLGGTPRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPG 1195 FH GGTPRKVLYHS++RLLLV+RTDL SSD+C +DPLSGS+L+++ E G Sbjct: 924 FHFGGTPRKVLYHSDSRLLLVLRTDLSDD------LCSSDVCCIDPLSGSVLSSFKFELG 977 Query: 1194 ETGKSMQLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RH 1024 E GK M+LVK GNEQVLVVGT S+G IMPSGEAESTKGRL+VL + ++ S Sbjct: 978 EIGKCMELVKAGNEQVLVVGTGLSSGPAIMPSGEAESTKGRLIVLCVEQMQNSDSGSIAF 1037 Query: 1023 CPNXXXXXXXXSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTL 850 SP ++ G A EQ W L L + TT Sbjct: 1038 SSRAGSSSQRTSPFREVGGYAAEQLSSSSICSSPDDNSCDGIKLEESEAWHLRLGYSTTW 1097 Query: 849 PGAVLAVCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAV 670 PG VLAVCPYLDR+FLASA N +V GF ++N RVR+ A GRTRF I LT FTRIAV Sbjct: 1098 PGMVLAVCPYLDRFFLASAANCFYVCGFPNDNAQRVRRLAVGRTRFMIMTLTAHFTRIAV 1157 Query: 669 GDCRDGILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYL 490 GDCRDGILFYSYQED +KL+Q+YCDP QRLV+DC LMD DTA VSDR+G+FA+LS NY+ Sbjct: 1158 GDCRDGILFYSYQEDSRKLDQIYCDPVQRLVSDCTLMDGDTAAVSDRKGSFAILSCLNYM 1217 Query: 489 E-DN-ASPECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIV 316 E DN SPE NL +CS+Y+GE + IRKGSFSYKLP DD L+GC V D+ +SI+ Sbjct: 1218 EADNFNSPERNLAQTCSFYMGEIAIRIRKGSFSYKLPADDALRGCQATSIVGDISQNSIM 1277 Query: 315 ASTLLGSVMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILD 136 ASTLLGS++IFIP++ EE++LLEAVQ+RLV+HPLTAPILGNDH E+R R S VPK LD Sbjct: 1278 ASTLLGSIIIFIPLTREEYDLLEAVQARLVIHPLTAPILGNDHTEYRCRGSMARVPKALD 1337 Query: 135 GDMLAQFLELTSMQQEAVLALPLG 64 GDMLAQFLELTSMQQEAVLALPLG Sbjct: 1338 GDMLAQFLELTSMQQEAVLALPLG 1361 >ref|XP_004303372.1| PREDICTED: pre-mRNA-splicing factor rse-1-like [Fragaria vesca subsp. vesca] Length = 1396 Score = 788 bits (2036), Expect = 0.0 Identities = 424/674 (62%), Positives = 488/674 (72%), Gaps = 8/674 (1%) Frame = -2 Query: 2064 PIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVL 1885 P GV I + FVIGTHKPSVEILS P GLR+LASG ISLTNTLGTAISGC+PQDVRLVL Sbjct: 700 PFGVDISNIFVIGTHKPSVEILSLAPSEGLRVLASGAISLTNTLGTAISGCIPQDVRLVL 759 Query: 1884 VDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIRQ 1705 VDRLY+LSGLRNGMLLRFEWP ++S PS + QSP + + + + Sbjct: 760 VDRLYVLSGLRNGMLLRFEWP-TASRMPSSVVPQSP-VDWLSVSTDTVLSSVSAANSYGR 817 Query: 1704 QCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARH 1525 Q + S D P+ LQLIAIRRIGITPVFLVPL D LD DII LSDRPWLL TARH Sbjct: 818 QVYTTKLSENIKDKFPVDLQLIAIRRIGITPVFLVPLSDSLDGDIIVLSDRPWLLHTARH 877 Query: 1524 SLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKV 1345 SLSYTSISFQ STHVTPVC V+CPKGILFVAE+ LHLVEMVHSKRLNVQK LGGTPR+V Sbjct: 878 SLSYTSISFQSSTHVTPVCYVECPKGILFVAENCLHLVEMVHSKRLNVQKLQLGGTPRRV 937 Query: 1344 LYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVK 1165 YHSE+RLL+VMRT+L SDIC VDPLSGS+L+++ LE GETGKSM+L++ Sbjct: 938 FYHSESRLLIVMRTNLSDDT------CLSDICCVDPLSGSVLSSFKLEFGETGKSMELMR 991 Query: 1164 VGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXXXXX 994 VG+EQVL+VGTS S+G IMP GEAESTKGRL+VL + +++ S Sbjct: 992 VGSEQVLLVGTSLSSGSAIMPCGEAESTKGRLIVLCLENMQNSDSGSMTFSSKAGSSSLR 1051 Query: 993 XSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLAVCPY 820 SP +IVG A EQ W+ L F PG VLA+CPY Sbjct: 1052 ASPFHEIVGYAAEQLSSSSLCSSPDDTSCDGIKLEETETWQFRLAFSMPWPGMVLAICPY 1111 Query: 819 LDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFY 640 LDRYFLASAGN ++ GF EN RV+K+A RTRFTIT LT FTRI VGDCRDGILFY Sbjct: 1112 LDRYFLASAGNAFYLCGFPHENSQRVKKWAVARTRFTITSLTAHFTRIVVGDCRDGILFY 1171 Query: 639 SYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLED---NASPE 469 Y ED KKL+QLYCDP QRLV DC LMD++TAVVSDR+G+ AVLS +YLE ASPE Sbjct: 1172 DYNEDSKKLQQLYCDPYQRLVGDCILMDVNTAVVSDRKGSIAVLSCADYLEGKHYTASPE 1231 Query: 468 CNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVM 289 CNLT+SC+YY+GE MSI+KGSFSYKLP DD +KG D +D + I+ STLLGS++ Sbjct: 1232 CNLTVSCAYYMGEIAMSIKKGSFSYKLPADDAMKG---GDGSIDFAQNGIIVSTLLGSII 1288 Query: 288 IFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLE 109 F+PIS EE+ELLEAVQ RL VHPLTAPILGNDHNEFR R++ VGVPKILD DML QFLE Sbjct: 1289 TFVPISREEYELLEAVQDRLAVHPLTAPILGNDHNEFRSRENPVGVPKILDADMLTQFLE 1348 Query: 108 LTSMQQEAVLALPL 67 LTS+QQEAVL+ P+ Sbjct: 1349 LTSVQQEAVLSSPI 1362 >ref|XP_006407388.1| hypothetical protein EUTSA_v10019900mg [Eutrema salsugineum] gi|557108534|gb|ESQ48841.1| hypothetical protein EUTSA_v10019900mg [Eutrema salsugineum] Length = 1367 Score = 762 bits (1968), Expect = 0.0 Identities = 406/675 (60%), Positives = 490/675 (72%), Gaps = 7/675 (1%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPEN-GLRILASGIISLTNTLGTAISGCVPQDV 1897 A +P G++ TF+IGTHKPSVE+LSF + G+R+LASG++SLTNT+GTAISGC+PQDV Sbjct: 688 AAIPSGMERGYTFLIGTHKPSVEVLSFSEDGAGVRVLASGLVSLTNTMGTAISGCIPQDV 747 Query: 1896 RLVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXX 1717 RLVLVD+LY+LSGLRNGMLLRFEWP S + P +SHC ++ Sbjct: 748 RLVLVDQLYVLSGLRNGMLLRFEWPPFSHSSGLNCPDY---LSHCKEEMDI--------- 795 Query: 1716 XIRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQ 1537 GER D++PI L LIA RRIGITPVFLVP D LD+DIIALSDRPWLLQ Sbjct: 796 ------AVGER-----DNLPIDLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQ 844 Query: 1536 TARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGT 1357 TAR SLSYTSISFQPSTH TPVCS +CP+GILFVAE+ LHLVEMVHSKRLN QKFHLGGT Sbjct: 845 TARQSLSYTSISFQPSTHATPVCSSECPQGILFVAENCLHLVEMVHSKRLNAQKFHLGGT 904 Query: 1356 PRKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSM 1177 PRKVLYHSE++LL+VMRTDL +SDIC VDPLSGSLL++Y L+PGETGKSM Sbjct: 905 PRKVLYHSESKLLIVMRTDLYDA-------CTSDICCVDPLSGSLLSSYKLKPGETGKSM 957 Query: 1176 QLVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSSRH---CPNXXX 1006 +L++VGNEQVLVVGTS S+G I+PSGEAESTKGRL++L + HI+ S C Sbjct: 958 ELLRVGNEQVLVVGTSLSSGPAILPSGEAESTKGRLIILYLEHIQNSDSGSITICSKAGS 1017 Query: 1005 XXXXXSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLA 832 SP D+ G TEQ + W+L L TT PG VLA Sbjct: 1018 SSQRTSPFRDVAGFTTEQLSSSSLCSSPDDNSYDGIKLDEAETWQLRLASATTWPGMVLA 1077 Query: 831 VCPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDG 652 +CPYLD YFLASAGN +V GF +++P R+++FA GRTRF IT L T FTRI VGDCRDG Sbjct: 1078 ICPYLDNYFLASAGNAFYVCGFPNDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDG 1137 Query: 651 ILFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLE-DNAS 475 +LFYSY ED+KKL Q+YCDP QRLVADC LMD ++ VSDR+G+ A+LS K++ + + +S Sbjct: 1138 VLFYSYHEDVKKLHQIYCDPAQRLVADCFLMDANSVAVSDRKGSVAILSCKDHSDFEYSS 1197 Query: 474 PECNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGS 295 PE NL L+C+YY+GE M+I+KG YKLP DDVL+ Y + +D +I+A TL+GS Sbjct: 1198 PESNLNLNCAYYMGEIAMAIKKGCNIYKLPADDVLRS-YGPCKSIDAADDTIIAGTLMGS 1256 Query: 294 VMIFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQF 115 + +F PIS EE+ELLEAVQ +LVVHPLTAP+LGNDH EFRGR++ KILDGDMLAQF Sbjct: 1257 IYVFAPISREEYELLEAVQEKLVVHPLTAPVLGNDHEEFRGRENPSQATKILDGDMLAQF 1316 Query: 114 LELTSMQQEAVLALP 70 LELT+ QQE+VLA P Sbjct: 1317 LELTNRQQESVLATP 1331 >ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus communis] gi|223528782|gb|EEF30789.1| spliceosomal protein sap, putative [Ricinus communis] Length = 1220 Score = 761 bits (1965), Expect = 0.0 Identities = 407/625 (65%), Positives = 468/625 (74%), Gaps = 7/625 (1%) Frame = -2 Query: 2010 VEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVDRLYILSGLRNGMLLRF 1831 VE+L FVP+ GLR+LA G ISLTNTLGTAISGCVPQDVRLVLVDRLY+LSGLRNGMLLRF Sbjct: 596 VEVLCFVPDEGLRVLARGTISLTNTLGTAISGCVPQDVRLVLVDRLYVLSGLRNGMLLRF 655 Query: 1830 EWPVSSSAFPS--ELPTQSPSISHCLTNINAXXXXXXXXXXIRQQCCAGERSGRPGDDVP 1657 EWP SSS+ S E+P I C+TN Q C+ + +G D P Sbjct: 656 EWPSSSSSSISSMEIPYYGYPIDSCMTNA-CSGLSTTTAVFPESQTCSVDLTGGAMDGPP 714 Query: 1656 IYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARHSLSYTSISFQPSTHVT 1477 I LQLIA RRIG+TPVFLVPL D LDAD+IALSDRPWLLQTARH LSYTSISFQPSTH T Sbjct: 715 INLQLIATRRIGVTPVFLVPLTDSLDADMIALSDRPWLLQTARHGLSYTSISFQPSTHST 774 Query: 1476 PVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSETRLLLVMRTDL 1297 PVCSV+CPKG+LFVAE+SLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSE+RLLLVMRT+L Sbjct: 775 PVCSVECPKGLLFVAENSLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSESRLLLVMRTEL 834 Query: 1296 EIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVKVGNEQVLVVGTSQSAG 1117 SSDIC VDPLSGS+++++ LE GETGKSM+LV+VG EQVLVVGTS S+G Sbjct: 835 SNDTC------SSDICCVDPLSGSVVSSFKLEHGETGKSMELVRVGTEQVLVVGTSLSSG 888 Query: 1116 RPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXXXXXXSPLGDIVGRATEQXX 946 IMPSGEAESTKGRL+VL + H++ S C SP ++VG EQ Sbjct: 889 PAIMPSGEAESTKGRLIVLCLEHLQSSDSGSMTFCSKAGSSSQRTSPFCEVVGYTAEQLS 948 Query: 945 XXXXXXXXXXXXXXXXNGCRE-WELELVFQTTLPGAVLAVCPYLDRYFLASAGNILFVYG 769 E W+L L + T PG L +CPYLDRYFLASAG+ +V G Sbjct: 949 SSSLCSSPDDSCDGVKLEESEAWQLRLAYATKWPGMALTICPYLDRYFLASAGSAFYVCG 1008 Query: 768 FLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFYSYQEDLKKLEQLYCDPD 589 F ++NP RVRKFA RTRFTI LT FTRIAVGDCRDGILFYSY ED +KLEQ+YCDP Sbjct: 1009 FPNDNPQRVRKFAIARTRFTIISLTAHFTRIAVGDCRDGILFYSYHEDTRKLEQVYCDPS 1068 Query: 588 QRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNLTLSCSYYIGETVMSIRK 409 QRLVADC L+D+DTAVVSDR+G+ AVLS E NASPECNLTL+C+YY+GE MSIRK Sbjct: 1069 QRLVADCILLDVDTAVVSDRKGSIAVLSCSGDSERNASPECNLTLTCAYYMGEIAMSIRK 1128 Query: 408 GSFSYKLPVDDVLKGCYDADRVVDLL-HSSIVASTLLGSVMIFIPISSEEHELLEAVQSR 232 GSFSY+LP DD+L G YDA + H++I+ASTLLGS++IFIP++ EEHELLEAVQ+R Sbjct: 1129 GSFSYRLPADDMLMG-YDAVTPNNYASHNTIMASTLLGSIIIFIPLTREEHELLEAVQAR 1187 Query: 231 LVVHPLTAPILGNDHNEFRGRQSRV 157 LVVHPLTAPILGNDH+EFR R++ V Sbjct: 1188 LVVHPLTAPILGNDHSEFRSRENPV 1212 >ref|XP_007163031.1| hypothetical protein PHAVU_001G200200g [Phaseolus vulgaris] gi|561036495|gb|ESW35025.1| hypothetical protein PHAVU_001G200200g [Phaseolus vulgaris] Length = 1362 Score = 760 bits (1962), Expect = 0.0 Identities = 406/668 (60%), Positives = 487/668 (72%), Gaps = 7/668 (1%) Frame = -2 Query: 2058 GVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVD 1879 GV I+ TFVIGTH+PSVEI F P G+ ++A G ISLTNT+GTAISGCVPQDVRLV VD Sbjct: 687 GVDINKTFVIGTHRPSVEIWFFSPGGGITVVACGTISLTNTIGTAISGCVPQDVRLVFVD 746 Query: 1878 RLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSP--SISHCLTNINAXXXXXXXXXXIRQ 1705 + Y+++GLRNGMLLRFEWPV PS SP + L++IN Sbjct: 747 KYYVVAGLRNGMLLRFEWPVEPC--PS-----SPINMVDTALSSINLVN----------- 788 Query: 1704 QCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARH 1525 + + +D+P+ LQLIAIRRIGITPVFLVPL D LDADIIALSDRPWLL +ARH Sbjct: 789 ---SASNAFDMRNDLPLTLQLIAIRRIGITPVFLVPLGDTLDADIIALSDRPWLLHSARH 845 Query: 1524 SLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKV 1345 SLSYTSISFQPSTHVTPVCSV+CPKGILFVAE+ LHLVEMVHSKRLN+QKFHL GTPRKV Sbjct: 846 SLSYTSISFQPSTHVTPVCSVECPKGILFVAENCLHLVEMVHSKRLNMQKFHLEGTPRKV 905 Query: 1344 LYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVK 1165 LYH E+++LLVMRT+L SDIC VDPLSGS+L+++ LE GETGKSM+LV+ Sbjct: 906 LYHDESKMLLVMRTELNCGT------CLSDICCVDPLSGSVLSSFRLELGETGKSMELVR 959 Query: 1164 VGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXXXXX 994 VG+EQVL+VGTS S+G +MPSGEAES KGRLLVL + H++ S C Sbjct: 960 VGSEQVLIVGTSLSSGPAVMPSGEAESCKGRLLVLCLVHVQNSDSGSMTFCSKAGSSSQK 1019 Query: 993 XSPLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLAVCPY 820 SP +IV A EQ + W+ L + G V +CPY Sbjct: 1020 TSPFHEIVSYAPEQLSSSSLGSSPDDNSSDGIKLDENEVWQFRLAYARKWQGVVFKICPY 1079 Query: 819 LDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFY 640 LDRYFLASAGN +V GFL++NP RVR++A GRT IT L+ FTRIAVGDCRDGI+ + Sbjct: 1080 LDRYFLASAGNTFYVCGFLNDNPQRVRRYAMGRTHHMITSLSAHFTRIAVGDCRDGIILF 1139 Query: 639 SYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNL 460 SY E+ +KLEQL CDP +RLVADC LMD DTAVVSDR+G A+L S N+LEDNAS ECN+ Sbjct: 1140 SYHEESRKLEQLCCDPSRRLVADCILMDADTAVVSDRKGGIAILCS-NHLEDNASTECNM 1198 Query: 459 TLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIFI 280 TLSC+Y++ E +S++KGS+SY+LP DDVL+G VD L ++I+ASTLLGS+MIFI Sbjct: 1199 TLSCAYFMAEIALSVQKGSYSYRLPADDVLQGGNGPKTNVDSLQNTIIASTLLGSIMIFI 1258 Query: 279 PISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELTS 100 P+S EE+ELLEAVQ RLVVH LTAP+LGNDHNEFR R++R GVPKILDGD+L QFLELTS Sbjct: 1259 PLSREEYELLEAVQERLVVHQLTAPVLGNDHNEFRSRETRGGVPKILDGDVLTQFLELTS 1318 Query: 99 MQQEAVLA 76 MQQ+ +L+ Sbjct: 1319 MQQKMILS 1326 >ref|XP_006577113.1| PREDICTED: splicing factor 3B subunit 3-like isoform X2 [Glycine max] Length = 1373 Score = 756 bits (1952), Expect = 0.0 Identities = 407/669 (60%), Positives = 484/669 (72%), Gaps = 5/669 (0%) Frame = -2 Query: 2058 GVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVD 1879 GV I+ TFVIGTH+PSVEI F P G+ ++A G ISLTNT+GTAISGCVPQDVRLV V Sbjct: 698 GVDINKTFVIGTHRPSVEIWYFAPGGGITVVACGTISLTNTVGTAISGCVPQDVRLVFVG 757 Query: 1878 RLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIRQQC 1699 + Y+L+GLRNGMLLRFEWP P S I+ T +++ ++ Sbjct: 758 KYYVLAGLRNGMLLRFEWPAE--------PCPSSPINIVDTALSSINLVNSVTNAFDKR- 808 Query: 1698 CAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARHSL 1519 +D P LQLIAIRRIGITPVFLVPL D LDADII LSDRPWLL +ARHSL Sbjct: 809 ----------NDFPSMLQLIAIRRIGITPVFLVPLGDTLDADIITLSDRPWLLHSARHSL 858 Query: 1518 SYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKVLY 1339 SY+SISFQPSTHVTPVCSV+CPKGILFVAE+SLHLVEMVHSKRLN+QKFHL GTPRKVLY Sbjct: 859 SYSSISFQPSTHVTPVCSVECPKGILFVAENSLHLVEMVHSKRLNMQKFHLEGTPRKVLY 918 Query: 1338 HSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVKVG 1159 H E+++LLVMRT+L SDIC++DPLSGS+L+++ LE GETGKSM+LV+VG Sbjct: 919 HDESKMLLVMRTELNCGT------CLSDICIMDPLSGSVLSSFRLELGETGKSMELVRVG 972 Query: 1158 NEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXXXXXXS 988 +EQVLVVGTS S+G M +GEAES KGRLLVL + H++ S C S Sbjct: 973 SEQVLVVGTSLSSGPHTMATGEAESCKGRLLVLCLDHVQNSDSGSVTFCSKAGSSSQKTS 1032 Query: 987 PLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLAVCPYLD 814 P +IV A EQ + W+ L F T PG VL +CPYLD Sbjct: 1033 PFREIVTYAPEQLSSSSLGSSPDDNSSDGIKLDENEVWQFRLTFATKWPGVVLKICPYLD 1092 Query: 813 RYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFYSY 634 RYFLA+AGN +V GF ++NP RVR++A GR RF IT LT FTRIAVGDCRDGIL YSY Sbjct: 1093 RYFLATAGNAFYVCGFPNDNPQRVRRYAMGRARFMITSLTAHFTRIAVGDCRDGILLYSY 1152 Query: 633 QEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNLTL 454 E+ KKLE LY DP RLVADC LMD DTAVVSDR+G+ AVL S ++LEDNA +CN+ L Sbjct: 1153 HEEAKKLELLYNDPSLRLVADCILMDADTAVVSDRKGSIAVLCS-DHLEDNAGAQCNMAL 1211 Query: 453 SCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIFIPI 274 SC+Y++ E MSI+KGS+SY+LP DDVL+G VD L ++I+A+TLLGS+MIFIP+ Sbjct: 1212 SCAYFMAEIAMSIKKGSYSYRLPADDVLQGGNGPKTNVDSLQNTIIATTLLGSIMIFIPL 1271 Query: 273 SSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELTSMQ 94 S EE+ELLEAVQ+RLVVH LTAP+LGNDHNEFR R++RVGVPKILDGDML QFLELTSMQ Sbjct: 1272 SREEYELLEAVQARLVVHHLTAPVLGNDHNEFRSRENRVGVPKILDGDMLTQFLELTSMQ 1331 Query: 93 QEAVLALPL 67 Q+ +L+L L Sbjct: 1332 QKMILSLEL 1340 >ref|XP_006577112.1| PREDICTED: splicing factor 3B subunit 3-like isoform X1 [Glycine max] Length = 1387 Score = 756 bits (1952), Expect = 0.0 Identities = 407/669 (60%), Positives = 484/669 (72%), Gaps = 5/669 (0%) Frame = -2 Query: 2058 GVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVD 1879 GV I+ TFVIGTH+PSVEI F P G+ ++A G ISLTNT+GTAISGCVPQDVRLV V Sbjct: 698 GVDINKTFVIGTHRPSVEIWYFAPGGGITVVACGTISLTNTVGTAISGCVPQDVRLVFVG 757 Query: 1878 RLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIRQQC 1699 + Y+L+GLRNGMLLRFEWP P S I+ T +++ ++ Sbjct: 758 KYYVLAGLRNGMLLRFEWPAE--------PCPSSPINIVDTALSSINLVNSVTNAFDKR- 808 Query: 1698 CAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARHSL 1519 +D P LQLIAIRRIGITPVFLVPL D LDADII LSDRPWLL +ARHSL Sbjct: 809 ----------NDFPSMLQLIAIRRIGITPVFLVPLGDTLDADIITLSDRPWLLHSARHSL 858 Query: 1518 SYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKVLY 1339 SY+SISFQPSTHVTPVCSV+CPKGILFVAE+SLHLVEMVHSKRLN+QKFHL GTPRKVLY Sbjct: 859 SYSSISFQPSTHVTPVCSVECPKGILFVAENSLHLVEMVHSKRLNMQKFHLEGTPRKVLY 918 Query: 1338 HSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVKVG 1159 H E+++LLVMRT+L SDIC++DPLSGS+L+++ LE GETGKSM+LV+VG Sbjct: 919 HDESKMLLVMRTELNCGT------CLSDICIMDPLSGSVLSSFRLELGETGKSMELVRVG 972 Query: 1158 NEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXXXXXXS 988 +EQVLVVGTS S+G M +GEAES KGRLLVL + H++ S C S Sbjct: 973 SEQVLVVGTSLSSGPHTMATGEAESCKGRLLVLCLDHVQNSDSGSVTFCSKAGSSSQKTS 1032 Query: 987 PLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLAVCPYLD 814 P +IV A EQ + W+ L F T PG VL +CPYLD Sbjct: 1033 PFREIVTYAPEQLSSSSLGSSPDDNSSDGIKLDENEVWQFRLTFATKWPGVVLKICPYLD 1092 Query: 813 RYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFYSY 634 RYFLA+AGN +V GF ++NP RVR++A GR RF IT LT FTRIAVGDCRDGIL YSY Sbjct: 1093 RYFLATAGNAFYVCGFPNDNPQRVRRYAMGRARFMITSLTAHFTRIAVGDCRDGILLYSY 1152 Query: 633 QEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNLTL 454 E+ KKLE LY DP RLVADC LMD DTAVVSDR+G+ AVL S ++LEDNA +CN+ L Sbjct: 1153 HEEAKKLELLYNDPSLRLVADCILMDADTAVVSDRKGSIAVLCS-DHLEDNAGAQCNMAL 1211 Query: 453 SCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIFIPI 274 SC+Y++ E MSI+KGS+SY+LP DDVL+G VD L ++I+A+TLLGS+MIFIP+ Sbjct: 1212 SCAYFMAEIAMSIKKGSYSYRLPADDVLQGGNGPKTNVDSLQNTIIATTLLGSIMIFIPL 1271 Query: 273 SSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELTSMQ 94 S EE+ELLEAVQ+RLVVH LTAP+LGNDHNEFR R++RVGVPKILDGDML QFLELTSMQ Sbjct: 1272 SREEYELLEAVQARLVVHHLTAPVLGNDHNEFRSRENRVGVPKILDGDMLTQFLELTSMQ 1331 Query: 93 QEAVLALPL 67 Q+ +L+L L Sbjct: 1332 QKMILSLEL 1340 >ref|XP_004494300.1| PREDICTED: uncharacterized protein LOC101490576 isoform X1 [Cicer arietinum] gi|502112345|ref|XP_004494301.1| PREDICTED: uncharacterized protein LOC101490576 isoform X2 [Cicer arietinum] Length = 1362 Score = 748 bits (1931), Expect = 0.0 Identities = 402/666 (60%), Positives = 484/666 (72%), Gaps = 5/666 (0%) Frame = -2 Query: 2058 GVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVRLVLVD 1879 GV I+ TFVIGTH+PSVEI SF PE G+ ++A G ISLT+T+GTA S C+PQDVRLV VD Sbjct: 691 GVDINKTFVIGTHRPSVEIWSFAPEGGVTVVACGTISLTSTMGTAKSFCIPQDVRLVFVD 750 Query: 1878 RLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXXIRQQC 1699 + Y+L+GLRNGMLLRFEWP PT + L++IN Sbjct: 751 KYYVLAGLRNGMLLRFEWPTE--------PTCINVVDTALSSINLVNSLT---------- 792 Query: 1698 CAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQTARHSL 1519 +S +D+P LQLIAIRRIGITPVFLVPL D LDADIIALSDRPWLL +ARHSL Sbjct: 793 ----KSFDMRNDLPSMLQLIAIRRIGITPVFLVPLDDTLDADIIALSDRPWLLHSARHSL 848 Query: 1518 SYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTPRKVLY 1339 SYTSISFQPS+H TPVCS+DCPKGILFVAE+SLHLVEMVHSKRLN++KFHL GTPRKVLY Sbjct: 849 SYTSISFQPSSHATPVCSIDCPKGILFVAENSLHLVEMVHSKRLNMRKFHLEGTPRKVLY 908 Query: 1338 HSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQLVKVG 1159 H+E+R LLVMRT+L SDIC VDPLSGS+L+++ LE GETG SM+L++ G Sbjct: 909 HNESRTLLVMRTELNYGT------CLSDICCVDPLSGSVLSSFRLELGETGTSMELIRFG 962 Query: 1158 NEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSSR---HCPNXXXXXXXXS 988 +E+VLVVGTS S+G P+MPSGEAES KGRLLV+ + H++ S +C S Sbjct: 963 SERVLVVGTSLSSGPPVMPSGEAESAKGRLLVICLEHVQNSDSGSMIYCSKAGSTSQKTS 1022 Query: 987 PLGDIVGRATEQ--XXXXXXXXXXXXXXXXXXNGCREWELELVFQTTLPGAVLAVCPYLD 814 P +IVG A EQ + W+ L + TT PG V A+CPYLD Sbjct: 1023 PFNEIVGYAPEQQSSSSLGSSPDDNSSDGIKLDDNEMWQFRLAYATTWPGIVHAICPYLD 1082 Query: 813 RYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGILFYSY 634 RYFLASAGN +V GF ++ P RVR++A GRTRF I+ LT F+RIAVGD RDGI+F+SY Sbjct: 1083 RYFLASAGNAFYVCGFPNDTPHRVRRYAVGRTRFMISSLTAYFSRIAVGDLRDGIIFFSY 1142 Query: 633 QEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPECNLTL 454 E+ +KLEQLY DP RLVADC LMD TA+VSDR+G+ AVL S ++LED AS E NL L Sbjct: 1143 HEEARKLEQLYGDPSCRLVADCILMDDHTAIVSDRKGSIAVLCS-DHLEDCASAERNLKL 1201 Query: 453 SCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVMIFIPI 274 SC+Y++ E +SIRKGS+SY+LP DDVL G VD L ++I+ASTLLGS+MIFIP+ Sbjct: 1202 SCAYFMAEIAVSIRKGSYSYRLPADDVLSGGIGPKTNVDSLQNTIIASTLLGSIMIFIPL 1261 Query: 273 SSEEHELLEAVQSRLVVHPLTAPILGNDHNEFRGRQSRVGVPKILDGDMLAQFLELTSMQ 94 S EE+ELLEAVQ+RLVVH LTAPILGNDHNEFR R++ VG+PKILDGDML QFLELT+MQ Sbjct: 1262 SREEYELLEAVQARLVVHHLTAPILGNDHNEFRSRENPVGIPKILDGDMLTQFLELTNMQ 1321 Query: 93 QEAVLA 76 Q A+L+ Sbjct: 1322 QNAILS 1327 >ref|XP_007029117.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 2, partial [Theobroma cacao] gi|508717722|gb|EOY09619.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 2, partial [Theobroma cacao] Length = 1237 Score = 746 bits (1925), Expect = 0.0 Identities = 400/638 (62%), Positives = 461/638 (72%), Gaps = 5/638 (0%) Frame = -2 Query: 2073 AGLPIGVQIDSTFVIGTHKPSVEILSFVPENGLRILASGIISLTNTLGTAISGCVPQDVR 1894 A LP+GV + TFVIGTH+PSVEILSF P+ GLR+LA+G ISL + + TA+SGC+PQDVR Sbjct: 608 AVLPVGVGMGITFVIGTHRPSVEILSFTPQ-GLRVLATGTISLASAMETAVSGCIPQDVR 666 Query: 1893 LVLVDRLYILSGLRNGMLLRFEWPVSSSAFPSELPTQSPSISHCLTNINAXXXXXXXXXX 1714 LVLVD+ Y+LSGLRNGMLLRFEWP + + SE + + + N++ Sbjct: 667 LVLVDQFYVLSGLRNGMLLRFEWPSAVATSSSECCSSTSPLPE---NVDRVLLNTKTANL 723 Query: 1713 IRQQCCAGERSGRPGDDVPIYLQLIAIRRIGITPVFLVPLHDCLDADIIALSDRPWLLQT 1534 + CA S + DD+PI LQLIA RRIGITPVFLVPL D LDADIIALSDRPWLL T Sbjct: 724 FGSEICAVNVSEK--DDLPINLQLIATRRIGITPVFLVPLSDSLDADIIALSDRPWLLHT 781 Query: 1533 ARHSLSYTSISFQPSTHVTPVCSVDCPKGILFVAESSLHLVEMVHSKRLNVQKFHLGGTP 1354 ARHSLSYTSISFQPSTH TPVCS +CPKGILFV E+SLHLVEMVH RLNVQKFHLGGTP Sbjct: 782 ARHSLSYTSISFQPSTHATPVCSAECPKGILFVTENSLHLVEMVHGNRLNVQKFHLGGTP 841 Query: 1353 RKVLYHSETRLLLVMRTDLEIQKIDLRLFSSSDICLVDPLSGSLLTTYVLEPGETGKSMQ 1174 RKVLYHSE++LL+VMRTDL SSDIC VDPL+ S++ ++ LE GETGK M+ Sbjct: 842 RKVLYHSESKLLIVMRTDLSNDTC------SSDICCVDPLTVSVVASFKLELGETGKCME 895 Query: 1173 LVKVGNEQVLVVGTSQSAGRPIMPSGEAESTKGRLLVLSIAHIKGDSS---RHCPNXXXX 1003 LV+ GNEQVLVVGTS S G IMPSGEAESTKGRL+VL I H++ S Sbjct: 896 LVRAGNEQVLVVGTSLSPGPAIMPSGEAESTKGRLIVLCIEHVQNSDSGSMTFSSMAGSS 955 Query: 1002 XXXXSPLGDIVGRATEQXXXXXXXXXXXXXXXXXXN--GCREWELELVFQTTLPGAVLAV 829 SP +IVG A EQ W+L L + TT P VLA+ Sbjct: 956 SQRNSPFCEIVGHANEQLSSSSICSSPDDTSCDGIKLEETEAWQLRLAYATTWPAMVLAI 1015 Query: 828 CPYLDRYFLASAGNILFVYGFLSENPLRVRKFASGRTRFTITCLTTQFTRIAVGDCRDGI 649 CPYLD YFLASAGN +V FLS NP RVR+FA RTRF I LT TRIAVGDCRDGI Sbjct: 1016 CPYLDHYFLASAGNTFYVCAFLSGNPQRVRRFALARTRFMIMSLTAHSTRIAVGDCRDGI 1075 Query: 648 LFYSYQEDLKKLEQLYCDPDQRLVADCNLMDLDTAVVSDRRGNFAVLSSKNYLEDNASPE 469 LFYSY E+ KKL+Q YCDP QRLVADC L D+DTAVVSDR+G+ AVLS + LEDNASPE Sbjct: 1076 LFYSYHEETKKLDQTYCDPSQRLVADCVLTDVDTAVVSDRKGSVAVLSCSDRLEDNASPE 1135 Query: 468 CNLTLSCSYYIGETVMSIRKGSFSYKLPVDDVLKGCYDADRVVDLLHSSIVASTLLGSVM 289 NLTL+ +YY+GE MSIRKGSF YKLP DD+L C + VD H +I+ASTLLGS+M Sbjct: 1136 RNLTLTSAYYMGEIAMSIRKGSFIYKLPADDMLNSCEGLNASVDPSHGTIMASTLLGSIM 1195 Query: 288 IFIPISSEEHELLEAVQSRLVVHPLTAPILGNDHNEFR 175 IFIPIS EEHELLEAVQ+RL+VHPLTAP+LGNDHNE+R Sbjct: 1196 IFIPISREEHELLEAVQARLIVHPLTAPVLGNDHNEYR 1233