BLASTX nr result
ID: Achyranthes23_contig00009372
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00009372 (2253 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI29964.3| unnamed protein product [Vitis vinifera] 845 0.0 ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-lik... 839 0.0 gb|EXB29323.1| DNA damage-binding protein 1b [Morus notabilis] 826 0.0 gb|EMJ05498.1| hypothetical protein PRUPE_ppa000262mg [Prunus pe... 824 0.0 ref|XP_006481686.1| PREDICTED: uncharacterized protein LOC102624... 810 0.0 ref|XP_006481685.1| PREDICTED: uncharacterized protein LOC102624... 810 0.0 ref|XP_002308344.2| hypothetical protein POPTR_0006s21160g [Popu... 799 0.0 ref|XP_006351358.1| PREDICTED: pre-mRNA-splicing factor prp12-li... 797 0.0 gb|EOY09618.1| Cleavage and polyadenylation specificity factor (... 792 0.0 ref|XP_004249760.1| PREDICTED: pre-mRNA-splicing factor prp12-li... 789 0.0 ref|XP_004303372.1| PREDICTED: pre-mRNA-splicing factor rse-1-li... 788 0.0 ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-lik... 784 0.0 ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus ... 752 0.0 ref|XP_006296833.1| hypothetical protein CARUB_v10012818mg [Caps... 749 0.0 ref|XP_006407388.1| hypothetical protein EUTSA_v10019900mg [Eutr... 748 0.0 ref|XP_006577113.1| PREDICTED: splicing factor 3B subunit 3-like... 748 0.0 gb|ESW35025.1| hypothetical protein PHAVU_001G200200g [Phaseolus... 745 0.0 ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp.... 738 0.0 ref|NP_187802.2| Cleavage and polyadenylation specificity factor... 733 0.0 gb|EOY09619.1| Cleavage and polyadenylation specificity factor (... 731 0.0 >emb|CBI29964.3| unnamed protein product [Vitis vinifera] Length = 1363 Score = 845 bits (2183), Expect = 0.0 Identities = 424/570 (74%), Positives = 491/570 (86%), Gaps = 6/570 (1%) Frame = +3 Query: 30 NDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEPST 209 N PV LQLIA+RRIGITPVFLVPLS+SL+ D+IALSDRPWL+Q+ARHSLSY SISF+PST Sbjct: 794 NSPVNLQLIAIRRIGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPST 853 Query: 210 YVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLVLR 389 +VTPVCS ECP G+LFVAEN LHLVEMV+SKRLNVQKF+LGGTPRKVLYHS+SRLLLV+R Sbjct: 854 HVTPVCSMECPMGILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLVMR 913 Query: 390 TDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGPAI 569 T+L D +SSDICCVDPLSGSV+SSF +LGETGK ME VRV EQ+L++GTSLSSGPA+ Sbjct: 914 TELSQDTYSSDICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAM 973 Query: 570 MPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSNSS 746 MPSGEAESTKGRLI+L + + NSDSGSMT C KAGSS+QR SP+ E GY E++S SS Sbjct: 974 MPSGEAESTKGRLIVLCLEHMQNSDSGSMTFCSKAGSSSQRTSPFREIVGYAAEQLSGSS 1033 Query: 747 LCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCGFQ 926 LCSSPDD SCD ++LE+SEAW L L + TWPG+VL+ICPYLD YFLAS+GN+FYVCGF Sbjct: 1034 LCSSPDDTSCDGVRLEESEAWQLRLAYTATWPGMVLAICPYLDRYFLASAGNSFYVCGFP 1093 Query: 927 NDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPGQR 1106 NDN +R++R AV RTRFMI+SLTA+FTRIAVGDCRDGV+FYSYHED++KLEQ+YCDP QR Sbjct: 1094 NDNPQRVRRFAVGRTRFMIMSLTAHFTRIAVGDCRDGVVFYSYHEDSRKLEQLYCDPEQR 1153 Query: 1107 LVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIRKGS 1286 LVADC+L ++DTA VSDRKGSIAVL+ S HLEDNASPECNL+++CSYY+GEIAMSI+KGS Sbjct: 1154 LVADCILMDVDTAVVSDRKGSIAVLSCSNHLEDNASPECNLTLNCSYYMGEIAMSIKKGS 1213 Query: 1287 FSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQARLAV 1466 FSYKL A+D LKGCD SN IID S N IMAGTLLGSI++ IPISREE+ELL+ VQARLAV Sbjct: 1214 FSYKLPADDVLKGCDGSNTIIDFSENSIMAGTLLGSIIMLIPISREEHELLEAVQARLAV 1273 Query: 1467 HPLTAPILGNNHSEFRSRENQIL---VPTILDGDMLAQFLELTSIQQEAVLGLPCATSEV 1637 H LTAPILGN+H+EFRSREN + V ILDGDMLAQFLELTS+QQEAVL LP + E Sbjct: 1274 HQLTAPILGNDHNEFRSRENSVRKAGVSKILDGDMLAQFLELTSMQQEAVLALPLGSLET 1333 Query: 1638 XXXXXXXH--APVSVNQVVQLLERVHYVLN 1721 +P+SVN+VVQLLERVHY LN Sbjct: 1334 VTSSSKQTLLSPISVNRVVQLLERVHYALN 1363 >ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-like [Vitis vinifera] Length = 1387 Score = 839 bits (2167), Expect = 0.0 Identities = 424/580 (73%), Positives = 490/580 (84%), Gaps = 16/580 (2%) Frame = +3 Query: 30 NDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEPST 209 N PV LQLIA+RRIGITPVFLVPLS+SL+ D+IALSDRPWL+Q+ARHSLSY SISF+PST Sbjct: 808 NSPVNLQLIAIRRIGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPST 867 Query: 210 YVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLVLR 389 +VTPVCS ECP G+LFVAEN LHLVEMV+SKRLNVQKF+LGGTPRKVLYHS+SRLLLV+R Sbjct: 868 HVTPVCSMECPMGILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLVMR 927 Query: 390 TDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGPAI 569 T+L D +SSDICCVDPLSGSV+SSF +LGETGK ME VRV EQ+L++GTSLSSGPA+ Sbjct: 928 TELSQDTYSSDICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAM 987 Query: 570 MPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSNSS 746 MPSGEAESTKGRLI+L + + NSDSGSMT C KAGSS+QR SP+ E GY E++S SS Sbjct: 988 MPSGEAESTKGRLIVLCLEHMQNSDSGSMTFCSKAGSSSQRTSPFREIVGYAAEQLSGSS 1047 Query: 747 LCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCGFQ 926 LCSSPDD SCD ++LE+SEAW L L + TWPG+VL+ICPYLD YFLAS+GN+FYVCGF Sbjct: 1048 LCSSPDDTSCDGVRLEESEAWQLRLAYTATWPGMVLAICPYLDRYFLASAGNSFYVCGFP 1107 Query: 927 NDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPGQR 1106 NDN +R++R AV RTRFMI+SLTA+FTRIAVGDCRDGV+FYSYHED++KLEQ+YCDP QR Sbjct: 1108 NDNPQRVRRFAVGRTRFMIMSLTAHFTRIAVGDCRDGVVFYSYHEDSRKLEQLYCDPEQR 1167 Query: 1107 LVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE-------------DNASPECNLSVSCSY 1247 LVADC+L ++DTA VSDRKGSIAVL+ S HLE DNASPECNL+++CSY Sbjct: 1168 LVADCILMDVDTAVVSDRKGSIAVLSCSNHLEELHGFKFLIISCPDNASPECNLTLNCSY 1227 Query: 1248 YIGEIAMSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREE 1427 Y+GEIAMSI+KGSFSYKL A+D LKGCD SN IID S N IMAGTLLGSI++ IPISREE Sbjct: 1228 YMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENSIMAGTLLGSIIMLIPISREE 1287 Query: 1428 YELLKPVQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAV 1607 +ELL+ VQARLAVH LTAPILGN+H+EFRSREN V ILDGDMLAQFLELTS+QQEAV Sbjct: 1288 HELLEAVQARLAVHQLTAPILGNDHNEFRSRENSAGVSKILDGDMLAQFLELTSMQQEAV 1347 Query: 1608 LGLPCATSEVXXXXXXXH--APVSVNQVVQLLERVHYVLN 1721 L LP + E +P+SVN+VVQLLERVHY LN Sbjct: 1348 LALPLGSLETVTSSSKQTLLSPISVNRVVQLLERVHYALN 1387 >gb|EXB29323.1| DNA damage-binding protein 1b [Morus notabilis] Length = 1388 Score = 826 bits (2134), Expect = 0.0 Identities = 419/571 (73%), Positives = 481/571 (84%), Gaps = 2/571 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 EK K +P+ LQLIA+RRIGITPVFLVPLS SLD D+IALSDRPWL+ ARHSLSY SIS Sbjct: 821 EKAKSKNPINLQLIAIRRIGITPVFLVPLSSSLDADIIALSDRPWLLHTARHSLSYTSIS 880 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+ ST+VTPVCSAECPKG+LFVAEN LHLVEMV+ KRLNVQK LGGTPRKVLYHS+SRL Sbjct: 881 FQASTHVTPVCSAECPKGILFVAENSLHLVEMVHCKRLNVQKLSLGGTPRKVLYHSESRL 940 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 LLV+RTDL ND SSDICCVDPLSG+V+SSF D GETGK ME VRVG EQ+L+VGT LS Sbjct: 941 LLVMRTDLTNDTCSSDICCVDPLSGTVLSSFKLDHGETGKSMELVRVGNEQVLVVGTRLS 1000 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + NSDSGSMT KAGSS+QR SP+ E GY TE+ Sbjct: 1001 SGPAIMPSGEAESTKGRLIVLCLEHAQNSDSGSMTFSSKAGSSSQRASPFREIVGYATEQ 1060 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD SCD +KLE++EAW L L +++ WPG+VL+ICPYL+ YFLAS+GN+FY Sbjct: 1061 LSSSSLCSSPDDTSCDGIKLEETEAWQLRLAYSVMWPGMVLAICPYLERYFLASAGNSFY 1120 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R+++ AV RTRFMI SLTA+FTRIAVGDCRDG+LF+SYHEDA+KLEQ+YC Sbjct: 1121 VCGFPNDNSQRVRKFAVGRTRFMITSLTAHFTRIAVGDCRDGILFFSYHEDARKLEQLYC 1180 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMS 1271 DP QRLVADCLL +LDTA VSDRKGSIAVL+ + HLEDNASPECNL+VSC+YY+GEIAMS Sbjct: 1181 DPSQRLVADCLLMDLDTAVVSDRKGSIAVLSCADHLEDNASPECNLNVSCAYYMGEIAMS 1240 Query: 1272 IRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQ 1451 I+KGSFSY L A+D LKG SN ID +RN I+A TLLGSI+ FIP+SR+EYELL+ VQ Sbjct: 1241 IKKGSFSYSLPADDVLKG---SNMKIDSARNTIIASTLLGSIITFIPLSRDEYELLEAVQ 1297 Query: 1452 ARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATS 1631 +RL VHPLTAPILGN+H+EFRSREN VP ILDGDML QFLELT +QQEAVL LP T Sbjct: 1298 SRLVVHPLTAPILGNDHNEFRSRENPPGVPKILDGDMLTQFLELTRMQQEAVLSLPLGTK 1357 Query: 1632 E-VXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 + V P+ VNQVVQLLERVHY LN Sbjct: 1358 DAVSSSSKTTPPPIPVNQVVQLLERVHYALN 1388 >gb|EMJ05498.1| hypothetical protein PRUPE_ppa000262mg [Prunus persica] Length = 1378 Score = 824 bits (2128), Expect = 0.0 Identities = 415/571 (72%), Positives = 479/571 (83%), Gaps = 2/571 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 EK KD P+ LQLIA RRIGITPVFLVPLS+SLD D++ LSDRPWL+ ARHSLSY SIS Sbjct: 811 EKTKDKFPIELQLIATRRIGITPVFLVPLSDSLDGDIVVLSDRPWLLHTARHSLSYTSIS 870 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+ ST+VTPVC ECPKG+LFVAENCLHLVEMV+SKRLNVQKFHLGGTPR+VLYHS+SRL Sbjct: 871 FQSSTHVTPVCYVECPKGILFVAENCLHLVEMVHSKRLNVQKFHLGGTPREVLYHSESRL 930 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 LLV+RTDL ND SSDICCVDPLSGSV+SSF + GETGK ME VRVG EQ+L+VGTSLS Sbjct: 931 LLVMRTDLSNDTSSSDICCVDPLSGSVLSSFKLEPGETGKSMELVRVGNEQVLVVGTSLS 990 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + + NSDSGSMT C KAGSS+QR SP+ E GY TE+ Sbjct: 991 SGPAIMPSGEAESTKGRLIVLCLEHVQNSDSGSMTLCSKAGSSSQRASPFHEIVGYATEQ 1050 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD SCD +KLE++EAW L + WPG+VL+ICPYLD YFLASSGNAFY Sbjct: 1051 LSSSSLCSSPDDTSCDGIKLEETEAWQFRLAYVTKWPGMVLAICPYLDRYFLASSGNAFY 1110 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R+++ A RTRFMI SLTA+FT IAVGDCRDGVLFY+YHED+KKL+Q+Y Sbjct: 1111 VCGFPNDNSQRVRKFAWARTRFMITSLTAHFTTIAVGDCRDGVLFYAYHEDSKKLQQLYF 1170 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMS 1271 DP QRLVADC+L +++TA VSDRKGSIAVL+ + +LED ASPECNL+VSC+YY+GEIAMS Sbjct: 1171 DPCQRLVADCILMDVNTAVVSDRKGSIAVLSCADYLEDTASPECNLTVSCAYYMGEIAMS 1230 Query: 1272 IRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQ 1451 IRKGSFSYKL A+D LKGCD + ID S+N I+ TLLGSI+ F+PISREEYELL+ VQ Sbjct: 1231 IRKGSFSYKLPADDVLKGCDGN---IDFSQNAIIVSTLLGSIITFVPISREEYELLEAVQ 1287 Query: 1452 ARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPC-AT 1628 RL VHPLTAPILGN+H+E+RSREN + VP ILDGDML+QFLELT +QQEAVL P A Sbjct: 1288 DRLVVHPLTAPILGNDHNEYRSRENPVGVPKILDGDMLSQFLELTGMQQEAVLSSPLGAQ 1347 Query: 1629 SEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 V +A + VNQVVQLLERVHY LN Sbjct: 1348 GTVKPSLKSRYALIPVNQVVQLLERVHYALN 1378 >ref|XP_006481686.1| PREDICTED: uncharacterized protein LOC102624787 isoform X2 [Citrus sinensis] Length = 1265 Score = 810 bits (2092), Expect = 0.0 Identities = 410/572 (71%), Positives = 480/572 (83%), Gaps = 3/572 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 E+ KD P+ LQLIA RRIGITPVFLVPLS+ LD D+IALSDRPWL+Q ARHSL+Y SIS Sbjct: 697 EESKDELPINLQLIATRRIGITPVFLVPLSDLLDADMIALSDRPWLLQTARHSLAYTSIS 756 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+PST+ TPVCS ECPKG+LFVAEN L+LVEMV++KRLNV KFHLGGTP+KVLYHS+SRL Sbjct: 757 FQPSTHATPVCSVECPKGILFVAENSLNLVEMVHNKRLNVPKFHLGGTPKKVLYHSESRL 816 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 L+V+RT+L+ND SSDICCVDPLSGSV+SSF +LGETGK ME VRVG EQ+L+VGTSLS Sbjct: 817 LIVMRTELNNDTCSSDICCVDPLSGSVLSSFKLELGETGKSMELVRVGHEQVLVVGTSLS 876 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + + NSD GSMT C KAGSS+QR SP+ E GY TE+ Sbjct: 877 SGPAIMPSGEAESTKGRLIVLCIEHMQNSDCGSMTFCSKAGSSSQRTSPFREIVGYATEQ 936 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD SCD +KLE++E W L L ++ TWPG+VL+ICPYLD YFLAS+GNAFY Sbjct: 937 LSSSSLCSSPDDASCDGIKLEETETWQLRLAYSTTWPGMVLAICPYLDRYFLASAGNAFY 996 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R++R AV RTRFMI+ LTA+FTRIAVGDCRDG+LFYSYHEDA+KLEQIYC Sbjct: 997 VCGFPNDNPQRVRRFAVGRTRFMIMLLTAHFTRIAVGDCRDGILFYSYHEDARKLEQIYC 1056 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMS 1271 DP QRLVADC+L ++DTA VSDRKGSIAVL+ S LEDNASPECNL+ +C+Y++GEIA+S Sbjct: 1057 DPSQRLVADCVLMDVDTAVVSDRKGSIAVLSCSDRLEDNASPECNLTPNCAYHMGEIAVS 1116 Query: 1272 IRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQ 1451 IRKGSF YKL A+D L C S + S+ I+A TLLGSIVIFIPIS EEYELL+ VQ Sbjct: 1117 IRKGSFIYKLPADDTLGDCLAS---FESSQTTIIASTLLGSIVIFIPISSEEYELLEAVQ 1173 Query: 1452 ARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATS 1631 ARLA+HPLTAP+LGN+H+EFRSREN + VP ILDGDML+QFLELTS QQEAVL + Sbjct: 1174 ARLAIHPLTAPLLGNDHNEFRSRENPVGVPKILDGDMLSQFLELTSTQQEAVLSFTLGSF 1233 Query: 1632 EV--XXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 + +P+ VNQVVQLLERVHY LN Sbjct: 1234 DTIKASSKLPPSSPIPVNQVVQLLERVHYALN 1265 >ref|XP_006481685.1| PREDICTED: uncharacterized protein LOC102624787 isoform X1 [Citrus sinensis] Length = 1394 Score = 810 bits (2092), Expect = 0.0 Identities = 410/572 (71%), Positives = 480/572 (83%), Gaps = 3/572 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 E+ KD P+ LQLIA RRIGITPVFLVPLS+ LD D+IALSDRPWL+Q ARHSL+Y SIS Sbjct: 826 EESKDELPINLQLIATRRIGITPVFLVPLSDLLDADMIALSDRPWLLQTARHSLAYTSIS 885 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+PST+ TPVCS ECPKG+LFVAEN L+LVEMV++KRLNV KFHLGGTP+KVLYHS+SRL Sbjct: 886 FQPSTHATPVCSVECPKGILFVAENSLNLVEMVHNKRLNVPKFHLGGTPKKVLYHSESRL 945 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 L+V+RT+L+ND SSDICCVDPLSGSV+SSF +LGETGK ME VRVG EQ+L+VGTSLS Sbjct: 946 LIVMRTELNNDTCSSDICCVDPLSGSVLSSFKLELGETGKSMELVRVGHEQVLVVGTSLS 1005 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + + NSD GSMT C KAGSS+QR SP+ E GY TE+ Sbjct: 1006 SGPAIMPSGEAESTKGRLIVLCIEHMQNSDCGSMTFCSKAGSSSQRTSPFREIVGYATEQ 1065 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD SCD +KLE++E W L L ++ TWPG+VL+ICPYLD YFLAS+GNAFY Sbjct: 1066 LSSSSLCSSPDDASCDGIKLEETETWQLRLAYSTTWPGMVLAICPYLDRYFLASAGNAFY 1125 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R++R AV RTRFMI+ LTA+FTRIAVGDCRDG+LFYSYHEDA+KLEQIYC Sbjct: 1126 VCGFPNDNPQRVRRFAVGRTRFMIMLLTAHFTRIAVGDCRDGILFYSYHEDARKLEQIYC 1185 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMS 1271 DP QRLVADC+L ++DTA VSDRKGSIAVL+ S LEDNASPECNL+ +C+Y++GEIA+S Sbjct: 1186 DPSQRLVADCVLMDVDTAVVSDRKGSIAVLSCSDRLEDNASPECNLTPNCAYHMGEIAVS 1245 Query: 1272 IRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQ 1451 IRKGSF YKL A+D L C S + S+ I+A TLLGSIVIFIPIS EEYELL+ VQ Sbjct: 1246 IRKGSFIYKLPADDTLGDCLAS---FESSQTTIIASTLLGSIVIFIPISSEEYELLEAVQ 1302 Query: 1452 ARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATS 1631 ARLA+HPLTAP+LGN+H+EFRSREN + VP ILDGDML+QFLELTS QQEAVL + Sbjct: 1303 ARLAIHPLTAPLLGNDHNEFRSRENPVGVPKILDGDMLSQFLELTSTQQEAVLSFTLGSF 1362 Query: 1632 EV--XXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 + +P+ VNQVVQLLERVHY LN Sbjct: 1363 DTIKASSKLPPSSPIPVNQVVQLLERVHYALN 1394 >ref|XP_002308344.2| hypothetical protein POPTR_0006s21160g [Populus trichocarpa] gi|550336774|gb|EEE91867.2| hypothetical protein POPTR_0006s21160g [Populus trichocarpa] Length = 1397 Score = 799 bits (2064), Expect = 0.0 Identities = 403/578 (69%), Positives = 476/578 (82%), Gaps = 5/578 (0%) Frame = +3 Query: 3 VGSYEKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSY 182 V S + D+ P+ LQLIA RRIGITPVFLVPLS+SLD D+IALSDRPWL+ AARHSLSY Sbjct: 821 VDSIDNTMDDLPINLQLIATRRIGITPVFLVPLSDSLDSDMIALSDRPWLLHAARHSLSY 880 Query: 183 ISISFEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHS 362 SISF+PST+ TPVCS ECPKG+LFVA+N LHLVEMV+S RLNVQKFHLGGTPRKV YHS Sbjct: 881 TSISFQPSTHATPVCSVECPKGILFVADNSLHLVEMVHSTRLNVQKFHLGGTPRKVQYHS 940 Query: 363 DSRLLLVLRTDL--DNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILL 536 +S+LLLV+RT+L DND SSDICCVDPLSGS VSSF + GETGK ME V++G EQ+L+ Sbjct: 941 ESKLLLVMRTELSNDNDTCSSDICCVDPLSGSTVSSFKLERGETGKSMELVKIGNEQVLV 1000 Query: 537 VGTSLSSGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGT 713 +GTSLSSGPAIMPSGEAESTKGR+I+L + L NSDSGSMT C KAGSS+QR SP+ E Sbjct: 1001 IGTSLSSGPAIMPSGEAESTKGRVIVLCLENLQNSDSGSMTFCSKAGSSSQRTSPFREIV 1060 Query: 714 GYTTERMSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLAS 893 GY E++S+SSLCSSPDD SCD +KLE++E W L + A T PG+VL+ICPYLD +FLAS Sbjct: 1061 GYAAEQLSSSSLCSSPDDTSCDGVKLEETETWQLRFVSATTLPGMVLAICPYLDRFFLAS 1120 Query: 894 SGNAFYVCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKK 1073 +GN+FYVCGF NDN KR+K+ AV RTRFMI+SLTAY TRIAVGDCRDG+LFY+YH ++KK Sbjct: 1121 AGNSFYVCGFANDN-KRVKKFAVGRTRFMIMSLTAYHTRIAVGDCRDGILFYAYHVESKK 1179 Query: 1074 LEQIYCDPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYI 1253 LEQ+YCDP QRLVA C+L ++DTA VSDRKGSIAVL+ S E SPECNL+++C+YY+ Sbjct: 1180 LEQLYCDPSQRLVAGCVLMDVDTAVVSDRKGSIAVLSRSDRFECTGSPECNLTLNCAYYM 1239 Query: 1254 GEIAMSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYE 1433 GEIAMSIRKGSF+YKL A+D L GCD +D S N I+A TLLGSI++FIP+SREE+E Sbjct: 1240 GEIAMSIRKGSFTYKLPADDILTGCDGVITKMDASNNTIVASTLLGSIIVFIPLSREEFE 1299 Query: 1434 LLKPVQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLG 1613 LL+ VQ+RL VHPLTAP+LGN+H EFRSREN + VP ILDGDMLAQFLELTS QQEAVL Sbjct: 1300 LLQAVQSRLVVHPLTAPVLGNDHHEFRSRENPVGVPKILDGDMLAQFLELTSSQQEAVLS 1359 Query: 1614 LPCATSEVXXXXXXXHA--PVSVNQVVQLLERVHYVLN 1721 LP + + P+S++QVVQLLERVHY LN Sbjct: 1360 LPLGPPDTIKTNLKPFSTLPISISQVVQLLERVHYALN 1397 >ref|XP_006351358.1| PREDICTED: pre-mRNA-splicing factor prp12-like isoform X1 [Solanum tuberosum] Length = 1393 Score = 797 bits (2059), Expect = 0.0 Identities = 397/572 (69%), Positives = 480/572 (83%), Gaps = 3/572 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 +K KD PV LQL+AVRRIGITPVFL+PL++SLD DVIALSDRPWL+Q ARHSLSY SIS Sbjct: 823 DKTKDF-PVYLQLVAVRRIGITPVFLIPLNDSLDADVIALSDRPWLLQTARHSLSYTSIS 881 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F PST+VTPVCS ECPKG++FVAEN LHLVEMV SKRLNVQKFH GGTPRKVLYHSDSRL Sbjct: 882 FPPSTHVTPVCSTECPKGIIFVAENSLHLVEMVPSKRLNVQKFHFGGTPRKVLYHSDSRL 941 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 LLVLRTDL +D+ SSD+CC+DPLSGSV+SSF F+ GE GKCM+ V+ G EQ+L+VGT LS Sbjct: 942 LLVLRTDLSDDLCSSDVCCIDPLSGSVLSSFKFEPGEIGKCMDLVKAGNEQVLVVGTGLS 1001 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + + NSDSGS+ +AGSS+QR SP+ E GY E+ Sbjct: 1002 SGPAIMPSGEAESTKGRLIVLCLEQMQNSDSGSIAFSSRAGSSSQRTSPFREIGGYAAEQ 1061 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDDNSCD +KLE+SEAW L L ++ TWPG+VL++CPYLD +FLAS+ N FY Sbjct: 1062 LSSSSLCSSPDDNSCDGIKLEESEAWHLRLGYSTTWPGMVLAVCPYLDRFFLASAANCFY 1121 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R++R AV RTRFMI++LTA+FTRIAVGDCRDG+LFYSY EDA+KL+Q+YC Sbjct: 1122 VCGFPNDNAQRVRRLAVGRTRFMIMTLTAHFTRIAVGDCRDGILFYSYQEDARKLDQVYC 1181 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDN-ASPECNLSVSCSYYIGEIAM 1268 DP QRLV+DC L + DTA VSDRKGS+A+L+ HLEDN SPE NL+++CS+Y+GEIA+ Sbjct: 1182 DPVQRLVSDCTLMDGDTAAVSDRKGSLAILSCLNHLEDNFNSPERNLALTCSFYMGEIAI 1241 Query: 1269 SIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPV 1448 IRKGSFSYKL A+D L+GC ++N+ D+S+N IMA TLLGSI+IFIP++REEY+LL+ V Sbjct: 1242 RIRKGSFSYKLPADDALRGCQVASNVGDISQNSIMASTLLGSIIIFIPLTREEYDLLEAV 1301 Query: 1449 QARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPC-A 1625 QARL +HPLTAPILGN+H+E+R R + P LDGDMLAQFLELTS+QQEAVL LP A Sbjct: 1302 QARLVIHPLTAPILGNDHTEYRCRGSTARAPKALDGDMLAQFLELTSMQQEAVLALPLGA 1361 Query: 1626 TSEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 + + P++VNQVV+LLER+HY LN Sbjct: 1362 QNTIMFNSKQSPDPITVNQVVRLLERIHYALN 1393 >gb|EOY09618.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 1 [Theobroma cacao] Length = 1391 Score = 792 bits (2045), Expect = 0.0 Identities = 397/568 (69%), Positives = 469/568 (82%), Gaps = 2/568 (0%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 KD+ P+ LQLIA RRIGITPVFLVPLS+SLD D+IALSDRPWL+ ARHSLSY SISF+P Sbjct: 824 KDDLPINLQLIATRRIGITPVFLVPLSDSLDADIIALSDRPWLLHTARHSLSYTSISFQP 883 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCSAECPKG+LFV EN LHLVEMV+ RLNVQKFHLGGTPRKVLYHS+S+LL+V Sbjct: 884 STHATPVCSAECPKGILFVTENSLHLVEMVHGNRLNVQKFHLGGTPRKVLYHSESKLLIV 943 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL ND SSDICCVDPL+ SVV+SF +LGETGKCME VR G EQ+L+VGTSLS GP Sbjct: 944 MRTDLSNDTCSSDICCVDPLTVSVVASFKLELGETGKCMELVRAGNEQVLVVGTSLSPGP 1003 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AIMPSGEAESTKGRLI+L + + NSDSGSMT AGSS+QR SP+CE G+ E++S+ Sbjct: 1004 AIMPSGEAESTKGRLIVLCIEHVQNSDSGSMTFSSMAGSSSQRNSPFCEIVGHANEQLSS 1063 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SS+CSSPDD SCD +KLE++EAW L L +A TWP +VL+ICPYLD+YFLAS+GN FYVC Sbjct: 1064 SSICSSPDDTSCDGIKLEETEAWQLRLAYATTWPAMVLAICPYLDHYFLASAGNTFYVCA 1123 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F + N +R++R A+ RTRFMI+SLTA+ TRIAVGDCRDG+LFYSYHE+ KKL+Q YCDP Sbjct: 1124 FLSGNPQRVRRFALARTRFMIMSLTAHSTRIAVGDCRDGILFYSYHEETKKLDQTYCDPS 1183 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIRK 1280 QRLVADC+LT++DTA VSDRKGS+AVL+ S LEDNASPE NL+++ +YY+GEIAMSIRK Sbjct: 1184 QRLVADCVLTDVDTAVVSDRKGSVAVLSCSDRLEDNASPERNLTLTSAYYMGEIAMSIRK 1243 Query: 1281 GSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQARL 1460 GSF YKL A+D L C+ N +D S IMA TLLGSI+IFIPISREE+ELL+ VQARL Sbjct: 1244 GSFIYKLPADDMLNSCEGLNASVDPSHGTIMASTLLGSIMIFIPISREEHELLEAVQARL 1303 Query: 1461 AVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATSEV- 1637 VHPLTAP+LGN+H+E+RS EN VP ILDGDMLAQFLELTS+QQEAVL + + Sbjct: 1304 IVHPLTAPVLGNDHNEYRSCENPAGVPKILDGDMLAQFLELTSMQQEAVLSFSIVSPDTH 1363 Query: 1638 XXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 +P+ V +VVQLLERVHY LN Sbjct: 1364 KLSSKQPPSPIPVKKVVQLLERVHYALN 1391 >ref|XP_004249760.1| PREDICTED: pre-mRNA-splicing factor prp12-like [Solanum lycopersicum] Length = 1394 Score = 789 bits (2037), Expect = 0.0 Identities = 396/573 (69%), Positives = 479/573 (83%), Gaps = 4/573 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 +K KD PV LQL+AVRRIGITPVFL+PL++SLD DVIALSDRPWL+Q ARHSLSY SIS Sbjct: 823 DKTKDF-PVYLQLVAVRRIGITPVFLIPLNDSLDADVIALSDRPWLLQTARHSLSYTSIS 881 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F PST+VTPVCS ECPKG++FVAEN LHLVEMV SKRLNVQKFH GGTPRKVLYHSDSRL Sbjct: 882 FPPSTHVTPVCSTECPKGIIFVAENSLHLVEMVPSKRLNVQKFHFGGTPRKVLYHSDSRL 941 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 LLVLRTDL +D+ SSD+CC+DPLSGSV+SSF F+LGE GKCME V+ G EQ+L+VGT LS Sbjct: 942 LLVLRTDLSDDLCSSDVCCIDPLSGSVLSSFKFELGEIGKCMELVKAGNEQVLVVGTGLS 1001 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIMPSGEAESTKGRLI+L + + NSDSGS+ +AGSS+QR SP+ E GY E+ Sbjct: 1002 SGPAIMPSGEAESTKGRLIVLCVEQMQNSDSGSIAFSSRAGSSSQRTSPFREVGGYAAEQ 1061 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SS+CSSPDDNSCD +KLE+SEAW L L ++ TWPG+VL++CPYLD +FLAS+ N FY Sbjct: 1062 LSSSSICSSPDDNSCDGIKLEESEAWHLRLGYSTTWPGMVLAVCPYLDRFFLASAANCFY 1121 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF NDN +R++R AV RTRFMI++LTA+FTRIAVGDCRDG+LFYSY ED++KL+QIYC Sbjct: 1122 VCGFPNDNAQRVRRLAVGRTRFMIMTLTAHFTRIAVGDCRDGILFYSYQEDSRKLDQIYC 1181 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE-DN-ASPECNLSVSCSYYIGEIA 1265 DP QRLV+DC L + DTA VSDRKGS A+L+ ++E DN SPE NL+ +CS+Y+GEIA Sbjct: 1182 DPVQRLVSDCTLMDGDTAAVSDRKGSFAILSCLNYMEADNFNSPERNLAQTCSFYMGEIA 1241 Query: 1266 MSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKP 1445 + IRKGSFSYKL A+D L+GC ++ + D+S+N IMA TLLGSI+IFIP++REEY+LL+ Sbjct: 1242 IRIRKGSFSYKLPADDALRGCQATSIVGDISQNSIMASTLLGSIIIFIPLTREEYDLLEA 1301 Query: 1446 VQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPC- 1622 VQARL +HPLTAPILGN+H+E+R R + VP LDGDMLAQFLELTS+QQEAVL LP Sbjct: 1302 VQARLVIHPLTAPILGNDHTEYRCRGSMARVPKALDGDMLAQFLELTSMQQEAVLALPLG 1361 Query: 1623 ATSEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 A + + P++VNQVV+LLER+HY LN Sbjct: 1362 AQNTIMFNSKQSPDPITVNQVVRLLERIHYALN 1394 >ref|XP_004303372.1| PREDICTED: pre-mRNA-splicing factor rse-1-like [Fragaria vesca subsp. vesca] Length = 1396 Score = 788 bits (2035), Expect = 0.0 Identities = 399/574 (69%), Positives = 469/574 (81%), Gaps = 5/574 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 E KD PV LQLIA+RRIGITPVFLVPLS+SLD D+I LSDRPWL+ ARHSLSY SIS Sbjct: 826 ENIKDKFPVDLQLIAIRRIGITPVFLVPLSDSLDGDIIVLSDRPWLLHTARHSLSYTSIS 885 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+ ST+VTPVC ECPKG+LFVAENCLHLVEMV+SKRLNVQK LGGTPR+V YHS+SRL Sbjct: 886 FQSSTHVTPVCYVECPKGILFVAENCLHLVEMVHSKRLNVQKLQLGGTPRRVFYHSESRL 945 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 L+V+RT+L +D SDICCVDPLSGSV+SSF + GETGK ME +RVG EQ+LLVGTSLS Sbjct: 946 LIVMRTNLSDDTCLSDICCVDPLSGSVLSSFKLEFGETGKSMELMRVGSEQVLLVGTSLS 1005 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SG AIMP GEAESTKGRLI+L + + NSDSGSMT KAGSS+ R SP+ E GY E+ Sbjct: 1006 SGSAIMPCGEAESTKGRLIVLCLENMQNSDSGSMTFSSKAGSSSLRASPFHEIVGYAAEQ 1065 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD SCD +KLE++E W L ++ WPG+VL+ICPYLD YFLAS+GNAFY Sbjct: 1066 LSSSSLCSSPDDTSCDGIKLEETETWQFRLAFSMPWPGMVLAICPYLDRYFLASAGNAFY 1125 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 +CGF ++N +R+K+ AV RTRF I SLTA+FTRI VGDCRDG+LFY Y+ED+KKL+Q+YC Sbjct: 1126 LCGFPHENSQRVKKWAVARTRFTITSLTAHFTRIVVGDCRDGILFYDYNEDSKKLQQLYC 1185 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLED---NASPECNLSVSCSYYIGEI 1262 DP QRLV DC+L +++TA VSDRKGSIAVL+ + +LE ASPECNL+VSC+YY+GEI Sbjct: 1186 DPYQRLVGDCILMDVNTAVVSDRKGSIAVLSCADYLEGKHYTASPECNLTVSCAYYMGEI 1245 Query: 1263 AMSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLK 1442 AMSI+KGSFSYKL A+D +KG D S ID ++NGI+ TLLGSI+ F+PISREEYELL+ Sbjct: 1246 AMSIKKGSFSYKLPADDAMKGGDGS---IDFAQNGIIVSTLLGSIITFVPISREEYELLE 1302 Query: 1443 PVQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLP- 1619 VQ RLAVHPLTAPILGN+H+EFRSREN + VP ILD DML QFLELTS+QQEAVL P Sbjct: 1303 AVQDRLAVHPLTAPILGNDHNEFRSRENPVGVPKILDADMLTQFLELTSVQQEAVLSSPI 1362 Query: 1620 CATSEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 C S V +PV VNQVVQLLERVHY LN Sbjct: 1363 CVRSTVKSRLKFRSSPVPVNQVVQLLERVHYALN 1396 >ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus] Length = 1376 Score = 784 bits (2025), Expect = 0.0 Identities = 396/571 (69%), Positives = 473/571 (82%), Gaps = 2/571 (0%) Frame = +3 Query: 15 EKPKDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISIS 194 EK +D P LQLIA+RRIGITPVFLVPL++ LD D+IALSDRPWL+ +ARHSLSY SIS Sbjct: 806 EKHEDEIPSCLQLIAIRRIGITPVFLVPLTDRLDSDIIALSDRPWLLHSARHSLSYTSIS 865 Query: 195 FEPSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRL 374 F+PST+VTPVCSA+CP GLLFVAE+ LHLVEMV++KRLNVQKFHLGGTPRKVLYHS+S+L Sbjct: 866 FQPSTHVTPVCSADCPSGLLFVAESSLHLVEMVHTKRLNVQKFHLGGTPRKVLYHSESKL 925 Query: 375 LLVLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLS 554 LLV+RT L ND SSDICCVDPLSGS++SS ++GETGK ME VR G EQ+L+VGTSLS Sbjct: 926 LLVMRTQLINDTSSSDICCVDPLSGSILSSHKLEIGETGKSMELVRNGNEQVLVVGTSLS 985 Query: 555 SGPAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTER 731 SGPAIM SGEAESTKGRLI+L + + NSD+GSMT C KAG S+ + SP+ E GY TE+ Sbjct: 986 SGPAIMASGEAESTKGRLIVLCLEHVQNSDTGSMTFCSKAGLSSLQASPFREIVGYATEQ 1045 Query: 732 MSNSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFY 911 +S+SSLCSSPDD S D +KLE++EAW L ++++ + PG+VL+ICPYLD YFLAS+GNAFY Sbjct: 1046 LSSSSLCSSPDDASSDGIKLEETEAWQLRVVYSTSLPGMVLAICPYLDRYFLASAGNAFY 1105 Query: 912 VCGFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYC 1091 VCGF ND+ +R+KR AV RTRFMI SLTA+ RIAVGDCRDG+LF+SY EDAKKLEQIY Sbjct: 1106 VCGFPNDSFQRVKRFAVGRTRFMITSLTAHVNRIAVGDCRDGILFFSYQEDAKKLEQIYS 1165 Query: 1092 DPGQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMS 1271 DP QRLVADC L ++DTA VSDRKGSIA+L+ S LEDNASPECNL+++C+YY+GEIAM+ Sbjct: 1166 DPSQRLVADCTLLDVDTAVVSDRKGSIAILSCSDRLEDNASPECNLTLNCAYYMGEIAMT 1225 Query: 1272 IRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQ 1451 +RKGSFSYKL A+D L+GC + D S N I+A TLLGSIVIF P+SR+EYELL+ VQ Sbjct: 1226 LRKGSFSYKLPADDLLRGCAVPGSDFDSSHNTIIASTLLGSIVIFTPLSRDEYELLEAVQ 1285 Query: 1452 ARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCAT- 1628 A+LAVHPLT+PILGN+H E+RSREN I VP ILDGD+L QFLELTS+QQE VL + Sbjct: 1286 AKLAVHPLTSPILGNDHYEYRSRENPIGVPKILDGDILTQFLELTSMQQELVLSSSVGSL 1345 Query: 1629 SEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 S V A + +NQVVQLLER+HY LN Sbjct: 1346 SAVKPSSKSMPASIPINQVVQLLERIHYALN 1376 >ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus communis] gi|223528782|gb|EEF30789.1| spliceosomal protein sap, putative [Ricinus communis] Length = 1220 Score = 752 bits (1941), Expect = 0.0 Identities = 374/503 (74%), Positives = 432/503 (85%), Gaps = 1/503 (0%) Frame = +3 Query: 27 DNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEPS 206 D P+ LQLIA RRIG+TPVFLVPL++SLD D+IALSDRPWL+Q ARH LSY SISF+PS Sbjct: 711 DGPPINLQLIATRRIGVTPVFLVPLTDSLDADMIALSDRPWLLQTARHGLSYTSISFQPS 770 Query: 207 TYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLVL 386 T+ TPVCS ECPKGLLFVAEN LHLVEMV+SKRLNVQKFHLGGTPRKVLYHS+SRLLLV+ Sbjct: 771 THSTPVCSVECPKGLLFVAENSLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSESRLLLVM 830 Query: 387 RTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGPA 566 RT+L ND SSDICCVDPLSGSVVSSF + GETGK ME VRVG EQ+L+VGTSLSSGPA Sbjct: 831 RTELSNDTCSSDICCVDPLSGSVVSSFKLEHGETGKSMELVRVGTEQVLVVGTSLSSGPA 890 Query: 567 IMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSNS 743 IMPSGEAESTKGRLI+L + L +SDSGSMT C KAGSS+QR SP+CE GYT E++S+S Sbjct: 891 IMPSGEAESTKGRLIVLCLEHLQSSDSGSMTFCSKAGSSSQRTSPFCEVVGYTAEQLSSS 950 Query: 744 SLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCGF 923 SLCSSPDD SCD +KLE+SEAW L L +A WPG+ L+ICPYLD YFLAS+G+AFYVCGF Sbjct: 951 SLCSSPDD-SCDGVKLEESEAWQLRLAYATKWPGMALTICPYLDRYFLASAGSAFYVCGF 1009 Query: 924 QNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPGQ 1103 NDN +R+++ A+ RTRF I+SLTA+FTRIAVGDCRDG+LFYSYHED +KLEQ+YCDP Q Sbjct: 1010 PNDNPQRVRKFAIARTRFTIISLTAHFTRIAVGDCRDGILFYSYHEDTRKLEQVYCDPSQ 1069 Query: 1104 RLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIRKG 1283 RLVADC+L ++DTA VSDRKGSIAVL+ S E NASPECNL+++C+YY+GEIAMSIRKG Sbjct: 1070 RLVADCILLDVDTAVVSDRKGSIAVLSCSGDSERNASPECNLTLTCAYYMGEIAMSIRKG 1129 Query: 1284 SFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQARLA 1463 SFSY+L A+D L G D S N IMA TLLGSI+IFIP++REE+ELL+ VQARL Sbjct: 1130 SFSYRLPADDMLMGYDAVTPNNYASHNTIMASTLLGSIIIFIPLTREEHELLEAVQARLV 1189 Query: 1464 VHPLTAPILGNNHSEFRSRENQI 1532 VHPLTAPILGN+HSEFRSREN + Sbjct: 1190 VHPLTAPILGNDHSEFRSRENPV 1212 >ref|XP_006296833.1| hypothetical protein CARUB_v10012818mg [Capsella rubella] gi|482565542|gb|EOA29731.1| hypothetical protein CARUB_v10012818mg [Capsella rubella] Length = 1368 Score = 749 bits (1933), Expect = 0.0 Identities = 379/570 (66%), Positives = 461/570 (80%), Gaps = 4/570 (0%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 +D+ P+ L LIA RRIGITPVFLVP S+SLD D+IALSDRPWL+Q AR SLSY SISF+P Sbjct: 801 RDDLPINLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQP 860 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCS+ECP+G+LFV+ENCLHLVEMV+SKRLN QKFHLGGTPRKV+YHS+S+LL+V Sbjct: 861 STHATPVCSSECPQGVLFVSENCLHLVEMVHSKRLNAQKFHLGGTPRKVIYHSESKLLIV 920 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL D +SDICCVDPLSGSV+SS+ GETGK ME VRVG E +L+VGTSLSSGP Sbjct: 921 MRTDL-YDTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGP 979 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AI+PSGEAESTKGRLI+L + HNSDSGSMT C KAGSS+QR SP+ + GY +E++S+ Sbjct: 980 AILPSGEAESTKGRLIILSLEHTHNSDSGSMTICSKAGSSSQRTSPFRDVVGYASEQLSS 1039 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SSLCSSPDDNS D +KL+++E W L L + TWPG+VL+ICPYLD+YFLAS+GNAFYVCG Sbjct: 1040 SSLCSSPDDNSYDGIKLDEAETWQLRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCG 1099 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F NDN +R+KR AV RTRFMI SL YFTRI VGDCRDGVLFYSYHED+KKL QIYCDP Sbjct: 1100 FPNDNPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEDSKKLLQIYCDPA 1159 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE-DNASPECNLSVSCSYYIGEIAMSIR 1277 QRLVADC L + ++ VSDRKGSIA+L+ H + + +SPE NL+++C+Y++GEIAM+I+ Sbjct: 1160 QRLVADCFLMDGNSVAVSDRKGSIAILSCKDHSDFEYSSPESNLNLNCAYFMGEIAMAIK 1219 Query: 1278 KGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQAR 1457 KG YKL A+DGL+ + + I+ + + I+AGTLLGSI +F PIS EEYELLK VQA+ Sbjct: 1220 KGCNIYKLPADDGLQS-NGLSKSINTADDTIIAGTLLGSIFVFAPISSEEYELLKAVQAK 1278 Query: 1458 LAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGL--PCATS 1631 L +HPLTAP+LGN+H EFR RENQ ILDGDMLAQFLELT+ QQE+VL P ++ Sbjct: 1279 LGIHPLTAPVLGNDHKEFRGRENQSQATKILDGDMLAQFLELTNRQQESVLSTPQPSQST 1338 Query: 1632 EVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 P+ ++QVVQLLERVHY L+ Sbjct: 1339 SKASSKQLSFPPLMLHQVVQLLERVHYALH 1368 >ref|XP_006407388.1| hypothetical protein EUTSA_v10019900mg [Eutrema salsugineum] gi|557108534|gb|ESQ48841.1| hypothetical protein EUTSA_v10019900mg [Eutrema salsugineum] Length = 1367 Score = 748 bits (1932), Expect = 0.0 Identities = 379/570 (66%), Positives = 459/570 (80%), Gaps = 4/570 (0%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 +DN P+ L LIA RRIGITPVFLVP S+SLD D+IALSDRPWL+Q AR SLSY SISF+P Sbjct: 800 RDNLPIDLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQP 859 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCS+ECP+G+LFVAENCLHLVEMV+SKRLN QKFHLGGTPRKVLYHS+S+LL+V Sbjct: 860 STHATPVCSSECPQGILFVAENCLHLVEMVHSKRLNAQKFHLGGTPRKVLYHSESKLLIV 919 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL D +SDICCVDPLSGS++SS+ GETGK ME +RVG EQ+L+VGTSLSSGP Sbjct: 920 MRTDL-YDACTSDICCVDPLSGSLLSSYKLKPGETGKSMELLRVGNEQVLVVGTSLSSGP 978 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AI+PSGEAESTKGRLI+L + + NSDSGS+T C KAGSS+QR SP+ + G+TTE++S+ Sbjct: 979 AILPSGEAESTKGRLIILYLEHIQNSDSGSITICSKAGSSSQRTSPFRDVAGFTTEQLSS 1038 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SSLCSSPDDNS D +KL+++E W L L A TWPG+VL+ICPYLDNYFLAS+GNAFYVCG Sbjct: 1039 SSLCSSPDDNSYDGIKLDEAETWQLRLASATTWPGMVLAICPYLDNYFLASAGNAFYVCG 1098 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F ND+ +R+KR AV RTRFMI SL YFTRI VGDCRDGVLFYSYHED KKL QIYCDP Sbjct: 1099 FPNDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEDVKKLHQIYCDPA 1158 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE-DNASPECNLSVSCSYYIGEIAMSIR 1277 QRLVADC L + ++ VSDRKGS+A+L+ H + + +SPE NL+++C+YY+GEIAM+I+ Sbjct: 1159 QRLVADCFLMDANSVAVSDRKGSVAILSCKDHSDFEYSSPESNLNLNCAYYMGEIAMAIK 1218 Query: 1278 KGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQAR 1457 KG YKL A+D L+ + ID + + I+AGTL+GSI +F PISREEYELL+ VQ + Sbjct: 1219 KGCNIYKLPADDVLRSYGPCKS-IDAADDTIIAGTLMGSIYVFAPISREEYELLEAVQEK 1277 Query: 1458 LAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGL--PCATS 1631 L VHPLTAP+LGN+H EFR REN ILDGDMLAQFLELT+ QQE+VL P ++ Sbjct: 1278 LVVHPLTAPVLGNDHEEFRGRENPSQATKILDGDMLAQFLELTNRQQESVLATPQPLPST 1337 Query: 1632 EVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 P+ ++QVVQLLERVHY L+ Sbjct: 1338 SKASLKQRSSPPLMLHQVVQLLERVHYALH 1367 >ref|XP_006577113.1| PREDICTED: splicing factor 3B subunit 3-like isoform X2 [Glycine max] Length = 1373 Score = 748 bits (1930), Expect = 0.0 Identities = 380/568 (66%), Positives = 459/568 (80%), Gaps = 2/568 (0%) Frame = +3 Query: 24 KDND-PVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFE 200 K ND P +LQLIA+RRIGITPVFLVPL ++LD D+I LSDRPWL+ +ARHSLSY SISF+ Sbjct: 807 KRNDFPSMLQLIAIRRIGITPVFLVPLGDTLDADIITLSDRPWLLHSARHSLSYSSISFQ 866 Query: 201 PSTYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLL 380 PST+VTPVCS ECPKG+LFVAEN LHLVEMV+SKRLN+QKFHL GTPRKVLYH +S++LL Sbjct: 867 PSTHVTPVCSVECPKGILFVAENSLHLVEMVHSKRLNMQKFHLEGTPRKVLYHDESKMLL 926 Query: 381 VLRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSG 560 V+RT+L+ SDIC +DPLSGSV+SSF +LGETGK ME VRVG EQ+L+VGTSLSSG Sbjct: 927 VMRTELNCGTCLSDICIMDPLSGSVLSSFRLELGETGKSMELVRVGSEQVLVVGTSLSSG 986 Query: 561 PAIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMS 737 P M +GEAES KGRL++L D + NSDSGS+T C KAGSS+Q+ SP+ E Y E++S Sbjct: 987 PHTMATGEAESCKGRLLVLCLDHVQNSDSGSVTFCSKAGSSSQKTSPFREIVTYAPEQLS 1046 Query: 738 NSSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVC 917 +SSL SSPDDNS D +KL+++E W L A WPGVVL ICPYLD YFLA++GNAFYVC Sbjct: 1047 SSSLGSSPDDNSSDGIKLDENEVWQFRLTFATKWPGVVLKICPYLDRYFLATAGNAFYVC 1106 Query: 918 GFQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDP 1097 GF NDN +R++R+A+ R RFMI SLTA+FTRIAVGDCRDG+L YSYHE+AKKLE +Y DP Sbjct: 1107 GFPNDNPQRVRRYAMGRARFMITSLTAHFTRIAVGDCRDGILLYSYHEEAKKLELLYNDP 1166 Query: 1098 GQRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIR 1277 RLVADC+L + DTA VSDRKGSIAVL S HLEDNA +CN+++SC+Y++ EIAMSI+ Sbjct: 1167 SLRLVADCILMDADTAVVSDRKGSIAVLCSD-HLEDNAGAQCNMALSCAYFMAEIAMSIK 1225 Query: 1278 KGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQAR 1457 KGS+SY+L A+D L+G + +D +N I+A TLLGSI+IFIP+SREEYELL+ VQAR Sbjct: 1226 KGSYSYRLPADDVLQGGNGPKTNVDSLQNTIIATTLLGSIMIFIPLSREEYELLEAVQAR 1285 Query: 1458 LAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATSEV 1637 L VH LTAP+LGN+H+EFRSREN++ VP ILDGDML QFLELTS+QQ+ +L L Sbjct: 1286 LVVHHLTAPVLGNDHNEFRSRENRVGVPKILDGDMLTQFLELTSMQQKMILSLELPDMVK 1345 Query: 1638 XXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 + VSVNQVVQLLERVHY LN Sbjct: 1346 PSLKPLLPSHVSVNQVVQLLERVHYALN 1373 >gb|ESW35025.1| hypothetical protein PHAVU_001G200200g [Phaseolus vulgaris] Length = 1362 Score = 745 bits (1923), Expect = 0.0 Identities = 371/563 (65%), Positives = 456/563 (80%), Gaps = 1/563 (0%) Frame = +3 Query: 36 PVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEPSTYV 215 P+ LQLIA+RRIGITPVFLVPL ++LD D+IALSDRPWL+ +ARHSLSY SISF+PST+V Sbjct: 801 PLTLQLIAIRRIGITPVFLVPLGDTLDADIIALSDRPWLLHSARHSLSYTSISFQPSTHV 860 Query: 216 TPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLVLRTD 395 TPVCS ECPKG+LFVAENCLHLVEMV+SKRLN+QKFHL GTPRKVLYH +S++LLV+RT+ Sbjct: 861 TPVCSVECPKGILFVAENCLHLVEMVHSKRLNMQKFHLEGTPRKVLYHDESKMLLVMRTE 920 Query: 396 LDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGPAIMP 575 L+ SDICCVDPLSGSV+SSF +LGETGK ME VRVG EQ+L+VGTSLSSGPA+MP Sbjct: 921 LNCGTCLSDICCVDPLSGSVLSSFRLELGETGKSMELVRVGSEQVLIVGTSLSSGPAVMP 980 Query: 576 SGEAESTKGRLIML-RFDLHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSNSSLC 752 SGEAES KGRL++L + NSDSGSMT C KAGSS+Q+ SP+ E Y E++S+SSL Sbjct: 981 SGEAESCKGRLLVLCLVHVQNSDSGSMTFCSKAGSSSQKTSPFHEIVSYAPEQLSSSSLG 1040 Query: 753 SSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCGFQND 932 SSPDDNS D +KL+++E W L +A W GVV ICPYLD YFLAS+GN FYVCGF ND Sbjct: 1041 SSPDDNSSDGIKLDENEVWQFRLAYARKWQGVVFKICPYLDRYFLASAGNTFYVCGFLND 1100 Query: 933 NLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPGQRLV 1112 N +R++R+A+ RT MI SL+A+FTRIAVGDCRDG++ +SYHE+++KLEQ+ CDP +RLV Sbjct: 1101 NPQRVRRYAMGRTHHMITSLSAHFTRIAVGDCRDGIILFSYHEESRKLEQLCCDPSRRLV 1160 Query: 1113 ADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIRKGSFS 1292 ADC+L + DTA VSDRKG IA+L S+ HLEDNAS ECN+++SC+Y++ EIA+S++KGS+S Sbjct: 1161 ADCILMDADTAVVSDRKGGIAILCSN-HLEDNASTECNMTLSCAYFMAEIALSVQKGSYS 1219 Query: 1293 YKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQARLAVHP 1472 Y+L A+D L+G + +D +N I+A TLLGSI+IFIP+SREEYELL+ VQ RL VH Sbjct: 1220 YRLPADDVLQGGNGPKTNVDSLQNTIIASTLLGSIMIFIPLSREEYELLEAVQERLVVHQ 1279 Query: 1473 LTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPCATSEVXXXXX 1652 LTAP+LGN+H+EFRSRE + VP ILDGD+L QFLELTS+QQ+ +L Sbjct: 1280 LTAPVLGNDHNEFRSRETRGGVPKILDGDVLTQFLELTSMQQKMILSSEPPDIAKPSLKP 1339 Query: 1653 XXHAPVSVNQVVQLLERVHYVLN 1721 VSVNQVVQLLERVHY LN Sbjct: 1340 LLSPHVSVNQVVQLLERVHYALN 1362 >ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297328597|gb|EFH59016.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 1384 Score = 738 bits (1904), Expect = 0.0 Identities = 379/583 (65%), Positives = 457/583 (78%), Gaps = 17/583 (2%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 KDN P+ L LIA RRIGITPVFLVP S+SLD D+IALSDRPWL+Q AR SLSY SISF+P Sbjct: 804 KDNLPINLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQP 863 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCS+ECP+G+LFV+ENCLHLVEMV+SKR N QKFHLGGTPRKV+YHS+S+LL+V Sbjct: 864 STHATPVCSSECPQGILFVSENCLHLVEMVHSKRRNAQKFHLGGTPRKVIYHSESKLLIV 923 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL D +SDICCVDPLSGSV+SS+ GETGK ME VRVG E +L+VGTSLSSGP Sbjct: 924 MRTDL-YDTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGP 982 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AI+PSGEAESTKGRLI+L + NSDSGSMT C KAGSS+QR SP+ + GYTTE++S+ Sbjct: 983 AILPSGEAESTKGRLIILCLEHTQNSDSGSMTICSKAGSSSQRTSPFRDVVGYTTEQLSS 1042 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SS CSSPDDNS D +K +++E W L L A TWPG+VL+ICPYLD+YFLAS+GNAFYVCG Sbjct: 1043 SSHCSSPDDNSYDGIKFDEAETWQLRLASATTWPGMVLAICPYLDHYFLASAGNAFYVCG 1102 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F ND+ +R+KR AV RTRFMI SL YFTRI VGDCRDGVLFYSYHE++KKL QIYCDP Sbjct: 1103 FPNDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPA 1162 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE--------------DNASPECNLSVS 1238 QRLVADC L + ++ VSDRKGSIA+L+ H E + +SPE NL+++ Sbjct: 1163 QRLVADCFLMDANSVAVSDRKGSIAILSCQDHSEFGTKHLAFSPRDDPEYSSPESNLNLN 1222 Query: 1239 CSYYIGEIAMSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPIS 1418 C+YY+GEIAM+I+KG YKL A+D L+ S + ID + + I+AGTLLGSI +F PIS Sbjct: 1223 CAYYMGEIAMAIKKGCNIYKLPADDVLRSYGLSKS-IDTADDTIIAGTLLGSIFVFAPIS 1281 Query: 1419 REEYELLKPVQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQ 1598 EEYELL+ VQA+L +HPLTAP+LGN+H+EFR REN ILDGDMLAQFLELT+ QQ Sbjct: 1282 SEEYELLEAVQAKLGIHPLTAPVLGNDHNEFRGRENPSQATKILDGDMLAQFLELTNRQQ 1341 Query: 1599 EAVL--GLPCATSEVXXXXXXXHAPVSVNQVVQLLERVHYVLN 1721 E+VL P ++ P+ ++QVVQLLERVHY L+ Sbjct: 1342 ESVLLTPQPSPSTSKASSKQRSSPPLMLHQVVQLLERVHYALH 1384 >ref|NP_187802.2| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] gi|29824376|gb|AAP04148.1| unknown protein [Arabidopsis thaliana] gi|110739103|dbj|BAF01468.1| hypothetical protein [Arabidopsis thaliana] gi|332641608|gb|AEE75129.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] Length = 1379 Score = 733 bits (1891), Expect = 0.0 Identities = 377/575 (65%), Positives = 454/575 (78%), Gaps = 9/575 (1%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 KDN PV L LIA RRIGITPVFLVP S+SLD D+IALSDRPWL+Q AR SLSY SISF+P Sbjct: 807 KDNLPVNLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQP 866 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCS ECP+G+LFV+ENCLHLVEMV+SKR N QKF LGGTPRKV+YHS+S+LL+V Sbjct: 867 STHATPVCSFECPQGILFVSENCLHLVEMVHSKRRNAQKFQLGGTPRKVIYHSESKLLIV 926 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL D +SDICCVDPLSGSV+SS+ GETGK ME VRVG E +L+VGTSLSSGP Sbjct: 927 MRTDL-YDTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGP 985 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AI+PSGEAESTKGR+I+L + NSDSGSMT C KA SS+QR SP+ + GYTTE +S+ Sbjct: 986 AILPSGEAESTKGRVIILCLEHTQNSDSGSMTICSKACSSSQRTSPFHDVVGYTTENLSS 1045 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SSLCSSPDD S D +KL+++E W L L + TWPG+VL+ICPYLD+YFLAS+GNAFYVCG Sbjct: 1046 SSLCSSPDDYSYDGIKLDEAETWQLRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCG 1105 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F ND+ +R+KR AV RTRFMI SL YFTRI VGDCRDGVLFYSYHE++KKL QIYCDP Sbjct: 1106 FPNDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPA 1165 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLE------DNASPECNLSVSCSYYIGEI 1262 QRLVADC L + ++ VSDRKGSIA+L+ H + + +SPE NL+++C+YY+GEI Sbjct: 1166 QRLVADCFLMDANSVAVSDRKGSIAILSCKDHSDFGMKHLEYSSPESNLNLNCAYYMGEI 1225 Query: 1263 AMSIRKGSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLK 1442 AMSI+KG YKL A+D L+ S + ID + + I+AGTLLGSI +F PIS EEYELL+ Sbjct: 1226 AMSIKKGCNIYKLPADDVLRSYGLSKS-IDTADDTIIAGTLLGSIFVFAPISSEEYELLE 1284 Query: 1443 PVQARLAVHPLTAPILGNNHSEFRSRENQILVPTILDGDMLAQFLELTSIQQEAVLGLPC 1622 VQA+L +HPLTAP+LGN+H+EFR REN ILDGDMLAQFLELT+ QQE+VL P Sbjct: 1285 GVQAKLGIHPLTAPVLGNDHNEFRGRENPSQARKILDGDMLAQFLELTNRQQESVLSTPQ 1344 Query: 1623 ATSEVXXXXXXXHA--PVSVNQVVQLLERVHYVLN 1721 + + P+ ++QVVQLLERVHY L+ Sbjct: 1345 PSPSTSKASSKQRSFPPLMLHQVVQLLERVHYALH 1379 >gb|EOY09619.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein isoform 2, partial [Theobroma cacao] Length = 1237 Score = 731 bits (1887), Expect = 0.0 Identities = 360/502 (71%), Positives = 426/502 (84%), Gaps = 1/502 (0%) Frame = +3 Query: 24 KDNDPVLLQLIAVRRIGITPVFLVPLSESLDDDVIALSDRPWLVQAARHSLSYISISFEP 203 KD+ P+ LQLIA RRIGITPVFLVPLS+SLD D+IALSDRPWL+ ARHSLSY SISF+P Sbjct: 736 KDDLPINLQLIATRRIGITPVFLVPLSDSLDADIIALSDRPWLLHTARHSLSYTSISFQP 795 Query: 204 STYVTPVCSAECPKGLLFVAENCLHLVEMVYSKRLNVQKFHLGGTPRKVLYHSDSRLLLV 383 ST+ TPVCSAECPKG+LFV EN LHLVEMV+ RLNVQKFHLGGTPRKVLYHS+S+LL+V Sbjct: 796 STHATPVCSAECPKGILFVTENSLHLVEMVHGNRLNVQKFHLGGTPRKVLYHSESKLLIV 855 Query: 384 LRTDLDNDMFSSDICCVDPLSGSVVSSFHFDLGETGKCMEFVRVGIEQILLVGTSLSSGP 563 +RTDL ND SSDICCVDPL+ SVV+SF +LGETGKCME VR G EQ+L+VGTSLS GP Sbjct: 856 MRTDLSNDTCSSDICCVDPLTVSVVASFKLELGETGKCMELVRAGNEQVLVVGTSLSPGP 915 Query: 564 AIMPSGEAESTKGRLIMLRFD-LHNSDSGSMTTCLKAGSSTQRYSPYCEGTGYTTERMSN 740 AIMPSGEAESTKGRLI+L + + NSDSGSMT AGSS+QR SP+CE G+ E++S+ Sbjct: 916 AIMPSGEAESTKGRLIVLCIEHVQNSDSGSMTFSSMAGSSSQRNSPFCEIVGHANEQLSS 975 Query: 741 SSLCSSPDDNSCDEMKLEDSEAWSLELIHAITWPGVVLSICPYLDNYFLASSGNAFYVCG 920 SS+CSSPDD SCD +KLE++EAW L L +A TWP +VL+ICPYLD+YFLAS+GN FYVC Sbjct: 976 SSICSSPDDTSCDGIKLEETEAWQLRLAYATTWPAMVLAICPYLDHYFLASAGNTFYVCA 1035 Query: 921 FQNDNLKRLKRHAVERTRFMIVSLTAYFTRIAVGDCRDGVLFYSYHEDAKKLEQIYCDPG 1100 F + N +R++R A+ RTRFMI+SLTA+ TRIAVGDCRDG+LFYSYHE+ KKL+Q YCDP Sbjct: 1036 FLSGNPQRVRRFALARTRFMIMSLTAHSTRIAVGDCRDGILFYSYHEETKKLDQTYCDPS 1095 Query: 1101 QRLVADCLLTNLDTAFVSDRKGSIAVLTSSTHLEDNASPECNLSVSCSYYIGEIAMSIRK 1280 QRLVADC+LT++DTA VSDRKGS+AVL+ S LEDNASPE NL+++ +YY+GEIAMSIRK Sbjct: 1096 QRLVADCVLTDVDTAVVSDRKGSVAVLSCSDRLEDNASPERNLTLTSAYYMGEIAMSIRK 1155 Query: 1281 GSFSYKLAAEDGLKGCDNSNNIIDMSRNGIMAGTLLGSIVIFIPISREEYELLKPVQARL 1460 GSF YKL A+D L C+ N +D S IMA TLLGSI+IFIPISREE+ELL+ VQARL Sbjct: 1156 GSFIYKLPADDMLNSCEGLNASVDPSHGTIMASTLLGSIMIFIPISREEHELLEAVQARL 1215 Query: 1461 AVHPLTAPILGNNHSEFRSREN 1526 VHPLTAP+LGN+H+E+RS EN Sbjct: 1216 IVHPLTAPVLGNDHNEYRSCEN 1237