BLASTX nr result
ID: Rehmannia28_contig00007497
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00007497 (1994 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011101139.1| PREDICTED: cleavage and polyadenylation spec... 1157 0.0 ref|XP_011101138.1| PREDICTED: cleavage and polyadenylation spec... 1157 0.0 ref|XP_012858363.1| PREDICTED: cleavage and polyadenylation spec... 1132 0.0 ref|XP_012858362.1| PREDICTED: cleavage and polyadenylation spec... 1129 0.0 ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prun... 1032 0.0 emb|CBI24510.3| unnamed protein product [Vitis vinifera] 1030 0.0 ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec... 1030 0.0 ref|XP_008234350.1| PREDICTED: cleavage and polyadenylation spec... 1027 0.0 ref|XP_007038474.1| Cleavage and polyadenylation specificity fac... 1024 0.0 ref|XP_007038473.1| Cleavage and polyadenylation specificity fac... 1024 0.0 ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation spec... 1023 0.0 emb|CDP05292.1| unnamed protein product [Coffea canephora] 1019 0.0 ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec... 1018 0.0 ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr... 1017 0.0 ref|XP_012090388.1| PREDICTED: cleavage and polyadenylation spec... 1014 0.0 gb|KDO65373.1| hypothetical protein CISIN_1g0005452mg, partial [... 1014 0.0 ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec... 1014 0.0 ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr... 1013 0.0 ref|XP_012484369.1| PREDICTED: cleavage and polyadenylation spec... 1011 0.0 ref|XP_012484368.1| PREDICTED: cleavage and polyadenylation spec... 1011 0.0 >ref|XP_011101139.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Sesamum indicum] Length = 1250 Score = 1157 bits (2992), Expect = 0.0 Identities = 583/664 (87%), Positives = 607/664 (91%), Gaps = 3/664 (0%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGDVYCVLCYENGNLE+ DVPN GK+HILD F HGPANDPV+LM +YS Sbjct: 573 DQGDVYCVLCYENGNLEMFDVPNFSSVFSVDKFVSGKSHILDAFFHGPANDPVQLMKRYS 632 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 +D VGHGRKE H +KVVELSMQRW EHSRPFLFG+LSDGSILCYHAY++EV ENASKA Sbjct: 633 DDAVGHGRKETTHGIKVVELSMQRWAQEHSRPFLFGLLSDGSILCYHAYVYEVPENASKA 692 Query: 367 EGVXXXXXXXXXXXXXX-RLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 EGV RLKNLRFVRV L+ YAREE PSG SSQRIT+FKNV GLQGLF Sbjct: 693 EGVVSSQSSLNLSSISASRLKNLRFVRVLLDPYAREEAPSGTSSQRITVFKNVSGLQGLF 752 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP WFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQL AL Sbjct: 753 LSGSRPAWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLPAL- 811 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEHD 903 SYDN+WPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSL+DQE GNQFEHD Sbjct: 812 SYDNYWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLVDQEAGNQFEHD 871 Query: 904 -MSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRNETL 1080 +S EGTY VEEFEVRIMEPE+S+GPWQTRATIPMQSSENALTVRVVTLFNTTTQ NETL Sbjct: 872 NLSSEGTYPVEEFEVRIMEPEKSSGPWQTRATIPMQSSENALTVRVVTLFNTTTQGNETL 931 Query: 1081 LAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHLLLA 1260 LAIGTAYVQGEDVAARGRVLLYSVE+ SDNVQ++VSEVYSKELKGAISALASLQGHLL+A Sbjct: 932 LAIGTAYVQGEDVAARGRVLLYSVERTSDNVQAKVSEVYSKELKGAISALASLQGHLLIA 991 Query: 1261 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQLN 1440 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ SQLN Sbjct: 992 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQVSQLN 1051 Query: 1441 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHV 1620 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPK+SESWKGQKLLSRAEFHV Sbjct: 1052 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHV 1111 Query: 1621 GAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSLQK 1800 GAHITKFLRLQLLPTSADRT PGSDKTNRFGLLFGTLDGSIGCIAPLDEL FRRLQSLQ+ Sbjct: 1112 GAHITKFLRLQLLPTSADRTTPGSDKTNRFGLLFGTLDGSIGCIAPLDELNFRRLQSLQR 1171 Query: 1801 KLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQQIGT 1980 KLVDAV HVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSH+EMLPLE+QLDIA QIGT Sbjct: 1172 KLVDAVPHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHYEMLPLEQQLDIANQIGT 1231 Query: 1981 TRTQ 1992 TRTQ Sbjct: 1232 TRTQ 1235 >ref|XP_011101138.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Sesamum indicum] Length = 1451 Score = 1157 bits (2992), Expect = 0.0 Identities = 583/664 (87%), Positives = 607/664 (91%), Gaps = 3/664 (0%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGDVYCVLCYENGNLE+ DVPN GK+HILD F HGPANDPV+LM +YS Sbjct: 774 DQGDVYCVLCYENGNLEMFDVPNFSSVFSVDKFVSGKSHILDAFFHGPANDPVQLMKRYS 833 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 +D VGHGRKE H +KVVELSMQRW EHSRPFLFG+LSDGSILCYHAY++EV ENASKA Sbjct: 834 DDAVGHGRKETTHGIKVVELSMQRWAQEHSRPFLFGLLSDGSILCYHAYVYEVPENASKA 893 Query: 367 EGVXXXXXXXXXXXXXX-RLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 EGV RLKNLRFVRV L+ YAREE PSG SSQRIT+FKNV GLQGLF Sbjct: 894 EGVVSSQSSLNLSSISASRLKNLRFVRVLLDPYAREEAPSGTSSQRITVFKNVSGLQGLF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP WFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQL AL Sbjct: 954 LSGSRPAWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLPAL- 1012 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEHD 903 SYDN+WPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSL+DQE GNQFEHD Sbjct: 1013 SYDNYWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLVDQEAGNQFEHD 1072 Query: 904 -MSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRNETL 1080 +S EGTY VEEFEVRIMEPE+S+GPWQTRATIPMQSSENALTVRVVTLFNTTTQ NETL Sbjct: 1073 NLSSEGTYPVEEFEVRIMEPEKSSGPWQTRATIPMQSSENALTVRVVTLFNTTTQGNETL 1132 Query: 1081 LAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHLLLA 1260 LAIGTAYVQGEDVAARGRVLLYSVE+ SDNVQ++VSEVYSKELKGAISALASLQGHLL+A Sbjct: 1133 LAIGTAYVQGEDVAARGRVLLYSVERTSDNVQAKVSEVYSKELKGAISALASLQGHLLIA 1192 Query: 1261 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQLN 1440 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ SQLN Sbjct: 1193 SGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQVSQLN 1252 Query: 1441 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHV 1620 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPK+SESWKGQKLLSRAEFHV Sbjct: 1253 LLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHV 1312 Query: 1621 GAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSLQK 1800 GAHITKFLRLQLLPTSADRT PGSDKTNRFGLLFGTLDGSIGCIAPLDEL FRRLQSLQ+ Sbjct: 1313 GAHITKFLRLQLLPTSADRTTPGSDKTNRFGLLFGTLDGSIGCIAPLDELNFRRLQSLQR 1372 Query: 1801 KLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQQIGT 1980 KLVDAV HVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSH+EMLPLE+QLDIA QIGT Sbjct: 1373 KLVDAVPHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHYEMLPLEQQLDIANQIGT 1432 Query: 1981 TRTQ 1992 TRTQ Sbjct: 1433 TRTQ 1436 >ref|XP_012858363.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Erythranthe guttata] gi|604299650|gb|EYU19493.1| hypothetical protein MIMGU_mgv1a000203mg [Erythranthe guttata] Length = 1437 Score = 1132 bits (2929), Expect = 0.0 Identities = 571/665 (85%), Positives = 603/665 (90%), Gaps = 1/665 (0%) Frame = +1 Query: 1 TTQDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMN 180 TT DQGDVY VLCYENGNLE+ DVPN GK+HILDTF HGPANDPVKLMN Sbjct: 768 TTHDQGDVYLVLCYENGNLEMFDVPNFSSVFSVDKFVSGKSHILDTFFHGPANDPVKLMN 827 Query: 181 KYSEDVGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENAS 360 K EDVG GRKE HN+KVVEL MQRW+ E SRPFLFGILSDGSILCYHAYI+E S+NAS Sbjct: 828 KDPEDVGRGRKETAHNIKVVELCMQRWDAEQSRPFLFGILSDGSILCYHAYIYEDSDNAS 887 Query: 361 KAEGVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGL 540 K + RL+NLRFVRV L++YAREETPSG SSQRI++FKNVGGLQGL Sbjct: 888 KTD---------LGSISSSRLRNLRFVRVCLDSYAREETPSGTSSQRISVFKNVGGLQGL 938 Query: 541 FLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSAL 720 FLSGS P WFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFI ITSEGALKICQL AL Sbjct: 939 FLSGSSPAWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFICITSEGALKICQLPAL 998 Query: 721 TSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH 900 SYDN+WPVQK+ALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQE GNQFE Sbjct: 999 -SYDNYWPVQKVALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEAGNQFEP 1057 Query: 901 D-MSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRNET 1077 D S EGTY +EEFE+RIMEPE+S GPWQTRATIPMQ+SENALT+RVVTLFN+TTQRNET Sbjct: 1058 DNFSSEGTYPMEEFEIRIMEPEKSAGPWQTRATIPMQTSENALTLRVVTLFNSTTQRNET 1117 Query: 1078 LLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHLLL 1257 LLAIGTAYVQGEDVAARGRVLLYSVEK+SD+ Q++V+EVYSKELKGAISALASLQGHLL+ Sbjct: 1118 LLAIGTAYVQGEDVAARGRVLLYSVEKSSDSAQTKVTEVYSKELKGAISALASLQGHLLI 1177 Query: 1258 ASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL 1437 ASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL Sbjct: 1178 ASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL 1237 Query: 1438 NLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFH 1617 NLLAKDFGSLD LATEFLIDGSTLSLIVSD+QKNVQIFYYAPKMSESWKGQKLL RAEFH Sbjct: 1238 NLLAKDFGSLDTLATEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLPRAEFH 1297 Query: 1618 VGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSLQ 1797 VGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSLQ Sbjct: 1298 VGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSLQ 1357 Query: 1798 KKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQQIG 1977 KKLVD+V+H AGLNPRSFRHFHSNGKAHRPGPDSIVDCELL +FEML LEEQ++IAQQIG Sbjct: 1358 KKLVDSVSHFAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLFNFEMLRLEEQIEIAQQIG 1417 Query: 1978 TTRTQ 1992 TTRTQ Sbjct: 1418 TTRTQ 1422 >ref|XP_012858362.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Erythranthe guttata] Length = 1440 Score = 1129 bits (2919), Expect = 0.0 Identities = 572/668 (85%), Positives = 603/668 (90%), Gaps = 4/668 (0%) Frame = +1 Query: 1 TTQDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMN 180 TT DQGDVY VLCYENGNLE+ DVPN GK+HILDTF HGPANDPVKLMN Sbjct: 768 TTHDQGDVYLVLCYENGNLEMFDVPNFSSVFSVDKFVSGKSHILDTFFHGPANDPVKLMN 827 Query: 181 KYSEDVGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENAS 360 K EDVG GRKE HN+KVVEL MQRW+ E SRPFLFGILSDGSILCYHAYI+E S+NAS Sbjct: 828 KDPEDVGRGRKETAHNIKVVELCMQRWDAEQSRPFLFGILSDGSILCYHAYIYEDSDNAS 887 Query: 361 KAEGVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGL 540 K + RL+NLRFVRV L++YAREETPSG SSQRI++FKNVGGLQGL Sbjct: 888 KTD---------LGSISSSRLRNLRFVRVCLDSYAREETPSGTSSQRISVFKNVGGLQGL 938 Query: 541 FLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSAL 720 FLSGS P WFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFI ITSEGALKICQL AL Sbjct: 939 FLSGSSPAWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFICITSEGALKICQLPAL 998 Query: 721 TSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH 900 SYDN+WPVQK+ALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQE GNQFE Sbjct: 999 -SYDNYWPVQKVALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEAGNQFEP 1057 Query: 901 D-MSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRNET 1077 D S EGTY +EEFE+RIMEPE+S GPWQTRATIPMQ+SENALT+RVVTLFN+TTQRNET Sbjct: 1058 DNFSSEGTYPMEEFEIRIMEPEKSAGPWQTRATIPMQTSENALTLRVVTLFNSTTQRNET 1117 Query: 1078 LLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQS---QVSEVYSKELKGAISALASLQGH 1248 LLAIGTAYVQGEDVAARGRVLLYSVEK+SD+ Q+ QV+EVYSKELKGAISALASLQGH Sbjct: 1118 LLAIGTAYVQGEDVAARGRVLLYSVEKSSDSAQTKSFQVTEVYSKELKGAISALASLQGH 1177 Query: 1249 LLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1428 LL+ASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG Sbjct: 1178 LLIASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1237 Query: 1429 SQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRA 1608 SQLNLLAKDFGSLD LATEFLIDGSTLSLIVSD+QKNVQIFYYAPKMSESWKGQKLL RA Sbjct: 1238 SQLNLLAKDFGSLDTLATEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLPRA 1297 Query: 1609 EFHVGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQ 1788 EFHVGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQ Sbjct: 1298 EFHVGAHITKFLRLQLLPTSADRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQ 1357 Query: 1789 SLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQ 1968 SLQKKLVD+V+H AGLNPRSFRHFHSNGKAHRPGPDSIVDCELL +FEML LEEQ++IAQ Sbjct: 1358 SLQKKLVDSVSHFAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLFNFEMLRLEEQIEIAQ 1417 Query: 1969 QIGTTRTQ 1992 QIGTTRTQ Sbjct: 1418 QIGTTRTQ 1425 >ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] gi|462416772|gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] Length = 1459 Score = 1032 bits (2668), Expect = 0.0 Identities = 514/671 (76%), Positives = 576/671 (85%), Gaps = 8/671 (1%) Frame = +1 Query: 4 TQDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNK 183 + DQGDVYCV+CYE+G+LEI DVPN G H++DT P DP KL+NK Sbjct: 774 SHDQGDVYCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINK 833 Query: 184 YSEDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENAS 360 SE+V G GRKE NMKVVEL+MQRW G+HSRPFLFGIL+DG ILCYHAY+FE E AS Sbjct: 834 SSEEVSGQGRKENIQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETAS 893 Query: 361 KAE-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQG 537 K E RL+NLRFVRV L+TYA+++T + S QR+TIFKN+ G QG Sbjct: 894 KTEDSASAQNTTGVSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQG 953 Query: 538 LFLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSA 717 LFLSGSRP WFM+FRERLRIHPQ+CDG +VA TVLHNVNCNHG IY+TS+G LKICQL Sbjct: 954 LFLSGSRPAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPP 1013 Query: 718 LTSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE 897 +TSYDN+WPVQKI LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSL+DQEVG+Q E Sbjct: 1014 ITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVE 1073 Query: 898 -HDMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQ 1065 H++S + TY V+EFE+RIMEP++S GPWQT+ATIPMQ+SENALTVRVVTLFNTTT+ Sbjct: 1074 NHNLSSDELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTK 1133 Query: 1066 RNETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQG 1245 NETLLAIGTAYVQGEDVA RGRVLL+S K++DN Q+ VSEVYSKELKGAISALASLQG Sbjct: 1134 ENETLLAIGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQG 1193 Query: 1246 HLLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1425 HLL+ASGPKIILHKW G+ELNGVAF+DVPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQ Sbjct: 1194 HLLIASGPKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQ 1253 Query: 1426 GSQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSR 1605 G+QL LLAKDFG+LDC ATEFLIDGSTLSL+V+D+QKN+QIFYYAPKMSESWKGQKLLSR Sbjct: 1254 GAQLTLLAKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSR 1313 Query: 1606 AEFHVGAHITKFLRLQLLPTSADR--TNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFR 1779 AEFHVG H+TKFLRLQ+L TS+DR TNPGSDKTNR+ LLFGTLDGSIGCIAPLDELTFR Sbjct: 1314 AEFHVGTHVTKFLRLQMLSTSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFR 1373 Query: 1780 RLQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLD 1959 RLQSLQKKLVDAV HVAGLNPR+FR F SNGKAHRPGPD+IVDCELLSH+EMLPLEEQL+ Sbjct: 1374 RLQSLQKKLVDAVHHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLE 1433 Query: 1960 IAQQIGTTRTQ 1992 IA QIGTTR+Q Sbjct: 1434 IANQIGTTRSQ 1444 >emb|CBI24510.3| unnamed protein product [Vitis vinifera] Length = 1448 Score = 1030 bits (2664), Expect = 0.0 Identities = 512/670 (76%), Positives = 575/670 (85%), Gaps = 8/670 (1%) Frame = +1 Query: 7 QDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKY 186 QDQGD+YCV+ YE+G+LEI DVPN G H++DT P+ D K+M+K Sbjct: 764 QDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKN 823 Query: 187 SED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASK 363 SE+ GRKE HN+KVVEL+MQRW G+HSRPFLFGIL+DG+ILCYHAY++E E+ K Sbjct: 824 SEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPK 883 Query: 364 AE-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGL 540 E V RL+NLRFVRV L+TY REE SG +S R+T+FKN+GG QGL Sbjct: 884 TEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGL 943 Query: 541 FLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSAL 720 FLSGSRP+WFM+FRER+R+HPQ+CDG IVAFTVLHN+NCNHG IY+TS+G LKICQL A+ Sbjct: 944 FLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAV 1003 Query: 721 TSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH 900 +SYDN+WPVQKI LKGTPHQVTYFAEKNLYPLIVSVPVLKPLN VLSSL+DQE G+Q E+ Sbjct: 1004 SSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEN 1063 Query: 901 DM----SMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQR 1068 D + +Y V+EFEVR++EPE+S PWQTRATIPMQSSENALTVRVVTLFNTTT+ Sbjct: 1064 DNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1123 Query: 1069 NETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGH 1248 NETLLAIGTAYVQGEDVAARGRVLL+SV KN+DN Q+ VSE+YSKELKGAISA+ASLQGH Sbjct: 1124 NETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGH 1183 Query: 1249 LLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1428 LL+ASGPKIILHKWTG+ELNGVAF+D PPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQG Sbjct: 1184 LLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQG 1243 Query: 1429 SQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRA 1608 +QLNLLAKDFGSLDC ATEFLIDGSTLSLIVSDDQKN+QIFYYAPKMSESWKGQKLLSRA Sbjct: 1244 AQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRA 1303 Query: 1609 EFHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRR 1782 EFHVGAH+TKFLRLQ+LP S+DRT+ GSDKTNRF LLFGTLDGSIGCIAPLDELTFRR Sbjct: 1304 EFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1363 Query: 1783 LQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDI 1962 LQSLQKKLVDAV HVAGLNPRSFR F SNGKAHRPGPD+IVDCELL H+EMLP EEQL+I Sbjct: 1364 LQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEI 1423 Query: 1963 AQQIGTTRTQ 1992 AQQIGTTR Q Sbjct: 1424 AQQIGTTRMQ 1433 >ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Vitis vinifera] gi|731423119|ref|XP_010662374.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Vitis vinifera] Length = 1442 Score = 1030 bits (2664), Expect = 0.0 Identities = 512/670 (76%), Positives = 575/670 (85%), Gaps = 8/670 (1%) Frame = +1 Query: 7 QDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKY 186 QDQGD+YCV+ YE+G+LEI DVPN G H++DT P+ D K+M+K Sbjct: 758 QDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKN 817 Query: 187 SED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASK 363 SE+ GRKE HN+KVVEL+MQRW G+HSRPFLFGIL+DG+ILCYHAY++E E+ K Sbjct: 818 SEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPK 877 Query: 364 AE-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGL 540 E V RL+NLRFVRV L+TY REE SG +S R+T+FKN+GG QGL Sbjct: 878 TEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGL 937 Query: 541 FLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSAL 720 FLSGSRP+WFM+FRER+R+HPQ+CDG IVAFTVLHN+NCNHG IY+TS+G LKICQL A+ Sbjct: 938 FLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAV 997 Query: 721 TSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH 900 +SYDN+WPVQKI LKGTPHQVTYFAEKNLYPLIVSVPVLKPLN VLSSL+DQE G+Q E+ Sbjct: 998 SSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEN 1057 Query: 901 DM----SMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQR 1068 D + +Y V+EFEVR++EPE+S PWQTRATIPMQSSENALTVRVVTLFNTTT+ Sbjct: 1058 DNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1117 Query: 1069 NETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGH 1248 NETLLAIGTAYVQGEDVAARGRVLL+SV KN+DN Q+ VSE+YSKELKGAISA+ASLQGH Sbjct: 1118 NETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGH 1177 Query: 1249 LLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1428 LL+ASGPKIILHKWTG+ELNGVAF+D PPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQG Sbjct: 1178 LLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQG 1237 Query: 1429 SQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRA 1608 +QLNLLAKDFGSLDC ATEFLIDGSTLSLIVSDDQKN+QIFYYAPKMSESWKGQKLLSRA Sbjct: 1238 AQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRA 1297 Query: 1609 EFHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRR 1782 EFHVGAH+TKFLRLQ+LP S+DRT+ GSDKTNRF LLFGTLDGSIGCIAPLDELTFRR Sbjct: 1298 EFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1357 Query: 1783 LQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDI 1962 LQSLQKKLVDAV HVAGLNPRSFR F SNGKAHRPGPD+IVDCELL H+EMLP EEQL+I Sbjct: 1358 LQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEI 1417 Query: 1963 AQQIGTTRTQ 1992 AQQIGTTR Q Sbjct: 1418 AQQIGTTRMQ 1427 >ref|XP_008234350.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Prunus mume] Length = 1459 Score = 1027 bits (2655), Expect = 0.0 Identities = 513/671 (76%), Positives = 575/671 (85%), Gaps = 8/671 (1%) Frame = +1 Query: 4 TQDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNK 183 + DQGDVYCV+CYE+G+LEI DVPN G H++D P DP KL+NK Sbjct: 774 SHDQGDVYCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLVDALMRDPPKDPQKLINK 833 Query: 184 YSEDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENAS 360 SE+V G GRKE NMKVVEL+MQRW G+HSRPFLFGIL+DG ILCYHAY+FE E AS Sbjct: 834 SSEEVSGQGRKENIQNMKVVELAMQRWLGQHSRPFLFGILNDGMILCYHAYLFEDPETAS 893 Query: 361 KAE-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQG 537 K E RL+NLRFVRV L+TYA+++T + S QR+TIFKN+ G QG Sbjct: 894 KTEDSASAQNTAGVSNLNASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQG 953 Query: 538 LFLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSA 717 LFLSGSRP WFM+FRERLRIHPQ+CDG +VA TVLHNVNCNHG IY+TS+G LKICQL Sbjct: 954 LFLSGSRPAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPP 1013 Query: 718 LTSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE 897 +TSYDN+WPVQKI LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSL+DQEVG+Q E Sbjct: 1014 ITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVE 1073 Query: 898 -HDMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQ 1065 H++S + TY V+EFE+RIMEP++S GPWQT+ATIPMQ+SENALTVRVVTLFNTTT+ Sbjct: 1074 NHNLSSDELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTK 1133 Query: 1066 RNETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQG 1245 NETLLAIGTAYVQGEDVA RGRVLL+S K++DN Q+ VSEVYSKELKGAISALASLQG Sbjct: 1134 ENETLLAIGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQG 1193 Query: 1246 HLLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1425 HLL+ASGPKIILHKW G+ELNGVAF+DVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ Sbjct: 1194 HLLIASGPKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1253 Query: 1426 GSQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSR 1605 G+QL+LLAKDFG+LDC ATEFLIDGSTLSL+V+D+QKN+QIFYYAPKMSESWKGQKLLSR Sbjct: 1254 GAQLSLLAKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSR 1313 Query: 1606 AEFHVGAHITKFLRLQLLPTSADR--TNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFR 1779 AEFHVG H+TKFLRLQ+L TS+DR TNPGSDKTNR+ LLFGTLDGSIGCIAPLDELTFR Sbjct: 1314 AEFHVGTHVTKFLRLQMLSTSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFR 1373 Query: 1780 RLQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLD 1959 RLQSLQKKLVDAV HVAGLNPR+FR F SNGKAHRPGPD+IVDCELLSH+EMLPL EQL+ Sbjct: 1374 RLQSLQKKLVDAVPHVAGLNPRAFRQFRSNGKAHRPGPDTIVDCELLSHYEMLPLGEQLE 1433 Query: 1960 IAQQIGTTRTQ 1992 IA QIGTTR+Q Sbjct: 1434 IANQIGTTRSQ 1444 >ref|XP_007038474.1| Cleavage and polyadenylation specificity factor 160 isoform 2 [Theobroma cacao] gi|508775719|gb|EOY22975.1| Cleavage and polyadenylation specificity factor 160 isoform 2 [Theobroma cacao] Length = 1257 Score = 1024 bits (2647), Expect = 0.0 Identities = 505/669 (75%), Positives = 578/669 (86%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YCV+CYE+G LEI DVPN G+ ++D ++ + D K++NK S Sbjct: 574 DQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSS 633 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E++ G GRKE N+KVVEL+MQRW HSRPFLFGIL+DG+ILCYHAY+FE SENASK Sbjct: 634 EELTGQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKV 693 Query: 367 E-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 E V RL+NLRF+R+ L+ Y REE +G SQRITIFKN+ G QG F Sbjct: 694 EDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFF 753 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP WFM+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQ+ + + Sbjct: 754 LSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSAS 813 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 +YDN+WPVQKI L+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSL+DQEVG+Q + H Sbjct: 814 NYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNH 873 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S ++ TY V+EFEVRI+EPE+S GPW+T+ATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 874 NLSSDELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKEN 933 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 E+LLAIGTAY+QGEDVAARGRV+L S+ +N+DN+Q+ VSEVYSKELKGAISALASLQGHL Sbjct: 934 ESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHL 993 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILH WTGSELNG+AFYD PPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+ Sbjct: 994 LIASGPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGA 1053 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1054 QLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1113 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKFLRLQ+L TS+DRT+ GSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 1114 FHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1173 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVDAV HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELL H+EMLPLEEQLDIA Sbjct: 1174 QSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIA 1233 Query: 1966 QQIGTTRTQ 1992 QIGTTR+Q Sbjct: 1234 HQIGTTRSQ 1242 >ref|XP_007038473.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] gi|508775718|gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 1024 bits (2647), Expect = 0.0 Identities = 505/669 (75%), Positives = 578/669 (86%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YCV+CYE+G LEI DVPN G+ ++D ++ + D K++NK S Sbjct: 774 DQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSS 833 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E++ G GRKE N+KVVEL+MQRW HSRPFLFGIL+DG+ILCYHAY+FE SENASK Sbjct: 834 EELTGQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKV 893 Query: 367 E-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 E V RL+NLRF+R+ L+ Y REE +G SQRITIFKN+ G QG F Sbjct: 894 EDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP WFM+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQ+ + + Sbjct: 954 LSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSAS 1013 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 +YDN+WPVQKI L+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSL+DQEVG+Q + H Sbjct: 1014 NYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNH 1073 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S ++ TY V+EFEVRI+EPE+S GPW+T+ATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 1074 NLSSDELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKEN 1133 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 E+LLAIGTAY+QGEDVAARGRV+L S+ +N+DN+Q+ VSEVYSKELKGAISALASLQGHL Sbjct: 1134 ESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHL 1193 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILH WTGSELNG+AFYD PPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+ Sbjct: 1194 LIASGPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGA 1253 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1254 QLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1313 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKFLRLQ+L TS+DRT+ GSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 1314 FHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1373 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVDAV HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELL H+EMLPLEEQLDIA Sbjct: 1374 QSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIA 1433 Query: 1966 QQIGTTRTQ 1992 QIGTTR+Q Sbjct: 1434 HQIGTTRSQ 1442 >ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Ziziphus jujuba] Length = 1453 Score = 1023 bits (2645), Expect = 0.0 Identities = 515/669 (76%), Positives = 576/669 (86%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YCV+CYE+G+LEI DVPN GK ++LDT + DP KLMN+ S Sbjct: 773 DQGDIYCVVCYESGSLEIYDVPNFNCVFSVEKFISGKMNLLDTLVEEQSKDPQKLMNRSS 832 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 EDV G RKE NMK+VEL+MQRW G+HSRPFLFGILSDG+ILCYHAY+FE E+ASK Sbjct: 833 EDVSGQARKENVQNMKIVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLFEGPESASKT 892 Query: 367 E-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 E V RL+NLRFVRVAL+TYA+EETP+ S QRI+IFKN+ G QGLF Sbjct: 893 EDSVSAQSLSGLSNNSASRLRNLRFVRVALDTYAKEETPNATSCQRISIFKNIAGYQGLF 952 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP WFM+FRERLR+HPQ+CDG IVAFTVLHNVNCNHG IY+TS+G LKICQL ++T Sbjct: 953 LSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSIT 1012 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 SYD++WPVQKI LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQV+SSLIDQEVG+Q E H Sbjct: 1013 SYDSYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVISSLIDQEVGHQAENH 1072 Query: 901 DMSMEG---TYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S + TY V+EFEVRI+EPE S GPWQT+ATIPMQ+SENALTVRVVTLFNTTT+ N Sbjct: 1073 NLSSDDLHRTYTVDEFEVRILEPEISGGPWQTKATIPMQTSENALTVRVVTLFNTTTKEN 1132 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+S+ N N+ VSEVY+K+LKGAISALASLQGHL Sbjct: 1133 ETLLAIGTAYVQGEDVAARGRVLLFSIGNNPQNL---VSEVYTKDLKGAISALASLQGHL 1189 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILHKWTG ELN VAF+DVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG+ Sbjct: 1190 LMASGPKIILHKWTGGELNAVAFFDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1249 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+VSD++KN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1250 QLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDNRKNIQIFYYAPKMSESWKGQKLLSRAE 1309 Query: 1612 FHVGAHITKFLRLQLLPTSADRTNPG--SDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TK LRLQ+L T++DRT SDKTNRF LLFGTLDGS+GCIAPLDELTFRRL Sbjct: 1310 FHVGAHVTKLLRLQMLSTTSDRTGTASVSDKTNRFALLFGTLDGSVGCIAPLDELTFRRL 1369 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVDAV+HVAGLNPRSFR F SNGKAHRPGPDSIVDCELL H+EMLPLEEQL+IA Sbjct: 1370 QSLQKKLVDAVSHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIA 1429 Query: 1966 QQIGTTRTQ 1992 QIGTTR+Q Sbjct: 1430 HQIGTTRSQ 1438 >emb|CDP05292.1| unnamed protein product [Coffea canephora] Length = 1501 Score = 1019 bits (2636), Expect = 0.0 Identities = 507/666 (76%), Positives = 569/666 (85%), Gaps = 3/666 (0%) Frame = +1 Query: 4 TQDQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNK 183 + D GDVYC++CY++G LEI DVPN GK ++DTFS PA +++ Sbjct: 822 SHDLGDVYCIVCYQSGGLEIFDVPNFTCVFSVENFASGKAILMDTFSPHPAKSNQEVVQM 881 Query: 184 YSEDVGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASK 363 + RK+ + VVEL+M +W G+HSRPFLFGILSDG+ILCYHA++FE SE S+ Sbjct: 882 IEDVNAQERKDNSQKIGVVELAMHKWAGQHSRPFLFGILSDGTILCYHAFVFENSETGSR 941 Query: 364 AEG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGL 540 E V RL+NLRF+R++L+TYAR+E PSG S+R+TIFKNVGG QGL Sbjct: 942 DEKPVISQNSGNLSSMNGSRLRNLRFIRISLDTYARDEIPSGTPSKRLTIFKNVGGFQGL 1001 Query: 541 FLSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSAL 720 FLSGSRP WFMMFRERLR HPQ+CDGPIVAFTVLHNVNCNHGFIY+TS+G LKICQL + Sbjct: 1002 FLSGSRPTWFMMFRERLRTHPQLCDGPIVAFTVLHNVNCNHGFIYVTSQGTLKICQLPSS 1061 Query: 721 TSYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH 900 YDN+WPVQK LKGTPHQVTYFAEKNLYPLIVS PVLKPLNQVLSSL+DQEVG+Q E+ Sbjct: 1062 LLYDNYWPVQKTTLKGTPHQVTYFAEKNLYPLIVSYPVLKPLNQVLSSLVDQEVGHQLEN 1121 Query: 901 D-MSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRNET 1077 + M+ EG Y VEEFE+RIMEPE S PWQTRATIPMQSSENALTVR VTLFN TT+ NET Sbjct: 1122 ETMNFEGMYPVEEFEIRIMEPENSR-PWQTRATIPMQSSENALTVRAVTLFNCTTRENET 1180 Query: 1078 LLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHLLL 1257 LLA+GTAYVQGEDVAARGR+LL+S+E+++DN Q VSEVY+KELKGAISALASLQGHLL+ Sbjct: 1181 LLAVGTAYVQGEDVAARGRILLFSIERSADNSQILVSEVYAKELKGAISALASLQGHLLI 1240 Query: 1258 ASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL 1437 ASGPKIILH+WTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL Sbjct: 1241 ASGPKIILHEWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGSQL 1300 Query: 1438 NLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFH 1617 NLLAKDFGSLDCLATEFLIDG+TLSL+VSDDQKNVQ+F Y+PK+SESWKGQKLLSRAEFH Sbjct: 1301 NLLAKDFGSLDCLATEFLIDGNTLSLMVSDDQKNVQVFSYSPKLSESWKGQKLLSRAEFH 1360 Query: 1618 VGAHITKFLRLQLLPTSADRTN-PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQSL 1794 +GAH+TKFLRL LLPTS DRTN PGSDKTNRFGLLFGTLDGSIGC+APLDELTFRRLQSL Sbjct: 1361 IGAHVTKFLRLHLLPTSPDRTNTPGSDKTNRFGLLFGTLDGSIGCVAPLDELTFRRLQSL 1420 Query: 1795 QKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQQI 1974 QKKLVDAV+HVAGLNPRSFR F SNG+AHRPGPDSIVDCELL H+EMLPLEEQL+IA QI Sbjct: 1421 QKKLVDAVSHVAGLNPRSFRQFRSNGRAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQI 1480 Query: 1975 GTTRTQ 1992 GTTR Q Sbjct: 1481 GTTRMQ 1486 >ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Citrus sinensis] Length = 1457 Score = 1018 bits (2633), Expect = 0.0 Identities = 510/669 (76%), Positives = 568/669 (84%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+Y V+CYE+G LEI DVPN G+ HI+DT+ D +N S Sbjct: 774 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 833 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+ G GRKE H+MKVVEL+MQRW G HSRPFLF IL+DG+ILCY AY+FE EN SK+ Sbjct: 834 EEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 893 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 + V RL+NLRF R+ L+ Y REETP G QRITIFKN+ G QG F Sbjct: 894 DDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP W M+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQL + + Sbjct: 954 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 1013 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 +YDN+WPVQKI LK TPHQ+TYFAEKNLYPLIVSVPVLKPLNQVLS LIDQEVG+Q + H Sbjct: 1014 TYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1073 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S + TY VEE+EVRI+EP+R+ GPWQTRATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 1074 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1133 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+S +N+DN Q+ V+EVYSKELKGAISALASLQGHL Sbjct: 1134 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1193 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILHKWTG+ELNG+AFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG+ Sbjct: 1194 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1253 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QLNLLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1254 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1313 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKFLRLQ+L TS+DRT PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 1314 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1373 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVD+V HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELLSH+EMLPLEEQL+IA Sbjct: 1374 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1433 Query: 1966 QQIGTTRTQ 1992 Q GTTR+Q Sbjct: 1434 HQTGTTRSQ 1442 >ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523633|gb|ESR35000.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1457 Score = 1017 bits (2630), Expect = 0.0 Identities = 510/669 (76%), Positives = 568/669 (84%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+Y V+CYE+G LEI DVPN G+ HI+DT+ D +N S Sbjct: 774 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 833 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+ G GRKE H+MKVVEL+MQRW G HSRPFLF IL+DG+ILCY AY+FE SEN SK+ Sbjct: 834 EEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKS 893 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 + V RL+NLRF R L+ Y REETP G QRITIFKN+ G QG F Sbjct: 894 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP W M+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQL + + Sbjct: 954 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 1013 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 +YDN+WPVQKI LK TPHQ+TYFAEKNLYPLIVSVPVLKPLNQVLS LIDQEVG+Q + H Sbjct: 1014 TYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1073 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S + TY VEE+EVRI+EP+R+ GPWQTRATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 1074 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1133 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 +TLLAIGTAYVQGEDVAARGRVLL+S +N+DN Q+ V+EVYSKELKGAISALASLQGHL Sbjct: 1134 DTLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1193 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILHKWTG+ELNG+AFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG+ Sbjct: 1194 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1253 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QLNLLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1254 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1313 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKFLRLQ+L TS+DRT PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 1314 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1373 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVD+V HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELLSH+EMLPLEEQL+IA Sbjct: 1374 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1433 Query: 1966 QQIGTTRTQ 1992 Q GTTR+Q Sbjct: 1434 HQTGTTRSQ 1442 >ref|XP_012090388.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Jatropha curcas] gi|643706250|gb|KDP22382.1| hypothetical protein JCGZ_26213 [Jatropha curcas] Length = 1456 Score = 1014 bits (2623), Expect = 0.0 Identities = 507/669 (75%), Positives = 572/669 (85%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YC++CYE+G LE+ DVPN GK +++DT+ P D +++NK S Sbjct: 774 DQGDIYCIVCYESGALEVLDVPNFNSVFSVEKFISGKTNLVDTYVREPPKDTQQMVNKSS 833 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+V G GRKE HNMKVVEL+MQRW G HSRPFLFGIL+DG+ILCYHAY+FE + SK Sbjct: 834 EEVAGLGRKESMHNMKVVELAMQRWSGHHSRPFLFGILTDGTILCYHAYLFEGPDGTSKT 893 Query: 367 E-GVXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 E V RL+NLRFVRV L++Y REET S SSQRITIFKN+ G QG F Sbjct: 894 EDSVSAQNSIDLGINSSSRLRNLRFVRVPLDSYTREET-SIESSQRITIFKNISGYQGFF 952 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 L GSRP WFM+FRER+R+HPQ+CDG IVAFTVLHNVNCNHG IY+TS+G LKICQL +++ Sbjct: 953 LIGSRPAWFMVFRERMRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSVS 1012 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 SYDN+WPVQK+ LK TPHQVTYFAEKNLYPLIVSVPV KP+NQVLSSL+DQE G+Q E H Sbjct: 1013 SYDNYWPVQKVPLKATPHQVTYFAEKNLYPLIVSVPVQKPVNQVLSSLVDQEAGHQIENH 1072 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S + TY VEEFEVRI+EPER GPWQT+A IPMQSSENALTVRVVTLFNTTT+ N Sbjct: 1073 NLSSDELHRTYSVEEFEVRILEPERPGGPWQTKAVIPMQSSENALTVRVVTLFNTTTKEN 1132 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+SV K +DN Q V+EVYSKELKGAISALASLQGHL Sbjct: 1133 ETLLAIGTAYVQGEDVAARGRVLLFSVVKTADNPQVLVTEVYSKELKGAISALASLQGHL 1192 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILHKWTG+ELNGVAF+D PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG+ Sbjct: 1193 LIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1252 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+V+D+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 1253 QLSLLAKDFGSLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAE 1312 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKF+RLQ+L TS+DR+ PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 1313 FHVGAHVTKFMRLQMLSTSSDRSGVAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1372 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKL+DAV HVAGLNPRSFR F S+G+ HRPGP+SIVDCELLSH+EMLPLEEQL+IA Sbjct: 1373 QSLQKKLIDAVPHVAGLNPRSFRQFQSDGRVHRPGPESIVDCELLSHYEMLPLEEQLEIA 1432 Query: 1966 QQIGTTRTQ 1992 QQIGTTR Q Sbjct: 1433 QQIGTTRAQ 1441 >gb|KDO65373.1| hypothetical protein CISIN_1g0005452mg, partial [Citrus sinensis] Length = 890 Score = 1014 bits (2622), Expect = 0.0 Identities = 509/669 (76%), Positives = 566/669 (84%), Gaps = 8/669 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+Y V+CYE+G LEI DVPN G+ HI+DT+ D +N S Sbjct: 207 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 266 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+ G GRKE H+MKVVEL+MQRW HSRPFLF IL+DG+ILCY AY+FE EN SK+ Sbjct: 267 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 326 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 + V RL+NLRF R L+ Y REETP G QRITIFKN+ G QG F Sbjct: 327 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 386 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP W M+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQL + + Sbjct: 387 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 446 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE-H 900 +YDN+WPVQKI LK TPHQ+TYFAEKNLYPLIVSVPVLKPLNQVLS LIDQEVG+Q + H Sbjct: 447 TYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 506 Query: 901 DMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 ++S + TY VEE+EVRI+EP+R+ GPWQTRATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 507 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 566 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+S +N+DN Q+ V+EVYSKELKGAISALASLQGHL Sbjct: 567 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 626 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILHKWTG+ELNG+AFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG+ Sbjct: 627 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 686 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QLNLLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRAE Sbjct: 687 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 746 Query: 1612 FHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRL 1785 FHVGAH+TKFLRLQ+L TS+DRT PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRRL Sbjct: 747 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 806 Query: 1786 QSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIA 1965 QSLQKKLVD+V HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELLSH+EMLPLEEQL+IA Sbjct: 807 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 866 Query: 1966 QQIGTTRTQ 1992 Q GTTR+Q Sbjct: 867 HQTGTTRSQ 875 >ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Citrus sinensis] Length = 1458 Score = 1014 bits (2621), Expect = 0.0 Identities = 510/670 (76%), Positives = 568/670 (84%), Gaps = 9/670 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+Y V+CYE+G LEI DVPN G+ HI+DT+ D +N S Sbjct: 774 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 833 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+ G GRKE H+MKVVEL+MQRW G HSRPFLF IL+DG+ILCY AY+FE EN SK+ Sbjct: 834 EEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 893 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 + V RL+NLRF R+ L+ Y REETP G QRITIFKN+ G QG F Sbjct: 894 DDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP W M+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQL + + Sbjct: 954 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 1013 Query: 724 SYDNHWPVQK-IALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE- 897 +YDN+WPVQK I LK TPHQ+TYFAEKNLYPLIVSVPVLKPLNQVLS LIDQEVG+Q + Sbjct: 1014 TYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1073 Query: 898 HDMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQR 1068 H++S + TY VEE+EVRI+EP+R+ GPWQTRATIPMQSSENALTVRVVTLFNTTT+ Sbjct: 1074 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1133 Query: 1069 NETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGH 1248 NETLLAIGTAYVQGEDVAARGRVLL+S +N+DN Q+ V+EVYSKELKGAISALASLQGH Sbjct: 1134 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGH 1193 Query: 1249 LLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1428 LL+ASGPKIILHKWTG+ELNG+AFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG Sbjct: 1194 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1253 Query: 1429 SQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRA 1608 +QLNLLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRA Sbjct: 1254 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1313 Query: 1609 EFHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRR 1782 EFHVGAH+TKFLRLQ+L TS+DRT PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRR Sbjct: 1314 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1373 Query: 1783 LQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDI 1962 LQSLQKKLVD+V HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELLSH+EMLPLEEQL+I Sbjct: 1374 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1433 Query: 1963 AQQIGTTRTQ 1992 A Q GTTR+Q Sbjct: 1434 AHQTGTTRSQ 1443 >ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523632|gb|ESR34999.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1458 Score = 1013 bits (2618), Expect = 0.0 Identities = 510/670 (76%), Positives = 568/670 (84%), Gaps = 9/670 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+Y V+CYE+G LEI DVPN G+ HI+DT+ D +N S Sbjct: 774 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 833 Query: 190 ED-VGHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E+ G GRKE H+MKVVEL+MQRW G HSRPFLF IL+DG+ILCY AY+FE SEN SK+ Sbjct: 834 EEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKS 893 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 + V RL+NLRF R L+ Y REETP G QRITIFKN+ G QG F Sbjct: 894 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSGSRP W M+FRERLR+HPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQL + + Sbjct: 954 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 1013 Query: 724 SYDNHWPVQK-IALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFE- 897 +YDN+WPVQK I LK TPHQ+TYFAEKNLYPLIVSVPVLKPLNQVLS LIDQEVG+Q + Sbjct: 1014 TYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1073 Query: 898 HDMS---MEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQR 1068 H++S + TY VEE+EVRI+EP+R+ GPWQTRATIPMQSSENALTVRVVTLFNTTT+ Sbjct: 1074 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1133 Query: 1069 NETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGH 1248 N+TLLAIGTAYVQGEDVAARGRVLL+S +N+DN Q+ V+EVYSKELKGAISALASLQGH Sbjct: 1134 NDTLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGH 1193 Query: 1249 LLLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1428 LL+ASGPKIILHKWTG+ELNG+AFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG Sbjct: 1194 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1253 Query: 1429 SQLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRA 1608 +QLNLLAKDFGSLDC ATEFLIDGSTLSL+VSD+QKN+QIFYYAPKMSESWKGQKLLSRA Sbjct: 1254 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1313 Query: 1609 EFHVGAHITKFLRLQLLPTSADRTN--PGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRR 1782 EFHVGAH+TKFLRLQ+L TS+DRT PGSDKTNRF LLFGTLDGSIGCIAPLDELTFRR Sbjct: 1314 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1373 Query: 1783 LQSLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDI 1962 LQSLQKKLVD+V HVAGLNPRSFR FHSNGKAHRPGPDSIVDCELLSH+EMLPLEEQL+I Sbjct: 1374 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1433 Query: 1963 AQQIGTTRTQ 1992 A Q GTTR+Q Sbjct: 1434 AHQTGTTRSQ 1443 >ref|XP_012484369.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Gossypium raimondii] Length = 1349 Score = 1011 bits (2614), Expect = 0.0 Identities = 500/668 (74%), Positives = 566/668 (84%), Gaps = 7/668 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YCV+CYENG LEI DVPN G+ H++D +S + K +NK S Sbjct: 667 DQGDIYCVICYENGALEIFDVPNFNCVFSVEKFASGRAHLVDAYSQESSEGSEKPINKSS 726 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E++ G RKE HN+KVVEL+MQRW G HSRPF+FGIL+DG+ILCYHAY+FE +NASK Sbjct: 727 EELAGQSRKENVHNLKVVELAMQRWSGNHSRPFIFGILTDGTILCYHAYLFEGPDNASKV 786 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 EG RL+NLRF+RV+L+ Y REET +G SQRITIFKN+ G QG F Sbjct: 787 EGSASAQNSVGLSNVNASRLRNLRFIRVSLDAYTREETSNGTLSQRITIFKNISGYQGFF 846 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSG RP WFM+FR+RLRIHPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQ+ + + Sbjct: 847 LSGLRPAWFMVFRQRLRIHPQICDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQMPSTS 906 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH- 900 +YDN+WPVQKI L+GTPHQVTYFAE+NLYPLIVSVPV KP+NQVLSSL+DQE G+Q ++ Sbjct: 907 NYDNYWPVQKIPLRGTPHQVTYFAERNLYPLIVSVPVHKPVNQVLSSLVDQEAGHQMDNL 966 Query: 901 ---DMSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 + TY VEEFEVRI+EPE+S GPW+T+ATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 967 NLSSDELHRTYTVEEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKEN 1026 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+S+ +++DN Q+ VSEVYSKELKGAISALASLQGHL Sbjct: 1027 ETLLAIGTAYVQGEDVAARGRVLLFSIGRSTDNNQNLVSEVYSKELKGAISALASLQGHL 1086 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILH WTGSELNG+AFYD PPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+ Sbjct: 1087 LIASGPKIILHIWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGA 1146 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+VSDDQKN+Q+FYYAPKMSESW+GQKLLSRAE Sbjct: 1147 QLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDQKNIQVFYYAPKMSESWRGQKLLSRAE 1206 Query: 1612 FHVGAHITKFLRLQLLPTSA-DRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQ 1788 FHVGA +TKFLRLQ+L TS G DKTNRF LLFGTLDGSIGCIAPLDELTFRRLQ Sbjct: 1207 FHVGARVTKFLRLQMLSTSGRTSATAGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1266 Query: 1789 SLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQ 1968 SLQKKLVDAV HVAGLNPRSFRHF SNGKAHRPGPDSIVDCELL H+EMLPLEEQL+IA Sbjct: 1267 SLQKKLVDAVPHVAGLNPRSFRHFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAH 1326 Query: 1969 QIGTTRTQ 1992 QIGTTR+Q Sbjct: 1327 QIGTTRSQ 1334 >ref|XP_012484368.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Gossypium raimondii] gi|763767219|gb|KJB34434.1| hypothetical protein B456_006G065300 [Gossypium raimondii] Length = 1456 Score = 1011 bits (2614), Expect = 0.0 Identities = 500/668 (74%), Positives = 566/668 (84%), Gaps = 7/668 (1%) Frame = +1 Query: 10 DQGDVYCVLCYENGNLEICDVPNXXXXXXXXXXXXGKNHILDTFSHGPANDPVKLMNKYS 189 DQGD+YCV+CYENG LEI DVPN G+ H++D +S + K +NK S Sbjct: 774 DQGDIYCVICYENGALEIFDVPNFNCVFSVEKFASGRAHLVDAYSQESSEGSEKPINKSS 833 Query: 190 EDV-GHGRKEIPHNMKVVELSMQRWEGEHSRPFLFGILSDGSILCYHAYIFEVSENASKA 366 E++ G RKE HN+KVVEL+MQRW G HSRPF+FGIL+DG+ILCYHAY+FE +NASK Sbjct: 834 EELAGQSRKENVHNLKVVELAMQRWSGNHSRPFIFGILTDGTILCYHAYLFEGPDNASKV 893 Query: 367 EG-VXXXXXXXXXXXXXXRLKNLRFVRVALETYAREETPSGISSQRITIFKNVGGLQGLF 543 EG RL+NLRF+RV+L+ Y REET +G SQRITIFKN+ G QG F Sbjct: 894 EGSASAQNSVGLSNVNASRLRNLRFIRVSLDAYTREETSNGTLSQRITIFKNISGYQGFF 953 Query: 544 LSGSRPVWFMMFRERLRIHPQVCDGPIVAFTVLHNVNCNHGFIYITSEGALKICQLSALT 723 LSG RP WFM+FR+RLRIHPQ+CDG IVAFTVLHNVNCNHGFIY+TS+G LKICQ+ + + Sbjct: 954 LSGLRPAWFMVFRQRLRIHPQICDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQMPSTS 1013 Query: 724 SYDNHWPVQKIALKGTPHQVTYFAEKNLYPLIVSVPVLKPLNQVLSSLIDQEVGNQFEH- 900 +YDN+WPVQKI L+GTPHQVTYFAE+NLYPLIVSVPV KP+NQVLSSL+DQE G+Q ++ Sbjct: 1014 NYDNYWPVQKIPLRGTPHQVTYFAERNLYPLIVSVPVHKPVNQVLSSLVDQEAGHQMDNL 1073 Query: 901 ---DMSMEGTYLVEEFEVRIMEPERSTGPWQTRATIPMQSSENALTVRVVTLFNTTTQRN 1071 + TY VEEFEVRI+EPE+S GPW+T+ATIPMQSSENALTVRVVTLFNTTT+ N Sbjct: 1074 NLSSDELHRTYTVEEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKEN 1133 Query: 1072 ETLLAIGTAYVQGEDVAARGRVLLYSVEKNSDNVQSQVSEVYSKELKGAISALASLQGHL 1251 ETLLAIGTAYVQGEDVAARGRVLL+S+ +++DN Q+ VSEVYSKELKGAISALASLQGHL Sbjct: 1134 ETLLAIGTAYVQGEDVAARGRVLLFSIGRSTDNNQNLVSEVYSKELKGAISALASLQGHL 1193 Query: 1252 LLASGPKIILHKWTGSELNGVAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGS 1431 L+ASGPKIILH WTGSELNG+AFYD PPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+ Sbjct: 1194 LIASGPKIILHIWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGA 1253 Query: 1432 QLNLLAKDFGSLDCLATEFLIDGSTLSLIVSDDQKNVQIFYYAPKMSESWKGQKLLSRAE 1611 QL+LLAKDFGSLDC ATEFLIDGSTLSL+VSDDQKN+Q+FYYAPKMSESW+GQKLLSRAE Sbjct: 1254 QLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDQKNIQVFYYAPKMSESWRGQKLLSRAE 1313 Query: 1612 FHVGAHITKFLRLQLLPTSA-DRTNPGSDKTNRFGLLFGTLDGSIGCIAPLDELTFRRLQ 1788 FHVGA +TKFLRLQ+L TS G DKTNRF LLFGTLDGSIGCIAPLDELTFRRLQ Sbjct: 1314 FHVGARVTKFLRLQMLSTSGRTSATAGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1373 Query: 1789 SLQKKLVDAVAHVAGLNPRSFRHFHSNGKAHRPGPDSIVDCELLSHFEMLPLEEQLDIAQ 1968 SLQKKLVDAV HVAGLNPRSFRHF SNGKAHRPGPDSIVDCELL H+EMLPLEEQL+IA Sbjct: 1374 SLQKKLVDAVPHVAGLNPRSFRHFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAH 1433 Query: 1969 QIGTTRTQ 1992 QIGTTR+Q Sbjct: 1434 QIGTTRSQ 1441