BLASTX nr result
ID: Rauwolfia21_contig00001145
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00001145 (3824 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus pe... 1884 0.0 ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec... 1872 0.0 gb|EOY22975.1| Cleavage and polyadenylation specificity factor 1... 1871 0.0 gb|EOY22974.1| Cleavage and polyadenylation specificity factor 1... 1871 0.0 emb|CBI24510.3| unnamed protein product [Vitis vinifera] 1865 0.0 ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec... 1860 0.0 ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec... 1846 0.0 ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec... 1836 0.0 ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr... 1835 0.0 ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec... 1831 0.0 ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr... 1830 0.0 ref|XP_002510905.1| cleavage and polyadenylation specificity fac... 1821 0.0 ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec... 1812 0.0 ref|XP_006587381.1| PREDICTED: cleavage and polyadenylation spec... 1810 0.0 ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec... 1810 0.0 gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus... 1804 0.0 gb|EXC20897.1| Cleavage and polyadenylation specificity factor s... 1798 0.0 ref|XP_002318462.2| cleavage and polyadenylation specificity fac... 1795 0.0 ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec... 1789 0.0 ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec... 1789 0.0 >gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] Length = 1459 Score = 1884 bits (4881), Expect = 0.0 Identities = 924/1190 (77%), Positives = 1046/1190 (87%), Gaps = 8/1190 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 270 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 329 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 +IHYHSQS+SC LALN++A+S+D SQ+M RSSF VELD ANATWL NDVA+LSTKTGELL Sbjct: 330 SIHYHSQSASCALALNSYAVSADNSQEMPRSSFTVELDTANATWLLNDVALLSTKTGELL 389 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542 LTLVYDGR+VQRL+LSKS+A+VLTS IT +G+S FFLGSRLGDSLLVQFTCG G L Sbjct: 390 LLTLVYDGRVVQRLDLSKSKASVLTSGITKVGNSLFFLGSRLGDSLLVQFTCGVGGSVLS 449 Query: 543 --VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +K+EVGDIEG APLAKRLR SSSDALQDMV+GEELSLYGS PNNA SAQ+SFSF VRD Sbjct: 450 SDMKDEVGDIEGDAPLAKRLRMSSSDALQDMVSGEELSLYGSAPNNAESAQKSFSFAVRD 509 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 510 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHKN R DSSKIAA DDE+HAYLIISLE RTMVL+TA+ L EVTE Sbjct: 570 VELPGCKGIWTVYHKNARGHNADSSKIAASDDEFHAYLIISLEARTMVLETADLLSEVTE 629 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRV+QVY GARILDG+FMTQ+LSF NSE+GSGSE V Sbjct: 630 SVDYFVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQDLSFGTSNSEMGSGSESSTV 689 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSI DPYVLLRM+DG I+LLVGDPS CTVS P FESSK+SISACTLYHDKGPEPW Sbjct: 690 LSVSIVDPYVLLRMSDGGIRLLVGDPSLCTVSTSIPAAFESSKKSISACTLYHDKGPEPW 749 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTGI EAIDG+D SHDQGD+YC+VCYESG LEIFDVPNF+ V+SVDKFV Sbjct: 750 LRKTSTDAWLSTGIDEAIDGADGVSHDQGDVYCVVCYESGSLEIFDVPNFNCVFSVDKFV 809 Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG A+L+D ++ DP K+ K++++V+ G+KE IQN+++VELAM RW GQHSRPFL Sbjct: 810 SGNAHLIDTLMRDPPKDPQKLINKSSEEVSGQGRKENIQNMKVVELAMQRWSGQHSRPFL 869 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 FGIL+DG ILCYH+Y+F+ + ASK E++ ++Q + ++S SRLRNLRF+RV LDTYA Sbjct: 870 FGILNDGMILCYHAYLFEGPETASKTEDSASAQNTTGVSNLSASRLRNLRFVRVPLDTYA 929 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 +++TS T CQR+TIFKN+ G QGLF+SGSRP+W+M+FRERLRIHPQLCDG++VA TVLH Sbjct: 930 KKDTSNETSCQRMTIFKNIAGYQGLFLSGSRPAWFMVFRERLRIHPQLCDGSVVAVTVLH 989 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330 NVNCNHG IYVTSQG LKICQLP SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS Sbjct: 990 NVNCNHGLIYVTSQGILKICQLPPITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVS 1049 Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498 PV KPLNQVLSSLVDQE G Q+EN +Y V+EFE+RIMEPD+SGGPWQT+A Sbjct: 1050 VPVHKPLNQVLSSLVDQEVGHQVENHNLSSDELHRTYSVDEFEIRIMEPDKSGGPWQTKA 1109 Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678 TIPMQ+SENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVA RGRVLLFS K+ DN+ Sbjct: 1110 TIPMQTSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAGRGRVLLFSAGKSADNT 1169 Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858 QTLVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNGVAFFD PPLYVVS+ Sbjct: 1170 QTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGVAFFDVPPLYVVSL 1229 Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038 NIVKNFILLGD+HKSIYFL+WKEQG+QL LL+KDFG LDC ATEFLIDGSTLSL+V+DEQ Sbjct: 1230 NIVKNFILLGDVHKSIYFLSWKEQGAQLTLLAKDFGNLDCFATEFLIDGSTLSLVVADEQ 1289 Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218 KNIQ+FYYAPK+ SWKGQKLLSRAEFH+G HVTKF+RLQ+L TSSD+ PGSDK NR Sbjct: 1290 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGTHVTKFLRLQMLSTSSDRTGTNPGSDKTNR 1349 Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398 + LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPR+FRQF+SNG+AHRP Sbjct: 1350 YALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVHHVAGLNPRAFRQFQSNGKAHRP 1409 Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 GPD IVDCELL HYEMLPLEEQLEIA Q+GTTR+QI SNLN+L++GTSFL Sbjct: 1410 GPDTIVDCELLSHYEMLPLEEQLEIANQIGTTRSQIFSNLNDLSIGTSFL 1459 >ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Vitis vinifera] Length = 1442 Score = 1872 bits (4849), Expect = 0.0 Identities = 922/1188 (77%), Positives = 1042/1188 (87%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLL VPSPIGGV+V+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 +IHYHSQS+SC LALNN+A+S+D SQ+M RSSF+VELDAANATWL+NDVAMLSTKTGELL Sbjct: 328 SIHYHSQSASCALALNNYAVSADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELL 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542 LTL YDGR+V RL+LSKSRA+VLTS I IG+S FFLGSRLGDSLLVQFT L Sbjct: 388 LLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGNSLFFLGSRLGDSLLVQFT---SILSSS 444 Query: 543 VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRDSL 722 VKEEVGDIEG P AKRLR+SSSDALQDMVNGEELSLYGS PN+ ++Q++FSF+VRDS Sbjct: 445 VKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEELSLYGSAPNSTETSQKTFSFSVRDSF 504 Query: 723 INIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQES 902 IN+GPLKDF+ GLR NAD A+G AKQSNYELVCCSGHGKNGALC+LQ+SIRPEMIT+ Sbjct: 505 INVGPLKDFAYGLRINADPKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVE 564 Query: 903 LPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTENV 1082 LPGCKG+WTVYHKN R DS+K+A DDEYHAYLIISLE+RTMVL+TA+ L EVTE+V Sbjct: 565 LPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHAYLIISLESRTMVLETADLLGEVTESV 624 Query: 1083 DYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMVSS 1262 DYY QG T++AGNLFGRRRV+QVYA GARILDGAFMTQ+L SE V S Sbjct: 625 DYYVQGCTISAGNLFGRRRVVQVYARGARILDGAFMTQDLPI----------SESSTVLS 674 Query: 1263 VSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPWLR 1442 VSIADPYVLLRM+DG+IQLLVGDPS CTVSI P +FESSK+SISACTLYHDKGPEPWLR Sbjct: 675 VSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLR 734 Query: 1443 RTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFVSG 1622 +TSTDAWLSTGIGEAIDG+D A+ DQGDIYC+V YESG LEIFDVPNF+ V+SVDKF+SG Sbjct: 735 KTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSG 794 Query: 1623 KANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 A+LVD +IL+P++ KV K +++ G+KE NI++VELAM RW GQHSRPFLFG Sbjct: 795 NAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFG 854 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 IL+DGTILCYH+Y+++ ++ K EEAV++Q +I ++S SRLRNLRF+RV LDTY RE Sbjct: 855 ILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTRE 914 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 E GT R+T+FKN+GGCQGLF+SGSRP W+M+FRER+R+HPQLCDG+IVAFTVLHN+ Sbjct: 915 EALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNI 974 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCNHG IYVTSQG LKICQLP+ SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS P Sbjct: 975 NCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVP 1034 Query: 2337 VLKPLNQVLSSLVDQEAGQQIENDV----SFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 VLKPLN VLSSLVDQEAG Q+END SY V+EFEVR++EP++SG PWQTRATI Sbjct: 1035 VLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATI 1094 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV KN DNSQ Sbjct: 1095 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQN 1154 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSE+YSKELKGAISA+ASLQGHLLIASGPKI LHKW G+ELNGVAFFDAPPLYVVS+NI Sbjct: 1155 LVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNI 1214 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFILLGDIH+SIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSD+QKN Sbjct: 1215 VKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKN 1274 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+LP SSD+ +A GSDK NRF Sbjct: 1275 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFA 1334 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQFRSNG+AHRPGP Sbjct: 1335 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGP 1394 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 DNIVDCELLCHYEMLP EEQLEIAQQ+GTTR QI+SNLN+L+LGTSFL Sbjct: 1395 DNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSNLNDLSLGTSFL 1442 >gb|EOY22975.1| Cleavage and polyadenylation specificity factor 160 isoform 2 [Theobroma cacao] Length = 1257 Score = 1871 bits (4847), Expect = 0.0 Identities = 914/1190 (76%), Positives = 1045/1190 (87%), Gaps = 8/1190 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 68 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 127 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+AIS D SQD+ RS+F+VELDAANATWL NDVA+LSTKTGELL Sbjct: 128 TIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELL 187 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LTL+YDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQF+ G+G +LP Sbjct: 188 LLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGVSALP 247 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+KEEVGDIEG PLAKRLRRSSSDALQDMV GEELSLYGS PNN SAQ++F F VRD Sbjct: 248 SGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRD 307 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 308 SLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 367 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 L GCKG+WTVYHK+ RS D SK+ DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 368 VELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTE 427 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRV+QVY GARILDG+FMTQELS NSE GSE V Sbjct: 428 SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTV 487 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLLRMTDGSI LLVGDP+ CTVSI P FE SK+ +SACTLYHDKGPEPW Sbjct: 488 ISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPW 547 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+ STDAWLSTG+GE+IDG+D HDQGDIYC+VCYESG LEIFDVPNF+ V+S++KF Sbjct: 548 LRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFA 607 Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ LVDA L+ + + KV K+++++T G+KE +QN+++VELAM RW HSRPFL Sbjct: 608 SGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFL 667 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 FGIL+DGTILCYH+Y+F+ S+ ASKVE++V +Q + +++ SRLRNLRFIR+ LD Y Sbjct: 668 FGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYT 727 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REE S GT QRITIFKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 728 REEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 787 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330 NVNCNHGFIYVTSQG LKICQ+PS +YDNYWPVQKIPL+GTPHQVTYFA++NLYP+IVS Sbjct: 788 NVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVS 847 Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498 PV KP+NQVLSSLVDQE G Q++N + +Y V+EFEVRI+EP++SGGPW+T+A Sbjct: 848 VPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKA 907 Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678 TIPMQSSENALTVRVVTL+N+TT+ENE+LLAIGTAY+QGEDVAARGRV+L S+ +N DN Sbjct: 908 TIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNL 967 Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858 Q LVSEVYSKELKGAISALASLQGHLLIASGPKI LH W GSELNG+AF+DAPPLYVVS+ Sbjct: 968 QNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSL 1027 Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038 NIVKNFILLGD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQ Sbjct: 1028 NIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQ 1087 Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218 KNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ +A GSDK NR Sbjct: 1088 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNR 1147 Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398 F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQF SNG+AHRP Sbjct: 1148 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRP 1207 Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 GPD+IVDCELLCHYEMLPLEEQL+IA Q+GTTR+QI+SNLN+LTLGTSFL Sbjct: 1208 GPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLTLGTSFL 1257 >gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 1871 bits (4847), Expect = 0.0 Identities = 914/1190 (76%), Positives = 1045/1190 (87%), Gaps = 8/1190 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+AIS D SQD+ RS+F+VELDAANATWL NDVA+LSTKTGELL Sbjct: 328 TIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELL 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LTL+YDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQF+ G+G +LP Sbjct: 388 LLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGVSALP 447 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+KEEVGDIEG PLAKRLRRSSSDALQDMV GEELSLYGS PNN SAQ++F F VRD Sbjct: 448 SGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRD 507 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 508 SLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 L GCKG+WTVYHK+ RS D SK+ DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 568 VELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRV+QVY GARILDG+FMTQELS NSE GSE V Sbjct: 628 SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLLRMTDGSI LLVGDP+ CTVSI P FE SK+ +SACTLYHDKGPEPW Sbjct: 688 ISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+ STDAWLSTG+GE+IDG+D HDQGDIYC+VCYESG LEIFDVPNF+ V+S++KF Sbjct: 748 LRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFA 807 Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ LVDA L+ + + KV K+++++T G+KE +QN+++VELAM RW HSRPFL Sbjct: 808 SGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 FGIL+DGTILCYH+Y+F+ S+ ASKVE++V +Q + +++ SRLRNLRFIR+ LD Y Sbjct: 868 FGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYT 927 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REE S GT QRITIFKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 928 REEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 987 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330 NVNCNHGFIYVTSQG LKICQ+PS +YDNYWPVQKIPL+GTPHQVTYFA++NLYP+IVS Sbjct: 988 NVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVS 1047 Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498 PV KP+NQVLSSLVDQE G Q++N + +Y V+EFEVRI+EP++SGGPW+T+A Sbjct: 1048 VPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKA 1107 Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678 TIPMQSSENALTVRVVTL+N+TT+ENE+LLAIGTAY+QGEDVAARGRV+L S+ +N DN Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNL 1167 Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858 Q LVSEVYSKELKGAISALASLQGHLLIASGPKI LH W GSELNG+AF+DAPPLYVVS+ Sbjct: 1168 QNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSL 1227 Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038 NIVKNFILLGD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQ Sbjct: 1228 NIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQ 1287 Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218 KNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ +A GSDK NR Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNR 1347 Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398 F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQF SNG+AHRP Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRP 1407 Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 GPD+IVDCELLCHYEMLPLEEQL+IA Q+GTTR+QI+SNLN+LTLGTSFL Sbjct: 1408 GPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLTLGTSFL 1457 >emb|CBI24510.3| unnamed protein product [Vitis vinifera] Length = 1448 Score = 1865 bits (4832), Expect = 0.0 Identities = 922/1194 (77%), Positives = 1042/1194 (87%), Gaps = 12/1194 (1%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLL VPSPIGGV+V+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 +IHYHSQS+SC LALNN+A+S+D SQ+M RSSF+VELDAANATWL+NDVAMLSTKTGELL Sbjct: 328 SIHYHSQSASCALALNNYAVSADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELL 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542 LTL YDGR+V RL+LSKSRA+VLTS I IG+S FFLGSRLGDSLLVQFT L Sbjct: 388 LLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGNSLFFLGSRLGDSLLVQFT---SILSSS 444 Query: 543 VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQ------ESFSF 704 VKEEVGDIEG P AKRLR+SSSDALQDMVNGEELSLYGS PN+ ++Q ++FSF Sbjct: 445 VKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEELSLYGSAPNSTETSQVEAQVGKTFSF 504 Query: 705 TVRDSLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPE 884 +VRDS IN+GPLKDF+ GLR NAD A+G AKQSNYELVCCSGHGKNGALC+LQ+SIRPE Sbjct: 505 SVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPE 564 Query: 885 MITQESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLE 1064 MIT+ LPGCKG+WTVYHKN R DS+K+A DDEYHAYLIISLE+RTMVL+TA+ L Sbjct: 565 MITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHAYLIISLESRTMVLETADLLG 624 Query: 1065 EVTENVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSE 1244 EVTE+VDYY QG T++AGNLFGRRRV+QVYA GARILDGAFMTQ+L SE Sbjct: 625 EVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGAFMTQDLPI----------SE 674 Query: 1245 GQMVSSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKG 1424 V SVSIADPYVLLRM+DG+IQLLVGDPS CTVSI P +FESSK+SISACTLYHDKG Sbjct: 675 SSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIPAVFESSKKSISACTLYHDKG 734 Query: 1425 PEPWLRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSV 1604 PEPWLR+TSTDAWLSTGIGEAIDG+D A+ DQGDIYC+V YESG LEIFDVPNF+ V+SV Sbjct: 735 PEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVSYESGDLEIFDVPNFNCVFSV 794 Query: 1605 DKFVSGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHS 1778 DKF+SG A+LVD +IL+P++ KV K +++ G+KE NI++VELAM RW GQHS Sbjct: 795 DKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKENAHNIKVVELAMQRWSGQHS 854 Query: 1779 RPFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTL 1958 RPFLFGIL+DGTILCYH+Y+++ ++ K EEAV++Q +I ++S SRLRNLRF+RV L Sbjct: 855 RPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLSISNVSASRLRNLRFVRVPL 914 Query: 1959 DTYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAF 2138 DTY REE GT R+T+FKN+GGCQGLF+SGSRP W+M+FRER+R+HPQLCDG+IVAF Sbjct: 915 DTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMVFRERIRVHPQLCDGSIVAF 974 Query: 2139 TVLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYP 2318 TVLHN+NCNHG IYVTSQG LKICQLP+ SYDNYWPVQKIPLKGTPHQVTYFA+KNLYP Sbjct: 975 TVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYP 1034 Query: 2319 LIVSFPVLKPLNQVLSSLVDQEAGQQIENDV----SFEGSYPVEEFEVRIMEPDRSGGPW 2486 LIVS PVLKPLN VLSSLVDQEAG Q+END SY V+EFEVR++EP++SG PW Sbjct: 1035 LIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPW 1094 Query: 2487 QTRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKN 2666 QTRATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV KN Sbjct: 1095 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKN 1154 Query: 2667 NDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLY 2846 DNSQ LVSE+YSKELKGAISA+ASLQGHLLIASGPKI LHKW G+ELNGVAFFDAPPLY Sbjct: 1155 TDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLY 1214 Query: 2847 VVSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMV 3026 VVS+NIVKNFILLGDIH+SIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+V Sbjct: 1215 VVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLIV 1274 Query: 3027 SDEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSD 3206 SD+QKNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+LP SSD+ +A GSD Sbjct: 1275 SDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPASSDRTSATQGSD 1334 Query: 3207 KINRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGR 3386 K NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQFRSNG+ Sbjct: 1335 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGK 1394 Query: 3387 AHRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 AHRPGPDNIVDCELLCHYEMLP EEQLEIAQQ+GTTR QI+SNLN+L+LGTSFL Sbjct: 1395 AHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSNLNDLSLGTSFL 1448 >ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum tuberosum] Length = 1447 Score = 1860 bits (4817), Expect = 0.0 Identities = 914/1188 (76%), Positives = 1050/1188 (88%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT +GRVSWKHHTCMVSA SISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 262 LTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWSAANLPHDAYKLLAVPSPIGGVLVIGAN 321 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQSSSC LALNN A D SQ+M RSSFNVELDAANATWLT+DVAMLSTKTGELL Sbjct: 322 TIHYHSQSSSCSLALNNFAFFGDNSQEMPRSSFNVELDAANATWLTSDVAMLSTKTGELL 381 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LT++YDGRIVQ+L+LSKSRA+VLTS ITTIG S FFLGSRLGDSLLVQF+CG G +LP Sbjct: 382 LLTIIYDGRIVQKLDLSKSRASVLTSGITTIGDSLFFLGSRLGDSLLVQFSCGLGGSNLP 441 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 GV+EEVGDIE AP AKRLR SSSDALQDM+NGEELSLYG+ PNNA SAQ++FSF VRD Sbjct: 442 PGVQEEVGDIESDAPSAKRLRMSSSDALQDMINGEELSLYGTAPNNAQSAQKTFSFAVRD 501 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS G+R NAD NA+G AKQSNYELVCCSGHGKNG+LCVLQ+SIRPE ITQ Sbjct: 502 SLINVGPLKDFSYGMRINADLNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPETITQ 561 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 E+LPGCKG+WTVYHKN R + +SS++A ++DEYHAYLIISLE RTMVLQTANNLEEVTE Sbjct: 562 EALPGCKGIWTVYHKNTRIHLSESSRMADEEDEYHAYLIISLEARTMVLQTANNLEEVTE 621 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 NVDYY QG+T+AAGNLFGRRRVIQV+A GARILDGAFMTQELSFK N E GS S+ +V Sbjct: 622 NVDYYVQGTTLAAGNLFGRRRVIQVFAHGARILDGAFMTQELSFKASNVESGSSSDTSIV 681 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 +SVSIADPYVLLRMT+GS+QLLVGDPS+C+VS+ P +FESSK+SISACTLYHDKGPEPW Sbjct: 682 ASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSLTVPSVFESSKKSISACTLYHDKGPEPW 741 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLS+G+GEAIDG+D + DQGD+YC+VCYE+G LEIFDVPNF+ V+SVDKF+ Sbjct: 742 LRKTSTDAWLSSGMGEAIDGADGVTQDQGDVYCVVCYENGTLEIFDVPNFTCVFSVDKFI 801 Query: 1617 SGKANLVDAIILDPTDSAH-KVKTAQDVTAPGKKEQIQNIRI--VELAMHRWPGQHSRPF 1787 SG+ LVD + D + H K +DV PG+KE ++++I VEL MHRW G+HSRPF Sbjct: 802 SGRTYLVDTFMQDSVNGLHAHSKNTEDVIRPGQKENSKDVKINVVELMMHRWIGKHSRPF 861 Query: 1788 LFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTY 1967 LFGIL+DGTIL YH+YVF+ S+ +SKVE +V+SQ ++ S + SRLRNLRF+RV +D Y Sbjct: 862 LFGILADGTILSYHAYVFEGSENSSKVEGSVSSQNSISLSSTNASRLRNLRFVRVPVDNY 921 Query: 1968 AREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVL 2147 AREE GT+ QR+ ++KN+GG QG+F++GSRPSW+M+FRERLRIHPQLCDG IVAFTVL Sbjct: 922 AREEMPSGTQLQRMNVYKNIGGSQGIFLTGSRPSWFMVFRERLRIHPQLCDGPIVAFTVL 981 Query: 2148 HNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIV 2327 HNVNCNHG IYVT+ G LKICQLPSFLSYDNYWPVQKIPLKGTPHQV YFA+KN+Y +IV Sbjct: 982 HNVNCNHGLIYVTALGTLKICQLPSFLSYDNYWPVQKIPLKGTPHQVAYFAEKNVYSVIV 1041 Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEND-VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 S PVLKPLNQVLS++ DQE GQQ + D +++EGSYP+EEFEVRI+EP++SGG W+TRA+I Sbjct: 1042 SVPVLKPLNQVLSTIADQEVGQQFDPDNLNYEGSYPIEEFEVRIVEPEKSGGLWKTRASI 1101 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL N+TTRENETLLA+GTAYVQGEDVAARGRVLLFS+++ DNS+T Sbjct: 1102 PMQSSENALTVRMVTLLNTTTRENETLLAVGTAYVQGEDVAARGRVLLFSIDRTADNSRT 1161 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAI ALASLQGHLLIASGPKI LHKW GSELNGVAF D PPL+ VS+NI Sbjct: 1162 LVSEVYSKELKGAIPALASLQGHLLIASGPKIILHKWTGSELNGVAFCDYPPLHAVSLNI 1221 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFILLGDIHKSI F++WKE QL+LL+KDF LDCLATEFLIDGSTLSL+VSD+QKN Sbjct: 1222 VKNFILLGDIHKSISFVSWKE--PQLSLLAKDFSPLDCLATEFLIDGSTLSLVVSDDQKN 1279 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 +Q+FYYAPK+ SWKGQKLLSRAEFH+G+ +TKF+RLQLLPT+S++ PGSDK NRF Sbjct: 1280 VQIFYYAPKVSESWKGQKLLSRAEFHVGSRITKFLRLQLLPTTSERTATTPGSDKTNRFA 1339 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 +FGTL+GS+GCI+PLDELTFRRLQSLQ+KLV AV H+AGLNPRSFRQFRSNG+AHRPGP Sbjct: 1340 TVFGTLEGSLGCIAPLDELTFRRLQSLQKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGP 1399 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 DNIVDCELL HYEMLPLEEQLEIAQQ+GTTR QIMSNLN++ LGTSFL Sbjct: 1400 DNIVDCELLSHYEMLPLEEQLEIAQQIGTTRMQIMSNLNDMILGTSFL 1447 >ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum lycopersicum] Length = 1447 Score = 1846 bits (4782), Expect = 0.0 Identities = 908/1188 (76%), Positives = 1046/1188 (88%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT +GRVSWKHHTCMVSA SISTTLKQ PLIWSATNLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 262 LTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWSATNLPHDAYKLLAVPSPIGGVLVIGAN 321 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQSSSC LALNN D SQ+M RSS NVELDAANATWLT+DVAMLSTKTGELL Sbjct: 322 TIHYHSQSSSCSLALNNFVFFGDNSQEMPRSSINVELDAANATWLTSDVAMLSTKTGELL 381 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LT++YDGRIVQ+L+LSKSRA+VLTS ITTIG S FFLGSRLGDSLLVQF+ G G +LP Sbjct: 382 LLTIIYDGRIVQKLDLSKSRASVLTSGITTIGDSLFFLGSRLGDSLLVQFSSGLGGSNLP 441 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 GV+EEVGDIE AP AKRLR SSSDALQDM+NGEELSLYG+ PNNA SAQ++FSF VRD Sbjct: 442 PGVQEEVGDIESDAPSAKRLRMSSSDALQDMINGEELSLYGTAPNNAQSAQKTFSFAVRD 501 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS G+R NAD NA+G AKQSNYELVCCSGHGKNG+L VLQ+SIRPE ITQ Sbjct: 502 SLINVGPLKDFSYGMRINADLNATGIAKQSNYELVCCSGHGKNGSLSVLQQSIRPETITQ 561 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 SLPGCKG+WTVYHKN R + +SS++A ++DEYHAYLIISLE RTMVLQTANNLEEVTE Sbjct: 562 VSLPGCKGIWTVYHKNTRIHLSESSRMADEEDEYHAYLIISLEARTMVLQTANNLEEVTE 621 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 NVDYY QG+T+AAGNLFGRRRVIQV+A GARILDGAFMTQELSFK N E GS S+ +V Sbjct: 622 NVDYYVQGTTLAAGNLFGRRRVIQVFAHGARILDGAFMTQELSFKASNVESGSSSDTSIV 681 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 +SVSIADPYVLLRMT+GS+QLLVGDPS+C+VS+ P +FESSK+SISACTLYHDKGPEPW Sbjct: 682 ASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSLTVPSVFESSKKSISACTLYHDKGPEPW 741 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLS+G+GEAIDG+D DQGD+YC+VCYE+G LEIFDVP+F+ V+SVDKF+ Sbjct: 742 LRKTSTDAWLSSGMGEAIDGADGVIQDQGDVYCVVCYENGTLEIFDVPSFTCVFSVDKFI 801 Query: 1617 SGKANLVDAIILDPTDSAH-KVKTAQDVTAPGKKEQIQNIRI--VELAMHRWPGQHSRPF 1787 SG+ LVD + D + H K +DV PG+KE ++++I VEL MHRW G+HSRPF Sbjct: 802 SGRTYLVDTFMQDSVNGLHAHSKNTEDVIRPGQKENSKDVKINVVELMMHRWIGKHSRPF 861 Query: 1788 LFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTY 1967 LFGIL+DGTIL YH+YVF+ S+ +SKV+ +V+SQ ++ S + SRLRNLRF+RV +D Y Sbjct: 862 LFGILADGTILSYHAYVFEGSENSSKVDGSVSSQNSISLSSTNASRLRNLRFVRVPVDNY 921 Query: 1968 AREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVL 2147 AREE G++ QR+ ++KN+GG QG+F++GSRPSW+M+FRERLRIHPQLCDG IVAFTVL Sbjct: 922 AREEMPSGSQLQRMNVYKNIGGSQGIFLTGSRPSWFMVFRERLRIHPQLCDGPIVAFTVL 981 Query: 2148 HNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIV 2327 HNVNCNHG IYVT+ G LKICQLPSFLSYDNYWPVQKIPLKGTPHQV YFA+KN+Y +IV Sbjct: 982 HNVNCNHGLIYVTALGTLKICQLPSFLSYDNYWPVQKIPLKGTPHQVAYFAEKNVYSVIV 1041 Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEND-VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 S PVLKPLNQVLSS+ DQE GQQ + D +++EGSYP+EEFEVRI+EP++SGGPW+TRA+I Sbjct: 1042 SVPVLKPLNQVLSSIADQEVGQQFDPDNLNYEGSYPIEEFEVRILEPEKSGGPWKTRASI 1101 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL+N+ T+ENETLLA+GTAYVQGEDVAARGRVLLFS+++ DNS+T Sbjct: 1102 PMQSSENALTVRMVTLFNTKTKENETLLAVGTAYVQGEDVAARGRVLLFSIDRTADNSRT 1161 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAI ALASLQGHLLIASGPKI LHKW GSELNGVAF D PPL+ VS+NI Sbjct: 1162 LVSEVYSKELKGAIPALASLQGHLLIASGPKIILHKWTGSELNGVAFCDYPPLHAVSLNI 1221 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFILLGDIHKSI F++WKE QL+LL+KDF LDCLATEFLIDGSTLSL+VSD+QKN Sbjct: 1222 VKNFILLGDIHKSISFVSWKE--PQLSLLAKDFSPLDCLATEFLIDGSTLSLVVSDDQKN 1279 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 +Q+FYYAPK+ SWKGQKLLSRAEFH+G+ +TKF+RLQLLPT+S++ PGSDK NRF Sbjct: 1280 VQIFYYAPKVSESWKGQKLLSRAEFHVGSRITKFLRLQLLPTTSERTATTPGSDKTNRFA 1339 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 +FGTL+GS+GCI+PLDELTFRRLQSLQ+KLV AV H+AGLNPRSFRQFRSNG+AHRPGP Sbjct: 1340 TVFGTLEGSLGCIAPLDELTFRRLQSLQKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGP 1399 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 DNIVDCELL HYEMLPLEEQLEIAQQ+GTTR QIMSNLN++ LGTSFL Sbjct: 1400 DNIVDCELLSHYEMLPLEEQLEIAQQIGTTRMQIMSNLNDMILGTSFL 1447 >ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X2 [Citrus sinensis] Length = 1457 Score = 1836 bits (4756), Expect = 0.0 Identities = 905/1190 (76%), Positives = 1035/1190 (86%), Gaps = 8/1190 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+ Sbjct: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LT+VYDGR+VQRL+LSK+ +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+ L Sbjct: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+KEE GDIE AP KRLRRSSSDALQDMVNGEELSLYGS NN SAQ++FSF VRD Sbjct: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ R DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 568 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRVIQV+ GARILDG++MTQ+LSF NSE GSGSE V Sbjct: 628 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLL M+DGSI+LLVGDPS CTVS+ P ESSK+ +S+CTLYHDKGPEPW Sbjct: 688 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV Sbjct: 748 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807 Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ ++VD + + DS ++ ++ ++ T G+KE I ++++VELAM RW G HSRPFL Sbjct: 808 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 F IL+DGTILCY +Y+F+ + SK ++ V++ + ++ ++S SRLRNLRF R+ LD Y Sbjct: 868 FAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYT 927 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REET G CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 928 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330 NVNCNHGFIYVTSQG LKICQLPS +YDNYWPVQKIPLK TPHQ+TYFA+KNLYPLIVS Sbjct: 988 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1047 Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498 PVLKPLNQVLS L+DQE G QI+N V +Y VEE+EVRI+EPDR+GGPWQTRA Sbjct: 1048 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1107 Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678 TIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFS +N DN Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1167 Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858 Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS+ Sbjct: 1168 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1227 Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038 NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDEQ Sbjct: 1228 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1287 Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218 KNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ A PGSDK NR Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1347 Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398 F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHRP Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1407 Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 GPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL Sbjct: 1408 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1457 >ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523633|gb|ESR35000.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1457 Score = 1835 bits (4752), Expect = 0.0 Identities = 906/1190 (76%), Positives = 1034/1190 (86%), Gaps = 8/1190 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+ Sbjct: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LT+VYDGR+VQRL+LSK+ +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+ L Sbjct: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G KEE GDIE AP KRLRRSSSDALQDMVNGEELSLYGS NN SAQ++FSF VRD Sbjct: 448 SGPKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ R DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 568 VELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRVIQV+ GARILDG++MTQ+LSF NSE GSGSE V Sbjct: 628 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLL M+DGSI+LLVGDPS CTVS+ P ESSK+ +SACTLYHDKGPEPW Sbjct: 688 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSACTLYHDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV Sbjct: 748 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807 Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ ++VD + + DS ++ ++ ++ T G+KE I ++++VELAM RW G HSRPFL Sbjct: 808 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 F IL+DGTILCY +Y+F+ S+ SK ++ V++ + ++ ++S SRLRNLRF R LD Y Sbjct: 868 FAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT 927 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REET G CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 928 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330 NVNCNHGFIYVTSQG LKICQLPS +YDNYWPVQKIPLK TPHQ+TYFA+KNLYPLIVS Sbjct: 988 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1047 Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498 PVLKPLNQVLS L+DQE G QI+N V +Y VEE+EVRI+EPDR+GGPWQTRA Sbjct: 1048 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1107 Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678 TIPMQSSENALTVRVVTL+N+TT+EN+TLLAIGTAYVQGEDVAARGRVLLFS +N DN Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1167 Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858 Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS+ Sbjct: 1168 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1227 Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038 NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDEQ Sbjct: 1228 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1287 Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218 KNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ A PGSDK NR Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1347 Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398 F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHRP Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1407 Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 GPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL Sbjct: 1408 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1457 >ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Citrus sinensis] Length = 1458 Score = 1832 bits (4744), Expect = 0.0 Identities = 905/1191 (75%), Positives = 1035/1191 (86%), Gaps = 9/1191 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+ Sbjct: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LT+VYDGR+VQRL+LSK+ +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+ L Sbjct: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+KEE GDIE AP KRLRRSSSDALQDMVNGEELSLYGS NN SAQ++FSF VRD Sbjct: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ R DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 568 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRVIQV+ GARILDG++MTQ+LSF NSE GSGSE V Sbjct: 628 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLL M+DGSI+LLVGDPS CTVS+ P ESSK+ +S+CTLYHDKGPEPW Sbjct: 688 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV Sbjct: 748 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807 Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ ++VD + + DS ++ ++ ++ T G+KE I ++++VELAM RW G HSRPFL Sbjct: 808 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 F IL+DGTILCY +Y+F+ + SK ++ V++ + ++ ++S SRLRNLRF R+ LD Y Sbjct: 868 FAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYT 927 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REET G CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 928 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327 NVNCNHGFIYVTSQG LKICQLPS +YDNYWPVQK IPLK TPHQ+TYFA+KNLYPLIV Sbjct: 988 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1047 Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495 S PVLKPLNQVLS L+DQE G QI+N V +Y VEE+EVRI+EPDR+GGPWQTR Sbjct: 1048 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1107 Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDN 2675 ATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFS +N DN Sbjct: 1108 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1167 Query: 2676 SQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVS 2855 Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS Sbjct: 1168 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1227 Query: 2856 MNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDE 3035 +NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDE Sbjct: 1228 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1287 Query: 3036 QKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKIN 3215 QKNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ A PGSDK N Sbjct: 1288 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1347 Query: 3216 RFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHR 3395 RF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHR Sbjct: 1348 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1407 Query: 3396 PGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 PGPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL Sbjct: 1408 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1458 >ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523632|gb|ESR34999.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1458 Score = 1830 bits (4740), Expect = 0.0 Identities = 906/1191 (76%), Positives = 1034/1191 (86%), Gaps = 9/1191 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+ Sbjct: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LT+VYDGR+VQRL+LSK+ +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+ L Sbjct: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G KEE GDIE AP KRLRRSSSDALQDMVNGEELSLYGS NN SAQ++FSF VRD Sbjct: 448 SGPKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ R DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 568 VELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRVIQV+ GARILDG++MTQ+LSF NSE GSGSE V Sbjct: 628 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLL M+DGSI+LLVGDPS CTVS+ P ESSK+ +SACTLYHDKGPEPW Sbjct: 688 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSACTLYHDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV Sbjct: 748 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807 Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ ++VD + + DS ++ ++ ++ T G+KE I ++++VELAM RW G HSRPFL Sbjct: 808 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 F IL+DGTILCY +Y+F+ S+ SK ++ V++ + ++ ++S SRLRNLRF R LD Y Sbjct: 868 FAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT 927 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REET G CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 928 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327 NVNCNHGFIYVTSQG LKICQLPS +YDNYWPVQK IPLK TPHQ+TYFA+KNLYPLIV Sbjct: 988 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1047 Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495 S PVLKPLNQVLS L+DQE G QI+N V +Y VEE+EVRI+EPDR+GGPWQTR Sbjct: 1048 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1107 Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDN 2675 ATIPMQSSENALTVRVVTL+N+TT+EN+TLLAIGTAYVQGEDVAARGRVLLFS +N DN Sbjct: 1108 ATIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1167 Query: 2676 SQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVS 2855 Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS Sbjct: 1168 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1227 Query: 2856 MNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDE 3035 +NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDE Sbjct: 1228 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1287 Query: 3036 QKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKIN 3215 QKNIQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ A PGSDK N Sbjct: 1288 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1347 Query: 3216 RFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHR 3395 RF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHR Sbjct: 1348 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1407 Query: 3396 PGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 PGPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL Sbjct: 1408 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1458 >ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] Length = 1461 Score = 1821 bits (4717), Expect = 0.0 Identities = 901/1193 (75%), Positives = 1033/1193 (86%), Gaps = 11/1193 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ LIWS NLPHDAYKLLAVP PIGGVLV+CAN Sbjct: 269 LTWAGRVSWKHHTCMISALSISTTLKQPTLIWSVVNLPHDAYKLLAVPPPIGGVLVICAN 328 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHS+S++ LALNN+A+S D SQ++ R+SF+VELDA A WL NDVA+LS K GELL Sbjct: 329 TIHYHSESATYALALNNYAVSIDSSQELPRASFSVELDAVKAAWLLNDVALLSAKNGELL 388 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 L+LVYDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQFT G G + Sbjct: 389 LLSLVYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFTNGLGPSVVS 448 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+KEEVG+IEG P AKRL+RS+SD LQDMV+GEELSLYGS NN SAQ+SFSF VRD Sbjct: 449 SGLKEEVGEIEGDVPSAKRLKRSASDGLQDMVSGEELSLYGSTANNTESAQKSFSFAVRD 508 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR+N D +A+G AKQSNY+LVCCSGHGKNG LC+L++SIRPEMIT+ Sbjct: 509 SLINVGPLKDFSYGLRSNYDASATGIAKQSNYDLVCCSGHGKNGTLCILRQSIRPEMITE 568 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGC+G+WTVYHKN R D SK+AA DEYHAYLIIS+E RTMVL+TA+ L EVTE Sbjct: 569 VDLPGCRGIWTVYHKNARGHNVDLSKMAAAADEYHAYLIISMEARTMVLETADLLSEVTE 628 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDY+ QG T+AAGNLFGRRRVIQV+ GARILDG+FMTQ+LS NSE GSE V Sbjct: 629 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSFMTQDLSIGSSNSESSPGSESATV 688 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SSVSIADPYVL++MTDGSI+LL+GD S C VSI P FE+S+RS+SACTLYHDKGPEPW Sbjct: 689 SSVSIADPYVLIKMTDGSIRLLIGDSSTCMVSINTPSAFENSERSVSACTLYHDKGPEPW 748 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAAS---HDQGDIYCIVCYESGVLEIFDVPNFSSVYSVD 1607 LR+ STDAWLSTG+ EAIDG+++A HDQGDIYCIVCYESG LEIFDVPNF+ V+SVD Sbjct: 749 LRKASTDAWLSTGVSEAIDGAESADGGPHDQGDIYCIVCYESGALEIFDVPNFNRVFSVD 808 Query: 1608 KFVSGKANLVDAIILDPT-DSAHKV-KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSR 1781 KFVSGK +L DA + +P DS K + +++V G+KE N++ VELAM RW G HSR Sbjct: 809 KFVSGKTHLADAYVREPPKDSQEKTNRISEEVAGLGRKENAHNMKAVELAMQRWSGHHSR 868 Query: 1782 PFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLD 1961 PFLFG+L+DGTILCYH+Y+F++ DA SK E++V++Q P + S+S SRLRNLRF+RV LD Sbjct: 869 PFLFGVLTDGTILCYHAYLFEAPDATSKTEDSVSAQNPVGLGSISASRLRNLRFVRVPLD 928 Query: 1962 TYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFT 2141 +Y +EETS CQRITIF N+ G QG F+ GSRP+W+M+FRERLR+HPQLCDG+IVAFT Sbjct: 929 SYIKEETSTENSCQRITIFNNISGHQGFFLLGSRPAWFMVFRERLRVHPQLCDGSIVAFT 988 Query: 2142 VLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPL 2321 VLHNVNCNHG IYVTSQGNLKICQLPSF +YDNYWPVQKIPLKGTPHQVTYF +KNLYPL Sbjct: 989 VLHNVNCNHGLIYVTSQGNLKICQLPSFSNYDNYWPVQKIPLKGTPHQVTYFPEKNLYPL 1048 Query: 2322 IVSFPVLKPLNQVLSSLVDQEAGQQIEN-DVSFEG---SYPVEEFEVRIMEPDRSGGPWQ 2489 IVS PV KP+NQVLSSLVDQE G QIEN ++S + +Y VEEFEVRI+E + GGPWQ Sbjct: 1049 IVSVPVHKPVNQVLSSLVDQEVGHQIENHNLSSDELLQTYSVEEFEVRILESENGGGPWQ 1108 Query: 2490 TRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNN 2669 T+ATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV K+ Sbjct: 1109 TKATIPMQSSENALTVRVVTLFNATTKENETLLAIGTAYVQGEDVAARGRVLLFSVVKST 1168 Query: 2670 DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYV 2849 +NSQ LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNGVAF+DAPPLYV Sbjct: 1169 ENSQVLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGVAFYDAPPLYV 1228 Query: 2850 VSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVS 3029 SMNIVKNFILLGDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSL+VS Sbjct: 1229 ASMNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVS 1288 Query: 3030 DEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDK 3209 DEQKNIQ+FYYAPK+L SWKGQKLLSRAEFH+GAH+TKF+RL +L TSSD+ A PG DK Sbjct: 1289 DEQKNIQIFYYAPKMLESWKGQKLLSRAEFHVGAHITKFIRLSMLSTSSDRSGAAPGPDK 1348 Query: 3210 INRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRA 3389 NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQRKLVDAV H+AGLNPRSFRQFRS+G+ Sbjct: 1349 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKV 1408 Query: 3390 HRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 HRPGP++IVDCELL H+EMLPLEEQLEIAQQVGTTR QI+SNLN+L+LGTSFL Sbjct: 1409 HRPGPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNLNDLSLGTSFL 1461 >ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Glycine max] Length = 1447 Score = 1812 bits (4694), Expect = 0.0 Identities = 897/1188 (75%), Positives = 1037/1188 (87%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 264 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 323 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALN++A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL Sbjct: 324 TIHYHSQSASCALALNSYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 383 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LTLVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G L Sbjct: 384 LLTLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMLS 443 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +KEEVGDIE AP +KRLRRS SDALQDMV+GEELSLYGS PN SAQ+SFSF VRD Sbjct: 444 SNLKEEVGDIEADAP-SKRLRRSPSDALQDMVSGEELSLYGSAPNRTESAQKSFSFAVRD 502 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+ Sbjct: 503 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 562 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 563 VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 622 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGR RVIQVY GARILDG+FMTQ++SF N E GS S+ + Sbjct: 623 SVDYYVQGKTLAAGNLFGRCRVIQVYERGARILDGSFMTQDVSFGASNLESGSASDSAIA 682 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P FESSK S+S+CTLYHDKGPEPW Sbjct: 683 LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 742 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GE IDG+D A+ D GDIYC+VC+++G LEIFDVPNF+ V+SV+ F+ Sbjct: 743 LRKTSTDAWLSTGVGETIDGTDGAAQDHGDIYCVVCFDNGNLEIFDVPNFNCVFSVENFM 802 Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 SGK++LVDA++ + + K V G+KE I ++++VELAM RW GQHSRPFLFG Sbjct: 803 SGKSHLVDALMKEVLKDS-KQGDRDGVINQGRKENIPDMKVVELAMQRWSGQHSRPFLFG 861 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 ILSDGTILCYH+Y+++S D+ SKVE++ ++ + S + SRLRNLRF+RV LD YARE Sbjct: 862 ILSDGTILCYHAYLYESPDSTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYARE 921 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 +TS G CQ+ITIFKN+G +G F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV Sbjct: 922 DTSNGPPCQQITIFKNIGSYEGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 981 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCN G IYVTSQG LKICQLPS +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP Sbjct: 982 NCNQGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1041 Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 VLKPLNQV+ SLVDQ+ Q E N YP++EFEVRIMEP++SGGPWQT+ATI Sbjct: 1042 VLKPLNQVI-SLVDQDINHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1100 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN QT Sbjct: 1101 PMQSSENALTVRMVTLVNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQT 1160 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI Sbjct: 1161 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1220 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N Sbjct: 1221 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1280 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+ AVPGSDK NRF Sbjct: 1281 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGAVPGSDKTNRFA 1339 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP Sbjct: 1340 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1399 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 D+IVDCELLCHYEMLPLEEQLEIA QVGTTR+QI+SNL++L+LGTSFL Sbjct: 1400 DSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1447 >ref|XP_006587381.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X2 [Glycine max] Length = 1217 Score = 1810 bits (4688), Expect = 0.0 Identities = 895/1188 (75%), Positives = 1036/1188 (87%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSW HHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 34 LTWAGRVSWTHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 93 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL Sbjct: 94 TIHYHSQSASCALALNNYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 153 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 L LVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G + Sbjct: 154 LLMLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMMS 213 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +KEEVGDIE AP +KRLRRS SDALQDMV+GEELSLYGS N SAQ+SFSF VRD Sbjct: 214 SNLKEEVGDIEVDAP-SKRLRRSPSDALQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 272 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+ Sbjct: 273 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 332 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 333 VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 392 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRVIQVY GARILDG+FMTQ++SF NSE GS SE + Sbjct: 393 SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSESGSASESAIA 452 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P FESSK S+S+CTLYHDKGPEPW Sbjct: 453 LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 512 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFD+PNF+ V+SV+ F+ Sbjct: 513 LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDIPNFNCVFSVENFM 572 Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 SGK++LVDA++ + + K V G+K+ I N+++VELAM RW GQHSRPFLFG Sbjct: 573 SGKSHLVDALMKEVLKDS-KQGDRDGVVNQGRKDNIPNMKVVELAMQRWSGQHSRPFLFG 631 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 ILSDGTILCYH+Y+++S D SKVE++ ++ + S + SRLRNLRF+RV LD Y RE Sbjct: 632 ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYPRE 691 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 +TS G+ CQ+ITIFKN+G QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV Sbjct: 692 DTSNGSPCQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 751 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCNHG IYVTSQG LKICQLPS +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP Sbjct: 752 NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 811 Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 VLKPLNQV+ SLVDQ+ Q E N YP++EFEVRIMEP++SGGPWQT+ATI Sbjct: 812 VLKPLNQVI-SLVDQDFNHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 870 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ K DN QT Sbjct: 871 PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKITDNPQT 930 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI Sbjct: 931 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 990 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N Sbjct: 991 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1050 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+ +VPGSDK NRF Sbjct: 1051 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGSVPGSDKTNRFA 1109 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP Sbjct: 1110 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1169 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 D+IVDCELLCHYEMLPLEEQLEIA Q+GTTR+QI+SNL++L+LGTSFL Sbjct: 1170 DSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLSLGTSFL 1217 >ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Glycine max] Length = 1449 Score = 1810 bits (4688), Expect = 0.0 Identities = 895/1188 (75%), Positives = 1036/1188 (87%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSW HHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 266 LTWAGRVSWTHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 325 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALNN+A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL Sbjct: 326 TIHYHSQSASCALALNNYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 385 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 L LVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G + Sbjct: 386 LLMLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMMS 445 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +KEEVGDIE AP +KRLRRS SDALQDMV+GEELSLYGS N SAQ+SFSF VRD Sbjct: 446 SNLKEEVGDIEVDAP-SKRLRRSPSDALQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 504 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+ Sbjct: 505 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 564 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 565 VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 624 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRVIQVY GARILDG+FMTQ++SF NSE GS SE + Sbjct: 625 SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSESGSASESAIA 684 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P FESSK S+S+CTLYHDKGPEPW Sbjct: 685 LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 744 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFD+PNF+ V+SV+ F+ Sbjct: 745 LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDIPNFNCVFSVENFM 804 Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 SGK++LVDA++ + + K V G+K+ I N+++VELAM RW GQHSRPFLFG Sbjct: 805 SGKSHLVDALMKEVLKDS-KQGDRDGVVNQGRKDNIPNMKVVELAMQRWSGQHSRPFLFG 863 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 ILSDGTILCYH+Y+++S D SKVE++ ++ + S + SRLRNLRF+RV LD Y RE Sbjct: 864 ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYPRE 923 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 +TS G+ CQ+ITIFKN+G QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV Sbjct: 924 DTSNGSPCQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 983 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCNHG IYVTSQG LKICQLPS +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP Sbjct: 984 NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1043 Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 VLKPLNQV+ SLVDQ+ Q E N YP++EFEVRIMEP++SGGPWQT+ATI Sbjct: 1044 VLKPLNQVI-SLVDQDFNHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1102 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ K DN QT Sbjct: 1103 PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKITDNPQT 1162 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI Sbjct: 1163 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1222 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N Sbjct: 1223 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1282 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+ +VPGSDK NRF Sbjct: 1283 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGSVPGSDKTNRFA 1341 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP Sbjct: 1342 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1401 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 D+IVDCELLCHYEMLPLEEQLEIA Q+GTTR+QI+SNL++L+LGTSFL Sbjct: 1402 DSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLSLGTSFL 1449 >gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris] Length = 1445 Score = 1804 bits (4672), Expect = 0.0 Identities = 888/1188 (74%), Positives = 1039/1188 (87%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 262 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 321 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 T+HYHSQS+SC LALN++A+S D SQ++ RSSFNVELD+ANATWL +DVA+LSTKTGELL Sbjct: 322 TVHYHSQSASCALALNSYAVSLDNSQEIPRSSFNVELDSANATWLLSDVALLSTKTGELL 381 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LTLVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G L Sbjct: 382 LLTLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGGSMLS 441 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +KEEVGDIE AP +KRLRRS SD LQD+V+GEELSLYGS PN SAQ+SFSF VRD Sbjct: 442 SNLKEEVGDIEADAP-SKRLRRSPSDTLQDVVSGEELSLYGSAPNRTESAQKSFSFAVRD 500 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+ Sbjct: 501 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 560 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 561 VELPGCKGIWTVYHKSTRSHNTDSSKLADDDDEYHAYLIISLEARTMVLETADLLSEVTE 620 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRVIQVY GARILDG+FMTQ+++F NSE S SE + Sbjct: 621 SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVTFGASNSESASASESAIA 680 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADP+VLLRM+DGS++LL+GDP CT+S+ +P FES+K S+S+CTLYHDKGPEPW Sbjct: 681 LSVSIADPFVLLRMSDGSVRLLIGDPITCTISVTSPASFESTKGSVSSCTLYHDKGPEPW 740 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFDVPNF+ V+SV F+ Sbjct: 741 LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDVPNFNCVFSVGNFM 800 Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 SGK++LVDA++ + + K V G+KE + ++++VELAM RW GQHSRPFLFG Sbjct: 801 SGKSHLVDALMKEVLKDSKK-GDRDGVIIQGRKENVPDMKVVELAMQRWSGQHSRPFLFG 859 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 ILSDGTILCYH+Y+++S D SKVE++ ++ + + + SRLRNLRF+RV+LD YARE Sbjct: 860 ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLGTTNISRLRNLRFVRVSLDAYARE 919 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 ETS G+ Q+ITIFKN+G QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV Sbjct: 920 ETSNGSLHQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 979 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCNHG IYVTSQG LKICQLPS +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP Sbjct: 980 NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1039 Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 VLKPL+QV+ SLVDQ+ Q E N YP++EFEVRIMEP++SGGPWQT+ATI Sbjct: 1040 VLKPLSQVI-SLVDQDVNHQNESQNMNSDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1098 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN Q+ Sbjct: 1099 PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQS 1158 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI Sbjct: 1159 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1218 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDF LDC ATEFLIDGSTLSLMVSD+++N Sbjct: 1219 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFSSLDCFATEFLIDGSTLSLMVSDDKRN 1278 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+FYYAPK+ SWKGQKLLSRAEFH+GAHVTKF+RLQ+LPT SD+ + PGSDK NRF Sbjct: 1279 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPT-SDRAGSAPGSDKTNRFA 1337 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDGSIGCI+PLDE+TFRRLQSLQ+KLVDAVAH+AGLNPR+FR+F+SNG+AHRPGP Sbjct: 1338 LLFGTLDGSIGCIAPLDEITFRRLQSLQKKLVDAVAHVAGLNPRAFRKFQSNGKAHRPGP 1397 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 D+IVDCELLCHYEMLPLEEQLEIA QVGTTR+QI+SNL++L+LGTSFL Sbjct: 1398 DSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1445 >gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus notabilis] Length = 1479 Score = 1798 bits (4658), Expect = 0.0 Identities = 893/1220 (73%), Positives = 1036/1220 (84%), Gaps = 38/1220 (3%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRV WKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+CAN Sbjct: 270 LTWAGRVLWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVICAN 329 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 T+HY SQS+SC LALN++A+S D SQ+M R+ F+VELDAANATWL+NDV +LSTK GELL Sbjct: 330 TLHYQSQSNSCTLALNSYAVSVDSSQEMRRAPFSVELDAANATWLSNDVVLLSTKAGELL 389 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LTLVYDGR+VQRL+LSKS+A+VLTS ITTIG+S FFLGSRLGDSLLVQFT G G+ L Sbjct: 390 LLTLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLFFLGSRLGDSLLVQFTYGLGTSMLS 449 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 G+K+EVGDIEG A LAKRLRRSSSD LQDM +GEELSLY S PNN+ S Q+SFSFTVRD Sbjct: 450 SGLKDEVGDIEGDAHLAKRLRRSSSDVLQDMTSGEELSLYVSAPNNSESTQKSFSFTVRD 509 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 510 SLVNVGPLKDFSYGLRINADPNATGVAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+ A DDEYHAYLIISLE RTMVL+TA+ L EVTE Sbjct: 570 VELPGCKGIWTVYHKSTRSH--DSSKLVAADDEYHAYLIISLEARTMVLETADLLTEVTE 627 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRV+QVY GARILDG+FMTQ+LSF SE SGSE +V Sbjct: 628 SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQDLSFGPAPSESSSGSENAVV 687 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 +SVSIADPYV+LRM+DGSI+LLVGDP++CTVS+ P FESSK ISACTLY DKGPEPW Sbjct: 688 TSVSIADPYVVLRMSDGSIRLLVGDPTSCTVSVSTPADFESSKSIISACTLYRDKGPEPW 747 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+ EAIDG+D DQGDIYC+VCYESG L+I+DVP+F+ V+SVD F+ Sbjct: 748 LRKTSTDAWLSTGVDEAIDGADETLQDQGDIYCVVCYESGSLDIYDVPSFNYVFSVDNFI 807 Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790 SG+ +LVDA + + K K +++ G+KE +QN++IVELAM RW G+HSRPFL Sbjct: 808 SGRPHLVDAFVQEQPKDLQKATNKNSEESAGQGRKENVQNMKIVELAMQRWSGKHSRPFL 867 Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970 GIL+DG+ILCYH+Y+F+ ++ S+ E++V+S ++ SGSRLRNLRF+RV LD+YA Sbjct: 868 LGILTDGSILCYHAYLFEGPESTSRTEDSVSS------RNSSGSRLRNLRFVRVPLDSYA 921 Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150 REETS G CQRI++FKN+ G QGLF+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH Sbjct: 922 REETSDGMPCQRISVFKNIAGYQGLFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 981 Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327 NVNCNHGFIYVTS+G LKICQLPS SYDNYWPVQK IPLKGTPHQVTYFA++NLYPLIV Sbjct: 982 NVNCNHGFIYVTSEGILKICQLPSITSYDNYWPVQKVIPLKGTPHQVTYFAERNLYPLIV 1041 Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495 S PV KPLNQV+SSL+DQE G Q EN +Y ++EFEVRI+EP+RSGGPWQT+ Sbjct: 1042 SVPVPKPLNQVMSSLLDQEVGHQFENPNLSPDDLNRTYTIDEFEVRILEPERSGGPWQTK 1101 Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLL--------- 2648 TIPMQSSENALT+RVVTL+N+TT ENETLLAIGTAYVQGEDVAARGR++L Sbjct: 1102 VTIPMQSSENALTIRVVTLFNTTTNENETLLAIGTAYVQGEDVAARGRIILRALAPWWER 1161 Query: 2649 --------------------FSVEKNNDNSQTLVSEVYSKELKGAISALASLQGHLLIAS 2768 F ++ + + Q VSE+YSKELKGAISALASLQGHLLIAS Sbjct: 1162 LHLHPGSRVQIPEMASPSGVFKIDSADFHLQ--VSEIYSKELKGAISALASLQGHLLIAS 1219 Query: 2769 GPKITLHKWIGSELNGVAFFDAPPLYVVSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNL 2948 GPKI LHKW G+ELNG+AFFDAPPLYVVS+NIVKNFIL+GD+HKSIYFL+WKEQG+QL+L Sbjct: 1220 GPKIILHKWTGTELNGIAFFDAPPLYVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLSL 1279 Query: 2949 LSKDFGFLDCLATEFLIDGSTLSLMVSDEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIG 3128 L+KDFG LDC ATEFLIDGSTLSL+VSD+QKNIQ+FYYAPK+ SWKGQ+LLSRAEFH+G Sbjct: 1280 LAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMSESWKGQRLLSRAEFHVG 1339 Query: 3129 AHVTKFMRLQLLPTSSDQINAVPGSDKINRFGLLFGTLDGSIGCISPLDELTFRRLQSLQ 3308 AHVTKF+RLQ+LPTS+D+ + PGSDK NRF LLFG LDGSIGCI+PLDELTFRRLQSLQ Sbjct: 1340 AHVTKFLRLQMLPTSTDRTGSTPGSDKTNRFALLFGALDGSIGCIAPLDELTFRRLQSLQ 1399 Query: 3309 RKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVG 3488 +KLVDAV H+AGLNPRSFRQF SNG+AHRPGPD+IVDCELLCHYEMLPLEEQLEIA +G Sbjct: 1400 KKLVDAVPHVAGLNPRSFRQFCSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHLIG 1459 Query: 3489 TTRTQIMSNLNELTLGTSFL 3548 TTR+QI+SNLN+L LGTSFL Sbjct: 1460 TTRSQILSNLNDLFLGTSFL 1479 >ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] gi|550326263|gb|EEE96682.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] Length = 1455 Score = 1795 bits (4648), Expect = 0.0 Identities = 892/1193 (74%), Positives = 1027/1193 (86%), Gaps = 11/1193 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRV WKHHTCM+SALSISTTLKQ LIWS NLPHDAYKLLAVPSPIGGVLV+ N Sbjct: 268 LTWAGRVVWKHHTCMISALSISTTLKQPTLIWSIGNLPHDAYKLLAVPSPIGGVLVIGVN 327 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHS+S+SC LALN++A S D SQ++ R++F+VELDAANATWL DVA+LSTKTGELL Sbjct: 328 TIHYHSESASCALALNSYAASVDSSQELPRATFSVELDAANATWLLKDVALLSTKTGELL 387 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPL- 539 LTLVYDGR+VQRL+LSKS+A+VLTS+ITT+G+SFFFLGSRLGDSLLVQFT G GS L Sbjct: 388 LLTLVYDGRVVQRLDLSKSKASVLTSDITTLGNSFFFLGSRLGDSLLVQFTSGLGSSMLS 447 Query: 540 -GVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQ-----ESFS 701 G+KEEVGDIEG P AKRL+ SSSDALQDMV+GEELSLY S PNNA S+Q ++FS Sbjct: 448 PGLKEEVGDIEGDLPSAKRLKVSSSDALQDMVSGEELSLYSSAPNNAESSQVVSVIKTFS 507 Query: 702 FTVRDSLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRP 881 FTVRDSLIN+GPLKDF+ GLR NAD NA+G +KQSNYELVCCSGHGKNGALCVLQ+SIRP Sbjct: 508 FTVRDSLINVGPLKDFAYGLRINADANATGISKQSNYELVCCSGHGKNGALCVLQQSIRP 567 Query: 882 EMITQESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNL 1061 EMIT+ LPGCKG+WTVYHKN RS DS K+A+DD EYHAYLIIS+E RTMVL+TA++L Sbjct: 568 EMITEVELPGCKGIWTVYHKNARSHSVDSLKMASDD-EYHAYLIISMEARTMVLETADHL 626 Query: 1062 EEVTENVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGS 1241 EVTE+VDY+ QG T+AAGNLFGRRRV+QV+ GARILDG+FMTQ+LSF NSE G S Sbjct: 627 TEVTESVDYFVQGRTIAAGNLFGRRRVVQVFERGARILDGSFMTQDLSFGGSNSETGR-S 685 Query: 1242 EGQMVSSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDK 1421 E V VSI DPYVL+RM DGSIQ+LVGDPSACTVS+ P F+SS +S+SACTLYHDK Sbjct: 686 ESSTVMHVSIVDPYVLVRMADGSIQILVGDPSACTVSVNTPSAFQSSTKSVSACTLYHDK 745 Query: 1422 GPEPWLRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYS 1601 GPEPWLR+TSTDAWLSTGI EAIDG+D+ +H+QGDIYC+VCYE+G LEIFDVPNF+SV+ Sbjct: 746 GPEPWLRKTSTDAWLSTGISEAIDGADSGAHEQGDIYCVVCYETGALEIFDVPNFNSVFF 805 Query: 1602 VDKFVSGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSR 1781 VDKFVSGK +L+D +P K ++V G+KE QN+++VEL M RW G+HSR Sbjct: 806 VDKFVSGKTHLLDTCTGEPAKDMMK-GVKEEVAGAGRKESTQNMKVVELTMLRWSGRHSR 864 Query: 1782 PFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLD 1961 PFLFGIL+DGTILCYH+Y+F+ D SK+E++V++Q ++S SRLRNLRF+RV LD Sbjct: 865 PFLFGILTDGTILCYHAYLFEGPDGTSKLEDSVSAQNSVGASTISASRLRNLRFVRVPLD 924 Query: 1962 TYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFT 2141 TY REETS T CQRIT FKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFT Sbjct: 925 TYTREETSSETSCQRITTFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFT 984 Query: 2142 VLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPL 2321 VLH VNCNHG IYVTSQGNLKIC L S SYDNYWPVQKIPLKGTPHQVTYFA++NLYPL Sbjct: 985 VLHTVNCNHGLIYVTSQGNLKICHLSSVSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPL 1044 Query: 2322 IVSFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQ 2489 IVS PV KP+NQVLSSLVDQE G QIEN +Y V+EFEVRI+EP S GPWQ Sbjct: 1045 IVSVPVQKPVNQVLSSLVDQEVGHQIENHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQ 1102 Query: 2490 TRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNN 2669 +ATIPMQ+SENALTVR+V+L+N++T+ENETLLA+GTAYVQGEDVAARGR+LLFSV KN Sbjct: 1103 VKATIPMQTSENALTVRMVSLFNTSTKENETLLAVGTAYVQGEDVAARGRILLFSVVKNP 1162 Query: 2670 DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYV 2849 +NSQ LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+EL GVAF DAPPLYV Sbjct: 1163 ENSQILVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELTGVAFSDAPPLYV 1222 Query: 2850 VSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVS 3029 VS+NIVKNFILLGDIHKSIYFL+WKEQG+QL+LL+KDF LDC +TEFLIDGSTLSL+VS Sbjct: 1223 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVS 1282 Query: 3030 DEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDK 3209 DEQKN+Q+FYYAPK+ SWKGQKLLSRAEFH+GA VTKFMRLQ+L S D+ A P SDK Sbjct: 1283 DEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDK 1342 Query: 3210 INRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRA 3389 NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNP+SFRQFRS+G+A Sbjct: 1343 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKA 1402 Query: 3390 HRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 HRPGP++IVDCE+L +YEM+PLEEQ+EIAQQ+GTTR QI+SNLN+LTLGTSFL Sbjct: 1403 HRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIGTTRAQILSNLNDLTLGTSFL 1455 >ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Cicer arietinum] Length = 1447 Score = 1789 bits (4633), Expect = 0.0 Identities = 885/1189 (74%), Positives = 1028/1189 (86%), Gaps = 7/1189 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 266 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 325 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 TIHYHSQS+SC LALN++A+S D SQ+M RSSFNVELDAANATWL NDVA+LSTKTGELL Sbjct: 326 TIHYHSQSASCALALNSYAVSVDNSQEMPRSSFNVELDAANATWLLNDVALLSTKTGELL 385 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536 LTL+YDGR+VQRL+LSKS+A+VL+S +TTIG+S FFL SRLGDS+LVQF+ G+G L Sbjct: 386 LLTLIYDGRVVQRLDLSKSKASVLSSGVTTIGNSLFFLASRLGDSMLVQFSSGSGVSMLS 445 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +KEEVGD + A AKR+RRS SD LQDMV+GEELSLYGS N SAQ+SFSF VRD Sbjct: 446 SNLKEEVGDFDVDASSAKRMRRSPSDTLQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 505 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+ Sbjct: 506 SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 565 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 LPGCKG+WTVYHK+ RS DSSK+A D+DEYHAYLIISLE+RTMVL+TA+ L EVTE Sbjct: 566 VELPGCKGIWTVYHKSTRSLNADSSKLADDEDEYHAYLIISLESRTMVLETADLLSEVTE 625 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 +VDYY QG T+AAGNLFGRRRVIQVY GARILDG+FMTQ++SF NSE GSE + Sbjct: 626 SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSEANYGSESALA 685 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSIADPYVLL+M+DGS++LLVGDPS CT+S+ +P FESSK S+S CTLYHDKGPEPW Sbjct: 686 LSVSIADPYVLLKMSDGSVRLLVGDPSTCTISVTSPASFESSKGSVSTCTLYHDKGPEPW 745 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VCYE+ LEIFDVPNFS V+SV+ F+ Sbjct: 746 LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCYENDSLEIFDVPNFSCVFSVENFL 805 Query: 1617 SGKANLVDAIILD-PTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLF 1793 SGK++LVDA+ + P DS K + V + G+K+ + N+++VELAM RW G+H RPFLF Sbjct: 806 SGKSHLVDALTKEVPKDSQKGDKVSDGVVSQGRKDAL-NMKVVELAMQRWSGKHGRPFLF 864 Query: 1794 GILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYAR 1973 GILSDGT LCYH+Y+++S D SKVE++V++ + + S SRLRNLRF+RV LD +AR Sbjct: 865 GILSDGTTLCYHAYLYESPDGTSKVEDSVSA----GLSNSSVSRLRNLRFVRVPLDVHAR 920 Query: 1974 EETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHN 2153 EETS G CQ+I IFKN+G +G F+SGSRP+W ML RERLR+HPQLCDG+IVAFTVLHN Sbjct: 921 EETSNGPPCQQINIFKNIGSYEGFFLSGSRPAWVMLLRERLRVHPQLCDGSIVAFTVLHN 980 Query: 2154 VNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSF 2333 VNCNHG IYVTSQG LKICQLPS +YD YWPVQK+PLK TPHQVTYFA+KNLYPLIVS+ Sbjct: 981 VNCNHGLIYVTSQGVLKICQLPSGSNYDCYWPVQKVPLKATPHQVTYFAEKNLYPLIVSY 1040 Query: 2334 PVLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRAT 2501 PV KPLNQV+ +LVDQ+A Q E N+ Y +EEFEVRIMEP++SGGPWQ +AT Sbjct: 1041 PVPKPLNQVI-ALVDQDANQLTESQNLNNDEQSHLYTIEEFEVRIMEPEKSGGPWQLKAT 1099 Query: 2502 IPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQ 2681 IPMQSSENALTVR+VTL N++++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN Q Sbjct: 1100 IPMQSSENALTVRMVTLMNTSSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQ 1159 Query: 2682 TLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMN 2861 LVSEVYSKELKGAISALA+LQGHLL+ASGPKI LHKW G+ELNGVAFFD PPL+VVS+N Sbjct: 1160 NLVSEVYSKELKGAISALAALQGHLLVASGPKIILHKWTGTELNGVAFFDVPPLHVVSLN 1219 Query: 2862 IVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQK 3041 IVKNFIL+GD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQK Sbjct: 1220 IVKNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQK 1279 Query: 3042 NIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRF 3221 NIQ+FYYAPK+ SWKGQKLLSRAEFH+GAH+TKF+RLQ+L T SD+ + PGSDK NRF Sbjct: 1280 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHITKFLRLQMLST-SDKTGSGPGSDKTNRF 1338 Query: 3222 GLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPG 3401 LLFGTLDGSIGCI+PLDE+TFRRLQSLQ+KLVDAV H+AGLNPR+FR F SNG+AHRPG Sbjct: 1339 ALLFGTLDGSIGCIAPLDEITFRRLQSLQKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPG 1398 Query: 3402 PDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 PD+IVDCELLCHYEML LEEQLEIA QVGTTR+QI+SNL++L+LGTSFL Sbjct: 1399 PDSIVDCELLCHYEMLQLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1447 >ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Fragaria vesca subsp. vesca] Length = 1439 Score = 1789 bits (4633), Expect = 0.0 Identities = 886/1188 (74%), Positives = 1019/1188 (85%), Gaps = 6/1188 (0%) Frame = +3 Query: 3 LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182 LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN Sbjct: 270 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAINLPHDAYKLLAVPSPIGGVLVISAN 329 Query: 183 TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362 +IHYHSQS+SC LALN++A S D SQ+M RSSF VELDAANA+WL+NDV +LSTKTGELL Sbjct: 330 SIHYHSQSASCALALNSYAGSVDSSQEMPRSSFTVELDAANASWLSNDVILLSTKTGELL 389 Query: 363 SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536 LTLVYDGR+V RL+LSKS+A+VLTS I T+G+S FFLGSRLGDSLLVQFT G G+ L Sbjct: 390 LLTLVYDGRVVHRLDLSKSKASVLTSGIATVGNSLFFLGSRLGDSLLVQFTSGVGASMLS 449 Query: 537 LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716 +K+EVGDIEG AP AKRLR SSSDALQDM++GEELSLYGS NNA SAQ SFSF VRD Sbjct: 450 ADLKDEVGDIEGDAPSAKRLRMSSSDALQDMISGEELSLYGSAQNNAESAQRSFSFAVRD 509 Query: 717 SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896 SL+N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+ Sbjct: 510 SLVNVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569 Query: 897 ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076 +LPGCKG+WTVYHKN R +S DDEYHA+LIISLE RTMVL+TA++L EVT+ Sbjct: 570 VALPGCKGIWTVYHKNARGHNAESY-----DDEYHAFLIISLEARTMVLETADHLSEVTD 624 Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256 VDY+ QG T+AAGNLFGRRRV+Q+Y GARIL+G +MTQ+LSF NSE GSGSE V Sbjct: 625 KVDYFLQGRTIAAGNLFGRRRVVQIYERGARILEGYYMTQDLSFGASNSESGSGSESATV 684 Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436 SVSI DPYVLLRM+DG I+LLVGDPS+CTVS+ NP FESSK+ +SACTLYHD+GPEPW Sbjct: 685 LSVSIVDPYVLLRMSDGGIRLLVGDPSSCTVSVSNPAAFESSKKLVSACTLYHDEGPEPW 744 Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616 LR++STDAWLSTGI EAIDG HDQGD+YC++CYESG LEIFDVPNF+ V+SV+KFV Sbjct: 745 LRKSSTDAWLSTGIDEAIDG---VLHDQGDVYCVICYESGSLEIFDVPNFNCVFSVEKFV 801 Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796 SGK LVD + DP K +++++V+ +KE++QN+R+VEL M RW GQHSRPFLFG Sbjct: 802 SGKPLLVDTFMGDP----QKSQSSEEVSGLSRKEKLQNMRVVELTMQRWSGQHSRPFLFG 857 Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976 IL+DG I CYH+Y+++S D+ SK E + +SQ + + SRLRNLRF+RV LDTY+R Sbjct: 858 ILNDGMIFCYHAYLYESMDSTSKTEVSASSQ------NTTASRLRNLRFVRVPLDTYSRN 911 Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156 + S GT CQR+T+FKN+ G QGLF++GSRP+W M+FRER+R+HPQLCDG+IVAFTVLHNV Sbjct: 912 DLSNGTSCQRMTVFKNIAGNQGLFLAGSRPAWLMVFRERIRVHPQLCDGSIVAFTVLHNV 971 Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336 NCNHG IYVTS+G +KICQLPS SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS P Sbjct: 972 NCNHGLIYVTSEGIMKICQLPSITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSIP 1031 Query: 2337 VLKPLNQVLSSLVDQEAGQQIEND----VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504 V KPLNQVLSSLVDQE Q+EN +Y V+EFEVRIMEP++SGGPWQTRATI Sbjct: 1032 VQKPLNQVLSSLVDQEFSHQVENHNLSPEELHRTYTVDEFEVRIMEPEKSGGPWQTRATI 1091 Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684 PMQ+SENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVA RGRVLLFS E N DN Q Sbjct: 1092 PMQTSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAGRGRVLLFSAENNVDNPQN 1151 Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864 LVSEV+SKELKGAISALASLQG+LLIASGPKI LHKW GS+L G+AFFD PPLYVVS+NI Sbjct: 1152 LVSEVFSKELKGAISALASLQGNLLIASGPKIILHKWTGSDLTGIAFFDVPPLYVVSLNI 1211 Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044 VKNFIL+GDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL V+D QKN Sbjct: 1212 VKNFILIGDIHKSIYFLSWKEQGAQLNLLAKDFGNLDCFATEFLIDGSTLSLAVADAQKN 1271 Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224 IQ+ YYAPK+ SW+GQKLL+RAEFH+GAHVTKF+RLQ+L TSSD+ PGSDK R+ Sbjct: 1272 IQILYYAPKISESWRGQKLLTRAEFHVGAHVTKFLRLQMLSTSSDRTGKNPGSDKTVRYA 1331 Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404 LLFGTLDG IG I+PL+ELTFRRLQSLQ KLVDAV H+AGLNPRSFRQFRSNG+AHRPGP Sbjct: 1332 LLFGTLDGGIGSIAPLEELTFRRLQSLQNKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGP 1391 Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548 D+IVDCELL HYEML LEEQLEIAQQ+GTTR QI+SNL++L+LGTSFL Sbjct: 1392 DSIVDCELLFHYEMLSLEEQLEIAQQIGTTRLQILSNLDDLSLGTSFL 1439