BLASTX nr result
ID: Paeonia24_contig00002825
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia24_contig00002825 (5122 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec... 2432 0.0 emb|CBI24510.3| unnamed protein product [Vitis vinifera] 2426 0.0 ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prun... 2411 0.0 ref|XP_007038473.1| Cleavage and polyadenylation specificity fac... 2410 0.0 ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec... 2350 0.0 ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec... 2346 0.0 ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr... 2342 0.0 ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr... 2337 0.0 ref|XP_002510905.1| cleavage and polyadenylation specificity fac... 2313 0.0 gb|EXC20897.1| Cleavage and polyadenylation specificity factor s... 2302 0.0 ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec... 2302 0.0 ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec... 2268 0.0 ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phas... 2264 0.0 ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec... 2256 0.0 ref|XP_002318462.2| cleavage and polyadenylation specificity fac... 2255 0.0 ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec... 2222 0.0 ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arab... 2181 0.0 ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec... 2170 0.0 ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec... 2170 0.0 ref|NP_199979.2| cleavage and polyadenylation specificity factor... 2163 0.0 >ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Vitis vinifera] Length = 1442 Score = 2432 bits (6304), Expect = 0.0 Identities = 1208/1456 (82%), Positives = 1313/1456 (90%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MSYAAYKMMHWPTGIENCASGF+TH ADFAPQI +QTDDLES+WP +R IGP+PNLI+ Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA N+LEVY++RVQE+D R+SR S E KRGGV+AGISGAALELVC YRLHGNVETM VL Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GGGD RRRDSIILAFQDAKISVLEFDDS+HGLRTSSMH FEGPEW HLKRG ESFARG Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC+GVLVYGLQMIILK +QAG GLVGD++AL+SG AVSARVESSY+ISLRD Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLL VPSPIGGV+VISAN+IHYHSQSASCALA+NN+AV+AD+SQ+MPRSS Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD ANA WLSNDVAMLSTKTGELLLLTL YDGRVVHRLDL+KSRASVLTSGI IG+ Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVNGEE 3271 SLFFLGSRLGDSLLVQ+T + SS VKEEVGDIEGD PSAKRLR+SSSDALQD+VNGEE Sbjct: 421 SLFFLGSRLGDSLLVQFTSIL--SSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478 Query: 3270 LSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYELVC 3091 LSLYGSAPN+ E++QK FSF+VRDS INVGPLKDF+YGLRINADP A GIAKQSNYELVC Sbjct: 479 LSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVC 538 Query: 3090 CSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDEYHA 2911 CSGHGKNGALC+LQQSI PE+ITEVEL GC+GIWTVYHKNTRGHNADS+KM+ DDEYHA Sbjct: 539 CSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHA 598 Query: 2910 YLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRILDGS 2731 YLIISLESRTMVLET D+LGEVTESVDYYVQG TI+AGNLFGRRRVVQV+ARG+RILDG+ Sbjct: 599 YLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGA 658 Query: 2730 FMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSINIP 2551 FMTQDL I SESSTV SIADPYVLLRM+DG+IQLLVGDPSTCTVSINIP Sbjct: 659 FMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIP 708 Query: 2550 SVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYCVVC 2371 +VFESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+GEAIDGADGA DQGDIYCVV Sbjct: 709 AVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVS 768 Query: 2370 YESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQARKE 2191 YESG LEIFDVPNF+CVFSV FMSG +LVDT + EPS+D + + NSEE A Q RKE Sbjct: 769 YESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKE 828 Query: 2190 NTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQNSV 2011 N N+KVVE+ MQRWSG HS PFLFGILTDGTILCYHAYL+EGPE+T K EEA+S QNS+ Sbjct: 829 NAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSL 888 Query: 2010 NLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPAWFM 1831 ++++ SASRLRNLRFVRV LDTYTREE +G TS RMTVFKN+GG QGLFLSGSRP WFM Sbjct: 889 SISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFM 948 Query: 1830 VVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWPVQK 1651 V RER+RVHPQ+CDGSIVAFTVLHN+NCNHGLIYVTS+G LKICQLP+V+SYDNYWPVQK Sbjct: 949 VFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQK 1008 Query: 1650 IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDLHQT 1471 IPLKGTPHQVTYFAEKNLYPLIVSV VLKPLN V+SSLVDQEA HQ+ENDNLSS++LH++ Sbjct: 1009 IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRS 1068 Query: 1470 YVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIGTAY 1291 Y VDEFEVR+LEPEKSG PWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAIGTAY Sbjct: 1069 YSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1128 Query: 1290 LQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPKITL 1111 +QGEDVA RGRVLLFSVG+NTDN+QNLVSE++SKE KGAISA+ASLQGHLL+ASGPKI L Sbjct: 1129 VQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIIL 1188 Query: 1110 YKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAKDYA 931 +KW TEL VAFFD PPL+VVSLNIVKNFILLGDIH+SIYFLSWKEQG QL+LLAKD+ Sbjct: 1189 HKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFG 1248 Query: 930 SLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHVTKF 751 SLDCFATEFLIDGSTLSL+VSDDQKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHVTKF Sbjct: 1249 SLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1308 Query: 750 QRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQRKLV 571 RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+KLV Sbjct: 1309 LRLQML--PASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1366 Query: 570 DAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGTTRS 391 DAV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELLCHYEMLP EEQLEIA QIGTTR Sbjct: 1367 DAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRM 1426 Query: 390 QIISNLNDLSLGTSFL 343 QI+SNLNDLSLGTSFL Sbjct: 1427 QILSNLNDLSLGTSFL 1442 >emb|CBI24510.3| unnamed protein product [Vitis vinifera] Length = 1448 Score = 2426 bits (6287), Expect = 0.0 Identities = 1208/1462 (82%), Positives = 1313/1462 (89%), Gaps = 6/1462 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MSYAAYKMMHWPTGIENCASGF+TH ADFAPQI +QTDDLES+WP +R IGP+PNLI+ Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA N+LEVY++RVQE+D R+SR S E KRGGV+AGISGAALELVC YRLHGNVETM VL Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GGGD RRRDSIILAFQDAKISVLEFDDS+HGLRTSSMH FEGPEW HLKRG ESFARG Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC+GVLVYGLQMIILK +QAG GLVGD++AL+SG AVSARVESSY+ISLRD Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLL VPSPIGGV+VISAN+IHYHSQSASCALA+NN+AV+AD+SQ+MPRSS Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD ANA WLSNDVAMLSTKTGELLLLTL YDGRVVHRLDL+KSRASVLTSGI IG+ Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVNGEE 3271 SLFFLGSRLGDSLLVQ+T + SS VKEEVGDIEGD PSAKRLR+SSSDALQD+VNGEE Sbjct: 421 SLFFLGSRLGDSLLVQFTSIL--SSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478 Query: 3270 LSLYGSAPNNAESAQ------KNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109 LSLYGSAPN+ E++Q K FSF+VRDS INVGPLKDF+YGLRINADP A GIAKQS Sbjct: 479 LSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQS 538 Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929 NYELVCCSGHGKNGALC+LQQSI PE+ITEVEL GC+GIWTVYHKNTRGHNADS+KM+ Sbjct: 539 NYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATK 598 Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749 DDEYHAYLIISLESRTMVLET D+LGEVTESVDYYVQG TI+AGNLFGRRRVVQV+ARG+ Sbjct: 599 DDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGA 658 Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569 RILDG+FMTQDL I SESSTV SIADPYVLLRM+DG+IQLLVGDPSTCT Sbjct: 659 RILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCT 708 Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389 VSINIP+VFESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+GEAIDGADGA DQGD Sbjct: 709 VSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGD 768 Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209 IYCVV YESG LEIFDVPNF+CVFSV FMSG +LVDT + EPS+D + + NSEE A Sbjct: 769 IYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEA 828 Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029 Q RKEN N+KVVE+ MQRWSG HS PFLFGILTDGTILCYHAYL+EGPE+T K EEA+ Sbjct: 829 DQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAV 888 Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849 S QNS+++++ SASRLRNLRFVRV LDTYTREE +G TS RMTVFKN+GG QGLFLSGS Sbjct: 889 SAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGS 948 Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669 RP WFMV RER+RVHPQ+CDGSIVAFTVLHN+NCNHGLIYVTS+G LKICQLP+V+SYDN Sbjct: 949 RPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDN 1008 Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSV VLKPLN V+SSLVDQEA HQ+ENDNLSS Sbjct: 1009 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSS 1068 Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309 ++LH++Y VDEFEVR+LEPEKSG PWQTRATIPMQSSENALTVRVVTL+N TTKENETLL Sbjct: 1069 DELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1128 Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129 AIGTAY+QGEDVA RGRVLLFSVG+NTDN+QNLVSE++SKE KGAISA+ASLQGHLL+AS Sbjct: 1129 AIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIAS 1188 Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949 GPKI L+KW TEL VAFFD PPL+VVSLNIVKNFILLGDIH+SIYFLSWKEQG QL+L Sbjct: 1189 GPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNL 1248 Query: 948 LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769 LAKD+ SLDCFATEFLIDGSTLSL+VSDDQKN+QIFYYAPK SESWKGQKLLSRAEFHVG Sbjct: 1249 LAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1308 Query: 768 AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589 AHVTKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQS Sbjct: 1309 AHVTKFLRLQML--PASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1366 Query: 588 LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409 LQ+KLVDAV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELLCHYEMLP EEQLEIA Q Sbjct: 1367 LQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQ 1426 Query: 408 IGTTRSQIISNLNDLSLGTSFL 343 IGTTR QI+SNLNDLSLGTSFL Sbjct: 1427 IGTTRMQILSNLNDLSLGTSFL 1448 >ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] gi|462416772|gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] Length = 1459 Score = 2411 bits (6249), Expect = 0.0 Identities = 1180/1461 (80%), Positives = 1316/1461 (90%), Gaps = 5/1461 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWP-ARRGIGPVPNLI 4534 MS+AAYKMMHWPTGIENCASGFI+H +DF P+IP +QT+DLES+WP +RR IGP+P+L+ Sbjct: 1 MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRIPPIQTEDLESEWPTSRREIGPIPDLV 60 Query: 4533 ITAGNVLEVYIIRVQEED-VRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357 +TAGNVLEVY++RVQEED R R SGE KRGG++ G+SGA+LELVCHYRLHGNV TMAV Sbjct: 61 VTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMAV 120 Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177 LS GGGDG RRRDSIIL F+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHL+RGRESFA Sbjct: 121 LSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180 Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997 RGPLVK DPQGRC +LVYGLQMIILK +Q GSGLVGDDD+ SGGA+S+R+ESSYI++L Sbjct: 181 RGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISSRIESSYIVNL 240 Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817 RD+ MKHVKDF F+HGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI Sbjct: 241 RDMDMKHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300 Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637 WSA+NLPHDAYKLLAVPSPIGGVLVISAN+IHYHSQSASCALA+N++AV+AD+SQ+MPRS Sbjct: 301 WSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEMPRS 360 Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457 S +VELD ANA WL NDVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVLTSGIT + Sbjct: 361 SFTVELDTANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITKV 420 Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286 G+SLFFLGSRLGDSLLVQ+TCGVG S S +K+EVGDIEGDAP AKRLR SSSDALQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPLAKRLRMSSSDALQDM 480 Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106 V+GEELSLYGSAPNNAESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSN Sbjct: 481 VSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 540 Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926 YELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHKN RGHNADSSK++A D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAASD 600 Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746 DE+HAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRVVQV+ RG+R Sbjct: 601 DEFHAYLIISLEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVVQVYERGAR 660 Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566 ILDGSFMTQDLS GT N+E G GSESSTV SI DPYVLLRM+DG I+LLVGDPS CTV Sbjct: 661 ILDGSFMTQDLSFGTSNSEMGSGSESSTVLSVSIVDPYVLLRMSDGGIRLLVGDPSLCTV 720 Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386 S +IP+ FESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+ EAIDGADG HDQGD+ Sbjct: 721 STSIPAAFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIDEAIDGADGVSHDQGDV 780 Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206 YCVVCYESG+LEIFDVPNF+CVFSV F+SG +L+DT +R+P KDP+ N +SEEV+G Sbjct: 781 YCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINKSSEEVSG 840 Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 Q RKEN +NMKVVE+ MQRWSG HS PFLFGIL DG ILCYHAYLFEGPE SK E++ S Sbjct: 841 QGRKENIQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETASKTEDSAS 900 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 QN+ +++ SASRLRNLRFVRV LDTY +++T + QRMT+FKN+ GYQGLFLSGSR Sbjct: 901 AQNTTGVSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQGLFLSGSR 960 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 PAWFMV RERLR+HPQ+CDGS+VA TVLHNVNCNHGLIYVTS+G+LKICQLP +TSYDNY Sbjct: 961 PAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPPITSYDNY 1020 Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSV V KPLNQV+SSLVDQE HQ+EN NLSS+ Sbjct: 1021 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLSSD 1080 Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306 +LH+TY VDEFE+RI+EP+KSGGPWQT+ATIPMQ+SENALTVRVVTL+N TTKENETLLA Sbjct: 1081 ELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLLA 1140 Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126 IGTAY+QGEDVAGRGRVLLFS G++ DNTQ LVSEV+SKE KGAISALASLQGHLL+ASG Sbjct: 1141 IGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQGHLLIASG 1200 Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946 PKI L+KWN TEL VAFFDVPPL+VVSLNIVKNFILLGD+HKSIYFLSWKEQG QL+LL Sbjct: 1201 PKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLTLL 1260 Query: 945 AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766 AKD+ +LDCFATEFLIDGSTLSL+V+D+QKN+QIFYYAPK SESWKGQKLLSRAEFHVG Sbjct: 1261 AKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGT 1320 Query: 765 HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586 HVTKF RLQML DKTNR+ALLF TLDGSIGCIAPLD LTFRRLQSL Sbjct: 1321 HVTKFLRLQML--STSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFRRLQSL 1378 Query: 585 QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406 Q+KLVDAVHHVAGLNPR+FRQF+S+GKAH+PGPD IVDCELL HYEMLPLEEQLEIA+QI Sbjct: 1379 QKKLVDAVHHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLEIANQI 1438 Query: 405 GTTRSQIISNLNDLSLGTSFL 343 GTTRSQI SNLNDLS+GTSFL Sbjct: 1439 GTTRSQIFSNLNDLSIGTSFL 1459 >ref|XP_007038473.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] gi|508775718|gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 2410 bits (6247), Expect = 0.0 Identities = 1184/1459 (81%), Positives = 1309/1459 (89%), Gaps = 3/1459 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MSYAAYKMMHWPTGIENCASGF+THC ADF PQIP QT+DLES+WPARRGIGPVPNLI+ Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA N+LE+Y++RVQEE R++R S E KRGGV+ G+SG +LELVC+YRLHGNVE+MAVLS Sbjct: 61 TAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSGVSLELVCNYRLHGNVESMAVLS 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 IGGGDG RRRDSIILAF+DAKISVLEFDDS+HGLRT+SMH FEGPEWLHLKRGRESFARG Sbjct: 121 IGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC GVLVY LQMIILK +QAGSG VG+DDA SGGAVSARVESSYII+LRD Sbjct: 181 PLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L +KH+KDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALA+NN+A++ D+SQD+PRS+ Sbjct: 301 AVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSNF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD ANA WL NDVA+LSTKTGELLLLTL+YDGRVV RLDL+KS+ASVLTS ITTIG+ Sbjct: 361 SVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ++ G G S SG+KEEVGDIEGD P AKRLRRSSSDALQD+V Sbjct: 421 SLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVG 480 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYGSAPNN ESAQK F FAVRDS+ NVGPLKDFSYGLRINAD NA GIAKQSNYE Sbjct: 481 GEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNYE 540 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR H+AD SK++ DDE Sbjct: 541 LVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDDE 600 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRVVQV+ RG+RIL Sbjct: 601 YHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARIL 660 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DGSFMTQ+LSI +PN+ES GSE+STV SIADPYVLLRMTDGSI LLVGDP+TCTVSI Sbjct: 661 DGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVSI 720 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 N P+ FE SKK++S CTLYHDKGPEPWLRKASTDAWLSTGVGE+IDGADG PHDQGDIYC Sbjct: 721 NTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIYC 780 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYESG LEIFDVPNF+CVFS+ F SG+ LVD E SKD + N +SEE+ GQ Sbjct: 781 VVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQG 840 Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020 RKEN +N+KVVE+ MQRWS HS PFLFGILTDGTILCYHAYLFEG EN SK+E+++ Q Sbjct: 841 RKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQ 900 Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840 NSV L++ +ASRLRNLRF+R+ LD YTREE G SQR+T+FKN+ GYQG FLSGSRPA Sbjct: 901 NSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPA 960 Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660 WFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQ+PS ++YDNYWP Sbjct: 961 WFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWP 1020 Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480 VQKIPL+GTPHQVTYFAE+NLYP+IVSV V KP+NQV+SSLVDQE HQ++N NLSS++L Sbjct: 1021 VQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDEL 1080 Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300 +TY VDEFEVRILEPEKSGGPW+T+ATIPMQSSENALTVRVVTL+N TTKENE+LLAIG Sbjct: 1081 QRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIG 1140 Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120 TAY+QGEDVA RGRV+L S+GRNTDN QNLVSEV+SKE KGAISALASLQGHLL+ASGPK Sbjct: 1141 TAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGPK 1200 Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940 I L+ W +EL +AF+D PPL+VVSLNIVKNFILLGD+HKSIYFLSWKEQG QLSLLAK Sbjct: 1201 IILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAK 1260 Query: 939 DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760 D+ SLDCFATEFLIDGSTLSLMVSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320 Query: 759 TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580 TKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+ Sbjct: 1321 TKFLRLQML--STSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378 Query: 579 KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400 KLVDAV HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELLCHYEMLPLEEQL+IAHQIGT Sbjct: 1379 KLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGT 1438 Query: 399 TRSQIISNLNDLSLGTSFL 343 TRSQI+SNLNDL+LGTSFL Sbjct: 1439 TRSQILSNLNDLTLGTSFL 1457 >ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X2 [Citrus sinensis] Length = 1457 Score = 2350 bits (6091), Expect = 0.0 Identities = 1152/1459 (78%), Positives = 1291/1459 (88%), Gaps = 3/1459 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AAYKMMHWPTGI NC SGFITH AD+ PQIP +QT++L+S+ P++RGIGPVPNL++ Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA NV+E+Y++RVQEE ++S+ SGE KR ++ GIS A+LELVCHYRLHGNVE++A+LS Sbjct: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG D RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG Sbjct: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D SGG SAR+ESS++I+LRD Sbjct: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS Sbjct: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+ SVLTS ITTIG+ Sbjct: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ+TCG G S SG+KEE GDIE DAPS KRLRRSSSDALQD+VN Sbjct: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE Sbjct: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSS+M+A DDE Sbjct: 541 LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 600 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL Sbjct: 601 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DGS+MTQDLS G N+ESG GSE+STV SIADPYVLL M+DGSI+LLVGDPSTCTVS+ Sbjct: 661 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 P+ ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY Sbjct: 721 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYESG LEIFDVPNF+CVF+V F+SG+ ++VDT +RE KD + N++SEE GQ Sbjct: 781 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840 Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020 RKEN +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEGPENTSK ++ +S Sbjct: 841 RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 900 Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840 S+++++ SASRLRNLRF R+ LD YTREETP G QR+T+FKN+ G+QG FLSGSRP Sbjct: 901 RSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960 Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660 W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP Sbjct: 961 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020 Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480 VQKIPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE HQI+N NLSS DL Sbjct: 1021 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1080 Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300 H+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAIG Sbjct: 1081 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1140 Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120 TAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGPK Sbjct: 1141 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1200 Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940 I L+KW TEL +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLAK Sbjct: 1201 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1260 Query: 939 DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760 D+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320 Query: 759 TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580 TKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+ Sbjct: 1321 TKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378 Query: 579 KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400 KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ GT Sbjct: 1379 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1438 Query: 399 TRSQIISNLNDLSLGTSFL 343 TRSQI+SNLNDL+LGTSFL Sbjct: 1439 TRSQILSNLNDLALGTSFL 1457 >ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Citrus sinensis] Length = 1458 Score = 2346 bits (6079), Expect = 0.0 Identities = 1152/1460 (78%), Positives = 1291/1460 (88%), Gaps = 4/1460 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AAYKMMHWPTGI NC SGFITH AD+ PQIP +QT++L+S+ P++RGIGPVPNL++ Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA NV+E+Y++RVQEE ++S+ SGE KR ++ GIS A+LELVCHYRLHGNVE++A+LS Sbjct: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG D RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG Sbjct: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D SGG SAR+ESS++I+LRD Sbjct: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS Sbjct: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+ SVLTS ITTIG+ Sbjct: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ+TCG G S SG+KEE GDIE DAPS KRLRRSSSDALQD+VN Sbjct: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE Sbjct: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSS+M+A DDE Sbjct: 541 LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 600 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL Sbjct: 601 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DGS+MTQDLS G N+ESG GSE+STV SIADPYVLL M+DGSI+LLVGDPSTCTVS+ Sbjct: 661 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 P+ ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY Sbjct: 721 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYESG LEIFDVPNF+CVF+V F+SG+ ++VDT +RE KD + N++SEE GQ Sbjct: 781 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840 Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020 RKEN +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEGPENTSK ++ +S Sbjct: 841 RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 900 Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840 S+++++ SASRLRNLRF R+ LD YTREETP G QR+T+FKN+ G+QG FLSGSRP Sbjct: 901 RSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960 Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660 W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP Sbjct: 961 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020 Query: 1659 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483 VQK IPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE HQI+N NLSS D Sbjct: 1021 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1080 Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303 LH+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAI Sbjct: 1081 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1140 Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123 GTAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGP Sbjct: 1141 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1200 Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943 KI L+KW TEL +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLA Sbjct: 1201 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1260 Query: 942 KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763 KD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAH Sbjct: 1261 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1320 Query: 762 VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583 VTKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ Sbjct: 1321 VTKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1378 Query: 582 RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403 +KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ G Sbjct: 1379 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1438 Query: 402 TTRSQIISNLNDLSLGTSFL 343 TTRSQI+SNLNDL+LGTSFL Sbjct: 1439 TTRSQILSNLNDLALGTSFL 1458 >ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523633|gb|ESR35000.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1457 Score = 2342 bits (6069), Expect = 0.0 Identities = 1149/1459 (78%), Positives = 1287/1459 (88%), Gaps = 3/1459 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AAYKMMHWPTGI NC SGFITH AD+ PQIP +QT++L+S+ P++RGIGPVPNL++ Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA NV+E+Y++RVQEE ++S+ SGE KR ++ GIS A+LELVCHYRLHGNVE++A+LS Sbjct: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG D RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG Sbjct: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D SGG SAR+ESS++I+LRD Sbjct: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS Sbjct: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+ SVLTS ITTIG+ Sbjct: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ+TCG G S SG KEE GDIE DAPS KRLRRSSSDALQD+VN Sbjct: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE Sbjct: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHN DSS+M+A DDE Sbjct: 541 LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDE 600 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL Sbjct: 601 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DGS+MTQDLS G N+ESG GSE+STV SIADPYVLL M+DGSI+LLVGDPSTCTVS+ Sbjct: 661 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 P+ ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY Sbjct: 721 QTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYESG LEIFDVPNF+CVF+V F+SG+ ++VDT +RE KD + N++SEE GQ Sbjct: 781 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840 Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020 RKEN +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEG ENTSK ++ +S Sbjct: 841 RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTS 900 Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840 S+++++ SASRLRNLRF R LD YTREETP G QR+T+FKN+ G+QG FLSGSRP Sbjct: 901 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960 Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660 W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP Sbjct: 961 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020 Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480 VQKIPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE HQI+N NLSS DL Sbjct: 1021 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1080 Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300 H+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKEN+TLLAIG Sbjct: 1081 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAIG 1140 Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120 TAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGPK Sbjct: 1141 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1200 Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940 I L+KW TEL +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLAK Sbjct: 1201 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1260 Query: 939 DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760 D+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320 Query: 759 TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580 TKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+ Sbjct: 1321 TKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378 Query: 579 KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400 KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ GT Sbjct: 1379 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1438 Query: 399 TRSQIISNLNDLSLGTSFL 343 TRSQI+SNLNDL+LGTSFL Sbjct: 1439 TRSQILSNLNDLALGTSFL 1457 >ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523632|gb|ESR34999.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1458 Score = 2337 bits (6057), Expect = 0.0 Identities = 1149/1460 (78%), Positives = 1287/1460 (88%), Gaps = 4/1460 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AAYKMMHWPTGI NC SGFITH AD+ PQIP +QT++L+S+ P++RGIGPVPNL++ Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 TA NV+E+Y++RVQEE ++S+ SGE KR ++ GIS A+LELVCHYRLHGNVE++A+LS Sbjct: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG D RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG Sbjct: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D SGG SAR+ESS++I+LRD Sbjct: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS Sbjct: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS Sbjct: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+ SVLTS ITTIG+ Sbjct: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ+TCG G S SG KEE GDIE DAPS KRLRRSSSDALQD+VN Sbjct: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE Sbjct: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHN DSS+M+A DDE Sbjct: 541 LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDE 600 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL Sbjct: 601 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DGS+MTQDLS G N+ESG GSE+STV SIADPYVLL M+DGSI+LLVGDPSTCTVS+ Sbjct: 661 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 P+ ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY Sbjct: 721 QTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYESG LEIFDVPNF+CVF+V F+SG+ ++VDT +RE KD + N++SEE GQ Sbjct: 781 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840 Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020 RKEN +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEG ENTSK ++ +S Sbjct: 841 RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTS 900 Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840 S+++++ SASRLRNLRF R LD YTREETP G QR+T+FKN+ G+QG FLSGSRP Sbjct: 901 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960 Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660 W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP Sbjct: 961 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020 Query: 1659 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483 VQK IPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE HQI+N NLSS D Sbjct: 1021 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1080 Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303 LH+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKEN+TLLAI Sbjct: 1081 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAI 1140 Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123 GTAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGP Sbjct: 1141 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1200 Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943 KI L+KW TEL +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLA Sbjct: 1201 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1260 Query: 942 KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763 KD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAH Sbjct: 1261 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1320 Query: 762 VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583 VTKF RLQML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ Sbjct: 1321 VTKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1378 Query: 582 RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403 +KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ G Sbjct: 1379 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1438 Query: 402 TTRSQIISNLNDLSLGTSFL 343 TTRSQI+SNLNDL+LGTSFL Sbjct: 1439 TTRSQILSNLNDLALGTSFL 1458 >ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] Length = 1461 Score = 2313 bits (5995), Expect = 0.0 Identities = 1146/1463 (78%), Positives = 1286/1463 (87%), Gaps = 7/1463 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWP-ARRGIGPVPNLI 4534 MSYAAYKM+HWPTGIE+CASG+ITH ADF PQIP +QTD+L+S+WP ++RGIGP+PNLI Sbjct: 1 MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60 Query: 4533 ITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVL 4354 +TAG+VLEVY++RVQE+ R+SR S E KRGG++ G+SGA+LELVCHYRLHGNVE+M VL Sbjct: 61 VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120 Query: 4353 SIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFAR 4174 GGD RRRDSIILAF+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHLKRGRESFAR Sbjct: 121 PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180 Query: 4173 GPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLR 3994 GPL+K DPQGRC G+LVY +QMIIL+ AQA SGLVGDDDAL+SGG++SARV+SSY+I+LR Sbjct: 181 GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240 Query: 3993 DLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIW 3814 D+ MKHVKDFIF+H YIEPV+VILHERELTW+GR+SWKHHTC ISALSISTTLKQ LIW Sbjct: 241 DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300 Query: 3813 SAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSS 3634 S +NLPHDAYKLLAVP PIGGVLVI ANTIHYHS+SA+ ALA+NN+AV+ DSSQ++PR+S Sbjct: 301 SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360 Query: 3633 ISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIG 3454 SVELD AAWL NDVA+LS K GELLLL+LVYDGRVV RLDL+KS+ASVLTS ITTIG Sbjct: 361 FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420 Query: 3453 SSLFFLGSRLGDSLLVQYTCGVG---ASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIV 3283 +SLFFLGSRLGDSLLVQ+T G+G SSG+KEEVG+IEGD PSAKRL+RS+SD LQD+V Sbjct: 421 NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480 Query: 3282 NGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNY 3103 +GEELSLYGS NN ESAQK+FSFAVRDS+INVGPLKDFSYGLR N D +A GIAKQSNY Sbjct: 481 SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540 Query: 3102 ELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDD 2923 +LVCCSGHGKNG LC+L+QSI PE+ITEV+L GCRGIWTVYHKN RGHN D SKM+A D Sbjct: 541 DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600 Query: 2922 EYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRI 2743 EYHAYLIIS+E+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RI Sbjct: 601 EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660 Query: 2742 LDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVS 2563 LDGSFMTQDLSIG+ N+ES GSES+TVS SIADPYVL++MTDGSI+LL+GD STC VS Sbjct: 661 LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720 Query: 2562 INIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGA---DGAPHDQG 2392 IN PS FE+S++ +S CTLYHDKGPEPWLRKASTDAWLSTGV EAIDGA DG PHDQG Sbjct: 721 INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780 Query: 2391 DIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEV 2212 DIYC+VCYESG LEIFDVPNF+ VFSV F+SGK +L D +REP KD + TN SEEV Sbjct: 781 DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840 Query: 2211 AGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEA 2032 AG RKEN NMK VE+ MQRWSG HS PFLFG+LTDGTILCYHAYLFE P+ TSK E++ Sbjct: 841 AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900 Query: 2031 ISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSG 1852 +S QN V L S SASRLRNLRFVRV LD+Y +EET T + QR+T+F N+ G+QG FL G Sbjct: 901 VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960 Query: 1851 SRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYD 1672 SRPAWFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G LKICQLPS ++YD Sbjct: 961 SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020 Query: 1671 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLS 1492 NYWPVQKIPLKGTPHQVTYF EKNLYPLIVSV V KP+NQV+SSLVDQE HQIEN NLS Sbjct: 1021 NYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNLS 1080 Query: 1491 SNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETL 1312 S++L QTY V+EFEVRILE E GGPWQT+ATIPMQSSENALTVRVVTL+NATTKENETL Sbjct: 1081 SDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENETL 1140 Query: 1311 LAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVA 1132 LAIGTAY+QGEDVA RGRVLLFSV ++T+N+Q LVSEV+SKE KGAISALASLQGHLL+A Sbjct: 1141 LAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLIA 1200 Query: 1131 SGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLS 952 SGPKI L+KW TEL VAF+D PPL+V S+NIVKNFILLGDIHKSIYFLSWKEQG QLS Sbjct: 1201 SGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQLS 1260 Query: 951 LLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHV 772 LLAKD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK ESWKGQKLLSRAEFHV Sbjct: 1261 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHV 1320 Query: 771 GAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQ 592 GAH+TKF RL ML DKTNRFALLF TLDGSIGCIAPLD LTFRRLQ Sbjct: 1321 GAHITKFIRLSML--STSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1378 Query: 591 SLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAH 412 SLQRKLVDAV HVAGLNPRSFRQF+S GK H+PGP++IVDCELL H+EMLPLEEQLEIA Sbjct: 1379 SLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQ 1438 Query: 411 QIGTTRSQIISNLNDLSLGTSFL 343 Q+GTTR+QI+SNLNDLSLGTSFL Sbjct: 1439 QVGTTRAQILSNLNDLSLGTSFL 1461 >gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus notabilis] Length = 1479 Score = 2302 bits (5965), Expect = 0.0 Identities = 1148/1489 (77%), Positives = 1292/1489 (86%), Gaps = 33/1489 (2%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPA-RRGIGPVPNLI 4534 MS+AAYKMMHWPTGIENCA+GF++H ADF P+IP +Q+DDL+SDWPA RR GPVPNL+ Sbjct: 1 MSFAAYKMMHWPTGIENCAAGFVSHSRADFVPRIPPIQSDDLDSDWPAGRRETGPVPNLV 60 Query: 4533 ITAGNVLEVYIIRVQEED-VRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357 +TAGNVLEVY++R+QEED R SR E++RGG++ G+SGA+LELVCHYRLHGNV+T+AV Sbjct: 61 VTAGNVLEVYVVRLQEEDDTRSSRAPAESRRGGLMDGLSGASLELVCHYRLHGNVQTIAV 120 Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177 LS GGGDG RRRDSIIL+FQDAKISVLEFDDS+HGLRTSSMH FEGPEWL+LKRGRESFA Sbjct: 121 LSSGGGDGSRRRDSIILSFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWLYLKRGRESFA 180 Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997 RGPLVK DPQGRCAGVL Y +QMI+LK AQAGSGLVG++DAL SGGAVSAR+ESSYII+L Sbjct: 181 RGPLVKVDPQGRCAGVLAYNIQMIMLKAAQAGSGLVGEEDALGSGGAVSARIESSYIINL 240 Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817 RDL MKH+KDF+FVHGYIEPVMVILHERELTW+GR+ WKHHTC ISALSISTTLKQHPLI Sbjct: 241 RDLDMKHIKDFVFVHGYIEPVMVILHERELTWAGRVLWKHHTCMISALSISTTLKQHPLI 300 Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637 WSA+NLPHDAYKLLAVPSPIGGVLVI ANT+HY SQS SC LA+N++AV+ DSSQ+M R+ Sbjct: 301 WSAVNLPHDAYKLLAVPSPIGGVLVICANTLHYQSQSNSCTLALNSYAVSVDSSQEMRRA 360 Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457 SVELD ANA WLSNDV +LSTK GELLLLTLVYDGRVV RLDL+KS+ASVLTSGITTI Sbjct: 361 PFSVELDAANATWLSNDVVLLSTKAGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITTI 420 Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286 G+SLFFLGSRLGDSLLVQ+T G+G SSG+K+EVGDIEGDA AKRLRRSSSD LQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTYGLGTSMLSSGLKDEVGDIEGDAHLAKRLRRSSSDVLQDM 480 Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106 +GEELSLY SAPNN+ES QK+FSF VRDS++NVGPLKDFSYGLRINADPNA G+AKQSN Sbjct: 481 TSGEELSLYVSAPNNSESTQKSFSFTVRDSLVNVGPLKDFSYGLRINADPNATGVAKQSN 540 Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926 YELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR H DSSK+ A D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRSH--DSSKLVAAD 598 Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746 DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRVVQV+ RG+R Sbjct: 599 DEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGAR 658 Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566 ILDGSFMTQDLS G +ES GSE++ V+ SIADPYV+LRM+DGSI+LLVGDP++CTV Sbjct: 659 ILDGSFMTQDLSFGPAPSESSSGSENAVVTSVSIADPYVVLRMSDGSIRLLVGDPTSCTV 718 Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386 S++ P+ FESSK +IS CTLY DKGPEPWLRK STDAWLSTGV EAIDGAD DQGDI Sbjct: 719 SVSTPADFESSKSIISACTLYRDKGPEPWLRKTSTDAWLSTGVDEAIDGADETLQDQGDI 778 Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206 YCVVCYESG+L+I+DVP+F+ VFSV NF+SG+P+LVD ++E KD + ATN NSEE AG Sbjct: 779 YCVVCYESGSLDIYDVPSFNYVFSVDNFISGRPHLVDAFVQEQPKDLQKATNKNSEESAG 838 Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 Q RKEN +NMK+VE+ MQRWSG HS PFL GILTDG+ILCYHAYLFEGPE+TS+ E+++S Sbjct: 839 QGRKENVQNMKIVELAMQRWSGKHSRPFLLGILTDGSILCYHAYLFEGPESTSRTEDSVS 898 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 +NS S SRLRNLRFVRV LD+Y REET G+ QR++VFKN+ GYQGLFLSGSR Sbjct: 899 SRNS------SGSRLRNLRFVRVPLDSYAREETSDGMPCQRISVFKNIAGYQGLFLSGSR 952 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 PAWFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTSEG+LKICQLPS+TSYDNY Sbjct: 953 PAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSEGILKICQLPSITSYDNY 1012 Query: 1665 WPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489 WPVQK IPLKGTPHQVTYFAE+NLYPLIVSV V KPLNQV+SSL+DQE HQ EN NLS Sbjct: 1013 WPVQKVIPLKGTPHQVTYFAERNLYPLIVSVPVPKPLNQVMSSLLDQEVGHQFENPNLSP 1072 Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309 +DL++TY +DEFEVRILEPE+SGGPWQT+ TIPMQSSENALT+RVVTL+N TT ENETLL Sbjct: 1073 DDLNRTYTIDEFEVRILEPERSGGPWQTKVTIPMQSSENALTIRVVTLFNTTTNENETLL 1132 Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGR---------------------------NTDNTQNL 1210 AIGTAY+QGEDVA RGR++L ++ ++ + Sbjct: 1133 AIGTAYVQGEDVAARGRIILRALAPWWERLHLHPGSRVQIPEMASPSGVFKIDSADFHLQ 1192 Query: 1209 VSEVFSKEYKGAISALASLQGHLLVASGPKITLYKWNATELTPVAFFDVPPLHVVSLNIV 1030 VSE++SKE KGAISALASLQGHLL+ASGPKI L+KW TEL +AFFD PPL+VVSLNIV Sbjct: 1193 VSEIYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFFDAPPLYVVSLNIV 1252 Query: 1029 KNFILLGDIHKSIYFLSWKEQGCQLSLLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNV 850 KNFIL+GD+HKSIYFLSWKEQG QLSLLAKD+ SLDCFATEFLIDGSTLSL+VSDDQKN+ Sbjct: 1253 KNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNI 1312 Query: 849 QIFYYAPKQSESWKGQKLLSRAEFHVGAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRF 670 QIFYYAPK SESWKGQ+LLSRAEFHVGAHVTKF RLQML DKTNRF Sbjct: 1313 QIFYYAPKMSESWKGQRLLSRAEFHVGAHVTKFLRLQML--PTSTDRTGSTPGSDKTNRF 1370 Query: 669 ALLFATLDGSIGCIAPLDFLTFRRLQSLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPG 490 ALLF LDGSIGCIAPLD LTFRRLQSLQ+KLVDAV HVAGLNPRSFRQF S+GKAH+PG Sbjct: 1371 ALLFGALDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFCSNGKAHRPG 1430 Query: 489 PDNIVDCELLCHYEMLPLEEQLEIAHQIGTTRSQIISNLNDLSLGTSFL 343 PD+IVDCELLCHYEMLPLEEQLEIAH IGTTRSQI+SNLNDL LGTSFL Sbjct: 1431 PDSIVDCELLCHYEMLPLEEQLEIAHLIGTTRSQILSNLNDLFLGTSFL 1479 >ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Fragaria vesca subsp. vesca] Length = 1439 Score = 2302 bits (5965), Expect = 0.0 Identities = 1147/1461 (78%), Positives = 1285/1461 (87%), Gaps = 5/1461 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPA-RRGIGPVPNLI 4534 MSYAA+KMMHWPTGIENCA+GFITH ADF P+IP +QTDDL+SDWPA RR IGPVPNL+ Sbjct: 1 MSYAAHKMMHWPTGIENCAAGFITHSRADFVPRIPQIQTDDLDSDWPAPRREIGPVPNLV 60 Query: 4533 ITAGNVLEVYIIRVQEEDV-RDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357 +TA NVLEVY++RVQE+D R SR SGE+KRGG++ G++GA+LELVCHYRLHGNV TMAV Sbjct: 61 VTAANVLEVYVVRVQEQDTARGSRASGESKRGGLMDGVAGASLELVCHYRLHGNVMTMAV 120 Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177 LS GGGDG +RRD+IIL F+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHL+RGRESFA Sbjct: 121 LSSGGGDGSKRRDAIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180 Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997 RGP VK DPQGRC GVLVY LQ+IILK AQ G GLVGDDD SG A+SARVESSYIISL Sbjct: 181 RGPSVKVDPQGRCGGVLVYDLQLIILKAAQGGYGLVGDDDGFASGAAISARVESSYIISL 240 Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817 RD+ MKHVKDF FVHGYIEPV+VILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI Sbjct: 241 RDMDMKHVKDFTFVHGYIEPVLVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300 Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637 WSAINLPHDAYKLLAVPSPIGGVLVISAN+IHYHSQSASCALA+N++A + DSSQ+MPRS Sbjct: 301 WSAINLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAGSVDSSQEMPRS 360 Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457 S +VELD ANA+WLSNDV +LSTKTGELLLLTLVYDGRVVHRLDL+KS+ASVLTSGI T+ Sbjct: 361 SFTVELDAANASWLSNDVILLSTKTGELLLLTLVYDGRVVHRLDLSKSKASVLTSGIATV 420 Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286 G+SLFFLGSRLGDSLLVQ+T GVGAS + +K+EVGDIEGDAPSAKRLR SSSDALQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTSGVGASMLSADLKDEVGDIEGDAPSAKRLRMSSSDALQDM 480 Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106 ++GEELSLYGSA NNAESAQ++FSFAVRDS++NVGPLKDFSYGLRINAD NA GIAKQSN Sbjct: 481 ISGEELSLYGSAQNNAESAQRSFSFAVRDSLVNVGPLKDFSYGLRINADANATGIAKQSN 540 Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926 YELVCCSGHGKNGALCVL+QSI PE+ITEV L GC+GIWTVYHKN RGHNA+S D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVALPGCKGIWTVYHKNARGHNAES-----YD 595 Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746 DEYHA+LIISLE+RTMVLET D L EVT+ VDY++QG TIAAGNLFGRRRVVQ++ RG+R Sbjct: 596 DEYHAFLIISLEARTMVLETADHLSEVTDKVDYFLQGRTIAAGNLFGRRRVVQIYERGAR 655 Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566 IL+G +MTQDLS G N+ESG GSES+TV SI DPYVLLRM+DG I+LLVGDPS+CTV Sbjct: 656 ILEGYYMTQDLSFGASNSESGSGSESATVLSVSIVDPYVLLRMSDGGIRLLVGDPSSCTV 715 Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386 S++ P+ FESSKKL+S CTLYHD+GPEPWLRK+STDAWLSTG+ EAIDG HDQGD+ Sbjct: 716 SVSNPAAFESSKKLVSACTLYHDEGPEPWLRKSSTDAWLSTGIDEAIDGV---LHDQGDV 772 Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206 YCV+CYESG+LEIFDVPNF+CVFSV F+SGKP LVDT + +P K + +SEEV+G Sbjct: 773 YCVICYESGSLEIFDVPNFNCVFSVEKFVSGKPLLVDTFMGDPQK------SQSSEEVSG 826 Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 +RKE +NM+VVE+TMQRWSG HS PFLFGIL DG I CYHAYL+E ++TSK E + S Sbjct: 827 LSRKEKLQNMRVVELTMQRWSGQHSRPFLFGILNDGMIFCYHAYLYESMDSTSKTEVSAS 886 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 QN T+ASRLRNLRFVRV LDTY+R + G + QRMTVFKN+ G QGLFL+GSR Sbjct: 887 SQN------TTASRLRNLRFVRVPLDTYSRNDLSNGTSCQRMTVFKNIAGNQGLFLAGSR 940 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 PAW MV RER+RVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTSEG++KICQLPS+TSYDNY Sbjct: 941 PAWLMVFRERIRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSEGIMKICQLPSITSYDNY 1000 Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486 WPVQKIPLKGTPHQVTYFAEKNLYPLIVS+ V KPLNQV+SSLVDQE SHQ+EN NLS Sbjct: 1001 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSIPVQKPLNQVLSSLVDQEFSHQVENHNLSPE 1060 Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306 +LH+TY VDEFEVRI+EPEKSGGPWQTRATIPMQ+SENALTVRVVTL+N TTKENETLLA Sbjct: 1061 ELHRTYTVDEFEVRIMEPEKSGGPWQTRATIPMQTSENALTVRVVTLFNTTTKENETLLA 1120 Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126 IGTAY+QGEDVAGRGRVLLFS N DN QNLVSEVFSKE KGAISALASLQG+LL+ASG Sbjct: 1121 IGTAYVQGEDVAGRGRVLLFSAENNVDNPQNLVSEVFSKELKGAISALASLQGNLLIASG 1180 Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946 PKI L+KW ++LT +AFFDVPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQG QL+LL Sbjct: 1181 PKIILHKWTGSDLTGIAFFDVPPLYVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLNLL 1240 Query: 945 AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766 AKD+ +LDCFATEFLIDGSTLSL V+D QKN+QI YYAPK SESW+GQKLL+RAEFHVGA Sbjct: 1241 AKDFGNLDCFATEFLIDGSTLSLAVADAQKNIQILYYAPKISESWRGQKLLTRAEFHVGA 1300 Query: 765 HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586 HVTKF RLQML DKT R+ALLF TLDG IG IAPL+ LTFRRLQSL Sbjct: 1301 HVTKFLRLQML--STSSDRTGKNPGSDKTVRYALLFGTLDGGIGSIAPLEELTFRRLQSL 1358 Query: 585 QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406 Q KLVDAV HVAGLNPRSFRQF+S+GKAH+PGPD+IVDCELL HYEML LEEQLEIA QI Sbjct: 1359 QNKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLFHYEMLSLEEQLEIAQQI 1418 Query: 405 GTTRSQIISNLNDLSLGTSFL 343 GTTR QI+SNL+DLSLGTSFL Sbjct: 1419 GTTRLQILSNLDDLSLGTSFL 1439 >ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Glycine max] Length = 1447 Score = 2268 bits (5876), Expect = 0.0 Identities = 1123/1461 (76%), Positives = 1277/1461 (87%), Gaps = 5/1461 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPAR--RGIGPVPNL 4537 MS+AAYKMM PTGI+NCA+GF+TH +DF P LQ DDL+++WP+R +G +PNL Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56 Query: 4536 IITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357 ++TA NVLEVY +R+QE+ + + + +++RG ++ GI+GA+LELVCHYRLHGNVETMAV Sbjct: 57 VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114 Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177 LSIGGGD RRRDSI+L F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE FA Sbjct: 115 LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174 Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997 RGP+VK DPQGRC GVL+Y LQMIILK QAGSGLVG+DDAL S GAV+AR+ESSY+I+L Sbjct: 175 RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234 Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817 RDL M+HVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI Sbjct: 235 RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294 Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637 WSA+NLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALA+N++AV DSSQ++PRS Sbjct: 295 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354 Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457 S +VELD ANA WL +DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVL+SGITTI Sbjct: 355 SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414 Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286 G+SLFFL SRLGDS+LVQ++CG G SS +KEEVGDIE DAPS KRLRRS SDALQD+ Sbjct: 415 GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473 Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106 V+GEELSLYGSAPN ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSN Sbjct: 474 VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533 Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926 YELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HNADSSKM+ D Sbjct: 534 YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593 Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746 DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGR RV+QV+ RG+R Sbjct: 594 DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653 Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566 ILDGSFMTQD+S G N ESG S+S+ SIADP+VLLRM+DGSI+LL+GDPSTCT+ Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713 Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386 S+ P+ FESSK +S CTLYHDKGPEPWLRK STDAWLSTGVGE IDG DGA D GDI Sbjct: 714 SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773 Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206 YCVVC+++G LEIFDVPNF+CVFSV NFMSGK +LVD ++E KD K + + V Sbjct: 774 YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830 Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 Q RKEN +MKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P++TSK+E++ S Sbjct: 831 QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 S+ L+ST+ SRLRNLRFVRV LD Y RE+T G Q++T+FKN+G Y+G FLSGSR Sbjct: 891 AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 PAW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCN GLIYVTS+G+LKICQLPS ++YD+Y Sbjct: 951 PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010 Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486 WPVQKIPLK TPHQVTYFAEKNLYPLIVS VLKPLNQV+ SLVDQ+ +HQ E+ N++ + Sbjct: 1011 WPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDINHQNESQNMNPD 1069 Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306 + ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLLA Sbjct: 1070 EQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLA 1129 Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126 IGTAY+QGEDVA RGR+LLFS+G+NTDN Q LVSEV+SKE KGAISALASLQGHLL+ASG Sbjct: 1130 IGTAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASG 1189 Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946 PKI L+KWN TEL +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSLL Sbjct: 1190 PKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLL 1249 Query: 945 AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766 AKD+ SLDCFATEFLIDGSTLSLMVSDD +N+QIFYYAPK SESWKGQKLLSRAEFHVGA Sbjct: 1250 AKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1309 Query: 765 HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586 HVTKF RLQML DKTNRFALLF TLDGSIGCIAPLD +TFRRLQSL Sbjct: 1310 HVTKFLRLQML---STSDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSL 1366 Query: 585 QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406 QRKLVDAV HVAGLNPR+FR F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIAHQ+ Sbjct: 1367 QRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQV 1426 Query: 405 GTTRSQIISNLNDLSLGTSFL 343 GTTRSQI+SNL+DLSLGTSFL Sbjct: 1427 GTTRSQILSNLSDLSLGTSFL 1447 >ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris] gi|561025706|gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris] Length = 1445 Score = 2264 bits (5867), Expect = 0.0 Identities = 1117/1460 (76%), Positives = 1277/1460 (87%), Gaps = 4/1460 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPAR-RGIGPVPNLI 4534 MS+AAYKMM TGI+NCA+GF+TH AD P LQ +DL+++WP+R R +GP+PNL+ Sbjct: 1 MSFAAYKMMQCSTGIDNCAAGFLTHSRADSVP----LQPEDLDAEWPSRPRRVGPLPNLV 56 Query: 4533 ITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVL 4354 +TA NVLEVY +R+QE+ + + +RG ++ GI GA+LELVCHYRLHGNVETMAVL Sbjct: 57 VTAANVLEVYTVRIQEDQPPKA---ADPRRGTLLDGIDGASLELVCHYRLHGNVETMAVL 113 Query: 4353 SIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFAR 4174 SIGGGD R+RDSIIL F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE FAR Sbjct: 114 SIGGGDASRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFAR 173 Query: 4173 GPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLR 3994 GP+VK DPQGRC G L+Y LQMIILK QAGSGLVGDDDAL GAV+AR+ESSY+I+LR Sbjct: 174 GPVVKVDPQGRCGGTLIYDLQMIILKATQAGSGLVGDDDALGFSGAVAARIESSYMINLR 233 Query: 3993 DLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIW 3814 DL M+HVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIW Sbjct: 234 DLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 293 Query: 3813 SAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSS 3634 SA+NLPHDAYKLLAVPSPIGGVLVI ANT+HYHSQSASCALA+N++AV+ D+SQ++PRSS Sbjct: 294 SAVNLPHDAYKLLAVPSPIGGVLVIGANTVHYHSQSASCALALNSYAVSLDNSQEIPRSS 353 Query: 3633 ISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIG 3454 +VELD+ANA WL +DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVL+SGITTIG Sbjct: 354 FNVELDSANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTIG 413 Query: 3453 SSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIV 3283 +SLFFL SRLGDS+LVQ++CG G SS +KEEVGDIE DAPS KRLRRS SD LQD+V Sbjct: 414 NSLFFLASRLGDSMLVQFSCGSGGSMLSSNLKEEVGDIEADAPS-KRLRRSPSDTLQDVV 472 Query: 3282 NGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNY 3103 +GEELSLYGSAPN ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSNY Sbjct: 473 SGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNY 532 Query: 3102 ELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDD 2923 ELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HN DSSK++ DD Sbjct: 533 ELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNTDSSKLADDDD 592 Query: 2922 EYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRI 2743 EYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG+RI Sbjct: 593 EYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGARI 652 Query: 2742 LDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVS 2563 LDGSFMTQD++ G N+ES SES+ SIADP+VLLRM+DGS++LL+GDP TCT+S Sbjct: 653 LDGSFMTQDVTFGASNSESASASESAIALSVSIADPFVLLRMSDGSVRLLIGDPITCTIS 712 Query: 2562 INIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIY 2383 + P+ FES+K +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA D GDIY Sbjct: 713 VTSPASFESTKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIY 772 Query: 2382 CVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQ 2203 CVVC+++G LEIFDVPNF+CVFSVGNFMSGK +LVD ++E KD K + + V Q Sbjct: 773 CVVCFDNGNLEIFDVPNFNCVFSVGNFMSGKSHLVDALMKEVLKDSK---KGDRDGVIIQ 829 Query: 2202 ARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISG 2023 RKEN +MKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P+ TSK+E++ S Sbjct: 830 GRKENVPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASA 889 Query: 2022 QNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRP 1843 S+ L +T+ SRLRNLRFVRVSLD Y REET G Q++T+FKN+G YQG FLSGSRP Sbjct: 890 GGSIGLGTTNISRLRNLRFVRVSLDAYAREETSNGSLHQQITIFKNIGSYQGFFLSGSRP 949 Query: 1842 AWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYW 1663 AW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD+YW Sbjct: 950 AWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYW 1009 Query: 1662 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483 PVQKIPLK TPHQVTYFAEKNLYPLIVS VLKPL+QV+ SLVDQ+ +HQ E+ N++S++ Sbjct: 1010 PVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLSQVI-SLVDQDVNHQNESQNMNSDE 1068 Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303 ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLLAI Sbjct: 1069 QNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAI 1128 Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123 GTAY+QGEDVA RGR+LLFS+G+NTDN Q+LVSEV+SKE KGAISALASLQGHLL+ASGP Sbjct: 1129 GTAYVQGEDVAARGRILLFSLGKNTDNPQSLVSEVYSKELKGAISALASLQGHLLIASGP 1188 Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943 KI L+KWN TEL +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSLLA Sbjct: 1189 KIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLA 1248 Query: 942 KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763 KD++SLDCFATEFLIDGSTLSLMVSDD++N+QIFYYAPK SESWKGQKLLSRAEFHVGAH Sbjct: 1249 KDFSSLDCFATEFLIDGSTLSLMVSDDKRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1308 Query: 762 VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583 VTKF RLQML DKTNRFALLF TLDGSIGCIAPLD +TFRRLQSLQ Sbjct: 1309 VTKFLRLQML---PTSDRAGSAPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQ 1365 Query: 582 RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403 +KLVDAV HVAGLNPR+FR+F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIAHQ+G Sbjct: 1366 KKLVDAVAHVAGLNPRAFRKFQSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVG 1425 Query: 402 TTRSQIISNLNDLSLGTSFL 343 TTRSQI+SNL+DLSLGTSFL Sbjct: 1426 TTRSQILSNLSDLSLGTSFL 1445 >ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Glycine max] Length = 1449 Score = 2256 bits (5846), Expect = 0.0 Identities = 1123/1462 (76%), Positives = 1272/1462 (87%), Gaps = 6/1462 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLES-DWPAR--RGIGPVPN 4540 MS+AAYKMM PTGI+NCA+GF+TH +DF P LQ DDL++ +WP+R +GP+PN Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56 Query: 4539 LIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360 L++TA NVLEVY +R+QE D + S +++RG ++ GI+GA+LEL CHYRLHGNVETMA Sbjct: 57 LVVTAANVLEVYAVRLQE-DQQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115 Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180 VLSIGGGD R+RDSIIL F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE F Sbjct: 116 VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175 Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000 ARGP+VK DPQGRC GVL+Y LQMIILK Q GSGLVGDDDA S GAV+AR+ESSY+I+ Sbjct: 176 ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235 Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820 LRDL M+HVKDF FV+GYIEPVMVILHERELTW+GR+SW HHTC ISALSISTTLKQHPL Sbjct: 236 LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295 Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640 IWSA+NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQSASCALA+NN+AV DSSQ++PR Sbjct: 296 IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355 Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460 SS +VELD ANA WL +DVA+LSTKTGELLLL LVYDGRVV RLDL+KS+ASVL+SGITT Sbjct: 356 SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415 Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289 IG+SLFFL SRLGDS+LVQ++CG G SS +KEEVGDIE DAPS KRLRRS SDALQD Sbjct: 416 IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474 Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109 +V+GEELSLYGSA N ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQS Sbjct: 475 MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534 Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929 NYELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HNADSSKM+ Sbjct: 535 NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594 Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749 DDEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG+ Sbjct: 595 DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654 Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569 RILDGSFMTQD+S G N+ESG SES+ SIADP+VLLRM+DGSI+LL+GDPSTCT Sbjct: 655 RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714 Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389 +S+ P+ FESSK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA D GD Sbjct: 715 ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774 Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209 IYCVVC+++G LEIFD+PNF+CVFSV NFMSGK +LVD ++E KD K + + V Sbjct: 775 IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831 Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029 Q RK+N NMKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P+ TSK+E++ Sbjct: 832 NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891 Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849 S S+ L+ST+ SRLRNLRFVRV LD Y RE+T G Q++T+FKN+G YQG FLSGS Sbjct: 892 SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951 Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669 RPAW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD+ Sbjct: 952 RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011 Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489 YWPVQKIPLK TPHQVTYFAEKNLYPLIVS VLKPLNQV+ SLVDQ+ +HQ E+ N++ Sbjct: 1012 YWPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDFNHQNESQNMNP 1070 Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309 ++ ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLL Sbjct: 1071 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLL 1130 Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129 AIGTAY+QGEDVA RGR+LLFS+G+ TDN Q LVSEV+SKE KGAISALASLQGHLL+AS Sbjct: 1131 AIGTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIAS 1190 Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949 GPKI L+KWN TEL +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSL Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250 Query: 948 LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769 LAKD+ SLDCFATEFLIDGSTLSLMVSDD +N+QIFYYAPK SESWKGQKLLSRAEFHVG Sbjct: 1251 LAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1310 Query: 768 AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589 AHVTKF RLQML DKTNRFALLF TLDGSIGCIAPLD +TFRRLQS Sbjct: 1311 AHVTKFLRLQML---STSDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQS 1367 Query: 588 LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409 LQRKLVDAV HVAGLNPR+FR F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIA+Q Sbjct: 1368 LQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQ 1427 Query: 408 IGTTRSQIISNLNDLSLGTSFL 343 IGTTRSQI+SNL+DLSLGTSFL Sbjct: 1428 IGTTRSQILSNLSDLSLGTSFL 1449 >ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] gi|550326263|gb|EEE96682.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] Length = 1455 Score = 2255 bits (5844), Expect = 0.0 Identities = 1135/1468 (77%), Positives = 1268/1468 (86%), Gaps = 12/1468 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARR----GIGPVP 4543 MSYAAYKMMHWPT I+ C SGF+TH ++ A +P L TDDL+SDWP+RR GIGP P Sbjct: 1 MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59 Query: 4542 NLIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETM 4363 NLI+ +GNVLE+Y++RVQEE +R SGE KRGGV+ G++GA+LELVCHYRLHGNVE+M Sbjct: 60 NLIVASGNVLELYVVRVQEEG---ARSSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116 Query: 4362 AVLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRES 4183 VLS+ GGD RRRDSIILAF+DAKISVLEFDDS+HGLRTSSMH FEGP+W HLKRGRES Sbjct: 117 GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176 Query: 4182 FARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYII 4003 FARGPLVK DPQGRC GVLVY LQMIILK AQAGS LV D+DA SG A+SA + SSYII Sbjct: 177 FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236 Query: 4002 SLRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHP 3823 +LRDL MKHVKDFIFVH YIEPV+V+LHERELTW+GR+ WKHHTC ISALSISTTLKQ Sbjct: 237 NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296 Query: 3822 LIWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMP 3643 LIWS NLPHDAYKLLAVPSPIGGVLVI NTIHYHS+SASCALA+N++A + DSSQ++P Sbjct: 297 LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356 Query: 3642 RSSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGIT 3463 R++ SVELD ANA WL DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVLTS IT Sbjct: 357 RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416 Query: 3462 TIGSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQ 3292 T+G+S FFLGSRLGDSLLVQ+T G+G+ S G+KEEVGDIEGD PSAKRL+ SSSDALQ Sbjct: 417 TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476 Query: 3291 DIVNGEELSLYGSAPNNAESAQ-----KNFSFAVRDSVINVGPLKDFSYGLRINADPNAA 3127 D+V+GEELSLY SAPNNAES+Q K FSF VRDS+INVGPLKDF+YGLRINAD NA Sbjct: 477 DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536 Query: 3126 GIAKQSNYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADS 2947 GI+KQSNYELVCCSGHGKNGALCVLQQSI PE+ITEVEL GC+GIWTVYHKN R H+ DS Sbjct: 537 GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARSHSVDS 596 Query: 2946 SKMSAVDDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQ 2767 KM A DDEYHAYLIIS+E+RTMVLET D L EVTESVDY+VQG TIAAGNLFGRRRVVQ Sbjct: 597 LKM-ASDDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655 Query: 2766 VFARGSRILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVG 2587 VF RG+RILDGSFMTQDLS G N+E+G SESSTV SI DPYVL+RM DGSIQ+LVG Sbjct: 656 VFERGARILDGSFMTQDLSFGGSNSETG-RSESSTVMHVSIVDPYVLVRMADGSIQILVG 714 Query: 2586 DPSTCTVSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGA 2407 DPS CTVS+N PS F+SS K +S CTLYHDKGPEPWLRK STDAWLSTG+ EAIDGAD Sbjct: 715 DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774 Query: 2406 PHDQGDIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNT 2227 H+QGDIYCVVCYE+G LEIFDVPNF+ VF V F+SGK +L+DT EP+KD Sbjct: 775 AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDMMKGV-- 832 Query: 2226 NSEEVAGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTS 2047 EEVAG RKE+T+NMKVVE+TM RWSG HS PFLFGILTDGTILCYHAYLFEGP+ TS Sbjct: 833 -KEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891 Query: 2046 KMEEAISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQG 1867 K+E+++S QNSV ++ SASRLRNLRFVRV LDTYTREET + + QR+T FKN+ GYQG Sbjct: 892 KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951 Query: 1866 LFLSGSRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPS 1687 FLSGSRPAWFMV RERLRVHPQ+CDGSIVAFTVLH VNCNHGLIYVTS+G LKIC L S Sbjct: 952 FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011 Query: 1686 VTSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIE 1507 V+SYDNYWPVQKIPLKGTPHQVTYFAE+NLYPLIVSV V KP+NQV+SSLVDQE HQIE Sbjct: 1012 VSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIE 1071 Query: 1506 NDNLSSNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTK 1327 N NLSS ++H+TY VDEFEVRILEP S GPWQ +ATIPMQ+SENALTVR+V+L+N +TK Sbjct: 1072 NHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTK 1129 Query: 1326 ENETLLAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQG 1147 ENETLLA+GTAY+QGEDVA RGR+LLFSV +N +N+Q LVSEV+SKE KGAISALASLQG Sbjct: 1130 ENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQG 1189 Query: 1146 HLLVASGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQ 967 HLL+ASGPKI L+KW TELT VAF D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQ Sbjct: 1190 HLLIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1249 Query: 966 GCQLSLLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSR 787 G QLSLLAKD+ASLDCF+TEFLIDGSTLSL+VSD+QKNVQIFYYAPK SESWKGQKLLSR Sbjct: 1250 GAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSR 1309 Query: 786 AEFHVGAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLT 607 AEFHVGA VTKF RLQML DKTNRFALLF TLDGSIGCIAPLD LT Sbjct: 1310 AEFHVGALVTKFMRLQML--SPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELT 1367 Query: 606 FRRLQSLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQ 427 FRRLQSLQ+KLVDAV HVAGLNP+SFRQF+S GKAH+PGP++IVDCE+L +YEM+PLEEQ Sbjct: 1368 FRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQ 1427 Query: 426 LEIAHQIGTTRSQIISNLNDLSLGTSFL 343 +EIA QIGTTR+QI+SNLNDL+LGTSFL Sbjct: 1428 VEIAQQIGTTRAQILSNLNDLTLGTSFL 1455 >ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Cicer arietinum] Length = 1447 Score = 2222 bits (5759), Expect = 0.0 Identities = 1102/1463 (75%), Positives = 1261/1463 (86%), Gaps = 7/1463 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQ---TDDLESDW-PARRGIGPVP 4543 MS+AAYKMM WPTGI+NCASGF+TH +D P+IP +Q DD++SDW P R + P+P Sbjct: 1 MSFAAYKMMQWPTGIQNCASGFLTHSRSDSTPRIPPIQHNDDDDIDSDWVPQPRDLAPLP 60 Query: 4542 NLIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETM 4363 NL+ITA N+LEVY +R+Q++ + S V+ G++GA+LELVCHYRLHGNVE++ Sbjct: 61 NLVITAANILEVYTVRIQQDPPKSSADPR------VLDGLAGASLELVCHYRLHGNVESV 114 Query: 4362 AVLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRES 4183 AVLS+GGGD RRRDSIIL F+DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE Sbjct: 115 AVLSVGGGDASRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREH 174 Query: 4182 FARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYII 4003 FARGP+ K DPQGRC GVLVY LQMIILKT QAGSGLVG+DD L SGGAV+AR+ESSY+I Sbjct: 175 FARGPVAKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVGEDDVLGSGGAVAARIESSYMI 234 Query: 4002 SLRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHP 3823 +LRDL M+HVKDF F+HGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHP Sbjct: 235 NLRDLDMRHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 294 Query: 3822 LIWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMP 3643 LIWSA+NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQSASCALA+N++AV+ D+SQ+MP Sbjct: 295 LIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSVDNSQEMP 354 Query: 3642 RSSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGIT 3463 RSS +VELD ANA WL NDVA+LSTKTGELLLLTL+YDGRVV RLDL+KS+ASVL+SG+T Sbjct: 355 RSSFNVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLSSGVT 414 Query: 3462 TIGSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQ 3292 TIG+SLFFL SRLGDS+LVQ++ G G S S +KEEVGD + DA SAKR+RRS SD LQ Sbjct: 415 TIGNSLFFLASRLGDSMLVQFSSGSGVSMLSSNLKEEVGDFDVDASSAKRMRRSPSDTLQ 474 Query: 3291 DIVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQ 3112 D+V+GEELSLYGSA N ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQ Sbjct: 475 DMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQ 534 Query: 3111 SNYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSA 2932 SNYELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR NADSSK++ Sbjct: 535 SNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSLNADSSKLAD 594 Query: 2931 VDDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARG 2752 +DEYHAYLIISLESRTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG Sbjct: 595 DEDEYHAYLIISLESRTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERG 654 Query: 2751 SRILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTC 2572 +RILDGSFMTQD+S G N+E+ GSES+ SIADPYVLL+M+DGS++LLVGDPSTC Sbjct: 655 ARILDGSFMTQDVSFGASNSEANYGSESALALSVSIADPYVLLKMSDGSVRLLVGDPSTC 714 Query: 2571 TVSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQG 2392 T+S+ P+ FESSK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA D G Sbjct: 715 TISVTSPASFESSKGSVSTCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHG 774 Query: 2391 DIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEV 2212 DIYCVVCYE+ +LEIFDVPNFSCVFSV NF+SGK +LVD +E KD + S+ V Sbjct: 775 DIYCVVCYENDSLEIFDVPNFSCVFSVENFLSGKSHLVDALTKEVPKDSQKGDKV-SDGV 833 Query: 2211 AGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEA 2032 Q RK+ NMKVVE+ MQRWSG H PFLFGIL+DGT LCYHAYL+E P+ TSK+E++ Sbjct: 834 VSQGRKD-ALNMKVVELAMQRWSGKHGRPFLFGILSDGTTLCYHAYLYESPDGTSKVEDS 892 Query: 2031 ISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSG 1852 + S L+++S SRLRNLRFVRV LD + REET G Q++ +FKN+G Y+G FLSG Sbjct: 893 V----SAGLSNSSVSRLRNLRFVRVPLDVHAREETSNGPPCQQINIFKNIGSYEGFFLSG 948 Query: 1851 SRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYD 1672 SRPAW M++RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD Sbjct: 949 SRPAWVMLLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYD 1008 Query: 1671 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLS 1492 YWPVQK+PLK TPHQVTYFAEKNLYPLIVS V KPLNQV+ +LVDQ+A+ E+ NL+ Sbjct: 1009 CYWPVQKVPLKATPHQVTYFAEKNLYPLIVSYPVPKPLNQVI-ALVDQDANQLTESQNLN 1067 Query: 1491 SNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETL 1312 +++ Y ++EFEVRI+EPEKSGGPWQ +ATIPMQSSENALTVR+VTL N ++KENETL Sbjct: 1068 NDEQSHLYTIEEFEVRIMEPEKSGGPWQLKATIPMQSSENALTVRMVTLMNTSSKENETL 1127 Query: 1311 LAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVA 1132 LAIGTAY+QGEDVA RGR+LLFS+G+NTDN QNLVSEV+SKE KGAISALA+LQGHLLVA Sbjct: 1128 LAIGTAYVQGEDVAARGRILLFSLGKNTDNPQNLVSEVYSKELKGAISALAALQGHLLVA 1187 Query: 1131 SGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLS 952 SGPKI L+KW TEL VAFFDVPPLHVVSLNIVKNFIL+GD+HKSIYFLSWKEQG QLS Sbjct: 1188 SGPKIILHKWTGTELNGVAFFDVPPLHVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLS 1247 Query: 951 LLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHV 772 LLAKD+ SLDCFATEFLIDGSTLSLMVSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHV Sbjct: 1248 LLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1307 Query: 771 GAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQ 592 GAH+TKF RLQML DKTNRFALLF TLDGSIGCIAPLD +TFRRLQ Sbjct: 1308 GAHITKFLRLQML---STSDKTGSGPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQ 1364 Query: 591 SLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAH 412 SLQ+KLVDAV HVAGLNPR+FR F S+GKAH+PGPD+IVDCELLCHYEML LEEQLEIAH Sbjct: 1365 SLQKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPGPDSIVDCELLCHYEMLQLEEQLEIAH 1424 Query: 411 QIGTTRSQIISNLNDLSLGTSFL 343 Q+GTTRSQI+SNL+DLSLGTSFL Sbjct: 1425 QVGTTRSQILSNLSDLSLGTSFL 1447 >ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp. lyrata] gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp. lyrata] Length = 1444 Score = 2181 bits (5652), Expect = 0.0 Identities = 1073/1462 (73%), Positives = 1255/1462 (85%), Gaps = 6/1462 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQ-TDDLESDWPA-RRGIGPVPNL 4537 MS+AA+KMMHWPTG+ENCASG+ITH +D QIP + DD+E++WP +RGIGP+PN+ Sbjct: 1 MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60 Query: 4536 IITAGNVLEVYIIRVQEE-DVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360 +ITAGN+LEVYI+R QEE + ++ R KRGGV+ G+SG +LELVCHYRLHGNVE++A Sbjct: 61 VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120 Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180 VL +GGG+ + RDSIIL F+DAKISVLEFDDS+H LR +SMH FEGP+WLHLKRGRESF Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180 Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000 RGPLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGDDDA +SGG VSARVESSYII+ Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240 Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820 LRDL MKHVKDF+F+HGYIEPV+VIL E E TW+GR+SWKHHTC +SALSI+TTLKQHP+ Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300 Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640 IWSAINLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+A +ADSSQ++P Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360 Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460 S+ SVELD A+ W+S+DVA+LSTK+GELLLLTL+YDGR V RLDL+KS+ASVL S IT+ Sbjct: 361 SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420 Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289 +G+SLFFLGSRLGDSLLVQ++C G ++ G+++E DIEG+ AKRLR SSD QD Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLR-ISSDTFQD 479 Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109 + EELSL+GS PNN++SAQK+FSFAVRDS++NVGP+KDF+YGLRINAD NA G++KQS Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539 Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929 NYELVCCSGHGKNGALCVL+QS+ PE+ITEVEL GC+GIWTVYHK++RGHNADSSKM+A Sbjct: 540 NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599 Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749 +DEYHAYLIIS+E+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRV+QVF G+ Sbjct: 600 EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659 Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569 RILDGSFM Q+LS G PN+ES GSESSTVS SIADPYVLLRMTD SI+LLVGDPSTCT Sbjct: 660 RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719 Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389 VSI+ PSV E SKK IS CTL+HDKGPEPWLRKASTDAWLS+GVGEA+D ADG P DQGD Sbjct: 720 VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779 Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209 IYCV+CYESG LEIFDVP F+CVFSV F SG+ +L D + E + N NSE+ A Sbjct: 780 IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHELEYE----LNKNSEDNA 835 Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029 +R E +N KVVE++MQRWSGPH+ PFLF +L DGTILCYHAYLFEG ++T K E ++ Sbjct: 836 S-SRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893 Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849 S +N LNS+ +S+LRNL+F+R+ DT TRE T G+ SQR+T+FKN+ G+QG FLSGS Sbjct: 894 SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953 Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669 RP W M+ RERLR H Q+CDGSI AFTVLHNVNCNHG IYVTS+ +LKICQLPS + YDN Sbjct: 954 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013 Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489 YWPVQKIPLK TPHQVTY+AEKNLYPLIVS V KP+NQV+SSLVDQEA QI+N NLSS Sbjct: 1014 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLSS 1073 Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309 +DL +TY V+EFE++ILEPE+SGGPW+T+ATIPMQSSE+ALTVRVVTL NA+T ENETLL Sbjct: 1074 DDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETLL 1133 Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129 A+GTAY+QGEDVA RGRVLLFS G+N DN+QN+V+EV+S+E KGAISA+AS+QGHLL++S Sbjct: 1134 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1193 Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949 GPKI L+KWN TEL VAFFD PPL+VVS+N+VK FILLGD+HKSIYFLSWKEQG QLSL Sbjct: 1194 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSL 1253 Query: 948 LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769 LAKD+ SLDCFATEFLIDG+TLSL VSD+QKN+Q+FYYAPK +ESWKGQKLLSRAEFHVG Sbjct: 1254 LAKDFGSLDCFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVG 1313 Query: 768 AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589 +HVTKF RLQM+ DKTNRFALLF TLDGS GCIAPLD +TFRRLQS Sbjct: 1314 SHVTKFLRLQMV-----------TSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQS 1362 Query: 588 LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409 LQ+KLVDAV HVAGLNP SFRQF++ GKA + GPD+I+DCELLCHYEMLPLEEQLE+AHQ Sbjct: 1363 LQKKLVDAVPHVAGLNPHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQ 1422 Query: 408 IGTTRSQIISNLNDLSLGTSFL 343 IGTTRS I+ NL +LS+GTSFL Sbjct: 1423 IGTTRSVILLNLVELSVGTSFL 1444 >ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum lycopersicum] Length = 1447 Score = 2170 bits (5624), Expect = 0.0 Identities = 1082/1461 (74%), Positives = 1254/1461 (85%), Gaps = 5/1461 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AA K MH PTGIENCASGFITH +AD PQI QT D++SDWPA + IGPVPNL++ Sbjct: 1 MSFAACKTMHCPTGIENCASGFITHSAADITPQI---QTADVDSDWPATKPIGPVPNLVV 57 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 +AGNVL+VY+IRV++ RD+ + KRGG+VAGIS A+LELVC YRLHGN+ +M V++ Sbjct: 58 SAGNVLDVYLIRVEQASSRDA--AEVVKRGGLVAGISAASLELVCTYRLHGNIYSMGVIT 115 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG DG +RRDSIIL+F+DAK+SVLEFDD+ HGLRTSSMH FEGP+W HLKRGRESF +G Sbjct: 116 AGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWFHLKRGRESFDKG 175 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 P++K DPQGRCAGV + QMI+LK A+ S L G+D A ++GGA SAR+ESSYII+LRD Sbjct: 176 PIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGA-SARIESSYIITLRD 234 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L ++HVKDF F+HGYIEPVMVILHERELTWSGR+SWKHHTC +SA SISTTLKQHPLIWS Sbjct: 235 LDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWS 294 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQS+SC+LA+NNF D+SQ+MPRSSI Sbjct: 295 ATNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFVFFGDNSQEMPRSSI 354 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 +VELD ANA WL++DVAMLSTKTGELLLLT++YDGR+V +LDL+KSRASVLTSGITTIG Sbjct: 355 NVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTIGD 414 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ++ G+G S+ GV+EEVGDIE DAPSAKRLR SSSDALQD++N Sbjct: 415 SLFFLGSRLGDSLLVQFSSGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDMIN 474 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYG+APNNA+SAQK FSFAVRDS+INVGPLKDFSYG+RINAD NA GIAKQSNYE Sbjct: 475 GEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSNYE 534 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNG+L VLQQSI PE IT+V L GC+GIWTVYHKNTR H ++SS+M+ +DE Sbjct: 535 LVCCSGHGKNGSLSVLQQSIRPETITQVSLPGCKGIWTVYHKNTRIHLSESSRMADEEDE 594 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVL+T + L EVTE+VDYYVQG+T+AAGNLFGRRRV+QVFA G+RIL Sbjct: 595 YHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGARIL 654 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DG+FMTQ+LS N ESG S++S V+ SIADPYVLLRMT+GS+QLLVGDPS+C+VS+ Sbjct: 655 DGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSL 714 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 +PSVFESSKK IS CTLYHDKGPEPWLRK STDAWLS+G+GEAIDGADG DQGD+YC Sbjct: 715 TVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVIQDQGDVYC 774 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYE+GTLEIFDVP+F+CVFSV F+SG+ LVDT +++ S + A + N+E+V Sbjct: 775 VVCYENGTLEIFDVPSFTCVFSVDKFISGRTYLVDTFMQD-SVNGLHAHSKNTEDVIRPG 833 Query: 2199 RKENTENMK--VVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 +KEN++++K VVE+ M RW G HS PFLFGIL DGTIL YHAY+FEG EN+SK++ ++S Sbjct: 834 QKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVDGSVS 893 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 QNS++L+ST+ASRLRNLRFVRV +D Y REE P+G QRM V+KN+GG QG+FL+GSR Sbjct: 894 SQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGSQLQRMNVYKNIGGSQGIFLTGSR 953 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 P+WFMV RERLR+HPQ+CDG IVAFTVLHNVNCNHGLIYVT+ G LKICQLPS SYDNY Sbjct: 954 PSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDNY 1013 Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486 WPVQKIPLKGTPHQV YFAEKN+Y +IVSV VLKPLNQV+SS+ DQE Q + DNL Sbjct: 1014 WPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSSIADQEVGQQFDPDNL--- 1070 Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306 + +Y ++EFEVRILEPEKSGGPW+TRA+IPMQSSENALTVR+VTL+N TKENETLLA Sbjct: 1071 NYEGSYPIEEFEVRILEPEKSGGPWKTRASIPMQSSENALTVRMVTLFNTKTKENETLLA 1130 Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126 +GTAY+QGEDVA RGRVLLFS+ R DN++ LVSEV+SKE KGAI ALASLQGHLL+ASG Sbjct: 1131 VGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIASG 1190 Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946 PKI L+KW +EL VAF D PPLH VSLNIVKNFILLGDIHKSI F+SWKE QLSLL Sbjct: 1191 PKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKEP--QLSLL 1248 Query: 945 AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766 AKD++ LDC ATEFLIDGSTLSL+VSDDQKNVQIFYYAPK SESWKGQKLLSRAEFHVG+ Sbjct: 1249 AKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVGS 1308 Query: 765 HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586 +TKF RLQ+L DKTNRFA +F TL+GS+GCIAPLD LTFRRLQSL Sbjct: 1309 RITKFLRLQLL--PTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSL 1366 Query: 585 QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406 Q+KLV AV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELL HYEMLPLEEQLEIA QI Sbjct: 1367 QKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQI 1426 Query: 405 GTTRSQIISNLNDLSLGTSFL 343 GTTR QI+SNLND+ LGTSFL Sbjct: 1427 GTTRMQIMSNLNDMILGTSFL 1447 >ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum tuberosum] Length = 1447 Score = 2170 bits (5622), Expect = 0.0 Identities = 1078/1461 (73%), Positives = 1255/1461 (85%), Gaps = 5/1461 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531 MS+AA K MH PTGIENCASGFITH +A+ PQI +T D++SDWPA + +GP+PNL++ Sbjct: 1 MSFAACKTMHCPTGIENCASGFITHSAAEITPQI---RTADVDSDWPATKPVGPMPNLVV 57 Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351 +AGNVLEVY+IR+++ RD+ + KRGG++AGIS A+LELVC YRLHGN+ +M V++ Sbjct: 58 SAGNVLEVYLIRIEQASSRDA--AEVVKRGGLMAGISAASLELVCTYRLHGNIYSMGVIT 115 Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171 GG DG +RRDSIIL+F+DAK+SVLEFDD+ HGLRTSSMH FEGP+WLHLKRGRESF +G Sbjct: 116 AGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWLHLKRGRESFDKG 175 Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991 P++K DPQGRCAGV + QMI+LK A+ S L G+D A ++GGA SAR+ESSYII+LRD Sbjct: 176 PIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGA-SARIESSYIITLRD 234 Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811 L ++HVKDF F+HGYIEPVMVILHERELTWSGR+SWKHHTC +SA SISTTLKQHPLIWS Sbjct: 235 LDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWS 294 Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631 A NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQS+SC+LA+NNFA D+SQ+MPRSS Sbjct: 295 AANLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFAFFGDNSQEMPRSSF 354 Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451 +VELD ANA WL++DVAMLSTKTGELLLLT++YDGR+V +LDL+KSRASVLTSGITTIG Sbjct: 355 NVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTIGD 414 Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280 SLFFLGSRLGDSLLVQ++CG+G S+ GV+EEVGDIE DAPSAKRLR SSSDALQD++N Sbjct: 415 SLFFLGSRLGDSLLVQFSCGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDMIN 474 Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100 GEELSLYG+APNNA+SAQK FSFAVRDS+INVGPLKDFSYG+RINAD NA GIAKQSNYE Sbjct: 475 GEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSNYE 534 Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920 LVCCSGHGKNG+LCVLQQSI PE IT+ L GC+GIWTVYHKNTR H ++SS+M+ +DE Sbjct: 535 LVCCSGHGKNGSLCVLQQSIRPETITQEALPGCKGIWTVYHKNTRIHLSESSRMADEEDE 594 Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740 YHAYLIISLE+RTMVL+T + L EVTE+VDYYVQG+T+AAGNLFGRRRV+QVFA G+RIL Sbjct: 595 YHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGARIL 654 Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560 DG+FMTQ+LS N ESG S++S V+ SIADPYVLLRMT+GS+QLLVGDPS+C+VS+ Sbjct: 655 DGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSL 714 Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380 +PSVFESSKK IS CTLYHDKGPEPWLRK STDAWLS+G+GEAIDGADG DQGD+YC Sbjct: 715 TVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVTQDQGDVYC 774 Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200 VVCYE+GTLEIFDVPNF+CVFSV F+SG+ LVDT +++ S + A + N+E+V Sbjct: 775 VVCYENGTLEIFDVPNFTCVFSVDKFISGRTYLVDTFMQD-SVNGLHAHSKNTEDVIRPG 833 Query: 2199 RKENTENMK--VVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026 +KEN++++K VVE+ M RW G HS PFLFGIL DGTIL YHAY+FEG EN+SK+E ++S Sbjct: 834 QKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVEGSVS 893 Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846 QNS++L+ST+ASRLRNLRFVRV +D Y REE P+G QRM V+KN+GG QG+FL+GSR Sbjct: 894 SQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGTQLQRMNVYKNIGGSQGIFLTGSR 953 Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666 P+WFMV RERLR+HPQ+CDG IVAFTVLHNVNCNHGLIYVT+ G LKICQLPS SYDNY Sbjct: 954 PSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDNY 1013 Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486 WPVQKIPLKGTPHQV YFAEKN+Y +IVSV VLKPLNQV+S++ DQE Q + DNL Sbjct: 1014 WPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSTIADQEVGQQFDPDNL--- 1070 Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306 + +Y ++EFEVRI+EPEKSGG W+TRA+IPMQSSENALTVR+VTL N TT+ENETLLA Sbjct: 1071 NYEGSYPIEEFEVRIVEPEKSGGLWKTRASIPMQSSENALTVRMVTLLNTTTRENETLLA 1130 Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126 +GTAY+QGEDVA RGRVLLFS+ R DN++ LVSEV+SKE KGAI ALASLQGHLL+ASG Sbjct: 1131 VGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIASG 1190 Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946 PKI L+KW +EL VAF D PPLH VSLNIVKNFILLGDIHKSI F+SWKE QLSLL Sbjct: 1191 PKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKEP--QLSLL 1248 Query: 945 AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766 AKD++ LDC ATEFLIDGSTLSL+VSDDQKNVQIFYYAPK SESWKGQKLLSRAEFHVG+ Sbjct: 1249 AKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVGS 1308 Query: 765 HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586 +TKF RLQ+L DKTNRFA +F TL+GS+GCIAPLD LTFRRLQSL Sbjct: 1309 RITKFLRLQLL--PTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSL 1366 Query: 585 QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406 Q+KLV AV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELL HYEMLPLEEQLEIA QI Sbjct: 1367 QKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQI 1426 Query: 405 GTTRSQIISNLNDLSLGTSFL 343 GTTR QI+SNLND+ LGTSFL Sbjct: 1427 GTTRMQIMSNLNDMILGTSFL 1447 >ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1 [Arabidopsis thaliana] gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit 1; AltName: Full=Cleavage and polyadenylation specificity factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160 kDa subunit gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1 [Arabidopsis thaliana] Length = 1442 Score = 2163 bits (5605), Expect = 0.0 Identities = 1066/1462 (72%), Positives = 1247/1462 (85%), Gaps = 6/1462 (0%) Frame = -2 Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQT-DDLESDWP-ARRGIGPVPNL 4537 MS+AAYKMMHWPTG+ENCASG+ITH +D QIP + DD+E++WP +RGIGP+PN+ Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60 Query: 4536 IITAGNVLEVYIIRVQEE-DVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360 +ITA N+LEVYI+R QEE + ++ R AKRGGV+ G+ G +LELVCHYRLHGNVE++A Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120 Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180 VL +GGG+ + RDSIIL F+DAKISVLEFDDS+H LR +SMH FEGP+WLHLKRGRESF Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180 Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000 RGPLVK DPQGRC GVLVYGLQMIILKT+Q GSGLVGDDDA +SGG VSARVESSYII+ Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240 Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820 LRDL MKHVKDF+F+HGYIEPV+VIL E E TW+GR+SWKHHTC +SALSI++TLKQHP+ Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300 Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640 IWSAINLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+A +ADSSQ++P Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360 Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460 S+ SVELD A+ W+SNDVA+LSTK+GELLLLTL+YDGR V RLDL+KS+ASVL S IT+ Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420 Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289 +G+SLFFLGSRLGDSLLVQ++C G ++ G+++E DIEG+ AKRLR +S D QD Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479 Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109 + EELSL+GS PNN++SAQK+FSFAVRDS++NVGP+KDF+YGLRINAD NA G++KQS Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539 Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929 NYELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSSKM+A Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599 Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749 +DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRV+QVF G+ Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659 Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569 RILDGSFM Q+LS G N+ES GSESSTVS SIADPYVLLRMTD SI+LLVGDPSTCT Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719 Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389 VSI+ PSV E SK+ IS CTLYHDKGPEPWLRKASTDAWLS+GVGEA+D DG P DQGD Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779 Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209 IYCVVCYESG LEIFDVP+F+CVFSV F SG+ +L D + E + N NSE+ Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELEYE----LNKNSEDNT 835 Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029 +NT +VVE+ MQRWSG H+ PFLF +L DGTILCYHAYLF+G ++T K E ++ Sbjct: 836 SSKEIKNT---RVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891 Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849 S +N LNS+ +S+LRNL+F+R+ LDT TRE T G+ SQR+T+FKN+ G+QG FLSGS Sbjct: 892 SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951 Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669 RP W M+ RERLR H Q+CDGSI AFTVLHNVNCNHG IYVT++G+LKICQLPS + YDN Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011 Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489 YWPVQKIPLK TPHQVTY+AEKNLYPLIVS V KPLNQV+SSLVDQEA Q++N N+SS Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071 Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309 +DL +TY V+EFE++ILEPE+SGGPW+T+A IPMQ+SE+ALTVRVVTL NA+T ENETLL Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131 Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129 A+GTAY+QGEDVA RGRVLLFS G+N DN+QN+V+EV+S+E KGAISA+AS+QGHLL++S Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191 Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949 GPKI L+KWN TEL VAFFD PPL+VVS+N+VK+FILLGD+HKSIYFLSWKEQG QLSL Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251 Query: 948 LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769 LAKD+ SLDCFATEFLIDGSTLSL VSD+QKN+Q+FYYAPK ESWKG KLLSRAEFHVG Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311 Query: 768 AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589 AHV+KF RLQM+ DK NRFALLF TLDGS GCIAPLD +TFRRLQS Sbjct: 1312 AHVSKFLRLQMV-----------SSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQS 1360 Query: 588 LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409 LQ+KLVDAV HVAGLNP +FRQF+S GKA + GPD+IVDCELLCHYEMLPLEEQLE+AHQ Sbjct: 1361 LQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQ 1420 Query: 408 IGTTRSQIISNLNDLSLGTSFL 343 IGTTR I+ +L DLS+GTSFL Sbjct: 1421 IGTTRYSILKDLVDLSVGTSFL 1442