BLASTX nr result
ID: Akebia23_contig00003492
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00003492 (4895 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec... 2379 0.0 ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prun... 2375 0.0 emb|CBI24510.3| unnamed protein product [Vitis vinifera] 2372 0.0 ref|XP_007038473.1| Cleavage and polyadenylation specificity fac... 2341 0.0 ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr... 2321 0.0 ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec... 2321 0.0 ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr... 2317 0.0 ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec... 2316 0.0 gb|EXC20897.1| Cleavage and polyadenylation specificity factor s... 2286 0.0 ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec... 2283 0.0 ref|XP_002318462.2| cleavage and polyadenylation specificity fac... 2266 0.0 ref|XP_002510905.1| cleavage and polyadenylation specificity fac... 2262 0.0 ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec... 2260 0.0 ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec... 2249 0.0 ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phas... 2236 0.0 ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec... 2227 0.0 ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation spec... 2145 0.0 ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec... 2142 0.0 ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec... 2140 0.0 ref|XP_006851814.1| hypothetical protein AMTR_s00041p00037520 [A... 2130 0.0 >ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Vitis vinifera] Length = 1442 Score = 2379 bits (6165), Expect = 0.0 Identities = 1171/1454 (80%), Positives = 1302/1454 (89%), Gaps = 3/1454 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MSYAA+KMMHWPTGIENCASGF++H +DFAPQI I +DL+S+WP ++ IGP+PNL+V Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA NILEVY++RVQE+ D R SR+ +E KRGG MAG+SGA+LELVC Y+LHGNVETM VL Sbjct: 61 TAANILEVYMVRVQED-DSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 G D +RRDSIILAFQDAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRG ESFAR Sbjct: 120 PSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRC GVLVYGLQMIILK +ESSYVISLR Sbjct: 180 GPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SAVNLPHDAYKL+ VPSPIGGV+VI ANSIHYHSQS SCALA+N+YAV+ DNSQE+PRSS Sbjct: 300 SAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAANATWLS+DV MLSTKTGELLLLT+ YDGRVV RL+LSKS+ASVLTSG+ IG Sbjct: 360 FSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVPKEEVGDIEGDVPSAKRLRMASSDALQDIVSGEE 1463 NS FLGSRLGDS+LVQFT + +SV KEEVGDIEGDVPSAKRLR +SSDALQD+V+GEE Sbjct: 420 NSLFFLGSRLGDSLLVQFTSILSSSV-KEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478 Query: 1464 LSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVC 1643 LSLY SAPN++E +QKTFSF+VRDS IN+GPLKDF+YGLRINADP ATGIAKQSNYELVC Sbjct: 479 LSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVC 538 Query: 1644 CSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDDEYHA 1823 CSGHGKNGALC+LQQSIRPE+ITEVELPGCKGIWTVYHK+ RGHNADS+KM + DDEYHA Sbjct: 539 CSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHA 598 Query: 1824 FLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARVLDGS 2003 +LIISLESRTMVLETADLLGEVTESVDYYVQG TI+AGNLFGRRRV+QV+ARGAR+LDG+ Sbjct: 599 YLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGA 658 Query: 2004 YMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVSISIP 2183 +MTQDL ++ PYVLLRM+DG+IQLLVGDPSTCTVSI+IP Sbjct: 659 FMTQDLPISESST----------VLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIP 708 Query: 2184 PAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIYCVVC 2363 FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADGA DQGDIYCVV Sbjct: 709 AVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVS 768 Query: 2364 YESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQN--SKN-EVVAGQVKKE 2534 YESG LEIFDVP+F CVFSV+KF+SG HLVDT I E ++D+Q SKN E A Q +KE Sbjct: 769 YESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKE 828 Query: 2535 TAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQNTV 2714 A N+K+VELAMQRW+G+H+RP+LFG+LTDGT+LCYHAYLYEG EST K EE+VS QN++ Sbjct: 829 NAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSL 888 Query: 2715 DLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAWFM 2894 +S++S SRLRNLRFVRVPL+TYTREE + + RMTVFKN+GG QGLFLSGSRP WFM Sbjct: 889 SISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFM 948 Query: 2895 VCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPVQK 3074 V RER+RVHPQLCDGSIVAF VLHN+NCNHGLIYVTS+GFLKICQLP+V SYDNYWPVQK Sbjct: 949 VFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQK 1008 Query: 3075 IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELHKT 3254 IPLKGTPHQVTYFAEKNLYPLIVSVPV++PLN VLSSLVDQ+AGHQ+E+DN S DELH++ Sbjct: 1009 IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRS 1068 Query: 3255 YTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGTAY 3434 Y+VDE+EVR+LEPEKSGAPWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLAIGTAY Sbjct: 1069 YSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1128 Query: 3435 VQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKVIL 3614 VQGEDVAARGRVLLFSVG+NTDNSQNLVSE+YSKELKGAISA+ASLQGHLLIASGPK+IL Sbjct: 1129 VQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIIL 1188 Query: 3615 HKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFG 3794 HKWTG EL GVAF+D PPLYVVSLNIVKNFIL+GDIH+SIYFLSWKEQGAQL+LLAKDFG Sbjct: 1189 HKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFG 1248 Query: 3795 NLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 3974 +LDCF+TEFLIDGSTLSLIVSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF Sbjct: 1249 SLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1308 Query: 3975 LRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDS 4154 LRLQMLP SSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKKLVD+ Sbjct: 1309 LRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDA 1368 Query: 4155 VPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQI 4334 VPH+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLP E+QLE+A QIGTTR QI Sbjct: 1369 VPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQI 1428 Query: 4335 LTNLNDLSLGTSFL 4376 L+NLNDLSLGTSFL Sbjct: 1429 LSNLNDLSLGTSFL 1442 >ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] gi|462416772|gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica] Length = 1459 Score = 2375 bits (6156), Expect = 0.0 Identities = 1158/1459 (79%), Positives = 1294/1459 (88%), Gaps = 8/1459 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWP-ARKGIGPIPNLV 200 MS+AA+KMMHWPTGIENCASGFISH SDF P+IP I EDL+S+WP +R+ IGPIP+LV Sbjct: 1 MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRIPPIQTEDLESEWPTSRREIGPIPDLV 60 Query: 201 VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 VTAGN+LEVYV+RVQEE+ R R+ E KRGG M G+SGASLELVCHY+LHGNV TMAV Sbjct: 61 VTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMAV 120 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 LS G DG +RRDSIIL F+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HL+RGRESFA Sbjct: 121 LSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 RGPLVKVDPQGRCG +LVYGLQMIILK IESSY+++L Sbjct: 181 RGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISSRIESSYIVNL 240 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RD+DMKHVKDF+F+HGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI Sbjct: 241 RDMDMKHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSAVNLPHDAYKL++VPSPIGGVLVI ANSIHYHSQS SCALA+NSYAV+ DNSQE+PRS Sbjct: 301 WSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEMPRS 360 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 SF VELD ANATWL +DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVLTSG+T + Sbjct: 361 SFTVELDTANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITKV 420 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDI 1448 GNS FLGSRLGDS+LVQFTCG+G SV K+EVGDIEGD P AKRLRM+SSDALQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPLAKRLRMSSSDALQDM 480 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQSN Sbjct: 481 VSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 540 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHK+ARGHNADSSK+ + D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAASD 600 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DE+HA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRV+QV+ RGAR Sbjct: 601 DEFHAYLIISLEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVVQVYERGAR 660 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDGS+MTQDLSFG NSE PYVLLRM+DG I+LLVGDPS CTV Sbjct: 661 ILDGSFMTQDLSFGTSNSEMGSGSESSTVLSVSIVDPYVLLRMSDGGIRLLVGDPSLCTV 720 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S SIP AFESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGADG HDQGD+ Sbjct: 721 STSIPAAFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIDEAIDGADGVSHDQGDV 780 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAG 2519 YCVVCYESG+LEIFDVP+F CVFSV+KF+SG HL+DT +R+ KD Q N +E V+G Sbjct: 781 YCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINKSSEEVSG 840 Query: 2520 QVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVS 2699 Q +KE QNMK+VELAMQRW+G+H+RP+LFG+L DG +LCYHAYL+EG E+ +K E+S S Sbjct: 841 QGRKENIQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETASKTEDSAS 900 Query: 2700 GQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSR 2879 QNT +S++S SRLRNLRFVRVPL+TY +++TS + CQRMT+FKN+ GYQGLFLSGSR Sbjct: 901 AQNTTGVSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQGLFLSGSR 960 Query: 2880 PAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNY 3059 PAWFMV RERLR+HPQLCDGS+VA VLHNVNCNHGLIYVTS+G LKICQLP + SYDNY Sbjct: 961 PAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPPITSYDNY 1020 Query: 3060 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPV +PLNQVLSSLVDQ+ GHQVE+ N S D Sbjct: 1021 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLSSD 1080 Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419 ELH+TY+VDE+E+RI+EP+KSG PWQT+ATIPMQ SENALTVR+VTLFNTT+KENETLLA Sbjct: 1081 ELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLLA 1140 Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599 IGTAYVQGEDVA RGRVLLFS G++ DN+Q LVSEVYSKELKGAISALASLQGHLLIASG Sbjct: 1141 IGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQGHLLIASG 1200 Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779 PK+ILHKW G EL GVAF+D PPLYVVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQL+LL Sbjct: 1201 PKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLTLL 1260 Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959 AKDFGNLDCF+TEFLIDGSTLSL+V+DEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVG Sbjct: 1261 AKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGT 1320 Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139 HVTKFLRLQML TSSDRTG PGSDKTNR+ALLFGTLDGS+GCIAPLDELTFRRLQ+LQK Sbjct: 1321 HVTKFLRLQMLSTSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1380 Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319 KLVD+V H+AGLNPRAFR F+SNGKAHRPGPD ++DCELL HYEMLPLE+QLE+A+QIGT Sbjct: 1381 KLVDAVHHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLEIANQIGT 1440 Query: 4320 TRTQILTNLNDLSLGTSFL 4376 TR+QI +NLNDLS+GTSFL Sbjct: 1441 TRSQIFSNLNDLSIGTSFL 1459 >emb|CBI24510.3| unnamed protein product [Vitis vinifera] Length = 1448 Score = 2372 bits (6148), Expect = 0.0 Identities = 1171/1460 (80%), Positives = 1302/1460 (89%), Gaps = 9/1460 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MSYAA+KMMHWPTGIENCASGF++H +DFAPQI I +DL+S+WP ++ IGP+PNL+V Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA NILEVY++RVQE+ D R SR+ +E KRGG MAG+SGA+LELVC Y+LHGNVETM VL Sbjct: 61 TAANILEVYMVRVQED-DSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 G D +RRDSIILAFQDAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRG ESFAR Sbjct: 120 PSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRC GVLVYGLQMIILK +ESSYVISLR Sbjct: 180 GPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SAVNLPHDAYKL+ VPSPIGGV+VI ANSIHYHSQS SCALA+N+YAV+ DNSQE+PRSS Sbjct: 300 SAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAANATWLS+DV MLSTKTGELLLLT+ YDGRVV RL+LSKS+ASVLTSG+ IG Sbjct: 360 FSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVPKEEVGDIEGDVPSAKRLRMASSDALQDIVSGEE 1463 NS FLGSRLGDS+LVQFT + +SV KEEVGDIEGDVPSAKRLR +SSDALQD+V+GEE Sbjct: 420 NSLFFLGSRLGDSLLVQFTSILSSSV-KEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478 Query: 1464 LSLYSSAPNTSELAQ------KTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQS 1625 LSLY SAPN++E +Q KTFSF+VRDS IN+GPLKDF+YGLRINADP ATGIAKQS Sbjct: 479 LSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQS 538 Query: 1626 NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSL 1805 NYELVCCSGHGKNGALC+LQQSIRPE+ITEVELPGCKGIWTVYHK+ RGHNADS+KM + Sbjct: 539 NYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATK 598 Query: 1806 DDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGA 1985 DDEYHA+LIISLESRTMVLETADLLGEVTESVDYYVQG TI+AGNLFGRRRV+QV+ARGA Sbjct: 599 DDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGA 658 Query: 1986 RVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCT 2165 R+LDG++MTQDL ++ PYVLLRM+DG+IQLLVGDPSTCT Sbjct: 659 RILDGAFMTQDLPISESST----------VLSVSIADPYVLLRMSDGNIQLLVGDPSTCT 708 Query: 2166 VSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGD 2345 VSI+IP FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADGA DQGD Sbjct: 709 VSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGD 768 Query: 2346 IYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQN--SKN-EVVA 2516 IYCVV YESG LEIFDVP+F CVFSV+KF+SG HLVDT I E ++D+Q SKN E A Sbjct: 769 IYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEA 828 Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696 Q +KE A N+K+VELAMQRW+G+H+RP+LFG+LTDGT+LCYHAYLYEG EST K EE+V Sbjct: 829 DQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAV 888 Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876 S QN++ +S++S SRLRNLRFVRVPL+TYTREE + + RMTVFKN+GG QGLFLSGS Sbjct: 889 SAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGS 948 Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056 RP WFMV RER+RVHPQLCDGSIVAF VLHN+NCNHGLIYVTS+GFLKICQLP+V SYDN Sbjct: 949 RPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDN 1008 Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPV++PLN VLSSLVDQ+AGHQ+E+DN S Sbjct: 1009 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSS 1068 Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416 DELH++Y+VDE+EVR+LEPEKSGAPWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLL Sbjct: 1069 DELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1128 Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596 AIGTAYVQGEDVAARGRVLLFSVG+NTDNSQNLVSE+YSKELKGAISA+ASLQGHLLIAS Sbjct: 1129 AIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIAS 1188 Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776 GPK+ILHKWTG EL GVAF+D PPLYVVSLNIVKNFIL+GDIH+SIYFLSWKEQGAQL+L Sbjct: 1189 GPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNL 1248 Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956 LAKDFG+LDCF+TEFLIDGSTLSLIVSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVG Sbjct: 1249 LAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1308 Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136 AHVTKFLRLQMLP SSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQ Sbjct: 1309 AHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1368 Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316 KKLVD+VPH+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLP E+QLE+A QIG Sbjct: 1369 KKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIG 1428 Query: 4317 TTRTQILTNLNDLSLGTSFL 4376 TTR QIL+NLNDLSLGTSFL Sbjct: 1429 TTRMQILSNLNDLSLGTSFL 1448 >ref|XP_007038473.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] gi|508775718|gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 2342 bits (6068), Expect = 0.0 Identities = 1142/1458 (78%), Positives = 1290/1458 (88%), Gaps = 7/1458 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MSYAA+KMMHWPTGIENCASGF++HC +DF PQIP EDL+S+WPAR+GIGP+PNL+V Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA N+LE+YV+RVQEE R +R+ +E KRGG + G+SG SLELVC+Y+LHGNVE+MAVL Sbjct: 61 TAANLLEIYVVRVQEE-GRREARNSTEVKRGGVLDGVSGVSLELVCNYRLHGNVESMAVL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S+ G DG +RRDSIILAF+DAKISVLEFDDSI+GLRT+SMHCFEGPEW HLKRGRESFAR Sbjct: 120 SIGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRCGGVLVY LQMIILK +ESSY+I+LR Sbjct: 180 GPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLD+KH+KDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SAVNLPHDAYKL++VPSPIGGVLVI AN+IHYHSQS SCALA+N+YA++VDNSQ+LPRS+ Sbjct: 300 SAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSN 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAANATWL +DV +LSTKTGELLLLT++YDGRVVQRL+LSKSKASVLTS +TTIG Sbjct: 360 FSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451 NS FLGSRLGDS+LVQF+ G G S KEEVGDIEGDVP AKRLR +SSDALQD+V Sbjct: 420 NSLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMV 479 Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631 GEELSLY SAPN +E AQKTF FAVRDSL N+GPLKDFSYGLRINAD NATGIAKQSNY Sbjct: 480 GGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNY 539 Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811 ELVCCSGHGKNGALCVL+QSIRPE+ITEVEL GCKGIWTVYHKS R H+AD SK+ DD Sbjct: 540 ELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDD 599 Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991 EYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG TIAAGNLFGRRRV+QV+ RGAR+ Sbjct: 600 EYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARI 659 Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171 LDGS+MTQ+LS +PNSE PYVLLRMTDGSI LLVGDP+TCTVS Sbjct: 660 LDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVS 719 Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351 I+ P AFE SK ++ACTLYHDKGPEPWLRK STDAWLSTG+GE+IDGADG HDQGDIY Sbjct: 720 INTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIY 779 Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522 CVVCYESG LEIFDVP+F CVFS+EKF SG T LVD + E++KDS+ N +E + GQ Sbjct: 780 CVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQ 839 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +KE QN+K+VELAMQRW+ H+RP+LFG+LTDGT+LCYHAYL+EG E+ +K+E+SV Sbjct: 840 GRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVA 899 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 QN+V LS+I+ SRLRNLRF+R+PL+ YTREE S + QR+T+FKN+ GYQG FLSGSRP Sbjct: 900 QNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRP 959 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 AWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQ+PS +YDNYW Sbjct: 960 AWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYW 1019 Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242 PVQKIPL+GTPHQVTYFAE+NLYP+IVSVPV +P+NQVLSSLVDQ+ GHQ+++ N S DE Sbjct: 1020 PVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDE 1079 Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422 L +TYTVDE+EVRILEPEKSG PW+T+ATIPMQ+SENALTVR+VTLFNTT+KENE+LLAI Sbjct: 1080 LQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAI 1139 Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602 GTAY+QGEDVAARGRV+L S+GRNTDN QNLVSEVYSKELKGAISALASLQGHLLIASGP Sbjct: 1140 GTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGP 1199 Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782 K+ILH WTG+EL G+AFYD PPLYVVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQLSLLA Sbjct: 1200 KIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLA 1259 Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962 KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319 Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142 VTKFLRLQML TSSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK Sbjct: 1320 VTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379 Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322 LVD+VPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QL++AHQIGTT Sbjct: 1380 LVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTT 1439 Query: 4323 RTQILTNLNDLSLGTSFL 4376 R+QIL+NLNDL+LGTSFL Sbjct: 1440 RSQILSNLNDLTLGTSFL 1457 >ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523633|gb|ESR35000.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1457 Score = 2321 bits (6016), Expect = 0.0 Identities = 1133/1458 (77%), Positives = 1281/1458 (87%), Gaps = 7/1458 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA+KMMHWPTGI NC SGFI+H +D+ PQIP I E+LDS+ P+++GIGP+PNLVV Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA N++E+YV+RVQEE + S++ E KR M G+S ASLELVCHY+LHGNVE++A+L Sbjct: 61 TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S G+D +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR Sbjct: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRCGGVLVYGLQMIILK IESS+VI+LR Sbjct: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS Sbjct: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+ SVLTS +TTIG Sbjct: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDIV 1451 NS FLGSRLGDS+LVQFTCG GTS+ PKEE GDIE D PS KRLR +SSDALQD+V Sbjct: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMV 479 Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631 +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY Sbjct: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539 Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811 ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHN DSS+M + DD Sbjct: 540 ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDD 599 Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991 EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+ Sbjct: 600 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659 Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171 LDGSYMTQDLSFG NSE PYVLL M+DGSI+LLVGDPSTCTVS Sbjct: 660 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719 Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351 + P A ESSK ++ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY Sbjct: 720 VQTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779 Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522 VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE KDS+ NS +E GQ Sbjct: 780 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +KE +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS Sbjct: 840 GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVST 899 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 ++ +S++S SRLRNLRF R PL+ YTREET +PCQR+T+FKN+ G+QG FLSGSRP Sbjct: 900 SRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS +YDNYW Sbjct: 960 CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019 Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242 PVQKIPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S + Sbjct: 1020 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1079 Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422 LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KEN+TLLAI Sbjct: 1080 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAI 1139 Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602 GTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASGP Sbjct: 1140 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1199 Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782 K+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLA Sbjct: 1200 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1259 Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962 KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319 Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142 VTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK Sbjct: 1320 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379 Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322 LVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GTT Sbjct: 1380 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1439 Query: 4323 RTQILTNLNDLSLGTSFL 4376 R+QIL+NLNDL+LGTSFL Sbjct: 1440 RSQILSNLNDLALGTSFL 1457 >ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X2 [Citrus sinensis] Length = 1457 Score = 2321 bits (6014), Expect = 0.0 Identities = 1133/1458 (77%), Positives = 1282/1458 (87%), Gaps = 7/1458 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA+KMMHWPTGI NC SGFI+H +D+ PQIP I E+LDS+ P+++GIGP+PNLVV Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA N++E+YV+RVQEE + S++ E KR M G+S ASLELVCHY+LHGNVE++A+L Sbjct: 61 TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S G+D +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR Sbjct: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRCGGVLVYGLQMIILK IESS+VI+LR Sbjct: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS Sbjct: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+ SVLTS +TTIG Sbjct: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451 NS FLGSRLGDS+LVQFTCG GTS+ KEE GDIE D PS KRLR +SSDALQD+V Sbjct: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479 Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631 +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY Sbjct: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539 Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811 ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHNADSS+M + DD Sbjct: 540 ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 599 Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991 EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+ Sbjct: 600 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659 Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171 LDGSYMTQDLSFG NSE PYVLL M+DGSI+LLVGDPSTCTVS Sbjct: 660 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719 Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351 + P A ESSK +++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY Sbjct: 720 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779 Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522 VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE KDS+ NS +E GQ Sbjct: 780 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +KE +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS Sbjct: 840 GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 899 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 ++ +S++S SRLRNLRF R+PL+ YTREET +PCQR+T+FKN+ G+QG FLSGSRP Sbjct: 900 SRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS +YDNYW Sbjct: 960 CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019 Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242 PVQKIPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S + Sbjct: 1020 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1079 Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422 LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLAI Sbjct: 1080 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1139 Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602 GTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASGP Sbjct: 1140 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1199 Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782 K+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLA Sbjct: 1200 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1259 Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962 KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319 Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142 VTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK Sbjct: 1320 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379 Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322 LVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GTT Sbjct: 1380 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1439 Query: 4323 RTQILTNLNDLSLGTSFL 4376 R+QIL+NLNDL+LGTSFL Sbjct: 1440 RSQILSNLNDLALGTSFL 1457 >ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] gi|557523632|gb|ESR34999.1| hypothetical protein CICLE_v10004147mg [Citrus clementina] Length = 1458 Score = 2317 bits (6004), Expect = 0.0 Identities = 1133/1459 (77%), Positives = 1281/1459 (87%), Gaps = 8/1459 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA+KMMHWPTGI NC SGFI+H +D+ PQIP I E+LDS+ P+++GIGP+PNLVV Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA N++E+YV+RVQEE + S++ E KR M G+S ASLELVCHY+LHGNVE++A+L Sbjct: 61 TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S G+D +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR Sbjct: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRCGGVLVYGLQMIILK IESS+VI+LR Sbjct: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS Sbjct: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+ SVLTS +TTIG Sbjct: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDIV 1451 NS FLGSRLGDS+LVQFTCG GTS+ PKEE GDIE D PS KRLR +SSDALQD+V Sbjct: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMV 479 Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631 +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY Sbjct: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539 Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811 ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHN DSS+M + DD Sbjct: 540 ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDD 599 Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991 EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+ Sbjct: 600 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659 Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171 LDGSYMTQDLSFG NSE PYVLL M+DGSI+LLVGDPSTCTVS Sbjct: 660 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719 Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351 + P A ESSK ++ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY Sbjct: 720 VQTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779 Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522 VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE KDS+ NS +E GQ Sbjct: 780 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +KE +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS Sbjct: 840 GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVST 899 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 ++ +S++S SRLRNLRF R PL+ YTREET +PCQR+T+FKN+ G+QG FLSGSRP Sbjct: 900 SRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS +YDNYW Sbjct: 960 CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019 Query: 3063 PVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239 PVQK IPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S Sbjct: 1020 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1079 Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419 +LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KEN+TLLA Sbjct: 1080 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLA 1139 Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599 IGTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASG Sbjct: 1140 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1199 Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779 PK+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL Sbjct: 1200 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1259 Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959 AKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGA Sbjct: 1260 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1319 Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139 HVTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQK Sbjct: 1320 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1379 Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319 KLVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GT Sbjct: 1380 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1439 Query: 4320 TRTQILTNLNDLSLGTSFL 4376 TR+QIL+NLNDL+LGTSFL Sbjct: 1440 TRSQILSNLNDLALGTSFL 1458 >ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Citrus sinensis] Length = 1458 Score = 2316 bits (6002), Expect = 0.0 Identities = 1133/1459 (77%), Positives = 1282/1459 (87%), Gaps = 8/1459 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA+KMMHWPTGI NC SGFI+H +D+ PQIP I E+LDS+ P+++GIGP+PNLVV Sbjct: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TA N++E+YV+RVQEE + S++ E KR M G+S ASLELVCHY+LHGNVE++A+L Sbjct: 61 TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S G+D +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR Sbjct: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GPLVKVDPQGRCGGVLVYGLQMIILK IESS+VI+LR Sbjct: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW Sbjct: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS Sbjct: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+ SVLTS +TTIG Sbjct: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451 NS FLGSRLGDS+LVQFTCG GTS+ KEE GDIE D PS KRLR +SSDALQD+V Sbjct: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479 Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631 +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY Sbjct: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539 Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811 ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHNADSS+M + DD Sbjct: 540 ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 599 Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991 EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+ Sbjct: 600 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659 Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171 LDGSYMTQDLSFG NSE PYVLL M+DGSI+LLVGDPSTCTVS Sbjct: 660 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719 Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351 + P A ESSK +++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY Sbjct: 720 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779 Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522 VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE KDS+ NS +E GQ Sbjct: 780 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +KE +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS Sbjct: 840 GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 899 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 ++ +S++S SRLRNLRF R+PL+ YTREET +PCQR+T+FKN+ G+QG FLSGSRP Sbjct: 900 SRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS +YDNYW Sbjct: 960 CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019 Query: 3063 PVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239 PVQK IPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S Sbjct: 1020 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1079 Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419 +LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLA Sbjct: 1080 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1139 Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599 IGTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASG Sbjct: 1140 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1199 Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779 PK+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL Sbjct: 1200 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1259 Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959 AKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGA Sbjct: 1260 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1319 Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139 HVTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQK Sbjct: 1320 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1379 Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319 KLVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GT Sbjct: 1380 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1439 Query: 4320 TRTQILTNLNDLSLGTSFL 4376 TR+QIL+NLNDL+LGTSFL Sbjct: 1440 TRSQILSNLNDLALGTSFL 1458 >gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus notabilis] Length = 1479 Score = 2286 bits (5923), Expect = 0.0 Identities = 1132/1487 (76%), Positives = 1283/1487 (86%), Gaps = 36/1487 (2%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPA-RKGIGPIPNLV 200 MS+AA+KMMHWPTGIENCA+GF+SH +DF P+IP I ++DLDSDWPA R+ GP+PNLV Sbjct: 1 MSFAAYKMMHWPTGIENCAAGFVSHSRADFVPRIPPIQSDDLDSDWPAGRRETGPVPNLV 60 Query: 201 VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 VTAGN+LEVYV+R+QEE+D R SR+P+E++RGG M GLSGASLELVCHY+LHGNV+T+AV Sbjct: 61 VTAGNVLEVYVVRLQEEDDTRSSRAPAESRRGGLMDGLSGASLELVCHYRLHGNVQTIAV 120 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 LS G DG +RRDSIIL+FQDAKISVLEFDDSI+GLRTSSMHCFEGPEW +LKRGRESFA Sbjct: 121 LSSGGGDGSRRRDSIILSFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWLYLKRGRESFA 180 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 RGPLVKVDPQGRC GVL Y +QMI+LK IESSY+I+L Sbjct: 181 RGPLVKVDPQGRCAGVLAYNIQMIMLKAAQAGSGLVGEEDALGSGGAVSARIESSYIINL 240 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RDLDMKH+KDF FVHGYIEPVMVILHERELTWAGR+ WKHHTCMISALSISTTLKQ PLI Sbjct: 241 RDLDMKHIKDFVFVHGYIEPVMVILHERELTWAGRVLWKHHTCMISALSISTTLKQHPLI 300 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSAVNLPHDAYKL++VPSPIGGVLVI AN++HY SQS SC LA+NSYAV+VD+SQE+ R+ Sbjct: 301 WSAVNLPHDAYKLLAVPSPIGGVLVICANTLHYQSQSNSCTLALNSYAVSVDSSQEMRRA 360 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 F+VELDAANATWLS+DVV+LSTK GELLLLT+VYDGRVVQRL+LSKSKASVLTSG+TTI Sbjct: 361 PFSVELDAANATWLSNDVVLLSTKAGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITTI 420 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDI 1448 GNS FLGSRLGDS+LVQFT G+GTS+ K+EVGDIEGD AKRLR +SSD LQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTYGLGTSMLSSGLKDEVGDIEGDAHLAKRLRRSSSDVLQDM 480 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 SGEELSLY SAPN SE QK+FSF VRDSL+N+GPLKDFSYGLRINADPNATG+AKQSN Sbjct: 481 TSGEELSLYVSAPNNSESTQKSFSFTVRDSLVNVGPLKDFSYGLRINADPNATGVAKQSN 540 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R H DSSK+V+ D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRSH--DSSKLVAAD 598 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG TIAAGNLFGRRRV+QV+ RGAR Sbjct: 599 DEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGAR 658 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDGS+MTQDLSFG SE PYV+LRM+DGSI+LLVGDP++CTV Sbjct: 659 ILDGSFMTQDLSFGPAPSESSSGSENAVVTSVSIADPYVVLRMSDGSIRLLVGDPTSCTV 718 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S+S P FESSK I+ACTLY DKGPEPWLRKTSTDAWLSTG+ EAIDGAD L DQGDI Sbjct: 719 SVSTPADFESSKSIISACTLYRDKGPEPWLRKTSTDAWLSTGVDEAIDGADETLQDQGDI 778 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAG 2519 YCVVCYESG+L+I+DVPSF VFSV+ FISG HLVD F++E KD Q N +E AG Sbjct: 779 YCVVCYESGSLDIYDVPSFNYVFSVDNFISGRPHLVDAFVQEQPKDLQKATNKNSEESAG 838 Query: 2520 QVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVS 2699 Q +KE QNMKIVELAMQRW+G+H+RP+L G+LTDG++LCYHAYL+EG EST++ E+SVS Sbjct: 839 QGRKENVQNMKIVELAMQRWSGKHSRPFLLGILTDGSILCYHAYLFEGPESTSRTEDSVS 898 Query: 2700 GQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSR 2879 +N+ S SRLRNLRFVRVPL++Y REETS PCQR++VFKN+ GYQGLFLSGSR Sbjct: 899 SRNS------SGSRLRNLRFVRVPLDSYAREETSDGMPCQRISVFKNIAGYQGLFLSGSR 952 Query: 2880 PAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNY 3059 PAWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTSEG LKICQLPS+ SYDNY Sbjct: 953 PAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSEGILKICQLPSITSYDNY 1012 Query: 3060 WPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236 WPVQK IPLKGTPHQVTYFAE+NLYPLIVSVPV +PLNQV+SSL+DQ+ GHQ E+ N SP Sbjct: 1013 WPVQKVIPLKGTPHQVTYFAERNLYPLIVSVPVPKPLNQVMSSLLDQEVGHQFENPNLSP 1072 Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416 D+L++TYT+DE+EVRILEPE+SG PWQT+ TIPMQ+SENALT+R+VTLFNTT+ ENETLL Sbjct: 1073 DDLNRTYTIDEFEVRILEPERSGGPWQTKVTIPMQSSENALTIRVVTLFNTTTNENETLL 1132 Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGR---------------------------NTDNSQNL 3515 AIGTAYVQGEDVAARGR++L ++ ++ + Sbjct: 1133 AIGTAYVQGEDVAARGRIILRALAPWWERLHLHPGSRVQIPEMASPSGVFKIDSADFHLQ 1192 Query: 3516 VSEVYSKELKGAISALASLQGHLLIASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIV 3695 VSE+YSKELKGAISALASLQGHLLIASGPK+ILHKWTG EL G+AF+D PPLYVVSLNIV Sbjct: 1193 VSEIYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFFDAPPLYVVSLNIV 1252 Query: 3696 KNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNV 3875 KNFIL+GD+HKSIYFLSWKEQGAQLSLLAKDFG+LDCF+TEFLIDGSTLSL+VSD+QKN+ Sbjct: 1253 KNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNI 1312 Query: 3876 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFAL 4055 QIFYYAPKMSESWKGQ+LLSRAEFHVGAHVTKFLRLQMLPTS+DRTG+TPGSDKTNRFAL Sbjct: 1313 QIFYYAPKMSESWKGQRLLSRAEFHVGAHVTKFLRLQMLPTSTDRTGSTPGSDKTNRFAL 1372 Query: 4056 LFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPD 4235 LFG LDGS+GCIAPLDELTFRRLQ+LQKKLVD+VPH+AGLNPR+FR F SNGKAHRPGPD Sbjct: 1373 LFGALDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFCSNGKAHRPGPD 1432 Query: 4236 NMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQILTNLNDLSLGTSFL 4376 +++DCELL HYEMLPLE+QLE+AH IGTTR+QIL+NLNDL LGTSFL Sbjct: 1433 SIVDCELLCHYEMLPLEEQLEIAHLIGTTRSQILSNLNDLFLGTSFL 1479 >ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Fragaria vesca subsp. vesca] Length = 1439 Score = 2283 bits (5917), Expect = 0.0 Identities = 1125/1456 (77%), Positives = 1271/1456 (87%), Gaps = 5/1456 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPA-RKGIGPIPNLV 200 MSYAA KMMHWPTGIENCA+GFI+H +DF P+IP I +DLDSDWPA R+ IGP+PNLV Sbjct: 1 MSYAAHKMMHWPTGIENCAAGFITHSRADFVPRIPQIQTDDLDSDWPAPRREIGPVPNLV 60 Query: 201 VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 VTA N+LEVYV+RVQE++ R SR+ E+KRGG M G++GASLELVCHY+LHGNV TMAV Sbjct: 61 VTAANVLEVYVVRVQEQDTARGSRASGESKRGGLMDGVAGASLELVCHYRLHGNVMTMAV 120 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 LS G DG KRRD+IIL F+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HL+RGRESFA Sbjct: 121 LSSGGGDGSKRRDAIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 RGP VKVDPQGRCGGVLVY LQ+IILK +ESSY+ISL Sbjct: 181 RGPSVKVDPQGRCGGVLVYDLQLIILKAAQGGYGLVGDDDGFASGAAISARVESSYIISL 240 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RD+DMKHVKDF+FVHGYIEPV+VILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI Sbjct: 241 RDMDMKHVKDFTFVHGYIEPVLVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSA+NLPHDAYKL++VPSPIGGVLVI ANSIHYHSQS SCALA+NSYA +VD+SQE+PRS Sbjct: 301 WSAINLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAGSVDSSQEMPRS 360 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 SF VELDAANA+WLS+DV++LSTKTGELLLLT+VYDGRVV RL+LSKSKASVLTSG+ T+ Sbjct: 361 SFTVELDAANASWLSNDVILLSTKTGELLLLTLVYDGRVVHRLDLSKSKASVLTSGIATV 420 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448 GNS FLGSRLGDS+LVQFT G+G S+ K+EVGDIEGD PSAKRLRM+SSDALQD+ Sbjct: 421 GNSLFFLGSRLGDSLLVQFTSGVGASMLSADLKDEVGDIEGDAPSAKRLRMSSSDALQDM 480 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 +SGEELSLY SA N +E AQ++FSFAVRDSL+N+GPLKDFSYGLRINAD NATGIAKQSN Sbjct: 481 ISGEELSLYGSAQNNAESAQRSFSFAVRDSLVNVGPLKDFSYGLRINADANATGIAKQSN 540 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNGALCVL+QSIRPE+ITEV LPGCKGIWTVYHK+ARGHNA+S D Sbjct: 541 YELVCCSGHGKNGALCVLRQSIRPEMITEVALPGCKGIWTVYHKNARGHNAES-----YD 595 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHAFLIISLE+RTMVLETAD L EVT+ VDY++QG TIAAGNLFGRRRV+Q++ RGAR Sbjct: 596 DEYHAFLIISLEARTMVLETADHLSEVTDKVDYFLQGRTIAAGNLFGRRRVVQIYERGAR 655 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +L+G YMTQDLSFGA NSE PYVLLRM+DG I+LLVGDPS+CTV Sbjct: 656 ILEGYYMTQDLSFGASNSESGSGSESATVLSVSIVDPYVLLRMSDGGIRLLVGDPSSCTV 715 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S+S P AFESSK ++ACTLYHD+GPEPWLRK+STDAWLSTG+ EAIDG LHDQGD+ Sbjct: 716 SVSNPAAFESSKKLVSACTLYHDEGPEPWLRKSSTDAWLSTGIDEAIDGV---LHDQGDV 772 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQVK 2528 YCV+CYESG+LEIFDVP+F CVFSVEKF+SG LVDTF+ + K SQ+S E V+G + Sbjct: 773 YCVICYESGSLEIFDVPNFNCVFSVEKFVSGKPLLVDTFMGDPQK-SQSS--EEVSGLSR 829 Query: 2529 KETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQN 2708 KE QNM++VEL MQRW+G+H+RP+LFG+L DG + CYHAYLYE +ST+K E S S QN Sbjct: 830 KEKLQNMRVVELTMQRWSGQHSRPFLFGILNDGMIFCYHAYLYESMDSTSKTEVSASSQN 889 Query: 2709 TVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAW 2888 T + SRLRNLRFVRVPL+TY+R + S + CQRMTVFKN+ G QGLFL+GSRPAW Sbjct: 890 T------TASRLRNLRFVRVPLDTYSRNDLSNGTSCQRMTVFKNIAGNQGLFLAGSRPAW 943 Query: 2889 FMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPV 3068 MV RER+RVHPQLCDGSIVAF VLHNVNCNHGLIYVTSEG +KICQLPS+ SYDNYWPV Sbjct: 944 LMVFRERIRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSEGIMKICQLPSITSYDNYWPV 1003 Query: 3069 QKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELH 3248 QKIPLKGTPHQVTYFAEKNLYPLIVS+PV +PLNQVLSSLVDQ+ HQVE+ N SP+ELH Sbjct: 1004 QKIPLKGTPHQVTYFAEKNLYPLIVSIPVQKPLNQVLSSLVDQEFSHQVENHNLSPEELH 1063 Query: 3249 KTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGT 3428 +TYTVDE+EVRI+EPEKSG PWQTRATIPMQ SENALTVR+VTLFNTT+KENETLLAIGT Sbjct: 1064 RTYTVDEFEVRIMEPEKSGGPWQTRATIPMQTSENALTVRVVTLFNTTTKENETLLAIGT 1123 Query: 3429 AYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKV 3608 AYVQGEDVA RGRVLLFS N DN QNLVSEV+SKELKGAISALASLQG+LLIASGPK+ Sbjct: 1124 AYVQGEDVAGRGRVLLFSAENNVDNPQNLVSEVFSKELKGAISALASLQGNLLIASGPKI 1183 Query: 3609 ILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKD 3788 ILHKWTG++LTG+AF+D PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLAKD Sbjct: 1184 ILHKWTGSDLTGIAFFDVPPLYVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLNLLAKD 1243 Query: 3789 FGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 3968 FGNLDCF+TEFLIDGSTLSL V+D QKN+QI YYAPK+SESW+GQKLL+RAEFHVGAHVT Sbjct: 1244 FGNLDCFATEFLIDGSTLSLAVADAQKNIQILYYAPKISESWRGQKLLTRAEFHVGAHVT 1303 Query: 3969 KFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLV 4148 KFLRLQML TSSDRTG PGSDKT R+ALLFGTLDG +G IAPL+ELTFRRLQ+LQ KLV Sbjct: 1304 KFLRLQMLSTSSDRTGKNPGSDKTVRYALLFGTLDGGIGSIAPLEELTFRRLQSLQNKLV 1363 Query: 4149 DSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRT 4328 D+VPH+AGLNPR+FR FRSNGKAHRPGPD+++DCELL+HYEML LE+QLE+A QIGTTR Sbjct: 1364 DAVPHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLFHYEMLSLEEQLEIAQQIGTTRL 1423 Query: 4329 QILTNLNDLSLGTSFL 4376 QIL+NL+DLSLGTSFL Sbjct: 1424 QILSNLDDLSLGTSFL 1439 >ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] gi|550326263|gb|EEE96682.2| cleavage and polyadenylation specificity factor family protein [Populus trichocarpa] Length = 1455 Score = 2266 bits (5871), Expect = 0.0 Identities = 1123/1464 (76%), Positives = 1271/1464 (86%), Gaps = 13/1464 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARK----GIGPIP 191 MSYAA+KMMHWPT I+ C SGF++H S+ A +P + +DLDSDWP+R+ GIGP P Sbjct: 1 MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59 Query: 192 NLVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVET 371 NL+V +GN+LE+YV+RVQEE +RS E KRGG M G++GASLELVCHY+LHGNVE+ Sbjct: 60 NLIVASGNVLELYVVRVQEEG----ARSSGELKRGGVMDGVAGASLELVCHYRLHGNVES 115 Query: 372 MAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRE 551 M VLS+EG D +RRDSIILAF+DAKISVLEFDDSI+GLRTSSMHCFEGP+W HLKRGRE Sbjct: 116 MGVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRE 175 Query: 552 SFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYV 731 SFARGPLVKVDPQGRCGGVLVY LQMIILK HI SSY+ Sbjct: 176 SFARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYI 235 Query: 732 ISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQ 911 I+LRDLDMKHVKDF FVH YIEPV+V+LHERELTWAGR+ WKHHTCMISALSISTTLKQ Sbjct: 236 INLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQP 295 Query: 912 PLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQEL 1091 LIWS NLPHDAYKL++VPSPIGGVLVIG N+IHYHS+S SCALA+NSYA +VD+SQEL Sbjct: 296 TLIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQEL 355 Query: 1092 PRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGV 1271 PR++F+VELDAANATWL DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVLTS + Sbjct: 356 PRATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDI 415 Query: 1272 TTIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDAL 1439 TT+GNSF FLGSRLGDS+LVQFT G+G+S+ KEEVGDIEGD+PSAKRL+++SSDAL Sbjct: 416 TTLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDAL 475 Query: 1440 QDIVSGEELSLYSSAPNTSELAQ-----KTFSFAVRDSLINIGPLKDFSYGLRINADPNA 1604 QD+VSGEELSLYSSAPN +E +Q KTFSF VRDSLIN+GPLKDF+YGLRINAD NA Sbjct: 476 QDMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANA 535 Query: 1605 TGIAKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNAD 1784 TGI+KQSNYELVCCSGHGKNGALCVLQQSIRPE+ITEVELPGCKGIWTVYHK+AR H+ D Sbjct: 536 TGISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARSHSVD 595 Query: 1785 SSKMVSLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVI 1964 S KM S DDEYHA+LIIS+E+RTMVLETAD L EVTESVDY+VQG TIAAGNLFGRRRV+ Sbjct: 596 SLKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVV 654 Query: 1965 QVFARGARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLV 2144 QVF RGAR+LDGS+MTQDLSFG NSE PYVL+RM DGSIQ+LV Sbjct: 655 QVFERGARILDGSFMTQDLSFGGSNSETGRSESSTVMHVSIVD-PYVLVRMADGSIQILV 713 Query: 2145 GDPSTCTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADG 2324 GDPS CTVS++ P AF+SS S++ACTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD Sbjct: 714 GDPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADS 773 Query: 2325 ALHDQGDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKN 2504 H+QGDIYCVVCYE+G LEIFDVP+F VF V+KF+SG THL+DT E AKD Sbjct: 774 GAHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDMMKGVK 833 Query: 2505 EVVAGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKI 2684 E VAG +KE+ QNMK+VEL M RW+G H+RP+LFG+LTDGT+LCYHAYL+EG + T+K+ Sbjct: 834 EEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTSKL 893 Query: 2685 EESVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLF 2864 E+SVS QN+V S+IS SRLRNLRFVRVPL+TYTREETS+ + CQR+T FKN+ GYQG F Sbjct: 894 EDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQGFF 953 Query: 2865 LSGSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVL 3044 LSGSRPAWFMV RERLRVHPQLCDGSIVAF VLH VNCNHGLIYVTS+G LKIC L SV Sbjct: 954 LSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSSVS 1013 Query: 3045 SYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHD 3224 SYDNYWPVQKIPLKGTPHQVTYFAE+NLYPLIVSVPV +P+NQVLSSLVDQ+ GHQ+E+ Sbjct: 1014 SYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIENH 1073 Query: 3225 NFSPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKEN 3404 N S +E+H+TY+VDE+EVRILEP S PWQ +ATIPMQ SENALTVRMV+LFNT++KEN Sbjct: 1074 NLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTKEN 1131 Query: 3405 ETLLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHL 3584 ETLLA+GTAYVQGEDVAARGR+LLFSV +N +NSQ LVSEVYSKELKGAISALASLQGHL Sbjct: 1132 ETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQGHL 1191 Query: 3585 LIASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGA 3764 LIASGPK+ILHKWTG ELTGVAF D PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGA Sbjct: 1192 LIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1251 Query: 3765 QLSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAE 3944 QLSLLAKDF +LDCFSTEFLIDGSTLSL+VSDEQKNVQIFYYAPKMSESWKGQKLLSRAE Sbjct: 1252 QLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSRAE 1311 Query: 3945 FHVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRL 4124 FHVGA VTKF+RLQML S DR+GA P SDKTNRFALLFGTLDGS+GCIAPLDELTFRRL Sbjct: 1312 FHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1371 Query: 4125 QTLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMA 4304 Q+LQKKLVD+VPH+AGLNP++FR FRS+GKAHRPGP++++DCE+L +YEM+PLE+Q+E+A Sbjct: 1372 QSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIA 1431 Query: 4305 HQIGTTRTQILTNLNDLSLGTSFL 4376 QIGTTR QIL+NLNDL+LGTSFL Sbjct: 1432 QQIGTTRAQILSNLNDLTLGTSFL 1455 >ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative [Ricinus communis] Length = 1461 Score = 2262 bits (5861), Expect = 0.0 Identities = 1109/1462 (75%), Positives = 1268/1462 (86%), Gaps = 11/1462 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWP-ARKGIGPIPNLV 200 MSYAA+KM+HWPTGIE+CASG+I+H +DF PQIP I ++LDS+WP +++GIGP+PNL+ Sbjct: 1 MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60 Query: 201 VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 VTAG++LEVYV+RVQE+ R SRS E KRGG M G+SGASLELVCHY+LHGNVE+M V Sbjct: 61 VTAGSVLEVYVVRVQED-GSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVV 119 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 L EG D +RRDSIILAF+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRGRESFA Sbjct: 120 LPTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFA 179 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 RGPL+KVDPQGRCGG+LVY +QMIIL+ ++SSYVI+L Sbjct: 180 RGPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINL 239 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RD+DMKHVKDF F+H YIEPV+VILHERELTWAGR+SWKHHTCMISALSISTTLKQ LI Sbjct: 240 RDMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLI 299 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WS VNLPHDAYKL++VP PIGGVLVI AN+IHYHS+S + ALA+N+YAV++D+SQELPR+ Sbjct: 300 WSVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRA 359 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 SF+VELDA A WL +DV +LS K GELLLL++VYDGRVVQRL+LSKSKASVLTS +TTI Sbjct: 360 SFSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTI 419 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448 GNS FLGSRLGDS+LVQFT G+G SV KEEVG+IEGDVPSAKRL+ ++SD LQD+ Sbjct: 420 GNSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDM 479 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 VSGEELSLY S N +E AQK+FSFAVRDSLIN+GPLKDFSYGLR N D +ATGIAKQSN Sbjct: 480 VSGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSN 539 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 Y+LVCCSGHGKNG LC+L+QSIRPE+ITEV+LPGC+GIWTVYHK+ARGHN D SKM + Sbjct: 540 YDLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAA 599 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHA+LIIS+E+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR Sbjct: 600 DEYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 659 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDGS+MTQDLS G+ NSE PYVL++MTDGSI+LL+GD STC V Sbjct: 660 ILDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMV 719 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGA---DGALHDQ 2339 SI+ P AFE+S+ S++ACTLYHDKGPEPWLRK STDAWLSTG+ EAIDGA DG HDQ Sbjct: 720 SINTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQ 779 Query: 2340 GDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEV--- 2510 GDIYC+VCYESG LEIFDVP+F VFSV+KF+SG THL D ++RE KDSQ N + Sbjct: 780 GDIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEE 839 Query: 2511 VAGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEE 2690 VAG +KE A NMK VELAMQRW+G H+RP+LFG+LTDGT+LCYHAYL+E ++T+K E+ Sbjct: 840 VAGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTED 899 Query: 2691 SVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLS 2870 SVS QN V L SIS SRLRNLRFVRVPL++Y +EETS + CQR+T+F N+ G+QG FL Sbjct: 900 SVSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLL 959 Query: 2871 GSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSY 3050 GSRPAWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS +Y Sbjct: 960 GSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNY 1019 Query: 3051 DNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNF 3230 DNYWPVQKIPLKGTPHQVTYF EKNLYPLIVSVPV +P+NQVLSSLVDQ+ GHQ+E+ N Sbjct: 1020 DNYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNL 1079 Query: 3231 SPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENET 3410 S DEL +TY+V+E+EVRILE E G PWQT+ATIPMQ+SENALTVR+VTLFN T+KENET Sbjct: 1080 SSDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENET 1139 Query: 3411 LLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLI 3590 LLAIGTAYVQGEDVAARGRVLLFSV ++T+NSQ LVSEVYSKELKGAISALASLQGHLLI Sbjct: 1140 LLAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLI 1199 Query: 3591 ASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQL 3770 ASGPK+ILHKWTG EL GVAFYD PPLYV S+NIVKNFIL+GDIHKSIYFLSWKEQGAQL Sbjct: 1200 ASGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQL 1259 Query: 3771 SLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFH 3950 SLLAKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKM ESWKGQKLLSRAEFH Sbjct: 1260 SLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFH 1319 Query: 3951 VGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQT 4130 VGAH+TKF+RL ML TSSDR+GA PG DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+ Sbjct: 1320 VGAHITKFIRLSMLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1379 Query: 4131 LQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQ 4310 LQ+KLVD+VPH+AGLNPR+FR FRS+GK HRPGP++++DCELL H+EMLPLE+QLE+A Q Sbjct: 1380 LQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQ 1439 Query: 4311 IGTTRTQILTNLNDLSLGTSFL 4376 +GTTR QIL+NLNDLSLGTSFL Sbjct: 1440 VGTTRAQILSNLNDLSLGTSFL 1461 >ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Glycine max] Length = 1447 Score = 2260 bits (5856), Expect = 0.0 Identities = 1108/1457 (76%), Positives = 1267/1457 (86%), Gaps = 6/1457 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKG--IGPIPNL 197 MS+AA+KMM PTGI+NCA+GF++H SDF P P +DLD++WP+R +G +PNL Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVPLQP----DDLDAEWPSRPRHHVGSLPNL 56 Query: 198 VVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMA 377 VVTA N+LEVY +R+QE++ + + ++++RG + G++GASLELVCHY+LHGNVETMA Sbjct: 57 VVTAANVLEVYAVRLQEDQPPKAA---ADSRRGALLDGIAGASLELVCHYRLHGNVETMA 113 Query: 378 VLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESF 557 VLS+ G D +RRDSI+L F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE F Sbjct: 114 VLSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 173 Query: 558 ARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVIS 737 ARGP+VKVDPQGRCGGVL+Y LQMIILK IESSY+I+ Sbjct: 174 ARGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMIN 233 Query: 738 LRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPL 917 LRDLDM+HVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PL Sbjct: 234 LRDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 293 Query: 918 IWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPR 1097 IWSAVNLPHDAYKL++VPSPIGGVLVI AN+IHYHSQS SCALA+NSYAV +D+SQE+PR Sbjct: 294 IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPR 353 Query: 1098 SSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTT 1277 SSFNVELDAANATWL DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVL+SG+TT Sbjct: 354 SSFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITT 413 Query: 1278 IGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQD 1445 IGNS FL SRLGDSMLVQF+CG G S+ KEEVGDIE D PS KRLR + SDALQD Sbjct: 414 IGNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQD 472 Query: 1446 IVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQS 1625 +VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQS Sbjct: 473 MVSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 532 Query: 1626 NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSL 1805 NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HNADSSKM Sbjct: 533 NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 592 Query: 1806 DDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGA 1985 DDEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGR RVIQV+ RGA Sbjct: 593 DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGA 652 Query: 1986 RVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCT 2165 R+LDGS+MTQD+SFGA N E P+VLLRM+DGSI+LL+GDPSTCT Sbjct: 653 RILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 712 Query: 2166 VSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGD 2345 +S++ P +FESSK S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GE IDG DGA D GD Sbjct: 713 ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGD 772 Query: 2346 IYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQV 2525 IYCVVC+++G LEIFDVP+F CVFSVE F+SG +HLVD ++E KDS+ + V Q Sbjct: 773 IYCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSKQGDRDGVINQG 832 Query: 2526 KKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQ 2705 +KE +MK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE +ST+K+E+S S Sbjct: 833 RKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSASAG 892 Query: 2706 NTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPA 2885 ++ LSS + SRLRNLRFVRVPL+ Y RE+TS PCQ++T+FKN+G Y+G FLSGSRPA Sbjct: 893 GSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSRPA 952 Query: 2886 WFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWP 3065 W MV RERLRVHPQLCDGSIVAF VLHNVNCN GLIYVTS+G LKICQLPS +YD+YWP Sbjct: 953 WVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSYWP 1012 Query: 3066 VQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDEL 3245 VQKIPLK TPHQVTYFAEKNLYPLIVS PV++PLNQV+ SLVDQD HQ E N +PDE Sbjct: 1013 VQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDINHQNESQNMNPDEQ 1071 Query: 3246 HKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIG 3425 ++ Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAIG Sbjct: 1072 NRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLAIG 1131 Query: 3426 TAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPK 3605 TAYVQGEDVAARGR+LLFS+G+NTDN Q LVSEVYSKELKGAISALASLQGHLLIASGPK Sbjct: 1132 TAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASGPK 1191 Query: 3606 VILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAK 3785 +ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAK Sbjct: 1192 IILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLAK 1251 Query: 3786 DFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 3965 DFG+LDCF+TEFLIDGSTLSL+VSD+ +N+QIFYYAPKMSESWKGQKLLSRAEFHVGAHV Sbjct: 1252 DFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1311 Query: 3966 TKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKL 4145 TKFLRLQML T SDR GA PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQ+KL Sbjct: 1312 TKFLRLQMLST-SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKL 1370 Query: 4146 VDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTR 4325 VD+VPH+AGLNPRAFR+FRSNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ+GTTR Sbjct: 1371 VDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTR 1430 Query: 4326 TQILTNLNDLSLGTSFL 4376 +QIL+NL+DLSLGTSFL Sbjct: 1431 SQILSNLSDLSLGTSFL 1447 >ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like isoform X1 [Glycine max] Length = 1449 Score = 2249 bits (5829), Expect = 0.0 Identities = 1105/1458 (75%), Positives = 1267/1458 (86%), Gaps = 7/1458 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDS-DWPARKG--IGPIPN 194 MS+AA+KMM PTGI+NCA+GF++H SDF P P +DLD+ +WP+R +GP+PN Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVPLQP----DDLDAAEWPSRPRHHVGPLPN 56 Query: 195 LVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETM 374 LVVTA N+LEVY +R+QE++ + + +++RG + G++GASLEL CHY+LHGNVETM Sbjct: 57 LVVTAANVLEVYAVRLQEDQQPK--DASDDSRRGTLLDGIAGASLELECHYRLHGNVETM 114 Query: 375 AVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRES 554 AVLS+ G D ++RDSIIL F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE Sbjct: 115 AVLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQ 174 Query: 555 FARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVI 734 FARGP+VK+DPQGRCGGVL+Y LQMIILK IESSY+I Sbjct: 175 FARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMI 234 Query: 735 SLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQP 914 +LRDLDM+HVKDF+FV+GYIEPVMVILHERELTWAGR+SW HHTCMISALSISTTLKQ P Sbjct: 235 NLRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHP 294 Query: 915 LIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELP 1094 LIWSAVNLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SCALA+N+YAV +D+SQE+P Sbjct: 295 LIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIP 354 Query: 1095 RSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVT 1274 RSSFNVELDAANATWL DV +LSTKTGELLLL +VYDGRVVQRL+LSKSKASVL+SG+T Sbjct: 355 RSSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGIT 414 Query: 1275 TIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQ 1442 TIGNS FL SRLGDSMLVQF+CG G S+ KEEVGDIE D PS KRLR + SDALQ Sbjct: 415 TIGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQ 473 Query: 1443 DIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQ 1622 D+VSGEELSLY SA N +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQ Sbjct: 474 DMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQ 533 Query: 1623 SNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVS 1802 SNYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HNADSSKM Sbjct: 534 SNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMAD 593 Query: 1803 LDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARG 1982 DDEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ RG Sbjct: 594 DDDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERG 653 Query: 1983 ARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTC 2162 AR+LDGS+MTQD+SFGA NSE P+VLLRM+DGSI+LL+GDPSTC Sbjct: 654 ARILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTC 713 Query: 2163 TVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQG 2342 T+S++ P +FESSK S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA D G Sbjct: 714 TISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHG 773 Query: 2343 DIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQ 2522 DIYCVVC+++G LEIFD+P+F CVFSVE F+SG +HLVD ++E KDS+ + V Q Sbjct: 774 DIYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSKQGDRDGVVNQ 833 Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702 +K+ NMK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE + T+K+E+S S Sbjct: 834 GRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASA 893 Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882 ++ LSS + SRLRNLRFVRVPL+ Y RE+TS SPCQ++T+FKN+G YQG FLSGSRP Sbjct: 894 GGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGSRP 953 Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062 AW MV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS +YD+YW Sbjct: 954 AWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYW 1013 Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242 PVQKIPLK TPHQVTYFAEKNLYPLIVS PV++PLNQV+ SLVDQD HQ E N +PDE Sbjct: 1014 PVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDFNHQNESQNMNPDE 1072 Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422 ++ Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAI Sbjct: 1073 QNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAI 1132 Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602 GTAYVQGEDVAARGR+LLFS+G+ TDN Q LVSEVYSKELKGAISALASLQGHLLIASGP Sbjct: 1133 GTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIASGP 1192 Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782 K+ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLA Sbjct: 1193 KIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLA 1252 Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962 KDFG+LDCF+TEFLIDGSTLSL+VSD+ +N+QIFYYAPKMSESWKGQKLLSRAEFHVGAH Sbjct: 1253 KDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1312 Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142 VTKFLRLQML T SDR G+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQ+K Sbjct: 1313 VTKFLRLQMLST-SDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRK 1371 Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322 LVD+VPH+AGLNPRAFR+FRSNGKAHRPGPD+++DCELL HYEMLPLE+QLE+A+QIGTT Sbjct: 1372 LVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTT 1431 Query: 4323 RTQILTNLNDLSLGTSFL 4376 R+QIL+NL+DLSLGTSFL Sbjct: 1432 RSQILSNLSDLSLGTSFL 1449 >ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris] gi|561025706|gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris] Length = 1445 Score = 2236 bits (5793), Expect = 0.0 Identities = 1095/1456 (75%), Positives = 1260/1456 (86%), Gaps = 5/1456 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPAR-KGIGPIPNLV 200 MS+AA+KMM TGI+NCA+GF++H +D P P EDLD++WP+R + +GP+PNLV Sbjct: 1 MSFAAYKMMQCSTGIDNCAAGFLTHSRADSVPLQP----EDLDAEWPSRPRRVGPLPNLV 56 Query: 201 VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 VTA N+LEVY +R+QE++ + + + +RG + G+ GASLELVCHY+LHGNVETMAV Sbjct: 57 VTAANVLEVYTVRIQEDQPPKAA----DPRRGTLLDGIDGASLELVCHYRLHGNVETMAV 112 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 LS+ G D ++RDSIIL F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE FA Sbjct: 113 LSIGGGDASRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 172 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 RGP+VKVDPQGRCGG L+Y LQMIILK IESSY+I+L Sbjct: 173 RGPVVKVDPQGRCGGTLIYDLQMIILKATQAGSGLVGDDDALGFSGAVAARIESSYMINL 232 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RDLDM+HVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI Sbjct: 233 RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 292 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSAVNLPHDAYKL++VPSPIGGVLVIGAN++HYHSQS SCALA+NSYAV++DNSQE+PRS Sbjct: 293 WSAVNLPHDAYKLLAVPSPIGGVLVIGANTVHYHSQSASCALALNSYAVSLDNSQEIPRS 352 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 SFNVELD+ANATWL DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVL+SG+TTI Sbjct: 353 SFNVELDSANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 412 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448 GNS FL SRLGDSMLVQF+CG G S+ KEEVGDIE D PS KRLR + SD LQD+ Sbjct: 413 GNSLFFLASRLGDSMLVQFSCGSGGSMLSSNLKEEVGDIEADAPS-KRLRRSPSDTLQDV 471 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQSN Sbjct: 472 VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 531 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HN DSSK+ D Sbjct: 532 YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNTDSSKLADDD 591 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ RGAR Sbjct: 592 DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGAR 651 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDGS+MTQD++FGA NSE P+VLLRM+DGS++LL+GDP TCT+ Sbjct: 652 ILDGSFMTQDVTFGASNSESASASESAIALSVSIADPFVLLRMSDGSVRLLIGDPITCTI 711 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S++ P +FES+K S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA D GDI Sbjct: 712 SVTSPASFESTKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGDI 771 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQVK 2528 YCVVC+++G LEIFDVP+F CVFSV F+SG +HLVD ++E KDS+ + V Q + Sbjct: 772 YCVVCFDNGNLEIFDVPNFNCVFSVGNFMSGKSHLVDALMKEVLKDSKKGDRDGVIIQGR 831 Query: 2529 KETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQN 2708 KE +MK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE + T+K+E+S S Sbjct: 832 KENVPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASAGG 891 Query: 2709 TVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAW 2888 ++ L + + SRLRNLRFVRV L+ Y REETS S Q++T+FKN+G YQG FLSGSRPAW Sbjct: 892 SIGLGTTNISRLRNLRFVRVSLDAYAREETSNGSLHQQITIFKNIGSYQGFFLSGSRPAW 951 Query: 2889 FMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPV 3068 MV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS +YD+YWPV Sbjct: 952 VMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYWPV 1011 Query: 3069 QKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELH 3248 QKIPLK TPHQVTYFAEKNLYPLIVS PV++PL+QV+ SLVDQD HQ E N + DE + Sbjct: 1012 QKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLSQVI-SLVDQDVNHQNESQNMNSDEQN 1070 Query: 3249 KTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGT 3428 + Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAIGT Sbjct: 1071 RFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAIGT 1130 Query: 3429 AYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKV 3608 AYVQGEDVAARGR+LLFS+G+NTDN Q+LVSEVYSKELKGAISALASLQGHLLIASGPK+ Sbjct: 1131 AYVQGEDVAARGRILLFSLGKNTDNPQSLVSEVYSKELKGAISALASLQGHLLIASGPKI 1190 Query: 3609 ILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKD 3788 ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAKD Sbjct: 1191 ILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLAKD 1250 Query: 3789 FGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 3968 F +LDCF+TEFLIDGSTLSL+VSD+++N+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVT Sbjct: 1251 FSSLDCFATEFLIDGSTLSLMVSDDKRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1310 Query: 3969 KFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLV 4148 KFLRLQMLPT SDR G+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQKKLV Sbjct: 1311 KFLRLQMLPT-SDRAGSAPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQKKLV 1369 Query: 4149 DSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRT 4328 D+V H+AGLNPRAFR F+SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ+GTTR+ Sbjct: 1370 DAVAHVAGLNPRAFRKFQSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRS 1429 Query: 4329 QILTNLNDLSLGTSFL 4376 QIL+NL+DLSLGTSFL Sbjct: 1430 QILSNLSDLSLGTSFL 1445 >ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Cicer arietinum] Length = 1447 Score = 2227 bits (5771), Expect = 0.0 Identities = 1102/1461 (75%), Positives = 1256/1461 (85%), Gaps = 10/1461 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITN---EDLDSDW-PARKGIGPIP 191 MS+AA+KMM WPTGI+NCASGF++H SD P+IP I + +D+DSDW P + + P+P Sbjct: 1 MSFAAYKMMQWPTGIQNCASGFLTHSRSDSTPRIPPIQHNDDDDIDSDWVPQPRDLAPLP 60 Query: 192 NLVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVET 371 NLV+TA NILEVY +R+Q++ P + + GL+GASLELVCHY+LHGNVE+ Sbjct: 61 NLVITAANILEVYTVRIQQDP-------PKSSADPRVLDGLAGASLELVCHYRLHGNVES 113 Query: 372 MAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRE 551 +AVLS+ G D +RRDSIIL F+DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE Sbjct: 114 VAVLSVGGGDASRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGRE 173 Query: 552 SFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYV 731 FARGP+ KVDPQGRCGGVLVY LQMIILK IESSY+ Sbjct: 174 HFARGPVAKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVGEDDVLGSGGAVAARIESSYM 233 Query: 732 ISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQ 911 I+LRDLDM+HVKDF+F+HGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ Sbjct: 234 INLRDLDMRHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 293 Query: 912 PLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQEL 1091 PLIWSAVNLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SCALA+NSYAV+VDNSQE+ Sbjct: 294 PLIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSVDNSQEM 353 Query: 1092 PRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGV 1271 PRSSFNVELDAANATWL +DV +LSTKTGELLLLT++YDGRVVQRL+LSKSKASVL+SGV Sbjct: 354 PRSSFNVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLSSGV 413 Query: 1272 TTIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDAL 1439 TTIGNS FL SRLGDSMLVQF+ G G S+ KEEVGD + D SAKR+R + SD L Sbjct: 414 TTIGNSLFFLASRLGDSMLVQFSSGSGVSMLSSNLKEEVGDFDVDASSAKRMRRSPSDTL 473 Query: 1440 QDIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAK 1619 QD+VSGEELSLY SA N +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAK Sbjct: 474 QDMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAK 533 Query: 1620 QSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMV 1799 QSNYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R NADSSK+ Sbjct: 534 QSNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSLNADSSKLA 593 Query: 1800 SLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFAR 1979 +DEYHA+LIISLESRTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ R Sbjct: 594 DDEDEYHAYLIISLESRTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYER 653 Query: 1980 GARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPST 2159 GAR+LDGS+MTQD+SFGA NSE PYVLL+M+DGS++LLVGDPST Sbjct: 654 GARILDGSFMTQDVSFGASNSEANYGSESALALSVSIADPYVLLKMSDGSVRLLVGDPST 713 Query: 2160 CTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQ 2339 CT+S++ P +FESSK S++ CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA D Sbjct: 714 CTISVTSPASFESSKGSVSTCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDH 773 Query: 2340 GDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSK--NEVV 2513 GDIYCVVCYE+ +LEIFDVP+F CVFSVE F+SG +HLVD +E KDSQ ++ V Sbjct: 774 GDIYCVVCYENDSLEIFDVPNFSCVFSVENFLSGKSHLVDALTKEVPKDSQKGDKVSDGV 833 Query: 2514 AGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEES 2693 Q +K+ A NMK+VELAMQRW+G+H RP+LFG+L+DGT LCYHAYLYE + T+K+E+S Sbjct: 834 VSQGRKD-ALNMKVVELAMQRWSGKHGRPFLFGILSDGTTLCYHAYLYESPDGTSKVEDS 892 Query: 2694 VSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSG 2873 VS LS+ S SRLRNLRFVRVPL+ + REETS PCQ++ +FKN+G Y+G FLSG Sbjct: 893 VSA----GLSNSSVSRLRNLRFVRVPLDVHAREETSNGPPCQQINIFKNIGSYEGFFLSG 948 Query: 2874 SRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYD 3053 SRPAW M+ RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS +YD Sbjct: 949 SRPAWVMLLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYD 1008 Query: 3054 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFS 3233 YWPVQK+PLK TPHQVTYFAEKNLYPLIVS PV +PLNQV++ LVDQDA E N + Sbjct: 1009 CYWPVQKVPLKATPHQVTYFAEKNLYPLIVSYPVPKPLNQVIA-LVDQDANQLTESQNLN 1067 Query: 3234 PDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETL 3413 DE YT++E+EVRI+EPEKSG PWQ +ATIPMQ+SENALTVRMVTL NT+SKENETL Sbjct: 1068 NDEQSHLYTIEEFEVRIMEPEKSGGPWQLKATIPMQSSENALTVRMVTLMNTSSKENETL 1127 Query: 3414 LAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIA 3593 LAIGTAYVQGEDVAARGR+LLFS+G+NTDN QNLVSEVYSKELKGAISALA+LQGHLL+A Sbjct: 1128 LAIGTAYVQGEDVAARGRILLFSLGKNTDNPQNLVSEVYSKELKGAISALAALQGHLLVA 1187 Query: 3594 SGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLS 3773 SGPK+ILHKWTG EL GVAF+D PPL+VVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQLS Sbjct: 1188 SGPKIILHKWTGTELNGVAFFDVPPLHVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLS 1247 Query: 3774 LLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHV 3953 LLAKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHV Sbjct: 1248 LLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1307 Query: 3954 GAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTL 4133 GAH+TKFLRLQML T SD+TG+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+L Sbjct: 1308 GAHITKFLRLQMLST-SDKTGSGPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSL 1366 Query: 4134 QKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQI 4313 QKKLVD+VPH+AGLNPRAFR+F SNGKAHRPGPD+++DCELL HYEML LE+QLE+AHQ+ Sbjct: 1367 QKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPGPDSIVDCELLCHYEMLQLEEQLEIAHQV 1426 Query: 4314 GTTRTQILTNLNDLSLGTSFL 4376 GTTR+QIL+NL+DLSLGTSFL Sbjct: 1427 GTTRSQILSNLSDLSLGTSFL 1447 >ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Cucumis sativus] Length = 1504 Score = 2145 bits (5559), Expect = 0.0 Identities = 1076/1508 (71%), Positives = 1242/1508 (82%), Gaps = 57/1508 (3%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA++MMHWPTGIENC S +I+H +DF P + S ++DLDSDW R+ IGP+PNLVV Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383 TAGN+LEVYV+RV EE GR S+S E +RGG M G+SGASLELVCHY+LHGNVE+MA+L Sbjct: 60 TAGNVLEVYVVRVLEE-GGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAIL 118 Query: 384 SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563 S G DG K+RDSIIL FQ+AKISVLEFDDS + LRTSSMHCF+GP+W HLKRGRESFAR Sbjct: 119 SSRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFAR 178 Query: 564 GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743 GP+VKVDPQGRCGGVLVYGLQMIILK +ESSY+I+LR Sbjct: 179 GPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLR 238 Query: 744 DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923 DLD+KHVKDF FVHGYIEPVMVILHE+ELTWAGR+SWKHHTCM+SALSISTTLKQ PLIW Sbjct: 239 DLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIW 298 Query: 924 SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103 SA NLPHDAYKL++VPSPIGGVLVI ANSIHY+SQS SC LA+N+YAV+ D+SQ++PRS+ Sbjct: 299 SASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSN 358 Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283 FNVELDAANATWL +DV +LSTKTGELLLL +VYDGRVVQRL+LSKSKASVLTSG+ +IG Sbjct: 359 FNVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG 418 Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTS----------------------------------- 1358 NS FLGSRLGDS+LVQF+CG+G+S Sbjct: 419 NSLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESK 478 Query: 1359 ----------------VPKEEVGDIEGDVPSAKRL---RMASSD-ALQDI-VSGEELSLY 1475 V + V +E D+ + L R+A D L D V G+ELSLY Sbjct: 479 PTQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLY 538 Query: 1476 SSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH 1655 SA N +E AQK FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH Sbjct: 539 GSAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH 598 Query: 1656 GKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDDEYHAFLII 1835 GKNGALC+L+QSIRPE+ITEVELPGCKGIWTVYHK+ RG ADSS+MV DDEYHA+LII Sbjct: 599 GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLII 658 Query: 1836 SLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARVLDGSYMTQ 2015 SLE+RTMVL T +LL EVTESVDY+V G TIAAGNLFGRRRVIQV+ GAR+LDGS+MTQ Sbjct: 659 SLEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQ 718 Query: 2016 DLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVSISIPPAFE 2195 DL+ +E PYVLL MTDGSI+LLVGD S+C+VS+S P AF Sbjct: 719 DLNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFG 778 Query: 2196 SSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIYCVVCYESG 2375 SSK +++CTLY DKG EPWLR TSTDAWLSTG+GE IDG DG+L DQGDIYCV CY++G Sbjct: 779 SSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNG 838 Query: 2376 TLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKN-EVVAGQVKKETAQNMK 2552 LEIFDVP+F VF V+KF+SG +HLVD I + K S+ +N + + + E++QNMK Sbjct: 839 DLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNSQELISHGRNESSQNMK 898 Query: 2553 IVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQNTVDLSSIS 2732 ++E+AMQRW+G+H+RP+LFG+LTDGT+LCYHAYL+E +S +KI++SVS N+V S++S Sbjct: 899 VIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNMS 958 Query: 2733 TSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAWFMVCRERL 2912 +SRLRNLRF+RVPL+ RE+ + R+++FKN+ GYQGLFL GSRPAWFMV RERL Sbjct: 959 SSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRERL 1018 Query: 2913 RVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPVQKIPLKGT 3092 RVHPQLCDG IVAFAVLHNVNCNHGLIYVTS+G LKICQLPS +YDNYWPVQK+PLKGT Sbjct: 1019 RVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGT 1078 Query: 3093 PHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELHKTYTVDEY 3272 PHQVTYF EKNLYP+I+S PV +PLNQVLSS+VDQD GH VE+ N S DEL +TY+V+E+ Sbjct: 1079 PHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVEEF 1137 Query: 3273 EVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGTAYVQGEDV 3452 E+RILEPEKSG PWQTRATI M +SENALT+R+VTL NTT+KENETLLA+GTAYVQGEDV Sbjct: 1138 EIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDV 1197 Query: 3453 AARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKVILHKWTGA 3632 AARGRVLLFSVG++ DNSQ LVSEVYSKELKGAISALASLQGHLLIASGPK+ILHKWTGA Sbjct: 1198 AARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGA 1257 Query: 3633 ELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFGNLDCFS 3812 EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAKDFG+LDC++ Sbjct: 1258 ELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYA 1317 Query: 3813 TEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 3992 TEFLIDGSTLSL VSD+QKN+QIFYYAPK +ESWKGQKLLSRAEFHVGAHVTKFLRLQML Sbjct: 1318 TEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQML 1377 Query: 3993 PTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDSVPHMAG 4172 TSSD+ +T SDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKKL D+VPH+ G Sbjct: 1378 STSSDKACSTV-SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVGG 1436 Query: 4173 LNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQILTNLND 4352 LNPR+FR F SNGK HR GPD+++DCELL HYEMLPLE+QL++AHQIGTTR+QIL+NLND Sbjct: 1437 LNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLND 1496 Query: 4353 LSLGTSFL 4376 LSLGTSFL Sbjct: 1497 LSLGTSFL 1504 >ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum lycopersicum] Length = 1447 Score = 2142 bits (5550), Expect = 0.0 Identities = 1054/1460 (72%), Positives = 1245/1460 (85%), Gaps = 9/1460 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA K MH PTGIENCASGFI+H ++D PQI + D+DSDWPA K IGP+PNLVV Sbjct: 1 MSFAACKTMHCPTGIENCASGFITHSAADITPQIQTA---DVDSDWPATKPIGPVPNLVV 57 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEA-KRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 +AGN+L+VY+IRV++ SR +E KRGG +AG+S ASLELVC Y+LHGN+ +M V Sbjct: 58 SAGNVLDVYLIRVEQAS----SRDAAEVVKRGGLVAGISAASLELVCTYRLHGNIYSMGV 113 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 ++ G+DG KRRDSIIL+F+DAK+SVLEFDD+ +GLRTSSMH FEGP+W HLKRGRESF Sbjct: 114 ITAGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWFHLKRGRESFD 173 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 +GP++KVDPQGRC GV + QMI+LK IESSY+I+L Sbjct: 174 KGPIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGASA-RIESSYIITL 232 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RDLD++HVKDF+F+HGYIEPVMVILHERELTW+GR+SWKHHTCM+SA SISTTLKQ PLI Sbjct: 233 RDLDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLI 292 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSA NLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SC+LA+N++ DNSQE+PRS Sbjct: 293 WSATNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFVFFGDNSQEMPRS 352 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 S NVELDAANATWL+ DV MLSTKTGELLLLT++YDGR+VQ+L+LSKS+ASVLTSG+TTI Sbjct: 353 SINVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTI 412 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGM-GTSVP---KEEVGDIEGDVPSAKRLRMASSDALQDI 1448 G+S FLGSRLGDS+LVQF+ G+ G+++P +EEVGDIE D PSAKRLRM+SSDALQD+ Sbjct: 413 GDSLFFLGSRLGDSLLVQFSSGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDM 472 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 ++GEELSLY +APN ++ AQKTFSFAVRDSLIN+GPLKDFSYG+RINAD NATGIAKQSN Sbjct: 473 INGEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSN 532 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNG+L VLQQSIRPE IT+V LPGCKGIWTVYHK+ R H ++SS+M + Sbjct: 533 YELVCCSGHGKNGSLSVLQQSIRPETITQVSLPGCKGIWTVYHKNTRIHLSESSRMADEE 592 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHA+LIISLE+RTMVL+TA+ L EVTE+VDYYVQG+T+AAGNLFGRRRVIQVFA GAR Sbjct: 593 DEYHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGAR 652 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDG++MTQ+LSF A N E PYVLLRMT+GS+QLLVGDPS+C+V Sbjct: 653 ILDGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSV 712 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S+++P FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLS+GMGEAIDGADG + DQGD+ Sbjct: 713 SLTVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVIQDQGDV 772 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAK----DSQNSKNEVVA 2516 YCVVCYE+GTLEIFDVPSF CVFSV+KFISG T+LVDTF++++ S+N+++ + Sbjct: 773 YCVVCYENGTLEIFDVPSFTCVFSVDKFISGRTYLVDTFMQDSVNGLHAHSKNTEDVIRP 832 Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696 GQ + + +VEL M RW G+H+RP+LFG+L DGT+L YHAY++EG E+++K++ SV Sbjct: 833 GQKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVDGSV 892 Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876 S QN++ LSS + SRLRNLRFVRVP++ Y REE + S QRM V+KN+GG QG+FL+GS Sbjct: 893 SSQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGSQLQRMNVYKNIGGSQGIFLTGS 952 Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056 RP+WFMV RERLR+HPQLCDG IVAF VLHNVNCNHGLIYVT+ G LKICQLPS LSYDN Sbjct: 953 RPSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDN 1012 Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236 YWPVQKIPLKGTPHQV YFAEKN+Y +IVSVPV++PLNQVLSS+ DQ+ G Q + DN + Sbjct: 1013 YWPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSSIADQEVGQQFDPDNLN- 1071 Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416 +Y ++E+EVRILEPEKSG PW+TRA+IPMQ+SENALTVRMVTLFNT +KENETLL Sbjct: 1072 --YEGSYPIEEFEVRILEPEKSGGPWKTRASIPMQSSENALTVRMVTLFNTKTKENETLL 1129 Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596 A+GTAYVQGEDVAARGRVLLFS+ R DNS+ LVSEVYSKELKGAI ALASLQGHLLIAS Sbjct: 1130 AVGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIAS 1189 Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776 GPK+ILHKWTG+EL GVAF D PPL+ VSLNIVKNFIL+GDIHKSI F+SWKE QLSL Sbjct: 1190 GPKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKE--PQLSL 1247 Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956 LAKDF LDC +TEFLIDGSTLSL+VSD+QKNVQIFYYAPK+SESWKGQKLLSRAEFHVG Sbjct: 1248 LAKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVG 1307 Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136 + +TKFLRLQ+LPT+S+RT TPGSDKTNRFA +FGTL+GS+GCIAPLDELTFRRLQ+LQ Sbjct: 1308 SRITKFLRLQLLPTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSLQ 1367 Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316 KKLV +V H+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLPLE+QLE+A QIG Sbjct: 1368 KKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQIG 1427 Query: 4317 TTRTQILTNLNDLSLGTSFL 4376 TTR QI++NLND+ LGTSFL Sbjct: 1428 TTRMQIMSNLNDMILGTSFL 1447 >ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Solanum tuberosum] Length = 1447 Score = 2140 bits (5544), Expect = 0.0 Identities = 1051/1460 (71%), Positives = 1246/1460 (85%), Gaps = 9/1460 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203 MS+AA K MH PTGIENCASGFI+H +++ PQI + D+DSDWPA K +GP+PNLVV Sbjct: 1 MSFAACKTMHCPTGIENCASGFITHSAAEITPQIRTA---DVDSDWPATKPVGPMPNLVV 57 Query: 204 TAGNILEVYVIRVQEEEDGRVSRSPSEA-KRGGFMAGLSGASLELVCHYKLHGNVETMAV 380 +AGN+LEVY+IR+++ SR +E KRGG MAG+S ASLELVC Y+LHGN+ +M V Sbjct: 58 SAGNVLEVYLIRIEQAS----SRDAAEVVKRGGLMAGISAASLELVCTYRLHGNIYSMGV 113 Query: 381 LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560 ++ G+DG KRRDSIIL+F+DAK+SVLEFDD+ +GLRTSSMH FEGP+W HLKRGRESF Sbjct: 114 ITAGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWLHLKRGRESFD 173 Query: 561 RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740 +GP++KVDPQGRC GV + QMI+LK IESSY+I+L Sbjct: 174 KGPIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGASA-RIESSYIITL 232 Query: 741 RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920 RDLD++HVKDF+F+HGYIEPVMVILHERELTW+GR+SWKHHTCM+SA SISTTLKQ PLI Sbjct: 233 RDLDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLI 292 Query: 921 WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100 WSA NLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SC+LA+N++A DNSQE+PRS Sbjct: 293 WSAANLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFAFFGDNSQEMPRS 352 Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280 SFNVELDAANATWL+ DV MLSTKTGELLLLT++YDGR+VQ+L+LSKS+ASVLTSG+TTI Sbjct: 353 SFNVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTI 412 Query: 1281 GNSFIFLGSRLGDSMLVQFTCGM-GTSVP---KEEVGDIEGDVPSAKRLRMASSDALQDI 1448 G+S FLGSRLGDS+LVQF+CG+ G+++P +EEVGDIE D PSAKRLRM+SSDALQD+ Sbjct: 413 GDSLFFLGSRLGDSLLVQFSCGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDM 472 Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628 ++GEELSLY +APN ++ AQKTFSFAVRDSLIN+GPLKDFSYG+RINAD NATGIAKQSN Sbjct: 473 INGEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSN 532 Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808 YELVCCSGHGKNG+LCVLQQSIRPE IT+ LPGCKGIWTVYHK+ R H ++SS+M + Sbjct: 533 YELVCCSGHGKNGSLCVLQQSIRPETITQEALPGCKGIWTVYHKNTRIHLSESSRMADEE 592 Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988 DEYHA+LIISLE+RTMVL+TA+ L EVTE+VDYYVQG+T+AAGNLFGRRRVIQVFA GAR Sbjct: 593 DEYHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGAR 652 Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168 +LDG++MTQ+LSF A N E PYVLLRMT+GS+QLLVGDPS+C+V Sbjct: 653 ILDGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSV 712 Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348 S+++P FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLS+GMGEAIDGADG DQGD+ Sbjct: 713 SLTVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVTQDQGDV 772 Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAK----DSQNSKNEVVA 2516 YCVVCYE+GTLEIFDVP+F CVFSV+KFISG T+LVDTF++++ S+N+++ + Sbjct: 773 YCVVCYENGTLEIFDVPNFTCVFSVDKFISGRTYLVDTFMQDSVNGLHAHSKNTEDVIRP 832 Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696 GQ + + +VEL M RW G+H+RP+LFG+L DGT+L YHAY++EG E+++K+E SV Sbjct: 833 GQKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVEGSV 892 Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876 S QN++ LSS + SRLRNLRFVRVP++ Y REE + + QRM V+KN+GG QG+FL+GS Sbjct: 893 SSQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGTQLQRMNVYKNIGGSQGIFLTGS 952 Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056 RP+WFMV RERLR+HPQLCDG IVAF VLHNVNCNHGLIYVT+ G LKICQLPS LSYDN Sbjct: 953 RPSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDN 1012 Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236 YWPVQKIPLKGTPHQV YFAEKN+Y +IVSVPV++PLNQVLS++ DQ+ G Q + DN + Sbjct: 1013 YWPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSTIADQEVGQQFDPDNLN- 1071 Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416 +Y ++E+EVRI+EPEKSG W+TRA+IPMQ+SENALTVRMVTL NTT++ENETLL Sbjct: 1072 --YEGSYPIEEFEVRIVEPEKSGGLWKTRASIPMQSSENALTVRMVTLLNTTTRENETLL 1129 Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596 A+GTAYVQGEDVAARGRVLLFS+ R DNS+ LVSEVYSKELKGAI ALASLQGHLLIAS Sbjct: 1130 AVGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIAS 1189 Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776 GPK+ILHKWTG+EL GVAF D PPL+ VSLNIVKNFIL+GDIHKSI F+SWKE QLSL Sbjct: 1190 GPKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKE--PQLSL 1247 Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956 LAKDF LDC +TEFLIDGSTLSL+VSD+QKNVQIFYYAPK+SESWKGQKLLSRAEFHVG Sbjct: 1248 LAKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVG 1307 Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136 + +TKFLRLQ+LPT+S+RT TPGSDKTNRFA +FGTL+GS+GCIAPLDELTFRRLQ+LQ Sbjct: 1308 SRITKFLRLQLLPTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSLQ 1367 Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316 KKLV +V H+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLPLE+QLE+A QIG Sbjct: 1368 KKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQIG 1427 Query: 4317 TTRTQILTNLNDLSLGTSFL 4376 TTR QI++NLND+ LGTSFL Sbjct: 1428 TTRMQIMSNLNDMILGTSFL 1447 >ref|XP_006851814.1| hypothetical protein AMTR_s00041p00037520 [Amborella trichopoda] gi|548855397|gb|ERN13281.1| hypothetical protein AMTR_s00041p00037520 [Amborella trichopoda] Length = 1459 Score = 2130 bits (5520), Expect = 0.0 Identities = 1039/1463 (71%), Positives = 1232/1463 (84%), Gaps = 12/1463 (0%) Frame = +3 Query: 24 MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDW---PARKGIGPIPN 194 MS++AFK+MHW T +ENCASG+I+H SDFA +P+I N+D DS+W P + IGPIPN Sbjct: 1 MSFSAFKLMHWATAVENCASGYITHSFSDFAASVPAIQNDDPDSEWDQPPVKARIGPIPN 60 Query: 195 LVVTAGNILEVYVIRVQEEED--GRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVE 368 LV++A N+LEVYVIRVQEE+ R S + S +K GG +AG+SGA LELVCHY+LHGNVE Sbjct: 61 LVISAANVLEVYVIRVQEEDSKASRHSGASSGSKGGGVVAGVSGAWLELVCHYRLHGNVE 120 Query: 369 TMAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGR 548 TMA+LS DG ++RDSIIL F++AKISVLE+DDS++GLRTSSMHCFEGPEW +LKRGR Sbjct: 121 TMAILSAGSDDGRRKRDSIILTFKEAKISVLEYDDSLHGLRTSSMHCFEGPEWQYLKRGR 180 Query: 549 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSY 728 E+FARGPLV VDPQGRCGGVL+ QM+ILK IESSY Sbjct: 181 ENFARGPLVYVDPQGRCGGVLICDSQMLILKASKAGYGFVGDDDANGPTGTVSIRIESSY 240 Query: 729 VISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQ 908 VISLR+LDMKHVKDF FVHGYIEPVMVILHE+ELTWAGR+SWKHHTCMISA SISTTLKQ Sbjct: 241 VISLRELDMKHVKDFVFVHGYIEPVMVILHEKELTWAGRLSWKHHTCMISAFSISTTLKQ 300 Query: 909 QPLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQE 1088 PLIWSA NLPHDAYKL+SVPSPIGGVLVI ANSIHYHSQS+SCALA+N +AVA +NSQE Sbjct: 301 HPLIWSASNLPHDAYKLLSVPSPIGGVLVICANSIHYHSQSMSCALALNDFAVAGENSQE 360 Query: 1089 LPRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSG 1268 +PRS+ NVELDAA+ATWLS+DV + S KTGELLLLT+VYDGR VQRLELSKS+ASVLTSG Sbjct: 361 MPRSNVNVELDAAHATWLSNDVALFSIKTGELLLLTLVYDGRTVQRLELSKSRASVLTSG 420 Query: 1269 VTTIGNSFIFLGSRLGDSMLVQFTCGMGTSV-----PKEEVGDIEGDVPSAKRLRMASSD 1433 +TTI N F FLGSRLGDS+LVQF G S+ K+E +IEGD P+AKRLR ASSD Sbjct: 421 ITTISNHFFFLGSRLGDSLLVQFNSGASQSILTSRQGKDEGVEIEGDAPAAKRLRRASSD 480 Query: 1434 ALQDIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGI 1613 QD+ EELSLY SAPN + AQK+FSFAVRDSL+N+GPLKDFSYGLRINADPNA+G+ Sbjct: 481 VSQDV--NEELSLYVSAPNNLDPAQKSFSFAVRDSLLNVGPLKDFSYGLRINADPNASGV 538 Query: 1614 AKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSK 1793 AKQSNYELV CSGHGKNGALC+LQQSIRPEL+TEVEL G KGIWTVYHK+ R H+ DSSK Sbjct: 539 AKQSNYELVSCSGHGKNGALCLLQQSIRPELVTEVELTGVKGIWTVYHKNPRSHSTDSSK 598 Query: 1794 MVSLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVF 1973 + + DE+HA+LIISLESRTMVLETADLLGEVTESVDYYVQGSTI AGNLFGRRRV+Q++ Sbjct: 599 VAAEGDEFHAYLIISLESRTMVLETADLLGEVTESVDYYVQGSTIVAGNLFGRRRVVQIY 658 Query: 1974 ARGARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDP 2153 RGAR+LDG+YMTQDL FG +++ PYVLLRM DGSIQLL GDP Sbjct: 659 VRGARILDGAYMTQDLPFGHSSTDSTSNSEASTVSSASIADPYVLLRMVDGSIQLLTGDP 718 Query: 2154 STCTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALH 2333 +TCT++ S+P FE+ D I ACT YHDKGPEPWLRK S DAWLS+G+ EA+DG+DG+ H Sbjct: 719 ATCTITSSLPAVFENLSDPITACTFYHDKGPEPWLRKASPDAWLSSGVAEALDGSDGSQH 778 Query: 2334 DQGDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVV 2513 DQGDIYC+VCYESG LEIFDVPSFKC+FSV++F++ THL D +E+ KD +K+ Sbjct: 779 DQGDIYCLVCYESGRLEIFDVPSFKCLFSVDRFLTVRTHLFDAHHQESFKDLHKAKSSYS 838 Query: 2514 --AGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIE 2687 + Q++KE +NMK+VEL M RW+G++ RP+LF L DG+ LCYHAYLYEGQ++ TK++ Sbjct: 839 EDSDQLEKEAVKNMKVVELCMHRWSGQYGRPFLFAGLADGSTLCYHAYLYEGQDNMTKLD 898 Query: 2688 ESVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFL 2867 +S S +N V+ S+ STSRLRNLRF+RV L++ TREE+ V +++ F N+GG+QG F+ Sbjct: 899 DSNSAENVVEASNTSTSRLRNLRFIRVSLQSLTREESPGVVSYRKIITFMNIGGHQGAFI 958 Query: 2868 SGSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLS 3047 +GSRPAW +VCRERLR+H QLCDG IVAF VLHNVNCN+G IYVTS+GFLKIC+LPS L+ Sbjct: 959 AGSRPAWLIVCRERLRIHHQLCDGPIVAFTVLHNVNCNYGCIYVTSQGFLKICRLPSWLN 1018 Query: 3048 YDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDN 3227 YDNYWPVQKIPL+GTPHQVTY AEKNLY LI+S+PV +P +QVL+SLVD + HQ EHDN Sbjct: 1019 YDNYWPVQKIPLRGTPHQVTYIAEKNLYALILSIPVAKPSSQVLTSLVDHEVSHQSEHDN 1078 Query: 3228 FSPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENE 3407 +P++L+++Y +DE+EVRILEP K+G PW+T+ATIPMQ+SE+A+T+R+V+L NT +KE + Sbjct: 1079 LNPEDLNRSYFIDEFEVRILEPAKAGGPWETKATIPMQSSEHAITIRLVSLSNTLTKETD 1138 Query: 3408 TLLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLL 3587 TLLA+GTAYVQGEDVAARGR++L+SVG+N DN QN V+EV+SKELKGAISALA LQGHLL Sbjct: 1139 TLLAVGTAYVQGEDVAARGRIILYSVGQNADNPQNWVTEVHSKELKGAISALAPLQGHLL 1198 Query: 3588 IASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQ 3767 IASGPK++LHKW G ELT VAF+D PLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQ Sbjct: 1199 IASGPKIVLHKWNGTELTAVAFFDA-PLYVVSLNIVKNFILMGDIHKSIYFLSWKEQGAQ 1257 Query: 3768 LSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEF 3947 LSLLAKDFG+L+C++TEFLIDGSTLSL+VSD+QKN+QIFYYAPK+ ESWKGQKLL +AEF Sbjct: 1258 LSLLAKDFGSLNCYTTEFLIDGSTLSLVVSDDQKNIQIFYYAPKLMESWKGQKLLPKAEF 1317 Query: 3948 HVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQ 4127 HVG+HVT+ +RLQMLPT SDRT A+ G+DKTNRFALLFGTLDGS+GCIAPL+ELTFRRLQ Sbjct: 1318 HVGSHVTRLMRLQMLPT-SDRTSASSGTDKTNRFALLFGTLDGSIGCIAPLEELTFRRLQ 1376 Query: 4128 TLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAH 4307 LQ+KLVD VPH GLNPRAFR FRSNGKAH+PGPDNM+DCELL HYEMLPLE+QL++AH Sbjct: 1377 MLQRKLVDRVPHACGLNPRAFRQFRSNGKAHKPGPDNMVDCELLCHYEMLPLEEQLDIAH 1436 Query: 4308 QIGTTRTQILTNLNDLSLGTSFL 4376 QIGTTR QI+TNL+DL+LGTSFL Sbjct: 1437 QIGTTRAQIVTNLSDLTLGTSFL 1459