BLASTX nr result
ID: Ophiopogon26_contig00019213
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ophiopogon26_contig00019213 (4584 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_020260753.1| cleavage and polyadenylation specificity fac... 2443 0.0 ref|XP_020260752.1| cleavage and polyadenylation specificity fac... 2439 0.0 ref|XP_008791964.1| PREDICTED: cleavage and polyadenylation spec... 2286 0.0 ref|XP_009383988.1| PREDICTED: cleavage and polyadenylation spec... 2283 0.0 ref|XP_008791963.1| PREDICTED: cleavage and polyadenylation spec... 2282 0.0 ref|XP_018676515.1| PREDICTED: cleavage and polyadenylation spec... 2276 0.0 ref|XP_008791966.1| PREDICTED: cleavage and polyadenylation spec... 2275 0.0 ref|XP_018676516.1| PREDICTED: cleavage and polyadenylation spec... 2274 0.0 ref|XP_010918168.1| PREDICTED: cleavage and polyadenylation spec... 2266 0.0 ref|XP_010918167.1| PREDICTED: cleavage and polyadenylation spec... 2261 0.0 ref|XP_020084456.1| probable cleavage and polyadenylation specif... 2237 0.0 gb|OAY63492.1| putative cleavage and polyadenylation specificity... 2196 0.0 ref|XP_020672482.1| cleavage and polyadenylation specificity fac... 2194 0.0 ref|XP_010277399.1| PREDICTED: cleavage and polyadenylation spec... 2161 0.0 ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec... 2153 0.0 emb|CBI24510.3| unnamed protein product, partial [Vitis vinifera] 2146 0.0 ref|XP_019081674.1| PREDICTED: cleavage and polyadenylation spec... 2146 0.0 ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation spec... 2133 0.0 gb|EOY22974.1| Cleavage and polyadenylation specificity factor 1... 2128 0.0 ref|XP_017972864.1| PREDICTED: cleavage and polyadenylation spec... 2127 0.0 >ref|XP_020260753.1| cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Asparagus officinalis] gb|ONK71681.1| uncharacterized protein A4U43_C04F11250 [Asparagus officinalis] Length = 1454 Score = 2443 bits (6332), Expect = 0.0 Identities = 1219/1454 (83%), Positives = 1301/1454 (89%), Gaps = 5/1454 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHWPTGIENCASGFITH RRIGPIPNL Sbjct: 1 MSYAAFKMMHWPTGIENCASGFITHSPADAAASVILPHLQPDDLDSDFAPTRRIGPIPNL 60 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+EVY VRVQEDD R A EPRGGGG+MDGL GARLELVCHYRLHGNV+SM Sbjct: 61 VVTAGNVVEVYIVRVQEDDARVSQAPIEPRGGGGIMDGLRGARLELVCHYRLHGNVQSMA 120 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLSLGGDE SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDWL+LK+GR SF Sbjct: 121 VLSLGGDERSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWLFLKKGRESF 180 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTS-VCIESSFVL 738 A GP+VKADPLGRCGG LVYD QMIVLKAAQ GQGLV D+EP+S+GG+T+ V IESS+V+ Sbjct: 181 AMGPLVKADPLGRCGGALVYDLQMIVLKAAQTGQGLVTDDEPTSVGGSTAAVRIESSYVI 240 Query: 739 NLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP 918 NLR+LDM+HVKDFTFVHGYIEPVMV LHE+EPTWAGRISWKHHTC+ISALSISTTLKQHP Sbjct: 241 NLRELDMKHVKDFTFVHGYIEPVMVFLHEREPTWAGRISWKHHTCLISALSISTTLKQHP 300 Query: 919 LIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMP 1098 LIWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQS+SC+LGLN FA+QM++SSDMP Sbjct: 301 LIWSAANLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSSSCALGLNSFAMQMDSSSDMP 360 Query: 1099 KTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGIT 1278 K +FNVELDAAKATWLS DV +FSSK GELLLLTLV DGR+VQKLDL KSKASVLTSGIT Sbjct: 361 KASFNVELDAAKATWLSPDVVIFSSKNGELLLLTLVSDGRIVQKLDLMKSKASVLTSGIT 420 Query: 1279 SIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 1458 ++G+SFFFLGSRLGDSLLVQYSYGASSP+SAHVKDE +IEGD+P KRL+ MSSDSLQD Sbjct: 421 TMGSSFFFLGSRLGDSLLVQYSYGASSPTSAHVKDEIAEIEGDVPAVKRLKRMSSDSLQD 480 Query: 1459 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 1638 VAS EELSLYAT PNSSES QKVFSFAVRD+LINVGPLKDFSYGLRINADPNATG+AKQS Sbjct: 481 VASAEELSLYATAPNSSESTQKVFSFAVRDTLINVGPLKDFSYGLRINADPNATGLAKQS 540 Query: 1639 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 1818 NYELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIWTVYHKSSR HAVDSSKT TE Sbjct: 541 NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSSRRHAVDSSKTLTE 600 Query: 1819 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 1998 DDEYHAYLIISLETRTMVLETADDLGEVTESV YYVQGSTIAAGNLFGRRRVVQIFARGA Sbjct: 601 DDEYHAYLIISLETRTMVLETADDLGEVTESVGYYVQGSTIAAGNLFGRRRVVQIFARGA 660 Query: 1999 RILDGSYMTQELSFVVH---XXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPS 2169 RILDGSYMTQELSF ++ IADPYVLLKMTDGSIQLLAGD S Sbjct: 661 RILDGSYMTQELSFAMNNSESASSASNSEPSTVSSVSIADPYVLLKMTDGSIQLLAGDTS 720 Query: 2170 TCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPD 2349 TCTVS+NVPARFASLTD ++ACTLYHDK PEPWLRKASSDAWLSTG+AE ID +DGSYPD Sbjct: 721 TCTVSVNVPARFASLTDPISACTLYHDKGPEPWLRKASSDAWLSTGIAEAIDGSDGSYPD 780 Query: 2350 QGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGLDETAKK 2529 QGDIYCLVCYESG LEIFDVPNFRSVFS DKF+SGKSHLMD CI+P K K + +ET KK Sbjct: 781 QGDIYCLVCYESGILEIFDVPNFRSVFSVDKFMSGKSHLMDMCIVPQKTKPRNFEETIKK 840 Query: 2530 EPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVYPLST 2709 EP QNLKV+EL MQRWSGQYSRPFLF ILNDGTML YHAYLYEG E APK E A+ P S+ Sbjct: 841 EPTQNLKVMELVMQRWSGQYSRPFLFAILNDGTMLSYHAYLYEGQENAPKAEDALSPHSS 900 Query: 2710 GDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSRPAWL 2889 ++++TG SR RNLRF RVSID NI+E+SSN+T RPRITIFKNVGG QGLFL+GSRPAW Sbjct: 901 SNMNSTGTSRMRNLRFQRVSIDTNIQEDSSNVTVRPRITIFKNVGGYQGLFLTGSRPAWF 960 Query: 2890 MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNYWPVQ 3069 MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPS CNYDNYWPVQ Sbjct: 961 MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSACNYDNYWPVQ 1020 Query: 3070 KIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSASSDSLQK 3246 KIPLR TPHQVTYF EK+LYPVIVSVPVVRPLSQVLSSLVDQD+ NQMDQD+ SSD LQ Sbjct: 1021 KIPLRGTPHQVTYFAEKSLYPVIVSVPVVRPLSQVLSSLVDQDSGNQMDQDTVSSDGLQN 1080 Query: 3247 SYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAIGTA 3426 +Y IDEFEVRIFEPEKSGGHWEPKATIPMQ SENALTVR VTL+NSTT ENE+LLAIGTA Sbjct: 1081 NYCIDEFEVRIFEPEKSGGHWEPKATIPMQTSENALTVRMVTLLNSTTKENESLLAIGTA 1140 Query: 3427 YVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGTKIT 3606 YVQGED+AARGR+LLF+IGR++EN QNLVSEVYSKELKGAISALASLQGHLLIASG KI Sbjct: 1141 YVQGEDIAARGRVLLFSIGRNSENLQNLVSEVYSKELKGAISALASLQGHLLIASGPKII 1200 Query: 3607 LNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAKDFS 3786 L+KWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAKDF Sbjct: 1201 LHKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAKDFG 1260 Query: 3787 SLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHVSKF 3966 SLDCFATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAH++KF Sbjct: 1261 SLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLSRAEFHAGAHITKF 1320 Query: 3967 LRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXXVDV 4146 LRLQMLPTSSDR+S PGSDKTNRFA+LFGTLDGSIGCVAPLEE VD Sbjct: 1321 LRLQMLPTSSDRASGAPGSDKTNRFALLFGTLDGSIGCVAPLEELTFRRLQTLQKRLVDA 1380 Query: 4147 VPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTRSQI 4326 VPHVCGLNPR+FRQF SNGKAHRPGPDNM+D ELLSHYE+LPLEEQLEIAHQIGTTRSQI Sbjct: 1381 VPHVCGLNPRSFRQFHSNGKAHRPGPDNMVDFELLSHYELLPLEEQLEIAHQIGTTRSQI 1440 Query: 4327 LSNLNDLSLGTSFL 4368 LSNLNDLSLGTSFL Sbjct: 1441 LSNLNDLSLGTSFL 1454 >ref|XP_020260752.1| cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Asparagus officinalis] Length = 1455 Score = 2439 bits (6320), Expect = 0.0 Identities = 1219/1455 (83%), Positives = 1301/1455 (89%), Gaps = 6/1455 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHWPTGIENCASGFITH RRIGPIPNL Sbjct: 1 MSYAAFKMMHWPTGIENCASGFITHSPADAAASVILPHLQPDDLDSDFAPTRRIGPIPNL 60 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+EVY VRVQEDD R A EPRGGGG+MDGL GARLELVCHYRLHGNV+SM Sbjct: 61 VVTAGNVVEVYIVRVQEDDARVSQAPIEPRGGGGIMDGLRGARLELVCHYRLHGNVQSMA 120 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLSLGGDE SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDWL+LK+GR SF Sbjct: 121 VLSLGGDERSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWLFLKKGRESF 180 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTS-VCIESSFVL 738 A GP+VKADPLGRCGG LVYD QMIVLKAAQ GQGLV D+EP+S+GG+T+ V IESS+V+ Sbjct: 181 AMGPLVKADPLGRCGGALVYDLQMIVLKAAQTGQGLVTDDEPTSVGGSTAAVRIESSYVI 240 Query: 739 NLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP 918 NLR+LDM+HVKDFTFVHGYIEPVMV LHE+EPTWAGRISWKHHTC+ISALSISTTLKQHP Sbjct: 241 NLRELDMKHVKDFTFVHGYIEPVMVFLHEREPTWAGRISWKHHTCLISALSISTTLKQHP 300 Query: 919 LIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMP 1098 LIWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQS+SC+LGLN FA+QM++SSDMP Sbjct: 301 LIWSAANLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSSSCALGLNSFAMQMDSSSDMP 360 Query: 1099 KTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGIT 1278 K +FNVELDAAKATWLS DV +FSSK GELLLLTLV DGR+VQKLDL KSKASVLTSGIT Sbjct: 361 KASFNVELDAAKATWLSPDVVIFSSKNGELLLLTLVSDGRIVQKLDLMKSKASVLTSGIT 420 Query: 1279 SIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 1458 ++G+SFFFLGSRLGDSLLVQYSYGASSP+SAHVKDE +IEGD+P KRL+ MSSDSLQD Sbjct: 421 TMGSSFFFLGSRLGDSLLVQYSYGASSPTSAHVKDEIAEIEGDVPAVKRLKRMSSDSLQD 480 Query: 1459 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 1638 VAS EELSLYAT PNSSES QKVFSFAVRD+LINVGPLKDFSYGLRINADPNATG+AKQS Sbjct: 481 VASAEELSLYATAPNSSESTQKVFSFAVRDTLINVGPLKDFSYGLRINADPNATGLAKQS 540 Query: 1639 NYELVCCSGHGKNGSLCVLQQSIRPELITE-VELPGCKGIWTVYHKSSRTHAVDSSKTTT 1815 NYELVCCSGHGKNG+LCVLQQSIRPELITE VELPGCKGIWTVYHKSSR HAVDSSKT T Sbjct: 541 NYELVCCSGHGKNGALCVLQQSIRPELITEQVELPGCKGIWTVYHKSSRRHAVDSSKTLT 600 Query: 1816 EDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARG 1995 EDDEYHAYLIISLETRTMVLETADDLGEVTESV YYVQGSTIAAGNLFGRRRVVQIFARG Sbjct: 601 EDDEYHAYLIISLETRTMVLETADDLGEVTESVGYYVQGSTIAAGNLFGRRRVVQIFARG 660 Query: 1996 ARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXX---IADPYVLLKMTDGSIQLLAGDP 2166 ARILDGSYMTQELSF ++ IADPYVLLKMTDGSIQLLAGD Sbjct: 661 ARILDGSYMTQELSFAMNNSESASSASNSEPSTVSSVSIADPYVLLKMTDGSIQLLAGDT 720 Query: 2167 STCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYP 2346 STCTVS+NVPARFASLTD ++ACTLYHDK PEPWLRKASSDAWLSTG+AE ID +DGSYP Sbjct: 721 STCTVSVNVPARFASLTDPISACTLYHDKGPEPWLRKASSDAWLSTGIAEAIDGSDGSYP 780 Query: 2347 DQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGLDETAK 2526 DQGDIYCLVCYESG LEIFDVPNFRSVFS DKF+SGKSHLMD CI+P K K + +ET K Sbjct: 781 DQGDIYCLVCYESGILEIFDVPNFRSVFSVDKFMSGKSHLMDMCIVPQKTKPRNFEETIK 840 Query: 2527 KEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVYPLS 2706 KEP QNLKV+EL MQRWSGQYSRPFLF ILNDGTML YHAYLYEG E APK E A+ P S Sbjct: 841 KEPTQNLKVMELVMQRWSGQYSRPFLFAILNDGTMLSYHAYLYEGQENAPKAEDALSPHS 900 Query: 2707 TGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSRPAW 2886 + ++++TG SR RNLRF RVSID NI+E+SSN+T RPRITIFKNVGG QGLFL+GSRPAW Sbjct: 901 SSNMNSTGTSRMRNLRFQRVSIDTNIQEDSSNVTVRPRITIFKNVGGYQGLFLTGSRPAW 960 Query: 2887 LMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNYWPV 3066 MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPS CNYDNYWPV Sbjct: 961 FMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSACNYDNYWPV 1020 Query: 3067 QKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSASSDSLQ 3243 QKIPLR TPHQVTYF EK+LYPVIVSVPVVRPLSQVLSSLVDQD+ NQMDQD+ SSD LQ Sbjct: 1021 QKIPLRGTPHQVTYFAEKSLYPVIVSVPVVRPLSQVLSSLVDQDSGNQMDQDTVSSDGLQ 1080 Query: 3244 KSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAIGT 3423 +Y IDEFEVRIFEPEKSGGHWEPKATIPMQ SENALTVR VTL+NSTT ENE+LLAIGT Sbjct: 1081 NNYCIDEFEVRIFEPEKSGGHWEPKATIPMQTSENALTVRMVTLLNSTTKENESLLAIGT 1140 Query: 3424 AYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGTKI 3603 AYVQGED+AARGR+LLF+IGR++EN QNLVSEVYSKELKGAISALASLQGHLLIASG KI Sbjct: 1141 AYVQGEDIAARGRVLLFSIGRNSENLQNLVSEVYSKELKGAISALASLQGHLLIASGPKI 1200 Query: 3604 TLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAKDF 3783 L+KWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAKDF Sbjct: 1201 ILHKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAKDF 1260 Query: 3784 SSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHVSK 3963 SLDCFATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAH++K Sbjct: 1261 GSLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLSRAEFHAGAHITK 1320 Query: 3964 FLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXXVD 4143 FLRLQMLPTSSDR+S PGSDKTNRFA+LFGTLDGSIGCVAPLEE VD Sbjct: 1321 FLRLQMLPTSSDRASGAPGSDKTNRFALLFGTLDGSIGCVAPLEELTFRRLQTLQKRLVD 1380 Query: 4144 VVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTRSQ 4323 VPHVCGLNPR+FRQF SNGKAHRPGPDNM+D ELLSHYE+LPLEEQLEIAHQIGTTRSQ Sbjct: 1381 AVPHVCGLNPRSFRQFHSNGKAHRPGPDNMVDFELLSHYELLPLEEQLEIAHQIGTTRSQ 1440 Query: 4324 ILSNLNDLSLGTSFL 4368 ILSNLNDLSLGTSFL Sbjct: 1441 ILSNLNDLSLGTSFL 1455 >ref|XP_008791964.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Phoenix dactylifera] Length = 1455 Score = 2286 bits (5925), Expect = 0.0 Identities = 1137/1458 (77%), Positives = 1255/1458 (86%), Gaps = 9/1458 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYA+FKMMHW TGIENCA+GFITH KRRIGPIPNL Sbjct: 1 MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADDLESEWSP--KRRIGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+E+Y VR+QEDD R + GGG+MDGL GARLELVCHYRLHGNVESM Sbjct: 59 VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR F Sbjct: 119 ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP++KADPLGRCGG LVY QMI+LK+AQAGQGLV D+EP+SLGGT IESS+V+N Sbjct: 179 ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+ Sbjct: 239 LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK Sbjct: 299 IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 359 SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE DIEGD P AKRLR MSSD+LQ+V Sbjct: 419 IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSLY+T PNSSESAQK+FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN Sbjct: 479 VSGEELSLYSTAPNSSESAQKIFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H D SKT +ED Sbjct: 539 YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 598 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR Sbjct: 599 DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F H IADP++LLKMTDGSIQLL GDPS CTV Sbjct: 658 ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+ Sbjct: 718 SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2517 YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+ T +G+ Sbjct: 778 YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E V Sbjct: 838 QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + DISN SR RNLRF RVS+DI REES + RPRIT+FKNVGG QGLFL+GSR Sbjct: 898 PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y Sbjct: 958 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD +Q D++S SSD Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1077 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1137 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 IGTAYVQGEDVAARGR+LL++ +++EN QNLV EVYSKELKGA+SALA+LQGHLLIASG Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLVLEVYSKELKGAVSALAALQGHLLIASG 1197 Query: 3595 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 3774 KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLA Sbjct: 1198 PKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLA 1257 Query: 3775 KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 3954 KDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGAH Sbjct: 1258 KDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGAH 1317 Query: 3955 VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 4134 V+KFLRLQMLPTS+DR++A GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1318 VTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQRK 1377 Query: 4135 XVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 4314 VD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGTT Sbjct: 1378 LVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGTT 1437 Query: 4315 RSQILSNLNDLSLGTSFL 4368 RSQILSNLNDLSLGTSFL Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455 >ref|XP_009383988.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Musa acuminata subsp. malaccensis] Length = 1458 Score = 2283 bits (5915), Expect = 0.0 Identities = 1132/1458 (77%), Positives = 1245/1458 (85%), Gaps = 9/1458 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHWPTGIE+CASGFITH +RR+GP+PNL Sbjct: 1 MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTAAN++EVY VR+QEDD R + GGGG MDGL GARLELVCHYRLHGNVESM Sbjct: 61 VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF Sbjct: 121 ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGPVVKADPLGRC G L+Y QMIVLKAAQ GQGLV D+EP+S GG SV IESS+V+N Sbjct: 181 ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+ Sbjct: 241 LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E SS+MPK Sbjct: 301 IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPEGSSEMPK 360 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 F VELDAA TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+ Sbjct: 361 AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 420 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE D EGDL +AKRLR SD+LQ+ Sbjct: 421 IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 480 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 ASGEELSLY TTP+SSE+AQK FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN Sbjct: 481 ASGEELSLYTTTPDSSETAQKFFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 540 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT ED Sbjct: 541 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 600 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR Sbjct: 601 DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 660 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F VH IADPYVLLKMTDGSIQLL GDPSTCTV Sbjct: 661 ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 720 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI Sbjct: 721 SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 780 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2517 YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D ++G+ DE Sbjct: 781 YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 840 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E PK E AV Sbjct: 841 SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 900 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + ++ N ASR NLRF RV++D EE+SN+ +RPRIT+FKNVGG QGLFLSGSR Sbjct: 901 PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 960 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY Sbjct: 961 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1020 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 3234 WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD + D DS +SD Sbjct: 1021 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1080 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK Y++DEFE+RI E KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+ Sbjct: 1081 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1140 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG Sbjct: 1141 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1200 Query: 3595 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 3774 KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA Sbjct: 1201 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1260 Query: 3775 KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 3954 KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H Sbjct: 1261 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1320 Query: 3955 VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 4134 V+KFLRLQML +S DR++ PGSDKTNRFA+LF TLDGSIGC+APL+E Sbjct: 1321 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1380 Query: 4135 XVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 4314 VD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT Sbjct: 1381 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1440 Query: 4315 RSQILSNLNDLSLGTSFL 4368 RSQILSNLNDLSLGTSFL Sbjct: 1441 RSQILSNLNDLSLGTSFL 1458 >ref|XP_008791963.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Phoenix dactylifera] Length = 1456 Score = 2282 bits (5913), Expect = 0.0 Identities = 1137/1459 (77%), Positives = 1255/1459 (86%), Gaps = 10/1459 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYA+FKMMHW TGIENCA+GFITH KRRIGPIPNL Sbjct: 1 MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADDLESEWSP--KRRIGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+E+Y VR+QEDD R + GGG+MDGL GARLELVCHYRLHGNVESM Sbjct: 59 VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR F Sbjct: 119 ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP++KADPLGRCGG LVY QMI+LK+AQAGQGLV D+EP+SLGGT IESS+V+N Sbjct: 179 ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+ Sbjct: 239 LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK Sbjct: 299 IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 359 SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE DIEGD P AKRLR MSSD+LQ+V Sbjct: 419 IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSLY+T PNSSESAQK+FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN Sbjct: 479 VSGEELSLYSTAPNSSESAQKIFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H D SKT +ED Sbjct: 539 YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 598 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR Sbjct: 599 DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F H IADP++LLKMTDGSIQLL GDPS CTV Sbjct: 658 ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+ Sbjct: 718 SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2517 YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+ T +G+ Sbjct: 778 YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E V Sbjct: 838 QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + DISN SR RNLRF RVS+DI REES + RPRIT+FKNVGG QGLFL+GSR Sbjct: 898 PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y Sbjct: 958 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD +Q D++S SSD Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1077 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1137 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 3591 IGTAYVQGEDVAARGR+LL++ +++EN QNL V EVYSKELKGA+SALA+LQGHLLIAS Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALAALQGHLLIAS 1197 Query: 3592 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 3771 G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL Sbjct: 1198 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1257 Query: 3772 AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 3951 AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA Sbjct: 1258 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1317 Query: 3952 HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 4131 HV+KFLRLQMLPTS+DR++A GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1318 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQR 1377 Query: 4132 XXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 4311 VD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGT Sbjct: 1378 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGT 1437 Query: 4312 TRSQILSNLNDLSLGTSFL 4368 TRSQILSNLNDLSLGTSFL Sbjct: 1438 TRSQILSNLNDLSLGTSFL 1456 >ref|XP_018676515.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Musa acuminata subsp. malaccensis] Length = 1457 Score = 2276 bits (5898), Expect = 0.0 Identities = 1131/1458 (77%), Positives = 1244/1458 (85%), Gaps = 9/1458 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHWPTGIE+CASGFITH +RR+GP+PNL Sbjct: 1 MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTAAN++EVY VR+QEDD R + GGGG MDGL GARLELVCHYRLHGNVESM Sbjct: 61 VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF Sbjct: 121 ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGPVVKADPLGRC G L+Y QMIVLKAAQ GQGLV D+EP+S GG SV IESS+V+N Sbjct: 181 ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+ Sbjct: 241 LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E SS+MPK Sbjct: 301 IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPEGSSEMPK 360 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 F VELDAA TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+ Sbjct: 361 AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 420 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE D EGDL +AKRLR SD+LQ+ Sbjct: 421 IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 480 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 ASGEELSLY TTP+SSE+AQ FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN Sbjct: 481 ASGEELSLYTTTPDSSETAQ-FFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 539 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT ED Sbjct: 540 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 599 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR Sbjct: 600 DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 659 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F VH IADPYVLLKMTDGSIQLL GDPSTCTV Sbjct: 660 ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 719 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI Sbjct: 720 SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 779 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2517 YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D ++G+ DE Sbjct: 780 YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 839 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E PK E AV Sbjct: 840 SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 899 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + ++ N ASR NLRF RV++D EE+SN+ +RPRIT+FKNVGG QGLFLSGSR Sbjct: 900 PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 959 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY Sbjct: 960 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1019 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 3234 WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD + D DS +SD Sbjct: 1020 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1079 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK Y++DEFE+RI E KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+ Sbjct: 1080 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1139 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG Sbjct: 1140 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1199 Query: 3595 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 3774 KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA Sbjct: 1200 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1259 Query: 3775 KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 3954 KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H Sbjct: 1260 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1319 Query: 3955 VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 4134 V+KFLRLQML +S DR++ PGSDKTNRFA+LF TLDGSIGC+APL+E Sbjct: 1320 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1379 Query: 4135 XVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 4314 VD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT Sbjct: 1380 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1439 Query: 4315 RSQILSNLNDLSLGTSFL 4368 RSQILSNLNDLSLGTSFL Sbjct: 1440 RSQILSNLNDLSLGTSFL 1457 >ref|XP_008791966.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X3 [Phoenix dactylifera] Length = 1455 Score = 2275 bits (5896), Expect = 0.0 Identities = 1136/1459 (77%), Positives = 1254/1459 (85%), Gaps = 10/1459 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYA+FKMMHW TGIENCA+GFITH KRRIGPIPNL Sbjct: 1 MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADDLESEWSP--KRRIGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+E+Y VR+QEDD R + GGG+MDGL GARLELVCHYRLHGNVESM Sbjct: 59 VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR F Sbjct: 119 ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP++KADPLGRCGG LVY QMI+LK+AQAGQGLV D+EP+SLGGT IESS+V+N Sbjct: 179 ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+ Sbjct: 239 LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK Sbjct: 299 IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 359 SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE DIEGD P AKRLR MSSD+LQ+V Sbjct: 419 IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSLY+T PNSSESAQ +FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN Sbjct: 479 VSGEELSLYSTAPNSSESAQ-IFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 537 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H D SKT +ED Sbjct: 538 YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 597 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR Sbjct: 598 DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 656 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F H IADP++LLKMTDGSIQLL GDPS CTV Sbjct: 657 ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 716 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+ Sbjct: 717 SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 776 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2517 YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+ T +G+ Sbjct: 777 YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 836 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E V Sbjct: 837 QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 896 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + DISN SR RNLRF RVS+DI REES + RPRIT+FKNVGG QGLFL+GSR Sbjct: 897 PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 956 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y Sbjct: 957 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1016 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD +Q D++S SSD Sbjct: 1017 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1076 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA Sbjct: 1077 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1136 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 3591 IGTAYVQGEDVAARGR+LL++ +++EN QNL V EVYSKELKGA+SALA+LQGHLLIAS Sbjct: 1137 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALAALQGHLLIAS 1196 Query: 3592 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 3771 G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL Sbjct: 1197 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1256 Query: 3772 AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 3951 AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA Sbjct: 1257 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1316 Query: 3952 HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 4131 HV+KFLRLQMLPTS+DR++A GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1317 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQR 1376 Query: 4132 XXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 4311 VD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGT Sbjct: 1377 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGT 1436 Query: 4312 TRSQILSNLNDLSLGTSFL 4368 TRSQILSNLNDLSLGTSFL Sbjct: 1437 TRSQILSNLNDLSLGTSFL 1455 >ref|XP_018676516.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X3 [Musa acuminata subsp. malaccensis] Length = 1455 Score = 2274 bits (5893), Expect = 0.0 Identities = 1130/1458 (77%), Positives = 1243/1458 (85%), Gaps = 9/1458 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHWPTGIE+CASGFITH +RR+GP+PNL Sbjct: 1 MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTAAN++EVY VR+QEDD R + GGGG MDGL GARLELVCHYRLHGNVESM Sbjct: 61 VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF Sbjct: 121 ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGPVVKADPLGRC G L+Y QMIVLKAAQ GQGLV D+EP+S GG SV IESS+V+N Sbjct: 181 ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+ Sbjct: 241 LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E +MPK Sbjct: 301 IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPE---EMPK 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 F VELDAA TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+ Sbjct: 358 AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE D EGDL +AKRLR SD+LQ+ Sbjct: 418 IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 477 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 ASGEELSLY TTP+SSE+AQK FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN Sbjct: 478 ASGEELSLYTTTPDSSETAQKFFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 537 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT ED Sbjct: 538 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 597 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR Sbjct: 598 DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F VH IADPYVLLKMTDGSIQLL GDPSTCTV Sbjct: 658 ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI Sbjct: 718 SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2517 YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D ++G+ DE Sbjct: 778 YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E PK E AV Sbjct: 838 SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + ++ N ASR NLRF RV++D EE+SN+ +RPRIT+FKNVGG QGLFLSGSR Sbjct: 898 PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY Sbjct: 958 PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 3234 WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD + D DS +SD Sbjct: 1018 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1077 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK Y++DEFE+RI E KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+ Sbjct: 1078 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1137 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG Sbjct: 1138 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1197 Query: 3595 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 3774 KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA Sbjct: 1198 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1257 Query: 3775 KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 3954 KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H Sbjct: 1258 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1317 Query: 3955 VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 4134 V+KFLRLQML +S DR++ PGSDKTNRFA+LF TLDGSIGC+APL+E Sbjct: 1318 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1377 Query: 4135 XVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 4314 VD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT Sbjct: 1378 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1437 Query: 4315 RSQILSNLNDLSLGTSFL 4368 RSQILSNLNDLSLGTSFL Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455 >ref|XP_010918168.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Elaeis guineensis] Length = 1455 Score = 2266 bits (5872), Expect = 0.0 Identities = 1131/1458 (77%), Positives = 1245/1458 (85%), Gaps = 9/1458 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHW TGIENCA+GFITH RRIGPIPNL Sbjct: 1 MSYAAFKMMHWATGIENCAAGFITHSLVDFAAQMPPIQADDLESEWSPM--RRIGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+E+Y VR+QEDD R + G G+ DGL GARLELVCHYRLHGNVESM Sbjct: 59 VVTAGNVLEIYLVRIQEDDGRPAARPGGEQRGVGITDGLSGARLELVCHYRLHGNVESMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLR SSMHCFEGPDW +LKRGR F Sbjct: 119 ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRASSMHCFEGPDWHFLKRGRERF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VKADPLGRC G L+Y QMI+LK+AQAGQGLV D+EP+S G T IESS+V+N Sbjct: 179 ARGPLVKADPLGRCCGALIYGLQMIILKSAQAGQGLVGDDEPASSGSTFPARIESSYVIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTC ISALSISTTLKQHP+ Sbjct: 239 LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCTISALSISTTLKQHPM 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK Sbjct: 299 IWSAVNLPHDAYKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 359 SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITA 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQY G S+P++A++KDE DIE D AKRLR MSSD+LQ+V Sbjct: 419 IGSSFFFLGSRLGDSLLVQYGCGTSTPTTANMKDEVADIEVDGLSAKRLRRMSSDALQEV 478 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSLY+T PNSSESAQK FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN Sbjct: 479 VSGEELSLYSTAPNSSESAQKSFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LCVLQQS+RPELITEVELPGCKGIWTVYHKSSR H DSSKT ED Sbjct: 539 YELVCCSGHGKNGALCVLQQSVRPELITEVELPGCKGIWTVYHKSSRGHTADSSKTMPED 598 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR Sbjct: 599 DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F H IADPYVLLKMTDGSIQLL GDPS CTV Sbjct: 658 ILDGSYMTQELNFGGHNSDSTTNSEVSAVASVSIADPYVLLKMTDGSIQLLVGDPSACTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S+NVP FA+ T ++ACTLYHDK PEPWLRK S+DAWLSTG+AEPID +DGSY DQGD+ Sbjct: 718 SVNVPPIFANSTKLISACTLYHDKGPEPWLRKTSTDAWLSTGIAEPIDGSDGSYHDQGDV 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2517 YCLVCYE+G L+IFDVP F+ VFS + FISGK+HL+DA I T +G Sbjct: 778 YCLVCYENGRLDIFDVPTFKCVFSVENFISGKTHLVDAYIPEPTTSTQVNKEKMFEGAKV 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 AKKE P+N+K+VELAMQRW GQYSRPFLF ILNDGTMLCYHAY+YEG E APK E V Sbjct: 838 QAKKETPENMKIVELAMQRWFGQYSRPFLFAILNDGTMLCYHAYIYEGPENAPKVEDVVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + DISN +SR RNLRF RV++DI REES + +PRI +FKNVGG QGLFL+GSR Sbjct: 898 PHNGMDISNMSSSRLRNLRFIRVALDITAREESPDSVTQPRIMVFKNVGGYQGLFLTGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLR+HPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y Sbjct: 958 PAWFMVCRERLRIHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVV+PL+QVLSSL DQD +Q D++S +SD Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVKPLNQVLSSLADQDIGHQTDRESINSD 1077 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK YS+DEFEVRI EPEKSGG WE +ATIPMQ SENALTVR +TL N+TT ENE+LLA Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGCWETRATIPMQTSENALTVRVITLFNTTTRENESLLA 1137 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 IGTAYVQGEDVAARGR+LL++ +++EN QNLV EVYSKELKGA+SALASLQGHLLIASG Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLVLEVYSKELKGAVSALASLQGHLLIASG 1197 Query: 3595 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 3774 KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLA Sbjct: 1198 PKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLA 1257 Query: 3775 KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 3954 KDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGAH Sbjct: 1258 KDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGAH 1317 Query: 3955 VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 4134 V+KFLRLQMLPTS+DR++A GSDKTNRFA+LF TLDGSIGC+APL+E Sbjct: 1318 VTKFLRLQMLPTSADRTTAATGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1377 Query: 4135 XVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 4314 VD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIAHQIGTT Sbjct: 1378 LVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAHQIGTT 1437 Query: 4315 RSQILSNLNDLSLGTSFL 4368 RSQILSNLNDLSLGTSFL Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455 >ref|XP_010918167.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Elaeis guineensis] Length = 1456 Score = 2261 bits (5860), Expect = 0.0 Identities = 1131/1459 (77%), Positives = 1245/1459 (85%), Gaps = 10/1459 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHW TGIENCA+GFITH RRIGPIPNL Sbjct: 1 MSYAAFKMMHWATGIENCAAGFITHSLVDFAAQMPPIQADDLESEWSPM--RRIGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+E+Y VR+QEDD R + G G+ DGL GARLELVCHYRLHGNVESM Sbjct: 59 VVTAGNVLEIYLVRIQEDDGRPAARPGGEQRGVGITDGLSGARLELVCHYRLHGNVESMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLR SSMHCFEGPDW +LKRGR F Sbjct: 119 ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRASSMHCFEGPDWHFLKRGRERF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VKADPLGRC G L+Y QMI+LK+AQAGQGLV D+EP+S G T IESS+V+N Sbjct: 179 ARGPLVKADPLGRCCGALIYGLQMIILKSAQAGQGLVGDDEPASSGSTFPARIESSYVIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTC ISALSISTTLKQHP+ Sbjct: 239 LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCTISALSISTTLKQHPM 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK Sbjct: 299 IWSAVNLPHDAYKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 359 SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITA 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IG+SFFFLGSRLGDSLLVQY G S+P++A++KDE DIE D AKRLR MSSD+LQ+V Sbjct: 419 IGSSFFFLGSRLGDSLLVQYGCGTSTPTTANMKDEVADIEVDGLSAKRLRRMSSDALQEV 478 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSLY+T PNSSESAQK FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN Sbjct: 479 VSGEELSLYSTAPNSSESAQKSFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LCVLQQS+RPELITEVELPGCKGIWTVYHKSSR H DSSKT ED Sbjct: 539 YELVCCSGHGKNGALCVLQQSVRPELITEVELPGCKGIWTVYHKSSRGHTADSSKTMPED 598 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR Sbjct: 599 DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL+F H IADPYVLLKMTDGSIQLL GDPS CTV Sbjct: 658 ILDGSYMTQELNFGGHNSDSTTNSEVSAVASVSIADPYVLLKMTDGSIQLLVGDPSACTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 S+NVP FA+ T ++ACTLYHDK PEPWLRK S+DAWLSTG+AEPID +DGSY DQGD+ Sbjct: 718 SVNVPPIFANSTKLISACTLYHDKGPEPWLRKTSTDAWLSTGIAEPIDGSDGSYHDQGDV 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2517 YCLVCYE+G L+IFDVP F+ VFS + FISGK+HL+DA I T +G Sbjct: 778 YCLVCYENGRLDIFDVPTFKCVFSVENFISGKTHLVDAYIPEPTTSTQVNKEKMFEGAKV 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 AKKE P+N+K+VELAMQRW GQYSRPFLF ILNDGTMLCYHAY+YEG E APK E V Sbjct: 838 QAKKETPENMKIVELAMQRWFGQYSRPFLFAILNDGTMLCYHAYIYEGPENAPKVEDVVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P + DISN +SR RNLRF RV++DI REES + +PRI +FKNVGG QGLFL+GSR Sbjct: 898 PHNGMDISNMSSSRLRNLRFIRVALDITAREESPDSVTQPRIMVFKNVGGYQGLFLTGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLR+HPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y Sbjct: 958 PAWFMVCRERLRIHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVV+PL+QVLSSL DQD +Q D++S +SD Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVKPLNQVLSSLADQDIGHQTDRESINSD 1077 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 LQK YS+DEFEVRI EPEKSGG WE +ATIPMQ SENALTVR +TL N+TT ENE+LLA Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGCWETRATIPMQTSENALTVRVITLFNTTTRENESLLA 1137 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 3591 IGTAYVQGEDVAARGR+LL++ +++EN QNL V EVYSKELKGA+SALASLQGHLLIAS Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALASLQGHLLIAS 1197 Query: 3592 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 3771 G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL Sbjct: 1198 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1257 Query: 3772 AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 3951 AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA Sbjct: 1258 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1317 Query: 3952 HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 4131 HV+KFLRLQMLPTS+DR++A GSDKTNRFA+LF TLDGSIGC+APL+E Sbjct: 1318 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQR 1377 Query: 4132 XXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 4311 VD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIAHQIGT Sbjct: 1378 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAHQIGT 1437 Query: 4312 TRSQILSNLNDLSLGTSFL 4368 TRSQILSNLNDLSLGTSFL Sbjct: 1438 TRSQILSNLNDLSLGTSFL 1456 >ref|XP_020084456.1| probable cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Ananas comosus] Length = 1465 Score = 2237 bits (5796), Expect = 0.0 Identities = 1126/1470 (76%), Positives = 1243/1470 (84%), Gaps = 21/1470 (1%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXX---------- 171 MSYAAFKMMHWPTGIENCA+GF TH Sbjct: 1 MSYAAFKMMHWPTGIENCAAGFFTHSPSSSSSSSAAAAEATAAAEIPPLPGDELEAAEWQ 60 Query: 172 ----KRRIGPIPNLVVTAANVIEVYTVRVQEDDTRAVPAAAEPR---GGGGVMDGLCGAR 330 +R +G +P+LVVTA NV+E+Y VR QED+ RA A+ E + GGGGV+DG+ GAR Sbjct: 61 QGRRRRGVGAVPDLVVTAGNVLEIYVVRAQEDEGRAPRASGEQKRGGGGGGVVDGISGAR 120 Query: 331 LELVCHYRLHGNVESMTVLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMH 510 LELVCHYRLHGNVESM VLS+G D S RRDSIVLAFQDAKITVLE+DDS HGLRTSSMH Sbjct: 121 LELVCHYRLHGNVESMAVLSVGADNRSNRRDSIVLAFQDAKITVLEYDDSLHGLRTSSMH 180 Query: 511 CFEGPDWLYLKRGRGSFARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPS 690 CFEGPDW YLKRGR SFARGP+VKADP GRCGG LVY QMI+LKAAQAGQ LV D+EP+ Sbjct: 181 CFEGPDWQYLKRGRESFARGPIVKADPSGRCGGALVYGLQMIILKAAQAGQSLVGDDEPN 240 Query: 691 SLGGTTSVCIESSFVLNLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHT 870 S GGT SV IESS+V+NLR+LDM HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHT Sbjct: 241 SAGGTISVRIESSYVINLRELDMNHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHT 300 Query: 871 CMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSL 1050 CMISALS+STTLKQHP+IWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQSASCSL Sbjct: 301 CMISALSVSTTLKQHPMIWSASNLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSASCSL 360 Query: 1051 GLNKFAVQMENSSDMPKTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQK 1230 LN F+ Q + S +MPK+NF VELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGR+VQ+ Sbjct: 361 SLNSFSSQPDGSLEMPKSNFAVELDAAHATWLSHDVAMFSSKTGELLLLTLVYDGRIVQR 420 Query: 1231 LDLTKSKASVLTSGITSIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDL 1410 LDL KSKASVLTSG+T+IG+SFFFLGSRLGDSLLVQYS G S P+S+ VKDE+ DI+GD+ Sbjct: 421 LDLVKSKASVLTSGLTTIGSSFFFLGSRLGDSLLVQYSCGTSVPTSSQVKDEATDIDGDV 480 Query: 1411 PMAKRLRGMSSDSLQDVASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYG 1590 P AKRLR MSSD+LQDV S EELSLY PNSSESAQK FSFAVRDSLINVGPLKDFSYG Sbjct: 481 PSAKRLRRMSSDALQDVTSVEELSLYNNAPNSSESAQKSFSFAVRDSLINVGPLKDFSYG 540 Query: 1591 LRINADPNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYH 1770 LRINADPN+TG+AKQSNYELVCCSGHGKNG+LCVLQQSIRPELITEV L GCKGIWTVYH Sbjct: 541 LRINADPNSTGLAKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVVLAGCKGIWTVYH 600 Query: 1771 KSSRTHAVDSSKTTTEDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAG 1950 KSSR HA DSSKT TE+DEYHAYLIISLE+RTMVLETADDLGEVTE+VDYYV GSTIAAG Sbjct: 601 KSSRGHATDSSKTMTENDEYHAYLIISLESRTMVLETADDLGEVTETVDYYVHGSTIAAG 660 Query: 1951 NLFGRRRVVQIFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKM 2130 NLFGRRRV+QI+A+GARILDGSYMTQEL+FV H IADPYVLLKM Sbjct: 661 NLFGRRRVIQIYAKGARILDGSYMTQELNFVAH-NSEQTSSELPTVASVSIADPYVLLKM 719 Query: 2131 TDGSIQLLAGDPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGV 2310 TDGSIQLL GDP+ CTVS+N PA F+S T+ ++ACTLYHDK PEPWLRK S+DAWLSTGV Sbjct: 720 TDGSIQLLLGDPAACTVSLNAPAIFSSSTEPISACTLYHDKGPEPWLRKTSTDAWLSTGV 779 Query: 2311 AEPIDENDGSYPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-IP 2487 AEPID NDGSY D GDIYCLVCYE+G LEIFDVP+F+SV+S D F+SGK++L D P Sbjct: 780 AEPIDGNDGSYHDHGDIYCLVCYENGKLEIFDVPSFKSVYSVDNFVSGKTYLADTYTKDP 839 Query: 2488 HK-AKTKGLDETAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGG 2664 +K TKG KEP QN++VVELAMQRWSG+YSRPFLFG+L+DGT+LCYHAY YEG Sbjct: 840 NKYPDTKGY---LNKEPVQNMRVVELAMQRWSGRYSRPFLFGMLSDGTILCYHAYFYEGT 896 Query: 2665 EIAPKNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVG 2844 E A K V P + D S+ SR RNLRF RVSIDI REE N +RPRIT+F NVG Sbjct: 897 ENAVKGGDPVSPRGSADTSSMSISRLRNLRFLRVSIDITTREEMLNAVSRPRITVFNNVG 956 Query: 2845 GCQGLFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKIC 3024 G QGLFLSGSRPAWLM+CRER+RVHPQLCDGSI AF VLHNVNCNHGLIYVTSQG+LKIC Sbjct: 957 GYQGLFLSGSRPAWLMVCRERIRVHPQLCDGSIAAFAVLHNVNCNHGLIYVTSQGYLKIC 1016 Query: 3025 QLPSVCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-N 3201 QLPS NYDN+WPVQKIPL TPHQVTY+ EKNLYP+I+SVPV+RPL+QVLSSL+DQ+ + Sbjct: 1017 QLPSSFNYDNHWPVQKIPLLGTPHQVTYYAEKNLYPLILSVPVIRPLNQVLSSLLDQEMS 1076 Query: 3202 NQMDQDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVN 3381 Q+D D+ +SD LQK+YS+DEFEVRI EP+KS GHW+ KAT+PMQ SENALTVR VTL N Sbjct: 1077 QQIDNDNFNSDDLQKTYSVDEFEVRILEPDKS-GHWDTKATVPMQTSENALTVRIVTLFN 1135 Query: 3382 STTNENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALA 3561 +TT ENE+L+AIGTAYVQGEDVAARGR+LLF+ + NE+ QNL+SEVYSKELKGA+S LA Sbjct: 1136 TTTKENESLMAIGTAYVQGEDVAARGRVLLFSFAKTNESSQNLISEVYSKELKGAVSNLA 1195 Query: 3562 SLQGHLLIASGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIW 3741 SLQGHLLIASG KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFIL GD+HKSIYFL W Sbjct: 1196 SLQGHLLIASGPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILFGDVHKSIYFLNW 1255 Query: 3742 KEQGAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKL 3921 KEQGAQLSLLAKDF L+C+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKL Sbjct: 1256 KEQGAQLSLLAKDFGPLNCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKL 1315 Query: 3922 LSRAEFHAGAHVSKFLRLQMLPTSSDRSSAT-PGSDKTNRFAILFGTLDGSIGCVAPLEE 4098 LSRAEFHAGA VSKFLRLQMLPTSSDRS AT PGS+KTNRFA+LF TLDGSIGC+APLEE Sbjct: 1316 LSRAEFHAGALVSKFLRLQMLPTSSDRSGATVPGSEKTNRFALLFSTLDGSIGCIAPLEE 1375 Query: 4099 XXXXXXXXXXXXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLE 4278 VD VPHVCGLNPR+FRQF SNGKAHRPGPD+M+D ELLSHYEMLPLE Sbjct: 1376 LTFRRLQTLQRKLVDAVPHVCGLNPRSFRQFHSNGKAHRPGPDHMVDFELLSHYEMLPLE 1435 Query: 4279 EQLEIAHQIGTTRSQILSNLNDLSLGTSFL 4368 EQLEIAHQIGTTRSQILSN NDLSLGTSFL Sbjct: 1436 EQLEIAHQIGTTRSQILSNFNDLSLGTSFL 1465 >gb|OAY63492.1| putative cleavage and polyadenylation specificity factor subunit 1 [Ananas comosus] Length = 1469 Score = 2196 bits (5691), Expect = 0.0 Identities = 1118/1484 (75%), Positives = 1235/1484 (83%), Gaps = 35/1484 (2%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXX---------- 171 MSYAAFKMMHWPTGIENCA+GF TH Sbjct: 1 MSYAAFKMMHWPTGIENCAAGFFTHSPSSSSSSSAAAAEATAAAEIPPLPGDELEAAEWQ 60 Query: 172 ----KRRIGPIPNLVVTAANVIEVYTVRVQEDDTRAVPAAAEPR---GGGGVMDGLCGAR 330 +R +G +P+LVVTA NV+E+Y VR QED+ RA A+ E + GGGGV+DG+ GAR Sbjct: 61 QGRRRRGVGAVPDLVVTAGNVLEIYVVRAQEDEGRAPRASGEQKRGGGGGGVVDGISGAR 120 Query: 331 LELVCHYRLHGNVESMTVLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMH 510 LELVCHYRLHGNVESM VLS+G D S RRDSIVLAFQDAKITVLE+DDS HGLRTSSMH Sbjct: 121 LELVCHYRLHGNVESMAVLSVGADNRSNRRDSIVLAFQDAKITVLEYDDSLHGLRTSSMH 180 Query: 511 CFEGPDWLYLKRGRGSFARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPS 690 CFEGPDW YLKRGR SFARGP+VKADP GRCGG LVY QMI+LKAAQAGQ LV D+EP+ Sbjct: 181 CFEGPDWQYLKRGRESFARGPIVKADPSGRCGGALVYGLQMIILKAAQAGQSLVGDDEPN 240 Query: 691 SLGGTTSVCIESSFVLNLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHT 870 S GGT SV IESS+V+NLR+LDM HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHT Sbjct: 241 SAGGTISVRIESSYVINLRELDMNHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHT 300 Query: 871 CMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSL 1050 CMISALS+STTLKQHP+IWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQSASCSL Sbjct: 301 CMISALSVSTTLKQHPMIWSASNLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSASCSL 360 Query: 1051 GLNKFAVQMENSSDMPKTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGR---- 1218 LN F++Q + S +MPK+NF VELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGR Sbjct: 361 SLNSFSLQPDGSLEMPKSNFAVELDAAHATWLSHDVAMFSSKTGELLLLTLVYDGRYNPK 420 Query: 1219 --------VVQKLDLTKSKASVLTSGITSIGNSFFFLGSRLGDSLLVQYSYGASSPSSAH 1374 +VQ+LDL KSKASVLTSG+T+IG+SFFFLGSRLGDSLLVQYS G S P+S+ Sbjct: 421 TLLGKLKTIVQRLDLVKSKASVLTSGLTTIGSSFFFLGSRLGDSLLVQYSCGTSVPTSSQ 480 Query: 1375 VKD--ESVDIEGDLPMAKRLRGMSSDSLQDVASGEELSLYATTPNSSESAQKVFSFAVRD 1548 VKD E+ DI+GD+P AKRLR MSSD+LQDV S EELSLY PNSSESAQK FSFAVRD Sbjct: 481 VKDEVEATDIDGDVPSAKRLRRMSSDALQDVTSVEELSLYNNAPNSSESAQKSFSFAVRD 540 Query: 1549 SLINVGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITE 1728 SLINVGPLKDFSYGLRINADPN+TG+AKQSNYELVCCSGHGKNG+LCVLQQSIRPELITE Sbjct: 541 SLINVGPLKDFSYGLRINADPNSTGLAKQSNYELVCCSGHGKNGALCVLQQSIRPELITE 600 Query: 1729 VELPGCKGIWTVYHKSSRTHAVDSSKTTTEDDEYHAYLIISLETRTMVLETADDLGEVTE 1908 V L GCKGIWTVYHKSSR HA DSSKT TE+DEYHAYLIISLE+RTMVLETADDLGEVTE Sbjct: 601 VVLAGCKGIWTVYHKSSRGHATDSSKTMTENDEYHAYLIISLESRTMVLETADDLGEVTE 660 Query: 1909 SVDYYVQGSTIAAGNLFGRRRVVQIFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXX 2088 +VDYYV GSTIAAGNLFGRRRV+QI+A+GARILDGSYMTQEL+FV H Sbjct: 661 TVDYYVHGSTIAAGNLFGRRRVIQIYAKGARILDGSYMTQELNFVAH-NSEQTSTELPTV 719 Query: 2089 XXXXIADPYVLLKMTDGSIQLLAGDPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPW 2268 IADPYVLLKMTDGSIQLL GDP+ CTVS+N PA F+S T+ ++ACTLYHDK EPW Sbjct: 720 ASVSIADPYVLLKMTDGSIQLLLGDPAACTVSLNAPAIFSSSTEPISACTLYHDKGLEPW 779 Query: 2269 LRKASSDAWLSTGVAEPIDENDGSYPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFI 2448 LRK S+DAWLSTGVAEPID NDGSY D GDIYCLVCYE+G LEIFDVP+F+SV+S D F+ Sbjct: 780 LRKTSTDAWLSTGVAEPIDGNDGSYHDHGDIYCLVCYENGKLEIFDVPSFKSVYSVDNFV 839 Query: 2449 SGKSHLMDACI-IPHK-AKTKGLDETAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILND 2622 SGK++L+D P+K TKG KEP QN++VVELAMQRWSG+YSRPFLFG+L+D Sbjct: 840 SGKTYLVDTYTKDPNKYPDTKGY---LNKEPVQNMRVVELAMQRWSGRYSRPFLFGMLSD 896 Query: 2623 GTMLCYHAYLYEGGEIAPKNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSN 2802 GT+LCYHAY YEG E A K V P + D S+ SR RNLRF RVSIDI REE SN Sbjct: 897 GTILCYHAYFYEGTENAVKGGDPVSPRGSADTSSMSISRLRNLRFLRVSIDITTREEMSN 956 Query: 2803 LTARPRITIFKNVGGCQGLFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNH 2982 +RPRIT+F NVGG QGLFLSGSRPAWLM+CRER+RVHPQLCDGSI AF VLHNVNCNH Sbjct: 957 AVSRPRITVFNNVGGYQGLFLSGSRPAWLMVCRERIRVHPQLCDGSIAAFAVLHNVNCNH 1016 Query: 2983 GLIYVTSQGFLKICQLPSVCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRP 3162 GLIYVTSQG+LKICQLPS NYDN+WPVQKIPL TPHQVTY+ EKNLYP+I+SVPV+RP Sbjct: 1017 GLIYVTSQGYLKICQLPSSFNYDNHWPVQKIPLLGTPHQVTYYAEKNLYPLILSVPVIRP 1076 Query: 3163 LSQVLSSLVDQD-NNQMDQDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQA 3339 L+QVLSSL+DQ+ + Q+D D+ +SD LQK+YS+DEFEVRI EP+KS GHW+ KAT+PMQ Sbjct: 1077 LNQVLSSLLDQEMSQQIDNDNFNSDDLQKTYSVDEFEVRILEPDKS-GHWDTKATVPMQT 1135 Query: 3340 SENALTVRNVTLVNSTTNENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSE 3519 SENALTVR VTL N+TT ENE+L+AIGTAYVQGEDVAARGR+LLF+ + NE+ QNL+SE Sbjct: 1136 SENALTVRIVTLFNTTTKENESLMAIGTAYVQGEDVAARGRVLLFSFAKTNESSQNLISE 1195 Query: 3520 VYSKELKGAISALASLQGHLLIASGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFI 3699 VYSKELKGA+S LASLQGHLLIASG KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFI Sbjct: 1196 VYSKELKGAVSNLASLQGHLLIASGPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFI 1255 Query: 3700 LLGDIHKSIYFLIWKEQGAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFY 3879 L GD+HKSIYFL WKEQGAQLSLLAKDF L+C+ATEFLIDG TLSLVVSDDQKNIQ Sbjct: 1256 LFGDVHKSIYFLNWKEQGAQLSLLAKDFGPLNCYATEFLIDGSTLSLVVSDDQKNIQ--- 1312 Query: 3880 YAPRMMESWKGQKLLSRAEFHAGAHVSKFLRLQMLPTSSDRSSAT-PGSDKTNRFAILFG 4056 SWKGQKLLSRAEFHAGA VSKFLRLQMLPTSSDRS AT PGS+KTNRFA+LF Sbjct: 1313 -------SWKGQKLLSRAEFHAGALVSKFLRLQMLPTSSDRSGATVPGSEKTNRFALLFS 1365 Query: 4057 TLDGSIGCVAPLEEXXXXXXXXXXXXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMI 4236 TLDGSIGC+APLEE VD VPHVCGLNPR+FRQF SNGKAHRPGPD+M+ Sbjct: 1366 TLDGSIGCIAPLEELTFRRLQTLQRKLVDAVPHVCGLNPRSFRQFHSNGKAHRPGPDHMV 1425 Query: 4237 DCELLSHYEMLPLEEQLEIAHQIGTTRSQILSNLNDLSLGTSFL 4368 D ELLSHYEMLPLEEQLEIAHQIGTTRSQILSN NDLSLGTSFL Sbjct: 1426 DFELLSHYEMLPLEEQLEIAHQIGTTRSQILSNFNDLSLGTSFL 1469 >ref|XP_020672482.1| cleavage and polyadenylation specificity factor subunit 1 [Dendrobium catenatum] Length = 1453 Score = 2194 bits (5685), Expect = 0.0 Identities = 1096/1457 (75%), Positives = 1228/1457 (84%), Gaps = 8/1457 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFKMMHW TGIENCASGFITH K+RIGPIPN+ Sbjct: 1 MSYAAFKMMHWATGIENCASGFITHSPAELAAQISPIQAVDLESEWPA--KKRIGPIPNV 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTAANV+E+YTVRV E+D R+ ++EPR GG VMDGL GA LELVCHYRLHGN+ESM Sbjct: 59 VVTAANVLELYTVRVHEEDGRSSQLSSEPRNGG-VMDGLSGAHLELVCHYRLHGNIESMV 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 +LS GGDE SKRRDSI+L+FQDAKITVLE+DD+ H LRTSSMHCFEGPDW YLKRGR SF Sbjct: 118 ILSSGGDERSKRRDSILLSFQDAKITVLEYDDAAHELRTSSMHCFEGPDWHYLKRGRESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 +RGP +K DP+GRC G LVYD QM+++KAAQAGQGL D+EPS++G + SV IESS+++N Sbjct: 178 SRGPTMKIDPVGRCVGALVYDLQMVIMKAAQAGQGLAVDDEPSNMGASASVRIESSYIIN 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM HVKDFTF+HGYIEPVMVIL EKEPTWAGRI+WKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDMMHVKDFTFIHGYIEPVMVILQEKEPTWAGRITWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSA NLPHDAYKLL VPSPIGG+LVICANSIHYHSQS SCSLGLN F Q+ENS DMPK Sbjct: 298 IWSAANLPHDAYKLLPVPSPIGGVLVICANSIHYHSQSVSCSLGLNSFCAQLENSLDMPK 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +NF VELDAA ATWLSHDVA+FSSKTGELLLLTLV DGRVVQ+LDL KSKASVLTSGIT+ Sbjct: 358 SNFTVELDAANATWLSHDVAMFSSKTGELLLLTLVNDGRVVQRLDLMKSKASVLTSGITT 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 +G SFFFLGSRLGDSLLVQYS G S +S H K+E+ D++ D P+AK+LR MSSDSLQD Sbjct: 418 VGCSFFFLGSRLGDSLLVQYSSGTSLSASVHAKEEAADVDSDAPLAKKLRRMSSDSLQDY 477 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 SGEELSL++TT NSSES QK F+FAVRDSLINVGPLKDF+YGLRINAD NATGI+KQSN Sbjct: 478 TSGEELSLFSTTSNSSESTQKSFTFAVRDSLINVGPLKDFAYGLRINADLNATGISKQSN 537 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVC SGHGKNG++CV+Q S+RP+LITEVELPGCKGIWTVYHKSSR HAVD+ K + ED Sbjct: 538 YELVCASGHGKNGAICVMQHSVRPDLITEVELPGCKGIWTVYHKSSRGHAVDALKPSKED 597 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DE+HAYLIISLE+RTMVLETADDLGEVTE VDYYVQG+TIAAGNLFGRRRVVQIFARGAR Sbjct: 598 DEFHAYLIISLESRTMVLETADDLGEVTERVDYYVQGTTIAAGNLFGRRRVVQIFARGAR 657 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDGSYMTQEL F VH IADPYVLLKM DGSIQLL GD STCTV Sbjct: 658 ILDGSYMTQELPFGVHNSDHNSSPDSCTVSSASIADPYVLLKMVDGSIQLLIGDLSTCTV 717 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 SI VP+ A+ TDSV+ACTLY DK P+ WLR+AS+DAWLS+G+AE ID NDGSY +QGDI Sbjct: 718 SIYVPSILANSTDSVSACTLYIDKGPDSWLRRASTDAWLSSGIAETIDGNDGSYHEQGDI 777 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI----IPH---KAKT-KGLDE 2517 YCL+CYESGTL+I+DVPNF+ VFS DKF+SGKSHL+D + P+ K+KT K + Sbjct: 778 YCLLCYESGTLDIYDVPNFKCVFSCDKFVSGKSHLVDRSVREPFKPYQKIKSKTNKESTD 837 Query: 2518 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 +KE + +KVVEL MQRWSG Y RPFLF +LNDGTMLCY AYLYEG + KNE V Sbjct: 838 LGRKETTEKMKVVELVMQRWSGPYCRPFLFAVLNDGTMLCYQAYLYEGLDSYSKNEEVVS 897 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 P ++ D S+ ASRFRNLRF RV D REE SN+ +P+IT+FKN+ G QGLFLSGSR Sbjct: 898 PNNSVDPSSISASRFRNLRFIRVPTDFTAREEPSNMVIQPKITVFKNIAGYQGLFLSGSR 957 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 PAW M+CRERLRVHPQLCDGSI AFTVLHNVNCNHG IYVT QGFLKICQLPS NYDNY Sbjct: 958 PAWFMVCRERLRVHPQLCDGSIAAFTVLHNVNCNHGFIYVTFQGFLKICQLPSAFNYDNY 1017 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQMDQDSASSDS 3237 W V+KIPLR TPHQ+TY EKNLY +I+S PV RP++QVLSS++DQD QM++++ SSD Sbjct: 1018 WAVRKIPLRGTPHQLTYCAEKNLYSIILSFPVARPINQVLSSMLDQDIGQMERENTSSDV 1077 Query: 3238 LQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAI 3417 LQK+Y++D+FEVR+ EPEKSGG+W+ +ATIPMQ ENALTVR VTL NS T ENE ++AI Sbjct: 1078 LQKTYTLDDFEVRLLEPEKSGGNWDTRATIPMQPCENALTVRMVTLFNSATKENEAIMAI 1137 Query: 3418 GTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGT 3597 GTAYVQGEDVA+RGRILL +IG++ EN QNLV+EVYSKELKGAISA+ASLQGHLLIASG Sbjct: 1138 GTAYVQGEDVASRGRILLLSIGKNGENSQNLVTEVYSKELKGAISAVASLQGHLLIASGP 1197 Query: 3598 KITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAK 3777 KITL+KWTGSEL GVAFYD PL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAK Sbjct: 1198 KITLHKWTGSELTGVAFYDTPLYVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAK 1257 Query: 3778 DFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHV 3957 DF SLDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAHV Sbjct: 1258 DFGSLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMVESWKGQKLLSRAEFHAGAHV 1317 Query: 3958 SKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXX 4137 +KFLRLQMLPT SDR SAT GSDKTNRFA+LF TLDGSIGCVAPL+E Sbjct: 1318 TKFLRLQMLPT-SDRMSATLGSDKTNRFALLFSTLDGSIGCVAPLDELTFRRLQTLQRKL 1376 Query: 4138 VDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTR 4317 VD VPHVCGLNPR+FRQFRSNGKAH PGPDN++DCELLS YEMLPLE QLEIA+QIGTTR Sbjct: 1377 VDSVPHVCGLNPRSFRQFRSNGKAHHPGPDNIVDCELLSQYEMLPLENQLEIANQIGTTR 1436 Query: 4318 SQILSNLNDLSLGTSFL 4368 +QILSNL D SL TSFL Sbjct: 1437 AQILSNLTDYSLSTSFL 1453 >ref|XP_010277399.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Nelumbo nucifera] Length = 1457 Score = 2161 bits (5600), Expect = 0.0 Identities = 1077/1460 (73%), Positives = 1220/1460 (83%), Gaps = 11/1460 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAAFK+MHW TGIENC +GFITH ++ GPIPNL Sbjct: 1 MSYAAFKVMHWATGIENCVAGFITHSSADAAPKLPPLQTDEFESEWTA--RKETGPIPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA N++EVY VRVQE+D R+ ++ E + GG VM GL GA LELVC Y+LHGNVE+M Sbjct: 59 VVTAGNILEVYVVRVQEEDNRSSRSSTEAKRGG-VMAGLSGASLELVCSYKLHGNVETMA 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLS+GG + S++RDSI+LAFQDAKI+VLEFDDS HGL SSMHCFEGP+W YLKRGR SF Sbjct: 118 VLSMGGGDGSRKRDSIILAFQDAKISVLEFDDSIHGLCISSMHCFEGPEWHYLKRGRESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRCGG LVYD QMI+LK+AQAG G D+E S G T S +ESS+V++ Sbjct: 178 ARGPIVKVDPQGRCGGVLVYDLQMIILKSAQAGYGFAGDDEASGSGSTISARVESSYVIS 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCM+SALSISTTLKQHPL Sbjct: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMVSALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLLAVPSPIGG+LVI +N+IHYHSQS SC L LN FAV +++S D+P+ Sbjct: 298 IWSAVNLPHDAYKLLAVPSPIGGVLVIGSNTIHYHSQSVSCVLALNNFAVPLDSSQDIPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 ++FNVELDAA ATWL HDVA+ S+KTGELLLLTLVYDGRVVQ+L+L+KSKASVLTSGIT+ Sbjct: 358 SSFNVELDAANATWLLHDVAILSTKTGELLLLTLVYDGRVVQRLELSKSKASVLTSGITT 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYS--YGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQ 1455 IGNSFFFLGSRLGDSLLVQY+ G S+ +S HVK+E DIE D P KRLR SD LQ Sbjct: 418 IGNSFFFLGSRLGDSLLVQYTCGMGTSTTTSGHVKEEVGDIETDAPSVKRLRRSPSDPLQ 477 Query: 1456 DVASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQ 1635 D+ GEELSLY + PN+SES QK+FSF VRDSLINVGPLKDFSYGLR+NADPNATGIAKQ Sbjct: 478 DIVGGEELSLYGSAPNNSESVQKIFSFTVRDSLINVGPLKDFSYGLRLNADPNATGIAKQ 537 Query: 1636 SNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTT 1815 SNYELVCCSGHGKNG+LCVLQQSIRPE+ITEVELPGCKGIWTVYHK++R H DSSK + Sbjct: 538 SNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNSDSSKMVS 597 Query: 1816 EDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARG 1995 EDDEYHAYLIISLE+RTMVLETAD LGEVTE+V+YYV GST+ AGNLFGRRRVVQIFARG Sbjct: 598 EDDEYHAYLIISLESRTMVLETADLLGEVTETVEYYVLGSTVTAGNLFGRRRVVQIFARG 657 Query: 1996 ARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTC 2175 AR+LDGSYMTQ++S IADPYVLL+M+DGSIQLL GDPSTC Sbjct: 658 ARVLDGSYMTQDISLGSPNPGSSSGSDNLTVSSASIADPYVLLRMSDGSIQLLIGDPSTC 717 Query: 2176 TVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQG 2355 TVS+ VPA F SL +S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID DG+ DQG Sbjct: 718 TVSVTVPAVFESLKESISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAPSDQG 777 Query: 2356 DIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-----IPHKAKTKGLDET 2520 DIYCLVCYESGTLEIF+VP+F+ VFS DKF+SGK+HL+D I PH ++ K DE Sbjct: 778 DIYCLVCYESGTLEIFEVPSFKCVFSVDKFVSGKTHLVDTVIGEPSKDPHVSRNKNSDEM 837 Query: 2521 A---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVA 2691 A KKE N+KVVELAMQRW GQ++RPFLFGIL DGT+ CYHA+LYEG E + K E A Sbjct: 838 AGKVKKENVLNMKVVELAMQRWLGQHTRPFLFGILTDGTVFCYHAFLYEGSENSLKTEEA 897 Query: 2692 VYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSG 2871 ++ +S+ SR RNLRF RV ++ REE+S L+ RITIFKNVGG QGLF+SG Sbjct: 898 TSLQNSVSLSSISTSRLRNLRFVRVPLESYTREETSGLSTCQRITIFKNVGGYQGLFVSG 957 Query: 2872 SRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYD 3051 SRPAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSV +YD Sbjct: 958 SRPAWFMICRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVSSYD 1017 Query: 3052 NYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSAS 3228 NYWPVQKIPL+ATPHQVTYF EKNLYP+IVS+PVV+PL+QVLSSLVDQ+ +Q+D D S Sbjct: 1018 NYWPVQKIPLKATPHQVTYFAEKNLYPLIVSIPVVKPLNQVLSSLVDQEGGHQIDHDGLS 1077 Query: 3229 SDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETL 3408 D L ++Y++DEFEVRI EPEKSGG W+ K TIPMQ+ E+ALTVR VTL N+TT ENETL Sbjct: 1078 PDELHRTYTVDEFEVRIMEPEKSGGPWQTKVTIPMQSCESALTVRMVTLFNTTTKENETL 1137 Query: 3409 LAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIA 3588 LAIGTAYVQGEDVAARGR+LLF+IGR+ +NPQNLVSEVYSKELKGAISALASLQGHLLIA Sbjct: 1138 LAIGTAYVQGEDVAARGRVLLFSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIA 1197 Query: 3589 SGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 3768 SG KI L+KWTG+EL GVAF+DAPL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQL+L Sbjct: 1198 SGPKIILHKWTGTELNGVAFFDAPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1257 Query: 3769 LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 3948 LAKDF +LDCFATEFLIDG TLSLVVSDDQKN+QIFYYAP+M ESWKG KLLSRAEFH G Sbjct: 1258 LAKDFGNLDCFATEFLIDGTTLSLVVSDDQKNVQIFYYAPKMSESWKGHKLLSRAEFHVG 1317 Query: 3949 AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 4128 AHV+KFLRLQMLPTSSDR++A P SDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1318 AHVTKFLRLQMLPTSSDRTTAAPSSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQ 1377 Query: 4129 XXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 4308 +D VPHV GLNPRAFRQF SNGKAHRPGP+N++DCELL HYEML LEEQL++AHQIG Sbjct: 1378 RKLIDAVPHVAGLNPRAFRQFHSNGKAHRPGPENIVDCELLCHYEMLQLEEQLDVAHQIG 1437 Query: 4309 TTRSQILSNLNDLSLGTSFL 4368 TTR QILSNLNDLSLGTSFL Sbjct: 1438 TTREQILSNLNDLSLGTSFL 1457 >ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Vitis vinifera] Length = 1442 Score = 2153 bits (5578), Expect = 0.0 Identities = 1076/1459 (73%), Positives = 1223/1459 (83%), Gaps = 10/1459 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAA+KMMHWPTGIENCASGF+TH KR+IGP+PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPT--KRQIGPLPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 +VTAAN++EVY VRVQEDD+R A+AE + GG VM G+ GA LELVC YRLHGNVE+MT Sbjct: 59 IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VL GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG SF Sbjct: 118 VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRC G LVY QMI+LKA+QAG GLV DEE S G S +ESS+V++ Sbjct: 178 ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV +NS +MP+ Sbjct: 298 IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI + Sbjct: 358 SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IGNS FFLGSRLGDSLLVQ+ +S S+ VK+E DIEGD+P AKRLR SSD+LQD+ Sbjct: 418 IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 +GEELSLY + PNS+E++QK FSF+VRDS INVGPLKDF+YGLRINADP ATGIAKQSN Sbjct: 474 VNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSN 533 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H DS+K T+D Sbjct: 534 YELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKD 593 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2001 DEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFGRRRVVQ++ARGAR Sbjct: 594 DEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGAR 653 Query: 2002 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCTV 2181 ILDG++MTQ+L IADPYVLL+M+DG+IQLL GDPSTCTV Sbjct: 654 ILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTV 703 Query: 2182 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2361 SIN+PA F S S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID DG+ DQGDI Sbjct: 704 SINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDI 763 Query: 2362 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDETA- 2523 YC+V YESG LEIFDVPNF VFS DKF+SG +HL+D I+ K +K +E A Sbjct: 764 YCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEAD 823 Query: 2524 --KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2697 +KE N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E PK E AV Sbjct: 824 QGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVS 883 Query: 2698 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 2877 ++ ISN ASR RNLRF RV +D REE+ + T PR+T+FKN+GGCQGLFLSGSR Sbjct: 884 AQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSR 943 Query: 2878 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 3057 P W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+V +YDNY Sbjct: 944 PLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNY 1003 Query: 3058 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 3234 WPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+ +Q++ D+ SSD Sbjct: 1004 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSD 1063 Query: 3235 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 3414 L +SYS+DEFEVR+ EPEKSG W+ +ATIPMQ+SENALTVR VTL N+TT ENETLLA Sbjct: 1064 ELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1123 Query: 3415 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 3594 IGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQGHLLIASG Sbjct: 1124 IGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASG 1183 Query: 3595 TKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 3771 KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQGAQL+LL Sbjct: 1184 PKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLL 1243 Query: 3772 AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 3951 AKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSRAEFH GA Sbjct: 1244 AKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1303 Query: 3952 HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 4131 HV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1304 HVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1363 Query: 4132 XXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 4311 VD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLEIA QIGT Sbjct: 1364 KLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGT 1423 Query: 4312 TRSQILSNLNDLSLGTSFL 4368 TR QILSNLNDLSLGTSFL Sbjct: 1424 TRMQILSNLNDLSLGTSFL 1442 >emb|CBI24510.3| unnamed protein product, partial [Vitis vinifera] Length = 1448 Score = 2146 bits (5561), Expect = 0.0 Identities = 1076/1465 (73%), Positives = 1223/1465 (83%), Gaps = 16/1465 (1%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAA+KMMHWPTGIENCASGF+TH KR+IGP+PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPT--KRQIGPLPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 +VTAAN++EVY VRVQEDD+R A+AE + GG VM G+ GA LELVC YRLHGNVE+MT Sbjct: 59 IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VL GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG SF Sbjct: 118 VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRC G LVY QMI+LKA+QAG GLV DEE S G S +ESS+V++ Sbjct: 178 ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV +NS +MP+ Sbjct: 298 IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI + Sbjct: 358 SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IGNS FFLGSRLGDSLLVQ+ +S S+ VK+E DIEGD+P AKRLR SSD+LQD+ Sbjct: 418 IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473 Query: 1462 ASGEELSLYATTPNSSESAQ------KVFSFAVRDSLINVGPLKDFSYGLRINADPNATG 1623 +GEELSLY + PNS+E++Q K FSF+VRDS INVGPLKDF+YGLRINADP ATG Sbjct: 474 VNGEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATG 533 Query: 1624 IAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSS 1803 IAKQSNYELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H DS+ Sbjct: 534 IAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADST 593 Query: 1804 KTTTEDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQI 1983 K T+DDEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFGRRRVVQ+ Sbjct: 594 KMATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQV 653 Query: 1984 FARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGD 2163 +ARGARILDG++MTQ+L IADPYVLL+M+DG+IQLL GD Sbjct: 654 YARGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGD 703 Query: 2164 PSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSY 2343 PSTCTVSIN+PA F S S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID DG+ Sbjct: 704 PSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAA 763 Query: 2344 PDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKG 2508 DQGDIYC+V YESG LEIFDVPNF VFS DKF+SG +HL+D I+ K +K Sbjct: 764 QDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKN 823 Query: 2509 LDETA---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPK 2679 +E A +KE N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E PK Sbjct: 824 SEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPK 883 Query: 2680 NEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGL 2859 E AV ++ ISN ASR RNLRF RV +D REE+ + T PR+T+FKN+GGCQGL Sbjct: 884 TEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGL 943 Query: 2860 FLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSV 3039 FLSGSRP W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+V Sbjct: 944 FLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAV 1003 Query: 3040 CNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQ 3216 +YDNYWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+ +Q++ Sbjct: 1004 SSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEN 1063 Query: 3217 DSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNE 3396 D+ SSD L +SYS+DEFEVR+ EPEKSG W+ +ATIPMQ+SENALTVR VTL N+TT E Sbjct: 1064 DNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1123 Query: 3397 NETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGH 3576 NETLLAIGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQGH Sbjct: 1124 NETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGH 1183 Query: 3577 LLIASGTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQG 3753 LLIASG KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQG Sbjct: 1184 LLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQG 1243 Query: 3754 AQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRA 3933 AQL+LLAKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSRA Sbjct: 1244 AQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRA 1303 Query: 3934 EFHAGAHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXX 4113 EFH GAHV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1304 EFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1363 Query: 4114 XXXXXXXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEI 4293 VD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLEI Sbjct: 1364 LQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEI 1423 Query: 4294 AHQIGTTRSQILSNLNDLSLGTSFL 4368 A QIGTTR QILSNLNDLSLGTSFL Sbjct: 1424 AQQIGTTRMQILSNLNDLSLGTSFL 1448 >ref|XP_019081674.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Vitis vinifera] ref|XP_019081675.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Vitis vinifera] Length = 1449 Score = 2146 bits (5560), Expect = 0.0 Identities = 1076/1466 (73%), Positives = 1223/1466 (83%), Gaps = 17/1466 (1%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAA+KMMHWPTGIENCASGF+TH KR+IGP+PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPT--KRQIGPLPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 +VTAAN++EVY VRVQEDD+R A+AE + GG VM G+ GA LELVC YRLHGNVE+MT Sbjct: 59 IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VL GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG SF Sbjct: 118 VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRC G LVY QMI+LKA+QAG GLV DEE S G S +ESS+V++ Sbjct: 178 ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV +NS +MP+ Sbjct: 298 IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI + Sbjct: 358 SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 1461 IGNS FFLGSRLGDSLLVQ+ +S S+ VK+E DIEGD+P AKRLR SSD+LQD+ Sbjct: 418 IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473 Query: 1462 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 1641 +GEELSLY + PNS+E++QK FSF+VRDS INVGPLKDF+YGLRINADP ATGIAKQSN Sbjct: 474 VNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSN 533 Query: 1642 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 1821 YELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H DS+K T+D Sbjct: 534 YELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKD 593 Query: 1822 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFG-------RRRVVQ 1980 DEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFG RRRVVQ Sbjct: 594 DEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRLIFCLFRRRVVQ 653 Query: 1981 IFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAG 2160 ++ARGARILDG++MTQ+L IADPYVLL+M+DG+IQLL G Sbjct: 654 VYARGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVG 703 Query: 2161 DPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGS 2340 DPSTCTVSIN+PA F S S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID DG+ Sbjct: 704 DPSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGA 763 Query: 2341 YPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTK 2505 DQGDIYC+V YESG LEIFDVPNF VFS DKF+SG +HL+D I+ K +K Sbjct: 764 AQDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSK 823 Query: 2506 GLDETA---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAP 2676 +E A +KE N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E P Sbjct: 824 NSEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTP 883 Query: 2677 KNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQG 2856 K E AV ++ ISN ASR RNLRF RV +D REE+ + T PR+T+FKN+GGCQG Sbjct: 884 KTEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQG 943 Query: 2857 LFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPS 3036 LFLSGSRP W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+ Sbjct: 944 LFLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPA 1003 Query: 3037 VCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMD 3213 V +YDNYWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+ +Q++ Sbjct: 1004 VSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLE 1063 Query: 3214 QDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTN 3393 D+ SSD L +SYS+DEFEVR+ EPEKSG W+ +ATIPMQ+SENALTVR VTL N+TT Sbjct: 1064 NDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTK 1123 Query: 3394 ENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQG 3573 ENETLLAIGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQG Sbjct: 1124 ENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQG 1183 Query: 3574 HLLIASGTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQ 3750 HLLIASG KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQ Sbjct: 1184 HLLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQ 1243 Query: 3751 GAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSR 3930 GAQL+LLAKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSR Sbjct: 1244 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSR 1303 Query: 3931 AEFHAGAHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXX 4110 AEFH GAHV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1304 AEFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1363 Query: 4111 XXXXXXXXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLE 4290 VD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLE Sbjct: 1364 RLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLE 1423 Query: 4291 IAHQIGTTRSQILSNLNDLSLGTSFL 4368 IA QIGTTR QILSNLNDLSLGTSFL Sbjct: 1424 IAQQIGTTRMQILSNLNDLSLGTSFL 1449 >ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Ziziphus jujuba] Length = 1453 Score = 2133 bits (5527), Expect = 0.0 Identities = 1076/1460 (73%), Positives = 1217/1460 (83%), Gaps = 11/1460 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MS+AAFKMMHWPTGIENCASGFITH +R IGPIPNL Sbjct: 1 MSFAAFKMMHWPTGIENCASGFITHSRADFVPRIPPIQNDDLDSDWSAS-RREIGPIPNL 59 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 VVTA NV+EVY VR+QE+ R+ A+ E R GG VMDGL GA LELVCHYRLHGNVE+M Sbjct: 60 VVTAGNVLEVYVVRIQEESNRSSRASGESRRGG-VMDGLSGASLELVCHYRLHGNVETMA 118 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLS GG E S+RRDSI+L+FQDAKI+VL+FDDSTHGLRTSSMHCFEGP WL+LKRGR SF Sbjct: 119 VLSTGGGESSRRRDSIILSFQDAKISVLDFDDSTHGLRTSSMHCFEGPKWLHLKRGRESF 178 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRCGG LVYD QMI+LKAAQAG GLV DE+ SS GG S IESS+++N Sbjct: 179 ARGPLVKVDPQGRCGGVLVYDFQMIILKAAQAGSGLVVDEDTSSSGGAVSAHIESSYIIN 238 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLDM+H+KDF FVHGYIEPVMVILHE+E TWAGR++WKHHTCM+SALSISTTLKQHPL Sbjct: 239 LRDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVAWKHHTCMVSALSISTTLKQHPL 298 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSA NLPHDAYKLLAVPSPIGG+LVI ANSIHYHSQS SC+L LN FAV +++S +MP+ Sbjct: 299 IWSAANLPHDAYKLLAVPSPIGGVLVIGANSIHYHSQSTSCALALNNFAVSVDSSQEMPR 358 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 ++FNVELDAA ATWL +DVA+ S+KTGELLLLT+VYDGRVVQ+LDL+KSKASVLTSGIT+ Sbjct: 359 SSFNVELDAANATWLLNDVALLSTKTGELLLLTIVYDGRVVQRLDLSKSKASVLTSGITT 418 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYG-ASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 1458 IGNS FFLGSRLGDSLLVQ++ G SS S+ +KDE DIEGD P AKRLR +SSD+ QD Sbjct: 419 IGNSLFFLGSRLGDSLLVQFTCGVGSSIMSSALKDEVGDIEGDAPSAKRLRRLSSDASQD 478 Query: 1459 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 1638 +ASGEELSLY + PN++ESAQK FSFAVRDSLINVGP+KDFSYGLR+NAD NATGIAKQS Sbjct: 479 MASGEELSLYGSAPNNTESAQKSFSFAVRDSLINVGPIKDFSYGLRVNADTNATGIAKQS 538 Query: 1639 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 1818 NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+R H VDS+K+ Sbjct: 539 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRGHNVDSAKSAAA 598 Query: 1819 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 1998 DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA Sbjct: 599 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 658 Query: 1999 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCT 2178 RILDGS+MTQ+LS V IADPYV+L+MTDGSI+LL GDPS+CT Sbjct: 659 RILDGSFMTQDLSIVA--ANSESGSESATVLSVSIADPYVVLRMTDGSIRLLIGDPSSCT 716 Query: 2179 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2358 VSI+ PA F S ++ACTLYHD PEPWLRK S+DAWLSTGV E +D DGS DQGD Sbjct: 717 VSISTPAAFESSKKLISACTLYHDDGPEPWLRKTSTDAWLSTGVDEAVDGADGSLHDQGD 776 Query: 2359 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-----IPHKAKTKGLDET- 2520 IYC+VCYESG+LEI+DVPNF VFS +KFISGK +L+D + P K + ++ Sbjct: 777 IYCVVCYESGSLEIYDVPNFNCVFSVEKFISGKMNLLDTLVEEQSKDPQKLMNRSSEDVS 836 Query: 2521 --AKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2694 A+KE QN+K+VELAMQRWSGQ+SRPFLFGIL+DGT+LCYHAYL+EG E A K E +V Sbjct: 837 GQARKENVQNMKIVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLFEGPESASKTEDSV 896 Query: 2695 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 2874 S +SN ASR RNLRF RV++D +EE+ N T+ RI+IFKN+ G QGLFLSGS Sbjct: 897 SAQSLSGLSNNSASRLRNLRFVRVALDTYAKEETPNATSCQRISIFKNIAGYQGLFLSGS 956 Query: 2875 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 3054 RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG LKICQLPS+ +YD+ Sbjct: 957 RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSITSYDS 1016 Query: 3055 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 3231 YWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV +PL+QV+SSL+DQ+ +Q + + SS Sbjct: 1017 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVISSLIDQEVGHQAENHNLSS 1076 Query: 3232 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 3411 D L ++Y++DEFEVRI EPE SGG W+ KATIPMQ SENALTVR VTL N+TT ENETLL Sbjct: 1077 DDLHRTYTVDEFEVRILEPEISGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLL 1136 Query: 3412 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 3591 AIGTAYVQGEDVAARGR+LLF+IG NPQNLVSEVY+K+LKGAISALASLQGHLL+AS Sbjct: 1137 AIGTAYVQGEDVAARGRVLLFSIG---NNPQNLVSEVYTKDLKGAISALASLQGHLLMAS 1193 Query: 3592 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 3768 G KI L+KWTG EL VAF+D PL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSL Sbjct: 1194 GPKIILHKWTGGELNAVAFFDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSL 1253 Query: 3769 LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 3948 LAKDF SLDCFATEFLIDG TLSLVVSD++KNIQIFYYAP+M ESWKGQKLLSRAEFH G Sbjct: 1254 LAKDFGSLDCFATEFLIDGSTLSLVVSDNRKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1313 Query: 3949 AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 4128 AHV+K LRLQML T+SDR+ SDKTNRFA+LFGTLDGS+GC+APL+E Sbjct: 1314 AHVTKLLRLQMLSTTSDRTGTASVSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQ 1373 Query: 4129 XXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 4308 VD V HV GLNPR+FRQFRSNGKAHRPGPD+++DCELL HYEMLPLEEQLEIAHQIG Sbjct: 1374 KKLVDAVSHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQIG 1433 Query: 4309 TTRSQILSNLNDLSLGTSFL 4368 TTRSQILSNLNDLSLGTSFL Sbjct: 1434 TTRSQILSNLNDLSLGTSFL 1453 >gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 2128 bits (5514), Expect = 0.0 Identities = 1069/1460 (73%), Positives = 1210/1460 (82%), Gaps = 11/1460 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAA+KMMHWPTGIENCASGF+TH +R IGP+PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPA--RRGIGPVPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 +VTAAN++E+Y VRVQE+ R + E + GG V+DG+ G LELVC+YRLHGNVESM Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGG-VLDGVSGVSLELVCNYRLHGNVESMA 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLS+GG + S+RRDSI+LAF+DAKI+VLEFDDS HGLRT+SMHCFEGP+WL+LKRGR SF Sbjct: 118 VLSIGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRCGG LVYD QMI+LKA+QAG G V +++ GG S +ESS+++N Sbjct: 178 ARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIIN 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLD++H+KDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLLAVPSPIGG+LVI AN+IHYHSQSASC+L LN +A+ ++NS D+P+ Sbjct: 298 IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +NF+VELDAA ATWL +DVA+ S+KTGELLLLTL+YDGRVVQ+LDL+KSKASVLTS IT+ Sbjct: 358 SNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITT 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGAS-SPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 1458 IGNS FFLGSRLGDSLLVQ+S G+ S + +K+E DIEGD+P+AKRLR SSD+LQD Sbjct: 418 IGNSLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQD 477 Query: 1459 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 1638 + GEELSLY + PN++ESAQK F FAVRDSL NVGPLKDFSYGLRINAD NATGIAKQS Sbjct: 478 MVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQS 537 Query: 1639 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 1818 NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVEL GCKGIWTVYHKS+R+H+ D SK T + Sbjct: 538 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 Query: 1819 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 1998 DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA Sbjct: 598 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 657 Query: 1999 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCT 2178 RILDGS+MTQELS IADPYVLL+MTDGSI LL GDP+TCT Sbjct: 658 RILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCT 717 Query: 2179 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2358 VSIN P F V+ACTLYHDK PEPWLRKAS+DAWLSTGV E ID DG DQGD Sbjct: 718 VSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGD 777 Query: 2359 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDE-- 2517 IYC+VCYESG LEIFDVPNF VFS +KF SG++ L+DA + K K +E Sbjct: 778 IYCVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELT 837 Query: 2518 -TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2694 +KE QNLKVVELAMQRWS +SRPFLFGIL DGT+LCYHAYL+EG E A K E +V Sbjct: 838 GQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSV 897 Query: 2695 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 2874 ++ +SN ASR RNLRF R+ +D REE SN T RITIFKN+ G QG FLSGS Sbjct: 898 VAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGS 957 Query: 2875 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 3054 RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQ+PS NYDN Sbjct: 958 RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDN 1017 Query: 3055 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 3231 YWPVQKIPLR TPHQVTYF E+NLYP+IVSVPV +P++QVLSSLVDQ+ +QMD + SS Sbjct: 1018 YWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSS 1077 Query: 3232 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 3411 D LQ++Y++DEFEVRI EPEKSGG WE KATIPMQ+SENALTVR VTL N+TT ENE+LL Sbjct: 1078 DELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLL 1137 Query: 3412 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 3591 AIGTAY+QGEDVAARGR++L +IGR+ +N QNLVSEVYSKELKGAISALASLQGHLLIAS Sbjct: 1138 AIGTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIAS 1197 Query: 3592 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 3768 G KI L+ WTGSEL G+AFYDA PL+VVSLNIVKNFILLGD+HKSIYFL WKEQGAQLSL Sbjct: 1198 GPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSL 1257 Query: 3769 LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 3948 LAKDF SLDCFATEFLIDG TLSL+VSD+QKNIQIFYYAP+M ESWKGQKLLSRAEFH G Sbjct: 1258 LAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1317 Query: 3949 AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 4128 AHV+KFLRLQML TSSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1318 AHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1377 Query: 4129 XXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 4308 VD VPHV GLNPR+FRQF SNGKAHRPGPD+++DCELL HYEMLPLEEQL+IAHQIG Sbjct: 1378 KKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIG 1437 Query: 4309 TTRSQILSNLNDLSLGTSFL 4368 TTRSQILSNLNDL+LGTSFL Sbjct: 1438 TTRSQILSNLNDLTLGTSFL 1457 >ref|XP_017972864.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Theobroma cacao] Length = 1457 Score = 2127 bits (5512), Expect = 0.0 Identities = 1068/1460 (73%), Positives = 1210/1460 (82%), Gaps = 11/1460 (0%) Frame = +1 Query: 22 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXXXXXXXXXKRRIGPIPNL 201 MSYAA+KMMHWPTGI+NCASGF+TH +R IGP+PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPA--RRGIGPVPNL 58 Query: 202 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 381 +VTAAN++E+Y VRVQE+ R + E + GG V+DG+ LELVC+YRLHGNVESM Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGG-VLDGVSRVSLELVCNYRLHGNVESMA 117 Query: 382 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 561 VLS+GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRT+SMHCFEGP+WL+LKRGR SF Sbjct: 118 VLSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESF 177 Query: 562 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 741 ARGP+VK DP GRCGG LVYD QMI+LKA+QAG G V +++ GG S +ESS+++N Sbjct: 178 ARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIIN 237 Query: 742 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 921 LRDLD++H+KDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL Sbjct: 238 LRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297 Query: 922 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 1101 IWSAVNLPHDAYKLLAVPSPIGG+LVI AN+IHYHSQSASC+L LN +A+ ++NS D+P+ Sbjct: 298 IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPR 357 Query: 1102 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 1281 +NF+VELDAA ATWL +DVA+ S+KTGELLLLTL+YDGRVVQ+LDL+KSKASVLTS IT+ Sbjct: 358 SNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITT 417 Query: 1282 IGNSFFFLGSRLGDSLLVQYSYGA-SSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 1458 IGNS FFLGSRLGDSLLVQ+S G+ +S + +K+E DIEGD+P+AKRLR SSD+LQD Sbjct: 418 IGNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQD 477 Query: 1459 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 1638 + GEELSLY + PN++ESAQK F FAVRDSL NVGPLKDFSYGLRINAD NATGIAKQS Sbjct: 478 MVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQS 537 Query: 1639 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 1818 NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVEL GCKGIWTVYHKS+R+H+ D SK T + Sbjct: 538 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 Query: 1819 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 1998 DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA Sbjct: 598 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 657 Query: 1999 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXXIADPYVLLKMTDGSIQLLAGDPSTCT 2178 RILDGS+MTQELS IADPYVLL+MTDGSI LL GDP+TCT Sbjct: 658 RILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCT 717 Query: 2179 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2358 VSIN P F V+ACTLYHDK PEPWLRKAS+DAWLSTGV E ID DG DQGD Sbjct: 718 VSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGD 777 Query: 2359 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDE-- 2517 IYC+VCYESG LEIFDVPNF VFS + F SG++ L+DA + K K +E Sbjct: 778 IYCVVCYESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELT 837 Query: 2518 -TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2694 +KE QNLKVVELAMQRWS +SRPFLFGIL DGT+LCYHAYL+EG E A K E +V Sbjct: 838 GQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSV 897 Query: 2695 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 2874 ++ +SN ASR RNLRF R+ +D REE SN T RITIFKN+ G QG FLSGS Sbjct: 898 VAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGS 957 Query: 2875 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 3054 RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQ+PS NYDN Sbjct: 958 RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDN 1017 Query: 3055 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 3231 YWPVQKIPLR TPHQVTYF E+NLYP+IVSVPV +P++QVLSSLVDQ+ +QMD + SS Sbjct: 1018 YWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSS 1077 Query: 3232 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 3411 D LQ++Y++DEFEVRI EPEKSGG WE KATIPMQ+SENALTVR VTL N+TT ENE+LL Sbjct: 1078 DELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLL 1137 Query: 3412 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 3591 AIGTAY+QGEDVAARGR++L +IGR+ +NPQNLVSEVYSKELKGAISALASLQGHLLIAS Sbjct: 1138 AIGTAYIQGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIAS 1197 Query: 3592 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 3768 G KI L+ WTGSEL G+AFYDA PL+VVSLNIVKNFILLGD+HKSIYFL WKEQGAQLSL Sbjct: 1198 GPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSL 1257 Query: 3769 LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 3948 LAKDF SLDCFATEFLIDG TLSL+VSD+QKNIQIFYYAP+M ESWKGQKLLSRAEFH G Sbjct: 1258 LAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1317 Query: 3949 AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 4128 AHV+KFLRLQML TSSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E Sbjct: 1318 AHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1377 Query: 4129 XXXVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 4308 VD VPHV GLNPR+FRQF SNGKAHRPGPD+++DCELL HYEMLPLEEQL+IAHQIG Sbjct: 1378 KKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIG 1437 Query: 4309 TTRSQILSNLNDLSLGTSFL 4368 TTRSQILSNLNDL+LGTSFL Sbjct: 1438 TTRSQILSNLNDLTLGTSFL 1457