BLASTX nr result
ID: Angelica27_contig00024992
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica27_contig00024992 (1865 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_017247188.1 PREDICTED: cleavage and polyadenylation specifici... 1062 0.0 XP_017235464.1 PREDICTED: cleavage and polyadenylation specifici... 1056 0.0 XP_017972869.1 PREDICTED: cleavage and polyadenylation specifici... 890 0.0 XP_017972868.1 PREDICTED: cleavage and polyadenylation specifici... 890 0.0 EOY22974.1 Cleavage and polyadenylation specificity factor 160 i... 892 0.0 XP_017972867.1 PREDICTED: cleavage and polyadenylation specifici... 890 0.0 XP_017972864.1 PREDICTED: cleavage and polyadenylation specifici... 890 0.0 XP_017649186.1 PREDICTED: cleavage and polyadenylation specifici... 877 0.0 XP_017972865.1 PREDICTED: cleavage and polyadenylation specifici... 884 0.0 XP_012484368.1 PREDICTED: cleavage and polyadenylation specifici... 880 0.0 XP_007220310.1 hypothetical protein PRUPE_ppa000211mg [Prunus pe... 880 0.0 XP_017649185.1 PREDICTED: cleavage and polyadenylation specifici... 877 0.0 XP_016672502.1 PREDICTED: cleavage and polyadenylation specifici... 876 0.0 XP_016668425.1 PREDICTED: cleavage and polyadenylation specifici... 876 0.0 XP_008234350.1 PREDICTED: cleavage and polyadenylation specifici... 876 0.0 XP_015877866.1 PREDICTED: cleavage and polyadenylation specifici... 872 0.0 XP_015965921.1 PREDICTED: cleavage and polyadenylation specifici... 870 0.0 GAV61868.1 CPSF_A domain-containing protein/MMS1_N domain-contai... 870 0.0 XP_016204143.1 PREDICTED: cleavage and polyadenylation specifici... 868 0.0 KCW46268.1 hypothetical protein EUGRSUZ_K00143 [Eucalyptus grandis] 866 0.0 >XP_017247188.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Daucus carota subsp. sativus] KZM96626.1 hypothetical protein DCAR_016012 [Daucus carota subsp. sativus] Length = 1446 Score = 1062 bits (2747), Expect = 0.0 Identities = 541/594 (91%), Positives = 559/594 (94%), Gaps = 6/594 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCNHLP---AIQTQDSSDWLSIKPNVITSIPNLVVT 1594 MSYAAFKMMHSPTAIETCASGYITH N LP +IQT+DS DWLSIKPNV SIPNLVVT Sbjct: 1 MSYAAFKMMHSPTAIETCASGYITHSNQLPKLPSIQTEDS-DWLSIKPNVTASIPNLVVT 59 Query: 1593 AANVLQVYVVRVSDDSV---KGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAILPC 1423 AANVL+VYVVRVS+DS KGS KRGGVMDGVSGASLELVCHYRLHGNIYSMAILPC Sbjct: 60 AANVLEVYVVRVSEDSGGSGKGSVVDKRGGVMDGVSGASLELVCHYRLHGNIYSMAILPC 119 Query: 1422 XXXXXXXXDSIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFPTGPL 1243 DSIILTFDDAKISVLEFDDSVHGL TSSMHCFEGPEWL+L+RGRE FPTGPL Sbjct: 120 GGADGRRRDSIILTFDDAKISVLEFDDSVHGLRTSSMHCFEGPEWLYLRRGRENFPTGPL 179 Query: 1242 VKVDPQGRCAGVLVYGLQMIILKAAQAGGFVGDDSTLGTGGACCARVESSYIISLRDLEM 1063 VKVDPQGRCAGVLVYGLQMI+LKA+QAGGFVGDDSTLG GGA CAR+ESSYIISLRD+EM Sbjct: 180 VKVDPQGRCAGVLVYGLQMIVLKASQAGGFVGDDSTLGAGGASCARIESSYIISLRDMEM 239 Query: 1062 KHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLIWSASN 883 KHVKDFVFINGYIEPVLVIL+EHELTWAGRVSWKHHTCGISALSIST+LKQHPLIWSASN Sbjct: 240 KHVKDFVFINGYIEPVLVILYEHELTWAGRVSWKHHTCGISALSISTTLKQHPLIWSASN 299 Query: 882 LPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRSNFNLE 703 LPHEAYKLLAVPSPIGGVIVISTN+IHYHSQSASC+LALNNFAVSVDGSQETTRSNF+LE Sbjct: 300 LPHEAYKLLAVPSPIGGVIVISTNSIHYHSQSASCILALNNFAVSVDGSQETTRSNFSLE 359 Query: 702 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF 523 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF Sbjct: 360 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF 419 Query: 522 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDMVNGDE 343 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDID D+H AKR+RRSSSDALQDMVN DE Sbjct: 420 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDVDVHQAKRLRRSSSDALQDMVN-DE 478 Query: 342 LSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC 163 LSLY SGPNNAEST K FSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC Sbjct: 479 LSLYGSGPNNAESTEKIFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC 538 Query: 162 CSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 CSGHGKNGALCVLQ+SIRPEVITQEPIPGCK LWTVYHKTSRSHTIDSSKM SD Sbjct: 539 CSGHGKNGALCVLQKSIRPEVITQEPIPGCKGLWTVYHKTSRSHTIDSSKMASD 592 >XP_017235464.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Daucus carota subsp. sativus] KZN04680.1 hypothetical protein DCAR_005517 [Daucus carota subsp. sativus] Length = 1446 Score = 1056 bits (2730), Expect = 0.0 Identities = 540/595 (90%), Positives = 558/595 (93%), Gaps = 7/595 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCNHLP---AIQTQDSSDWLSIKPNVITSIPNLVVT 1594 MSYAAFKMMHSPT+IETCASGYITH N LP +IQT+DS DWLSIKPNV SIPNLVVT Sbjct: 1 MSYAAFKMMHSPTSIETCASGYITHSNQLPKLPSIQTEDS-DWLSIKPNVTASIPNLVVT 59 Query: 1593 AANVLQVYVVRVSDDSV---KGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAILPC 1423 AANVL+VYVVRVS+DS KGS KRGGVMDG+SGASLELVCHYRLHGNIYSMAILPC Sbjct: 60 AANVLEVYVVRVSEDSGGSGKGSVVDKRGGVMDGISGASLELVCHYRLHGNIYSMAILPC 119 Query: 1422 XXXXXXXXDSIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFPTGPL 1243 DSIILTFDDAKISVLEFDDSVHGL TSSMHCFEGPEWL+L+RGRE FPTGPL Sbjct: 120 GGADGRRRDSIILTFDDAKISVLEFDDSVHGLRTSSMHCFEGPEWLYLRRGRENFPTGPL 179 Query: 1242 VKVDPQGRCAGVLVYGLQMIILKAAQAGGFVGDDSTLGTGGACCARVESSYIISLRDLEM 1063 VKVDPQGRCAGVLVYGLQMI+LKA+QAGGFVGDDSTLG GGA CAR+ESSYIISLRDLEM Sbjct: 180 VKVDPQGRCAGVLVYGLQMIVLKASQAGGFVGDDSTLGAGGASCARIESSYIISLRDLEM 239 Query: 1062 KHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLIWSASN 883 KHVKDFVFINGYIEPVLVIL EHELTWAGRVSWKHHTCGISALSIST+LKQHPLIWSASN Sbjct: 240 KHVKDFVFINGYIEPVLVILFEHELTWAGRVSWKHHTCGISALSISTTLKQHPLIWSASN 299 Query: 882 LPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRSNFNLE 703 LPHEAYKLLAVPSPIGGVIVISTN+IHYHSQSASC+LALNNFAVSVDGSQETTRSNF+LE Sbjct: 300 LPHEAYKLLAVPSPIGGVIVISTNSIHYHSQSASCILALNNFAVSVDGSQETTRSNFSLE 359 Query: 702 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF 523 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF Sbjct: 360 LDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLF 419 Query: 522 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDMVNGDE 343 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDID D+H AKR+RRSSSDALQDMVN DE Sbjct: 420 FLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDVDVHQAKRLRRSSSDALQDMVN-DE 478 Query: 342 LSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC 163 LSLY SGPNNAEST K FSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC Sbjct: 479 LSLYGSGPNNAESTEKIFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELVC 538 Query: 162 CSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTID-SSKMTSD 1 CSGHGKNGALCVLQ+SIRPEVITQEPIPGCK LWTVYHKTSRSHTID SSKM SD Sbjct: 539 CSGHGKNGALCVLQKSIRPEVITQEPIPGCKGLWTVYHKTSRSHTIDSSSKMASD 593 >XP_017972869.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X5 [Theobroma cacao] Length = 1253 Score = 890 bits (2299), Expect = 0.0 Identities = 447/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT I+ CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVS SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G GAS LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES KTF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 >XP_017972868.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X4 [Theobroma cacao] Length = 1292 Score = 890 bits (2299), Expect = 0.0 Identities = 447/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT I+ CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVS SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G GAS LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES KTF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 >EOY22974.1 Cleavage and polyadenylation specificity factor 160 isoform 1 [Theobroma cacao] Length = 1457 Score = 892 bits (2305), Expect = 0.0 Identities = 448/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVSG SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSGVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G G S LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES KTF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 >XP_017972867.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X3 [Theobroma cacao] Length = 1395 Score = 890 bits (2299), Expect = 0.0 Identities = 447/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT I+ CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVS SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G GAS LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES KTF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 >XP_017972864.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Theobroma cacao] Length = 1457 Score = 890 bits (2299), Expect = 0.0 Identities = 447/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT I+ CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVS SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G GAS LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES KTF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597 >XP_017649186.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Gossypium arboreum] Length = 1198 Score = 877 bits (2265), Expect = 0.0 Identities = 443/599 (73%), Positives = 500/599 (83%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHC-----NHLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++T+C +P T+D SDW S + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTNCLADFTPQIPLNHTEDLESDWSSRRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDSVK---GSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAANVL++YVVRV ++ + S+E KRGG+MDGVS SLELVC YRLHGN+ SMA+ Sbjct: 59 IVTAANVLELYVVRVQEEGTREARNSTEVKRGGIMDGVSAVSLELVCSYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIILTF DAKI+VLEFDDS H L TSSMHCFEGPEW HLKRGRE+F Sbjct: 119 LSIGGGDVSRRRDSIILTFQDAKIAVLEFDDSTHSLQTSSMHCFEGPEWFHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVK DPQGRC+GVLVYGLQMIILKAAQAG GFVG+D G+G ARVESSYII+L Sbjct: 179 RGPLVKADPQGRCSGVLVYGLQMIILKAAQAGSGFVGEDDAFGSGATVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+V+S N IHYHSQSASC LALN++A SVD SQE RS Sbjct: 299 WSAANLPHDAYKLLAVPSPIGGVLVLSANMIHYHSQSASCALALNSYAASVDNSQELPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LS+KTGELLLL LVYDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 SFNVELDAANATWLLNDVALLSSKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSL FL SRLGDSLLVQF+SG GASTLP G+KEEVGDI+GD+ LAKR+RRSSSDALQD Sbjct: 419 GNSLVFLGSRLGDSLLVQFSSGSGASTLPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDA 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V +ELSLY S PNN+ES K F F VRDSLINVGPLKDFSYGLR+NAD NATGIAKQSN Sbjct: 479 VGSEELSLYGSTPNNSESAQKAFLFAVRDSLINVGPLKDFSYGLRVNADANATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++R H DSSK+ D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRGHNADSSKLADD 597 >XP_017972865.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X2 [Theobroma cacao] Length = 1456 Score = 884 bits (2285), Expect = 0.0 Identities = 446/599 (74%), Positives = 508/599 (84%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT I+ CASG++THC +P QT+D S+W + + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTEDLESEWPARRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAAN+L++YVVRV ++ + S+E KRGGV+DGVS SLELVC+YRLHGN+ SMA+ Sbjct: 59 IVTAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSRVSLELVCNYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL F DAKISVLEFDDS+HGL T+SMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY LQMIILKA+QAG GFVG+D G+GGA ARVESSYII+L Sbjct: 179 RGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL++KH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA NLPH+AYKLLAVPSPIGGV+VIS NTIHYHSQSASC LALNN+A+SVD SQ+ RS Sbjct: 299 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 NF++ELDAANATWL NDVA+LSTKTGELLLL L+YDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 NFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQF+ G GAS LP G+KEEVGDI+GD+ LAKR+RRSSSDALQDM Sbjct: 419 GNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDM 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V G+ELSLY S PNN ES +TF F VRDSL NVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGGEELSLYGSAPNNTESA-QTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSN 537 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++RSH+ D SK+T D Sbjct: 538 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 596 >XP_012484368.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Gossypium raimondii] KJB34434.1 hypothetical protein B456_006G065300 [Gossypium raimondii] Length = 1456 Score = 880 bits (2275), Expect = 0.0 Identities = 446/599 (74%), Positives = 500/599 (83%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++T+C +P T+D SDW S + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTNCRADFTPQIPLNHTEDLESDWSSRRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDSVK---GSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAANVL++YVVRV ++ + S+E KRGG+MDGVS SLELVC YRLHGN+ SMA+ Sbjct: 59 IVTAANVLELYVVRVQEEGTREARNSTEVKRGGIMDGVSAVSLELVCSYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIILTF DAKI+VLEFDDS H L TSSMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDVSRRRDSIILTFQDAKIAVLEFDDSTHSLQTSSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVK DPQGRC+GVLVYGLQMIILKAAQAG GFVG+D G+G ARVESSYII+L Sbjct: 179 RGPLVKADPQGRCSGVLVYGLQMIILKAAQAGSGFVGEDDAFGSGATVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+VIS N IHYHSQSA+C LALNN+A SVD SQE RS Sbjct: 299 WSAANLPHDAYKLLAVPSPIGGVLVISANMIHYHSQSATCALALNNYAASVDNSQELPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LS KTGELLLL LVYDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 SFNVELDAANATWLLNDVALLSAKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSL FL SRLGDSLLVQF+SG GASTLP G+KEEVGDI+GD+ LAKR+RRSSSDALQD Sbjct: 419 GNSLVFLGSRLGDSLLVQFSSGSGASTLPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDA 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V +ELSLY S PNN+ES K F F VRDSLINVGPLKDFSYGLRINAD NATGIAKQSN Sbjct: 479 VGSEELSLYGSTPNNSESAQKAFLFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++R H DSSK+ D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRGHNADSSKLADD 597 >XP_007220310.1 hypothetical protein PRUPE_ppa000211mg [Prunus persica] ONI25129.1 hypothetical protein PRUPE_2G282700 [Prunus persica] Length = 1459 Score = 880 bits (2273), Expect = 0.0 Identities = 446/599 (74%), Positives = 504/599 (84%), Gaps = 12/599 (2%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MS+AA+KMMH PT IE CASG+I+H +P IQT+D S+W + + I IP+L Sbjct: 1 MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRIPPIQTEDLESEWPTSRRE-IGPIPDL 59 Query: 1602 VVTAANVLQVYVVRVSD-DSVKG---SSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMA 1435 VVTA NVL+VYVVRV + D +G S E KRGG+MDGVSGASLELVCHYRLHGN+ +MA Sbjct: 60 VVTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMA 119 Query: 1434 ILPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETF 1258 +L SIILTF+DAKISVLEFDDS+HGL TSSMHCFEGPEWLHL+RGRE+F Sbjct: 120 VLSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESF 179 Query: 1257 PTGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIIS 1081 GPLVKVDPQGRC +LVYGLQMIILKA+Q G G VGDD + G+GGA +R+ESSYI++ Sbjct: 180 ARGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISSRIESSYIVN 239 Query: 1080 LRDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPL 901 LRD++MKHVKDF F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPL Sbjct: 240 LRDMDMKHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 299 Query: 900 IWSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTR 721 IWSA NLPH+AYKLLAVPSPIGGV+VIS N+IHYHSQSASC LALN++AVS D SQE R Sbjct: 300 IWSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEMPR 359 Query: 720 SNFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITT 541 S+F +ELD ANATWL NDVA+LSTKTGELLLL LVYDGRVVQRLDLSKSKASVLTSGIT Sbjct: 360 SSFTVELDTANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITK 419 Query: 540 IGNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQD 361 +GNSLFFL SRLGDSLLVQFT GVG S L MK+EVGDI+GD LAKR+R SSSDALQD Sbjct: 420 VGNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPLAKRLRMSSSDALQD 479 Query: 360 MVNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQS 181 MV+G+ELSLY S PNNAES K+FSF VRDSLINVGPLKDFSYGLRINAD NATGIAKQS Sbjct: 480 MVSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 539 Query: 180 NYELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTS 4 NYELVCCSGHGKNGALCVL+QSIRPE+IT+ +PGCK +WTVYHK +R H DSSK+ + Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAA 598 >XP_017649185.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Gossypium arboreum] Length = 1456 Score = 877 bits (2265), Expect = 0.0 Identities = 443/599 (73%), Positives = 500/599 (83%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHC-----NHLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++T+C +P T+D SDW S + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTNCLADFTPQIPLNHTEDLESDWSSRRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDSVK---GSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAANVL++YVVRV ++ + S+E KRGG+MDGVS SLELVC YRLHGN+ SMA+ Sbjct: 59 IVTAANVLELYVVRVQEEGTREARNSTEVKRGGIMDGVSAVSLELVCSYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIILTF DAKI+VLEFDDS H L TSSMHCFEGPEW HLKRGRE+F Sbjct: 119 LSIGGGDVSRRRDSIILTFQDAKIAVLEFDDSTHSLQTSSMHCFEGPEWFHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVK DPQGRC+GVLVYGLQMIILKAAQAG GFVG+D G+G ARVESSYII+L Sbjct: 179 RGPLVKADPQGRCSGVLVYGLQMIILKAAQAGSGFVGEDDAFGSGATVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+V+S N IHYHSQSASC LALN++A SVD SQE RS Sbjct: 299 WSAANLPHDAYKLLAVPSPIGGVLVLSANMIHYHSQSASCALALNSYAASVDNSQELPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LS+KTGELLLL LVYDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 SFNVELDAANATWLLNDVALLSSKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSL FL SRLGDSLLVQF+SG GASTLP G+KEEVGDI+GD+ LAKR+RRSSSDALQD Sbjct: 419 GNSLVFLGSRLGDSLLVQFSSGSGASTLPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDA 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V +ELSLY S PNN+ES K F F VRDSLINVGPLKDFSYGLR+NAD NATGIAKQSN Sbjct: 479 VGSEELSLYGSTPNNSESAQKAFLFAVRDSLINVGPLKDFSYGLRVNADANATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++R H DSSK+ D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRGHNADSSKLADD 597 >XP_016672502.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Gossypium hirsutum] Length = 1456 Score = 876 bits (2263), Expect = 0.0 Identities = 443/599 (73%), Positives = 499/599 (83%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++T+C +P T+D SDW S + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTNCRADFTPQIPLNHTEDLESDWSSRRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDSVK---GSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAANVL++YVVRV ++ + S+E KRGG+MDGVS SLELVC YRLHGN+ SMA+ Sbjct: 59 IVTAANVLELYVVRVQEEGTREARNSTEVKRGGIMDGVSAVSLELVCSYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIILTF DAKI+VLEFDDS H L TSSMHCFEGPEWLHLKRGRE+F Sbjct: 119 LSIGGGDVSRRRDSIILTFQDAKIAVLEFDDSTHSLQTSSMHCFEGPEWLHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVK DPQGRC+GVLVYGLQMI+LKAAQAG GFVG+D G+G ARVESSYII+L Sbjct: 179 RGPLVKADPQGRCSGVLVYGLQMIVLKAAQAGSGFVGEDDAFGSGATVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+VIS N IHYHSQSA+C LALN++A SVD SQE RS Sbjct: 299 WSAANLPHDAYKLLAVPSPIGGVLVISANMIHYHSQSATCALALNSYAASVDNSQELPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LS KTGELLLL LVYDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 SFNVELDAANATWLLNDVALLSAKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSL FL SRLGDSLLVQF+SG GASTLP G+KEEVGDI+GD+ LAKR+RRSSSDALQD Sbjct: 419 GNSLVFLGSRLGDSLLVQFSSGSGASTLPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDA 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V +ELSLY S PNN+ES K F F VRDSLINVGPLKDFSYGLRINAD NA GIAKQSN Sbjct: 479 VGSEELSLYGSTPNNSESAQKAFLFAVRDSLINVGPLKDFSYGLRINADANAMGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++R H DSSK+ D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRGHNADSSKLADD 597 >XP_016668425.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Gossypium hirsutum] Length = 1456 Score = 876 bits (2263), Expect = 0.0 Identities = 442/599 (73%), Positives = 499/599 (83%), Gaps = 11/599 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MSYAA+KMMH PT IE CASG++T+C +P +D SDW S + I +PNL Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTNCRADFTPQIPLNHIEDLESDWSSRRG--IGPVPNL 58 Query: 1602 VVTAANVLQVYVVRVSDDSVK---GSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 +VTAANVL++YVVRV ++ + S+E KRGG+MDGVS SLELVC YRLHGN+ SMA+ Sbjct: 59 IVTAANVLELYVVRVQEEGTREARNSTEVKRGGIMDGVSAVSLELVCSYRLHGNVESMAV 118 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIILTF DAKI+VLEFDDS H L TSSMHCFEGPEW HLKRGRE+F Sbjct: 119 LSIGGGDVSRRRDSIILTFQDAKIAVLEFDDSTHSLQTSSMHCFEGPEWFHLKRGRESFA 178 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVK DPQGRC+GVLVYGLQMIILKAAQAG GFVG+D G+G ARVESSYII+L Sbjct: 179 RGPLVKADPQGRCSGVLVYGLQMIILKAAQAGSGFVGEDDAFGSGATVSARVESSYIINL 238 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPLI Sbjct: 239 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+V+S N IHYHSQSASC LALN++A SVD SQE RS Sbjct: 299 WSAANLPHDAYKLLAVPSPIGGVLVLSANMIHYHSQSASCALALNSYAASVDNSQELPRS 358 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LS KTGELLLL LVYDGRVVQRLDLSKSKASVLTS ITTI Sbjct: 359 SFNVELDAANATWLLNDVALLSAKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTI 418 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSL FL SRLGDSLLVQF+SG+GASTLP G+KEEVGDI+GD+ LAKR+RRSSSDALQD Sbjct: 419 GNSLVFLGSRLGDSLLVQFSSGLGASTLPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDA 478 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 V +ELSLY S PNN+ES K F F VRDSLINVGPLKDFSYGLR+NAD NATGIAKQSN Sbjct: 479 VGSEELSLYGSTPNNSESAQKAFLFAVRDSLINVGPLKDFSYGLRVNADANATGIAKQSN 538 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 YELVCCSGHGKNGALCVL+QSIRPE+IT+ + GCK +WTVYHK++R H DSSK+ D Sbjct: 539 YELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRDHNADSSKLADD 597 >XP_008234350.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Prunus mume] Length = 1459 Score = 876 bits (2263), Expect = 0.0 Identities = 446/599 (74%), Positives = 503/599 (83%), Gaps = 12/599 (2%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MS+AA+KMMH PT IE CASG+I+H + IQT+D S+W + + I IP+L Sbjct: 1 MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRILPIQTEDLESEWPTSRRE-IGPIPDL 59 Query: 1602 VVTAANVLQVYVVRVSD-DSVKG---SSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMA 1435 VVTA NVL+VYVVRV + D +G S E KRGG+MDGVSGASLELVCHYRLHGN+ +MA Sbjct: 60 VVTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMA 119 Query: 1434 ILPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETF 1258 +L SIILTF+DAKISVLEFDDS+HGL TSSMHCFEGPEWLHL+RGRE+F Sbjct: 120 VLSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESF 179 Query: 1257 PTGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIIS 1081 GPLVKVDPQGRC +LVYGLQMIILKA+Q G G VGDD + G+GGA AR+ESSYI++ Sbjct: 180 ARGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISARIESSYIVN 239 Query: 1080 LRDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPL 901 LRD++MKHVKDF F++GYIEPV+VILHE ELTWAGRVSWKHHTC ISALSIST+LKQHPL Sbjct: 240 LRDMDMKHVKDFTFLHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPL 299 Query: 900 IWSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTR 721 IWSA NLPH+AYKLLAVPSPIGGV+VIS N+IHYHSQSASC LALN++AVS D SQE R Sbjct: 300 IWSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEVPR 359 Query: 720 SNFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITT 541 S+F +ELDAANATWL NDVA+LSTKTGELLLL LVYDGRVVQRLDLSKSKASVLTSGIT Sbjct: 360 SSFPVELDAANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITK 419 Query: 540 IGNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQD 361 +GNSLFFL SRLGDSLLVQFT GVG S L MK+EVGDI+GD AKR+R SSSDALQD Sbjct: 420 VGNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPSAKRLRMSSSDALQD 479 Query: 360 MVNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQS 181 MV+G+ELSLY S PNNAES K+FSF VRDSLINVGPLKDFSYGLRINAD NATGIAKQS Sbjct: 480 MVSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 539 Query: 180 NYELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTS 4 NYELVCCSGHGKNGALCVL+QSIRPE+IT+ +PGCK +WTVYHK +R H DSSK+ + Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAA 598 >XP_015877866.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Ziziphus jujuba] Length = 1453 Score = 872 bits (2254), Expect = 0.0 Identities = 439/595 (73%), Positives = 498/595 (83%), Gaps = 11/595 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQD-SSDWLSIKPNVITSIPNL 1603 MS+AAFKMMH PT IE CASG+ITH +P IQ D SDW S I IPNL Sbjct: 1 MSFAAFKMMHWPTGIENCASGFITHSRADFVPRIPPIQNDDLDSDW-SASRREIGPIPNL 59 Query: 1602 VVTAANVLQVYVVRVSDDS---VKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAI 1432 VVTA NVL+VYVVR+ ++S + S E++RGGVMDG+SGASLELVCHYRLHGN+ +MA+ Sbjct: 60 VVTAGNVLEVYVVRIQEESNRSSRASGESRRGGVMDGLSGASLELVCHYRLHGNVETMAV 119 Query: 1431 LPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFP 1255 L SIIL+F DAKISVL+FDDS HGL TSSMHCFEGP+WLHLKRGRE+F Sbjct: 120 LSTGGGESSRRRDSIILSFQDAKISVLDFDDSTHGLRTSSMHCFEGPKWLHLKRGRESFA 179 Query: 1254 TGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISL 1078 GPLVKVDPQGRC GVLVY QMIILKAAQAG G V D+ T +GGA A +ESSYII+L Sbjct: 180 RGPLVKVDPQGRCGGVLVYDFQMIILKAAQAGSGLVVDEDTSSSGGAVSAHIESSYIINL 239 Query: 1077 RDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLI 898 RDL+MKH+KDF+F++GYIEPV+VILHE ELTWAGRV+WKHHTC +SALSIST+LKQHPLI Sbjct: 240 RDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVAWKHHTCMVSALSISTTLKQHPLI 299 Query: 897 WSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRS 718 WSA+NLPH+AYKLLAVPSPIGGV+VI N+IHYHSQS SC LALNNFAVSVD SQE RS Sbjct: 300 WSAANLPHDAYKLLAVPSPIGGVLVIGANSIHYHSQSTSCALALNNFAVSVDSSQEMPRS 359 Query: 717 NFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTI 538 +FN+ELDAANATWL NDVA+LSTKTGELLLL +VYDGRVVQRLDLSKSKASVLTSGITTI Sbjct: 360 SFNVELDAANATWLLNDVALLSTKTGELLLLTIVYDGRVVQRLDLSKSKASVLTSGITTI 419 Query: 537 GNSLFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDM 358 GNSLFFL SRLGDSLLVQFT GVG+S + +K+EVGDI+GD AKR+RR SSDA QDM Sbjct: 420 GNSLFFLGSRLGDSLLVQFTCGVGSSIMSSALKDEVGDIEGDAPSAKRLRRLSSDASQDM 479 Query: 357 VNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSN 178 +G+ELSLY S PNN ES K+FSF VRDSLINVGP+KDFSYGLR+NAD NATGIAKQSN Sbjct: 480 ASGEELSLYGSAPNNTESAQKSFSFAVRDSLINVGPIKDFSYGLRVNADTNATGIAKQSN 539 Query: 177 YELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSK 13 YELVCCSGHGKNGALCVL+QSIRPE+IT+ +PGCK +WTVYHK++R H +DS+K Sbjct: 540 YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRGHNVDSAK 594 >XP_015965921.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Arachis duranensis] Length = 1458 Score = 870 bits (2249), Expect = 0.0 Identities = 444/607 (73%), Positives = 504/607 (83%), Gaps = 19/607 (3%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCNH--------LPAIQTQDSSDWLSIKPNVITS-- 1615 MS+AA+KMMH PT I+ CASG++TH LPA SDW PN T Sbjct: 1 MSFAAYKMMHCPTGIDNCASGFLTHSRADYVPRVPPLPADDLDPDSDW----PNPATRRD 56 Query: 1614 ---IPNLVVTAANVLQVYVVRVSDDSVKG---SSEAKRGGVMDGVSGASLELVCHYRLHG 1453 IPNL++T+ANVL+VY VRV ++S KG ++E+ RGGV DGV+GASLELVCHYRLHG Sbjct: 57 LGPIPNLILTSANVLEVYAVRVHEESAKGPPAAAESSRGGVFDGVTGASLELVCHYRLHG 116 Query: 1452 NIYSMAILPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLK 1276 N+ +MA+L SIILTF DAKISVLE+DDS+HGL TSS+HCFEGPEWLHLK Sbjct: 117 NVEAMAVLSIGAGDGSRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLK 176 Query: 1275 RGRETFPTGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVE 1099 RGRE F +GPLVKVDPQGRC GVLVY LQMIILKA QAG G VGDD TLG+GGA AR+E Sbjct: 177 RGREQFASGPLVKVDPQGRCGGVLVYDLQMIILKATQAGSGLVGDDDTLGSGGAVAARIE 236 Query: 1098 SSYIISLRDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTS 919 SSY+I+LRDL+M+HVKDF F++GYIEPV+VILHE ELTWAGR+SWKHHTC ISALSIST+ Sbjct: 237 SSYMINLRDLDMRHVKDFTFVHGYIEPVMVILHECELTWAGRLSWKHHTCMISALSISTT 296 Query: 918 LKQHPLIWSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDG 739 LKQHPLIWSA NLPH+AYKLLAVPSPIGGV+VI NTIHYHSQSASC LALN++AVS+D Sbjct: 297 LKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSLDS 356 Query: 738 SQETTRSNFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVL 559 SQE RS FN+ELDAANATWLSNDVA+LSTKTGELLLL LVYDGRVVQRLDLSKSKASVL Sbjct: 357 SQEMPRSTFNVELDAANATWLSNDVALLSTKTGELLLLVLVYDGRVVQRLDLSKSKASVL 416 Query: 558 TSGITTIGNSLFFLASRLGDSLLVQFTSGVGAS-TLPPGMKEEVGDIDGDIHLAKRIRRS 382 +SGITTIGNSLFFLASRLGDS+LVQF+ G G S + +KEEVGDI+ D +KR+RRS Sbjct: 417 SSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMSSSNNLKEEVGDIEVDAPSSKRLRRS 476 Query: 381 SSDALQDMVNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNA 202 SDALQD+V+G+ELSLY S PN ES KTFSF VRDSLINVGPLKDFSYGLRINAD NA Sbjct: 477 PSDALQDLVSGEELSLYGSAPNRTESAQKTFSFAVRDSLINVGPLKDFSYGLRINADANA 536 Query: 201 TGIAKQSNYELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTID 22 TGIAKQSNYELVCCSGHGKNG++CVL+QSIRPEVIT+ +PGCK +WTVYHK+SRSH+ D Sbjct: 537 TGIAKQSNYELVCCSGHGKNGSICVLRQSIRPEVITEVELPGCKGIWTVYHKSSRSHSAD 596 Query: 21 SSKMTSD 1 SSKM +D Sbjct: 597 SSKMAND 603 >GAV61868.1 CPSF_A domain-containing protein/MMS1_N domain-containing protein [Cephalotus follicularis] Length = 1449 Score = 870 bits (2248), Expect = 0.0 Identities = 440/593 (74%), Positives = 502/593 (84%), Gaps = 8/593 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHC--NHLPAIQTQD-SSDWLSIKPNVITSIPNLVVT 1594 MS+AA+KMMH PTAIE CASG++THC + +P IQT + S+W + + IPNL+VT Sbjct: 1 MSFAAYKMMHWPTAIENCASGFVTHCRADFVPQIQTDELESEWAPTRG--VAPIPNLIVT 58 Query: 1593 AANVLQVYVVRVSDDSV---KGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAILPC 1423 AANVL++Y VRV ++ + S+++K VMDG+SGASLELVCHYRLHGN+ SMA+L Sbjct: 59 AANVLEIYAVRVQEEGSGDSRISTDSKHAVVMDGLSGASLELVCHYRLHGNVESMAVLLL 118 Query: 1422 XXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFPTGP 1246 SIILTF DAKISVLEFDDS+HGL TSSMHCFEGPEWLHLKRGRE+F GP Sbjct: 119 GGNDGSKKRDSIILTFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAGGP 178 Query: 1245 LVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISLRDL 1069 VKVDPQGRC GVLVYGLQMIIL++AQAG G VGD+ +G A ARV+SSYII+LRDL Sbjct: 179 SVKVDPQGRCGGVLVYGLQMIILESAQAGSGLVGDEDASSSGVAASARVKSSYIINLRDL 238 Query: 1068 EMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLIWSA 889 EMKHVKDFVF++GYIEPV+V+LHE ELTWAGR+SWKHHTC ISALSIST+LKQHPLIWSA Sbjct: 239 EMKHVKDFVFVHGYIEPVMVVLHERELTWAGRLSWKHHTCMISALSISTTLKQHPLIWSA 298 Query: 888 SNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRSNFN 709 NLPH+AYKLLAVPSPIGGV+V+ NT+HYHSQSASC LALNN+AVSVDG QE RS+F+ Sbjct: 299 INLPHDAYKLLAVPSPIGGVLVVCANTVHYHSQSASCTLALNNYAVSVDGGQELPRSSFS 358 Query: 708 LELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNS 529 +ELDAA+ATWLSNDVA+LSTKTGELLLL LVYDGRVVQRLDLSKSKASVLTSGITTIGNS Sbjct: 359 VELDAAHATWLSNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITTIGNS 418 Query: 528 LFFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDMVNG 349 LFFL SRLGDSLLVQFT G GAS L G+KEEVGDI+ D +K++RRS SDALQDMV+G Sbjct: 419 LFFLGSRLGDSLLVQFTCGSGASILSSGLKEEVGDIEDDAPSSKQLRRSPSDALQDMVSG 478 Query: 348 DELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYEL 169 +ELSLY S NN ES KTFSF VRDSLIN+GPLKDFSYGLRINAD NATGIAKQSNYEL Sbjct: 479 EELSLYVSDTNNTESAQKTFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 538 Query: 168 VCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKM 10 VCCSGHGKNG+LCVL+QSIRPE+IT+ +PGCK +WTVYHK +R H+ DSSKM Sbjct: 539 VCCSGHGKNGSLCVLRQSIRPEMITEVDLPGCKGIWTVYHKNTRGHSADSSKM 591 >XP_016204143.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1 [Arachis ipaensis] Length = 1458 Score = 868 bits (2244), Expect = 0.0 Identities = 443/607 (72%), Positives = 502/607 (82%), Gaps = 19/607 (3%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCNH--------LPAIQTQDSSDWLSIKPNVITS-- 1615 MS+AA+KMMH PT I+ CASG++TH LPA SDW PN T Sbjct: 1 MSFAAYKMMHCPTGIDNCASGFLTHSRADYVPRVPPLPADDLDPDSDW----PNPATRRD 56 Query: 1614 ---IPNLVVTAANVLQVYVVRVSDDSVKG---SSEAKRGGVMDGVSGASLELVCHYRLHG 1453 IPNL++T+ANVL+VY VRV ++S KG ++E+ RGGV DGV+GASLELVCHYRLHG Sbjct: 57 LGPIPNLILTSANVLEVYAVRVHEESAKGPPAAAESSRGGVFDGVTGASLELVCHYRLHG 116 Query: 1452 NIYSMAILPCXXXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLK 1276 N+ +M +L SIILTF DAKISVLE+DDS+HGL TSS+HCFEGPEWLHLK Sbjct: 117 NVEAMGVLSIGAGDGSRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLK 176 Query: 1275 RGRETFPTGPLVKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVE 1099 RGRE F GPLVKVDPQGRC GVLVY LQMIILKA QAG G VGDD TLG+GGA AR+E Sbjct: 177 RGREQFANGPLVKVDPQGRCGGVLVYDLQMIILKATQAGSGLVGDDDTLGSGGAVAARIE 236 Query: 1098 SSYIISLRDLEMKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTS 919 SSY+I+LRDL+M+HVKDF F++GYIEPV+VILHE ELTWAGR+SWKHHTC ISALSIST+ Sbjct: 237 SSYMINLRDLDMRHVKDFTFVHGYIEPVMVILHECELTWAGRLSWKHHTCMISALSISTT 296 Query: 918 LKQHPLIWSASNLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDG 739 LKQHPLIWSA NLPH+AYKLLAVPSPIGGV+VI NTIHYHSQSASC LALN++AVS+D Sbjct: 297 LKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSLDS 356 Query: 738 SQETTRSNFNLELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVL 559 SQE RS FN+ELDAANATWLSNDVA+LSTKTGELLLL LVYDGRVVQRLDLSKSKASVL Sbjct: 357 SQEMPRSTFNVELDAANATWLSNDVALLSTKTGELLLLVLVYDGRVVQRLDLSKSKASVL 416 Query: 558 TSGITTIGNSLFFLASRLGDSLLVQFTSGVGAS-TLPPGMKEEVGDIDGDIHLAKRIRRS 382 +SGITTIGNSLFFLASRLGDS+LVQF+ G G S + +KEEVGDI+ D +KR+RRS Sbjct: 417 SSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMSSSNNLKEEVGDIEVDAPSSKRLRRS 476 Query: 381 SSDALQDMVNGDELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNA 202 SDALQD+V+G+ELSLY S PN ES KTFSF VRDSLINVGPLKDFSYGLRINAD NA Sbjct: 477 PSDALQDLVSGEELSLYGSAPNRTESAQKTFSFAVRDSLINVGPLKDFSYGLRINADANA 536 Query: 201 TGIAKQSNYELVCCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTID 22 TGIAKQSNYELVCCSGHGKNG++CVL+QSIRPEVIT+ +PGCK +WTVYHK+SRSH+ D Sbjct: 537 TGIAKQSNYELVCCSGHGKNGSICVLRQSIRPEVITEVELPGCKGIWTVYHKSSRSHSAD 596 Query: 21 SSKMTSD 1 SSKM +D Sbjct: 597 SSKMAND 603 >KCW46268.1 hypothetical protein EUGRSUZ_K00143 [Eucalyptus grandis] Length = 1415 Score = 866 bits (2238), Expect = 0.0 Identities = 438/595 (73%), Positives = 496/595 (83%), Gaps = 7/595 (1%) Frame = -3 Query: 1764 MSYAAFKMMHSPTAIETCASGYITHCN-----HLPAIQTQDSSDWLSIKPNVITSIPNLV 1600 MSYAA+KMMH PT I+ C SG+ITH +P QT D + I +PNLV Sbjct: 1 MSYAAYKMMHWPTGIDNCGSGFITHSPSDFPPRIPPSQTDDLEPDYAPPRREIGPVPNLV 60 Query: 1599 VTAANVLQVYVVRVSDDSVKGSSEAKRGGVMDGVSGASLELVCHYRLHGNIYSMAILPCX 1420 VTAANVL+VYVVRV +D K S E+KRGG MDGVSGASLELVCHYRLHGN+ SMA+L Sbjct: 61 VTAANVLEVYVVRVQEDGDKDSGESKRGGAMDGVSGASLELVCHYRLHGNVESMAVLSTG 120 Query: 1419 XXXXXXXD-SIILTFDDAKISVLEFDDSVHGLHTSSMHCFEGPEWLHLKRGRETFPTGPL 1243 SIILTF DAKIS+LEFDDS+HGL T+SMHCFEGP+WLHLKRGRE+F GPL Sbjct: 121 GGNGSRSRDSIILTFQDAKISILEFDDSIHGLRTTSMHCFEGPDWLHLKRGRESFARGPL 180 Query: 1242 VKVDPQGRCAGVLVYGLQMIILKAAQAG-GFVGDDSTLGTGGACCARVESSYIISLRDLE 1066 VKVDPQGRC GVLVYGLQMI+LKA+Q G G VGD+ T + GA RVESSYIISLR+LE Sbjct: 181 VKVDPQGRCGGVLVYGLQMIMLKASQVGSGLVGDEDTFESAGAVSIRVESSYIISLRELE 240 Query: 1065 MKHVKDFVFINGYIEPVLVILHEHELTWAGRVSWKHHTCGISALSISTSLKQHPLIWSAS 886 MKHVKDFVF++GYIEPV+VILHE ELTWAGRVSWK+HTC ISALSIST+LKQHPLIWSAS Sbjct: 241 MKHVKDFVFVHGYIEPVMVILHERELTWAGRVSWKNHTCMISALSISTTLKQHPLIWSAS 300 Query: 885 NLPHEAYKLLAVPSPIGGVIVISTNTIHYHSQSASCVLALNNFAVSVDGSQETTRSNFNL 706 NLPH+AYKLLAVPSPIGGV+VIS N IHYHSQSASCVLALN++A S DGSQE +S+F++ Sbjct: 301 NLPHDAYKLLAVPSPIGGVLVISANAIHYHSQSASCVLALNSYASSADGSQEMPKSSFSV 360 Query: 705 ELDAANATWLSNDVAMLSTKTGELLLLKLVYDGRVVQRLDLSKSKASVLTSGITTIGNSL 526 ELDAA+ATWL NDV +LSTKTGELLLL LVYDGRVVQRLDL+KSKASVLTSGITTIGNSL Sbjct: 361 ELDAASATWLLNDVVLLSTKTGELLLLTLVYDGRVVQRLDLAKSKASVLTSGITTIGNSL 420 Query: 525 FFLASRLGDSLLVQFTSGVGASTLPPGMKEEVGDIDGDIHLAKRIRRSSSDALQDMVNGD 346 FFL SRLGDSLLVQ+T G G S G+KEEVGDI+GD LAKR+RRSSSDALQDMV G+ Sbjct: 421 FFLGSRLGDSLLVQYTCGFGTSKPSSGLKEEVGDIEGDAPLAKRLRRSSSDALQDMVGGE 480 Query: 345 ELSLYSSGPNNAESTLKTFSFTVRDSLINVGPLKDFSYGLRINADHNATGIAKQSNYELV 166 ELS++ P+NAES KTFSF VRDSLIN+GPLKDF+YGLRINAD NATG+AKQSNYELV Sbjct: 481 ELSIHGLTPSNAESAQKTFSFAVRDSLINIGPLKDFAYGLRINADANATGVAKQSNYELV 540 Query: 165 CCSGHGKNGALCVLQQSIRPEVITQEPIPGCKRLWTVYHKTSRSHTIDSSKMTSD 1 CCSGHGKNG+LCVL+QS+RPE+IT+ +PGCK +WTVYHK +R +DSSK+ D Sbjct: 541 CCSGHGKNGSLCVLRQSVRPEIITEVELPGCKGIWTVYHKNTRG--LDSSKVGVD 593