BLASTX nr result
ID: Akebia24_contig00012893
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00012893 (2417 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec... 825 0.0 ref|XP_007041140.1| Cleavage and polyadenylation specificity fac... 823 0.0 ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr... 794 0.0 ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm... 788 0.0 ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec... 786 0.0 gb|EXB51974.1| Cleavage and polyadenylation specificity factor C... 783 0.0 ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec... 778 0.0 ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas... 775 0.0 ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec... 775 0.0 ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec... 774 0.0 ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec... 744 0.0 ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun... 729 0.0 ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr... 727 0.0 ref|XP_002300333.2| zinc finger family protein [Populus trichoca... 722 0.0 ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec... 715 0.0 gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus... 711 0.0 ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec... 709 0.0 ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation spec... 704 0.0 gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malu... 702 0.0 ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec... 698 0.0 >ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Vitis vinifera] Length = 673 Score = 825 bits (2131), Expect = 0.0 Identities = 441/714 (61%), Positives = 482/714 (67%), Gaps = 21/714 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 MED EGVLSFDFEGGLD AP + PL+ +D + AA+ P V ++P PG Sbjct: 1 MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAA-------PSSVVSAEPTPGG 53 Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928 PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVYKHTN Sbjct: 54 APGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTN 113 Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748 EDIKECNMYKLGFCPNG DCRYRH KLPGPPP +EEVFQKIQ LSSFNYG NRF+QNRN Sbjct: 114 EDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRN 173 Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568 Y QQ E+ Q QGSN +N G AK STT E+ N+ N Sbjct: 174 P-YNQQTEKSQILQGSNAVNLGTVAKSSTT-EAINVQQQQVQPPQQQVSQTPMQ-----N 226 Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388 +P+G+PNQ NKTA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ Sbjct: 227 LPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 286 Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKY+HG+AHYGRNFSVKWLKLCELSFH Sbjct: 287 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 346 Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028 KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+ Sbjct: 347 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKA 406 Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPPHMTL 851 KGVNPD+G ENPDIVPF S+GQALG A QGRGRGRG++WPPHM L Sbjct: 407 KGVNPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPL 466 Query: 850 XXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTG 671 PVMMGADGF+Y + PDG M P++F + PR FP YGPRF DFT Sbjct: 467 ARGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAM-PDIFGVGPRAFPPYGPRFSGDFT- 524 Query: 670 LGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXXXXMV 491 G SGMMF GR QPG +FP ASG GM+ Sbjct: 525 ---------------GPASGMMFPGR-GQPGAVFP-ASGYGMMMGPGRAPFMGGMGVPAA 567 Query: 490 A--------------TPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMAS 353 A PP N NDRYS GSD G+GQ+MA Sbjct: 568 APTRAGRPVGMPPMFPPPPPPN---SQNNRTKRDQRTPVNDRNDRYSGGSDQGRGQDMAG 624 Query: 352 PGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209 P D T Y + GLK+Q N +RNDESESEDEAPRRSRHGEGKK+ Sbjct: 625 PDDETQYLQ------GLKSQQDDQFGGGNSFRNDESESEDEAPRRSRHGEGKKK 672 >ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] Length = 698 Score = 823 bits (2125), Expect = 0.0 Identities = 441/715 (61%), Positives = 488/715 (68%), Gaps = 22/715 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAA---SGNPALVPGPGQSVISDP- 2120 M+D EG LSFDFEGGLD P+ P+A++P+V +DPS AA S N + VPG + +DP Sbjct: 1 MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60 Query: 2119 --VPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946 V G GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQDC Sbjct: 61 AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766 VYKHTNEDIKECNMYKLGFCPNG DCRYRH KLPGPPPPVEEV QKIQ LSS+NY N+ Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177 Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586 FFQ RN+G+AQQ E+ Q PQG N +NQG KPSTT ES NM Sbjct: 178 FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTT-ESANMHPQQQVQQPQQQVSQTQI 236 Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406 N+P+G NQ NKTA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN Sbjct: 237 Q----NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 292 Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226 EAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKY+HG+AHYGRNFSVKWLKL Sbjct: 293 EAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 352 Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046 CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV Sbjct: 353 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELK 412 Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWP 866 KGVN D+G ENPDIVPF S+ A QGRGRGRG++WP Sbjct: 413 REEEKAKGVNSDNGGENPDIVPFEDNEEEEEEESEEEDESFS---AAAQGRGRGRGVMWP 469 Query: 865 PHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFP 686 PHM L P+MMG DGF+YG +TPDG +P +LF APR FP YGPRF Sbjct: 470 PHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVP-DLFG-APRPFPPYGPRFS 527 Query: 685 PDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXX 506 DFTG SGMMF GRP QPG +FPA GLGM+ Sbjct: 528 GDFTGPA----------------SGMMFPGRPPQPGAMFPAG-GLGMMMGPGRAPFMGGM 570 Query: 505 XXXM---------VATPPVHANXXXXXXXXXXXXXXXXXXP-TNDRYSYGSDPGKGQEMA 356 V+ PP+ TNDRY GS+ G+GQEMA Sbjct: 571 GPTGANPVRGGRPVSMPPMFPPPPAPSSQNSGRAVKRDQRTPTNDRYGAGSEQGRGQEMA 630 Query: 355 SPGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209 PG R DET+YQ G KA + N +RNDESESEDEAPRRSR+GEGKK+ Sbjct: 631 GPGGRLD-DETQYQQEGQKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKK 684 >ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551535|gb|ESR62164.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 701 Score = 794 bits (2051), Expect = 0.0 Identities = 430/716 (60%), Positives = 481/716 (67%), Gaps = 23/716 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASG-----NPALVPGPGQSV--I 2129 MED EG LSFDFEGGLD P P+A+ P + +D + AA+ N A + G + Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949 S PVP ++ GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQD Sbjct: 61 SAPVP-HHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119 Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN Sbjct: 120 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179 Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589 + FQ R A ++ QI++ QF QG N +NQG A K S+TAES N+ Sbjct: 180 KLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTT 237 Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409 N+P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL Sbjct: 238 QMQ---NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294 Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229 NEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLK Sbjct: 295 NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354 Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049 LCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLA+LLYLEPDSELMAISV Sbjct: 355 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEA 414 Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLW 869 KGVNPD+G +NPDIVPF S G A+QGRGRGRGM+W Sbjct: 415 KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGT---ASQGRGRGRGMMW 471 Query: 868 PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689 P M L P+M+GADGF+YG +TPDG PM P+LF +APR F YGPRF Sbjct: 472 PGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRF 529 Query: 688 PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXX 533 DFTG G GMMF GRP QPG +FP GM+ Sbjct: 530 SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 572 Query: 532 XXXXXXXXXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362 V PP N NDRYS GSD G+ QE Sbjct: 573 MGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQE 632 Query: 361 MASPGDRTTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209 M PG R DE +YQ G KA ++ Y RNDESESEDEAPRRSRHGEGKK+ Sbjct: 633 MGGPG-RGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 687 >ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis] gi|223537608|gb|EEF39232.1| conserved hypothetical protein [Ricinus communis] Length = 702 Score = 788 bits (2035), Expect = 0.0 Identities = 424/719 (58%), Positives = 483/719 (67%), Gaps = 26/719 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTA-PSNPSAAVPLVPTD--PSIAASGNPALVPGPGQSVISDPV 2117 M+D +G LSFDFEGGLD++ P+NP+A++P +P+D ++AA+ N ++VP + DP Sbjct: 1 MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSN---DPA 57 Query: 2116 PG------NYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECRE 1955 N GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECRE Sbjct: 58 SAAAAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 117 Query: 1954 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGF 1775 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ L+S+NYG Sbjct: 118 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGS 177 Query: 1774 PNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXX 1595 N+FFQ R AG+ Q ++ QF QG N + QG+AAKP T ES N+ Sbjct: 178 SNKFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGT-ESANVQQPQQQQPQPGQGQQ 236 Query: 1594 XXXXXXXQ---NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424 N+P+G PNQ N+TA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRS Sbjct: 237 SQQQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRS 296 Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244 NEAKLNEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIG VGGGNWKY+HG+AHYGRNFS Sbjct: 297 NEAKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFS 356 Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064 VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+G QLA LLY EPDSELMAIS Sbjct: 357 VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAIS 416 Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGAT-QGRGR 887 + KGVNP++G +NPDIVPF S+GQALGA QGRGR Sbjct: 417 LAAEAKREEEKAKGVNPENGGDNPDIVPFEDNEEEEEEESEEEEESFGQALGAPGQGRGR 476 Query: 886 GRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFP 707 GRG++W PHM L P+MMGAD F+YG +TPDG M P+LF +APR F Sbjct: 477 GRGIIW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGM-PDLFGVAPRGFT 534 Query: 706 SYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXX 527 Y PRF DFT G SGMMF GRP QPG +FP G GM+ Sbjct: 535 PYAPRFSGDFT----------------GAASGMMFPGRPPQPGGVFP-NGGFGMMMGPGR 577 Query: 526 XXXXXXXXXXMVATPPVHAN-------XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKG 368 +T P+ N NDRYS GSD +G Sbjct: 578 APFMGGMGPN--STNPLRGNWPGGMPFPPLPTPSPQRPVKRDQRMTANDRYSTGSD--QG 633 Query: 367 QEMASPGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209 + A D DE +YQ GLKA + N +RNDESESEDEAPRRSRHGEGKK+ Sbjct: 634 RNTAGEPD----DEARYQQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKK 688 >ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Citrus sinensis] Length = 683 Score = 786 bits (2029), Expect = 0.0 Identities = 426/709 (60%), Positives = 472/709 (66%), Gaps = 16/709 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 MED EG LSFDFEGGLD P P+A+ P S AA + S PVP + Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAAAPSSSGAAPDHA-----------SAPVP-H 48 Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928 + GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQDCVYKHTN Sbjct: 49 HSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTN 108 Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN+ FQ R Sbjct: 109 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRG 168 Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568 A ++ Q ++ QF QG N +NQG A K S+TAES N+ N Sbjct: 169 A-FSHQTDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTTQMQ---N 223 Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388 +P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ Sbjct: 224 LPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA 283 Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208 ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLKLCELSFH Sbjct: 284 ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 343 Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028 KTRHLRNP+NENLPVKISRDCQELE SIGEQLA+LLYLEPDSELMAISV Sbjct: 344 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKA 403 Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWPPHMTLX 848 KGVNPD+G +NPDIVPF S G A+QGRGRGRGM+WP M L Sbjct: 404 KGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGT---ASQGRGRGRGMMWPGPMPLA 460 Query: 847 XXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTGL 668 P+M+GADGF+YG +TPDG PM P+LF +APR F YGPRF DFTG Sbjct: 461 RGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRFSGDFTGP 518 Query: 667 GQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXXXXXXXXX 512 G GMMF GRP QPG +FP GM+ Sbjct: 519 G-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATN 561 Query: 511 XXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDR 341 V PP N NDRYS GSD G+ QEM PG R Sbjct: 562 PRGGRPVGVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPG-R 620 Query: 340 TTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209 DE +YQ G KA ++ Y RNDESESEDEAPRRSRHGEGKK+ Sbjct: 621 GPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 669 >gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus notabilis] Length = 710 Score = 783 bits (2021), Expect = 0.0 Identities = 425/722 (58%), Positives = 473/722 (65%), Gaps = 29/722 (4%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTA-----PSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISD 2123 MED EGVLSFDFEGGLDT P+ +A+ L+ D S AA+ N + +V +D Sbjct: 1 MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNN--LAASNSAVSAD 58 Query: 2122 PVPG-----NYPGR-RSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGEC 1961 P G + PGR RSFRQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFRLYGEC Sbjct: 59 PTSGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGEC 118 Query: 1960 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNY 1781 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP VEEV QKIQ+LSS+NY Sbjct: 119 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY 178 Query: 1780 GFPNRFFQNRNAG-YAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXX 1604 N+FFQ RNAG +AQ E+P P G N ++QGV KPS ES N+ Sbjct: 179 -HSNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSIL-ESANVQQPQQQVQPSQQ 236 Query: 1603 XXXXXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424 N+ G+PNQ N+T PLP G+SRYFIVKSCNRENLELSVQQGVWATQRS Sbjct: 237 PVGQNQIQ---NVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRS 293 Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244 NEAKLNEAFD ENVILIFSVNRTRHFQGCAKM S+IGG + GGNWKY+HG+AHYGRNFS Sbjct: 294 NEAKLNEAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFS 353 Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064 VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS Sbjct: 354 VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS 413 Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRG 884 + KGV+PD+G ENPDIVPF S+ Q LGA QGRGRG Sbjct: 414 LAAESKREEEKAKGVDPDNGGENPDIVPFEDNEEDEEEESEDEEESFSQVLGANQGRGRG 473 Query: 883 RGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPS 704 RG++WPPHM L PVM+GADG YG +TPDG PM P+LFN+ PR F Sbjct: 474 RGVMWPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPM-PDLFNVGPRAFNP 532 Query: 703 YGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXX 524 YGPRFP DF G TSGMMF GRP+QPG +FP G GM+ Sbjct: 533 YGPRFPGDF----------------MGPTSGMMFRGRPTQPGAVFP-GGGFGMMMGPGRA 575 Query: 523 XXXXXXXXXMV---------ATPPVHANXXXXXXXXXXXXXXXXXXPTND---RYSYGSD 380 A PP+ ND RY GSD Sbjct: 576 PCMGGMGVQGTSPARPMRPGAMPPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGAGSD 635 Query: 379 PGKGQEMASPGDRTTYDETKYQPGGLKAQ-----SKNIYRNDESESEDEAPRRSRHGEGK 215 +GQEM+ P D+ YQ G Q + N +RNDESESEDEAPRRSRHG+GK Sbjct: 636 QVRGQEMSGPAGGPE-DDAHYQLGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGK 694 Query: 214 KR 209 K+ Sbjct: 695 KK 696 >ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 691 Score = 778 bits (2009), Expect = 0.0 Identities = 427/719 (59%), Positives = 473/719 (65%), Gaps = 26/719 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVP---LVPTDPSIAAS-----GNPALVPGPGQSV 2132 MED EGVLSFDFEGGLD APS+ +AAVP LV D S AAS G+ A P Sbjct: 1 MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPST---- 56 Query: 2131 ISDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQ 1952 +DP GN PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQ Sbjct: 57 -ADPAGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQ 115 Query: 1951 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFP 1772 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY Sbjct: 116 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSS 175 Query: 1771 NRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXX 1592 N+FFQ R A Y QQ E+PQ PQG+N+ NQGV KP AES N Sbjct: 176 NKFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKP-LPAESGNAQPQQQVQQSQQQVNQS 234 Query: 1591 XXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1412 N+ +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+K Sbjct: 235 QMQ----NVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESK 290 Query: 1411 LNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWL 1232 LNEAFD+ ENVIL+FSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWL Sbjct: 291 LNEAFDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWL 350 Query: 1231 KLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXX 1052 KLCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV Sbjct: 351 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAE 410 Query: 1051 XXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGM 875 KGVNPD+G ENPDIVPF S+ +G A QGRGRGRGM Sbjct: 411 SKREEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGM 470 Query: 874 LWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITP---DGLPMPPELFNMAPRVFPS 704 +WPPHM L PVMMG DG +YG + P DG MP +LF + PR F Sbjct: 471 MWPPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMP-DLFGVGPRGFAP 528 Query: 703 YGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXX 524 YGPRF DF G + MMF GRPSQPG +FP + G GM+ Sbjct: 529 YGPRFSGDF----------------GGPPAAMMFRGRPSQPG-MFP-SGGFGMMMNPGRG 570 Query: 523 XXXXXXXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPG 374 P PV+ NDR+ GS+ G Sbjct: 571 PFMGGMGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQG 630 Query: 373 KGQEMASPGDRTTYDETKYQPGGLKAQ----SKNIYRNDESESEDEAPRRSRHGEGKKR 209 K Q+M S D+ +YQ G Q + N +RND+SESEDEAPRRSRHGEGKK+ Sbjct: 631 KSQDMLSQSGGPD-DDAQYQQGYKGNQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 688 >ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] gi|561020727|gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] Length = 697 Score = 775 bits (2002), Expect = 0.0 Identities = 422/714 (59%), Positives = 469/714 (65%), Gaps = 21/714 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSA-AVPLVPTDPSIAAS-----GNPALVPGPGQSVIS 2126 MED EGVLSFDFEGGLDTAPS +A + PLV D S AAS G PA P + Sbjct: 1 MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSG-----T 55 Query: 2125 DPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946 +P N PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQDC Sbjct: 56 EPAAVNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDC 115 Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766 VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY N+ Sbjct: 116 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNK 175 Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586 FFQ R + Y QQ E+ Q PQG+N+ NQGV KP AES N Sbjct: 176 FFQQRGSSYTQQAEKSQLPQGTNSTNQGVTGKP-LPAESGNAQPQQQVQQSQQQQVSQNQ 234 Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406 N+ +G PNQ ++ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLN Sbjct: 235 IQ---NVANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLN 291 Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226 EAFD+ ENVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWLKL Sbjct: 292 EAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 351 Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046 CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPD ELMA+SV Sbjct: 352 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESK 411 Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLW 869 KGVNPD+G ENPDIVPF S+G +G A QGRGRGRGM+W Sbjct: 412 REEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMW 471 Query: 868 PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689 PPHM L PVMMG DG +YG + PDG MP +LF++ PR F YGPRF Sbjct: 472 PPHMPLPRGARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMP-DLFSVGPRAFAPYGPRF 529 Query: 688 PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXX 509 DF G + MMF GRPSQPG +FP G GM+ Sbjct: 530 SGDF----------------GGPPAAMMFRGRPSQPG-MFP-GGGFGMMMNPGRGPFMGG 571 Query: 508 XXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEM 359 P PV+ NDRY GS+ GK Q+M Sbjct: 572 MGVAGANPPRGGRPVNMPPMFPPPPPLPQNTNRLAKRDQRTTDRNDRYGSGSEQGKSQDM 631 Query: 358 ASPGDRTTYDETKYQPGGLKAQ----SKNIYRNDESESEDEAPRRSRHGEGKKR 209 S D+ +YQ G Q + N +RND+SESEDEAPRRSRHGEGKK+ Sbjct: 632 LSQSGAPD-DDMQYQQGYKANQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 684 >ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cicer arietinum] Length = 677 Score = 775 bits (2001), Expect = 0.0 Identities = 422/706 (59%), Positives = 466/706 (66%), Gaps = 11/706 (1%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 MED EGVLSFDFEGGLD AP PSAA VP PS + +P S + PV GN Sbjct: 1 MEDSEGVLSFDFEGGLDAAP--PSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAPVSGN 58 Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928 PGRRSFRQTVCRHWLR LCMKGEACGFLHQYDK+RMP+CRFFRLYGECREQDCVYKHTN Sbjct: 59 IPGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTN 118 Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748 EDIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQ+L S+N+ ++F Q R Sbjct: 119 EDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRG 178 Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568 + Y QQ+E+ QFPQG N+ NQGVA KP AES N+ N Sbjct: 179 SSYTQQVEKSQFPQGINSANQGVAGKP-LAAESGNVQQQQQVQQSQQQVSQIQTQ----N 233 Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388 + +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+ Sbjct: 234 LANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSV 293 Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208 ENVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWLKLCELSFH Sbjct: 294 ENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 353 Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028 KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+ Sbjct: 354 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKA 413 Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQA-LGATQGRGRGRGMLWPPHMTL 851 KGVNPD+ ENPDIVPF S+ QA + QGRGRGRGM+WPPHM L Sbjct: 414 KGVNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPL 473 Query: 850 XXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTG 671 PVMMG DG +YG PDG M P+LF M PR F YGPRF DF Sbjct: 474 GRGARPMPGMQGFNPVMMG-DGLSYGPGAPDGFGM-PDLFGMGPRGFGPYGPRFSGDF-- 529 Query: 670 LGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVT---------XXXXXXX 518 AG + MMF GRPSQPG +FP G GM+ Sbjct: 530 --------------AGPPAAMMFRGRPSQPG-MFP-GGGFGMMMNPGRGPFMGGMGVPGP 573 Query: 517 XXXXXXXMVATPPVH-ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDR 341 + PP+ NDRYS G + GK Q+M S Sbjct: 574 NPPRGGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTNDRNDRYSSGQEQGKSQDMLSQSGG 633 Query: 340 TTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR*G 203 DE +YQ G A N +RN++SESEDEAPRRSRHGEGKKR G Sbjct: 634 PD-DEMQYQQSGAPA---NNFRNEDSESEDEAPRRSRHGEGKKRKG 675 >ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 681 Score = 774 bits (1998), Expect = 0.0 Identities = 422/710 (59%), Positives = 464/710 (65%), Gaps = 17/710 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAV--PLVPTDPSIAAS----GNPALVPGPGQSVIS 2126 MED EGVLSFDFEGGLD APS+ +AA PL+P D S AAS G PA P S + Sbjct: 1 MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPA---APAPSAVD 57 Query: 2125 DPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946 GN PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQDC Sbjct: 58 PVGGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDC 117 Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766 VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY N+ Sbjct: 118 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNK 177 Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586 FFQ R A Y QQ E+P PQG+N+ NQGV P P Sbjct: 178 FFQQRGASYNQQAEKPLLPQGNNSTNQGVTGNPL-----PAELGNAQPQQQVQQSQQQVN 232 Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406 QN+ +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLN Sbjct: 233 QSQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLN 292 Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226 EAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKY+HG+AHYGRNFSVKWLKL Sbjct: 293 EAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 352 Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046 CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV Sbjct: 353 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESK 412 Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLW 869 KGVNPD+G ENPDIVPF S+G +G A QGRGRGRGM+W Sbjct: 413 REEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMW 472 Query: 868 PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689 PPHM L PVMMG DG +YG + PDG MP +LF + PR F YGPRF Sbjct: 473 PPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMP-DLFGVGPRGFAPYGPRF 530 Query: 688 PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXX 509 DF G + MMF GRPSQPG +FP G GM+ Sbjct: 531 SGDF----------------GGPPAAMMFRGRPSQPG-MFP-GGGFGMMLNPGRGPFMGG 572 Query: 508 XXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEM 359 P PV+ NDR+ GS+ GK Q+M Sbjct: 573 IGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDM 632 Query: 358 ASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 S D+ +YQ G Q + D+SESEDEAPRRSRHGEGKK+ Sbjct: 633 LSQSGGPD-DDPQYQQGYKGNQDDH---PDDSESEDEAPRRSRHGEGKKK 678 >ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cucumis sativus] Length = 707 Score = 744 bits (1921), Expect = 0.0 Identities = 409/722 (56%), Positives = 461/722 (63%), Gaps = 29/722 (4%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSA--AVPLVPTD----PSIAASGNPALVPGPGQSVIS 2126 MED EGVLSFDFEGGLD P+NP+A ++P++ +D P+ +A NP L G +V + Sbjct: 1 MEDSEGVLSFDFEGGLDAGPTNPAATSSLPIINSDSSAPPAASAVSNP-LSGALGPAVSA 59 Query: 2125 DPVP---GNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECRE 1955 +P GN RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECRE Sbjct: 60 EPTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECRE 119 Query: 1954 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGF 1775 QDCVYKHTNEDIKECNMYK GFCPNGPDCRYRH KLPGPPPP+EE+ QKIQ+L S+NYG Sbjct: 120 QDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYGP 179 Query: 1774 PNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXX 1595 N+FF R G +QQ E+ QFPQ + QGV KPS AES N+ Sbjct: 180 SNKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSA-AESVNVQQQQGQQSAPQASQT 238 Query: 1594 XXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEA 1415 ++ +G PNQ N+ A LPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEA Sbjct: 239 PVQ-----SLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEA 293 Query: 1414 KLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKW 1235 KLNEAFD+ +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HG+ HYG+NFS+KW Sbjct: 294 KLNEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKW 353 Query: 1234 LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXX 1055 LKLCELSF KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPD ELMA+SV Sbjct: 354 LKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAA 413 Query: 1054 XXXXXXXXXKGVNPDDGVENPDIVPF-XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGR 881 KGVNPD G ENPDIVPF S+GQ+ G QGRGRGR Sbjct: 414 ESKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGRGR 473 Query: 880 GMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSY 701 GM+WPPHM + P MMG DG +YG +TPDG PM P++F M PR F Y Sbjct: 474 GMMWPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPM-PDIFGMTPRGFGPY 532 Query: 700 G--PRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXX 527 G PRF DF G + MMF GRPSQP +FP SG GM+ Sbjct: 533 GPTPRFSGDF----------------MGPPTAMMFRGRPSQPAAMFP-PSGFGMMMGQGR 575 Query: 526 XXXXXXXXXXMV----------ATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDP 377 +P TNDRY G D Sbjct: 576 GPFMGGMGVAGANPARPGRPVGVSPLYPPPAVPSSQNMNRAIKRDQRGLTNDRYIVGMDQ 635 Query: 376 GKGQEMASPGDRTTYDETKYQPGGLKAQSKNIY------RNDESESEDEAPRRSRHGEGK 215 KG E+ S G DE G KA S Y RN+ESESEDEAPRRSRHGEGK Sbjct: 636 NKGVEIQSSG----RDEEMQYKQGSKAYSDEQYGTGTTFRNEESESEDEAPRRSRHGEGK 691 Query: 214 KR 209 K+ Sbjct: 692 KK 693 >ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] gi|462410040|gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] Length = 695 Score = 729 bits (1881), Expect = 0.0 Identities = 410/720 (56%), Positives = 459/720 (63%), Gaps = 27/720 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLD-TAPSNPSAAVP----LVPTDPSIAA-SGNPALV-PGPGQSVI 2129 MED +G ++FDFEGGLD TA + P+ P L+ +D +AA NPA P P Sbjct: 1 MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNH--- 57 Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949 P P N G RS+RQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFRLYGECREQD Sbjct: 58 --PNP-NRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQD 114 Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+NY N Sbjct: 115 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSN 174 Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589 +F+Q RNAG+ QQ ++ Q QG N++ QGV KPST ES N+ Sbjct: 175 KFYQQRNAGFPQQADKYQSAQGPNSVYQGVVGKPST-GESANVHQQQQVQQTQQQVGHTQ 233 Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409 N+P+G+ NQ N++A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KL Sbjct: 234 TQ----NLPNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKL 288 Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229 NEAFD+ ENVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HGSAHYGRNFSVKWLK Sbjct: 289 NEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLK 348 Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049 LCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMA+S+ Sbjct: 349 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAES 408 Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGR-GM 875 KGVNP++G ENPDIVPF S+G G +GRGRGR G+ Sbjct: 409 KREEEKAKGVNPENGGENPDIVPFEDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGI 468 Query: 874 LWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGP 695 +WPPHM L P MMGAD YG PDG M P F + PR F YGP Sbjct: 469 MWPPHMPLARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGM-PNPFGVGPRGFNPYGP 526 Query: 694 RFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGM---------- 545 RF DFT G T GMMF GRP QPG FP G GM Sbjct: 527 RFSGDFT----------------GPTPGMMFRGRPQQPG--FP-PGGYGMMMGPGRAPFM 567 Query: 544 ----VTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDP 377 V + PP N N+RYS GS Sbjct: 568 GGMGVGGANPGRPGRPTGMSPMFPPPSSQN----TNRMQKRDPRGPSNDRNERYSAGSGQ 623 Query: 376 GKGQEM----ASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 GKGQE+ P D Y + + + N RND+SESEDEAPRRSRHGEGKK+ Sbjct: 624 GKGQEIPGLAGGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAPRRSRHGEGKKK 683 >ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551536|gb|ESR62165.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 672 Score = 727 bits (1877), Expect = 0.0 Identities = 403/716 (56%), Positives = 453/716 (63%), Gaps = 23/716 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASG-----NPALVPGPGQSV--I 2129 MED EG LSFDFEGGLD P P+A+ P + +D + AA+ N A + G + Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949 S PVP ++ GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQD Sbjct: 61 SAPVP-HHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119 Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN Sbjct: 120 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179 Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589 + FQ R A ++ QI++ QF QG N +NQG A K S+TAES N+ Sbjct: 180 KLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTT 237 Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409 N+P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL Sbjct: 238 QMQ---NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294 Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229 NEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLK Sbjct: 295 NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354 Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049 LCELSFHKTRHLRNP+NENLPVK AISV Sbjct: 355 LCELSFHKTRHLRNPYNENLPVK-----------------------------AISVAAEA 385 Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLW 869 KGVNPD+G +NPDIVPF S G A +QGRGRGRGM+W Sbjct: 386 KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGTA---SQGRGRGRGMMW 442 Query: 868 PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689 P M L P+M+GADGF+YG +TPDG PM P+LF +APR F YGPRF Sbjct: 443 PGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRF 500 Query: 688 PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXX 533 DFTG G GMMF GRP QPG +FP GM+ Sbjct: 501 SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 543 Query: 532 XXXXXXXXXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362 V PP N NDRYS GSD G+ QE Sbjct: 544 MGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQE 603 Query: 361 MASPGDRTTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209 M PG R DE +YQ G KA ++ Y RNDESESEDEAPRRSRHGEGKK+ Sbjct: 604 MGGPG-RGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 658 >ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa] gi|550349048|gb|EEE85138.2| zinc finger family protein [Populus trichocarpa] Length = 669 Score = 722 bits (1864), Expect = 0.0 Identities = 405/711 (56%), Positives = 449/711 (63%), Gaps = 18/711 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 MED EGVLSFDFEGGLD+ P+NP A++P +P+D AA+ A P + + N Sbjct: 1 MEDSEGVLSFDFEGGLDSGPANPIASIPAIPSDNYGAAT---AAAPNTTNTTTNTTNNSN 57 Query: 2107 ------YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946 GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDC Sbjct: 58 SGAADIQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 117 Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766 VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ L+S+N N+ Sbjct: 118 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSNK 177 Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586 FQ RNAG++QQIE+ NTI KPS T ES N+ Sbjct: 178 NFQQRNAGFSQQIEK-----SPNTI-----IKPSGT-ESANVQQQQQQQQQTQTPHLTNG 226 Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSR-----------YFIVKSCNRENLELSVQQGVW 1439 PN N+ A PLPQG+S YFIVKSCNRENLELSVQQGVW Sbjct: 227 QHQQPQ----QPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQGVW 282 Query: 1438 ATQRSNEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHY 1259 ATQRSNE KLNEA D+ +NVILIFSVNRTRHFQGCAKM SKIG VGGGNWKY+HG+AHY Sbjct: 283 ATQRSNEIKLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTAHY 342 Query: 1258 GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSE 1079 GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELE SIGEQLASLLYLEPDSE Sbjct: 343 GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSE 402 Query: 1078 LMAISVXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-AT 902 LMA+S+ KGVNPD G ENPDIVPF S+GQ LG A Sbjct: 403 LMAVSLAAEAKREEEKEKGVNPDSGGENPDIVPFEDNEEEEEEESEEEEESFGQPLGPAA 462 Query: 901 QGRGRGRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMA 722 QGRGRGRGM+WP H + P+MMGADGF+YG +TPD M P+LF +A Sbjct: 463 QGRGRGRGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGM-PDLFGVA 521 Query: 721 PRVFPSYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV 542 R FP YGPRF DFT G SGMMF GRPSQPG +FP A G GM+ Sbjct: 522 SRGFPPYGPRFSGDFT----------------GAASGMMFPGRPSQPGAVFP-AGGFGMM 564 Query: 541 TXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362 P +N N+ S K + Sbjct: 565 MGPGRPPFIG-------GMGPTPSNLLRGPRPGGMFAPFPAPSSQNNSRSV-----KRDQ 612 Query: 361 MASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 A+ DR ++ Q G + N RNDESESEDEAPRRSRHGEGKK+ Sbjct: 613 RAAANDR---NDRHNQFGAV-----NSIRNDESESEDEAPRRSRHGEGKKK 655 >ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Fragaria vesca subsp. vesca] Length = 689 Score = 715 bits (1845), Expect = 0.0 Identities = 396/714 (55%), Positives = 444/714 (62%), Gaps = 21/714 (2%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDP----SIAASGNPALVPGPGQSVISDP 2120 MEDP+GVL+FDFEGGLD+A + L + P S A+ P P P Sbjct: 1 MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAAPAP------QP 54 Query: 2119 VPGNYP-GRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCV 1943 P P GR+SFRQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFR+YGECREQDCV Sbjct: 55 DPNVNPSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCV 114 Query: 1942 YKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRF 1763 YKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+NY N+F Sbjct: 115 YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKF 174 Query: 1762 FQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXX 1583 Q RN G+ QQ +R Q Q +N+ NQ V +PS AES N+ Sbjct: 175 SQPRNGGFPQQHDRSQPAQVTNSFNQ-VVVRPSA-AESANVQQPQQFQQTQQPVAQTQAQ 232 Query: 1582 XXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1403 ++P+G+ +Q N+ ALPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNE Sbjct: 233 ----SVPNGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNE 288 Query: 1402 AFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLC 1223 AFD+ ENVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HG+AHYGRNFSVKWLKLC Sbjct: 289 AFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLC 348 Query: 1222 ELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXX 1043 ELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+ Sbjct: 349 ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKR 408 Query: 1042 XXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWPP 863 KGVNP++G ENPDIVPF Y GA + RGRGR ++WPP Sbjct: 409 EEEKAKGVNPENGGENPDIVPF-EDNEEEEEEESDDEEDYQVPGGAIENRGRGR-VMWPP 466 Query: 862 HMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPP 683 HM L P MMG D YG +TPDG MP PR F YGPRF Sbjct: 467 HMPLGGRGGRPMPGMQGFPGMMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSG 526 Query: 682 DFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAA--------------SGLGM 545 DF G GMMF GRP QPG +FP G+G+ Sbjct: 527 DF----------------GGPNPGMMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGV 570 Query: 544 VTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQ 365 M P N N+RYS GS GK Sbjct: 571 GGNNPARGGRPGGMPPMFPPHPPSQN----NNRLQKRDPRGSGNDRNERYSAGSGHGKEM 626 Query: 364 EMASPGDRTTYDET--KYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 + P D Y + YQ + N RND+SESEDEAPRRSRHGEGKK+ Sbjct: 627 QAGGPDDENHYQHSSKSYQE---DYGAGNNGRNDDSESEDEAPRRSRHGEGKKK 677 >gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus guttatus] Length = 681 Score = 711 bits (1834), Expect = 0.0 Identities = 399/731 (54%), Positives = 451/731 (61%), Gaps = 26/731 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPG- 2111 M+D EG LSFDFEGGLD PS+P+A+VP++ + + + A P + + PVP Sbjct: 1 MDDGEGGLSFDFEGGLDIGPSHPTASVPVIQSSANANTASAAAAAANP-YNPSAAPVPAT 59 Query: 2110 ------NYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949 N GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECREQD Sbjct: 60 QAAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQD 119 Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769 CVYKHTNED+KECNMYKLGFCPNGPDCRYRH KLPGPPP VEEV QKIQ L+S+NYG N Sbjct: 120 CVYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSN 179 Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589 FFQNRN+ +AQQ E+PQFPQG N +Q + AE N+ Sbjct: 180 NFFQNRNSNFAQQTEKPQFPQGPNGTHQ---VGKTNAAEPGNLNQPAQQSQQPGSQGQLQ 236 Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409 +IP+ NQ ++ A PLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKL Sbjct: 237 ------SIPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKL 290 Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229 NEAF++ EN+ILIFSVN+TRHFQGCAKMTS+IGG VGGGNWK++HG+AHYGRNF++KWLK Sbjct: 291 NEAFESVENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLK 350 Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049 LCEL+F KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDS+LMAI++ Sbjct: 351 LCELTFDKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAEL 410 Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSY-----GQALGATQGRGRG 884 KGVN D+G ENPDIVPF GQA GA QGRG G Sbjct: 411 KREEEKAKGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGA-QGRGVG 469 Query: 883 RGMLWPPHMT-LXXXXXXXXXXXXXXPVMMGADGFTYGTITP---DGLPMPPELFNMAPR 716 RGM+W PHM L P MMG DGF YG P DG PM + F M PR Sbjct: 470 RGMMWGPHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMH-DPFGMVPR 528 Query: 715 VFPSYGPRFPPDFTGLGQSSAM-------GFAPLGGAGLTS--GMMFHGRP-SQPGPIFP 566 F +GPRF DF G M GF P+ G G G GRP P P FP Sbjct: 529 GFGQFGPRFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPFFP 588 Query: 565 AASGLGMVTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYG 386 PPV A ND Sbjct: 589 PPP------------------------PPVAAQPPPQNSNWVKRDQKAPYSDRNDV---- 620 Query: 385 SDPGKGQEMASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR* 206 SD GKGQE+ S G A+ + YRNDESESEDEAPRRSRHGEGKK+ Sbjct: 621 SDQGKGQEIVSGSSNR----------GNAAKREESYRNDESESEDEAPRRSRHGEGKKKR 670 Query: 205 GS*HREMEREY 173 E + E+ Sbjct: 671 RGSEAETDGEF 681 >ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum tuberosum] Length = 677 Score = 709 bits (1829), Expect = 0.0 Identities = 395/705 (56%), Positives = 456/705 (64%), Gaps = 12/705 (1%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNP----ALVPGPGQSVISDP 2120 M+D EG L+FDFEGGLDT P++P+A+VP++ + I P ALVP PG V Sbjct: 1 MDDGEGGLNFDFEGGLDTGPTHPTASVPVLQSAGHITTGPAPNASVALVP-PGGGV-GQG 58 Query: 2119 VPGNYPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCV 1943 G++ G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCV Sbjct: 59 GDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 118 Query: 1942 YKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRF 1763 YKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPV EV Q+IQNL+S YG+ NRF Sbjct: 119 YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTS--YGYSNRF 176 Query: 1762 FQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXX 1583 FQNRN Y+ Q ++ Q PQ N +NQ V ST AE P Sbjct: 177 FQNRNTNYSTQADKSQIPQVPNVMNQAVK---STAAEPPIGQPHQPHQQQVQQPQHQGAP 233 Query: 1582 XXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1403 Q +P +Q N+ A+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE Sbjct: 234 TQTQTLPS---SQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 290 Query: 1402 AFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLC 1223 AFD+ ENVIL+FS+NRTRHFQG AKMTS+IGG GGNWK+ HG+AHYGRNFS+KWLKLC Sbjct: 291 AFDSVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLC 350 Query: 1222 ELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXX 1043 ELSF KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLY+EPDSELMA+S+ Sbjct: 351 ELSFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKR 410 Query: 1042 XXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXS--YGQALG-ATQGRGRGRGML 872 KGVNPD+G ENPDIVPF +GQA G A GRGRGRG++ Sbjct: 411 EEERAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGRGIV 470 Query: 871 WPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPR 692 WPP + P MM +DGF+YG++TPDG PMP + + M R F +GPR Sbjct: 471 WPPLVPFGRGARPFPGMRGFPPGMM-SDGFSYGSMTPDGFPMP-DPYGMGGRPFGPFGPR 528 Query: 691 FPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXX 512 FP D + A G G G+ MM GRP G + P A G Sbjct: 529 FPGDMMFHSRPPAAG-----GFGM---MMGPGRPPFMGGMGPGAPG-----PPRGGRPMG 575 Query: 511 XXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMAS----PGD 344 + TPP N NDR+S G D G+GQE+A P + Sbjct: 576 IHPSFIPPTPPPSQNPRVKKDQRAPFNER------NDRFSSGPDQGRGQEIAGSVGGPAE 629 Query: 343 RTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 Y +T+ N +RNDESESEDEAPRRSRHG+GKK+ Sbjct: 630 GVHYPQTE-----------NSFRNDESESEDEAPRRSRHGDGKKK 663 >ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum lycopersicum] Length = 671 Score = 704 bits (1818), Expect = 0.0 Identities = 391/698 (56%), Positives = 451/698 (64%), Gaps = 5/698 (0%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 M+D EG L+FDFEGGLDT P++P+A+VP++ P+ AS + PG G + D G+ Sbjct: 1 MDDGEGGLNFDFEGGLDTGPTHPTASVPVIQAGPAPNASV-AVVPPGGGVGLGGD---GS 56 Query: 2107 YPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 1931 + G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVYKHT Sbjct: 57 FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 116 Query: 1930 NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNR 1751 NEDIKECNM+KLGFCPNGPDCRYRH K+PGPPPPV EV QKIQNL+S +G+ NRFFQNR Sbjct: 117 NEDIKECNMFKLGFCPNGPDCRYRHAKMPGPPPPVVEVLQKIQNLTS--HGYSNRFFQNR 174 Query: 1750 NAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQ 1571 N Y+ Q ++ Q PQ N +NQ V ST E P Q Sbjct: 175 NTNYSTQADKSQIPQVPNVMNQAVK---STATEPPIGQPHQPHQQQVQQPQHQGPPTQTQ 231 Query: 1570 NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDT 1391 +P Q N+ A+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ Sbjct: 232 TLPG---TQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 288 Query: 1390 TENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSF 1211 ENVILIFS+NRTRHFQG AKMTS+IGG GGNWK+ HG+AHYGRNFSVKWLKLCELSF Sbjct: 289 VENVILIFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCELSF 348 Query: 1210 HKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1031 KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLY+EPDSELMAIS+ Sbjct: 349 QKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAISLAAESKREEER 408 Query: 1030 XKGVNPDDGVENPDIVPF---XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPP 863 KGVNPD+G ENPDIVPF +GQALG A RGRGRG++WPP Sbjct: 409 AKGVNPDNGNENPDIVPFEDNEEEEEEESEEEDEEDEGFGQALGPAALDRGRGRGIVWPP 468 Query: 862 HMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPP 683 + +M +DGF+YG++TPDG PM P+ + M R F +GPRFP Sbjct: 469 LVPFRGARPFPGMRGFPPGIM--SDGFSYGSMTPDGFPM-PDPYGMGGRPFGPFGPRFPG 525 Query: 682 DFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXX 503 D + A GG G+ MM RP G + P A G Sbjct: 526 DMMFHSRPPA-----AGGFGM---MMGPARPPFMGGMGPGAPG-----PPRGGRPMGMHP 572 Query: 502 XXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDRTTYDET 323 PP N NDR+S G D G+GQE A G DE Sbjct: 573 SFTPPPPPPSQN------PRVKKDQRAPFNERNDRFSSGPDQGRGQETA--GSVVGPDEG 624 Query: 322 KYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 + P Q++N +RNDESESEDEAPRRSRHG+GKK+ Sbjct: 625 VHYP-----QTENSFRNDESESEDEAPRRSRHGDGKKK 657 >gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malus domestica] Length = 667 Score = 702 bits (1812), Expect = 0.0 Identities = 392/718 (54%), Positives = 444/718 (61%), Gaps = 25/718 (3%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLD-----TAPSNPSAAVP-----LVPTDPSIAASG--NPALVPGP 2144 MED +G L+FDFEGGLD +A + P+ VP ++ +D ++ G A P P Sbjct: 1 MEDSDGGLNFDFEGGLDAPATVSASAGPANTVPTSNYSVMQSDSAVTGLGANQAAAAPQP 60 Query: 2143 GQSVISDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGE 1964 Q+ N G RS+RQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGE Sbjct: 61 NQNA-------NRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGE 113 Query: 1963 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFN 1784 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+N Sbjct: 114 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYN 173 Query: 1783 YGFPNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXX 1604 Y ++F+Q RNAG+ QQ ++ Q QG N KP TTAE N+ Sbjct: 174 YNNSSKFYQQRNAGFPQQGDKHQPAQGPNNF----VGKP-TTAEPGNVQQQQQQQLQQTQ 228 Query: 1603 XXXXXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424 +P+G+ NQ N++ALPLPQG SRYFIVKSCNRENLELSVQQG+WATQRS Sbjct: 229 QHVGPTQTQ--TLPNGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRS 286 Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244 NE+KLNEAFD+ ENVILIFSVNRTRHFQGCAKM S+IGG VGGGNWKY+HG+AHYGRNFS Sbjct: 287 NESKLNEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFS 346 Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064 VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAIS Sbjct: 347 VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAIS 406 Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGA---TQGR 893 + KGVNP++G ENPDIVPF S+GQ GA +GR Sbjct: 407 IAAESKREEEKAKGVNPENGGENPDIVPFEDNEEEEEEESEDEEDSFGQVPGAGNDGRGR 466 Query: 892 GRGRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRV 713 GRG G++WPPHM L P MMG D Y PDG M P F MAPR Sbjct: 467 GRGGGVMWPPHMALPRGGRPMPGMQGFPPGMMGHDAMPY---VPDGFVM-PNPFGMAPRG 522 Query: 712 FPSYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFP----------A 563 F YGPRF DFT G GMMF GRP QPG FP Sbjct: 523 FNPYGPRFSGDFT----------------GPNPGMMFRGRPQQPG--FPPGGFGIMGPGR 564 Query: 562 ASGLGMVTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGS 383 A +G + PP N S Sbjct: 565 APFMGGIHPGRGGRPTGMSPMFPPPPPPSSQNPNRMPKRDPRG---------------AS 609 Query: 382 DPGKGQEMASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209 KGQ+M+ P D T Y + N RND+SESEDEAPRRSRHG+GKK+ Sbjct: 610 TDRKGQDMSGPDDETHYG------------AGNSSRNDDSESEDEAPRRSRHGDGKKK 655 >ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum lycopersicum] Length = 689 Score = 698 bits (1801), Expect = 0.0 Identities = 385/701 (54%), Positives = 447/701 (63%), Gaps = 8/701 (1%) Frame = -2 Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108 M++ EG L+FDFEGGLDT P++P+A+VP++ + AA+ + A + P + Sbjct: 1 MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAASSANINPPTVPAVGGQGDVG 60 Query: 2107 YPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 1931 + G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT Sbjct: 61 FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 120 Query: 1930 NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNR 1751 EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPPVEE+ QKIQ+L+S NYG+ NRF QNR Sbjct: 121 IEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSNRFNQNR 180 Query: 1750 NAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQ 1571 NA Y+ Q ++ Q Q N + V ST E+P + Sbjct: 181 NANYSTQTDKSQASQAQNGTSLAVK---STATETPIIQQHQPHQQVQPPQLQGGPTQAQI 237 Query: 1570 NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDT 1391 + P+G NQ ++TA+ LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ Sbjct: 238 H-PNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 296 Query: 1390 TENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSF 1211 ENVILIFSVNRTRHFQGC KMTS+IGG GGNWK+ HG+AHYGRNFS+KWLKLCELSF Sbjct: 297 VENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKLCELSF 356 Query: 1210 HKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1031 KT HLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAIS+ Sbjct: 357 QKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRLEEK 416 Query: 1030 XKGVNPDDGVENPDIVPF--XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPPH 860 KGVNPD+G +NPDIVPF ++ Q G A GRGRGRG+ WPP Sbjct: 417 AKGVNPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGIAWPPI 476 Query: 859 MTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPD 680 M P MMG DGF+YG +TP+G PM + F M PR FP YGPRF D Sbjct: 477 MPFGHGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPM-TDHFGMGPRPFPPYGPRFSSD 534 Query: 679 FTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXXX 500 G+ P GG G+ M+ GRP G + P A+G Sbjct: 535 LMFHGR------PPAGGFGM---MIGPGRPPFVGGMGPGATGPPRAGRAVRMHPSFIPPS 585 Query: 499 XMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDRTTYDETK 320 + P A NDR+S SD GKGQEM G D Sbjct: 586 SQPSQYPYRAK----------REQRAPVSDRNDRFS--SDQGKGQEMM--GSVNGPDGVH 631 Query: 319 YQPGGLKAQSK----NIYRNDESESEDEAPRRSRHGEGKKR 209 Q G + ++ N +ND SESEDEAPRRSRHG+GKK+ Sbjct: 632 MQIGKSEHDNQFGAGNSLKNDGSESEDEAPRRSRHGDGKKK 672