BLASTX nr result
ID: Paeonia22_contig00008238
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia22_contig00008238 (2518 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr... 783 0.0 ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec... 781 0.0 ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec... 778 0.0 ref|XP_007041140.1| Cleavage and polyadenylation specificity fac... 771 0.0 ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun... 751 0.0 ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm... 750 0.0 ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas... 746 0.0 ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec... 745 0.0 ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec... 736 0.0 gb|EXB51974.1| Cleavage and polyadenylation specificity factor C... 734 0.0 ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec... 731 0.0 ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec... 731 0.0 ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec... 731 0.0 gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malu... 725 0.0 ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr... 714 0.0 ref|XP_002300333.2| zinc finger family protein [Populus trichoca... 712 0.0 gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus... 711 0.0 ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec... 690 0.0 ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec... 686 0.0 ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec... 683 0.0 >ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551535|gb|ESR62164.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 701 Score = 783 bits (2022), Expect = 0.0 Identities = 428/705 (60%), Positives = 468/705 (66%), Gaps = 42/705 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXV--------PSAE 2178 MEDSEG LSFDFEGGL+ P TA+ P IQSDS P Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2177 PPAMNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDC 1998 + + SGRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR++GECREQDC Sbjct: 61 SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 1997 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNK 1818 VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KL GPPP+VEEVLQKIQQ++S+++GN NK Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180 Query: 1817 FYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ 1638 +Q RG A +Q +K Q QGPN N GA KSSTAES Sbjct: 181 LFQQRG-AFSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239 Query: 1637 -TLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1461 LPNGLPNQ N+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD Sbjct: 240 QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299 Query: 1460 SVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELT 1281 S ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGT HYGRNFSVKWLKLCEL+ Sbjct: 300 SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359 Query: 1280 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXX 1101 FHKTRHLRNPYNENLPVKISRDCQELEPSIGE+LA+LLYLEPDSELMAISV Sbjct: 360 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419 Query: 1100 XXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPL 921 KGVN +NG +NPDIVPF+DN SLG T PL Sbjct: 420 KAKGVNPDNGGDNPDIVPFEDN-EEEEEEESEEEEESLG-TASQGRGRGRGMMWPGPMPL 477 Query: 920 ARGARPMPGVRGFPPVMMGPDGFSYG--PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP- 750 ARGARP+PG+RGFPP+M+G DGFSYG PDGFP+PDLFGV PRPFAPYGPRFSGDF+GP Sbjct: 478 ARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG 537 Query: 749 GMMY--RPQQ--------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 GMM+ RP Q Sbjct: 538 GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQ 597 Query: 617 PQQNNNRMVKRDQRGPVSDRNERY--------------XXXXXXXXXXXXXXXXXXXXXX 480 QN++R+ KRD RG ++DRN+RY Sbjct: 598 SSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQED 657 Query: 479 QFGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 Q+GS FRN+ESESEDEAPRRSRHGEGKKKRR SE D SSD+ Sbjct: 658 QYGS-RNFRNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSDN 701 >ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Vitis vinifera] Length = 673 Score = 781 bits (2016), Expect = 0.0 Identities = 426/682 (62%), Positives = 454/682 (66%), Gaps = 32/682 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAMNNVS 2154 MED+EGVLSFDFEGGL+ P A PLIQSD+ SAEP Sbjct: 1 MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAAPSSVV----SAEPTP-GGAP 55 Query: 2153 GRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTNED 1974 GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQDCVYKHTNED Sbjct: 56 GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNED 115 Query: 1973 IKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQHRGPA 1794 IKECNMYKLGFCPNG DCRYRHAKL GPPP +EEV QKIQQL+SF+YG+ N+FYQ+R P Sbjct: 116 IKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNPY 175 Query: 1793 PPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLPNGLPN 1614 Q+EK Q+ QG N N G V KSST E+ Q LPNGLPN Sbjct: 176 NQ-QTEKSQILQGSNAVNLGTVAKSSTTEA-INVQQQQVQPPQQQVSQTPMQNLPNGLPN 233 Query: 1613 QANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIF 1434 QANK SPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIF Sbjct: 234 QANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIF 293 Query: 1433 SVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKTRHLRN 1254 SVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGT HYGRNFSVKWLKLCEL+FHKTRHLRN Sbjct: 294 SVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN 353 Query: 1253 PYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKGVNSNN 1074 PYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAIS+ KGVN +N Sbjct: 354 PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGVNPDN 413 Query: 1073 GAENPDIVPFDDNXXXXXXXXXXXXXXSLGQT---XXXXXXXXXXXXXXXXXPLARGARP 903 G ENPDIVPF+DN S GQ PLARGARP Sbjct: 414 GGENPDIVPFEDN-EEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLARGARP 472 Query: 902 MPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP--GMM- 741 +P +RGFPPVMMG DGFSY PDGF +PD+FGVGPR F PYGPRFSGDF+GP GMM Sbjct: 473 IPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDFTGPASGMMF 532 Query: 740 --------------YRPQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPQQNN 603 Y P N Sbjct: 533 PGRGQPGAVFPASGYGMMMGPGRAPFMGGMGVPAAAPTRAGRPVGMPPMFPPPPPPNSQN 592 Query: 602 NRMVKRDQRGPVSDRNERY---------XXXXXXXXXXXXXXXXXXXXXXQFGSGSKFRN 450 NR KRDQR PV+DRN+RY QFG G+ FRN Sbjct: 593 NR-TKRDQRTPVNDRNDRYSGGSDQGRGQDMAGPDDETQYLQGLKSQQDDQFGGGNSFRN 651 Query: 449 EESESEDEAPRRSRHGEGKKKR 384 +ESESEDEAPRRSRHGEGKKKR Sbjct: 652 DESESEDEAPRRSRHGEGKKKR 673 >ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Citrus sinensis] Length = 683 Score = 778 bits (2008), Expect = 0.0 Identities = 424/697 (60%), Positives = 464/697 (66%), Gaps = 34/697 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAMNNVS 2154 MEDSEG LSFDFEGGL+ P TA+ P S P + + S Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAAAPSSSGAA----------PDHASAPVPHHS 50 Query: 2153 GRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTNED 1974 GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR++GECREQDCVYKHTNED Sbjct: 51 GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTNED 110 Query: 1973 IKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQHRGPA 1794 IKECNMYKLGFCPNGPDCRYRH KL GPPP+VEEVLQKIQQ++S+++GN NK +Q RG A Sbjct: 111 IKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRG-A 169 Query: 1793 PPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ-TLPNGLP 1617 +Q++K Q QGPN N GA KSSTAES LPNGLP Sbjct: 170 FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQMQNLPNGLP 229 Query: 1616 NQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILI 1437 NQ N+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS ENVILI Sbjct: 230 NQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAENVILI 289 Query: 1436 FSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKTRHLR 1257 FSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGT HYGRNFSVKWLKLCEL+FHKTRHLR Sbjct: 290 FSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLR 349 Query: 1256 NPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKGVNSN 1077 NPYNENLPVKISRDCQELEPSIGE+LA+LLYLEPDSELMAISV KGVN + Sbjct: 350 NPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKAKGVNPD 409 Query: 1076 NGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPLARGARPMP 897 NG +NPDIVPF+DN SLG T PLARGARP+P Sbjct: 410 NGGDNPDIVPFEDN-EEEEEEESEEEEESLG-TASQGRGRGRGMMWPGPMPLARGARPVP 467 Query: 896 GVRGFPPVMMGPDGFSYG--PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP-GMMY--RP 732 G+RGFPP+M+G DGFSYG PDGFP+PDLFGV PRPFAPYGPRFSGDF+GP GMM+ RP Sbjct: 468 GMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPGGMMFPGRP 527 Query: 731 QQ--------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPQQNNNRM 594 Q QN++R Sbjct: 528 PQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRA 587 Query: 593 VKRDQRGPVSDRNERY--------------XXXXXXXXXXXXXXXXXXXXXXQFGSGSKF 456 KRD RG ++DRN+RY Q+GS F Sbjct: 588 AKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGS-RNF 646 Query: 455 RNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 RN+ESESEDEAPRRSRHGEGKKKRR SE D SSD+ Sbjct: 647 RNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSDN 683 >ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] Length = 698 Score = 771 bits (1991), Expect = 0.0 Identities = 422/704 (59%), Positives = 455/704 (64%), Gaps = 41/704 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAMNNVS 2154 M+DSEG LSFDFEGGL+ P TA+MP++ SD VP A P + N+ + Sbjct: 1 MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60 Query: 2153 --------GRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDC 1998 GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR++GECREQDC Sbjct: 61 AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 1997 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNK 1818 VYKHTNEDIKECNMYKLGFCPNG DCRYRHAKL GPPP VEEVLQKIQQL+S++Y NK Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177 Query: 1817 FYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ 1638 F+Q R Q+EK Q+PQG N N GA K ST ES Q Sbjct: 178 FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQTQIQ 237 Query: 1637 TLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1458 +PNG NQANK PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS Sbjct: 238 NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 297 Query: 1457 VENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTF 1278 ENVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+F Sbjct: 298 AENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 357 Query: 1277 HKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXX 1098 HKTRHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAISV Sbjct: 358 HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREEEK 417 Query: 1097 XKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPLA 918 KGVNS+NG ENPDIVPF+DN PLA Sbjct: 418 AKGVNSDNGGENPDIVPFEDNEEEEEEESEEEDESF--SAAAQGRGRGRGVMWPPHMPLA 475 Query: 917 RGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP- 750 RGARPMPG+RGFPP+MMG DGFSYG PDGF +PDLFG PRPF PYGPRFSGDF+GP Sbjct: 476 RGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGDFTGPA 534 Query: 749 -GMMY--RPQQ---------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 624 GMM+ RP Q Sbjct: 535 SGMMFPGRPPQPGAMFPAGGLGMMMGPGRAPFMGGMGPTGANPVRGGRPVSMPPMFPPPP 594 Query: 623 XXPQQNNNRMVKRDQRGPVSDR-----------NERYXXXXXXXXXXXXXXXXXXXXXXQ 477 QN+ R VKRDQR P +DR Q Sbjct: 595 APSSQNSGRAVKRDQRTPTNDRYGAGSEQGRGQEMAGPGGRLDDETQYQQEGQKAHHEDQ 654 Query: 476 FGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 F +G+ FRN+ESESEDEAPRRSR+GEGKKKRRS E DD SDH Sbjct: 655 FAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGDDANGSDH 698 >ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] gi|462410040|gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] Length = 695 Score = 751 bits (1939), Expect = 0.0 Identities = 414/704 (58%), Positives = 452/704 (64%), Gaps = 41/704 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETV----PTN-ATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPA 2169 MEDS+G ++FDFEGGL+ PTN + L+QSDS P+A P Sbjct: 1 MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTN------PAAAAPQ 54 Query: 2168 MN----NVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQD 2001 N N SG RSYRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFR+YGECREQD Sbjct: 55 PNHPNPNRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQD 114 Query: 2000 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQN 1821 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP VEEVLQKIQ LNS++Y N Sbjct: 115 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSN 174 Query: 1820 KFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXX 1641 KFYQ R P Q++K+Q QGPN G V K ST ES Sbjct: 175 KFYQQRNAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQVGHTQT 234 Query: 1640 QTLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1461 Q LPNGL NQAN+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD Sbjct: 235 QNLPNGLANQANRS-APLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFD 293 Query: 1460 SVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELT 1281 S ENVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHG+ HYGRNFSVKWLKLCEL+ Sbjct: 294 SAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLKLCELS 353 Query: 1280 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXX 1101 FHKTRHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMA+S+ Sbjct: 354 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAESKREEE 413 Query: 1100 XXKGVNSNNGAENPDIVPFDDN---XXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXX 930 KGVN NG ENPDIVPF+DN G Sbjct: 414 KAKGVNPENGGENPDIVPFEDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGIMWPPH 473 Query: 929 XPLARGARPMPGVRGFPPVMMGPDGFSYG--PDGFPIPDLFGVGPRPFAPYGPRFSGDFS 756 PLARG RPMPG++GFPP MMG D YG PDGF +P+ FGVGPR F PYGPRFSGDF+ Sbjct: 474 MPLARGGRPMPGMQGFPPGMMGADAMPYGPAPDGFGMPNPFGVGPRGFNPYGPRFSGDFT 533 Query: 755 G--PGMMY--RPQQ----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 G PGMM+ RPQQ Sbjct: 534 GPTPGMMFRGRPQQPGFPPGGYGMMMGPGRAPFMGGMGVGGANPGRPGRPTGMSPMFPPP 593 Query: 617 PQQNNNRMVKRDQRGPVSDRNERY-------------XXXXXXXXXXXXXXXXXXXXXXQ 477 QN NRM KRD RGP +DRNERY Q Sbjct: 594 SSQNTNRMQKRDPRGPSNDRNERYSAGSGQGKGQEIPGLAGGPDDEARYQQASKAYREDQ 653 Query: 476 FGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 +G+G+ RN++SESEDEAPRRSRHGEGKKK R SE D +S+H Sbjct: 654 YGAGNNSRNDDSESEDEAPRRSRHGEGKKKGRGSEGD--VTSEH 695 >ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis] gi|223537608|gb|EEF39232.1| conserved hypothetical protein [Ricinus communis] Length = 702 Score = 750 bits (1937), Expect = 0.0 Identities = 414/703 (58%), Positives = 451/703 (64%), Gaps = 40/703 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETV-PTNATATMPLIQSDSXXXXXXXXXXXXXV-------PSAE 2178 M+D++G LSFDFEGGL++ PTN TA++P I SD+ SA Sbjct: 1 MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAA 60 Query: 2177 PPAMNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDC 1998 A NN +GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQDC Sbjct: 61 AAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 120 Query: 1997 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNK 1818 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP VEEVLQKIQQLNS++YG+ NK Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSNK 180 Query: 1817 FYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVK-----SSTAESPXXXXXXXXXXXXXXXX 1653 F+Q RG ++K Q QGPN G K S+ + P Sbjct: 181 FFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQQQ 240 Query: 1652 XXXXQT--LPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1479 T LPNG PNQAN+ PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK Sbjct: 241 ATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 300 Query: 1478 LNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWL 1299 LNEAFDS ENVILIFSVNRTRHFQGCAKMTSKIG VGGGNWKYAHGT HYGRNFSVKWL Sbjct: 301 LNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWL 360 Query: 1298 KLCELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXX 1119 KLCEL+FHKTRHLRNPYNENLPVKISRDCQELEPS+G +LA LLY EPDSELMAIS+ Sbjct: 361 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLAAE 420 Query: 1118 XXXXXXXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQT--XXXXXXXXXXX 945 KGVN NG +NPDIVPF+DN S GQ Sbjct: 421 AKREEEKAKGVNPENGGDNPDIVPFEDN-EEEEEEESEEEEESFGQALGAPGQGRGRGRG 479 Query: 944 XXXXXXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPR 774 PLARGARP+PG+RGFPP+MMG D FSYG PDGF +PDLFGV PR F PY PR Sbjct: 480 IIWPHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYAPR 539 Query: 773 FSGDFSG--PGMMY--RPQQ----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 636 FSGDF+G GMM+ RP Q Sbjct: 540 FSGDFTGAASGMMFPGRPPQPGGVFPNGGFGMMMGPGRAPFMGGMGPNSTNPLRGNWPGG 599 Query: 635 XXXXXXPQQNNNRMVKRDQRGPVSDR------NERYXXXXXXXXXXXXXXXXXXXXXXQF 474 P + R VKRDQR +DR R QF Sbjct: 600 MPFPPLPTPSPQRPVKRDQRMTANDRYSTGSDQGRNTAGEPDDEARYQQEGLKASHEDQF 659 Query: 473 GSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 G+G+ FRN+ESESEDEAPRRSRHGEGKKKRR SE D SDH Sbjct: 660 GAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEGDATPGSDH 702 >ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] gi|561020727|gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] Length = 697 Score = 746 bits (1926), Expect = 0.0 Identities = 415/702 (59%), Positives = 452/702 (64%), Gaps = 39/702 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATM-PLIQSDSXXXXXXXXXXXXXVP--SAEPPAMN 2163 MEDSEGVLSFDFEGGL+T P+ A A PL+Q DS P S PA Sbjct: 1 MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAV 60 Query: 2162 NVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHT 1983 NV GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR+YGECREQDCVYKHT Sbjct: 61 NVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHT 120 Query: 1982 NEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQHR 1803 NEDIKECNMYKLGFCPNGPDCRYRHAK GPPP VEEVLQKIQ L S++Y + NKF+Q R Sbjct: 121 NEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQR 180 Query: 1802 GPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ-TLPN 1626 G + Q+EK Q+PQG N N G K AES + N Sbjct: 181 GSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQIQNVAN 240 Query: 1625 GLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENV 1446 G PNQA++ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVENV Sbjct: 241 GQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENV 300 Query: 1445 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKTR 1266 ILIFSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+FHKTR Sbjct: 301 ILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 360 Query: 1265 HLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKGV 1086 HLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPD ELMA+SV KGV Sbjct: 361 HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKGV 420 Query: 1085 NSNNGAENPDIVPFDDN---XXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPLAR 915 N +NG ENPDIVPF+DN +G PL R Sbjct: 421 NPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPA-GQGRGRGRGMMWPPHMPLPR 479 Query: 914 GARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP-- 750 GARPMPG++GF PVMMG DG SYG PDGF +PDLF VGPR FAPYGPRFSGDF GP Sbjct: 480 GARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFGGPPA 538 Query: 749 GMMY--RPQQ-------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXP 615 MM+ RP Q Sbjct: 539 AMMFRGRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVAGANPPRGGRPVNMPPMFPPPPPL 598 Query: 614 QQNNNRMVKRDQRGPVSDRNERY-XXXXXXXXXXXXXXXXXXXXXXQFGSGSK------- 459 QN NR+ KRDQR +DRN+RY Q+ G K Sbjct: 599 PQNTNRLAKRDQR--TTDRNDRYGSGSEQGKSQDMLSQSGAPDDDMQYQQGYKANQDDHP 656 Query: 458 ----FRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 FRN++SESEDEAPRRSRHGEGKKKRR E D T+ +H Sbjct: 657 AVNNFRNDDSESEDEAPRRSRHGEGKKKRRGPE-DVNTNYNH 697 >ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 691 Score = 745 bits (1923), Expect = 0.0 Identities = 410/693 (59%), Positives = 446/693 (64%), Gaps = 42/693 (6%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMP---LIQSDSXXXXXXXXXXXXXVP--SAEPPA 2169 MEDSEGVLSFDFEGGL+ P++A A +P L+Q DS P S PA Sbjct: 1 MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60 Query: 2168 MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK 1989 NV GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR+YGECREQDCVYK Sbjct: 61 GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120 Query: 1988 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQ 1809 HTNEDIKECNMYKLGFCPNGPDCRYRHAK GPPP VEEVLQKIQ L S++Y + NKF+Q Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180 Query: 1808 HRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLP 1629 RG + Q+EK Q+PQG N N G K AES Q + Sbjct: 181 QRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQSQMQNVA 240 Query: 1628 NGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEN 1449 NG PNQAN+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVEN Sbjct: 241 NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300 Query: 1448 VILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKT 1269 VIL+FSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+FHKT Sbjct: 301 VILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360 Query: 1268 RHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKG 1089 RHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAISV KG Sbjct: 361 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420 Query: 1088 VNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQT--XXXXXXXXXXXXXXXXXPLAR 915 VN +NG ENPDIVPF+DN PL R Sbjct: 421 VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPHMPLGR 480 Query: 914 GARPMPGVRGFPPVMMGPDGFSY------GPDGFPIPDLFGVGPRPFAPYGPRFSGDFSG 753 GARPMPG++GF PVMMG DG SY GPDGF +PDLFGVGPR FAPYGPRFSGDF G Sbjct: 481 GARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGG 539 Query: 752 P--GMMY--RPQQ-------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 624 P MM+ RP Q Sbjct: 540 PPAAMMFRGRPSQPGMFPSGGFGMMMNPGRGPFMGGMGVGGANPPRGGRPVNMPPMFPPP 599 Query: 623 XXPQQNNNRMVKRDQRGPVSDRNERY-XXXXXXXXXXXXXXXXXXXXXXQFGSGSK---- 459 QN NR KRDQR +DRN+R+ Q+ G K Sbjct: 600 PPLPQNANRAAKRDQR--TADRNDRFGSGSEQGKSQDMLSQSGGPDDDAQYQQGYKGNQD 657 Query: 458 -------FRNEESESEDEAPRRSRHGEGKKKRR 381 FRN++SESEDEAPRRSRHGEGKKK + Sbjct: 658 DHPAVNNFRNDDSESEDEAPRRSRHGEGKKKHK 690 >ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cucumis sativus] Length = 707 Score = 736 bits (1899), Expect = 0.0 Identities = 404/701 (57%), Positives = 446/701 (63%), Gaps = 45/701 (6%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTN--ATATMPLIQSDSXXXXXXXXXXXXXVP------SAE 2178 MEDSEGVLSFDFEGGL+ PTN AT+++P+I SDS SAE Sbjct: 1 MEDSEGVLSFDFEGGLDAGPTNPAATSSLPIINSDSSAPPAASAVSNPLSGALGPAVSAE 60 Query: 2177 PPAM--NNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQ 2004 P NV RRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+YGECREQ Sbjct: 61 PTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQ 120 Query: 2003 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQ 1824 DCVYKHTNEDIKECNMYK GFCPNGPDCRYRHAKL GPPP +EE+LQKIQ L S++YG Sbjct: 121 DCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYGPS 180 Query: 1823 NKFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXX 1644 NKF+ RG Q+EK Q PQ P + G K S AES Sbjct: 181 NKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAES-VNVQQQQGQQSAPQASQTP 239 Query: 1643 XQTLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1464 Q+L NG PNQ N+ + LPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF Sbjct: 240 VQSLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 299 Query: 1463 DSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCEL 1284 DS +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGTPHYG+NFS+KWLKLCEL Sbjct: 300 DSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKWLKLCEL 359 Query: 1283 TFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXX 1104 +F KTRHLRNPYNENLPVKISRDCQELEPS+GE+LASLLYLEPD ELMA+SV Sbjct: 360 SFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAAESKREE 419 Query: 1103 XXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQT---XXXXXXXXXXXXXXX 933 KGVN + G+ENPDIVPF+DN S GQ+ Sbjct: 420 EKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGRGRGMMWPP 479 Query: 932 XXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYG--PRFS 768 P+ RGARP G++GFPP MMGPDG SYG PDGFP+PD+FG+ PR F PYG PRFS Sbjct: 480 HMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMTPRGFGPYGPTPRFS 539 Query: 767 GDFSGP--GMMY--RPQQ---------------XXXXXXXXXXXXXXXXXXXXXXXXXXX 645 GDF GP MM+ RP Q Sbjct: 540 GDFMGPPTAMMFRGRPSQPAAMFPPSGFGMMMGQGRGPFMGGMGVAGANPARPGRPVGVS 599 Query: 644 XXXXXXXXXPQQNNNRMVKRDQRGPVSDR--------NERYXXXXXXXXXXXXXXXXXXX 489 QN NR +KRDQRG +DR Sbjct: 600 PLYPPPAVPSSQNMNRAIKRDQRGLTNDRYIVGMDQNKGVEIQSSGRDEEMQYKQGSKAY 659 Query: 488 XXXQFGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERD 366 Q+G+G+ FRNEESESEDEAPRRSRHGEGKKKRR SE D Sbjct: 660 SDEQYGTGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGD 700 >gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus notabilis] Length = 710 Score = 734 bits (1896), Expect = 0.0 Identities = 412/713 (57%), Positives = 452/713 (63%), Gaps = 50/713 (7%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETV-----PTNATATMPLIQSDSXXXXXXXXXXXXXVP-SAEPP 2172 MEDSEGVLSFDFEGGL+T P A A+ LI DS SA+P Sbjct: 1 MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSADPT 60 Query: 2171 A-----MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECRE 2007 + +N RS+RQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFR+YGECRE Sbjct: 61 SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120 Query: 2006 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGN 1827 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP+VEEVLQKIQ L+S++Y + Sbjct: 121 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY-H 179 Query: 1826 QNKFYQHRGPAPPYQ-SEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXX 1650 NKF+Q R Q EK +P GPN + G V K S ES Sbjct: 180 SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239 Query: 1649 XXXQ-TLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1473 + GLPNQAN+ V+PLP GISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN Sbjct: 240 QNQIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 299 Query: 1472 EAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKL 1293 EAFD ENVILIFSVNRTRHFQGCAKM S+IGG + GGNWKYAHGT HYGRNFSVKWLKL Sbjct: 300 EAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKWLKL 359 Query: 1292 CELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXX 1113 CEL+FHKTRHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAIS+ Sbjct: 360 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESK 419 Query: 1112 XXXXXXKGVNSNNGAENPDIVPFDDN---XXXXXXXXXXXXXXSLGQTXXXXXXXXXXXX 942 KGV+ +NG ENPDIVPF+DN LG Sbjct: 420 REEEKAKGVDPDNGGENPDIVPFEDNEEDEEEESEDEEESFSQVLGAN--QGRGRGRGVM 477 Query: 941 XXXXXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRF 771 PL+RGARPMP ++GFPPVM+G DG YG PDGFP+PDLF VGPR F PYGPRF Sbjct: 478 WPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPYGPRF 537 Query: 770 SGDFSGP--GMMY--RPQQ--------------XXXXXXXXXXXXXXXXXXXXXXXXXXX 645 GDF GP GMM+ RP Q Sbjct: 538 PGDFMGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPCMGGMGVQGTSPARPMRPGAM 597 Query: 644 XXXXXXXXXPQQNNNRMVKRDQRGPVSDRNERY-------------XXXXXXXXXXXXXX 504 P QN NR +RDQRG +DRNERY Sbjct: 598 PPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGAGSDQVRGQEMSGPAGGPEDDAHYQL 657 Query: 503 XXXXXXXXQFGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 Q+G+G+ FRN+ESESEDEAPRRSRHG+GKKKRRSSE D T SDH Sbjct: 658 GAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGKKKRRSSEEDAATGSDH 710 >ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cicer arietinum] Length = 677 Score = 731 bits (1887), Expect = 0.0 Identities = 394/677 (58%), Positives = 431/677 (63%), Gaps = 27/677 (3%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAMNNVS 2154 MEDSEGVLSFDFEGGL+ P +A + P N+ Sbjct: 1 MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAPVSGNIP 60 Query: 2153 GRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTNED 1974 GRRS+RQTVCRHWLRSLCMKG+ACGFLHQYDK+RMPVCRFFR+YGECREQDCVYKHTNED Sbjct: 61 GRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTNED 120 Query: 1973 IKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQHRGPA 1794 IKECNMYKLGFCPNGPDCRYRHAK GPPP +EEVLQKIQ L S+++ N +KF Q RG + Sbjct: 121 IKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRGSS 180 Query: 1793 PPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLPNGLPN 1614 Q EK Q PQG N N G K AES Q L NG PN Sbjct: 181 YTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQTQNLANGQPN 240 Query: 1613 QANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIF 1434 QAN+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVENVILIF Sbjct: 241 QANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENVILIF 300 Query: 1433 SVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKTRHLRN 1254 SVNRTRHFQGCAKMTS+IGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+FHKTRHLRN Sbjct: 301 SVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN 360 Query: 1253 PYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKGVNSNN 1074 PYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAIS+ KGVN +N Sbjct: 361 PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKGVNPDN 420 Query: 1073 GAENPDIVPFDDNXXXXXXXXXXXXXXSLGQT--XXXXXXXXXXXXXXXXXPLARGARPM 900 ENPDIVPF+DN + PL RGARPM Sbjct: 421 AGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPLGRGARPM 480 Query: 899 PGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP--GMMY- 738 PG++GF PVMMG DG SYG PDGF +PDLFG+GPR F PYGPRFSGDF+GP MM+ Sbjct: 481 PGMQGFNPVMMG-DGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFAGPPAAMMFR 539 Query: 737 -RPQQ-------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPQQNNN 600 RP Q P QN N Sbjct: 540 GRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVPGPNPPRGGRPLNMPPMFPPPPPPPQNVN 599 Query: 599 RMVKRDQRGPVSDRNERY-----XXXXXXXXXXXXXXXXXXXXXXQFGSGSKFRNEESES 435 R+ KRDQR +DRN+RY + FRNE+SES Sbjct: 600 RIAKRDQR--TNDRNDRYSSGQEQGKSQDMLSQSGGPDDEMQYQQSGAPANNFRNEDSES 657 Query: 434 EDEAPRRSRHGEGKKKR 384 EDEAPRRSRHGEGKK++ Sbjct: 658 EDEAPRRSRHGEGKKRK 674 >ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Fragaria vesca subsp. vesca] Length = 689 Score = 731 bits (1887), Expect = 0.0 Identities = 403/697 (57%), Positives = 445/697 (63%), Gaps = 34/697 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNAT-----ATMPLIQSDSXXXXXXXXXXXXXVPSAEPPA 2169 MED +GVL+FDFEGGL++ +A A+ IQSDS P+ +P Sbjct: 1 MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAA----PAPQPDP 56 Query: 2168 MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK 1989 N SGR+S+RQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK Sbjct: 57 NVNPSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK 116 Query: 1988 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQ 1809 HTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP VEEVLQKIQ LNS++Y N NKF Q Sbjct: 117 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKFSQ 176 Query: 1808 HRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLP 1629 R P Q ++ Q Q N N VV+ S AES Q++P Sbjct: 177 PRNGGFPQQHDRSQPAQVTNSFNQ-VVVRPSAAESANVQQPQQFQQTQQPVAQTQAQSVP 235 Query: 1628 NGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEN 1449 NGL +QAN+ PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS EN Sbjct: 236 NGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAEN 295 Query: 1448 VILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKT 1269 VILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+FHKT Sbjct: 296 VILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 355 Query: 1268 RHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKG 1089 RHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAIS+ KG Sbjct: 356 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKG 415 Query: 1088 VNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPLARGA 909 VN NG ENPDIVPF+DN + RG Sbjct: 416 VNPENGGENPDIVPFEDNEEEEEEESDDEEDYQVPGGAIENRGRGRVMWPPHMPLGGRGG 475 Query: 908 RPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGV-GPRPFAPYGPRFSGDFSG--PG 747 RPMPG++GFP MMGPD YG PDGF +P+ FG+ GPR F PYGPRFSGDF G PG Sbjct: 476 RPMPGMQGFPG-MMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSGDFGGPNPG 534 Query: 746 MMYR---PQ------------QXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPQ 612 MM+R PQ P Sbjct: 535 MMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGVGGNNPARGGRPGGMPPMFPPHPPS 594 Query: 611 QNNNRMVKRDQRGPVSDRNERY--------XXXXXXXXXXXXXXXXXXXXXXQFGSGSKF 456 QNNNR+ KRD RG +DRNERY +G+G+ Sbjct: 595 QNNNRLQKRDPRGSGNDRNERYSAGSGHGKEMQAGGPDDENHYQHSSKSYQEDYGAGNNG 654 Query: 455 RNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 RN++SESEDEAPRRSRHGEGKKKRR SE D +S+H Sbjct: 655 RNDDSESEDEAPRRSRHGEGKKKRRDSEGD--ATSEH 689 >ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 681 Score = 731 bits (1886), Expect = 0.0 Identities = 405/684 (59%), Positives = 440/684 (64%), Gaps = 33/684 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATM--PLIQSDSXXXXXXXXXXXXXVPS---AEPPA 2169 MEDSEGVLSFDFEGGL+ P++A A PLI DS P+ +P Sbjct: 1 MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVG 60 Query: 2168 MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK 1989 NV GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR+YGECREQDCVYK Sbjct: 61 GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120 Query: 1988 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQ 1809 HTNEDIKECNMYKLGFCPNGPDCRYRHAK GPPP VEEVLQKIQ L S++Y + NKF+Q Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 180 Query: 1808 HRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLP 1629 RG + Q+EK +PQG N N G AE Q + Sbjct: 181 QRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQQQVNQSQMQNVA 240 Query: 1628 NGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEN 1449 NG PNQAN+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVEN Sbjct: 241 NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300 Query: 1448 VILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELTFHKT 1269 VILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGT HYGRNFSVKWLKLCEL+FHKT Sbjct: 301 VILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360 Query: 1268 RHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXXXXKG 1089 RHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMAISV KG Sbjct: 361 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420 Query: 1088 VNSNNGAENPDIVPFDDN---XXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPLA 918 VN +NG ENPDIVPF+DN +G PL Sbjct: 421 VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPA-GQGRGRGRGMMWPPHMPLG 479 Query: 917 RGARPMPGVRGFPPVMMGPDGFSY---GPDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP- 750 RGARPMPG++GF PVMMG DG SY GPDGF +PDLFGVGPR FAPYGPRFSGDF GP Sbjct: 480 RGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGGPP 538 Query: 749 -GMMY--RPQQ-------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 MM+ RP Q Sbjct: 539 AAMMFRGRPSQPGMFPGGGFGMMLNPGRGPFMGGIGVGGANPPRGGRPVNMPPMFPPPPP 598 Query: 617 PQQNNNRMVKRDQRGPVSDRNERY-XXXXXXXXXXXXXXXXXXXXXXQFGSGSKFRN--- 450 QN NR KRDQR +DRN+R+ Q+ G K Sbjct: 599 LPQNANRAAKRDQR--TADRNDRFGSGSEQGKSQDMLSQSGGPDDDPQYQQGYKGNQDDH 656 Query: 449 -EESESEDEAPRRSRHGEGKKKRR 381 ++SESEDEAPRRSRHGEGKKK + Sbjct: 657 PDDSESEDEAPRRSRHGEGKKKHK 680 >gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malus domestica] Length = 667 Score = 725 bits (1871), Expect = 0.0 Identities = 400/691 (57%), Positives = 443/691 (64%), Gaps = 28/691 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLE---TVPTNA-------TATMPLIQSDSXXXXXXXXXXXXXVPS 2184 MEDS+G L+FDFEGGL+ TV +A T+ ++QSDS + Sbjct: 1 MEDSDGGLNFDFEGGLDAPATVSASAGPANTVPTSNYSVMQSDSAVTGLGANQAAA---A 57 Query: 2183 AEPPAMNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQ 2004 +P N +G RSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQ Sbjct: 58 PQPNQNANRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 117 Query: 2003 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQ 1824 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP VEEVLQKIQ L S++Y N Sbjct: 118 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYNYNNS 177 Query: 1823 NKFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAE--SPXXXXXXXXXXXXXXXXX 1650 +KFYQ R P Q +K Q QGPN V K +TAE + Sbjct: 178 SKFYQQRNAGFPQQGDKHQPAQGPN----NFVGKPTTAEPGNVQQQQQQQLQQTQQHVGP 233 Query: 1649 XXXQTLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1470 QTLPNGL NQAN+ PLPQG SRYFIVKSCNRENLELSVQQG+WATQRSNE+KLNE Sbjct: 234 TQTQTLPNGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRSNESKLNE 293 Query: 1469 AFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLC 1290 AFDS ENVILIFSVNRTRHFQGCAKM S+IGG VGGGNWKYAHGT HYGRNFSVKWLKLC Sbjct: 294 AFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLC 353 Query: 1289 ELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXX 1110 EL+FHKTRHLRNPYNENLPVKISRDCQELE S+GE+LASLLYLEPDSELMAIS+ Sbjct: 354 ELSFHKTRHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAISIAAESKR 413 Query: 1109 XXXXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQ-----TXXXXXXXXXXX 945 KGVN NG ENPDIVPF+DN S GQ Sbjct: 414 EEEKAKGVNPENGGENPDIVPFEDN-EEEEEEESEDEEDSFGQVPGAGNDGRGRGRGGGV 472 Query: 944 XXXXXXPLARGARPMPGVRGFPPVMMGPDGFSYGPDGFPIPDLFGVGPRPFAPYGPRFSG 765 L RG RPMPG++GFPP MMG D Y PDGF +P+ FG+ PR F PYGPRFSG Sbjct: 473 MWPPHMALPRGGRPMPGMQGFPPGMMGHDAMPYVPDGFVMPNPFGMAPRGFNPYGPRFSG 532 Query: 764 DFSG--PGMMY--RPQQ-------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 DF+G PGMM+ RPQQ Sbjct: 533 DFTGPNPGMMFRGRPQQPGFPPGGFGIMGPGRAPFMGGIHPGRGGRPTGMSPMFPPPPPP 592 Query: 617 PQQNNNRMVKRDQRGPVSDRNERYXXXXXXXXXXXXXXXXXXXXXXQFGSGSKFRNEESE 438 QN NRM KRD RG +DR + +G+G+ RN++SE Sbjct: 593 SSQNPNRMPKRDPRGASTDRKGQ--------------DMSGPDDETHYGAGNSSRNDDSE 638 Query: 437 SEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 SEDEAPRRSRHG+GKKKRR SE D +S+H Sbjct: 639 SEDEAPRRSRHGDGKKKRRDSEGD--ATSEH 667 >ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551536|gb|ESR62165.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 672 Score = 714 bits (1843), Expect = 0.0 Identities = 401/705 (56%), Positives = 439/705 (62%), Gaps = 42/705 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXV--------PSAE 2178 MEDSEG LSFDFEGGL+ P TA+ P IQSDS P Sbjct: 1 MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2177 PPAMNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDC 1998 + + SGRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR++GECREQDC Sbjct: 61 SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 1997 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNK 1818 VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KL GPPP+VEEVLQKIQQ++S+++GN NK Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180 Query: 1817 FYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ 1638 +Q RG A +Q +K Q QGPN N GA KSSTAES Sbjct: 181 LFQQRG-AFSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239 Query: 1637 -TLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1461 LPNGLPNQ N+ +PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD Sbjct: 240 QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299 Query: 1460 SVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELT 1281 S ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGT HYGRNFSVKWLKLCEL+ Sbjct: 300 SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359 Query: 1280 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXX 1101 FHKTRHLRNPYNENLPVK AISV Sbjct: 360 FHKTRHLRNPYNENLPVK-----------------------------AISVAAEAKREEE 390 Query: 1100 XXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQTXXXXXXXXXXXXXXXXXPL 921 KGVN +NG +NPDIVPF+DN SLG T PL Sbjct: 391 KAKGVNPDNGGDNPDIVPFEDN-EEEEEEESEEEEESLG-TASQGRGRGRGMMWPGPMPL 448 Query: 920 ARGARPMPGVRGFPPVMMGPDGFSYG--PDGFPIPDLFGVGPRPFAPYGPRFSGDFSGP- 750 ARGARP+PG+RGFPP+M+G DGFSYG PDGFP+PDLFGV PRPFAPYGPRFSGDF+GP Sbjct: 449 ARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG 508 Query: 749 GMMY--RPQQ--------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 618 GMM+ RP Q Sbjct: 509 GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQ 568 Query: 617 PQQNNNRMVKRDQRGPVSDRNERY--------------XXXXXXXXXXXXXXXXXXXXXX 480 QN++R+ KRD RG ++DRN+RY Sbjct: 569 SSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQED 628 Query: 479 QFGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 Q+GS FRN+ESESEDEAPRRSRHGEGKKKRR SE D SSD+ Sbjct: 629 QYGS-RNFRNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSDN 672 >ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa] gi|550349048|gb|EEE85138.2| zinc finger family protein [Populus trichocarpa] Length = 669 Score = 712 bits (1837), Expect = 0.0 Identities = 395/701 (56%), Positives = 439/701 (62%), Gaps = 38/701 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPS-----AEPPA 2169 MEDSEGVLSFDFEGGL++ P N A++P I SD+ + + A Sbjct: 1 MEDSEGVLSFDFEGGLDSGPANPIASIPAIPSDNYGAATAAAPNTTNTTTNTTNNSNSGA 60 Query: 2168 MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYK 1989 + +GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQDCVYK Sbjct: 61 ADIQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120 Query: 1988 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNKFYQ 1809 HTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP VEEV+QKIQQLNS++ NK +Q Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSNKNFQ 180 Query: 1808 HRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQTLP 1629 R Q EK +P ++K S ES Q Sbjct: 181 QRNAGFSQQIEK----------SPNTIIKPSGTESANVQQQQQQQQQTQTPHLTNGQHQQ 230 Query: 1628 NGLPNQANKIVSPLPQGISR-----------YFIVKSCNRENLELSVQQGVWATQRSNEA 1482 PN N+I +PLPQGIS YFIVKSCNRENLELSVQQGVWATQRSNE Sbjct: 231 PQQPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQGVWATQRSNEI 290 Query: 1481 KLNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKW 1302 KLNEA DS +NVILIFSVNRTRHFQGCAKM SKIG VGGGNWKYAHGT HYGRNFSVKW Sbjct: 291 KLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTAHYGRNFSVKW 350 Query: 1301 LKLCELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXX 1122 LKLCEL+FHKTRHLRNP+NENLPVKISRDCQELEPSIGE+LASLLYLEPDSELMA+S+ Sbjct: 351 LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSLAA 410 Query: 1121 XXXXXXXXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQ---TXXXXXXXXX 951 KGVN ++G ENPDIVPF+DN S GQ Sbjct: 411 EAKREEEKEKGVNPDSGGENPDIVPFEDN-EEEEEEESEEEEESFGQPLGPAAQGRGRGR 469 Query: 950 XXXXXXXXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYG 780 P+ARGARP+PG+RGFPP+MMG DGFSYG PD F +PDLFGV R F PYG Sbjct: 470 GMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGMPDLFGVASRGFPPYG 529 Query: 779 PRFSGDFSG--PGMMY--RPQQ------------XXXXXXXXXXXXXXXXXXXXXXXXXX 648 PRFSGDF+G GMM+ RP Q Sbjct: 530 PRFSGDFTGAASGMMFPGRPSQPGAVFPAGGFGMMMGPGRPPFIGGMGPTPSNLLRGPRP 589 Query: 647 XXXXXXXXXXPQQNNNRMVKRDQRGPVSDRNERYXXXXXXXXXXXXXXXXXXXXXXQFGS 468 QNN+R VKRDQR +DRN+R+ QFG+ Sbjct: 590 GGMFAPFPAPSSQNNSRSVKRDQRAAANDRNDRH---------------------NQFGA 628 Query: 467 GSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 + RN+ESESEDEAPRRSRHGEGKKKRR S D S+H Sbjct: 629 VNSIRNDESESEDEAPRRSRHGEGKKKRRGSGDDATPGSEH 669 >gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus guttatus] Length = 681 Score = 711 bits (1835), Expect = 0.0 Identities = 394/690 (57%), Positives = 447/690 (64%), Gaps = 34/690 (4%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQS--DSXXXXXXXXXXXXXVPSAEP-PA-- 2169 M+D EG LSFDFEGGL+ P++ TA++P+IQS ++ PSA P PA Sbjct: 1 MDDGEGGLSFDFEGGLDIGPSHPTASVPVIQSSANANTASAAAAAANPYNPSAAPVPATQ 60 Query: 2168 ----MNNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQD 2001 MNN GRRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+YGECREQD Sbjct: 61 AAEGMNN-GGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQD 119 Query: 2000 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQN 1821 CVYKHTNED+KECNMYKLGFCPNGPDCRYRHAKL GPPP+VEEVLQKIQQL S++YG N Sbjct: 120 CVYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSN 179 Query: 1820 KFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXX 1641 F+Q+R Q+EK Q PQGPN V K++ AE Sbjct: 180 NFFQNRNSNFAQQTEKPQFPQGPN--GTHQVGKTNAAEP--GNLNQPAQQSQQPGSQGQL 235 Query: 1640 QTLPNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1461 Q++PN NQA++ +PLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKLNEAF+ Sbjct: 236 QSIPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFE 295 Query: 1460 SVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELT 1281 SVEN+ILIFSVN+TRHFQGCAKMTS+IGG VGGGNWK+AHGT HYGRNF++KWLKLCELT Sbjct: 296 SVENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLKLCELT 355 Query: 1280 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXX 1101 F KTRHLRNPYNENLPVKISRDCQELEPSIGE+LASLLYLEPDS+LMAI++ Sbjct: 356 FDKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAELKREEE 415 Query: 1100 XXKGVNSNNGAENPDIVPFDDN----XXXXXXXXXXXXXXSLGQT--XXXXXXXXXXXXX 939 KGVN +NGAENPDIVPF+DN GQ Sbjct: 416 KAKGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGAQGRGVGRGMMWG 475 Query: 938 XXXXPLARGARPMPGVRGFPPVMMGPDGFSYG------PDGFPIPDLFGVGPRPFAPYGP 777 PL RG RP PGVRGFPP MMG DGF YG DGFP+ D FG+ PR F +GP Sbjct: 476 PHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMHDPFGMVPRGFGQFGP 535 Query: 776 RFSGDFSGPG---MMY--RPQ----QXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 624 RF GDF+GP MM+ RP Sbjct: 536 RFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPFFPPPPPPVA 595 Query: 623 XXPQQNNNRMVKRDQRGPVSDRNERYXXXXXXXXXXXXXXXXXXXXXXQFGSGSK----F 456 P N+ VKRDQ+ P SDRN+ G+ +K + Sbjct: 596 AQPPPQNSNWVKRDQKAPYSDRND---------VSDQGKGQEIVSGSSNRGNAAKREESY 646 Query: 455 RNEESESEDEAPRRSRHGEGKKKRRSSERD 366 RN+ESESEDEAPRRSRHGEGKKKRR SE + Sbjct: 647 RNDESESEDEAPRRSRHGEGKKKRRGSEAE 676 >ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum tuberosum] Length = 692 Score = 690 bits (1780), Expect = 0.0 Identities = 382/704 (54%), Positives = 430/704 (61%), Gaps = 41/704 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAE--PPAMNN 2160 M++ EG L+FDFEGGL+T PT+ TA++P+IQS PSA PP ++ Sbjct: 1 MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQS--------FDHTAAAAPSANINPPTVSA 52 Query: 2159 VSG----------RRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECR 2010 G RRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+YGECR Sbjct: 53 AVGGQSDVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECR 112 Query: 2009 EQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYG 1830 EQDCVYKHT EDIKECNMYKLGFCPNGPDCRYRHAK+ GPPP VEE+LQKIQ L S++YG Sbjct: 113 EQDCVYKHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYG 172 Query: 1829 NQNKFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXX 1650 N+F Q+R QS+K Q Q N VKS+ E+P Sbjct: 173 YSNRFNQNRNANYSTQSDKSQASQAQN--GMSLAVKSTATETPIIQQHQPNQQVQPPQLQ 230 Query: 1649 XXXQTL---PNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1479 PNG NQA++ LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAK Sbjct: 231 GGPTQAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 290 Query: 1478 LNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWL 1299 LNEAFDSVENVILIFSVNRTRHFQGC KMTS+IGG GGNWK+ HGT HYGRNFSVKWL Sbjct: 291 LNEAFDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWL 350 Query: 1298 KLCELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXX 1119 KLCEL+F KT HLRNPYNENLPVKISRDCQELEPS+GE+LASLLYLEPDSELMAIS+ Sbjct: 351 KLCELSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAE 410 Query: 1118 XXXXXXXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQ------TXXXXXXX 957 KGVN +NG +NPDIVPF+DN Sbjct: 411 SKRQEEKAKGVNPDNGKDNPDIVPFEDNEEEEEEEEEEESEDEDESFDQGFGPAALGRGR 470 Query: 956 XXXXXXXXXXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAP 786 P G RP PG+RGFPP MMG DGFSYG P+GFP+PD FG+GPRPF P Sbjct: 471 GRGIAWPPIMPFGHGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPMPDHFGMGPRPFGP 529 Query: 785 YGPRFSGDF--------SGPGMMYRPQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 630 YGP FS D G GMM P + Sbjct: 530 YGPPFSSDLMFHGRPPAGGFGMMMGPGRPPFMGGMGPGATGPPRAGRAVGMHPSFVPPSS 589 Query: 629 XXXXPQQNNNRMVKRDQRGPVSDRNERY---------XXXXXXXXXXXXXXXXXXXXXXQ 477 KR+QR PVSDRN+R+ Q Sbjct: 590 QPSQYPYK----AKREQRAPVSDRNDRFSSDQGKGQEMMGSVGGPDGVHMQIGKSEHDNQ 645 Query: 476 FGSGSKFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 FG+G+ +NEESESEDEAPRRSRHG+GKKKRR + D T S++ Sbjct: 646 FGAGNSQKNEESESEDEAPRRSRHGDGKKKRRDVDEDAATGSEN 689 >ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum tuberosum] Length = 677 Score = 686 bits (1769), Expect = 0.0 Identities = 378/689 (54%), Positives = 434/689 (62%), Gaps = 27/689 (3%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAM---- 2166 M+D EG L+FDFEGGL+T PT+ TA++P++QS + PP Sbjct: 1 MDDGEGGLNFDFEGGLDTGPTHPTASVPVLQSAGHITTGPAPNASV---ALVPPGGGVGQ 57 Query: 2165 ----NNVSGRRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDC 1998 + V RRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQDC Sbjct: 58 GGDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 117 Query: 1997 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQNK 1818 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL GPPP V EVLQ+IQ L S YG N+ Sbjct: 118 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTS--YGYSNR 175 Query: 1817 FYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXXQ 1638 F+Q+R Q++K Q+PQ PN+ N VKS+ AE P Sbjct: 176 FFQNRNTNYSTQADKSQIPQVPNVMNQA--VKSTAAEPPIGQPHQPHQQQVQQPQHQGAP 233 Query: 1637 TLPNGLPN-QANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1461 T LP+ Q N+ PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD Sbjct: 234 TQTQTLPSSQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 293 Query: 1460 SVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLCELT 1281 SVENVIL+FS+NRTRHFQG AKMTS+IGG GGNWK+ HGT HYGRNFS+KWLKLCEL+ Sbjct: 294 SVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLCELS 353 Query: 1280 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXXXXX 1101 F KTRHLRNPYNENLPVKISRDCQELE S+GE+LASLLY+EPDSELMA+S+ Sbjct: 354 FQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKREEE 413 Query: 1100 XXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXS-LGQTXXXXXXXXXXXXXXXXXP 924 KGVN +NG ENPDIVPF+DN GQ P Sbjct: 414 RAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGRGIVWPP 473 Query: 923 LA---RGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRFSGD 762 L RGARP PG+RGFPP MM DGFSYG PDGFP+PD +G+G RPF P+GPRF GD Sbjct: 474 LVPFGRGARPFPGMRGFPPGMMS-DGFSYGSMTPDGFPMPDPYGMGGRPFGPFGPRFPGD 532 Query: 761 F---------SGPGMMYRPQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPQQ 609 G GMM P + P Sbjct: 533 MMFHSRPPAAGGFGMMMGPGR-----PPFMGGMGPGAPGPPRGGRPMGIHPSFIPPTPPP 587 Query: 608 NNNRMVKRDQRGPVSDRNERYXXXXXXXXXXXXXXXXXXXXXXQF--GSGSKFRNEESES 435 + N VK+DQR P ++RN+R+ + + FRN+ESES Sbjct: 588 SQNPRVKKDQRAPFNERNDRFSSGPDQGRGQEIAGSVGGPAEGVHYPQTENSFRNDESES 647 Query: 434 EDEAPRRSRHGEGKKKRRSSERDDPTSSD 348 EDEAPRRSRHG+GKKK+ S + D T ++ Sbjct: 648 EDEAPRRSRHGDGKKKKNSMDGDATTGTE 676 >ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum lycopersicum] Length = 689 Score = 683 bits (1763), Expect = 0.0 Identities = 376/699 (53%), Positives = 426/699 (60%), Gaps = 36/699 (5%) Frame = -3 Query: 2333 MEDSEGVLSFDFEGGLETVPTNATATMPLIQSDSXXXXXXXXXXXXXVPSAEPPAMNNVS 2154 M++ EG L+FDFEGGL+T PT+ TA++P+IQS + PP + V Sbjct: 1 MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQS------FDHTAAAASSANINPPTVPAVG 54 Query: 2153 G---------RRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQD 2001 G RRS+RQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+YGECREQD Sbjct: 55 GQGDVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQD 114 Query: 2000 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLSGPPPAVEEVLQKIQQLNSFSYGNQN 1821 CVYKHT EDIKECNMYKLGFCPNGPDCRYRHAK+ GPPP VEE+LQKIQ L S +YG N Sbjct: 115 CVYKHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSN 174 Query: 1820 KFYQHRGPAPPYQSEKFQVPQGPNIGNPGAVVKSSTAESPXXXXXXXXXXXXXXXXXXXX 1641 +F Q+R Q++K Q Q N VKS+ E+P Sbjct: 175 RFNQNRNANYSTQTDKSQASQAQN--GTSLAVKSTATETPIIQQHQPHQQVQPPQLQGGP 232 Query: 1640 QTL---PNGLPNQANKIVSPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1470 PNG NQA++ LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE Sbjct: 233 TQAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 292 Query: 1469 AFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTPHYGRNFSVKWLKLC 1290 AFDSVENVILIFSVNRTRHFQGC KMTS+IGG GGNWK+ HGT HYGRNFS+KWLKLC Sbjct: 293 AFDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKLC 352 Query: 1289 ELTFHKTRHLRNPYNENLPVKISRDCQELEPSIGEELASLLYLEPDSELMAISVXXXXXX 1110 EL+F KT HLRNPYNENLPVKISRDCQELEPS+GE+LASLLYLEPDSELMAIS+ Sbjct: 353 ELSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKR 412 Query: 1109 XXXXXKGVNSNNGAENPDIVPFDDNXXXXXXXXXXXXXXSLGQ----TXXXXXXXXXXXX 942 KGVN +NG +NPDIVPF+DN Sbjct: 413 LEEKAKGVNPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGIA 472 Query: 941 XXXXXPLARGARPMPGVRGFPPVMMGPDGFSYG---PDGFPIPDLFGVGPRPFAPYGPRF 771 P G RP PG+RGFPP MMG DGFSYG P+GFP+ D FG+GPRPF PYGPRF Sbjct: 473 WPPIMPFGHGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPMTDHFGMGPRPFPPYGPRF 531 Query: 770 SGDF--------SGPGMMYRPQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXP 615 S D G GMM P + Sbjct: 532 SSDLMFHGRPPAGGFGMMIGPGRPPFVGGMGPGATGPPRAGRAVRMHPSFIPPSSQPSQY 591 Query: 614 QQNNNRMVKRDQRGPVSDRNERY---------XXXXXXXXXXXXXXXXXXXXXXQFGSGS 462 KR+QR PVSDRN+R+ QFG+G+ Sbjct: 592 PYR----AKREQRAPVSDRNDRFSSDQGKGQEMMGSVNGPDGVHMQIGKSEHDNQFGAGN 647 Query: 461 KFRNEESESEDEAPRRSRHGEGKKKRRSSERDDPTSSDH 345 +N+ SESEDEAPRRSRHG+GKKKRR + D T S++ Sbjct: 648 SLKNDGSESEDEAPRRSRHGDGKKKRRDVDEDAATGSEN 686