BLASTX nr result
ID: Akebia22_contig00011706
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00011706 (2625 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec... 828 0.0 ref|XP_007041140.1| Cleavage and polyadenylation specificity fac... 815 0.0 ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm... 796 0.0 ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr... 788 0.0 ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec... 788 0.0 ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec... 787 0.0 ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas... 786 0.0 ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec... 780 0.0 gb|EXB51974.1| Cleavage and polyadenylation specificity factor C... 776 0.0 ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec... 771 0.0 ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec... 761 0.0 ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun... 740 0.0 ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec... 731 0.0 ref|XP_002300333.2| zinc finger family protein [Populus trichoca... 728 0.0 gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus... 726 0.0 gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malu... 724 0.0 ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr... 721 0.0 ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec... 711 0.0 ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec... 709 0.0 ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec... 698 0.0 >ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Vitis vinifera] Length = 673 Score = 828 bits (2140), Expect = 0.0 Identities = 443/709 (62%), Positives = 484/709 (68%), Gaps = 14/709 (1%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAGNN 2193 +D EGVLSFDFEGGLD AP + + A PLI +D++ + EP G Sbjct: 2 EDAEGVLSFDFEGGLDAAPGTAATVA-PLIQSDATAAAAAPSSVVSA------EPTPGGA 54 Query: 2192 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 2013 RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCVYKH+NE Sbjct: 55 PGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNE 114 Query: 2012 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1833 DIKECNMYKLGFCPNG DCRYRH KLPGPPP EEV QKIQ S+FNYGSSNRF+Q RN Sbjct: 115 DIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNP 174 Query: 1832 SYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNLPN 1653 Y QTE+SQ QGSN VN K STT QNLPN Sbjct: 175 -YNQQTEKSQILQGSNAVNLGTVAKSSTTE----AINVQQQQVQPPQQQVSQTPMQNLPN 229 Query: 1652 SLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSIDNV 1473 LP +ANKTA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS++NV Sbjct: 230 GLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENV 289 Query: 1472 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 1293 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR Sbjct: 290 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 349 Query: 1292 HLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGV 1113 HLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+ KGV Sbjct: 350 HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGV 409 Query: 1112 NLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMMWAPHMPLAR 936 N D+ ENPDIVPF SF Q L AAQ G+MW PHMPLAR Sbjct: 410 NPDNGGENPDIVPF-EDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLAR 468 Query: 935 GARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGLGQ 756 GARP+P +RGFPPVMMG DGF+Y A+ PDGF MPD+FG+ PRAF PYGPRFSGD Sbjct: 469 GARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDF----- 523 Query: 755 SSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVAAP 576 TGP SGMMF GR QPG VF + AAP Sbjct: 524 -----------TGPASGMMFPGR-GQPGAVF--PASGYGMMMGPGRAPFMGGMGVPAAAP 569 Query: 575 VRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD-------------MGQEMAGPG 435 RA P +QNN K+DQR P++ GQ+MAGP Sbjct: 570 TRAGRPVGMPPMFPPPPPPNSQNNR---TKRDQRTPVNDRNDRYSGGSDQGRGQDMAGPD 626 Query: 434 MLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRK 288 D+ +Y G+K Q +D FGG NSFRNDESESEDEAPRRSRHGEGKK++ Sbjct: 627 --DETQYLQGLKSQQDDQFGGGNSFRNDESESEDEAPRRSRHGEGKKKR 673 >ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] Length = 698 Score = 815 bits (2105), Expect = 0.0 Identities = 442/724 (61%), Positives = 490/724 (67%), Gaps = 20/724 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVI----SNTXXXXXXXXXXXXSEPV 2205 DD EG LSFDFEGGLD A P+ P+A++P++ +D S SN ++P Sbjct: 2 DDSEGGLSFDFEGGLD-AGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60 Query: 2204 A---GNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 2034 A G RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC Sbjct: 61 AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 2033 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1854 VYKH+NEDIKECNMYKLGFCPNG DCRYRH KLPGPPPP EEV+QKIQ S++NY N+ Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177 Query: 1853 FFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI 1674 FFQQRN+ + QTE+SQ PQG N VNQ K STT Sbjct: 178 FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQT--- 234 Query: 1673 ETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1494 + QN+PN +ANKTA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA Sbjct: 235 QIQNVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 294 Query: 1493 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1314 FDS +NVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGTAHYGRNFSVKWLKLCE Sbjct: 295 FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCE 354 Query: 1313 LSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXX 1134 LSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV Sbjct: 355 LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKRE 414 Query: 1133 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAP 954 KGVN D+ ENPDIVPF SFS +AAQ G+MW P Sbjct: 415 EEKAKGVNSDNGGENPDIVPF-EDNEEEEEEESEEEDESFS---AAAQGRGRGRGVMWPP 470 Query: 953 HMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGD 774 HMPLARGARPMPG+RGFPP+MMGGDGF+YG +TPDGF +PDLFG APR F PYGPRFSGD Sbjct: 471 HMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGD 529 Query: 773 LSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXX 594 TGP SGMMF GRP QPG +F Sbjct: 530 F----------------TGPASGMMFPGRPPQPGAMF--PAGGLGMMMGPGRAPFMGGMG 571 Query: 593 MSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD----------MGQEMA 444 + A PVR P +Q N+ R VK+DQR P + GQEMA Sbjct: 572 PTGANPVRGGRPVSMPPMFPPPPAPSSQ-NSGRAVKRDQRTPTNDRYGAGSEQGRGQEMA 630 Query: 443 GPG--MLDDGKY-QSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVD 273 GPG + D+ +Y Q G K ED F NSFRNDESESEDEAPRRSR+GEGKK++ S Sbjct: 631 GPGGRLDDETQYQQEGQKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEG 690 Query: 272 EQQN 261 + N Sbjct: 691 DDAN 694 >ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis] gi|223537608|gb|EEF39232.1| conserved hypothetical protein [Ricinus communis] Length = 702 Score = 796 bits (2056), Expect = 0.0 Identities = 426/721 (59%), Positives = 479/721 (66%), Gaps = 21/721 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPV---- 2205 DD +G LSFDFEGGLD++ P+NP+A++P I +D++ ++P Sbjct: 2 DDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAAA 61 Query: 2204 --AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCV 2031 A N RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCV Sbjct: 62 AAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 121 Query: 2030 YKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRF 1851 YKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQ +++NYGSSN+F Sbjct: 122 YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSNKF 181 Query: 1850 FQQRNASYTHQTERSQFPQGSNIVNQVVAVK----QSTTADXXXXXXXXXXXXXXXXXXX 1683 FQQR A + ++SQF QG N + Q +A K +S Sbjct: 182 FQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQQQA 241 Query: 1682 XXIETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1503 TQNLPN P +AN+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL Sbjct: 242 TQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 301 Query: 1502 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1323 NEAFDS +NVILIFSVNRTRHFQGCAKMTSKIG VGGGNWKYAHGTAHYGRNFSVKWLK Sbjct: 302 NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWLK 361 Query: 1322 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXX 1143 LCELSFHKTRHLRNPYNENLPVKISRDCQELEP VG QLA LLY EPDSELMAIS+ Sbjct: 362 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLAAEA 421 Query: 1142 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSA-AQXXXXXXGM 966 KGVN ++ +NPDIVPF SF Q L A Q G+ Sbjct: 422 KREEEKAKGVNPENGGDNPDIVPF-EDNEEEEEEESEEEEESFGQALGAPGQGRGRGRGI 480 Query: 965 MWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPR 786 +W PHMPLARGARP+PG+RGFPP+MMG D F+YG +TPDGF MPDLFG+APR F PY PR Sbjct: 481 IW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYAPR 539 Query: 785 FSGDLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXX 606 FSGD TG SGMMF GRP QPG VF Sbjct: 540 FSGDF----------------TGAASGMMFPGRPPQPGGVF--PNGGFGMMMGPGRAPFM 581 Query: 605 XXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL--------DMGQE 450 + P+R + PL + R VK+DQR D G+ Sbjct: 582 GGMGPNSTNPLRGN------WPGGMPFPPLPTPSPQRPVKRDQRMTANDRYSTGSDQGRN 635 Query: 449 MAGPGMLDDGKY-QSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEV 276 AG D+ +Y Q G+K ED FG NSFRNDESESEDEAPRRSRHGEG KKR+ SE Sbjct: 636 TAGEPD-DEARYQQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEG 694 Query: 275 D 273 D Sbjct: 695 D 695 >ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551535|gb|ESR62164.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 701 Score = 788 bits (2036), Expect = 0.0 Identities = 428/724 (59%), Positives = 480/724 (66%), Gaps = 24/724 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVIS-------NTXXXXXXXXXXXXS 2214 +D EG LSFDFEGGLD A P P+A+ P I +DS+ + N + Sbjct: 2 EDSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2213 EPVAGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 2034 ++ RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC Sbjct: 61 SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 2033 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1854 VYKH+NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP EEV+QKIQ S++N+G+ N+ Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180 Query: 1853 FFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI 1674 FQQR A ++HQ ++SQF QG N VNQ A K ST Sbjct: 181 LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT-- 237 Query: 1673 ETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1494 + QNLPN LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA Sbjct: 238 QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 297 Query: 1493 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1314 FDS +NVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLKLCE Sbjct: 298 FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 357 Query: 1313 LSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXX 1134 LSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLA+LLYLEPDSELMAISV Sbjct: 358 LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKRE 417 Query: 1133 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAP 954 KGVN D+ +NPDIVPF +A+Q GMMW Sbjct: 418 EEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPG 473 Query: 953 HMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGD 774 MPLARGARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRFSGD Sbjct: 474 PMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGD 532 Query: 773 LSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXX 594 +G G GMMF GRP QPG+VF Sbjct: 533 FTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMG- 574 Query: 593 MSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG--Q 453 A P + N++RV K+D R + D G Q Sbjct: 575 ---PAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQ 631 Query: 452 EMAGPGMLDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKD 285 EM GPG D + Q G K ED +G RN FRNDESESEDEAPRRSRHGEG KKR+D Sbjct: 632 EMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKKRRD 690 Query: 284 SEVD 273 SE D Sbjct: 691 SEGD 694 >ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cicer arietinum] Length = 677 Score = 788 bits (2035), Expect = 0.0 Identities = 418/703 (59%), Positives = 463/703 (65%), Gaps = 8/703 (1%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAGNN 2193 +D EGVLSFDFEGGLD APPS + +VP A S I + + PV+GN Sbjct: 2 EDSEGVLSFDFEGGLDAAPPSAATVSVP--APPSGPIVHPDSSLPPSISSNGAAPVSGNI 59 Query: 2192 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 2013 RR FRQTVCRHWLRSLCMKG+ACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH+NE Sbjct: 60 PGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTNE 119 Query: 2012 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1833 DIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH ++N+ +S++F QQR + Sbjct: 120 DIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRGS 179 Query: 1832 SYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNLPN 1653 SYT Q E+SQFPQG N NQ VA K +TQNL N Sbjct: 180 SYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQI---QTQNLAN 236 Query: 1652 SLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSIDNV 1473 P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS++NV Sbjct: 237 GQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENV 296 Query: 1472 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 1293 ILIFSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR Sbjct: 297 ILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 356 Query: 1292 HLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGV 1113 HLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+ KGV Sbjct: 357 HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKGV 416 Query: 1112 NLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPLARG 933 N D+ ENPDIVPF + Q GMMW PHMPL RG Sbjct: 417 NPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPLGRG 476 Query: 932 ARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGLGQS 753 ARPMPG++GF PVMM GDG +YG PDGF MPDLFGM PR F PYGPRFSGD + Sbjct: 477 ARPMPGMQGFNPVMM-GDGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFA----- 530 Query: 752 SAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVAAPV 573 GP + MMF GRP+QPG M V P Sbjct: 531 -----------GPPAAMMFRGRPSQPG-----MFPGGGFGMMMNPGRGPFMGGMGVPGPN 574 Query: 572 RASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRR-----PLDMGQEMAGPGMLDDGKYQS 408 P N NR+ K+DQR GQE G D QS Sbjct: 575 PPRGGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTNDRNDRYSSGQEQ---GKSQDMLSQS 631 Query: 407 G---IKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRK 288 G ++Q + S N+FRN++SESEDEAPRRSRHGEGKKRK Sbjct: 632 GGPDDEMQYQQSGAPANNFRNEDSESEDEAPRRSRHGEGKKRK 674 >ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Citrus sinensis] Length = 683 Score = 787 bits (2032), Expect = 0.0 Identities = 430/717 (59%), Positives = 479/717 (66%), Gaps = 17/717 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAGNN 2193 +D EG LSFDFEGGLD A P P+A+ P A SS + S PV ++ Sbjct: 2 EDSEGGLSFDFEGGLD-AGPGMPTASNPAAAPSSSGAA----------PDHASAPVPHHS 50 Query: 2192 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 2013 RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDCVYKH+NE Sbjct: 51 -GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTNE 109 Query: 2012 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1833 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP EEV+QKIQ S++N+G+ N+ FQQR A Sbjct: 110 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRGA 169 Query: 1832 SYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNLPN 1653 ++HQT++SQF QG N VNQ A K ST + QNLPN Sbjct: 170 -FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT--QMQNLPN 226 Query: 1652 SLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSIDNV 1473 LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS +NV Sbjct: 227 GLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAENV 286 Query: 1472 ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 1293 ILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR Sbjct: 287 ILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 346 Query: 1292 HLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGV 1113 HLRNPYNENLPVKISRDCQELEP +GEQLA+LLYLEPDSELMAISV KGV Sbjct: 347 HLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKAKGV 406 Query: 1112 NLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPLARG 933 N D+ +NPDIVPF +A+Q GMMW MPLARG Sbjct: 407 NPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPGPMPLARG 462 Query: 932 ARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGLGQS 753 ARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRFSGD +G G Sbjct: 463 ARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG-- 519 Query: 752 SAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVAAPV 573 GMMF GRP QPG+VF A Sbjct: 520 ---------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMG----PAAT 560 Query: 572 RASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG--QEMAGPGM 432 P + N++R K+D R + D G QEM GPG Sbjct: 561 NPRGGRPVGVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGR 620 Query: 431 LDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEVD 273 D + Q G K ED +G RN FRNDESESEDEAPRRSRHGEG KKR+DSE D Sbjct: 621 GPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKKRRDSEGD 676 >ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] gi|561020727|gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] Length = 697 Score = 786 bits (2029), Expect = 0.0 Identities = 416/720 (57%), Positives = 465/720 (64%), Gaps = 16/720 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXS-EPVAGN 2196 +D EGVLSFDFEGGLDTAP + + + PL+ DSS ++ EP A N Sbjct: 2 EDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAVN 61 Query: 2195 NIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSN 2016 RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH+N Sbjct: 62 VPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTN 121 Query: 2015 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRN 1836 EDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH ++NY SSN+FFQQR Sbjct: 122 EDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQRG 181 Query: 1835 ASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNLP 1656 +SYT Q E+SQ PQG+N NQ V K + QN+ Sbjct: 182 SSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQN--QIQNVA 239 Query: 1655 NSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSIDN 1476 N P +A++ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS++N Sbjct: 240 NGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 299 Query: 1475 VILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1296 VILIFSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT Sbjct: 300 VILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 359 Query: 1295 RHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKG 1116 RHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPD ELMA+SV KG Sbjct: 360 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKG 419 Query: 1115 VNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPLAR 936 VN D+ ENPDIVPF A Q GMMW PHMPL R Sbjct: 420 VNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWPPHMPLPR 479 Query: 935 GARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGLGQ 756 GARPMPG++GF PVMM GDG +YG + PDGF MPDLF + PRAFAPYGPRFSGD Sbjct: 480 GARPMPGMQGFNPVMM-GDGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFG---- 534 Query: 755 SSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVAAP 576 GP + MMF GRP+QPG ++ A P Sbjct: 535 ------------GPPAAMMFRGRPSQPG---MFPGGGFGMMMNPGRGPFMGGMGVAGANP 579 Query: 575 VRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RPLDMGQEMAG 441 R N NR+ K+DQR + DM + Sbjct: 580 PRGGRPVNMPPMFPPPPP--LPQNTNRLAKRDQRTTDRNDRYGSGSEQGKSQDMLSQSGA 637 Query: 440 PGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDEQQN 261 P DD +YQ G K +D N+FRND+SESEDEAPRRSRHGEGKK++ D N Sbjct: 638 PD--DDMQYQQGYKAN-QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKKRRGPEDVNTN 694 >ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 691 Score = 780 bits (2013), Expect = 0.0 Identities = 419/716 (58%), Positives = 464/716 (64%), Gaps = 22/716 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVP---LIATDSSVISNTXXXXXXXXXXXXSEPVA 2202 +D EGVLSFDFEGGLD AP S+ +AAVP L+ DSS ++ + A Sbjct: 2 EDSEGVLSFDFEGGLDAAP-SSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60 Query: 2201 GNNI-ARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 2025 G N+ RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYK Sbjct: 61 GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120 Query: 2024 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1845 H+NEDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH ++NY SSN+FFQ Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180 Query: 1844 QRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQ 1665 QR ASY Q E+ Q PQG+N NQ V K + Q Sbjct: 181 QRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQS---QMQ 237 Query: 1664 NLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1485 N+ N P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS Sbjct: 238 NVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDS 297 Query: 1484 IDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1305 ++NVIL+FSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSF Sbjct: 298 VENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 357 Query: 1304 HKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1125 HKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV Sbjct: 358 HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEK 417 Query: 1124 XKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMP 945 KGVN D+ ENPDIVPF A Q GMMW PHMP Sbjct: 418 AKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPHMP 477 Query: 944 LARGARPMPGLRGFPPVMMGGDGFTY---GAITPDGFPMPDLFGMAPRAFAPYGPRFSGD 774 L RGARPMPG++GF PVMM GDG +Y G + PDGF MPDLFG+ PR FAPYGPRFSGD Sbjct: 478 LGRGARPMPGMQGFNPVMM-GDGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGD 536 Query: 773 LSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXX 594 GP + MMF GRP+QPG Sbjct: 537 FG----------------GPPAAMMFRGRPSQPG---MFPSGGFGMMMNPGRGPFMGGMG 577 Query: 593 MSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RPLDM 459 + A P R N NR K+DQR + DM Sbjct: 578 VGGANPPRGGRPVNMPPMFPPPPP--LPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDM 635 Query: 458 GQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR 291 + GP DD +YQ G K +D N+FRND+SESEDEAPRRSRHGEGKK+ Sbjct: 636 LSQSGGPD--DDAQYQQGYKGN-QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 688 >gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus notabilis] Length = 710 Score = 776 bits (2005), Expect = 0.0 Identities = 424/728 (58%), Positives = 473/728 (64%), Gaps = 28/728 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTA----PPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPV 2205 +D EGVLSFDFEGGLDT PP+ +A+ LI DSS + + +P Sbjct: 2 EDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSA-DPT 60 Query: 2204 AG-----NNIAR-RCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECRE 2043 +G +N R R FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+RLYGECRE Sbjct: 61 SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120 Query: 2042 QDCVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGS 1863 QDCVYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP EEV+QKIQH S++NY Sbjct: 121 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY-H 179 Query: 1862 SNRFFQQRNAS-YTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXX 1686 SN+FFQQRNA + E+ P G N V+Q V K S Sbjct: 180 SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239 Query: 1685 XXXIETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1506 + QN+ LP +AN+T PLP G+SRYFIVKSCNRENLELSVQQGVWATQRSNEAK Sbjct: 240 QN--QIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 297 Query: 1505 LNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWL 1326 LNEAFD +NVILIFSVNRTRHFQGCAKM S+IGG + GGNWKYAHGTAHYGRNFSVKWL Sbjct: 298 LNEAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKWL 357 Query: 1325 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXX 1146 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+ Sbjct: 358 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAE 417 Query: 1145 XXXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGM 966 KGV+ D+ ENPDIVPF SFSQ L A Q G+ Sbjct: 418 SKREEEKAKGVDPDNGGENPDIVPF-EDNEEDEEEESEDEEESFSQVLGANQGRGRGRGV 476 Query: 965 MWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPR 786 MW PHMPL+RGARPMP ++GFPPVM+G DG YG +TPDGFPMPDLF + PRAF PYGPR Sbjct: 477 MWPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPYGPR 536 Query: 785 FSGDLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVF-XXXXXXXXXXXXXXXXXX 609 F GD GPTSGMMF GRP QPG VF Sbjct: 537 FPGDF----------------MGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPCMGG 580 Query: 608 XXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD----------- 462 S A P+R N NR ++DQR + Sbjct: 581 MGVQGTSPARPMRPGAMPPMFQQPPP-----PSQNMNRPPRRDQRGLANDRNERYGAGSD 635 Query: 461 --MGQEMAGP--GMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-K 297 GQEM+GP G DD YQ G K + ED +G NSFRNDESESEDEAPRRSRHG+G K Sbjct: 636 QVRGQEMSGPAGGPEDDAHYQLGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGKK 695 Query: 296 KRKDSEVD 273 KR+ SE D Sbjct: 696 KRRSSEED 703 >ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 681 Score = 771 bits (1991), Expect = 0.0 Identities = 415/712 (58%), Positives = 456/712 (64%), Gaps = 18/712 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSA-AVPLIATDSSVISNTXXXXXXXXXXXXS-EPVAG 2199 +D EGVLSFDFEGGLD AP S +A + PLI DSS ++ + +PV G Sbjct: 2 EDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVGG 61 Query: 2198 NNI-ARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKH 2022 N+ RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH Sbjct: 62 GNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKH 121 Query: 2021 SNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQ 1842 +NEDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH ++NY SSN+FFQQ Sbjct: 122 TNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQ 181 Query: 1841 RNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQN 1662 R ASY Q E+ PQG+N NQ V + QN Sbjct: 182 RGASYNQQAEKPLLPQGNNSTNQGVT---GNPLPAELGNAQPQQQVQQSQQQVNQSQMQN 238 Query: 1661 LPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSI 1482 + N P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS+ Sbjct: 239 VANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSV 298 Query: 1481 DNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1302 +NVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFH Sbjct: 299 ENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 358 Query: 1301 KTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1122 KTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV Sbjct: 359 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKA 418 Query: 1121 KGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPL 942 KGVN D+ ENPDIVPF A Q GMMW PHMPL Sbjct: 419 KGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPL 478 Query: 941 ARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGL 762 RGARPMPG++GF PVMM GDG +YG + PDGF MPDLFG+ PR FAPYGPRFSGD Sbjct: 479 GRGARPMPGMQGFNPVMM-GDGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFG-- 535 Query: 761 GQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVA 582 GP + MMF GRP+QPG + A Sbjct: 536 --------------GPPAAMMFRGRPSQPG---MFPGGGFGMMLNPGRGPFMGGIGVGGA 578 Query: 581 APVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RPLDMGQEM 447 P R N NR K+DQR + DM + Sbjct: 579 NPPRGGRPVNMPPMFPPPPP--LPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDMLSQS 636 Query: 446 AGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR 291 GP DD +YQ G K G D+SESEDEAPRRSRHGEGKK+ Sbjct: 637 GGPD--DDPQYQQGYK--------GNQDDHPDDSESEDEAPRRSRHGEGKKK 678 >ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cucumis sativus] Length = 707 Score = 761 bits (1964), Expect = 0.0 Identities = 409/723 (56%), Positives = 465/723 (64%), Gaps = 23/723 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSA--AVPLIATDSSV------ISNTXXXXXXXXXXXX 2217 +D EGVLSFDFEGGLD A P+NP+A ++P+I +DSS +SN Sbjct: 2 EDSEGVLSFDFEGGLD-AGPTNPAATSSLPIINSDSSAPPAASAVSNPLSGALGPAVSAE 60 Query: 2216 SEPVAGNNIA-RRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQ 2040 N+ RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQ Sbjct: 61 PTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQ 120 Query: 2039 DCVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSS 1860 DCVYKH+NEDIKECNMYK GFCPNGPDCRYRH KLPGPPPP EE++QKIQH ++NYG S Sbjct: 121 DCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYGPS 180 Query: 1859 NRFFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXX 1680 N+FF QR + Q E+SQFPQ +V Q V K S Sbjct: 181 NKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAESVNVQQQQGQQSAPQASQTP- 239 Query: 1679 XIETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1500 Q+L N P + N+ AT LPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN Sbjct: 240 ---VQSLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 296 Query: 1499 EAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKL 1320 EAFDS DNVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGT HYG+NFS+KWLKL Sbjct: 297 EAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKWLKL 356 Query: 1319 CELSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXX 1140 CELSF KTRHLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPD ELMA+SV Sbjct: 357 CELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAAESK 416 Query: 1139 XXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMM 963 KGVN D +ENPDIVPF SF Q+ Q GMM Sbjct: 417 REEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGRGRGMM 476 Query: 962 WAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYG--P 789 W PHMP+ RGARP G++GFPP MMG DG +YG +TPDGFPMPD+FGM PR F PYG P Sbjct: 477 WPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMTPRGFGPYGPTP 536 Query: 788 RFSGDLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXX 609 RFSGD GP + MMF GRP+QP +F Sbjct: 537 RFSGDF----------------MGPPTAMMFRGRPSQPAAMF--PPSGFGMMMGQGRGPF 578 Query: 608 XXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR----------RPLDM 459 ++ A P R P +Q N NR +K+DQR + Sbjct: 579 MGGMGVAGANPARPGRPVGVSPLYPPPAVPSSQ-NMNRAIKRDQRGLTNDRYIVGMDQNK 637 Query: 458 GQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDS 282 G E+ G ++ +Y+ G K ++ +G +FRN+ESESEDEAPRRSRHGEG KKR+ S Sbjct: 638 GVEIQSSGRDEEMQYKQGSKAYSDEQYGTGTTFRNEESESEDEAPRRSRHGEGKKKRRGS 697 Query: 281 EVD 273 E D Sbjct: 698 EGD 700 >ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] gi|462410040|gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica] Length = 695 Score = 740 bits (1910), Expect = 0.0 Identities = 401/722 (55%), Positives = 458/722 (63%), Gaps = 22/722 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVP----LIATDSSVISNTXXXXXXXXXXXXSEPV 2205 +D +G ++FDFEGGLD + P+ P L+ +DS V + P Sbjct: 2 EDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNHPNP- 60 Query: 2204 AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 2025 N R +RQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+RLYGECREQDCVYK Sbjct: 61 --NRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 118 Query: 2024 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1845 H+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +SN+F+Q Sbjct: 119 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSNKFYQ 178 Query: 1844 QRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQ 1665 QRNA + Q ++ Q QG N V Q V K ST +TQ Sbjct: 179 QRNAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQVGHT---QTQ 235 Query: 1664 NLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1485 NLPN L +AN++A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS Sbjct: 236 NLPNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDS 294 Query: 1484 IDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1305 +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHG+AHYGRNFSVKWLKLCELSF Sbjct: 295 AENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLKLCELSF 354 Query: 1304 HKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1125 HKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMA+S+ Sbjct: 355 HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAESKREEEK 414 Query: 1124 XKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQ--XXXXXXGMMWAPH 951 KGVN ++ ENPDIVPF SF G+MW PH Sbjct: 415 AKGVNPENGGENPDIVPF-EDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGIMWPPH 473 Query: 950 MPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDL 771 MPLARG RPMPG++GFPP MMG D YG PDGF MP+ FG+ PR F PYGPRFSGD Sbjct: 474 MPLARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGMPNPFGVGPRGFNPYGPRFSGDF 532 Query: 770 SGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXM 591 TGPT GMMF GRP QPG + Sbjct: 533 ----------------TGPTPGMMFRGRPQQPG----FPPGGYGMMMGPGRAPFMGGMGV 572 Query: 590 SVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD-------------MGQE 450 A P R + N NR+ K+D R P + GQE Sbjct: 573 GGANPGRPGRPTGMSPMFPPP----SSQNTNRMQKRDPRGPSNDRNERYSAGSGQGKGQE 628 Query: 449 MAG--PGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR-KDSE 279 + G G D+ +YQ K ED +G N+ RND+SESEDEAPRRSRHGEGKK+ + SE Sbjct: 629 IPGLAGGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAPRRSRHGEGKKKGRGSE 688 Query: 278 VD 273 D Sbjct: 689 GD 690 >ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Fragaria vesca subsp. vesca] Length = 689 Score = 731 bits (1886), Expect = 0.0 Identities = 396/715 (55%), Positives = 455/715 (63%), Gaps = 15/715 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAG-N 2196 +D +GVL+FDFEGGLD+A S P+ +A+ + + S++ +P N Sbjct: 2 EDPDGVLNFDFEGGLDSAAVSAPTHTG--LASSAPIQSDSFASQPKNQAAPAPQPDPNVN 59 Query: 2195 NIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSN 2016 R+ FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+R+YGECREQDCVYKH+N Sbjct: 60 PSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTN 119 Query: 2015 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRN 1836 EDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +SN+F Q RN Sbjct: 120 EDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKFSQPRN 179 Query: 1835 ASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNLP 1656 + Q +RSQ Q +N NQVV + + + Q++P Sbjct: 180 GGFPQQHDRSQPAQVTNSFNQVVVRPSAAES----ANVQQPQQFQQTQQPVAQTQAQSVP 235 Query: 1655 NSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSIDN 1476 N L ++AN+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS +N Sbjct: 236 NGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAEN 295 Query: 1475 VILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1296 VILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT Sbjct: 296 VILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 355 Query: 1295 RHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKG 1116 RHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+ KG Sbjct: 356 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKG 415 Query: 1115 VNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPL-A 939 VN ++ ENPDIVPF Q A +MW PHMPL Sbjct: 416 VNPENGGENPDIVPFEDNEEEEEEESDDEEDY---QVPGGAIENRGRGRVMWPPHMPLGG 472 Query: 938 RGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGM-APRAFAPYGPRFSGDLSGL 762 RG RPMPG++GFP MMG D YG +TPDGF MP+ FGM PR F PYGPRFSGD Sbjct: 473 RGGRPMPGMQGFPG-MMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSGDFG-- 529 Query: 761 GQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVA 582 GP GMMF GRP QPG +F + Sbjct: 530 --------------GPNPGMMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGVGGN-- 573 Query: 581 APVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR-----------RPLDMGQEMAGPG 435 P R NNNR+ K+D R G+EM G Sbjct: 574 NPARGGRPGGMPPMFPPHP---PSQNNNRLQKRDPRGSGNDRNERYSAGSGHGKEMQAGG 630 Query: 434 MLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEVD 273 D+ YQ K ED +G N+ RND+SESEDEAPRRSRHGEG KKR+DSE D Sbjct: 631 PDDENHYQHSSKSYQED-YGAGNNGRNDDSESEDEAPRRSRHGEGKKKRRDSEGD 684 >ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa] gi|550349048|gb|EEE85138.2| zinc finger family protein [Populus trichocarpa] Length = 669 Score = 728 bits (1878), Expect = 0.0 Identities = 399/727 (54%), Positives = 451/727 (62%), Gaps = 26/727 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDS----SVISNTXXXXXXXXXXXXSEPV 2205 +D EGVLSFDFEGGLD+ P +NP A++P I +D+ + + + Sbjct: 2 EDSEGVLSFDFEGGLDSGP-ANPIASIPAIPSDNYGAATAAAPNTTNTTTNTTNNSNSGA 60 Query: 2204 AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 2025 A RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCVYK Sbjct: 61 ADIQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120 Query: 2024 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1845 H+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEVVQKIQ +++N +SN+ FQ Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSNKNFQ 180 Query: 1844 QRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQ 1665 QRNA ++ Q E+S N ++ + +A+ + Q Sbjct: 181 QRNAGFSQQIEKSP--------NTIIKPSGTESANVQQQQQQQQQTQTPHLTNGQHQQPQ 232 Query: 1664 NLPNSLPTEANKTATPLPQGLSR-----------YFIVKSCNRENLELSVQQGVWATQRS 1518 P N+ ATPLPQG+S YFIVKSCNRENLELSVQQGVWATQRS Sbjct: 233 Q-----PNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQGVWATQRS 287 Query: 1517 NEAKLNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFS 1338 NE KLNEA DS DNVILIFSVNRTRHFQGCAKM SKIG VGGGNWKYAHGTAHYGRNFS Sbjct: 288 NEIKLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTAHYGRNFS 347 Query: 1337 VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAIS 1158 VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELEP +GEQLASLLYLEPDSELMA+S Sbjct: 348 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVS 407 Query: 1157 VXXXXXXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLS-AAQXXX 981 + KGVN D ENPDIVPF SF Q L AAQ Sbjct: 408 LAAEAKREEEKEKGVNPDSGGENPDIVPF-EDNEEEEEEESEEEEESFGQPLGPAAQGRG 466 Query: 980 XXXGMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFA 801 GMMW H P+ARGARP+PG+RGFPP+MMG DGF+YGA+TPD F MPDLFG+A R F Sbjct: 467 RGRGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGMPDLFGVASRGFP 526 Query: 800 PYGPRFSGDLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXX 621 PYGPRFSGD TG SGMMF GRP+QPG VF Sbjct: 527 PYGPRFSGDF----------------TGAASGMMFPGRPSQPGAVFPAGGFGMMMGPGRP 570 Query: 620 XXXXXXXXXMS----------VAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRR 471 S + AP A + NN+R VK+DQR Sbjct: 571 PFIGGMGPTPSNLLRGPRPGGMFAPFPAP----------------SSQNNSRSVKRDQRA 614 Query: 470 PLDMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR 291 + + + FG NS RNDESESEDEAPRRSRHGEGKK+ Sbjct: 615 AANDRNDR-------------------HNQFGAVNSIRNDESESEDEAPRRSRHGEGKKK 655 Query: 290 KDSEVDE 270 + D+ Sbjct: 656 RRGSGDD 662 >gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus guttatus] Length = 681 Score = 726 bits (1875), Expect = 0.0 Identities = 395/727 (54%), Positives = 458/727 (62%), Gaps = 27/727 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAG-- 2199 DD EG LSFDFEGGLD P S+P+A+VP+I + ++ + + + PV Sbjct: 2 DDGEGGLSFDFEGGLDIGP-SHPTASVPVIQSSANANTASAAAAAANPYNPSAAPVPATQ 60 Query: 2198 -----NNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 2034 NN RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDC Sbjct: 61 AAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDC 120 Query: 2033 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1854 VYKH+NED+KECNMYKLGFCPNGPDCRYRH KLPGPPP EEV+QKIQ +++NYG SN Sbjct: 121 VYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSNN 180 Query: 1853 FFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI 1674 FFQ RN+++ QTE+ QFPQG N +QV + + Sbjct: 181 FFQNRNSNFAQQTEKPQFPQGPNGTHQVGKTNAAEPGNLNQPAQQSQQPGSQG------- 233 Query: 1673 ETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1494 + Q++PN +A++ ATPLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKLNEA Sbjct: 234 QLQSIPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEA 293 Query: 1493 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1314 F+S++N+ILIFSVN+TRHFQGCAKMTS+IGG VGGGNWK+AHGTAHYGRNF++KWLKLCE Sbjct: 294 FESVENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLKLCE 353 Query: 1313 LSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXX 1134 L+F KTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDS+LMAI++ Sbjct: 354 LTFDKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAELKRE 413 Query: 1133 XXXXKGVNLDDETENPDIVPF---XXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMM 963 KGVN+D+ ENPDIVPF F AQ GMM Sbjct: 414 EEKAKGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGAQGRGVGRGMM 473 Query: 962 WAPHM-PLARGARPMPGLRGFPPVMMGGDGFTYGAITP---DGFPMPDLFGMAPRAFAPY 795 W PHM PL RG RP PG+RGFPP MMGGDGF YG P DGFPM D FGM PR F + Sbjct: 474 WGPHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMHDPFGMVPRGFGQF 533 Query: 794 GPRFSGDLSGLGQSSAM-------GFTPV--DGTGPTSGMMFHGRP-NQPGNVFXXXXXX 645 GPRF GD +G M GF P+ G GP G GRP P F Sbjct: 534 GPRFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPFFPPP--- 590 Query: 644 XXXXXXXXXXXXXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL 465 PV A N+ VK+DQ+ P Sbjct: 591 --------------------PPPVAAQPPP----------------QNSNWVKRDQKAPY 614 Query: 464 DMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGR--NSFRNDESESEDEAPRRSRHGEG-KK 294 +++ D GK Q + + S+RNDESESEDEAPRRSRHGEG KK Sbjct: 615 SDRNDVS-----DQGKGQEIVSGSSNRGNAAKREESYRNDESESEDEAPRRSRHGEGKKK 669 Query: 293 RKDSEVD 273 R+ SE + Sbjct: 670 RRGSEAE 676 >gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malus domestica] Length = 667 Score = 724 bits (1869), Expect = 0.0 Identities = 399/723 (55%), Positives = 455/723 (62%), Gaps = 23/723 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDT----APPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPV 2205 +D +G L+FDFEGGLD + + P+ VP ++ SV+ + + P Sbjct: 2 EDSDGGLNFDFEGGLDAPATVSASAGPANTVP--TSNYSVMQSDSAVTGLGANQAAAAPQ 59 Query: 2204 AGNNIAR---RCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 2034 N R R +RQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDC Sbjct: 60 PNQNANRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 119 Query: 2033 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1854 VYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +S++ Sbjct: 120 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYNYNNSSK 179 Query: 1853 FFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI 1674 F+QQRNA + Q ++ Q QG N V + TTA+ Sbjct: 180 FYQQRNAGFPQQGDKHQPAQGPNNF-----VGKPTTAEPGNVQQQQQQQLQQTQQHVGPT 234 Query: 1673 ETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1494 +TQ LPN L +AN++A PLPQG SRYFIVKSCNRENLELSVQQG+WATQRSNE+KLNEA Sbjct: 235 QTQTLPNGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRSNESKLNEA 294 Query: 1493 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1314 FDS +NVILIFSVNRTRHFQGCAKM S+IGG VGGGNWKYAHGTAHYGRNFSVKWLKLCE Sbjct: 295 FDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 354 Query: 1313 LSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXX 1134 LSFHKTRHLRNPYNENLPVKISRDCQELE VGEQLASLLYLEPDSELMAIS+ Sbjct: 355 LSFHKTRHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAISIAAESKRE 414 Query: 1133 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSA---AQXXXXXXGMM 963 KGVN ++ ENPDIVPF SF Q A + G+M Sbjct: 415 EEKAKGVNPENGGENPDIVPF-EDNEEEEEEESEDEEDSFGQVPGAGNDGRGRGRGGGVM 473 Query: 962 WAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRF 783 W PHM L RG RPMPG++GFPP MMG D Y PDGF MP+ FGMAPR F PYGPRF Sbjct: 474 WPPHMALPRGGRPMPGMQGFPPGMMGHDAMPY---VPDGFVMPNPFGMAPRGFNPYGPRF 530 Query: 782 SGDLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXX 603 SGD TGP GMMF GRP QPG Sbjct: 531 SGDF----------------TGPNPGMMFRGRPQQPG---------------------FP 553 Query: 602 XXXMSVAAPVRASXXXXXXXXXXXXXXPL----------AQNNNNRVVKKDQRRPLD--M 459 + P RA + + N NR+ K+D R Sbjct: 554 PGGFGIMGPGRAPFMGGIHPGRGGRPTGMSPMFPPPPPPSSQNPNRMPKRDPRGASTDRK 613 Query: 458 GQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDS 282 GQ+M+GP DD E +G NS RND+SESEDEAPRRSRHG+G KKR+DS Sbjct: 614 GQDMSGP---DD-----------ETHYGAGNSSRNDDSESEDEAPRRSRHGDGKKKRRDS 659 Query: 281 EVD 273 E D Sbjct: 660 EGD 662 >ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] gi|557551536|gb|ESR62165.1| hypothetical protein CICLE_v10014454mg [Citrus clementina] Length = 672 Score = 721 bits (1861), Expect = 0.0 Identities = 402/724 (55%), Positives = 452/724 (62%), Gaps = 24/724 (3%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVIS-------NTXXXXXXXXXXXXS 2214 +D EG LSFDFEGGLD A P P+A+ P I +DS+ + N + Sbjct: 2 EDSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60 Query: 2213 EPVAGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 2034 ++ RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC Sbjct: 61 SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120 Query: 2033 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1854 VYKH+NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP EEV+QKIQ S++N+G+ N+ Sbjct: 121 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180 Query: 1853 FFQQRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI 1674 FQQR A ++HQ ++SQF QG N VNQ A K ST Sbjct: 181 LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT-- 237 Query: 1673 ETQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1494 + QNLPN LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA Sbjct: 238 QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 297 Query: 1493 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1314 FDS +NVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLKLCE Sbjct: 298 FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 357 Query: 1313 LSFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXX 1134 LSFHKTRHLRNPYNENLPVK AISV Sbjct: 358 LSFHKTRHLRNPYNENLPVK-----------------------------AISVAAEAKRE 388 Query: 1133 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAP 954 KGVN D+ +NPDIVPF +A+Q GMMW Sbjct: 389 EEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPG 444 Query: 953 HMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGD 774 MPLARGARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRFSGD Sbjct: 445 PMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGD 503 Query: 773 LSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXX 594 +G G GMMF GRP QPG+VF Sbjct: 504 FTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMG- 545 Query: 593 MSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG--Q 453 A P + N++RV K+D R + D G Q Sbjct: 546 ---PAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQ 602 Query: 452 EMAGPGMLDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKD 285 EM GPG D + Q G K ED +G RN FRNDESESEDEAPRRSRHGEG KKR+D Sbjct: 603 EMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKKRRD 661 Query: 284 SEVD 273 SE D Sbjct: 662 SEGD 665 >ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum tuberosum] Length = 692 Score = 711 bits (1834), Expect = 0.0 Identities = 391/710 (55%), Positives = 452/710 (63%), Gaps = 9/710 (1%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIAT---DSSVISNTXXXXXXXXXXXXSEPVA 2202 D+ EG L+FDFEGGLDT P ++P+A+VP+I + ++ + + Sbjct: 2 DEGEGGLNFDFEGGLDTGP-THPTASVPVIQSFDHTAAAAPSANINPPTVSAAVGGQSDV 60 Query: 2201 GNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKH 2022 G RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDCVYKH Sbjct: 61 GFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKH 120 Query: 2021 SNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQ 1842 + EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPP EE++QKIQH +++NYG SNRF Q Sbjct: 121 TIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYGYSNRFNQN 180 Query: 1841 RNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQN 1662 RNA+Y+ Q+++SQ Q N ++ +AVK + T + Q Sbjct: 181 RNANYSTQSDKSQASQAQNGMS--LAVKSTATETPIIQQHQPNQQVQPPQLQGGPTQAQI 238 Query: 1661 LPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSI 1482 PN +A++TA LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS+ Sbjct: 239 HPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 298 Query: 1481 DNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1302 +NVILIFSVNRTRHFQGC KMTS+IGG GGNWK+ HGTAHYGRNFSVKWLKLCELSF Sbjct: 299 ENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWLKLCELSFQ 358 Query: 1301 KTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1122 KT HLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPDSELMAIS+ Sbjct: 359 KTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQEEKA 418 Query: 1121 KGVNLDDETENPDIVPF---XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMMWAP 954 KGVN D+ +NPDIVPF SF Q AA G+ W P Sbjct: 419 KGVNPDNGKDNPDIVPFEDNEEEEEEEEEEESEDEDESFDQGFGPAALGRGRGRGIAWPP 478 Query: 953 HMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGD 774 MP G RP PG+RGFPP MM GDGF+YGA+TP+GFPMPD FGM PR F PYGP FS D Sbjct: 479 IMPFGHGPRPPPGMRGFPPGMM-GDGFSYGAMTPEGFPMPDHFGMGPRPFGPYGPPFSSD 537 Query: 773 LSGLGQSSAMGFTPVDGTG--PTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXX 600 L G+ A GF + G G P G M G P Sbjct: 538 LMFHGRPPAGGFGMMMGPGRPPFMGGMGPGATGPPRAGRAVGMHPSFVPPSSQPSQYPYK 597 Query: 599 XXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPGMLDDG 420 APV ++ N DQ + GQEM G DG Sbjct: 598 AKREQRAPV---------------------SDRNDRFSSDQGK----GQEMMGSVGGPDG 632 Query: 419 KYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDE 270 + K + ++ FG NS +N+ESESEDEAPRRSRHG+GKK++ +VDE Sbjct: 633 VHMQIGKSEHDNQFGAGNSQKNEESESEDEAPRRSRHGDGKKKR-RDVDE 681 >ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum lycopersicum] Length = 689 Score = 709 bits (1831), Expect = 0.0 Identities = 390/707 (55%), Positives = 451/707 (63%), Gaps = 6/707 (0%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIAT--DSSVISNTXXXXXXXXXXXXSEPVAG 2199 D+ EG L+FDFEGGLDT P ++P+A+VP+I + ++ +++ + G Sbjct: 2 DEGEGGLNFDFEGGLDTGP-THPTASVPVIQSFDHTAAAASSANINPPTVPAVGGQGDVG 60 Query: 2198 NNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHS 2019 RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDCVYKH+ Sbjct: 61 FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 120 Query: 2018 NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQR 1839 EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPP EE++QKIQH ++ NYG SNRF Q R Sbjct: 121 IEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSNRFNQNR 180 Query: 1838 NASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXIETQNL 1659 NA+Y+ QT++SQ Q N + +AVK + T + Q Sbjct: 181 NANYSTQTDKSQASQAQNGTS--LAVKSTATETPIIQQHQPHQQVQPPQLQGGPTQAQIH 238 Query: 1658 PNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSID 1479 PN +A++TA LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS++ Sbjct: 239 PNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE 298 Query: 1478 NVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 1299 NVILIFSVNRTRHFQGC KMTS+IGG GGNWK+ HGTAHYGRNFS+KWLKLCELSF K Sbjct: 299 NVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKLCELSFQK 358 Query: 1298 TRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXXXXXK 1119 T HLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPDSELMAIS+ K Sbjct: 359 THHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRLEEKAK 418 Query: 1118 GVNLDDETENPDIVPF-XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMMWAPHMP 945 GVN D+ +NPDIVPF +F Q AA G+ W P MP Sbjct: 419 GVNPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGIAWPPIMP 478 Query: 944 LARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSG 765 G RP PG+RGFPP MM GDGF+YGA+TP+GFPM D FGM PR F PYGPRFS DL Sbjct: 479 FGHGPRPPPGMRGFPPGMM-GDGFSYGAMTPEGFPMTDHFGMGPRPFPPYGPRFSSDLMF 537 Query: 764 LGQSSAMGFTPVDGTG--PTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXM 591 G+ A GF + G G P G M G P Sbjct: 538 HGRPPAGGFGMMIGPGRPPFVGGMGPGATGPPRAGRAVRMHPSFIPPSSQPSQYPYRAKR 597 Query: 590 SVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPGMLDDGKYQ 411 APV ++ N DQ + GQEM G DG + Sbjct: 598 EQRAPV---------------------SDRNDRFSSDQGK----GQEMMGSVNGPDGVHM 632 Query: 410 SGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDE 270 K + ++ FG NS +ND SESEDEAPRRSRHG+GKK++ +VDE Sbjct: 633 QIGKSEHDNQFGAGNSLKNDGSESEDEAPRRSRHGDGKKKR-RDVDE 678 >ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Solanum tuberosum] Length = 677 Score = 698 bits (1801), Expect = 0.0 Identities = 390/715 (54%), Positives = 454/715 (63%), Gaps = 18/715 (2%) Frame = -2 Query: 2372 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXXXSEPVAGNN 2193 DD EG L+FDFEGGLDT P ++P+A+VP++ + + + G + Sbjct: 2 DDGEGGLNFDFEGGLDTGP-THPTASVPVLQSAGHITTGPAPNASVALVPPGGGVGQGGD 60 Query: 2192 IA----RRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 2025 + RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCVYK Sbjct: 61 GSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120 Query: 2024 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1845 H+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EV+Q+IQ+ ++ YG SNRFFQ Sbjct: 121 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTS--YGYSNRFFQ 178 Query: 1844 QRNASYTHQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXI--E 1671 RN +Y+ Q ++SQ PQ N++NQ V +ST A+ + Sbjct: 179 NRNTNYSTQADKSQIPQVPNVMNQAV---KSTAAEPPIGQPHQPHQQQVQQPQHQGAPTQ 235 Query: 1670 TQNLPNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1491 TQ LP+S + N+ A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF Sbjct: 236 TQTLPSS---QQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 292 Query: 1490 DSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 1311 DS++NVIL+FS+NRTRHFQG AKMTS+IGG GGNWK+ HGTAHYGRNFS+KWLKLCEL Sbjct: 293 DSVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLCEL 352 Query: 1310 SFHKTRHLRNPYNENLPVKISRDCQELEPFVGEQLASLLYLEPDSELMAISVXXXXXXXX 1131 SF KTRHLRNPYNENLPVKISRDCQELE VGEQLASLLY+EPDSELMA+S+ Sbjct: 353 SFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKREE 412 Query: 1130 XXXKGVNLDDETENPDIVPF-XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMMWA 957 KGVN D+ ENPDIVPF F Q AA G++W Sbjct: 413 ERAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGRGIVWP 472 Query: 956 PHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSG 777 P +P RGARP PG+RGFPP MM DGF+YG++TPDGFPMPD +GM R F P+GPRF G Sbjct: 473 PLVPFGRGARPFPGMRGFPPGMM-SDGFSYGSMTPDGFPMPDPYGMGGRPFGPFGPRFPG 531 Query: 776 DLSGLGQSSAMGFTPVDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXX 597 D+ + A G G G MM GRP G + Sbjct: 532 DMMFHSRPPAAG-----GFGM---MMGPGRPPFMGGM----------------------- 560 Query: 596 XMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPGMLDDGK 417 P R P +QN VKKDQR P + + G D G+ Sbjct: 561 GPGAPGPPRGGRPMGIHPSFIPPTPPPSQNPR---VKKDQRAPFNERNDRFSSGP-DQGR 616 Query: 416 YQSGIKVQCEDSFGG----------RNSFRNDESESEDEAPRRSRHGEGKKRKDS 282 Q + S GG NSFRNDESESEDEAPRRSRHG+GKK+K+S Sbjct: 617 GQ-----EIAGSVGGPAEGVHYPQTENSFRNDESESEDEAPRRSRHGDGKKKKNS 666