BLASTX nr result
ID: Anemarrhena21_contig00012264
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Anemarrhena21_contig00012264 (1550 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008799098.1| PREDICTED: zinc finger CCCH domain-containin... 669 0.0 ref|XP_010926956.1| PREDICTED: zinc finger CCCH domain-containin... 657 0.0 ref|XP_008775232.1| PREDICTED: zinc finger CCCH domain-containin... 652 0.0 ref|XP_010941538.1| PREDICTED: zinc finger CCCH domain-containin... 647 0.0 ref|XP_010941539.1| PREDICTED: zinc finger CCCH domain-containin... 628 e-177 ref|XP_007041140.1| Cleavage and polyadenylation specificity fac... 603 e-169 ref|XP_009419568.1| PREDICTED: zinc finger CCCH domain-containin... 603 e-169 ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation spec... 601 e-169 ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylati... 601 e-169 ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec... 601 e-169 ref|XP_006846022.1| PREDICTED: 30-kDa cleavage and polyadenylati... 597 e-168 ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylati... 597 e-168 ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec... 597 e-167 ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas... 596 e-167 ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm... 594 e-167 ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylati... 593 e-166 gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium r... 592 e-166 ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylati... 592 e-166 ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation spec... 588 e-165 gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sin... 588 e-165 >ref|XP_008799098.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Phoenix dactylifera] Length = 697 Score = 669 bits (1727), Expect = 0.0 Identities = 336/456 (73%), Positives = 357/456 (78%), Gaps = 8/456 (1%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNAT----ASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSV 1182 M+D +GALSFDFEGGLD +SAPT+L AS A G + Sbjct: 1 MDDADGALSFDFEGGLDAGAPAPASSAPTSLMASDPT------------VAAANAGAAAG 48 Query: 1181 HGDPAASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGEC 1002 G + G GG RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGEC Sbjct: 49 PGPSDLAGGGGGPGRRTFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGEC 108 Query: 1001 REQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNY 822 REQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEV QKIQHLSSFNY Sbjct: 109 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLSSFNY 168 Query: 821 GSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAA---XXXXXXXXXXXXXXXX 651 GS NRF+QH+N GY+QQAEKPQF GS GANQ AVKPP + Sbjct: 169 GSSNRFYQHRNTGYNQQAEKPQFSQGSAGANQNAAVKPPISVEPPNVQPPQSQIQQSQQQ 228 Query: 650 XXXXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRS 471 NI NGL N RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ+S Sbjct: 229 PPQPTTENPVQNISNGLLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQKS 288 Query: 470 NEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFS 291 NEAKLNEAFESSENVILIFS+NRTRHFQGCAKMTSKIGG+IGGGNWKYAHGTAHYGRNFS Sbjct: 289 NEAKLNEAFESSENVILIFSINRTRHFQGCAKMTSKIGGYIGGGNWKYAHGTAHYGRNFS 348 Query: 290 VKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEML 111 VKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDG+LM ML Sbjct: 349 VKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGELMAML 408 Query: 110 XXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG+S+DDA+DNPDIVLFEDNEEEE Sbjct: 409 IAAESKREEEKAKGVSTDDATDNPDIVLFEDNEEEE 444 >ref|XP_010926956.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Elaeis guineensis] Length = 686 Score = 657 bits (1695), Expect = 0.0 Identities = 331/452 (73%), Positives = 353/452 (78%), Gaps = 4/452 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 M+D +GALSFDFEGGLD A AP + +++ P A T GDP Sbjct: 1 MDDADGALSFDFEGGLD---AGAPAHASSA---PASLMPSDPTVAAANAGTAAAPGPGDP 54 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 A GPG RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQD Sbjct: 55 VAGGGPG---RRTFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 111 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KL GPPPP+EEV QKIQHLSSFNYGS N Sbjct: 112 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLLGPPPPVEEVLQKIQHLSSFNYGSSN 171 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAA---XXXXXXXXXXXXXXXXXXXX 639 RFFQH+N GY+QQAEK QF GS +NQ AV+PP + Sbjct: 172 RFFQHRNTGYNQQAEKAQFVQGSAVSNQNAAVRPPPSVEPPNVQQPQSQIQQSQQQPPQP 231 Query: 638 XXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 459 NI NGL N RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ+SNEAK Sbjct: 232 TTENPVQNISNGLLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQKSNEAK 291 Query: 458 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWL 279 LNEAFESSENVILIFS+NRTRHFQGCAKMTSKIGG+IGGGNWKYAHGTAHYGRNFSVKWL Sbjct: 292 LNEAFESSENVILIFSINRTRHFQGCAKMTSKIGGYIGGGNWKYAHGTAHYGRNFSVKWL 351 Query: 278 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXX 99 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM ML Sbjct: 352 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAMLIAAE 411 Query: 98 XXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG+S+DDA+DNPDIVLFEDNEEEE Sbjct: 412 SKREEEKAKGVSTDDATDNPDIVLFEDNEEEE 443 >ref|XP_008775232.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Phoenix dactylifera] Length = 696 Score = 652 bits (1682), Expect = 0.0 Identities = 326/453 (71%), Positives = 353/453 (77%), Gaps = 5/453 (1%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EGALSFDFEGGLD AP + +++ + + T V GD Sbjct: 1 MEDAEGALSFDFEGGLDTG---APAHASSAPASLMPSDPTAAAANAGAVATPVA---GDA 54 Query: 1169 AASSG--PGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRE 996 A++ G PG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR+YGECRE Sbjct: 55 ASTGGNIPG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRIYGECRE 111 Query: 995 QDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGS 816 QDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP++EVFQKIQHLS+FNYGS Sbjct: 112 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVDEVFQKIQHLSAFNYGS 171 Query: 815 GNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPP--AAXXXXXXXXXXXXXXXXXXX 642 NR+FQH+N Y+QQ+E+PQ GS ANQ A KPP Sbjct: 172 SNRYFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPPIPVELSNVQQPQSQIQQSQQPPQ 231 Query: 641 XXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEA 462 +I NGLS RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEA Sbjct: 232 PPADNQVQHISNGLSKQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEA 291 Query: 461 KLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKW 282 KLNEAFESSENVILIFS+NRTRHFQGCAKMTSKIGG++GGGNWKYAHGTAHYGRNFSVKW Sbjct: 292 KLNEAFESSENVILIFSINRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRNFSVKW 351 Query: 281 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXX 102 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM ML Sbjct: 352 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAMLIAA 411 Query: 101 XXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG+S+D+A+DNPDIVLFEDNEEEE Sbjct: 412 ESKCEEEKAKGVSTDEAADNPDIVLFEDNEEEE 444 >ref|XP_010941538.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like isoform X1 [Elaeis guineensis] Length = 683 Score = 647 bits (1668), Expect = 0.0 Identities = 329/458 (71%), Positives = 348/458 (75%), Gaps = 10/458 (2%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNA----TASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSV 1182 MED EGALSFDFEGGLD +SAP +L S G+V Sbjct: 1 MEDPEGALSFDFEGGLDAGGPAHASSAPASLMPSDPTAAA--------------ANAGAV 46 Query: 1181 HGDPAASSGPGGNQ---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY 1011 A + P G RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY Sbjct: 47 APPVAGDAAPSGGNIQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY 106 Query: 1010 GECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSS 831 GECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEVFQKIQHLS+ Sbjct: 107 GECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVFQKIQHLSA 166 Query: 830 FNY-GSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKP-PAAXXXXXXXXXXXXXX 657 FNY GS NR+FQH+N Y+QQ+E+PQ GS ANQ A KP P Sbjct: 167 FNYYGSSNRYFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPIPVEPSNVQQPQTQIQQS 226 Query: 656 XXXXXXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ 477 NI N L N RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ Sbjct: 227 QPPPQPPPENQVQNISNALLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ 286 Query: 476 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRN 297 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGG++GGGNWKYAHGTAHYGRN Sbjct: 287 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRN 346 Query: 296 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLME 117 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM Sbjct: 347 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMA 406 Query: 116 MLXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 ML KG+S+D+A+DNPDIVLFEDNEEEE Sbjct: 407 MLIAAESKRDEEKAKGVSTDEAADNPDIVLFEDNEEEE 444 >ref|XP_010941539.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like isoform X2 [Elaeis guineensis] Length = 677 Score = 628 bits (1620), Expect = e-177 Identities = 323/458 (70%), Positives = 342/458 (74%), Gaps = 10/458 (2%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNA----TASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSV 1182 MED EGALSFDFEGGLD +SAP +L S G+V Sbjct: 1 MEDPEGALSFDFEGGLDAGGPAHASSAPASLMPSDPTAAA--------------ANAGAV 46 Query: 1181 HGDPAASSGPGGNQ---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY 1011 A + P G RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY Sbjct: 47 APPVAGDAAPSGGNIQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLY 106 Query: 1010 GECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSS 831 GECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEVFQKIQHLS+ Sbjct: 107 GECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVFQKIQHLSA 166 Query: 830 FNY-GSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKP-PAAXXXXXXXXXXXXXX 657 FNY GS NR+FQH+N Y+QQ+E+PQ GS ANQ A KP P Sbjct: 167 FNYYGSSNRYFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPIPVEPSNVQQPQTQIQQS 226 Query: 656 XXXXXXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ 477 NI N L N RTASPLPQGQS SCNRENLEISVQQGVWATQ Sbjct: 227 QPPPQPPPENQVQNISNALLNQATRTASPLPQGQS------SCNRENLEISVQQGVWATQ 280 Query: 476 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRN 297 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGG++GGGNWKYAHGTAHYGRN Sbjct: 281 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRN 340 Query: 296 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLME 117 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM Sbjct: 341 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMA 400 Query: 116 MLXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 ML KG+S+D+A+DNPDIVLFEDNEEEE Sbjct: 401 MLIAAESKRDEEKAKGVSTDEAADNPDIVLFEDNEEEE 438 >ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao] Length = 698 Score = 603 bits (1555), Expect = e-169 Identities = 300/447 (67%), Positives = 331/447 (74%) Frame = -1 Query: 1343 EDEGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDPAA 1164 + EG LSFDFEGGLD A+ ++ + +P S + DPAA Sbjct: 3 DSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTN-DPAA 61 Query: 1163 SSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 984 + G GG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQDCV Sbjct: 62 AVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCV 121 Query: 983 YKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGNRF 804 YKHTN+DIKECNMYKLGFCPNG DCRYRH KLPGPPPP+EEV QKIQ LSS+NY N+F Sbjct: 122 YKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NKF 178 Query: 803 FQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXXXX 624 FQ +N G++QQ EK Q P G NQ KP Sbjct: 179 FQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTT---ESANMHPQQQVQQPQQQVSQTQ 235 Query: 623 XXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNEAF 444 N+PNG SN N+TA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNEAF Sbjct: 236 IQNVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 295 Query: 443 ESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKLCEL 264 +S+ENVILIFSVNRTRHFQGCAKMTSKIGG + GGNWKYAHGTAHYGRNFSVKWLKLCEL Sbjct: 296 DSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCEL 355 Query: 263 SFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXXXXX 84 SF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 356 SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREE 415 Query: 83 XXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++SD+ +NPDIV FEDNEEEE Sbjct: 416 EKAKGVNSDNGGENPDIVPFEDNEEEE 442 >ref|XP_009419568.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Musa acuminata subsp. malaccensis] Length = 700 Score = 603 bits (1554), Expect = e-169 Identities = 308/458 (67%), Positives = 337/458 (73%), Gaps = 11/458 (2%) Frame = -1 Query: 1343 EDEGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDPAA 1164 E EG+L+FDFEGGLD A AP+ ++ PL A + G+ D A Sbjct: 3 EPEGSLNFDFEGGLDVA---APSVAAVAASGPLAPSDPTAAAASAGASSPSGTA--DRMA 57 Query: 1163 SSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 984 +G + RRSFRQTVCRHWLR LCMKGDACGFLHQYDK RMPVCRFFR YGECREQDCV Sbjct: 58 VAGGNVSGRRSFRQTVCRHWLRGLCMKGDACGFLHQYDKDRMPVCRFFRQYGECREQDCV 117 Query: 983 YKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGNRF 804 YKHTN+DIKECNMYK GFCPNGPDCRYRH KLPGPPPP+EEV QKIQHL+S YGS NRF Sbjct: 118 YKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSA-YGSSNRF 176 Query: 803 FQHKNIG--YSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 + H+N Y+QQ +K Q G NQ T VKP ++ Sbjct: 177 YHHRNNNNSYNQQPDKNQLSSTPGLPNQNTGVKPVSSFEPSDVKLPQSLVQQSEQQQQQQ 236 Query: 629 XXXXN---------IPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ 477 I N LSN T RTASPLPQGQSRYFIVKSCNRENLEISVQQG+WATQ Sbjct: 237 QQLPIPSLENQVPSISNALSNQTVRTASPLPQGQSRYFIVKSCNRENLEISVQQGMWATQ 296 Query: 476 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRN 297 RSNEAKLNEAFES+ENVILIFS+N+TRHFQGC KMTS+IGGF+GGGNWKY+HGTAHYGRN Sbjct: 297 RSNEAKLNEAFESTENVILIFSINKTRHFQGCGKMTSRIGGFVGGGNWKYSHGTAHYGRN 356 Query: 296 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLME 117 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM Sbjct: 357 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMA 416 Query: 116 MLXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 ML KG +D+A+DNPDIVLFEDNEEEE Sbjct: 417 MLVAAESKRDEEKAKGGGADEATDNPDIVLFEDNEEEE 454 >ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30 [Nelumbo nucifera] Length = 715 Score = 601 bits (1549), Expect = e-169 Identities = 306/452 (67%), Positives = 331/452 (73%), Gaps = 4/452 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLDN PTN S+ PL + V +P Sbjct: 1 MEDPEGVLSFDFEGGLDNG----PTNPTPSA--PLIPADSSIAAAA---NSAVAPAVVEP 51 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 A G RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR+YGECREQD Sbjct: 52 VAGGHAG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQD 108 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYK GFCPNGPDCRYRH K PGPPPP+EEVFQKIQHL SFNYGS N Sbjct: 109 CVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKQPGPPPPVEEVFQKIQHLGSFNYGSSN 168 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 RFFQ + Y Q+E+ QFP GS NQ A KP A Sbjct: 169 RFFQQRIGSYVPQSERSQFPQGSSNVNQGIASKPSTAAESPNVQQQQQQSQIQQPQQQQQ 228 Query: 629 XXXXNI---PNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 459 + NGL N +RTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAK Sbjct: 229 VNQTQMQNPQNGLPNQASRTATPLPQGSSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 288 Query: 458 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWL 279 LNEAF+S ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKYAHGTAHYGRNFSVKWL Sbjct: 289 LNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWL 348 Query: 278 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXX 99 KLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 349 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAE 408 Query: 98 XXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ +DN DIV FEDNE+EE Sbjct: 409 SKREEEKAKGVNPDEGADNHDIVPFEDNEDEE 440 >ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Vitis vinifera] Length = 673 Score = 601 bits (1549), Expect = e-169 Identities = 308/449 (68%), Positives = 330/449 (73%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD A +A T V PL V +P Sbjct: 1 MEDAEGVLSFDFEGGLDAAPGTAAT------VAPLIQSDATAAAAAPS-----SVVSAEP 49 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 PG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQD Sbjct: 50 TPGGAPG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 106 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYKLGFCPNG DCRYRH KLPGPPP +EEVFQKIQ LSSFNYGS N Sbjct: 107 CVYKHTNEDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSN 166 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 RF+Q++N Y+QQ EK Q GS N T K Sbjct: 167 RFYQNRN-PYNQQTEKSQILQGSNAVNLGTVAKSSTT----EAINVQQQQVQPPQQQVSQ 221 Query: 629 XXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 450 N+PNGL N N+TASPLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNE Sbjct: 222 TPMQNLPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 281 Query: 449 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKLC 270 AF+S ENVILIFSVNRTRHFQGCAKMTSKIGGF+GGGNWKYAHGTAHYGRNFSVKWLKLC Sbjct: 282 AFDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLC 341 Query: 269 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXXX 90 ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 342 ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKR 401 Query: 89 XXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ +NPDIV FEDNEEEE Sbjct: 402 EEEKAKGVNPDNGGENPDIVPFEDNEEEE 430 >ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 691 Score = 601 bits (1549), Expect = e-169 Identities = 302/449 (67%), Positives = 330/449 (73%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD A +SA + + + S DP Sbjct: 1 MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPST-ADP 59 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 A + PG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD Sbjct: 60 AGGNVPG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 116 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQHL S+NY S N Sbjct: 117 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSN 176 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 +FFQ + Y+QQAEKPQ P G+ NQ KP A Sbjct: 177 KFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPA---ESGNAQPQQQVQQSQQQVNQ 233 Query: 629 XXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 450 N+ NG N NRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+KLNE Sbjct: 234 SQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNE 293 Query: 449 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKLC 270 AF+S ENVIL+FSVNRTRHFQGCAKMTS+IGG + GGNWKYAHGTAHYGRNFSVKWLKLC Sbjct: 294 AFDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLC 353 Query: 269 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXXX 90 ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 354 ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKR 413 Query: 89 XXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ +NPDIV FEDNEEEE Sbjct: 414 EEEKAKGVNPDNGGENPDIVPFEDNEEEE 442 >ref|XP_006846022.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Amborella trichopoda] gi|548848778|gb|ERN07697.1| hypothetical protein AMTR_s00155p00079840 [Amborella trichopoda] Length = 701 Score = 597 bits (1540), Expect = e-168 Identities = 300/452 (66%), Positives = 336/452 (74%), Gaps = 5/452 (1%) Frame = -1 Query: 1343 EDEGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDPAA 1164 E EG LSFDFEGGL+ PT ++ +P + V DPAA Sbjct: 3 EPEGGLSFDFEGGLETTNNPNPTAISLIQNDP----------NAPISSNSVAGNLPDPAA 52 Query: 1163 SSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 984 + G RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCV Sbjct: 53 MNLQG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 109 Query: 983 YKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSS-FNYGSGNR 807 YKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EE+FQKIQ LSS FN GS NR Sbjct: 110 YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEIFQKIQQLSSSFNQGSSNR 169 Query: 806 FFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXXX 627 FFQH+N GY Q +K Q GS NQ A+KP +A Sbjct: 170 FFQHRNTGYVPQVDKNQMQQGSAVVNQGAALKP-SATVDSSGSQQQQQQIQQPQQNASPN 228 Query: 626 XXXNIPNGLSNPTNRT--ASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLN 453 ++PNGL NP NR ASPLPQGQSRYFIVKSCNRENLE+SVQ+G+WATQRSNE+KLN Sbjct: 229 QMQSMPNGLLNPINRVSAASPLPQGQSRYFIVKSCNRENLELSVQKGIWATQRSNESKLN 288 Query: 452 EAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKL 273 EAF+SSENV+LIFS+NRTRHFQGCAKMTSKIGG++GGG WKYAHGTAHYGRNFS+KWLKL Sbjct: 289 EAFDSSENVVLIFSINRTRHFQGCAKMTSKIGGYVGGGGWKYAHGTAHYGRNFSLKWLKL 348 Query: 272 CELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXX 93 CELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 349 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIAVAAKSK 408 Query: 92 XXXXXXKGLS--SDDASDNPDIVLFEDNEEEE 3 KG+S D S+NP+IV FEDN+++E Sbjct: 409 REEERAKGVSPGGGDGSENPEIVPFEDNDDDE 440 >ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Gossypium raimondii] gi|763780831|gb|KJB47902.1| hypothetical protein B456_008G046800 [Gossypium raimondii] Length = 700 Score = 597 bits (1539), Expect = e-168 Identities = 307/457 (67%), Positives = 336/457 (73%), Gaps = 9/457 (1%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNA----TASAPT-NLNASSVNPLXXXXXXXXXXXXXAQTGVGS 1185 M+D EG LSFDFEGGLD TAS P N + S+ N GV + Sbjct: 1 MDDAEGGLSFDFEGGLDAGPPAPTASMPVVNSDPSAANNTNNFTAPG---------GVQA 51 Query: 1184 VHGDPAASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGE 1005 DP A+ G GG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GE Sbjct: 52 SINDPVANQG-GGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGE 110 Query: 1004 CREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFN 825 CREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEV QKIQ LS++N Sbjct: 111 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYN 170 Query: 824 YGSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXX 645 Y N+F+Q +N G+ QQ EK Q P NQ A KP A Sbjct: 171 Y--NNKFYQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQ 228 Query: 644 XXXXXXXXXNI---PNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR 474 I PNG SN NRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQR Sbjct: 229 QPQQQVSQTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQR 288 Query: 473 SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNF 294 SNEAKLNEAF+S+ENVIL+FSVNRTRHFQGCAKMTSKIGG + GGNWKYAHGTAHYGRNF Sbjct: 289 SNEAKLNEAFDSAENVILVFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNF 348 Query: 293 SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEM 114 SVKWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +GEQLASLLYLEPD +LM + Sbjct: 349 SVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAI 408 Query: 113 LXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++SD+A +NPDIV FEDNEEEE Sbjct: 409 SLAAESKREEEKAKGVNSDNA-ENPDIVPFEDNEEEE 444 >ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Glycine max] Length = 681 Score = 597 bits (1538), Expect = e-167 Identities = 303/452 (67%), Positives = 330/452 (73%), Gaps = 4/452 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD +AP++ A+ PL + G P Sbjct: 1 MEDSEGVLSFDFEGGLD----AAPSSAAAAPSGPLIPHDSSAAASAV---SNGGPAAPAP 53 Query: 1169 AASSGPGGNQ---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR 999 +A GG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR Sbjct: 54 SAVDPVGGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR 113 Query: 998 EQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYG 819 EQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQHL S+NY Sbjct: 114 EQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYN 173 Query: 818 SGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXX 639 S N+FFQ + Y+QQAEKP P G+ NQ P A Sbjct: 174 SSNKFFQQRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPA---ELGNAQPQQQVQQSQQQ 230 Query: 638 XXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 459 N+ NG N NRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+K Sbjct: 231 VNQSQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESK 290 Query: 458 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWL 279 LNEAF+S ENVILIFSVNRTRHFQGCAKMTSKIGG + GGNWKYAHGTAHYGRNFSVKWL Sbjct: 291 LNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWL 350 Query: 278 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXX 99 KLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 351 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAE 410 Query: 98 XXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ +NPDIV FEDNEEEE Sbjct: 411 SKREEEKAKGVNPDNGGENPDIVPFEDNEEEE 442 >ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] gi|561020727|gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris] Length = 697 Score = 596 bits (1536), Expect = e-167 Identities = 301/449 (67%), Positives = 330/449 (73%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD A ++A + A T G+ +P Sbjct: 1 MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGT---EP 57 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 AA + PG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD Sbjct: 58 AAVNVPG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 114 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQHL S+NY S N Sbjct: 115 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSN 174 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 +FFQ + Y+QQAEK Q P G+ NQ KP A Sbjct: 175 KFFQQRGSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQ 234 Query: 629 XXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 450 + NG N +R A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+KLNE Sbjct: 235 IQN--VANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNE 292 Query: 449 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKLC 270 AF+S ENVILIFSVNRTRHFQGCAKMTS+IGG + GGNWKYAHGTAHYGRNFSVKWLKLC Sbjct: 293 AFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLC 352 Query: 269 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXXX 90 ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPDG+LM + Sbjct: 353 ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKR 412 Query: 89 XXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ +NPDIV FEDNEEEE Sbjct: 413 EEEKAKGVNPDNGGENPDIVPFEDNEEEE 441 >ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis] gi|223537608|gb|EEF39232.1| conserved hypothetical protein [Ricinus communis] Length = 702 Score = 594 bits (1531), Expect = e-167 Identities = 300/455 (65%), Positives = 331/455 (72%), Gaps = 8/455 (1%) Frame = -1 Query: 1343 EDEGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHG-DPA 1167 + +G LSFDFEGGLD+ S PTN AS P + V +V DPA Sbjct: 3 DTDGGLSFDFEGGLDS---SGPTNPTASI--PAIPSDNTAAVAAATNNSIVPNVSSNDPA 57 Query: 1166 ASSGPGGNQ---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRE 996 +++ N RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECRE Sbjct: 58 SAAAAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 117 Query: 995 QDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGS 816 QDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEV QKIQ L+S+NYGS Sbjct: 118 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGS 177 Query: 815 GNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPP----AAXXXXXXXXXXXXXXXXX 648 N+FFQ + G+ Q A+K QF G Q A KPP A Sbjct: 178 SNKFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQS 237 Query: 647 XXXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSN 468 N+PNG N NRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSN Sbjct: 238 QQQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSN 297 Query: 467 EAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSV 288 EAKLNEAF+S+ENVILIFSVNRTRHFQGCAKMTSKIG +GGGNWKYAHGTAHYGRNFSV Sbjct: 298 EAKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSV 357 Query: 287 KWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLX 108 KWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +G QLA LLY EPD +LM + Sbjct: 358 KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISL 417 Query: 107 XXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ ++ DNPDIV FEDNEEEE Sbjct: 418 AAEAKREEEKAKGVNPENGGDNPDIVPFEDNEEEE 452 >ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30 [Cicer arietinum] Length = 677 Score = 593 bits (1529), Expect = e-166 Identities = 302/450 (67%), Positives = 332/450 (73%), Gaps = 2/450 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPT-NLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGD 1173 MED EG LSFDFEGGLD A SA T ++ A P+ G +V G+ Sbjct: 1 MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSIS--SNGAAAVSGN 58 Query: 1172 PAASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQ 993 PG RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDKARMPVCRFFRLYGECREQ Sbjct: 59 I-----PG---RRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQ 110 Query: 992 DCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSG 813 DCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQHL S+N+ + Sbjct: 111 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNS 170 Query: 812 NRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXX 633 ++F Q + Y+QQ EK QFP G ANQ A KP AA Sbjct: 171 HKFIQQRGSSYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQ 230 Query: 632 XXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLN 453 + NG N NRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+KLN Sbjct: 231 TQN---LANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLN 287 Query: 452 EAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKL 273 EAF+S ENVILIFSVNRTRHFQGCAKMTS+IGG + GGNWKYAHGTAHYGRNFSVKWLKL Sbjct: 288 EAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 347 Query: 272 CELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXX 93 CELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + Sbjct: 348 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESK 407 Query: 92 XXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+A +NPDIV FEDNEEEE Sbjct: 408 REEEKAKGVNPDNAGENPDIVPFEDNEEEE 437 >gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium raimondii] Length = 701 Score = 592 bits (1527), Expect = e-166 Identities = 307/458 (67%), Positives = 336/458 (73%), Gaps = 10/458 (2%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNA----TASAPT-NLNASSVNPLXXXXXXXXXXXXXAQTGVGS 1185 M+D EG LSFDFEGGLD TAS P N + S+ N GV + Sbjct: 1 MDDAEGGLSFDFEGGLDAGPPAPTASMPVVNSDPSAANNTNNFTAPG---------GVQA 51 Query: 1184 VHGDPAASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGE 1005 DP A+ G GG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GE Sbjct: 52 SINDPVANQG-GGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGE 110 Query: 1004 CREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFN 825 CREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEV QKIQ LS++N Sbjct: 111 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYN 170 Query: 824 YGSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXX 645 Y N+F+Q +N G+ QQ EK Q P NQ A KP A Sbjct: 171 Y--NNKFYQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQ 228 Query: 644 XXXXXXXXXNI---PNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR 474 I PNG SN NRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQR Sbjct: 229 QPQQQVSQTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQR 288 Query: 473 SNEAKLNEAFESSENVILIFSVNRTRHFQ-GCAKMTSKIGGFIGGGNWKYAHGTAHYGRN 297 SNEAKLNEAF+S+ENVIL+FSVNRTRHFQ GCAKMTSKIGG + GGNWKYAHGTAHYGRN Sbjct: 289 SNEAKLNEAFDSAENVILVFSVNRTRHFQVGCAKMTSKIGGSVAGGNWKYAHGTAHYGRN 348 Query: 296 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLME 117 FSVKWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +GEQLASLLYLEPD +LM Sbjct: 349 FSVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMA 408 Query: 116 MLXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 + KG++SD+A +NPDIV FEDNEEEE Sbjct: 409 ISLAAESKREEEKAKGVNSDNA-ENPDIVPFEDNEEEE 445 >ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30-like [Sesamum indicum] Length = 688 Score = 592 bits (1527), Expect = e-166 Identities = 299/461 (64%), Positives = 333/461 (72%), Gaps = 14/461 (3%) Frame = -1 Query: 1343 EDEGALSFDFEGGLDNA----TASAPT--------NLNASSVNPLXXXXXXXXXXXXXAQ 1200 + EG LSFDFEGGLD TAS P +A+S NP Sbjct: 3 DGEGGLSFDFEGGLDTGPAHPTASVPVIQSSADAKTASAASGNP--------------NN 48 Query: 1199 TGVGSVHGDPAASS--GPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCR 1026 G V PAA + G GG RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCR Sbjct: 49 PSAGLV---PAAQTAEGMGGGARRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCR 105 Query: 1025 FFRLYGECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKI 846 FFRLYGECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPP+EEV QKI Sbjct: 106 FFRLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKI 165 Query: 845 QHLSSFNYGSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXX 666 Q L+S+N+G+ N+FFQ++N Y+QQ EK Q P G G NQ P + Sbjct: 166 QQLTSYNHGNTNKFFQNRNTTYTQQTEKTQLPQGPNGVNQAGKTNPIESSNINQQAQVQQ 225 Query: 665 XXXXXXXXXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVW 486 P G N +RTA+PLPQG SRYF+VKSCNRENLE+SVQQGVW Sbjct: 226 SQQQGSQGQIQNT-----PGGQQNQASRTATPLPQGTSRYFVVKSCNRENLELSVQQGVW 280 Query: 485 ATQRSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHY 306 ATQRSNEAKLNEAFES ENVILIFSVN+TRHFQGCAKMTSKIGG +GGGNWK+AHGTAHY Sbjct: 281 ATQRSNEAKLNEAFESVENVILIFSVNKTRHFQGCAKMTSKIGGSVGGGNWKHAHGTAHY 340 Query: 305 GRNFSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQ 126 GRNF+VKWLKLCELSF+KT HL+NPYN+NLPVKISRDCQELEP +GEQLASLLYLEPD Sbjct: 341 GRNFAVKWLKLCELSFDKTRHLKNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSD 400 Query: 125 LMEMLXXXXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 LM + KG++ D+ ++NPDIV FEDNEEEE Sbjct: 401 LMAVSLAAELKREEEKAKGVNLDNGTENPDIVPFEDNEEEE 441 >ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30-like [Cucumis melo] Length = 708 Score = 588 bits (1517), Expect = e-165 Identities = 297/454 (65%), Positives = 328/454 (72%), Gaps = 6/454 (1%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD + PTN A+S PL + G Sbjct: 1 MEDSEGVLSFDFEGGLD----AGPTNPAATSSLPLINSDSSAPPAASAVSNSLSGALGPA 56 Query: 1169 AASSGPG---GN--QRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGE 1005 ++ PG GN RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGE Sbjct: 57 VSAEPPGAPPGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGE 116 Query: 1004 CREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFN 825 CREQDCVYKHTN+DIKECNMYK GFCPNGPDCRYRH KLPGPPPP+EE+ QKIQHL S+N Sbjct: 117 CREQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGSYN 176 Query: 824 YGSGNRFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXX 645 YG N+FF + +G SQQ EK QFP Q KP AA Sbjct: 177 YGPSNKFFTQRGVGLSQQNEKSQFPQVPAITTQGVTGKPSAA----ESANVQQQQGQQSA 232 Query: 644 XXXXXXXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNE 465 N+ NG N NR A+ LPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE Sbjct: 233 PQASQTPVQNLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE 292 Query: 464 AKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVK 285 AKLNEAF++++NVILIFSVNRTRHFQGCAKM S+IGG + GGNWKYAHGTAHYG+NFS+K Sbjct: 293 AKLNEAFDTADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLK 352 Query: 284 WLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXX 105 WLKLCELSF KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPDG+LM + Sbjct: 353 WLKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIA 412 Query: 104 XXXXXXXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D S+NPDIV FEDNEEEE Sbjct: 413 AESKREEEKAKGVNPDIGSENPDIVPFEDNEEEE 446 >gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sinensis] Length = 701 Score = 588 bits (1517), Expect = e-165 Identities = 298/449 (66%), Positives = 329/449 (73%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MED-EGALSFDFEGGLDNATASAPTNLNASSVNPLXXXXXXXXXXXXXAQTGVGSVHGDP 1170 MED EG LSFDFEGGLD A PT N + + A D Sbjct: 1 MEDSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAAPSSSGAAPDH 59 Query: 1169 AASSGPGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 990 A++ P + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD Sbjct: 60 ASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119 Query: 989 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPLEEVFQKIQHLSSFNYGSGN 810 CVYKHTN+DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP +EEV QKIQ +SS+N+G+ N Sbjct: 120 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179 Query: 809 RFFQHKNIGYSQQAEKPQFPHGSGGANQPTAVKPPAAXXXXXXXXXXXXXXXXXXXXXXX 630 + FQ + +S Q +K QF G NQ A K A Sbjct: 180 KHFQQRG-AFSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQ 238 Query: 629 XXXXNIPNGLSNPTNRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 450 +PNGL N TNR A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNE Sbjct: 239 MQN--LPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 296 Query: 449 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYAHGTAHYGRNFSVKWLKLC 270 AF+S+ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKYAHGTAHYGRNFSVKWLKLC Sbjct: 297 AFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLC 356 Query: 269 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLXXXXXXX 90 ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLA+LLYLEPD +LM + Sbjct: 357 ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKR 416 Query: 89 XXXXXKGLSSDDASDNPDIVLFEDNEEEE 3 KG++ D+ DNPDIV FEDNEEEE Sbjct: 417 EEEKAKGVNPDNGGDNPDIVPFEDNEEEE 445