BLASTX nr result
ID: Cinnamomum24_contig00003716
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum24_contig00003716 (3023 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010916255.1| PREDICTED: nuclear poly(A) polymerase 1 [Ela... 1001 0.0 ref|XP_010265872.1| PREDICTED: poly(A) polymerase PAPalpha-like ... 999 0.0 ref|XP_009387260.1| PREDICTED: poly(A) polymerase PAPalpha isofo... 991 0.0 ref|XP_010257444.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) poly... 991 0.0 ref|XP_008775850.1| PREDICTED: poly(A) polymerase PAPalpha [Phoe... 981 0.0 ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vit... 971 0.0 ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma ca... 968 0.0 ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Pop... 964 0.0 ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isof... 956 0.0 ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Popu... 956 0.0 gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium r... 950 0.0 ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|5879... 943 0.0 emb|CDO98397.1| unnamed protein product [Coffea canephora] 941 0.0 ref|XP_003534153.1| PREDICTED: poly(A) polymerase-like isoform X... 940 0.0 ref|XP_004512881.1| PREDICTED: nuclear poly(A) polymerase 1 [Cic... 936 0.0 ref|XP_011627554.1| PREDICTED: nuclear poly(A) polymerase 1 [Amb... 934 0.0 ref|XP_010036910.1| PREDICTED: poly(A) polymerase type 3 [Eucaly... 934 0.0 ref|XP_004137491.1| PREDICTED: nuclear poly(A) polymerase 1 isof... 934 0.0 ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prun... 934 0.0 ref|XP_009387262.1| PREDICTED: poly(A) polymerase PAPalpha isofo... 934 0.0 >ref|XP_010916255.1| PREDICTED: nuclear poly(A) polymerase 1 [Elaeis guineensis] Length = 768 Score = 1001 bits (2587), Expect = 0.0 Identities = 531/786 (67%), Positives = 584/786 (74%), Gaps = 34/786 (4%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M S+GL R NG YLGVTEPIS GP+EFD+ KT ELEK LAD GLYESQEEAVSREEV Sbjct: 1 MASSGLAKRGNG--YLGVTEPISWSGPTEFDITKTHELEKYLADAGLYESQEEAVSREEV 58 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVKKVSRAKGFNEQ V EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 59 LGRLDQIVKVWVKKVSRAKGFNEQFVLEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 118 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL NML EMPEVTELHPVPDAHVPVMKFKF+GVSIDLLYAKLSLWVIPEDLD Sbjct: 119 TREEDFFTELHNMLAEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLD 178 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRC+RFWAKRRGVYSNV+GF Sbjct: 179 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCLRFWAKRRGVYSNVAGF 238 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYP ALPSMLV+RFFRVYTQWRWPNPVMLC IEEG+LGLP+WDPR Sbjct: 239 LGGINWALLVARICQLYPKALPSMLVSRFFRVYTQWRWPNPVMLCDIEEGTLGLPVWDPR 298 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +N +DRLHQMPIITPAYPCMNSSYNV SSTLRVMTEEFQRG+EICE M+ NKADW+ LF Sbjct: 299 KNYKDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEEMEANKADWNKLFA 358 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYK+YLEIDITA NEDDLRKWKGWVESRLR LTLKIERHTFGMLQCHPHPGDFS Sbjct: 359 PYPFFEAYKHYLEIDITAANEDDLRKWKGWVESRLRTLTLKIERHTFGMLQCHPHPGDFS 418 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFHCC+FMGLQRKQG P NEGEQFDIR TVEEFKHSVG YTLWK GMEIQVSHI+R Sbjct: 419 DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRVTVEEFKHSVGMYTLWKPGMEIQVSHIKR 478 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKA---------SDTVQADKNVGVADETRK 850 RN+P FVFPGG RPSRP K G +G ++ K K+ SDT+ + +AD++ Sbjct: 479 RNVPSFVFPGGTRPSRPLKAAGSEGHTISKTKSSSLVLAGKPSDTLSGSCDTHMADDSST 538 Query: 849 RK-LAEGNGESSFTHIKHFKAMDSGCGGTEE-SEICKSHTSVIGSCSLDSDA-------- 700 RK LA G D G+E S I + +S C+ +++ Sbjct: 539 RKQLAAGT-----------PVGDQVVQGSERCSPITMTSSSASSLCTKEAEGSAINLVGN 587 Query: 699 ----------ARQRQHVEDNNVKNNSTDGK---CHTASASEGVAGEGSEIGSALPIIAPG 559 +R+R+HVED + +NS D + H+A E V GS I +A + PG Sbjct: 588 ANGILNVTVESRKRKHVEDTD--SNSIDAQRLAAHSAKLPESVGMAGSGIIAA---VGPG 642 Query: 558 AATST--SREAEELAIEKIMSAQTINHTGFPEELDELEEYGMTDHERDLGGVMNGRANES 385 T++ S+EAE LAI+KI S N PE LDELE + ++D GV G + S Sbjct: 643 NCTASLCSKEAEALAIKKITSGSPTNLASLPEGLDELELFEPQGQDKDFDGVAGGCSVVS 702 Query: 384 LTTKPVQGLSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMA 205 K + VG P ASSSN QRKPLIRL LSS+ Sbjct: 703 SAAKDAP-MQVGKLHDSSKNEGIEELEPAELSAPTFGGPTASSSNTQRKPLIRLRLSSVV 761 Query: 204 KTTGTS 187 K S Sbjct: 762 KAADKS 767 >ref|XP_010265872.1| PREDICTED: poly(A) polymerase PAPalpha-like [Nelumbo nucifera] Length = 756 Score = 999 bits (2583), Expect = 0.0 Identities = 524/766 (68%), Positives = 601/766 (78%), Gaps = 13/766 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS GLN R+NG +LGVTEPISL GP+EFDVVKT+ELEK L D GLYESQEEAV+REEV Sbjct: 1 MGSPGLNVRNNG--HLGVTEPISLSGPTEFDVVKTRELEKFLVDAGLYESQEEAVAREEV 58 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK W+K VSRAKGFNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 59 LGRLDQIVKKWIKMVSRAKGFNEQLVLEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 118 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF+EL NML EMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYAKLSLWVIPEDLD Sbjct: 119 TREEDFFIELHNMLAEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 178 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQD +LQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF Sbjct: 179 ISQDMVLQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 238 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRV+TQWRWPNPVMLCAIEEG+LGLP+WDPR Sbjct: 239 LGGINWALLVARICQLYPNALPSMLVSRFFRVFTQWRWPNPVMLCAIEEGTLGLPVWDPR 298 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +N RDRLHQMPIITPAYPCMNSSYNV SSTLRVMTEEFQRGNEICEAM+ NKADW+TLFE Sbjct: 299 KNYRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGNEICEAMEKNKADWNTLFE 358 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 P FFEAYKNYL+I+I+AEN+D LRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS Sbjct: 359 PCRFFEAYKNYLQIEISAENDDHLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 418 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSR FHCC+FMGL+ KQG EG+QFDIRATVE+FKHSVG YTLWK GMEI VSHI+R Sbjct: 419 DKSRLFHCCYFMGLRLKQGVSMQEGKQFDIRATVEDFKHSVGLYTLWKPGMEIYVSHIKR 478 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVGVA-------DETRKRK 844 RNIPLFVFP GVRPSR AK ++GKS PK +++ A+++ +A D+ RKRK Sbjct: 479 RNIPLFVFPDGVRPSRSAKE-AWEGKSASNPKLCNSISAEESCEIATGSMDGTDDIRKRK 537 Query: 843 LAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNV 664 L++ NGE++ K A ++ G S S + I S+ +A Q +++N+ Sbjct: 538 LSDDNGENNPRSTKFLAATNTAYGVLGGS---GSGSPPIVRTSVREEAREGGQ--QEDNL 592 Query: 663 KNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAAT---STSREAEELAIEKIMSAQT 493 + +S + C T ++ + E E + P +A S ++EAE+LAIEKI S + Sbjct: 593 RGSSINATCPTEITTD-IGREAEEPARCSQSVGPPSANSGLSCTKEAEKLAIEKIASGPS 651 Query: 492 I-NHTGFPEELDELE-EYGMTDHERDLGGVMNGRANESLT-TKPVQGLSVGSRDSCXXXX 322 + H GFPEELDELE ++ + H + G M + + T V G+ + + Sbjct: 652 VGGHGGFPEELDELEDDFNSSYHVKGFGRDMPSKVLVAKAGTVEVNGVHPPT-EFLQHGG 710 Query: 321 XXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTSS 184 P VPAS+S PQRKPLIRL+L+SMAK TG ++ Sbjct: 711 GLEELEPAELTTPFSNLVPASTSIPQRKPLIRLNLTSMAKATGRNT 756 >ref|XP_009387260.1| PREDICTED: poly(A) polymerase PAPalpha isoform X1 [Musa acuminata subsp. malaccensis] gi|695079670|ref|XP_009387261.1| PREDICTED: poly(A) polymerase PAPalpha isoform X1 [Musa acuminata subsp. malaccensis] Length = 773 Score = 991 bits (2563), Expect = 0.0 Identities = 528/779 (67%), Positives = 576/779 (73%), Gaps = 26/779 (3%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M S+GL RSNG +LGVTEPIS GP+E+DV+KTQELEK LAD GLYESQEEAVSREE+ Sbjct: 1 MESSGLVKRSNG--HLGVTEPISWSGPTEYDVIKTQELEKYLADAGLYESQEEAVSREEI 58 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVKKVSRAKGFNEQ VQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 59 LGRLDQIVKIWVKKVSRAKGFNEQFVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 118 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL NML EMPEVTELHPVPDAHVPVM+FKFSGVSIDLLYAKLSLWVIPEDLD Sbjct: 119 TREEDFFTELHNMLSEMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLD 178 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 179 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 238 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRVYTQWRWPNPVMLC I+EG+LGLPIWDPR Sbjct: 239 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCEIQEGTLGLPIWDPR 298 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RN RDRLHQMPIITPAYPCMNSSYNV SSTLRVMTEEFQRGNEICEAM+ NKADW TLFE Sbjct: 299 RNFRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGNEICEAMEANKADWDTLFE 358 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYLEIDITA+NE DLRKWKGWVESRLR LTLKIERHTFGML CHP P DFS Sbjct: 359 PYPFFEAYKNYLEIDITADNESDLRKWKGWVESRLRTLTLKIERHTFGMLHCHPCPRDFS 418 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFHCC+FMGLQRKQG P E EQFDIR TV++FK+SV YTLWK GMEIQVSH +R Sbjct: 419 DKSRPFHCCYFMGLQRKQGVPVQESEQFDIRGTVDDFKNSVSMYTLWKPGMEIQVSHRKR 478 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVG----VADETRKRKLAE 835 RN+PLFVFPGGVRPSRP K G DG +V K SD V A K G VAD + RK E Sbjct: 479 RNVPLFVFPGGVRPSRPPKVAGVDGHAVSGRKVSDMVHAGKPAGNVSHVADASTDRKQME 538 Query: 834 GNGES------SFTHIKHFKAMDSGCGGT------------EESEICKSHTSVIGSCSL- 712 G G S S + + K +D+ + SE+ + G + Sbjct: 539 GKGASCDPIVESSSESRKGKQLDNRTDSNAANMNNLVDHILKPSEMGTPSSFANGVLDVP 598 Query: 711 DSDAARQRQHVEDNNVKNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATSTSREA 532 D R+ V ++ S H+ E A + +G + G + S+EA Sbjct: 599 DESRKRKCMDVTTDSFATGSEFQADHSFKRPETSAAIAASVGPVTE-VDNGESIFCSKEA 657 Query: 531 EELAIEKIMSAQTINHTGFPEELDELEEYGMTDHERDLGGVMNGRANESLTTKPV---QG 361 E LAI KI S N PE LDELE + H++ GG + G + ES T K G Sbjct: 658 ETLAISKITSVPPSNLAALPEGLDELEYFESQGHDKGFGGPVGGHSVESSTVKDAITQLG 717 Query: 360 LSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTSS 184 S GS PAS++N QRKPL RL LS++AK+ G S Sbjct: 718 SSYGSNTKNGGVEELEKSSELSAPYL--GGAPASTANTQRKPL-RLRLSTVAKSAGERS 773 >ref|XP_010257444.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) polymerase PAPalpha-like [Nelumbo nucifera] Length = 741 Score = 991 bits (2561), Expect = 0.0 Identities = 508/698 (72%), Positives = 567/698 (81%), Gaps = 11/698 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS G N R+NG+ +LGVTEPISLGGP+EFDV+KT+ELEK LA+ GLYESQEEAVSREEV Sbjct: 1 MGSPGSNVRNNGR-HLGVTEPISLGGPTEFDVIKTRELEKFLAEAGLYESQEEAVSREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LG LDQ+VK W+K VSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGAD+DTLCVGPRHA Sbjct: 60 LGSLDQVVKKWIKAVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADVDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF+EL ML EMPEVTELHPVPDAHVPVMKFKF+GVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFVELHKMLAEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTD+ILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNADEQTVRSLNGCRVTDRILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRV+ QWRWPNPVMLCAIEEGSLGLP+WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPSMLVSRFFRVFAQWRWPNPVMLCAIEEGSLGLPVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +N RDRLHQMPIITPAYPCMNSSYNV SSTLRVM +EFQRGNEICE M+ NKADW+ LFE Sbjct: 300 KNYRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMXQEFQRGNEICEPMEKNKADWNALFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYL+IDI+AEN+DDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS Sbjct: 360 PYPFFEAYKNYLQIDISAENDDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFHC +FMGL RKQG +EG+QFDIRATVEEFK SVG YTLWK MEI VSHI+R Sbjct: 420 DKSRPFHCSYFMGLSRKQGVSVHEGKQFDIRATVEEFKLSVGMYTLWKPRMEIHVSHIKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKN-------VGVADETRKRK 844 RNIPLFVFPGG+RPSRPAK G + K V K ++VQA K+ +GVAD+ RKRK Sbjct: 480 RNIPLFVFPGGIRPSRPAKEDG-ESKPVSNLKLCNSVQASKSCESAVGAMGVADDIRKRK 538 Query: 843 LAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNV 664 L N E++ K ++ + G S S + S+ + QH +NN+ Sbjct: 539 LGYDNDENNPRAAK-LLSVTTTEGSVSRSSPTASTCTTATFYDAPSEVRERGQH--ENNL 595 Query: 663 KNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATS---TSREAEELAIEKIMSAQT 493 ++ C T S G EGS S P++ P + S S+EAE+LAIEKI S + Sbjct: 596 GDSPISATCLTGVPSHGGEAEGSVRCS--PLVKPSSTNSDLVCSKEAEKLAIEKIASGPS 653 Query: 492 INHTGFPEELDELE-EYGMTDHERDLGGVMNGRANESL 382 ++ GF EELDELE + G TD + G G ++ESL Sbjct: 654 VSQQGFLEELDELEDDIGSTDQVKVFGVSRKGISSESL 691 >ref|XP_008775850.1| PREDICTED: poly(A) polymerase PAPalpha [Phoenix dactylifera] Length = 767 Score = 981 bits (2537), Expect = 0.0 Identities = 529/781 (67%), Positives = 587/781 (75%), Gaps = 28/781 (3%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M S+GL+ RSNG YLGVTEPIS GP+EFD+ KTQELEK LAD GLYESQE AVSREEV Sbjct: 1 MASSGLSKRSNG--YLGVTEPISWSGPTEFDITKTQELEKYLADAGLYESQEGAVSREEV 58 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WV+KVSRAKGFNEQ VQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 59 LGRLDQIVKVWVRKVSRAKGFNEQFVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 118 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL NML EMPEVTELHPVPDAHVPVMKFKF+GVSIDLLYAKLSLWVIPEDLD Sbjct: 119 TREEDFFTELHNMLAEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLD 178 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 179 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 238 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRVYTQWRWPNPVMLC IEEG+LGL +WDPR Sbjct: 239 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCDIEEGTLGLSVWDPR 298 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +N +DRLHQMPIITPAYP MNSSYNV SSTLRVMT+EFQRG+ ICE M+ NKADWS LFE Sbjct: 299 KNFKDRLHQMPIITPAYPSMNSSYNVSSSTLRVMTDEFQRGHVICEEMEANKADWSKLFE 358 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYK+YLEIDITA NEDDLRKWKGWVESRLR LTLKIERHTFGMLQCHPHPGDFS Sbjct: 359 PYPFFEAYKHYLEIDITAANEDDLRKWKGWVESRLRTLTLKIERHTFGMLQCHPHPGDFS 418 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSR FHCC+FMGLQRKQG P NEGEQFDIR TVE+FKHSVG YTLWK GMEIQVSHI+R Sbjct: 419 DKSRLFHCCYFMGLQRKQGVPVNEGEQFDIRVTVEDFKHSVGMYTLWKPGMEIQVSHIKR 478 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQA----DKNVGVADETRKRKLAE 835 RN+P FVFP G+RPSRP K +G +V K K+S +VQA D G D T +AE Sbjct: 479 RNVPSFVFPSGIRPSRPPKAAVSEGHTVSKIKSSSSVQAGKPSDTLAGSGDTT--THMAE 536 Query: 834 GNGESSFTHIKHFKAM--DSGCGGTEE-SEICKSHTSVIGSCSLDSDA------------ 700 +SS T + + D G+E S I + +S C+ +++ Sbjct: 537 ---DSSTTKLLAAGILIGDQVVEGSERCSPITMTSSSASSLCTKEAEGSAINLVGNANGI 593 Query: 699 ------ARQRQHVEDNNVKNNSTDGK-CHTASASEGVAGEGSEIGSALPIIAPGAATST- 544 +R+R+H ED + +NS D K +A E V S I +A PG TS+ Sbjct: 594 LNVTIESRKRKHEEDTD--SNSIDAKRLASAKPPESVGMAASGIIAAED---PGNCTSSL 648 Query: 543 -SREAEELAIEKIMSAQTINHTGFPEELDELEEYGMTDHERDLGGVMNGRANESLTTKPV 367 S+EAE LAI+KI S N PE LDELE + + ++D GGV +G + S K Sbjct: 649 CSKEAEALAIKKITSGSPTNLASLPEGLDELELFELHGQDKDFGGVASGCSVVSSAAKDA 708 Query: 366 QGLSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTS 187 + VG P AS+ N QRKPL +L LSS+ + TG S Sbjct: 709 P-MQVGKLHDSSKNGGIEELEPAELSAPTFGGPTASTLNAQRKPL-KLRLSSVVRATGKS 766 Query: 186 S 184 + Sbjct: 767 A 767 >ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vitis vinifera] Length = 757 Score = 971 bits (2510), Expect = 0.0 Identities = 506/766 (66%), Positives = 574/766 (74%), Gaps = 13/766 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M + GLNNR+N Q LG+TEPISLGGP+E DV KTQELEK LA GLYESQEEAVSREEV Sbjct: 1 MSNLGLNNRNNSGQRLGITEPISLGGPNELDVTKTQELEKFLAAAGLYESQEEAVSREEV 60 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 61 LGRLDQIVKIWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL ML EMPEVTELHPVPDAHVPVM+FKFSGVSIDLLYAKLSLWVIPEDLD Sbjct: 121 TREEDFFGELHKMLSEMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLD 180 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 +SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLR MRFWAKRRGVYSNV+GF Sbjct: 181 VSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRFMRFWAKRRGVYSNVAGF 240 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRVYTQWRWPNPVMLCAIEEG+LGL +WDPR Sbjct: 241 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLQVWDPR 300 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 + P+DR H MPIITPAYPCMNSSYNV SSTLR+M+EEF+RGNEI E M+ NKADW+TL E Sbjct: 301 KYPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMSEEFKRGNEISEVMEANKADWATLCE 360 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYL+I+I AEN DDLRKWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDFS Sbjct: 361 PYPFFEAYKNYLQIEIAAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFS 420 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFHCC+FMGLQRKQG PA+EGEQFDIR TV+EFKHSVG YTLWK GMEI V H+RR Sbjct: 421 DKSRPFHCCYFMGLQRKQGVPASEGEQFDIRLTVDEFKHSVGMYTLWKPGMEIHVIHVRR 480 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVGVADETRKRKLAEGNGE 823 RNIP FVFPGGVRPSRP K + + VL+P S + A++++KRK + N E Sbjct: 481 RNIPNFVFPGGVRPSRPTK-VASERRRVLEPNVSTQAVLEG----AEDSKKRKREDENVE 535 Query: 822 SSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSL--DSDAARQRQHVEDNNVKNNST 649 T+ ++ K + + + E S + +CS+ DS V+NN Sbjct: 536 ---TNSRNAKCLVAAASSSHEVLSSNPLVSTVNACSIKVDSMDINMLGKTRKEKVENNIE 592 Query: 648 DGKCHTASASEGVAGEGSEIGSA-----LPIIAPGAATSTSREAEELAIEKIMSAQTINH 484 G + ++ E G GS + ++ + +S EAE++AIEKIMS ++H Sbjct: 593 HGLKNLNNSVEVPPQNGEVDGSVRCSHPIKTLSSSGGSPSSTEAEKIAIEKIMSGPYVSH 652 Query: 483 TGFPEELDELE-EYGMTDHERDLGGVMNGRANESLTTKPVQGLSVGSRDSCXXXXXXXXX 307 FP ELDELE + + +D G G + ES + V + + Sbjct: 653 QAFPGELDELEDDVEYKNQVKDFTGSTKGSSAES-SKANVAEEPLTTTSGTVPCTILSPN 711 Query: 306 XXXXXXXPCHTRVPAS-----SSNPQRKPLIRLSLSSMAKTTGTSS 184 P P S SS Q+KP+IRLS +S+AK TG S+ Sbjct: 712 GGLEELEPAELMPPLSYGNRPSSTEQKKPIIRLSFTSLAKATGKST 757 >ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao] gi|590665102|ref|XP_007036648.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao] gi|508773892|gb|EOY21148.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao] gi|508773893|gb|EOY21149.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao] Length = 762 Score = 968 bits (2502), Expect = 0.0 Identities = 511/768 (66%), Positives = 576/768 (75%), Gaps = 16/768 (2%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS GL NR+NGQ+ LG+TEPISLGGP+++DV+KT+ELEK L +VGLYESQEEAV REEV Sbjct: 1 MGSPGLGNRNNGQR-LGITEPISLGGPTDYDVIKTRELEKYLQNVGLYESQEEAVGREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQ VK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQTVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL ML EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQN DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGL +WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +NP+DR H MPIITPAYPCMNSSYNV SSTLR+MT+EFQRG+EICEAM+ NKADW LFE Sbjct: 300 KNPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDILFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 Y FFEAYKNYL+IDI+AEN DDLRKWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF Sbjct: 360 SYAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQ 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFH +FMGLQRKQG P NEGEQFDIR TVEEFKHSV YTLWK GMEI+V+H++R Sbjct: 420 DKSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKN---VGVA---DETRKRKL 841 RNIP FVFPGGVRPSRP+K +D V K S DK+ GVA D+ +KRK Sbjct: 480 RNIPSFVFPGGVRPSRPSK-VTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQDDGKKRKR 538 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVK 661 + NG++ K+ A+ S + E + S S + SCS D + +E K Sbjct: 539 VDDNGDAQLRSSKYITAVPS---SSLEGRV-GSPVSTVSSCSTKGDYSDATGLIETTREK 594 Query: 660 --NNSTDGKCHTASASEGVAGEGSEIGS--ALPIIAPGAATSTSREAEELAIEKIMSAQT 493 +N T+G ++ S E + G GS P I A S+ EAE LAIEKIMS Sbjct: 595 AESNMTNGLINSRSLEELSSHNGEVDGSVGCNPPIKVSADASSCTEAENLAIEKIMSGPY 654 Query: 492 INHTGFPEELDELE-EYGMTDHERDLGGVMNGRANESLT----TKPVQGLS-VGSRDSCX 331 H FP+EL+ELE + + R + +G S++ PV + G S Sbjct: 655 GAHQAFPQELEELEDDLEFRNQVRSVENTKSGPVESSMSDLAGAAPVTSSNGAGPSTSLH 714 Query: 330 XXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTS 187 R+P S+ QRKPLIRL+ +S+ K + S Sbjct: 715 ASGGIEELEPAELTAMISNRIP-SAPVAQRKPLIRLNFTSLGKASEKS 761 >ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Populus euphratica] Length = 776 Score = 964 bits (2492), Expect = 0.0 Identities = 508/781 (65%), Positives = 581/781 (74%), Gaps = 28/781 (3%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQY--LGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSRE 2269 MGS GL NR+NGQQ LG+TEPISLGGP+E+DV KT+ELEK L D GLYESQEEAVSRE Sbjct: 1 MGSPGLINRNNGQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSRE 60 Query: 2268 EVLGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 2089 EVLGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR Sbjct: 61 EVLGRLDQIVKNWVKVISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 120 Query: 2088 HATREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPED 1909 HATREEDFF EL ML EMPEVTELHPVPDAHVPVM+FKF GVSIDLLYAKLSLWVIPED Sbjct: 121 HATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPED 180 Query: 1908 LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 1729 LD+SQDS+L NADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS Sbjct: 181 LDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 240 Query: 1728 GFLGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWD 1549 GFLGGINWALL ARICQL+PNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGLP+WD Sbjct: 241 GFLGGINWALLAARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPVWD 300 Query: 1548 PRRNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTL 1369 PRRNP+DR H MPIITPAYP MNSSYNV SSTLR+MTEEFQRGNEICEAM+V+KA+W TL Sbjct: 301 PRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAEWDTL 360 Query: 1368 FEPYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGD 1189 FEP+ FFEAYKNYL+IDI+AENEDDLR+WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+ Sbjct: 361 FEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHPHPGE 420 Query: 1188 FSDKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHI 1009 FSDKSRP HC +FMGLQRKQG P NEGEQFDIR TV+EFKHSV YT K GMEI V+H+ Sbjct: 421 FSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKHSVKMYTSRKPGMEIHVTHV 480 Query: 1008 RRRNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVGV-----ADETRKRK 844 +RRNIP FVFP GVRPSRP+K +DG+ + K ++ ADK G +DE +KRK Sbjct: 481 KRRNIPNFVFPNGVRPSRPSKA-TWDGRRSSEAKVANNSSADKIEGKGVLDGSDEGKKRK 539 Query: 843 LAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSD--AARQRQHVEDN 670 + + E++ + K + AM G E + SCS SD ++ Sbjct: 540 RIDDDTENNLRNPKGYAAMPPSSGEVLEG---SPPVGNVSSCSTQSDLVITNSLGELKGE 596 Query: 669 NVKNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAA------TSTSREAEELAIEKI 508 NN T+ ++ + + G+ + E+ L PG TS+S+EAE+LAI+KI Sbjct: 597 KADNNETESLNNSQNLA-GIFAQNGELDGILRCNLPGKGLPANNNTSSSKEAEKLAIDKI 655 Query: 507 MSAQTINHTGFPEELDELE-EYGMTDHERDLGGVMNGRANES--------LTTKPVQGL- 358 MS + H P+ELDELE ++ T+ + G ES LT + + + Sbjct: 656 MSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAAELTNESIAAVA 715 Query: 357 -SVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNP--QRKPLIRLSLSSMAKTTGTS 187 S G+ S SS+ P Q KPLIRL+ +S+ K G S Sbjct: 716 CSNGAGPSAYLYPNGGSDELEXAELMAPLFNGISSAPPVAQPKPLIRLNFTSLGKAAGKS 775 Query: 186 S 184 + Sbjct: 776 T 776 >ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium raimondii] gi|823176367|ref|XP_012486422.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium raimondii] gi|823176370|ref|XP_012486423.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium raimondii] gi|763769978|gb|KJB37193.1| hypothetical protein B456_006G193600 [Gossypium raimondii] gi|763769981|gb|KJB37196.1| hypothetical protein B456_006G193600 [Gossypium raimondii] Length = 762 Score = 956 bits (2470), Expect = 0.0 Identities = 499/772 (64%), Positives = 571/772 (73%), Gaps = 20/772 (2%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS GL ++GQ+ LG+TEPISLGGP+E+DV+KT+ELEK L +VGLYESQEEAVSREEV Sbjct: 1 MGSPGLGTGNSGQR-LGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL ML EMPEV+ELHPVPDAHVP+MKFKF GVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQN D+QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAI+EGSLGL +WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +NP+DR H MPIITPAYP MNSSYNV SSTLR+MT+EFQRG+EICEAM+ NKADW LFE Sbjct: 300 KNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 Y FFEAYKNYL+IDI+AEN+DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF Sbjct: 360 AYAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQ 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 D SRPFHC +FMGLQRK G P NEGEQFDIR TVEEFKHSV YTLWK GMEI+VSH++R Sbjct: 420 DNSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKN---VGVAD---ETRKRKL 841 R+IP FVFPGGVRPSRP+K +D + K S +DK G AD + +KRK Sbjct: 480 RSIPSFVFPGGVRPSRPSKA-TWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKR 538 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVK 661 A+ + ++ + K+ A+ S + S + CSL D VE K Sbjct: 539 ADDSADTQLKNSKYITAVPSSSAEVQAG----SPGGTVSPCSLKGDNVDATGLVEPTRGK 594 Query: 660 NNSTDGKCHTASASEGVAGEGSEIGSALPIIAP------GAATSTSREAEELAIEKIMSA 499 + S S+++ ++ SE+ +L I P A S+S+EAE+LAIE+IMS Sbjct: 595 DESNMTNGSKTSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSG 654 Query: 498 QTINHTGFPEELDELEEYGMTDHERDLGGVMNGRANESLTTKPVQGL--------SVGSR 343 ++H FPEE +ELE+ D E V G N PV S G+ Sbjct: 655 PYVSHQAFPEEPEELED----DLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGAG 710 Query: 342 DSCXXXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTS 187 S T +P + Q+KPLIRL+ +S+ K + S Sbjct: 711 PSISLHASGSIEELEPAELTAMTSIPVAPV-VQKKPLIRLNFTSLGKASEKS 761 >ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Populus trichocarpa] gi|550321905|gb|EEF06201.2| hypothetical protein POPTR_0015s04100g [Populus trichocarpa] Length = 780 Score = 956 bits (2470), Expect = 0.0 Identities = 505/788 (64%), Positives = 579/788 (73%), Gaps = 35/788 (4%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQY---LGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSR 2272 MGS GL NR+NGQQ LG+TEPISLGGP+E+DV KT+ELEK L D GLYESQEEAVSR Sbjct: 1 MGSPGLINRNNGQQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSR 60 Query: 2271 EEVLGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP 2092 EEVLGRLDQIVK WVK +SRAK NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP Sbjct: 61 EEVLGRLDQIVKNWVKVISRAKRLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP 120 Query: 2091 RHATREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPE 1912 RHATREEDFF EL ML EMPEVTELHPVPDAHVPVM+FKF GVSIDLLYAKLSLWVIPE Sbjct: 121 RHATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPE 180 Query: 1911 DLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ---NFRTTLRCMRFWAKRRGVY 1741 DLD+SQDS+L NADEQTVRSLNGCRVTDQILRLVPNIQ NFRTTLRCMRFWAKRRGVY Sbjct: 181 DLDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQAMQNFRTTLRCMRFWAKRRGVY 240 Query: 1740 SNVSGFLGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGL 1561 SNVSGFLGGINWALLVARICQL+PNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGL Sbjct: 241 SNVSGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL 300 Query: 1560 PIWDPRRNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKAD 1381 +WDPRRNP+DR H MPIITPAYP MNSSYNV SSTLR+MTEEFQRGNEICEAM+V+KA+ Sbjct: 301 SVWDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAE 360 Query: 1380 WSTLFEPYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHP 1201 W TLFEP+ FFEAYKNYL+IDI+AENEDDLR+WKGWVESRLRQLTLKIERHT+ MLQCHP Sbjct: 361 WDTLFEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHP 420 Query: 1200 HPGDFSDKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQ 1021 HPG+FSDKSRP HC +FMGLQRKQG P NEGEQFDIR TV+EFK+SV YTLWK GMEI+ Sbjct: 421 HPGEFSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKNSVNMYTLWKPGMEIR 480 Query: 1020 VSHIRRRNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVGV-----ADET 856 V+H+++RNIP FVFP GVRPSRP+K +DG+ + K ++ ADK G +DE Sbjct: 481 VTHVKKRNIPNFVFPSGVRPSRPSKA-TWDGRRSSEAKVANNSSADKIEGKGVLDGSDEG 539 Query: 855 RKRKLAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSD--AARQRQH 682 +KRK + + E++ + K + AM G E + SCS SD Sbjct: 540 KKRKRIDEDTENNLRNPKGYAAMPPSGGEVHEG---SPPVGNVSSCSTQSDLVITNSLGE 596 Query: 681 VEDNNVKNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAA------TSTSREAEELA 520 ++ NN T+ ++ + + G+ + E+ L P TS+S+EAE+LA Sbjct: 597 LKGEKADNNETESLSNSQNLA-GIFAQNGELDGILRCNLPDKGLPANNDTSSSKEAEKLA 655 Query: 519 IEKIMSAQTINHTGFPEELDELE-EYGMTDH-------------ERDLGGVMNGRANESL 382 I+KIMS + H P+ELDELE ++ T+ E L + NES+ Sbjct: 656 IDKIMSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAVEQTNESI 715 Query: 381 TTKPVQGLSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNP--QRKPLIRLSLSSM 208 S G+ S SS+ P Q KPLIRL+ +S+ Sbjct: 716 A---AVACSNGAGPSAYLYPNGGSEELEPAELMAPLFNGISSAPPVAQPKPLIRLNFTSL 772 Query: 207 AKTTGTSS 184 K G S+ Sbjct: 773 GKAAGKST 780 >gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium raimondii] Length = 748 Score = 950 bits (2456), Expect = 0.0 Identities = 477/677 (70%), Positives = 541/677 (79%), Gaps = 12/677 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS GL ++GQ+ LG+TEPISLGGP+E+DV+KT+ELEK L +VGLYESQEEAVSREEV Sbjct: 1 MGSPGLGTGNSGQR-LGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL ML EMPEV+ELHPVPDAHVP+MKFKF GVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQN D+QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAI+EGSLGL +WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 +NP+DR H MPIITPAYP MNSSYNV SSTLR+MT+EFQRG+EICEAM+ NKADW LFE Sbjct: 300 KNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 Y FFEAYKNYL+IDI+AEN+DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF Sbjct: 360 AYAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQ 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 D SRPFHC +FMGLQRK G P NEGEQFDIR TVEEFKHSV YTLWK GMEI+VSH++R Sbjct: 420 DNSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKN---VGVAD---ETRKRKL 841 R+IP FVFPGGVRPSRP+K +D + K S +DK G AD + +KRK Sbjct: 480 RSIPSFVFPGGVRPSRPSKA-TWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKR 538 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVK 661 A+ + ++ + K+ A+ S + S + CSL D VE K Sbjct: 539 ADDSADTQLKNSKYITAVPSSSAEVQAG----SPGGTVSPCSLKGDNVDATGLVEPTRGK 594 Query: 660 NNSTDGKCHTASASEGVAGEGSEIGSALPIIAP------GAATSTSREAEELAIEKIMSA 499 + S S+++ ++ SE+ +L I P A S+S+EAE+LAIE+IMS Sbjct: 595 DESNMTNGSKTSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSG 654 Query: 498 QTINHTGFPEELDELEE 448 ++H FPEE +ELE+ Sbjct: 655 PYVSHQAFPEEPEELED 671 >ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|587938462|gb|EXC25192.1| Poly(A) polymerase [Morus notabilis] Length = 838 Score = 943 bits (2438), Expect = 0.0 Identities = 500/784 (63%), Positives = 571/784 (72%), Gaps = 45/784 (5%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQY-------------------------LGVTEPISLGGPSEFDVVKT 2338 M + GL+NR+NGQ+ LG+TEPISLGGP+E+DV+K+ Sbjct: 1 MANHGLSNRNNGQRLGITEPISLGGPTEYDVMKSQELEKRLGITEPISLGGPTEYDVMKS 60 Query: 2337 QELEKCLADVGLYESQEEAVSREEVLGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFT 2158 QELEK L D GLYESQEEAVSREEVLGRLDQIVK WVK +SRAKG NEQLVQEANAKIFT Sbjct: 61 QELEKYLQDAGLYESQEEAVSREEVLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFT 120 Query: 2157 FGSYRLGVHGPGADIDTLCVGPRHATREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMK 1978 FGSYRLGVHGPGADIDTLCVGPRHATREEDFF EL ML EMPEVTE+HPVPDAHVPV++ Sbjct: 121 FGSYRLGVHGPGADIDTLCVGPRHATREEDFFGELHRMLVEMPEVTEVHPVPDAHVPVLR 180 Query: 1977 FKFSGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ 1798 FKF+GVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ Sbjct: 181 FKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ 240 Query: 1797 NFRTTLRCMRFWAKRRGVYSNVSGFLGGINWALLVARICQLYPNALPSMLVARFFRVYTQ 1618 NFRTTLRCMR WAKRRGVYSNVSGFLGGINWALLVARICQLYPNALP+MLV+RFFRVYTQ Sbjct: 241 NFRTTLRCMRLWAKRRGVYSNVSGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQ 300 Query: 1617 WRWPNPVMLCAIEEGSLGLPIWDPRRNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMT 1438 WRWPNPVMLCAIEEGSLGL +WDPRRNP+DR H MPIITPAYPCMNSSYNV +STLR+M+ Sbjct: 301 WRWPNPVMLCAIEEGSLGLQVWDPRRNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMS 360 Query: 1437 EEFQRGNEICEAMKVNKADWSTLFEPYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRL 1258 EEFQRG EICEAM+ +KADW TLFEPYPFFEAYKNYL+IDI+AEN+DDLRKWKGWVESRL Sbjct: 361 EEFQRGREICEAMETDKADWDTLFEPYPFFEAYKNYLQIDISAENDDDLRKWKGWVESRL 420 Query: 1257 RQLTLKIERHTFGMLQCHPHPGDFSDKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVE 1078 RQLTLKIERHT+ LQCHPHPG+FSDKS+PFHC +FMGLQRKQG PANE FDIR TVE Sbjct: 421 RQLTLKIERHTYNKLQCHPHPGEFSDKSKPFHCSYFMGLQRKQGVPANESGHFDIRLTVE 480 Query: 1077 EFKHSVGNYTLWKRGMEIQVSHIRRRNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASD 898 EFK+SV Y LWK GM I VSH++R+NIP FVFPG VRP RP K +D K + KAS Sbjct: 481 EFKNSVNMYMLWKPGMLIHVSHVKRKNIPNFVFPGRVRPGRPVK-ITWDMKRASELKASG 539 Query: 897 TVQADKN------VGVADETRKRKLAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHT 736 Q DK+ + +D+ KRK + N ESS ++K + T E S Sbjct: 540 LAQPDKSDESKTVLNGSDDGSKRKRVDDNVESSLRNVKPRASF------TGEVLEASSPI 593 Query: 735 SVIGSCSLDSDAARQRQHVEDNNVK--NNSTDG--KCHTASASEGVAGEGSEIGS----- 583 S + S S+ D+ + VE K NN D KC ++ GE +E+ S Sbjct: 594 STLSSSSVKFDSMDMNRLVESQREKSDNNFVDSFKKCENSADIPSQNGE-NEVSSRCSPP 652 Query: 582 --ALPIIAPGAATSTSREAEELAIEKIMSAQTINHTGFPEELDELEEYGMTDHERDL-GG 412 A+P+ A A S+S+EAE++AI+ IMS +H PEELDELE++ + +D G Sbjct: 653 TKAVPVAAVDA--SSSKEAEKMAIDNIMSGPYDSHQALPEELDELEDFEYRNQAKDFSGS 710 Query: 411 VMNGRANESLTTKPVQGLSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNP--QRK 238 M+ + S +P ++ + V SS P QRK Sbjct: 711 TMDSQVETSKGNQPAAPITSNTGTGPSTGSYFNGGLEELEPAELMAPVSNGSSAPVAQRK 770 Query: 237 PLIR 226 P+IR Sbjct: 771 PIIR 774 >emb|CDO98397.1| unnamed protein product [Coffea canephora] Length = 754 Score = 941 bits (2432), Expect = 0.0 Identities = 490/760 (64%), Positives = 572/760 (75%), Gaps = 7/760 (0%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M G N+S+GQ+ LG+TEPIS GP+E+D++KT+ELEK LADVGLYESQEEA+SREEV Sbjct: 1 MAGPGFGNQSSGQR-LGITEPISWSGPTEYDMIKTRELEKFLADVGLYESQEEAISREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVKTWVK VSRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQIVKTWVKNVSRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TR++DFF ELQ ML EMPEV+ELHPVPDAHVPV+KFKFSG+SIDLLYAKLSLWVIPEDLD Sbjct: 120 TRDDDFFGELQRMLSEMPEVSELHPVPDAHVPVLKFKFSGISIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQ+SILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMR+WAKRRGVYSNV+GF Sbjct: 180 ISQESILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRYWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLC IE+GSLGLP+WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCEIEDGSLGLPVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+DR H MPIITPAYPCMNSSYNV SSTLR+MT EFQRGNEICEAM NK +W LFE Sbjct: 300 RNPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMTNEFQRGNEICEAMDANKCNWDKLFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 YPFFEAYKNYL+ID+TA N DL WKGWVESRLRQLTLKIERHT MLQCHPHPGDFS Sbjct: 360 LYPFFEAYKNYLQIDVTAANAADLMNWKGWVESRLRQLTLKIERHTLNMLQCHPHPGDFS 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPF+CC+FMGLQRKQG ANEGEQFDIR TVEEFKH+VG Y WK GMEI V H++R Sbjct: 420 DKSRPFYCCYFMGLQRKQGVAANEGEQFDIRLTVEEFKHAVGMYNTWKPGMEIHVCHVKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVGVADETRKRKLAEGNGE 823 R+IP FVFPGGVRP RP K G +G+ + K S + + KRK + + Sbjct: 480 RSIPAFVFPGGVRP-RPTKVAG-EGRRPSQTKVSSHTEDSSFPKALNGGSKRKRDDTDTA 537 Query: 822 SSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVKNNSTDG 643 +S + +SG E TS +G+ SL++ + VED N+ N + Sbjct: 538 TSLNAKRIAGVGESGELVHEGRPSGCIGTSYLGNASLETPGKIFNEKVED-NMGNGLENP 596 Query: 642 KCHTASASEGVAGEGSEIGSAL---PIIAPGAATSTSREAEELAIEKIMSAQTINHTGFP 472 C ++S+ G E+ ++L P + + +S+EAE+LAIEK+M+ + H FP Sbjct: 597 ICLPQASSQ----NGGELDASLRLDPSTPADSISLSSKEAEKLAIEKMMTGPYVAHQTFP 652 Query: 471 EELDELEEYGMTDHERDL-GGVMNGRANESLTTKPVQGLSVGSRDSCXXXXXXXXXXXXX 295 +ELDELE+ ++ + GG + G + ES TK +S+ + + Sbjct: 653 QELDELEDDPEYKNQGKITGGSVKGSSMESSATKGSLIVSLTTSTAAGSCSSLQSSGKLE 712 Query: 294 XXXPCHTRVPAS---SSNPQRKPLIRLSLSSMAKTTGTSS 184 P PAS S+ KP++R + +S+AK TG S+ Sbjct: 713 ELEPPELLPPASRLNSATSAPKPVLRFNFTSLAKATGEST 752 >ref|XP_003534153.1| PREDICTED: poly(A) polymerase-like isoform X1 [Glycine max] gi|571478167|ref|XP_006587485.1| PREDICTED: poly(A) polymerase-like isoform X2 [Glycine max] gi|734382895|gb|KHN23742.1| Poly(A) polymerase [Glycine soja] Length = 757 Score = 940 bits (2430), Expect = 0.0 Identities = 493/769 (64%), Positives = 567/769 (73%), Gaps = 16/769 (2%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MG GL+N++NGQQ LG+TEPISL GP+E DV+KT+ELEK L VGLYESQEEAV REEV Sbjct: 1 MGIPGLSNQNNGQQRLGITEPISLAGPTEDDVIKTRELEKYLQGVGLYESQEEAVGREEV 60 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKGFNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 61 LGRLDQIVKIWVKNISRAKGFNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 +R+EDFF ELQ ML EM EVTELHPVPDAHVPVMKFKF+GVS+DLLYA+L+LWVIP+DLD Sbjct: 121 SRDEDFFGELQKMLSEMQEVTELHPVPDAHVPVMKFKFNGVSVDLLYARLALWVIPDDLD 180 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQ+SILQN DEQTV SLNGCRVTDQ+LRLVPNIQ FRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 181 ISQESILQNVDEQTVLSLNGCRVTDQVLRLVPNIQTFRTTLRCMRFWAKRRGVYSNVAGF 240 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGIN ALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGL +WDPR Sbjct: 241 LGGINLALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPR 300 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+DR H MPIITPAYPCMNS+YNV SSTLRVM++EF+RG+EICEAM+ +KADW TLFE Sbjct: 301 RNPKDRYHLMPIITPAYPCMNSTYNVTSSTLRVMSDEFRRGSEICEAMEASKADWDTLFE 360 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFE+YKNYL+IDITAEN DDLR+WKGWVESRLRQLTLKIERHT+GMLQCHPHPG+FS Sbjct: 361 PYPFFESYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERHTYGMLQCHPHPGEFS 420 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 D SRPFH C+FMGLQRKQG P NEGEQFDIR TVEEFKHSV YTLWK GM I VSH++R Sbjct: 421 DNSRPFHHCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMNIHVSHVKR 480 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVG------VADETRKRKL 841 RNIP ++FPGGVRP+ P+K + K K + QA+K G AD+ RKRK Sbjct: 481 RNIPNYIFPGGVRPTFPSKVTA-ENKQSSKSRVPGHGQAEKPQGGKTVVVGADDVRKRKR 539 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVK 661 +E + + ++ K+ S + E S S SCS+ D + E N++ Sbjct: 540 SE---DIMDNNPRNSKSPVSLAPPSREVNEDISPISASSSCSMKFDES------EVNSIG 590 Query: 660 NNSTDGKC-----HTASASEGVAGEGSEIGSALPIIAPGAATSTSREAEELAIEKIMSAQ 496 ++ C S G G + P++A A TS S+E E+LAIEKIMS Sbjct: 591 GQKSEKPCLNSPGEIPSGDSGTNGSVTNNQQVNPVLA-AADTSNSKEEEKLAIEKIMSGP 649 Query: 495 TINHTGFPEELDELE-EYGMTDHERDLGGVMNGRANESLTTKPVQGLSVGSRDSCXXXXX 319 H FPEE +ELE + + ++D GG M ESL +KP Sbjct: 650 YDAHQAFPEEPEELEDDTQYKNQDKDSGGNMKNNM-ESLLSKPAVAEEPVISKEITCSTH 708 Query: 318 XXXXXXXXXXXPCHTRVPASSSN----PQRKPLIRLSLSSMAKTTGTSS 184 P P S P +KPLIRL+ +S+ K S+ Sbjct: 709 LFSNEILEELEPAELSAPLLSGPPAPLPMKKPLIRLNFTSLGKAADKSA 757 >ref|XP_004512881.1| PREDICTED: nuclear poly(A) polymerase 1 [Cicer arietinum] Length = 753 Score = 936 bits (2419), Expect = 0.0 Identities = 495/767 (64%), Positives = 574/767 (74%), Gaps = 14/767 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MG GL+N++NG+Q+LG+TEPISL GP+E DVVK+QELEK L GLYESQ EAV REEV Sbjct: 1 MGIPGLSNQNNGKQWLGITEPISLAGPTEEDVVKSQELEKYLQGAGLYESQHEAVGREEV 60 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKGFNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 61 LGRLDQIVKIWVKTISRAKGFNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL+ ML EM EVTELHPVPDAHVPVMKFKF+G+S+DLLYA+L+LWVIPEDLD Sbjct: 121 TREEDFFGELRKMLSEMEEVTELHPVPDAHVPVMKFKFNGISVDLLYARLALWVIPEDLD 180 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQ+SILQNADEQTV SLNGCRVTDQ+LRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 181 ISQESILQNADEQTVLSLNGCRVTDQVLRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 240 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGIN ALLV RICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGL +WDPR Sbjct: 241 LGGINLALLVGRICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPR 300 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+DR H MPIITPAYPCMNS+YNV STLR+M+EEF+RG+EICEAM+ +KADW TLFE Sbjct: 301 RNPKDRYHLMPIITPAYPCMNSTYNVTLSTLRIMSEEFKRGSEICEAMEASKADWDTLFE 360 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYL+IDITAEN DDLR+WKGWVESRLRQLTLKIER+T+GMLQCHP+PG+FS Sbjct: 361 PYPFFEAYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERYTYGMLQCHPYPGEFS 420 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSR FH C+FMGLQRKQG P NEGEQFDIR TVEEFKHSV YTLWK GM+I VSH++R Sbjct: 421 DKSRTFHQCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMDIHVSHVKR 480 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVG---VADETRKRKLAEG 832 RNIP F+FPGGVRP P+K G + + K + S QA+K+ G +E RKRK +E Sbjct: 481 RNIPNFIFPGGVRPLLPSKATG-ENRQSSKSRVSGHSQAEKSQGGKAATNEARKRKRSEE 539 Query: 831 NGESSFTHI-KHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVKNN 655 N E++ + I K F ++ E E SV SCS+ D + E N++ Sbjct: 540 NVENNNSKISKSFVSLSP--PNKEVHEDITPIISVTSSCSMKFDDS------EVNSISAQ 591 Query: 654 STDGKCHTASASEGVAGEGSEIGSAL---PIIAPGAATSTSREAEELAIEKIMSAQTINH 484 ++ C E +G+ GS + + AP A S ++E E LAIE+IMS H Sbjct: 592 KSEKPC-LKLVGEIPSGDSQAYGSVMGNQQLTAPDA--SNTKEEERLAIEQIMSGPYEVH 648 Query: 483 TGFPEELDELE-EYGMTDHERDLGG-VMNGRANESLTTKPVQGLSVGSRDSCXXXXXXXX 310 EE DELE + G + +D GG V + + S+ V V +++ Sbjct: 649 QALAEESDELEDDMGYRNQVKDNGGSVKSNNFDISIPKFVVAEEQVIPKETICSTHLFSN 708 Query: 309 XXXXXXXXPCHTR-----VPASSSNPQRKPLIRLSLSSMAKTTGTSS 184 T +PA PQRKPLIRL+ +S+ K SS Sbjct: 709 GGLDELEPAELTAPLLCGIPAPV--PQRKPLIRLNFTSLGKALDKSS 753 >ref|XP_011627554.1| PREDICTED: nuclear poly(A) polymerase 1 [Amborella trichopoda] gi|769798422|ref|XP_011627555.1| PREDICTED: nuclear poly(A) polymerase 1 [Amborella trichopoda] Length = 533 Score = 934 bits (2415), Expect = 0.0 Identities = 455/512 (88%), Positives = 476/512 (92%), Gaps = 2/512 (0%) Frame = -3 Query: 2397 LGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEVLGRLDQIVKTWVKKV 2218 LGVTEPISLGGPSEFDV+KTQELEK L GLYESQEE+VSREEVLGRLDQIVK W+KKV Sbjct: 8 LGVTEPISLGGPSEFDVLKTQELEKFLEGAGLYESQEESVSREEVLGRLDQIVKVWIKKV 67 Query: 2217 SRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFMELQNMLE 2038 SRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFF EL ML Sbjct: 68 SRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFQELYAMLV 127 Query: 2037 EMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTV 1858 EMPEVTELHPVPDAHVPVMKFKF+GVSIDLLYAKLSLW+IPEDLDISQDSILQNADEQTV Sbjct: 128 EMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWIIPEDLDISQDSILQNADEQTV 187 Query: 1857 RSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLGGINWALLVARICQ 1678 RSLNGCRVTDQILRLVPNIQ+FRTTLRCMRFWAKRRGVYSNV+GFLGGINWALLVARICQ Sbjct: 188 RSLNGCRVTDQILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQ 247 Query: 1677 LYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPRRNPRDRLHQMPIITP 1498 LYPNALPSMLV+RFFRVYTQWRWPNPVMLCAIEEG+LGLP+WDPR+NPRD+LHQMPIITP Sbjct: 248 LYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLPVWDPRKNPRDKLHQMPIITP 307 Query: 1497 AYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFEPYPFFEAYKNYLEID 1318 AYPCMNSSYNV SSTLRVM EEFQRGNEICEAM++NK DWSTLFEPYPFFEAYKNYLEID Sbjct: 308 AYPCMNSSYNVSSSTLRVMMEEFQRGNEICEAMEINKCDWSTLFEPYPFFEAYKNYLEID 367 Query: 1317 ITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDKSRPFHCCFFMGLQ 1138 +TAENEDDLRKWKGWVESRLRQLTLKIER TF MLQCHPHP DFSDKSR FHCC+FMGLQ Sbjct: 368 VTAENEDDLRKWKGWVESRLRQLTLKIERDTFRMLQCHPHPNDFSDKSRTFHCCYFMGLQ 427 Query: 1137 RKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRRRNIPLFVFPGGVRPS 958 RK+G P EGEQFDIRATVEEFKHSVG YTLWK GM+IQVSHIRRRN+P FVFPGGVRPS Sbjct: 428 RKKGVPILEGEQFDIRATVEEFKHSVGMYTLWKPGMDIQVSHIRRRNVPHFVFPGGVRPS 487 Query: 957 RPAKGWGFDGKSV-LKPKASDTVQADK-NVGV 868 RP K G + K V K KA D Q DK +VGV Sbjct: 488 RPLKTAGGEVKKVGSKRKAPDLAQGDKSSVGV 519 >ref|XP_010036910.1| PREDICTED: poly(A) polymerase type 3 [Eucalyptus grandis] gi|702495144|ref|XP_010036911.1| PREDICTED: poly(A) polymerase type 3 [Eucalyptus grandis] gi|702495149|ref|XP_010036912.1| PREDICTED: poly(A) polymerase type 3 [Eucalyptus grandis] gi|629082125|gb|KCW48570.1| hypothetical protein EUGRSUZ_K02240 [Eucalyptus grandis] Length = 732 Score = 934 bits (2415), Expect = 0.0 Identities = 499/762 (65%), Positives = 557/762 (73%), Gaps = 12/762 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS L+NRS GQ+ LG+TEPISL GP+E+DV+KT ELEK L D GLYESQ EAV REEV Sbjct: 1 MGSPLLSNRSGGQR-LGITEPISLSGPTEYDVIKTCELEKYLQDAGLYESQAEAVRREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQIVKIWVKSISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF ELQ ML EM EV+EL PVPDA+VPVM FKF+GVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFGELQRMLSEMSEVSELRPVPDAYVPVMGFKFNGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNAD+QTVRSLNGCRVTDQILRLVPNIQNFR TLRCM+FWAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNADDQTVRSLNGCRVTDQILRLVPNIQNFRMTLRCMKFWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGG+NWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLC IEEGSLGL IWDPR Sbjct: 240 LGGVNWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCNIEEGSLGLQIWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+DR H MPIITPAYPCMNSSYNV +STL++M+EEFQRG+++CEAM+ K +W TLFE Sbjct: 300 RNPKDRFHLMPIITPAYPCMNSSYNVSASTLQIMSEEFQRGSDVCEAMEAGKVEWDTLFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 P+ FFEAYKNYL+IDI+AEN DDLRKWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDFS Sbjct: 360 PFGFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFS 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFH CFFMGLQRKQG P NEGEQFDIR TVEEFK +V YT WK GMEI VSH+RR Sbjct: 420 DKSRPFHHCFFMGLQRKQGVPVNEGEQFDIRVTVEEFKQAVNLYTSWKPGMEIYVSHVRR 479 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKN---VGVADET---RKRKL 841 +NIP FVFPGG RP RP+K +D + + KA+ + Q DK+ G DE RKRK Sbjct: 480 KNIPDFVFPGGARPPRPSKA-TWDSRRAAELKAASSSQVDKSSEGQGTPDEKDDGRKRK- 537 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNVK 661 E ES+ +K A+ S G +ES +L SDA + + V+ Sbjct: 538 REEEVESNLKSVKVLAALPSSTGEAQES-------------ALASDATNGAGDIGHDVVQ 584 Query: 660 NNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATSTSREAEELAIEKIMSAQTINHT 481 N+ T G + G GS A S SREAE +AIEKI S + H Sbjct: 585 NHIT---------GTGGSAYGKPSGSV-------ADESNSREAENIAIEKITSVPYVGHQ 628 Query: 480 GFPEELDELE-EYGMTDHERDLGGVMNGRANESLTTKPVQGLSVGSRDSCXXXXXXXXXX 304 F +ELDELE + D D SLT+ SV S + Sbjct: 629 DFSQELDELEDDVQQKDKYEDTAKGGKSPPMVSLTSN-ASSTSVTSSNGMSTSVSFYTSG 687 Query: 303 XXXXXXPCHTRVPASSSNP-----QRKPLIRLSLSSMAKTTG 193 P P NP QRKPLIRLSL+S+ K TG Sbjct: 688 DLEELEPAELMAPPPIVNPPAPPTQRKPLIRLSLTSLGKATG 729 >ref|XP_004137491.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Cucumis sativus] gi|700209059|gb|KGN64155.1| hypothetical protein Csa_1G042640 [Cucumis sativus] Length = 748 Score = 934 bits (2415), Expect = 0.0 Identities = 486/767 (63%), Positives = 568/767 (74%), Gaps = 15/767 (1%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 MGS L R+NGQQ LG+T+PISL GP+E+DV+KT+ELEK L D GLYESQE+AV+REEV Sbjct: 1 MGSPALCGRNNGQQRLGITDPISLSGPTEYDVLKTRELEKYLQDAGLYESQEDAVNREEV 60 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SRAKG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 61 LGRLDQIVKIWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF EL ML EMPEV+ELHPVPDAHVPVM+FK SGVSIDLLYAKLSLWVIPEDLD Sbjct: 121 TREEDFFGELHKMLSEMPEVSELHPVPDAHVPVMRFKLSGVSIDLLYAKLSLWVIPEDLD 180 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQN DEQTVRSLNGCRVTD+ILRLVPNIQ+FRTTLRCMRFWAKRRGVYSNVSGF Sbjct: 181 ISQDSILQNTDEQTVRSLNGCRVTDRILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVSGF 240 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRV+TQWRWPNPVMLCA EEGSLGL +WDPR Sbjct: 241 LGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCANEEGSLGLQVWDPR 300 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+DR H MPIITPAYPCMNSSYNV +STLR+MTEEF+RG++ICE M+ NK+DW TLFE Sbjct: 301 RNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMTEEFRRGHDICEVMEENKSDWDTLFE 360 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYL+IDITAEN+DD+R WKGWVESRLRQLTLKIERHT+ MLQCHP+PGDFS Sbjct: 361 PYPFFEAYKNYLQIDITAENDDDIRIWKGWVESRLRQLTLKIERHTYNMLQCHPYPGDFS 420 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFH C+FMGLQRKQG PA+ GEQFDIR TV+EFKHSV YT KRGMEI VSH++R Sbjct: 421 DKSRPFHHCYFMGLQRKQGGPASGGEQFDIRLTVDEFKHSVNVYTQRKRGMEIYVSHVKR 480 Query: 1002 RNIPLFVFPGGVRPSRPAK-GWGFDGKSVLKPKASDTVQADKNVGVA-----DETRKRKL 841 R+IP FVFPGGVRPSR +K W S LK ASD+ Q D D+ RKR Sbjct: 481 RSIPNFVFPGGVRPSRASKLTWDIRRSSELK--ASDSTQVDSPSEATESLDGDDRRKRIR 538 Query: 840 AEGNGESSFTHIKHFKAMDSGCGGTEE----SEICKSHTSVIGSCSLDSDAARQRQHVED 673 + N T++++ + + EE S++ + + I + +A +++ D Sbjct: 539 IDDNAN---TNLRNGECLAMAHSHPEEVHEVSQVSNTSSCSIKDVNFIPTSANNLENLAD 595 Query: 672 NNVKNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATSTSREAEELAIEKIMSAQT 493 + +NN G + ++ V+ ++ TS +EAE+LAI+KI+S Sbjct: 596 VSSQNNGDHGSLRVSPSTNNVSDAAAD-------------TSNCKEAEKLAIQKILSDSY 642 Query: 492 INHTGFPEELDELEEYGMTDHERDLGGVMNGRANES--LTTKPVQGLSVGSRDSCXXXXX 319 +H FP E +ELE++ + +D G G S T P+ L S + Sbjct: 643 DSHQDFPCETEELEDFDYNNQAKDFGATKQGSPMMSSVANTSPLV-LPTVSCNEARQSSS 701 Query: 318 XXXXXXXXXXXPCHTRVPASSSNP---QRKPLIRLSLSSMAKTTGTS 187 P P S+ +RKP+IRLS +S+ K +S Sbjct: 702 SYYNGGLEELEPAEIVAPLSTGTAPVAERKPIIRLSFTSLGKAGKSS 748 >ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prunus persica] gi|462406077|gb|EMJ11541.1| hypothetical protein PRUPE_ppa001856mg [Prunus persica] Length = 755 Score = 934 bits (2414), Expect = 0.0 Identities = 499/770 (64%), Positives = 569/770 (73%), Gaps = 17/770 (2%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M S GL+NR+NG++ LG+TEPISLGGP+E+DV+KT+ELEK L D LYESQEEAVSREEV Sbjct: 1 MASPGLSNRNNGKR-LGITEPISLGGPTEYDVIKTRELEKYLQDARLYESQEEAVSREEV 59 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVK +SR KG NEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA Sbjct: 60 LGRLDQIVKIWVKTISRTKGLNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 119 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 TREEDFF ELQ ML EMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD Sbjct: 120 TREEDFFGELQRMLSEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 179 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTDQILRLVP+IQNFRTTLRCMR WAKRRGVYSNV+GF Sbjct: 180 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPSIQNFRTTLRCMRLWAKRRGVYSNVAGF 239 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALP+MLV+RFFRVYTQWRWPNPVMLCAIEEGSLGL +WDPR Sbjct: 240 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 299 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RNP+D+ H MPIITPAYP MNSSYNV SSTLR+M EEFQRGNEICEAM+ NKADW TLFE Sbjct: 300 RNPKDKYHLMPIITPAYPSMNSSYNVSSSTLRIMLEEFQRGNEICEAMEANKADWDTLFE 359 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 Y FFEAYKNYL+IDI+AEN DD RKWKGWVESRLRQLTLKIERHT+GMLQCHPHPGDFS Sbjct: 360 SYDFFEAYKNYLQIDISAENADDFRKWKGWVESRLRQLTLKIERHTYGMLQCHPHPGDFS 419 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFH +FMGLQRKQG P EGEQFDIRATVEEFK SV YTL +RGMEI+VSH++R Sbjct: 420 DKSRPFHSSYFMGLQRKQGVPVTEGEQFDIRATVEEFKQSVNLYTLLERGMEIRVSHVKR 479 Query: 1002 RNIPLFVFPGGVRPSRPAK-GWGFDGKSVLKPKASDTVQADK------NVGVADETRKRK 844 RNIP FVFPG VRP R +K WG S L K S Q DK ++ +D +KRK Sbjct: 480 RNIPNFVFPGEVRPLRLSKVTWGSRRGSEL--KVSGDSQPDKLCEGKTDLDGSDGGQKRK 537 Query: 843 LAEGNGESSFTHIKHFKAMDSGCGGTEESEICKSHTSVIGSCSLDSDAARQRQHVEDNNV 664 + N E T+ ++ K++ G E S I SCS ++ + V+D+ Sbjct: 538 RVDDNVE---TNSRYAKSLHLSSG---EVHAASPPISNISSCSTKCESMDANKKVDDSIA 591 Query: 663 KNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATSTSREAEELAIEKIMSAQTINH 484 + G S + A TS+S+EAE++A+ K M+ ++H Sbjct: 592 DSLEKIENPADIPYQNGQIEVSSRCKPPNDSLPAAANTSSSKEAEKMALGKNMAGPYVSH 651 Query: 483 TGFPEELDELEE-----YGMTDHERDLGGVMNGRANESLTTKPVQGLSVGSRDSCXXXXX 319 P ELDELE+ + + D R++ + ES++ S G+ S Sbjct: 652 QALP-ELDELEDDSEHGHQVKDFSRNMKSSQMEPSEESVSVSAPVNSSNGAGPS-----T 705 Query: 318 XXXXXXXXXXXPCHTRVPASSSNP-----QRKPLIRLSLSSMAKTTGTSS 184 P VP+S+ P Q+K +IRL+ +S+AK +G SS Sbjct: 706 DSYNGGLEELEPAELMVPSSNGTPPEPVAQKKSIIRLNFTSLAKASGKSS 755 >ref|XP_009387262.1| PREDICTED: poly(A) polymerase PAPalpha isoform X2 [Musa acuminata subsp. malaccensis] Length = 752 Score = 934 bits (2413), Expect = 0.0 Identities = 507/779 (65%), Positives = 555/779 (71%), Gaps = 26/779 (3%) Frame = -3 Query: 2442 MGSAGLNNRSNGQQYLGVTEPISLGGPSEFDVVKTQELEKCLADVGLYESQEEAVSREEV 2263 M S+GL RSNG +LGVTEPIS GP+E+DV+KTQELEK LAD GLYESQEEAVSREE+ Sbjct: 1 MESSGLVKRSNG--HLGVTEPISWSGPTEYDVIKTQELEKYLADAGLYESQEEAVSREEI 58 Query: 2262 LGRLDQIVKTWVKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 2083 LGRLDQIVK WVKKVSRAKGFNEQ VQEANAKIFTFGSYRLG Sbjct: 59 LGRLDQIVKIWVKKVSRAKGFNEQFVQEANAKIFTFGSYRLG------------------ 100 Query: 2082 TREEDFFMELQNMLEEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLD 1903 EDFF EL NML EMPEVTELHPVPDAHVPVM+FKFSGVSIDLLYAKLSLWVIPEDLD Sbjct: 101 ---EDFFTELHNMLSEMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLD 157 Query: 1902 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGF 1723 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+GF Sbjct: 158 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 217 Query: 1722 LGGINWALLVARICQLYPNALPSMLVARFFRVYTQWRWPNPVMLCAIEEGSLGLPIWDPR 1543 LGGINWALLVARICQLYPNALPSMLV+RFFRVYTQWRWPNPVMLC I+EG+LGLPIWDPR Sbjct: 218 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCEIQEGTLGLPIWDPR 277 Query: 1542 RNPRDRLHQMPIITPAYPCMNSSYNVLSSTLRVMTEEFQRGNEICEAMKVNKADWSTLFE 1363 RN RDRLHQMPIITPAYPCMNSSYNV SSTLRVMTEEFQRGNEICEAM+ NKADW TLFE Sbjct: 278 RNFRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGNEICEAMEANKADWDTLFE 337 Query: 1362 PYPFFEAYKNYLEIDITAENEDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 1183 PYPFFEAYKNYLEIDITA+NE DLRKWKGWVESRLR LTLKIERHTFGML CHP P DFS Sbjct: 338 PYPFFEAYKNYLEIDITADNESDLRKWKGWVESRLRTLTLKIERHTFGMLHCHPCPRDFS 397 Query: 1182 DKSRPFHCCFFMGLQRKQGKPANEGEQFDIRATVEEFKHSVGNYTLWKRGMEIQVSHIRR 1003 DKSRPFHCC+FMGLQRKQG P E EQFDIR TV++FK+SV YTLWK GMEIQVSH +R Sbjct: 398 DKSRPFHCCYFMGLQRKQGVPVQESEQFDIRGTVDDFKNSVSMYTLWKPGMEIQVSHRKR 457 Query: 1002 RNIPLFVFPGGVRPSRPAKGWGFDGKSVLKPKASDTVQADKNVG----VADETRKRKLAE 835 RN+PLFVFPGGVRPSRP K G DG +V K SD V A K G VAD + RK E Sbjct: 458 RNVPLFVFPGGVRPSRPPKVAGVDGHAVSGRKVSDMVHAGKPAGNVSHVADASTDRKQME 517 Query: 834 GNGES------SFTHIKHFKAMDSGCGGT------------EESEICKSHTSVIGSCSL- 712 G G S S + + K +D+ + SE+ + G + Sbjct: 518 GKGASCDPIVESSSESRKGKQLDNRTDSNAANMNNLVDHILKPSEMGTPSSFANGVLDVP 577 Query: 711 DSDAARQRQHVEDNNVKNNSTDGKCHTASASEGVAGEGSEIGSALPIIAPGAATSTSREA 532 D R+ V ++ S H+ E A + +G + G + S+EA Sbjct: 578 DESRKRKCMDVTTDSFATGSEFQADHSFKRPETSAAIAASVGPVTE-VDNGESIFCSKEA 636 Query: 531 EELAIEKIMSAQTINHTGFPEELDELEEYGMTDHERDLGGVMNGRANESLTTKPV---QG 361 E LAI KI S N PE LDELE + H++ GG + G + ES T K G Sbjct: 637 ETLAISKITSVPPSNLAALPEGLDELEYFESQGHDKGFGGPVGGHSVESSTVKDAITQLG 696 Query: 360 LSVGSRDSCXXXXXXXXXXXXXXXXPCHTRVPASSSNPQRKPLIRLSLSSMAKTTGTSS 184 S GS PAS++N QRKPL RL LS++AK+ G S Sbjct: 697 SSYGSNTKNGGVEELEKSSELSAPYL--GGAPASTANTQRKPL-RLRLSTVAKSAGERS 752