BLASTX nr result

ID: Papaver29_contig00008321 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver29_contig00008321
         (2653 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010257444.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) poly...   910   0.0  
gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium r...   895   0.0  
ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isof...   895   0.0  
ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma ca...   895   0.0  
ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vit...   894   0.0  
ref|XP_010265872.1| PREDICTED: poly(A) polymerase PAPalpha-like ...   892   0.0  
emb|CDO98397.1| unnamed protein product [Coffea canephora]            889   0.0  
ref|XP_010916255.1| PREDICTED: nuclear poly(A) polymerase 1 [Ela...   884   0.0  
gb|KJB37194.1| hypothetical protein B456_006G193600 [Gossypium r...   879   0.0  
ref|XP_008775850.1| PREDICTED: poly(A) polymerase PAPalpha [Phoe...   879   0.0  
ref|XP_009387260.1| PREDICTED: poly(A) polymerase PAPalpha isofo...   877   0.0  
ref|XP_011627554.1| PREDICTED: nuclear poly(A) polymerase 1 [Amb...   874   0.0  
ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Pop...   872   0.0  
gb|ERN17250.1| hypothetical protein AMTR_s00044p00207970 [Ambore...   870   0.0  
ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Popu...   869   0.0  
ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|5879...   868   0.0  
ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prun...   860   0.0  
ref|XP_012075422.1| PREDICTED: nuclear poly(A) polymerase 1-like...   859   0.0  
ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citr...   857   0.0  
ref|XP_004512881.1| PREDICTED: nuclear poly(A) polymerase 1 [Cic...   857   0.0  

>ref|XP_010257444.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) polymerase PAPalpha-like
            [Nelumbo nucifera]
          Length = 741

 Score =  910 bits (2351), Expect = 0.0
 Identities = 479/738 (64%), Positives = 547/738 (74%), Gaps = 11/738 (1%)
 Frame = -3

Query: 2210 MANPGVT---NVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            M +PG     N + LG+TEPISL GP+E D+ KT+ELEKFL+  GLYE   EAVSREEVL
Sbjct: 1    MGSPGSNVRNNGRHLGVTEPISLGGPTEFDVIKTRELEKFLAEAGLYESQEEAVSREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            G LDQ+VK W+K V+R +G+N+QLVQEANAKI+TFGSYRLGVHGPGAD+DTLC+GP+HAT
Sbjct: 61   GSLDQVVKKWIKAVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADVDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFFVEL+ ML EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFVELHKMLAEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQD+ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNADEQTVRSLNGCRVTDRILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALPSMLVSRFFRV+ QWRWPNPVMLCAIEEGS  LP+WDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPSMLVSRFFRVFAQWRWPNPVMLCAIEEGSLGLPVWDPRK 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N+RDRLHQMPIITPAYPCMNSSYNVSSSTLRVM +EFQRG+EICE ME NKADWN LFE 
Sbjct: 301  NYRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMXQEFQRGNEICEPMEKNKADWNALFEP 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            YPFF+AYKNYLQI+ISAENDDDLR WKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD
Sbjct: 361  YPFFEAYKNYLQIDISAENDDDLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  HC YFMGL R QGV VHEG+QFDIRATVEEFKLSVGMYTLWKP MEI+VSHI+RR
Sbjct: 421  KSRPFHCSYFMGLSRKQGVSVHEGKQFDIRATVEEFKLSVGMYTLWKPRMEIHVSHIKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVE---KGRKXX 615
            NIP FVFPGG+RPSR  K  GE +PVSN +                   V    + RK  
Sbjct: 481  NIPLFVFPGGIRPSRPAKEDGESKPVSNLKLCNSVQASKSCESAVGAMGVADDIRKRKLG 540

Query: 614  XXXDESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALI 435
               DE+      L S+           + + S S + P+ S T TT+T     + V    
Sbjct: 541  YDNDENNPRAAKLLSV----------TTTEGSVSRSSPTAS-TCTTATFYDAPSEVRERG 589

Query: 434  LQDGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQC 255
              +  +    +      +   L+ VPS    +  +V   P+ +  S ++   CSKEA + 
Sbjct: 590  QHENNLGDSPI------SATCLTGVPSHGGEAEGSVRCSPLVKPSSTNSDLVCSKEAEKL 643

Query: 254  ATDYI-SGPSAGREVF-SELNELEDESGLV-QDNGGNTMEGSFAESSTVKQIALLATNGA 84
            A + I SGPS  ++ F  EL+ELED+ G   Q           +  S  + +   A NG 
Sbjct: 644  AIEKIASGPSVSQQGFLEELDELEDDIGSTDQVKVFGVSRKGISSESLAENVRAAAVNGI 703

Query: 83   GSSSSIGRFQNGGLEELE 30
               S  G  QNGGLEELE
Sbjct: 704  --CSPAGFIQNGGLEELE 719


>gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium raimondii]
          Length = 748

 Score =  895 bits (2314), Expect = 0.0
 Identities = 473/743 (63%), Positives = 542/743 (72%), Gaps = 11/743 (1%)
 Frame = -3

Query: 2210 MANPGV---TNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            M +PG+    + Q+LGITEPISL GP+E D+ KT+ELEK+L  VGLYE   EAVSREEVL
Sbjct: 1    MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFF EL+ ML+EMPEVSELHPVPDAHVP+MKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQD+ILQN DDQTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAI+EGS  L +WDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPRK 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N +DR H MPIITPAYP MNSSYNVSSSTLR+MT+EFQRG EICEAME NKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFEA 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            Y FF+AYKNYLQI+ISAENDDDLRNWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  HC YFMGLQR  GVPV+EGEQFDIR TVEEFK SV  YTLWKPGMEI VSH++RR
Sbjct: 421  NSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            +IPSFVFPGGVRPSR  K   + R  S+ +                 D    G+K     
Sbjct: 481  SIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKRAD 540

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQD 426
            D + T +K+   I A             S+S    +GS   T S     G+ VDA  L +
Sbjct: 541  DSADTQLKNSKYITAVP-----------SSSAEVQAGSPGGTVSPCSLKGDNVDATGLVE 589

Query: 425  GAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCATD 246
                 ++    N        E+ SL+     ++  +P    L   A +  SKEA + A +
Sbjct: 590  PTRGKDESNMTNGSKTSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIE 649

Query: 245  YI-SGPSAGREVF-SELNELEDESGLVQD--NGGNTMEGSFAE--SSTVKQIALLATNGA 84
             I SGP    + F  E  ELED+        + GNT  G      S       ++++NGA
Sbjct: 650  QIMSGPYVSHQAFPEEPEELEDDLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGA 709

Query: 83   GSSSSIGRFQNGGLEELEPAELT 15
            G S S+    +G +EELEPAELT
Sbjct: 710  GPSISL--HASGSIEELEPAELT 730


>ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176367|ref|XP_012486422.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176370|ref|XP_012486423.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|763769978|gb|KJB37193.1| hypothetical
            protein B456_006G193600 [Gossypium raimondii]
            gi|763769981|gb|KJB37196.1| hypothetical protein
            B456_006G193600 [Gossypium raimondii]
          Length = 762

 Score =  895 bits (2314), Expect = 0.0
 Identities = 473/743 (63%), Positives = 542/743 (72%), Gaps = 11/743 (1%)
 Frame = -3

Query: 2210 MANPGV---TNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            M +PG+    + Q+LGITEPISL GP+E D+ KT+ELEK+L  VGLYE   EAVSREEVL
Sbjct: 1    MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFF EL+ ML+EMPEVSELHPVPDAHVP+MKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQD+ILQN DDQTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAI+EGS  L +WDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPRK 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N +DR H MPIITPAYP MNSSYNVSSSTLR+MT+EFQRG EICEAME NKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFEA 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            Y FF+AYKNYLQI+ISAENDDDLRNWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  HC YFMGLQR  GVPV+EGEQFDIR TVEEFK SV  YTLWKPGMEI VSH++RR
Sbjct: 421  NSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            +IPSFVFPGGVRPSR  K   + R  S+ +                 D    G+K     
Sbjct: 481  SIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKRAD 540

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQD 426
            D + T +K+   I A             S+S    +GS   T S     G+ VDA  L +
Sbjct: 541  DSADTQLKNSKYITAVP-----------SSSAEVQAGSPGGTVSPCSLKGDNVDATGLVE 589

Query: 425  GAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCATD 246
                 ++    N        E+ SL+     ++  +P    L   A +  SKEA + A +
Sbjct: 590  PTRGKDESNMTNGSKTSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIE 649

Query: 245  YI-SGPSAGREVF-SELNELEDESGLVQD--NGGNTMEGSFAE--SSTVKQIALLATNGA 84
             I SGP    + F  E  ELED+        + GNT  G      S       ++++NGA
Sbjct: 650  QIMSGPYVSHQAFPEEPEELEDDLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGA 709

Query: 83   GSSSSIGRFQNGGLEELEPAELT 15
            G S S+    +G +EELEPAELT
Sbjct: 710  GPSISL--HASGSIEELEPAELT 730


>ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|590665102|ref|XP_007036648.1| Poly(A) polymerase 1
            isoform 1 [Theobroma cacao] gi|508773892|gb|EOY21148.1|
            Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|508773893|gb|EOY21149.1| Poly(A) polymerase 1 isoform
            1 [Theobroma cacao]
          Length = 762

 Score =  895 bits (2312), Expect = 0.0
 Identities = 470/747 (62%), Positives = 548/747 (73%), Gaps = 15/747 (2%)
 Frame = -3

Query: 2210 MANPGV---TNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            M +PG+    N Q+LGITEPISL GP++ D+ KT+ELEK+L  VGLYE   EAV REEVL
Sbjct: 1    MGSPGLGNRNNGQRLGITEPISLGGPTDYDVIKTRELEKYLQNVGLYESQEEAVGREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQ VK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQTVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFF ELY ML+EMPEVSELHPVPDAHVPVMKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQD+ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGS  L +WDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRK 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N +DR H MPIITPAYPCMNSSYNVSSSTLR+MT+EFQRG EICEAME NKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDILFES 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            Y FF+AYKNYLQI+ISAEN DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  H  YFMGLQR QGVPV+EGEQFDIR TVEEFK SV MYTLWKPGMEI V+H++RR
Sbjct: 421  KSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            NIPSFVFPGGVRPSR  KV  +   VS+ +                 D  + G+K     
Sbjct: 481  NIPSFVFPGGVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQDDGKKRKRVD 540

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQD 426
            D     ++S   I A             S+S     GS  +T S+  + G+  DA     
Sbjct: 541  DNGDAQLRSSKYITAVP-----------SSSLEGRVGSPVSTVSSCSTKGDYSDA----T 585

Query: 425  GAIKTEQVEYINEEANPLLS--EVPSLSENSGSTVSLLPISRTLSADA-ASFCSKEAGQC 255
            G I+T + +  +   N L++   +  LS ++G     +  +  +   A AS C++     
Sbjct: 586  GLIETTREKAESNMTNGLINSRSLEELSSHNGEVDGSVGCNPPIKVSADASSCTEAENLA 645

Query: 254  ATDYISGPSAGREVF-SELNELEDESGL------VQDNGGNTMEGSFAESSTVKQIALLA 96
                +SGP    + F  EL ELED+         V++     +E S ++ +    +   +
Sbjct: 646  IEKIMSGPYGAHQAFPQELEELEDDLEFRNQVRSVENTKSGPVESSMSDLAGAAPVT--S 703

Query: 95   TNGAGSSSSIGRFQNGGLEELEPAELT 15
            +NGAG S+S+    +GG+EELEPAELT
Sbjct: 704  SNGAGPSTSL--HASGGIEELEPAELT 728


>ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vitis vinifera]
          Length = 757

 Score =  894 bits (2309), Expect = 0.0
 Identities = 476/743 (64%), Positives = 550/743 (74%), Gaps = 12/743 (1%)
 Frame = -3

Query: 2210 MANPGVTNV----QQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEV 2043
            M+N G+ N     Q+LGITEPISL GP+E+D+TKTQELEKFL+  GLYE   EAVSREEV
Sbjct: 1    MSNLGLNNRNNSGQRLGITEPISLGGPNELDVTKTQELEKFLAAAGLYESQEEAVSREEV 60

Query: 2042 LGRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHA 1863
            LGRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HA
Sbjct: 61   LGRLDQIVKIWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1862 TREEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLD 1683
            TREEDFF EL+ ML+EMPEV+ELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPEDLD
Sbjct: 121  TREEDFFGELHKMLSEMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLD 180

Query: 1682 ISQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGF 1503
            +SQD+ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLR MRFWAKRRGVYSNVAGF
Sbjct: 181  VSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRFMRFWAKRRGVYSNVAGF 240

Query: 1502 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPR 1329
            LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEG+  L +WDPR
Sbjct: 241  LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLQVWDPR 300

Query: 1328 RNHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFE 1149
            +  +DR H MPIITPAYPCMNSSYNVSSSTLR+M+EEF+RG+EI E ME NKADW  L E
Sbjct: 301  KYPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMSEEFKRGNEISEVMEANKADWATLCE 360

Query: 1148 QYPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 969
             YPFF+AYKNYLQIEI+AEN DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPGDFS
Sbjct: 361  PYPFFEAYKNYLQIEIAAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFS 420

Query: 968  DTSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRR 789
            D S+  HCCYFMGLQR QGVP  EGEQFDIR TV+EFK SVGMYTLWKPGMEI+V H+RR
Sbjct: 421  DKSRPFHCCYFMGLQRKQGVPASEGEQFDIRLTVDEFKHSVGMYTLWKPGMEIHVIHVRR 480

Query: 788  RNIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXX 609
            RNIP+FVFPGGVRPSR  KVA E R V                    ++  E  +K    
Sbjct: 481  RNIPNFVFPGGVRPSRPTKVASERRRVLEPN----------VSTQAVLEGAEDSKKRKRE 530

Query: 608  XDESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQ 429
             +   TN ++   + AAA+          S      +  ++T  + ++ V +M   ++ +
Sbjct: 531  DENVETNSRNAKCLVAAASS---------SHEVLSSNPLVSTVNACSIKVDSMDINMLGK 581

Query: 428  DGAIKTE-QVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCA 252
                K E  +E+  +  N  +   P   E  GS     PI +TLS+   S  S EA + A
Sbjct: 582  TRKEKVENNIEHGLKNLNNSVEVPPQNGEVDGSVRCSHPI-KTLSSSGGSPSSTEAEKIA 640

Query: 251  TDYI-SGPSAGREVF-SELNELEDE---SGLVQDNGGNTMEGSFAESSTVKQIALLATNG 87
             + I SGP    + F  EL+ELED+      V+D  G+T +GS AESS         T  
Sbjct: 641  IEKIMSGPYVSHQAFPGELDELEDDVEYKNQVKDFTGST-KGSSAESSKANVAEEPLTTT 699

Query: 86   AGSSSSIGRFQNGGLEELEPAEL 18
            +G+        NGGLEELEPAEL
Sbjct: 700  SGTVPCTILSPNGGLEELEPAEL 722


>ref|XP_010265872.1| PREDICTED: poly(A) polymerase PAPalpha-like [Nelumbo nucifera]
          Length = 756

 Score =  892 bits (2305), Expect = 0.0
 Identities = 478/747 (63%), Positives = 541/747 (72%), Gaps = 12/747 (1%)
 Frame = -3

Query: 2210 MANPG--VTNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLG 2037
            M +PG  V N   LG+TEPISL+GP+E D+ KT+ELEKFL   GLYE   EAV+REEVLG
Sbjct: 1    MGSPGLNVRNNGHLGVTEPISLSGPTEFDVVKTRELEKFLVDAGLYESQEEAVAREEVLG 60

Query: 2036 RLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATR 1857
            RLDQIVK W+K V+R +G+N+QLV EANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATR
Sbjct: 61   RLDQIVKKWIKMVSRAKGFNEQLVLEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATR 120

Query: 1856 EEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDIS 1677
            EEDFF+EL+ ML EMPEVSELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPEDLDIS
Sbjct: 121  EEDFFIELHNMLAEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDIS 180

Query: 1676 QDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLG 1497
            QD +LQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNV+GFLG
Sbjct: 181  QDMVLQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVSGFLG 240

Query: 1496 GINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRN 1323
            GINWALLVARICQLYPNALPSMLVSRFFRV+TQWRWPNPVMLCAIEEG+  LP+WDPR+N
Sbjct: 241  GINWALLVARICQLYPNALPSMLVSRFFRVFTQWRWPNPVMLCAIEEGTLGLPVWDPRKN 300

Query: 1322 HRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQY 1143
            +RDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRG+EICEAME NKADWN LFE  
Sbjct: 301  YRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGNEICEAMEKNKADWNTLFEPC 360

Query: 1142 PFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDT 963
             FF+AYKNYLQIEISAENDD LR WKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 
Sbjct: 361  RFFEAYKNYLQIEISAENDDHLRKWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDK 420

Query: 962  SKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRN 783
            S+  HCCYFMGL+  QGV + EG+QFDIRATVE+FK SVG+YTLWKPGMEIYVSHI+RRN
Sbjct: 421  SRLFHCCYFMGLRLKQGVSMQEGKQFDIRATVEDFKHSVGLYTLWKPGMEIYVSHIKRRN 480

Query: 782  IPSFVFPGGVRPSRSPKVAGEGRPVSN-KRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            IP FVFP GVRPSRS K A EG+  SN K                 +D  +  RK     
Sbjct: 481  IPLFVFPDGVRPSRSAKEAWEGKSASNPKLCNSISAEESCEIATGSMDGTDDIRKRKLSD 540

Query: 605  DESATNIKSLPSIGAAANG-RILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQ 429
            D    N +S   + A      +L  SG  S    R     T+    A   G   D   L+
Sbjct: 541  DNGENNPRSTKFLAATNTAYGVLGGSGSGSPPIVR-----TSVREEAREGGQQEDN--LR 593

Query: 428  DGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCAT 249
              +I       I  +      E    S++ G            SA++   C+KEA + A 
Sbjct: 594  GSSINATCPTEITTDIGREAEEPARCSQSVGPP----------SANSGLSCTKEAEKLAI 643

Query: 248  DYI-SGPSAGRE--VFSELNELEDESGLVQDNGGNTMEG---SFAESSTVKQIALLATNG 87
            + I SGPS G       EL+ELED+      N    ++G          V +   +  NG
Sbjct: 644  EKIASGPSVGGHGGFPEELDELEDDF-----NSSYHVKGFGRDMPSKVLVAKAGTVEVNG 698

Query: 86   AGSSSSIGRFQNGGLEELEPAELTGSF 6
                +   +   GGLEELEPAELT  F
Sbjct: 699  VHPPTEFLQ-HGGGLEELEPAELTTPF 724


>emb|CDO98397.1| unnamed protein product [Coffea canephora]
          Length = 754

 Score =  889 bits (2298), Expect = 0.0
 Identities = 471/745 (63%), Positives = 554/745 (74%), Gaps = 14/745 (1%)
 Frame = -3

Query: 2210 MANPGVTNV---QQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            MA PG  N    Q+LGITEPIS +GP+E D+ KT+ELEKFL+ VGLYE   EA+SREEVL
Sbjct: 1    MAGPGFGNQSSGQRLGITEPISWSGPTEYDMIKTRELEKFLADVGLYESQEEAISREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQIVKTWVK V+R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQIVKTWVKNVSRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            R++DFF EL  ML+EMPEVSELHPVPDAHVPV+KFKF G+SIDLLYAKLSL +IPEDLDI
Sbjct: 121  RDDDFFGELQRMLSEMPEVSELHPVPDAHVPVLKFKFSGISIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQ++ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMR+WAKRRGVYSNVAGFL
Sbjct: 181  SQESILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRYWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLC IE+GS  LP+WDPRR
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCEIEDGSLGLPVWDPRR 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N +DR H MPIITPAYPCMNSSYNVSSSTLR+MT EFQRG+EICEAM+ NK +W+ LFE 
Sbjct: 301  NPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMTNEFQRGNEICEAMDANKCNWDKLFEL 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            YPFF+AYKNYLQI+++A N  DL NWKGWVESRLRQLTLKIERHT  MLQCHPHPGDFSD
Sbjct: 361  YPFFEAYKNYLQIDVTAANAADLMNWKGWVESRLRQLTLKIERHTLNMLQCHPHPGDFSD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  +CCYFMGLQR QGV  +EGEQFDIR TVEEFK +VGMY  WKPGMEI+V H++RR
Sbjct: 421  KSRPFYCCYFMGLQRKQGVAANEGEQFDIRLTVEEFKHAVGMYNTWKPGMEIHVCHVKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            +IP+FVFPGGVRP R  KVAGEGR  S  +                  A+  G K     
Sbjct: 481  SIPAFVFPGGVRP-RPTKVAGEGRRPSQTK------VSSHTEDSSFPKALNGGSKRKRDD 533

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQD 426
             ++AT++ +    G   +G ++ +         RPSG I T+      +GN   A +   
Sbjct: 534  TDTATSLNAKRIAGVGESGELVHEG--------RPSGCIGTS-----YLGN---ASLETP 577

Query: 425  GAIKTEQVE--YINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCA 252
            G I  E+VE    N   NP+     S S+N G   + L +  +  AD+ S  SKEA + A
Sbjct: 578  GKIFNEKVEDNMGNGLENPICLPQAS-SQNGGELDASLRLDPSTPADSISLSSKEAEKLA 636

Query: 251  TD-YISGPSAGREVF-SELNELEDESGLVQDN--GGNTMEGSFAESSTVKQ---IALLAT 93
             +  ++GP    + F  EL+ELED+          G +++GS  ESS  K    ++L  +
Sbjct: 637  IEKMMTGPYVAHQTFPQELDELEDDPEYKNQGKITGGSVKGSSMESSATKGSLIVSLTTS 696

Query: 92   NGAGSSSSIGRFQNGGLEELEPAEL 18
              AGS SS+    +G LEELEP EL
Sbjct: 697  TAAGSCSSLQ--SSGKLEELEPPEL 719


>ref|XP_010916255.1| PREDICTED: nuclear poly(A) polymerase 1 [Elaeis guineensis]
          Length = 768

 Score =  884 bits (2285), Expect = 0.0
 Identities = 468/734 (63%), Positives = 543/734 (73%), Gaps = 13/734 (1%)
 Frame = -3

Query: 2177 LGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIVKTWVKKV 1998
            LG+TEPIS +GP+E DITKT ELEK+L+  GLYE   EAVSREEVLGRLDQIVK WVKKV
Sbjct: 14   LGVTEPISWSGPTEFDITKTHELEKYLADAGLYESQEEAVSREEVLGRLDQIVKVWVKKV 73

Query: 1997 TRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFVELYGMLT 1818
            +R +G+N+Q V EANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF EL+ ML 
Sbjct: 74   SRAKGFNEQFVLEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFTELHNMLA 133

Query: 1817 EMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQNVDDQTV 1638
            EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL +IPEDLDISQD+ILQN D+QTV
Sbjct: 134  EMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTV 193

Query: 1637 RSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQ 1458
            RSLNGCRVTD+IL LVPN+QNFRTTLRC+RFWAKRRGVYSNVAGFLGGINWALLVARICQ
Sbjct: 194  RSLNGCRVTDQILRLVPNIQNFRTTLRCLRFWAKRRGVYSNVAGFLGGINWALLVARICQ 253

Query: 1457 LYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLHQMPIITP 1284
            LYP ALPSMLVSRFFRVYTQWRWPNPVMLC IEEG+  LP+WDPR+N++DRLHQMPIITP
Sbjct: 254  LYPKALPSMLVSRFFRVYTQWRWPNPVMLCDIEEGTLGLPVWDPRKNYKDRLHQMPIITP 313

Query: 1283 AYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAYKNYLQIE 1104
            AYPCMNSSYNVSSSTLRVMTEEFQRGHEICE ME NKADWN LF  YPFF+AYK+YL+I+
Sbjct: 314  AYPCMNSSYNVSSSTLRVMTEEFQRGHEICEEMEANKADWNKLFAPYPFFEAYKHYLEID 373

Query: 1103 ISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHCCYFMGLQ 924
            I+A N+DDLR WKGWVESRLR LTLKIERHTFGMLQCHPHPGDFSD S+  HCCYFMGLQ
Sbjct: 374  ITAANEDDLRKWKGWVESRLRTLTLKIERHTFGMLQCHPHPGDFSDKSRPFHCCYFMGLQ 433

Query: 923  RNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVFPGGVRPS 744
            R QGVPV+EGEQFDIR TVEEFK SVGMYTLWKPGMEI VSHI+RRN+PSFVFPGG RPS
Sbjct: 434  RKQGVPVNEGEQFDIRVTVEEFKHSVGMYTLWKPGMEIQVSHIKRRNVPSFVFPGGTRPS 493

Query: 743  RSPKVAG-EGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXXDESATNIKSLPSI 567
            R  K AG EG  +S  +                      G       D+S+T  K L + 
Sbjct: 494  RPLKAAGSEGHTISKTKSSSLVLAGKPSDTL-------SGSCDTHMADDSSTR-KQL-AA 544

Query: 566  GAAANGRILEDSGDCS-ASFTRPSGSITTTTSTAMSVGNMV-DALILQDGAIKTEQVEYI 393
            G     ++++ S  CS  + T  S S   T     S  N+V +A  + +  +++ + +++
Sbjct: 545  GTPVGDQVVQGSERCSPITMTSSSASSLCTKEAEGSAINLVGNANGILNVTVESRKRKHV 604

Query: 392  NE------EANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCATDYIS-- 237
             +      +A  L +    L E+ G   S +  +       AS CSKEA   A   I+  
Sbjct: 605  EDTDSNSIDAQRLAAHSAKLPESVGMAGSGIIAAVGPGNCTASLCSKEAEALAIKKITSG 664

Query: 236  GPSAGREVFSELNELEDESGLVQDNGGNTMEGSFAESSTVKQIALLATNGAGSSSSIGRF 57
             P+    +   L+ELE      QD   + + G  +  S+  + A +       SS     
Sbjct: 665  SPTNLASLPEGLDELELFEPQGQDKDFDGVAGGCSVVSSAAKDAPMQVGKLHDSS----- 719

Query: 56   QNGGLEELEPAELT 15
            +N G+EELEPAEL+
Sbjct: 720  KNEGIEELEPAELS 733


>gb|KJB37194.1| hypothetical protein B456_006G193600 [Gossypium raimondii]
          Length = 793

 Score =  879 bits (2272), Expect = 0.0
 Identities = 473/774 (61%), Positives = 542/774 (70%), Gaps = 42/774 (5%)
 Frame = -3

Query: 2210 MANPGV---TNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            M +PG+    + Q+LGITEPISL GP+E D+ KT+ELEK+L  VGLYE   EAVSREEVL
Sbjct: 1    MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFF EL+ ML+EMPEVSELHPVPDAHVP+MKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQ-------------------------- 1578
            SQD+ILQN DDQTVRSLNGCRVTD+IL LVPN+Q                          
Sbjct: 181  SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQVVATTLYSVHYICFFPPEILDFDYVL 240

Query: 1577 -----NFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALPSMLVSRFF 1413
                 NFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALP+MLVSRFF
Sbjct: 241  LLDLQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALPNMLVSRFF 300

Query: 1412 RVYTQWRWPNPVMLCAIEEGSL--PIWDPRRNHRDRLHQMPIITPAYPCMNSSYNVSSST 1239
            RVYTQWRWPNPVMLCAI+EGSL   +WDPR+N +DR H MPIITPAYP MNSSYNVSSST
Sbjct: 301  RVYTQWRWPNPVMLCAIKEGSLGLQVWDPRKNPKDRYHLMPIITPAYPSMNSSYNVSSST 360

Query: 1238 LRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAYKNYLQIEISAENDDDLRNWKGW 1059
            LR+MT+EFQRG EICEAME NKADW+ LFE Y FF+AYKNYLQI+ISAENDDDLRNWKGW
Sbjct: 361  LRIMTDEFQRGSEICEAMEANKADWDALFEAYAFFEAYKNYLQIDISAENDDDLRNWKGW 420

Query: 1058 VESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHCCYFMGLQRNQGVPVHEGEQFDI 879
            VESRLRQLTLKIERHT+ MLQCHPHPGDF D S+  HC YFMGLQR  GVPV+EGEQFDI
Sbjct: 421  VESRLRQLTLKIERHTYNMLQCHPHPGDFQDNSRPFHCSYFMGLQRKLGVPVNEGEQFDI 480

Query: 878  RATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVFPGGVRPSRSPKVAGEGRPVSNK 699
            R TVEEFK SV  YTLWKPGMEI VSH++RR+IPSFVFPGGVRPSR  K   + R  S+ 
Sbjct: 481  RLTVEEFKHSVNTYTLWKPGMEIRVSHVKRRSIPSFVFPGGVRPSRPSKATWDSRRASDA 540

Query: 698  RXXXXXXXXXXXXXXXXVDAVEKGRKXXXXXDESATNIKSLPSIGAAANGRILEDSGDCS 519
            +                 D    G+K     D + T +K+   I A             S
Sbjct: 541  KVSGHAGSDKPGEVKGAADGQVDGKKRKRADDSADTQLKNSKYITAVP-----------S 589

Query: 518  ASFTRPSGSITTTTSTAMSVGNMVDALILQDGAIKTEQVEYINEEANPLLSEVPSLSENS 339
            +S    +GS   T S     G+ VDA  L +     ++    N        E+ SL+   
Sbjct: 590  SSAEVQAGSPGGTVSPCSLKGDNVDATGLVEPTRGKDESNMTNGSKTSSTDELSSLNSEV 649

Query: 338  GSTVSLLPISRTLSADAASFCSKEAGQCATDYI-SGPSAGREVF-SELNELEDESGLVQD 165
              ++  +P    L   A +  SKEA + A + I SGP    + F  E  ELED+      
Sbjct: 650  DGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSGPYVSHQAFPEEPEELEDDLEFRNR 709

Query: 164  --NGGNTMEGSFAE--SSTVKQIALLATNGAGSSSSIGRFQNGGLEELEPAELT 15
              + GNT  G      S       ++++NGAG S S+    +G +EELEPAELT
Sbjct: 710  VVSVGNTNNGPLQAPVSDAAGAAPIISSNGAGPSISL--HASGSIEELEPAELT 761


>ref|XP_008775850.1| PREDICTED: poly(A) polymerase PAPalpha [Phoenix dactylifera]
          Length = 767

 Score =  879 bits (2271), Expect = 0.0
 Identities = 471/750 (62%), Positives = 538/750 (71%), Gaps = 18/750 (2%)
 Frame = -3

Query: 2210 MANPGVTNVQQ--LGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLG 2037
            MA+ G++      LG+TEPIS +GP+E DITKTQELEK+L+  GLYE    AVSREEVLG
Sbjct: 1    MASSGLSKRSNGYLGVTEPISWSGPTEFDITKTQELEKYLADAGLYESQEGAVSREEVLG 60

Query: 2036 RLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATR 1857
            RLDQIVK WV+KV+R +G+N+Q VQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATR
Sbjct: 61   RLDQIVKVWVRKVSRAKGFNEQFVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATR 120

Query: 1856 EEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDIS 1677
            EEDFF EL+ ML EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL +IPEDLDIS
Sbjct: 121  EEDFFTELHNMLAEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWVIPEDLDIS 180

Query: 1676 QDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLG 1497
            QD+ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFLG
Sbjct: 181  QDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLG 240

Query: 1496 GINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRN 1323
            GINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLC IEEG+  L +WDPR+N
Sbjct: 241  GINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCDIEEGTLGLSVWDPRKN 300

Query: 1322 HRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQY 1143
             +DRLHQMPIITPAYP MNSSYNVSSSTLRVMT+EFQRGH ICE ME NKADW+ LFE Y
Sbjct: 301  FKDRLHQMPIITPAYPSMNSSYNVSSSTLRVMTDEFQRGHVICEEMEANKADWSKLFEPY 360

Query: 1142 PFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDT 963
            PFF+AYK+YL+I+I+A N+DDLR WKGWVESRLR LTLKIERHTFGMLQCHPHPGDFSD 
Sbjct: 361  PFFEAYKHYLEIDITAANEDDLRKWKGWVESRLRTLTLKIERHTFGMLQCHPHPGDFSDK 420

Query: 962  SKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRN 783
            S+  HCCYFMGLQR QGVPV+EGEQFDIR TVE+FK SVGMYTLWKPGMEI VSHI+RRN
Sbjct: 421  SRLFHCCYFMGLQRKQGVPVNEGEQFDIRVTVEDFKHSVGMYTLWKPGMEIQVSHIKRRN 480

Query: 782  IPSFVFPGGVRPSRSPKVA-GEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            +PSFVFP G+RPSR PK A  EG  VS  +                 D +          
Sbjct: 481  VPSFVFPSGIRPSRPPKAAVSEGHTVSKIK------SSSSVQAGKPSDTLAGSGDTTTHM 534

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCS-ASFTRPSGSITTTTSTAMSVGNMV------ 447
             E ++  K L + G     +++E S  CS  + T  S S   T     S  N+V      
Sbjct: 535  AEDSSTTKLL-AAGILIGDQVVEGSERCSPITMTSSSASSLCTKEAEGSAINLVGNANGI 593

Query: 446  -DALILQDGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSK 270
             +  I        E  +  + +A  L S  P   E+ G   S +  +       +S CSK
Sbjct: 594  LNVTIESRKRKHEEDTDSNSIDAKRLASAKP--PESVGMAASGIIAAEDPGNCTSSLCSK 651

Query: 269  EAGQCATDYISGPSAGREVFSELNELEDESGLVQDNG-----GNTMEGSFAESSTVKQIA 105
            EA   A   I+  S      + L E  DE  L + +G     G    G    SS  K   
Sbjct: 652  EAEALAIKKITSGSPTN--LASLPEGLDELELFELHGQDKDFGGVASGCSVVSSAAKDAP 709

Query: 104  LLATNGAGSSSSIGRFQNGGLEELEPAELT 15
            +       SS      +NGG+EELEPAEL+
Sbjct: 710  MQVGKLHDSS------KNGGIEELEPAELS 733


>ref|XP_009387260.1| PREDICTED: poly(A) polymerase PAPalpha isoform X1 [Musa acuminata
            subsp. malaccensis] gi|695079670|ref|XP_009387261.1|
            PREDICTED: poly(A) polymerase PAPalpha isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 773

 Score =  877 bits (2265), Expect = 0.0
 Identities = 467/747 (62%), Positives = 547/747 (73%), Gaps = 31/747 (4%)
 Frame = -3

Query: 2177 LGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIVKTWVKKV 1998
            LG+TEPIS +GP+E D+ KTQELEK+L+  GLYE   EAVSREE+LGRLDQIVK WVKKV
Sbjct: 14   LGVTEPISWSGPTEYDVIKTQELEKYLADAGLYESQEEAVSREEILGRLDQIVKIWVKKV 73

Query: 1997 TRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFVELYGMLT 1818
            +R +G+N+Q VQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF EL+ ML+
Sbjct: 74   SRAKGFNEQFVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFTELHNMLS 133

Query: 1817 EMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQNVDDQTV 1638
            EMPEV+ELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPEDLDISQD+ILQN D+QTV
Sbjct: 134  EMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTV 193

Query: 1637 RSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQ 1458
            RSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQ
Sbjct: 194  RSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQ 253

Query: 1457 LYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLHQMPIITP 1284
            LYPNALPSMLVSRFFRVYTQWRWPNPVMLC I+EG+  LPIWDPRRN RDRLHQMPIITP
Sbjct: 254  LYPNALPSMLVSRFFRVYTQWRWPNPVMLCEIQEGTLGLPIWDPRRNFRDRLHQMPIITP 313

Query: 1283 AYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAYKNYLQIE 1104
            AYPCMNSSYNVSSSTLRVMTEEFQRG+EICEAME NKADW+ LFE YPFF+AYKNYL+I+
Sbjct: 314  AYPCMNSSYNVSSSTLRVMTEEFQRGNEICEAMEANKADWDTLFEPYPFFEAYKNYLEID 373

Query: 1103 ISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHCCYFMGLQ 924
            I+A+N+ DLR WKGWVESRLR LTLKIERHTFGML CHP P DFSD S+  HCCYFMGLQ
Sbjct: 374  ITADNESDLRKWKGWVESRLRTLTLKIERHTFGMLHCHPCPRDFSDKSRPFHCCYFMGLQ 433

Query: 923  RNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVFPGGVRPS 744
            R QGVPV E EQFDIR TV++FK SV MYTLWKPGMEI VSH +RRN+P FVFPGGVRPS
Sbjct: 434  RKQGVPVQESEQFDIRGTVDDFKNSVSMYTLWKPGMEIQVSHRKRRNVPLFVFPGGVRPS 493

Query: 743  RSPKVAG-EGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRK--XXXXXDESATNIKSLP 573
            R PKVAG +G  VS ++                 D V  G+         +++T+ K + 
Sbjct: 494  RPPKVAGVDGHAVSGRK---------------VSDMVHAGKPAGNVSHVADASTDRKQME 538

Query: 572  SIGAAANGRILEDSGDCSASFTRPSGSI-TTTTSTAMSVGNMVDALILQD-----GAIKT 411
              GA+ +  I+E     S+S +R    +   T S A ++ N+VD ++         +   
Sbjct: 539  GKGASCD-PIVE-----SSSESRKGKQLDNRTDSNAANMNNLVDHILKPSEMGTPSSFAN 592

Query: 410  EQVEYINEEANPLLSEVPSLSENSGS-----------------TVSLLPISRTLSADAAS 282
              ++  +E       +V + S  +GS                   S+ P++   + ++  
Sbjct: 593  GVLDVPDESRKRKCMDVTTDSFATGSEFQADHSFKRPETSAAIAASVGPVTEVDNGESI- 651

Query: 281  FCSKEAGQCATDYISG--PSAGREVFSELNELEDESGLVQDNG-GNTMEGSFAESSTVKQ 111
            FCSKEA   A   I+   PS    +   L+ELE       D G G  + G   ESSTVK 
Sbjct: 652  FCSKEAETLAISKITSVPPSNLAALPEGLDELEYFESQGHDKGFGGPVGGHSVESSTVKD 711

Query: 110  IALLATNGAGSSSSIGRFQNGGLEELE 30
                  +  GS++     +NGG+EELE
Sbjct: 712  AITQLGSSYGSNT-----KNGGVEELE 733


>ref|XP_011627554.1| PREDICTED: nuclear poly(A) polymerase 1 [Amborella trichopoda]
            gi|769798422|ref|XP_011627555.1| PREDICTED: nuclear
            poly(A) polymerase 1 [Amborella trichopoda]
          Length = 533

 Score =  874 bits (2257), Expect = 0.0
 Identities = 423/527 (80%), Positives = 461/527 (87%), Gaps = 3/527 (0%)
 Frame = -3

Query: 2198 GVTNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIV 2019
            G T +  LG+TEPISL GPSE D+ KTQELEKFL G GLYE   E+VSREEVLGRLDQIV
Sbjct: 3    GATRI--LGVTEPISLGGPSEFDVLKTQELEKFLEGAGLYESQEESVSREEVLGRLDQIV 60

Query: 2018 KTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFV 1839
            K W+KKV+R +G+N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF 
Sbjct: 61   KVWIKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFQ 120

Query: 1838 ELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQ 1659
            ELY ML EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL IIPEDLDISQD+ILQ
Sbjct: 121  ELYAMLVEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWIIPEDLDISQDSILQ 180

Query: 1658 NVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 1479
            N D+QTVRSLNGCRVTD+IL LVPN+Q+FRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL
Sbjct: 181  NADEQTVRSLNGCRVTDQILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 240

Query: 1478 LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLH 1305
            LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEG+  LP+WDPR+N RD+LH
Sbjct: 241  LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLPVWDPRKNPRDKLH 300

Query: 1304 QMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAY 1125
            QMPIITPAYPCMNSSYNVSSSTLRVM EEFQRG+EICEAME+NK DW+ LFE YPFF+AY
Sbjct: 301  QMPIITPAYPCMNSSYNVSSSTLRVMMEEFQRGNEICEAMEINKCDWSTLFEPYPFFEAY 360

Query: 1124 KNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHC 945
            KNYL+I+++AEN+DDLR WKGWVESRLRQLTLKIER TF MLQCHPHP DFSD S+  HC
Sbjct: 361  KNYLEIDVTAENEDDLRKWKGWVESRLRQLTLKIERDTFRMLQCHPHPNDFSDKSRTFHC 420

Query: 944  CYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVF 765
            CYFMGLQR +GVP+ EGEQFDIRATVEEFK SVGMYTLWKPGM+I VSHIRRRN+P FVF
Sbjct: 421  CYFMGLQRKKGVPILEGEQFDIRATVEEFKHSVGMYTLWKPGMDIQVSHIRRRNVPHFVF 480

Query: 764  PGGVRPSRSPKVA-GEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKG 627
            PGGVRPSR  K A GE + V +KR                 D VE G
Sbjct: 481  PGGVRPSRPLKTAGGEVKKVGSKRKAPDLAQGDKSSVGVERDEVENG 527


>ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Populus euphratica]
          Length = 776

 Score =  872 bits (2254), Expect = 0.0
 Identities = 467/755 (61%), Positives = 559/755 (74%), Gaps = 24/755 (3%)
 Frame = -3

Query: 2210 MANPGVTN------VQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSRE 2049
            M +PG+ N       Q+LGITEPISL GP+E D+TKT+ELEKFL   GLYE   EAVSRE
Sbjct: 1    MGSPGLINRNNGQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSRE 60

Query: 2048 EVLGRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPK 1869
            EVLGRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+
Sbjct: 61   EVLGRLDQIVKNWVKVISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 120

Query: 1868 HATREEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPED 1689
            HATREEDFF EL+ ML+EMPEV+ELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPED
Sbjct: 121  HATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPED 180

Query: 1688 LDISQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVA 1509
            LD+SQD++L N D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNV+
Sbjct: 181  LDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 240

Query: 1508 GFLGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWD 1335
            GFLGGINWALL ARICQL+PNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGS  LP+WD
Sbjct: 241  GFLGGINWALLAARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPVWD 300

Query: 1334 PRRNHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNL 1155
            PRRN +DR H MPIITPAYP MNSSYNVSSSTLR+MTEEFQRG+EICEAME++KA+W+ L
Sbjct: 301  PRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAEWDTL 360

Query: 1154 FEQYPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGD 975
            FE + FF+AYKNYLQI+ISAEN+DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+
Sbjct: 361  FEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHPHPGE 420

Query: 974  FSDTSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHI 795
            FSD S+  HC YFMGLQR QGVPV+EGEQFDIR TV+EFK SV MYT  KPGMEI+V+H+
Sbjct: 421  FSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKHSVKMYTSRKPGMEIHVTHV 480

Query: 794  RRRNIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXX 615
            +RRNIP+FVFP GVRPSR  K   +GR  S++                 +D  ++G+K  
Sbjct: 481  KRRNIPNFVFPNGVRPSRPSKATWDGRR-SSEAKVANNSSADKIEGKGVLDGSDEGKKRK 539

Query: 614  XXXDESATNIKSLPSIGA--AANGRILEDSGDCSASFTRPSGSITT-TTSTAMSVGNMVD 444
               D++  N+++     A   ++G +LE S         P G++++ +T + + + N + 
Sbjct: 540  RIDDDTENNLRNPKGYAAMPPSSGEVLEGS--------PPVGNVSSCSTQSDLVITNSLG 591

Query: 443  ALILQDGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEA 264
               L+       + E +N   N L        E  G     LP  + L A+  +  SKEA
Sbjct: 592  E--LKGEKADNNETESLNNSQN-LAGIFAQNGELDGILRCNLP-GKGLPANNNTSSSKEA 647

Query: 263  GQCATDYI-SGPSAGREVF-SELNELEDESGLVQDNGGN--TMEGSFAESS--------T 120
             + A D I SGP    +    EL+ELED+        G+    +GS  ESS        T
Sbjct: 648  EKLAIDKIMSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAAELT 707

Query: 119  VKQIALLA-TNGAGSSSSIGRFQNGGLEELEPAEL 18
             + IA +A +NGAG S+ +  + NGG +ELE AEL
Sbjct: 708  NESIAAVACSNGAGPSAYL--YPNGGSDELEXAEL 740


>gb|ERN17250.1| hypothetical protein AMTR_s00044p00207970 [Amborella trichopoda]
          Length = 649

 Score =  870 bits (2248), Expect = 0.0
 Identities = 415/494 (84%), Positives = 451/494 (91%), Gaps = 2/494 (0%)
 Frame = -3

Query: 2198 GVTNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIV 2019
            G T +  LG+TEPISL GPSE D+ KTQELEKFL G GLYE   E+VSREEVLGRLDQIV
Sbjct: 3    GATRI--LGVTEPISLGGPSEFDVLKTQELEKFLEGAGLYESQEESVSREEVLGRLDQIV 60

Query: 2018 KTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFV 1839
            K W+KKV+R +G+N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF 
Sbjct: 61   KVWIKKVSRAKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFQ 120

Query: 1838 ELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQ 1659
            ELY ML EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL IIPEDLDISQD+ILQ
Sbjct: 121  ELYAMLVEMPEVTELHPVPDAHVPVMKFKFNGVSIDLLYAKLSLWIIPEDLDISQDSILQ 180

Query: 1658 NVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 1479
            N D+QTVRSLNGCRVTD+IL LVPN+Q+FRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL
Sbjct: 181  NADEQTVRSLNGCRVTDQILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 240

Query: 1478 LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLH 1305
            LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEG+  LP+WDPR+N RD+LH
Sbjct: 241  LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLPVWDPRKNPRDKLH 300

Query: 1304 QMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAY 1125
            QMPIITPAYPCMNSSYNVSSSTLRVM EEFQRG+EICEAME+NK DW+ LFE YPFF+AY
Sbjct: 301  QMPIITPAYPCMNSSYNVSSSTLRVMMEEFQRGNEICEAMEINKCDWSTLFEPYPFFEAY 360

Query: 1124 KNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHC 945
            KNYL+I+++AEN+DDLR WKGWVESRLRQLTLKIER TF MLQCHPHP DFSD S+  HC
Sbjct: 361  KNYLEIDVTAENEDDLRKWKGWVESRLRQLTLKIERDTFRMLQCHPHPNDFSDKSRTFHC 420

Query: 944  CYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVF 765
            CYFMGLQR +GVP+ EGEQFDIRATVEEFK SVGMYTLWKPGM+I VSHIRRRN+P FVF
Sbjct: 421  CYFMGLQRKKGVPILEGEQFDIRATVEEFKHSVGMYTLWKPGMDIQVSHIRRRNVPHFVF 480

Query: 764  PGGVRPSRSPKVAG 723
            PGGVRPSR  K AG
Sbjct: 481  PGGVRPSRPLKTAG 494


>ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Populus trichocarpa]
            gi|550321905|gb|EEF06201.2| hypothetical protein
            POPTR_0015s04100g [Populus trichocarpa]
          Length = 780

 Score =  869 bits (2246), Expect = 0.0
 Identities = 461/751 (61%), Positives = 556/751 (74%), Gaps = 22/751 (2%)
 Frame = -3

Query: 2204 NPGVTNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQ 2025
            N G    Q+LGITEPISL GP+E D+TKT+ELEKFL   GLYE   EAVSREEVLGRLDQ
Sbjct: 10   NNGQQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSREEVLGRLDQ 69

Query: 2024 IVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDF 1845
            IVK WVK ++R +  N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDF
Sbjct: 70   IVKNWVKVISRAKRLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDF 129

Query: 1844 FVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAI 1665
            F EL+ ML+EMPEV+ELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPEDLD+SQD++
Sbjct: 130  FGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPEDLDVSQDSM 189

Query: 1664 LQNVDDQTVRSLNGCRVTDKILHLVPNVQ---NFRTTLRCMRFWAKRRGVYSNVAGFLGG 1494
            L N D+QTVRSLNGCRVTD+IL LVPN+Q   NFRTTLRCMRFWAKRRGVYSNV+GFLGG
Sbjct: 190  LHNADEQTVRSLNGCRVTDQILRLVPNIQAMQNFRTTLRCMRFWAKRRGVYSNVSGFLGG 249

Query: 1493 INWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGSL--PIWDPRRNH 1320
            INWALLVARICQL+PNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGSL   +WDPRRN 
Sbjct: 250  INWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPRRNP 309

Query: 1319 RDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYP 1140
            +DR H MPIITPAYP MNSSYNVSSSTLR+MTEEFQRG+EICEAME++KA+W+ LFE + 
Sbjct: 310  KDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAEWDTLFEPFS 369

Query: 1139 FFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTS 960
            FF+AYKNYLQI+ISAEN+DDLR WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+FSD S
Sbjct: 370  FFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHPHPGEFSDKS 429

Query: 959  KRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNI 780
            +  HC YFMGLQR QGVPV+EGEQFDIR TV+EFK SV MYTLWKPGMEI V+H+++RNI
Sbjct: 430  RPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKNSVNMYTLWKPGMEIRVTHVKKRNI 489

Query: 779  PSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXXDE 600
            P+FVFP GVRPSR  K   +GR  S++                 +D  ++G+K     ++
Sbjct: 490  PNFVFPSGVRPSRPSKATWDGRR-SSEAKVANNSSADKIEGKGVLDGSDEGKKRKRIDED 548

Query: 599  SATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITT-TTSTAMSVGNMVDALILQDG 423
            +  N+++     A      +  SG      + P G++++ +T + + + N +       G
Sbjct: 549  TENNLRNPKGYAA------MPPSGGEVHEGSPPVGNVSSCSTQSDLVITNSL-------G 595

Query: 422  AIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPIS---RTLSADAASFCSKEAGQCA 252
             +K E+ +    E+      +  +   +G    +L  +   + L A+  +  SKEA + A
Sbjct: 596  ELKGEKADNNETESLSNSQNLAGIFAQNGELDGILRCNLPDKGLPANNDTSSSKEAEKLA 655

Query: 251  TDYI-SGPSAGREVF-SELNELEDESGLVQDNGGN--TMEGSFAESS--------TVKQI 108
             D I SGP    +    EL+ELED+        G+    +GS  ESS        T + I
Sbjct: 656  IDKIMSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAVEQTNESI 715

Query: 107  ALLA-TNGAGSSSSIGRFQNGGLEELEPAEL 18
            A +A +NGAG S+ +  + NGG EELEPAEL
Sbjct: 716  AAVACSNGAGPSAYL--YPNGGSEELEPAEL 744


>ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|587938462|gb|EXC25192.1|
            Poly(A) polymerase [Morus notabilis]
          Length = 838

 Score =  868 bits (2242), Expect = 0.0
 Identities = 454/729 (62%), Positives = 540/729 (74%), Gaps = 7/729 (0%)
 Frame = -3

Query: 2183 QQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIVKTWVK 2004
            ++LGITEPISL GP+E D+ K+QELEK+L   GLYE   EAVSREEVLGRLDQIVK WVK
Sbjct: 39   KRLGITEPISLGGPTEYDVMKSQELEKYLQDAGLYESQEEAVSREEVLGRLDQIVKLWVK 98

Query: 2003 KVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFVELYGM 1824
             ++R +G N+QLVQEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF EL+ M
Sbjct: 99   TISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGELHRM 158

Query: 1823 LTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQNVDDQ 1644
            L EMPEV+E+HPVPDAHVPV++FKF GVSIDLLYAKLSL +IPEDLDISQD+ILQN D+Q
Sbjct: 159  LVEMPEVTEVHPVPDAHVPVLRFKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQ 218

Query: 1643 TVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARI 1464
            TVRSLNGCRVTD+IL LVPN+QNFRTTLRCMR WAKRRGVYSNV+GFLGGINWALLVARI
Sbjct: 219  TVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRLWAKRRGVYSNVSGFLGGINWALLVARI 278

Query: 1463 CQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLHQMPII 1290
            CQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGS  L +WDPRRN +DR H MPII
Sbjct: 279  CQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRRNPKDRYHLMPII 338

Query: 1289 TPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQYPFFDAYKNYLQ 1110
            TPAYPCMNSSYNVS+STLR+M+EEFQRG EICEAME +KADW+ LFE YPFF+AYKNYLQ
Sbjct: 339  TPAYPCMNSSYNVSASTLRIMSEEFQRGREICEAMETDKADWDTLFEPYPFFEAYKNYLQ 398

Query: 1109 IEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRCHCCYFMG 930
            I+ISAENDDDLR WKGWVESRLRQLTLKIERHT+  LQCHPHPG+FSD SK  HC YFMG
Sbjct: 399  IDISAENDDDLRKWKGWVESRLRQLTLKIERHTYNKLQCHPHPGEFSDKSKPFHCSYFMG 458

Query: 929  LQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSFVFPGGVR 750
            LQR QGVP +E   FDIR TVEEFK SV MY LWKPGM I+VSH++R+NIP+FVFPG VR
Sbjct: 459  LQRKQGVPANESGHFDIRLTVEEFKNSVNMYMLWKPGMLIHVSHVKRKNIPNFVFPGRVR 518

Query: 749  PSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXXDESATNIKSLPS 570
            P R  K+  + +  S  +                ++  + G K     D   ++++++  
Sbjct: 519  PGRPVKITWDMKRASELKASGLAQPDKSDESKTVLNGSDDGSKRKRVDDNVESSLRNVKP 578

Query: 569  IGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALILQDGAIKTEQVEYIN 390
              A+  G +LE            S  I+T +S+++   +M    +++    K++     +
Sbjct: 579  -RASFTGEVLE-----------ASSPISTLSSSSVKFDSMDMNRLVESQREKSDNNFVDS 626

Query: 389  EEANPLLSEVPSLS-ENSGSTVSLLPISRTLSADAASFCSKEAGQCATDYI-SGPSAGRE 216
             +     +++PS + EN  S+    P      A   +  SKEA + A D I SGP    +
Sbjct: 627  FKKCENSADIPSQNGENEVSSRCSPPTKAVPVAAVDASSSKEAEKMAIDNIMSGPYDSHQ 686

Query: 215  VF-SELNELED--ESGLVQDNGGNTMEGSFAESSTVKQIALLATNGAGSSSSIGRFQNGG 45
                EL+ELED       +D  G+TM+ S  E+S   Q A   T+  G+  S G + NGG
Sbjct: 687  ALPEELDELEDFEYRNQAKDFSGSTMD-SQVETSKGNQPAAPITSNTGTGPSTGSYFNGG 745

Query: 44   LEELEPAEL 18
            LEELEPAEL
Sbjct: 746  LEELEPAEL 754


>ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prunus persica]
            gi|462406077|gb|EMJ11541.1| hypothetical protein
            PRUPE_ppa001856mg [Prunus persica]
          Length = 755

 Score =  860 bits (2222), Expect = 0.0
 Identities = 466/743 (62%), Positives = 539/743 (72%), Gaps = 12/743 (1%)
 Frame = -3

Query: 2210 MANPGVTNV---QQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVL 2040
            MA+PG++N    ++LGITEPISL GP+E D+ KT+ELEK+L    LYE   EAVSREEVL
Sbjct: 1    MASPGLSNRNNGKRLGITEPISLGGPTEYDVIKTRELEKYLQDARLYESQEEAVSREEVL 60

Query: 2039 GRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHAT 1860
            GRLDQIVK WVK ++R +G N+QLV EANAKI+TFGSYRLGVHGPGADIDTLC+GP+HAT
Sbjct: 61   GRLDQIVKIWVKTISRTKGLNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1859 REEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDI 1680
            REEDFF EL  ML+EMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYAKLSL +IPEDLDI
Sbjct: 121  REEDFFGELQRMLSEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1679 SQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFL 1500
            SQD+ILQN D+QTVRSLNGCRVTD+IL LVP++QNFRTTLRCMR WAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNADEQTVRSLNGCRVTDQILRLVPSIQNFRTTLRCMRLWAKRRGVYSNVAGFL 240

Query: 1499 GGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRR 1326
            GGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGS  L +WDPRR
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 300

Query: 1325 NHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFEQ 1146
            N +D+ H MPIITPAYP MNSSYNVSSSTLR+M EEFQRG+EICEAME NKADW+ LFE 
Sbjct: 301  NPKDKYHLMPIITPAYPSMNSSYNVSSSTLRIMLEEFQRGNEICEAMEANKADWDTLFES 360

Query: 1145 YPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSD 966
            Y FF+AYKNYLQI+ISAEN DD R WKGWVESRLRQLTLKIERHT+GMLQCHPHPGDFSD
Sbjct: 361  YDFFEAYKNYLQIDISAENADDFRKWKGWVESRLRQLTLKIERHTYGMLQCHPHPGDFSD 420

Query: 965  TSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRR 786
             S+  H  YFMGLQR QGVPV EGEQFDIRATVEEFK SV +YTL + GMEI VSH++RR
Sbjct: 421  KSRPFHSSYFMGLQRKQGVPVTEGEQFDIRATVEEFKQSVNLYTLLERGMEIRVSHVKRR 480

Query: 785  NIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXX 606
            NIP+FVFPG VRP R  KV    R  S  +                +D  + G+K     
Sbjct: 481  NIPNFVFPGEVRPLRLSKVTWGSRRGSELKVSGDSQPDKLCEGKTDLDGSDGGQKRKRVD 540

Query: 605  DESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSG-SITTTTSTAMSVGNMVDALILQ 429
            D   TN +   S+  +        SG+  A+    S  S  +T   +M     VD  I  
Sbjct: 541  DNVETNSRYAKSLHLS--------SGEVHAASPPISNISSCSTKCESMDANKKVDDSI-- 590

Query: 428  DGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCAT 249
              A   E++E      NP  +++P  +     +    P + +L A A +  SKEA + A 
Sbjct: 591  --ADSLEKIE------NP--ADIPYQNGQIEVSSRCKPPNDSLPAAANTSSSKEAEKMAL 640

Query: 248  -DYISGPSAGREVFSELNELEDES---GLVQDNGGNTMEGSF--AESSTVKQIALLATNG 87
               ++GP    +   EL+ELED+S     V+D   N        +E S      + ++NG
Sbjct: 641  GKNMAGPYVSHQALPELDELEDDSEHGHQVKDFSRNMKSSQMEPSEESVSVSAPVNSSNG 700

Query: 86   AGSSSSIGRFQNGGLEELEPAEL 18
            AG S+      NGGLEELEPAEL
Sbjct: 701  AGPSTD---SYNGGLEELEPAEL 720


>ref|XP_012075422.1| PREDICTED: nuclear poly(A) polymerase 1-like [Jatropha curcas]
            gi|643726451|gb|KDP35158.1| hypothetical protein
            JCGZ_10692 [Jatropha curcas]
          Length = 754

 Score =  859 bits (2219), Expect = 0.0
 Identities = 465/757 (61%), Positives = 539/757 (71%), Gaps = 26/757 (3%)
 Frame = -3

Query: 2210 MANPGVT---NVQQ---LGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSRE 2049
            M +PG++   N QQ   LGITEPISL GP+E D+ KT+ELEK+L  VGLYE   EAVSRE
Sbjct: 1    MGSPGLSTQSNGQQQRRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSRE 60

Query: 2048 EVLGRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPK 1869
            EVLGRLDQIVK WVK ++R +G N+QLVQEANAKI+TFGSY LGVHGPGADIDTLC+GP+
Sbjct: 61   EVLGRLDQIVKNWVKVISRAKGLNEQLVQEANAKIFTFGSYWLGVHGPGADIDTLCVGPR 120

Query: 1868 HATREEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPED 1689
            HATREEDFF EL+ ML+EMPEV+ELHPVPDAHVPVM+FKF GVSIDLLYAKLSL +IPED
Sbjct: 121  HATREEDFFCELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPED 180

Query: 1688 LDISQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVA 1509
            LDISQD+ILQN D+QTVRSLNGCRVTD+IL LVPN+QNFRTTLRCMRFWAKRRGVYSNV+
Sbjct: 181  LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 240

Query: 1508 GFLGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWD 1335
            GFLGGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEG+  L +WD
Sbjct: 241  GFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLQVWD 300

Query: 1334 PRRNHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEE-----FQRGHEICEAMELNKA 1170
            PRRN +DR H MPIITPAYPCMNSSYNVSSSTLR+M EE     F RG+EICEAME   A
Sbjct: 301  PRRNPKDRFHLMPIITPAYPCMNSSYNVSSSTLRLMLEESGRWTFGRGNEICEAMEARLA 360

Query: 1169 DWNNLFEQYPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCH 990
            DW+ LFE + FF+AY+NYLQI+I A+N+DDLR WKGWVESRLRQLTLKIERHT  MLQCH
Sbjct: 361  DWDTLFEPFSFFEAYRNYLQIDIKADNEDDLRQWKGWVESRLRQLTLKIERHTHNMLQCH 420

Query: 989  PHPGDFSDTSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEI 810
            PHPG+F D S+  HC YFMGLQR QGVP++EGE FDIR  VEEFK SV +Y  WKPGMEI
Sbjct: 421  PHPGEFMDKSRPLHCSYFMGLQRKQGVPINEGEHFDIRLAVEEFKHSVNIYASWKPGMEI 480

Query: 809  YVSHIRRRNIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEK 630
             V H++R+N+PSFVFPGGVRPSR  K   + R  S                    D  + 
Sbjct: 481  QVIHVKRKNMPSFVFPGGVRPSRPSKATWDSRRSS----------AGMSSGGRVSDGSDD 530

Query: 629  GRKXXXXXDESATNIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNM 450
            GRK     D  A  +K++ S  AA     L+D                     ++SVGN+
Sbjct: 531  GRKKRRIDDNVANTMKNMKSFTAAP----LDDG--------------------SLSVGNV 566

Query: 449  VDALILQDGAIKTEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSK 270
               +I         +++   EE    L ++ +L+        L   S+ LSA   + CSK
Sbjct: 567  AVGVI-----FSANEMQENREEKTDGLKDLENLAGIPAQNADLNLQSKDLSATRDTPCSK 621

Query: 269  EAGQCATDYI-SGPSAGREVFS-ELNELEDESGL---VQDNGGNTMEGSFAESSTVKQIA 105
             A + A + I SGP    +  S E++ELED+      V+D+ GNT EGS   SST    A
Sbjct: 622  GAEKLAIETILSGPYVTNQALSQEVDELEDDRDCGIQVKDSVGNTKEGSVESSSTGMTAA 681

Query: 104  LLAT--------NGAGSSSSIGRFQNGGLEELEPAEL 18
             LA         +G  S S I    NGGLEELE AEL
Sbjct: 682  KLANAPIAVPIISGNRSGSFISFSTNGGLEELELAEL 718


>ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530780|gb|ESR41963.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
          Length = 748

 Score =  857 bits (2215), Expect = 0.0
 Identities = 456/743 (61%), Positives = 545/743 (73%), Gaps = 12/743 (1%)
 Frame = -3

Query: 2198 GVTNVQQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEVLGRLDQIV 2019
            G +N Q+LGITEPISLAGP++ D+ +T++LEK+L  V LYE   EAVSREEVLGRLDQIV
Sbjct: 2    GSSNGQRLGITEPISLAGPTDDDLMRTRKLEKYLRDVNLYESQEEAVSREEVLGRLDQIV 61

Query: 2018 KTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHATREEDFFV 1839
            K WVKK++R +G NDQL+QEANAKI+TFGSYRLGVHGPGADIDTLC+GP+HATREEDFF 
Sbjct: 62   KIWVKKISRAKGLNDQLLQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFG 121

Query: 1838 ELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLDISQDAILQ 1659
            EL+ MLTEMPEV+ELHPVPDAHVPVMKFKF GVSIDLLYA+LSL +IPEDLDISQD+ILQ
Sbjct: 122  ELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIPEDLDISQDSILQ 181

Query: 1658 NVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 1479
            N D+QTVRSLNGCRVTD+IL LVP +QNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL
Sbjct: 182  NADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWAL 241

Query: 1478 LVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPRRNHRDRLH 1305
            LVARICQLYPNA+PSMLVSRFFRVYTQWRWPNPV+LCAIEEGS  L +WDPRRN +D+ H
Sbjct: 242  LVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDKYH 301

Query: 1304 QMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKA--DWNNLFEQYPFFD 1131
             MPIITPAYPCMNSSYNVS+STLR+M +EFQRGHEICEAME N+A  DW+ LFE + FF+
Sbjct: 302  LMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVDWDTLFEPFTFFE 361

Query: 1130 AYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFSDTSKRC 951
            AYKNYL+I+ISAEN DDLRNWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDFSD SK  
Sbjct: 362  AYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDKSKPL 421

Query: 950  HCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRRRNIPSF 771
            +C YFMGLQR QGVPV EGEQFDIR TV+EFK +V MYTL KPGM+I V+H+ RRN+P+F
Sbjct: 422  YCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQISVAHVTRRNLPNF 481

Query: 770  VFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXXXDESAT 591
            VFPGGVRPSR  K   + R    ++                    + GRK     D   T
Sbjct: 482  VFPGGVRPSRPSKGTWDSRRALERK-----------VSSHTKPGADDGRKRKQTDDNVDT 530

Query: 590  NIKSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVG-NMVDALILQDGAIK 414
            ++++     A  +  +   SG+    F   S  ++T +S+++++    +DA  L  G+ +
Sbjct: 531  HLRN-----AKCHATMPSSSGE----FREGSPIMSTISSSSINLQFEHMDANELA-GSNR 580

Query: 413  TEQVEYINEEANPLLSEVPSLSENSGSTVSLLPISRTLSADAASFCSKEAGQCATDYI-S 237
             +    + +      + V   S N      ++   R       S  SK+A + A + I S
Sbjct: 581  EKVENNLTDSIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMS 640

Query: 236  GPSAGREVFS-ELNELEDESGL---VQDNGGNTMEGSFAESST--VKQIALLATNGAGSS 75
            GP    + F  EL++LED+  L    +D  G+T   S    +     +  L + NG  SS
Sbjct: 641  GPYVADQAFPLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSS 700

Query: 74   SSIGRFQNGGLEELEPAELTGSF 6
            S++    NGGL ELEP ELT  F
Sbjct: 701  SALS--PNGGLGELEPVELTAPF 721


>ref|XP_004512881.1| PREDICTED: nuclear poly(A) polymerase 1 [Cicer arietinum]
          Length = 753

 Score =  857 bits (2213), Expect = 0.0
 Identities = 451/744 (60%), Positives = 539/744 (72%), Gaps = 12/744 (1%)
 Frame = -3

Query: 2210 MANPGVTNV----QQLGITEPISLAGPSEIDITKTQELEKFLSGVGLYECPAEAVSREEV 2043
            M  PG++N     Q LGITEPISLAGP+E D+ K+QELEK+L G GLYE   EAV REEV
Sbjct: 1    MGIPGLSNQNNGKQWLGITEPISLAGPTEEDVVKSQELEKYLQGAGLYESQHEAVGREEV 60

Query: 2042 LGRLDQIVKTWVKKVTRNRGYNDQLVQEANAKIYTFGSYRLGVHGPGADIDTLCIGPKHA 1863
            LGRLDQIVK WVK ++R +G+N+QLV EANAKI+TFGSYRLGVHGPGADIDTLC+GP+HA
Sbjct: 61   LGRLDQIVKIWVKTISRAKGFNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1862 TREEDFFVELYGMLTEMPEVSELHPVPDAHVPVMKFKFCGVSIDLLYAKLSLPIIPEDLD 1683
            TREEDFF EL  ML+EM EV+ELHPVPDAHVPVMKFKF G+S+DLLYA+L+L +IPEDLD
Sbjct: 121  TREEDFFGELRKMLSEMEEVTELHPVPDAHVPVMKFKFNGISVDLLYARLALWVIPEDLD 180

Query: 1682 ISQDAILQNVDDQTVRSLNGCRVTDKILHLVPNVQNFRTTLRCMRFWAKRRGVYSNVAGF 1503
            ISQ++ILQN D+QTV SLNGCRVTD++L LVPN+QNFRTTLRCMRFWAKRRGVYSNVAGF
Sbjct: 181  ISQESILQNADEQTVLSLNGCRVTDQVLRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 240

Query: 1502 LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGS--LPIWDPR 1329
            LGGIN ALLV RICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEGS  L +WDPR
Sbjct: 241  LGGINLALLVGRICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPR 300

Query: 1328 RNHRDRLHQMPIITPAYPCMNSSYNVSSSTLRVMTEEFQRGHEICEAMELNKADWNNLFE 1149
            RN +DR H MPIITPAYPCMNS+YNV+ STLR+M+EEF+RG EICEAME +KADW+ LFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSTYNVTLSTLRIMSEEFKRGSEICEAMEASKADWDTLFE 360

Query: 1148 QYPFFDAYKNYLQIEISAENDDDLRNWKGWVESRLRQLTLKIERHTFGMLQCHPHPGDFS 969
             YPFF+AYKNYLQI+I+AEN DDLR WKGWVESRLRQLTLKIER+T+GMLQCHP+PG+FS
Sbjct: 361  PYPFFEAYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERYTYGMLQCHPYPGEFS 420

Query: 968  DTSKRCHCCYFMGLQRNQGVPVHEGEQFDIRATVEEFKLSVGMYTLWKPGMEIYVSHIRR 789
            D S+  H CYFMGLQR QGVPV+EGEQFDIR TVEEFK SV  YTLWKPGM+I+VSH++R
Sbjct: 421  DKSRTFHQCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMDIHVSHVKR 480

Query: 788  RNIPSFVFPGGVRPSRSPKVAGEGRPVSNKRXXXXXXXXXXXXXXXXVDAVEKGRKXXXX 609
            RNIP+F+FPGGVRP    K  GE R  S  R                 +   K ++    
Sbjct: 481  RNIPNFIFPGGVRPLLPSKATGENRQSSKSRVSGHSQAEKSQGGKAATNEARKRKRSEEN 540

Query: 608  XDESATNI-KSLPSIGAAANGRILEDSGDCSASFTRPSGSITTTTSTAMSVGNMVDALIL 432
             + + + I KS  S+ +  N  + ED          P  S+T++ S           +  
Sbjct: 541  VENNNSKISKSFVSL-SPPNKEVHED--------ITPIISVTSSCS-----------MKF 580

Query: 431  QDGAIKTEQVEYINEEANPLLSEVPS-LSENSGSTVSLLPISRTLSADAASFCSKEAGQC 255
             D  + +   +   +    L+ E+PS  S+  GS +     ++ L+A  AS   +E    
Sbjct: 581  DDSEVNSISAQKSEKPCLKLVGEIPSGDSQAYGSVMG----NQQLTAPDASNTKEEERLA 636

Query: 254  ATDYISGP-SAGREVFSELNELEDESGL---VQDNGGNTMEGSFAESSTVKQIALLATNG 87
                +SGP    + +  E +ELED+ G    V+DNGG+    +F  S     +A      
Sbjct: 637  IEQIMSGPYEVHQALAEESDELEDDMGYRNQVKDNGGSVKSNNFDISIPKFVVAEEQVIP 696

Query: 86   AGSSSSIGRFQNGGLEELEPAELT 15
              +  S   F NGGL+ELEPAELT
Sbjct: 697  KETICSTHLFSNGGLDELEPAELT 720


Top