BLASTX nr result

ID: Cornus23_contig00006308 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00006308
         (2943 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007012184.1| Poly(A) polymerase 1 isoform 3 [Theobroma ca...  1046   0.0  
ref|XP_007012183.1| Poly(A) polymerase 1 isoform 2 [Theobroma ca...  1041   0.0  
ref|XP_007012185.1| Poly(A) polymerase 1 isoform 4 [Theobroma ca...  1038   0.0  
ref|XP_007012182.1| Poly(A) polymerase 1 isoform 1 [Theobroma ca...  1038   0.0  
ref|XP_012077399.1| PREDICTED: LOW QUALITY PROTEIN: nuclear poly...  1025   0.0  
ref|XP_007012186.1| Poly(A) polymerase 1 isoform 5 [Theobroma ca...  1025   0.0  
ref|XP_011019566.1| PREDICTED: LOW QUALITY PROTEIN: nuclear poly...  1024   0.0  
ref|XP_011010887.1| PREDICTED: nuclear poly(A) polymerase 4 isof...  1006   0.0  
ref|XP_012434911.1| PREDICTED: nuclear poly(A) polymerase 4 isof...   994   0.0  
ref|XP_012434772.1| PREDICTED: nuclear poly(A) polymerase 4 isof...   989   0.0  
ref|XP_002324162.2| hypothetical protein POPTR_0018s04870g [Popu...   988   0.0  
ref|XP_011010888.1| PREDICTED: nuclear poly(A) polymerase 4 isof...   985   0.0  
ref|XP_007012187.1| Poly(A) polymerase 1 isoform 6 [Theobroma ca...   979   0.0  
gb|KDP34179.1| hypothetical protein JCGZ_07750 [Jatropha curcas]      976   0.0  
ref|XP_012435059.1| PREDICTED: nuclear poly(A) polymerase 4 isof...   975   0.0  
ref|XP_012435194.1| PREDICTED: nuclear poly(A) polymerase 4 isof...   973   0.0  
ref|XP_009587895.1| PREDICTED: poly(A) polymerase type 3-like is...   971   0.0  
ref|XP_009779470.1| PREDICTED: poly(A) polymerase type 3-like is...   965   0.0  
ref|XP_009587889.1| PREDICTED: poly(A) polymerase PAPalpha-like ...   959   0.0  
gb|KJB07462.1| hypothetical protein B456_001G025900 [Gossypium r...   958   0.0  

>ref|XP_007012184.1| Poly(A) polymerase 1 isoform 3 [Theobroma cacao]
            gi|508782547|gb|EOY29803.1| Poly(A) polymerase 1 isoform
            3 [Theobroma cacao]
          Length = 811

 Score = 1046 bits (2704), Expect = 0.0
 Identities = 540/793 (68%), Positives = 622/793 (78%), Gaps = 20/793 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESV 610
            LN     +G V + D DSLL NG  N NG+++N+L+EELKPN ALG +V +QDGA SE++
Sbjct: 743  LN----CEGVVCSADLDSLLENGHLNANGVFQNSLSEELKPNIALGKVVNSQDGARSETL 798

Query: 609  QKPVLRLSLESTA 571
            QKPV+RLSL+S A
Sbjct: 799  QKPVMRLSLKSMA 811


>ref|XP_007012183.1| Poly(A) polymerase 1 isoform 2 [Theobroma cacao]
            gi|508782546|gb|EOY29802.1| Poly(A) polymerase 1 isoform
            2 [Theobroma cacao]
          Length = 812

 Score = 1041 bits (2692), Expect = 0.0
 Identities = 540/794 (68%), Positives = 622/794 (78%), Gaps = 21/794 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESV 610
            LN     +G V + D DSLL NG  N NG+++N+L+EELKPN ALG +V +QDGA SE++
Sbjct: 743  LN----CEGVVCSADLDSLLENGHLNANGVFQNSLSEELKPNIALGKVVNSQDGARSETL 798

Query: 609  QKPVL-RLSLESTA 571
            QKPV+ RLSL+S A
Sbjct: 799  QKPVMSRLSLKSMA 812


>ref|XP_007012185.1| Poly(A) polymerase 1 isoform 4 [Theobroma cacao]
            gi|508782548|gb|EOY29804.1| Poly(A) polymerase 1 isoform
            4 [Theobroma cacao]
          Length = 804

 Score = 1038 bits (2684), Expect = 0.0
 Identities = 535/786 (68%), Positives = 616/786 (78%), Gaps = 20/786 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESV 610
            LN     +G V + D DSLL NG  N NG+++N+L+EELKPN ALG +V +QDGA SE++
Sbjct: 743  LN----CEGVVCSADLDSLLENGHLNANGVFQNSLSEELKPNIALGKVVNSQDGARSETL 798

Query: 609  QKPVLR 592
            QKPV+R
Sbjct: 799  QKPVMR 804


>ref|XP_007012182.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|508782545|gb|EOY29801.1| Poly(A) polymerase 1 isoform
            1 [Theobroma cacao]
          Length = 817

 Score = 1038 bits (2684), Expect = 0.0
 Identities = 535/786 (68%), Positives = 616/786 (78%), Gaps = 20/786 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESV 610
            LN     +G V + D DSLL NG  N NG+++N+L+EELKPN ALG +V +QDGA SE++
Sbjct: 743  LN----CEGVVCSADLDSLLENGHLNANGVFQNSLSEELKPNIALGKVVNSQDGARSETL 798

Query: 609  QKPVLR 592
            QKPV+R
Sbjct: 799  QKPVMR 804


>ref|XP_012077399.1| PREDICTED: LOW QUALITY PROTEIN: nuclear poly(A) polymerase 4
            [Jatropha curcas]
          Length = 833

 Score = 1025 bits (2651), Expect = 0.0
 Identities = 539/818 (65%), Positives = 617/818 (75%), Gaps = 45/818 (5%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGPTEADL RNAELEKFL+ SGLYESKE + +REEV+ RIDQIVK WVKQLT QRGYT
Sbjct: 28   SLAGPTEADLHRNAELEKFLVDSGLYESKEKTMKREEVLGRIDQIVKGWVKQLTHQRGYT 87

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDIL+EM+EV+E
Sbjct: 88   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILSEMDEVTE 147

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYD+DEQTVRSLNGCR
Sbjct: 148  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDIDEQTVRSLNGCR 207

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 208  VADQILKLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARLCQLYPNAIP 267

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKN RDR HHMPIITPAYPCMNS
Sbjct: 268  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNHRDRFHHMPIITPAYPCMNS 327

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICE----------------------------EIELNKAE 1894
            SYNVS STLRVMM+QF +GN+ICE                            EIELNKA+
Sbjct: 328  SYNVSISTLRVMMEQFQYGNKICEVCAXVFTFYFAAHIILLMPSLWSIFSMQEIELNKAQ 387

Query: 1893 WNSFFEPYLFFEAYKNYXXXXXXXXXXXXXXAWRGWVESRLRQLTLKIERDTYGMLQCHP 1714
            W++ FEPYLFFEAYKNY              AW+GWVESRLRQLTLKIERDT G+LQCHP
Sbjct: 388  WSALFEPYLFFEAYKNYLQIDIIAADADDLLAWKGWVESRLRQLTLKIERDTNGVLQCHP 447

Query: 1713 YPNEYTDTSKPCPHCAFFMGLQRKQGVKVQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDI 1534
            YPNEY DTSK C HCAFFMGLQR++GV  QEGQQFDIRGTVDEFRQE+NMYMFWKPGMDI
Sbjct: 448  YPNEYVDTSKQCSHCAFFMGLQRREGVTGQEGQQFDIRGTVDEFRQEINMYMFWKPGMDI 507

Query: 1533 YVSHVRRKQIPAYVFPDGYKRPRPSRHTSQGVERTPEVDAEGCRSRSEGHPKRKHGAEMV 1354
            YVSHVRR+Q+PA+VFPDGYKR RPSRH +Q V ++ +  A       EGH KRK+  E V
Sbjct: 508  YVSHVRRRQLPAFVFPDGYKRSRPSRHLNQQVSKSNDGAATSRIGSPEGHLKRKNDHEEV 567

Query: 1353 DVKAEKPGKRSSISPQRLGSV-------------FFGSSEEIKLECLVAGGVDRNSENRS 1213
            D++ +KP KR+SISPQRL SV                 SE IKL C  A  VD NSE RS
Sbjct: 568  DLRPDKPEKRASISPQRLQSVSPESSTSRCGGTSLANFSERIKLGCSTAADVDNNSEARS 627

Query: 1212 SGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNA-CDTSRAWSEVKPDEPVAEPPP 1036
              G   +E   +G +V ++GET     + L +   +    + +   +EV+P +   E   
Sbjct: 628  CRGPSSNENCILG-NVMQVGET----VMGLYDPAVVRGDVEPAECRNEVEPTDLAVETIL 682

Query: 1035 SKELFAPCEVKTEVTLKIE--LKEDKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEEL 862
             +EL  P E+ +    KI     E++  DL    ++N   G   RLLK    V   +EEL
Sbjct: 683  KQELLDPYEISSSEIRKIHNVTNENRNGDLISASLEN---GSPNRLLKWGGEVIEVEEEL 739

Query: 861  LKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNL 682
            ++PC +TAV E   SV+ SN+S QNLN +G+    + A D DSLL NGC N +G ++N+L
Sbjct: 740  VRPCNQTAVVELAESVICSNTSAQNLNCEGA----ICAADLDSLLENGCLNASGAFQNSL 795

Query: 681  TEELKPNTALG-MVEAQDGASSESVQKPVLRLSLESTA 571
             EEL+P+TA+G +V +QDG  SES+QKPV+RL+L+S A
Sbjct: 796  PEELEPSTAIGKVVNSQDGDRSESLQKPVIRLNLKSKA 833


>ref|XP_007012186.1| Poly(A) polymerase 1 isoform 5 [Theobroma cacao]
            gi|508782549|gb|EOY29805.1| Poly(A) polymerase 1 isoform
            5 [Theobroma cacao]
          Length = 801

 Score = 1025 bits (2650), Expect = 0.0
 Identities = 534/793 (67%), Positives = 613/793 (77%), Gaps = 20/793 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESV 610
            LN     +G V + D DSLL NG  N NG+++N+L+EELKPN ALG +V +QDGA     
Sbjct: 743  LN----CEGVVCSADLDSLLENGHLNANGVFQNSLSEELKPNIALGKVVNSQDGA----- 793

Query: 609  QKPVLRLSLESTA 571
                 RLSL+S A
Sbjct: 794  -----RLSLKSMA 801


>ref|XP_011019566.1| PREDICTED: LOW QUALITY PROTEIN: nuclear poly(A) polymerase 4-like
            [Populus euphratica]
          Length = 823

 Score = 1024 bits (2647), Expect = 0.0
 Identities = 539/808 (66%), Positives = 614/808 (75%), Gaps = 35/808 (4%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S+AGPTE DL RNAELEKFL+ SGLYESKE++ +REEV+ RIDQIVK WVK+LTRQRGYT
Sbjct: 22   SVAGPTEPDLHRNAELEKFLVDSGLYESKEEAMKREEVLGRIDQIVKDWVKRLTRQRGYT 81

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNRE DFFIVLHD LAEMEEV+E
Sbjct: 82   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREGDFFIVLHDKLAEMEEVTE 141

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDIS+GSVLY+VDEQTVRSLNGCR
Sbjct: 142  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISNGSVLYEVDEQTVRSLNGCR 201

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 202  VADQILKLVPNVEHFRATLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 261

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEED LGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 262  SMLVSRFFRVYTQWRWPNPVMLCSIEEDALGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 321

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVSTSTLRVM +QF  GNRICEEIELNKA+W++ FEPYLFFEAYKNY           
Sbjct: 322  SYNVSTSTLRVMTEQFQSGNRICEEIELNKAQWSALFEPYLFFEAYKNYLQVDIVAADAV 381

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY D SK C HCAFFMGLQRK+GV 
Sbjct: 382  DLLAWKGWVESRLRQLTLKIERDTDGMLQCHPYPNEYIDPSKQCAHCAFFMGLQRKEGVT 441

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQ++NMY+ WKPGMDIYVSHVRR+Q+P +VFPDGYKR RPSRH 
Sbjct: 442  GQEGQQFDIRGTVDEFRQDINMYLPWKPGMDIYVSHVRRRQLPGFVFPDGYKRSRPSRHM 501

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSV------- 1291
            +Q   RT E  A      +E H KRK+  EM D+K  KP KR+S SPQRL SV       
Sbjct: 502  NQQTNRTSEDVARSLSGSAERHVKRKNDCEMADLKPVKPEKRASTSPQRLQSVSPSSSAG 561

Query: 1290 ------FFGSSEEIKLECLVAG-------GVDRNSENRSSGGILESERGKVGCDVEKLGE 1150
                    GS E + L C   G        V  NSE RS+ G LESE+G +G D  +LG 
Sbjct: 562  RSGVTSLAGSCEGVILGCSTIGDIVSNCEDVASNSEVRSTSGQLESEKGDLG-DARQLGV 620

Query: 1149 TDAHRSITLNEHNSLNACDTSRAWSEVKP------DEPVA---EPPPSKELFAPCEV--- 1006
            T  ++   LN+  S++  D+    +E++P       EP+    +P   +EL +  EV   
Sbjct: 621  T-VYQESPLNQQTSMDVHDSPIVRNELEPADHMNGSEPMGLMFDPITKQELVSSHEVPNF 679

Query: 1005 KTEVTLKIELKEDKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSED 826
            +T    K     +K+EDL    ++N  + K    +         D+EL+KPC +TAV E 
Sbjct: 680  ETGEKHKEVGVNEKIEDLGSNFLENGSSRKLMNWVGGASRGMEVDQELVKPCSQTAVVEF 739

Query: 825  DRSVLGSNSSMQNLNSKGSLQGDVHATDSDSLLGNGCPNGNG--IYENNLTEELKPNTAL 652
              SV+ S+S  QNLN     +G+V A D+DSLL +GC N +G  + +N L EEL+P TA+
Sbjct: 740  AESVISSHSGSQNLN----YEGNVCAVDADSLLESGCLNVSGXVLLQNGLPEELEPKTAI 795

Query: 651  GMV-EAQDGASSESVQKPVLRLSLESTA 571
            G V  +QDGA SES+QKPV+RLSL+S+A
Sbjct: 796  GKVLNSQDGARSESLQKPVIRLSLKSSA 823


>ref|XP_011010887.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X1 [Populus
            euphratica]
          Length = 799

 Score = 1006 bits (2601), Expect = 0.0
 Identities = 529/790 (66%), Positives = 600/790 (75%), Gaps = 17/790 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S+AGPTE DL RNAELEKFL+ SGL ESK+++ +REEV+ RIDQIVK WVKQLTRQRGYT
Sbjct: 22   SVAGPTEPDLHRNAELEKFLVDSGLNESKDETIKREEVLGRIDQIVKDWVKQLTRQRGYT 81

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI LHD LAEMEEV+E
Sbjct: 82   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFITLHDKLAEMEEVTE 141

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDIS+GSVLY+VDEQTVRSLNGCR
Sbjct: 142  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISNGSVLYEVDEQTVRSLNGCR 201

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 202  VADQILKLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 261

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEED LGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 262  SMLVSRFFRVYTQWRWPNPVMLCSIEEDALGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 321

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVSTSTLRVM +QF  GNRICEEIELNKA+W++ FEPYLFFEAYKNY           
Sbjct: 322  SYNVSTSTLRVMTEQFQSGNRICEEIELNKAQWSALFEPYLFFEAYKNYLQVDIVAAVAA 381

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY D SK CPHCAFFMGLQRK+GV 
Sbjct: 382  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYIDASKQCPHCAFFMGLQRKEGVT 441

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE+NMYMFWKPGM+IYVSHVRR+Q+P +VFPDGYKR R SRH 
Sbjct: 442  SQEGQQFDIRGTVDEFRQEINMYMFWKPGMEIYVSHVRRRQLPGFVFPDGYKRSRSSRHV 501

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQ---------RLG 1297
            +Q   +T E  A      +E   KRK+  EM D+K EK    S I PQ         R G
Sbjct: 502  NQHTSKTGEDVARSQSGSAERPVKRKNDCEMEDLKPEKRALNSPIRPQSVSPSSSVSRSG 561

Query: 1296 SVFFGSS-EEIKLECLV------AGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
                 SS E +KL C           V  NSE RSS G LESE+  +G D  +LGET  +
Sbjct: 562  VTSLASSCEGVKLGCSTIDIGSNCKDVASNSEVRSSSGQLESEKDGLG-DAMQLGET-VY 619

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEVKTEVTLKIELKEDKVE 958
            +   LN   S++  D+    +E++P   +    P ++ F   E K E  +      DK+ 
Sbjct: 620  QDSPLNRQISMDVHDSKIVRNELEPANNMNGIEPMEQNFETGE-KHETGV-----NDKIA 673

Query: 957  DLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNS 778
             L    +++  + K    +  T      D+EL+KPC +TAV E   SV+ S+S  QNLN 
Sbjct: 674  GLGSNIMESGSSRKLLNWVAGTSQAVEVDQELVKPCCQTAVVEYADSVIKSHSGTQNLN- 732

Query: 777  KGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKP 601
                +G+V A D+D +L NGC N N + +  L EEL+P TA+G +V +QDGA SES+QKP
Sbjct: 733  ---CEGNVCAVDADVVLENGCLNMNRVLQKGLPEELEPKTAIGKVVNSQDGARSESLQKP 789

Query: 600  VLRLSLESTA 571
            ++RLSL+STA
Sbjct: 790  MIRLSLKSTA 799


>ref|XP_012434911.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X2 [Gossypium
            raimondii] gi|823120688|ref|XP_012434982.1| PREDICTED:
            nuclear poly(A) polymerase 4 isoform X2 [Gossypium
            raimondii] gi|763739960|gb|KJB07459.1| hypothetical
            protein B456_001G025900 [Gossypium raimondii]
            gi|763739961|gb|KJB07460.1| hypothetical protein
            B456_001G025900 [Gossypium raimondii]
            gi|763739967|gb|KJB07466.1| hypothetical protein
            B456_001G025900 [Gossypium raimondii]
          Length = 775

 Score =  994 bits (2570), Expect = 0.0
 Identities = 517/777 (66%), Positives = 596/777 (76%), Gaps = 4/777 (0%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I +IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADIQRNTELEKFLIESGLYESKEEAAKREEVLGHISEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVM+FKFQGISIDLLYASISLLVVP+DLDIS  SVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMRFKFQGISIDLLYASISLLVVPDDLDISRESVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN++HFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNA+ 
Sbjct: 207  VADQILKLVPNVKHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNKA+W++ FEP+LFFEAYKNY           
Sbjct: 327  SYNVSLSTLRVMMEQFQFGNRICEEIELNKAQWSALFEPHLFFEAYKNYLQVDIVSADAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
              EGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGY+RPR  RH 
Sbjct: 447  GLEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYRRPRSLRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGSSEE 1270
            SQ   +T E         +E   KRK   E VD K  KP KR+SISP R+ SV    S +
Sbjct: 507  SQQTGKTCEDVTTSRSGSAERQIKRKRDDETVDEKLNKPEKRASISPLRMESV----SPD 562

Query: 1269 IKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNACDT 1090
            I     V       S N +   +    RG V  D             +L    SL+  D+
Sbjct: 563  IITSKSVG-----TSHNSNGQAVKVEHRGTVDLD-------------SLRGQTSLDIDDS 604

Query: 1089 SRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKEDKVEDLALTDVDNAETG 919
            S   S V+  E +   P  +EL +PCEV   +T  T K  L ++K  D     +++ E G
Sbjct: 605  SVVRS-VESAEQIG-LPFRQELLSPCEVSDFETRETCKAGLNQEKTADSTSAFINDPEIG 662

Query: 918  KARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDS 739
             +RR+L    +    D+E++K C +TA  E   SV GS+S+ QNLN KGS+ G     D 
Sbjct: 663  SSRRILNWKGVGAEVDQEVVKACNQTAAVEIAESVFGSSSNAQNLNCKGSVCG----ADL 718

Query: 738  DSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKPVLRLSLESTA 571
            DSLL  G  N + +++N+L+EELKP+ ++G +V +QDGA SE++QKPV+RLSL+STA
Sbjct: 719  DSLLEKGHLNASAVFQNSLSEELKPSISVGKVVNSQDGARSETLQKPVMRLSLKSTA 775


>ref|XP_012434772.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X1 [Gossypium
            raimondii] gi|823120684|ref|XP_012434842.1| PREDICTED:
            nuclear poly(A) polymerase 4 isoform X1 [Gossypium
            raimondii] gi|763739964|gb|KJB07463.1| hypothetical
            protein B456_001G025900 [Gossypium raimondii]
          Length = 776

 Score =  989 bits (2558), Expect = 0.0
 Identities = 517/778 (66%), Positives = 596/778 (76%), Gaps = 5/778 (0%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I +IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADIQRNTELEKFLIESGLYESKEEAAKREEVLGHISEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVM+FKFQGISIDLLYASISLLVVP+DLDIS  SVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMRFKFQGISIDLLYASISLLVVPDDLDISRESVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN++HFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNA+ 
Sbjct: 207  VADQILKLVPNVKHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNKA+W++ FEP+LFFEAYKNY           
Sbjct: 327  SYNVSLSTLRVMMEQFQFGNRICEEIELNKAQWSALFEPHLFFEAYKNYLQVDIVSADAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
              EGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGY+RPR  RH 
Sbjct: 447  GLEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYRRPRSLRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGSSEE 1270
            SQ   +T E         +E   KRK   E VD K  KP KR+SISP R+ SV    S +
Sbjct: 507  SQQTGKTCEDVTTSRSGSAERQIKRKRDDETVDEKLNKPEKRASISPLRMESV----SPD 562

Query: 1269 IKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNACDT 1090
            I     V       S N +   +    RG V  D             +L    SL+  D+
Sbjct: 563  IITSKSVG-----TSHNSNGQAVKVEHRGTVDLD-------------SLRGQTSLDIDDS 604

Query: 1089 SRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKEDKVEDLALTDVDNAETG 919
            S   S V+  E +   P  +EL +PCEV   +T  T K  L ++K  D     +++ E G
Sbjct: 605  SVVRS-VESAEQIG-LPFRQELLSPCEVSDFETRETCKAGLNQEKTADSTSAFINDPEIG 662

Query: 918  KARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDS 739
             +RR+L    +    D+E++K C +TA  E   SV GS+S+ QNLN KGS+ G     D 
Sbjct: 663  SSRRILNWKGVGAEVDQEVVKACNQTAAVEIAESVFGSSSNAQNLNCKGSVCG----ADL 718

Query: 738  DSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKPVL-RLSLESTA 571
            DSLL  G  N + +++N+L+EELKP+ ++G +V +QDGA SE++QKPV+ RLSL+STA
Sbjct: 719  DSLLEKGHLNASAVFQNSLSEELKPSISVGKVVNSQDGARSETLQKPVMSRLSLKSTA 776


>ref|XP_002324162.2| hypothetical protein POPTR_0018s04870g [Populus trichocarpa]
            gi|550318063|gb|EEF02727.2| hypothetical protein
            POPTR_0018s04870g [Populus trichocarpa]
          Length = 835

 Score =  988 bits (2554), Expect = 0.0
 Identities = 532/819 (64%), Positives = 608/819 (74%), Gaps = 46/819 (5%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S+AGPTE DL RNAELEKFL+ SGL ESK+++ +REEV+ RIDQIVK WVKQLTRQRGYT
Sbjct: 22   SVAGPTEPDLHRNAELEKFLVDSGLNESKDETIKREEVLGRIDQIVKDWVKQLTRQRGYT 81

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI LHD LAE EEV+E
Sbjct: 82   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFITLHDKLAETEEVTE 141

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDIS+GSVLY+VDEQTVRSLNGCR
Sbjct: 142  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISNGSVLYEVDEQTVRSLNGCR 201

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 202  VADQILKLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 261

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEED+LGFPVWDPRKNPRDR H MPIITPAYPCMNS
Sbjct: 262  SMLVSRFFRVYTQWRWPNPVMLCSIEEDDLGFPVWDPRKNPRDRFHLMPIITPAYPCMNS 321

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVSTSTLRVM +QF  GNRICEEIELNKA+W++ FEPYLFFEAYKNY           
Sbjct: 322  SYNVSTSTLRVMTEQFQSGNRICEEIELNKAQWSALFEPYLFFEAYKNYLQVDIVAAVAA 381

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY D SK CPHCAFFMGLQRK+GV 
Sbjct: 382  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYIDASKQCPHCAFFMGLQRKEGVT 441

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE+NMYMFWKPGM+IYVSHVRR+Q+P +VFPDGYKR R SRH 
Sbjct: 442  GQEGQQFDIRGTVDEFRQEINMYMFWKPGMEIYVSHVRRRQLPGFVFPDGYKRSRSSRHI 501

Query: 1449 SQ------GVE-----------RTPEVDAEGCRSRSEGHP-KRKHGAEMVDVKAEK---- 1336
            +Q      G+E           R   V      SRS   P KRK+  EM D+K EK    
Sbjct: 502  NQHTSKTGGMEIYVSHACYSPVRPQSVSPSSSVSRSGVAPVKRKNDCEMEDLKPEKQACY 561

Query: 1335 -PGKRSSISP----QRLGSVFFGSS-EEIKLECLV-------AGGVDRNSENRSSGGILE 1195
             P +  S+SP     R G     SS E +KL C            V  NSE RSS G LE
Sbjct: 562  SPVRPQSVSPSSSVSRSGVTSLASSWEGVKLGCSTIRDIGSNCKDVASNSEVRSSSGQLE 621

Query: 1194 SERGKVGCDVEKLGET-----DAHRSITLNEHNS---LNACDTSRAWSEVKPDEPVAEPP 1039
            SE+  +G D  +LGET       +R I+++ H+S    N  + +   + ++P E +    
Sbjct: 622  SEKDGLG-DSMQLGETVYQDSPLNRQISMDVHDSPIVRNELEPANHMNGIEPMESMVNTI 680

Query: 1038 PSKELFAPCEVKT-EVTLKIEL-KEDKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEE 865
              +E+ +P E+   E   K E    DK+  L    ++N  + K    +  T      D+E
Sbjct: 681  TKQEMLSPQEIPNFETGEKHETGVNDKIAGLGSNLMENGSSRKLLNWVAGTSQAMEVDQE 740

Query: 864  LLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENN 685
            L+KPC +TAV E   SV+ S+S  QNLN     +G+V A D+D +L +GC N + +    
Sbjct: 741  LVKPCCQTAVVEYAESVIRSHSGTQNLN----CEGNVCAVDADVVLESGCLNMSRVLPKG 796

Query: 684  LTEELKPNTALG-MVEAQDGASSESVQKPVLRLSLESTA 571
            L EEL+P TA+G +V +QDGA SES+QKP++RLSL+STA
Sbjct: 797  LPEELEPKTAIGKVVNSQDGARSESLQKPMIRLSLKSTA 835


>ref|XP_011010888.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X2 [Populus
            euphratica]
          Length = 789

 Score =  985 bits (2547), Expect = 0.0
 Identities = 523/790 (66%), Positives = 591/790 (74%), Gaps = 17/790 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S+AGPTE DL RNAELEKFL+ SGL ESK+++ +REEV+ RIDQIVK WVKQLTRQRGYT
Sbjct: 22   SVAGPTEPDLHRNAELEKFLVDSGLNESKDETIKREEVLGRIDQIVKDWVKQLTRQRGYT 81

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI LHD LAEMEEV+E
Sbjct: 82   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFITLHDKLAEMEEVTE 141

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDIS+GSVLY+VDEQTVRSLNGCR
Sbjct: 142  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISNGSVLYEVDEQTVRSLNGCR 201

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 202  VADQILKLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 261

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEED LGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 262  SMLVSRFFRVYTQWRWPNPVMLCSIEEDALGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 321

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVSTSTLRVM +QF  GNRICEEIELNKA+W++ FEPYLFFEAYKNY           
Sbjct: 322  SYNVSTSTLRVMTEQFQSGNRICEEIELNKAQWSALFEPYLFFEAYKNYLQVDIVAAVAA 381

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY D SK CPHCAFFMGLQRK+GV 
Sbjct: 382  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYIDASKQCPHCAFFMGLQRKEGVT 441

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE+NMYMFWKPGM+IYVSHVRR+Q+P +VFPDGYKR R SRH 
Sbjct: 442  SQEGQQFDIRGTVDEFRQEINMYMFWKPGMEIYVSHVRRRQLPGFVFPDGYKRSRSSRHV 501

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQ---------RLG 1297
            +Q   +T E  A      +E   KRK+  EM D+K EK    S I PQ         R G
Sbjct: 502  NQHTSKTGEDVARSQSGSAERPVKRKNDCEMEDLKPEKRALNSPIRPQSVSPSSSVSRSG 561

Query: 1296 SVFFGSS-EEIKLECLV------AGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
                 SS E +KL C           V  NSE RSS G LESE+  +G D  +LGET  +
Sbjct: 562  VTSLASSCEGVKLGCSTIDIGSNCKDVASNSEVRSSSGQLESEKDGLG-DAMQLGET-VY 619

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEVKTEVTLKIELKEDKVE 958
            +   LN   S++  D+    +E++P   +    P ++ F   E K E  +      DK+ 
Sbjct: 620  QDSPLNRQISMDVHDSKIVRNELEPANNMNGIEPMEQNFETGE-KHETGV-----NDKIA 673

Query: 957  DLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNS 778
             L    +++  + K    +  T      D+EL+KPC +TAV E   SV+ S+S  QNLN 
Sbjct: 674  GLGSNIMESGSSRKLLNWVAGTSQAVEVDQELVKPCCQTAVVEYADSVIKSHSGTQNLN- 732

Query: 777  KGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKP 601
                +G+V A D+D +L NGC N N + +  L EEL+P TA+G +V +QDGA        
Sbjct: 733  ---CEGNVCAVDADVVLENGCLNMNRVLQKGLPEELEPKTAIGKVVNSQDGA-------- 781

Query: 600  VLRLSLESTA 571
              RLSL+STA
Sbjct: 782  --RLSLKSTA 789


>ref|XP_007012187.1| Poly(A) polymerase 1 isoform 6 [Theobroma cacao]
            gi|508782550|gb|EOY29806.1| Poly(A) polymerase 1 isoform
            6 [Theobroma cacao]
          Length = 751

 Score =  979 bits (2532), Expect = 0.0
 Identities = 501/726 (69%), Positives = 571/726 (78%), Gaps = 19/726 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I++IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADVQRNTELEKFLIESGLYESKEEAVKREEVLGHINEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSY LGVHGPGAD+DTLC+GPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYCLGVHGPGADIDTLCIGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVP++LDISHGSVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPDNLDISHGSVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 207  VADQILKLVPNVEHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAIP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNK++WN+ FEPYLFFEAYKNY           
Sbjct: 327  SYNVSISTLRVMMEQFQCGNRICEEIELNKSQWNALFEPYLFFEAYKNYLQVDIVSAEAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGYKRPR SRH 
Sbjct: 447  GQEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYKRPRSSRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRS---EGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGS 1279
             Q   +T ++  +  RS+S   E   KRKH  E  D K +KP KRSSISPQRL SV   S
Sbjct: 507  GQ---QTGKICEDITRSQSGSVERQIKRKHEDEAFDEKMDKPDKRSSISPQRLESVSPES 563

Query: 1278 S-------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAH 1138
            S             + + LE      VD NS  R S G+L+SE+  VG  +++    D  
Sbjct: 564  SASRSGGTSHISDGQMVTLERPTTWDVDSNSVLRQSSGLLDSEKRNVGISIQQARTVD-Q 622

Query: 1137 RSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKED 967
             S+TL+   SL+          V+  E + EP   +E  +PCEV   +   T K  + ++
Sbjct: 623  GSLTLSGQTSLDVVHNLSVVRNVESAEQMGEPFLRQESHSPCEVPDSELRETCKTGVNQE 682

Query: 966  KVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQN 787
            K  D +   +++AETG +RR+L       G D+E++KPC +TAV E   SV GS+S+ QN
Sbjct: 683  KTGDYSSAYMNDAETGSSRRILNWKGGGVGVDQEVVKPCNQTAVVEIAESVFGSSSNAQN 742

Query: 786  LNSKGS 769
            LN + S
Sbjct: 743  LNCEVS 748


>gb|KDP34179.1| hypothetical protein JCGZ_07750 [Jatropha curcas]
          Length = 745

 Score =  976 bits (2523), Expect = 0.0
 Identities = 503/723 (69%), Positives = 566/723 (78%), Gaps = 16/723 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGPTEADL RNAELEKFL+ SGLYESKE + +REEV+ RIDQIVK WVKQLT QRGYT
Sbjct: 28   SLAGPTEADLHRNAELEKFLVDSGLYESKEKTMKREEVLGRIDQIVKGWVKQLTHQRGYT 87

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDIL+EM+EV+E
Sbjct: 88   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILSEMDEVTE 147

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYD+DEQTVRSLNGCR
Sbjct: 148  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDIDEQTVRSLNGCR 207

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN+EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNAI 
Sbjct: 208  VADQILKLVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARLCQLYPNAIP 267

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKN RDR HHMPIITPAYPCMNS
Sbjct: 268  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNHRDRFHHMPIITPAYPCMNS 327

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF +GN+ICEEIELNKA+W++ FEPYLFFEAYKNY           
Sbjct: 328  SYNVSISTLRVMMEQFQYGNKICEEIELNKAQWSALFEPYLFFEAYKNYLQIDIIAADAD 387

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT G+LQCHPYPNEY DTSK C HCAFFMGLQR++GV 
Sbjct: 388  DLLAWKGWVESRLRQLTLKIERDTNGVLQCHPYPNEYVDTSKQCSHCAFFMGLQRREGVT 447

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
             QEGQQFDIRGTVDEFRQE+NMYMFWKPGMDIYVSHVRR+Q+PA+VFPDGYKR RPSRH 
Sbjct: 448  GQEGQQFDIRGTVDEFRQEINMYMFWKPGMDIYVSHVRRRQLPAFVFPDGYKRSRPSRHL 507

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSV------- 1291
            +Q V ++ +  A       EGH KRK+  E VD++ +KP KR+SISPQRL SV       
Sbjct: 508  NQQVSKSNDGAATSRIGSPEGHLKRKNDHEEVDLRPDKPEKRASISPQRLQSVSPESSTS 567

Query: 1290 ------FFGSSEEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSI 1129
                      SE IKL C  A  VD NSE RS  G   +E   +G +V ++GET     +
Sbjct: 568  RCGGTSLANFSERIKLGCSTAADVDNNSEARSCRGPSSNENCILG-NVMQVGET----VM 622

Query: 1128 TLNEHNSLNA-CDTSRAWSEVKPDEPVAEPPPSKELFAPCEVKTEVTLKIE--LKEDKVE 958
             L +   +    + +   +EV+P +   E    +EL  P E+ +    KI     E++  
Sbjct: 623  GLYDPAVVRGDVEPAECRNEVEPTDLAVETILKQELLDPYEISSSEIRKIHNVTNENRNG 682

Query: 957  DLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNS 778
            DL    ++N   G   RLLK    V   +EEL++PC +TAV E   SV+ SN+S QNLN 
Sbjct: 683  DLISASLEN---GSPNRLLKWGGEVIEVEEELVRPCNQTAVVELAESVICSNTSAQNLNC 739

Query: 777  KGS 769
            + S
Sbjct: 740  EVS 742


>ref|XP_012435059.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X3 [Gossypium
            raimondii] gi|823120692|ref|XP_012435128.1| PREDICTED:
            nuclear poly(A) polymerase 4 isoform X3 [Gossypium
            raimondii] gi|763739965|gb|KJB07464.1| hypothetical
            protein B456_001G025900 [Gossypium raimondii]
          Length = 766

 Score =  975 bits (2521), Expect = 0.0
 Identities = 512/777 (65%), Positives = 588/777 (75%), Gaps = 4/777 (0%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I +IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADIQRNTELEKFLIESGLYESKEEAAKREEVLGHISEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVM+FKFQGISIDLLYASISLLVVP+DLDIS  SVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMRFKFQGISIDLLYASISLLVVPDDLDISRESVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN++HFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNA+ 
Sbjct: 207  VADQILKLVPNVKHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNKA+W++ FEP+LFFEAYKNY           
Sbjct: 327  SYNVSLSTLRVMMEQFQFGNRICEEIELNKAQWSALFEPHLFFEAYKNYLQVDIVSADAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
              EGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGY+RPR  RH 
Sbjct: 447  GLEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYRRPRSLRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGSSEE 1270
            SQ   +T E         +E   KRK   E VD K  KP KR+SISP R+ SV    S +
Sbjct: 507  SQQTGKTCEDVTTSRSGSAERQIKRKRDDETVDEKLNKPEKRASISPLRMESV----SPD 562

Query: 1269 IKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNACDT 1090
            I     V       S N +   +    RG V  D             +L    SL+  D+
Sbjct: 563  IITSKSVG-----TSHNSNGQAVKVEHRGTVDLD-------------SLRGQTSLDIDDS 604

Query: 1089 SRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKEDKVEDLALTDVDNAETG 919
            S   S V+  E +   P  +EL +PCEV   +T  T K  L ++K  D     +++ E G
Sbjct: 605  SVVRS-VESAEQIG-LPFRQELLSPCEVSDFETRETCKAGLNQEKTADSTSAFINDPEIG 662

Query: 918  KARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDS 739
             +RR+L    +    D+E++K C +TA  E   SV GS+S+ QNLN KGS+ G     D 
Sbjct: 663  SSRRILNWKGVGAEVDQEVVKACNQTAAVEIAESVFGSSSNAQNLNCKGSVCG----ADL 718

Query: 738  DSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKPVLRLSLESTA 571
            DSLL  G  N + +++N+L+EELKP+ ++G +V +QDGAS         RLSL+STA
Sbjct: 719  DSLLEKGHLNASAVFQNSLSEELKPSISVGKVVNSQDGAS---------RLSLKSTA 766


>ref|XP_012435194.1| PREDICTED: nuclear poly(A) polymerase 4 isoform X4 [Gossypium
            raimondii] gi|823120696|ref|XP_012435268.1| PREDICTED:
            nuclear poly(A) polymerase 4 isoform X4 [Gossypium
            raimondii] gi|763739959|gb|KJB07458.1| hypothetical
            protein B456_001G025900 [Gossypium raimondii]
            gi|763739966|gb|KJB07465.1| hypothetical protein
            B456_001G025900 [Gossypium raimondii]
          Length = 765

 Score =  973 bits (2516), Expect = 0.0
 Identities = 511/777 (65%), Positives = 587/777 (75%), Gaps = 4/777 (0%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I +IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADIQRNTELEKFLIESGLYESKEEAAKREEVLGHISEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVM+FKFQGISIDLLYASISLLVVP+DLDIS  SVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMRFKFQGISIDLLYASISLLVVPDDLDISRESVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN++HFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNA+ 
Sbjct: 207  VADQILKLVPNVKHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNKA+W++ FEP+LFFEAYKNY           
Sbjct: 327  SYNVSLSTLRVMMEQFQFGNRICEEIELNKAQWSALFEPHLFFEAYKNYLQVDIVSADAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
              EGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGY+RPR  RH 
Sbjct: 447  GLEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYRRPRSLRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGSSEE 1270
            SQ   +T E         +E   KRK   E VD K  KP KR+SISP R+ SV    S +
Sbjct: 507  SQQTGKTCEDVTTSRSGSAERQIKRKRDDETVDEKLNKPEKRASISPLRMESV----SPD 562

Query: 1269 IKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNACDT 1090
            I     V       S N +   +    RG V  D             +L    SL+  D+
Sbjct: 563  IITSKSVG-----TSHNSNGQAVKVEHRGTVDLD-------------SLRGQTSLDIDDS 604

Query: 1089 SRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKEDKVEDLALTDVDNAETG 919
            S   S V+  E +   P  +EL +PCEV   +T  T K  L ++K  D     +++ E G
Sbjct: 605  SVVRS-VESAEQIG-LPFRQELLSPCEVSDFETRETCKAGLNQEKTADSTSAFINDPEIG 662

Query: 918  KARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDS 739
             +RR+L    +    D+E++K C +TA  E   SV GS+S+ QNLN KGS+ G     D 
Sbjct: 663  SSRRILNWKGVGAEVDQEVVKACNQTAAVEIAESVFGSSSNAQNLNCKGSVCG----ADL 718

Query: 738  DSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSESVQKPVLRLSLESTA 571
            DSLL  G  N + +++N+L+EELKP+ ++G +V +QDGA          RLSL+STA
Sbjct: 719  DSLLEKGHLNASAVFQNSLSEELKPSISVGKVVNSQDGA----------RLSLKSTA 765


>ref|XP_009587895.1| PREDICTED: poly(A) polymerase type 3-like isoform X2 [Nicotiana
            tomentosiformis]
          Length = 782

 Score =  971 bits (2509), Expect = 0.0
 Identities = 516/794 (64%), Positives = 589/794 (74%), Gaps = 21/794 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S AGPT+ADLQRNA LEKFL  SGLYES+E+++RREEV++++DQIVK WVK+LT QRGYT
Sbjct: 29   SSAGPTDADLQRNAALEKFLKDSGLYESEEETERREEVLRQLDQIVKSWVKELTHQRGYT 88

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVEDANA+IFTFGSYRLGVHGPGAD+DTLCVGPSYVNR+EDFFI+LHDILAE EEVSE
Sbjct: 89   DQMVEDANAIIFTFGSYRLGVHGPGADIDTLCVGPSYVNRDEDFFIILHDILAEKEEVSE 148

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGIS+DLLYASISLLVVPEDLDIS  SVLY VDE TVRSLNGCR
Sbjct: 149  LQPVPDAHVPVMKFKFQGISVDLLYASISLLVVPEDLDISDRSVLYSVDEPTVRSLNGCR 208

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVARICQ YPNA+ 
Sbjct: 209  VADQILKLVPNAEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARICQFYPNAVP 268

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLC IEEDELGF VWDPRKNP+DRTHHMPIITPAYPCMNS
Sbjct: 269  SMLVSRFFRVYTQWRWPNPVMLCPIEEDELGFLVWDPRKNPKDRTHHMPIITPAYPCMNS 328

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMMDQF  GN+ICEEIELNKA+W + FE YLFFE YKNY           
Sbjct: 329  SYNVSPSTLRVMMDQFQFGNKICEEIELNKAQWAALFEHYLFFEVYKNYLQVDIVAADSD 388

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AWRGWVESRLRQLTLKIERDT GMLQCHPYPNE+ D SKPCPHCAFFMGLQRKQGVK
Sbjct: 389  DLLAWRGWVESRLRQLTLKIERDTNGMLQCHPYPNEFVDLSKPCPHCAFFMGLQRKQGVK 448

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
            VQEGQQFDIRGTVDEF+Q+V+MY +W+PGMDIYVSHVRRKQIP +VFPDGYKRPR SR+T
Sbjct: 449  VQEGQQFDIRGTVDEFKQDVSMYTYWRPGMDIYVSHVRRKQIPPFVFPDGYKRPRQSRNT 508

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSV--FFGSS 1276
            S     TPE  A GC S  E HPKRK  AE V V   K GKR+SISPQR+GSV    GSS
Sbjct: 509  SHS---TPEKVARGCMSPEERHPKRKQEAETVGVNWGKLGKRASISPQRIGSVSPLGGSS 565

Query: 1275 ---------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDA 1141
                           +E++  CL     D NS +R S          + C  + L     
Sbjct: 566  RSDGSSQIIISDESQKELESSCL-RDSSDDNSLHRCSRNDASLSDSSI-CAPDSL----- 618

Query: 1140 HRSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCE---VKTEVTLKIELKE 970
              + T++ +++L+         EV  D    +  PS+E+ +P +    +   T ++   +
Sbjct: 619  --NYTMSRNSTLSGLP-----REVDLDSSNTKSFPSQEMRSPIQDICTRNVQTFQVLQND 671

Query: 969  DKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQ 790
            +K E L     DN  TG                 +L +P  +T  +E    V  SNS++Q
Sbjct: 672  EKGEILGSLHQDN--TG-----------------QLNEPGVQTGCAERGERVHVSNSNIQ 712

Query: 789  NLNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSES 613
            NL    + +GD+   D  S LG+GC +GNG+  N L E+ +PN +L   +E+QDGASSE+
Sbjct: 713  NL----TCEGDISLADRISQLGDGCLSGNGVLGNGLAEKSQPNHSLSRAMESQDGASSEA 768

Query: 612  VQKPVLRLSLESTA 571
            VQ+P +RLSLESTA
Sbjct: 769  VQEPAIRLSLESTA 782


>ref|XP_009779470.1| PREDICTED: poly(A) polymerase type 3-like isoform X3 [Nicotiana
            sylvestris]
          Length = 780

 Score =  965 bits (2494), Expect = 0.0
 Identities = 519/795 (65%), Positives = 581/795 (73%), Gaps = 22/795 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S AGPT+ADLQRNA LEKFL  SGLYES+E+++RREEV++++DQIVK WVKQLT QRGYT
Sbjct: 29   SSAGPTDADLQRNASLEKFLKDSGLYESEEETERREEVLRQLDQIVKSWVKQLTHQRGYT 88

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVEDANA+IFTFGSYRLGVHGPGAD+DTLCVGPSYVNR+EDFFI+LHDILAE EEVSE
Sbjct: 89   DQMVEDANAIIFTFGSYRLGVHGPGADIDTLCVGPSYVNRDEDFFIILHDILAEKEEVSE 148

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDIS  SVLY VDE TVRSLNGCR
Sbjct: 149  LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISDQSVLYSVDEPTVRSLNGCR 208

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVARICQ YPNAI 
Sbjct: 209  VADQILKLVPNAEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARICQFYPNAIP 268

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLC IEEDELGF VWDPRKNP+DRTHHMPIITPAYPCMNS
Sbjct: 269  SMLVSRFFRVYTQWRWPNPVMLCPIEEDELGFLVWDPRKNPKDRTHHMPIITPAYPCMNS 328

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMMDQF  GN+ICEEIELNKA+W + F+ YLFFE YKNY           
Sbjct: 329  SYNVSPSTLRVMMDQFQFGNKICEEIELNKAQWAALFKHYLFFEVYKNYLQVDIVAADND 388

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AWRGWVESRLRQLTLKIERDT GMLQCHPYPNE+ D SKPCPHCAFFMGLQRKQGVK
Sbjct: 389  DLLAWRGWVESRLRQLTLKIERDTNGMLQCHPYPNEFVDLSKPCPHCAFFMGLQRKQGVK 448

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
            VQEGQQFDIRGTVDEF+Q+V+MY +W+PGMDIYVSHVRRKQIP +VFPDGYKRPR SR+ 
Sbjct: 449  VQEGQQFDIRGTVDEFKQDVSMYTYWRPGMDIYVSHVRRKQIPPFVFPDGYKRPRQSRNA 508

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQ-RLGSV--FFGS 1279
            S     TPE  A GC S  E HPKRK  AE V V   K GKR+SISPQ R+GSV    GS
Sbjct: 509  SHS---TPEKVARGCMSPEERHPKRKQEAETVGVNWGKLGKRASISPQRRIGSVSPLGGS 565

Query: 1278 S---------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETD 1144
            S               +E++  CL+    D NS +R S          V C  + L  T 
Sbjct: 566  SRSDGSSQIIISDESQKELESSCLLDTS-DDNSLHRCSRNDASLSDSSV-CAPDSLNYTI 623

Query: 1143 AHRSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCE---VKTEVTLKIELK 973
            + +SI             S    EV  D    +  PS+E+  P +    +   T ++   
Sbjct: 624  SRKSI------------LSGLPREVDLDSSNTKSFPSQEMLRPFQDICTRNVQTFQVLQN 671

Query: 972  EDKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSM 793
            ++K E L     DN  TG+   L                    T  +E    V  SNS++
Sbjct: 672  DEKGETLGSFHQDN--TGQLNEL--------------------TGCAERGERVAVSNSNI 709

Query: 792  QNLNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSE 616
            QNL    + +GD    D  S LG+GC +GNG+  N L E+ +PN +L   +E+QDGASSE
Sbjct: 710  QNL----TCEGDTSLADRISQLGDGCLSGNGVLGNGLAEKSQPNHSLARAMESQDGASSE 765

Query: 615  SVQKPVLRLSLESTA 571
            +VQ+P +RLSLESTA
Sbjct: 766  AVQEPAIRLSLESTA 780


>ref|XP_009587889.1| PREDICTED: poly(A) polymerase PAPalpha-like isoform X1 [Nicotiana
            tomentosiformis]
          Length = 807

 Score =  959 bits (2479), Expect = 0.0
 Identities = 509/787 (64%), Positives = 582/787 (73%), Gaps = 21/787 (2%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            S AGPT+ADLQRNA LEKFL  SGLYES+E+++RREEV++++DQIVK WVK+LT QRGYT
Sbjct: 29   SSAGPTDADLQRNAALEKFLKDSGLYESEEETERREEVLRQLDQIVKSWVKELTHQRGYT 88

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVEDANA+IFTFGSYRLGVHGPGAD+DTLCVGPSYVNR+EDFFI+LHDILAE EEVSE
Sbjct: 89   DQMVEDANAIIFTFGSYRLGVHGPGADIDTLCVGPSYVNRDEDFFIILHDILAEKEEVSE 148

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVMKFKFQGIS+DLLYASISLLVVPEDLDIS  SVLY VDE TVRSLNGCR
Sbjct: 149  LQPVPDAHVPVMKFKFQGISVDLLYASISLLVVPEDLDISDRSVLYSVDEPTVRSLNGCR 208

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN EHFRTTLRCLKFWAK RGVYSNVTGFLGGVNWALLVARICQ YPNA+ 
Sbjct: 209  VADQILKLVPNAEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARICQFYPNAVP 268

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLC IEEDELGF VWDPRKNP+DRTHHMPIITPAYPCMNS
Sbjct: 269  SMLVSRFFRVYTQWRWPNPVMLCPIEEDELGFLVWDPRKNPKDRTHHMPIITPAYPCMNS 328

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMMDQF  GN+ICEEIELNKA+W + FE YLFFE YKNY           
Sbjct: 329  SYNVSPSTLRVMMDQFQFGNKICEEIELNKAQWAALFEHYLFFEVYKNYLQVDIVAADSD 388

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AWRGWVESRLRQLTLKIERDT GMLQCHPYPNE+ D SKPCPHCAFFMGLQRKQGVK
Sbjct: 389  DLLAWRGWVESRLRQLTLKIERDTNGMLQCHPYPNEFVDLSKPCPHCAFFMGLQRKQGVK 448

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
            VQEGQQFDIRGTVDEF+Q+V+MY +W+PGMDIYVSHVRRKQIP +VFPDGYKRPR SR+T
Sbjct: 449  VQEGQQFDIRGTVDEFKQDVSMYTYWRPGMDIYVSHVRRKQIPPFVFPDGYKRPRQSRNT 508

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSV--FFGSS 1276
            S     TPE  A GC S  E HPKRK  AE V V   K GKR+SISPQR+GSV    GSS
Sbjct: 509  SHS---TPEKVARGCMSPEERHPKRKQEAETVGVNWGKLGKRASISPQRIGSVSPLGGSS 565

Query: 1275 ---------------EEIKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDA 1141
                           +E++  CL     D NS +R S          + C  + L     
Sbjct: 566  RSDGSSQIIISDESQKELESSCL-RDSSDDNSLHRCSRNDASLSDSSI-CAPDSL----- 618

Query: 1140 HRSITLNEHNSLNACDTSRAWSEVKPDEPVAEPPPSKELFAPCE---VKTEVTLKIELKE 970
              + T++ +++L+         EV  D    +  PS+E+ +P +    +   T ++   +
Sbjct: 619  --NYTMSRNSTLSGLP-----REVDLDSSNTKSFPSQEMRSPIQDICTRNVQTFQVLQND 671

Query: 969  DKVEDLALTDVDNAETGKARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQ 790
            +K E L     DN  TG                 +L +P  +T  +E    V  SNS++Q
Sbjct: 672  EKGEILGSLHQDN--TG-----------------QLNEPGVQTGCAERGERVHVSNSNIQ 712

Query: 789  NLNSKGSLQGDVHATDSDSLLGNGCPNGNGIYENNLTEELKPNTALG-MVEAQDGASSES 613
            NL    + +GD+   D  S LG+GC +GNG+  N L E+ +PN +L   +E+QDGASSE+
Sbjct: 713  NL----TCEGDISLADRISQLGDGCLSGNGVLGNGLAEKSQPNHSLSRAMESQDGASSEA 768

Query: 612  VQKPVLR 592
            VQ+P +R
Sbjct: 769  VQEPAIR 775


>gb|KJB07462.1| hypothetical protein B456_001G025900 [Gossypium raimondii]
          Length = 760

 Score =  958 bits (2477), Expect = 0.0
 Identities = 500/751 (66%), Positives = 570/751 (75%), Gaps = 4/751 (0%)
 Frame = -1

Query: 2889 SLAGPTEADLQRNAELEKFLIKSGLYESKEDSKRREEVIQRIDQIVKYWVKQLTRQRGYT 2710
            SLAGP+EAD+QRN ELEKFLI+SGLYESKE++ +REEV+  I +IVK WVKQLTRQRGYT
Sbjct: 27   SLAGPSEADIQRNTELEKFLIESGLYESKEEAAKREEVLGHISEIVKSWVKQLTRQRGYT 86

Query: 2709 DQMVEDANAVIFTFGSYRLGVHGPGADMDTLCVGPSYVNREEDFFIVLHDILAEMEEVSE 2530
            DQMVE+ANAVIFTFGSYRLGVHGPGAD+DTLCVGPSYVNREEDFFI+LHDILAEMEEV+E
Sbjct: 87   DQMVEEANAVIFTFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFIILHDILAEMEEVTE 146

Query: 2529 LQPVPDAHVPVMKFKFQGISIDLLYASISLLVVPEDLDISHGSVLYDVDEQTVRSLNGCR 2350
            LQPVPDAHVPVM+FKFQGISIDLLYASISLLVVP+DLDIS  SVL++VDEQTVRSLNGCR
Sbjct: 147  LQPVPDAHVPVMRFKFQGISIDLLYASISLLVVPDDLDISRESVLHNVDEQTVRSLNGCR 206

Query: 2349 VADQILKLVPNIEHFRTTLRCLKFWAKTRGVYSNVTGFLGGVNWALLVARICQLYPNAIS 2170
            VADQILKLVPN++HFR TLRCLKFWAK RGVYSNVTGFLGGVNWALLVAR+CQLYPNA+ 
Sbjct: 207  VADQILKLVPNVKHFRMTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVP 266

Query: 2169 SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRTHHMPIITPAYPCMNS 1990
            SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDR HHMPIITPAYPCMNS
Sbjct: 267  SMLVSRFFRVYTQWRWPNPVMLCSIEEDELGFPVWDPRKNPRDRFHHMPIITPAYPCMNS 326

Query: 1989 SYNVSTSTLRVMMDQFHHGNRICEEIELNKAEWNSFFEPYLFFEAYKNYXXXXXXXXXXX 1810
            SYNVS STLRVMM+QF  GNRICEEIELNKA+W++ FEP+LFFEAYKNY           
Sbjct: 327  SYNVSLSTLRVMMEQFQFGNRICEEIELNKAQWSALFEPHLFFEAYKNYLQVDIVSADAD 386

Query: 1809 XXXAWRGWVESRLRQLTLKIERDTYGMLQCHPYPNEYTDTSKPCPHCAFFMGLQRKQGVK 1630
               AW+GWVESRLRQLTLKIERDT GMLQCHPYPNEY DTSK  PHCAFFMGLQRK+GV 
Sbjct: 387  DLLAWKGWVESRLRQLTLKIERDTNGMLQCHPYPNEYVDTSKQFPHCAFFMGLQRKEGVS 446

Query: 1629 VQEGQQFDIRGTVDEFRQEVNMYMFWKPGMDIYVSHVRRKQIPAYVFPDGYKRPRPSRHT 1450
              EGQQFDIRGTVDEFRQE++MYM+WKPGMDIYVSHVRR+Q+PA+VFPDGY+RPR  RH 
Sbjct: 447  GLEGQQFDIRGTVDEFRQEISMYMYWKPGMDIYVSHVRRRQLPAFVFPDGYRRPRSLRHP 506

Query: 1449 SQGVERTPEVDAEGCRSRSEGHPKRKHGAEMVDVKAEKPGKRSSISPQRLGSVFFGSSEE 1270
            SQ   +T E         +E   KRK   E VD K  KP KR+SISP R+ SV    S +
Sbjct: 507  SQQTGKTCEDVTTSRSGSAERQIKRKRDDETVDEKLNKPEKRASISPLRMESV----SPD 562

Query: 1269 IKLECLVAGGVDRNSENRSSGGILESERGKVGCDVEKLGETDAHRSITLNEHNSLNACDT 1090
            I     V       S N +   +    RG V  D             +L    SL+  D+
Sbjct: 563  IITSKSVG-----TSHNSNGQAVKVEHRGTVDLD-------------SLRGQTSLDIDDS 604

Query: 1089 SRAWSEVKPDEPVAEPPPSKELFAPCEV---KTEVTLKIELKEDKVEDLALTDVDNAETG 919
            S   S V+  E +   P  +EL +PCEV   +T  T K  L ++K  D     +++ E G
Sbjct: 605  SVVRS-VESAEQIG-LPFRQELLSPCEVSDFETRETCKAGLNQEKTADSTSAFINDPEIG 662

Query: 918  KARRLLKQTEMVFGGDEELLKPCKRTAVSEDDRSVLGSNSSMQNLNSKGSLQGDVHATDS 739
             +RR+L    +    D+E++K C +TA  E   SV GS+S+ QNLN KGS+ G     D 
Sbjct: 663  SSRRILNWKGVGAEVDQEVVKACNQTAAVEIAESVFGSSSNAQNLNCKGSVCG----ADL 718

Query: 738  DSLLGNGCPNGNGIYENNLTEELKP-NTALG 649
            DSLL  G  N + +++N+L+EELK  N  LG
Sbjct: 719  DSLLEKGHLNASAVFQNSLSEELKVLNLTLG 749


Top