BLASTX nr result

ID: Zanthoxylum22_contig00008100 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00008100
         (2000 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006493032.1| PREDICTED: poly(A) polymerase-like isoform X...  1038   0.0  
ref|XP_006493030.1| PREDICTED: poly(A) polymerase-like isoform X...  1038   0.0  
ref|XP_006428724.1| hypothetical protein CICLE_v10011139mg [Citr...  1038   0.0  
ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citr...  1038   0.0  
gb|KDO42643.1| hypothetical protein CISIN_1g004767mg [Citrus sin...   996   0.0  
gb|KDO42639.1| hypothetical protein CISIN_1g004767mg [Citrus sin...   996   0.0  
ref|XP_006428725.1| hypothetical protein CICLE_v10011139mg [Citr...   996   0.0  
gb|KDO42644.1| hypothetical protein CISIN_1g004767mg [Citrus sin...   962   0.0  
ref|XP_006493031.1| PREDICTED: poly(A) polymerase-like isoform X...   960   0.0  
ref|XP_006428722.1| hypothetical protein CICLE_v10011139mg [Citr...   960   0.0  
gb|KDO42642.1| hypothetical protein CISIN_1g004767mg [Citrus sin...   919   0.0  
gb|KDO42640.1| hypothetical protein CISIN_1g004767mg [Citrus sin...   907   0.0  
ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma ca...   881   0.0  
ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Popu...   879   0.0  
ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Pop...   877   0.0  
ref|XP_012486424.1| PREDICTED: nuclear poly(A) polymerase 1 isof...   860   0.0  
ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isof...   860   0.0  
gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium r...   850   0.0  
ref|XP_007036649.1| Poly(A) polymerase 1 isoform 3 [Theobroma ca...   850   0.0  
ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|5879...   845   0.0  

>ref|XP_006493032.1| PREDICTED: poly(A) polymerase-like isoform X3 [Citrus sinensis]
          Length = 712

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 517/640 (80%), Positives = 558/640 (87%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 74   PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 133

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 134  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 193

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 194  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 253

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 254  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 313

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 314  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 373

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 374  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 433

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 434  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 493

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 494  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 553

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 554  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 613

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 614  PLELDQLEVDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 673

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 674  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 712


>ref|XP_006493030.1| PREDICTED: poly(A) polymerase-like isoform X1 [Citrus sinensis]
          Length = 748

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 517/640 (80%), Positives = 558/640 (87%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 409

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 410  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 469

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 470  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 529

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 530  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 589

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 590  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 649

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 650  PLELDQLEVDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 709

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 710  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 748


>ref|XP_006428724.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530781|gb|ESR41964.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
          Length = 730

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 517/640 (80%), Positives = 558/640 (87%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 92   PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 151

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 152  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 211

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 212  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 271

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 272  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 331

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 332  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 391

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 392  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 451

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 452  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 511

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 512  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 571

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 572  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 631

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 632  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 691

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 692  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 730


>ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530780|gb|ESR41963.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
          Length = 748

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 517/640 (80%), Positives = 558/640 (87%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 409

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 410  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 469

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 470  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 529

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 530  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 589

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 590  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 649

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 650  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 709

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 710  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 748


>gb|KDO42643.1| hypothetical protein CISIN_1g004767mg [Citrus sinensis]
          Length = 655

 Score =  996 bits (2576), Expect = 0.0
 Identities = 490/599 (81%), Positives = 529/599 (88%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 33   PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 92

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 93   EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 152

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 153  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 212

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 213  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 272

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 273  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 332

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 333  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 392

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 393  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 452

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 453  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 512

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 513  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 572

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 573  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 631


>gb|KDO42639.1| hypothetical protein CISIN_1g004767mg [Citrus sinensis]
          Length = 696

 Score =  996 bits (2576), Expect = 0.0
 Identities = 490/599 (81%), Positives = 529/599 (88%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 74   PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 133

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 134  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 193

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 194  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 253

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 254  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 313

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 314  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 373

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 374  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 433

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 434  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 493

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 494  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 553

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 554  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 613

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 614  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 672


>ref|XP_006428725.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530782|gb|ESR41965.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
            gi|641823208|gb|KDO42641.1| hypothetical protein
            CISIN_1g004767mg [Citrus sinensis]
          Length = 732

 Score =  996 bits (2576), Expect = 0.0
 Identities = 490/599 (81%), Positives = 529/599 (88%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 409

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDFSD+SKP +CSYFMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 410  HPGDFSDKSKPLYCSYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 469

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 470  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 529

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 530  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 589

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 590  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 649

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 650  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 708


>gb|KDO42644.1| hypothetical protein CISIN_1g004767mg [Citrus sinensis]
          Length = 607

 Score =  962 bits (2486), Expect = 0.0
 Identities = 474/583 (81%), Positives = 513/583 (87%)
 Frame = -1

Query: 1952 MLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIPEDLDISQDSILQSADE 1773
            ML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIPEDLDISQDSILQ+ADE
Sbjct: 1    MLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIPEDLDISQDSILQNADE 60

Query: 1772 QTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSNVAGFLGGINWALLVAR 1593
            QTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSNVAGFLGGINWALLVAR
Sbjct: 61   QTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVAR 120

Query: 1592 ICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDRYHLMPI 1413
            ICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKD+YHLMPI
Sbjct: 121  ICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDKYHLMPI 180

Query: 1412 ITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVDWDTLFEPFTFSEAYKN 1233
            ITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VDWDTLFEPFTF EAYKN
Sbjct: 181  ITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVDWDTLFEPFTFFEAYKN 240

Query: 1232 YLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDESKPFHCSY 1053
            YL+IDISAENADDLR WKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSD+SKP +CSY
Sbjct: 241  YLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDKSKPLYCSY 300

Query: 1052 FMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEISVSHVKRRNLPNFIFPG 873
            FMGLQRKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+ISV+HV RRNLPNF+FPG
Sbjct: 301  FMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQISVAHVTRRNLPNFVFPG 360

Query: 872  GVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVDTHLRNAKCYAAMPSSS 693
            GVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VDTHLRNAKC+A MPSSS
Sbjct: 361  GVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVDTHLRNAKCHATMPSSS 420

Query: 692  GEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPDSLRSSRNLAEVSSPNG 513
            GE  EG          SINL+ E+MDAN L  SNRE+VEN L DS+R SRN  EVSS NG
Sbjct: 421  GEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTDSIRGSRNSVEVSSHNG 480

Query: 512  DIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAFPXXXXXXXXXXXLKNQ 333
             +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AFP           LKNQ
Sbjct: 481  KVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAFPLELDQLEDDLELKNQ 540

Query: 332  AKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            AK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 541  AKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 583


>ref|XP_006493031.1| PREDICTED: poly(A) polymerase-like isoform X2 [Citrus sinensis]
          Length = 732

 Score =  960 bits (2482), Expect = 0.0
 Identities = 488/640 (76%), Positives = 534/640 (83%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLK+   T +      
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKVNATTLH------ 403

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
                       +  ++F  ++RKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 404  ----------AWLFAFFTSIKRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 453

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 454  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 513

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 514  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 573

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 574  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 633

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 634  PLELDQLEVDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 693

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 694  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 732


>ref|XP_006428722.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530779|gb|ESR41962.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
          Length = 732

 Score =  960 bits (2482), Expect = 0.0
 Identities = 488/640 (76%), Positives = 534/640 (83%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLK+   T +      
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKVNATTLH------ 403

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
                       +  ++F  ++RKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 404  ----------AWLFAFFTSIKRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 453

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 454  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 513

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 514  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 573

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 574  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 633

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGX 201
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG 
Sbjct: 634  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGL 693

Query: 200  XXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSLNNKAT 81
                   L AP SNVI SAPVPQRKP+IRL+FTSL NKAT
Sbjct: 694  GELEPVELTAPFSNVIPSAPVPQRKPLIRLNFTSL-NKAT 732


>gb|KDO42642.1| hypothetical protein CISIN_1g004767mg [Citrus sinensis]
          Length = 716

 Score =  919 bits (2375), Expect = 0.0
 Identities = 461/599 (76%), Positives = 505/599 (84%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLK+   T +      
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLKVNATTLH------ 403

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
                       +  ++F  ++RKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 404  ----------TWLFAFFTSIKRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 453

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 454  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 513

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 514  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 573

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 574  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 633

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 634  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 692


>gb|KDO42640.1| hypothetical protein CISIN_1g004767mg [Citrus sinensis]
          Length = 698

 Score =  907 bits (2345), Expect = 0.0
 Identities = 459/599 (76%), Positives = 496/599 (82%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELHQML+EMPEVTELHPVPDAHVPVM+FK SGVSIDLLYARLSLWVIP
Sbjct: 110  PRHATREEDFFGELHQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIP 169

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVP IQNF TTLRCM+FWAKRRGVYSN
Sbjct: 170  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSN 229

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNA+P+MLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV
Sbjct: 230  VAGFLGGINWALLVARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 289

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKD+YHLMPIITPAYPCMNSSYNVS STLRIM DEFQRGHEICEAME NEA+VD
Sbjct: 290  WDPRRNPKDKYHLMPIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVD 349

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPFTF EAYKNYL+IDISAENADDLR WKGWVESRLRQLTLK             
Sbjct: 350  WDTLFEPFTFFEAYKNYLRIDISAENADDLRNWKGWVESRLRQLTLK------------- 396

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
                                 RKQGVPVGEGEQFDIRLTV+EFKQAV++YTLRKPGM+IS
Sbjct: 397  ---------------------RKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQIS 435

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKSAADDGSKRKRMDDSVD 741
            V+HV RRNLPNF+FPGGVRPSR SKGTWDSRRA E KVSS  K  ADDG KRK+ DD+VD
Sbjct: 436  VAHVTRRNLPNFVFPGGVRPSRPSKGTWDSRRALERKVSSHTKPGADDGRKRKQTDDNVD 495

Query: 740  THLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPD 561
            THLRNAKC+A MPSSSGE  EG          SINL+ E+MDAN L  SNRE+VEN L D
Sbjct: 496  THLRNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTD 555

Query: 560  SLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAF 381
            S+R SRN  EVSS NG +DG +I +P NK LS +++NSKDAEKLAIEKIMSGP + D+AF
Sbjct: 556  SIRGSRNSVEVSSHNGKVDGPMIGDPRNKGLSFNSSNSKDAEKLAIEKIMSGPYVADQAF 615

Query: 380  PXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGG 204
            P           LKNQAK F G+TQ++++GS AVN A EAT TSMNGG SSSAL  NGG
Sbjct: 616  PLELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGG 674


>ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|590665102|ref|XP_007036648.1| Poly(A) polymerase 1
            isoform 1 [Theobroma cacao] gi|508773892|gb|EOY21148.1|
            Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|508773893|gb|EOY21149.1| Poly(A) polymerase 1 isoform
            1 [Theobroma cacao]
          Length = 762

 Score =  881 bits (2276), Expect = 0.0
 Identities = 453/646 (70%), Positives = 508/646 (78%), Gaps = 11/646 (1%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGEL++MLSEMPEV+ELHPVPDAHVPVM+FK  GVSIDLLYA+LSLWVIP
Sbjct: 116  PRHATREEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIP 175

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ DEQTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSN
Sbjct: 176  EDLDISQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSN 235

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQV
Sbjct: 236  VAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQV 295

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPR+NPKDRYHLMPIITPAYPCMNSSYNVS STLRIMTDEFQRG EICEAME N+A  D
Sbjct: 296  WDPRKNPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKA--D 353

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WD LFE + F EAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 354  WDILFESYAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 413

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDF D+S+PFH SYFMGLQRKQGVPV EGEQFDIRLTVEEFK +VN+YTL KPGMEI 
Sbjct: 414  HPGDFQDKSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIR 473

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLA-----------KSAADDG 774
            V+HVKRRN+P+F+FPGGVRPSR SK TWDS R S+ KVS  A               DDG
Sbjct: 474  VTHVKRRNIPSFVFPGGVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQDDG 533

Query: 773  SKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGES 594
             KRKR+DD+ D  LR++K   A+PSSS    EG          S + K +Y DA GL E+
Sbjct: 534  KKRKRVDDNGDAQLRSSKYITAVPSSS---LEGRVGSPVSTVSSCSTKGDYSDATGLIET 590

Query: 593  NRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKI 414
             RE+ E+ + + L +SR+L E+SS NG++DG +  NP  K+ S D ++  +AE LAIEKI
Sbjct: 591  TREKAESNMTNGLINSRSLEELSSHNGEVDGSVGCNPPIKV-SADASSCTEAENLAIEKI 649

Query: 413  MSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGG 234
            MSGP    +AFP            +NQ +    NT+   + S   + A  A  TS NG G
Sbjct: 650  MSGPYGAHQAFPQELEELEDDLEFRNQVRSV-ENTKSGPVESSMSDLAGAAPVTSSNGAG 708

Query: 233  SSSALHTNGGXXXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSL 96
             S++LH +GG        L A  SN I SAPV QRKP+IRL+FTSL
Sbjct: 709  PSTSLHASGGIEELEPAELTAMISNRIPSAPVAQRKPLIRLNFTSL 754


>ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Populus trichocarpa]
            gi|550321905|gb|EEF06201.2| hypothetical protein
            POPTR_0015s04100g [Populus trichocarpa]
          Length = 780

 Score =  879 bits (2270), Expect = 0.0
 Identities = 452/661 (68%), Positives = 519/661 (78%), Gaps = 22/661 (3%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+MLSEMPEVTELHPVPDAHVPVMRFK  GVSIDLLYA+LSLWVIP
Sbjct: 120  PRHATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIP 179

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQ---NFHTTLRCMKFWAKRRGV 1650
            EDLD+SQDS+L +ADEQTVRSLNGCRVTDQ+LRLVPNIQ   NF TTLRCM+FWAKRRGV
Sbjct: 180  EDLDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQAMQNFRTTLRCMRFWAKRRGV 239

Query: 1649 YSNVAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLG 1470
            YSNV+GFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPV+LCAIEEGSLG
Sbjct: 240  YSNVSGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLG 299

Query: 1469 LQVWDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEA 1290
            L VWDPRRNPKDRYHLMPIITPAYP MNSSYNVS STLRIMT+EFQRG+EICEAME+++A
Sbjct: 300  LSVWDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKA 359

Query: 1289 NVDWDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQ 1110
              +WDTLFEPF+F EAYKNYLQIDISAEN DDLR+WKGWVESRLRQLTLKIERHTYNMLQ
Sbjct: 360  --EWDTLFEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQ 417

Query: 1109 CHPHPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGM 930
            CHPHPG+FSD+S+P HCSYFMGLQRKQGVPV EGEQFDIR+TV+EFK +VN+YTL KPGM
Sbjct: 418  CHPHPGEFSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKNSVNMYTLWKPGM 477

Query: 929  EISVSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKS----------AAD 780
            EI V+HVK+RN+PNF+FP GVRPSR SK TWD RR+SE KV++ + +           +D
Sbjct: 478  EIRVTHVKKRNIPNFVFPSGVRPSRPSKATWDGRRSSEAKVANNSSADKIEGKGVLDGSD 537

Query: 779  DGSKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLG 600
            +G KRKR+D+  + +LRN K YAAMP S GEVHEG          S + + + +  N LG
Sbjct: 538  EGKKRKRIDEDTENNLRNPKGYAAMPPSGGEVHEG--SPPVGNVSSCSTQSDLVITNSLG 595

Query: 599  ESNRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKIL--SLDTANSKDAEKLA 426
            E   E+ +N   +SL +S+NLA + + NG++DG L  N  +K L  + DT++SK+AEKLA
Sbjct: 596  ELKGEKADNNETESLSNSQNLAGIFAQNGELDGILRCNLPDKGLPANNDTSSSKEAEKLA 655

Query: 425  IEKIMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSM 246
            I+KIMSGP +  +A P             NQ K      + S + S   NTAVE T  S+
Sbjct: 656  IDKIMSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAVEQTNESI 715

Query: 245  ------NGGGSSSALHTNGGXXXXXXXXLMAPSSNVISSA-PVPQRKPIIRLSFTSLNNK 87
                  NG G S+ L+ NGG        LMAP  N ISSA PV Q KP+IRL+FTSL   
Sbjct: 716  AAVACSNGAGPSAYLYPNGGSEELEPAELMAPLFNGISSAPPVAQPKPLIRLNFTSLGKA 775

Query: 86   A 84
            A
Sbjct: 776  A 776


>ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Populus euphratica]
          Length = 776

 Score =  877 bits (2266), Expect = 0.0
 Identities = 451/658 (68%), Positives = 516/658 (78%), Gaps = 19/658 (2%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+MLSEMPEVTELHPVPDAHVPVMRFK  GVSIDLLYA+LSLWVIP
Sbjct: 119  PRHATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIP 178

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLD+SQDS+L +ADEQTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSN
Sbjct: 179  EDLDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSN 238

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            V+GFLGGINWALL ARICQLFPNALPNMLVSRFFRVYTQWRWPNPV+LCAIEEGSLGL V
Sbjct: 239  VSGFLGGINWALLAARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPV 298

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKDRYHLMPIITPAYP MNSSYNVS STLRIMT+EFQRG+EICEAME+++A  +
Sbjct: 299  WDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKA--E 356

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEPF+F EAYKNYLQIDISAEN DDLR+WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 357  WDTLFEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHP 416

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPG+FSD+S+P HCSYFMGLQRKQGVPV EGEQFDIR+TV+EFK +V +YT RKPGMEI 
Sbjct: 417  HPGEFSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKHSVKMYTSRKPGMEIH 476

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAKS----------AADDGS 771
            V+HVKRRN+PNF+FP GVRPSR SK TWD RR+SE KV++ + +           +D+G 
Sbjct: 477  VTHVKRRNIPNFVFPNGVRPSRPSKATWDGRRSSEAKVANNSSADKIEGKGVLDGSDEGK 536

Query: 770  KRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESN 591
            KRKR+DD  + +LRN K YAAMP SSGEV EG          S + + + +  N LGE  
Sbjct: 537  KRKRIDDDTENNLRNPKGYAAMPPSSGEVLEG--SPPVGNVSSCSTQSDLVITNSLGELK 594

Query: 590  RERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKIL--SLDTANSKDAEKLAIEK 417
             E+ +N   +SL +S+NLA + + NG++DG L  N   K L  + +T++SK+AEKLAI+K
Sbjct: 595  GEKADNNETESLNNSQNLAGIFAQNGELDGILRCNLPGKGLPANNNTSSSKEAEKLAIDK 654

Query: 416  IMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSM--- 246
            IMSGP +  +A P             NQ K      + S + S   NTA E T  S+   
Sbjct: 655  IMSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAAELTNESIAAV 714

Query: 245  ---NGGGSSSALHTNGGXXXXXXXXLMAPSSNVISSA-PVPQRKPIIRLSFTSLNNKA 84
               NG G S+ L+ NGG        LMAP  N ISSA PV Q KP+IRL+FTSL   A
Sbjct: 715  ACSNGAGPSAYLYPNGGSDELEXAELMAPLFNGISSAPPVAQPKPLIRLNFTSLGKAA 772


>ref|XP_012486424.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X2 [Gossypium
            raimondii]
          Length = 726

 Score =  860 bits (2221), Expect = 0.0
 Identities = 446/648 (68%), Positives = 505/648 (77%), Gaps = 13/648 (2%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+MLSEMPEV+ELHPVPDAHVP+M+FK  GVSIDLLYA+LSLWVIP
Sbjct: 80   PRHATREEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIP 139

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ D+QTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSN
Sbjct: 140  EDLDISQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSN 199

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAI+EGSLGLQV
Sbjct: 200  VAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQV 259

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPR+NPKDRYHLMPIITPAYP MNSSYNVS STLRIMTDEFQRG EICEAME N+A  D
Sbjct: 260  WDPRKNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKA--D 317

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WD LFE + F EAYKNYLQIDISAEN DDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 318  WDALFEAYAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 377

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDF D S+PFHCSYFMGLQRK GVPV EGEQFDIRLTVEEFK +VN YTL KPGMEI 
Sbjct: 378  HPGDFQDNSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIR 437

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLA--------KSAAD---DG 774
            VSHVKRR++P+F+FPGGVRPSR SK TWDSRRAS+ KVS  A        K AAD   DG
Sbjct: 438  VSHVKRRSIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDG 497

Query: 773  SKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGES 594
             KRKR DDS DT L+N+K   A+PSSS EV  G            +LK + +DA GL E 
Sbjct: 498  KKRKRADDSADTQLKNSKYITAVPSSSAEVQAG---SPGGTVSPCSLKGDNVDATGLVEP 554

Query: 593  NRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKIL--SLDTANSKDAEKLAIE 420
             R + E+ + +  ++S +  E+SS N ++DG L   P +  L  + D ++SK+AEKLAIE
Sbjct: 555  TRGKDESNMTNGSKTS-STDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIE 613

Query: 419  KIMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNG 240
            +IMSGP ++ +AFP            +N+     GNT +  + +P  + A  A   S NG
Sbjct: 614  QIMSGPYVSHQAFPEEPEELEDDLEFRNRVVSV-GNTNNGPLQAPVSDAAGAAPIISSNG 672

Query: 239  GGSSSALHTNGGXXXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSL 96
             G S +LH +G         L A +S  I  APV Q+KP+IRL+FTSL
Sbjct: 673  AGPSISLHASGSIEELEPAELTAMTS--IPVAPVVQKKPLIRLNFTSL 718


>ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176367|ref|XP_012486422.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176370|ref|XP_012486423.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|763769978|gb|KJB37193.1| hypothetical
            protein B456_006G193600 [Gossypium raimondii]
            gi|763769981|gb|KJB37196.1| hypothetical protein
            B456_006G193600 [Gossypium raimondii]
          Length = 762

 Score =  860 bits (2221), Expect = 0.0
 Identities = 446/648 (68%), Positives = 505/648 (77%), Gaps = 13/648 (2%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+MLSEMPEV+ELHPVPDAHVP+M+FK  GVSIDLLYA+LSLWVIP
Sbjct: 116  PRHATREEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIP 175

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ D+QTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSN
Sbjct: 176  EDLDISQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSN 235

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAI+EGSLGLQV
Sbjct: 236  VAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQV 295

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPR+NPKDRYHLMPIITPAYP MNSSYNVS STLRIMTDEFQRG EICEAME N+A  D
Sbjct: 296  WDPRKNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKA--D 353

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WD LFE + F EAYKNYLQIDISAEN DDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 354  WDALFEAYAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 413

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDF D S+PFHCSYFMGLQRK GVPV EGEQFDIRLTVEEFK +VN YTL KPGMEI 
Sbjct: 414  HPGDFQDNSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIR 473

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLA--------KSAAD---DG 774
            VSHVKRR++P+F+FPGGVRPSR SK TWDSRRAS+ KVS  A        K AAD   DG
Sbjct: 474  VSHVKRRSIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDG 533

Query: 773  SKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGES 594
             KRKR DDS DT L+N+K   A+PSSS EV  G            +LK + +DA GL E 
Sbjct: 534  KKRKRADDSADTQLKNSKYITAVPSSSAEVQAG---SPGGTVSPCSLKGDNVDATGLVEP 590

Query: 593  NRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKIL--SLDTANSKDAEKLAIE 420
             R + E+ + +  ++S +  E+SS N ++DG L   P +  L  + D ++SK+AEKLAIE
Sbjct: 591  TRGKDESNMTNGSKTS-STDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIE 649

Query: 419  KIMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNG 240
            +IMSGP ++ +AFP            +N+     GNT +  + +P  + A  A   S NG
Sbjct: 650  QIMSGPYVSHQAFPEEPEELEDDLEFRNRVVSV-GNTNNGPLQAPVSDAAGAAPIISSNG 708

Query: 239  GGSSSALHTNGGXXXXXXXXLMAPSSNVISSAPVPQRKPIIRLSFTSL 96
             G S +LH +G         L A +S  I  APV Q+KP+IRL+FTSL
Sbjct: 709  AGPSISLHASGSIEELEPAELTAMTS--IPVAPVVQKKPLIRLNFTSL 754


>gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium raimondii]
          Length = 748

 Score =  850 bits (2197), Expect = 0.0
 Identities = 441/642 (68%), Positives = 499/642 (77%), Gaps = 13/642 (2%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+MLSEMPEV+ELHPVPDAHVP+M+FK  GVSIDLLYA+LSLWVIP
Sbjct: 116  PRHATREEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIP 175

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ D+QTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSN
Sbjct: 176  EDLDISQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSN 235

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            VAGFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAI+EGSLGLQV
Sbjct: 236  VAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQV 295

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPR+NPKDRYHLMPIITPAYP MNSSYNVS STLRIMTDEFQRG EICEAME N+A  D
Sbjct: 296  WDPRKNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKA--D 353

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WD LFE + F EAYKNYLQIDISAEN DDLR WKGWVESRLRQLTLKIERHTYNMLQCHP
Sbjct: 354  WDALFEAYAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHP 413

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPGDF D S+PFHCSYFMGLQRK GVPV EGEQFDIRLTVEEFK +VN YTL KPGMEI 
Sbjct: 414  HPGDFQDNSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIR 473

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLA--------KSAAD---DG 774
            VSHVKRR++P+F+FPGGVRPSR SK TWDSRRAS+ KVS  A        K AAD   DG
Sbjct: 474  VSHVKRRSIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDG 533

Query: 773  SKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGES 594
             KRKR DDS DT L+N+K   A+PSSS EV  G            +LK + +DA GL E 
Sbjct: 534  KKRKRADDSADTQLKNSKYITAVPSSSAEVQAG---SPGGTVSPCSLKGDNVDATGLVEP 590

Query: 593  NRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNKIL--SLDTANSKDAEKLAIE 420
             R + E+ + +  ++S +  E+SS N ++DG L   P +  L  + D ++SK+AEKLAIE
Sbjct: 591  TRGKDESNMTNGSKTS-STDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIE 649

Query: 419  KIMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNG 240
            +IMSGP ++ +AFP            +N+     GNT +  + +P  + A  A   S NG
Sbjct: 650  QIMSGPYVSHQAFPEEPEELEDDLEFRNRVVSV-GNTNNGPLQAPVSDAAGAAPIISSNG 708

Query: 239  GGSSSALHTNGGXXXXXXXXLMAPSSNVISSAPVPQRKPIIR 114
             G S +LH +G         L A +S  I  APV Q+KP+IR
Sbjct: 709  AGPSISLHASGSIEELEPAELTAMTS--IPVAPVVQKKPLIR 748


>ref|XP_007036649.1| Poly(A) polymerase 1 isoform 3 [Theobroma cacao]
            gi|508773894|gb|EOY21150.1| Poly(A) polymerase 1 isoform
            3 [Theobroma cacao]
          Length = 631

 Score =  850 bits (2196), Expect = 0.0
 Identities = 439/630 (69%), Positives = 492/630 (78%), Gaps = 11/630 (1%)
 Frame = -1

Query: 1952 MLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIPEDLDISQDSILQSADE 1773
            MLSEMPEV+ELHPVPDAHVPVM+FK  GVSIDLLYA+LSLWVIPEDLDISQDSILQ+ DE
Sbjct: 1    MLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLDISQDSILQNTDE 60

Query: 1772 QTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSNVAGFLGGINWALLVAR 1593
            QTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+FWAKRRGVYSNVAGFLGGINWALLVAR
Sbjct: 61   QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVAR 120

Query: 1592 ICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDRYHLMPI 1413
            ICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQVWDPR+NPKDRYHLMPI
Sbjct: 121  ICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRKNPKDRYHLMPI 180

Query: 1412 ITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVDWDTLFEPFTFSEAYKN 1233
            ITPAYPCMNSSYNVS STLRIMTDEFQRG EICEAME N+A  DWD LFE + F EAYKN
Sbjct: 181  ITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKA--DWDILFESYAFFEAYKN 238

Query: 1232 YLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDESKPFHCSY 1053
            YLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDF D+S+PFH SY
Sbjct: 239  YLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQDKSRPFHGSY 298

Query: 1052 FMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEISVSHVKRRNLPNFIFPG 873
            FMGLQRKQGVPV EGEQFDIRLTVEEFK +VN+YTL KPGMEI V+HVKRRN+P+F+FPG
Sbjct: 299  FMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKRRNIPSFVFPG 358

Query: 872  GVRPSRSSKGTWDSRRASELKVSSLA-----------KSAADDGSKRKRMDDSVDTHLRN 726
            GVRPSR SK TWDS R S+ KVS  A               DDG KRKR+DD+ D  LR+
Sbjct: 359  GVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQDDGKKRKRVDDNGDAQLRS 418

Query: 725  AKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGESNRERVENILPDSLRSS 546
            +K   A+PSSS    EG          S + K +Y DA GL E+ RE+ E+ + + L +S
Sbjct: 419  SKYITAVPSSS---LEGRVGSPVSTVSSCSTKGDYSDATGLIETTREKAESNMTNGLINS 475

Query: 545  RNLAEVSSPNGDIDGHLIDNPVNKILSLDTANSKDAEKLAIEKIMSGPCITDEAFPXXXX 366
            R+L E+SS NG++DG +  NP  K+ S D ++  +AE LAIEKIMSGP    +AFP    
Sbjct: 476  RSLEELSSHNGEVDGSVGCNPPIKV-SADASSCTEAENLAIEKIMSGPYGAHQAFPQELE 534

Query: 365  XXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMNGGGSSSALHTNGGXXXXXX 186
                    +NQ +    NT+   + S   + A  A  TS NG G S++LH +GG      
Sbjct: 535  ELEDDLEFRNQVRSV-ENTKSGPVESSMSDLAGAAPVTSSNGAGPSTSLHASGGIEELEP 593

Query: 185  XXLMAPSSNVISSAPVPQRKPIIRLSFTSL 96
              L A  SN I SAPV QRKP+IRL+FTSL
Sbjct: 594  AELTAMISNRIPSAPVAQRKPLIRLNFTSL 623


>ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|587938462|gb|EXC25192.1|
            Poly(A) polymerase [Morus notabilis]
          Length = 838

 Score =  845 bits (2183), Expect = 0.0
 Identities = 439/643 (68%), Positives = 498/643 (77%), Gaps = 14/643 (2%)
 Frame = -1

Query: 2000 PRHATREEDFFGELHQMLSEMPEVTELHPVPDAHVPVMRFKLSGVSIDLLYARLSLWVIP 1821
            PRHATREEDFFGELH+ML EMPEVTE+HPVPDAHVPV+RFK +GVSIDLLYA+LSLWVIP
Sbjct: 142  PRHATREEDFFGELHRMLVEMPEVTEVHPVPDAHVPVLRFKFNGVSIDLLYAKLSLWVIP 201

Query: 1820 EDLDISQDSILQSADEQTVRSLNGCRVTDQVLRLVPNIQNFHTTLRCMKFWAKRRGVYSN 1641
            EDLDISQDSILQ+ADEQTVRSLNGCRVTDQ+LRLVPNIQNF TTLRCM+ WAKRRGVYSN
Sbjct: 202  EDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRLWAKRRGVYSN 261

Query: 1640 VAGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQV 1461
            V+GFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQV
Sbjct: 262  VSGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQV 321

Query: 1460 WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSLSTLRIMTDEFQRGHEICEAMEINEANVD 1281
            WDPRRNPKDRYHLMPIITPAYPCMNSSYNVS STLRIM++EFQRG EICEAME ++A  D
Sbjct: 322  WDPRRNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMSEEFQRGREICEAMETDKA--D 379

Query: 1280 WDTLFEPFTFSEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHP 1101
            WDTLFEP+ F EAYKNYLQIDISAEN DDLRKWKGWVESRLRQLTLKIERHTYN LQCHP
Sbjct: 380  WDTLFEPYPFFEAYKNYLQIDISAENDDDLRKWKGWVESRLRQLTLKIERHTYNKLQCHP 439

Query: 1100 HPGDFSDESKPFHCSYFMGLQRKQGVPVGEGEQFDIRLTVEEFKQAVNLYTLRKPGMEIS 921
            HPG+FSD+SKPFHCSYFMGLQRKQGVP  E   FDIRLTVEEFK +VN+Y L KPGM I 
Sbjct: 440  HPGEFSDKSKPFHCSYFMGLQRKQGVPANESGHFDIRLTVEEFKNSVNMYMLWKPGMLIH 499

Query: 920  VSHVKRRNLPNFIFPGGVRPSRSSKGTWDSRRASELKVSSLAK-----------SAADDG 774
            VSHVKR+N+PNF+FPG VRP R  K TWD +RASELK S LA+           + +DDG
Sbjct: 500  VSHVKRKNIPNFVFPGRVRPGRPVKITWDMKRASELKASGLAQPDKSDESKTVLNGSDDG 559

Query: 773  SKRKRMDDSVDTHLRNAKCYAAMPSSSGEVHEGXXXXXXXXXXSINLKVEYMDANGLGES 594
            SKRKR+DD+V++ LRN K  A   S +GEV E             ++K + MD N L ES
Sbjct: 560  SKRKRVDDNVESSLRNVKPRA---SFTGEVLEASSPISTLSSS--SVKFDSMDMNRLVES 614

Query: 593  NRERVENILPDSLRSSRNLAEVSSPNGDIDGHLIDNPVNK---ILSLDTANSKDAEKLAI 423
             RE+ +N   DS +   N A++ S NG+ +     +P  K   + ++D ++SK+AEK+AI
Sbjct: 615  QREKSDNNFVDSFKKCENSADIPSQNGENEVSSRCSPPTKAVPVAAVDASSSKEAEKMAI 674

Query: 422  EKIMSGPCITDEAFPXXXXXXXXXXXLKNQAKFFGGNTQDSTMGSPAVNTAVEATPTSMN 243
            + IMSGP  + +A P            +NQAK F G+T DS + +   N    A  TS  
Sbjct: 675  DNIMSGPYDSHQALP-EELDELEDFEYRNQAKDFSGSTMDSQVETSKGNQPA-APITSNT 732

Query: 242  GGGSSSALHTNGGXXXXXXXXLMAPSSNVISSAPVPQRKPIIR 114
            G G S+  + NGG        LMAP SN  SSAPV QRKPIIR
Sbjct: 733  GTGPSTGSYFNGGLEELEPAELMAPVSNG-SSAPVAQRKPIIR 774


Top