BLASTX nr result

ID: Cornus23_contig00007141 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00007141
         (2770 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma ca...  1025   0.0  
ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vit...  1023   0.0  
gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium r...   990   0.0  
ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isof...   990   0.0  
ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|5879...   988   0.0  
ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Popu...   976   0.0  
ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Pop...   976   0.0  
ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prun...   971   0.0  
ref|XP_008240214.1| PREDICTED: poly(A) polymerase pla1 isoform X...   970   0.0  
emb|CDO98397.1| unnamed protein product [Coffea canephora]            962   0.0  
ref|XP_002524874.1| Poly(A) polymerase alpha, putative [Ricinus ...   951   0.0  
ref|XP_004137491.1| PREDICTED: nuclear poly(A) polymerase 1 isof...   949   0.0  
ref|XP_003534153.1| PREDICTED: poly(A) polymerase-like isoform X...   946   0.0  
ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citr...   945   0.0  
ref|XP_006493030.1| PREDICTED: poly(A) polymerase-like isoform X...   943   0.0  
ref|XP_008461688.1| PREDICTED: poly(A) polymerase beta isoform X...   942   0.0  
ref|XP_014513245.1| PREDICTED: nuclear poly(A) polymerase 1-like...   938   0.0  
gb|KOM54431.1| hypothetical protein LR48_Vigan10g032300 [Vigna a...   936   0.0  
ref|XP_007163961.1| hypothetical protein PHAVU_L002300g [Phaseol...   935   0.0  
ref|XP_006428725.1| hypothetical protein CICLE_v10011139mg [Citr...   934   0.0  

>ref|XP_007036647.1| Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|590665102|ref|XP_007036648.1| Poly(A) polymerase 1
            isoform 1 [Theobroma cacao] gi|508773892|gb|EOY21148.1|
            Poly(A) polymerase 1 isoform 1 [Theobroma cacao]
            gi|508773893|gb|EOY21149.1| Poly(A) polymerase 1 isoform
            1 [Theobroma cacao]
          Length = 762

 Score = 1025 bits (2650), Expect = 0.0
 Identities = 526/732 (71%), Positives = 574/732 (78%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            MGS G  NRNNGQ+LGITEPIS GGPT++DV KT ELEK+L +VGLYESQEEAV REEVL
Sbjct: 1    MGSPGLGNRNNGQRLGITEPISLGGPTDYDVIKTRELEKYLQNVGLYESQEEAVGREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQTVK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQTVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            REEDFFGEL++MLSEMPEVSELHPVPDAHVPVM+FKF GVSIDLLYAKLSLWVIPEDLDI
Sbjct: 121  REEDFFGELYKMLSEMPEVSELHPVPDAHVPVMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQDSILQN DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRK 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKDR+HLMPIITPAYP          STLRIMT+EFQRG+EICEAMEANKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPCMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDILFES 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            Y FFE+YKNYL+IDI+A NADDLRKWKGWVESRLRQLTLKIERHT+NMLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
            KSRPFH  YFMGLQRKQGVPVNEGEQFDIRLTVEEFKH+V MYTLWKPGMEI VTHV+RR
Sbjct: 421  KSRPFHGSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNMYTLWKPGMEIRVTHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            +IP+FVF                      KVSGH   ++S E K V  G           
Sbjct: 481  NIPSFVFPGGVRPSRPSKVTWDSMRVSDAKVSGHAGPDKSGEVKGVADGQDDGKKRKRVD 540

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEIK 401
            D     LR++K + A  SS  E   GS +ST SS S     +DA GL+ET  EK ES + 
Sbjct: 541  DNGDAQLRSSKYITAVPSSSLEGRVGSPVSTVSSCSTKGDYSDATGLIETTREKAESNMT 600

Query: 400  EGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAH 221
             GL   ++  EL   +      V C P I+V   + ++SSC EAE LAIEK+MSGPY AH
Sbjct: 601  NGLINSRSLEELSSHNGEVDGSVGCNPPIKV---SADASSCTEAENLAIEKIMSGPYGAH 657

Query: 220  QAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDLNG 41
            QAFP      E+D E+RNQV+    +  G    S S+   A  + + N AGP +S   +G
Sbjct: 658  QAFPQELEELEDDLEFRNQVRSVENTKSGPVESSMSDLAGAAPVTSSNGAGPSTSLHASG 717

Query: 40   GLEELEPTELVA 5
            G+EELEP EL A
Sbjct: 718  GIEELEPAELTA 729


>ref|XP_002279968.2| PREDICTED: nuclear poly(A) polymerase 1 [Vitis vinifera]
          Length = 757

 Score = 1023 bits (2645), Expect = 0.0
 Identities = 529/737 (71%), Positives = 575/737 (78%), Gaps = 4/737 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNN-GQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            M + G NNRNN GQ+LGITEPIS GGP   DV KT ELEKFLA  GLYESQEEAVSREEV
Sbjct: 1    MSNLGLNNRNNSGQRLGITEPISLGGPNELDVTKTQELEKFLAAAGLYESQEEAVSREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            TREEDFFGELH+MLSEMPEV+ELHPVPDAHVPVMRFKF+GVSIDLLYAKLSLWVIPEDLD
Sbjct: 121  TREEDFFGELHKMLSEMPEVTELHPVPDAHVPVMRFKFSGVSIDLLYAKLSLWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            +SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLR MRFWAKRRGVYSNVAGF
Sbjct: 181  VSQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRFMRFWAKRRGVYSNVAGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGINWALLVARICQLYPNALP+MLVSRFFRVYTQWRWPNPVMLCAIEEG+LGLQVWDPR
Sbjct: 241  LGGINWALLVARICQLYPNALPSMLVSRFFRVYTQWRWPNPVMLCAIEEGTLGLQVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            + PKDRFHLMPIITPAYP          STLRIM+EEF+RGNEI E MEANKADW TL E
Sbjct: 301  KYPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMSEEFKRGNEISEVMEANKADWATLCE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFE+YKNYL+I+I A NADDLRKWKGWVESRLRQLTLKIERHT+NMLQCHPHPGDFS
Sbjct: 361  PYPFFEAYKNYLQIEIAAENADDLRKWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFHCCYFMGLQRKQGVP +EGEQFDIRLTV+EFKH+V MYTLWKPGMEI+V HVRR
Sbjct: 421  DKSRPFHCCYFMGLQRKQGVPASEGEQFDIRLTVDEFKHSVGMYTLWKPGMEIHVIHVRR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNES--EESPEGKLVLGGAXXXXXXX 590
            R+IPNFVF                      KV+       E +   + VL GA       
Sbjct: 481  RNIPNFVF------------PGGVRPSRPTKVASERRRVLEPNVSTQAVLEGAEDSKKRK 528

Query: 589  XXXDIAGTNLRAAKCLAAGDSSQRESYEGSTL-STNSSPSIIVGNTDANGLVETRGEKVE 413
               +   TN R AKCL A  SS  E    + L ST ++ SI V + D N L +TR EKVE
Sbjct: 529  REDENVETNSRNAKCLVAAASSSHEVLSSNPLVSTVNACSIKVDSMDINMLGKTRKEKVE 588

Query: 412  SEIKEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGP 233
            + I+ GL+   N  E+PP++      VRC   I+ LS++  S S  EAEK+AIEK+MSGP
Sbjct: 589  NNIEHGLKNLNNSVEVPPQNGEVDGSVRCSHPIKTLSSSGGSPSSTEAEKIAIEKIMSGP 648

Query: 232  YNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSS 53
            Y +HQAFP      E+D EY+NQVKDF  ST G   ESS        L   +   P +  
Sbjct: 649  YVSHQAFPGELDELEDDVEYKNQVKDFTGSTKGSSAESSKANVAEEPLTTTSGTVPCTIL 708

Query: 52   DLNGGLEELEPTELVAP 2
              NGGLEELEP EL+ P
Sbjct: 709  SPNGGLEELEPAELMPP 725


>gb|KJB37195.1| hypothetical protein B456_006G193600 [Gossypium raimondii]
          Length = 748

 Score =  990 bits (2560), Expect = 0.0
 Identities = 511/732 (69%), Positives = 560/732 (76%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            MGS G    N+GQ+LGITEPIS GGPT +DV KT ELEK+L +VGLYESQEEAVSREEVL
Sbjct: 1    MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            REEDFFGELH+MLSEMPEVSELHPVPDAHVP+M+FKF GVSIDLLYAKLSLWVIPEDLDI
Sbjct: 121  REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQDSILQN D+QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAI+EGSLGLQVWDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPRK 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKDR+HLMPIITPAYP          STLRIMT+EFQRG+EICEAMEANKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFEA 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            Y FFE+YKNYL+IDI+A N DDLR WKGWVESRLRQLTLKIERHT+NMLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
             SRPFHC YFMGLQRK GVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGMEI V+HV+RR
Sbjct: 421  NSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            SIP+FVF                      KVSGH  S++  E K    G           
Sbjct: 481  SIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKRAD 540

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEIK 401
            D A T L+ +K + A  SS  E   GS   T S  S+   N DA GLVE    K ES + 
Sbjct: 541  DSADTQLKNSKYITAVPSSSAEVQAGSPGGTVSPCSLKGDNVDATGLVEPTRGKDESNMT 600

Query: 400  EGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAH 221
             G +   +  EL   ++     +RC P    L    ++SS KEAEKLAIE++MSGPY +H
Sbjct: 601  NGSK-TSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSGPYVSH 659

Query: 220  QAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDLNG 41
            QAFP      E+D E+RN+V   G +  G      S+A  A  + + N AGP  S   +G
Sbjct: 660  QAFPEEPEELEDDLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGAGPSISLHASG 719

Query: 40   GLEELEPTELVA 5
             +EELEP EL A
Sbjct: 720  SIEELEPAELTA 731


>ref|XP_012486421.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176367|ref|XP_012486422.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|823176370|ref|XP_012486423.1| PREDICTED:
            nuclear poly(A) polymerase 1 isoform X1 [Gossypium
            raimondii] gi|763769978|gb|KJB37193.1| hypothetical
            protein B456_006G193600 [Gossypium raimondii]
            gi|763769981|gb|KJB37196.1| hypothetical protein
            B456_006G193600 [Gossypium raimondii]
          Length = 762

 Score =  990 bits (2560), Expect = 0.0
 Identities = 511/732 (69%), Positives = 560/732 (76%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            MGS G    N+GQ+LGITEPIS GGPT +DV KT ELEK+L +VGLYESQEEAVSREEVL
Sbjct: 1    MGSPGLGTGNSGQRLGITEPISLGGPTEYDVIKTRELEKYLQNVGLYESQEEAVSREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQIVKNWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            REEDFFGELH+MLSEMPEVSELHPVPDAHVP+M+FKF GVSIDLLYAKLSLWVIPEDLDI
Sbjct: 121  REEDFFGELHKMLSEMPEVSELHPVPDAHVPIMKFKFKGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQDSILQN D+QTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNTDDQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAI+EGSLGLQVWDPR+
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIKEGSLGLQVWDPRK 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKDR+HLMPIITPAYP          STLRIMT+EFQRG+EICEAMEANKADW+ LFE 
Sbjct: 301  NPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTDEFQRGSEICEAMEANKADWDALFEA 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            Y FFE+YKNYL+IDI+A N DDLR WKGWVESRLRQLTLKIERHT+NMLQCHPHPGDF D
Sbjct: 361  YAFFEAYKNYLQIDISAENDDDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFQD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
             SRPFHC YFMGLQRK GVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGMEI V+HV+RR
Sbjct: 421  NSRPFHCSYFMGLQRKLGVPVNEGEQFDIRLTVEEFKHSVNTYTLWKPGMEIRVSHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            SIP+FVF                      KVSGH  S++  E K    G           
Sbjct: 481  SIPSFVFPGGVRPSRPSKATWDSRRASDAKVSGHAGSDKPGEVKGAADGQVDGKKRKRAD 540

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEIK 401
            D A T L+ +K + A  SS  E   GS   T S  S+   N DA GLVE    K ES + 
Sbjct: 541  DSADTQLKNSKYITAVPSSSAEVQAGSPGGTVSPCSLKGDNVDATGLVEPTRGKDESNMT 600

Query: 400  EGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAH 221
             G +   +  EL   ++     +RC P    L    ++SS KEAEKLAIE++MSGPY +H
Sbjct: 601  NGSK-TSSTDELSSLNSEVDGSLRCIPPHTGLHVTADASSSKEAEKLAIEQIMSGPYVSH 659

Query: 220  QAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDLNG 41
            QAFP      E+D E+RN+V   G +  G      S+A  A  + + N AGP  S   +G
Sbjct: 660  QAFPEEPEELEDDLEFRNRVVSVGNTNNGPLQAPVSDAAGAAPIISSNGAGPSISLHASG 719

Query: 40   GLEELEPTELVA 5
             +EELEP EL A
Sbjct: 720  SIEELEPAELTA 731


>ref|XP_010110105.1| Poly(A) polymerase [Morus notabilis] gi|587938462|gb|EXC25192.1|
            Poly(A) polymerase [Morus notabilis]
          Length = 838

 Score =  988 bits (2555), Expect = 0.0
 Identities = 509/760 (66%), Positives = 573/760 (75%), Gaps = 27/760 (3%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEK--------------------- 2084
            M +HG +NRNNGQ+LGITEPIS GGPT +DV K+ ELEK                     
Sbjct: 1    MANHGLSNRNNGQRLGITEPISLGGPTEYDVMKSQELEKRLGITEPISLGGPTEYDVMKS 60

Query: 2083 -----FLADVGLYESQEEAVSREEVLGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFT 1919
                 +L D GLYESQEEAVSREEVLGRLDQ VK+WVKTISRAKGLNEQLVQEANAKIFT
Sbjct: 61   QELEKYLQDAGLYESQEEAVSREEVLGRLDQIVKLWVKTISRAKGLNEQLVQEANAKIFT 120

Query: 1918 FGSYRLGVHGPGADIDTLCVGPRHATREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMR 1739
            FGSYRLGVHGPGADIDTLCVGPRHATREEDFFGELHRML EMPEV+E+HPVPDAHVPV+R
Sbjct: 121  FGSYRLGVHGPGADIDTLCVGPRHATREEDFFGELHRMLVEMPEVTEVHPVPDAHVPVLR 180

Query: 1738 FKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ 1559
            FKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ
Sbjct: 181  FKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ 240

Query: 1558 NFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQ 1379
            NFRTTLRCMR WAKRRGVYSNV+GFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQ
Sbjct: 241  NFRTTLRCMRLWAKRRGVYSNVSGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQ 300

Query: 1378 WRWPNPVMLCAIEEGSLGLQVWDPRRCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMT 1199
            WRWPNPVMLCAIEEGSLGLQVWDPRR PKDR+HLMPIITPAYP          STLRIM+
Sbjct: 301  WRWPNPVMLCAIEEGSLGLQVWDPRRNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMS 360

Query: 1198 EEFQRGNEICEAMEANKADWNTLFEPYPFFESYKNYLEIDITAANADDLRKWKGWVESRL 1019
            EEFQRG EICEAME +KADW+TLFEPYPFFE+YKNYL+IDI+A N DDLRKWKGWVESRL
Sbjct: 361  EEFQRGREICEAMETDKADWDTLFEPYPFFEAYKNYLQIDISAENDDDLRKWKGWVESRL 420

Query: 1018 RQLTLKIERHTFNMLQCHPHPGDFSDKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVE 839
            RQLTLKIERHT+N LQCHPHPG+FSDKS+PFHC YFMGLQRKQGVP NE   FDIRLTVE
Sbjct: 421  RQLTLKIERHTYNKLQCHPHPGEFSDKSKPFHCSYFMGLQRKQGVPANESGHFDIRLTVE 480

Query: 838  EFKHTVAMYTLWKPGMEIYVTHVRRRSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGH 659
            EFK++V MY LWKPGM I+V+HV+R++IPNFVF                      K SG 
Sbjct: 481  EFKNSVNMYMLWKPGMLIHVSHVKRKNIPNFVFPGRVRPGRPVKITWDMKRASELKASGL 540

Query: 658  NESEESPEGKLVLGGAXXXXXXXXXXDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSS 479
             + ++S E K VL G+          D   ++LR  K  A+      E+   S +ST SS
Sbjct: 541  AQPDKSDESKTVLNGSDDGSKRKRVDDNVESSLRNVKPRASFTGEVLEA--SSPISTLSS 598

Query: 478  PSIIVGNTDANGLVETRGEKVESEIKEGLEGFKNPSELPPKSAGAIDEVRCR-PTIEVLS 302
             S+   + D N LVE++ EK ++   +  +  +N +++P ++       RC  PT  V  
Sbjct: 599  SSVKFDSMDMNRLVESQREKSDNNFVDSFKKCENSADIPSQNGENEVSSRCSPPTKAVPV 658

Query: 301  ANDESSSCKEAEKLAIEKLMSGPYNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTE 122
            A  ++SS KEAEK+AI+ +MSGPY++HQA P        D EYRNQ KDF  ST   Q E
Sbjct: 659  AAVDASSSKEAEKMAIDNIMSGPYDSHQALP-EELDELEDFEYRNQAKDFSGSTMDSQVE 717

Query: 121  SSSNAKVAISLANVNPAGPRSSSDLNGGLEELEPTELVAP 2
            +S   + A  + +    GP + S  NGGLEELEP EL+AP
Sbjct: 718  TSKGNQPAAPITSNTGTGPSTGSYFNGGLEELEPAELMAP 757


>ref|XP_002322074.2| hypothetical protein POPTR_0015s04100g [Populus trichocarpa]
            gi|550321905|gb|EEF06201.2| hypothetical protein
            POPTR_0015s04100g [Populus trichocarpa]
          Length = 780

 Score =  976 bits (2524), Expect = 0.0
 Identities = 518/748 (69%), Positives = 569/748 (76%), Gaps = 15/748 (2%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ----LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSR 2033
            MGS G  NRNNGQQ    LGITEPIS GGPT +DV KT ELEKFL D GLYESQEEAVSR
Sbjct: 1    MGSPGLINRNNGQQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSR 60

Query: 2032 EEVLGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP 1853
            EEVLGRLDQ VK WVK ISRAK LNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP
Sbjct: 61   EEVLGRLDQIVKNWVKVISRAKRLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGP 120

Query: 1852 RHATREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPE 1673
            RHATREEDFFGELHRMLSEMPEV+ELHPVPDAHVPVMRFKF GVSIDLLYAKLSLWVIPE
Sbjct: 121  RHATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPE 180

Query: 1672 DLDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQ---NFRTTLRCMRFWAKRRGVY 1502
            DLD+SQDS+L NADEQTVRSLNGCRVTDQILRLVPNIQ   NFRTTLRCMRFWAKRRGVY
Sbjct: 181  DLDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQAMQNFRTTLRCMRFWAKRRGVY 240

Query: 1501 SNVAGFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL 1322
            SNV+GFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL
Sbjct: 241  SNVSGFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL 300

Query: 1321 QVWDPRRCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKAD 1142
             VWDPRR PKDR+HLMPIITPAYP          STLRIMTEEFQRGNEICEAME +KA+
Sbjct: 301  SVWDPRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAE 360

Query: 1141 WNTLFEPYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHP 962
            W+TLFEP+ FFE+YKNYL+IDI+A N DDLR+WKGWVESRLRQLTLKIERHT+NMLQCHP
Sbjct: 361  WDTLFEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHP 420

Query: 961  HPGDFSDKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIY 782
            HPG+FSDKSRP HC YFMGLQRKQGVPVNEGEQFDIR+TV+EFK++V MYTLWKPGMEI 
Sbjct: 421  HPGEFSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKNSVNMYTLWKPGMEIR 480

Query: 781  VTHVRRRSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXX 602
            VTHV++R+IPNFVF                      KV+ +N S +  EGK VL G+   
Sbjct: 481  VTHVKKRNIPNFVFPSGVRPSRPSKATWDGRRSSEAKVA-NNSSADKIEGKGVLDGSDEG 539

Query: 601  XXXXXXXDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTN-SSPSIIVGNTDANGLVETRG 425
                   +    NLR  K  AA   S  E +EGS    N SS S        N L E +G
Sbjct: 540  KKRKRIDEDTENNLRNPKGYAAMPPSGGEVHEGSPPVGNVSSCSTQSDLVITNSLGELKG 599

Query: 424  EKVESEIKEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKL 245
            EK ++   E L   +N + +  ++      +RC    + L AN+++SS KEAEKLAI+K+
Sbjct: 600  EKADNNETESLSNSQNLAGIFAQNGELDGILRCNLPDKGLPANNDTSSSKEAEKLAIDKI 659

Query: 244  MSGPYNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESS-SNAKV------AISLA 86
            MSGPY AHQA P      E+D  Y NQ K    +  G   ESS SN  V        ++A
Sbjct: 660  MSGPYVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAVEQTNESIAAVA 719

Query: 85   NVNPAGPRSSSDLNGGLEELEPTELVAP 2
              N AGP +    NGG EELEP EL+AP
Sbjct: 720  CSNGAGPSAYLYPNGGSEELEPAELMAP 747


>ref|XP_011009627.1| PREDICTED: nuclear poly(A) polymerase 1 [Populus euphratica]
          Length = 776

 Score =  976 bits (2522), Expect = 0.0
 Identities = 516/745 (69%), Positives = 566/745 (75%), Gaps = 12/745 (1%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ---LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSRE 2030
            MGS G  NRNNGQQ   LGITEPIS GGPT +DV KT ELEKFL D GLYESQEEAVSRE
Sbjct: 1    MGSPGLINRNNGQQQQRLGITEPISLGGPTEYDVTKTRELEKFLQDAGLYESQEEAVSRE 60

Query: 2029 EVLGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 1850
            EVLGRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR
Sbjct: 61   EVLGRLDQIVKNWVKVISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 120

Query: 1849 HATREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPED 1670
            HATREEDFFGELHRMLSEMPEV+ELHPVPDAHVPVMRFKF GVSIDLLYAKLSLWVIPED
Sbjct: 121  HATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVMRFKFKGVSIDLLYAKLSLWVIPED 180

Query: 1669 LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVA 1490
            LD+SQDS+L NADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+
Sbjct: 181  LDVSQDSMLHNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 240

Query: 1489 GFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWD 1310
            GFLGGINWALL ARICQL+PNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL VWD
Sbjct: 241  GFLGGINWALLAARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPVWD 300

Query: 1309 PRRCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTL 1130
            PRR PKDR+HLMPIITPAYP          STLRIMTEEFQRGNEICEAME +KA+W+TL
Sbjct: 301  PRRNPKDRYHLMPIITPAYPSMNSSYNVSSSTLRIMTEEFQRGNEICEAMEVSKAEWDTL 360

Query: 1129 FEPYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGD 950
            FEP+ FFE+YKNYL+IDI+A N DDLR+WKGWVESRLRQLTLKIERHT+NMLQCHPHPG+
Sbjct: 361  FEPFSFFEAYKNYLQIDISAENEDDLRQWKGWVESRLRQLTLKIERHTYNMLQCHPHPGE 420

Query: 949  FSDKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHV 770
            FSDKSRP HC YFMGLQRKQGVPVNEGEQFDIR+TV+EFKH+V MYT  KPGMEI+VTHV
Sbjct: 421  FSDKSRPLHCSYFMGLQRKQGVPVNEGEQFDIRITVDEFKHSVKMYTSRKPGMEIHVTHV 480

Query: 769  RRRSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXX 590
            +RR+IPNFVF                      KV+ +N S +  EGK VL G+       
Sbjct: 481  KRRNIPNFVFPNGVRPSRPSKATWDGRRSSEAKVA-NNSSADKIEGKGVLDGSDEGKKRK 539

Query: 589  XXXDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTN-SSPSIIVGNTDANGLVETRGEKVE 413
               D    NLR  K  AA   S  E  EGS    N SS S        N L E +GEK +
Sbjct: 540  RIDDDTENNLRNPKGYAAMPPSSGEVLEGSPPVGNVSSCSTQSDLVITNSLGELKGEKAD 599

Query: 412  SEIKEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGP 233
            +   E L   +N + +  ++      +RC    + L AN+ +SS KEAEKLAI+K+MSGP
Sbjct: 600  NNETESLNNSQNLAGIFAQNGELDGILRCNLPGKGLPANNNTSSSKEAEKLAIDKIMSGP 659

Query: 232  YNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESS--------SNAKVAISLANVN 77
            Y AHQA P      E+D  Y NQ K    +  G   ESS        +N  +A ++A  N
Sbjct: 660  YVAHQALPQELDELEDDFVYTNQGKGSEWAAKGSPVESSLSNTAAELTNESIA-AVACSN 718

Query: 76   PAGPRSSSDLNGGLEELEPTELVAP 2
             AGP +    NGG +ELE  EL+AP
Sbjct: 719  GAGPSAYLYPNGGSDELEXAELMAP 743


>ref|XP_007210342.1| hypothetical protein PRUPE_ppa001856mg [Prunus persica]
            gi|462406077|gb|EMJ11541.1| hypothetical protein
            PRUPE_ppa001856mg [Prunus persica]
          Length = 755

 Score =  971 bits (2511), Expect = 0.0
 Identities = 508/734 (69%), Positives = 564/734 (76%), Gaps = 1/734 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            M S G +NRNNG++LGITEPIS GGPT +DV KT ELEK+L D  LYESQEEAVSREEVL
Sbjct: 1    MASPGLSNRNNGKRLGITEPISLGGPTEYDVIKTRELEKYLQDARLYESQEEAVSREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQ VKIWVKTISR KGLNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQIVKIWVKTISRTKGLNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            REEDFFGEL RMLSEMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYAKLSLWVIPEDLDI
Sbjct: 121  REEDFFGELQRMLSEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQDSILQNADEQTVRSLNGCRVTDQILRLVP+IQNFRTTLRCMR WAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNADEQTVRSLNGCRVTDQILRLVPSIQNFRTTLRCMRLWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKD++HLMPIITPAYP          STLRIM EEFQRGNEICEAMEANKADW+TLFE 
Sbjct: 301  NPKDKYHLMPIITPAYPSMNSSYNVSSSTLRIMLEEFQRGNEICEAMEANKADWDTLFES 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            Y FFE+YKNYL+IDI+A NADD RKWKGWVESRLRQLTLKIERHT+ MLQCHPHPGDFSD
Sbjct: 361  YDFFEAYKNYLQIDISAENADDFRKWKGWVESRLRQLTLKIERHTYGMLQCHPHPGDFSD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
            KSRPFH  YFMGLQRKQGVPV EGEQFDIR TVEEFK +V +YTL + GMEI V+HV+RR
Sbjct: 421  KSRPFHSSYFMGLQRKQGVPVTEGEQFDIRATVEEFKQSVNLYTLLERGMEIRVSHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            +IPNFVF                      KVSG ++ ++  EGK  L G+          
Sbjct: 481  NIPNFVFPGEVRPLRLSKVTWGSRRGSELKVSGDSQPDKLCEGKTDLDGSDGGQKRKRVD 540

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEIK 401
            D   TN R AK L    SS         +S  SS S    + DAN       +KV+  I 
Sbjct: 541  DNVETNSRYAKSLHL--SSGEVHAASPPISNISSCSTKCESMDAN-------KKVDDSIA 591

Query: 400  EGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAH 221
            + LE  +NP+++P ++       RC+P  + L A   +SS KEAEK+A+ K M+GPY +H
Sbjct: 592  DSLEKIENPADIPYQNGQIEVSSRCKPPNDSLPAAANTSSSKEAEKMALGKNMAGPYVSH 651

Query: 220  QAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTE-SSSNAKVAISLANVNPAGPRSSSDLN 44
            QA P      E+D+E+ +QVKDF  +    Q E S  +  V+  + + N AGP S+   N
Sbjct: 652  QALP-ELDELEDDSEHGHQVKDFSRNMKSSQMEPSEESVSVSAPVNSSNGAGP-STDSYN 709

Query: 43   GGLEELEPTELVAP 2
            GGLEELEP EL+ P
Sbjct: 710  GGLEELEPAELMVP 723


>ref|XP_008240214.1| PREDICTED: poly(A) polymerase pla1 isoform X1 [Prunus mume]
          Length = 755

 Score =  970 bits (2507), Expect = 0.0
 Identities = 508/734 (69%), Positives = 563/734 (76%), Gaps = 1/734 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            M S G +NRNNG++LGITEPIS GGPT +DV KT ELEK+L D  LYESQEEAVSREEVL
Sbjct: 1    MASPGLSNRNNGKRLGITEPISLGGPTEYDVIKTRELEKYLQDARLYESQEEAVSREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQ VKIWVKTISRAKGLNEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQIVKIWVKTISRAKGLNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            REEDFFGEL RMLSEMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYAKLSLWVIPEDLDI
Sbjct: 121  REEDFFGELQRMLSEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQDSILQNADEQTVRSLNGCRVTDQILRLVP+IQNFRTTLRCMR WAKRRGVYSNVAGFL
Sbjct: 181  SQDSILQNADEQTVRSLNGCRVTDQILRLVPSIQNFRTTLRCMRLWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKD++HLMPIITP+YP          STLRIM EEFQRGNEICEAME+NKADW+TLFE 
Sbjct: 301  NPKDKYHLMPIITPSYPSMNSSYNVSSSTLRIMLEEFQRGNEICEAMESNKADWDTLFES 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            Y FFE+YKNYL+IDI+A NADD RKWKGWVESRLRQLTLKIERHT++MLQCHPHPGDFSD
Sbjct: 361  YNFFEAYKNYLQIDISAENADDFRKWKGWVESRLRQLTLKIERHTYDMLQCHPHPGDFSD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
            KSRPFH  YFMGLQRKQGVPV EGEQFDIR TVEEFK +V  YTL + G EI V+HV+RR
Sbjct: 421  KSRPFHSSYFMGLQRKQGVPVTEGEQFDIRATVEEFKQSVNRYTLLERGREIRVSHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            +IPNFVF                      KVSG  + ++  EGK  L G+          
Sbjct: 481  NIPNFVFPGEVRPLRPSKVTWGSRRGSELKVSGDAQPDKLCEGKTDLEGSDGGQKRKRVD 540

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEIK 401
            D   T+ R AK L     S         +S  SS S    + DAN       +KV+  I 
Sbjct: 541  DTVETDSRYAKSLHL--CSGEVHAASPPISNISSRSTKCESMDAN-------KKVDDSIA 591

Query: 400  EGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAH 221
              LE  +NP+++P ++       RC P  + L A   +SS KEAEK+A+EK M+GPY +H
Sbjct: 592  VSLEKIENPADIPYQNGQIEVSSRCNPPNDSLPAAANTSSFKEAEKMALEKNMAGPYVSH 651

Query: 220  QAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTE-SSSNAKVAISLANVNPAGPRSSSDLN 44
            QAFP      E+D+EYR+QVKDF  +    Q E S  +  V+  + + N AGP S+   N
Sbjct: 652  QAFP-ELDELEDDSEYRHQVKDFSRNMKSSQMEPSEESVSVSARVNSSNGAGP-STDSYN 709

Query: 43   GGLEELEPTELVAP 2
            GGLEELEP EL+ P
Sbjct: 710  GGLEELEPAELMVP 723


>emb|CDO98397.1| unnamed protein product [Coffea canephora]
          Length = 754

 Score =  962 bits (2488), Expect = 0.0
 Identities = 508/743 (68%), Positives = 559/743 (75%), Gaps = 10/743 (1%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVL 2021
            M   GF N+++GQ+LGITEPIS+ GPT +D+ KT ELEKFLADVGLYESQEEA+SREEVL
Sbjct: 1    MAGPGFGNQSSGQRLGITEPISWSGPTEYDMIKTRELEKFLADVGLYESQEEAISREEVL 60

Query: 2020 GRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 1841
            GRLDQ VK WVK +SRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT
Sbjct: 61   GRLDQIVKTWVKNVSRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHAT 120

Query: 1840 REEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDI 1661
            R++DFFGEL RMLSEMPEVSELHPVPDAHVPV++FKF+G+SIDLLYAKLSLWVIPEDLDI
Sbjct: 121  RDDDFFGELQRMLSEMPEVSELHPVPDAHVPVLKFKFSGISIDLLYAKLSLWVIPEDLDI 180

Query: 1660 SQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFL 1481
            SQ+SILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMR+WAKRRGVYSNVAGFL
Sbjct: 181  SQESILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRYWAKRRGVYSNVAGFL 240

Query: 1480 GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRR 1301
            GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLC IE+GSLGL VWDPRR
Sbjct: 241  GGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCEIEDGSLGLPVWDPRR 300

Query: 1300 CPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFEP 1121
             PKDRFHLMPIITPAYP          STLRIMT EFQRGNEICEAM+ANK +W+ LFE 
Sbjct: 301  NPKDRFHLMPIITPAYPCMNSSYNVSSSTLRIMTNEFQRGNEICEAMDANKCNWDKLFEL 360

Query: 1120 YPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSD 941
            YPFFE+YKNYL+ID+TAANA DL  WKGWVESRLRQLTLKIERHT NMLQCHPHPGDFSD
Sbjct: 361  YPFFEAYKNYLQIDVTAANAADLMNWKGWVESRLRQLTLKIERHTLNMLQCHPHPGDFSD 420

Query: 940  KSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRR 761
            KSRPF+CCYFMGLQRKQGV  NEGEQFDIRLTVEEFKH V MY  WKPGMEI+V HV+RR
Sbjct: 421  KSRPFYCCYFMGLQRKQGVAANEGEQFDIRLTVEEFKHAVGMYNTWKPGMEIHVCHVKRR 480

Query: 760  SIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXX 581
            SIP FVF                      KVS H E    P  K + GG+          
Sbjct: 481  SIPAFVF-PGGVRPRPTKVAGEGRRPSQTKVSSHTEDSSFP--KALNGGSKRKRDDTD-- 535

Query: 580  DIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTD-ANGLVETRG----EKV 416
                T+L A +    G+S +        L     PS  +G +   N  +ET G    EKV
Sbjct: 536  --TATSLNAKRIAGVGESGE--------LVHEGRPSGCIGTSYLGNASLETPGKIFNEKV 585

Query: 415  ESEIKEGLEGFKNPSELPPKSA---GAID-EVRCRPTIEVLSANDESSSCKEAEKLAIEK 248
            E  +  GLE   NP  LP  S+   G +D  +R  P+     A+  S S KEAEKLAIEK
Sbjct: 586  EDNMGNGLE---NPICLPQASSQNGGELDASLRLDPS---TPADSISLSSKEAEKLAIEK 639

Query: 247  LMSGPYNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESS-SNAKVAISLANVNPA 71
            +M+GPY AHQ FP      E+D EY+NQ K  G S  G   ESS +   + +SL     A
Sbjct: 640  MMTGPYVAHQTFPQELDELEDDPEYKNQGKITGGSVKGSSMESSATKGSLIVSLTTSTAA 699

Query: 70   GPRSSSDLNGGLEELEPTELVAP 2
            G  SS   +G LEELEP EL+ P
Sbjct: 700  GSCSSLQSSGKLEELEPPELLPP 722


>ref|XP_002524874.1| Poly(A) polymerase alpha, putative [Ricinus communis]
            gi|223535837|gb|EEF37498.1| Poly(A) polymerase alpha,
            putative [Ricinus communis]
          Length = 770

 Score =  951 bits (2457), Expect = 0.0
 Identities = 511/753 (67%), Positives = 563/753 (74%), Gaps = 20/753 (2%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ---LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSRE 2030
            MGS G + RNNGQQ   LGIT+PIS GGPT +D+ K+ ELEKFL D+GLYES+EE+VSRE
Sbjct: 1    MGSPGLSTRNNGQQQLRLGITDPISLGGPTEYDLIKSRELEKFLQDMGLYESREESVSRE 60

Query: 2029 EVLGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 1850
            EVLGRLDQ VK WVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR
Sbjct: 61   EVLGRLDQIVKHWVKVISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPR 120

Query: 1849 HATREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPED 1670
            HATREEDFFGELHRMLSEMPEV+ELHPVPDAHVPV++FKF GVSIDLLYAKLSLWVIPED
Sbjct: 121  HATREEDFFGELHRMLSEMPEVTELHPVPDAHVPVLKFKFKGVSIDLLYAKLSLWVIPED 180

Query: 1669 LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVA 1490
            LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNV+
Sbjct: 181  LDISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVS 240

Query: 1489 GFLGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWD 1310
            GFLGGINWALLVARICQL+PNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWD
Sbjct: 241  GFLGGINWALLVARICQLFPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWD 300

Query: 1309 PRRCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTL 1130
            PRR PKDR+HLMPIITPAYP          STLRIM+EEFQRGNEICEAMEA+KADWNTL
Sbjct: 301  PRRNPKDRYHLMPIITPAYPSMNSSYNVSASTLRIMSEEFQRGNEICEAMEASKADWNTL 360

Query: 1129 FEPYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGD 950
            FE + FF++YKNYL+IDI A N DDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPG+
Sbjct: 361  FESFSFFDAYKNYLQIDIGAENEDDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGE 420

Query: 949  FSDKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHV 770
            F+DKSRP HC YFMGLQRKQGVP NEGEQFDIR+TVEEFK +V MYT WKPGMEI+VTHV
Sbjct: 421  FTDKSRPLHCSYFMGLQRKQGVPANEGEQFDIRITVEEFKISVNMYTSWKPGMEIHVTHV 480

Query: 769  RRRSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXX 590
            +RR+IP+FVF                        S  + +E+S EGK V  G+       
Sbjct: 481  KRRNIPSFVFPGGIRPSRPSKTTWD---------SKRSSAEKSGEGKGVSDGSDDKGKRK 531

Query: 589  XXXDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPS---IIVGNTDANGLVETRGEK 419
                    N+  A  +A  DS+   S E      N SPS   + +  T  N + E R E 
Sbjct: 532  R----IDDNVANATKIAKPDSTSPLSGE----VNNGSPSAGTVSLLLTSTNAVGEPRDEP 583

Query: 418  VESEIKEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMS 239
            VE+ +    + F N  +L    A         P  +VL   +E+ S K AEKLAIE +MS
Sbjct: 584  VENNL---TDDFSNSKDLTGFYA---HNGELNPPNKVLLGINEAPS-KVAEKLAIETIMS 636

Query: 238  GPYNAHQAFPXXXXXXENDTEYRNQVKDFGV-----STGGGQTESSS---------NAKV 101
            GPY  +QA P      E+D E R QVKDFG       T     ESSS         NA V
Sbjct: 637  GPYVTNQALPQELDELEDDFECRTQVKDFGAGANTKDTNDSPIESSSASTTAKTLPNAPV 696

Query: 100  AISLANVNPAGPRSSSDLNGGLEELEPTELVAP 2
            A  + + N   P ++   NGGLEELEP ELVAP
Sbjct: 697  AAPVISSNGTDPSTALCPNGGLEELEPAELVAP 729


>ref|XP_004137491.1| PREDICTED: nuclear poly(A) polymerase 1 isoform X1 [Cucumis sativus]
            gi|700209059|gb|KGN64155.1| hypothetical protein
            Csa_1G042640 [Cucumis sativus]
          Length = 748

 Score =  949 bits (2454), Expect = 0.0
 Identities = 497/737 (67%), Positives = 555/737 (75%), Gaps = 4/737 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MGS     RNNGQQ LGIT+PIS  GPT +DV KT ELEK+L D GLYESQE+AV+REEV
Sbjct: 1    MGSPALCGRNNGQQRLGITDPISLSGPTEYDVLKTRELEKYLQDAGLYESQEDAVNREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKAISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            TREEDFFGELH+MLSEMPEVSELHPVPDAHVPVMRFK +GVSIDLLYAKLSLWVIPEDLD
Sbjct: 121  TREEDFFGELHKMLSEMPEVSELHPVPDAHVPVMRFKLSGVSIDLLYAKLSLWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQDSILQN DEQTVRSLNGCRVTD+ILRLVPNIQ+FRTTLRCMRFWAKRRGVYSNV+GF
Sbjct: 181  ISQDSILQNTDEQTVRSLNGCRVTDRILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVSGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLCA EEGSLGLQVWDPR
Sbjct: 241  LGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCANEEGSLGLQVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLRIMTEEF+RG++ICE ME NK+DW+TLFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMTEEFRRGHDICEVMEENKSDWDTLFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFE+YKNYL+IDITA N DD+R WKGWVESRLRQLTLKIERHT+NMLQCHP+PGDFS
Sbjct: 361  PYPFFEAYKNYLQIDITAENDDDIRIWKGWVESRLRQLTLKIERHTYNMLQCHPYPGDFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFH CYFMGLQRKQG P + GEQFDIRLTV+EFKH+V +YT  K GMEIYV+HV+R
Sbjct: 421  DKSRPFHHCYFMGLQRKQGGPASGGEQFDIRLTVDEFKHSVNVYTQRKRGMEIYVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            RSIPNFVF                      K S   + +   E    L G          
Sbjct: 481  RSIPNFVFPGGVRPSRASKLTWDIRRSSELKASDSTQVDSPSEATESLDGDDRRKRIRID 540

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEI 404
             + A TNLR  +CLA   S   E +E S +S  SS SI     D N  + T    +E   
Sbjct: 541  DN-ANTNLRNGECLAMAHSHPEEVHEVSQVSNTSSCSI----KDVN-FIPTSANNLE--- 591

Query: 403  KEGLEGFKNPSELPPKSAGAIDEVRCRP-TIEVLSANDESSSCKEAEKLAIEKLMSGPYN 227
                    N +++  ++ G    +R  P T  V  A  ++S+CKEAEKLAI+K++S  Y+
Sbjct: 592  --------NLADVSSQNNGDHGSLRVSPSTNNVSDAAADTSNCKEAEKLAIQKILSDSYD 643

Query: 226  AHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESS--SNAKVAISLANVNPAGPRSSS 53
            +HQ FP        D +Y NQ KDFG +  G    SS  + + + +   + N A   SSS
Sbjct: 644  SHQDFP-CETEELEDFDYNNQAKDFGATKQGSPMMSSVANTSPLVLPTVSCNEARQSSSS 702

Query: 52   DLNGGLEELEPTELVAP 2
              NGGLEELEP E+VAP
Sbjct: 703  YYNGGLEELEPAEIVAP 719


>ref|XP_003534153.1| PREDICTED: poly(A) polymerase-like isoform X1 [Glycine max]
            gi|571478167|ref|XP_006587485.1| PREDICTED: poly(A)
            polymerase-like isoform X2 [Glycine max]
            gi|734382895|gb|KHN23742.1| Poly(A) polymerase [Glycine
            soja] gi|947090460|gb|KRH39125.1| hypothetical protein
            GLYMA_09G179500 [Glycine max]
          Length = 757

 Score =  946 bits (2444), Expect = 0.0
 Identities = 488/735 (66%), Positives = 550/735 (74%), Gaps = 2/735 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MG  G +N+NNGQQ LGITEPIS  GPT  DV KT ELEK+L  VGLYESQEEAV REEV
Sbjct: 1    MGIPGLSNQNNGQQRLGITEPISLAGPTEDDVIKTRELEKYLQGVGLYESQEEAVGREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISRAKG NEQLV EANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKNISRAKGFNEQLVHEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            +R+EDFFGEL +MLSEM EV+ELHPVPDAHVPVM+FKFNGVS+DLLYA+L+LWVIP+DLD
Sbjct: 121  SRDEDFFGELQKMLSEMQEVTELHPVPDAHVPVMKFKFNGVSVDLLYARLALWVIPDDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQ+SILQN DEQTV SLNGCRVTDQ+LRLVPNIQ FRTTLRCMRFWAKRRGVYSNVAGF
Sbjct: 181  ISQESILQNVDEQTVLSLNGCRVTDQVLRLVPNIQTFRTTLRCMRFWAKRRGVYSNVAGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGIN ALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL VWDPR
Sbjct: 241  LGGINLALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLR+M++EF+RG+EICEAMEA+KADW+TLFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSTYNVTSSTLRVMSDEFRRGSEICEAMEASKADWDTLFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFESYKNYL+IDITA NADDLR+WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+FS
Sbjct: 361  PYPFFESYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERHTYGMLQCHPHPGEFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            D SRPFH CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGM I+V+HV+R
Sbjct: 421  DNSRPFHHCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMNIHVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            R+IPN++F                      +V GH ++E+   GK V+ GA         
Sbjct: 481  RNIPNYIFPGGVRPTFPSKVTAENKQSSKSRVPGHGQAEKPQGGKTVVVGADDVRKRKRS 540

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEG-STLSTNSSPSIIVGNTDANGLVETRGEKVESE 407
             DI   N R +K   +     RE  E  S +S +SS S+    ++ N +   + EK    
Sbjct: 541  EDIMDNNPRNSKSPVSLAPPSREVNEDISPISASSSCSMKFDESEVNSIGGQKSEK---- 596

Query: 406  IKEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYN 227
                     +P E+P   +G    V     +  + A  ++S+ KE EKLAIEK+MSGPY+
Sbjct: 597  -----PCLNSPGEIPSGDSGTNGSVTNNQQVNPVLAAADTSNSKEEEKLAIEKIMSGPYD 651

Query: 226  AHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDL 47
            AHQAFP      E+DT+Y+NQ KD G +         S   VA            +    
Sbjct: 652  AHQAFPEEPEELEDDTQYKNQDKDSGGNMKNNMESLLSKPAVAEEPVISKEITCSTHLFS 711

Query: 46   NGGLEELEPTELVAP 2
            N  LEELEP EL AP
Sbjct: 712  NEILEELEPAELSAP 726


>ref|XP_006428723.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530780|gb|ESR41963.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
          Length = 748

 Score =  945 bits (2442), Expect = 0.0
 Identities = 496/730 (67%), Positives = 556/730 (76%), Gaps = 6/730 (0%)
 Frame = -1

Query: 2173 NNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVLGRLDQTVKI 1994
            +NGQ+LGITEPIS  GPT+ D+ +T +LEK+L DV LYESQEEAVSREEVLGRLDQ VKI
Sbjct: 4    SNGQRLGITEPISLAGPTDDDLMRTRKLEKYLRDVNLYESQEEAVSREEVLGRLDQIVKI 63

Query: 1993 WVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 1814
            WVK ISRAKGLN+QL+QEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL
Sbjct: 64   WVKKISRAKGLNDQLLQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 123

Query: 1813 HRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNA 1634
            H+ML+EMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYA+LSLWVIPEDLDISQDSILQNA
Sbjct: 124  HQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIPEDLDISQDSILQNA 183

Query: 1633 DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 1454
            DEQTVRSLNGCRVTDQILRLVP IQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV
Sbjct: 184  DEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 243

Query: 1453 ARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRRCPKDRFHLM 1274
            ARICQLYPNA+P+MLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQVWDPRR PKD++HLM
Sbjct: 244  ARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDKYHLM 303

Query: 1273 PIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKA--DWNTLFEPYPFFESY 1100
            PIITPAYP          STLRIM +EFQRG+EICEAME N+A  DW+TLFEP+ FFE+Y
Sbjct: 304  PIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVDWDTLFEPFTFFEAY 363

Query: 1099 KNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSDKSRPFHC 920
            KNYL IDI+A NADDLR WKGWVESRLRQLTLKIERHT+NMLQCHPHPGDFSDKS+P +C
Sbjct: 364  KNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDKSKPLYC 423

Query: 919  CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRRSIPNFVF 740
             YFMGLQRKQGVPV EGEQFDIRLTV+EFK  V+MYTL KPGM+I V HV RR++PNFVF
Sbjct: 424  SYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQISVAHVTRRNLPNFVF 483

Query: 739  XXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXXDIAGTNL 560
                                  KVS H +            GA          D   T+L
Sbjct: 484  PGGVRPSRPSKGTWDSRRALERKVSSHTKP-----------GADDGRKRKQTDDNVDTHL 532

Query: 559  RAAKCLAAGDSSQRESYEGS-TLSTNSSPSIIV--GNTDANGLVETRGEKVESEIKEGLE 389
            R AKC A   SS  E  EGS  +ST SS SI +   + DAN L  +  EKVE+ + + + 
Sbjct: 533  RNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTDSIR 592

Query: 388  GFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAHQAFP 209
            G +N  E+   +      +   P  + LS N  SS+ K+AEKLAIEK+MSGPY A QAFP
Sbjct: 593  GSRNSVEVSSHNGKVDGPMIGDPRNKGLSFN--SSNSKDAEKLAIEKIMSGPYVADQAFP 650

Query: 208  XXXXXXENDTEYRNQVKDFGVSTGGGQTESSS-NAKVAISLANVNPAGPRSSSDLNGGLE 32
                  E+D E +NQ KDF  ST      S + N     +L ++N     S+   NGGL 
Sbjct: 651  LELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGLG 710

Query: 31   ELEPTELVAP 2
            ELEP EL AP
Sbjct: 711  ELEPVELTAP 720


>ref|XP_006493030.1| PREDICTED: poly(A) polymerase-like isoform X1 [Citrus sinensis]
          Length = 748

 Score =  943 bits (2438), Expect = 0.0
 Identities = 496/730 (67%), Positives = 555/730 (76%), Gaps = 6/730 (0%)
 Frame = -1

Query: 2173 NNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVLGRLDQTVKI 1994
            +NGQ+LGITEPIS  GPT+ D+ +T +LEK+L DV LYESQEEAVSREEVLGRLDQ VKI
Sbjct: 4    SNGQRLGITEPISLAGPTDDDLMRTRKLEKYLRDVNLYESQEEAVSREEVLGRLDQIVKI 63

Query: 1993 WVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 1814
            WVK ISRAKGLN+QL+QEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL
Sbjct: 64   WVKKISRAKGLNDQLLQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 123

Query: 1813 HRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNA 1634
            H+ML+EMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYA+LSLWVIPEDLDISQDSILQNA
Sbjct: 124  HQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIPEDLDISQDSILQNA 183

Query: 1633 DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 1454
            DEQTVRSLNGCRVTDQILRLVP IQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV
Sbjct: 184  DEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 243

Query: 1453 ARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRRCPKDRFHLM 1274
            ARICQLYPNA+P+MLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQVWDPRR PKD++HLM
Sbjct: 244  ARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDKYHLM 303

Query: 1273 PIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKA--DWNTLFEPYPFFESY 1100
            PIITPAYP          STLRIM +EFQRG+EICEAME N+A  DW+TLFEP+ FFE+Y
Sbjct: 304  PIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVDWDTLFEPFTFFEAY 363

Query: 1099 KNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSDKSRPFHC 920
            KNYL IDI+A NADDLR WKGWVESRLRQLTLKIERHT+NMLQCHPHPGDFSDKS+P +C
Sbjct: 364  KNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDKSKPLYC 423

Query: 919  CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRRSIPNFVF 740
             YFMGLQRKQGVPV EGEQFDIRLTV+EFK  V+MYTL KPGM+I V HV RR++PNFVF
Sbjct: 424  SYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQISVAHVTRRNLPNFVF 483

Query: 739  XXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXXDIAGTNL 560
                                  KVS H +            GA          D   T+L
Sbjct: 484  PGGVRPSRPSKGTWDSRRALERKVSSHTKP-----------GADDGRKRKQTDDNVDTHL 532

Query: 559  RAAKCLAAGDSSQRESYEGS-TLSTNSSPSIIV--GNTDANGLVETRGEKVESEIKEGLE 389
            R AKC A   SS  E  EGS  +ST SS SI +   + DAN L  +  EKVE+ + + + 
Sbjct: 533  RNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTDSIR 592

Query: 388  GFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAHQAFP 209
            G +N  E+   +      +   P  + LS N  SS+ K+AEKLAIEK+MSGPY A QAFP
Sbjct: 593  GSRNSVEVSSHNGKVDGPMIGDPRNKGLSFN--SSNSKDAEKLAIEKIMSGPYVADQAFP 650

Query: 208  XXXXXXENDTEYRNQVKDFGVSTGGGQTESSS-NAKVAISLANVNPAGPRSSSDLNGGLE 32
                  E D E +NQ KDF  ST      S + N     +L ++N     S+   NGGL 
Sbjct: 651  LELDQLEVDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGLG 710

Query: 31   ELEPTELVAP 2
            ELEP EL AP
Sbjct: 711  ELEPVELTAP 720


>ref|XP_008461688.1| PREDICTED: poly(A) polymerase beta isoform X1 [Cucumis melo]
          Length = 747

 Score =  942 bits (2435), Expect = 0.0
 Identities = 492/736 (66%), Positives = 549/736 (74%), Gaps = 3/736 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MGS     RNNGQQ LGIT+PIS  GPT +DV KT ELEK+L D GLYESQEEAV+REEV
Sbjct: 1    MGSPALCGRNNGQQRLGITDPISLSGPTEYDVLKTRELEKYLQDAGLYESQEEAVNREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISR+KGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKAISRSKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            TREEDFFGELH+MLSEMPEVSELHPVPDAHVPVMRFK +GVSIDLLYAKLSLWVIPEDLD
Sbjct: 121  TREEDFFGELHKMLSEMPEVSELHPVPDAHVPVMRFKLSGVSIDLLYAKLSLWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQ+SILQN DEQTVRSLNGCRVTD+ILRLVPNIQ+FRTTLRCMRFWAKRRGVYSNV+GF
Sbjct: 181  ISQESILQNTDEQTVRSLNGCRVTDRILRLVPNIQSFRTTLRCMRFWAKRRGVYSNVSGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGINWALLVARICQLYPNALPNMLVSRFFRV+TQWRWPNPVMLCA EEGSLGL VWDPR
Sbjct: 241  LGGINWALLVARICQLYPNALPNMLVSRFFRVFTQWRWPNPVMLCANEEGSLGLPVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLRIMTEEFQRG++ICE ME NKADW+TLFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSSYNVSASTLRIMTEEFQRGHDICEVMEENKADWDTLFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFE+YKNYL+IDITA N DD+R WKGWVESRLRQLTLKIERHT+NMLQCHP+PGDFS
Sbjct: 361  PYPFFEAYKNYLQIDITAENDDDIRIWKGWVESRLRQLTLKIERHTYNMLQCHPYPGDFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFH CYFMGLQRKQG P +  EQFDIRLTV+EFK +V +YT  K GMEIYV+HV+R
Sbjct: 421  DKSRPFHHCYFMGLQRKQGGPASGSEQFDIRLTVDEFKRSVNVYTQRKRGMEIYVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            RSIPNFVF                      K S   + +   E    L G          
Sbjct: 481  RSIPNFVFPGGVRPSRASKLTWDIRRSSELKASDSTQVDSPSEVTESLDGDDRRKKIRID 540

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEI 404
             D A TNLR  +CLAA +S   E +E S +S  SS SI                K  + I
Sbjct: 541  DD-ANTNLRNGECLAAANSHHEEVHEVSQVSNTSSCSI----------------KDVNFI 583

Query: 403  KEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNA 224
                   +N +++  ++ G    +   P+  V     ++S+CKEAEKLAI+K++S  Y++
Sbjct: 584  PTSTNNLENLADVSSQNNGDHGSMGVNPSKNVSDTAADTSNCKEAEKLAIQKILSDSYDS 643

Query: 223  HQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESS--SNAKVAISLANVNPAGPRSSSD 50
            HQ FP        D +Y  Q KDFG +  G    SS  + + V +   + N A   SSS 
Sbjct: 644  HQDFP-CEPEELEDFDYNKQAKDFGATKQGSPMMSSVANTSPVVLPTVSCNEARQSSSSY 702

Query: 49   LNGGLEELEPTELVAP 2
             NGGLEELEP E+VAP
Sbjct: 703  SNGGLEELEPAEIVAP 718


>ref|XP_014513245.1| PREDICTED: nuclear poly(A) polymerase 1-like [Vigna radiata var.
            radiata]
          Length = 749

 Score =  938 bits (2425), Expect = 0.0
 Identities = 490/736 (66%), Positives = 551/736 (74%), Gaps = 3/736 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MG  G +++NNGQQ LGITEPIS  GPT  D+ KT ELEK+L  VGLYESQEEAV REEV
Sbjct: 1    MGIPGLSDQNNGQQRLGITEPISLAGPTEDDLIKTRELEKYLQGVGLYESQEEAVGREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISR KG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKNISRGKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            +R+EDFFGEL +MLSEM EV+ELHPVPDAHVPVM+FKFNGVS+DLLYA+L+LWVIPEDLD
Sbjct: 121  SRDEDFFGELRKMLSEMQEVTELHPVPDAHVPVMKFKFNGVSVDLLYARLALWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQ+SILQN DEQTV SLNGCRVTDQ+LRLVPNIQ FRTTLRCMRFWAKRRGVYSNVAGF
Sbjct: 181  ISQESILQNVDEQTVLSLNGCRVTDQVLRLVPNIQTFRTTLRCMRFWAKRRGVYSNVAGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGIN ALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL VWDPR
Sbjct: 241  LGGINLALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLR+M++EFQRG+EICEAMEA+KADWN LFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSTYNVTSSTLRVMSDEFQRGSEICEAMEASKADWNALFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFESYKNYL+IDITA NADDLR+WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+FS
Sbjct: 361  PYPFFESYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERHTYGMLQCHPHPGEFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFH CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGM+I+V+HV+R
Sbjct: 421  DKSRPFHHCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMDIHVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            R+IP ++F                      + SGH ++E+S  GK V  GA         
Sbjct: 481  RNIPAYIFPGGVRPSGPSKVTAENKQSSKLRASGHGQAEKSQGGKGVAVGADDVKKRRRS 540

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEI 404
             D    + + +K   +     RE  E  T        I    ++ N +   + +K+    
Sbjct: 541  EDDMDNSSKNSKSPVSLPPPSREVNEDMT-------PITWDESEVNSIDGQKSKKL---- 589

Query: 403  KEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNA 224
                    +P E+PP  +G    V     +  + A  + SS KE EKLAIEK+MSGPY+A
Sbjct: 590  -----CLTSPGEIPPGDSGTNGSVASNQPVNPILAATDISSSKEEEKLAIEKIMSGPYDA 644

Query: 223  HQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDL- 47
            HQAFP      E+DT+YRNQVKD   S     TES +  K A++   V       S+ L 
Sbjct: 645  HQAFPEEPEELEDDTQYRNQVKDTAGSL-KNITESLA-LKPAVAEEPVVCVETTCSTSLC 702

Query: 46   -NGGLEELEPTELVAP 2
             N G EELE  EL AP
Sbjct: 703  SNEGSEELESAELTAP 718


>gb|KOM54431.1| hypothetical protein LR48_Vigan10g032300 [Vigna angularis]
          Length = 749

 Score =  936 bits (2420), Expect = 0.0
 Identities = 487/742 (65%), Positives = 554/742 (74%), Gaps = 9/742 (1%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MG  G +++NNGQQ LGITEPIS  GP+  D+ KT ELEK+L  VGLYESQEEAV REEV
Sbjct: 1    MGIPGLSDQNNGQQRLGITEPISLAGPSEDDLIKTRELEKYLQGVGLYESQEEAVGREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQ VKIWVK ISR KG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQIVKIWVKNISRGKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            TR+EDFFGEL +MLSEM EV+ELHPVPDAHVPVM+FKFNGVS+DLLYA+L+LWVIPEDLD
Sbjct: 121  TRDEDFFGELRKMLSEMQEVTELHPVPDAHVPVMKFKFNGVSVDLLYARLALWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQ+SILQN DEQTV SLNGCRVTDQ+LRLVPNIQ FRTTLRCMRFWAKRRGVYSNVAGF
Sbjct: 181  ISQESILQNVDEQTVLSLNGCRVTDQVLRLVPNIQTFRTTLRCMRFWAKRRGVYSNVAGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGIN ALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL VWDPR
Sbjct: 241  LGGINLALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLPVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLR+M++EFQRG+EICEAMEA+KADWN LFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSTYNVTSSTLRVMSDEFQRGSEICEAMEASKADWNALFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFESYKNYL+IDITA NADDLR+WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+FS
Sbjct: 361  PYPFFESYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERHTYGMLQCHPHPGEFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFH CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGM+I+V+HV+R
Sbjct: 421  DKSRPFHHCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMDIHVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            R+IP ++F                      + SGH ++E+S  GK V  GA         
Sbjct: 481  RNIPAYIFPGGVRPSGPSKVTAENKQSSKLRASGHGQAEKSQGGKGVSVGADDVKKR--- 537

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDAN-GLVETRGEKVESE 407
                                  +  + S+ ++ S  S+   + + N  +   + ++ E  
Sbjct: 538  ------------------RRSEDDMDNSSKNSKSPVSLPPPSREVNEDMTPIKWDESEVN 579

Query: 406  IKEGLEGFK----NPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMS 239
              +G +  K    +P E+PP  +G    V     +  + A  + SS KE EKLAIEK+MS
Sbjct: 580  SSDGQKSKKLCLTSPGEIPPGDSGTNGSVASNQPVNPILAATDISSSKEEEKLAIEKIMS 639

Query: 238  GPYNAHQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNA-KVAISLANVNPAGPR 62
            GPY+AHQAFP      E+DT+YR QVKD   + G  +  + S A K A++   V      
Sbjct: 640  GPYDAHQAFPEEPEELEDDTQYRTQVKD---TAGSLENITESLALKPAVAEEPVVYMETT 696

Query: 61   SSSDL--NGGLEELEPTELVAP 2
             S+ L  N GLEELE  EL AP
Sbjct: 697  CSNSLCSNEGLEELESAELTAP 718


>ref|XP_007163961.1| hypothetical protein PHAVU_L002300g [Phaseolus vulgaris]
            gi|561039848|gb|ESW35955.1| hypothetical protein
            PHAVU_L002300g [Phaseolus vulgaris]
          Length = 749

 Score =  935 bits (2417), Expect = 0.0
 Identities = 486/734 (66%), Positives = 545/734 (74%), Gaps = 1/734 (0%)
 Frame = -1

Query: 2200 MGSHGFNNRNNGQQ-LGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEV 2024
            MG  G +++NNGQQ LGITEPIS  GPT  DV KT ELEK+L  VGLYESQEEAV REEV
Sbjct: 1    MGIPGLSDQNNGQQRLGITEPISLAGPTEDDVIKTRELEKYLQGVGLYESQEEAVGREEV 60

Query: 2023 LGRLDQTVKIWVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 1844
            LGRLDQTVKIWVK ISR KG NEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA
Sbjct: 61   LGRLDQTVKIWVKNISRGKGFNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHA 120

Query: 1843 TREEDFFGELHRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLD 1664
            TR+EDFFGEL  MLSEM EV+ELHPVPDAHVPVM+FKFNGVS+DLLYA+L+LWVIPEDLD
Sbjct: 121  TRDEDFFGELKNMLSEMQEVTELHPVPDAHVPVMKFKFNGVSVDLLYARLALWVIPEDLD 180

Query: 1663 ISQDSILQNADEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGF 1484
            ISQ+SILQN DEQTV SLNGCRVTDQ+LRLVPNIQ FRTTLRCMRFWAKRRGVYSNVAGF
Sbjct: 181  ISQESILQNVDEQTVLSLNGCRVTDQVLRLVPNIQTFRTTLRCMRFWAKRRGVYSNVAGF 240

Query: 1483 LGGINWALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPR 1304
            LGGIN ALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGL VWDPR
Sbjct: 241  LGGINLALLVARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLSVWDPR 300

Query: 1303 RCPKDRFHLMPIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKADWNTLFE 1124
            R PKDR+HLMPIITPAYP          STLR+M++EFQRG+EICE MEA+KADW+TLFE
Sbjct: 301  RNPKDRYHLMPIITPAYPCMNSTYNVTSSTLRVMSDEFQRGSEICEVMEASKADWDTLFE 360

Query: 1123 PYPFFESYKNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFS 944
            PYPFFESYKNYL+IDITA NADDLR+WKGWVESRLRQLTLKIERHT+ MLQCHPHPG+FS
Sbjct: 361  PYPFFESYKNYLQIDITAENADDLRQWKGWVESRLRQLTLKIERHTYGMLQCHPHPGEFS 420

Query: 943  DKSRPFHCCYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRR 764
            DKSRPFH  YFMGLQRKQGVPVNEGEQFDIRLTVEEFKH+V  YTLWKPGM+I+V+HV+R
Sbjct: 421  DKSRPFHHSYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHSVNAYTLWKPGMDIHVSHVKR 480

Query: 763  RSIPNFVFXXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXX 584
            R+IP ++F                      + SGH ++E+   GK V  GA         
Sbjct: 481  RNIPAYIFPGGVRPSCPSKVSSENKQSSKLRASGHGQAEKPQGGKGVAVGADDVKKRRRS 540

Query: 583  XDIAGTNLRAAKCLAAGDSSQRESYEGSTLSTNSSPSIIVGNTDANGLVETRGEKVESEI 404
             D   +  + +K   +     RE  E         PSI +  ++ N +   + +K+    
Sbjct: 541  EDDMDSISKNSKSPVSLPPPSREVNE-------DMPSIKLDESEVNSIDGQKSKKL---- 589

Query: 403  KEGLEGFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNA 224
                   ++P E+P   +   + V     +  + A  + SS KE EKLAIEK+MSGPY+A
Sbjct: 590  -----CLRSPGEIPSGDSVTDESVPSNQLVNPILAATDMSSSKEEEKLAIEKIMSGPYDA 644

Query: 223  HQAFPXXXXXXENDTEYRNQVKDFGVSTGGGQTESSSNAKVAISLANVNPAGPRSSSDLN 44
            HQAFP      E+DT+YRNQVKD G S             VA   A        +S   N
Sbjct: 645  HQAFPEEPEELEDDTQYRNQVKDTGGSMKNVTESLVPKPAVAEEPAVSMKTTCSTSLCSN 704

Query: 43   GGLEELEPTELVAP 2
              LEELE  EL AP
Sbjct: 705  EDLEELESAELTAP 718


>ref|XP_006428725.1| hypothetical protein CICLE_v10011139mg [Citrus clementina]
            gi|557530782|gb|ESR41965.1| hypothetical protein
            CICLE_v10011139mg [Citrus clementina]
            gi|641823208|gb|KDO42641.1| hypothetical protein
            CISIN_1g004767mg [Citrus sinensis]
          Length = 732

 Score =  934 bits (2415), Expect = 0.0
 Identities = 491/723 (67%), Positives = 551/723 (76%), Gaps = 6/723 (0%)
 Frame = -1

Query: 2173 NNGQQLGITEPISYGGPTNHDVAKTHELEKFLADVGLYESQEEAVSREEVLGRLDQTVKI 1994
            +NGQ+LGITEPIS  GPT+ D+ +T +LEK+L DV LYESQEEAVSREEVLGRLDQ VKI
Sbjct: 4    SNGQRLGITEPISLAGPTDDDLMRTRKLEKYLRDVNLYESQEEAVSREEVLGRLDQIVKI 63

Query: 1993 WVKTISRAKGLNEQLVQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 1814
            WVK ISRAKGLN+QL+QEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL
Sbjct: 64   WVKKISRAKGLNDQLLQEANAKIFTFGSYRLGVHGPGADIDTLCVGPRHATREEDFFGEL 123

Query: 1813 HRMLSEMPEVSELHPVPDAHVPVMRFKFNGVSIDLLYAKLSLWVIPEDLDISQDSILQNA 1634
            H+ML+EMPEV+ELHPVPDAHVPVM+FKF+GVSIDLLYA+LSLWVIPEDLDISQDSILQNA
Sbjct: 124  HQMLTEMPEVTELHPVPDAHVPVMKFKFSGVSIDLLYARLSLWVIPEDLDISQDSILQNA 183

Query: 1633 DEQTVRSLNGCRVTDQILRLVPNIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 1454
            DEQTVRSLNGCRVTDQILRLVP IQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV
Sbjct: 184  DEQTVRSLNGCRVTDQILRLVPKIQNFRTTLRCMRFWAKRRGVYSNVAGFLGGINWALLV 243

Query: 1453 ARICQLYPNALPNMLVSRFFRVYTQWRWPNPVMLCAIEEGSLGLQVWDPRRCPKDRFHLM 1274
            ARICQLYPNA+P+MLVSRFFRVYTQWRWPNPV+LCAIEEGSLGLQVWDPRR PKD++HLM
Sbjct: 244  ARICQLYPNAVPSMLVSRFFRVYTQWRWPNPVLLCAIEEGSLGLQVWDPRRNPKDKYHLM 303

Query: 1273 PIITPAYPXXXXXXXXXXSTLRIMTEEFQRGNEICEAMEANKA--DWNTLFEPYPFFESY 1100
            PIITPAYP          STLRIM +EFQRG+EICEAME N+A  DW+TLFEP+ FFE+Y
Sbjct: 304  PIITPAYPCMNSSYNVSTSTLRIMMDEFQRGHEICEAMEKNEADVDWDTLFEPFTFFEAY 363

Query: 1099 KNYLEIDITAANADDLRKWKGWVESRLRQLTLKIERHTFNMLQCHPHPGDFSDKSRPFHC 920
            KNYL IDI+A NADDLR WKGWVESRLRQLTLKIERHT+NMLQCHPHPGDFSDKS+P +C
Sbjct: 364  KNYLRIDISAENADDLRNWKGWVESRLRQLTLKIERHTYNMLQCHPHPGDFSDKSKPLYC 423

Query: 919  CYFMGLQRKQGVPVNEGEQFDIRLTVEEFKHTVAMYTLWKPGMEIYVTHVRRRSIPNFVF 740
             YFMGLQRKQGVPV EGEQFDIRLTV+EFK  V+MYTL KPGM+I V HV RR++PNFVF
Sbjct: 424  SYFMGLQRKQGVPVGEGEQFDIRLTVKEFKQAVSMYTLRKPGMQISVAHVTRRNLPNFVF 483

Query: 739  XXXXXXXXXXXXXXXXXXXXXPKVSGHNESEESPEGKLVLGGAXXXXXXXXXXDIAGTNL 560
                                  KVS H +            GA          D   T+L
Sbjct: 484  PGGVRPSRPSKGTWDSRRALERKVSSHTKP-----------GADDGRKRKQTDDNVDTHL 532

Query: 559  RAAKCLAAGDSSQRESYEGS-TLSTNSSPSIIV--GNTDANGLVETRGEKVESEIKEGLE 389
            R AKC A   SS  E  EGS  +ST SS SI +   + DAN L  +  EKVE+ + + + 
Sbjct: 533  RNAKCHATMPSSSGEFREGSPIMSTISSSSINLQFEHMDANELAGSNREKVENNLTDSIR 592

Query: 388  GFKNPSELPPKSAGAIDEVRCRPTIEVLSANDESSSCKEAEKLAIEKLMSGPYNAHQAFP 209
            G +N  E+   +      +   P  + LS N  SS+ K+AEKLAIEK+MSGPY A QAFP
Sbjct: 593  GSRNSVEVSSHNGKVDGPMIGDPRNKGLSFN--SSNSKDAEKLAIEKIMSGPYVADQAFP 650

Query: 208  XXXXXXENDTEYRNQVKDFGVSTGGGQTESSS-NAKVAISLANVNPAGPRSSSDLNGGLE 32
                  E+D E +NQ KDF  ST      S + N     +L ++N     S+   NGGL 
Sbjct: 651  LELDQLEDDLELKNQAKDFAGSTQNNSLGSCAVNIAAEATLTSMNGGSSSSALSPNGGLG 710

Query: 31   ELE 23
            ELE
Sbjct: 711  ELE 713


Top