BLASTX nr result

ID: Mentha23_contig00005690 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00005690
         (2174 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265454.1| PREDICTED: DNA-directed RNA polymerase III s...  1214   0.0  
ref|XP_006494574.1| PREDICTED: DNA-directed RNA polymerase III s...  1213   0.0  
ref|XP_006494572.1| PREDICTED: DNA-directed RNA polymerase III s...  1213   0.0  
ref|XP_006432385.1| hypothetical protein CICLE_v10000078mg [Citr...  1212   0.0  
ref|XP_007010803.1| DNA-directed RNA polymerase isoform 5 [Theob...  1212   0.0  
ref|XP_007010802.1| DNA-directed RNA polymerase isoform 4 [Theob...  1212   0.0  
ref|XP_007010801.1| DNA-directed RNA polymerase isoform 3 [Theob...  1212   0.0  
ref|XP_007010800.1| DNA-directed RNA polymerase isoform 2 [Theob...  1212   0.0  
ref|XP_007010799.1| DNA-directed RNA polymerase isoform 1 [Theob...  1212   0.0  
ref|XP_007010805.1| DNA-directed RNA polymerase isoform 7 [Theob...  1200   0.0  
ref|XP_007010804.1| DNA-directed RNA polymerase isoform 6 [Theob...  1200   0.0  
ref|XP_004249732.1| PREDICTED: DNA-directed RNA polymerase III s...  1198   0.0  
ref|XP_006362214.1| PREDICTED: DNA-directed RNA polymerase III s...  1197   0.0  
ref|XP_002525541.1| DNA-directed RNA polymerase III subunit, put...  1179   0.0  
gb|EYU26925.1| hypothetical protein MIMGU_mgv1a000512mg [Mimulus...  1167   0.0  
ref|XP_006398163.1| hypothetical protein EUTSA_v10000748mg [Eutr...  1164   0.0  
ref|XP_006279912.1| hypothetical protein CARUB_v10025768mg [Caps...  1163   0.0  
ref|XP_004141655.1| PREDICTED: DNA-directed RNA polymerase III s...  1160   0.0  
ref|XP_002321360.2| hypothetical protein POPTR_0015s005302g, par...  1157   0.0  
ref|XP_002321356.2| hypothetical protein POPTR_0015s005302g, par...  1157   0.0  

>ref|XP_002265454.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2 [Vitis
            vinifera]
          Length = 1142

 Score = 1214 bits (3140), Expect = 0.0
 Identities = 596/728 (81%), Positives = 658/728 (90%), Gaps = 4/728 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPS+EEC    ++TQ +ALE+LE KV++    N   EKEGR + ILRDTF+AN+PVR+N
Sbjct: 263  LLPSMEECASHGIYTQQQALEFLERKVKKLPFYNPSLEKEGRGMAILRDTFIANVPVRQN 322

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRMM+AILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 323  NFRPKCLYVAVMLRRMMDAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMISEVK 382

Query: 358  RKMEVVLSRLSRSNRFDISQ---LIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVA 528
            + ++ +L++ SRS+RFD SQ    I  +SIT GLERTLSTGN+D+KRF+MHRKGM+Q VA
Sbjct: 383  KTIDAILAKPSRSSRFDFSQCLRFIVRDSITVGLERTLSTGNWDVKRFRMHRKGMSQVVA 442

Query: 529  RLSYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTH 708
            RLSYI +LG MTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTH
Sbjct: 443  RLSYIGSLGHMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTH 502

Query: 709  VTTDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRK 888
            VTTDE+E PLISLCY+LGVEDLELLS EELH PNS+LIIFNGLILGKHRRPQ+FAN++RK
Sbjct: 503  VTTDEEESPLISLCYSLGVEDLELLSGEELHTPNSFLIIFNGLILGKHRRPQRFANALRK 562

Query: 889  LRRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSF 1068
            LRRAGKIGEFVS+FVNEKQ CVYIASDGGRVCRP+VIADKG SRIKEHHMKEL DGVR+F
Sbjct: 563  LRRAGKIGEFVSVFVNEKQHCVYIASDGGRVCRPVVIADKGKSRIKEHHMKELIDGVRTF 622

Query: 1069 QSFLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSP 1248
              FL++GLIEYLDVNEENNALIALYE +AK ETTHIEIEPFTILGVCAGLIP+PHHNQSP
Sbjct: 623  DDFLRDGLIEYLDVNEENNALIALYEADAKPETTHIEIEPFTILGVCAGLIPFPHHNQSP 682

Query: 1249 RNTYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASV 1428
            RNTYQCAMGKQAMGNIAYNQLCRMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+V
Sbjct: 683  RNTYQCAMGKQAMGNIAYNQLCRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATV 742

Query: 1429 AVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAE 1608
            AVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKK SAV Q+YEN  SDRI+ P + GH AE
Sbjct: 743  AVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKFSAVNQRYENNASDRIVRPLKVGHDAE 802

Query: 1609 HMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPE 1788
             MQILDDDGLAAPGEII+P DIYINK++P+ TK  G   + V L DS YK S+QT+KGPE
Sbjct: 803  RMQILDDDGLAAPGEIIKPNDIYINKESPIITK--GPLISPVGLPDSAYKPSRQTFKGPE 860

Query: 1789 GETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSE 1968
            GE  VVDRVALCSD+N+N+C+KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSE
Sbjct: 861  GEASVVDRVALCSDKNSNLCIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSE 920

Query: 1969 RGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISET 2148
            RG+CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHADKVE IS+T
Sbjct: 921  RGICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADKVETISKT 980

Query: 2149 LVKHGFSY 2172
            LVKHGFSY
Sbjct: 981  LVKHGFSY 988


>ref|XP_006494574.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2-like isoform
            X3 [Citrus sinensis]
          Length = 1058

 Score = 1213 bits (3138), Expect = 0.0
 Identities = 592/725 (81%), Positives = 655/725 (90%), Gaps = 1/725 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRP-EKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ KALEYLE KV+R    + P ++EGRA  ILRD FLAN+PV  N
Sbjct: 182  LLPSIEECANLDIYTQEKALEYLEGKVKRSTFGSPPNDREGRAFSILRDVFLANVPVHNN 241

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC YVAVMLRRM+EA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 242  NFRPKCFYVAVMLRRMVEAMLNKDAMDDKDYVGNKRLELSGQLVSLLFEDLFKTMISEVQ 301

Query: 358  RKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLS 537
            + ++++LS+ SRS+RFD+SQ I  +SIT GLERTLSTGNFD+KRFKMHRKGMTQ +ARLS
Sbjct: 302  KTVDIILSKPSRSSRFDLSQFIVRDSITVGLERTLSTGNFDVKRFKMHRKGMTQVLARLS 361

Query: 538  YIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 717
            +I TLG MT++SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT
Sbjct: 362  FIGTLGHMTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 421

Query: 718  DEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRR 897
            DE+EGPLISLCY LGVEDLELLS EELH PNS+L+IFNGLILGKHRRP+ FA+ MRKLRR
Sbjct: 422  DEEEGPLISLCYCLGVEDLELLSGEELHNPNSFLVIFNGLILGKHRRPKCFADVMRKLRR 481

Query: 898  AGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSF 1077
            AGKIGEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKG+SRIKEHHMKEL DGVRSF  F
Sbjct: 482  AGKIGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGISRIKEHHMKELLDGVRSFDDF 541

Query: 1078 LKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNT 1257
            L+EGLIEYLDVNEENNALIALYE +A  +TTHIEIEPFTILGV AGLIPYPHHNQSPRNT
Sbjct: 542  LREGLIEYLDVNEENNALIALYEGDATPDTTHIEIEPFTILGVIAGLIPYPHHNQSPRNT 601

Query: 1258 YQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVM 1437
            YQCAMGKQAMGNIA+NQLCRMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAVM
Sbjct: 602  YQCAMGKQAMGNIAFNQLCRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAVM 661

Query: 1438 SYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQ 1617
            SYSGYDIEDAIVMNKSSLDRGFGRCIV+KK +A+ QKY N TSDRI+ P R G GAE MQ
Sbjct: 662  SYSGYDIEDAIVMNKSSLDRGFGRCIVVKKYTAINQKYANSTSDRILRPDRTGPGAERMQ 721

Query: 1618 ILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGET 1797
            ILDDDGLAAPGEII+P D+YINK++P++T+  G   +     DS+Y+S++QTYKGP+GET
Sbjct: 722  ILDDDGLAAPGEIIKPNDVYINKESPLETR--GSIMSPTGQTDSRYRSARQTYKGPDGET 779

Query: 1798 PVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGV 1977
             VVDRVALCSD+N ++C+KF+IRHTRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFSERG+
Sbjct: 780  CVVDRVALCSDKNGDLCIKFLIRHTRRPELGDKFSSRHGQKGVCGTIVQQEDFPFSERGI 839

Query: 1978 CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLVK 2157
            CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD VE ISETLVK
Sbjct: 840  CPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADTVESISETLVK 899

Query: 2158 HGFSY 2172
            HGFSY
Sbjct: 900  HGFSY 904


>ref|XP_006494572.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2-like isoform
            X1 [Citrus sinensis] gi|568883651|ref|XP_006494573.1|
            PREDICTED: DNA-directed RNA polymerase III subunit
            RPC2-like isoform X2 [Citrus sinensis]
          Length = 1149

 Score = 1213 bits (3138), Expect = 0.0
 Identities = 592/725 (81%), Positives = 655/725 (90%), Gaps = 1/725 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRP-EKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ KALEYLE KV+R    + P ++EGRA  ILRD FLAN+PV  N
Sbjct: 273  LLPSIEECANLDIYTQEKALEYLEGKVKRSTFGSPPNDREGRAFSILRDVFLANVPVHNN 332

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC YVAVMLRRM+EA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 333  NFRPKCFYVAVMLRRMVEAMLNKDAMDDKDYVGNKRLELSGQLVSLLFEDLFKTMISEVQ 392

Query: 358  RKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLS 537
            + ++++LS+ SRS+RFD+SQ I  +SIT GLERTLSTGNFD+KRFKMHRKGMTQ +ARLS
Sbjct: 393  KTVDIILSKPSRSSRFDLSQFIVRDSITVGLERTLSTGNFDVKRFKMHRKGMTQVLARLS 452

Query: 538  YIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 717
            +I TLG MT++SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT
Sbjct: 453  FIGTLGHMTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 512

Query: 718  DEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRR 897
            DE+EGPLISLCY LGVEDLELLS EELH PNS+L+IFNGLILGKHRRP+ FA+ MRKLRR
Sbjct: 513  DEEEGPLISLCYCLGVEDLELLSGEELHNPNSFLVIFNGLILGKHRRPKCFADVMRKLRR 572

Query: 898  AGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSF 1077
            AGKIGEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKG+SRIKEHHMKEL DGVRSF  F
Sbjct: 573  AGKIGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGISRIKEHHMKELLDGVRSFDDF 632

Query: 1078 LKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNT 1257
            L+EGLIEYLDVNEENNALIALYE +A  +TTHIEIEPFTILGV AGLIPYPHHNQSPRNT
Sbjct: 633  LREGLIEYLDVNEENNALIALYEGDATPDTTHIEIEPFTILGVIAGLIPYPHHNQSPRNT 692

Query: 1258 YQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVM 1437
            YQCAMGKQAMGNIA+NQLCRMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAVM
Sbjct: 693  YQCAMGKQAMGNIAFNQLCRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAVM 752

Query: 1438 SYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQ 1617
            SYSGYDIEDAIVMNKSSLDRGFGRCIV+KK +A+ QKY N TSDRI+ P R G GAE MQ
Sbjct: 753  SYSGYDIEDAIVMNKSSLDRGFGRCIVVKKYTAINQKYANSTSDRILRPDRTGPGAERMQ 812

Query: 1618 ILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGET 1797
            ILDDDGLAAPGEII+P D+YINK++P++T+  G   +     DS+Y+S++QTYKGP+GET
Sbjct: 813  ILDDDGLAAPGEIIKPNDVYINKESPLETR--GSIMSPTGQTDSRYRSARQTYKGPDGET 870

Query: 1798 PVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGV 1977
             VVDRVALCSD+N ++C+KF+IRHTRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFSERG+
Sbjct: 871  CVVDRVALCSDKNGDLCIKFLIRHTRRPELGDKFSSRHGQKGVCGTIVQQEDFPFSERGI 930

Query: 1978 CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLVK 2157
            CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD VE ISETLVK
Sbjct: 931  CPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADTVESISETLVK 990

Query: 2158 HGFSY 2172
            HGFSY
Sbjct: 991  HGFSY 995


>ref|XP_006432385.1| hypothetical protein CICLE_v10000078mg [Citrus clementina]
            gi|557534507|gb|ESR45625.1| hypothetical protein
            CICLE_v10000078mg [Citrus clementina]
          Length = 1149

 Score = 1212 bits (3136), Expect = 0.0
 Identities = 591/725 (81%), Positives = 655/725 (90%), Gaps = 1/725 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRP-EKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ KALEYLE KV+R    + P ++EGRA  ILRD FLAN+PV  N
Sbjct: 273  LLPSIEECANLDIYTQEKALEYLEGKVKRSTFGSPPNDREGRAFSILRDVFLANVPVHNN 332

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC YVAVMLRRM+EA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 333  NFRPKCFYVAVMLRRMVEAMLNKDAMDDKDYVGNKRLELSGQLVSLLFEDLFKTMISEVQ 392

Query: 358  RKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLS 537
            + ++++LS+ SRS+RFD+SQ I  +SIT GLERTLSTGNFD+KRFKMHRKGMTQ +ARLS
Sbjct: 393  KTVDIILSKPSRSSRFDLSQFIVRDSITVGLERTLSTGNFDVKRFKMHRKGMTQVLARLS 452

Query: 538  YIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 717
            +I TLG MT++SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT
Sbjct: 453  FIGTLGHMTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 512

Query: 718  DEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRR 897
            DE+EGPLISLCY LGVEDLELLS EELH PNS+L+IFNGLILGKHRRP+ FA+ MRKLRR
Sbjct: 513  DEEEGPLISLCYCLGVEDLELLSGEELHNPNSFLVIFNGLILGKHRRPKCFADVMRKLRR 572

Query: 898  AGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSF 1077
            AGKIGEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKG+SRIKEHHMKEL DGVRSF  F
Sbjct: 573  AGKIGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGISRIKEHHMKELSDGVRSFDDF 632

Query: 1078 LKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNT 1257
            L+EGLIEYLDVNEENNALIALYE +A  +TTHIEIEPFTILGV AGLIPYPHHNQSPRNT
Sbjct: 633  LREGLIEYLDVNEENNALIALYEGDATPDTTHIEIEPFTILGVIAGLIPYPHHNQSPRNT 692

Query: 1258 YQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVM 1437
            YQCAMGKQAMGNIA+NQLCRMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAVM
Sbjct: 693  YQCAMGKQAMGNIAFNQLCRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAVM 752

Query: 1438 SYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQ 1617
            SYSGYDIEDAIVMNKSSLDRGFGRCIV+KK +A+ QKY N TSDRI+ P R G G+E MQ
Sbjct: 753  SYSGYDIEDAIVMNKSSLDRGFGRCIVVKKYTAINQKYANSTSDRILRPDRTGPGSERMQ 812

Query: 1618 ILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGET 1797
            ILDDDGLAAPGEII+P D+YINK++P++T+  G   +     DS+Y+S++QTYKGP+GET
Sbjct: 813  ILDDDGLAAPGEIIKPNDVYINKESPLETR--GSIMSPTGQTDSRYRSARQTYKGPDGET 870

Query: 1798 PVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGV 1977
             VVDRVALCSD+N ++C+KF+IRHTRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFSERG+
Sbjct: 871  CVVDRVALCSDKNGDLCIKFLIRHTRRPELGDKFSSRHGQKGVCGTIVQQEDFPFSERGI 930

Query: 1978 CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLVK 2157
            CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD VE ISETLVK
Sbjct: 931  CPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADTVESISETLVK 990

Query: 2158 HGFSY 2172
            HGFSY
Sbjct: 991  HGFSY 995


>ref|XP_007010803.1| DNA-directed RNA polymerase isoform 5 [Theobroma cacao]
            gi|508727716|gb|EOY19613.1| DNA-directed RNA polymerase
            isoform 5 [Theobroma cacao]
          Length = 1006

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 597/726 (82%), Positives = 652/726 (89%), Gaps = 2/726 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 276  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 335

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 336  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 395

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 396  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 455

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 456  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 515

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 516  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 575

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 576  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 635

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 636  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 695

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 696  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 755

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV QKYE G SDRI+ PQR G G+E M
Sbjct: 756  MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERM 815

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            QILDDDG+A PGEIIRP DIYINK++ + T+  G   ++ +L DS Y+ ++QTYKGPEGE
Sbjct: 816  QILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRVSSESLPDSAYRPARQTYKGPEGE 873

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 874  SCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 933

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLV 2154
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE ISETL+
Sbjct: 934  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLI 993

Query: 2155 KHGFSY 2172
            KHGFSY
Sbjct: 994  KHGFSY 999


>ref|XP_007010802.1| DNA-directed RNA polymerase isoform 4 [Theobroma cacao]
            gi|508727715|gb|EOY19612.1| DNA-directed RNA polymerase
            isoform 4 [Theobroma cacao]
          Length = 1034

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 597/726 (82%), Positives = 652/726 (89%), Gaps = 2/726 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 281  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 340

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 341  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 400

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 401  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 460

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 461  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 520

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 521  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 580

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 581  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 640

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 641  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 700

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 701  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 760

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV QKYE G SDRI+ PQR G G+E M
Sbjct: 761  MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERM 820

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            QILDDDG+A PGEIIRP DIYINK++ + T+  G   ++ +L DS Y+ ++QTYKGPEGE
Sbjct: 821  QILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRVSSESLPDSAYRPARQTYKGPEGE 878

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 879  SCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 938

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLV 2154
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE ISETL+
Sbjct: 939  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLI 998

Query: 2155 KHGFSY 2172
            KHGFSY
Sbjct: 999  KHGFSY 1004


>ref|XP_007010801.1| DNA-directed RNA polymerase isoform 3 [Theobroma cacao]
            gi|508727714|gb|EOY19611.1| DNA-directed RNA polymerase
            isoform 3 [Theobroma cacao]
          Length = 1153

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 597/726 (82%), Positives = 652/726 (89%), Gaps = 2/726 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 276  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 335

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 336  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 395

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 396  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 455

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 456  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 515

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 516  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 575

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 576  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 635

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 636  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 695

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 696  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 755

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV QKYE G SDRI+ PQR G G+E M
Sbjct: 756  MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERM 815

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            QILDDDG+A PGEIIRP DIYINK++ + T+  G   ++ +L DS Y+ ++QTYKGPEGE
Sbjct: 816  QILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRVSSESLPDSAYRPARQTYKGPEGE 873

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 874  SCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 933

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLV 2154
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE ISETL+
Sbjct: 934  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLI 993

Query: 2155 KHGFSY 2172
            KHGFSY
Sbjct: 994  KHGFSY 999


>ref|XP_007010800.1| DNA-directed RNA polymerase isoform 2 [Theobroma cacao]
            gi|508727713|gb|EOY19610.1| DNA-directed RNA polymerase
            isoform 2 [Theobroma cacao]
          Length = 1011

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 597/726 (82%), Positives = 652/726 (89%), Gaps = 2/726 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 281  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 340

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 341  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 400

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 401  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 460

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 461  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 520

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 521  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 580

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 581  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 640

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 641  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 700

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 701  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 760

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV QKYE G SDRI+ PQR G G+E M
Sbjct: 761  MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERM 820

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            QILDDDG+A PGEIIRP DIYINK++ + T+  G   ++ +L DS Y+ ++QTYKGPEGE
Sbjct: 821  QILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRVSSESLPDSAYRPARQTYKGPEGE 878

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 879  SCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 938

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLV 2154
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE ISETL+
Sbjct: 939  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLI 998

Query: 2155 KHGFSY 2172
            KHGFSY
Sbjct: 999  KHGFSY 1004


>ref|XP_007010799.1| DNA-directed RNA polymerase isoform 1 [Theobroma cacao]
            gi|508727712|gb|EOY19609.1| DNA-directed RNA polymerase
            isoform 1 [Theobroma cacao]
          Length = 1158

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 597/726 (82%), Positives = 652/726 (89%), Gaps = 2/726 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 281  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 340

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 341  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 400

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 401  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 460

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 461  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 520

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 521  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 580

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 581  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 640

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 641  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 700

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 701  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 760

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV QKYE G SDRI+ PQR G G+E M
Sbjct: 761  MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERM 820

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            QILDDDG+A PGEIIRP DIYINK++ + T+  G   ++ +L DS Y+ ++QTYKGPEGE
Sbjct: 821  QILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRVSSESLPDSAYRPARQTYKGPEGE 878

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 879  SCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 938

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLV 2154
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE ISETL+
Sbjct: 939  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLI 998

Query: 2155 KHGFSY 2172
            KHGFSY
Sbjct: 999  KHGFSY 1004


>ref|XP_007010805.1| DNA-directed RNA polymerase isoform 7 [Theobroma cacao]
            gi|508727718|gb|EOY19615.1| DNA-directed RNA polymerase
            isoform 7 [Theobroma cacao]
          Length = 1082

 Score = 1200 bits (3105), Expect = 0.0
 Identities = 598/749 (79%), Positives = 653/749 (87%), Gaps = 25/749 (3%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 182  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 241

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 242  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 301

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 302  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 361

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 362  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 421

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 422  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 481

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 482  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 541

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 542  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 601

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVS-------------- 1392
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELVS              
Sbjct: 602  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVSGLYVVDYSTPFCPV 661

Query: 1393 ---------YDKLGAGQNASVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQ 1545
                     YDKLGAGQNA+VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV Q
Sbjct: 662  NCLLCFQVGYDKLGAGQNATVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQ 721

Query: 1546 KYENGTSDRIIGPQRDGHGAEHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAH 1725
            KYE G SDRI+ PQR G G+E MQILDDDG+A PGEIIRP DIYINK++ + T+  G   
Sbjct: 722  KYETGASDRILRPQRTGPGSERMQILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRV 779

Query: 1726 NTVALKDSQYKSSKQTYKGPEGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSS 1905
            ++ +L DS Y+ ++QTYKGPEGE+ VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSS
Sbjct: 780  SSESLPDSAYRPARQTYKGPEGESCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSS 839

Query: 1906 RHGQKGVCGTIVQQEDFPFSERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFH 2085
            RHGQKGVCGTI+QQEDFPFSERG+CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFH
Sbjct: 840  RHGQKGVCGTIIQQEDFPFSERGICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFH 899

Query: 2086 YGSAFGEPSGHADKVEVISETLVKHGFSY 2172
            YGSAFGEPSGHAD+VE ISETL+KHGFSY
Sbjct: 900  YGSAFGEPSGHADRVEAISETLIKHGFSY 928


>ref|XP_007010804.1| DNA-directed RNA polymerase isoform 6 [Theobroma cacao]
            gi|508727717|gb|EOY19614.1| DNA-directed RNA polymerase
            isoform 6 [Theobroma cacao]
          Length = 1034

 Score = 1200 bits (3105), Expect = 0.0
 Identities = 598/749 (79%), Positives = 653/749 (87%), Gaps = 25/749 (3%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    ++TQ +ALEYLE KV+R        EKEGRAL ILRD FLAN+PVR N
Sbjct: 281  LLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSN 340

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKC+YVAVMLRRM+EAILNKD MDDKDYVGNKRLELSGQL++LLFEDLFKT   E  
Sbjct: 341  NFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQ 400

Query: 358  RKMEVVLSRLSRSNRFDISQLIGG-ESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
            + +++VLS+ SRS+  D SQ +   E+IT GLERTLSTGNFDIKRFKMHRKGMTQ +ARL
Sbjct: 401  KMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARL 460

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            S+I TLG+MTK+SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 461  SFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 520

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDEDEGPLISLCY LGVEDLELLS EELH PNS+L+I NGLILGKHRRPQ FA +MRKLR
Sbjct: 521  TDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLR 580

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGK+GEFVS+FVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKEL DGVR+F  
Sbjct: 581  RAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDD 640

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGVCAGLIPYPHHNQSPRN
Sbjct: 641  FLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 700

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVS-------------- 1392
            TYQCAMGKQAMGNIAYNQLCRMDTL+YLLVYPQRPLLTTRTIELVS              
Sbjct: 701  TYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVSGLYVVDYSTPFCPV 760

Query: 1393 ---------YDKLGAGQNASVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQ 1545
                     YDKLGAGQNA+VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMK+ SAV Q
Sbjct: 761  NCLLCFQVGYDKLGAGQNATVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQ 820

Query: 1546 KYENGTSDRIIGPQRDGHGAEHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAH 1725
            KYE G SDRI+ PQR G G+E MQILDDDG+A PGEIIRP DIYINK++ + T+  G   
Sbjct: 821  KYETGASDRILRPQRTGPGSERMQILDDDGIATPGEIIRPNDIYINKESSIHTR--GSRV 878

Query: 1726 NTVALKDSQYKSSKQTYKGPEGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSS 1905
            ++ +L DS Y+ ++QTYKGPEGE+ VVDRVALC+DRN+N+ +KF+IRHTRRPEVGDKFSS
Sbjct: 879  SSESLPDSAYRPARQTYKGPEGESCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSS 938

Query: 1906 RHGQKGVCGTIVQQEDFPFSERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFH 2085
            RHGQKGVCGTI+QQEDFPFSERG+CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFH
Sbjct: 939  RHGQKGVCGTIIQQEDFPFSERGICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFH 998

Query: 2086 YGSAFGEPSGHADKVEVISETLVKHGFSY 2172
            YGSAFGEPSGHAD+VE ISETL+KHGFSY
Sbjct: 999  YGSAFGEPSGHADRVEAISETLIKHGFSY 1027


>ref|XP_004249732.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2-like [Solanum
            lycopersicum]
          Length = 1183

 Score = 1198 bits (3099), Expect = 0.0
 Identities = 601/729 (82%), Positives = 652/729 (89%), Gaps = 5/729 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLEN----KVQRYQNNTRP-EKEGRALDILRDTFLANIP 165
            LLPSIEEC    ++TQ +ALE+LE+    K+  Y  +T P EK  RAL ILRD FLAN+P
Sbjct: 305  LLPSIEECADLKLYTQQQALEFLESDKMLKMPSY--STGPIEKGARALSILRDIFLANVP 362

Query: 166  VRENNFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMN 345
            V ++NFR KCIYVAVM+RRMMEAILNKD MDDKDYVGNKRLELSGQLL+LLFEDLFKTMN
Sbjct: 363  VHQHNFRKKCIYVAVMMRRMMEAILNKDAMDDKDYVGNKRLELSGQLLSLLFEDLFKTMN 422

Query: 346  EEASRKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAV 525
            +EA R ++ +L+R SRS+R DISQ I  +SIT GLERTLSTGN+D+KRF+MHRKGMTQ V
Sbjct: 423  DEARRTIDTLLARPSRSSRLDISQYIIKDSITMGLERTLSTGNWDVKRFRMHRKGMTQVV 482

Query: 526  ARLSYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMT 705
            ARLSYI +LG MTKISPQFEKSRKVSGPRALQPSQ+GMLCPCDTPEGEACGLVKNLALMT
Sbjct: 483  ARLSYIGSLGHMTKISPQFEKSRKVSGPRALQPSQFGMLCPCDTPEGEACGLVKNLALMT 542

Query: 706  HVTTDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMR 885
            HVTTDEDE P++SLCY LGVEDLE LS EELHMP SYLI  NGLILGKH+ PQ+FAN+MR
Sbjct: 543  HVTTDEDERPIMSLCYCLGVEDLEQLSPEELHMPTSYLITLNGLILGKHKSPQRFANAMR 602

Query: 886  KLRRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRS 1065
            +LRRAGK+GEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHM+ELRDG+R 
Sbjct: 603  RLRRAGKVGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMRELRDGLRD 662

Query: 1066 FQSFLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQS 1245
            F SFLK+GLIEYLDVNEENN LIALYE+ A  ETTHIEIEPFTILGVCAGLIPYPHHNQS
Sbjct: 663  FDSFLKDGLIEYLDVNEENNTLIALYEKEATPETTHIEIEPFTILGVCAGLIPYPHHNQS 722

Query: 1246 PRNTYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNAS 1425
            PRNTYQCAMGKQAMGNIAYNQL RMD L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+
Sbjct: 723  PRNTYQCAMGKQAMGNIAYNQLNRMDGLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNAT 782

Query: 1426 VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGA 1605
            VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKK SA+ QKYENGTSDRII PQR G  A
Sbjct: 783  VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKYSAICQKYENGTSDRIIKPQRQGPEA 842

Query: 1606 EHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGP 1785
            + MQILDDDG+AAPGE IR  DIYINK++P  T++     + + L DS YKSSKQTYKGP
Sbjct: 843  DRMQILDDDGMAAPGETIRNHDIYINKESPTVTRT--PVTSPMGLPDSAYKSSKQTYKGP 900

Query: 1786 EGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFS 1965
            EGET VVDRVAL SDRNNN+ +KFMIRHTRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFS
Sbjct: 901  EGETAVVDRVALYSDRNNNLSIKFMIRHTRRPELGDKFSSRHGQKGVCGTIVQQEDFPFS 960

Query: 1966 ERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISE 2145
            ERG+CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHAD V+ ISE
Sbjct: 961  ERGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADTVDAISE 1020

Query: 2146 TLVKHGFSY 2172
            TLVKHGFSY
Sbjct: 1021 TLVKHGFSY 1029


>ref|XP_006362214.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2-like isoform
            X1 [Solanum tuberosum] gi|565393087|ref|XP_006362215.1|
            PREDICTED: DNA-directed RNA polymerase III subunit
            RPC2-like isoform X2 [Solanum tuberosum]
          Length = 1153

 Score = 1197 bits (3098), Expect = 0.0
 Identities = 601/729 (82%), Positives = 653/729 (89%), Gaps = 5/729 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLEN----KVQRYQNNTRP-EKEGRALDILRDTFLANIP 165
            LLPSIEEC    ++TQ++ALE+LE+    K+  Y  +T P EK  RAL ILRD FLAN+P
Sbjct: 275  LLPSIEECADLKLYTQHQALEFLESDKMLKMPSY--STGPVEKGARALSILRDIFLANVP 332

Query: 166  VRENNFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMN 345
            V ++NFR KCIYVAVM+RRMMEAILNKD MDDKDYVGNKRLELSGQLL+LLFEDLFKTMN
Sbjct: 333  VHQHNFRKKCIYVAVMMRRMMEAILNKDAMDDKDYVGNKRLELSGQLLSLLFEDLFKTMN 392

Query: 346  EEASRKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAV 525
            +EA R ++ +L+R SRS+R DISQ I  +SIT GLERTLSTGN+D+KRF+MHRKGMTQ V
Sbjct: 393  DEARRTIDTLLARPSRSSRLDISQYIVKDSITMGLERTLSTGNWDVKRFRMHRKGMTQVV 452

Query: 526  ARLSYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMT 705
            ARLSYI +LG MTKISPQFEKSRKVSGPRALQPSQ+GMLCPCDTPEGEACGLVKNLALMT
Sbjct: 453  ARLSYIGSLGHMTKISPQFEKSRKVSGPRALQPSQFGMLCPCDTPEGEACGLVKNLALMT 512

Query: 706  HVTTDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMR 885
            HVTTDEDE P++SLCY LGVEDLE LS EELHMP SYLI  NGLILGKH+ PQ+FAN+MR
Sbjct: 513  HVTTDEDERPIMSLCYCLGVEDLEQLSPEELHMPTSYLITLNGLILGKHKSPQRFANAMR 572

Query: 886  KLRRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRS 1065
            +LRRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHM+ELRDGVR 
Sbjct: 573  RLRRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMRELRDGVRD 632

Query: 1066 FQSFLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQS 1245
            F SFLK+GLIEYLDVNEENN LIALYE+ A  ETTHIEIEPFTILGVCAGLIPYPHHNQS
Sbjct: 633  FDSFLKDGLIEYLDVNEENNTLIALYEKEATPETTHIEIEPFTILGVCAGLIPYPHHNQS 692

Query: 1246 PRNTYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNAS 1425
            PRNTYQCAMGKQAMGNIAYNQL RMD L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+
Sbjct: 693  PRNTYQCAMGKQAMGNIAYNQLNRMDGLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNAT 752

Query: 1426 VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGA 1605
            VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKK SA+ QKYENGTSDRII PQR G  A
Sbjct: 753  VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKYSAICQKYENGTSDRIIKPQRQGFEA 812

Query: 1606 EHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGP 1785
            + MQILDDDG+AAPGE IR  DIYINK++P  T++     + + L DS YK+S+QTYKGP
Sbjct: 813  DKMQILDDDGMAAPGETIRNHDIYINKESPTVTRT--PITSPMGLPDSAYKTSRQTYKGP 870

Query: 1786 EGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFS 1965
            EGET VVDRVAL SDRNNN+ +KFMIRHTRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFS
Sbjct: 871  EGETAVVDRVALYSDRNNNLSIKFMIRHTRRPEIGDKFSSRHGQKGVCGTIVQQEDFPFS 930

Query: 1966 ERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISE 2145
            ERG+CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHAD V+ ISE
Sbjct: 931  ERGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADTVDSISE 990

Query: 2146 TLVKHGFSY 2172
            TLVKHGFSY
Sbjct: 991  TLVKHGFSY 999


>ref|XP_002525541.1| DNA-directed RNA polymerase III subunit, putative [Ricinus communis]
            gi|223535120|gb|EEF36800.1| DNA-directed RNA polymerase
            III subunit, putative [Ricinus communis]
          Length = 1139

 Score = 1179 bits (3050), Expect = 0.0
 Identities = 579/724 (79%), Positives = 640/724 (88%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRPEKEGRALDILRDTFLANIPVRENN 180
            LLPSIEEC   S++TQ KALEYL+ K            E RAL ILRD FLAN+PV +NN
Sbjct: 279  LLPSIEECAGLSIYTQQKALEYLDGK------------ENRALTILRDVFLANVPVHKNN 326

Query: 181  FRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEASR 360
            FRPKC+YVAVMLRRMMEA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  R
Sbjct: 327  FRPKCLYVAVMLRRMMEAMLNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMITEVQR 386

Query: 361  KMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLSY 540
             ++ VL++ +RS+RFD++Q I  ++ITNGLERTLSTGNFD+KRFKMHRKGMTQ + RLSY
Sbjct: 387  TIDTVLTKQNRSSRFDLAQYIVRDNITNGLERTLSTGNFDVKRFKMHRKGMTQVLVRLSY 446

Query: 541  IATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTD 720
            IA+LG MT++SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTD
Sbjct: 447  IASLGMMTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTD 506

Query: 721  EDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRRA 900
            ++EGPLISLCY LGVEDLELLS EELH PNS+L+IFNGLILGKHRRPQ F NSMRKLRRA
Sbjct: 507  DEEGPLISLCYCLGVEDLELLSGEELHTPNSFLVIFNGLILGKHRRPQYFVNSMRKLRRA 566

Query: 901  GKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSFL 1080
            GKIGEFVS+FVNEKQR VY+ASDGGRVCRPLVIAD+GVSRIKEHHMKELRDGVR+F  FL
Sbjct: 567  GKIGEFVSVFVNEKQRAVYLASDGGRVCRPLVIADRGVSRIKEHHMKELRDGVRTFDDFL 626

Query: 1081 KEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNTY 1260
            ++GLIEYLDVNEENNAL+ALYE  A  ETTHIEIEPFTILGVCAGLIP+PHHNQSPRNTY
Sbjct: 627  RDGLIEYLDVNEENNALVALYEGEATPETTHIEIEPFTILGVCAGLIPFPHHNQSPRNTY 686

Query: 1261 QCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVMS 1440
            QCAMGKQAMGNIAYNQL RMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAVMS
Sbjct: 687  QCAMGKQAMGNIAYNQLFRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAVMS 746

Query: 1441 YSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQI 1620
            YSGYDIEDAIVMNK+SLDRGFGRCIVMKK  A+ QKYENG SDRI+ P R     E  ++
Sbjct: 747  YSGYDIEDAIVMNKASLDRGFGRCIVMKKYPAIRQKYENGASDRILRPDRT---VERERV 803

Query: 1621 LDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGETP 1800
            LD DGLAAPGEII+P DIY+ K+ P+DT   G   ++ AL++ +Y+ S  +YKGPEGE+P
Sbjct: 804  LDYDGLAAPGEIIKPSDIYVKKECPIDTM--GPVKSSAALENIKYRPSPLSYKGPEGESP 861

Query: 1801 VVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGVC 1980
            V+DRVAL SDRNNN+C+K MIRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG+C
Sbjct: 862  VIDRVALSSDRNNNLCIKVMIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERGIC 921

Query: 1981 PDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLVKH 2160
            PDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD VE ISETLV  
Sbjct: 922  PDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADTVEAISETLVNR 981

Query: 2161 GFSY 2172
            GFSY
Sbjct: 982  GFSY 985


>gb|EYU26925.1| hypothetical protein MIMGU_mgv1a000512mg [Mimulus guttatus]
          Length = 1103

 Score = 1167 bits (3020), Expect = 0.0
 Identities = 581/697 (83%), Positives = 631/697 (90%), Gaps = 1/697 (0%)
 Frame = +1

Query: 85   RYQNNTRPE-KEGRALDILRDTFLANIPVRENNFRPKCIYVAVMLRRMMEAILNKDTMDD 261
            RY     P  +EGRAL ILRD FLANIPV ++NFR KCIYVAVM+RRMMEAILNKD MDD
Sbjct: 255  RYSELLLPSIEEGRALSILRDIFLANIPVDQDNFRSKCIYVAVMVRRMMEAILNKDAMDD 314

Query: 262  KDYVGNKRLELSGQLLALLFEDLFKTMNEEASRKMEVVLSRLSRSNRFDISQLIGGESIT 441
            KDYVGNKRLELSGQ+L+LLFEDLFK+MNEEA + ++ +L++ SRS+R DISQ I  +SIT
Sbjct: 315  KDYVGNKRLELSGQMLSLLFEDLFKSMNEEAVKSIDKILAKPSRSSRLDISQYIVKDSIT 374

Query: 442  NGLERTLSTGNFDIKRFKMHRKGMTQAVARLSYIATLGFMTKISPQFEKSRKVSGPRALQ 621
             GLERTLSTGNFD+KRF ++RKGM QAVARLSYIAT+G MTKI PQFEKSRKVSGPRALQ
Sbjct: 375  FGLERTLSTGNFDLKRFGVNRKGMAQAVARLSYIATVGHMTKIIPQFEKSRKVSGPRALQ 434

Query: 622  PSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEDEGPLISLCYTLGVEDLELLSAEELH 801
            PSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEDEGPLISLCYTLGVE  E LS EE+H
Sbjct: 435  PSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEDEGPLISLCYTLGVESFEYLSGEEVH 494

Query: 802  MPNSYLIIFNGLILGKHRRPQKFANSMRKLRRAGKIGEFVSIFVNEKQRCVYIASDGGRV 981
            MPNSYLIIFNGLI+GKHRRPQ+F+N+MR LRRAGKIGEF+SIFVNEKQRCVYIASDGGRV
Sbjct: 495  MPNSYLIIFNGLIIGKHRRPQRFSNAMRTLRRAGKIGEFISIFVNEKQRCVYIASDGGRV 554

Query: 982  CRPLVIADKGVSRIKEHHMKELRDGVRSFQSFLKEGLIEYLDVNEENNALIALYEENAKE 1161
            CRPLVIADKGVSRIKEHHMKEL+DGVR+F SFLK+GLIEYLDVNEENNALIALYE+NAKE
Sbjct: 555  CRPLVIADKGVSRIKEHHMKELKDGVRTFDSFLKDGLIEYLDVNEENNALIALYEDNAKE 614

Query: 1162 ETTHIEIEPFTILGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQLCRMDTLIYLL 1341
            +TTHIEIEPFTILGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQ CR DTL+ LL
Sbjct: 615  KTTHIEIEPFTILGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQYCRTDTLMCLL 674

Query: 1342 VYPQRPLLTTRTIELVSYDKLGAGQNASVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVM 1521
            VYPQRPLLTTRTIELVS+DKLGAGQNA+VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVM
Sbjct: 675  VYPQRPLLTTRTIELVSFDKLGAGQNATVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVM 734

Query: 1522 KKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQILDDDGLAAPGEIIRPFDIYINKQTPMD 1701
            K+  +V  KYEN TSDRII PQR+G   E MQILDDDGLAAPG+ IR  DIYINKQ+P+ 
Sbjct: 735  KRMVSVCLKYENSTSDRIIKPQREGQDCEKMQILDDDGLAAPGQTIRADDIYINKQSPIV 794

Query: 1702 TKSGGRAHNTVALKDSQYKSSKQTYKGPEGETPVVDRVALCSDRNNNMCVKFMIRHTRRP 1881
            T+      + +AL DS Y+SS++ YKG  GE+ V+DRVAL SDR+NNM +KFM R TRRP
Sbjct: 795  TRV--PVKSPMALPDSSYRSSREHYKGSVGESAVIDRVALFSDRDNNMKIKFMTRQTRRP 852

Query: 1882 EVGDKFSSRHGQKGVCGTIVQQEDFPFSERGVCPDLIMNPHGFPSRMTVGKMIELLGSKA 2061
            EVGDKFSSRHGQKGVCGTIVQQEDFPFSERG+CPDLIMNPHGFPSRMTVGKMIELLGSKA
Sbjct: 853  EVGDKFSSRHGQKGVCGTIVQQEDFPFSERGICPDLIMNPHGFPSRMTVGKMIELLGSKA 912

Query: 2062 GVSCGRFHYGSAFGEPSGHADKVEVISETLVKHGFSY 2172
            GVSCGRFHYGSAFGEPSGHADKVE ISETLVKHGFSY
Sbjct: 913  GVSCGRFHYGSAFGEPSGHADKVETISETLVKHGFSY 949


>ref|XP_006398163.1| hypothetical protein EUTSA_v10000748mg [Eutrema salsugineum]
            gi|557099252|gb|ESQ39616.1| hypothetical protein
            EUTSA_v10000748mg [Eutrema salsugineum]
          Length = 1164

 Score = 1164 bits (3012), Expect = 0.0
 Identities = 573/740 (77%), Positives = 648/740 (87%), Gaps = 16/740 (2%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRPEKEGRALDILRDTFLANIPVRENN 180
            LLPSIEECV   V+T+ +AL+YLE KV++      PEK+GRAL ILRD FLA++PVR+NN
Sbjct: 274  LLPSIEECVSEGVNTRKQALDYLEAKVKKTSYGPPPEKDGRALYILRDVFLAHVPVRDNN 333

Query: 181  FRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEASR 360
            FR KC YV VMLRRM+EA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  EA +
Sbjct: 334  FRQKCFYVGVMLRRMIEAMLNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMITEAVK 393

Query: 361  KMEVVLSRLSRSNRFDISQLIGGE---SITNGLERTLSTGNFDIKRFKMHRKGMTQAVAR 531
            K++ +L + +R++RFD SQ + GE   +I+ GLERTLSTGNFDIKRF+MHRKGMTQ + R
Sbjct: 394  KVDGILQKPNRASRFDFSQCLTGEKNHNISFGLERTLSTGNFDIKRFRMHRKGMTQVLTR 453

Query: 532  LSYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHV 711
            LS+I +LGF+TKISPQFEKSRKVSGPR+LQPSQWGMLCPCDTPEGE+CGLVKNLALMTHV
Sbjct: 454  LSFIGSLGFITKISPQFEKSRKVSGPRSLQPSQWGMLCPCDTPEGESCGLVKNLALMTHV 513

Query: 712  TTDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKL 891
            TTDE+EGPL+++CY LGV DLE+LSAEELH P+S+L+IFNGLILGKHRRPQ FANS+R+L
Sbjct: 514  TTDEEEGPLVAMCYKLGVTDLEVLSAEELHTPDSFLVIFNGLILGKHRRPQYFANSLRRL 573

Query: 892  RRAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQ 1071
            RRAGKIGEFVS+F+NEKQ CVY+ASDGGRVCRPLVIADKGVSR+K+HHMKEL+DGVR+F 
Sbjct: 574  RRAGKIGEFVSVFINEKQHCVYVASDGGRVCRPLVIADKGVSRVKQHHMKELQDGVRTFD 633

Query: 1072 SFLKEGLIEYLDVNEENNALIALYEENA-------------KEETTHIEIEPFTILGVCA 1212
             F+++GLIEYLDVNEENNALIALYE +A             +E TTHIEIEPFTILGV A
Sbjct: 634  DFIRDGLIEYLDVNEENNALIALYESDATSEMAETAEAAKIREGTTHIEIEPFTILGVVA 693

Query: 1213 GLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVS 1392
            GLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQL RMDTL+YLLVYPQRPLLTTRTIELV 
Sbjct: 694  GLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQLNRMDTLLYLLVYPQRPLLTTRTIELVG 753

Query: 1393 YDKLGAGQNASVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDR 1572
            YDKLGAGQNA+VAVMS +GYDIEDAIVMNK+SLDRGFGRCIVMKK  A  QKY N   DR
Sbjct: 754  YDKLGAGQNATVAVMSNTGYDIEDAIVMNKASLDRGFGRCIVMKKIVATCQKYGNDAVDR 813

Query: 1573 IIGPQRDGHGAEHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQ 1752
            I+ PQR G  AE MQILDDDG+AAPGEIIRP D+YINKQ P+DT    R + T  L DSQ
Sbjct: 814  ILRPQRTGPDAEKMQILDDDGIAAPGEIIRPNDVYINKQIPVDT----RDNITSPLSDSQ 869

Query: 1753 YKSSKQTYKGPEGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCG 1932
            Y+ +++ +KGPEGET VVDRVALCSD+N N+C+K++IRHTRRPE+GDKFSSRHGQKGVCG
Sbjct: 870  YRPAREYFKGPEGETQVVDRVALCSDKNGNLCIKYIIRHTRRPELGDKFSSRHGQKGVCG 929

Query: 1933 TIVQQEDFPFSERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPS 2112
            TIVQQEDFPFSERG+CPDLIMNPHGFPSRMTVGKMIELLGSKAGV  GRFHYGSAFGE S
Sbjct: 930  TIVQQEDFPFSERGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVCSGRFHYGSAFGERS 989

Query: 2113 GHADKVEVISETLVKHGFSY 2172
            GHADKVE IS TLVKHGFSY
Sbjct: 990  GHADKVETISATLVKHGFSY 1009


>ref|XP_006279912.1| hypothetical protein CARUB_v10025768mg [Capsella rubella]
            gi|482548616|gb|EOA12810.1| hypothetical protein
            CARUB_v10025768mg [Capsella rubella]
          Length = 1160

 Score = 1163 bits (3008), Expect = 0.0
 Identities = 565/737 (76%), Positives = 649/737 (88%), Gaps = 13/737 (1%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQNNTRPEKEGRALDILRDTFLANIPVRENN 180
            LLPSIEECV   V+TQ +AL+YLE KV++       +K+GRAL ILRD FLA++PVR+NN
Sbjct: 274  LLPSIEECVSEGVNTQKQALDYLEAKVKKTSYGPPLQKDGRALYILRDLFLAHVPVRDNN 333

Query: 181  FRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEASR 360
            FR KC YV VMLRRM+EA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  EA +
Sbjct: 334  FRQKCFYVGVMLRRMIEAMLNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMITEAIK 393

Query: 361  KMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLSY 540
            K++V+L++ +R++RFD SQ + G+ I+ GLERTLSTGNFDIKRF+MHRKGMTQ + RLS+
Sbjct: 394  KVDVILTKQNRASRFDFSQCLSGDIISLGLERTLSTGNFDIKRFRMHRKGMTQVLTRLSF 453

Query: 541  IATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTD 720
            I +LGF+TKISPQFEKSRKVSGPR+LQPSQWGMLCPCDTPEGE+CGLVKNLALMTHVTTD
Sbjct: 454  IGSLGFITKISPQFEKSRKVSGPRSLQPSQWGMLCPCDTPEGESCGLVKNLALMTHVTTD 513

Query: 721  EDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRRA 900
            E+EGPL+++CY LGV DLE+LSAEELH P+S+L+I NGLILGKHRRPQ FANS+R+LRRA
Sbjct: 514  EEEGPLVAMCYKLGVTDLEVLSAEELHTPDSFLVILNGLILGKHRRPQYFANSLRRLRRA 573

Query: 901  GKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSFL 1080
            GKIGEFVS+F NEKQ CVY+ASDGGRVCRPLVIADKG+SR+K+HHMKEL+DGVR+F  F+
Sbjct: 574  GKIGEFVSVFTNEKQHCVYVASDGGRVCRPLVIADKGISRVKQHHMKELQDGVRTFDDFI 633

Query: 1081 KEGLIEYLDVNEENNALIALYEENA-------------KEETTHIEIEPFTILGVCAGLI 1221
            ++GLIEYLDVNEENNALIALYE +A             + +TTHIEIEPFTILGV AGLI
Sbjct: 634  RDGLIEYLDVNEENNALIALYESDATTELDEGGEAAKIRADTTHIEIEPFTILGVVAGLI 693

Query: 1222 PYPHHNQSPRNTYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDK 1401
            PYPHHNQSPRNTYQCAMGKQAMGNIAYNQL RMDTL+YLLVYPQRPLLTTRTIELV YDK
Sbjct: 694  PYPHHNQSPRNTYQCAMGKQAMGNIAYNQLNRMDTLLYLLVYPQRPLLTTRTIELVGYDK 753

Query: 1402 LGAGQNASVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIG 1581
            LGAGQNA+VAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKK  A  QKY+N T DRI+ 
Sbjct: 754  LGAGQNATVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKIVATSQKYDNNTQDRILR 813

Query: 1582 PQRDGHGAEHMQILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKS 1761
            PQR G  AE MQILD+DG+A+PGEIIRP D+YINKQ+P+DTK+      T  L DSQY+ 
Sbjct: 814  PQRTGPDAEKMQILDNDGIASPGEIIRPNDVYINKQSPVDTKN----KMTTQLSDSQYRP 869

Query: 1762 SKQTYKGPEGETPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIV 1941
            +++ +KGPEGET VVDRVALCS++++++C+K++IRHTRRPE+GDKFSSRHGQKGVCGTI+
Sbjct: 870  AREYFKGPEGETQVVDRVALCSNKHDHLCIKYIIRHTRRPELGDKFSSRHGQKGVCGTII 929

Query: 1942 QQEDFPFSERGVCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHA 2121
            QQEDFPFSE G+CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGE  GHA
Sbjct: 930  QQEDFPFSELGICPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGERGGHA 989

Query: 2122 DKVEVISETLVKHGFSY 2172
            DKVE IS TLV  GFSY
Sbjct: 990  DKVETISATLVNKGFSY 1006


>ref|XP_004141655.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC2-like [Cucumis
            sativus]
          Length = 1158

 Score = 1160 bits (3000), Expect = 0.0
 Identities = 570/725 (78%), Positives = 638/725 (88%), Gaps = 1/725 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC K  ++TQ +ALEYLE KV+++Q  +  PEKEGRAL ILRD FLAN+PV +N
Sbjct: 284  LLPSIEECAKEKIYTQEQALEYLETKVKKFQFASAPPEKEGRALGILRDVFLANVPVYKN 343

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NF PKCIYVAVM+RRMM+AIL+KD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 344  NFHPKCIYVAVMMRRMMDAILSKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMVSEVK 403

Query: 358  RKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLS 537
            + ++ +L + SRS+RFD SQ +    I+ GLERTLSTGN+D+KRF+MHRKGM+Q +ARLS
Sbjct: 404  KTIDKLLGKHSRSSRFDFSQHLNSNIISFGLERTLSTGNWDVKRFRMHRKGMSQVLARLS 463

Query: 538  YIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 717
            +I+T+G +T++SPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT
Sbjct: 464  FISTMGHVTRVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 523

Query: 718  DEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRR 897
            D++EGPLISLCY LGVEDLELLSAEELH PNS+L+IFNG ILGKHRRPQ FA  MR LRR
Sbjct: 524  DQEEGPLISLCYCLGVEDLELLSAEELHTPNSFLVIFNGRILGKHRRPQYFATGMRMLRR 583

Query: 898  AGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSF 1077
            AGKIGEFVS+FVNEKQ CVYIASDGGRVCRPLVIADKGVSRIKE+HMKEL DGVR+F  F
Sbjct: 584  AGKIGEFVSVFVNEKQHCVYIASDGGRVCRPLVIADKGVSRIKEYHMKELSDGVRTFDDF 643

Query: 1078 LKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNT 1257
            L++GLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGV AGLIPYPHHNQSPRNT
Sbjct: 644  LRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVVAGLIPYPHHNQSPRNT 703

Query: 1258 YQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVM 1437
            YQCAMGKQAMGNIAYNQL RMDTL+YLLVYPQRPLLTT+TIELV YDKLGAGQNA+VAVM
Sbjct: 704  YQCAMGKQAMGNIAYNQLRRMDTLLYLLVYPQRPLLTTKTIELVGYDKLGAGQNATVAVM 763

Query: 1438 SYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQ 1617
            SYSGYDIEDAIVMNKSSLDRGFGRCIV KK S+V QKYEN T+DRI+ P R+     +MQ
Sbjct: 764  SYSGYDIEDAIVMNKSSLDRGFGRCIVFKKYSSVNQKYENNTADRIVRPNRNEDFTGNMQ 823

Query: 1618 ILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGET 1797
            ILDDDGLAAPGEIIRP DIY+NKQ+P+  K          + D+ Y+  +Q +KG EGE 
Sbjct: 824  ILDDDGLAAPGEIIRPNDIYVNKQSPIIMKGS----PLPGIPDNAYRPCRQIFKGSEGEP 879

Query: 1798 PVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGV 1977
             VVDRVAL +D+N+ +C+KF+IR TRRPE+GDKFSSRHGQKGVCGTIVQQEDFPFSERG+
Sbjct: 880  TVVDRVALSTDKNDCLCIKFLIRQTRRPELGDKFSSRHGQKGVCGTIVQQEDFPFSERGI 939

Query: 1978 CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVISETLVK 2157
            CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHADKV+ ISETL+K
Sbjct: 940  CPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADKVDAISETLIK 999

Query: 2158 HGFSY 2172
             GFSY
Sbjct: 1000 RGFSY 1004


>ref|XP_002321360.2| hypothetical protein POPTR_0015s005302g, partial [Populus
            trichocarpa] gi|550321669|gb|EEF05487.2| hypothetical
            protein POPTR_0015s005302g, partial [Populus trichocarpa]
          Length = 1035

 Score = 1157 bits (2994), Expect = 0.0
 Identities = 574/732 (78%), Positives = 641/732 (87%), Gaps = 8/732 (1%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQR--YQNNTRPEKEGRALDILRDTFLANIPVRE 174
            LLPSIEEC    V+TQ +ALEYLE  V+R  Y +++  ++E RAL ILRD F+AN+PVR+
Sbjct: 277  LLPSIEECASHGVYTQQQALEYLEAMVKRSTYSSSSTEKQENRALAILRDVFIANVPVRK 336

Query: 175  NNFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEA 354
            NNFRPKCIYVAVMLRRMMEA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E 
Sbjct: 337  NNFRPKCIYVAVMLRRMMEALLNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMITEV 396

Query: 355  SRKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARL 534
             +  + +L + +RS+RFD SQ I  +SITNGLER LSTGN+D+KRF+M+RKG+TQ + RL
Sbjct: 397  QKTADTLLVKQNRSSRFDFSQYIVRDSITNGLERALSTGNWDVKRFRMNRKGVTQVLVRL 456

Query: 535  SYIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 714
            SY+A+LG MT+ISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT
Sbjct: 457  SYMASLGHMTRISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVT 516

Query: 715  TDEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLR 894
            TDE+E PLISLC  LGVEDLELLS EELH PNS+L+IFNGLILGKHRRPQ+FAN+MRKLR
Sbjct: 517  TDEEESPLISLCKCLGVEDLELLSGEELHTPNSFLVIFNGLILGKHRRPQQFANAMRKLR 576

Query: 895  RAGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQS 1074
            RAGKIGEFVS+FVNEKQR VYIASDGGRVCRPLVIADKGVSRIKEHHM+EL DG R+F  
Sbjct: 577  RAGKIGEFVSVFVNEKQRAVYIASDGGRVCRPLVIADKGVSRIKEHHMRELMDGARTFDD 636

Query: 1075 FLKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRN 1254
            FL EGLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGV AGLIPYPHHNQSPRN
Sbjct: 637  FLHEGLIEYLDVNEENNALIALYEWEATPETTHIEIEPFTILGVVAGLIPYPHHNQSPRN 696

Query: 1255 TYQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAV 1434
            TYQCAMGKQAMGNIAYNQ  RMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAV
Sbjct: 697  TYQCAMGKQAMGNIAYNQASRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAV 756

Query: 1435 MSYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHM 1614
            MSYSGYDIEDAIVMNK+SLDRGFGRCIV+KK +   QKYENG SDRI+ P+++    E  
Sbjct: 757  MSYSGYDIEDAIVMNKASLDRGFGRCIVLKKYTCTNQKYENGASDRILRPRKN---EERE 813

Query: 1615 QILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGE 1794
            ++LDDDGLAAPGEIIR  DIYINK++P++T+  G   +  AL D +Y+   Q +KG EGE
Sbjct: 814  RVLDDDGLAAPGEIIRHGDIYINKESPIETR--GPLKSAAALADVKYRPCAQIFKGTEGE 871

Query: 1795 TPVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERG 1974
            + VVDRVALCSD+NNN+C+K+ IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG
Sbjct: 872  SCVVDRVALCSDKNNNLCIKYKIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERG 931

Query: 1975 VCPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVIS---- 2142
            +CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE I     
Sbjct: 932  ICPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAIRHFLS 991

Query: 2143 --ETLVKHGFSY 2172
              ETLVKHGFSY
Sbjct: 992  LIETLVKHGFSY 1003


>ref|XP_002321356.2| hypothetical protein POPTR_0015s005302g, partial [Populus
            trichocarpa] gi|550321670|gb|EEF05483.2| hypothetical
            protein POPTR_0015s005302g, partial [Populus trichocarpa]
          Length = 1034

 Score = 1157 bits (2993), Expect = 0.0
 Identities = 575/731 (78%), Positives = 640/731 (87%), Gaps = 7/731 (0%)
 Frame = +1

Query: 1    LLPSIEECVKASVHTQNKALEYLENKVQRYQ-NNTRPEKEGRALDILRDTFLANIPVREN 177
            LLPSIEEC    V+TQ +ALEYLE  V+R   +++  EKE RAL ILRD F+AN+PVR+N
Sbjct: 277  LLPSIEECASHGVYTQQQALEYLEAMVKRSTYSSSSTEKENRALAILRDVFIANVPVRKN 336

Query: 178  NFRPKCIYVAVMLRRMMEAILNKDTMDDKDYVGNKRLELSGQLLALLFEDLFKTMNEEAS 357
            NFRPKCIYVAVMLRRMMEA+LNKD MDDKDYVGNKRLELSGQL++LLFEDLFKTM  E  
Sbjct: 337  NFRPKCIYVAVMLRRMMEALLNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTMITEVQ 396

Query: 358  RKMEVVLSRLSRSNRFDISQLIGGESITNGLERTLSTGNFDIKRFKMHRKGMTQAVARLS 537
            +  + +L + +RS+RFD SQ I  +SITNGLER LSTGN+D+KRF+M+RKG+TQ + RLS
Sbjct: 397  KTADTLLVKQNRSSRFDFSQYIVRDSITNGLERALSTGNWDVKRFRMNRKGVTQVLVRLS 456

Query: 538  YIATLGFMTKISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 717
            Y+A+LG MT+ISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT
Sbjct: 457  YMASLGHMTRISPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTT 516

Query: 718  DEDEGPLISLCYTLGVEDLELLSAEELHMPNSYLIIFNGLILGKHRRPQKFANSMRKLRR 897
            DE+E PLISLC  LGVEDLELLS EELH PNS+L+IFNGLILGKHRRPQ+FAN+MRKLRR
Sbjct: 517  DEEESPLISLCKCLGVEDLELLSGEELHTPNSFLVIFNGLILGKHRRPQQFANAMRKLRR 576

Query: 898  AGKIGEFVSIFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELRDGVRSFQSF 1077
            AGKIGEFVS+FVNEKQR VYIASDGGRVCRPLVIADKGVSRIKEHHM+EL DG R+F  F
Sbjct: 577  AGKIGEFVSVFVNEKQRAVYIASDGGRVCRPLVIADKGVSRIKEHHMRELMDGARTFDDF 636

Query: 1078 LKEGLIEYLDVNEENNALIALYEENAKEETTHIEIEPFTILGVCAGLIPYPHHNQSPRNT 1257
            L EGLIEYLDVNEENNALIALYE  A  ETTHIEIEPFTILGV AGLIPYPHHNQSPRNT
Sbjct: 637  LHEGLIEYLDVNEENNALIALYEWEATPETTHIEIEPFTILGVVAGLIPYPHHNQSPRNT 696

Query: 1258 YQCAMGKQAMGNIAYNQLCRMDTLIYLLVYPQRPLLTTRTIELVSYDKLGAGQNASVAVM 1437
            YQCAMGKQAMGNIAYNQ  RMD+L+YLLVYPQRPLLTTRTIELV YDKLGAGQNA+VAVM
Sbjct: 697  YQCAMGKQAMGNIAYNQASRMDSLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNATVAVM 756

Query: 1438 SYSGYDIEDAIVMNKSSLDRGFGRCIVMKKTSAVVQKYENGTSDRIIGPQRDGHGAEHMQ 1617
            SYSGYDIEDAIVMNK+SLDRGFGRCIV+KK +   QKYENG SDRI+ P+++    E  +
Sbjct: 757  SYSGYDIEDAIVMNKASLDRGFGRCIVLKKYTCTNQKYENGASDRILRPRKN---EERER 813

Query: 1618 ILDDDGLAAPGEIIRPFDIYINKQTPMDTKSGGRAHNTVALKDSQYKSSKQTYKGPEGET 1797
            +LDDDGLAAPGEIIR  DIYINK++P++T+  G   +  AL D +Y+   Q +KG EGE+
Sbjct: 814  VLDDDGLAAPGEIIRHGDIYINKESPIETR--GPLKSAAALADVKYRPCAQIFKGTEGES 871

Query: 1798 PVVDRVALCSDRNNNMCVKFMIRHTRRPEVGDKFSSRHGQKGVCGTIVQQEDFPFSERGV 1977
             VVDRVALCSD+NNN+C+K+ IRHTRRPEVGDKFSSRHGQKGVCGTI+QQEDFPFSERG+
Sbjct: 872  CVVDRVALCSDKNNNLCIKYKIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERGI 931

Query: 1978 CPDLIMNPHGFPSRMTVGKMIELLGSKAGVSCGRFHYGSAFGEPSGHADKVEVIS----- 2142
            CPDLIMNPHGFPSRMTVGKMIELLG KAGVSCGRFHYGSAFGEPSGHAD+VE I      
Sbjct: 932  CPDLIMNPHGFPSRMTVGKMIELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAIRHFLSL 991

Query: 2143 -ETLVKHGFSY 2172
             ETLVKHGFSY
Sbjct: 992  IETLVKHGFSY 1002


Top