BLASTX nr result

ID: Mentha23_contig00040417 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00040417
         (815 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAN04919.1| Putative polyprotein [Oryza sativa Japonica Group...   216   3e-72
ref|XP_007050063.1| Uncharacterized protein TCM_003746 [Theobrom...   214   6e-72
gb|AAR89003.1| putative polyprotein [Oryza sativa Japonica Group...   215   8e-71
gb|ABF95081.1| retrotransposon protein, putative, Ty3-gypsy subc...   215   3e-68
gb|AAR89842.1| putative polyprotein [Oryza sativa Japonica Group]     207   3e-67
emb|CAE01541.2| OSJNBa0033G05.1 [Oryza sativa Japonica Group] gi...   197   7e-67
gb|ABB47446.1| retrotransposon protein, putative, Ty3-gypsy subc...   197   1e-66
gb|ABA97518.1| retrotransposon protein, putative, Ty3-gypsy subc...   196   7e-66
gb|ABA97617.1| retrotransposon protein, putative, Ty3-gypsy subc...   195   1e-65
gb|AAQ56379.1| putative polyprotein [Oryza sativa Japonica Group]     199   2e-65
gb|ABA94145.1| retrotransposon protein, putative, Ty3-gypsy subc...   196   4e-64
gb|AAF18642.1|AC006228_13 F5J5.15 [Arabidopsis thaliana]              191   5e-64
emb|CAH67760.1| H0124E07.7 [Oryza sativa Indica Group]                187   2e-63
ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [The...   239   9e-61
ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The...   237   3e-60
ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobrom...   237   3e-60
ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,...   237   3e-60
ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [The...   237   4e-60
ref|XP_007044132.1| Uncharacterized protein TCM_009073 [Theobrom...   236   6e-60
ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [The...   236   7e-60

>gb|AAN04919.1| Putative polyprotein [Oryza sativa Japonica Group]
            gi|31430212|gb|AAP52158.1| retrotransposon protein,
            putative, Ty3-gypsy subclass, expressed [Oryza sativa
            Japonica Group]
          Length = 1719

 Score =  216 bits (549), Expect(2) = 3e-72
 Identities = 102/171 (59%), Positives = 128/171 (74%)
 Frame = +3

Query: 303  ITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGH 482
            I SDR S+FTS+ W  LQ EL T+LNFSTA+HPQTDGQ+ER  Q LEDMLR   LD GG 
Sbjct: 1422 IVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDFGGA 1481

Query: 483  WEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETI 662
            W+  LP  EF+ NNSYQA++ M P+EALYGRKCR+PL+WD+ G R+L G + + E  E +
Sbjct: 1482 WDKSLPYAEFSYNNSYQASLQMAPFEALYGRKCRTPLFWDQTGERQLFGTEVLAEAEEKV 1541

Query: 663  RQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            R IRER++ AQ RQKSYADNRR ++ FE GD  +L+V+P +G+ RF  KGK
Sbjct: 1542 RIIRERLRIAQSRQKSYADNRRRELTFEAGDHVYLRVTPLRGVHRFQTKGK 1592



 Score = 83.6 bits (205), Expect(2) = 3e-72
 Identities = 38/69 (55%), Positives = 52/69 (75%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F    M+++IA FV  C  CQ
Sbjct: 1325 RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 1384

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1385 RVKAEHQRP 1393


>ref|XP_007050063.1| Uncharacterized protein TCM_003746 [Theobroma cacao]
           gi|508702324|gb|EOX94220.1| Uncharacterized protein
           TCM_003746 [Theobroma cacao]
          Length = 267

 Score =  214 bits (544), Expect(2) = 6e-72
 Identities = 108/165 (65%), Positives = 124/165 (75%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           P+SI SDRD +FTSR W   Q  L TKL FSTAFHPQTDGQ ERTIQTLEDMLR  V+D 
Sbjct: 100 PVSIVSDRDPRFTSRFWPKFQEALGTKLKFSTAFHPQTDGQLERTIQTLEDMLRACVIDF 159

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            G W+  LPL+EFA NNS+Q+ I M PYEALYGRKCR+PL WDEVG RKL+  + I    
Sbjct: 160 IGSWDRHLPLVEFAYNNSFQSIIGMAPYEALYGRKCRTPLCWDEVGERKLVSVELIELTN 219

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKG 788
           + I+ IRER+K AQDR+KSYAD RR  ++FEI D  FLKVSP KG
Sbjct: 220 DKIKVIRERLKVAQDRRKSYADKRRKDLEFEIDDNVFLKVSPWKG 264



 Score = 84.7 bits (208), Expect(2) = 6e-72
 Identities = 37/73 (50%), Positives = 48/73 (65%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
           R+C+P    LR  IM EAH + Y  HPGS KMY+ +++ +    MK+D+A FV +CL CQ
Sbjct: 5   RVCVPEGNQLRQAIMEEAHSSAYALHPGSTKMYRTIRENYWWPSMKRDVAEFVAKCLVCQ 64

Query: 269 QVKALHQRPYLNH 307
           QVKA HQRP   H
Sbjct: 65  QVKAEHQRPAAIH 77


>gb|AAR89003.1| putative polyprotein [Oryza sativa Japonica Group]
            gi|108709031|gb|ABF96826.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 934

 Score =  215 bits (547), Expect(2) = 8e-71
 Identities = 103/174 (59%), Positives = 129/174 (74%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR S+FTS+ W  LQ EL T+LNFSTA+HPQT+GQ+ER  Q LEDMLR   LD 
Sbjct: 634  PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTNGQTERVNQILEDMLRACALDF 693

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            GG W+  LP  EF+ NNSYQA++ MVP+EALYGRKCR+PL+WD+ G R+L G + + E  
Sbjct: 694  GGAWDKSLPYAEFSYNNSYQASLQMVPFEALYGRKCRTPLFWDQAGERQLFGIEVLAEAE 753

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            E +R IRER++ AQ RQKSYADNRR ++ FE GD  +L V+P +G+ RF  KGK
Sbjct: 754  EKVRTIRERLRIAQSRQKSYADNRRRELTFEAGDYVYLLVTPLRGVHRFQTKGK 807



 Score = 79.7 bits (195), Expect(2) = 8e-71
 Identities = 37/69 (53%), Positives = 51/69 (73%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
           RLC+  ++ L++ I++EAH T Y+ HPGS KMYQDLK++F    M+++IA FV  C  CQ
Sbjct: 540 RLCVLDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 599

Query: 269 QVKALHQRP 295
           +VKA HQRP
Sbjct: 600 RVKAEHQRP 608


>gb|ABF95081.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1329

 Score =  215 bits (548), Expect(2) = 3e-68
 Identities = 101/174 (58%), Positives = 129/174 (74%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR S+FTS+ W  LQ EL T+LNFSTA+HPQTDGQ+ER  Q LEDMLR   LD 
Sbjct: 1029 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDF 1088

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            GG W+  LP  EF+ NNSYQ+++ M P+EALYGRKCR+PL+WD+ G R+L G + + E  
Sbjct: 1089 GGAWDKSLPYAEFSYNNSYQSSLQMAPFEALYGRKCRTPLFWDQTGERQLFGTEVLTEAE 1148

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            E +R +RER++ AQ RQKSYADNRR ++ FE GD  +L+V+P +G+ RF  KGK
Sbjct: 1149 EKVRTVRERLRIAQSRQKSYADNRRRELTFEAGDYVYLRVTPLRGVHRFQTKGK 1202



 Score = 70.9 bits (172), Expect(2) = 3e-68
 Identities = 35/77 (45%), Positives = 49/77 (63%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F    M+K+IA FV  C  CQ
Sbjct: 954  RLCVPDDKELKDLILTEAHQTHYSIHPGSTKMYQDLKEKFWWVSMRKEIAEFVALCDVCQ 1013

Query: 269  QVKALHQRPYLNHLGPR 319
              +    R    H  P+
Sbjct: 1014 LAELYLSRIMCLHGVPK 1030


>gb|AAR89842.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1789

 Score =  207 bits (526), Expect(2) = 3e-67
 Identities = 101/174 (58%), Positives = 124/174 (71%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR S+FTS  W  LQ EL T+LNFSTA+HPQTDGQ+ER  Q LEDMLR  VLD 
Sbjct: 1516 PKKIVSDRGSQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLRACVLDF 1575

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            G  W+  LP  EF+ NNSYQA+I M PYEALYGRKCR+PL WD+VG  ++ G D + E  
Sbjct: 1576 GKTWDKSLPYAEFSYNNSYQASIQMAPYEALYGRKCRTPLMWDQVGESQVFGTDILREAE 1635

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
              +R IR+ +K AQ RQKSYADNRR  ++F + D  +L+V+P +G+ RF  KGK
Sbjct: 1636 AKVRTIRDNLKVAQSRQKSYADNRRRDLEFAVDDFVYLRVTPLRGVHRFRTKGK 1689



 Score = 75.9 bits (185), Expect(2) = 3e-67
 Identities = 34/69 (49%), Positives = 48/69 (69%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            R+C+P N  L+  I+ EAH++ Y+ HPGS KMY DL++++    MK++IA F   C  CQ
Sbjct: 1417 RVCVPDNRELKQLILQEAHESPYSIHPGSTKMYLDLREKYWWVSMKREIAEFEALCDVCQ 1476

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1477 RVKAEHQRP 1485


>emb|CAE01541.2| OSJNBa0033G05.1 [Oryza sativa Japonica Group]
            gi|38347324|emb|CAE05974.2| OSJNBa0063C18.15 [Oryza
            sativa Japonica Group]
          Length = 1764

 Score =  197 bits (500), Expect(2) = 7e-67
 Identities = 96/174 (55%), Positives = 123/174 (70%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR ++FTSR W  L   L T LNFSTA+HPQTDGQ+ER  Q LEDMLR+  LD 
Sbjct: 1475 PKKIVSDRGTQFTSRFWKQLHEALGTDLNFSTAYHPQTDGQTERVNQILEDMLRSCALDF 1534

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
             G W+  L   EF+ NN YQA+I M+P EA++GRKCR+PL W+EVG   + GPD +    
Sbjct: 1535 EGTWDRCLLYAEFSYNNGYQASIQMLPNEAMFGRKCRTPLCWNEVGKALVFGPDILKSAE 1594

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            E ++  RER+K AQ+RQK+YADNRR  ++FE GD  +L+VSP +G+ RFG+ GK
Sbjct: 1595 EQVKLTRERLKTAQNRQKNYADNRRRDLEFEKGDHVYLRVSPLRGMRRFGMSGK 1648



 Score = 84.7 bits (208), Expect(2) = 7e-67
 Identities = 37/69 (53%), Positives = 50/69 (72%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            R+C+P  + LR+ I+ EAH++ Y+ HPGS KMYQD+K  F   GMK+D+A +V  C  CQ
Sbjct: 1376 RICVPAKKELRDLILKEAHESAYSIHPGSTKMYQDIKAYFWWTGMKRDVAEYVALCDVCQ 1435

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1436 RVKAEHQRP 1444


>gb|ABB47446.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1705

 Score =  197 bits (501), Expect(2) = 1e-66
 Identities = 98/177 (55%), Positives = 125/177 (70%), Gaps = 3/177 (1%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR S+FTS+ W  LQ EL T+LNFSTA+HPQTDGQ+ER    LEDMLR   LD 
Sbjct: 1402 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNLILEDMLRACALDF 1461

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRS---PLYWDEVG*RKLLGPDAIG 644
            GG W+  LP  EF+ NNSYQA++ M P+EALYGRK  +    L+WD+ G R+L G + + 
Sbjct: 1462 GGAWDKSLPYAEFSYNNSYQASLQMAPFEALYGRKLYTAICALFWDQTGERQLFGTEVLA 1521

Query: 645  EMVETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            E+ E +R IRER++ AQ RQKSYADNRR ++ FE GD  +L+V+P +G+  F  KGK
Sbjct: 1522 EVEEKVRIIRERLRIAQSRQKSYADNRRRELTFEEGDYVYLRVTPLRGVHHFQTKGK 1578



 Score = 83.6 bits (205), Expect(2) = 1e-66
 Identities = 38/69 (55%), Positives = 52/69 (75%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F    M+++IA FV  C  CQ
Sbjct: 1302 RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 1361

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1362 RVKAEHQRP 1370


>gb|ABA97518.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1524

 Score =  196 bits (499), Expect(2) = 7e-66
 Identities = 94/160 (58%), Positives = 118/160 (73%)
 Frame = +3

Query: 336  RLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGHWEPILPLIEFA 515
            RL   LQ EL T+LNFSTA+HPQTDGQ+ER  Q LEDMLR   LD GG W+  LP  EF+
Sbjct: 1238 RLTKKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDFGGAWDKSLPYAEFS 1297

Query: 516  *NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETIRQIRERIKEAQ 695
             NNSYQA++ M P+EALY RKCR+PL+WD+ G R+L G + + E  E +R IRER++ AQ
Sbjct: 1298 YNNSYQASLQMAPFEALYDRKCRTPLFWDQTGERQLFGTEVLAEAEEKVRTIRERLRIAQ 1357

Query: 696  DRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
             RQKSYADNRR ++ FE GD  +L V+P +G+ RF +KGK
Sbjct: 1358 SRQKSYADNRRRELTFEAGDYVYLHVTPLRGVHRFQIKGK 1397



 Score = 81.6 bits (200), Expect(2) = 7e-66
 Identities = 37/69 (53%), Positives = 51/69 (73%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RLC+P ++  ++ I++EAH T Y+ HPGS KMYQDLK++F    M+++IA FV  C  CQ
Sbjct: 1129 RLCVPDDKEQKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCNVCQ 1188

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1189 RVKAEHQRP 1197


>gb|ABA97617.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1243

 Score =  195 bits (495), Expect(2) = 1e-65
 Identities = 93/162 (57%), Positives = 117/162 (72%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P  I SDR S+FTS+ W  LQ EL T+LNFSTA+HPQTDGQ+ER  Q  EDMLR   LD 
Sbjct: 1068 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQIWEDMLRACALDF 1127

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            GG W+  LP  EF+ NN+YQA++ M P+EALYGRKC +PL+WD+ G R+L G + + E  
Sbjct: 1128 GGAWDKNLPYAEFSYNNNYQASLQMAPFEALYGRKCHTPLFWDQTGERQLFGTEVLAEAE 1187

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSP 779
            E +R IRER++ AQ RQKSYADN R ++ FE GD  +L V+P
Sbjct: 1188 EKVRIIRERLRIAQSRQKSYADNPRRELTFEAGDYVYLHVTP 1229



 Score = 82.4 bits (202), Expect(2) = 1e-65
 Identities = 38/69 (55%), Positives = 51/69 (73%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F    M ++IA FV  C  CQ
Sbjct: 974  RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMTREIAEFVALCDVCQ 1033

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1034 RVKAEHQRP 1042


>gb|AAQ56379.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1689

 Score =  199 bits (505), Expect(2) = 2e-65
 Identities = 98/171 (57%), Positives = 121/171 (70%)
 Frame = +3

Query: 303  ITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGH 482
            I SDR S+FTS  W  LQ EL T+LNFSTA+HPQTDGQ+ER  Q LEDMLR  VLD G  
Sbjct: 1392 IVSDRGSQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLRACVLDFGKT 1451

Query: 483  WEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETI 662
            W+  L   EF+ NNSYQA+I M PYEALYGRKCR+PL WD+VG  ++ G D + E    +
Sbjct: 1452 WDKSLLYAEFSYNNSYQASIQMAPYEALYGRKCRTPLLWDQVGESQVFGTDILREAEAKV 1511

Query: 663  RQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            R IR+ +K AQ RQKSYAD RR  ++F + D  +L+V+P +G+ RF  KGK
Sbjct: 1512 RTIRDNLKVAQSRQKSYADTRRRNLEFAMDDFVYLRVTPLRGVHRFQTKGK 1562



 Score = 77.8 bits (190), Expect(2) = 2e-65
 Identities = 35/69 (50%), Positives = 50/69 (72%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            R+C+P ++ L+  I+ EAH++ Y+ HPGS KMY DLK+++    MK++IA FV  C  CQ
Sbjct: 1286 RVCVPNDKELKQLILQEAHESPYSIHPGSTKMYFDLKEKYWWVSMKREIAKFVALCDVCQ 1345

Query: 269  QVKALHQRP 295
            +VKA HQRP
Sbjct: 1346 RVKAEHQRP 1354


>gb|ABA94145.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 927

 Score =  196 bits (497), Expect(2) = 4e-64
 Identities = 94/161 (58%), Positives = 120/161 (74%), Gaps = 2/161 (1%)
 Frame = +3

Query: 339  LWISLQR--ELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGHWEPILPLIEF 512
            +W+ + R     T+L FSTAFHPQ DGQSERTIQTLEDMLR+ +L   G WE  LPL+EF
Sbjct: 724  IWVVVDRLKAFDTQLKFSTAFHPQADGQSERTIQTLEDMLRSCILSWKGSWEDHLPLVEF 783

Query: 513  A*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETIRQIRERIKEA 692
              NNS+ A+I + PYEALYGRKCRS L WD +G R +LGPD + +  E I +IR+ +  A
Sbjct: 784  TYNNSFHASIQVAPYEALYGRKCRSLLCWDSIGERAILGPDWVQQTTERIAEIRQHMLAA 843

Query: 693  QDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            Q RQKSYAD +R +++FE+GD+  LKVSP+KG++RFG KGK
Sbjct: 844  QSRQKSYADVKRRELEFEVGDQVLLKVSPTKGVVRFGTKGK 884



 Score = 76.6 bits (187), Expect(2) = 4e-64
 Identities = 37/85 (43%), Positives = 50/85 (58%)
 Frame = +2

Query: 41  TKKWTMRLQSKEGL*GRLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DG 220
           T+ +T+         GRLC+P    ++ +I+ EAH T YT HPG  KMY DLKK +    
Sbjct: 605 TRDFTLDSSGAVRFHGRLCVPQKAKVKEEILREAHRTPYTVHPGENKMYHDLKKIYWWKR 664

Query: 221 MKKDIASFVERCLACQQVKALHQRP 295
           MK D+A +V  C  CQ+VKA H+ P
Sbjct: 665 MKVDVAKYVASCGVCQRVKAEHKSP 689


>gb|AAF18642.1|AC006228_13 F5J5.15 [Arabidopsis thaliana]
          Length = 1617

 Score =  191 bits (484), Expect(2) = 5e-64
 Identities = 92/153 (60%), Positives = 115/153 (75%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            P+SI S RDSKFTS  W + Q E+ TK+  STA+HPQTDGQSERTIQTLEDML+  VLD 
Sbjct: 1117 PVSILSHRDSKFTSAFWRAFQVEMGTKVQMSTAYHPQTDGQSERTIQTLEDMLQMCVLDW 1176

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            GGHW   L L++FA NNSYQA+I M P+EALYGR CR+ L W +VG + + G D + E  
Sbjct: 1177 GGHWADHLSLVKFAYNNSYQASIGMAPFEALYGRPCRTLLCWTQVGEKSIYGADYVQETT 1236

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIG 752
            E IR ++  +KEAQDRQ+SYAD RR +++FE+G
Sbjct: 1237 ERIRVLKLNMKEAQDRQRSYADKRRRELEFEVG 1269



 Score = 81.3 bits (199), Expect(2) = 5e-64
 Identities = 35/69 (50%), Positives = 52/69 (75%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            R+C+P +E LR +I+SEAH ++++ HPG+ KMY+DLK+ +   GMK+D+A++V  C  CQ
Sbjct: 1013 RVCLPKDEELRREILSEAHASMFSIHPGATKMYRDLKRHYQWVGMKRDVANWVTECDVCQ 1072

Query: 269  QVKALHQRP 295
             VKA HQ P
Sbjct: 1073 LVKAEHQVP 1081


>emb|CAH67760.1| H0124E07.7 [Oryza sativa Indica Group]
          Length = 1430

 Score =  187 bits (474), Expect(2) = 2e-63
 Identities = 93/176 (52%), Positives = 122/176 (69%)
 Frame = +3

Query: 288  NDPISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVL 467
            +D I +  DR +K  +     LQ E+ +KLNFSTA+HPQTDGQ+ER  Q LEDMLR   L
Sbjct: 1129 HDSIWVIVDRLTK-VAHFIPKLQEEMGSKLNFSTAYHPQTDGQTERVNQILEDMLRVCAL 1187

Query: 468  DRGGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGE 647
            D GG W+  LP  EF+ NNSYQA++ M PYEALYGRKCR+PL WD+ G R++ G D + +
Sbjct: 1188 DFGGSWDKNLPYAEFSYNNSYQASLQMAPYEALYGRKCRTPLLWDQTGERQVFGTDILRK 1247

Query: 648  MVETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
              E ++ I+ER++ AQ   KSYADNRR  + FE GD  +L+V+P +G+ RF  KGK
Sbjct: 1248 AEEKVKIIQERLRVAQSSHKSYADNRRRDLSFEEGDYVYLRVTPLRGVHRFHTKGK 1303



 Score = 83.6 bits (205), Expect(2) = 2e-63
 Identities = 37/69 (53%), Positives = 49/69 (71%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            R+C+P N+ L++ I+ EAHDTLY+ HP S KMYQDLK+ F    MK +I  +V  C  CQ
Sbjct: 1029 RICVPDNKDLKDAILKEAHDTLYSIHPSSTKMYQDLKERFWWASMKHEITEYVAVCDVCQ 1088

Query: 269  QVKALHQRP 295
            +VKA HQ+P
Sbjct: 1089 RVKAEHQKP 1097


>ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508774422|gb|EOY21678.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 448

 Score =  239 bits (610), Expect = 9e-61
 Identities = 117/174 (67%), Positives = 137/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR ++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 158 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 217

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 218 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 277

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  IR+R+  AQ RQKSYADNRR  ++F++GD  FLKVSP+KGIMRFG KGK
Sbjct: 278 EKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFLKVSPTKGIMRFGKKGK 331



 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 38/69 (55%), Positives = 51/69 (73%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
           RL +P  +GLR +I+ EAH   Y  HPG+ KMYQDLK+ +  +G+K+D+A FV +CL CQ
Sbjct: 15  RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 74

Query: 269 QVKALHQRP 295
           QVKA HQ+P
Sbjct: 75  QVKAEHQKP 83


>ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508779195|gb|EOY26451.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 679

 Score =  237 bits (605), Expect = 3e-60
 Identities = 115/174 (66%), Positives = 136/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR ++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 389 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 448

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 449 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 508

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  IR+R+  AQ RQKSYADNRR  ++F++GD  FLK SP+KG+MRFG KGK
Sbjct: 509 EKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKFSPTKGVMRFGKKGK 562



 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 38/69 (55%), Positives = 51/69 (73%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
           RL +P  +GLR +I+ EAH   Y  HPG+ KMYQDLK+ +  +G+K+D+A FV +CL CQ
Sbjct: 246 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 305

Query: 269 QVKALHQRP 295
           QVKA HQ+P
Sbjct: 306 QVKAEHQKP 314


>ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
           gi|508728383|gb|EOY20280.1| Uncharacterized protein
           TCM_045699 [Theobroma cacao]
          Length = 415

 Score =  237 bits (605), Expect = 3e-60
 Identities = 114/174 (65%), Positives = 137/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR+++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 125 PISIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 184

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 185 GVKWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 244

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  IR+++   Q RQKSYADNRR  ++F++GD  FLKVSP+KG+MRFG KGK
Sbjct: 245 EKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 298



 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 28/47 (59%), Positives = 37/47 (78%)
 Frame = +2

Query: 155 YTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQQVKALHQRP 295
           Y  HPG+ KMYQDLK+ +  +G+K+D+A FV +CL CQQVKA HQ+P
Sbjct: 4   YVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKP 50


>ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
           cacao] gi|508716770|gb|EOY08667.1| Retrotransposon
           protein, Ty3-gypsy subclass, putative [Theobroma cacao]
          Length = 521

 Score =  237 bits (605), Expect = 3e-60
 Identities = 115/174 (66%), Positives = 136/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR ++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 231 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 290

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 291 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 350

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  IR+R+  AQ R KSYADNRR  ++F++GD  FLKVSP+KG+MRFG KGK
Sbjct: 351 EKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 404



 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 38/69 (55%), Positives = 51/69 (73%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
           RL +P  +GLR +I+ EAH   Y  HPG+ KMYQDLK+ +  +G+K+D+A FV +CL CQ
Sbjct: 88  RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 147

Query: 269 QVKALHQRP 295
           QVKA HQ+P
Sbjct: 148 QVKAEHQKP 156


>ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508728378|gb|EOY20275.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 562

 Score =  237 bits (604), Expect = 4e-60
 Identities = 114/174 (65%), Positives = 137/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR+++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 338 PISIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 397

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 398 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 457

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  I +++  AQ RQKSYADNRR  ++F++GD  FLKVSP+KG+MRFG KGK
Sbjct: 458 EKIHMISQKMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 511



 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 33/102 (32%)
 Frame = +2

Query: 89  RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKK---------------------- 202
           RL +P  +GLR +I+ EAH   Y  HPG+ KMYQDLK+                      
Sbjct: 162 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVXXXXXXXXXXXXXXXXXXXXX 221

Query: 203 -----------EFL*DGMKKDIASFVERCLACQQVKALHQRP 295
                          +G+K+D+A FV +CL CQQVKA HQ+P
Sbjct: 222 XXXXXXXXXXXXXWWEGLKRDVAEFVSKCLVCQQVKAEHQKP 263


>ref|XP_007044132.1| Uncharacterized protein TCM_009073 [Theobroma cacao]
           gi|508708067|gb|EOX99963.1| Uncharacterized protein
           TCM_009073 [Theobroma cacao]
          Length = 421

 Score =  236 bits (603), Expect = 6e-60
 Identities = 115/174 (66%), Positives = 137/174 (78%)
 Frame = +3

Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
           PISI SDR ++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLEDMLR  V+D 
Sbjct: 117 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 176

Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
           G  WE  LPL+EFA NNS+Q +I M P++ALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 177 GVRWEQYLPLVEFAYNNSFQTSIQMAPFKALYGRRCRSPIGWLEVGERKLLGPELVQDAT 236

Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
           E I  IR+R+  AQ RQKSYADNRR  ++F++GD  FLKVSP+KG+MRFG KGK
Sbjct: 237 EKIHIIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 290


>ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508711429|gb|EOY03326.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  236 bits (602), Expect = 7e-60
 Identities = 115/174 (66%), Positives = 136/174 (78%)
 Frame = +3

Query: 294  PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473
            PISI SDR ++FTSR W  LQ  L TKL+FSTAFHPQTDGQSERTIQTLE MLR  V+D 
Sbjct: 1157 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRACVIDL 1216

Query: 474  GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653
            G  WE  LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + +  
Sbjct: 1217 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 1276

Query: 654  ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815
            E I  IR+R+  AQ RQKSYADNRR  ++F++GD  FLKVSP+KG+MRFG KGK
Sbjct: 1277 EKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 1330



 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 38/69 (55%), Positives = 51/69 (73%)
 Frame = +2

Query: 89   RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268
            RL +P  +GLR +I+ EAH   Y  HPG+ KMYQDLK+ +  +G+K+D+A FV +CL CQ
Sbjct: 1014 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 1073

Query: 269  QVKALHQRP 295
            QVKA HQ+P
Sbjct: 1074 QVKAEHQKP 1082


Top