BLASTX nr result
ID: Mentha23_contig00040417
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00040417 (815 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAN04919.1| Putative polyprotein [Oryza sativa Japonica Group... 216 3e-72 ref|XP_007050063.1| Uncharacterized protein TCM_003746 [Theobrom... 214 6e-72 gb|AAR89003.1| putative polyprotein [Oryza sativa Japonica Group... 215 8e-71 gb|ABF95081.1| retrotransposon protein, putative, Ty3-gypsy subc... 215 3e-68 gb|AAR89842.1| putative polyprotein [Oryza sativa Japonica Group] 207 3e-67 emb|CAE01541.2| OSJNBa0033G05.1 [Oryza sativa Japonica Group] gi... 197 7e-67 gb|ABB47446.1| retrotransposon protein, putative, Ty3-gypsy subc... 197 1e-66 gb|ABA97518.1| retrotransposon protein, putative, Ty3-gypsy subc... 196 7e-66 gb|ABA97617.1| retrotransposon protein, putative, Ty3-gypsy subc... 195 1e-65 gb|AAQ56379.1| putative polyprotein [Oryza sativa Japonica Group] 199 2e-65 gb|ABA94145.1| retrotransposon protein, putative, Ty3-gypsy subc... 196 4e-64 gb|AAF18642.1|AC006228_13 F5J5.15 [Arabidopsis thaliana] 191 5e-64 emb|CAH67760.1| H0124E07.7 [Oryza sativa Indica Group] 187 2e-63 ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [The... 239 9e-61 ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The... 237 3e-60 ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobrom... 237 3e-60 ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,... 237 3e-60 ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [The... 237 4e-60 ref|XP_007044132.1| Uncharacterized protein TCM_009073 [Theobrom... 236 6e-60 ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [The... 236 7e-60 >gb|AAN04919.1| Putative polyprotein [Oryza sativa Japonica Group] gi|31430212|gb|AAP52158.1| retrotransposon protein, putative, Ty3-gypsy subclass, expressed [Oryza sativa Japonica Group] Length = 1719 Score = 216 bits (549), Expect(2) = 3e-72 Identities = 102/171 (59%), Positives = 128/171 (74%) Frame = +3 Query: 303 ITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGH 482 I SDR S+FTS+ W LQ EL T+LNFSTA+HPQTDGQ+ER Q LEDMLR LD GG Sbjct: 1422 IVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDFGGA 1481 Query: 483 WEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETI 662 W+ LP EF+ NNSYQA++ M P+EALYGRKCR+PL+WD+ G R+L G + + E E + Sbjct: 1482 WDKSLPYAEFSYNNSYQASLQMAPFEALYGRKCRTPLFWDQTGERQLFGTEVLAEAEEKV 1541 Query: 663 RQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 R IRER++ AQ RQKSYADNRR ++ FE GD +L+V+P +G+ RF KGK Sbjct: 1542 RIIRERLRIAQSRQKSYADNRRRELTFEAGDHVYLRVTPLRGVHRFQTKGK 1592 Score = 83.6 bits (205), Expect(2) = 3e-72 Identities = 38/69 (55%), Positives = 52/69 (75%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F M+++IA FV C CQ Sbjct: 1325 RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 1384 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1385 RVKAEHQRP 1393 >ref|XP_007050063.1| Uncharacterized protein TCM_003746 [Theobroma cacao] gi|508702324|gb|EOX94220.1| Uncharacterized protein TCM_003746 [Theobroma cacao] Length = 267 Score = 214 bits (544), Expect(2) = 6e-72 Identities = 108/165 (65%), Positives = 124/165 (75%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P+SI SDRD +FTSR W Q L TKL FSTAFHPQTDGQ ERTIQTLEDMLR V+D Sbjct: 100 PVSIVSDRDPRFTSRFWPKFQEALGTKLKFSTAFHPQTDGQLERTIQTLEDMLRACVIDF 159 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G W+ LPL+EFA NNS+Q+ I M PYEALYGRKCR+PL WDEVG RKL+ + I Sbjct: 160 IGSWDRHLPLVEFAYNNSFQSIIGMAPYEALYGRKCRTPLCWDEVGERKLVSVELIELTN 219 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKG 788 + I+ IRER+K AQDR+KSYAD RR ++FEI D FLKVSP KG Sbjct: 220 DKIKVIRERLKVAQDRRKSYADKRRKDLEFEIDDNVFLKVSPWKG 264 Score = 84.7 bits (208), Expect(2) = 6e-72 Identities = 37/73 (50%), Positives = 48/73 (65%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P LR IM EAH + Y HPGS KMY+ +++ + MK+D+A FV +CL CQ Sbjct: 5 RVCVPEGNQLRQAIMEEAHSSAYALHPGSTKMYRTIRENYWWPSMKRDVAEFVAKCLVCQ 64 Query: 269 QVKALHQRPYLNH 307 QVKA HQRP H Sbjct: 65 QVKAEHQRPAAIH 77 >gb|AAR89003.1| putative polyprotein [Oryza sativa Japonica Group] gi|108709031|gb|ABF96826.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 934 Score = 215 bits (547), Expect(2) = 8e-71 Identities = 103/174 (59%), Positives = 129/174 (74%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR S+FTS+ W LQ EL T+LNFSTA+HPQT+GQ+ER Q LEDMLR LD Sbjct: 634 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTNGQTERVNQILEDMLRACALDF 693 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 GG W+ LP EF+ NNSYQA++ MVP+EALYGRKCR+PL+WD+ G R+L G + + E Sbjct: 694 GGAWDKSLPYAEFSYNNSYQASLQMVPFEALYGRKCRTPLFWDQAGERQLFGIEVLAEAE 753 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E +R IRER++ AQ RQKSYADNRR ++ FE GD +L V+P +G+ RF KGK Sbjct: 754 EKVRTIRERLRIAQSRQKSYADNRRRELTFEAGDYVYLLVTPLRGVHRFQTKGK 807 Score = 79.7 bits (195), Expect(2) = 8e-71 Identities = 37/69 (53%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+ ++ L++ I++EAH T Y+ HPGS KMYQDLK++F M+++IA FV C CQ Sbjct: 540 RLCVLDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 599 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 600 RVKAEHQRP 608 >gb|ABF95081.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1329 Score = 215 bits (548), Expect(2) = 3e-68 Identities = 101/174 (58%), Positives = 129/174 (74%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR S+FTS+ W LQ EL T+LNFSTA+HPQTDGQ+ER Q LEDMLR LD Sbjct: 1029 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDF 1088 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 GG W+ LP EF+ NNSYQ+++ M P+EALYGRKCR+PL+WD+ G R+L G + + E Sbjct: 1089 GGAWDKSLPYAEFSYNNSYQSSLQMAPFEALYGRKCRTPLFWDQTGERQLFGTEVLTEAE 1148 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E +R +RER++ AQ RQKSYADNRR ++ FE GD +L+V+P +G+ RF KGK Sbjct: 1149 EKVRTVRERLRIAQSRQKSYADNRRRELTFEAGDYVYLRVTPLRGVHRFQTKGK 1202 Score = 70.9 bits (172), Expect(2) = 3e-68 Identities = 35/77 (45%), Positives = 49/77 (63%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F M+K+IA FV C CQ Sbjct: 954 RLCVPDDKELKDLILTEAHQTHYSIHPGSTKMYQDLKEKFWWVSMRKEIAEFVALCDVCQ 1013 Query: 269 QVKALHQRPYLNHLGPR 319 + R H P+ Sbjct: 1014 LAELYLSRIMCLHGVPK 1030 >gb|AAR89842.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1789 Score = 207 bits (526), Expect(2) = 3e-67 Identities = 101/174 (58%), Positives = 124/174 (71%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR S+FTS W LQ EL T+LNFSTA+HPQTDGQ+ER Q LEDMLR VLD Sbjct: 1516 PKKIVSDRGSQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLRACVLDF 1575 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G W+ LP EF+ NNSYQA+I M PYEALYGRKCR+PL WD+VG ++ G D + E Sbjct: 1576 GKTWDKSLPYAEFSYNNSYQASIQMAPYEALYGRKCRTPLMWDQVGESQVFGTDILREAE 1635 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 +R IR+ +K AQ RQKSYADNRR ++F + D +L+V+P +G+ RF KGK Sbjct: 1636 AKVRTIRDNLKVAQSRQKSYADNRRRDLEFAVDDFVYLRVTPLRGVHRFRTKGK 1689 Score = 75.9 bits (185), Expect(2) = 3e-67 Identities = 34/69 (49%), Positives = 48/69 (69%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P N L+ I+ EAH++ Y+ HPGS KMY DL++++ MK++IA F C CQ Sbjct: 1417 RVCVPDNRELKQLILQEAHESPYSIHPGSTKMYLDLREKYWWVSMKREIAEFEALCDVCQ 1476 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1477 RVKAEHQRP 1485 >emb|CAE01541.2| OSJNBa0033G05.1 [Oryza sativa Japonica Group] gi|38347324|emb|CAE05974.2| OSJNBa0063C18.15 [Oryza sativa Japonica Group] Length = 1764 Score = 197 bits (500), Expect(2) = 7e-67 Identities = 96/174 (55%), Positives = 123/174 (70%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR ++FTSR W L L T LNFSTA+HPQTDGQ+ER Q LEDMLR+ LD Sbjct: 1475 PKKIVSDRGTQFTSRFWKQLHEALGTDLNFSTAYHPQTDGQTERVNQILEDMLRSCALDF 1534 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G W+ L EF+ NN YQA+I M+P EA++GRKCR+PL W+EVG + GPD + Sbjct: 1535 EGTWDRCLLYAEFSYNNGYQASIQMLPNEAMFGRKCRTPLCWNEVGKALVFGPDILKSAE 1594 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E ++ RER+K AQ+RQK+YADNRR ++FE GD +L+VSP +G+ RFG+ GK Sbjct: 1595 EQVKLTRERLKTAQNRQKNYADNRRRDLEFEKGDHVYLRVSPLRGMRRFGMSGK 1648 Score = 84.7 bits (208), Expect(2) = 7e-67 Identities = 37/69 (53%), Positives = 50/69 (72%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P + LR+ I+ EAH++ Y+ HPGS KMYQD+K F GMK+D+A +V C CQ Sbjct: 1376 RICVPAKKELRDLILKEAHESAYSIHPGSTKMYQDIKAYFWWTGMKRDVAEYVALCDVCQ 1435 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1436 RVKAEHQRP 1444 >gb|ABB47446.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1705 Score = 197 bits (501), Expect(2) = 1e-66 Identities = 98/177 (55%), Positives = 125/177 (70%), Gaps = 3/177 (1%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR S+FTS+ W LQ EL T+LNFSTA+HPQTDGQ+ER LEDMLR LD Sbjct: 1402 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNLILEDMLRACALDF 1461 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRS---PLYWDEVG*RKLLGPDAIG 644 GG W+ LP EF+ NNSYQA++ M P+EALYGRK + L+WD+ G R+L G + + Sbjct: 1462 GGAWDKSLPYAEFSYNNSYQASLQMAPFEALYGRKLYTAICALFWDQTGERQLFGTEVLA 1521 Query: 645 EMVETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E+ E +R IRER++ AQ RQKSYADNRR ++ FE GD +L+V+P +G+ F KGK Sbjct: 1522 EVEEKVRIIRERLRIAQSRQKSYADNRRRELTFEEGDYVYLRVTPLRGVHHFQTKGK 1578 Score = 83.6 bits (205), Expect(2) = 1e-66 Identities = 38/69 (55%), Positives = 52/69 (75%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F M+++IA FV C CQ Sbjct: 1302 RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCDVCQ 1361 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1362 RVKAEHQRP 1370 >gb|ABA97518.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1524 Score = 196 bits (499), Expect(2) = 7e-66 Identities = 94/160 (58%), Positives = 118/160 (73%) Frame = +3 Query: 336 RLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGHWEPILPLIEFA 515 RL LQ EL T+LNFSTA+HPQTDGQ+ER Q LEDMLR LD GG W+ LP EF+ Sbjct: 1238 RLTKKLQEELGTRLNFSTAYHPQTDGQTERVNQILEDMLRACALDFGGAWDKSLPYAEFS 1297 Query: 516 *NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETIRQIRERIKEAQ 695 NNSYQA++ M P+EALY RKCR+PL+WD+ G R+L G + + E E +R IRER++ AQ Sbjct: 1298 YNNSYQASLQMAPFEALYDRKCRTPLFWDQTGERQLFGTEVLAEAEEKVRTIRERLRIAQ 1357 Query: 696 DRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 RQKSYADNRR ++ FE GD +L V+P +G+ RF +KGK Sbjct: 1358 SRQKSYADNRRRELTFEAGDYVYLHVTPLRGVHRFQIKGK 1397 Score = 81.6 bits (200), Expect(2) = 7e-66 Identities = 37/69 (53%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+P ++ ++ I++EAH T Y+ HPGS KMYQDLK++F M+++IA FV C CQ Sbjct: 1129 RLCVPDDKEQKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMRREIAEFVALCNVCQ 1188 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1189 RVKAEHQRP 1197 >gb|ABA97617.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1243 Score = 195 bits (495), Expect(2) = 1e-65 Identities = 93/162 (57%), Positives = 117/162 (72%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P I SDR S+FTS+ W LQ EL T+LNFSTA+HPQTDGQ+ER Q EDMLR LD Sbjct: 1068 PKKIVSDRGSQFTSKFWQKLQEELGTRLNFSTAYHPQTDGQTERVNQIWEDMLRACALDF 1127 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 GG W+ LP EF+ NN+YQA++ M P+EALYGRKC +PL+WD+ G R+L G + + E Sbjct: 1128 GGAWDKNLPYAEFSYNNNYQASLQMAPFEALYGRKCHTPLFWDQTGERQLFGTEVLAEAE 1187 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSP 779 E +R IRER++ AQ RQKSYADN R ++ FE GD +L V+P Sbjct: 1188 EKVRIIRERLRIAQSRQKSYADNPRRELTFEAGDYVYLHVTP 1229 Score = 82.4 bits (202), Expect(2) = 1e-65 Identities = 38/69 (55%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RLC+P ++ L++ I++EAH T Y+ HPGS KMYQDLK++F M ++IA FV C CQ Sbjct: 974 RLCVPDDKELKDLILTEAHQTQYSIHPGSTKMYQDLKEKFWWVSMTREIAEFVALCDVCQ 1033 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1034 RVKAEHQRP 1042 >gb|AAQ56379.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1689 Score = 199 bits (505), Expect(2) = 2e-65 Identities = 98/171 (57%), Positives = 121/171 (70%) Frame = +3 Query: 303 ITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGH 482 I SDR S+FTS W LQ EL T+LNFSTA+HPQTDGQ+ER Q LEDMLR VLD G Sbjct: 1392 IVSDRGSQFTSHFWKKLQEELGTRLNFSTAYHPQTDGQTERLNQILEDMLRACVLDFGKT 1451 Query: 483 WEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETI 662 W+ L EF+ NNSYQA+I M PYEALYGRKCR+PL WD+VG ++ G D + E + Sbjct: 1452 WDKSLLYAEFSYNNSYQASIQMAPYEALYGRKCRTPLLWDQVGESQVFGTDILREAEAKV 1511 Query: 663 RQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 R IR+ +K AQ RQKSYAD RR ++F + D +L+V+P +G+ RF KGK Sbjct: 1512 RTIRDNLKVAQSRQKSYADTRRRNLEFAMDDFVYLRVTPLRGVHRFQTKGK 1562 Score = 77.8 bits (190), Expect(2) = 2e-65 Identities = 35/69 (50%), Positives = 50/69 (72%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P ++ L+ I+ EAH++ Y+ HPGS KMY DLK+++ MK++IA FV C CQ Sbjct: 1286 RVCVPNDKELKQLILQEAHESPYSIHPGSTKMYFDLKEKYWWVSMKREIAKFVALCDVCQ 1345 Query: 269 QVKALHQRP 295 +VKA HQRP Sbjct: 1346 RVKAEHQRP 1354 >gb|ABA94145.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 927 Score = 196 bits (497), Expect(2) = 4e-64 Identities = 94/161 (58%), Positives = 120/161 (74%), Gaps = 2/161 (1%) Frame = +3 Query: 339 LWISLQR--ELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDRGGHWEPILPLIEF 512 +W+ + R T+L FSTAFHPQ DGQSERTIQTLEDMLR+ +L G WE LPL+EF Sbjct: 724 IWVVVDRLKAFDTQLKFSTAFHPQADGQSERTIQTLEDMLRSCILSWKGSWEDHLPLVEF 783 Query: 513 A*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMVETIRQIRERIKEA 692 NNS+ A+I + PYEALYGRKCRS L WD +G R +LGPD + + E I +IR+ + A Sbjct: 784 TYNNSFHASIQVAPYEALYGRKCRSLLCWDSIGERAILGPDWVQQTTERIAEIRQHMLAA 843 Query: 693 QDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 Q RQKSYAD +R +++FE+GD+ LKVSP+KG++RFG KGK Sbjct: 844 QSRQKSYADVKRRELEFEVGDQVLLKVSPTKGVVRFGTKGK 884 Score = 76.6 bits (187), Expect(2) = 4e-64 Identities = 37/85 (43%), Positives = 50/85 (58%) Frame = +2 Query: 41 TKKWTMRLQSKEGL*GRLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DG 220 T+ +T+ GRLC+P ++ +I+ EAH T YT HPG KMY DLKK + Sbjct: 605 TRDFTLDSSGAVRFHGRLCVPQKAKVKEEILREAHRTPYTVHPGENKMYHDLKKIYWWKR 664 Query: 221 MKKDIASFVERCLACQQVKALHQRP 295 MK D+A +V C CQ+VKA H+ P Sbjct: 665 MKVDVAKYVASCGVCQRVKAEHKSP 689 >gb|AAF18642.1|AC006228_13 F5J5.15 [Arabidopsis thaliana] Length = 1617 Score = 191 bits (484), Expect(2) = 5e-64 Identities = 92/153 (60%), Positives = 115/153 (75%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 P+SI S RDSKFTS W + Q E+ TK+ STA+HPQTDGQSERTIQTLEDML+ VLD Sbjct: 1117 PVSILSHRDSKFTSAFWRAFQVEMGTKVQMSTAYHPQTDGQSERTIQTLEDMLQMCVLDW 1176 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 GGHW L L++FA NNSYQA+I M P+EALYGR CR+ L W +VG + + G D + E Sbjct: 1177 GGHWADHLSLVKFAYNNSYQASIGMAPFEALYGRPCRTLLCWTQVGEKSIYGADYVQETT 1236 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIG 752 E IR ++ +KEAQDRQ+SYAD RR +++FE+G Sbjct: 1237 ERIRVLKLNMKEAQDRQRSYADKRRRELEFEVG 1269 Score = 81.3 bits (199), Expect(2) = 5e-64 Identities = 35/69 (50%), Positives = 52/69 (75%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P +E LR +I+SEAH ++++ HPG+ KMY+DLK+ + GMK+D+A++V C CQ Sbjct: 1013 RVCLPKDEELRREILSEAHASMFSIHPGATKMYRDLKRHYQWVGMKRDVANWVTECDVCQ 1072 Query: 269 QVKALHQRP 295 VKA HQ P Sbjct: 1073 LVKAEHQVP 1081 >emb|CAH67760.1| H0124E07.7 [Oryza sativa Indica Group] Length = 1430 Score = 187 bits (474), Expect(2) = 2e-63 Identities = 93/176 (52%), Positives = 122/176 (69%) Frame = +3 Query: 288 NDPISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVL 467 +D I + DR +K + LQ E+ +KLNFSTA+HPQTDGQ+ER Q LEDMLR L Sbjct: 1129 HDSIWVIVDRLTK-VAHFIPKLQEEMGSKLNFSTAYHPQTDGQTERVNQILEDMLRVCAL 1187 Query: 468 DRGGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGE 647 D GG W+ LP EF+ NNSYQA++ M PYEALYGRKCR+PL WD+ G R++ G D + + Sbjct: 1188 DFGGSWDKNLPYAEFSYNNSYQASLQMAPYEALYGRKCRTPLLWDQTGERQVFGTDILRK 1247 Query: 648 MVETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E ++ I+ER++ AQ KSYADNRR + FE GD +L+V+P +G+ RF KGK Sbjct: 1248 AEEKVKIIQERLRVAQSSHKSYADNRRRDLSFEEGDYVYLRVTPLRGVHRFHTKGK 1303 Score = 83.6 bits (205), Expect(2) = 2e-63 Identities = 37/69 (53%), Positives = 49/69 (71%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 R+C+P N+ L++ I+ EAHDTLY+ HP S KMYQDLK+ F MK +I +V C CQ Sbjct: 1029 RICVPDNKDLKDAILKEAHDTLYSIHPSSTKMYQDLKERFWWASMKHEITEYVAVCDVCQ 1088 Query: 269 QVKALHQRP 295 +VKA HQ+P Sbjct: 1089 RVKAEHQKP 1097 >ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774422|gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 239 bits (610), Expect = 9e-61 Identities = 117/174 (67%), Positives = 137/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR ++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 158 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 217 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 218 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 277 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+R+ AQ RQKSYADNRR ++F++GD FLKVSP+KGIMRFG KGK Sbjct: 278 EKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFLKVSPTKGIMRFGKKGK 331 Score = 89.7 bits (221), Expect = 1e-15 Identities = 38/69 (55%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RL +P +GLR +I+ EAH Y HPG+ KMYQDLK+ + +G+K+D+A FV +CL CQ Sbjct: 15 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 74 Query: 269 QVKALHQRP 295 QVKA HQ+P Sbjct: 75 QVKAEHQKP 83 >ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779195|gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 237 bits (605), Expect = 3e-60 Identities = 115/174 (66%), Positives = 136/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR ++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 389 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 448 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 449 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 508 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+R+ AQ RQKSYADNRR ++F++GD FLK SP+KG+MRFG KGK Sbjct: 509 EKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKFSPTKGVMRFGKKGK 562 Score = 89.7 bits (221), Expect = 1e-15 Identities = 38/69 (55%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RL +P +GLR +I+ EAH Y HPG+ KMYQDLK+ + +G+K+D+A FV +CL CQ Sbjct: 246 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 305 Query: 269 QVKALHQRP 295 QVKA HQ+P Sbjct: 306 QVKAEHQKP 314 >ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobroma cacao] gi|508728383|gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 237 bits (605), Expect = 3e-60 Identities = 114/174 (65%), Positives = 137/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR+++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 125 PISIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 184 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 185 GVKWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 244 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+++ Q RQKSYADNRR ++F++GD FLKVSP+KG+MRFG KGK Sbjct: 245 EKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 298 Score = 68.9 bits (167), Expect = 2e-09 Identities = 28/47 (59%), Positives = 37/47 (78%) Frame = +2 Query: 155 YTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQQVKALHQRP 295 Y HPG+ KMYQDLK+ + +G+K+D+A FV +CL CQQVKA HQ+P Sbjct: 4 YVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKP 50 >ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] gi|508716770|gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 237 bits (605), Expect = 3e-60 Identities = 115/174 (66%), Positives = 136/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR ++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 231 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 290 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 291 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 350 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+R+ AQ R KSYADNRR ++F++GD FLKVSP+KG+MRFG KGK Sbjct: 351 EKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 404 Score = 89.7 bits (221), Expect = 1e-15 Identities = 38/69 (55%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RL +P +GLR +I+ EAH Y HPG+ KMYQDLK+ + +G+K+D+A FV +CL CQ Sbjct: 88 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 147 Query: 269 QVKALHQRP 295 QVKA HQ+P Sbjct: 148 QVKAEHQKP 156 >ref|XP_007099730.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508728378|gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 562 Score = 237 bits (604), Expect = 4e-60 Identities = 114/174 (65%), Positives = 137/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR+++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 338 PISIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 397 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 398 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 457 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I I +++ AQ RQKSYADNRR ++F++GD FLKVSP+KG+MRFG KGK Sbjct: 458 EKIHMISQKMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 511 Score = 71.6 bits (174), Expect = 3e-10 Identities = 38/102 (37%), Positives = 50/102 (49%), Gaps = 33/102 (32%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKK---------------------- 202 RL +P +GLR +I+ EAH Y HPG+ KMYQDLK+ Sbjct: 162 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVXXXXXXXXXXXXXXXXXXXXX 221 Query: 203 -----------EFL*DGMKKDIASFVERCLACQQVKALHQRP 295 +G+K+D+A FV +CL CQQVKA HQ+P Sbjct: 222 XXXXXXXXXXXXXWWEGLKRDVAEFVSKCLVCQQVKAEHQKP 263 >ref|XP_007044132.1| Uncharacterized protein TCM_009073 [Theobroma cacao] gi|508708067|gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] Length = 421 Score = 236 bits (603), Expect = 6e-60 Identities = 115/174 (66%), Positives = 137/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR ++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLEDMLR V+D Sbjct: 117 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDL 176 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P++ALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 177 GVRWEQYLPLVEFAYNNSFQTSIQMAPFKALYGRRCRSPIGWLEVGERKLLGPELVQDAT 236 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+R+ AQ RQKSYADNRR ++F++GD FLKVSP+KG+MRFG KGK Sbjct: 237 EKIHIIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 290 >ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508711429|gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 236 bits (602), Expect = 7e-60 Identities = 115/174 (66%), Positives = 136/174 (78%) Frame = +3 Query: 294 PISITSDRDSKFTSRLWISLQRELRTKLNFSTAFHPQTDGQSERTIQTLEDMLRTIVLDR 473 PISI SDR ++FTSR W LQ L TKL+FSTAFHPQTDGQSERTIQTLE MLR V+D Sbjct: 1157 PISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRACVIDL 1216 Query: 474 GGHWEPILPLIEFA*NNSYQATIDMVPYEALYGRKCRSPLYWDEVG*RKLLGPDAIGEMV 653 G WE LPL+EFA NNS+Q +I M P+EALYGR+CRSP+ W EVG RKLLGP+ + + Sbjct: 1217 GVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDAT 1276 Query: 654 ETIRQIRERIKEAQDRQKSYADNRRTKIQFEIGDKKFLKVSPSKGIMRFGVKGK 815 E I IR+R+ AQ RQKSYADNRR ++F++GD FLKVSP+KG+MRFG KGK Sbjct: 1277 EKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 1330 Score = 89.7 bits (221), Expect = 1e-15 Identities = 38/69 (55%), Positives = 51/69 (73%) Frame = +2 Query: 89 RLCIPTNEGLRNDIMSEAHDTLYTAHPGSIKMYQDLKKEFL*DGMKKDIASFVERCLACQ 268 RL +P +GLR +I+ EAH Y HPG+ KMYQDLK+ + +G+K+D+A FV +CL CQ Sbjct: 1014 RLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQ 1073 Query: 269 QVKALHQRP 295 QVKA HQ+P Sbjct: 1074 QVKAEHQKP 1082