BLASTX nr result
ID: Catharanthus23_contig00029551
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00029551 (335 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobrom... 175 5e-42 gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] 171 9e-41 gb|EOY32328.1| Uncharacterized protein TCM_040115 [Theobroma cacao] 171 1e-40 gb|EOY26510.1| DNA/RNA polymerases superfamily protein [Theobrom... 170 2e-40 gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobrom... 170 2e-40 gb|EOX94092.1| Gag protease polyprotein [Theobroma cacao] 170 2e-40 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 168 8e-40 gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 1e-39 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 1e-39 gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 167 1e-39 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 167 1e-39 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 167 1e-39 gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 1e-39 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 1e-39 gb|EOX94089.1| Uncharacterized protein TCM_003206 [Theobroma cacao] 167 2e-39 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 166 2e-39 gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] 166 4e-39 gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom... 165 5e-39 gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobrom... 164 9e-39 emb|CAC44142.1| putative polyprotein [Cicer arietinum] 164 1e-38 >gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 175 bits (444), Expect = 5e-42 Identities = 73/111 (65%), Positives = 95/111 (85%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +K+ +WW GMKR+IA++VA CLTCQ++KAEHQKS+G LQPLP+PEWKW+H Sbjct: 513 HPGSTKMYRTIKESYWWPGMKRDIAKFVAKCLTCQQIKAEHQKSSGTLQPLPIPEWKWEH 572 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVIVD LTK AHF+AI+ +Y +++L ++Y+DE Sbjct: 573 VTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDE 623 >gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 171 bits (433), Expect = 9e-41 Identities = 72/111 (64%), Positives = 93/111 (83%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +K+ +WW GMKR+IAE+VA CLTCQ++KAEHQK +G LQPL +PEWKW+H Sbjct: 617 HPGSTKMYRTIKESYWWPGMKRDIAEFVAKCLTCQQIKAEHQKPSGTLQPLLIPEWKWEH 676 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVIVD LTK AHF+AI+ +Y +++L ++Y+DE Sbjct: 677 VTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDE 727 >gb|EOY32328.1| Uncharacterized protein TCM_040115 [Theobroma cacao] Length = 363 Score = 171 bits (432), Expect = 1e-40 Identities = 72/111 (64%), Positives = 92/111 (82%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +++ +WW GMKR++AE+VA CL CQ+VKAEHQ+ AG LQ LPVPEWKW+H Sbjct: 226 HPGSTKMYRTIRENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQSLPVPEWKWEH 285 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVIVD LTK AHF+A++ +Y ++KL Q+Y+DE Sbjct: 286 VTMDFVLGLPRTQRGKDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDE 336 >gb|EOY26510.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1290 Score = 170 bits (431), Expect = 2e-40 Identities = 71/111 (63%), Positives = 93/111 (83%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +K+ +WW GMKR+IAE+VA CL CQ++KAEHQKS+G LQPLP+PEWKW+H Sbjct: 901 HPGSTKMYQTIKESYWWPGMKRDIAEFVAKCLICQQIKAEHQKSSGTLQPLPIPEWKWEH 960 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVI+ LTK AHF+AI+ +Y +++L ++Y+DE Sbjct: 961 VTMDFVLGLPRTQSGKDAIWVIMGRLTKSAHFLAIHSTYSIERLARLYIDE 1011 >gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 170 bits (431), Expect = 2e-40 Identities = 71/111 (63%), Positives = 93/111 (83%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +K+ +WW GM+R+IAE+VA CLTCQ++KAEHQK +G LQPL +PEWKW+H Sbjct: 1112 HPGSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLSIPEWKWEH 1171 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVIVD LTK AHF+AI+ +Y +++L ++Y+DE Sbjct: 1172 VTMDFVLGLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIERLARLYIDE 1222 >gb|EOX94092.1| Gag protease polyprotein [Theobroma cacao] Length = 269 Score = 170 bits (430), Expect = 2e-40 Identities = 72/111 (64%), Positives = 91/111 (81%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +K+ +WW GMKR++AE+VA CL CQ+VKAEHQ+ AG LQ LPVPEWKW+H Sbjct: 12 HPGSTKMYRTIKENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQSLPVPEWKWEH 71 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ DAIWVIVD LTK AHF+A++ +Y ++KL Q+Y+DE Sbjct: 72 VTMDFVLGLPRTQRGNDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDE 122 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 168 bits (425), Expect = 8e-40 Identities = 69/111 (62%), Positives = 90/111 (81%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 365 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 424 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+++ +Y + ++YVDE Sbjct: 425 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVYVDE 475 >gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 40 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 99 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 100 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 150 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 648 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 707 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 708 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 758 >gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 460 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 234 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 293 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 294 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 344 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 482 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 541 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 542 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 592 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 113 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 172 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 173 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 223 >gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 501 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 560 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 561 IAMDFVTGLPRTNGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 611 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 167 bits (423), Expect = 1e-39 Identities = 69/111 (62%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 1039 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 1098 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 1099 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYVDE 1149 >gb|EOX94089.1| Uncharacterized protein TCM_003206 [Theobroma cacao] Length = 694 Score = 167 bits (422), Expect = 2e-39 Identities = 70/111 (63%), Positives = 91/111 (81%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 H G K Y+ +++ +WW GMKR++AE+VA C+ CQ+VKAEHQ+ AG LQ LPVPEWKW+H Sbjct: 522 HSGSTKMYRTIRENYWWPGMKRDVAEFVAKCVVCQQVKAEHQRPAGTLQSLPVPEWKWEH 581 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GLPRT+ KDAIWVIVD LTK AHF+A++ +Y ++KL Q+Y+DE Sbjct: 582 VTMDFVLGLPRTQRGKDAIWVIVDRLTKFAHFLAVHSTYSIEKLAQLYIDE 632 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 166 bits (421), Expect = 2e-39 Identities = 69/111 (62%), Positives = 88/111 (79%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK AGLLQPLPVPEWKW+H Sbjct: 271 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEH 330 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y ++YVDE Sbjct: 331 IAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVYVDE 381 >gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 166 bits (419), Expect = 4e-39 Identities = 68/111 (61%), Positives = 88/111 (79%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DLK+ +WW G+KR++AE+V+ CL CQ+VKAEHQK GLLQPLPVPEWKW+H Sbjct: 7 HPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPTGLLQPLPVPEWKWEH 66 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 + MDFVTGLPRT G D+IW++VD LTK AHF+ + +Y + ++YVDE Sbjct: 67 IAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLLVKTTYGAAQYARVYVDE 117 >gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 165 bits (418), Expect = 5e-39 Identities = 68/111 (61%), Positives = 90/111 (81%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+ +++ +WW GMKR++AE++A CL CQ+VKAEHQ+ LQ LPVPEWKW+H Sbjct: 1099 HPGSTKMYRTIRENYWWPGMKRDVAEFIAKCLVCQQVKAEHQRLVDTLQSLPVPEWKWEH 1158 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDF+ GLPRT+ KDAIWVIVD LTK AHF+A++ +Y ++KL Q+Y+DE Sbjct: 1159 VTMDFILGLPRTQRGKDAIWVIVDRLTKSAHFLAVHSTYSIEKLAQLYIDE 1209 >gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1110 Score = 164 bits (416), Expect = 9e-39 Identities = 71/111 (63%), Positives = 91/111 (81%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 H K Y+ +K+ +WW GMKR+IAE+VA CLTCQ++KAEHQK +G LQPLP+PEWKW+H Sbjct: 807 HLESTKMYRTIKESYWWPGMKRDIAEFVAKCLTCQQIKAEHQKLSGTLQPLPIPEWKWEH 866 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 VTMDFV GL RT+ KDAIWVIVD LTK AHF+AI+ +Y ++KL ++Y+DE Sbjct: 867 VTMDFVLGLLRTQSGKDAIWVIVDRLTKSAHFLAIHNTYSIEKLVKLYIDE 917 >emb|CAC44142.1| putative polyprotein [Cicer arietinum] Length = 655 Score = 164 bits (415), Expect = 1e-38 Identities = 66/111 (59%), Positives = 89/111 (80%) Frame = +1 Query: 1 HPGENKKYKDLKQKFWWNGMKREIAEYVAACLTCQRVKAEHQKSAGLLQPLPVPEWKWDH 180 HPG K Y+DL+Q +WW GMK+ +AEYV+ CLTCQ+ K EHQ+ AG+LQPL +PEWKWD Sbjct: 341 HPGATKMYQDLRQNYWWPGMKKHVAEYVSTCLTCQKAKVEHQRPAGMLQPLDIPEWKWDS 400 Query: 181 VTMDFVTGLPRTKGYKDAIWVIVDLLTKVAHFIAINISYPLDKLTQIYVDE 333 ++MDF+TGLP+T+ D+IWVIVD LTK AHF+ + +Y +D+LT+IY+ E Sbjct: 401 ISMDFITGLPKTRRKNDSIWVIVDRLTKSAHFLPVRTTYKVDQLTEIYIAE 451