BLASTX nr result
ID: Catharanthus23_contig00031959
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00031959 (665 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] 117 5e-25 gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] 118 5e-25 gb|EOY08694.1| Uncharacterized protein TCM_023754 [Theobroma cacao] 112 1e-22 gb|EOY04261.1| Uncharacterized protein TCM_019516, partial [Theo... 109 5e-22 ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr... 109 9e-22 gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] 108 1e-21 gb|EMJ04485.1| hypothetical protein PRUPE_ppa025662mg, partial [... 107 3e-21 emb|CAN69087.1| hypothetical protein VITISV_031061 [Vitis vinifera] 101 2e-20 gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ... 104 3e-20 gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ... 104 3e-20 gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] 100 3e-19 gb|EOX94204.1| Uncharacterized protein TCM_003699 [Theobroma cacao] 100 5e-19 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 100 5e-19 ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594... 100 7e-19 gb|EOY08376.1| DNA/RNA polymerases superfamily protein [Theobrom... 100 7e-19 gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobrom... 100 7e-19 gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom... 99 1e-18 ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, part... 99 2e-18 gb|AAM94350.1| gag-pol polyprotein [Zea mays] 99 2e-18 emb|CAN64780.1| hypothetical protein VITISV_043231 [Vitis vinifera] 98 2e-18 >gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 117 bits (294), Expect(2) = 5e-25 Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 1/101 (0%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K N+IIDSGS EN ++ MV KLK + HP PY++ W GN+V V+ RC V FSI Sbjct: 386 QGKVCNVIIDSGSCENVIANYMVKKLKLQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSI 445 Query: 274 GS-FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 G+ +++ ++ DV+PMDACH+L+ RPWQYD+ HD KNTY Sbjct: 446 GNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHHDGYKNTY 486 Score = 23.1 bits (48), Expect(2) = 5e-25 Identities = 9/16 (56%), Positives = 11/16 (68%) Frame = +2 Query: 59 LRKNIVRTRCMASGKL 106 LR NI TRC + GK+ Sbjct: 374 LRHNIFHTRCTSQGKV 389 >gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 118 bits (296), Expect(2) = 5e-25 Identities = 52/101 (51%), Positives = 72/101 (71%), Gaps = 1/101 (0%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K N+IIDSGS EN ++ MV+KLK + HP PY++ W GN+V V+ RC V FSI Sbjct: 235 QGKVCNVIIDSGSCENVIANYMVEKLKLQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSI 294 Query: 274 GS-FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 G+ +++ ++ D++PMDACH+L+ RPWQYD+ HD KNTY Sbjct: 295 GNKYEDEVWCDIIPMDACHLLLGRPWQYDRRAHHDGYKNTY 335 Score = 22.3 bits (46), Expect(2) = 5e-25 Identities = 9/16 (56%), Positives = 11/16 (68%) Frame = +2 Query: 59 LRKNIVRTRCMASGKL 106 LR NI TRC + GK+ Sbjct: 223 LRHNIFYTRCTSQGKV 238 >gb|EOY08694.1| Uncharacterized protein TCM_023754 [Theobroma cacao] Length = 440 Score = 112 bits (280), Expect = 1e-22 Identities = 51/99 (51%), Positives = 69/99 (69%), Gaps = 1/99 (1%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K N+IIDSGS EN ++ MV+KLK HP PY++ W GN+V V+ RC V FSI Sbjct: 170 QGKVCNVIIDSGSCENVIANYMVEKLKLPTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSI 229 Query: 274 GS-FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKN 387 GS +++ ++ DV+PMDACH+L+ RPWQYD+ +D KN Sbjct: 230 GSKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKN 268 >gb|EOY04261.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] Length = 215 Score = 109 bits (273), Expect(2) = 5e-22 Identities = 50/101 (49%), Positives = 69/101 (68%), Gaps = 1/101 (0%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K N+IIDSGS EN ++ MV+KLK + P PY++ W GN+V V+ C V FSI Sbjct: 62 QGKVCNVIIDSGSCENVIANYMVEKLKLQTEVLPHPYKLQWLRKGNEVKVTKHCCVQFSI 121 Query: 274 GS-FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 G+ +++ ++ DV+PMDAC +L+ RPWQYD+ HD KNTY Sbjct: 122 GNKYEDEVWCDVIPMDACQLLLGRPWQYDRRAHHDGYKNTY 162 Score = 21.2 bits (43), Expect(2) = 5e-22 Identities = 8/16 (50%), Positives = 10/16 (62%) Frame = +2 Query: 59 LRKNIVRTRCMASGKL 106 LR NI RC + GK+ Sbjct: 50 LRHNIFHARCTSQGKV 65 >ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] gi|557089351|gb|ESQ30059.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] Length = 382 Score = 109 bits (272), Expect = 9e-22 Identities = 51/99 (51%), Positives = 67/99 (67%), Gaps = 1/99 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIGS 279 K NL+IDSGSS N VS+ V KL K HP PY + W G DV ++ R LV+FSIG+ Sbjct: 222 KLCNLVIDSGSSRNVVSETAVKKLGLKREDHPAPYALAWITEGTDVKITHRALVSFSIGA 281 Query: 280 F-KESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 F K++IY D+ PMD H+++ RPWQ+D+ T H+ KNTY Sbjct: 282 FYKDTIYCDIAPMDVSHLILGRPWQFDRDTCHNGKKNTY 320 >gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 108 bits (271), Expect = 1e-21 Identities = 49/101 (48%), Positives = 67/101 (66%), Gaps = 1/101 (0%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q N+IIDSGS EN V+ MV+KLK HP PY++ W GN+V V+ RC + F I Sbjct: 355 QGNVCNVIIDSGSCENVVANYMVEKLKLPTEVHPHPYKLQWLRKGNEVKVTKRCCIQFFI 414 Query: 274 -GSFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 +++ ++ DV+PMDACH+L+ RPWQYD+ +D KNTY Sbjct: 415 RNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHYDGYKNTY 455 >gb|EMJ04485.1| hypothetical protein PRUPE_ppa025662mg, partial [Prunus persica] Length = 363 Score = 107 bits (267), Expect = 3e-21 Identities = 50/99 (50%), Positives = 69/99 (69%), Gaps = 1/99 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K N+I+D GSSEN +S+E V+KLK I KH PY+V WF G+DV ++SRCLV F+IG Sbjct: 125 KMCNVILDGGSSENIISKEAVEKLKLPIEKHANPYKVAWFRKGSDVPITSRCLVKFTIGN 184 Query: 277 SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 + ++ + DVV DACHIL+ RPW +D+ + + NTY Sbjct: 185 TIEDEAWCDVVLTDACHILLGRPWLFDKDMMRSTKANTY 223 >emb|CAN69087.1| hypothetical protein VITISV_031061 [Vitis vinifera] Length = 611 Score = 101 bits (251), Expect(2) = 2e-20 Identities = 52/72 (72%), Positives = 58/72 (80%), Gaps = 3/72 (4%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K N IID GSSEN VSQEMVDKLK K+ KHP+PY +LWFN GN+V+VSS+CLVNFSIG Sbjct: 267 KLCNFIIDGGSSENLVSQEMVDKLKLKMEKHPQPYCILWFNKGNEVLVSSKCLVNFSIGD 326 Query: 277 SFKESIY--YDV 306 SFKE Y YDV Sbjct: 327 SFKEQNYSNYDV 338 Score = 24.3 bits (51), Expect(2) = 2e-20 Identities = 10/16 (62%), Positives = 11/16 (68%) Frame = +2 Query: 59 LRKNIVRTRCMASGKL 106 L KNI RT C + GKL Sbjct: 253 LHKNIFRTSCTSGGKL 268 >gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 104 bits (259), Expect = 3e-20 Identities = 45/98 (45%), Positives = 65/98 (66%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIGS 279 K Y++IID GS N VS +VDKL K PY++ W N+ +V V+ +C+++F++G Sbjct: 432 KTYSMIIDGGSCANVVSSYLVDKLGIACMKRSTPYRLQWLNDCGEVQVNKQCMISFNVGR 491 Query: 280 FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 +++ I DVVPM ACH+L+ RPWQYD+ T H KN Y Sbjct: 492 YEDEILCDVVPMQACHVLLGRPWQYDRDTTHHGRKNRY 529 >gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 104 bits (259), Expect = 3e-20 Identities = 45/98 (45%), Positives = 65/98 (66%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIGS 279 K Y++IID GS N VS +VDKL K PY++ W N+ +V V+ +C+++F++G Sbjct: 432 KTYSMIIDGGSCANVVSSYLVDKLGIACMKRSTPYRLQWLNDCGEVQVNKQCMISFNVGR 491 Query: 280 FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 +++ I DVVPM ACH+L+ RPWQYD+ T H KN Y Sbjct: 492 YEDEILCDVVPMQACHVLLGRPWQYDRDTTHHGRKNRY 529 >gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1713 Score = 100 bits (250), Expect = 3e-19 Identities = 45/100 (45%), Positives = 61/100 (61%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K +IID GS N S+EMV+KL K+ KHP PY V W NN + ++ R V F I Sbjct: 474 QDKVVKVIIDGGSCHNLASKEMVEKLGLKLLKHPHPYHVQWLNNSGSIKIAQRVKVPFKI 533 Query: 274 GSFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 G + +++ DV PM CH+L+ RPWQYD+ ++H N Y Sbjct: 534 GEYIDTMECDVAPMTVCHMLLGRPWQYDRSSLHCGRTNQY 573 >gb|EOX94204.1| Uncharacterized protein TCM_003699 [Theobroma cacao] Length = 258 Score = 100 bits (248), Expect = 5e-19 Identities = 49/101 (48%), Positives = 64/101 (63%), Gaps = 1/101 (0%) Frame = +1 Query: 94 QWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSI 273 Q K N+II+SGS EN V+ MV+KLK H PY++ W GN+V V C V F I Sbjct: 90 QGKVCNVIINSGSCENVVANYMVEKLKLPTKVHLHPYKLQWLRKGNEVKVMKHCCVQFYI 149 Query: 274 GS-FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 G+ +++ I+ DV+PMDACH+ + RP QYD HD KNTY Sbjct: 150 GNKYQDEIWCDVIPMDACHLFLGRPCQYDCQAHHDGYKNTY 190 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 100 bits (248), Expect = 5e-19 Identities = 44/96 (45%), Positives = 62/96 (64%), Gaps = 1/96 (1%) Frame = +1 Query: 109 NLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG-SFK 285 NLIID GS N S +++KL HP PY++ W N G +V V +CLV FSIG ++ Sbjct: 401 NLIIDGGSCTNVASSTLIEKLSLPTQDHPSPYKLRWLNKGAEVRVDKQCLVTFSIGKNYS 460 Query: 286 ESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 + DV+PMDACH+L+ RPW++D+ ++H NTY Sbjct: 461 DEALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTY 496 >ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594328 [Solanum tuberosum] Length = 1191 Score = 99.8 bits (247), Expect = 7e-19 Identities = 42/92 (45%), Positives = 62/92 (67%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIGS 279 K Y++IID GS N VS +VDKL K P PY++ W N+ +V V+ +C+++F++G Sbjct: 458 KTYSMIIDGGSCANVVSSYLVDKLGIACMKRPTPYRLQWLNDCGEVKVNKQCMISFNVGR 517 Query: 280 FKESIYYDVVPMDACHILIERPWQYDQGTIHD 375 +++ I DVVPM ACH+L+ RPWQYD+ D Sbjct: 518 YEDEILCDVVPMQACHVLLGRPWQYDRQNFED 549 >gb|EOY08376.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 558 Score = 99.8 bits (247), Expect = 7e-19 Identities = 46/99 (46%), Positives = 65/99 (65%), Gaps = 1/99 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K +L+ID GS EN +S+E V+KLK KHP PY++ W G++V V+++CLV F++G Sbjct: 340 KVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGD 399 Query: 277 SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 + + DVVPMD HIL+ RPW YD +H + NTY Sbjct: 400 NLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTY 438 >gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 546 Score = 99.8 bits (247), Expect = 7e-19 Identities = 46/99 (46%), Positives = 65/99 (65%), Gaps = 1/99 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K +L+ID GS EN +S+E V+KLK KHP PY++ W G++V V+++CLV F++G Sbjct: 344 KVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGN 403 Query: 277 SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 + + DVVPMD HIL+ RPW YD +H + NTY Sbjct: 404 NLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTY 442 >gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 99.0 bits (245), Expect = 1e-18 Identities = 46/99 (46%), Positives = 65/99 (65%), Gaps = 1/99 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K +L+ID GS EN +S+E V+KLK KHP PY++ W G++V V+++CLV F++G Sbjct: 335 KVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGD 394 Query: 277 SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 + + DVVPMD HIL+ RPW YD +H + NTY Sbjct: 395 NSDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTY 433 >ref|XP_006290417.1| hypothetical protein CARUB_v10019144mg, partial [Capsella rubella] gi|482559124|gb|EOA23315.1| hypothetical protein CARUB_v10019144mg, partial [Capsella rubella] Length = 110 Score = 98.6 bits (244), Expect = 2e-18 Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 1/98 (1%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIG- 276 K +IIDSGS N +S+E V KL HP PY + W N+ D +S RC V F IG Sbjct: 8 KVCRMIIDSGSFTNVISEEAVSKLALFTESHPTPYCLAWLNSSTDSRLSKRCRVPFLIGA 67 Query: 277 SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNT 390 ++K+ + D++PMDACH+L+ RPWQYD+ +HD NT Sbjct: 68 NYKDLVICDILPMDACHLLLGRPWQYDRRIMHDGFANT 105 >gb|AAM94350.1| gag-pol polyprotein [Zea mays] Length = 1618 Score = 98.6 bits (244), Expect = 2e-18 Identities = 44/98 (44%), Positives = 60/98 (61%) Frame = +1 Query: 100 KAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRCLVNFSIGS 279 ++ LIID GS N S +MV+KL HP PY + W NN V V+ +NF+IGS Sbjct: 492 RSCRLIIDGGSCNNLASSDMVEKLALTTKPHPHPYHIQWLNNSGKVKVTKLVRINFAIGS 551 Query: 280 FKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 +++ + DVVPMDAC+IL+ RPWQ+D +H N Y Sbjct: 552 YRDVVDCDVVPMDACNILLGRPWQFDSDCMHHGRSNQY 589 >emb|CAN64780.1| hypothetical protein VITISV_043231 [Vitis vinifera] Length = 898 Score = 98.2 bits (243), Expect = 2e-18 Identities = 51/107 (47%), Positives = 70/107 (65%), Gaps = 1/107 (0%) Frame = +1 Query: 76 SYSLHGQWKAYNLIIDSGSSENFVSQEMVDKLKFKI*KHPRPYQVLWFNNGNDVIVSSRC 255 S + HG K N+IIDSGS N V++EMV KL I +PY++ F GN + V+ R Sbjct: 482 SCTSHG--KVCNVIIDSGSCTNVVAKEMVTKLNLTIKPLLQPYKIQLFQRGNGLKVTKRF 539 Query: 256 LVNFSIG-SFKESIYYDVVPMDACHILIERPWQYDQGTIHDSVKNTY 393 LV+FSIG ++K+ ++ D + MDACH+L+ RPWQYDQ H KNT+ Sbjct: 540 LVSFSIGKNYKDEVWCDAMSMDACHLLLRRPWQYDQNVSHYRFKNTH 586