BLASTX nr result
ID: Catharanthus22_contig00022865
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00022865 (1561 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ... 67 3e-12 gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ... 67 3e-12 ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594... 62 1e-10 ref|XP_006368019.1| PREDICTED: uncharacterized protein LOC102593... 62 2e-10 gb|AAD17351.1| contains similarity to retrovirus-related polypro... 51 9e-08 ref|XP_006490838.1| PREDICTED: uncharacterized protein LOC102624... 50 1e-06 gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] 47 2e-06 ref|XP_006575965.1| PREDICTED: uncharacterized protein LOC102669... 48 4e-06 >gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 66.6 bits (161), Expect(2) = 3e-12 Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 8/129 (6%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL*HCAH 192 CAN+VSSY ++K GI +K P LQW+++CGE++VN Q + ++ Y+DE+L C Sbjct: 443 CANVVSSYLVDKLGIACMKRSTPYRLQWLNDCGEVQVNKQCMISFNVGRYEDEIL--CDV 500 Query: 191 AYDSYC---VG*ASTI***RDA*DSH----NTYSCVEL-EENLLPPISPTKVFEDLNTIK 36 C +G D +H N YS + ++ L P+SP++VFED ++ Sbjct: 501 VPMQACHVLLGRPWQY----DRDTTHHGRKNRYSLLHNGKKYTLAPLSPSQVFEDQKRLR 556 Query: 35 ENMALERRE 9 E M ++ E Sbjct: 557 ETMGKQKGE 565 Score = 33.1 bits (74), Expect(2) = 3e-12 Identities = 16/32 (50%), Positives = 20/32 (62%) Frame = -1 Query: 469 DLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 +LG QREN +T I GK YS++IDGG Sbjct: 410 NLGSVDEGQRENLFHTRCGIKGKTYSMIIDGG 441 >gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 66.6 bits (161), Expect(2) = 3e-12 Identities = 44/129 (34%), Positives = 70/129 (54%), Gaps = 8/129 (6%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL*HCAH 192 CAN+VSSY ++K GI +K P LQW+++CGE++VN Q + ++ Y+DE+L C Sbjct: 443 CANVVSSYLVDKLGIACMKRSTPYRLQWLNDCGEVQVNKQCMISFNVGRYEDEIL--CDV 500 Query: 191 AYDSYC---VG*ASTI***RDA*DSH----NTYSCVEL-EENLLPPISPTKVFEDLNTIK 36 C +G D +H N YS + ++ L P+SP++VFED ++ Sbjct: 501 VPMQACHVLLGRPWQY----DRDTTHHGRKNRYSLLHNGKKYTLAPLSPSQVFEDQKRLR 556 Query: 35 ENMALERRE 9 E M ++ E Sbjct: 557 ETMGKQKGE 565 Score = 33.1 bits (74), Expect(2) = 3e-12 Identities = 16/32 (50%), Positives = 20/32 (62%) Frame = -1 Query: 469 DLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 +LG QREN +T I GK YS++IDGG Sbjct: 410 NLGSVDEGQRENLFHTRCGIKGKTYSMIIDGG 441 >ref|XP_006366953.1| PREDICTED: uncharacterized protein LOC102594328 [Solanum tuberosum] Length = 1191 Score = 61.6 bits (148), Expect(2) = 1e-10 Identities = 26/55 (47%), Positives = 39/55 (70%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL 207 CAN+VSSY ++K GI +K P LQW+++CGE+KVN Q + ++ Y+DE+L Sbjct: 469 CANVVSSYLVDKLGIACMKRPTPYRLQWLNDCGEVKVNKQCMISFNVGRYEDEIL 523 Score = 33.1 bits (74), Expect(2) = 1e-10 Identities = 16/32 (50%), Positives = 20/32 (62%) Frame = -1 Query: 469 DLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 +LG QREN +T I GK YS++IDGG Sbjct: 436 NLGSVDEGQRENLFHTRCGIKGKTYSMIIDGG 467 >ref|XP_006368019.1| PREDICTED: uncharacterized protein LOC102593574 [Solanum tuberosum] Length = 385 Score = 62.0 bits (149), Expect(2) = 2e-10 Identities = 26/55 (47%), Positives = 39/55 (70%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL 207 CAN+VSSY ++K GI +K P LQW+++CGE+KVN Q + ++ Y+DE+L Sbjct: 325 CANVVSSYLVDKLGIACMKRSTPYRLQWLNDCGEVKVNKQCMISFNVGRYEDEIL 379 Score = 32.0 bits (71), Expect(2) = 2e-10 Identities = 16/32 (50%), Positives = 20/32 (62%) Frame = -1 Query: 469 DLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 +LG QREN +T I GK YS++IDGG Sbjct: 292 NLGIVDEGQRENLFHTRCGIKGKTYSMIIDGG 323 >gb|AAD17351.1| contains similarity to retrovirus-related polyproteins and to CCHC zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051, E= 1) [Arabidopsis thaliana] gi|7267432|emb|CAB77944.1| putative polyprotein [Arabidopsis thaliana] Length = 1138 Score = 50.8 bits (120), Expect(2) = 9e-08 Identities = 22/55 (40%), Positives = 34/55 (61%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL 207 C N+ S ++K G+ H KP +LQW++E GEM V Q + ++I Y+DE+L Sbjct: 370 CTNVASETMVQKLGLEEFPHPKPYKLQWLNESGEMAVTRQVQVPLAIGKYEDEIL 424 Score = 33.9 bits (76), Expect(2) = 9e-08 Identities = 19/46 (41%), Positives = 29/46 (63%) Frame = -1 Query: 511 GKMLMARQAISSISDLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 G++L+ + +S ++ E A QREN +T I GK+ SL+IDGG Sbjct: 325 GELLVTMRVLSVLNKAEEQA--QRENLFHTRCLIKGKVCSLIIDGG 368 >ref|XP_006490838.1| PREDICTED: uncharacterized protein LOC102624837 [Citrus sinensis] Length = 177 Score = 49.7 bits (117), Expect(2) = 1e-06 Identities = 22/50 (44%), Positives = 34/50 (68%) Frame = -3 Query: 356 SSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEVL 207 S+ +EK ++ +KH +P +LQW+++CGE+KVN Q + I Y DEVL Sbjct: 29 STSLVEKLNLKPLKHPRPYKLQWLNDCGEVKVNKQVPVSFYIGRYKDEVL 78 Score = 31.6 bits (70), Expect(2) = 1e-06 Identities = 12/25 (48%), Positives = 17/25 (68%) Frame = -1 Query: 208 CDIVHMHMTRIVWGKPQLFDSNVMH 134 CD+V MH RI++G+P +D V H Sbjct: 79 CDVVPMHAGRILFGQPWQYDRRVTH 103 >gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] Length = 1887 Score = 47.0 bits (110), Expect(2) = 2e-06 Identities = 20/47 (42%), Positives = 32/47 (68%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISI 231 C N+ S +EK G++ +KH +P +LQW++E GEM V+ Q + +SI Sbjct: 754 CTNVASETMVEKLGLKVMKHPRPYKLQWLNEDGEMSVDRQVKVPLSI 800 Score = 33.1 bits (74), Expect(2) = 2e-06 Identities = 17/48 (35%), Positives = 30/48 (62%) Frame = -1 Query: 517 NNGKMLMARQAISSISDLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 + G++L+ +A+S I+ E +QREN ++ + K+ SL+IDGG Sbjct: 707 SKGELLVTMKALSVIAKTDE--QEQRENLFHSSCMVNDKVCSLIIDGG 752 >ref|XP_006575965.1| PREDICTED: uncharacterized protein LOC102669237, partial [Glycine max] Length = 1520 Score = 48.1 bits (113), Expect(2) = 4e-06 Identities = 24/54 (44%), Positives = 31/54 (57%) Frame = -3 Query: 371 CANLVSSYAIEK*GIRYIKHLKPKELQWMSECGEMKVNDQANLFISIRGYDDEV 210 C N S+ + K + I H KP +LQW++E GEM VN Q + SI Y DEV Sbjct: 849 CCNCCSTRLVSKLNLTIIPHPKPYKLQWLNEQGEMIVNQQVKVPFSIGTYKDEV 902 Score = 30.8 bits (68), Expect(2) = 4e-06 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 1/47 (2%) Frame = -1 Query: 511 GKMLMARQAISSIS-DLGEGASQQRENFIYTPHRIIGKLYSLVIDGG 374 G +LM R+ + S DL + QREN +T +I+ K SL++D G Sbjct: 804 GDLLMVRRLLGGQSCDLSQS---QRENIFHTRCKILDKTCSLIVDSG 847