BLASTX nr result
ID: Cephaelis21_contig00031902
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00031902 (555 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] 95 4e-33 ref|XP_003520298.1| PREDICTED: uncharacterized protein LOC100797... 84 4e-32 ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800... 87 5e-32 ref|XP_003522113.1| PREDICTED: uncharacterized protein LOC100796... 84 5e-32 gb|AAD17351.1| contains similarity to retrovirus-related polypro... 96 6e-32 >gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] Length = 1887 Score = 94.7 bits (234), Expect(2) = 4e-33 Identities = 46/73 (63%), Positives = 55/73 (75%) Frame = +1 Query: 214 NSVGIGAVLKQEEKSVAYFSKMLGRTTLNYPTYDKELY*AVRAVEM*QHYLMLKEFVIQI 393 + VGIG VL Q++K +AYFS+ LG TLNYPTYDKELY VRA++ QHYL KEFVI Sbjct: 1213 SGVGIGVVLMQDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHT 1272 Query: 394 GHESLKHLRSQDE 432 HESLKHL+ Q + Sbjct: 1273 DHESLKHLKGQQK 1285 Score = 72.0 bits (175), Expect(2) = 4e-33 Identities = 31/58 (53%), Positives = 44/58 (75%) Frame = +3 Query: 45 KVTAINDCPTSTNITEVRSFYGLTSFYRRFVKDFNTKATLLIQIVKKKIGFRWGEAQQ 218 KV AI + P+ ++ EVRSF+GL FYRRFVKDF+T A L +++KK +GF+W +AQ+ Sbjct: 1124 KVKAIREWPSPKSVGEVRSFHGLAGFYRRFVKDFSTLAAPLTEVIKKNVGFKWEQAQE 1181 >ref|XP_003520298.1| PREDICTED: uncharacterized protein LOC100797944 [Glycine max] Length = 893 Score = 84.3 bits (207), Expect(2) = 4e-32 Identities = 43/71 (60%), Positives = 52/71 (73%) Frame = +1 Query: 214 NSVGIGAVLKQEEKSVAYFSKMLGRTTLNYPTYDKELY*AVRAVEM*QHYLMLKEFVIQI 393 ++VGIGAVL QE +AYFS+ L TLNY TYDKELY VRA++ QHYL KEFVI Sbjct: 601 SNVGIGAVLMQEGHPIAYFSEKLSGPTLNYSTYDKELYALVRALKTWQHYLYPKEFVIHS 660 Query: 394 GHESLKHLRSQ 426 HESLK+++ Q Sbjct: 661 DHESLKYIKGQ 671 Score = 79.3 bits (194), Expect(2) = 4e-32 Identities = 35/61 (57%), Positives = 47/61 (77%) Frame = +3 Query: 45 KVTAINDCPTSTNITEVRSFYGLTSFYRRFVKDFNTKATLLIQIVKKKIGFRWGEAQQRR 224 KV AI + PT ++TEVRSF+GL SFYRRFVKDF+T A L +++KK +GF+WGE Q+ Sbjct: 512 KVRAIQEWPTPKSVTEVRSFHGLASFYRRFVKDFSTLAAPLNEVLKKNVGFKWGEKQEEA 571 Query: 225 Y 227 + Sbjct: 572 F 572 >ref|XP_003530517.1| PREDICTED: uncharacterized protein LOC100800881 [Glycine max] Length = 1746 Score = 87.4 bits (215), Expect(2) = 5e-32 Identities = 44/71 (61%), Positives = 53/71 (74%) Frame = +1 Query: 214 NSVGIGAVLKQEEKSVAYFSKMLGRTTLNYPTYDKELY*AVRAVEM*QHYLMLKEFVIQI 393 ++VGIGAVL QE +AYFS+ LG LNY TYDKELY VRA++ QHYL+ KEFVI Sbjct: 950 SNVGIGAVLLQEGHPIAYFSEKLGAAALNYSTYDKELYALVRALQTWQHYLLPKEFVIHS 1009 Query: 394 GHESLKHLRSQ 426 HESLK+L+ Q Sbjct: 1010 DHESLKYLKGQ 1020 Score = 75.9 bits (185), Expect(2) = 5e-32 Identities = 33/61 (54%), Positives = 45/61 (73%) Frame = +3 Query: 45 KVTAINDCPTSTNITEVRSFYGLTSFYRRFVKDFNTKATLLIQIVKKKIGFRWGEAQQRR 224 KV AI + PT ++EVR F+GL SFYRRFVKDF+T A L ++VKK +GF+WG+ Q+ Sbjct: 861 KVKAIQEWPTPKTLSEVRGFHGLASFYRRFVKDFSTLAAPLTEVVKKNVGFKWGKKQEEA 920 Query: 225 Y 227 + Sbjct: 921 F 921 >ref|XP_003522113.1| PREDICTED: uncharacterized protein LOC100796705 [Glycine max] Length = 1010 Score = 84.0 bits (206), Expect(2) = 5e-32 Identities = 42/73 (57%), Positives = 53/73 (72%) Frame = +1 Query: 214 NSVGIGAVLKQEEKSVAYFSKMLGRTTLNYPTYDKELY*AVRAVEM*QHYLMLKEFVIQI 393 ++VGIGAVL QE +AYFS+ L TLNY TYDKE Y V+A++ QHYL KEFVI Sbjct: 910 SNVGIGAVLMQEGHPIAYFSEKLSGPTLNYSTYDKEFYALVQALKTWQHYLYPKEFVIHS 969 Query: 394 GHESLKHLRSQDE 432 HESLK+++ QD+ Sbjct: 970 DHESLKYIKGQDK 982 Score = 79.3 bits (194), Expect(2) = 5e-32 Identities = 35/61 (57%), Positives = 47/61 (77%) Frame = +3 Query: 45 KVTAINDCPTSTNITEVRSFYGLTSFYRRFVKDFNTKATLLIQIVKKKIGFRWGEAQQRR 224 KV AI + PT ++TEVRSF+GL SFYRRFVKDF+T A L +++KK +GF+WGE Q+ Sbjct: 821 KVRAIQEWPTPKSVTEVRSFHGLASFYRRFVKDFSTLAAPLNEVLKKNVGFKWGEKQEEA 880 Query: 225 Y 227 + Sbjct: 881 F 881 >gb|AAD17351.1| contains similarity to retrovirus-related polyproteins and to CCHC zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051, E= 1) [Arabidopsis thaliana] gi|7267432|emb|CAB77944.1| putative polyprotein [Arabidopsis thaliana] Length = 1138 Score = 95.9 bits (237), Expect(2) = 6e-32 Identities = 47/71 (66%), Positives = 55/71 (77%) Frame = +1 Query: 220 VGIGAVLKQEEKSVAYFSKMLGRTTLNYPTYDKELY*AVRAVEM*QHYLMLKEFVIQIGH 399 VGIGAVL Q++K +AYFS+ LG TLNYPTYDKELY VRA++ QHYL KEFVI H Sbjct: 634 VGIGAVLMQDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQTWQHYLWPKEFVIHTDH 693 Query: 400 ESLKHLRSQDE 432 ESLKHL+ Q + Sbjct: 694 ESLKHLKGQQK 704 Score = 67.0 bits (162), Expect(2) = 6e-32 Identities = 28/58 (48%), Positives = 43/58 (74%) Frame = +3 Query: 45 KVTAINDCPTSTNITEVRSFYGLTSFYRRFVKDFNTKATLLIQIVKKKIGFRWGEAQQ 218 KV AI + P+ ++ +VRSF+GL FYRRFV+DF+T A L +++KK +GF+W +A + Sbjct: 543 KVKAIREWPSPKSVGKVRSFHGLAGFYRRFVRDFSTLAAPLTEVIKKNVGFKWEQAPE 600