BLASTX nr result
ID: Astragalus22_contig00009188
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00009188 (438 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposo... 78 1e-17 ref|XP_021646157.1| uncharacterized protein LOC110639482 [Hevea ... 77 3e-16 gb|KHN17162.1| hypothetical protein glysoja_010621, partial [Gly... 72 2e-15 gb|PNX77860.1| retrovirus-related Pol polyprotein from transposo... 82 2e-15 ref|XP_014617557.1| PREDICTED: uncharacterized protein LOC102665... 72 6e-15 gb|KYP35968.1| hypothetical protein KK1_042948, partial [Cajanus... 75 7e-15 dbj|GAU50539.1| hypothetical protein TSUD_409840 [Trifolium subt... 64 1e-14 dbj|GAU17048.1| hypothetical protein TSUD_105470 [Trifolium subt... 68 1e-14 gb|KYP72603.1| hypothetical protein KK1_005199 [Cajanus cajan] 79 2e-14 ref|XP_017426279.1| PREDICTED: uncharacterized protein LOC108334... 77 2e-14 gb|KYP69419.1| Copia protein, partial [Cajanus cajan] 76 2e-14 dbj|GAU38852.1| hypothetical protein TSUD_154140 [Trifolium subt... 63 3e-14 ref|XP_019450659.1| PREDICTED: uncharacterized protein LOC109352... 79 3e-14 ref|XP_019430924.1| PREDICTED: uncharacterized protein LOC109338... 79 3e-14 ref|XP_021603009.1| uncharacterized protein LOC110608109 [Maniho... 79 4e-14 gb|AER13167.1| putative retrovirus-like polyprotein [Phaseolus v... 78 6e-14 gb|KYP77253.1| hypothetical protein KK1_044385, partial [Cajanus... 73 6e-14 gb|KYP41411.1| hypothetical protein KK1_037205, partial [Cajanus... 73 6e-14 ref|XP_019423079.1| PREDICTED: uncharacterized protein LOC109332... 78 7e-14 ref|XP_021621503.1| uncharacterized protein LOC110621536 [Maniho... 77 1e-13 >gb|KYP42564.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1427 Score = 77.8 bits (190), Expect(2) = 1e-17 Identities = 33/67 (49%), Positives = 48/67 (71%) Frame = +3 Query: 231 LIHTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHN 410 L HT WI+DTGATDH+CN+L F YH I PV +++ NG + A+++GTI+ + L++ Sbjct: 345 LEHTDWILDTGATDHVCNSLYFFTKYHPIDPVHVKLPNGNTSTAQFSGTIIFSEKFFLND 404 Query: 411 VLYVPEF 431 VLY+P F Sbjct: 405 VLYIPNF 411 Score = 39.3 bits (90), Expect(2) = 1e-17 Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 13/91 (14%) Frame = +1 Query: 1 PGFKFRDKEDST--NSASTEGNNTQDKNAAPQQN-ESSVSITPEVYQKLMALLNK----S 159 P FKF+DK ++T N+ S++ +TQ + +QN ES+ + T E Y L+ +L + S Sbjct: 255 PNFKFKDKGNTTSINTISSKAPSTQASEISRKQNKESTSNFTHEDYDHLIDMLKRAKLQS 314 Query: 160 GDGGIGQV------HNVIASNITQDSSGNLL 234 + I Q+ +V +SN+ Q+ GN L Sbjct: 315 PEHSINQLVHQTTTESVSSSNLQQNQPGNPL 345 >ref|XP_021646157.1| uncharacterized protein LOC110639482 [Hevea brasiliensis] Length = 547 Score = 76.6 bits (187), Expect(2) = 3e-16 Identities = 33/65 (50%), Positives = 47/65 (72%) Frame = +3 Query: 243 IWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYV 422 +WI+DTGATDHIC +L F Y +I+P+ +++ NGT A Y+GTI D +LH+VLY+ Sbjct: 297 MWILDTGATDHICFSLSSFTLYKKINPIFMKLLNGTQLVANYSGTIHFTKDFLLHDVLYI 356 Query: 423 PEFGF 437 P+F F Sbjct: 357 PQFTF 361 Score = 35.8 bits (81), Expect(2) = 3e-16 Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 5/83 (6%) Frame = +1 Query: 1 PGFKFRDKEDSTNSASTEGNNTQDKNAAPQQNES----SVSITPEVYQKLMALLNKSGDG 168 PG+KF++ STN + E + + + P Q S SV T E Q+L+AL+ + Sbjct: 205 PGYKFKNSGSSTNQVTVEDQPSGNTDNTPIQVNSPSIQSVPFTQEQIQQLLALIQRPNLT 264 Query: 169 GIGQVHNVIASN-ITQDSSGNLL 234 I + V N + S GN L Sbjct: 265 TIHASNQVANGNTLASSSPGNSL 287 >gb|KHN17162.1| hypothetical protein glysoja_010621, partial [Glycine soja] Length = 529 Score = 72.4 bits (176), Expect(2) = 2e-15 Identities = 30/66 (45%), Positives = 48/66 (72%) Frame = +3 Query: 240 TIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLY 419 T WI+D+GATDH+ ++L F+SYH+I+P+++++ NG YA + GT+ + + L +VLY Sbjct: 388 TSWILDSGATDHVSSSLTNFHSYHQINPITVKLPNGHLVYATHLGTVQLSTFITLIDVLY 447 Query: 420 VPEFGF 437 VP F F Sbjct: 448 VPSFTF 453 Score = 37.4 bits (85), Expect(2) = 2e-15 Identities = 18/67 (26%), Positives = 31/67 (46%) Frame = +1 Query: 1 PGFKFRDKEDSTNSASTEGNNTQDKNAAPQQNESSVSITPEVYQKLMALLNKSGDGGIGQ 180 PGFKF + + N+ D PQ+++ V +PE Y+ L+AL+ + Sbjct: 300 PGFKFNNGKTIANNVVAVEEKATDDQILPQESQELVRFSPEQYKALLALIQQPSAENSAS 359 Query: 181 VHNVIAS 201 + +AS Sbjct: 360 IKPQVAS 366 >gb|PNX77860.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 581 Score = 82.0 bits (201), Expect = 2e-15 Identities = 38/90 (42%), Positives = 56/90 (62%), Gaps = 5/90 (5%) Frame = +3 Query: 177 TSAQCDSIQHNTRFL-----R*SLIHTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMS 341 TS +IQH+ + S+ H WI+D+GA+DHIC+N+ LF YH I P+ ++M Sbjct: 368 TSGTAGNIQHSNVYSCITCSLSSIAHNSWIIDSGASDHICSNIALFDDYHAITPIQVKMP 427 Query: 342 NGTHTYAKYTGTIMCNFDLILHNVLYVPEF 431 NG+ YAK G++ + + +HNVL VPEF Sbjct: 428 NGSVAYAKIAGSVKLSENFSIHNVLLVPEF 457 >ref|XP_014617557.1| PREDICTED: uncharacterized protein LOC102665992 [Glycine max] Length = 898 Score = 72.0 bits (175), Expect(2) = 6e-15 Identities = 29/66 (43%), Positives = 49/66 (74%) Frame = +3 Query: 240 TIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLY 419 T WI+D+GATDH+ ++L F+SYH+I+P+++R+ NG +A ++GT+ + + L +VLY Sbjct: 396 TSWILDSGATDHVSSSLTNFHSYHQINPITVRLPNGHLVHATHSGTVQLSAFITLIDVLY 455 Query: 420 VPEFGF 437 +P F F Sbjct: 456 IPSFTF 461 Score = 36.2 bits (82), Expect(2) = 6e-15 Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 1/83 (1%) Frame = +1 Query: 1 PGFKFRD-KEDSTNSASTEGNNTQDKNAAPQQNESSVSITPEVYQKLMALLNKSGDGGIG 177 PGFKF + K + N + EG T D+ Q+++ V +PE Y+ L+AL+ + G Sbjct: 308 PGFKFNNGKVIANNVVAVEGKATDDQ-IQRQESQELVRFSPEQYKALLALIQQPYAGNSA 366 Query: 178 QVHNVIASNITQDSSGNLLYTLS 246 +AS + S+ TLS Sbjct: 367 STKPQVASISSCSSNDATGITLS 389 >gb|KYP35968.1| hypothetical protein KK1_042948, partial [Cajanus cajan] Length = 98 Score = 75.1 bits (183), Expect = 7e-15 Identities = 30/63 (47%), Positives = 47/63 (74%) Frame = +3 Query: 249 IVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVPE 428 I+D+GATDHIC++L F SYH+I+P+ +++ NG A Y+G++ N + ++HNVLY+P Sbjct: 1 ILDSGATDHICSSLTHFTSYHQINPICVKLPNGNQVTANYSGSVFLNQNHVIHNVLYIPC 60 Query: 429 FGF 437 F F Sbjct: 61 FTF 63 >dbj|GAU50539.1| hypothetical protein TSUD_409840 [Trifolium subterraneum] Length = 1245 Score = 63.5 bits (153), Expect(2) = 1e-14 Identities = 25/64 (39%), Positives = 41/64 (64%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 WI+D+GATDH+C +L LF Y +++P+ +++ NG+ G I+ + L +VLY+P Sbjct: 387 WIIDSGATDHVCASLSLFTEYRKVNPIPVKLPNGSIVTTDIIGNIIITPTITLKHVLYMP 446 Query: 426 EFGF 437 F F Sbjct: 447 HFSF 450 Score = 43.5 bits (101), Expect(2) = 1e-14 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 8/85 (9%) Frame = +1 Query: 1 PGFKFRDKE--DSTNSASTEGNNTQ-DKNAAPQQNESSVSITPEVYQKLMALL--NKSGD 165 PG++F+D S N + N D N ++ ++ + E YQ LMALL NK Sbjct: 296 PGYRFKDGTVVGSKNQGQSSANCVNADDNVEQSSVDTRMTFSAEDYQALMALLKNNKPAG 355 Query: 166 GGIGQVHNV---IASNITQDSSGNL 231 G QV+NV IAS+ T D GN+ Sbjct: 356 EGSSQVNNVSKFIASSFTNDKQGNV 380 >dbj|GAU17048.1| hypothetical protein TSUD_105470 [Trifolium subterraneum] Length = 769 Score = 67.8 bits (164), Expect(2) = 1e-14 Identities = 27/64 (42%), Positives = 41/64 (64%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 WI+D+GATDH+C +L LF +Y ++HP+ +++ NG G I+ + L NVLY+P Sbjct: 386 WILDSGATDHVCASLSLFTAYKKVHPIPVKLPNGNIVTTDIIGDILVTPSITLRNVLYMP 445 Query: 426 EFGF 437 F F Sbjct: 446 HFSF 449 Score = 39.3 bits (90), Expect(2) = 1e-14 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 9/86 (10%) Frame = +1 Query: 1 PGFKFRDKE---DSTNSASTEGNNTQ-DKNAAPQQNESSVSITPEVYQKLMALLNKSGDG 168 PG++F+D S N + N + N A ++ ++ + E YQ LMALL + + Sbjct: 294 PGYRFKDGTVVGGSKNQGYSSANCIDAEDNEAQSSVDTRMTFSAEDYQALMALLKSTKNA 353 Query: 169 GIG--QVHNV---IASNITQDSSGNL 231 G G QV+NV IAS+ + D GN+ Sbjct: 354 GEGTSQVNNVTKVIASSYSNDKQGNV 379 >gb|KYP72603.1| hypothetical protein KK1_005199 [Cajanus cajan] Length = 375 Score = 79.0 bits (193), Expect = 2e-14 Identities = 35/89 (39%), Positives = 50/89 (56%) Frame = +3 Query: 171 NRTSAQCDSIQHNTRFLR*SLIHTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGT 350 N+ SA H+ + S + W++D+GATDHIC L F SYH+I P+ + + NG Sbjct: 230 NQVSATLGPFDHSGQIGNTSSSFSSWVLDSGATDHICYCLSHFTSYHKIKPIHVTLPNGN 289 Query: 351 HTYAKYTGTIMCNFDLILHNVLYVPEFGF 437 A Y+G + D +LHN LY+P F F Sbjct: 290 KVVANYSGNVFLTLDHVLHNFLYIPSFSF 318 >ref|XP_017426279.1| PREDICTED: uncharacterized protein LOC108334859 [Vigna angularis] Length = 569 Score = 77.0 bits (188), Expect(2) = 2e-14 Identities = 31/67 (46%), Positives = 48/67 (71%) Frame = +3 Query: 237 HTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVL 416 H +WI+D+GA+DH+ ++L L+ SY I P+++++ NG T A Y+GTI N L + NVL Sbjct: 413 HVMWIIDSGASDHVSSSLNLYSSYKAIDPITVKLPNGQQTIASYSGTIKINDSLSISNVL 472 Query: 417 YVPEFGF 437 Y+P+F F Sbjct: 473 YLPQFNF 479 Score = 29.3 bits (64), Expect(2) = 2e-14 Identities = 17/52 (32%), Positives = 24/52 (46%) Frame = +1 Query: 1 PGFKFRDKEDSTNSASTEGNNTQDKNAAPQQNESSVSITPEVYQKLMALLNK 156 PG K N+A++ D NE + ITP+ YQ L+ALL + Sbjct: 325 PGHKLHKPVMVNNAATSNNEGPIDHTQVSDSNE--LKITPQQYQHLIALLQQ 374 >gb|KYP69419.1| Copia protein, partial [Cajanus cajan] Length = 195 Score = 76.3 bits (186), Expect = 2e-14 Identities = 30/64 (46%), Positives = 47/64 (73%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 WI+D+GATDHIC++L F SYH+I+P+ +++ NG A Y+ ++ N + ++HNVLY+P Sbjct: 2 WILDSGATDHICSSLTHFTSYHQINPICVKLPNGNQVTANYSRSVFLNQNHVIHNVLYIP 61 Query: 426 EFGF 437 F F Sbjct: 62 CFTF 65 >dbj|GAU38852.1| hypothetical protein TSUD_154140 [Trifolium subterraneum] Length = 1494 Score = 62.8 bits (151), Expect(2) = 3e-14 Identities = 25/64 (39%), Positives = 40/64 (62%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 WI+D+GATDH+C +L LF Y +++P+ +++ NG+ G I + L +VLY+P Sbjct: 387 WIIDSGATDHVCASLSLFTEYRKVNPIPVKLPNGSIVTTDIIGNISITPTITLKHVLYMP 446 Query: 426 EFGF 437 F F Sbjct: 447 HFSF 450 Score = 43.1 bits (100), Expect(2) = 3e-14 Identities = 31/85 (36%), Positives = 43/85 (50%), Gaps = 8/85 (9%) Frame = +1 Query: 1 PGFKFRDKE--DSTNSASTEGNNTQ-DKNAAPQQNESSVSITPEVYQKLMALLNKSGDGG 171 PG++F+D S N + N D N ++ ++ + E YQ LMALL S G Sbjct: 296 PGYRFKDGTVVGSKNQGQSSANCVNADDNMEQSSVDTRMTFSAEDYQALMALLKNSKSAG 355 Query: 172 IG--QVHNV---IASNITQDSSGNL 231 G QV+NV IAS+ T D GN+ Sbjct: 356 EGSSQVNNVSKFIASSFTNDKQGNV 380 >ref|XP_019450659.1| PREDICTED: uncharacterized protein LOC109352929 [Lupinus angustifolius] Length = 683 Score = 79.0 bits (193), Expect = 3e-14 Identities = 34/66 (51%), Positives = 47/66 (71%) Frame = +3 Query: 240 TIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLY 419 ++WI+DTGATDH+C++L F YH I P+ I M NG+ T A+ +G++ + L LHNVLY Sbjct: 409 SLWILDTGATDHVCHDLGKFNIYHSIKPIIINMPNGSTTLARLSGSVSISESLTLHNVLY 468 Query: 420 VPEFGF 437 VP F F Sbjct: 469 VPNFKF 474 >ref|XP_019430924.1| PREDICTED: uncharacterized protein LOC109338182 [Lupinus angustifolius] Length = 433 Score = 78.6 bits (192), Expect = 3e-14 Identities = 38/88 (43%), Positives = 58/88 (65%) Frame = +3 Query: 174 RTSAQCDSIQHNTRFLR*SLIHTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTH 353 R ++Q + Q + R ++ H WI+DTGA DH+C +L F ++H I P++I + NGT Sbjct: 310 RNTSQGNQNQEGNQITR-NISH--WILDTGAIDHVCPDLSSFTTFHSIRPIAIGLPNGTK 366 Query: 354 TYAKYTGTIMCNFDLILHNVLYVPEFGF 437 +A ++GTIM + LILH+VLY+P F F Sbjct: 367 IFANHSGTIMISEMLILHDVLYIPNFKF 394 >ref|XP_021603009.1| uncharacterized protein LOC110608109 [Manihot esculenta] Length = 591 Score = 78.6 bits (192), Expect = 4e-14 Identities = 34/64 (53%), Positives = 44/64 (68%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 W++DTGATDHIC +L LF SY IHP+ +++ NG + + GTI N DL L NVLY+P Sbjct: 498 WLLDTGATDHICFSLSLFSSYKRIHPIHVKLPNGEQLMSHFLGTISINDDLCLTNVLYIP 557 Query: 426 EFGF 437 F F Sbjct: 558 SFTF 561 >gb|AER13167.1| putative retrovirus-like polyprotein [Phaseolus vulgaris] Length = 1009 Score = 78.2 bits (191), Expect = 6e-14 Identities = 33/64 (51%), Positives = 46/64 (71%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 WI+D+GATDH+C++L F SY I P+SI + NG H +AKY+GT++ N L +VLYVP Sbjct: 622 WILDSGATDHVCSSLSEFTSYKSIKPISISLPNGHHVFAKYSGTVIFNHKFYLIDVLYVP 681 Query: 426 EFGF 437 + F Sbjct: 682 QLSF 685 >gb|KYP77253.1| hypothetical protein KK1_044385, partial [Cajanus cajan] Length = 99 Score = 72.8 bits (177), Expect = 6e-14 Identities = 32/66 (48%), Positives = 44/66 (66%) Frame = +3 Query: 240 TIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLY 419 T WIVD+GATDHI ++L L +Y +I P I + NG A ++GT+ + ++HNVLY Sbjct: 9 TPWIVDSGATDHIASSLALLQAYTKIKPARINLPNGAFVTAHFSGTVQFSPSFVIHNVLY 68 Query: 420 VPEFGF 437 VPEF F Sbjct: 69 VPEFNF 74 >gb|KYP41411.1| hypothetical protein KK1_037205, partial [Cajanus cajan] Length = 99 Score = 72.8 bits (177), Expect = 6e-14 Identities = 32/66 (48%), Positives = 44/66 (66%) Frame = +3 Query: 240 TIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLY 419 T WIVD+GATDHI ++L L +Y +I P I + NG A ++GT+ + ++HNVLY Sbjct: 9 TPWIVDSGATDHIASSLALLQAYTKIKPARINLPNGAFVTAHFSGTVQFSPSFVIHNVLY 68 Query: 420 VPEFGF 437 VPEF F Sbjct: 69 VPEFNF 74 >ref|XP_019423079.1| PREDICTED: uncharacterized protein LOC109332552 [Lupinus angustifolius] Length = 482 Score = 77.8 bits (190), Expect = 7e-14 Identities = 33/67 (49%), Positives = 46/67 (68%) Frame = +3 Query: 237 HTIWIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVL 416 HT WI+DTGATDH+C +L LF ++H I P++I + NGT A G I + + LH+VL Sbjct: 413 HTYWILDTGATDHVCPDLSLFVAHHNIRPITIGLPNGTTIQATIAGAIKLSDSITLHHVL 472 Query: 417 YVPEFGF 437 YVP+F + Sbjct: 473 YVPQFHY 479 >ref|XP_021621503.1| uncharacterized protein LOC110621536 [Manihot esculenta] Length = 663 Score = 77.4 bits (189), Expect = 1e-13 Identities = 33/64 (51%), Positives = 45/64 (70%) Frame = +3 Query: 246 WIVDTGATDHICNNLPLFYSYHEIHPVSIRMSNGTHTYAKYTGTIMCNFDLILHNVLYVP 425 W++DTGATDHIC +L LF SY IHP+ +++ NG + ++GTI N +L L NVLY+P Sbjct: 343 WLLDTGATDHICFSLSLFSSYKRIHPIHVKLPNGEQLMSHFSGTISLNDNLCLTNVLYIP 402 Query: 426 EFGF 437 F F Sbjct: 403 SFTF 406