BLASTX nr result
ID: Astragalus22_contig00015380
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00015380 (423 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU42845.1| hypothetical protein TSUD_387380 [Trifolium subt... 54 5e-13 dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subt... 50 4e-12 gb|OIV93118.1| hypothetical protein TanjilG_20780 [Lupinus angus... 50 6e-11 gb|PNX96991.1| retrotransposon-related protein [Trifolium pratense] 47 7e-11 gb|KYP65802.1| Retrovirus-related Pol polyprotein from transposo... 49 7e-11 gb|PNX78253.1| copia-type polyprotein, partial [Trifolium pratense] 48 1e-10 dbj|GAU33377.1| hypothetical protein TSUD_365030 [Trifolium subt... 47 9e-10 dbj|GAU25674.1| hypothetical protein TSUD_266010 [Trifolium subt... 49 9e-10 dbj|GAU33536.1| hypothetical protein TSUD_143290 [Trifolium subt... 47 1e-09 dbj|GAU46968.1| hypothetical protein TSUD_143100 [Trifolium subt... 47 1e-09 dbj|GAU34493.1| hypothetical protein TSUD_388050 [Trifolium subt... 45 3e-09 dbj|GAU25767.1| hypothetical protein TSUD_222240 [Trifolium subt... 44 8e-09 dbj|GAU48263.1| hypothetical protein TSUD_405090 [Trifolium subt... 45 2e-08 dbj|GAU36721.1| hypothetical protein TSUD_318190 [Trifolium subt... 47 8e-08 gb|OIW00846.1| hypothetical protein TanjilG_12250 [Lupinus angus... 54 3e-06 dbj|GAU42259.1| hypothetical protein TSUD_327370 [Trifolium subt... 44 9e-06 gb|PNX76333.1| putative LRR receptor-like protein kinase [Trifol... 54 1e-05 >dbj|GAU42845.1| hypothetical protein TSUD_387380 [Trifolium subterraneum] Length = 1239 Score = 54.3 bits (129), Expect(2) = 5e-13 Identities = 31/105 (29%), Positives = 48/105 (45%) Frame = +1 Query: 109 QTWDWKLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXPVPVT 288 ++WDWK+ P I+ + EE D + + A + Sbjct: 707 ESWDWKVQNPVINPLLLDEEHSVEDTRPATKASSSRASE-----------------IITD 749 Query: 289 RPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 +RNR S+RL D+E D I+ DG+++H L VD+EP+S EE Sbjct: 750 GTRRNRMPSTRLQDFETYNDNAINTDGDLVHFALFVDSEPVSFEE 794 Score = 47.4 bits (111), Expect(2) = 5e-13 Identities = 18/33 (54%), Positives = 28/33 (84%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 SK +IL+GYH TGAY+LY P+ +K+++ RD++V Sbjct: 671 SKQMILVGYHPTGAYRLYDPLNKKVELGRDIIV 703 >dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subterraneum] Length = 1073 Score = 50.1 bits (118), Expect(2) = 4e-12 Identities = 30/105 (28%), Positives = 47/105 (44%) Frame = +1 Query: 109 QTWDWKLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXPVPVT 288 ++WDWK+ P I+ + EE D + + A + Sbjct: 654 ESWDWKVQNPVINPLLLDEEHSVEDTRPATEASSSRASE-----------------IITD 696 Query: 289 RPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 +RN S+RL D+E D I+ DG+++H L VD+EP+S EE Sbjct: 697 GTRRNIMPSTRLQDFETYNDNAINIDGDLVHFALFVDSEPVSFEE 741 Score = 48.5 bits (114), Expect(2) = 4e-12 Identities = 19/33 (57%), Positives = 28/33 (84%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 SK +IL+GYH TGAY+LY P+ +K+K+ RD++V Sbjct: 618 SKQMILVGYHPTGAYRLYDPLNKKVKLGRDIVV 650 >gb|OIV93118.1| hypothetical protein TanjilG_20780 [Lupinus angustifolius] Length = 148 Score = 50.4 bits (119), Expect(2) = 6e-11 Identities = 33/110 (30%), Positives = 50/110 (45%), Gaps = 3/110 (2%) Frame = +1 Query: 103 KKQTWDW-KLIQPSISAVKSPEEDCATDYVD--KAAITENNAGHXXXXXXXXXXXXXXXX 273 + QTWDW + + + EED T+ + ITE +A Sbjct: 32 ESQTWDWGQNSKVKKQTLVQLEEDNITELQPCIEQQITEQSADRR--------------- 76 Query: 274 PVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 RP R RQ RL DYE+ D+ I +G+++H+ LL + EP+S +E Sbjct: 77 -----RPSRTRQAPQRLSDYEIFPDSNITTEGDLVHIALLAEMEPVSFDE 121 Score = 44.3 bits (103), Expect(2) = 6e-11 Identities = 19/28 (67%), Positives = 24/28 (85%) Frame = +3 Query: 12 VILLGYHSTGAYKLYCPMTRKIKISRDV 95 +IL+G+HSTGAYK+Y P T+KI SRDV Sbjct: 1 MILIGFHSTGAYKVYDPSTQKIMFSRDV 28 >gb|PNX96991.1| retrotransposon-related protein [Trifolium pratense] Length = 1333 Score = 47.4 bits (111), Expect(2) = 7e-11 Identities = 27/105 (25%), Positives = 43/105 (40%) Frame = +1 Query: 109 QTWDWKLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXPVPVT 288 + WDWK S +D + N G P Sbjct: 722 ENWDWKQKSSSKKTYAVDLDDGLNVNNEPVTFVNQNQGAISDEEMHTSSDDEDANLPP-- 779 Query: 289 RPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 RPQR QL RL D E++ D ++++G+I+H +L + EP++V + Sbjct: 780 RPQRQTQLPRRLADCEMLQDNAVNSEGDIVHYAMLANTEPINVSD 824 Score = 47.0 bits (110), Expect(2) = 7e-11 Identities = 20/33 (60%), Positives = 27/33 (81%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ +I +GYH TGAY+LY P+T K++ISRDV V Sbjct: 686 SEAMIFVGYHRTGAYRLYNPITNKVEISRDVKV 718 >gb|KYP65802.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 227 Score = 48.5 bits (114), Expect(2) = 7e-11 Identities = 31/109 (28%), Positives = 49/109 (44%), Gaps = 4/109 (3%) Frame = +1 Query: 109 QTWDWKLIQPSISAVKSP----EEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXP 276 + W+W QPS + K E DC D+ +K + + + Sbjct: 119 EAWEW-FKQPSTTISKVSLTLEESDCGEDHGEKNKVAQTHN------------------- 158 Query: 277 VPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 TRPQR++ L SR D E+ D I DG+++++ L D EP++ EE Sbjct: 159 ---TRPQRSKHLPSRFDDCEVFSDDLITVDGDLVYMAFLADFEPITFEE 204 Score = 45.8 bits (107), Expect(2) = 7e-11 Identities = 20/31 (64%), Positives = 26/31 (83%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDV 95 S+ IL+GYHSTGAY+LY P+ +KI +SRDV Sbjct: 83 SQPTILVGYHSTGAYELYDPVAKKIMLSRDV 113 >gb|PNX78253.1| copia-type polyprotein, partial [Trifolium pratense] Length = 780 Score = 48.1 bits (113), Expect(2) = 1e-10 Identities = 22/33 (66%), Positives = 27/33 (81%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 SK VIL+GYH TGAY+LY P+ KI+I RDV+V Sbjct: 523 SKQVILVGYHPTGAYRLYDPVKEKIEIGRDVVV 555 Score = 45.4 bits (106), Expect(2) = 1e-10 Identities = 30/112 (26%), Positives = 47/112 (41%), Gaps = 5/112 (4%) Frame = +1 Query: 103 KKQTWDWK-----LIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXX 267 + + WDWK +I P +S + D T NN Sbjct: 557 ESENWDWKGKSTTVINPLLSDIDD----------DAEITTANNGASTSSHGNAENGNVVS 606 Query: 268 XXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 P +R R S+RL+D E+ D ID +G+++H L VD EP+++E+ Sbjct: 607 TIPAG---SKRTRIPSTRLLDCEVYADDAIDAEGDLVHFALFVDTEPVNIED 655 >dbj|GAU33377.1| hypothetical protein TSUD_365030 [Trifolium subterraneum] Length = 1208 Score = 47.4 bits (111), Expect(2) = 9e-10 Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 6/112 (5%) Frame = +1 Query: 103 KKQTWDWKLIQPSISA------VKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXX 264 + ++WDWK S V + E + +VD+ EN+ Sbjct: 647 ENESWDWKQKSTSKKTCEVDLDVGTSEANAPVTFVDQHQHQENHNEEDASSDEDNGKH-- 704 Query: 265 XXXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVE 420 +PV R R Q+ RL D +++ D +DN+G I+H +L D EPL V+ Sbjct: 705 ----LPV-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPLDVK 751 Score = 43.1 bits (100), Expect(2) = 9e-10 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TGAY+LY P + KI+ISRDV V Sbjct: 613 SEPMVFVGYHRTGAYRLYNPTSDKIEISRDVKV 645 >dbj|GAU25674.1| hypothetical protein TSUD_266010 [Trifolium subterraneum] Length = 1195 Score = 49.3 bits (116), Expect(2) = 9e-10 Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 6/112 (5%) Frame = +1 Query: 103 KKQTWDWKLIQPSISA------VKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXX 264 K ++WDWKL S V + E + +VD+ EN Sbjct: 680 KNESWDWKLKSTSKKTCEVDLDVGTSEANAPVTFVDQHQHKEN------YNEEDASSDED 733 Query: 265 XXXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVE 420 +P+ R R Q+ RL D +++ D +DN+G I+H +L D EPL V+ Sbjct: 734 NGRHLPL-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPLDVK 784 Score = 41.2 bits (95), Expect(2) = 9e-10 Identities = 19/35 (54%), Positives = 26/35 (74%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLVKK 107 S+ ++ +GYH TGAY+LY + KI+ISRDV V K Sbjct: 646 SEPIVFVGYHRTGAYRLYNLTSDKIEISRDVKVLK 680 >dbj|GAU33536.1| hypothetical protein TSUD_143290 [Trifolium subterraneum] Length = 1007 Score = 47.4 bits (111), Expect(2) = 1e-09 Identities = 31/112 (27%), Positives = 49/112 (43%), Gaps = 6/112 (5%) Frame = +1 Query: 103 KKQTWDWKLIQPSISA------VKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXX 264 + ++WDWK S V + E + +VD+ EN+ Sbjct: 544 ENESWDWKQKSTSKKTSEVDLDVGTSEANAPVTFVDQHQHQENHNEEDASSDEDNGRH-- 601 Query: 265 XXXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVE 420 +PV R R Q+ RL D +++ D +DN+G I+H +L D EPL V+ Sbjct: 602 ----LPV-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPLDVK 648 Score = 43.1 bits (100), Expect(2) = 1e-09 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TGAY+LY P + KI+ISRDV V Sbjct: 510 SEPMVFVGYHRTGAYRLYNPTSDKIEISRDVKV 542 >dbj|GAU46968.1| hypothetical protein TSUD_143100 [Trifolium subterraneum] Length = 1293 Score = 47.0 bits (110), Expect(2) = 1e-09 Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 15/121 (12%) Frame = +1 Query: 103 KKQTWDWKLIQPSISA------VKSPEEDCATDYVDK---------AAITENNAGHXXXX 237 + ++WDWK S V + E + +VD+ A+ E+N H Sbjct: 723 ENESWDWKQKSTSKKTCEVDLDVGTSEANAPVTFVDQHQENHNEEDASSDEDNGRH---- 778 Query: 238 XXXXXXXXXXXXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSV 417 +PV R R Q+ RL D +++ D +DN+G I+H +L D EPL V Sbjct: 779 -------------LPV-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPLDV 824 Query: 418 E 420 + Sbjct: 825 K 825 Score = 43.1 bits (100), Expect(2) = 1e-09 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TGAY+LY P + KI+ISRDV V Sbjct: 689 SEPMVFVGYHRTGAYRLYNPTSDKIEISRDVKV 721 >dbj|GAU34493.1| hypothetical protein TSUD_388050 [Trifolium subterraneum] Length = 1412 Score = 45.4 bits (106), Expect(2) = 3e-09 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 6/109 (5%) Frame = +1 Query: 103 KKQTWDWKLIQPSISA------VKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXX 264 + ++WDWK S V + E + +VD+ EN+ Sbjct: 713 ENESWDWKQKSTSKKTCEVDLDVGTSEANAPVTFVDQHQHQENHNEEDASSDEDNGRH-- 770 Query: 265 XXXPVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPL 411 +PV R R Q+ RL D +++ D +DN+G I+H +L D EPL Sbjct: 771 ----LPV-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPL 814 Score = 43.1 bits (100), Expect(2) = 3e-09 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TGAY+LY P + KI+ISRDV V Sbjct: 679 SEPMVFVGYHRTGAYRLYNPTSDKIEISRDVKV 711 >dbj|GAU25767.1| hypothetical protein TSUD_222240 [Trifolium subterraneum] Length = 1166 Score = 44.3 bits (103), Expect(2) = 8e-09 Identities = 26/106 (24%), Positives = 44/106 (41%) Frame = +1 Query: 103 KKQTWDWKLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXPVP 282 + ++WDWK S + + D T + + H P Sbjct: 646 ENESWDWKHKSTSKKTCEV-DLDVGTSEANAPVTFVDQHQHQKNHNEEDASSDEDNGRHP 704 Query: 283 VTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVE 420 R R + RL +++LD +DN+G I++ +LVD EPL+V+ Sbjct: 705 PVRTHRTTHIPRRLACCDMVLDNVVDNEGNIVYYAMLVDTEPLNVK 750 Score = 43.1 bits (100), Expect(2) = 8e-09 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TGAY+LY P + KI+ISRDV V Sbjct: 612 SEPMVFVGYHRTGAYRLYNPTSDKIEISRDVKV 644 >dbj|GAU48263.1| hypothetical protein TSUD_405090 [Trifolium subterraneum] Length = 1292 Score = 45.1 bits (105), Expect(2) = 2e-08 Identities = 19/36 (52%), Positives = 29/36 (80%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLVKKT 110 S+ +IL+GYH TG YKLY P+++K+ +RDV+V +T Sbjct: 705 SESMILVGYHVTGVYKLYNPISKKMTYNRDVIVDET 740 Score = 41.2 bits (95), Expect(2) = 2e-08 Identities = 29/107 (27%), Positives = 51/107 (47%), Gaps = 5/107 (4%) Frame = +1 Query: 109 QTWDWKLIQPSISAVK-SPEEDCATDYVDK----AAITENNAGHXXXXXXXXXXXXXXXX 273 +TWDW ++ S S ++ S D TD V+ A E AG+ Sbjct: 741 KTWDW-IVGSSTSRLQISQFSDDETDSVESNEEIEAPVEPTAGNDVIN------------ 787 Query: 274 PVPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLS 414 + + R RQ+ +R+ D E + D +++ DG+++H LL +EP++ Sbjct: 788 -AEIRKSTRPRQVPARIQDCETVNDNEVNEDGDLVHFALLARSEPIN 833 >dbj|GAU36721.1| hypothetical protein TSUD_318190 [Trifolium subterraneum] Length = 1087 Score = 47.0 bits (110), Expect(2) = 8e-08 Identities = 34/106 (32%), Positives = 49/106 (46%) Frame = +1 Query: 103 KKQTWDWKLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXPVP 282 + ++WDWK Q S S K+ E D D A +N H +P Sbjct: 711 ENESWDWK--QKSTSK-KTCEVDL--DVGTSEANASDNGRH-----------------LP 748 Query: 283 VTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVE 420 V R R Q+ RL D +++ D +DN+G I+H +L D EPL V+ Sbjct: 749 V-RTHRTTQIPRRLADCDMVPDNVVDNEGNIVHYAMLADTEPLDVK 793 Score = 37.0 bits (84), Expect(2) = 8e-08 Identities = 18/33 (54%), Positives = 25/33 (75%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ ++ +GYH TG Y+LY P + KI+ISRDV V Sbjct: 678 SEPMVFVGYHRTG-YRLYNPTSDKIEISRDVKV 709 >gb|OIW00846.1| hypothetical protein TanjilG_12250 [Lupinus angustifolius] Length = 154 Score = 53.9 bits (128), Expect = 3e-06 Identities = 24/45 (53%), Positives = 35/45 (77%) Frame = +1 Query: 289 RPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 R QR R L +R DYEL+ D++ID +G++IH+ L+V+AEP+S EE Sbjct: 76 RSQRVRTLPNRFSDYELLHDSQIDEEGDLIHLALMVEAEPISEEE 120 >dbj|GAU42259.1| hypothetical protein TSUD_327370 [Trifolium subterraneum] Length = 1090 Score = 44.3 bits (103), Expect(2) = 9e-06 Identities = 20/33 (60%), Positives = 27/33 (81%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLV 101 S+ +IL+GYH TGAYKLY P ++K+ SRDV+V Sbjct: 544 SEPMILVGYHVTGAYKLYNPTSKKMIYSRDVIV 576 Score = 32.7 bits (73), Expect(2) = 9e-06 Identities = 22/109 (20%), Positives = 43/109 (39%), Gaps = 4/109 (3%) Frame = +1 Query: 109 QTWDW----KLIQPSISAVKSPEEDCATDYVDKAAITENNAGHXXXXXXXXXXXXXXXXP 276 ++WDW +P +S + E D + T+NN G Sbjct: 580 KSWDWIAGSSTSRPHLSQIPDDETDAVE------STTDNNNG------------------ 615 Query: 277 VPVTRPQRNRQLSSRLVDYELILDTKIDNDGEIIHVTLLVDAEPLSVEE 423 + + +++ R+ D D ++D DG+++H L A+P++ E Sbjct: 616 --TRKSNKTKKVPPRIQDCITANDNEVDEDGDLVHFAQLAGAKPINYLE 662 >gb|PNX76333.1| putative LRR receptor-like protein kinase [Trifolium pratense] Length = 511 Score = 54.3 bits (129), Expect = 1e-05 Identities = 23/35 (65%), Positives = 30/35 (85%) Frame = +3 Query: 3 SKVVILLGYHSTGAYKLYCPMTRKIKISRDVLVKK 107 S+++IL+GYH TGAYKLY PMT K+ ISRDV+V + Sbjct: 133 SEIMILIGYHPTGAYKLYNPMTNKVNISRDVIVNE 167