BLASTX nr result
ID: Astragalus24_contig00027802
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00027802 (403 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX59779.1| retrovirus-related Pol polyprotein from transposo... 140 3e-40 gb|PNY02869.1| copia-type polyprotein [Trifolium pratense] 136 5e-38 dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subt... 144 4e-37 dbj|GAU10241.1| hypothetical protein TSUD_420250, partial [Trifo... 136 2e-36 gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] 141 2e-36 gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo... 140 3e-36 dbj|GAU18821.1| hypothetical protein TSUD_228110 [Trifolium subt... 140 8e-36 dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subt... 139 1e-35 dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt... 137 5e-35 gb|PNX56727.1| retrovirus-related Pol polyprotein from transposo... 126 1e-34 gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] 136 1e-34 dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subt... 135 2e-34 dbj|GAU21581.1| hypothetical protein TSUD_35440 [Trifolium subte... 135 2e-34 gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense] 127 3e-34 gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] 135 3e-34 dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subt... 134 6e-34 dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subte... 133 7e-34 dbj|GAU29902.1| hypothetical protein TSUD_379930 [Trifolium subt... 134 8e-34 dbj|GAU31928.1| hypothetical protein TSUD_271130, partial [Trifo... 134 1e-33 gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense] 134 1e-33 >gb|PNX59779.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 150 Score = 140 bits (354), Expect = 3e-40 Identities = 61/104 (58%), Positives = 79/104 (75%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E++E YKCHKLGHFQYEC S E ANYA FDD EE+LLMA + + WF+DSGC Sbjct: 41 ESIECYKCHKLGHFQYECQSGEGGYANYAEFDDSEEVLLMAHDEDSSNSNSKLWFIDSGC 100 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 SNHM G+K W +LD+S+RETV+LGD S++N+MG+G+V+L+ G Sbjct: 101 SNHMSGVKAWFHDLDDSFRETVRLGDDSKMNVMGRGNVKLQLNG 144 >gb|PNY02869.1| copia-type polyprotein [Trifolium pratense] Length = 192 Score = 136 bits (343), Expect = 5e-38 Identities = 57/104 (54%), Positives = 82/104 (78%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN+E ++CHKLGH+Q ECP+ E+ AN+ FDD+EE+LLMA+G ++ +++ W+LDSGC Sbjct: 45 ENIECFRCHKLGHYQSECPNWEDANANFVEFDDKEEILLMAQGTDESNDKKAVWYLDSGC 104 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 SNHM G KEWLF+ D +RE+VKLGD S++ +MGKG+++L G Sbjct: 105 SNHMVGNKEWLFDFDVIFRESVKLGDDSKMAVMGKGNLKLNING 148 >dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subterraneum] Length = 1222 Score = 144 bits (362), Expect = 4e-37 Identities = 63/104 (60%), Positives = 82/104 (78%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN+E +KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LLMA+ +G WFLDSGC Sbjct: 219 ENVECFKCHKLGHFQSECPSWEEENANYAQFDESEEILLMAQETNEGESNNEIWFLDSGC 278 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 SNHM G K+WLF+ D SY+++VKLGD S++ +MGKG+++L+ EG Sbjct: 279 SNHMIGNKDWLFDFDSSYKDSVKLGDDSKMPVMGKGNLKLQIEG 322 >dbj|GAU10241.1| hypothetical protein TSUD_420250, partial [Trifolium subterraneum] Length = 333 Score = 136 bits (342), Expect = 2e-36 Identities = 65/105 (61%), Positives = 80/105 (76%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN+E YKCHKLGH+Q ECP E ANYA F DEEE LLMAR + + E AWFLDSGC Sbjct: 98 ENIECYKCHKLGHYQNECPEWGEGNANYAEFLDEEETLLMARTNSEELKNE-AWFLDSGC 156 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 SNHM G K WL+E DE+YR++VKLGD S++++MGKG+V+L G+ Sbjct: 157 SNHMVGNKNWLYEFDENYRDSVKLGDDSKMSVMGKGNVKLNIGGK 201 >gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] Length = 886 Score = 141 bits (356), Expect = 2e-36 Identities = 60/104 (57%), Positives = 84/104 (80%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN+E ++CHKLGH+Q ECP+ E+ AN+A FDD+EE+LLMA+G ++ ++ W+LDSGC Sbjct: 242 ENIECFRCHKLGHYQSECPNWEDANANFAEFDDKEEILLMAQGTDESNNKKVVWYLDSGC 301 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 SNHM G KEWLF+ D+S+RE+VKLGD SR+ +MGKG+++L G Sbjct: 302 SNHMVGNKEWLFDFDDSFRESVKLGDDSRMAVMGKGNLKLNING 345 >gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 582 Score = 140 bits (352), Expect = 3e-36 Identities = 61/105 (58%), Positives = 79/105 (75%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E++E YKCHK GH+QYECP+L + NY GFD+EEEMLLMA + ++E WFLDSGC Sbjct: 239 EHIECYKCHKFGHYQYECPNLATDVVNYTGFDEEEEMLLMALEGSKENDQE-CWFLDSGC 297 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 S HMCG+K W +LDE +RE VKLGD +++MG+G+V+L EGR Sbjct: 298 STHMCGVKRWFIDLDEQFREVVKLGDGRTLSVMGRGNVKLCVEGR 342 >dbj|GAU18821.1| hypothetical protein TSUD_228110 [Trifolium subterraneum] Length = 1013 Score = 140 bits (352), Expect = 8e-36 Identities = 66/108 (61%), Positives = 86/108 (79%), Gaps = 4/108 (3%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGV----EQGTEEERAWFL 150 EN+E++KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LLMA+ EQG+ E WFL Sbjct: 222 ENVEFFKCHKLGHFQSECPSREEENANYAQFDEGEEILLMAQETKETKEQGSHSE-IWFL 280 Query: 149 DSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 DSGCSNHM G KEWLF+ D+S++++VKLGD S++ +MGKG+++L EG Sbjct: 281 DSGCSNHMIGNKEWLFDFDDSFKDSVKLGDDSKMAVMGKGNLKLHIEG 328 >dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subterraneum] Length = 1102 Score = 139 bits (350), Expect = 1e-35 Identities = 61/105 (58%), Positives = 81/105 (77%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN+E +KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LL+A+ +G WFLDS C Sbjct: 146 ENVECFKCHKLGHFQSECPSWEEENANYAQFDESEEILLVAQETNEGESNNEIWFLDSSC 205 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 S HM G KEWLF+ D SY+++VKLGD S++ +MGKG+++L+ EG+ Sbjct: 206 SKHMIGNKEWLFDFDSSYKDSVKLGDDSKMPVMGKGNLKLQIEGQ 250 >dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum] Length = 1322 Score = 137 bits (346), Expect = 5e-35 Identities = 61/101 (60%), Positives = 82/101 (81%), Gaps = 1/101 (0%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDSG 141 EN+E Y+CHKLGH+Q+ECP+ EEK ANYA +D EE+LLMA+ G+E +E WFLDSG Sbjct: 238 ENIECYRCHKLGHYQHECPTWEEKDANYAAYDSHEEILLMAKHGIETDARDE-VWFLDSG 296 Query: 140 CSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRL 18 CSNHM G +EWLF+ D++ RE+VKLGD SR+ ++GKG+++L Sbjct: 297 CSNHMVGTREWLFDFDDNIRESVKLGDDSRMQILGKGNLKL 337 >gb|PNX56727.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 153 Score = 126 bits (317), Expect = 1e-34 Identities = 59/108 (54%), Positives = 77/108 (71%), Gaps = 10/108 (9%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMA----------RGVEQGTEE 168 EN+E YKCHKLGHFQ ECP EE NYA FD+ EE+LLMA R E+ + Sbjct: 46 ENVECYKCHKLGHFQSECPDWEEDNVNYAEFDEAEELLLMAQKGKEIKEIQRSDERFNDN 105 Query: 167 ERAWFLDSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSV 24 + WFLDSGCSNHM G KEWLF+ D+S++++VKLGD S++ ++GKG++ Sbjct: 106 HQIWFLDSGCSNHMVGNKEWLFDYDDSFKDSVKLGDDSKMAVIGKGNL 153 >gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] Length = 1062 Score = 136 bits (343), Expect = 1e-34 Identities = 62/104 (59%), Positives = 83/104 (79%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 +N+E Y CHKLGHFQ +CP+ +EK ANYA FD+ EEMLLMA E+G+ +++ WFLDSGC Sbjct: 269 DNIECYHCHKLGHFQSDCPAWDEK-ANYAEFDEGEEMLLMAHS-EKGSYDKKVWFLDSGC 326 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 NHMCG K+W F LDE +R +VKLGD+SR+ ++GKG+V+L+ G Sbjct: 327 RNHMCGTKDWFFNLDEQFRISVKLGDNSRMMVVGKGNVKLRIGG 370 >dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subterraneum] Length = 1103 Score = 135 bits (341), Expect = 2e-34 Identities = 62/106 (58%), Positives = 79/106 (74%), Gaps = 2/106 (1%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEK-IANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDS 144 EN+E YKCHK GHFQYEC + E ANYA FDD EE+LLMA + W++DS Sbjct: 239 ENVECYKCHKFGHFQYECQNSEGGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDS 298 Query: 143 GCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 GCSNHMCGIKEW +LDES+RE+V+LGD S++++MGKG+V+L+ G Sbjct: 299 GCSNHMCGIKEWFHDLDESFRESVRLGDDSQMSVMGKGNVKLQMNG 344 >dbj|GAU21581.1| hypothetical protein TSUD_35440 [Trifolium subterraneum] Length = 1283 Score = 135 bits (341), Expect = 2e-34 Identities = 66/105 (62%), Positives = 81/105 (77%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E +E +KCHKLGH++ ECP E ANYA F DEEE LLMAR ++ E WFLDSGC Sbjct: 215 ETIECFKCHKLGHYRSECPDWEGN-ANYAEFLDEEETLLMARTNTNESKHE-TWFLDSGC 272 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 SNHM G K+WL+ELDESYR+TVKLGD S++N+MGKG+V+L +GR Sbjct: 273 SNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGR 317 >gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense] Length = 197 Score = 127 bits (318), Expect = 3e-34 Identities = 57/109 (52%), Positives = 82/109 (75%), Gaps = 5/109 (4%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGV-----EQGTEEERAWF 153 E++E YKCHKLGH+Q +CPS +E ANYA FD+ +E+LLMA+ + +E+ WF Sbjct: 50 ESVECYKCHKLGHYQSDCPSWDEDNANYAEFDEGQEILLMAQNTMVNESQNSSEKLELWF 109 Query: 152 LDSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 LDSGCSNHM G K WLF+ D+S++++VKLGD S++++ GKG+++L EG Sbjct: 110 LDSGCSNHMVGNKNWLFDYDDSFKDSVKLGDDSKMSVEGKGNLKLYIEG 158 >gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] Length = 1324 Score = 135 bits (340), Expect = 3e-34 Identities = 60/101 (59%), Positives = 81/101 (80%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E +E +KCHKLGH++ ECP+ EE ANYA FD+E+E+LLMAR +E WFLDSGC Sbjct: 245 ELVECFKCHKLGHYRNECPTWEEYDANYAEFDEEQELLLMARDNVSTHAKEEVWFLDSGC 304 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLK 15 SNHM G KEWLF+ D+S+RE+VKLGD SR+++MG+G+++L+ Sbjct: 305 SNHMIGTKEWLFDYDDSFRESVKLGDDSRMSVMGRGNLKLQ 345 >dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subterraneum] Length = 1132 Score = 134 bits (338), Expect = 6e-34 Identities = 61/106 (57%), Positives = 79/106 (74%), Gaps = 2/106 (1%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEK-IANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDS 144 EN+E YKCHK GHFQYEC + E ANYA FDD EE+LLMA + W++DS Sbjct: 239 ENVECYKCHKFGHFQYECQNSEGGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDS 298 Query: 143 GCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6 GCSNHMCGIKEW +LD+S+RE+V+LGD S++++MGKG+V+L+ G Sbjct: 299 GCSNHMCGIKEWFHDLDDSFRESVRLGDDSQMSVMGKGNVKLQMNG 344 >dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subterraneum] Length = 538 Score = 133 bits (334), Expect = 7e-34 Identities = 64/105 (60%), Positives = 82/105 (78%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E +E YKCHKLGH+Q ECP+ E ANYA F+DEEEMLLMA+ + +EE WFLDSGC Sbjct: 239 ELVECYKCHKLGHYQNECPTWGEN-ANYAEFNDEEEMLLMAKTNCEEMKEE-IWFLDSGC 296 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 SNHM G K+W++E DE+YR++VKLGD S++ +MGKG+V+L GR Sbjct: 297 SNHMIGNKDWMYEFDETYRDSVKLGDDSKMQVMGKGNVKLSINGR 341 >dbj|GAU29902.1| hypothetical protein TSUD_379930 [Trifolium subterraneum] Length = 1277 Score = 134 bits (337), Expect = 8e-34 Identities = 65/105 (61%), Positives = 80/105 (76%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E +E +KCHKLGH++ ECP E ANY F DEEE LLMAR ++ E WFLDSGC Sbjct: 244 ETIECFKCHKLGHYRNECPEWEGN-ANYVEFLDEEETLLMARTNADESKHE-TWFLDSGC 301 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 SNHM G K+WL+ELDESYR+TVKLGD S++N+MGKG+V+L +GR Sbjct: 302 SNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGR 346 >dbj|GAU31928.1| hypothetical protein TSUD_271130, partial [Trifolium subterraneum] Length = 747 Score = 134 bits (336), Expect = 1e-33 Identities = 65/105 (61%), Positives = 80/105 (76%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 E +E +KCHKLGH++ ECP E ANYA F DEEE LLMAR ++ E WFLDSGC Sbjct: 244 ETIECFKCHKLGHYRNECPDWEGN-ANYAEFLDEEETLLMARTNADESKHE-TWFLDSGC 301 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 SNHM G K+WL+ELDESYR+T KLGD S++N+MGKG+V+L +GR Sbjct: 302 SNHMVGNKDWLYELDESYRDTFKLGDDSKMNVMGKGNVKLSIDGR 346 >gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense] Length = 861 Score = 134 bits (336), Expect = 1e-33 Identities = 63/105 (60%), Positives = 79/105 (75%) Frame = -3 Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138 EN E +KCHKLGH+Q +CP+ E ANYA FDDEEEMLLMA+ + E WFLDSGC Sbjct: 240 ENAECFKCHKLGHYQSDCPNWGEN-ANYAEFDDEEEMLLMAKTNDDAKSEN--WFLDSGC 296 Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3 S HM G K+W ++ DE+YR++VKLGD SR+N+MGKG+V+L GR Sbjct: 297 SYHMAGNKDWFYDFDENYRDSVKLGDDSRMNVMGKGNVKLSINGR 341