BLASTX nr result
ID: Astragalus23_contig00012165
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00012165 (343 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subt... 122 4e-30 gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposo... 119 2e-29 gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo... 120 3e-29 dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte... 120 3e-29 dbj|GAU26945.1| hypothetical protein TSUD_06170, partial [Trifol... 120 4e-29 gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposo... 112 5e-29 gb|PNX58304.1| retrovirus-related Pol polyprotein from transposo... 109 2e-28 dbj|GAU45313.1| hypothetical protein TSUD_300420 [Trifolium subt... 117 3e-28 gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] 114 3e-27 dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subt... 114 3e-27 gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense] 114 3e-27 gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] 114 5e-27 gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli... 114 5e-27 dbj|GAU31303.1| hypothetical protein TSUD_315120 [Trifolium subt... 113 7e-27 gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [T... 105 8e-27 gb|PNY03001.1| copia-type polyprotein, partial [Trifolium pratense] 113 1e-26 dbj|GAU43961.1| hypothetical protein TSUD_283880 [Trifolium subt... 112 1e-26 gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] 112 1e-26 gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] 112 3e-26 gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen... 110 7e-26 >dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subterraneum] Length = 1197 Score = 122 bits (307), Expect = 4e-30 Identities = 60/115 (52%), Positives = 79/115 (68%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI +++M NRM+ +V ++ CL T++D +LWH RYGHLS KGL LLN+K MV+G Sbjct: 318 LIFSTQMSTNRMYMIVAPVIVLMCLQTTNEDKTQLWHHRYGHLSVKGLKLLNKKDMVKGL 377 Query: 183 PALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 PAL+E + TDCL+G + Q KWRA+ KL+LIH DICGPI+P SN KR Sbjct: 378 PALRELNEKCTDCLMGKQHREVIPKQAKWRATAKLQLIHYDICGPINPSSNGGKR 432 >gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 508 Score = 119 bits (299), Expect = 2e-29 Identities = 62/117 (52%), Positives = 79/117 (67%), Gaps = 4/117 (3%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVIS--GCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVE 176 L+MTS M NNRMF + VI+ C+ ++ D E+WH+RYGHLS GL LLNQK MV+ Sbjct: 59 LLMTSYMANNRMFPIKAKLVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVK 118 Query: 177 GFPALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 G P LKE K +C VG + + Q WRA+R+LELIHSDICGP +P SNS++R Sbjct: 119 GLPELKEMDKVCPECAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNRR 175 >gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 582 Score = 120 bits (300), Expect = 3e-29 Identities = 62/117 (52%), Positives = 79/117 (67%), Gaps = 4/117 (3%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVIS--GCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVE 176 L+MTS M NNRMF + VI+ C+ ++ D E+WH+RYGHLS GL LLNQK MV+ Sbjct: 388 LLMTSYMANNRMFPIKAKPVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVK 447 Query: 177 GFPALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 G P LKE K +C VG + + Q WRA+R+LELIHSDICGP +P SNS++R Sbjct: 448 GLPELKEMDKVCPECAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNRR 504 >dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum] Length = 1172 Score = 120 bits (301), Expect = 3e-29 Identities = 65/117 (55%), Positives = 76/117 (64%), Gaps = 4/117 (3%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVI--SGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVE 176 LIMTS M NRM+ + VI S C +T +D LWH+RYGHLS KG+ +L QK MV Sbjct: 376 LIMTSHMSMNRMYVIKAPVVIPQSQCFKITKNDEDTLWHKRYGHLSFKGINVLVQKNMVI 435 Query: 177 GFPALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 G P LKEP + T C+ G + KK WRAS KLELIHSDICGPI+PESN KR Sbjct: 436 GLPKLKEPTEKCTHCMKGKQQRENVPKKSHWRASHKLELIHSDICGPINPESNGKKR 492 >dbj|GAU26945.1| hypothetical protein TSUD_06170, partial [Trifolium subterraneum] Length = 832 Score = 120 bits (300), Expect = 4e-29 Identities = 61/115 (53%), Positives = 77/115 (66%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LIMT+ M NRM+ + +I CL V+ + A+LWH+RY HLS KG+ +L QK MV G Sbjct: 139 LIMTTHMSMNRMYIIKAPIIIPTCLKVSQNSEAQLWHQRYDHLSFKGMKILAQKNMVHGL 198 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P +KEP + T C+ G ++ KK WRAS KLELIHSDICGPI+PESN KR Sbjct: 199 PDVKEPNQSCTHCMKGKQQRDYVPKKSSWRASTKLELIHSDICGPINPESNGKKR 253 >gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 164 Score = 112 bits (279), Expect = 5e-29 Identities = 58/115 (50%), Positives = 74/115 (64%), Gaps = 4/115 (3%) Frame = +3 Query: 9 MTSKMENNRMFTV--VGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 M S M +NRMF + + V C ++D+A+LWH R+GHLS KGL L QKGMVEG Sbjct: 1 MQSNMSSNRMFILHAISLPVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGL 60 Query: 183 PALKEPVKP*TDCLVGTLNFNH--NQKKWRASRKLELIHSDICGPISPESNSHKR 341 P LK P K DCL+G + + + WRAS+ L+L+H+DICGPI P SNS KR Sbjct: 61 PMLKSPSKLCKDCLIGKQHRDSFPMRSSWRASQILQLVHADICGPIKPISNSKKR 115 >gb|PNX58304.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 132 Score = 109 bits (272), Expect = 2e-28 Identities = 56/115 (48%), Positives = 75/115 (65%), Gaps = 4/115 (3%) Frame = +3 Query: 9 MTSKMENNRMFTVVGSSVISGCLLVTSDDL--AELWHRRYGHLSQKGLMLLNQKGMVEGF 182 M + M NRMF + ++ C+ ++D A LWH+RYGHLS KG+ +L QK MV G Sbjct: 1 MATHMTLNRMFVIKAPVIVPHCMKTSNDSNVNANLWHQRYGHLSFKGMSVLVQKEMVTGL 60 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P LK+P + ++C+ G + + KK WRAS KLEL+HSDICGPI+PESN KR Sbjct: 61 PKLKQPNEECSNCMKGKQHRKNVPKKSSWRASTKLELVHSDICGPINPESNGKKR 115 >dbj|GAU45313.1| hypothetical protein TSUD_300420 [Trifolium subterraneum] Length = 1329 Score = 117 bits (293), Expect = 3e-28 Identities = 61/111 (54%), Positives = 75/111 (67%), Gaps = 2/111 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI+T++M +NR++ V S V+ CL +T +D LWH+RY HLS KGL LL K MV G Sbjct: 450 LILTTEMTSNRIYIVHASVVMPKCLQMTKEDQFTLWHQRYAHLSSKGLKLLTDKNMVMGL 509 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQK--KWRASRKLELIHSDICGPISPESN 329 PALKE TDCL G + K WRAS+KLELIHSDICGPI+P+SN Sbjct: 510 PALKEAEDKCTDCLSGKQHRESIPKLANWRASQKLELIHSDICGPINPKSN 560 >gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] Length = 803 Score = 114 bits (286), Expect = 3e-27 Identities = 58/115 (50%), Positives = 77/115 (66%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI++++M NRM+ V + +I CL VT + ELWH+RY HLS KGL +LN+K MV+G Sbjct: 153 LILSTEMTMNRMYIVRATVIIPNCLQVTKAEETELWHKRYAHLSIKGLRVLNKKHMVKGL 212 Query: 183 PALKEPVKP*TDCLVGTLNFNH--NQKKWRASRKLELIHSDICGPISPESNSHKR 341 P L++ + TDCL G + + Q WRAS LELIHSDICGPI+P+SN R Sbjct: 213 PELRDTEEKCTDCLSGKQHRENMPKQANWRASEILELIHSDICGPITPKSNGGNR 267 >dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subterraneum] Length = 1067 Score = 114 bits (286), Expect = 3e-27 Identities = 57/115 (49%), Positives = 76/115 (66%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI +++M NRM+T+ +I CL +++D +LWH RYGHLS GL LL+ K MV+G Sbjct: 388 LIFSTQMSANRMYTLTAPVMIPMCLQTSNEDQTQLWHNRYGHLSVNGLKLLSSKDMVKGL 447 Query: 183 PALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 PA++E + DCL+G Q KWRA+ KL+LIHSDICGPI+P SN KR Sbjct: 448 PAIREMSERCIDCLIGKQQREVIPKQAKWRANTKLQLIHSDICGPINPCSNGGKR 502 >gb|PNX83704.1| copia-type polyprotein, partial [Trifolium pratense] Length = 598 Score = 114 bits (285), Expect = 3e-27 Identities = 59/117 (50%), Positives = 77/117 (65%), Gaps = 4/117 (3%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSD--DLAELWHRRYGHLSQKGLMLLNQKGMVE 176 LIM + M NRMF + ++ C+ +S+ D A LWH+RYGHLS KG+ +L KGMV Sbjct: 409 LIMATHMTMNRMFVIKAPVIVPHCMNTSSNNHDNANLWHQRYGHLSFKGMNVLAHKGMVI 468 Query: 177 GFPALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 G P LK+P + ++C+ G + KK WRAS KLEL+HSDICGPI+PESN KR Sbjct: 469 GLPKLKQPDEECSNCMKGKQQRKNVPKKSSWRASTKLELVHSDICGPINPESNGRKR 525 >gb|PNX77752.1| copia-type polyprotein, partial [Trifolium pratense] Length = 736 Score = 114 bits (284), Expect = 5e-27 Identities = 58/115 (50%), Positives = 74/115 (64%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LIM + M NRMF + ++ C+ ++ D + LWH+RYGHLS KG+ +L QK MV G Sbjct: 164 LIMATTMTFNRMFVIKAPVIVPQCMKISGVDDSTLWHQRYGHLSFKGMNVLTQKQMVIGL 223 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P LKE + T+C+ G KK WRAS KLEL+HSDICGPI+PESN KR Sbjct: 224 PKLKESNEKCTNCMKGKQQKQSAPKKSSWRASTKLELVHSDICGPINPESNGKKR 278 >gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1327 Score = 114 bits (284), Expect = 5e-27 Identities = 60/116 (51%), Positives = 76/116 (65%), Gaps = 4/116 (3%) Frame = +3 Query: 6 IMTSKMENNRMFTVVGS--SVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEG 179 IM S M NRMF + S + C+ V +DD + LWH R+GHL+ KGL L +GMVEG Sbjct: 396 IMHSDMSGNRMFYFLAKMVSTLPKCMQVEADDESRLWHSRFGHLNYKGLRTLAYRGMVEG 455 Query: 180 FPALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P +K P K T CLVG + + K+ WRA+ KL+L+HSDICGPISP SNS+KR Sbjct: 456 LPTVKTPQKLCTHCLVGKQHRDPIPKRNLWRATHKLQLVHSDICGPISPISNSNKR 511 >dbj|GAU31303.1| hypothetical protein TSUD_315120 [Trifolium subterraneum] Length = 1229 Score = 113 bits (283), Expect = 7e-27 Identities = 56/115 (48%), Positives = 76/115 (66%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI +++M NRM+ + S ++ CL ++ +D +LWH RYGH+S + L LLN K MV+G Sbjct: 387 LIFSTQMSTNRMYILTTSVIVPMCLQISKEDKNQLWHNRYGHISMQRLKLLNSKDMVKGL 446 Query: 183 PALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 PAL+E + TDCL+G Q KWRA+ KL+LIHSDIC PI+P SN KR Sbjct: 447 PALEEMDEKCTDCLIGKQQREAIPKQAKWRATTKLQLIHSDICEPINPCSNGGKR 501 >gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [Trifolium pratense] Length = 149 Score = 105 bits (263), Expect = 8e-27 Identities = 54/111 (48%), Positives = 70/111 (63%), Gaps = 2/111 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 L+ TS M NRM+ + ++ CL + +LWH RYGHLS KGL L++K MV G Sbjct: 22 LLFTSHMSKNRMYVITTPVIMPMCLKTAKQESTQLWHDRYGHLSFKGLNTLSKKQMVIGL 81 Query: 183 PALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESN 329 P L++ + +DCL G + + Q WRAS KLELIHSDICGPISP+SN Sbjct: 82 PELEDSDENCSDCLTGKQHRDIIPKQANWRASVKLELIHSDICGPISPQSN 132 >gb|PNY03001.1| copia-type polyprotein, partial [Trifolium pratense] Length = 878 Score = 113 bits (282), Expect = 1e-26 Identities = 56/115 (48%), Positives = 74/115 (64%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LIM + M NRMF + ++ C+ ++ + LWH+RYGHLS KG+ L+QK MV G Sbjct: 154 LIMATHMTQNRMFVIKAPVIVPHCMKISKANDTALWHQRYGHLSFKGMNALHQKEMVIGL 213 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P LKEP + ++C+ G KK WRAS KL+L+HSDICGPI+PESN KR Sbjct: 214 PKLKEPDEKCSNCMKGKQQKQSAPKKSSWRASSKLQLVHSDICGPINPESNGKKR 268 >dbj|GAU43961.1| hypothetical protein TSUD_283880 [Trifolium subterraneum] Length = 1273 Score = 112 bits (281), Expect = 1e-26 Identities = 57/115 (49%), Positives = 74/115 (64%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI +++M N M+ + ++ CL + +D +LWH RYGHLS KGL +LN K MV+G Sbjct: 388 LIFSTQMSANIMYILAAPVIVPMCLQTSIEDKTQLWHNRYGHLSVKGLKILNSKDMVKGL 447 Query: 183 PALKEPVKP*TDCLVGTLNFN--HNQKKWRASRKLELIHSDICGPISPESNSHKR 341 PAL E + TDCL+G Q KWRA+ KL+LIHSDICGPI+P SN KR Sbjct: 448 PALGEMNERCTDCLIGKQQREVIPKQAKWRATTKLQLIHSDICGPINPCSNGGKR 502 >gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] Length = 1328 Score = 112 bits (281), Expect = 1e-26 Identities = 59/112 (52%), Positives = 75/112 (66%), Gaps = 2/112 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LIM++ M NRMF + + ++ CL T++ ++LWH+RYGHLS KGL L +K MV G Sbjct: 395 LIMSTPMSANRMFVIKATVLVPMCLQTTNEIDSQLWHKRYGHLSYKGLNTLVKKEMVRGL 454 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNS 332 PALKE +DCL G + KK WRA+ KLELIHSDICGPI+P SNS Sbjct: 455 PALKEASDVCSDCLFGKQHREVIPKKVNWRATHKLELIHSDICGPINPASNS 506 >gb|PNX99782.1| copia-type polyprotein [Trifolium pratense] Length = 912 Score = 112 bits (279), Expect = 3e-26 Identities = 58/115 (50%), Positives = 76/115 (66%), Gaps = 2/115 (1%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSVISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVEGF 182 LI+T+KM N+M+ V S ++ CL T+ + LWH+RY HLS +GL L K MV+G Sbjct: 388 LILTTKMTFNKMYIVKASMILPNCLQATALEETTLWHQRYAHLSFQGLKTLITKQMVKGL 447 Query: 183 PALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 P LKE TDCLVG + + K+ WRAS+KLEL+HSDICGPI+P+SN R Sbjct: 448 PNLKESGDKCTDCLVGKQHRSSIPKEANWRASKKLELVHSDICGPINPQSNGGNR 502 >gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1302 Score = 110 bits (276), Expect = 7e-26 Identities = 59/117 (50%), Positives = 72/117 (61%), Gaps = 4/117 (3%) Frame = +3 Query: 3 LIMTSKMENNRMFTVVGSSV--ISGCLLVTSDDLAELWHRRYGHLSQKGLMLLNQKGMVE 176 LIM +KM NRMF ++ + + S C S+D LWH RYGHLS KGL L + MV+ Sbjct: 394 LIMQTKMSANRMFVILANMLPQASACFQTVSEDNTHLWHCRYGHLSFKGLKTLQYRNMVK 453 Query: 177 GFPALKEPVKP*TDCLVGTLNFNHNQKK--WRASRKLELIHSDICGPISPESNSHKR 341 G P K P K DC+VG + KK WRAS +L+LIHSDICGPI P SN+ KR Sbjct: 454 GLPDFKMPSKLCKDCMVGKQHRESIPKKSMWRASHRLQLIHSDICGPIKPLSNNRKR 510