BLASTX nr result
ID: Astragalus22_contig00019119
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00019119 (304 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposo... 132 3e-37 gb|PNX93789.1| copia-type polyprotein [Trifolium pratense] 134 2e-34 gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposo... 131 8e-34 gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo... 131 1e-33 gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli... 130 7e-33 dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subt... 129 1e-32 dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subt... 128 3e-32 gb|KYP73784.1| Retrovirus-related Pol polyprotein from transposo... 123 1e-31 gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [T... 117 1e-31 emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera] 125 2e-31 gb|PNX58304.1| retrovirus-related Pol polyprotein from transposo... 115 4e-31 dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subt... 125 4e-31 gb|KYP42102.1| Retrovirus-related Pol polyprotein from transposo... 124 5e-31 dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subt... 124 6e-31 gb|KYP63625.1| Retrovirus-related Pol polyprotein from transposo... 124 1e-30 gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] 123 2e-30 dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subt... 122 3e-30 gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen... 122 3e-30 dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subt... 122 3e-30 dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte... 122 4e-30 >gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 164 Score = 132 bits (332), Expect = 3e-37 Identities = 58/97 (59%), Positives = 73/97 (75%) Frame = -1 Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113 P+ PTC V+ED + LWH RFGHLS+ LQ L QKGMV GLP +K P+K C +CLIG+Q Sbjct: 19 PVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGLPMLKSPSKLCKDCLIGKQ 78 Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 HR++ RS WRAS+ L+++H+DICGPI P SNS KR Sbjct: 79 HRDSFPMRSSWRASQILQLVHADICGPIKPISNSKKR 115 >gb|PNX93789.1| copia-type polyprotein [Trifolium pratense] Length = 1347 Score = 134 bits (338), Expect = 2e-34 Identities = 62/97 (63%), Positives = 75/97 (77%) Frame = -1 Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113 P TC Q VSE++S LWH RFGHLSY L+ L K MV GLPS++ P K C ECLIG+Q Sbjct: 417 PNESTCFQNVSENESYLWHCRFGHLSYQGLRTLFYKKMVNGLPSIQIPKKLCTECLIGKQ 476 Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 HR++MSK+S WRAS KL+++H+DICGPI P SNSNKR Sbjct: 477 HRDSMSKKSLWRASNKLQLVHADICGPIKPESNSNKR 513 >gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 508 Score = 131 bits (329), Expect = 8e-34 Identities = 56/92 (60%), Positives = 75/92 (81%) Frame = -1 Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98 C+Q + D + +WH+R+GHLSYG L+LL+QK MV GLP +KE K C EC +G+QHR+A+ Sbjct: 84 CIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPECAVGKQHRDAI 143 Query: 97 SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 SK+S WRA+R+LE+IHSDICGP +P SNSN+R Sbjct: 144 SKQSTWRATRRLELIHSDICGPSTPTSNSNRR 175 >gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 582 Score = 131 bits (330), Expect = 1e-33 Identities = 59/103 (57%), Positives = 79/103 (76%), Gaps = 2/103 (1%) Frame = -1 Query: 304 IMADPI--PPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAE 131 I A P+ C+Q + D + +WH+R+GHLSYG L+LL+QK MV GLP +KE K C E Sbjct: 402 IKAKPVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPE 461 Query: 130 CLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 C +G+QHR+A+SK+S WRA+R+LE+IHSDICGP +P SNSN+R Sbjct: 462 CAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNRR 504 >gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 1327 Score = 130 bits (326), Expect = 7e-33 Identities = 56/94 (59%), Positives = 73/94 (77%) Frame = -1 Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104 P C+Q ++D+S LWH RFGHL+Y L+ L +GMV GLP+VK P K C CL+G+QHR+ Sbjct: 418 PKCMQVEADDESRLWHSRFGHLNYKGLRTLAYRGMVEGLPTVKTPQKLCTHCLVGKQHRD 477 Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 + KR+ WRA+ KL+++HSDICGPISP SNSNKR Sbjct: 478 PIPKRNLWRATHKLQLVHSDICGPISPISNSNKR 511 >dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subterraneum] Length = 1608 Score = 129 bits (325), Expect = 1e-32 Identities = 59/94 (62%), Positives = 75/94 (79%), Gaps = 2/94 (2%) Frame = -1 Query: 277 CLQA--VSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104 CLQA +SE ++ LWH RFGHL+Y L L K MVIGLPS+K P K C CLIG+QHRE Sbjct: 422 CLQAEMMSEKETQLWHSRFGHLNYKGLNTLSNKKMVIGLPSLKSPKKICTTCLIGKQHRE 481 Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 ++ K+S WRAS+KL+++H+DICGPI+P+SNSNKR Sbjct: 482 SIPKKSSWRASKKLQLVHADICGPITPSSNSNKR 515 >dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subterraneum] Length = 1222 Score = 128 bits (321), Expect = 3e-32 Identities = 57/100 (57%), Positives = 73/100 (73%) Frame = -1 Query: 301 MADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLI 122 +A I P C + +++ LWH+R+GHLSY L +L K MV+GLP VKEPT C+ C+ Sbjct: 340 LAPVIVPKCFKTTHSNENQLWHQRYGHLSYKGLGVLANKKMVLGLPCVKEPTDKCSNCMK 399 Query: 121 GRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 G+QHREA+ KRS WRAS KLE++HSDICGPI+P SN KR Sbjct: 400 GKQHREAIPKRSLWRASAKLELVHSDICGPITPESNGKKR 439 >gb|KYP73784.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 395 Score = 123 bits (309), Expect = 1e-31 Identities = 57/99 (57%), Positives = 74/99 (74%), Gaps = 2/99 (2%) Frame = -1 Query: 292 PIPPTCLQA--VSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIG 119 P P CLQA VSE ++ LWH RFGHL+Y L L K MV+G+PS+K P C CL+G Sbjct: 10 PQHPLCLQAEDVSEKKTQLWHSRFGHLNYKGLSTLASKQMVLGIPSLKSPKTICTTCLVG 69 Query: 118 RQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 +QHR+++ +S WRAS KL++IH+DICGPISP+S+SNKR Sbjct: 70 KQHRDSIPMQSSWRASTKLQLIHADICGPISPSSHSNKR 108 >gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [Trifolium pratense] Length = 149 Score = 117 bits (293), Expect = 1e-31 Identities = 51/96 (53%), Positives = 70/96 (72%) Frame = -1 Query: 289 IPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQH 110 I P CL+ ++ + LWH R+GHLS+ L L +K MVIGLP +++ +NC++CL G+QH Sbjct: 41 IMPMCLKTAKQESTQLWHDRYGHLSFKGLNTLSKKQMVIGLPELEDSDENCSDCLTGKQH 100 Query: 109 REAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 R+ + K++ WRAS KLE+IHSDICGPISP SN R Sbjct: 101 RDIIPKQANWRASVKLELIHSDICGPISPQSNGGCR 136 >emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera] Length = 2408 Score = 125 bits (315), Expect = 2e-31 Identities = 53/94 (56%), Positives = 72/94 (76%) Frame = -1 Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104 PTC Q + ED + LWH R+GHLS+ L+ L K MV GLP +K P+K C +C++G+QHR+ Sbjct: 357 PTCFQTILEDNTHLWHCRYGHLSFKGLRTLQYKQMVRGLPQLKAPSKICTDCMVGKQHRD 416 Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 A+ KRS WRAS++L+++H+DICGPI P SNS KR Sbjct: 417 AIPKRSLWRASQRLQLVHADICGPIKPISNSKKR 450 >gb|PNX58304.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 132 Score = 115 bits (289), Expect = 4e-31 Identities = 54/104 (51%), Positives = 74/104 (71%), Gaps = 3/104 (2%) Frame = -1 Query: 304 IMADPIPPTCLQAVSEDQSV---LWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCA 134 I A I P C++ S D +V LWH+R+GHLS+ + +L QK MV GLP +K+P + C+ Sbjct: 13 IKAPVIVPHCMKT-SNDSNVNANLWHQRYGHLSFKGMSVLVQKEMVTGLPKLKQPNEECS 71 Query: 133 ECLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 C+ G+QHR+ + K+S WRAS KLE++HSDICGPI+P SN KR Sbjct: 72 NCMKGKQHRKNVPKKSSWRASTKLELVHSDICGPINPESNGKKR 115 >dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subterraneum] Length = 1067 Score = 125 bits (313), Expect = 4e-31 Identities = 54/94 (57%), Positives = 72/94 (76%) Frame = -1 Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104 P CLQ +EDQ+ LWH R+GHLS L+LL K MV GLP+++E ++ C +CLIG+Q RE Sbjct: 409 PMCLQTSNEDQTQLWHNRYGHLSVNGLKLLSSKDMVKGLPAIREMSERCIDCLIGKQQRE 468 Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 + K++KWRA+ KL++IHSDICGPI+P SN KR Sbjct: 469 VIPKQAKWRANTKLQLIHSDICGPINPCSNGGKR 502 >gb|KYP42102.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 538 Score = 124 bits (310), Expect = 5e-31 Identities = 55/97 (56%), Positives = 72/97 (74%) Frame = -1 Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113 PI TC + +ED + LWH RFGHLS+ LQ L QK MV GLP +K P+K C +C+ G+Q Sbjct: 414 PISSTCFKIATEDVAHLWHCRFGHLSFKGLQTLQQKEMVKGLPLLKSPSKLCKDCIAGKQ 473 Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 HR++ +S WRAS+ L+++H+DICGPI PASNSNKR Sbjct: 474 HRDSFPIKSSWRASQILQLVHADICGPIKPASNSNKR 510 >dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subterraneum] Length = 1102 Score = 124 bits (312), Expect = 6e-31 Identities = 57/101 (56%), Positives = 72/101 (71%) Frame = -1 Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125 I A I P C +A +++ +WH+R+GHLSY L +L K MVIGLP VKEPT C+ C+ Sbjct: 288 IRAPLIVPKCFKASHSNENQIWHQRYGHLSYKGLGVLANKKMVIGLPCVKEPTDKCSNCM 347 Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 G+QHREA+ K WRAS KLE++HSDICGPI+P SN KR Sbjct: 348 KGKQHREAIPKMRLWRASAKLELVHSDICGPITPESNGKKR 388 >gb|KYP63625.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 930 Score = 124 bits (310), Expect = 1e-30 Identities = 55/97 (56%), Positives = 71/97 (73%) Frame = -1 Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113 P+ PTC V+ED + LWH RFGHLS+ LQ L QKGMV GL +K P+K C + LIG+Q Sbjct: 392 PVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGLAMLKPPSKLCKDYLIGKQ 451 Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 HR++ +S WRAS+ L+++H+DICGPI P SNS KR Sbjct: 452 HRDSFPMKSSWRASQILQLVHADICGPIKPVSNSKKR 488 >gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] Length = 1328 Score = 123 bits (308), Expect = 2e-30 Identities = 56/101 (55%), Positives = 73/101 (72%) Frame = -1 Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125 I A + P CLQ +E S LWH+R+GHLSY L L +K MV GLP++KE + C++CL Sbjct: 409 IKATVLVPMCLQTTNEIDSQLWHKRYGHLSYKGLNTLVKKEMVRGLPALKEASDVCSDCL 468 Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 G+QHRE + K+ WRA+ KLE+IHSDICGPI+PASNS + Sbjct: 469 FGKQHREVIPKKVNWRATHKLELIHSDICGPINPASNSGNK 509 >dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subterraneum] Length = 1197 Score = 122 bits (307), Expect = 3e-30 Identities = 54/101 (53%), Positives = 77/101 (76%) Frame = -1 Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125 I+A I CLQ +ED++ LWH R+GHLS L+LL++K MV GLP+++E + C +CL Sbjct: 332 IVAPVIVLMCLQTTNEDKTQLWHHRYGHLSVKGLKLLNKKDMVKGLPALRELNEKCTDCL 391 Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 +G+QHRE + K++KWRA+ KL++IH DICGPI+P+SN KR Sbjct: 392 MGKQHREVIPKQAKWRATAKLQLIHYDICGPINPSSNGGKR 432 >gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1302 Score = 122 bits (307), Expect = 3e-30 Identities = 55/103 (53%), Positives = 75/103 (72%), Gaps = 2/103 (1%) Frame = -1 Query: 304 IMADPIPPT--CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAE 131 I+A+ +P C Q VSED + LWH R+GHLS+ L+ L + MV GLP K P+K C + Sbjct: 408 ILANMLPQASACFQTVSEDNTHLWHCRYGHLSFKGLKTLQYRNMVKGLPDFKMPSKLCKD 467 Query: 130 CLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 C++G+QHRE++ K+S WRAS +L++IHSDICGPI P SN+ KR Sbjct: 468 CMVGKQHRESIPKKSMWRASHRLQLIHSDICGPIKPLSNNRKR 510 >dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subterraneum] Length = 1302 Score = 122 bits (307), Expect = 3e-30 Identities = 56/92 (60%), Positives = 71/92 (77%) Frame = -1 Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98 CLQA +ED + LWH R+GHL+ LQ L QK MVIGLP +E CA+CL G+QHRE++ Sbjct: 413 CLQANTEDITQLWHCRYGHLNIKGLQNLQQKNMVIGLPKFEESNHVCADCLRGKQHRESI 472 Query: 97 SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 K S W++S++LE+IHSDICGPI+P SNSNKR Sbjct: 473 PKTSNWKSSKRLELIHSDICGPITPVSNSNKR 504 >dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum] Length = 1172 Score = 122 bits (306), Expect = 4e-30 Identities = 52/92 (56%), Positives = 67/92 (72%) Frame = -1 Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98 C + D+ LWH+R+GHLS+ + +L QK MVIGLP +KEPT+ C C+ G+Q RE + Sbjct: 401 CFKITKNDEDTLWHKRYGHLSFKGINVLVQKNMVIGLPKLKEPTEKCTHCMKGKQQRENV 460 Query: 97 SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2 K+S WRAS KLE+IHSDICGPI+P SN KR Sbjct: 461 PKKSHWRASHKLELIHSDICGPINPESNGKKR 492