BLASTX nr result
ID: Astragalus23_contig00023045
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus23_contig00023045 (352 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU13723.1| hypothetical protein TSUD_348270 [Trifolium subt... 145 7e-38 dbj|GAU39416.1| hypothetical protein TSUD_323640 [Trifolium subt... 142 6e-37 dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subt... 142 6e-37 gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] 141 2e-36 gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposo... 128 2e-35 dbj|GAU45181.1| hypothetical protein TSUD_178740 [Trifolium subt... 137 4e-35 gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] 136 7e-35 gb|PNX74620.1| putative LRR receptor-like protein kinase, partia... 136 9e-35 dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt... 135 1e-34 gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] 135 2e-34 gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] 134 3e-34 gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] 134 3e-34 gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [T... 124 3e-34 emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera] 134 4e-34 gb|PNX61303.1| copia-type polyprotein, partial [Trifolium pratense] 127 1e-33 dbj|GAU36545.1| hypothetical protein TSUD_277510 [Trifolium subt... 132 1e-33 gb|PRQ38021.1| putative RNA-directed DNA polymerase [Rosa chinen... 132 2e-33 gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen... 132 3e-33 dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subt... 132 3e-33 gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinen... 131 4e-33 >dbj|GAU13723.1| hypothetical protein TSUD_348270 [Trifolium subterraneum] Length = 1117 Score = 145 bits (365), Expect = 7e-38 Identities = 63/112 (56%), Positives = 83/112 (74%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F + KGLI+ + M+ NRM+ I V+LP C+ VT +D LWHCRY HL KGL LA Sbjct: 383 KIFHEGKGLIVTTQMTVNRMYIILAPVMLPTCFKVTNKDEGHLWHCRYGHLSFKGLNTLA 442 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +++MV+GLP +KD + VCSDC+V KQHR FP+ +SWRAT KL+LVH+DICG Sbjct: 443 KREMVKGLPMVKDNQTVCSDCVVSKQHRDTFPKNASWRATSKLELVHSDICG 494 >dbj|GAU39416.1| hypothetical protein TSUD_323640 [Trifolium subterraneum] Length = 1056 Score = 142 bits (358), Expect = 6e-37 Identities = 60/112 (53%), Positives = 86/112 (76%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F ++ GL++ + M+ NRM+ I V+LP C+ VT +D S LWHCRY++L KGL ALA Sbjct: 329 KIFHEEMGLMVTTQMTVNRMYIILAPVMLPSCFKVTNKDESHLWHCRYSNLSFKGLNALA 388 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +++MV+GLP +KD + VCSDC+V KQHR FP+ ++WRAT KL+L+H+DICG Sbjct: 389 KREMVKGLPMVKDNQTVCSDCVVSKQHRDTFPKSTTWRATSKLELIHSDICG 440 >dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subterraneum] Length = 1323 Score = 142 bits (358), Expect = 6e-37 Identities = 62/112 (55%), Positives = 83/112 (74%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F + KGLI+ + M+ NRM+ I V+LP C+ V+ +D LWHCRY HL KGL LA Sbjct: 387 KIFHEGKGLIVTTQMTVNRMYIILAPVMLPACFKVSNQDEGHLWHCRYGHLSFKGLNTLA 446 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +++MV+GLP +KD + VCSDC+V KQHR FP+ +SWRAT KL+LVH+DICG Sbjct: 447 KREMVKGLPMVKDDQTVCSDCVVSKQHRDTFPKIASWRATSKLELVHSDICG 498 >gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense] Length = 1062 Score = 141 bits (355), Expect = 2e-36 Identities = 61/110 (55%), Positives = 88/110 (80%) Frame = +3 Query: 21 FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200 + +++G+IMQ M++NRM+ I V VV+P C+ VT ED + LWHCRY +L KGL+ L Q+ Sbjct: 411 YHQERGVIMQCKMTANRMYVIMVDVVIPTCFKVTNEDVTYLWHCRYGYLSQKGLKILEQK 470 Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +MVRGLP+++D+ VCSDC++GKQHR+PF + S+ RATK+LQL+HAD+ G Sbjct: 471 NMVRGLPKLQDSSNVCSDCMIGKQHREPFLKVSTRRATKRLQLIHADVFG 520 >gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 164 Score = 128 bits (322), Expect = 2e-35 Identities = 60/104 (57%), Positives = 76/104 (73%), Gaps = 2/104 (1%) Frame = +3 Query: 45 MQSLMSSNRMFAI-CVSV-VLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVRGL 218 MQS MSSNRMF + +S+ V P C++ ED ++LWHCR+ HL KGL+ L Q+ MV GL Sbjct: 1 MQSNMSSNRMFILHAISLPVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGL 60 Query: 219 PEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 P +K +C DCL+GKQHR FP +SSWRA++ LQLVHADICG Sbjct: 61 PMLKSPSKLCKDCLIGKQHRDSFPMRSSWRASQILQLVHADICG 104 >dbj|GAU45181.1| hypothetical protein TSUD_178740 [Trifolium subterraneum] Length = 940 Score = 137 bits (345), Expect = 4e-35 Identities = 61/107 (57%), Positives = 83/107 (77%) Frame = +3 Query: 30 KKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMV 209 +KGLI + +++NRM+ + SVVLP+C V G D S LWH RYAHL +KGL+ L++ +MV Sbjct: 235 EKGLIFTTQITANRMYIVFASVVLPKCLQVRGVDESHLWHHRYAHLNIKGLKILSKNNMV 294 Query: 210 RGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +GL E+KD + C DCL GKQHR FP++SSWRA++KL+LVH+DICG Sbjct: 295 KGLLELKDIEGQCGDCLAGKQHRDNFPKKSSWRASQKLELVHSDICG 341 >gb|PNX94698.1| copia-type polyprotein [Trifolium pratense] Length = 1324 Score = 136 bits (343), Expect = 7e-35 Identities = 57/110 (51%), Positives = 82/110 (74%) Frame = +3 Query: 21 FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200 F +++GLIM + M++NRM+ I V+LP C + S LWHCRY HL KGL L ++ Sbjct: 389 FHEQRGLIMTTRMTANRMYVISAPVILPMCLKTEKQVNSHLWHCRYGHLSFKGLNTLVKR 448 Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +MV+GLP++++ + CSDC++GKQHR P+Q++WRATKKL+LVH+DICG Sbjct: 449 NMVKGLPQLQEIETNCSDCMIGKQHRDSIPKQANWRATKKLELVHSDICG 498 >gb|PNX74620.1| putative LRR receptor-like protein kinase, partial [Trifolium pratense] Length = 814 Score = 136 bits (342), Expect = 9e-35 Identities = 58/112 (51%), Positives = 86/112 (76%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F ++KGLI+ + M++N+M+ I V+ P C +T ++ ++LWH RYAHL LKGL+ L Sbjct: 107 QLFHEEKGLIISTAMTTNKMYIINAPVITPNCLQMTKDEETDLWHKRYAHLSLKGLKVLT 166 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 ++MV+GLPE+KD + CSDCL GKQHR P+Q++WRA++KL+LVH+DICG Sbjct: 167 GKNMVKGLPELKDNEEKCSDCLSGKQHRDNIPKQTNWRASQKLELVHSDICG 218 >dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum] Length = 1322 Score = 135 bits (341), Expect = 1e-34 Identities = 57/107 (53%), Positives = 80/107 (74%) Frame = +3 Query: 30 KKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMV 209 ++GLIM + MS+NRM+ I V++P C D +ELWHCRY HL KGL L ++DMV Sbjct: 385 QRGLIMATKMSANRMYIIYAPVIIPMCLKTVKMDNNELWHCRYGHLSFKGLNTLVKKDMV 444 Query: 210 RGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 RGLP++++T C++C+ GKQHR+ P+ S+WRA+KKL+LVH+DICG Sbjct: 445 RGLPQLQETTENCTNCMTGKQHREAIPKSSNWRASKKLELVHSDICG 491 >gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense] Length = 803 Score = 135 bits (339), Expect = 2e-34 Identities = 57/112 (50%), Positives = 86/112 (76%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F + KGLI+ + M+ NRM+ + +V++P C VT + +ELWH RYAHL +KGLR L Sbjct: 145 QLFHEDKGLILSTEMTMNRMYIVRATVIIPNCLQVTKAEETELWHKRYAHLSIKGLRVLN 204 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 ++ MV+GLPE++DT+ C+DCL GKQHR+ P+Q++WRA++ L+L+H+DICG Sbjct: 205 KKHMVKGLPELRDTEEKCTDCLSGKQHRENMPKQANWRASEILELIHSDICG 256 >gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense] Length = 886 Score = 134 bits (338), Expect = 3e-34 Identities = 60/110 (54%), Positives = 80/110 (72%) Frame = +3 Query: 21 FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200 F K GLI+ S MS+NRMF I S++ P C ++ + S LWHCRYAHL KGL L ++ Sbjct: 386 FHDKWGLIITSDMSANRMFIIQASIISPMCLKISKDSQSHLWHCRYAHLSFKGLNTLVKK 445 Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 DMV+GLP +++T VCSDC GKQ R+ P+ ++WRA++KLQLVH+DICG Sbjct: 446 DMVKGLPTLQETDEVCSDCATGKQSREAIPKSNNWRASEKLQLVHSDICG 495 >gb|PNX95204.1| copia-type polyprotein [Trifolium pratense] Length = 1328 Score = 134 bits (338), Expect = 3e-34 Identities = 60/110 (54%), Positives = 81/110 (73%) Frame = +3 Query: 21 FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200 F +++GLIM + MS+NRMF I +V++P C T E S+LWH RY HL KGL L ++ Sbjct: 389 FHEERGLIMSTPMSANRMFVIKATVLVPMCLQTTNEIDSQLWHKRYGHLSYKGLNTLVKK 448 Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +MVRGLP +K+ VCSDCL GKQHR+ P++ +WRAT KL+L+H+DICG Sbjct: 449 EMVRGLPALKEASDVCSDCLFGKQHREVIPKKVNWRATHKLELIHSDICG 498 >gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [Trifolium pratense] Length = 149 Score = 124 bits (312), Expect = 3e-34 Identities = 53/106 (50%), Positives = 76/106 (71%) Frame = +3 Query: 33 KGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVR 212 +GL+ S MS NRM+ I V++P C +++++LWH RY HL KGL L+++ MV Sbjct: 20 RGLLFTSHMSKNRMYVITTPVIMPMCLKTAKQESTQLWHDRYGHLSFKGLNTLSKKQMVI 79 Query: 213 GLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 GLPE++D+ CSDCL GKQHR P+Q++WRA+ KL+L+H+DICG Sbjct: 80 GLPELEDSDENCSDCLTGKQHRDIIPKQANWRASVKLELIHSDICG 125 >emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera] Length = 2408 Score = 134 bits (337), Expect = 4e-34 Identities = 60/109 (55%), Positives = 78/109 (71%), Gaps = 2/109 (1%) Frame = +3 Query: 30 KKGLIMQSLMSSNRMFAICVSVV--LPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQD 203 KKGLIMQ+ MS+ RMF + ++ P C+ ED + LWHCRY HL KGLR L + Sbjct: 331 KKGLIMQTAMSTKRMFILSARILSKAPTCFQTILEDNTHLWHCRYGHLSFKGLRTLQYKQ 390 Query: 204 MVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 MVRGLP++K +C+DC+VGKQHR P++S WRA+++LQLVHADICG Sbjct: 391 MVRGLPQLKAPSKICTDCMVGKQHRDAIPKRSLWRASQRLQLVHADICG 439 >gb|PNX61303.1| copia-type polyprotein, partial [Trifolium pratense] Length = 298 Score = 127 bits (319), Expect = 1e-33 Identities = 52/112 (46%), Positives = 79/112 (70%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194 + F ++KGLI+ + M++NRM+ + V++P+C ED +WHCRY HL KGL LA Sbjct: 54 KIFHEEKGLIISTPMTANRMYVLLAPVMMPQCLVAKHEDIEHIWHCRYGHLNFKGLVTLA 113 Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 ++ MV+GLP +KD+ +C DC++ K HR P+ +SWRA+ KL+L+H+DICG Sbjct: 114 KRTMVKGLPILKDSAELCPDCVISKHHRDSIPKTASWRASSKLELIHSDICG 165 >dbj|GAU36545.1| hypothetical protein TSUD_277510 [Trifolium subterraneum] Length = 1139 Score = 132 bits (333), Expect = 1e-33 Identities = 58/106 (54%), Positives = 77/106 (72%) Frame = +3 Query: 33 KGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVR 212 KGL+ + MS+N+M+ I VV+P+C + EDTS+LWH RY HL +KGL L + DMVR Sbjct: 346 KGLLFATHMSANKMYVIKALVVIPKCLQASKEDTSQLWHMRYGHLSIKGLNTLVKMDMVR 405 Query: 213 GLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 GLP+++D C DCL GKQHR+ P+Q+ WRA+ KL LVH+DICG Sbjct: 406 GLPDLEDFSEKCIDCLTGKQHREVIPKQAKWRASVKLDLVHSDICG 451 >gb|PRQ38021.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 719 Score = 132 bits (332), Expect = 2e-33 Identities = 61/115 (53%), Positives = 82/115 (71%), Gaps = 3/115 (2%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185 + + KKGLIMQ+ M++NRMF + +VV+ + C + +D S LWHCRY+HL KGL+ Sbjct: 398 KIYHSKKGLIMQTPMTANRMFVLLANVVVTDFSTCMQASSDDLSHLWHCRYSHLNYKGLK 457 Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 L + MV+GLP+IK + VC DCLVGKQ R P+ S WRA+++LQLVHADICG Sbjct: 458 TLHYRKMVKGLPQIKASARVCHDCLVGKQSRDSIPKSSQWRASQRLQLVHADICG 512 >gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1316 Score = 132 bits (331), Expect = 3e-33 Identities = 59/115 (51%), Positives = 83/115 (72%), Gaps = 3/115 (2%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185 + + +KGLIMQ+ MS+NRMF I ++ LP+ C+ ED + LWHCRY HL KGLR Sbjct: 388 QIYHPRKGLIMQTKMSANRMFVIRANM-LPQASACFQTVSEDNTHLWHCRYGHLSFKGLR 446 Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 +L + MV+GLP+ K + +C DC+VGKQHR+ P++S WRA+ +LQL+H+DICG Sbjct: 447 SLQYRKMVKGLPDFKMSSKLCKDCMVGKQHRESIPKKSMWRASHRLQLIHSDICG 501 >dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subterraneum] Length = 1421 Score = 132 bits (331), Expect = 3e-33 Identities = 56/109 (51%), Positives = 77/109 (70%) Frame = +3 Query: 21 FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200 + ++KGLIM + MSSNRM+ I V++P C+ D +ELWHCRY HL KGL L ++ Sbjct: 386 YHEEKGLIMSTKMSSNRMYVIFAPVIVPMCFKTVKMDNNELWHCRYDHLSFKGLNTLVKK 445 Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADIC 347 +MV+GLP ++D + C CL GKQHR+ P+ S WRAT+ L+LVH+DIC Sbjct: 446 EMVKGLPHLQDMEDTCVSCLTGKQHREAIPKSSDWRATRPLELVHSDIC 494 >gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 1044 Score = 131 bits (330), Expect = 4e-33 Identities = 59/115 (51%), Positives = 82/115 (71%), Gaps = 3/115 (2%) Frame = +3 Query: 15 EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185 + + +KGLIMQ+ MS+NRMF I ++ LP+ C+ ED + LWHCRY HL KGLR Sbjct: 107 QIYHPRKGLIMQTKMSANRMFVIRANM-LPQASACFQTVSEDNTHLWHCRYGHLSFKGLR 165 Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350 L + MV+GLP+ K + +C DC+VGKQHR+ P++S WRA+ +LQL+H+DICG Sbjct: 166 TLQYRKMVKGLPDFKMSSKLCKDCMVGKQHRESIPKKSMWRASHRLQLIHSDICG 220