BLASTX nr result

ID: Astragalus23_contig00023045 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00023045
         (352 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU13723.1| hypothetical protein TSUD_348270 [Trifolium subt...   145   7e-38
dbj|GAU39416.1| hypothetical protein TSUD_323640 [Trifolium subt...   142   6e-37
dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subt...   142   6e-37
gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]   141   2e-36
gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposo...   128   2e-35
dbj|GAU45181.1| hypothetical protein TSUD_178740 [Trifolium subt...   137   4e-35
gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]            136   7e-35
gb|PNX74620.1| putative LRR receptor-like protein kinase, partia...   136   9e-35
dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt...   135   1e-34
gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense]   135   2e-34
gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]   134   3e-34
gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]            134   3e-34
gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [T...   124   3e-34
emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera]   134   4e-34
gb|PNX61303.1| copia-type polyprotein, partial [Trifolium pratense]   127   1e-33
dbj|GAU36545.1| hypothetical protein TSUD_277510 [Trifolium subt...   132   1e-33
gb|PRQ38021.1| putative RNA-directed DNA polymerase [Rosa chinen...   132   2e-33
gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinen...   132   3e-33
dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subt...   132   3e-33
gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinen...   131   4e-33

>dbj|GAU13723.1| hypothetical protein TSUD_348270 [Trifolium subterraneum]
          Length = 1117

 Score =  145 bits (365), Expect = 7e-38
 Identities = 63/112 (56%), Positives = 83/112 (74%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F + KGLI+ + M+ NRM+ I   V+LP C+ VT +D   LWHCRY HL  KGL  LA
Sbjct: 383 KIFHEGKGLIVTTQMTVNRMYIILAPVMLPTCFKVTNKDEGHLWHCRYGHLSFKGLNTLA 442

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +++MV+GLP +KD + VCSDC+V KQHR  FP+ +SWRAT KL+LVH+DICG
Sbjct: 443 KREMVKGLPMVKDNQTVCSDCVVSKQHRDTFPKNASWRATSKLELVHSDICG 494


>dbj|GAU39416.1| hypothetical protein TSUD_323640 [Trifolium subterraneum]
          Length = 1056

 Score =  142 bits (358), Expect = 6e-37
 Identities = 60/112 (53%), Positives = 86/112 (76%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F ++ GL++ + M+ NRM+ I   V+LP C+ VT +D S LWHCRY++L  KGL ALA
Sbjct: 329 KIFHEEMGLMVTTQMTVNRMYIILAPVMLPSCFKVTNKDESHLWHCRYSNLSFKGLNALA 388

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +++MV+GLP +KD + VCSDC+V KQHR  FP+ ++WRAT KL+L+H+DICG
Sbjct: 389 KREMVKGLPMVKDNQTVCSDCVVSKQHRDTFPKSTTWRATSKLELIHSDICG 440


>dbj|GAU23220.1| hypothetical protein TSUD_172480 [Trifolium subterraneum]
          Length = 1323

 Score =  142 bits (358), Expect = 6e-37
 Identities = 62/112 (55%), Positives = 83/112 (74%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F + KGLI+ + M+ NRM+ I   V+LP C+ V+ +D   LWHCRY HL  KGL  LA
Sbjct: 387 KIFHEGKGLIVTTQMTVNRMYIILAPVMLPACFKVSNQDEGHLWHCRYGHLSFKGLNTLA 446

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +++MV+GLP +KD + VCSDC+V KQHR  FP+ +SWRAT KL+LVH+DICG
Sbjct: 447 KREMVKGLPMVKDDQTVCSDCVVSKQHRDTFPKIASWRATSKLELVHSDICG 498


>gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 1062

 Score =  141 bits (355), Expect = 2e-36
 Identities = 61/110 (55%), Positives = 88/110 (80%)
 Frame = +3

Query: 21  FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200
           + +++G+IMQ  M++NRM+ I V VV+P C+ VT ED + LWHCRY +L  KGL+ L Q+
Sbjct: 411 YHQERGVIMQCKMTANRMYVIMVDVVIPTCFKVTNEDVTYLWHCRYGYLSQKGLKILEQK 470

Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +MVRGLP+++D+  VCSDC++GKQHR+PF + S+ RATK+LQL+HAD+ G
Sbjct: 471 NMVRGLPKLQDSSNVCSDCMIGKQHREPFLKVSTRRATKRLQLIHADVFG 520


>gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 164

 Score =  128 bits (322), Expect = 2e-35
 Identities = 60/104 (57%), Positives = 76/104 (73%), Gaps = 2/104 (1%)
 Frame = +3

Query: 45  MQSLMSSNRMFAI-CVSV-VLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVRGL 218
           MQS MSSNRMF +  +S+ V P C++   ED ++LWHCR+ HL  KGL+ L Q+ MV GL
Sbjct: 1   MQSNMSSNRMFILHAISLPVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGL 60

Query: 219 PEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           P +K    +C DCL+GKQHR  FP +SSWRA++ LQLVHADICG
Sbjct: 61  PMLKSPSKLCKDCLIGKQHRDSFPMRSSWRASQILQLVHADICG 104


>dbj|GAU45181.1| hypothetical protein TSUD_178740 [Trifolium subterraneum]
          Length = 940

 Score =  137 bits (345), Expect = 4e-35
 Identities = 61/107 (57%), Positives = 83/107 (77%)
 Frame = +3

Query: 30  KKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMV 209
           +KGLI  + +++NRM+ +  SVVLP+C  V G D S LWH RYAHL +KGL+ L++ +MV
Sbjct: 235 EKGLIFTTQITANRMYIVFASVVLPKCLQVRGVDESHLWHHRYAHLNIKGLKILSKNNMV 294

Query: 210 RGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +GL E+KD +  C DCL GKQHR  FP++SSWRA++KL+LVH+DICG
Sbjct: 295 KGLLELKDIEGQCGDCLAGKQHRDNFPKKSSWRASQKLELVHSDICG 341


>gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]
          Length = 1324

 Score =  136 bits (343), Expect = 7e-35
 Identities = 57/110 (51%), Positives = 82/110 (74%)
 Frame = +3

Query: 21  FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200
           F +++GLIM + M++NRM+ I   V+LP C     +  S LWHCRY HL  KGL  L ++
Sbjct: 389 FHEQRGLIMTTRMTANRMYVISAPVILPMCLKTEKQVNSHLWHCRYGHLSFKGLNTLVKR 448

Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +MV+GLP++++ +  CSDC++GKQHR   P+Q++WRATKKL+LVH+DICG
Sbjct: 449 NMVKGLPQLQEIETNCSDCMIGKQHRDSIPKQANWRATKKLELVHSDICG 498


>gb|PNX74620.1| putative LRR receptor-like protein kinase, partial [Trifolium
           pratense]
          Length = 814

 Score =  136 bits (342), Expect = 9e-35
 Identities = 58/112 (51%), Positives = 86/112 (76%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F ++KGLI+ + M++N+M+ I   V+ P C  +T ++ ++LWH RYAHL LKGL+ L 
Sbjct: 107 QLFHEEKGLIISTAMTTNKMYIINAPVITPNCLQMTKDEETDLWHKRYAHLSLKGLKVLT 166

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
            ++MV+GLPE+KD +  CSDCL GKQHR   P+Q++WRA++KL+LVH+DICG
Sbjct: 167 GKNMVKGLPELKDNEEKCSDCLSGKQHRDNIPKQTNWRASQKLELVHSDICG 218


>dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum]
          Length = 1322

 Score =  135 bits (341), Expect = 1e-34
 Identities = 57/107 (53%), Positives = 80/107 (74%)
 Frame = +3

Query: 30  KKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMV 209
           ++GLIM + MS+NRM+ I   V++P C      D +ELWHCRY HL  KGL  L ++DMV
Sbjct: 385 QRGLIMATKMSANRMYIIYAPVIIPMCLKTVKMDNNELWHCRYGHLSFKGLNTLVKKDMV 444

Query: 210 RGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           RGLP++++T   C++C+ GKQHR+  P+ S+WRA+KKL+LVH+DICG
Sbjct: 445 RGLPQLQETTENCTNCMTGKQHREAIPKSSNWRASKKLELVHSDICG 491


>gb|PNX77239.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 803

 Score =  135 bits (339), Expect = 2e-34
 Identities = 57/112 (50%), Positives = 86/112 (76%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F + KGLI+ + M+ NRM+ +  +V++P C  VT  + +ELWH RYAHL +KGLR L 
Sbjct: 145 QLFHEDKGLILSTEMTMNRMYIVRATVIIPNCLQVTKAEETELWHKRYAHLSIKGLRVLN 204

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           ++ MV+GLPE++DT+  C+DCL GKQHR+  P+Q++WRA++ L+L+H+DICG
Sbjct: 205 KKHMVKGLPELRDTEEKCTDCLSGKQHRENMPKQANWRASEILELIHSDICG 256


>gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 886

 Score =  134 bits (338), Expect = 3e-34
 Identities = 60/110 (54%), Positives = 80/110 (72%)
 Frame = +3

Query: 21  FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200
           F  K GLI+ S MS+NRMF I  S++ P C  ++ +  S LWHCRYAHL  KGL  L ++
Sbjct: 386 FHDKWGLIITSDMSANRMFIIQASIISPMCLKISKDSQSHLWHCRYAHLSFKGLNTLVKK 445

Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           DMV+GLP +++T  VCSDC  GKQ R+  P+ ++WRA++KLQLVH+DICG
Sbjct: 446 DMVKGLPTLQETDEVCSDCATGKQSREAIPKSNNWRASEKLQLVHSDICG 495


>gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]
          Length = 1328

 Score =  134 bits (338), Expect = 3e-34
 Identities = 60/110 (54%), Positives = 81/110 (73%)
 Frame = +3

Query: 21  FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200
           F +++GLIM + MS+NRMF I  +V++P C   T E  S+LWH RY HL  KGL  L ++
Sbjct: 389 FHEERGLIMSTPMSANRMFVIKATVLVPMCLQTTNEIDSQLWHKRYGHLSYKGLNTLVKK 448

Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +MVRGLP +K+   VCSDCL GKQHR+  P++ +WRAT KL+L+H+DICG
Sbjct: 449 EMVRGLPALKEASDVCSDCLFGKQHREVIPKKVNWRATHKLELIHSDICG 498


>gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [Trifolium pratense]
          Length = 149

 Score =  124 bits (312), Expect = 3e-34
 Identities = 53/106 (50%), Positives = 76/106 (71%)
 Frame = +3

Query: 33  KGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVR 212
           +GL+  S MS NRM+ I   V++P C     +++++LWH RY HL  KGL  L+++ MV 
Sbjct: 20  RGLLFTSHMSKNRMYVITTPVIMPMCLKTAKQESTQLWHDRYGHLSFKGLNTLSKKQMVI 79

Query: 213 GLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           GLPE++D+   CSDCL GKQHR   P+Q++WRA+ KL+L+H+DICG
Sbjct: 80  GLPELEDSDENCSDCLTGKQHRDIIPKQANWRASVKLELIHSDICG 125


>emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera]
          Length = 2408

 Score =  134 bits (337), Expect = 4e-34
 Identities = 60/109 (55%), Positives = 78/109 (71%), Gaps = 2/109 (1%)
 Frame = +3

Query: 30  KKGLIMQSLMSSNRMFAICVSVV--LPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQD 203
           KKGLIMQ+ MS+ RMF +   ++   P C+    ED + LWHCRY HL  KGLR L  + 
Sbjct: 331 KKGLIMQTAMSTKRMFILSARILSKAPTCFQTILEDNTHLWHCRYGHLSFKGLRTLQYKQ 390

Query: 204 MVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           MVRGLP++K    +C+DC+VGKQHR   P++S WRA+++LQLVHADICG
Sbjct: 391 MVRGLPQLKAPSKICTDCMVGKQHRDAIPKRSLWRASQRLQLVHADICG 439


>gb|PNX61303.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 298

 Score =  127 bits (319), Expect = 1e-33
 Identities = 52/112 (46%), Positives = 79/112 (70%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALA 194
           + F ++KGLI+ + M++NRM+ +   V++P+C     ED   +WHCRY HL  KGL  LA
Sbjct: 54  KIFHEEKGLIISTPMTANRMYVLLAPVMMPQCLVAKHEDIEHIWHCRYGHLNFKGLVTLA 113

Query: 195 QQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           ++ MV+GLP +KD+  +C DC++ K HR   P+ +SWRA+ KL+L+H+DICG
Sbjct: 114 KRTMVKGLPILKDSAELCPDCVISKHHRDSIPKTASWRASSKLELIHSDICG 165


>dbj|GAU36545.1| hypothetical protein TSUD_277510 [Trifolium subterraneum]
          Length = 1139

 Score =  132 bits (333), Expect = 1e-33
 Identities = 58/106 (54%), Positives = 77/106 (72%)
 Frame = +3

Query: 33  KGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQDMVR 212
           KGL+  + MS+N+M+ I   VV+P+C   + EDTS+LWH RY HL +KGL  L + DMVR
Sbjct: 346 KGLLFATHMSANKMYVIKALVVIPKCLQASKEDTSQLWHMRYGHLSIKGLNTLVKMDMVR 405

Query: 213 GLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           GLP+++D    C DCL GKQHR+  P+Q+ WRA+ KL LVH+DICG
Sbjct: 406 GLPDLEDFSEKCIDCLTGKQHREVIPKQAKWRASVKLDLVHSDICG 451


>gb|PRQ38021.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 719

 Score =  132 bits (332), Expect = 2e-33
 Identities = 61/115 (53%), Positives = 82/115 (71%), Gaps = 3/115 (2%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185
           + +  KKGLIMQ+ M++NRMF +  +VV+ +   C   + +D S LWHCRY+HL  KGL+
Sbjct: 398 KIYHSKKGLIMQTPMTANRMFVLLANVVVTDFSTCMQASSDDLSHLWHCRYSHLNYKGLK 457

Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
            L  + MV+GLP+IK +  VC DCLVGKQ R   P+ S WRA+++LQLVHADICG
Sbjct: 458 TLHYRKMVKGLPQIKASARVCHDCLVGKQSRDSIPKSSQWRASQRLQLVHADICG 512


>gb|PRQ52345.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1316

 Score =  132 bits (331), Expect = 3e-33
 Identities = 59/115 (51%), Positives = 83/115 (72%), Gaps = 3/115 (2%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185
           + +  +KGLIMQ+ MS+NRMF I  ++ LP+   C+    ED + LWHCRY HL  KGLR
Sbjct: 388 QIYHPRKGLIMQTKMSANRMFVIRANM-LPQASACFQTVSEDNTHLWHCRYGHLSFKGLR 446

Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
           +L  + MV+GLP+ K +  +C DC+VGKQHR+  P++S WRA+ +LQL+H+DICG
Sbjct: 447 SLQYRKMVKGLPDFKMSSKLCKDCMVGKQHRESIPKKSMWRASHRLQLIHSDICG 501


>dbj|GAU37486.1| hypothetical protein TSUD_275380 [Trifolium subterraneum]
          Length = 1421

 Score =  132 bits (331), Expect = 3e-33
 Identities = 56/109 (51%), Positives = 77/109 (70%)
 Frame = +3

Query: 21  FTKKKGLIMQSLMSSNRMFAICVSVVLPECYSVTGEDTSELWHCRYAHLYLKGLRALAQQ 200
           + ++KGLIM + MSSNRM+ I   V++P C+     D +ELWHCRY HL  KGL  L ++
Sbjct: 386 YHEEKGLIMSTKMSSNRMYVIFAPVIVPMCFKTVKMDNNELWHCRYDHLSFKGLNTLVKK 445

Query: 201 DMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADIC 347
           +MV+GLP ++D +  C  CL GKQHR+  P+ S WRAT+ L+LVH+DIC
Sbjct: 446 EMVKGLPHLQDMEDTCVSCLTGKQHREAIPKSSDWRATRPLELVHSDIC 494


>gb|PRQ42077.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1044

 Score =  131 bits (330), Expect = 4e-33
 Identities = 59/115 (51%), Positives = 82/115 (71%), Gaps = 3/115 (2%)
 Frame = +3

Query: 15  EFFTKKKGLIMQSLMSSNRMFAICVSVVLPE---CYSVTGEDTSELWHCRYAHLYLKGLR 185
           + +  +KGLIMQ+ MS+NRMF I  ++ LP+   C+    ED + LWHCRY HL  KGLR
Sbjct: 107 QIYHPRKGLIMQTKMSANRMFVIRANM-LPQASACFQTVSEDNTHLWHCRYGHLSFKGLR 165

Query: 186 ALAQQDMVRGLPEIKDTKAVCSDCLVGKQHRKPFPQQSSWRATKKLQLVHADICG 350
            L  + MV+GLP+ K +  +C DC+VGKQHR+  P++S WRA+ +LQL+H+DICG
Sbjct: 166 TLQYRKMVKGLPDFKMSSKLCKDCMVGKQHRESIPKKSMWRASHRLQLIHSDICG 220


Top