BLASTX nr result

ID: Astragalus22_contig00038733 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00038733
         (304 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium prat...   167   4e-47
gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium prat...   167   6e-47
gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   160   2e-43
gb|KYP63967.1| hypothetical protein KK1_018554 [Cajanus cajan]        145   1e-41
dbj|GAU40457.1| hypothetical protein TSUD_141370 [Trifolium subt...   153   4e-41
gb|KYP35193.1| hypothetical protein KK1_043782 [Cajanus cajan]        141   3e-40
dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subte...   150   5e-40
gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   150   5e-40
ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332...   147   2e-39
gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus ca...   144   1e-38
dbj|GAU39167.1| hypothetical protein TSUD_147890 [Trifolium subt...   145   4e-38
gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposo...   144   5e-38
ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna ...   139   5e-36
ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334...   137   2e-35
dbj|GAU39763.1| hypothetical protein TSUD_220060 [Trifolium subt...   136   3e-35
gb|PNX62323.1| hypothetical protein L195_g061091, partial [Trifo...   123   3e-34
gb|KHN02181.1| hypothetical protein glysoja_002205, partial [Gly...   122   7e-34
ref|XP_014630536.1| PREDICTED: uncharacterized protein LOC106798...   132   1e-33
gb|KYP33764.1| hypothetical protein KK1_045361 [Cajanus cajan]        125   1e-33
ref|XP_020232880.1| uncharacterized protein LOC109813157 [Cajanu...   121   7e-33

>gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium pratense]
          Length = 536

 Score =  167 bits (422), Expect = 4e-47
 Identities = 81/100 (81%), Positives = 89/100 (89%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAWYQWM+RN QI +W QFL +LE RFAPTAFDDPRGNLFKLTQ+TTV+AYLTEFEA
Sbjct: 94  GAALAWYQWMYRNRQIASWAQFLEKLETRFAPTAFDDPRGNLFKLTQSTTVSAYLTEFEA 153

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANRL GLS  DLLSCFISG K DVRREV++QQP +ISQA
Sbjct: 154 LANRLEGLSDVDLLSCFISGLKSDVRREVVAQQPTSISQA 193


>gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium pratense]
          Length = 566

 Score =  167 bits (422), Expect = 6e-47
 Identities = 77/100 (77%), Positives = 92/100 (92%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G AL+WYQWM+RN+Q+V+WNQFL  LE RFAPTA+DDPRGNLFKLTQ+TTVAAYL EFEA
Sbjct: 94  GPALSWYQWMYRNSQLVSWNQFLQALETRFAPTAYDDPRGNLFKLTQSTTVAAYLVEFEA 153

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGLS ADLLSCFISG K+D+RREVL++QP +++QA
Sbjct: 154 LANRIVGLSSADLLSCFISGLKLDIRREVLARQPTSLTQA 193


>gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
 gb|PNY07311.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1494

 Score =  160 bits (404), Expect = 2e-43
 Identities = 75/100 (75%), Positives = 89/100 (89%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAWYQWM+RN+QIV+WNQFL  LE RFAPTA+DDP+GNLFKLTQ+ +V  YLTEFE+
Sbjct: 101 GPALAWYQWMYRNSQIVSWNQFLRALETRFAPTAYDDPKGNLFKLTQSGSVNDYLTEFES 160

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGLS  DLLSCFISG KV++RREVL+QQP ++SQA
Sbjct: 161 LANRIVGLSPLDLLSCFISGLKVEIRREVLAQQPNSLSQA 200


>gb|KYP63967.1| hypothetical protein KK1_018554 [Cajanus cajan]
          Length = 223

 Score =  145 bits (365), Expect = 1e-41
 Identities = 67/100 (67%), Positives = 85/100 (85%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAW+QWM+RN QI +WNQ L  LENRFAPTAF+DPRG LFKLTQ+++V +YLTEFE+
Sbjct: 96  GPALAWFQWMYRNGQIHSWNQMLQALENRFAPTAFNDPRGKLFKLTQSSSVTSYLTEFES 155

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGL  + LLSCFISG K ++RR+V++ QP ++SQA
Sbjct: 156 LANRIVGLQPSFLLSCFISGLKPELRRDVIAHQPSSLSQA 195


>dbj|GAU40457.1| hypothetical protein TSUD_141370 [Trifolium subterraneum]
          Length = 1277

 Score =  153 bits (387), Expect = 4e-41
 Identities = 71/99 (71%), Positives = 88/99 (88%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAWYQWM++N QI++W QFL  LE RFAPTA+DDPRG LFKL QTT+V++YL++FEA
Sbjct: 82  GAALAWYQWMYKNAQILSWAQFLHALELRFAPTAYDDPRGKLFKLHQTTSVSSYLSDFEA 141

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQ 299
           LANR+VGLS +DLLSCF+SG K ++RREVL+QQPR +SQ
Sbjct: 142 LANRIVGLSPSDLLSCFVSGLKTEIRREVLAQQPRDLSQ 180


>gb|KYP35193.1| hypothetical protein KK1_043782 [Cajanus cajan]
          Length = 207

 Score =  141 bits (355), Expect = 3e-40
 Identities = 67/100 (67%), Positives = 81/100 (81%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAW+QWM+RN QI +W   L  LE RFAPTAFDDPRG LFKLTQTTTV+ +LTEFEA
Sbjct: 75  GAALAWFQWMYRNGQIHSWQHLLQALETRFAPTAFDDPRGRLFKLTQTTTVSPFLTEFEA 134

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           +ANR+ GLS   LLSCFIS  K ++RREV++QQP +++ A
Sbjct: 135 VANRVTGLSPQFLLSCFISELKPEIRREVIAQQPLSLTHA 174


>dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subterraneum]
          Length = 1385

 Score =  150 bits (379), Expect = 5e-40
 Identities = 72/100 (72%), Positives = 84/100 (84%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAWYQW   N +IV+W QFL  LE RFAPTA+DDPRG LFKL QTT+VA+YL+EFEA
Sbjct: 114 GPALAWYQWKHSNGEIVSWTQFLRALELRFAPTAYDDPRGKLFKLQQTTSVASYLSEFEA 173

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGLS  DLLSCF+SG KV++RREVL+QQP  +SQA
Sbjct: 174 LANRIVGLSPQDLLSCFVSGLKVEIRREVLAQQPVDLSQA 213


>gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1478

 Score =  150 bits (379), Expect = 5e-40
 Identities = 71/100 (71%), Positives = 86/100 (86%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAWYQWM+RN QIV+W Q L  LE RFAPTA+DDPRG LFKL QTTTVA+YL++FE+
Sbjct: 83  GPALAWYQWMYRNGQIVSWPQVLQALELRFAPTAYDDPRGKLFKLHQTTTVASYLSDFES 142

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGLS  DLLSCFISG + ++RREVL+QQP +++QA
Sbjct: 143 LANRIVGLSPPDLLSCFISGLRSEIRREVLAQQPTSLTQA 182


>ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332889 [Vigna angularis]
          Length = 556

 Score =  147 bits (370), Expect = 2e-39
 Identities = 72/100 (72%), Positives = 83/100 (83%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAW+QWM+RN QI +W Q L  LE  FAPTAFDDPRG LFKLTQT++VAAYL+EFE 
Sbjct: 88  GPALAWFQWMYRNGQIHSWPQLLQALEICFAPTAFDDPRGKLFKLTQTSSVAAYLSEFET 147

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGL    LLSCFISG K ++RREVLSQQP+T+SQA
Sbjct: 148 LANRIVGLQPQFLLSCFISGLKPEIRREVLSQQPQTLSQA 187


>gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus cajan]
          Length = 507

 Score =  144 bits (362), Expect = 1e-38
 Identities = 68/100 (68%), Positives = 82/100 (82%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAW+QWM+RN QI +W   L  LE RFAPTAFDDPRG LFKLTQTTTV+A+LTEFEA
Sbjct: 105 GAALAWFQWMYRNGQIHSWQHLLQALETRFAPTAFDDPRGRLFKLTQTTTVSAFLTEFEA 164

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           +ANR+ GLS   LLSCFI G K ++RREV++QQP +++ A
Sbjct: 165 VANRVTGLSPQFLLSCFIFGLKPEIRREVIAQQPPSLTHA 204


>dbj|GAU39167.1| hypothetical protein TSUD_147890 [Trifolium subterraneum]
          Length = 1450

 Score =  145 bits (365), Expect = 4e-38
 Identities = 69/100 (69%), Positives = 85/100 (85%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAWYQW++RN QIV+W QFL  LE RFAPTA+DDPRG LFKL QTTTV+AYL+EFE+
Sbjct: 113 GPALAWYQWLYRNGQIVSWPQFLQALELRFAPTAYDDPRGKLFKLQQTTTVSAYLSEFES 172

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           +ANR+VGLS  DLLS FISG + ++ +EVL+QQP ++SQA
Sbjct: 173 IANRIVGLSPPDLLSFFISGLRSEICQEVLAQQPSSLSQA 212


>gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposon 297 family
           [Cajanus cajan]
          Length = 894

 Score =  144 bits (364), Expect = 5e-38
 Identities = 67/100 (67%), Positives = 85/100 (85%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G+ALAW+QWM+RN QI +WNQ L  LENRFAPTAFD+PRG LFKLTQ+ +V +YLTEFE+
Sbjct: 96  GSALAWFQWMYRNGQIHSWNQMLQALENRFAPTAFDNPRGKLFKLTQSFSVTSYLTEFES 155

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGL  + LLSCFISG K ++RR+V++ QP ++SQA
Sbjct: 156 LANRIVGLQPSFLLSCFISGLKPELRRDVIAHQPSSLSQA 195


>ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna radiata var. radiata]
          Length = 851

 Score =  139 bits (349), Expect = 5e-36
 Identities = 65/100 (65%), Positives = 81/100 (81%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAW+QWM+RN QI++W   L  LE RFAPTAF+DPRG LFKL+QT++V+AYL EFEA
Sbjct: 128 GAALAWFQWMYRNGQILSWTHLLQALETRFAPTAFEDPRGKLFKLSQTSSVSAYLNEFEA 187

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
            ANR+ G S   LLSCF+SG K + RREV++QQP+T+S A
Sbjct: 188 TANRVTGXSPPFLLSCFLSGLKSEXRREVVAQQPQTLSLA 227


>ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334872 [Vigna angularis]
          Length = 756

 Score =  137 bits (345), Expect = 2e-35
 Identities = 65/100 (65%), Positives = 81/100 (81%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALA +QWM+RN Q+ +W Q L  LE RFAPTAFDDP+G LFKL QTTTV+ +LTEFE+
Sbjct: 108 GAALALFQWMYRNGQLHSWQQLLQALETRFAPTAFDDPKGKLFKLAQTTTVSDFLTEFES 167

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           +ANR+ GL  + LLSCFISG K ++RREV++QQP T+S A
Sbjct: 168 IANRVAGLPPSFLLSCFISGLKPEIRREVVAQQPPTLSHA 207


>dbj|GAU39763.1| hypothetical protein TSUD_220060 [Trifolium subterraneum]
          Length = 721

 Score =  136 bits (343), Expect = 3e-35
 Identities = 65/99 (65%), Positives = 79/99 (79%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAW QWM++N QIV+WN FL  LE RFAPTA+DDPRG LFKL Q+T+V  YL++FEA
Sbjct: 61  GAALAWCQWMYKNGQIVSWNHFLQALEIRFAPTAYDDPRGKLFKLQQSTSVENYLSDFEA 120

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQ 299
           L NR+VGLS  DLLSCFI G K ++RREVL+Q    +S+
Sbjct: 121 LENRIVGLSPTDLLSCFIFGLKYEIRREVLAQHTLDLSK 159


>gb|PNX62323.1| hypothetical protein L195_g061091, partial [Trifolium pratense]
          Length = 125

 Score =  123 bits (309), Expect = 3e-34
 Identities = 58/100 (58%), Positives = 74/100 (74%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G AL W+QWM +N Q++NW  FL  LE RFAP+ ++DP+G LFKLTQT +V  Y T+FE 
Sbjct: 3   GEALTWFQWMHQNGQLMNWGTFLHALEIRFAPSQYEDPKGALFKLTQTASVKDYQTQFEL 62

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR++GL  A  LSCF+SG K  +RREVL+ QP T+ QA
Sbjct: 63  LANRIIGLPPACYLSCFVSGLKPAIRREVLAFQPTTLIQA 102


>gb|KHN02181.1| hypothetical protein glysoja_002205, partial [Glycine soja]
          Length = 124

 Score =  122 bits (306), Expect = 7e-34
 Identities = 58/100 (58%), Positives = 73/100 (73%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G ALAW+QWM  N Q  +W  FL  L+ RFAP+ ++DP G+LFKL Q TTVA YL+EFE 
Sbjct: 14  GRALAWFQWMSSNGQFTSWPVFLQALQTRFAPSNYEDPSGSLFKLIQKTTVAEYLSEFEE 73

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGL    LLSCF+SG   ++RREV+  QP T++QA
Sbjct: 74  LANRVVGLPAPFLLSCFVSGLAPEIRREVMINQPLTVAQA 113


>ref|XP_014630536.1| PREDICTED: uncharacterized protein LOC106798462 [Glycine max]
          Length = 1691

 Score =  132 bits (332), Expect = 1e-33
 Identities = 67/100 (67%), Positives = 75/100 (75%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           G AL+WYQWM  N  I +WN FL  LE+RFAPT +DDP+G LFKLTQT TV  YLTEFE 
Sbjct: 113 GPALSWYQWMHSNGLITSWNGFLQALESRFAPTFYDDPKGALFKLTQTGTVNDYLTEFER 172

Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302
           LANR+VGL    LLSCFISG K DVRREVL+ QP +  QA
Sbjct: 173 LANRVVGLPPPFLLSCFISGLKPDVRREVLALQPLSFLQA 212


>gb|KYP33764.1| hypothetical protein KK1_045361 [Cajanus cajan]
          Length = 238

 Score =  125 bits (313), Expect = 1e-33
 Identities = 60/91 (65%), Positives = 75/91 (82%)
 Frame = +3

Query: 30  MFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEALANRLVGLS 209
           M+RN Q+ +WNQFL  LENRFAPTAFDDPRG LFKLTQ+++V  YLTEFE+LANR+ GL 
Sbjct: 1   MYRNGQLHSWNQFLQALENRFAPTAFDDPRGKLFKLTQSSSVTEYLTEFESLANRIDGLQ 60

Query: 210 QADLLSCFISG*KVDVRREVLSQQPRTISQA 302
              LLSCFISG  +++RR+V++ QP +ISQA
Sbjct: 61  PFLLLSCFISGLTLELRRDVIAHQPTSISQA 91


>ref|XP_020232880.1| uncharacterized protein LOC109813157 [Cajanus cajan]
          Length = 164

 Score =  121 bits (303), Expect = 7e-33
 Identities = 57/85 (67%), Positives = 69/85 (81%)
 Frame = +3

Query: 3   GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182
           GAALAW+QWM+RN QI++W QFL  LE RFAP+AFD P+G LFKL QT+TVA YL+EFEA
Sbjct: 80  GAALAWFQWMYRNGQILSWTQFLQALETRFAPSAFDHPKGKLFKLQQTSTVADYLSEFEA 139

Query: 183 LANRLVGLSQADLLSCFISG*KVDV 257
           L+NR+ GL  + LLSCFI G K +V
Sbjct: 140 LSNRIDGLPPSFLLSCFIFGLKTEV 164


Top