BLASTX nr result
ID: Astragalus22_contig00038733
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00038733 (304 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium prat... 167 4e-47 gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium prat... 167 6e-47 gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 160 2e-43 gb|KYP63967.1| hypothetical protein KK1_018554 [Cajanus cajan] 145 1e-41 dbj|GAU40457.1| hypothetical protein TSUD_141370 [Trifolium subt... 153 4e-41 gb|KYP35193.1| hypothetical protein KK1_043782 [Cajanus cajan] 141 3e-40 dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subte... 150 5e-40 gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium prat... 150 5e-40 ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332... 147 2e-39 gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus ca... 144 1e-38 dbj|GAU39167.1| hypothetical protein TSUD_147890 [Trifolium subt... 145 4e-38 gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposo... 144 5e-38 ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna ... 139 5e-36 ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334... 137 2e-35 dbj|GAU39763.1| hypothetical protein TSUD_220060 [Trifolium subt... 136 3e-35 gb|PNX62323.1| hypothetical protein L195_g061091, partial [Trifo... 123 3e-34 gb|KHN02181.1| hypothetical protein glysoja_002205, partial [Gly... 122 7e-34 ref|XP_014630536.1| PREDICTED: uncharacterized protein LOC106798... 132 1e-33 gb|KYP33764.1| hypothetical protein KK1_045361 [Cajanus cajan] 125 1e-33 ref|XP_020232880.1| uncharacterized protein LOC109813157 [Cajanu... 121 7e-33 >gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium pratense] Length = 536 Score = 167 bits (422), Expect = 4e-47 Identities = 81/100 (81%), Positives = 89/100 (89%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAWYQWM+RN QI +W QFL +LE RFAPTAFDDPRGNLFKLTQ+TTV+AYLTEFEA Sbjct: 94 GAALAWYQWMYRNRQIASWAQFLEKLETRFAPTAFDDPRGNLFKLTQSTTVSAYLTEFEA 153 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANRL GLS DLLSCFISG K DVRREV++QQP +ISQA Sbjct: 154 LANRLEGLSDVDLLSCFISGLKSDVRREVVAQQPTSISQA 193 >gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium pratense] Length = 566 Score = 167 bits (422), Expect = 6e-47 Identities = 77/100 (77%), Positives = 92/100 (92%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G AL+WYQWM+RN+Q+V+WNQFL LE RFAPTA+DDPRGNLFKLTQ+TTVAAYL EFEA Sbjct: 94 GPALSWYQWMYRNSQLVSWNQFLQALETRFAPTAYDDPRGNLFKLTQSTTVAAYLVEFEA 153 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGLS ADLLSCFISG K+D+RREVL++QP +++QA Sbjct: 154 LANRIVGLSSADLLSCFISGLKLDIRREVLARQPTSLTQA 193 >gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] gb|PNY07311.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1494 Score = 160 bits (404), Expect = 2e-43 Identities = 75/100 (75%), Positives = 89/100 (89%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAWYQWM+RN+QIV+WNQFL LE RFAPTA+DDP+GNLFKLTQ+ +V YLTEFE+ Sbjct: 101 GPALAWYQWMYRNSQIVSWNQFLRALETRFAPTAYDDPKGNLFKLTQSGSVNDYLTEFES 160 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGLS DLLSCFISG KV++RREVL+QQP ++SQA Sbjct: 161 LANRIVGLSPLDLLSCFISGLKVEIRREVLAQQPNSLSQA 200 >gb|KYP63967.1| hypothetical protein KK1_018554 [Cajanus cajan] Length = 223 Score = 145 bits (365), Expect = 1e-41 Identities = 67/100 (67%), Positives = 85/100 (85%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAW+QWM+RN QI +WNQ L LENRFAPTAF+DPRG LFKLTQ+++V +YLTEFE+ Sbjct: 96 GPALAWFQWMYRNGQIHSWNQMLQALENRFAPTAFNDPRGKLFKLTQSSSVTSYLTEFES 155 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGL + LLSCFISG K ++RR+V++ QP ++SQA Sbjct: 156 LANRIVGLQPSFLLSCFISGLKPELRRDVIAHQPSSLSQA 195 >dbj|GAU40457.1| hypothetical protein TSUD_141370 [Trifolium subterraneum] Length = 1277 Score = 153 bits (387), Expect = 4e-41 Identities = 71/99 (71%), Positives = 88/99 (88%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAWYQWM++N QI++W QFL LE RFAPTA+DDPRG LFKL QTT+V++YL++FEA Sbjct: 82 GAALAWYQWMYKNAQILSWAQFLHALELRFAPTAYDDPRGKLFKLHQTTSVSSYLSDFEA 141 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQ 299 LANR+VGLS +DLLSCF+SG K ++RREVL+QQPR +SQ Sbjct: 142 LANRIVGLSPSDLLSCFVSGLKTEIRREVLAQQPRDLSQ 180 >gb|KYP35193.1| hypothetical protein KK1_043782 [Cajanus cajan] Length = 207 Score = 141 bits (355), Expect = 3e-40 Identities = 67/100 (67%), Positives = 81/100 (81%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAW+QWM+RN QI +W L LE RFAPTAFDDPRG LFKLTQTTTV+ +LTEFEA Sbjct: 75 GAALAWFQWMYRNGQIHSWQHLLQALETRFAPTAFDDPRGRLFKLTQTTTVSPFLTEFEA 134 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 +ANR+ GLS LLSCFIS K ++RREV++QQP +++ A Sbjct: 135 VANRVTGLSPQFLLSCFISELKPEIRREVIAQQPLSLTHA 174 >dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subterraneum] Length = 1385 Score = 150 bits (379), Expect = 5e-40 Identities = 72/100 (72%), Positives = 84/100 (84%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAWYQW N +IV+W QFL LE RFAPTA+DDPRG LFKL QTT+VA+YL+EFEA Sbjct: 114 GPALAWYQWKHSNGEIVSWTQFLRALELRFAPTAYDDPRGKLFKLQQTTSVASYLSEFEA 173 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGLS DLLSCF+SG KV++RREVL+QQP +SQA Sbjct: 174 LANRIVGLSPQDLLSCFVSGLKVEIRREVLAQQPVDLSQA 213 >gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium pratense] Length = 1478 Score = 150 bits (379), Expect = 5e-40 Identities = 71/100 (71%), Positives = 86/100 (86%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAWYQWM+RN QIV+W Q L LE RFAPTA+DDPRG LFKL QTTTVA+YL++FE+ Sbjct: 83 GPALAWYQWMYRNGQIVSWPQVLQALELRFAPTAYDDPRGKLFKLHQTTTVASYLSDFES 142 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGLS DLLSCFISG + ++RREVL+QQP +++QA Sbjct: 143 LANRIVGLSPPDLLSCFISGLRSEIRREVLAQQPTSLTQA 182 >ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332889 [Vigna angularis] Length = 556 Score = 147 bits (370), Expect = 2e-39 Identities = 72/100 (72%), Positives = 83/100 (83%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAW+QWM+RN QI +W Q L LE FAPTAFDDPRG LFKLTQT++VAAYL+EFE Sbjct: 88 GPALAWFQWMYRNGQIHSWPQLLQALEICFAPTAFDDPRGKLFKLTQTSSVAAYLSEFET 147 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGL LLSCFISG K ++RREVLSQQP+T+SQA Sbjct: 148 LANRIVGLQPQFLLSCFISGLKPEIRREVLSQQPQTLSQA 187 >gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus cajan] Length = 507 Score = 144 bits (362), Expect = 1e-38 Identities = 68/100 (68%), Positives = 82/100 (82%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAW+QWM+RN QI +W L LE RFAPTAFDDPRG LFKLTQTTTV+A+LTEFEA Sbjct: 105 GAALAWFQWMYRNGQIHSWQHLLQALETRFAPTAFDDPRGRLFKLTQTTTVSAFLTEFEA 164 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 +ANR+ GLS LLSCFI G K ++RREV++QQP +++ A Sbjct: 165 VANRVTGLSPQFLLSCFIFGLKPEIRREVIAQQPPSLTHA 204 >dbj|GAU39167.1| hypothetical protein TSUD_147890 [Trifolium subterraneum] Length = 1450 Score = 145 bits (365), Expect = 4e-38 Identities = 69/100 (69%), Positives = 85/100 (85%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAWYQW++RN QIV+W QFL LE RFAPTA+DDPRG LFKL QTTTV+AYL+EFE+ Sbjct: 113 GPALAWYQWLYRNGQIVSWPQFLQALELRFAPTAYDDPRGKLFKLQQTTTVSAYLSEFES 172 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 +ANR+VGLS DLLS FISG + ++ +EVL+QQP ++SQA Sbjct: 173 IANRIVGLSPPDLLSFFISGLRSEICQEVLAQQPSSLSQA 212 >gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposon 297 family [Cajanus cajan] Length = 894 Score = 144 bits (364), Expect = 5e-38 Identities = 67/100 (67%), Positives = 85/100 (85%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G+ALAW+QWM+RN QI +WNQ L LENRFAPTAFD+PRG LFKLTQ+ +V +YLTEFE+ Sbjct: 96 GSALAWFQWMYRNGQIHSWNQMLQALENRFAPTAFDNPRGKLFKLTQSFSVTSYLTEFES 155 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGL + LLSCFISG K ++RR+V++ QP ++SQA Sbjct: 156 LANRIVGLQPSFLLSCFISGLKPELRRDVIAHQPSSLSQA 195 >ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna radiata var. radiata] Length = 851 Score = 139 bits (349), Expect = 5e-36 Identities = 65/100 (65%), Positives = 81/100 (81%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAW+QWM+RN QI++W L LE RFAPTAF+DPRG LFKL+QT++V+AYL EFEA Sbjct: 128 GAALAWFQWMYRNGQILSWTHLLQALETRFAPTAFEDPRGKLFKLSQTSSVSAYLNEFEA 187 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 ANR+ G S LLSCF+SG K + RREV++QQP+T+S A Sbjct: 188 TANRVTGXSPPFLLSCFLSGLKSEXRREVVAQQPQTLSLA 227 >ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334872 [Vigna angularis] Length = 756 Score = 137 bits (345), Expect = 2e-35 Identities = 65/100 (65%), Positives = 81/100 (81%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALA +QWM+RN Q+ +W Q L LE RFAPTAFDDP+G LFKL QTTTV+ +LTEFE+ Sbjct: 108 GAALALFQWMYRNGQLHSWQQLLQALETRFAPTAFDDPKGKLFKLAQTTTVSDFLTEFES 167 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 +ANR+ GL + LLSCFISG K ++RREV++QQP T+S A Sbjct: 168 IANRVAGLPPSFLLSCFISGLKPEIRREVVAQQPPTLSHA 207 >dbj|GAU39763.1| hypothetical protein TSUD_220060 [Trifolium subterraneum] Length = 721 Score = 136 bits (343), Expect = 3e-35 Identities = 65/99 (65%), Positives = 79/99 (79%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAW QWM++N QIV+WN FL LE RFAPTA+DDPRG LFKL Q+T+V YL++FEA Sbjct: 61 GAALAWCQWMYKNGQIVSWNHFLQALEIRFAPTAYDDPRGKLFKLQQSTSVENYLSDFEA 120 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQ 299 L NR+VGLS DLLSCFI G K ++RREVL+Q +S+ Sbjct: 121 LENRIVGLSPTDLLSCFIFGLKYEIRREVLAQHTLDLSK 159 >gb|PNX62323.1| hypothetical protein L195_g061091, partial [Trifolium pratense] Length = 125 Score = 123 bits (309), Expect = 3e-34 Identities = 58/100 (58%), Positives = 74/100 (74%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G AL W+QWM +N Q++NW FL LE RFAP+ ++DP+G LFKLTQT +V Y T+FE Sbjct: 3 GEALTWFQWMHQNGQLMNWGTFLHALEIRFAPSQYEDPKGALFKLTQTASVKDYQTQFEL 62 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR++GL A LSCF+SG K +RREVL+ QP T+ QA Sbjct: 63 LANRIIGLPPACYLSCFVSGLKPAIRREVLAFQPTTLIQA 102 >gb|KHN02181.1| hypothetical protein glysoja_002205, partial [Glycine soja] Length = 124 Score = 122 bits (306), Expect = 7e-34 Identities = 58/100 (58%), Positives = 73/100 (73%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G ALAW+QWM N Q +W FL L+ RFAP+ ++DP G+LFKL Q TTVA YL+EFE Sbjct: 14 GRALAWFQWMSSNGQFTSWPVFLQALQTRFAPSNYEDPSGSLFKLIQKTTVAEYLSEFEE 73 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGL LLSCF+SG ++RREV+ QP T++QA Sbjct: 74 LANRVVGLPAPFLLSCFVSGLAPEIRREVMINQPLTVAQA 113 >ref|XP_014630536.1| PREDICTED: uncharacterized protein LOC106798462 [Glycine max] Length = 1691 Score = 132 bits (332), Expect = 1e-33 Identities = 67/100 (67%), Positives = 75/100 (75%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 G AL+WYQWM N I +WN FL LE+RFAPT +DDP+G LFKLTQT TV YLTEFE Sbjct: 113 GPALSWYQWMHSNGLITSWNGFLQALESRFAPTFYDDPKGALFKLTQTGTVNDYLTEFER 172 Query: 183 LANRLVGLSQADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LANR+VGL LLSCFISG K DVRREVL+ QP + QA Sbjct: 173 LANRVVGLPPPFLLSCFISGLKPDVRREVLALQPLSFLQA 212 >gb|KYP33764.1| hypothetical protein KK1_045361 [Cajanus cajan] Length = 238 Score = 125 bits (313), Expect = 1e-33 Identities = 60/91 (65%), Positives = 75/91 (82%) Frame = +3 Query: 30 MFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEALANRLVGLS 209 M+RN Q+ +WNQFL LENRFAPTAFDDPRG LFKLTQ+++V YLTEFE+LANR+ GL Sbjct: 1 MYRNGQLHSWNQFLQALENRFAPTAFDDPRGKLFKLTQSSSVTEYLTEFESLANRIDGLQ 60 Query: 210 QADLLSCFISG*KVDVRREVLSQQPRTISQA 302 LLSCFISG +++RR+V++ QP +ISQA Sbjct: 61 PFLLLSCFISGLTLELRRDVIAHQPTSISQA 91 >ref|XP_020232880.1| uncharacterized protein LOC109813157 [Cajanus cajan] Length = 164 Score = 121 bits (303), Expect = 7e-33 Identities = 57/85 (67%), Positives = 69/85 (81%) Frame = +3 Query: 3 GAALAWYQWMFRNNQIVNWNQFLTELENRFAPTAFDDPRGNLFKLTQTTTVAAYLTEFEA 182 GAALAW+QWM+RN QI++W QFL LE RFAP+AFD P+G LFKL QT+TVA YL+EFEA Sbjct: 80 GAALAWFQWMYRNGQILSWTQFLQALETRFAPSAFDHPKGKLFKLQQTSTVADYLSEFEA 139 Query: 183 LANRLVGLSQADLLSCFISG*KVDV 257 L+NR+ GL + LLSCFI G K +V Sbjct: 140 LSNRIDGLPPSFLLSCFIFGLKTEV 164