BLASTX nr result
ID: Astragalus22_contig00036717
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00036717 (400 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU33096.1| hypothetical protein TSUD_259470 [Trifolium subt... 108 9e-25 gb|PNX85211.1| gag-pol polyprotein, partial [Trifolium pratense] 106 4e-24 gb|PNX88251.1| gag-pol polyprotein [Trifolium pratense] 103 6e-24 gb|PNX86185.1| retrotransposon-related protein, partial [Trifoli... 103 4e-23 gb|PNX91900.1| gag-pol polyprotein [Trifolium pratense] 101 3e-22 gb|KYP60970.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] 97 4e-22 dbj|GAU32939.1| hypothetical protein TSUD_153560 [Trifolium subt... 100 5e-22 gb|PNY11290.1| gag-pol polyprotein [Trifolium pratense] 99 1e-21 gb|PNY05502.1| gag-pol polyprotein, partial [Trifolium pratense] 99 2e-21 ref|XP_016191765.1| uncharacterized protein LOC107632612 [Arachi... 98 2e-21 gb|KYP39861.1| hypothetical protein KK1_038827 [Cajanus cajan] 94 2e-21 gb|PNX78768.1| gag-pol polyprotein, partial [Trifolium pratense] 98 4e-21 dbj|GAU27362.1| hypothetical protein TSUD_55150 [Trifolium subte... 98 4e-21 ref|XP_020218762.1| uncharacterized protein LOC109801993 [Cajanu... 94 7e-21 dbj|GAU42011.1| hypothetical protein TSUD_236780 [Trifolium subt... 97 1e-20 dbj|GAU51073.1| hypothetical protein TSUD_13180, partial [Trifol... 97 1e-20 dbj|GAU30548.1| hypothetical protein TSUD_65580 [Trifolium subte... 97 1e-20 ref|XP_015949749.1| uncharacterized protein LOC107474627 [Arachi... 95 1e-20 dbj|GAU24562.1| hypothetical protein TSUD_149030 [Trifolium subt... 97 1e-20 dbj|GAU39516.1| hypothetical protein TSUD_68800 [Trifolium subte... 97 1e-20 >dbj|GAU33096.1| hypothetical protein TSUD_259470 [Trifolium subterraneum] Length = 1237 Score = 108 bits (270), Expect = 9e-25 Identities = 56/127 (44%), Positives = 79/127 (62%) Frame = -1 Query: 391 RTVQSKPPQSQDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALD 212 R + K Q C FLDLD R ++ EE RPRP +++KEI+IG Q++K+G L+ Sbjct: 355 RAERRKIVNDQRCNFLDLDPRDFNREEEEWRPRPAEEVKEIQIGAEPGQRSKVGTSLSRI 414 Query: 211 TKRVFIKFLADNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIK 32 + L N+D+FAWS +DM GIDPNFI H L++N + + + Q KRK EK +AI+ Sbjct: 415 MEEELKTTLRKNIDLFAWSAKDMSGIDPNFICHHLAVNSNAKVLQQRKRKMSLEKQKAIE 474 Query: 31 EEANKLL 11 EE KL+ Sbjct: 475 EETQKLV 481 >gb|PNX85211.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 509 Score = 106 bits (264), Expect = 4e-24 Identities = 52/105 (49%), Positives = 70/105 (66%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DL++++IG HQ T +G L+L K IK L +N D+FAW P Sbjct: 36 DPREEFQDRRVSPIEDLEQVQIGEHPHQTTSLGTALSLQEKEKIIKILKNNADLFAWKPS 95 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+LS++PSI+ +SQ KRK GEE+ AI EE KL Sbjct: 96 DMPGIDERVITHKLSISPSIKLISQRKRKVGEERRAAIAEEVAKL 140 >gb|PNX88251.1| gag-pol polyprotein [Trifolium pratense] Length = 347 Score = 103 bits (258), Expect = 6e-24 Identities = 51/105 (48%), Positives = 68/105 (64%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DLK+++IG HQ T +G L+ + IK L DN D+FAW P Sbjct: 68 DPREEFQDRRLSPIEDLKQVQIGEHPHQTTSLGTTLSYQEREKIIKILKDNADLFAWKPS 127 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+LS++PS + +SQ KRK GEE+ AI EE KL Sbjct: 128 DMPGIDEGVITHKLSISPSTKPISQRKRKVGEERRVAIAEEVEKL 172 >gb|PNX86185.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 465 Score = 103 bits (256), Expect = 4e-23 Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DLK+++IG HQ T +G L+ + IK L DN D+FAW P Sbjct: 243 DPREEFQDRRVSPIEDLKQVQIGEHPHQTTSLGTTLSYQEREKIIKILKDNADLFAWKPS 302 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+LS++P+ + +SQ KRK GEE+ AI EE KL Sbjct: 303 DMPGIDEGVITHKLSISPNTKPISQRKRKVGEERRVAIAEEVEKL 347 >gb|PNX91900.1| gag-pol polyprotein [Trifolium pratense] Length = 750 Score = 101 bits (251), Expect = 3e-22 Identities = 49/105 (46%), Positives = 69/105 (65%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E +R P+++L++I+IG HQ T +G L+ + IK L DN D+FAW+P Sbjct: 510 DPREELHNRRVSPIEELEQIQIGGQPHQTTNLGTALSAQERERIIKILKDNADLFAWTPS 569 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID + I H+LS++P I+ ++Q KRK GEE+ AI EE KL Sbjct: 570 DMPGIDESVITHKLSISPDIKPIAQRKRKVGEERRAAIAEEVAKL 614 >gb|KYP60970.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan] Length = 258 Score = 97.4 bits (241), Expect = 4e-22 Identities = 48/99 (48%), Positives = 65/99 (65%) Frame = -1 Query: 307 EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQDMPGIDP 128 + RP P +D+KE+ I + + KIG L L+ + I+ L DNV FAW DMPGIDP Sbjct: 5 DHRPAPAEDVKEVEI--MEGRNVKIGTSLTLEDEEKLIRVLKDNVSAFAWHSSDMPGIDP 62 Query: 127 NFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11 +F+ H+L+L+PS + V Q +RK GEEK RAI EE KL+ Sbjct: 63 DFLCHKLALDPSAKAVIQKRRKFGEEKRRAITEETQKLV 101 >dbj|GAU32939.1| hypothetical protein TSUD_153560 [Trifolium subterraneum] Length = 1382 Score = 100 bits (250), Expect = 5e-22 Identities = 53/105 (50%), Positives = 69/105 (65%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E E R P++DL+EI+IG HQ T IG L + K I+ L NVD+FAW P+ Sbjct: 491 DPREEFHEGRVSPIEDLEEIKIGSEPHQVTNIGTTLPAEEKDKVIETLRRNVDLFAWHPK 550 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID + I H+L+++P + VSQ KRK GEE+ AI EE +KL Sbjct: 551 DMPGIDESIITHKLAIHPDAKPVSQRKRKVGEERRAAIDEEVDKL 595 >gb|PNY11290.1| gag-pol polyprotein [Trifolium pratense] Length = 715 Score = 99.4 bits (246), Expect = 1e-21 Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P+++L++++IG HQ T +G L + +K L NVD+FAW P Sbjct: 141 DPREEFQDRRVSPIEELEQVQIGEEPHQTTNLGTTLLHSEREKIMKILKKNVDLFAWKPS 200 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID + I H+LS++PSI+ VSQ KRK GEE+ AI EE KL Sbjct: 201 DMPGIDESVITHKLSISPSIKPVSQRKRKAGEERRVAIVEEVAKL 245 >gb|PNY05502.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 1734 Score = 99.0 bits (245), Expect = 2e-21 Identities = 49/105 (46%), Positives = 70/105 (66%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P+++L+ ++IG A+HQ T IG L + K + L NVD+FAW P Sbjct: 656 DPREEFKDQRVSPIENLEPVQIGSAEHQVTYIGTQLNNEEKERIVATLRSNVDLFAWKPS 715 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID + I H+L+++P ++ VSQ KRK GEE+ AI EE +KL Sbjct: 716 DMPGIDESIITHKLAISPKVKPVSQRKRKVGEERRTAIDEEVSKL 760 >ref|XP_016191765.1| uncharacterized protein LOC107632612 [Arachis ipaensis] Length = 403 Score = 97.8 bits (242), Expect = 2e-21 Identities = 51/114 (44%), Positives = 71/114 (62%), Gaps = 1/114 (0%) Frame = -1 Query: 340 LDLRLYDPARE-EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVF 164 L L DP + ++RP+PVDDL+++ + Q T IG L + + IK L DN D+F Sbjct: 123 LSLAELDPRNDFQERPQPVDDLQKVPLTRKADQFTYIGRALEGEEQSKLIKVLQDNADLF 182 Query: 163 AWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLLAVD 2 AW+P DMPGIDP I H+L++N +I+ ++Q KR G EK +A EE KLL D Sbjct: 183 AWTPADMPGIDPRVICHKLAINKTIRPIAQKKRNLGAEKVKAALEETKKLLNAD 236 >gb|KYP39861.1| hypothetical protein KK1_038827 [Cajanus cajan] Length = 206 Score = 94.4 bits (233), Expect = 2e-21 Identities = 47/112 (41%), Positives = 73/112 (65%) Frame = -1 Query: 346 LDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDV 167 +DLD R+ +E+R P++D K I++G +D Q T +G L+ K + + DN D+ Sbjct: 1 MDLDPRI----DQEERVEPIEDKKSIQVGASDSQLTYLGTILSEQEKSAIGQVILDNKDL 56 Query: 166 FAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11 FAW P DMPGIDP+F+ H+LS++ + ++Q +RK GEE+ AI+ E +KLL Sbjct: 57 FAWHPSDMPGIDPDFLCHKLSISKEEKPIAQRRRKAGEERKAAIEVEVSKLL 108 >gb|PNX78768.1| gag-pol polyprotein, partial [Trifolium pratense] Length = 713 Score = 98.2 bits (243), Expect = 4e-21 Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P+++L++I+IG HQ T +G L K +K L +NVD+FAW P Sbjct: 249 DPREEFQDRRFNPIEELEQIQIGVEPHQTTNLGRELLPTDKARIVKILKENVDLFAWKPS 308 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMP ID + I H+LS++P I+ VSQ KRK GEE+ AI EE KL Sbjct: 309 DMPDIDESVITHKLSISPKIKPVSQRKRKVGEERRAAITEEVTKL 353 >dbj|GAU27362.1| hypothetical protein TSUD_55150 [Trifolium subterraneum] Length = 1410 Score = 98.2 bits (243), Expect = 4e-21 Identities = 51/107 (47%), Positives = 68/107 (63%), Gaps = 2/107 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E +KR P++DL+ I+IG A H+ T +G L K I+ L NVD+FAW P Sbjct: 602 DPREEFQDKRVSPIEDLEPIQIGEAPHELTNMGTHLDEGEKEKIIEILRKNVDLFAWKPS 661 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLLA 8 DMPGID I H+L++ P+ + VSQ KRK GEE+ AI EE +K L+ Sbjct: 662 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKFLS 708 >ref|XP_020218762.1| uncharacterized protein LOC109801993 [Cajanus cajan] Length = 259 Score = 94.4 bits (233), Expect = 7e-21 Identities = 49/117 (41%), Positives = 77/117 (65%) Frame = -1 Query: 361 QDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLA 182 Q+ +DLD R+ + +++P PV+D+KE+ + + +K KIG L+ + I+ L Sbjct: 80 QEVHSVDLDPRV---SHFDRQPAPVEDVKEVIV--MEGRKVKIGTSLSPEDAVKLIEVLK 134 Query: 181 DNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11 N+ FAW +DMPG+DP+F+ H+L+++PS + V Q +RK GEEK +AI EE NKLL Sbjct: 135 VNISAFAWHAKDMPGVDPDFMCHRLAIDPSAKPVIQKRRKFGEEKRKAIAEEINKLL 191 >dbj|GAU42011.1| hypothetical protein TSUD_236780 [Trifolium subterraneum] Length = 806 Score = 97.1 bits (240), Expect = 1e-20 Identities = 49/110 (44%), Positives = 68/110 (61%) Frame = -1 Query: 391 RTVQSKPPQSQDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALD 212 R + K Q C FLDLD R ++ EE RPRP +++KEI+IG Q+TK+G L Sbjct: 630 RAERRKIVNDQRCNFLDLDPRDFNKDEEEWRPRPAEEVKEIQIGAEPGQRTKVGTSLLRT 689 Query: 211 TKRVFIKFLADNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK 62 + L N+D+FAWS +DM GIDPNFI H+L++N + + + Q KRK Sbjct: 690 MEEELKTTLRKNIDLFAWSAKDMLGIDPNFICHRLAVNSNAKVMQQRKRK 739 >dbj|GAU51073.1| hypothetical protein TSUD_13180, partial [Trifolium subterraneum] Length = 1053 Score = 97.1 bits (240), Expect = 1e-20 Identities = 50/105 (47%), Positives = 67/105 (63%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DL+ I+IG A H+ T +G L K I+ L NVD+FAW P Sbjct: 76 DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTHLDEGEKEKIIEILRKNVDLFAWKPS 135 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+L++ P+ + VSQ KRK GEE+ AI EE +KL Sbjct: 136 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKL 180 >dbj|GAU30548.1| hypothetical protein TSUD_65580 [Trifolium subterraneum] Length = 1352 Score = 97.1 bits (240), Expect = 1e-20 Identities = 50/105 (47%), Positives = 67/105 (63%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DL+ I+IG A H+ T +G L K I+ L NVD+FAW P Sbjct: 603 DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTQLDEGEKEKIIEILRKNVDLFAWKPS 662 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+L++ P+ + VSQ KRK GEE+ AI EE +KL Sbjct: 663 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKL 707 >ref|XP_015949749.1| uncharacterized protein LOC107474627 [Arachis duranensis] Length = 328 Score = 95.1 bits (235), Expect = 1e-20 Identities = 45/104 (43%), Positives = 64/104 (61%) Frame = -1 Query: 322 DPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQDM 143 DP + RP P DDL+++ + D Q T +G+ K + L N D+FAW+P DM Sbjct: 200 DPRSDNHRPTPTDDLEKVILNQ-DEQFTNVGSAFIAGQKTDLMALLKTNADLFAWTPADM 258 Query: 142 PGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11 PGIDPNFI H+L+++P+ Q + Q KR G+E+ RA + E KLL Sbjct: 259 PGIDPNFICHKLAVHPNAQSIRQKKRNLGDERRRAAEAETKKLL 302 >dbj|GAU24562.1| hypothetical protein TSUD_149030 [Trifolium subterraneum] Length = 1406 Score = 96.7 bits (239), Expect = 1e-20 Identities = 52/105 (49%), Positives = 67/105 (63%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E E R P++DL+EI IG HQ T IG L + K I+ L NVD+FAW P+ Sbjct: 409 DPREEFHEGRVSPIEDLEEITIGSEPHQVTNIGTTLPPEEKDKVIETLRRNVDLFAWHPK 468 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 MPGID + I H+L+++P + VSQ KRK GEE+ AI EE +KL Sbjct: 469 HMPGIDESIITHKLAIHPDAKPVSQQKRKVGEERRSAIDEEVDKL 513 >dbj|GAU39516.1| hypothetical protein TSUD_68800 [Trifolium subterraneum] Length = 1537 Score = 96.7 bits (239), Expect = 1e-20 Identities = 49/105 (46%), Positives = 68/105 (64%), Gaps = 2/105 (1%) Frame = -1 Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149 DP E ++R P++DL+ I+IG A H+ T +G L + K I+ L NVD+FAW P Sbjct: 580 DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTHLDEEEKEKIIEILRKNVDLFAWKPS 639 Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14 DMPGID I H+L++ P+ + VSQ KRK GEE+ +I EE +KL Sbjct: 640 DMPGIDETIITHKLAIVPNSKPVSQRKRKVGEERRTSIDEEVHKL 684