BLASTX nr result

ID: Astragalus22_contig00036717 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00036717
         (400 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU33096.1| hypothetical protein TSUD_259470 [Trifolium subt...   108   9e-25
gb|PNX85211.1| gag-pol polyprotein, partial [Trifolium pratense]      106   4e-24
gb|PNX88251.1| gag-pol polyprotein [Trifolium pratense]               103   6e-24
gb|PNX86185.1| retrotransposon-related protein, partial [Trifoli...   103   4e-23
gb|PNX91900.1| gag-pol polyprotein [Trifolium pratense]               101   3e-22
gb|KYP60970.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]    97   4e-22
dbj|GAU32939.1| hypothetical protein TSUD_153560 [Trifolium subt...   100   5e-22
gb|PNY11290.1| gag-pol polyprotein [Trifolium pratense]                99   1e-21
gb|PNY05502.1| gag-pol polyprotein, partial [Trifolium pratense]       99   2e-21
ref|XP_016191765.1| uncharacterized protein LOC107632612 [Arachi...    98   2e-21
gb|KYP39861.1| hypothetical protein KK1_038827 [Cajanus cajan]         94   2e-21
gb|PNX78768.1| gag-pol polyprotein, partial [Trifolium pratense]       98   4e-21
dbj|GAU27362.1| hypothetical protein TSUD_55150 [Trifolium subte...    98   4e-21
ref|XP_020218762.1| uncharacterized protein LOC109801993 [Cajanu...    94   7e-21
dbj|GAU42011.1| hypothetical protein TSUD_236780 [Trifolium subt...    97   1e-20
dbj|GAU51073.1| hypothetical protein TSUD_13180, partial [Trifol...    97   1e-20
dbj|GAU30548.1| hypothetical protein TSUD_65580 [Trifolium subte...    97   1e-20
ref|XP_015949749.1| uncharacterized protein LOC107474627 [Arachi...    95   1e-20
dbj|GAU24562.1| hypothetical protein TSUD_149030 [Trifolium subt...    97   1e-20
dbj|GAU39516.1| hypothetical protein TSUD_68800 [Trifolium subte...    97   1e-20

>dbj|GAU33096.1| hypothetical protein TSUD_259470 [Trifolium subterraneum]
          Length = 1237

 Score =  108 bits (270), Expect = 9e-25
 Identities = 56/127 (44%), Positives = 79/127 (62%)
 Frame = -1

Query: 391 RTVQSKPPQSQDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALD 212
           R  + K    Q C FLDLD R ++   EE RPRP +++KEI+IG    Q++K+G  L+  
Sbjct: 355 RAERRKIVNDQRCNFLDLDPRDFNREEEEWRPRPAEEVKEIQIGAEPGQRSKVGTSLSRI 414

Query: 211 TKRVFIKFLADNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIK 32
            +      L  N+D+FAWS +DM GIDPNFI H L++N + + + Q KRK   EK +AI+
Sbjct: 415 MEEELKTTLRKNIDLFAWSAKDMSGIDPNFICHHLAVNSNAKVLQQRKRKMSLEKQKAIE 474

Query: 31  EEANKLL 11
           EE  KL+
Sbjct: 475 EETQKLV 481


>gb|PNX85211.1| gag-pol polyprotein, partial [Trifolium pratense]
          Length = 509

 Score =  106 bits (264), Expect = 4e-24
 Identities = 52/105 (49%), Positives = 70/105 (66%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DL++++IG   HQ T +G  L+L  K   IK L +N D+FAW P 
Sbjct: 36  DPREEFQDRRVSPIEDLEQVQIGEHPHQTTSLGTALSLQEKEKIIKILKNNADLFAWKPS 95

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+LS++PSI+ +SQ KRK GEE+  AI EE  KL
Sbjct: 96  DMPGIDERVITHKLSISPSIKLISQRKRKVGEERRAAIAEEVAKL 140


>gb|PNX88251.1| gag-pol polyprotein [Trifolium pratense]
          Length = 347

 Score =  103 bits (258), Expect = 6e-24
 Identities = 51/105 (48%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DLK+++IG   HQ T +G  L+   +   IK L DN D+FAW P 
Sbjct: 68  DPREEFQDRRLSPIEDLKQVQIGEHPHQTTSLGTTLSYQEREKIIKILKDNADLFAWKPS 127

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+LS++PS + +SQ KRK GEE+  AI EE  KL
Sbjct: 128 DMPGIDEGVITHKLSISPSTKPISQRKRKVGEERRVAIAEEVEKL 172


>gb|PNX86185.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 465

 Score =  103 bits (256), Expect = 4e-23
 Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DLK+++IG   HQ T +G  L+   +   IK L DN D+FAW P 
Sbjct: 243 DPREEFQDRRVSPIEDLKQVQIGEHPHQTTSLGTTLSYQEREKIIKILKDNADLFAWKPS 302

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+LS++P+ + +SQ KRK GEE+  AI EE  KL
Sbjct: 303 DMPGIDEGVITHKLSISPNTKPISQRKRKVGEERRVAIAEEVEKL 347


>gb|PNX91900.1| gag-pol polyprotein [Trifolium pratense]
          Length = 750

 Score =  101 bits (251), Expect = 3e-22
 Identities = 49/105 (46%), Positives = 69/105 (65%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E   +R  P+++L++I+IG   HQ T +G  L+   +   IK L DN D+FAW+P 
Sbjct: 510 DPREELHNRRVSPIEELEQIQIGGQPHQTTNLGTALSAQERERIIKILKDNADLFAWTPS 569

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID + I H+LS++P I+ ++Q KRK GEE+  AI EE  KL
Sbjct: 570 DMPGIDESVITHKLSISPDIKPIAQRKRKVGEERRAAIAEEVAKL 614


>gb|KYP60970.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 258

 Score = 97.4 bits (241), Expect = 4e-22
 Identities = 48/99 (48%), Positives = 65/99 (65%)
 Frame = -1

Query: 307 EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQDMPGIDP 128
           + RP P +D+KE+ I   + +  KIG  L L+ +   I+ L DNV  FAW   DMPGIDP
Sbjct: 5   DHRPAPAEDVKEVEI--MEGRNVKIGTSLTLEDEEKLIRVLKDNVSAFAWHSSDMPGIDP 62

Query: 127 NFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11
           +F+ H+L+L+PS + V Q +RK GEEK RAI EE  KL+
Sbjct: 63  DFLCHKLALDPSAKAVIQKRRKFGEEKRRAITEETQKLV 101


>dbj|GAU32939.1| hypothetical protein TSUD_153560 [Trifolium subterraneum]
          Length = 1382

 Score =  100 bits (250), Expect = 5e-22
 Identities = 53/105 (50%), Positives = 69/105 (65%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  E R  P++DL+EI+IG   HQ T IG  L  + K   I+ L  NVD+FAW P+
Sbjct: 491 DPREEFHEGRVSPIEDLEEIKIGSEPHQVTNIGTTLPAEEKDKVIETLRRNVDLFAWHPK 550

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID + I H+L+++P  + VSQ KRK GEE+  AI EE +KL
Sbjct: 551 DMPGIDESIITHKLAIHPDAKPVSQRKRKVGEERRAAIDEEVDKL 595


>gb|PNY11290.1| gag-pol polyprotein [Trifolium pratense]
          Length = 715

 Score = 99.4 bits (246), Expect = 1e-21
 Identities = 50/105 (47%), Positives = 69/105 (65%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P+++L++++IG   HQ T +G  L    +   +K L  NVD+FAW P 
Sbjct: 141 DPREEFQDRRVSPIEELEQVQIGEEPHQTTNLGTTLLHSEREKIMKILKKNVDLFAWKPS 200

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID + I H+LS++PSI+ VSQ KRK GEE+  AI EE  KL
Sbjct: 201 DMPGIDESVITHKLSISPSIKPVSQRKRKAGEERRVAIVEEVAKL 245


>gb|PNY05502.1| gag-pol polyprotein, partial [Trifolium pratense]
          Length = 1734

 Score = 99.0 bits (245), Expect = 2e-21
 Identities = 49/105 (46%), Positives = 70/105 (66%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P+++L+ ++IG A+HQ T IG  L  + K   +  L  NVD+FAW P 
Sbjct: 656 DPREEFKDQRVSPIENLEPVQIGSAEHQVTYIGTQLNNEEKERIVATLRSNVDLFAWKPS 715

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID + I H+L+++P ++ VSQ KRK GEE+  AI EE +KL
Sbjct: 716 DMPGIDESIITHKLAISPKVKPVSQRKRKVGEERRTAIDEEVSKL 760


>ref|XP_016191765.1| uncharacterized protein LOC107632612 [Arachis ipaensis]
          Length = 403

 Score = 97.8 bits (242), Expect = 2e-21
 Identities = 51/114 (44%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
 Frame = -1

Query: 340 LDLRLYDPARE-EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVF 164
           L L   DP  + ++RP+PVDDL+++ +     Q T IG  L  + +   IK L DN D+F
Sbjct: 123 LSLAELDPRNDFQERPQPVDDLQKVPLTRKADQFTYIGRALEGEEQSKLIKVLQDNADLF 182

Query: 163 AWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLLAVD 2
           AW+P DMPGIDP  I H+L++N +I+ ++Q KR  G EK +A  EE  KLL  D
Sbjct: 183 AWTPADMPGIDPRVICHKLAINKTIRPIAQKKRNLGAEKVKAALEETKKLLNAD 236


>gb|KYP39861.1| hypothetical protein KK1_038827 [Cajanus cajan]
          Length = 206

 Score = 94.4 bits (233), Expect = 2e-21
 Identities = 47/112 (41%), Positives = 73/112 (65%)
 Frame = -1

Query: 346 LDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDV 167
           +DLD R+     +E+R  P++D K I++G +D Q T +G  L+   K    + + DN D+
Sbjct: 1   MDLDPRI----DQEERVEPIEDKKSIQVGASDSQLTYLGTILSEQEKSAIGQVILDNKDL 56

Query: 166 FAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11
           FAW P DMPGIDP+F+ H+LS++   + ++Q +RK GEE+  AI+ E +KLL
Sbjct: 57  FAWHPSDMPGIDPDFLCHKLSISKEEKPIAQRRRKAGEERKAAIEVEVSKLL 108


>gb|PNX78768.1| gag-pol polyprotein, partial [Trifolium pratense]
          Length = 713

 Score = 98.2 bits (243), Expect = 4e-21
 Identities = 50/105 (47%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P+++L++I+IG   HQ T +G  L    K   +K L +NVD+FAW P 
Sbjct: 249 DPREEFQDRRFNPIEELEQIQIGVEPHQTTNLGRELLPTDKARIVKILKENVDLFAWKPS 308

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMP ID + I H+LS++P I+ VSQ KRK GEE+  AI EE  KL
Sbjct: 309 DMPDIDESVITHKLSISPKIKPVSQRKRKVGEERRAAITEEVTKL 353


>dbj|GAU27362.1| hypothetical protein TSUD_55150 [Trifolium subterraneum]
          Length = 1410

 Score = 98.2 bits (243), Expect = 4e-21
 Identities = 51/107 (47%), Positives = 68/107 (63%), Gaps = 2/107 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  +KR  P++DL+ I+IG A H+ T +G  L    K   I+ L  NVD+FAW P 
Sbjct: 602 DPREEFQDKRVSPIEDLEPIQIGEAPHELTNMGTHLDEGEKEKIIEILRKNVDLFAWKPS 661

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLLA 8
           DMPGID   I H+L++ P+ + VSQ KRK GEE+  AI EE +K L+
Sbjct: 662 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKFLS 708


>ref|XP_020218762.1| uncharacterized protein LOC109801993 [Cajanus cajan]
          Length = 259

 Score = 94.4 bits (233), Expect = 7e-21
 Identities = 49/117 (41%), Positives = 77/117 (65%)
 Frame = -1

Query: 361 QDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLA 182
           Q+   +DLD R+   +  +++P PV+D+KE+ +   + +K KIG  L+ +     I+ L 
Sbjct: 80  QEVHSVDLDPRV---SHFDRQPAPVEDVKEVIV--MEGRKVKIGTSLSPEDAVKLIEVLK 134

Query: 181 DNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11
            N+  FAW  +DMPG+DP+F+ H+L+++PS + V Q +RK GEEK +AI EE NKLL
Sbjct: 135 VNISAFAWHAKDMPGVDPDFMCHRLAIDPSAKPVIQKRRKFGEEKRKAIAEEINKLL 191


>dbj|GAU42011.1| hypothetical protein TSUD_236780 [Trifolium subterraneum]
          Length = 806

 Score = 97.1 bits (240), Expect = 1e-20
 Identities = 49/110 (44%), Positives = 68/110 (61%)
 Frame = -1

Query: 391 RTVQSKPPQSQDCRFLDLDLRLYDPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALD 212
           R  + K    Q C FLDLD R ++   EE RPRP +++KEI+IG    Q+TK+G  L   
Sbjct: 630 RAERRKIVNDQRCNFLDLDPRDFNKDEEEWRPRPAEEVKEIQIGAEPGQRTKVGTSLLRT 689

Query: 211 TKRVFIKFLADNVDVFAWSPQDMPGIDPNFIYHQLSLNPSIQHVSQSKRK 62
            +      L  N+D+FAWS +DM GIDPNFI H+L++N + + + Q KRK
Sbjct: 690 MEEELKTTLRKNIDLFAWSAKDMLGIDPNFICHRLAVNSNAKVMQQRKRK 739


>dbj|GAU51073.1| hypothetical protein TSUD_13180, partial [Trifolium subterraneum]
          Length = 1053

 Score = 97.1 bits (240), Expect = 1e-20
 Identities = 50/105 (47%), Positives = 67/105 (63%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DL+ I+IG A H+ T +G  L    K   I+ L  NVD+FAW P 
Sbjct: 76  DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTHLDEGEKEKIIEILRKNVDLFAWKPS 135

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+L++ P+ + VSQ KRK GEE+  AI EE +KL
Sbjct: 136 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKL 180


>dbj|GAU30548.1| hypothetical protein TSUD_65580 [Trifolium subterraneum]
          Length = 1352

 Score = 97.1 bits (240), Expect = 1e-20
 Identities = 50/105 (47%), Positives = 67/105 (63%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DL+ I+IG A H+ T +G  L    K   I+ L  NVD+FAW P 
Sbjct: 603 DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTQLDEGEKEKIIEILRKNVDLFAWKPS 662

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+L++ P+ + VSQ KRK GEE+  AI EE +KL
Sbjct: 663 DMPGIDETIITHKLAIAPNSKPVSQRKRKVGEERRTAIDEEVHKL 707


>ref|XP_015949749.1| uncharacterized protein LOC107474627 [Arachis duranensis]
          Length = 328

 Score = 95.1 bits (235), Expect = 1e-20
 Identities = 45/104 (43%), Positives = 64/104 (61%)
 Frame = -1

Query: 322 DPAREEKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQDM 143
           DP  +  RP P DDL+++ +   D Q T +G+      K   +  L  N D+FAW+P DM
Sbjct: 200 DPRSDNHRPTPTDDLEKVILNQ-DEQFTNVGSAFIAGQKTDLMALLKTNADLFAWTPADM 258

Query: 142 PGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKLL 11
           PGIDPNFI H+L+++P+ Q + Q KR  G+E+ RA + E  KLL
Sbjct: 259 PGIDPNFICHKLAVHPNAQSIRQKKRNLGDERRRAAEAETKKLL 302


>dbj|GAU24562.1| hypothetical protein TSUD_149030 [Trifolium subterraneum]
          Length = 1406

 Score = 96.7 bits (239), Expect = 1e-20
 Identities = 52/105 (49%), Positives = 67/105 (63%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  E R  P++DL+EI IG   HQ T IG  L  + K   I+ L  NVD+FAW P+
Sbjct: 409 DPREEFHEGRVSPIEDLEEITIGSEPHQVTNIGTTLPPEEKDKVIETLRRNVDLFAWHPK 468

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
            MPGID + I H+L+++P  + VSQ KRK GEE+  AI EE +KL
Sbjct: 469 HMPGIDESIITHKLAIHPDAKPVSQQKRKVGEERRSAIDEEVDKL 513


>dbj|GAU39516.1| hypothetical protein TSUD_68800 [Trifolium subterraneum]
          Length = 1537

 Score = 96.7 bits (239), Expect = 1e-20
 Identities = 49/105 (46%), Positives = 68/105 (64%), Gaps = 2/105 (1%)
 Frame = -1

Query: 322 DPARE--EKRPRPVDDLKEIRIGPADHQKTKIGAGLALDTKRVFIKFLADNVDVFAWSPQ 149
           DP  E  ++R  P++DL+ I+IG A H+ T +G  L  + K   I+ L  NVD+FAW P 
Sbjct: 580 DPREEFQDRRVSPIEDLEPIQIGEAPHELTNLGTHLDEEEKEKIIEILRKNVDLFAWKPS 639

Query: 148 DMPGIDPNFIYHQLSLNPSIQHVSQSKRK*GEEKTRAIKEEANKL 14
           DMPGID   I H+L++ P+ + VSQ KRK GEE+  +I EE +KL
Sbjct: 640 DMPGIDETIITHKLAIVPNSKPVSQRKRKVGEERRTSIDEEVHKL 684


Top