BLASTX nr result

ID: Astragalus22_contig00033480 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00033480
         (330 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006596874.1| PREDICTED: uncharacterized protein LOC102667...   132   2e-36
gb|KHN44086.1| Retrovirus-related Pol polyprotein from transposo...   128   1e-33
gb|KHN42350.1| Retrovirus-related Pol polyprotein from transposo...   128   1e-33
gb|KHN31292.1| Retrovirus-related Pol polyprotein from transposo...   128   1e-33
gb|KHN43564.1| Retrovirus-related Pol polyprotein from transposo...   128   1e-33
ref|XP_006591608.1| PREDICTED: uncharacterized protein LOC102662...   122   3e-33
ref|XP_006593150.1| PREDICTED: uncharacterized protein LOC102663...   123   6e-33
ref|XP_019433835.1| PREDICTED: uncharacterized protein LOC109340...   122   6e-33
ref|XP_006595218.1| PREDICTED: uncharacterized protein LOC102669...   119   3e-32
ref|XP_006580666.1| PREDICTED: uncharacterized protein LOC102668...   125   4e-32
dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subt...   128   5e-32
ref|XP_006605124.1| PREDICTED: uncharacterized protein LOC102670...   119   7e-32
ref|XP_006593259.1| PREDICTED: uncharacterized protein LOC102670...   119   9e-32
ref|XP_004517112.1| PREDICTED: uncharacterized protein LOC101507...   117   1e-31
gb|KYP67178.1| Retrovirus-related Pol polyprotein from transposo...   126   2e-31
ref|XP_004496824.1| PREDICTED: uncharacterized protein LOC101496...   121   2e-31
gb|KHN19736.1| hypothetical protein glysoja_024837, partial [Gly...   119   2e-31
gb|KYP66838.1| Retrovirus-related Pol polyprotein from transposo...   125   3e-31
ref|XP_019430617.1| PREDICTED: uncharacterized protein LOC109337...   122   4e-31
gb|KHN30402.1| Retrovirus-related Pol polyprotein from transposo...   122   5e-31

>ref|XP_006596874.1| PREDICTED: uncharacterized protein LOC102667115 [Glycine max]
          Length = 227

 Score =  132 bits (332), Expect = 2e-36
 Identities = 65/110 (59%), Positives = 84/110 (76%), Gaps = 1/110 (0%)
 Frame = +3

Query: 3   NIAKG-DSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLA 179
           NIA+  D +DE  M++V TSD     + WY DTGCSNHMTG+K+WL DFD ++R+KI+L 
Sbjct: 80  NIAQDEDLEDEHPMMLVATSDSNPHSKAWYLDTGCSNHMTGHKEWLGDFDENRRSKIKLV 139

Query: 180 DDRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           D + L+AEGMGNIL+Q  DGK  LI++  YV GM+ NLMSVGQL+EKG+S
Sbjct: 140 DSKTLSAEGMGNILIQRKDGKIKLIKNGLYVPGMRCNLMSVGQLVEKGFS 189


>gb|KHN44086.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  128 bits (322), Expect = 1e-33
 Identities = 61/109 (55%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   NIAKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLAD 182
           N+A+G  D   V++M TT +D V  E WY D+GCSNHMT +++WL  FD SK+T I+LAD
Sbjct: 233 NVAQG-KDPHTVLMMATTCEDKVQNEEWYLDSGCSNHMTAHREWLTSFDNSKKTSIKLAD 291

Query: 183 DRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           +R L AEG+GNI+++  DGK V+IE V YV  M  NLMS+GQL+EKG+S
Sbjct: 292 NRKLAAEGIGNIVIRGNDGKRVIIEKVLYVPEMNCNLMSIGQLVEKGFS 340


>gb|KHN42350.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  128 bits (322), Expect = 1e-33
 Identities = 61/109 (55%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   NIAKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLAD 182
           N+A+G  D   V++M TT +D V  E WY D+GCSNHMT +++WL  FD SK+T I+LAD
Sbjct: 233 NVAQG-KDPHTVLMMATTCEDKVQNEEWYLDSGCSNHMTAHREWLTSFDNSKKTSIKLAD 291

Query: 183 DRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           +R L AEG+GNI+++  DGK V+IE V YV  M  NLMS+GQL+EKG+S
Sbjct: 292 NRKLAAEGIGNIVIRGNDGKRVIIEKVLYVPEMNCNLMSIGQLVEKGFS 340


>gb|KHN31292.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  128 bits (322), Expect = 1e-33
 Identities = 61/109 (55%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   NIAKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLAD 182
           N+A+G  D   V++M TT +D V  E WY D+GCSNHMT +++WL  FD SK+T I+LAD
Sbjct: 233 NVAQG-KDPHTVLMMATTCEDKVQNEEWYLDSGCSNHMTAHREWLTSFDNSKKTSIKLAD 291

Query: 183 DRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           +R L AEG+GNI+++  DGK V+IE V YV  M  NLMS+GQL+EKG+S
Sbjct: 292 NRKLAAEGIGNIVIRGNDGKRVIIEKVLYVPEMNCNLMSIGQLVEKGFS 340


>gb|KHN43564.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 342

 Score =  128 bits (321), Expect = 1e-33
 Identities = 61/109 (55%), Positives = 81/109 (74%)
 Frame = +3

Query: 3   NIAKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLAD 182
           N+A+G  D   V++M TT +D V  E WY D+GCSNHMT +++WL  FD SK+T I+LAD
Sbjct: 233 NVAQG-KDPHTVLMMATTYEDKVQNEEWYLDSGCSNHMTAHREWLTSFDNSKKTSIKLAD 291

Query: 183 DRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           +R L AEG+GNI+++  DGK V+IE V YV  M  NLMS+GQL+EKG+S
Sbjct: 292 NRKLAAEGIGNIVIRGNDGKRVIIEKVLYVPEMNCNLMSIGQLVEKGFS 340


>ref|XP_006591608.1| PREDICTED: uncharacterized protein LOC102662140 [Glycine max]
          Length = 176

 Score =  122 bits (307), Expect = 3e-33
 Identities = 56/106 (52%), Positives = 83/106 (78%)
 Frame = +3

Query: 12  KGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRY 191
           + DS+++P+MLM++ ++     EIWY ++GCSNHMTG++ WLV+FD  K++K+R AD++ 
Sbjct: 7   EADSEEKPLMLMMSHNN-----EIWYINSGCSNHMTGHRDWLVNFDVMKKSKVRFADNKV 61

Query: 192 LTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           + AEG GN+ V+  DG+  +I DV YV GMKSNL+S+GQLLEKG+S
Sbjct: 62  IQAEGAGNVAVRRLDGRQAMITDVLYVLGMKSNLISMGQLLEKGFS 107


>ref|XP_006593150.1| PREDICTED: uncharacterized protein LOC102663128 [Glycine max]
          Length = 208

 Score =  123 bits (308), Expect = 6e-33
 Identities = 54/104 (51%), Positives = 77/104 (74%)
 Frame = +3

Query: 18  DSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLT 197
           DSD + V+LM TT+ +   V +WY DTGCSNHMTG+++W V+ D   ++KI+ AD+  +T
Sbjct: 104 DSDSDKVLLMTTTNSEEDNVNLWYLDTGCSNHMTGHREWFVNIDDKVKSKIKFADNNSVT 163

Query: 198 AEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           AEG+G +++Q  DG+   I DV YV  MK+N++S+GQLLEKGYS
Sbjct: 164 AEGIGKVMIQRKDGQHSFINDVLYVPNMKNNMLSLGQLLEKGYS 207


>ref|XP_019433835.1| PREDICTED: uncharacterized protein LOC109340568, partial [Lupinus
           angustifolius]
          Length = 197

 Score =  122 bits (307), Expect = 6e-33
 Identities = 51/102 (50%), Positives = 79/102 (77%)
 Frame = +3

Query: 24  DDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLTAE 203
           D+ PV+LM+ T     + E+WY D+GCSNHMTG++ WLV+F+ +KR+K+R AD+R + AE
Sbjct: 11  DENPVLLMMITDQKDTSEEVWYIDSGCSNHMTGHRDWLVNFNPNKRSKVRFADNRLIQAE 70

Query: 204 GMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           G G+++++  DGK  ++ DV +V  MK+NL+S+GQL+EKG+S
Sbjct: 71  GTGDVVIRRNDGKKAMLTDVLFVPNMKNNLISLGQLIEKGFS 112


>ref|XP_006595218.1| PREDICTED: uncharacterized protein LOC102669724 [Glycine max]
          Length = 132

 Score =  119 bits (297), Expect = 3e-32
 Identities = 56/94 (59%), Positives = 68/94 (72%)
 Frame = +3

Query: 45  MVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLTAEGMGNILV 224
           MVT     +  E WY D GCSNHM G K WL+D D SK+T+I+LAD + LT EGMGNI+ 
Sbjct: 1   MVTIGGGDLHSEAWYLDIGCSNHMVGRKDWLIDLDTSKKTQIKLADSKALTVEGMGNIVT 60

Query: 225 QMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
           +  DGK  LIE+V +V GMK NLMSVGQL+EKG+
Sbjct: 61  KRKDGKVALIENVPFVPGMKCNLMSVGQLIEKGF 94


>ref|XP_006580666.1| PREDICTED: uncharacterized protein LOC102668826 [Glycine max]
          Length = 415

 Score =  125 bits (315), Expect = 4e-32
 Identities = 63/101 (62%), Positives = 77/101 (76%), Gaps = 1/101 (0%)
 Frame = +3

Query: 3   NIAKG-DSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLA 179
           NIA+  DS+DE  +++V TSD     E WY DTGCSNHM  +K+WL DFD ++R+KIRLA
Sbjct: 304 NIAQDEDSEDEHPVMLVATSDSNPHSEAWYLDTGCSNHMIDHKEWLGDFDENRRSKIRLA 363

Query: 180 DDRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSV 302
           D R L+AEGMGNIL+Q  DGKT LIE+V YV GM+ NLMSV
Sbjct: 364 DSRTLSAEGMGNILIQRKDGKTTLIENVLYVPGMRCNLMSV 404


>dbj|GAU50483.1| hypothetical protein TSUD_409690 [Trifolium subterraneum]
          Length = 1073

 Score =  128 bits (321), Expect = 5e-32
 Identities = 59/110 (53%), Positives = 81/110 (73%), Gaps = 4/110 (3%)
 Frame = +3

Query: 9   AKGDSDDEPVMLMVTT----SDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRL 176
           +  DSD+  V LM+ T    SDD    + W+ DTGCSNHMT +K+WL+D + SK++K+R 
Sbjct: 312 SSSDSDENEVKLMMVTLSEVSDDHSHTDYWFLDTGCSNHMTSHKEWLIDINPSKKSKVRF 371

Query: 177 ADDRYLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
           ADDR L AEGMG +++   DGK V++EDV YV GMKSNL+S+GQL++KG+
Sbjct: 372 ADDRTLHAEGMGKMVITRDDGKNVIMEDVLYVPGMKSNLLSIGQLIQKGF 421


>ref|XP_006605124.1| PREDICTED: uncharacterized protein LOC102670439 [Glycine max]
          Length = 159

 Score =  119 bits (297), Expect = 7e-32
 Identities = 51/94 (54%), Positives = 74/94 (78%)
 Frame = +3

Query: 45  MVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLTAEGMGNILV 224
           MVTT+D+   ++ WY DTGCS HMTG+K+WLV+FD SK+ KI+ AD R + AEG+GN+++
Sbjct: 1   MVTTADEMCTIDEWYLDTGCSTHMTGHKEWLVNFDASKKNKIKFADGRAMLAEGVGNVMI 60

Query: 225 QMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
           +M +G    I  V+YV G++SNL+S+GQL+EKG+
Sbjct: 61  KMPNGTQYCISSVFYVPGLESNLLSLGQLVEKGH 94


>ref|XP_006593259.1| PREDICTED: uncharacterized protein LOC102670253 [Glycine max]
          Length = 181

 Score =  119 bits (298), Expect = 9e-32
 Identities = 53/106 (50%), Positives = 76/106 (71%)
 Frame = +3

Query: 12  KGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRY 191
           + DS+++ ++LM+ T  +    EIWY D+GCSNHMTG++ WLV+FD  K++K+R  DDR 
Sbjct: 7   EADSEEQSLLLMMITDSESQNNEIWYIDSGCSNHMTGHRDWLVNFDAMKKSKVRFVDDRV 66

Query: 192 LTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           + AEG GN+ V+  DG+  +   V YV GMKSNL+S+ QLLE G+S
Sbjct: 67  IQAEGEGNVAVRRLDGRQAMFTYVLYVLGMKSNLISMSQLLENGFS 112


>ref|XP_004517112.1| PREDICTED: uncharacterized protein LOC101507399, partial [Cicer
           arietinum]
          Length = 132

 Score =  117 bits (293), Expect = 1e-31
 Identities = 53/106 (50%), Positives = 73/106 (68%)
 Frame = +3

Query: 9   AKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDR 188
           A  + DDEPV+LMVTT +D V   +WY DTGC  HM+G K W ++ D S ++K++ A+  
Sbjct: 13  AHEEQDDEPVILMVTTKEDEVGTNLWYLDTGCLTHMSGRKDWFINLDESMKSKVKFANSS 72

Query: 189 YLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
            L AEG+G +L+Q  +GK   I +  +V G+K NL+SVGQLLEKGY
Sbjct: 73  SLMAEGVGEVLIQNKNGKQSKISEGLFVPGLKCNLLSVGQLLEKGY 118


>gb|KYP67178.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 963

 Score =  126 bits (317), Expect = 2e-31
 Identities = 57/106 (53%), Positives = 80/106 (75%)
 Frame = +3

Query: 12  KGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRY 191
           + +S+++P+MLM+ T+ +    E WY D+GCSNHMTG + WLV+FD  K++ +R AD+R 
Sbjct: 45  EANSEEQPLMLMMITNPESHNNETWYIDSGCSNHMTGYRDWLVNFDAKKKSTVRFADNRV 104

Query: 192 LTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
           + AEG GN+LV   DG+  +I DV YV GMKSNL+S+GQLLEKG+S
Sbjct: 105 IQAEGTGNVLVTRQDGRQAVIADVLYVPGMKSNLISMGQLLEKGFS 150


>ref|XP_004496824.1| PREDICTED: uncharacterized protein LOC101496201 [Cicer arietinum]
          Length = 298

 Score =  121 bits (304), Expect = 2e-31
 Identities = 53/99 (53%), Positives = 77/99 (77%)
 Frame = +3

Query: 30  EPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLTAEGM 209
           E  +LM TT+++     +W+ DTGCSNHMT +K+WLVD D S+++KIR  D R L AEG 
Sbjct: 200 ETALLMATTNEEHSPSHVWFLDTGCSNHMTSHKEWLVDIDKSRKSKIRFVDYRTLEAEGA 259

Query: 210 GNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
           GN++++  +GKT++IE+V YV GMKSNL+S+GQL++KG+
Sbjct: 260 GNMVIKRRNGKTLMIENVLYVPGMKSNLLSIGQLIQKGF 298


>gb|KHN19736.1| hypothetical protein glysoja_024837, partial [Glycine soja]
          Length = 197

 Score =  119 bits (297), Expect = 2e-31
 Identities = 53/104 (50%), Positives = 75/104 (72%)
 Frame = +3

Query: 18  DSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLT 197
           DSD + V+LM TT+ +   V +WY DTGCSNHMTG+++W V+ D   ++KI+ A+   +T
Sbjct: 74  DSDSDKVLLMATTNSEEDNVNLWYLDTGCSNHMTGHREWFVNIDDKVKSKIKFANYNSVT 133

Query: 198 AEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
            EG+G +++Q  DG+   I DV YV  MK+NL+S+GQLLEKGYS
Sbjct: 134 VEGIGKVMIQRKDGQHSFINDVLYVPNMKNNLLSLGQLLEKGYS 177


>gb|KYP66838.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1317

 Score =  125 bits (315), Expect = 3e-31
 Identities = 56/107 (52%), Positives = 80/107 (74%)
 Frame = +3

Query: 9   AKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDR 188
           ++ +S+++P+MLM+ T+ +    E WY D+GCSNHMTG + WLV+FD  K++ +R AD+R
Sbjct: 279 SEANSEEQPLMLMMITNPESHKNETWYIDSGCSNHMTGYRDWLVNFDAKKKSTVRFADNR 338

Query: 189 YLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
            + AEG GN+LV   DG+  +I DV YV GMKSN +S+GQLLEKG+S
Sbjct: 339 VIQAEGTGNVLVTRQDGRQAVIADVLYVPGMKSNFISMGQLLEKGFS 385


>ref|XP_019430617.1| PREDICTED: uncharacterized protein LOC109337963 [Lupinus
           angustifolius]
          Length = 380

 Score =  122 bits (306), Expect = 4e-31
 Identities = 55/107 (51%), Positives = 77/107 (71%)
 Frame = +3

Query: 9   AKGDSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDR 188
           A+ DSD EP+ LMVTTS      E WY D+GC NHMT +K+WL +FD SK++K++ ADD 
Sbjct: 222 AQEDSDQEPLNLMVTTSGGDSQTESWYLDSGCLNHMTKHKEWLTNFDSSKKSKVKFADDS 281

Query: 189 YLTAEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGYS 329
            L  EGMGN++++  +G   +I +V YV  MK NL+S+GQL++KG+S
Sbjct: 282 SLEVEGMGNVIIKRQNGSKAMITEVLYVPYMKCNLLSIGQLVDKGFS 328


>gb|KHN30402.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 371

 Score =  122 bits (305), Expect = 5e-31
 Identities = 55/103 (53%), Positives = 75/103 (72%)
 Frame = +3

Query: 18  DSDDEPVMLMVTTSDDGVAVEIWYKDTGCSNHMTGNKQWLVDFDGSKRTKIRLADDRYLT 197
           D D E V+LMVTT  +G +   WY DTGCS HMTG ++W ++ D S +++++ ADDR LT
Sbjct: 208 DDDTEQVLLMVTTQIEGASDNCWYLDTGCSTHMTGRREWFLNLDQSVKSQVKFADDRILT 267

Query: 198 AEGMGNILVQMTDGKTVLIEDVWYVHGMKSNLMSVGQLLEKGY 326
           AEG+G +L++  DG    I DV +V GMKSNL+S+GQLLEKG+
Sbjct: 268 AEGIGKVLIKTKDGGQSCITDVLFVPGMKSNLLSLGQLLEKGF 310


Top