BLASTX nr result

ID: Astragalus22_contig00035717 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00035717
         (413 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN05285.1| Retrovirus-related Pol polyprotein from transposo...    61   5e-08
dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis t...    60   1e-07
gb|KYP37856.1| Retrovirus-related Pol polyprotein from transposo...    59   1e-07
gb|PNX92344.1| retrovirus-related Pol polyprotein from transposo...    59   2e-07
gb|AJY78067.1| putative polyprotein [Glycine max]                      59   3e-07
dbj|GAU31769.1| hypothetical protein TSUD_22150 [Trifolium subte...    59   3e-07
ref|XP_021727452.1| uncharacterized protein LOC110694595 [Chenop...    58   4e-07
gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha...    58   6e-07
gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arab...    58   6e-07
ref|XP_018508380.1| PREDICTED: uncharacterized protein LOC108868...    57   7e-07
ref|XP_010256211.1| PREDICTED: uncharacterized protein LOC104596...    57   8e-07
ref|XP_022557786.1| uncharacterized protein LOC111205820 [Brassi...    57   8e-07
ref|XP_022549636.1| uncharacterized protein LOC111201669 [Brassi...    57   8e-07
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi...    57   8e-07
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...    57   8e-07
ref|XP_021749717.1| uncharacterized protein LOC110715444 [Chenop...    57   1e-06
gb|PKI48793.1| hypothetical protein CRG98_030835 [Punica granatum]     57   1e-06
gb|PKI78362.1| hypothetical protein CRG98_001305 [Punica granatum]     57   1e-06
ref|XP_013601587.1| PREDICTED: uncharacterized protein LOC106309...    57   1e-06
ref|XP_016690731.1| PREDICTED: uncharacterized protein LOC107907...    56   2e-06

>gb|KHN05285.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 1346

 Score = 60.8 bits (146), Expect = 5e-08
 Identities = 34/92 (36%), Positives = 51/92 (55%), Gaps = 20/92 (21%)
 Frame = +1

Query: 127 QLQLQNFISLIERQFGESVV--------------------IIQQTSCIDTAEHNGRVERR 246
           +L +QNFI  I+ Q+  SV                     I+ QTSC+D+ + NGRVER+
Sbjct: 535 RLHVQNFIHFIKTQYNHSVKSIRTDNGPEFLMPDFYASKGILHQTSCVDSPQQNGRVERK 594

Query: 247 HRSILNVPSALMIQSFLLKFYEHCCQVNHVIF 342
           H+ ILN+  AL++QS L K +  C  V+H ++
Sbjct: 595 HQQILNIGRALLVQSNLPKSF-WCYAVSHAVY 625


>dbj|BAB10837.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1462

 Score = 60.1 bits (144), Expect = 1e-07
 Identities = 34/85 (40%), Positives = 47/85 (55%), Gaps = 22/85 (25%)
 Frame = +1

Query: 130 LQLQNFISLIERQFGESVV---------------------IIQQTSCIDTAEHNGRVERR 246
           + L+NFISL+ERQF   +                      II +TSC+ T + NGRVER+
Sbjct: 571 MHLKNFISLVERQFSTKIKTIRSDNGTEFVCLSSFFVDHGIIHETSCVGTPQQNGRVERK 630

Query: 247 HRSILNVPSALMIQSFL-LKFYEHC 318
           HR ILNV  AL  Q+ L ++F+ +C
Sbjct: 631 HRHILNVARALRFQARLPIEFWSYC 655


>gb|KYP37856.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 283

 Score = 59.3 bits (142), Expect = 1e-07
 Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 20/89 (22%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVV--------------------IIQQTSCIDTAEHNGRVERRHRS 255
           +QNF++LIE QF  ++                     IIQQTSC+ T + NGRVER+H+ 
Sbjct: 145 IQNFVALIENQFETTIKCIRSDNGLEFLLKDFFSSKGIIQQTSCVYTPQQNGRVERKHQH 204

Query: 256 ILNVPSALMIQSFLLKFYEHCCQVNHVIF 342
           ILNV  ALM QS +   +  C  + H +F
Sbjct: 205 ILNVARALMFQSQIPNNF-WCYAIKHAMF 232


>gb|PNX92344.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 1492

 Score = 58.9 bits (141), Expect = 2e-07
 Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 22/83 (26%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVVIIQ---------------------QTSCIDTAEHNGRVERRHR 252
           L+NF+++++RQFG+ V II+                     QTSC+ T++ NGRVER+HR
Sbjct: 627 LRNFLAMVQRQFGKLVKIIRSDNGTEFTCLGAHFAENGIIHQTSCVGTSQQNGRVERKHR 686

Query: 253 SILNVPSALMIQSFL-LKFYEHC 318
            ILNV  AL  Q+ L ++F+  C
Sbjct: 687 HILNVARALRFQANLPIEFWGEC 709


>gb|AJY78067.1| putative polyprotein [Glycine max]
          Length = 886

 Score = 58.5 bits (140), Expect = 3e-07
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 20/92 (21%)
 Frame = +1

Query: 127 QLQLQNFISLIERQFGESVV--------------------IIQQTSCIDTAEHNGRVERR 246
           +L +QNFI  I+ Q+  SV                     I+ QT C+D+ + NGRVER+
Sbjct: 631 RLHVQNFIHFIKTQYNHSVKSIRTDNGPEFLMPDFYASKGILHQTLCVDSPQQNGRVERK 690

Query: 247 HRSILNVPSALMIQSFLLKFYEHCCQVNHVIF 342
           H+ ILN+  AL++QS L K +  C  V+H ++
Sbjct: 691 HQQILNIGRALLVQSNLPKSF-WCYAVSHAVY 721


>dbj|GAU31769.1| hypothetical protein TSUD_22150 [Trifolium subterraneum]
          Length = 1372

 Score = 58.5 bits (140), Expect = 3e-07
 Identities = 28/60 (46%), Positives = 40/60 (66%)
 Frame = +1

Query: 130 LQLQNFISLIERQFGESVVIIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFLLKFY 309
           ++  N +  +  QF  S  II QTSC++T E NGRVER+H+ +LNV  AL+ Q+ L KF+
Sbjct: 580 VRTDNGVEFLITQFYASKGIIHQTSCVETPEQNGRVERKHQHLLNVGRALLFQAHLPKFF 639


>ref|XP_021727452.1| uncharacterized protein LOC110694595 [Chenopodium quinoa]
          Length = 606

 Score = 58.2 bits (139), Expect = 4e-07
 Identities = 30/62 (48%), Positives = 40/62 (64%)
 Frame = +1

Query: 151 SLIERQFGESVVIIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFLLKFYEHCCQVN 330
           S + R+F     I+QQTSC+DT + NGRVER+HR ILNV  AL  Q+ L K++   C + 
Sbjct: 475 SQLVREFCAMKGILQQTSCVDTTQQNGRVERKHRHILNVARALRFQAKLPKYFWGECVMT 534

Query: 331 HV 336
            V
Sbjct: 535 AV 536


>gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score = 57.8 bits (138), Expect = 6e-07
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 22/86 (25%)
 Frame = +1

Query: 127 QLQLQNFISLIERQFGESVVIIQ---------------------QTSCIDTAEHNGRVER 243
           Q  L++FI+L+ERQF   + I++                     +TSC+ T   NGRVER
Sbjct: 616 QKHLKDFIALVERQFDTEIKIVRSDNGTEFLCMREYFLHKGIAHETSCVGTPHQNGRVER 675

Query: 244 RHRSILNVPSALMIQSFL-LKFYEHC 318
           +HR ILN+  AL  QS+L ++F+  C
Sbjct: 676 KHRHILNIARALRFQSYLPIQFWGEC 701


>gb|AAG09097.1|AC009323_8 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score = 57.8 bits (138), Expect = 6e-07
 Identities = 33/83 (39%), Positives = 47/83 (56%), Gaps = 22/83 (26%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVV---------------------IIQQTSCIDTAEHNGRVERRHR 252
           L+NFI+L+ERQ+  ++                      II +TSC+ T + NGRVER+HR
Sbjct: 609 LKNFIALVERQYTTNIKMIRSDNGSEFICLSDFFAQKGIIHETSCVGTPQQNGRVERKHR 668

Query: 253 SILNVPSALMIQSFL-LKFYEHC 318
            ILNV  AL  QS L ++F+ +C
Sbjct: 669 HILNVARALRFQSGLPIEFWSYC 691


>ref|XP_018508380.1| PREDICTED: uncharacterized protein LOC108868977 [Brassica rapa]
          Length = 487

 Score = 57.4 bits (137), Expect = 7e-07
 Identities = 36/91 (39%), Positives = 49/91 (53%), Gaps = 23/91 (25%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVVIIQ---------------------QTSCIDTAEHNGRVERRHR 252
           +Q F +  E+QFG+ V I++                     QTSC+DT + NGRVER+HR
Sbjct: 263 IQRFCAYSEKQFGQVVQIVRSDNGMEFMCLKDFFQDKGIIHQTSCVDTPQQNGRVERKHR 322

Query: 253 SILNVPSALMIQSFL-LKFY-EHCCQVNHVI 339
            ILNV  AL+ QS + +KF+ E      HVI
Sbjct: 323 HILNVARALLFQSHMPVKFWGEAVTTATHVI 353


>ref|XP_010256211.1| PREDICTED: uncharacterized protein LOC104596654 [Nelumbo nucifera]
          Length = 597

 Score = 57.4 bits (137), Expect = 8e-07
 Identities = 30/64 (46%), Positives = 43/64 (67%), Gaps = 3/64 (4%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVV--IIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFL-LKF 306
           L+NF +++ERQF + V   I+ Q+SC+ T + NGRVER+H  ILNV  AL  Q+ L + F
Sbjct: 286 LKNFCAMVERQFNKKVKVGILFQSSCVGTPQQNGRVERKHCHILNVARALRFQAHLPISF 345

Query: 307 YEHC 318
           +  C
Sbjct: 346 WGEC 349


>ref|XP_022557786.1| uncharacterized protein LOC111205820 [Brassica napus]
          Length = 856

 Score = 57.4 bits (137), Expect = 8e-07
 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 22/99 (22%)
 Frame = +1

Query: 88  SSLGFICRSGNLNQLQLQNFISLIERQFGESVV---------------------IIQQTS 204
           SS  F+ +  +L   Q++NF+ +IERQF + V                      +I +TS
Sbjct: 543 SSTDFLLKDCDLVSQQIRNFLVMIERQFSKKVKTIRSDNGTEFTCLSRFFREEGVIHETS 602

Query: 205 CIDTAEHNGRVERRHRSILNVPSALMIQSFL-LKFYEHC 318
           C+ T + NGRVER+HR I NV  AL  Q+ L ++++  C
Sbjct: 603 CVYTPQQNGRVERKHRHIFNVARALRFQANLSIEYWGEC 641


>ref|XP_022549636.1| uncharacterized protein LOC111201669 [Brassica napus]
          Length = 1309

 Score = 57.4 bits (137), Expect = 8e-07
 Identities = 37/103 (35%), Positives = 54/103 (52%), Gaps = 23/103 (22%)
 Frame = +1

Query: 100 FICRSGNLNQLQLQNFISLIERQFGESVVIIQ---------------------QTSCIDT 216
           ++ R  +  ++ LQNF  + E+QFG+SV +++                     QTSC+ T
Sbjct: 530 YLMREKSEVRVVLQNFCKMTEKQFGKSVKMVRSDNGTEFMCLSQFFRENGVLHQTSCVGT 589

Query: 217 AEHNGRVERRHRSILNVPSALMIQSFL-LKFY-EHCCQVNHVI 339
            + NGRVER+HR ILNV  AL+ Q  L  KF+ E      H+I
Sbjct: 590 PQQNGRVERKHRHILNVARALLFQGSLPTKFWGEAVMTATHLI 632


>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score = 57.4 bits (137), Expect = 8e-07
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 22/80 (27%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVVIIQ---------------------QTSCIDTAEHNGRVERRHR 252
           L NF++  E+QFG+SV II+                     QTSC+ T + NGRVER+HR
Sbjct: 615 LTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHR 674

Query: 253 SILNVPSALMIQSFL-LKFY 309
            ILNV  AL+ Q+ L +KF+
Sbjct: 675 HILNVSRALLFQASLPIKFW 694


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score = 57.4 bits (137), Expect = 8e-07
 Identities = 34/80 (42%), Positives = 46/80 (57%), Gaps = 22/80 (27%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVVIIQ---------------------QTSCIDTAEHNGRVERRHR 252
           L NF++  E+QFG+SV II+                     QTSC+ T + NGRVER+HR
Sbjct: 615 LTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRVERKHR 674

Query: 253 SILNVPSALMIQSFL-LKFY 309
            ILNV  AL+ Q+ L +KF+
Sbjct: 675 HILNVSRALLFQASLPIKFW 694


>ref|XP_021749717.1| uncharacterized protein LOC110715444 [Chenopodium quinoa]
          Length = 544

 Score = 57.0 bits (136), Expect = 1e-06
 Identities = 32/66 (48%), Positives = 41/66 (62%)
 Frame = +1

Query: 139 QNFISLIERQFGESVVIIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFLLKFYEHC 318
           Q F+SL    F     I+ QTSC+DTA+ NGRVER+HR ILNV  AL  Q+ L K++   
Sbjct: 322 QEFLSL--GSFFAKKGILHQTSCVDTAQQNGRVERKHRHILNVARALRFQAKLPKYFWGE 379

Query: 319 CQVNHV 336
           C +  V
Sbjct: 380 CVMTAV 385


>gb|PKI48793.1| hypothetical protein CRG98_030835 [Punica granatum]
          Length = 412

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 29/48 (60%), Positives = 34/48 (70%), Gaps = 1/48 (2%)
 Frame = +1

Query: 187 IIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFL-LKFYEHCCQV 327
           II QTSCIDT + NG+VER+HR ILNV  ALM Q+ L L F+  C  V
Sbjct: 11  IIHQTSCIDTPQQNGQVERKHRHILNVARALMFQASLPLNFWGECISV 58


>gb|PKI78362.1| hypothetical protein CRG98_001305 [Punica granatum]
          Length = 628

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 29/45 (64%), Positives = 33/45 (73%), Gaps = 1/45 (2%)
 Frame = +1

Query: 187 IIQQTSCIDTAEHNGRVERRHRSILNVPSALMIQSFL-LKFYEHC 318
           II QTSCIDT + NGRVER+HR ILNV  ALM Q+ L L F+  C
Sbjct: 14  IIHQTSCIDTPQQNGRVERKHRHILNVARALMFQASLPLAFWGEC 58


>ref|XP_013601587.1| PREDICTED: uncharacterized protein LOC106309050 [Brassica oleracea
           var. oleracea]
          Length = 652

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 22/99 (22%)
 Frame = +1

Query: 88  SSLGFICRSGNLNQLQLQNFISLIERQFGESVV---------------------IIQQTS 204
           SS  F+ +  +L   Q++NF+ +IERQF + V                      +I +TS
Sbjct: 297 SSNDFLLKDCDLVSQQIRNFLVMIERQFSKKVKTIRSDNGTEFTCLSRFFREEGVIHETS 356

Query: 205 CIDTAEHNGRVERRHRSILNVPSALMIQSFL-LKFYEHC 318
           C+ T + NGRVER+HR I NV  AL  Q+ L ++++  C
Sbjct: 357 CVYTPQQNGRVERKHRHIFNVARALRFQANLSIEYWGEC 395


>ref|XP_016690731.1| PREDICTED: uncharacterized protein LOC107907947 [Gossypium
           hirsutum]
          Length = 667

 Score = 56.2 bits (134), Expect = 2e-06
 Identities = 33/90 (36%), Positives = 48/90 (53%), Gaps = 21/90 (23%)
 Frame = +1

Query: 136 LQNFISLIERQFGESVVIIQ--------------------QTSCIDTAEHNGRVERRHRS 255
           +QNF +L+E QF   + +I+                    QT+C++T + NG VER+H+ 
Sbjct: 486 IQNFFTLVETQFSSKIKVIRSNNGCEFVIPTFYASKGVIHQTTCVETPQQNGLVERKHQH 545

Query: 256 ILNVPSALMIQSFLLK-FYEHCCQVNHVIF 342
           ILNV  AL+  S LLK F+ H   V H +F
Sbjct: 546 ILNVARALLFHSHLLKHFWGHA--VLHAVF 573


Top