BLASTX nr result

ID: Astragalus24_contig00025801 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00025801
         (702 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX89396.1| pentatricopeptide repeat-containing protein, part...    66   8e-09
dbj|GAU50456.1| hypothetical protein TSUD_373220 [Trifolium subt...    63   9e-08
ref|XP_017420235.1| PREDICTED: uncharacterized protein LOC108330...    62   2e-07
dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subt...    62   2e-07
gb|OTG24956.1| putative retrotransposon gag domain-containing pr...    61   3e-07
gb|KOM49767.1| hypothetical protein LR48_Vigan08g059400 [Vigna a...    60   1e-06
ref|XP_017438495.1| PREDICTED: uncharacterized protein LOC108344...    60   1e-06
dbj|GAU24602.1| hypothetical protein TSUD_289640 [Trifolium subt...    60   1e-06
gb|KOM47233.1| hypothetical protein LR48_Vigan07g093700 [Vigna a...    59   1e-06
gb|KYP56676.1| Retrovirus-related Pol polyprotein from transposo...    60   1e-06
dbj|GAU40717.1| hypothetical protein TSUD_263670 [Trifolium subt...    60   1e-06
ref|XP_017407717.1| PREDICTED: uncharacterized protein LOC108320...    59   2e-06
dbj|GAU22599.1| hypothetical protein TSUD_134990 [Trifolium subt...    59   2e-06
dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subt...    59   2e-06
dbj|GAU37038.1| hypothetical protein TSUD_207440 [Trifolium subt...    59   2e-06
ref|XP_017438375.1| PREDICTED: uncharacterized protein LOC108344...    59   3e-06
gb|PNX97977.1| hypothetical protein L195_g021217, partial [Trifo...    59   3e-06
gb|PNY15662.1| retrotransposon-related protein [Trifolium pratense]    59   3e-06
gb|PNY17068.1| retrotransposon-related protein [Trifolium pratense]    58   3e-06
gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptas...    58   6e-06

>gb|PNX89396.1| pentatricopeptide repeat-containing protein, partial [Trifolium
           pratense]
          Length = 407

 Score = 65.9 bits (159), Expect = 8e-09
 Identities = 30/72 (41%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
 Frame = -3

Query: 559 GIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITGDGEIVVLEE 383
           G++  +  E   RW++ LCF CG  + P  H+CPE +LRV++L + E +T DGEIV LEE
Sbjct: 331 GVRSFNNEETEERWRKGLCFKCGGKYHPTLHKCPEKSLRVLILGEGETLTEDGEIVSLEE 390

Query: 382 DQSNGAIHSDEE 347
            +  G    + E
Sbjct: 391 VEVEGEEEEEVE 402


>dbj|GAU50456.1| hypothetical protein TSUD_373220 [Trifolium subterraneum]
          Length = 1463

 Score = 63.2 bits (152), Expect = 9e-08
 Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 5/84 (5%)
 Frame = -3

Query: 571 PRKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           PR  G   LS  E   R K+ LCF CG PF P H+CP+ +LRV+V+ D  +  G+ +++ 
Sbjct: 320 PRDKGFTHLSYNELMERKKKGLCFKCGGPFHPTHQCPDKHLRVLVVDDECEEDGEAKVLA 379

Query: 391 LE----EDQSNGAIHS-DEEHYTH 335
           +E    ED+  G I   D  H  H
Sbjct: 380 VEIEEGEDEEQGEISMLDLHHIAH 403


>ref|XP_017420235.1| PREDICTED: uncharacterized protein LOC108330249 [Vigna angularis]
          Length = 684

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 30/60 (50%), Positives = 41/60 (68%)
 Frame = -3

Query: 562 PGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVVLEE 383
           P  +FL + E+ R      CF CG PFAPG+RCPE +LRV++LA++E+   D EI+ LEE
Sbjct: 293 PYPKFLQRKEEGR------CFRCGCPFAPGNRCPEKSLRVLLLAEDEEADVDEEIINLEE 346


>dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subterraneum]
          Length = 1523

 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 29/65 (44%), Positives = 40/65 (61%)
 Frame = -3

Query: 571 PRKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           PR  G   LS  E   R ++ LCF CG PF P H+CPE  LRV+++ D E+   +GEI+ 
Sbjct: 313 PRDRGFNHLSYNELMERKQKGLCFKCGGPFHPMHQCPEKQLRVLIVDDEEE---EGEIIA 369

Query: 391 LEEDQ 377
           +E D+
Sbjct: 370 VEVDE 374


>gb|OTG24956.1| putative retrotransposon gag domain-containing protein [Helianthus
           annuus]
          Length = 379

 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 30/74 (40%), Positives = 45/74 (60%), Gaps = 3/74 (4%)
 Frame = -3

Query: 568 RKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDG---EI 398
           R  GI+ LS+TE   R K+  C+ CG P++P H CP G LRVM+L D+E    +G   ++
Sbjct: 298 RDRGIRSLSRTEWEERQKKGQCYKCGQPYSPTHTCPNGKLRVMLLGDDEPDEFEGLHFQL 357

Query: 397 VVLEEDQSNGAIHS 356
             L++ QS+   H+
Sbjct: 358 EQLDDGQSDRGSHT 371


>gb|KOM49767.1| hypothetical protein LR48_Vigan08g059400 [Vigna angularis]
          Length = 1563

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 26/55 (47%), Positives = 37/55 (67%)
 Frame = -3

Query: 556 IQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           I+ L   E  +R +E  CF CG PF PGHRCPE  LR+++LA++E+  G+ E+ V
Sbjct: 293 IRDLPYAEYVKRREEGRCFRCGGPFGPGHRCPERGLRMLILAEDEEPGGEEEVEV 347


>ref|XP_017438495.1| PREDICTED: uncharacterized protein LOC108344576 [Vigna angularis]
          Length = 1969

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 26/55 (47%), Positives = 37/55 (67%)
 Frame = -3

Query: 556 IQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           I+ L   E  +R +E  CF CG PF PGHRCPE  LR+++LA++E+  G+ E+ V
Sbjct: 266 IRDLPYAEYVKRREEGRCFRCGGPFGPGHRCPERGLRMLILAEDEEPGGEEEVEV 320


>dbj|GAU24602.1| hypothetical protein TSUD_289640 [Trifolium subterraneum]
          Length = 2246

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 32/72 (44%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
 Frame = -3

Query: 559 GIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITGDGEIVVLEE 383
           G++ LS  E   R  + LCF CG  + P  H+C E +LRV++L D E +  DGEIV LEE
Sbjct: 679 GVRSLSNEEFEERRTKGLCFKCGGRYHPTLHKCTERSLRVLILGDGESMNEDGEIVCLEE 738

Query: 382 DQSNGAIHSDEE 347
           +     + SDEE
Sbjct: 739 EN----VESDEE 746


>gb|KOM47233.1| hypothetical protein LR48_Vigan07g093700 [Vigna angularis]
          Length = 321

 Score = 59.3 bits (142), Expect = 1e-06
 Identities = 28/59 (47%), Positives = 40/59 (67%)
 Frame = -3

Query: 562 PGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVVLE 386
           P  +FL + E+ R      CF CG PF+PGH+CPE +LRV++LA++E+   +GE V LE
Sbjct: 149 PYPEFLKRREEGR------CFQCGGPFSPGHQCPEKSLRVVLLAEDEEEDVEGEEVELE 201


>gb|KYP56676.1| Retrovirus-related Pol polyprotein from transposon 297 family
           [Cajanus cajan]
          Length = 712

 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 28/61 (45%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
 Frame = -3

Query: 574 PPRKP-GIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEI 398
           PP +P G + L   E  +  +E  CF CG  F PGHRCPE NLRV++LAD E    D  +
Sbjct: 173 PPTRPRGTRNLPYREYIKHREENRCFHCGLAFGPGHRCPEKNLRVIILADEESSDPDSPV 232

Query: 397 V 395
           +
Sbjct: 233 L 233


>dbj|GAU40717.1| hypothetical protein TSUD_263670 [Trifolium subterraneum]
          Length = 1770

 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 1/66 (1%)
 Frame = -3

Query: 571 PRKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           PR  G   LS  E   R ++ LCF CG PF P H+CP+  LR++V+ D E+  G+ +++ 
Sbjct: 409 PRDRGFTHLSYNELMERKQKGLCFKCGGPFHPMHQCPDKQLRLLVIEDEEEEEGEAKVLA 468

Query: 391 LE-EDQ 377
           +E ED+
Sbjct: 469 VEIEDE 474


>ref|XP_017407717.1| PREDICTED: uncharacterized protein LOC108320713 [Vigna angularis]
          Length = 845

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 25/55 (45%), Positives = 37/55 (67%)
 Frame = -3

Query: 556 IQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           ++ L   E  +R +E  CF CG PF PGHRCPE  LR+++LA++E+  G+ E+ V
Sbjct: 293 VRDLPYAEYVKRREEGRCFRCGGPFGPGHRCPERGLRMVILAEDEEPGGEEEVEV 347


>dbj|GAU22599.1| hypothetical protein TSUD_134990 [Trifolium subterraneum]
          Length = 1539

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 30/72 (41%), Positives = 43/72 (59%), Gaps = 1/72 (1%)
 Frame = -3

Query: 586 QPSTPPRKPGIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITG 410
           Q S   R  G++ L   E A R  + LCF CG  F P  H+CPE ++RVM+L + E +  
Sbjct: 300 QGSAMDRWKGMRSLPNDEMAERRAKGLCFKCGGKFHPTLHKCPESSMRVMILGEGEIVNE 359

Query: 409 DGEIVVLEEDQS 374
           +GEIV LE +++
Sbjct: 360 EGEIVSLEIEEN 371


>dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subterraneum]
          Length = 1542

 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 28/66 (42%), Positives = 41/66 (62%)
 Frame = -3

Query: 568 RKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVVL 389
           R  G   LS  E   R ++ LCF CG PF P H+CPE  LRV+V+ ++ED   + +I+ +
Sbjct: 326 RDRGFTQLSYNELMERKQKGLCFKCGGPFHPMHQCPEKQLRVLVIDEDEDGEEEAKILAV 385

Query: 388 EEDQSN 371
           E D+S+
Sbjct: 386 EVDESD 391


>dbj|GAU37038.1| hypothetical protein TSUD_207440 [Trifolium subterraneum]
          Length = 1575

 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 28/71 (39%), Positives = 41/71 (57%), Gaps = 1/71 (1%)
 Frame = -3

Query: 586 QPSTPPRKPGIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITG 410
           Q     R  G++ L   E A R  + LCF CG  F P  H+CPE ++RV++L D E +  
Sbjct: 302 QGGATDRWKGVRSLHSDEMAERRAKGLCFKCGGRFHPTLHKCPESSMRVLILGDGERLND 361

Query: 409 DGEIVVLEEDQ 377
           +GEIV +E ++
Sbjct: 362 EGEIVAVEVEE 372


>ref|XP_017438375.1| PREDICTED: uncharacterized protein LOC108344441 [Vigna angularis]
          Length = 621

 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 24/55 (43%), Positives = 36/55 (65%)
 Frame = -3

Query: 556 IQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           ++ L   E  +R +E  CF CG PF PGHRCPE  +R+++LA+ E+  G+ E+ V
Sbjct: 293 VRDLPYAEYVKRREEGRCFRCGGPFGPGHRCPERGIRMLILAEEEEPGGEEEVEV 347


>gb|PNX97977.1| hypothetical protein L195_g021217, partial [Trifolium pratense]
          Length = 1299

 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 26/73 (35%), Positives = 42/73 (57%)
 Frame = -3

Query: 571 PRKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           PR  G   LS  E   R K+ LCF CG PF P  +CP+ +LRV+V+ D ++   +G+ + 
Sbjct: 96  PRDRGFTHLSYNELMERRKKGLCFKCGGPFHPMQQCPDKHLRVLVVEDEDESGQEGKCLA 155

Query: 391 LEEDQSNGAIHSD 353
           +E D+ +  +  +
Sbjct: 156 VEVDEEDEEVDGE 168


>gb|PNY15662.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1531

 Score = 58.5 bits (140), Expect = 3e-06
 Identities = 29/65 (44%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
 Frame = -3

Query: 559 GIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITGDGEIVVLEE 383
           G++ +S  E   R  + LCF CG  + P  H+CPE  LRV++L D E I  +GEIVV+E 
Sbjct: 302 GVRSVSNDEVVERRAKGLCFKCGGRWHPTQHKCPEKALRVLILGDGETINEEGEIVVMEG 361

Query: 382 DQSNG 368
           + S G
Sbjct: 362 EVSEG 366


>gb|PNY17068.1| retrotransposon-related protein [Trifolium pratense]
          Length = 463

 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 30/72 (41%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
 Frame = -3

Query: 559 GIQFLSKTEQARRWKERLCFSCGAPFAPG-HRCPEGNLRVMVLADNEDITGDGEIVVLEE 383
           GI+ +   E   R  + LCF CG  + P  H+CPE ++RV++L D E I  +GEI+ +E 
Sbjct: 302 GIRSIHSDEVVERRAKGLCFKCGGKWHPTQHKCPEKSIRVLILGDGETINEEGEIIAME- 360

Query: 382 DQSNGAIHSDEE 347
               GAI  DEE
Sbjct: 361 ----GAISDDEE 368


>gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptase); Chromo; Zinc
           finger, CCHC-type; Peptidase aspartic, active site;
           Polynucleotidyl transferase, Ribonuclease H fold
           [Medicago truncatula]
          Length = 1297

 Score = 57.8 bits (138), Expect = 6e-06
 Identities = 26/64 (40%), Positives = 39/64 (60%)
 Frame = -3

Query: 571 PRKPGIQFLSKTEQARRWKERLCFSCGAPFAPGHRCPEGNLRVMVLADNEDITGDGEIVV 392
           PR      LS  E   R ++ LCF CG PF P H+CP+  LRV+VL ++E+   +G+++ 
Sbjct: 91  PRDRSFTHLSYNELMERKQKGLCFKCGGPFHPMHQCPDKQLRVLVLEEDEEGEPEGKLLA 150

Query: 391 LEED 380
           +E D
Sbjct: 151 VEVD 154


Top