BLASTX nr result

ID: Astragalus24_contig00027802 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00027802
         (403 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX59779.1| retrovirus-related Pol polyprotein from transposo...   140   3e-40
gb|PNY02869.1| copia-type polyprotein [Trifolium pratense]            136   5e-38
dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subt...   144   4e-37
dbj|GAU10241.1| hypothetical protein TSUD_420250, partial [Trifo...   136   2e-36
gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]   141   2e-36
gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo...   140   3e-36
dbj|GAU18821.1| hypothetical protein TSUD_228110 [Trifolium subt...   140   8e-36
dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subt...   139   1e-35
dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subt...   137   5e-35
gb|PNX56727.1| retrovirus-related Pol polyprotein from transposo...   126   1e-34
gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]   136   1e-34
dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subt...   135   2e-34
dbj|GAU21581.1| hypothetical protein TSUD_35440 [Trifolium subte...   135   2e-34
gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense]   127   3e-34
gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]            135   3e-34
dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subt...   134   6e-34
dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subte...   133   7e-34
dbj|GAU29902.1| hypothetical protein TSUD_379930 [Trifolium subt...   134   8e-34
dbj|GAU31928.1| hypothetical protein TSUD_271130, partial [Trifo...   134   1e-33
gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense]   134   1e-33

>gb|PNX59779.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 150

 Score =  140 bits (354), Expect = 3e-40
 Identities = 61/104 (58%), Positives = 79/104 (75%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E++E YKCHKLGHFQYEC S E   ANYA FDD EE+LLMA   +      + WF+DSGC
Sbjct: 41  ESIECYKCHKLGHFQYECQSGEGGYANYAEFDDSEEVLLMAHDEDSSNSNSKLWFIDSGC 100

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           SNHM G+K W  +LD+S+RETV+LGD S++N+MG+G+V+L+  G
Sbjct: 101 SNHMSGVKAWFHDLDDSFRETVRLGDDSKMNVMGRGNVKLQLNG 144


>gb|PNY02869.1| copia-type polyprotein [Trifolium pratense]
          Length = 192

 Score =  136 bits (343), Expect = 5e-38
 Identities = 57/104 (54%), Positives = 82/104 (78%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN+E ++CHKLGH+Q ECP+ E+  AN+  FDD+EE+LLMA+G ++  +++  W+LDSGC
Sbjct: 45  ENIECFRCHKLGHYQSECPNWEDANANFVEFDDKEEILLMAQGTDESNDKKAVWYLDSGC 104

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           SNHM G KEWLF+ D  +RE+VKLGD S++ +MGKG+++L   G
Sbjct: 105 SNHMVGNKEWLFDFDVIFRESVKLGDDSKMAVMGKGNLKLNING 148


>dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subterraneum]
          Length = 1222

 Score =  144 bits (362), Expect = 4e-37
 Identities = 63/104 (60%), Positives = 82/104 (78%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN+E +KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LLMA+   +G      WFLDSGC
Sbjct: 219 ENVECFKCHKLGHFQSECPSWEEENANYAQFDESEEILLMAQETNEGESNNEIWFLDSGC 278

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           SNHM G K+WLF+ D SY+++VKLGD S++ +MGKG+++L+ EG
Sbjct: 279 SNHMIGNKDWLFDFDSSYKDSVKLGDDSKMPVMGKGNLKLQIEG 322


>dbj|GAU10241.1| hypothetical protein TSUD_420250, partial [Trifolium subterraneum]
          Length = 333

 Score =  136 bits (342), Expect = 2e-36
 Identities = 65/105 (61%), Positives = 80/105 (76%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN+E YKCHKLGH+Q ECP   E  ANYA F DEEE LLMAR   +  + E AWFLDSGC
Sbjct: 98  ENIECYKCHKLGHYQNECPEWGEGNANYAEFLDEEETLLMARTNSEELKNE-AWFLDSGC 156

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           SNHM G K WL+E DE+YR++VKLGD S++++MGKG+V+L   G+
Sbjct: 157 SNHMVGNKNWLYEFDENYRDSVKLGDDSKMSVMGKGNVKLNIGGK 201


>gb|PNX72392.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 886

 Score =  141 bits (356), Expect = 2e-36
 Identities = 60/104 (57%), Positives = 84/104 (80%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN+E ++CHKLGH+Q ECP+ E+  AN+A FDD+EE+LLMA+G ++   ++  W+LDSGC
Sbjct: 242 ENIECFRCHKLGHYQSECPNWEDANANFAEFDDKEEILLMAQGTDESNNKKVVWYLDSGC 301

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           SNHM G KEWLF+ D+S+RE+VKLGD SR+ +MGKG+++L   G
Sbjct: 302 SNHMVGNKEWLFDFDDSFRESVKLGDDSRMAVMGKGNLKLNING 345


>gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 582

 Score =  140 bits (352), Expect = 3e-36
 Identities = 61/105 (58%), Positives = 79/105 (75%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E++E YKCHK GH+QYECP+L   + NY GFD+EEEMLLMA    +  ++E  WFLDSGC
Sbjct: 239 EHIECYKCHKFGHYQYECPNLATDVVNYTGFDEEEEMLLMALEGSKENDQE-CWFLDSGC 297

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           S HMCG+K W  +LDE +RE VKLGD   +++MG+G+V+L  EGR
Sbjct: 298 STHMCGVKRWFIDLDEQFREVVKLGDGRTLSVMGRGNVKLCVEGR 342


>dbj|GAU18821.1| hypothetical protein TSUD_228110 [Trifolium subterraneum]
          Length = 1013

 Score =  140 bits (352), Expect = 8e-36
 Identities = 66/108 (61%), Positives = 86/108 (79%), Gaps = 4/108 (3%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGV----EQGTEEERAWFL 150
           EN+E++KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LLMA+      EQG+  E  WFL
Sbjct: 222 ENVEFFKCHKLGHFQSECPSREEENANYAQFDEGEEILLMAQETKETKEQGSHSE-IWFL 280

Query: 149 DSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           DSGCSNHM G KEWLF+ D+S++++VKLGD S++ +MGKG+++L  EG
Sbjct: 281 DSGCSNHMIGNKEWLFDFDDSFKDSVKLGDDSKMAVMGKGNLKLHIEG 328


>dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subterraneum]
          Length = 1102

 Score =  139 bits (350), Expect = 1e-35
 Identities = 61/105 (58%), Positives = 81/105 (77%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN+E +KCHKLGHFQ ECPS EE+ ANYA FD+ EE+LL+A+   +G      WFLDS C
Sbjct: 146 ENVECFKCHKLGHFQSECPSWEEENANYAQFDESEEILLVAQETNEGESNNEIWFLDSSC 205

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           S HM G KEWLF+ D SY+++VKLGD S++ +MGKG+++L+ EG+
Sbjct: 206 SKHMIGNKEWLFDFDSSYKDSVKLGDDSKMPVMGKGNLKLQIEGQ 250


>dbj|GAU23361.1| hypothetical protein TSUD_334080 [Trifolium subterraneum]
          Length = 1322

 Score =  137 bits (346), Expect = 5e-35
 Identities = 61/101 (60%), Positives = 82/101 (81%), Gaps = 1/101 (0%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDSG 141
           EN+E Y+CHKLGH+Q+ECP+ EEK ANYA +D  EE+LLMA+ G+E    +E  WFLDSG
Sbjct: 238 ENIECYRCHKLGHYQHECPTWEEKDANYAAYDSHEEILLMAKHGIETDARDE-VWFLDSG 296

Query: 140 CSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRL 18
           CSNHM G +EWLF+ D++ RE+VKLGD SR+ ++GKG+++L
Sbjct: 297 CSNHMVGTREWLFDFDDNIRESVKLGDDSRMQILGKGNLKL 337


>gb|PNX56727.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 153

 Score =  126 bits (317), Expect = 1e-34
 Identities = 59/108 (54%), Positives = 77/108 (71%), Gaps = 10/108 (9%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMA----------RGVEQGTEE 168
           EN+E YKCHKLGHFQ ECP  EE   NYA FD+ EE+LLMA          R  E+  + 
Sbjct: 46  ENVECYKCHKLGHFQSECPDWEEDNVNYAEFDEAEELLLMAQKGKEIKEIQRSDERFNDN 105

Query: 167 ERAWFLDSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSV 24
            + WFLDSGCSNHM G KEWLF+ D+S++++VKLGD S++ ++GKG++
Sbjct: 106 HQIWFLDSGCSNHMVGNKEWLFDYDDSFKDSVKLGDDSKMAVIGKGNL 153


>gb|PNX96089.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 1062

 Score =  136 bits (343), Expect = 1e-34
 Identities = 62/104 (59%), Positives = 83/104 (79%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           +N+E Y CHKLGHFQ +CP+ +EK ANYA FD+ EEMLLMA   E+G+ +++ WFLDSGC
Sbjct: 269 DNIECYHCHKLGHFQSDCPAWDEK-ANYAEFDEGEEMLLMAHS-EKGSYDKKVWFLDSGC 326

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
            NHMCG K+W F LDE +R +VKLGD+SR+ ++GKG+V+L+  G
Sbjct: 327 RNHMCGTKDWFFNLDEQFRISVKLGDNSRMMVVGKGNVKLRIGG 370


>dbj|GAU16533.1| hypothetical protein TSUD_167640 [Trifolium subterraneum]
          Length = 1103

 Score =  135 bits (341), Expect = 2e-34
 Identities = 62/106 (58%), Positives = 79/106 (74%), Gaps = 2/106 (1%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEK-IANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDS 144
           EN+E YKCHK GHFQYEC + E    ANYA FDD EE+LLMA           + W++DS
Sbjct: 239 ENVECYKCHKFGHFQYECQNSEGGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDS 298

Query: 143 GCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           GCSNHMCGIKEW  +LDES+RE+V+LGD S++++MGKG+V+L+  G
Sbjct: 299 GCSNHMCGIKEWFHDLDESFRESVRLGDDSQMSVMGKGNVKLQMNG 344


>dbj|GAU21581.1| hypothetical protein TSUD_35440 [Trifolium subterraneum]
          Length = 1283

 Score =  135 bits (341), Expect = 2e-34
 Identities = 66/105 (62%), Positives = 81/105 (77%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E +E +KCHKLGH++ ECP  E   ANYA F DEEE LLMAR     ++ E  WFLDSGC
Sbjct: 215 ETIECFKCHKLGHYRSECPDWEGN-ANYAEFLDEEETLLMARTNTNESKHE-TWFLDSGC 272

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           SNHM G K+WL+ELDESYR+TVKLGD S++N+MGKG+V+L  +GR
Sbjct: 273 SNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGR 317


>gb|PNX67946.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 197

 Score =  127 bits (318), Expect = 3e-34
 Identities = 57/109 (52%), Positives = 82/109 (75%), Gaps = 5/109 (4%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGV-----EQGTEEERAWF 153
           E++E YKCHKLGH+Q +CPS +E  ANYA FD+ +E+LLMA+       +  +E+   WF
Sbjct: 50  ESVECYKCHKLGHYQSDCPSWDEDNANYAEFDEGQEILLMAQNTMVNESQNSSEKLELWF 109

Query: 152 LDSGCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           LDSGCSNHM G K WLF+ D+S++++VKLGD S++++ GKG+++L  EG
Sbjct: 110 LDSGCSNHMVGNKNWLFDYDDSFKDSVKLGDDSKMSVEGKGNLKLYIEG 158


>gb|PNX94698.1| copia-type polyprotein [Trifolium pratense]
          Length = 1324

 Score =  135 bits (340), Expect = 3e-34
 Identities = 60/101 (59%), Positives = 81/101 (80%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E +E +KCHKLGH++ ECP+ EE  ANYA FD+E+E+LLMAR       +E  WFLDSGC
Sbjct: 245 ELVECFKCHKLGHYRNECPTWEEYDANYAEFDEEQELLLMARDNVSTHAKEEVWFLDSGC 304

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLK 15
           SNHM G KEWLF+ D+S+RE+VKLGD SR+++MG+G+++L+
Sbjct: 305 SNHMIGTKEWLFDYDDSFRESVKLGDDSRMSVMGRGNLKLQ 345


>dbj|GAU23238.1| hypothetical protein TSUD_172660 [Trifolium subterraneum]
          Length = 1132

 Score =  134 bits (338), Expect = 6e-34
 Identities = 61/106 (57%), Positives = 79/106 (74%), Gaps = 2/106 (1%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEK-IANYAGFDDEEEMLLMAR-GVEQGTEEERAWFLDS 144
           EN+E YKCHK GHFQYEC + E    ANYA FDD EE+LLMA           + W++DS
Sbjct: 239 ENVECYKCHKFGHFQYECQNSEGGGYANYADFDDSEEVLLMAHESTSSNNPRAKVWYIDS 298

Query: 143 GCSNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEG 6
           GCSNHMCGIKEW  +LD+S+RE+V+LGD S++++MGKG+V+L+  G
Sbjct: 299 GCSNHMCGIKEWFHDLDDSFRESVRLGDDSQMSVMGKGNVKLQMNG 344


>dbj|GAU43011.1| hypothetical protein TSUD_28300 [Trifolium subterraneum]
          Length = 538

 Score =  133 bits (334), Expect = 7e-34
 Identities = 64/105 (60%), Positives = 82/105 (78%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E +E YKCHKLGH+Q ECP+  E  ANYA F+DEEEMLLMA+   +  +EE  WFLDSGC
Sbjct: 239 ELVECYKCHKLGHYQNECPTWGEN-ANYAEFNDEEEMLLMAKTNCEEMKEE-IWFLDSGC 296

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           SNHM G K+W++E DE+YR++VKLGD S++ +MGKG+V+L   GR
Sbjct: 297 SNHMIGNKDWMYEFDETYRDSVKLGDDSKMQVMGKGNVKLSINGR 341


>dbj|GAU29902.1| hypothetical protein TSUD_379930 [Trifolium subterraneum]
          Length = 1277

 Score =  134 bits (337), Expect = 8e-34
 Identities = 65/105 (61%), Positives = 80/105 (76%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E +E +KCHKLGH++ ECP  E   ANY  F DEEE LLMAR     ++ E  WFLDSGC
Sbjct: 244 ETIECFKCHKLGHYRNECPEWEGN-ANYVEFLDEEETLLMARTNADESKHE-TWFLDSGC 301

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           SNHM G K+WL+ELDESYR+TVKLGD S++N+MGKG+V+L  +GR
Sbjct: 302 SNHMVGNKDWLYELDESYRDTVKLGDDSKMNVMGKGNVKLSIDGR 346


>dbj|GAU31928.1| hypothetical protein TSUD_271130, partial [Trifolium subterraneum]
          Length = 747

 Score =  134 bits (336), Expect = 1e-33
 Identities = 65/105 (61%), Positives = 80/105 (76%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           E +E +KCHKLGH++ ECP  E   ANYA F DEEE LLMAR     ++ E  WFLDSGC
Sbjct: 244 ETIECFKCHKLGHYRNECPDWEGN-ANYAEFLDEEETLLMARTNADESKHE-TWFLDSGC 301

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           SNHM G K+WL+ELDESYR+T KLGD S++N+MGKG+V+L  +GR
Sbjct: 302 SNHMVGNKDWLYELDESYRDTFKLGDDSKMNVMGKGNVKLSIDGR 346


>gb|PNY01730.1| copia-type polyprotein, partial [Trifolium pratense]
          Length = 861

 Score =  134 bits (336), Expect = 1e-33
 Identities = 63/105 (60%), Positives = 79/105 (75%)
 Frame = -3

Query: 317 ENLEYYKCHKLGHFQYECPSLEEKIANYAGFDDEEEMLLMARGVEQGTEEERAWFLDSGC 138
           EN E +KCHKLGH+Q +CP+  E  ANYA FDDEEEMLLMA+  +    E   WFLDSGC
Sbjct: 240 ENAECFKCHKLGHYQSDCPNWGEN-ANYAEFDDEEEMLLMAKTNDDAKSEN--WFLDSGC 296

Query: 137 SNHMCGIKEWLFELDESYRETVKLGDHSRINMMGKGSVRLKSEGR 3
           S HM G K+W ++ DE+YR++VKLGD SR+N+MGKG+V+L   GR
Sbjct: 297 SYHMAGNKDWFYDFDENYRDSVKLGDDSRMNVMGKGNVKLSINGR 341


Top