BLASTX nr result

ID: Astragalus23_contig00031898 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00031898
         (346 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAO23078.1| polyprotein [Glycine max]                               72   2e-12
gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus ca...    66   4e-10
gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium prat...    66   4e-10
gb|PNX99071.1| retrotransposon-related protein, partial [Trifoli...    65   1e-09
gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium prat...    64   1e-09
dbj|GAU30363.1| hypothetical protein TSUD_57740 [Trifolium subte...    64   2e-09
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]    64   2e-09
dbj|GAU28866.1| hypothetical protein TSUD_293180 [Trifolium subt...    64   3e-09
gb|KOM58233.1| hypothetical protein LR48_Vigan11g126700 [Vigna a...    63   3e-09
dbj|BAT97165.1| hypothetical protein VIGAN_09053500 [Vigna angul...    63   4e-09
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...    63   4e-09
dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subt...    63   5e-09
gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifo...    62   7e-09
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...    62   7e-09
gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifo...    62   9e-09
gb|PNX97449.1| retrotransposon-related protein, partial [Trifoli...    62   9e-09
dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subt...    62   9e-09
dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt...    62   1e-08
gb|KYP34468.1| hypothetical protein KK1_044568 [Cajanus cajan]         58   1e-08
dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subte...    62   1e-08

>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score = 72.4 bits (176), Expect = 2e-12
 Identities = 37/58 (63%), Positives = 41/58 (70%)
 Frame = +3

Query: 15  NYKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFS 188
           N KN NPKP   P  LP   TKP   N +NQ +K +SPAEIQLRR+K LCYFCDEKFS
Sbjct: 281 NQKNDNPKPNLPPL-LPTPSTKPF--NLRNQNIKKISPAEIQLRREKNLCYFCDEKFS 335


>gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus cajan]
          Length = 431

 Score = 65.9 bits (159), Expect = 4e-10
 Identities = 34/68 (50%), Positives = 45/68 (66%), Gaps = 8/68 (11%)
 Frame = +3

Query: 15  NYKNQNPKP---YTQPTN-----LPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYF 170
           N  + NPKP   ++ P +     LP   TKP    +KNQ+ K +SPAE+Q+RR+KGLCYF
Sbjct: 124 NRHSSNPKPDIPHSLPKSNLSPLLPNPSTKPFPQTHKNQV-KKISPAEMQIRREKGLCYF 182

Query: 171 CDEKFSYT 194
           CDEKF +T
Sbjct: 183 CDEKFPFT 190


>gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1525

 Score = 65.9 bits (159), Expect = 4e-10
 Identities = 32/68 (47%), Positives = 44/68 (64%)
 Frame = +3

Query: 24  NQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYTR** 203
           N NP   TQP  LP   TKP +   KN ++K++S AE+QLRR+KGLCY C++K+S+    
Sbjct: 256 NPNPNRPTQPPLLPTPSTKPSLFTQKNNIVKNISSAEMQLRREKGLCYTCEDKWSFNHKC 315

Query: 204 PSFSVKSL 227
           P+  V  L
Sbjct: 316 PNKHVMLL 323


>gb|PNX99071.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 730

 Score = 64.7 bits (156), Expect = 1e-09
 Identities = 28/51 (54%), Positives = 39/51 (76%)
 Frame = +3

Query: 36  KPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFS 188
           +P  QP  LP   TKP  +  K++ +K++SPAE+QLRR+KG+CY+CDEKFS
Sbjct: 185 QPLKQPPILPTPTTKPFNSPLKSKTIKNISPAEMQLRREKGICYYCDEKFS 235


>gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium pratense]
          Length = 576

 Score = 64.3 bits (155), Expect = 1e-09
 Identities = 29/59 (49%), Positives = 41/59 (69%)
 Frame = +3

Query: 18  YKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           + N NP   T P  LP   +KP    +KNQ +K+M+ AE+Q+RR+KGLCY CDEK+S++
Sbjct: 260 FNNPNPNRTTIPPLLPTPNSKPTNTYSKNQNVKNMTRAEMQIRREKGLCYTCDEKWSFS 318


>dbj|GAU30363.1| hypothetical protein TSUD_57740 [Trifolium subterraneum]
          Length = 654

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 30/60 (50%), Positives = 40/60 (66%)
 Frame = +3

Query: 15  NYKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           ++ N NP   T P  LP   TKP    +KNQ +K MS AE+Q+RR+KGLCY  DEK+S++
Sbjct: 235 SFNNPNPNRTTTPPLLPTPNTKPTNTYSKNQNVKKMSSAEMQIRREKGLCYTYDEKWSFS 294


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score = 63.9 bits (154), Expect = 2e-09
 Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 8/67 (11%)
 Frame = +3

Query: 15  NYKNQNPKP--------YTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYF 170
           N  + NPKP           P  LP   TKP     +NQ+ K +SPAE+Q+RR+KGLCYF
Sbjct: 268 NRHSSNPKPDIPHSLPKSNLPPLLPNPSTKPFSQTYQNQV-KKISPAEMQIRREKGLCYF 326

Query: 171 CDEKFSY 191
           CDEKFS+
Sbjct: 327 CDEKFSF 333


>dbj|GAU28866.1| hypothetical protein TSUD_293180 [Trifolium subterraneum]
          Length = 527

 Score = 63.5 bits (153), Expect = 3e-09
 Identities = 28/59 (47%), Positives = 41/59 (69%)
 Frame = +3

Query: 18  YKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           + N NP   T P  LP   +KP    +KNQ +K+M+ AE+Q+RR+KGLCY CD+K+S++
Sbjct: 258 FNNPNPNRTTIPPLLPTPNSKPTNTYSKNQNVKNMTSAEMQIRREKGLCYTCDDKWSFS 316


>gb|KOM58233.1| hypothetical protein LR48_Vigan11g126700 [Vigna angularis]
          Length = 472

 Score = 63.2 bits (152), Expect = 3e-09
 Identities = 27/50 (54%), Positives = 36/50 (72%)
 Frame = +3

Query: 45  TQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           T P  LP    KP+   NK+  ++ +SPAE+QLRR+K LCYFCDEKFS++
Sbjct: 277 TLPPLLPNPNIKPLSQTNKHHQIRKLSPAEMQLRREKCLCYFCDEKFSFS 326


>dbj|BAT97165.1| hypothetical protein VIGAN_09053500 [Vigna angularis var.
           angularis]
          Length = 651

 Score = 63.2 bits (152), Expect = 4e-09
 Identities = 27/50 (54%), Positives = 36/50 (72%)
 Frame = +3

Query: 45  TQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           T P  LP    KP+   NK+  ++ +SPAE+QLRR+K LCYFCDEKFS++
Sbjct: 277 TLPPLLPNPNIKPLSQTNKHHQIRKLSPAEMQLRREKCLCYFCDEKFSFS 326


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score = 63.2 bits (152), Expect = 4e-09
 Identities = 32/64 (50%), Positives = 40/64 (62%)
 Frame = +3

Query: 3   KTDINYKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEK 182
           K D   +N  P   T PT       +P+    KN  +K +SPAE+QLRRDKGLCY+CDEK
Sbjct: 253 KNDTTTRNAAPVLNTPPT-------RPMSQYQKNPNIKRISPAEMQLRRDKGLCYWCDEK 305

Query: 183 FSYT 194
           FS+T
Sbjct: 306 FSFT 309


>dbj|GAU12540.1| hypothetical protein TSUD_182540 [Trifolium subterraneum]
          Length = 1451

 Score = 62.8 bits (151), Expect = 5e-09
 Identities = 35/72 (48%), Positives = 43/72 (59%), Gaps = 7/72 (9%)
 Frame = +3

Query: 15  NYKNQNP---KPYTQPTN----LPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFC 173
           NY    P   KP T   N    L    T+P+    KN  +K +SPAE+Q+RRDKGLCY+C
Sbjct: 231 NYATNKPFTNKPETITRNSAPILNTPPTRPMSQFQKNPNIKRISPAEMQVRRDKGLCYWC 290

Query: 174 DEKFSYTR**PS 209
           D+KFSYT   PS
Sbjct: 291 DDKFSYTHKCPS 302


>gb|PNX92431.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1502

 Score = 62.4 bits (150), Expect = 7e-09
 Identities = 34/73 (46%), Positives = 42/73 (57%), Gaps = 9/73 (12%)
 Frame = +3

Query: 3   KTDINYKNQNPKPYTQPTNLPKTETKPIVNN---------NKNQLLKSMSPAEIQLRRDK 155
           K  IN  + N KP+     +    T PI+N           KN  +K MSPAE Q+RRDK
Sbjct: 239 KATINNHSTN-KPFINKPEIATRNTAPILNTPPTRPMSQFQKNPNIKRMSPAERQVRRDK 297

Query: 156 GLCYFCDEKFSYT 194
           GLCY+CDEKFS+T
Sbjct: 298 GLCYWCDEKFSFT 310


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score = 62.4 bits (150), Expect = 7e-09
 Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 7/67 (10%)
 Frame = +3

Query: 15  NYKNQNP---KPYTQPTN----LPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFC 173
           NY N  P   KP     N    L    T+P+    KN  +K +SPAE+Q+RRDKGLCY+C
Sbjct: 243 NYSNIKPLTTKPENSTRNSAPILNTPPTRPMSQFQKNPNIKRISPAEMQIRRDKGLCYWC 302

Query: 174 DEKFSYT 194
           DEKFS+T
Sbjct: 303 DEKFSFT 309


>gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifolium pratense]
          Length = 487

 Score = 62.0 bits (149), Expect = 9e-09
 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 2/74 (2%)
 Frame = +3

Query: 12  INYKNQNPKPYTQPTNLPKTETKPIVN--NNKNQLLKSMSPAEIQLRRDKGLCYFCDEKF 185
           I+     P P TQ  N  + +  P++   N K   +K+MS AEIQLRRDKGLCYFCD+KF
Sbjct: 84  ISPNTNKPHPITQQ-NPQRAQLPPLLPTPNQKPMSIKNMSSAEIQLRRDKGLCYFCDDKF 142

Query: 186 SYTR**PSFSVKSL 227
           S+T   P+  V  L
Sbjct: 143 SHTHRCPNRRVMML 156


>gb|PNX97449.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 806

 Score = 62.0 bits (149), Expect = 9e-09
 Identities = 29/53 (54%), Positives = 38/53 (71%)
 Frame = +3

Query: 30  NPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFS 188
           +P P  +P  LP   T P + N ++  +K +SPAEIQLRR+KGLCYFCD+KFS
Sbjct: 156 SPSPLKKPPLLPTPTTTPQMPNQRS--IKHISPAEIQLRREKGLCYFCDDKFS 206


>dbj|GAU47333.1| hypothetical protein TSUD_101210 [Trifolium subterraneum]
          Length = 1017

 Score = 62.0 bits (149), Expect = 9e-09
 Identities = 32/62 (51%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
 Frame = +3

Query: 21  KNQNPKPYTQPTN----LPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFS 188
           KN   KP T   N    L    T+P+    KN  +K +SPAE+QLRRDKGLCY+CD+KFS
Sbjct: 164 KNFTNKPETLTRNSTPILNTPPTRPMSQFQKNPNIKRISPAEMQLRRDKGLCYWCDDKFS 223

Query: 189 YT 194
           +T
Sbjct: 224 FT 225


>dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum]
          Length = 1479

 Score = 62.0 bits (149), Expect = 1e-08
 Identities = 31/64 (48%), Positives = 42/64 (65%)
 Frame = +3

Query: 3   KTDINYKNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEK 182
           K++I  +N  P   T PT       +P+    KN  +K +SPAE+Q+RRDKGLCY+CDEK
Sbjct: 252 KSEIATRNSAPILNTPPT-------RPMSQYQKNPNIKRISPAEMQVRRDKGLCYWCDEK 304

Query: 183 FSYT 194
           FS+T
Sbjct: 305 FSFT 308


>gb|KYP34468.1| hypothetical protein KK1_044568 [Cajanus cajan]
          Length = 98

 Score = 58.2 bits (139), Expect = 1e-08
 Identities = 26/52 (50%), Positives = 34/52 (65%)
 Frame = +3

Query: 33  PKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFS 188
           PK  T P  LP    KP   N K+  +K M+ AE+Q+RR+KGLC+ CDEKF+
Sbjct: 16  PKTNTLPPLLPTPTIKPFSQNTKSATIKRMTSAEMQIRREKGLCFTCDEKFT 67


>dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subterraneum]
          Length = 1418

 Score = 61.6 bits (148), Expect = 1e-08
 Identities = 30/58 (51%), Positives = 40/58 (68%)
 Frame = +3

Query: 21  KNQNPKPYTQPTNLPKTETKPIVNNNKNQLLKSMSPAEIQLRRDKGLCYFCDEKFSYT 194
           K++N      P  L  + T+P+    KN  +K +SPAEIQ+RRDKGLCY+CDEKFS+T
Sbjct: 228 KSENTTRNAAPI-LNTSPTRPMSQFQKNPNIKRISPAEIQIRRDKGLCYWCDEKFSFT 284


Top