BLASTX nr result

ID: Astragalus23_contig00030265 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00030265
         (442 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposo...    77   2e-13
ref|XP_014632953.1| PREDICTED: uncharacterized protein LOC102666...    76   3e-13
gb|PNY02430.1| retrovirus-related Pol polyprotein from transposo...    66   9e-10
ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797...    62   2e-08
ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanu...    61   6e-08
ref|XP_016195998.1| uncharacterized protein LOC107637061 [Arachi...    60   1e-07
dbj|GAU10291.1| hypothetical protein TSUD_418880 [Trifolium subt...    60   2e-07
ref|XP_020207034.1| uncharacterized protein LOC109792059 isoform...    57   2e-07
gb|PNX81124.1| histone deacetylase, partial [Trifolium pratense]       59   3e-07
ref|XP_020211686.1| uncharacterized protein LOC109796422 [Cajanu...    58   5e-07
gb|KYP71160.1| Retrovirus-related Pol polyprotein from transposo...    58   6e-07
ref|XP_015966155.1| uncharacterized protein LOC107489903 [Arachi...    57   8e-07
dbj|GAU29238.1| hypothetical protein TSUD_362280 [Trifolium subt...    57   1e-06
gb|PNX93258.1| histone deacetylase, partial [Trifolium pratense]       57   1e-06
gb|PNX81791.1| hypothetical protein L195_g037816, partial [Trifo...    57   1e-06
gb|PNX84343.1| retrovirus-related Pol polyprotein from transposo...    57   1e-06
ref|XP_016163061.1| uncharacterized protein LOC107605630 [Arachi...    57   1e-06
ref|XP_017640210.1| PREDICTED: retrovirus-related Pol polyprotei...    57   2e-06
ref|XP_016192001.1| uncharacterized protein LOC107632877 [Arachi...    55   5e-06
gb|PNY10805.1| histone deacetylase, partial [Trifolium pratense]       55   5e-06

>gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 1429

 Score = 77.0 bits (188), Expect = 2e-13
 Identities = 47/105 (44%), Positives = 59/105 (56%), Gaps = 3/105 (2%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVW--TNPQ 168
           VQCQVC++ GHD + CYHRFNA    AY S QP  + GNPYQYVR    ++  W  +NPQ
Sbjct: 282 VQCQVCHRTGHDASYCYHRFNA----AYGSNQP-YVHGNPYQYVRNTTPNNNNWAQSNPQ 336

Query: 167 LMQPQHVAYPQA-FIGYAASGPAQNIP*QQSQNRIVDSGASHHIT 36
             Q    A PQA F GYA            + N  +D+ A+ H+T
Sbjct: 337 WQQ----AAPQANFTGYAPQTNFTGYAMHPTMNNNLDTAATQHVT 377


>ref|XP_014632953.1| PREDICTED: uncharacterized protein LOC102666325 [Glycine max]
          Length = 608

 Score = 76.3 bits (186), Expect = 3e-13
 Identities = 47/105 (44%), Positives = 59/105 (56%), Gaps = 3/105 (2%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVW--TNPQ 168
           VQCQVC+  GHD + CYHRFNA    AY S QP  + GNPYQYVR    ++  W  +NPQ
Sbjct: 249 VQCQVCHHTGHDASYCYHRFNA----AYGSNQP-YVHGNPYQYVRNTTPNNNNWAQSNPQ 303

Query: 167 LMQPQHVAYPQA-FIGYAASGPAQNIP*QQSQNRIVDSGASHHIT 36
             Q    A PQA F GYA      +     + N  +D+ A+ H+T
Sbjct: 304 WQQ----AAPQANFTGYAPQTNFTSYAMHPTMNNNLDTAATQHVT 344


>gb|PNY02430.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 1064

 Score = 66.2 bits (160), Expect = 9e-10
 Identities = 43/123 (34%), Positives = 53/123 (43%), Gaps = 10/123 (8%)
 Frame = -3

Query: 344 NVQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPY--QYVRPAPQS------- 192
           ++QCQVC K GH    C+HRFN  +      P PQ   GNPY   Y    PQ+       
Sbjct: 297 DLQCQVCAKFGHSALNCWHRFNQQFQGNPAPPVPQPRYGNPYGNPYGNAPPQAFGYAPFP 356

Query: 191 -SMVWTNPQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLA 15
               W  P       +A P AF+  AA           S +   DSGAS H+TG+  NL 
Sbjct: 357 PQNTWMRPPAQAQLTMAPPSAFLTNAAP--------STSNSWFPDSGASFHVTGDSRNLQ 408

Query: 14  NAT 6
             T
Sbjct: 409 QLT 411


>ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797270 [Glycine max]
          Length = 329

 Score = 62.0 bits (149), Expect = 2e-08
 Identities = 33/60 (55%), Positives = 39/60 (65%), Gaps = 2/60 (3%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVW--TNPQ 168
           VQCQVC+  GHD + CYHRFNA    AY S QP  + GNPYQYVR    ++  W  +NPQ
Sbjct: 273 VQCQVCHCTGHDASYCYHRFNA----AYGSNQP-YVHGNPYQYVRNTTPNNNNWAQSNPQ 327


>ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanus cajan]
          Length = 385

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 42/119 (35%), Positives = 57/119 (47%), Gaps = 10/119 (8%)
 Frame = -3

Query: 344 NVQCQVCYKIGHDDAVCYHRFNAHYTPA----YYSPQPQQIAGNPYQYVRPAP--QSSMV 183
           N QCQ+C+K GH   +C++R + +Y  A     Y P   Q    P Q V P+   ++S  
Sbjct: 257 NFQCQICFKYGHTANICFYRADINYQTAESLVLYDPTTLQ----PVQ-VTPSSNLKASNT 311

Query: 182 WTNPQLMQPQHVAY----PQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNL 18
           W NP   QP   A+    P A I   +S      P     + I DSGAS H+TG P N+
Sbjct: 312 WVNPNQKQPSQDAHANVTPSAMIDNTSS------PAGTHSSWIPDSGASFHVTGEPQNI 364


>ref|XP_016195998.1| uncharacterized protein LOC107637061 [Arachis ipaensis]
          Length = 1042

 Score = 60.1 bits (144), Expect = 1e-07
 Identities = 45/112 (40%), Positives = 52/112 (46%), Gaps = 5/112 (4%)
 Frame = -3

Query: 338 QCQVCYKIGHDDAVCYHRFNAHYT-----PAYYSPQPQQIAGNPYQYVRPAPQSSMVWTN 174
           QCQVC KIGH    CYHRF+  YT     P   +P P     N  Q     PQSS     
Sbjct: 230 QCQVCGKIGHIALKCYHRFDQSYTNPQLQPLNAAPPPSMAFHNGKQVQHHTPQSS----- 284

Query: 173 PQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNL 18
           PQ  QP     PQA+I   ++ P              DSGASHHIT + +NL
Sbjct: 285 PQ--QPA-APSPQAYIALPSAVP--------DAGWYPDSGASHHITFDQSNL 325


>dbj|GAU10291.1| hypothetical protein TSUD_418880 [Trifolium subterraneum]
          Length = 483

 Score = 59.7 bits (143), Expect = 2e-07
 Identities = 39/118 (33%), Positives = 55/118 (46%), Gaps = 7/118 (5%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRF---NAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTN- 174
           +QCQ+CYK GHD + CY+RF   N++    Y +P       N +    P P         
Sbjct: 252 IQCQICYKTGHDASYCYYRFDGPNSYGYGGYGAPNGYGAPSNVWMQNLPRPSQPTFNARP 311

Query: 173 ---PQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANA 9
              PQ   P+  A PQA++    +G         S     DSGA+HH+T + NNL +A
Sbjct: 312 AFPPQFGNPKPQA-PQAYL----TGNESTASSSFSNGWYPDSGATHHVTPDANNLMDA 364


>ref|XP_020207034.1| uncharacterized protein LOC109792059 isoform X2 [Cajanus cajan]
          Length = 120

 Score = 56.6 bits (135), Expect = 2e-07
 Identities = 34/78 (43%), Positives = 40/78 (51%), Gaps = 8/78 (10%)
 Frame = -3

Query: 227 NPYQYVRPAPQSSMVWT--NPQLMQPQHVAYPQAFIGYAASGPAQN------IP*QQSQN 72
           NPY+Y+ P+P    V     PQL QP     P A   +  + P          P QQSQN
Sbjct: 7   NPYRYIHPSPSQLPVQPVIQPQLNQPVPTHGPSAQACFTFTYPQPQPFVTNVTPQQQSQN 66

Query: 71  RIVDSGASHHITGNPNNL 18
             VDSGASHH+T NP NL
Sbjct: 67  WFVDSGASHHVTENPGNL 84


>gb|PNX81124.1| histone deacetylase, partial [Trifolium pratense]
          Length = 660

 Score = 58.9 bits (141), Expect = 3e-07
 Identities = 38/110 (34%), Positives = 50/110 (45%), Gaps = 1/110 (0%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQY-VRPAPQSSMVWTNPQL 165
           VQCQ+C K  HD ++C+HR         Y P   +  G  Y     P P       NP  
Sbjct: 293 VQCQICSKYNHDASICWHR---------YDPSSSRPTGRGYNAGNNPRP----XXYNPYP 339

Query: 164 MQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLA 15
               H+A PQ +       P  ++    + +   DSGASHH+T NPNNLA
Sbjct: 340 RPSAHLALPQYY------NPIPDMDSVSAASWYPDSGASHHLTFNPNNLA 383


>ref|XP_020211686.1| uncharacterized protein LOC109796422 [Cajanus cajan]
          Length = 465

 Score = 58.2 bits (139), Expect = 5e-07
 Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 9/118 (7%)
 Frame = -3

Query: 344 NVQCQVCYKIGHDDAVCYHRFNAHYTP----AYYSP---QPQQIAGNPYQYVRPAPQSSM 186
           N QCQ+C K  H   +C++R +A+Y P      Y P   QP Q+   P        ++S 
Sbjct: 175 NFQCQICLKYSHTANICFYRADANYHPHDSLVLYDPSTLQPVQVNSPP-----SITKTSN 229

Query: 185 VWTNPQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNR--IVDSGASHHITGNPNNL 18
            W NP   QP       +      S    N P Q + N   I DSGAS H+TG P N+
Sbjct: 230 SWGNPTSKQPSQDTNVNS---VTPSAMLANTPSQGAVNSTWIPDSGASFHVTGEPQNV 284


>gb|KYP71160.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1268

 Score = 58.2 bits (139), Expect = 6e-07
 Identities = 40/118 (33%), Positives = 53/118 (44%), Gaps = 9/118 (7%)
 Frame = -3

Query: 344 NVQCQVCYKIGHDDAVCYHRFNAHYTP----AYYSP---QPQQIAGNPYQYVRPAPQSSM 186
           N QCQ+C K  H   +C++R +A+Y P      Y P   QP Q+   P        ++S 
Sbjct: 175 NFQCQICLKYSHTANICFYRADANYHPHDSLVLYDPSTLQPVQVNSPP-----SITKTSN 229

Query: 185 VWTNPQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNR--IVDSGASHHITGNPNNL 18
            W NP   QP       +      S    N P Q + N   I DSGAS H+TG P N+
Sbjct: 230 SWGNPTSKQPSQDTNVNS---VTPSAMLANTPSQGAVNSTWIPDSGASFHVTGEPQNV 284


>ref|XP_015966155.1| uncharacterized protein LOC107489903 [Arachis duranensis]
          Length = 251

 Score = 57.0 bits (136), Expect = 8e-07
 Identities = 41/117 (35%), Positives = 50/117 (42%), Gaps = 6/117 (5%)
 Frame = -3

Query: 338 QCQVCYKIGHDDAVCYHRF-----NAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTN 174
           QCQ+C KIGH    CYHRF     N H  P   +  P     N      P  Q      +
Sbjct: 128 QCQLCGKIGHTVIQCYHRFDQDFMNPHLQPLNTAQPPSLAFHNNSSPNTPQQQQQ----S 183

Query: 173 PQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNR-IVDSGASHHITGNPNNLANAT 6
           P  +QP H   PQA +          +P   S     +DSGASHH+T +P NL   T
Sbjct: 184 PTTLQPTH-PNPQALL---------TVPLSVSDTAWYLDSGASHHVTYDPRNLTTGT 230


>dbj|GAU29238.1| hypothetical protein TSUD_362280 [Trifolium subterraneum]
          Length = 1433

 Score = 57.4 bits (137), Expect = 1e-06
 Identities = 39/112 (34%), Positives = 51/112 (45%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNPQLM 162
           VQCQ+C K  HD A+C++R    Y P   S +      N     RP P       NP   
Sbjct: 296 VQCQICGKANHDAAICWYR----YEPP--SSRSNACGHNAGSSSRPPPY------NPYPR 343

Query: 161 QPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANAT 6
              H+A PQ +       P  ++    + +   DSGASHH+T NPNNL   T
Sbjct: 344 PSAHLALPQYY------NPIADMDSVSNASWYPDSGASHHLTFNPNNLTYRT 389


>gb|PNX93258.1| histone deacetylase, partial [Trifolium pratense]
          Length = 1438

 Score = 57.4 bits (137), Expect = 1e-06
 Identities = 39/112 (34%), Positives = 49/112 (43%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNPQLM 162
           VQCQ+C K  H+   C+HR         Y P PQ    NP  Y  P+      + NP   
Sbjct: 295 VQCQICGKANHEALNCWHR---------YEP-PQSARPNPRGYNAPSGSRPPHY-NPYAR 343

Query: 161 QPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANAT 6
              H+A PQ F  +  +          S +   DSGASHH+T NPNN    T
Sbjct: 344 PTAHLAIPQYFPSFPDNDSIS------SASWYPDSGASHHLTYNPNNFVYRT 389


>gb|PNX81791.1| hypothetical protein L195_g037816, partial [Trifolium pratense]
          Length = 258

 Score = 56.6 bits (135), Expect = 1e-06
 Identities = 42/112 (37%), Positives = 48/112 (42%), Gaps = 1/112 (0%)
 Frame = -3

Query: 344 NVQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNP-Q 168
           NVQ Q+C K GHD  +CYHR +A   P             P+Q     P S   W NP  
Sbjct: 75  NVQRQICEKFGHDARICYHRNSAVVQP-------------PWQVAPARPPSGNQWLNPWH 121

Query: 167 LMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLAN 12
             QP   AYP          PA       SQ    DSGA+HH+T N NN  N
Sbjct: 122 SAQPHPSAYPP---------PA-------SQLWYPDSGATHHVT-NTNNSEN 156


>gb|PNX84343.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 560

 Score = 57.0 bits (136), Expect = 1e-06
 Identities = 35/107 (32%), Positives = 51/107 (47%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNPQLM 162
           VQCQ+C +  HD  +C++R+++       + +PQ    N     RPA        NP   
Sbjct: 293 VQCQICDRPNHDATICWYRYDSS------NSKPQARGYNASSNPRPAH------FNPYAR 340

Query: 161 QPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNN 21
              H+A PQ +       P+ +     S +   DSGASHH+T NPNN
Sbjct: 341 PSAHLAIPQYY------APSADFDSMSSASWYPDSGASHHLTYNPNN 381


>ref|XP_016163061.1| uncharacterized protein LOC107605630 [Arachis ipaensis]
          Length = 1595

 Score = 57.0 bits (136), Expect = 1e-06
 Identities = 44/112 (39%), Positives = 51/112 (45%), Gaps = 5/112 (4%)
 Frame = -3

Query: 338 QCQVCYKIGHDDAVCYHRFNAHYT-----PAYYSPQPQQIAGNPYQYVRPAPQSSMVWTN 174
           QCQVC KIGH    CYHRF+  YT     P   +P P     N        PQSS     
Sbjct: 283 QCQVCGKIGHIALQCYHRFDQSYTNPQLQPLNATPPPSMAFHNGGLVQHHTPQSS----- 337

Query: 173 PQLMQPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNL 18
           PQ  QP     PQA+I   ++ P              DSGASHHIT + +NL
Sbjct: 338 PQ--QPA-APSPQAYIALPSAVP--------DAGWYPDSGASHHITFDQSNL 378


>ref|XP_017640210.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
           1-94 [Gossypium arboreum]
          Length = 604

 Score = 56.6 bits (135), Expect = 2e-06
 Identities = 36/120 (30%), Positives = 53/120 (44%), Gaps = 8/120 (6%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPA----PQSSMVW-- 180
           +QCQ+C K+GH    CYHRF+  Y    Y P   Q+  +P  Y++P     P S M W  
Sbjct: 270 IQCQLCGKMGHLVDRCYHRFDLSYKNTGYRPSSSQVGSSPPPYMQPGWVIPPTSPMSWNA 329

Query: 179 TNPQLMQPQHV--AYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANAT 6
             PQ      V  + PQA++    +                DS A+HH+T +   +  +T
Sbjct: 330 NTPQTSYTSTVSTSSPQAYVATPET--------VYDNAWFPDSSATHHLTHSATAIGEST 381


>ref|XP_016192001.1| uncharacterized protein LOC107632877 [Arachis ipaensis]
          Length = 275

 Score = 55.1 bits (131), Expect = 5e-06
 Identities = 35/111 (31%), Positives = 53/111 (47%)
 Frame = -3

Query: 341 VQCQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNPQLM 162
           VQCQ+C ++GH    CYHRF+  + P   S  P      PY    P P            
Sbjct: 57  VQCQLCGRLGHVVWNCYHRFDHSFNP--NSGNPSTTPQIPYINAHPPP------------ 102

Query: 161 QPQHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANA 9
            P +   P AF+    + P+ ++      + + DSGASHH+T +P+NL ++
Sbjct: 103 PPANFHQPTAFL----TAPSSSL---SDASWLADSGASHHLTPDPSNLLSS 146


>gb|PNY10805.1| histone deacetylase, partial [Trifolium pratense]
          Length = 720

 Score = 55.5 bits (132), Expect = 5e-06
 Identities = 36/111 (32%), Positives = 49/111 (44%)
 Frame = -3

Query: 335 CQVCYKIGHDDAVCYHRFNAHYTPAYYSPQPQQIAGNPYQYVRPAPQSSMVWTNPQLMQP 156
           CQ+C K GH    C++R++ ++ P   +P PQ     P     PAPQ+            
Sbjct: 269 CQLCNKYGHHVRDCWYRYDENFVPVQANPVPQPPPPPPRDTQAPAPQACTA--------- 319

Query: 155 QHVAYPQAFIGYAASGPAQNIP*QQSQNRIVDSGASHHITGNPNNLANATV 3
                      +AAS     IP    Q+   DSGASHHIT + +NLA   V
Sbjct: 320 ----------NFAASTQELVIP----QSWFPDSGASHHITADASNLAQGKV 356


Top