BLASTX nr result

ID: Astragalus22_contig00025683 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00025683
         (364 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_019443170.1| PREDICTED: neurofilament heavy polypeptide-l...   116   8e-28
ref|XP_019443169.1| PREDICTED: muscle M-line assembly protein un...   116   8e-28
ref|XP_019443168.1| PREDICTED: muscle M-line assembly protein un...   116   8e-28
ref|XP_019443167.1| PREDICTED: proteoglycan 4-like isoform X1 [L...   116   8e-28
gb|KRH15183.1| hypothetical protein GLYMA_14G073600 [Glycine max]      95   2e-22
gb|PNX81581.1| hypothetical protein L195_g037604 [Trifolium prat...    94   6e-21
ref|XP_002510405.1| PREDICTED: serine/arginine repetitive matrix...    96   2e-20
ref|XP_021613295.1| proteoglycan 4 [Manihot esculenta] >gi|10359...    95   3e-20
ref|XP_006595931.1| PREDICTED: muscle M-line assembly protein un...    95   3e-20
ref|XP_003527688.1| PREDICTED: uncharacterized protein PB18E9.04...    93   4e-20
dbj|GAU34965.1| hypothetical protein TSUD_312960 [Trifolium subt...    92   4e-20
ref|NP_001304413.2| uncharacterized protein LOC100785981 [Glycin...    93   4e-20
gb|KHN22246.1| hypothetical protein glysoja_022002 [Glycine soja]      91   2e-19
ref|XP_012071885.1| proteoglycan 4 [Jatropha curcas] >gi|6437311...    92   2e-19
dbj|GAY43064.1| hypothetical protein CUMW_071720 [Citrus unshiu]       93   2e-19
ref|XP_007136602.1| hypothetical protein PHAVU_009G058700g [Phas...    91   2e-19
ref|XP_015384333.1| PREDICTED: neurofilament heavy polypeptide i...    92   2e-19
gb|KDO84591.1| hypothetical protein CISIN_1g005888mg [Citrus sin...    92   2e-19
ref|XP_006473518.1| PREDICTED: proteoglycan 4 isoform X1 [Citrus...    92   2e-19
ref|XP_006435013.1| proteoglycan 4 [Citrus clementina] >gi|55753...    92   2e-19

>ref|XP_019443170.1| PREDICTED: neurofilament heavy polypeptide-like isoform X4 [Lupinus
           angustifolius]
          Length = 704

 Score =  116 bits (291), Expect = 8e-28
 Identities = 57/94 (60%), Positives = 74/94 (78%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-IDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           S+ ERDPGVQV +P+KP +  K  D +  + H+TEFN+SRV+KS Y+P+VRRRCLRGLF 
Sbjct: 611 SVTERDPGVQVILPQKPADPIKYDDKLSPETHKTEFNISRVEKSNYKPMVRRRCLRGLFV 670

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNN--EDIENL 269
           EP+DS  D+P+KPR HGCKF+C KN   EDIE++
Sbjct: 671 EPSDSDPDNPDKPRRHGCKFSCDKNEKVEDIEDM 704


>ref|XP_019443169.1| PREDICTED: muscle M-line assembly protein unc-89-like isoform X3
           [Lupinus angustifolius]
          Length = 720

 Score =  116 bits (291), Expect = 8e-28
 Identities = 57/94 (60%), Positives = 74/94 (78%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-IDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           S+ ERDPGVQV +P+KP +  K  D +  + H+TEFN+SRV+KS Y+P+VRRRCLRGLF 
Sbjct: 627 SVTERDPGVQVILPQKPADPIKYDDKLSPETHKTEFNISRVEKSNYKPMVRRRCLRGLFV 686

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNN--EDIENL 269
           EP+DS  D+P+KPR HGCKF+C KN   EDIE++
Sbjct: 687 EPSDSDPDNPDKPRRHGCKFSCDKNEKVEDIEDM 720


>ref|XP_019443168.1| PREDICTED: muscle M-line assembly protein unc-89-like isoform X2
           [Lupinus angustifolius]
          Length = 723

 Score =  116 bits (291), Expect = 8e-28
 Identities = 57/94 (60%), Positives = 74/94 (78%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-IDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           S+ ERDPGVQV +P+KP +  K  D +  + H+TEFN+SRV+KS Y+P+VRRRCLRGLF 
Sbjct: 630 SVTERDPGVQVILPQKPADPIKYDDKLSPETHKTEFNISRVEKSNYKPMVRRRCLRGLFV 689

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNN--EDIENL 269
           EP+DS  D+P+KPR HGCKF+C KN   EDIE++
Sbjct: 690 EPSDSDPDNPDKPRRHGCKFSCDKNEKVEDIEDM 723


>ref|XP_019443167.1| PREDICTED: proteoglycan 4-like isoform X1 [Lupinus angustifolius]
 gb|OIW12047.1| hypothetical protein TanjilG_20885 [Lupinus angustifolius]
          Length = 733

 Score =  116 bits (291), Expect = 8e-28
 Identities = 57/94 (60%), Positives = 74/94 (78%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-IDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           S+ ERDPGVQV +P+KP +  K  D +  + H+TEFN+SRV+KS Y+P+VRRRCLRGLF 
Sbjct: 640 SVTERDPGVQVILPQKPADPIKYDDKLSPETHKTEFNISRVEKSNYKPMVRRRCLRGLFV 699

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNN--EDIENL 269
           EP+DS  D+P+KPR HGCKF+C KN   EDIE++
Sbjct: 700 EPSDSDPDNPDKPRRHGCKFSCDKNEKVEDIEDM 733


>gb|KRH15183.1| hypothetical protein GLYMA_14G073600 [Glycine max]
          Length = 151

 Score = 95.1 bits (235), Expect = 2e-22
 Identities = 54/94 (57%), Positives = 65/94 (69%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-----IDTQNHRTEFNLSRVQKSTYQPIVRRRCLR 167
           SI ERDPGVQVT+P+KP E  KP D     ++TQ  RT+FN++R +KSTYQP V  R +R
Sbjct: 66  SITERDPGVQVTLPQKPAEPIKPDDKPNPGLETQ--RTQFNINRAEKSTYQPTV-GRSIR 122

Query: 168 GLFHEPNDSDDPNKPRSHGCKFNCGKNNEDIENL 269
           G F EPND     KPR HGC F+C K+ EDIE L
Sbjct: 123 GPFLEPND-----KPRRHGCNFSCDKDIEDIEIL 151


>gb|PNX81581.1| hypothetical protein L195_g037604 [Trifolium pratense]
          Length = 281

 Score = 94.4 bits (233), Expect = 6e-21
 Identities = 50/88 (56%), Positives = 63/88 (71%), Gaps = 3/88 (3%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYDIDTQNHRTEFN-LSRVQKSTYQPIVRRRCLRGLFH 179
           SIN RDPGV+V +P++P    K    D +N R E   +SRV+K TYQP+VRRRCLRGL  
Sbjct: 195 SINGRDPGVRVILPQQPEPRVKQ---DLENRRDEVKTVSRVEKLTYQPMVRRRCLRGLMV 251

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNNED 257
           EP+DS  D+P+KPR HGCKF+CG   +D
Sbjct: 252 EPSDSDPDNPDKPRRHGCKFSCGDVRKD 279


>ref|XP_002510405.1| PREDICTED: serine/arginine repetitive matrix protein 1 [Ricinus
           communis]
 gb|EEF52592.1| oxidoreductase, putative [Ricinus communis]
          Length = 551

 Score = 95.5 bits (236), Expect = 2e-20
 Identities = 45/89 (50%), Positives = 62/89 (69%), Gaps = 4/89 (4%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPY--DIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLF 176
           S+NE+DPGV + +     E +KP       + H+TEFN++R QK TY+P +RRRCLRGLF
Sbjct: 457 SVNEQDPGVHLALSHNLAESSKPSAKPEPLETHKTEFNVTRSQKLTYEPTIRRRCLRGLF 516

Query: 177 HEPNDS--DDPNKPRSHGCKFNCGKNNED 257
            EP+DS  D+P KPR HGC + CG+ ++D
Sbjct: 517 LEPSDSDNDNPEKPRRHGCLYACGEKSKD 545


>ref|XP_021613295.1| proteoglycan 4 [Manihot esculenta]
 gb|OAY50708.1| hypothetical protein MANES_05G158000 [Manihot esculenta]
          Length = 552

 Score = 95.1 bits (235), Expect = 3e-20
 Identities = 46/91 (50%), Positives = 66/91 (72%), Gaps = 6/91 (6%)
 Frame = +3

Query: 3   SINERDPGVQVTIPK---KPLEVN-KPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRG 170
           S+NER+PGVQ+ + +   +P+  N KP  ++T  H+ EFN++  QK TY+P +RRRCLRG
Sbjct: 458 SVNERNPGVQLVLSQNLAEPINPNAKPETMET--HKAEFNITPAQKLTYEPTIRRRCLRG 515

Query: 171 LFHEPNDS--DDPNKPRSHGCKFNCGKNNED 257
           LF EP+DS  D+P KPR HGC++NC +  +D
Sbjct: 516 LFLEPSDSDPDNPEKPRRHGCRYNCAEMGKD 546


>ref|XP_006595931.1| PREDICTED: muscle M-line assembly protein unc-89-like [Glycine max]
 gb|KHN20971.1| hypothetical protein glysoja_009279 [Glycine soja]
          Length = 616

 Score = 95.1 bits (235), Expect = 3e-20
 Identities = 54/94 (57%), Positives = 65/94 (69%), Gaps = 5/94 (5%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYD-----IDTQNHRTEFNLSRVQKSTYQPIVRRRCLR 167
           SI ERDPGVQVT+P+KP E  KP D     ++TQ  RT+FN++R +KSTYQP V  R +R
Sbjct: 531 SITERDPGVQVTLPQKPAEPIKPDDKPNPGLETQ--RTQFNINRAEKSTYQPTV-GRSIR 587

Query: 168 GLFHEPNDSDDPNKPRSHGCKFNCGKNNEDIENL 269
           G F EPND     KPR HGC F+C K+ EDIE L
Sbjct: 588 GPFLEPND-----KPRRHGCNFSCDKDIEDIEIL 616


>ref|XP_003527688.1| PREDICTED: uncharacterized protein PB18E9.04c-like [Glycine max]
          Length = 316

 Score = 92.8 bits (229), Expect = 4e-20
 Identities = 47/92 (51%), Positives = 65/92 (70%), Gaps = 3/92 (3%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFHE 182
           SIN RDPGV+V +P++P  + KP        + E +++R +   Y+P+VRRRCLRGLFHE
Sbjct: 232 SINGRDPGVRVILPQQPC-LEKP--------KAEVSINRAEMVPYRPVVRRRCLRGLFHE 282

Query: 183 PNDS--DDPNKPRSHGCKFNCGK-NNEDIENL 269
           P+DS  D+P+KPR HGCKF CG  + +D EN+
Sbjct: 283 PSDSEPDNPDKPRRHGCKFRCGDIDTKDKENV 314


>dbj|GAU34965.1| hypothetical protein TSUD_312960 [Trifolium subterraneum]
          Length = 272

 Score = 92.0 bits (227), Expect = 4e-20
 Identities = 50/90 (55%), Positives = 63/90 (70%), Gaps = 1/90 (1%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYDIDTQNHRTEFN-LSRVQKSTYQPIVRRRCLRGLFH 179
           SIN RDPGV+V +P+ P    K    D +N R E   +S+V+K TY+P+VRRRCLRGL  
Sbjct: 187 SINGRDPGVRVILPQHPEPQVKR---DLENRRDEVKTVSQVEKLTYRPMVRRRCLRGLMA 243

Query: 180 EPNDSDDPNKPRSHGCKFNCGKNNEDIENL 269
           EP+DS DP+KPR HGCKF+CG   +D E L
Sbjct: 244 EPSDS-DPDKPRRHGCKFSCGDVKKDNEIL 272


>ref|NP_001304413.2| uncharacterized protein LOC100785981 [Glycine max]
 gb|KHN45860.1| hypothetical protein glysoja_023083 [Glycine soja]
 gb|KRH61151.1| hypothetical protein GLYMA_04G031100 [Glycine max]
          Length = 346

 Score = 93.2 bits (230), Expect = 4e-20
 Identities = 44/83 (53%), Positives = 63/83 (75%), Gaps = 3/83 (3%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPL-EVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           SIN RDPGV+V +P++P  +V +P     +  + E +++RV++  Y+P+VRRRCLRGLFH
Sbjct: 255 SINGRDPGVRVILPQQPQPDVKQPC---LEKPKAEVSINRVERVPYRPVVRRRCLRGLFH 311

Query: 180 EPNDSD--DPNKPRSHGCKFNCG 242
           EP+DS+  +P+KPR HGCKF CG
Sbjct: 312 EPSDSEPHNPDKPRRHGCKFRCG 334


>gb|KHN22246.1| hypothetical protein glysoja_022002 [Glycine soja]
          Length = 316

 Score = 91.3 bits (225), Expect = 2e-19
 Identities = 44/82 (53%), Positives = 59/82 (71%), Gaps = 2/82 (2%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFHE 182
           SIN RDPGV+V +P++P  + KP        + E +++R +   Y+P+VRRRCLRGLFHE
Sbjct: 232 SINGRDPGVRVILPQQPC-LEKP--------KAEVSINRAEMVPYRPVVRRRCLRGLFHE 282

Query: 183 PNDS--DDPNKPRSHGCKFNCG 242
           P+DS  D+P+KPR HGCKF CG
Sbjct: 283 PSDSEPDNPDKPRRHGCKFRCG 304


>ref|XP_012071885.1| proteoglycan 4 [Jatropha curcas]
 gb|KDP38520.1| hypothetical protein JCGZ_04445 [Jatropha curcas]
          Length = 450

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 42/89 (47%), Positives = 62/89 (69%), Gaps = 4/89 (4%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPLEVNKPY--DIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLF 176
           S+NER PG+Q+++    +E  KP       ++H+ EFN++  QK TY+P +RRRCLRGLF
Sbjct: 356 SVNERSPGIQLSLSNNVVEPTKPSAKPETIESHKAEFNVTPAQKLTYEPTIRRRCLRGLF 415

Query: 177 HEPNDS--DDPNKPRSHGCKFNCGKNNED 257
            E +DS  D+P+KPR HGC++ CG  ++D
Sbjct: 416 LESSDSDPDNPDKPRRHGCRYYCGDKSKD 444


>dbj|GAY43064.1| hypothetical protein CUMW_071720 [Citrus unshiu]
          Length = 671

 Score = 92.8 bits (229), Expect = 2e-19
 Identities = 45/95 (47%), Positives = 61/95 (64%), Gaps = 8/95 (8%)
 Frame = +3

Query: 3   SINERDPGVQVTI------PKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCL 164
           S+NER+PGV +        P KP    KP  ++T  H  +  ++  +K TYQP VRRRCL
Sbjct: 575 SVNERNPGVHLVFSHNLAEPTKP--ATKPETLETHGHEAKVTITPSEKLTYQPTVRRRCL 632

Query: 165 RGLFHEPNDS--DDPNKPRSHGCKFNCGKNNEDIE 263
           RGLF EP+DS  D+P KPR HGC +NCG+ ++D +
Sbjct: 633 RGLFMEPSDSDPDNPEKPRRHGCLYNCGEKSKDTD 667


>ref|XP_007136602.1| hypothetical protein PHAVU_009G058700g [Phaseolus vulgaris]
 gb|ESW08596.1| hypothetical protein PHAVU_009G058700g [Phaseolus vulgaris]
          Length = 353

 Score = 91.3 bits (225), Expect = 2e-19
 Identities = 44/88 (50%), Positives = 64/88 (72%), Gaps = 3/88 (3%)
 Frame = +3

Query: 3   SINERDPGVQVTIPKKPL-EVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCLRGLFH 179
           +IN RDPGV+V +P++P  +V +P     +  + E +++RV++  Y+P+VRRRCLRGLF 
Sbjct: 263 TINGRDPGVRVILPQQPQPDVKEPC---LEKPKAEVSINRVERVPYRPVVRRRCLRGLFL 319

Query: 180 EPNDS--DDPNKPRSHGCKFNCGKNNED 257
           EP+DS  D+P+KPR HGCK  CG N+ D
Sbjct: 320 EPSDSEHDNPDKPRRHGCKVRCGDNSND 347


>ref|XP_015384333.1| PREDICTED: neurofilament heavy polypeptide isoform X2 [Citrus
           sinensis]
          Length = 612

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 8/93 (8%)
 Frame = +3

Query: 3   SINERDPGVQVTI------PKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCL 164
           S+NER+PGV +        P KP    KP  ++T  H  +  ++  +K TYQP VRRRCL
Sbjct: 516 SVNERNPGVHLVFSHNLAEPTKP--ATKPETLETHGHEAKVTITPSEKLTYQPTVRRRCL 573

Query: 165 RGLFHEPNDS--DDPNKPRSHGCKFNCGKNNED 257
           RGLF EP+DS  D+P KPR HGC +NCG+ ++D
Sbjct: 574 RGLFMEPSDSDPDNPEKPRRHGCLYNCGEKSKD 606


>gb|KDO84591.1| hypothetical protein CISIN_1g005888mg [Citrus sinensis]
          Length = 671

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 8/93 (8%)
 Frame = +3

Query: 3   SINERDPGVQVTI------PKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCL 164
           S+NER+PGV +        P KP    KP  ++T  H  +  ++  +K TYQP VRRRCL
Sbjct: 575 SVNERNPGVHLVFSHNLAEPTKP--ATKPETLETHGHEAKVTITPSEKLTYQPTVRRRCL 632

Query: 165 RGLFHEPNDS--DDPNKPRSHGCKFNCGKNNED 257
           RGLF EP+DS  D+P KPR HGC +NCG+ ++D
Sbjct: 633 RGLFMEPSDSDPDNPEKPRRHGCLYNCGEKSKD 665


>ref|XP_006473518.1| PREDICTED: proteoglycan 4 isoform X1 [Citrus sinensis]
          Length = 687

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 8/93 (8%)
 Frame = +3

Query: 3   SINERDPGVQVTI------PKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCL 164
           S+NER+PGV +        P KP    KP  ++T  H  +  ++  +K TYQP VRRRCL
Sbjct: 591 SVNERNPGVHLVFSHNLAEPTKP--ATKPETLETHGHEAKVTITPSEKLTYQPTVRRRCL 648

Query: 165 RGLFHEPNDS--DDPNKPRSHGCKFNCGKNNED 257
           RGLF EP+DS  D+P KPR HGC +NCG+ ++D
Sbjct: 649 RGLFMEPSDSDPDNPEKPRRHGCLYNCGEKSKD 681


>ref|XP_006435013.1| proteoglycan 4 [Citrus clementina]
 gb|ESR48253.1| hypothetical protein CICLE_v10000483mg [Citrus clementina]
          Length = 687

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 45/93 (48%), Positives = 60/93 (64%), Gaps = 8/93 (8%)
 Frame = +3

Query: 3   SINERDPGVQVTI------PKKPLEVNKPYDIDTQNHRTEFNLSRVQKSTYQPIVRRRCL 164
           S+NER+PGV +        P KP    KP  ++T  H  +  ++  +K TYQP VRRRCL
Sbjct: 591 SVNERNPGVHLVFSHNLAEPTKP--ATKPETLETHGHEAKVTITPSEKLTYQPTVRRRCL 648

Query: 165 RGLFHEPNDS--DDPNKPRSHGCKFNCGKNNED 257
           RGLF EP+DS  D+P KPR HGC +NCG+ ++D
Sbjct: 649 RGLFMEPSDSDPDNPEKPRRHGCLYNCGEKSKD 681


Top