BLASTX nr result

ID: Astragalus23_contig00024838 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00024838
         (778 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003630936.1| PPR containing plant-like protein [Medicago ...    68   4e-09
gb|PNY11660.1| pentatricopeptide repeat-containing protein at4g2...    64   5e-08
ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containi...    64   7e-08
gb|KHN41349.1| Pentatricopeptide repeat-containing protein [Glyc...    64   1e-07
ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containi...    64   1e-07
ref|XP_020203036.1| pentatricopeptide repeat-containing protein ...    62   3e-07
ref|XP_007138522.1| hypothetical protein PHAVU_009G216300g [Phas...    61   8e-07
gb|KYP39482.1| Pentatricopeptide repeat-containing protein At4g2...    60   1e-06
gb|KHN05282.1| Pentatricopeptide repeat-containing protein [Glyc...    60   1e-06
gb|KRH12777.1| hypothetical protein GLYMA_15G193700 [Glycine max]      60   1e-06
ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containi...    60   1e-06
ref|XP_014501087.1| pentatricopeptide repeat-containing protein ...    59   5e-06

>ref|XP_003630936.1| PPR containing plant-like protein [Medicago truncatula]
 gb|AET05412.1| PPR containing plant-like protein [Medicago truncatula]
          Length = 959

 Score = 67.8 bits (164), Expect = 4e-09
 Identities = 34/81 (41%), Positives = 50/81 (61%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E V LF AM  +GVK D    I++  F P VL+ G  KYC++ ++YI +
Sbjct: 351 IAGYVQNGFTDEAVALFKAMVTSGVKLDS---ITFASFLPSVLKSGSLKYCKEVHSYIVR 407

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           HG+ FD+Y++S L+D YF  G
Sbjct: 408 HGVPFDVYLKSALVDIYFKGG 428


>gb|PNY11660.1| pentatricopeptide repeat-containing protein at4g21300-like protein
           [Trifolium pratense]
          Length = 768

 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 32/81 (39%), Positives = 49/81 (60%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E V LF AM  +GVK D    I++  F P +LE G  K+C++ ++YI +
Sbjct: 235 IAGYVQNGFTDEAVALFKAMIASGVKLDS---ITFASFLPSILESGTLKHCKEVHSYIVR 291

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S L+D YF  G
Sbjct: 292 HDVPFDVYLKSALVDIYFKGG 312


>ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300
           [Cicer arietinum]
 ref|XP_012572043.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300
           [Cicer arietinum]
          Length = 875

 Score = 63.9 bits (154), Expect = 7e-08
 Identities = 32/81 (39%), Positives = 48/81 (59%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E V LF AM  +GVK D    I++  F P +LE G    C++ ++YI +
Sbjct: 348 IAGYVQNGFTDEAVTLFKAMIASGVKPDS---ITFASFLPSILESGSLNNCKEVHSYIVR 404

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           HG+ FD+Y++S L+D YF  G
Sbjct: 405 HGVPFDVYLKSALVDIYFKGG 425


>gb|KHN41349.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 788

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 56/228 (24%), Positives = 97/228 (42%), Gaps = 26/228 (11%)
 Frame = -1

Query: 607 DMETSADLVSKQNDDGTTSVINYIDDKLHVTDTIIEN-------------NCFFTFRCHD 467
           D+   + L+    D+G       + D+L + DTI+ N             N   TF C  
Sbjct: 121 DLFAGSALIKLYADNGYIRDARRVFDELPLRDTILWNVMLRGYVKSGDFDNAIGTF-CEM 179

Query: 466 HNKYLETINV-------VALDRSTMPPSAHVQGTILGLVNGGTGSKFIAGHVEECSIFIY 308
              Y    +V       +   R        + G ++G     +G +F    V    + +Y
Sbjct: 180 RTSYSMVNSVTYTCILSICATRGNFCAGTQLHGLVIG-----SGFEFDP-QVANTLVAMY 233

Query: 307 GEYLQMAFVALIYHPQGYDVSVTSYPLVAS------TNEVVLLFIAMCFTGVKQDFQHLI 146
            +   + +   +++      +VT   L+A       T+E   LF AM   GVK D    +
Sbjct: 234 SKCGNLLYARKLFNTMPQTDTVTWNGLIAGYVQNGFTDEAAPLFNAMISAGVKPDS---V 290

Query: 145 SYVCFYPPVLEYGCPKYCRKNYNYIGKHGLTFDIYVRSILIDKYFNNG 2
           ++  F P +LE G  ++C++ ++YI +H + FD+Y++S LID YF  G
Sbjct: 291 TFASFLPSILESGSLRHCKEVHSYIVRHRVPFDVYLKSALIDVYFKGG 338


>ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 ref|XP_014617512.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 ref|XP_014617513.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 ref|XP_014617514.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 ref|XP_014617515.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 ref|XP_014617516.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
 gb|KRH37767.1| hypothetical protein GLYMA_09G088000 [Glycine max]
 gb|KRH37768.1| hypothetical protein GLYMA_09G088000 [Glycine max]
          Length = 848

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 56/228 (24%), Positives = 97/228 (42%), Gaps = 26/228 (11%)
 Frame = -1

Query: 607 DMETSADLVSKQNDDGTTSVINYIDDKLHVTDTIIEN-------------NCFFTFRCHD 467
           D+   + L+    D+G       + D+L + DTI+ N             N   TF C  
Sbjct: 181 DLFAGSALIKLYADNGYIRDARRVFDELPLRDTILWNVMLRGYVKSGDFDNAIGTF-CEM 239

Query: 466 HNKYLETINV-------VALDRSTMPPSAHVQGTILGLVNGGTGSKFIAGHVEECSIFIY 308
              Y    +V       +   R        + G ++G     +G +F    V    + +Y
Sbjct: 240 RTSYSMVNSVTYTCILSICATRGNFCAGTQLHGLVIG-----SGFEFDP-QVANTLVAMY 293

Query: 307 GEYLQMAFVALIYHPQGYDVSVTSYPLVAS------TNEVVLLFIAMCFTGVKQDFQHLI 146
            +   + +   +++      +VT   L+A       T+E   LF AM   GVK D    +
Sbjct: 294 SKCGNLLYARKLFNTMPQTDTVTWNGLIAGYVQNGFTDEAAPLFNAMISAGVKPDS---V 350

Query: 145 SYVCFYPPVLEYGCPKYCRKNYNYIGKHGLTFDIYVRSILIDKYFNNG 2
           ++  F P +LE G  ++C++ ++YI +H + FD+Y++S LID YF  G
Sbjct: 351 TFASFLPSILESGSLRHCKEVHSYIVRHRVPFDVYLKSALIDVYFKGG 398


>ref|XP_020203036.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus
           cajan]
 ref|XP_020203037.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus
           cajan]
 ref|XP_020203038.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus
           cajan]
 ref|XP_020203039.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus
           cajan]
 ref|XP_020203041.1| pentatricopeptide repeat-containing protein At4g21300 [Cajanus
           cajan]
          Length = 849

 Score = 62.0 bits (149), Expect = 3e-07
 Identities = 56/225 (24%), Positives = 96/225 (42%), Gaps = 23/225 (10%)
 Frame = -1

Query: 607 DMETSADLVSKQNDDGTTSVINYIDDKLHVTDTIIENNCFFTF-RCHDHNKYLETINVVA 431
           D+   + L+    D+G  +    + D+L   D I+ N     + +C D    +ET     
Sbjct: 182 DLFVGSALIKLYADNGYINDARLVFDELPQRDNILWNVMLNGYVKCGDFYNTIETFR--E 239

Query: 430 LDRSTMPPSAHVQGTILGLVNGGTGSKFIAG----------------HVEECSIFIYGEY 299
           +  S   PS+     +L +    T  KF  G                 V    + +Y + 
Sbjct: 240 MRTSFCKPSSVTYTCVLSMC--ATRGKFCVGTQLHGLVIGSGFEFDSQVANTLVAMYSKC 297

Query: 298 LQMAFVALIYHPQGYDVSVTSYPLVAS------TNEVVLLFIAMCFTGVKQDFQHLISYV 137
             +     +++      +VT   L+A       T+E   LF AM   GVK D    +++ 
Sbjct: 298 GNLFDARKLFNIMPQTDTVTWNGLIAGYVQNGFTDEAAPLFNAMISVGVKPDS---VTFA 354

Query: 136 CFYPPVLEYGCPKYCRKNYNYIGKHGLTFDIYVRSILIDKYFNNG 2
            F P +L+ G  K+C++ ++YI +H + FD+Y++S LID YF  G
Sbjct: 355 SFLPSILKSGSLKHCKEVHSYIVRHRIPFDVYLKSALIDIYFKGG 399


>ref|XP_007138522.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris]
 gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris]
          Length = 848

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 30/81 (37%), Positives = 48/81 (59%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E   LF AM   GVK D    +++  F P +L+ G  K+C++ ++YI +
Sbjct: 322 IAGYVQNGFTDEAAPLFNAMISAGVKPD---AVTFASFLPSILKSGSLKHCKEVHSYIVR 378

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S LID YF +G
Sbjct: 379 HRVPFDVYLKSALIDIYFKSG 399


>gb|KYP39482.1| Pentatricopeptide repeat-containing protein At4g21300 family
           [Cajanus cajan]
          Length = 724

 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 30/81 (37%), Positives = 47/81 (58%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E   LF AM   GVK D    +++  F P +L+ G  K+C++ ++YI +
Sbjct: 238 IAGYVQNGFTDEAAPLFNAMISVGVKPDS---VTFASFLPSILKSGSLKHCKEVHSYIVR 294

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S LID YF  G
Sbjct: 295 HRIPFDVYLKSALIDIYFKGG 315


>gb|KHN05282.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 772

 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 30/81 (37%), Positives = 47/81 (58%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E   LF AM   GVK D    +++  F P +LE G  ++C++ ++YI +
Sbjct: 245 IAGYVQNGFTDEAAPLFNAMISAGVKPDS---VTFASFLPSILESGSLRHCKEVHSYIVR 301

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S LID YF  G
Sbjct: 302 HRVPFDVYLKSALIDIYFKGG 322


>gb|KRH12777.1| hypothetical protein GLYMA_15G193700 [Glycine max]
          Length = 825

 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 30/81 (37%), Positives = 47/81 (58%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E   LF AM   GVK D    +++  F P +LE G  ++C++ ++YI +
Sbjct: 298 IAGYVQNGFTDEAAPLFNAMISAGVKPDS---VTFASFLPSILESGSLRHCKEVHSYIVR 354

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S LID YF  G
Sbjct: 355 HRVPFDVYLKSALIDIYFKGG 375


>ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
          Length = 846

 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 30/81 (37%), Positives = 47/81 (58%)
 Frame = -1

Query: 244 VTSYPLVASTNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGK 65
           +  Y     T+E   LF AM   GVK D    +++  F P +LE G  ++C++ ++YI +
Sbjct: 319 IAGYVQNGFTDEAAPLFNAMISAGVKPDS---VTFASFLPSILESGSLRHCKEVHSYIVR 375

Query: 64  HGLTFDIYVRSILIDKYFNNG 2
           H + FD+Y++S LID YF  G
Sbjct: 376 HRVPFDVYLKSALIDIYFKGG 396


>ref|XP_014501087.1| pentatricopeptide repeat-containing protein At4g21300 [Vigna
           radiata var. radiata]
          Length = 848

 Score = 58.5 bits (140), Expect = 5e-06
 Identities = 29/72 (40%), Positives = 44/72 (61%)
 Frame = -1

Query: 217 TNEVVLLFIAMCFTGVKQDFQHLISYVCFYPPVLEYGCPKYCRKNYNYIGKHGLTFDIYV 38
           ++E   LF AM   GVK D    +++  F P VL+ G  K+C++ + YI +H + FD+Y+
Sbjct: 330 SDEAAPLFNAMISAGVKPD---AVTFASFLPSVLKTGSLKHCKEVHGYIVRHRVPFDVYL 386

Query: 37  RSILIDKYFNNG 2
           +S LID YF  G
Sbjct: 387 KSALIDIYFKGG 398


Top