BLASTX nr result

ID: Cheilocostus21_contig00035030 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00035030
         (494 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009393287.1| PREDICTED: pentatricopeptide repeat-containi...   126   1e-30
ref|XP_020103187.1| pentatricopeptide repeat-containing protein ...    83   3e-15
ref|XP_020103186.1| pentatricopeptide repeat-containing protein ...    83   3e-15
ref|XP_010926893.1| PREDICTED: pentatricopeptide repeat-containi...    72   2e-11
gb|OAY74994.1| Pentatricopeptide repeat-containing protein, chlo...    72   2e-11
ref|XP_008807161.1| PREDICTED: pentatricopeptide repeat-containi...    70   5e-11
gb|PKU82244.1| Pentatricopeptide repeat-containing protein [Dend...    67   8e-10
ref|XP_020576653.1| pentatricopeptide repeat-containing protein ...    67   8e-10
ref|XP_020679014.1| pentatricopeptide repeat-containing protein ...    67   8e-10
emb|CDM83971.1| unnamed protein product [Triticum aestivum]            66   2e-09
gb|OVA07351.1| Pentatricopeptide repeat [Macleaya cordata]             63   2e-08
dbj|BAK07775.1| predicted protein [Hordeum vulgare subsp. vulgare]     62   3e-08
gb|ONK65740.1| uncharacterized protein A4U43_C06F440 [Asparagus ...    62   6e-08
ref|XP_020269344.1| LOW QUALITY PROTEIN: pentatricopeptide repea...    62   6e-08
gb|PAN29218.1| hypothetical protein PAHAL_E02151 [Panicum hallii...    60   1e-07
ref|XP_010275031.1| PREDICTED: pentatricopeptide repeat-containi...    60   3e-07
gb|AQK97280.1| Pentatricopeptide repeat-containing protein chlor...    59   5e-07
ref|XP_008656745.1| pentatricopeptide repeat-containing protein ...    59   5e-07
ref|XP_020418206.1| pentatricopeptide repeat-containing protein ...    58   1e-06
ref|XP_021823689.1| pentatricopeptide repeat-containing protein ...    58   1e-06

>ref|XP_009393287.1| PREDICTED: pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Musa acuminata subsp. malaccensis]
          Length = 1105

 Score =  126 bits (317), Expect = 1e-30
 Identities = 72/115 (62%), Positives = 81/115 (70%), Gaps = 1/115 (0%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHG-LAVPKKPLLGGALYLRAVKKRWNPGAQ 167
           M+LAN  IG L    Y  +  NS S LSS HG L VP+KPL G AL LR + K WN  A 
Sbjct: 1   MMLANWPIGVLNSVTYWVLPENSHSNLSSSHGRLPVPQKPLFGRALSLR-LAKNWNFDAG 59

Query: 166 GTKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
            TKCR+C AL +EE GS SVA+RF+EKE KFSPAF DYVKVLESVRV+RSK  GG
Sbjct: 60  STKCRLCCALASEEDGSSSVASRFIEKELKFSPAFSDYVKVLESVRVDRSKDSGG 114


>ref|XP_020103187.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic isoform X2 [Ananas comosus]
          Length = 928

 Score = 82.8 bits (203), Expect = 3e-15
 Identities = 44/113 (38%), Positives = 62/113 (54%)
 Frame = -2

Query: 340 ILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGT 161
           ++ANG +G L  G       NS       HG  +P++P+LG  +    V+K W    +  
Sbjct: 1   MMANGHLGPLILGKNGIFPQNSPPNPCIPHGFLIPRRPILGITMDCMLVRKSWVFDGRAP 60

Query: 160 KCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
           K R+  AL + EA + S   R LEKE KFSP F DYVK++ESV+ +RS+   G
Sbjct: 61  KFRVVSALASGEAEAPSADLRSLEKELKFSPTFSDYVKIMESVKSDRSRDSDG 113


>ref|XP_020103186.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic isoform X1 [Ananas comosus]
          Length = 943

 Score = 82.8 bits (203), Expect = 3e-15
 Identities = 44/113 (38%), Positives = 62/113 (54%)
 Frame = -2

Query: 340 ILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGT 161
           ++ANG +G L  G       NS       HG  +P++P+LG  +    V+K W    +  
Sbjct: 1   MMANGHLGPLILGKNGIFPQNSPPNPCIPHGFLIPRRPILGITMDCMLVRKSWVFDGRAP 60

Query: 160 KCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
           K R+  AL + EA + S   R LEKE KFSP F DYVK++ESV+ +RS+   G
Sbjct: 61  KFRVVSALASGEAEAPSADLRSLEKELKFSPTFSDYVKIMESVKSDRSRDSDG 113


>ref|XP_010926893.1| PREDICTED: pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Elaeis guineensis]
          Length = 865

 Score = 71.6 bits (174), Expect = 2e-11
 Identities = 40/109 (36%), Positives = 54/109 (49%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQG 164
           M + NG I              S     S HG  +P +P+    L    V K W+   + 
Sbjct: 1   MTMTNGQIDVFNLKGNRVFFSGSSLNSCSGHGFLIPGRPVSSITLNSLRVTKSWDFDVRK 60

Query: 163 TKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERS 17
            K R+  AL N E  + S   R LE EFKF+P F+DYVKV+ESV+++RS
Sbjct: 61  PKLRVANALANGEIEAHSTPGRLLEDEFKFNPTFNDYVKVMESVKMDRS 109


>gb|OAY74994.1| Pentatricopeptide repeat-containing protein, chloroplastic [Ananas
           comosus]
          Length = 903

 Score = 71.6 bits (174), Expect = 2e-11
 Identities = 42/113 (37%), Positives = 57/113 (50%)
 Frame = -2

Query: 340 ILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGT 161
           ++ANG +G L  G       NS       HG  +P++P+LG  +    V+K W       
Sbjct: 1   MMANGHLGPLILGKNGIFPQNSPPNPCIPHGFLIPRRPILGITMDCMLVRKSWV------ 54

Query: 160 KCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
                G   N EA + S   R LEKE KFSP F DYVK++ESV+ +RS+   G
Sbjct: 55  ---FDGRAPNGEAEAPSADLRSLEKELKFSPTFSDYVKIMESVKSDRSRDSDG 104


>ref|XP_008807161.1| PREDICTED: pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Phoenix dactylifera]
          Length = 865

 Score = 70.5 bits (171), Expect = 5e-11
 Identities = 39/106 (36%), Positives = 53/106 (50%)
 Frame = -2

Query: 331 NGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGTKCR 152
           NG IG             S     S HG  +  +P+    L    V K W+   +  K R
Sbjct: 9   NGQIGVFNLKGNRVFFSGSPLNSCSGHGFLISGRPVSSNTLNSLRVTKSWDFDVRKPKLR 68

Query: 151 ICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSK 14
           +  AL N E  + S  +R LE EFKFSP F+DYVKV+ESV++ R++
Sbjct: 69  VANALANGEIEAPSTPSRLLENEFKFSPTFNDYVKVMESVKMGRNQ 114


>gb|PKU82244.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum]
          Length = 737

 Score = 67.0 bits (162), Expect = 8e-10
 Identities = 40/110 (36%), Positives = 59/110 (53%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQG 164
           M++A    G    G      +NS    S  HG+++ +KP+ G        +KR     +G
Sbjct: 1   MVVAYLQFGFSSLGASSFPILNSRLNSSRIHGISIFQKPVPGMHSASAGARKRCVFYVKG 60

Query: 163 TKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSK 14
            K  I  +L + E  + S ++ FLEKEFKF P FD YVKVLES++ +RS+
Sbjct: 61  PKIGIINSLPDGEVETTSTSSEFLEKEFKFIPTFDKYVKVLESIKTDRSR 110


>ref|XP_020576653.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Phalaenopsis equestris]
          Length = 1025

 Score = 67.0 bits (162), Expect = 8e-10
 Identities = 37/88 (42%), Positives = 59/88 (67%), Gaps = 2/88 (2%)
 Frame = -2

Query: 271 SKLSSF--HGLAVPKKPLLGGALYLRAVKKRWNPGAQGTKCRICGALGNEEAGSGSVATR 98
           ++LSSF  HG+++ +KP++G  +     +K     A+ +K  I  AL + E  + S +++
Sbjct: 18  NRLSSFYTHGVSIFQKPVIGMDVGCAGARKSSVFYAKRSKIGIINALPDGEVEAPSSSSQ 77

Query: 97  FLEKEFKFSPAFDDYVKVLESVRVERSK 14
           F EKEFKF+P FD+YV+VLESVR +RS+
Sbjct: 78  FFEKEFKFTPTFDEYVRVLESVRTDRSR 105


>ref|XP_020679014.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Dendrobium catenatum]
          Length = 1055

 Score = 67.0 bits (162), Expect = 8e-10
 Identities = 40/110 (36%), Positives = 59/110 (53%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQG 164
           M++A    G    G      +NS    S  HG+++ +KP+ G        +KR     +G
Sbjct: 1   MVVAYLQFGFSSLGASSFPILNSRLNSSRIHGISIFQKPVPGMHSASAGARKRCVFYVKG 60

Query: 163 TKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSK 14
            K  I  +L + E  + S ++ FLEKEFKF P FD YVKVLES++ +RS+
Sbjct: 61  PKIGIINSLPDGEVETTSTSSEFLEKEFKFIPTFDKYVKVLESIKTDRSR 110


>emb|CDM83971.1| unnamed protein product [Triticum aestivum]
          Length = 979

 Score = 65.9 bits (159), Expect = 2e-09
 Identities = 42/110 (38%), Positives = 59/110 (53%)
 Frame = -2

Query: 331 NGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGTKCR 152
           N S+G L  G    +  +   K S+ HG  VP++ +    L    V++     A     R
Sbjct: 7   NASMGLLNLGGCGVLLPSLPPKSSAGHGFLVPRRDVSASPLSWGLVRRGRFLDAG---FR 63

Query: 151 ICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
             GAL + EAG+GS   R +EKE  FSP F DYVK++ESV+++RSK   G
Sbjct: 64  AAGALASGEAGAGSSELRHIEKELTFSPTFTDYVKMMESVKLDRSKSLQG 113


>gb|OVA07351.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 1055

 Score = 63.2 bits (152), Expect = 2e-08
 Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
 Frame = -2

Query: 295 EAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGTKCRICGAL--GNEEA 122
           E ++  +    +S +G  V  +P+ G +L  R +K+ W    +   CR   AL  G  + 
Sbjct: 18  EILYPKNYQNPASLNGFLVSGRPICGISLNARRMKQSWFFSVRSPNCRTINALSKGEYDD 77

Query: 121 GSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
              + + + LEKEFKF P FD+Y+K +ESVR  R   P G
Sbjct: 78  NGSTNSGKVLEKEFKFQPTFDEYLKAMESVRTYRENNPIG 117


>dbj|BAK07775.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 969

 Score = 62.4 bits (150), Expect = 3e-08
 Identities = 41/114 (35%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
 Frame = -2

Query: 340 ILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKK-RWNPGAQG 164
           +  N S+G L  G    +  + +   S+ HG  VP++ +    L    V++ R      G
Sbjct: 4   VAPNASMGLLNLGGCGVLLPSLQPNSSAGHGFLVPRRDVSALPLSWGLVRRGRVLDAGFG 63

Query: 163 TKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
                 GAL + EAG+GS   R +EKE  FSP F DYVK++ESV+++RSK   G
Sbjct: 64  A----AGALASGEAGAGSSELRHIEKELTFSPTFTDYVKMMESVKLDRSKSLQG 113


>gb|ONK65740.1| uncharacterized protein A4U43_C06F440 [Asparagus officinalis]
          Length = 1015

 Score = 61.6 bits (148), Expect = 6e-08
 Identities = 33/77 (42%), Positives = 45/77 (58%)
 Frame = -2

Query: 238 PKKPLLGGALYLRAVKKRWNPGAQGTKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFD 59
           P K   G AL    +K+ +N   +  K     ALG+ E  + S   +F+EKE KFSP+F 
Sbjct: 63  PSKVGFGTALSCIDIKEIFNFDGEKPKFCTISALGSGEVDARSTTVQFVEKELKFSPSFQ 122

Query: 58  DYVKVLESVRVERSKGP 8
           DY+ V+ESVR +RSK P
Sbjct: 123 DYLNVMESVRTDRSKNP 139


>ref|XP_020269344.1| LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein
           At1g30610, chloroplastic [Asparagus officinalis]
          Length = 1029

 Score = 61.6 bits (148), Expect = 6e-08
 Identities = 33/77 (42%), Positives = 45/77 (58%)
 Frame = -2

Query: 238 PKKPLLGGALYLRAVKKRWNPGAQGTKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFD 59
           P K   G AL    +K+ +N   +  K     ALG+ E  + S   +F+EKE KFSP+F 
Sbjct: 63  PSKVGFGTALSCIDIKEIFNFDGEKPKFCTISALGSGEVDARSTTVQFVEKELKFSPSFQ 122

Query: 58  DYVKVLESVRVERSKGP 8
           DY+ V+ESVR +RSK P
Sbjct: 123 DYLNVMESVRTDRSKNP 139


>gb|PAN29218.1| hypothetical protein PAHAL_E02151 [Panicum hallii]
 gb|PAN29219.1| hypothetical protein PAHAL_E02151 [Panicum hallii]
 gb|PAN29220.1| hypothetical protein PAHAL_E02151 [Panicum hallii]
 gb|PAN29221.1| hypothetical protein PAHAL_E02151 [Panicum hallii]
          Length = 974

 Score = 60.5 bits (145), Expect = 1e-07
 Identities = 40/110 (36%), Positives = 55/110 (50%)
 Frame = -2

Query: 331 NGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQGTKCR 152
           N S+G L  G   A+    +   S   G  VP + +    L     +KR     +    R
Sbjct: 6   NASMGMLNMGGCGALLPTPQPNSSQGRGFLVPGRSVSVLPLRWGLARKR----GRVLDSR 61

Query: 151 ICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
             GA+   EAG+GS   R +EKE  FSP F DYVK++ESV+++RSK   G
Sbjct: 62  TDGAVAGGEAGAGSSDLRHIEKELTFSPTFTDYVKIMESVKLDRSKNLHG 111


>ref|XP_010275031.1| PREDICTED: pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Nelumbo nucifera]
          Length = 1042

 Score = 59.7 bits (143), Expect = 3e-07
 Identities = 36/106 (33%), Positives = 54/106 (50%)
 Frame = -2

Query: 346 EMILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQ 167
           E+I+ +G +G           +N     SS  G  V +KP+   AL  + V +      +
Sbjct: 2   EVIVTSGFMGVSSLEGSGLFFLNYFQNPSSSLGFPVSRKPISKVALNTKKVSRIELFSLR 61

Query: 166 GTKCRICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVR 29
            + CRI   L   EA + S +  FLEKEF+F P FD+Y+K +ESV+
Sbjct: 62  ASSCRIMNVLSEGEATNRSSSDGFLEKEFRFQPTFDEYLKAMESVK 107


>gb|AQK97280.1| Pentatricopeptide repeat-containing protein chloroplastic [Zea
           mays]
          Length = 512

 Score = 58.9 bits (141), Expect = 5e-07
 Identities = 37/111 (33%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
 Frame = -2

Query: 331 NGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKR-WNPGAQGTKC 155
           N S+G L  G   A+    +       G ++P++ +    L     +KR W   ++    
Sbjct: 6   NASMGPLSLGGCGALLSTPQPNSWHCRGFSIPERSVFMLPLRRGLSRKRGWVLDSRSN-- 63

Query: 154 RICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
              GA+  +E G+GS   R +EKE  FSP F DYVK++ESV+++RSK   G
Sbjct: 64  ---GAVARDEVGAGSSELRHIEKELTFSPTFTDYVKIMESVKLDRSKNLHG 111


>ref|XP_008656745.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Zea mays]
 ref|XP_008656746.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Zea mays]
 ref|XP_008656747.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Zea mays]
 gb|AQK97278.1| Pentatricopeptide repeat-containing protein chloroplastic [Zea
           mays]
 gb|AQK97279.1| Pentatricopeptide repeat-containing protein chloroplastic [Zea
           mays]
          Length = 956

 Score = 58.9 bits (141), Expect = 5e-07
 Identities = 37/111 (33%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
 Frame = -2

Query: 331 NGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKR-WNPGAQGTKC 155
           N S+G L  G   A+    +       G ++P++ +    L     +KR W   ++    
Sbjct: 6   NASMGPLSLGGCGALLSTPQPNSWHCRGFSIPERSVFMLPLRRGLSRKRGWVLDSRSN-- 63

Query: 154 RICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSKGPGG 2
              GA+  +E G+GS   R +EKE  FSP F DYVK++ESV+++RSK   G
Sbjct: 64  ---GAVARDEVGAGSSELRHIEKELTFSPTFTDYVKIMESVKLDRSKNLHG 111


>ref|XP_020418206.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Prunus persica]
 gb|ONI11321.1| hypothetical protein PRUPE_4G101500 [Prunus persica]
          Length = 902

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 1/111 (0%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQG 164
           MI+ N  +G   +   +    N  SK     G ++ ++P+    LY + VKK    G + 
Sbjct: 4   MIMTNAQLGVSNFQRNDIFVANCSSKPGPLSGFSLFRRPIFCVGLYEKNVKKNRGFGIKI 63

Query: 163 TKCR-ICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSK 14
              R +  A+  E + + SV    LEKEF+F P+FD Y+KV+ +VR+   +
Sbjct: 64  PNRRTVISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDR 114


>ref|XP_021823689.1| pentatricopeptide repeat-containing protein At1g30610,
           chloroplastic [Prunus avium]
          Length = 903

 Score = 57.8 bits (138), Expect = 1e-06
 Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 1/111 (0%)
 Frame = -2

Query: 343 MILANGSIGALFYGNYEAVHVNSESKLSSFHGLAVPKKPLLGGALYLRAVKKRWNPGAQG 164
           MI+ N  +G   +   +    N  SK     G ++ ++P+    LY + VKK    G + 
Sbjct: 4   MIVTNAQLGVSNFQRNDIFAANCSSKPGPLSGFSLFRRPIFCVGLYEKNVKKNRGFGIKI 63

Query: 163 TKCR-ICGALGNEEAGSGSVATRFLEKEFKFSPAFDDYVKVLESVRVERSK 14
              R +  A+  E + + SV    LEKEF+F P+FD Y+KV+ +VR+   +
Sbjct: 64  PNRRTVISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDR 114


Top