BLASTX nr result

ID: Mentha27_contig00025886 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00025886
         (1112 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial...   340   6e-91
ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containi...   330   7e-88
ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein...   327   4e-87
emb|CBI30774.3| unnamed protein product [Vitis vinifera]              327   4e-87
ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containi...   326   9e-87
ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containi...   325   3e-86
ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prun...   324   4e-86
ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citr...   322   2e-85
ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containi...   322   2e-85
ref|XP_002526919.1| conserved hypothetical protein [Ricinus comm...   317   4e-84
ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containi...   315   2e-83
ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containi...   313   8e-83
gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]     313   1e-82
ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containi...   301   2e-79
gb|ACU23441.1| unknown [Glycine max]                                  301   3e-79
ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [A...   296   1e-77
ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phas...   296   1e-77
ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arab...   293   7e-77
ref|NP_001031667.1| pentatricopeptide repeat-containing protein ...   291   4e-76
ref|NP_567571.1| pentatricopeptide repeat-containing protein [Ar...   291   4e-76

>gb|EYU33203.1| hypothetical protein MIMGU_mgv1a020021mg, partial [Mimulus
           guttatus]
          Length = 209

 Score =  340 bits (872), Expect = 6e-91
 Identities = 161/203 (79%), Positives = 180/203 (88%)
 Frame = -3

Query: 789 KLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPLI 610
           K++RKSGKKEHHLWQ+RD AGSGHKA NLVR IC LPNEK AVY ALDEWIAWETEFPLI
Sbjct: 1   KVIRKSGKKEHHLWQKRDSAGSGHKALNLVRTICRLPNEKEAVYGALDEWIAWETEFPLI 60

Query: 609 AAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMI 430
           AAAKAL ILRKR+ WKRIIQV KWMLSKGQGATMSTYD LLLAFDMD R D+A++LW M+
Sbjct: 61  AAAKALRILRKRNHWKRIIQVGKWMLSKGQGATMSTYDSLLLAFDMDGRLDDAEILWNMV 120

Query: 429 LQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALGQ 250
           LQ + RS+ K +FSRMISLYDHHNLP+ V+EVFADMEEL VKPDEDTVRR+ARAFEALGQ
Sbjct: 121 LQTYNRSLPKMIFSRMISLYDHHNLPDKVIEVFADMEELEVKPDEDTVRRVARAFEALGQ 180

Query: 249 EDKQTQFLKKYQRRWKYIHFKGE 181
           ++K+   +KKYQ +WKYIHFKGE
Sbjct: 181 KEKERLVMKKYQSKWKYIHFKGE 203


>ref|XP_004232997.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Solanum lycopersicum]
          Length = 265

 Score =  330 bits (846), Expect = 7e-88
 Identities = 167/264 (63%), Positives = 203/264 (76%)
 Frame = -3

Query: 972 NLSQKYYASSLLIKASNFSEATHQASGGEKFIQLSKAIASPQHCRGSQKEQPLVAPTAVE 793
           +L   +++ ++L+K  N       ++G    + +S A+   +H +  Q E  L    A +
Sbjct: 4   SLQFHFFSCNILLKGIN-------STGLSDKLNVSSAL---KHSK-KQGELSLTISDAAD 52

Query: 792 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPL 613
           +K V+K+GK EHHLW++R+ AGSG KA NLVRII GLPNEK +VY ALD+WIAWETEFPL
Sbjct: 53  QKKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWETEFPL 112

Query: 612 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 433
           IAAAKAL ILR++  WKR+IQVAKWMLSKGQGATM+TYD LLLAFDMD R DEA+ LW M
Sbjct: 113 IAAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNM 172

Query: 432 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALG 253
           IL    RS+SKRLFSRMISLYDHH++P+ +VEVFADMEELGVKPDEDTVRR+ARAF+ LG
Sbjct: 173 ILHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVRRVARAFQMLG 232

Query: 252 QEDKQTQFLKKYQRRWKYIHFKGE 181
           QED Q   LKKYQ RWKY+HF GE
Sbjct: 233 QEDNQKLVLKKYQSRWKYVHFNGE 256


>ref|XP_007025241.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao]
           gi|508780607|gb|EOY27863.1| Pentatricopeptide repeat
           superfamily protein [Theobroma cacao]
          Length = 276

 Score =  327 bits (839), Expect = 4e-87
 Identities = 161/221 (72%), Positives = 180/221 (81%)
 Frame = -3

Query: 843 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAA 664
           C     EQ L    AVEKK V+K GK EHHLW++RD AGSG KA NLVRII  LPNEK A
Sbjct: 44  CSQKLGEQSLGISEAVEKKPVKKVGKNEHHLWKKRDSAGSGQKALNLVRIISQLPNEKEA 103

Query: 663 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 484
           VY ALD+W AWETEFPLIAAAKAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLL
Sbjct: 104 VYGALDKWTAWETEFPLIAAAKALRILRKRSQWLRVIQVAKWMLSKGQGATMGTYDTLLL 163

Query: 483 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 304
           AFDMD+R DEA+ LW MIL  H RSISKRLFSRMISLYDHHN+ + ++EVFADMEEL V+
Sbjct: 164 AFDMDKRVDEAESLWNMILHIHTRSISKRLFSRMISLYDHHNMQDKIIEVFADMEELCVR 223

Query: 303 PDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           PDE+TVR++ARAF+ LGQEDKQ   L++Y  +WKYIHF GE
Sbjct: 224 PDENTVRKVARAFQKLGQEDKQKLVLRRYLSKWKYIHFNGE 264


>emb|CBI30774.3| unnamed protein product [Vitis vinifera]
          Length = 277

 Score =  327 bits (839), Expect = 4e-87
 Identities = 159/207 (76%), Positives = 177/207 (85%)
 Frame = -3

Query: 801 AVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETE 622
           AVEK++ +K GKKEHHLW++RD  GSG KA NLVRI+  LPNEK AVY ALD+W AWETE
Sbjct: 58  AVEKEISKKVGKKEHHLWRKRDSIGSGQKALNLVRIVSELPNEKEAVYGALDKWTAWETE 117

Query: 621 FPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKML 442
           FPLIAAAKAL ILRKR+QWKR+IQVAKWMLSKGQGATM TYD LLLAFDMD R DEA+ L
Sbjct: 118 FPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDWRVDEAESL 177

Query: 441 WEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFE 262
           W MIL  H RSISK+LFSRMISLYDHH++ + V+EVFADMEELGVKPDEDTVRR+A AF+
Sbjct: 178 WNMILHTHTRSISKQLFSRMISLYDHHDMRDKVIEVFADMEELGVKPDEDTVRRVACAFQ 237

Query: 261 ALGQEDKQTQFLKKYQRRWKYIHFKGE 181
            LGQEDKQ   LKKYQ +WKYIHF GE
Sbjct: 238 TLGQEDKQKLVLKKYQCKWKYIHFNGE 264


>ref|XP_004164255.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Cucumis sativus]
          Length = 270

 Score =  326 bits (836), Expect = 9e-87
 Identities = 155/221 (70%), Positives = 182/221 (82%)
 Frame = -3

Query: 843 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAA 664
           C  +Q  QPL + T  E+++V+K GK+ HHLW++RD AGSG KA NLVRI+   PNEK A
Sbjct: 36  CVHNQAAQPLTSFTTPERRVVKKVGKETHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEA 95

Query: 663 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 484
           VY  L++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKWMLSKGQGATM TYD LLL
Sbjct: 96  VYGELNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLL 155

Query: 483 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 304
           AFDMD+R DEA+ LW MIL  H RSISKR+FSRMISLY+HH+L + ++E+FADMEELGVK
Sbjct: 156 AFDMDKRVDEAESLWNMILHTHTRSISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVK 215

Query: 303 PDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           PDEDTVRR+ RAF+ LGQED +    K+Y  +WKYIHFKGE
Sbjct: 216 PDEDTVRRVCRAFQKLGQEDNRKMVYKRYSCQWKYIHFKGE 256


>ref|XP_006355563.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X1 [Solanum tuberosum]
           gi|565378234|ref|XP_006355564.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X2 [Solanum tuberosum]
           gi|565378236|ref|XP_006355565.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 265

 Score =  325 bits (832), Expect = 3e-86
 Identities = 163/264 (61%), Positives = 199/264 (75%)
 Frame = -3

Query: 972 NLSQKYYASSLLIKASNFSEATHQASGGEKFIQLSKAIASPQHCRGSQKEQPLVAPTAVE 793
           +L   +++ ++L+K  N       ++G    + ++ A+   +     Q E  L      +
Sbjct: 4   SLQFHFFSCNILLKGIN-------STGLSDKLNVTSALKDSK----KQGELSLTISDTAD 52

Query: 792 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPL 613
           +K V+K+GK EHHLW++R+ AGSG KA NLVRII GLPNEK +VY ALD+WIAWE EFPL
Sbjct: 53  QKKVQKAGKVEHHLWKKRESAGSGQKALNLVRIISGLPNEKESVYGALDKWIAWEAEFPL 112

Query: 612 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 433
           IAAAKAL ILR++  WKR+IQVAKWMLSKGQGATM+TYD LLLAFDMD R DEA+ LW M
Sbjct: 113 IAAAKALRILRQQRLWKRVIQVAKWMLSKGQGATMATYDALLLAFDMDNRVDEAETLWNM 172

Query: 432 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALG 253
           IL    RS+SKRLFSRMISLYDHH++P+ +VEVFADMEELGVKPDEDTV R+ARAF+ LG
Sbjct: 173 ILHTSTRSVSKRLFSRMISLYDHHHVPDKIVEVFADMEELGVKPDEDTVGRVARAFQMLG 232

Query: 252 QEDKQTQFLKKYQRRWKYIHFKGE 181
           QEDKQ   LKKYQ RWKY+HF GE
Sbjct: 233 QEDKQKLVLKKYQSRWKYVHFNGE 256


>ref|XP_007211999.1| hypothetical protein PRUPE_ppa011078mg [Prunus persica]
           gi|462407864|gb|EMJ13198.1| hypothetical protein
           PRUPE_ppa011078mg [Prunus persica]
          Length = 224

 Score =  324 bits (831), Expect = 4e-86
 Identities = 154/204 (75%), Positives = 174/204 (85%)
 Frame = -3

Query: 792 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPL 613
           +K ++K G+KEHHLWQ+RD AGSG KA NLVRI+ GLPNEK  VY ALD+W AWETEFPL
Sbjct: 4   RKTIKKVGRKEHHLWQKRDSAGSGQKALNLVRIVSGLPNEKETVYGALDKWTAWETEFPL 63

Query: 612 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 433
           IAA KAL ILRKRSQW R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW M
Sbjct: 64  IAAVKALRILRKRSQWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDQRVDEAESLWNM 123

Query: 432 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALG 253
           IL  H RSISKRLFSRMISLYDHH+    ++EVFADMEELGVKPDEDTVRR+ARAF+ LG
Sbjct: 124 ILHTHTRSISKRLFSRMISLYDHHDKQNKIIEVFADMEELGVKPDEDTVRRVARAFKELG 183

Query: 252 QEDKQTQFLKKYQRRWKYIHFKGE 181
           QE+ +T  L++YQ +WKYIHFKGE
Sbjct: 184 QEENKTLVLRRYQCKWKYIHFKGE 207


>ref|XP_006449586.1| hypothetical protein CICLE_v10016169mg [Citrus clementina]
           gi|557552197|gb|ESR62826.1| hypothetical protein
           CICLE_v10016169mg [Citrus clementina]
          Length = 284

 Score =  322 bits (824), Expect = 2e-85
 Identities = 164/246 (66%), Positives = 187/246 (76%), Gaps = 5/246 (2%)
 Frame = -3

Query: 903 QASGGEKFIQLSKAIASPQ-HCRGSQKEQPLVAPT----AVEKKLVRKSGKKEHHLWQRR 739
           Q   G   +    A ++P   C  +Q + P VA +    + + KLV K GKKE HLWQ+R
Sbjct: 23  QTPSGFSLLTTKLATSNPHLKCFLNQNKLPPVANSNANASKKNKLVVKVGKKEQHLWQKR 82

Query: 738 DQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKR 559
           D AGSG KA NLVRI+  LPNEK AVY ALD+W AWETEFPLIAAAKAL ILRKR QW R
Sbjct: 83  DSAGSGQKALNLVRIVSELPNEKHAVYGALDKWTAWETEFPLIAAAKALRILRKRGQWLR 142

Query: 558 IIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMI 379
           +IQVAKWMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL  H RSISKRLFSRMI
Sbjct: 143 VIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTHTRSISKRLFSRMI 202

Query: 378 SLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKY 199
           SLYDHH++P  ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ++KQ   LKKY  +WKY
Sbjct: 203 SLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDEKQKLVLKKYLSKWKY 262

Query: 198 IHFKGE 181
           IHFKGE
Sbjct: 263 IHFKGE 268


>ref|XP_004294568.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 300

 Score =  322 bits (824), Expect = 2e-85
 Identities = 150/221 (67%), Positives = 180/221 (81%)
 Frame = -3

Query: 843 CRGSQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAA 664
           C   Q  Q ++A  A+EKK+++K+G+ EHHLW+++D AGSG KA NL+RI+  LPNEK A
Sbjct: 63  CCQKQSRQTVMASKAMEKKIIKKAGRNEHHLWKKKDSAGSGQKALNLIRIVSDLPNEKEA 122

Query: 663 VYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLL 484
           ++ ALD+W AWETEFPLIAAAKAL ILR+  QW+R+IQVAKWMLSKGQGATM+TYD LLL
Sbjct: 123 IFGALDKWTAWETEFPLIAAAKALRILRRTCQWRRVIQVAKWMLSKGQGATMATYDTLLL 182

Query: 483 AFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVK 304
           AFDMD R DEA+ LW MIL  H RSISKRLFSRMISLYDHH +   ++EVFADMEEL V+
Sbjct: 183 AFDMDNRLDEAESLWNMILHTHTRSISKRLFSRMISLYDHHEMKTKIIEVFADMEELSVR 242

Query: 303 PDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           PDEDTVRR+ARAF+  GQEDK    L++Y  +WKYIHFKGE
Sbjct: 243 PDEDTVRRVARAFQEFGQEDKSKLVLRRYGCKWKYIHFKGE 283


>ref|XP_002526919.1| conserved hypothetical protein [Ricinus communis]
           gi|223533738|gb|EEF35472.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 224

 Score =  317 bits (813), Expect = 4e-84
 Identities = 149/204 (73%), Positives = 171/204 (83%)
 Frame = -3

Query: 792 KKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPL 613
           +K V+K+GK+EHHLW++RD A SG KA +LVRI+C LP+EK  VY ALD+W AWETEFPL
Sbjct: 10  RKPVKKAGKEEHHLWKKRDSARSGEKALSLVRIVCELPDEKECVYGALDKWTAWETEFPL 69

Query: 612 IAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEM 433
           IA AK L ILRK +QW R+IQVAKWMLSKGQG TM TYD LLLAFDMD R DEA  LW M
Sbjct: 70  IAVAKGLRILRKHNQWLRVIQVAKWMLSKGQGTTMGTYDTLLLAFDMDNRVDEAASLWNM 129

Query: 432 ILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALG 253
           IL  H RSISKRLFSRMISLYDHHN+P+ ++E+FADMEELGV+PDEDTVRR+ARAF+ LG
Sbjct: 130 ILHTHVRSISKRLFSRMISLYDHHNMPDGIIEIFADMEELGVRPDEDTVRRVARAFKELG 189

Query: 252 QEDKQTQFLKKYQRRWKYIHFKGE 181
           QE+KQ   LK+Y  RWKYIHFKGE
Sbjct: 190 QEEKQKLVLKRYMSRWKYIHFKGE 213


>ref|XP_004499754.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X5 [Cicer arietinum]
          Length = 287

 Score =  315 bits (807), Expect = 2e-83
 Identities = 158/251 (62%), Positives = 184/251 (73%), Gaps = 5/251 (1%)
 Frame = -3

Query: 918 SEATHQASGGEKFIQLSKAIASPQHCRGSQ----KEQPLVA-PTAVEKKLVRKSGKKEHH 754
           SEAT     G KF+  S  I+    C   +    K  P V  P   +KK  +  GK EHH
Sbjct: 25  SEATRVFLSGNKFLTTSITISRKTSCTSCRFVQSKSSPNVGRPVEKDKKGNKIKGKVEHH 84

Query: 753 LWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPLIAAAKALGILRKR 574
           LW+RR+ A SG KA  LVR IC LPNEK +VY ALD+W AWETEFPL+AAAKAL ILRKR
Sbjct: 85  LWKRRNSAQSGQKALTLVRTICELPNEKESVYGALDKWTAWETEFPLVAAAKALNILRKR 144

Query: 573 SQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRL 394
            QW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW MI+ AH RS+SKRL
Sbjct: 145 GQWVRVIQLAKWMLSKGQGATMGTYDTLLLAFDMDQRIDEAESLWNMIIHAHMRSVSKRL 204

Query: 393 FSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALGQEDKQTQFLKKYQ 214
           FSRMISLYDHHNL E +VE+FADMEEL +KPDEDTVR++  AF  LGQE+K+   +K+Y 
Sbjct: 205 FSRMISLYDHHNLSEKIVEIFADMEELRIKPDEDTVRKVTNAFRKLGQEEKRKSVIKRYG 264

Query: 213 RRWKYIHFKGE 181
            +WKYIHF GE
Sbjct: 265 LKWKYIHFNGE 275


>ref|XP_006467567.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like [Citrus sinensis]
          Length = 281

 Score =  313 bits (802), Expect = 8e-83
 Identities = 163/246 (66%), Positives = 184/246 (74%), Gaps = 5/246 (2%)
 Frame = -3

Query: 903 QASGGEKFIQLSKAIASPQ-HCRGSQKEQPLV----APTAVEKKLVRKSGKKEHHLWQRR 739
           Q + G   +    A ++P   C  +Q +QP V    A  + + KLV K GKKE HLWQ+R
Sbjct: 23  QTASGFSLLTTKLATSNPHLKCFLNQNKQPPVSNSNANASKKNKLVVKVGKKEQHLWQKR 82

Query: 738 DQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKR 559
           D AGSG KA NLV     LPNEK AVY ALD+W AWETEFPLIAAAKAL ILRKR QW R
Sbjct: 83  DSAGSGQKALNLVS---ELPNEKHAVYGALDKWTAWETEFPLIAAAKALRILRKRGQWLR 139

Query: 558 IIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMI 379
           +IQVAKWMLSKGQGATM TYD LLLAFD D R DEA+ LW MIL    RSISKRLFSRMI
Sbjct: 140 VIQVAKWMLSKGQGATMGTYDTLLLAFDKDHRADEAESLWNMILHTQTRSISKRLFSRMI 199

Query: 378 SLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKY 199
           SLYDHH++P  ++EVFADMEELGV+PDEDTVRRIA AF+ +GQ+DKQ   LKKY  +WKY
Sbjct: 200 SLYDHHDMPNKIIEVFADMEELGVRPDEDTVRRIASAFQRVGQDDKQKLVLKKYLSKWKY 259

Query: 198 IHFKGE 181
           IHFKGE
Sbjct: 260 IHFKGE 265


>gb|EXB58283.1| hypothetical protein L484_015617 [Morus notabilis]
          Length = 326

 Score =  313 bits (801), Expect = 1e-82
 Identities = 153/239 (64%), Positives = 184/239 (76%), Gaps = 14/239 (5%)
 Frame = -3

Query: 855 SPQHCRGSQKEQPLVAPTAVEK--------------KLVRKSGKKEHHLWQRRDQAGSGH 718
           S  +C      +PL +  A+EK               LV+K+GKKE+HLW+++D AGSG 
Sbjct: 71  SHHNCSIKGNGEPLTSSKAIEKLQRLCIEFLYMEFRNLVKKTGKKEYHLWKKKDSAGSGQ 130

Query: 717 KAQNLVRIICGLPNEKAAVYSALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKW 538
           KA NL+RI+  LPNEK  VY AL++WIAWETEFPLIAAAKAL ILRKRSQWKR+IQVAKW
Sbjct: 131 KALNLIRILSVLPNEKEVVYGALNKWIAWETEFPLIAAAKALRILRKRSQWKRVIQVAKW 190

Query: 537 MLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHN 358
           MLSKGQG TM TYD LLLAFDMD+R DEA+  W MIL  H RSISKRLFSRMI+LYDHH+
Sbjct: 191 MLSKGQGTTMGTYDTLLLAFDMDQRVDEAESFWNMILHTHKRSISKRLFSRMIALYDHHD 250

Query: 357 LPENVVEVFADMEELGVKPDEDTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           + + ++EVFADMEEL V+ DEDTVRR+A AF+ LGQE+K+   L+KYQ +WKY+HFKGE
Sbjct: 251 VKDKIIEVFADMEELSVRLDEDTVRRVAYAFQKLGQEEKKKLLLRKYQCKWKYVHFKGE 309


>ref|XP_003546058.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X1 [Glycine max]
           gi|571517206|ref|XP_006597502.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g18975,
           chloroplastic-like isoform X2 [Glycine max]
          Length = 288

 Score =  301 bits (772), Expect = 2e-79
 Identities = 143/206 (69%), Positives = 168/206 (81%)
 Frame = -3

Query: 798 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEF 619
           +EKK  + +GKKEHHLW+ RD A SG KA  LVR +  LPNEK AVY ALD+W AWETEF
Sbjct: 67  MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126

Query: 618 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 439
           P+IA +KAL ILRKR  W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW
Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186

Query: 438 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEA 259
            MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF  
Sbjct: 187 NMIIHAHMRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246

Query: 258 LGQEDKQTQFLKKYQRRWKYIHFKGE 181
           LG E+K+   +K+Y  +WKYIHF GE
Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272


>gb|ACU23441.1| unknown [Glycine max]
          Length = 288

 Score =  301 bits (771), Expect = 3e-79
 Identities = 143/206 (69%), Positives = 168/206 (81%)
 Frame = -3

Query: 798 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEF 619
           +EKK  + +GKKEHHLW+ RD A SG KA  LVR +  LPNEK AVY ALD+W AWETEF
Sbjct: 67  MEKKGKKTTGKKEHHLWKSRDSAQSGQKALALVRTVYKLPNEKEAVYGALDKWTAWETEF 126

Query: 618 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 439
           P+IA +KAL ILRKR  W R+IQVAKWMLSKGQGATM TYD LLLAFDMD+R DEA+ LW
Sbjct: 127 PVIAVSKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLW 186

Query: 438 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEA 259
            MI+ AH RS+SKRLFSRMISLYDHHN+P+ +++VFADMEEL +KPDEDTVRR+ARAF  
Sbjct: 187 NMIIHAHLRSVSKRLFSRMISLYDHHNMPDKIIDVFADMEELRLKPDEDTVRRVARAFRE 246

Query: 258 LGQEDKQTQFLKKYQRRWKYIHFKGE 181
           LG E+K+   +K+Y  +WKYIHF GE
Sbjct: 247 LGDEEKRKLVIKQYGLKWKYIHFNGE 272


>ref|XP_006848146.1| hypothetical protein AMTR_s00029p00227910 [Amborella trichopoda]
           gi|548851451|gb|ERN09727.1| hypothetical protein
           AMTR_s00029p00227910 [Amborella trichopoda]
          Length = 287

 Score =  296 bits (758), Expect = 1e-77
 Identities = 145/218 (66%), Positives = 174/218 (79%)
 Frame = -3

Query: 834 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYS 655
           S+K  PL+ P   + K   K  KKEHHLW +RD AGS  KA NLVRI+  + NEK A+Y 
Sbjct: 61  SRKIAPLITPVDEKPK---KLFKKEHHLWMKRDSAGSSQKALNLVRIVSRVSNEKEAIYV 117

Query: 654 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 475
           ALDEW AWETEFP+IAAAKALGILRKR +W R+IQV+KW+LSKGQ  TM TYD LLLAFD
Sbjct: 118 ALDEWAAWETEFPVIAAAKALGILRKRRRWLRVIQVSKWLLSKGQVLTMGTYDTLLLAFD 177

Query: 474 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 295
           MD R DEA+ +W MIL  + RSISKRLFSRM+SLYDHH++P+ ++EVFADMEELGVKPD+
Sbjct: 178 MDGRVDEAETIWNMILHTYTRSISKRLFSRMMSLYDHHHIPDKLLEVFADMEELGVKPDQ 237

Query: 294 DTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           D+VRR+ARAF+ LG+E+KQ Q L+KY  + KYIHF GE
Sbjct: 238 DSVRRVARAFQQLGEEEKQKQVLQKYGLKLKYIHFNGE 275


>ref|XP_007147960.1| hypothetical protein PHAVU_006G168700g [Phaseolus vulgaris]
           gi|561021183|gb|ESW19954.1| hypothetical protein
           PHAVU_006G168700g [Phaseolus vulgaris]
          Length = 289

 Score =  296 bits (757), Expect = 1e-77
 Identities = 141/205 (68%), Positives = 168/205 (81%)
 Frame = -3

Query: 795 EKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEFP 616
           +KK  + +GKKEHHLW+ RD A SG KA  LVRI+  LPNEK AVY ALD+WIAWETEFP
Sbjct: 69  KKKGKKTTGKKEHHLWKSRDSAQSGQKALTLVRIVSKLPNEKEAVYGALDKWIAWETEFP 128

Query: 615 LIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLWE 436
           +IAAAKAL ILRKR  W R+IQVAKWMLSKGQGATM T+D LLLAFDMD+R DEA+ LW 
Sbjct: 129 VIAAAKALKILRKRGHWVRVIQVAKWMLSKGQGATMGTFDTLLLAFDMDQRVDEAESLWN 188

Query: 435 MILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEAL 256
           MI+  H RS+SKRLFSRMIS+YD+H++P+ ++EVFADMEEL VKPDEDTVRR+ARAF  L
Sbjct: 189 MIIHTHMRSVSKRLFSRMISIYDNHDMPDKIIEVFADMEELRVKPDEDTVRRVARAFTEL 248

Query: 255 GQEDKQTQFLKKYQRRWKYIHFKGE 181
           G+E+K+    ++Y  +WKYIHF  E
Sbjct: 249 GEEEKRKLVARRYGIKWKYIHFNRE 273


>ref|XP_002867967.1| hypothetical protein ARALYDRAFT_914772 [Arabidopsis lyrata subsp.
           lyrata] gi|297313803|gb|EFH44226.1| hypothetical protein
           ARALYDRAFT_914772 [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  293 bits (751), Expect = 7e-77
 Identities = 140/218 (64%), Positives = 172/218 (78%)
 Frame = -3

Query: 834 SQKEQPLVAPTAVEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYS 655
           S+K+   +    V    ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY 
Sbjct: 55  SEKQAGKLDVATVNSNEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYG 114

Query: 654 ALDEWIAWETEFPLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFD 475
           AL++W+AWE EFP+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFD
Sbjct: 115 ALNKWVAWEVEFPIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDTLLLAFD 174

Query: 474 MDRRPDEAKMLWEMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDE 295
           MD+R DEA+ LW MIL  H RSI +RLF+RMI+LY H++L + V+EVFADMEEL V+PDE
Sbjct: 175 MDQRADEAESLWNMILHTHTRSIPRRLFARMIALYAHYDLHDKVIEVFADMEELKVRPDE 234

Query: 294 DTVRRIARAFEALGQEDKQTQFLKKYQRRWKYIHFKGE 181
           DT RR+ARAF  LGQE+ +   L++Y   +KYI+F GE
Sbjct: 235 DTARRVARAFRELGQEENRKLILRRYLSEFKYIYFNGE 272


>ref|NP_001031667.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|332658716|gb|AEE84116.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 260

 Score =  291 bits (744), Expect = 4e-76
 Identities = 138/206 (66%), Positives = 165/206 (80%)
 Frame = -3

Query: 798 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEF 619
           V  K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF
Sbjct: 43  VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 102

Query: 618 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 439
           P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW
Sbjct: 103 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 162

Query: 438 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEA 259
            MIL  H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF  
Sbjct: 163 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 222

Query: 258 LGQEDKQTQFLKKYQRRWKYIHFKGE 181
           L QE+ +   L++Y   +KYI+F GE
Sbjct: 223 LNQEENRKLILRRYLSEYKYIYFNGE 248


>ref|NP_567571.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|186512032|ref|NP_001119009.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|334186688|ref|NP_001190768.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|223635632|sp|Q2V3H0.2|PP322_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g18975, chloroplastic; Flags: Precursor
           gi|332658715|gb|AEE84115.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658717|gb|AEE84117.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658718|gb|AEE84118.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 287

 Score =  291 bits (744), Expect = 4e-76
 Identities = 138/206 (66%), Positives = 165/206 (80%)
 Frame = -3

Query: 798 VEKKLVRKSGKKEHHLWQRRDQAGSGHKAQNLVRIICGLPNEKAAVYSALDEWIAWETEF 619
           V  K ++K GKKEHHLW++ D AGSG KA NLVR++ GLPNEK AVY AL++W+AWE EF
Sbjct: 70  VNSKEIKKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEF 129

Query: 618 PLIAAAKALGILRKRSQWKRIIQVAKWMLSKGQGATMSTYDHLLLAFDMDRRPDEAKMLW 439
           P+IAAAKAL ILRKRSQW R+IQ+AKWMLSKGQGATM TYD LLLAFDMD R DEA+ LW
Sbjct: 130 PIIAAAKALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLW 189

Query: 438 EMILQAHGRSISKRLFSRMISLYDHHNLPENVVEVFADMEELGVKPDEDTVRRIARAFEA 259
            MIL  H RSI +RLF+RMI+LY HH+L + V+EVFADMEEL V PDED+ RR+ARAF  
Sbjct: 190 NMILHTHTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRE 249

Query: 258 LGQEDKQTQFLKKYQRRWKYIHFKGE 181
           L QE+ +   L++Y   +KYI+F GE
Sbjct: 250 LNQEENRKLILRRYLSEYKYIYFNGE 275


Top