BLASTX nr result

ID: Cocculus22_contig00021600 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00021600
         (324 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006424118.1| hypothetical protein CICLE_v10028449mg [Citr...   164   2e-38
ref|XP_006481538.1| PREDICTED: pentatricopeptide repeat-containi...   162   3e-38
ref|XP_004293118.1| PREDICTED: pentatricopeptide repeat-containi...   162   3e-38
ref|XP_004139593.1| PREDICTED: pentatricopeptide repeat-containi...   162   3e-38
ref|XP_004159118.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   159   3e-37
ref|XP_007015694.1| Tetratricopeptide repeat (TPR)-like superfam...   158   9e-37
ref|XP_004487456.1| PREDICTED: pentatricopeptide repeat-containi...   155   7e-36
ref|XP_002274209.2| PREDICTED: pentatricopeptide repeat-containi...   155   7e-36
ref|XP_006417992.1| hypothetical protein EUTSA_v10009524mg [Eutr...   152   5e-35
gb|AEP33754.1| chloroplast biogenesis 19, partial [Nasturtium of...   151   8e-35
gb|EXC37761.1| hypothetical protein L484_001219 [Morus notabilis]     150   1e-34
gb|AEP33751.1| chloroplast biogenesis 19, partial [Lepidium virg...   149   4e-34
gb|AEP33749.1| chloroplast biogenesis 19, partial [Crucihimalaya...   149   4e-34
gb|AEP33746.1| chloroplast biogenesis 19, partial [Barbarea verna]    149   4e-34
ref|XP_004251424.1| PREDICTED: pentatricopeptide repeat-containi...   149   5e-34
gb|AEP33747.1| chloroplast biogenesis 19, partial [Brassica oler...   148   7e-34
ref|NP_172066.3| pentatricopeptide repeat protein PDE247 [Arabid...   148   9e-34
ref|XP_002523876.1| pentatricopeptide repeat-containing protein,...   148   9e-34
ref|XP_006303598.1| hypothetical protein CARUB_v10011161mg [Caps...   147   1e-33
gb|AEP33750.1| chloroplast biogenesis 19, partial [Lepidium sati...   147   1e-33

>ref|XP_006424118.1| hypothetical protein CICLE_v10028449mg [Citrus clementina]
           gi|557526052|gb|ESR37358.1| hypothetical protein
           CICLE_v10028449mg [Citrus clementina]
          Length = 445

 Score =  164 bits (414), Expect = 2e-38
 Identities = 73/108 (67%), Positives = 89/108 (82%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF +N+RVCN+LID+YSRCGCI+FA Q F  M+KR LVSWNSII+GFA+NG+  EALE+
Sbjct: 174 QDFKDNVRVCNTLIDLYSRCGCIEFARQVFQRMHKRTLVSWNSIIVGFAVNGFVGEALEY 233

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGVSFTGALTACSHAG +E GL  +  MK  Y++SPR
Sbjct: 234 FNSMQKEGFKPDGVSFTGALTACSHAGLIEDGLRYFDIMKKIYRVSPR 281



 Score = 78.6 bits (192), Expect = 9e-13
 Identities = 41/99 (41%), Positives = 61/99 (61%), Gaps = 1/99 (1%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G I+ A + F+ M  RN +SW +++ GFA  GY EEALE F  MQ+ G E
Sbjct: 83  NAMIDGYMRNGDIESAVKMFDEMPVRNAISWTALLNGFAKRGYFEEALECFREMQISGVE 142

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK-TMKNAYKISPR 324
           PD V+    L AC++ G +  GL +++  +K  +K + R
Sbjct: 143 PDYVTIISVLNACANVGMLGIGLWIHRFVLKQDFKDNVR 181


>ref|XP_006481538.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Citrus sinensis]
          Length = 509

 Score =  162 bits (411), Expect = 3e-38
 Identities = 72/108 (66%), Positives = 89/108 (82%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF +N++VCN+LID+YSRCGCI+FA Q F  M+KR LVSWNSII+GFA+NG+  EALE+
Sbjct: 244 QDFKDNVKVCNTLIDLYSRCGCIEFARQVFQRMHKRTLVSWNSIIVGFAVNGFVGEALEY 303

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGVSFTGALTACSHAG +E GL  +  MK  Y++SPR
Sbjct: 304 FNSMQKEGFKPDGVSFTGALTACSHAGLIEDGLRYFDIMKKIYRVSPR 351



 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 38/95 (40%), Positives = 58/95 (61%), Gaps = 1/95 (1%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G I+ A + F+ M  R+ +SW +++ GF   GY EEALE F  MQ+ G E
Sbjct: 153 NAMIDGYMRRGDIESAVRMFDEMPVRDAISWTALLNGFVKRGYFEEALECFREMQISGVE 212

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK-TMKNAYK 312
           PD V+    L AC++ G +  GL +++  +K  +K
Sbjct: 213 PDYVTIISVLNACANVGTLGIGLWIHRYVLKQDFK 247


>ref|XP_004293118.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 504

 Score =  162 bits (411), Expect = 3e-38
 Identities = 75/106 (70%), Positives = 88/106 (83%)
 Frame = +1

Query: 7   FSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFY 186
           F  NIR+ NSLIDMYSRCGCI FA Q F NM  R LVSWNS+I+GFA+NG+AEEALE F+
Sbjct: 246 FRHNIRISNSLIDMYSRCGCIDFARQVFGNMPNRTLVSWNSMIVGFAVNGHAEEALEFFH 305

Query: 187 RMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           +MQ EGF+PDGVSFTGALTACSHAG V++GLH +  MK  +KI+PR
Sbjct: 306 QMQKEGFKPDGVSFTGALTACSHAGLVDEGLHFFDKMKRIHKITPR 351



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 35/83 (42%), Positives = 49/83 (59%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N+LID Y + G ++ A + F+ M KR+ VSW ++I GF      E+ALE F  MQV G E
Sbjct: 153 NTLIDGYMKMGNVRDAVEVFDEMPKRDAVSWTTLIGGFVKKRRYEDALEWFREMQVSGVE 212

Query: 211 PDGVSFTGALTACSHAGFVEKGL 279
           PD V+    + AC+  G +  GL
Sbjct: 213 PDYVTIIAVIAACADLGTLGLGL 235


>ref|XP_004139593.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Cucumis sativus]
          Length = 525

 Score =  162 bits (411), Expect = 3e-38
 Identities = 74/108 (68%), Positives = 91/108 (84%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           ++F +NI++ NSLIDMYSRCGCI+FA Q F  M KR LVSWNSII+GFA+NG+A+E+LE 
Sbjct: 256 QEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAVNGFADESLEF 315

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           FY MQ EGF+PDGVS+TGALTACSHAG V KGL L+  MK+ +KI+PR
Sbjct: 316 FYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITPR 363



 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 32/87 (36%), Positives = 53/87 (60%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++++ + R G I+ A Q F+ M  R+ +SW ++I G   +GY+E+ALE F++MQ  G  
Sbjct: 165 NTMLNGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVA 224

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
            D VS    L AC+  G +  GL +++
Sbjct: 225 ADYVSIIAVLAACADLGALTLGLWVHR 251


>ref|XP_004159118.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At1g05750, chloroplastic-like [Cucumis sativus]
          Length = 525

 Score =  159 bits (403), Expect = 3e-37
 Identities = 73/108 (67%), Positives = 90/108 (83%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           ++F +NI++ NSLIDMYSRCGCI+FA Q F  M KR LVSWNSII+GFA+NG+A+E+LE 
Sbjct: 256 QEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVGFAVNGFADESLEF 315

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGVS+TGALTACSHAG V KGL L+  MK+ +KI+PR
Sbjct: 316 FXAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITPR 363



 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 32/87 (36%), Positives = 53/87 (60%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++++ + R G I+ A Q F+ M  R+ +SW ++I G   +GY+E+ALE F++MQ  G  
Sbjct: 165 NTMLNGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVA 224

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
            D VS    L AC+  G +  GL +++
Sbjct: 225 ADYVSIIAVLAACADLGALTLGLWVHR 251


>ref|XP_007015694.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
           cacao] gi|508786057|gb|EOY33313.1| Tetratricopeptide
           repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 509

 Score =  158 bits (399), Expect = 9e-37
 Identities = 73/108 (67%), Positives = 89/108 (82%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           + F +N+RV NSLIDMYSRCGCI+ A + F+ M KR LVSWNSII+GFA+NG+AEEAL++
Sbjct: 244 QSFRDNVRVNNSLIDMYSRCGCIELAREVFDKMQKRTLVSWNSIIVGFAVNGFAEEALKY 303

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGVSFTGALTACSHAG V++GL  +  MK  Y+ISPR
Sbjct: 304 FDSMQKEGFKPDGVSFTGALTACSHAGLVDEGLRYFGIMKRVYRISPR 351



 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 36/105 (34%), Positives = 65/105 (61%), Gaps = 1/105 (0%)
 Frame = +1

Query: 13  ENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRM 192
           +N+   N+++D Y R G  + A + F+ M +R+++SW ++I GFA  G+ EEAL+ F  M
Sbjct: 147 KNLVSWNTMVDGYMRNGEYEKAVEIFDEMPQRDVISWTALINGFARRGFHEEALDWFREM 206

Query: 193 QVEGFEPDGVSFTGALTACSHAGFVEKGLHLYK-TMKNAYKISPR 324
            + G +PD V     LTAC++ G +  GL +++  +K +++ + R
Sbjct: 207 MIFGVKPDYVVIIAVLTACANLGALGVGLWIHRFVLKQSFRDNVR 251


>ref|XP_004487456.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Cicer arietinum]
          Length = 512

 Score =  155 bits (391), Expect = 7e-36
 Identities = 70/108 (64%), Positives = 89/108 (82%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           K+F +N++V NSLIDMY+RCGCI FA Q F+ M++RNLVSWNSIIIGFA+NG+A+EAL  
Sbjct: 247 KEFRDNVKVSNSLIDMYARCGCIGFARQVFDGMSQRNLVSWNSIIIGFAVNGHADEALSF 306

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           FY M+ EGFEPDGVS+TGALTACSHAG +++GL ++  MK   +  PR
Sbjct: 307 FYSMKKEGFEPDGVSYTGALTACSHAGLIDEGLKIFANMKKVSRNLPR 354



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 36/100 (36%), Positives = 58/100 (58%), Gaps = 1/100 (1%)
 Frame = +1

Query: 16  NIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQ 195
           N+   N++I  Y + G I+ A + F+ M  +N VSW SII GF      EEA+E F  MQ
Sbjct: 151 NLVSWNTMIGGYMKNGEIEDALKLFDEMPMKNAVSWTSIIGGFVKRDCHEEAVECFREMQ 210

Query: 196 VEGFEPDGVSFTGALTACSHAGFVEKGLHLYK-TMKNAYK 312
           ++G  PD V+    ++AC++ G +  GL +++  MK  ++
Sbjct: 211 LDGVVPDYVTVIAVISACANLGALGLGLWVHRFVMKKEFR 250


>ref|XP_002274209.2| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Vitis vinifera]
          Length = 518

 Score =  155 bits (391), Expect = 7e-36
 Identities = 72/108 (66%), Positives = 87/108 (80%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF +NI++ NSLIDMYSRCGCI+ A Q F  M KR+LVSWNS+I+GFA+NG+AEEALE 
Sbjct: 253 QDFKDNIKISNSLIDMYSRCGCIRLARQVFEQMPKRSLVSWNSMIVGFALNGHAEEALEF 312

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  M+ EGF PDGVSFTGALTACSH+G V++GL  +  MK   KISPR
Sbjct: 313 FNLMRKEGFRPDGVSFTGALTACSHSGLVDEGLQFFDIMKRTRKISPR 360



 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 45/133 (33%), Positives = 64/133 (48%), Gaps = 32/133 (24%)
 Frame = +1

Query: 10  SENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWN--------------------- 126
           +EN+ V  +L+DMYS+CG +  A+  F+ M+ RN VSWN                     
Sbjct: 124 TENVMVGTALVDMYSKCGQLDLAWLMFDEMHVRNSVSWNTMIDGCMRNGEVGEAIVLFDQ 183

Query: 127 ----------SIIIGFAINGYAEEALEHFYRMQVEGFEPDGVSFTGALTACSHAGFVEKG 276
                     S+I GF   G  E+ALE F  MQ+ G EPD V+    L AC++ G +  G
Sbjct: 184 MSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPDYVTIISVLAACANLGALGLG 243

Query: 277 LHLYK-TMKNAYK 312
           L + +  MK  +K
Sbjct: 244 LWINRFVMKQDFK 256


>ref|XP_006417992.1| hypothetical protein EUTSA_v10009524mg [Eutrema salsugineum]
           gi|557095763|gb|ESQ36345.1| hypothetical protein
           EUTSA_v10009524mg [Eutrema salsugineum]
          Length = 500

 Score =  152 bits (384), Expect = 5e-35
 Identities = 68/107 (63%), Positives = 86/107 (80%)
 Frame = +1

Query: 4   DFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHF 183
           DF  N+RV NSLID+Y RCGC++FA Q F+ M KR +VSWNS+I+GFA NG A+E+L +F
Sbjct: 242 DFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIVGFAANGNADESLVYF 301

Query: 184 YRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
            +MQ EGF+PD V+FTGALTACSH G VE+GL  ++TMK  Y+ISPR
Sbjct: 302 RKMQEEGFKPDAVTFTGALTACSHVGLVEEGLRYFQTMKRDYRISPR 348



 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 38/99 (38%), Positives = 60/99 (60%), Gaps = 1/99 (1%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A + F+ M  R+L+SW +++ GF   G+ EEAL  F  MQ+ G E
Sbjct: 150 NTMIDGYMRNGQVYDAVKMFDEMPDRDLISWTAMMNGFVKKGFHEEALAWFREMQISGVE 209

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK-TMKNAYKISPR 324
           PD V+   AL AC++ G +  GL +++  M + +K + R
Sbjct: 210 PDYVAIIAALAACTNLGALSFGLWVHRYVMSHDFKNNVR 248


>gb|AEP33754.1| chloroplast biogenesis 19, partial [Nasturtium officinale]
          Length = 447

 Score =  151 bits (382), Expect = 8e-35
 Identities = 68/108 (62%), Positives = 86/108 (79%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA Q F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 182 QDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIVGFAANGNAHESLFY 241

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF+PD V+FTGALTACSH G VE+GL  ++TMK  Y+ISPR
Sbjct: 242 FRKMQEEGFKPDAVTFTGALTACSHVGLVEEGLRYFQTMKRDYRISPR 289



 Score = 72.8 bits (177), Expect = 5e-11
 Identities = 34/87 (39%), Positives = 54/87 (62%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A + F+ M  R+L+SW +++ GF   G+ EEAL  F  MQ+ G +
Sbjct: 91  NTMIDGYMRSGQVNTAVKLFDEMLNRDLISWTAMVNGFVKKGFHEEALSWFREMQISGVK 150

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+   AL AC++ G +  GL +++
Sbjct: 151 PDYVAIIAALAACTNLGALSFGLWIHR 177


>gb|EXC37761.1| hypothetical protein L484_001219 [Morus notabilis]
          Length = 508

 Score =  150 bits (380), Expect = 1e-34
 Identities = 71/108 (65%), Positives = 86/108 (79%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           + F +N+++ NSLIDMYSRCGCI+FA Q F  M  R LVSWNSII+GFA+NG+AEEAL+ 
Sbjct: 246 RKFKDNVKISNSLIDMYSRCGCIEFARQVFERMPNRTLVSWNSIIVGFAVNGHAEEALKF 305

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGVSFTGALTACSHAG VE+GL L++ MK  + I  R
Sbjct: 306 FNLMQREGFKPDGVSFTGALTACSHAGLVEEGLLLFENMKRVHGIRHR 353



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 37/91 (40%), Positives = 52/91 (57%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G ++ A + F+ M +R+ VSW ++I GF      EEALE F  MQV   E
Sbjct: 155 NTMIDGYMRNGKVRDAVEVFDEMPERDAVSWTALIGGFVKRRRFEEALEWFREMQVSSVE 214

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYKTMKN 303
           PD V+    L AC+  G V  GL + + + N
Sbjct: 215 PDYVTVIAVLAACADLGTVGLGLWMNRFIMN 245


>gb|AEP33751.1| chloroplast biogenesis 19, partial [Lepidium virginicum]
          Length = 485

 Score =  149 bits (376), Expect = 4e-34
 Identities = 67/108 (62%), Positives = 85/108 (78%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA Q F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 220 QDFRNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSWNSVIVGFAANGNANESLVY 279

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF PDGV+FTGALTACSH G VE+G   ++ MK+ Y+ISPR
Sbjct: 280 FRKMQREGFTPDGVTFTGALTACSHVGLVEEGFQYFQMMKHDYRISPR 327



 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 34/87 (39%), Positives = 52/87 (59%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A   F+ M  R+L+SW ++I GF   G+ EEAL  F  MQ+ G  
Sbjct: 129 NTMIDGYMRNGQVDNAVDVFDKMPDRDLISWTAMITGFVKKGFHEEALAWFREMQISGVN 188

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+   A+ AC++ G +  GL +++
Sbjct: 189 PDYVAIISAVAACTNLGALSFGLWVHR 215


>gb|AEP33749.1| chloroplast biogenesis 19, partial [Crucihimalaya wallichii]
          Length = 491

 Score =  149 bits (376), Expect = 4e-34
 Identities = 67/108 (62%), Positives = 85/108 (78%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA + F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 226 QDFKNNVRVSNSLIDLYCRCGCVEFAREVFDKMEKRTVVSWNSVIVGFAANGNAHESLVY 285

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF+PD V+FTGALTACSH G VE+GL  ++TMK  Y ISPR
Sbjct: 286 FRKMQEEGFKPDAVTFTGALTACSHVGLVEEGLRYFQTMKRDYGISPR 333



 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 39/102 (38%), Positives = 61/102 (59%), Gaps = 2/102 (1%)
 Frame = +1

Query: 4   DFSENIR--VCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALE 177
           D+ E+I     N++ID Y R G +  A + F+ M +R+L+SW ++I GF   G+ EEAL 
Sbjct: 124 DYMEDINSVTWNTMIDGYMRSGQVDNAVKMFDKMPERDLISWTAMINGFVKKGFHEEALV 183

Query: 178 HFYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKN 303
            F  MQ+ G  PD V+   AL AC++ G +  GL +++ + N
Sbjct: 184 WFREMQISGVRPDYVAIIAALNACTNLGALSFGLWVHRYVMN 225


>gb|AEP33746.1| chloroplast biogenesis 19, partial [Barbarea verna]
          Length = 494

 Score =  149 bits (376), Expect = 4e-34
 Identities = 67/108 (62%), Positives = 86/108 (79%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  NIRV NSLID+Y RCGC++FA + F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 229 QDFKNNIRVSNSLIDLYCRCGCVEFAREVFDKMEKRTVVSWNSVIVGFAANGNAHESLVY 288

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF+PD V+FTGALTACSH G VE+GL  ++TMK  ++ISPR
Sbjct: 289 FRKMQEEGFKPDAVTFTGALTACSHVGLVEEGLRYFQTMKRDHRISPR 336



 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 41/102 (40%), Positives = 61/102 (59%), Gaps = 2/102 (1%)
 Frame = +1

Query: 4   DFSE--NIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALE 177
           DF E  N    N++ID Y R G +  A + F+ M +R+L+SW ++I GF   G+ EEAL 
Sbjct: 127 DFMEDKNSVTWNTMIDGYMRSGQVNNAVKLFDEMPERDLISWTAMINGFVKKGFHEEALA 186

Query: 178 HFYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKN 303
            F  MQ+ G +PD V+   AL AC+H G +  GL +++ + N
Sbjct: 187 WFREMQISGVKPDYVAIIAALAACTHLGALSFGLWVHRYVMN 228


>ref|XP_004251424.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Solanum lycopersicum]
          Length = 507

 Score =  149 bits (375), Expect = 5e-34
 Identities = 67/108 (62%), Positives = 87/108 (80%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           ++F +N+RV NSLIDMY RCGC++ A Q F+ M  R+LVSWNSII+G A+NG+A +AL++
Sbjct: 243 REFKDNVRVNNSLIDMYCRCGCVELACQVFHRMTGRSLVSWNSIIVGLAVNGHAIDALQY 302

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F  MQ EGF+PDGV+FTG LTACSHAG VEKGL  +K MK  ++I+PR
Sbjct: 303 FDLMQNEGFQPDGVTFTGVLTACSHAGLVEKGLKYFKAMKRVHRITPR 350



 Score = 65.5 bits (158), Expect = 8e-09
 Identities = 32/87 (36%), Positives = 52/87 (59%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N+++D Y R G  K A + F+ +  R+++SW +++ GF  NG  EE L  F  MQ+ G E
Sbjct: 152 NTMVDGYMRNGDFKNAVKVFDEIPDRDVISWTALVGGFVKNGLFEEGLVWFREMQLSGVE 211

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+    L+AC++ G +   L L++
Sbjct: 212 PDYVTMISVLSACANLGTLGISLWLHR 238


>gb|AEP33747.1| chloroplast biogenesis 19, partial [Brassica oleracea]
          Length = 485

 Score =  148 bits (374), Expect = 7e-34
 Identities = 66/108 (61%), Positives = 85/108 (78%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA Q F+ M KR +VSWNS+I+GFA NG+A E+L +
Sbjct: 224 QDFKNNVRVSNSLIDLYCRCGCVEFARQVFDEMEKRTVVSWNSVIVGFAANGHAHESLVY 283

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F RMQ E F+PD V+FTGALTACSH G VE+G+  ++ MK  Y+ISPR
Sbjct: 284 FRRMQEERFKPDAVTFTGALTACSHVGLVEEGVRYFEAMKRDYRISPR 331



 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 36/87 (41%), Positives = 54/87 (62%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A + F+ M +R+L+SW ++I GF   G  EEAL  F  MQV G +
Sbjct: 133 NTMIDGYMRSGRVDDAAKVFDEMPERDLISWTAMINGFVKKGLHEEALAWFREMQVSGVK 192

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+   AL AC++ G +  GL +++
Sbjct: 193 PDYVAVIAALAACANLGALSFGLWVHR 219


>ref|NP_172066.3| pentatricopeptide repeat protein PDE247 [Arabidopsis thaliana]
           gi|75191933|sp|Q9MA50.1|PPR13_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g05750, chloroplastic; AltName: Full=Protein PIGMENT
           DEFECTIVE 247; Flags: Precursor
           gi|6850304|gb|AAF29381.1|AC009999_1 Contains similarity
           to a hypothetical protein from Arabidopsis thaliana
           gb|AC007109.6, and contains two DUF17 PF|01535 domains
           [Arabidopsis thaliana] gi|62320576|dbj|BAD95203.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332189766|gb|AEE27887.1| pentatricopeptide repeat
           protein PDE247 [Arabidopsis thaliana]
          Length = 500

 Score =  148 bits (373), Expect = 9e-34
 Identities = 67/108 (62%), Positives = 85/108 (78%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA Q F NM KR +VSWNS+I+GFA NG A E+L +
Sbjct: 235 QDFKNNVRVSNSLIDLYCRCGCVEFARQVFYNMEKRTVVSWNSVIVGFAANGNAHESLVY 294

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ +GF+PD V+FTGALTACSH G VE+GL  ++ MK  Y+ISPR
Sbjct: 295 FRKMQEKGFKPDAVTFTGALTACSHVGLVEEGLRYFQIMKCDYRISPR 342



 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 36/87 (41%), Positives = 55/87 (63%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A + F+ M +R+L+SW ++I GF   GY EEAL  F  MQ+ G +
Sbjct: 144 NTMIDGYMRSGQVDNAAKMFDKMPERDLISWTAMINGFVKKGYQEEALLWFREMQISGVK 203

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+   AL AC++ G +  GL +++
Sbjct: 204 PDYVAIIAALNACTNLGALSFGLWVHR 230


>ref|XP_002523876.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223536964|gb|EEF38602.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 384

 Score =  148 bits (373), Expect = 9e-34
 Identities = 70/103 (67%), Positives = 83/103 (80%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           K+F  N+R+ NSLIDMYSRCGCI+ A Q F+ M KR LVSWNSII+GFA NG+AEEALE+
Sbjct: 258 KEFRNNVRIGNSLIDMYSRCGCIELARQVFHKMLKRTLVSWNSIIVGFAANGFAEEALEY 317

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAY 309
           F  MQ EGF+PDGVSFTGALTACSHAG V++GL  +  MK  +
Sbjct: 318 FGLMQKEGFKPDGVSFTGALTACSHAGMVDEGLKCFDIMKRHF 360



 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 37/123 (30%), Positives = 61/123 (49%), Gaps = 31/123 (25%)
 Frame = +1

Query: 16  NIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAING------------- 156
           N+ V  +L+DMY++CG ++ A   F+++  +N VSWN++I G+  NG             
Sbjct: 131 NVMVGTALVDMYAKCGKVQLARLIFDDLKVKNSVSWNTMIDGYMRNGETGSAMELFDEMP 190

Query: 157 ------------------YAEEALEHFYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLH 282
                             + E+ALE F  MQV   EPD V+    L+AC++ G +  GL 
Sbjct: 191 EKDAISWTVFIDGFIKKGHFEQALEWFREMQVSKVEPDYVTIIAVLSACANLGALGLGLW 250

Query: 283 LYK 291
           +++
Sbjct: 251 IHR 253


>ref|XP_006303598.1| hypothetical protein CARUB_v10011161mg [Capsella rubella]
           gi|482572309|gb|EOA36496.1| hypothetical protein
           CARUB_v10011161mg [Capsella rubella]
          Length = 506

 Score =  147 bits (372), Expect = 1e-33
 Identities = 65/108 (60%), Positives = 86/108 (79%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N++V NSLID+Y RCGC++FA + F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 241 QDFKNNVKVSNSLIDLYCRCGCVEFAREVFDKMEKRTVVSWNSVIVGFAANGNAHESLVY 300

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF+PD V+FTGALTACSH G VE+GL  ++TMK  ++ISPR
Sbjct: 301 FRKMQEEGFKPDAVTFTGALTACSHVGLVEEGLRYFQTMKRNHRISPR 348



 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++I+ Y R G +  A + F+ M +R+ +SW ++I GF   G+ EEAL  F  MQ+ G +
Sbjct: 150 NTMINGYMRNGQVDNAVKMFDKMPERDFISWTAMINGFVKKGFHEEALAWFREMQISGVK 209

Query: 211 PDGVSFTGALTACSHAGFVEKGL--HLY---KTMKNAYKIS 318
           PD V+   AL AC++ G +  GL  H Y   +  KN  K+S
Sbjct: 210 PDYVAIIAALNACTNLGALSFGLWVHRYVMSQDFKNNVKVS 250


>gb|AEP33750.1| chloroplast biogenesis 19, partial [Lepidium sativum]
          Length = 494

 Score =  147 bits (372), Expect = 1e-33
 Identities = 66/108 (61%), Positives = 84/108 (77%)
 Frame = +1

Query: 1   KDFSENIRVCNSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEH 180
           +DF  N+RV NSLID+Y RCGC++FA Q F+ M KR +VSWNS+I+GFA NG A E+L +
Sbjct: 232 QDFRNNVRVSNSLIDLYCRCGCVEFARQVFDTMEKRTVVSWNSVIVGFAANGNANESLVY 291

Query: 181 FYRMQVEGFEPDGVSFTGALTACSHAGFVEKGLHLYKTMKNAYKISPR 324
           F +MQ EGF+PD V+FTGALTACSH G VE+G   ++ MK  Y+ISPR
Sbjct: 292 FRKMQEEGFKPDAVTFTGALTACSHVGLVEEGFQYFQMMKTDYRISPR 339



 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 35/87 (40%), Positives = 53/87 (60%)
 Frame = +1

Query: 31  NSLIDMYSRCGCIKFAYQEFNNMNKRNLVSWNSIIIGFAINGYAEEALEHFYRMQVEGFE 210
           N++ID Y R G +  A + F+ M +R+L+SW ++I GF   G+ EEAL  F  MQ+ G  
Sbjct: 141 NTMIDGYMRNGQVDNAVKVFDEMPERDLISWTAMITGFVKKGFHEEALAWFREMQISGVN 200

Query: 211 PDGVSFTGALTACSHAGFVEKGLHLYK 291
           PD V+   AL AC++ G +  GL  ++
Sbjct: 201 PDYVAIIAALAACTNLGALSFGLWAHR 227


Top