BLASTX nr result

ID: Akebia24_contig00020475 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00020475
         (925 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274209.2| PREDICTED: pentatricopeptide repeat-containi...   365   2e-98
emb|CBI24422.3| unnamed protein product [Vitis vinifera]              365   2e-98
ref|XP_006481538.1| PREDICTED: pentatricopeptide repeat-containi...   355   2e-95
ref|XP_002523876.1| pentatricopeptide repeat-containing protein,...   353   4e-95
ref|XP_007015694.1| Tetratricopeptide repeat (TPR)-like superfam...   352   1e-94
gb|EXC37761.1| hypothetical protein L484_001219 [Morus notabilis]     346   7e-93
ref|XP_004293118.1| PREDICTED: pentatricopeptide repeat-containi...   345   2e-92
ref|XP_004251424.1| PREDICTED: pentatricopeptide repeat-containi...   340   6e-91
ref|XP_006340426.1| PREDICTED: pentatricopeptide repeat-containi...   338   1e-90
ref|XP_004159118.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   335   2e-89
ref|XP_004139593.1| PREDICTED: pentatricopeptide repeat-containi...   335   2e-89
ref|XP_006417992.1| hypothetical protein EUTSA_v10009524mg [Eutr...   330   4e-88
ref|XP_002889563.1| PDE247 [Arabidopsis lyrata subsp. lyrata] gi...   327   5e-87
ref|XP_003541961.1| PREDICTED: pentatricopeptide repeat-containi...   326   7e-87
ref|NP_172066.3| pentatricopeptide repeat protein PDE247 [Arabid...   324   3e-86
gb|AEP33749.1| chloroplast biogenesis 19, partial [Crucihimalaya...   322   1e-85
gb|AEP33748.1| chloroplast biogenesis 19, partial [Capsella burs...   322   1e-85
ref|XP_006303598.1| hypothetical protein CARUB_v10011161mg [Caps...   321   3e-85
gb|AEP33755.1| chloroplast biogenesis 19, partial [Olimarabidops...   319   9e-85
gb|AEP33750.1| chloroplast biogenesis 19, partial [Lepidium sati...   319   9e-85

>ref|XP_002274209.2| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Vitis vinifera]
          Length = 518

 Score =  365 bits (936), Expect = 2e-98
 Identities = 173/248 (69%), Positives = 208/248 (83%)
 Frame = -3

Query: 746 RSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFP 567
           RS+    ID  VSWTSSIA  CRNG L +AAAEF RM+++GV PNH+TF+TLLSAC DFP
Sbjct: 44  RSHTHSPIDPIVSWTSSIALHCRNGQLPEAAAEFSRMQIAGVRPNHITFLTLLSACTDFP 103

Query: 566 SKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNT 387
            + LRFG SIH Y+RKLG D  NV +GT+LV MYSKC  +DLA  +FDEM V+NS+SWNT
Sbjct: 104 LEGLRFGGSIHAYVRKLGLDTENVMVGTALVDMYSKCGQLDLAWLMFDEMHVRNSVSWNT 163

Query: 386 MIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPD 207
           MIDG MRNG+V +AI LFDQM +RD +SWT++IGGFVKKG FE+ALEWFREMQL+GVEPD
Sbjct: 164 MIDGCMRNGEVGEAIVLFDQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPD 223

Query: 206 YVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFN 27
           YVTII+VL+A ANLGA+GLG+W++R+V++QDFK+NI++SNSLIDMYSRCGCI  ARQ F 
Sbjct: 224 YVTIISVLAACANLGALGLGLWINRFVMKQDFKDNIKISNSLIDMYSRCGCIRLARQVFE 283

Query: 26  NMQKRSLV 3
            M KRSLV
Sbjct: 284 QMPKRSLV 291



 Score =  114 bits (286), Expect = 4e-23
 Identities = 78/241 (32%), Positives = 121/241 (50%), Gaps = 2/241 (0%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D  S    +SWTS I    + G    A   F  M+L+GVEP++VT +++L+ACA+  +  
Sbjct: 182 DQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPDYVTIISVLAACANLGALG 241

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  I+ ++ K  F K+N+ +  SL+ MYS                            
Sbjct: 242 L--GLWINRFVMKQDF-KDNIKISNSLIDMYS---------------------------- 270

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
              R G +  A ++F+QMPKR  VSW ++I GF   GH EEALE+F  M+  G  PD V+
Sbjct: 271 ---RCGCIRLARQVFEQMPKRSLVSWNSMIVGFALNGHAEEALEFFNLMRKEGFRPDGVS 327

Query: 197 IIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNN 24
               L+A ++ G +  G+     ++++  K + R+ +   L+D+YSR G +E A     N
Sbjct: 328 FTGALTACSHSGLVDEGLQFFD-IMKRTRKISPRIEHYGCLVDLYSRAGRLEDALNVIAN 386

Query: 23  M 21
           M
Sbjct: 387 M 387


>emb|CBI24422.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  365 bits (936), Expect = 2e-98
 Identities = 173/248 (69%), Positives = 208/248 (83%)
 Frame = -3

Query: 746 RSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFP 567
           RS+    ID  VSWTSSIA  CRNG L +AAAEF RM+++GV PNH+TF+TLLSAC DFP
Sbjct: 44  RSHTHSPIDPIVSWTSSIALHCRNGQLPEAAAEFSRMQIAGVRPNHITFLTLLSACTDFP 103

Query: 566 SKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNT 387
            + LRFG SIH Y+RKLG D  NV +GT+LV MYSKC  +DLA  +FDEM V+NS+SWNT
Sbjct: 104 LEGLRFGGSIHAYVRKLGLDTENVMVGTALVDMYSKCGQLDLAWLMFDEMHVRNSVSWNT 163

Query: 386 MIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPD 207
           MIDG MRNG+V +AI LFDQM +RD +SWT++IGGFVKKG FE+ALEWFREMQL+GVEPD
Sbjct: 164 MIDGCMRNGEVGEAIVLFDQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPD 223

Query: 206 YVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFN 27
           YVTII+VL+A ANLGA+GLG+W++R+V++QDFK+NI++SNSLIDMYSRCGCI  ARQ F 
Sbjct: 224 YVTIISVLAACANLGALGLGLWINRFVMKQDFKDNIKISNSLIDMYSRCGCIRLARQVFE 283

Query: 26  NMQKRSLV 3
            M KRSLV
Sbjct: 284 QMPKRSLV 291



 Score =  110 bits (274), Expect = 1e-21
 Identities = 71/232 (30%), Positives = 121/232 (52%), Gaps = 24/232 (10%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D  S    +SWTS I    + G    A   F  M+L+GVEP++VT +++L+ACA+  +  
Sbjct: 182 DQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPDYVTIISVLAACANLGALG 241

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  I+ ++ K  F K+N+ +  SL+ MYS+C  + LAR VF++M  ++ +SWN+MI 
Sbjct: 242 L--GLWINRFVMKQDF-KDNIKISNSLIDMYSRCGCIRLARQVFEQMPKRSLVSWNSMIV 298

Query: 377 GYMRNGDVEDA-------------------IKLFDQMPKRDKVS-----WTALIGGFVKK 270
           G+  NG  E+A                   ++ FD M +  K+S     +  L+  + + 
Sbjct: 299 GFALNGHAEEALEFFNLMRKEGHSGLVDEGLQFFDIMKRTRKISPRIEHYGCLVDLYSRA 358

Query: 269 GHFEEALEWFREMQLSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQD 114
           G  E+AL     M +   +P+ V + ++L+A    G +GL   + +Y+ + D
Sbjct: 359 GRLEDALNVIANMPM---KPNEVVLGSLLAACRTHGDVGLAERLMKYLCEVD 407


>ref|XP_006481538.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Citrus sinensis]
          Length = 509

 Score =  355 bits (910), Expect = 2e-95
 Identities = 162/261 (62%), Positives = 213/261 (81%), Gaps = 1/261 (0%)
 Frame = -3

Query: 782 NRNLVNKTQLSVRSNDDQS-IDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHV 606
           N+NL    Q+S+++N+ +S ++ TV WTSSI+R CR+G +++AA EF RM L G  PNH+
Sbjct: 22  NQNLTTTPQISIQTNNSKSTVNPTVQWTSSISRHCRSGRIAEAALEFTRMTLHGTNPNHI 81

Query: 605 TFVTLLSACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVF 426
           TF+TLLS CADFPS+ L  G+ IHG + KLG D+NNV +GT+L+ MY+K   +DLA  VF
Sbjct: 82  TFITLLSGCADFPSQCLFLGAMIHGLVCKLGLDRNNVMVGTALLDMYAKFGRMDLATVVF 141

Query: 425 DEMCVKNSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALE 246
           D M VK+S +WN MIDGYMR GD+E A+++FD+MP RD +SWTAL+ GFVK+G+FEEALE
Sbjct: 142 DAMRVKSSFTWNAMIDGYMRRGDIESAVRMFDEMPVRDAISWTALLNGFVKRGYFEEALE 201

Query: 245 WFREMQLSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYS 66
            FREMQ+SGVEPDYVTII+VL+A AN+G +G+G+W+HRYVL+QDFK+N++V N+LID+YS
Sbjct: 202 CFREMQISGVEPDYVTIISVLNACANVGTLGIGLWIHRYVLKQDFKDNVKVCNTLIDLYS 261

Query: 65  RCGCIEFARQEFNNMQKRSLV 3
           RCGCIEFARQ F  M KR+LV
Sbjct: 262 RCGCIEFARQVFQRMHKRTLV 282



 Score =  112 bits (280), Expect = 2e-22
 Identities = 74/248 (29%), Positives = 128/248 (51%), Gaps = 4/248 (1%)
 Frame = -3

Query: 752 SVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACAD 573
           +VR  D+  +   +SWT+ +    + G   +A   F  M++SGVEP++VT +++L+ACA+
Sbjct: 168 AVRMFDEMPVRDAISWTALLNGFVKRGYFEEALECFREMQISGVEPDYVTIISVLNACAN 227

Query: 572 FPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSW 393
             +  L  G  IH Y+ K  F K+NV +  +L+ +YS+C  ++ AR VF  M  +  +SW
Sbjct: 228 VGT--LGIGLWIHRYVLKQDF-KDNVKVCNTLIDLYSRCGCIEFARQVFQRMHKRTLVSW 284

Query: 392 NTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVE 213
           N++I G+  NG V +A+                               E+F  MQ  G +
Sbjct: 285 NSIIVGFAVNGFVGEAL-------------------------------EYFNSMQKEGFK 313

Query: 212 PDYVTIIAVLSAIANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEF 45
           PD V+    L+A ++ G I  G+   RY  ++++ ++ + R+ +   ++D+YSR G +E 
Sbjct: 314 PDGVSFTGALTACSHAGLIEDGL---RYFDIMKKIYRVSPRIEHYGCIVDLYSRAGRLED 370

Query: 44  ARQEFNNM 21
           A     NM
Sbjct: 371 ALNVVENM 378


>ref|XP_002523876.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223536964|gb|EEF38602.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 384

 Score =  353 bits (907), Expect = 4e-95
 Identities = 162/258 (62%), Positives = 209/258 (81%)
 Frame = -3

Query: 776 NLVNKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFV 597
           +L+   + +++   ++SID T++WTSSI+R C NG L +AA+ F +MRL+ VEPNH+TF 
Sbjct: 39  HLIQHPRTNLKHQCNRSIDLTIAWTSSISRHCCNGQLPEAASLFTQMRLAAVEPNHITFA 98

Query: 596 TLLSACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEM 417
           TL+S CADFP +    G SIH Y+RKLG D  NV +GT+LV MY+KC  V LAR +FD++
Sbjct: 99  TLISFCADFPFQGKSIGPSIHAYVRKLGLDTCNVMVGTALVDMYAKCGKVQLARLIFDDL 158

Query: 416 CVKNSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFR 237
            VKNS+SWNTMIDGYMRNG+   A++LFD+MP++D +SWT  I GF+KKGHFE+ALEWFR
Sbjct: 159 KVKNSVSWNTMIDGYMRNGETGSAMELFDEMPEKDAISWTVFIDGFIKKGHFEQALEWFR 218

Query: 236 EMQLSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCG 57
           EMQ+S VEPDYVTIIAVLSA ANLGA+GLG+W+HRYVL+++F+ N+R+ NSLIDMYSRCG
Sbjct: 219 EMQVSKVEPDYVTIIAVLSACANLGALGLGLWIHRYVLEKEFRNNVRIGNSLIDMYSRCG 278

Query: 56  CIEFARQEFNNMQKRSLV 3
           CIE ARQ F+ M KR+LV
Sbjct: 279 CIELARQVFHKMLKRTLV 296



 Score = 94.7 bits (234), Expect = 4e-17
 Identities = 65/198 (32%), Positives = 96/198 (48%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D+      +SWT  I    + G    A   F  M++S VEP++VT + +LSACA+  +  
Sbjct: 187 DEMPEKDAISWTVFIDGFIKKGHFEQALEWFREMQVSKVEPDYVTIIAVLSACANLGALG 246

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  IH Y+ +  F +NNV +G                               N++ID
Sbjct: 247 L--GLWIHRYVLEKEF-RNNVRIG-------------------------------NSLID 272

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
            Y R G +E A ++F +M KR  VSW ++I GF   G  EEALE+F  MQ  G +PD V+
Sbjct: 273 MYSRCGCIELARQVFHKMLKRTLVSWNSIIVGFAANGFAEEALEYFGLMQKEGFKPDGVS 332

Query: 197 IIAVLSAIANLGAIGLGI 144
               L+A ++ G +  G+
Sbjct: 333 FTGALTACSHAGMVDEGL 350


>ref|XP_007015694.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
           cacao] gi|508786057|gb|EOY33313.1| Tetratricopeptide
           repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 509

 Score =  352 bits (903), Expect = 1e-94
 Identities = 162/241 (67%), Positives = 202/241 (83%)
 Frame = -3

Query: 725 IDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFG 546
           +DH VSWTSSI+R CR G +S+AA+EF RMRLS VEPNH+TFVTLLS CADFP KS   G
Sbjct: 42  LDHIVSWTSSISRHCRAGQISEAASEFTRMRLSEVEPNHITFVTLLSGCADFPLKSGVLG 101

Query: 545 SSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMR 366
             IHGY+ KLG DK NV +GT+LV MY+KC HV +A+ VFD M VKN +SWNTM+DGYMR
Sbjct: 102 VLIHGYVCKLGLDKENVMVGTALVEMYAKCGHVKVAKLVFDVMRVKNLVSWNTMVDGYMR 161

Query: 365 NGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAV 186
           NG+ E A+++FD+MP+RD +SWTALI GF ++G  EEAL+WFREM + GV+PDYV IIAV
Sbjct: 162 NGEYEKAVEIFDEMPQRDVISWTALINGFARRGFHEEALDWFREMMIFGVKPDYVVIIAV 221

Query: 185 LSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRSL 6
           L+A ANLGA+G+G+W+HR+VL+Q F++N+RV+NSLIDMYSRCGCIE AR+ F+ MQKR+L
Sbjct: 222 LTACANLGALGVGLWIHRFVLKQSFRDNVRVNNSLIDMYSRCGCIELAREVFDKMQKRTL 281

Query: 5   V 3
           V
Sbjct: 282 V 282



 Score =  104 bits (260), Expect = 4e-20
 Identities = 74/243 (30%), Positives = 121/243 (49%), Gaps = 4/243 (1%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D+      +SWT+ I    R G   +A   F  M + GV+P++V  + +L+ACA+  +  
Sbjct: 173 DEMPQRDVISWTALINGFARRGFHEEALDWFREMMIFGVKPDYVVIIAVLTACANLGA-- 230

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  IH ++ K  F ++NV +  SL+ MYS                            
Sbjct: 231 LGVGLWIHRFVLKQSF-RDNVRVNNSLIDMYS---------------------------- 261

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
              R G +E A ++FD+M KR  VSW ++I GF   G  EEAL++F  MQ  G +PD V+
Sbjct: 262 ---RCGCIELAREVFDKMQKRTLVSWNSIIVGFAVNGFAEEALKYFDSMQKEGFKPDGVS 318

Query: 197 IIAVLSAIANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEF 30
               L+A ++ G +  G+   RY  ++++ ++ + R+ +   ++D+YSR G +E A    
Sbjct: 319 FTGALTACSHAGLVDEGL---RYFGIMKRVYRISPRIEHFGCIVDLYSRAGKLEEALDVI 375

Query: 29  NNM 21
            NM
Sbjct: 376 ENM 378


>gb|EXC37761.1| hypothetical protein L484_001219 [Morus notabilis]
          Length = 508

 Score =  346 bits (888), Expect = 7e-93
 Identities = 165/241 (68%), Positives = 202/241 (83%)
 Frame = -3

Query: 725 IDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFG 546
           I+  V WTSSIAR C+NG  S+AAAEF RMRLSGVEPNHVTFVTLLS CAD    ++ FG
Sbjct: 47  IEPVVKWTSSIARHCKNGRFSEAAAEFSRMRLSGVEPNHVTFVTLLSGCAD---SNISFG 103

Query: 545 SSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMR 366
           +SIHGY RKL FD +NV +GT+LV MY+K   VD+AR VFD++  KNS+SWNTMIDGYMR
Sbjct: 104 ASIHGYARKLCFDTSNVMVGTALVAMYAKRGLVDVARLVFDDIKEKNSVSWNTMIDGYMR 163

Query: 365 NGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAV 186
           NG V DA+++FD+MP+RD VSWTALIGGFVK+  FEEALEWFREMQ+S VEPDYVT+IAV
Sbjct: 164 NGKVRDAVEVFDEMPERDAVSWTALIGGFVKRRRFEEALEWFREMQVSSVEPDYVTVIAV 223

Query: 185 LSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRSL 6
           L+A A+LG +GLG+W++R+++ + FK+N+++SNSLIDMYSRCGCIEFARQ F  M  R+L
Sbjct: 224 LAACADLGTVGLGLWMNRFIMNRKFKDNVKISNSLIDMYSRCGCIEFARQVFERMPNRTL 283

Query: 5   V 3
           V
Sbjct: 284 V 284



 Score =  104 bits (259), Expect = 6e-20
 Identities = 73/242 (30%), Positives = 117/242 (48%), Gaps = 1/242 (0%)
 Frame = -3

Query: 764 KTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLS 585
           K + +V   D+      VSWT+ I    +     +A   F  M++S VEP++VT + +L+
Sbjct: 166 KVRDAVEVFDEMPERDAVSWTALIGGFVKRRRFEEALEWFREMQVSSVEPDYVTVIAVLA 225

Query: 584 ACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKN 405
           ACAD  +  L  G  ++ +I    F K+NV +  SL+ MYS                   
Sbjct: 226 ACADLGTVGL--GLWMNRFIMNRKF-KDNVKISNSLIDMYS------------------- 263

Query: 404 SMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQL 225
                       R G +E A ++F++MP R  VSW ++I GF   GH EEAL++F  MQ 
Sbjct: 264 ------------RCGCIEFARQVFERMPNRTLVSWNSIIVGFAVNGHAEEALKFFNLMQR 311

Query: 224 SGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQ-QDFKENIRVSNSLIDMYSRCGCIE 48
            G +PD V+    L+A ++ G +  G+ +   + +    +  I     ++D+YSR G +E
Sbjct: 312 EGFKPDGVSFTGALTACSHAGLVEEGLLLFENMKRVHGIRHRIEHYGCIVDLYSRAGRLE 371

Query: 47  FA 42
            A
Sbjct: 372 DA 373



 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 38/108 (35%), Positives = 61/108 (56%), Gaps = 1/108 (0%)
 Frame = -3

Query: 323 PKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAIANLGAIGLGI 144
           P    V WT+ I    K G F EA   F  M+LSGVEP++VT + +LS  A+   I  G 
Sbjct: 46  PIEPVVKWTSSIARHCKNGRFSEAAAEFSRMRLSGVEPNHVTFVTLLSGCAD-SNISFGA 104

Query: 143 WVHRYVLQQDF-KENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRSLV 3
            +H Y  +  F   N+ V  +L+ MY++ G ++ AR  F+++++++ V
Sbjct: 105 SIHGYARKLCFDTSNVMVGTALVAMYAKRGLVDVARLVFDDIKEKNSV 152


>ref|XP_004293118.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 504

 Score =  345 bits (884), Expect = 2e-92
 Identities = 162/255 (63%), Positives = 207/255 (81%)
 Frame = -3

Query: 767 NKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLL 588
           NK  + ++S  +Q ID TV WTSSI++RCRNG L+ A ++F++MR + VEPNH+TFVTLL
Sbjct: 29  NKHSVLLKSRKEQ-IDQTVLWTSSISQRCRNGQLAQAVSQFIQMRRARVEPNHITFVTLL 87

Query: 587 SACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVK 408
           S CA FP+K+  FG S+H Y+ KLG D+ NV +GT+L+ MY+K   V+ AR  F  M VK
Sbjct: 88  SGCAHFPAKAAFFGPSLHAYVCKLGLDRTNVIVGTALIDMYAKSGRVEFARLAFGGMEVK 147

Query: 407 NSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQ 228
           NSMSWNT+IDGYM+ G+V DA+++FD+MPKRD VSWT LIGGFVKK  +E+ALEWFREMQ
Sbjct: 148 NSMSWNTLIDGYMKMGNVRDAVEVFDEMPKRDAVSWTTLIGGFVKKRRYEDALEWFREMQ 207

Query: 227 LSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIE 48
           +SGVEPDYVTIIAV++A A+LG +GLG+WV+R+V +Q F+ NIR+SNSLIDMYSRCGCI+
Sbjct: 208 VSGVEPDYVTIIAVIAACADLGTLGLGLWVNRFVTKQHFRHNIRISNSLIDMYSRCGCID 267

Query: 47  FARQEFNNMQKRSLV 3
           FARQ F NM  R+LV
Sbjct: 268 FARQVFGNMPNRTLV 282



 Score =  107 bits (268), Expect = 5e-21
 Identities = 72/240 (30%), Positives = 114/240 (47%), Gaps = 1/240 (0%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D+      VSWT+ I    +     DA   F  M++SGVEP++VT + +++ACAD  +  
Sbjct: 173 DEMPKRDAVSWTTLIGGFVKKRRYEDALEWFREMQVSGVEPDYVTIIAVIAACADLGTLG 232

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  ++ ++ K  F ++N+ +  SL+ MYS                            
Sbjct: 233 L--GLWVNRFVTKQHF-RHNIRISNSLIDMYS---------------------------- 261

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
              R G ++ A ++F  MP R  VSW ++I GF   GH EEALE+F +MQ  G +PD V+
Sbjct: 262 ---RCGCIDFARQVFGNMPNRTLVSWNSMIVGFAVNGHAEEALEFFHQMQKEGFKPDGVS 318

Query: 197 IIAVLSAIANLGAIGLGI-WVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNM 21
               L+A ++ G +  G+ +  +          I     ++D+YSR G +E A     NM
Sbjct: 319 FTGALTACSHAGLVDEGLHFFDKMKRIHKITPRIEHYGCIVDLYSRAGRLEDALSIIENM 378


>ref|XP_004251424.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Solanum lycopersicum]
          Length = 507

 Score =  340 bits (871), Expect = 6e-91
 Identities = 159/248 (64%), Positives = 198/248 (79%)
 Frame = -3

Query: 746 RSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFP 567
           RSN+D     T SWTS IAR C+NG L +A AEF RMR SGVEPNH+TFVTLLS CA FP
Sbjct: 38  RSNNDS----TASWTSLIARHCKNGRLIEAVAEFTRMRNSGVEPNHITFVTLLSCCAHFP 93

Query: 566 SKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNT 387
            ++L FGS++HGY RKLG D  NV +GT+++ MYSK   V LAR  FD M  KN ++WNT
Sbjct: 94  DQALSFGSALHGYARKLGLDTQNVKVGTAVIDMYSKFGLVGLARLSFDHMGAKNKVTWNT 153

Query: 386 MIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPD 207
           M+DGYMRNGD ++A+K+FD++P RD +SWTAL+GGFVK G FEE L WFREMQLSGVEPD
Sbjct: 154 MVDGYMRNGDFKNAVKVFDEIPDRDVISWTALVGGFVKNGLFEEGLVWFREMQLSGVEPD 213

Query: 206 YVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFN 27
           YVT+I+VLSA ANLG +G+ +W+HR++L+++FK+N+RV+NSLIDMY RCGC+E A Q F+
Sbjct: 214 YVTMISVLSACANLGTLGISLWLHRFILRREFKDNVRVNNSLIDMYCRCGCVELACQVFH 273

Query: 26  NMQKRSLV 3
            M  RSLV
Sbjct: 274 RMTGRSLV 281



 Score =  103 bits (258), Expect = 7e-20
 Identities = 68/225 (30%), Positives = 111/225 (49%), Gaps = 1/225 (0%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ +    +NG+  +    F  M+LSGVEP++VT +++LSACA+  +  +     +H
Sbjct: 180 ISWTALVGGFVKNGLFEEGLVWFREMQLSGVEPDYVTMISVLSACANLGTLGISLW--LH 237

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            +I +  F K+NV +  SL+ MY +C  V+LA  VF  M  ++ +SWN++I G   NG  
Sbjct: 238 RFILRREF-KDNVRVNNSLIDMYCRCGCVELACQVFHRMTGRSLVSWNSIIVGLAVNGHA 296

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
            DA++ FD                                MQ  G +PD VT   VL+A 
Sbjct: 297 IDALQYFDL-------------------------------MQNEGFQPDGVTFTGVLTAC 325

Query: 173 ANLGAIGLGIWVHRYVLQ-QDFKENIRVSNSLIDMYSRCGCIEFA 42
           ++ G +  G+   + + +       I     ++D+YSR G +E A
Sbjct: 326 SHAGLVEKGLKYFKAMKRVHRITPRIEHYGCIVDLYSRAGRLEDA 370


>ref|XP_006340426.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Solanum tuberosum]
          Length = 509

 Score =  338 bits (868), Expect = 1e-90
 Identities = 159/260 (61%), Positives = 205/260 (78%)
 Frame = -3

Query: 782 NRNLVNKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVT 603
           N+N  + +  + RSN+D     T SWTS IAR C+NG L +A +EF RMR SGVEPNH+T
Sbjct: 28  NKNSASASAATYRSNNDS----TASWTSLIARHCKNGRLIEAVSEFTRMRNSGVEPNHIT 83

Query: 602 FVTLLSACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFD 423
           FVTLLS CA FP+++L  GS++HGY RKLG D  NV +GT+++ MYSK   V LAR  FD
Sbjct: 84  FVTLLSGCAHFPAQALSLGSALHGYARKLGLDTQNVKVGTAVIDMYSKFGLVGLARLSFD 143

Query: 422 EMCVKNSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEW 243
            M VKN ++WNTM+DGYMRNGD ++A+K+FD++ +RD +SWTAL+GGFVK G FEE L W
Sbjct: 144 HMGVKNKVTWNTMVDGYMRNGDFKNAVKVFDEITERDVISWTALVGGFVKNGLFEEGLVW 203

Query: 242 FREMQLSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSR 63
           FREMQLS VEPDYVT+I+VLSA ANLG +G+ +W+HR++L+++FK+N+RV+NSLIDMY R
Sbjct: 204 FREMQLSEVEPDYVTMISVLSACANLGTLGISLWLHRFILRREFKDNVRVNNSLIDMYCR 263

Query: 62  CGCIEFARQEFNNMQKRSLV 3
           CGCIE A Q F+ M +RSLV
Sbjct: 264 CGCIELACQVFDRMTERSLV 283



 Score =  104 bits (259), Expect = 6e-20
 Identities = 71/245 (28%), Positives = 118/245 (48%), Gaps = 1/245 (0%)
 Frame = -3

Query: 752 SVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACAD 573
           +V+  D+ +    +SWT+ +    +NG+  +    F  M+LS VEP++VT +++LSACA+
Sbjct: 169 AVKVFDEITERDVISWTALVGGFVKNGLFEEGLVWFREMQLSEVEPDYVTMISVLSACAN 228

Query: 572 FPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSW 393
             +  +     +H +I +  F K+NV +  SL+ MY +C  ++LA  VFD M  ++ +SW
Sbjct: 229 LGTLGISLW--LHRFILRREF-KDNVRVNNSLIDMYCRCGCIELACQVFDRMTERSLVSW 285

Query: 392 NTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVE 213
           N++I G   NG   DA                               L++F  MQ  G  
Sbjct: 286 NSIIVGLAVNGHAVDA-------------------------------LQYFELMQNEGFL 314

Query: 212 PDYVTIIAVLSAIANLGAIGLGIWVHRYVLQ-QDFKENIRVSNSLIDMYSRCGCIEFARQ 36
           PD VT   VL+A ++ G +  G+   + + +       I     ++D+YSR G +E A  
Sbjct: 315 PDAVTFTGVLTACSHAGLVEKGLKYFKSMKRVHRITPRIEHYGCIVDLYSRAGRLEDALG 374

Query: 35  EFNNM 21
              NM
Sbjct: 375 IIKNM 379


>ref|XP_004159118.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At1g05750, chloroplastic-like [Cucumis sativus]
          Length = 525

 Score =  335 bits (859), Expect = 2e-89
 Identities = 160/242 (66%), Positives = 199/242 (82%)
 Frame = -3

Query: 728 SIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRF 549
           S+D  V WTSS+AR CRNG LS+AAAEF RMRL+GVEPNH+TF+TLLSACADFPS+S  F
Sbjct: 53  SVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFF 112

Query: 548 GSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYM 369
            SS+HGY  K G D  +V +GT+L+ MYSKCA +  AR VF  + VKNS+SWNTM++G+M
Sbjct: 113 ASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFM 172

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           RNG++E AI+LFD+MP RD +SWTALI G +K G+ E+ALE F +MQ SGV  DYV+IIA
Sbjct: 173 RNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIA 232

Query: 188 VLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRS 9
           VL+A A+LGA+ LG+WVHR+V+ Q+FK+NI++SNSLIDMYSRCGCIEFARQ F  M KR+
Sbjct: 233 VLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRT 292

Query: 8   LV 3
           LV
Sbjct: 293 LV 294



 Score = 96.3 bits (238), Expect = 2e-17
 Identities = 55/181 (30%), Positives = 99/181 (54%), Gaps = 4/181 (2%)
 Frame = -3

Query: 758 QLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSAC 579
           +L+++  D+      +SWT+ I    ++G    A   F +M+ SGV  ++V+ + +L+AC
Sbjct: 178 ELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAAC 237

Query: 578 ADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSM 399
           AD  + +L  G  +H ++    F K+N+ +  SL+ MYS+C  ++ AR VF +M  +  +
Sbjct: 238 ADLGALTL--GLWVHRFVMPQEF-KDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLV 294

Query: 398 SWNTMIDGYMRNGDVEDAIKLFDQMPKR----DKVSWTALIGGFVKKGHFEEALEWFREM 231
           SWN++I G+  NG  +++++ F  M K     D VS+T  +      G   + LE F  M
Sbjct: 295 SWNSIIVGFAVNGFADESLEFFXAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM 354

Query: 230 Q 228
           +
Sbjct: 355 K 355


>ref|XP_004139593.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Cucumis sativus]
          Length = 525

 Score =  335 bits (859), Expect = 2e-89
 Identities = 160/242 (66%), Positives = 199/242 (82%)
 Frame = -3

Query: 728 SIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRF 549
           S+D  V WTSS+AR CRNG LS+AAAEF RMRL+GVEPNH+TF+TLLSACADFPS+S  F
Sbjct: 53  SVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFF 112

Query: 548 GSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYM 369
            SS+HGY  K G D  +V +GT+L+ MYSKCA +  AR VF  + VKNS+SWNTM++G+M
Sbjct: 113 ASSLHGYACKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFM 172

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           RNG++E AI+LFD+MP RD +SWTALI G +K G+ E+ALE F +MQ SGV  DYV+IIA
Sbjct: 173 RNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIA 232

Query: 188 VLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRS 9
           VL+A A+LGA+ LG+WVHR+V+ Q+FK+NI++SNSLIDMYSRCGCIEFARQ F  M KR+
Sbjct: 233 VLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRT 292

Query: 8   LV 3
           LV
Sbjct: 293 LV 294



 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 55/181 (30%), Positives = 99/181 (54%), Gaps = 4/181 (2%)
 Frame = -3

Query: 758 QLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSAC 579
           +L+++  D+      +SWT+ I    ++G    A   F +M+ SGV  ++V+ + +L+AC
Sbjct: 178 ELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAAC 237

Query: 578 ADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSM 399
           AD  + +L  G  +H ++    F K+N+ +  SL+ MYS+C  ++ AR VF +M  +  +
Sbjct: 238 ADLGALTL--GLWVHRFVMPQEF-KDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLV 294

Query: 398 SWNTMIDGYMRNGDVEDAIKLFDQMPKR----DKVSWTALIGGFVKKGHFEEALEWFREM 231
           SWN++I G+  NG  +++++ F  M K     D VS+T  +      G   + LE F  M
Sbjct: 295 SWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNM 354

Query: 230 Q 228
           +
Sbjct: 355 K 355


>ref|XP_006417992.1| hypothetical protein EUTSA_v10009524mg [Eutrema salsugineum]
           gi|557095763|gb|ESQ36345.1| hypothetical protein
           EUTSA_v10009524mg [Eutrema salsugineum]
          Length = 500

 Score =  330 bits (847), Expect = 4e-88
 Identities = 156/255 (61%), Positives = 193/255 (75%)
 Frame = -3

Query: 767 NKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLL 588
           N+    ++  +  + + TVSWTS I    RNG L+DAA EF  MRL+GVEPNH+TF+ LL
Sbjct: 25  NQANPKIQKLNQSTSETTVSWTSRITLLSRNGRLADAAKEFSDMRLAGVEPNHITFIALL 84

Query: 587 SACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVK 408
           S C DFPS S   G  +HGY  KLG D+N+V +GT+++GMYSK      AR VFD M  K
Sbjct: 85  SGCGDFPSGSEALGDLLHGYACKLGLDRNHVMVGTAILGMYSKRGRFRKARLVFDYMEDK 144

Query: 407 NSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQ 228
           NS++WNTMIDGYMRNG V DA+K+FD+MP RD +SWTA++ GFVKKG  EEAL WFREMQ
Sbjct: 145 NSVTWNTMIDGYMRNGQVYDAVKMFDEMPDRDLISWTAMMNGFVKKGFHEEALAWFREMQ 204

Query: 227 LSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIE 48
           +SGVEPDYV IIA L+A  NLGA+  G+WVHRYV+  DFK N+RVSNSLID+Y RCGC+E
Sbjct: 205 ISGVEPDYVAIIAALAACTNLGALSFGLWVHRYVMSHDFKNNVRVSNSLIDLYCRCGCVE 264

Query: 47  FARQEFNNMQKRSLV 3
           FARQ F+ M+KR++V
Sbjct: 265 FARQVFDKMEKRTVV 279



 Score =  117 bits (292), Expect = 8e-24
 Identities = 75/248 (30%), Positives = 129/248 (52%), Gaps = 4/248 (1%)
 Frame = -3

Query: 752 SVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACAD 573
           +V+  D+      +SWT+ +    + G   +A A F  M++SGVEP++V  +  L+AC +
Sbjct: 165 AVKMFDEMPDRDLISWTAMMNGFVKKGFHEEALAWFREMQISGVEPDYVAIIAALAACTN 224

Query: 572 FPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSW 393
             +  L FG  +H Y+    F KNNV +  SL+ +Y +C  V+ AR VFD+M  +  +SW
Sbjct: 225 LGA--LSFGLWVHRYVMSHDF-KNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVVSW 281

Query: 392 NTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVE 213
           N++I G+  NG+                                +E+L +FR+MQ  G +
Sbjct: 282 NSVIVGFAANGNA-------------------------------DESLVYFRKMQEEGFK 310

Query: 212 PDYVTIIAVLSAIANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEF 45
           PD VT    L+A +++G +  G+   RY   +++D++ + R+ +   L+D+YSR G +E 
Sbjct: 311 PDAVTFTGALTACSHVGLVEEGL---RYFQTMKRDYRISPRIEHYGCLVDLYSRAGRLED 367

Query: 44  ARQEFNNM 21
           A +   +M
Sbjct: 368 ALKVVQSM 375



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 3/125 (2%)
 Frame = -3

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           R       I+  +Q      VSWT+ I    + G   +A + F +M+L+GVEP+++T IA
Sbjct: 23  RENQANPKIQKLNQSTSETTVSWTSRITLLSRNGRLADAAKEFSDMRLAGVEPNHITFIA 82

Query: 188 VLSAIANL--GAIGLGIWVHRYVLQQDFKEN-IRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           +LS   +   G+  LG  +H Y  +     N + V  +++ MYS+ G    AR  F+ M+
Sbjct: 83  LLSGCGDFPSGSEALGDLLHGYACKLGLDRNHVMVGTAILGMYSKRGRFRKARLVFDYME 142

Query: 17  KRSLV 3
            ++ V
Sbjct: 143 DKNSV 147


>ref|XP_002889563.1| PDE247 [Arabidopsis lyrata subsp. lyrata]
           gi|297335405|gb|EFH65822.1| PDE247 [Arabidopsis lyrata
           subsp. lyrata]
          Length = 500

 Score =  327 bits (837), Expect = 5e-87
 Identities = 152/240 (63%), Positives = 190/240 (79%)
 Frame = -3

Query: 722 DHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGS 543
           ++TVSWTS I    RNG L++AA EF  MRL+GVEPNH+TF+ +LS C DFPS S   G 
Sbjct: 34  ENTVSWTSRITLLTRNGRLAEAAKEFSDMRLAGVEPNHITFIAILSGCGDFPSGSEALGD 93

Query: 542 SIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRN 363
            +HGY  KLG D+N+V +GT+++GMYSK   V  AR VFD M  KNS++WNTMIDGYMR+
Sbjct: 94  LLHGYACKLGLDRNHVMVGTAIIGMYSKRGRVKKARCVFDYMEDKNSVTWNTMIDGYMRS 153

Query: 362 GDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVL 183
           G V++A K+FD+MP+RD +SWTA+I GFV KG  EEAL WFREMQ+SGV+PDYV IIA L
Sbjct: 154 GQVDNAAKMFDKMPERDLISWTAMINGFVNKGFHEEALAWFREMQISGVKPDYVAIIAAL 213

Query: 182 SAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRSLV 3
           +A  NLGA+  G+WVHRYV+ QDFK N+RVSNSLID+Y RCGC+EFARQ F+ M+KR++V
Sbjct: 214 NACTNLGALSFGLWVHRYVMSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFDKMEKRTVV 273



 Score =  108 bits (269), Expect = 4e-21
 Identities = 75/235 (31%), Positives = 120/235 (51%), Gaps = 4/235 (1%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ I      G   +A A F  M++SGV+P++V  +  L+AC +  +  L FG  +H
Sbjct: 172 ISWTAMINGFVNKGFHEEALAWFREMQISGVKPDYVAIIAALNACTNLGA--LSFGLWVH 229

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +                                N++ID Y R G V
Sbjct: 230 RYVMSQDF-KNNVRVS-------------------------------NSLIDLYCRCGCV 257

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
           E A ++FD+M KR  VSW ++I GF   G+  E+L +FR+MQ    +PD VT    L+A 
Sbjct: 258 EFARQVFDKMEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEERFKPDAVTFTGALTAC 317

Query: 173 ANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   RY  ++  D++ + R+ +   L+D+YSR G +E A +   +M
Sbjct: 318 SHVGLVEEGL---RYFQIMISDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSM 369



 Score = 67.0 bits (162), Expect = 1e-08
 Identities = 39/117 (33%), Positives = 66/117 (56%), Gaps = 3/117 (2%)
 Frame = -3

Query: 344 IKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAIANL 165
           I+  +Q    + VSWT+ I    + G   EA + F +M+L+GVEP+++T IA+LS   + 
Sbjct: 25  IQRLNQSTSENTVSWTSRITLLTRNGRLAEAAKEFSDMRLAGVEPNHITFIAILSGCGDF 84

Query: 164 --GAIGLGIWVHRYVLQQDFKEN-IRVSNSLIDMYSRCGCIEFARQEFNNMQKRSLV 3
             G+  LG  +H Y  +     N + V  ++I MYS+ G ++ AR  F+ M+ ++ V
Sbjct: 85  PSGSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRVKKARCVFDYMEDKNSV 141


>ref|XP_003541961.1| PREDICTED: pentatricopeptide repeat-containing protein At1g05750,
           chloroplastic-like [Glycine max]
          Length = 521

 Score =  326 bits (836), Expect = 7e-87
 Identities = 157/261 (60%), Positives = 206/261 (78%), Gaps = 1/261 (0%)
 Frame = -3

Query: 782 NRNLVNKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVT 603
           N N      LS+R     + D  VSWT+SIA  C++G L  AA++FV+MR + +EPNH+T
Sbjct: 35  NTNTNTNQGLSLRHTTKYN-DPIVSWTTSIADYCKSGHLVKAASKFVQMREAAIEPNHIT 93

Query: 602 FVTLLSACADFPSKS-LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVF 426
           F+TLLSACA +PS+S + FG++IH ++RKLG D N+V +GT+L+ MY+KC  V+ AR  F
Sbjct: 94  FITLLSACAHYPSRSSISFGTAIHAHVRKLGLDINDVMVGTALIDMYAKCGRVESARLAF 153

Query: 425 DEMCVKNSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALE 246
           D+M V+N +SWNTMIDGYMRNG  EDA+++FD +P ++ +SWTALIGGFVKK + EEALE
Sbjct: 154 DQMGVRNLVSWNTMIDGYMRNGKFEDALQVFDGLPVKNAISWTALIGGFVKKDYHEEALE 213

Query: 245 WFREMQLSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYS 66
            FREMQLSGV PDYVT+IAV++A ANLG +GLG+WVHR V+ QDF+ N++VSNSLIDMYS
Sbjct: 214 CFREMQLSGVAPDYVTVIAVIAACANLGTLGLGLWVHRLVMTQDFRNNVKVSNSLIDMYS 273

Query: 65  RCGCIEFARQEFNNMQKRSLV 3
           RCGCI+ ARQ F+ M +R+LV
Sbjct: 274 RCGCIDLARQVFDRMPQRTLV 294



 Score =  105 bits (262), Expect = 3e-20
 Identities = 72/240 (30%), Positives = 115/240 (47%), Gaps = 1/240 (0%)
 Frame = -3

Query: 737 DDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           D   + + +SWT+ I    +     +A   F  M+LSGV P++VT + +++ACA+  +  
Sbjct: 185 DGLPVKNAISWTALIGGFVKKDYHEEALECFREMQLSGVAPDYVTVIAVIAACANLGTLG 244

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
           L  G  +H  +    F +NNV +  SL+ MYS                            
Sbjct: 245 L--GLWVHRLVMTQDF-RNNVKVSNSLIDMYS---------------------------- 273

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
              R G ++ A ++FD+MP+R  VSW ++I GF   G  +EAL +F  MQ  G +PD V+
Sbjct: 274 ---RCGCIDLARQVFDRMPQRTLVSWNSIIVGFAVNGLADEALSYFNSMQEEGFKPDGVS 330

Query: 197 IIAVLSAIANLGAIGLGIWVHRYVLQ-QDFKENIRVSNSLIDMYSRCGCIEFARQEFNNM 21
               L A ++ G IG G+ +  ++ + +     I     L+D+YSR G +E A     NM
Sbjct: 331 YTGALMACSHAGLIGEGLRIFEHMKRVRRILPRIEHYGCLVDLYSRAGRLEEALNVLKNM 390


>ref|NP_172066.3| pentatricopeptide repeat protein PDE247 [Arabidopsis thaliana]
           gi|75191933|sp|Q9MA50.1|PPR13_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g05750, chloroplastic; AltName: Full=Protein PIGMENT
           DEFECTIVE 247; Flags: Precursor
           gi|6850304|gb|AAF29381.1|AC009999_1 Contains similarity
           to a hypothetical protein from Arabidopsis thaliana
           gb|AC007109.6, and contains two DUF17 PF|01535 domains
           [Arabidopsis thaliana] gi|62320576|dbj|BAD95203.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332189766|gb|AEE27887.1| pentatricopeptide repeat
           protein PDE247 [Arabidopsis thaliana]
          Length = 500

 Score =  324 bits (830), Expect = 3e-86
 Identities = 154/255 (60%), Positives = 194/255 (76%)
 Frame = -3

Query: 767 NKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLL 588
           N     ++ ++  + + TVSWTS I    RNG L++AA EF  M L+GVEPNH+TF+ LL
Sbjct: 19  NHANPKIQRHNQSTSETTVSWTSRINLLTRNGRLAEAAKEFSDMTLAGVEPNHITFIALL 78

Query: 587 SACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVK 408
           S C DF S S   G  +HGY  KLG D+N+V +GT+++GMYSK      AR VFD M  K
Sbjct: 79  SGCGDFTSGSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRFKKARLVFDYMEDK 138

Query: 407 NSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQ 228
           NS++WNTMIDGYMR+G V++A K+FD+MP+RD +SWTA+I GFVKKG+ EEAL WFREMQ
Sbjct: 139 NSVTWNTMIDGYMRSGQVDNAAKMFDKMPERDLISWTAMINGFVKKGYQEEALLWFREMQ 198

Query: 227 LSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIE 48
           +SGV+PDYV IIA L+A  NLGA+  G+WVHRYVL QDFK N+RVSNSLID+Y RCGC+E
Sbjct: 199 ISGVKPDYVAIIAALNACTNLGALSFGLWVHRYVLSQDFKNNVRVSNSLIDLYCRCGCVE 258

Query: 47  FARQEFNNMQKRSLV 3
           FARQ F NM+KR++V
Sbjct: 259 FARQVFYNMEKRTVV 273



 Score =  108 bits (269), Expect = 4e-21
 Identities = 71/235 (30%), Positives = 120/235 (51%), Gaps = 4/235 (1%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ I    + G   +A   F  M++SGV+P++V  +  L+AC +  +  L FG  +H
Sbjct: 172 ISWTAMINGFVKKGYQEEALLWFREMQISGVKPDYVAIIAALNACTNLGA--LSFGLWVH 229

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +  SL+ +Y +C  V+ AR VF  M  +  +SWN++I G+  NG+ 
Sbjct: 230 RYVLSQDF-KNNVRVSNSLIDLYCRCGCVEFARQVFYNMEKRTVVSWNSVIVGFAANGNA 288

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
                                           E+L +FR+MQ  G +PD VT    L+A 
Sbjct: 289 -------------------------------HESLVYFRKMQEKGFKPDAVTFTGALTAC 317

Query: 173 ANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   RY  +++ D++ + R+ +   L+D+YSR G +E A +   +M
Sbjct: 318 SHVGLVEEGL---RYFQIMKCDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSM 369


>gb|AEP33749.1| chloroplast biogenesis 19, partial [Crucihimalaya wallichii]
          Length = 491

 Score =  322 bits (826), Expect = 1e-85
 Identities = 151/238 (63%), Positives = 187/238 (78%)
 Frame = -3

Query: 716 TVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSI 537
           TVSWTS I    RNG L++AA  F  MRLSGVEPNH+TF+ LLS C DFPS S    + +
Sbjct: 27  TVSWTSRITLLTRNGRLAEAAKXFSDMRLSGVEPNHITFIALLSGCGDFPSGSETLSNLL 86

Query: 536 HGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGD 357
           HGY  KLG D+ +V +GT+++GMYSK  HV  AR VFD M   NS++WNTMIDGYMR+G 
Sbjct: 87  HGYACKLGLDRTHVMVGTAIIGMYSKRGHVKKARLVFDYMEDINSVTWNTMIDGYMRSGQ 146

Query: 356 VEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSA 177
           V++A+K+FD+MP+RD +SWTA+I GFVKKG  EEAL WFREMQ+SGV PDYV IIA L+A
Sbjct: 147 VDNAVKMFDKMPERDLISWTAMINGFVKKGFHEEALVWFREMQISGVRPDYVAIIAALNA 206

Query: 176 IANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQKRSLV 3
             NLGA+  G+WVHRYV+ QDFK N+RVSNSLID+Y RCGC+EFAR+ F+ M+KR++V
Sbjct: 207 CTNLGALSFGLWVHRYVMNQDFKNNVRVSNSLIDLYCRCGCVEFAREVFDKMEKRTVV 264



 Score =  112 bits (279), Expect = 3e-22
 Identities = 72/235 (30%), Positives = 120/235 (51%), Gaps = 4/235 (1%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ I    + G   +A   F  M++SGV P++V  +  L+AC +  +  L FG  +H
Sbjct: 163 ISWTAMINGFVKKGFHEEALVWFREMQISGVRPDYVAIIAALNACTNLGA--LSFGLWVH 220

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +  SL+ +Y +C  V+ AR VFD+M  +  +SWN++I G+  NG+ 
Sbjct: 221 RYVMNQDF-KNNVRVSNSLIDLYCRCGCVEFAREVFDKMEKRTVVSWNSVIVGFAANGNA 279

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
                                           E+L +FR+MQ  G +PD VT    L+A 
Sbjct: 280 -------------------------------HESLVYFRKMQEEGFKPDAVTFTGALTAC 308

Query: 173 ANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   RY   +++D+  + R+ +   L+D+YSR G +E A +   +M
Sbjct: 309 SHVGLVEEGL---RYFQTMKRDYGISPRIEHYGCLVDLYSRAGRLEDALKVIESM 360



 Score = 62.0 bits (149), Expect = 3e-07
 Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
 Frame = -3

Query: 344 IKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAIANL 165
           I+  +Q      VSWT+ I    + G   EA + F +M+LSGVEP+++T IA+LS   + 
Sbjct: 16  IQRLNQSTSETTVSWTSRITLLTRNGRLAEAAKXFSDMRLSGVEPNHITFIALLSGCGDF 75

Query: 164 --GAIGLGIWVHRYVLQQDF-KENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
             G+  L   +H Y  +    + ++ V  ++I MYS+ G ++ AR  F+ M+
Sbjct: 76  PSGSETLSNLLHGYACKLGLDRTHVMVGTAIIGMYSKRGHVKKARLVFDYME 127


>gb|AEP33748.1| chloroplast biogenesis 19, partial [Capsella bursa-pastoris]
          Length = 489

 Score =  322 bits (825), Expect = 1e-85
 Identities = 153/245 (62%), Positives = 192/245 (78%), Gaps = 1/245 (0%)
 Frame = -3

Query: 734 DQSIDHT-VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           +QS   T VSWTS I    RNG L++AA EF  MRL+GVEPNH+TF+ LLS C DF S S
Sbjct: 18  NQSTSETIVSWTSRITLLTRNGRLAEAAKEFSDMRLAGVEPNHITFIALLSGCGDFSSGS 77

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
              G  +HGY  KLG D+ +V +GT+++GMYSK + V  AR VFD M  KNS++WNTMID
Sbjct: 78  EALGDLLHGYACKLGHDRTHVMVGTAILGMYSKHSRVKKARLVFDYMEDKNSVTWNTMID 137

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
           GYMRNG V++A+K+FD+MP+RD +SWTA+I GFVKKG  EEAL WFREMQ+SGV+PDYV 
Sbjct: 138 GYMRNGQVDNAVKMFDKMPERDLISWTAMINGFVKKGFHEEALAWFREMQISGVKPDYVA 197

Query: 197 IIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           IIA L+A  NLGA+  G+WVHRYV+ QDFK N++VSNSLID+Y RCGC+EFAR+ F+ M+
Sbjct: 198 IIAALNACTNLGALSFGLWVHRYVMSQDFKNNVKVSNSLIDLYCRCGCVEFAREVFDKME 257

Query: 17  KRSLV 3
           KR++V
Sbjct: 258 KRTVV 262



 Score =  114 bits (284), Expect = 7e-23
 Identities = 73/235 (31%), Positives = 122/235 (51%), Gaps = 4/235 (1%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ I    + G   +A A F  M++SGV+P++V  +  L+AC +  +  L FG  +H
Sbjct: 161 ISWTAMINGFVKKGFHEEALAWFREMQISGVKPDYVAIIAALNACTNLGA--LSFGLWVH 218

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +  SL+ +Y +C  V+ AR VFD+M  +  +SWN++I G+  NG+ 
Sbjct: 219 RYVMSQDF-KNNVKVSNSLIDLYCRCGCVEFAREVFDKMEKRTVVSWNSVIVGFAANGNA 277

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
                                           E+L +FR+MQ  G +PD VT    L+A 
Sbjct: 278 -------------------------------HESLVYFRKMQEEGFKPDAVTFTGALTAC 306

Query: 173 ANLGAIGLGIWVHRY--VLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   RY   +++D + + R+ +   L+D+YSR G +E A +   +M
Sbjct: 307 SHVGLVEEGL---RYFQTMKRDHRISPRIEHYGCLVDLYSRAGRLEEALKVVQSM 358



 Score = 61.2 bits (147), Expect = 5e-07
 Identities = 37/125 (29%), Positives = 67/125 (53%), Gaps = 3/125 (2%)
 Frame = -3

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           R    +  I+  +Q      VSWT+ I    + G   EA + F +M+L+GVEP+++T IA
Sbjct: 6   RKHHADPKIQKLNQSTSETIVSWTSRITLLTRNGRLAEAAKEFSDMRLAGVEPNHITFIA 65

Query: 188 VLSAIANL--GAIGLGIWVHRYVLQQDF-KENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           +LS   +   G+  LG  +H Y  +    + ++ V  +++ MYS+   ++ AR  F+ M+
Sbjct: 66  LLSGCGDFSSGSEALGDLLHGYACKLGHDRTHVMVGTAILGMYSKHSRVKKARLVFDYME 125

Query: 17  KRSLV 3
            ++ V
Sbjct: 126 DKNSV 130


>ref|XP_006303598.1| hypothetical protein CARUB_v10011161mg [Capsella rubella]
           gi|482572309|gb|EOA36496.1| hypothetical protein
           CARUB_v10011161mg [Capsella rubella]
          Length = 506

 Score =  321 bits (822), Expect = 3e-85
 Identities = 152/245 (62%), Positives = 192/245 (78%), Gaps = 1/245 (0%)
 Frame = -3

Query: 734 DQSIDHT-VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           +QS   T VSWTS I    RNG L++AA EF  MRL+GVEPNH+TF+ LLS C DF S S
Sbjct: 35  NQSTSETIVSWTSRITLLTRNGRLAEAAKEFSNMRLAGVEPNHITFIALLSGCGDFSSGS 94

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
              G  +HGY  KLG D+ +V +GT+++GMYSK + V  AR VFD M  KNS++WNTMI+
Sbjct: 95  EALGDLLHGYACKLGLDRTHVMVGTAILGMYSKRSRVKKARLVFDYMEDKNSVTWNTMIN 154

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
           GYMRNG V++A+K+FD+MP+RD +SWTA+I GFVKKG  EEAL WFREMQ+SGV+PDYV 
Sbjct: 155 GYMRNGQVDNAVKMFDKMPERDFISWTAMINGFVKKGFHEEALAWFREMQISGVKPDYVA 214

Query: 197 IIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           IIA L+A  NLGA+  G+WVHRYV+ QDFK N++VSNSLID+Y RCGC+EFAR+ F+ M+
Sbjct: 215 IIAALNACTNLGALSFGLWVHRYVMSQDFKNNVKVSNSLIDLYCRCGCVEFAREVFDKME 274

Query: 17  KRSLV 3
           KR++V
Sbjct: 275 KRTVV 279



 Score =  113 bits (283), Expect = 9e-23
 Identities = 79/237 (33%), Positives = 119/237 (50%), Gaps = 6/237 (2%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ I    + G   +A A F  M++SGV+P++V  +  L+AC +  +  L FG  +H
Sbjct: 178 ISWTAMINGFVKKGFHEEALAWFREMQISGVKPDYVAIIAALNACTNLGA--LSFGLWVH 235

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +                                N++ID Y R G V
Sbjct: 236 RYVMSQDF-KNNVKVS-------------------------------NSLIDLYCRCGCV 263

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
           E A ++FD+M KR  VSW ++I GF   G+  E+L +FR+MQ  G +PD VT    L+A 
Sbjct: 264 EFAREVFDKMEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEEGFKPDAVTFTGALTAC 323

Query: 173 ANLGAIGLGIWVHRYVLQQDFKENIRVS------NSLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   RY   Q  K N R+S        L+D+YSR G +E A +   +M
Sbjct: 324 SHVGLVEEGL---RYF--QTMKRNHRISPRIEHYGCLVDLYSRAGRLEEALKVVQSM 375



 Score = 60.8 bits (146), Expect = 7e-07
 Identities = 37/125 (29%), Positives = 66/125 (52%), Gaps = 3/125 (2%)
 Frame = -3

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           R    +  I+  +Q      VSWT+ I    + G   EA + F  M+L+GVEP+++T IA
Sbjct: 23  RKHHADPKIQKLNQSTSETIVSWTSRITLLTRNGRLAEAAKEFSNMRLAGVEPNHITFIA 82

Query: 188 VLSAIANL--GAIGLGIWVHRYVLQQDF-KENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           +LS   +   G+  LG  +H Y  +    + ++ V  +++ MYS+   ++ AR  F+ M+
Sbjct: 83  LLSGCGDFSSGSEALGDLLHGYACKLGLDRTHVMVGTAILGMYSKRSRVKKARLVFDYME 142

Query: 17  KRSLV 3
            ++ V
Sbjct: 143 DKNSV 147


>gb|AEP33755.1| chloroplast biogenesis 19, partial [Olimarabidopsis pumila]
          Length = 475

 Score =  319 bits (818), Expect = 9e-85
 Identities = 152/245 (62%), Positives = 190/245 (77%), Gaps = 1/245 (0%)
 Frame = -3

Query: 734 DQSIDHT-VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKS 558
           +QS   T VSWTS I    R+  L++AA EF  MRL+GVEP H+TF+ LLS C DFPS S
Sbjct: 4   NQSTSETIVSWTSRITLLTRSAXLAEAAKEFADMRLAGVEPTHITFIALLSGCGDFPSGS 63

Query: 557 LRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMID 378
              G  +HGY  KLG D+N+V +GT+++GMYSK   V  AR VFD M  KNS++WNTMID
Sbjct: 64  ETLGDLLHGYACKLGLDRNHVMVGTAILGMYSKRGRVKKARLVFDYMDDKNSVTWNTMID 123

Query: 377 GYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVT 198
           GYMR+G V +A+KLFD+MP+ D +SWTA++ GFVKKG  EEAL WFREMQ+SGV+PDYV 
Sbjct: 124 GYMRSGQVHNAVKLFDKMPEPDLISWTAMVNGFVKKGFHEEALVWFREMQISGVKPDYVA 183

Query: 197 IIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           IIA L+A  NLGA+ LG+WVHRYV+ QDFK N+RVSNSLID+Y RCGC+EFAR+ F+ M+
Sbjct: 184 IIAALNACTNLGALSLGLWVHRYVMSQDFKNNVRVSNSLIDLYCRCGCVEFAREVFDKME 243

Query: 17  KRSLV 3
           KR++V
Sbjct: 244 KRTVV 248



 Score =  106 bits (265), Expect = 1e-20
 Identities = 71/233 (30%), Positives = 120/233 (51%), Gaps = 2/233 (0%)
 Frame = -3

Query: 713 VSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACADFPSKSLRFGSSIH 534
           +SWT+ +    + G   +A   F  M++SGV+P++V  +  L+AC +  + SL  G  +H
Sbjct: 147 ISWTAMVNGFVKKGFHEEALVWFREMQISGVKPDYVAIIAALNACTNLGALSL--GLWVH 204

Query: 533 GYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSWNTMIDGYMRNGDV 354
            Y+    F KNNV +                                N++ID Y R G V
Sbjct: 205 RYVMSQDF-KNNVRVS-------------------------------NSLIDLYCRCGCV 232

Query: 353 EDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIAVLSAI 174
           E A ++FD+M KR  VSW ++I GF   G+  E+L +FR+MQ  G +P+ VT    L+A 
Sbjct: 233 EFAREVFDKMEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEEGFKPNAVTFTGALTAC 292

Query: 173 ANLGAIGLGIWVHRYVLQQDFKENIRVSN--SLIDMYSRCGCIEFARQEFNNM 21
           +++G +  G+   +  +++D+  + R+ +   L+D+YSR G +E A +   +M
Sbjct: 293 SHVGLVDEGLRFFQ-SMKRDYNISPRIEHYGCLVDLYSRAGRLEDALKVVQSM 344


>gb|AEP33750.1| chloroplast biogenesis 19, partial [Lepidium sativum]
          Length = 494

 Score =  319 bits (818), Expect = 9e-85
 Identities = 149/255 (58%), Positives = 194/255 (76%)
 Frame = -3

Query: 767 NKTQLSVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLL 588
           N     ++  +  + +  VSWTS I    R+  L++AA EF  MRL+G+EPNH+TF++LL
Sbjct: 16  NNVNPKIQKLNQSTSETIVSWTSRITLLSRDDRLAEAAREFSDMRLAGIEPNHITFISLL 75

Query: 587 SACADFPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVK 408
           SAC +FPS S      +HGY  KLG D+++V +GT+++GMYSK  HV  AR VFD M  K
Sbjct: 76  SACGNFPSGSEALSDLLHGYACKLGLDRSHVMVGTAILGMYSKRGHVRKARLVFDYMEDK 135

Query: 407 NSMSWNTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQ 228
           NS++WNTMIDGYMRNG V++A+K+FD+MP+RD +SWTA+I GFVKKG  EEAL WFREMQ
Sbjct: 136 NSVTWNTMIDGYMRNGQVDNAVKVFDEMPERDLISWTAMITGFVKKGFHEEALAWFREMQ 195

Query: 227 LSGVEPDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSNSLIDMYSRCGCIE 48
           +SGV PDYV IIA L+A  NLGA+  G+W HRYV+ QDF+ N+RVSNSLID+Y RCGC+E
Sbjct: 196 ISGVNPDYVAIIAALAACTNLGALSFGLWAHRYVVSQDFRNNVRVSNSLIDLYCRCGCVE 255

Query: 47  FARQEFNNMQKRSLV 3
           FARQ F+ M+KR++V
Sbjct: 256 FARQVFDTMEKRTVV 270



 Score =  108 bits (270), Expect = 3e-21
 Identities = 75/246 (30%), Positives = 124/246 (50%), Gaps = 2/246 (0%)
 Frame = -3

Query: 752 SVRSNDDQSIDHTVSWTSSIARRCRNGMLSDAAAEFVRMRLSGVEPNHVTFVTLLSACAD 573
           +V+  D+      +SWT+ I    + G   +A A F  M++SGV P++V  +  L+AC +
Sbjct: 156 AVKVFDEMPERDLISWTAMITGFVKKGFHEEALAWFREMQISGVNPDYVAIIAALAACTN 215

Query: 572 FPSKSLRFGSSIHGYIRKLGFDKNNVSLGTSLVGMYSKCAHVDLARHVFDEMCVKNSMSW 393
             +  L FG   H Y+    F +NNV +                                
Sbjct: 216 LGA--LSFGLWAHRYVVSQDF-RNNVRVS------------------------------- 241

Query: 392 NTMIDGYMRNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVE 213
           N++ID Y R G VE A ++FD M KR  VSW ++I GF   G+  E+L +FR+MQ  G +
Sbjct: 242 NSLIDLYCRCGCVEFARQVFDTMEKRTVVSWNSVIVGFAANGNANESLVYFRKMQEEGFK 301

Query: 212 PDYVTIIAVLSAIANLGAIGLGIWVHRYVLQQDFKENIRVSN--SLIDMYSRCGCIEFAR 39
           PD VT    L+A +++G +  G + +  +++ D++ + R+ +   L+D+YSR G +E A 
Sbjct: 302 PDAVTFTGALTACSHVGLVEEG-FQYFQMMKTDYRISPRIEHFGCLVDLYSRAGRLEDAI 360

Query: 38  QEFNNM 21
           +   +M
Sbjct: 361 KVVESM 366



 Score = 61.6 bits (148), Expect = 4e-07
 Identities = 37/125 (29%), Positives = 66/125 (52%), Gaps = 3/125 (2%)
 Frame = -3

Query: 368 RNGDVEDAIKLFDQMPKRDKVSWTALIGGFVKKGHFEEALEWFREMQLSGVEPDYVTIIA 189
           R  +V   I+  +Q      VSWT+ I    +     EA   F +M+L+G+EP+++T I+
Sbjct: 14  RKNNVNPKIQKLNQSTSETIVSWTSRITLLSRDDRLAEAAREFSDMRLAGIEPNHITFIS 73

Query: 188 VLSAIANL--GAIGLGIWVHRYVLQQDF-KENIRVSNSLIDMYSRCGCIEFARQEFNNMQ 18
           +LSA  N   G+  L   +H Y  +    + ++ V  +++ MYS+ G +  AR  F+ M+
Sbjct: 74  LLSACGNFPSGSEALSDLLHGYACKLGLDRSHVMVGTAILGMYSKRGHVRKARLVFDYME 133

Query: 17  KRSLV 3
            ++ V
Sbjct: 134 DKNSV 138


Top