BLASTX nr result

ID: Cinnamomum24_contig00013895 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum24_contig00013895
         (826 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267829.1| PREDICTED: pentatricopeptide repeat-containi...   250   7e-64
ref|XP_012076004.1| PREDICTED: pentatricopeptide repeat-containi...   249   1e-63
ref|XP_010269888.1| PREDICTED: pentatricopeptide repeat-containi...   238   5e-60
ref|XP_002523226.1| pentatricopeptide repeat-containing protein,...   231   6e-58
ref|XP_007046316.1| Pentatricopeptide repeat (PPR) superfamily p...   222   2e-55
ref|XP_012438652.1| PREDICTED: pentatricopeptide repeat-containi...   220   8e-55
gb|KHG06568.1| Pentatricopeptide repeat-containing -like protein...   214   4e-53
ref|XP_010908381.1| PREDICTED: pentatricopeptide repeat-containi...   211   6e-52
ref|XP_010450093.1| PREDICTED: pentatricopeptide repeat-containi...   210   8e-52
ref|XP_006283664.1| hypothetical protein CARUB_v10004721mg [Caps...   209   1e-51
ref|NP_193153.3| pentatricopeptide repeat-containing protein [Ar...   209   2e-51
gb|AAU94392.1| At4g14170 [Arabidopsis thaliana] gi|55733761|gb|A...   209   2e-51
ref|XP_008788281.1| PREDICTED: pentatricopeptide repeat-containi...   208   3e-51
ref|XP_002868306.1| pentatricopeptide repeat-containing protein ...   205   3e-50
ref|XP_010440520.1| PREDICTED: pentatricopeptide repeat-containi...   205   3e-50
emb|CDY37924.1| BnaA06g18930D [Brassica napus]                        205   3e-50
emb|CDY56890.1| BnaC03g75290D [Brassica napus]                        203   1e-49
ref|XP_006414782.1| hypothetical protein EUTSA_v10027059mg [Eutr...   203   1e-49
ref|XP_009150190.1| PREDICTED: pentatricopeptide repeat-containi...   201   4e-49
ref|XP_011620751.1| PREDICTED: pentatricopeptide repeat-containi...   185   3e-44

>ref|XP_002267829.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Vitis vinifera]
          Length = 455

 Score =  250 bits (639), Expect = 7e-64
 Identities = 132/233 (56%), Positives = 156/233 (66%)
 Frame = -2

Query: 702 PKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHS 523
           PK + N           SPN +         LR  LY +V+LSSKL+LMYS+  KL PHS
Sbjct: 4   PKPKSNLVSSYFALLHSSPNPTHLRHLHARLLRTSLYDNVILSSKLLLMYSQLGKLSPHS 63

Query: 522 FSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACAC 343
            SVFLHMPHRN +SWNIIIGEFSRS LPH S+ LF++M R     PD F LPLVLRACA 
Sbjct: 64  LSVFLHMPHRNIYSWNIIIGEFSRSHLPHKSIDLFLQM-RHFNQPPDVFTLPLVLRACAA 122

Query: 342 LGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSM 163
            G  ++G SVHG C+++G EK+                V DAR +FDEM +RDAVLWT+M
Sbjct: 123 SGSVKLGVSVHGLCVEMGMEKSLFVASALVFMYVTFGKVLDARVLFDEMPERDAVLWTAM 182

Query: 162 LGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           L GYAQ+ EP+LAL VFR+MV AG+ LDGVVM+SLLL C QLGW K GKSVHG
Sbjct: 183 LAGYAQHEEPMLALSVFRQMVSAGVALDGVVMISLLLACGQLGWLKHGKSVHG 235



 Score = 76.6 bits (187), Expect = 2e-11
 Identities = 52/193 (26%), Positives = 93/193 (48%), Gaps = 4/193 (2%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G+   + ++S L+ MY    K+L     +F  MP R+   W  ++  +++   P ++L +
Sbjct: 140 GMEKSLFVASALVFMYVTFGKVLDARV-LFDEMPERDAVLWTAMLAGYAQHEEPMLALSV 198

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGF----CLKLGFEKNXXXXXXXX 253
           F +M+ +G +      + L+L AC  LG  + G SVHG+    CL LG            
Sbjct: 199 FRQMVSAGVALDGVVMISLLL-ACGQLGWLKHGKSVHGWITRRCLALGLNLGNALVYFYV 257

Query: 252 XXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGV 73
                      +  +FD+M +RD + W+S++ GY  +G   +AL++F  M  AG++ + V
Sbjct: 258 KCAALGY----SYNLFDKMPERDVISWSSIILGYGLSGNVDIALDLFDRMRVAGVKPNDV 313

Query: 72  VMVSLLLVCSQLG 34
             +  L  C+  G
Sbjct: 314 TFLGALSACTHTG 326


>ref|XP_012076004.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Jatropha curcas]
          Length = 496

 Score =  249 bits (637), Expect = 1e-63
 Identities = 132/260 (50%), Positives = 164/260 (63%)
 Frame = -2

Query: 783 MHVRLISTIHLPKRLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLR 604
           MH+ L+        L+ KLFF + ++    + +           SPN +         LR
Sbjct: 1   MHIALLFLARKESLLFNKLFFSTTTSINPSQTSLISHYFSLLHSSPNPTHLRHLHARLLR 60

Query: 603 NGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQ 424
             LY +V+LSSKL+LMYS H KL+PHS SVF HMP+RN FSWNII+GEFSRS  P  S+ 
Sbjct: 61  TSLYDNVILSSKLVLMYSCHGKLIPHSLSVFFHMPYRNIFSWNIIMGEFSRSDFPEKSID 120

Query: 423 LFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXX 244
           LF++M R     PD F+ PLVLRACA  G  E+G SVH  CL++G+  +           
Sbjct: 121 LFLQMRRESDVRPDDFSFPLVLRACAGSGMAELGTSVHALCLRMGWSVSLFVASALVFMY 180

Query: 243 XXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMV 64
                + +AR +FDEMS RDAVLWT+ML GYAQ+GEPVL L+VF EMV  GI+LDGVVMV
Sbjct: 181 VTFGNLFNARMLFDEMSKRDAVLWTAMLAGYAQHGEPVLGLQVFEEMVNLGIKLDGVVMV 240

Query: 63  SLLLVCSQLGWPKPGKSVHG 4
           SLLLV  +LGW K GKSVHG
Sbjct: 241 SLLLVFGRLGWLKQGKSVHG 260



 Score = 77.8 bits (190), Expect = 8e-12
 Identities = 56/193 (29%), Positives = 93/193 (48%), Gaps = 2/193 (1%)
 Frame = -2

Query: 606 RNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSL 427
           R G    + ++S L+ MY     L  ++  +F  M  R+   W  ++  +++ G P + L
Sbjct: 163 RMGWSVSLFVASALVFMYVTFGNLF-NARMLFDEMSKRDAVLWTAMLAGYAQHGEPVLGL 221

Query: 426 QLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLK--LGFEKNXXXXXXXX 253
           Q+F EM+  G        + L+L     LG  + G SVHG+C++  LG E +        
Sbjct: 222 QVFEEMVNLGIKLDGVVMVSLLL-VFGRLGWLKQGKSVHGWCVRNCLGLELSLGNAIVDV 280

Query: 252 XXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGV 73
                      A +VFD MS+RD V W+S++ GY  +G   +ALE+F +M   G+  + V
Sbjct: 281 YVKCAILGY--AHRVFDRMSERDVVSWSSLILGYGLSGNVSVALELFDQMHLRGVRPNDV 338

Query: 72  VMVSLLLVCSQLG 34
             + +L  C+  G
Sbjct: 339 TFLGVLSACAHGG 351


>ref|XP_010269888.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Nelumbo nucifera]
          Length = 485

 Score =  238 bits (606), Expect = 5e-60
 Identities = 131/248 (52%), Positives = 159/248 (64%)
 Frame = -2

Query: 747 KRLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLRNGLYSDVVLSSK 568
           + +Y K +  + ST  K   +           S NA          LR GLY +V+LSSK
Sbjct: 12  RTMYNKFYTTNISTANKDAYDLVSYYFSLLHSSANARHLGSLHTRLLRTGLYGNVILSSK 71

Query: 567 LMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASA 388
           L++MYSKHN L   S SVFLHMPHRN +SWNIIIGEFSRSG P  +++LFV ++R+    
Sbjct: 72  LVMMYSKHNNL-SCSLSVFLHMPHRNIYSWNIIIGEFSRSGDPEQAIRLFV-LMRNSDVQ 129

Query: 387 PDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKV 208
           PD F  PLVLRACA  G    GASVHG C ++G E+N                + +AR++
Sbjct: 130 PDVFTFPLVLRACASSGAICWGASVHGLCARMGMERNVFVASALVFFYVTLARILEARRL 189

Query: 207 FDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCSQLGWP 28
           FDEM  RDAVLWT+ML GYAQ+GEP+L L VFREMVG GI LDGVVMVSLLL+C QLG  
Sbjct: 190 FDEMPQRDAVLWTAMLAGYAQHGEPMLGLAVFREMVGEGIGLDGVVMVSLLLICGQLGLL 249

Query: 27  KPGKSVHG 4
           + GKSVHG
Sbjct: 250 RHGKSVHG 257



 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 55/195 (28%), Positives = 91/195 (46%), Gaps = 4/195 (2%)
 Frame = -2

Query: 606 RNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSL 427
           R G+  +V ++S L+  Y    ++L  +  +F  MP R+   W  ++  +++ G P + L
Sbjct: 160 RMGMERNVFVASALVFFYVTLARIL-EARRLFDEMPQRDAVLWTAMLAGYAQHGEPMLGL 218

Query: 426 QLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGF----CLKLGFEKNXXXXXX 259
            +F EM+  G        + L+L  C  LG    G SVHG+    CL LG          
Sbjct: 219 AVFREMVGEGIGLDGVVMVSLLL-ICGQLGLLRHGKSVHGWIIRRCLCLGLSLGNALMDM 277

Query: 258 XXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELD 79
                        A ++F +M +RD + W+SM+ GY  NG   +A E+F  MV  G++ +
Sbjct: 278 YVKSAVLSY----AHRIFHKMPERDVISWSSMILGYGLNGGVQIAFELFDRMVMEGVKPN 333

Query: 78  GVVMVSLLLVCSQLG 34
            V  + +L  C+  G
Sbjct: 334 DVTFLGVLSACAHSG 348


>ref|XP_002523226.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223537522|gb|EEF39147.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 488

 Score =  231 bits (588), Expect = 6e-58
 Identities = 128/249 (51%), Positives = 158/249 (63%), Gaps = 2/249 (0%)
 Frame = -2

Query: 744 RLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLRNGLYSDVVLSSKL 565
           RL+ KLF  +++  P                SP+ +         LR  LY +VVLSSKL
Sbjct: 17  RLFNKLF-TTWTAPPPRATCLISHYFSLLHSSPDLTHLRHLHARLLRTSLYDNVVLSSKL 75

Query: 564 MLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASAP 385
           +LMYS HN+L PHS S F HMP +N +SWNIIIGEF+RS LP  S+ LF++M R     P
Sbjct: 76  VLMYSHHNRLTPHSLSTFFHMPCKNIYSWNIIIGEFARSNLPEKSVDLFIDMRRDSHFQP 135

Query: 384 DPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKVF 205
           D F+LPLVLRACA  G G++G+SVHG C+K+G   +                V +AR VF
Sbjct: 136 DDFSLPLVLRACAGSGLGKLGSSVHGLCVKMGLAVSLFVGSALVFMYVTFGNVSNARVVF 195

Query: 204 DEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMV--GAGIELDGVVMVSLLLVCSQLGW 31
           DEM+ RD+VLWT++L GYAQNGEP L L+VFREMV     ++LD VVMVSLLLVC QLG 
Sbjct: 196 DEMAKRDSVLWTALLSGYAQNGEPKLGLQVFREMVDNSTRVKLDWVVMVSLLLVCGQLGS 255

Query: 30  PKPGKSVHG 4
            K GKSVHG
Sbjct: 256 LKHGKSVHG 264



 Score = 75.1 bits (183), Expect = 5e-11
 Identities = 51/197 (25%), Positives = 92/197 (46%), Gaps = 1/197 (0%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           GL   + + S L+ MY     +  ++  VF  M  R++  W  ++  ++++G P + LQ+
Sbjct: 167 GLAVSLFVGSALVFMYVTFGNV-SNARVVFDEMAKRDSVLWTALLSGYAQNGEPKLGLQV 225

Query: 420 FVEMLRSGASAP-DPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXX 244
           F EM+ +      D   +  +L  C  LG  + G SVHG+C++                 
Sbjct: 226 FREMVDNSTRVKLDWVVMVSLLLVCGQLGSLKHGKSVHGWCVRNCLRLELSLGNAIVHMY 285

Query: 243 XXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMV 64
                +  A + FD+M +RD   W+S++ GY  +G   +AL +F +M   GI+ + V  +
Sbjct: 286 IKCGMLAYAHRFFDKMPERDVFSWSSLILGYGLSGNVSVALCLFDQMHMRGIKPNDVTFL 345

Query: 63  SLLLVCSQLGWPKPGKS 13
            +L  C   G  +  +S
Sbjct: 346 GILSACGHGGLVEQARS 362


>ref|XP_007046316.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
           [Theobroma cacao] gi|508710251|gb|EOY02148.1|
           Pentatricopeptide repeat (PPR) superfamily protein,
           putative [Theobroma cacao]
          Length = 478

 Score =  222 bits (566), Expect = 2e-55
 Identities = 122/248 (49%), Positives = 149/248 (60%)
 Frame = -2

Query: 747 KRLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLRNGLYSDVVLSSK 568
           K L+ KL+     + P +  N           SPN           LR  LY DV+LSSK
Sbjct: 2   KTLFNKLYSTLTCSNPTNL-NIVSYYFSLVNSSPNCKHLRHLHARLLRTSLYDDVILSSK 60

Query: 567 LMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASA 388
           L+L YS+HNKL   S SVF HMP +N +SWNIIIGEFSRS  P  ++ LF+ M +S    
Sbjct: 61  LVLAYSQHNKLTSDSLSVFFHMPQKNIYSWNIIIGEFSRSNFPLKAIDLFLRMWQSSDVR 120

Query: 387 PDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKV 208
           PD F LPLVLR+C   G  E+  S HG C+K+G E +                V DAR +
Sbjct: 121 PDDFTLPLVLRSCVSCGLVELAVSFHGLCVKMGLESSLFVASALVFLYVSSGKVYDARVL 180

Query: 207 FDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCSQLGWP 28
           FD M  +DAVLWT+ML GYA++ EP+L LE+FREMV AG+  D VVM+SLLLVC QLGW 
Sbjct: 181 FDGMPKKDAVLWTAMLDGYAKHEEPMLGLELFREMVDAGVAPDWVVMLSLLLVCGQLGWL 240

Query: 27  KPGKSVHG 4
           K GKSVHG
Sbjct: 241 KQGKSVHG 248



 Score = 83.6 bits (205), Expect = 1e-13
 Identities = 55/191 (28%), Positives = 95/191 (49%), Gaps = 2/191 (1%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           GL S + ++S L+ +Y    K+      +F  MP ++   W  ++  +++   P + L+L
Sbjct: 153 GLESSLFVASALVFLYVSSGKVYDARV-LFDGMPKKDAVLWTAMLDGYAKHEEPMLGLEL 211

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLK--LGFEKNXXXXXXXXXX 247
           F EM+ +G  APD   +  +L  C  LG  + G SVHG+C++  LG E N          
Sbjct: 212 FREMVDAGV-APDWVVMLSLLLVCGQLGWLKQGKSVHGWCVRRCLGMELNLGNAIVDMYL 270

Query: 246 XXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVM 67
                    A +VF+ M+ RD + W+S++ GY  +G   +A  +F  MV  G++ + V  
Sbjct: 271 KCATLAY--AHRVFNMMNQRDVISWSSLILGYGLSGNVSIAFRLFDNMVAKGVKPNQVTF 328

Query: 66  VSLLLVCSQLG 34
           + +L  C+  G
Sbjct: 329 LGILSACAHGG 339


>ref|XP_012438652.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Gossypium raimondii] gi|763783695|gb|KJB50766.1|
           hypothetical protein B456_008G186600 [Gossypium
           raimondii]
          Length = 479

 Score =  220 bits (561), Expect = 8e-55
 Identities = 118/248 (47%), Positives = 151/248 (60%)
 Frame = -2

Query: 747 KRLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXSPNASXXXXXXXXXLRNGLYSDVVLSSK 568
           K L +KL++ + + +     N           SPN           LR  LY +VVLSS+
Sbjct: 2   KTLLKKLYYSTLACSGPTNSNLISCYFSLINSSPNCKHLRHLHARLLRTSLYDNVVLSSR 61

Query: 567 LMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASA 388
           L+L YS+H  L+P+S SVF HMP RN +SWNIIIGEFSRS  P  ++ LF+ M +S    
Sbjct: 62  LVLAYSRHKHLVPYSLSVFFHMPQRNIYSWNIIIGEFSRSNSPCQAIHLFLHMWQSSNVR 121

Query: 387 PDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKV 208
           PD F LPLVLRAC   G  ++  S HG C+KLGFE++                +  AR +
Sbjct: 122 PDDFTLPLVLRACVGCGLVKLAVSFHGLCVKLGFERSPFVASALVFLYVSFGKIFYARVL 181

Query: 207 FDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCSQLGWP 28
           FD M  RDAV+WT+ML GYA++ EP L LE+FREMV AG+  D VVM+SL+L+C QLGW 
Sbjct: 182 FDGMPKRDAVMWTAMLDGYAKHEEPTLGLELFREMVDAGVTPDWVVMLSLVLMCGQLGWL 241

Query: 27  KPGKSVHG 4
           K GKSVHG
Sbjct: 242 KHGKSVHG 249



 Score = 86.3 bits (212), Expect = 2e-14
 Identities = 56/184 (30%), Positives = 94/184 (51%), Gaps = 2/184 (1%)
 Frame = -2

Query: 579 LSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRS 400
           ++S L+ +Y    K+  ++  +F  MP R+   W  ++  +++   P + L+LF EM+ +
Sbjct: 161 VASALVFLYVSFGKIF-YARVLFDGMPKRDAVMWTAMLDGYAKHEEPTLGLELFREMVDA 219

Query: 399 GASAPDPFNLPLVLRACACLGDGEMGASVHGFCLK--LGFEKNXXXXXXXXXXXXXXXXV 226
           G +      L LVL  C  LG  + G SVHG+C++  LG E N                 
Sbjct: 220 GVTPDWVVMLSLVLM-CGQLGWLKHGKSVHGWCVRRWLGMELNLGNAIVDMYLKCAMLTY 278

Query: 225 CDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVC 46
             A +VFD M+ RD + W+S++ GY  +G    ALE+F +M+  GI+ + V  + +L  C
Sbjct: 279 --AHRVFDMMNQRDVISWSSLILGYGLSGNVTTALELFEDMIAKGIKPNEVTFLGILSAC 336

Query: 45  SQLG 34
           +  G
Sbjct: 337 AHGG 340


>gb|KHG06568.1| Pentatricopeptide repeat-containing -like protein [Gossypium
           arboreum]
          Length = 478

 Score =  214 bits (546), Expect = 4e-53
 Identities = 112/215 (52%), Positives = 139/215 (64%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PN           LR  LY +VVLSS+L+L YS++  L+P+S SVF HMP RN +SWNII
Sbjct: 34  PNCKHLRHLHARLLRTSLYDNVVLSSRLVLAYSRYKHLVPYSLSVFFHMPQRNIYSWNII 93

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRS  P  ++ LF+ M +S    PD F LPLVLRAC   G  ++  S HG CLKLG
Sbjct: 94  IGEFSRSNSPLQAIHLFLHMWQSSNVRPDDFTLPLVLRACVGCGLLQLAVSFHGLCLKLG 153

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           FE++                +  AR +FD M  RDAV+WT+ML GYA++ EP L LE+FR
Sbjct: 154 FERSPFVASALVFLYASFGKIFYARVLFDGMPKRDAVMWTAMLDGYAKHEEPTLGLELFR 213

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EMV AG+  D VVM+SL+L+C QLGW K GKSVHG
Sbjct: 214 EMVVAGVTPDWVVMLSLVLMCGQLGWLKHGKSVHG 248



 Score = 84.7 bits (208), Expect = 6e-14
 Identities = 55/184 (29%), Positives = 94/184 (51%), Gaps = 2/184 (1%)
 Frame = -2

Query: 579 LSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRS 400
           ++S L+ +Y+   K+  ++  +F  MP R+   W  ++  +++   P + L+LF EM+ +
Sbjct: 160 VASALVFLYASFGKIF-YARVLFDGMPKRDAVMWTAMLDGYAKHEEPTLGLELFREMVVA 218

Query: 399 GASAPDPFNLPLVLRACACLGDGEMGASVHGFCLK--LGFEKNXXXXXXXXXXXXXXXXV 226
           G +      L LVL  C  LG  + G SVHG+C++  LG E N                 
Sbjct: 219 GVTPDWVVMLSLVLM-CGQLGWLKHGKSVHGWCVRRWLGMELNLGNAIVDMYLKCAMLTY 277

Query: 225 CDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVC 46
             A +VFD M+  D + W+S++ GY  +G    ALE+F +M+  GI+ + V  + +L  C
Sbjct: 278 --AHRVFDMMNQTDVISWSSLILGYGLSGNVTTALELFEDMIAKGIKPNEVTFLGILSAC 335

Query: 45  SQLG 34
           +  G
Sbjct: 336 AHGG 339


>ref|XP_010908381.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Elaeis guineensis]
          Length = 488

 Score =  211 bits (536), Expect = 6e-52
 Identities = 119/253 (47%), Positives = 148/253 (58%), Gaps = 2/253 (0%)
 Frame = -2

Query: 756 HLPKRLYQKLFFLSYSTTPKHRPNXXXXXXXXXXXS--PNASXXXXXXXXXLRNGLYSDV 583
           HL KR +  +      TT  H+PN              P+           LR GLY+D 
Sbjct: 14  HLTKRCHNTI------TTTPHQPNSNLHSLYFHFLHSSPSLHHLCHLHARLLRTGLYTDA 67

Query: 582 VLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLR 403
           +LS+KL+L YS+   LLP + S+FLHMP R ++SWNI+I E +R GLPH S+  F+ M +
Sbjct: 68  ILSTKLLLTYSQRYCLLPTALSIFLHMPCRTSYSWNILITELARFGLPHKSMDFFLRM-Q 126

Query: 402 SGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVC 223
           S +   D F LP VLR+CA LG      SVH   +KLG + N                + 
Sbjct: 127 SSSIPVDEFTLPPVLRSCALLGSSPASMSVHALAVKLGLDHNLYVGSALVLCYNGLSEIS 186

Query: 222 DARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCS 43
            ARK+FDEM +RDA LWTSML  YAQ+GEP LAL  FREMV  GI+LDGVVMVSLLL C 
Sbjct: 187 LARKLFDEMPERDAALWTSMLSVYAQSGEPELALGFFREMVSEGIQLDGVVMVSLLLACG 246

Query: 42  QLGWPKPGKSVHG 4
           QLGW + G+SVHG
Sbjct: 247 QLGWLRHGRSVHG 259



 Score = 69.3 bits (168), Expect = 3e-09
 Identities = 50/189 (26%), Positives = 86/189 (45%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           GL  ++ + S L+L Y+  +++   +  +F  MP R+   W  ++  +++SG P ++L  
Sbjct: 164 GLDHNLYVGSALVLCYNGLSEI-SLARKLFDEMPERDAALWTSMLSVYAQSGEPELALGF 222

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+  G        + L+L AC  LG    G SVHG  ++                  
Sbjct: 223 FREMVSEGIQLDGVVMVSLLL-ACGQLGWLRHGRSVHGCSIRRFLGLPLSLGNALIDMYV 281

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
                  A  VF  M   D + W++++ G+  NG    AL++F EM   G+E + V  + 
Sbjct: 282 KCGAFGYAETVFSMMPATDVISWSALILGHGLNGHASAALKLFDEMSEEGMEPNSVTFLG 341

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 342 VLSACAHAG 350


>ref|XP_010450093.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Camelina sativa]
          Length = 479

 Score =  210 bits (535), Expect = 8e-52
 Identities = 117/216 (54%), Positives = 137/216 (63%), Gaps = 1/216 (0%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKL-LPHSFSVFLHMPHRNTFSWNI 472
           PNA          LR   YSDVVLSSKL+L YSK N+L LP S SVF HMP RN FSWNI
Sbjct: 46  PNAKHLRHLHAHLLRTSTYSDVVLSSKLVLAYSKLNRLFLPTSLSVFWHMPCRNIFSWNI 105

Query: 471 IIGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKL 292
           IIGEFSRSG   +S+ LF+ M R  +  PD F LPLVLRAC+   + ++G  VH  CLKL
Sbjct: 106 IIGEFSRSGFASISIGLFLRMWRESSVRPDDFTLPLVLRACSASREAKLGDLVHVLCLKL 165

Query: 291 GFEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVF 112
           GF  +                +  ARK+FD+M  RD+VL+T+M GGY Q GE +L L VF
Sbjct: 166 GFNASLFVSSALVIMYVDMGQILHARKLFDDMPVRDSVLYTAMFGGYVQQGEAILGLVVF 225

Query: 111 REMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           REM+ +G  LD VVMVSLL  C QLG  K GKSVHG
Sbjct: 226 REMMYSGFALDSVVMVSLLTACGQLGALKHGKSVHG 261



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 54/189 (28%), Positives = 95/189 (50%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  + + +SS L++MY    ++L H+  +F  MP R++  +  + G + + G   + L +
Sbjct: 166 GFNASLFVSSALVIMYVDMGQIL-HARKLFDDMPVRDSVLYTAMFGGYVQQGEAILGLVV 224

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+ SG  A D   +  +L AC  LG  + G SVHG+ ++                  
Sbjct: 225 FREMMYSGF-ALDSVVMVSLLTACGQLGALKHGKSVHGWSIRRCSCFGLNLGNAITDMYV 283

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A  VF +M +RD + W+S++ GY  +G  V+A+++F EM+   IE + V  + 
Sbjct: 284 KCSNLDYAYGVFVKMPNRDVISWSSLILGYGLDGNVVMAIKLFEEMLKERIEPNAVTFLG 343

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 344 VLSACAHGG 352


>ref|XP_006283664.1| hypothetical protein CARUB_v10004721mg [Capsella rubella]
           gi|482552369|gb|EOA16562.1| hypothetical protein
           CARUB_v10004721mg [Capsella rubella]
          Length = 477

 Score =  209 bits (533), Expect = 1e-51
 Identities = 113/214 (52%), Positives = 136/214 (63%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PNA          LR  LYS+VVLSSKL+L YSK N+L P S SVF HMP RN FSWNII
Sbjct: 45  PNAKHLRHLHAHLLRTSLYSNVVLSSKLVLAYSKLNRLFPTSLSVFWHMPCRNIFSWNII 104

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG   +S+ LF+ M R     PD F LPLVLRAC+   + ++G  +H  CLKLG
Sbjct: 105 IGEFSRSGFASVSIDLFLRMWRESYVRPDDFTLPLVLRACSASREAKLGDLIHVLCLKLG 164

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                + +ARK+FD+M  RD+VL+T+M GGY Q GE +L L +FR
Sbjct: 165 FNASLFVSSALVIMYVDMGEIINARKLFDDMPLRDSVLYTAMFGGYVQQGEALLGLAMFR 224

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVH 7
           EM  +G  LD VVMVSLL+ C QLG  K GKSVH
Sbjct: 225 EMTYSGFVLDSVVMVSLLMACGQLGALKHGKSVH 258



 Score = 67.0 bits (162), Expect = 1e-08
 Identities = 49/189 (25%), Positives = 92/189 (48%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  + + +SS L++MY    +++ ++  +F  MP R++  +  + G + + G   + L +
Sbjct: 164 GFNASLFVSSALVIMYVDMGEII-NARKLFDDMPLRDSVLYTAMFGGYVQQGEALLGLAM 222

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM  SG    D   +  +L AC  LG  + G SVH + ++                  
Sbjct: 223 FREMTYSGFVL-DSVVMVSLLMACGQLGALKHGKSVHAWSIRRCSCFGLNLGNAIIDMYV 281

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A++VF  M DRD + W+S++ GY  +G+  +A+++F EM+   I  + V  + 
Sbjct: 282 KCSKLDYAQEVFVNMPDRDVISWSSLILGYGLDGDVFMAIKLFDEMLQERIIPNAVTFLG 341

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 342 VLSACAHGG 350


>ref|NP_193153.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635637|sp|Q5XEY7.2|PP309_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g14170 gi|332657989|gb|AEE83389.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 477

 Score =  209 bits (532), Expect = 2e-51
 Identities = 113/215 (52%), Positives = 134/215 (62%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PNA          LR  LYS+VVLSSKL+L YSK N L P S SVF HMP+RN FSWNII
Sbjct: 45  PNAKHLRHLHAHLLRTFLYSNVVLSSKLVLAYSKLNHLFPTSLSVFWHMPYRNIFSWNII 104

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG    S+ LF+ M R     PD F LPL+LRAC+   + + G  +H  CLKLG
Sbjct: 105 IGEFSRSGFASKSIDLFLRMWRESCVRPDDFTLPLILRACSASREAKSGDLIHVLCLKLG 164

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                +  ARK+FD+M  RD+VL+T+M GGY Q GE +L L +FR
Sbjct: 165 FSSSLFVSSALVIMYVDMGKLLHARKLFDDMPVRDSVLYTAMFGGYVQQGEAMLGLAMFR 224

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EM  +G  LD VVMVSLL+ C QLG  K GKSVHG
Sbjct: 225 EMGYSGFALDSVVMVSLLMACGQLGALKHGKSVHG 259



 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 59/189 (31%), Positives = 95/189 (50%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  S + +SS L++MY    KLL H+  +F  MP R++  +  + G + + G   + L +
Sbjct: 164 GFSSSLFVSSALVIMYVDMGKLL-HARKLFDDMPVRDSVLYTAMFGGYVQQGEAMLGLAM 222

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM  SG  A D   +  +L AC  LG  + G SVHG+C++                  
Sbjct: 223 FREMGYSGF-ALDSVVMVSLLMACGQLGALKHGKSVHGWCIRRCSCLGLNLGNAITDMYV 281

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A  VF  MS RD + W+S++ GY  +G+ V++ ++F EM+  GIE + V  + 
Sbjct: 282 KCSILDYAHTVFVNMSRRDVISWSSLILGYGLDGDVVMSFKLFDEMLKEGIEPNAVTFLG 341

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 342 VLSACAHGG 350


>gb|AAU94392.1| At4g14170 [Arabidopsis thaliana] gi|55733761|gb|AAV59277.1|
           At4g14170 [Arabidopsis thaliana]
          Length = 458

 Score =  209 bits (532), Expect = 2e-51
 Identities = 113/215 (52%), Positives = 134/215 (62%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PNA          LR  LYS+VVLSSKL+L YSK N L P S SVF HMP+RN FSWNII
Sbjct: 26  PNAKHLRHLHAHLLRTFLYSNVVLSSKLVLAYSKLNHLFPTSLSVFWHMPYRNIFSWNII 85

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG    S+ LF+ M R     PD F LPL+LRAC+   + + G  +H  CLKLG
Sbjct: 86  IGEFSRSGFASKSIDLFLRMWRESCVRPDDFTLPLILRACSASREAKSGDLIHVLCLKLG 145

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                +  ARK+FD+M  RD+VL+T+M GGY Q GE +L L +FR
Sbjct: 146 FSSSLFVSSALVIMYVDMGKLLHARKLFDDMPVRDSVLYTAMFGGYVQQGEAMLGLAMFR 205

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EM  +G  LD VVMVSLL+ C QLG  K GKSVHG
Sbjct: 206 EMGYSGFALDSVVMVSLLMACGQLGALKHGKSVHG 240



 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 59/189 (31%), Positives = 95/189 (50%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  S + +SS L++MY    KLL H+  +F  MP R++  +  + G + + G   + L +
Sbjct: 145 GFSSSLFVSSALVIMYVDMGKLL-HARKLFDDMPVRDSVLYTAMFGGYVQQGEAMLGLAM 203

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM  SG  A D   +  +L AC  LG  + G SVHG+C++                  
Sbjct: 204 FREMGYSGF-ALDSVVMVSLLMACGQLGALKHGKSVHGWCIRRCSCLGLNLGNAITDMYV 262

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A  VF  MS RD + W+S++ GY  +G+ V++ ++F EM+  GIE + V  + 
Sbjct: 263 KCSILDYAHTVFVNMSRRDVISWSSLILGYGLDGDVVMSFKLFDEMLKEGIEPNAVTFLG 322

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 323 VLSACAHGG 331


>ref|XP_008788281.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Phoenix dactylifera] gi|672129542|ref|XP_008788282.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At4g14170 [Phoenix dactylifera]
          Length = 461

 Score =  208 bits (530), Expect = 3e-51
 Identities = 108/201 (53%), Positives = 135/201 (67%)
 Frame = -2

Query: 606 RNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSL 427
           R GLY++ +LS+KL+L YS+  +LLP + SVFLHMP R ++SWNI+I E +RSGLPH S+
Sbjct: 33  RTGLYANAILSTKLLLAYSRRCRLLPSALSVFLHMPRRTSYSWNILITELARSGLPHKSM 92

Query: 426 QLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXX 247
             F+ M +S +   D F L  VLR+CA L       SVHG  +KLG E+N          
Sbjct: 93  DFFLRM-QSSSIPIDEFTLSPVLRSCALLDSSPASMSVHGLSVKLGLERNPYVASALVLC 151

Query: 246 XXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVM 67
                 +  AR++FDEM +RDAVLWTSML  YAQ+GEP  AL  FREMV  GI+LDGVVM
Sbjct: 152 YSGLSEISLARRLFDEMPERDAVLWTSMLSVYAQSGEPESALGFFREMVSEGIQLDGVVM 211

Query: 66  VSLLLVCSQLGWPKPGKSVHG 4
           VSLLL C QLGW + G+SVHG
Sbjct: 212 VSLLLACGQLGWLRHGRSVHG 232



 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 54/189 (28%), Positives = 88/189 (46%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           GL  +  ++S L+L YS  +++   +  +F  MP R+   W  ++  +++SG P  +L  
Sbjct: 137 GLERNPYVASALVLCYSGLSEI-SLARRLFDEMPERDAVLWTSMLSVYAQSGEPESALGF 195

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+  G        + L+L AC  LG    G SVHG C++                  
Sbjct: 196 FREMVSEGIQLDGVVMVSLLL-ACGQLGWLRHGRSVHGCCVRRCLGLPLSLGNALTDLYV 254

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
                  A  VF  M DRD + W++++ G+  NG    AL++F EM   GI+ + V  + 
Sbjct: 255 KCGAFGYAEMVFRMMPDRDVISWSALILGHGLNGHASAALKLFEEMSAEGIQPNSVTFLG 314

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 315 VLSACAHAG 323


>ref|XP_002868306.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297314142|gb|EFH44565.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 458

 Score =  205 bits (522), Expect = 3e-50
 Identities = 110/201 (54%), Positives = 130/201 (64%)
 Frame = -2

Query: 606 RNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSL 427
           R  LYS+VVLSSKL+L YSK N L P S SVF HMP RN FSWNIIIGEFSRSG   +S+
Sbjct: 40  RTSLYSNVVLSSKLVLAYSKMNHLFPTSLSVFWHMPCRNIFSWNIIIGEFSRSGFASISI 99

Query: 426 QLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXX 247
            +F+ M R     PD F LPLVLRAC+   + + G  +H  CLKLGF  +          
Sbjct: 100 GMFLRMWRESNVRPDDFTLPLVLRACSASREAKFGDLIHVLCLKLGFNASLFVRSALVIM 159

Query: 246 XXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVM 67
                 +  ARK+FD+M  RD+VL+T+M GGY Q GE +L L VFREM  +G  LD VVM
Sbjct: 160 YVDLGEILHARKLFDDMPVRDSVLYTAMFGGYVQQGEALLGLAVFREMRYSGFLLDSVVM 219

Query: 66  VSLLLVCSQLGWPKPGKSVHG 4
           VSLL+ C QLG  K GKSVHG
Sbjct: 220 VSLLMACGQLGALKHGKSVHG 240



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 53/189 (28%), Positives = 93/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  + + + S L++MY    ++L H+  +F  MP R++  +  + G + + G   + L +
Sbjct: 145 GFNASLFVRSALVIMYVDLGEIL-HARKLFDDMPVRDSVLYTAMFGGYVQQGEALLGLAV 203

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM  SG    D   +  +L AC  LG  + G SVHG+C++                  
Sbjct: 204 FREMRYSGFLL-DSVVMVSLLMACGQLGALKHGKSVHGWCIRRCSCFGLNLGNAITDMYV 262

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A  VF  M  RD + W+S++ GY  +G+ V+++++F EM+  GIE + V  + 
Sbjct: 263 KCSILDYAHTVFVNMPRRDVISWSSLILGYGLDGDVVVSIKLFDEMLQEGIEPNAVTFLG 322

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 323 VLSACAHGG 331


>ref|XP_010440520.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g14170-like [Camelina sativa]
          Length = 479

 Score =  205 bits (521), Expect = 3e-50
 Identities = 114/216 (52%), Positives = 136/216 (62%), Gaps = 1/216 (0%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKL-LPHSFSVFLHMPHRNTFSWNI 472
           PNA          LR   YSDVVL SKL+L YSK N++ LP S SVF HMP RN FSWNI
Sbjct: 46  PNAKHLRHLHAHLLRTSTYSDVVLCSKLVLAYSKLNRIFLPTSLSVFWHMPCRNIFSWNI 105

Query: 471 IIGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKL 292
           IIGEFSRSG   +S+ LF+ M R  +  PD F LPLVLRAC+   + ++G  +H  CLKL
Sbjct: 106 IIGEFSRSGFASVSIGLFLRMWRESSVRPDDFTLPLVLRACSASREEKLGDLIHVLCLKL 165

Query: 291 GFEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVF 112
           GF  +                +  ARK+FD+M  RD+VL+T+M GGY Q GE +L L VF
Sbjct: 166 GFNASLFVSSALVIMYVDMGKILYARKLFDDMPVRDSVLYTAMFGGYVQLGEAILGLAVF 225

Query: 111 REMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           REM+ +G  LD VVMVSLL  C QLG  K GKSVHG
Sbjct: 226 REMMCSGFALDSVVMVSLLTACGQLGALKHGKSVHG 261



 Score = 74.3 bits (181), Expect = 9e-11
 Identities = 54/189 (28%), Positives = 94/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  + + +SS L++MY    K+L ++  +F  MP R++  +  + G + + G   + L +
Sbjct: 166 GFNASLFVSSALVIMYVDMGKIL-YARKLFDDMPVRDSVLYTAMFGGYVQLGEAILGLAV 224

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+ SG  A D   +  +L AC  LG  + G SVHG+ ++                  
Sbjct: 225 FREMMCSGF-ALDSVVMVSLLTACGQLGALKHGKSVHGWSIRRCSCFGLNLGNAITDMYV 283

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A  VF +M +RD + W+S++ GY  +G  V+A+ +F EM+   IE + V  + 
Sbjct: 284 KCSNLDYAHGVFVKMPNRDVISWSSLILGYGLDGNVVMAINLFEEMLKEMIEPNAVTFLG 343

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 344 VLSACAHGG 352


>emb|CDY37924.1| BnaA06g18930D [Brassica napus]
          Length = 477

 Score =  205 bits (521), Expect = 3e-50
 Identities = 112/215 (52%), Positives = 132/215 (61%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PN +         LR   YS+VVLSSKL+L YSK N L P S +VF HMPHRN FSWNII
Sbjct: 45  PNVNHLRHLHAHLLRTSSYSNVVLSSKLVLAYSKLNHLFPTSLAVFWHMPHRNIFSWNII 104

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG P +S+ LF+ M    +  PD F LPLVLRAC+   +  +G  VH    KLG
Sbjct: 105 IGEFSRSGFPSISIDLFLRMWGESSVRPDDFTLPLVLRACSASREARLGGLVHVLSFKLG 164

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                +  ARK+FDEM  RD+VL+T+M GGY Q GE VL L +FR
Sbjct: 165 FGVSLFVSSALVIMYVDIGEILYARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLALFR 224

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EM+  G  LD VVMVSL+  C QLG  K GKSVHG
Sbjct: 225 EMMCYGFSLDSVVMVSLVTACGQLGTLKHGKSVHG 259



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 54/189 (28%), Positives = 94/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G    + +SS L++MY    ++L ++  +F  M  R++  +  + G + + G   + L L
Sbjct: 164 GFGVSLFVSSALVIMYVDIGEIL-YARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLAL 222

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+  G S      + LV  AC  LG  + G SVHG+C++                  
Sbjct: 223 FREMMCYGFSLDSVVMVSLVT-ACGQLGTLKHGKSVHGWCIRRCPCLGLNLGNAVVDMYV 281

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A +VF EM  RD + W+S++ GY  +G+ V+++++F EM+  GIE + V  + 
Sbjct: 282 KCSKLVYAHRVFVEMPTRDVISWSSLVLGYGLDGDVVMSVKLFDEMLEEGIEPNAVTFLG 341

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 342 VLSACAHGG 350


>emb|CDY56890.1| BnaC03g75290D [Brassica napus]
          Length = 474

 Score =  203 bits (516), Expect = 1e-49
 Identities = 112/215 (52%), Positives = 131/215 (60%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PN +         LR   YS+VVLSSKL+L YSK   L P S +VF HMPHRN FSWNII
Sbjct: 42  PNVNHLRHLHAHLLRTSNYSNVVLSSKLVLAYSKLKHLFPTSLAVFWHMPHRNIFSWNII 101

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG P  S+ LF+ M    +  PD F LPLVLRAC+   +  +G  VH   LKLG
Sbjct: 102 IGEFSRSGFPSKSIDLFLRMWGDSSVRPDDFTLPLVLRACSASREARLGGLVHVLSLKLG 161

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                +  ARK+FDEM  RD+VL+T+M GGY Q GE VL L +FR
Sbjct: 162 FGVSLFVSSALVIMYVDIGEILYARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLALFR 221

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EM+  G  LD VVMVSL+  C QLG  K GKSVHG
Sbjct: 222 EMMCYGFSLDSVVMVSLVTACGQLGTLKHGKSVHG 256



 Score = 76.3 bits (186), Expect = 2e-11
 Identities = 54/189 (28%), Positives = 94/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G    + +SS L++MY    ++L ++  +F  M  R++  +  + G + + G   + L L
Sbjct: 161 GFGVSLFVSSALVIMYVDIGEIL-YARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLAL 219

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+  G S      + LV  AC  LG  + G SVHG+C++                  
Sbjct: 220 FREMMCYGFSLDSVVMVSLVT-ACGQLGTLKHGKSVHGWCIRRCPCLGLNLGNAVVDMYV 278

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A +VF EM  RD + W+S++ GY  +G+ V+++++F EM+  GIE + V  + 
Sbjct: 279 KCSKLVYAHRVFVEMPSRDVISWSSLVLGYGLDGDVVMSVKLFDEMLEEGIEPNAVTFLG 338

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 339 VLSACAHGG 347


>ref|XP_006414782.1| hypothetical protein EUTSA_v10027059mg [Eutrema salsugineum]
           gi|557115952|gb|ESQ56235.1| hypothetical protein
           EUTSA_v10027059mg [Eutrema salsugineum]
          Length = 472

 Score =  203 bits (516), Expect = 1e-49
 Identities = 112/216 (51%), Positives = 135/216 (62%), Gaps = 1/216 (0%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PNA          LR   YS+VVLSSKL+L YSK + L P S +VF HMPHRN FSWNII
Sbjct: 39  PNAKHLRHLHAHLLRTSTYSNVVLSSKLVLAYSKLDHLFPTSLAVFWHMPHRNIFSWNII 98

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRAC-ACLGDGEMGASVHGFCLKL 292
           IGEFSRSG    S+ LF+ M R  +  PD F LPL+LRAC A   + ++G  +H   LKL
Sbjct: 99  IGEFSRSGFAAESIGLFLRMWRESSVRPDDFTLPLLLRACSASPEEAKLGDLIHSLSLKL 158

Query: 291 GFEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVF 112
           GF  +                + DARK+FD+M  RD+VL+T+M GGY Q GE +L L +F
Sbjct: 159 GFGASLFVSSALVIMYVDIKKISDARKLFDDMHVRDSVLYTAMFGGYVQQGEAILGLALF 218

Query: 111 REMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           REM+ +G  LD VVMVSLL  C QLG  K GKSVHG
Sbjct: 219 REMMCSGFSLDSVVMVSLLTACGQLGALKHGKSVHG 254



 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 54/189 (28%), Positives = 94/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G  + + +SS L++MY    K+   +  +F  M  R++  +  + G + + G   + L L
Sbjct: 159 GFGASLFVSSALVIMYVDIKKI-SDARKLFDDMHVRDSVLYTAMFGGYVQQGEAILGLAL 217

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+ SG S  D   +  +L AC  LG  + G SVHG+C++                  
Sbjct: 218 FREMMCSGFSL-DSVVMVSLLTACGQLGALKHGKSVHGWCIRRCSCLGLNLGNAITDMYV 276

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  ARKVF  M  +D + W+S++ GY  +G+ V+++ +F EM+  GI+ + V  + 
Sbjct: 277 KCSNMACARKVFVNMQRKDVISWSSLILGYGLDGDGVVSINLFEEMLKEGIKPNAVTFLG 336

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 337 VLSACAHGG 345


>ref|XP_009150190.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Brassica rapa]
          Length = 475

 Score =  201 bits (512), Expect = 4e-49
 Identities = 113/215 (52%), Positives = 132/215 (61%)
 Frame = -2

Query: 648 PNASXXXXXXXXXLRNGLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNII 469
           PN +         LR   YS+VVLSSKL+L YSK N L P S +VF HMPHRN FSWNII
Sbjct: 45  PNVNHLRHLHAHLLRTSNYSNVVLSSKLVLAYSKLNHLFPTSLAVFWHMPHRNIFSWNII 104

Query: 468 IGEFSRSGLPHMSLQLFVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLG 289
           IGEFSRSG P  S+ LF+ M    +  PD F LPLVLRAC+   +  +G  VH   LKLG
Sbjct: 105 IGEFSRSGFPSQSIDLFLRMWGESSVRPDDFTLPLVLRACSASREARLG--VHVLSLKLG 162

Query: 288 FEKNXXXXXXXXXXXXXXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFR 109
           F  +                +  ARK+FDEM  RD+VL+T+M GGY Q GE VL L +FR
Sbjct: 163 FGVSLFVSSALVIMYVDIGEILYARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLALFR 222

Query: 108 EMVGAGIELDGVVMVSLLLVCSQLGWPKPGKSVHG 4
           EM+  G  LD VVMVSL+  C QLG  K GKSVHG
Sbjct: 223 EMMCYGFSLDSVVMVSLVTACGQLGTLKHGKSVHG 257



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 54/189 (28%), Positives = 94/189 (49%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G    + +SS L++MY    ++L ++  +F  M  R++  +  + G + + G   + L L
Sbjct: 162 GFGVSLFVSSALVIMYVDIGEIL-YARKLFDEMLVRDSVLYTAMFGGYVQEGEAVLGLAL 220

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+  G S      + LV  AC  LG  + G SVHG+C++                  
Sbjct: 221 FREMMCYGFSLDSVVMVSLVT-ACGQLGTLKHGKSVHGWCIRRCPCLGLNLGNAVVDMYV 279

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
               +  A +VF EM  RD + W+S++ GY  +G+ V+++++F EM+  GIE + V  + 
Sbjct: 280 KCSKLVYAHRVFVEMPTRDVISWSSLVLGYGLDGDVVMSVKLFDEMLEEGIEPNAVTFLG 339

Query: 60  LLLVCSQLG 34
           +L  C+  G
Sbjct: 340 VLSACAHGG 348


>ref|XP_011620751.1| PREDICTED: pentatricopeptide repeat-containing protein At4g14170
           [Amborella trichopoda]
          Length = 420

 Score =  185 bits (470), Expect = 3e-44
 Identities = 98/184 (53%), Positives = 120/184 (65%)
 Frame = -2

Query: 558 MYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQLFVEMLRSGASAPDP 379
           MYSKH  L P+++++FLHMP RN +SWNIIIGE SRSG P  +++ F  M R     PD 
Sbjct: 1   MYSKHKSLFPNAYNLFLHMPERNIYSWNIIIGELSRSGYPLKAIETFKNM-RKSRMEPDL 59

Query: 378 FNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXXXXXXVCDARKVFDE 199
           + LPLVLRACA LG  + G  +H +C K+G  +N                +  A +VFDE
Sbjct: 60  YTLPLVLRACATLGQFKPGQKLHCYCEKVGSSRNVFVASALVFFYAKFGEIDAACQVFDE 119

Query: 198 MSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVSLLLVCSQLGWPKPG 19
           M+ RD VLWT+ML GYAQ G+   AL VFREM+   I LD VVMVSLLLVCSQLGW + G
Sbjct: 120 MTQRDHVLWTTMLSGYAQIGDTSKALRVFREMMREKIMLDHVVMVSLLLVCSQLGWLRHG 179

Query: 18  KSVH 7
           KSVH
Sbjct: 180 KSVH 183



 Score = 71.6 bits (174), Expect = 6e-10
 Identities = 50/189 (26%), Positives = 87/189 (46%)
 Frame = -2

Query: 600 GLYSDVVLSSKLMLMYSKHNKLLPHSFSVFLHMPHRNTFSWNIIIGEFSRSGLPHMSLQL 421
           G   +V ++S L+  Y+K  ++   +  VF  M  R+   W  ++  +++ G    +L++
Sbjct: 89  GSSRNVFVASALVFFYAKFGEI-DAACQVFDEMTQRDHVLWTTMLSGYAQIGDTSKALRV 147

Query: 420 FVEMLRSGASAPDPFNLPLVLRACACLGDGEMGASVHGFCLKLGFEKNXXXXXXXXXXXX 241
           F EM+R          + L+L  C+ LG    G SVH   +K     N            
Sbjct: 148 FREMMREKIMLDHVVMVSLLL-VCSQLGWLRHGKSVHCVMVKGFLGMNLSLANALIDVYS 206

Query: 240 XXXXVCDARKVFDEMSDRDAVLWTSMLGGYAQNGEPVLALEVFREMVGAGIELDGVVMVS 61
                  AR+VFD+M +RD + W+SM+ GY  +G    AL++F+ M+   I+ + +  + 
Sbjct: 207 KCGAPKIARRVFDKMHERDVISWSSMIAGYGLHGHAKKALDLFKTMLDHDIKPNSITFLG 266

Query: 60  LLLVCSQLG 34
            L  C   G
Sbjct: 267 ALSACVHAG 275


Top