BLASTX nr result

ID: Cheilocostus21_contig00032096 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00032096
         (1554 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009412383.1| PREDICTED: pentatricopeptide repeat-containi...   673   0.0  
gb|ERN08806.1| hypothetical protein AMTR_s00017p00256920 [Ambore...   382   e-124
ref|XP_020524583.1| pentatricopeptide repeat-containing protein ...   382   e-124
ref|XP_023895407.1| pentatricopeptide repeat-containing protein ...   315   3e-97
ref|XP_007212650.2| pentatricopeptide repeat-containing protein ...   315   6e-97
ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containi...   315   6e-97
gb|PIA41688.1| hypothetical protein AQUCO_02200249v1 [Aquilegia ...   311   7e-97
ref|XP_010920022.1| PREDICTED: pentatricopeptide repeat-containi...   314   1e-96
ref|XP_008225544.1| PREDICTED: pentatricopeptide repeat-containi...   314   2e-96
ref|XP_010923707.1| PREDICTED: pentatricopeptide repeat-containi...   313   3e-96
ref|XP_006651966.1| PREDICTED: pentatricopeptide repeat-containi...   312   6e-96
ref|XP_020521010.1| LOW QUALITY PROTEIN: pentatricopeptide repea...   311   8e-96
ref|XP_021593410.1| pentatricopeptide repeat-containing protein ...   312   1e-95
ref|XP_021593409.1| pentatricopeptide repeat-containing protein ...   312   1e-95
gb|OVA05584.1| Pentatricopeptide repeat [Macleaya cordata]            309   1e-95
ref|XP_007147940.1| hypothetical protein PHAVU_006G167300g [Phas...   311   2e-95
ref|XP_021593408.1| pentatricopeptide repeat-containing protein ...   312   2e-95
ref|XP_014517439.1| pentatricopeptide repeat-containing protein ...   311   3e-95
ref|XP_008794509.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   311   3e-95
ref|XP_020522997.1| pentatricopeptide repeat-containing protein ...   311   4e-95

>ref|XP_009412383.1| PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like
            [Musa acuminata subsp. malaccensis]
          Length = 470

 Score =  673 bits (1736), Expect = 0.0
 Identities = 347/472 (73%), Positives = 384/472 (81%), Gaps = 7/472 (1%)
 Frame = -3

Query: 1498 MPIAGTCSPLSLPLPRFSAHRTHRDAA-----VIHLRRLHAHALRSDVHEPPFWNSLARS 1334
            M  A   SP  LP P   AHR H D +       HLR  HA A+R+ VH+P FWN+LARS
Sbjct: 1    MQTAEAWSPTPLPGP--PAHRKHLDPSGSFPTFAHLRMRHARAIRTHVHQPSFWNALARS 58

Query: 1333 YASHGFPD-LALAVCLQMPLRDAFTFPFAFKLCSLLSAIPEAASLQAHLLKLGPAAATIY 1157
            YAS+G    LAL VCL MPLRDA TFP AFKLCSLLSA  EA SL AHL+KLG AA +++
Sbjct: 59   YASYGAAHHLALGVCLHMPLRDAHTFPLAFKLCSLLSAFAEAVSLHAHLVKLGLAATSVH 118

Query: 1156 SLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSG 977
            SLNALV+LYSN GHLDLA QLFDRIP RT SSWSAMIAGYDRN +P EALFTFLGM+ +G
Sbjct: 119  SLNALVTLYSNFGHLDLARQLFDRIPRRTVSSWSAMIAGYDRNAQPREALFTFLGMSGAG 178

Query: 976  VCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFATALVDFYAKCGEVDSA 797
            V PDEAA VSTLAACTHGGCLEFG+++HA A VYGLGL+SVGFATALVD YAKCGEVDSA
Sbjct: 179  VSPDEAALVSTLAACTHGGCLEFGKAIHACATVYGLGLESVGFATALVDLYAKCGEVDSA 238

Query: 796  RSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHA 617
              VF RMA+RNVL+WSAMIGGLAMHG   EAIK+FDEMV AGVRPT VTMTNVLSACSH 
Sbjct: 239  MEVFERMAQRNVLSWSAMIGGLAMHGRAPEAIKLFDEMVEAGVRPTSVTMTNVLSACSHV 298

Query: 616  GLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRS 437
            GLV++GLR+FKL +EEYG+EPRVEHCGCVVDLL RAG+F EAREFI+TMP   TAAIWRS
Sbjct: 299  GLVDQGLRLFKLMKEEYGMEPRVEHCGCVVDLLGRAGLFHEAREFISTMPTPATAAIWRS 358

Query: 436  LLGAACTQGTLDVGRMAGERLAA-AEEMAAGDYVMLANLYARFALWEEVRKVRVEMNDMG 260
            LLGAACT G L+ GR+AGERLAA  E+M AGDYVMLANLYARF LWEEV +VR EMND+G
Sbjct: 359  LLGAACTHGDLEAGRLAGERLAATGEQMVAGDYVMLANLYARFGLWEEVGRVRTEMNDVG 418

Query: 259  VRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEMLELLNSELMDQ*FRSRH 104
            VRKVAGFSSVE+DGELHRF M D+ H +  RIHE+L LLNSELMD    S H
Sbjct: 419  VRKVAGFSSVEVDGELHRFVMADRSHREAKRIHEVLRLLNSELMDHESSSFH 470


>gb|ERN08806.1| hypothetical protein AMTR_s00017p00256920 [Amborella trichopoda]
          Length = 488

 Score =  382 bits (982), Expect = e-124
 Identities = 208/437 (47%), Positives = 272/437 (62%), Gaps = 7/437 (1%)
 Frame = -3

Query: 1429 RDAAVIHLRRLHAHALRSDVHEPP-FWNSLARSYASHGFPDLALAVCLQM-PLRDAFTFP 1256
            R     H+ ++HA  LR +   P   +N+L R+YA H +P  AL +   + P  D FT P
Sbjct: 44   RSTTTAHITQIHARLLRINPSNPVRLYNTLIRAYALHSYPRSALLLYAHLLPHADPFTLP 103

Query: 1255 FAFKLCSLLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNL-GHLDLALQLFDRIP 1079
            FA K CS  S++   + L AH +KLG  A   + LN L+  Y+     +DLA  +FD + 
Sbjct: 104  FALKACS--SSLLLTSCLHAHAIKLGHPAINTFFLNTLIHNYATTCARVDLAHHVFDCMI 161

Query: 1078 NRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRS 899
             RTA+SW A+IAGY+R G P  +L  F  M   G  PDEA  VS L AC   GC   G  
Sbjct: 162  GRTAASWGALIAGYERVGEPHHSLHLFQAMRLQGEPPDEATLVSALCACAQLGCPRSGPL 221

Query: 898  MHAHAIVYGLG-LDSVGFATALVDFYAKCGEVDSARSVFGRM--AERNVLTWSAMIGGLA 728
            +HA  I++G    D V   TAL+D YAKCG +  A  +F RM    RNVL WSAMIGG+A
Sbjct: 222  LHACTIIHGFDPADHVNLGTALIDMYAKCGCISYACKLFDRMPLGRRNVLLWSAMIGGMA 281

Query: 727  MHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRV 548
            MHG   EA+ +F+ M   GV P  +T TNVL+ACSH GLV EGLR F+   EEY + PR+
Sbjct: 282  MHGQAHEALILFEHMRSCGVVPNAITFTNVLNACSHRGLVGEGLRCFRRMMEEYRMLPRI 341

Query: 547  EHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAA 368
            EH GCVVD+L RAG+ EEA  F+  MP+ PT  +WRSLLGAACT G +++G  A + L+ 
Sbjct: 342  EHYGCVVDMLGRAGLLEEALAFMRAMPIKPTVPLWRSLLGAACTHGDVELGEAAMDGLSR 401

Query: 367  AE-EMAAGDYVMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGD 191
             E  ++A D V+++NLYA+   WE+V K+R+ MND G++K  GFS +E+DG LHRF MGD
Sbjct: 402  LEAALSASDCVVMSNLYAQVGDWEKVGKLRMAMNDSGLKKTPGFSVIEVDGTLHRFVMGD 461

Query: 190  KLHPQIIRIHEMLELLN 140
            K HPQ   I++ML  LN
Sbjct: 462  KFHPQTQHIYDMLHQLN 478


>ref|XP_020524583.1| pentatricopeptide repeat-containing protein At4g21065 [Amborella
            trichopoda]
 ref|XP_020524584.1| pentatricopeptide repeat-containing protein At4g21065 [Amborella
            trichopoda]
 ref|XP_006847225.2| pentatricopeptide repeat-containing protein At4g21065 [Amborella
            trichopoda]
 ref|XP_020524585.1| pentatricopeptide repeat-containing protein At4g21065 [Amborella
            trichopoda]
          Length = 491

 Score =  382 bits (982), Expect = e-124
 Identities = 208/437 (47%), Positives = 272/437 (62%), Gaps = 7/437 (1%)
 Frame = -3

Query: 1429 RDAAVIHLRRLHAHALRSDVHEPP-FWNSLARSYASHGFPDLALAVCLQM-PLRDAFTFP 1256
            R     H+ ++HA  LR +   P   +N+L R+YA H +P  AL +   + P  D FT P
Sbjct: 47   RSTTTAHITQIHARLLRINPSNPVRLYNTLIRAYALHSYPRSALLLYAHLLPHADPFTLP 106

Query: 1255 FAFKLCSLLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNL-GHLDLALQLFDRIP 1079
            FA K CS  S++   + L AH +KLG  A   + LN L+  Y+     +DLA  +FD + 
Sbjct: 107  FALKACS--SSLLLTSCLHAHAIKLGHPAINTFFLNTLIHNYATTCARVDLAHHVFDCMI 164

Query: 1078 NRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRS 899
             RTA+SW A+IAGY+R G P  +L  F  M   G  PDEA  VS L AC   GC   G  
Sbjct: 165  GRTAASWGALIAGYERVGEPHHSLHLFQAMRLQGEPPDEATLVSALCACAQLGCPRSGPL 224

Query: 898  MHAHAIVYGLG-LDSVGFATALVDFYAKCGEVDSARSVFGRM--AERNVLTWSAMIGGLA 728
            +HA  I++G    D V   TAL+D YAKCG +  A  +F RM    RNVL WSAMIGG+A
Sbjct: 225  LHACTIIHGFDPADHVNLGTALIDMYAKCGCISYACKLFDRMPLGRRNVLLWSAMIGGMA 284

Query: 727  MHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRV 548
            MHG   EA+ +F+ M   GV P  +T TNVL+ACSH GLV EGLR F+   EEY + PR+
Sbjct: 285  MHGQAHEALILFEHMRSCGVVPNAITFTNVLNACSHRGLVGEGLRCFRRMMEEYRMLPRI 344

Query: 547  EHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAA 368
            EH GCVVD+L RAG+ EEA  F+  MP+ PT  +WRSLLGAACT G +++G  A + L+ 
Sbjct: 345  EHYGCVVDMLGRAGLLEEALAFMRAMPIKPTVPLWRSLLGAACTHGDVELGEAAMDGLSR 404

Query: 367  AE-EMAAGDYVMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGD 191
             E  ++A D V+++NLYA+   WE+V K+R+ MND G++K  GFS +E+DG LHRF MGD
Sbjct: 405  LEAALSASDCVVMSNLYAQVGDWEKVGKLRMAMNDSGLKKTPGFSVIEVDGTLHRFVMGD 464

Query: 190  KLHPQIIRIHEMLELLN 140
            K HPQ   I++ML  LN
Sbjct: 465  KFHPQTQHIYDMLHQLN 481


>ref|XP_023895407.1| pentatricopeptide repeat-containing protein At4g21065-like [Quercus
            suber]
          Length = 578

 Score =  315 bits (806), Expect = 3e-97
 Identities = 175/442 (39%), Positives = 256/442 (57%), Gaps = 8/442 (1%)
 Frame = -3

Query: 1423 AAVIHLRRL-HAHALRSDVHEPPF--WNSLARSYASHGFPDL-ALAVCLQMPLR----DA 1268
            A  + +R++ HAH L S  H+P    WN++ R Y+ +    L A+A+   M L     ++
Sbjct: 46   AITLPIRQIAHAHKLFSWTHQPNLFMWNTIIRGYSINDSSSLKAIALYKDMHLSGISSNS 105

Query: 1267 FTFPFAFKLCSLLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFD 1088
            FTF F  K C  L  + E   + + +LK+G    T + +N L+ LY+  G +D A  LFD
Sbjct: 106  FTFGFVLKACCNLPRLEEGKMVHSQVLKMGLDYET-HVVNGLIKLYTTCGRVDEARDLFD 164

Query: 1087 RIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEF 908
             +  R   SWS M++GY +NG   EA   F  M    V  DE    S   AC   G L+ 
Sbjct: 165  EMSERDLVSWSTMVSGYVQNGCSNEAFVLFKQMQAQNVIADEFTLASVAGACGDMGALDL 224

Query: 907  GRSMHAHAIVYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLA 728
            G+ +H++    G+ +D V   T+LVD Y+KCG +D+A  VF  M++R+V+ WS MIGG A
Sbjct: 225  GKWVHSYIDKEGIDIDIV-LGTSLVDMYSKCGSLDNAIRVFEGMSKRDVMAWSTMIGGCA 283

Query: 727  MHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRV 548
            +HG G +A+KVF  M  A VRP  VT T VL ACSH+GLV+EG + F     +YGI P +
Sbjct: 284  IHGFGEKALKVFHAMKSANVRPNSVTFTCVLCACSHSGLVKEGCQHFNSMSLDYGITPEI 343

Query: 547  EHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAA 368
            EH GC+VDL  RAG+   A +FI  MP+ P A +WR+LLGA  T G  ++       +  
Sbjct: 344  EHYGCMVDLFCRAGLVLRAHKFIQKMPIKPNAVLWRTLLGACKTHGYKELSESITREVLE 403

Query: 367  AEEMAAGDYVMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDK 188
             E  +A +YV+++N+YA    W  V KVR +M     +K  G+SS+E+   +H+F MGD+
Sbjct: 404  LEPRSAENYVLVSNVYASQGRWSSVSKVRSQMKHKKAKKQHGWSSIEMGFAVHQFVMGDE 463

Query: 187  LHPQIIRIHEMLELLNSELMDQ 122
            LHP+I +I++ML+ +  +L  +
Sbjct: 464  LHPEIRQIYQMLDQMAKKLKQE 485


>ref|XP_007212650.2| pentatricopeptide repeat-containing protein At4g21065 [Prunus
            persica]
 gb|ONI11096.1| hypothetical protein PRUPE_4G087400 [Prunus persica]
          Length = 623

 Score =  315 bits (808), Expect = 6e-97
 Identities = 171/421 (40%), Positives = 250/421 (59%), Gaps = 6/421 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH + S +  P    WN++ R YA    P   L +  QM +     D  T+PF  K  +
Sbjct: 102  YAHQIFSQIRSPNVFTWNTMIRGYAESENPTPVLQLYHQMHVNSVEPDTHTYPFLLKAVA 161

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             L+ + E   + +  L+ G   + ++  N L+ +Y+  GH++ A ++F+ I  R   +W+
Sbjct: 162  KLTNVREGEKIHSIALRNG-FESLVFVKNTLLHMYACCGHVESAHRVFESISERDLVAWN 220

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I G+  NGRP EAL  F  M+  GV PD    VS L+AC   G L  GR +H + +  
Sbjct: 221  SVINGFALNGRPNEALTVFRDMSLEGVQPDGFTMVSLLSACAELGTLALGRRIHVYMLKV 280

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +S     AL+D YAKCG +  A+ VF  M ER+V++W+A++ GLA++G G EA++ 
Sbjct: 281  GLTGNSHA-TNALLDLYAKCGNIREAQKVFKTMDERSVVSWTALVVGLAVNGFGNEALEH 339

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F E+   G+ PT +T   VL ACSH G+V+EG   F++ +EEYGI PR+EH GC++DLL 
Sbjct: 340  FQELRREGLVPTEITFVGVLYACSHCGMVDEGFNYFRMMKEEYGIVPRIEHYGCMIDLLG 399

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA E+I  MP+ P A IWR+LLGA    G L +G  A   +   E   +GDYV+
Sbjct: 400  RAGLVKEAYEYINNMPMQPNAVIWRTLLGACTIHGHLALGETARAHIRELEPGHSGDYVL 459

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            L+NLYA    W +V+KVR  M   GVRK  G+S VE+   ++ F MGD+ HPQ  +I+ M
Sbjct: 460  LSNLYASERRWSDVQKVRRTMLSDGVRKTPGYSIVELRNCIYEFTMGDRSHPQSEKIYTM 519

Query: 154  L 152
            L
Sbjct: 520  L 520


>ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065
            [Vitis vinifera]
          Length = 613

 Score =  315 bits (807), Expect = 6e-97
 Identities = 173/431 (40%), Positives = 250/431 (58%), Gaps = 10/431 (2%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH + S +  P    WN++ R YA    P  AL +  QM +     D  T+PF  K  +
Sbjct: 92   YAHQIFSQIQNPNIFTWNTMIRGYAESENPMPALELYRQMHVSCIEPDTHTYPFLLKAIA 151

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             L  + E   + +  ++ G   + ++  N LV +Y+  GH + A +LF+ +  R   +W+
Sbjct: 152  KLMDVREGEKVHSIAIRNG-FESLVFVQNTLVHMYAACGHAESAHKLFELMAERNLVTWN 210

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I GY  NGRP EAL  F  M   GV PD    VS L+AC   G L  GR  H + +  
Sbjct: 211  SVINGYALNGRPNEALTLFREMGLRGVEPDGFTMVSLLSACAELGALALGRRAHVYMVKV 270

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL   ++    AL+D YAKCG +  A  VF  M E++V++W+++I GLA++G G EA+++
Sbjct: 271  GLD-GNLHAGNALLDLYAKCGSIRQAHKVFDEMEEKSVVSWTSLIVGLAVNGFGKEALEL 329

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F E+   G+ P+ +T   VL ACSH G+V+EG   FK  +EEYGI P++EH GC+VDLL 
Sbjct: 330  FKELERKGLMPSEITFVGVLYACSHCGMVDEGFDYFKRMKEEYGIVPKIEHYGCMVDLLG 389

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ ++A EFI  MP+ P A +WR+LLGA    G L +G +A  +L   E   +GDYV+
Sbjct: 390  RAGLVKQAHEFIQNMPMQPNAVVWRTLLGACTIHGHLALGEVARAQLLQLEPKHSGDYVL 449

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQ----IIR 167
            L+NLYA    W +V KVR  M   GV+K  G S VE+   LH F MGD+ HPQ     ++
Sbjct: 450  LSNLYASEQRWSDVHKVRRTMLREGVKKTPGHSLVELRNRLHEFVMGDRSHPQTEEIYVK 509

Query: 166  IHEMLELLNSE 134
            + E+ +LL  E
Sbjct: 510  LAEITKLLKLE 520



 Score =  112 bits (281), Expect = 1e-22
 Identities = 74/246 (30%), Positives = 118/246 (47%), Gaps = 1/246 (0%)
 Frame = -3

Query: 1144 LVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPD 965
            + +L S    +  A Q+F +I N    +W+ MI GY  +  P+ AL  +  M  S + PD
Sbjct: 80   IFTLLSFCSPMSYAHQIFSQIQNPNIFTWNTMIRGYAESENPMPALELYRQMHVSCIEPD 139

Query: 964  EAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCGEVDSARSV 788
               +   L A      ++       H+I    G +S+ F    LV  YA CG  +SA  +
Sbjct: 140  THTYPFLLKAIAK--LMDVREGEKVHSIAIRNGFESLVFVQNTLVHMYAACGHAESAHKL 197

Query: 787  FGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLV 608
            F  MAERN++TW+++I G A++G   EA+ +F EM L GV P   TM ++LSAC+  G +
Sbjct: 198  FELMAERNLVTWNSVINGYALNGRPNEALTLFREMGLRGVEPDGFTMVSLLSACAELGAL 257

Query: 607  EEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLG 428
              G R   +   + G++  +     ++DL  + G   +A +    M    +   W SL+ 
Sbjct: 258  ALGRRA-HVYMVKVGLDGNLHAGNALLDLYAKCGSIRQAHKVFDEME-EKSVVSWTSLIV 315

Query: 427  AACTQG 410
                 G
Sbjct: 316  GLAVNG 321


>gb|PIA41688.1| hypothetical protein AQUCO_02200249v1 [Aquilegia coerulea]
          Length = 503

 Score =  311 bits (798), Expect = 7e-97
 Identities = 170/409 (41%), Positives = 241/409 (58%), Gaps = 4/409 (0%)
 Frame = -3

Query: 1345 LARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCSLLSAIPEAASLQAHLLKLG 1178
            + R YA    P  A+++  QM +     D  T+PF  K C+ L A+ E   + + ++K G
Sbjct: 1    MIRGYAESENPVPAISLYAQMHVLNIEPDTHTYPFLLKACAKLMAVKEGEEIHSIVIKNG 60

Query: 1177 PAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTF 998
               + ++  N LV LYS  G  + A ++F+ +P+R   +W+++I G+  NGRP EAL  F
Sbjct: 61   -YESLVFVQNTLVHLYSACGLPENAHKMFELMPDRNLVTWNSVINGFSLNGRPNEALTLF 119

Query: 997  LGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFATALVDFYAK 818
              M+  GV PD    VS L AC   G L  GR +H + +  GL  D++    AL+D YAK
Sbjct: 120  KKMSVEGVEPDGFTMVSLLTACAELGALALGRRVHLYMVKVGLH-DNLHAGNALIDLYAK 178

Query: 817  CGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNV 638
            CG +  A  VF  M  R++++W+++I GLA++G G EAIK+F E+   G+ P+ +T   V
Sbjct: 179  CGSIWEAYKVFEEMKSRSIVSWTSLIVGLAVNGFGKEAIKLFGELEKEGLVPSDITFVGV 238

Query: 637  LSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAP 458
            L ACSH G+V EG   FK   EEY I P++EH GC+VDLL RAG  +EA  FI  MP+ P
Sbjct: 239  LYACSHCGMVNEGFNYFKRMTEEYHITPKIEHYGCLVDLLGRAGRVQEAYHFIQNMPLEP 298

Query: 457  TAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVMLANLYARFALWEEVRKVRV 278
             A +WR+LLGA    G + +G +A  RL   E   +GDYV+++NLYA    W +V+KVR 
Sbjct: 299  NAIVWRTLLGACMIHGHMKLGEVARARLLQLEPKHSGDYVLISNLYASERRWSDVQKVRK 358

Query: 277  EMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEMLELLNSEL 131
             M   GV+K  G S VE+   +H F MGDK HPQ   I+ MLE +  +L
Sbjct: 359  TMLREGVKKNPGHSLVELQNCVHEFVMGDKAHPQTREIYNMLEEIMKKL 407


>ref|XP_010920022.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
            [Elaeis guineensis]
          Length = 591

 Score =  314 bits (804), Expect = 1e-96
 Identities = 168/429 (39%), Positives = 256/429 (59%), Gaps = 7/429 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPFW--NSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH++   + +P  +  N++ R++     P+ AL +  +M  R    D FTFPFA K C+
Sbjct: 84   YAHSIFLTLDDPGTFDFNTMIRAHVKDNDPEAALLLFKEMQERSVRPDNFTFPFALKACA 143

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRI-PNRTASSW 1058
             LSAI E   +  H+ KLG     ++  N+L+++Y   G + L  ++F ++  +RT +SW
Sbjct: 144  QLSAIEEGMQIHGHVTKLG-FECDVFIQNSLINMYGKCGEIKLCCRVFGQMGSDRTVASW 202

Query: 1057 SAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIV 878
            SA++A + R G   E L  F  M   G+  DE++ VS L++C H G  + GRS+H   + 
Sbjct: 203  SAILAAHTRMGLWNECLKLFAMMMTEGLKADESSMVSALSSCAHLGTYDLGRSIHCSLLR 262

Query: 877  YGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIK 698
               GL+ +   T+L+D Y KCG ++   ++F RM E+N  T+SA+I GLAMHG G +A++
Sbjct: 263  NITGLNLI-VQTSLIDTYLKCGSLEKGMAIFDRMPEKNKWTYSAVISGLAMHGDGEKALQ 321

Query: 697  VFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLL 518
            VF  M+  G+ P  V    VLSACSHAGL+E+GL+ F   + E+ I P V+H GC+VDL+
Sbjct: 322  VFSNMLKEGIEPDEVVYVGVLSACSHAGLLEDGLQCFDRMKLEHRIVPNVQHYGCMVDLM 381

Query: 517  FRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYV 338
             RAG   EA E I +MP+ PT   WR LL A    G L++   A + L   +   AGD++
Sbjct: 382  SRAGELNEAYELIRSMPMGPTDVAWRCLLNACKVHGNLELAECASKNLMQLDAHNAGDHI 441

Query: 337  MLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHE 158
            +L+N+YA+   W++  ++RVEM D GV +V G+S VE+ G +H F   DK HPQ   ++E
Sbjct: 442  ILSNMYAKAQRWDDAARIRVEMVDRGVLQVPGYSRVEVKGRMHTFVSHDKSHPQSDEVYE 501

Query: 157  MLELLNSEL 131
            ML  +  +L
Sbjct: 502  MLYQMEWQL 510



 Score =  102 bits (255), Expect = 2e-19
 Identities = 72/258 (27%), Positives = 119/258 (46%), Gaps = 3/258 (1%)
 Frame = -3

Query: 1231 LSAIPEAASLQAHLLKLGPAAATIYSLNALVSL-YSNLGHLDLALQLFDRIPNRTASSWS 1055
            +  I E   +QA  +KLG      ++ + L +   S+ G +D A  +F  + +     ++
Sbjct: 42   VKTIEEFRKVQAQYIKLGLDRVPRHAGDLLSACALSDWGSMDYAHSIFLTLDDPGTFDFN 101

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
             MI  + ++  P  AL  F  M +  V PD   F   L AC     +E G  +H H    
Sbjct: 102  TMIRAHVKDNDPEAALLLFKEMQERSVRPDNFTFPFALKACAQLSAIEEGMQIHGHVTKL 161

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRM-AERNVLTWSAMIGGLAMHGLGTEAIK 698
            G   D V    +L++ Y KCGE+     VFG+M ++R V +WSA++      GL  E +K
Sbjct: 162  GFECD-VFIQNSLINMYGKCGEIKLCCRVFGQMGSDRTVASWSAILAAHTRMGLWNECLK 220

Query: 697  VFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVF-KLTREEYGIEPRVEHCGCVVDL 521
            +F  M+  G++    +M + LS+C+H G  + G  +   L R   G+   V+    ++D 
Sbjct: 221  LFAMMMTEGLKADESSMVSALSSCAHLGTYDLGRSIHCSLLRNITGLNLIVQ--TSLIDT 278

Query: 520  LFRAGMFEEAREFIATMP 467
              + G  E+       MP
Sbjct: 279  YLKCGSLEKGMAIFDRMP 296


>ref|XP_008225544.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like
            [Prunus mume]
          Length = 623

 Score =  314 bits (804), Expect = 2e-96
 Identities = 169/421 (40%), Positives = 251/421 (59%), Gaps = 6/421 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH + S +  P    WN++ R YA    P   L +  QM +     D  T+PF  K  +
Sbjct: 102  YAHQIFSQIRSPNVFTWNTMIRGYAESENPTPVLQLYHQMHVNSVEPDTHTYPFLLKAVA 161

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             L+ + +   + +  L+ G   + ++  N L+ +Y+  GH++ A ++F+ +  R   +W+
Sbjct: 162  KLTNVRDGEKIHSIALRNG-FESLVFVKNTLLHMYACCGHVESAHRVFESMSERDLVAWN 220

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I G+  NGRP EAL  F  M+  GV PD    VS L+AC   G L  GR +H + +  
Sbjct: 221  SVINGFALNGRPNEALTIFRDMSLEGVQPDGFTMVSLLSACAELGTLALGRRIHVYMLKV 280

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +S     AL+D YAKCG +  A+ VF  M ER+V++W+A++ GLA++G G EA+++
Sbjct: 281  GLTGNSHA-TNALLDLYAKCGSIREAQKVFTTMDERSVVSWTALVVGLAVNGFGNEALEL 339

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F E+   G+ PT +T   VL ACSH G+V+EG   F++ +EEYGI PR+EH GC++DLL 
Sbjct: 340  FKELRREGLVPTEITFVGVLYACSHCGMVDEGFNYFRMMKEEYGIVPRIEHYGCMIDLLG 399

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA E+I  MP+ P A IWR+LLGA    G L +G  A   +   E   +GDYV+
Sbjct: 400  RAGLVKEAYEYINNMPMQPNAVIWRTLLGACTIHGHLALGETARAHIRELEPGHSGDYVL 459

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            L+NLYA    W +V+KVR  M   GVRK  G+S VE+   ++ F MGD+ HPQ  +I+ M
Sbjct: 460  LSNLYASERRWSDVQKVRRTMLSDGVRKTPGYSIVELRNCIYEFTMGDRSHPQSEKIYTM 519

Query: 154  L 152
            L
Sbjct: 520  L 520


>ref|XP_010923707.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065
            [Elaeis guineensis]
          Length = 626

 Score =  313 bits (803), Expect = 3e-96
 Identities = 174/429 (40%), Positives = 247/429 (57%), Gaps = 8/429 (1%)
 Frame = -3

Query: 1393 AHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCSL 1232
            A A+ S +H P    +N++ R YA    P  AL V  QM       D  T+PF  K C+ 
Sbjct: 104  AAAVFSQIHLPGVFTYNTMIRGYAESDSPGPALLVHRQMLAAAIPPDTHTYPFLLKACAK 163

Query: 1231 LSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSA 1052
            L A+ E   + +  +K G     ++  N LV  Y   G  + A ++F+ +  R   +W++
Sbjct: 164  LMALREGEKVHSLSVKNG-LETCVFVQNTLVHFYGTCGLFESAYKVFEEMDERNLVTWNS 222

Query: 1051 MIAGYDRNGRPLEALFTF--LGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIV 878
            +I G+  NGRP EAL  F  + + DSGV PD    VS L AC   G L  GR  H + + 
Sbjct: 223  IINGFATNGRPNEALTLFREMNLEDSGVKPDGFTMVSLLCACAELGALALGRRAHLYLVK 282

Query: 877  YGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIK 698
             GL   +V    AL+D YAKCG ++ A  VF  MA + V++W+++I GLA++G G EA++
Sbjct: 283  VGL-YGNVHVENALIDLYAKCGSIEEAYRVFDEMASKTVVSWTSLIVGLAVNGFGKEALE 341

Query: 697  VFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLL 518
            +F  M    + PT +T+  VL ACSH GLV+EG R F   + EY I P++EH GC+VDLL
Sbjct: 342  LFSAMERERLVPTEITLVGVLYACSHCGLVDEGFRYFNRMKNEYNIVPKIEHYGCMVDLL 401

Query: 517  FRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYV 338
             RAG+ E+A ++I  MP+AP A +WR+LLGA      LD+G +A  RL   +   +GDYV
Sbjct: 402  GRAGLVEQAHDYIMNMPLAPNAVVWRTLLGACAMHKRLDLGNVAWARLVELDPGHSGDYV 461

Query: 337  MLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHE 158
            +L+NLYA    W EV ++R  M   GVRK+ G S VE+   +H F MGD+ H Q   I+ 
Sbjct: 462  LLSNLYAAVGRWGEVHRLRRSMLKGGVRKMPGHSLVELGNRVHEFVMGDRSHSQSDEIYV 521

Query: 157  MLELLNSEL 131
            MLE + ++L
Sbjct: 522  MLEEIANKL 530



 Score =  111 bits (278), Expect = 3e-22
 Identities = 94/324 (29%), Positives = 147/324 (45%), Gaps = 10/324 (3%)
 Frame = -3

Query: 1258 PFAFKLCSLL----SAIPEAASLQAHLLKLG-PAAATIYSLNALVSLYS-NLGHLDLALQ 1097
            P   K C  L     ++P+   + AH ++ G P +   +  + + +L S +   L  A  
Sbjct: 47   PPTLKRCIALLQTWKSLPKIKQIHAHSIRTGVPLSDRAFGKHLVFALVSLSPSPLPCAAA 106

Query: 1096 LFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGC 917
            +F +I      +++ MI GY  +  P  AL     M  + + PD   +   L AC     
Sbjct: 107  VFSQIHLPGVFTYNTMIRGYAESDSPGPALLVHRQMLAAAIPPDTHTYPFLLKACAKLMA 166

Query: 916  LEFGRSMHAHAIVYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIG 737
            L  G  +H+ ++  GL    V     LV FY  CG  +SA  VF  M ERN++TW+++I 
Sbjct: 167  LREGEKVHSLSVKNGLE-TCVFVQNTLVHFYGTCGLFESAYKVFEEMDERNLVTWNSIIN 225

Query: 736  GLAMHGLGTEAIKVFDEMVL--AGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYG 563
            G A +G   EA+ +F EM L  +GV+P   TM ++L AC+  G +  G R   L   + G
Sbjct: 226  GFATNGRPNEALTLFREMNLEDSGVKPDGFTMVSLLCACAELGALALGRRA-HLYLVKVG 284

Query: 562  IEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAG 383
            +   V     ++DL  + G  EEA      M  + T   W SL+      G    G+ A 
Sbjct: 285  LYGNVHVENALIDLYAKCGSIEEAYRVFDEM-ASKTVVSWTSLIVGLAVNG---FGKEAL 340

Query: 382  ERLAA--AEEMAAGDYVMLANLYA 317
            E  +A   E +   +  ++  LYA
Sbjct: 341  ELFSAMERERLVPTEITLVGVLYA 364


>ref|XP_006651966.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like
            [Oryza brachyantha]
          Length = 607

 Score =  312 bits (800), Expect = 6e-96
 Identities = 188/443 (42%), Positives = 250/443 (56%), Gaps = 12/443 (2%)
 Frame = -3

Query: 1423 AAVIHLRRLHAHALRSDVHEPPFW-NSLARSYASHGFPDLALAVCLQMPLR-------DA 1268
            AA   L  L A  LR  V    F  N+L R++A+   P + L      PL        + 
Sbjct: 72   AAPALLEPLVAALLRPSVPLDAFLVNTLIRAHATSPIPSVRLRAASFFPLMLRAAVVPNK 131

Query: 1267 FTFPFAFKLCSLLSAIPEAASLQAHL--LKLGPAAATIYSLNALVSLYS--NLGHLDLAL 1100
            FTFPF  K C+ L   P A  LQAH   LK G  AA  Y+ N L+ +YS    G L  A 
Sbjct: 132  FTFPFLLKACAALPGSP-AVGLQAHAAALKFG-FAADQYASNTLIHMYSCFGAGFLGDAR 189

Query: 1099 QLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGG 920
             +FDR+P  +A +WSAMI GY R G   +A+  F  M  +GV  DE   +  LAA T  G
Sbjct: 190  NVFDRMPRESAVTWSAMIGGYVRGGLSTDAIELFREMQANGVRADEVTVIGVLAAATDLG 249

Query: 919  CLEFGRSMHAHAIVYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMI 740
             LE  R +       G+   SV    AL+D  AKCG+VD A +VF  M  R+V++W+++I
Sbjct: 250  ALELARWVRGFVEREGIE-KSVTLCNALIDTLAKCGDVDGAVAVFEGMERRSVVSWTSVI 308

Query: 739  GGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGI 560
              LAM G G EA++VF+EM +AGVRP  V    VL+ACSHAG+V+EG   F   + EYGI
Sbjct: 309  DALAMEGRGKEAVQVFEEMKVAGVRPDDVAFIGVLTACSHAGMVDEGCDYFDAMKTEYGI 368

Query: 559  EPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGE 380
            EP++EH GC+VD+  R GM E A EF+ TMP+ P   IWRSL+ A    G L++G     
Sbjct: 369  EPKIEHYGCMVDMFGRVGMVERAMEFVRTMPMQPNPIIWRSLVSACRAHGRLELGERITR 428

Query: 379  RLAAAEEMAAGDYVMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFA 200
             L         +YVML+N++A    W+E  ++R EM+  G++KV G S VE+DGE+H F 
Sbjct: 429  SLLNEYPAHEANYVMLSNVFALTQRWKEKSEIRREMSKKGIKKVPGCSVVELDGEIHEFI 488

Query: 199  MGDKLHPQIIRIHEMLELLNSEL 131
             GD+ HPQ   I+ M+E +  EL
Sbjct: 489  AGDESHPQYKEIYRMVEEMAREL 511


>ref|XP_020521010.1| LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein
            At4g21065-like [Amborella trichopoda]
          Length = 582

 Score =  311 bits (797), Expect = 8e-96
 Identities = 177/431 (41%), Positives = 248/431 (57%), Gaps = 4/431 (0%)
 Frame = -3

Query: 1411 HLRRLHAHALRSDVHEPPFWNSLARSYA-SHGFPDLALAVCLQMP---LRDAFTFPFAFK 1244
            + RR+ AHA   ++     WN+L R Y  +H   D  LA         + + +T  FA +
Sbjct: 62   YARRVFAHAPSPNLFT---WNTLIRGYTRAHAARDALLAYRTMRAHGTVPNGYTLGFALQ 118

Query: 1243 LCSLLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTAS 1064
             C+ +    EA  + A  +KLG A  ++    +L+ +YS   ++  A Q+FD +P R + 
Sbjct: 119  ACAHVRVADEAHEVHADAVKLGLAHGSVGL--SLMRVYSVCCNVYCARQVFDEMPQRGSG 176

Query: 1063 SWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHA 884
             W AM+AGY +NG+P EAL  F  M  SG   D     S L AC   G L  GR +HA+ 
Sbjct: 177  VWGAMVAGYVQNGKPSEALGVFREMQKSGQEVDGFTLASVLGACGALGALNLGRWVHAYI 236

Query: 883  IVYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEA 704
              YG+ LD V  AT+LVD Y KCG ++ AR VF  M  ++V+ WS+MIGGLA+HG G EA
Sbjct: 237  DKYGVDLDVV-LATSLVDMYCKCGXLEKARLVFEAMPYKDVMAWSSMIGGLAIHGFGEEA 295

Query: 703  IKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVD 524
            +++F  M +A VRP  VT T+VL ACSHAGLV EG R F+  R EY I+P  EH GC+VD
Sbjct: 296  MELFSRMKMAKVRPNSVTFTSVLCACSHAGLVSEGHRQFESMRFEYSIKPEPEHYGCIVD 355

Query: 523  LLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGD 344
            +L RAG    A EF+ +MPV P A +WR+LL A    G + +  +  + L   E     +
Sbjct: 356  MLCRAGQLHRAHEFVMSMPVKPNAIMWRTLLNACGIHGDIGLSEIITKHLLELEPQRGEN 415

Query: 343  YVMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRI 164
            YV+++N+YA    W +V ++R  M     +K  GFSS+E+   LH F MGD+ HPQ   I
Sbjct: 416  YVLVSNVYASLRRWSDVSEIRGLMRHKRAKKFHGFSSIEVGSILHEFVMGDESHPQWKDI 475

Query: 163  HEMLELLNSEL 131
            +EML  + ++L
Sbjct: 476  YEMLGKIGAKL 486


>ref|XP_021593410.1| pentatricopeptide repeat-containing protein At4g21065 isoform X3
            [Manihot esculenta]
          Length = 614

 Score =  312 bits (799), Expect = 1e-95
 Identities = 168/428 (39%), Positives = 251/428 (58%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH++ + +  P    WN++ R YA    P+ A+ +  +M +     D  T+PF  K  S
Sbjct: 98   YAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRMHVNATEPDTHTYPFLLKAVS 157

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             +  +     + + +++ G   + ++  N+LV +Y+  GH + A +LF+ +P R   +W+
Sbjct: 158  KVVNVRVGEGIHSIVVRNG-FESLVFVQNSLVHMYAACGHYENAYKLFELMPERDIIAWN 216

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
             +I G+  NG+P+EAL  +  M   GV PD    VS L+AC     L  GR +H + +  
Sbjct: 217  TVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLSACAELDALALGRRVHTYIVKV 276

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +++    AL+D YAKCG +  A+ VFG M ERNV++W+++I GLA++G GTEA++ 
Sbjct: 277  GLN-ENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVVSWTSLIVGLAVNGFGTEALEH 335

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F EM      P+ +T   VL ACSH G+V EG   FK  +E+YGI PR+EH GC+VDLL 
Sbjct: 336  FGEMEKQQFVPSEITYVGVLYACSHCGMVNEGFNYFKRMKEKYGIVPRMEHYGCMVDLLG 395

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA E+I  MP+ P A +WR+LLGA    G L +G +A  +L   E    GDYV+
Sbjct: 396  RAGLVKEAYEYIQNMPLQPNAVVWRTLLGACTIHGHLALGEVARVQLLQLEPKHCGDYVL 455

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            ++NLYA    W +V  VR  M   GVRK  G S VE+   +H F MGD+ HPQ   I+ M
Sbjct: 456  ISNLYASEQRWSDVHNVRKTMLTQGVRKAPGHSLVELGNCVHEFVMGDRTHPQSEAIYAM 515

Query: 154  LELLNSEL 131
            L  ++ +L
Sbjct: 516  LVEISKKL 523



 Score =  101 bits (251), Expect = 7e-19
 Identities = 85/314 (27%), Positives = 138/314 (43%), Gaps = 7/314 (2%)
 Frame = -3

Query: 1330 ASHGFPDLALAVCLQMPLRDAFTFPFAFKLCSLLSAIPEAASL---QAHLLKLGPAAATI 1160
            ASH  P  A   C + P+      PF  K C  L  I  ++     Q H   +    A I
Sbjct: 27   ASHITPRSA---CPETPI------PFIVKKCIALLQICASSKFKLKQIHAFSIRHGVAPI 77

Query: 1159 ---YSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGM 989
                  + + S+ S    +  A  +F +I N    +W+ MI GY  +  P  A+  +  M
Sbjct: 78   NPDMGKHLIYSIVSLSAPMTYAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRM 137

Query: 988  ADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCG 812
              +   PD   +   L A +    +  G  +  H+IV   G +S+ F   +LV  YA CG
Sbjct: 138  HVNATEPDTHTYPFLLKAVSKVVNVRVGEGI--HSIVVRNGFESLVFVQNSLVHMYAACG 195

Query: 811  EVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLS 632
              ++A  +F  M ER+++ W+ +I G A++G   EA+ ++ EM L GV P   T+ ++LS
Sbjct: 196  HYENAYKLFELMPERDIIAWNTVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLS 255

Query: 631  ACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTA 452
            AC+    +  G RV      + G+   +     ++DL  + G   EA++    M      
Sbjct: 256  ACAELDALALGRRVHTYI-VKVGLNENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVV 314

Query: 451  AIWRSLLGAACTQG 410
            + W SL+      G
Sbjct: 315  S-WTSLIVGLAVNG 327


>ref|XP_021593409.1| pentatricopeptide repeat-containing protein At4g21065 isoform X2
            [Manihot esculenta]
          Length = 619

 Score =  312 bits (799), Expect = 1e-95
 Identities = 168/428 (39%), Positives = 251/428 (58%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH++ + +  P    WN++ R YA    P+ A+ +  +M +     D  T+PF  K  S
Sbjct: 98   YAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRMHVNATEPDTHTYPFLLKAVS 157

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             +  +     + + +++ G   + ++  N+LV +Y+  GH + A +LF+ +P R   +W+
Sbjct: 158  KVVNVRVGEGIHSIVVRNG-FESLVFVQNSLVHMYAACGHYENAYKLFELMPERDIIAWN 216

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
             +I G+  NG+P+EAL  +  M   GV PD    VS L+AC     L  GR +H + +  
Sbjct: 217  TVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLSACAELDALALGRRVHTYIVKV 276

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +++    AL+D YAKCG +  A+ VFG M ERNV++W+++I GLA++G GTEA++ 
Sbjct: 277  GLN-ENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVVSWTSLIVGLAVNGFGTEALEH 335

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F EM      P+ +T   VL ACSH G+V EG   FK  +E+YGI PR+EH GC+VDLL 
Sbjct: 336  FGEMEKQQFVPSEITYVGVLYACSHCGMVNEGFNYFKRMKEKYGIVPRMEHYGCMVDLLG 395

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA E+I  MP+ P A +WR+LLGA    G L +G +A  +L   E    GDYV+
Sbjct: 396  RAGLVKEAYEYIQNMPLQPNAVVWRTLLGACTIHGHLALGEVARVQLLQLEPKHCGDYVL 455

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            ++NLYA    W +V  VR  M   GVRK  G S VE+   +H F MGD+ HPQ   I+ M
Sbjct: 456  ISNLYASEQRWSDVHNVRKTMLTQGVRKAPGHSLVELGNCVHEFVMGDRTHPQSEAIYAM 515

Query: 154  LELLNSEL 131
            L  ++ +L
Sbjct: 516  LVEISKKL 523



 Score =  101 bits (251), Expect = 7e-19
 Identities = 85/314 (27%), Positives = 138/314 (43%), Gaps = 7/314 (2%)
 Frame = -3

Query: 1330 ASHGFPDLALAVCLQMPLRDAFTFPFAFKLCSLLSAIPEAASL---QAHLLKLGPAAATI 1160
            ASH  P  A   C + P+      PF  K C  L  I  ++     Q H   +    A I
Sbjct: 27   ASHITPRSA---CPETPI------PFIVKKCIALLQICASSKFKLKQIHAFSIRHGVAPI 77

Query: 1159 ---YSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGM 989
                  + + S+ S    +  A  +F +I N    +W+ MI GY  +  P  A+  +  M
Sbjct: 78   NPDMGKHLIYSIVSLSAPMTYAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRM 137

Query: 988  ADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCG 812
              +   PD   +   L A +    +  G  +  H+IV   G +S+ F   +LV  YA CG
Sbjct: 138  HVNATEPDTHTYPFLLKAVSKVVNVRVGEGI--HSIVVRNGFESLVFVQNSLVHMYAACG 195

Query: 811  EVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLS 632
              ++A  +F  M ER+++ W+ +I G A++G   EA+ ++ EM L GV P   T+ ++LS
Sbjct: 196  HYENAYKLFELMPERDIIAWNTVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLS 255

Query: 631  ACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTA 452
            AC+    +  G RV      + G+   +     ++DL  + G   EA++    M      
Sbjct: 256  ACAELDALALGRRVHTYI-VKVGLNENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVV 314

Query: 451  AIWRSLLGAACTQG 410
            + W SL+      G
Sbjct: 315  S-WTSLIVGLAVNG 327


>gb|OVA05584.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 524

 Score =  309 bits (791), Expect = 1e-95
 Identities = 172/428 (40%), Positives = 246/428 (57%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH + S +  P    WN++ R YA    P+ A+ +  QM +     D  T+PF  K  +
Sbjct: 3    YAHKIFSQIQNPNIFTWNTMIRGYAESENPNPAIQLHHQMHISSIEPDTHTYPFLLKAIA 62

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             L A+ E   +    ++ G   + ++  N LV +Y+  G  + A +LFD + +R   +W+
Sbjct: 63   KLMAVREGEMIHCVAIRNG-FESLVFVQNTLVHMYAACGLAENAHKLFDLMSDRNLVTWN 121

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I G+  NGRP EAL  F  M  +GV PD    VS L AC   G L+ GR  H +    
Sbjct: 122  SVINGFAVNGRPNEALTLFREMNLAGVEPDGFTMVSLLTACAELGALDLGRRAHVYMFKV 181

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL   ++    AL+D YAKCG +  A  VF  M  R+V++W+++I GLA++G G EA+++
Sbjct: 182  GLN-GNLHAGNALIDLYAKCGTIWEAHKVFDEMLLRSVVSWTSLIVGLAVNGFGKEALEL 240

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F E+   G+ P+ ++   VL ACSH G+V++G   FK  RE+YGI P++EH GC+VDLL 
Sbjct: 241  FGELEREGLVPSEISFVGVLYACSHCGMVDKGFEYFKRMREDYGILPKIEHYGCLVDLLG 300

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA +FI  MP+ P A IWR LLGA    G L +G +    L   E    GDYV+
Sbjct: 301  RAGLVQEAHKFIQNMPLEPNAVIWRILLGACMIHGNLALGEVTRAELLRLEPKHCGDYVL 360

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            L+NLYA    W +V+KVR  M   GVRK  G S VE+   +H F MGDK HPQ   I+E 
Sbjct: 361  LSNLYASEGRWSDVQKVRRTMLRKGVRKSPGHSLVELRNCVHEFVMGDKSHPQTEEIYEK 420

Query: 154  LELLNSEL 131
            LE +  +L
Sbjct: 421  LEEMMKKL 428



 Score =  104 bits (260), Expect = 3e-20
 Identities = 70/236 (29%), Positives = 115/236 (48%), Gaps = 1/236 (0%)
 Frame = -3

Query: 1114 LDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAA 935
            +  A ++F +I N    +W+ MI GY  +  P  A+     M  S + PD   +   L A
Sbjct: 1    MSYAHKIFSQIQNPNIFTWNTMIRGYAESENPNPAIQLHHQMHISSIEPDTHTYPFLLKA 60

Query: 934  CTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCGEVDSARSVFGRMAERNVL 758
                  +  G  +H  AI    G +S+ F    LV  YA CG  ++A  +F  M++RN++
Sbjct: 61   IAKLMAVREGEMIHCVAI--RNGFESLVFVQNTLVHMYAACGLAENAHKLFDLMSDRNLV 118

Query: 757  TWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLT 578
            TW+++I G A++G   EA+ +F EM LAGV P   TM ++L+AC+  G ++ G R   + 
Sbjct: 119  TWNSVINGFAVNGRPNEALTLFREMNLAGVEPDGFTMVSLLTACAELGALDLGRRA-HVY 177

Query: 577  REEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQG 410
              + G+   +     ++DL  + G   EA +    M +  +   W SL+      G
Sbjct: 178  MFKVGLNGNLHAGNALIDLYAKCGTIWEAHKVFDEM-LLRSVVSWTSLIVGLAVNG 232


>ref|XP_007147940.1| hypothetical protein PHAVU_006G167300g [Phaseolus vulgaris]
 gb|ESW19934.1| hypothetical protein PHAVU_006G167300g [Phaseolus vulgaris]
          Length = 611

 Score =  311 bits (797), Expect = 2e-95
 Identities = 167/428 (39%), Positives = 254/428 (59%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +A+ + + +H P    WN++ R YA    P  AL    QM +     D  T+PF  K  S
Sbjct: 90   YAYNVFTRIHNPNVFTWNTMIRGYAESQNPSPALHFYRQMTVSCVEPDTHTYPFLLKAIS 149

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
                + E  ++ +  ++ G   + ++  N+L+ +Y+  G+ + A ++F+ +  R   +W+
Sbjct: 150  KSLNVREGEAIHSVTIRNG-FQSLVFVQNSLLHIYAACGYTESAYKVFELMKERDLVAWN 208

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I G+  NGRP EAL  F  M+  GV PD    VS L+AC   G LE GR +H + +  
Sbjct: 209  SVINGFALNGRPNEALTLFREMSVEGVEPDGFTVVSLLSACAELGALELGRRVHVYLLKV 268

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +S     +L+D YAKCG +  A+ VFG M+ERN ++W+++I GLA++G G EA+++
Sbjct: 269  GLRENSY-VTNSLLDLYAKCGTIREAQQVFGEMSERNAVSWTSLIVGLAVNGFGEEALEL 327

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F EM   G+ P+ +T   VL ACSH G+++EG   FK   EEYGI PR+EH GC+VDLL 
Sbjct: 328  FKEMEGQGLVPSEITFVGVLYACSHCGMLDEGFNYFKRMEEEYGILPRIEHYGCMVDLLS 387

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ ++A ++I  MPV P A IWR+LLGA    G LD+G +A   +   E   +GDYV+
Sbjct: 388  RAGLVKQAYKYIQNMPVQPNAVIWRTLLGACTIHGHLDLGEIARSHILKLEPKHSGDYVL 447

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            L+NLYA    W +V+ VR  M   GV+K  G+S VE+   ++ F +GD+ HPQ   ++ +
Sbjct: 448  LSNLYASERRWSDVQVVRRSMLKDGVKKTPGYSLVELGNRVYEFTIGDRSHPQSQDVYAL 507

Query: 154  LELLNSEL 131
            LE +   L
Sbjct: 508  LEKITELL 515



 Score =  107 bits (266), Expect = 9e-21
 Identities = 78/253 (30%), Positives = 121/253 (47%), Gaps = 1/253 (0%)
 Frame = -3

Query: 1165 TIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMA 986
            TI SL+A +S          A  +F RI N    +W+ MI GY  +  P  AL  +  M 
Sbjct: 80   TIVSLSAPMSY---------AYNVFTRIHNPNVFTWNTMIRGYAESQNPSPALHFYRQMT 130

Query: 985  DSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCGE 809
             S V PD   +   L A +    +  G ++  H++    G  S+ F   +L+  YA CG 
Sbjct: 131  VSCVEPDTHTYPFLLKAISKSLNVREGEAI--HSVTIRNGFQSLVFVQNSLLHIYAACGY 188

Query: 808  VDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSA 629
             +SA  VF  M ER+++ W+++I G A++G   EA+ +F EM + GV P   T+ ++LSA
Sbjct: 189  TESAYKVFELMKERDLVAWNSVINGFALNGRPNEALTLFREMSVEGVEPDGFTVVSLLSA 248

Query: 628  CSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAA 449
            C+  G +E G RV  +   + G+         ++DL  + G   EA++    M     A 
Sbjct: 249  CAELGALELGRRV-HVYLLKVGLRENSYVTNSLLDLYAKCGTIREAQQVFGEMS-ERNAV 306

Query: 448  IWRSLLGAACTQG 410
             W SL+      G
Sbjct: 307  SWTSLIVGLAVNG 319


>ref|XP_021593408.1| pentatricopeptide repeat-containing protein At4g21065 isoform X1
            [Manihot esculenta]
 gb|OAY29204.1| hypothetical protein MANES_15G125800 [Manihot esculenta]
          Length = 640

 Score =  312 bits (799), Expect = 2e-95
 Identities = 168/428 (39%), Positives = 251/428 (58%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +AH++ + +  P    WN++ R YA    P+ A+ +  +M +     D  T+PF  K  S
Sbjct: 98   YAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRMHVNATEPDTHTYPFLLKAVS 157

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             +  +     + + +++ G   + ++  N+LV +Y+  GH + A +LF+ +P R   +W+
Sbjct: 158  KVVNVRVGEGIHSIVVRNG-FESLVFVQNSLVHMYAACGHYENAYKLFELMPERDIIAWN 216

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
             +I G+  NG+P+EAL  +  M   GV PD    VS L+AC     L  GR +H + +  
Sbjct: 217  TVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLSACAELDALALGRRVHTYIVKV 276

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +++    AL+D YAKCG +  A+ VFG M ERNV++W+++I GLA++G GTEA++ 
Sbjct: 277  GLN-ENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVVSWTSLIVGLAVNGFGTEALEH 335

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F EM      P+ +T   VL ACSH G+V EG   FK  +E+YGI PR+EH GC+VDLL 
Sbjct: 336  FGEMEKQQFVPSEITYVGVLYACSHCGMVNEGFNYFKRMKEKYGIVPRMEHYGCMVDLLG 395

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ +EA E+I  MP+ P A +WR+LLGA    G L +G +A  +L   E    GDYV+
Sbjct: 396  RAGLVKEAYEYIQNMPLQPNAVVWRTLLGACTIHGHLALGEVARVQLLQLEPKHCGDYVL 455

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            ++NLYA    W +V  VR  M   GVRK  G S VE+   +H F MGD+ HPQ   I+ M
Sbjct: 456  ISNLYASEQRWSDVHNVRKTMLTQGVRKAPGHSLVELGNCVHEFVMGDRTHPQSEAIYAM 515

Query: 154  LELLNSEL 131
            L  ++ +L
Sbjct: 516  LVEISKKL 523



 Score =  101 bits (251), Expect = 7e-19
 Identities = 85/314 (27%), Positives = 138/314 (43%), Gaps = 7/314 (2%)
 Frame = -3

Query: 1330 ASHGFPDLALAVCLQMPLRDAFTFPFAFKLCSLLSAIPEAASL---QAHLLKLGPAAATI 1160
            ASH  P  A   C + P+      PF  K C  L  I  ++     Q H   +    A I
Sbjct: 27   ASHITPRSA---CPETPI------PFIVKKCIALLQICASSKFKLKQIHAFSIRHGVAPI 77

Query: 1159 ---YSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGM 989
                  + + S+ S    +  A  +F +I N    +W+ MI GY  +  P  A+  +  M
Sbjct: 78   NPDMGKHLIYSIVSLSAPMTYAHSIFAQIQNPNVFTWNTMIRGYAESENPEPAIELYNRM 137

Query: 988  ADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCG 812
              +   PD   +   L A +    +  G  +  H+IV   G +S+ F   +LV  YA CG
Sbjct: 138  HVNATEPDTHTYPFLLKAVSKVVNVRVGEGI--HSIVVRNGFESLVFVQNSLVHMYAACG 195

Query: 811  EVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLS 632
              ++A  +F  M ER+++ W+ +I G A++G   EA+ ++ EM L GV P   T+ ++LS
Sbjct: 196  HYENAYKLFELMPERDIIAWNTVINGFALNGKPIEALTLYKEMGLEGVEPDGFTVVSLLS 255

Query: 631  ACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTA 452
            AC+    +  G RV      + G+   +     ++DL  + G   EA++    M      
Sbjct: 256  ACAELDALALGRRVHTYI-VKVGLNENMHVNNALLDLYAKCGNIMEAQKVFGEMEERNVV 314

Query: 451  AIWRSLLGAACTQG 410
            + W SL+      G
Sbjct: 315  S-WTSLIVGLAVNG 327


>ref|XP_014517439.1| pentatricopeptide repeat-containing protein At4g21065 [Vigna radiata
            var. radiata]
          Length = 611

 Score =  311 bits (796), Expect = 3e-95
 Identities = 168/428 (39%), Positives = 253/428 (59%), Gaps = 6/428 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +A+ + + +H P    WN++ R YA    P LAL +  QM +     D  T+PF  K  S
Sbjct: 90   YAYNVFTMIHNPNVFTWNTMIRGYAESQNPSLALHLYRQMIVSCVEPDTHTYPFLLKAIS 149

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
                + E  ++ +  ++ G   + ++  N+L+ +Y+  G  + A ++F+ +  R   +W+
Sbjct: 150  KSLNVREGEAIHSVTIRNG-FQSLLFVQNSLLHIYAACGCTESAYKVFELMKERDLVAWN 208

Query: 1054 AMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVY 875
            ++I G+  NGRP EAL  F  M   G+ PD    VS L+AC   G LE GR +H + +  
Sbjct: 209  SVINGFALNGRPNEALTLFRDMCVEGLEPDGFTVVSLLSACAELGALELGRRVHVYLLKV 268

Query: 874  GLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKV 695
            GL  +S     +L+D YAKCG +  A+ VF  M+ERN ++W+++I GLA++G G EA+++
Sbjct: 269  GLRENSY-VTNSLLDLYAKCGTIREAQQVFSEMSERNAVSWTSLIVGLAVNGFGEEALEL 327

Query: 694  FDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLF 515
            F EM   G+ PT +T   VL ACSH G+++EG   F+  +EEYGI PR+EH GCVVDLL 
Sbjct: 328  FKEMEGQGLVPTEITFVGVLYACSHCGMLDEGFNYFRRMKEEYGIMPRIEHYGCVVDLLS 387

Query: 514  RAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVM 335
            RAG+ ++A E+I  MPV P A IWR+LL A    G LD+G +A   +   E   +GDYV+
Sbjct: 388  RAGLVKQAYEYIQNMPVQPNAVIWRTLLAACTKHGYLDLGEIARSHILKLEPKHSGDYVL 447

Query: 334  LANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEM 155
            L+NLYA    W +V+ VR  M   GV+K  G+S VE+   ++ F MGD+ HPQ   ++ +
Sbjct: 448  LSNLYASERRWTDVQVVRRSMLKDGVKKTPGYSLVELGNRVYEFTMGDRSHPQSRDVYAL 507

Query: 154  LELLNSEL 131
            LE +   L
Sbjct: 508  LEKITELL 515



 Score =  101 bits (251), Expect = 7e-19
 Identities = 75/253 (29%), Positives = 121/253 (47%), Gaps = 1/253 (0%)
 Frame = -3

Query: 1165 TIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEALFTFLGMA 986
            TI SL+A +S          A  +F  I N    +W+ MI GY  +  P  AL  +  M 
Sbjct: 80   TIVSLSAPMSY---------AYNVFTMIHNPNVFTWNTMIRGYAESQNPSLALHLYRQMI 130

Query: 985  DSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFA-TALVDFYAKCGE 809
             S V PD   +   L A +    +  G ++  H++    G  S+ F   +L+  YA CG 
Sbjct: 131  VSCVEPDTHTYPFLLKAISKSLNVREGEAI--HSVTIRNGFQSLLFVQNSLLHIYAACGC 188

Query: 808  VDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTMTNVLSA 629
             +SA  VF  M ER+++ W+++I G A++G   EA+ +F +M + G+ P   T+ ++LSA
Sbjct: 189  TESAYKVFELMKERDLVAWNSVINGFALNGRPNEALTLFRDMCVEGLEPDGFTVVSLLSA 248

Query: 628  CSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMPVAPTAA 449
            C+  G +E G RV  +   + G+         ++DL  + G   EA++  + M     A 
Sbjct: 249  CAELGALELGRRV-HVYLLKVGLRENSYVTNSLLDLYAKCGTIREAQQVFSEMS-ERNAV 306

Query: 448  IWRSLLGAACTQG 410
             W SL+      G
Sbjct: 307  SWTSLIVGLAVNG 319


>ref|XP_008794509.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g21065-like [Phoenix dactylifera]
          Length = 626

 Score =  311 bits (797), Expect = 3e-95
 Identities = 173/430 (40%), Positives = 248/430 (57%), Gaps = 8/430 (1%)
 Frame = -3

Query: 1396 HAHALRSDVHEPPF--WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCS 1235
            +A A+ S +H P    +N++ R YA    P  AL V  QM       D  T+PF  K C+
Sbjct: 103  YAAAVFSQIHLPGVFTYNTMVRGYAESDSPGPALLVHRQMLAAAVTPDTHTYPFLLKACA 162

Query: 1234 LLSAIPEAASLQAHLLKLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWS 1055
             L A  E   + +  +K G    +++  N LV  Y+  G  + A ++F+ +  R   +W+
Sbjct: 163  KLMAFREGEKVHSLSVKNG-LETSVFVQNTLVHFYATCGLFESAYKVFEGMDERNLVTWN 221

Query: 1054 AMIAGYDRNGRPLEALFTF--LGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAI 881
            ++I G+  NGRP EAL  F  + + DSGV PD    VS L AC   G L  GR  H +  
Sbjct: 222  SIINGFAINGRPNEALTLFREMSLEDSGVKPDGFTMVSLLCACAELGALALGRRAHLYLF 281

Query: 880  VYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAI 701
              GL   +V    AL+D YAKCG V+ A  VF  MA + V++W+++I G+A++G G EA+
Sbjct: 282  KVGL-CGNVHVENALIDLYAKCGSVEEAYRVFNEMASKTVVSWTSLIVGMAVNGFGKEAL 340

Query: 700  KVFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDL 521
            ++F  M    + PT +T+  VL ACSH GLV+EG R F   + EY I P++EH GC+VDL
Sbjct: 341  ELFSAMERERLVPTEITLVGVLYACSHCGLVDEGFRYFNRMKNEYNIVPKIEHYGCMVDL 400

Query: 520  LFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDY 341
            L RAG+ E+A ++I  MP+ P A +WR+LLGA      LD+G++A  RLA  +   +GDY
Sbjct: 401  LGRAGLVEQAHDYIMNMPLEPNAVLWRTLLGACAMHKRLDLGKLAWARLAELDPGHSGDY 460

Query: 340  VMLANLYARFALWEEVRKVRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIH 161
            V+L+NLYA    W EV   +  M   GVRK+ G S VE+   +H F MGD+ H Q   I+
Sbjct: 461  VLLSNLYASVGRWGEVHXAKRSMLKGGVRKMPGHSLVELGNRVHEFVMGDRSHSQSDEIY 520

Query: 160  EMLELLNSEL 131
            +MLE + ++L
Sbjct: 521  KMLEEIANKL 530



 Score =  112 bits (280), Expect = 2e-22
 Identities = 91/315 (28%), Positives = 146/315 (46%), Gaps = 6/315 (1%)
 Frame = -3

Query: 1243 LCSLLSAIPEAASLQAHLLKLG-PAAATIYSLNALVSLYS-NLGHLDLALQLFDRIPNRT 1070
            L     ++P+   + AH ++ G P +   +  + + ++ S +   L  A  +F +I    
Sbjct: 56   LLQTCKSLPKIKQIHAHSIRTGVPLSDRAFGKHLVFAIVSLSPSPLPYAAAVFSQIHLPG 115

Query: 1069 ASSWSAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHA 890
              +++ M+ GY  +  P  AL     M  + V PD   +   L AC        G  +H+
Sbjct: 116  VFTYNTMVRGYAESDSPGPALLVHRQMLAAAVTPDTHTYPFLLKACAKLMAFREGEKVHS 175

Query: 889  HAIVYGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGT 710
             ++  GL   SV     LV FYA CG  +SA  VF  M ERN++TW+++I G A++G   
Sbjct: 176  LSVKNGLE-TSVFVQNTLVHFYATCGLFESAYKVFEGMDERNLVTWNSIINGFAINGRPN 234

Query: 709  EAIKVFDEMVL--AGVRPTPVTMTNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCG 536
            EA+ +F EM L  +GV+P   TM ++L AC+  G +  G R   L   + G+   V    
Sbjct: 235  EALTLFREMSLEDSGVKPDGFTMVSLLCACAELGALALGRRA-HLYLFKVGLCGNVHVEN 293

Query: 535  CVVDLLFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQGTLDVGRMAGERLAA--AE 362
             ++DL  + G  EEA      M  + T   W SL+      G    G+ A E  +A   E
Sbjct: 294  ALIDLYAKCGSVEEAYRVFNEM-ASKTVVSWTSLIVGMAVNG---FGKEALELFSAMERE 349

Query: 361  EMAAGDYVMLANLYA 317
             +   +  ++  LYA
Sbjct: 350  RLVPTEITLVGVLYA 364


>ref|XP_020522997.1| pentatricopeptide repeat-containing protein At4g21065 isoform X2
            [Amborella trichopoda]
          Length = 642

 Score =  311 bits (797), Expect = 4e-95
 Identities = 165/405 (40%), Positives = 244/405 (60%), Gaps = 4/405 (0%)
 Frame = -3

Query: 1354 WNSLARSYASHGFPDLALAVCLQMPLR----DAFTFPFAFKLCSLLSAIPEAASLQAHLL 1187
            +N++ R Y+S   P  AL++   M       D +TFPF  K C+ L  + +   +    +
Sbjct: 137  YNTMIRGYSSRDLPFEALSLYNLMKESGVDCDHYTFPFVLKACARLYLLKKGMEVHGFCV 196

Query: 1186 KLGPAAATIYSLNALVSLYSNLGHLDLALQLFDRIPNRTASSWSAMIAGYDRNGRPLEAL 1007
            KLG  ++ I+  NALV +Y N G + +A ++FD +  R   SWS+ I  Y RNG   EAL
Sbjct: 197  KLG-LSSDIFVQNALVHMYGNCGEVLMAQKVFDGMGKRDVVSWSSAIGCYVRNGLCNEAL 255

Query: 1006 FTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIVYGLGLDSVGFATALVDF 827
              F  M    + PDE   VS ++ACT  G LE G+ +H +    G  L +V  ATAL+D 
Sbjct: 256  DLFQAMQIENMRPDEVTMVSVVSACTSLGALELGKWVHHYLSRNGFEL-TVTLATALMDM 314

Query: 826  YAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIKVFDEMVLAGVRPTPVTM 647
            YAKCG ++    +F +M +RN+LTW+++IGGLA++G   EA+  ++ M  AG+RP  +T+
Sbjct: 315  YAKCGCLEGLLKIFHKMPQRNLLTWTSLIGGLAINGRSLEALTAYESMKEAGMRPDDITL 374

Query: 646  TNVLSACSHAGLVEEGLRVFKLTREEYGIEPRVEHCGCVVDLLFRAGMFEEAREFIATMP 467
              VLSACSH GLV+EG   F   ++E+ +EPR+EH GC+VDLL RAG  E+A +FI ++P
Sbjct: 375  IGVLSACSHGGLVQEGWSHFHSIKQEFKMEPRIEHYGCMVDLLGRAGQLEKAYQFIESLP 434

Query: 466  VAPTAAIWRSLLGAACTQGTLDVGRMAGERLAAAEEMAAGDYVMLANLYARFALWEEVRK 287
            + P + +WR+LLGA  + G L++GR+   R+   E    GDYV+L+N+Y     W +   
Sbjct: 435  IKPNSIMWRTLLGACASHGNLELGRLVSNRILEIELDHEGDYVLLSNIYGGLGRWADKAG 494

Query: 286  VRVEMNDMGVRKVAGFSSVEIDGELHRFAMGDKLHPQIIRIHEML 152
            VR  M + G+ K  G S VE+DG +H F  GD+ HP+   I+EM+
Sbjct: 495  VRNLMRERGIEKRPGCSIVEVDGVIHEFVAGDESHPRYKEINEMV 539



 Score =  131 bits (329), Expect = 9e-29
 Identities = 107/337 (31%), Positives = 156/337 (46%), Gaps = 5/337 (1%)
 Frame = -3

Query: 1405 RRLHAHALRSDVHEPPFWNSLARSYASHGFPDLALAVCLQMPLRDAFTFPFAFKLCSLLS 1226
            R+  +  L  + H  P   S   S  SH    LA+ + LQ           +++ C+ + 
Sbjct: 25   RKYASDFLSLNTHFIPNLASHFLSLNSHCMATLAIQI-LQSSQLTVHEPMQSYRKCTTME 83

Query: 1225 AIPEAASLQAHLLKLGPAAATIYSLNAL----VSLYSNLGHLDLALQLFDRIPNRTASSW 1058
               +A  + A L+K G  +  +Y+   L    VS  SN+G    A  +FD+I NR   S+
Sbjct: 84   ---QALQIHAILIKTGLNSNPLYTREILKFSAVSPDSNMG---FARSIFDQIQNRDVISY 137

Query: 1057 SAMIAGYDRNGRPLEALFTFLGMADSGVCPDEAAFVSTLAACTHGGCLEFGRSMHAHAIV 878
            + MI GY     P EAL  +  M +SGV  D   F   L AC     L+ G  +H   + 
Sbjct: 138  NTMIRGYSSRDLPFEALSLYNLMKESGVDCDHYTFPFVLKACARLYLLKKGMEVHGFCVK 197

Query: 877  YGLGLDSVGFATALVDFYAKCGEVDSARSVFGRMAERNVLTWSAMIGGLAMHGLGTEAIK 698
             GL  D +    ALV  Y  CGEV  A+ VF  M +R+V++WS+ IG    +GL  EA+ 
Sbjct: 198  LGLSSD-IFVQNALVHMYGNCGEVLMAQKVFDGMGKRDVVSWSSAIGCYVRNGLCNEALD 256

Query: 697  VFDEMVLAGVRPTPVTMTNVLSACSHAGLVEEGLRVFK-LTREEYGIEPRVEHCGCVVDL 521
            +F  M +  +RP  VTM +V+SAC+  G +E G  V   L+R   G E  V     ++D+
Sbjct: 257  LFQAMQIENMRPDEVTMVSVVSACTSLGALELGKWVHHYLSRN--GFELTVTLATALMDM 314

Query: 520  LFRAGMFEEAREFIATMPVAPTAAIWRSLLGAACTQG 410
              + G  E   +    MP       W SL+G     G
Sbjct: 315  YAKCGCLEGLLKIFHKMP-QRNLLTWTSLIGGLAING 350


Top