BLASTX nr result

ID: Cheilocostus21_contig00014726 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00014726
         (1972 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_018676479.1| PREDICTED: pentatricopeptide repeat-containi...   654   0.0  
ref|XP_008782338.1| PREDICTED: pentatricopeptide repeat-containi...   565   0.0  
ref|XP_010926377.1| PREDICTED: pentatricopeptide repeat-containi...   548   0.0  
gb|OAY66701.1| Pentatricopeptide repeat-containing protein [Anan...   509   e-170
ref|XP_020685221.1| putative pentatricopeptide repeat-containing...   474   e-156
ref|XP_020586611.1| putative pentatricopeptide repeat-containing...   457   e-150
gb|PIA46344.1| hypothetical protein AQUCO_01500103v1 [Aquilegia ...   453   e-148
ref|XP_010043756.1| PREDICTED: pentatricopeptide repeat-containi...   447   e-146
gb|PKA49143.1| Pentatricopeptide repeat-containing protein [Apos...   446   e-145
ref|XP_010263222.1| PREDICTED: pentatricopeptide repeat-containi...   441   e-143
ref|XP_021897472.1| pentatricopeptide repeat-containing protein ...   434   e-141
ref|XP_002277337.2| PREDICTED: pentatricopeptide repeat-containi...   432   e-140
gb|OVA19051.1| Pentatricopeptide repeat [Macleaya cordata]            429   e-139
ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containi...   428   e-138
ref|XP_009787964.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-138
ref|XP_021714284.1| pentatricopeptide repeat-containing protein ...   424   e-137
ref|XP_016511607.1| PREDICTED: pentatricopeptide repeat-containi...   424   e-137
gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]   422   e-136
ref|XP_021751703.1| pentatricopeptide repeat-containing protein ...   422   e-136
ref|XP_017219367.1| PREDICTED: pentatricopeptide repeat-containi...   422   e-136

>ref|XP_018676479.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic-like [Musa acuminata subsp. malaccensis]
          Length = 629

 Score =  654 bits (1688), Expect = 0.0
 Identities = 335/537 (62%), Positives = 405/537 (75%), Gaps = 2/537 (0%)
 Frame = +3

Query: 366  MLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGD 545
            M W+TP LS   +RSLL+ +AA KSLP+IAQLHQHL A GL  D  ++TKL++LYADAGD
Sbjct: 1    MRWRTPILSASSVRSLLREVAATKSLPQIAQLHQHLTAAGLTGDPFLSTKLLELYADAGD 60

Query: 546  LSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRS 725
            L  AL LFA LPSPSVFAWTPILALLSRS +H RCLA+YRSMR ++IAPDGYVFP VLRS
Sbjct: 61   LPSALSLFAVLPSPSVFAWTPILALLSRSGHHPRCLAAYRSMRFAAIAPDGYVFPPVLRS 120

Query: 726  AAADYHQHIS-TLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSW 902
            AA  +H   + +LH                     YSKSGDLAAAR  FD     DLLSW
Sbjct: 121  AAGHHHPTATASLHADAVKFAAATALPVANALINAYSKSGDLAAARHAFDFEDGSDLLSW 180

Query: 903  NSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLAL 1082
            NSIIAAY+ A  ++ ALGLLDSMRSDG DPD+VTWNT+MDGYCRAGRCSEAR+IL  L  
Sbjct: 181  NSIIAAYAIAGCIEPALGLLDSMRSDGYDPDVVTWNTIMDGYCRAGRCSEAREILDALPQ 240

Query: 1083 PNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRA 1262
            PN +SWTTVISGYSR+GNHEAALEIFSRMM AG+VPPD DTL+CVA+S R   A+ A  +
Sbjct: 241  PNAVSWTTVISGYSRSGNHEAALEIFSRMMHAGTVPPDPDTLSCVASSLRHVEALGAALS 300

Query: 1263 VHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXX 1442
            VHAYGL+T+ VD FYR AGAAL+ LYAS GRIS A+ VF+MM P DVVTWNA+I      
Sbjct: 301  VHAYGLKTTAVDAFYRVAGAALVALYASRGRISTAKVVFDMMSPADVVTWNAVILGFGRA 360

Query: 1443 XXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKH-DAGCRCTVTGMN 1619
                     F  M+SRGI++N TT++ V  L ++KLGKE+HAH ++  DA    TV   N
Sbjct: 361  GLTRVALEYFSAMLSRGIRSNCTTIATVLPLCDMKLGKEVHAHVIRQSDAYASSTV--QN 418

Query: 1620 SLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKP 1799
            +LI+MYSR+GC++ AHQ+F+ I+  DVVTWNT+IGAYG+HG G  A++L ++M+R G KP
Sbjct: 419  TLIDMYSRAGCVDRAHQVFAKIEVDDVVTWNTMIGAYGAHGLGTQALQLAHRMVRRGLKP 478

Query: 1800 NPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            + VTLTSTL+ACA CGLV EGL+F E ++RD+G +P++E YACVVD+LARAGRFEEA
Sbjct: 479  DAVTLTSTLVACARCGLVGEGLEFLETVVRDLGLVPSKEQYACVVDLLARAGRFEEA 535


>ref|XP_008782338.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic [Phoenix dactylifera]
          Length = 620

 Score =  565 bits (1456), Expect = 0.0
 Identities = 297/535 (55%), Positives = 364/535 (68%)
 Frame = +3

Query: 366  MLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGD 545
            M W+    +F  L SLLQ L+ + SLP+IAQLHQHL   GL      ATKL+QLYADAGD
Sbjct: 1    MRWRAAA-AFSSLESLLQALSTSGSLPQIAQLHQHLLVRGLAAAPFRATKLLQLYADAGD 59

Query: 546  LSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRS 725
            L  AL+LFA LP PSVFAWTPILALLSRS +H RCLA+Y +MRS+++ PDGYVFP VLRS
Sbjct: 60   LPSALRLFAALPCPSVFAWTPILALLSRSGDHLRCLATYAAMRSAAVPPDGYVFPSVLRS 119

Query: 726  AAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWN 905
            +AA  H   S +H                     YSK GD A+ARR FDL   RDL+SWN
Sbjct: 120  SAAAAHP--SAVHADAVKFGSDAVLPVRNALIDAYSKVGDTASARRVFDLTDGRDLVSWN 177

Query: 906  SIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALP 1085
            S+I+AY +A  V  AL LL S+ SDGC+PDLVTWN +MDGY RAGRC EA  I   ++ P
Sbjct: 178  SMISAYVNAGSVGLALDLLTSVESDGCEPDLVTWNIVMDGYSRAGRCDEALKIFDQISDP 237

Query: 1086 NTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAV 1265
            N +SWTT+IS YSR GN EAAL IF RM  A +VPPD DTL+CV + CR    +  GR  
Sbjct: 238  NVVSWTTLISCYSRCGNQEAALAIFRRMTGAATVPPDQDTLSCVISCCRNVARLANGRGA 297

Query: 1266 HAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXX 1445
            HAYGL+T   D FY SAGAALI +YAS G++S A  VF MMDP DVV WNA+I       
Sbjct: 298  HAYGLKTMAADAFYSSAGAALITMYASCGKLSTAEQVFLMMDPADVVAWNAVILGFAHCG 357

Query: 1446 XXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSL 1625
                    FR M SRGI+++ TT++ +    +L LGK++HAHA +  AG    V   N+L
Sbjct: 358  LDHTALEHFRAMQSRGIRSDETTVATILPACDLNLGKQIHAHAARRYAGSSALV--WNAL 415

Query: 1626 IEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNP 1805
            + MYSRSGCI+ A+ +FS + ++DVV+WNT+IGAYGSHG GK A+E ++ M   G KPN 
Sbjct: 416  MNMYSRSGCIKDAYLVFSRMVSRDVVSWNTMIGAYGSHGLGKEALEFMDLMKGLGPKPNA 475

Query: 1806 VTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            +T T+ L+AC+H G+VDEGL+ FE+L R  G  PT E YACVVD+LARAGRFEEA
Sbjct: 476  ITFTNALVACSHGGMVDEGLELFESLTRSCGSAPTMEQYACVVDLLARAGRFEEA 530


>ref|XP_010926377.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic-like [Elaeis guineensis]
          Length = 620

 Score =  548 bits (1413), Expect = 0.0
 Identities = 286/535 (53%), Positives = 356/535 (66%)
 Frame = +3

Query: 366  MLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGD 545
            M W+T   +F  L SL Q L+A +SLP+IAQLHQ L   GL      ATKL+QLYADA D
Sbjct: 1    MRWRTAA-AFSSLESLFQALSATRSLPQIAQLHQQLLVRGLAPAPFRATKLLQLYADASD 59

Query: 546  LSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRS 725
            L  AL+LFA LP PSVFAWTPILAL SRS +H RCLA Y +MR +++APDGYVFP VLRS
Sbjct: 60   LPSALRLFAALPRPSVFAWTPILALFSRSGDHLRCLAGYAAMRYAAVAPDGYVFPSVLRS 119

Query: 726  AAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWN 905
            + A  H   S +H                     YSK GD A+AR  FDL   RDL+SWN
Sbjct: 120  SVAAAHP--SAVHADAVKFRSDAVLPVRNALVDAYSKVGDTASARCVFDLTDGRDLVSWN 177

Query: 906  SIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALP 1085
            S+I  Y +A  +  A  LL SM SDGC+PDLVTWN +MDGY RAGR  EA  I   ++ P
Sbjct: 178  SMICGYVNAGCIGLARELLRSMESDGCEPDLVTWNIVMDGYSRAGRSDEALKIFDQISDP 237

Query: 1086 NTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAV 1265
            N +SWTT+IS YSR GNHEAAL IF RM+ A ++PPD DTL+CV + CR    +  GR V
Sbjct: 238  NVVSWTTLISCYSRCGNHEAALAIFRRMLSAATIPPDQDTLSCVISCCRNVARLANGREV 297

Query: 1266 HAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXX 1445
            HAYGL+T   D FY SAGAAL+ +YAS  ++S A   F MMDP DVV WNA+I       
Sbjct: 298  HAYGLKTMAADAFYSSAGAALVTMYASCNKLSTAEQAFLMMDPADVVAWNAVILGFAHSG 357

Query: 1446 XXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSL 1625
                    FRD+ SRG +++ TT++ +  + +L LGK++HAH  +H  G    V   N+L
Sbjct: 358  LDHTALEYFRDLQSRGNRSDETTVATILPVCDLNLGKQIHAHVARHHTGSSPLV--WNAL 415

Query: 1626 IEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNP 1805
            + MY+RSGCI  A+ +FS + ++DVV+WNT+IGAYGSHG GK A+EL++ M   G KPN 
Sbjct: 416  MNMYARSGCIRDAYLVFSRMVSRDVVSWNTMIGAYGSHGLGKEALELMDLMKGLGPKPNA 475

Query: 1806 VTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            +T T+ LMAC+HCG+VDEGL+ FE L +  G +PT E YACVVD+LARAGRF EA
Sbjct: 476  ITFTNALMACSHCGMVDEGLELFENLSQSWGLVPTMEQYACVVDLLARAGRFGEA 530


>gb|OAY66701.1| Pentatricopeptide repeat-containing protein [Ananas comosus]
          Length = 643

 Score =  509 bits (1312), Expect = e-170
 Identities = 288/544 (52%), Positives = 351/544 (64%), Gaps = 21/544 (3%)
 Frame = +3

Query: 402  LRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHS-VATKLIQLYADAGDLSCALQLFATL 578
            LRSLL V A   SLP+IAQLHQHL + GL+     + TKL+ +YA  GDL C L+LF+ L
Sbjct: 13   LRSLLSV-APTLSLPQIAQLHQHLLSRGLLLSSPFLLTKLLTVYASLGDLPCTLRLFSLL 71

Query: 579  PSP-SVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYV-FPLVLRSAAADY---- 740
            P P S+FAWTP+L+LLSRS+ H  CL+SY  +RSS ++P   +  P VLRSAAA      
Sbjct: 72   PHPRSLFAWTPLLSLLSRSSLHLLCLSSYSLLRSSPLSPPPTLPLPPVLRSAAAAPAPAA 131

Query: 741  -HQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDL--AGCRDLLSWNSI 911
                +  LH                     Y++ GDLAAARR FDL  A  RDLLSWNS+
Sbjct: 132  PRLAVPALHADALKFASDGPLPVANALVSAYARRGDLAAARRAFDLIPARRRDLLSWNSV 191

Query: 912  IAAYSSADRVDAALG----LLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILF-GL 1076
            +AAY++A      L     LL SMRS G +PDLVTWNTLMDGYCRAGRC EARDIL  G 
Sbjct: 192  LAAYAAAAAASGGLPELELLLRSMRSQGLEPDLVTWNTLMDGYCRAGRCDEARDILLRGA 251

Query: 1077 ALPNTISWTTVISGYSRTGNHEAALEIFSRMMFA-----GSVPPDLDTLACVAASCRQAT 1241
              PN +SWTTVISGY+R  NHEA+L +F RM        G V PD D LACV +SCR  +
Sbjct: 252  PDPNAVSWTTVISGYARAANHEASLRLFQRMTTTTTTSRGGVSPDADALACVVSSCRHVS 311

Query: 1242 AVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAM 1421
            A + GR VHAYGL+T   ++FY SAGAAL+ +YAS  +IS  RSVF  MDPTDVVTWN++
Sbjct: 312  AFLHGREVHAYGLKTMDRESFYGSAGAALVAMYASCSKISAVRSVFRAMDPTDVVTWNSV 371

Query: 1422 IQXXXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKH-DAGCR 1598
            I                + M SRGI +N T L+A     NL  GK++H +A +H D    
Sbjct: 372  IHALVHAGLPSAALDHLKQMHSRGIPSNATALAAALPACNLNSGKQIHGYATRHLDKPSS 431

Query: 1599 CTVTGMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQM 1778
              VT  N+LI MYS+SGCI AAH +FS+ DAKDVV+WNT+IGA  +HG  + A+ELV  M
Sbjct: 432  NLVT--NALISMYSKSGCIGAAHLVFSTTDAKDVVSWNTMIGACAAHGLARQALELVRLM 489

Query: 1779 IRNGFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGR 1958
             R+G + + VT TS LMAC+HCG  DEGL+ FE L+RD G  PT E YAC+VDML RA R
Sbjct: 490  RRSGARLDAVTFTSALMACSHCGFADEGLELFERLVRDEGLTPTMEQYACIVDMLGRAAR 549

Query: 1959 FEEA 1970
            FEEA
Sbjct: 550  FEEA 553



 Score = 63.9 bits (154), Expect = 8e-07
 Identities = 62/305 (20%), Positives = 110/305 (36%), Gaps = 8/305 (2%)
 Frame = +3

Query: 501  SVATKLIQLYADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSS 680
            S    L+ +YA    +S    +F  +    V  W  ++  L  +   +  L   + M S 
Sbjct: 335  SAGAALVAMYASCSKISAVRSVFRAMDPTDVVTWNSVIHALVHAGLPSAALDHLKQMHSR 394

Query: 681  SIAPDGYVFPLVLRSAAADYHQHI---STLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLA 851
             I  +       L +   +  + I   +T H                     YSKSG + 
Sbjct: 395  GIPSNATALAAALPACNLNSGKQIHGYATRHLDKPSSNLVTNALISM-----YSKSGCIG 449

Query: 852  AARRTFDLAGCRDLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYC 1031
            AA   F     +D++SWN++I A ++      AL L+  MR  G   D VT+ + +    
Sbjct: 450  AAHLVFSTTDAKDVVSWNTMIGACAAHGLARQALELVRLMRRSGARLDAVTFTSALMACS 509

Query: 1032 RAGRCSEARDILFGLA-----LPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPD 1196
              G   E  ++   L       P    +  ++    R    E A+    +M  + +    
Sbjct: 510  HCGFADEGLELFERLVRDEGLTPTMEQYACIVDMLGRAARFEEAVGFIGKMPVSATAV-- 567

Query: 1197 LDTLACVAASCRQATAVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSV 1376
                  + ++CR    V  G       +R+ P +         +  +YA AGR+  A+ V
Sbjct: 568  --VWGALLSACRMHHNVECGELAFEELVRSEPENP---GNFVTMSNIYAKAGRLEDAKRV 622

Query: 1377 FEMMD 1391
              M++
Sbjct: 623  RRMIE 627


>ref|XP_020685221.1| putative pentatricopeptide repeat-containing protein At1g03510
            [Dendrobium catenatum]
 gb|PKU69491.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum]
          Length = 632

 Score =  474 bits (1221), Expect = e-156
 Identities = 260/544 (47%), Positives = 341/544 (62%), Gaps = 8/544 (1%)
 Frame = +3

Query: 363  LMLWQTP-TLSFYCLRSLLQVLAAAKSLPKIAQLHQH--LAATGLVRDHSVA----TKLI 521
            L  W+ P  L+ Y   SLL  L ++ S    A +H H  L   GL+   S +    TKL+
Sbjct: 2    LSRWKAPQALAPYAFESLLSSLLSSPSTAASAAVHVHSQLLTNGLLPPSSTSGYFNTKLL 61

Query: 522  QLYADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGY 701
            QLYAD GDL  AL LF  LP P++FAWTPI+A LSRS +H RCL++Y  MR++ IAPDGY
Sbjct: 62   QLYADGGDLHSALHLFDVLPHPNIFAWTPIIAFLSRSGDHQRCLSTYSRMRAAGIAPDGY 121

Query: 702  VFPLVLRSAAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAG 881
            V P+ LRS+ +      + LH                     Y+ SGD+++A R F    
Sbjct: 122  VLPVALRSSGSSLFA-AAALHSNAVKFAAAANLHVSNALITAYADSGDVSSAGRVFVTMD 180

Query: 882  CRDLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARD 1061
             RDLLSWNS+I+A+ S    + AL LL SM S G +PD+VTWNT++DGYCR GRC+EA +
Sbjct: 181  GRDLLSWNSMISAFVSTGSTEPALDLLRSMPSVGYEPDIVTWNTILDGYCRVGRCAEAVE 240

Query: 1062 ILFGLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQAT 1241
            I   +  PN +S+T +I G++R+GNHEAALEIF RM   G+VPPD DTL+CV A CR   
Sbjct: 241  IFNQMTEPNVVSYTIIIVGHARSGNHEAALEIFRRMASGGAVPPDQDTLSCVVACCRHVA 300

Query: 1242 AVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAM 1421
            AV AGR VHA GL+T     FY SAGAAL+ LYA +  I+ A+ VF+++ PTD++  NA+
Sbjct: 301  AVRAGREVHASGLKTLDPAAFYFSAGAALVALYAGSKLITTAKRVFDLIVPTDLMKLNAL 360

Query: 1422 IQXXXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRC 1601
            I               FR+M SRGI  + TTL++V    ++  GK++ AHA+++   C  
Sbjct: 361  IAGLTHAGMTREALHHFREMQSRGIGTDPTTLASVLPACSIIQGKQIQAHAIRN--YCEL 418

Query: 1602 TVTGMNSLIEMYSRSGCIEAAHQLFSSID-AKDVVTWNTIIGAYGSHGRGKPAVELVNQM 1778
                 N+LI  Y+RSGCI AA  +FS+   A DVVTWN +I AYGSHG G+ AVEL  +M
Sbjct: 419  ATEVYNALISAYARSGCIAAARSVFSAASPAGDVVTWNAMITAYGSHGLGELAVELAWEM 478

Query: 1779 IRNGFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGR 1958
            IR G +PN +T T  L AC+H GLVD+GL++F+    D+G  P    Y+CVVDML RAGR
Sbjct: 479  IRAGPRPNIITFTGVLAACSHAGLVDQGLEWFDRFSIDMGLAPEMAQYSCVVDMLGRAGR 538

Query: 1959 FEEA 1970
            FEEA
Sbjct: 539  FEEA 542


>ref|XP_020586611.1| putative pentatricopeptide repeat-containing protein At1g03510
            [Phalaenopsis equestris]
          Length = 632

 Score =  457 bits (1177), Expect = e-150
 Identities = 251/541 (46%), Positives = 335/541 (61%), Gaps = 8/541 (1%)
 Frame = +3

Query: 372  WQTP-TLSFYCLRSLLQVLAAA--KSLPKIAQLHQHLAATGLVRDHSVA----TKLIQLY 530
            W+ P TL  Y   SLL  L+ +  KS    A +H  L   GL+   S +    TKL+QLY
Sbjct: 5    WKAPETLKPYFFESLLSSLSTSPSKSATAAAHIHSQLLTNGLLPPSSPSGYLNTKLLQLY 64

Query: 531  ADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFP 710
            AD GDL  AL LF +LP P++FAWTPI+A LSR  +H R L++Y  MR++ I PDGYV P
Sbjct: 65   ADGGDLHSALHLFDSLPHPNIFAWTPIIAFLSRIGDHRRSLSTYSRMRAAGIPPDGYVLP 124

Query: 711  LVLRSAAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRD 890
            + LR+A    H   + LH                     Y+ SGDL +A R F     RD
Sbjct: 125  VALRAATFS-HFAAAALHSNAVKFAATANLHVSNALIKAYAYSGDLTSADRVFASMDRRD 183

Query: 891  LLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILF 1070
            LLSWNS+I+ + SA   D AL L  SM SDG +PD+VTWNT++DGYCRAGRC+EA +I  
Sbjct: 184  LLSWNSMISVFVSAGSTDPALELFRSMASDGYEPDIVTWNTILDGYCRAGRCAEAVEIFN 243

Query: 1071 GLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVV 1250
             +  PN +S+TT+I G++R+GNH+AA+ IF RM+  G+V PD D L+CV A CR   A+ 
Sbjct: 244  RVKDPNVVSYTTIILGHARSGNHDAAIGIFRRMVSGGAVSPDQDMLSCVVACCRHLGALR 303

Query: 1251 AGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQX 1430
            AG  VHA+GL+T     FY SAGAAL+ LY+    I+ +R VF+ MDPTD++  NA+I  
Sbjct: 304  AGGEVHAFGLKTLNPVAFYCSAGAALVALYSGNKLITYSRRVFDSMDPTDLMKRNALIAG 363

Query: 1431 XXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVT 1610
                        QF++     +  + TTL++V    ++  GK++HAHA+++       V 
Sbjct: 364  LTHAGMVSEALDQFKETQLWDVGTDPTTLASVLPACSMIQGKQIHAHAIRNYYDFATEV- 422

Query: 1611 GMNSLIEMYSRSGCIEAAHQLF-SSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRN 1787
              N+LI +Y+  GCI AA  +F ++  A+DVVTWN +I AYGSHG G PA+EL+ +M+R 
Sbjct: 423  -YNALISVYASGGCITAAWSVFLAACMARDVVTWNAMIAAYGSHGLGGPAIELIREMMRA 481

Query: 1788 GFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEE 1967
            G +PN +T TS L AC H GLVD GL++F+    DVG +P   HYACVVDML RAGR EE
Sbjct: 482  GLRPNLITFTSVLSACNHAGLVDVGLEWFDRFRNDVGLVPEMVHYACVVDMLGRAGRLEE 541

Query: 1968 A 1970
            A
Sbjct: 542  A 542


>gb|PIA46344.1| hypothetical protein AQUCO_01500103v1 [Aquilegia coerulea]
          Length = 641

 Score =  453 bits (1166), Expect = e-148
 Identities = 241/540 (44%), Positives = 324/540 (60%), Gaps = 2/540 (0%)
 Frame = +3

Query: 357  SNLMLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYAD 536
            SN     T TL    L  LLQ  +  K+L +  Q+HQ +   G   ++ + TKL+Q+YAD
Sbjct: 14   SNTQTTITYTLLPSQLNHLLQCCSETKALKQGKQVHQQIITLGFGSNYFIITKLVQMYAD 73

Query: 537  AGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLV 716
              DL  A  LF  LP P+VFAWT IL   SR+  +  CL +Y  M+   + PD YVFP V
Sbjct: 74   CNDLVFAHDLFDKLPQPNVFAWTAILGYYSRNGMYKECLETYSDMKVHGVRPDRYVFPKV 133

Query: 717  LR--SAAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRD 890
            L+  S ++   + I  +H                     YSK GD+ +ARR FD  G RD
Sbjct: 134  LKACSQSSSLKKGIC-IHTDIVRFGAEMNPQVCNSLIDMYSKCGDVGSARRVFDEMGERD 192

Query: 891  LLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILF 1070
            LLSWN++I+ Y S   VD A+ L+ SMR DG +PDLV+WNT+MD YCR G C EA  I  
Sbjct: 193  LLSWNTLISGYVSNGFVDLAIKLIQSMRLDGVEPDLVSWNTVMDAYCRMGLCEEASRIFE 252

Query: 1071 GLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVV 1250
             +  PN ISWTT+ISGYSR G H+ AL IF  MM +  V PD D L+ +   CR   A +
Sbjct: 253  CIDKPNIISWTTLISGYSRIGKHDIALGIFRNMMMSRDVVPDSDALSNILVCCRLVGAYL 312

Query: 1251 AGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQX 1430
             GR +H YG++T  +  FY SAGAAL+ +YA   R   A +VFE+MD +DVVTWNAMI  
Sbjct: 313  NGREIHGYGIKTQTLPEFYNSAGAALVTMYARCRRAQCAATVFELMDKSDVVTWNAMILA 372

Query: 1431 XXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVT 1610
                        +F  M +RGI+ +  T+SA+    +LK GK++HA+  ++D     T+ 
Sbjct: 373  FVHLGMGDLALTRFSQMQTRGIKNDEVTISALLPACDLKFGKQIHAYIRRNDFDSAITI- 431

Query: 1611 GMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNG 1790
              N+LI MYS+SGC+E A+ +F ++  KD+++WNT+IG YG HG G  A++L+  M   G
Sbjct: 432  -WNALINMYSKSGCVEGAYNVFMNMRMKDIISWNTMIGGYGMHGNGVAALQLLRDMCHAG 490

Query: 1791 FKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
             +PN VT TS L AC+H GLVDEGLQ F +L ++ G   T E +ACVVD+LARAG+ E+A
Sbjct: 491  IQPNSVTFTSALSACSHSGLVDEGLQLFCSLSQEYGITLTMEQFACVVDLLARAGQLEDA 550


>ref|XP_010043756.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic [Eucalyptus grandis]
          Length = 640

 Score =  447 bits (1150), Expect = e-146
 Identities = 239/534 (44%), Positives = 329/534 (61%), Gaps = 1/534 (0%)
 Frame = +3

Query: 372  WQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLS 551
            WQT   S     +LLQ+   A SL +  Q+HQ +   GL  D  VATKL Q YAD  D +
Sbjct: 4    WQT--FSPLEATALLQLCRDAGSLAQCKQVHQRIVQHGLQADQFVATKLTQSYADLDDAA 61

Query: 552  CALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAA 731
             A +LF  L  P+VFAWT +LA  SRS     C+ +YR M+ + +APDGYVFP VLR+ A
Sbjct: 62   SADRLFRRLLRPNVFAWTAVLAHRSRSGAFGACVGAYRDMKRAGVAPDGYVFPSVLRACA 121

Query: 732  ADYHQHI-STLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNS 908
                  + + +H                     Y K G + +ARR FD  G RDLLSWNS
Sbjct: 122  QSGRAEVGAVVHRDVIARACEANVQVCNALIDMYGKCGHVESARRVFDEMGDRDLLSWNS 181

Query: 909  IIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPN 1088
            +++ Y       +A+ LL +MRS+G DPD+VTWN +MD YC+ G   EAR++L  +  PN
Sbjct: 182  VMSGYIWNGLHGSAVELLPAMRSEGLDPDIVTWNMVMDAYCQMGLFDEARNVLEQIEEPN 241

Query: 1089 TISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVH 1268
            TISWTT+ISGY R G H+ +LE+F  M+ AG V PD+ TL+    SCR   A+V G+ +H
Sbjct: 242  TISWTTLISGYCRIGKHKTSLEVFWDMVNAGKVWPDIRTLSSALTSCRHLGALVIGKEIH 301

Query: 1269 AYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXX 1448
            A+G++T    +FY SAGAAL+ +YA  G+I  A  VFE MD +D VTWNAMI        
Sbjct: 302  AHGIKTESRSSFYSSAGAALLTMYAKCGKIQEATRVFEQMDKSDFVTWNAMILGFVDLGY 361

Query: 1449 XXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLI 1628
                   F +M++ GI  +  T+S +  + +L LGK++HA+  K+ +     V   N+LI
Sbjct: 362  GNSAIVCFIEMLNLGIAFDQITISTLLPVCDLTLGKQIHAYIQKNSS--LSGVITWNALI 419

Query: 1629 EMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPV 1808
             MYS+ GCI++A  +FS+++ KD+++WN+IIG +G HG G+ A++L+ +M  +GF+PN V
Sbjct: 420  HMYSKQGCIQSALSIFSNMETKDIISWNSIIGGFGMHGHGEAALQLLQEMKVSGFQPNSV 479

Query: 1809 TLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            TLTS L AC+H GLV EGL+ F ++    G  P  EH+ACVVDMLARAG+ EEA
Sbjct: 480  TLTSALSACSHSGLVHEGLRLFNSM-PSFGLTPAMEHFACVVDMLARAGQLEEA 532


>gb|PKA49143.1| Pentatricopeptide repeat-containing protein [Apostasia shenzhenica]
          Length = 648

 Score =  446 bits (1147), Expect = e-145
 Identities = 246/534 (46%), Positives = 322/534 (60%), Gaps = 5/534 (0%)
 Frame = +3

Query: 384  TLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVA----TKLIQLYADAGDLS 551
            TL+   + S+L+ L+A+ S    A +H  L   GL+R  S +    TKL+QLYADAGDL 
Sbjct: 31   TLTPRAVDSILRSLSASPSTVAAAHVHLKLLVRGLLRPSSSSDYFTTKLLQLYADAGDLP 90

Query: 552  CALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAA 731
             AL LF  LP P++FAWT ++A  SRS +  RC A+Y  MR+  +APD YV P+VLRS  
Sbjct: 91   AALHLFDELPQPNIFAWTAVIAYHSRSGDRRRCYATYERMRAVGVAPDSYVLPVVLRSTP 150

Query: 732  ADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSI 911
                 + + LH                     YS SGD  AARR FD    RDLLSWNS+
Sbjct: 151  C----YFAALHADSIKFAASGNVQVSNALICAYSDSGDAGAARRVFDSMDSRDLLSWNSM 206

Query: 912  IAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNT 1091
            I+AY+ +   D+AL L  SM S+G DPD VTWNTL+DGYCRAGRC+EA ++   +   + 
Sbjct: 207  ISAYAYSGCTDSALALFRSMGSNGYDPDRVTWNTLIDGYCRAGRCAEALELFDQVVDRDV 266

Query: 1092 ISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHA 1271
            ++WTT+I G++   NHEAAL +F RM     VPPD  TL+CV A CR   ++ AGR VHA
Sbjct: 267  VTWTTIIVGHAYCANHEAALGLFRRMTSCSDVPPDQHTLSCVVACCRHVGSLRAGREVHA 326

Query: 1272 YGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXX 1451
             G++T     FY SAGAAL+ LYAS+  I+ A  V  + D  D + WNA+I         
Sbjct: 327  SGIKTLEPAAFYFSAGAALVALYASSKCITAANEVIRLADSMDRMNWNALIVGLTHAGLN 386

Query: 1452 XXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIE 1631
                 +FR M SRGI  + TTL++V    +L  G+++HAH +K+   C       N+LI 
Sbjct: 387  CQALDRFRVMQSRGIGTDATTLASVLPACSLAHGQQIHAHTIKN--YCHPAAVVCNALIS 444

Query: 1632 MYSRSGCIEAAHQLFSSID-AKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPV 1808
             Y+R GCI AA  +FS+   + DVVTWN +I AYGSHG   P + LV++M  NG +PN  
Sbjct: 445  TYARGGCIAAARSVFSAARFSGDVVTWNAMIAAYGSHGLVGPIMGLVSEMAHNGPRPNLS 504

Query: 1809 TLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            TLTS L AC+H GLVD+GL++F     D+G  P    + CVVDML RAGRFEEA
Sbjct: 505  TLTSLLTACSHTGLVDDGLEWFNRFQADMGLEPEMAQFGCVVDMLGRAGRFEEA 558



 Score = 79.0 bits (193), Expect = 2e-11
 Identities = 79/311 (25%), Positives = 125/311 (40%), Gaps = 9/311 (2%)
 Frame = +3

Query: 501  SVATKLIQLYADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSS 680
            S    L+ LYA +  ++ A ++     S     W  ++  L+ +  + + L  +R M+S 
Sbjct: 340  SAGAALVALYASSKCITAANEVIRLADSMDRMNWNALIVGLTHAGLNCQALDRFRVMQSR 399

Query: 681  SIAPDGYVFPLVLRSAAADYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAAR 860
             I  D      VL + +  + Q I   H                     Y++ G +AAAR
Sbjct: 400  GIGTDATTLASVLPACSLAHGQQI---HAHTIKNYCHPAAVVCNALISTYARGGCIAAAR 456

Query: 861  RTFDLAGCR-DLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRA 1037
              F  A    D+++WN++IAAY S   V   +GL+  M  +G  P+L T  +L+      
Sbjct: 457  SVFSAARFSGDVVTWNAMIAAYGSHGLVGPIMGLVSEMAHNGPRPNLSTLTSLLTACSHT 516

Query: 1038 GRCSEARDIL------FGLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDL 1199
            G   +  +         GL  P    +  V+    R G  E A+    RM  A    P  
Sbjct: 517  GLVDDGLEWFNRFQADMGLE-PEMAQFGCVVDMLGRAGRFEEAIGFVRRMREA----PAA 571

Query: 1200 DTLACVAASCRQATAVVAGRAVHAYGLRTSPVDTFYRSAGAALIL--LYASAGRISPARS 1373
                 + A+ R    V  G  V    ++  P      +AG  + +  +YASAGR   A+ 
Sbjct: 572  SVWGALMAASRAHRNVEVGEMVFEKLVKIEP-----ENAGNYVTMAEIYASAGRWEDAKM 626

Query: 1374 VFEMMDPTDVV 1406
            V EM+D   VV
Sbjct: 627  VREMIDWKRVV 637


>ref|XP_010263222.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic-like [Nelumbo nucifera]
          Length = 654

 Score =  441 bits (1135), Expect = e-143
 Identities = 234/525 (44%), Positives = 326/525 (62%), Gaps = 5/525 (0%)
 Frame = +3

Query: 411  LLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLSCALQLFATLPSPS 590
            LLQ  + + +L +  Q+HQ +    L  D    TKL+Q+YA   DL  A  LF  LP P+
Sbjct: 29   LLQRCSDSVALKQGRQVHQQIILHELAFDPFTVTKLVQMYAACNDLISARILFDELPRPN 88

Query: 591  VFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAAADYHQHIST---- 758
            VFAWT I++  SR+     C+ +Y  M+   I PDGYVFP VLR+      Q +S     
Sbjct: 89   VFAWTSIISFYSRNGMFKECVRTYNEMKLQGIGPDGYVFPKVLRACT----QSLSLAEGI 144

Query: 759  -LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSIIAAYSSAD 935
             +H                     YSK GD+  A+R F+    +DLL+WNS+I+ +   D
Sbjct: 145  RIHKDIIELGAEHNLQVCNSLIDMYSKCGDVQTAQRIFNGMAEKDLLTWNSMISGFVCND 204

Query: 936  RVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNTISWTTVIS 1115
             +D A+  LD+MRS+G +PDLVTWNT+MD YCR G C++A +I   +  PN IS TT+IS
Sbjct: 205  FLDIAIEQLDAMRSEGFEPDLVTWNTIMDAYCRMGLCNKALEIFEQIREPNIISLTTLIS 264

Query: 1116 GYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHAYGLRTSPV 1295
            GYSR GNHE  L IF  MM    V PD D L+ V  SCR    + +G+ +HAYG++TS  
Sbjct: 265  GYSRIGNHEMPLVIFREMMSKQEVCPDPDALSSVLVSCRHMGVLRSGQEIHAYGIKTSAR 324

Query: 1296 DTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXXXXXXXQFR 1475
              FY S+G AL+ +YA++GR+  AR+VF++MD +DVVTWNAMI                R
Sbjct: 325  IEFYNSSGPALLTVYATSGRLRDARNVFQLMDKSDVVTWNAMILGLVHLGLGDLAIKYVR 384

Query: 1476 DMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIEMYSRSGCI 1655
            +M SRG+Q + TT+S V  + +L+ GK++HA+  ++      +V   N+LI MYS+ GCI
Sbjct: 385  EMQSRGLQYDETTVSTVLPVCDLRFGKQIHAYIRRNALDSAISV--WNALINMYSKCGCI 442

Query: 1656 EAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPVTLTSTLMAC 1835
             +A+ +FS +D++DVV+WNT+IG YG +G GK A++L+ +M ++GF+PN  T TS L AC
Sbjct: 443  RSAYTVFSKMDSRDVVSWNTMIGGYGMNGCGKAALQLLLEMKQSGFQPNSATFTSLLSAC 502

Query: 1836 AHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            +H GLVD+GLQ F++LI    F P  EH+ACVVD+LARAG+ EEA
Sbjct: 503  SHSGLVDDGLQLFDSLINCFDFSPRMEHFACVVDLLARAGKLEEA 547


>ref|XP_021897472.1| pentatricopeptide repeat-containing protein DOT4, chloroplastic-like
            [Carica papaya]
          Length = 627

 Score =  434 bits (1117), Expect = e-141
 Identities = 230/540 (42%), Positives = 324/540 (60%), Gaps = 4/540 (0%)
 Frame = +3

Query: 363  LMLWQTP-TLSFYC--LRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYA 533
            + +W++P T+   C  +  LLQ  + ++SL    Q+HQ +   G  RD    TKL+QLY 
Sbjct: 1    MSIWRSPRTIISQCDNINYLLQQCSNSRSLQPAKQVHQRVTVCGSSRDPFTLTKLLQLYV 60

Query: 534  DAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPL 713
            D  DL  A  LF  LP P+VFAW+ ILA  SR  ++  CL SYR M+   ++PD YVFP 
Sbjct: 61   DCDDLDSAQNLFDKLPQPNVFAWSSILAFYSRHGSYEECLHSYRDMKVKGVSPDNYVFPQ 120

Query: 714  VLRSAAADYH-QHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRD 890
            VLR+ A     +    +H                     Y+K GD+ +ARR FD    +D
Sbjct: 121  VLRACAQSSSLEEGIQIHKHVIVYGSELNLQVCNSLIDMYAKCGDVESARRVFDEMVEKD 180

Query: 891  LLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILF 1070
            LLSWNS+I+ Y     +  A+ L  S+R++GC+PD+VT+NT++D YCR G C EA  I  
Sbjct: 181  LLSWNSMISGYVHNGLLRLAVQLFSSVRANGCEPDIVTFNTVLDAYCRMGLCEEAWKIFG 240

Query: 1071 GLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVV 1250
             +  PN ISWTT+ISGYSRTG HE AL  F  M+  G V PDL +L+ V  SCR   A++
Sbjct: 241  QIKDPNIISWTTLISGYSRTGKHEIALRKFRTMVNMGRVFPDLGSLSSVLVSCRHLGALM 300

Query: 1251 AGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQX 1430
            +GR +H YG +      FY SAG AL+ +Y    RI  A++VF +MD +DVVTWN+MI  
Sbjct: 301  SGREIHGYGTKMERGTKFYSSAGPALLTMYTKCHRIQDAKTVFGLMDKSDVVTWNSMILG 360

Query: 1431 XXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVT 1610
                         F  +   G++ + TT+S +  + ++K GK++HA  M  ++ C   V 
Sbjct: 361  FSDVQLGHMALECFSQLQKTGVKNDQTTISTILPVCDIKSGKQIHAR-MIRNSFCSVVVV 419

Query: 1611 GMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNG 1790
             +N+LI MYS+ GC+  A+ +F ++  +D+V+WN +IG +  HG G+ A+ L+ QM ++G
Sbjct: 420  -LNALIHMYSKCGCVGYAYSVFCNMFIRDLVSWNAMIGGFAMHGLGQAALHLLEQMNQSG 478

Query: 1791 FKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            F PN VTLTS L AC H GLVDEG++ F  + ++VG +P+ EH ACVVDMLARAGR E+A
Sbjct: 479  FSPNSVTLTSALSACTHGGLVDEGIELFHRMTKEVGLIPSIEHCACVVDMLARAGRIEDA 538


>ref|XP_002277337.2| PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like
            [Vitis vinifera]
          Length = 634

 Score =  432 bits (1110), Expect = e-140
 Identities = 234/540 (43%), Positives = 316/540 (58%), Gaps = 1/540 (0%)
 Frame = +3

Query: 354  ISNLMLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYA 533
            IS+L       LS + L  LLQ+ + +K+L +  QLHQH+   GL     + TKL+Q+YA
Sbjct: 10   ISSLPTSNPNLLSSFQLNHLLQLCSNSKALHQGKQLHQHIILCGLDHHPFMLTKLVQMYA 69

Query: 534  DAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPL 713
            D GDL  A  LF  L  P+VFAWT IL   SR+     C+ +Y  M+   + PD YVFP 
Sbjct: 70   DCGDLGSAQALFDKLSQPNVFAWTAILGFYSRNGLSDECVRTYSEMKLKGVLPDKYVFPK 129

Query: 714  VLRSAAADYHQHIST-LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRD 890
            V R+        +   +H                     YSKSGD+ + RR FD    RD
Sbjct: 130  VFRACGQLLWLEVGIQVHKDVVICGCEFDLQVCNSLIDMYSKSGDVGSGRRVFDEMVERD 189

Query: 891  LLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILF 1070
            +LSWNS+I+ Y     ++ ++ LL SMR  G +PD+VTWNT+MD YCR G C EA +I  
Sbjct: 190  VLSWNSMISGYVCNGFLEFSVELLASMRIRGFEPDMVTWNTVMDAYCRMGLCDEAWEIFE 249

Query: 1071 GLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVV 1250
             +  PN IS TT++SGYSR GNHE +L IF  MM      PDLD+L+ V  SCR   A+V
Sbjct: 250  QIKEPNIISLTTLVSGYSRIGNHEKSLGIFREMMSRRVAFPDLDSLSSVLVSCRHLGALV 309

Query: 1251 AGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQX 1430
             G+ +H YG+R+    +FY+SAGAAL+ +Y    RI  A +VFE+MD  DVVTWNAMI  
Sbjct: 310  CGQEIHGYGIRSVDSSSFYKSAGAALLTMYVKCKRIQDALNVFELMDRFDVVTWNAMILG 369

Query: 1431 XXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVT 1610
                         F  M   GI  N  T+S V    +LK GK++HA+  K+       + 
Sbjct: 370  FVDLEMGHLALECFSKMQRSGIMNNQITISTVLPACDLKSGKQVHAYITKNSFS--SVIP 427

Query: 1611 GMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNG 1790
              N+LI MYS+ GCI  A+ +FS++ ++D+V+WNT+IG +G HG G+ A++L+  M  + 
Sbjct: 428  VWNALIHMYSKCGCIGTAYSIFSNMISRDLVSWNTMIGGFGMHGLGQFALQLLRDMSHSD 487

Query: 1791 FKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
              PN VT TS L AC+H GLVDEG++ F  + RD GF P  EH++CVVD+LARA R E+A
Sbjct: 488  VCPNSVTFTSALSACSHSGLVDEGMELFHTMTRDFGFTPGMEHFSCVVDLLARADRLEDA 547


>gb|OVA19051.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 622

 Score =  429 bits (1104), Expect = e-139
 Identities = 227/525 (43%), Positives = 313/525 (59%), Gaps = 2/525 (0%)
 Frame = +3

Query: 402  LRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLSCALQLFATLP 581
            L  LLQ  + +K+L +  Q+HQ +   G   D  + TKLIQ+Y+D  D S    LF  LP
Sbjct: 9    LNHLLQCCSDSKALKQGRQVHQQIIVHGFGLDPFITTKLIQMYSDCNDFSSTHNLFDKLP 68

Query: 582  SPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAAADYH-QHIST 758
             P+VFAWT ILA   R+  +  CL +Y  +R     PD Y+FP V+R+       +    
Sbjct: 69   QPNVFAWTAILAFYLRNGMYQDCLRTYHQLRWQGTKPDSYIFPKVIRACTQSLSLEEGIE 128

Query: 759  LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSIIAAYSSADR 938
            +H                     YSK GD+  ARR FD    +DLLSWNS+I+ Y     
Sbjct: 129  IHTDIIKFDGELNLQVCNSLIDMYSKFGDVQNARRVFDEMLQKDLLSWNSMISGYVCNGF 188

Query: 939  VDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNTISWTTVISG 1118
            +D ++ LLDSMR +G +PDLVTWNT+MD YCR G C EA  I   +  PN ISWT++ISG
Sbjct: 189  IDLSIKLLDSMRLNGFEPDLVTWNTIMDAYCRIGLCEEASKIFEHVKEPNIISWTSLISG 248

Query: 1119 YSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHAYGLRTSPVD 1298
            YSRTG HE ++ IF  MM  G V PD D L+   +SC+    + +GR +H YG++     
Sbjct: 249  YSRTGKHEISMRIFRDMMSRGGVIPDPDALSSALSSCKLLGNLRSGRELHGYGIKIQEGL 308

Query: 1299 TFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXXXXXXXQFRD 1478
             FY SAGA L+ +Y+ +G+   AR+VFE MD +DVVTWNAMI               F +
Sbjct: 309  EFYNSAGAVLLTMYSGSGKAQFARNVFEFMDKSDVVTWNAMILGLTHLGMGDSALKCFGE 368

Query: 1479 MVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIEMYSRSGCIE 1658
            M   G++ +  TLS +  + +L LGK++HA  +K+  G    V   N+LI MYS+ GCIE
Sbjct: 369  MQLIGVKNSQITLSTILPVCDLNLGKQIHADIVKN--GFNSAVPVWNALINMYSKCGCIE 426

Query: 1659 AAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIR-NGFKPNPVTLTSTLMAC 1835
             A+ +F+++  KDVV+WNT+IG YG HG G+ A+EL+ +M + +   PN VT TS L AC
Sbjct: 427  DAYTVFTNMGTKDVVSWNTMIGGYGIHGHGRAALELLQEMKKQSSILPNSVTFTSALTAC 486

Query: 1836 AHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            +H GLVDEGLQ F++  ++  F+PT +H+ C+VD+LARAGR  +A
Sbjct: 487  SHAGLVDEGLQLFDSWNQEFNFVPTMDHFGCLVDLLARAGRLVDA 531


>ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic [Theobroma cacao]
 gb|EOX91375.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 635

 Score =  428 bits (1101), Expect = e-138
 Identities = 224/531 (42%), Positives = 320/531 (60%), Gaps = 1/531 (0%)
 Frame = +3

Query: 381  PTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLSCAL 560
            P +S   L +LLQ+ + +KSL +  Q+H  + + G  ++  + TKL+Q+YAD  DL  A 
Sbjct: 16   PRISLSQLNNLLQLCSKSKSLSQGKQIHPQIISNGSHQNTFIITKLVQMYADCDDLVSAN 75

Query: 561  QLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAAADY 740
            +LF  LP P+VF+WT IL L SR   + +C+ SY  M+ S + PDG+VFP VLR++    
Sbjct: 76   KLFDRLPQPNVFSWTAILGLYSRHGMYRKCIESYCEMKMSGVLPDGFVFPKVLRASVQGL 135

Query: 741  HQHIST-LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSIIA 917
                   +H                     Y + GDL +ARR FD    RDL SWN +I+
Sbjct: 136  CLETGICVHKDVIVCGCEFYLEVCNSLIDMYGRCGDLTSARRVFDEMVGRDLFSWNLMIS 195

Query: 918  AYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNTIS 1097
             Y     ++  L +L+ MR DG +PD+VTWN +MDGYCR GRC EA  I   +  PN IS
Sbjct: 196  GYVGNGMLEFGLEILNCMRLDGFEPDVVTWNMVMDGYCRMGRCDEALKIFEYIKEPNIIS 255

Query: 1098 WTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHAYG 1277
            WTT+ISGYSR G HE++L IF  M+  G V PDLD L+    SCR   A+++G+ +H +G
Sbjct: 256  WTTLISGYSRIGQHESSLRIFKDMLNKGVVLPDLDCLSSALVSCRHLGALLSGKEIHGFG 315

Query: 1278 LRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXXXX 1457
            ++     +FY SAG AL+ L++  GR   A ++FE+MD +D VTWNAMI           
Sbjct: 316  IKMMIGRSFYGSAGPALLTLHSKCGRSRDAGNIFELMDKSDTVTWNAMILGFVDRGLGHM 375

Query: 1458 XXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIEMY 1637
                F +M   GI+ + TT+  V  +  L+ GK+LHA+  +  +   C +   N+L+ MY
Sbjct: 376  AVDCFGEMQRMGIKNDQTTICTVLPVCELRQGKQLHAYIRRQYSDSICPI--WNALVHMY 433

Query: 1638 SRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPVTLT 1817
            S+ G I +A+ +FS++ A+D+V+WNT+IG +  HG G+ A++L+ +M   G  P+PVTLT
Sbjct: 434  SKCGSIGSAYSVFSNMVARDLVSWNTMIGGFALHGLGEAALQLLKEMNYLGVCPSPVTLT 493

Query: 1818 STLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            S L AC H GLVDEGL+ F ++ R     P+ EH+ACVVDML+RAGR E+A
Sbjct: 494  SALSACNHSGLVDEGLKVFSSMTRGFHLSPSMEHFACVVDMLSRAGRLEDA 544


>ref|XP_009787964.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Nicotiana sylvestris]
          Length = 637

 Score =  427 bits (1098), Expect = e-138
 Identities = 230/550 (41%), Positives = 322/550 (58%), Gaps = 3/550 (0%)
 Frame = +3

Query: 330  RVGTRFPVISNLMLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVA 509
            RV + +   S+       T   + +  LLQ+ +  K++ +  Q HQ +   G   +  + 
Sbjct: 2    RVWSCYKKFSSAPATNVRTFLTFEINHLLQLCSNFKAIEQGKQTHQQIIVHGHGHNPFII 61

Query: 510  TKLIQLYADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIA 689
            TKLI++YA+ G++  A  LF  L   +VFAWT +L+  SR+     C+++YR M+   I 
Sbjct: 62   TKLIRVYAECGNIKSARYLFVELSQRNVFAWTAMLSYFSRNCLIEECVSTYREMKLDGIL 121

Query: 690  PDGYVFPLVLRSAAADYHQHIST---LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAAR 860
             DGYVFPLVL+  A      ++T   +H                     YSK GD+ +A+
Sbjct: 122  LDGYVFPLVLKVCAK--FSSLATGEQVHKDVVVCGAEWNLQVGHSLIDMYSKCGDIQSAK 179

Query: 861  RTFDLAGCRDLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAG 1040
            R FDL   +DLLSWN II+ Y S +  D A+G+   MR +GC PD+VT+NT+MD YCR G
Sbjct: 180  RVFDLMQEKDLLSWNLIISGYVSNELPDLAVGMFGLMRMEGCQPDIVTFNTIMDAYCRMG 239

Query: 1041 RCSEARDILFGLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVA 1220
            RC EAR I   +  P+ ISWTT+ISGYSR G H  AL+IF  M+  G V PDLD L+ V 
Sbjct: 240  RCDEARKIFMLIKDPSIISWTTLISGYSRIGEHNHALDIFREMINRGEVCPDLDCLSSVL 299

Query: 1221 ASCRQATAVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTD 1400
            ASC+    + + + +HA G++     +FYRS+G AL+ LY   GRI  AR VFE+MD TD
Sbjct: 300  ASCQLIGDLRSAKEIHAQGIKVEWPFSFYRSSGPALLTLYTKCGRIQDARHVFELMDKTD 359

Query: 1401 VVTWNAMIQXXXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMK 1580
            VV WN+MI               F++M+  GI+ N TTLS+V  + +LK GK++HA+ ++
Sbjct: 360  VVAWNSMIHGFAELGMKGLALEYFKNMIPMGIKINGTTLSSVLPVFDLKYGKQIHAYILR 419

Query: 1581 HDAGCRCTVTGMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAV 1760
                    +   N+LI MYS+ GCI  A  +FS +  KD+V+WNTIIG  G HG G+ A+
Sbjct: 420  SSLWDVTPI--WNALIYMYSKYGCIGNALSVFSHLAHKDIVSWNTIIGGLGMHGLGQDAL 477

Query: 1761 ELVNQMIRNGFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDM 1940
             L+ +M   G +PN +T TS L AC+H GLVDEGL  F  ++ + G  P  EH+ CVVD+
Sbjct: 478  HLLEKMSHYGIRPNALTFTSVLSACSHAGLVDEGLDIFHRMVEEFGLNPRMEHFTCVVDL 537

Query: 1941 LARAGRFEEA 1970
            L RAGR E+A
Sbjct: 538  LTRAGRLEDA 547


>ref|XP_021714284.1| pentatricopeptide repeat-containing protein At2g03380,
            mitochondrial-like [Chenopodium quinoa]
          Length = 635

 Score =  424 bits (1090), Expect = e-137
 Identities = 225/546 (41%), Positives = 324/546 (59%), Gaps = 6/546 (1%)
 Frame = +3

Query: 351  VISNLMLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLY 530
            ++ +   + T  LS   L  LLQV + +++L +  Q H  +   G+  +  V TKL+Q Y
Sbjct: 6    IVRHFSSFSTSNLSSSKLTHLLQVCSNSRALAQAKQTHLQVIQHGMHHEPYVTTKLVQTY 65

Query: 531  ADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFP 710
            A+   L  A +LF  LP P+VFAWT +LA  SR+     CL  Y +MR   + PD YVFP
Sbjct: 66   AECDHLGYARKLFDELPEPNVFAWTALLAFYSRNGLFIECLQVYGAMRFMGVLPDQYVFP 125

Query: 711  LVLRSAAADYHQHISTL------HXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFD 872
             +LR+ A      +STL      H                     YSK G++  ARR FD
Sbjct: 126  KILRACA-----QLSTLKTGAMVHKEVIVCGVELNLQVCSSLIDMYSKCGEVDNARRVFD 180

Query: 873  LAGCRDLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSE 1052
                RDLLSWN++I+ Y +   ++ AL L  SMR  G +PDLVT N++MD YC+ G C E
Sbjct: 181  SLAVRDLLSWNAMISGYVANGFLELALELYGSMRVKGTEPDLVTLNSVMDAYCQMGLCDE 240

Query: 1053 ARDILFGLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCR 1232
            A ++   +A P+ +SWTT++SGYSR GNHEA+L +F  M+      PDLD+L+    SCR
Sbjct: 241  ALEVFGQIARPSVVSWTTLMSGYSRIGNHEASLGMFRDMVKFMVDIPDLDSLSSAMVSCR 300

Query: 1233 QATAVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTW 1412
               A+  G+ +H YGL+T     FY+S G AL+ +Y+   R+   R+VFE+MD +DVVTW
Sbjct: 301  HLRALRNGQEIHVYGLKTHSRSLFYKSCGPALLTMYSKCSRVYDMRNVFELMDKSDVVTW 360

Query: 1413 NAMIQXXXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAG 1592
            NAMI               F  M   G++ + TT+S +  + +LK GK++HA+ ++++ G
Sbjct: 361  NAMILGLADLSMGHSALETFSMMQIMGVKNDQTTISTILPICDLKPGKQIHAYILRNNLG 420

Query: 1593 CRCTVTGMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVN 1772
                V  +N+LI MY + G I +A  +FS++  KD+VTWNT+IG +  HG+G+ A++++N
Sbjct: 421  S--VVPVLNALISMYCKCGSIHSAKLIFSNMTMKDLVTWNTMIGGFAMHGKGEAALQMLN 478

Query: 1773 QMIRNGFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARA 1952
            +MI +G  PN VTLTS L AC H GL++EGL+ F  + RD G +P  EH+AC+VDMLARA
Sbjct: 479  EMISSGLSPNSVTLTSVLSACNHSGLINEGLEAFYGMSRDFGLVPKMEHFACLVDMLARA 538

Query: 1953 GRFEEA 1970
            G+  EA
Sbjct: 539  GQLNEA 544


>ref|XP_016511607.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic-like [Nicotiana tabacum]
          Length = 637

 Score =  424 bits (1090), Expect = e-137
 Identities = 229/550 (41%), Positives = 321/550 (58%), Gaps = 3/550 (0%)
 Frame = +3

Query: 330  RVGTRFPVISNLMLWQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVA 509
            RV + +   S+       T   + +  LLQ+ +  K++ +  Q HQ +   G   +  + 
Sbjct: 2    RVWSCYKKFSSAPATNVRTFLTFEINHLLQLCSNFKAIEQGKQTHQQIIVHGHGHNPFII 61

Query: 510  TKLIQLYADAGDLSCALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIA 689
            TKLI++YA+ G++  A  LF  L   +VFAWT +L+  SR+     C+++YR M+   I 
Sbjct: 62   TKLIRVYAECGNIKSARYLFVELSQRNVFAWTAMLSYFSRNCLIEECVSTYREMKLDGIL 121

Query: 690  PDGYVFPLVLRSAAADYHQHIST---LHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAAR 860
             DGYVFPLVL+  A      ++T   +H                     YSK GD+ +A+
Sbjct: 122  LDGYVFPLVLKVCAK--FSSLATGEQVHKDVVVCGAEWNLQVGHSLIDMYSKCGDIQSAK 179

Query: 861  RTFDLAGCRDLLSWNSIIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAG 1040
            R FDL   +DLLSWN II+ Y S +  D A+G+   M  +GC PD+VT+NT+MD YCR G
Sbjct: 180  RVFDLMQEKDLLSWNLIISGYVSNELPDLAVGMFGLMSMEGCQPDIVTFNTVMDAYCRMG 239

Query: 1041 RCSEARDILFGLALPNTISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVA 1220
            RC EAR I   +  P+ ISWTT+ISGYSR G H  AL+IF  M+  G V PDLD L+ V 
Sbjct: 240  RCDEARKIFMLIKDPSIISWTTLISGYSRIGEHNHALDIFREMINRGEVCPDLDCLSSVL 299

Query: 1221 ASCRQATAVVAGRAVHAYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTD 1400
            ASC+    + + + +HA G++     +FYRS+G AL+ LY   GRI  AR VFE+MD TD
Sbjct: 300  ASCQLIGDLRSAKEIHAQGIKVEWPFSFYRSSGPALLTLYTKCGRIPDARHVFELMDKTD 359

Query: 1401 VVTWNAMIQXXXXXXXXXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMK 1580
            VV WN+MI               F++M+  GI+ N TTLS+V  + +LK GK++HA+ ++
Sbjct: 360  VVAWNSMIHGFAELGMKGLALEYFKNMIPMGIKINGTTLSSVLPVFDLKYGKQIHAYILR 419

Query: 1581 HDAGCRCTVTGMNSLIEMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAV 1760
                    +   N+LI MYS+ GCI  A  +FS +  KD+V+WNTIIG  G HG G+ A+
Sbjct: 420  SSLWDVTPI--WNALIYMYSKYGCIGNALSVFSHLAHKDIVSWNTIIGGLGMHGLGQDAL 477

Query: 1761 ELVNQMIRNGFKPNPVTLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDM 1940
             L+ +M   G +PN +T TS L AC+H GLVDEGL  F  ++ + G  P  EH+ CVVD+
Sbjct: 478  HLLEKMSHYGIRPNALTFTSVLSACSHAGLVDEGLDIFHRMVEEFGLNPRMEHFTCVVDL 537

Query: 1941 LARAGRFEEA 1970
            L RAGR E+A
Sbjct: 538  LTRAGRLEDA 547


>gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]
          Length = 646

 Score =  422 bits (1086), Expect = e-136
 Identities = 221/520 (42%), Positives = 310/520 (59%), Gaps = 1/520 (0%)
 Frame = +3

Query: 414  LQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLSCALQLFATLPSPSV 593
            LQ+ + +KSL +   +HQ +   GL ++  + TKL+Q+YAD  DL  + +LF  L  P+V
Sbjct: 30   LQLCSNSKSLNQGKHVHQKIIQCGLDQNPFIVTKLVQMYADCDDLVSSWKLFDNLLKPNV 89

Query: 594  FAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAAADYHQHIS-TLHXX 770
            FAWT IL   SR   H  C+ +Y  M  + + PDGYVFP VLR+ A      +   +H  
Sbjct: 90   FAWTAILGFYSRHGMHEECVRAYVEMILNDVLPDGYVFPKVLRACAQLLRLKVGIVVHKD 149

Query: 771  XXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSIIAAYSSADRVDAA 950
                               YSK  D+ +A+R FD    RDL SWNS+I+ Y     +  A
Sbjct: 150  VIICGLNLNLQVCNSLIDMYSKCEDIGSAKRVFDEMVGRDLWSWNSMISGYVCNGLLGLA 209

Query: 951  LGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNTISWTTVISGYSRT 1130
            + L D M   GC+PD+VT NT+MD YCR G C+EA  I   +  PN ISWTT+ISGYSR 
Sbjct: 210  VELFDCMNLGGCEPDIVTLNTVMDAYCRMGHCNEATRIFEQIKEPNIISWTTLISGYSRI 269

Query: 1131 GNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHAYGLRTSPVDTFYR 1310
            G+HEA+L IF  M+ +  V PDLD+L+ V  SCR   +++ G+ +H YG++      FY 
Sbjct: 270  GSHEASLRIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRESGIAFYH 329

Query: 1311 SAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXXXXXXXQFRDMVSR 1490
            SAG AL+ +YA+  RI  A +VF++M+P  VV+WNAMI               FR M   
Sbjct: 330  SAGPALLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALDSFRRMQRA 389

Query: 1491 GIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIEMYSRSGCIEAAHQ 1670
             I  + TT+S +    NLK GK++HA   K        V   N+LI MYS+ GCI +A+ 
Sbjct: 390  RINVDQTTISTILPACNLKFGKQIHAFIRK--ISFDLVVPVWNALIHMYSKCGCIGSAYS 447

Query: 1671 LFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPVTLTSTLMACAHCGL 1850
            +FS++  +D+V+WN++IG +G HG G+ A+ L+ +M  +G  PN VT TS L AC+H GL
Sbjct: 448  VFSNMINRDLVSWNSMIGGFGMHGHGRAALHLLKEMNHSGTCPNSVTFTSVLSACSHAGL 507

Query: 1851 VDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            VDEGLQ F +++++ G +P+ EHYAC+VDMLAR G+ E+A
Sbjct: 508  VDEGLQVFHSMMKEYGVIPSMEHYACIVDMLARDGQLEDA 547


>ref|XP_021751703.1| pentatricopeptide repeat-containing protein At2g03380,
            mitochondrial-like [Chenopodium quinoa]
          Length = 635

 Score =  422 bits (1085), Expect = e-136
 Identities = 223/534 (41%), Positives = 320/534 (59%), Gaps = 1/534 (0%)
 Frame = +3

Query: 372  WQTPTLSFYCLRSLLQVLAAAKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLS 551
            + TP LS   L  LLQV + +++L +  Q H  +   GL  +  + TKL+Q YA+   L 
Sbjct: 13   FSTPNLSSSKLTHLLQVCSNSRALAQAKQTHLQIIQHGLHHEPYITTKLVQTYAECDHLG 72

Query: 552  CALQLFATLPSPSVFAWTPILALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAA 731
             A +LF  LP P+VFAWT +LA  SR+     CL  Y +MR   + PD YV P +LR+ A
Sbjct: 73   YAQKLFDKLPEPNVFAWTALLAFYSRNGLFVECLQVYGAMRFMGVLPDQYVIPKILRACA 132

Query: 732  A-DYHQHISTLHXXXXXXXXXXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNS 908
                 +  + +H                     YSK G++ +ARR FD    RDLLSWN+
Sbjct: 133  QLSKLKTGAMIHKEVVVCGVELNLQVCSSLIDMYSKCGEVDSARRVFDNLAVRDLLSWNA 192

Query: 909  IIAAYSSADRVDAALGLLDSMRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPN 1088
            +++ Y +   ++ AL L  SMR  G +PDLVT N++MD YCR G C EA ++   +A P+
Sbjct: 193  MLSGYVANGFLELALELYGSMRIMGVEPDLVTLNSVMDAYCRMGLCDEALEVFRQIARPS 252

Query: 1089 TISWTTVISGYSRTGNHEAALEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVH 1268
             +SWTT++SGYSR GNHEA+L +F  M+      PDLD+L+    SCR   A+  G+ +H
Sbjct: 253  VVSWTTLMSGYSRIGNHEASLSMFRDMVNFMVDIPDLDSLSSAMVSCRHLRALRNGQEIH 312

Query: 1269 AYGLRTSPVDTFYRSAGAALILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXX 1448
            AYG++T     FY S G AL+ +Y+   R+   R+VFE+MD +DVVTWNAMI        
Sbjct: 313  AYGMKTHSRSLFYTSCGPALLTMYSKCSRVHDMRNVFELMDKSDVVTWNAMILGLVDLAM 372

Query: 1449 XXXXXXQFRDMVSRGIQANHTTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLI 1628
                   F  M + G++ + TT+S +  + +LK GK++HA  ++++ G    V  +N+LI
Sbjct: 373  GHSALETFSMMQNVGVKNDQTTISTILPICDLKPGKQIHACILRNNLGS--VVPVLNALI 430

Query: 1629 EMYSRSGCIEAAHQLFSSIDAKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPV 1808
             MY + G I +A  +FS++  KD+VTWNTIIG +  HG G+ A++++N+MI +G  PN V
Sbjct: 431  SMYCKCGSIHSAKLIFSNMTIKDLVTWNTIIGGFAMHGIGEAALQILNEMIFSGLSPNSV 490

Query: 1809 TLTSTLMACAHCGLVDEGLQFFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
            TLTS L AC H GL++EGL+ F  + RD G +P  EH+AC+VDMLARAG+  EA
Sbjct: 491  TLTSVLSACNHSGLINEGLEAFYGMSRDFGLVPKMEHFACLVDMLARAGQLNEA 544


>ref|XP_017219367.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
            chloroplastic-like [Daucus carota subsp. sativus]
          Length = 647

 Score =  422 bits (1085), Expect = e-136
 Identities = 220/514 (42%), Positives = 308/514 (59%), Gaps = 1/514 (0%)
 Frame = +3

Query: 432  AKSLPKIAQLHQHLAATGLVRDHSVATKLIQLYADAGDLSCALQLFATLPSPSVFAWTPI 611
            +++L +  Q HQ +   GL ++  +ATKL+Q+YAD  ++  A  +F  L  P+VFAWT +
Sbjct: 47   SRALQQGRQTHQQIITHGLHQNPFMATKLVQMYADCNNIISAHHVFDQLSHPNVFAWTAM 106

Query: 612  LALLSRSANHTRCLASYRSMRSSSIAPDGYVFPLVLRSAAADYHQHIST-LHXXXXXXXX 788
            LA  SR+     CLA Y  M+ + + PD YV P V+R+            +H        
Sbjct: 107  LAFYSRNGMTKECLACYNEMKKNRVFPDKYVLPNVVRACTKSLCLETGMQIHKEAVVFGV 166

Query: 789  XXXXXXXXXXXXXYSKSGDLAAARRTFDLAGCRDLLSWNSIIAAYSSADRVDAALGLLDS 968
                         YSK GD+A ARR FD    RDLLSWNS+I+AY S+  +  A+ L   
Sbjct: 167  EMNLQICNSLIDMYSKCGDVAGARRVFDAMVERDLLSWNSLISAYVSSGFLVLAIELFGF 226

Query: 969  MRSDGCDPDLVTWNTLMDGYCRAGRCSEARDILFGLALPNTISWTTVISGYSRTGNHEAA 1148
            MR  G +PD VTWNT++D YCR G+C EA ++   +  PN ISWTT+ISGYSR G HE  
Sbjct: 227  MRMGGFEPDTVTWNTIVDAYCRMGQCDEASNVFKKIKEPNIISWTTLISGYSRIGEHEVT 286

Query: 1149 LEIFSRMMFAGSVPPDLDTLACVAASCRQATAVVAGRAVHAYGLRTSPVDTFYRSAGAAL 1328
            L IF  MM  G V PDLD L+ V  SCR       GR +HA+G++T  +  FY+SAG AL
Sbjct: 287  LSIFREMMSIGKVCPDLDCLSSVLVSCRHVEGFNFGREIHAHGIKTINITAFYKSAGPAL 346

Query: 1329 ILLYASAGRISPARSVFEMMDPTDVVTWNAMIQXXXXXXXXXXXXXQFRDMVSRGIQANH 1508
            +++YA+  R+    +VF+ MD +DVVTW+AMI               FR M    IQ + 
Sbjct: 347  LVMYATNRRMPEMGNVFDFMDMSDVVTWSAMIHSLAHLGMAHSALACFRKMHILKIQNDQ 406

Query: 1509 TTLSAVASLSNLKLGKELHAHAMKHDAGCRCTVTGMNSLIEMYSRSGCIEAAHQLFSSID 1688
            TTLS +  + +LK+GKE+HA+  ++  G    +T +N+LI MYS++GC   AH +F +++
Sbjct: 407  TTLSTILPVCDLKIGKEIHAYIWRN--GFNSVITVLNALIHMYSKNGCSTIAHSVFVNME 464

Query: 1689 AKDVVTWNTIIGAYGSHGRGKPAVELVNQMIRNGFKPNPVTLTSTLMACAHCGLVDEGLQ 1868
            ++DVV+WN IIG +G +G G+ A+ L+ +M  +   PN  T TS L AC+H GLVDEGLQ
Sbjct: 465  SRDVVSWNAIIGGFGMNGFGQAALHLLQEMSHSEICPNSSTFTSVLSACSHSGLVDEGLQ 524

Query: 1869 FFEALIRDVGFMPTQEHYACVVDMLARAGRFEEA 1970
             F  + R+ GF P  EH+ACVVD+LAR+G+  +A
Sbjct: 525  VFHKMTREFGFEPKTEHFACVVDLLARSGQLNDA 558


Top