BLASTX nr result

ID: Cheilocostus21_contig00013508 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00013508
         (1400 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containi...   564   0.0  
ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containi...   472   e-161
ref|XP_020582022.1| pentatricopeptide repeat-containing protein ...   407   e-136
ref|XP_020276341.1| pentatricopeptide repeat-containing protein ...   397   e-132
ref|XP_020685912.1| pentatricopeptide repeat-containing protein ...   395   e-131
gb|PKA66355.1| Pentatricopeptide repeat-containing protein [Apos...   389   e-129
gb|ONK64277.1| uncharacterized protein A4U43_C07F24000 [Asparagu...   389   e-128
ref|XP_008781345.2| PREDICTED: pentatricopeptide repeat-containi...   379   e-126
ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi...   365   e-119
ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containi...   348   e-112
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   347   e-112
dbj|GAV86601.1| PPR domain-containing protein [Cephalotus follic...   344   e-111
ref|XP_020092848.1| uncharacterized protein LOC109713258, partia...   350   e-110
gb|OMO91929.1| hypothetical protein COLO4_18015 [Corchorus olito...   337   e-108
ref|NP_001324067.1| pentatricopeptide (PPR) repeat-containing pr...   335   e-107
ref|XP_015893244.1| PREDICTED: pentatricopeptide repeat-containi...   334   e-107
ref|XP_012077696.1| pentatricopeptide repeat-containing protein ...   335   e-107
gb|OAP09950.1| hypothetical protein AXX17_AT2G12140 [Arabidopsis...   335   e-107
ref|NP_001324066.1| pentatricopeptide (PPR) repeat-containing pr...   335   e-107
ref|NP_565402.1| pentatricopeptide (PPR) repeat-containing prote...   335   e-107

>ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Musa acuminata subsp. malaccensis]
          Length = 430

 Score =  564 bits (1453), Expect = 0.0
 Identities = 297/437 (67%), Positives = 337/437 (77%), Gaps = 8/437 (1%)
 Frame = +3

Query: 63   MALLCTTAGSFSPSAATTRCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSK 242
            MALL  TA SFSPS+A  RCALA   KHADRL S L   S DDP AADRLIRKFLAASSK
Sbjct: 1    MALLWATAASFSPSSAGLRCALAGRRKHADRLVSDLRGASADDPSAADRLIRKFLAASSK 60

Query: 243  PAALHSLSRFISLSSPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMD 422
            PAALH+LS F+SLSSPFA P+YERISEASWF+W P LAA+V+ALLEKQGR AEA TLT+D
Sbjct: 61   PAALHALSSFLSLSSPFAPPLYERISEASWFSWKPKLAATVVALLEKQGRCAEAETLTLD 120

Query: 423  AVARLKSPRDLALFYCDLLESISEHGLKQSALETYARLREMPF--------RXXXXXXXX 578
            AV+R K+ RDLALFYCDL+E  SE GL+Q  LETYARLRE+PF                 
Sbjct: 121  AVSRSKTHRDLALFYCDLIECFSEQGLEQPVLETYARLREVPFAGRRPYESMIKALCLMG 180

Query: 579  XXXXAEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSN 758
                AE KL EMAS GCKPSPF+FR VIQ YG+ GL SEM+  + SME+ G+PIDTVC N
Sbjct: 181  MPGEAEAKLKEMASSGCKPSPFEFRSVIQSYGRSGLLSEMRRVVGSMEDAGLPIDTVCVN 240

Query: 759  IVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSID 938
            +VLSCYG HG+L EMASWM KMR  GI  SIRT N VLNSCP VVSI +S+  SLPLS++
Sbjct: 241  VVLSCYGHHGELPEMASWMTKMREKGIVFSIRTFNCVLNSCPRVVSI-ASDAGSLPLSME 299

Query: 939  DLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLI 1118
            +L++KLE E++         EALLV EL  S VLADI EWSPSGSKLDLHG HV AAY+I
Sbjct: 300  ELLQKLENESS------SRTEALLVQELTSSSVLADISEWSPSGSKLDLHGLHVAAAYII 353

Query: 1119 LLNWMQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRK 1298
            LL W+QELR RF EE ++PLEISVICGSG+HSERRG+SPIK+LVSEMMF+ +SPMRID K
Sbjct: 354  LLKWIQELRRRFQEEDVIPLEISVICGSGKHSERRGRSPIKDLVSEMMFRKSSPMRIDSK 413

Query: 1299 NPGRFAASGKAVREWLC 1349
            NPGRF A GKAV EW+C
Sbjct: 414  NPGRFVARGKAVWEWMC 430


>ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Elaeis guineensis]
          Length = 434

 Score =  472 bits (1214), Expect = e-161
 Identities = 244/433 (56%), Positives = 311/433 (71%), Gaps = 16/433 (3%)
 Frame = +3

Query: 99   PSAATTRCAL--------ACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAAL 254
            P+A   RCAL        A  GKH  RL S+L T +  DP AADRL+RKF+AASSK AAL
Sbjct: 10   PAATGPRCALRNSHSSRTAGGGKHIHRLLSSLDTAA--DPSAADRLVRKFVAASSKSAAL 67

Query: 255  HSLSRFISLSSPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVAR 434
            H+LS  +SLSS FALPIY R+SEA+WF WNP LAA++ A+L  QGR  EA +L  ++V+R
Sbjct: 68   HTLSHLLSLSSRFALPIYRRVSEANWFKWNPKLAAAMAAVLVNQGRATEAESLISESVSR 127

Query: 435  LKSPRDLALFYCDLLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX-------- 590
            L S  +++LFYCDL+E+ SE GLK  AL+ Y+RL E+P                      
Sbjct: 128  LNSDLEISLFYCDLIEAFSERGLKDFALDFYSRLHEIPCSVRKPYESMIKALCLMGLPVD 187

Query: 591  AEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLS 770
            AE+KL EMA LG +PSPF+FRLV+Q YG+ G F+EM   L  ME+ G+ IDTVC+N+VLS
Sbjct: 188  AEEKLKEMAFLGFRPSPFEFRLVMQSYGKSGSFAEMSRVLGIMEDAGLAIDTVCTNVVLS 247

Query: 771  CYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMK 950
            CYG HG+L++M SW+ KM+  GIG S+RT N VLNSCP ++S+V  + + +PLSI  L+K
Sbjct: 248  CYGDHGELAKMVSWIRKMKKLGIGFSVRTFNVVLNSCPTIISMVQ-DVKHIPLSIAALVK 306

Query: 951  KLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNW 1130
            K+EE++         +EALLV EL+GS VL DILEWSP   KLDLHGFHV +A++ILL W
Sbjct: 307  KVEEDSLSL------DEALLVRELVGSSVLVDILEWSPDEGKLDLHGFHVASAFVILLQW 360

Query: 1131 MQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310
            ++ELR+RF  ++ VPLEISV+CGSG+HS++ G+SP+K LVSEMMFQ NS MRIDRKN GR
Sbjct: 361  VEELRIRFRVDEAVPLEISVVCGSGKHSDKIGESPVKMLVSEMMFQLNSSMRIDRKNAGR 420

Query: 1311 FAASGKAVREWLC 1349
            F A GKAVR+WLC
Sbjct: 421  FVARGKAVRDWLC 433


>ref|XP_020582022.1| pentatricopeptide repeat-containing protein At2g17033 [Phalaenopsis
            equestris]
          Length = 426

 Score =  407 bits (1047), Expect = e-136
 Identities = 213/420 (50%), Positives = 291/420 (69%), Gaps = 9/420 (2%)
 Frame = +3

Query: 117  RCAL-ACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPF 293
            RC++ A +GK + RL ++LS  +T DP  A RL+RKF+A+SSK ++L +LS  IS SSPF
Sbjct: 16   RCSIPAGAGKPSRRLLNSLS--ATSDPSTAVRLVRKFVASSSKSSSLQALSFLISHSSPF 73

Query: 294  ALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCD 473
            +L +Y+ +SE SWF +N  L AS+I+LLE    + +A TL   + + L+SPRDL+LFYCD
Sbjct: 74   SLHLYQFLSETSWFQFNSKLIASLISLLEDHHCSLDALTLISQSTSVLRSPRDLSLFYCD 133

Query: 474  LLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGC 629
            L+++ S  GLK   L++YARL+E+PF                     AE+ L EM S G 
Sbjct: 134  LIDAFSGRGLKTQVLQSYARLKEIPFSGKRPYQSIIKGMCLMEMPEEAEEFLREMGSSGF 193

Query: 630  KPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMAS 809
            KPSPF+FRLV + YG+ G F+EM   L+SMEE G+ +DT+ +N VLSCYG HG+LSEM S
Sbjct: 194  KPSPFEFRLVFRAYGRAGAFAEMTRVLQSMEENGMALDTLSANTVLSCYGDHGKLSEMVS 253

Query: 810  WMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXX 989
            W+ K+R +GIG S RT+NSVLNSCP  +++++ +  SLPLSID L +K+EE +       
Sbjct: 254  WIQKIRESGIGFSFRTVNSVLNSCP-TIAMLTDDVLSLPLSIDALFRKVEESSPCS---- 308

Query: 990  XXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKL 1169
              NE+LL+ EL+  P+L  +L+WS S  KLDLHG H+ +AY+I+L WM+E+R      K 
Sbjct: 309  --NESLLLRELVDFPLLCSLLDWSDSEVKLDLHGLHLVSAYVIILQWMREIRSCLVAGKA 366

Query: 1170 VPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349
            VPLE S+ICGSG+HS  RG+SP+K LVS MMFQ  SP++IDRKN G+F A GK V++WLC
Sbjct: 367  VPLEFSIICGSGKHSRSRGESPVKKLVSVMMFQLKSPLKIDRKNVGKFVAKGKKVKDWLC 426


>ref|XP_020276341.1| pentatricopeptide repeat-containing protein At2g17033 [Asparagus
            officinalis]
          Length = 430

 Score =  397 bits (1019), Expect = e-132
 Identities = 214/413 (51%), Positives = 275/413 (66%), Gaps = 10/413 (2%)
 Frame = +3

Query: 141  KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320
            KH+DRL S+LS  S+ DP AA  +IR+F+++SSK  AL +LS  +SLSSPF+LP Y RIS
Sbjct: 32   KHSDRLLSSLS--SSCDPSAAAHVIRRFVSSSSKSTALRTLSLLLSLSSPFSLPFYRRIS 89

Query: 321  EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500
             + WF W P L A VIALLE  G   EAR L  ++V RL S R++A FYCDL+ + S  G
Sbjct: 90   ASHWFKWTPKLVAEVIALLESDGHPLEARELVSESVLRLSSQREIAHFYCDLVVAASGRG 149

Query: 501  LKQSALETYARLREMPF--------RXXXXXXXXXXXXAEDKLNEMASLGCKPSPFQFRL 656
            LK+  LE    +++  F                     AE  L+EM  LG KPS F++R+
Sbjct: 150  LKEFVLECCGWIKDTGFVGKRVFECMVRGLSLVGMVEDAEKVLDEMGHLGFKPSGFEYRV 209

Query: 657  VIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTG 836
            VIQGYG+LG F EM+  +  ME   I IDTVC+N+VLSCYG +G L+EM +W+  MR  G
Sbjct: 210  VIQGYGRLGSFKEMRRVIGRMENAEIGIDTVCANLVLSCYGDYGNLAEMVTWIRNMRLLG 269

Query: 837  IGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVG 1016
            I  S+RT NSVLNSCP VVS+V  +  +LPLS+DDL++ L +E           EALLV 
Sbjct: 270  ISYSVRTCNSVLNSCPSVVSMV-KDLENLPLSMDDLLEMLNDE-----------EALLVK 317

Query: 1017 ELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQEL--RLRFCEEKLVPLEISV 1190
            E+ G+ VL + L WS S  KLDLHGFH+ +AY+ILL WM+E   RLR  EE  +PLEISV
Sbjct: 318  EMAGTSVLLEKLTWSDSEGKLDLHGFHLASAYIILLQWMEEFRRRLRVKEEVPIPLEISV 377

Query: 1191 ICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349
            +CG G+HS  RG+SP+K LVS++M +  SPM+IDRKN GRF A GKAV++WLC
Sbjct: 378  VCGLGKHSIMRGESPVKKLVSKLMSRLKSPMKIDRKNVGRFVAKGKAVKDWLC 430


>ref|XP_020685912.1| pentatricopeptide repeat-containing protein At2g17033 [Dendrobium
            catenatum]
 gb|PKU84129.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum]
          Length = 426

 Score =  395 bits (1014), Expect = e-131
 Identities = 207/413 (50%), Positives = 280/413 (67%), Gaps = 8/413 (1%)
 Frame = +3

Query: 135  SGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYER 314
            + K + RL ++LST S  D  AA RL+RKF+A+SSK  +L +LS  IS SSPF+L +Y+ 
Sbjct: 23   ASKRSHRLLTSLSTAS--DSSAAIRLVRKFVASSSKSTSLQALSFLISHSSPFSLHLYQT 80

Query: 315  ISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISE 494
            +SEA WF +NP LAAS+I+LLE Q  T +A TL   + + L+ PR LALFYCDL+++ S+
Sbjct: 81   LSEAPWFQFNPKLAASLISLLEDQHCTVDALTLLSQSASGLRLPRHLALFYCDLIDAFSD 140

Query: 495  HGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQF 650
             GLK     +YARL+E+PF                     AE  L EM   G KPSPF+F
Sbjct: 141  RGLKVQVHRSYARLKEIPFSGRRPYESMIKGMCLMKMPEEAEVFLREMGLAGFKPSPFEF 200

Query: 651  RLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRS 830
            R V+Q YG++G F+EM   L SM+E G+ +DT+ +N VLSCYG HG+LSEM SW+ K R 
Sbjct: 201  RQVLQAYGRVGAFAEMTRVLESMQENGMALDTLSANTVLSCYGDHGKLSEMVSWIQKTRE 260

Query: 831  TGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALL 1010
            +G+G SIRT NSVLNSCP +V +++ N  SLP SI+ L +K++E +         NE+LL
Sbjct: 261  SGVGFSIRTFNSVLNSCPTIV-MITKNVSSLPPSIEALFRKVDESSPCL------NESLL 313

Query: 1011 VGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISV 1190
              EL+  P+L+ +L+WS S  KLDLHG H+ +AY+I+L WM+++R      KLVPLE S+
Sbjct: 314  FRELVNFPLLSAMLDWSDSEVKLDLHGLHLVSAYVIILLWMEKIRSCLVAGKLVPLEFSI 373

Query: 1191 ICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349
            +CGSG+HS+  G+SP+K LVS MMFQ  SP++IDR N G+F A GK VR+WLC
Sbjct: 374  VCGSGKHSKIIGESPVKKLVSVMMFQLKSPLKIDRNNAGKFVAKGKKVRDWLC 426


>gb|PKA66355.1| Pentatricopeptide repeat-containing protein [Apostasia shenzhenica]
          Length = 426

 Score =  389 bits (999), Expect = e-129
 Identities = 203/410 (49%), Positives = 284/410 (69%), Gaps = 8/410 (1%)
 Frame = +3

Query: 141  KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320
            + + RL S+LS  +  D  +ADRL+RKF+A+SSKP AL SLS FISLSSPF+L +Y+ I+
Sbjct: 25   RRSHRLLSSLSAAA--DFSSADRLLRKFVASSSKPDALQSLSLFISLSSPFSLLLYQAIA 82

Query: 321  EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500
            +  WF WNP LAASV++LLE+Q R+ +A  L   + +RL++PRDL  FYC+L+++ S   
Sbjct: 83   DTPWFRWNPKLAASVVSLLEEQQRSTDAEALISRSTSRLRAPRDLPAFYCELIDAFSCRW 142

Query: 501  LKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQFRL 656
            L+  AL ++ARLRE+P+                     AE+ L EMA  G +PS F+FR 
Sbjct: 143  LQLPALRSFARLREIPYSGRKPYESIIKGLCSMGMATDAEELLREMAIAGFRPSAFEFRS 202

Query: 657  VIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTG 836
            V Q YG+ G F++M     SM+E GI +DTV +NI LSCYG H + S M S++ KMR +G
Sbjct: 203  VAQAYGRSGAFADMTRVFESMQEAGIVLDTVSANIALSCYGDHFKFSVMVSFLRKMRESG 262

Query: 837  IGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVG 1016
            I  S+RT NSVLNSCP VV+I + + RSLPLS+  L++K+EE +         +EALL+ 
Sbjct: 263  IIFSLRTFNSVLNSCPSVVTI-TKDLRSLPLSMAALLRKVEEASLCL------DEALLIR 315

Query: 1017 ELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISVIC 1196
            E+  S ++ D+L+WS S  KLDLHGFH+ +AY+++L W++ +R R  E+ ++PLEIS+IC
Sbjct: 316  EVTDSSLMGDMLQWSDSEGKLDLHGFHLASAYVMILMWIEVVRDRLSEDGIIPLEISIIC 375

Query: 1197 GSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWL 1346
            GSG++S  +G SP+K LVSEMMFQ  SPM++DRKN G+F A GKAV++WL
Sbjct: 376  GSGKNSRMKGDSPLKKLVSEMMFQLYSPMKMDRKNVGKFVAKGKAVKDWL 425


>gb|ONK64277.1| uncharacterized protein A4U43_C07F24000 [Asparagus officinalis]
          Length = 448

 Score =  389 bits (1000), Expect = e-128
 Identities = 214/431 (49%), Positives = 275/431 (63%), Gaps = 28/431 (6%)
 Frame = +3

Query: 141  KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320
            KH+DRL S+LS  S+ DP AA  +IR+F+++SSK  AL +LS  +SLSSPF+LP Y RIS
Sbjct: 32   KHSDRLLSSLS--SSCDPSAAAHVIRRFVSSSSKSTALRTLSLLLSLSSPFSLPFYRRIS 89

Query: 321  EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500
             + WF W P L A VIALLE  G   EAR L  ++V RL S R++A FYCDL+ + S  G
Sbjct: 90   ASHWFKWTPKLVAEVIALLESDGHPLEARELVSESVLRLSSQREIAHFYCDLVVAASGRG 149

Query: 501  LKQSALETYARLREMPF--------------------------RXXXXXXXXXXXXAEDK 602
            LK+  LE    +++  F                                       AE  
Sbjct: 150  LKEFVLECCGWIKDTGFVGKRVFECMVRGLSLVGMVEDAEKVLDEMGHLGVGMVEDAEKV 209

Query: 603  LNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQ 782
            L+EM  LG KPS F++R+VIQGYG+LG F EM+  +  ME   I IDTVC+N+VLSCYG 
Sbjct: 210  LDEMGHLGFKPSGFEYRVVIQGYGRLGSFKEMRRVIGRMENAEIGIDTVCANLVLSCYGD 269

Query: 783  HGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEE 962
            +G L+EM +W+  MR  GI  S+RT NSVLNSCP VVS+V  +  +LPLS+DDL++ L +
Sbjct: 270  YGNLAEMVTWIRNMRLLGISYSVRTCNSVLNSCPSVVSMV-KDLENLPLSMDDLLEMLND 328

Query: 963  ETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQEL 1142
            E           EALLV E+ G+ VL + L WS S  KLDLHGFH+ +AY+ILL WM+E 
Sbjct: 329  E-----------EALLVKEMAGTSVLLEKLTWSDSEGKLDLHGFHLASAYIILLQWMEEF 377

Query: 1143 --RLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFA 1316
              RLR  EE  +PLEISV+CG G+HS  RG+SP+K LVS++M +  SPM+IDRKN GRF 
Sbjct: 378  RRRLRVKEEVPIPLEISVVCGLGKHSIMRGESPVKKLVSKLMSRLKSPMKIDRKNVGRFV 437

Query: 1317 ASGKAVREWLC 1349
            A GKAV++WLC
Sbjct: 438  AKGKAVKDWLC 448


>ref|XP_008781345.2| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Phoenix dactylifera]
          Length = 330

 Score =  379 bits (973), Expect = e-126
 Identities = 191/335 (57%), Positives = 246/335 (73%), Gaps = 8/335 (2%)
 Frame = +3

Query: 369  ALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHGLKQSALETYARLREMP 548
            A+L  QGR AEA +L  ++V+RL S  +++LFYCDL+E+ SE GLK  AL+ Y RLREMP
Sbjct: 3    AVLVNQGRAAEAESLISESVSRLNSDLEISLFYCDLIEAFSERGLKDLALDFYFRLREMP 62

Query: 549  FRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKS 704
                                  AE+KL EMA LG +PSPF+FRLV+Q YG+LG F+EM+ 
Sbjct: 63   CSRRKPYESMIKALCLMGLPVDAEEKLKEMALLGFRPSPFEFRLVLQSYGKLGSFAEMRR 122

Query: 705  ALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCP 884
             L  ME+ G+ +DT+C+N+VLSCYG HG+L+EM SW+ KM+  G+G SIRT N VLNSCP
Sbjct: 123  VLGIMEDAGLAVDTICTNVVLSCYGDHGELAEMVSWIRKMKKLGVGFSIRTFNVVLNSCP 182

Query: 885  MVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSP 1064
             ++SIV  + +  PLSI  L+KK+EE++         +EALLV EL+GS VL DILEWSP
Sbjct: 183  TIISIVQ-DAKHFPLSIAALVKKVEEDSPSP------DEALLVRELVGSSVLVDILEWSP 235

Query: 1065 SGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKN 1244
            +  KLDLHGFHV++AY+ILL WM+ELR+RF  +++VPLEISV+CGSG+ S++ G+SP+K 
Sbjct: 236  NEGKLDLHGFHVSSAYVILLQWMEELRMRFRVDEVVPLEISVVCGSGKKSDKIGESPVKM 295

Query: 1245 LVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349
            LVSEMMFQ NS MRIDRKN GRF A GKAVR+WLC
Sbjct: 296  LVSEMMFQLNSSMRIDRKNAGRFVAQGKAVRDWLC 330


>ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nelumbo nucifera]
          Length = 451

 Score =  365 bits (937), Expect = e-119
 Identities = 201/431 (46%), Positives = 273/431 (63%), Gaps = 21/431 (4%)
 Frame = +3

Query: 120  CALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS----- 284
            CAL+   K   R F++L+  + D   AA+RLIRKF+A+SSK  AL++LS  IS +     
Sbjct: 31   CALS---KKGHRFFTSLAAAAGDSA-AANRLIRKFVASSSKSDALNALSHLISSNTTHFH 86

Query: 285  -SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLA 458
             S   LP+Y RI+E  WFNWNP L ASVIA L+KQG+  EA  L  ++V +L    RD+A
Sbjct: 87   LSSLVLPMYRRIAETPWFNWNPKLVASVIAYLDKQGQPEEAEALISESVQKLGFQERDVA 146

Query: 459  LFYCDLLESISEHGLKQSALETYARLREM-------------PFRXXXXXXXXXXXXAED 599
            LFYCDL++S S+   +    E+YARL+++                            AE+
Sbjct: 147  LFYCDLIDSYSKQRSRIGVFESYARLKQLFSDSSSSLSRRAYETIICSLCSVDLPRDAEN 206

Query: 600  KLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYG 779
             + EM   G KPS F+FR ++ GYG+LGLF++M+  LR ME+ G  +DT+CSN+VLS +G
Sbjct: 207  MVEEMTISGFKPSAFEFRSLVSGYGRLGLFTDMRRVLRKMEDAGYCLDTICSNMVLSSFG 266

Query: 780  QHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLE 959
             H +LSEMASW+ KM+ + I  SIRT NSV+NSCP + S++  + + +PLS++DL  +L+
Sbjct: 267  AHSELSEMASWLRKMKDSNISFSIRTYNSVMNSCPTITSLL-KDLKFVPLSMEDLKGRLQ 325

Query: 960  EETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQE 1139
            ++           E LLV +LIGS VL D L+W PS  KLDLHG H+  AYLI+L W+Q 
Sbjct: 326  KD-----------ETLLVEQLIGSSVLMDALKWCPSEGKLDLHGMHLATAYLIMLQWVQV 374

Query: 1140 LRLRFCEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFA 1316
            LR RF     ++P E  VICGSG+HS  RG+SP+K LV +MM +  SPM+IDR N G F 
Sbjct: 375  LRSRFSAGNWVIPTEFRVICGSGKHSSVRGESPVKALVKQMMVRMKSPMKIDRNNVGCFV 434

Query: 1317 ASGKAVREWLC 1349
              GKAVR+WLC
Sbjct: 435  GRGKAVRDWLC 445


>ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Populus euphratica]
          Length = 473

 Score =  348 bits (893), Expect = e-112
 Identities = 193/432 (44%), Positives = 265/432 (61%), Gaps = 24/432 (5%)
 Frame = +3

Query: 126  LACSGKHADRLFSA-LSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSP---- 290
            LA   K A R FSA L TV+  D  A +RLI+KF+A+S K  AL +LS  +S  S     
Sbjct: 53   LAAISKQAQRFFSAVLPTVAARDTSATNRLIKKFVASSPKSIALDALSHLLSPDSTHHPL 112

Query: 291  ---FALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLA 458
                 LP+Y +ISEASWF+WNP L A V+ LL+KQG   E + L  + V+RL+   R+L 
Sbjct: 113  LYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKERELV 172

Query: 459  LFYCDLLESISEHGLKQSALETYARLREM--------------PFRXXXXXXXXXXXXAE 596
            LFYC+L+   S+H   +   ++Y+RL +                              AE
Sbjct: 173  LFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSESKSVYVKKQGYKAMISGLCEMGRAREAE 232

Query: 597  DKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCY 776
            D + EM   G KP+ F+FR V+ GYG+LGLF +M+  L  ME   I +DTVC+N+VL+ Y
Sbjct: 233  DLIGEMRERGLKPTLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVLASY 292

Query: 777  GQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKL 956
            G H  L EM  W+ KM++ GI +SIRT NSVLNSCP +++++ +   S P+SI +L+K L
Sbjct: 293  GAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELLKIL 352

Query: 957  EEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQ 1136
             E+           EA+LV ELI S VL + +EW  S  KLDLHG H+ +AY+I+L WM+
Sbjct: 353  SED-----------EAMLVKELIESSVLKEAVEWDTSEGKLDLHGMHLGSAYVIMLQWME 401

Query: 1137 ELRLRFCE-EKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRF 1313
            E R R  + E ++P EI+V+CGSG HS  RG+SP+K++++E+M QT SPMRIDRKN G F
Sbjct: 402  ETRNRLSDGEHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCF 461

Query: 1314 AASGKAVREWLC 1349
             A G  V++WLC
Sbjct: 462  VAKGNVVKKWLC 473


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera]
 emb|CBI37819.3| unnamed protein product, partial [Vitis vinifera]
          Length = 435

 Score =  347 bits (889), Expect = e-112
 Identities = 201/433 (46%), Positives = 273/433 (63%), Gaps = 22/433 (5%)
 Frame = +3

Query: 117  RCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS---- 284
            +CAL+  G+    LF  LS+V+ D P A++RLI KF+A+SSK  AL++LS  +S +    
Sbjct: 22   QCALSKQGQ----LF--LSSVARD-PSASNRLICKFIASSSKSIALNALSHLLSPTTTHP 74

Query: 285  --SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDL 455
              S  ALP+Y RISEASWF+WNP L A VIALL KQG+  EA TL  + + +L S  RDL
Sbjct: 75   YLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDL 134

Query: 456  ALFYCDLLESISEHGLKQSALETYARL--------------REMPFRXXXXXXXXXXXXA 593
              FYC+L++S S+H   Q   +  +RL              R                 A
Sbjct: 135  VSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEA 194

Query: 594  EDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSC 773
            E+ + EM   G KPS F+FR V+ GYG++GL  +M+  L  M   G  +DTV SN+VLS 
Sbjct: 195  ENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSS 254

Query: 774  YGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKK 953
            YG + + SEM SW+ +M+++ I  SIRT NSVLNSCPM++SI+  + ++ P +ID+LM+ 
Sbjct: 255  YGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSIL-QDLKTFPPTIDELMET 313

Query: 954  LEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133
            L+            +EALLV ELIGS VLA+++EW  S  KLDLHG H+ +AYLI+L W 
Sbjct: 314  LK-----------GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWR 362

Query: 1134 QELRLRF-CEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310
            +ELR R    E ++P+EI+V+CGSG+HS  RG+SP+K +V EMM +T SPM+IDRKN G 
Sbjct: 363  EELRYRLNAAEYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGC 422

Query: 1311 FAASGKAVREWLC 1349
            F A  K V+ WLC
Sbjct: 423  FVAKAKVVKNWLC 435


>dbj|GAV86601.1| PPR domain-containing protein [Cephalotus follicularis]
          Length = 451

 Score =  344 bits (883), Expect = e-111
 Identities = 193/433 (44%), Positives = 266/433 (61%), Gaps = 22/433 (5%)
 Frame = +3

Query: 117  RCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS---- 284
            +CA + + +   R  S L+  + +  + A RLI KF+A+S K  AL++LS  +SL     
Sbjct: 32   QCAASLTTR-GHRFISTLAAATNEPQVVAHRLISKFVASSPKSVALNALSHLLSLDTSQP 90

Query: 285  --SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDL 455
              S  ALP+Y RISEA WFNWNP L A ++ALL+KQG+ ++++ L  + +++L+   RDL
Sbjct: 91   HLSSLALPLYSRISEAPWFNWNPKLVADLVALLDKQGQYSQSQALIFETISKLQFKERDL 150

Query: 456  ALFYCDLLESI----SEHGLKQSAL----------ETYARLREMPFRXXXXXXXXXXXXA 593
            ALFYC+L+ES     SEHG   S +            Y + +                 A
Sbjct: 151  ALFYCNLIESHAKNKSEHGFNDSYICLNEVIRKSCSVYVKSQGYKSMVSALCEMGEPHEA 210

Query: 594  EDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSC 773
            E+ + EM   G K S F+FR V+ GYG+LGLF +M   +  ME  G  +DTV SN+VLS 
Sbjct: 211  ENVVEEMRVNGLKLSLFEFRCVLYGYGRLGLFEDMLRIVEQMESEGFQVDTVSSNMVLSS 270

Query: 774  YGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKK 953
            YG H  LS+M SW+ +++S GI  S+RT NSVLNSCPM++S++  + +SLPLS+ +L   
Sbjct: 271  YGAHNALSDMLSWLQQLKSLGIPFSVRTYNSVLNSCPMMISML-QDLKSLPLSLKELTVT 329

Query: 954  LEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133
            L  +           EALLV EL  SPVL   +EW    +KLDLHG H+ +AYLI+L WM
Sbjct: 330  LNND-----------EALLVKELTQSPVLDGAIEWGALEAKLDLHGMHLGSAYLIMLQWM 378

Query: 1134 QELRLRFCEEKL-VPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310
             E+R RF + K  +P EI ++CGSG+HS  RG+SP+K +V E+M +T SPMRIDRKN G 
Sbjct: 379  DEMRNRFKDGKFALPAEIILVCGSGKHSSVRGESPVKGMVREIMVRTRSPMRIDRKNIGC 438

Query: 1311 FAASGKAVREWLC 1349
            F A GK VR+WLC
Sbjct: 439  FIAKGKVVRDWLC 451


>ref|XP_020092848.1| uncharacterized protein LOC109713258, partial [Ananas comosus]
          Length = 720

 Score =  350 bits (899), Expect = e-110
 Identities = 180/357 (50%), Positives = 239/357 (66%), Gaps = 9/357 (2%)
 Frame = +3

Query: 306  YERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLES 485
            Y RI E SWF+WN  L A V ALLE+ G+  +A  L   +++ ++SPRDLALFYC+L+ES
Sbjct: 371  YARIRETSWFSWNSKLTADVAALLEQLGQCFDAEHLVSSSISTIRSPRDLALFYCNLIES 430

Query: 486  ISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSP 641
             S  GL+Q  ++  + LR +PF                     AE  L EMA LG KPS 
Sbjct: 431  YSGRGLRQKVVDICSSLRNLPFSGRKPYKSMIKGFCLLDMPEEAEANLQEMALLGLKPSA 490

Query: 642  FQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAK 821
            F+FRL+ Q YG+ G FSEMK  +  ME+ G  +DTVC+N+VLSCYG  G+L EM  W+  
Sbjct: 491  FEFRLIAQSYGKPGSFSEMKRVIGLMEDAGFSVDTVCANVVLSCYGDRGELPEMVEWLKW 550

Query: 822  MRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNE 1001
            MR  GI  S+RT N+VLNSC ++V +V  +  +LPLSI+ L+ KLE E+A         E
Sbjct: 551  MRELGIDFSVRTFNTVLNSCSVIVGMV-RDLDTLPLSIEQLLDKLESESASVV------E 603

Query: 1002 ALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF-CEEKLVPL 1178
            A L+ +LI S +L ++LEWSP+  KLDLHGFH T+A++I+L ++ ELR R   E  +VP 
Sbjct: 604  AALIRKLIDSALLVEMLEWSPAEGKLDLHGFHATSAFVIMLQFVDELRSRLSAENAVVPA 663

Query: 1179 EISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349
            EISV+CGSG+HS+ RG+SP+K +VSEM+F+TNS MR+DRKN GRF   GKAV+EWLC
Sbjct: 664  EISVVCGSGKHSDVRGRSPVKMVVSEMLFRTNSVMRLDRKNSGRFVGRGKAVKEWLC 720


>gb|OMO91929.1| hypothetical protein COLO4_18015 [Corchorus olitorius]
          Length = 467

 Score =  337 bits (864), Expect = e-108
 Identities = 188/427 (44%), Positives = 265/427 (62%), Gaps = 25/427 (5%)
 Frame = +3

Query: 141  KHADRLFSALS-TVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS------SPFAL 299
            K   R FS+L+ T   +DP AA+R+I+KF+A+S K  AL++LS  +S        S  A 
Sbjct: 50   KQGQRFFSSLAATAGVNDPAAANRIIKKFVASSPKAIALNALSHLLSTRNSHPHLSAIAF 109

Query: 300  PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLALFYCDL 476
            P+Y +I+E SWF+WNP L A +IALL+KQGR  E   L   AV++LK   RDL  FYC+L
Sbjct: 110  PLYTKITETSWFDWNPKLVADLIALLDKQGRYDETEALISQAVSKLKFRERDLVQFYCNL 169

Query: 477  LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614
            +ES S+H  KQ   + Y  L E+                             AE+   EM
Sbjct: 170  IESCSKHDSKQGFNDAYGYLSELVRNSSSLYVKRQGYKSLVSSFCEMGQPNEAENVFEEM 229

Query: 615  ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794
               G KPS F+FR +I GYG++G F +M+  +  ME  G  +DT+CSN+VLS YG +  L
Sbjct: 230  RKNGVKPSSFEFRFIIYGYGKMGFFEDMERMVSEMEIAGFEVDTICSNMVLSSYGDYNAL 289

Query: 795  SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974
            ++M SW+ KM++  I  S+RT NSVLNSCP ++S+V  +  +LPLS+ +L+K L+E+   
Sbjct: 290  AKMVSWLQKMKTLQIPFSVRTYNSVLNSCPGIMSLV-QDINNLPLSLGELVKVLKED--- 345

Query: 975  XXXXXXXNEALLVGELI-GSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLR 1151
                    EALLV EL+  S VL + +E   S +KLDLHG H+ +AYLI+L W++E++ R
Sbjct: 346  --------EALLVKELVESSAVLDNAVECDVSEAKLDLHGMHLGSAYLIMLQWIEEMKCR 397

Query: 1152 F-CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASG 1325
            F  EEK ++P +I+++CGSG+HS  RG+SP+K+L+ +MM Q  SPM+IDRKN G F A G
Sbjct: 398  FKAEEKCVIPAQITIVCGSGKHSSVRGESPVKSLLKKMMVQMKSPMKIDRKNIGCFTAKG 457

Query: 1326 KAVREWL 1346
              V+ WL
Sbjct: 458  HVVKNWL 464


>ref|NP_001324067.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
 gb|ANM61875.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
          Length = 470

 Score =  335 bits (860), Expect = e-107
 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%)
 Frame = +3

Query: 141  KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299
            KH DR  S+LS+ +   DP A +R I+KF+AAS K  AL+ LS  +S   S P    FAL
Sbjct: 56   KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 115

Query: 300  PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476
             +Y  I+EASWF+WNP L A +IALL KQ R  E+ TL   AV+RLKS  RD  LF C+L
Sbjct: 116  SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 175

Query: 477  LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614
            +ES S+ G  Q   E   RLRE+  R                          AE  + EM
Sbjct: 176  VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 235

Query: 615  ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794
                 KP  F+++ V+ GYG+LGLF +M   +  M   G  IDTVCSN+VLS YG H  L
Sbjct: 236  RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 295

Query: 795  SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974
             +M SW+ K++   +  SIRT NSVLNSCP ++S++  +  S P+S+ +L   L E+   
Sbjct: 296  PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 351

Query: 975  XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154
                    EALLV EL  S VL + +EW+    KLDLHG H++++YLILL WM E RLRF
Sbjct: 352  --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 403

Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331
             EEK ++P EI V+ GSG+HS  RG+SP+K LV ++M +T SPMRIDRKN G F A GK 
Sbjct: 404  SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 463

Query: 1332 VREWLC 1349
            V+EWLC
Sbjct: 464  VKEWLC 469


>ref|XP_015893244.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Ziziphus jujuba]
          Length = 450

 Score =  334 bits (857), Expect = e-107
 Identities = 189/439 (43%), Positives = 270/439 (61%), Gaps = 22/439 (5%)
 Frame = +3

Query: 99   PSAATTRCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFIS 278
            PS+++ +CAL+  G    R  S LS V+  DP A+ +LI KF+ +SSK  AL++LS  +S
Sbjct: 28   PSSSSIKCALSKQGL---RFISTLS-VNAGDPSASAKLIGKFVGSSSKSIALNALSHLLS 83

Query: 279  LSSP------FALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK 440
              +        ALP+Y +I EASWF  NP L A++ ALL+KQGR +E+ TL  + +A L 
Sbjct: 84   PDTTHPHLTSLALPLYSKIKEASWFERNPKLVAAMAALLDKQGRHSESETLISETIAELG 143

Query: 441  S-PRDLALFYCDLLESISE--------------HGLKQSALETYARLREMPFRXXXXXXX 575
            +  R+LALFYC L+ES S+              H L  ++   Y + R +          
Sbjct: 144  NRERELALFYCQLVESHSKQNSGHGFERSYTYLHHLLHNSSSVYVKRRALESMVGGLCTM 203

Query: 576  XXXXXAEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCS 755
                 AE  + EM  +G KPS F+ R V+ GYG+LGL  EM   ++ M+  G+ IDT+ S
Sbjct: 204  DRPIEAESLIEEMRVVGLKPSVFELRSVMYGYGRLGLLKEMLRIVQQMDNGGLAIDTISS 263

Query: 756  NIVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSI 935
            N+VLS  G H +LSEM  W+ KM++  I  S RT N+VLNSCP ++ I+  N+  +P SI
Sbjct: 264  NMVLSSLGIHNELSEMVLWLRKMKTFNIPFSTRTYNTVLNSCPTIMEIL-QNSDHIPFSI 322

Query: 936  DDLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYL 1115
            ++L   L+            +EALLV EL+GS VL ++++W    +KLDLHG H+ +AYL
Sbjct: 323  EELKGVLK-----------GDEALLVDELVGSGVLKEVMKWDSLEAKLDLHGLHLGSAYL 371

Query: 1116 ILLNWMQELRLRFCEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRID 1292
            I+L WM+E++ RF  EK ++P E++V+CG G+HS  RG SP+K ++ EMM +T SPMRID
Sbjct: 372  IMLEWMEEMKCRFNNEKHVLPAEVTVVCGVGKHSNFRGVSPVKVMIKEMMARTRSPMRID 431

Query: 1293 RKNPGRFAASGKAVREWLC 1349
            RKN G F A G+AV++WLC
Sbjct: 432  RKNAGCFIAKGRAVKDWLC 450


>ref|XP_012077696.1| pentatricopeptide repeat-containing protein At2g17033 [Jatropha
            curcas]
          Length = 473

 Score =  335 bits (859), Expect = e-107
 Identities = 191/432 (44%), Positives = 262/432 (60%), Gaps = 26/432 (6%)
 Frame = +3

Query: 129  ACSGKHADRLFSALSTVSTD-DPLAADRLIRKFLAASSKPAALHSLSRFISLSSPF---- 293
            A   K   R  S+L+T +   D  A + LI+KF+AAS K  AL +LS  +S +S +    
Sbjct: 52   AALSKQGQRFLSSLATATAARDNSATNSLIKKFVAASPKSIALDALSHLLSPNSSYSHLS 111

Query: 294  --ALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLALF 464
              A P+Y +I EA WF+WNP L A V+ALL+KQG+  E+ TL  D++++LK   RDLALF
Sbjct: 112  SLAFPLYLKIQEAHWFDWNPKLVAEVVALLDKQGQYNESGTLISDSISKLKLRERDLALF 171

Query: 465  YCDLLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDK 602
            YC+L+ES S+    Q   +++ARL ++ F                           A+D 
Sbjct: 172  YCNLVESHSKQNCVQGFEDSFARLNQLVFSSNSVYIKKQAYKSMISGLCEMGRPKEAQDL 231

Query: 603  LNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQ 782
            + EM   G KPS ++FR V+  YG+LGLF EM+  L  ME  G  +DTVCSN+VLS YG 
Sbjct: 232  IEEMRGKGVKPSVYEFRCVLHAYGKLGLFQEMQMILDQMESGGFKVDTVCSNMVLSSYGV 291

Query: 783  HGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIV-SSNTRSLPLSIDDLMKKLE 959
            +  L E+ SW+ KM+  GI  S RT NSVLNSCP ++S V +SN  + P+SI +LMK L 
Sbjct: 292  YNALPEIVSWLKKMKDLGIPFSSRTCNSVLNSCPTMMSTVQNSNANTYPISIQELMKILR 351

Query: 960  EETAXXXXXXXXNEALLVGELI--GSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133
                        +EA++V ELI   S VL + ++W    SKLDLHG H+ +AYLI+L W 
Sbjct: 352  -----------GDEAMVVNELIIGSSSVLEEAMQWDALESKLDLHGMHLCSAYLIMLLWF 400

Query: 1134 QELRLRF-CEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310
            +E++ RF     ++P EI+V+CGSG HS  RG+SP+K ++  +M QT SPMR+DRKN G 
Sbjct: 401  EEMKNRFNGGNYVIPAEITVVCGSGNHSIVRGESPVKRMIKSIMVQTRSPMRVDRKNLGC 460

Query: 1311 FAASGKAVREWL 1346
            F A GK V+EWL
Sbjct: 461  FIAKGKVVKEWL 472


>gb|OAP09950.1| hypothetical protein AXX17_AT2G12140 [Arabidopsis thaliana]
          Length = 470

 Score =  335 bits (858), Expect = e-107
 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%)
 Frame = +3

Query: 141  KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299
            KH DR  S+LS+ +   DP A +R I+KF+AAS K  AL+ LS  +S   S P    FAL
Sbjct: 56   KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 115

Query: 300  PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476
             +Y  I+EASWF+WNP L A +IALL KQ R  E+ TL   AV+RLKS  RD  LF C+L
Sbjct: 116  SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 175

Query: 477  LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614
            +ES S+ G  Q   E   RLRE+  R                          AE  + EM
Sbjct: 176  VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 235

Query: 615  ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794
                 KP  F+++ V+ GYG+LGLF +M   +  M   G  IDTVCSN+VLS YG H  L
Sbjct: 236  RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 295

Query: 795  SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974
             +M SW+ K++   +  SIRT NSVLNSCP ++S++  +  S P+S+ +L   L E+   
Sbjct: 296  PQMGSWLQKLKGFNVLFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 351

Query: 975  XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154
                    EALLV EL  S VL + +EW+    KLDLHG H++++YLILL WM E RLRF
Sbjct: 352  --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 403

Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331
             EEK ++P EI V+ GSG+HS  RG+SP+K LV ++M +T SPMRIDRKN G F A GK 
Sbjct: 404  SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 463

Query: 1332 VREWLC 1349
            V+EWLC
Sbjct: 464  VKEWLC 469


>ref|NP_001324066.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
 gb|ANM61874.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
          Length = 501

 Score =  335 bits (860), Expect = e-107
 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%)
 Frame = +3

Query: 141  KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299
            KH DR  S+LS+ +   DP A +R I+KF+AAS K  AL+ LS  +S   S P    FAL
Sbjct: 87   KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 146

Query: 300  PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476
             +Y  I+EASWF+WNP L A +IALL KQ R  E+ TL   AV+RLKS  RD  LF C+L
Sbjct: 147  SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 206

Query: 477  LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614
            +ES S+ G  Q   E   RLRE+  R                          AE  + EM
Sbjct: 207  VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 266

Query: 615  ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794
                 KP  F+++ V+ GYG+LGLF +M   +  M   G  IDTVCSN+VLS YG H  L
Sbjct: 267  RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 326

Query: 795  SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974
             +M SW+ K++   +  SIRT NSVLNSCP ++S++  +  S P+S+ +L   L E+   
Sbjct: 327  PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 382

Query: 975  XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154
                    EALLV EL  S VL + +EW+    KLDLHG H++++YLILL WM E RLRF
Sbjct: 383  --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 434

Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331
             EEK ++P EI V+ GSG+HS  RG+SP+K LV ++M +T SPMRIDRKN G F A GK 
Sbjct: 435  SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 494

Query: 1332 VREWLC 1349
            V+EWLC
Sbjct: 495  VKEWLC 500


>ref|NP_565402.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
 gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana]
 gb|AAM44931.1| unknown protein [Arabidopsis thaliana]
 gb|AEC06575.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis
            thaliana]
          Length = 504

 Score =  335 bits (860), Expect = e-107
 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%)
 Frame = +3

Query: 141  KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299
            KH DR  S+LS+ +   DP A +R I+KF+AAS K  AL+ LS  +S   S P    FAL
Sbjct: 90   KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 149

Query: 300  PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476
             +Y  I+EASWF+WNP L A +IALL KQ R  E+ TL   AV+RLKS  RD  LF C+L
Sbjct: 150  SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 209

Query: 477  LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614
            +ES S+ G  Q   E   RLRE+  R                          AE  + EM
Sbjct: 210  VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 269

Query: 615  ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794
                 KP  F+++ V+ GYG+LGLF +M   +  M   G  IDTVCSN+VLS YG H  L
Sbjct: 270  RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 329

Query: 795  SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974
             +M SW+ K++   +  SIRT NSVLNSCP ++S++  +  S P+S+ +L   L E+   
Sbjct: 330  PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 385

Query: 975  XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154
                    EALLV EL  S VL + +EW+    KLDLHG H++++YLILL WM E RLRF
Sbjct: 386  --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 437

Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331
             EEK ++P EI V+ GSG+HS  RG+SP+K LV ++M +T SPMRIDRKN G F A GK 
Sbjct: 438  SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 497

Query: 1332 VREWLC 1349
            V+EWLC
Sbjct: 498  VKEWLC 503


Top