BLASTX nr result

ID: Magnolia22_contig00016899 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00016899
         (1024 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017697354.1 PREDICTED: pentatricopeptide repeat-containing pr...   573   0.0  
XP_010932394.2 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide...   568   0.0  
XP_018845304.1 PREDICTED: pentatricopeptide repeat-containing pr...   538   0.0  
OAY84201.1 Pentatricopeptide repeat-containing protein, chloropl...   548   0.0  
XP_010266404.1 PREDICTED: pentatricopeptide repeat-containing pr...   556   0.0  
XP_009381612.1 PREDICTED: pentatricopeptide repeat-containing pr...   549   0.0  
XP_020090005.1 pentatricopeptide repeat-containing protein At3g1...   548   e-180
CBI26570.3 unnamed protein product, partial [Vitis vinifera]          531   e-178
XP_010662151.1 PREDICTED: pentatricopeptide repeat-containing pr...   538   e-177
ONK61154.1 uncharacterized protein A4U43_C08F26790 [Asparagus of...   530   e-176
JAT59381.1 Pentatricopeptide repeat-containing protein At3g18110...   536   e-176
OAY44607.1 hypothetical protein MANES_08G165200 [Manihot esculenta]   534   e-175
XP_015878584.1 PREDICTED: pentatricopeptide repeat-containing pr...   531   e-174
KMZ57512.1 putative Pentatricopeptide repeat-containing protein ...   528   e-173
XP_006491807.1 PREDICTED: pentatricopeptide repeat-containing pr...   527   e-173
CAN76112.1 hypothetical protein VITISV_005527 [Vitis vinifera]        526   e-172
XP_010103833.1 hypothetical protein L484_024135 [Morus notabilis...   524   e-171
XP_012090946.1 PREDICTED: pentatricopeptide repeat-containing pr...   521   e-171
XP_006372940.1 hypothetical protein POPTR_0017s06420g [Populus t...   520   e-170
XP_011026363.1 PREDICTED: pentatricopeptide repeat-containing pr...   520   e-170

>XP_017697354.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Phoenix dactylifera]
            XP_008784335.2 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Phoenix dactylifera] XP_017697355.1
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g18110, chloroplastic isoform X1 [Phoenix dactylifera]
            XP_008784336.2 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Phoenix dactylifera]
          Length = 1463

 Score =  573 bits (1478), Expect = 0.0
 Identities = 281/341 (82%), Positives = 313/341 (91%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E++L+LMK+ G+EPT+ATMHMLMVS+GTAGQPQEAE VLNNL++SGL+LST+PYSSVIDA
Sbjct: 1067 EHLLSLMKKDGIEPTIATMHMLMVSYGTAGQPQEAENVLNNLKSSGLDLSTLPYSSVIDA 1126

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYNLGI KL EMK+DGVEPD  IWTCFIRAASLC+  +EA +LLNS+ DTGFDLP
Sbjct: 1127 YLKNGDYNLGIMKLLEMKRDGVEPDHRIWTCFIRAASLCEKTNEAMVLLNSLSDTGFDLP 1186

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK  SLV EVD LLE+LG  E+N +FNFVN+LEDLLWA+ERRATA+W+FQLAI+K
Sbjct: 1187 IRLLTEKAGSLVMEVDHLLEELGPMEDNASFNFVNALEDLLWAYERRATASWIFQLAIKK 1246

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVA+KDWGADFRKLSAGAALVGLTLWLDHMQDASL G PES KSVVLITGTA
Sbjct: 1247 SIYRHDVFRVAEKDWGADFRKLSAGAALVGLTLWLDHMQDASLHGSPESPKSVVLITGTA 1306

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKAYLWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FCMDLELKDALSL
Sbjct: 1307 EYNMVSLNNTLKAYLWEMGSPFLPCKTRSGVLVAKAHSLRMWLKDSSFCMDLELKDALSL 1366

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L+EGYFMRAGLVP FKDIHERLG+VRPKKFARLA
Sbjct: 1367 PESNSMKLTEGYFMRAGLVPAFKDIHERLGEVRPKKFARLA 1407


>XP_010932394.2 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At3g18110, chloroplastic [Elaeis guineensis]
          Length = 1464

 Score =  568 bits (1464), Expect = 0.0
 Identities = 278/341 (81%), Positives = 312/341 (91%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E++L+LMK+ G+EPT+ATMHMLMVS+G+AGQPQEAE VLNNL++SGL+LST+PYSSVIDA
Sbjct: 1068 EHLLSLMKKDGIEPTIATMHMLMVSYGSAGQPQEAENVLNNLKSSGLDLSTLPYSSVIDA 1127

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYNLGI KL EMK+DGVEPD  IWTCFIRAASLC+  +EA +LLNS+ D GFDLP
Sbjct: 1128 YLKNGDYNLGIMKLLEMKRDGVEPDHRIWTCFIRAASLCEKTNEAMVLLNSLCDIGFDLP 1187

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EKT SLV +VD LL++LG  E+N  FNFVN+LEDLLWA+ERRATA+W+FQLAI+K
Sbjct: 1188 IRLLTEKTGSLVMKVDCLLDELGPMEDNACFNFVNALEDLLWAYERRATASWIFQLAIKK 1247

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVA+KDWGADFRKLSAGAALVGLTLWLDH+QDASLQG PES KSVVLITGTA
Sbjct: 1248 NIYRHDVFRVAEKDWGADFRKLSAGAALVGLTLWLDHLQDASLQGSPESPKSVVLITGTA 1307

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKAYLWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FCMDLELKDA SL
Sbjct: 1308 EYNMVSLNNTLKAYLWEMGSPFLPCKTRSGVLVAKAHSLRMWLKDSSFCMDLELKDASSL 1367

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM LSEGYFMRAGLVP FKDIHERLG+VRPKKFARLA
Sbjct: 1368 PESNSMKLSEGYFMRAGLVPAFKDIHERLGEVRPKKFARLA 1408


>XP_018845304.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Juglans regia]
          Length = 637

 Score =  538 bits (1386), Expect = 0.0
 Identities = 264/341 (77%), Positives = 303/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L++MKEAGVEPT+ATMH+LMVS+G++GQP EAEKVL+NL+ +GL+L T+PYSSVIDA
Sbjct: 240  EKLLSIMKEAGVEPTIATMHLLMVSYGSSGQPHEAEKVLDNLKATGLSLDTLPYSSVIDA 299

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y+KNGDYN GI+KL EMK++GVEPD  IWTCFIRAASLC+  SE  +LLN++RD GFDLP
Sbjct: 300  YVKNGDYNAGIQKLMEMKEEGVEPDHRIWTCFIRAASLCERTSEVLVLLNALRDAGFDLP 359

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLLMEK++SLV EVD  LEKL    +N AFNFVN+LEDLLWAFE RATA+WVFQLAI++
Sbjct: 360  IRLLMEKSESLVSEVDNCLEKLEPMGDNAAFNFVNALEDLLWAFELRATASWVFQLAIKR 419

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +Y H+VFRVADKDWGADFRKLSAG+ALVGLTLWLDHMQDASLQG PES KSVVLITGTA
Sbjct: 420  NIYCHNVFRVADKDWGADFRKLSAGSALVGLTLWLDHMQDASLQGYPESPKSVVLITGTA 479

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKA LWEMGSPFLP + RSG+L+AKAHSLRMWLKDSPFC+DLELKDA SL
Sbjct: 480  EYNMVSLNSTLKACLWEMGSPFLPCRTRSGLLVAKAHSLRMWLKDSPFCLDLELKDATSL 539

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R GLVP FKDI +RLG VRPKKFARLA
Sbjct: 540  PESNSMKLVEGCFIRRGLVPAFKDITDRLGLVRPKKFARLA 580


>OAY84201.1 Pentatricopeptide repeat-containing protein, chloroplastic [Ananas
            comosus]
          Length = 993

 Score =  548 bits (1412), Expect = 0.0
 Identities = 268/341 (78%), Positives = 306/341 (89%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            + +L LMK+ G+EPT+ATMHMLMVS+GTAGQPQEAE VLNNL+TSGL LS++PYSSVIDA
Sbjct: 598  QRLLVLMKDDGIEPTIATMHMLMVSYGTAGQPQEAENVLNNLKTSGLELSSLPYSSVIDA 657

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYNLGI KL EMK DG+EPD  IWTCFIRAASLC+  ++A +LLN++ +TGFDLP
Sbjct: 658  YLKNGDYNLGIAKLLEMKGDGLEPDHRIWTCFIRAASLCEQTNQAVMLLNALGNTGFDLP 717

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EKT  +V EVD LLE+L   E+N  FNFVN+LEDLLWAFERRATA+W+FQLAI++
Sbjct: 718  IRLLTEKTGPMVLEVDRLLEELVLLEDNACFNFVNALEDLLWAFERRATASWIFQLAIKR 777

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +Y HDVFRVA+KDWGADFRKLSAGAALVGLTLWLD+MQDASLQG PES KSVVLITGTA
Sbjct: 778  NIYHHDVFRVAEKDWGADFRKLSAGAALVGLTLWLDNMQDASLQGSPESPKSVVLITGTA 837

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL  TLKAYLWEMGSPFLP K R+G+L+AKAHSLRMWLKDS FC+DLELKDAL+L
Sbjct: 838  EYNMVSLSNTLKAYLWEMGSPFLPCKTRTGVLVAKAHSLRMWLKDSSFCVDLELKDALAL 897

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P++NSM L+EG+FMRAGLVP FKDI+ERLGQVRPKKFARLA
Sbjct: 898  PETNSMKLTEGFFMRAGLVPAFKDINERLGQVRPKKFARLA 938


>XP_010266404.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Nelumbo nucifera]
          Length = 1488

 Score =  556 bits (1434), Expect = 0.0
 Identities = 274/340 (80%), Positives = 310/340 (91%)
 Frame = -2

Query: 1020 NVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDAY 841
            NV A+MKEAG+EP++ATMHML+VS+G+AG+P+EAE VLNNL+ SGLNL+T+PYSSVIDAY
Sbjct: 1082 NVFAMMKEAGLEPSIATMHMLIVSYGSAGEPKEAENVLNNLKASGLNLTTLPYSSVIDAY 1141

Query: 840  LKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLPI 661
            LKNGDYNLGIEKL EMKKDG+EPD  IWTCF RAASLCQ  SEA  LLNS+RD+GFDLPI
Sbjct: 1142 LKNGDYNLGIEKLLEMKKDGLEPDHRIWTCFTRAASLCQQTSEAIFLLNSLRDSGFDLPI 1201

Query: 660  RLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRKG 481
            R+L EK++SLV EVD LLE+L   E+N AFNFVN+LEDLLWAFE RATA+WVFQLAIR+ 
Sbjct: 1202 RILTEKSESLVNEVDHLLEQLEPLEDNAAFNFVNALEDLLWAFECRATASWVFQLAIRRH 1261

Query: 480  VYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTAE 301
            +Y HDVFRV++KDWGADFRKLS GAALVGLTLWLDHMQDASLQG PES KSVVLITGTAE
Sbjct: 1262 IYCHDVFRVSEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVVLITGTAE 1321

Query: 300  YNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSLP 121
            YNMVSL+KTLKAYLWEMGSPFLP K R+G+LIAKAHSLRMWLKDSPFC+DLELK+A SLP
Sbjct: 1322 YNMVSLNKTLKAYLWEMGSPFLPCKTRTGLLIAKAHSLRMWLKDSPFCLDLELKNAPSLP 1381

Query: 120  KSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            +SNSM L EGYFMR+GLVPVFK+IH++LGQV PKKFARLA
Sbjct: 1382 ESNSMQLYEGYFMRSGLVPVFKEIHDQLGQVTPKKFARLA 1421


>XP_009381612.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Musa acuminata subsp.
            malaccensis] XP_009381613.1 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Musa acuminata subsp. malaccensis]
            XP_018674760.1 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Musa acuminata subsp. malaccensis]
          Length = 1468

 Score =  549 bits (1414), Expect = 0.0
 Identities = 267/341 (78%), Positives = 303/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            EN+L  M+E G++PT+ATMHMLMVS+G+AGQPQEAE VLNNLR+S   L+T+PYSSVIDA
Sbjct: 1073 ENLLFQMEEVGIKPTIATMHMLMVSYGSAGQPQEAENVLNNLRSSSQELTTLPYSSVIDA 1132

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLK GDYN+GI KL EMKKDGVEPD  IWTCFIRAASLC+  +EA LLL ++ + GFD+P
Sbjct: 1133 YLKVGDYNMGITKLMEMKKDGVEPDHRIWTCFIRAASLCEKTNEAMLLLGTLGNNGFDIP 1192

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL  K +SL  EVD LLE+LGS E+N +FNFVN+LEDLLWAFERRATA W+FQLAI +
Sbjct: 1193 IRLLTGKAESLFMEVDHLLEELGSLEDNASFNFVNALEDLLWAFERRATALWIFQLAITR 1252

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVA+KDWGADFRK+SAGA+LVGLTLWLDHMQDASLQG PES KSVVLITGTA
Sbjct: 1253 NIYRHDVFRVAEKDWGADFRKMSAGASLVGLTLWLDHMQDASLQGSPESPKSVVLITGTA 1312

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+KTLKAYLWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FC+DLELKD  SL
Sbjct: 1313 EYNMVSLEKTLKAYLWEMGSPFLPCKTRSGVLVAKAHSLRMWLKDSSFCLDLELKDTTSL 1372

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P++NSM L+EGYFMRAGLVP FKDIHERLGQ+RPKKFARLA
Sbjct: 1373 PQTNSMKLTEGYFMRAGLVPAFKDIHERLGQIRPKKFARLA 1413


>XP_020090005.1 pentatricopeptide repeat-containing protein At3g18110, chloroplastic
            [Ananas comosus] XP_020090006.1 pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            [Ananas comosus] XP_020090007.1 pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            [Ananas comosus]
          Length = 1474

 Score =  548 bits (1411), Expect = e-180
 Identities = 268/341 (78%), Positives = 306/341 (89%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            +++L LMK+ G+EPT+ATMHMLMVS+GTAGQPQEAE VLNNL+TSGL LS++PYSSVIDA
Sbjct: 1079 QHLLVLMKDDGIEPTIATMHMLMVSYGTAGQPQEAENVLNNLKTSGLELSSLPYSSVIDA 1138

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYNLGI KL EMK DG+EPD  IWTCFIRAASLC+  ++A +LLN++ +TGFDLP
Sbjct: 1139 YLKNGDYNLGIAKLLEMKGDGLEPDHRIWTCFIRAASLCEQTNQAVMLLNALGNTGFDLP 1198

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EKT  +V EVD LLE+L   E+N  FNFVN+LEDLLWAFERRATA+W+FQLAI++
Sbjct: 1199 IRLLTEKTGPMVLEVDRLLEELVLLEDNACFNFVNALEDLLWAFERRATASWIFQLAIKR 1258

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +Y HDVFRVA+KDWGADFRKLSAGAALVGLTLWLD+MQDASLQG PES KSVVLITGTA
Sbjct: 1259 NIYHHDVFRVAEKDWGADFRKLSAGAALVGLTLWLDNMQDASLQGSPESPKSVVLITGTA 1318

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL  TLKAYLWEMGSPFLP K R+G+L+AKAHSLRMWLKDS FC+DLELKDAL+L
Sbjct: 1319 EYNMVSLSNTLKAYLWEMGSPFLPCKTRTGVLVAKAHSLRMWLKDSSFCVDLELKDALAL 1378

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+ NSM L+EG+FMRAGLVP FKDI+ERLGQVRPKKFARLA
Sbjct: 1379 PEMNSMKLTEGFFMRAGLVPAFKDINERLGQVRPKKFARLA 1419



 Score = 59.3 bits (142), Expect = 6e-06
 Identities = 43/186 (23%), Positives = 80/186 (43%), Gaps = 8/186 (4%)
 Frame = -2

Query: 999 EAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDAYLKNGDYN 820
           + G  P   T + L+  F   G   + E+V   +  +G     + Y+++I  Y K G  +
Sbjct: 388 DKGFTPDAVTYNSLLYGFAKEGNVDKVERVCEEMVKAGFKKDEITYNTIIHMYGKQGRID 447

Query: 819 LGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLPIRLLME-- 646
           + +E   EMK +G  PD + +T  I +      + EA  ++N M + G    +R      
Sbjct: 448 VALELYDEMKSEGCSPDSVTYTVIIDSLGKADRIGEAGKVMNDMVEAGVKPTLRTFSALI 507

Query: 645 ---KTDSLVPEVDLLLE---KLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
                  +  E +   +   +LG   +NLA++    + D+L  F     A  +++  ++ 
Sbjct: 508 CGYAKSGMRVEAERTFDHMIRLGIKPDNLAYSV---MLDILLRFGEIRKAMPLYRAMVKD 564

Query: 483 GVYRHD 466
           G YR D
Sbjct: 565 G-YRPD 569


>CBI26570.3 unnamed protein product, partial [Vitis vinifera]
          Length = 1042

 Score =  531 bits (1368), Expect = e-178
 Identities = 265/349 (75%), Positives = 302/349 (86%), Gaps = 8/349 (2%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L +MKEAGVEPT+ATMH+LMVS+  +GQP+EAEKVL+NL+  GL LST+PYSSVIDA
Sbjct: 629  EKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYSSVIDA 688

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGD+N+ I+KL EMKKDG+EPD  IWTCF+RAASL Q+ SEA +LL ++RDTGFDLP
Sbjct: 689  YLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDTGFDLP 748

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK+DSLV EVD  LEKLG  E+N AFNFVN+LEDLLWAFE RATA+WVFQLA+++
Sbjct: 749  IRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQLAVKR 808

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHM--------QDASLQGLPESAKS 328
             +YRHDVFRVA+KDWGADFRK+SAG+ALVGLTLWLDHM        QDASLQG P S KS
Sbjct: 809  SIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQAKYFYFWQDASLQGYPLSPKS 868

Query: 327  VVLITGTAEYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDL 148
            VVLITGTAEYNMVSL+ TLKA+LWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FC+DL
Sbjct: 869  VVLITGTAEYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSSFCLDL 928

Query: 147  ELKDALSLPKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            ELKDA SLP+SNSM L EG F+R GLVP FKDI ERLG VRPKKFARLA
Sbjct: 929  ELKDAPSLPESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPKKFARLA 977


>XP_010662151.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Vitis vinifera]
          Length = 1478

 Score =  538 bits (1387), Expect = e-177
 Identities = 265/341 (77%), Positives = 302/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L +MKEAGVEPT+ATMH+LMVS+  +GQP+EAEKVL+NL+  GL LST+PYSSVIDA
Sbjct: 1083 EKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYSSVIDA 1142

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGD+N+ I+KL EMKKDG+EPD  IWTCF+RAASL Q+ SEA +LL ++RDTGFDLP
Sbjct: 1143 YLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDTGFDLP 1202

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK+DSLV EVD  LEKLG  E+N AFNFVN+LEDLLWAFE RATA+WVFQLA+++
Sbjct: 1203 IRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQLAVKR 1262

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVA+KDWGADFRK+SAG+ALVGLTLWLDHMQDASLQG P S KSVVLITGTA
Sbjct: 1263 SIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQDASLQGYPLSPKSVVLITGTA 1322

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKA+LWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FC+DLELKDA SL
Sbjct: 1323 EYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSSFCLDLELKDAPSL 1382

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R GLVP FKDI ERLG VRPKKFARLA
Sbjct: 1383 PESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPKKFARLA 1423


>ONK61154.1 uncharacterized protein A4U43_C08F26790 [Asparagus officinalis]
          Length = 1215

 Score =  530 bits (1365), Expect = e-176
 Identities = 261/341 (76%), Positives = 293/341 (85%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            EN+L LMKE G+EPTVATMHML++S+G  GQPQ+AE VLN L+ SG NL+T+ YS+VIDA
Sbjct: 819  ENLLLLMKEDGIEPTVATMHMLLISYGDGGQPQQAEDVLNTLKVSGQNLTTLVYSAVIDA 878

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y KN +Y +GI KL EM +DGV PD  IWTCF+RAAS CQ   +A  LLN + D GFDLP
Sbjct: 879  YFKNKEYKMGITKLFEMNRDGVAPDHRIWTCFVRAASFCQETEDAISLLNCLHDIGFDLP 938

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            +RLL EK +SL  E+D LL+KL   E+N AFNFVN+LEDLLWAFE RATA+WVFQLAIRK
Sbjct: 939  LRLLTEKPESLFTELDNLLDKLSPEEDNAAFNFVNALEDLLWAFEHRATASWVFQLAIRK 998

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
            G+YRHDVFRVADKDWGADFRKLSAGAALVGLTLWLD+MQDASLQG PES KSV LITGTA
Sbjct: 999  GIYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDNMQDASLQGSPESQKSVALITGTA 1058

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSLD T+KAYLWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FCMDLELKDA +L
Sbjct: 1059 EYNMVSLDNTIKAYLWEMGSPFLPCKTRSGVLVAKAHSLRMWLKDSSFCMDLELKDAPNL 1118

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            PKSNSM+L+EGYFMRA LVP FKDI ERLG+VRPKKFARLA
Sbjct: 1119 PKSNSMMLTEGYFMRATLVPAFKDILERLGKVRPKKFARLA 1159



 Score = 63.2 bits (152), Expect = 3e-07
 Identities = 31/112 (27%), Positives = 55/112 (49%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E V   + E G  P   T + L+ ++   G   +  +V +++ +SG     + Y+++I  
Sbjct: 122  ERVFLELGEKGFSPDAVTYNSLLYAYAVEGDVDKVRRVCDDMISSGFGKDEITYNTIIHM 181

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSM 688
            Y K GD N  +E   EMK+ G +PD + +T  I +      +SEA  ++  M
Sbjct: 182  YGKRGDVNFALELYGEMKEAGCKPDAVTYTVLIDSLGKSDRISEAGKVMEEM 233


>JAT59381.1 Pentatricopeptide repeat-containing protein At3g18110, chloroplastic
            [Anthurium amnicola]
          Length = 1493

 Score =  536 bits (1382), Expect = e-176
 Identities = 265/341 (77%), Positives = 305/341 (89%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E+VL LMKEAGVEPT+ATMH+LMVS+GTAG+PQEAE VLN+L++S L+LST+PY+S+IDA
Sbjct: 1096 EHVLFLMKEAGVEPTIATMHILMVSYGTAGRPQEAESVLNSLKSSSLDLSTLPYTSLIDA 1155

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYN+GI+K+ EMK DGVEPD  IWTCF+ AASLC   S+  +LLNS+ DTGFDLP
Sbjct: 1156 YLKNGDYNMGIKKMLEMKADGVEPDHRIWTCFVHAASLCHETSQGIMLLNSLCDTGFDLP 1215

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EKT+SL+ E+D  L++LGS +E+ +FNFVN+LEDLLWAFE RA+A+WVFQLA+ K
Sbjct: 1216 IRLLTEKTESLIIELDSFLDQLGS-QEDASFNFVNALEDLLWAFEHRASASWVFQLAVSK 1274

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
            G+YR +VFRVA+KDWGADFRKLSAGAALVGLTLWLDHMQDASL G+PES KSVVLITGTA
Sbjct: 1275 GIYRQNVFRVAEKDWGADFRKLSAGAALVGLTLWLDHMQDASLLGVPESPKSVVLITGTA 1334

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
             YNMVSL+ TLKAYLWEMGSPFLP K RSG+L+AKAHSLRMWLKDS FCMDLELKDA SL
Sbjct: 1335 LYNMVSLNNTLKAYLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSSFCMDLELKDASSL 1394

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            PKSNS+ L +GYFMRAGLVP FKDIHERLGQV  KKFARLA
Sbjct: 1395 PKSNSIQLIDGYFMRAGLVPAFKDIHERLGQVGAKKFARLA 1435


>OAY44607.1 hypothetical protein MANES_08G165200 [Manihot esculenta]
          Length = 1480

 Score =  534 bits (1375), Expect = e-175
 Identities = 262/341 (76%), Positives = 303/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L++MK+AGVEPT+ATMH+LMVS+G++GQPQEAEKVL NL+ SGL+LST+PYSSVIDA
Sbjct: 1075 EKLLSMMKDAGVEPTIATMHLLMVSYGSSGQPQEAEKVLTNLKESGLDLSTLPYSSVIDA 1134

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYN+GI+KL EMKK+GVEPD  IWTCF+RAASL Q+  EA +LLN+++D+GFDLP
Sbjct: 1135 YLKNGDYNVGIQKLMEMKKEGVEPDHRIWTCFVRAASLSQHTHEAIILLNALQDSGFDLP 1194

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL E+++SLV EVD  LE L   E+N AFNFVN+LEDLLWAFE RATA+WVFQLA+++
Sbjct: 1195 IRLLKERSESLVSEVDQCLEMLEDMEDNAAFNFVNALEDLLWAFELRATASWVFQLAVKR 1254

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +Y HDVFRVAD+DWGADFRKLS GAALV LTLWLDHMQDASLQG P S KSVVLITGTA
Sbjct: 1255 SIYSHDVFRVADQDWGADFRKLSGGAALVSLTLWLDHMQDASLQGYPASPKSVVLITGTA 1314

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSLDKTLKA LWEMGSPFLP K RSG+LIAKAHSLRMWLKDSPFC+DLELKD+ SL
Sbjct: 1315 EYNMVSLDKTLKACLWEMGSPFLPCKTRSGLLIAKAHSLRMWLKDSPFCLDLELKDSPSL 1374

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R GLVP FK+I E+LG VRPKKFA+LA
Sbjct: 1375 PESNSMQLIEGCFIRRGLVPAFKEITEKLGFVRPKKFAKLA 1415



 Score = 59.7 bits (143), Expect = 4e-06
 Identities = 30/122 (24%), Positives = 59/122 (48%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +   ++  G  P   T + L+ +F   G   + ++V   +   G +   + Y+++I  
Sbjct: 377  EQLFKELESKGFYPDAVTYNSLLYAFAREGNVDKVKEVCEEMVNMGFSKDEMTYNTIIHM 436

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y K G ++L ++  ++MK  G  PD + +T  I +      M+EA  +++ M DTG    
Sbjct: 437  YGKQGQHDLALQLYNDMKLSGRTPDAITYTVLIDSLGKANKMAEAASVMSGMLDTGVKPT 496

Query: 663  IR 658
            +R
Sbjct: 497  LR 498


>XP_015878584.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic [Ziziphus jujuba] XP_015878585.1 PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic [Ziziphus jujuba] XP_015878586.1 PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic [Ziziphus jujuba]
          Length = 1485

 Score =  531 bits (1367), Expect = e-174
 Identities = 258/341 (75%), Positives = 303/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L +MKEAG+EP  ATMH+LMVS+G++GQPQEAE+VLNNL+ +GL L+T+PYSSVIDA
Sbjct: 1093 EMLLGVMKEAGIEPNFATMHLLMVSYGSSGQPQEAEEVLNNLKVTGLQLNTLPYSSVIDA 1152

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYN+GI+KL EMK+ G+EPD  IWTCF+RAASL Q+ SEA +LLN++RD GFDLP
Sbjct: 1153 YLKNGDYNIGIQKLKEMKQGGLEPDHRIWTCFVRAASLSQHTSEAIILLNALRDAGFDLP 1212

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK+++L+ EV L LEKL   E+N AFNFVN+L+DLLWAFE RATA+WVFQLAI++
Sbjct: 1213 IRLLTEKSNALISEVGLCLEKLEPLEDNAAFNFVNALDDLLWAFELRATASWVFQLAIKR 1272

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
            G+YRHDVFRVA++DWGADFRKLSAG+ALV LTLWLDHMQDASLQG PES+KSVVLITGTA
Sbjct: 1273 GIYRHDVFRVAERDWGADFRKLSAGSALVALTLWLDHMQDASLQGYPESSKSVVLITGTA 1332

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYN VSL+ TLKA+LWEMGSPFLP   RSG+LIAKAHSLRMWLKDSPFC+DLELKD+ SL
Sbjct: 1333 EYNNVSLNSTLKAFLWEMGSPFLPCSTRSGLLIAKAHSLRMWLKDSPFCLDLELKDSPSL 1392

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L +G F+R GLVP FKDI E+LG VRPKKFARLA
Sbjct: 1393 PESNSMQLIDGCFIRTGLVPAFKDITEKLGLVRPKKFARLA 1433


>KMZ57512.1 putative Pentatricopeptide repeat-containing protein [Zostera marina]
          Length = 1458

 Score =  528 bits (1361), Expect = e-173
 Identities = 263/342 (76%), Positives = 302/342 (88%), Gaps = 1/342 (0%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E+VL  MK++G++PT+ATMH+LM S+GTAG+P+EAE VLNNL  SGLNLST+PY SVID 
Sbjct: 1060 ESVLFQMKDSGLQPTIATMHILMDSYGTAGKPEEAENVLNNLIESGLNLSTLPYCSVIDG 1119

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGD N+ I+KL +MK DG EPD  IWTCFIR A LC   +EA LLLNS+ D+GFDLP
Sbjct: 1120 YLKNGDNNMAIKKLLDMKNDGTEPDHRIWTCFIRGARLCYQTNEAMLLLNSLSDSGFDLP 1179

Query: 663  IRLLMEKTD-SLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIR 487
            +RLL +KTD SLV E+D  L+K+GS E+N +FNFVN+LEDLLWAFE RATA+WVFQLAI+
Sbjct: 1180 MRLLTQKTDFSLVKELDNTLDKIGS-EDNGSFNFVNALEDLLWAFECRATASWVFQLAIK 1238

Query: 486  KGVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGT 307
            KG+YRHDV+RV DK+WGADFRKLSAGAALVGLTLWLDHMQDASLQG PES KSVVLITGT
Sbjct: 1239 KGIYRHDVYRVIDKNWGADFRKLSAGAALVGLTLWLDHMQDASLQGFPESPKSVVLITGT 1298

Query: 306  AEYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALS 127
            AEY+MVSL+KTLKAYLWEMGSPFLP K R+GIL+AKAHSLRMWLKDS FCMDLEL+DA S
Sbjct: 1299 AEYHMVSLEKTLKAYLWEMGSPFLPCKTRTGILVAKAHSLRMWLKDSSFCMDLELRDAPS 1358

Query: 126  LPKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            LP+ NS+ L+EGYFMRAGLVP FKDIHERLG+VRPKKFARLA
Sbjct: 1359 LPEFNSVQLNEGYFMRAGLVPAFKDIHERLGEVRPKKFARLA 1400


>XP_006491807.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Citrus sinensis] XP_006491808.1
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g18110, chloroplastic isoform X1 [Citrus sinensis]
            XP_006491809.1 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Citrus sinensis] XP_006491810.1 PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic isoform X1 [Citrus sinensis] XP_006491811.1
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g18110, chloroplastic isoform X1 [Citrus sinensis]
            XP_015389910.1 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic
            isoform X1 [Citrus sinensis]
          Length = 1459

 Score =  527 bits (1357), Expect = e-173
 Identities = 263/341 (77%), Positives = 299/341 (87%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            EN+L +MKE+GVEPT+ATMH+LMVS+ ++GQPQEAEKVL+NL+ + LNLST+PYSSVI A
Sbjct: 1066 ENLLNMMKESGVEPTIATMHLLMVSYSSSGQPQEAEKVLSNLKGTSLNLSTLPYSSVIAA 1125

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YL+NGD  +GI+KL EMK++G+EPD  IWTCF+RAASL Q  SEA +LLN++RD GFDLP
Sbjct: 1126 YLRNGDSAVGIQKLIEMKEEGIEPDHRIWTCFVRAASLSQCSSEAIILLNAIRDAGFDLP 1185

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK+++LV EVD  LEKL   E+N AFNFVN+LEDLLWAFE RATA+WVFQLAI+ 
Sbjct: 1186 IRLLTEKSETLVAEVDHCLEKLKPMEDNAAFNFVNALEDLLWAFELRATASWVFQLAIKM 1245

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
            G+Y HDVFRVADKDWGADFRKLS GAALVGLTLWLDHMQDASLQG PES KSVVLITGTA
Sbjct: 1246 GIYHHDVFRVADKDWGADFRKLSGGAALVGLTLWLDHMQDASLQGCPESPKSVVLITGTA 1305

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKA LWEMGSPFLP K RSG+L+AKAHSLRMWLKDSPFC+DLELKDA SL
Sbjct: 1306 EYNMVSLNSTLKACLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSPFCLDLELKDAPSL 1365

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L  G F+R GLVP FKDI ERLG VRPKKFARLA
Sbjct: 1366 PESNSMQLIGGCFIRRGLVPAFKDITERLGIVRPKKFARLA 1406


>CAN76112.1 hypothetical protein VITISV_005527 [Vitis vinifera]
          Length = 1494

 Score =  526 bits (1356), Expect = e-172
 Identities = 265/361 (73%), Positives = 302/361 (83%), Gaps = 20/361 (5%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L +MKEAGVEPT+ATMH+LMVS+  +GQP+EAEKVL+NL+  GL LST+PYSSVIDA
Sbjct: 1079 EKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYSSVIDA 1138

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGD+N+ I+KL EMKKDG+EPD  IWTCF+RAASL Q+ SEA +LL ++RDTGFDLP
Sbjct: 1139 YLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDTGFDLP 1198

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK+DSLV EVD  LEKLG  E+N AFNFVN+LEDLLWAFE RATA+WVFQLA+++
Sbjct: 1199 IRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQLAVKR 1258

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHM--------------------QD 364
             +YRHDVFRVA+KDWGADFRK+SAG+ALVGLTLWLDHM                    QD
Sbjct: 1259 SIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQASFLITIFVQLMEEYFYFWQD 1318

Query: 363  ASLQGLPESAKSVVLITGTAEYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLR 184
            ASLQG P S KSVVLITGTAEYNMVSL+ TLKA+LWEMGSPFLP K RSG+L+AKAHSLR
Sbjct: 1319 ASLQGYPLSPKSVVLITGTAEYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAKAHSLR 1378

Query: 183  MWLKDSPFCMDLELKDALSLPKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARL 4
            MWLKDS FC+DLELKDA SLP+SNSM L EG F+R GLVP FKDI ERLG VRPKKFARL
Sbjct: 1379 MWLKDSSFCLDLELKDAPSLPESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPKKFARL 1438

Query: 3    A 1
            A
Sbjct: 1439 A 1439


>XP_010103833.1 hypothetical protein L484_024135 [Morus notabilis] EXB97274.1
            hypothetical protein L484_024135 [Morus notabilis]
          Length = 1494

 Score =  524 bits (1349), Expect = e-171
 Identities = 252/341 (73%), Positives = 301/341 (88%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E ++ +MKEAG+EP  ATMH+LMVS+G +GQP EAEKVL +L+ +GLNL+T+PYSSVIDA
Sbjct: 1089 EMLVTMMKEAGMEPNFATMHLLMVSYGGSGQPGEAEKVLEDLKETGLNLNTLPYSSVIDA 1148

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            YLKNGDYN+ I+KL +M+K+G+EPD  IWTCFIRAASLCQ  SEA+ LLN++ DTGFDLP
Sbjct: 1149 YLKNGDYNVAIQKLKDMEKEGLEPDHRIWTCFIRAASLCQRTSEAFTLLNALSDTGFDLP 1208

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IR+L EK++SL+ EVD  LEKLG  E++ AFNFVN+LEDLLWAFE RATA+WV+QLAI++
Sbjct: 1209 IRILTEKSESLISEVDQCLEKLGPLEDDAAFNFVNALEDLLWAFEFRATASWVYQLAIKR 1268

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
            G+YRHD+FRVADKDWGADFRKLSAG+ALVGLTLWLDHMQDASLQG PES KSVVLITGT+
Sbjct: 1269 GIYRHDLFRVADKDWGADFRKLSAGSALVGLTLWLDHMQDASLQGYPESPKSVVLITGTS 1328

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYN +SL+ TLKA LWEMGSPFLP + R+G+L+AKAHSLR+WLKDSPFC+DLELKDA SL
Sbjct: 1329 EYNSISLNSTLKACLWEMGSPFLPCRTRTGLLVAKAHSLRLWLKDSPFCLDLELKDAPSL 1388

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+ NSM L EG F+R GLVP FK++ ERLG VRPKKF+RLA
Sbjct: 1389 PEYNSMQLMEGCFLRRGLVPAFKEVTERLGIVRPKKFSRLA 1429


>XP_012090946.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic [Jatropha curcas] KDP21799.1 hypothetical
            protein JCGZ_00586 [Jatropha curcas]
          Length = 1454

 Score =  521 bits (1343), Expect = e-171
 Identities = 256/341 (75%), Positives = 298/341 (87%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E +L +MK +GVEPT+ATMH+LMVS+G++GQPQEAEKVL NL+ +GLNLST+PYSSVIDA
Sbjct: 1075 EKLLGMMKNSGVEPTIATMHLLMVSYGSSGQPQEAEKVLTNLKGAGLNLSTLPYSSVIDA 1134

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y +N DYN+GI+KL EMKK+G+EPD  IWTCFIRAASL Q+  EA  LLN+++D+GFDLP
Sbjct: 1135 YFRNRDYNVGIQKLEEMKKEGLEPDHRIWTCFIRAASLSQHTHEAINLLNALQDSGFDLP 1194

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL E+++SLV EVD  LE L + E+N AFNFVN+LEDLLWAFE RATA+WVF LA+++
Sbjct: 1195 IRLLTERSESLVSEVDHCLEMLETVEDNAAFNFVNALEDLLWAFELRATASWVFHLAVKR 1254

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVAD+DWGADFRKLS GAALVGLTLWLDHMQDASLQG P S KSVVLITGTA
Sbjct: 1255 SIYRHDVFRVADQDWGADFRKLSGGAALVGLTLWLDHMQDASLQGYPVSPKSVVLITGTA 1314

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSL+ TLKA LWEMGSPFLP K RSG+L+AKAHSLRMWLKDSPFC+DLELKDA SL
Sbjct: 1315 EYNMVSLNNTLKACLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSPFCLDLELKDASSL 1374

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R GL P FK+I E+LG VRPKKFA+LA
Sbjct: 1375 PESNSMQLIEGCFIRRGLAPAFKEITEKLGFVRPKKFAKLA 1415


>XP_006372940.1 hypothetical protein POPTR_0017s06420g [Populus trichocarpa]
            XP_006372941.1 pentatricopeptide repeat-containing family
            protein [Populus trichocarpa] ERP50737.1 hypothetical
            protein POPTR_0017s06420g [Populus trichocarpa]
            ERP50738.1 pentatricopeptide repeat-containing family
            protein [Populus trichocarpa]
          Length = 1465

 Score =  520 bits (1340), Expect = e-170
 Identities = 255/341 (74%), Positives = 299/341 (87%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            + + ++MK+AGVEPT+ATMH+LMVS+G++GQPQEAEKVL+NL+ +  NLST+PYSSVIDA
Sbjct: 1084 QRLFSMMKDAGVEPTIATMHLLMVSYGSSGQPQEAEKVLSNLKETDANLSTLPYSSVIDA 1143

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y++NGDYN GI+KL ++K++G+EPD  IWTCFIRAASL Q+ SEA LLLN++RDTGFDLP
Sbjct: 1144 YVRNGDYNAGIQKLKQVKEEGLEPDHRIWTCFIRAASLSQHTSEAILLLNALRDTGFDLP 1203

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK + LV  +DL LE L +  +N AFNFVN+LEDLLWAFE RATA+WVF LAI++
Sbjct: 1204 IRLLTEKPEPLVSALDLCLEMLETLGDNAAFNFVNALEDLLWAFELRATASWVFLLAIKR 1263

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVADKDWGADFRKLS GAALVGLTLWLDHMQDASLQG PES KSV LITGTA
Sbjct: 1264 KIYRHDVFRVADKDWGADFRKLSGGAALVGLTLWLDHMQDASLQGCPESPKSVALITGTA 1323

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSLD TLKA LWEMGSPFLP K RSG+LIAKAHSL+MWLKDSPFC+DLELK+A SL
Sbjct: 1324 EYNMVSLDSTLKACLWEMGSPFLPCKTRSGLLIAKAHSLKMWLKDSPFCLDLELKNAPSL 1383

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R GLVP FK+I+E+LG VRPKKFA+ A
Sbjct: 1384 PESNSMQLIEGCFIRRGLVPAFKEINEKLGFVRPKKFAKFA 1424


>XP_011026363.1 PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X1 [Populus euphratica]
            XP_011026364.1 PREDICTED: pentatricopeptide
            repeat-containing protein At3g18110, chloroplastic-like
            isoform X1 [Populus euphratica] XP_011026365.1 PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X1 [Populus euphratica]
          Length = 1478

 Score =  520 bits (1340), Expect = e-170
 Identities = 256/341 (75%), Positives = 298/341 (87%)
 Frame = -2

Query: 1023 ENVLALMKEAGVEPTVATMHMLMVSFGTAGQPQEAEKVLNNLRTSGLNLSTVPYSSVIDA 844
            E + ++MK+AGVEPT+ATMH+LMVS+G++GQPQEAEKVL+NL+ +G NLST+PYSSVIDA
Sbjct: 1079 ERLFSMMKDAGVEPTIATMHLLMVSYGSSGQPQEAEKVLSNLKETGSNLSTLPYSSVIDA 1138

Query: 843  YLKNGDYNLGIEKLSEMKKDGVEPDCLIWTCFIRAASLCQNMSEAWLLLNSMRDTGFDLP 664
            Y +NGDYN+GI+KL +MKK+G+EPD  IWTCFIRAASL Q  S+A  LLN++RD  FDLP
Sbjct: 1139 YHRNGDYNIGIQKLIQMKKEGLEPDHRIWTCFIRAASLSQRTSDAIFLLNALRDAEFDLP 1198

Query: 663  IRLLMEKTDSLVPEVDLLLEKLGSFEENLAFNFVNSLEDLLWAFERRATAAWVFQLAIRK 484
            IRLL EK + LV  +D  LE L + E+N AFNFVN+LEDLLWAFE RATA+WVFQLAI+K
Sbjct: 1199 IRLLTEKPELLVSALDRCLEMLETLEDNAAFNFVNALEDLLWAFELRATASWVFQLAIKK 1258

Query: 483  GVYRHDVFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGLPESAKSVVLITGTA 304
             +YRHDVFRVADK+WGADFRKLS GAALVGLT WLDHMQDASLQG PES KSVVLITGTA
Sbjct: 1259 RIYRHDVFRVADKNWGADFRKLSGGAALVGLTFWLDHMQDASLQGCPESPKSVVLITGTA 1318

Query: 303  EYNMVSLDKTLKAYLWEMGSPFLPSKMRSGILIAKAHSLRMWLKDSPFCMDLELKDALSL 124
            EYNMVSLD TLKA LWEMGSPFLP K RSG+LIAKAHSLRMWLKDSPFC+DLELK+A SL
Sbjct: 1319 EYNMVSLDSTLKACLWEMGSPFLPCKSRSGLLIAKAHSLRMWLKDSPFCLDLELKNAPSL 1378

Query: 123  PKSNSMILSEGYFMRAGLVPVFKDIHERLGQVRPKKFARLA 1
            P+SNSM L EG F+R+GLVP FK+I+E++G VRPKKFA+ A
Sbjct: 1379 PESNSMQLIEGCFIRSGLVPAFKEINEKVGFVRPKKFAKFA 1419