BLASTX nr result

ID: Zingiber25_contig00020526 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00020526
         (1730 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006878653.1| hypothetical protein AMTR_s00011p00265800 [A...   528   e-147
ref|XP_002275236.2| PREDICTED: pentatricopeptide repeat-containi...   521   e-145
gb|EMT04799.1| hypothetical protein F775_19518 [Aegilops tauschii]    519   e-144
ref|XP_006656740.1| PREDICTED: pentatricopeptide repeat-containi...   519   e-144
gb|EOY10001.1| Pentatricopeptide repeat (PPR) superfamily protei...   517   e-144
ref|XP_002438011.1| hypothetical protein SORBIDRAFT_10g006490 [S...   516   e-143
ref|XP_004966645.1| PREDICTED: pentatricopeptide repeat-containi...   515   e-143
ref|NP_001057065.1| Os06g0199100 [Oryza sativa Japonica Group] g...   515   e-143
gb|EEE65260.1| hypothetical protein OsJ_20463 [Oryza sativa Japo...   515   e-143
emb|CBI26570.3| unnamed protein product [Vitis vinifera]              514   e-143
emb|CAN76112.1| hypothetical protein VITISV_005527 [Vitis vinifera]   509   e-141
ref|XP_003564143.1| PREDICTED: pentatricopeptide repeat-containi...   509   e-141
gb|AFW85425.1| chloroplast RNA splicing4 [Zea mays]                   507   e-141
ref|XP_004298102.1| PREDICTED: pentatricopeptide repeat-containi...   507   e-141
gb|EXB97274.1| hypothetical protein L484_024135 [Morus notabilis]     506   e-141
ref|XP_006347554.1| PREDICTED: pentatricopeptide repeat-containi...   502   e-139
ref|XP_006372940.1| hypothetical protein POPTR_0017s06420g [Popu...   499   e-138
ref|XP_006491807.1| PREDICTED: pentatricopeptide repeat-containi...   496   e-138
gb|EMJ07903.1| hypothetical protein PRUPE_ppa023974mg [Prunus pe...   496   e-137
ref|XP_002519997.1| pentatricopeptide repeat-containing protein,...   487   e-135

>ref|XP_006878653.1| hypothetical protein AMTR_s00011p00265800 [Amborella trichopoda]
            gi|548831996|gb|ERM94798.1| hypothetical protein
            AMTR_s00011p00265800 [Amborella trichopoda]
          Length = 1522

 Score =  528 bits (1361), Expect = e-147
 Identities = 261/402 (64%), Positives = 320/402 (79%), Gaps = 1/402 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            +H KAE+LL +MK+DG++P++ATMHLLM SYG AG P  AE VL  +K+SG  + T+PY 
Sbjct: 1118 SHEKAENLLVKMKDDGIEPSLATMHLLMDSYGQAGLPDGAENVLKGIKSSGLNVGTVPYV 1177

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K G+Y +GI K L+MK+DGV+PD+++WTCFIRAAS C+Q              
Sbjct: 1178 SVIDVYLKNGEYELGIEKMLQMKRDGVDPDYRVWTCFIRAASRCRQRNEALKLLNCLSDV 1237

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LPLR+L G  +  I+++++LLE+  SL EDA+F FVNA+EDLLWAFERRA ASW+FQ
Sbjct: 1238 GFDLPLRLLMGKSELLILEMDHLLEQLGSLEEDAAFRFVNALEDLLWAFERRAAASWVFQ 1297

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            +AI+KNIY HD+FRVAEK+WGADFRKLS GAALVGLTLWLDHMQDASLQG PESPKSV+L
Sbjct: 1298 MAIQKNIYPHDVFRVAEKNWGADFRKLSGGAALVGLTLWLDHMQDASLQGLPESPKSVVL 1357

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGTAEYN VS+ KT+KA+LWEMGSPFLPS+TRTGI+VAKAHSLRMWLKDS FC+DLEL+
Sbjct: 1358 ITGTAEYNNVSISKTLKAFLWEMGSPFLPSKTRTGILVAKAHSLRMWLKDSAFCMDLELR 1417

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D SSLPE NSM+L EGYFMR GL+P FK I ERLG +RPK FARLALL EE RE+++  D
Sbjct: 1418 DASSLPELNSMQLNEGYFMRSGLVPVFKEIQERLGDVRPKTFARLALLCEEKRERVITAD 1477

Query: 649  LQGRKEKTEKMKAKG-IPRSRKLNRFQMKYLRRQHKSPAALN 527
            ++GRKEK EKMK +G + RS++  +F+    RR  KS A  +
Sbjct: 1478 IKGRKEKLEKMKRQGRMLRSQRRMKFRK---RRFFKSRATFS 1516


>ref|XP_002275236.2| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Vitis vinifera]
          Length = 1442

 Score =  521 bits (1343), Expect = e-145
 Identities = 260/392 (66%), Positives = 313/392 (79%), Gaps = 1/392 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKE GV+PTIATMHLLMVSY  +GQP+EAEKVL +LK  G  LSTLPY 
Sbjct: 1042 NHSKAEKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYS 1101

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K GD+N+ I K +EMKKDG+EPDH+IWTCF+RAASL + T             
Sbjct: 1102 SVIDAYLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDT 1161

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + +++  LE+   L ++A+FNFVNA+EDLLWAFE RATASW+FQ
Sbjct: 1162 GFDLPIRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQ 1221

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA++++IYRHD+FRVAEKDWGADFRK+S G+ALVGLTLWLDHMQDASLQG P SPKSV+L
Sbjct: 1222 LAVKRSIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQDASLQGYPLSPKSVVL 1281

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGTAEYN+VSL  T+KA+LWEMGSPFLP +TR+G++VAKAHSLRMWLKDS FCLDLELK
Sbjct: 1282 ITGTAEYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSSFCLDLELK 1341

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  SLPE+NSM+L EG F+R GL+P FK I ERLG +RPKKFARLALL +E R+K++  D
Sbjct: 1342 DAPSLPESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPKKFARLALLPDEKRDKVIRAD 1401

Query: 649  LQGRKEKTEKMKAK-GIPRSRKLNRFQMKYLR 557
            ++G KEK EKMK K G+ R RKL R   K++R
Sbjct: 1402 IEGGKEKLEKMKKKVGVKRRRKLVR--RKFIR 1431


>gb|EMT04799.1| hypothetical protein F775_19518 [Aegilops tauschii]
          Length = 1216

 Score =  519 bits (1337), Expect = e-144
 Identities = 254/395 (64%), Positives = 313/395 (79%), Gaps = 2/395 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKEDG++PTIATMH+LM SYG+AG P EAEKVL SLK+S  E+S+LPY 
Sbjct: 815  NHSKAEQLLAAMKEDGIEPTIATMHILMTSYGTAGHPVEAEKVLNSLKSSSLEVSSLPYS 874

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V D Y K GDY++GI K LEMK+DG++PDH++WTCFIRAASLC++T             
Sbjct: 875  TVFDAYLKNGDYSLGITKLLEMKRDGIKPDHQVWTCFIRAASLCERTDDAILLLNSLQDC 934

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + +++  LEE ++L + A+ NFVNA+EDLLWAFERRATAS+IFQ
Sbjct: 935  GFGLPIRLLTERTPSLLTEVDSFLEELEALEDSAALNFVNALEDLLWAFERRATASYIFQ 994

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+ ++IYRH IFRV EKDWGADFRKLS GAALV LTLWLD MQDASLQGSPESPKS++L
Sbjct: 995  LAVNRSIYRHSIFRVIEKDWGADFRKLSAGAALVALTLWLDQMQDASLQGSPESPKSIVL 1054

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R RTG  VAK +SL+MWLKDS FC+DLELK
Sbjct: 1055 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRARTGRFVAKFYSLKMWLKDSPFCMDLELK 1114

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+ NSMKLTEGYFMR GL+ TFK IHE+LG++RPKKF+RLA+LSEESR K +  D
Sbjct: 1115 DAPALPKMNSMKLTEGYFMRAGLVSTFKDIHEQLGEVRPKKFSRLAMLSEESRAKFIKAD 1174

Query: 649  LQGRKEKTEKMKAKG--IPRSRKLNRFQMKYLRRQ 551
            ++GRK+K E++K KG  IPR  K    + K++R Q
Sbjct: 1175 IKGRKDKLERIKEKGLVIPRKSKRGPRRAKFVREQ 1209


>ref|XP_006656740.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Oryza brachyantha]
          Length = 1216

 Score =  519 bits (1336), Expect = e-144
 Identities = 256/396 (64%), Positives = 315/396 (79%), Gaps = 3/396 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKEDG++PTIATMH+LM SYG++G P EAEKVL SLK+S  E+STLPY 
Sbjct: 815  NHSKAEHLLSAMKEDGIEPTIATMHILMTSYGTSGHPDEAEKVLNSLKSSNLEISTLPYS 874

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +VID Y +  DYN+GI K LEMK+DGVEPDH++WTCFIRAASLC+QT             
Sbjct: 875  TVIDAYLRNHDYNLGITKLLEMKRDGVEPDHQVWTCFIRAASLCEQTDDAILLLKSLQDC 934

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S   +++  LEE  +L + A  NFVNA+EDLLWAFERRATASWIFQ
Sbjct: 935  GFDLPIRLLTERTSSLFTEVDSFLEELGALEDSAPLNFVNALEDLLWAFERRATASWIFQ 994

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+ ++IY H+IFRV EKDWGAD RKLS GAALV LTLWLD MQDASLQG+P+SPKS++L
Sbjct: 995  LAVNRSIYHHNIFRVEEKDWGADLRKLSAGAALVALTLWLDQMQDASLQGAPDSPKSIVL 1054

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R+R+G  V KA+SL+MWLKDS FCLDLELK
Sbjct: 1055 VTGEGEYNMVSLHKTIRAYLLEMGSPFLPCRSRSGRFVVKAYSLKMWLKDSPFCLDLELK 1114

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+TNSMKLTEGYFMR GL+  FK IHERLG++ PKKF+RLALLSEESR++++  D
Sbjct: 1115 DAPALPKTNSMKLTEGYFMRAGLVSVFKDIHERLGEVWPKKFSRLALLSEESRDEVIKAD 1174

Query: 649  LQGRKEKTEKMKAKGI---PRSRKLNRFQMKYLRRQ 551
            +QGRKEK EKMK +G+    RS+K +R + K++++Q
Sbjct: 1175 IQGRKEKLEKMKRQGLAIAKRSKKGHR-RGKFIKQQ 1209


>gb|EOY10001.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 1458

 Score =  517 bits (1331), Expect = e-144
 Identities = 259/394 (65%), Positives = 312/394 (79%), Gaps = 2/394 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAESLL  MKE GV+PTIATMHLLMVSYGS+GQPQEAEKVLTSLK +G  L+TLPY 
Sbjct: 1063 NHSKAESLLSMMKEAGVEPTIATMHLLMVSYGSSGQPQEAEKVLTSLKETGLNLTTLPYS 1122

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVI+ Y + GDYN+GI K +EMKK+G+  DH+IWTCFIRAASL   T             
Sbjct: 1123 SVINAYLRNGDYNVGIQKLMEMKKEGLAVDHRIWTCFIRAASLSNHTSEAIILLNALRDA 1182

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+++   +  + ++E  LE+ + +G+DA+FNFVNA+EDLLWAFE RATASW+FQ
Sbjct: 1183 GFDLPIRLMTEKSELLLSEVESCLEKLEPIGDDAAFNFVNALEDLLWAFELRATASWVFQ 1242

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA++K IY H +FRVA+KDWGADFRKLS G+ALV LTLWLD MQDA+LQG PESPKSV+L
Sbjct: 1243 LAVKKTIYHHHVFRVADKDWGADFRKLSAGSALVALTLWLDRMQDAALQGYPESPKSVVL 1302

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGTAEYN+VSL  T+KA LWEMGSPFLP +TR+G++VAKAHSLRMWLKDS FCLDLELK
Sbjct: 1303 ITGTAEYNMVSLNYTLKACLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSPFCLDLELK 1362

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  SLPE NSM+L EG FMR GL+P FK I ERLG +RPKKFARLALLS++ REK +  D
Sbjct: 1363 DAPSLPELNSMQLVEGCFMRRGLVPAFKDITERLGLVRPKKFARLALLSDDRREKAIQAD 1422

Query: 649  LQGRKEKTEKMKAK-GIPRSRKLNRF-QMKYLRR 554
            +QG KEK EK+K K G   +R + +  + K++RR
Sbjct: 1423 IQGGKEKLEKLKTKVGYKGARNIKKLRKRKFIRR 1456



 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 29/97 (29%), Positives = 50/97 (51%)
 Frame = -2

Query: 1720 KAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYCSVI 1541
            +A +++ +M + GVKPT+ T   L+  Y  AG   EAE+    ++ SG  L  L Y  ++
Sbjct: 474  EASNVMSEMLDVGVKPTVRTYSALICGYAKAGMAVEAEETFNCMRRSGIRLDFLAYSVML 533

Query: 1540 DGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRA 1430
            D   +       +  + EM +DG  PDH ++   ++A
Sbjct: 534  DILLRCNKTTKALLLYREMVRDGFTPDHTLYEVMLQA 570


>ref|XP_002438011.1| hypothetical protein SORBIDRAFT_10g006490 [Sorghum bicolor]
            gi|241916234|gb|EER89378.1| hypothetical protein
            SORBIDRAFT_10g006490 [Sorghum bicolor]
          Length = 1443

 Score =  516 bits (1328), Expect = e-143
 Identities = 255/398 (64%), Positives = 314/398 (78%), Gaps = 2/398 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE+LL  MKEDG++PTIATMH+LM SYG+AGQP EAE VL SLK+S  E+STLPY 
Sbjct: 1040 NHSKAENLLAVMKEDGIEPTIATMHILMTSYGTAGQPHEAENVLNSLKSSSLEVSTLPYS 1099

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V D Y K GDY++GI K LEMK+DGVEPDH++WTCFIRAASLC+QT             
Sbjct: 1100 TVFDAYLKNGDYDLGIKKLLEMKRDGVEPDHQVWTCFIRAASLCEQTADAILLLKSLQDC 1159

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + ++   LEE ++L + A+ NFVNA+EDLLWAFE RATAS IFQ
Sbjct: 1160 GFDLPIRLLTERTPSLLSEIANYLEELEALEDSAALNFVNAVEDLLWAFECRATASRIFQ 1219

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+ ++IYR ++FRVA+KDWGADFRKLS GAALVGLTLWLDHMQDASLQGSPESPKSV+L
Sbjct: 1220 LAVERSIYRDNVFRVAQKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGSPESPKSVVL 1279

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R R+G  V KA+SL+MWLKDS FC+DLELK
Sbjct: 1280 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRARSGRFVVKAYSLKMWLKDSPFCMDLELK 1339

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+ NSMKL EGYFMR GL+  FK IHE+LG++ PKKF+RLALLSEE R++++  D
Sbjct: 1340 DAPALPKLNSMKLIEGYFMRAGLVSAFKDIHEKLGEVWPKKFSRLALLSEECRDEVIKAD 1399

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNRFQM--KYLRRQHKS 542
            ++GRKEK E+MK KGI  +RK  R     K++R Q ++
Sbjct: 1400 IKGRKEKLERMKKKGIVTARKSKRRAQRGKFVREQEQN 1437


>ref|XP_004966645.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Setaria italica]
          Length = 1410

 Score =  515 bits (1326), Expect = e-143
 Identities = 252/403 (62%), Positives = 316/403 (78%), Gaps = 2/403 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE+LL  M+EDG++PTIATMH+LM SYG+AG P+EAE VL SLK+S  E+STLPY 
Sbjct: 1007 NHSKAENLLSVMREDGIEPTIATMHILMTSYGTAGHPREAENVLNSLKSSSLEVSTLPYS 1066

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V D Y K GDY +GI K LEMK+DGVEPDH++WTCFIRAASLC+QT             
Sbjct: 1067 TVFDAYLKNGDYELGITKLLEMKRDGVEPDHQVWTCFIRAASLCEQTDDAILLLNSLKDC 1126

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + ++   LEE ++L + A+ NFVNA+EDLLWAFE RATASWIFQ
Sbjct: 1127 GFELPIRLLTERTPSVLSEVANYLEELEALEDSAALNFVNALEDLLWAFECRATASWIFQ 1186

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+++NIYR ++FRV EKDWGADFRKLS GAALVGLTLWLDHMQDASLQGSPESPKS+++
Sbjct: 1187 LAVKRNIYRDNVFRVVEKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGSPESPKSIVM 1246

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R R+G  V KA+SL+MWLKDS FC+DLELK
Sbjct: 1247 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRVRSGRFVVKAYSLKMWLKDSPFCMDLELK 1306

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+ NSMKL +GYFMR GL+  FK IHERLG++ PKKF++LALLSEESR++ +  D
Sbjct: 1307 DVPALPKLNSMKLIDGYFMRAGLVSAFKDIHERLGEVWPKKFSKLALLSEESRDEAIKAD 1366

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNR--FQMKYLRRQHKSPAALN 527
            +QGRKEK ++MK K +  +R   R   + K++R Q +S  A++
Sbjct: 1367 IQGRKEKLDRMKKKDLVTARNSKRRPQRAKFVREQGQSMKAVS 1409


>ref|NP_001057065.1| Os06g0199100 [Oryza sativa Japonica Group]
            gi|51091829|dbj|BAD36643.1| putative PPR protein [Oryza
            sativa Japonica Group] gi|113595105|dbj|BAF18979.1|
            Os06g0199100 [Oryza sativa Japonica Group]
          Length = 1283

 Score =  515 bits (1326), Expect = e-143
 Identities = 249/384 (64%), Positives = 308/384 (80%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKEDG++PTIATMH+LM SYG++G P EAEKVL SLK+S  E+STLPY 
Sbjct: 882  NHSKAEHLLSAMKEDGIEPTIATMHILMTSYGTSGHPDEAEKVLNSLKSSNLEISTLPYS 941

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V+D Y +  DY++GI K LEMK+DGVEPDH++WT FIRAASLC+QT             
Sbjct: 942  TVLDAYLRNRDYSLGITKLLEMKRDGVEPDHQVWTSFIRAASLCEQTDDAILLLKSLQDC 1001

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S   +++  LE+  +L + AS NFVNA+EDLLWAFERRATASWIFQ
Sbjct: 1002 GFDLPIRLLTERTSSLFTEVDSFLEKLGTLEDSASLNFVNALEDLLWAFERRATASWIFQ 1061

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA++++IY H+IFRV EKDWGAD RKLS GAALV LTLWLD MQDASLQG+PESPKS++L
Sbjct: 1062 LAVKRSIYHHNIFRVEEKDWGADLRKLSAGAALVALTLWLDQMQDASLQGAPESPKSIVL 1121

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R+R+G  V KA+SL+MWLKDS FCLDLELK
Sbjct: 1122 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRSRSGRFVVKAYSLKMWLKDSPFCLDLELK 1181

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+TNSMKLTEGYFMR GL+P FK IHERLG++ PKKF+RLALLSEESR++++  D
Sbjct: 1182 DAPALPKTNSMKLTEGYFMRAGLVPVFKDIHERLGEVWPKKFSRLALLSEESRDEVIKAD 1241

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNR 578
            ++GRKEK EKMK +G+  +++  R
Sbjct: 1242 IKGRKEKLEKMKKQGLAIAKRSKR 1265


>gb|EEE65260.1| hypothetical protein OsJ_20463 [Oryza sativa Japonica Group]
          Length = 1443

 Score =  515 bits (1326), Expect = e-143
 Identities = 249/384 (64%), Positives = 308/384 (80%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKEDG++PTIATMH+LM SYG++G P EAEKVL SLK+S  E+STLPY 
Sbjct: 1042 NHSKAEHLLSAMKEDGIEPTIATMHILMTSYGTSGHPDEAEKVLNSLKSSNLEISTLPYS 1101

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V+D Y +  DY++GI K LEMK+DGVEPDH++WT FIRAASLC+QT             
Sbjct: 1102 TVLDAYLRNRDYSLGITKLLEMKRDGVEPDHQVWTSFIRAASLCEQTDDAILLLKSLQDC 1161

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S   +++  LE+  +L + AS NFVNA+EDLLWAFERRATASWIFQ
Sbjct: 1162 GFDLPIRLLTERTSSLFTEVDSFLEKLGTLEDSASLNFVNALEDLLWAFERRATASWIFQ 1221

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA++++IY H+IFRV EKDWGAD RKLS GAALV LTLWLD MQDASLQG+PESPKS++L
Sbjct: 1222 LAVKRSIYHHNIFRVEEKDWGADLRKLSAGAALVALTLWLDQMQDASLQGAPESPKSIVL 1281

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R+R+G  V KA+SL+MWLKDS FCLDLELK
Sbjct: 1282 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRSRSGRFVVKAYSLKMWLKDSPFCLDLELK 1341

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LP+TNSMKLTEGYFMR GL+P FK IHERLG++ PKKF+RLALLSEESR++++  D
Sbjct: 1342 DAPALPKTNSMKLTEGYFMRAGLVPVFKDIHERLGEVWPKKFSRLALLSEESRDEVIKAD 1401

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNR 578
            ++GRKEK EKMK +G+  +++  R
Sbjct: 1402 IKGRKEKLEKMKKQGLAIAKRSKR 1425


>emb|CBI26570.3| unnamed protein product [Vitis vinifera]
          Length = 1042

 Score =  514 bits (1324), Expect = e-143
 Identities = 260/400 (65%), Positives = 313/400 (78%), Gaps = 9/400 (2%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKE GV+PTIATMHLLMVSY  +GQP+EAEKVL +LK  G  LSTLPY 
Sbjct: 624  NHSKAEKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYS 683

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K GD+N+ I K +EMKKDG+EPDH+IWTCF+RAASL + T             
Sbjct: 684  SVIDAYLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDT 743

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + +++  LE+   L ++A+FNFVNA+EDLLWAFE RATASW+FQ
Sbjct: 744  GFDLPIRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQ 803

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQ--------DASLQGSP 1034
            LA++++IYRHD+FRVAEKDWGADFRK+S G+ALVGLTLWLDHMQ        DASLQG P
Sbjct: 804  LAVKRSIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQAKYFYFWQDASLQGYP 863

Query: 1033 ESPKSVLLITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSC 854
             SPKSV+LITGTAEYN+VSL  T+KA+LWEMGSPFLP +TR+G++VAKAHSLRMWLKDS 
Sbjct: 864  LSPKSVVLITGTAEYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSS 923

Query: 853  FCLDLELKDGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEES 674
            FCLDLELKD  SLPE+NSM+L EG F+R GL+P FK I ERLG +RPKKFARLALL +E 
Sbjct: 924  FCLDLELKDAPSLPESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPKKFARLALLPDEK 983

Query: 673  REKIMNTDLQGRKEKTEKMKAK-GIPRSRKLNRFQMKYLR 557
            R+K++  D++G KEK EKMK K G+ R RKL R   K++R
Sbjct: 984  RDKVIRADIEGGKEKLEKMKKKVGVKRRRKLVR--RKFIR 1021


>emb|CAN76112.1| hypothetical protein VITISV_005527 [Vitis vinifera]
          Length = 1494

 Score =  509 bits (1312), Expect = e-141
 Identities = 260/412 (63%), Positives = 313/412 (75%), Gaps = 21/412 (5%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKE GV+PTIATMHLLMVSY  +GQP+EAEKVL +LK  G  LSTLPY 
Sbjct: 1074 NHSKAEKLLGVMKEAGVEPTIATMHLLMVSYSGSGQPEEAEKVLDNLKVEGLPLSTLPYS 1133

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K GD+N+ I K +EMKKDG+EPDH+IWTCF+RAASL + T             
Sbjct: 1134 SVIDAYLKNGDHNVAIQKLMEMKKDGLEPDHRIWTCFVRAASLSQHTSEAIVLLKALRDT 1193

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + +++  LE+   L ++A+FNFVNA+EDLLWAFE RATASW+FQ
Sbjct: 1194 GFDLPIRLLTEKSDSLVSEVDNCLEKLGPLEDNAAFNFVNALEDLLWAFELRATASWVFQ 1253

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQ---------------- 1058
            LA++++IYRHD+FRVAEKDWGADFRK+S G+ALVGLTLWLDHMQ                
Sbjct: 1254 LAVKRSIYRHDVFRVAEKDWGADFRKMSAGSALVGLTLWLDHMQASFLITIFVQLMEEYF 1313

Query: 1057 ----DASLQGSPESPKSVLLITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAK 890
                DASLQG P SPKSV+LITGTAEYN+VSL  T+KA+LWEMGSPFLP +TR+G++VAK
Sbjct: 1314 YFWQDASLQGYPLSPKSVVLITGTAEYNMVSLNSTLKAFLWEMGSPFLPCKTRSGLLVAK 1373

Query: 889  AHSLRMWLKDSCFCLDLELKDGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPK 710
            AHSLRMWLKDS FCLDLELKD  SLPE+NSM+L EG F+R GL+P FK I ERLG +RPK
Sbjct: 1374 AHSLRMWLKDSSFCLDLELKDAPSLPESNSMQLMEGCFLRRGLVPAFKDITERLGDVRPK 1433

Query: 709  KFARLALLSEESREKIMNTDLQGRKEKTEKMKAK-GIPRSRKLNRFQMKYLR 557
            KFARLALL +E R+K++  D++G KEK EKMK K G+ R RKL R   K++R
Sbjct: 1434 KFARLALLPDEKRDKVIRADIEGGKEKLEKMKKKVGVKRRRKLVR--RKFIR 1483


>ref|XP_003564143.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Brachypodium distachyon]
          Length = 1285

 Score =  509 bits (1311), Expect = e-141
 Identities = 249/395 (63%), Positives = 308/395 (77%), Gaps = 2/395 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MKEDG++PTIATMH+LM SYG+AG P EAEKVL SLK+S  E+STLPY 
Sbjct: 884  NHSKAEQLLASMKEDGIEPTIATMHILMTSYGTAGHPDEAEKVLNSLKSSSLEVSTLPYS 943

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V + Y K GDYN+GI K LEMK DGV+PDH++WTCFIRAASLC++T             
Sbjct: 944  TVFNAYLKNGDYNLGITKLLEMKADGVKPDHQVWTCFIRAASLCERTADAILLLNSLRDC 1003

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + ++   LEE  +L + A+ NFVNA+EDLLWAFE RATAS++FQ
Sbjct: 1004 EFDLPIRLLTERTSSLLTEVSNFLEELDALEDSAALNFVNALEDLLWAFECRATASYVFQ 1063

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+ K+IYRH++FRV EKDWGADFRKLS GAAL  LTLWLD MQDASLQGSPESPKS++L
Sbjct: 1064 LAVDKSIYRHNVFRVVEKDWGADFRKLSAGAALTALTLWLDQMQDASLQGSPESPKSIVL 1123

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP +TR+G  VAK++SL+MWLKDS FC+DLELK
Sbjct: 1124 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCKTRSGRFVAKSYSLKMWLKDSAFCMDLELK 1183

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D   LP+ NSMKLT+GYFMR GL+ TF  IHE+LG++ PKKF+RLALLSEESR  ++  D
Sbjct: 1184 DAPDLPKMNSMKLTDGYFMRAGLVSTFNDIHEQLGEVWPKKFSRLALLSEESRATVIKAD 1243

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNR--FQMKYLRRQ 551
            +QGRKEK  KMK +G+  SR+  +   + K++R Q
Sbjct: 1244 IQGRKEKLAKMKTQGLVISRRSQKRPRRAKFVREQ 1278


>gb|AFW85425.1| chloroplast RNA splicing4 [Zea mays]
          Length = 1435

 Score =  507 bits (1306), Expect = e-141
 Identities = 249/381 (65%), Positives = 302/381 (79%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE+LL  MKEDG++PTIATMH+LM SYG+AGQP+EAE VL +LK+S  E+STLPY 
Sbjct: 1040 NHSKAENLLAVMKEDGIEPTIATMHILMTSYGTAGQPREAENVLNNLKSSSLEVSTLPYS 1099

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            +V D Y K GDYN G  K LEMK+DGVEPDH++WTCFIRAASLC+QT             
Sbjct: 1100 TVFDAYLKNGDYNHGTTKLLEMKRDGVEPDHQVWTCFIRAASLCEQTADAILLLKSLQDC 1159

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+    S + ++   LEE ++L + A+ NFVNA+EDLLWAFE RATAS IFQ
Sbjct: 1160 GFDLPIRLLTERTPSLLSEIANYLEELEALEDSAALNFVNAVEDLLWAFECRATASRIFQ 1219

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+ ++IYR ++FRVA+KDWGADFRKLS GAALVGLTLWLDHMQDASLQGSPESPKSV+L
Sbjct: 1220 LAVERSIYRDNVFRVAQKDWGADFRKLSAGAALVGLTLWLDHMQDASLQGSPESPKSVVL 1279

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            +TG  EYN+VSL KT++AYL EMGSPFLP R R+G  V K +SL+MWLKDS FC+DLELK
Sbjct: 1280 VTGEGEYNMVSLRKTIRAYLLEMGSPFLPCRARSGRFVVKDYSLKMWLKDSPFCMDLELK 1339

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  + P+ NSMKL EGYFMR G +  FK IHERLG++ PKKF+RLALLSEE R++++  D
Sbjct: 1340 DAPAHPKLNSMKLIEGYFMRAGFVSAFKDIHERLGEVWPKKFSRLALLSEECRDEVIKAD 1399

Query: 649  LQGRKEKTEKMKAKGIPRSRK 587
            +QGRKEK E+MK KGI  +RK
Sbjct: 1400 IQGRKEKLERMKKKGIVTARK 1420


>ref|XP_004298102.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 1496

 Score =  507 bits (1305), Expect = e-141
 Identities = 249/400 (62%), Positives = 312/400 (78%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NH+KAE LL  MKE G++P  ATMHLLMVSYGS+GQP+EAEKVL +LK +   L TLPY 
Sbjct: 1084 NHAKAEMLLSVMKEAGIEPNFATMHLLMVSYGSSGQPEEAEKVLDNLKVTDSYLGTLPYS 1143

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y + GDYN GI K  EMK+DG EPDH+IWTCFIRAASL +QT             
Sbjct: 1144 SVIDAYLRNGDYNTGIQKLNEMKRDGPEPDHRIWTCFIRAASLSQQTSEVFVLLNALRDA 1203

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R++    +S I D++  LE+   L ++A+FNFVNA+ DLLWA+E RATASW+FQ
Sbjct: 1204 GFDLPIRLMKEKSESLIPDVDQCLEKLAPLDDNAAFNFVNALGDLLWAYELRATASWVFQ 1263

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA+++ IY HD+FRVA+KDWGADFRKLS G+ALVGLTLWLD MQDASL+G PESPKSV+L
Sbjct: 1264 LAVKRGIYNHDVFRVADKDWGADFRKLSAGSALVGLTLWLDQMQDASLEGFPESPKSVVL 1323

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGT+EYN+VSL  T+K  LWE+GSPFLP +TR+G++VAKAHSLRMWLKDS FCLDLELK
Sbjct: 1324 ITGTSEYNMVSLNSTLKTCLWEIGSPFLPCKTRSGLLVAKAHSLRMWLKDSPFCLDLELK 1383

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  +LPE+NSM+L +G F+R GL+P FK I+E+L  +RPKKFARLALLS+E RE+++  D
Sbjct: 1384 DAPALPESNSMQLIDGCFLRRGLVPAFKEINEKLELVRPKKFARLALLSDEKRERVIQAD 1443

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNRFQMKYLRRQHKSPAAL 530
            ++GRKEK EKM+ +G    R++NR + K  +R ++ PA L
Sbjct: 1444 IEGRKEKLEKMRKRGNVDPRRVNRIK-KLRKRTYRRPAML 1482


>gb|EXB97274.1| hypothetical protein L484_024135 [Morus notabilis]
          Length = 1494

 Score =  506 bits (1304), Expect = e-141
 Identities = 250/385 (64%), Positives = 304/385 (78%), Gaps = 1/385 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            N SKAE L+  MKE G++P  ATMHLLMVSYG +GQP EAEKVL  LK +G  L+TLPY 
Sbjct: 1084 NPSKAEMLVTMMKEAGMEPNFATMHLLMVSYGGSGQPGEAEKVLEDLKETGLNLNTLPYS 1143

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K GDYN+ I K  +M+K+G+EPDH+IWTCFIRAASLC++T             
Sbjct: 1144 SVIDAYLKNGDYNVAIQKLKDMEKEGLEPDHRIWTCFIRAASLCQRTSEAFTLLNALSDT 1203

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+RIL+   +S I +++  LE+   L +DA+FNFVNA+EDLLWAFE RATASW++Q
Sbjct: 1204 GFDLPIRILTEKSESLISEVDQCLEKLGPLEDDAAFNFVNALEDLLWAFEFRATASWVYQ 1263

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LAI++ IYRHD+FRVA+KDWGADFRKLS G+ALVGLTLWLDHMQDASLQG PESPKSV+L
Sbjct: 1264 LAIKRGIYRHDLFRVADKDWGADFRKLSAGSALVGLTLWLDHMQDASLQGYPESPKSVVL 1323

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGT+EYN +SL  T+KA LWEMGSPFLP RTRTG++VAKAHSLR+WLKDS FCLDLELK
Sbjct: 1324 ITGTSEYNSISLNSTLKACLWEMGSPFLPCRTRTGLLVAKAHSLRLWLKDSPFCLDLELK 1383

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  SLPE NSM+L EG F+R GL+P FK + ERLG +RPKKF+RLA+LS+E R K +  D
Sbjct: 1384 DAPSLPEYNSMQLMEGCFLRRGLVPAFKEVTERLGIVRPKKFSRLAMLSDEKRTKAIEAD 1443

Query: 649  LQGRKEKTEKMKAK-GIPRSRKLNR 578
            ++GRK+K EK+K   G+ R RK+ +
Sbjct: 1444 IEGRKQKLEKIKKNGGLGRMRKIKK 1468


>ref|XP_006347554.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 1476

 Score =  502 bits (1293), Expect = e-139
 Identities = 248/392 (63%), Positives = 306/392 (78%), Gaps = 1/392 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE L+ +MKE G++P+ ATMHLLM SYG++G P EAEKVL SLK++G  LSTL Y 
Sbjct: 1079 NHSKAEKLIEKMKESGIEPSDATMHLLMTSYGTSGHPMEAEKVLNSLKSNGVNLSTLQYG 1138

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K  DY+ G+ K  EM  +G+EPDH+IWTCFIRAASLC+               
Sbjct: 1139 SVIDAYLKSRDYDTGLLKLKEMIGEGLEPDHRIWTCFIRAASLCEYITEAKTLLNAVADA 1198

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R L+ N +S ++DL+  LE+ ++  + A+ NFVNA+EDLLWAFE RATASW+FQ
Sbjct: 1199 GFNLPIRFLTENSESLVLDLDLYLEQIETAEDKAALNFVNALEDLLWAFELRATASWVFQ 1258

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LAI+++IY +DIFRVA+KDWGADFRKLS GAALVGLTLWLDHMQDASL+G PESPKSV+L
Sbjct: 1259 LAIKRSIYHNDIFRVADKDWGADFRKLSAGAALVGLTLWLDHMQDASLEGFPESPKSVVL 1318

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITG ++YN VSL  TVKAYLWEMGSPFLP +TRTGI+VAKAHSLRMWLKDS FCLDLELK
Sbjct: 1319 ITGKSDYNRVSLNSTVKAYLWEMGSPFLPCKTRTGILVAKAHSLRMWLKDSPFCLDLELK 1378

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            +  SLPE NSM+L EG F+R GL+P FK I+ERLG + P+KFARLALLS E REK++  D
Sbjct: 1379 NRPSLPEMNSMQLIEGCFIRRGLVPAFKEINERLGPVNPRKFARLALLSNEKREKVIQAD 1438

Query: 649  LQGRKEKTEKMKAKGIPRSRKLNRFQM-KYLR 557
            ++GR+EK  K+K+  + + R    F+M K++R
Sbjct: 1439 IEGRREKLAKLKSTAVTKRRNTKSFRMNKFVR 1470


>ref|XP_006372940.1| hypothetical protein POPTR_0017s06420g [Populus trichocarpa]
            gi|566211778|ref|XP_006372941.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
            gi|550319588|gb|ERP50737.1| hypothetical protein
            POPTR_0017s06420g [Populus trichocarpa]
            gi|550319589|gb|ERP50738.1| pentatricopeptide
            repeat-containing family protein [Populus trichocarpa]
          Length = 1465

 Score =  499 bits (1285), Expect = e-138
 Identities = 244/374 (65%), Positives = 301/374 (80%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            +HSKA+ L   MK+ GV+PTIATMHLLMVSYGS+GQPQEAEKVL++LK +   LSTLPY 
Sbjct: 1079 SHSKAQRLFSMMKDAGVEPTIATMHLLMVSYGSSGQPQEAEKVLSNLKETDANLSTLPYS 1138

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y + GDYN GI K  ++K++G+EPDH+IWTCFIRAASL + T             
Sbjct: 1139 SVIDAYVRNGDYNAGIQKLKQVKEEGLEPDHRIWTCFIRAASLSQHTSEAILLLNALRDT 1198

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+L+   +  +  L+  LE  ++LG++A+FNFVNA+EDLLWAFE RATASW+F 
Sbjct: 1199 GFDLPIRLLTEKPEPLVSALDLCLEMLETLGDNAAFNFVNALEDLLWAFELRATASWVFL 1258

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LAI++ IYRHD+FRVA+KDWGADFRKLS GAALVGLTLWLDHMQDASLQG PESPKSV L
Sbjct: 1259 LAIKRKIYRHDVFRVADKDWGADFRKLSGGAALVGLTLWLDHMQDASLQGCPESPKSVAL 1318

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGTAEYN+VSL+ T+KA LWEMGSPFLP +TR+G+++AKAHSL+MWLKDS FCLDLELK
Sbjct: 1319 ITGTAEYNMVSLDSTLKACLWEMGSPFLPCKTRSGLLIAKAHSLKMWLKDSPFCLDLELK 1378

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            +  SLPE+NSM+L EG F+R GL+P FK I+E+LG +RPKKFA+ ALLS++ REK +   
Sbjct: 1379 NAPSLPESNSMQLIEGCFIRRGLVPAFKEINEKLGFVRPKKFAKFALLSDDRREKAIQVF 1438

Query: 649  LQGRKEKTEKMKAK 608
            ++G KEK EKMK +
Sbjct: 1439 IEGGKEKKEKMKKR 1452



 Score = 59.3 bits (142), Expect = 5e-06
 Identities = 28/96 (29%), Positives = 49/96 (51%)
 Frame = -2

Query: 1720 KAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYCSVI 1541
            +A  ++ +M   GVKPT+ T   L+  Y  AG+P EAE+    +  SG     L Y  ++
Sbjct: 490  EAAGMMSEMLNTGVKPTLRTYSALICGYAKAGKPVEAEETFDCMLRSGTRPDQLAYSVML 549

Query: 1540 DGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIR 1433
            D + +  +    +  + EM  DG+ P+H ++   +R
Sbjct: 550  DIHLRFNEPKRAMTFYKEMIHDGIMPEHSLYELMLR 585


>ref|XP_006491807.1| PREDICTED: pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X1 [Citrus sinensis]
            gi|568877582|ref|XP_006491808.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X2 [Citrus sinensis]
            gi|568877584|ref|XP_006491809.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X3 [Citrus sinensis]
            gi|568877586|ref|XP_006491810.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X4 [Citrus sinensis]
            gi|568877588|ref|XP_006491811.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g18110,
            chloroplastic-like isoform X5 [Citrus sinensis]
          Length = 1459

 Score =  496 bits (1278), Expect = e-138
 Identities = 247/391 (63%), Positives = 307/391 (78%), Gaps = 1/391 (0%)
 Frame = -2

Query: 1726 HSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYCS 1547
            HSK+E+LL  MKE GV+PTIATMHLLMVSY S+GQPQEAEKVL++LK +   LSTLPY S
Sbjct: 1062 HSKSENLLNMMKESGVEPTIATMHLLMVSYSSSGQPQEAEKVLSNLKGTSLNLSTLPYSS 1121

Query: 1546 VIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXXX 1367
            VI  Y + GD  +GI K +EMK++G+EPDH+IWTCF+RAASL + +              
Sbjct: 1122 VIAAYLRNGDSAVGIQKLIEMKEEGIEPDHRIWTCFVRAASLSQCSSEAIILLNAIRDAG 1181

Query: 1366 XXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQL 1187
              LP+R+L+   ++ + ++++ LE+ K + ++A+FNFVNA+EDLLWAFE RATASW+FQL
Sbjct: 1182 FDLPIRLLTEKSETLVAEVDHCLEKLKPMEDNAAFNFVNALEDLLWAFELRATASWVFQL 1241

Query: 1186 AIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLLI 1007
            AI+  IY HD+FRVA+KDWGADFRKLS GAALVGLTLWLDHMQDASLQG PESPKSV+LI
Sbjct: 1242 AIKMGIYHHDVFRVADKDWGADFRKLSGGAALVGLTLWLDHMQDASLQGCPESPKSVVLI 1301

Query: 1006 TGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELKD 827
            TGTAEYN+VSL  T+KA LWEMGSPFLP +TR+G++VAKAHSLRMWLKDS FCLDLELKD
Sbjct: 1302 TGTAEYNMVSLNSTLKACLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDSPFCLDLELKD 1361

Query: 826  GSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTDL 647
              SLPE+NSM+L  G F+R GL+P FK I ERLG +RPKKFARLALL ++ R K +  D+
Sbjct: 1362 APSLPESNSMQLIGGCFIRRGLVPAFKDITERLGIVRPKKFARLALLPDDRRVKAIQADI 1421

Query: 646  QGRKEKTEKMKAK-GIPRSRKLNRFQMKYLR 557
            +GRK K EKMK +  +  +R +     +Y+R
Sbjct: 1422 EGRKGKFEKMKKRVQLKSTRNMKLGTRRYVR 1452


>gb|EMJ07903.1| hypothetical protein PRUPE_ppa023974mg [Prunus persica]
          Length = 1353

 Score =  496 bits (1277), Expect = e-137
 Identities = 249/410 (60%), Positives = 317/410 (77%), Gaps = 9/410 (2%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NH+KAE L   MKE G++P  ATMHLLMVSYGS+GQPQEAEKVL +LK +G +L TLPY 
Sbjct: 934  NHAKAEMLFTMMKEAGIEPNFATMHLLMVSYGSSGQPQEAEKVLDNLKVTGLDLDTLPYS 993

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVI  Y K GDYN+GI K  EMK+ G+EPDH+IWTCFIRAASL +               
Sbjct: 994  SVIGAYLKNGDYNIGIQKLNEMKEVGLEPDHRIWTCFIRAASLSQHKSEAIILLNALRDA 1053

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP+R+++   +S I+++++ LE+ + L ++A+FNFVNA+EDLLWA+E RATASW+FQ
Sbjct: 1054 GFDLPIRLVTEKPESLILEVDHCLEKLEPLEDNAAFNFVNALEDLLWAYELRATASWVFQ 1113

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQ---------DASLQGS 1037
            LA+++ IY +D+FRVA+KDW ADFRKLS G+ALVGLTLWLD MQ         DASL+G 
Sbjct: 1114 LAVKRGIYNNDVFRVADKDWAADFRKLSAGSALVGLTLWLDQMQATLFLLHSFDASLEGY 1173

Query: 1036 PESPKSVLLITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDS 857
            PESPKSV+LITGT+EYN+VSL  T+KA LWEMGSPFLP +TR+G++VAKAHSLRMWLKDS
Sbjct: 1174 PESPKSVVLITGTSEYNMVSLNSTLKACLWEMGSPFLPCKTRSGLLVAKAHSLRMWLKDS 1233

Query: 856  CFCLDLELKDGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEE 677
             FCLDLELKD  +LPE+NSM+L +G F+R GL+P FK I ERLG +RPKKFARLALLS+E
Sbjct: 1234 PFCLDLELKDAPALPESNSMQLIDGCFLRRGLVPAFKEITERLGLVRPKKFARLALLSDE 1293

Query: 676  SREKIMNTDLQGRKEKTEKMKAKGIPRSRKLNRFQMKYLRRQHKSPAALN 527
             REK++ +D++GRKEK EKMK    P  R+++R + K  +R++  P+ L+
Sbjct: 1294 KREKVIQSDIEGRKEKLEKMKENDNP--RRVSRIK-KLRKRKYVRPSTLS 1340



 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 30/97 (30%), Positives = 52/97 (53%)
 Frame = -2

Query: 1723 SKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYCSV 1544
            ++A +++ +M + GVKPT+ T   LM +Y  AG+  EA++    +  SG     L Y  +
Sbjct: 343  TEAANVMSEMLDSGVKPTLRTYSALMCAYAKAGKQVEAQETFDCMVKSGIRPDHLAYSVI 402

Query: 1543 IDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIR 1433
            +D + KV +    I  + EM  DG + DH ++   +R
Sbjct: 403  LDIFLKVNETKKAITLYQEMLHDGFKLDHALYGFMLR 439


>ref|XP_002519997.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223540761|gb|EEF42321.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 1429

 Score =  487 bits (1254), Expect = e-135
 Identities = 244/394 (61%), Positives = 308/394 (78%), Gaps = 1/394 (0%)
 Frame = -2

Query: 1729 NHSKAESLLFQMKEDGVKPTIATMHLLMVSYGSAGQPQEAEKVLTSLKNSGQELSTLPYC 1550
            NHSKAE LL  MK+ GV+PTIATMHLLMVSYGS+GQPQEAEKVLT+LK  G  LSTLPY 
Sbjct: 1030 NHSKAEKLLSMMKDAGVEPTIATMHLLMVSYGSSGQPQEAEKVLTNLKEMGLSLSTLPYS 1089

Query: 1549 SVIDGYFKVGDYNMGIAKFLEMKKDGVEPDHKIWTCFIRAASLCKQTXXXXXXXXXXXXX 1370
            SVID Y K  DY++GI K +EMKK+G+EPDH+IWTCFIRAASL + T             
Sbjct: 1090 SVIDAYLKNKDYSVGIQKLVEMKKEGLEPDHRIWTCFIRAASLSEHTHDAILLLQALQDS 1149

Query: 1369 XXXLPLRILSGNDQSAIMDLEYLLEESKSLGEDASFNFVNAMEDLLWAFERRATASWIFQ 1190
               LP R+++    S ++++++ LE  +++ ++A+FNFVNA+EDLLWAFE RATASW+F+
Sbjct: 1150 GFDLPSRLITERSDSLVLEVDHCLEMLETMEDNAAFNFVNALEDLLWAFELRATASWVFR 1209

Query: 1189 LAIRKNIYRHDIFRVAEKDWGADFRKLSPGAALVGLTLWLDHMQDASLQGSPESPKSVLL 1010
            LA++++IY HD+FRVAE+DWGADFRKLS GAAL           DASLQG P SPKSV+L
Sbjct: 1210 LAVKRSIYCHDVFRVAEQDWGADFRKLSGGAAL-----------DASLQGYPASPKSVVL 1258

Query: 1009 ITGTAEYNLVSLEKTVKAYLWEMGSPFLPSRTRTGIMVAKAHSLRMWLKDSCFCLDLELK 830
            ITGTAEYN+VSL+ T+KA LWEMGSPFLP RTR+G++VAKAHSLRMWLKDS FCLDLELK
Sbjct: 1259 ITGTAEYNMVSLDNTLKACLWEMGSPFLPCRTRSGLLVAKAHSLRMWLKDSPFCLDLELK 1318

Query: 829  DGSSLPETNSMKLTEGYFMRVGLLPTFKHIHERLGQIRPKKFARLALLSEESREKIMNTD 650
            D  SLPE+NSM+L EG F+R GL+P FK I+E+LG +RPKKFA+LALLS++ R+K ++ D
Sbjct: 1319 DAPSLPESNSMQLIEGCFIRRGLVPAFKEINEKLGFVRPKKFAKLALLSDDKRQKAIHAD 1378

Query: 649  LQGRKEKTEKMKAK-GIPRSRKLNRFQMKYLRRQ 551
            ++GRKEK EK+K+K  + R  K N+ + +   R+
Sbjct: 1379 IEGRKEKLEKLKSKVDLERKNKTNKLRRRRFIRK 1412


Top