BLASTX nr result

ID: Ephedra26_contig00011755 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00011755
         (2897 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280557.1| PREDICTED: pentatricopeptide repeat-containi...   810   0.0  
ref|XP_004508810.1| PREDICTED: pentatricopeptide repeat-containi...   805   0.0  
ref|XP_006390383.1| hypothetical protein EUTSA_v10018112mg [Eutr...   797   0.0  
gb|EMJ11568.1| hypothetical protein PRUPE_ppa001337mg [Prunus pe...   797   0.0  
ref|XP_003549648.1| PREDICTED: pentatricopeptide repeat-containi...   796   0.0  
ref|XP_002888995.1| hypothetical protein ARALYDRAFT_476621 [Arab...   795   0.0  
gb|ESW27367.1| hypothetical protein PHAVU_003G195800g [Phaseolus...   793   0.0  
ref|XP_002322139.2| hypothetical protein POPTR_0015s08030g [Popu...   793   0.0  
ref|XP_006600662.1| PREDICTED: pentatricopeptide repeat-containi...   791   0.0  
ref|XP_006300609.1| hypothetical protein CARUB_v10019779mg [Caps...   791   0.0  
ref|NP_177623.1| plastid transcriptionally active 2 [Arabidopsis...   789   0.0  
ref|XP_004301287.1| PREDICTED: pentatricopeptide repeat-containi...   788   0.0  
gb|EOY20555.1| Plastid transcriptionally active 2 isoform 1 [The...   788   0.0  
gb|EXB29767.1| hypothetical protein L484_008930 [Morus notabilis]     786   0.0  
ref|XP_003525484.1| PREDICTED: pentatricopeptide repeat-containi...   784   0.0  
ref|XP_006344988.1| PREDICTED: pentatricopeptide repeat-containi...   783   0.0  
ref|XP_006439718.1| hypothetical protein CICLE_v10018817mg [Citr...   781   0.0  
ref|XP_006476695.1| PREDICTED: pentatricopeptide repeat-containi...   781   0.0  
ref|XP_006843571.1| hypothetical protein AMTR_s00007p00097240 [A...   781   0.0  
ref|XP_004236160.1| PREDICTED: pentatricopeptide repeat-containi...   781   0.0  

>ref|XP_002280557.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic [Vitis vinifera]
          Length = 869

 Score =  810 bits (2091), Expect = 0.0
 Identities = 390/737 (52%), Positives = 533/737 (72%), Gaps = 1/737 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 73   EKGKYSYDVETLINKLSSLPPRGSIARCLDVFKNKLSLNDFALVFKEFAQRGDWQRSLRL 132

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++IGVLGREGLL+KC EIF+EMPS+GV  +VF+FTALINAYG
Sbjct: 133  FKYMQRQIWCKPNEHIYTIMIGVLGREGLLEKCQEIFDEMPSHGVAPSVFSFTALINAYG 192

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN QY++SL LL RMK+E ++P+++TYNTVI +C RGGLDWE LLGLFAQMRH+GIQ DI
Sbjct: 193  RNGQYKSSLELLDRMKKERVSPSILTYNTVINSCARGGLDWEELLGLFAQMRHEGIQADI 252

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC+ RGL +EAEMVFRTMNE GI+P+  TY+ LV+TF  +  LE V+EL +E
Sbjct: 253  VTYNTLLSACARRGLGDEAEMVFRTMNEGGILPDITTYSYLVETFGKLNRLEKVSELLKE 312

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  G+ PD+T+YN+LLEA + SG IK+A GVFRQM+ AGC+PN +TY IL+N YG + +
Sbjct: 313  MESGGSFPDITSYNVLLEAHAQSGSIKEAMGVFRQMQGAGCVPNAATYSILLNLYGRHGR 372

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VRDLFLEMK +N EP+  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 373  YDDVRDLFLEMKVSNTEPNAATYNILINVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 432

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  + +VPS K + G++ AYG+A LYE+A    + M E+G
Sbjct: 433  LIFACGKGGLHEDAKKILLHMNEKGVVPSSKAYTGVIEAYGQAALYEEALVAFNTMNEVG 492

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +NS+I M+ + GLYKE+  I   M ++G+    +TFN +IEAF +  +++EA+
Sbjct: 493  SKPTVETYNSLIQMFAKGGLYKESEAILLKMGQSGVARNRDTFNGVIEAFRQGGQFEEAI 552

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   P+++T E +L VYC +   E++   F E+K    LPS+  YCMM+++Y
Sbjct: 553  KAYVEMEKARCDPDEQTLEAVLSVYCFAGLVEESEEQFGEIKALGILPSVMCYCMMLAVY 612

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ + W +   LL EM  + V +I+  +  +I GDY  +S+W    + F+KL+++G  L 
Sbjct: 613  AKADRWDDAHQLLDEMFTNRVSNIHQVIGQMIRGDYDDDSNWQMVEYVFEKLKSEGCSLG 672

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWWLGQ+E +TRVL EA   G+FPE F +  +V  VD+HRM  G A  A+
Sbjct: 673  VRFYNTLLEALWWLGQKERATRVLNEATKRGLFPELFRKNKLVWSVDVHRMWEGAACTAI 732

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKDLESPFKIAD 2368
            S W+  M  +  +G+++ +L S  V+RG +E     ++ P++K+   FL ++ S F    
Sbjct: 733  SVWLNNMHEMFISGDDLPQLASAVVVRGHMEKSSITRDFPVAKSAYAFLNEVSSSFCFPG 792

Query: 2369 WNGGRIVCTNTQLKRWL 2419
            WN GRIVC  +QLKR L
Sbjct: 793  WNKGRIVCQRSQLKRIL 809


>ref|XP_004508810.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Cicer arietinum]
          Length = 861

 Score =  805 bits (2078), Expect = 0.0
 Identities = 391/740 (52%), Positives = 537/740 (72%), Gaps = 2/740 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E GKYSY VE  I  + +L  RGSI RCL++FKNKL+LNDF+++F++    G+WQR+L+L
Sbjct: 65   ESGKYSYDVETLINRLSSLPPRGSIARCLDSFKNKLSLNDFSVVFKEFAQRGDWQRSLRL 124

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMPS GV  +VF +TA+INAYG
Sbjct: 125  FKYMQRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSQGVPRSVFAYTAVINAYG 184

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q++TS+ LL RMK+E ++P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPD+
Sbjct: 185  RNGQFQTSVELLDRMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDV 244

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            ITYNTLLSAC+ RGL +EAEMVFRTMNE G+VP+  TY+ LV TF  + +LE V+EL RE
Sbjct: 245  ITYNTLLSACAHRGLGDEAEMVFRTMNEGGVVPDINTYSYLVHTFGKLNKLEKVSELLRE 304

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPDV++YN+LLEA++ SG IK A GVFRQM+ AGC+PN +TY IL+N YG + +
Sbjct: 305  MESGGNLPDVSSYNVLLEAYAESGSIKDAIGVFRQMQGAGCVPNAATYSILLNLYGKHGR 364

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VRDLFLEMK +N +PD  TYN LI +FG+GGYFKE V+ F DMV++NV+P M++YEG
Sbjct: 365  YDDVRDLFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVDENVEPNMETYEG 424

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  R +VPS K + G++ AYG+A LYE+A    + M E+G
Sbjct: 425  LIFACGKGGLYEDAKKILLHMNERGVVPSSKAYTGVIEAYGQAALYEEALVAFNTMNEVG 484

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
             +P    +NS++  + R GLYKE   I + M E+G+     +FN +IEA  +A +Y+EA+
Sbjct: 485  SNPTVETYNSLVRSFARGGLYKEVEAILFRMGESGLPRDVHSFNGVIEALRQAGQYEEAV 544

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              + +M  +    ++ T E +L +YCA+   +++   F E+K +  LPS+T YCMM+++Y
Sbjct: 545  KAHVEMEKANCDYDESTLEAVLSIYCAAGLVDESEEQFQEIKASGILPSVTCYCMMLALY 604

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ +   +  +LL EM+ + V DI+  +  +I GD+  ES+W    + F KL +KG  L 
Sbjct: 605  AKNDRSIDAYSLLDEMITTRVSDIHQVIGQMIKGDFDDESNWQIVEYIFDKLNSKGCGLG 664

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWW+ QRE + RVL EA   G+FPE F +  +V  VD+HRMS G AL AL
Sbjct: 665  MKFYNALLEALWWMYQRERAARVLNEASKRGLFPELFRKNKLVWSVDVHRMSEGAALTAL 724

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKDL-ESPFKIA 2365
            S W+ ++Q +   GE++ +L ++ V RG++E   +A++ PI+K    FL+D+  S F   
Sbjct: 725  SIWLNDIQEMFMIGESLPELAAVVVARGKMEESIDAQDFPIAKAAFLFLQDIVSSAFTYP 784

Query: 2366 DWNGGRIVCTNTQLKRWLQG 2425
             WN GRIVC  +QL+R L G
Sbjct: 785  GWNKGRIVCQQSQLRRILSG 804


>ref|XP_006390383.1| hypothetical protein EUTSA_v10018112mg [Eutrema salsugineum]
            gi|557086817|gb|ESQ27669.1| hypothetical protein
            EUTSA_v10018112mg [Eutrema salsugineum]
          Length = 863

 Score =  797 bits (2058), Expect = 0.0
 Identities = 384/740 (51%), Positives = 533/740 (72%), Gaps = 2/740 (0%)
 Frame = +2

Query: 206  SEEEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRAL 385
            S E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++  G G+WQR+L
Sbjct: 65   SVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSL 124

Query: 386  KLFKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINA 565
            +LFKYMQRQ WCKPNE+IYT++I +LGREGLLDKC EIF+EMPS GV  +VF++TALINA
Sbjct: 125  RLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEIFDEMPSQGVARSVFSYTALINA 184

Query: 566  YGRNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQP 745
            YGRN +YETSL LL RMK E I+P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQP
Sbjct: 185  YGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQP 244

Query: 746  DIITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELY 925
            DI+TYNTLLSAC+ RGL +EAEMVFRTMN+ GIVP+  TY+ LV+TF  +  L  V++L 
Sbjct: 245  DIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLSRLVKVSDLL 304

Query: 926  REMELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNN 1105
             EM   G+LPD+T+YN+LLEA++ SG IK+A GVF QM+ AGC PN +TY +L+N +G +
Sbjct: 305  SEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQS 364

Query: 1106 AQYDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSY 1285
             +YD VR LFLEMK++N +PD  TYN LI +FG+GGYFKE V+ F DMVE+N++P+M++Y
Sbjct: 365  GRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETY 424

Query: 1286 EGFMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKE 1465
            EG +FACG+GGL EDA K+L++M  +++VPS K + G++ A+G+A LYE+A    + M E
Sbjct: 425  EGIIFACGKGGLHEDARKVLQYMTAKDVVPSSKAYTGVIEAFGQAALYEEALVAFNTMHE 484

Query: 1466 LGRDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKE 1645
            +G +P    ++S++  + R GL+KE+  I   + ++GI    +TFN+ IEA+ +  +++E
Sbjct: 485  VGSNPSIETYHSLLYSFARGGLFKESEVILSRLVDSGIPRNRDTFNAQIEAYRQGGKFEE 544

Query: 1646 ALNIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMS 1825
            A+  Y DM  S   P+++T E +L VY  +   ++    F EMK +  LPSI  YCMM+S
Sbjct: 545  AVKTYVDMEKSRCDPDERTLEAVLSVYSCARLVDECREQFEEMKASDILPSIMCYCMMLS 604

Query: 1826 MYARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVD 2002
            +Y +   W + + LL+EML++ V +I+  +  +I GDY  +S+W    +   KL ++G  
Sbjct: 605  VYGKTERWGDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCG 664

Query: 2003 LETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALA 2182
            L   FY++ LDALWWLGQ+E + RVL EA   G+FPE F +  +V  VD+HRMS G    
Sbjct: 665  LGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVRSVDVHRMSEGGMYT 724

Query: 2183 ALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFK 2359
            ALS W+ ++  ++  GE++ +L  +  +RG+LE    A+ +PI+K    FL+D + S F 
Sbjct: 725  ALSVWLNDINDMLLKGEDLPQLAVVVSVRGQLEKSSAARESPIAKAAFSFLQDHVSSSFS 784

Query: 2360 IADWNGGRIVCTNTQLKRWL 2419
               WNGGRI+C  +QLK+ L
Sbjct: 785  FTGWNGGRIMCQRSQLKQLL 804


>gb|EMJ11568.1| hypothetical protein PRUPE_ppa001337mg [Prunus persica]
          Length = 850

 Score =  797 bits (2058), Expect = 0.0
 Identities = 383/738 (51%), Positives = 525/738 (71%), Gaps = 2/738 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 53   EKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAARGDWQRSLRL 112

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLLDKCSE+F++MPS GV  +VF++TALINAYG
Sbjct: 113  FKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCSEVFDDMPSQGVVRSVFSYTALINAYG 172

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN QYETSL  L RMK++ ++P+++TYNTV+ AC RGGL+WEGLLGLFA+MRH+GIQPD+
Sbjct: 173  RNGQYETSLQFLDRMKKDKVSPSILTYNTVLNACARGGLEWEGLLGLFAEMRHEGIQPDL 232

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC+ RGL +EAEMVFRTMNE GIVP+  TY  LV+TF  + +LE V+EL +E
Sbjct: 233  VTYNTLLSACAGRGLGDEAEMVFRTMNEGGIVPDITTYRYLVETFGKLDKLEKVSELLKE 292

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++  G I+++ GVFRQM+ AGC+PN +TY IL+N YG + +
Sbjct: 293  MESGGNLPDITSYNVLLEAYAQLGSIRESMGVFRQMQAAGCMPNAATYSILLNLYGRHGR 352

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD  TYN LI +FG+GGYFKE V+ F DMVE+N++P M++YEG
Sbjct: 353  YDDVRELFLEMKISNTEPDPATYNILIQVFGEGGYFKEVVTLFHDMVEENIEPNMETYEG 412

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             ++ACG+GGL EDA+ IL HM  + IVPS K + G++ AYG+A LY++A    + M E+G
Sbjct: 413  LIYACGKGGLHEDAKNILLHMSEKGIVPSSKAYTGVIEAYGQAALYDEALVAFNTMNEVG 472

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +NS+I  + R GLY+E   +   M E G      TFN +IEAF +  +++EA+
Sbjct: 473  SKPSVESYNSLIYAFARGGLYRETEAVLSIMGEVGAARNVHTFNGMIEAFRQGGQFEEAI 532

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M       ++ T E +L VYC +    +   HF EMK +  LPS+  YCMM+++Y
Sbjct: 533  KAYVEMEKRRCDHDEWTLEAVLSVYCVAGLVNECEEHFQEMKASGILPSVMCYCMMLAVY 592

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            AR + W + + LL EML +   +I+  +  +I GDY  +S+W    + F KL+++G  L 
Sbjct: 593  ARNDRWDDANELLNEMLTNRASNIHQVIGQMIKGDYDDDSNWQMVEYVFDKLKSEGCGLG 652

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWWLGQ++ + RVL EA   G+FPE F +  +V  VD+HRM  G A AA+
Sbjct: 653  MRFYNTLLEALWWLGQKQRAVRVLNEATQRGLFPELFRKNKLVGSVDVHRMWQGGAYAAM 712

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+  M  +  NGE++  + ++ V+RG++E     ++ PI+K    FL+D + S F   
Sbjct: 713  SVWLNNMYEMFLNGEDLPNIATVVVVRGKMEKSSMTQDLPIAKAAYSFLEDNMPSSFSFP 772

Query: 2366 DWNGGRIVCTNTQLKRWL 2419
             WN GRI+C   QLKR L
Sbjct: 773  KWNKGRILCQRPQLKRIL 790


>ref|XP_003549648.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 859

 Score =  796 bits (2056), Expect = 0.0
 Identities = 396/791 (50%), Positives = 542/791 (68%), Gaps = 7/791 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  I  L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 63   EKGKYSYDVETLINRITALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQRSLRL 122

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMPS GV  TV+ +TA+INAYG
Sbjct: 123  FKYMQRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSNGVARTVYVYTAVINAYG 182

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q+  SL LL  MK+E ++P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPD+
Sbjct: 183  RNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDV 242

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            ITYNTLL AC+ RGL +EAEMVFRTMNE+GIVP+  TY+ LV TF  +  LE V+EL RE
Sbjct: 243  ITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSELLRE 302

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++  G IK+A  VFRQM+ AGC+ N +TY +L+N YG + +
Sbjct: 303  MESGGNLPDITSYNVLLEAYAELGSIKEAMDVFRQMQAAGCVANAATYSVLLNLYGKHGR 362

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VRD+FLEMK +N +PD  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 363  YDDVRDIFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 422

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  + IVPS K + G++ A+G+A LYE+A    + M E+G
Sbjct: 423  LIFACGKGGLYEDAKKILLHMNEKGIVPSSKAYTGVIEAFGQAALYEEALVVFNTMNEVG 482

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
             +P    +NS I  + R GLYKEA  I   M E+G++    +FN +I+AF +  +Y+EA+
Sbjct: 483  SNPTVETYNSFIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIKAFRQGGQYEEAV 542

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   PN+ T E +L VYC++   +++   F E+K +  LPS+  YC+M+++Y
Sbjct: 543  KSYVEMEKANCEPNELTLEVVLSVYCSAGLVDESEEQFQEIKASGILPSVMCYCLMLALY 602

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ +   +   L+ EM+   V DI+  +  +I GD+  ES+W    + F KL ++G  L 
Sbjct: 603  AKNDRLNDAYNLIDEMITMRVSDIHQGIGQMIKGDFDDESNWQIVEYVFDKLNSEGCGLG 662

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWW+ QRE + RVL EA   G+FPE F ++ +V  VD+HRMS G AL AL
Sbjct: 663  MRFYNALLEALWWMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGALTAL 722

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+  M  + R G ++ +L ++ V+RG +E   EA++ PI+K    FL+D + S F   
Sbjct: 723  SVWLNNMHEMSRTGNDLPELATVVVVRGHMEKSTEAQDFPIAKAAISFLQDNVPSSFTFP 782

Query: 2366 DWNGGRIVCTNTQLKRWLQGXXXXXXXXXXXXXI-----PLPLEDTTNTISDEVDSTKED 2530
             WN GRIVC  +QL+R L G             +     PL       + SD       D
Sbjct: 783  GWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLVSLSNTPLTTAGVITSKSDVQSGKAND 842

Query: 2531 IEDQTLNFKTK 2563
            ++ +T + +T+
Sbjct: 843  VDSRTDSTRTE 853


>ref|XP_002888995.1| hypothetical protein ARALYDRAFT_476621 [Arabidopsis lyrata subsp.
            lyrata] gi|297334836|gb|EFH65254.1| hypothetical protein
            ARALYDRAFT_476621 [Arabidopsis lyrata subsp. lyrata]
          Length = 863

 Score =  795 bits (2053), Expect = 0.0
 Identities = 385/740 (52%), Positives = 532/740 (71%), Gaps = 2/740 (0%)
 Frame = +2

Query: 206  SEEEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRAL 385
            S E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++  G G+WQR+L
Sbjct: 66   SVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSL 125

Query: 386  KLFKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINA 565
            +LFKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMPS GV  +VF++TALINA
Sbjct: 126  RLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINA 185

Query: 566  YGRNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQP 745
            YGRN +YETSL LL RMK + I+P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQP
Sbjct: 186  YGRNGRYETSLELLDRMKNDKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQP 245

Query: 746  DIITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELY 925
            DI+TYNTLLSAC+ RGL +EAEMVFRTMN+ GIVP+  TY+ LV+TF  +  LE V++L 
Sbjct: 246  DIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVSDLL 305

Query: 926  REMELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNN 1105
             EM   G+LPD+T+YN+LLEA++ SG IK+A GVF QM+ AGC PN +TY +L+N +G +
Sbjct: 306  SEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQS 365

Query: 1106 AQYDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSY 1285
             +YD VR LFLEMK++N +PD  TYN LI +FG+GGYFKE V+ F DMVE+N++P+M++Y
Sbjct: 366  GRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETY 425

Query: 1286 EGFMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKE 1465
            EG +FACG+GGL EDA KIL++M   +IVPS K + G++ A+G+A LYE+A    + M E
Sbjct: 426  EGIIFACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHE 485

Query: 1466 LGRDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKE 1645
            +G +P    ++S++  + R GL KE+  I   + ++GI    +TFN+ IEA+ +  +++E
Sbjct: 486  VGSNPSIETYHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEE 545

Query: 1646 ALNIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMS 1825
            A+  Y DM  S   P+++T E +L VY  +   ++    F EMK +  LPSI  YCMM++
Sbjct: 546  AVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLA 605

Query: 1826 MYARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVD 2002
            +Y +   W + + LL+EML++ V +I+  +  +I GDY  +S+W    +   KL ++G  
Sbjct: 606  VYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCG 665

Query: 2003 LETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALA 2182
            L   FY++ LDALWWLGQ+E + RVL EA   G+FPE F +  +V  VD+HRMS G    
Sbjct: 666  LGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMSEGGMYT 725

Query: 2183 ALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFK 2359
            ALS W+ +M  ++ NGE++ +L  +  +RG+LE    A+ + I+K    FL+D + S F 
Sbjct: 726  ALSVWLNDMNDMLLNGEDLPQLAVVVSVRGQLEKSSAARESSIAKAAFSFLQDHVSSSFS 785

Query: 2360 IADWNGGRIVCTNTQLKRWL 2419
               WNGGRI+C  +QLK+ L
Sbjct: 786  FTGWNGGRIMCQRSQLKQLL 805


>gb|ESW27367.1| hypothetical protein PHAVU_003G195800g [Phaseolus vulgaris]
          Length = 857

 Score =  793 bits (2048), Expect = 0.0
 Identities = 401/804 (49%), Positives = 550/804 (68%), Gaps = 6/804 (0%)
 Frame = +2

Query: 170  FRAQATVETSLTSEEEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFR 349
            F+    +  S+T E+ GKYSY VE  I  +  L  RGSI RCL+ FKNKL+LNDFAL+F+
Sbjct: 50   FKELIPINPSVTVEK-GKYSYDVETLINRLTALPPRGSIARCLDPFKNKLSLNDFALVFK 108

Query: 350  DLGGCGEWQRALKLFKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVH 529
            +    G+WQR+L+LFKYMQRQ WCKPNE+I T++I +LGRE LLDKC E+F+EMPS GV 
Sbjct: 109  EFAQRGDWQRSLRLFKYMQRQLWCKPNEHICTIMITLLGRESLLDKCREVFDEMPSNGVA 168

Query: 530  WTVFTFTALINAYGRNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLG 709
             TV+ +TA+INAYGRN Q++ SL LL  MK+E ++P+++TYNTVI AC RGGLDWEGLLG
Sbjct: 169  RTVYAYTAIINAYGRNGQFQASLELLDAMKQERVSPSILTYNTVINACARGGLDWEGLLG 228

Query: 710  LFAQMRHDGIQPDIITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFK 889
            LFA+MRH+GIQPD+ITYNTLL AC+ RGL +EAEMVFRTMNE+GIVP+  TY+ LV TF 
Sbjct: 229  LFAEMRHEGIQPDVITYNTLLCACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFG 288

Query: 890  NIGELEMVTELYREMELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVS 1069
             +  LE V++L REME  GNLPD+T+YN+LLEA +  G IK A GVFRQM+ AGC+PN  
Sbjct: 289  KLNRLEKVSDLLREMESGGNLPDITSYNVLLEAHAELGSIKDAMGVFRQMQAAGCVPNAD 348

Query: 1070 TYVILINSYGNNAQYDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDM 1249
            TY IL+N YG + +YD VR+LFLEMK +N +PDV TYN LI +FG+GGYFKE V+ F DM
Sbjct: 349  TYSILLNLYGKHGRYDDVRELFLEMKVSNTDPDVGTYNILIQVFGEGGYFKEVVTLFHDM 408

Query: 1250 VEKNVQPEMDSYEGFMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLY 1429
            VE+N++P M++YEG +FACG+GGL EDA+KIL HM+ + IVP+ K + G++ A+G+A LY
Sbjct: 409  VEENIEPNMETYEGLIFACGKGGLYEDAKKILMHMKEKGIVPTSKAYTGVIEAFGQAALY 468

Query: 1430 EDAHSTLSHMKELGRDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSL 1609
            E+A    + MKE+G +     +NS +  Y R GLYKEA  I   M E+G++    +FN  
Sbjct: 469  EEALVAFNTMKEVGSNATLETYNSFVHAYARGGLYKEAEAILSRMNESGLKRDVNSFNGE 528

Query: 1610 IEAFGRACEYKEALNIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFG 1789
            IEAF +A +Y+EA+  + +M  +   PN+ T E +L VYC +   +++   F E+K +  
Sbjct: 529  IEAFRQAGQYEEAVKAHVEMEKANCEPNELTLEAVLSVYCTAGLVDESEEQFQEIKASGL 588

Query: 1790 LPSITSYCMMMSMYARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAV 1966
            LPS+  YCMM+++YA+ +  K+   L+ EM+   V D++  +  +I GD+  ES+W    
Sbjct: 589  LPSVMCYCMMLALYAKNDRSKDAYNLIDEMIKIRVSDVHQVIGQMIKGDFDDESNWQIVE 648

Query: 1967 HEFKKLRTKGVDLETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLV 2146
            + F KL ++G  L   FY++ L+ALWW+ QRE + RVL EA   G+FPE F ++ +V  V
Sbjct: 649  YIFDKLTSEGCGLGMRFYNALLEALWWMFQRERAARVLNEASKRGLFPELFRKSKLVWSV 708

Query: 2147 DIHRMSIGTALAALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQ 2326
            D+HRMS G AL ALS W+  MQ +    E++  L S+ V+RGE+E   +A++ PI+K   
Sbjct: 709  DVHRMSEGAALTALSVWLNNMQEMFMISEDLPVLASVVVVRGEMEKTIDAQDFPIAKAAM 768

Query: 2327 RFLKD--LESPFKIADWNGGRIVCTNTQLKRWLQGXXXXXXXXXXXXXIPL---PLEDTT 2491
             FL+D    S F   +WN GRIVC  +QL++ L G             I L   PL  T 
Sbjct: 769  SFLQDNVPSSSFTFPEWNKGRIVCQQSQLRQILSGTESSSSRKKMGKLISLSNSPL-TTA 827

Query: 2492 NTISDEVDSTKEDIEDQTLNFKTK 2563
               + + D    D++ +T + +T+
Sbjct: 828  GAKASKSDRKANDVDSRTDSTRTE 851


>ref|XP_002322139.2| hypothetical protein POPTR_0015s08030g [Populus trichocarpa]
            gi|550322283|gb|EEF06266.2| hypothetical protein
            POPTR_0015s08030g [Populus trichocarpa]
          Length = 866

 Score =  793 bits (2047), Expect = 0.0
 Identities = 382/740 (51%), Positives = 531/740 (71%), Gaps = 2/740 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 70   EKGKYSYDVETLINKLSSLPPRGSIARCLDVFKNKLSLNDFALVFKEFAQRGDWQRSLRL 129

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FK+MQRQ WCKPNE+IYT++I +LGREGLL+KCS+IFEEM ++GV  +VF++TALIN+YG
Sbjct: 130  FKHMQRQIWCKPNEHIYTIMISLLGREGLLEKCSDIFEEMGAHGVSRSVFSYTALINSYG 189

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN +YE SL LL RMK+E ++P+++TYNTVI +C RGGLDWEGLLGLFA+MRH+GIQPDI
Sbjct: 190  RNGKYEVSLELLERMKKERVSPSILTYNTVINSCARGGLDWEGLLGLFAEMRHEGIQPDI 249

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLL ACS+RGL +EAEMVFRTMNE G+VP+  TYT LVDTF  +  L+ V+EL +E
Sbjct: 250  VTYNTLLCACSNRGLGDEAEMVFRTMNEGGVVPDITTYTYLVDTFGKLNRLDKVSELLKE 309

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            M   GN+P++++YN+LLEA++  G I+ A GVFR M+EAGC+PN  TY IL+  YG + +
Sbjct: 310  MASTGNVPEISSYNVLLEAYARIGNIEDATGVFRLMQEAGCVPNAETYSILLGLYGKHGR 369

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD  TYN LI +FG+GGYFKE V+ F DM E+NV+P M++YEG
Sbjct: 370  YDEVRELFLEMKVSNTEPDAATYNTLIDVFGEGGYFKEVVTLFHDMAEENVEPNMETYEG 429

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL +DA+KIL HM  + ++PS K + G++ AYG+A +YE+A  TL+ M E+G
Sbjct: 430  LIFACGKGGLHDDAKKILLHMSEKGMIPSSKAYTGVIEAYGQAAMYEEALVTLNTMNEMG 489

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +N++I M+ R GLYKE   I   M + G+    ++FN +IE F +  +++EA+
Sbjct: 490  SKPTIETYNTLIYMFARGGLYKETEAILLKMGDFGVARERDSFNGVIEGFRQGGQFEEAI 549

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  S + P+++T E +L VYC +   +++   F E+K +  LP++  YCMM+++Y
Sbjct: 550  KAYVEMEKSRLVPDERTLEAVLSVYCIAGLVDESVEQFQEIKASGILPNVMCYCMMLAVY 609

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ + W E   LL EML +   +I+  +  +I GD+  +S+W    + F KL ++G  L 
Sbjct: 610  AKSDRWNEAYELLDEMLTNRASNIHQVIGQMIKGDFDDDSNWQMVEYVFDKLNSEGCGLG 669

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWWLGQ+E + RVL EA   G FPE F ++ +V  VDIHRM  G+A  A+
Sbjct: 670  MRFYNTLLEALWWLGQKERAVRVLGEATKRGHFPELFRKSKLVWSVDIHRMWEGSAYTAI 729

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKDL-ESPFKIA 2365
            S W+  M  +  N ++I +L S+ V+RG LE    A++ PI K V  FL+D+  S F  +
Sbjct: 730  SVWLNNMYEIFMNRQDIPQLASVIVVRGLLEKSSVAQDFPIGKAVHSFLQDIVPSSFSYS 789

Query: 2366 DWNGGRIVCTNTQLKRWLQG 2425
             WN GRI C  +QLKR+L G
Sbjct: 790  GWNNGRITCQRSQLKRFLLG 809


>ref|XP_006600662.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 860

 Score =  791 bits (2044), Expect = 0.0
 Identities = 396/792 (50%), Positives = 542/792 (68%), Gaps = 8/792 (1%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  I  L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 63   EKGKYSYDVETLINRITALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQRSLRL 122

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMPS GV  TV+ +TA+INAYG
Sbjct: 123  FKYMQRQIWCKPNEHIYTIMITLLGREGLLDKCREVFDEMPSNGVARTVYVYTAVINAYG 182

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q+  SL LL  MK+E ++P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPD+
Sbjct: 183  RNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDV 242

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            ITYNTLL AC+ RGL +EAEMVFRTMNE+GIVP+  TY+ LV TF  +  LE V+EL RE
Sbjct: 243  ITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSELLRE 302

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++  G IK+A  VFRQM+ AGC+ N +TY +L+N YG + +
Sbjct: 303  MESGGNLPDITSYNVLLEAYAELGSIKEAMDVFRQMQAAGCVANAATYSVLLNLYGKHGR 362

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VRD+FLEMK +N +PD  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 363  YDDVRDIFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 422

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  + IVPS K + G++ A+G+A LYE+A    + M E+G
Sbjct: 423  LIFACGKGGLYEDAKKILLHMNEKGIVPSSKAYTGVIEAFGQAALYEEALVVFNTMNEVG 482

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
             +P    +NS I  + R GLYKEA  I   M E+G++    +FN +I+AF +  +Y+EA+
Sbjct: 483  SNPTVETYNSFIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIKAFRQGGQYEEAV 542

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   PN+ T E +L VYC++   +++   F E+K +  LPS+  YC+M+++Y
Sbjct: 543  KSYVEMEKANCEPNELTLEVVLSVYCSAGLVDESEEQFQEIKASGILPSVMCYCLMLALY 602

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ +   +   L+ EM+   V DI+  +  +I GD+  ES+W    + F KL ++G  L 
Sbjct: 603  AKNDRLNDAYNLIDEMITMRVSDIHQGIGQMIKGDFDDESNWQIVEYVFDKLNSEGCGLG 662

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWW+ QRE + RVL EA   G+FPE F ++ +V  VD+HRMS G AL AL
Sbjct: 663  MRFYNALLEALWWMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGALTAL 722

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVM-RGELEHRKEAKNNPISKTVQRFLKD-LESPFKI 2362
            S W+  M  + R G ++ +L ++ V+ RG +E   EA++ PI+K    FL+D + S F  
Sbjct: 723  SVWLNNMHEMSRTGNDLPELATVVVVSRGHMEKSTEAQDFPIAKAAISFLQDNVPSSFTF 782

Query: 2363 ADWNGGRIVCTNTQLKRWLQGXXXXXXXXXXXXXI-----PLPLEDTTNTISDEVDSTKE 2527
              WN GRIVC  +QL+R L G             +     PL       + SD       
Sbjct: 783  PGWNKGRIVCQQSQLRRILSGTESSSSRKKMDKLVSLSNTPLTTAGVITSKSDVQSGKAN 842

Query: 2528 DIEDQTLNFKTK 2563
            D++ +T + +T+
Sbjct: 843  DVDSRTDSTRTE 854


>ref|XP_006300609.1| hypothetical protein CARUB_v10019779mg [Capsella rubella]
            gi|482569319|gb|EOA33507.1| hypothetical protein
            CARUB_v10019779mg [Capsella rubella]
          Length = 865

 Score =  791 bits (2042), Expect = 0.0
 Identities = 383/740 (51%), Positives = 528/740 (71%), Gaps = 2/740 (0%)
 Frame = +2

Query: 206  SEEEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRAL 385
            S E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++  G  +WQR+L
Sbjct: 66   SVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRSDWQRSL 125

Query: 386  KLFKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINA 565
            +LFKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMP  GV  +VF++TALINA
Sbjct: 126  RLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPGQGVSRSVFSYTALINA 185

Query: 566  YGRNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQP 745
            YGRN +YETSL LL RMK E I+P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQ 
Sbjct: 186  YGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQS 245

Query: 746  DIITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELY 925
            DI+TYNTLLSAC+ RGL +EAEMVFRTMN+ GIVP+  TY+ LV+TF  +G LE V++L 
Sbjct: 246  DIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLGRLEKVSDLL 305

Query: 926  REMELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNN 1105
             EM   G+LPD+T+YN+LLEA++ SG IK++ GVF QM+ AGC PN +TY +L+N +G +
Sbjct: 306  SEMASGGSLPDITSYNVLLEAYAKSGSIKESMGVFHQMQAAGCTPNANTYSVLLNLFGQS 365

Query: 1106 AQYDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSY 1285
             +YD VR LFLEMK++N +PD  TYN LI +FG+GGYFKE V+ F DMVE+N++P+M++Y
Sbjct: 366  GRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETY 425

Query: 1286 EGFMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKE 1465
            EG +FACG+GGL EDA KIL++M   +IVPS K + G++ A+G+A LYE+A    + M E
Sbjct: 426  EGIIFACGKGGLQEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHE 485

Query: 1466 LGRDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKE 1645
            +G +P    ++S++  + R GL KE+  I   + ++GI    +TFN+ IEA+ +   ++E
Sbjct: 486  VGSNPSIETYHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGRFEE 545

Query: 1646 ALNIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMS 1825
            A+  Y DM  S   P+++T E +L VY  +   ++    F EMK +  LPSI  YCMM++
Sbjct: 546  AVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLA 605

Query: 1826 MYARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVD 2002
            +Y +   W + + LL+EML++ V +I+  +  +I GDY  +S+W    +   KL ++G  
Sbjct: 606  VYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCG 665

Query: 2003 LETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALA 2182
            L   FY++ LDALWWLGQ+E + RVL EA   G+FPE F +  +V  VD+HRMS G    
Sbjct: 666  LGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMSEGGMYT 725

Query: 2183 ALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFK 2359
            ALS W+ +M  +   GE++ +L  +  +RG+LE    A+ +PI+K    FL+D + S F 
Sbjct: 726  ALSVWLNDMNDMFLTGEDLPQLAVVVSVRGQLEKSSAARESPIAKAAFSFLQDHVSSSFS 785

Query: 2360 IADWNGGRIVCTNTQLKRWL 2419
               WNGGRI+C  +QLK+ L
Sbjct: 786  FTGWNGGRIMCQRSQLKQLL 805


>ref|NP_177623.1| plastid transcriptionally active 2 [Arabidopsis thaliana]
            gi|75194055|sp|Q9S7Q2.1|PP124_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g74850, chloroplastic; AltName: Full=Protein PLASTID
            TRANSCRIPTIONALLY ACTIVE 2; Flags: Precursor
            gi|5882738|gb|AAD55291.1|AC008263_22 Contains 3 PF|01535
            DUF17 domains [Arabidopsis thaliana]
            gi|12323908|gb|AAG51934.1|AC013258_28 hypothetical
            protein; 81052-84129 [Arabidopsis thaliana]
            gi|332197518|gb|AEE35639.1| plastid transcriptionally
            active 2 [Arabidopsis thaliana]
          Length = 862

 Score =  789 bits (2038), Expect = 0.0
 Identities = 385/740 (52%), Positives = 530/740 (71%), Gaps = 2/740 (0%)
 Frame = +2

Query: 206  SEEEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRAL 385
            S E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++  G G+WQR+L
Sbjct: 66   SVEKGKYSYDVESLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAGRGDWQRSL 125

Query: 386  KLFKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINA 565
            +LFKYMQRQ WCKPNE+IYT++I +LGREGLLDKC E+F+EMPS GV  +VF++TALINA
Sbjct: 126  RLFKYMQRQIWCKPNEHIYTIMISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINA 185

Query: 566  YGRNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQP 745
            YGRN +YETSL LL RMK E I+P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQP
Sbjct: 186  YGRNGRYETSLELLDRMKNEKISPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQP 245

Query: 746  DIITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELY 925
            DI+TYNTLLSAC+ RGL +EAEMVFRTMN+ GIVP+  TY+ LV+TF  +  LE V +L 
Sbjct: 246  DIVTYNTLLSACAIRGLGDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLL 305

Query: 926  REMELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNN 1105
             EM   G+LPD+T+YN+LLEA++ SG IK+A GVF QM+ AGC PN +TY +L+N +G +
Sbjct: 306  GEMASGGSLPDITSYNVLLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQS 365

Query: 1106 AQYDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSY 1285
             +YD VR LFLEMK++N +PD  TYN LI +FG+GGYFKE V+ F DMVE+N++P+M++Y
Sbjct: 366  GRYDDVRQLFLEMKSSNTDPDAATYNILIEVFGEGGYFKEVVTLFHDMVEENIEPDMETY 425

Query: 1286 EGFMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKE 1465
            EG +FACG+GGL EDA KIL++M   +IVPS K + G++ A+G+A LYE+A    + M E
Sbjct: 426  EGIIFACGKGGLHEDARKILQYMTANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHE 485

Query: 1466 LGRDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKE 1645
            +G +P    F+S++  + R GL KE+  I   + ++GI    +TFN+ IEA+ +  +++E
Sbjct: 486  VGSNPSIETFHSLLYSFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEE 545

Query: 1646 ALNIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMS 1825
            A+  Y DM  S   P+++T E +L VY  +   ++    F EMK +  LPSI  YCMM++
Sbjct: 546  AVKTYVDMEKSRCDPDERTLEAVLSVYSFARLVDECREQFEEMKASDILPSIMCYCMMLA 605

Query: 1826 MYARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVD 2002
            +Y +   W + + LL+EML++ V +I+  +  +I GDY  +S+W    +   KL ++G  
Sbjct: 606  VYGKTERWDDVNELLEEMLSNRVSNIHQVIGQMIKGDYDDDSNWQIVEYVLDKLNSEGCG 665

Query: 2003 LETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALA 2182
            L   FY++ LDALWWLGQ+E + RVL EA   G+FPE F +  +V  VD+HRMS G    
Sbjct: 666  LGIRFYNALLDALWWLGQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMSEGGMYT 725

Query: 2183 ALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFK 2359
            ALS W+ ++  ++  G+ + +L  +  +RG+LE    A+ +PI+K    FL+D + S F 
Sbjct: 726  ALSVWLNDINDMLLKGD-LPQLAVVVSVRGQLEKSSAARESPIAKAAFSFLQDHVSSSFS 784

Query: 2360 IADWNGGRIVCTNTQLKRWL 2419
               WNGGRI+C  +QLK+ L
Sbjct: 785  FTGWNGGRIMCQRSQLKQLL 804


>ref|XP_004301287.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 862

 Score =  788 bits (2036), Expect = 0.0
 Identities = 378/738 (51%), Positives = 523/738 (70%), Gaps = 2/738 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 65   EKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAARGDWQRSLRL 124

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKP+E+IYT++I +LGREGLLDKC+EIF+EMP+ GV  +VF++TALINAYG
Sbjct: 125  FKYMQRQIWCKPSEHIYTIMISLLGREGLLDKCAEIFDEMPTQGVIRSVFSYTALINAYG 184

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q+E SL LL RMK++ ++P ++TYNTV+ AC RGGLDWEGLLGLFA+MRH+G+QPD+
Sbjct: 185  RNGQFEMSLQLLDRMKKDKVSPNILTYNTVLNACARGGLDWEGLLGLFAEMRHEGVQPDL 244

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC+ RGL +EAEMVFRTMNE GIVP+  TY+ LV+TF  +  LE V+EL + 
Sbjct: 245  VTYNTLLSACAGRGLGDEAEMVFRTMNEGGIVPDITTYSYLVETFGKLNNLEKVSELLKG 304

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++  G IK+A GVFRQM+EAGC+ N +TY IL+N YG   +
Sbjct: 305  MESGGNLPDITSYNVLLEAYAQLGSIKEAMGVFRQMQEAGCMANAATYSILLNLYGRLGR 364

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD  TYN LI +FG+GGYF+E V+ F DMVE+N++P M++YEG
Sbjct: 365  YDDVRELFLEMKVSNAEPDAATYNILIQVFGEGGYFREVVTLFHDMVEENIEPNMETYEG 424

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             ++ACG+GGL EDA+ IL HM  + IVPS K + G + AYG+A LY++A    + M E+G
Sbjct: 425  LIYACGKGGLHEDAKNILLHMNEKGIVPSSKAYTGAIEAYGQAALYDEALVAFNTMNEVG 484

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    FNS+I  Y R GLYKE   +   M E GI     +FN +IEAF +  +++EA+
Sbjct: 485  SSPSVESFNSLIHAYARGGLYKETEQVLSIMGEFGIAINASSFNGMIEAFRQGGQFEEAI 544

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M      P++ T E +L VY  +    +   HF E+K +  LPS+  YCMM+++Y
Sbjct: 545  KTYVEMEKRRCDPDECTLEAVLSVYSVAGLVNECEEHFEEIKASGILPSVMCYCMMLAVY 604

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ + W + + LL EML + V +I+  +  +I GDY  ES+W    + F KL+++G  L 
Sbjct: 605  AKTDRWDDANKLLNEMLTNRVSNIHQVMGQMIKGDYDDESNWQMVEYVFDKLKSEGCGLG 664

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ ++ALWWLGQ++ + RVL EA   G+FPE   +  +V  +D+HRM  G A AA+
Sbjct: 665  MRFYNTLIEALWWLGQKQRAVRVLSEATQRGLFPELLRKNKLVWSIDVHRMWEGGAYAAM 724

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+ +M  +  NGE++  + ++ V+RG++E     ++ P++K    FL+D +   F   
Sbjct: 725  SVWLNDMYEMFLNGEDLPHVATVVVVRGKMEKSSTTQDLPVAKAAYSFLQDNMSGAFNFP 784

Query: 2366 DWNGGRIVCTNTQLKRWL 2419
             WN GRI+C  +QLK+ L
Sbjct: 785  KWNNGRILCQRSQLKKLL 802


>gb|EOY20555.1| Plastid transcriptionally active 2 isoform 1 [Theobroma cacao]
          Length = 859

 Score =  788 bits (2034), Expect = 0.0
 Identities = 378/738 (51%), Positives = 526/738 (71%), Gaps = 2/738 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ F+NKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 63   EKGKYSYDVETLINKLSSLPPRGSIARCLDVFRNKLSLNDFALVFKEFAHRGDWQRSLRL 122

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLL+KC E+F+EMPS GV  +VF +TALINAYG
Sbjct: 123  FKYMQRQIWCKPNEHIYTIMISLLGREGLLEKCREVFDEMPSQGVTRSVFAYTALINAYG 182

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN  Y  SL LL +MK++ + P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPDI
Sbjct: 183  RNGAYNISLELLDKMKKDKVLPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDI 242

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC++RGL NEAEMVFRTMNE GI+P+  TY+ LV++F  +G+LE V+EL +E
Sbjct: 243  VTYNTLLSACANRGLGNEAEMVFRTMNEGGILPDLTTYSYLVESFGKLGKLEKVSELLKE 302

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+ +YN+LLEA++ SG IK+A GVF+QM+ AGC PN +TY IL+N YG N +
Sbjct: 303  MESGGNLPDIMSYNVLLEAYAKSGSIKEAMGVFKQMQVAGCAPNATTYSILLNLYGRNGR 362

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD  TYN LI +FG+GGYFKE V+ F DMVE+N++P + +Y+G
Sbjct: 363  YDDVRELFLEMKESNTEPDAATYNILIQVFGEGGYFKEVVTLFHDMVEENIEPNVKTYDG 422

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  + IVPS + + G++ AYG+A LYE+     + M E+ 
Sbjct: 423  LIFACGKGGLHEDAKKILLHMNEKCIVPSSRAYTGVIEAYGQAALYEEVLVAFNTMNEVE 482

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
             +P    +NS++  + R GLYKEA  I   M E G+    ++FN+LIEAF +  ++++A+
Sbjct: 483  SNPTIETYNSLLQTFARGGLYKEANAILSRMNETGVAKNRDSFNALIEAFRQGGQFEDAI 542

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   P+++T E +L VYC +   +++N  F E+K    LPS+  YCMM+++Y
Sbjct: 543  KAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESNEQFQEIKALGVLPSVMCYCMMLAVY 602

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ + W +   L  EML ++V +I+  +  +I GDY  +++W    + F KL ++G    
Sbjct: 603  AKCDRWDDAYQLFDEMLTNKVSNIHQVIGKMIRGDYDDDANWQMVEYVFDKLNSEGCGFG 662

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALWWL Q+E + RVL EA   G+FPE F +  +V  VD+HRM  G    A+
Sbjct: 663  IRFYNALLEALWWLRQKERAARVLNEATKRGLFPELFRKNKLVWSVDVHRMWEGGTYTAV 722

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKDL-ESPFKIA 2365
            S W+  MQ +  +G+++ +L ++ V RG++E    A++ P +K    FL+D+  S F   
Sbjct: 723  SIWLNSMQKMFLSGDDLPQLATVVVARGQMEKSSIARDIPTAKAAYTFLQDIVSSSFSFP 782

Query: 2366 DWNGGRIVCTNTQLKRWL 2419
             WN GRIVC  +QLKR L
Sbjct: 783  GWNKGRIVCQRSQLKRIL 800


>gb|EXB29767.1| hypothetical protein L484_008930 [Morus notabilis]
          Length = 905

 Score =  786 bits (2031), Expect = 0.0
 Identities = 390/765 (50%), Positives = 530/765 (69%), Gaps = 27/765 (3%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 85   EKGKYSYDVETLINKLSSLPPRGSIARCLDIFKNKLSLNDFALVFKEFAQRGDWQRSLRL 144

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYT++I +LGREGLLDK +EIF+EMPS GV  +VF++TALINAYG
Sbjct: 145  FKYMQRQIWCKPNEHIYTIMISLLGREGLLDKSAEIFDEMPSQGVVRSVFSYTALINAYG 204

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN QYETSL LL RMK++ ++P ++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPD+
Sbjct: 205  RNGQYETSLQLLDRMKKDKVSPNILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDL 264

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLL AC++RGL +EAEMVFRTMNE GIVP+  TY+ LV+TF  +G+LE V+EL +E
Sbjct: 265  VTYNTLLGACANRGLGDEAEMVFRTMNEGGIVPDITTYSCLVETFGKLGKLEKVSELLKE 324

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++ SG I +A GVFRQM+ AGCLPN +TY IL+N YG   +
Sbjct: 325  MESRGNLPDITSYNVLLEAYAESGSISEAVGVFRQMQTAGCLPNANTYSILLNLYGKQGR 384

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            Y+ VR+LFLEMK +N EPD  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 385  YEDVRELFLEMKVSNTEPDAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 444

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             + ACG+GGL  DA+ IL HM  + IVPS KV+ G++ AYG+A LYE+A    + M E+G
Sbjct: 445  LIIACGKGGLHGDAKIILNHMNEKGIVPSSKVYTGVIEAYGQAALYEEALVAFNTMNEVG 504

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +NS+I  + R GLYKEA  I   M  + +    + FNSLIEAF +  + +EA+
Sbjct: 505  SRPSVETYNSLIHAFSRGGLYKEAEAILQRMGNSAVARNVDLFNSLIEAFRQGGQIEEAV 564

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  S   P+++T E +L VYC +   ++   HF E+K +  LPS+  YC M+++Y
Sbjct: 565  KAYIEMGKSRCDPDERTLEALLSVYCFAGLVDECEEHFKEIKASGILPSVMCYCTMLAVY 624

Query: 1832 ARLN-------------------------SWKEFDTLLQEMLASEVPDIYIAVLTLITGD 1936
            AR +                          W +   LL EML ++  +I+  +  +I GD
Sbjct: 625  ARCDRIDRTLPQTLFYPNPPVPLDRWHRVRWDDAFKLLDEMLKNKASNIHQVIAQMIKGD 684

Query: 1937 YTE-SDWHSAVHEFKKLRTKGVDLETSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPE 2113
            Y + ++W    + F KL ++G  L   FY++ L+ALWW+GQ+E + RVL EA   G+FPE
Sbjct: 685  YDDGTNWQMVEYVFDKLNSEGCGLGIRFYNTLLEALWWMGQKERAVRVLNEATKRGLFPE 744

Query: 2114 AFHRTDIVSLVDIHRMSIGTALAALSTWMLEMQTLVRNGENISKLCSIAVMRGELEHRKE 2293
             F R  +V  +D+HRM  G A  A+S W+ +M  + +NG+++  + ++ V+RG++E    
Sbjct: 745  LFRRNKLVWSIDVHRMWEGGACTAISVWLNDMFGMFKNGDDLPHVATVVVVRGKMERSPS 804

Query: 2294 AKNNPISKTVQRFLKD-LESPFKIADWNGGRIVCTNTQLKRWLQG 2425
            A+  PI+K    FL++ + S F    WN GRIVC  +QLK+ L G
Sbjct: 805  AQETPIAKASYSFLQENMFSSFGFPTWNKGRIVCQRSQLKQVLSG 849


>ref|XP_003525484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 857

 Score =  784 bits (2024), Expect = 0.0
 Identities = 382/740 (51%), Positives = 525/740 (70%), Gaps = 2/740 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  +  L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 61   EKGKYSYDVETLINRLTALPPRGSIARCLDPFKNKLSLNDFALVFKEFAQRGDWQRSLRL 120

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+I+T++I +LGREGLLDKC E+F+EMPS GV  TV+++TA+INAYG
Sbjct: 121  FKYMQRQIWCKPNEHIHTIMITLLGREGLLDKCREVFDEMPSNGVVRTVYSYTAIINAYG 180

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q+  SL LL  MK+E ++P+++TYNTVI AC RGGLDWEGLLGLFA+MRH+GIQPD+
Sbjct: 181  RNGQFHASLELLNGMKQERVSPSILTYNTVINACARGGLDWEGLLGLFAEMRHEGIQPDV 240

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            ITYNTLL AC+ RGL +EAEMVFRTMNE+GIVP+  TY+ LV TF  +  LE V+EL RE
Sbjct: 241  ITYNTLLGACAHRGLGDEAEMVFRTMNESGIVPDINTYSYLVQTFGKLNRLEKVSELLRE 300

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPD+T+YN+LLEA++  G IK+A GVFRQM+ AGC+ N +TY +L+N YG + +
Sbjct: 301  MECGGNLPDITSYNVLLEAYAELGSIKEAMGVFRQMQAAGCVANAATYSVLLNLYGKHGR 360

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VRDLFLEMK +N +PD  TYN LI +FG+GGYFKE V+ F DM E+NV+P M +YEG
Sbjct: 361  YDDVRDLFLEMKVSNTDPDAGTYNILIQVFGEGGYFKEVVTLFHDMAEENVEPNMQTYEG 420

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL EDA+KIL HM  + +VPS K + G++ A+G+A LYE+A    + M E+G
Sbjct: 421  LIFACGKGGLYEDAKKILLHMNEKGVVPSSKAYTGVIEAFGQAALYEEALVMFNTMNEVG 480

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
             +P    +NS+I  + R GLYKEA  I   M E+G++    +FN +IEAF +  +Y+EA+
Sbjct: 481  SNPTVETYNSLIHAFARGGLYKEAEAILSRMNESGLKRDVHSFNGVIEAFRQGGQYEEAV 540

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   PN+ T E +L +YC++   ++    F E+K +  LPS+  YCMM+++Y
Sbjct: 541  KSYVEMEKANCEPNELTLEAVLSIYCSAGLVDEGEEQFQEIKASGILPSVMCYCMMLALY 600

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ +   +   L+  M+   V DI+  +  +I GD+  ES+W    + F KL ++G  L 
Sbjct: 601  AKNDRLNDAYNLIDAMITMRVSDIHQVIGQMIKGDFDDESNWQIVEYVFDKLNSEGCGLG 660

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALW + QRE + RVL EA   G+FPE F ++ +V  VD+HRMS G AL AL
Sbjct: 661  MRFYNALLEALWCMFQRERAARVLNEASKRGLFPELFRKSKLVWSVDVHRMSEGGALTAL 720

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+  +  +   G+++ ++ ++ V+RG +E   +A++ PI+K    FL+D + S F   
Sbjct: 721  SVWLNNVHEMSMTGDDLPEVATVVVVRGHMEKTTDAQDFPIAKAAISFLQDNVPSSFAFP 780

Query: 2366 DWNGGRIVCTNTQLKRWLQG 2425
             WN GRIVC  +QL+R L G
Sbjct: 781  GWNKGRIVCQQSQLRRILSG 800


>ref|XP_006344988.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Solanum tuberosum]
          Length = 860

 Score =  783 bits (2022), Expect = 0.0
 Identities = 389/787 (49%), Positives = 536/787 (68%), Gaps = 9/787 (1%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+L+DF+L+F++    G+WQR+L+L
Sbjct: 63   EKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLSDFSLVFKEFAARGDWQRSLRL 122

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYTL+IG+LGREGLLDK  EIF+EM ++ V  TVF++TA+INAYG
Sbjct: 123  FKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAFEIFDEMSTHSVARTVFSYTAIINAYG 182

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN QYETSL LL +MK+E I P+++TYNTVI +C RGG +WEGLLGLFA+MRH+GIQPD+
Sbjct: 183  RNGQYETSLQLLEKMKQENIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGIQPDL 242

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSACSSR L +EAEMVFRTMNEAG++P+  TY+ LV+TF  +G+LE V+EL  E
Sbjct: 243  VTYNTLLSACSSRELEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLME 302

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  G  P+VT+YN+LLEA++  G +K+A  VFRQM+ AGC+ N  TY IL+N YG N +
Sbjct: 303  MEAGGTSPEVTSYNVLLEAYAHLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGR 362

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD DTYN LI +FG+GGYFKE V+ F DMVE+ V+P M++YEG
Sbjct: 363  YDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEG 422

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             ++ACG+GGL EDA++IL HM  + +VPS KV+  ++ AYG+A LYE+A    + M E+G
Sbjct: 423  LIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYTAVIEAYGQAALYEEAVVAFNTMNEVG 482

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    FNS+I  + + GLYKE+  I++ M E G+    ++FN LIE + +  +++EA+
Sbjct: 483  SRPMVETFNSLIHTFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGLIEGYRQGGQFEEAI 542

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   P+++T E +L VYC +   +++   F E+K     PSI   CMM+++Y
Sbjct: 543  KAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIY 602

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+   W     LL +++ ++  D++  +  +I GD+  E++W    + F KL+++G  L 
Sbjct: 603  AKSERWDMARELLNDVMTNKTSDMHQIIGRMIHGDFDDENNWQMVEYVFDKLKSEGCGLS 662

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ ++ALWWLGQ+E + RVL EA   G+FPE F R  +V  VD+HRM  G A  A+
Sbjct: 663  MRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTAI 722

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+ +M+ L   GE + +L SI V+RG+ E     ++ P++K    FLKD + S F   
Sbjct: 723  SVWLNDMEELFHKGEELPQLASIVVVRGQTEKSSVTRDLPVAKAAYSFLKDTVSSSFSFP 782

Query: 2366 DWNGGRIVCTNTQLKRWLQGXXXXXXXXXXXXXIPL-----PLEDTTNTISD--EVDSTK 2524
             WN GRIVC  TQLKR                 IPL      L  T  ++SD    +S  
Sbjct: 783  GWNKGRIVCQRTQLKRTFSSAEPSAEASKGDRLIPLSNSPISLLGTQTSMSDAKRSESAN 842

Query: 2525 EDIEDQT 2545
             D E  T
Sbjct: 843  ADSERST 849


>ref|XP_006439718.1| hypothetical protein CICLE_v10018817mg [Citrus clementina]
            gi|557541980|gb|ESR52958.1| hypothetical protein
            CICLE_v10018817mg [Citrus clementina]
          Length = 871

 Score =  781 bits (2018), Expect = 0.0
 Identities = 387/740 (52%), Positives = 519/740 (70%), Gaps = 2/740 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 74   EKGKYSYDVETLINKLSSLPPRGSIARCLDMFKNKLSLNDFALVFKEFAQRGDWQRSLRL 133

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKP+E IYT++I +LGRE LLDK SE+FEEMPS GV  +VF++TALINAYG
Sbjct: 134  FKYMQRQIWCKPSEQIYTIMISLLGRENLLDKASEVFEEMPSQGVARSVFSYTALINAYG 193

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            R+ QYETSL LL RMKRE IAP ++TYNTVI ACVRGGLDWE LLGLFA+MRH+GIQPDI
Sbjct: 194  RHGQYETSLELLDRMKREKIAPNILTYNTVINACVRGGLDWEDLLGLFAEMRHEGIQPDI 253

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC  RGL +EAEMVFRTMNE G++P+  T++ LV+TF  +G+LE V+EL RE
Sbjct: 254  VTYNTLLSACGGRGLGDEAEMVFRTMNEGGVLPDLTTFSYLVETFGKLGKLEKVSELLRE 313

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPDVT YN+LLEA +  G IK+A  VFRQM+ AG + N +TY IL+N YG N +
Sbjct: 314  MESGGNLPDVTCYNVLLEAHAKMGSIKEAMDVFRQMQAAGSVANATTYSILLNLYGRNGR 373

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMKA+N EP+  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 374  YDDVRELFLEMKASNTEPNAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 433

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL ED +KIL +M  R  VPS K + G++ AYG A LYE+A    + M E+ 
Sbjct: 434  LIFACGKGGLHEDVKKILLYMNERGTVPSSKAYTGVIEAYGLAALYEEALVAFNTMNEVE 493

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +NS++  + R GLYKE   I   M+E+G+   +++FN++IEAF +   ++EA+
Sbjct: 494  SKPTIETYNSLLHTFARGGLYKECQAILSRMSESGVARNSDSFNAVIEAFRQGGRFEEAI 553

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M      PN++T E +L VYC +   +++   F E+K +  LPS+  YCM++++Y
Sbjct: 554  KAYVEMEKVRCDPNERTLEAVLSVYCFAGLVDESKEQFQEIKSSGILPSVMCYCMLLAVY 613

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ N W +   LL EM  + + +I+     +I G++  ES+W    + F KL  +G  L 
Sbjct: 614  AKSNRWDDAYGLLDEMHTNRISNIHQVTGQMIKGEFDDESNWQMVEYVFDKLNCEGYGLG 673

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ ++ALW LGQRE + RVL EA   G+FPE F    +V  VD+HRM  G A  A+
Sbjct: 674  MRFYNALMEALWCLGQRERAARVLDEATKRGLFPELFRHNKLVWSVDVHRMWEGGAYTAI 733

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+ +M  +   GE++ +L ++ V+RG++E     ++ PI+K    FL++   S F   
Sbjct: 734  SVWLNKMYEMFMMGEDLPQLATVVVVRGQMERTSTTEDLPIAKAAYTFLQENASSLFSFP 793

Query: 2366 DWNGGRIVCTNTQLKRWLQG 2425
             WN GRI+C  TQLKR L G
Sbjct: 794  QWNKGRIICQRTQLKRILSG 813


>ref|XP_006476695.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Citrus sinensis]
          Length = 871

 Score =  781 bits (2016), Expect = 0.0
 Identities = 387/740 (52%), Positives = 518/740 (70%), Gaps = 2/740 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+LNDFAL+F++    G+WQR+L+L
Sbjct: 74   EKGKYSYDVETLINKLSSLPPRGSIARCLDMFKNKLSLNDFALVFKEFAQRGDWQRSLRL 133

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKP+E IYT++I +LGRE LLDK SE+FEEMPS GV  +VF++TALINAYG
Sbjct: 134  FKYMQRQIWCKPSEQIYTIMISLLGRENLLDKASEVFEEMPSQGVPRSVFSYTALINAYG 193

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            R+ QYETSL LL RMKRE IAP ++TYNTVI ACVRGGLDWE LLGLFA+MRH+GIQPDI
Sbjct: 194  RHGQYETSLELLDRMKREKIAPNILTYNTVINACVRGGLDWEDLLGLFAEMRHEGIQPDI 253

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSAC SRGL +EAEMVFRTMNE G++P+  T++ LV+TF  +G+LE V+EL RE
Sbjct: 254  VTYNTLLSACGSRGLGDEAEMVFRTMNEGGVLPDLTTFSYLVETFGKLGKLEKVSELLRE 313

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  GNLPDVT YN+LLEA +  G IK+A  VFRQM+ AG + N +TY IL+N YG N +
Sbjct: 314  MESGGNLPDVTCYNVLLEAHAKMGSIKEAMDVFRQMQAAGSVANATTYSILLNLYGRNGR 373

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMKA+N EP+  TYN LI +FG+GGYFKE V+ F DMVE+NV+P M++YEG
Sbjct: 374  YDDVRELFLEMKASNTEPNAATYNILIQVFGEGGYFKEVVTLFHDMVEENVEPNMETYEG 433

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             +FACG+GGL ED +KIL +M  R  VPS K + G++ AYG A LYE+A    + M E+ 
Sbjct: 434  LIFACGKGGLHEDVKKILLYMNERGTVPSSKAYTGVIEAYGLAALYEEALVAFNTMNEVE 493

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    +NS++  + R GLYKE   I   M+E+G+   +++FN++IEAF +   ++EA+
Sbjct: 494  SKPTIETYNSLLHTFSRGGLYKECQAILSRMSESGVARNSDSFNAVIEAFRQGGRFEEAI 553

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M      PN++T E +L VYC +   +++   F E+K +  LPS+  YCM++++Y
Sbjct: 554  KAYVEMEKVRCDPNERTLEAVLSVYCFAGLVDESKEQFQEIKSSGILPSVMCYCMLLAVY 613

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+ N W +   LL EM  + + +I+     +I G++  ES+W    + F KL  +G  L 
Sbjct: 614  AKSNRWDDAYGLLDEMYTNRISNIHQVTGQMIKGEFDDESNWQMVEYVFDKLNCEGYGLG 673

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ L+ALW LG RE + RVL EA   G+FPE F    +V  VD+HRM  G A  A+
Sbjct: 674  MRFYNALLEALWCLGLRERAARVLDEATKRGLFPELFRHNKLVWSVDVHRMWEGGAYTAI 733

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+ +M  +   GE++ +L ++ V+RG +E     ++ P++K    FL++   S F   
Sbjct: 734  SVWLNKMYEMFMMGEDLPQLATVVVVRGRMERTSTTEDLPVAKAAYTFLQENASSLFNFP 793

Query: 2366 DWNGGRIVCTNTQLKRWLQG 2425
             WN GRI+C  TQLKR L G
Sbjct: 794  QWNKGRIICQRTQLKRILSG 813


>ref|XP_006843571.1| hypothetical protein AMTR_s00007p00097240 [Amborella trichopoda]
            gi|548845939|gb|ERN05246.1| hypothetical protein
            AMTR_s00007p00097240 [Amborella trichopoda]
          Length = 872

 Score =  781 bits (2016), Expect = 0.0
 Identities = 378/736 (51%), Positives = 522/736 (70%), Gaps = 2/736 (0%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ F+N+L+L DFAL+F++     +WQR+L+L
Sbjct: 87   EKGKYSYDVETLINKLSSLPPRGSIARCLDAFRNRLSLADFALVFKEFALRSDWQRSLRL 146

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE IY L++G+LGREGLLDKCSE+FEEMP+ GV  +  +FTALIN+YG
Sbjct: 147  FKYMQRQLWCKPNEPIYALMLGILGREGLLDKCSEVFEEMPTQGVPRSALSFTALINSYG 206

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN Q+E +LHLL RMKRE +AP ++TYNTV+ AC RGGL+WEGLLGLFAQMRHDG++PDI
Sbjct: 207  RNGQHEVTLHLLGRMKRERVAPTVLTYNTVLAACARGGLEWEGLLGLFAQMRHDGVRPDI 266

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
             TYNTLLSAC+SRGL ++AE VFR MNEAG++P+ AT+  LV  F+ +  LE V+EL  E
Sbjct: 267  ATYNTLLSACASRGLSDQAETVFRAMNEAGVLPDVATHKHLVSAFEKVEHLEKVSELLAE 326

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME +GN PDV +YN+L+EA + SG +K+A  V RQM+ AGC P+ STY +L++ YG + +
Sbjct: 327  MESSGNPPDVPSYNVLVEAHARSGSVKEAVAVLRQMQRAGCAPDASTYGLLLDLYGRHGR 386

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            Y+ VR LFLEMKA   E D  TYN LIG+FG+GGYF+E V+ FDDM+E+ V+P+M++YEG
Sbjct: 387  YEEVRGLFLEMKAGGTEADAATYNVLIGVFGEGGYFREVVTLFDDMIEEKVKPDMETYEG 446

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             ++ACG+GGL  DA +IL HMQ   IVPS K + G++ A+G+A LYE+A    + M+E+G
Sbjct: 447  LIYACGKGGLHGDARRILLHMQGNGIVPSAKAYTGVIEAFGQAALYEEAIVAFNTMQEIG 506

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P  + +NS+I+M+ R GLYKEA  +   M EA ++   E+FN+LIEAF +  +Y+EAL
Sbjct: 507  SVPTIDTYNSLINMFSRGGLYKEAQVVCSRMNEADVQRNDESFNALIEAFSQGGQYEEAL 566

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y DM      PN++T E IL+VYC++   E++   F E+K++  +P++ SYC+++S++
Sbjct: 567  KTYVDMQKVRCSPNQRTLEAILYVYCSAGLVEESRETFLEIKESGAMPTVDSYCLLLSVF 626

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            AR N   +   LL EM  +   + +  + T+I G+Y  +S+W    + F K  + G    
Sbjct: 627  ARSNRLDDAHELLDEMRTNRASNAHQVIGTMIKGEYDDDSNWQMVEYVFDKFVSDGCGSG 686

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ LDALWWLGQ+  + RVL EA    +FPE F  + +V   D+HRMS+G AL AL
Sbjct: 687  LRFYNALLDALWWLGQKARAARVLSEATKRALFPELFRHSKLVWSADVHRMSVGGALTAL 746

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKDLESP-FKIA 2365
            S W+ +M   + NG+++ +L SI V+RG +E  KEA   P+++ V  F+K+   P F I 
Sbjct: 747  SIWLNDMHDKLANGDDLPQLASIVVVRGVVEKSKEAGGFPVARAVYSFVKEQVPPSFSIG 806

Query: 2366 DWNGGRIVCTNTQLKR 2413
             WN GRIVC  +QLKR
Sbjct: 807  GWNKGRIVCHRSQLKR 822


>ref|XP_004236160.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74850,
            chloroplastic-like [Solanum lycopersicum]
          Length = 860

 Score =  781 bits (2016), Expect = 0.0
 Identities = 384/787 (48%), Positives = 534/787 (67%), Gaps = 9/787 (1%)
 Frame = +2

Query: 212  EEGKYSYYVERAIANIKNLSSRGSITRCLENFKNKLNLNDFALIFRDLGGCGEWQRALKL 391
            E+GKYSY VE  I  + +L  RGSI RCL+ FKNKL+L DF+L+F++    G+WQR+L+L
Sbjct: 63   EKGKYSYDVETLINKLSSLPPRGSIARCLDTFKNKLSLTDFSLVFKEFAARGDWQRSLRL 122

Query: 392  FKYMQRQQWCKPNENIYTLIIGVLGREGLLDKCSEIFEEMPSYGVHWTVFTFTALINAYG 571
            FKYMQRQ WCKPNE+IYTL+IG+LGREGLLDK  EIF+EM ++ V  TVF++TA+IN+YG
Sbjct: 123  FKYMQRQIWCKPNEHIYTLMIGILGREGLLDKAFEIFDEMSTHNVARTVFSYTAIINSYG 182

Query: 572  RNSQYETSLHLLARMKREGIAPALVTYNTVITACVRGGLDWEGLLGLFAQMRHDGIQPDI 751
            RN QYETSL LL +MK+E I P+++TYNTVI +C RGG +WEGLLGLFA+MRH+GIQPD+
Sbjct: 183  RNGQYETSLQLLEKMKQENIVPSILTYNTVINSCARGGYEWEGLLGLFAEMRHEGIQPDL 242

Query: 752  ITYNTLLSACSSRGLLNEAEMVFRTMNEAGIVPNKATYTLLVDTFKNIGELEMVTELYRE 931
            +TYNTLLSACSSR L +EAEMVFRTMNEAG++P+  TY+ LV+TF  +G+LE V+EL  E
Sbjct: 243  VTYNTLLSACSSRELEDEAEMVFRTMNEAGVLPDVTTYSYLVETFGKLGKLEKVSELLME 302

Query: 932  MELAGNLPDVTAYNLLLEAFSTSGKIKQAEGVFRQMKEAGCLPNVSTYVILINSYGNNAQ 1111
            ME  G  P+VT+YN+LLEA++  G +K+A  VFRQM+ AGC+ N  TY IL+N YG N +
Sbjct: 303  MEAGGTSPEVTSYNVLLEAYAHLGSMKEAMDVFRQMQAAGCVANAETYSILLNLYGKNGR 362

Query: 1112 YDVVRDLFLEMKATNVEPDVDTYNNLIGIFGKGGYFKEAVSFFDDMVEKNVQPEMDSYEG 1291
            YD VR+LFLEMK +N EPD DTYN LI +FG+GGYFKE V+ F DMVE+ V+P M++YEG
Sbjct: 363  YDQVRELFLEMKTSNTEPDADTYNILIQVFGEGGYFKEVVTLFHDMVEEKVEPNMETYEG 422

Query: 1292 FMFACGEGGLIEDAEKILKHMQRREIVPSIKVFNGLLLAYGKAVLYEDAHSTLSHMKELG 1471
             ++ACG+GGL EDA++IL HM  + +VPS KV+  ++ AYG+A LYE+A    + M E+G
Sbjct: 423  LIYACGKGGLHEDAKRILLHMNGQGLVPSSKVYTAVIEAYGQAALYEEAVVAFNTMNEVG 482

Query: 1472 RDPDSNCFNSIISMYGRAGLYKEALHIYWHMTEAGIESTTETFNSLIEAFGRACEYKEAL 1651
              P    FNS+I  + + GLYKE+  I++ M E G+    ++FN +IE + +  +++EA+
Sbjct: 483  SRPVVETFNSLIHTFAKGGLYKESEAIWFRMGEVGVPRNRDSFNGMIEGYRQGGQFEEAI 542

Query: 1652 NIYKDMVMSEIPPNKKTYETILHVYCASNSCEKANLHFFEMKQNFGLPSITSYCMMMSMY 1831
              Y +M  +   P+++T E +L VYC +   +++   F E+K     PSI   CMM+++Y
Sbjct: 543  KAYVEMEKARCDPDERTLEAVLSVYCFAGLVDESEEQFQEIKSLGIQPSIICCCMMLAIY 602

Query: 1832 ARLNSWKEFDTLLQEMLASEVPDIYIAVLTLITGDY-TESDWHSAVHEFKKLRTKGVDLE 2008
            A+   W     LL +++ ++  D++  +  +I GD+  E++W    + F KL+++G  L 
Sbjct: 603  AKSERWDMARELLNDVMTNKTSDMHQIIGRMIHGDFDDENNWQMVEYVFDKLKSEGCGLS 662

Query: 2009 TSFYDSFLDALWWLGQRETSTRVLQEARSLGIFPEAFHRTDIVSLVDIHRMSIGTALAAL 2188
              FY++ ++ALWWLGQ+E + RVL EA   G+FPE F R  +V  VD+HRM  G A  A+
Sbjct: 663  MRFYNTLIEALWWLGQKERAARVLNEATKRGLFPELFRRNKLVWSVDVHRMWPGGACTAI 722

Query: 2189 STWMLEMQTLVRNGENISKLCSIAVMRGELEHRKEAKNNPISKTVQRFLKD-LESPFKIA 2365
            S W+ +M+ L   GE + +L SI V+RG+ E     ++ P++K    FLKD + S F   
Sbjct: 723  SIWLNDMEELFHKGEELPQLASIVVVRGQTEKSSVTRDLPVAKAAYSFLKDTISSSFSFP 782

Query: 2366 DWNGGRIVCTNTQLKRWLQGXXXXXXXXXXXXXIPLPLE-------DTTNTISDEVDSTK 2524
             WN GRIVC  TQLKR                 IPL           T+ +++   +S  
Sbjct: 783  GWNKGRIVCQKTQLKRTFSSAEPSVEASKGDRLIPLSNSLISLLGTQTSMSVAKRSESVN 842

Query: 2525 EDIEDQT 2545
             D E  T
Sbjct: 843  ADSERST 849


Top