BLASTX nr result

ID: Sinomenium22_contig00023850 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00023850
         (2129 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi...   756   0.0  
ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [A...   716   0.0  
ref|XP_002529286.1| pentatricopeptide repeat-containing protein,...   715   0.0  
ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi...   712   0.0  
ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu...   709   0.0  
ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr...   709   0.0  
ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi...   709   0.0  
ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prun...   707   0.0  
ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein...   701   0.0  
ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily p...   694   0.0  
ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi...   687   0.0  
ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi...   682   0.0  
ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containi...   682   0.0  
ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi...   682   0.0  
ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phas...   682   0.0  
ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi...   682   0.0  
ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arab...   680   0.0  
ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar...   672   0.0  
ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr...   672   0.0  
gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus...   671   0.0  

>ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic [Vitis vinifera]
            gi|298204537|emb|CBI23812.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  756 bits (1951), Expect = 0.0
 Identities = 384/531 (72%), Positives = 438/531 (82%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YS++IKFMGK  NP KALE+Y+SIQD+S+RN+VS+CNS++ CL+RNGK E+S+KLF QMK
Sbjct: 138  YSTYIKFMGKSLNPIKALEIYNSIQDESVRNNVSVCNSVLSCLIRNGKFENSLKLFHQMK 197

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PDAVTYST+             AGC+K K GY KAL+ VQE++ + L MDSVIYG
Sbjct: 198  QDGLRPDAVTYSTLL------------AGCMKVKHGYSKALELVQEMERSRLPMDSVIYG 245

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLA+CASNNRC+EAE +F QMKDEG  PNVFHYSSLLNAYS DG+Y KAD LV+DMKSA
Sbjct: 246  TLLAVCASNNRCKEAENYFNQMKDEGHLPNVFHYSSLLNAYSADGDYKKADMLVQDMKSA 305

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKSRELL ELE LGYA DEMPYCLLMDGLAK+ RILEA
Sbjct: 306  GLVPNKVILTTLLKVYVRGGLFEKSRELLAELEDLGYAEDEMPYCLLMDGLAKSRRILEA 365

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K+IF+EM+ K VKSDGY +SIMISAFCRSGLL+EAKQLARD+E  YDKYDLVMLNTML A
Sbjct: 366  KSIFEEMKKKQVKSDGYCYSIMISAFCRSGLLKEAKQLARDFEATYDKYDLVMLNTMLCA 425

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVMQM+ KMDE  ISPD NTFHILIKYFCKEKLY LAYRTM DMH+KGHQ +
Sbjct: 426  YCRAGEMESVMQMMRKMDELAISPDWNTFHILIKYFCKEKLYLLAYRTMEDMHNKGHQPE 485

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  A S+AFSVYNMLR+SKRTMCKALHEK+L ILVAG LLKDAYVVVK
Sbjct: 486  EELCSSLISHLGKIRAHSQAFSVYNMLRYSKRTMCKALHEKILHILVAGRLLKDAYVVVK 545

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DN   IS+  +KKFA +FMK GN+NLINDVM A+H SG KIDQE+F +A++RY+ +P   
Sbjct: 546  DNEGLISKPSIKKFATAFMKFGNVNLINDVMKAIHGSGYKIDQELFQMAVTRYIAEPEKK 605

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLL 1595
                    WM GQGYVV+ S+RN++LKNSHLFGR LIAE LSKQ   +K L
Sbjct: 606  ELLLHLLQWMPGQGYVVDSSTRNMILKNSHLFGRQLIAEMLSKQHARAKAL 656


>ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [Amborella trichopoda]
            gi|548831187|gb|ERM94004.1| hypothetical protein
            AMTR_s00136p00085920 [Amborella trichopoda]
          Length = 690

 Score =  716 bits (1849), Expect = 0.0
 Identities = 363/534 (67%), Positives = 434/534 (81%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSSFIK+MG+  N  KAL+VY SI+D+     V++CNSI+GCL RNGK ESSIKLF+QMK
Sbjct: 165  YSSFIKYMGRSGNTVKALQVYQSIKDEPTLYDVTVCNSILGCLARNGKFESSIKLFEQMK 224

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            KGGL PD VTYS++             AGC K+K+GY +AL+ ++ELK +GL MDSVIYG
Sbjct: 225  KGGLTPDTVTYSSLL------------AGCNKNKNGYSQALQLIKELKISGLCMDSVIYG 272

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LLAICASNN+CEEAETFF QM+ EG SPN+FHYSSLLNAY+ +GN+ KAD LV+D+KSA
Sbjct: 273  SLLAICASNNQCEEAETFFQQMRAEGFSPNIFHYSSLLNAYAVEGNHKKADKLVEDIKSA 332

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG  F+KSRELL EL+TLG+A DEMPYCLLMDGLAKAG I EA
Sbjct: 333  GLVPNKVILTTLLKVYVRGCFFDKSRELLAELDTLGFARDEMPYCLLMDGLAKAGHIDEA 392

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            KA+F++M+ K VKSDGYSHSI+ISA+CR GLLEEAK LA+D+E+   KYDLVMLNT+LRA
Sbjct: 393  KAVFEDMKQKNVKSDGYSHSIIISAYCREGLLEEAKLLAKDFESTSGKYDLVMLNTLLRA 452

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YC+ GEM+ VMQ + KMDE  ISPD +TF ILIKYF KEKLY LAYRT+ DMH++G Q+D
Sbjct: 453  YCKGGEMQYVMQTMKKMDELAISPDLHTFSILIKYFSKEKLYNLAYRTVEDMHARGLQID 512

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL + LI++LGK GA SEA+SVYN LR++KRT+CKALHEKVLKILVAG LLKDAYV+VK
Sbjct: 513  EELCTSLILELGKAGAASEAYSVYNKLRYTKRTLCKALHEKVLKILVAGRLLKDAYVLVK 572

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DN+E IS++ L KF  SFMK GNINLINDV+ A+H++G  I+Q VF +A+SRYVG+P   
Sbjct: 573  DNSELISKSALDKFVTSFMKFGNINLINDVLRALHNNGYLINQGVFSLAVSRYVGEPEKK 632

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRTQ 1604
                    WM+GQGYVV+  SRNLLLKN  LFG+ LIAE LSKQ  MSK+ RTQ
Sbjct: 633  ELLLHMLEWMSGQGYVVDSESRNLLLKNCDLFGKQLIAEGLSKQHAMSKIRRTQ 686



 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 65/349 (18%), Positives = 137/349 (39%), Gaps = 4/349 (1%)
 Frame = +3

Query: 387  NNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVI 566
            +N+  E    F  M+  G   N+  YSS +      GN  KA  + + +K    + +  +
Sbjct: 141  SNKWREISQLFNWMQKLG-KVNISSYSSFIKYMGRSGNTVKALQVYQSIKDEPTLYDVTV 199

Query: 567  LTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAK-AGRILEAKAIFDEM 743
              ++L    R   FE S +L  +++  G   D + Y  L+ G  K      +A  +  E+
Sbjct: 200  CNSILGCLARNGKFESSIKLFEQMKKGGLTPDTVTYSSLLAGCNKNKNGYSQALQLIKEL 259

Query: 744  EMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEM 923
            ++ G+  D   +  +++    +   EEA+   +         ++   +++L AY   G  
Sbjct: 260  KISGLCMDSVIYGSLLAICASNNQCEEAETFFQQMRAEGFSPNIFHYSSLLNAYAVEGNH 319

Query: 924  ESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFL 1103
            +   +++  +    + P+      L+K + +   +  +   + ++ + G   DE     L
Sbjct: 320  KKADKLVEDIKSAGLVPNKVILTTLLKVYVRGCFFDKSRELLAELDTLGFARDEMPYCLL 379

Query: 1104 IVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVKD---NAE 1274
            +  L K G   EA +V+  ++          H  ++      GLL++A ++ KD    + 
Sbjct: 380  MDGLAKAGHIDEAKAVFEDMKQKNVKSDGYSHSIIISAYCREGLLEEAKLLAKDFESTSG 439

Query: 1275 RISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRY 1421
            +     L     ++ K G +  +   M  +       D   F I I  +
Sbjct: 440  KYDLVMLNTLLRAYCKGGEMQYVMQTMKKMDELAISPDLHTFSILIKYF 488


>ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223531275|gb|EEF33118.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 672

 Score =  715 bits (1845), Expect = 0.0
 Identities = 359/535 (67%), Positives = 432/535 (80%), Gaps = 2/535 (0%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            Y+S++KFMGK  NP KALE+Y+SI D+S++N+V ICNS++ CLVR+GK + S+KLF +MK
Sbjct: 149  YTSYMKFMGKSLNPAKALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLFHKMK 208

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD +TYST+             +GCIK KDGY K L FVQELK NGL+MD+VIYG
Sbjct: 209  QNGLTPDTITYSTLL------------SGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYG 256

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            T+LA+CAS+NRCEEAE++F QMK+EG  PNVFHYSSLLNAY+  GNY KA+ LV+DMKS 
Sbjct: 257  TILAVCASHNRCEEAESYFSQMKNEGHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSL 316

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVI TTLLKVYVRG LFEKS++LL+ELETLGYA DEMPYCLLMDGL+KAGR+ EA
Sbjct: 317  GLVPNKVIWTTLLKVYVRGGLFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEA 376

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            ++ FDEM+ K VKSDGY++SIMISA+CR  LLEEAKQLA+++E +YDKYD+V+LNTML A
Sbjct: 377  RSFFDEMKEKNVKSDGYAYSIMISAYCRGRLLEEAKQLAKEFEAKYDKYDVVILNTMLCA 436

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAG+MESVMQ + KMDE  ISP   TFHILIKYFCK+KLY LAY+TM DMH KGHQ +
Sbjct: 437  YCRAGDMESVMQTMRKMDELAISPSYCTFHILIKYFCKQKLYLLAYQTMEDMHRKGHQPE 496

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  A +EAFSVY ML++ KRTMCKALHEK+L +L+ G LLKDAYVVVK
Sbjct: 497  EELCSMLIFHLGKAKAYTEAFSVYTMLKYGKRTMCKALHEKILHVLLGGQLLKDAYVVVK 556

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQ--EVFDIAISRYVGKPX 1436
            DNAE IS+  +KKFA +FMK GNINLINDVM  +HSSG KIDQ  E+F +AISRY+ +P 
Sbjct: 557  DNAELISQAAIKKFANAFMKLGNINLINDVMKVIHSSGYKIDQASELFQMAISRYIAQPE 616

Query: 1437 XXXXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRT 1601
                      WM G GYVV+ S+RNL+LK+SHLFGR LIAE LSKQ  +SK L++
Sbjct: 617  KKDLLVQLLQWMPGHGYVVDASTRNLILKSSHLFGRQLIAEILSKQHIISKTLKS 671



 Score = 62.4 bits (150), Expect = 8e-07
 Identities = 53/273 (19%), Positives = 116/273 (42%), Gaps = 1/273 (0%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            +V  Y+S +       N AKA  +   +    +  N  I  ++L   VR   F+ S +L 
Sbjct: 145  SVSSYTSYMKFMGKSLNPAKALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLF 204

Query: 630  VELETLGYAVDEMPYCLLMDGLAKAGRILEAKAIF-DEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D + Y  L+ G  KA         F  E++  G++ D   +  +++    
Sbjct: 205  HKMKQNGLTPDTITYSTLLSGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCAS 264

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
                EEA+      +      ++   +++L AY  +G  +   +++  M    + P+   
Sbjct: 265  HNRCEEAESYFSQMKNEGHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSLGLVPNKVI 324

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYNMLR 1166
            +  L+K + +  L++ + + ++++ + G+  DE     L+  L K G   EA S ++ ++
Sbjct: 325  WTTLLKVYVRGGLFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMK 384

Query: 1167 FSKRTMCKALHEKVLKILVAGGLLKDAYVVVKD 1265
                      +  ++     G LL++A  + K+
Sbjct: 385  EKNVKSDGYAYSIMISAYCRGRLLEEAKQLAKE 417


>ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 642

 Score =  712 bits (1838), Expect = 0.0
 Identities = 364/534 (68%), Positives = 424/534 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKFMGK  NP KALE+Y+SIQD+S + +V ICNS++G LVR+GK + SIKLF QMK
Sbjct: 121  YSSYIKFMGKSLNPVKALEIYNSIQDESTKKNVHICNSVLGSLVRSGKFDGSIKLFHQMK 180

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PDAVTYST+             AGCIK K GY KAL+ VQEL++N L+MDSVIYG
Sbjct: 181  QDGLTPDAVTYSTLL------------AGCIKFKHGYSKALELVQELQNNELQMDSVIYG 228

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLAICASNN+ EEAE++F QMKDEG  PN FHYSSLLNAYS  GNY KAD +V+DMKSA
Sbjct: 229  TLLAICASNNKWEEAESYFKQMKDEGHLPNEFHYSSLLNAYSISGNYKKADDVVQDMKSA 288

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKV LTTLLK YVRG LFEKSRELL ELE LGYA DEMPYC+LMD  AKAGRI +A
Sbjct: 289  GLVPNKVTLTTLLKAYVRGGLFEKSRELLTELEALGYAEDEMPYCILMDAFAKAGRIEDA 348

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K +FDE++ K V+SDGYS+SIMISAFCR GL+++AKQLA+D+E  YDKYDLVMLNTM+ A
Sbjct: 349  KLVFDEIKEKSVRSDGYSYSIMISAFCRGGLVDDAKQLAKDFERTYDKYDLVMLNTMICA 408

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEM+SVM+ML KMDE KI+PD NTFHILIKYFCKEKLY LAY+TM DMH+KG+  D
Sbjct: 409  YCRAGEMDSVMEMLRKMDELKITPDNNTFHILIKYFCKEKLYMLAYKTMEDMHNKGYPPD 468

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S L+  LGK  A SEA+S+YN+LR+SKRTMCKALHEK+L ILVAG LLKDAYVVVK
Sbjct: 469  EELCSSLMFHLGKIRAYSEAYSIYNILRYSKRTMCKALHEKILHILVAGRLLKDAYVVVK 528

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DN   IS+    KFA +FMK GNINLINDV+ A+  SG KIDQ +F +AISRY+  P   
Sbjct: 529  DNPRLISKAATMKFATAFMKLGNINLINDVLKAIDGSGCKIDQGIFQMAISRYISDPDKK 588

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRTQ 1604
                    WM GQGY V+ S+RNL+LKNSHLF R  IAE LSKQ  +SK  +++
Sbjct: 589  DLLLQLLQWMPGQGYTVDSSTRNLILKNSHLFDRQHIAEMLSKQHMISKASKSK 642


>ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa]
            gi|550347847|gb|EEE84472.2| hypothetical protein
            POPTR_0001s21880g [Populus trichocarpa]
          Length = 673

 Score =  709 bits (1831), Expect = 0.0
 Identities = 360/534 (67%), Positives = 425/534 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKFMG   NP KALE+YHSI D+S + +V ICNS++ CLVRN K +SS+K F +MK
Sbjct: 152  YSSYIKFMGTSLNPAKALEIYHSIPDESKKTNVFICNSLLRCLVRNTKFDSSMKFFHKMK 211

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
              GL PDA+TYST+             AGC+K KDGY KAL  VQEL  NGL+MDS++YG
Sbjct: 212  NNGLTPDAITYSTLL------------AGCMKIKDGYSKALDLVQELNYNGLQMDSIMYG 259

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLA+CASNNRCEEA+++F QMKDEG SPN+FHYSSLLNAYS DGNY KA+ LV+DMKS+
Sbjct: 260  TLLAVCASNNRCEEAQSYFNQMKDEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSS 319

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKSR+LLVEL+TLG+A +EMPYCLLMDGLAK G + EA
Sbjct: 320  GLVPNKVILTTLLKVYVRGGLFEKSRDLLVELDTLGFAKNEMPYCLLMDGLAKNGLLDEA 379

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            +++F+EM+ K VKS GYS+SIMIS+FCR GL EEAK+LA ++E +YDKYD+V+LNT+L A
Sbjct: 380  RSVFNEMKEKRVKSGGYSYSIMISSFCRGGLFEEAKELAEEFEAKYDKYDVVILNTILCA 439

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCR GE ESVM+ + KMDE  ISPD NTFHILIKYFCKEKLY LAY+TM DMH KGHQ  
Sbjct: 440  YCRTGEKESVMRTMRKMDELAISPDYNTFHILIKYFCKEKLYMLAYQTMEDMHRKGHQPM 499

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI+ LGK  A +EAFSVY+ML+ SKRTM KA HE +L IL+AG LLKDAYVVVK
Sbjct: 500  EELCSSLILHLGKIKAHAEAFSVYSMLKSSKRTMSKAFHEDILHILIAGRLLKDAYVVVK 559

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNAE IS   +KKFA SF+K G+INLINDVM  +H SG KIDQE+F +A+SRY+ +P   
Sbjct: 560  DNAELISPAAIKKFASSFVKLGDINLINDVMKVIHGSGYKIDQELFLMAVSRYIAEPEKK 619

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRTQ 1604
                    WM GQGYVV+ S+RNL+LKNSHLFGR LIAE LSKQ   SK L+ Q
Sbjct: 620  DLLIQLLQWMPGQGYVVDSSTRNLILKNSHLFGRQLIAEILSKQHMTSKALKAQ 673


>ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina]
            gi|557534005|gb|ESR45123.1| hypothetical protein
            CICLE_v10000525mg [Citrus clementina]
          Length = 660

 Score =  709 bits (1830), Expect = 0.0
 Identities = 357/533 (66%), Positives = 428/533 (80%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKF+GK  N  KALE+Y+SI D+S + +V ICNSI+ CLVRNGK ESS+KLFD+MK
Sbjct: 137  YSSYIKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFESSLKLFDKMK 196

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PDAVTY+T+              GCIKDK+GY KAL+ VQELK NG +MD+V+YG
Sbjct: 197  QSGLTPDAVTYNTLL------------TGCIKDKNGYSKALELVQELKYNGAQMDNVMYG 244

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
             LLAICASNN C +A+++F QMK EG SPNV+HYSSLLNAYS  G+Y KAD L++DMKS+
Sbjct: 245  ILLAICASNNLCAKAQSYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSS 304

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKSRELL EL+TLGYA +EMPYCLLMDGL+KAG + EA
Sbjct: 305  GLVPNKVILTTLLKVYVRGGLFEKSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEA 364

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            + +F+EM+ K VKSDGY+HSIMISAFCR G  EEAKQLA D+E +YDKYD+V+LN+ML A
Sbjct: 365  RVVFNEMQEKCVKSDGYAHSIMISAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCA 424

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCR G+MESVM ++ K+DE  ISPD NTFHILIKYFCKEK+Y LAYRTMVDMH KGHQ +
Sbjct: 425  YCRTGDMESVMHVMRKLDELAISPDYNTFHILIKYFCKEKMYILAYRTMVDMHRKGHQPE 484

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  A SEA SVYNMLR+SKR+MCKALHEK+L IL++G LLKDAYVVVK
Sbjct: 485  EELCSSLIFHLGKMRAHSEALSVYNMLRYSKRSMCKALHEKILHILISGKLLKDAYVVVK 544

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DN+E IS   +KKFA +F++ GNINL+NDVM A+H++G +IDQ +F IAI+RY+ +    
Sbjct: 545  DNSESISHPVIKKFASAFVRLGNINLVNDVMKAIHTTGYRIDQGIFHIAIARYIAEREKK 604

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRT 1601
                    WMTGQGYVV+ S+RNL+LKNSHL GR LIA+ LSKQ   SK  +T
Sbjct: 605  ELLLKLLEWMTGQGYVVDSSTRNLILKNSHLLGRQLIADILSKQHMKSKSSKT 657



 Score = 69.7 bits (169), Expect = 5e-09
 Identities = 61/328 (18%), Positives = 130/328 (39%), Gaps = 4/328 (1%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            ++  YSS +      GN  KA  +   +       N  I  ++L   VR   FE S +L 
Sbjct: 133  SISSYSSYIKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFESSLKLF 192

Query: 630  VELETLGYAVDEMPYCLLMDGLAK-AGRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D + Y  L+ G  K      +A  +  E++  G + D   + I+++    
Sbjct: 193  DKMKQSGLTPDAVTYNTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYGILLAICAS 252

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
            + L  +A+      +      ++   +++L AY   G+     +++  M    + P+   
Sbjct: 253  NNLCAKAQSYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSSGLVPNKVI 312

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYNMLR 1166
               L+K + +  L++ +   + ++ + G+  +E     L+  L K G   EA  V+N ++
Sbjct: 313  LTTLLKVYVRGGLFEKSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEARVVFNEMQ 372

Query: 1167 FSKRTMCKALHEKVLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNIN 1337
                      H  ++     GG  ++A  +  D   +  +     L     ++ ++G++ 
Sbjct: 373  EKCVKSDGYAHSIMISAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCAYCRTGDME 432

Query: 1338 LINDVMSAVHSSGQKIDQEVFDIAISRY 1421
             +  VM  +       D   F I I  +
Sbjct: 433  SVMHVMRKLDELAISPDYNTFHILIKYF 460



 Score = 58.9 bits (141), Expect = 9e-06
 Identities = 48/220 (21%), Positives = 98/220 (44%), Gaps = 1/220 (0%)
 Frame = +3

Query: 609  EKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEAKAIFDEMEMKGVKSDGYSHSIM 788
            ++S +L   LE LG  +       ++      GR  +   +F+ M+  G K+   S+S  
Sbjct: 82   QQSSDLTSSLERLGGILKVPDLNAILRHFGDLGRGRDVLQLFEWMQQHG-KTSISSYSSY 140

Query: 789  ISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKI 968
            I    +SG   +A ++        DK ++ + N++L    R G+ ES +++  KM +  +
Sbjct: 141  IKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFESSLKLFDKMKQSGL 200

Query: 969  SPDCNTFHILIKYFCKEKL-YQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAF 1145
            +PD  T++ L+    K+K  Y  A   + ++   G Q+D  +   L+         ++A 
Sbjct: 201  TPDAVTYNTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYGILLAICASNNLCAKAQ 260

Query: 1146 SVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVKD 1265
            S +N ++    +     +  +L    +GG    A  +++D
Sbjct: 261  SYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQD 300


>ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cucumis sativus]
          Length = 668

 Score =  709 bits (1830), Expect = 0.0
 Identities = 359/535 (67%), Positives = 424/535 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKFMG+G NP KALEVY++I++ SI+N + ICNSI+ CLVRNGK ++S+KLF QMK
Sbjct: 140  YSSYIKFMGRGLNPLKALEVYNNIEEVSIKNSIFICNSILNCLVRNGKFDTSVKLFHQMK 199

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
              GL PD VTYST+              GCI+ K GY KA++ ++EL+DNGL MD V YG
Sbjct: 200  NDGLCPDTVTYSTML------------TGCIRVKHGYAKAMELLKELQDNGLCMDCVSYG 247

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TL+AICAS+NR E+AE FF QM+ EG SPN+FHY SLLNAYS +G+Y KAD L++DMK  
Sbjct: 248  TLIAICASHNRLEDAERFFNQMRAEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLT 307

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKSR+LL ELE+LGY  +EMPYCLLMDGLAKAG I EA
Sbjct: 308  GLVPNKVILTTLLKVYVRGGLFEKSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREA 367

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K +FDEM+ K VK+DGY+HSIMISAFCR GLLEEAK LA+D+E  YD+YD+V+LNTML A
Sbjct: 368  KTVFDEMKAKNVKTDGYAHSIMISAFCRGGLLEEAKLLAKDFEATYDRYDIVILNTMLCA 427

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVMQML KMD+  ISPD NTFHILIKYF KEKLY L YRT+ DMH KGHQ +
Sbjct: 428  YCRAGEMESVMQMLRKMDDLAISPDYNTFHILIKYFFKEKLYLLCYRTLEDMHRKGHQPE 487

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI+ LG   A SEAFSVYN+L++SKRTMCKALHEK+L IL+AG LLKDAYVVVK
Sbjct: 488  EELCSSLILSLGNIRAYSEAFSVYNILKYSKRTMCKALHEKILHILIAGRLLKDAYVVVK 547

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA  IS+  ++KFA  FMK GN+NLINDVM A+H SG KIDQ++F IA SRY+  P   
Sbjct: 548  DNAGVISKPAIRKFAFGFMKFGNVNLINDVMKAIHGSGYKIDQDLFMIATSRYIELPEKK 607

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRTQE 1607
                    WM GQGYVV+ S+RNL+LKN+HLFGR LIAE LSK   +SK  +++E
Sbjct: 608  DLFIQLLKWMPGQGYVVDSSTRNLILKNAHLFGRQLIAEILSKHSLLSKSTKSRE 662


>ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica]
            gi|462422086|gb|EMJ26349.1| hypothetical protein
            PRUPE_ppa002505mg [Prunus persica]
          Length = 664

 Score =  707 bits (1825), Expect = 0.0
 Identities = 360/535 (67%), Positives = 429/535 (80%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKFMGK  NP KALE+Y++IQD S + +V ICNS++G L+R+GK + S KLF QMK
Sbjct: 137  YSSYIKFMGKSLNPVKALEIYNNIQDASTKKNVHICNSVLGSLIRSGKFDGSFKLFHQMK 196

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PDAVTYST+             AGC K K GY KAL+ VQEL+ N L+MDSVIYG
Sbjct: 197  QDGLTPDAVTYSTLL------------AGCNKVKHGYSKALELVQELQRNELQMDSVIYG 244

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLA+CASNN+ EEAE +F QMK+EG  PNVFHYS++LNAYS  GNY +AD LV+DMKSA
Sbjct: 245  TLLAVCASNNKLEEAEGYFKQMKNEGYLPNVFHYSAMLNAYSISGNYKEADDLVQDMKSA 304

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKSRELL ELE LGYA DEMPYCLLMD LAKAGRI EA
Sbjct: 305  GLVPNKVILTTLLKVYVRGGLFEKSRELLAELEALGYAEDEMPYCLLMDALAKAGRIHEA 364

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K +FDEM+ K ++S+GYS+SIMISAFCR GLLE+AKQL++D E  +DK+DLVMLNTM+ A
Sbjct: 365  KLVFDEMKEKSIRSNGYSYSIMISAFCRGGLLEDAKQLSKDVERTHDKFDLVMLNTMICA 424

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEM+SVM+M+ KMDE KI+PD NTFHILIKYFCKEKLY LAY+TM DMH+KGHQ D
Sbjct: 425  YCRAGEMDSVMEMMRKMDEQKITPDYNTFHILIKYFCKEKLYLLAYQTMEDMHNKGHQPD 484

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S L+  LGK  A SEA+SVYN+LR+SKRTMCKALHEK+L IL+AG LLKDAYVVVK
Sbjct: 485  EELCSSLMFLLGKIRAYSEAYSVYNILRYSKRTMCKALHEKILHILLAGQLLKDAYVVVK 544

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA  IS+  +KKF+ +F+K GNINLINDV+  + +SG KIDQ +F +AISRY+  P   
Sbjct: 545  DNAGLISKPAVKKFSTAFLKLGNINLINDVLKVIDASGCKIDQGLFQMAISRYIALPEKK 604

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRTQE 1607
                    WM GQGYVV+ ++RNL+LKNSHLFGR  IA+ LSKQ  +SK  ++++
Sbjct: 605  ELLIQMLLWMPGQGYVVDSATRNLILKNSHLFGRQHIADVLSKQHMISKASKSRK 659


>ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508706162|gb|EOX98058.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 717

 Score =  701 bits (1810), Expect = 0.0
 Identities = 359/532 (67%), Positives = 424/532 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IK MGK  +P KALE+Y+SI D+S R +V ICNS++  LVRNGK ES IKLFD+MK
Sbjct: 132  YSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFESGIKLFDKMK 191

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD+VTY+T+             AGCIK K G+ KAL+ ++ELK NGL+MDSV+YG
Sbjct: 192  QDGLTPDSVTYNTLL------------AGCIKIKHGHSKALELIKELKYNGLKMDSVMYG 239

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLA+CAS+   EEA+ +F QM++EG SPN++HYSSLLNAYS DGNY KAD LV+ MKS+
Sbjct: 240  TLLAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSS 299

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKS +LL ELE LGYA DEMP+CLLMDGL+KAGR+ EA
Sbjct: 300  GLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEA 359

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            +++F EM+ K VKSDGYSHSIMISA CR+GL EEAK+LA+D+E +Y+KYDLVMLNTML A
Sbjct: 360  RSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCA 419

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVMQ + KMDE  ISPD NTFHILIKYFCKEKLY LAY+TM DMH KG+  +
Sbjct: 420  YCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPE 479

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI QLGK  A  EAFSVYNMLR+SKRTMCKALHEK+L IL+AG LLKDAYVVVK
Sbjct: 480  EELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVK 539

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNAE IS+  + KFA +FMK GNIN+INDV+  +H SG KIDQ +F +AISRY+G+P   
Sbjct: 540  DNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQGLFQMAISRYLGQPEKK 599

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLR 1598
                    WM G GYVV+ S+RN++LKNS L GR L AE LSKQ  MSK+ R
Sbjct: 600  ELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSKVSR 651


>ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao] gi|508706163|gb|EOX98059.1|
            Pentatricopeptide repeat (PPR) superfamily protein
            isoform 2 [Theobroma cacao]
          Length = 649

 Score =  694 bits (1791), Expect = 0.0
 Identities = 358/533 (67%), Positives = 423/533 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IK MGK  +P KALE+Y+SI D+S R +V ICNS++  LVRNGK ES IKLFD+MK
Sbjct: 132  YSSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFESGIKLFDKMK 191

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD+VTY+T+             AGCIK K G+ KAL+ ++ELK NGL+MDSV+YG
Sbjct: 192  QDGLTPDSVTYNTLL------------AGCIKIKHGHSKALELIKELKYNGLKMDSVMYG 239

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TLLA+CAS+   EEA+ +F QM++EG SPN++HYSSLLNAYS DGNY KAD LV+ MKS+
Sbjct: 240  TLLAVCASSGLHEEAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSS 299

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYVRG LFEKS +LL ELE LGYA DEMP+CLLMDGL+KAGR+ EA
Sbjct: 300  GLVPNKVILTTLLKVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEA 359

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            +++F EM+ K VKSDGYSHSIMISA CR+GL EEAK+LA+D+E +Y+KYDLVMLNTML A
Sbjct: 360  RSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCA 419

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVMQ + KMDE  ISPD NTFHILIKYFCKEKLY LAY+TM DMH KG+  +
Sbjct: 420  YCRAGEMESVMQTMKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPE 479

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI QLGK  A  EAFSVYNMLR+SKRTMCKALHEK+L IL+AG LLKDAYVVVK
Sbjct: 480  EELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVK 539

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNAE IS+  + KFA +FMK GNIN+INDV+  +H SG KIDQ    +AISRY+G+P   
Sbjct: 540  DNAELISQPAITKFATAFMKLGNINMINDVLKVLHGSGYKIDQ----MAISRYLGQPEKK 595

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLRT 1601
                    WM G GYVV+ S+RN++LKNS L GR L AE LSKQ  MSK+ R+
Sbjct: 596  ELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSKVSRS 648


>ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 651

 Score =  687 bits (1772), Expect = 0.0
 Identities = 343/532 (64%), Positives = 424/532 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS++KFMGK  +   A+E+Y  I+D+SI+ +VS+CN+ +  L++NGK ESS+KLF QMK
Sbjct: 125  YSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMK 184

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL+PD  TYST+             AGC K   GY KAL+ VQEL  NGL+MDSV YG
Sbjct: 185  RDGLVPDVFTYSTLL------------AGCAKVNGGYYKALELVQELMSNGLQMDSVTYG 232

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LL++CAS+  C EA  +F +MKDEG SPNV+HYSSLLNAYS D NY KA+ L+++M+SA
Sbjct: 233  SLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSA 292

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLV NKVI TTLLKVYV+G LFEKS+ELL ELE LGYA DEMP+CLLMDGLAK+G +LEA
Sbjct: 293  GLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEA 352

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K++FDEM  K VK+DGYS+SIMISAFCRSGLLE+AK++A ++E +YDKYD+V+LN ML A
Sbjct: 353  KSVFDEMMEKHVKTDGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSA 412

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAG+ME+VM M+ KMD+  ISPD NTF+ILI+YFCKEKLY LAYRTM DMHSKGHQ +
Sbjct: 413  YCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPE 472

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            E L S LI  LGKTGA SEAFSVYNMLR+SKRT+  ALHE +L IL+AG LLKDAYVVVK
Sbjct: 473  EGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVK 532

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA  IS+  +KKF+++FM+SGN+NLINDVM+A+HSSG KIDQE+FD+AI+RY+ KP   
Sbjct: 533  DNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKK 592

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLR 1598
                    WM G+GY ++ S+RNL+LKNSHLFG  LIAE+LSK   MSK ++
Sbjct: 593  ELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVK 644



 Score = 68.9 bits (167), Expect = 9e-09
 Identities = 76/384 (19%), Positives = 151/384 (39%), Gaps = 4/384 (1%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            NV  YSS +       +   A  + +D+K   +  N  +    L   ++    E S +L 
Sbjct: 121  NVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLF 180

Query: 630  VELETLGYAVDEMPYCLLMDGLAKA-GRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D   Y  L+ G AK  G   +A  +  E+   G++ D  ++  ++S    
Sbjct: 181  TQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCAS 240

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
                 EA +  +  +      ++   +++L AY      E    ++ +M    +  +   
Sbjct: 241  HKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVI 300

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYNMLR 1166
            +  L+K + K  L++ +   + ++ + G+  DE     L+  L K+G   EA SV++ + 
Sbjct: 301  YTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMM 360

Query: 1167 FSKRTMCKALHEKVLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNIN 1337
                      +  ++      GLL+DA  V  +  E+  +     L     ++ ++G + 
Sbjct: 361  EKHVKTDGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKME 420

Query: 1338 LINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXXXXXXXXXXWMTGQGYVVEPSSRNLL 1517
             +  +M  +  S    D   F+I I RY  K             M  +G+  E    + L
Sbjct: 421  NVMSMMKKMDDSAISPDWNTFNILI-RYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSL 479

Query: 1518 LKNSHLFGRHLIAETLSKQQRMSK 1589
            + +    G H  A ++    R SK
Sbjct: 480  IYHLGKTGAHSEAFSVYNMLRYSK 503


>ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Solanum lycopersicum]
          Length = 642

 Score =  682 bits (1761), Expect = 0.0
 Identities = 342/529 (64%), Positives = 420/529 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS++KFMGK  +   A+E+Y  I+D+SI+ +VS+CN+ +  L++NGK ESS+KLF QMK
Sbjct: 125  YSSYVKFMGKSLSCVDAVEMYRDIKDRSIKFNVSVCNAFLSSLIKNGKSESSLKLFTQMK 184

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL+PD  TYST+             AGC K   GY KAL+ VQE+  NGL MDSV YG
Sbjct: 185  RDGLVPDVFTYSTLL------------AGCAKVNGGYYKALELVQEMMSNGLEMDSVTYG 232

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LL++CAS+  C EA  +F +MKDEG SPNV+HYSSLLNAYS D NY KA+AL+++M+SA
Sbjct: 233  SLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEALIEEMRSA 292

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLV NKVI TTLLKVYV+G LFEKS+ELL ELE LGYA DEMP+CLLMDGLAK+G +LEA
Sbjct: 293  GLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEA 352

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K++FDEM  K VK+DGYS+SIMISAFCR GLLE+AK+LA ++E +YDKYD+V+LN ML A
Sbjct: 353  KSVFDEMMEKQVKTDGYSYSIMISAFCRRGLLEDAKKLASEFEEKYDKYDIVILNAMLSA 412

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAG+ME+VM M+ KMD+  ISPD NTF+ILI+YFCKEKLY LAYRTM DMHSKGHQ +
Sbjct: 413  YCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPE 472

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            E L S LI  LGKTGA SEAFSVYNMLR+SKRT+  ALHE +L IL+AG LLKDAYVVVK
Sbjct: 473  EGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHENILHILIAGRLLKDAYVVVK 532

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA  IS+  +KKF+++FM+SGN+NLINDVM+A+HSSG KIDQE+FD+AI+RY+ KP   
Sbjct: 533  DNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKK 592

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
                    WM  +GY ++ S+RNL+LKNSHLFG  LIAE+LSK   MSK
Sbjct: 593  ELLLWLLKWMPVKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 641



 Score = 71.6 bits (174), Expect = 1e-09
 Identities = 88/431 (20%), Positives = 169/431 (39%), Gaps = 5/431 (1%)
 Frame = +3

Query: 312  VQELKDNGLRMDSVIYGTLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSC 491
            V E      +++   Y + +     +  C +A   +  +KD  +  NV   ++ L++   
Sbjct: 110  VFEWMQQNQKINVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKFNVSVCNAFLSSLIK 169

Query: 492  DGNYAKADALVKDMKSAGLVPNKVILTTLLK--VYVRGDLFEKSRELLVELETLGYAVDE 665
            +G    +  L   MK  GLVP+    +TLL     V G  + K+ EL+ E+ + G  +D 
Sbjct: 170  NGKSESSLKLFTQMKRDGLVPDVFTYSTLLAGCAKVNGGYY-KALELVQEMMSNGLEMDS 228

Query: 666  MPYCLLMDGLAKAGRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARD 845
            + Y  L+   A      EA   F +M+ +G   + Y +S +++A+      E+A+ L  +
Sbjct: 229  VTYGSLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEALIEE 288

Query: 846  YETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKL 1025
              +     + V+  T+L+ Y + G  E   ++L +++                       
Sbjct: 289  MRSAGLVLNKVIYTTLLKVYVKGGLFEKSKELLKELE----------------------- 325

Query: 1026 YQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEK 1205
                        + G+  DE     L+  L K+G   EA SV++ +   +       +  
Sbjct: 326  ------------ALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKQVKTDGYSYSI 373

Query: 1206 VLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNINLINDVMSAVHSSG 1376
            ++      GLL+DA  +  +  E+  +     L     ++ ++G +  +  +M  +  S 
Sbjct: 374  MISAFCRRGLLEDAKKLASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSA 433

Query: 1377 QKIDQEVFDIAISRYVGKPXXXXXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIA 1556
               D   F+I I RY  K             M  +G+  E    + L+ +    G H  A
Sbjct: 434  ISPDWNTFNILI-RYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEA 492

Query: 1557 ETLSKQQRMSK 1589
             ++    R SK
Sbjct: 493  FSVYNMLRYSK 503


>ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X4 [Solanum tuberosum]
          Length = 539

 Score =  682 bits (1760), Expect = 0.0
 Identities = 343/533 (64%), Positives = 424/533 (79%), Gaps = 1/533 (0%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS++KFMGK  +   A+E+Y  I+D+SI+ +VS+CN+ +  L++NGK ESS+KLF QMK
Sbjct: 12   YSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMK 71

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL+PD  TYST+             AGC K   GY KAL+ VQEL  NGL+MDSV YG
Sbjct: 72   RDGLVPDVFTYSTLL------------AGCAKVNGGYYKALELVQELMSNGLQMDSVTYG 119

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LL++CAS+  C EA  +F +MKDEG SPNV+HYSSLLNAYS D NY KA+ L+++M+SA
Sbjct: 120  SLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSA 179

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLV NKVI TTLLKVYV+G LFEKS+ELL ELE LGYA DEMP+CLLMDGLAK+G +LEA
Sbjct: 180  GLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEA 239

Query: 723  KAIFDEMEMKGVKS-DGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLR 899
            K++FDEM  K VK+ DGYS+SIMISAFCRSGLLE+AK++A ++E +YDKYD+V+LN ML 
Sbjct: 240  KSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLS 299

Query: 900  AYCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQL 1079
            AYCRAG+ME+VM M+ KMD+  ISPD NTF+ILI+YFCKEKLY LAYRTM DMHSKGHQ 
Sbjct: 300  AYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQP 359

Query: 1080 DEELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVV 1259
            +E L S LI  LGKTGA SEAFSVYNMLR+SKRT+  ALHE +L IL+AG LLKDAYVVV
Sbjct: 360  EEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVV 419

Query: 1260 KDNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXX 1439
            KDNA  IS+  +KKF+++FM+SGN+NLINDVM+A+HSSG KIDQE+FD+AI+RY+ KP  
Sbjct: 420  KDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEK 479

Query: 1440 XXXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLR 1598
                     WM G+GY ++ S+RNL+LKNSHLFG  LIAE+LSK   MSK ++
Sbjct: 480  KELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVK 532



 Score = 70.9 bits (172), Expect = 2e-09
 Identities = 78/385 (20%), Positives = 154/385 (40%), Gaps = 5/385 (1%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            NV  YSS +       +   A  + +D+K   +  N  +    L   ++    E S +L 
Sbjct: 8    NVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLF 67

Query: 630  VELETLGYAVDEMPYCLLMDGLAKA-GRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D   Y  L+ G AK  G   +A  +  E+   G++ D  ++  ++S    
Sbjct: 68   TQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCAS 127

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
                 EA +  +  +      ++   +++L AY      E    ++ +M    +  +   
Sbjct: 128  HKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVI 187

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYN-ML 1163
            +  L+K + K  L++ +   + ++ + G+  DE     L+  L K+G   EA SV++ M+
Sbjct: 188  YTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMM 247

Query: 1164 RFSKRTMCKALHEKVLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNI 1334
                +T     +  ++      GLL+DA  V  +  E+  +     L     ++ ++G +
Sbjct: 248  EKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKM 307

Query: 1335 NLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXXXXXXXXXXWMTGQGYVVEPSSRNL 1514
              +  +M  +  S    D   F+I I RY  K             M  +G+  E    + 
Sbjct: 308  ENVMSMMKKMDDSAISPDWNTFNILI-RYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSS 366

Query: 1515 LLKNSHLFGRHLIAETLSKQQRMSK 1589
            L+ +    G H  A ++    R SK
Sbjct: 367  LIYHLGKTGAHSEAFSVYNMLRYSK 391


>ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 652

 Score =  682 bits (1760), Expect = 0.0
 Identities = 343/533 (64%), Positives = 424/533 (79%), Gaps = 1/533 (0%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS++KFMGK  +   A+E+Y  I+D+SI+ +VS+CN+ +  L++NGK ESS+KLF QMK
Sbjct: 125  YSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMK 184

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL+PD  TYST+             AGC K   GY KAL+ VQEL  NGL+MDSV YG
Sbjct: 185  RDGLVPDVFTYSTLL------------AGCAKVNGGYYKALELVQELMSNGLQMDSVTYG 232

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LL++CAS+  C EA  +F +MKDEG SPNV+HYSSLLNAYS D NY KA+ L+++M+SA
Sbjct: 233  SLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSA 292

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLV NKVI TTLLKVYV+G LFEKS+ELL ELE LGYA DEMP+CLLMDGLAK+G +LEA
Sbjct: 293  GLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEA 352

Query: 723  KAIFDEMEMKGVKS-DGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLR 899
            K++FDEM  K VK+ DGYS+SIMISAFCRSGLLE+AK++A ++E +YDKYD+V+LN ML 
Sbjct: 353  KSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLS 412

Query: 900  AYCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQL 1079
            AYCRAG+ME+VM M+ KMD+  ISPD NTF+ILI+YFCKEKLY LAYRTM DMHSKGHQ 
Sbjct: 413  AYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQP 472

Query: 1080 DEELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVV 1259
            +E L S LI  LGKTGA SEAFSVYNMLR+SKRT+  ALHE +L IL+AG LLKDAYVVV
Sbjct: 473  EEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVV 532

Query: 1260 KDNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXX 1439
            KDNA  IS+  +KKF+++FM+SGN+NLINDVM+A+HSSG KIDQE+FD+AI+RY+ KP  
Sbjct: 533  KDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEK 592

Query: 1440 XXXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLLR 1598
                     WM G+GY ++ S+RNL+LKNSHLFG  LIAE+LSK   MSK ++
Sbjct: 593  KELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVK 645



 Score = 70.9 bits (172), Expect = 2e-09
 Identities = 78/385 (20%), Positives = 154/385 (40%), Gaps = 5/385 (1%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            NV  YSS +       +   A  + +D+K   +  N  +    L   ++    E S +L 
Sbjct: 121  NVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLF 180

Query: 630  VELETLGYAVDEMPYCLLMDGLAKA-GRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D   Y  L+ G AK  G   +A  +  E+   G++ D  ++  ++S    
Sbjct: 181  TQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCAS 240

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
                 EA +  +  +      ++   +++L AY      E    ++ +M    +  +   
Sbjct: 241  HKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVI 300

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYN-ML 1163
            +  L+K + K  L++ +   + ++ + G+  DE     L+  L K+G   EA SV++ M+
Sbjct: 301  YTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMM 360

Query: 1164 RFSKRTMCKALHEKVLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNI 1334
                +T     +  ++      GLL+DA  V  +  E+  +     L     ++ ++G +
Sbjct: 361  EKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKM 420

Query: 1335 NLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXXXXXXXXXXWMTGQGYVVEPSSRNL 1514
              +  +M  +  S    D   F+I I RY  K             M  +G+  E    + 
Sbjct: 421  ENVMSMMKKMDDSAISPDWNTFNILI-RYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSS 479

Query: 1515 LLKNSHLFGRHLIAETLSKQQRMSK 1589
            L+ +    G H  A ++    R SK
Sbjct: 480  LIYHLGKTGAHSEAFSVYNMLRYSK 504


>ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris]
            gi|561021735|gb|ESW20506.1| hypothetical protein
            PHAVU_006G214900g [Phaseolus vulgaris]
          Length = 639

 Score =  682 bits (1760), Expect = 0.0
 Identities = 343/525 (65%), Positives = 416/525 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YS +++FM    +  + L++YHSIQD+S R ++ +CNS++GCL++ GK +S +KLF QM+
Sbjct: 117  YSHYMRFMANNLDAAEMLQLYHSIQDESARKNILVCNSVLGCLIKKGKFDSGMKLFRQMQ 176

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
              GL+PD VTYST+             AGCIK ++GYPKAL+ +QEL+ + L+MD VIYG
Sbjct: 177  LDGLVPDPVTYSTLL------------AGCIKIENGYPKALELIQELQHSKLQMDGVIYG 224

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            T+LA+CASN + EEAE +F QMKDEG S NV+HYSSLLNAYS  GNY KAD L +DMKS 
Sbjct: 225  TILAVCASNGKWEEAEKYFNQMKDEGHSRNVYHYSSLLNAYSTCGNYKKADILFQDMKSE 284

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKVILTTLLKVYV+G LF+KSRELL EL++LGYA DEMPYC+LMDGLAKAG+I EA
Sbjct: 285  GLVPNKVILTTLLKVYVKGGLFDKSRELLAELKSLGYAEDEMPYCILMDGLAKAGQIHEA 344

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K IFDEM    V+SDGY+HSIMISA CRS L  EAKQLA+D+ET  +KYD+V+LN+ML A
Sbjct: 345  KLIFDEMMKNHVRSDGYAHSIMISALCRSKLFREAKQLAKDFETTSNKYDIVILNSMLCA 404

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            +CR GEMESVM+ L KMDE  ISP  NTFHILIKYFC+EK+Y LAYRTM DMHSKGHQ  
Sbjct: 405  FCRVGEMESVMETLKKMDELAISPSYNTFHILIKYFCREKMYLLAYRTMKDMHSKGHQPG 464

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LG+  A SEAFSVYNMLR+ KRTMCK+LHEK+L IL+AG LLKDAYVVVK
Sbjct: 465  EELCSTLISHLGQVNAYSEAFSVYNMLRYGKRTMCKSLHEKILYILLAGHLLKDAYVVVK 524

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA+ ISR   KKFAI+FMKSGNIN INDV+  +H SG K+DQ++F +A+SRY+G+P   
Sbjct: 525  DNAKYISRPPTKKFAIAFMKSGNINYINDVLKTLHDSGYKLDQDLFAMAVSRYLGEPEKK 584

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQ 1577
                    WM+GQGY+V+ S+RNL+LK+SHLFGR LIAE LSKQQ
Sbjct: 585  DLLLHLLQWMSGQGYMVDSSTRNLILKHSHLFGRQLIAEVLSKQQ 629



 Score = 72.0 bits (175), Expect = 1e-09
 Identities = 53/227 (23%), Positives = 109/227 (48%), Gaps = 1/227 (0%)
 Frame = +3

Query: 339  RMDSVIYGTLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADA 518
            ++D   Y   +   A+N    E    +  ++DE    N+   +S+L      G +     
Sbjct: 111  KLDVSSYSHYMRFMANNLDAAEMLQLYHSIQDESARKNILVCNSVLGCLIKKGKFDSGMK 170

Query: 519  LVKDMKSAGLVPNKVILTTLLKVYVR-GDLFEKSRELLVELETLGYAVDEMPYCLLMDGL 695
            L + M+  GLVP+ V  +TLL   ++  + + K+ EL+ EL+     +D + Y  ++   
Sbjct: 171  LFRQMQLDGLVPDPVTYSTLLAGCIKIENGYPKALELIQELQHSKLQMDGVIYGTILAVC 230

Query: 696  AKAGRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDL 875
            A  G+  EA+  F++M+ +G   + Y +S +++A+   G  ++A  L +D ++     + 
Sbjct: 231  ASNGKWEEAEKYFNQMKDEGHSRNVYHYSSLLNAYSTCGNYKKADILFQDMKSEGLVPNK 290

Query: 876  VMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCK 1016
            V+L T+L+ Y + G  +   ++L ++     + D   + IL+    K
Sbjct: 291  VILTTLLKVYVKGGLFDKSRELLAELKSLGYAEDEMPYCILMDGLAK 337


>ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 646

 Score =  682 bits (1759), Expect = 0.0
 Identities = 343/530 (64%), Positives = 422/530 (79%), Gaps = 1/530 (0%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS++KFMGK  +   A+E+Y  I+D+SI+ +VS+CN+ +  L++NGK ESS+KLF QMK
Sbjct: 125  YSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMK 184

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL+PD  TYST+             AGC K   GY KAL+ VQEL  NGL+MDSV YG
Sbjct: 185  RDGLVPDVFTYSTLL------------AGCAKVNGGYYKALELVQELMSNGLQMDSVTYG 232

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            +LL++CAS+  C EA  +F +MKDEG SPNV+HYSSLLNAYS D NY KA+ L+++M+SA
Sbjct: 233  SLLSVCASHKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSA 292

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLV NKVI TTLLKVYV+G LFEKS+ELL ELE LGYA DEMP+CLLMDGLAK+G +LEA
Sbjct: 293  GLVLNKVIYTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEA 352

Query: 723  KAIFDEMEMKGVKS-DGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLR 899
            K++FDEM  K VK+ DGYS+SIMISAFCRSGLLE+AK++A ++E +YDKYD+V+LN ML 
Sbjct: 353  KSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLS 412

Query: 900  AYCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQL 1079
            AYCRAG+ME+VM M+ KMD+  ISPD NTF+ILI+YFCKEKLY LAYRTM DMHSKGHQ 
Sbjct: 413  AYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQP 472

Query: 1080 DEELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVV 1259
            +E L S LI  LGKTGA SEAFSVYNMLR+SKRT+  ALHE +L IL+AG LLKDAYVVV
Sbjct: 473  EEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVV 532

Query: 1260 KDNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXX 1439
            KDNA  IS+  +KKF+++FM+SGN+NLINDVM+A+HSSG KIDQE+FD+AI+RY+ KP  
Sbjct: 533  KDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEK 592

Query: 1440 XXXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
                     WM G+GY ++ S+RNL+LKNSHLFG  LIAE+LSK   MSK
Sbjct: 593  KELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 642



 Score = 70.9 bits (172), Expect = 2e-09
 Identities = 78/385 (20%), Positives = 154/385 (40%), Gaps = 5/385 (1%)
 Frame = +3

Query: 450  NVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILTTLLKVYVRGDLFEKSRELL 629
            NV  YSS +       +   A  + +D+K   +  N  +    L   ++    E S +L 
Sbjct: 121  NVASYSSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLF 180

Query: 630  VELETLGYAVDEMPYCLLMDGLAKA-GRILEAKAIFDEMEMKGVKSDGYSHSIMISAFCR 806
             +++  G   D   Y  L+ G AK  G   +A  +  E+   G++ D  ++  ++S    
Sbjct: 181  TQMKRDGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCAS 240

Query: 807  SGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMESVMQMLGKMDEFKISPDCNT 986
                 EA +  +  +      ++   +++L AY      E    ++ +M    +  +   
Sbjct: 241  HKECNEAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVI 300

Query: 987  FHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIVQLGKTGAPSEAFSVYN-ML 1163
            +  L+K + K  L++ +   + ++ + G+  DE     L+  L K+G   EA SV++ M+
Sbjct: 301  YTTLLKVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMM 360

Query: 1164 RFSKRTMCKALHEKVLKILVAGGLLKDAYVVVKDNAERISRN---CLKKFAISFMKSGNI 1334
                +T     +  ++      GLL+DA  V  +  E+  +     L     ++ ++G +
Sbjct: 361  EKHVKTADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKM 420

Query: 1335 NLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXXXXXXXXXXWMTGQGYVVEPSSRNL 1514
              +  +M  +  S    D   F+I I RY  K             M  +G+  E    + 
Sbjct: 421  ENVMSMMKKMDDSAISPDWNTFNILI-RYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSS 479

Query: 1515 LLKNSHLFGRHLIAETLSKQQRMSK 1589
            L+ +    G H  A ++    R SK
Sbjct: 480  LIYHLGKTGAHSEAFSVYNMLRYSK 504


>ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arabidopsis lyrata subsp.
            lyrata] gi|297335683|gb|EFH66100.1| hypothetical protein
            ARALYDRAFT_888388 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  680 bits (1754), Expect = 0.0
 Identities = 346/529 (65%), Positives = 419/529 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS IKF+G   N  KALE+Y SI D+S + +V ICNSI+ CLV+NGKL+S IKLFDQMK
Sbjct: 136  YSSCIKFVG-AKNVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMK 194

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            +GGL PD +TY+T+             AGCIK K+GYPKA++ + EL  NG++MDSV+YG
Sbjct: 195  RGGLKPDVITYNTLL------------AGCIKVKNGYPKAVELIGELPHNGIQMDSVMYG 242

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            T+LAICASN RCEEAE F  QMK EG SPN++HYSSLLN+YS  G+Y KAD L+ +MKS 
Sbjct: 243  TVLAICASNGRCEEAENFIQQMKAEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSI 302

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKV++TTLLKVY++G LF++SRELL ELE+ GYA +EMPYC+LMDGL+KAG++ EA
Sbjct: 303  GLVPNKVMMTTLLKVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEA 362

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            ++IFD+M+ KGVKSDGY++SIMISA CRS   EEAK+L+RD ET Y+K DLVMLNTML A
Sbjct: 363  RSIFDDMKGKGVKSDGYANSIMISALCRSKRFEEAKELSRDSETTYEKCDLVMLNTMLCA 422

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVM+M+ KMDE  I PD NTFHILIKYF KEKL+ LAY+T +DMHSKGH+L+
Sbjct: 423  YCRAGEMESVMRMMKKMDEQAIIPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLE 482

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  APSEAFSVYNMLR+SKRT+CK LHEK+L IL+ G LLKDAY+VVK
Sbjct: 483  EELCSSLIYHLGKIRAPSEAFSVYNMLRYSKRTICKELHEKILHILIHGDLLKDAYIVVK 542

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA+ IS+  LKKF  +FM SGNINL+NDV+  +H SG KIDQ  F+IAISRY+  P   
Sbjct: 543  DNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLHGSGHKIDQVQFEIAISRYILLPDKK 602

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
                    WM GQGY+V+ S+RNL+LKNSH+FGR LIAE LSK    S+
Sbjct: 603  ELLLQLLQWMPGQGYIVDSSTRNLILKNSHMFGRLLIAEILSKHHVASR 651


>ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g10910, chloroplastic; Flags: Precursor
            gi|110741600|dbj|BAE98748.1| membrane-associated
            salt-inducible protein isolog [Arabidopsis thaliana]
            gi|332190541|gb|AEE28662.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 664

 Score =  672 bits (1735), Expect = 0.0
 Identities = 342/529 (64%), Positives = 418/529 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS IKF+G   N  KALE+Y SI D+S + +V ICNSI+ CLV+NGKL+S IKLFDQMK
Sbjct: 135  YSSCIKFVG-AKNVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMK 193

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD VTY+T+             AGCIK K+GYPKA++ + EL  NG++MDSV+YG
Sbjct: 194  RDGLKPDVVTYNTLL------------AGCIKVKNGYPKAIELIGELPHNGIQMDSVMYG 241

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            T+LAICASN R EEAE F  QMK EG SPN++HYSSLLN+YS  G+Y KAD L+ +MKS 
Sbjct: 242  TVLAICASNGRSEEAENFIQQMKVEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSI 301

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            GLVPNKV++TTLLKVY++G LF++SRELL ELE+ GYA +EMPYC+LMDGL+KAG++ EA
Sbjct: 302  GLVPNKVMMTTLLKVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEA 361

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            ++IFD+M+ KGV+SDGY++SIMISA CRS   +EAK+L+RD ET Y+K DLVMLNTML A
Sbjct: 362  RSIFDDMKGKGVRSDGYANSIMISALCRSKRFKEAKELSRDSETTYEKCDLVMLNTMLCA 421

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVM+M+ KMDE  +SPD NTFHILIKYF KEKL+ LAY+T +DMHSKGH+L+
Sbjct: 422  YCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLE 481

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  A +EAFSVYNMLR+SKRT+CK LHEK+L IL+ G LLKDAY+VVK
Sbjct: 482  EELCSSLIYHLGKIRAQAEAFSVYNMLRYSKRTICKELHEKILHILIQGNLLKDAYIVVK 541

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA+ IS+  LKKF  +FM SGNINL+NDV+  +H SG KIDQ  F+IAISRY+ +P   
Sbjct: 542  DNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLHGSGHKIDQVQFEIAISRYISQPDKK 601

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
                    WM GQGYVV+ S+RNL+LKNSH+FGR LIAE LSK    S+
Sbjct: 602  ELLLQLLQWMPGQGYVVDSSTRNLILKNSHMFGRLLIAEILSKHHVASR 650


>ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum]
            gi|557095175|gb|ESQ35757.1| hypothetical protein
            EUTSA_v10007006mg [Eutrema salsugineum]
          Length = 666

 Score =  672 bits (1734), Expect = 0.0
 Identities = 343/531 (64%), Positives = 418/531 (78%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS IKF+G   +  KALE+Y SI D+S + +V ICNSI+ CLV+NGKLES  KLFDQMK
Sbjct: 136  YSSCIKFVG-AKSVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLESCFKLFDQMK 194

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD +TY+T+             AGCIK K+GY KA++ V EL  NG++MD V+YG
Sbjct: 195  RDGLKPDVITYNTLL------------AGCIKVKNGYSKAMELVGELPHNGIQMDGVMYG 242

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            T+LAICASN RCEEAE+F  QMK +G SPN++HYSSLLN+YS  G+Y KAD L+ +MKS 
Sbjct: 243  TVLAICASNGRCEEAESFIQQMKVKGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSV 302

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            G+VPNKV++TTLLKVY+RG LFE+SRELL ELE+ GYA +EMPYC+LMDGL+KAG+  EA
Sbjct: 303  GIVPNKVMMTTLLKVYIRGGLFERSRELLSELESAGYAENEMPYCMLMDGLSKAGKFEEA 362

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            ++IFDEM+ KGVKSDGY++SIMISA CRS   EEAKQLARD E+ Y+K DLVMLNTML A
Sbjct: 363  RSIFDEMKGKGVKSDGYANSIMISALCRSKRFEEAKQLARDSESTYEKCDLVMLNTMLCA 422

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCRAGEMESVM+M+ KMDE  +SPD NTFHILIKYF KEKL+ LAY+T++DMHSKGH+L+
Sbjct: 423  YCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTLLDMHSKGHRLE 482

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            EEL S LI  LGK  A SEAFSVY+MLR+SKRT+CK LHEK+L IL+ G LLKDAYVVVK
Sbjct: 483  EELCSSLIYHLGKIRAHSEAFSVYSMLRYSKRTICKDLHEKILHILIHGKLLKDAYVVVK 542

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA+ IS+  LK+F  +FM SGN+NL+NDV+  +H SG KIDQ  F+IAISRY+ +P   
Sbjct: 543  DNAKMISQPTLKRFGRAFMNSGNVNLVNDVLKVLHGSGHKIDQVQFEIAISRYISQPDKK 602

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSKLL 1595
                    WM GQGYVV+ S+RNL+LKNS+LFGR LIAE LSK    S+ +
Sbjct: 603  ELLLQLLQWMPGQGYVVDSSTRNLILKNSNLFGRQLIAEILSKHHIASRTM 653


>gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus guttatus]
          Length = 663

 Score =  671 bits (1732), Expect = 0.0
 Identities = 331/529 (62%), Positives = 422/529 (79%)
 Frame = +3

Query: 3    YSSFIKFMGKGHNPKKALEVYHSIQDKSIRNHVSICNSIIGCLVRNGKLESSIKLFDQMK 182
            YSS+IKF+G+  N  KA+E+Y+SI+D S + +VS+CNS + CL+++GK ES +KLF+QMK
Sbjct: 145  YSSYIKFVGRDSNATKAVEIYNSIKDDSTKTNVSVCNSTLYCLIKSGKFESGLKLFNQMK 204

Query: 183  KGGLLPDAVTYSTVXXXXXXXXXXXXXAGCIKDKDGYPKALKFVQELKDNGLRMDSVIYG 362
            + GL PD VTYST+             +GC K K GY KA++ VQE+K   L+MD+VIYG
Sbjct: 205  QAGLEPDIVTYSTLL------------SGCTKVKGGYIKAMELVQEIKCRKLQMDTVIYG 252

Query: 363  TLLAICASNNRCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSA 542
            TL+++CASNN+ EEAE +F +MK EG SPNVFHYSSLLNAY+ DG+Y KADAL+++M+SA
Sbjct: 253  TLISVCASNNQREEAEKYFNEMKSEGHSPNVFHYSSLLNAYAIDGSYKKADALIEEMRSA 312

Query: 543  GLVPNKVILTTLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKAGRILEA 722
            G+  NK+ILTT LKVYV+G LF+KSRELL +L+ LGYA DEMPYCLLMDGLAK+G++ EA
Sbjct: 313  GIELNKIILTTQLKVYVKGGLFDKSRELLDQLQALGYAEDEMPYCLLMDGLAKSGKVPEA 372

Query: 723  KAIFDEMEMKGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRA 902
            K++FDEM  K VK+DG+S+SIMISA CRSGL+EEAK LA ++ET+YDKYD+V+LN+ML A
Sbjct: 373  KSLFDEMRQKEVKNDGFSYSIMISALCRSGLIEEAKMLACEFETKYDKYDVVILNSMLCA 432

Query: 903  YCRAGEMESVMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLD 1082
            YCR+GEME+VM+ + KMDE  ISPD NTFHILIKYFCKEKLY LAYRTMVDMH KGHQL+
Sbjct: 433  YCRSGEMENVMKTMKKMDESSISPDWNTFHILIKYFCKEKLYLLAYRTMVDMHKKGHQLE 492

Query: 1083 EELSSFLIVQLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVVK 1262
            E+L  FLI  LGKTGA +EAFSVY+ML++SKRT+ K LHEK+L  L+AGGL KDAYV+VK
Sbjct: 493  EDLCVFLIHHLGKTGAHAEAFSVYSMLKYSKRTINKTLHEKILHTLLAGGLFKDAYVLVK 552

Query: 1263 DNAERISRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXX 1442
            DNA+ IS + ++KF  +FM+ GNINLINDV+ ++HSS  KIDQ++F +AISRY+ +P   
Sbjct: 553  DNAKYISESAIRKFTTTFMRKGNINLINDVIKSIHSSSYKIDQDIFHMAISRYIEQPEKK 612

Query: 1443 XXXXXXXXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
                    WM GQGY V+ S+RNL+L+N+ LFGR+ I E LSK    SK
Sbjct: 613  ELLLHLLQWMRGQGYPVDSSTRNLILENAELFGRNSITEILSKHYAASK 661



 Score = 73.9 bits (180), Expect = 3e-10
 Identities = 77/403 (19%), Positives = 162/403 (40%), Gaps = 4/403 (0%)
 Frame = +3

Query: 393  RCEEAETFFLQMKDEGVSPNVFHYSSLLNAYSCDGNYAKADALVKDMKSAGLVPNKVILT 572
            R ++    F  M+  G + N+  YSS +     D N  KA  +   +K      N  +  
Sbjct: 123  RWKDLSQLFNWMRQHGKT-NIASYSSYIKFVGRDSNATKAVEIYNSIKDDSTKTNVSVCN 181

Query: 573  TLLKVYVRGDLFEKSRELLVELETLGYAVDEMPYCLLMDGLAKA-GRILEAKAIFDEMEM 749
            + L   ++   FE   +L  +++  G   D + Y  L+ G  K  G  ++A  +  E++ 
Sbjct: 182  STLYCLIKSGKFESGLKLFNQMKQAGLEPDIVTYSTLLSGCTKVKGGYIKAMELVQEIKC 241

Query: 750  KGVKSDGYSHSIMISAFCRSGLLEEAKQLARDYETRYDKYDLVMLNTMLRAYCRAGEMES 929
            + ++ D   +  +IS    +   EEA++   + ++     ++   +++L AY   G  + 
Sbjct: 242  RKLQMDTVIYGTLISVCASNNQREEAEKYFNEMKSEGHSPNVFHYSSLLNAYAIDGSYKK 301

Query: 930  VMQMLGKMDEFKISPDCNTFHILIKYFCKEKLYQLAYRTMVDMHSKGHQLDEELSSFLIV 1109
               ++ +M    I  +       +K + K  L+  +   +  + + G+  DE     L+ 
Sbjct: 302  ADALIEEMRSAGIELNKIILTTQLKVYVKGGLFDKSRELLDQLQALGYAEDEMPYCLLMD 361

Query: 1110 QLGKTGAPSEAFSVYNMLRFSKRTMCKALHEKVLKILVAGGLLKDAYVVV---KDNAERI 1280
             L K+G   EA S+++ +R  +       +  ++  L   GL+++A ++    +   ++ 
Sbjct: 362  GLAKSGKVPEAKSLFDEMRQKEVKNDGFSYSIMISALCRSGLIEEAKMLACEFETKYDKY 421

Query: 1281 SRNCLKKFAISFMKSGNINLINDVMSAVHSSGQKIDQEVFDIAISRYVGKPXXXXXXXXX 1460
                L     ++ +SG +  +   M  +  S    D   F I I +Y  K          
Sbjct: 422  DVVILNSMLCAYCRSGEMENVMKTMKKMDESSISPDWNTFHILI-KYFCKEKLYLLAYRT 480

Query: 1461 XXWMTGQGYVVEPSSRNLLLKNSHLFGRHLIAETLSKQQRMSK 1589
               M  +G+ +E      L+ +    G H  A ++    + SK
Sbjct: 481  MVDMHKKGHQLEEDLCVFLIHHLGKTGAHAEAFSVYSMLKYSK 523


Top