BLASTX nr result

ID: Akebia23_contig00012021 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00012021
         (2407 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containi...  1124   0.0  
ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prun...  1065   0.0  
gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis]    1041   0.0  
ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containi...  1036   0.0  
ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfam...  1023   0.0  
ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containi...  1014   0.0  
ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containi...  1013   0.0  
ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citr...  1011   0.0  
ref|XP_002314110.1| pentatricopeptide repeat-containing family p...  1003   0.0  
ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containi...   999   0.0  
ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containi...   994   0.0  
ref|NP_180537.1| RNA editing factor OTP81 [Arabidopsis thaliana]...   936   0.0  
ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutr...   935   0.0  
ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Caps...   933   0.0  
ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp....   933   0.0  
gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus...   918   0.0  
ref|XP_003520267.1| PREDICTED: pentatricopeptide repeat-containi...   917   0.0  
gb|AFN53666.1| hypothetical protein [Linum usitatissimum]             908   0.0  
ref|XP_006575137.1| PREDICTED: pentatricopeptide repeat-containi...   905   0.0  
gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlise...   896   0.0  

>ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Vitis vinifera]
          Length = 743

 Score = 1124 bits (2908), Expect = 0.0
 Identities = 548/745 (73%), Positives = 628/745 (84%), Gaps = 6/745 (0%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPN------NNNRYFADHPIISLIDQCSDITQVKQIHAHML 207
            M   NP L+SLP   H +  PN      NN+RYFA+HP +SLIDQCS+  Q+KQIHA ML
Sbjct: 1    MAIPNPCLVSLPR-SHSLPTPNPNSITLNNDRYFANHPTLSLIDQCSETKQLKQIHAQML 59

Query: 208  RIGLFFDPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTE 387
            R GLFFDPFSASRLIT +ALSPF S LDYA+QVFDQIPHPN+YTWNTLIRAYASS +P +
Sbjct: 60   RTGLFFDPFSASRLITAAALSPFPS-LDYAQQVFDQIPHPNLYTWNTLIRAYASSSNPHQ 118

Query: 388  SFIIFLQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLV 567
            S +IFL+MLHQ PD P+KFTFPFLIKAASEL  L  G+ FHGM IK  L SDVFILNSL+
Sbjct: 119  SLLIFLRMLHQSPDFPDKFTFPFLIKAASELEELFTGKAFHGMVIKVLLGSDVFILNSLI 178

Query: 568  HFYASCGNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDV 747
            HFYA CG L   Y+VFV IP+RDVVSWNSMITAFVQ  C EEALELFQ ME +N+KPN +
Sbjct: 179  HFYAKCGELGLGYRVFVNIPRRDVVSWNSMITAFVQGGCPEEALELFQEMETQNVKPNGI 238

Query: 748  TMVSVLSACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKM 927
            TMV VLSACAK S+ E G+WVHSYI++N I  +L LSNA LDMYTKCG++E+AKR+FDKM
Sbjct: 239  TMVGVLSACAKKSDFEFGRWVHSYIERNRIGESLTLSNAMLDMYTKCGSVEDAKRLFDKM 298

Query: 928  CEKDTVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFH 1107
             EKD VSWTTM+VGYAK+GE+  A+ +F+ MP+QDIAAWN+LISAYEQ G+PKEAL LFH
Sbjct: 299  PEKDIVSWTTMLVGYAKIGEYDAAQGIFDAMPNQDIAAWNALISAYEQCGKPKEALELFH 358

Query: 1108 ELQLDKNTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKC 1287
            ELQL K  KPDEVTLVS LSACAQLGA++LGGWIHVYIKKQ  K+NCH+T+SLIDMY KC
Sbjct: 359  ELQLSKTAKPDEVTLVSTLSACAQLGAMDLGGWIHVYIKKQGMKLNCHLTTSLIDMYCKC 418

Query: 1288 GDLEKALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVL 1467
            GDL+KAL VF S E+KDV+VWSAMIAGL+MHGHG DAI LF  MQE  VKPN VTFTN+L
Sbjct: 419  GDLQKALMVFHSVERKDVFVWSAMIAGLAMHGHGKDAIALFSKMQEDKVKPNAVTFTNIL 478

Query: 1468 CACSHAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPG 1647
            CACSH GLVEEGR  F+QM LV+GV P +KHY CMVD+LGRAGLLEEA+ LIEKMP+ P 
Sbjct: 479  CACSHVGLVEEGRTFFNQMELVYGVLPGVKHYACMVDILGRAGLLEEAVELIEKMPMAPA 538

Query: 1648 ASIWGALLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKL 1827
            AS+WGALLGAC IHENV LAE AC++L+ELEP NHGAYVLLSNIYAK G+WD VS LRKL
Sbjct: 539  ASVWGALLGACTIHENVVLAEQACSQLIELEPGNHGAYVLLSNIYAKAGKWDRVSGLRKL 598

Query: 1828 MRDSGINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQV 2007
            MRD G+ KEPGCSSIEVDG+VHEFLVGDNSHP + KIY+KL EI +RL+++GYVPNKS +
Sbjct: 599  MRDVGLKKEPGCSSIEVDGIVHEFLVGDNSHPSAKKIYAKLDEIVARLETIGYVPNKSHL 658

Query: 2008 LQDIEDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYD 2187
            LQ +E++DVKEQAL+LHSEKLA+A+GLI T    P+RIVKNLRVCGDCHSVAKL+SK+YD
Sbjct: 659  LQLVEEEDVKEQALFLHSEKLAIAFGLISTGQSQPIRIVKNLRVCGDCHSVAKLVSKLYD 718

Query: 2188 REILLRDRYRFHHFKGGVCSCKDYW 2262
            REILLRDRYRFHHF+ G CSC DYW
Sbjct: 719  REILLRDRYRFHHFREGHCSCMDYW 743


>ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prunus persica]
            gi|462402068|gb|EMJ07625.1| hypothetical protein
            PRUPE_ppa001946mg [Prunus persica]
          Length = 738

 Score = 1065 bits (2753), Expect = 0.0
 Identities = 517/739 (69%), Positives = 616/739 (83%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFF 225
            M +L+ PLISLP H +  +   + +  F+ HP +SLIDQC+ I Q+KQ+HA MLR G+ F
Sbjct: 1    MASLSTPLISLPRHPNSSSPTFSTDLRFSSHPALSLIDQCTSIKQLKQVHAQMLRTGVLF 60

Query: 226  DPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFL 405
            DP+SAS+LIT SALS FSS LDYARQVFDQIP PN+YTWNTLIRAYASS DP ES ++FL
Sbjct: 61   DPYSASKLITASALSSFSS-LDYARQVFDQIPQPNVYTWNTLIRAYASSSDPAESILVFL 119

Query: 406  QMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASC 585
             ML  C + P+K+T+PF IKAASEL AL  G  FHGMAIK+SL SD++ILNSLVHFY SC
Sbjct: 120  DMLDHCSECPDKYTYPFAIKAASELRALQVGRGFHGMAIKASLGSDIYILNSLVHFYGSC 179

Query: 586  GNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVL 765
            G+L+ A +VF+K PK+DVVSWNSMIT F Q +C +EALELF+ ME EN+KPNDVTMVSVL
Sbjct: 180  GDLDLARRVFMKTPKKDVVSWNSMITVFAQGNCPQEALELFKEMEAENVKPNDVTMVSVL 239

Query: 766  SACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTV 945
            SACAK  +LE G+WV S+I++NEI+ NL L+NA LDMY KCG++++AKR+FD+M EKD V
Sbjct: 240  SACAKKVDLEFGRWVCSHIQRNEIKENLTLNNAMLDMYVKCGSVDDAKRLFDRMPEKDIV 299

Query: 946  SWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDK 1125
            SWTTM+ GYA++G +  A  +F  MPSQDIAAWN LIS+YEQSG+PKEAL++F+ELQ  K
Sbjct: 300  SWTTMLDGYAQLGNYEEAWRVFAAMPSQDIAAWNVLISSYEQSGKPKEALAVFNELQKSK 359

Query: 1126 NTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKA 1305
            + KPDEVTLVS L+ACAQLGAI+LGGWIHVYIKKQ  K+NCH+T+SLIDMYAKCGDL+KA
Sbjct: 360  SPKPDEVTLVSTLAACAQLGAIDLGGWIHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKA 419

Query: 1306 LEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHA 1485
            LEVF S E++DV+VWSAMIAGL+MHG G DA++ F  M EA VKPN VTFTNVLCACSH 
Sbjct: 420  LEVFNSVERRDVFVWSAMIAGLAMHGQGRDALEFFSKMLEAKVKPNAVTFTNVLCACSHT 479

Query: 1486 GLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGA 1665
            GLV+EGR  F QM  V+GV P +KHY CMVD+LGR+G L+EA+ LIEKMPI P AS+WGA
Sbjct: 480  GLVDEGRTFFYQMEPVYGVVPGIKHYACMVDILGRSGNLDEAVELIEKMPIPPTASVWGA 539

Query: 1666 LLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGI 1845
            LLGAC +H NV LAE AC+ LLEL+PRNHGAYVLLSNIYA+ G+WD VS LRK MRD+GI
Sbjct: 540  LLGACKLHGNVVLAEKACSHLLELDPRNHGAYVLLSNIYAETGKWDEVSGLRKHMRDAGI 599

Query: 1846 NKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIED 2025
             KEPGCSSIEV+G VHEFLVGDNSHPL  +IYSKL E+A RLKS GYVPNKS +LQ +E+
Sbjct: 600  KKEPGCSSIEVNGSVHEFLVGDNSHPLCKEIYSKLDEMALRLKSNGYVPNKSHLLQFVEE 659

Query: 2026 DDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLR 2205
            +D+K+ AL LHSEKLA+A+GLI  SP  P+++VKNLRVCGDCHSVAKLISK+YDREILLR
Sbjct: 660  EDMKDHALILHSEKLAIAFGLISLSPSQPIQVVKNLRVCGDCHSVAKLISKLYDREILLR 719

Query: 2206 DRYRFHHFKGGVCSCKDYW 2262
            DRYRFHHF+ G CSC DYW
Sbjct: 720  DRYRFHHFRDGHCSCNDYW 738


>gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis]
          Length = 739

 Score = 1041 bits (2693), Expect = 0.0
 Identities = 499/740 (67%), Positives = 610/740 (82%), Gaps = 1/740 (0%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPN-NNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLF 222
            M AL+ P++S PH     T    NN+  F ++P++SLI+QC+ + ++KQIHA MLR GLF
Sbjct: 1    MAALSVPVLSFPHQRKLPTSSTVNNDLRFPNYPLLSLIEQCTSLKELKQIHAQMLRTGLF 60

Query: 223  FDPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIF 402
            FDPFSAS+LIT+ A+S FSS LDYA QVFDQIP PN+YTWNT+IRAYASS DP +S ++F
Sbjct: 61   FDPFSASKLITVCAMSSFSS-LDYAHQVFDQIPKPNLYTWNTIIRAYASSSDPIQSIVVF 119

Query: 403  LQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYAS 582
            L+ML QC + PNK+T+PF++KAASEL A   G  FHGM +KSSL SDVFILNSLVHFY S
Sbjct: 120  LRMLDQCCESPNKYTYPFVLKAASELKASRVGRGFHGMVMKSSLASDVFILNSLVHFYGS 179

Query: 583  CGNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSV 762
            C +L+ AY+VF+ IP +DVVSWNSMI AFV+ DC +EA +LF+ ME EN+KPND+TMV V
Sbjct: 180  CDDLDSAYRVFLNIPSKDVVSWNSMIKAFVEGDCPDEAFQLFREMEMENLKPNDITMVGV 239

Query: 763  LSACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDT 942
            L AC K +++E G+W+ SYI++N I +NL L+NA LDMY KCG++E+AK +FDKM E+D 
Sbjct: 240  LCACGKKADIEFGRWLCSYIQRNGIAVNLTLNNAMLDMYVKCGSVEDAKELFDKMPERDV 299

Query: 943  VSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLD 1122
            VSWTTM+ GY +MG++  A  +F  MP+QDIAAWN LIS+YEQ+G PKEALS+FH+LQ+ 
Sbjct: 300  VSWTTMLDGYTRMGKYDEALRVFEAMPNQDIAAWNVLISSYEQNGMPKEALSVFHKLQVS 359

Query: 1123 KNTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEK 1302
            K+ KPDEVTLVS LSAC+QLG+I+ G WIH+YIK+Q  K+NCH+T+SLIDMYAKCGDLEK
Sbjct: 360  KSAKPDEVTLVSSLSACSQLGSIDPGRWIHIYIKRQGIKLNCHLTTSLIDMYAKCGDLEK 419

Query: 1303 ALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSH 1482
            ALEVF S E+KDVYVWSAMIAGL+MHG G  AIDLF  M +A VKPN VTFTN+LCACSH
Sbjct: 420  ALEVFDSVERKDVYVWSAMIAGLAMHGCGRAAIDLFYEMLKAKVKPNAVTFTNILCACSH 479

Query: 1483 AGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWG 1662
             GL+EEG   F QM  V+ V P +KHY CMVDMLGR+G L++++  IEKMPI P ASIWG
Sbjct: 480  TGLLEEGTSLFYQMEPVYKVVPGVKHYACMVDMLGRSGRLKDSLEFIEKMPIPPTASIWG 539

Query: 1663 ALLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSG 1842
            ALLGAC +H NV+LAEHAC +LLEL+PRNHGAYVLLSNIYA+  +WD VSRLRK MRDSG
Sbjct: 540  ALLGACRLHGNVELAEHACGQLLELDPRNHGAYVLLSNIYARTDKWDRVSRLRKAMRDSG 599

Query: 1843 INKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIE 2022
            I KEPGCSSIE++G+VHEFLVGDNSHPL   IY KL EIA+ LK++GYVPNKS +LQ +E
Sbjct: 600  IKKEPGCSSIEINGIVHEFLVGDNSHPLCKDIYEKLDEIAATLKAIGYVPNKSHLLQLVE 659

Query: 2023 DDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILL 2202
            ++D+KEQAL LHSEKLA+A+GLI T+P  P+R+VKNLRVCGDCH+VAKL+SK+Y REILL
Sbjct: 660  EEDMKEQALNLHSEKLAIAFGLISTAPSQPIRVVKNLRVCGDCHAVAKLVSKVYKREILL 719

Query: 2203 RDRYRFHHFKGGVCSCKDYW 2262
            RDRYRFHHFK G CSC +YW
Sbjct: 720  RDRYRFHHFKDGHCSCGEYW 739


>ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 1049

 Score = 1036 bits (2678), Expect = 0.0
 Identities = 503/740 (67%), Positives = 604/740 (81%)
 Frame = +1

Query: 28   NVTKQIMTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHML 207
            NVT + M  L  P+ISLP H   +    NN+  F    ++ LIDQC+ +  +KQ+HA ML
Sbjct: 55   NVTAEDMATLGTPVISLPRHPPTV----NNDLRFPTQLLLPLIDQCTTLNHLKQVHAQML 110

Query: 208  RIGLFFDPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTE 387
            +  LFFDP+SAS+LIT +ALSPFSS LDYARQVFD+IP PN++TWN LIRAYASSPDP E
Sbjct: 111  KTSLFFDPYSASKLITAAALSPFSS-LDYARQVFDEIPEPNLFTWNALIRAYASSPDPVE 169

Query: 388  SFIIFLQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLV 567
            S  IFLQML +C + PNKFTFPFL+KAASEL A   G  FHGM +K+ L SDV+I+NSL+
Sbjct: 170  SIRIFLQMLDECNECPNKFTFPFLLKAASELRASKIGRGFHGMVVKAELGSDVYIVNSLI 229

Query: 568  HFYASCGNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDV 747
            HFY SCG L+ A  VF+K  K+DVVSWNS+ITAF Q +C E ALELF+ ME EN+KPNDV
Sbjct: 230  HFYGSCGELDLARLVFLKSYKKDVVSWNSVITAFAQGNCPEVALELFKEMEAENMKPNDV 289

Query: 748  TMVSVLSACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKM 927
            T+VSVLSACAK ++LE G+WV S+++++ +E NL L+NA LDMY KCG++E+A+R+F +M
Sbjct: 290  TLVSVLSACAKMADLEFGRWVCSHVERHGVEENLTLNNAMLDMYAKCGSVEDAERLFGRM 349

Query: 928  CEKDTVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFH 1107
             EKD VSWTTM+ GYA+MG +  AR +F  MPSQDIA WN LIS+YEQ+G+PKEAL++FH
Sbjct: 350  PEKDVVSWTTMLDGYARMGNYDEARRVFGTMPSQDIATWNVLISSYEQNGKPKEALAVFH 409

Query: 1108 ELQLDKNTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKC 1287
            ELQ +K  KPDEVTLVS L+AC+QLGAI+LGGWIHVY+KKQ  K+NCH+T+SLIDMYAKC
Sbjct: 410  ELQKNKGPKPDEVTLVSTLAACSQLGAIDLGGWIHVYVKKQGMKLNCHLTTSLIDMYAKC 469

Query: 1288 GDLEKALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVL 1467
            G+LEKALEVF S+E +DV+VWSAMIA L+MHG G DA+  F  M EA VKPN VTFTN+L
Sbjct: 470  GNLEKALEVFNSAETRDVFVWSAMIAALAMHGQGRDALHFFSKMLEAKVKPNAVTFTNIL 529

Query: 1468 CACSHAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPG 1647
            CACSHAGLV EGR  F+QM  V+GV P +KHY CMVD+LGR+G LEEA  LIEKMPI P 
Sbjct: 530  CACSHAGLVNEGRTVFNQMEQVYGVVPGIKHYACMVDILGRSGNLEEAAELIEKMPISPT 589

Query: 1648 ASIWGALLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKL 1827
             S+WGALLGAC  HENV LAE AC+ LL+L+PRNHGAYVLLSN+YAK G+W+ VS LRKL
Sbjct: 590  PSVWGALLGACTRHENVALAEKACSHLLDLDPRNHGAYVLLSNVYAKTGKWEAVSGLRKL 649

Query: 1828 MRDSGINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQV 2007
            MRDSGI KEPGCSSIE+DG VHEFLVGDN+HPLS  IYSKL EIA RLKS+GYVPNKS +
Sbjct: 650  MRDSGIKKEPGCSSIEIDGSVHEFLVGDNTHPLSKDIYSKLDEIAGRLKSIGYVPNKSHL 709

Query: 2008 LQDIEDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYD 2187
            LQ +E++D+KE AL LHSEKLA+A+GLI + P  P+R+VKNLRVCGDCHSVAKLISK+Y+
Sbjct: 710  LQFVEEEDMKEHALILHSEKLAIAFGLISSKPSQPIRVVKNLRVCGDCHSVAKLISKLYN 769

Query: 2188 REILLRDRYRFHHFKGGVCS 2247
            REI LRDRYRFHHF+ G CS
Sbjct: 770  REIFLRDRYRFHHFREGHCS 789


>ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao] gi|508786485|gb|EOY33741.1| Tetratricopeptide
            repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 733

 Score = 1023 bits (2646), Expect = 0.0
 Identities = 487/739 (65%), Positives = 605/739 (81%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFF 225
            M  L   L+SLP H    T  N++     D P++S I+QC+++ Q+KQIHA MLR GLFF
Sbjct: 1    METLGTRLLSLPRHPTSKTVNNDH-----DDPVLSRINQCTNLNQLKQIHAQMLRTGLFF 55

Query: 226  DPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFL 405
            +P+SAS+L   SALSPFSS LDYAR+VFDQIP PN+YTWNTLIR YAS P+P +  +IFL
Sbjct: 56   NPYSASKLFAASALSPFSS-LDYARKVFDQIPKPNLYTWNTLIRVYASGPEPLQGILIFL 114

Query: 406  QMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASC 585
            +M+ + P  PNKFTFPF+IKAA+E++++  G+  HGM IK+SL +DVFI NSL+H Y SC
Sbjct: 115  RMVDESPYYPNKFTFPFVIKAAAEIVSVCVGQALHGMVIKASLGADVFISNSLIHLYLSC 174

Query: 586  GNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVL 765
            G+L+ AY+VF+ I ++DVVSWNS+IT   Q  C E+ALELF+ M+ E++KPNDVTMV VL
Sbjct: 175  GDLDSAYRVFMMIGEKDVVSWNSLITGLAQKGCAEKALELFRRMDAESVKPNDVTMVGVL 234

Query: 766  SACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTV 945
            SAC K  +LE G+WV SYI++N I +NL LSNA LDMY KCG+LE+AKR+FD M EKD V
Sbjct: 235  SACTKKLDLEFGRWVCSYIERNGISVNLTLSNAMLDMYAKCGSLEDAKRLFDMMEEKDIV 294

Query: 946  SWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDK 1125
            +WTTM+ GYAK+GE+  AR + ++MP QDIAAWN+LIS YEQ+G+PKEAL+++HEL+L K
Sbjct: 295  TWTTMLDGYAKLGEYEAARRVLDIMPRQDIAAWNALISGYEQNGKPKEALAIYHELKLSK 354

Query: 1126 NTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKA 1305
              KPDE+TLVS LSACAQLGA+++G  IH Y+K+Q  ++NCH+T+SLIDMY+KCGD+ KA
Sbjct: 355  IAKPDEITLVSTLSACAQLGAMDIGRGIHAYVKEQGIQLNCHLTTSLIDMYSKCGDVNKA 414

Query: 1306 LEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHA 1485
            LEVF S E++DV+VWSAMIAGL+MHGHG  AIDLF  MQEA +KPN VTFTNVLCACSHA
Sbjct: 415  LEVFYSVERRDVFVWSAMIAGLAMHGHGRAAIDLFSRMQEATMKPNSVTFTNVLCACSHA 474

Query: 1486 GLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGA 1665
            GLV+EG+   +QM  V+G+ P+++HY CMVD+LGRAG  EEA+  IEKMPIVP  S+WGA
Sbjct: 475  GLVKEGKTFLNQMEPVYGIPPEVQHYSCMVDILGRAGRFEEAVEFIEKMPIVPSDSVWGA 534

Query: 1666 LLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGI 1845
            LLGAC IH NV+LAE AC+RLLEL+P NHGAYVLLSN+YAK G+WD VSRLRK MR +G+
Sbjct: 535  LLGACQIHGNVELAEKACSRLLELDPGNHGAYVLLSNVYAKTGKWDSVSRLRKHMRVTGL 594

Query: 1846 NKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIED 2025
             KE GCS+IEV+GVVHEFL GDN HPLS +IYSKL EI +RLKSVGYVP KS +LQ IE+
Sbjct: 595  KKEQGCSTIEVNGVVHEFLAGDNRHPLSKEIYSKLDEIVARLKSVGYVPKKSHLLQLIEE 654

Query: 2026 DDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLR 2205
            DD++E AL LHSEKLA+A+GL+Y   P P+RI+KNLRVCGDCHSVAKL+S++Y+REI+LR
Sbjct: 655  DDLQEHALKLHSEKLAIAFGLLYMEAPQPIRIIKNLRVCGDCHSVAKLVSRLYNREIILR 714

Query: 2206 DRYRFHHFKGGVCSCKDYW 2262
            DRYRFHHF GG CSCKDYW
Sbjct: 715  DRYRFHHFGGGHCSCKDYW 733


>ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Citrus sinensis]
          Length = 746

 Score = 1014 bits (2622), Expect = 0.0
 Identities = 495/741 (66%), Positives = 597/741 (80%)
 Frame = +1

Query: 40   QIMTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGL 219
            Q M  L+ P+ISLP H +  T   NN      HP+ SLI QC +I Q+KQIH  MLR GL
Sbjct: 8    QHMETLSTPVISLPRHPNPTTLTVNNGHQHHPHPVFSLIKQCKNIKQLKQIHTQMLRTGL 67

Query: 220  FFDPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFII 399
            FFDP+SAS+L T  AL  FSS L+YAR++FDQIP PN+YTWNTLIRAY+SS +P +SF+I
Sbjct: 68   FFDPYSASKLFTPCALGTFSS-LEYAREMFDQIPQPNLYTWNTLIRAYSSSAEPIQSFMI 126

Query: 400  FLQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYA 579
            FLQ+++  P  PN+FTFPF+IKAA+ L+    G+  HGM IKSS + D+FI NSL+HFYA
Sbjct: 127  FLQLVYNSPYFPNEFTFPFVIKAAARLVQFRVGQAIHGMVIKSSFEDDLFISNSLIHFYA 186

Query: 580  SCGNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVS 759
             CG+L  AY VFV I K+DVVSWNSMI+ FVQ    E+A+EL++ ME EN+KP++VTMV+
Sbjct: 187  ICGDLAMAYCVFVMIGKKDVVSWNSMISGFVQGGFFEKAIELYREMEMENVKPDEVTMVA 246

Query: 760  VLSACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKD 939
            VLSACAK  +LE G+WV SYI+KN I+M+L LSNA LDMY KCG+LE+AK +FDKM EKD
Sbjct: 247  VLSACAKKRDLEFGRWVCSYIEKNGIKMDLTLSNAMLDMYVKCGSLEDAKSLFDKMEEKD 306

Query: 940  TVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQL 1119
             VSWTTMI GYAK+GEF  A  +   +P Q IA WN+LISAYEQ+G+P EALS+FHE QL
Sbjct: 307  IVSWTTMIDGYAKLGEFDAAMSVLAAVPIQQIATWNALISAYEQNGKPNEALSIFHE-QL 365

Query: 1120 DKNTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLE 1299
             KN  PDE T VSVLSACAQLGA+++G  IH  +KKQ  K+NC++T+SLIDMY KCG+L+
Sbjct: 366  SKNVNPDEFTFVSVLSACAQLGAMDIGVQIHAKMKKQGIKLNCYLTTSLIDMYTKCGNLD 425

Query: 1300 KALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACS 1479
            KALEVF + + +DV+VWS MIAG +M+G G DA+DLF  MQEA VKPN VTFTNVLCACS
Sbjct: 426  KALEVFHTVKSRDVFVWSTMIAGFAMYGRGRDALDLFSRMQEAKVKPNAVTFTNVLCACS 485

Query: 1480 HAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIW 1659
            H+GLV+EGR+ F+QM  V+GV P +KHY CMVDMLGRAGLL+EA+  IEKMPIVPGAS+W
Sbjct: 486  HSGLVDEGRMFFNQMEPVYGVVPGVKHYTCMVDMLGRAGLLDEAVEFIEKMPIVPGASVW 545

Query: 1660 GALLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDS 1839
            GALLGAC IHENV+LAE+AC+ LLELEP NHGA VLLSNIYAK G+WD VS LRK MR S
Sbjct: 546  GALLGACKIHENVELAEYACSHLLELEPENHGALVLLSNIYAKTGKWDNVSELRKHMRVS 605

Query: 1840 GINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDI 2019
            G+ KEPGCSSIEV+G +H+FL G++SHPL  +IYSKL EI +RLKS GYVPN+S +LQ +
Sbjct: 606  GLKKEPGCSSIEVNGEIHKFLAGESSHPLCKEIYSKLDEIVARLKSFGYVPNRSHLLQLV 665

Query: 2020 EDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREIL 2199
            E++DV+EQAL LHSE+LA+AYGLI   P  P+RIVKNLRVCGDCHSVAKLISK+Y+REIL
Sbjct: 666  EEEDVQEQALNLHSERLAIAYGLISVEPSQPIRIVKNLRVCGDCHSVAKLISKLYNREIL 725

Query: 2200 LRDRYRFHHFKGGVCSCKDYW 2262
            LRDRYRFHHF GG CSC DYW
Sbjct: 726  LRDRYRFHHFSGGNCSCMDYW 746


>ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Cucumis sativus]
            gi|449470513|ref|XP_004152961.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Cucumis sativus]
            gi|449523079|ref|XP_004168552.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Cucumis sativus]
          Length = 733

 Score = 1013 bits (2618), Expect = 0.0
 Identities = 495/739 (66%), Positives = 597/739 (80%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFF 225
            M AL+ P ISL +         NNN  F +H I+S ID+CS   Q+K++HA MLR GLFF
Sbjct: 1    MEALSVPSISLQNFS-----TLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFF 55

Query: 226  DPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFL 405
            DPFSAS+L T SALS FS+ LDYAR +FDQIP PN+YTWNTLIRAYASS DP +SF+IFL
Sbjct: 56   DPFSASKLFTASALSSFST-LDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFL 114

Query: 406  QMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASC 585
             +L +C D PNKFTFPF+IKAASEL A   G   HGMAIK S   D++ILNSLV FY +C
Sbjct: 115  DLLDKCEDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGAC 174

Query: 586  GNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVL 765
            G+L+ A ++F  I  +DVVSWNSMI+AF Q +C E+ALELF  ME EN+ PN VTMV VL
Sbjct: 175  GDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVL 234

Query: 766  SACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTV 945
            SACAK  +LE G+WV SYI++  I+++L L NA LDMYTKCG++++A+++FD+M E+D  
Sbjct: 235  SACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVF 294

Query: 946  SWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDK 1125
            SWT M+ GYAKMG++  AR +FN MP ++IAAWN LISAYEQ+G+PKEAL++F+ELQL K
Sbjct: 295  SWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSK 354

Query: 1126 NTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKA 1305
              KPDEVTLVS LSACAQLGAI+LGGWIHVYIK++   +NCH+ SSL+DMYAKCG LEKA
Sbjct: 355  IAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKA 414

Query: 1306 LEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHA 1485
            LEVF S E++DVYVWSAMIAGL MHG G  AIDLF  MQEA VKPN VTFTNVLCACSHA
Sbjct: 415  LEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHA 474

Query: 1486 GLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGA 1665
            GLV+EGR+ F +M  V+GV P++KHY CMVD+LGRAG LEEAM LI +M   P AS+WGA
Sbjct: 475  GLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGA 534

Query: 1666 LLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGI 1845
            LLGAC +H NV+L E A ++LL+LEPRNHGA VLLSNIYAK GRW+ VS LRKLMRD+ +
Sbjct: 535  LLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTEL 594

Query: 1846 NKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIED 2025
             KEPGCSSIE +G VHEFLVGDN+HPLS+ IYSKL EIA++LKSVGY PNKS +LQ IE+
Sbjct: 595  KKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEE 654

Query: 2026 DDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLR 2205
            DD+KEQAL LHSEKLA+A+GL+  +P  P+R+VKNLR+CGDCH+ AKL+S++YDR+ILLR
Sbjct: 655  DDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLR 714

Query: 2206 DRYRFHHFKGGVCSCKDYW 2262
            DRYRFHHF+ G CSC DYW
Sbjct: 715  DRYRFHHFRDGHCSCMDYW 733


>ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citrus clementina]
            gi|557530863|gb|ESR42046.1| hypothetical protein
            CICLE_v10011151mg [Citrus clementina]
          Length = 737

 Score = 1011 bits (2614), Expect = 0.0
 Identities = 493/739 (66%), Positives = 594/739 (80%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFF 225
            M  L+ P+ISLP H +  T   NN      HP+ SLI QC +I Q+KQIH  MLR GLFF
Sbjct: 1    METLSTPVISLPRHPNTTTLTVNNGHQHHPHPVFSLIKQCKNIKQLKQIHTQMLRTGLFF 60

Query: 226  DPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFL 405
            DP+SAS+L T  AL  FSS L+YAR++FDQIP PN+YTWNTLIRAY+SS +P +SF+IFL
Sbjct: 61   DPYSASKLFTPCALGTFSS-LEYAREMFDQIPQPNLYTWNTLIRAYSSSAEPIQSFMIFL 119

Query: 406  QMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASC 585
            Q+++  P  PN+FTFPF+IKAA+ L+    G+  HGM IKSS + D+FI NSL+HFYA C
Sbjct: 120  QLVYNSPYFPNEFTFPFVIKAAARLVQFRVGQAIHGMVIKSSFEDDLFISNSLIHFYAIC 179

Query: 586  GNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVL 765
            G+L  AY VFV I K+DVVSWNSMI+ FVQ    E+A+EL++ ME EN+KP++VTMV+VL
Sbjct: 180  GDLAMAYCVFVMIGKKDVVSWNSMISGFVQGGFFEKAIELYREMEMENVKPDEVTMVAVL 239

Query: 766  SACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTV 945
            SACAK  +LE G+WV SYI+KN I+M+L LSNA LDMY KCG+LE+AK +FDKM EKD V
Sbjct: 240  SACAKKRDLEFGRWVCSYIEKNGIKMDLTLSNAMLDMYVKCGSLEDAKSLFDKMEEKDIV 299

Query: 946  SWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDK 1125
            SWTTMI GYAK+GEF  A  +   +P Q IA WN+LISAYEQ+G+P EALS+FHE QL K
Sbjct: 300  SWTTMIDGYAKLGEFDAAMSVLAAVPIQQIATWNALISAYEQNGKPNEALSIFHE-QLSK 358

Query: 1126 NTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKA 1305
            N  PDE T VSVLSACAQLGA+++G  IH  +KKQ  K+NC++T+SLIDMY KCG+L+KA
Sbjct: 359  NVNPDEFTFVSVLSACAQLGAMDIGVQIHAKMKKQGIKLNCYLTTSLIDMYTKCGNLDKA 418

Query: 1306 LEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHA 1485
            LEVF + + +DV+VWS MIAG +M+G G DA+DLF  MQEA VKPN VTFTNVLCACSH+
Sbjct: 419  LEVFHTVKSRDVFVWSTMIAGFAMYGRGRDALDLFSRMQEAKVKPNAVTFTNVLCACSHS 478

Query: 1486 GLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGA 1665
            GLV+EGR+ F+QM  V+G  P +KHY CMVDMLGRAGLL EA+  IEKMPIVPGAS+WGA
Sbjct: 479  GLVDEGRMFFNQMEPVYGAVPGVKHYTCMVDMLGRAGLLNEAVEFIEKMPIVPGASVWGA 538

Query: 1666 LLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGI 1845
            LLGAC IHENV+LAE+AC+ LLELEP NHGA VLLSNIYAK G+WD VS LRK MR SG+
Sbjct: 539  LLGACKIHENVELAEYACSHLLELEPENHGALVLLSNIYAKTGKWDNVSELRKHMRVSGL 598

Query: 1846 NKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIED 2025
             KEPGCSSIEV+G +H+FL G++SHPL  +IYSKL EI +RLKS GYVPN+S +LQ +E+
Sbjct: 599  KKEPGCSSIEVNGEIHKFLAGESSHPLCKEIYSKLDEIVARLKSFGYVPNRSHLLQLVEE 658

Query: 2026 DDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLR 2205
            +DV+EQAL LHSE+LA+AYGLI   P  P+RIVKNLRVCGDCHSVAKLISK+Y+REILLR
Sbjct: 659  EDVQEQALNLHSERLAIAYGLISVEPSQPIRIVKNLRVCGDCHSVAKLISKLYNREILLR 718

Query: 2206 DRYRFHHFKGGVCSCKDYW 2262
            DRYRFHHF GG CSC DYW
Sbjct: 719  DRYRFHHFSGGNCSCMDYW 737


>ref|XP_002314110.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222850518|gb|EEE88065.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 738

 Score = 1003 bits (2593), Expect = 0.0
 Identities = 489/739 (66%), Positives = 599/739 (81%)
 Frame = +1

Query: 46   MTALNPPLISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFF 225
            M  L  PL S+P   +      NN +      +  LID+C++   +KQ+HAHMLR GLFF
Sbjct: 1    MATLGNPLASVPISSNPTILTANNEQKSNPSTVPILIDKCANKKHLKQLHAHMLRTGLFF 60

Query: 226  DPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFL 405
            DP SA++L T  ALS   SSLDYA +VFDQIP PN+YTWNTLIRA+ASSP P +  ++F+
Sbjct: 61   DPPSATKLFTACALSS-PSSLDYACKVFDQIPRPNLYTWNTLIRAFASSPKPIQGLLVFI 119

Query: 406  QMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASC 585
            QMLH+    PN +TFPF+IKAA+E+ +L  G+  HGM +K+S  SD+FI NSL+HFY+S 
Sbjct: 120  QMLHESQRFPNSYTFPFVIKAATEVSSLLAGQAIHGMVMKASFGSDLFISNSLIHFYSSL 179

Query: 586  GNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVL 765
            G+L+ AY VF KI ++D+VSWNSMI+ FVQ    EEAL+LF+ M+ EN +PN VTMV VL
Sbjct: 180  GDLDSAYLVFSKIVEKDIVSWNSMISGFVQGGSPEEALQLFKRMKMENARPNRVTMVGVL 239

Query: 766  SACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTV 945
            SACAK  +LE G+W   YI++N I++NLILSNA LDMY KCG+LE+A+R+FDKM EKD V
Sbjct: 240  SACAKRIDLEFGRWACDYIERNGIDINLILSNAMLDMYVKCGSLEDARRLFDKMEEKDIV 299

Query: 946  SWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDK 1125
            SWTTMI GYAK+G++  AR +F++MP +DI AWN+LIS+Y+Q+G+PKEAL++F ELQL+K
Sbjct: 300  SWTTMIDGYAKVGDYDAARRVFDVMPREDITAWNALISSYQQNGKPKEALAIFRELQLNK 359

Query: 1126 NTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKA 1305
            NTKP+EVTL S L+ACAQLGA++LGGWIHVYIKKQ  K+N HIT+SLIDMY+KCG LEKA
Sbjct: 360  NTKPNEVTLASTLAACAQLGAMDLGGWIHVYIKKQGIKLNFHITTSLIDMYSKCGHLEKA 419

Query: 1306 LEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHA 1485
            LEVF S E++DV+VWSAMIAGL+MHGHG  AIDLF  MQE  VKPN VTFTN+LCACSH+
Sbjct: 420  LEVFYSVERRDVFVWSAMIAGLAMHGHGRAAIDLFSKMQETKVKPNAVTFTNLLCACSHS 479

Query: 1486 GLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGA 1665
            GLV+EGRL F+QM  V+GV P  KHY CMVD+LGRAG LEEA+ LIEKMPIVP AS+WGA
Sbjct: 480  GLVDEGRLFFNQMRPVYGVVPGSKHYACMVDILGRAGCLEEAVELIEKMPIVPSASVWGA 539

Query: 1666 LLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGI 1845
            LLGAC I+ NV+LAE AC+RLLE +  NHGAYVLLSNIYAK G+WD VSRLR+ M+ SG+
Sbjct: 540  LLGACRIYGNVELAEMACSRLLETDSNNHGAYVLLSNIYAKAGKWDCVSRLRQHMKVSGL 599

Query: 1846 NKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIED 2025
             KEPGCSSIEV+G++HEFLVGDNSHPLST+IYSKL EI +R+KS GYV ++S +LQ +E+
Sbjct: 600  EKEPGCSSIEVNGIIHEFLVGDNSHPLSTEIYSKLDEIVARIKSTGYVSDESHLLQFVEE 659

Query: 2026 DDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLR 2205
            + +KE AL LHSEKLA+AYGLI   P  P+RIVKNLRVCGDCHSVAKLISK+Y+R+ILLR
Sbjct: 660  EYMKEHALNLHSEKLAIAYGLIRMEPSQPIRIVKNLRVCGDCHSVAKLISKLYNRDILLR 719

Query: 2206 DRYRFHHFKGGVCSCKDYW 2262
            DRYRFHHF GG CSC DYW
Sbjct: 720  DRYRFHHFSGGNCSCMDYW 738


>ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Solanum lycopersicum]
          Length = 744

 Score =  999 bits (2582), Expect = 0.0
 Identities = 480/738 (65%), Positives = 593/738 (80%), Gaps = 6/738 (0%)
 Frame = +1

Query: 67   LISLPHHDHH-----ITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDP 231
            ++ LP H H      I+K   N+RYF +HP++ LID+   I Q+KQIHA+MLRIGLFFDP
Sbjct: 8    VLPLPRHQHFPKPNPISKTVINDRYFENHPLVLLIDKSQSINQLKQIHAYMLRIGLFFDP 67

Query: 232  FSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQM 411
            FSAS+LI  S+LS FSS LDYA +VFD+IP PN+++WN LIRAY+SS DP +S ++F+ M
Sbjct: 68   FSASKLIEASSLSHFSS-LDYAHKVFDEIPQPNLFSWNALIRAYSSSQDPIQSILMFVNM 126

Query: 412  LHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKS-SLDSDVFILNSLVHFYASCG 588
            L +  + P+KFT+PF+ KA++++ A+  G   HGM +K   +  D+F+LNSL+HFYA CG
Sbjct: 127  LCEGREFPSKFTYPFVFKASAKMKAIRFGRGLHGMVVKGRDVGLDIFVLNSLIHFYADCG 186

Query: 589  NLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLS 768
             L++AY +F  +  RDVVSWN+MI  F +    +EAL++F  M EEN++PNDVTM++VLS
Sbjct: 187  CLDEAYLIFENMQTRDVVSWNTMILGFAEGGYADEALKIFHRMGEENVRPNDVTMMAVLS 246

Query: 769  ACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVS 948
            ACAK  +LE G+WVH++IK+N I  +LIL NA LDMY KCG++E+A+R+F KM EKD VS
Sbjct: 247  ACAKKLDLEFGRWVHAFIKRNGIRESLILDNAILDMYMKCGSIEDAERLFRKMGEKDIVS 306

Query: 949  WTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKN 1128
            WTTM+VGYA+ G F  AR + N MPSQDI AWN+LISAYEQSG+PKEALS+F+ELQL K 
Sbjct: 307  WTTMLVGYARAGNFNAARSILNTMPSQDIVAWNALISAYEQSGKPKEALSVFNELQLIKK 366

Query: 1129 TKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKAL 1308
             +PDEVTLV  LSACAQLGAI+LGGWIHVYIKKQ  K NCH+T++LIDMY+KCGD+EKAL
Sbjct: 367  AEPDEVTLVCALSACAQLGAIDLGGWIHVYIKKQGIKFNCHLTTALIDMYSKCGDVEKAL 426

Query: 1309 EVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAG 1488
            E+F S   +DV+VWSAMIAGL+MHG G +AI LFL MQE  VKPN VT  NVLCACSH+G
Sbjct: 427  EMFDSVNIRDVFVWSAMIAGLAMHGRGKEAISLFLKMQEHKVKPNSVTLINVLCACSHSG 486

Query: 1489 LVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGAL 1668
            LVEEGR  F+QM  V+G+ P +KHY C+VD+LGRAG LE A  LI  MP+ PG S+WGAL
Sbjct: 487  LVEEGRAIFNQMEYVYGIVPGVKHYACLVDILGRAGELEVAEKLINNMPVTPGPSVWGAL 546

Query: 1669 LGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGIN 1848
            LGAC +H N++LAE ACNRL+ELEP NHGAYVLLSNIYAK G+WD VS LRK MR+ G+ 
Sbjct: 547  LGACRLHGNLELAEQACNRLVELEPENHGAYVLLSNIYAKSGKWDEVSMLRKRMRECGLK 606

Query: 1849 KEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDD 2028
            KEPGCSSIEV  +VHEFLVGDN+HP S KIY+KL EIA+RLK VGYV NKSQ+LQ +E++
Sbjct: 607  KEPGCSSIEVHSIVHEFLVGDNTHPQSQKIYAKLDEIAARLKHVGYVSNKSQILQLVEEE 666

Query: 2029 DVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRD 2208
            D++EQAL LHSEKLAMA+GLI  +P  P+RIVKNLRVC DCH+VAKL+SK+YDREI+LRD
Sbjct: 667  DMQEQALNLHSEKLAMAFGLISVAPSQPIRIVKNLRVCADCHAVAKLLSKLYDREIILRD 726

Query: 2209 RYRFHHFKGGVCSCKDYW 2262
            RYRFHHFK G CSCKDYW
Sbjct: 727  RYRFHHFKEGNCSCKDYW 744


>ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like [Solanum tuberosum]
          Length = 744

 Score =  994 bits (2570), Expect = 0.0
 Identities = 477/738 (64%), Positives = 593/738 (80%), Gaps = 6/738 (0%)
 Frame = +1

Query: 67   LISLPHHDHH-----ITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDP 231
            ++ LP H H      I+K   N+RYF +HP++ LID+C  I Q+KQIHA+MLRIGLF DP
Sbjct: 8    VLPLPRHQHFPKPNPISKTVINDRYFENHPLVLLIDKCQSIKQLKQIHAYMLRIGLFSDP 67

Query: 232  FSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQM 411
            FSAS+LI  S+LS FSS LDYA +VFD+IP PN+++WN LIRAY+SS DP +S ++F+ M
Sbjct: 68   FSASKLIEASSLSHFSS-LDYAHKVFDEIPQPNLFSWNALIRAYSSSQDPIQSILMFVNM 126

Query: 412  LHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKS-SLDSDVFILNSLVHFYASCG 588
            + +  + P+KFT+PF+ KA++++ AL  G   HGM +K   +  D+F+LNSL+HFYA CG
Sbjct: 127  ICEGREFPSKFTYPFVFKASAKMKALRFGRGLHGMVVKGRDVGLDIFVLNSLIHFYADCG 186

Query: 589  NLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLS 768
             L++AY VF  +  RDVVSWN+MI  F +    +EAL++F  M EEN++PN VTM++VLS
Sbjct: 187  CLDEAYLVFENMQTRDVVSWNTMILGFAEGGYADEALKMFHRMGEENVRPNGVTMMAVLS 246

Query: 769  ACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVS 948
            AC K  +LE G+WVH +IK+N I  +LIL NA LDMY KCG++E+A+R+F KM EKD VS
Sbjct: 247  ACGKKLDLEFGRWVHVFIKRNGIRESLILDNAILDMYMKCGSIEDAERLFHKMGEKDIVS 306

Query: 949  WTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKN 1128
            WTTM+VGYA+ G F  AR + N MPSQDIAAWN+LISAYEQSG+PKEALS+F+ELQL K 
Sbjct: 307  WTTMLVGYARAGNFNAARSILNTMPSQDIAAWNALISAYEQSGKPKEALSVFNELQLIKK 366

Query: 1129 TKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKAL 1308
             +PDEVTLV  LSACAQLGAI+LGGWIHVYIKKQ  K+NCH+T++LIDMY+KCGD+EKAL
Sbjct: 367  AEPDEVTLVCALSACAQLGAIDLGGWIHVYIKKQGIKLNCHLTTALIDMYSKCGDVEKAL 426

Query: 1309 EVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAG 1488
            E+F S   +DV+VWSAM+AGL+MHG G +AI LFL MQE  VKPN VT  NVLCACSH+G
Sbjct: 427  EMFDSVNIRDVFVWSAMVAGLAMHGRGKEAISLFLKMQEHKVKPNSVTLINVLCACSHSG 486

Query: 1489 LVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGAL 1668
            LVEEGR  F+QM  ++G+ P +KHY C+VD+LGRAG LEEA  LI  MP+ PG S+WGAL
Sbjct: 487  LVEEGREIFNQMENIYGIVPGVKHYACLVDILGRAGELEEAEELINNMPVTPGPSVWGAL 546

Query: 1669 LGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGIN 1848
            LGAC +H N++LAE ACNRL+ELEP NHGAYVLLSNIYAK G+WD VS LRK M++ G+ 
Sbjct: 547  LGACKLHGNLELAEQACNRLVELEPENHGAYVLLSNIYAKSGKWDEVSLLRKHMKECGLK 606

Query: 1849 KEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDD 2028
            KEPGCSSIEV  +VHEFLVGDNSHP S KIY+KL EIA+RLK VGYV NKSQ+LQ +E++
Sbjct: 607  KEPGCSSIEVHSIVHEFLVGDNSHPQSQKIYAKLDEIAARLKHVGYVSNKSQILQLVEEE 666

Query: 2029 DVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRD 2208
            D++EQAL LHSEKLAMA+GLI  +P  P+R+VKNLRVC DCH+VAKL+SK+Y+REI+LRD
Sbjct: 667  DMQEQALNLHSEKLAMAFGLISVAPSQPIRVVKNLRVCADCHAVAKLLSKLYNREIILRD 726

Query: 2209 RYRFHHFKGGVCSCKDYW 2262
            RYRFHHFK G CSCKDYW
Sbjct: 727  RYRFHHFKEGNCSCKDYW 744


>ref|NP_180537.1| RNA editing factor OTP81 [Arabidopsis thaliana]
            gi|75100656|sp|O82380.1|PP175_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g29760, chloroplastic; Flags: Precursor
            gi|3582328|gb|AAC35225.1| hypothetical protein
            [Arabidopsis thaliana] gi|330253207|gb|AEC08301.1| RNA
            editing factor OTP81 [Arabidopsis thaliana]
          Length = 738

 Score =  936 bits (2420), Expect = 0.0
 Identities = 453/731 (61%), Positives = 568/731 (77%)
 Frame = +1

Query: 70   ISLPHHDHHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSASRL 249
            +SLP H +  + PN           ISLI++C  + Q+KQ H HM+R G F DP+SAS+L
Sbjct: 10   LSLPRHPN-FSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKL 68

Query: 250  ITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQCPD 429
              ++ALS F+S L+YAR+VFD+IP PN + WNTLIRAYAS PDP  S   FL M+ +   
Sbjct: 69   FAMAALSSFAS-LEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQC 127

Query: 430  PPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQAYK 609
             PNK+TFPFLIKAA+E+ +LS G+  HGMA+KS++ SDVF+ NSL+H Y SCG+L+ A K
Sbjct: 128  YPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACK 187

Query: 610  VFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLSACAKSSN 789
            VF  I ++DVVSWNSMI  FVQ    ++ALELF+ ME E++K + VTMV VLSACAK  N
Sbjct: 188  VFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRN 247

Query: 790  LELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMIVG 969
            LE G+ V SYI++N + +NL L+NA LDMYTKCG++E+AKR+FD M EKD V+WTTM+ G
Sbjct: 248  LEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDG 307

Query: 970  YAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKNTKPDEVT 1149
            YA   ++  AR + N MP +DI AWN+LISAYEQ+G+P EAL +FHELQL KN K +++T
Sbjct: 308  YAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQIT 367

Query: 1150 LVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRSSE 1329
            LVS LSACAQ+GA+ELG WIH YIKK   +MN H+TS+LI MY+KCGDLEK+ EVF S E
Sbjct: 368  LVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE 427

Query: 1330 KKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEGRL 1509
            K+DV+VWSAMI GL+MHG GN+A+D+F  MQEANVKPN VTFTNV CACSH GLV+E   
Sbjct: 428  KRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAES 487

Query: 1510 HFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACMIH 1689
             F QM   +G+ P+ KHY C+VD+LGR+G LE+A+  IE MPI P  S+WGALLGAC IH
Sbjct: 488  LFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIH 547

Query: 1690 ENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGCSS 1869
             N+ LAE AC RLLELEPRN GA+VLLSNIYAK+G+W+ VS LRK MR +G+ KEPGCSS
Sbjct: 548  ANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSS 607

Query: 1870 IEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQAL 2049
            IE+DG++HEFL GDN+HP+S K+Y KL E+  +LKS GY P  SQVLQ IE++++KEQ+L
Sbjct: 608  IEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSL 667

Query: 2050 YLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFHHF 2229
             LHSEKLA+ YGLI T  P  +R++KNLRVCGDCHSVAKLIS++YDREI++RDRYRFHHF
Sbjct: 668  NLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHF 727

Query: 2230 KGGVCSCKDYW 2262
            + G CSC D+W
Sbjct: 728  RNGQCSCNDFW 738


>ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutrema salsugineum]
            gi|557111205|gb|ESQ51489.1| hypothetical protein
            EUTSA_v10016305mg [Eutrema salsugineum]
          Length = 739

 Score =  935 bits (2417), Expect = 0.0
 Identities = 455/733 (62%), Positives = 569/733 (77%), Gaps = 2/733 (0%)
 Frame = +1

Query: 70   ISLPHHD--HHITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSAS 243
            +SLP H    +  +P  NN        +SLID+C+D+ Q+KQIHA M+R GLF D +SAS
Sbjct: 10   LSLPRHPTFSNPNQPTTNNE--RSRHTLSLIDRCADLRQLKQIHAQMVRTGLFNDHYSAS 67

Query: 244  RLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQC 423
            +L  I+ALSPF+S LDYA +VFDQIP PN +TWNTLIRAYAS PDP  S  +FL M+ + 
Sbjct: 68   KLFAIAALSPFAS-LDYACKVFDQIPQPNSFTWNTLIRAYASGPDPLRSICVFLDMVSES 126

Query: 424  PDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQA 603
               PN +TFPFLIKAA+E+ +LS G+  HGMA+KSS+  DVF+ NSL+H Y SCG+L+ A
Sbjct: 127  QCYPNTYTFPFLIKAAAEVSSLSLGQSLHGMAVKSSVGCDVFVANSLIHCYFSCGDLDSA 186

Query: 604  YKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLSACAKS 783
             KVF  I ++DVVSWNSMIT FVQ    ++ALELF+ ME E +K + VTMV VLSACAK 
Sbjct: 187  CKVFTTIQEKDVVSWNSMITGFVQKGSPDKALELFKKMESEEVKASHVTMVGVLSACAKL 246

Query: 784  SNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMI 963
             NLE G+ V SYI++N ++MNL L+NA LDMYTKCG++E+AKR+FD M  KD V+WTTM+
Sbjct: 247  RNLEFGRQVCSYIEENGVKMNLTLANAMLDMYTKCGSIEDAKRLFDTMEVKDNVTWTTML 306

Query: 964  VGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKNTKPDE 1143
             G+A + ++  AR + N MP +DI AWN+LISAYEQ+G+P EAL +FHELQL KN K ++
Sbjct: 307  DGFAILEDYEAARDVLNSMPKKDIVAWNALISAYEQNGKPNEALLVFHELQLQKNIKLNQ 366

Query: 1144 VTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRS 1323
            +TLVS LSACAQ+GA+ELG WIH YIKK   + N ++TS+LI MY+KCGDL KA EVF +
Sbjct: 367  ITLVSTLSACAQVGALELGRWIHSYIKKHGIRSNFYVTSALIHMYSKCGDLVKAREVFNT 426

Query: 1324 SEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEG 1503
             EK+DV+VWSAMI GL+MHG GNDA+D+F  MQEANVKPN VTFTNV CACSH+GLV+E 
Sbjct: 427  VEKRDVFVWSAMIGGLAMHGCGNDALDMFYKMQEANVKPNGVTFTNVFCACSHSGLVDEA 486

Query: 1504 RLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACM 1683
               F +M   +G+ P+ KHY C+VD+LGR+G LE+A+  IE MPI P AS+WGALLGAC 
Sbjct: 487  ESLFSKMESNYGIVPEDKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSASVWGALLGACK 546

Query: 1684 IHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGC 1863
            IH N+ LAE AC RLLELEPRN GA+VLLSNIYAK G+W+ VS LRK MR +G+ KEPGC
Sbjct: 547  IHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKSGKWESVSELRKHMRVTGLKKEPGC 606

Query: 1864 SSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQ 2043
            SSIE++G +HEFL GDN HP++ K+Y KL E+  RLKS GY P  SQVLQ I++++ KEQ
Sbjct: 607  SSIEINGTIHEFLSGDNEHPMADKVYGKLNEVMERLKSSGYEPEMSQVLQLIDEEETKEQ 666

Query: 2044 ALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFH 2223
            +L LHSEKLA+ YGLI T  P  +R++KNLRVCGDCHSVAKLIS++YDREI++RDRYRFH
Sbjct: 667  SLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQVYDREIIVRDRYRFH 726

Query: 2224 HFKGGVCSCKDYW 2262
            HF+ G CSC D+W
Sbjct: 727  HFRNGQCSCNDFW 739


>ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Capsella rubella]
            gi|482562457|gb|EOA26647.1| hypothetical protein
            CARUB_v10022711mg [Capsella rubella]
          Length = 739

 Score =  933 bits (2411), Expect = 0.0
 Identities = 453/733 (61%), Positives = 568/733 (77%), Gaps = 2/733 (0%)
 Frame = +1

Query: 70   ISLPHHDHHI--TKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSAS 243
            +SLP H       +P  NN        ISLID+CS++ Q+KQ HAHM+R G F DP+SAS
Sbjct: 10   LSLPRHPTFSGPNQPTTNNE--RSRHTISLIDRCSNLRQLKQTHAHMIRTGTFSDPYSAS 67

Query: 244  RLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQC 423
            +L  I+ALS F+S L+YAR+VFD+IP PN +TWNTLIRAYAS PDP  S  IFL M+ + 
Sbjct: 68   KLFAIAALSSFAS-LEYARKVFDEIPQPNSFTWNTLIRAYASGPDPVRSIWIFLDMVSES 126

Query: 424  PDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQA 603
               PNK+TFPFL+KAA+E+ +LS G+  HGMAIKS++  D+F+ NSL+H Y SCG+L+ A
Sbjct: 127  QCYPNKYTFPFLVKAAAEVSSLSLGQSLHGMAIKSAVGCDLFVANSLIHCYFSCGDLDSA 186

Query: 604  YKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLSACAKS 783
             KVF  I ++DVVSWNSMI  FVQ    ++ALELF+ ME E++K + VTMV VLSAC K 
Sbjct: 187  CKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACTKL 246

Query: 784  SNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMI 963
             NLE G+ V S+I++N + +N+ L+NA LDMYTKCG++EEAKR+FD M EKD V++TTM+
Sbjct: 247  RNLEFGRQVCSFIEENRVNVNMTLANAMLDMYTKCGSIEEAKRLFDTMEEKDNVTFTTML 306

Query: 964  VGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKNTKPDE 1143
             GYA   ++  AR + N MP +DI AWN+LISAYEQ+G+P EAL +FHELQL KN K ++
Sbjct: 307  DGYAISEDYEAAREVLNSMPKKDIVAWNALISAYEQNGKPNEALLVFHELQLQKNIKLNQ 366

Query: 1144 VTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRS 1323
            +TLVS LSACAQ+GA+ELG WIH YIKK   +MN +ITS+LI MY+KCGDLEKA EVF  
Sbjct: 367  ITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFYITSALIHMYSKCGDLEKAREVFNC 426

Query: 1324 SEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEG 1503
             EK+DV+VWSAMI GL+MHG GN+A+D+F  MQE NVKPN VTFTN+ CACSH GLV+E 
Sbjct: 427  VEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEENVKPNGVTFTNLFCACSHTGLVDEA 486

Query: 1504 RLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACM 1683
               F +M   +G+ P+ KHY C+VD+LGR+G LE+A+  IE MPI P  S+WGALLGAC 
Sbjct: 487  ESLFHKMGSSYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACK 546

Query: 1684 IHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGC 1863
            IH N+ LAE AC RLLELEPRN GA+VLLSNIYAK G+W+ VS LRK MR +G+ KEPGC
Sbjct: 547  IHANLSLAEMACTRLLELEPRNDGAHVLLSNIYAKSGKWENVSELRKHMRVTGLKKEPGC 606

Query: 1864 SSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQ 2043
            SSIE+DG++HEFL GDN+HP+S K+Y KL E+  +LKS GY P  SQVLQ IED+++KEQ
Sbjct: 607  SSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEMSQVLQIIEDEEMKEQ 666

Query: 2044 ALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFH 2223
            +L LHSEKLA+ YGLI T  P  +R++KNLRVCGDCHSVAKLIS++YDREI++RDRYRFH
Sbjct: 667  SLNLHSEKLAICYGLISTEAPKTIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFH 726

Query: 2224 HFKGGVCSCKDYW 2262
            HF+ G CSC D+W
Sbjct: 727  HFRNGQCSCNDFW 739


>ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297325073|gb|EFH55493.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 740

 Score =  933 bits (2411), Expect = 0.0
 Identities = 457/736 (62%), Positives = 573/736 (77%), Gaps = 5/736 (0%)
 Frame = +1

Query: 70   ISLPHHDH--HITKPNNNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSAS 243
            +SLP H +  +  +P  NN        ISLID+CS + Q+KQ HAHM+R G+F DP+SAS
Sbjct: 10   LSLPRHPNFSNPNQPTTNNE--RSRHTISLIDRCSSLRQLKQTHAHMIRTGMFSDPYSAS 67

Query: 244  RLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLH-- 417
            +L  I+ALS F+S L+YAR+VFD+IP PN +TWNTLIRAYAS PDP  S   FL M+   
Sbjct: 68   KLFAIAALSSFAS-LEYARKVFDEIPQPNSFTWNTLIRAYASGPDPVCSIWAFLDMVSSE 126

Query: 418  -QCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNL 594
             QC   PNK+TFPFLIKAA+E+ +LS G+  HGMAIKS++ SDVF+ NSL+H Y SCG+L
Sbjct: 127  SQCY--PNKYTFPFLIKAAAEVSSLSLGQSLHGMAIKSAVGSDVFVANSLIHCYFSCGDL 184

Query: 595  NQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLSAC 774
            + A KVF  I ++DVVSWNSMI  FVQ    ++ALELF+ ME E++K + VTMV VLSAC
Sbjct: 185  DSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSAC 244

Query: 775  AKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWT 954
            AK  +LE G+ V SYI++N + +NL L+NA LDMYTKCG++E+AKR+FD M EKD V+WT
Sbjct: 245  AKIRDLEFGRRVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWT 304

Query: 955  TMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKNTK 1134
            TM+ GYA   ++  AR + N MP +DI AWN+LISAYEQ+G+P EAL +FHELQL KN K
Sbjct: 305  TMLDGYAISEDYEAAREVLNAMPKKDIVAWNALISAYEQNGKPNEALLVFHELQLQKNIK 364

Query: 1135 PDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEV 1314
             +++TLVS LSACAQ+GA+ELG WIH YIKK   KMN ++TS+LI MY+KCGDLEKA EV
Sbjct: 365  LNQITLVSTLSACAQVGALELGRWIHSYIKKNGIKMNFYVTSALIHMYSKCGDLEKAREV 424

Query: 1315 FRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLV 1494
            F S EK+DV+VWSAMI GL+MHG G++A+D+F  MQEANVKPN VTFTNV CACSH GLV
Sbjct: 425  FNSVEKRDVFVWSAMIGGLAMHGCGSEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLV 484

Query: 1495 EEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLG 1674
            +E    F +M   +G+ P+ KHY C+VD+LGR+G LE+A+  IE MPI P  S+WGALLG
Sbjct: 485  DEAESLFYKMESSYGIVPEDKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLG 544

Query: 1675 ACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKE 1854
            AC IH N+ LAE AC RLLELEPRN GA+VLLSNIYAK G+WD VS LRK MR +G+ KE
Sbjct: 545  ACKIHANLSLAEMACTRLLELEPRNDGAHVLLSNIYAKSGKWDNVSELRKHMRVTGLKKE 604

Query: 1855 PGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDV 2034
            PGCSSIE+DG++HEFL GDN+HP+S K+Y KL E+  +LKS GY P  S VLQ IE++++
Sbjct: 605  PGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEMSHVLQIIEEEEM 664

Query: 2035 KEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRY 2214
            KEQ+L LHSEKLA+ YGLI T  P  +R++KNLR+CGDCH+VAKLIS++Y+REI++RDRY
Sbjct: 665  KEQSLNLHSEKLAICYGLISTEAPKAIRVIKNLRMCGDCHAVAKLISQLYNREIIVRDRY 724

Query: 2215 RFHHFKGGVCSCKDYW 2262
            RFHHF+ G CSC D+W
Sbjct: 725  RFHHFRNGQCSCNDFW 740


>gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus guttatus]
          Length = 701

 Score =  918 bits (2372), Expect = 0.0
 Identities = 439/717 (61%), Positives = 556/717 (77%)
 Frame = +1

Query: 112  NNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSASRLITISALSPFSSSLD 291
            NN+RYFA+HP ++LI++CS+  Q+KQIHA MLR GLFFDPFSAS+L+   ALS  SS L 
Sbjct: 15   NNDRYFANHPTVTLIEKCSNSRQLKQIHAQMLRCGLFFDPFSASKLVQSYALSELSS-LH 73

Query: 292  YARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQCPDPPNKFTFPFLIKAA 471
            YA +VFDQIP PN+Y+WN LIRA ASSP P    ++F+++LH   + PNKFT+PF+IKA+
Sbjct: 74   YAYKVFDQIPQPNLYSWNILIRASASSPQPINCLLMFIRLLHVGGEKPNKFTYPFVIKAS 133

Query: 472  SELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQAYKVFVKIPKRDVVSWN 651
            ++L     G+  HGM IK    SD+F+ N L++FY+ CG L+ A +VF  + +RDVVSWN
Sbjct: 134  AKLDDFQLGKGIHGMVIKEGFSSDLFVSNCLIYFYSECGCLDMARRVFSSMSERDVVSWN 193

Query: 652  SMITAFVQADCLEEALELFQGMEEENIKPNDVTMVSVLSACAKSSNLELGKWVHSYIKKN 831
            +M+    Q   ++EA+E F  MEEE +KPNDVTMV VLSAC K S+++ G+WVHSYI+ N
Sbjct: 194  TMVNGLAQNGYVDEAVECFHRMEEEGLKPNDVTMVGVLSACGKKSDVKFGRWVHSYIETN 253

Query: 832  EIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMIVGYAKMGEFVVARHLF 1011
             I ++LIL NA LDMYTKCG++++AK++FDKM                            
Sbjct: 254  RIRLSLILCNAILDMYTKCGSMKDAKKLFDKM---------------------------- 285

Query: 1012 NLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKNTKPDEVTLVSVLSACAQLGAI 1191
             L   +DIA WN+LISAYEQSG P EA+++F+ELQL K +KPDEVTLVS LSAC+QLGA 
Sbjct: 286  -LPSKEDIATWNALISAYEQSGNPNEAIAIFNELQLSKASKPDEVTLVSTLSACSQLGAT 344

Query: 1192 ELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRSSEKKDVYVWSAMIAGL 1371
            E G WIHVY+KK+  ++N H+ ++L+DMY+KCGDL KALE+F S +K+DV+VWSAMIAGL
Sbjct: 345  EFGSWIHVYMKKEGMRLNRHLVTALVDMYSKCGDLHKALEIFNSVDKRDVFVWSAMIAGL 404

Query: 1372 SMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEGRLHFDQMLLVHGVEPQ 1551
             MHG G DAI LFL MQEA V+P+ VTFTN+L ACSH+GLVEEGR  F QM   + + P 
Sbjct: 405  GMHGRGGDAIKLFLKMQEAKVRPSSVTFTNLLAACSHSGLVEEGREFFVQMDQNYKIAPG 464

Query: 1552 LKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACMIHENVKLAEHACNRLL 1731
            ++HY CMVD+LGRAGLLE+A+ LI+ MP+ PGAS+WGALLGAC +H NV LAE+ACN LL
Sbjct: 465  VEHYACMVDILGRAGLLEDAVDLIKNMPMAPGASVWGALLGACKLHRNVDLAEYACNSLL 524

Query: 1732 ELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGCSSIEVDGVVHEFLVGD 1911
            E+EP+NHGAYVLLSNIYA +G+WD VS LRK MRD G+ KEPGCSS+EV+GVVHEFLVGD
Sbjct: 525  EIEPQNHGAYVLLSNIYANLGKWDKVSELRKRMRDVGLKKEPGCSSVEVNGVVHEFLVGD 584

Query: 1912 NSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQALYLHSEKLAMAYGLI 2091
            + HPL  KIYSKL E+A+RLK VGYV +KSQ+LQ ++++D++E+ALYLHSE+LA+A+GLI
Sbjct: 585  SRHPLCKKIYSKLDEVAARLKHVGYVSDKSQLLQLVKEEDMQEKALYLHSERLALAFGLI 644

Query: 2092 YTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFHHFKGGVCSCKDYW 2262
               P  P+RIVKNLRVC DCHSV KL+SK+YDREI+LRDRYRFHHF+GG CSC DYW
Sbjct: 645  TLGPSQPIRIVKNLRVCEDCHSVIKLVSKLYDREIVLRDRYRFHHFRGGSCSCMDYW 701


>ref|XP_003520267.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like isoform X1 [Glycine max]
          Length = 780

 Score =  917 bits (2369), Expect = 0.0
 Identities = 442/707 (62%), Positives = 552/707 (78%)
 Frame = +1

Query: 142  IISLIDQCSDITQVKQIHAHMLRIGLFFDPFSASRLITISALSPFSSSLDYARQVFDQIP 321
            I+  IDQC++  Q+KQIHAHMLR   F DP++AS+L+T  A+S   S L YA+ VF+QIP
Sbjct: 75   ILEFIDQCTNTMQLKQIHAHMLRTSRFCDPYTASKLLTAYAISS-CSCLIYAKNVFNQIP 133

Query: 322  HPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQCPDPPNKFTFPFLIKAASELLALSHGE 501
             PN+Y WNTLIR YASS DPT+SF+IFL MLH C + PNKFTFPFL KAAS L  L  G 
Sbjct: 134  QPNLYCWNTLIRGYASSSDPTQSFLIFLHMLHSCSEFPNKFTFPFLFKAASRLKVLHLGS 193

Query: 502  VFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQAYKVFVKIPKRDVVSWNSMITAFVQAD 681
            V HGM IK+SL SD+FILNSL++FY S G  + A++VF  +P +DVVSWN+MI AF    
Sbjct: 194  VLHGMVIKASLSSDLFILNSLINFYGSSGAPDLAHRVFTNMPGKDVVSWNAMINAFALGG 253

Query: 682  CLEEALELFQGMEEENIKPNDVTMVSVLSACAKSSNLELGKWVHSYIKKNEIEMNLILSN 861
              ++AL LFQ ME +++KPN +TMVSVLSACAK  +LE G+W+ SYI+ N    +LIL+N
Sbjct: 254  LPDKALLLFQEMEMKDVKPNVITMVSVLSACAKKIDLEFGRWICSYIENNGFTEHLILNN 313

Query: 862  ATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAA 1041
            A LDMY KCG + +AK +F+KM EKD VSWTTM+ G+AK+G +  A  +F+ MP +  AA
Sbjct: 314  AMLDMYVKCGCINDAKDLFNKMSEKDIVSWTTMLDGHAKLGNYDEAHCIFDAMPHKWTAA 373

Query: 1042 WNSLISAYEQSGRPKEALSLFHELQLDKNTKPDEVTLVSVLSACAQLGAIELGGWIHVYI 1221
            WN+LISAYEQ+G+P+ ALSLFHE+QL K+ KPDEVTL+  L A AQLGAI+ G WIHVYI
Sbjct: 374  WNALISAYEQNGKPRVALSLFHEMQLSKDAKPDEVTLICALCASAQLGAIDFGHWIHVYI 433

Query: 1222 KKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAI 1401
            KK    +NCH+ +SL+DMYAKCG+L KA+EVF + E+KDVYVWSAMI  L+M+G G  A+
Sbjct: 434  KKHDINLNCHLATSLLDMYAKCGNLNKAMEVFHAVERKDVYVWSAMIGALAMYGQGKAAL 493

Query: 1402 DLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDM 1581
            DLF  M EA +KPN VTFTN+LCAC+HAGLV EG   F+QM  ++G+ PQ++HY C+VD+
Sbjct: 494  DLFSSMLEAYIKPNAVTFTNILCACNHAGLVNEGEQLFEQMEPLYGIVPQIQHYVCVVDI 553

Query: 1582 LGRAGLLEEAMGLIEKMPIVPGASIWGALLGACMIHENVKLAEHACNRLLELEPRNHGAY 1761
             GRAGLLE+A   IEKMPI P A++WGALLGAC  H NV+LAE A   LLELEP NHGA+
Sbjct: 554  FGRAGLLEKAASFIEKMPIPPTAAVWGALLGACSRHGNVELAELAYQNLLELEPCNHGAF 613

Query: 1762 VLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIY 1941
            VLLSNIYAK G W+ VS LRKLMRDS + KEP CSSI+V+G+VHEFLVGDNSHP S KIY
Sbjct: 614  VLLSNIYAKAGDWEKVSNLRKLMRDSDVKKEPWCSSIDVNGIVHEFLVGDNSHPFSQKIY 673

Query: 1942 SKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRI 2121
            SKL EI+ + K +GY P+ S +LQ  E+D++ EQ+L +HSEKLA+A+GLI T+   P+RI
Sbjct: 674  SKLDEISEKFKPIGYKPDMSNLLQLSEEDNLMEQSLNVHSEKLAIAFGLISTASSQPIRI 733

Query: 2122 VKNLRVCGDCHSVAKLISKIYDREILLRDRYRFHHFKGGVCSCKDYW 2262
            VKN+R+CGDCH+ AKL+S++YDR+ILLRDRYRFHHF+GG CSC DYW
Sbjct: 734  VKNIRICGDCHAFAKLVSQLYDRDILLRDRYRFHHFRGGKCSCLDYW 780


>gb|AFN53666.1| hypothetical protein [Linum usitatissimum]
          Length = 850

 Score =  908 bits (2346), Expect = 0.0
 Identities = 441/708 (62%), Positives = 554/708 (78%), Gaps = 3/708 (0%)
 Frame = +1

Query: 148  SLIDQCSDITQVKQIHAHMLRIGLFFDPFSASRLITISALSPFSSSLDYARQVFDQIPHP 327
            +L  QC+   Q+KQIHA MLR     DP++AS L T +A S FS+ LDYAR+VFDQIP P
Sbjct: 144  ALFQQCTSFKQLKQIHAQMLRTNKLHDPYAASELFTAAAFSSFSA-LDYARKVFDQIPQP 202

Query: 328  NIYTWNTLIRAYASSPDPTESFIIFLQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVF 507
            N+Y+WN LIRA A+S DP +S ++F++MLH  P  PNKFTFP LIKA +E      G+  
Sbjct: 203  NLYSWNILIRALATSSDPIQSVLVFIRMLHDSPFGPNKFTFPVLIKAVAERRCFLVGKAV 262

Query: 508  HGMAIKSSLDSDVFILNSLVHFYASCGNLNQAYKVFVKIP--KRDVVSWNSMITAFVQAD 681
            HGMAIK+S   DVF+LNSL+HFYASCG+L+ AY VF  I    +D+VSWNSM+T FVQ  
Sbjct: 263  HGMAIKTSFGDDVFVLNSLIHFYASCGHLDLAYLVFEMIEGNNKDIVSWNSMVTGFVQGG 322

Query: 682  CLEEALELFQGMEEENIKPNDVTMVSVLSACAKSSNLELGKWVHSYIKKNEIEMNLILSN 861
              ++AL+LF+ M  E + PN VTMVSV+SACAK+ NL LG+ V  YI +NE+ MNL + N
Sbjct: 323  YPDKALDLFERMRNEGVHPNAVTMVSVMSACAKTMNLTLGRKVCDYIDRNEMMMNLNVCN 382

Query: 862  ATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAA 1041
            AT+DM+ KCG +E A+ +FD M ++D VSWTT+I GYAKM E  +AR +F+ MP +DI A
Sbjct: 383  ATIDMFVKCGEVEIARGLFDNMEKRDVVSWTTIIDGYAKMSEHGIARDIFDSMPRKDIPA 442

Query: 1042 WNSLISAYEQSGRPKEALSLFHELQLDKN-TKPDEVTLVSVLSACAQLGAIELGGWIHVY 1218
            WN LIS YEQSGRPKEAL++F ELQL K+  +PD+VTL+S LSACAQLGA+++G WIH Y
Sbjct: 443  WNVLISGYEQSGRPKEALAIFRELQLTKSGARPDQVTLLSTLSACAQLGAMDIGEWIHGY 502

Query: 1219 IKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDA 1398
            IKK+  ++N ++ +SLIDMY+K GD+EKA+EVF S   KDV+VWSAMIAGL+MHG G  A
Sbjct: 503  IKKERIQLNRNLATSLIDMYSKSGDVEKAIEVFHSIGNKDVFVWSAMIAGLAMHGRGEAA 562

Query: 1399 IDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVD 1578
            I+LFL MQE  VKPN VTFTN+LCACSH+GLV+EG+  FD+M  V+GV P+ KHY CMVD
Sbjct: 563  IELFLDMQETQVKPNSVTFTNLLCACSHSGLVDEGKRLFDEMERVYGVVPKTKHYSCMVD 622

Query: 1579 MLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACMIHENVKLAEHACNRLLELEPRNHGA 1758
            +LGRAG LEEA+  IE MP+ P AS+WGALLGAC IH N++LAE AC+RLLE+EP NHGA
Sbjct: 623  VLGRAGHLEEALKFIEGMPLAPSASVWGALLGACCIHGNLELAEKACSRLLEIEPGNHGA 682

Query: 1759 YVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKI 1938
            YVLLSN+YAK G W+GVS LR+ MRDSG+ KE GCSSIE+DG VHEF+VGDN+HPLS  I
Sbjct: 683  YVLLSNLYAKTGDWEGVSELRQQMRDSGLKKETGCSSIEIDGTVHEFIVGDNAHPLSRDI 742

Query: 1939 YSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVR 2118
            Y+KL EI +RL+S GYV N   +LQ +E++++KE+AL LHSEK+A+A+GLI       +R
Sbjct: 743  YAKLDEIMARLRSHGYVANTLCMLQFVEEEEMKEKALKLHSEKMAIAFGLIRADSQQAIR 802

Query: 2119 IVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFHHFKGGVCSCKDYW 2262
            IVKNLRVC DCH+VAK++SK+Y R+I+LRDRYRFHHF GG CSC+DYW
Sbjct: 803  IVKNLRVCRDCHTVAKMVSKVYGRDIVLRDRYRFHHFSGGHCSCQDYW 850


>ref|XP_006575137.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
            chloroplastic-like isoform X2 [Glycine max]
          Length = 695

 Score =  905 bits (2339), Expect = 0.0
 Identities = 437/695 (62%), Positives = 544/695 (78%)
 Frame = +1

Query: 178  QVKQIHAHMLRIGLFFDPFSASRLITISALSPFSSSLDYARQVFDQIPHPNIYTWNTLIR 357
            Q+KQIHAHMLR   F DP++AS+L+T  A+S   S L YA+ VF+QIP PN+Y WNTLIR
Sbjct: 2    QLKQIHAHMLRTSRFCDPYTASKLLTAYAISS-CSCLIYAKNVFNQIPQPNLYCWNTLIR 60

Query: 358  AYASSPDPTESFIIFLQMLHQCPDPPNKFTFPFLIKAASELLALSHGEVFHGMAIKSSLD 537
             YASS DPT+SF+IFL MLH C + PNKFTFPFL KAAS L  L  G V HGM IK+SL 
Sbjct: 61   GYASSSDPTQSFLIFLHMLHSCSEFPNKFTFPFLFKAASRLKVLHLGSVLHGMVIKASLS 120

Query: 538  SDVFILNSLVHFYASCGNLNQAYKVFVKIPKRDVVSWNSMITAFVQADCLEEALELFQGM 717
            SD+FILNSL++FY S G  + A++VF  +P +DVVSWN+MI AF      ++AL LFQ M
Sbjct: 121  SDLFILNSLINFYGSSGAPDLAHRVFTNMPGKDVVSWNAMINAFALGGLPDKALLLFQEM 180

Query: 718  EEENIKPNDVTMVSVLSACAKSSNLELGKWVHSYIKKNEIEMNLILSNATLDMYTKCGNL 897
            E +++KPN +TMVSVLSACAK  +LE G+W+ SYI+ N    +LIL+NA LDMY KCG +
Sbjct: 181  EMKDVKPNVITMVSVLSACAKKIDLEFGRWICSYIENNGFTEHLILNNAMLDMYVKCGCI 240

Query: 898  EEAKRIFDKMCEKDTVSWTTMIVGYAKMGEFVVARHLFNLMPSQDIAAWNSLISAYEQSG 1077
             +AK +F+KM EKD VSWTTM+ G+AK+G +  A  +F+ MP +  AAWN+LISAYEQ+G
Sbjct: 241  NDAKDLFNKMSEKDIVSWTTMLDGHAKLGNYDEAHCIFDAMPHKWTAAWNALISAYEQNG 300

Query: 1078 RPKEALSLFHELQLDKNTKPDEVTLVSVLSACAQLGAIELGGWIHVYIKKQSFKMNCHIT 1257
            +P+ ALSLFHE+QL K+ KPDEVTL+  L A AQLGAI+ G WIHVYIKK    +NCH+ 
Sbjct: 301  KPRVALSLFHEMQLSKDAKPDEVTLICALCASAQLGAIDFGHWIHVYIKKHDINLNCHLA 360

Query: 1258 SSLIDMYAKCGDLEKALEVFRSSEKKDVYVWSAMIAGLSMHGHGNDAIDLFLGMQEANVK 1437
            +SL+DMYAKCG+L KA+EVF + E+KDVYVWSAMI  L+M+G G  A+DLF  M EA +K
Sbjct: 361  TSLLDMYAKCGNLNKAMEVFHAVERKDVYVWSAMIGALAMYGQGKAALDLFSSMLEAYIK 420

Query: 1438 PNDVTFTNVLCACSHAGLVEEGRLHFDQMLLVHGVEPQLKHYGCMVDMLGRAGLLEEAMG 1617
            PN VTFTN+LCAC+HAGLV EG   F+QM  ++G+ PQ++HY C+VD+ GRAGLLE+A  
Sbjct: 421  PNAVTFTNILCACNHAGLVNEGEQLFEQMEPLYGIVPQIQHYVCVVDIFGRAGLLEKAAS 480

Query: 1618 LIEKMPIVPGASIWGALLGACMIHENVKLAEHACNRLLELEPRNHGAYVLLSNIYAKVGR 1797
             IEKMPI P A++WGALLGAC  H NV+LAE A   LLELEP NHGA+VLLSNIYAK G 
Sbjct: 481  FIEKMPIPPTAAVWGALLGACSRHGNVELAELAYQNLLELEPCNHGAFVLLSNIYAKAGD 540

Query: 1798 WDGVSRLRKLMRDSGINKEPGCSSIEVDGVVHEFLVGDNSHPLSTKIYSKLGEIASRLKS 1977
            W+ VS LRKLMRDS + KEP CSSI+V+G+VHEFLVGDNSHP S KIYSKL EI+ + K 
Sbjct: 541  WEKVSNLRKLMRDSDVKKEPWCSSIDVNGIVHEFLVGDNSHPFSQKIYSKLDEISEKFKP 600

Query: 1978 VGYVPNKSQVLQDIEDDDVKEQALYLHSEKLAMAYGLIYTSPPSPVRIVKNLRVCGDCHS 2157
            +GY P+ S +LQ  E+D++ EQ+L +HSEKLA+A+GLI T+   P+RIVKN+R+CGDCH+
Sbjct: 601  IGYKPDMSNLLQLSEEDNLMEQSLNVHSEKLAIAFGLISTASSQPIRIVKNIRICGDCHA 660

Query: 2158 VAKLISKIYDREILLRDRYRFHHFKGGVCSCKDYW 2262
             AKL+S++YDR+ILLRDRYRFHHF+GG CSC DYW
Sbjct: 661  FAKLVSQLYDRDILLRDRYRFHHFRGGKCSCLDYW 695


>gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlisea aurea]
          Length = 726

 Score =  896 bits (2316), Expect = 0.0
 Identities = 431/725 (59%), Positives = 563/725 (77%), Gaps = 8/725 (1%)
 Frame = +1

Query: 112  NNNRYFADHPIISLIDQCSDITQVKQIHAHMLRIGLFFDPFSASRLITISALSPFSSSLD 291
            +  R+   HP ++LID+C+   Q+KQIH  MLR GL  DPF+AS+LI++SALS FSS L 
Sbjct: 3    DKERFLEKHPTVTLIDRCTSQKQLKQIHCQMLRSGLLDDPFAASKLISLSALSDFSS-LA 61

Query: 292  YARQVFDQIPHPNIYTWNTLIRAYASSPDPTESFIIFLQMLHQCPDPPNKFTFPFLIKAA 471
            YA++VFDQ+P PN+++WN L+RAYAS+  P  S  +F+++LH  PDPP+KFT+PF IKA 
Sbjct: 62   YAQKVFDQMPRPNLFSWNILVRAYASASRPLHSLSLFIRLLHHSPDPPDKFTYPFAIKAC 121

Query: 472  SELLALSHGEVFHGMAIKSSLDSDVFILNSLVHFYASCGNLNQAYKVFVKIPK--RDVVS 645
            ++L  L  G   HGMA+K +  SDVF+ NSL+ FY+ C  L  AY++F  +P+  RDVVS
Sbjct: 122  ADLSDLRLGRGIHGMAVKGNHASDVFVSNSLIRFYSECRCLVAAYRIFETMPRTRRDVVS 181

Query: 646  WNSMITAFVQADCLEEALELFQGM----EEENIKPNDVTMVSVLSACAKSSNLELGKWVH 813
            WNSMI   VQ    ++A+ELF  M    EEE ++PN VTM+SVL  C   S+LELGKW H
Sbjct: 182  WNSMINGLVQNKWHDDAMELFHRMVAEEEEEGVEPNGVTMLSVLGICGTKSDLELGKWAH 241

Query: 814  SYIKKNEIEMNLILSNATLDMYTKCGNLEEAKRIFDKMCEKDTVSWTTMIVGYAKMGEFV 993
            SY+ KN +E +LIL NA LDMYTKCG ++EA+ +FDKM ++D ++WTTM+ GYAK G+F 
Sbjct: 242  SYVNKNGMEGSLILDNAILDMYTKCGGMKEAREVFDKMEDRDVITWTTMLTGYAKTGDFK 301

Query: 994  VARHLFNLMPSQDIAAWNSLISAYEQSGRPKEALSLFHELQLDKN-TKPDEVTLVSVLSA 1170
             AR LF+ +P++DI +WN+LISAYEQ G  KEA+++F+ELQ   N T+PD VTLVS LSA
Sbjct: 302  AARDLFDALPTKDITSWNALISAYEQRGNAKEAIAIFNELQQSNNDTEPDGVTLVSTLSA 361

Query: 1171 CAQLGAIELGGWIHVYIKKQSFKMNCHITSSLIDMYAKCGDLEKALEVFRSSE-KKDVYV 1347
            C+QLGAIELG  IH Y+KK+   +NCH+ +SLIDMY+KCGDLEKA +VFRSS  ++DV+V
Sbjct: 362  CSQLGAIELGTRIHNYVKKRGMSLNCHLVTSLIDMYSKCGDLEKAAQVFRSSSHERDVFV 421

Query: 1348 WSAMIAGLSMHGHGNDAIDLFLGMQEANVKPNDVTFTNVLCACSHAGLVEEGRLHFDQML 1527
            WSAMIA   MHG G+DA++LF  MQEA VKP+ VTFTN+L ACSH+GLVEEG   F+QM 
Sbjct: 422  WSAMIAAYGMHGCGHDAVELFKKMQEAKVKPSFVTFTNLLSACSHSGLVEEGVELFNQME 481

Query: 1528 LVHGVEPQLKHYGCMVDMLGRAGLLEEAMGLIEKMPIVPGASIWGALLGACMIHENVKLA 1707
             V+G+ P+++HY C+VD+LGRAG LE A+  I  MP+ PG+S+WGALLGAC +H+NV+LA
Sbjct: 482  NVYGIVPRMEHYACLVDILGRAGRLERAVEFIRSMPMTPGSSVWGALLGACKLHKNVELA 541

Query: 1708 EHACNRLLELEPRNHGAYVLLSNIYAKVGRWDGVSRLRKLMRDSGINKEPGCSSIEVDGV 1887
            + ACN LLE+EP N GA V+LSN+YA +G+W+ VS LRK MR++G+ K+ GCSS+E++G 
Sbjct: 542  QLACNNLLEIEPLNDGAMVVLSNLYADLGKWEEVSNLRKRMRETGLKKQTGCSSVEINGT 601

Query: 1888 VHEFLVGDNSHPLSTKIYSKLGEIASRLKSVGYVPNKSQVLQDIEDDDVKEQALYLHSEK 2067
             HEFLVGD +HPLS KIY KL EIA+ LKS GYVP+KSQVLQ +E++D++E++LY HSE+
Sbjct: 602  NHEFLVGDTTHPLSKKIYLKLEEIAAELKSAGYVPDKSQVLQQVEEEDIQEKSLYHHSER 661

Query: 2068 LAMAYGLIYTSPPSPVRIVKNLRVCGDCHSVAKLISKIYDREILLRDRYRFHHFKGGVCS 2247
            LA+A GLI  +P  P+RIVKNLRVC DCH V KL+S+IYDREI+LRDRYRFH F+ G CS
Sbjct: 662  LALALGLISLAPSQPIRIVKNLRVCEDCHCVFKLVSRIYDREIVLRDRYRFHLFRKGCCS 721

Query: 2248 CKDYW 2262
            CK+YW
Sbjct: 722  CKEYW 726


Top