BLASTX nr result

ID: Mentha29_contig00010732 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00010732
         (2639 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containi...   994   0.0  
ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containi...   992   0.0  
gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Mimulus...   984   0.0  
ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containi...   912   0.0  
ref|XP_007051367.1| Pentatricopeptide repeat-containing protein,...   901   0.0  
gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlise...   887   0.0  
ref|XP_002515124.1| pentatricopeptide repeat-containing protein,...   872   0.0  
ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Popu...   871   0.0  
gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]     870   0.0  
ref|XP_006386676.1| pentatricopeptide repeat-containing family p...   869   0.0  
ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containi...   864   0.0  
ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containi...   858   0.0  
ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containi...   853   0.0  
ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Caps...   852   0.0  
ref|NP_192066.2| pentatricopeptide repeat-containing protein [Ar...   848   0.0  
ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutr...   846   0.0  
ref|XP_002874971.1| pentatricopeptide repeat-containing protein ...   845   0.0  
ref|XP_003539071.1| PREDICTED: pentatricopeptide repeat-containi...   816   0.0  
ref|XP_003621545.1| Pentatricopeptide repeat-containing protein ...   777   0.0  
ref|XP_004491942.1| PREDICTED: pentatricopeptide repeat-containi...   775   0.0  

>ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Solanum lycopersicum]
          Length = 819

 Score =  994 bits (2569), Expect = 0.0
 Identities = 490/812 (60%), Positives = 626/812 (77%), Gaps = 10/812 (1%)
 Frame = +1

Query: 52   RAMSVLRDSSAFLLPNTARFRFTTAAEKSAESAPISELGDLLVVAAIAKTLSKPGGIHAL 231
            R ++VL +   F +   A +  TT+A K+A +   S++G+L+VVA+IAK L K GG   L
Sbjct: 7    RNLAVLYNKRQFSVAG-ASYTGTTSAAKTAAA---SKVGNLIVVASIAKALIKRGGTRNL 62

Query: 232  EKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCSLRQNYKHSARAYSQMFKVLCFLTH 411
            EK GD IP                      FF+WCSLR N+KHS   YSQMFK +C+ + 
Sbjct: 63   EKYGDLIPLSESLVLQVLRRNNLDAEKKLDFFKWCSLRPNFKHSTETYSQMFKCICY-SR 121

Query: 412  QHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSA 591
             H +DV  LL +M+ D + L+S+T K++LD F + G +DSALE+L++ E +L  +SC S 
Sbjct: 122  NHREDVFVLLNSMKDDEVLLNSATFKLLLDSFTRTGNFDSALEILEFVEGDLANSSCLSP 181

Query: 592  DVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNGENIVIPDAIACNEVLVGLKKADMR 771
            DVY+ VL+ALV KNQ+++ALS+F KLL++     +G +I +  AIACNE+LVGLK+ +MR
Sbjct: 182  DVYNSVLIALVQKNQVNLALSIFLKLLET----NDGNSIGVSSAIACNELLVGLKRGNMR 237

Query: 772  DEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWGDLSTALALFKEMKERNGPFEPDLCTY 951
             EF+Q++  LR   ++P DRWGYNICIHA GCWGDLS +L+LFKEMKER   F PDLCTY
Sbjct: 238  AEFKQVFDKLRGGNVFPFDRWGYNICIHAFGCWGDLSRSLSLFKEMKERGSCFSPDLCTY 297

Query: 952  NSLINVLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQ 1131
            NSLI+VLCLLGKVKDA +VWEELK SSG EPD +TYRI+IQGC+K+Y +NDA+++F+EMQ
Sbjct: 298  NSLIHVLCLLGKVKDAFVVWEELKGSSGLEPDAYTYRIVIQGCSKAYLINDAIKVFTEMQ 357

Query: 1132 YNGIRAGTVVYNSLLDGLMKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRRE 1311
            YNGIR  T+VYNSLLDGL+K +KL +AC+LF+KM++DDGVRASCWTYNILIDGL+KN R 
Sbjct: 358  YNGIRPDTIVYNSLLDGLLKVRKLTDACNLFQKMIEDDGVRASCWTYNILIDGLFKNGRA 417

Query: 1312 EAAYTLFCDLKKKGNNFVDGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITS 1491
             AAYTLFCDLKKK NNFVDGV+YSIV LHLCRE++++EAL+LVEEME RGF VDLVTITS
Sbjct: 418  LAAYTLFCDLKKKSNNFVDGVSYSIVILHLCREDRLDEALKLVEEMEARGFTVDLVTITS 477

Query: 1492 LVIALYKRGRWDWIEKLLRHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPS--- 1662
            L+IA+Y+ G WD+ E+L++HIR+ NLVP +I+WK +MEA M  PQS+++DFTP+FPS   
Sbjct: 478  LLIAIYREGHWDYTERLMKHIRDSNLVPIIIRWKDSMEATMKAPQSREKDFTPIFPSNRN 537

Query: 1663 LSDVVEILNLKKADKD---GEDNTDKFNDEKDEWSSSPYM----DKLANSGQPSLTYAIS 1821
              D++ + NL  A+ D   G +  +    E D WSSSPYM    DK+++    S T++++
Sbjct: 538  FGDILGLENLTDAETDIALGAEEAEIHYQESDPWSSSPYMDLLADKVSSQSNSSRTFSLT 597

Query: 1822 KGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSF 2001
             G R+  K  DSFDID++NT+LSIFLAKGKLS+ACKLFEIFT++G +P+SYTYNS+MSSF
Sbjct: 598  GGKRIDTKSADSFDIDMVNTFLSIFLAKGKLSMACKLFEIFTDMGADPVSYTYNSMMSSF 657

Query: 2002 IKKGYFKEAWGVLHAMGEEEISPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLD 2181
            +KKGYF EAWGVL  MG E++ PSD+ATYNV+IQGLGKMGRADLA  VL+KL  +GGYLD
Sbjct: 658  VKKGYFNEAWGVLQEMG-EKVCPSDVATYNVIIQGLGKMGRADLADAVLDKLMKQGGYLD 716

Query: 2182 IVMYNTLINALGKGGRFGEAVELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLK 2361
            IVMYNTLINALGK GR  E  +LF+QMK SG+NPDV TYNTLIEVHAK G++K++Y FL+
Sbjct: 717  IVMYNTLINALGKAGRIEEVNKLFQQMKDSGINPDVVTYNTLIEVHAKAGQLKQSYKFLR 776

Query: 2362 MMLDAGCVPNHVTDTCLDYLEKEIERRRYDMA 2457
            MML+AGC PN VTDT LD+LEKEIE+ RY  A
Sbjct: 777  MMLEAGCAPNQVTDTTLDFLEKEIEKLRYQKA 808


>ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            isoform X1 [Solanum tuberosum]
          Length = 816

 Score =  992 bits (2565), Expect = 0.0
 Identities = 479/783 (61%), Positives = 614/783 (78%), Gaps = 10/783 (1%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            + +A  S++G+LLVVA+IAK L KPGG   LE+ GD+IP                     
Sbjct: 29   SSTAAASKVGNLLVVASIAKALIKPGGTRNLEQYGDSIPLSESLVLQVLRRNNLDAEKKL 88

Query: 319  GFFRWCSLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMIL 498
             FF+WCSLR ++KHS   YSQMFK +C+ +H H + +  LL +M+ D + L+++T K++L
Sbjct: 89   DFFKWCSLRPSFKHSTETYSQMFKSICY-SHNHREAIFVLLNSMKDDKVLLNAATFKLLL 147

Query: 499  DGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDS 678
            D F + G +DSALE+L++ E +L  +SC S DVY+ VL+ALV KNQ+++ALS+F KLL++
Sbjct: 148  DSFTRTGNFDSALEILEFVEGDLDNSSCLSPDVYNSVLIALVQKNQVNLALSIFLKLLET 207

Query: 679  ALFAKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHA 858
                 +G +I +  A+ACNE+LVGLK+ +MR EF+Q++  LR   ++P DRWGYNICIH 
Sbjct: 208  ----NDGNSIGVSSAVACNELLVGLKRGNMRAEFKQVFDKLRGGNVFPFDRWGYNICIHT 263

Query: 859  LGCWGDLSTALALFKEMKERNGPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGY 1038
             GCWGDLS++L+LFKEMKER   F PDLCTYNSLI+VLCLLGKVKDA +VWEELK SSG 
Sbjct: 264  FGCWGDLSSSLSLFKEMKERGSWFSPDLCTYNSLIHVLCLLGKVKDAFVVWEELKGSSGL 323

Query: 1039 EPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACD 1218
            EPD +TYRI+IQGC+K+Y +NDA+++F+EMQYNGIR  T+VYN+LLDGL+K++KL +AC+
Sbjct: 324  EPDAYTYRIVIQGCSKAYLINDAIKVFTEMQYNGIRPDTIVYNTLLDGLLKARKLTDACN 383

Query: 1219 LFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLH 1398
            LF+KM++DDGVRASCWTYNILIDGL+KN R  AAYTLFCDLKKK NNFVDGVTYSIV LH
Sbjct: 384  LFQKMIEDDGVRASCWTYNILIDGLFKNGRALAAYTLFCDLKKKSNNFVDGVTYSIVILH 443

Query: 1399 LCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPS 1578
            LCRE++++EAL+LVEEME RGF VDLVTITSL+IA+YK G WD+ E+L++HIR+ NLVP 
Sbjct: 444  LCREDRLDEALKLVEEMEARGFTVDLVTITSLLIAIYKEGHWDYTERLMKHIRDSNLVPI 503

Query: 1579 LIKWKSAMEAAMTKPQSKKRDFTPLFPS---LSDVVEILNLKKADKD---GEDNTDKFND 1740
            +I+WK +MEA M  PQS+++DFTP+FPS     D++ + NL  A+ D   G ++ +    
Sbjct: 504  IIRWKDSMEATMKAPQSREKDFTPIFPSNRNFGDILGLENLTDAETDTALGAEDAEIHYQ 563

Query: 1741 EKDEWSSSPYMDKLAN----SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKG 1908
            E D WSSSPYMD LAN        S T++++ G R+  K  DSFDID++NT+LSIFLAKG
Sbjct: 564  ESDPWSSSPYMDMLANKVSSQSNSSRTFSLTGGKRIDTKSADSFDIDMVNTFLSIFLAKG 623

Query: 1909 KLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATY 2088
            KLS+ACKLFEIFT++G +P+SYTYNS+MSSF+KKGYF EAWG+L  MG E++ PSD+ATY
Sbjct: 624  KLSMACKLFEIFTDMGADPVSYTYNSMMSSFVKKGYFNEAWGILQEMG-EKVCPSDVATY 682

Query: 2089 NVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKR 2268
            NV+IQGLGKMGRADLA  VL+KL  +GGYLDIVMYNTLINALGK GR  E  +LF+QMK 
Sbjct: 683  NVIIQGLGKMGRADLADAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEVNKLFQQMKN 742

Query: 2269 SGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRY 2448
            SG+NPDV TYNTLIEVHAK G++K++Y FL+MML+AGC PN VTDT LD+LEKEIE+ RY
Sbjct: 743  SGINPDVVTYNTLIEVHAKAGQLKQSYKFLRMMLEAGCAPNQVTDTTLDFLEKEIEKLRY 802

Query: 2449 DMA 2457
              A
Sbjct: 803  QKA 805


>gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Mimulus guttatus]
          Length = 760

 Score =  984 bits (2545), Expect = 0.0
 Identities = 513/790 (64%), Positives = 614/790 (77%), Gaps = 10/790 (1%)
 Frame = +1

Query: 58   MSVLRDSSAFLL-PNTARFRFTTAAEKS--AESAPISELGDLLVVAAIAKTLSKPGGIHA 228
            M++   S++FL  P + + RFTTAA+ +  A S   SELG+LL+VAAIAKTLS PGGIH+
Sbjct: 1    MALFHHSASFLRRPLSPKSRFTTAAKSTNGAVSGTASELGNLLIVAAIAKTLSNPGGIHS 60

Query: 229  LEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCSLRQNYKHSARAYSQMFKVLCFLT 408
            LEK+ D+IP                      FFR                          
Sbjct: 61   LEKDADSIPLSENLVLQVLRRGSLDAARKLDFFRC------------------------- 95

Query: 409  HQHHDDVLELLAAMRH--DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSC 582
                 D+LEL+A+M    D  ALDS TLK+IL+ FI++GKYDSALEVLD  ER+LI T+ 
Sbjct: 96   -----DILELVASMASGGDAAALDSPTLKLILNSFIRSGKYDSALEVLDCVERDLIQTTS 150

Query: 583  FSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNGENIVIPDAIACNEVLVGLKKA 762
             S D+YSPV+VAL+ KNQISIALS+F KLLDS+       +  IPDAIACNE+LV LKK+
Sbjct: 151  LSPDIYSPVIVALIRKNQISIALSIFLKLLDSS-------SSEIPDAIACNELLVALKKS 203

Query: 763  DMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWGDLSTALALFKEMK-ERNGPFEPD 939
            DM+DEF+Q+++ LRKTKLYP+DR GYNICIH LGCWGDLST+L LFKEMK E N    PD
Sbjct: 204  DMKDEFKQVFAKLRKTKLYPLDRCGYNICIHTLGCWGDLSTSLNLFKEMKRETNIRLNPD 263

Query: 940  LCTYNSLINVLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIF 1119
            LCTYNSLI+VLCLLGKVKDALIVWEELKASSG+EPD FTYRI+IQGC KSYR+N+A++IF
Sbjct: 264  LCTYNSLIHVLCLLGKVKDALIVWEELKASSGHEPDAFTYRILIQGCCKSYRINEAVKIF 323

Query: 1120 SEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYK 1299
            SEMQYNGI+  TVVYNSLLDGL+KS+KL+EAC+LFEKM DDDG RA+CWTYNILIDGLYK
Sbjct: 324  SEMQYNGIKTETVVYNSLLDGLLKSRKLVEACNLFEKMADDDGARATCWTYNILIDGLYK 383

Query: 1300 NRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLV 1479
            N R EAAYT+FCDLK+KGNNF+DGV+YSIV L LCRE+Q+EEA++LVEEME RGFVVDLV
Sbjct: 384  NGRAEAAYTMFCDLKRKGNNFIDGVSYSIVVLQLCREDQLEEAVRLVEEMEARGFVVDLV 443

Query: 1480 TITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFP 1659
            TITSL+ ALY+RG+WD  E L+++IR+ NLV SL+KWKS+MEA++  PQSKKRDFTP FP
Sbjct: 444  TITSLLSALYRRGQWDSTEGLMKYIRDRNLVSSLLKWKSSMEASLRSPQSKKRDFTPFFP 503

Query: 1660 SLSDVVEILNLKKADKDGEDNTDKFNDEKDEWSSSPYMDKLANS----GQPSLTYAISKG 1827
             +S++ EILNL K+    E + +    EKDEWSSSPYMD+LAN       PS ++++S+G
Sbjct: 504  PISNIAEILNLAKS---SETHCEGVEVEKDEWSSSPYMDELANKFVSRDTPSQSFSMSRG 560

Query: 1828 VRVLGKGDDSFDIDIMNTYLSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIK 2007
            VRV+ KG+DSFDID+           GKLSLACKLFEIFT++GV+P SYTYNSIMSSF+K
Sbjct: 561  VRVMAKGEDSFDIDM-----------GKLSLACKLFEIFTDMGVDPTSYTYNSIMSSFVK 609

Query: 2008 KGYFKEAWGVLHAMGEEEISPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIV 2187
            KGYFKEAWGVLHAMG E ++P+D+ATYNV+IQGLGKMGRADLA  VLEKL++EGGYLDIV
Sbjct: 610  KGYFKEAWGVLHAMG-ETVNPTDVATYNVIIQGLGKMGRADLANSVLEKLREEGGYLDIV 668

Query: 2188 MYNTLINALGKGGRFGEAVELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMM 2367
            MYNTLINALGK GR  EA ELF QMK SG+NPDV TYNTLIEVH+K G++K+AY FL+ M
Sbjct: 669  MYNTLINALGKDGRLDEANELFGQMKSSGINPDVVTYNTLIEVHSKAGRLKDAYKFLRKM 728

Query: 2368 LDAGCVPNHV 2397
            LD GC PNHV
Sbjct: 729  LDDGCAPNHV 738


>ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570
            [Vitis vinifera]
          Length = 792

 Score =  912 bits (2356), Expect = 0.0
 Identities = 465/790 (58%), Positives = 596/790 (75%), Gaps = 11/790 (1%)
 Frame = +1

Query: 121  TAAEKSAESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXX 300
            T +  +A  A + +LGD+L+VA+I+KTLS+ G       + ++IP               
Sbjct: 6    TLSSSAAAGAGV-KLGDMLLVASISKTLSERG---TRSPDLESIPISESLVVQILGRNSI 61

Query: 301  XXXXXXGFFRWCSLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSS 480
                   FFRWCS R NYKHS  AYS +F+++C    +  D V  L+++M+ DG+ +   
Sbjct: 62   DVFRKVEFFRWCSFRHNYKHSVGAYSHIFRIVCRAGAEFLDQVPLLMSSMKDDGVVVGQE 121

Query: 481  TLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVF 660
            T K++LD  I+AGK+DSALE+LD+ E   +GT   ++ VY  VLVAL+ KNQ+ +AL +F
Sbjct: 122  TFKLLLDSLIRAGKFDSALEILDHIEE--LGTG-LNSYVYDSVLVALIRKNQLGLALPLF 178

Query: 661  SKLLDSALFAKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGY 840
             KLL      +    + +P++ ACN++LV L+KADM+ EFR ++  LR  K + +D  GY
Sbjct: 179  FKLLGGD---EGQGGVPVPESNACNQLLVALRKADMKIEFRNVFEKLRAKKDFDLDTQGY 235

Query: 841  NICIHALGCWGDLSTALALFKEMKERN---GPFEPDLCTYNSLINVLCLLGKVKDALIVW 1011
            NICIHA GCWGDL TAL LFKEMK+++     F PDLCTYNSLI VLCL+GKVKDALIVW
Sbjct: 236  NICIHAFGCWGDLGTALNLFKEMKDKSLNSSSFGPDLCTYNSLIRVLCLVGKVKDALIVW 295

Query: 1012 EELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMK 1191
            EELK S G+EPD FTYRI+IQGC+KSYRM+DA+RIF+EMQYNG    T+VYN+LLDGL K
Sbjct: 296  EELKGS-GHEPDAFTYRILIQGCSKSYRMDDAMRIFNEMQYNGFCPDTIVYNTLLDGLFK 354

Query: 1192 SKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDG 1371
            ++K+MEAC +FEKMV+D GVRASCWT+NI+I GL++N R  A YTLFCDLKKKG  FVDG
Sbjct: 355  ARKVMEACQVFEKMVED-GVRASCWTHNIVICGLFRNGRAAAGYTLFCDLKKKGK-FVDG 412

Query: 1372 VTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRH 1551
            +TYSIV L LCRE Q+EEALQLVEEME RGFVVDLVTITSL+I  +K+GRWDW E+L++H
Sbjct: 413  ITYSIVVLQLCREGQLEEALQLVEEMEARGFVVDLVTITSLLIGFHKQGRWDWTERLMKH 472

Query: 1552 IREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPS---LSDVVEILNLKKADKDGEDN 1722
            IR+GNLVP+++ WK+ MEA M  PQS+++D+TP+FPS   LS+++ +++    + DG   
Sbjct: 473  IRDGNLVPNVLNWKANMEAYMKAPQSRRKDYTPMFPSEGNLSEIMSLISSADTEMDGSPG 532

Query: 1723 TDK-FNDEKDEWSSSPYMDKLANSGQP----SLTYAISKGVRVLGKGDDSFDIDIMNTYL 1887
            +++     +D+WSSSPYMD+LA+  +     S   ++S+G RV  KG DSFDID++NTYL
Sbjct: 533  SEEDVAQHEDQWSSSPYMDQLASQLKSIDVSSQLLSLSRGQRVQAKGIDSFDIDMVNTYL 592

Query: 1888 SIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEIS 2067
            SIFLAKGKLSLACKLFEIF+N+GV+P+ YTYNS+M++F+KKGYF EAWGV H MGE+ + 
Sbjct: 593  SIFLAKGKLSLACKLFEIFSNMGVDPVIYTYNSMMTAFVKKGYFNEAWGVFHEMGEK-VC 651

Query: 2068 PSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVE 2247
            P DIATYNV+IQGLGKMGRADLA  VL+ L  +GGYLDIVMYNTLINALGK GR  EA +
Sbjct: 652  PPDIATYNVIIQGLGKMGRADLASAVLDMLMKQGGYLDIVMYNTLINALGKAGRIDEATK 711

Query: 2248 LFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEK 2427
            LFEQM+ SG+NPDV T+NTLIE+HAK G++K AY FLK+MLDAGC PNHVTDT LD+L K
Sbjct: 712  LFEQMRSSGINPDVVTFNTLIEIHAKAGQLKAAYKFLKLMLDAGCSPNHVTDTTLDFLGK 771

Query: 2428 EIERRRYDMA 2457
            EIE+ RY  A
Sbjct: 772  EIEKLRYKKA 781


>ref|XP_007051367.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao] gi|508703628|gb|EOX95524.1| Pentatricopeptide
            repeat-containing protein, putative [Theobroma cacao]
          Length = 807

 Score =  901 bits (2329), Expect = 0.0
 Identities = 466/791 (58%), Positives = 592/791 (74%), Gaps = 21/791 (2%)
 Frame = +1

Query: 148  APISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFF 327
            +P   LG++L++A++ KTLS+  G   L+ N  +IP                      FF
Sbjct: 18   SPSIHLGNILLIASLTKTLSE-SGTRNLDPN--SIPISEPLVIQILRKHSLEPSKKLDFF 74

Query: 328  RWC-SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDG 504
             WC S++ N+KHSA  YS +F+ LC       ++V  LL AM+ DG+ +DS T K +LD 
Sbjct: 75   NWCRSVKPNFKHSAVTYSHIFRTLC--RSGFVEEVPNLLFAMKEDGVLVDSDTFKFLLDA 132

Query: 505  FIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSAL 684
            FI++GK+DSALE+LD+ E    G    +  VY  VLVAL+ K+Q+ +ALS+F KLL++  
Sbjct: 133  FIRSGKFDSALEILDFMEELGAG---LNLRVYDSVLVALIRKDQVGLALSLFFKLLEACN 189

Query: 685  FAKNGENI--VIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHA 858
               +G ++   +P +IA NE+LV L+KA MR EF+Q++  LR+ + +  D  GYNICIH+
Sbjct: 190  GNDDGNSVDSSLPGSIAINELLVALRKAHMRREFKQVFDILREKREFEFDTCGYNICIHS 249

Query: 859  LGCWGDLSTALALFKEMKERN---GPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKAS 1029
             GCWGDL  +L LFKEMKE+    G F PDLCTYNSLI+VLCL+GKVKDAL+VWEELK S
Sbjct: 250  FGCWGDLGASLKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLVGKVKDALVVWEELKVS 309

Query: 1030 SGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLME 1209
             G+EPD FTYRI+IQGC+KSYRM+DA +IFSEMQYNG    TVVYNSLL+GL K++K+ME
Sbjct: 310  -GHEPDAFTYRILIQGCSKSYRMDDATKIFSEMQYNGFAMDTVVYNSLLNGLFKARKVME 368

Query: 1210 ACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIV 1389
            AC  FEKMV D GVRASCWTYNILIDGL++N R EAAYTLFCDLKKKG  FVDG+TYSIV
Sbjct: 369  ACQFFEKMVQD-GVRASCWTYNILIDGLFRNGRAEAAYTLFCDLKKKGQ-FVDGITYSIV 426

Query: 1390 TLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNL 1569
             L LCRE Q+E AL+LVEEME RGF+VDLVTITSL+I  +K+GRWDW E+L++HIR+GNL
Sbjct: 427  VLQLCREGQLEGALRLVEEMEARGFIVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNL 486

Query: 1570 VPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKA-------DKDGEDNTD 1728
            VP+++KWK+ MEA+M  P   ++D+TPLFPS  D  EI+NL  +       + D ED  +
Sbjct: 487  VPNVLKWKANMEASMKNPPKNRKDYTPLFPSKGDFREIMNLLGSVGQAMGTNLDSEDCDE 546

Query: 1729 KFND----EKDEWSSSPYMDKLANSGQP----SLTYAISKGVRVLGKGDDSFDIDIMNTY 1884
            K  +    + D+WSSSPYMD+LAN G+     S  +++ +G RV  KG  SFD+D++NT+
Sbjct: 547  KDQEKPSIDTDQWSSSPYMDQLANQGKSTERSSQLFSLIRGQRVQEKGIGSFDVDMVNTF 606

Query: 1885 LSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEI 2064
            LSIFLAKGKLSLACKLFE+FT++GV+P+SYTYNSIMSSF+KKGYF EAWGVL+ M +E++
Sbjct: 607  LSIFLAKGKLSLACKLFEVFTDMGVDPVSYTYNSIMSSFVKKGYFNEAWGVLNEM-DEKV 665

Query: 2065 SPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAV 2244
             P+DIATYN++IQGLGKMGRAD+A  VL+KL  +GGYLD+VMYNTL+NALGK GR  EA 
Sbjct: 666  CPADIATYNLIIQGLGKMGRADIASSVLDKLMKQGGYLDVVMYNTLVNALGKAGRVDEAS 725

Query: 2245 ELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLE 2424
            +LFEQM+ SG+NPDV TYNTLIEVH K G++++AY FLKMMLDAGC PNHVTDT LD L 
Sbjct: 726  KLFEQMRTSGINPDVITYNTLIEVHTKAGQLQDAYKFLKMMLDAGCSPNHVTDTILDNLG 785

Query: 2425 KEIERRRYDMA 2457
            KEIE+ R   A
Sbjct: 786  KEIEKMRLQKA 796


>gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlisea aurea]
          Length = 770

 Score =  887 bits (2291), Expect = 0.0
 Identities = 452/779 (58%), Positives = 570/779 (73%), Gaps = 20/779 (2%)
 Frame = +1

Query: 169  DLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCSLRQ 348
            ++LVVA+I K LSK G +  LEKN D+IP                      FFRWCS R 
Sbjct: 1    NILVVASITKILSKFGALQYLEKNADSIPLSEDVVLQIVHHRSLVISKKLEFFRWCSSRP 60

Query: 349  NYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKAGKYD 528
            +Y H+A AYS+M + +    +QHH++V+ELLA M+ DG+ LDS TLK IL+G I+A K+D
Sbjct: 61   DYNHTANAYSEMLRAIFRFPNQHHNNVIELLALMKRDGVILDSDTLKRILNGLIRAQKFD 120

Query: 529  SALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNGENI 708
             AL+VLDY E++ +     S DVYSPVLVALV K+QISIAL VF KLL S          
Sbjct: 121  YALDVLDYIEKDSVIAGNLSPDVYSPVLVALVRKDQISIALPVFFKLLHSQF------ED 174

Query: 709  VIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWGDLSTA 888
             IPDA ACNE+L GLKK  M++EFR++++ LR+T  YP DRWGYNICIH+ GCWGDLSTA
Sbjct: 175  YIPDAFACNELLAGLKKKKMKNEFREVFAKLRETARYPSDRWGYNICIHSFGCWGDLSTA 234

Query: 889  LALFKEMKERNGPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEPDEFTYRIM 1068
            L+LFKEMK+R G   PDLCTYNSLI V C LG++ DAL++W+ELK SSGYEPD FTYRI+
Sbjct: 235  LSLFKEMKDRGGSVYPDLCTYNSLIQVFCSLGRLNDALVIWKELKNSSGYEPDRFTYRIL 294

Query: 1069 IQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLFEKMVDDDG 1248
            IQGC+KSYR+NDA+ IF+EMQYNGIRA TV YNSL+DGL KS+KL  AC  FE+MV D+ 
Sbjct: 295  IQGCSKSYRINDAMTIFNEMQYNGIRAETVTYNSLMDGLFKSRKLTTACSFFERMV-DNR 353

Query: 1249 VRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLCRENQVEEA 1428
            VRASC TYNI+IDGLY+N R EAAY LF DLK+KGN FVD +++SIV LHLC+E +++EA
Sbjct: 354  VRASCSTYNIIIDGLYRNGRPEAAYALFSDLKRKGNQFVDVISFSIVVLHLCKEERLDEA 413

Query: 1429 LQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLIKWKSAMEA 1608
            L+LVEEME RGFVVDLVT+TSL++ALY+ G  D+ EKL++H+R GNL+PS+ KWKSA+E+
Sbjct: 414  LRLVEEMESRGFVVDLVTVTSLLMALYRAGHSDFTEKLMKHVRNGNLIPSVFKWKSALES 473

Query: 1609 AMTKPQSKKRDFTPLFPSLSDVVEILNLKK--ADKDGEDNTDKFNDE----KDEWSSSPY 1770
            ++  PQ K+RDFTP+FP +  + EIL   K  A    ED T K  DE     DEWSSSPY
Sbjct: 474  SLMSPQGKERDFTPMFPEVRSIDEILEATKSVASTRSEDGTVKNGDEGEERADEWSSSPY 533

Query: 1771 MDKLAN--SGQ---PSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLSLACKLF 1935
            MD+LA   SG     S  + + + VR +G+G++SFD+D+ NTYLS+    GKLS ACK+ 
Sbjct: 534  MDELARNLSGDHRYSSHFFTMFRAVRAVGRGEESFDVDMANTYLSLLSGTGKLSSACKVL 593

Query: 1936 EIFTNLGVNPLS---------YTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATY 2088
            E+ +  GV P S         Y YNS+ SSFIKKGY KEAWG+L  +   +  P+D+ATY
Sbjct: 594  ELLSRGGVGPNSESSLANVFCYGYNSLTSSFIKKGYVKEAWGIL--LRHFDAGPADVATY 651

Query: 2089 NVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKR 2268
            +++++GLGKMGRADLA+ V +KL  +GGYLD VMYNTLI+ LGK GR  +A  +F +M+ 
Sbjct: 652  SLIVRGLGKMGRADLARSVRDKLTRDGGYLDAVMYNTLIHTLGKAGRLEDARNVFGEMRA 711

Query: 2269 SGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRR 2445
            SG+ PDV TYNTLIEVH+K G V+EA  +LK MLD GC PNHVTDT LDYLEKEI +++
Sbjct: 712  SGIIPDVVTYNTLIEVHSKAGDVEEANRWLKTMLDNGCAPNHVTDTTLDYLEKEIRKQK 770


>ref|XP_002515124.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545604|gb|EEF47108.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 898

 Score =  872 bits (2252), Expect = 0.0
 Identities = 457/799 (57%), Positives = 584/799 (73%), Gaps = 15/799 (1%)
 Frame = +1

Query: 94   PNTARFRFTTAAEKSAESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXX 273
            P     RF+  +  S+ S+  ++L  +L+VA + K LS+  G+  L+   D IP      
Sbjct: 28   PRPLGIRFSLCSSLSSSSS--NQLESILLVAFLNKALSE-SGVRNLDP--DFIPLSEPLI 82

Query: 274  XXXXXXXXXXXXXXXGFFRWCSLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMR 453
                            FF+WCS   NYKHSA  YS MF+ +C     + ++V  LL +M+
Sbjct: 83   LQILRQNSLDASKKIEFFKWCSFSHNYKHSACVYSHMFRTVC--NAGYFEEVRSLLNSMK 140

Query: 454  HDGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKN 633
             D   + + T K +LD FI  G +D ALE+LD  E   +GT+  +  +Y  VLVAL  KN
Sbjct: 141  DDCAIVGTGTFKFLLDTFINLGNFDFALELLDVMEE--LGTN-LNPHMYDSVLVALTRKN 197

Query: 634  QISIALSVFSKLLDSALFAKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTK 813
            QI +ALS+F KLL+++     G  + +P ++ACN +LV L+KADMR EF++++  L K  
Sbjct: 198  QIGLALSIFFKLLETSNDIDIG--VSVPGSVACNTLLVALRKADMRVEFKKVFDKL-KGM 254

Query: 814  LYPMDRWGYNICIHALGCWGDLSTALALFKEMKERNGPFE---PDLCTYNSLINVLCLLG 984
             + +D WGYNICIHA GCW DL TAL LFKEMKE++  F    PDLCTYNSLI +LC  G
Sbjct: 255  GFELDTWGYNICIHAFGCWSDLGTALRLFKEMKEKSKGFGSCCPDLCTYNSLIRLLCFSG 314

Query: 985  KVKDALIVWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVY 1164
            KVKDAL+V+EELK S G+EPD FTYRI+I+GC+KSYRMNDA +IFSEMQYNG    T VY
Sbjct: 315  KVKDALVVYEELKIS-GHEPDAFTYRIIIEGCSKSYRMNDATKIFSEMQYNGFVPDTTVY 373

Query: 1165 NSLLDGLMKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLK 1344
            NSLLDG+ K++K+ EAC LFEKMV D GVRAS WTYNILIDGL KN R  A Y+LFCDLK
Sbjct: 374  NSLLDGMFKARKVTEACQLFEKMVQD-GVRASSWTYNILIDGLCKNGRSAAGYSLFCDLK 432

Query: 1345 KKGNNFVDGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRW 1524
            KKG  FVD +TYSI+ L LCRE Q++EAL LVEEME RGFVVDLVTITSL+IA +K+GRW
Sbjct: 433  KKGK-FVDAITYSIIVLLLCREGQLKEALSLVEEMEERGFVVDLVTITSLLIAFHKQGRW 491

Query: 1525 DWIEKLLRHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKK-- 1698
            DW EKL++H+R+GNLVP+++ W++ MEA++  P+S+++D+TP+F S   + EI+N+ +  
Sbjct: 492  DWTEKLMKHVRDGNLVPNVLNWQADMEASLKNPRSRRKDYTPMFLSNGSLSEIINIIRYP 551

Query: 1699 ------ADKDGEDNTDKFNDEKDEWSSSPYMDKLANSGQP----SLTYAISKGVRVLGKG 1848
                   D +  ++ D  + E D+WSSSPYMD LAN  +     S ++++++G RV  KG
Sbjct: 552  DLKNHGLDDNAVEHGDNISAETDQWSSSPYMDHLANQVKSTDNCSQSFSLARGQRVQAKG 611

Query: 1849 DDSFDIDIMNTYLSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEA 2028
             +SFDID++NT+LSIFLAKGKLS+ACKLFEIF+++GVNP+SYTYNSIMSSF+KKGYF EA
Sbjct: 612  VESFDIDMVNTFLSIFLAKGKLSVACKLFEIFSDMGVNPVSYTYNSIMSSFVKKGYFSEA 671

Query: 2029 WGVLHAMGEEEISPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLIN 2208
            W VL+ MGE+ + PSDIATYN++IQGLGKMGRADLA  VL+KL  +GGYLDIVMYNTLIN
Sbjct: 672  WDVLNQMGEK-VCPSDIATYNLIIQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIN 730

Query: 2209 ALGKGGRFGEAVELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVP 2388
            ALGK GR  E  +LFEQMK SG+NPDV TYNTLIEVH K G++K+AY FLKMMLDAGC+P
Sbjct: 731  ALGKAGRIDEVRKLFEQMKTSGINPDVVTYNTLIEVHTKAGRLKDAYKFLKMMLDAGCLP 790

Query: 2389 NHVTDTCLDYLEKEIERRR 2445
            NHVTDT LD+L KEIE++R
Sbjct: 791  NHVTDTTLDFLAKEIEKQR 809


>ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa]
            gi|550345304|gb|EEE81962.2| hypothetical protein
            POPTR_0002s18390g [Populus trichocarpa]
          Length = 776

 Score =  871 bits (2250), Expect = 0.0
 Identities = 451/780 (57%), Positives = 578/780 (74%), Gaps = 15/780 (1%)
 Frame = +1

Query: 163  LGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCSL 342
            +G++L+VA + KTLS+ G       + D+IP                      FF+WCS+
Sbjct: 1    MGNILLVAYLTKTLSESG---TRSLDPDSIPLSESLVLQILRRNSLDSSKKMEFFKWCSV 57

Query: 343  RQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKAGK 522
            R  YKHS   YSQMF  LC     + D+V +LL +M++DG+ + S T K++LD FI++GK
Sbjct: 58   RHIYKHSVSTYSQMFSTLC--RSGYLDEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGK 115

Query: 523  YDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNGE 702
            +DSAL++LD+ E   +G++  +  +Y  ++VAL  KNQ+ +ALS+  KLL+++    N E
Sbjct: 116  FDSALDILDHMEE--LGSNP-NPHMYDSIIVALAKKNQVGLALSIMFKLLEAS--DGNEE 170

Query: 703  NIV---IPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWG 873
            N V   +P ++ACN +LV L+  +M+ EF+ +++ LR    + ++ WGYNICIHA GCWG
Sbjct: 171  NAVGVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKGGFELNTWGYNICIHAFGCWG 230

Query: 874  DLSTALALFKEMKERN---GPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEP 1044
            DL+T+L LFKEMKE++   G  +PDLCTYNSLI+VLCL GKVKDA+IV+EELK S G+EP
Sbjct: 231  DLTTSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVS-GHEP 289

Query: 1045 DEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLF 1224
            D FTYRI+IQGC KSY+M DA +IFSEMQYNG    TVVYNSLLDG+ K++K+MEAC LF
Sbjct: 290  DAFTYRILIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLF 349

Query: 1225 EKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLC 1404
            EKMV D GVRASCWTYNILIDGL KN R EA Y LFC LKKKG  FVD VTYSIV L LC
Sbjct: 350  EKMVQD-GVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQ-FVDAVTYSIVVLLLC 407

Query: 1405 RENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLI 1584
            R+  +EEAL LVEEME RGFVVDL+TITSL+IA +K+GRWD  E+L++HIR+ NL+P+++
Sbjct: 408  RKGHLEEALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVL 467

Query: 1585 KWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNL----KKADKDGEDNTDKFND-EKD 1749
            KW++ MEA++  P   + D+TP+FPS   + EI++     K    DG    +K +  + D
Sbjct: 468  KWRADMEASLKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDGATEDEKSSSADTD 527

Query: 1750 EWSSSPYMDKLANSGQP----SLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLS 1917
            +WSSSPYMD LAN  +     S  +++++G RV  KG  SFDID++NT+LSIFLAKGKLS
Sbjct: 528  QWSSSPYMDHLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLS 587

Query: 1918 LACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATYNVV 2097
            LACKLFEIFT++GV+P+SYTYNSIMSSF+KKGYF  AW V + MGE+ + P DIATYN+V
Sbjct: 588  LACKLFEIFTDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEK-VCPPDIATYNLV 646

Query: 2098 IQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKRSGM 2277
            IQGLGKMGRADLA  VL+KL  +GGYLDIVMYNTLI+ALGK GR  EA  LFEQMK SG+
Sbjct: 647  IQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGL 706

Query: 2278 NPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRYDMA 2457
            NPDV TYN +IEVH+K G++K+AY FLKMMLDAGC+PNHVTDT LD+L KEIE+ RY  A
Sbjct: 707  NPDVVTYNIMIEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKA 766


>gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]
          Length = 788

 Score =  870 bits (2248), Expect = 0.0
 Identities = 448/783 (57%), Positives = 582/783 (74%), Gaps = 16/783 (2%)
 Frame = +1

Query: 157  SELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWC 336
            S+L D+L+VA++ KTLS+    +  +    +IP                      FF W 
Sbjct: 18   SQLADVLLVASLTKTLSESSTRYLPDPR--SIPLSEPILLQILRNNSLHISKKLDFFTWF 75

Query: 337  SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKA 516
            SL  + K SA +YSQ+ + LC   H H  +   LL +MR +G+ +DS T K +LD FI++
Sbjct: 76   SLNSDLKPSAHSYSQVLRALCREGHLH--EASNLLGSMRQNGVIIDSWTFKTLLDTFIRS 133

Query: 517  GKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKN 696
            GK+D ALE+LD  E   +G +  ++ +Y  VL+ALV K+Q+S ALS+F K+L+ +     
Sbjct: 134  GKFDFALEILDTMEE--LGVT-LNSHMYDSVLIALVRKDQLSFALSIFFKILEDSSH--- 187

Query: 697  GENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWGD 876
                 +P +I CNE+LV LKK+DMR EF+Q++  +R+ K + M+ WGYNICIHA G WGD
Sbjct: 188  -----VPSSIGCNELLVALKKSDMRVEFKQVFDGIREKKGFGMNVWGYNICIHAFGFWGD 242

Query: 877  LSTALALFKEMKERNGPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEPDEFT 1056
            L T+L+L++EMK   GP   DLCTYNSLI+VLC  GKVKDAL+V+EELK S G++PD FT
Sbjct: 243  LGTSLSLYREMKVSVGP---DLCTYNSLIHVLCFFGKVKDALVVYEELKGS-GHQPDRFT 298

Query: 1057 YRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLFEKMV 1236
            YRI+IQGC KSYR+++A +IF+EM+YNG  A TVVYNSL+DGL+K++K+ EAC+LFEKM 
Sbjct: 299  YRILIQGCCKSYRIDNAEKIFNEMEYNGHCADTVVYNSLIDGLLKARKVSEACELFEKMT 358

Query: 1237 DDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLCRENQ 1416
             D GVRAS WTYN LIDGL+KN R EA YT+FCDLKKKG  FVDG+TYSIV L LCRE  
Sbjct: 359  QD-GVRASSWTYNTLIDGLFKNERAEAGYTMFCDLKKKGQ-FVDGITYSIVVLQLCREGL 416

Query: 1417 VEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGN-LVPSLIKWK 1593
            +EEAL LVEEMEGRGFVVDLVTITSL++ LYK+GRWDW ++L++HIR+GN L+P++++WK
Sbjct: 417  LEEALGLVEEMEGRGFVVDLVTITSLLVGLYKQGRWDWTDRLMKHIRDGNNLLPNVLRWK 476

Query: 1594 SAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDG------EDNTDKFNDEK--- 1746
              +EA++  PQSK++D+TP+FPS  +  EI++L ++           DN D  +DE    
Sbjct: 477  IDLEASLKNPQSKRKDYTPMFPSKDEFSEIMSLIRSANATMKAQLVPDNVDVKDDESVSS 536

Query: 1747 --DEWSSSPYMDKLAN----SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKG 1908
              D+WSSSPYMD+L N    +G+ S  +++S+G RV  KG DSFDID++NT+LSIFLAKG
Sbjct: 537  DIDQWSSSPYMDQLTNQVLSNGRSSQLFSLSRGRRVQAKGGDSFDIDMVNTFLSIFLAKG 596

Query: 1909 KLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATY 2088
            KLSLACKLFEIFT++GVNP+SYTYNS+M+SF+KKGYF EAW +L  MGE+ + P+DIATY
Sbjct: 597  KLSLACKLFEIFTDMGVNPVSYTYNSMMTSFVKKGYFDEAWNILGEMGEK-VCPADIATY 655

Query: 2089 NVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKR 2268
            NV+IQ LGKMGRADLA  VL+KL ++GGYLD+VMYNTLINALGK GR  E  + F+QM+ 
Sbjct: 656  NVIIQSLGKMGRADLASAVLDKLIEQGGYLDLVMYNTLINALGKAGRIDEVNKFFDQMRA 715

Query: 2269 SGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRY 2448
            SG+NPDV TYNTLIEVH K G++K+AY FLKMMLDAGC+PNHVTDT LD+L KEIE+  Y
Sbjct: 716  SGINPDVITYNTLIEVHTKAGQLKDAYKFLKMMLDAGCIPNHVTDTTLDFLGKEIEKESY 775

Query: 2449 DMA 2457
              A
Sbjct: 776  QKA 778


>ref|XP_006386676.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550345301|gb|ERP64473.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 776

 Score =  869 bits (2246), Expect = 0.0
 Identities = 450/780 (57%), Positives = 578/780 (74%), Gaps = 15/780 (1%)
 Frame = +1

Query: 163  LGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCSL 342
            +G++L+VA + KTLS+ G       + D+IP                      FF+WCS+
Sbjct: 1    MGNILLVAYLTKTLSESG---TRSLDPDSIPLSEYLVLQILRRNSLDSSKKMEFFKWCSV 57

Query: 343  RQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKAGK 522
            R  YKHS   YSQMF  LC     + ++V +LL +M++DG+ + S T K++LD FI++GK
Sbjct: 58   RHIYKHSVSTYSQMFSTLC--RSGYLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGK 115

Query: 523  YDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNGE 702
            +DSAL++LD+ E   +G++  +  +Y  ++VAL  KNQ+ +ALS+  KLL+++    N E
Sbjct: 116  FDSALDILDHMEE--LGSNP-NPHMYDSIIVALAKKNQVGLALSIMFKLLEAS--DGNEE 170

Query: 703  NIV---IPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWG 873
            N V   +P ++ACN +LV L+  +M+ EF+ +++ LR    + ++ WGYNICIHA GCWG
Sbjct: 171  NAVRVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKVGFKLNTWGYNICIHAFGCWG 230

Query: 874  DLSTALALFKEMKERN---GPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEP 1044
            DL+T+L LFKEMKE++   G  +PDLCTYNSLI+VLCL GKVKDA+IV+EELK S G+EP
Sbjct: 231  DLTTSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVS-GHEP 289

Query: 1045 DEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLF 1224
            D FTYRI+IQGC KSY+M DA +IFSEMQYNG    TVVYNSLLDG+ K++K+MEAC LF
Sbjct: 290  DAFTYRILIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLF 349

Query: 1225 EKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLC 1404
            EKMV D GVRASCWTYNILIDGL KN R EA Y LFC LKKKG  FVD VTYSIV L LC
Sbjct: 350  EKMVQD-GVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQ-FVDAVTYSIVVLLLC 407

Query: 1405 RENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLI 1584
            R+  +EEAL LVEEME RGFVVDL+TITSL+IA +K+GRWD  E+L++HIR+ NL+P+++
Sbjct: 408  RKGHLEEALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVL 467

Query: 1585 KWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNL----KKADKDGEDNTDKFND-EKD 1749
            KW++ MEA++  P   + D+TP+FPS   + EI++     K    DG    +K +  + D
Sbjct: 468  KWRADMEASLKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDGATEDEKSSSADTD 527

Query: 1750 EWSSSPYMDKLANSGQP----SLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLS 1917
            +WSSSPYMD LAN  +     S  +++++G RV  KG  SFDID++NT+LSIFLAKGKLS
Sbjct: 528  QWSSSPYMDHLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLS 587

Query: 1918 LACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATYNVV 2097
            LACKLFEIFT++GV+P+SYTYNSIMSSF+KKGYF  AW V + MGE+ + P DIATYN+V
Sbjct: 588  LACKLFEIFTDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEK-VCPPDIATYNLV 646

Query: 2098 IQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKRSGM 2277
            IQGLGKMGRADLA  VL+KL  +GGYLDIVMYNTLI+ALGK GR  EA  LFEQMK SG+
Sbjct: 647  IQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGL 706

Query: 2278 NPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRYDMA 2457
            NPDV TYN +IEVH+K G++K+AY FLKMMLDAGC+PNHVTDT LD+L KEIE+ RY  A
Sbjct: 707  NPDVVTYNIMIEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKA 766


>ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Fragaria vesca subsp. vesca]
          Length = 789

 Score =  864 bits (2232), Expect = 0.0
 Identities = 451/786 (57%), Positives = 567/786 (72%), Gaps = 19/786 (2%)
 Frame = +1

Query: 157  SELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWC 336
            +ELGD+L+VA+I KTLS+ G  +  +     +P                      FF+WC
Sbjct: 17   AELGDILLVASITKTLSQSGTRNLPQP----LPLTEPLLLQILRTQSLHPSKKLDFFKWC 72

Query: 337  SLRQNYKHSARAYSQMFKVLC---FLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGF 507
            SL  +   S RA+S +    C   FL      ++ ELL  MR D LA+DS T K +LD F
Sbjct: 73   SLTHSIPPSPRAFSHVLHTACRAGFLA-----EIPELLTIMRRDSLAVDSGTFKSLLDAF 127

Query: 508  IKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALF 687
            I+ GK+D A+E+LD  +      +  +AD+Y+ VLVALV K Q+ +A+S+  +LL+    
Sbjct: 128  IREGKFDMAIEILDTMQEV---NAELNADMYNSVLVALVRKGQLRLAMSILVRLLEG--- 181

Query: 688  AKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGC 867
               G    +P  IACNE+LVGL+K DMR EF+Q+Y  LR  + + MD WGYNICIHA GC
Sbjct: 182  ---GSCDQVPSCIACNELLVGLRKGDMRVEFKQVYDKLRGNEWFEMDTWGYNICIHAFGC 238

Query: 868  WGDLSTALALFKEMKERNGPFE-PDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEP 1044
            WGDL T+L+LFKEMK+ N     PDL TYNSLI+VLCL+GKV DA+ VWEELK S G+EP
Sbjct: 239  WGDLGTSLSLFKEMKDLNSDSVFPDLSTYNSLIHVLCLVGKVDDAITVWEELKCS-GHEP 297

Query: 1045 DEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLF 1224
            D  TYRI+IQGC K YR+ +A RIFSEMQ NG    TVVYNSL+DGL K++K+ E C +F
Sbjct: 298  DAITYRILIQGCCKCYRIEEATRIFSEMQNNGYNPDTVVYNSLIDGLFKARKVNEGCQMF 357

Query: 1225 EKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLC 1404
            E+M+   GVRAS WTYNILIDGL++N R EAAYTLFCDLKKKG  FVDGVTYSIV L LC
Sbjct: 358  ERMIQY-GVRASTWTYNILIDGLFRNARAEAAYTLFCDLKKKGQ-FVDGVTYSIVVLQLC 415

Query: 1405 RENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLI 1584
            RE  +EEAL L EEME RGF VDLVTI++L+I+LYK  RWDW +KL++ IR+GNL+PS++
Sbjct: 416  REGLLEEALGLAEEMEMRGFTVDLVTISTLIISLYKHSRWDWTDKLMKRIRDGNLLPSVL 475

Query: 1585 KWKSAMEAAMTKPQSKKRDFTPLFPS---LSDVVEILNLKKADKDGEDNTDK--FNDEK- 1746
            KWK  MEA +  PQ  K+D TPLFPS    SDV+ +++   +  DG   TD     D+K 
Sbjct: 476  KWKVDMEATLKSPQKNKKDHTPLFPSNGDFSDVLSLISSVASTMDGGFETDDAGVKDDKN 535

Query: 1747 -----DEWSSSPYMDKLAN----SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFL 1899
                 D+WSSSP+MD+LAN    + Q S  +++S+G RV  KGDD+FDID++NT+LS+FL
Sbjct: 536  SSTPIDQWSSSPHMDQLANQITSTDQSSQQFSLSRGQRVQAKGDDTFDIDMVNTFLSLFL 595

Query: 1900 AKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDI 2079
            AKGKLS+ACKLFEIF++ G NP+SYTYNSI+SSF+KKGYF EAWGVL  MGE+ + P+DI
Sbjct: 596  AKGKLSMACKLFEIFSDTGANPVSYTYNSILSSFVKKGYFNEAWGVLSEMGEK-VCPTDI 654

Query: 2080 ATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQ 2259
            ATYN++IQGLGKMGRADLA  VL+KL  +GGYLD+VMYNTLINALGK  R  E  +LF+Q
Sbjct: 655  ATYNMIIQGLGKMGRADLASSVLDKLMKQGGYLDVVMYNTLINALGKANRIDEVNKLFKQ 714

Query: 2260 MKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIER 2439
            MK SG+NPDV T+NTLIEVH+K G++K+AY FLKMMLD+GC+PNHVTDT LD+L KEIE+
Sbjct: 715  MKSSGINPDVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCIPNHVTDTTLDFLGKEIEK 774

Query: 2440 RRYDMA 2457
             RY  A
Sbjct: 775  SRYQKA 780


>ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Citrus sinensis]
          Length = 790

 Score =  858 bits (2216), Expect = 0.0
 Identities = 452/783 (57%), Positives = 582/783 (74%), Gaps = 23/783 (2%)
 Frame = +1

Query: 160  ELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCS 339
            +LG +L++A + KTL K  G   L+    +IP                      FFRWCS
Sbjct: 18   QLGSILLLAFVTKTL-KESGTRNLDPR--SIPISEPLVLQVLGKNSLDSSKKLDFFRWCS 74

Query: 340  -LRQNYKHSARAYSQMFKVLC---FLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGF 507
             LR  YKH+A  YS +F+ +C   FL     ++V  LL +M+ D + +DS T K++L+  
Sbjct: 75   SLRPIYKHTACTYSHIFRTVCRAGFL-----EEVPSLLNSMQEDDVVVDSETFKLLLEPC 129

Query: 508  IKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALF 687
            IK+GK D A+E+LDY E   +GTS  S +VY  VLV+LV K Q+ +A+S+  KLL++   
Sbjct: 130  IKSGKIDFAIEILDYMEE--LGTS-LSPNVYDSVLVSLVRKKQLGLAMSILFKLLEACND 186

Query: 688  AKNGENIV--IPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHAL 861
                 ++V  +P  +ACNE+LV L+K+D R EF+Q++  L++ K +  D +GYNICIHA 
Sbjct: 187  NTADNSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQKEFEFDIYGYNICIHAF 246

Query: 862  GCWGDLSTALALFKEMKERNGPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYE 1041
            GCWGDL T+L LFKEMKE+     PDL TYNSLI VLC++GKVKDALIVWEELK S G+E
Sbjct: 247  GCWGDLHTSLRLFKEMKEKG--LVPDLHTYNSLIQVLCVVGKVKDALIVWEELKGS-GHE 303

Query: 1042 PDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDL 1221
            P+EFT+RI+IQGC KSYRM+DA++IFSEMQYNG+   TVVYNSLL+ + KS+K+MEAC L
Sbjct: 304  PNEFTHRIIIQGCCKSYRMDDAMKIFSEMQYNGLIPDTVVYNSLLNRMFKSRKVMEACQL 363

Query: 1222 FEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHL 1401
            FEKMV D GVR SCWT+NILIDGL++N R EAAYTLFCDLKKKG  FVDG+T+SIV L L
Sbjct: 364  FEKMVQD-GVRTSCWTHNILIDGLFRNGRAEAAYTLFCDLKKKGK-FVDGITFSIVVLQL 421

Query: 1402 CRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSL 1581
            CRE Q+EEAL+LVEEMEGRGFVVDLVTI+SL+I  +K GRWD+ E+L++HIR+GNLV  +
Sbjct: 422  CREGQIEEALRLVEEMEGRGFVVDLVTISSLLIGFHKYGRWDFTERLMKHIRDGNLVLDV 481

Query: 1582 IKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEIL------------NLKKADKDGEDNT 1725
            +KWK+ +EA M   +SK++D+TP+FP   D+ EI+            NL   + D +D  
Sbjct: 482  LKWKADVEATMKSRKSKRKDYTPMFPYKGDLSEIMSLIGSTNLETDANLGSGEGDAKDEG 541

Query: 1726 DKFNDEKDEWSSSPYMDKLANSGQ----PSLTYAISKGVRVLGKGDDSFDIDIMNTYLSI 1893
             +  +  DEWSSSPYMDKLA+  +     S  +++++G+RV GKG  +FDID++NT+LSI
Sbjct: 542  SQLTN-SDEWSSSPYMDKLADQVKSDCHSSQLFSLARGLRVQGKGMGTFDIDMVNTFLSI 600

Query: 1894 FLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPS 2073
            FLAKGKL+LACKLFEIFT++GV+P++YTYNS+MSSF+KKGYF +AWGVL+ MG E+  P+
Sbjct: 601  FLAKGKLNLACKLFEIFTDMGVHPVNYTYNSMMSSFVKKGYFNQAWGVLNEMG-EKFCPT 659

Query: 2074 DIATYNVVIQGLGKMGRADLAKVVLEKL-KDEGGYLDIVMYNTLINALGKGGRFGEAVEL 2250
            DIATYNVVIQGLGKMGRADLA  +L+KL K  GGYLD+VMYNTLIN LGK GRF EA  L
Sbjct: 660  DIATYNVVIQGLGKMGRADLASTILDKLMKQGGGYLDVVMYNTLINVLGKAGRFDEANML 719

Query: 2251 FEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKE 2430
            FEQM+ SG+NPDV T+NTLIEV+ K G++KEA+ FLKMMLD+GC PNHVTDT LD+L +E
Sbjct: 720  FEQMRTSGINPDVVTFNTLIEVNGKAGRLKEAHYFLKMMLDSGCTPNHVTDTTLDFLGRE 779

Query: 2431 IER 2439
            I+R
Sbjct: 780  IDR 782


>ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Cucumis sativus] gi|449523383|ref|XP_004168703.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g01570-like [Cucumis sativus]
          Length = 803

 Score =  853 bits (2205), Expect = 0.0
 Identities = 442/800 (55%), Positives = 592/800 (74%), Gaps = 19/800 (2%)
 Frame = +1

Query: 115  FTTAAEKSAESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXX 294
            F +    S  ++ +S L  LL++A+I KTLS+  G   L+ +  ++P             
Sbjct: 10   FLSIESHSRTASTLSHLSHLLLLASITKTLSE-SGTRTLQHH--SLPISHPLLLQILHSR 66

Query: 295  XXXXXXXXGFFRWCSLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALD 474
                     FF+WCSL  N+ HS   YSQ+F +LC   + H  +V  LL +M+ DG+++D
Sbjct: 67   SLNPSHKLDFFKWCSLAPNFNHSPSTYSQIFHILCRSGYLH--EVPPLLDSMKRDGVSVD 124

Query: 475  SSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALS 654
            S T K++LD FI++GKYD+ALE+LD+ E   +GTS    + Y+ VLVAL+ KNQ+ +ALS
Sbjct: 125  SHTFKVLLDAFIRSGKYDAALEILDHMED--LGTS-LELNTYNSVLVALLRKNQVGLALS 181

Query: 655  VFSKLLDSALFAKNGENI--------VIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKT 810
            +F KLLD      NG  +         +P+++ACNE+LV L+K DMR EF++++  LR  
Sbjct: 182  IFFKLLDGF---NNGGQVDSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAI 238

Query: 811  KLYPMDRWGYNICIHALGCWGDLSTALALFKEMKERN---GPFEPDLCTYNSLINVLCLL 981
            + +    +GYNICI+A GCWG L TAL+LFKEMKE++     F PDLCTYNS+I+VLCL+
Sbjct: 239  ESFEFSVYGYNICIYAFGCWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLV 298

Query: 982  GKVKDALIVWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVV 1161
            GKVKDALIVWEELK S G+EPD FTYRI+IQGC KS RM+DA  IF+EM+YNG+   T+V
Sbjct: 299  GKVKDALIVWEELKGS-GHEPDAFTYRIIIQGCCKSCRMDDATMIFNEMEYNGLIPDTIV 357

Query: 1162 YNSLLDGLMKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDL 1341
            YNSLL+GL K++K+ EAC LF+KMV +D VRAS WTYNILIDGL++N R EA YTLFCDL
Sbjct: 358  YNSLLNGLFKARKVTEACQLFDKMVQED-VRASPWTYNILIDGLFRNGRAEAGYTLFCDL 416

Query: 1342 KKKGNNFVDGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGR 1521
            KKKG   VD VTYSI+ L LC+E  +EEALQLVEEME RGFVVDL+TITSL+IA++K+G+
Sbjct: 417  KKKGQ-IVDAVTYSIIILQLCKERLLEEALQLVEEMEARGFVVDLITITSLLIAMHKQGQ 475

Query: 1522 WDWIEKLLRHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLK-- 1695
            WD +E+L++HIREG+LVP+++KWK  ME ++   ++K++DF+ LF    D+ E+++ +  
Sbjct: 476  WDGLERLMKHIREGDLVPNVLKWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRAS 535

Query: 1696 KADKDGEDNTDKFNDEKD--EWSSSPYMDKLANSGQPSLT----YAISKGVRVLGKGDDS 1857
             A K   DN+ +  +E+D   WSSSPY+++LAN    +      ++I +G R+  K D+S
Sbjct: 536  SAAKVNIDNSFENTEERDMDSWSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNS 595

Query: 1858 FDIDIMNTYLSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGV 2037
            FDI+++NT+LSIFLAKGKL+LACKLFEIF+++GVNP+ YTYNS++SSF+KKGYF +AWG+
Sbjct: 596  FDINMVNTFLSIFLAKGKLNLACKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGI 655

Query: 2038 LHAMGEEEISPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALG 2217
             + MGE  + P+DIATYNV+IQGLGKMGRADLA  VLEKL ++GGYLDIVMYNTLINALG
Sbjct: 656  FNEMGEN-VCPADIATYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALG 714

Query: 2218 KGGRFGEAVELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHV 2397
            K GR  +  +LF QM+ SG+NPDV T+NTLIEVH+K G++K+AY FLKMMLD+GC PNHV
Sbjct: 715  KAGRMDDVNKLFGQMRNSGINPDVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCSPNHV 774

Query: 2398 TDTCLDYLEKEIERRRYDMA 2457
            TDT LD+L +E+E+ RY+ A
Sbjct: 775  TDTTLDFLGREMEKARYEKA 794


>ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Capsella rubella]
            gi|482558640|gb|EOA22832.1| hypothetical protein
            CARUB_v10003556mg [Capsella rubella]
          Length = 802

 Score =  852 bits (2201), Expect = 0.0
 Identities = 442/784 (56%), Positives = 578/784 (73%), Gaps = 11/784 (1%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            A+++P  +L ++L+VA+++KTLS+  G  +L+ N  +IP                     
Sbjct: 19   AKNSPFPQLCNVLLVASLSKTLSQ-SGTRSLDAN--SIPISESVVLQILRRSSIDSSKKL 75

Query: 319  GFFRWC-SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMI 495
             FFRWC SLR  YKHSA AYSQ+F+ +C        +V +LL +M+ DG+ LD +  K++
Sbjct: 76   DFFRWCFSLRPGYKHSASAYSQIFRTVC--RTGLIGEVPDLLGSMKDDGVNLDQTMAKVL 133

Query: 496  LDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLD 675
            LD  I++GK+DSAL VLDY E   +G  C +  +Y  VLVALV KN++ +ALS+F KLL+
Sbjct: 134  LDSLIRSGKFDSALGVLDYMEE--LG-DCLNPGLYDSVLVALVKKNEMRLALSIFFKLLE 190

Query: 676  SALFAKNGENIVI----PDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYN 843
            ++    +G   VI    P  +A NE+LVGL++A MR EF++++  LR+ K +  D WGYN
Sbjct: 191  ASDNHSDGTGGVIVSYLPGTVAVNELLVGLRRAGMRSEFKRVFEKLREVKRFKFDTWGYN 250

Query: 844  ICIHALGCWGDLSTALALFKEMKERNG----PFEPDLCTYNSLINVLCLLGKVKDALIVW 1011
            ICIH  GCWGDL  AL+LFKEMK ++      F PD+CTYNSLI+VLCL GK KDALIVW
Sbjct: 251  ICIHGFGCWGDLDAALSLFKEMKVQSSVSGSSFGPDICTYNSLIHVLCLFGKAKDALIVW 310

Query: 1012 EELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMK 1191
            +ELK S G+EPD  TYRI+IQGC KSYRM+DA+RIF EMQYNG    T+VYN LLDG +K
Sbjct: 311  DELKVS-GHEPDNSTYRILIQGCCKSYRMDDAMRIFGEMQYNGFVPDTIVYNCLLDGTLK 369

Query: 1192 SKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDG 1371
            ++K+ EAC LFEKMV + GVRASCWTYNILIDGL+++ R EA +TLFCDLKKKG  FVD 
Sbjct: 370  ARKVTEACQLFEKMVQE-GVRASCWTYNILIDGLFRSGRAEAGFTLFCDLKKKGQ-FVDA 427

Query: 1372 VTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRH 1551
            +T+SIV L LC+E  +E A++LVEEME RGF VDLVTI+SL+I  +K+GRWDW EKL++H
Sbjct: 428  ITFSIVVLQLCKEGDLEAAVKLVEEMETRGFTVDLVTISSLLIGFHKQGRWDWKEKLIKH 487

Query: 1552 IREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDGEDNTDK 1731
            IREGNLV ++++W + +EA++ +PQ+K +D+T +FPS    ++I+N+  ++ DG  + + 
Sbjct: 488  IREGNLVSNVLRWNAGVEASLKRPQNKDKDYTSMFPSKGSFLDIMNMVSSEDDGARDEEV 547

Query: 1732 FNDEKDEWSSSPYMDKLAN-SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKG 1908
               E D WSSSP MD+LA+ S +P+  + +++G RV  K  DSFD+D+MNT+LSI+L+KG
Sbjct: 548  SPMEDDPWSSSPCMDQLAHQSSRPNPLFGLARGQRVEAK-PDSFDVDMMNTFLSIYLSKG 606

Query: 1909 KLSLACKLFEIFTNLGVNPL-SYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIAT 2085
             LSLACKLFEIF  +GV  L SYTYNS+MSSF+KKGYF+ A GVL  MGE     SDIAT
Sbjct: 607  DLSLACKLFEIFEGMGVTDLTSYTYNSMMSSFVKKGYFETARGVLDQMGEN-FCASDIAT 665

Query: 2086 YNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMK 2265
            YNV+I GLGKMGRADLA  VL++L  +GGYLDIVMYNTLIN+LGK  R  EA  LFE MK
Sbjct: 666  YNVIIHGLGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINSLGKANRLDEATRLFEHMK 725

Query: 2266 RSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRR 2445
             +G+NPDV +YNT+IEV++K GK+KEAY +LKMMLDAGC+PNHVTDT LDYL KEIE+ R
Sbjct: 726  SNGINPDVVSYNTMIEVNSKAGKLKEAYKYLKMMLDAGCLPNHVTDTILDYLGKEIEKAR 785

Query: 2446 YDMA 2457
            ++ A
Sbjct: 786  FEKA 789


>ref|NP_192066.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75161629|sp|Q8VZE4.1|PP299_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g01570 gi|18086402|gb|AAL57659.1| AT4g01570/T15B16_21
            [Arabidopsis thaliana] gi|24797024|gb|AAN64524.1|
            At4g01570/T15B16_21 [Arabidopsis thaliana]
            gi|332656643|gb|AEE82043.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 805

 Score =  848 bits (2190), Expect = 0.0
 Identities = 441/786 (56%), Positives = 576/786 (73%), Gaps = 13/786 (1%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            A+++P  +L ++L+VA+++KTLS+  G  +L+ N  +IP                     
Sbjct: 19   AKNSPFPQLCNVLLVASLSKTLSQ-SGTRSLDAN--SIPISEPVVLQILRRNSIDPSKKL 75

Query: 319  GFFRWC-SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMI 495
             FFRWC SLR  YKHSA AYSQ+F+ +C        +V +LL +M+ DG+ LD +  K++
Sbjct: 76   DFFRWCYSLRPGYKHSATAYSQIFRTVCRTGLL--GEVPDLLGSMKEDGVNLDQTMAKIL 133

Query: 496  LDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLL- 672
            LD  I++GK++SAL VLDY E   +G  C +  VY  VL+ALV K+++ +ALS+  KLL 
Sbjct: 134  LDSLIRSGKFESALGVLDYMEE--LG-DCLNPSVYDSVLIALVKKHELRLALSILFKLLE 190

Query: 673  --DSALFAKNGENIVI---PDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWG 837
              D+      G  I++   P  +A NE+LVGL++ADMR EF++++  L+  K +  D W 
Sbjct: 191  ASDNHSDDDTGRVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWS 250

Query: 838  YNICIHALGCWGDLSTALALFKEMKERNG----PFEPDLCTYNSLINVLCLLGKVKDALI 1005
            YNICIH  GCWGDL  AL+LFKEMKER+      F PD+CTYNSLI+VLCL GK KDALI
Sbjct: 251  YNICIHGFGCWGDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLCLFGKAKDALI 310

Query: 1006 VWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGL 1185
            VW+ELK S G+EPD  TYRI+IQGC KSYRM+DA+RI+ EMQYNG    T+VYN LLDG 
Sbjct: 311  VWDELKVS-GHEPDNSTYRILIQGCCKSYRMDDAMRIYGEMQYNGFVPDTIVYNCLLDGT 369

Query: 1186 MKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFV 1365
            +K++K+ EAC LFEKMV + GVRASCWTYNILIDGL++N R EA +TLFCDLKKKG  FV
Sbjct: 370  LKARKVTEACQLFEKMVQE-GVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FV 427

Query: 1366 DGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLL 1545
            D +T+SIV L LCRE ++E A++LVEEME RGF VDLVTI+SL+I  +K+GRWDW EKL+
Sbjct: 428  DAITFSIVGLQLCREGKLEGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLM 487

Query: 1546 RHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDGEDNT 1725
            +HIREGNLVP++++W + +EA++ +PQSK +D+TP+FPS    ++I+++  ++ DG    
Sbjct: 488  KHIREGNLVPNVLRWNAGVEASLKRPQSKDKDYTPMFPSKGSFLDIMSMVGSEDDGASAE 547

Query: 1726 DKFNDEKDEWSSSPYMDKLANS-GQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLA 1902
            +    E D WSSSPYMD+LA+   QP   + +++G RV  K  DSFD+D+MNT+LSI+L+
Sbjct: 548  EVSPMEDDPWSSSPYMDQLAHQRNQPKPLFGLARGQRVEAK-PDSFDVDMMNTFLSIYLS 606

Query: 1903 KGKLSLACKLFEIFTNLGVNPL-SYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDI 2079
            KG LSLACKLFEIF  +GV  L SYTYNS+MSSF+KKGYF+ A GVL  M E     +DI
Sbjct: 607  KGDLSLACKLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFQTARGVLDQMFEN-FCAADI 665

Query: 2080 ATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQ 2259
            ATYNV+IQGLGKMGRADLA  VL++L  +GGYLDIVMYNTLINALGK  R  EA +LF+ 
Sbjct: 666  ATYNVIIQGLGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINALGKATRLDEATQLFDH 725

Query: 2260 MKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIER 2439
            MK +G+NPDV +YNT+IEV++K GK+KEAY +LK MLDAGC+PNHVTDT LDYL KE+E+
Sbjct: 726  MKSNGINPDVVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEK 785

Query: 2440 RRYDMA 2457
             R+  A
Sbjct: 786  ARFKKA 791


>ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum]
            gi|557097371|gb|ESQ37807.1| hypothetical protein
            EUTSA_v10028437mg [Eutrema salsugineum]
          Length = 801

 Score =  846 bits (2186), Expect = 0.0
 Identities = 443/783 (56%), Positives = 571/783 (72%), Gaps = 10/783 (1%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            A+  P  +L ++LVVA+++KTLS  G       + ++ P                     
Sbjct: 19   AKIPPFPQLCNVLVVASLSKTLSHSG---TRNLDANSTPISEPIVLQILRRNSLDPSKKL 75

Query: 319  GFFRWC-SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMI 495
             FFRWC SLR  YKHSA AYSQ+F+ +C        ++  LL +M+ DG+ LD +T K++
Sbjct: 76   DFFRWCFSLRPGYKHSASAYSQIFRTVCRTGLL--GEIPNLLGSMKEDGVNLDQTTSKLL 133

Query: 496  LDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLD 675
            LD  I++GKYDSAL VLDY E EL G  C +  +Y  VL+ALV KN++ +ALS+F KLL+
Sbjct: 134  LDSLIRSGKYDSALGVLDYME-ELGG--CLNPRLYDSVLIALVKKNELRLALSIFFKLLE 190

Query: 676  SALFAKNGENIVI---PDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNI 846
            ++        + +   P  +A NE+LVGL+KA+M+ EF+ ++  L+  + +  D WGYNI
Sbjct: 191  ASDNPSETGGVSVSYLPGTVAVNELLVGLRKANMKLEFKGVFDKLKGMERFKFDTWGYNI 250

Query: 847  CIHALGCWGDLSTALALFKEMKERNGPFE----PDLCTYNSLINVLCLLGKVKDALIVWE 1014
            CIH  GCWGDL  AL+LFKEMKE++        PD+CTYNSLI+VLCL+GK KDALIVW+
Sbjct: 251  CIHGFGCWGDLDAALSLFKEMKEQSSISGSCAGPDICTYNSLIHVLCLVGKAKDALIVWD 310

Query: 1015 ELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKS 1194
            ELK S G+EPD  TYRI+IQGC KSY M+DA+RIF EMQYNG    TV+YNSLLDG +K+
Sbjct: 311  ELKVS-GHEPDNSTYRILIQGCCKSYLMDDAMRIFGEMQYNGFVPDTVLYNSLLDGTLKA 369

Query: 1195 KKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGV 1374
            +K++EAC LFEKMV + GVRASCWT NILIDGL++N R EA +TLFCDLKKKG  FVD +
Sbjct: 370  RKVVEACQLFEKMVQE-GVRASCWTNNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FVDAI 427

Query: 1375 TYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHI 1554
            T+SIV L LCRE ++E A++LVEEME RGF VDLVTI+SL+I  +K+GRWDW EKL++H+
Sbjct: 428  TFSIVVLQLCREGKLEGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLMKHV 487

Query: 1555 REGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDGEDNTDKF 1734
            R GNLVP++++W + +EA++ +PQSK +D+TP+FPS    V+I++L  +  DG    +  
Sbjct: 488  RGGNLVPNVLRWNAGVEASLKRPQSKDKDYTPMFPSKGSFVDIMSLVGSKDDGAKAEELT 547

Query: 1735 NDEKDEWSSSPYMDKLAN-SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGK 1911
              E D WSSSPYMD+LA+ S QP   +A+++G RV  K  DSFD+D+MNT+LSI+L+KG 
Sbjct: 548  PVEDDPWSSSPYMDQLAHQSNQPKPLFALARGQRVEAK-PDSFDVDMMNTFLSIYLSKGD 606

Query: 1912 LSLACKLFEIFTNLGVNPL-SYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATY 2088
            LSLACKLFEIF  +GV  L SYTYNS+MSSF+KKGYFK A GVL  MGE     +DIATY
Sbjct: 607  LSLACKLFEIFNEMGVTDLTSYTYNSMMSSFVKKGYFKTARGVLDQMGEN-FCAADIATY 665

Query: 2089 NVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKR 2268
            NV+IQGLGKMGRADLA  VL++L ++GGYLDIVMYNTLINALGK  R  EA  LFE MK 
Sbjct: 666  NVIIQGLGKMGRADLASAVLDRLTEQGGYLDIVMYNTLINALGKANRLDEATRLFEHMKS 725

Query: 2269 SGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRY 2448
            SG+NPDV +YNT+IEV++K GK+KEAY +LK MLDA C+PNHVTDT LDYL KE+E+ R+
Sbjct: 726  SGINPDVVSYNTMIEVNSKAGKLKEAYKYLKAMLDANCLPNHVTDTILDYLGKEMEKARF 785

Query: 2449 DMA 2457
              A
Sbjct: 786  KKA 788


>ref|XP_002874971.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297320808|gb|EFH51230.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 802

 Score =  845 bits (2184), Expect = 0.0
 Identities = 437/786 (55%), Positives = 572/786 (72%), Gaps = 13/786 (1%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            A ++P  +L ++L+VA+++KTLS+  G   L+ N  +IP                     
Sbjct: 19   ATNSPFPQLCNVLLVASLSKTLSQ-SGTRGLDAN--SIPISEPVVLQILRRNSIDPSKKL 75

Query: 319  GFFRWC-SLRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMI 495
             FFRWC SLR  YKHS  AYSQ+F+ +C        +V +LL +M+ DG+ LD +  K++
Sbjct: 76   DFFRWCYSLRTGYKHSVSAYSQIFRTVCRTGLL--GEVPDLLCSMKEDGVNLDQTMAKIL 133

Query: 496  LDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLD 675
            LD  I++GK++SAL VLDY E   +G  C +  +Y  VL+AL  KN++ +ALS+F KLL+
Sbjct: 134  LDSLIRSGKFESALGVLDYMEE--LG-DCLNPSLYDSVLIALAKKNELRLALSIFFKLLE 190

Query: 676  SALFAKNGENI------VIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWG 837
            ++    +G++        +P  +A NE+LVGL++ADMR EF+ ++  L+    +  D W 
Sbjct: 191  AS--DNHGDDTSGVTVSYLPGRVAVNELLVGLRRADMRSEFKTVFEKLKGMNRFKFDTWS 248

Query: 838  YNICIHALGCWGDLSTALALFKEMKERNG----PFEPDLCTYNSLINVLCLLGKVKDALI 1005
            YNICIH  GCWGDL  AL+LFKEMKER+      F PD+CTYNSLI+VLCL GK KDALI
Sbjct: 249  YNICIHGFGCWGDLDAALSLFKEMKERSSVSGSSFAPDICTYNSLIHVLCLFGKAKDALI 308

Query: 1006 VWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGL 1185
            VW+ELK S G+EPD  TYRI+IQGC KSYRM+DA+RIF EMQYNG    TVVYN LLDG 
Sbjct: 309  VWDELKVS-GHEPDNSTYRILIQGCCKSYRMDDAMRIFGEMQYNGFVPDTVVYNCLLDGT 367

Query: 1186 MKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFV 1365
            +K++K+ EAC LFEKMV + GVRASCWTYNILIDGL++N R EA +TLFCDLKKKG  FV
Sbjct: 368  LKARKVTEACQLFEKMVQE-GVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FV 425

Query: 1366 DGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLL 1545
            D +T+SIV L LCRE ++EEA++LVEEME RGF VDLVTI+SL+I  +K+GRWDW EKL+
Sbjct: 426  DAITFSIVVLQLCREGKLEEAVKLVEEMETRGFTVDLVTISSLLIGFHKQGRWDWKEKLM 485

Query: 1546 RHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDGEDNT 1725
            +H+REGNLVP++++W + +EA++ +PQ K +D+TP+FPS    ++I+++   + DG    
Sbjct: 486  KHVREGNLVPNVLRWNAGVEASLKRPQRKDKDYTPMFPSKGSFLDIMSMVGLEDDGARAE 545

Query: 1726 DKFNDEKDEWSSSPYMDKLAN-SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLA 1902
            +    E D WSSSPYMD+LA+ S +P   + +++G RV  K  DSFD+D+MNT+LSI+L+
Sbjct: 546  EVPPMEDDPWSSSPYMDQLAHQSNRPKPLFGLARGQRVEAK-PDSFDVDMMNTFLSIYLS 604

Query: 1903 KGKLSLACKLFEIFTNLGVNPL-SYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDI 2079
            KG LSLACKLFEIF  +GV  L SYTYNS+MSSF+KKGYFK   GVL  MGE     +DI
Sbjct: 605  KGDLSLACKLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFKTVRGVLDQMGEN-FCAADI 663

Query: 2080 ATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQ 2259
            ATYNV+IQGLGKMGRADLA  VL++L  +GGYLDIVMYNTLINA+GK  R   A +LF+ 
Sbjct: 664  ATYNVIIQGLGKMGRADLAGAVLDRLTKQGGYLDIVMYNTLINAIGKANRLDAATQLFDH 723

Query: 2260 MKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIER 2439
            MK +G+NPDV +YNT+IEV++K GK+KEAY +LK MLDAGC+PNHVTDT LDYL KE+E+
Sbjct: 724  MKSNGINPDVVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEK 783

Query: 2440 RRYDMA 2457
             R+  A
Sbjct: 784  ARFKKA 789


>ref|XP_003539071.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Glycine max]
          Length = 768

 Score =  816 bits (2108), Expect = 0.0
 Identities = 430/774 (55%), Positives = 548/774 (70%), Gaps = 8/774 (1%)
 Frame = +1

Query: 160  ELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXXGFFRWCS 339
            +LG++LV A+I  TLS             A+                       FF W  
Sbjct: 6    QLGEVLVAASITNTLSHSHSATINLPPNLALGLTQPLILKILSNPAHHASHKLRFFEWS- 64

Query: 340  LRQNYKHSARAYSQMFKVLCFLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFIKAG 519
             R ++  S  AYS + + L       + D+  LL +M   G+ LD  +L  +L  FI + 
Sbjct: 65   -RSHHCPSPAAYSVILRTLS--REGFYSDIPSLLHSMTQAGVVLDPHSLNHLLRSFIISS 121

Query: 520  KYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSALFAKNG 699
             ++ AL++LDY +   +  S     +Y+ +LVAL+ KNQ+++ALS+F KLL     A + 
Sbjct: 122  NFNLALQLLDYVQHLHLDPS----PIYNSLLVALLEKNQLTLALSIFFKLLG----AVDS 173

Query: 700  ENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHALGCWGDL 879
            ++I      ACN++LV L+KADMR EF Q++  LR+ + +  D WGYN+CIHA GCWGDL
Sbjct: 174  KSIT-----ACNQLLVALRKADMRVEFEQVFQRLREKRGFSFDTWGYNVCIHAFGCWGDL 228

Query: 880  STALALFKEMKERNGPF-EPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYEPDEFT 1056
            +T  ALFKEMK  N  F  PDLCTYNSLI  LC LGKV DA+ V+EEL  S+ ++PD FT
Sbjct: 229  ATCFALFKEMKGGNKGFVAPDLCTYNSLITALCRLGKVDDAITVYEELNGSA-HQPDRFT 287

Query: 1057 YRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDLFEKMV 1236
            Y  +IQ C+K+YRM DA+RIF++MQ NG R  T+ YNSLLDG  K+ K+MEAC LFEKMV
Sbjct: 288  YTNLIQACSKTYRMEDAIRIFNQMQSNGFRPDTLAYNSLLDGHFKATKVMEACQLFEKMV 347

Query: 1237 DDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHLCRENQ 1416
             + GVR SCWTYNILI GL++N R EAAYT+FCDLKKKG  FVDG+TYSIV L LC+E Q
Sbjct: 348  QE-GVRPSCWTYNILIHGLFRNGRAEAAYTMFCDLKKKGQ-FVDGITYSIVVLQLCKEGQ 405

Query: 1417 VEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSLIKWKS 1596
            +EEALQLVEEME RGFVVDLVTITSL+I++++ GRWDW ++L++HIREG+L  S++KWK+
Sbjct: 406  LEEALQLVEEMESRGFVVDLVTITSLLISIHRHGRWDWTDRLMKHIREGDLALSVLKWKA 465

Query: 1597 AMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDG---EDNTDKFNDEKDEWSSSP 1767
             MEA+M  P  KK+D++PLFPS  D ++I+N     +D     D  +   +E DEWSSSP
Sbjct: 466  GMEASMKNPPGKKKDYSPLFPSKGDFIDIINFMTCAQDTTNINDGEENSCNEIDEWSSSP 525

Query: 1768 YMDKLAN----SGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLSLACKLF 1935
            +MDKLAN    +G  S  +  S+G RV  KG DSFD+D++NT+LSIFLAKGKLSLACKLF
Sbjct: 526  HMDKLANQVSSTGYSSQMFTPSRGQRVQEKGPDSFDVDMVNTFLSIFLAKGKLSLACKLF 585

Query: 1936 EIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATYNVVIQGLGK 2115
            EIF++ GV+P+SYTYNSIMSSF+KKGYF EAW +L  MGE+   P+DIATYN++IQGLGK
Sbjct: 586  EIFSDAGVDPVSYTYNSIMSSFVKKGYFAEAWAILTEMGEK-FCPTDIATYNMIIQGLGK 644

Query: 2116 MGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKRSGMNPDVFT 2295
            MGRADLA  VL++L  +GGYLDIVMYNTLINALGK  R  E  +LFEQM+ SG+NPDV T
Sbjct: 645  MGRADLASAVLDRLLRQGGYLDIVMYNTLINALGKASRIDEVNKLFEQMRSSGINPDVVT 704

Query: 2296 YNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRYDMA 2457
            YNTLIEVH+K G++K+AY FLKMMLDAGC PNHVTDT LDYL +EI++ RY  A
Sbjct: 705  YNTLIEVHSKAGRLKDAYKFLKMMLDAGCSPNHVTDTTLDYLGREIDKLRYQRA 758


>ref|XP_003621545.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|87241489|gb|ABD33347.1| Pentatricopeptide repeat
            [Medicago truncatula] gi|355496560|gb|AES77763.1|
            Pentatricopeptide repeat-containing protein [Medicago
            truncatula]
          Length = 791

 Score =  777 bits (2006), Expect = 0.0
 Identities = 403/796 (50%), Positives = 545/796 (68%), Gaps = 23/796 (2%)
 Frame = +1

Query: 139  AESAPISELGDLLVVAAIAKTLSKPGGIHALEKNGDAIPXXXXXXXXXXXXXXXXXXXXX 318
            + S+   ++ +LL VA+I KTLSK              P                     
Sbjct: 7    SSSSTWKQVSELLTVASITKTLSK--------NPTQTPPQTNLTQTLIHKILSNPSLHIS 58

Query: 319  GFFRWCSLRQNYKHSARAYSQMFKVLC------FLTHQHHDDVLELLAAMRHDGLALDSS 480
                + +   N  HS+ +YS +F  LC       L HQH   +  LL +M+ +G+  DS+
Sbjct: 59   HKLNFFNSNNNIHHSSLSYSLIFNNLCNPKTPFSLLHQH---LPHLLHSMKQNGIVFDSN 115

Query: 481  TLKMILDGFIKAG--------KYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 636
            +   +L+  IK G         +   +++LDY + + +     +  +Y+ +L+A +  NQ
Sbjct: 116  SFNTLLNFLIKFGVSHNNNSKNFHFVIDILDYIQTQNLHPVDTTPFIYNSLLIASIKNNQ 175

Query: 637  ISIALSVFSKLL----DSALFAKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLR 804
            I +ALS+F+ ++    D  L   N +++++  +   N +L  L+KA M+ EF  +++ LR
Sbjct: 176  IPLALSIFNNIMTLGDDDCL---NLDSVIVGSS---NYLLSVLRKARMKKEFENVFNRLR 229

Query: 805  KTKLYPMDRWGYNICIHALGCWGDLSTALALFKEMKERNGPFEPDLCTYNSLINVLCLLG 984
            + K +  D WGYNICIHA G WGDL T++ LF EMKE    F PD+CTYNS+++VLC +G
Sbjct: 230  ERKSFDFDLWGYNICIHAFGSWGDLVTSMKLFNEMKEDKNLFGPDMCTYNSVLSVLCKVG 289

Query: 985  KVKDALIVWEELKASSGYEPDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVY 1164
            K+ DALIVW+ELK   GYEPDEFTY I+++GC ++YRM+ ALRIF+EM+ NG R G +VY
Sbjct: 290  KINDALIVWDELKGC-GYEPDEFTYTILVRGCCRTYRMDVALRIFNEMKDNGFRPGVLVY 348

Query: 1165 NSLLDGLMKSKKLMEACDLFEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLK 1344
            N +LDGL K+ K+ E C +FEKM  + GV+ASC TYNILI GL KN R EA Y LFCDLK
Sbjct: 349  NCVLDGLFKAAKVNEGCQMFEKMAQE-GVKASCSTYNILIHGLIKNGRSEAGYMLFCDLK 407

Query: 1345 KKGNNFVDGVTYSIVTLHLCRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRW 1524
            KKG  FVDG+TYSIV L LC+E  +EEAL+LVEEME RGF VDLVTITSL+I ++K GRW
Sbjct: 408  KKGQ-FVDGITYSIVVLQLCKEGLLEEALELVEEMEARGFSVDLVTITSLLIGIHKYGRW 466

Query: 1525 DWIEKLLRHIREGNLVPSLIKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKAD 1704
            +W ++L++H+REG+L+P +++WK+ MEA++    SK++D++ +FPS     EI++     
Sbjct: 467  EWTDRLIKHVREGDLLPGVLRWKAGMEASINNFHSKEKDYSSMFPSKGGFCEIMSFITRS 526

Query: 1705 KDGEDNTDKFNDEKDEWSSSPYMDKLA-----NSGQPSLTYAISKGVRVLGKGDDSFDID 1869
            +D +D  +  +++ DEWSSSP+MDKLA     ++G  S  +   +G RV  KG DSFDID
Sbjct: 527  RDEDDEVETSSEQIDEWSSSPHMDKLAKRVVNSTGNASRMFTPDRGQRVQQKGSDSFDID 586

Query: 1870 IMNTYLSIFLAKGKLSLACKLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAM 2049
            ++NT+LSIFL+KGKLSLACKLFEIFT+ GV+P+SYTYNSIMSSF+KKGYF EAW +L  M
Sbjct: 587  MVNTFLSIFLSKGKLSLACKLFEIFTDAGVDPVSYTYNSIMSSFVKKGYFNEAWAILSEM 646

Query: 2050 GEEEISPSDIATYNVVIQGLGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGR 2229
            GE+ + P+DIATYN++IQGLGKMGRADLA  VL+ L  +GGYLDIVMYNTLINALGK GR
Sbjct: 647  GEK-LCPTDIATYNMIIQGLGKMGRADLASAVLDGLLKQGGYLDIVMYNTLINALGKAGR 705

Query: 2230 FGEAVELFEQMKRSGMNPDVFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTC 2409
              E  + FEQMK SG+NPDV TYNTLIE+H+K G++K+AY FLKMM+DAGC PNHVTDT 
Sbjct: 706  IDEVNKFFEQMKSSGINPDVVTYNTLIEIHSKAGRLKDAYKFLKMMIDAGCTPNHVTDTT 765

Query: 2410 LDYLEKEIERRRYDMA 2457
            LDYL +EI++ RY  A
Sbjct: 766  LDYLVREIDKLRYQKA 781


>ref|XP_004491942.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Cicer arietinum]
          Length = 793

 Score =  775 bits (2002), Expect = 0.0
 Identities = 390/717 (54%), Positives = 525/717 (73%), Gaps = 17/717 (2%)
 Frame = +1

Query: 358  HSARAYSQMFKVLC------FLTHQHHDDVLELLAAMRHDGLALDSSTLKMILDGFI--- 510
            H++  YS +FK LC       L HQH   + +LL +M+ + +  DS + K +L+  I   
Sbjct: 79   HNSITYSLIFKTLCNPTTPISLLHQH---LPQLLHSMKQNDVVFDSYSFKNLLNFLINLS 135

Query: 511  ---KAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQISIALSVFSKLLDSA 681
               K       +++LDY + + +  S  +  +Y+ +L+A +  NQ+++ALS+F  ++ S 
Sbjct: 136  HNNKKNNLHFVIDILDYIQSQNLQPSGTTPFIYNSLLIASIKNNQLNLALSIFKNVI-SI 194

Query: 682  LFAKNGENIVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLYPMDRWGYNICIHAL 861
              + N +++++  +   N +L  L+KA M+ EF  +++ LR+ K +  D WGYNICIHA 
Sbjct: 195  DDSSNFDHVIVGSS---NYLLSALRKAQMKKEFINVFNTLRERKSFDFDLWGYNICIHAF 251

Query: 862  GCWGDLSTALALFKEMKERNGPFEPDLCTYNSLINVLCLLGKVKDALIVWEELKASSGYE 1041
            G WGDL T++ LF EMKE    F PD+CTYNS++++LC +GKV DAL+VWEELK   GYE
Sbjct: 252  GSWGDLVTSMMLFNEMKEDKNLFGPDMCTYNSVLSILCKVGKVNDALVVWEELKGC-GYE 310

Query: 1042 PDEFTYRIMIQGCAKSYRMNDALRIFSEMQYNGIRAGTVVYNSLLDGLMKSKKLMEACDL 1221
            PDEFTY I+++G +++ RM++A+RIF+EM+ NG R G +VYN +LDGL K+ K+ EAC +
Sbjct: 311  PDEFTYTILVRGFSRTCRMDEAIRIFNEMKDNGFRPGILVYNCVLDGLFKAAKVNEACQM 370

Query: 1222 FEKMVDDDGVRASCWTYNILIDGLYKNRREEAAYTLFCDLKKKGNNFVDGVTYSIVTLHL 1401
            FEKM  + GV+ASCWTYNILI GL KN R EA YTLFCDLKKKG  FVD +TYSIV L L
Sbjct: 371  FEKMAQE-GVKASCWTYNILIHGLIKNGRSEAGYTLFCDLKKKGQ-FVDEITYSIVVLQL 428

Query: 1402 CRENQVEEALQLVEEMEGRGFVVDLVTITSLVIALYKRGRWDWIEKLLRHIREGNLVPSL 1581
            C+E Q+EEAL+LVEEME RGF VDLVTITSL+I ++K GRWDW ++L++H+REG+L+P +
Sbjct: 429  CKEGQLEEALELVEEMEARGFSVDLVTITSLLIGIHKYGRWDWTDRLIKHVREGDLLPGV 488

Query: 1582 IKWKSAMEAAMTKPQSKKRDFTPLFPSLSDVVEILNLKKADKDGEDNTDKFNDEKDEWSS 1761
            ++WK+ MEA++    S K+D++P+F S  D  EI++     +D ED  +  +++ DEWSS
Sbjct: 489  LRWKAGMEASINNLPSGKKDYSPMFSSKGDFSEIMSFITRARD-EDEVETLSEQIDEWSS 547

Query: 1762 SPYMDKLA-----NSGQPSLTYAISKGVRVLGKGDDSFDIDIMNTYLSIFLAKGKLSLAC 1926
            SP+MDKLA     ++G  S  +   +G RV  KG DSFD+D++NT+LSIFLAKGKLSLAC
Sbjct: 548  SPHMDKLAKHVVRSTGNASRLFTPDRGQRVQQKGPDSFDVDMVNTFLSIFLAKGKLSLAC 607

Query: 1927 KLFEIFTNLGVNPLSYTYNSIMSSFIKKGYFKEAWGVLHAMGEEEISPSDIATYNVVIQG 2106
            KLFEIFT+ GV+P+SYTYNSIMSSF+KKGYF EAW +L  MGE+   P+DIATYN++IQG
Sbjct: 608  KLFEIFTDAGVDPVSYTYNSIMSSFVKKGYFNEAWAILTEMGEK-FCPTDIATYNMIIQG 666

Query: 2107 LGKMGRADLAKVVLEKLKDEGGYLDIVMYNTLINALGKGGRFGEAVELFEQMKRSGMNPD 2286
            LGKMGRADLA  VL+ L  +GGYLDIVMYNTLINALGK GR  E  + F+QM+ SG++PD
Sbjct: 667  LGKMGRADLASAVLDGLLKQGGYLDIVMYNTLINALGKAGRIDEVSKFFDQMRNSGISPD 726

Query: 2287 VFTYNTLIEVHAKGGKVKEAYGFLKMMLDAGCVPNHVTDTCLDYLEKEIERRRYDMA 2457
            V TYNTLIE+H+K G+VK+AY FLKMMLDAGC PNHVTDT LDYL +EI++ RY  A
Sbjct: 727  VVTYNTLIEIHSKAGRVKDAYKFLKMMLDAGCTPNHVTDTTLDYLVREIDKLRYQKA 783


Top