BLASTX nr result

ID: Catharanthus22_contig00018004 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00018004
         (2679 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containi...  1067   0.0  
ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containi...  1062   0.0  
ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containi...   963   0.0  
gb|EOX95524.1| Pentatricopeptide repeat-containing protein, puta...   960   0.0  
ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Popu...   946   0.0  
ref|XP_006386676.1| pentatricopeptide repeat-containing family p...   939   0.0  
ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containi...   931   0.0  
ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containi...   925   0.0  
ref|XP_002515124.1| pentatricopeptide repeat-containing protein,...   923   0.0  
ref|NP_192066.2| pentatricopeptide repeat-containing protein [Ar...   907   0.0  
ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutr...   904   0.0  
ref|XP_002874971.1| pentatricopeptide repeat-containing protein ...   902   0.0  
gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]     901   0.0  
ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Caps...   901   0.0  
ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containi...   894   0.0  
ref|XP_003539071.1| PREDICTED: pentatricopeptide repeat-containi...   845   0.0  
gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlise...   820   0.0  
ref|XP_003621545.1| Pentatricopeptide repeat-containing protein ...   794   0.0  
ref|XP_004491942.1| PREDICTED: pentatricopeptide repeat-containi...   793   0.0  
ref|XP_006827884.1| hypothetical protein AMTR_s00008p00117710 [A...   746   0.0  

>ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            isoform X1 [Solanum tuberosum]
          Length = 816

 Score = 1067 bits (2760), Expect = 0.0
 Identities = 533/783 (68%), Positives = 636/783 (81%), Gaps = 4/783 (0%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            S+VGN+L+VASIAK+L +PGG RNLE+   SI LSE LVLQVL R +LDA +KLDFF+WC
Sbjct: 35   SKVGNLLVVASIAKALIKPGGTRNLEQYGDSIPLSESLVLQVLRRNNLDAEKKLDFFKWC 94

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +L+P++ HST TYSQ+F +IC      + I  LLNS+  D + L++ATFKL+LD+F  +G
Sbjct: 95   SLRPSFKHSTETYSQMFKSICYSHNHREAIFVLLNSMKDDKVLLNAATFKLLLDSFTRTG 154

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
             F SALEIL+ +E DL   SCL PD+Y++VLIAL++KNQ+ +ALSIFLK L+ +  D   
Sbjct: 155  NFDSALEILEFVEGDLDNSSCLSPDVYNSVLIALVQKNQVNLALSIFLKLLETN--DGNS 212

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
             G   A+ACNELLVGL++ NM  EF  VF KLR    FP DRWGYNICIH FGC GDLS+
Sbjct: 213  IGVSSAVACNELLVGLKRGNMRAEFKQVFDKLRGGNVFPFDRWGYNICIHTFGCWGDLSS 272

Query: 1652 SLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYRV 1473
            SL+LFKEMKERG  FSPDLCTYNSLI VLCL GKV DA  VWEELKGSSG EPD +TYR+
Sbjct: 273  SLSLFKEMKERGSWFSPDLCTYNSLIHVLCLLGKVKDAFVVWEELKGSSGLEPDAYTYRI 332

Query: 1472 LVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEED 1293
            ++QGCSKAY I DA+ +F++MQ +G+RPDT +YN+LL+GL+KA+KLT+ACNLF+KM E+D
Sbjct: 333  VIQGCSKAYLINDAIKVFTEMQYNGIRPDTIVYNTLLDGLLKARKLTDACNLFQKMIEDD 392

Query: 1292 GVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXXX 1113
            GVRAS WTYNILIDGLF+NGRALAAYTLF DLKKK +NFVDG+T+SIV LHLCRE +   
Sbjct: 393  GVRASCWTYNILIDGLFKNGRALAAYTLFCDLKKKSNNFVDGVTYSIVILHLCREDRLDE 452

Query: 1112 XXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKATME 933
                     ARGF VDLVTI+SLLI+ Y+ G  D TERLMK+IRD NLVP ++ WK +ME
Sbjct: 453  ALKLVEEMEARGFTVDLVTITSLLIAIYKEGHWDYTERLMKHIRDSNLVPIIIRWKDSME 512

Query: 932  DSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPWSASP 753
             +MKA QS+EKD TP+FPS  +F DIL + NL + +TD+ LG ED E      DPWS+SP
Sbjct: 513  ATMKAPQSREKDFTPIFPSNRNFGDILGLENLTDAETDTALGAEDAEIHYQESDPWSSSP 572

Query: 752  YLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLACKLF 585
            Y+D+LAN++S +S     FSL+ G+RI  K  DSFDIDMVNT+LSIFLAKGKLS+ACKLF
Sbjct: 573  YMDMLANKVSSQSNSSRTFSLTGGKRIDTKSADSFDIDMVNTFLSIFLAKGKLSMACKLF 632

Query: 584  EIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQGLGKM 405
            EIFT+MG DPVSYT+NSMMSSFVKKGY NEAWG+LQ MGE++CP+D+ATYNVIIQGLGKM
Sbjct: 633  EIFTDMGADPVSYTYNSMMSSFVKKGYFNEAWGILQEMGEKVCPSDVATYNVIIQGLGKM 692

Query: 404  GRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPDVVTY 225
            GRADLA AVLDKLMKQGGYLDIVMYNTLINALGKAGRIEE N LFQQM+ SGINPDVVTY
Sbjct: 693  GRADLADAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEVNKLFQQMKNSGINPDVVTY 752

Query: 224  NTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMKPINA 45
            NTLIE+H KAG+LK +YKFL+MML+AGCAPN VTDTTLDFLEKEIEK RYQKA+MK  N 
Sbjct: 753  NTLIEVHAKAGQLKQSYKFLRMMLEAGCAPNQVTDTTLDFLEKEIEKLRYQKASMKRPNV 812

Query: 44   EDP 36
            ++P
Sbjct: 813  DNP 815


>ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Solanum lycopersicum]
          Length = 819

 Score = 1062 bits (2746), Expect = 0.0
 Identities = 533/783 (68%), Positives = 631/783 (80%), Gaps = 4/783 (0%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            S+VGN+++VASIAK+L + GG RNLEK    I LSE LVLQVL R +LDA +KLDFF+WC
Sbjct: 38   SKVGNLIVVASIAKALIKRGGTRNLEKYGDLIPLSESLVLQVLRRNNLDAEKKLDFFKWC 97

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +L+PN+ HST TYSQ+F  IC      +++  LLNS+  D + L+SATFKL+LD+F  +G
Sbjct: 98   SLRPNFKHSTETYSQMFKCICYSRNHREDVFVLLNSMKDDEVLLNSATFKLLLDSFTRTG 157

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
             F SALEIL+ +E DL   SCL PD+Y++VLIAL++KNQ+ +ALSIFLK L+ +  D   
Sbjct: 158  NFDSALEILEFVEGDLANSSCLSPDVYNSVLIALVQKNQVNLALSIFLKLLETN--DGNS 215

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
             G   AIACNELLVGL++ NM  EF  VF KLR    FP DRWGYNICIH FGC GDLS 
Sbjct: 216  IGVSSAIACNELLVGLKRGNMRAEFKQVFDKLRGGNVFPFDRWGYNICIHAFGCWGDLSR 275

Query: 1652 SLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYRV 1473
            SL+LFKEMKERG  FSPDLCTYNSLI VLCL GKV DA  VWEELKGSSG EPD +TYR+
Sbjct: 276  SLSLFKEMKERGSCFSPDLCTYNSLIHVLCLLGKVKDAFVVWEELKGSSGLEPDAYTYRI 335

Query: 1472 LVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEED 1293
            ++QGCSKAY I DA+ +F++MQ +G+RPDT +YNSLL+GL+K +KLT+ACNLF+KM E+D
Sbjct: 336  VIQGCSKAYLINDAIKVFTEMQYNGIRPDTIVYNSLLDGLLKVRKLTDACNLFQKMIEDD 395

Query: 1292 GVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXXX 1113
            GVRAS WTYNILIDGLF+NGRALAAYTLF DLKKK +NFVDG+++SIV LHLCRE +   
Sbjct: 396  GVRASCWTYNILIDGLFKNGRALAAYTLFCDLKKKSNNFVDGVSYSIVILHLCREDRLDE 455

Query: 1112 XXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKATME 933
                     ARGF VDLVTI+SLLI+ YR G  D TERLMK+IRD NLVP ++ WK +ME
Sbjct: 456  ALKLVEEMEARGFTVDLVTITSLLIAIYREGHWDYTERLMKHIRDSNLVPIIIRWKDSME 515

Query: 932  DSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPWSASP 753
             +MKA QS+EKD TP+FPS  +F DIL + NL + +TD  LG E+ E      DPWS+SP
Sbjct: 516  ATMKAPQSREKDFTPIFPSNRNFGDILGLENLTDAETDIALGAEEAEIHYQESDPWSSSP 575

Query: 752  YLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLACKLF 585
            Y+DLLA+++S +S     FSL+ G+RI  K  DSFDIDMVNT+LSIFLAKGKLS+ACKLF
Sbjct: 576  YMDLLADKVSSQSNSSRTFSLTGGKRIDTKSADSFDIDMVNTFLSIFLAKGKLSMACKLF 635

Query: 584  EIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQGLGKM 405
            EIFT+MG DPVSYT+NSMMSSFVKKGY NEAWGVLQ MGE++CP+D+ATYNVIIQGLGKM
Sbjct: 636  EIFTDMGADPVSYTYNSMMSSFVKKGYFNEAWGVLQEMGEKVCPSDVATYNVIIQGLGKM 695

Query: 404  GRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPDVVTY 225
            GRADLA AVLDKLMKQGGYLDIVMYNTLINALGKAGRIEE N LFQQM+ SGINPDVVTY
Sbjct: 696  GRADLADAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEVNKLFQQMKDSGINPDVVTY 755

Query: 224  NTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMKPINA 45
            NTLIE+H KAG+LK +YKFL+MML+AGCAPN VTDTTLDFLEKEIEK RYQKA+MK  N 
Sbjct: 756  NTLIEVHAKAGQLKQSYKFLRMMLEAGCAPNQVTDTTLDFLEKEIEKLRYQKASMKRPNV 815

Query: 44   EDP 36
            ++P
Sbjct: 816  DNP 818


>ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570
            [Vitis vinifera]
          Length = 792

 Score =  963 bits (2490), Expect = 0.0
 Identities = 508/787 (64%), Positives = 610/787 (77%), Gaps = 8/787 (1%)
 Frame = -2

Query: 2369 EVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCT 2190
            ++G++LLVASI+K+LSE G       D  SI +SE LV+Q+L R S+D  +K++FFRWC+
Sbjct: 18   KLGDMLLVASISKTLSERG---TRSPDLESIPISESLVVQILGRNSIDVFRKVEFFRWCS 74

Query: 2189 LKPNYIHSTRTYSQIFHTICRC-PQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
             + NY HS   YS IF  +CR   +F D++P L++S+  DG+ +   TFKL+LD+ I +G
Sbjct: 75   FRHNYKHSVGAYSHIFRIVCRAGAEFLDQVPLLMSSMKDDGVVVGQETFKLLLDSLIRAG 134

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
            KF SALEILDH+E +LG  + L   +Y +VL+ALIRKNQL +AL +F K L      + G
Sbjct: 135  KFDSALEILDHIE-ELG--TGLNSYVYDSVLVALIRKNQLGLALPLFFKLLGGDE-GQGG 190

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
               P++ ACN+LLV LRKA+M  EF  VF KLR K  F LD  GYNICIH FGC GDL T
Sbjct: 191  VPVPESNACNQLLVALRKADMKIEFRNVFEKLRAKKDFDLDTQGYNICIHAFGCWGDLGT 250

Query: 1652 SLTLFKEMKERG---DPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFT 1482
            +L LFKEMK++      F PDLCTYNSLI+VLCL GKV DAL VWEELKGS GHEPD FT
Sbjct: 251  ALNLFKEMKDKSLNSSSFGPDLCTYNSLIRVLCLVGKVKDALIVWEELKGS-GHEPDAFT 309

Query: 1481 YRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMA 1302
            YR+L+QGCSK+YR+ DAM IF++MQ +G  PDT +YN+LL+GL KA+K+ EAC +FEKM 
Sbjct: 310  YRILIQGCSKSYRMDDAMRIFNEMQYNGFCPDTIVYNTLLDGLFKARKVMEACQVFEKMV 369

Query: 1301 EEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQ 1122
            E DGVRAS WT+NI+I GLFRNGRA A YTLF DLKKKG  FVDGIT+SIV L LCREGQ
Sbjct: 370  E-DGVRASCWTHNIVICGLFRNGRAAAGYTLFCDLKKKGK-FVDGITYSIVVLQLCREGQ 427

Query: 1121 XXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKA 942
                        ARGF+VDLVTI+SLLI F++ G  D TERLMK+IRDGNLVPNVL+WKA
Sbjct: 428  LEEALQLVEEMEARGFVVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNLVPNVLNWKA 487

Query: 941  TMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPWS 762
             ME  MKA QS+ KD TPMFPS G+ ++I+S+I+ A+ + D   G+E  E  + + D WS
Sbjct: 488  NMEAYMKAPQSRRKDYTPMFPSEGNLSEIMSLISSADTEMDGSPGSE--EDVAQHEDQWS 545

Query: 761  ASPYLDLLANQLSP----RSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLAC 594
            +SPY+D LA+QL        L SLSRG+R+ AKGIDSFDIDMVNTYLSIFLAKGKLSLAC
Sbjct: 546  SSPYMDQLASQLKSIDVSSQLLSLSRGQRVQAKGIDSFDIDMVNTYLSIFLAKGKLSLAC 605

Query: 593  KLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQGL 414
            KLFEIF+NMGVDPV YT+NSMM++FVKKGY NEAWGV   MGE++CP DIATYNVIIQGL
Sbjct: 606  KLFEIFSNMGVDPVIYTYNSMMTAFVKKGYFNEAWGVFHEMGEKVCPPDIATYNVIIQGL 665

Query: 413  GKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPDV 234
            GKMGRADLASAVLD LMKQGGYLDIVMYNTLINALGKAGRI+EA  LF+QM++SGINPDV
Sbjct: 666  GKMGRADLASAVLDMLMKQGGYLDIVMYNTLINALGKAGRIDEATKLFEQMRSSGINPDV 725

Query: 233  VTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMKP 54
            VT+NTLIEIH KAG+LK AYKFLK+MLDAGC+PNHVTDTTLDFL KEIEK RY+KA++  
Sbjct: 726  VTFNTLIEIHAKAGQLKAAYKFLKLMLDAGCSPNHVTDTTLDFLGKEIEKLRYKKASIIR 785

Query: 53   INAEDPS 33
             + +D S
Sbjct: 786  TSKDDSS 792


>gb|EOX95524.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 807

 Score =  960 bits (2481), Expect = 0.0
 Identities = 514/786 (65%), Positives = 606/786 (77%), Gaps = 17/786 (2%)
 Frame = -2

Query: 2366 VGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC-T 2190
            +GN+LL+AS+ K+LSE  G RNL  D  SI +SE LV+Q+L + SL+ S+KLDFF WC +
Sbjct: 23   LGNILLIASLTKTLSE-SGTRNL--DPNSIPISEPLVIQILRKHSLEPSKKLDFFNWCRS 79

Query: 2189 LKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGK 2010
            +KPN+ HS  TYS IF T+CR   F +E+P+LL ++  DG+ +DS TFK +LDAFI SGK
Sbjct: 80   VKPNFKHSAVTYSHIFRTLCRSG-FVEEVPNLLFAMKEDGVLVDSDTFKFLLDAFIRSGK 138

Query: 2009 FVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG- 1833
            F SALEILD ME +LGA   L+  +Y +VL+ALIRK+Q+ +ALS+F K L+    ++ G 
Sbjct: 139  FDSALEILDFME-ELGAGLNLR--VYDSVLVALIRKDQVGLALSLFFKLLEACNGNDDGN 195

Query: 1832 ---SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGD 1662
               S  P +IA NELLV LRKA+M  EF  VF  LREK  F  D  GYNICIH FGC GD
Sbjct: 196  SVDSSLPGSIAINELLVALRKAHMRREFKQVFDILREKREFEFDTCGYNICIHSFGCWGD 255

Query: 1661 LSTSLTLFKEMKERGDPFS---PDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPD 1491
            L  SL LFKEMKE+   F    PDLCTYNSLI VLCL GKV DAL VWEELK  SGHEPD
Sbjct: 256  LGASLKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLVGKVKDALVVWEELK-VSGHEPD 314

Query: 1490 LFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFE 1311
             FTYR+L+QGCSK+YR+ DA  IFS+MQ +G   DT +YNSLLNGL KA+K+ EAC  FE
Sbjct: 315  AFTYRILIQGCSKSYRMDDATKIFSEMQYNGFAMDTVVYNSLLNGLFKARKVMEACQFFE 374

Query: 1310 KMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCR 1131
            KM + DGVRAS WTYNILIDGLFRNGRA AAYTLF DLKKKG  FVDGIT+SIV L LCR
Sbjct: 375  KMVQ-DGVRASCWTYNILIDGLFRNGRAEAAYTLFCDLKKKGQ-FVDGITYSIVVLQLCR 432

Query: 1130 EGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLS 951
            EGQ            ARGFIVDLVTI+SLLI F++ G  D TERLMK+IRDGNLVPNVL 
Sbjct: 433  EGQLEGALRLVEEMEARGFIVDLVTITSLLIGFHKQGRWDWTERLMKHIRDGNLVPNVLK 492

Query: 950  WKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDI-----EPE 786
            WKA ME SMK      KD TP+FPS+GDF +I++++        + L +ED      E  
Sbjct: 493  WKANMEASMKNPPKNRKDYTPLFPSKGDFREIMNLLGSVGQAMGTNLDSEDCDEKDQEKP 552

Query: 785  SDNIDPWSASPYLDLLANQ--LSPRS--LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLA 618
            S + D WS+SPY+D LANQ   + RS  LFSL RG+R+  KGI SFD+DMVNT+LSIFLA
Sbjct: 553  SIDTDQWSSSPYMDQLANQGKSTERSSQLFSLIRGQRVQEKGIGSFDVDMVNTFLSIFLA 612

Query: 617  KGKLSLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIAT 438
            KGKLSLACKLFE+FT+MGVDPVSYT+NS+MSSFVKKGY NEAWGVL  M E++CPADIAT
Sbjct: 613  KGKLSLACKLFEVFTDMGVDPVSYTYNSIMSSFVKKGYFNEAWGVLNEMDEKVCPADIAT 672

Query: 437  YNVIIQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQ 258
            YN+IIQGLGKMGRAD+AS+VLDKLMKQGGYLD+VMYNTL+NALGKAGR++EA+ LF+QM+
Sbjct: 673  YNLIIQGLGKMGRADIASSVLDKLMKQGGYLDVVMYNTLVNALGKAGRVDEASKLFEQMR 732

Query: 257  TSGINPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHR 78
            TSGINPDV+TYNTLIE+H KAG+L+DAYKFLKMMLDAGC+PNHVTDT LD L KEIEK R
Sbjct: 733  TSGINPDVITYNTLIEVHTKAGQLQDAYKFLKMMLDAGCSPNHVTDTILDNLGKEIEKMR 792

Query: 77   YQKATM 60
             QKA+M
Sbjct: 793  LQKASM 798


>ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa]
            gi|550345304|gb|EEE81962.2| hypothetical protein
            POPTR_0002s18390g [Populus trichocarpa]
          Length = 776

 Score =  946 bits (2445), Expect = 0.0
 Identities = 496/787 (63%), Positives = 600/787 (76%), Gaps = 10/787 (1%)
 Frame = -2

Query: 2366 VGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCTL 2187
            +GN+LLVA + K+LSE  G R+L+ D  SI LSE LVLQ+L R SLD+S+K++FF+WC++
Sbjct: 1    MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSESLVLQILRRNSLDSSKKMEFFKWCSV 57

Query: 2186 KPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKF 2007
            +  Y HS  TYSQ+F T+CR   + DE+PDLLNS+ +DG+ + S TFKL+LDAFI SGKF
Sbjct: 58   RHIYKHSVSTYSQMFSTLCRSG-YLDEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKF 116

Query: 2006 VSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEK--- 1836
             SAL+ILDHME +LG  S   P +Y ++++AL +KNQ+ +ALSI  K L+ S  +E+   
Sbjct: 117  DSALDILDHME-ELG--SNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEASDGNEENAV 173

Query: 1835 GSGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLS 1656
            G   P ++ACN LLV LR   M  EF  VF KLR KG F L+ WGYNICIH FGC GDL+
Sbjct: 174  GVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKGGFELNTWGYNICIHAFGCWGDLT 233

Query: 1655 TSLTLFKEMKERG---DPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLF 1485
            TSL LFKEMKE+        PDLCTYNSLI VLCLAGKV DA+ V+EELK  SGHEPD F
Sbjct: 234  TSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELK-VSGHEPDAF 292

Query: 1484 TYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKM 1305
            TYR+L+QGC K+Y++ DA  IFS+MQ +G  PDT +YNSLL+G+ KA+K+ EAC LFEKM
Sbjct: 293  TYRILIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKM 352

Query: 1304 AEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREG 1125
             + DGVRAS WTYNILIDGL +NGRA A Y LF  LKKKG  FVD +T+SIV L LCR+G
Sbjct: 353  VQ-DGVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQ-FVDAVTYSIVVLLLCRKG 410

Query: 1124 QXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWK 945
                          RGF+VDL+TI+SLLI+F++ G  D TERLMK+IRD NL+PNVL W+
Sbjct: 411  HLEEALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVLKWR 470

Query: 944  ATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPW 765
            A ME S+K      +D TPMFPS G   +I+S I+    ++D G  TED +  S + D W
Sbjct: 471  ADMEASLKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDG-ATEDEKSSSADTDQW 529

Query: 764  SASPYLDLLANQLSPRSL----FSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLA 597
            S+SPY+D LANQ     L    FSL+RG+R+ AKG  SFDIDMVNT+LSIFLAKGKLSLA
Sbjct: 530  SSSPYMDHLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLSLA 589

Query: 596  CKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQG 417
            CKLFEIFT+MGVDPVSYT+NS+MSSFVKKGY N AW V   MGE++CP DIATYN++IQG
Sbjct: 590  CKLFEIFTDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEKVCPPDIATYNLVIQG 649

Query: 416  LGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPD 237
            LGKMGRADLAS+VLDKLMKQGGYLDIVMYNTLI+ALGKAGRI+EAN LF+QM+ SG+NPD
Sbjct: 650  LGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGLNPD 709

Query: 236  VVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMK 57
            VVTYN +IE+H+K GRLKDAYKFLKMMLDAGC PNHVTDTTLDFL KEIEK RYQKA++ 
Sbjct: 710  VVTYNIMIEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKASIM 769

Query: 56   PINAEDP 36
                + P
Sbjct: 770  RQKDDSP 776


>ref|XP_006386676.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550345301|gb|ERP64473.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 776

 Score =  939 bits (2426), Expect = 0.0
 Identities = 493/787 (62%), Positives = 599/787 (76%), Gaps = 10/787 (1%)
 Frame = -2

Query: 2366 VGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCTL 2187
            +GN+LLVA + K+LSE  G R+L+ D  SI LSE LVLQ+L R SLD+S+K++FF+WC++
Sbjct: 1    MGNILLVAYLTKTLSE-SGTRSLDPD--SIPLSEYLVLQILRRNSLDSSKKMEFFKWCSV 57

Query: 2186 KPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKF 2007
            +  Y HS  TYSQ+F T+CR   + +E+PDLLNS+ +DG+ + S TFKL+LDAFI SGKF
Sbjct: 58   RHIYKHSVSTYSQMFSTLCRSG-YLEEVPDLLNSMKNDGVVVGSETFKLLLDAFIRSGKF 116

Query: 2006 VSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKGS- 1830
             SAL+ILDHME +LG  S   P +Y ++++AL +KNQ+ +ALSI  K L+ S  +E+ + 
Sbjct: 117  DSALDILDHME-ELG--SNPNPHMYDSIIVALAKKNQVGLALSIMFKLLEASDGNEENAV 173

Query: 1829 --GTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLS 1656
                P ++ACN LLV LR   M  EF  VF KLR K  F L+ WGYNICIH FGC GDL+
Sbjct: 174  RVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRGKVGFKLNTWGYNICIHAFGCWGDLT 233

Query: 1655 TSLTLFKEMKERG---DPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLF 1485
            TSL LFKEMKE+        PDLCTYNSLI VLCLAGKV DA+ V+EELK  SGHEPD F
Sbjct: 234  TSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELK-VSGHEPDAF 292

Query: 1484 TYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKM 1305
            TYR+L+QGC K+Y++ DA  IFS+MQ +G  PDT +YNSLL+G+ KA+K+ EAC LFEKM
Sbjct: 293  TYRILIQGCCKSYQMEDATKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKM 352

Query: 1304 AEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREG 1125
             + DGVRAS WTYNILIDGL +NGRA A Y LF  LKKKG  FVD +T+SIV L LCR+G
Sbjct: 353  VQ-DGVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQ-FVDAVTYSIVVLLLCRKG 410

Query: 1124 QXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWK 945
                          RGF+VDL+TI+SLLI+F++ G  D TERLMK+IRD NL+PNVL W+
Sbjct: 411  HLEEALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHIRDVNLLPNVLKWR 470

Query: 944  ATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPW 765
            A ME S+K      +D TPMFPS G   +I+S I+    ++D G  TED +  S + D W
Sbjct: 471  ADMEASLKNPPRSREDYTPMFPSTGGLQEIMSSISSPKSRSDDG-ATEDEKSSSADTDQW 529

Query: 764  SASPYLDLLANQLSPRSL----FSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLA 597
            S+SPY+D LANQ     L    FSL+RG+R+ AKG  SFDIDMVNT+LSIFLAKGKLSLA
Sbjct: 530  SSSPYMDHLANQAKSTDLSSQLFSLARGQRVQAKGAGSFDIDMVNTFLSIFLAKGKLSLA 589

Query: 596  CKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQG 417
            CKLFEIFT+MGVDPVSYT+NS+MSSFVKKGY N AW V   MGE++CP DIATYN++IQG
Sbjct: 590  CKLFEIFTDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEKVCPPDIATYNLVIQG 649

Query: 416  LGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPD 237
            LGKMGRADLAS+VLDKLMKQGGYLDIVMYNTLI+ALGKAGRI+EAN LF+QM+ SG+NPD
Sbjct: 650  LGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGLNPD 709

Query: 236  VVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMK 57
            VVTYN +IE+H+K GRLKDAYKFLKMMLDAGC PNHVTDTTLDFL KEIEK RYQKA++ 
Sbjct: 710  VVTYNIMIEVHSKTGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKLRYQKASIM 769

Query: 56   PINAEDP 36
                + P
Sbjct: 770  RQKDDSP 776


>ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Fragaria vesca subsp. vesca]
          Length = 789

 Score =  931 bits (2407), Expect = 0.0
 Identities = 482/780 (61%), Positives = 591/780 (75%), Gaps = 10/780 (1%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            +E+G++LLVASI K+LS+  G RNL +    + L+E L+LQ+L  +SL  S+KLDFF+WC
Sbjct: 17   AELGDILLVASITKTLSQ-SGTRNLPQP---LPLTEPLLLQILRTQSLHPSKKLDFFKWC 72

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +L  +   S R +S + HT CR   F  EIP+LL  +  D LA+DS TFK +LDAFI  G
Sbjct: 73   SLTHSIPPSPRAFSHVLHTACRAG-FLAEIPELLTIMRRDSLAVDSGTFKSLLDAFIREG 131

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
            KF  A+EILD M++     + L  D+Y++VL+AL+RK QLR+A+SI ++ L+    D+  
Sbjct: 132  KFDMAIEILDTMQE---VNAELNADMYNSVLVALVRKGQLRLAMSILVRLLEGGSCDQ-- 186

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
               P  IACNELLVGLRK +M  EF  V+ KLR   +F +D WGYNICIH FGC GDL T
Sbjct: 187  --VPSCIACNELLVGLRKGDMRVEFKQVYDKLRGNEWFEMDTWGYNICIHAFGCWGDLGT 244

Query: 1652 SLTLFKEMKE-RGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYR 1476
            SL+LFKEMK+   D   PDL TYNSLI VLCL GKV DA+ VWEELK  SGHEPD  TYR
Sbjct: 245  SLSLFKEMKDLNSDSVFPDLSTYNSLIHVLCLVGKVDDAITVWEELK-CSGHEPDAITYR 303

Query: 1475 VLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEE 1296
            +L+QGC K YRI +A  IFS+MQ +G  PDT +YNSL++GL KA+K+ E C +FE+M + 
Sbjct: 304  ILIQGCCKCYRIEEATRIFSEMQNNGYNPDTVVYNSLIDGLFKARKVNEGCQMFERMIQY 363

Query: 1295 DGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXX 1116
             GVRAS+WTYNILIDGLFRN RA AAYTLF DLKKKG  FVDG+T+SIV L LCREG   
Sbjct: 364  -GVRASTWTYNILIDGLFRNARAEAAYTLFCDLKKKGQ-FVDGVTYSIVVLQLCREGLLE 421

Query: 1115 XXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKATM 936
                       RGF VDLVTIS+L+IS Y+H   D T++LMK IRDGNL+P+VL WK  M
Sbjct: 422  EALGLAEEMEMRGFTVDLVTISTLIISLYKHSRWDWTDKLMKRIRDGNLLPSVLKWKVDM 481

Query: 935  EDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDN-----ID 771
            E ++K+ Q  +KD TP+FPS GDF+D+LS+I+      D G  T+D   + D      ID
Sbjct: 482  EATLKSPQKNKKDHTPLFPSNGDFSDVLSLISSVASTMDGGFETDDAGVKDDKNSSTPID 541

Query: 770  PWSASPYLDLLANQLSPRSL----FSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLS 603
             WS+SP++D LANQ++        FSLSRG+R+ AKG D+FDIDMVNT+LS+FLAKGKLS
Sbjct: 542  QWSSSPHMDQLANQITSTDQSSQQFSLSRGQRVQAKGDDTFDIDMVNTFLSLFLAKGKLS 601

Query: 602  LACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVII 423
            +ACKLFEIF++ G +PVSYT+NS++SSFVKKGY NEAWGVL  MGE++CP DIATYN+II
Sbjct: 602  MACKLFEIFSDTGANPVSYTYNSILSSFVKKGYFNEAWGVLSEMGEKVCPTDIATYNMII 661

Query: 422  QGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGIN 243
            QGLGKMGRADLAS+VLDKLMKQGGYLD+VMYNTLINALGKA RI+E N LF+QM++SGIN
Sbjct: 662  QGLGKMGRADLASSVLDKLMKQGGYLDVVMYNTLINALGKANRIDEVNKLFKQMKSSGIN 721

Query: 242  PDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKAT 63
            PDVVT+NTLIE+H+KAGRLKDAYKFLKMMLD+GC PNHVTDTTLDFL KEIEK RYQKA+
Sbjct: 722  PDVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCIPNHVTDTTLDFLGKEIEKSRYQKAS 781


>ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Citrus sinensis]
          Length = 790

 Score =  925 bits (2391), Expect = 0.0
 Identities = 486/781 (62%), Positives = 602/781 (77%), Gaps = 15/781 (1%)
 Frame = -2

Query: 2369 EVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCT 2190
            ++G++LL+A + K+L E  G RNL  D  SI +SE LVLQVL + SLD+S+KLDFFRWC+
Sbjct: 18   QLGSILLLAFVTKTLKE-SGTRNL--DPRSIPISEPLVLQVLGKNSLDSSKKLDFFRWCS 74

Query: 2189 -LKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
             L+P Y H+  TYS IF T+CR   F +E+P LLNS+  D + +DS TFKL+L+  I SG
Sbjct: 75   SLRPIYKHTACTYSHIFRTVCRAG-FLEEVPSLLNSMQEDDVVVDSETFKLLLEPCIKSG 133

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFL----DNSRT 1845
            K   A+EILD+ME +LG  + L P++Y +VL++L+RK QL +A+SI  K L    DN+  
Sbjct: 134  KIDFAIEILDYME-ELG--TSLSPNVYDSVLVSLVRKKQLGLAMSILFKLLEACNDNTAD 190

Query: 1844 DEKGSGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKG 1665
            +      P  +ACNELLV LRK++   EF  VF +L+E+  F  D +GYNICIH FGC G
Sbjct: 191  NSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQKEFEFDIYGYNICIHAFGCWG 250

Query: 1664 DLSTSLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLF 1485
            DL TSL LFKEMKE+G    PDL TYNSLIQVLC+ GKV DAL VWEELKGS GHEP+ F
Sbjct: 251  DLHTSLRLFKEMKEKG--LVPDLHTYNSLIQVLCVVGKVKDALIVWEELKGS-GHEPNEF 307

Query: 1484 TYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKM 1305
            T+R+++QGC K+YR+ DAM IFS+MQ +G+ PDT +YNSLLN + K++K+ EAC LFEKM
Sbjct: 308  THRIIIQGCCKSYRMDDAMKIFSEMQYNGLIPDTVVYNSLLNRMFKSRKVMEACQLFEKM 367

Query: 1304 AEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREG 1125
             + DGVR S WT+NILIDGLFRNGRA AAYTLF DLKKKG  FVDGITFSIV L LCREG
Sbjct: 368  VQ-DGVRTSCWTHNILIDGLFRNGRAEAAYTLFCDLKKKGK-FVDGITFSIVVLQLCREG 425

Query: 1124 QXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWK 945
            Q             RGF+VDLVTISSLLI F+++G  D TERLMK+IRDGNLV +VL WK
Sbjct: 426  QIEEALRLVEEMEGRGFVVDLVTISSLLIGFHKYGRWDFTERLMKHIRDGNLVLDVLKWK 485

Query: 944  ATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESD----- 780
            A +E +MK+++SK KD TPMFP +GD ++I+S+I   N +TD+ LG+ + + + +     
Sbjct: 486  ADVEATMKSRKSKRKDYTPMFPYKGDLSEIMSLIGSTNLETDANLGSGEGDAKDEGSQLT 545

Query: 779  NIDPWSASPYLDLLANQLSP----RSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKG 612
            N D WS+SPY+D LA+Q+        LFSL+RG R+  KG+ +FDIDMVNT+LSIFLAKG
Sbjct: 546  NSDEWSSSPYMDKLADQVKSDCHSSQLFSLARGLRVQGKGMGTFDIDMVNTFLSIFLAKG 605

Query: 611  KLSLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYN 432
            KL+LACKLFEIFT+MGV PV+YT+NSMMSSFVKKGY N+AWGVL  MGE+ CP DIATYN
Sbjct: 606  KLNLACKLFEIFTDMGVHPVNYTYNSMMSSFVKKGYFNQAWGVLNEMGEKFCPTDIATYN 665

Query: 431  VIIQGLGKMGRADLASAVLDKLMKQ-GGYLDIVMYNTLINALGKAGRIEEANTLFQQMQT 255
            V+IQGLGKMGRADLAS +LDKLMKQ GGYLD+VMYNTLIN LGKAGR +EAN LF+QM+T
Sbjct: 666  VVIQGLGKMGRADLASTILDKLMKQGGGYLDVVMYNTLINVLGKAGRFDEANMLFEQMRT 725

Query: 254  SGINPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRY 75
            SGINPDVVT+NTLIE++ KAGRLK+A+ FLKMMLD+GC PNHVTDTTLDFL +EI++ + 
Sbjct: 726  SGINPDVVTFNTLIEVNGKAGRLKEAHYFLKMMLDSGCTPNHVTDTTLDFLGREIDRLKD 785

Query: 74   Q 72
            Q
Sbjct: 786  Q 786


>ref|XP_002515124.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545604|gb|EEF47108.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 898

 Score =  923 bits (2386), Expect = 0.0
 Identities = 485/776 (62%), Positives = 595/776 (76%), Gaps = 11/776 (1%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            +++ ++LLVA + K+LSE  G RNL+ D   I LSE L+LQ+L + SLDAS+K++FF+WC
Sbjct: 47   NQLESILLVAFLNKALSE-SGVRNLDPDF--IPLSEPLILQILRQNSLDASKKIEFFKWC 103

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +   NY HS   YS +F T+C    F +E+  LLNS+  D   + + TFK +LD FI+ G
Sbjct: 104  SFSHNYKHSACVYSHMFRTVCNAGYF-EEVRSLLNSMKDDCAIVGTGTFKFLLDTFINLG 162

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
             F  ALE+LD ME +LG  + L P +Y +VL+AL RKNQ+ +ALSIF K L+ S   + G
Sbjct: 163  NFDFALELLDVME-ELG--TNLNPHMYDSVLVALTRKNQIGLALSIFFKLLETSNDIDIG 219

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
               P ++ACN LLV LRKA+M  EF  VF KL+  GF  LD WGYNICIH FGC  DL T
Sbjct: 220  VSVPGSVACNTLLVALRKADMRVEFKKVFDKLKGMGF-ELDTWGYNICIHAFGCWSDLGT 278

Query: 1652 SLTLFKEMKERGDPFS---PDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFT 1482
            +L LFKEMKE+   F    PDLCTYNSLI++LC +GKV DAL V+EELK  SGHEPD FT
Sbjct: 279  ALRLFKEMKEKSKGFGSCCPDLCTYNSLIRLLCFSGKVKDALVVYEELK-ISGHEPDAFT 337

Query: 1481 YRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMA 1302
            YR++++GCSK+YR+ DA  IFS+MQ +G  PDT +YNSLL+G+ KA+K+TEAC LFEKM 
Sbjct: 338  YRIIIEGCSKSYRMNDATKIFSEMQYNGFVPDTTVYNSLLDGMFKARKVTEACQLFEKMV 397

Query: 1301 EEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQ 1122
            + DGVRASSWTYNILIDGL +NGR+ A Y+LF DLKKKG  FVD IT+SI+ L LCREGQ
Sbjct: 398  Q-DGVRASSWTYNILIDGLCKNGRSAAGYSLFCDLKKKGK-FVDAITYSIIVLLLCREGQ 455

Query: 1121 XXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKA 942
                         RGF+VDLVTI+SLLI+F++ G  D TE+LMK++RDGNLVPNVL+W+A
Sbjct: 456  LKEALSLVEEMEERGFVVDLVTITSLLIAFHKQGRWDWTEKLMKHVRDGNLVPNVLNWQA 515

Query: 941  TMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNI---- 774
             ME S+K  +S+ KD TPMF S G  ++I+++I   + K + GL    +E   DNI    
Sbjct: 516  DMEASLKNPRSRRKDYTPMFLSNGSLSEIINIIRYPDLK-NHGLDDNAVE-HGDNISAET 573

Query: 773  DPWSASPYLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKL 606
            D WS+SPY+D LANQ+         FSL+RG+R+ AKG++SFDIDMVNT+LSIFLAKGKL
Sbjct: 574  DQWSSSPYMDHLANQVKSTDNCSQSFSLARGQRVQAKGVESFDIDMVNTFLSIFLAKGKL 633

Query: 605  SLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVI 426
            S+ACKLFEIF++MGV+PVSYT+NS+MSSFVKKGY +EAW VL  MGE++CP+DIATYN+I
Sbjct: 634  SVACKLFEIFSDMGVNPVSYTYNSIMSSFVKKGYFSEAWDVLNQMGEKVCPSDIATYNLI 693

Query: 425  IQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGI 246
            IQGLGKMGRADLAS+VLDKLMKQGGYLDIVMYNTLINALGKAGRI+E   LF+QM+TSGI
Sbjct: 694  IQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNTLINALGKAGRIDEVRKLFEQMKTSGI 753

Query: 245  NPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHR 78
            NPDVVTYNTLIE+H KAGRLKDAYKFLKMMLDAGC PNHVTDTTLDFL KEIEK R
Sbjct: 754  NPDVVTYNTLIEVHTKAGRLKDAYKFLKMMLDAGCLPNHVTDTTLDFLAKEIEKQR 809


>ref|NP_192066.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75161629|sp|Q8VZE4.1|PP299_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g01570 gi|18086402|gb|AAL57659.1| AT4g01570/T15B16_21
            [Arabidopsis thaliana] gi|24797024|gb|AAN64524.1|
            At4g01570/T15B16_21 [Arabidopsis thaliana]
            gi|332656643|gb|AEE82043.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 805

 Score =  907 bits (2343), Expect = 0.0
 Identities = 488/789 (61%), Positives = 588/789 (74%), Gaps = 18/789 (2%)
 Frame = -2

Query: 2360 NVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC-TLK 2184
            NVLLVAS++K+LS+  G R+L  DA SI +SE +VLQ+L R S+D S+KLDFFRWC +L+
Sbjct: 29   NVLLVASLSKTLSQ-SGTRSL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85

Query: 2183 PNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKFV 2004
            P Y HS   YSQIF T+CR      E+PDLL S+  DG+ LD    K++LD+ I SGKF 
Sbjct: 86   PGYKHSATAYSQIFRTVCRTGLL-GEVPDLLGSMKEDGVNLDQTMAKILLDSLIRSGKFE 144

Query: 2003 SALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFL---DNSRTDEKG 1833
            SAL +LD+ME +LG   CL P +Y +VLIAL++K++LR+ALSI  K L   DN   D+ G
Sbjct: 145  SALGVLDYME-ELG--DCLNPSVYDSVLIALVKKHELRLALSILFKLLEASDNHSDDDTG 201

Query: 1832 -----SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCK 1668
                 S  P  +A NELLVGLR+A+M  EF  VF KL+    F  D W YNICIHGFGC 
Sbjct: 202  RVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEKLKGMKRFKFDTWSYNICIHGFGCW 261

Query: 1667 GDLSTSLTLFKEMKER----GDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGH 1500
            GDL  +L+LFKEMKER    G  F PD+CTYNSLI VLCL GK  DAL VW+ELK  SGH
Sbjct: 262  GDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIHVLCLFGKAKDALIVWDELK-VSGH 320

Query: 1499 EPDLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACN 1320
            EPD  TYR+L+QGC K+YR+ DAM I+ +MQ +G  PDT +YN LL+G +KA+K+TEAC 
Sbjct: 321  EPDNSTYRILIQGCCKSYRMDDAMRIYGEMQYNGFVPDTIVYNCLLDGTLKARKVTEACQ 380

Query: 1319 LFEKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALH 1140
            LFEKM +E GVRAS WTYNILIDGLFRNGRA A +TLF DLKKKG  FVD ITFSIV L 
Sbjct: 381  LFEKMVQE-GVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FVDAITFSIVGLQ 438

Query: 1139 LCREGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPN 960
            LCREG+             RGF VDLVTISSLLI F++ G  D  E+LMK+IR+GNLVPN
Sbjct: 439  LCREGKLEGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLMKHIREGNLVPN 498

Query: 959  VLSWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESD 780
            VL W A +E S+K  QSK+KD TPMFPS+G F DI+S++    G  D G   E++ P  D
Sbjct: 499  VLRWNAGVEASLKRPQSKDKDYTPMFPSKGSFLDIMSMV----GSEDDGASAEEVSPMED 554

Query: 779  NIDPWSASPYLDLLANQLS-PRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLS 603
              DPWS+SPY+D LA+Q + P+ LF L+RG+R+ AK  DSFD+DM+NT+LSI+L+KG LS
Sbjct: 555  --DPWSSSPYMDQLAHQRNQPKPLFGLARGQRVEAKP-DSFDVDMMNTFLSIYLSKGDLS 611

Query: 602  LACKLFEIFTNMGV-DPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVI 426
            LACKLFEIF  MGV D  SYT+NSMMSSFVKKGY   A GVL  M E  C ADIATYNVI
Sbjct: 612  LACKLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFQTARGVLDQMFENFCAADIATYNVI 671

Query: 425  IQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGI 246
            IQGLGKMGRADLASAVLD+L KQGGYLDIVMYNTLINALGKA R++EA  LF  M+++GI
Sbjct: 672  IQGLGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINALGKATRLDEATQLFDHMKSNGI 731

Query: 245  NPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKA 66
            NPDVV+YNT+IE+++KAG+LK+AYK+LK MLDAGC PNHVTDT LD+L KE+EK R++KA
Sbjct: 732  NPDVVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEKARFKKA 791

Query: 65   TM---KPIN 48
            +    KP N
Sbjct: 792  SFVRNKPNN 800


>ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum]
            gi|557097371|gb|ESQ37807.1| hypothetical protein
            EUTSA_v10028437mg [Eutrema salsugineum]
          Length = 801

 Score =  904 bits (2336), Expect = 0.0
 Identities = 486/778 (62%), Positives = 582/778 (74%), Gaps = 12/778 (1%)
 Frame = -2

Query: 2360 NVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC-TLK 2184
            NVL+VAS++K+LS   G RNL  DA S  +SE +VLQ+L R SLD S+KLDFFRWC +L+
Sbjct: 29   NVLVVASLSKTLSH-SGTRNL--DANSTPISEPIVLQILRRNSLDPSKKLDFFRWCFSLR 85

Query: 2183 PNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKFV 2004
            P Y HS   YSQIF T+CR      EIP+LL S+  DG+ LD  T KL+LD+ I SGK+ 
Sbjct: 86   PGYKHSASAYSQIFRTVCRTGLL-GEIPNLLGSMKEDGVNLDQTTSKLLLDSLIRSGKYD 144

Query: 2003 SALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKGSGT 1824
            SAL +LD+ME +LG   CL P LY +VLIAL++KN+LR+ALSIF K L+ S    +  G 
Sbjct: 145  SALGVLDYME-ELGG--CLNPRLYDSVLIALVKKNELRLALSIFFKLLEASDNPSETGGV 201

Query: 1823 -----PDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDL 1659
                 P  +A NELLVGLRKANM  EF  VF KL+    F  D WGYNICIHGFGC GDL
Sbjct: 202  SVSYLPGTVAVNELLVGLRKANMKLEFKGVFDKLKGMERFKFDTWGYNICIHGFGCWGDL 261

Query: 1658 STSLTLFKEMKER----GDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPD 1491
              +L+LFKEMKE+    G    PD+CTYNSLI VLCL GK  DAL VW+ELK  SGHEPD
Sbjct: 262  DAALSLFKEMKEQSSISGSCAGPDICTYNSLIHVLCLVGKAKDALIVWDELK-VSGHEPD 320

Query: 1490 LFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFE 1311
              TYR+L+QGC K+Y + DAM IF +MQ +G  PDT LYNSLL+G +KA+K+ EAC LFE
Sbjct: 321  NSTYRILIQGCCKSYLMDDAMRIFGEMQYNGFVPDTVLYNSLLDGTLKARKVVEACQLFE 380

Query: 1310 KMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCR 1131
            KM +E GVRAS WT NILIDGLFRNGRA A +TLF DLKKKG  FVD ITFSIV L LCR
Sbjct: 381  KMVQE-GVRASCWTNNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FVDAITFSIVVLQLCR 438

Query: 1130 EGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLS 951
            EG+             RGF VDLVTISSLLI F++ G  D  E+LMK++R GNLVPNVL 
Sbjct: 439  EGKLEGAVKLVEEMETRGFSVDLVTISSLLIGFHKQGRWDWKEKLMKHVRGGNLVPNVLR 498

Query: 950  WKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNID 771
            W A +E S+K  QSK+KD TPMFPS+G F DI+S++    G  D G   E++ P  D  D
Sbjct: 499  WNAGVEASLKRPQSKDKDYTPMFPSKGSFVDIMSLV----GSKDDGAKAEELTPVED--D 552

Query: 770  PWSASPYLDLLANQLS-PRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLAC 594
            PWS+SPY+D LA+Q + P+ LF+L+RG+R+ AK  DSFD+DM+NT+LSI+L+KG LSLAC
Sbjct: 553  PWSSSPYMDQLAHQSNQPKPLFALARGQRVEAKP-DSFDVDMMNTFLSIYLSKGDLSLAC 611

Query: 593  KLFEIFTNMGV-DPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQG 417
            KLFEIF  MGV D  SYT+NSMMSSFVKKGY   A GVL  MGE  C ADIATYNVIIQG
Sbjct: 612  KLFEIFNEMGVTDLTSYTYNSMMSSFVKKGYFKTARGVLDQMGENFCAADIATYNVIIQG 671

Query: 416  LGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPD 237
            LGKMGRADLASAVLD+L +QGGYLDIVMYNTLINALGKA R++EA  LF+ M++SGINPD
Sbjct: 672  LGKMGRADLASAVLDRLTEQGGYLDIVMYNTLINALGKANRLDEATRLFEHMKSSGINPD 731

Query: 236  VVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKAT 63
            VV+YNT+IE+++KAG+LK+AYK+LK MLDA C PNHVTDT LD+L KE+EK R++KA+
Sbjct: 732  VVSYNTMIEVNSKAGKLKEAYKYLKAMLDANCLPNHVTDTILDYLGKEMEKARFKKAS 789


>ref|XP_002874971.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297320808|gb|EFH51230.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 802

 Score =  902 bits (2331), Expect = 0.0
 Identities = 483/779 (62%), Positives = 580/779 (74%), Gaps = 13/779 (1%)
 Frame = -2

Query: 2360 NVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC-TLK 2184
            NVLLVAS++K+LS+  G R L  DA SI +SE +VLQ+L R S+D S+KLDFFRWC +L+
Sbjct: 29   NVLLVASLSKTLSQ-SGTRGL--DANSIPISEPVVLQILRRNSIDPSKKLDFFRWCYSLR 85

Query: 2183 PNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKFV 2004
              Y HS   YSQIF T+CR      E+PDLL S+  DG+ LD    K++LD+ I SGKF 
Sbjct: 86   TGYKHSVSAYSQIFRTVCRTGLL-GEVPDLLCSMKEDGVNLDQTMAKILLDSLIRSGKFE 144

Query: 2003 SALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFL---DNSRTDEKG 1833
            SAL +LD+ME +LG   CL P LY +VLIAL +KN+LR+ALSIF K L   DN   D  G
Sbjct: 145  SALGVLDYME-ELG--DCLNPSLYDSVLIALAKKNELRLALSIFFKLLEASDNHGDDTSG 201

Query: 1832 ---SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGD 1662
               S  P  +A NELLVGLR+A+M  EF  VF KL+    F  D W YNICIHGFGC GD
Sbjct: 202  VTVSYLPGRVAVNELLVGLRRADMRSEFKTVFEKLKGMNRFKFDTWSYNICIHGFGCWGD 261

Query: 1661 LSTSLTLFKEMKER----GDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEP 1494
            L  +L+LFKEMKER    G  F+PD+CTYNSLI VLCL GK  DAL VW+ELK  SGHEP
Sbjct: 262  LDAALSLFKEMKERSSVSGSSFAPDICTYNSLIHVLCLFGKAKDALIVWDELK-VSGHEP 320

Query: 1493 DLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLF 1314
            D  TYR+L+QGC K+YR+ DAM IF +MQ +G  PDT +YN LL+G +KA+K+TEAC LF
Sbjct: 321  DNSTYRILIQGCCKSYRMDDAMRIFGEMQYNGFVPDTVVYNCLLDGTLKARKVTEACQLF 380

Query: 1313 EKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLC 1134
            EKM +E GVRAS WTYNILIDGLFRNGRA A +TLF DLKKKG  FVD ITFSIV L LC
Sbjct: 381  EKMVQE-GVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQ-FVDAITFSIVVLQLC 438

Query: 1133 REGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVL 954
            REG+             RGF VDLVTISSLLI F++ G  D  E+LMK++R+GNLVPNVL
Sbjct: 439  REGKLEEAVKLVEEMETRGFTVDLVTISSLLIGFHKQGRWDWKEKLMKHVREGNLVPNVL 498

Query: 953  SWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNI 774
             W A +E S+K  Q K+KD TPMFPS+G F DI+S++ L     D G   E++ P  D  
Sbjct: 499  RWNAGVEASLKRPQRKDKDYTPMFPSKGSFLDIMSMVGLE----DDGARAEEVPPMED-- 552

Query: 773  DPWSASPYLDLLANQLS-PRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLA 597
            DPWS+SPY+D LA+Q + P+ LF L+RG+R+ AK  DSFD+DM+NT+LSI+L+KG LSLA
Sbjct: 553  DPWSSSPYMDQLAHQSNRPKPLFGLARGQRVEAKP-DSFDVDMMNTFLSIYLSKGDLSLA 611

Query: 596  CKLFEIFTNMGV-DPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQ 420
            CKLFEIF  MGV D  SYT+NSMMSSFVKKGY     GVL  MGE  C ADIATYNVIIQ
Sbjct: 612  CKLFEIFNGMGVTDLTSYTYNSMMSSFVKKGYFKTVRGVLDQMGENFCAADIATYNVIIQ 671

Query: 419  GLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINP 240
            GLGKMGRADLA AVLD+L KQGGYLDIVMYNTLINA+GKA R++ A  LF  M+++GINP
Sbjct: 672  GLGKMGRADLAGAVLDRLTKQGGYLDIVMYNTLINAIGKANRLDAATQLFDHMKSNGINP 731

Query: 239  DVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKAT 63
            DVV+YNT+IE+++KAG+LK+AYK+LK MLDAGC PNHVTDT LD+L KE+EK R++KA+
Sbjct: 732  DVVSYNTMIEVNSKAGKLKEAYKYLKAMLDAGCLPNHVTDTILDYLGKEMEKARFKKAS 790


>gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]
          Length = 788

 Score =  901 bits (2328), Expect = 0.0
 Identities = 484/790 (61%), Positives = 589/790 (74%), Gaps = 10/790 (1%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            S++ +VLLVAS+ K+LSE   +     D  SI LSE ++LQ+L   SL  S+KLDFF W 
Sbjct: 18   SQLADVLLVASLTKTLSE--SSTRYLPDPRSIPLSEPILLQILRNNSLHISKKLDFFTWF 75

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +L  +   S  +YSQ+   +CR    H E  +LL S+  +G+ +DS TFK +LD FI SG
Sbjct: 76   SLNSDLKPSAHSYSQVLRALCREGHLH-EASNLLGSMRQNGVIIDSWTFKTLLDTFIRSG 134

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
            KF  ALEILD ME +LG    L   +Y +VLIAL+RK+QL  ALSIF K L++S      
Sbjct: 135  KFDFALEILDTME-ELGVT--LNSHMYDSVLIALVRKDQLSFALSIFFKILEDS------ 185

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
            S  P +I CNELLV L+K++M  EF  VF  +REK  F ++ WGYNICIH FG  GDL T
Sbjct: 186  SHVPSSIGCNELLVALKKSDMRVEFKQVFDGIREKKGFGMNVWGYNICIHAFGFWGDLGT 245

Query: 1652 SLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYRV 1473
            SL+L++EMK       PDLCTYNSLI VLC  GKV DAL V+EELKGS GH+PD FTYR+
Sbjct: 246  SLSLYREMKVS---VGPDLCTYNSLIHVLCFFGKVKDALVVYEELKGS-GHQPDRFTYRI 301

Query: 1472 LVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEED 1293
            L+QGC K+YRI +A  IF++M+ +G   DT +YNSL++GL+KA+K++EAC LFEKM + D
Sbjct: 302  LIQGCCKSYRIDNAEKIFNEMEYNGHCADTVVYNSLIDGLLKARKVSEACELFEKMTQ-D 360

Query: 1292 GVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXXX 1113
            GVRASSWTYN LIDGLF+N RA A YT+F DLKKKG  FVDGIT+SIV L LCREG    
Sbjct: 361  GVRASSWTYNTLIDGLFKNERAEAGYTMFCDLKKKGQ-FVDGITYSIVVLQLCREGLLEE 419

Query: 1112 XXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGN-LVPNVLSWKATM 936
                      RGF+VDLVTI+SLL+  Y+ G  D T+RLMK+IRDGN L+PNVL WK  +
Sbjct: 420  ALGLVEEMEGRGFVVDLVTITSLLVGLYKQGRWDWTDRLMKHIRDGNNLLPNVLRWKIDL 479

Query: 935  EDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESD-----NID 771
            E S+K  QSK KD TPMFPS+ +F++I+S+I  AN    + L  ++++ + D     +ID
Sbjct: 480  EASLKNPQSKRKDYTPMFPSKDEFSEIMSLIRSANATMKAQLVPDNVDVKDDESVSSDID 539

Query: 770  PWSASPYLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLS 603
             WS+SPY+D L NQ+        LFSLSRGRR+ AKG DSFDIDMVNT+LSIFLAKGKLS
Sbjct: 540  QWSSSPYMDQLTNQVLSNGRSSQLFSLSRGRRVQAKGGDSFDIDMVNTFLSIFLAKGKLS 599

Query: 602  LACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVII 423
            LACKLFEIFT+MGV+PVSYT+NSMM+SFVKKGY +EAW +L  MGE++CPADIATYNVII
Sbjct: 600  LACKLFEIFTDMGVNPVSYTYNSMMTSFVKKGYFDEAWNILGEMGEKVCPADIATYNVII 659

Query: 422  QGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGIN 243
            Q LGKMGRADLASAVLDKL++QGGYLD+VMYNTLINALGKAGRI+E N  F QM+ SGIN
Sbjct: 660  QSLGKMGRADLASAVLDKLIEQGGYLDLVMYNTLINALGKAGRIDEVNKFFDQMRASGIN 719

Query: 242  PDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKAT 63
            PDV+TYNTLIE+H KAG+LKDAYKFLKMMLDAGC PNHVTDTTLDFL KEIEK  YQKA+
Sbjct: 720  PDVITYNTLIEVHTKAGQLKDAYKFLKMMLDAGCIPNHVTDTTLDFLGKEIEKESYQKAS 779

Query: 62   MKPINAEDPS 33
            +   N +D S
Sbjct: 780  IMR-NKDDDS 788


>ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Capsella rubella]
            gi|482558640|gb|EOA22832.1| hypothetical protein
            CARUB_v10003556mg [Capsella rubella]
          Length = 802

 Score =  901 bits (2328), Expect = 0.0
 Identities = 484/792 (61%), Positives = 589/792 (74%), Gaps = 16/792 (2%)
 Frame = -2

Query: 2360 NVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC-TLK 2184
            NVLLVAS++K+LS+  G R+L  DA SI +SE +VLQ+L R S+D+S+KLDFFRWC +L+
Sbjct: 29   NVLLVASLSKTLSQ-SGTRSL--DANSIPISESVVLQILRRSSIDSSKKLDFFRWCFSLR 85

Query: 2183 PNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKFV 2004
            P Y HS   YSQIF T+CR      E+PDLL S+  DG+ LD    K++LD+ I SGKF 
Sbjct: 86   PGYKHSASAYSQIFRTVCRTGLI-GEVPDLLGSMKDDGVNLDQTMAKVLLDSLIRSGKFD 144

Query: 2003 SALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKGSG- 1827
            SAL +LD+ME +LG   CL P LY +VL+AL++KN++R+ALSIF K L+ S     G+G 
Sbjct: 145  SALGVLDYME-ELG--DCLNPGLYDSVLVALVKKNEMRLALSIFFKLLEASDNHSDGTGG 201

Query: 1826 -----TPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGD 1662
                  P  +A NELLVGLR+A M  EF  VF KLRE   F  D WGYNICIHGFGC GD
Sbjct: 202  VIVSYLPGTVAVNELLVGLRRAGMRSEFKRVFEKLREVKRFKFDTWGYNICIHGFGCWGD 261

Query: 1661 LSTSLTLFKEMKER----GDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEP 1494
            L  +L+LFKEMK +    G  F PD+CTYNSLI VLCL GK  DAL VW+ELK  SGHEP
Sbjct: 262  LDAALSLFKEMKVQSSVSGSSFGPDICTYNSLIHVLCLFGKAKDALIVWDELK-VSGHEP 320

Query: 1493 DLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLF 1314
            D  TYR+L+QGC K+YR+ DAM IF +MQ +G  PDT +YN LL+G +KA+K+TEAC LF
Sbjct: 321  DNSTYRILIQGCCKSYRMDDAMRIFGEMQYNGFVPDTIVYNCLLDGTLKARKVTEACQLF 380

Query: 1313 EKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLC 1134
            EKM +E GVRAS WTYNILIDGLFR+GRA A +TLF DLKKKG  FVD ITFSIV L LC
Sbjct: 381  EKMVQE-GVRASCWTYNILIDGLFRSGRAEAGFTLFCDLKKKGQ-FVDAITFSIVVLQLC 438

Query: 1133 REGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVL 954
            +EG              RGF VDLVTISSLLI F++ G  D  E+L+K+IR+GNLV NVL
Sbjct: 439  KEGDLEAAVKLVEEMETRGFTVDLVTISSLLIGFHKQGRWDWKEKLIKHIREGNLVSNVL 498

Query: 953  SWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNI 774
             W A +E S+K  Q+K+KD T MFPS+G F DI+++++      D G   E++ P  D  
Sbjct: 499  RWNAGVEASLKRPQNKDKDYTSMFPSKGSFLDIMNMVS----SEDDGARDEEVSPMED-- 552

Query: 773  DPWSASPYLDLLANQLS-PRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLA 597
            DPWS+SP +D LA+Q S P  LF L+RG+R+ AK  DSFD+DM+NT+LSI+L+KG LSLA
Sbjct: 553  DPWSSSPCMDQLAHQSSRPNPLFGLARGQRVEAKP-DSFDVDMMNTFLSIYLSKGDLSLA 611

Query: 596  CKLFEIFTNMGV-DPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQ 420
            CKLFEIF  MGV D  SYT+NSMMSSFVKKGY   A GVL  MGE  C +DIATYNVII 
Sbjct: 612  CKLFEIFEGMGVTDLTSYTYNSMMSSFVKKGYFETARGVLDQMGENFCASDIATYNVIIH 671

Query: 419  GLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINP 240
            GLGKMGRADLASAVLD+L KQGGYLDIVMYNTLIN+LGKA R++EA  LF+ M+++GINP
Sbjct: 672  GLGKMGRADLASAVLDRLTKQGGYLDIVMYNTLINSLGKANRLDEATRLFEHMKSNGINP 731

Query: 239  DVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATM 60
            DVV+YNT+IE+++KAG+LK+AYK+LKMMLDAGC PNHVTDT LD+L KEIEK R++KA+ 
Sbjct: 732  DVVSYNTMIEVNSKAGKLKEAYKYLKMMLDAGCLPNHVTDTILDYLGKEIEKARFEKASF 791

Query: 59   ---KPINAEDPS 33
               KP N  DPS
Sbjct: 792  VRNKPNN--DPS 801


>ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Cucumis sativus] gi|449523383|ref|XP_004168703.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g01570-like [Cucumis sativus]
          Length = 803

 Score =  894 bits (2311), Expect = 0.0
 Identities = 471/785 (60%), Positives = 599/785 (76%), Gaps = 14/785 (1%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            S + ++LL+ASI K+LSE  G R L+    S+ +S  L+LQ+L  +SL+ S KLDFF+WC
Sbjct: 24   SHLSHLLLLASITKTLSE-SGTRTLQHH--SLPISHPLLLQILHSRSLNPSHKLDFFKWC 80

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            +L PN+ HS  TYSQIFH +CR    H E+P LL+S+  DG+++DS TFK++LDAFI SG
Sbjct: 81   SLAPNFNHSPSTYSQIFHILCRSGYLH-EVPPLLDSMKRDGVSVDSHTFKVLLDAFIRSG 139

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLD---NSRTD 1842
            K+ +ALEILDHME DLG  + L+ + Y++VL+AL+RKNQ+ +ALSIF K LD   N    
Sbjct: 140  KYDAALEILDHME-DLG--TSLELNTYNSVLVALLRKNQVGLALSIFFKLLDGFNNGGQV 196

Query: 1841 EKGSGT----PDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFG 1674
            +  + T    P+++ACNELLV LRK +M  EF  VF KLR    F    +GYNICI+ FG
Sbjct: 197  DSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVFDKLRAIESFEFSVYGYNICIYAFG 256

Query: 1673 CKGDLSTSLTLFKEMKERG---DPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSG 1503
            C G L T+L+LFKEMKE+    + FSPDLCTYNS+I VLCL GKV DAL VWEELKGS G
Sbjct: 257  CWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSIIHVLCLVGKVKDALIVWEELKGS-G 315

Query: 1502 HEPDLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEAC 1323
            HEPD FTYR+++QGC K+ R+ DA  IF++M+ +G+ PDT +YNSLLNGL KA+K+TEAC
Sbjct: 316  HEPDAFTYRIIIQGCCKSCRMDDATMIFNEMEYNGLIPDTIVYNSLLNGLFKARKVTEAC 375

Query: 1322 NLFEKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVAL 1143
             LF+KM +ED VRAS WTYNILIDGLFRNGRA A YTLF DLKKKG   VD +T+SI+ L
Sbjct: 376  QLFDKMVQED-VRASPWTYNILIDGLFRNGRAEAGYTLFCDLKKKGQ-IVDAVTYSIIIL 433

Query: 1142 HLCREGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVP 963
             LC+E              ARGF+VDL+TI+SLLI+ ++ G  D  ERLMK+IR+G+LVP
Sbjct: 434  QLCKERLLEEALQLVEEMEARGFVVDLITITSLLIAMHKQGQWDGLERLMKHIREGDLVP 493

Query: 962  NVLSWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPES 783
            NVL WK  ME S+K +++K KD + +F  + D ++++S    +  K +     E+ E   
Sbjct: 494  NVLKWKINMEYSIKYQKNKRKDFSSLFSPKEDLSEVISSRASSAAKVNIDNSFENTEER- 552

Query: 782  DNIDPWSASPYLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAK 615
             ++D WS+SPY++ LAN  +  S     FS+ +GRRI  K  +SFDI+MVNT+LSIFLAK
Sbjct: 553  -DMDSWSSSPYVNRLANLANSTSDILQPFSIRQGRRIQEKQDNSFDINMVNTFLSIFLAK 611

Query: 614  GKLSLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATY 435
            GKL+LACKLFEIF++MGV+PV YT+NSM+SSFVKKGY ++AWG+   MGE +CPADIATY
Sbjct: 612  GKLNLACKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGIFNEMGENVCPADIATY 671

Query: 434  NVIIQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQT 255
            NVIIQGLGKMGRADLAS+VL+KLM+QGGYLDIVMYNTLINALGKAGR+++ N LF QM+ 
Sbjct: 672  NVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFGQMRN 731

Query: 254  SGINPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRY 75
            SGINPDVVT+NTLIE+H+KAGRLKDAYKFLKMMLD+GC+PNHVTDTTLDFL +E+EK RY
Sbjct: 732  SGINPDVVTFNTLIEVHSKAGRLKDAYKFLKMMLDSGCSPNHVTDTTLDFLGREMEKARY 791

Query: 74   QKATM 60
            +KA++
Sbjct: 792  EKASI 796


>ref|XP_003539071.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Glycine max]
          Length = 768

 Score =  845 bits (2182), Expect = 0.0
 Identities = 458/786 (58%), Positives = 570/786 (72%), Gaps = 7/786 (0%)
 Frame = -2

Query: 2369 EVGNVLLVASIAKSLSEPGGAR-NLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            ++G VL+ ASI  +LS    A  NL  +  ++ L++ L+L++L   +  AS KL FF W 
Sbjct: 6    QLGEVLVAASITNTLSHSHSATINLPPNL-ALGLTQPLILKILSNPAHHASHKLRFFEWS 64

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
              + ++  S   YS I  T+ R   F+ +IP LL+S++  G+ LD  +   +L +FI S 
Sbjct: 65   --RSHHCPSPAAYSVILRTLSR-EGFYSDIPSLLHSMTQAGVVLDPHSLNHLLRSFIISS 121

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPD-LYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEK 1836
             F  AL++LD+++        L P  +Y+++L+AL+ KNQL +ALSIF K L     D K
Sbjct: 122  NFNLALQLLDYVQH-----LHLDPSPIYNSLLVALLEKNQLTLALSIFFKLL--GAVDSK 174

Query: 1835 GSGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLS 1656
                    ACN+LLV LRKA+M  EF  VF +LREK  F  D WGYN+CIH FGC GDL+
Sbjct: 175  S-----ITACNQLLVALRKADMRVEFEQVFQRLREKRGFSFDTWGYNVCIHAFGCWGDLA 229

Query: 1655 TSLTLFKEMKERGDPF-SPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTY 1479
            T   LFKEMK     F +PDLCTYNSLI  LC  GKV DA+ V+EEL GS+ H+PD FTY
Sbjct: 230  TCFALFKEMKGGNKGFVAPDLCTYNSLITALCRLGKVDDAITVYEELNGSA-HQPDRFTY 288

Query: 1478 RVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAE 1299
              L+Q CSK YR+ DA+ IF+QMQ +G RPDT  YNSLL+G  KA K+ EAC LFEKM +
Sbjct: 289  TNLIQACSKTYRMEDAIRIFNQMQSNGFRPDTLAYNSLLDGHFKATKVMEACQLFEKMVQ 348

Query: 1298 EDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQX 1119
            E GVR S WTYNILI GLFRNGRA AAYT+F DLKKKG  FVDGIT+SIV L LC+EGQ 
Sbjct: 349  E-GVRPSCWTYNILIHGLFRNGRAEAAYTMFCDLKKKGQ-FVDGITYSIVVLQLCKEGQL 406

Query: 1118 XXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKAT 939
                       +RGF+VDLVTI+SLLIS +RHG  D T+RLMK+IR+G+L  +VL WKA 
Sbjct: 407  EEALQLVEEMESRGFVVDLVTITSLLISIHRHGRWDWTDRLMKHIREGDLALSVLKWKAG 466

Query: 938  MEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPWSA 759
            ME SMK    K+KD +P+FPS+GDF DI++ +  A   T+   G E+     + ID WS+
Sbjct: 467  MEASMKNPPGKKKDYSPLFPSKGDFIDIINFMTCAQDTTNINDGEEN---SCNEIDEWSS 523

Query: 758  SPYLDLLANQLSPRS----LFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLACK 591
            SP++D LANQ+S       +F+ SRG+R+  KG DSFD+DMVNT+LSIFLAKGKLSLACK
Sbjct: 524  SPHMDKLANQVSSTGYSSQMFTPSRGQRVQEKGPDSFDVDMVNTFLSIFLAKGKLSLACK 583

Query: 590  LFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQGLG 411
            LFEIF++ GVDPVSYT+NS+MSSFVKKGY  EAW +L  MGE+ CP DIATYN+IIQGLG
Sbjct: 584  LFEIFSDAGVDPVSYTYNSIMSSFVKKGYFAEAWAILTEMGEKFCPTDIATYNMIIQGLG 643

Query: 410  KMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPDVV 231
            KMGRADLASAVLD+L++QGGYLDIVMYNTLINALGKA RI+E N LF+QM++SGINPDVV
Sbjct: 644  KMGRADLASAVLDRLLRQGGYLDIVMYNTLINALGKASRIDEVNKLFEQMRSSGINPDVV 703

Query: 230  TYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMKPI 51
            TYNTLIE+H+KAGRLKDAYKFLKMMLDAGC+PNHVTDTTLD+L +EI+K RYQ+A++   
Sbjct: 704  TYNTLIEVHSKAGRLKDAYKFLKMMLDAGCSPNHVTDTTLDYLGREIDKLRYQRASILS- 762

Query: 50   NAEDPS 33
              +DPS
Sbjct: 763  EKDDPS 768


>gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlisea aurea]
          Length = 770

 Score =  820 bits (2117), Expect = 0.0
 Identities = 422/777 (54%), Positives = 556/777 (71%), Gaps = 16/777 (2%)
 Frame = -2

Query: 2360 NVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCTLKP 2181
            N+L+VASI K LS+ G  + LEK+A SI LSED+VLQ++  +SL  S+KL+FFRWC+ +P
Sbjct: 1    NILVVASITKILSKFGALQYLEKNADSIPLSEDVVLQIVHHRSLVISKKLEFFRWCSSRP 60

Query: 2180 NYIHSTRTYSQIFHTICRCP-QFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSGKFV 2004
            +Y H+   YS++   I R P Q H+ + +LL  +  DG+ LDS T K IL+  I + KF 
Sbjct: 61   DYNHTANAYSEMLRAIFRFPNQHHNNVIELLALMKRDGVILDSDTLKRILNGLIRAQKFD 120

Query: 2003 SALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKGSGT 1824
             AL++LD++EKD      L PD+YS VL+AL+RK+Q+ +AL +F K L +   D      
Sbjct: 121  YALDVLDYIEKDSVIAGNLSPDVYSPVLVALVRKDQISIALPVFFKLLHSQFEDY----I 176

Query: 1823 PDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLSTSLT 1644
            PDA ACNELL GL+K  M +EF  VF KLRE   +P DRWGYNICIH FGC GDLST+L+
Sbjct: 177  PDAFACNELLAGLKKKKMKNEFREVFAKLRETARYPSDRWGYNICIHSFGCWGDLSTALS 236

Query: 1643 LFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYRVLVQ 1464
            LFKEMK+RG    PDLCTYNSLIQV C  G++ DAL +W+ELK SSG+EPD FTYR+L+Q
Sbjct: 237  LFKEMKDRGGSVYPDLCTYNSLIQVFCSLGRLNDALVIWKELKNSSGYEPDRFTYRILIQ 296

Query: 1463 GCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEEDGVR 1284
            GCSK+YRI DAM IF++MQ +G+R +T  YNSL++GL K++KLT AC+ FE+M  ++ VR
Sbjct: 297  GCSKSYRINDAMTIFNEMQYNGIRAETVTYNSLMDGLFKSRKLTTACSFFERMV-DNRVR 355

Query: 1283 ASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXXXXXX 1104
            AS  TYNI+IDGL+RNGR  AAY LFSDLK+KG+ FVD I+FSIV LHLC+E +      
Sbjct: 356  ASCSTYNIIIDGLYRNGRPEAAYALFSDLKRKGNQFVDVISFSIVVLHLCKEERLDEALR 415

Query: 1103 XXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKATMEDSM 924
                  +RGF+VDLVT++SLL++ YR G SD TE+LMK++R+GNL+P+V  WK+ +E S+
Sbjct: 416  LVEEMESRGFVVDLVTVTSLLMALYRAGHSDFTEKLMKHVRNGNLIPSVFKWKSALESSL 475

Query: 923  KAKQSKEKDSTPMFPSRGDFADILSVI-NLANGKTDSGLGTEDIEPESDNIDPWSASPYL 747
             + Q KE+D TPMFP      +IL    ++A+ +++ G      E E +  D WS+SPY+
Sbjct: 476  MSPQGKERDFTPMFPEVRSIDEILEATKSVASTRSEDGTVKNGDEGE-ERADEWSSSPYM 534

Query: 746  DLLANQLS-----PRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLACKLFE 582
            D LA  LS         F++ R  R V +G +SFD+DM NTYLS+    GKLS ACK+ E
Sbjct: 535  DELARNLSGDHRYSSHFFTMFRAVRAVGRGEESFDVDMANTYLSLLSGTGKLSSACKVLE 594

Query: 581  IFTNMGVDPVS---------YTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNV 429
            + +  GV P S         Y +NS+ SSF+KKGY+ EAWG+L    +   PAD+ATY++
Sbjct: 595  LLSRGGVGPNSESSLANVFCYGYNSLTSSFIKKGYVKEAWGILLRHFD-AGPADVATYSL 653

Query: 428  IIQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSG 249
            I++GLGKMGRADLA +V DKL + GGYLD VMYNTLI+ LGKAGR+E+A  +F +M+ SG
Sbjct: 654  IVRGLGKMGRADLARSVRDKLTRDGGYLDAVMYNTLIHTLGKAGRLEDARNVFGEMRASG 713

Query: 248  INPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHR 78
            I PDVVTYNTLIE+H+KAG +++A ++LK MLD GCAPNHVTDTTLD+LEKEI K +
Sbjct: 714  IIPDVVTYNTLIEVHSKAGDVEEANRWLKTMLDNGCAPNHVTDTTLDYLEKEIRKQK 770


>ref|XP_003621545.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|87241489|gb|ABD33347.1| Pentatricopeptide repeat
            [Medicago truncatula] gi|355496560|gb|AES77763.1|
            Pentatricopeptide repeat-containing protein [Medicago
            truncatula]
          Length = 791

 Score =  794 bits (2050), Expect = 0.0
 Identities = 429/796 (53%), Positives = 552/796 (69%), Gaps = 17/796 (2%)
 Frame = -2

Query: 2369 EVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCT 2190
            +V  +L VASI K+LS     +N  +     +L++ L+ ++L   SL  S KL+FF    
Sbjct: 14   QVSELLTVASITKTLS-----KNPTQTPPQTNLTQTLIHKILSNPSLHISHKLNFFN--- 65

Query: 2189 LKPNYIHSTRTYSQIFHTICRCPQ----FHDEIPDLLNSVSSDGLALDSATFKLILDAFI 2022
               N  HS+ +YS IF+ +C         H  +P LL+S+  +G+  DS +F  +L+  I
Sbjct: 66   SNNNIHHSSLSYSLIFNNLCNPKTPFSLLHQHLPHLLHSMKQNGIVFDSNSFNTLLNFLI 125

Query: 2021 HSG--------KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLK 1866
              G         F   ++ILD+++          P +Y+++LIA I+ NQ+ +ALSIF  
Sbjct: 126  KFGVSHNNNSKNFHFVIDILDYIQTQNLHPVDTTPFIYNSLLIASIKNNQIPLALSIFNN 185

Query: 1865 FLDNSRTDEKGSGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICI 1686
             +     D     +    + N LL  LRKA M  EF  VF++LRE+  F  D WGYNICI
Sbjct: 186  IMTLGDDDCLNLDSVIVGSSNYLLSVLRKARMKKEFENVFNRLRERKSFDFDLWGYNICI 245

Query: 1685 HGFGCKGDLSTSLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSS 1506
            H FG  GDL TS+ LF EMKE  + F PD+CTYNS++ VLC  GK+ DAL VW+ELKG  
Sbjct: 246  HAFGSWGDLVTSMKLFNEMKEDKNLFGPDMCTYNSVLSVLCKVGKINDALIVWDELKGC- 304

Query: 1505 GHEPDLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEA 1326
            G+EPD FTY +LV+GC + YR+  A+ IF++M+ +G RP   +YN +L+GL KA K+ E 
Sbjct: 305  GYEPDEFTYTILVRGCCRTYRMDVALRIFNEMKDNGFRPGVLVYNCVLDGLFKAAKVNEG 364

Query: 1325 CNLFEKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVA 1146
            C +FEKMA+E GV+AS  TYNILI GL +NGR+ A Y LF DLKKKG  FVDGIT+SIV 
Sbjct: 365  CQMFEKMAQE-GVKASCSTYNILIHGLIKNGRSEAGYMLFCDLKKKGQ-FVDGITYSIVV 422

Query: 1145 LHLCREGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLV 966
            L LC+EG             ARGF VDLVTI+SLLI  +++G  + T+RL+K++R+G+L+
Sbjct: 423  LQLCKEGLLEEALELVEEMEARGFSVDLVTITSLLIGIHKYGRWEWTDRLIKHVREGDLL 482

Query: 965  PNVLSWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPE 786
            P VL WKA ME S+    SKEKD + MFPS+G F +I+S I  +  + D      ++E  
Sbjct: 483  PGVLRWKAGMEASINNFHSKEKDYSSMFPSKGGFCEIMSFITRSRDEDD------EVETS 536

Query: 785  SDNIDPWSASPYLDLLANQL-----SPRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFL 621
            S+ ID WS+SP++D LA ++     +   +F+  RG+R+  KG DSFDIDMVNT+LSIFL
Sbjct: 537  SEQIDEWSSSPHMDKLAKRVVNSTGNASRMFTPDRGQRVQQKGSDSFDIDMVNTFLSIFL 596

Query: 620  AKGKLSLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIA 441
            +KGKLSLACKLFEIFT+ GVDPVSYT+NS+MSSFVKKGY NEAW +L  MGE+LCP DIA
Sbjct: 597  SKGKLSLACKLFEIFTDAGVDPVSYTYNSIMSSFVKKGYFNEAWAILSEMGEKLCPTDIA 656

Query: 440  TYNVIIQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQM 261
            TYN+IIQGLGKMGRADLASAVLD L+KQGGYLDIVMYNTLINALGKAGRI+E N  F+QM
Sbjct: 657  TYNMIIQGLGKMGRADLASAVLDGLLKQGGYLDIVMYNTLINALGKAGRIDEVNKFFEQM 716

Query: 260  QTSGINPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKH 81
            ++SGINPDVVTYNTLIEIH+KAGRLKDAYKFLKMM+DAGC PNHVTDTTLD+L +EI+K 
Sbjct: 717  KSSGINPDVVTYNTLIEIHSKAGRLKDAYKFLKMMIDAGCTPNHVTDTTLDYLVREIDKL 776

Query: 80   RYQKATMKPINAEDPS 33
            RYQKA++     +DPS
Sbjct: 777  RYQKASILS-KKDDPS 791


>ref|XP_004491942.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like
            [Cicer arietinum]
          Length = 793

 Score =  793 bits (2049), Expect = 0.0
 Identities = 432/788 (54%), Positives = 553/788 (70%), Gaps = 18/788 (2%)
 Frame = -2

Query: 2369 EVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWCT 2190
            +VG +L VASI  +LS+     N    +    +++ L+ ++L   SL  S KL+FF    
Sbjct: 14   QVGELLTVASITNTLSKSPTPPNPTLFSPKF-ITQTLIHKILSNPSLHISHKLNFFNSFN 72

Query: 2189 LKPNYIHSTRTYSQIFHTICR----CPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFI 2022
                 IH++ TYS IF T+C         H  +P LL+S+  + +  DS +FK +L+  I
Sbjct: 73   SHNINIHNSITYSLIFKTLCNPTTPISLLHQHLPQLLHSMKQNDVVFDSYSFKNLLNFLI 132

Query: 2021 ---HSGKFVS---ALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFL 1860
               H+ K  +    ++ILD+++      S   P +Y+++LIA I+ NQL +ALSIF   +
Sbjct: 133  NLSHNNKKNNLHFVIDILDYIQSQNLQPSGTTPFIYNSLLIASIKNNQLNLALSIFKNVI 192

Query: 1859 ---DNSRTDEKGSGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNIC 1689
               D+S  D    G+      N LL  LRKA M  EF+ VF+ LRE+  F  D WGYNIC
Sbjct: 193  SIDDSSNFDHVIVGSS-----NYLLSALRKAQMKKEFINVFNTLRERKSFDFDLWGYNIC 247

Query: 1688 IHGFGCKGDLSTSLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGS 1509
            IH FG  GDL TS+ LF EMKE  + F PD+CTYNS++ +LC  GKV DAL VWEELKG 
Sbjct: 248  IHAFGSWGDLVTSMMLFNEMKEDKNLFGPDMCTYNSVLSILCKVGKVNDALVVWEELKGC 307

Query: 1508 SGHEPDLFTYRVLVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTE 1329
             G+EPD FTY +LV+G S+  R+ +A+ IF++M+ +G RP   +YN +L+GL KA K+ E
Sbjct: 308  -GYEPDEFTYTILVRGFSRTCRMDEAIRIFNEMKDNGFRPGILVYNCVLDGLFKAAKVNE 366

Query: 1328 ACNLFEKMAEEDGVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIV 1149
            AC +FEKMA+E GV+AS WTYNILI GL +NGR+ A YTLF DLKKKG  FVD IT+SIV
Sbjct: 367  ACQMFEKMAQE-GVKASCWTYNILIHGLIKNGRSEAGYTLFCDLKKKGQ-FVDEITYSIV 424

Query: 1148 ALHLCREGQXXXXXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNL 969
             L LC+EGQ            ARGF VDLVTI+SLLI  +++G  D T+RL+K++R+G+L
Sbjct: 425  VLQLCKEGQLEEALELVEEMEARGFSVDLVTITSLLIGIHKYGRWDWTDRLIKHVREGDL 484

Query: 968  VPNVLSWKATMEDSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEP 789
            +P VL WKA ME S+    S +KD +PMF S+GDF++I+S I  A  +       +++E 
Sbjct: 485  LPGVLRWKAGMEASINNLPSGKKDYSPMFSSKGDFSEIMSFITRARDE-------DEVET 537

Query: 788  ESDNIDPWSASPYLDLLANQL-----SPRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIF 624
             S+ ID WS+SP++D LA  +     +   LF+  RG+R+  KG DSFD+DMVNT+LSIF
Sbjct: 538  LSEQIDEWSSSPHMDKLAKHVVRSTGNASRLFTPDRGQRVQQKGPDSFDVDMVNTFLSIF 597

Query: 623  LAKGKLSLACKLFEIFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADI 444
            LAKGKLSLACKLFEIFT+ GVDPVSYT+NS+MSSFVKKGY NEAW +L  MGE+ CP DI
Sbjct: 598  LAKGKLSLACKLFEIFTDAGVDPVSYTYNSIMSSFVKKGYFNEAWAILTEMGEKFCPTDI 657

Query: 443  ATYNVIIQGLGKMGRADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQ 264
            ATYN+IIQGLGKMGRADLASAVLD L+KQGGYLDIVMYNTLINALGKAGRI+E +  F Q
Sbjct: 658  ATYNMIIQGLGKMGRADLASAVLDGLLKQGGYLDIVMYNTLINALGKAGRIDEVSKFFDQ 717

Query: 263  MQTSGINPDVVTYNTLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEK 84
            M+ SGI+PDVVTYNTLIEIH+KAGR+KDAYKFLKMMLDAGC PNHVTDTTLD+L +EI+K
Sbjct: 718  MRNSGISPDVVTYNTLIEIHSKAGRVKDAYKFLKMMLDAGCTPNHVTDTTLDYLVREIDK 777

Query: 83   HRYQKATM 60
             RYQKA++
Sbjct: 778  LRYQKASI 785


>ref|XP_006827884.1| hypothetical protein AMTR_s00008p00117710 [Amborella trichopoda]
            gi|548832519|gb|ERM95300.1| hypothetical protein
            AMTR_s00008p00117710 [Amborella trichopoda]
          Length = 788

 Score =  746 bits (1925), Expect = 0.0
 Identities = 402/777 (51%), Positives = 542/777 (69%), Gaps = 3/777 (0%)
 Frame = -2

Query: 2372 SEVGNVLLVASIAKSLSEPGGARNLEKDAGSIDLSEDLVLQVLCRKSLDASQKLDFFRWC 2193
            S +  +LLV SI K+L   GG   L+K    I LS  LVLQVL +K L+  +K++FFRW 
Sbjct: 33   SHIPTLLLVVSICKALIN-GGTTELQKLP--IVLSHSLVLQVL-KKDLNPHRKMEFFRWV 88

Query: 2192 TLKPNYIHSTRTYSQIFHTICRCPQFHDEIPDLLNSVSSDGLALDSATFKLILDAFIHSG 2013
            + +  Y  S   YS +   + R     D +  L++S+ ++ + LDS +FKL+L++F+ SG
Sbjct: 89   SSQTGYKPSNDAYSLMVQIVSRNKDI-DSLRTLMHSMKTEKMVLDSRSFKLMLNSFVSSG 147

Query: 2012 KFVSALEILDHMEKDLGAVSCLKPDLYSTVLIALIRKNQLRVALSIFLKFLDNSRTDEKG 1833
             F  ALE+L  ME ++G  S L P +YS+VL+ALI+K ++ +AL++F   L        G
Sbjct: 148  NFDQALELLQDME-EIG--SSLSPQIYSSVLLALIKKERVDLALTLFHSVLKG------G 198

Query: 1832 SGTPDAIACNELLVGLRKANMNDEFMLVFHKLREKGFFPLDRWGYNICIHGFGCKGDLST 1653
                 ++ACN+L+V LRK  M  EF  V  +LR  G+   D WGYNICIH FG  GDL  
Sbjct: 199  HVLLSSVACNQLMVFLRKRGMVVEFKRVISELRNLGY-QFDIWGYNICIHAFGSFGDLGF 257

Query: 1652 SLTLFKEMKERGDPFSPDLCTYNSLIQVLCLAGKVTDALCVWEELKGSSGHEPDLFTYRV 1473
            SL LF+EMKE+   ++PDLCTYN+L+++LC + ++ DAL + EELK +SGH+PD +TYR+
Sbjct: 258  SLELFREMKEKS--WNPDLCTYNTLLRILCNSSRLNDALAIAEELK-NSGHDPDGYTYRI 314

Query: 1472 LVQGCSKAYRIGDAMNIFSQMQQDGVRPDTALYNSLLNGLMKAQKLTEACNLFEKMAEED 1293
            L+ GC KAYRI +A+ +F +M+ +    DT +YN +++GL KA K++EACN FE M +E 
Sbjct: 315  LIHGCCKAYRINEALKLFREMEVNTRNTDTVVYNCMMDGLFKAGKVSEACNFFENMVQE- 373

Query: 1292 GVRASSWTYNILIDGLFRNGRALAAYTLFSDLKKKGSNFVDGITFSIVALHLCREGQXXX 1113
            G+R + W+YNILIDGLFRNGRA AAYTLF DLKKKG  FVD IT+SIV  +LC++ +   
Sbjct: 374  GIRPTCWSYNILIDGLFRNGRAEAAYTLFCDLKKKGQ-FVDSITYSIVIWYLCKDDKTEA 432

Query: 1112 XXXXXXXXXARGFIVDLVTISSLLISFYRHGMSDQTERLMKYIRDGNLVPNVLSWKATME 933
                     ARG +VDL  I++LL+  +R G  D  E+LMK++RD +LVP+++ W   ME
Sbjct: 433  SLELVEEMEARGLVVDLTAITTLLMGLHRTGRWDWAEKLMKHVRDSSLVPSLIRWTTEME 492

Query: 932  DSMKAKQSKEKDSTPMFPSRGDFADILSVINLANGKTDSGLGTEDIEPESDNIDPWSASP 753
              ++A Q + KD  P+F   G   +I+++I+  +G  D       I  E ++ D WS S 
Sbjct: 493  SCLRAPQDRAKDFEPIFQFEGGEREIVNLISYDSGSEDK----TQIRDEKES-DIWSPSV 547

Query: 752  YLDLLANQ---LSPRSLFSLSRGRRIVAKGIDSFDIDMVNTYLSIFLAKGKLSLACKLFE 582
            +LD L ++   L     FSL RG R+  KG +SFD DMVNTY+S+FLAKGKLS+ACKLFE
Sbjct: 548  HLDRLTDKPSALHGTRQFSLYRGVRVHGKGFESFDTDMVNTYMSVFLAKGKLSIACKLFE 607

Query: 581  IFTNMGVDPVSYTFNSMMSSFVKKGYLNEAWGVLQAMGEQLCPADIATYNVIIQGLGKMG 402
            IF  MG  PVSYT+NS++SSFVK+GY NEAWGVL  M E  CPADIATYN +IQGLGKMG
Sbjct: 608  IFNAMGHKPVSYTYNSLVSSFVKRGYFNEAWGVLCEMREN-CPADIATYNAVIQGLGKMG 666

Query: 401  RADLASAVLDKLMKQGGYLDIVMYNTLINALGKAGRIEEANTLFQQMQTSGINPDVVTYN 222
            R DL  AVLD+L++ GGYLD+ MYNTLI+ LG+ GR++EAN LF+QM++SGINPDVVTYN
Sbjct: 667  RVDLVCAVLDQLLQTGGYLDVFMYNTLIHVLGRGGRLDEANKLFEQMKSSGINPDVVTYN 726

Query: 221  TLIEIHNKAGRLKDAYKFLKMMLDAGCAPNHVTDTTLDFLEKEIEKHRYQKATMKPI 51
            TLIE+H+KAGR+K+AY++LK MLDAGC PNH+TDT LDFLE+EIEK RY+KA+MK +
Sbjct: 727  TLIEVHSKAGRVKEAYEYLKAMLDAGCPPNHITDTILDFLEREIEKLRYEKASMKRV 783


Top