BLASTX nr result

ID: Catharanthus22_contig00011419 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00011419
         (3290 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containi...   637   e-179
ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containi...   630   e-178
gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, put...   522   e-145
ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containi...   522   e-145
ref|XP_006381622.1| pentatricopeptide repeat-containing family p...   519   e-144
ref|XP_002326124.1| predicted protein [Populus trichocarpa]           519   e-144
ref|XP_002512275.1| pentatricopeptide repeat-containing protein,...   514   e-143
ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containi...   508   e-141
ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containi...   506   e-140
gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]     506   e-140
ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139
ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139
ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citr...   499   e-138
ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containi...   470   e-129
ref|XP_002885810.1| pentatricopeptide repeat-containing protein ...   464   e-128
dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]                      462   e-127
ref|NP_178657.1| pentatricopeptide repeat-containing protein [Ar...   462   e-127
ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Caps...   460   e-126
ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containi...   458   e-126
ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutr...   447   e-122

>ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Solanum tuberosum]
            gi|565370447|ref|XP_006351832.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Solanum tuberosum]
          Length = 550

 Score =  637 bits (1643), Expect = e-179
 Identities = 318/554 (57%), Positives = 408/554 (73%)
 Frame = +2

Query: 236  MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 415
            MPLW++RA   +    IAR FHG+++ +S P      E++WF K VC LC  ++++L +F
Sbjct: 1    MPLWVQRASNILL---IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56

Query: 416  GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 595
            GSDYFR+NL P IAF VI HIN N   PRLAF F Q TR+NLNL+H I +FNLLLRSL Q
Sbjct: 57   GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLVHCIGSFNLLLRSLSQ 116

Query: 596  MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 775
            MG  D A LV ++M ADG  L+  +LE +V ++A+AGKF I +EILISQA+L   +  I+
Sbjct: 117  MGFHDSAMLVFKFMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGRIV 176

Query: 776  NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 955
              FVHN  LSLL+K++RVDEA  FF+ H+L+      DTC+FN VI GLC+ G +D+AFE
Sbjct: 177  RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236

Query: 956  VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 1135
             FNDMGSFGCF D  TYN++INGLC +G  +RA  LL  ++ Q GLSPDV TYT++I+G+
Sbjct: 237  FFNDMGSFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYTSVIAGY 296

Query: 1136 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 1315
             KLGR  EA++L D+MT  GI PN+ TFN+LI+GFG+ GD+ SA++++  M   G  PDV
Sbjct: 297  CKLGRMDEAINLMDEMTTYGISPNLVTFNILINGFGKIGDMFSAIQMYGRMCAVGYPPDV 356

Query: 1316 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 1495
            +TFT+L+ GYC+ GE+DQGLKLWDEMN R L PN YTFSI+I++L K NRLNEAR+LL Q
Sbjct: 357  VTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKENRLNEARELLRQ 416

Query: 1496 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKG 1675
            L+ R+DI+PQ F+YNPV+DGFCKAGNL  AN I AEME + C  DK TFTILI+GHCMKG
Sbjct: 417  LKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITFTILILGHCMKG 476

Query: 1676 RMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGG 1855
            RM +A+ +F+KM  +GC PD ITV+CL SCLLKAGMV EAY+++    + L    SS+  
Sbjct: 477  RMLEAMAIFDKMLSLGCVPDDITVSCLTSCLLKAGMVKEAYKVRLTPSKDLNPDLSSSKQ 536

Query: 1856 PKPFRASMDITVAV 1897
              PFR S+DI VAV
Sbjct: 537  SVPFRTSLDIPVAV 550


>ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Solanum lycopersicum]
          Length = 550

 Score =  630 bits (1626), Expect = e-178
 Identities = 319/554 (57%), Positives = 407/554 (73%)
 Frame = +2

Query: 236  MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 415
            MPLW++RA  NI+   IAR FHG+++ +S P      E++WF K VC LC  ++++L +F
Sbjct: 1    MPLWVQRAS-NISL--IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56

Query: 416  GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 595
            GSDYFR+NL P IAF VI HIN N   PRLAF F Q TR+NLNLIH I +FNLLLRSL Q
Sbjct: 57   GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLIHCIGSFNLLLRSLSQ 116

Query: 596  MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 775
            MG  D A LV +YM ADG  L+  +LE +V ++A+AGKF I +EILISQA+L   +  I+
Sbjct: 117  MGFHDSAMLVFKYMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGSIV 176

Query: 776  NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 955
              FVHN  LSLL+K++RVDEA  FF+ H+L+      DTC+FN VI GLC+ G +D+AFE
Sbjct: 177  RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236

Query: 956  VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 1135
             FNDMGSFGC  D  TYN++INGLC +G  +RA  LL  +Q Q GLSPDV TYT++ISG+
Sbjct: 237  FFNDMGSFGCSPDTVTYNTLINGLCAVGQVNRAQGLLGNLQLQDGLSPDVVTYTSLISGY 296

Query: 1136 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 1315
             KL R  EA++L D+M   GI PN+ TFN+LI+GFG+ GD+ SA+K++  M   G  PDV
Sbjct: 297  CKLSRMDEAINLMDEMITYGISPNLVTFNILINGFGKIGDMFSAIKMYGKMCAVGYPPDV 356

Query: 1316 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 1495
            +TFT+L+ GYC+ GE+DQGLKLWD+MN+R L PN YTFS++I++L K NRLNEAR+LL Q
Sbjct: 357  VTFTSLIDGYCRTGELDQGLKLWDDMNSRNLSPNLYTFSVLISALSKENRLNEARELLRQ 416

Query: 1496 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKG 1675
            L+ R+DI+PQ F+YNPV+DGFCKAGNL  AN I AEME K C  DK TFTILI+GHCMKG
Sbjct: 417  LKSRDDIVPQPFVYNPVLDGFCKAGNLSEANVIAAEMESKGCCHDKITFTILILGHCMKG 476

Query: 1676 RMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGG 1855
            RM +A+ +F+KM  +GC PD IT++CL SCLLKAGMV EAY+++    + L    S +  
Sbjct: 477  RMLEALAIFDKMLSLGCVPDDITISCLTSCLLKAGMVKEAYKVRLIPSKDLNPDLSPSKL 536

Query: 1856 PKPFRASMDITVAV 1897
              PFR S+DI VAV
Sbjct: 537  FIPFRTSLDIPVAV 550


>gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 592

 Score =  522 bits (1345), Expect = e-145
 Identities = 265/513 (51%), Positives = 356/513 (69%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            WF+K VCTL + + + L      Y  KNL P I F V++ +NN    P L  +F +F+R+
Sbjct: 91   WFVKVVCTLFVYS-QPLDDSCLSYLSKNLTPLIEFEVVKWLNN----PALGLKFLEFSRV 145

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            N N+ HS  T+NLL+RS C MGL D A LV +YM  DG   D  +L F++SS   AG+F 
Sbjct: 146  NFNIAHSFWTYNLLMRSFCHMGLHDSAKLVFDYMRIDGHLPDTTILGFMISSFGRAGEFG 205

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
            + +++L          E +I+ F  N  L+++VKQN+++EA   ++++L    +F  D  
Sbjct: 206  MAKKLLADVQS----DEVVISIFALNNLLNMMVKQNKLEEAVSLYKENLGS--NFYPDAW 259

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            +FNI+I GLC+ G++D+AFE+FNDMGSFGCF DI TYN++INGLCK+ + DR  +LL ++
Sbjct: 260  TFNILIRGLCRVGKVDQAFELFNDMGSFGCFPDIVTYNTIINGLCKVNEVDRGHKLLNQV 319

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
            QS+   SPDV TYT++ISG+ KLG+  EA  L+ +M   G  P V TFNVLI GFG+ GD
Sbjct: 320  QSRDDCSPDVVTYTSVISGYCKLGKMDEASALFHEMISSGTVPTVVTFNVLIDGFGKVGD 379

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SA  ++E M+ FGC  DV+TFT+L+ GYC+IG+++Q L+LW+ M  R L PN YTF+I
Sbjct: 380  MVSAKSMYEQMASFGCIADVVTFTSLIDGYCRIGDVNQSLQLWNTMKGRDLSPNVYTFAI 439

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
             IN+LCK NRL+EAR  L +L+ RN I+P+ FI+NPVIDGFCKAGNLD AN IVAEMEEK
Sbjct: 440  TINALCKENRLHEARGFLRELQCRN-IVPKPFIFNPVIDGFCKAGNLDEANLIVAEMEEK 498

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            +C+PDK TFTILIIGHCMKGRM +AI +FNKM  VGC PD +TVN L+SCLLKAGM +EA
Sbjct: 499  QCHPDKVTFTILIIGHCMKGRMFEAISIFNKMLSVGCTPDDVTVNSLISCLLKAGMPSEA 558

Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894
             RI K   + ++ G S      P R +  + VA
Sbjct: 559  SRITKMASEDMKLGSSLLENNSPLRINRGVPVA 591


>ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Vitis vinifera]
          Length = 641

 Score =  522 bits (1345), Expect = e-145
 Identities = 266/513 (51%), Positives = 354/513 (69%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            W +K +CTLC+R     A    DYF K L P IAF V++ +NN    P LA +FFQ +R+
Sbjct: 76   WIVKVICTLCVRTHSLDACL--DYFSKTLTPSIAFEVVRGLNN----PELALKFFQLSRV 129

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            NLNL HS  T++ LLRSL +MG  + A  V + MN DG S D  +L FLVSS   AGKF+
Sbjct: 130  NLNLCHSFRTYSFLLRSLSEMGFHESAKAVYDCMNIDGHSPDASVLGFLVSSATDAGKFN 189

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
            I R   +   +  L         V+NK L+ LV+ N+VDEA  FFR+ +     F  D+C
Sbjct: 190  IART-WVDGVEFSL--------VVYNKLLNQLVRGNQVDEAVCFFREQMGLHGPF--DSC 238

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            SFNI+I GLC+ G++D+AFE+FN+M  FGC  D+ TYN++ING C++ + DR  +LL+E+
Sbjct: 239  SFNILIRGLCRIGKVDKAFELFNEMRGFGCSPDVITYNTLINGFCRVNEVDRGHDLLKEL 298

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
             S+  LSPDV TYT+IISG+ KLG+  +A  L+++M   GI+PN +TFN+LI+GFG+ GD
Sbjct: 299  LSKNDLSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAFTFNILINGFGKVGD 358

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SA  ++E M   GC PD+ITFT+L+ G+C+ G++++ LKLW E+NAR L PN YTF+I
Sbjct: 359  MVSAENMYEEMLLLGCPPDIITFTSLIDGHCRTGKVERSLKLWHELNARNLSPNEYTFAI 418

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
            + N+LCK NRL+EAR  L  L+WR+ I+ Q F+YNPVIDGFCKAGN+D AN I+AEMEEK
Sbjct: 419  LTNALCKENRLHEARGFLRDLKWRH-IVAQPFMYNPVIDGFCKAGNVDEANVILAEMEEK 477

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            +C PDK T+TILIIGHCMKGR+ +AI +FN+M   GCAPD IT+  L+SCLLKAGM  EA
Sbjct: 478  RCKPDKITYTILIIGHCMKGRLSEAISIFNRMLGTGCAPDSITMTSLISCLLKAGMPNEA 537

Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894
            YRI +   +    G  S     P R + DI VA
Sbjct: 538  YRIMQIASEDFNLGLKSLKRNVPLRTNTDIPVA 570


>ref|XP_006381622.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336330|gb|ERP59419.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 511

 Score =  519 bits (1336), Expect = e-144
 Identities = 262/491 (53%), Positives = 348/491 (70%)
 Frame = +2

Query: 425  YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 604
            Y  + L P IAF VI+  NN    P++ F+F +F+RLNLN+ H  +T+NLL+RSLCQMG 
Sbjct: 33   YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88

Query: 605  LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 784
             DL N+V +YM +DG   D  LL FLV+ +A A  F + +++L ++ Q   GKE  INSF
Sbjct: 89   HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144

Query: 785  VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 964
            V+N  LS+LVKQN+V EA   F+++L        DT +FNI+I GLC+ G +DRAFEVF 
Sbjct: 145  VYNNLLSVLVKQNQVHEAIYLFKEYLAMQSP---DTWTFNILIRGLCRVGGVDRAFEVFK 201

Query: 965  DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 1144
            DM SFGC  D+ TYN++INGLCK  +  R  EL +EIQS+   SPD+ TYT+IISGF K 
Sbjct: 202  DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 261

Query: 1145 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 1324
            G+  EA +L+++M   GI+PNV TFNVLI GFG+ G++  A  ++  M+ F C+ DV+TF
Sbjct: 262  GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 321

Query: 1325 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 1504
            T+L+ GYC+ G+++ GLK W+ M  R + P  YT++++IN+LCK NRLNEARD L Q++ 
Sbjct: 322  TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 380

Query: 1505 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMH 1684
             + I+P+ F+YNPVIDGFCKAGN+D  N I+ EMEEK+C+PDK TFTILIIGHC+KGRM 
Sbjct: 381  NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMF 440

Query: 1685 DAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKP 1864
            +AI++FN+M    CAPD ITVN L+SCLLKAGM  EAYRI+K  L+    G SS     P
Sbjct: 441  EAINIFNRMLATRCAPDNITVNSLISCLLKAGMPNEAYRIRKMALEDRNLGLSSFEKAIP 500

Query: 1865 FRASMDITVAV 1897
             R + DI VAV
Sbjct: 501  LRTNTDIPVAV 511


>ref|XP_002326124.1| predicted protein [Populus trichocarpa]
          Length = 512

 Score =  519 bits (1336), Expect = e-144
 Identities = 262/491 (53%), Positives = 349/491 (71%)
 Frame = +2

Query: 425  YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 604
            Y  + L P IAF VI+  NN    P++ F+F +F+RLNLN+ H  +T+NLL+RSLCQMG 
Sbjct: 33   YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88

Query: 605  LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 784
             DL N+V +YM +DG   D  LL FLV+ +A A  F + +++L ++ Q   GKE  INSF
Sbjct: 89   HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144

Query: 785  VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 964
            V+N  LS+LVKQN+V EA   F+++L+       DT +FNI+I GLC+ G +DRAFEVF 
Sbjct: 145  VYNNLLSVLVKQNQVHEAIYLFKEYLVMQSP--PDTWTFNILIRGLCRVGGVDRAFEVFK 202

Query: 965  DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 1144
            DM SFGC  D+ TYN++INGLCK  +  R  EL +EIQS+   SPD+ TYT+IISGF K 
Sbjct: 203  DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 262

Query: 1145 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 1324
            G+  EA +L+++M   GI+PNV TFNVLI GFG+ G++  A  ++  M+ F C+ DV+TF
Sbjct: 263  GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 322

Query: 1325 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 1504
            T+L+ GYC+ G+++ GLK W+ M  R + P  YT++++IN+LCK NRLNEARD L Q++ 
Sbjct: 323  TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 381

Query: 1505 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMH 1684
             + I+P+ F+YNPVIDGFCKAGN+D  N I+ EMEEK+C+PDK TFTILIIGHC+KGRM 
Sbjct: 382  NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMF 441

Query: 1685 DAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKP 1864
            +AI++FN+M    CAPD ITVN L+SCLLKAGM  EAYRI+K  L+    G SS     P
Sbjct: 442  EAINIFNRMLATRCAPDNITVNSLISCLLKAGMPNEAYRIRKMALEDRNLGLSSFEKAIP 501

Query: 1865 FRASMDITVAV 1897
             R + DI VAV
Sbjct: 502  LRTNTDIPVAV 512


>ref|XP_002512275.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548236|gb|EEF49727.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 532

 Score =  514 bits (1324), Expect = e-143
 Identities = 261/518 (50%), Positives = 358/518 (69%)
 Frame = +2

Query: 341  KIESLWFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFF 520
            K +  WF+K +  L +R+  + A        K   P +AF VI+ +NNN   P++  +F 
Sbjct: 24   KNQEAWFVKVIAILFVRSHCSDATSLGYLSEKLNDPLVAFEVIKRLNNN---PQVGLKFM 80

Query: 521  QFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAH 700
            +F RLN +LIH  +T+ LL+RSLCQMGL DL  +V+ YM +DG  +D  +L FLV+S A 
Sbjct: 81   EFCRLNFSLIHCFSTYELLIRSLCQMGLHDLVEMVIGYMRSDGHLIDSRVLGFLVTSFAQ 140

Query: 701  AGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSF 880
            AGKF + ++++I       G+E  I+SFV+N  L+ LVK  +V EA   F+++L      
Sbjct: 141  AGKFDLAKKLIIEVQ----GEEARISSFVYNYLLNELVKGGKVHEAIFLFKENLAFHSP- 195

Query: 881  CLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALE 1060
              +T +FNI+I GLC+ G++++ FE+FN M SFGC  D+ TYN++I+GLCK  + DRA +
Sbjct: 196  -PNTWTFNILIRGLCRVGEVEKGFELFNAMQSFGCLPDVVTYNTLISGLCKANELDRACD 254

Query: 1061 LLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGF 1240
            LL+E+QS+   SPDV TYT+IISGF KLG+   A  L+++M   GI P V TFNVLI GF
Sbjct: 255  LLKEVQSRNDCSPDVMTYTSIISGFRKLGKLEAASVLFEEMIRSGIEPTVVTFNVLIDGF 314

Query: 1241 GQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNA 1420
            G+ G++ +A  + E M+ + C PDV+TFT+L+ GYC+ G+I  GLK+WD M AR + PN 
Sbjct: 315  GKIGNMVAAEAMHEKMASYSCIPDVVTFTSLIDGYCRTGDIRLGLKVWDVMKARNVSPNI 374

Query: 1421 YTFSIVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVA 1600
            YT+S++IN+LCK NR++EARDLL QL+  +D+ P+ FIYNPVIDGFCKAGN+D AN IV 
Sbjct: 375  YTYSVIINALCKDNRIHEARDLLRQLKC-SDVFPKPFIYNPVIDGFCKAGNVDEANVIVT 433

Query: 1601 EMEEKKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAG 1780
            EMEEK+C PDK TFTILIIGHCMKGRM +A+D+F KM  +GCAPD IT++ LV+CLLKAG
Sbjct: 434  EMEEKRCRPDKVTFTILIIGHCMKGRMVEALDIFKKMLAIGCAPDNITISSLVACLLKAG 493

Query: 1781 MVTEAYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894
              +EA+ I +   + L   FSS     P R   DI+VA
Sbjct: 494  KPSEAFHIVQTASEDLNLSFSSLRKTFPMRVKTDISVA 531


>ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  508 bits (1307), Expect = e-141
 Identities = 264/514 (51%), Positives = 353/514 (68%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            W +K VCTL  R+    A FG  Y  +NL P IAF VI+     F  P L  +FF+F+R 
Sbjct: 48   WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            +L++ H+  T++LL+R+LC++GL D A +V + M +DG+  D  +LE LVSS A  GK  
Sbjct: 102  HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
              +  L      C G +  ++ FV+N  L++LVKQN VDEA + FR+HL     F  D  
Sbjct: 162  SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ +  +  +LL+E 
Sbjct: 216  SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
                G+SPDV TYT+IISG+ KLG    A  L+D+M   GI+PN +TFNVLI GFG+ G+
Sbjct: 276  MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SA+ ++E M   GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM  R L PN YT+++
Sbjct: 336  MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
            +IN+LCK NR+ EAR+ L  L+  ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK
Sbjct: 396  LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            KC PDK TFTILIIG+CMKGRM +AI  F KM  + C PD IT+N L+SCLLKAGM  EA
Sbjct: 455  KCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEA 514

Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897
             +IK+  LQ L  G SS G P   R S  + VAV
Sbjct: 515  SQIKQAALQKLNLGLSSLGSPLT-RKSSRVPVAV 547


>ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  506 bits (1304), Expect = e-140
 Identities = 260/507 (51%), Positives = 350/507 (69%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            W +K VCTL  R+    A FG  Y  +NL P IAF VI+     F  P L  +FF+F+R 
Sbjct: 48   WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            +L++ H+  T++LL+R+LC++GL D A +V + M +DG+  D  +LE LVSS A  GK  
Sbjct: 102  HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
              +  L      C G +  ++ FV+N  L++LVKQN VDEA + FR+HL     F  D  
Sbjct: 162  SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ +  +  +LL+E 
Sbjct: 216  SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
                G+SPDV TYT+IISG+ KLG    A  L+D+M   GI+PN +TFNVLI GFG+ G+
Sbjct: 276  MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SA+ ++E M   GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM  R L PN YT+++
Sbjct: 336  MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
            +IN+LCK NR+ EAR+ L  L+  ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK
Sbjct: 396  LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            KC PDK TFTILIIG+CMKGRM +AI  F KM  + C PD IT+N L+SCLLKAGM  EA
Sbjct: 455  KCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEA 514

Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRAS 1876
             +IK+  LQ L  G SS G P   ++S
Sbjct: 515  SQIKQAALQKLNLGLSSLGSPLTRKSS 541


>gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]
          Length = 570

 Score =  506 bits (1303), Expect = e-140
 Identities = 266/515 (51%), Positives = 349/515 (67%), Gaps = 1/515 (0%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            WF+K V TL +R+      FG  Y  K L P I+F VI+ +NNN   P L  +FF+ +R 
Sbjct: 75   WFVKVVSTLFVRSQSLNTFFG--YLSKKLTPSISFEVIKRLNNN---PNLGLKFFELSRA 129

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            NL++ HS +T+NLL+RSLCQMG  D A  V + M  DG S D   +EFLV   A  GK  
Sbjct: 130  NLSVNHSFSTYNLLIRSLCQMGFHDSAKFVFDCMRIDGHSPDNSTIEFLVCVFAKVGKLD 189

Query: 716  IGREILISQAQLCLGKEEI-INSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 892
               ++L          EEI  + FV++   ++LVK N+V EA   FR  +     F  DT
Sbjct: 190  SCEKLL----------EEIRASKFVYSSLFNVLVKNNKVYEAVCLFRKQIGS--HFVPDT 237

Query: 893  CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072
             +FNI+I GLC  G++  AFE FNDMG F C  D+ TYN++I+GLC+  + DR  +LLRE
Sbjct: 238  WTFNILIGGLCGVGEVHSAFEFFNDMGKFRCSPDVVTYNTLISGLCRTNEVDRGCDLLRE 297

Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252
            +Q +G  SP+V+T+T++I G+ KLGR  EA  L+D+M   G RP   TFNVLI  F + G
Sbjct: 298  VQLRGDFSPNVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTFNVLIDAFSKVG 357

Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432
            D+ SA+ ++E M   G  PDV+TFT+L+ GYC++G++++GLKLW EM+ R + PN YT+S
Sbjct: 358  DMASAIALYEKMLFHGYRPDVVTFTSLIDGYCRVGQLNRGLKLWCEMSVRNVSPNGYTYS 417

Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612
            +VI++LCK NRL+EARDLL QL   N I+P+ F+YNPVIDGFCKAGN+D AN IVAEMEE
Sbjct: 418  VVIHALCKVNRLHEARDLLRQLNCTN-IVPKPFMYNPVIDGFCKAGNVDEANMIVAEMEE 476

Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792
            K+CNPDK TFTILI+G+CMKGRM DAI VF KM  VGCAPD+ITV+CL+SCLLKAGM  E
Sbjct: 477  KRCNPDKMTFTILILGNCMKGRMVDAIGVFYKMLAVGCAPDKITVHCLMSCLLKAGMPNE 536

Query: 1793 AYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897
            A+ IK+ V++ L  G SS       RA  +I +AV
Sbjct: 537  AFHIKETVMKSLNVGMSSLRS-NHMRAIAEIPMAV 570


>ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Citrus sinensis]
            gi|568841566|ref|XP_006474729.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Citrus sinensis]
          Length = 595

 Score =  503 bits (1295), Expect = e-139
 Identities = 266/549 (48%), Positives = 369/549 (67%), Gaps = 5/549 (0%)
 Frame = +2

Query: 263  RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 427
            R  T  +IA  FHG++N  S P    ++        WF+K VCTL +R++  L+   + Y
Sbjct: 59   RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116

Query: 428  FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 607
              + L P  +  VI+ ++N    P+L  +F +F+R+NL+L HS  T+NL++RSLC+MGL 
Sbjct: 117  LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172

Query: 608  DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 787
            D   +V +YM +DG   + P++EF VSS   AGK    + +L   +Q   G E  +++F+
Sbjct: 173  DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228

Query: 788  HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 967
            +N  L+ LVKQN  DEA   F+++         DT +FNI+I GL + G++ +AFE F D
Sbjct: 229  YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIQGLSRIGEVKKAFEFFYD 286

Query: 968  MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 1147
            MGSFGC  DI TYN++I+GLC++ +  R  ELL+E++ +   SPDV TYT++ISG+ KLG
Sbjct: 287  MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFSPDVVTYTSVISGYCKLG 346

Query: 1148 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 1327
            +  +A  ++++M   GI+P+  TFNVLI GFG+ G++ SA  + E M  FG  PDV+TF+
Sbjct: 347  KMDKATGIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSFGYLPDVVTFS 406

Query: 1328 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 1507
            +L+ GYC+ G+++QGLKL DEM  + L PN YTF+I+IN+LCK NRLN+AR  L QL+W 
Sbjct: 407  SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFTILINALCKENRLNDARRFLKQLKW- 465

Query: 1508 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHD 1687
            ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTILIIGHCMKGRM +
Sbjct: 466  NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTILIIGHCMKGRMVE 525

Query: 1688 AIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKPF 1867
            AI +FNKM  +GCAPD ITVN L+SCLLK GM  EA+RI +   + L     S     P 
Sbjct: 526  AISIFNKMLTIGCAPDDITVNSLISCLLKGGMPNEAFRIMQRASEDLNLQLPSWKKAVPL 585

Query: 1868 RASMDITVA 1894
            R + DI VA
Sbjct: 586  RTNTDIPVA 594


>ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Fragaria vesca subsp. vesca]
          Length = 583

 Score =  503 bits (1294), Expect = e-139
 Identities = 262/515 (50%), Positives = 359/515 (69%), Gaps = 1/515 (0%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            WF+K V TL +R+    +  G  Y  KNL P +AF VI+ +NN    P+L   FF+ ++ 
Sbjct: 84   WFVKVVYTLFLRSHSLDSYVG--YLSKNLTPSLAFEVIKRLNN----PKLGLRFFELSKF 137

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
            +LN+ H + T++ LLRSLCQMGL D A LV +YM  DGLS +  +LEFLVSS A  G+  
Sbjct: 138  SLNVNHGVWTYHYLLRSLCQMGLQDSAKLVFDYMRTDGLSPNESVLEFLVSSCAQMGRSD 197

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCL-DT 892
            +  +IL       +G    ++SFV+N   ++LVK NRVDEA   FR ++    S+C  D+
Sbjct: 198  LAEKILDEVHCSVVG----LSSFVYNNLFNVLVKLNRVDEAVCLFRKYV---GSYCCPDS 250

Query: 893  CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072
             +FNI+I GLC+ G +D+  E F+DM SFGC  ++ TYN++I+GLC+  + DR  +LLRE
Sbjct: 251  WTFNILIRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCDLLRE 310

Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252
            +Q +  LSPDV T+T++ISG+ KLGR  EA  ++D+M   G++P   TFN LI G+G++G
Sbjct: 311  VQFRSELSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAVTFNALIDGYGKAG 370

Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432
            D+ SA  ++ESM   G   DVITFT+L+ GYC+ G ++ GL+LW EMNA+ + P+AYTFS
Sbjct: 371  DMSSAFSLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKNVSPSAYTFS 430

Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612
            ++IN+LCK NRL EARDLL +L+  N ++P++F+YNPVIDG CKAGN+D AN IVAEMEE
Sbjct: 431  VLINALCKGNRLCEARDLLRELKGSN-VVPKSFLYNPVIDGLCKAGNIDEANLIVAEMEE 489

Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792
            KKC PD+ TFTILI+G+ MKGRM +AI  F+KM  +GCAPD+IT++ L+SCL KAGM +E
Sbjct: 490  KKCTPDRVTFTILILGNSMKGRMSEAIGNFSKMLSIGCAPDKITIDSLISCLSKAGMPSE 549

Query: 1793 AYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897
            A RIKK   + L  G  S G P   RA+ +I VAV
Sbjct: 550  AGRIKKIAYEDLNMGAPSMGRPPHLRAN-EIPVAV 583


>ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citrus clementina]
            gi|557556032|gb|ESR66046.1| hypothetical protein
            CICLE_v10007804mg [Citrus clementina]
          Length = 595

 Score =  499 bits (1285), Expect = e-138
 Identities = 264/549 (48%), Positives = 367/549 (66%), Gaps = 5/549 (0%)
 Frame = +2

Query: 263  RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 427
            R  T  +IA  FHG++N  S P    ++        WF+K VCTL +R++  L+   + Y
Sbjct: 59   RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116

Query: 428  FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 607
              + L P  +  VI+ ++N    P+L  +F +F+R+NL+L HS  T+NL++RSLC+MGL 
Sbjct: 117  LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172

Query: 608  DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 787
            D   +V +YM +DG   + P++EF VSS   AGK    + +L   +Q   G E  +++F+
Sbjct: 173  DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228

Query: 788  HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 967
            +N  L+ LVKQN  DEA   F+++         DT +FNI+I GLC+ G++ +AFE F D
Sbjct: 229  YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIRGLCRIGEVKKAFEFFYD 286

Query: 968  MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 1147
            MGSFGC  DI TYN++I+GLC++ +  R  ELL+E++ +    PDV TYT++ISG+ KLG
Sbjct: 287  MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFLPDVVTYTSVISGYCKLG 346

Query: 1148 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 1327
            +  +A  ++++M   GI+P+  TFNVLI GFG+ G++ SA  + E M   G  PDV+TF+
Sbjct: 347  KMDKATSIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSLGYLPDVVTFS 406

Query: 1328 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 1507
            +L+ GYC+ G+++QGLKL DEM  + L PN YTF+I+IN+LCK NRLN+AR  L QL+W 
Sbjct: 407  SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAILINALCKENRLNDARRFLKQLKW- 465

Query: 1508 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHD 1687
            ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTILIIGHCMKGRM +
Sbjct: 466  NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTILIIGHCMKGRMVE 525

Query: 1688 AIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKPF 1867
            AI +FNKM  +GCAPD ITVN L+SCLLK GM  EA+RI +   +       S     P 
Sbjct: 526  AISIFNKMLRIGCAPDDITVNSLISCLLKGGMPNEAFRIMQRASEDQNLQLPSWKKAVPL 585

Query: 1868 RASMDITVA 1894
            R + DI VA
Sbjct: 586  RTNTDIPVA 594


>ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Glycine max]
          Length = 544

 Score =  470 bits (1210), Expect = e-129
 Identities = 249/494 (50%), Positives = 346/494 (70%), Gaps = 1/494 (0%)
 Frame = +2

Query: 356  WFIKFVCTLCI-RNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTR 532
            WF+K V TL +  N+ +    G  YFR++L P     V++  NN    P L F+FF+FTR
Sbjct: 46   WFVKIVSTLFLCSNSLDDRFLG--YFREHLTPSHVLEVVKRFNN----PNLGFKFFRFTR 99

Query: 533  LNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKF 712
              L++ HS  T+N+LLRSLCQ GL + A L+ + M +DG   D  LL FLVSS A A +F
Sbjct: 100  ERLSMSHSFWTYNMLLRSLCQAGLHNSAKLLYDSMRSDGQLPDSRLLGFLVSSFALADRF 159

Query: 713  SIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 892
             + +E+L ++AQ C G +  ++  V+N FL++L+K NR+D+A   FR+ L++  S CLD 
Sbjct: 160  DVSKELL-AEAQ-CSGVQ--VDVIVYNNFLNILIKHNRLDDAICLFRE-LMRSHS-CLDA 213

Query: 893  CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072
             +FNI+I GLC AG +D AFE+  DMGSFGC  DI TYN +++GLC++   DRA +LL E
Sbjct: 214  FTFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRIDQVDRARDLLEE 273

Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252
            +  +   +P+V +YTT+ISG+ +L +  EA  L+ +M   G +PNV+TF+ L+ GF ++G
Sbjct: 274  VCLKCEFAPNVVSYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVFTFSALVDGFVKAG 333

Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432
            D+ SAL + + +   GCAP+VIT T+L++GYC+ G ++ GL LW EMNAR +  N YT+S
Sbjct: 334  DMASALGMHKKILFHGCAPNVITLTSLINGYCRAGWVNHGLDLWREMNARNIPANLYTYS 393

Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612
            ++I++LCKSNRL EAR+LL  L+ ++DI+P AF+YNPVIDG+CK+GN+D ANAIVAEMEE
Sbjct: 394  VLISALCKSNRLQEARNLLRILK-QSDIVPLAFVYNPVIDGYCKSGNIDEANAIVAEMEE 452

Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792
             KC PDK TFTILIIGHCMKGR  +AI +F KM   GC PD IT+  L SCLLK+GM  E
Sbjct: 453  -KCKPDKLTFTILIIGHCMKGRTPEAIGIFYKMLASGCTPDDITIRTLSSCLLKSGMPGE 511

Query: 1793 AYRIKKDVLQGLQS 1834
            A RIK+ + +  +S
Sbjct: 512  AARIKETLFENQES 525


>ref|XP_002885810.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297331650|gb|EFH62069.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 536

 Score =  464 bits (1195), Expect = e-128
 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%)
 Frame = +2

Query: 272  TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430
            TF +    FH  S+   ++ P  +N  E +     W +K V TL +    +  +    Y 
Sbjct: 5    TFATAIAHFHTHSHGGAQARPIQNNTREKIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63

Query: 431  RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610
             KNL P I+F V++ ++NN   P + F F++F+R  LN+ HS  T+NLL RSLC+ G+ D
Sbjct: 64   SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGMHD 120

Query: 611  LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781
            LA  + E M +DG+S +  LL FLVSS A  GK      +L+   ++   C+        
Sbjct: 121  LAGQMFECMKSDGISPNSRLLGFLVSSFAEKGKLHCATALLLQSYEVEGCCM-------- 172

Query: 782  FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961
             V N  L+ LVK +RV++A   F +HL + +S C DT +FNI+I GLC  G+ ++A E+ 
Sbjct: 173  -VVNSLLNTLVKLDRVEDAMKLFEEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKAVELL 229

Query: 962  NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141
              M  FGC  DI TYN++I G CK  +  +A E+  +++S  G SPDV TYT++ISG+ K
Sbjct: 230  GGMSGFGCLPDIVTYNTLIKGFCKSNELKKANEMFDDVKSSSGCSPDVVTYTSMISGYCK 289

Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321
             G+  EA  L DDM   GI P   TFNVL+ G+ ++G++ +A +I   M  FGC PDV+T
Sbjct: 290  AGKMQEASVLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMHTAEEIRGKMISFGCFPDVVT 349

Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501
            FT+L+ GYC++G+++QG +LW+EMNAR ++PNA+T+SI+IN+LCK NRL +AR+LL QL 
Sbjct: 350  FTSLIDGYCRVGQVNQGFRLWEEMNARGMFPNAFTYSILINALCKENRLLKARELLGQLA 409

Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681
               DI+PQ F+YNPVIDGFCKAG ++ A  IV EME+KKC PDK TFTILIIGHCMKGRM
Sbjct: 410  -SKDIIPQPFMYNPVIDGFCKAGKVNEAIVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468

Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825
             +A+ +F+KM  +GC+PD+ITV+ L+SCLLKAGM  EAY + +   +G
Sbjct: 469  FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIAHKG 516


>dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]
          Length = 536

 Score =  462 bits (1189), Expect = e-127
 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%)
 Frame = +2

Query: 272  TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430
            TF +    FH  S+   ++ P  +N  E +     W +K V TL +    +  +    Y 
Sbjct: 5    TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63

Query: 431  RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610
             KNL P I+F V++ ++NN   P + F F++F+R  LN+ HS  T+NLL RSLC+ GL D
Sbjct: 64   SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120

Query: 611  LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781
            LA  + E M +DG+S +  LL FLVSS A  GK      +L+   ++   C+        
Sbjct: 121  LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172

Query: 782  FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961
             V N  L+ LVK +RV++A   F +HL + +S C DT +FNI+I GLC  G+ ++A E+ 
Sbjct: 173  -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229

Query: 962  NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141
              M  FGC  DI TYN++I G CK  + ++A E+ ++++S    SPDV TYT++ISG+ K
Sbjct: 230  GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289

Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321
             G+  EA  L DDM   GI P   TFNVL+ G+ ++G++ +A +I   M  FGC PDV+T
Sbjct: 290  AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349

Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501
            FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC  NRL +AR+LL QL 
Sbjct: 350  FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409

Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681
               DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM
Sbjct: 410  -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468

Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825
             +A+ +F+KM  +GC+PD+ITV+ L+SCLLKAGM  EAY + +   +G
Sbjct: 469  FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIARKG 516


>ref|NP_178657.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|42570711|ref|NP_973429.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75216767|sp|Q9ZUE9.1|PP149_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g06000 gi|4006835|gb|AAC95177.1| hypothetical protein
            [Arabidopsis thaliana] gi|110736272|dbj|BAF00106.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|330250896|gb|AEC05990.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|330250897|gb|AEC05991.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 536

 Score =  462 bits (1189), Expect = e-127
 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%)
 Frame = +2

Query: 272  TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430
            TF +    FH  S+   ++ P  +N  E +     W +K V TL +    +  +    Y 
Sbjct: 5    TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63

Query: 431  RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610
             KNL P I+F V++ ++NN   P + F F++F+R  LN+ HS  T+NLL RSLC+ GL D
Sbjct: 64   SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120

Query: 611  LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781
            LA  + E M +DG+S +  LL FLVSS A  GK      +L+   ++   C+        
Sbjct: 121  LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172

Query: 782  FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961
             V N  L+ LVK +RV++A   F +HL + +S C DT +FNI+I GLC  G+ ++A E+ 
Sbjct: 173  -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229

Query: 962  NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141
              M  FGC  DI TYN++I G CK  + ++A E+ ++++S    SPDV TYT++ISG+ K
Sbjct: 230  GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289

Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321
             G+  EA  L DDM   GI P   TFNVL+ G+ ++G++ +A +I   M  FGC PDV+T
Sbjct: 290  AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349

Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501
            FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC  NRL +AR+LL QL 
Sbjct: 350  FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409

Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681
               DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM
Sbjct: 410  -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468

Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825
             +A+ +F+KM  +GC+PD+ITV+ L+SCLLKAGM  EAY + +   +G
Sbjct: 469  FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIARKG 516


>ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Capsella rubella]
            gi|565479514|ref|XP_006297397.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566105|gb|EOA30294.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566106|gb|EOA30295.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
          Length = 535

 Score =  460 bits (1184), Expect = e-126
 Identities = 247/523 (47%), Positives = 337/523 (64%), Gaps = 10/523 (1%)
 Frame = +2

Query: 272  TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430
            TF +    FH  S+   ++ P   NK E +     W IK V TL +    +  +    Y 
Sbjct: 5    TFATAIAHFHTHSHGGAQARPLHSNKREVMHCPEAWLIKIVSTLFVYRVPDSDLCFC-YL 63

Query: 431  RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610
             KNL P IAF V++ ++NN   P L F F++F+R  LN+ HS  T+N+L RSLC+ G+ D
Sbjct: 64   SKNLNPFIAFEVVKKLDNNH--PHLGFRFWEFSRFKLNIRHSFWTYNVLTRSLCKAGMHD 121

Query: 611  LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781
            LA  + E M +DG+S +  LL FLVSS A  GK      +L+   ++   C+        
Sbjct: 122  LAGQMFECMRSDGVSPNSRLLGFLVSSFAEKGKLQFATALLLQSYEVERCCM-------- 173

Query: 782  FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961
             V N  L+ LVK +RVD+A   F  HL      C DT +FNI+I GLC  G+ ++A E+ 
Sbjct: 174  -VVNSLLNTLVKLDRVDDAMKLFDKHLRF--QCCNDTKTFNILIRGLCSVGKGEKALELL 230

Query: 962  NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141
             +M  FGC  DI TYN++I G CK  +  +A E+L +++S  G SPDV TYT++ISG+ K
Sbjct: 231  GEMSGFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCK 290

Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321
             G+  EA  L DDM   GI P   TFNVL+ G+ ++G++ SA  I   M  FGC PDV+T
Sbjct: 291  AGKMQEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISFGCFPDVVT 350

Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501
            FT+L+ GYC+ G+++QG +LW+EMNA+ + PN +T+SI+IN+LCK N L +AR+LL QL 
Sbjct: 351  FTSLIDGYCRAGQVNQGFRLWEEMNAKGMLPNEFTYSILINALCKENSLLKARELLGQLA 410

Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681
               DI+ + F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM
Sbjct: 411  -SKDIITKPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 469

Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKK 1810
             +A+ +F+KM  +GC+PD+ITVN L+SCLLKAGM  EAY + +
Sbjct: 470  FEAVSIFHKMVAIGCSPDKITVNSLLSCLLKAGMAEEAYHLNQ 512



 Score =  131 bits (330), Expect = 2e-27
 Identities = 77/272 (28%), Positives = 136/272 (50%)
 Frame = +2

Query: 1001 TYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDD 1180
            TYN +   LCK G  D A ++   ++S G +SP+ +    ++S F + G+   A  L   
Sbjct: 106  TYNVLTRSLCKAGMHDLAGQMFECMRSDG-VSPNSRLLGFLVSSFAEKGKLQFATALL-- 162

Query: 1181 MTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGE 1360
            +    +       N L++   +   +  A+K+F+   +F C  D  TF  L+ G C +G+
Sbjct: 163  LQSYEVERCCMVVNSLLNTLVKLDRVDDAMKLFDKHLRFQCCNDTKTFNILIRGLCSVGK 222

Query: 1361 IDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYN 1540
             ++ L+L  EM+     P+  T++ +I   CKSN L +A ++L+ ++  +   P    Y 
Sbjct: 223  GEKALELLGEMSGFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYT 282

Query: 1541 PVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLV 1720
             +I G+CKAG +  A  ++ +M      P   TF +L+ G+   G M  A D+  KM   
Sbjct: 283  SMISGYCKAGKMQEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISF 342

Query: 1721 GCAPDRITVNCLVSCLLKAGMVTEAYRIKKDV 1816
            GC PD +T   L+    +AG V + +R+ +++
Sbjct: 343  GCFPDVVTFTSLIDGYCRAGQVNQGFRLWEEM 374



 Score = 84.0 bits (206), Expect = 4e-13
 Identities = 63/231 (27%), Positives = 110/231 (47%), Gaps = 4/231 (1%)
 Frame = +2

Query: 1139 KLGRNHEAL--HLWDDMTHR-GIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAP 1309
            KL  NH  L    W+    +  IR + +T+NVL     ++G    A ++FE M   G +P
Sbjct: 78   KLDNNHPHLGFRFWEFSRFKLNIRHSFWTYNVLTRSLCKAGMHDLAGQMFECMRSDGVSP 137

Query: 1310 DVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLL 1489
            +      L+S + + G++     L   + + ++       + ++N+L K +R+++A  L 
Sbjct: 138  NSRLLGFLVSSFAEKGKLQFATALL--LQSYEVERCCMVVNSLLNTLVKLDRVDDAMKLF 195

Query: 1490 SQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCM 1669
             +   R         +N +I G C  G  + A  ++ EM    C+PD  T+  LI G C 
Sbjct: 196  DK-HLRFQCCNDTKTFNILIRGLCSVGKGEKALELLGEMSGFGCSPDIVTYNTLIKGFCK 254

Query: 1670 KGRMHDAIDVFNKM-SLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVL 1819
               +  A ++ N + S  GC+PD +T   ++S   KAG + EAY +  D+L
Sbjct: 255  SNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCKAGKMQEAYLLLDDML 305


>ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Glycine max] gi|571448762|ref|XP_006577947.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Glycine max]
            gi|571448764|ref|XP_006577948.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X3 [Glycine max]
            gi|571448766|ref|XP_006577949.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X4 [Glycine max]
          Length = 510

 Score =  458 bits (1179), Expect = e-126
 Identities = 239/514 (46%), Positives = 338/514 (65%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            WF+K  CT+ +R+       G  YF K+L P + + V+  ++     P L F+F +F R 
Sbjct: 10   WFVKIACTVFVRSNSLDPFVG--YFSKHLTPSLVYEVVNRLHI----PNLGFKFVEFCRH 63

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
             L++ HS  T++LLLRSLC+  L   A +V ++M  DG   D  LL FLV S A  G+  
Sbjct: 64   KLHMSHSYLTYSLLLRSLCRSNLHHTAKVVYDWMRCDGQIPDNRLLGFLVWSYAIVGRLD 123

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
            + RE+L       +G    +N+ V+N   ++L++QN+V +A + FR+ L++LR +   T 
Sbjct: 124  VSRELLADVQCNNVG----VNAVVYNDLFNVLIRQNKVVDAVVLFRE-LIRLR-YKPVTY 177

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            + NI++ GLC+AG+ID AF + ND+ SFGC  D+ TYN++I+GLC++ + DRA  LL+E+
Sbjct: 178  TVNILMRGLCRAGEIDEAFRLLNDLRSFGCLPDVITYNTLIHGLCRINEVDRARSLLKEV 237

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
               G  +PDV +YTTIISG+ K  +  E   L+ +M   G  PN +TFN LI GFG+ GD
Sbjct: 238  CLNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGKLGD 297

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SAL ++E M   GC PDV TFT+L++GY ++G++ Q + +W +MN + +    YTFS+
Sbjct: 298  MASALALYEKMLVQGCVPDVATFTSLINGYFRLGQVHQAMDMWHKMNDKNIGATLYTFSV 357

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
            +++ LC +NRL++ARD+L  L   +DI+PQ FIYNPVIDG+CK+GN+D AN IVAEME  
Sbjct: 358  LVSGLCNNNRLHKARDILRLLN-ESDIVPQPFIYNPVIDGYCKSGNVDEANKIVAEMEVN 416

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            +C PDK TFTILIIGHCMKGRM +AI +F+KM  VGCAPD ITVN L SCLLKAGM  EA
Sbjct: 417  RCKPDKLTFTILIIGHCMKGRMPEAIGIFHKMLAVGCAPDEITVNNLRSCLLKAGMPGEA 476

Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897
             R+KK + Q L  G +S+        +  I VAV
Sbjct: 477  ARVKKVLAQNLTLGITSSKKSYHETTNESIPVAV 510


>ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum]
            gi|557096393|gb|ESQ36901.1| hypothetical protein
            EUTSA_v10002477mg [Eutrema salsugineum]
          Length = 535

 Score =  447 bits (1150), Expect = e-122
 Identities = 229/490 (46%), Positives = 327/490 (66%)
 Frame = +2

Query: 356  WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535
            W +K V TL +    +  +    Y  KNL P IAF V++ ++N    P + F F++F+R 
Sbjct: 40   WLVKIVSTLFVYQVPDSDLCFC-YLSKNLNPFIAFEVVKKLDN----PHIGFRFWEFSRF 94

Query: 536  NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715
             LN+ HS  T+NLL RSLC+ GL DLA  + E M +DG+S +  LL FLVSS A  GK  
Sbjct: 95   KLNIRHSFWTYNLLTRSLCKAGLHDLAGKMFECMKSDGVSPNSRLLGFLVSSFAEKGKLH 154

Query: 716  IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895
                +L+   ++  G   ++NS +H      LV+ +RV++A   F  HL      C DT 
Sbjct: 155  FATALLLQSYEV-EGSSMVVNSLLHT-----LVRLDRVEDAMKLFDTHLRS--QSCNDTR 206

Query: 896  SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075
            +FNI+I GLC  G+   A ++  +M SFG   DI TYN++I G CK  + ++A E+  E+
Sbjct: 207  TFNILIQGLCGIGKAHEALKLLGEMSSFGSSPDIVTYNTLIKGFCKSNELNKANEIFNEV 266

Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255
            +S+ G   DV TYT+++SG+ K G+  EA  L D+M   G+ P   TFNVL++G+ ++G+
Sbjct: 267  KSRNGCFRDVVTYTSMMSGYCKAGKMREASLLLDEMVGLGMYPTNITFNVLVYGYVKAGE 326

Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435
            + SA  I   M  FGC PDV+TFT L+ GYC++G++++G  LW+EM+A+ ++PNA+T+SI
Sbjct: 327  MSSAEAIRRKMDSFGCFPDVVTFTTLIDGYCRVGQVNKGFSLWEEMSAKGMFPNAFTYSI 386

Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615
            +IN+LCK NRL +AR+LL QL    DI+P+ F+YNP+IDGFCKAG ++ AN IVAEME+ 
Sbjct: 387  LINALCKENRLLKARELLGQLACM-DIVPKPFLYNPIIDGFCKAGKVNEANVIVAEMEKF 445

Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795
            +C PDK TFTILIIGHCMKGRM +AI +F+KM  +GC+PD+ITV+ L SCLLKAGM  EA
Sbjct: 446  RCKPDKITFTILIIGHCMKGRMCEAISIFHKMVAIGCSPDKITVSSLSSCLLKAGMAKEA 505

Query: 1796 YRIKKDVLQG 1825
            Y++ +  ++G
Sbjct: 506  YQLNQFAVKG 515


Top