BLASTX nr result

ID: Cocculus23_contig00014629 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00014629
         (2134 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containi...   600   e-169
ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containi...   565   e-158
ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containi...   564   e-158
ref|XP_007012633.1| Pentatricopeptide repeat superfamily protein...   560   e-156
ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citr...   556   e-155
ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containi...   555   e-155
ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containi...   532   e-148
ref|XP_002512275.1| pentatricopeptide repeat-containing protein,...   527   e-147
gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]     521   e-145
gb|EYU18527.1| hypothetical protein MIMGU_mgv1a003955mg [Mimulus...   512   e-142
ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containi...   497   e-137
ref|XP_006381622.1| pentatricopeptide repeat-containing family p...   497   e-137
ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containi...   493   e-136
ref|XP_007204825.1| hypothetical protein PRUPE_ppa004064mg [Prun...   489   e-135
ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutr...   488   e-135
ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containi...   487   e-134
ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Caps...   487   e-134
dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]                      486   e-134
ref|NP_178657.1| pentatricopeptide repeat-containing protein [Ar...   486   e-134
ref|XP_002885810.1| pentatricopeptide repeat-containing protein ...   484   e-134

>ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Vitis vinifera]
          Length = 641

 Score =  600 bits (1547), Expect = e-169
 Identities = 315/568 (55%), Positives = 395/568 (69%), Gaps = 5/568 (0%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWI 1708
            +GV  QMTL L  T  SRV    ++   I+ FH+HAV +S          EVI+N E WI
Sbjct: 27   DGVALQMTLLLFITRPSRV---RASKIAIAQFHEHAVGISRNRP------EVIQNPENWI 77

Query: 1707 VKVVSTLCV-VHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLN 1531
            VKV+ TLCV  H L      ACL+YFSK+  PSI + VVR L NP+LALKFF+     LN
Sbjct: 78   VKVICTLCVRTHSLD-----ACLDYFSKTLTPSIAFEVVRGLNNPELALKFFQLSRVNLN 132

Query: 1530 GDELNHLVKSYNFLVKSLCQAGLHESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLD 1363
               L H  ++Y+FL++SL + G HESA  +      DGH PD  V  FLVSS   +GK +
Sbjct: 133  ---LCHSFRTYSFLLRSLSEMGFHESAKAVYDCMNIDGHSPDASVLGFLVSSATDAGKFN 189

Query: 1362 IAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIV 1183
            IA+  +       ++                + VDEA+CFF+ E + L    D+CSFNI+
Sbjct: 190  IARTWV-----DGVEFSLVVYNKLLNQLVRGNQVDEAVCFFR-EQMGLHGPFDSCSFNIL 243

Query: 1182 TRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTG 1003
             RGLCR+GK+D AFELFN+M  F CSPDV+TYNTLI+GFCR  +VD+G++LL ++  K  
Sbjct: 244  IRGLCRIGKVDKAFELFNEMRGFGCSPDVITYNTLINGFCRVNEVDRGHDLLKELLSKND 303

Query: 1002 LSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAV 823
            LSPDVVTYTS+ISGYCKLGKME+AS L + MIS  I PN+FTFN+LINGFGK+G+M SA 
Sbjct: 304  LSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAFTFNILINGFGKVGDMVSAE 363

Query: 822  ATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINAL 643
              YE ML  GC PD++TFTSLIDGHCR G+ E  +KLW+E+  +N+SPN YTF++L NAL
Sbjct: 364  NMYEEMLLLGCPPDIITFTSLIDGHCRTGKVERSLKLWHELNARNLSPNEYTFAILTNAL 423

Query: 642  CKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDK 463
            CKENRL+EAR  LR L+WR+IV +PF+YNPVIDGFCKAGNVDEANVI AEMEEKRC PDK
Sbjct: 424  CKENRLHEARGFLRDLKWRHIVAQPFMYNPVIDGFCKAGNVDEANVILAEMEEKRCKPDK 483

Query: 462  FTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIMVT 283
             T+T LIIGH MKGR++EAIS+F++M   GCAPD+IT+++ ISCLLKAGMP+EA RIM  
Sbjct: 484  ITYTILIIGHCMKGRLSEAISIFNRMLGTGCAPDSITMTSLISCLLKAGMPNEAYRIM-Q 542

Query: 282  FSKRDLGPALSSPRKSFSSTKNMDIPVA 199
             +  D    L S +++     N DIPVA
Sbjct: 543  IASEDFNLGLKSLKRNVPLRTNTDIPVA 570


>ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  565 bits (1456), Expect = e-158
 Identities = 302/555 (54%), Positives = 384/555 (69%), Gaps = 8/555 (1%)
 Frame = -2

Query: 1839 SRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSS 1660
            SR  R  ++ F I+ F+  A  VS    F   DRE+IR+SEAW+VKVV TL      +S 
Sbjct: 8    SRAYRLRTSNFSIAQFYSLADSVSRARPF--CDREIIRHSEAWLVKVVCTLF----FRSH 61

Query: 1659 TDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKS 1480
            +  AC  Y S++ NPSI + V++R  +P L LKFFEF    L+   +NH   +Y+ L+++
Sbjct: 62   SLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLS---INHTFNTYDLLMRN 118

Query: 1479 LCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQA-CHGKIK 1318
            LC+ GL++SA K+VFD     G  PD  + + LVSS A+ GKLD AK  L +  C+G IK
Sbjct: 119  LCKVGLNDSA-KIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYG-IK 176

Query: 1317 XXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFE 1138
                            + VDEA+  F+ EHL   F PD  SFNI+ RGLCR+G+ID AFE
Sbjct: 177  VSPFVYNNLLNMLVKQNLVDEAVLLFR-EHLEPYFVPDVYSFNILIRGLCRIGEIDKAFE 235

Query: 1137 LFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGY 958
             F +MG+F C PD+V+YNTLI+GFCR  ++ KG++LL +     G+SPDV+TYTS+ISGY
Sbjct: 236  FFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGY 295

Query: 957  CKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDV 778
            CKLG M+ AS L DEM+S  I PN FTFNVLI+GFGK+GNMRSA+  YE ML  GC+PDV
Sbjct: 296  CKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDV 355

Query: 777  VTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQ 598
            VTFTSLIDG+CR GE  +G+KLW EM  +N+SPN YT++VLINALCKENR+ EAR+ LR 
Sbjct: 356  VTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRH 415

Query: 597  LRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGR 418
            L+   +VP+PFIYNPVIDGFCKAG VDEAN I AEM+EK+C PDK TFT LIIG+ MKGR
Sbjct: 416  LKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGR 475

Query: 417  MAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRI-MVTFSKRDLG-PALSSP 244
            M EAIS F+KM  I C PD ITI++ ISCLLKAGMP+EA++I      K +LG  +L SP
Sbjct: 476  MVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSP 535

Query: 243  RKSFSSTKNMDIPVA 199
                 + K+  +PVA
Sbjct: 536  ----LTRKSSRVPVA 546


>ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Cucumis sativus]
          Length = 548

 Score =  564 bits (1453), Expect = e-158
 Identities = 301/545 (55%), Positives = 380/545 (69%), Gaps = 10/545 (1%)
 Frame = -2

Query: 1839 SRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSS 1660
            SR  R  ++ F I+ F+  A  VS    F   DRE+IR+SEAW+VKVV TL      +S 
Sbjct: 8    SRAYRLRTSNFSIAQFYSLADSVSRARPF--CDREIIRHSEAWLVKVVCTLF----FRSH 61

Query: 1659 TDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKS 1480
            +  AC  Y S++ NPSI + V++R  +P L LKFFEF    L+   +NH   +Y+ L+++
Sbjct: 62   SLNACFGYLSRNLNPSIAFEVIKRFSDPLLGLKFFEFSRTHLS---INHTFNTYDLLMRN 118

Query: 1479 LCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQA-CHGKIK 1318
            LC+ GL++SA K+VFD     G  PD  + + LVSS A+ GKLD AK  L +  C+G IK
Sbjct: 119  LCKVGLNDSA-KIVFDCMRSDGILPDSSILELLVSSYARMGKLDSAKNFLNEVHCYG-IK 176

Query: 1317 XXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFE 1138
                            + VDEA+  F+ EHL   F PD  SFNI+ RGLCR+G+ID AFE
Sbjct: 177  VSPFVYNNLLNMLVKQNLVDEAVLLFR-EHLEPYFVPDVYSFNILIRGLCRIGEIDKAFE 235

Query: 1137 LFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGY 958
             F +MG+F C PD+V+YNTLI+GFCR  ++ KG++LL +     G+SPDV+TYTS+ISGY
Sbjct: 236  FFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKEDMLIKGVSPDVITYTSIISGY 295

Query: 957  CKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDV 778
            CKLG M+ AS L DEM+S  I PN FTFNVLI+GFGK+GNMRSA+  YE ML  GC+PDV
Sbjct: 296  CKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGNMRSAMVMYEKMLLLGCLPDV 355

Query: 777  VTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQ 598
            VTFTSLIDG+CR GE  +G+KLW EM  +N+SPN YT++VLINALCKENR+ EAR+ LR 
Sbjct: 356  VTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAVLINALCKENRIREARNFLRH 415

Query: 597  LRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGR 418
            L+   +VP+PFIYNPVIDGFCKAG VDEAN I AEM+EK+C PDK TFT LIIG+ MKGR
Sbjct: 416  LKSSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEKKCRPDKITFTILIIGNCMKGR 475

Query: 417  MAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRI-MVTFSKRDLG-PALSSP 244
            M EAIS F+KM  I C PD ITI++ ISCLLKAGMP+EA++I      K +LG  +L SP
Sbjct: 476  MVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEASQIKQAALQKLNLGLSSLGSP 535

Query: 243  --RKS 235
              RKS
Sbjct: 536  LTRKS 540


>ref|XP_007012633.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508782996|gb|EOY30252.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 592

 Score =  560 bits (1442), Expect = e-156
 Identities = 303/571 (53%), Positives = 387/571 (67%), Gaps = 7/571 (1%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREV--IRNSEA 1714
            NG+  QMTLF   T  SRV RAAS  F I  FH   +Q    P  +  ++EV  I+  EA
Sbjct: 37   NGLGLQMTLFSFTTRASRV-RAASKVF-IPHFH---IQFHGGPHPQ-GNKEVKAIQKHEA 90

Query: 1713 WIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRL 1534
            W VKVV TL V        D +CL Y SK+  P I + VV+ L NP L LKF EF     
Sbjct: 91   WFVKVVCTLFVY---SQPLDDSCLSYLSKNLTPLIEFEVVKWLNNPALGLKFLEFSRVNF 147

Query: 1533 NGDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGK 1369
            N   + H   +YN L++S C  GLH+SA KLVFD     GH PD  +  F++SS  ++G+
Sbjct: 148  N---IAHSFWTYNLLMRSFCHMGLHDSA-KLVFDYMRIDGHLPDTTILGFMISSFGRAGE 203

Query: 1368 LDIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFN 1189
              +AK+LL      ++                 + ++EA+  +K E+L   F PD  +FN
Sbjct: 204  FGMAKKLLADVQSDEVVISIFALNNLLNMMVKQNKLEEAVSLYK-ENLGSNFYPDAWTFN 262

Query: 1188 IVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPK 1009
            I+ RGLCR+GK+D AFELFNDMGSF C PD+VTYNT+I+G C+  +VD+G++LL Q++ +
Sbjct: 263  ILIRGLCRVGKVDQAFELFNDMGSFGCFPDIVTYNTIINGLCKVNEVDRGHKLLNQVQSR 322

Query: 1008 TGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRS 829
               SPDVVTYTSVISGYCKLGKM+EASAL  EMIS   +P   TFNVLI+GFGK+G+M S
Sbjct: 323  DDCSPDVVTYTSVISGYCKLGKMDEASALFHEMISSGTVPTVVTFNVLIDGFGKVGDMVS 382

Query: 828  AVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLIN 649
            A + YE M S GC+ DVVTFTSLIDG+CRIG+  + ++LWN M  +++SPN YTF++ IN
Sbjct: 383  AKSMYEQMASFGCIADVVTFTSLIDGYCRIGDVNQSLQLWNTMKGRDLSPNVYTFAITIN 442

Query: 648  ALCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNP 469
            ALCKENRL+EAR  LR+L+ RNIVP+PFI+NPVIDGFCKAGN+DEAN+I AEMEEK+C+P
Sbjct: 443  ALCKENRLHEARGFLRELQCRNIVPKPFIFNPVIDGFCKAGNLDEANLIVAEMEEKQCHP 502

Query: 468  DKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIM 289
            DK TFT LIIGH MKGRM EAIS+F+KM S+GC PD +T+++ ISCLLKAGMPSEA+RI 
Sbjct: 503  DKVTFTILIIGHCMKGRMFEAISIFNKMLSVGCTPDDVTVNSLISCLLKAGMPSEASRI- 561

Query: 288  VTFSKRDLGPALSSPRKSFSSTKNMDIPVAA 196
               +  D+    S    +     N  +PVAA
Sbjct: 562  TKMASEDMKLGSSLLENNSPLRINRGVPVAA 592


>ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citrus clementina]
            gi|557556032|gb|ESR66046.1| hypothetical protein
            CICLE_v10007804mg [Citrus clementina]
          Length = 595

 Score =  556 bits (1433), Expect = e-155
 Identities = 308/571 (53%), Positives = 387/571 (67%), Gaps = 7/571 (1%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIR--NSEA 1714
            NGV   MTL       SRV   AST   I+ FH  A    S+P     ++EV    ++E 
Sbjct: 40   NGVGLPMTLLFFTVRPSRV--RASTIAAIAHFHGLA-NGGSRP---FDEKEVNYRCSNEF 93

Query: 1713 WIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRL 1534
            W VKVV TL +     S T   C  Y  +  +P     V++RL NPKL LKF EF    L
Sbjct: 94   WFVKVVCTLLLRSSYLSDT---CARYLCEKLSPLNSLEVIKRLDNPKLGLKFLEFSRVNL 150

Query: 1533 NGDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGK 1369
            +   LNH  K+YN +++SLC+ GLH+S  ++VFD     GH P+ P+ +F VSSC ++GK
Sbjct: 151  S---LNHSFKTYNLVMRSLCEMGLHDSV-QVVFDYMRSDGHLPNSPMIEFFVSSCIRAGK 206

Query: 1368 LDIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFN 1189
             D AK LL Q   G++                 +  DEA+  FK E+ RL   PDT +FN
Sbjct: 207  CDAAKGLLSQFRPGEVTMSTFMYNSLLNALVKQNNADEAVYMFK-EYFRLYSQPDTWTFN 265

Query: 1188 IVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPK 1009
            I+ RGLCR+G++  AFE F DMGSF CSPD+VTYNTLI G CR  +V +G+ELL +++ K
Sbjct: 266  ILIRGLCRIGEVKKAFEFFYDMGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFK 325

Query: 1008 TGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRS 829
            +   PDVVTYTSVISGYCKLGKM++A+++ +EM S  I P++ TFNVLI+GFGK+GNM S
Sbjct: 326  SEFLPDVVTYTSVISGYCKLGKMDKATSIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVS 385

Query: 828  AVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLIN 649
            A    E MLS G +PDVVTF+SLIDG+CR G+  +G+KL +EM  KN+SPN YTF++LIN
Sbjct: 386  AEYMRERMLSLGYLPDVVTFSSLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAILIN 445

Query: 648  ALCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNP 469
            ALCKENRLN+AR  L+QL+W ++VP+PF+YNPVIDGFCKAGNVDEANVI AEMEEKRC P
Sbjct: 446  ALCKENRLNDARRFLKQLKWNDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKP 505

Query: 468  DKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIM 289
            DK TFT LIIGH MKGRM EAIS+F+KM  IGCAPD IT+++ ISCLLK GMP+EA RIM
Sbjct: 506  DKVTFTILIIGHCMKGRMVEAISIFNKMLRIGCAPDDITVNSLISCLLKGGMPNEAFRIM 565

Query: 288  VTFSKRDLGPALSSPRKSFSSTKNMDIPVAA 196
               S+ D    L S +K+     N DIPVAA
Sbjct: 566  QRASE-DQNLQLPSWKKAVPLRTNTDIPVAA 595


>ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Citrus sinensis]
            gi|568841566|ref|XP_006474729.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Citrus sinensis]
          Length = 595

 Score =  555 bits (1431), Expect = e-155
 Identities = 308/571 (53%), Positives = 388/571 (67%), Gaps = 7/571 (1%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIR--NSEA 1714
            NGV   MTL       SRV   AST   I+ FH  A    S+P     ++EV    ++E 
Sbjct: 40   NGVGLPMTLLFFTVRPSRV--RASTIAAIAHFHGLA-NGGSRP---FDEKEVNYRCSNEF 93

Query: 1713 WIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRL 1534
            W VKVV TL +     S T   C  Y  +  +P     V++RL NPKL LKF EF    L
Sbjct: 94   WFVKVVCTLLLRSSYLSDT---CARYLCEKLSPLNSLEVIKRLDNPKLGLKFLEFSRVNL 150

Query: 1533 NGDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGK 1369
            +   LNH  K+YN +++SLC+ GLH+S  ++VFD     GH P+ P+ +F VSSC ++GK
Sbjct: 151  S---LNHSFKTYNLVMRSLCEMGLHDSV-QVVFDYMRSDGHLPNSPMIEFFVSSCIRAGK 206

Query: 1368 LDIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFN 1189
             D AK LL Q   G++                 +  DEA+  FK E+ RL   PDT +FN
Sbjct: 207  CDAAKGLLSQFRPGEVTMSTFMYNSLLNALVKQNNADEAVYMFK-EYFRLYSQPDTWTFN 265

Query: 1188 IVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPK 1009
            I+ +GL R+G++  AFE F DMGSF CSPD+VTYNTLI G CR  +V +G+ELL +++ K
Sbjct: 266  ILIQGLSRIGEVKKAFEFFYDMGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFK 325

Query: 1008 TGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRS 829
            +  SPDVVTYTSVISGYCKLGKM++A+ + +EM S  I P++ TFNVLI+GFGK+GNM S
Sbjct: 326  SEFSPDVVTYTSVISGYCKLGKMDKATGIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVS 385

Query: 828  AVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLIN 649
            A    E MLS G +PDVVTF+SLIDG+CR G+  +G+KL +EM  KN+SPN YTF++LIN
Sbjct: 386  AEYMRERMLSFGYLPDVVTFSSLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFTILIN 445

Query: 648  ALCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNP 469
            ALCKENRLN+AR  L+QL+W ++VP+PF+YNPVIDGFCKAGNVDEANVI AEMEEKRC P
Sbjct: 446  ALCKENRLNDARRFLKQLKWNDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKP 505

Query: 468  DKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIM 289
            DK TFT LIIGH MKGRM EAIS+F+KM +IGCAPD IT+++ ISCLLK GMP+EA RIM
Sbjct: 506  DKVTFTILIIGHCMKGRMVEAISIFNKMLTIGCAPDDITVNSLISCLLKGGMPNEAFRIM 565

Query: 288  VTFSKRDLGPALSSPRKSFSSTKNMDIPVAA 196
               S+ DL   L S +K+     N DIPVAA
Sbjct: 566  QRASE-DLNLQLPSWKKAVPLRTNTDIPVAA 595


>ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Fragaria vesca subsp. vesca]
          Length = 583

 Score =  532 bits (1370), Expect = e-148
 Identities = 292/570 (51%), Positives = 388/570 (68%), Gaps = 7/570 (1%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWI 1708
            +G+  QM+L       S   RA+     I++ H H +   ++P     +REV+ N EAW 
Sbjct: 34   DGMAVQMSLLFFTARPSFWGRASK----IAASHLHTL-AGARPR---PEREVLLNPEAWF 85

Query: 1707 VKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLNG 1528
            VKVV TL     L+S +  + + Y SK+  PS+ + V++RL NPKL L+FFE     LN 
Sbjct: 86   VKVVYTLF----LRSHSLDSYVGYLSKNLTPSLAFEVIKRLNNPKLGLRFFELSKFSLN- 140

Query: 1527 DELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGKLD 1363
              +NH V +Y++L++SLCQ GL +SA KLVFD     G  P+  V +FLVSSCAQ G+ D
Sbjct: 141  --VNHGVWTYHYLLRSLCQMGLQDSA-KLVFDYMRTDGLSPNESVLEFLVSSCAQMGRSD 197

Query: 1362 IAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIV 1183
            +A+++L +     +                 + VDEA+C F+ +++    CPD+ +FNI+
Sbjct: 198  LAEKILDEVHCSVVGLSSFVYNNLFNVLVKLNRVDEAVCLFR-KYVGSYCCPDSWTFNIL 256

Query: 1182 TRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTG 1003
             RGLCR G +D   E F+DM SF CSP+VVTYNTLI G CRA +VD+G +LL +++ ++ 
Sbjct: 257  IRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCDLLREVQFRSE 316

Query: 1002 LSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAV 823
            LSPDV+T+TSVISGYCKLG+MEEASA+ DEMI   + P + TFN LI+G+GK G+M SA 
Sbjct: 317  LSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAVTFNALIDGYGKAGDMSSAF 376

Query: 822  ATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINAL 643
            + YE+ML  G   DV+TFTSLIDG+CR G    G++LW+EM  KN+SP+AYTFSVLINAL
Sbjct: 377  SLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKNVSPSAYTFSVLINAL 436

Query: 642  CKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDK 463
            CK NRL EARDLLR+L+  N+VP+ F+YNPVIDG CKAGN+DEAN+I AEMEEK+C PD+
Sbjct: 437  CKGNRLCEARDLLRELKGSNVVPKSFLYNPVIDGLCKAGNIDEANLIVAEMEEKKCTPDR 496

Query: 462  FTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRI-MV 286
             TFT LI+G+SMKGRM+EAI  F KM SIGCAPD ITI + ISCL KAGMPSEA RI  +
Sbjct: 497  VTFTILILGNSMKGRMSEAIGNFSKMLSIGCAPDKITIDSLISCLSKAGMPSEAGRIKKI 556

Query: 285  TFSKRDLG-PALSSPRKSFSSTKNMDIPVA 199
             +   ++G P++  P       +  +IPVA
Sbjct: 557  AYEDLNMGAPSMGRP----PHLRANEIPVA 582


>ref|XP_002512275.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223548236|gb|EEF49727.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 532

 Score =  527 bits (1358), Expect = e-147
 Identities = 285/543 (52%), Positives = 367/543 (67%), Gaps = 7/543 (1%)
 Frame = -2

Query: 1803 ISSFHDHAVQVSSQPDFRISDREVI-RNSEAWIVKVVSTLCVVHGLKSSTDFACLEYFSK 1627
            ++ FHD+       P    SD+EVI +N EAW VKV++ L V       +D   L Y S+
Sbjct: 1    MAHFHDYTKGGGFHP---FSDKEVIVKNQEAWFVKVIAILFV---RSHCSDATSLGYLSE 54

Query: 1626 SFN-PSIVYGVVRRLGN-PKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAGLHES 1453
              N P + + V++RL N P++ LKF EFC  RLN   L H   +Y  L++SLCQ GLH+ 
Sbjct: 55   KLNDPLVAFEVIKRLNNNPQVGLKFMEFC--RLNFS-LIHCFSTYELLIRSLCQMGLHDL 111

Query: 1452 ASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXXXXXX 1285
               ++     DGH  D  V  FLV+S AQ+GK D+AK+L+++    + +           
Sbjct: 112  VEMVIGYMRSDGHLIDSRVLGFLVTSFAQAGKFDLAKKLIIEVQGEEARISSFVYNYLLN 171

Query: 1284 XXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGSFDCS 1105
                   V EAI  FK E+L     P+T +FNI+ RGLCR+G+++  FELFN M SF C 
Sbjct: 172  ELVKGGKVHEAIFLFK-ENLAFHSPPNTWTFNILIRGLCRVGEVEKGFELFNAMQSFGCL 230

Query: 1104 PDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKMEEASA 925
            PDVVTYNTLI G C+A ++D+  +LL +++ +   SPDV+TYTS+ISG+ KLGK+E AS 
Sbjct: 231  PDVVTYNTLISGLCKANELDRACDLLKEVQSRNDCSPDVMTYTSIISGFRKLGKLEAASV 290

Query: 924  LLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLIDGHC 745
            L +EMI   I P   TFNVLI+GFGKIGNM +A A +E M S  C+PDVVTFTSLIDG+C
Sbjct: 291  LFEEMIRSGIEPTVVTFNVLIDGFGKIGNMVAAEAMHEKMASYSCIPDVVTFTSLIDGYC 350

Query: 744  RIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIVPRPF 565
            R G+   G+K+W+ M  +N+SPN YT+SV+INALCK+NR++EARDLLRQL+  ++ P+PF
Sbjct: 351  RTGDIRLGLKVWDVMKARNVSPNIYTYSVIINALCKDNRIHEARDLLRQLKCSDVFPKPF 410

Query: 564  IYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISLFHKM 385
            IYNPVIDGFCKAGNVDEANVI  EMEEKRC PDK TFT LIIGH MKGRM EA+ +F KM
Sbjct: 411  IYNPVIDGFCKAGNVDEANVIVTEMEEKRCRPDKVTFTILIIGHCMKGRMVEALDIFKKM 470

Query: 384  SSIGCAPDTITISTFISCLLKAGMPSEANRIMVTFSKRDLGPALSSPRKSFSSTKNMDIP 205
             +IGCAPD ITIS+ ++CLLKAG PSEA  I+ T S+ DL  + SS RK+F      DI 
Sbjct: 471  LAIGCAPDNITISSLVACLLKAGKPSEAFHIVQTASE-DLNLSFSSLRKTFPMRVKTDIS 529

Query: 204  VAA 196
            VAA
Sbjct: 530  VAA 532


>gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis]
          Length = 570

 Score =  521 bits (1341), Expect = e-145
 Identities = 280/513 (54%), Positives = 349/513 (68%), Gaps = 6/513 (1%)
 Frame = -2

Query: 1755 FRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGN- 1579
            F    +EVI  SEAW VKVVSTL V    +S +      Y SK   PSI + V++RL N 
Sbjct: 61   FHHKRKEVISYSEAWFVKVVSTLFV----RSQSLNTFFGYLSKKLTPSISFEVIKRLNNN 116

Query: 1578 PKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDG 1414
            P L LKFFE     L+   +NH   +YN L++SLCQ G H+SA K VFD     GH PD 
Sbjct: 117  PNLGLKFFELSRANLS---VNHSFSTYNLLIRSLCQMGFHDSA-KFVFDCMRIDGHSPDN 172

Query: 1413 PVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKD 1234
               +FLV   A+ GKLD  ++LL +     I+               N+ V EA+C F+ 
Sbjct: 173  STIEFLVCVFAKVGKLDSCEKLLEE-----IRASKFVYSSLFNVLVKNNKVYEAVCLFRK 227

Query: 1233 EHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAK 1054
            + +   F PDT +FNI+  GLC +G++  AFE FNDMG F CSPDVVTYNTLI G CR  
Sbjct: 228  Q-IGSHFVPDTWTFNILIGGLCGVGEVHSAFEFFNDMGKFRCSPDVVTYNTLISGLCRTN 286

Query: 1053 DVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTF 874
            +VD+G +LL +++ +   SP+V T+TSVI GYCKLG+MEEASAL DEM+     P + TF
Sbjct: 287  EVDRGCDLLREVQLRGDFSPNVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTF 346

Query: 873  NVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGT 694
            NVLI+ F K+G+M SA+A YE ML  G  PDVVTFTSLIDG+CR+G+   G+KLW EM  
Sbjct: 347  NVLIDAFSKVGDMASAIALYEKMLFHGYRPDVVTFTSLIDGYCRVGQLNRGLKLWCEMSV 406

Query: 693  KNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDE 514
            +N+SPN YT+SV+I+ALCK NRL+EARDLLRQL   NIVP+PF+YNPVIDGFCKAGNVDE
Sbjct: 407  RNVSPNGYTYSVVIHALCKVNRLHEARDLLRQLNCTNIVPKPFMYNPVIDGFCKAGNVDE 466

Query: 513  ANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFIS 334
            AN+I AEMEEKRCNPDK TFT LI+G+ MKGRM +AI +F+KM ++GCAPD IT+   +S
Sbjct: 467  ANMIVAEMEEKRCNPDKMTFTILILGNCMKGRMVDAIGVFYKMLAVGCAPDKITVHCLMS 526

Query: 333  CLLKAGMPSEANRIMVTFSKRDLGPALSSPRKS 235
            CLLKAGMP+EA  I  T  K  L   +SS R +
Sbjct: 527  CLLKAGMPNEAFHIKETVMK-SLNVGMSSLRSN 558


>gb|EYU18527.1| hypothetical protein MIMGU_mgv1a003955mg [Mimulus guttatus]
          Length = 552

 Score =  512 bits (1318), Expect = e-142
 Identities = 293/556 (52%), Positives = 360/556 (64%), Gaps = 16/556 (2%)
 Frame = -2

Query: 1818 STTFFISSFHDHAVQV-SSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            S  F  S FH  +  V SS P    S      +S  W VKVV TLC+      S  F   
Sbjct: 11   SKVFLASLFHGRSSLVESSSPPSPSSP-----SSTFWFVKVVCTLCIRRS--PSLAFVET 63

Query: 1641 EYFSKSFNPSIVYGVV----RRLGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLC 1474
            +YF  + NPS+ + VV     RL NP LA  FF     RLN   L HL  +++ L++SLC
Sbjct: 64   DYFRVNLNPSVAFAVVYHINSRLNNPDLAFTFFRCSRLRLN---LIHLEPTFDLLLRSLC 120

Query: 1473 QAGLHESASKLVF-----DGHFPDGPVFDFLVSSCAQSGKLDIAKRLLV---QACHGKIK 1318
            Q G H+SA +LV+     DG  PD  V DF+VSS A +GK  IA+ +L+   + C+ K +
Sbjct: 121  QMGRHDSA-ELVYQYMKSDGFLPDSSVLDFVVSSFANAGKFRIAEEILIARAEYCNEKDE 179

Query: 1317 XXXXXXXXXXXXXXXN-SWVDEAICFFKDEHLRL-GFCPDTCSFNIVTRGLCRLGKIDLA 1144
                           N + +D+A+ FFK   LRL  FCPDTCSFNIV RGLCR  K+D A
Sbjct: 180  LVSSFVYNNFLSMLTNKNRIDDAVLFFKSHILRLKSFCPDTCSFNIVMRGLCRASKVDKA 239

Query: 1143 FELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVIS 964
            FE F+ M SF CSPD+VTYNTLI+G CR   VD+  ELL +I+ ++  S DVVTYTSVIS
Sbjct: 240  FEFFDVMRSFSCSPDLVTYNTLINGLCRVGKVDRAEELLREIKVQSEFSADVVTYTSVIS 299

Query: 963  GYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVP 784
            GYCKLGK + A+ L +EMI+  I PN FTFN +I+GFGK G + SA   YE M + G  P
Sbjct: 300  GYCKLGKTDAAAFLFEEMINNGIRPNLFTFNAIIDGFGKKGEVASASKMYERMTATGFRP 359

Query: 783  DVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLL 604
            DVVTFTSLIDGHCR G+  +G+ L NEM  K +SPN +TFSVLI+ALCKENRLNEARDLL
Sbjct: 360  DVVTFTSLIDGHCRCGDLGQGIHLLNEMNEKRVSPNVFTFSVLISALCKENRLNEARDLL 419

Query: 603  RQLRWR-NIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSM 427
             QL+WR +IVP PF+YNPVIDG+CKAGNVDEAN I AEME K C  DK TFT LI+GH M
Sbjct: 420  NQLKWREDIVPPPFVYNPVIDGYCKAGNVDEANAIVAEMEAKGCVHDKMTFTILILGHCM 479

Query: 426  KGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIMVTFSKRDLGPALSS 247
            KGRM EAI +++KM S+GC PD IT+S+ ISCL KAGM  EAN I     ++ L    SS
Sbjct: 480  KGRMFEAIGMYNKMLSVGCVPDNITMSSLISCLRKAGMAREANEI----EQKALFSVSSS 535

Query: 246  PRKSFSSTKNMDIPVA 199
             ++S     NM++ VA
Sbjct: 536  SKRSNPVRNNMNVTVA 551


>ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Solanum tuberosum]
            gi|565370447|ref|XP_006351832.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Solanum tuberosum]
          Length = 550

 Score =  497 bits (1279), Expect = e-137
 Identities = 281/560 (50%), Positives = 359/560 (64%), Gaps = 16/560 (2%)
 Frame = -2

Query: 1830 LRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDF 1651
            ++ AS    I+ FH       S P +      V      W  KVV  LC  H    S D 
Sbjct: 5    VQRASNILLIARFHG-LTSSKSIPSYGPGPEAV------WFTKVVCLLCFHHS--QSLDV 55

Query: 1650 ACLEYFSKSFNPSIVYGVVRR----LGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVK 1483
               +YF ++ +P I + V+      L NP+LA +F +     LN   L H + S+N L++
Sbjct: 56   FGSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLN---LVHCIGSFNLLLR 112

Query: 1482 SLCQAGLHESASKLVF-----DGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACH---- 1330
            SL Q G H+SA  LVF     DG+  +  + + +V + A +GK +IAK +L+        
Sbjct: 113  SLSQMGFHDSAM-LVFKFMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGRE 171

Query: 1329 -GKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLG-FCPDTCSFNIVTRGLCRLGK 1156
             G+I                 S VDEA+ FFK   LR     PDTC+FN V RGLCR+G 
Sbjct: 172  EGRI-VRPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGG 230

Query: 1155 IDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYT 976
            +D AFE FNDMGSF C PD VTYNTLI+G C    V++   LL  +  + GLSPDVVTYT
Sbjct: 231  VDKAFEFFNDMGSFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYT 290

Query: 975  SVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSR 796
            SVI+GYCKLG+M+EA  L+DEM +  I PN  TFN+LINGFGKIG+M SA+  Y  M + 
Sbjct: 291  SVIAGYCKLGRMDEAINLMDEMTTYGISPNLVTFNILINGFGKIGDMFSAIQMYGRMCAV 350

Query: 795  GCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEA 616
            G  PDVVTFTSLIDG+CR GE ++G+KLW+EM T+N+SPN YTFS+LI+AL KENRLNEA
Sbjct: 351  GYPPDVVTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKENRLNEA 410

Query: 615  RDLLRQLRWR-NIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLII 439
            R+LLRQL+ R +IVP+PF+YNPV+DGFCKAGN+ +ANVI AEME + C  DK TFT LI+
Sbjct: 411  RELLRQLKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITFTILIL 470

Query: 438  GHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIMVTFSKRDLGP 259
            GH MKGRM EA+++F KM S+GC PD IT+S   SCLLKAGM  EA ++ +T SK DL P
Sbjct: 471  GHCMKGRMLEAMAIFDKMLSLGCVPDDITVSCLTSCLLKAGMVKEAYKVRLTPSK-DLNP 529

Query: 258  ALSSPRKSFSSTKNMDIPVA 199
             LSS ++S     ++DIPVA
Sbjct: 530  DLSSSKQSVPFRTSLDIPVA 549


>ref|XP_006381622.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336330|gb|ERP59419.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 511

 Score =  497 bits (1279), Expect = e-137
 Identities = 259/486 (53%), Positives = 331/486 (68%), Gaps = 6/486 (1%)
 Frame = -2

Query: 1638 YFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAGLH 1459
            Y  +   P I + V++R  NPK+  KF EF    LN   +NH   +YN L++SLCQ G H
Sbjct: 33   YPDRQLTPLIAFEVIKRFNNPKVGFKFLEFSRLNLN---VNHCYSTYNLLMRSLCQMGHH 89

Query: 1458 ESASKLVFD-----GHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXXX 1294
            +  + +VFD     GH PD  +  FLV+  AQ+   D+ K+LL +    +++        
Sbjct: 90   DLVN-IVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLLAEVQGKEVRINSFVYNN 148

Query: 1293 XXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGSF 1114
                    + V EAI  FK E+L +   PDT +FNI+ RGLCR+G +D AFE+F DM SF
Sbjct: 149  LLSVLVKQNQVHEAIYLFK-EYLAMQ-SPDTWTFNILIRGLCRVGGVDRAFEVFKDMESF 206

Query: 1113 DCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKMEE 934
             C PDVVTYNTLI+G C+A +V +G EL  +I+ ++  SPD+VTYTS+ISG+CK GKM+E
Sbjct: 207  GCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKSGKMKE 266

Query: 933  ASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLID 754
            AS L +EM+   I PN  TFNVLI+GFGKIGN+  A A Y  M    C  DVVTFTSLID
Sbjct: 267  ASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTFTSLID 326

Query: 753  GHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIVP 574
            G+CR G+   G+K WN M T+N+SP  YT++VLINALCKENRLNEARD L Q++  +I+P
Sbjct: 327  GYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIKNSSIIP 386

Query: 573  RPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISLF 394
            +PF+YNPVIDGFCKAGNVDE NVI  EMEEKRC+PDK TFT LIIGH +KGRM EAI++F
Sbjct: 387  KPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMFEAINIF 446

Query: 393  HKMSSIGCAPDTITISTFISCLLKAGMPSEANRI-MVTFSKRDLGPALSSPRKSFSSTKN 217
            ++M +  CAPD IT+++ ISCLLKAGMP+EA RI  +    R+LG  LSS  K+     N
Sbjct: 447  NRMLATRCAPDNITVNSLISCLLKAGMPNEAYRIRKMALEDRNLG--LSSFEKAIPLRTN 504

Query: 216  MDIPVA 199
             DIPVA
Sbjct: 505  TDIPVA 510


>ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            [Solanum lycopersicum]
          Length = 550

 Score =  493 bits (1268), Expect = e-136
 Identities = 274/520 (52%), Positives = 347/520 (66%), Gaps = 15/520 (2%)
 Frame = -2

Query: 1713 WIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRR----LGNPKLALKFFEFC 1546
            W  KVV  LC  H    S D    +YF ++ +P I + V+      L NP+LA +F +  
Sbjct: 37   WFTKVVCLLCFHHS--QSLDVFGSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCT 94

Query: 1545 FCRLNGDELNHLVKSYNFLVKSLCQAGLHESASKLVF-----DGHFPDGPVFDFLVSSCA 1381
               LN   L H + S+N L++SL Q G H+SA  LVF     DG+  +  + + +V + A
Sbjct: 95   RINLN---LIHCIGSFNLLLRSLSQMGFHDSAM-LVFKYMKADGYLLENSILESVVLALA 150

Query: 1380 QSGKLDIAKRLLV-QACHGKIKXXXXXXXXXXXXXXXN---SWVDEAICFFKDEHLRLG- 1216
             +GK +IAK +L+ QA  G+ +                   S VDEA+ FFK   LR   
Sbjct: 151  NAGKFEIAKEILISQAELGREEGSIVRPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSER 210

Query: 1215 FCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGY 1036
              PDTC+FN V RGLCR+G +D AFE FNDMGSF CSPD VTYNTLI+G C    V++  
Sbjct: 211  LFPDTCTFNTVIRGLCRVGGVDKAFEFFNDMGSFGCSPDTVTYNTLINGLCAVGQVNRAQ 270

Query: 1035 ELLTQIRPKTGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLING 856
             LL  ++ + GLSPDVVTYTS+ISGYCKL +M+EA  L+DEMI+  I PN  TFN+LING
Sbjct: 271  GLLGNLQLQDGLSPDVVTYTSLISGYCKLSRMDEAINLMDEMITYGISPNLVTFNILING 330

Query: 855  FGKIGNMRSAVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPN 676
            FGKIG+M SA+  Y  M + G  PDVVTFTSLIDG+CR GE ++G+KLW++M ++N+SPN
Sbjct: 331  FGKIGDMFSAIKMYGKMCAVGYPPDVVTFTSLIDGYCRTGELDQGLKLWDDMNSRNLSPN 390

Query: 675  AYTFSVLINALCKENRLNEARDLLRQLRWR-NIVPRPFIYNPVIDGFCKAGNVDEANVIF 499
             YTFSVLI+AL KENRLNEAR+LLRQL+ R +IVP+PF+YNPV+DGFCKAGN+ EANVI 
Sbjct: 391  LYTFSVLISALSKENRLNEARELLRQLKSRDDIVPQPFVYNPVLDGFCKAGNLSEANVIA 450

Query: 498  AEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKA 319
            AEME K C  DK TFT LI+GH MKGRM EA+++F KM S+GC PD ITIS   SCLLKA
Sbjct: 451  AEMESKGCCHDKITFTILILGHCMKGRMLEALAIFDKMLSLGCVPDDITISCLTSCLLKA 510

Query: 318  GMPSEANRIMVTFSKRDLGPALSSPRKSFSSTKNMDIPVA 199
            GM  EA ++ +  SK DL P LS  +       ++DIPVA
Sbjct: 511  GMVKEAYKVRLIPSK-DLNPDLSPSKLFIPFRTSLDIPVA 549


>ref|XP_007204825.1| hypothetical protein PRUPE_ppa004064mg [Prunus persica]
            gi|462400356|gb|EMJ06024.1| hypothetical protein
            PRUPE_ppa004064mg [Prunus persica]
          Length = 532

 Score =  489 bits (1258), Expect = e-135
 Identities = 290/569 (50%), Positives = 352/569 (61%), Gaps = 6/569 (1%)
 Frame = -2

Query: 1887 NGVRFQMTLFLLPTSRSRVLRAASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWI 1708
            +GV  QMTL       +  +RA+     IS FH  A    ++P       EVI N EAW 
Sbjct: 45   DGVAVQMTLLFFTARPTFWVRASKIA--ISHFHSLA-HGGARPQI-----EVISNPEAWF 96

Query: 1707 VKVVSTLCV-VHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLN 1531
            VKVV TL V  H L S      L Y SK+  PSI + V+RRL +PKL LKFFE     L+
Sbjct: 97   VKVVCTLFVRSHALDSY-----LGYLSKNLTPSIAFEVIRRLNHPKLGLKFFELSRLSLS 151

Query: 1530 GDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSSCAQSGKL 1366
               +NH V +YNFL++SLCQ GL +SA KLVFD     GH PD  + + LVSS AQ GKL
Sbjct: 152  ---VNHSVWTYNFLLRSLCQIGLQDSA-KLVFDYMRSDGHTPDDSIAELLVSSYAQMGKL 207

Query: 1365 DIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNI 1186
            + A++LL                            DE  C             D+ +FNI
Sbjct: 208  NNAEKLL----------------------------DEVHC-------------DSWTFNI 226

Query: 1185 VTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKT 1006
            + RGLCR+G+ID AFE F+DM SF C PD+VTYNTLI G CRA +VD+G  LL +++ + 
Sbjct: 227  LIRGLCRIGEIDKAFEFFSDMESFGCYPDIVTYNTLISGLCRANEVDRGCHLLKEVQSRI 286

Query: 1005 GLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSA 826
             LSPDV+T+TSVISGYCKLGKMEEAS L DEM +  + P S TFN LI+GFGK GNM SA
Sbjct: 287  ELSPDVITFTSVISGYCKLGKMEEASVLFDEMNNSGVGPTSVTFNALIDGFGKSGNMISA 346

Query: 825  VATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINA 646
             A YE ML  G  PDV+TFTSLIDG+CR G+  +G+KLW EM  KN+SP+AYTFSVLINA
Sbjct: 347  RAMYEKMLFHGYRPDVITFTSLIDGYCRAGKLSQGLKLWQEMNAKNVSPSAYTFSVLINA 406

Query: 645  LCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPD 466
            LC+ENRL EA                        GFCKAGNVDEAN+I AEMEEKRC+PD
Sbjct: 407  LCRENRLQEAH-----------------------GFCKAGNVDEANLIVAEMEEKRCSPD 443

Query: 465  KFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPSEANRIMV 286
            K TFT LI+G+ MKGRM+EAIS F KM +IGCAPD IT+ + ISCL+KAGMP+EA+ I  
Sbjct: 444  KVTFTILILGNCMKGRMSEAISNFKKMLAIGCAPDNITVDSLISCLMKAGMPNEAHHIK- 502

Query: 285  TFSKRDLGPALSSPRKSFSSTKNMDIPVA 199
              +  DL   +S  R++     N  I VA
Sbjct: 503  KIACEDLNLGMSPSRRADHLRANAKITVA 531


>ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum]
            gi|557096393|gb|ESQ36901.1| hypothetical protein
            EUTSA_v10002477mg [Eutrema salsugineum]
          Length = 535

 Score =  488 bits (1257), Expect = e-135
 Identities = 262/528 (49%), Positives = 341/528 (64%), Gaps = 7/528 (1%)
 Frame = -2

Query: 1821 ASTTFFISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            AS    I  FH H    +     + + REVI+  EAW+VK+VSTL V     S     C 
Sbjct: 4    ASFATTIGLFHSHTHGGAQARPLQSNTREVIQCPEAWLVKIVSTLFVYQVPDSDL---CF 60

Query: 1641 EYFSKSFNPSIVYGVVRRLGNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAGL 1462
             Y SK+ NP I + VV++L NP +  +F+EF   +LN   + H   +YN L +SLC+AGL
Sbjct: 61   CYLSKNLNPFIAFEVVKKLDNPHIGFRFWEFSRFKLN---IRHSFWTYNLLTRSLCKAGL 117

Query: 1461 HESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXXX 1294
            H+ A K+      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++        
Sbjct: 118  HDLAGKMFECMKSDGVSPNSRLLGFLVSSFAEKGKLHFATALLLQSY--EVEGSSMVVNS 175

Query: 1293 XXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGSF 1114
                      V++A+  F D HLR   C DT +FNI+ +GLC +GK   A +L  +M SF
Sbjct: 176  LLHTLVRLDRVEDAMKLF-DTHLRSQSCNDTRTFNILIQGLCGIGKAHEALKLLGEMSSF 234

Query: 1113 DCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKMEE 934
              SPD+VTYNTLI GFC++ +++K  E+  +++ + G   DVVTYTS++SGYCK GKM E
Sbjct: 235  GSSPDIVTYNTLIKGFCKSNELNKANEIFNEVKSRNGCFRDVVTYTSMMSGYCKAGKMRE 294

Query: 933  ASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLID 754
            AS LLDEM+   + P + TFNVL+ G+ K G M SA A    M S GC PDVVTFT+LID
Sbjct: 295  ASLLLDEMVGLGMYPTNITFNVLVYGYVKAGEMSSAEAIRRKMDSFGCFPDVVTFTTLID 354

Query: 753  GHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIVP 574
            G+CR+G+  +G  LW EM  K + PNA+T+S+LINALCKENRL +AR+LL QL   +IVP
Sbjct: 355  GYCRVGQVNKGFSLWEEMSAKGMFPNAFTYSILINALCKENRLLKARELLGQLACMDIVP 414

Query: 573  RPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISLF 394
            +PF+YNP+IDGFCKAG V+EANVI AEME+ RC PDK TFT LIIGH MKGRM EAIS+F
Sbjct: 415  KPFLYNPIIDGFCKAGKVNEANVIVAEMEKFRCKPDKITFTILIIGHCMKGRMCEAISIF 474

Query: 393  HKMSSIGCAPDTITISTFISCLLKAGMPSEA---NRIMVTFSKRDLGP 259
            HKM +IGC+PD IT+S+  SCLLKAGM  EA   N+  V     D+ P
Sbjct: 475  HKMVAIGCSPDKITVSSLSSCLLKAGMAKEAYQLNQFAVKGQSNDVAP 522


>ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like
            isoform X1 [Glycine max] gi|571448762|ref|XP_006577947.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g06000-like isoform X2 [Glycine max]
            gi|571448764|ref|XP_006577948.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X3 [Glycine max]
            gi|571448766|ref|XP_006577949.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g06000-like isoform X4 [Glycine max]
          Length = 510

 Score =  487 bits (1253), Expect = e-134
 Identities = 257/516 (49%), Positives = 339/516 (65%), Gaps = 5/516 (0%)
 Frame = -2

Query: 1731 IRNSEAWIVKVVSTLCVVHGLKSSTDFACLEYFSKSFNPSIVYGVVRRLGNPKLALKFFE 1552
            IR +EAW VK+  T+ V    +S++    + YFSK   PS+VY VV RL  P L  KF E
Sbjct: 4    IRRAEAWFVKIACTVFV----RSNSLDPFVGYFSKHLTPSLVYEVVNRLHIPNLGFKFVE 59

Query: 1551 FCFCRLNGDELNHLVKSYNFLVKSLCQAGLHESASKLVFD-----GHFPDGPVFDFLVSS 1387
            FC  +L+   ++H   +Y+ L++SLC++ LH +A K+V+D     G  PD  +  FLV S
Sbjct: 60   FCRHKLH---MSHSYLTYSLLLRSLCRSNLHHTA-KVVYDWMRCDGQIPDNRLLGFLVWS 115

Query: 1386 CAQSGKLDIAKRLLVQACHGKIKXXXXXXXXXXXXXXXNSWVDEAICFFKDEHLRLGFCP 1207
             A  G+LD+++ LL       +                 + V +A+  F+ E +RL + P
Sbjct: 116  YAIVGRLDVSRELLADVQCNNVGVNAVVYNDLFNVLIRQNKVVDAVVLFR-ELIRLRYKP 174

Query: 1206 DTCSFNIVTRGLCRLGKIDLAFELFNDMGSFDCSPDVVTYNTLIDGFCRAKDVDKGYELL 1027
             T + NI+ RGLCR G+ID AF L ND+ SF C PDV+TYNTLI G CR  +VD+   LL
Sbjct: 175  VTYTVNILMRGLCRAGEIDEAFRLLNDLRSFGCLPDVITYNTLIHGLCRINEVDRARSLL 234

Query: 1026 TQIRPKTGLSPDVVTYTSVISGYCKLGKMEEASALLDEMISKRIMPNSFTFNVLINGFGK 847
             ++      +PDVV+YT++ISGYCK  KMEE + L  EMI     PN+FTFN LI GFGK
Sbjct: 235  KEVCLNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGK 294

Query: 846  IGNMRSAVATYENMLSRGCVPDVVTFTSLIDGHCRIGETEEGMKLWNEMGTKNISPNAYT 667
            +G+M SA+A YE ML +GCVPDV TFTSLI+G+ R+G+  + M +W++M  KNI    YT
Sbjct: 295  LGDMASALALYEKMLVQGCVPDVATFTSLINGYFRLGQVHQAMDMWHKMNDKNIGATLYT 354

Query: 666  FSVLINALCKENRLNEARDLLRQLRWRNIVPRPFIYNPVIDGFCKAGNVDEANVIFAEME 487
            FSVL++ LC  NRL++ARD+LR L   +IVP+PFIYNPVIDG+CK+GNVDEAN I AEME
Sbjct: 355  FSVLVSGLCNNNRLHKARDILRLLNESDIVPQPFIYNPVIDGYCKSGNVDEANKIVAEME 414

Query: 486  EKRCNPDKFTFTSLIIGHSMKGRMAEAISLFHKMSSIGCAPDTITISTFISCLLKAGMPS 307
              RC PDK TFT LIIGH MKGRM EAI +FHKM ++GCAPD IT++   SCLLKAGMP 
Sbjct: 415  VNRCKPDKLTFTILIIGHCMKGRMPEAIGIFHKMLAVGCAPDEITVNNLRSCLLKAGMPG 474

Query: 306  EANRIMVTFSKRDLGPALSSPRKSFSSTKNMDIPVA 199
            EA R+    + ++L   ++S +KS+  T N  IPVA
Sbjct: 475  EAARVKKVLA-QNLTLGITSSKKSYHETTNESIPVA 509


>ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Capsella rubella]
            gi|565479514|ref|XP_006297397.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566105|gb|EOA30294.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
            gi|482566106|gb|EOA30295.1| hypothetical protein
            CARUB_v10013421mg [Capsella rubella]
          Length = 535

 Score =  487 bits (1253), Expect = e-134
 Identities = 260/530 (49%), Positives = 345/530 (65%), Gaps = 11/530 (2%)
 Frame = -2

Query: 1815 TTFF--ISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            TTF   I+ FH H+   +       + REV+   EAW++K+VSTL V     S     C 
Sbjct: 4    TTFATAIAHFHTHSHGGAQARPLHSNKREVMHCPEAWLIKIVSTLFVYRVPDSDL---CF 60

Query: 1641 EYFSKSFNPSIVYGVVRRLGN--PKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQA 1468
             Y SK+ NP I + VV++L N  P L  +F+EF   +LN   + H   +YN L +SLC+A
Sbjct: 61   CYLSKNLNPFIAFEVVKKLDNNHPHLGFRFWEFSRFKLN---IRHSFWTYNVLTRSLCKA 117

Query: 1467 GLHESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXX 1300
            G+H+ A ++      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++      
Sbjct: 118  GMHDLAGQMFECMRSDGVSPNSRLLGFLVSSFAEKGKLQFATALLLQSY--EVERCCMVV 175

Query: 1299 XXXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMG 1120
                        VD+A+  F D+HLR   C DT +FNI+ RGLC +GK + A EL  +M 
Sbjct: 176  NSLLNTLVKLDRVDDAMKLF-DKHLRFQCCNDTKTFNILIRGLCSVGKGEKALELLGEMS 234

Query: 1119 SFDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKM 940
             F CSPD+VTYNTLI GFC++ ++ K  E+L  ++  +G SPDVVTYTS+ISGYCK GKM
Sbjct: 235  GFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCKAGKM 294

Query: 939  EEASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSL 760
            +EA  LLD+M+   I P + TFNVL++G+ K G M SA      M+S GC PDVVTFTSL
Sbjct: 295  QEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISFGCFPDVVTFTSL 354

Query: 759  IDGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNI 580
            IDG+CR G+  +G +LW EM  K + PN +T+S+LINALCKEN L +AR+LL QL  ++I
Sbjct: 355  IDGYCRAGQVNQGFRLWEEMNAKGMLPNEFTYSILINALCKENSLLKARELLGQLASKDI 414

Query: 579  VPRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAIS 400
            + +PF+YNPVIDGFCKAG V+EANVI  EME+K+C PDK TFT LIIGH MKGRM EA+S
Sbjct: 415  ITKPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVS 474

Query: 399  LFHKMSSIGCAPDTITISTFISCLLKAGMPSEA---NRIMVTFSKRDLGP 259
            +FHKM +IGC+PD IT+++ +SCLLKAGM  EA   N+I       D+ P
Sbjct: 475  IFHKMVAIGCSPDKITVNSLLSCLLKAGMAEEAYHLNQIARKAQSNDVAP 524


>dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana]
          Length = 536

 Score =  486 bits (1252), Expect = e-134
 Identities = 257/512 (50%), Positives = 341/512 (66%), Gaps = 7/512 (1%)
 Frame = -2

Query: 1815 TTFF--ISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            TTF   I+ FH H+   +     + + REVI   EAW+VK+VSTL V     S     C 
Sbjct: 4    TTFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDL---CF 60

Query: 1641 EYFSKSFNPSIVYGVVRRL-GNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAG 1465
             Y SK+ NP I + VV++L  NP +  +F+EF   +LN   + H   +YN L +SLC+AG
Sbjct: 61   CYLSKNLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN---IRHSFWTYNLLTRSLCKAG 117

Query: 1464 LHESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXX 1297
            LH+ A ++      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++       
Sbjct: 118  LHDLAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSF--EVEGCCMVVN 175

Query: 1296 XXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGS 1117
                       V++A+  F DEHLR   C DT +FNI+ RGLC +GK + A EL   M  
Sbjct: 176  SLLNTLVKLDRVEDAMKLF-DEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSG 234

Query: 1116 FDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKME 937
            F C PD+VTYNTLI GFC++ +++K  E+   ++  +  SPDVVTYTS+ISGYCK GKM 
Sbjct: 235  FGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMR 294

Query: 936  EASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLI 757
            EAS+LLD+M+   I P + TFNVL++G+ K G M +A      M+S GC PDVVTFTSLI
Sbjct: 295  EASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLI 354

Query: 756  DGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIV 577
            DG+CR+G+  +G +LW EM  + + PNA+T+S+LINALC ENRL +AR+LL QL  ++I+
Sbjct: 355  DGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDII 414

Query: 576  PRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISL 397
            P+PF+YNPVIDGFCKAG V+EANVI  EME+K+C PDK TFT LIIGH MKGRM EA+S+
Sbjct: 415  PQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSI 474

Query: 396  FHKMSSIGCAPDTITISTFISCLLKAGMPSEA 301
            FHKM +IGC+PD IT+S+ +SCLLKAGM  EA
Sbjct: 475  FHKMVAIGCSPDKITVSSLLSCLLKAGMAKEA 506


>ref|NP_178657.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|42570711|ref|NP_973429.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75216767|sp|Q9ZUE9.1|PP149_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g06000 gi|4006835|gb|AAC95177.1| hypothetical protein
            [Arabidopsis thaliana] gi|110736272|dbj|BAF00106.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|330250896|gb|AEC05990.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|330250897|gb|AEC05991.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 536

 Score =  486 bits (1252), Expect = e-134
 Identities = 257/512 (50%), Positives = 341/512 (66%), Gaps = 7/512 (1%)
 Frame = -2

Query: 1815 TTFF--ISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            TTF   I+ FH H+   +     + + REVI   EAW+VK+VSTL V     S     C 
Sbjct: 4    TTFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDL---CF 60

Query: 1641 EYFSKSFNPSIVYGVVRRL-GNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAG 1465
             Y SK+ NP I + VV++L  NP +  +F+EF   +LN   + H   +YN L +SLC+AG
Sbjct: 61   CYLSKNLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN---IRHSFWTYNLLTRSLCKAG 117

Query: 1464 LHESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXX 1297
            LH+ A ++      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++       
Sbjct: 118  LHDLAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSF--EVEGCCMVVN 175

Query: 1296 XXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGS 1117
                       V++A+  F DEHLR   C DT +FNI+ RGLC +GK + A EL   M  
Sbjct: 176  SLLNTLVKLDRVEDAMKLF-DEHLRFQSCNDTKTFNILIRGLCGVGKAEKALELLGVMSG 234

Query: 1116 FDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKME 937
            F C PD+VTYNTLI GFC++ +++K  E+   ++  +  SPDVVTYTS+ISGYCK GKM 
Sbjct: 235  FGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCKAGKMR 294

Query: 936  EASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLI 757
            EAS+LLD+M+   I P + TFNVL++G+ K G M +A      M+S GC PDVVTFTSLI
Sbjct: 295  EASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVTFTSLI 354

Query: 756  DGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIV 577
            DG+CR+G+  +G +LW EM  + + PNA+T+S+LINALC ENRL +AR+LL QL  ++I+
Sbjct: 355  DGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLASKDII 414

Query: 576  PRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISL 397
            P+PF+YNPVIDGFCKAG V+EANVI  EME+K+C PDK TFT LIIGH MKGRM EA+S+
Sbjct: 415  PQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSI 474

Query: 396  FHKMSSIGCAPDTITISTFISCLLKAGMPSEA 301
            FHKM +IGC+PD IT+S+ +SCLLKAGM  EA
Sbjct: 475  FHKMVAIGCSPDKITVSSLLSCLLKAGMAKEA 506


>ref|XP_002885810.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297331650|gb|EFH62069.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 536

 Score =  484 bits (1245), Expect = e-134
 Identities = 255/512 (49%), Positives = 340/512 (66%), Gaps = 7/512 (1%)
 Frame = -2

Query: 1815 TTFF--ISSFHDHAVQVSSQPDFRISDREVIRNSEAWIVKVVSTLCVVHGLKSSTDFACL 1642
            TTF   I+ FH H+   +     + + RE I   EAW+VK+VSTL V     S     C 
Sbjct: 4    TTFATAIAHFHTHSHGGAQARPIQNNTREKIHCPEAWLVKIVSTLFVYRVPDSDL---CF 60

Query: 1641 EYFSKSFNPSIVYGVVRRL-GNPKLALKFFEFCFCRLNGDELNHLVKSYNFLVKSLCQAG 1465
             Y SK+ NP I + VV++L  NP +  +F+EF   +LN   + H   +YN L +SLC+AG
Sbjct: 61   CYLSKNLNPFISFEVVKKLDNNPHIGFRFWEFSRFKLN---IRHSFWTYNLLTRSLCKAG 117

Query: 1464 LHESASKLV----FDGHFPDGPVFDFLVSSCAQSGKLDIAKRLLVQACHGKIKXXXXXXX 1297
            +H+ A ++      DG  P+  +  FLVSS A+ GKL  A  LL+Q+   +++       
Sbjct: 118  MHDLAGQMFECMKSDGISPNSRLLGFLVSSFAEKGKLHCATALLLQSY--EVEGCCMVVN 175

Query: 1296 XXXXXXXXNSWVDEAICFFKDEHLRLGFCPDTCSFNIVTRGLCRLGKIDLAFELFNDMGS 1117
                       V++A+  F +EHLR   C DT +FNI+ RGLC +GK + A EL   M  
Sbjct: 176  SLLNTLVKLDRVEDAMKLF-EEHLRFQSCNDTKTFNILIRGLCGVGKAEKAVELLGGMSG 234

Query: 1116 FDCSPDVVTYNTLIDGFCRAKDVDKGYELLTQIRPKTGLSPDVVTYTSVISGYCKLGKME 937
            F C PD+VTYNTLI GFC++ ++ K  E+   ++  +G SPDVVTYTS+ISGYCK GKM+
Sbjct: 235  FGCLPDIVTYNTLIKGFCKSNELKKANEMFDDVKSSSGCSPDVVTYTSMISGYCKAGKMQ 294

Query: 936  EASALLDEMISKRIMPNSFTFNVLINGFGKIGNMRSAVATYENMLSRGCVPDVVTFTSLI 757
            EAS LLD+M+   I P + TFNVL++G+ K G M +A      M+S GC PDVVTFTSLI
Sbjct: 295  EASVLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMHTAEEIRGKMISFGCFPDVVTFTSLI 354

Query: 756  DGHCRIGETEEGMKLWNEMGTKNISPNAYTFSVLINALCKENRLNEARDLLRQLRWRNIV 577
            DG+CR+G+  +G +LW EM  + + PNA+T+S+LINALCKENRL +AR+LL QL  ++I+
Sbjct: 355  DGYCRVGQVNQGFRLWEEMNARGMFPNAFTYSILINALCKENRLLKARELLGQLASKDII 414

Query: 576  PRPFIYNPVIDGFCKAGNVDEANVIFAEMEEKRCNPDKFTFTSLIIGHSMKGRMAEAISL 397
            P+PF+YNPVIDGFCKAG V+EA VI  EME+K+C PDK TFT LIIGH MKGRM EA+S+
Sbjct: 415  PQPFMYNPVIDGFCKAGKVNEAIVIVEEMEKKKCKPDKITFTILIIGHCMKGRMFEAVSI 474

Query: 396  FHKMSSIGCAPDTITISTFISCLLKAGMPSEA 301
            FHKM +IGC+PD IT+S+ +SCLLKAGM  EA
Sbjct: 475  FHKMVAIGCSPDKITVSSLLSCLLKAGMAKEA 506


Top