BLASTX nr result

ID: Cocculus23_contig00028830 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00028830
         (1982 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI25851.3| unnamed protein product [Vitis vinifera]              507   e-141
ref|XP_003631455.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-139
ref|XP_007039757.1| Tetratricopeptide repeat-like superfamily pr...   489   e-135
ref|XP_006477135.1| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_006440247.1| hypothetical protein CICLE_v10019985mg [Citr...   467   e-129
ref|XP_002531466.1| pentatricopeptide repeat-containing protein,...   457   e-126
gb|EXC02094.1| hypothetical protein L484_024059 [Morus notabilis]     454   e-125
ref|XP_004139002.1| PREDICTED: pentatricopeptide repeat-containi...   447   e-123
ref|XP_006368989.1| hypothetical protein POPTR_0001s15470g [Popu...   441   e-121
ref|XP_004300367.1| PREDICTED: pentatricopeptide repeat-containi...   436   e-119
ref|XP_007212439.1| hypothetical protein PRUPE_ppa016777mg, part...   432   e-118
ref|XP_006359252.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-117
ref|XP_006359251.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-117
ref|XP_004245793.1| PREDICTED: pentatricopeptide repeat-containi...   421   e-115
ref|XP_004515635.1| PREDICTED: pentatricopeptide repeat-containi...   416   e-113
emb|CAN68810.1| hypothetical protein VITISV_001082 [Vitis vinifera]   389   e-105
ref|XP_007131288.1| hypothetical protein PHAVU_011G001300g [Phas...   387   e-104
ref|XP_002863348.1| pentatricopeptide repeat-containing protein ...   386   e-104
ref|XP_006398426.1| hypothetical protein EUTSA_v10000870mg [Eutr...   385   e-104
ref|NP_199547.1| pentatricopeptide repeat-containing protein [Ar...   383   e-103

>emb|CBI25851.3| unnamed protein product [Vitis vinifera]
          Length = 528

 Score =  507 bits (1305), Expect = e-141
 Identities = 271/487 (55%), Positives = 333/487 (68%), Gaps = 3/487 (0%)
 Frame = +2

Query: 92   CKFMAINSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKT 271
            CK MA +S  R +    R KNP F +      +  ++E Y+  L++     N+EKTL   
Sbjct: 4    CKSMAFSSVSRLLPYSIRHKNPNFST------ALSSAEKYYTHLQKYGD--NIEKTLPAV 55

Query: 272  RGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETL 451
            R KLDSS V  VL RCS ++  +GLRFFIWAG+Q  YRHS+++Y +AC L  INQ    +
Sbjct: 56   RAKLDSSCVNEVLNRCSLTQSQLGLRFFIWAGVQSYYRHSSYLYSKACELFRINQNPRAI 115

Query: 452  IGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRL 631
            I V+EAYR EG +VS+KTF V+L+L REAKL DEAL +L+KM EFN R DT  +N VIRL
Sbjct: 116  IDVIEAYRVEGTVVSVKTFNVVLHLLREAKLADEALWILKKMAEFNIRADTVAYNSVIRL 175

Query: 632  FSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPN 811
            F EKGDMD+A  LM EM LIDLYP+MITYVTMIKGFCNV RLED   LF+ M+ HGC PN
Sbjct: 176  FCEKGDMDLAAGLMKEMGLIDLYPNMITYVTMIKGFCNVGRLEDACKLFKVMKGHGCSPN 235

Query: 812  VVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALA 991
            VV Y+ +LDGVC+ G+L+RA+ELLGEMEKES  C +PNVVTYTS+IQ+ CE  + MEAL 
Sbjct: 236  VVVYTVILDGVCRFGSLERALELLGEMEKESGDC-SPNVVTYTSMIQSCCEKGKLMEALE 294

Query: 992  ILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLL 1171
            IL RM   GC PNRVTVS L+KG C  G VEEA+KLIDKVV  G V+   CYSSLIV L+
Sbjct: 295  ILDRMRACGCAPNRVTVSILMKGFCAEGRVEEAFKLIDKVVAGGNVSYGECYSSLIVSLV 354

Query: 1172 QNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDA---FXXXXXXXXXXXXX 1342
             N N++EAEKLFRRMLA+A+KPDGL+C   IK LC +GR LD    F             
Sbjct: 355  GNKNLQEAEKLFRRMLANAVKPDGLACGTLIKALCLEGRVLDGFHLFDEFENMEGLSYLD 414

Query: 1343 XXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLTS 1522
               Y +LL GL QK H  EA KLA +M +RGI+LK PY D I+E+L  SG+ ++     +
Sbjct: 415  SDIYSILLVGLSQKRHSVEAVKLARLMVDRGIQLKTPYFDSIVEHLKESGDKEICTHFCT 474

Query: 1523 *AC*YMI 1543
             A  Y +
Sbjct: 475  LALLYNV 481


>ref|XP_003631455.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            [Vitis vinifera]
          Length = 638

 Score =  503 bits (1296), Expect = e-139
 Identities = 268/476 (56%), Positives = 330/476 (69%), Gaps = 3/476 (0%)
 Frame = +2

Query: 101  MAINSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGK 280
            MA +S  R +    R KNP F +      +  ++E Y+  L++     N+EKTL   R K
Sbjct: 1    MAFSSVSRLLPYSIRHKNPNFST------ALSSAEKYYTHLQKYGD--NIEKTLPAVRAK 52

Query: 281  LDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGV 460
            LDSS V  VL RCS ++  +GLRFFIWAG+Q  YRHS+++Y +AC L  INQ    +I V
Sbjct: 53   LDSSCVNEVLNRCSLTQSQLGLRFFIWAGVQSYYRHSSYLYSKACELFRINQNPRAIIDV 112

Query: 461  LEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSE 640
            +EAYR EG +VS+KTF V+L+L REAKL DEAL +L+KM EFN R DT  +N VIRLF E
Sbjct: 113  IEAYRVEGTVVSVKTFNVVLHLLREAKLADEALWILKKMAEFNIRADTVAYNSVIRLFCE 172

Query: 641  KGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVT 820
            KGDMD+A  LM EM LIDLYP+MITYVTMIKGFCNV RLED   LF+ M+ HGC PNVV 
Sbjct: 173  KGDMDLAAGLMKEMGLIDLYPNMITYVTMIKGFCNVGRLEDACKLFKVMKGHGCSPNVVV 232

Query: 821  YSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILH 1000
            Y+ +LDGVC+ G+L+RA+ELLGEMEKES  C +PNVVTYTS+IQ+ CE  + MEAL IL 
Sbjct: 233  YTVILDGVCRFGSLERALELLGEMEKESGDC-SPNVVTYTSMIQSCCEKGKLMEALEILD 291

Query: 1001 RMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNN 1180
            RM   GC PNRVTVS L+KG C  G VEEA+KLIDKVV  G V+   CYSSLIV L+ N 
Sbjct: 292  RMRACGCAPNRVTVSILMKGFCAEGRVEEAFKLIDKVVAGGNVSYGECYSSLIVSLVGNK 351

Query: 1181 NMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDA---FXXXXXXXXXXXXXXXX 1351
            N++EAEKLFRRMLA+A+KPDGL+C   IK LC +GR LD    F                
Sbjct: 352  NLQEAEKLFRRMLANAVKPDGLACGTLIKALCLEGRVLDGFHLFDEFENMEGLSYLDSDI 411

Query: 1352 YGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLT 1519
            Y +LL GL QK H  EA KLA +M +RGI+LK PY D I+E+L  SG+ ++ + L+
Sbjct: 412  YSILLVGLSQKRHSVEAVKLARLMVDRGIQLKTPYFDSIVEHLKESGDKEIVMYLS 467


>ref|XP_007039757.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590676515|ref|XP_007039758.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|590676519|ref|XP_007039759.1| Tetratricopeptide
            repeat-like superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590676523|ref|XP_007039760.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 1 [Theobroma cacao]
            gi|508777002|gb|EOY24258.1| Tetratricopeptide repeat-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508777003|gb|EOY24259.1| Tetratricopeptide repeat-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508777004|gb|EOY24260.1| Tetratricopeptide repeat-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508777005|gb|EOY24261.1| Tetratricopeptide repeat-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 483

 Score =  489 bits (1258), Expect = e-135
 Identities = 248/443 (55%), Positives = 317/443 (71%), Gaps = 3/443 (0%)
 Frame = +2

Query: 197  ASENYWNLLRRIESDPNLEKTLTKTRGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQP 376
            +++ ++  L++ +S  N+EKTL     KLDS+ V  VL+RC   +  +GLRFFIWAGLQ 
Sbjct: 31   SADKFFTHLQKKQS--NIEKTLALVNSKLDSNCVCEVLERCCFDKSQMGLRFFIWAGLQS 88

Query: 377  GYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEA 556
             YRHS+++Y +AC  L I Q    ++ V+EAY+ E CLV++K FKV+LNLCREA++ DEA
Sbjct: 89   NYRHSSYMYSKACEFLKIKQNPFLVLDVIEAYKVEKCLVNVKMFKVVLNLCREARITDEA 148

Query: 557  LEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKG 736
            L VLRKM EFN RPDTT +N+VIRL  EKGDMD+A +LM +M LIDLYPDMITY+ MIKG
Sbjct: 149  LLVLRKMPEFNLRPDTTTYNVVIRLICEKGDMDMADKLMKDMGLIDLYPDMITYLAMIKG 208

Query: 737  FCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCV 916
            FCN  RLED  GLF+ MR HGCFPN V YS LL+G+C+ G++++A+ELLGEMEKE D C 
Sbjct: 209  FCNAGRLEDACGLFQVMREHGCFPNAVAYSALLEGICRYGSVEKALELLGEMEKEGDGC- 267

Query: 917  APNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYK 1096
            +PNV+TYTSVIQ+ CE  +T +AL +L RM   GC PNRVTVSTLIK LC  G VEEAYK
Sbjct: 268  SPNVITYTSVIQSFCEKGQTTKALRVLDRMGTCGCAPNRVTVSTLIKRLCAEGHVEEAYK 327

Query: 1097 LIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLC 1276
            LIDKVV  G V+   CYSSL+V L++   ++EAEKLFR+MLA+  KPD ++CS+ I+++C
Sbjct: 328  LIDKVVPGGGVSDGDCYSSLVVSLIRIKRLDEAEKLFRKMLATGAKPDSIACSIMIREIC 387

Query: 1277 SDGRFLDAF---XXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLK 1447
             +GR LD F                   Y +LL GLC++SH  EAAKLA  M  + IRLK
Sbjct: 388  QEGRVLDGFYLYEEIERMRYLSSIDADIYSILLVGLCRQSHSVEAAKLARSMLEKRIRLK 447

Query: 1448 APYVDGILEYLNISGEGDLALRL 1516
            APYVD I+E+L   G+  L   L
Sbjct: 448  APYVDKIIEHLKNCGDKQLVTEL 470


>ref|XP_006477135.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            isoform X1 [Citrus sinensis]
            gi|568846596|ref|XP_006477136.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At5g47360-like isoform X2 [Citrus sinensis]
            gi|568846598|ref|XP_006477137.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At5g47360-like isoform X3 [Citrus sinensis]
          Length = 475

 Score =  468 bits (1204), Expect = e-129
 Identities = 248/471 (52%), Positives = 319/471 (67%), Gaps = 3/471 (0%)
 Frame = +2

Query: 113  SFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGKLDSS 292
            S  R + +   +KN K F+      ++PA   Y +L +   +  N+EKTL   + KLDS+
Sbjct: 5    SLSRILSSSVNIKNSKIFA-LHFTTASPAERFYTHLQK---NPNNIEKTLATVKAKLDST 60

Query: 293  IVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAY 472
             V  VL RC  S+  +G+RFFIWA LQ  YRHS+++Y RAC +  I Q    +I V+EAY
Sbjct: 61   CVIEVLHRCFPSQSQMGIRFFIWAALQSSYRHSSFMYNRACEMSRIKQNPSIIIDVVEAY 120

Query: 473  RNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDM 652
            + EGC+VS+K  KVI NLC +A+L +EA+ VLRKM EF+ RPDT  +N VIRLF EKGDM
Sbjct: 121  KEEGCVVSVKMMKVIFNLCEKARLANEAMWVLRKMPEFDLRPDTIIYNNVIRLFCEKGDM 180

Query: 653  DVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTL 832
              A ELM  M LIDLYPD+ITYV+MIKGFCN  RLED  GLF+ M+ HGC  N+V YS L
Sbjct: 181  IAADELMKGMGLIDLYPDIITYVSMIKGFCNAGRLEDACGLFKVMKRHGCAANLVAYSAL 240

Query: 833  LDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEE 1012
            LDG+C+ G+++RA+ELLGEMEKE   C +PNVVTYTSVIQ  C      EAL IL RME 
Sbjct: 241  LDGICRLGSMERALELLGEMEKEGGDC-SPNVVTYTSVIQIFCGKGMMKEALGILDRMEA 299

Query: 1013 RGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEE 1192
             GC PNRVT+STLIKG C  G ++EAY+LIDKVV  G+V++ GCYSSL+V L++   ++E
Sbjct: 300  LGCAPNRVTISTLIKGFCVEGNLDEAYQLIDKVVAGGSVSSGGCYSSLVVELVRTKRLKE 359

Query: 1193 AEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXXYGVL 1363
            AEKLF +MLAS +KPDGL+CS+ I++LC  G+ L+ F                   + VL
Sbjct: 360  AEKLFSKMLASGVKPDGLACSVMIRELCLGGQVLEGFCLYEDIEKIGFLSSVDSDIHSVL 419

Query: 1364 LAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRL 1516
            L GLC+K+H  EAAKLA  M  + I L+ PYVD I+E+L  SG+ +L   L
Sbjct: 420  LLGLCRKNHSVEAAKLARFMLKKRIWLQGPYVDKIVEHLKKSGDEELITNL 470


>ref|XP_006440247.1| hypothetical protein CICLE_v10019985mg [Citrus clementina]
            gi|567895520|ref|XP_006440248.1| hypothetical protein
            CICLE_v10019985mg [Citrus clementina]
            gi|567895522|ref|XP_006440249.1| hypothetical protein
            CICLE_v10019985mg [Citrus clementina]
            gi|557542509|gb|ESR53487.1| hypothetical protein
            CICLE_v10019985mg [Citrus clementina]
            gi|557542510|gb|ESR53488.1| hypothetical protein
            CICLE_v10019985mg [Citrus clementina]
            gi|557542511|gb|ESR53489.1| hypothetical protein
            CICLE_v10019985mg [Citrus clementina]
          Length = 475

 Score =  467 bits (1202), Expect = e-129
 Identities = 248/471 (52%), Positives = 319/471 (67%), Gaps = 3/471 (0%)
 Frame = +2

Query: 113  SFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGKLDSS 292
            S  R + +   +KN K F+      ++PA   Y +L +   +  N+EKTL   + KLDS+
Sbjct: 5    SLSRILSSSVNIKNSKIFA-LHFTTASPAERFYTHLQK---NPNNIEKTLATVKAKLDST 60

Query: 293  IVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAY 472
             V  VL RC  S+  +G+RFFIWA LQ  YRHS+++Y RAC +  I Q    +I V+EAY
Sbjct: 61   CVIEVLHRCFPSQSQMGIRFFIWAALQSSYRHSSFMYNRACEMSRIKQNPSIIIDVVEAY 120

Query: 473  RNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDM 652
            + EGC+VS+K  KVI NLC +A+L +EA+ VLRKM EF+ RPDT  +N VIRLF EKGDM
Sbjct: 121  KEEGCVVSVKMMKVIFNLCEKARLANEAMWVLRKMPEFDLRPDTIIYNNVIRLFCEKGDM 180

Query: 653  DVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTL 832
              A ELM  M LIDLYPD+ITYV+MIKGFCN  RLED  GLF+ M+ HGC  N+V YS L
Sbjct: 181  IAADELMKGMGLIDLYPDIITYVSMIKGFCNAGRLEDACGLFKVMKRHGCAANLVAYSAL 240

Query: 833  LDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEE 1012
            LDG+C+ G+++RA+ELLGEMEKE   C +PNVVTYTSVIQ  C      EAL IL RME 
Sbjct: 241  LDGICRLGSMERALELLGEMEKEGGDC-SPNVVTYTSVIQIFCGKGMMKEALGILDRMEA 299

Query: 1013 RGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEE 1192
             GC PNRVT+STLIKG C  G ++EAY+LIDKVV  G+V++ GCYSSL+V L++   ++E
Sbjct: 300  FGCAPNRVTISTLIKGFCVEGNLDEAYQLIDKVVAGGSVSSGGCYSSLVVELVRTKRLKE 359

Query: 1193 AEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXXYGVL 1363
            AEKLF +MLAS +KPDGL+CS+ I++LC  G+ L+ F                   + VL
Sbjct: 360  AEKLFSKMLASGVKPDGLACSVMIRELCLRGQVLEGFCLYEDIEKIGFLSSVDSDIHSVL 419

Query: 1364 LAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRL 1516
            L GLC+K+H  EAAKLA  M  + I L+ PYVD I+E+L  SG+ +L   L
Sbjct: 420  LLGLCRKNHSVEAAKLARFMLKKRIWLQGPYVDKIVEHLKKSGDEELITNL 470


>ref|XP_002531466.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223528920|gb|EEF30916.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 518

 Score =  457 bits (1176), Expect = e-126
 Identities = 235/466 (50%), Positives = 317/466 (68%), Gaps = 10/466 (2%)
 Frame = +2

Query: 155  PKFFSNARRHKSNPASENYWN------LLRRIESDPN-LEKTLTKTRGKLDSSIVEGVLQ 313
            P+F S +   K++  S  ++       L   ++++PN +EK+L   + KLD+  V  VL 
Sbjct: 7    PRFLSLSIAPKTSKISTLHFTTSLADKLYTHLQNNPNNVEKSLNSIKPKLDTRCVTEVLH 66

Query: 314  RCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLV 493
            +CS +   +GLRFF+WAG Q  YRHS+++Y +AC+L  I Q  + ++ + E YR E C+V
Sbjct: 67   KCSLNNSQIGLRFFVWAGYQSNYRHSSFLYSKACKLFNIKQNPQAVLDLFEFYRAEKCVV 126

Query: 494  SIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELM 673
            ++KTFKV+LNLC+E  L +EA  VLRKM EF+ + DT  + +VIRLF +KGDMD+A +LM
Sbjct: 127  NLKTFKVVLNLCKEGTLANEAFLVLRKMQEFDIQADTKAYTIVIRLFCDKGDMDMAQKLM 186

Query: 674  NEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKS 853
             EM+  DLYPDM+TYV++IKGFC++ RLE+   L + MR+HGC PNVV YSTL+DG+C+ 
Sbjct: 187  GEMSFNDLYPDMVTYVSIIKGFCDIGRLEEACRLVKEMRAHGCVPNVVVYSTLVDGICRF 246

Query: 854  GNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPNR 1033
            G+++RA+ELLG MEKE   C  PNV+TYTSVIQ LCE  RTM+A A+L RME  GC PNR
Sbjct: 247  GSVERALELLGGMEKEGGDC-NPNVLTYTSVIQGLCEKGRTMDAFAVLDRMEACGCAPNR 305

Query: 1034 VTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKLFRR 1213
            VTVSTL+K LC  G +EEAYKLID+VV  G+V++  CYS ++VCL++   +EEAEKLFRR
Sbjct: 306  VTVSTLLKRLCMDGHLEEAYKLIDRVVAGGSVSSCDCYSPIVVCLIRIKKVEEAEKLFRR 365

Query: 1214 MLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXXYGVLLAGLCQK 1384
             + S +KPDGL+CSL IK+LC   R LD +                   Y VLL GLCQ+
Sbjct: 366  AVVSGVKPDGLACSLMIKELCFVNRVLDGYCLHDEIEKIGSLSTIDSDTYSVLLVGLCQQ 425

Query: 1385 SHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLTS 1522
             +  EAAKLA  +  + I LK PYVD ++EY+   G  DL   L S
Sbjct: 426  GYSLEAAKLARSLIEKRIHLKHPYVDKVVEYMKKFGVTDLVTELAS 471


>gb|EXC02094.1| hypothetical protein L484_024059 [Morus notabilis]
          Length = 474

 Score =  454 bits (1168), Expect = e-125
 Identities = 243/476 (51%), Positives = 316/476 (66%), Gaps = 3/476 (0%)
 Frame = +2

Query: 101  MAINSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGK 280
            M++ S  R + +P R  NP+F  +  R     +++  ++ L +  +  N+EKTL   + K
Sbjct: 1    MSLRSISRILSSPNRFLNPQF--STIRFAITSSADKIFDHLNK--NGGNIEKTLATIKPK 56

Query: 281  LDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGV 460
            LD   V  VL +C  S+  +G+RFFIWAGLQ  YRHS ++YG+AC+L  I+Q  + +  +
Sbjct: 57   LDPKFVSDVLFKCHPSQSQMGIRFFIWAGLQSDYRHSYFMYGKACKLFEISQNPKLISDI 116

Query: 461  LEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSE 640
            +EAYR+E C V++KTFKV+LNLC+EAKL DEAL VLRKM EFN  PDTT +N VIRLF  
Sbjct: 117  IEAYRDEKCFVTVKTFKVVLNLCKEAKLADEALWVLRKMPEFNLFPDTTMYNSVIRLFCL 176

Query: 641  KGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVT 820
            KGDM+ A  LM EM L+DLYPDMITYV M+KGFCNV RL+D +GLF+ ++   C  N V 
Sbjct: 177  KGDMNTAESLMKEMGLVDLYPDMITYVEMVKGFCNVGRLDDAFGLFKVVKELDCGNNTVL 236

Query: 821  YSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILH 1000
             S LLDGVCKSG+++RA+ELL EMEK     V+PNVV YTSVIQ  CE  RT EAL +L 
Sbjct: 237  CSALLDGVCKSGDMERALELLEEMEKGGGE-VSPNVVAYTSVIQRFCEKGRTSEALEVLD 295

Query: 1001 RMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNN 1180
            RME  GC PNRVTVS LI+  C  G VEE  KLID+VV  G V+ D C SS +V L +  
Sbjct: 296  RMEAWGCFPNRVTVSCLIERFCAEGRVEEVSKLIDRVV-KGGVSYDECCSSFVVSLKRTG 354

Query: 1181 NMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXX 1351
              EEAEK+FR+M+ + +KPD L+C++ IK+LC  GR LD +                   
Sbjct: 355  QFEEAEKVFRKMINNGLKPDSLACTIVIKELCLIGRVLDGYQLCDEIEKIGFWSSIDSDV 414

Query: 1352 YGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLT 1519
            Y +L+ GLCQ+ HL EAA L ++M  +GI+L APYVD I+E L  SG+ +L   LT
Sbjct: 415  YSLLIVGLCQQGHLVEAANLVSLMLKKGIQLSAPYVDRIVEILKKSGDEELIHHLT 470


>ref|XP_004139002.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            [Cucumis sativus] gi|449505643|ref|XP_004162530.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At5g47360-like [Cucumis sativus]
          Length = 475

 Score =  447 bits (1151), Expect = e-123
 Identities = 228/431 (52%), Positives = 297/431 (68%), Gaps = 3/431 (0%)
 Frame = +2

Query: 233  ESDPNLEKTLTKTRGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRA 412
            +S+ NL+KTL   + KLDS  V  VL +CS     +GLRFFIWAG QP YRHS+++Y RA
Sbjct: 41   KSNGNLDKTLATLKTKLDSRCVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRA 100

Query: 413  CRLLGINQRRETLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNC 592
            C L+GIN     L  V+E YR EGCLV I+ FK+ILNLC+EAKL  EAL +LRKM EF+ 
Sbjct: 101  CELIGINVSPCLLFNVIEDYRREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHL 160

Query: 593  RPDTTNFNLVIRLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYG 772
            R DTT +NLVIRLF+EKG+MD A+ELM EM  +D++P+MITY++M+KGFC+V R ED YG
Sbjct: 161  RADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYISMLKGFCDVGRWEDAYG 220

Query: 773  LFRFMRSHGCFPNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQ 952
            LF+ M+ +GC PN V YS L++G  +   +DR ME+L EMEK+  +C +PN VTYTS+IQ
Sbjct: 221  LFKDMKENGCAPNTVVYSVLVNGAIRLRIMDRLMEMLKEMEKQGGTC-SPNTVTYTSIIQ 279

Query: 953  NLCESSRTMEALAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVT 1132
            +LCE    +EAL +L RMEE G  PNRV VS L+K  C  G VEEAYKLID+VV  G V+
Sbjct: 280  SLCEEGHPLEALKVLDRMEEYGYAPNRVAVSFLVKEFCKDGHVEEAYKLIDRVVARGGVS 339

Query: 1133 TDGCYSSLIVCLLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF--- 1303
               CYSSL+V L++   + EAEKLFR MLA+ +KPDG++CSL I++LC + R LD F   
Sbjct: 340  YGDCYSSLVVTLVKMKKIAEAEKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFNLC 399

Query: 1304 XXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLN 1483
                            Y +LL GLC+  H  +AAKLA +M  +GIRLK  Y + I+++L 
Sbjct: 400  YEVDRNGYLCSIDADIYSLLLVGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLK 459

Query: 1484 ISGEGDLALRL 1516
               + +L + L
Sbjct: 460  KFEDRELVMHL 470


>ref|XP_006368989.1| hypothetical protein POPTR_0001s15470g [Populus trichocarpa]
            gi|550347348|gb|ERP65558.1| hypothetical protein
            POPTR_0001s15470g [Populus trichocarpa]
          Length = 476

 Score =  441 bits (1135), Expect = e-121
 Identities = 222/433 (51%), Positives = 293/433 (67%), Gaps = 4/433 (0%)
 Frame = +2

Query: 236  SDPNLEKTLTKTRG-KLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRA 412
            S  N+EKTL      KLD+  V  ++ R S +   +GLRFFIWAG QP YRH+ +IY +A
Sbjct: 42   SPNNVEKTLNSLAPIKLDTKYVNDIIHRWSLNNLQLGLRFFIWAGDQPNYRHNLYIYNKA 101

Query: 413  CRLLGINQRRETLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNC 592
            C L  I Q  + ++ ++E Y+ E C+V + TFKV+L LC+   L DEAL VL+KM EFN 
Sbjct: 102  CSLFKIKQNPQVILDLIETYKLEKCVVCVDTFKVVLRLCKAGGLADEALMVLKKMPEFNI 161

Query: 593  RPDTTNFNLVIRLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYG 772
            RPDTT +N+VIR   EKGD+D+A +LM EM LIDLYPDMITYV+MIKGFC+V RLE+ + 
Sbjct: 162  RPDTTAYNVVIRSLCEKGDVDMAKKLMGEMGLIDLYPDMITYVSMIKGFCDVGRLEEAFA 221

Query: 773  LFRFMRSHGCFPNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQ 952
            LF  M  HGC+PNVV YS LLDG+C+ G ++RA ELL EMEK+ + C  PNV+TYTSVIQ
Sbjct: 222  LFPVMSVHGCYPNVVAYSALLDGICRFGIVERAFELLAEMEKQGEGC-CPNVITYTSVIQ 280

Query: 953  NLCESSRTMEALAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVT 1132
            + CE  RT +AL++L  ME RGC PNRVT S  I G+C  G +++ Y  I+++V  G+V+
Sbjct: 281  SFCEQGRTKDALSVLELMEVRGCAPNRVTASAWINGICTNGQLQDVYNFIERIVAGGSVS 340

Query: 1133 TDGCYSSLIVCLLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF--- 1303
               CYSSL+VCL++   +EEAEK FRR L+S MKPD L+CS+ I+++CS+ R LD F   
Sbjct: 341  IGDCYSSLVVCLIKIKKVEEAEKTFRRALSSGMKPDSLACSMMIREICSEKRVLDGFCLY 400

Query: 1304 XXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLN 1483
                            Y +LLAGLCQ+ H  EAA+LA  M  + I L+AP+V+ I+E+L 
Sbjct: 401  EEVEKTGCLSSIDIDIYSILLAGLCQQGHSAEAARLARSMLEKRIPLRAPHVEKIVEHLK 460

Query: 1484 ISGEGDLALRLTS 1522
              G  +L   L S
Sbjct: 461  NFGGKELVAELVS 473


>ref|XP_004300367.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            isoform 1 [Fragaria vesca subsp. vesca]
            gi|470128894|ref|XP_004300368.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At5g47360-like isoform 2 [Fragaria vesca subsp. vesca]
          Length = 421

 Score =  436 bits (1120), Expect = e-119
 Identities = 226/419 (53%), Positives = 286/419 (68%), Gaps = 3/419 (0%)
 Frame = +2

Query: 272  RGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETL 451
            R  LD+  V  VLQRC  ++  +GLRFFIWAG+   YRHS +++ +AC L  I +    +
Sbjct: 2    RLNLDAKCVSQVLQRCYPTQSQLGLRFFIWAGVHSSYRHSYFMFSKACDLYKIREYPSLI 61

Query: 452  IGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRL 631
              VLEAY  EGC VS+K FKV+ N+C+EAKL DEAL VLRKM EF  R D   +N+VIR 
Sbjct: 62   FDVLEAYSAEGCSVSVKMFKVLFNVCKEAKLADEALRVLRKMPEFGLRGDNVVYNVVIRQ 121

Query: 632  FSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPN 811
            F EKGDMD+A  L+ EM+ ++LYPD+ITY+ MIKGFCNV RL+D  GLF FM+ +GC PN
Sbjct: 122  FCEKGDMDMAESLVKEMSEVELYPDLITYMVMIKGFCNVGRLDDACGLFMFMKENGCVPN 181

Query: 812  VVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALA 991
            VV YS LLDG C+ G+++RA+ LL EMEKE   C  PNVVTYT+VIQ LC   R++EAL 
Sbjct: 182  VVVYSALLDGFCRFGDMERALTLLEEMEKEGGDC-GPNVVTYTTVIQCLCNKHRSVEALL 240

Query: 992  ILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLL 1171
            +L RME RGCLPNRVTVSTLI GL     VE AYKL+D+VV  G+VT   CYS+ +V L 
Sbjct: 241  VLDRMEARGCLPNRVTVSTLITGLVKEDQVEHAYKLVDRVVKSGSVTKTDCYSTFVVSLE 300

Query: 1172 QNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDA---FXXXXXXXXXXXXX 1342
            +    EEAEK+ R ML S +KP+ L C++ +K+ C +GR +DA   F             
Sbjct: 301  RVGRPEEAEKVLRMMLNSGVKPNSLVCTIMLKKCCLEGRMVDAYCLFGELEKMECLSSIE 360

Query: 1343 XXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLT 1519
               Y +LL GLCQ+ HL EAA+LA VM ++GI+LK PYVD I E L  SG+ +L  +LT
Sbjct: 361  SDTYSILLLGLCQQRHLVEAAELARVMLSKGIKLKGPYVDIISEVLVKSGDEELVKQLT 419


>ref|XP_007212439.1| hypothetical protein PRUPE_ppa016777mg, partial [Prunus persica]
            gi|462408304|gb|EMJ13638.1| hypothetical protein
            PRUPE_ppa016777mg, partial [Prunus persica]
          Length = 394

 Score =  432 bits (1110), Expect = e-118
 Identities = 221/392 (56%), Positives = 273/392 (69%), Gaps = 3/392 (0%)
 Frame = +2

Query: 338  VGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLVSIKTFKVI 517
            +GLRFFIWAGL   YRHS ++Y +AC L  I      +  VLEAYR EG +VS+K FKV+
Sbjct: 1    MGLRFFIWAGLHSSYRHSYFMYSQACELCEIKLNPSVIFDVLEAYRIEGRVVSLKAFKVV 60

Query: 518  LNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELMNEMALIDL 697
             NLC+EAKL DEAL VLRK+ +F  RPDTT +N+VIRLF +KG+M+VA  L+ EM L+DL
Sbjct: 61   FNLCKEAKLADEALRVLRKIPDFGLRPDTTVYNVVIRLFCDKGNMNVAERLVKEMGLVDL 120

Query: 698  YPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKSGNLDRAME 877
             PD+ITYV MI GFC V RL+D  GLF+ M+ HGC PN V YS LLDG C+S N++RA+E
Sbjct: 121  LPDLITYVVMINGFCKVGRLDDACGLFKVMKGHGCLPNAVVYSALLDGFCRSENMERALE 180

Query: 878  LLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPNRVTVSTLIK 1057
            LL EMEKE   C +PNVVTYTSVIQ LC+  R+ EAL IL RME  GC P+RVTVS LIK
Sbjct: 181  LLTEMEKEGGDC-SPNVVTYTSVIQKLCDKGRSKEALVILDRMEACGCAPSRVTVSILIK 239

Query: 1058 GLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKLFRRMLASAMKP 1237
              C    VEEAYKLID+VV   +VT   CYSSL+V L +    EEAEK+ R ML S +KP
Sbjct: 240  SFCVEDQVEEAYKLIDRVVVGRSVTYSDCYSSLVVSLARGRKPEEAEKVLRMMLDSGLKP 299

Query: 1238 DGLSCSLFIKQLCSDGRFLDA---FXXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEAAK 1408
            + L+CS+ +K++C +GR +D    F                Y +LL GLC++ HL EAAK
Sbjct: 300  NSLACSIMLKKVCLEGRVIDGFCLFDELEKMECLSSIDSDTYSILLVGLCEQRHLLEAAK 359

Query: 1409 LANVMFNRGIRLKAPYVDGILEYLNISGEGDL 1504
            LA +M N+GI+LKAPYVD I E L  SG+ +L
Sbjct: 360  LARLMLNKGIKLKAPYVDSIAEILKKSGDEEL 391


>ref|XP_006359252.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            isoform X2 [Solanum tuberosum]
          Length = 487

 Score =  427 bits (1099), Expect = e-117
 Identities = 224/455 (49%), Positives = 297/455 (65%), Gaps = 3/455 (0%)
 Frame = +2

Query: 152  NPKFFSNARRHKSNPASENYWNLLRRIESDPN---LEKTLTKTRGKLDSSIVEGVLQRCS 322
            N   FS    H  + +S +    L  + ++ N   +E+TL+  R KLD+  V+ VL++C+
Sbjct: 17   NKPIFSLKLVHLLSTSSSSAGEFLSHLLNNKNVSGMERTLSSVRSKLDARCVDEVLEKCA 76

Query: 323  KSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLVSIK 502
              +P + LRFFIWAGLQ  YRHS+++Y RA +LLG++ + + +   +EAYR +  + S K
Sbjct: 77   VDDPQMCLRFFIWAGLQSSYRHSSYMYSRAYKLLGVDSKPQIIRDAIEAYRLQKYVTSAK 136

Query: 503  TFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELMNEM 682
             FKV+LNLCRE K     L VLRKM E NCRPDT  +N+VIRL  EKGDMD A+ LM EM
Sbjct: 137  MFKVVLNLCREGKDATLGLWVLRKMKESNCRPDTIMYNVVIRLLCEKGDMDEAMGLMREM 196

Query: 683  ALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKSGNL 862
             LID++PDMITYV MIKG   V RLE+  GL + MR HGC PN VTYS LLDG+C+ G+L
Sbjct: 197  DLIDVHPDMITYVVMIKGLSEVGRLEEACGLTKAMRGHGCIPNTVTYSALLDGICRFGSL 256

Query: 863  DRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPNRVTV 1042
            +RA+ELL EMEK+   C  PNVVTYT+V+QN  E  + +EAL+IL +M + GC PNRV +
Sbjct: 257  ERALELLREMEKDGGQC-EPNVVTYTTVVQNFVEKCQAIEALSILDQMRDFGCKPNRVLI 315

Query: 1043 STLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKLFRRMLA 1222
            STLI GLC  G VEEA+K+ID+V   G ++ D CYSSL++ L +   +EEAE  FRRML 
Sbjct: 316  STLIHGLCKEGHVEEAHKVIDRVAKSG-ISYDSCYSSLVLSLFRIGKVEEAEMFFRRMLT 374

Query: 1223 SAMKPDGLSCSLFIKQLCSDGRFLDAFXXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEA 1402
              +KPD  + S  I+ LC   R LD +                Y +L+AGLC+ +HL EA
Sbjct: 375  GGLKPDSFTSSTIIRWLCQQNRILDGYHLIEQSASVSSIDSDIYSILMAGLCEANHLAEA 434

Query: 1403 AKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLA 1507
            AKLA++M  + I+LK P V  + E L   G+ DLA
Sbjct: 435  AKLAHLMVEKRIQLKGPCVKNVTECLRHCGKEDLA 469


>ref|XP_006359251.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            isoform X1 [Solanum tuberosum]
          Length = 488

 Score =  427 bits (1099), Expect = e-117
 Identities = 224/455 (49%), Positives = 297/455 (65%), Gaps = 3/455 (0%)
 Frame = +2

Query: 152  NPKFFSNARRHKSNPASENYWNLLRRIESDPN---LEKTLTKTRGKLDSSIVEGVLQRCS 322
            N   FS    H  + +S +    L  + ++ N   +E+TL+  R KLD+  V+ VL++C+
Sbjct: 17   NKPIFSLKLVHLLSTSSSSAGEFLSHLLNNKNVSGMERTLSSVRSKLDARCVDEVLEKCA 76

Query: 323  KSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLVSIK 502
              +P + LRFFIWAGLQ  YRHS+++Y RA +LLG++ + + +   +EAYR +  + S K
Sbjct: 77   VDDPQMCLRFFIWAGLQSSYRHSSYMYSRAYKLLGVDSKPQIIRDAIEAYRLQKYVTSAK 136

Query: 503  TFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELMNEM 682
             FKV+LNLCRE K     L VLRKM E NCRPDT  +N+VIRL  EKGDMD A+ LM EM
Sbjct: 137  MFKVVLNLCREGKDATLGLWVLRKMKESNCRPDTIMYNVVIRLLCEKGDMDEAMGLMREM 196

Query: 683  ALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKSGNL 862
             LID++PDMITYV MIKG   V RLE+  GL + MR HGC PN VTYS LLDG+C+ G+L
Sbjct: 197  DLIDVHPDMITYVVMIKGLSEVGRLEEACGLTKAMRGHGCIPNTVTYSALLDGICRFGSL 256

Query: 863  DRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPNRVTV 1042
            +RA+ELL EMEK+   C  PNVVTYT+V+QN  E  + +EAL+IL +M + GC PNRV +
Sbjct: 257  ERALELLREMEKDGGQC-EPNVVTYTTVVQNFVEKCQAIEALSILDQMRDFGCKPNRVLI 315

Query: 1043 STLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKLFRRMLA 1222
            STLI GLC  G VEEA+K+ID+V   G ++ D CYSSL++ L +   +EEAE  FRRML 
Sbjct: 316  STLIHGLCKEGHVEEAHKVIDRVAKSG-ISYDSCYSSLVLSLFRIGKVEEAEMFFRRMLT 374

Query: 1223 SAMKPDGLSCSLFIKQLCSDGRFLDAFXXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEA 1402
              +KPD  + S  I+ LC   R LD +                Y +L+AGLC+ +HL EA
Sbjct: 375  GGLKPDSFTSSTIIRWLCQQNRILDGYHLIEQSASVSSIDSDIYSILMAGLCEANHLAEA 434

Query: 1403 AKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLA 1507
            AKLA++M  + I+LK P V  + E L   G+ DLA
Sbjct: 435  AKLAHLMVEKRIQLKGPCVKNVTECLRHCGKEDLA 469


>ref|XP_004245793.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            [Solanum lycopersicum]
          Length = 480

 Score =  421 bits (1081), Expect = e-115
 Identities = 227/474 (47%), Positives = 302/474 (63%), Gaps = 3/474 (0%)
 Frame = +2

Query: 95   KFMAINSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPN---LEKTLT 265
            K M + S  R       L  P  FS    H  + +S +    L  +  + N   +E+TL+
Sbjct: 3    KIMFLPSISRLFADTKSLNKP-IFSLKLVHLLSTSSSSAGEYLSHLLKNKNVSGMERTLS 61

Query: 266  KTRGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRE 445
              R KLD+  V+ VL++C+  +P + LRFFIWAG Q  YRHS+++Y RA +LLG++++ +
Sbjct: 62   SVRSKLDARCVDEVLEKCAVDDPQMCLRFFIWAGFQSSYRHSSYMYSRAYKLLGVDRKPQ 121

Query: 446  TLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVI 625
             +  ++EAYR    + S K FKV+LNLCRE K     L VLRKM E NCRPDTT +N+VI
Sbjct: 122  IIRDIIEAYRMHKYVTSAKMFKVVLNLCREGKDAILGLWVLRKMKELNCRPDTTMYNVVI 181

Query: 626  RLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCF 805
            RL  EKGDMD A+ LM EM LID++PDMITYV MIKG   V RLE+  GL + MR HGC 
Sbjct: 182  RLLCEKGDMDEAMGLMREMDLIDVHPDMITYVVMIKGLSEVGRLEEACGLTKAMREHGCI 241

Query: 806  PNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEA 985
            PN VTYS LLDG+C+ G+L+RA+ELL EMEK+   C  PNVVTYT+V+QN  E  +++EA
Sbjct: 242  PNTVTYSALLDGICRFGSLERALELLREMEKDGGQC-KPNVVTYTTVVQNFVEKCQSIEA 300

Query: 986  LAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVC 1165
            L+IL +M + GC PNRV +STLI GLC  G VEEA+K+ID+V   G ++   CYSSL++ 
Sbjct: 301  LSILDQMMDFGCKPNRVLISTLIHGLCKEGHVEEAHKVIDRVAKSG-ISYGSCYSSLVLS 359

Query: 1166 LLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAFXXXXXXXXXXXXXX 1345
            L +   +E+AE  FRRML   +KPD  + S  I+ LC   R LD +              
Sbjct: 360  LFRIGKVEDAEMFFRRMLTGGLKPDSYTSSTIIRWLCQQNRILDGYHLIEQSASVSSIDS 419

Query: 1346 XXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLA 1507
              Y VL+AGLC  +HL EAA LA++M  + I+LK P V  ++E L   G+ DLA
Sbjct: 420  DIYSVLMAGLCDANHLAEAANLAHLMVEKRIQLKGP-VKNVIECLRRCGKEDLA 472


>ref|XP_004515635.1| PREDICTED: pentatricopeptide repeat-containing protein At5g47360-like
            [Cicer arietinum]
          Length = 477

 Score =  416 bits (1070), Expect = e-113
 Identities = 208/428 (48%), Positives = 286/428 (66%), Gaps = 3/428 (0%)
 Frame = +2

Query: 245  NLEKTLTKTRGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLL 424
            N+E +L+K + KLDS  V  VL +C   +  +G+RFFIWAG Q GYRHS ++Y +AC LL
Sbjct: 46   NIENSLSKKKPKLDSQCVIQVLSKCCPKQSQLGVRFFIWAGFQSGYRHSGFVYKKACNLL 105

Query: 425  GINQRRETLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDT 604
            GI++  E +  ++++Y +EGC+V++  F+ +L LC+EA+L D  L VLRKM +FN +PDT
Sbjct: 106  GIDKNPEVICNLIKSYESEGCVVNVNMFREVLKLCKEAQLADLGLWVLRKMVDFNLQPDT 165

Query: 605  TNFNLVIRLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRF 784
              +N+VIRLFS+KGD+++A +LM EM+L D+ PD+ITY+TMI+GFCN  RLED Y + + 
Sbjct: 166  VMYNIVIRLFSQKGDVEMAEKLMREMSLNDICPDLITYMTMIEGFCNAGRLEDAYNMLKV 225

Query: 785  MRSHGCFPNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCE 964
            MR HGC PN+V  S +LDG C+ G++++A+ELL EMEK  D C  PNVVTYTS+IQ  C+
Sbjct: 226  MRVHGCSPNLVVLSAILDGFCRCGSMEKALELLDEMEKGGDCC--PNVVTYTSLIQGFCK 283

Query: 965  SSRTMEALAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGC 1144
              +  EAL IL RM   GC  N VTV TLI+ LC  G VEEAYKL+DK V    V+    
Sbjct: 284  RGKWTEALGILDRMRAFGCFANHVTVFTLIESLCIEGRVEEAYKLVDKFVVEHGVSRGDS 343

Query: 1145 YSSLIVCLLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXX 1315
            YSSL++ L++   +EEAEKLF+ ML   +KPD L+ SL +K+ C   R LD F       
Sbjct: 344  YSSLVISLIRIKKLEEAEKLFKEMLDGEIKPDTLASSLLLKEFCLKDRVLDGFYLLDAIE 403

Query: 1316 XXXXXXXXXXXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGE 1495
                        Y +LL GLC+++HL EA KLA +M  +G+ L+ PY D  ++ LN  GE
Sbjct: 404  NKGFLSSIDSDIYSILLVGLCRENHLMEATKLATIMLKKGVSLRPPYRDSAIDVLNKYGE 463

Query: 1496 GDLALRLT 1519
              +  +LT
Sbjct: 464  KGIVNQLT 471


>emb|CAN68810.1| hypothetical protein VITISV_001082 [Vitis vinifera]
          Length = 577

 Score =  389 bits (999), Expect = e-105
 Identities = 213/373 (57%), Positives = 258/373 (69%)
 Frame = +2

Query: 101  MAINSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGK 280
            MA +S  R +    R KNP F +       +PA + Y +L +  +   N+EKTL   R K
Sbjct: 1    MAFSSVSRLLPYSIRHKNPNFSTAL-----SPAEKYYTHLQKYGD---NIEKTLPAVRAK 52

Query: 281  LDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGV 460
            LDSS V  VL RCS ++  +GLRFFIWAG+Q  YRHS+++Y +AC L  INQ    +I V
Sbjct: 53   LDSSCVNEVLNRCSLTQSQLGLRFFIWAGVQSYYRHSSYLYSKACELFRINQNPRAIIDV 112

Query: 461  LEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSE 640
            +EAYR EG +VS+KTF V+L+L REAKL DEAL +L+KM EFN R DT  +N VIRLF E
Sbjct: 113  IEAYRVEGTVVSVKTFNVVLHLLREAKLADEALWILKKMAEFNIRADTVAYNSVIRLFCE 172

Query: 641  KGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVT 820
            KGDMD+A  LM EM LIDLYP+MITYVTMIKGFCNV RLED   LF+ M+ HGC PNVV 
Sbjct: 173  KGDMDLAAGLMKEMGLIDLYPNMITYVTMIKGFCNVGRLEDACKLFKVMKGHGCSPNVVV 232

Query: 821  YSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVIQNLCESSRTMEALAILH 1000
            Y+ +LDGVC+ G+L+RA+ELLGEMEKES  C +PNVVTYTS+IQ+ CE  + MEAL IL 
Sbjct: 233  YTVILDGVCRFGSLERALELLGEMEKESGDC-SPNVVTYTSMIQSCCEKGKLMEALEILD 291

Query: 1001 RMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNN 1180
            RM   GC PNRVTVS L+KG C  G VEEA+KLIDKVV  G V+  G        L Q  
Sbjct: 292  RMRACGCAPNRVTVSILMKGFCAEGRVEEAFKLIDKVVAGGNVSYVG--------LSQKR 343

Query: 1181 NMEEAEKLFRRML 1219
            +  EA KL R M+
Sbjct: 344  HSVEAVKLARLMV 356


>ref|XP_007131288.1| hypothetical protein PHAVU_011G001300g [Phaseolus vulgaris]
            gi|561004288|gb|ESW03282.1| hypothetical protein
            PHAVU_011G001300g [Phaseolus vulgaris]
          Length = 474

 Score =  387 bits (993), Expect = e-104
 Identities = 203/433 (46%), Positives = 281/433 (64%), Gaps = 4/433 (0%)
 Frame = +2

Query: 233  ESDPNLEKTLTKTRGKLDSSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRA 412
            +S+  +E +L+K + KLDS  V  VL  C   + ++G+RFF+WAG Q GYRHSA+ Y +A
Sbjct: 38   QSNGGVENSLSKVKPKLDSRCVIQVLNSCHPKQLLLGVRFFVWAGFQSGYRHSAYTYSKA 97

Query: 413  CRLLGINQRRETLIGVLEAYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGE-FN 589
            C+LLGI Q  + +  V+ +Y  EGC V++  F+ +L LC+EA+L D AL VLRKM + FN
Sbjct: 98   CKLLGIQQNPQIIRDVVLSYEAEGCSVTVNMFREVLKLCKEAQLADVALWVLRKMEQSFN 157

Query: 590  CRPDTTNFNLVIRLFSEKGDMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGY 769
             R DT  +N+VIRL  + GD++ A +L  EM+L  LYPD+ITY+ +++GFCN  R E  Y
Sbjct: 158  IRADTVMYNVVIRLCCKNGDIETAEKLTGEMSLNGLYPDLITYMAIVEGFCNAGRPEHAY 217

Query: 770  GLFRFMRSHGCFPNVVTYSTLLDGVCKSGNLDRAMELLGEMEKESDSCVAPNVVTYTSVI 949
             + + MR H C PN+V  S +LDG+C+SG+++ A+ELL EMEK  D   +PNVVTYTSVI
Sbjct: 218  SVLKVMRVHKCSPNLVLLSAILDGLCRSGSMEMALELLDEMEKGGDC--SPNVVTYTSVI 275

Query: 950  QNLCESSRTMEALAILHRMEERGCLPNRVTVSTLIKGLCNAGCVEEAYKLIDKVVGLGTV 1129
            Q+ C+  + +EAL IL RM+  GC  N VTV TL+  LC  G V EAYKLIDK V    V
Sbjct: 276  QSFCKRGQWVEALDILDRMKALGCHANHVTVFTLVDRLCVEGRVGEAYKLIDKFVVEHGV 335

Query: 1130 TTDGCYSSLIVCLLQNNNMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF-- 1303
            +   C SSL++ L++   ++EAEKLF  ML+  ++PD L+ SL +K+LC   + LD F  
Sbjct: 336  SYGNCCSSLVISLIRIKKLDEAEKLFMEMLSGDVRPDSLASSLLLKELCMKDQVLDGFHL 395

Query: 1304 -XXXXXXXXXXXXXXXXYGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYL 1480
                             Y +LL GLCQ++HLTEA KLA +M  + + L+ PY DG ++ L
Sbjct: 396  LEAMENKGCLSTIDNGIYSILLVGLCQRNHLTEATKLAKIMLKKSVPLQPPYKDGAIDIL 455

Query: 1481 NISGEGDLALRLT 1519
              SGE DL  +LT
Sbjct: 456  IKSGEKDLVNQLT 468


>ref|XP_002863348.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297309183|gb|EFH39607.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 477

 Score =  386 bits (991), Expect = e-104
 Identities = 212/477 (44%), Positives = 293/477 (61%), Gaps = 5/477 (1%)
 Frame = +2

Query: 107  INSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGKLD 286
            +N     +  P  L +P   S  R   +  A++  +  L+   S+P  EK L      LD
Sbjct: 2    LNHLISRLLPPSLLSHPSKISALRFSTTVSAADRLYGHLQGGTSNP--EKDLASANVNLD 59

Query: 287  SSIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLE 466
            SS +  V++RC  ++  +GLRFFIWAG Q  +RHS ++Y +AC  L I    + +  V+E
Sbjct: 60   SSSINEVIRRCDPNQFQLGLRFFIWAGTQSSHRHSPYMYTKACDFLKIRANPDLIKDVVE 119

Query: 467  AYRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKG 646
            AY+ E C VS+KT  ++L LC +AKL DEAL VLRK  EF+   DT  +NLVIRLF++KG
Sbjct: 120  AYKKEECFVSVKTMWIVLTLCNQAKLADEALWVLRKFPEFDLCADTVAYNLVIRLFADKG 179

Query: 647  DMDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYS 826
            D+ +A  LM EM  +DLYPD+ITY  MI G+CN  ++++ + L + M  H C  N VTYS
Sbjct: 180  DLSMADMLMKEMDCVDLYPDVITYTAMINGYCNAGKIDEAWKLAKEMSKHDCVLNTVTYS 239

Query: 827  TLLDGVCKSGNLDRAMELLGEMEKE-SDSCVAPNVVTYTSVIQNLCESSRTMEALAILHR 1003
             +L+GVCKSG+++ A+ELL EMEKE     ++PN VTYT VIQ+ CE  R  EAL +L R
Sbjct: 240  RILEGVCKSGDMETALELLAEMEKEDGGGLISPNAVTYTLVIQSFCEKKRIREALLVLDR 299

Query: 1004 MEERGCLPNRVTVSTLIKG-LCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNN 1180
            M +RGC PNRVT S LI+G L N   V++  KLIDK+V LG V+   C+SS  V L++  
Sbjct: 300  MGDRGCTPNRVTASVLIQGVLENDEDVKDLSKLIDKLVKLGGVSLSECFSSATVSLIRMK 359

Query: 1181 NMEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXX 1351
              EEAEK+FR ML   ++PDGL+C+   ++LC   R+LD F                   
Sbjct: 360  RWEEAEKIFRLMLVRGIRPDGLACTHVFRELCLSERYLDCFVLYQEIEKEDVKSTMDSDI 419

Query: 1352 YGVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLTS 1522
            Y VLL GLCQ+ +  EAAKLA  M ++ +RLK  +V+ I+E L  +G+ DL  R ++
Sbjct: 420  YAVLLLGLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLMSRFST 476


>ref|XP_006398426.1| hypothetical protein EUTSA_v10000870mg [Eutrema salsugineum]
            gi|557099515|gb|ESQ39879.1| hypothetical protein
            EUTSA_v10000870mg [Eutrema salsugineum]
          Length = 478

 Score =  385 bits (989), Expect = e-104
 Identities = 211/469 (44%), Positives = 293/469 (62%), Gaps = 6/469 (1%)
 Frame = +2

Query: 134  TPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGKLDSSIVEGVLQ 313
            +P     P   S  R   +  A+E  ++ L+  +++P  EK L   + KLD+S +  V++
Sbjct: 11   SPSFRSQPSKLSALRFSTTVSAAERLYDHLQGCKNNP--EKELASAKVKLDASTINEVIK 68

Query: 314  RCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEAYRNEGCLV 493
            RCS ++  +GLRFFIWAG Q G+RHS ++Y +AC  L I    + +  V+EAY  E C V
Sbjct: 69   RCSPNQFQLGLRFFIWAGTQSGHRHSPYMYSKACEFLEIRANPDLIKDVVEAYGKEECFV 128

Query: 494  SIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGDMDVALELM 673
            SIKT +++L+LC +AKL DEAL VLRK  +F    DT  +NLVIRLF++KGD+ +A  LM
Sbjct: 129  SIKTMRIVLSLCNQAKLADEALWVLRKYPDFGLSADTIAYNLVIRLFADKGDLSMAETLM 188

Query: 674  NEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYSTLLDGVCKS 853
             EM  IDL PD++TY ++I GFCN  ++++ + L + M  HGC  N VT+S +L+GVCKS
Sbjct: 189  KEMDCIDLCPDVMTYTSVINGFCNAGKIDEAWNLSKAMSKHGCVLNTVTFSRILEGVCKS 248

Query: 854  GNLDRAMELLGEMEKE-SDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRMEERGCLPN 1030
            G+++RA+E LGEMEKE     ++PN VTYT VIQ  CE  R  EAL IL RM +RGCLPN
Sbjct: 249  GDMERALEFLGEMEKEDGGGFISPNAVTYTLVIQAFCEKKRVQEALMILDRMGDRGCLPN 308

Query: 1031 RVTVSTLIKGLC--NAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNNMEEAEKL 1204
            RVT S LI+G+   N   V +  KLIDK+V LG V+   C+SS  V L++    EEAEK+
Sbjct: 309  RVTASVLIQGVVEENDEDVMDLSKLIDKLVKLGGVSLSECFSSATVSLIRLKKWEEAEKI 368

Query: 1205 FRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXXYGVLLAGL 1375
            FR ML    +PDGL+CSL +++LCS  R+ D F                   + VLL GL
Sbjct: 369  FRLMLVQGNRPDGLACSLVLRELCSLERYQDCFFLYEEIEKANVISTIDSDIHSVLLLGL 428

Query: 1376 CQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLTS 1522
            C+     EA KLA  M ++ +RL   +V  I++ L  +G+ DL  RL++
Sbjct: 429  CEHGCSWEATKLAKWMLDKKMRLNVSHVKRIIQALKKTGDEDLMRRLST 477


>ref|NP_199547.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75180684|sp|Q9LVS3.1|PP422_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g47360 gi|8809619|dbj|BAA97170.1| unnamed protein
            product [Arabidopsis thaliana]
            gi|332008119|gb|AED95502.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 477

 Score =  383 bits (983), Expect = e-103
 Identities = 213/475 (44%), Positives = 292/475 (61%), Gaps = 5/475 (1%)
 Frame = +2

Query: 110  NSFFRCICTPYRLKNPKFFSNARRHKSNPASENYWNLLRRIESDPNLEKTLTKTRGKLDS 289
            NS    + +P     P   S  R   +  A+E  +  L+   S  NLEK L     +LDS
Sbjct: 3    NSLISRLVSPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTS--NLEKELASANVQLDS 60

Query: 290  SIVEGVLQRCSKSEPIVGLRFFIWAGLQPGYRHSAWIYGRACRLLGINQRRETLIGVLEA 469
            S +  VL+RC  ++   GLRFFIWAG    +RHSA++Y +AC +L I  + + +  V+E+
Sbjct: 61   SCINEVLRRCDPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIES 120

Query: 470  YRNEGCLVSIKTFKVILNLCREAKLPDEALEVLRKMGEFNCRPDTTNFNLVIRLFSEKGD 649
            YR E C V++KT +++L LC +A L DEAL VLRK  EFN   DT  +NLVIRLF++KGD
Sbjct: 121  YRKEECFVNVKTMRIVLTLCNQANLADEALWVLRKFPEFNVCADTVAYNLVIRLFADKGD 180

Query: 650  MDVALELMNEMALIDLYPDMITYVTMIKGFCNVDRLEDGYGLFRFMRSHGCFPNVVTYST 829
            +++A  L+ EM  + LYPD+ITY +MI G+CN  +++D + L + M  H C  N VTYS 
Sbjct: 181  LNIADMLIKEMDCVGLYPDVITYTSMINGYCNAGKIDDAWRLAKEMSKHDCVLNSVTYSR 240

Query: 830  LLDGVCKSGNLDRAMELLGEMEKE-SDSCVAPNVVTYTSVIQNLCESSRTMEALAILHRM 1006
            +L+GVCKSG+++RA+ELL EMEKE     ++PN VTYT VIQ  CE  R  EAL +L RM
Sbjct: 241  ILEGVCKSGDMERALELLAEMEKEDGGGLISPNAVTYTLVIQAFCEKRRVEEALLVLDRM 300

Query: 1007 EERGCLPNRVTVSTLIKG-LCNAGCVEEAYKLIDKVVGLGTVTTDGCYSSLIVCLLQNNN 1183
              RGC+PNRVT   LI+G L N   V+   KLIDK+V LG V+   C+SS  V L++   
Sbjct: 301  GNRGCMPNRVTACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKR 360

Query: 1184 MEEAEKLFRRMLASAMKPDGLSCSLFIKQLCSDGRFLDAF---XXXXXXXXXXXXXXXXY 1354
             EEAEK+FR ML   ++PDGL+CS   ++LC   R+LD F                   +
Sbjct: 361  WEEAEKIFRLMLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIH 420

Query: 1355 GVLLAGLCQKSHLTEAAKLANVMFNRGIRLKAPYVDGILEYLNISGEGDLALRLT 1519
             VLL GLCQ+ +  EAAKLA  M ++ +RLK  +V+ I+E L  +G+ DL  R +
Sbjct: 421  AVLLLGLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLMSRFS 475


Top