BLASTX nr result

ID: Mentha28_contig00008011 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00008011
         (1566 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21693.1| hypothetical protein MIMGU_mgv1a024440mg, partial...   654   0.0  
ref|XP_006344442.1| PREDICTED: pentatricopeptide repeat-containi...   617   e-174
ref|XP_004236239.1| PREDICTED: pentatricopeptide repeat-containi...   608   e-171
ref|XP_002274114.2| PREDICTED: pentatricopeptide repeat-containi...   590   e-166
ref|XP_007023347.1| Pentatricopeptide repeat superfamily protein...   583   e-164
ref|XP_006465271.1| PREDICTED: pentatricopeptide repeat-containi...   579   e-162
gb|EPS66144.1| hypothetical protein M569_08630, partial [Genlise...   573   e-161
ref|XP_007226356.1| hypothetical protein PRUPE_ppa022331mg [Prun...   563   e-158
ref|XP_004158824.1| PREDICTED: pentatricopeptide repeat-containi...   557   e-156
ref|XP_002306508.1| pentatricopeptide repeat-containing family p...   557   e-156
gb|EXB67206.1| hypothetical protein L484_025684 [Morus notabilis]     553   e-155
ref|XP_004136096.1| PREDICTED: uncharacterized protein LOC101205...   531   e-148
ref|XP_004297847.1| PREDICTED: pentatricopeptide repeat-containi...   528   e-147
ref|XP_007023349.1| Pentatricopeptide repeat (PPR) superfamily p...   524   e-146
ref|NP_178170.1| pentatricopeptide repeat-containing protein [Ar...   518   e-144
ref|XP_002887822.1| pentatricopeptide repeat-containing protein ...   516   e-144
ref|XP_006302258.1| hypothetical protein CARUB_v10020295mg [Caps...   514   e-143
ref|XP_003597983.1| Pentatricopeptide repeat-containing protein ...   514   e-143
ref|XP_007150807.1| hypothetical protein PHAVU_005G182300g [Phas...   513   e-142
ref|XP_006389809.1| hypothetical protein EUTSA_v10018541mg [Eutr...   510   e-142

>gb|EYU21693.1| hypothetical protein MIMGU_mgv1a024440mg, partial [Mimulus guttatus]
          Length = 412

 Score =  654 bits (1686), Expect = 0.0
 Identities = 307/396 (77%), Positives = 348/396 (87%)
 Frame = +3

Query: 378  SDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMAWV 557
            SDF+P TVLETLN YANDWKLALEFF+W E + G++HT  TLNRI+DILGKFFEF+ AW 
Sbjct: 2    SDFQPDTVLETLNCYANDWKLALEFFNWAETQTGYQHTTQTLNRIIDILGKFFEFDAAWK 61

Query: 558  LIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDALC 737
            LIER+ KN C  PDHTTFR+LFKRYVSAHLV+EAI+ F + ++YNLRDE S SNLIDALC
Sbjct: 62   LIERMSKNPCSSPDHTTFRILFKRYVSAHLVEEAIEIFGKLDEYNLRDETSLSNLIDALC 121

Query: 738  EYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMD 917
            EYKHV+EAEEL FK   GD  D ++F F   +TKI+NMILRGWFKM+WW KCREFWE MD
Sbjct: 122  EYKHVVEAEELCFKSKNGDNVDGNVFGFSVESTKIHNMILRGWFKMKWWSKCREFWEAMD 181

Query: 918  KRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVD 1097
            KRGV KDL+SYSIYMDIQCKS KPWKAVKLYKEMKKKG+ LD VAYNTVIRAIGISEG D
Sbjct: 182  KRGVTKDLFSYSIYMDIQCKSGKPWKAVKLYKEMKKKGVRLDVVAYNTVIRAIGISEGAD 241

Query: 1098 VAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFF 1277
            VAV LYKEM ELGC+PNVVTFNTILKLLC+NGRY EAH++LDLM KKG EP+V+TYH FF
Sbjct: 242  VAVSLYKEMIELGCEPNVVTFNTILKLLCQNGRYREAHQLLDLMPKKGCEPDVITYHSFF 301

Query: 1278 VCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDE 1457
             CLEKP+EILK F+RM++SG++P+MDTYVMLMRKFGRWGFLRPV+ +WKKMEEHGLSPDE
Sbjct: 302  TCLEKPKEILKMFNRMVESGVQPRMDTYVMLMRKFGRWGFLRPVIDVWKKMEEHGLSPDE 361

Query: 1458 FAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
             AYNALID LLQKGLVD+AR+YD+EML KG+SAKPR
Sbjct: 362  SAYNALIDALLQKGLVDIARRYDEEMLSKGISAKPR 397


>ref|XP_006344442.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Solanum tuberosum]
          Length = 467

 Score =  617 bits (1590), Expect = e-174
 Identities = 303/464 (65%), Positives = 368/464 (79%), Gaps = 1/464 (0%)
 Frame = +3

Query: 177  MLPSILSKFHHFRSHRSILLSKFKISLFYHTSPRSNPQFPPRTTSPPTQTHRSTIFAVNP 356
            ML S++S++  F    S+LL++F+ +L +H    S P   P TTS P     ++    +P
Sbjct: 1    MLSSVISRYSTFLPS-SLLLAQFE-TLLHHHGYHSTPSNAPNTTSFPNYDDPNS----SP 54

Query: 357  APTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFF 536
            + T        P  +LETL+ Y NDW+ ALEFF+W E +CGF HT+ T N+++DILGKFF
Sbjct: 55   SSTSASSDPLNPTIMLETLSCYNNDWRRALEFFNWAETQCGFHHTSQTCNQLIDILGKFF 114

Query: 537  EFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFS 716
            EF+ AW LIE++R     +PDHTTFRVLFKRYVSAH+VKEAID F +  ++NL+D++SFS
Sbjct: 115  EFDAAWSLIEKMRSVSS-MPDHTTFRVLFKRYVSAHMVKEAIDMFDKMEEFNLKDQVSFS 173

Query: 717  NLIDALCEYKHVIEAEELFFKKNWGD-QYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKC 893
            NLIDALCEYKHVIEAE+L F KN  D +Y     CF + +TKI NM+LRGWFKM WW KC
Sbjct: 174  NLIDALCEYKHVIEAEDLCFPKNKNDVKYS----CF-KVDTKICNMLLRGWFKMSWWGKC 228

Query: 894  REFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRA 1073
            R+FWEEMD RGV+KDLYSYSIYMD+QCKS KPWKAVKLYKEMKKKGI LD +AYNTVIRA
Sbjct: 229  RQFWEEMDTRGVQKDLYSYSIYMDVQCKSGKPWKAVKLYKEMKKKGINLDVIAYNTVIRA 288

Query: 1074 IGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPN 1253
            IGIS+GVDVA +L +EM ELGC+PNV T+NT++KL+CENGRY +A+KVL  M  KG EPN
Sbjct: 289  IGISDGVDVAAKLCQEMIELGCKPNVSTYNTLIKLMCENGRYRDAYKVLSQMPHKGCEPN 348

Query: 1254 VVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKME 1433
            V+TY+ FF CLEKPREILK FDRM++SG+RP+MDTYVMLMRKFGRWGFLRPV +LW+KME
Sbjct: 349  VITYNSFFGCLEKPREILKLFDRMIESGVRPRMDTYVMLMRKFGRWGFLRPVFILWEKME 408

Query: 1434 EHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            + GLSPD  AYNALID L+QKG+VDMARKYD+EML KGLSAKPR
Sbjct: 409  KQGLSPDASAYNALIDALVQKGMVDMARKYDEEMLAKGLSAKPR 452


>ref|XP_004236239.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Solanum lycopersicum]
          Length = 467

 Score =  608 bits (1568), Expect = e-171
 Identities = 300/464 (64%), Positives = 367/464 (79%), Gaps = 1/464 (0%)
 Frame = +3

Query: 177  MLPSILSKFHHFRSHRSILLSKFKISLFYHTSPRSNPQFPPRTTSPPTQTHRSTIFAVNP 356
            ML S++S+   F    S+LL +F+ +L +H    S P   P+ TS P     ++    +P
Sbjct: 1    MLSSVISRSSTFLPS-SLLLVQFE-TLLHHHGYHSTPSKAPKPTSFPNYDDPNS----SP 54

Query: 357  APTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFF 536
            + T        P  VLETL+ Y NDW+ ALEFF+W E +CGF HT+ T N+++DILGKFF
Sbjct: 55   SSTSASSDPLNPTIVLETLSCYNNDWRRALEFFNWAETQCGFHHTSQTSNQLIDILGKFF 114

Query: 537  EFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFS 716
            EF+ AW LIE++R     +PDHTTFRVLFKRYVSAH+VKEAID F +  ++NL+D++SFS
Sbjct: 115  EFDAAWSLIEKMRSVSS-MPDHTTFRVLFKRYVSAHMVKEAIDMFDKMEEFNLKDQVSFS 173

Query: 717  NLIDALCEYKHVIEAEELFFKKNWGD-QYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKC 893
            NLIDALCEYKHVIEAE+L F KN  D +Y     CF + +TKI NM+LRGWFKM WW KC
Sbjct: 174  NLIDALCEYKHVIEAEDLCFPKNKNDVKYS----CF-KVDTKICNMLLRGWFKMSWWGKC 228

Query: 894  REFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRA 1073
            R+FWEEMD RGV+KDLYSYSIYMD+QCKS KPWKAVKLYKEMKKKGI+LD +AYNTVIRA
Sbjct: 229  RQFWEEMDTRGVQKDLYSYSIYMDVQCKSGKPWKAVKLYKEMKKKGIDLDVIAYNTVIRA 288

Query: 1074 IGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPN 1253
            IGI++GVDVA +L +EM ELGC+PNV T+NT++KL+CENGRY +A+KVL+ M +KG EPN
Sbjct: 289  IGIADGVDVAAKLCQEMIELGCKPNVSTYNTLIKLMCENGRYRDAYKVLNQMPQKGCEPN 348

Query: 1254 VVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKME 1433
            V+TY+ FF CLEKPREIL  FDRM++SG+RP+MDTYVMLMRKFGRW FLRPV +LW+KME
Sbjct: 349  VITYNSFFGCLEKPREILTLFDRMIESGVRPRMDTYVMLMRKFGRWEFLRPVFILWEKME 408

Query: 1434 EHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            + GLSPD  AYNALID L+QKG+VDMARKYD+EML KGLSAKPR
Sbjct: 409  KQGLSPDASAYNALIDALVQKGMVDMARKYDEEMLAKGLSAKPR 452


>ref|XP_002274114.2| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Vitis vinifera]
          Length = 571

 Score =  590 bits (1522), Expect = e-166
 Identities = 271/396 (68%), Positives = 335/396 (84%)
 Frame = +3

Query: 378  SDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMAWV 557
            + F+ +TV +TL+ YANDWK ALEFFDWV+ +CGF HT  T N ++DILGKFFEF++ WV
Sbjct: 174  TSFDHSTVRQTLSCYANDWKRALEFFDWVQTQCGFNHTTDTYNGMIDILGKFFEFDLIWV 233

Query: 558  LIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDALC 737
            LI+R++ +    P+H TFR +FKRY +AHLV+EA++A+ R+ ++NLRDE S+SNLIDALC
Sbjct: 234  LIQRMKADPVAYPNHVTFRFVFKRYAAAHLVEEAMNAYYRTEEFNLRDETSYSNLIDALC 293

Query: 738  EYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMD 917
            EYKHVIEAEELF K++    ++DD+        KIYN+ILRGWFKM WW+KCREFWEEMD
Sbjct: 294  EYKHVIEAEELFLKESKDLVFNDDV--------KIYNIILRGWFKMGWWKKCREFWEEMD 345

Query: 918  KRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVD 1097
            +RGV K LYSYSIYMDIQCKS KPW+AVKLYKEMKKKGI LD VAYNTVIRAIG+SEGVD
Sbjct: 346  RRGVCKSLYSYSIYMDIQCKSGKPWRAVKLYKEMKKKGIRLDVVAYNTVIRAIGLSEGVD 405

Query: 1098 VAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFF 1277
             ++R+++EM E+GC+PNVVT+NTI+KLLCENGR  EA+ V D M +KG  PNV+TYHCFF
Sbjct: 406  FSIRVFREMKEVGCEPNVVTYNTIIKLLCENGRIREAYGVFDQMREKGYAPNVITYHCFF 465

Query: 1278 VCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDE 1457
             C+EKP++IL+ FDRM++SG+RP+MDTYVMLM+KFGRWGFLRPV ++WKKMEE G SPD 
Sbjct: 466  GCIEKPKQILRTFDRMINSGVRPRMDTYVMLMKKFGRWGFLRPVFIVWKKMEEQGCSPDA 525

Query: 1458 FAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
             AYNALID L+QKG+VD+ARKY++EML KGLSAKPR
Sbjct: 526  CAYNALIDALVQKGMVDLARKYEEEMLAKGLSAKPR 561


>ref|XP_007023347.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508778713|gb|EOY25969.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 481

 Score =  583 bits (1504), Expect = e-164
 Identities = 277/408 (67%), Positives = 334/408 (81%)
 Frame = +3

Query: 342  FAVNPAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDI 521
            F   P   L ++ +F+  TV ETL+ Y+NDWK ALEFF+WVE +C F HT  T N+++DI
Sbjct: 38   FHSQPPNPLPDQPNFDHQTVRETLSCYSNDWKRALEFFNWVETQCQFPHTTETFNKMLDI 97

Query: 522  LGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRD 701
            LGK FEF+++W LI+R++   C +PDH TFR+LFKRY++AHLVKEAI  F R  ++NL+D
Sbjct: 98   LGKSFEFDLSWDLIDRMKNKPCSIPDHATFRILFKRYITAHLVKEAISTFDRLEEFNLKD 157

Query: 702  EISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEW 881
            EISF NL+DALCEYKHVIEA+EL F   +G   +  L   D   TKI+NMILRGWFKM W
Sbjct: 158  EISFCNLVDALCEYKHVIEAQELCF---FGKIKEIGLSVND---TKIHNMILRGWFKMGW 211

Query: 882  WRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNT 1061
            W KCREFW+EMDK+GVKKDL+SYSIYMDI CKS KPWKAVKLYKEMKKKG++LD VAYNT
Sbjct: 212  WSKCREFWQEMDKKGVKKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKGMKLDVVAYNT 271

Query: 1062 VIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKG 1241
            VIRAIGISEG D  V +++EM +LGC+PNVVT+NT++KLLCENGR  +A+ VLD M KK 
Sbjct: 272  VIRAIGISEGADFGVGVFREMRDLGCEPNVVTYNTVIKLLCENGRVRQAYAVLDQMLKKD 331

Query: 1242 VEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLW 1421
              P+V+TYHCFF CLEKPREILK FD M+ +GI+P+MDTYVMLMRKFGRWGFLRPV M+W
Sbjct: 332  CAPDVITYHCFFGCLEKPREILKLFDLMITNGIQPRMDTYVMLMRKFGRWGFLRPVFMVW 391

Query: 1422 KKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            KKMEE G SP+EFAYNALID L+QKG++DMARKYD+EML KGLS+KPR
Sbjct: 392  KKMEELGSSPNEFAYNALIDALIQKGMLDMARKYDEEMLEKGLSSKPR 439


>ref|XP_006465271.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Citrus sinensis]
          Length = 455

 Score =  579 bits (1492), Expect = e-162
 Identities = 274/405 (67%), Positives = 327/405 (80%)
 Frame = +3

Query: 351  NPAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGK 530
            NP P      +F  +TV ETL+ YANDWK ALEFF+WVE +C F HT  T N ++DILGK
Sbjct: 37   NPKPQS-NPHNFHQSTVRETLSCYANDWKRALEFFNWVETDCHFTHTTDTYNSVIDILGK 95

Query: 531  FFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEIS 710
            FFEF+++W LI R++ N   +P+H TFR++FKRYV+AHLV EA+  F + +++ L+DE+S
Sbjct: 96   FFEFDLSWNLIHRMKDNPSSIPNHATFRIMFKRYVTAHLVNEAMGTFNKLDEFGLKDEVS 155

Query: 711  FSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRK 890
            + NL+DALCEYKHVIEA+EL F +N    +       +   TKIYNMILRGWFKM WW K
Sbjct: 156  YCNLVDALCEYKHVIEAQELCFGENKNVGFSG---LVEMNKTKIYNMILRGWFKMSWWGK 212

Query: 891  CREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIR 1070
            CREFWEEMDKRGV KDL+SYSIYMDI CKS KPWKAVKLYKEMKKK I++D VAYNTVIR
Sbjct: 213  CREFWEEMDKRGVVKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKRIKMDVVAYNTVIR 272

Query: 1071 AIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEP 1250
            A+GISEGVD A+R+Y EM E+GCQP+VVT NT++KLLCENGR  EA+ VL  M KKG  P
Sbjct: 273  AVGISEGVDFAMRVYCEMREMGCQPSVVTCNTVIKLLCENGRVKEAYAVLAEMPKKGCVP 332

Query: 1251 NVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKM 1430
            +V+TYHCFF CLEKPREIL  FDRM++SGIRPKMDTYVML+RKFGRWGFLRPV ++WKKM
Sbjct: 333  DVITYHCFFRCLEKPREILGLFDRMIESGIRPKMDTYVMLLRKFGRWGFLRPVFVVWKKM 392

Query: 1431 EEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            EE G SPDEFAYNAL+D L+ KG++DMARKYD+EM  KGLSAKPR
Sbjct: 393  EELGCSPDEFAYNALVDALIDKGMLDMARKYDEEMFAKGLSAKPR 437


>gb|EPS66144.1| hypothetical protein M569_08630, partial [Genlisea aurea]
          Length = 403

 Score =  573 bits (1478), Expect = e-161
 Identities = 278/402 (69%), Positives = 325/402 (80%)
 Frame = +3

Query: 354  PAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKF 533
            PAP     SDF PATVLETLN YANDWKLALEFF+W E + GF HTA T NR+VD LGKF
Sbjct: 1    PAPFQFRDSDFNPATVLETLNCYANDWKLALEFFNWSETQSGFVHTAETFNRMVDTLGKF 60

Query: 534  FEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISF 713
            FEFE+AW LI+R+ ++    P+HTTFRVL KRYVSA LVKEAIDAF R ++YNLRDE SF
Sbjct: 61   FEFELAWSLIQRMNESPSSPPNHTTFRVLCKRYVSARLVKEAIDAFRRLDEYNLRDETSF 120

Query: 714  SNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKC 893
            S LID+LCEY+HVI+AE+L FK+N   +YD     FD   TKIYNMILRG+FK++WW KC
Sbjct: 121  SILIDSLCEYRHVIDAEDLCFKRNRDTEYDGVFAGFDVETTKIYNMILRGFFKIQWWGKC 180

Query: 894  REFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRA 1073
            R FWE MD++G++KDL+SYSIYMDIQCKS KP KA+KL+KEMK+KGI+ D VAYNTVIRA
Sbjct: 181  RAFWEAMDRKGIQKDLFSYSIYMDIQCKSGKPCKAMKLFKEMKRKGIKPDAVAYNTVIRA 240

Query: 1074 IGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPN 1253
             G   GVD ++R+YK+M E GC P++VTFNTILKLLCENGRY EA ++L  M +KG  PN
Sbjct: 241  AGEQRGVDDSLRIYKQMVEAGCSPSLVTFNTILKLLCENGRYGEAREMLSWMRRKGCPPN 300

Query: 1254 VVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKME 1433
            VVTY+CFF  LEKP EILK FD M+  GIRP+MDTYVMLM KFGRWGFLRPV+ +W+KME
Sbjct: 301  VVTYNCFFGSLEKPGEILKLFDEMVGRGIRPRMDTYVMLMSKFGRWGFLRPVVYVWEKME 360

Query: 1434 EHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAK 1559
            E G SPDEFAYNALID  + KGLV+ ARKYDDEM  KG+SAK
Sbjct: 361  ELGDSPDEFAYNALIDAFMNKGLVEEARKYDDEMYRKGISAK 402


>ref|XP_007226356.1| hypothetical protein PRUPE_ppa022331mg [Prunus persica]
            gi|462423292|gb|EMJ27555.1| hypothetical protein
            PRUPE_ppa022331mg [Prunus persica]
          Length = 455

 Score =  563 bits (1451), Expect = e-158
 Identities = 270/424 (63%), Positives = 333/424 (78%)
 Frame = +3

Query: 294  PPRTTSPPTQTHRSTIFAVNPAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIE 473
            P   ++ P   H  T    NP P     + ++  TV ETL+SY NDWK AL+FF+W+E E
Sbjct: 30   PHSVSTKPISIHNPT----NPEPQS-SSTIYDHTTVRETLSSYCNDWKKALDFFNWLETE 84

Query: 474  CGFEHTAHTLNRIVDILGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVK 653
            C F HT  T NR++DILGKFFEFE+ W LI+++++N   +PDHTTFR+LFKRYVSAHLVK
Sbjct: 85   CHFLHTTVTYNRMLDILGKFFEFELCWNLIQKMKQNPVSVPDHTTFRILFKRYVSAHLVK 144

Query: 654  EAIDAFVRSNDYNLRDEISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGN 833
            EAID + R  ++ L+DE S+ NLIDALCEYKHVIEA+EL F KN    +D         +
Sbjct: 145  EAIDTYNRLEEFGLKDETSYCNLIDALCEYKHVIEAQELCFWKNKDLGFDK--------S 196

Query: 834  TKIYNMILRGWFKMEWWRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYK 1013
            TK+YN++LRGW KM WW KCR+FWEEMD+RGV+KDL+SYSIYMDI CKS KPWKAVKLYK
Sbjct: 197  TKLYNLLLRGWLKMGWWGKCRDFWEEMDRRGVRKDLHSYSIYMDILCKSGKPWKAVKLYK 256

Query: 1014 EMKKKGIELDTVAYNTVIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENG 1193
            EMK KGI+LD VAYNTVIRAIG+S+GVD ++RL +EM ELGCQPNV T+NTI+KLLCENG
Sbjct: 257  EMKNKGIKLDVVAYNTVIRAIGLSDGVDFSMRLLREMKELGCQPNVGTYNTIIKLLCENG 316

Query: 1194 RYIEAHKVLDLMSKKGVEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLM 1373
            R  EA  +L  M + G+ P+V+TYHC F  LEKP EIL+ FDRM +SG++PKMDT+VMLM
Sbjct: 317  RCKEAFSLLHQMPRMGLLPDVITYHCIFKHLEKPNEILRLFDRMTESGVQPKMDTFVMLM 376

Query: 1374 RKFGRWGFLRPVLMLWKKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLS 1553
            RKFGRWGFLRP+ ++W +ME+ G SPDE AYNALID L++KG++DMAR+YD+EML KGLS
Sbjct: 377  RKFGRWGFLRPMFLVWNRMEKLGCSPDESAYNALIDALVEKGMLDMARQYDEEMLAKGLS 436

Query: 1554 AKPR 1565
            AKPR
Sbjct: 437  AKPR 440


>ref|XP_004158824.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Cucumis sativus]
          Length = 450

 Score =  557 bits (1435), Expect = e-156
 Identities = 264/408 (64%), Positives = 322/408 (78%)
 Frame = +3

Query: 342  FAVNPAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDI 521
            F  N +      ++F+P TV E L+SY NDWK + EFF+WVE EC F+HT  T NR++DI
Sbjct: 31   FKFNTSERRPTHTNFDPFTVREALDSYCNDWKRSYEFFNWVESECKFDHTTETYNRMLDI 90

Query: 522  LGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRD 701
            LGKFFEF+++WVLI R+R++    PDH TFR+LFKRY  AHLV EAI A+ R  ++ LRD
Sbjct: 91   LGKFFEFDLSWVLINRMRQSPSASPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRD 150

Query: 702  EISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEW 881
            E SF NLIDALCE +HV EA+EL F KN        L C  + +TKI+N+ILRGW KM W
Sbjct: 151  ETSFCNLIDALCESRHVDEAQELCFGKN------RKLDC--DSSTKIHNLILRGWLKMGW 202

Query: 882  WRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNT 1061
            W KCR+FWEEMDK+GV+KDL+SYSIYMDIQCKS KPWKAVKLYKEMKKKG++LD VAYNT
Sbjct: 203  WSKCRDFWEEMDKKGVRKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNT 262

Query: 1062 VIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKG 1241
            VI A+GISEGVD A R++ EM E+GC+PNVVT NT++KL CENGR+ +AH +LD M K+ 
Sbjct: 263  VIHAVGISEGVDFASRVFHEMKEMGCKPNVVTCNTVIKLFCENGRFKDAHMMLDQMLKRD 322

Query: 1242 VEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLW 1421
             +PNV+TYHCFF  LEKP+EIL  FDRM+  G+ PKMDTYVML+RKFGRWGFLRPV ++W
Sbjct: 323  CQPNVITYHCFFRSLEKPKEILVLFDRMIKYGVHPKMDTYVMLLRKFGRWGFLRPVFLVW 382

Query: 1422 KKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
             KMEE G SP+E AYNALID L++KG++DMARKYD+EM+ KGLS K R
Sbjct: 383  NKMEELGCSPNECAYNALIDALVEKGMIDMARKYDEEMVAKGLSPKLR 430


>ref|XP_002306508.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222855957|gb|EEE93504.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 439

 Score =  557 bits (1435), Expect = e-156
 Identities = 271/433 (62%), Positives = 329/433 (75%), Gaps = 1/433 (0%)
 Frame = +3

Query: 270  SPRSNPQFPPRTTSPPTQTHRSTIFAVNPAPTLIERSDFEPATVLETLNSYANDWKLALE 449
            +P      P  T +P    H  T     P P   +  + + +TV +TL+ Y NDWK AL+
Sbjct: 11   NPAKTLLLPYSTNTPTFHFHSRT-----PNPPQSDPLNLDSSTVFQTLSCYNNDWKRALD 65

Query: 450  FFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKR 629
            FF+WVE E  F+HT  T NR++DILGKFFEF+++W LI+R+R N    P+HTTFRVLF R
Sbjct: 66   FFNWVETESQFQHTTETYNRMIDILGKFFEFDLSWDLIQRMRNNPFSTPNHTTFRVLFHR 125

Query: 630  YVSAHLVKEAIDAFV-RSNDYNLRDEISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDD 806
            Y+SAHLV EA+  +  R  ++ L+DE S+  L+DALCEYKHVIEA EL F  N       
Sbjct: 126  YISAHLVNEAVSVYEDRLKEFGLKDETSYCILVDALCEYKHVIEAHELCFGNNNNSINVR 185

Query: 807  DLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKK 986
            ++       TKIYNMILRGWFKM WW KCREFWEEMD++ V KDL+SYSIYMDI CKS K
Sbjct: 186  NI-------TKIYNMILRGWFKMGWWGKCREFWEEMDRKEVCKDLHSYSIYMDILCKSGK 238

Query: 987  PWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNT 1166
            PWKAVKLYKEMK KGI+LD VAYNTVI AIG+SEGVD  +R+Y+EM ELGCQPNVVT NT
Sbjct: 239  PWKAVKLYKEMKSKGIKLDVVAYNTVINAIGLSEGVDFVLRVYREMRELGCQPNVVTCNT 298

Query: 1167 ILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRP 1346
            ++KLLCENGR  EA+K+LD M +  + P+V TYHCFF CLEKP+EIL  FD+M+++G+ P
Sbjct: 299  VIKLLCENGRIKEAYKMLDEMPQSYIAPDVFTYHCFFRCLEKPKEILCLFDQMIENGVCP 358

Query: 1347 KMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYD 1526
            +MDTYVMLMRKFGRWGFLRPV ++WKKME+ G SPDEFAYNALID L+QKG+VDMARKYD
Sbjct: 359  RMDTYVMLMRKFGRWGFLRPVFLVWKKMEKLGCSPDEFAYNALIDALIQKGMVDMARKYD 418

Query: 1527 DEMLLKGLSAKPR 1565
            +EM+ KGLSAKPR
Sbjct: 419  EEMMAKGLSAKPR 431


>gb|EXB67206.1| hypothetical protein L484_025684 [Morus notabilis]
          Length = 442

 Score =  553 bits (1425), Expect = e-155
 Identities = 266/423 (62%), Positives = 329/423 (77%), Gaps = 5/423 (1%)
 Frame = +3

Query: 312  PPTQT----HRSTIFAVNPA-PTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIEC 476
            P TQT      +  F+ NP  P   +   F   TV ETL SY NDW+ A EFF WVE  C
Sbjct: 17   PRTQTLLVSSATQSFSTNPQKPPQSDSLQFNSDTVTETLTSYCNDWQRAFEFFTWVETNC 76

Query: 477  GFEHTAHTLNRIVDILGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKE 656
             F HT  T NR++DILGKFFEF+++W LI R+ +N   +P H TFRV+F RY +AHLVKE
Sbjct: 77   RFLHTTDTYNRMLDILGKFFEFDLSWDLIHRMNQNPVSVPSHATFRVMFHRYAAAHLVKE 136

Query: 657  AIDAFVRSNDYNLRDEISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNT 836
            A++A+ RS ++ L+DE ++SNLIDALC+ KHVIEA++L F   W  +         E +T
Sbjct: 137  AVEAYNRSEEFGLKDETTYSNLIDALCDQKHVIEAQDLCF---WNGKE-----LGFEKST 188

Query: 837  KIYNMILRGWFKMEWWRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKE 1016
            KIYNMILRGW ++ WW KC +FWEEMD+RG++KDL++YSIYMDI CKS KPWKAVKLYKE
Sbjct: 189  KIYNMILRGWSRVGWWSKCGDFWEEMDRRGLEKDLHTYSIYMDILCKSGKPWKAVKLYKE 248

Query: 1017 MKKKGIELDTVAYNTVIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGR 1196
            MKKK I+LD VAYNT++RA+G+SEGVD ++R+ +EM ELGCQPNVVT+NT++KLLCENGR
Sbjct: 249  MKKKRIKLDVVAYNTIVRAVGLSEGVDFSMRVLREMRELGCQPNVVTYNTLIKLLCENGR 308

Query: 1197 YIEAHKVLDLMSKKGVEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMR 1376
            Y EA KVLD M + G  P+V+TYHCFF  +EKP+EIL+ FDRM+DSGIRP+ DTYVMLMR
Sbjct: 309  YREASKVLDKMPEWGCSPDVITYHCFFGSMEKPKEILRLFDRMIDSGIRPRTDTYVMLMR 368

Query: 1377 KFGRWGFLRPVLMLWKKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSA 1556
            KFGRWGFLRPVL++WKKMEE G SP++ AYNALID L+ KG++DMARKYD+EML KGLS 
Sbjct: 369  KFGRWGFLRPVLVVWKKMEELGCSPNDAAYNALIDALIDKGMLDMARKYDEEMLAKGLSP 428

Query: 1557 KPR 1565
            KPR
Sbjct: 429  KPR 431


>ref|XP_004136096.1| PREDICTED: uncharacterized protein LOC101205322 [Cucumis sativus]
          Length = 1559

 Score =  531 bits (1367), Expect = e-148
 Identities = 254/393 (64%), Positives = 310/393 (78%)
 Frame = +3

Query: 387  EPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMAWVLIE 566
            +P T+LE   +       + EFF+WVE EC F+HT  T NR++DILGKFFEF+++WVLI 
Sbjct: 598  KPVTILEP-GTLHRRLPRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLIN 656

Query: 567  RLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDALCEYK 746
            R+R++    PDH TFR+LFKRY  AHLV EAI A+ R  ++ LRDE SF NLIDALCE +
Sbjct: 657  RMRQSPSASPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESR 716

Query: 747  HVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMDKRG 926
            HV EA+EL F KN        L C  + +TKI+N+ILRGW KM WW KCR+FWEEMDK+G
Sbjct: 717  HVDEAQELCFGKN------RKLDC--DSSTKIHNLILRGWLKMGWWSKCRDFWEEMDKKG 768

Query: 927  VKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVDVAV 1106
            V+KDL+SYSIYMDIQCKS KPWKAVKLYKEMKKKG++LD VAYNTVI A+GISEGVD A 
Sbjct: 769  VRKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAVGISEGVDFAS 828

Query: 1107 RLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFFVCL 1286
            R++ EM E+GC+PNVVT NT++KL CENGR+ +AH +LD M K+  +PNV+TYHCFF  L
Sbjct: 829  RVFHEMKEMGCKPNVVTCNTVIKLFCENGRFKDAHMMLDQMLKRDCQPNVITYHCFFRSL 888

Query: 1287 EKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDEFAY 1466
            EKP+EIL  FDRM+  G+ PKMDTYVML+RKFGRWGFLRPV ++W KMEE G SP+E AY
Sbjct: 889  EKPKEILVLFDRMIKYGVHPKMDTYVMLLRKFGRWGFLRPVFLVWNKMEELGCSPNECAY 948

Query: 1467 NALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            NALID L++KG++DMARKYD+EM+ KGLS K R
Sbjct: 949  NALIDALVEKGMIDMARKYDEEMVAKGLSPKLR 981


>ref|XP_004297847.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 465

 Score =  528 bits (1360), Expect = e-147
 Identities = 257/455 (56%), Positives = 332/455 (72%)
 Frame = +3

Query: 201  FHHFRSHRSILLSKFKISLFYHTSPRSNPQFPPRTTSPPTQTHRSTIFAVNPAPTLIERS 380
            FH +    S    + +  L  HT        P +  S P  T         P P+     
Sbjct: 13   FHFYTMLSSRTAQRLRPFLLPHTQTLVFSSLPTKPISIPNPTDPY------PQPS---SP 63

Query: 381  DFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMAWVL 560
             ++  TV ETL+SY NDWK AL+FF WVE +  F+HT  T NR++DILGK+FEFE+ W L
Sbjct: 64   IYDHTTVRETLSSYCNDWKKALDFFIWVESQPHFQHTTETYNRLLDILGKYFEFELCWDL 123

Query: 561  IERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDALCE 740
            + ++++N  C+PDHTTFR++FKRYVSAHLVKEAID + + +++ L+DE S+ NL+DALCE
Sbjct: 124  VHKMKQNPLCVPDHTTFRIMFKRYVSAHLVKEAIDTYNKLDEFGLKDETSYCNLVDALCE 183

Query: 741  YKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMDK 920
            +KHVIEA+EL   KN    +D         +TK++N+ILRGW KM WW KCR+FWEEMD+
Sbjct: 184  HKHVIEAQELCSWKNKELGFDR--------STKLHNIILRGWSKMGWWGKCRDFWEEMDR 235

Query: 921  RGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVDV 1100
            RGV KDL+SYSIYMDI CKS K WKAVKLYKE+K+K I+LD VAYNTVI A+G SEGVD 
Sbjct: 236  RGVCKDLHSYSIYMDIMCKSGKAWKAVKLYKEVKRKRIKLDVVAYNTVIGAVGASEGVDF 295

Query: 1101 AVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFFV 1280
            A+R+ +EM ELGC PN+VT+NTI+KLLCEN R  EA  +L +MSK    P+V+TY   F 
Sbjct: 296  AIRILREMKELGCDPNIVTYNTIIKLLCENMRVREAFSMLRVMSKNSCGPDVITYQIIFK 355

Query: 1281 CLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDEF 1460
             LEKP EIL+ FDRM++SG++P+MDTYVM+MRKFGRWGFLRP+ ++W+KME+ G SP+E 
Sbjct: 356  YLEKPNEILRLFDRMIESGVQPRMDTYVMIMRKFGRWGFLRPMFIVWQKMEKLGCSPNES 415

Query: 1461 AYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            AYNALID L++KG++DMARKYD+EM+ KGL  +PR
Sbjct: 416  AYNALIDALVEKGMLDMARKYDEEMIAKGLPTRPR 450


>ref|XP_007023349.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3
            [Theobroma cacao] gi|508778715|gb|EOY25971.1|
            Pentatricopeptide repeat (PPR) superfamily protein
            isoform 3 [Theobroma cacao]
          Length = 360

 Score =  524 bits (1350), Expect = e-146
 Identities = 249/352 (70%), Positives = 297/352 (84%)
 Frame = +3

Query: 510  IVDILGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDY 689
            ++DILGK FEF+++W LI+R++   C +PDH TFR+LFKRY++AHLVKEAI  F R  ++
Sbjct: 1    MLDILGKSFEFDLSWDLIDRMKNKPCSIPDHATFRILFKRYITAHLVKEAISTFDRLEEF 60

Query: 690  NLRDEISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWF 869
            NL+DEISF NL+DALCEYKHVIEA+EL F   +G   +  L   D   TKI+NMILRGWF
Sbjct: 61   NLKDEISFCNLVDALCEYKHVIEAQELCF---FGKIKEIGLSVND---TKIHNMILRGWF 114

Query: 870  KMEWWRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTV 1049
            KM WW KCREFW+EMDK+GVKKDL+SYSIYMDI CKS KPWKAVKLYKEMKKKG++LD V
Sbjct: 115  KMGWWSKCREFWQEMDKKGVKKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKGMKLDVV 174

Query: 1050 AYNTVIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLM 1229
            AYNTVIRAIGISEG D  V +++EM +LGC+PNVVT+NT++KLLCENGR  +A+ VLD M
Sbjct: 175  AYNTVIRAIGISEGADFGVGVFREMRDLGCEPNVVTYNTVIKLLCENGRVRQAYAVLDQM 234

Query: 1230 SKKGVEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPV 1409
             KK   P+V+TYHCFF CLEKPREILK FD M+ +GI+P+MDTYVMLMRKFGRWGFLRPV
Sbjct: 235  LKKDCAPDVITYHCFFGCLEKPREILKLFDLMITNGIQPRMDTYVMLMRKFGRWGFLRPV 294

Query: 1410 LMLWKKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
             M+WKKMEE G SP+EFAYNALID L+QKG++DMARKYD+EML KGLS+KPR
Sbjct: 295  FMVWKKMEELGSSPNEFAYNALIDALIQKGMLDMARKYDEEMLEKGLSSKPR 346


>ref|NP_178170.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75264854|sp|Q9M8M3.1|PP136_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g80550, mitochondrial; Flags: Precursor
            gi|6730729|gb|AAF27119.1|AC018849_7 unknown protein;
            31926-33272 [Arabidopsis thaliana]
            gi|332198297|gb|AEE36418.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 448

 Score =  518 bits (1333), Expect = e-144
 Identities = 242/398 (60%), Positives = 312/398 (78%)
 Frame = +3

Query: 372  ERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMA 551
            ++S ++  TV E L  Y+NDW+ ALEFF+WVE E GF HT  T NR++DILGK+FEFE++
Sbjct: 41   DQSSYDQKTVCEALTCYSNDWQKALEFFNWVERESGFRHTTETFNRVIDILGKYFEFEIS 100

Query: 552  WVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDA 731
            W LI R+  N   +P+H TFR++FKRYV+AHLV+EAIDA+ + +D+NLRDE SF NL+DA
Sbjct: 101  WALINRMIGNTESVPNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDETSFYNLVDA 160

Query: 732  LCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEE 911
            LCE+KHV+EAEEL F KN           F   NTKI+N+ILRGW K+ WW KC+E+W++
Sbjct: 161  LCEHKHVVEAEELCFGKNVIGNG------FSVSNTKIHNLILRGWSKLGWWGKCKEYWKK 214

Query: 912  MDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEG 1091
            MD  GV KDL+SYSIYMDI CKS KPWKAVKLYKEMK + ++LD VAYNTVIRAIG S+G
Sbjct: 215  MDTEGVTKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRRMKLDVVAYNTVIRAIGASQG 274

Query: 1092 VDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHC 1271
            V+  +R+++EM E GC+PNV T NTI+KLLCE+GR  +A+++LD M K+G +P+ +TY C
Sbjct: 275  VEFGIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAYRMLDEMPKRGCQPDSITYMC 334

Query: 1272 FFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSP 1451
             F  LEKP EIL  F RM+ SG+RPKMDTYVMLMRKF RWGFL+PVL +WK M+E G +P
Sbjct: 335  LFSRLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGDTP 394

Query: 1452 DEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            D  AYNA+ID L+QKG++DMAR+Y++EM+ +GLS + R
Sbjct: 395  DSAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRR 432


>ref|XP_002887822.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297333663|gb|EFH64081.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 407

 Score =  516 bits (1330), Expect = e-144
 Identities = 243/398 (61%), Positives = 310/398 (77%)
 Frame = +3

Query: 372  ERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMA 551
            ++S ++  TV E L+ Y NDW+ ALEFF+WVE E GF HT  T NR++DILGK+FEFE  
Sbjct: 1    DQSSYDQKTVCEALSCYINDWQKALEFFNWVEKESGFRHTTETFNRMIDILGKYFEFETC 60

Query: 552  WVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDA 731
            W LI R+  N   LP+H TFR++FKRYV+AHLV+EAIDA+ + +D+NLRD+ SF NL+DA
Sbjct: 61   WALINRMIGNPESLPNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDDTSFYNLVDA 120

Query: 732  LCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEE 911
            LCE+KHV+EAEEL F KN           F   NTKI+N+ILRGW K+ WW KC+E+W++
Sbjct: 121  LCEHKHVVEAEELCFGKNV------IAHGFSVSNTKIHNLILRGWSKLGWWGKCKEYWDK 174

Query: 912  MDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEG 1091
            MD  GV KDL+SYSIYMDI CKS KPWKAVKLYKEMK + I+LD VAYNTVIRAIG S+G
Sbjct: 175  MDTEGVPKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRRIKLDVVAYNTVIRAIGASQG 234

Query: 1092 VDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHC 1271
            V+  +R+++EM E GC+PNV T NTI+KLLCE+GR  +A+++LD M KKG +P+ ++Y C
Sbjct: 235  VEFGIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAYRMLDEMPKKGCQPDSISYMC 294

Query: 1272 FFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSP 1451
             F  LEKP EIL  F RM+ SG+RPKMDTYVMLMRKF RWGFL+PVL +WK M+E G +P
Sbjct: 295  LFSRLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGDTP 354

Query: 1452 DEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            D  AYNA+ID L+QKG++DMAR+Y++EM+ +GLS + R
Sbjct: 355  DSAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRR 392


>ref|XP_006302258.1| hypothetical protein CARUB_v10020295mg [Capsella rubella]
            gi|482570968|gb|EOA35156.1| hypothetical protein
            CARUB_v10020295mg [Capsella rubella]
          Length = 447

 Score =  514 bits (1325), Expect = e-143
 Identities = 242/398 (60%), Positives = 310/398 (77%)
 Frame = +3

Query: 372  ERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEMA 551
            ++S ++   V E L+ Y+NDW+ ALEFF+WVE E GF HT  T NR++DILGK+FEF+ +
Sbjct: 40   DQSSYDQKAVCEALSCYSNDWQKALEFFNWVEKESGFRHTTETFNRMIDILGKYFEFDAS 99

Query: 552  WVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDA 731
            W LI R+      LP+H TFR++FKRYV+AHLV+EAIDA+ + +D+NLRDE SF NL+DA
Sbjct: 100  WGLINRMIGMPQSLPNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDETSFYNLVDA 159

Query: 732  LCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEE 911
            LCE+KHV+EAEEL F KN           F   NTKI+N+ILRGW K+ WW KC+EFWE+
Sbjct: 160  LCEHKHVVEAEELCFGKNVIAN------AFSLSNTKIHNLILRGWSKLGWWGKCKEFWEK 213

Query: 912  MDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEG 1091
            MD  GV KDL+SYSIYMDI CKS KPWKAV+LYKEM+ +GI+LD VAYNTVIRAIG S+G
Sbjct: 214  MDTEGVAKDLFSYSIYMDIMCKSGKPWKAVRLYKEMRSRGIKLDVVAYNTVIRAIGASQG 273

Query: 1092 VDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHC 1271
            V+  +R+++EM + GC+PNV T NTI+KLLCENGR  +A+++LD M KKG + + +TY C
Sbjct: 274  VEFGIRVFREMRDRGCEPNVATHNTIIKLLCENGRMRDAYQMLDEMPKKGCQADSITYMC 333

Query: 1272 FFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSP 1451
             F  LEKP EIL  F RM+ SG+RPKMDTYVMLMRKF RWGFL+PVL +WK M+E G +P
Sbjct: 334  LFARLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGDTP 393

Query: 1452 DEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            D  AYNA+ID L+QKG++DMAR+Y++EM+ +GLS + R
Sbjct: 394  DAAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRR 431


>ref|XP_003597983.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487031|gb|AES68234.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 520

 Score =  514 bits (1324), Expect = e-143
 Identities = 249/429 (58%), Positives = 319/429 (74%), Gaps = 12/429 (2%)
 Frame = +3

Query: 315  PTQTHRST---IFAVNPAPTLIERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFE 485
            P  TH S    I   NP P        +  TV  TL S+ ND+K ALEFF+WVE +  F+
Sbjct: 48   PFHTHSSFHTFITQQNPNPNPSPIPFVDHTTVRATLTSFNNDYKRALEFFNWVETKFKFQ 107

Query: 486  HTAHTLNRIVDILGKFFEFEMAWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAID 665
            H+  T N ++DILGKFFEF+  W LI R+R+N   LP+HTTFRV+FKRYVSAH V++A++
Sbjct: 108  HSTETYNLVLDILGKFFEFQQCWNLIHRMRQNPHSLPNHTTFRVMFKRYVSAHCVQDAVN 167

Query: 666  AFVRSNDYNLRDEISFSNLIDALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEG----- 830
             F R N++NL+DE SFSNLIDALCEYKHV+EA++L F    GD+ +  L    +G     
Sbjct: 168  TFQRLNEFNLKDETSFSNLIDALCEYKHVLEAQDLVF----GDKKNQTLTWIVDGVDGFV 223

Query: 831  ----NTKIYNMILRGWFKMEWWRKCREFWEEMDKRGVKKDLYSYSIYMDIQCKSKKPWKA 998
                NTKI+N++LRGW+K+ WW KC EFW+EMD+RGV+KDL+SYSIYMDI  K  KPWKA
Sbjct: 224  ASSKNTKIFNIVLRGWYKLGWWSKCWEFWDEMDRRGVEKDLHSYSIYMDILSKGGKPWKA 283

Query: 999  VKLYKEMKKKGIELDTVAYNTVIRAIGISEGVDVAVRLYKEMAELGCQPNVVTFNTILKL 1178
            VKL+KEMK+KGI+LD V YN VIRAIG+S+GVD ++R++ EM +LG  P VVT+NTI++L
Sbjct: 284  VKLFKEMKRKGIQLDVVVYNIVIRAIGVSQGVDFSIRMFCEMKDLGLNPTVVTYNTIIRL 343

Query: 1179 LCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFFVCLEKPREILKYFDRMMDSGIRPKMDT 1358
            LC++ RY EA  ++  M + G  PN V+Y CFF CLEKP+ I++ FD M++SG+RP MDT
Sbjct: 344  LCDSYRYKEALTLIRTMRRDGCSPNAVSYQCFFACLEKPKFIIELFDGMIESGVRPTMDT 403

Query: 1359 YVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDEFAYNALIDVLLQKGLVDMARKYDDEML 1538
            YVML++KF RWGFLR V ++W +MEE G SPD  AYNALID L++KGL+DMARKYD+EML
Sbjct: 404  YVMLLKKFARWGFLRLVFLVWNRMEELGCSPDASAYNALIDALVEKGLIDMARKYDEEML 463

Query: 1539 LKGLSAKPR 1565
             KGLS KPR
Sbjct: 464  AKGLSPKPR 472


>ref|XP_007150807.1| hypothetical protein PHAVU_005G182300g [Phaseolus vulgaris]
            gi|593700758|ref|XP_007150808.1| hypothetical protein
            PHAVU_005G182300g [Phaseolus vulgaris]
            gi|561024071|gb|ESW22801.1| hypothetical protein
            PHAVU_005G182300g [Phaseolus vulgaris]
            gi|561024072|gb|ESW22802.1| hypothetical protein
            PHAVU_005G182300g [Phaseolus vulgaris]
          Length = 464

 Score =  513 bits (1320), Expect = e-142
 Identities = 254/453 (56%), Positives = 318/453 (70%), Gaps = 3/453 (0%)
 Frame = +3

Query: 216  SHRSILLSKFKISLFYHTSPRSNPQFPPRTTSPPTQTHRSTIFAVNPAPTLIERSDFEPA 395
            S RS L   F+      T+   +P  PP    PP            P P        + A
Sbjct: 8    SSRSNLPQPFQFQTLSTTTVTESPLPPPPPPPPP------------PPP--------DEA 47

Query: 396  TVLETLNSYANDWKLALEFFDWVE---IECGFEHTAHTLNRIVDILGKFFEFEMAWVLIE 566
             V +TL S+ NDWK A+EFFDWVE     C F H+  T N ++DIL KFFEF++ W LI 
Sbjct: 48   DVRQTLLSFNNDWKRAMEFFDWVEESHSHCNFRHSTDTFNLMLDILAKFFEFDLCWHLIR 107

Query: 567  RLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLIDALCEYK 746
            R+       P+HTTFRVLFKRYVSAHLV++AI AF R  ++NL D  SFS+LIDALCEYK
Sbjct: 108  RMHSRASSPPNHTTFRVLFKRYVSAHLVQDAIHAFHRLGEFNLNDHTSFSHLIDALCEYK 167

Query: 747  HVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWEEMDKRG 926
            HVIEA++L F K   D   D +     GNTKI+NM+LRGWFK+ WW KC EFWEEMD++G
Sbjct: 168  HVIEAQDLVFSK---DAPVDAI-----GNTKIHNMVLRGWFKLGWWSKCNEFWEEMDRKG 219

Query: 927  VKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISEGVDVAV 1106
            V+KDL+SYSIYMDI CK  KPWKAVKL+KE+K+KG +LD V YN +IRAIG+SEGVD ++
Sbjct: 220  VQKDLHSYSIYMDILCKGGKPWKAVKLFKEVKRKGFQLDVVVYNILIRAIGLSEGVDFSI 279

Query: 1107 RLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYHCFFVCL 1286
             +++EM +LG  P VVT+NT+++LLC+  R+ EA  +L  M++ G  P  ++YHCFF  L
Sbjct: 280  GVFREMKDLGINPTVVTYNTLIRLLCDCYRHKEALALLQTMARDGCHPTAISYHCFFASL 339

Query: 1287 EKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLSPDEFAY 1466
            EKP+EIL  FD M++SG+RP MDTYVML+ KFGRWGFLRPV M+W +ME+ G SPD  AY
Sbjct: 340  EKPKEILVMFDNMIESGVRPSMDTYVMLLNKFGRWGFLRPVFMVWNRMEQLGCSPDAAAY 399

Query: 1467 NALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            NALID L+ KGL++MARKYD+EML KGLS KPR
Sbjct: 400  NALIDALVDKGLIEMARKYDEEMLAKGLSPKPR 432


>ref|XP_006389809.1| hypothetical protein EUTSA_v10018541mg [Eutrema salsugineum]
            gi|557086243|gb|ESQ27095.1| hypothetical protein
            EUTSA_v10018541mg [Eutrema salsugineum]
          Length = 448

 Score =  510 bits (1313), Expect = e-142
 Identities = 238/399 (59%), Positives = 308/399 (77%)
 Frame = +3

Query: 369  IERSDFEPATVLETLNSYANDWKLALEFFDWVEIECGFEHTAHTLNRIVDILGKFFEFEM 548
            +++S ++  TV E L  Y NDW+ ALEFF+WV+ E GF HT  T NR++DILGK+FEF+ 
Sbjct: 41   MDQSSYDQKTVCEALTCYGNDWQKALEFFNWVDKESGFSHTTDTFNRMIDILGKYFEFQT 100

Query: 549  AWVLIERLRKNGCCLPDHTTFRVLFKRYVSAHLVKEAIDAFVRSNDYNLRDEISFSNLID 728
             WVLI R+ +N   +P+H TFR++FKRY  AHLV+EA+D + + +D+NLRDE SF NL+D
Sbjct: 101  CWVLINRMAENPLSVPNHVTFRIIFKRYAMAHLVQEALDTYDKLDDFNLRDETSFYNLVD 160

Query: 729  ALCEYKHVIEAEELFFKKNWGDQYDDDLFCFDEGNTKIYNMILRGWFKMEWWRKCREFWE 908
            +LCE+KHV+EAEEL F KN           F   NTKI+N+ILRGW K+ WW KC+E+WE
Sbjct: 161  SLCEHKHVVEAEELCFGKNVIGNG------FSVSNTKIHNLILRGWSKLGWWGKCKEYWE 214

Query: 909  EMDKRGVKKDLYSYSIYMDIQCKSKKPWKAVKLYKEMKKKGIELDTVAYNTVIRAIGISE 1088
            +MD  GV KDL+SYSIYMDI CKS KPWKAVKLYKEMK K ++LD VAYNTVIRAIG S+
Sbjct: 215  KMDTEGVAKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSKRMKLDVVAYNTVIRAIGASQ 274

Query: 1089 GVDVAVRLYKEMAELGCQPNVVTFNTILKLLCENGRYIEAHKVLDLMSKKGVEPNVVTYH 1268
            GV+  +R+++EM E GC+PNV T NTI+KLLCE+GR  +A+ +L+ M KKG +P+ VTY 
Sbjct: 275  GVEFGMRMFREMRERGCEPNVATHNTIIKLLCEDGRMKDAYGMLNEMPKKGCQPDSVTYM 334

Query: 1269 CFFVCLEKPREILKYFDRMMDSGIRPKMDTYVMLMRKFGRWGFLRPVLMLWKKMEEHGLS 1448
            C F  LEKP EIL  F +M+ SG+RP+MDTYVML+RKF RWGFL+PVL +WK M+E G +
Sbjct: 335  CLFARLEKPSEILSLFGKMIRSGVRPRMDTYVMLIRKFERWGFLQPVLHVWKTMKESGDT 394

Query: 1449 PDEFAYNALIDVLLQKGLVDMARKYDDEMLLKGLSAKPR 1565
            PD  AYNA+ID L+QKG++DMAR+Y+DEM+ +GLS + R
Sbjct: 395  PDSAAYNAVIDALVQKGMLDMAREYEDEMVERGLSPRRR 433


Top