BLASTX nr result

ID: Akebia25_contig00024320 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00024320
         (1662 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274321.2| PREDICTED: pentatricopeptide repeat-containi...   901   0.0  
ref|XP_006837119.1| hypothetical protein AMTR_s00110p00121530 [A...   876   0.0  
ref|XP_007045889.1| Tetratricopeptide repeat (TPR)-like superfam...   860   0.0  
ref|XP_002512090.1| pentatricopeptide repeat-containing protein,...   843   0.0  
ref|XP_006484190.1| PREDICTED: pentatricopeptide repeat-containi...   832   0.0  
ref|XP_006437942.1| hypothetical protein CICLE_v10031249mg [Citr...   829   0.0  
ref|XP_004144175.1| PREDICTED: pentatricopeptide repeat-containi...   827   0.0  
gb|EXB44833.1| hypothetical protein L484_026413 [Morus notabilis]     820   0.0  
ref|XP_006303169.1| hypothetical protein CARUB_v10008572mg [Caps...   813   0.0  
ref|XP_002893403.1| pentatricopeptide repeat-containing protein ...   809   0.0  
ref|NP_564247.1| pentatricopeptide repeat-containing protein [Ar...   806   0.0  
ref|XP_006415925.1| hypothetical protein EUTSA_v10007071mg [Eutr...   805   0.0  
gb|AAM61409.1| unknown [Arabidopsis thaliana]                         803   0.0  
gb|EYU22373.1| hypothetical protein MIMGU_mgv1a001842mg [Mimulus...   799   0.0  
ref|XP_004297246.1| PREDICTED: pentatricopeptide repeat-containi...   787   0.0  
ref|XP_002315005.2| pentatricopeptide repeat-containing family p...   780   0.0  
ref|XP_003516357.1| PREDICTED: pentatricopeptide repeat-containi...   777   0.0  
ref|XP_007222915.1| hypothetical protein PRUPE_ppa003665mg [Prun...   773   0.0  
ref|XP_003518766.1| PREDICTED: pentatricopeptide repeat-containi...   769   0.0  
ref|XP_004490167.1| PREDICTED: pentatricopeptide repeat-containi...   763   0.0  

>ref|XP_002274321.2| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like [Vitis vinifera]
            gi|297738080|emb|CBI27281.3| unnamed protein product
            [Vitis vinifera]
          Length = 616

 Score =  901 bits (2329), Expect = 0.0
 Identities = 447/551 (81%), Positives = 494/551 (89%), Gaps = 1/551 (0%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWR+PIP    AQS++PLGF+ Q+ S R+ ALSQTLD P+LMNVFADWMTSQRWADM Q
Sbjct: 68   ENWRSPIPQ---AQSIIPLGFLHQSSSSRLQALSQTLDVPSLMNVFADWMTSQRWADMNQ 124

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFEFWI+ LD+NGK NKPDVNLYNHYLRA LM GASAGELLDLVA MEDY I PNTASFN
Sbjct: 125  LFEFWIRSLDKNGKPNKPDVNLYNHYLRAKLMIGASAGELLDLVAQMEDYAIIPNTASFN 184

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLKAMY  +ETEAAEKL +RMLQTGKESMPDDESY+LV+GMLFL +QIDAAL+Y+DLTL
Sbjct: 185  LVLKAMYQARETEAAEKLFQRMLQTGKESMPDDESYELVVGMLFLTDQIDAALRYVDLTL 244

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            KSGYMLSMRVF +CVRSCVN GRLD L SIIERCKTMDQNKAL PSWN+C +IAD+A+QE
Sbjct: 245  KSGYMLSMRVFAECVRSCVNKGRLDALVSIIERCKTMDQNKALSPSWNMCIFIADIAMQE 304

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLAFYALEF+ARWIARGE +R PVLLSVDEGLVVSALGTAGRTY+ TLLDASWAILR
Sbjct: 305  DNSKLAFYALEFMARWIARGENARGPVLLSVDEGLVVSALGTAGRTYSSTLLDASWAILR 364

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEA-EDLFSPFTSLY 1080
            RSLRQK+AP PESYL KIYA A+LGNLQRAFSTL+E E+AY  S  EA ED+FSPFTSL+
Sbjct: 365  RSLRQKKAPQPESYLAKIYADAALGNLQRAFSTLHEFETAYRNSPKEAEEDIFSPFTSLH 424

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLVMA SKKGFETLD+VYFQLENLS+ADPPYKSVAALNC+ILGCAN WDLDRAYQTFEAI
Sbjct: 425  PLVMASSKKGFETLDTVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAYQTFEAI 484

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
              +FGLTP+IHSYNAL+ AFGKLKKTFEASRVFEHL SLG+KPN +S+SLLVDAHLINRD
Sbjct: 485  GSSFGLTPDIHSYNALMYAFGKLKKTFEASRVFEHLTSLGIKPNAMSFSLLVDAHLINRD 544

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K ALSVI+EM+NAGF PSKE LKKVRRRCIRE DYE++D+VE L +KF+IRMG ENRRD
Sbjct: 545  QKAALSVIEEMINAGFAPSKEILKKVRRRCIREMDYESNDQVESLARKFKIRMGTENRRD 604

Query: 1621 MLFNLNYSTNY 1653
            MLFNL Y+T Y
Sbjct: 605  MLFNLAYNTEY 615


>ref|XP_006837119.1| hypothetical protein AMTR_s00110p00121530 [Amborella trichopoda]
            gi|548839712|gb|ERM99972.1| hypothetical protein
            AMTR_s00110p00121530 [Amborella trichopoda]
          Length = 617

 Score =  876 bits (2263), Expect = 0.0
 Identities = 434/554 (78%), Positives = 481/554 (86%), Gaps = 2/554 (0%)
 Frame = +1

Query: 1    QENWRNP--IPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWAD 174
            QENWRNP  I    ++QSLVPLGF+ Q PS RI A SQTLD P+L+N FADWMTSQRW+D
Sbjct: 64   QENWRNPGSIQETVISQSLVPLGFLHQTPSTRIQAFSQTLDLPSLLNAFADWMTSQRWSD 123

Query: 175  MKQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTA 354
            +KQLFEFWI+ LD NGK NKPDVNLYNHYLRANLM G SAGELLDLVA MEDY ISPNTA
Sbjct: 124  LKQLFEFWIRSLDTNGKPNKPDVNLYNHYLRANLMIGGSAGELLDLVAQMEDYGISPNTA 183

Query: 355  SFNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLD 534
            S+NLVLKAMY  +E+EAAEKL++RM+Q GK++ PD+ESYDLVIG+ FLVN+ID ALKYLD
Sbjct: 184  SYNLVLKAMYQARESEAAEKLLDRMIQAGKDASPDEESYDLVIGLQFLVNKIDDALKYLD 243

Query: 535  LTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVA 714
            LTLKSGYMLSMRVFTDCVRSCV AG LDTL SIIERCK MDQNKALCPSWNLC Y+ADVA
Sbjct: 244  LTLKSGYMLSMRVFTDCVRSCVKAGTLDTLTSIIERCKRMDQNKALCPSWNLCTYLADVA 303

Query: 715  LQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWA 894
            LQ DNGKLAF++LEF ARWIARGE  RPPVLLSVDEGLVVSALGTA RT N  LLDASWA
Sbjct: 304  LQADNGKLAFHSLEFFARWIARGENVRPPVLLSVDEGLVVSALGTAARTCNSNLLDASWA 363

Query: 895  ILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTS 1074
            ILRRSLRQKRAPNPESYLGKIYA++ LG LQRAF+TLNE E+AY   T   E+LFSPFTS
Sbjct: 364  ILRRSLRQKRAPNPESYLGKIYAYSMLGTLQRAFATLNEFETAYANPTAVDEELFSPFTS 423

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            L PLV+ACSK GF TLDSVYFQLENLS+ADPPYKSVAA+NC+ILGCAN WDLDRAYQTFE
Sbjct: 424  LNPLVVACSKNGFATLDSVYFQLENLSRADPPYKSVAAVNCVILGCANIWDLDRAYQTFE 483

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AIS TFGLTP+IHSYNAL+ AFGK+KKTFEASRVFEHL SLGV+PN  +YSLL+DAHLIN
Sbjct: 484  AISNTFGLTPDIHSYNALLFAFGKMKKTFEASRVFEHLTSLGVRPNETTYSLLIDAHLIN 543

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            +D K ALS+IDEM+NAGF PSKE+LKKVRRR +REFD E++++V  L  K + RMG E R
Sbjct: 544  KDQKSALSIIDEMMNAGFAPSKESLKKVRRRSVREFDDESNEQVRILASKLKYRMGGEAR 603

Query: 1615 RDMLFNLNYSTNYP 1656
            RDMLF LNYST  P
Sbjct: 604  RDMLFGLNYSTELP 617


>ref|XP_007045889.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao] gi|508709824|gb|EOY01721.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 616

 Score =  860 bits (2221), Expect = 0.0
 Identities = 423/553 (76%), Positives = 480/553 (86%), Gaps = 3/553 (0%)
 Frame = +1

Query: 4    ENWRNPIP---NPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWAD 174
            ENWRNP     + SLAQSL+PLGF+ QAP  RI  LS+ LDAPALMN FA  MT QRWAD
Sbjct: 60   ENWRNPNAAQNSTSLAQSLIPLGFLVQAPGHRIQYLSENLDAPALMNHFAGLMTQQRWAD 119

Query: 175  MKQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTA 354
            +K+LFEFW++ LD+NGK NKPDVNLYNHYLRANLM GASAG+LLDLVA M+D+ I PNTA
Sbjct: 120  VKELFEFWVRSLDKNGKPNKPDVNLYNHYLRANLMIGASAGDLLDLVAQMDDFAIVPNTA 179

Query: 355  SFNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLD 534
            SFN +LKAM   +ETEAA+KL+ERMLQ G ES+PDD+SYDLVIGMLF   QIDAALKY+D
Sbjct: 180  SFNFILKAMNQAKETEAAKKLLERMLQGGAESLPDDDSYDLVIGMLFEAEQIDAALKYVD 239

Query: 535  LTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVA 714
            + LKSGY+LSMRVFT+CV SCV  GRLDTLA++IERCKT DQN+AL P+WNLCNY+A+VA
Sbjct: 240  MALKSGYLLSMRVFTECVGSCVRQGRLDTLATVIERCKTKDQNRALYPNWNLCNYLAEVA 299

Query: 715  LQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWA 894
            +Q DN KLAFYALEF+A+WIARGE ++PP LLSVDEGL+VSAL TAGRTY+  LLDASWA
Sbjct: 300  MQADNSKLAFYALEFMAKWIARGEIAKPPFLLSVDEGLIVSALATAGRTYSSNLLDASWA 359

Query: 895  ILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTS 1074
            ILRRSLRQK+ P+PESYLGK+YAHASLGNLQ+AF TL+E E+A+  S  EAEDLFSPFTS
Sbjct: 360  ILRRSLRQKKVPSPESYLGKMYAHASLGNLQKAFGTLHEFEAAHRNSINEAEDLFSPFTS 419

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            LYPLV+ACSKKGFETLDSVY+QLE LS ADPPYKSVAALNCIILGC N WD++RAYQTF+
Sbjct: 420  LYPLVVACSKKGFETLDSVYYQLEKLSSADPPYKSVAALNCIILGCGNIWDIERAYQTFD 479

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AI  +FGLTP+IHSYNAL+ AFGKLKKTFEASRVFEH++SLGVKPN  SYSLLVDAHLIN
Sbjct: 480  AIGSSFGLTPDIHSYNALMYAFGKLKKTFEASRVFEHMLSLGVKPNAKSYSLLVDAHLIN 539

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K ALS IDEMV A F PSKET+KKVRRRCIRE DYE+DDRVE L KKF I+MG+ENR
Sbjct: 540  RDQKSALSAIDEMVTAEFVPSKETVKKVRRRCIREMDYESDDRVESLAKKFNIQMGSENR 599

Query: 1615 RDMLFNLNYSTNY 1653
            R MLFNL+Y T Y
Sbjct: 600  RGMLFNLDYGTEY 612


>ref|XP_002512090.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223549270|gb|EEF50759.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 619

 Score =  843 bits (2179), Expect = 0.0
 Identities = 415/553 (75%), Positives = 482/553 (87%), Gaps = 3/553 (0%)
 Frame = +1

Query: 4    ENWRNP--IPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            ENWRNP  I NP    +L+P G + Q P+ R  ++SQTLD  +L+N+FADWMTSQRW+DM
Sbjct: 68   ENWRNPTLIQNPD---ALIPFGILHQPPTARFQSMSQTLDLNSLLNLFADWMTSQRWSDM 124

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFEFWI+ LD+NGK NKPDVNL+NHYLRANLM+ A+A +LLDL+A MEDY + PNTAS
Sbjct: 125  KQLFEFWIRSLDQNGKPNKPDVNLFNHYLRANLMTNATAVDLLDLLAQMEDYAVLPNTAS 184

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDL 537
            FNLVLKAM+  +ET AAEKL++RM  TG ES PDDESYDLVIGMLF  NQIDAALKY+D 
Sbjct: 185  FNLVLKAMFQARETAAAEKLLQRMELTGNESQPDDESYDLVIGMLFSTNQIDAALKYIDK 244

Query: 538  TLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVAL 717
            TLK+G+ LSMRVFT+CV+SCVN GRLDTL SIIE+CK +DQNKAL P+WN+C YIA+VA+
Sbjct: 245  TLKNGHTLSMRVFTECVKSCVNKGRLDTLVSIIEKCKKVDQNKALSPTWNMCYYIAEVAM 304

Query: 718  QEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAI 897
            QEDN KLA+YALEF+A+WIARGE +RP +LLSVDEGLVVSALGTAGRTY+ TLLDASWAI
Sbjct: 305  QEDNSKLAYYALEFMAKWIARGENARPAILLSVDEGLVVSALGTAGRTYSSTLLDASWAI 364

Query: 898  LRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEA-EDLFSPFTS 1074
            LRRSLR K+AP+PESYLG+IYA+ASLGNLQ+AF+TL E ESAY+ S  EA E+LFSPFTS
Sbjct: 365  LRRSLRDKKAPSPESYLGRIYAYASLGNLQKAFTTLREYESAYDSSEKEAEEELFSPFTS 424

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            L PLV+ACSKKGFETLD+VYFQLENLS+A+ PYKSVAALNCIILGCAN WD+DRAYQTFE
Sbjct: 425  LNPLVVACSKKGFETLDTVYFQLENLSRAERPYKSVAALNCIILGCANIWDIDRAYQTFE 484

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AI  +F LTPNIHSYNALI AFG+LKKTFEA+RVFEHL+SLG+KPN  +Y LLVDAHLIN
Sbjct: 485  AIGSSFELTPNIHSYNALIYAFGRLKKTFEAARVFEHLVSLGIKPNATTYLLLVDAHLIN 544

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K ALSVI+EM++AGF PSKETLKKVRRRC+RE DY++DDRV  + K  +IRMG ENR
Sbjct: 545  RDVKTALSVIEEMMSAGFTPSKETLKKVRRRCVREMDYDSDDRVGSVAKNCKIRMGTENR 604

Query: 1615 RDMLFNLNYSTNY 1653
            RDMLFNL YST+Y
Sbjct: 605  RDMLFNLEYSTDY 617


>ref|XP_006484190.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X1 [Citrus sinensis]
          Length = 613

 Score =  832 bits (2148), Expect = 0.0
 Identities = 404/551 (73%), Positives = 475/551 (86%), Gaps = 1/551 (0%)
 Frame = +1

Query: 4    ENWRNPIPNP-SLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMK 180
            ENWRN  PN  +LAQSL+PLG ++ A + R+ A+SQT+DA  LM++FA+WMTS+RW+DMK
Sbjct: 62   ENWRNQAPNSETLAQSLIPLGLLKNASTQRVQAISQTIDAQTLMDLFANWMTSKRWSDMK 121

Query: 181  QLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASF 360
            ++FE+WI+ LD +GK NKPDV LYNHY+RAN M  AS  EL+DLVA  ED+ I PNTAS+
Sbjct: 122  EMFEYWIRSLDVHGKPNKPDVGLYNHYIRANFMLEASTAELIDLVAQTEDFAIVPNTASY 181

Query: 361  NLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLT 540
            NLVLKAM+   E+EAA+K +ERMLQ GK+S+PDDE+YDL+I +LF   +ID+ALKY+D+ 
Sbjct: 182  NLVLKAMHQAGESEAAQKWLERMLQGGKDSLPDDETYDLLISLLFSKGEIDSALKYIDMA 241

Query: 541  LKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQ 720
            LKSGYMLSM VFT+CVR CV+  R D L SII++CKTMDQNKALCP+WNLC  IA+VA+Q
Sbjct: 242  LKSGYMLSMNVFTECVRGCVDERRPDALVSIIKKCKTMDQNKALCPTWNLCIIIAEVAVQ 301

Query: 721  EDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAIL 900
            EDN +LAFYALEFLARW+ARGE +RPPVLLS DEGLVVS LGTAGRT+N  LLDASWA+L
Sbjct: 302  EDNSELAFYALEFLARWMARGENARPPVLLSADEGLVVSVLGTAGRTFNQKLLDASWAVL 361

Query: 901  RRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTSLY 1080
            RRSLRQKR P PESYLGKIYAHAS+G+LQRAF TLNE E+AY  S  + E++FSPFTSLY
Sbjct: 362  RRSLRQKRVPKPESYLGKIYAHASMGDLQRAFITLNEFETAYGDSIIDMEEIFSPFTSLY 421

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLV+ACS+KGFETLDSVYFQLENLS+A+PPYKSVAA+NC+ILGCAN WDLDRAYQTFEA+
Sbjct: 422  PLVVACSRKGFETLDSVYFQLENLSRAEPPYKSVAAINCVILGCANIWDLDRAYQTFEAM 481

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
              +FGLTP+IHSYNALI AFGKLKKTFEASRVFEHL+SLGVKPN +SYSLLVDAHL NRD
Sbjct: 482  GSSFGLTPDIHSYNALIYAFGKLKKTFEASRVFEHLVSLGVKPNAMSYSLLVDAHLTNRD 541

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K ALSVIDEMVNAGF PSKETLKKVRRRC+RE D E++DRVE L KKF IRM  ENR++
Sbjct: 542  QKAALSVIDEMVNAGFAPSKETLKKVRRRCVREMDEESNDRVEALAKKFDIRMNTENRKN 601

Query: 1621 MLFNLNYSTNY 1653
            +LFNL YS +Y
Sbjct: 602  ILFNLEYSASY 612


>ref|XP_006437942.1| hypothetical protein CICLE_v10031249mg [Citrus clementina]
            gi|567890849|ref|XP_006437945.1| hypothetical protein
            CICLE_v10031249mg [Citrus clementina]
            gi|557540138|gb|ESR51182.1| hypothetical protein
            CICLE_v10031249mg [Citrus clementina]
            gi|557540141|gb|ESR51185.1| hypothetical protein
            CICLE_v10031249mg [Citrus clementina]
          Length = 613

 Score =  829 bits (2141), Expect = 0.0
 Identities = 402/551 (72%), Positives = 474/551 (86%), Gaps = 1/551 (0%)
 Frame = +1

Query: 4    ENWRNPIPNP-SLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMK 180
            ENWRN  PN  +LAQSL+PLG ++ A + R+ A+SQT+DA  LM++FA+WMTS+RW+DMK
Sbjct: 62   ENWRNQAPNSETLAQSLIPLGLLKNASTQRVQAISQTIDAQTLMDLFANWMTSKRWSDMK 121

Query: 181  QLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASF 360
            ++FE+WI+ LD +GK NKPDV LYNHY+RAN M  AS  EL+DLVA  ED+ I PNTAS+
Sbjct: 122  EMFEYWIRSLDVHGKPNKPDVGLYNHYIRANFMLEASTAELIDLVAQTEDFAIVPNTASY 181

Query: 361  NLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLT 540
            NLVLKAM+   E+EAA+K +ERMLQ GK+S+PDDE+YDL+I +LF   +ID++LKY+D+ 
Sbjct: 182  NLVLKAMHQAGESEAAQKWLERMLQGGKDSLPDDETYDLLISLLFSKGEIDSSLKYIDMA 241

Query: 541  LKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQ 720
            LKSGYMLSM VFT+CVR CV+  R D L SII++CKTMDQNKALCP+WNLC  IA+VA+Q
Sbjct: 242  LKSGYMLSMNVFTECVRGCVDERRPDALVSIIKKCKTMDQNKALCPTWNLCIIIAEVAVQ 301

Query: 721  EDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAIL 900
            EDN +LAFYALEFLARW+ARGE +RPPVLLS DEGLVVS LGTAGRT+N  LLDASWA+L
Sbjct: 302  EDNSELAFYALEFLARWMARGENARPPVLLSADEGLVVSVLGTAGRTFNQKLLDASWAVL 361

Query: 901  RRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTSLY 1080
            RRSLRQKR P PESYLGKIYAHAS+G+LQRAF TLNE E+AY  S  + E++FSPFTSLY
Sbjct: 362  RRSLRQKRVPKPESYLGKIYAHASMGDLQRAFITLNEFETAYGDSIIDMEEIFSPFTSLY 421

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLV+ACS+KGFETLDSVYFQLENLS+A+PPYKSVAA+NC+ILGCAN WDLDRAYQTFEA+
Sbjct: 422  PLVVACSRKGFETLDSVYFQLENLSRAEPPYKSVAAINCVILGCANIWDLDRAYQTFEAM 481

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
              +FGLTP+IHSYNALI AFGKLKKTFEASRVFEHL+SLGVKPN +SYSLLVDAHL NRD
Sbjct: 482  GSSFGLTPDIHSYNALIYAFGKLKKTFEASRVFEHLVSLGVKPNAMSYSLLVDAHLTNRD 541

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K ALSVIDEMVNAGF PSKETLKKVRRRC+RE D E++DRVE L KKF IRM  ENR++
Sbjct: 542  QKAALSVIDEMVNAGFAPSKETLKKVRRRCVREMDEESNDRVEALAKKFDIRMNTENRKN 601

Query: 1621 MLFNLNYSTNY 1653
            +LFNL Y  +Y
Sbjct: 602  ILFNLEYGASY 612


>ref|XP_004144175.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like [Cucumis sativus]
            gi|449489211|ref|XP_004158247.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like [Cucumis sativus]
          Length = 618

 Score =  827 bits (2137), Expect = 0.0
 Identities = 405/551 (73%), Positives = 470/551 (85%), Gaps = 1/551 (0%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWRNP+ N S+AQS++P GF+ Q+P+ RI ALSQTLD   L++VFADWM SQRW DMKQ
Sbjct: 66   ENWRNPLNNYSMAQSMIPDGFLSQSPNYRIQALSQTLDVQGLLSVFADWMASQRWEDMKQ 125

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFEFWI+ LD++GK NKPDVNLYN+YLRANLMS A+ G LLDL+  MEDY ISPNTASFN
Sbjct: 126  LFEFWIRSLDKDGKPNKPDVNLYNNYLRANLMSDATPGVLLDLLTRMEDYAISPNTASFN 185

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLKAMY  +ETEAAEKL+ERMLQTG+ESMPDDESYDLVI ML    QIDAALKY+DLT 
Sbjct: 186  LVLKAMYQARETEAAEKLLERMLQTGEESMPDDESYDLVIRMLLSTYQIDAALKYIDLTS 245

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            K G+MLS++ F++CVRSCV  GRLDTL S+I++CK   +NKAL P+WN C  IA  A Q+
Sbjct: 246  KPGHMLSLKAFSECVRSCVRKGRLDTLVSVIDKCKATVENKALSPTWNSCYDIAIAATQQ 305

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLA+YALEF+A+WIARGE +RPPV LSVDEGLVVS LGTAGRTY+ +LLDA+W++L+
Sbjct: 306  DNSKLAYYALEFMAQWIARGENARPPVHLSVDEGLVVSTLGTAGRTYSSSLLDAAWSVLK 365

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTT-EAEDLFSPFTSLY 1080
            RSLRQK+ PNPES+LGKIY  ASLGNLQRAFSTL E E AY  S     ED+FSPFTSL+
Sbjct: 366  RSLRQKKVPNPESFLGKIYTLASLGNLQRAFSTLREFEEAYRNSDDGSCEDMFSPFTSLH 425

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLV+ACSKKGFETLD VYFQLENLS+ADPPYKSVAALNC+ILGCAN WDLDRAYQTFEAI
Sbjct: 426  PLVVACSKKGFETLDLVYFQLENLSRADPPYKSVAALNCVILGCANIWDLDRAYQTFEAI 485

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
              +FGLTPNIHSYNAL+ AFG+LKKTFEA+RVFEHL+ LG+KPN  SYSLL DAHLINRD
Sbjct: 486  GSSFGLTPNIHSYNALMYAFGRLKKTFEAARVFEHLVGLGIKPNATSYSLLADAHLINRD 545

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K AL+ ID MV AGF PSKE LKKVRRRCIRE DY+++D+V +L + F+IRMG+E+RRD
Sbjct: 546  PKSALAAIDNMVTAGFAPSKELLKKVRRRCIREQDYDSNDKVGNLAQNFKIRMGSESRRD 605

Query: 1621 MLFNLNYSTNY 1653
            +LFNLNY +NY
Sbjct: 606  ILFNLNYGSNY 616


>gb|EXB44833.1| hypothetical protein L484_026413 [Morus notabilis]
          Length = 665

 Score =  820 bits (2118), Expect = 0.0
 Identities = 402/551 (72%), Positives = 473/551 (85%), Gaps = 1/551 (0%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWR+PI N S  QS++PL F++Q+P+      S+ +DA +LM+VFAD   SQ W+++K+
Sbjct: 119  ENWRSPIANSSTNQSVIPLDFLRQSPA------SRAMDAKSLMDVFADCTASQNWSEVKR 172

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFE W++ LD+NGK NKPDV+L+N+YLRANLM GA+AG+LLD+VA MEDY I PNTASFN
Sbjct: 173  LFEAWVQSLDKNGKPNKPDVSLFNYYLRANLMIGATAGDLLDVVAQMEDYAIKPNTASFN 232

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLKAM   +ETEAA KL++RML TGKES+PDDESY+LV+G+LF  ++ID A KY+DLTL
Sbjct: 233  LVLKAMCQARETEAAVKLLDRMLLTGKESLPDDESYNLVLGLLFQSDKIDEAFKYIDLTL 292

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            KSGY LS+RVF+DCVR CVN GRLD L SII+RCK MDQNKALCP+W LCN+IA VALQE
Sbjct: 293  KSGYTLSLRVFSDCVRVCVNKGRLDALVSIIDRCKAMDQNKALCPTWGLCNFIAGVALQE 352

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLA++ALEF+A+WIARGE +RPPV LSV+EGL+VSA+GTA RTY+  LLDASW ILR
Sbjct: 353  DNSKLAYHALEFMAKWIARGEQTRPPVWLSVEEGLLVSAIGTAARTYDSDLLDASWVILR 412

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEA-EDLFSPFTSLY 1080
            RSLRQK+APNPE+YL KI+A A LGNL+RAFSTL E ES Y  S TEA E+LFSPFTSLY
Sbjct: 413  RSLRQKKAPNPEAYLAKIHALALLGNLRRAFSTLQEFESVYGNSQTEAEEELFSPFTSLY 472

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLV+ACSKKGFETLDSVYFQLENLS+ADPPYKSVAALNCIILGCAN WDLDRAYQTFEAI
Sbjct: 473  PLVVACSKKGFETLDSVYFQLENLSRADPPYKSVAALNCIILGCANNWDLDRAYQTFEAI 532

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
              +FGL P+IHSYNAL+ AFG LKKTFEAS+VF+H++SLG+KPN  SYSLLVD HLINRD
Sbjct: 533  GSSFGLNPDIHSYNALVKAFGNLKKTFEASKVFDHMLSLGIKPNATSYSLLVDTHLINRD 592

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K ALS++D+MVNAGF PSKETLKKVRRRCIRE DYE++DRV+    K++IRMG ENRRD
Sbjct: 593  QKAALSMLDDMVNAGFEPSKETLKKVRRRCIREMDYESNDRVQSFAVKYKIRMGTENRRD 652

Query: 1621 MLFNLNYSTNY 1653
            MLFNL YST Y
Sbjct: 653  MLFNLRYSTEY 663


>ref|XP_006303169.1| hypothetical protein CARUB_v10008572mg [Capsella rubella]
            gi|482571880|gb|EOA36067.1| hypothetical protein
            CARUB_v10008572mg [Capsella rubella]
          Length = 630

 Score =  813 bits (2101), Expect = 0.0
 Identities = 397/550 (72%), Positives = 471/550 (85%), Gaps = 2/550 (0%)
 Frame = +1

Query: 1    QENWRNPIPN-PSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            QENWR+PIPN PS  QSLVPLGF+ QAP+ RI ALS+TLD  AL+N+FADW  SQRW+DM
Sbjct: 74   QENWRSPIPNTPSFNQSLVPLGFLNQAPAARIRALSETLDMNALLNMFADWTASQRWSDM 133

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFEFW++ LD+NGK NKPDVNLYNHYLRANLM GASA ++LDLVA ME++ ++PNTAS
Sbjct: 134  KQLFEFWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAADMLDLVAPMEEFSVAPNTAS 193

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDL 537
            +NLVLKAMY  +ET+AA KL+ERML  GKES+PDDESYDLVIGMLF   + D A+K +D+
Sbjct: 194  YNLVLKAMYQARETDAAMKLLERMLLLGKESLPDDESYDLVIGMLFGTGKNDEAMKIMDM 253

Query: 538  TLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVAL 717
             LKSGYMLS  VFT+CVRSCV  GR DTL SIIERCK +D+NK+LCPSW LCNYIA+VA+
Sbjct: 254  ALKSGYMLSTTVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLCPSWILCNYIAEVAI 313

Query: 718  QEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAI 897
            QEDN KLAFYA EF+ +WI RGE +RP V+LSVDEGLVV+ L TA RT + +L++ SW I
Sbjct: 314  QEDNSKLAFYAFEFMFKWITRGEMARPSVILSVDEGLVVAGLATAARTCSSSLVEGSWTI 373

Query: 898  LRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFSPFTS 1074
            L++SLR ++A NP SY+ KI A+ASLGNLQ+AF++L+ELESAY  S  E  E++ SPFTS
Sbjct: 374  LKQSLRGRKAANPASYIAKINAYASLGNLQKAFTSLHELESAYADSEKEVVEEMLSPFTS 433

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            LYPLV+ACSKKGFETLD VYFQLE+LSQ D PYKSVAALNCIILGCANTWDLDRAYQTFE
Sbjct: 434  LYPLVVACSKKGFETLDEVYFQLESLSQGDTPYKSVAALNCIILGCANTWDLDRAYQTFE 493

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AIS +FGLTPNI SYNAL+ AFGK+KKTFEA+ VFEHL+S+GVKP+  +YSLLVDAHLIN
Sbjct: 494  AISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPDSRTYSLLVDAHLIN 553

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K AL+V+D+M+ AGF PS+ETLKK+RRRC+RE D ENDD+VE L KKF+IRMG ENR
Sbjct: 554  RDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDNENDDQVEALAKKFQIRMGTENR 613

Query: 1615 RDMLFNLNYS 1644
            R+MLFN++YS
Sbjct: 614  RNMLFNIDYS 623


>ref|XP_002893403.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297339245|gb|EFH69662.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 630

 Score =  809 bits (2089), Expect = 0.0
 Identities = 395/550 (71%), Positives = 470/550 (85%), Gaps = 2/550 (0%)
 Frame = +1

Query: 1    QENWRNPIPN-PSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            QENWR+PIPN PS  QSLVPLGF+ QAP+ RI ALS+TLD  +L+N+FADW  SQRW+DM
Sbjct: 74   QENWRSPIPNTPSFNQSLVPLGFLNQAPAARIRALSETLDMNSLLNMFADWTASQRWSDM 133

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFE W++ LD+NGK NKPDVNLYNHYLRANLM GASAG++LDLVA ME++ ++PNTAS
Sbjct: 134  KQLFEVWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAGDMLDLVAPMEEFSVAPNTAS 193

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDL 537
            +NLVLKAMY  +ET+AA KL+ERML  GKES PDDESYDLVIGM F V + D A+K +D 
Sbjct: 194  YNLVLKAMYQARETDAAMKLLERMLLLGKESPPDDESYDLVIGMHFGVGKNDEAMKVMDT 253

Query: 538  TLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVAL 717
             LKSGYMLS  VFT+CVRSCV  GR DTL SIIERCK +D+NK+LCPSW LCNYIA+VA+
Sbjct: 254  ALKSGYMLSTTVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLCPSWILCNYIAEVAI 313

Query: 718  QEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAI 897
            QEDN KLAFYA EF+ +WI RGE +RP V+LSVDEGLVV+ L TA RT + +L++ SW I
Sbjct: 314  QEDNSKLAFYAFEFMFKWITRGEMARPSVILSVDEGLVVAGLATAARTCSSSLVEGSWTI 373

Query: 898  LRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFSPFTS 1074
            L++SLR ++A NP SY+ KI A+ASLGNLQ+AF++L+ELE+AY  S  E  E++ SPFTS
Sbjct: 374  LKQSLRGRKAANPASYIAKINAYASLGNLQKAFTSLHELETAYADSEKEVVEEMLSPFTS 433

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            LYPLV+ACSKKGFETLD VYFQLE+LS+ D PYKSVAALNCIILGCANTWDLDRAYQTFE
Sbjct: 434  LYPLVVACSKKGFETLDEVYFQLESLSRGDTPYKSVAALNCIILGCANTWDLDRAYQTFE 493

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AIS +FGLTPNI SYNAL+ AFGK+KKTFEA+ VFEHL+S+GVKP+  +YSLLVDAHLIN
Sbjct: 494  AISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPDSRTYSLLVDAHLIN 553

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K AL+V+D+M+ AGF PS+ETLKK+RRRC+RE DYENDD+VE L KKF+IRMG ENR
Sbjct: 554  RDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDYENDDQVEALAKKFQIRMGTENR 613

Query: 1615 RDMLFNLNYS 1644
            R+MLFN++YS
Sbjct: 614  RNMLFNIDYS 623


>ref|NP_564247.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75173405|sp|Q9FZD1.1|PPR58_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g26460, mitochondrial; Flags: Precursor
            gi|9797754|gb|AAF98572.1|AC013427_15 Contains similarity
            to a hypothetical protein F21B7.16 gi|7485908 from
            Arabidopsis thaliana BAC F21B7 gb|AC002560 and contains
            multiple PPR PF|01535 repeats and a domain of unknown
            function PF|00668. ESTs gb|T45755, gb|AI993167,
            gb|AV554476, gb|T46823, gb|T41981, gb|AV546597,
            gb|AI099868 come from this gene [Arabidopsis thaliana]
            gi|19698979|gb|AAL91225.1| unknown protein [Arabidopsis
            thaliana] gi|22136300|gb|AAM91228.1| unknown protein
            [Arabidopsis thaliana] gi|332192573|gb|AEE30694.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 630

 Score =  806 bits (2081), Expect = 0.0
 Identities = 394/550 (71%), Positives = 469/550 (85%), Gaps = 2/550 (0%)
 Frame = +1

Query: 1    QENWRNPIPN-PSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            QENWR+PIPN PS  QSLVPLGF+ QAP+ RI ALS+TLD  +L+N+FADW  SQRW+DM
Sbjct: 74   QENWRSPIPNTPSFNQSLVPLGFLNQAPAPRIRALSETLDMNSLLNMFADWTASQRWSDM 133

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFE W++ LD+NGK NKPDVNLYNHYLRANLM GASAG++LDLVA ME++ + PNTAS
Sbjct: 134  KQLFEVWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAGDMLDLVAPMEEFSVEPNTAS 193

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDL 537
            +NLVLKAMY  +ETEAA KL+ERML  GK+S+PDDESYDLVIGM F V + D A+K +D 
Sbjct: 194  YNLVLKAMYQARETEAAMKLLERMLLLGKDSLPDDESYDLVIGMHFGVGKNDEAMKVMDT 253

Query: 538  TLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVAL 717
             LKSGYMLS  VFT+CVRSCV  GR DTL SIIERCK +D+NK+LCPSW LCNYIA+VA+
Sbjct: 254  ALKSGYMLSTSVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLCPSWILCNYIAEVAI 313

Query: 718  QEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAI 897
            QEDN KLAFYA EF+ +WI RGE +RP V+ SVDEGLVV+ L +A RT + +L++ SW I
Sbjct: 314  QEDNSKLAFYAFEFMFKWITRGEMARPSVIFSVDEGLVVAGLASAARTCSSSLVEGSWTI 373

Query: 898  LRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFSPFTS 1074
            L++SLR ++A NP SY+ KI A+ASLGNLQ+AF++L+ELESAY  S  E  E++ SPFTS
Sbjct: 374  LKQSLRGRKAANPASYIAKINAYASLGNLQKAFTSLHELESAYADSEKEVVEEMLSPFTS 433

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            LYPLV+ACSKKGFETLD VYFQLE+LSQ D PYKSVAALNCIILGCANTWDLDRAYQTFE
Sbjct: 434  LYPLVVACSKKGFETLDEVYFQLESLSQGDTPYKSVAALNCIILGCANTWDLDRAYQTFE 493

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AIS +FGLTPNI SYNAL+ AFGK+KKTFEA+ VFEHL+S+GVKP+  +YSLLVDAHLIN
Sbjct: 494  AISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPDSRTYSLLVDAHLIN 553

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K AL+V+D+M+ AGF PS+ETLKK+RRRC+RE D ENDD+VE L KKF+IRMG+ENR
Sbjct: 554  RDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDDENDDQVEALAKKFQIRMGSENR 613

Query: 1615 RDMLFNLNYS 1644
            R+MLFN++YS
Sbjct: 614  RNMLFNIDYS 623


>ref|XP_006415925.1| hypothetical protein EUTSA_v10007071mg [Eutrema salsugineum]
            gi|557093696|gb|ESQ34278.1| hypothetical protein
            EUTSA_v10007071mg [Eutrema salsugineum]
          Length = 625

 Score =  805 bits (2078), Expect = 0.0
 Identities = 394/551 (71%), Positives = 469/551 (85%), Gaps = 3/551 (0%)
 Frame = +1

Query: 1    QENWRNPIPN-PSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            QENWR+PIP+ PS +QSLVP+GF+ QAP+ RI A+S+TLD  +L+N+FADW  SQRW++M
Sbjct: 71   QENWRSPIPSSPSSSQSLVPMGFLNQAPAPRIRAISETLDMNSLLNMFADWTASQRWSEM 130

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFEFW++ LD+NGK NKPDVNLYNHYLRANLM GAS  ++LDLVA M+D+ ++PNTAS
Sbjct: 131  KQLFEFWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASPSDMLDLVALMDDFSLAPNTAS 190

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKE-SMPDDESYDLVIGMLFLVNQIDAALKYLD 534
            +N+VLKAM+  +ETEAA+KL+ RML +GKE S+PDDESYD+VI +LFL    D A+K +D
Sbjct: 191  YNIVLKAMHQARETEAAQKLLNRMLMSGKEESVPDDESYDVVIELLFLTGNNDEAMKLMD 250

Query: 535  LTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVA 714
            L LKSGYMLS   F++CVRSCV  GR DTL SIIERCK +D+NK+LCPSW LC Y+A+VA
Sbjct: 251  LALKSGYMLSTTAFSECVRSCVAKGRTDTLVSIIERCKALDRNKSLCPSWILCTYMAEVA 310

Query: 715  LQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWA 894
            +QEDN KLAFYA EF+ +WI RGE + P VLLSVDEGLVVSAL TA RT NPTL+D SW 
Sbjct: 311  VQEDNSKLAFYAFEFMFKWITRGEMAWPLVLLSVDEGLVVSALATAARTCNPTLVDGSWM 370

Query: 895  ILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEA-EDLFSPFT 1071
            IL+RSLR K+APNP SY+ KI A+ASLGNLQ+AF  L+E E+AY  S  E  E++ SPFT
Sbjct: 371  ILKRSLRGKKAPNPASYIAKINAYASLGNLQKAFIALHEFENAYADSEKEVVEEILSPFT 430

Query: 1072 SLYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTF 1251
            SLYPLV+ACSKKGFETLD VYFQLE LSQ + PYKSVAALNCIILGCANTWDLDRAYQTF
Sbjct: 431  SLYPLVVACSKKGFETLDEVYFQLETLSQGETPYKSVAALNCIILGCANTWDLDRAYQTF 490

Query: 1252 EAISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLI 1431
            +AIS +FGLTPNI SYNALI AFGK+KKTFEA+RV+EHL+S+GVKP+  +YSLLVDAHLI
Sbjct: 491  DAISASFGLTPNIDSYNALIYAFGKVKKTFEATRVYEHLVSVGVKPDARTYSLLVDAHLI 550

Query: 1432 NRDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNEN 1611
            NRD K AL+VID+M+NAGF P+KETLKKVRRRC+RE DYEN+D VE L KKF IRMG+EN
Sbjct: 551  NRDPKSALTVIDDMINAGFEPTKETLKKVRRRCVREIDYENNDHVESLAKKFEIRMGSEN 610

Query: 1612 RRDMLFNLNYS 1644
            RR+MLFNL+YS
Sbjct: 611  RRNMLFNLDYS 621


>gb|AAM61409.1| unknown [Arabidopsis thaliana]
          Length = 630

 Score =  803 bits (2075), Expect = 0.0
 Identities = 393/550 (71%), Positives = 468/550 (85%), Gaps = 2/550 (0%)
 Frame = +1

Query: 1    QENWRNPIPN-PSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            QENWR+PIPN PS  QSLVPLGF+ QAP+ RI ALS+TLD  +L+N+FADW  SQRW+DM
Sbjct: 74   QENWRSPIPNTPSFNQSLVPLGFLNQAPAPRIRALSETLDMNSLLNMFADWTASQRWSDM 133

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            KQLFE W++ LD+NGK NKPDVNLYNHYLRANLM GASAG++LDLVA ME++ + PNTAS
Sbjct: 134  KQLFEVWVRSLDKNGKPNKPDVNLYNHYLRANLMMGASAGDMLDLVAPMEEFSVEPNTAS 193

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDL 537
            +NLVLKAMY  +ETEAA KL+ERML  GK+S+PDDESYDLVIGM F V + D A+K +D 
Sbjct: 194  YNLVLKAMYQARETEAAMKLLERMLLLGKDSLPDDESYDLVIGMHFGVGKNDEAMKVMDT 253

Query: 538  TLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVAL 717
             LKSGYMLS  VFT+CVRSCV  GR DTL SIIERCK +D+NK+LCPSW LCNYIA+VA+
Sbjct: 254  ALKSGYMLSTSVFTECVRSCVAKGRTDTLVSIIERCKAVDRNKSLCPSWILCNYIAEVAI 313

Query: 718  QEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAI 897
            QEDN KLAFYA EF+ +WI RGE +RP V+ SVDEGLVV+ L +A RT + +L++ SW I
Sbjct: 314  QEDNSKLAFYAFEFMFKWITRGEMARPSVIFSVDEGLVVAGLASAARTCSSSLVEGSWTI 373

Query: 898  LRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFSPFTS 1074
            L++SLR ++A  P SY+ KI A+ASLGNLQ+AF++L+ELESAY  S  E  E++ SPFTS
Sbjct: 374  LKQSLRGRKAAKPASYIAKINAYASLGNLQKAFTSLHELESAYADSEKEVVEEMLSPFTS 433

Query: 1075 LYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFE 1254
            LYPLV+ACSKKGFETLD VYFQLE+LSQ D PYKSVAALNCIILGCANTWDLDRAYQTFE
Sbjct: 434  LYPLVVACSKKGFETLDEVYFQLESLSQGDTPYKSVAALNCIILGCANTWDLDRAYQTFE 493

Query: 1255 AISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLIN 1434
            AIS +FGLTPNI SYNAL+ AFGK+KKTFEA+ VFEHL+S+GVKP+  +YSLLVDAHLIN
Sbjct: 494  AISASFGLTPNIDSYNALLYAFGKVKKTFEATNVFEHLVSIGVKPDSRTYSLLVDAHLIN 553

Query: 1435 RDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENR 1614
            RD K AL+V+D+M+ AGF PS+ETLKK+RRRC+RE D ENDD+VE L KKF+IRMG+ENR
Sbjct: 554  RDPKSALTVVDDMIKAGFEPSRETLKKLRRRCVREMDDENDDQVEALAKKFQIRMGSENR 613

Query: 1615 RDMLFNLNYS 1644
            R+MLFN++YS
Sbjct: 614  RNMLFNIDYS 623


>gb|EYU22373.1| hypothetical protein MIMGU_mgv1a001842mg [Mimulus guttatus]
          Length = 751

 Score =  799 bits (2064), Expect = 0.0
 Identities = 393/557 (70%), Positives = 464/557 (83%), Gaps = 7/557 (1%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQ-SLVP--LGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWAD 174
            ENWRNP    S    +L+P  LGF+Q     RI  LSQ+LD+ +LMN FADWMTSQRW D
Sbjct: 194  ENWRNPTSAYSAGDGALIPVGLGFLQHTQGARIQMLSQSLDSQSLMNQFADWMTSQRWED 253

Query: 175  MKQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTA 354
            MKQLFEFWI+ LD NGK NKPDVNLYNHYLRANLM G+SAGELLD+VA M+DY I PNTA
Sbjct: 254  MKQLFEFWIRSLDVNGKPNKPDVNLYNHYLRANLMIGSSAGELLDVVAQMDDYGILPNTA 313

Query: 355  SFNLVLKAMYLQQETEAAEKLIERMLQTGKE---SMPDDESYDLVIGMLFLVNQIDAALK 525
            S+NLVLKAM    ET AAEKLIERM+QTGKE   S+PD+ESYDL+I ML   NQIDAALK
Sbjct: 314  SYNLVLKAMQKAGETVAAEKLIERMIQTGKEYKESLPDEESYDLIIVMLLSKNQIDAALK 373

Query: 526  YLDLTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIA 705
            Y+DLTLKSGYMLSM+ F DCV+SCV+ GRL+TL SIIERCK MDQNK+LCP W +CNYIA
Sbjct: 374  YIDLTLKSGYMLSMKAFVDCVQSCVSNGRLETLVSIIERCKKMDQNKSLCPPWRVCNYIA 433

Query: 706  DVALQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDA 885
            D+A+Q DNG+L +YALEF+A+WIA+GE +RPPVLL+ DEGLVVSA+GTAGR  +  LLD 
Sbjct: 434  DIAMQSDNGELTYYALEFMAKWIAQGERARPPVLLAADEGLVVSAIGTAGRIGHSKLLDG 493

Query: 886  SWAILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFS 1062
            SWA+L+RSLRQK+ PNPESYLGKIYAHA+LGNLQ+AFSTL+E E+AY  S+ E A+DLFS
Sbjct: 494  SWAVLKRSLRQKKLPNPESYLGKIYAHANLGNLQKAFSTLHEFETAYGNSSQEDADDLFS 553

Query: 1063 PFTSLYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAY 1242
            PF SL PLVMACSK GF TLDSVY+QLENLS A+PPYKSVAALNC+ILGCAN WD+DRAY
Sbjct: 554  PFYSLNPLVMACSKNGFATLDSVYYQLENLSHANPPYKSVAALNCVILGCANIWDVDRAY 613

Query: 1243 QTFEAISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDA 1422
            QTF AI  +FGLTPN+HSYNALI AFGKL K  EA +VFEH + LG+KPN  +Y+LL+DA
Sbjct: 614  QTFNAIDSSFGLTPNVHSYNALIYAFGKLSKRDEAVKVFEHFVGLGLKPNSTTYTLLIDA 673

Query: 1423 HLINRDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMG 1602
            HLI RD K ALSV+DEM++AG+ PSK+ LKK+RRRC+RE DY++D +VE   ++ +IR+G
Sbjct: 674  HLIKRDPKAALSVVDEMIHAGYEPSKKLLKKIRRRCVREMDYDSDAKVESFARQLKIRLG 733

Query: 1603 NENRRDMLFNLNYSTNY 1653
             E+RRD+LFNL Y T+Y
Sbjct: 734  TESRRDVLFNLQYVTDY 750


>ref|XP_004297246.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 618

 Score =  787 bits (2033), Expect = 0.0
 Identities = 389/552 (70%), Positives = 465/552 (84%), Gaps = 2/552 (0%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWR+P  N S A SL+P GF++Q+PS +I +L+Q LD+P L+NVFADWMTSQRW +MKQ
Sbjct: 66   ENWRSPPINFSAAASLLPSGFLEQSPSYKIQSLAQDLDSPGLLNVFADWMTSQRWTEMKQ 125

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFE WI+ +D++G+ N+PDV+ YNHYL ANLMSGA+A ++LDLVA MED  ++PNTASFN
Sbjct: 126  LFEVWIRSMDKSGRPNRPDVSSYNHYLMANLMSGATAADMLDLVARMEDCGVAPNTASFN 185

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLKAM+  QE+EAAEKL++RMLQTGK S PDDESY++V+ +L   ++ DAA KY++LTL
Sbjct: 186  LVLKAMHAGQESEAAEKLLQRMLQTGKASPPDDESYNIVVSLLLQSHRNDAAFKYIELTL 245

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            KSGYM++M VF DCV++CVN G+LDTL SII++CK+MDQNKALCP WNLCN+IADVALQ 
Sbjct: 246  KSGYMVTMTVFQDCVQACVNTGKLDTLVSIIDKCKSMDQNKALCPPWNLCNFIADVALQV 305

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLA +AL+F+A+WIARGE +RP V L VDEGL+VSAL TAGRTY+ TLLD SWAIL+
Sbjct: 306  DNSKLAVHALQFMAKWIARGEAARPAVFLPVDEGLLVSALQTAGRTYSTTLLDVSWAILQ 365

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTE-AEDLFSPFTSLY 1080
            RSLR K+ PNPES+L KI A ASLG LQRAF TL+E ESAY  S  E  E+LF PFTSLY
Sbjct: 366  RSLRGKKVPNPESFLAKISALASLGELQRAFRTLSEFESAYANSGKEIEEELFCPFTSLY 425

Query: 1081 PLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAI 1260
            PLV+ACSK GFETLD+VYFQLE+LS ADPPYKSVAALNCIILGCAN WDLDRAYQTFEAI
Sbjct: 426  PLVVACSKNGFETLDTVYFQLESLSCADPPYKSVAALNCIILGCANIWDLDRAYQTFEAI 485

Query: 1261 SGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRD 1440
            S TFGLTP+IHSYNAL+ AFGKLKKT EA RVFEHL+SLGV+PN +SYSLLVDAHLINRD
Sbjct: 486  SSTFGLTPDIHSYNALMQAFGKLKKTSEAVRVFEHLVSLGVRPNAMSYSLLVDAHLINRD 545

Query: 1441 AKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRD 1620
             K A+SV+ +MV +GF PSKETLKK++RRC+RE DYE+DD VE+  K F +RM  E RR+
Sbjct: 546  PKVAVSVVKDMVKSGFKPSKETLKKIKRRCMREMDYESDDLVEEYAKTFDLRMDGEARRN 605

Query: 1621 MLFNLNY-STNY 1653
             LF L + ST Y
Sbjct: 606  RLFELKFNSTTY 617


>ref|XP_002315005.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550329962|gb|EEF01176.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 623

 Score =  780 bits (2015), Expect = 0.0
 Identities = 400/556 (71%), Positives = 457/556 (82%), Gaps = 7/556 (1%)
 Frame = +1

Query: 7    NWRNPIP---NPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADM 177
            NWRNPIP   NP+    ++PLG      S RI ++   +D  +L N+ ADW TSQRW D+
Sbjct: 72   NWRNPIPIYQNPN--SPMIPLGPFHNQTS-RIQSMPPNMDLNSLSNMLADWTTSQRWEDI 128

Query: 178  KQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTAS 357
            K  FE WIK LDRNGK NKPDV+LYNHYLRANLM  ASAG LLDLVA MED+ +SPNT S
Sbjct: 129  KGYFEAWIKSLDRNGKPNKPDVSLYNHYLRANLMMKASAGYLLDLVAQMEDFNLSPNTVS 188

Query: 358  FNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDA---ALKY 528
            FN VLKAMY   ETEAAEKL++R+   G    PD+ESYDLV+ ML     IDA   ALKY
Sbjct: 189  FNFVLKAMYEGLETEAAEKLLQRLANFGL--FPDEESYDLVVTMLLNKGNIDAFDTALKY 246

Query: 529  LDLTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIAD 708
            +D  LK  Y+LSM+VF  CVRSC N GRLD L SIIE+CK MDQNKALCP+WNLCN+IA+
Sbjct: 247  IDKILKGDYVLSMKVFDACVRSCCNFGRLDVLLSIIEKCKKMDQNKALCPNWNLCNHIAE 306

Query: 709  VALQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDAS 888
            +AL+EDN KL F ALEF+ARWIARGE +RP VLLSVDEGL+V+ALGTAGRTYN TLLDAS
Sbjct: 307  IALKEDNSKLLFCALEFMARWIARGEKARPIVLLSVDEGLIVAALGTAGRTYNSTLLDAS 366

Query: 889  WAILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAED-LFSP 1065
            WAILRRSLRQK+APNPESY+GKIYAHASLG+LQ+AF+TL ELES Y  S  EAE+ LFSP
Sbjct: 367  WAILRRSLRQKKAPNPESYIGKIYAHASLGSLQKAFATLRELESCYGSSDKEAEEELFSP 426

Query: 1066 FTSLYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQ 1245
            F+SL PLV+ACSKKGFETLDSVYFQLENLS+A+ PYKSVAALNCIILGCAN WDLDRAYQ
Sbjct: 427  FSSLNPLVLACSKKGFETLDSVYFQLENLSRAESPYKSVAALNCIILGCANIWDLDRAYQ 486

Query: 1246 TFEAISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAH 1425
            TFEAIS +FGLTPNIHSYNALI AFG+LKKTFEAS VFEHL+SLGVKPN +SYSLLVDAH
Sbjct: 487  TFEAISSSFGLTPNIHSYNALIFAFGRLKKTFEASNVFEHLVSLGVKPNAMSYSLLVDAH 546

Query: 1426 LINRDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGN 1605
            LINRD K A+ VID+M +AGF PSKE LKKV+RRCIRE +YE+DDRVE   +KF  R+G+
Sbjct: 547  LINRDTKAAVLVIDKMDSAGFVPSKEILKKVKRRCIREMEYESDDRVEFWARKFDYRLGS 606

Query: 1606 ENRRDMLFNLNYSTNY 1653
            +NRRD+LFNL YST++
Sbjct: 607  QNRRDLLFNLEYSTDF 622


>ref|XP_003516357.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X1 [Glycine max]
          Length = 616

 Score =  777 bits (2007), Expect = 0.0
 Identities = 383/556 (68%), Positives = 454/556 (81%), Gaps = 6/556 (1%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQS------LVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQR 165
            ENWR+PIP P  A S      L P+GF  +A S       +T D  AL+++F DWM SQ+
Sbjct: 67   ENWRSPIPPPPPASSAATSHALAPVGFYNRATS-------ETYDPRALLDLFGDWMASQQ 119

Query: 166  WADMKQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISP 345
            W D+K LFE W++ LD+ GK NKPDVNL+NHYLRANLM GASA ELLDLVA M +++++P
Sbjct: 120  WHDVKFLFEAWVRSLDKTGKPNKPDVNLFNHYLRANLMLGASAAELLDLVAQMAEFDVAP 179

Query: 346  NTASFNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALK 525
            NTASFNLVLKAM   +ET AA+KL++RMLQ+G +++PDDESYDLVIGMLF ++QID A K
Sbjct: 180  NTASFNLVLKAMCQAKETLAADKLLQRMLQSGNDALPDDESYDLVIGMLFSMDQIDTAFK 239

Query: 526  YLDLTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIA 705
            Y+DL LKSG +LSM+VF +C  SCVN GRLDTL +IIERC+  DQNKALCP+W+LCN+I 
Sbjct: 240  YIDLILKSGNVLSMKVFMNCAGSCVNKGRLDTLVTIIERCRASDQNKALCPNWDLCNFIV 299

Query: 706  DVALQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDA 885
            ++A +EDN KLAFY LEF+A+WI +GE  RPP+ +SVDEGLV+SAL TAGRTYN  LL A
Sbjct: 300  EIATREDNSKLAFYGLEFMAKWIVKGERQRPPIYISVDEGLVLSALLTAGRTYNTDLLVA 359

Query: 886  SWAILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSP 1065
            SWA+L RSLR+K+APNPESYLGKIYAHASLGNLQ+AF TLNE ESAY  S  EAEDLF P
Sbjct: 360  SWAVLDRSLRKKKAPNPESYLGKIYAHASLGNLQKAFGTLNEYESAYGDSGQEAEDLFCP 419

Query: 1066 FTSLYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQ 1245
            FTSL+PLV+ACSKKGFETLD+VYFQLENL++A+PPYKSVAALNC+ILGCAN WDLDRAYQ
Sbjct: 420  FTSLHPLVVACSKKGFETLDNVYFQLENLNRAEPPYKSVAALNCVILGCANIWDLDRAYQ 479

Query: 1246 TFEAISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAH 1425
            TFE+I  TFGL P+IHSYN LI AFGKLKKT EA+RVFEHL+SLG+K N  SYSLLVDAH
Sbjct: 480  TFESIGSTFGLIPDIHSYNGLIYAFGKLKKTHEATRVFEHLVSLGLKSNAKSYSLLVDAH 539

Query: 1426 LINRDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGN 1605
            LINRD K AL+VID+M  AG+ PSKE LKKVRRRC RE D E+D RV+ L      ++G+
Sbjct: 540  LINRDVKSALAVIDDMRAAGYEPSKEMLKKVRRRCTREMDNESDARVQSLANSLNYQLGS 599

Query: 1606 ENRRDMLFNLNYSTNY 1653
            ENRRD+LFNLNYS  Y
Sbjct: 600  ENRRDILFNLNYSMGY 615


>ref|XP_007222915.1| hypothetical protein PRUPE_ppa003665mg [Prunus persica]
            gi|462419851|gb|EMJ24114.1| hypothetical protein
            PRUPE_ppa003665mg [Prunus persica]
          Length = 558

 Score =  773 bits (1995), Expect = 0.0
 Identities = 380/495 (76%), Positives = 435/495 (87%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWRNPIPN S+ QSL+PLGF+ Q+PS RIHALSQTLD  +LMNVFADWMTSQRWADMKQ
Sbjct: 64   ENWRNPIPNSSMTQSLLPLGFLNQSPSSRIHALSQTLDVQSLMNVFADWMTSQRWADMKQ 123

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFEFWI+ LD+NGK NKPDVNLYNHYLRANLM+GAS G++LDLV HMEDY ++PNTASFN
Sbjct: 124  LFEFWIRSLDKNGKPNKPDVNLYNHYLRANLMTGASPGQMLDLVGHMEDYGVTPNTASFN 183

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLK+M+  +E +AAEKL+ERMLQTG ES PDDESYDLV+GMLF  ++IDAALKY+DLTL
Sbjct: 184  LVLKSMHQAREIDAAEKLLERMLQTGNESPPDDESYDLVVGMLFQTDRIDAALKYIDLTL 243

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            KSGYMLSM VF DCV+ C + G+LD L SII++CK+MDQNKALCP WN+CNYIA+VALQ 
Sbjct: 244  KSGYMLSMAVFRDCVQGCADKGKLDILVSIIDKCKSMDQNKALCPPWNMCNYIAEVALQV 303

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLAF+ALEF+A+WIARGE +RP V LSVDEGL+VSAL TA RT++ TLLDASWAILR
Sbjct: 304  DNSKLAFHALEFMAKWIARGEQARPAVFLSVDEGLLVSALATAARTHSTTLLDASWAILR 363

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTSLYP 1083
            RSLRQK+APNPESY GKI A ASLG+LQRAFSTL+E ESAY  S  E E+LFSPFTSL+P
Sbjct: 364  RSLRQKKAPNPESYRGKICALASLGSLQRAFSTLHEYESAYGNSDKE-EELFSPFTSLHP 422

Query: 1084 LVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAIS 1263
            LV+ACSK GFETLDSVY+QLENLS+ADPPYKSVAALNCIILGCAN W ++RAYQTF+AIS
Sbjct: 423  LVVACSKGGFETLDSVYYQLENLSRADPPYKSVAALNCIILGCANIWHIERAYQTFDAIS 482

Query: 1264 GTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRDA 1443
             +F LTP+IHSYN L+ AFGK K+T EAS VF HL+SLGVKPN  SYSLLVDAHL+N+D 
Sbjct: 483  SSFELTPDIHSYNCLMYAFGKFKQTVEASNVFGHLVSLGVKPNAKSYSLLVDAHLVNKDP 542

Query: 1444 KGALSVIDEMVNAGF 1488
            K ALSVID+MV A F
Sbjct: 543  KSALSVIDDMVTAFF 557


>ref|XP_003518766.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X1 [Glycine max]
          Length = 615

 Score =  769 bits (1985), Expect = 0.0
 Identities = 380/553 (68%), Positives = 454/553 (82%), Gaps = 5/553 (0%)
 Frame = +1

Query: 4    ENWRNPIPNP----SLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWA 171
            ENWR+PIP P      + +L P+GF  +A S        T D  AL+++F DWM SQ+W 
Sbjct: 66   ENWRSPIPPPPSSAGTSHALSPVGFYNRATS-------DTYDPRALLDLFGDWMASQQWH 118

Query: 172  DMKQLFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYE-ISPN 348
            D+K LFE W++ LD+ GK NKPDVNL+NHYLRANLM GASA ELLDLVA M +++ ++PN
Sbjct: 119  DVKFLFESWVRSLDKTGKPNKPDVNLFNHYLRANLMLGASAAELLDLVAQMAEFDNVAPN 178

Query: 349  TASFNLVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKY 528
            TASFNLVLKAM   +ET AA+KL++RMLQ+G +++PDDESYDLVIGMLF + QID A KY
Sbjct: 179  TASFNLVLKAMCQAKETLAADKLLQRMLQSGNDALPDDESYDLVIGMLFSMGQIDTAFKY 238

Query: 529  LDLTLKSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIAD 708
            +DL LKSG +LSM+VF +CV SCVN GRLDTL +IIERC+  DQNKALCP+W+LCN+I +
Sbjct: 239  IDLILKSGNVLSMKVFMNCVGSCVNKGRLDTLVTIIERCRASDQNKALCPNWDLCNFIVE 298

Query: 709  VALQEDNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDAS 888
            +A +EDN KL+FY LEF+A+WI +GE  RPP+ LSVDEGLV+SAL TAGRTYN  LL AS
Sbjct: 299  IATREDNSKLSFYGLEFMAKWIVKGERQRPPIYLSVDEGLVLSALLTAGRTYNSDLLVAS 358

Query: 889  WAILRRSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPF 1068
            WA+L RSLR+K+ PNPESYLGKIYA ASLGNLQ+AF TLNE E+AY  S  EAEDLF PF
Sbjct: 359  WAVLDRSLRKKKVPNPESYLGKIYALASLGNLQKAFGTLNEYEAAYGDSGQEAEDLFCPF 418

Query: 1069 TSLYPLVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQT 1248
            TSL+PLV+ACSKKGFETLD+VYFQLENL++A+PPYKSVAALNC+ILGCAN WDLDRAYQT
Sbjct: 419  TSLHPLVVACSKKGFETLDNVYFQLENLNRAEPPYKSVAALNCVILGCANIWDLDRAYQT 478

Query: 1249 FEAISGTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHL 1428
            FE+I  TFGL P+IHSYN L+ AFGKLKKT EA+RVFEHL+SLG+KPN  SYSLLVDAHL
Sbjct: 479  FESIGSTFGLIPDIHSYNGLMYAFGKLKKTHEATRVFEHLVSLGLKPNAKSYSLLVDAHL 538

Query: 1429 INRDAKGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNE 1608
            INRD K AL+VID+M  AG+ PSKE LKKVRRRC+RE D E+D RV+ LV     R+G+E
Sbjct: 539  INRDVKSALAVIDDMRAAGYEPSKEVLKKVRRRCMREMDNESDARVQSLVNSLNYRLGSE 598

Query: 1609 NRRDMLFNLNYST 1647
            NRRD+LFNLNYS+
Sbjct: 599  NRRDILFNLNYSS 611


>ref|XP_004490167.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X1 [Cicer arietinum]
            gi|502094224|ref|XP_004490168.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X2 [Cicer arietinum]
            gi|502094228|ref|XP_004490169.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g26460,
            mitochondrial-like isoform X3 [Cicer arietinum]
          Length = 609

 Score =  763 bits (1970), Expect = 0.0
 Identities = 373/547 (68%), Positives = 445/547 (81%)
 Frame = +1

Query: 4    ENWRNPIPNPSLAQSLVPLGFIQQAPSMRIHALSQTLDAPALMNVFADWMTSQRWADMKQ 183
            ENWRNPIP  S   ++ P G   +A      ++  T D+ AL+++F DWM SQRW D+K 
Sbjct: 65   ENWRNPIPKSSSNNAVTPFGLFTRA------SVKDTYDSHALLDMFGDWMASQRWQDVKD 118

Query: 184  LFEFWIKCLDRNGKLNKPDVNLYNHYLRANLMSGASAGELLDLVAHMEDYEISPNTASFN 363
            LFE W++ LD+NGK NKPDVNL+NHY+RANLM G SA +LLDL+A ME + +SPNTASFN
Sbjct: 119  LFEDWVRSLDKNGKPNKPDVNLFNHYIRANLMIGGSAADLLDLLAQMEHFNVSPNTASFN 178

Query: 364  LVLKAMYLQQETEAAEKLIERMLQTGKESMPDDESYDLVIGMLFLVNQIDAALKYLDLTL 543
            LVLKAM+   ET AAEKL+ERMLQ+G E++PDDESYDLVIGM F  +QIDAA KY+DLTL
Sbjct: 179  LVLKAMHQAGETLAAEKLVERMLQSGNEALPDDESYDLVIGMFFSTDQIDAAFKYIDLTL 238

Query: 544  KSGYMLSMRVFTDCVRSCVNAGRLDTLASIIERCKTMDQNKALCPSWNLCNYIADVALQE 723
            K G +LSM  F +CVRSCV   RLDTL +IIE+C+  D+NK+LCPSWNLCN+IA+VA++E
Sbjct: 239  KHGNVLSMNTFMNCVRSCVKQRRLDTLVAIIEKCRETDKNKSLCPSWNLCNFIAEVAIRE 298

Query: 724  DNGKLAFYALEFLARWIARGETSRPPVLLSVDEGLVVSALGTAGRTYNPTLLDASWAILR 903
            DN KLA+Y LEF+ARW+  GE +RPPVLLSVDEGLVVSA+ TAGRTYN  LL A+W++L 
Sbjct: 299  DNSKLAYYGLEFMARWMVNGERARPPVLLSVDEGLVVSAMLTAGRTYNSELLGAAWSVLG 358

Query: 904  RSLRQKRAPNPESYLGKIYAHASLGNLQRAFSTLNELESAYEISTTEAEDLFSPFTSLYP 1083
            RSLR+K+ PNPESYLGKI A ASLGNLQ+AF TL+E E +Y  S  EA DLF PFTSL+P
Sbjct: 359  RSLRKKKVPNPESYLGKISALASLGNLQKAFGTLHEYEISYGDSNQEANDLFCPFTSLHP 418

Query: 1084 LVMACSKKGFETLDSVYFQLENLSQADPPYKSVAALNCIILGCANTWDLDRAYQTFEAIS 1263
            LV+ACSKKGFETLDSVYFQLE+LS+A+ PYKSVAALNCIILGCAN WDLDRAYQTFE+I 
Sbjct: 419  LVVACSKKGFETLDSVYFQLESLSRAERPYKSVAALNCIILGCANIWDLDRAYQTFESIG 478

Query: 1264 GTFGLTPNIHSYNALICAFGKLKKTFEASRVFEHLMSLGVKPNPISYSLLVDAHLINRDA 1443
              FGLTP+IHSYN L+ AFGKLKKT EAS+VFEHL+SLGVKPN  SYSLLVDAHLINRD 
Sbjct: 479  SAFGLTPDIHSYNGLMYAFGKLKKTHEASKVFEHLVSLGVKPNAKSYSLLVDAHLINRDV 538

Query: 1444 KGALSVIDEMVNAGFNPSKETLKKVRRRCIREFDYENDDRVEDLVKKFRIRMGNENRRDM 1623
            K AL+VID+M+ AGF P K TL+ VRRRC+RE D ++D RV+ L +   IRM ++ RR+M
Sbjct: 539  KSALAVIDDMIAAGFKPLKATLQNVRRRCVREIDNDSDQRVDSLAQSLNIRMASDARRNM 598

Query: 1624 LFNLNYS 1644
            LFNLNYS
Sbjct: 599  LFNLNYS 605


Top