BLASTX nr result

ID: Akebia22_contig00018631 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00018631
         (2517 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007222034.1| hypothetical protein PRUPE_ppa003040mg [Prun...   724   0.0  
ref|XP_002282419.1| PREDICTED: pentatricopeptide repeat-containi...   711   0.0  
gb|EXC20787.1| hypothetical protein L484_007369 [Morus notabilis]     704   0.0  
ref|XP_004292965.1| PREDICTED: pentatricopeptide repeat-containi...   690   0.0  
ref|XP_006353639.1| PREDICTED: pentatricopeptide repeat-containi...   675   0.0  
ref|XP_002308636.1| hypothetical protein POPTR_0006s26360g [Popu...   669   0.0  
ref|XP_002516618.1| pentatricopeptide repeat-containing protein,...   667   0.0  
ref|XP_007013754.1| Pentatricopeptide repeat superfamily protein...   667   0.0  
ref|XP_007013753.1| Pentatricopeptide repeat superfamily protein...   667   0.0  
ref|XP_004241813.1| PREDICTED: pentatricopeptide repeat-containi...   665   0.0  
ref|XP_004136469.1| PREDICTED: pentatricopeptide repeat-containi...   664   0.0  
ref|XP_007013756.1| Pentatricopeptide repeat (PPR) superfamily p...   656   0.0  
ref|XP_007161257.1| hypothetical protein PHAVU_001G055200g [Phas...   646   0.0  
ref|XP_004498635.1| PREDICTED: pentatricopeptide repeat-containi...   646   0.0  
ref|XP_003588687.1| Pentatricopeptide repeat-containing protein ...   637   e-179
ref|XP_003549241.1| PREDICTED: pentatricopeptide repeat-containi...   634   e-179
ref|XP_006476197.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   619   e-174
ref|XP_006450554.1| hypothetical protein CICLE_v10008018mg [Citr...   603   e-170
ref|XP_006399632.1| hypothetical protein EUTSA_v10013015mg [Eutr...   591   e-166
ref|XP_002871469.1| pentatricopeptide repeat-containing protein ...   582   e-163

>ref|XP_007222034.1| hypothetical protein PRUPE_ppa003040mg [Prunus persica]
            gi|462418970|gb|EMJ23233.1| hypothetical protein
            PRUPE_ppa003040mg [Prunus persica]
          Length = 609

 Score =  724 bits (1868), Expect = 0.0
 Identities = 366/588 (62%), Positives = 447/588 (76%), Gaps = 3/588 (0%)
 Frame = -3

Query: 2392 KRNPNLLLPIQ--RLFSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2219
            K NPNL L     R FSNQSWLSV GNP+IKW                            
Sbjct: 18   KPNPNLNLRALSLRFFSNQSWLSVRGNPIIKWPSPPDIPCSLPHPNPAPNPNPNPNSSGP 77

Query: 2218 XXQR-DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQ 2042
               + D +TI N+L D ++S G +L++ALD+TGI+PGP LLQ++F HFDSSPK L TLF 
Sbjct: 78   NFSQNDFSTIANVLADPSISPGSSLQSALDRTGIEPGPCLLQAVFDHFDSSPKLLHTLFL 137

Query: 2041 WAEKQQPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRR 1862
            WAEK+ P + S+A +F  MI +LAKSR+F+SAWSL+L+R+  DE P ++S  TF I+IRR
Sbjct: 138  WAEKR-PGFRSSATLFGCMINVLAKSREFESAWSLILNRIGGDEEPGLVSVDTFVIMIRR 196

Query: 1861 YARAGMPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELE 1682
            Y+RAGM   AIR+ EFA NLD    + S + LFE+LLDSLCKEGL   AS+Y D KR+L 
Sbjct: 197  YSRAGMSQSAIRTFEFASNLDSFLNSESEMSLFEVLLDSLCKEGLVRVASEYFDMKRKLH 256

Query: 1681 PQWIPSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVER 1502
            P WIPS+RVYNILLNGWFRSRKLK+ ERLW +MK++NV P+VVTYGTL+EGYCR+ R E 
Sbjct: 257  PDWIPSVRVYNILLNGWFRSRKLKRAERLWAEMKRDNVKPSVVTYGTLIEGYCRMRRAEI 316

Query: 1501 AMELVGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVK 1322
            A+ELV EMR  GIEPNAIVYN I+DALGE+ +FKEALGM+E  LV ESGPT+STYNSL K
Sbjct: 317  AIELVSEMRSEGIEPNAIVYNAIIDALGEAGKFKEALGMMEHFLVLESGPTISTYNSLAK 376

Query: 1321 GFCKAGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVP 1142
            GFCKAGDLVGASK+LKMMI +G VPTPTTYNYFFRYFS+FGKIEEGMNLYTKMIESGY P
Sbjct: 377  GFCKAGDLVGASKILKMMISKGCVPTPTTYNYFFRYFSKFGKIEEGMNLYTKMIESGYTP 436

Query: 1141 DRLTYHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEF 962
            DRLT+HLL+KMLC++ RL LAVQVS+EMR+RG D+DL T TML+HLLC +H+F EA  EF
Sbjct: 437  DRLTFHLLLKMLCDEGRLGLAVQVSKEMRSRGLDMDLATSTMLIHLLCNVHKFKEAFAEF 496

Query: 961  EDMIRRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESR 782
            EDMIRRG+ PQYLT++++  EL+K GM E+A+K+   M+S PHST LPNTY RE  + S 
Sbjct: 497  EDMIRRGLVPQYLTFQRMNVELRKQGMTEMAHKMCNMMSSVPHSTNLPNTYVRER-DASH 555

Query: 781  EFRSSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
              R SI++KA+AMSD+LKT  +PR+L+K R   EN V+ A  L++ IK
Sbjct: 556  ARRKSIIQKAEAMSDLLKTCSDPRELVKYRSLPENVVSRANQLVEDIK 603


>ref|XP_002282419.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial [Vitis vinifera]
            gi|296081989|emb|CBI20994.3| unnamed protein product
            [Vitis vinifera]
          Length = 597

 Score =  711 bits (1835), Expect = 0.0
 Identities = 360/529 (68%), Positives = 428/529 (80%)
 Frame = -3

Query: 2206 DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQ 2027
            D +TIC LL D  LSSG  LE AL++TGIKP   LLQ+IF+HFD+SPKPLFTLF+WA KQ
Sbjct: 71   DFSTICALLTDPALSSGAPLEDALNRTGIKPCSGLLQAIFSHFDASPKPLFTLFRWAMKQ 130

Query: 2026 QPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAG 1847
             P + S+  +FNSMI++LAKSR FDSAW L+LDR++  E P ++S  TFA+LIRRYARAG
Sbjct: 131  -PGFESSMTLFNSMIDVLAKSRAFDSAWLLVLDRIEGGEEPELVSSNTFAVLIRRYARAG 189

Query: 1846 MPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIP 1667
            M L AIR+ EFA +LD  R   S   LF+ILLDSLCKEG    AS+Y D++R L+P W+P
Sbjct: 190  MTLSAIRTFEFAFSLDSIRDRDSEWSLFKILLDSLCKEGHVRVASEYFDQQRGLDPSWVP 249

Query: 1666 SIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELV 1487
            SIRVYN+LLNGWFRSRKLK+ E+LW  MK+ENV PTVVTYGTLVEGYCR+ R E+A+ELV
Sbjct: 250  SIRVYNVLLNGWFRSRKLKRAEQLWRTMKRENVKPTVVTYGTLVEGYCRMRRSEKAIELV 309

Query: 1486 GEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKA 1307
            GEMR  GIEPN IVYNPI+D+L E+ RFKEA+GM+ER LVSE+GPT+STYNSLVKGFCKA
Sbjct: 310  GEMRGKGIEPNVIVYNPIIDSLAEAGRFKEAMGMMERCLVSETGPTISTYNSLVKGFCKA 369

Query: 1306 GDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTY 1127
            GDLVGASKVLKMMI RGF PT TTYNYFFRYFSR GK EEGMNLYTKMIESG+ PDRLTY
Sbjct: 370  GDLVGASKVLKMMISRGFDPTLTTYNYFFRYFSRCGKTEEGMNLYTKMIESGHTPDRLTY 429

Query: 1126 HLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIR 947
            HLLIKM+CE++RLDLAVQVS+EMR RG DLDL T TMLVHLLCK+HR +EA  EFEDMIR
Sbjct: 430  HLLIKMMCEEERLDLAVQVSKEMRARGCDLDLATSTMLVHLLCKMHRLEEAFAEFEDMIR 489

Query: 946  RGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSS 767
            RGI PQYLT+ ++   L+K G+ E+A KL   MAS PHS+KLPNTY  + G+ SR  ++S
Sbjct: 490  RGIVPQYLTFERMNNALRKRGLTEMARKLCDMMASVPHSSKLPNTYSGD-GDASRARKTS 548

Query: 766  IMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK*SFSKT 620
            I+++A+AMSD+LKT  +PR+L+K R S EN V  A  LI+ IK   +KT
Sbjct: 549  IIQRAEAMSDILKTCNDPRELVKRRSSFENTVLVADQLIEDIKRRANKT 597


>gb|EXC20787.1| hypothetical protein L484_007369 [Morus notabilis]
          Length = 612

 Score =  704 bits (1818), Expect = 0.0
 Identities = 358/588 (60%), Positives = 440/588 (74%), Gaps = 5/588 (0%)
 Frame = -3

Query: 2386 NPNLLLPIQRLFSNQS-----WLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXX 2222
            NPNL     R FS+ S     WLSVPG PLI+W                           
Sbjct: 29   NPNLQFHTLRFFSSSSASGLSWLSVPGKPLIRWPHEPCSVPNPQPDPNPSPNPGAEFSQ- 87

Query: 2221 XXXQRDCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQ 2042
                 + A I  +L + N+S G +L TALD+TGI+P P+LLQ++F HFDSSPK L++LF 
Sbjct: 88   ----NEFAAISEVLTNPNISGGFSLHTALDRTGIEPSPSLLQAVFDHFDSSPKLLYSLFL 143

Query: 2041 WAEKQQPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRR 1862
            WAEKQ P Y S+A +F S+I +LAKSR+FDSAWSL+L R+  +E P ++   TF I+IRR
Sbjct: 144  WAEKQ-PGYRSSASLFASVINVLAKSREFDSAWSLILHRIGKEEEPRLVCEDTFVIMIRR 202

Query: 1861 YARAGMPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELE 1682
            YAR GMP  A+R+ EFA N  P    +S + LF ILLD+LCKEG   AAS Y + K++L+
Sbjct: 203  YAREGMPQSAVRTFEFASNSVPICSYISEISLFGILLDALCKEGHVRAASDYFNEKKKLD 262

Query: 1681 PQWIPSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVER 1502
            P WIPSIR YNILLNGWFRSRKLK+ ERLW +MK++NV  TVVTYGTLVEGYCR+ R E 
Sbjct: 263  PSWIPSIRAYNILLNGWFRSRKLKRAERLWMEMKRDNVRSTVVTYGTLVEGYCRMRRAEI 322

Query: 1501 AMELVGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVK 1322
            A+ELV EMR  GIEPNAIVYNPI+DALGE+ RFKEALGM+ER LV ESGPT+STYNSLVK
Sbjct: 323  AVELVKEMRTEGIEPNAIVYNPIIDALGEAGRFKEALGMMERFLVLESGPTISTYNSLVK 382

Query: 1321 GFCKAGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVP 1142
            GFCKAG+L GASK++KMMIGRG +PTPTTYNYFF+YFS+FGKIEEGMNLYTKMI SG+ P
Sbjct: 383  GFCKAGNLAGASKIIKMMIGRGIIPTPTTYNYFFKYFSKFGKIEEGMNLYTKMIGSGHSP 442

Query: 1141 DRLTYHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEF 962
            DRLTYHLL+KMLCE+ +LDLAVQV +EMR+RG+D+DL T TML+HL C + RF+EA +EF
Sbjct: 443  DRLTYHLLLKMLCEEGKLDLAVQVGKEMRSRGFDMDLATSTMLIHLFCNMRRFEEAYLEF 502

Query: 961  EDMIRRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESR 782
             DMIRRGI PQYLTY ++ +ELKK GM E+ +KL   M+S PHSTKLPNTY R+ G+ S 
Sbjct: 503  GDMIRRGIVPQYLTYHRMKDELKKRGMTEMVSKLRDLMSSVPHSTKLPNTYTRD-GDASS 561

Query: 781  EFRSSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            + R+S+MRKA+A+SD+LKT +  R+L+  R   ENAV+ A  LI+ I+
Sbjct: 562  DRRNSVMRKAEAISDMLKTCKESRELVNYRGPFENAVSLANRLIEDIQ 609


>ref|XP_004292965.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 582

 Score =  690 bits (1781), Expect = 0.0
 Identities = 349/569 (61%), Positives = 428/569 (75%)
 Frame = -3

Query: 2344 QSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRDHNL 2165
            QSWLSVPGNPLIKW                                D +TI  LL D ++
Sbjct: 13   QSWLSVPGNPLIKWPSLSPPPSPPPTLPPPNPNPNFDPKFSE---NDFSTITKLLTDPSI 69

Query: 2164 SSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIFNSM 1985
              G +L +ALD+ GI P P+L+Q++F HFDSSPK L TLF WAE +QP +  +  +F S+
Sbjct: 70   FPGASLRSALDRVGIDPSPSLVQAVFDHFDSSPKLLHTLFVWAE-EQPGFRCSVKLFTSV 128

Query: 1984 IELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMPLPAIRSLEFAQN 1805
            I +LAK+R+F+SAWS++LDR+  D+   ++S   F I+IRRYARAG P  AIR+ EFA N
Sbjct: 129  INVLAKAREFESAWSMILDRIGGDKEAGLVSVDAFVIMIRRYARAGQPQSAIRAFEFATN 188

Query: 1804 LDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNILLNGWFR 1625
            LD    + S + LFEILLDSLCKEGL   A++Y D KR+    WIPS+RVYNILLNGWFR
Sbjct: 189  LDSFLSSESEMSLFEILLDSLCKEGLVRVATEYFDGKRKSHRDWIPSVRVYNILLNGWFR 248

Query: 1624 SRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGIEPNAIV 1445
            SRKLK+ ERLW +MK + V P+VVTYGTLVEGYCR+ R E AMELVGEMRR G+EPNAIV
Sbjct: 249  SRKLKKAERLWVEMKSDGVKPSVVTYGTLVEGYCRMRRPEIAMELVGEMRREGVEPNAIV 308

Query: 1444 YNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASKVLKMMI 1265
            +NPI+DALGE+ RFKEA GM+ER  V ESGPT+STYNSLVKG+CKAG+LV AS++LKMMI
Sbjct: 309  FNPIIDALGEAGRFKEAWGMMERFSVLESGPTISTYNSLVKGYCKAGNLVEASRILKMMI 368

Query: 1264 GRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLCEQDRLD 1085
             RG VPTP TYNYFFRYFS+ GKIEEGMNLYTKMIESGY PDRLT+HLL+KMLCE+ RLD
Sbjct: 369  SRGIVPTPATYNYFFRYFSKSGKIEEGMNLYTKMIESGYTPDRLTFHLLLKMLCEEGRLD 428

Query: 1084 LAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYLTYRKLI 905
            LAVQVS+EMR RG D+DL T TML+HLLCK+++F EA  EFEDMIR+G+ PQYLT++ + 
Sbjct: 429  LAVQVSKEMRTRGCDMDLATSTMLIHLLCKMNKFKEALSEFEDMIRKGLVPQYLTFQNMN 488

Query: 904  EELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAMSDVLKT 725
            +EL+K GM E+A KL A M+S PHSTKLPNTY ++  +ES E R SI++KA+AMS VLKT
Sbjct: 489  DELRKQGMTEMARKLCALMSSVPHSTKLPNTYVKDR-DESHERRKSIIKKAEAMSKVLKT 547

Query: 724  TRNPRQLIKLRHSSENAVASATPLIDHIK 638
              +PR+L+K R S E+  + A  LI+ IK
Sbjct: 548  CSDPRELVKHRSSPESVESRANRLIEDIK 576


>ref|XP_006353639.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Solanum tuberosum]
          Length = 604

 Score =  675 bits (1741), Expect = 0.0
 Identities = 343/577 (59%), Positives = 427/577 (74%), Gaps = 1/577 (0%)
 Frame = -3

Query: 2368 PIQRLFSNQSWL-SVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATI 2192
            P  +LFS QSWL S  G PL+K                                 D  T+
Sbjct: 28   PTSKLFSTQSWLKSTRGKPLMK-LPKLNHPPKQPIPLFSPPPSTQQPETPNYCPTDFTTL 86

Query: 2191 CNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYS 2012
            C +LRD  + +GP LE ALD+ G++    +   +F HFDSSPKPLFTL+ WAEK++  + 
Sbjct: 87   CEILRDPIIPAGPVLENALDRAGVEVNECMFLQLFNHFDSSPKPLFTLYLWAEKKE-WFK 145

Query: 2011 STAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMPLPA 1832
             + P+FN+++  L K R+FDSAW+L+LDR+ + E P++    TFAI+IRRYARAGM LPA
Sbjct: 146  FSLPVFNAVVNALGKEREFDSAWNLILDRLNSTERPNL---DTFAIMIRRYARAGMLLPA 202

Query: 1831 IRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVY 1652
            +R+ EF+ NL+ +   + + +LFEILLDSLCKEGL   AS Y  R++  +  W PSIRVY
Sbjct: 203  VRTYEFSSNLEIHALGLED-NLFEILLDSLCKEGLIREASDYFYRRKGQDSNWSPSIRVY 261

Query: 1651 NILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRR 1472
            NILLNGWFRSRKLK+ ERLW +MKKE + P+VVTYGTLVEG CR+ RVE A+EL+ EM+ 
Sbjct: 262  NILLNGWFRSRKLKKAERLWTEMKKEGIKPSVVTYGTLVEGLCRMRRVEMAIELIDEMKE 321

Query: 1471 GGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVG 1292
             GI PNA+VYNP++DALGE+ RFKEA GM+ERLLV ESGPT+STYNSLVKGFCKAGD+VG
Sbjct: 322  EGIPPNAVVYNPVIDALGEAGRFKEASGMMERLLVLESGPTLSTYNSLVKGFCKAGDIVG 381

Query: 1291 ASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIK 1112
            ASK+LKMMI RG +PTPTTYNYFFRYFS+FGKIEEG+NLYTK+IESGYV DRLTYHLL+K
Sbjct: 382  ASKILKMMINRGLMPTPTTYNYFFRYFSKFGKIEEGLNLYTKLIESGYVADRLTYHLLVK 441

Query: 1111 MLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAP 932
            MLCEQDRL+LA+Q+ QEMR +G+DLDL T TML+HL CK+H+FDEA   F DMIRRG+ P
Sbjct: 442  MLCEQDRLNLALQIIQEMRTKGFDLDLATSTMLIHLFCKMHQFDEAVEWFHDMIRRGLVP 501

Query: 931  QYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKA 752
            QYLTY++L  +L K GM + A KL   M STP+S KLPNTY R+ G+ S   R SI+ KA
Sbjct: 502  QYLTYQRLCNDLAKQGMNDKAEKLRNTMVSTPYSEKLPNTYIRD-GDTSHSKRKSIIAKA 560

Query: 751  QAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHI 641
            + MS++L+T R+PRQLIK R   ENAV SA  LI++I
Sbjct: 561  EEMSNILQTCRSPRQLIKRRTPPENAVLSANQLIENI 597


>ref|XP_002308636.1| hypothetical protein POPTR_0006s26360g [Populus trichocarpa]
            gi|222854612|gb|EEE92159.1| hypothetical protein
            POPTR_0006s26360g [Populus trichocarpa]
          Length = 607

 Score =  669 bits (1727), Expect = 0.0
 Identities = 338/571 (59%), Positives = 415/571 (72%)
 Frame = -3

Query: 2350 SNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRDH 2171
            S +SWL+V GNPLIKW                                D  T+CN+L+D 
Sbjct: 35   SAESWLAVQGNPLIKWPHNPNLAPSPSADQQNSSPTSNSNPNYHQ--NDFFTLCNILKDP 92

Query: 2170 NLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIFN 1991
             +  GP+L TALD+TGI+P   L+QS+F HFDSSPK L ++F WAEK+ P + S+A +FN
Sbjct: 93   KIQLGPSLRTALDRTGIEPELGLIQSVFDHFDSSPKLLHSVFLWAEKK-PGFQSSAALFN 151

Query: 1990 SMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMPLPAIRSLEFA 1811
            SM+  L K+R+F SAW LLLDR+  +E   ++S  TFAILIRRY RAGM   AIR+ E+A
Sbjct: 152  SMVNFLGKAREFGSAWCLLLDRIGGNEGGDLVSSDTFAILIRRYTRAGMSEAAIRTFEYA 211

Query: 1810 QNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNILLNGW 1631
             +LD    + +   LFEILLDSLCKEG    A+ Y DRK E +P W+PS+R+YNILLNGW
Sbjct: 212  SSLDLIHNSEAGTSLFEILLDSLCKEGHVRVATDYFDRKVEKDPCWVPSVRIYNILLNGW 271

Query: 1630 FRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGIEPNA 1451
            FRSRKLK  ERLW +MKK+NV P+VVTYGTLVEGY R+ RVERA+ELV EM+R GI+ NA
Sbjct: 272  FRSRKLKHAERLWLEMKKKNVKPSVVTYGTLVEGYSRMRRVERAIELVDEMKREGIKSNA 331

Query: 1450 IVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASKVLKM 1271
            IVYNPI+DAL E+ RFKE LGM+E   + E GPT+STYNSLVKG+CKAGDLVGASK+LKM
Sbjct: 332  IVYNPIIDALAEAGRFKEVLGMMEHFFLCEEGPTISTYNSLVKGYCKAGDLVGASKILKM 391

Query: 1270 MIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLCEQDR 1091
            MI R   PTPTTYNYFFR+FS+  KIEEGMNLYTKMIESGY PDRLTYHLL+KMLCE++R
Sbjct: 392  MISREVFPTPTTYNYFFRHFSKCRKIEEGMNLYTKMIESGYTPDRLTYHLLLKMLCEEER 451

Query: 1090 LDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYLTYRK 911
            LDLAVQ+S+EMR RG D+DL T TM  HLLCK+ RF+EA  EFEDM+RRGI PQYLT+ +
Sbjct: 452  LDLAVQISKEMRARGCDMDLATSTMFTHLLCKMQRFEEAFAEFEDMLRRGIVPQYLTFHR 511

Query: 910  LIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAMSDVL 731
            L +E +K G+ ELA +L   M+S  HS  LPNTY  +        R SI++KA  MS++L
Sbjct: 512  LNDEFRKQGLTELARRLCKLMSSVSHSKNLPNTYNVDRDASRHARRKSILQKAGVMSEIL 571

Query: 730  KTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            KT  +PR+L+K R SS+N  +SA  LI+ IK
Sbjct: 572  KTCNDPRELVKHRSSSQNPESSANQLIEDIK 602


>ref|XP_002516618.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223544438|gb|EEF45959.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 577

 Score =  667 bits (1721), Expect = 0.0
 Identities = 339/525 (64%), Positives = 413/525 (78%), Gaps = 2/525 (0%)
 Frame = -3

Query: 2206 DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQ 2027
            D +T+CNLL D NL     LETALDQTGIKP  +LL ++F HF+SSPK L +LF WA+KQ
Sbjct: 60   DFSTLCNLLSDPNLKP---LETALDQTGIKPETSLLNAVFDHFNSSPKLLHSLFVWADKQ 116

Query: 2026 QPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAG 1847
             P + S+  +FNS+I  L K ++FDSAW L+LDR        ++S  TFAILIRRY RAG
Sbjct: 117  -PEFESSTTLFNSVINALGKMKEFDSAWCLVLDRT------GLVSSDTFAILIRRYTRAG 169

Query: 1846 MPLPAIRSLEFAQNLDPNRFTVS-NLD-LFEILLDSLCKEGLASAASKYCDRKRELEPQW 1673
            MP  AIR+ E+A +LD   F    N D L EILLDSLCKEG    A +Y D +++L+  W
Sbjct: 170  MPQSAIRTFEYAISLD---FICDYNCDALLEILLDSLCKEGHVRVAKEYFDSRKQLDSCW 226

Query: 1672 IPSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAME 1493
            IP +R+YNI+LNGWFRSRKLK  ERLW +MKK NVSP+VVTYGTLVEGYCR+ RVERA+E
Sbjct: 227  IPHVRIYNIMLNGWFRSRKLKHAERLWLEMKKNNVSPSVVTYGTLVEGYCRMRRVERAIE 286

Query: 1492 LVGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFC 1313
            LV  MR+ GIEPNA+VYNPI+DAL E  RFKE  GM+E  L SESGPT+STYNSLVKG+C
Sbjct: 287  LVDVMRKEGIEPNALVYNPIIDALAEEGRFKEVSGMMEYFLQSESGPTISTYNSLVKGYC 346

Query: 1312 KAGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRL 1133
            KA D VGASKVLKMMI RGFVPTPTTYNYFFR+FS+FG IEEGMNLYTKMIESGY PDRL
Sbjct: 347  KAKDPVGASKVLKMMISRGFVPTPTTYNYFFRHFSKFGMIEEGMNLYTKMIESGYTPDRL 406

Query: 1132 TYHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDM 953
            T+HLL+KMLCE++RLDLAVQ+S+EMR+RG D+DL T TML+HL C++HRF+EA +EFEDM
Sbjct: 407  TFHLLLKMLCEEERLDLAVQISKEMRSRGCDMDLATSTMLIHLFCRMHRFEEAFMEFEDM 466

Query: 952  IRRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFR 773
            I++GI PQYLT+++L +EL+K GMVE A KLS  M+S PHST LPNTY  E     R  R
Sbjct: 467  IQKGIVPQYLTFQRLNDELRKRGMVERARKLSDMMSSVPHSTNLPNTYSVEGDALRRARR 526

Query: 772  SSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            SSI++KA+AMS +LKT  +PR+L+KL+ SS+N + SA  LI++I+
Sbjct: 527  SSILQKAEAMSKILKTCNDPRELVKLKSSSQNPITSAIQLIENIR 571


>ref|XP_007013754.1| Pentatricopeptide repeat superfamily protein isoform 2, partial
            [Theobroma cacao] gi|508784117|gb|EOY31373.1|
            Pentatricopeptide repeat superfamily protein isoform 2,
            partial [Theobroma cacao]
          Length = 584

 Score =  667 bits (1720), Expect = 0.0
 Identities = 349/579 (60%), Positives = 433/579 (74%), Gaps = 1/579 (0%)
 Frame = -3

Query: 2353 FSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRD 2174
            FS+QSWLS   NPLIKW                                + + I NLL++
Sbjct: 14   FSDQSWLSKKRNPLIKWPPPSSSPCNQPHPIPNRTFSQS----------NFSIISNLLKN 63

Query: 2173 HNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIF 1994
              ++SG +LE+ALDQT I P P LLQ+IF  FDSSPK L  LF WAEK+ P + S+A +F
Sbjct: 64   STITSGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKK-PGFKSSATLF 122

Query: 1993 NSMIELLAKSRDFDSAWSLLLDRMKTD-EAPSMISGQTFAILIRRYARAGMPLPAIRSLE 1817
            +SM+ +L K+R F+ AWSL+LDR+    E  +++S  TF ILIRRYARAGMP PAIR+ E
Sbjct: 123  DSMVNVLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRTFE 182

Query: 1816 FAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNILLN 1637
            FA++L+    +    +LFEI+LDSLCKEG     S+Y  RKRE +  W+PSI+VYNILLN
Sbjct: 183  FAKSLEQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNILLN 242

Query: 1636 GWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGIEP 1457
            GWFRSRKLK  ERLW  MKKE V P+VVTYGTLVEGYC + RVERA++LV EM+  GIEP
Sbjct: 243  GWFRSRKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGIEP 302

Query: 1456 NAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASKVL 1277
            NA VYNPI+DALGE+ R KEALGM+ER+ + ESGP +S Y+SLVKG+CKA DLVGASK+L
Sbjct: 303  NAKVYNPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASKIL 362

Query: 1276 KMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLCEQ 1097
            KMMI RGF+PTPTTYNYFFRYFS+F KIEE MNLYTKMIESG+ PDRLTYHLL+KML E+
Sbjct: 363  KMMISRGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLFEE 422

Query: 1096 DRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYLTY 917
            +RLDLAVQ+S+EMR RGYD DL T TML+HLLCK+HRF++A  EFEDMIRRG+APQYLT+
Sbjct: 423  ERLDLAVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYLTF 482

Query: 916  RKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAMSD 737
            +++ +ELKK GM ++A+KL   M+S   S KLPNTY  +  + SR  R+SIMRKA+AMSD
Sbjct: 483  QRMNDELKKRGMTDMASKLCDMMSSVRSSKKLPNTYGGDE-DSSRARRTSIMRKAEAMSD 541

Query: 736  VLKTTRNPRQLIKLRHSSENAVASATPLIDHIK*SFSKT 620
            +LKT ++PR+ +K R  SENAV+SA  LI+ IK   ++T
Sbjct: 542  MLKTCKDPREFVKHRTLSENAVSSAGRLIEIIKEGATET 580


>ref|XP_007013753.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|590579326|ref|XP_007013755.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
            gi|590579333|ref|XP_007013757.1| Pentatricopeptide repeat
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508784116|gb|EOY31372.1| Pentatricopeptide repeat
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508784118|gb|EOY31374.1| Pentatricopeptide repeat
            superfamily protein isoform 1 [Theobroma cacao]
            gi|508784120|gb|EOY31376.1| Pentatricopeptide repeat
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 595

 Score =  667 bits (1720), Expect = 0.0
 Identities = 349/579 (60%), Positives = 433/579 (74%), Gaps = 1/579 (0%)
 Frame = -3

Query: 2353 FSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRD 2174
            FS+QSWLS   NPLIKW                                + + I NLL++
Sbjct: 25   FSDQSWLSKKRNPLIKWPPPSSSPCNQPHPIPNRTFSQS----------NFSIISNLLKN 74

Query: 2173 HNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIF 1994
              ++SG +LE+ALDQT I P P LLQ+IF  FDSSPK L  LF WAEK+ P + S+A +F
Sbjct: 75   STITSGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKK-PGFKSSATLF 133

Query: 1993 NSMIELLAKSRDFDSAWSLLLDRMKTD-EAPSMISGQTFAILIRRYARAGMPLPAIRSLE 1817
            +SM+ +L K+R F+ AWSL+LDR+    E  +++S  TF ILIRRYARAGMP PAIR+ E
Sbjct: 134  DSMVNVLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRTFE 193

Query: 1816 FAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNILLN 1637
            FA++L+    +    +LFEI+LDSLCKEG     S+Y  RKRE +  W+PSI+VYNILLN
Sbjct: 194  FAKSLEQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNILLN 253

Query: 1636 GWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGIEP 1457
            GWFRSRKLK  ERLW  MKKE V P+VVTYGTLVEGYC + RVERA++LV EM+  GIEP
Sbjct: 254  GWFRSRKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGIEP 313

Query: 1456 NAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASKVL 1277
            NA VYNPI+DALGE+ R KEALGM+ER+ + ESGP +S Y+SLVKG+CKA DLVGASK+L
Sbjct: 314  NAKVYNPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASKIL 373

Query: 1276 KMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLCEQ 1097
            KMMI RGF+PTPTTYNYFFRYFS+F KIEE MNLYTKMIESG+ PDRLTYHLL+KML E+
Sbjct: 374  KMMISRGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLFEE 433

Query: 1096 DRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYLTY 917
            +RLDLAVQ+S+EMR RGYD DL T TML+HLLCK+HRF++A  EFEDMIRRG+APQYLT+
Sbjct: 434  ERLDLAVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYLTF 493

Query: 916  RKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAMSD 737
            +++ +ELKK GM ++A+KL   M+S   S KLPNTY  +  + SR  R+SIMRKA+AMSD
Sbjct: 494  QRMNDELKKRGMTDMASKLCDMMSSVRSSKKLPNTYGGDE-DSSRARRTSIMRKAEAMSD 552

Query: 736  VLKTTRNPRQLIKLRHSSENAVASATPLIDHIK*SFSKT 620
            +LKT ++PR+ +K R  SENAV+SA  LI+ IK   ++T
Sbjct: 553  MLKTCKDPREFVKHRTLSENAVSSAGRLIEIIKEGATET 591


>ref|XP_004241813.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Solanum lycopersicum]
          Length = 602

 Score =  665 bits (1717), Expect = 0.0
 Identities = 341/577 (59%), Positives = 424/577 (73%), Gaps = 1/577 (0%)
 Frame = -3

Query: 2368 PIQRLFSNQSWL-SVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATI 2192
            P  +LFS QSWL S  G PL+K                                 D  T+
Sbjct: 28   PTFKLFSTQSWLKSTRGKPLMK-LPKLNYPPKQPTPLFSPPPSTHQPETPNYCPTDFTTL 86

Query: 2191 CNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYS 2012
              +LRD  +  GPALE ALD+ GI+    +   +F HFDSSPKPLFTL+ WAEK++  + 
Sbjct: 87   SEILRDPTIPPGPALENALDRAGIEVNECMFLQLFNHFDSSPKPLFTLYLWAEKKE-WFK 145

Query: 2011 STAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMPLPA 1832
             + P+FN+++  L K R+FDSAW+L+LDR+ + E P++    TFAI+IRRY+RAGM LPA
Sbjct: 146  FSLPVFNAVVNALGKEREFDSAWNLILDRLNSTERPNL---GTFAIMIRRYSRAGMLLPA 202

Query: 1831 IRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVY 1652
            IR+ EF+ NL+ +   + + +LFEILLDSLCKEG    AS Y  R++  +  W PSIRVY
Sbjct: 203  IRTYEFSTNLEIHGLGLED-NLFEILLDSLCKEGHIREASDYFYRRKGKDLNWSPSIRVY 261

Query: 1651 NILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRR 1472
            NILLNGWFRSRKLK+ ERLW +MKKE + P+VVTYGTLVEG CR+ RVE A+EL+ EM+ 
Sbjct: 262  NILLNGWFRSRKLKKAERLWTEMKKEGIKPSVVTYGTLVEGLCRMRRVEMAIELIDEMKE 321

Query: 1471 GGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVG 1292
             GI PN +VYNP++DALGE+ RFKEA GM+ERLLV ESGPT+STYNSLVKGFCKAGD+ G
Sbjct: 322  EGIHPNVVVYNPVIDALGEAGRFKEASGMMERLLVLESGPTLSTYNSLVKGFCKAGDIAG 381

Query: 1291 ASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIK 1112
            ASK+LKMMI RGF+PTPTTYNYFFRYFS+FGKIEEG+NLYTK+IESGYV DRLTYHLL+K
Sbjct: 382  ASKILKMMIDRGFMPTPTTYNYFFRYFSKFGKIEEGLNLYTKLIESGYVADRLTYHLLVK 441

Query: 1111 MLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAP 932
            MLCEQDRLDLA+Q+ QEMR +G+DLDL T TML+HL CK+H+FDEA   F DMIRRG+ P
Sbjct: 442  MLCEQDRLDLALQIIQEMRTKGFDLDLATSTMLIHLFCKMHQFDEAVEWFHDMIRRGVVP 501

Query: 931  QYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKA 752
            QYLTY++L  +L K GM + A KL   M STP++ KLPNTY R+ G+ S   R SI+ KA
Sbjct: 502  QYLTYQRLCNDLAKQGMNDNAEKLRNMMVSTPYAEKLPNTYIRD-GDTSHSRRKSIIAKA 560

Query: 751  QAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHI 641
            + MS++++T R+PRQLIK R   ENAV SA  LI++I
Sbjct: 561  EEMSNIIQTCRSPRQLIKRRTPPENAVLSANQLIENI 597


>ref|XP_004136469.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cucumis sativus]
            gi|449503560|ref|XP_004162063.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cucumis sativus]
          Length = 615

 Score =  664 bits (1714), Expect = 0.0
 Identities = 342/575 (59%), Positives = 422/575 (73%), Gaps = 4/575 (0%)
 Frame = -3

Query: 2350 SNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ----RDCATICNL 2183
            S  SWLS PG PL+KW                                    D +TI ++
Sbjct: 36   SPHSWLSTPGKPLVKWPSLPDQPANPLPSNSAVISNPNSAIDVKFEASYSPNDLSTISSI 95

Query: 2182 LRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTA 2003
            L D ++  G ALE ALD+TGI P  +LL+++F HFDSSPK L +LF WA K+   +  +A
Sbjct: 96   LSDRSVRPGAALEDALDRTGIVPSSSLLEAVFDHFDSSPKFLHSLFLWAAKKS-GFRPSA 154

Query: 2002 PIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMPLPAIRS 1823
             +FN +I +LAKSR+FDSAWSL+  R++  E   ++S + F ILIRRYARAGM  PAIR+
Sbjct: 155  ALFNRLINVLAKSREFDSAWSLITSRLRGGEESFLVSVEVFVILIRRYARAGMVQPAIRT 214

Query: 1822 LEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNIL 1643
             EFA NL+    T S   LFEILLDSLCKEG    AS+Y +RKRE+   + PSIR YNIL
Sbjct: 215  YEFACNLETISGTGSE-GLFEILLDSLCKEGHVRVASEYFNRKREMGSSFEPSIRAYNIL 273

Query: 1642 LNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGI 1463
            +NGWFRSRKLK  +RLW +MKK  +SPTVVTYGTL+EGYCR+  VE A+ELV EMRR GI
Sbjct: 274  INGWFRSRKLKHAQRLWFEMKKNKISPTVVTYGTLIEGYCRMRSVEIAIELVDEMRREGI 333

Query: 1462 EPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASK 1283
            EPNAIVYNPIVDALGE+ RFKEALGM+ER +V E GPT+STYNSLVKG+CKAGDL GASK
Sbjct: 334  EPNAIVYNPIVDALGEAGRFKEALGMMERFMVLEQGPTISTYNSLVKGYCKAGDLSGASK 393

Query: 1282 VLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLC 1103
            +LKMMIGRGF PTPTTYNYFFR+FS++GKIEE M+LY KMIESGY PD+LTYHLL+KMLC
Sbjct: 394  ILKMMIGRGFTPTPTTYNYFFRFFSKYGKIEESMSLYNKMIESGYAPDKLTYHLLLKMLC 453

Query: 1102 EQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYL 923
            E++RL+LAVQV  EM+ RG+D+DL T TML+HLLCK+H+F+EA  EFE MI RGI PQYL
Sbjct: 454  EEERLNLAVQVCNEMKARGFDMDLATSTMLMHLLCKMHKFEEAFAEFEHMIHRGIVPQYL 513

Query: 922  TYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAM 743
            T+ +L +E  K G+ ++A+KL   M+S PHS KLP+TY  +  +  R  R+SIMRKA+AM
Sbjct: 514  TFCRLHDEFMKRGLTKMASKLQEMMSSVPHSEKLPDTY-NQTPDSIRARRTSIMRKAEAM 572

Query: 742  SDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            S++LK  ++PR+L+K R  SE+AV SA  LID IK
Sbjct: 573  SEMLKVCKDPRELVKRRSPSEDAVFSANKLIDDIK 607


>ref|XP_007013756.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform
            4, partial [Theobroma cacao]
            gi|590579336|ref|XP_007013758.1| Pentatricopeptide repeat
            (PPR) superfamily protein, putative isoform 4, partial
            [Theobroma cacao] gi|508784119|gb|EOY31375.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative isoform 4, partial [Theobroma cacao]
            gi|508784121|gb|EOY31377.1| Pentatricopeptide repeat
            (PPR) superfamily protein, putative isoform 4, partial
            [Theobroma cacao]
          Length = 560

 Score =  656 bits (1693), Expect = 0.0
 Identities = 336/526 (63%), Positives = 417/526 (79%), Gaps = 1/526 (0%)
 Frame = -3

Query: 2194 ICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSY 2015
            I NLL++  ++SG +LE+ALDQT I P P LLQ+IF  FDSSPK L  LF WAEK+ P +
Sbjct: 33   ISNLLKNSTITSGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKK-PGF 91

Query: 2014 SSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTD-EAPSMISGQTFAILIRRYARAGMPL 1838
             S+A +F+SM+ +L K+R F+ AWSL+LDR+    E  +++S  TF ILIRRYARAGMP 
Sbjct: 92   KSSATLFDSMVNVLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQ 151

Query: 1837 PAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIR 1658
            PAIR+ EFA++L+    +    +LFEI+LDSLCKEG     S+Y  RKRE +  W+PSI+
Sbjct: 152  PAIRTFEFAKSLEQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIK 211

Query: 1657 VYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEM 1478
            VYNILLNGWFRSRKLK  ERLW  MKKE V P+VVTYGTLVEGYC + RVERA++LV EM
Sbjct: 212  VYNILLNGWFRSRKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEM 271

Query: 1477 RRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDL 1298
            +  GIEPNA VYNPI+DALGE+ R KEALGM+ER+ + ESGP +S Y+SLVKG+CKA DL
Sbjct: 272  KGVGIEPNAKVYNPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDL 331

Query: 1297 VGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLL 1118
            VGASK+LKMMI RGF+PTPTTYNYFFRYFS+F KIEE MNLYTKMIESG+ PDRLTYHLL
Sbjct: 332  VGASKILKMMISRGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLL 391

Query: 1117 IKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGI 938
            +KML E++RLDLAVQ+S+EMR RGYD DL T TML+HLLCK+HRF++A  EFEDMIRRG+
Sbjct: 392  LKMLFEEERLDLAVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGM 451

Query: 937  APQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMR 758
            APQYLT++++ +ELKK GM ++A+KL   M+S   S KLPNTY  +  + SR  R+SIMR
Sbjct: 452  APQYLTFQRMNDELKKRGMTDMASKLCDMMSSVRSSKKLPNTYGGDE-DSSRARRTSIMR 510

Query: 757  KAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK*SFSKT 620
            KA+AMSD+LKT ++PR+ +K R  SENAV+SA  LI+ IK   ++T
Sbjct: 511  KAEAMSDMLKTCKDPREFVKHRTLSENAVSSAGRLIEIIKEGATET 556


>ref|XP_007161257.1| hypothetical protein PHAVU_001G055200g [Phaseolus vulgaris]
            gi|561034721|gb|ESW33251.1| hypothetical protein
            PHAVU_001G055200g [Phaseolus vulgaris]
          Length = 606

 Score =  646 bits (1666), Expect = 0.0
 Identities = 334/586 (56%), Positives = 424/586 (72%), Gaps = 3/586 (0%)
 Frame = -3

Query: 2386 NPNLLLPIQRLFSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQR 2207
            NP++     RLFS+ SWLS PGNP+I+W                                
Sbjct: 28   NPSIRPSRHRLFSS-SWLSEPGNPIIQWPSLPTPNPPPHPNPKPTPDPTSPNLNAF---- 82

Query: 2206 DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQ 2027
              + I NL  D ++S GP L   LD++ I+P P LL ++F  F SSPK L +LF WA+ +
Sbjct: 83   --SLISNLFTDPSVSPGPVLHAKLDRSAIEPDPALLLALFDRFGSSPKLLHSLFLWAQTR 140

Query: 2026 QPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTD---EAPSMISGQTFAILIRRYA 1856
             P +     +F++++  LAK+++FD+AW L+LD +  D   E  S++S  TFAI+IRRYA
Sbjct: 141  -PGFRPGPKLFDAVVNALAKAKEFDAAWKLVLDNVDGDGEEENESLVSVGTFAIMIRRYA 199

Query: 1855 RAGMPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQ 1676
            RAGM   AIR+ EFA+N      + S + LFEIL+DSLCKEG    AS+Y   ++EL+  
Sbjct: 200  RAGMSKLAIRTYEFARNNKSIVDSGSEMSLFEILMDSLCKEGSVREASEYFLWRKELDLS 259

Query: 1675 WIPSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAM 1496
            W+PSIRVYNI+LNGWFRSRKLKQ ERLWE+MKKENV P+VVTYGTLVEGYCR+ RVE+A+
Sbjct: 260  WVPSIRVYNIMLNGWFRSRKLKQGERLWEEMKKENVRPSVVTYGTLVEGYCRMRRVEKAL 319

Query: 1495 ELVGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGF 1316
            E+VG+M + GI PN IVYNPI+DAL E+ RFKEALGM+ER  + E GPT STYNSL+KG+
Sbjct: 320  EMVGDMTKEGIAPNVIVYNPIIDALAEAGRFKEALGMLERFHILEIGPTDSTYNSLIKGY 379

Query: 1315 CKAGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDR 1136
            CKA DL GASK+LKMMI RGF+P+PTTYNYFFRYFSR GKIEEGMNLY KMIESGY PDR
Sbjct: 380  CKAADLAGASKILKMMISRGFIPSPTTYNYFFRYFSRCGKIEEGMNLYRKMIESGYTPDR 439

Query: 1135 LTYHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFED 956
            LTYHLL+KMLCE+ +LDLAVQVS+EMR+ GYD+DL T TML+HLLCK+HR +EA  EFED
Sbjct: 440  LTYHLLVKMLCEEGKLDLAVQVSKEMRHNGYDMDLATSTMLIHLLCKMHRLEEAFAEFED 499

Query: 955  MIRRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREF 776
            MIRRGI PQYLT++ +  ELKK GM E+A KL   M+S P+S  LPNTY  +   ++   
Sbjct: 500  MIRRGIVPQYLTFQGMKAELKKQGMTEMAQKLCKLMSSVPYSDNLPNTYGGDR-EDALTR 558

Query: 775  RSSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            R SI+RKA+A SD+LK  ++P +L + ++SSENAV+SA  +I+ I+
Sbjct: 559  RKSIIRKAKAFSDMLKDCKDPSELRQWKNSSENAVSSANSMIEDIE 604


>ref|XP_004498635.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cicer arietinum]
          Length = 596

 Score =  646 bits (1666), Expect = 0.0
 Identities = 338/575 (58%), Positives = 417/575 (72%), Gaps = 3/575 (0%)
 Frame = -3

Query: 2353 FSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRD 2174
            F   SWLS PGNPLI W                                + + I  L  +
Sbjct: 26   FPFSSWLSQPGNPLIHWPSLPPIQPKPTLNPNPTPNPNTPTPDS-----NFSLISTLFTN 80

Query: 2173 HNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIF 1994
             ++S G  L   L++TGIKP   LL+++F HF SSPK L +LF WA+KQ P +     +F
Sbjct: 81   PSISPGSQLHAQLNRTGIKPDSPLLRAVFDHFASSPKLLHSLFLWADKQ-PGFKPDPTLF 139

Query: 1993 NSMIELLAKSRDFDSAWSLLLDRMKTDEAPS---MISGQTFAILIRRYARAGMPLPAIRS 1823
            +SM+  LAK ++FDSAW+L+LDR+  +E      ++S  TFAILIRRYARAGM   AIR+
Sbjct: 140  DSMVNALAKIKEFDSAWTLVLDRIHREEEEKEDKLVSIGTFAILIRRYARAGMHEAAIRT 199

Query: 1822 LEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNIL 1643
             EFA++      ++S + LF IL+DSLCKEG    AS+Y  R++E +  W+PS RVYNI+
Sbjct: 200  FEFAKDKKSIVDSMSEMSLFGILIDSLCKEGSVREASEYFLRRKETDLGWVPSTRVYNIM 259

Query: 1642 LNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGI 1463
            LNGWFR+RKLK  ERLWE+MKKENV P+VVTYGTLVEGYCR+ RVE+A+E+VGEM + GI
Sbjct: 260  LNGWFRARKLKHAERLWEEMKKENVKPSVVTYGTLVEGYCRMRRVEKALEMVGEMTKEGI 319

Query: 1462 EPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASK 1283
            E NAIVYNPI+DAL E+ RFKEALGM+ER  V + GPT+STYNSLVKGFCKAGDL GASK
Sbjct: 320  EANAIVYNPIIDALAEAGRFKEALGMMERFHVLQIGPTLSTYNSLVKGFCKAGDLEGASK 379

Query: 1282 VLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLC 1103
            +LK MI RGF+P PTTYNYFFRYFSR GKIEEGMNLYTKMIESG+ PDRLTYHL++KMLC
Sbjct: 380  ILKKMISRGFLPIPTTYNYFFRYFSRCGKIEEGMNLYTKMIESGHTPDRLTYHLVLKMLC 439

Query: 1102 EQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYL 923
            E++RLDLAVQVS+EMR+ GYD+DL T TML+HLLCK+HR +EA  EFEDMIRRGI PQYL
Sbjct: 440  EEERLDLAVQVSKEMRHNGYDMDLATSTMLIHLLCKMHRLEEAFAEFEDMIRRGIVPQYL 499

Query: 922  TYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAM 743
            T++KL  ELKK GM E++ KL   M++ PHST LPNTY     N +   R SI++KAQA+
Sbjct: 500  TFQKLNVELKKQGMTEMSQKLCHLMSNVPHSTNLPNTYGEVRDN-AHAHRKSIIQKAQAV 558

Query: 742  SDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
            SD+LK   +P++L K R SSEN V+ A  LI+ IK
Sbjct: 559  SDLLK---DPKELDKFRSSSENDVSIANCLIEDIK 590


>ref|XP_003588687.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355477735|gb|AES58938.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 587

 Score =  637 bits (1642), Expect = e-179
 Identities = 330/561 (58%), Positives = 415/561 (73%), Gaps = 2/561 (0%)
 Frame = -3

Query: 2341 SWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDCATICNLLRDHNLS 2162
            SWLS PGNPLI W                                D   I  L  + ++S
Sbjct: 28   SWLSQPGNPLINWPSLPSKPTPIPTLNPNPNSKPESSQPTFSP-NDFTLISTLFTNPSIS 86

Query: 2161 SGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQPSYSSTAPIFNSMI 1982
             G +L T L QTGIKP P LL ++F HF SSPK L +L+ WA   QP +   + +F+S+I
Sbjct: 87   PGSSLLTNLTQTGIKPTPPLLHAVFDHFASSPKLLHSLYLWA-LNQPGFKPDSSLFDSVI 145

Query: 1981 ELLAKSRDFDSAWSLLLDRMKTDEAPS--MISGQTFAILIRRYARAGMPLPAIRSLEFAQ 1808
              LAK ++FD AWSL+LDR++ D+     ++S  TFAI+IRRYARAGM   AIR+ EFA+
Sbjct: 146  NALAKMKEFDDAWSLVLDRIRRDDDDDEKLVSVGTFAIIIRRYARAGMHKAAIRTFEFAK 205

Query: 1807 NLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSIRVYNILLNGWF 1628
            +      +VS + LFEIL+DSLCKEG A  AS+Y  R++E +  W+PSIRVYNI+LNGWF
Sbjct: 206  DKKSIVDSVSEMSLFEILIDSLCKEGSAREASEYLLRRKETDLGWVPSIRVYNIMLNGWF 265

Query: 1627 RSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGEMRRGGIEPNAI 1448
            R+RKLK  ERLWE+MK ENV P+VVTYGTLVEGYCR+ RVE+A+E+VGEM + GI+PNAI
Sbjct: 266  RARKLKHAERLWEEMKNENVRPSVVTYGTLVEGYCRMRRVEKALEMVGEMTKEGIKPNAI 325

Query: 1447 VYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGDLVGASKVLKMM 1268
            VYNPI+DAL E+ RFKEALGM+ER  V + GPT+STYNSLVKGFCKAGD+ GASK+LK M
Sbjct: 326  VYNPIIDALAEAGRFKEALGMMERFHVLQIGPTLSTYNSLVKGFCKAGDIEGASKILKKM 385

Query: 1267 IGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHLLIKMLCEQDRL 1088
            I RGF+P PTTYNYFFRYFSR GK++EGMNLYTKMIESG+ PDRLTYHL++KMLCE+++L
Sbjct: 386  ISRGFLPIPTTYNYFFRYFSRCGKVDEGMNLYTKMIESGHNPDRLTYHLVLKMLCEEEKL 445

Query: 1087 DLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRGIAPQYLTYRKL 908
            +LAVQVS EMR++GYD+DL T TML HLLCK+H+ +EA  EFEDMIRRGI PQYLT++KL
Sbjct: 446  ELAVQVSMEMRHKGYDMDLATSTMLTHLLCKMHKLEEAFAEFEDMIRRGIIPQYLTFQKL 505

Query: 907  IEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIMRKAQAMSDVLK 728
              ELKK GM E+A KL   M+S P+S KLPNTY  E  +++   R SI++KA+A+S++LK
Sbjct: 506  NVELKKQGMNEMARKLCHLMSSVPYSDKLPNTY-GEVRDDAHARRKSIIQKAKAVSELLK 564

Query: 727  TTRNPRQLIKLRHSSENAVAS 665
               +P++L K R SSE+AV+S
Sbjct: 565  ---DPKELDKFRSSSEDAVSS 582


>ref|XP_003549241.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like isoform X1 [Glycine max]
          Length = 622

 Score =  634 bits (1634), Expect = e-179
 Identities = 340/592 (57%), Positives = 421/592 (71%), Gaps = 9/592 (1%)
 Frame = -3

Query: 2386 NPNLLLPIQ----RLFSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2219
            NPN   PI+    R FS+ SWLS PGNP+I+W                            
Sbjct: 31   NPNRTPPIRASTHRPFSS-SWLSEPGNPIIQWPSLPPPKPTPNPNPNPNLNPTPDPPSPN 89

Query: 2218 XXQRDCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQW 2039
                  + I NL  D +LS GPAL   LD+ GI+P P LL ++F  F SSPK L +LF W
Sbjct: 90   PNA--LSVISNLFADPSLSPGPALHAELDRAGIEPDPALLLAVFDRFGSSPKLLHSLFLW 147

Query: 2038 AEKQQPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTD-----EAPSMISGQTFAI 1874
            A+ + P++     +F++++  LAK+R+FD+AW L+L   + D     E   ++S  TFAI
Sbjct: 148  AQTR-PAFRPGPKLFDAVVNALAKAREFDAAWKLVLHHAEKDGEEEGEKERLVSVGTFAI 206

Query: 1873 LIRRYARAGMPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRK 1694
            +IRRYARAGM   AIR+ EFA N      + S + L EIL+DSLCKEG    AS+Y   K
Sbjct: 207  MIRRYARAGMSKLAIRTYEFATNNKSIVDSGSEMSLLEILMDSLCKEGSVREASEYFLWK 266

Query: 1693 RELEPQWIPSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLC 1514
            +EL+  W+PSIRVYNI+LNGWFR RKLKQ ERLW +MK EN+ PTVVTYGTLVEGYCR+ 
Sbjct: 267  KELDLSWVPSIRVYNIMLNGWFRLRKLKQGERLWAEMK-ENMRPTVVTYGTLVEGYCRMR 325

Query: 1513 RVERAMELVGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYN 1334
            RVE+A+E+VG+M + GI PNAIVYNPI+DAL E+ RFKEALGM+ER  V E GPT STYN
Sbjct: 326  RVEKALEMVGDMTKEGIAPNAIVYNPIIDALAEAGRFKEALGMLERFHVLEIGPTDSTYN 385

Query: 1333 SLVKGFCKAGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIES 1154
            SLVKGFCKAGDLVGASK+LKMMI RGF+P+ TTYNYFFRYFSR  KIEEGMNLYTK+I+S
Sbjct: 386  SLVKGFCKAGDLVGASKILKMMISRGFLPSATTYNYFFRYFSRCRKIEEGMNLYTKLIQS 445

Query: 1153 GYVPDRLTYHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEA 974
            GY PDRLTYHLL+KMLCE+++LDLAVQVS+EMR+ GYD+DL T TMLVHLLCK+ R +EA
Sbjct: 446  GYTPDRLTYHLLVKMLCEEEKLDLAVQVSKEMRHNGYDMDLATSTMLVHLLCKVRRLEEA 505

Query: 973  CVEFEDMIRRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENG 794
             VEFEDMIRRGI PQYLT++++  +LKK GM E+A KL   M+S P+S  LPNTY  E  
Sbjct: 506  FVEFEDMIRRGIVPQYLTFQRMKADLKKQGMTEMAQKLCKLMSSVPYSPNLPNTY-GEVR 564

Query: 793  NESREFRSSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHIK 638
             ++   R SI+RKA+A SD+LK  ++P +L K R SSEN V+S   LI+ I+
Sbjct: 565  EDAYARRKSIIRKAKAFSDMLKDCKDPSELRKHRSSSENTVSSTNSLIEDIE 616


>ref|XP_006476197.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g11310, mitochondrial-like [Citrus sinensis]
          Length = 551

 Score =  619 bits (1597), Expect = e-174
 Identities = 314/561 (55%), Positives = 407/561 (72%)
 Frame = -3

Query: 2380 NLLLPIQRLFSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDC 2201
            +L  P ++LFS+Q+ LSVP N LI                                Q D 
Sbjct: 14   SLYSPFRKLFSDQACLSVPVNTLIP----------------------PSTPPHNFSQTDF 51

Query: 2200 ATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQP 2021
            + I  LLR+  +SSGP+LE+ L+QTG++P P LL ++F HFD SPK L TLF+WAE + P
Sbjct: 52   SVISGLLRNPAISSGPSLESELNQTGVEPEPALLLAVFEHFDHSPKLLHTLFRWAESK-P 110

Query: 2020 SYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMP 1841
             +  +A +FN MI++LAK+++FDSAW LLLD++   E P  +S  TF ILIRRYARAGM 
Sbjct: 111  EFKCSAALFNCMIKVLAKAKEFDSAWCLLLDKIGGHEVPDFVSKDTFVILIRRYARAGMV 170

Query: 1840 LPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSI 1661
              AI + EFA NLD  +   S   LFEILLDSLCK+G   AAS+Y  +++EL+  W P++
Sbjct: 171  EAAIWTFEFANNLDMVKNFDSGASLFEILLDSLCKQGRVKAASEYFHKRKELDQSWAPTV 230

Query: 1660 RVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGE 1481
            RVYNILLNGWFRS+ +K  ER W +M+KENV+P VVTYGTLVEGYCRL RV+RA+ LV E
Sbjct: 231  RVYNILLNGWFRSKNVKDAERFWLEMRKENVTPNVVTYGTLVEGYCRLRRVDRAIRLVKE 290

Query: 1480 MRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGD 1301
            MR+ GIEPNAIVYN ++D L E+ RF+E  GM+ER LV E GPT+ TY SLVKG+CKAGD
Sbjct: 291  MRKEGIEPNAIVYNTVIDGLVEAGRFEEVSGMMERFLVCEPGPTMVTYTSLVKGYCKAGD 350

Query: 1300 LVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHL 1121
            L GASK+LKMMI R F+P+PTTYNYFFRYFS+FGK+++ MNLY KMIESGY PDRLTYH+
Sbjct: 351  LEGASKILKMMISRDFLPSPTTYNYFFRYFSKFGKVDDAMNLYRKMIESGYTPDRLTYHI 410

Query: 1120 LIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRG 941
            L+KMLC++D+LDLA+QVS+EM+ RG D+DL T TML+HLLC++++FDEA  EFEDMIRRG
Sbjct: 411  LLKMLCKEDKLDLAIQVSKEMKCRGCDIDLDTSTMLIHLLCRMYKFDEASAEFEDMIRRG 470

Query: 940  IAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRENGNESREFRSSIM 761
            + P YLT+++L +E KK GM  LA KL   M+S P S +L ++ + ++ N S   R   M
Sbjct: 471  LVPHYLTFKRLNDEFKKRGMTALAQKLCNVMSSVPRSMELLDS-RCKDXNASDARRRPTM 529

Query: 760  RKAQAMSDVLKTTRNPRQLIK 698
            +KA+ MS +LK  ++PR+L+K
Sbjct: 530  QKAETMSHILKACKDPRELVK 550


>ref|XP_006450554.1| hypothetical protein CICLE_v10008018mg [Citrus clementina]
            gi|557553780|gb|ESR63794.1| hypothetical protein
            CICLE_v10008018mg [Citrus clementina]
          Length = 517

 Score =  603 bits (1556), Expect = e-170
 Identities = 303/520 (58%), Positives = 383/520 (73%)
 Frame = -3

Query: 2380 NLLLPIQRLFSNQSWLSVPGNPLIKWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRDC 2201
            +L  P + LFS+Q+ LSVP N LI                                Q D 
Sbjct: 14   SLYSPFRNLFSDQAGLSVPVNTLIP----------------------PSTPPHNFSQTDF 51

Query: 2200 ATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQQP 2021
            + I  LLR+  +SSGP+LE+ L+QTG++P P LL ++F HFD SPK L TLF+WAE + P
Sbjct: 52   SVISGLLRNTAISSGPSLESELNQTGVEPEPALLLAVFEHFDHSPKLLHTLFRWAESK-P 110

Query: 2020 SYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAGMP 1841
             +  +A +FN +I++LAK+++FDSAW LLLD++   EAP  +S  TF ILIRRYARAGM 
Sbjct: 111  EFKCSAALFNCVIKVLAKAKEFDSAWCLLLDKIGGHEAPDFVSKDTFVILIRRYARAGMV 170

Query: 1840 LPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIPSI 1661
              AIR+ EFA NLD  +   S   LFEILLDSLCK+G   AAS+Y  +++EL+  W P++
Sbjct: 171  EAAIRTFEFANNLDMVKNFDSGASLFEILLDSLCKQGRVKAASEYFHKRKELDQSWAPTV 230

Query: 1660 RVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELVGE 1481
            RVYNILLNGWFRS+ +K  ER W +M+KENV+P VVTYGTLVEGYCRL RV+RA+ LV E
Sbjct: 231  RVYNILLNGWFRSKNVKDAERFWLEMRKENVTPNVVTYGTLVEGYCRLRRVDRAIRLVKE 290

Query: 1480 MRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKAGD 1301
            MR+ GIEPNAIVYN ++D L E+ RF+E  GM+ER LV E GPT+ TY SLVKG+CKAGD
Sbjct: 291  MRKEGIEPNAIVYNTVIDGLVEAGRFEEVSGMMERFLVCEPGPTMVTYTSLVKGYCKAGD 350

Query: 1300 LVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTYHL 1121
            L GASK+LKMMI RGF+P+PTTYNYFFRYFS+FGK+E+ MNLY KMIESGY PDRLTYH+
Sbjct: 351  LEGASKILKMMISRGFLPSPTTYNYFFRYFSKFGKVEDAMNLYRKMIESGYTPDRLTYHI 410

Query: 1120 LIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIRRG 941
            L+KMLC++D+LDLA+QVS+EM+ RG D+DL T TML+HLLC++++FDEA  EFEDMIRRG
Sbjct: 411  LLKMLCKEDKLDLAIQVSKEMKCRGCDIDLDTSTMLIHLLCRMYKFDEASAEFEDMIRRG 470

Query: 940  IAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKL 821
            + P YLT+++L +E KK GM  LA KL   M+S P S +L
Sbjct: 471  LVPHYLTFKRLNDEFKKRGMTALAQKLCNVMSSVPRSMEL 510


>ref|XP_006399632.1| hypothetical protein EUTSA_v10013015mg [Eutrema salsugineum]
            gi|557100722|gb|ESQ41085.1| hypothetical protein
            EUTSA_v10013015mg [Eutrema salsugineum]
          Length = 603

 Score =  591 bits (1523), Expect = e-166
 Identities = 286/523 (54%), Positives = 389/523 (74%), Gaps = 1/523 (0%)
 Frame = -3

Query: 2206 DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQ 2027
            D +TI NLL + ++  G +LE+ALD+TGI+P   L+Q++F    SSP  L +LF+WAE  
Sbjct: 71   DFSTISNLLENPDVDLGSSLESALDETGIEPSIQLIQALFDRLRSSPMLLHSLFKWAE-M 129

Query: 2026 QPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAG 1847
            +P ++ +  +F+S+I  L K+R+F+ AWSL+ DR+++D    ++S  TF +LIRRYARAG
Sbjct: 130  KPGFTPSPSMFDSVINALCKAREFEIAWSLIFDRVRSDGGSDLVSADTFVVLIRRYARAG 189

Query: 1846 MPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKRELEPQWIP 1667
            M   AIR+ EFA++ DP   + S L L E+LLD+LCKEG    AS Y +R+R ++  W+P
Sbjct: 190  MVQQAIRAFEFARSYDPVCKSASELKLLEVLLDALCKEGHVREASMYLERRRRIDSNWVP 249

Query: 1666 SIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMELV 1487
            S+R++NILLNGWFRSRKLKQ E LW +MK  NV PTVVTYGTL+EG+CR+ RVE AME++
Sbjct: 250  SVRIFNILLNGWFRSRKLKQAENLWAEMKVMNVKPTVVTYGTLIEGFCRMRRVEIAMEVL 309

Query: 1486 GEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCKA 1307
             EM+   +E N +V+NPI+D LGES R +EALGM+ER  VSESGPT+ TYNSLVK FCKA
Sbjct: 310  EEMKMAEMELNFMVFNPIIDGLGESGRLQEALGMMERFFVSESGPTIVTYNSLVKSFCKA 369

Query: 1306 GDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLTY 1127
            GDL GASK+LKMM+ RG  PTPTTYN+FF++FS+  K E+GMNLY K+IE+G+ PDR TY
Sbjct: 370  GDLTGASKILKMMMNRGVDPTPTTYNHFFKFFSKHNKTEQGMNLYFKLIEAGHSPDRFTY 429

Query: 1126 HLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMIR 947
            HL++KMLCE  +L LA+QV++EM+NRG D DL T TM++HLLC+L   +EA  EFE  +R
Sbjct: 430  HLILKMLCEDGKLSLAMQVNKEMKNRGIDPDLLTTTMMIHLLCRLDMLEEAFGEFEKAVR 489

Query: 946  RGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKR-ENGNESREFRS 770
            RGI PQY+T++ +   L+  GM+++A +LS+ M+S PHS KLPNTY+   +    R+ + 
Sbjct: 490  RGIVPQYITFKMIDNGLRSKGMIDMAKRLSSVMSSLPHSKKLPNTYREVVDAPPDRDRKK 549

Query: 769  SIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHI 641
            SI+ KA+AMSDVLK  RNPR+L+K+R S +  V     L+D +
Sbjct: 550  SILHKAEAMSDVLKGCRNPRKLVKMRGSHQRTVGEDKKLVDDL 592


>ref|XP_002871469.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317306|gb|EFH47728.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 602

 Score =  582 bits (1499), Expect = e-163
 Identities = 285/524 (54%), Positives = 389/524 (74%), Gaps = 2/524 (0%)
 Frame = -3

Query: 2206 DCATICNLLRDHNLSSGPALETALDQTGIKPGPTLLQSIFTHFDSSPKPLFTLFQWAEKQ 2027
            D +TI NLL + N+  G +LE+ALD+TGI+P   L+Q++F    SSP  L ++F+WAE  
Sbjct: 69   DLSTISNLLENTNVVPGSSLESALDETGIEPSLQLVQALFDRLSSSPMLLHSVFKWAE-M 127

Query: 2026 QPSYSSTAPIFNSMIELLAKSRDFDSAWSLLLDRMKTDEAPSMISGQTFAILIRRYARAG 1847
            +P ++ +  +F+S+I  L K+R+F+ AWSL+ DR+++DE  +++S  TF +LIRRYARAG
Sbjct: 128  KPGFTLSPSLFDSVINSLCKAREFEIAWSLVFDRVRSDEGSNLVSADTFIVLIRRYARAG 187

Query: 1846 MPLPAIRSLEFAQNLDPNRFTVSNLDLFEILLDSLCKEGLASAASKYCDRKREL-EPQWI 1670
            M   AIR+ EFA++ +P   + S L L E+LLD+LCKEG    AS Y +R+R + +  W+
Sbjct: 188  MVQQAIRAFEFARSYEPVCKSASELKLLEVLLDALCKEGYVREASVYLERRRGMMDSNWV 247

Query: 1669 PSIRVYNILLNGWFRSRKLKQVERLWEQMKKENVSPTVVTYGTLVEGYCRLCRVERAMEL 1490
            PS+R++NILLNGWFRSRKLKQ E+LWE+MK  NV PTVVTYGTL+EGYCR+ RVE AME+
Sbjct: 248  PSVRIFNILLNGWFRSRKLKQAEKLWEEMKAMNVKPTVVTYGTLIEGYCRMRRVEIAMEI 307

Query: 1489 VGEMRRGGIEPNAIVYNPIVDALGESKRFKEALGMIERLLVSESGPTVSTYNSLVKGFCK 1310
            + EM+   +E   +V+NPI+D LGE+ R  EALGM+ER  V ESGPT+ TYNSLVK FCK
Sbjct: 308  LEEMKMAEMELTFMVFNPIIDGLGEAGRLSEALGMMERFFVCESGPTIVTYNSLVKNFCK 367

Query: 1309 AGDLVGASKVLKMMIGRGFVPTPTTYNYFFRYFSRFGKIEEGMNLYTKMIESGYVPDRLT 1130
            AGDL GASK+LKMM+ RG  PT +TYN+FF+YFS+  K EEGMNLY K+IE+G+ PDRLT
Sbjct: 368  AGDLPGASKILKMMMTRGVEPTTSTYNHFFKYFSKHNKTEEGMNLYFKLIEAGHSPDRLT 427

Query: 1129 YHLLIKMLCEQDRLDLAVQVSQEMRNRGYDLDLPTCTMLVHLLCKLHRFDEACVEFEDMI 950
            YHL++KMLCE  +L LA+QV++EM+NRG D DL T TML+HLLC+L   +EA  EF++ +
Sbjct: 428  YHLILKMLCEDGKLSLAIQVNKEMKNRGIDPDLLTTTMLMHLLCRLDMLEEAFEEFDNAV 487

Query: 949  RRGIAPQYLTYRKLIEELKKSGMVELANKLSAKMASTPHSTKLPNTYKRE-NGNESREFR 773
            RRGI PQY+T++ +   L+  GM ++A +LS+ M+S PHS KLPNTY+   +    ++ R
Sbjct: 488  RRGIIPQYITFKMIDNGLRSKGMTDMAKRLSSLMSSLPHSKKLPNTYREAVDAPPDKDRR 547

Query: 772  SSIMRKAQAMSDVLKTTRNPRQLIKLRHSSENAVASATPLIDHI 641
             SI+ +A+AMSDVLK  RNPR+L+K+R S +  V     L D +
Sbjct: 548  KSILHRAEAMSDVLKGCRNPRKLVKMRGSHKKGVREDESLTDDL 591


Top