BLASTX nr result

ID: Papaver27_contig00029020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver27_contig00029020
         (2370 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285611.1| PREDICTED: pentatricopeptide repeat-containi...   880   0.0  
emb|CAN61988.1| hypothetical protein VITISV_026694 [Vitis vinifera]   862   0.0  
gb|EXC01179.1| hypothetical protein L484_025557 [Morus notabilis]     860   0.0  
ref|XP_007018302.1| Pentatricopeptide repeat (PPR-like) superfam...   848   0.0  
ref|XP_006364562.1| PREDICTED: pentatricopeptide repeat-containi...   848   0.0  
ref|XP_002302359.2| hypothetical protein POPTR_0002s11020g [Popu...   848   0.0  
ref|XP_004240633.1| PREDICTED: pentatricopeptide repeat-containi...   845   0.0  
ref|XP_006433766.1| hypothetical protein CICLE_v10000605mg [Citr...   840   0.0  
ref|XP_006472405.1| PREDICTED: pentatricopeptide repeat-containi...   838   0.0  
ref|XP_004170776.1| PREDICTED: pentatricopeptide repeat-containi...   835   0.0  
ref|XP_004288876.1| PREDICTED: pentatricopeptide repeat-containi...   833   0.0  
ref|XP_003523769.1| PREDICTED: pentatricopeptide repeat-containi...   828   0.0  
ref|XP_007137661.1| hypothetical protein PHAVU_009G145100g [Phas...   822   0.0  
ref|XP_003527866.1| PREDICTED: pentatricopeptide repeat-containi...   822   0.0  
ref|XP_002889775.1| pentatricopeptide repeat-containing protein ...   815   0.0  
ref|NP_172461.1| pentatricopeptide repeat-containing protein [Ar...   811   0.0  
ref|XP_006306156.1| hypothetical protein CARUB_v10011677mg, part...   774   0.0  
ref|XP_004501057.1| PREDICTED: pentatricopeptide repeat-containi...   760   0.0  
ref|XP_006856585.1| hypothetical protein AMTR_s00046p00202770 [A...   757   0.0  
gb|EPS61251.1| hypothetical protein M569_13548, partial [Genlise...   691   0.0  

>ref|XP_002285611.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            isoform 1 [Vitis vinifera]
          Length = 610

 Score =  880 bits (2275), Expect = 0.0
 Identities = 458/613 (74%), Positives = 503/613 (82%), Gaps = 18/613 (2%)
 Frame = -3

Query: 2056 MDSVLPIEQTYVGFSSVNNCVHKDTFRVSSLGFN-----------NACFCVLSSDCDVR- 1913
            MD ++P+ QT+ G  S+ + VH++    + LG              A F  LS       
Sbjct: 1    MDLIVPVSQTHEGLYSLQH-VHRENTTKTCLGTRARFRSNLVLGYKARFLALSDGTSNEC 59

Query: 1912 KSNYPSRKYRRNPNPVLAM-----YGSNGKLP-TPSNTHLSDNGHSNSMYSKNSYEDFQS 1751
            K    SR  RRN   V A      + SN KLP    N H+  +G + +  S +S E+ +S
Sbjct: 60   KKIGGSRNQRRNQ--VFAALRADTFSSNDKLPYAEKNQHVHLSGGNYTSNSSSSIEEHES 117

Query: 1750 NNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILEN 1571
            NNHLRRLVRNGELE+G  FLE+MV  GDIPDIIPCTSLIRGFCR+GKTKKAT V+EILE 
Sbjct: 118  NNHLRRLVRNGELEDGFKFLESMVYRGDIPDIIPCTSLIRGFCRIGKTKKATWVMEILEQ 177

Query: 1570 SGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAME 1391
            SGA+PDVITYNVLISGYCK GEIDNAL VLDRM+V PDVVTYNTILR+LCDSGKL+QAME
Sbjct: 178  SGAVPDVITYNVLISGYCKSGEIDNALQVLDRMNVAPDVVTYNTILRTLCDSGKLKQAME 237

Query: 1390 VLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGIC 1211
            VLD QLQKECYPDVITYTILIEATC+ESGVGQAMKL+DEMR+KG KPDVVTYNVLINGIC
Sbjct: 238  VLDRQLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNKGSKPDVVTYNVLINGIC 297

Query: 1210 KEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVV 1031
            KEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVV
Sbjct: 298  KEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVV 357

Query: 1030 TFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIM 851
            TFNILINFLCR+GLLGRAI+IL+KMP HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIM
Sbjct: 358  TFNILINFLCRQGLLGRAIDILEKMPMHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIM 417

Query: 850  VSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNT 671
            VSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL SKGC+PVLITYNTVIDGLSK+G T
Sbjct: 418  VSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSSKGCSPVLITYNTVIDGLSKVGKT 477

Query: 670  ERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSV 491
            ERA         KGL+PDIITYSSLV+GLSREGKVDE+IKFFHDLEG+GIRPNAITYNS+
Sbjct: 478  ERAIKLLDEMRRKGLKPDIITYSSLVSGLSREGKVDEAIKFFHDLEGLGIRPNAITYNSI 537

Query: 490  MLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGV 311
            MLGLCK+RQT RAIDFLAYM+SK CKPTEATYTILIEGI               LC RG+
Sbjct: 538  MLGLCKSRQTDRAIDFLAYMISKRCKPTEATYTILIEGIAYEGLAKEALDLLNELCSRGL 597

Query: 310  VKRSSAQNVAVKM 272
            VK+SSA+ VAVKM
Sbjct: 598  VKKSSAEQVAVKM 610


>emb|CAN61988.1| hypothetical protein VITISV_026694 [Vitis vinifera]
          Length = 553

 Score =  862 bits (2227), Expect = 0.0
 Identities = 428/510 (83%), Positives = 461/510 (90%)
 Frame = -3

Query: 1801 GHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFC 1622
            G + +  S +S E+ +SNNHLRRLVRNGELE+G  FLE+MV  GDIPDIIPCTSLIRGFC
Sbjct: 44   GGNYTSNSSSSIEEHESNNHLRRLVRNGELEDGFKFLESMVYRGDIPDIIPCTSLIRGFC 103

Query: 1621 RVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYN 1442
            R+GKTKKAT V+EILE SGA+PDVITYNVLISGYCK GEIDNAL VLDRM+V PDVVTYN
Sbjct: 104  RIGKTKKATWVMEILEQSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMNVAPDVVTYN 163

Query: 1441 TILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSK 1262
            TILR+LCDSGKL+QAMEVLD QLQKECYPDVITYTILIEATC+ESGVGQAMKL+DEMR+K
Sbjct: 164  TILRTLCDSGKLKQAMEVLDRQLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNK 223

Query: 1261 GCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDA 1082
            G KPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDA
Sbjct: 224  GSKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDA 283

Query: 1081 EKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHG 902
            EKLLSDMLRKGCSPSVVTFNILINFLCR+GLLGRAI+IL+KMP HGCTPNSLSYNPLLHG
Sbjct: 284  EKLLSDMLRKGCSPSVVTFNILINFLCRQGLLGRAIDILEKMPMHGCTPNSLSYNPLLHG 343

Query: 901  FCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPV 722
            FCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL SKGC+PV
Sbjct: 344  FCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSSKGCSPV 403

Query: 721  LITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFH 542
            LITYNTVIDGLSK+G TERA         KGL+PDIITYSSLV+GLSREGKVDE+IKFFH
Sbjct: 404  LITYNTVIDGLSKVGKTERAIKLLDEMRRKGLKPDIITYSSLVSGLSREGKVDEAIKFFH 463

Query: 541  DLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXX 362
            DLEG+GIRPNAITYNS+MLGLCK+RQT RAIDFLAYM+SK CKPTEATYTILIEGI    
Sbjct: 464  DLEGLGIRPNAITYNSIMLGLCKSRQTDRAIDFLAYMISKRCKPTEATYTILIEGIAYEG 523

Query: 361  XXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                       LC RG+VK+SSA+ VAVKM
Sbjct: 524  LAKEALDLLNELCSRGLVKKSSAEQVAVKM 553


>gb|EXC01179.1| hypothetical protein L484_025557 [Morus notabilis]
          Length = 610

 Score =  860 bits (2221), Expect = 0.0
 Identities = 442/610 (72%), Positives = 493/610 (80%), Gaps = 15/610 (2%)
 Frame = -3

Query: 2056 MDSVLPIEQTYVGFSSVNNCVHKDTFRVSSLGFN--------NACFCVLSSDCDVRKSNY 1901
            MDSV+PI QT  GF S +    + T   S+ G +         A   V+S +   R +  
Sbjct: 1    MDSVVPIRQTPDGFCSFHKTSLESTRNTSTFGGSLVGVAVGCKARLLVMSHNIQCRINEG 60

Query: 1900 PSRKYRRNPNPV--LAMYGSNGKLP-----TPSNTHLSDNGHSNSMYSKNSYEDFQSNNH 1742
             S+  R++   V  +    SNG+L       P +     N  +NS +S  ++E+F+ N  
Sbjct: 61   SSKHRRKHVLAVSKIEALSSNGRLQENFEKNPFDHLNGTNSLANSGHSTRNFEEFEGNKR 120

Query: 1741 LRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGA 1562
            LRR VRNGELEEG   LE MV HGDIPDII CTSLIRGFC++GKTKKA+RV+EILE SGA
Sbjct: 121  LRRFVRNGELEEGFKVLERMVYHGDIPDIIACTSLIRGFCKIGKTKKASRVMEILEESGA 180

Query: 1561 IPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLD 1382
             PDVITYNVLISGYCK GEIDNAL VLDRMSV PDVVTYNTILR+LCDSGKL++AMEVLD
Sbjct: 181  APDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRTLCDSGKLKEAMEVLD 240

Query: 1381 LQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEG 1202
             QL++ECYPDVITYTILIEATC+ESGVGQAMKL+DEMRSKGCKPDVVTYNVLINGICKEG
Sbjct: 241  RQLRRECYPDVITYTILIEATCKESGVGQAMKLLDEMRSKGCKPDVVTYNVLINGICKEG 300

Query: 1201 RLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFN 1022
            RLDEAIKFLNNMPSYGC  NVITHNIILRSMCSTGRWMDAEKLL++M+RKGCSPSVVTFN
Sbjct: 301  RLDEAIKFLNNMPSYGCHSNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFN 360

Query: 1021 ILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSR 842
            ILINFLCRKGLLGRAI+IL+KMP+HGCTPNSLSYNPLLHGFCKEKKM RAIEYLD+MVSR
Sbjct: 361  ILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPLLHGFCKEKKMARAIEYLDVMVSR 420

Query: 841  GCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERA 662
            GCYPDIVTYNTLLTALCKDGKV++AV +LNQL SKGC+PVLITYNTVIDGLSK G TERA
Sbjct: 421  GCYPDIVTYNTLLTALCKDGKVDIAVVILNQLSSKGCSPVLITYNTVIDGLSKAGETERA 480

Query: 661  XXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLG 482
                     KGL+PDIITYSSLV GLSREGKVDE+IKFFHDLEG GI+PNAIT+NS+MLG
Sbjct: 481  IKLLYEMQRKGLKPDIITYSSLVGGLSREGKVDEAIKFFHDLEGFGIKPNAITFNSIMLG 540

Query: 481  LCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKR 302
            LCKARQT RAIDFLA+MVSKGCKPTEATYTILIEG+               LC RGVVK+
Sbjct: 541  LCKARQTSRAIDFLAHMVSKGCKPTEATYTILIEGLAYEGLAKEALELLSELCARGVVKK 600

Query: 301  SSAQNVAVKM 272
            SSA  VAV+M
Sbjct: 601  SSADQVAVRM 610


>ref|XP_007018302.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508723630|gb|EOY15527.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 606

 Score =  848 bits (2192), Expect = 0.0
 Identities = 422/532 (79%), Positives = 469/532 (88%), Gaps = 5/532 (0%)
 Frame = -3

Query: 1852 GSNGK---LPTPSNTHLSDNGH--SNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLE 1688
            G NG+   L + S  HL  NGH  S+ + S +++E+  SNN LR+ VRNGELEEG   LE
Sbjct: 76   GVNGRFQNLDSSSQGHLG-NGHVSSSPLKSLHNFEESGSNNQLRKFVRNGELEEGFKLLE 134

Query: 1687 NMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLG 1508
             MV HG+IPDII CTSLIRGFC+ GKT+KATRV+EI+E+SGA+PDVITYNVLISGYCK G
Sbjct: 135  GMVYHGEIPDIIACTSLIRGFCKKGKTRKATRVMEIIEDSGAVPDVITYNVLISGYCKAG 194

Query: 1507 EIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILI 1328
            EIDNAL VLDRMSV PDVVTYNTILRSLCDSGKL+QAMEV+D QLQ+ECYPDVITYTILI
Sbjct: 195  EIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAMEVMDRQLQRECYPDVITYTILI 254

Query: 1327 EATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQ 1148
            EATC+ESGVGQAMKL+DEMRS+GCKPDVVTYNVL+NGICKEGRLDEAIKFLNNMPSYGCQ
Sbjct: 255  EATCKESGVGQAMKLLDEMRSRGCKPDVVTYNVLVNGICKEGRLDEAIKFLNNMPSYGCQ 314

Query: 1147 PNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINI 968
            PNVITHNIILRSMCSTGRWMDAE+LL+DMLRKGCSPSVVTFNILINFLCRKGLLGRAI+I
Sbjct: 315  PNVITHNIILRSMCSTGRWMDAERLLADMLRKGCSPSVVTFNILINFLCRKGLLGRAIDI 374

Query: 967  LDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCK 788
            L+KMP+HGCTPNSLSYNPLLHGFCKEKKM+RAIEYL+IMVSRGCYPDIVTYNTLLTALCK
Sbjct: 375  LEKMPKHGCTPNSLSYNPLLHGFCKEKKMERAIEYLEIMVSRGCYPDIVTYNTLLTALCK 434

Query: 787  DGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIIT 608
            DGKV+VAVE+LNQL +KGC+PVLITYNTVIDGLSK+G T++A         KGL+PDIIT
Sbjct: 435  DGKVDVAVEILNQLSTKGCSPVLITYNTVIDGLSKVGKTDQAIKLLEEMRAKGLKPDIIT 494

Query: 607  YSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMV 428
            YSSLV GLSREGKVD++IKFFHD E +GIRPNAITYNS+MLGLCKARQT RAIDFLAYMV
Sbjct: 495  YSSLVGGLSREGKVDDAIKFFHDFERMGIRPNAITYNSIMLGLCKARQTDRAIDFLAYMV 554

Query: 427  SKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
             +GCKPTE+TYTILIEG+               LC RGVVK+SSA+ VAVKM
Sbjct: 555  MRGCKPTESTYTILIEGLAYEGFANEALELLNELCSRGVVKKSSAEQVAVKM 606


>ref|XP_006364562.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Solanum tuberosum]
          Length = 623

 Score =  848 bits (2190), Expect = 0.0
 Identities = 445/627 (70%), Positives = 496/627 (79%), Gaps = 32/627 (5%)
 Frame = -3

Query: 2056 MDSVLPIEQTYVGFSSVN--------NCVHKDTFRVSSL------------GFNNACFCV 1937
            M+ ++P +QT+ GF S +        NC +   F+ SSL             F+ +    
Sbjct: 1    MELIVPTKQTHEGFCSFHSTRKEITVNCCNNRRFKNSSLLVGQLRKQRQDMVFSISKIET 60

Query: 1936 LSSDCDVRKSNYPSRKYR------------RNPNPVLAMYGSNGKLPTPSNTHLSDNGHS 1793
            LSS   V K  + S   R            +  N V A+        T +    S    +
Sbjct: 61   LSS---VEKRGFGSSNRRCKNSLLGGQLRKQRQNKVFAISRIEILGGTTNGRLSSVEKKT 117

Query: 1792 NSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVG 1613
            N   S+N  E+F+SNN+LRRLVRNGELEE    LE+MV  GDIPDIIPCTSLIRGFCR+G
Sbjct: 118  NGSISEN-IEEFESNNYLRRLVRNGELEESFKHLESMVYRGDIPDIIPCTSLIRGFCRIG 176

Query: 1612 KTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTIL 1433
            +TKKATRVLEILE+SGA+PDVITYNVLISGYCK GEIDNAL VLDRMSV PDVVTYNTIL
Sbjct: 177  QTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALKVLDRMSVAPDVVTYNTIL 236

Query: 1432 RSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCK 1253
            RSLCDSGKL+QAM VLD  LQKECYPDVITYTILIEATC+ESGVGQAMKL+DEMRSKGC 
Sbjct: 237  RSLCDSGKLKQAMHVLDRMLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRSKGCV 296

Query: 1252 PDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL 1073
            PDVVTYNVLINGICKEGRL+EAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL
Sbjct: 297  PDVVTYNVLINGICKEGRLNEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL 356

Query: 1072 LSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCK 893
            L+DM+RKGCSPSVVTFNILINFLCRKGLLGRAI++L+KMP++GCTPNSLSYNPLLH FCK
Sbjct: 357  LADMVRKGCSPSVVTFNILINFLCRKGLLGRAIDLLEKMPKYGCTPNSLSYNPLLHAFCK 416

Query: 892  EKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLIT 713
            EKKMDRAIEYL++MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL  KGC+PVLIT
Sbjct: 417  EKKMDRAIEYLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSDKGCSPVLIT 476

Query: 712  YNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLE 533
            YNTVIDGLSK+G TE A        EKGLQPDIITYSS VAGLSREGKVDE+IKFFHD+E
Sbjct: 477  YNTVIDGLSKVGKTELAIELLNEMREKGLQPDIITYSSFVAGLSREGKVDEAIKFFHDIE 536

Query: 532  GIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXX 353
            G+ +RPNAITYN++MLGLCKARQT RAIDFLAYM+SKGCKPTE+TYTILIEGI       
Sbjct: 537  GLDVRPNAITYNAIMLGLCKARQTDRAIDFLAYMISKGCKPTESTYTILIEGIAYEGLAE 596

Query: 352  XXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                    LC RGVVK+SSA+ V VKM
Sbjct: 597  EALELLNELCSRGVVKKSSAEQVVVKM 623


>ref|XP_002302359.2| hypothetical protein POPTR_0002s11020g [Populus trichocarpa]
            gi|550344756|gb|EEE81632.2| hypothetical protein
            POPTR_0002s11020g [Populus trichocarpa]
          Length = 637

 Score =  848 bits (2190), Expect = 0.0
 Identities = 432/605 (71%), Positives = 494/605 (81%), Gaps = 8/605 (1%)
 Frame = -3

Query: 2062 KTMDSVLPIEQTYVGFSSVNNCVHKDTFRVSSLGFNNACFCVLSSDCDVRKSNYPSRKYR 1883
            + MD ++P   T+ G  S        T R S  G       +  +D   RK +   RK R
Sbjct: 45   QVMDLIVPTSHTHEGLRSFQYFNRYTTRRSSFAGAR-----IRGNDGSSRKVHVGFRKLR 99

Query: 1882 RNPNPVLAMYG-----SNGKLPT---PSNTHLSDNGHSNSMYSKNSYEDFQSNNHLRRLV 1727
            ++   V A+ G     SNGKL     P N H+  NGH +S    +S E+F+SNNHLR+LV
Sbjct: 100  KSR--VFAVSGVETFRSNGKLQNLDKPLNGHMG-NGHVSS----SSIEEFESNNHLRKLV 152

Query: 1726 RNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVI 1547
            RNGELEEG  FLENMV  G+IPDII  TSLIRGFC++GKT+KATR++EI+E+SGA+PDVI
Sbjct: 153  RNGELEEGFRFLENMVYRGEIPDIIASTSLIRGFCKIGKTRKATRIMEIIEDSGAVPDVI 212

Query: 1546 TYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQK 1367
            TYNVLISGYCK GEIDNAL VLDRMSV PDVVTYNTILR+LCDSGKL+QAMEVLD QL+K
Sbjct: 213  TYNVLISGYCKAGEIDNALRVLDRMSVAPDVVTYNTILRTLCDSGKLKQAMEVLDRQLEK 272

Query: 1366 ECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEA 1187
            ECYPDVITYTILIEATC ESGVGQAMKL+DEM S+GCKPDVVTYNVL+NG+CKEGRLDEA
Sbjct: 273  ECYPDVITYTILIEATCAESGVGQAMKLLDEMGSRGCKPDVVTYNVLVNGMCKEGRLDEA 332

Query: 1186 IKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINF 1007
            IKFLN+MPSYG QPNVITHNIILRSMCSTGRWMDAEKLL++M+RKGCSPSVVTFNILINF
Sbjct: 333  IKFLNSMPSYGSQPNVITHNIILRSMCSTGRWMDAEKLLTEMVRKGCSPSVVTFNILINF 392

Query: 1006 LCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPD 827
            LCRKGLLGRAI+IL+KMP HGCTPNSLSYNPLLHGFCKEKKMDRAI+YL+IMVSRGCYPD
Sbjct: 393  LCRKGLLGRAIDILEKMPTHGCTPNSLSYNPLLHGFCKEKKMDRAIQYLEIMVSRGCYPD 452

Query: 826  IVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXX 647
            IVTYNT+LTALCKDGKV+ AVELLNQL SKGC+PVLITYNTVIDGLSK+G T++A     
Sbjct: 453  IVTYNTMLTALCKDGKVDAAVELLNQLSSKGCSPVLITYNTVIDGLSKVGKTDQAVELLH 512

Query: 646  XXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKAR 467
                KGL+PD+ITYSSL+AGLSREGKV+E+IKFFHD+EG G++PNA TYNS+M GLCKA+
Sbjct: 513  EMRGKGLKPDVITYSSLIAGLSREGKVEEAIKFFHDVEGFGVKPNAFTYNSIMFGLCKAQ 572

Query: 466  QTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQN 287
            QT RAIDFLAYM+SKGCKPTE +YTILIEGI               LC RGVVK+SSA+ 
Sbjct: 573  QTDRAIDFLAYMISKGCKPTEVSYTILIEGIANEGLAKEALELLNELCSRGVVKKSSAEQ 632

Query: 286  VAVKM 272
            V V++
Sbjct: 633  VVVRL 637


>ref|XP_004240633.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Solanum lycopersicum]
          Length = 739

 Score =  845 bits (2182), Expect = 0.0
 Identities = 441/621 (71%), Positives = 495/621 (79%), Gaps = 32/621 (5%)
 Frame = -3

Query: 2056 MDSVLPIEQTYVGFSSVN--------NCVHKDTFRVSSLGFN-------NACFCV--LSS 1928
            M+ ++P +QT+ GF S +        NC +   F+ SSL          +  F V  + +
Sbjct: 1    MELIVPTKQTHEGFCSFHSTRKDITVNCCNNRRFKNSSLLVGQLRKQRQDKVFPVSKIET 60

Query: 1927 DCDVRKSNYPSRKYRR-------------NPNPVLAMYG--SNGKLPTPSNTHLSDNGHS 1793
               V K  + S   RR               N V A+ G  +NG+L +           +
Sbjct: 61   LSSVEKRGFGSSSNRRCKNSLLVGQLRKQRQNKVFAILGGTTNGRLSSVEK-------RT 113

Query: 1792 NSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVG 1613
            N   S+N  E+F+SNN+LRRLVRNGELEE    LE+MV  GDIPDIIPCTSLIRGFCR+G
Sbjct: 114  NGSVSEN-IEEFESNNYLRRLVRNGELEESFKHLESMVYRGDIPDIIPCTSLIRGFCRIG 172

Query: 1612 KTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTIL 1433
            +TKKATRVLEILE+SGA+PDVITYNVLISGYCK GEIDNAL VLDRMSV PDVVTYNTIL
Sbjct: 173  QTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALKVLDRMSVAPDVVTYNTIL 232

Query: 1432 RSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCK 1253
            RSLCDSGKL+QAM VLD  LQKECYPDVITYTILIEATC+ESGVGQAMKL+DEMRSKGC 
Sbjct: 233  RSLCDSGKLKQAMHVLDRMLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRSKGCV 292

Query: 1252 PDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL 1073
            PDVVTYNVLINGICKEGRL+EAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL
Sbjct: 293  PDVVTYNVLINGICKEGRLNEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL 352

Query: 1072 LSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCK 893
            L+DM+RKGCSPSVVTFNILINFLCRKGLLGRAI++L+KMP++GCTPNSLSYNPLLH FCK
Sbjct: 353  LADMVRKGCSPSVVTFNILINFLCRKGLLGRAIDLLEKMPKYGCTPNSLSYNPLLHAFCK 412

Query: 892  EKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLIT 713
            EKKMDRAI+YL++MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL  KGC+PVLIT
Sbjct: 413  EKKMDRAIQYLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSDKGCSPVLIT 472

Query: 712  YNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLE 533
            YNTVIDGLSK+G TE A        EKGLQPDIITYSS VAGLSREGKVDE+IKFFHD+E
Sbjct: 473  YNTVIDGLSKVGKTELAIELLNEMREKGLQPDIITYSSFVAGLSREGKVDEAIKFFHDIE 532

Query: 532  GIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXX 353
            G+ +RPNAITYN++MLGLCKARQT RAIDFLAYM+SKGCKPTE+TYTILIEGI       
Sbjct: 533  GLDVRPNAITYNAIMLGLCKARQTDRAIDFLAYMISKGCKPTESTYTILIEGIAYEGLAE 592

Query: 352  XXXXXXXXLCRRGVVKRSSAQ 290
                    LC RGVVK+SSA+
Sbjct: 593  EALELLNELCSRGVVKKSSAE 613


>ref|XP_006433766.1| hypothetical protein CICLE_v10000605mg [Citrus clementina]
            gi|557535888|gb|ESR47006.1| hypothetical protein
            CICLE_v10000605mg [Citrus clementina]
          Length = 619

 Score =  840 bits (2171), Expect = 0.0
 Identities = 436/620 (70%), Positives = 496/620 (80%), Gaps = 25/620 (4%)
 Frame = -3

Query: 2056 MDSVLPIEQTYVGFSSVNN-----CVHKDTFRVSSLGFNNA------------CFCVLSS 1928
            MD ++P    + GF S  +     C +   F V +     A             F V S+
Sbjct: 1    MDIIVPANHAHEGFCSFQHFTRESCRNTGPFSVGAGDIARARAIRKVHVGCKVSFSVQSA 60

Query: 1927 DCDVRKSNYPSRKYRRNPNPVLAMYGS---NGKLPTPS---NTHLSDNGHSNSMYSKNS- 1769
            D    K +   +K R+N    ++   +   NGK+         HL+ NGH +S    +S 
Sbjct: 61   DSVDFKKHKGFQKQRQNRVFAISKVETLSFNGKMKHGEAFVQGHLN-NGHISSGMENSSL 119

Query: 1768 -YEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATR 1592
             +EDF+SNNHLRRLVRNGELEEG  FLE+MV HGDIPDIIPCTSLIRGFC+VGKT+KATR
Sbjct: 120  NFEDFESNNHLRRLVRNGELEEGFKFLESMVYHGDIPDIIPCTSLIRGFCKVGKTRKATR 179

Query: 1591 VLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSG 1412
            V+EI+E+SGA+PDVITYNVLISGYC+LGEIDNAL VL+RMSV PDVVTYNTILR+LCDSG
Sbjct: 180  VMEIVEDSGAVPDVITYNVLISGYCRLGEIDNALQVLERMSVAPDVVTYNTILRTLCDSG 239

Query: 1411 KLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYN 1232
            KL  AMEVL  QL+KECYPDVITYTILIEATC+ESGVGQAMKL+DEMR+KGC PDVVTYN
Sbjct: 240  KLNLAMEVLHKQLEKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNKGCIPDVVTYN 299

Query: 1231 VLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRK 1052
            VL+NGICKEGRLDEAIKFLN+MPSYGCQPNVITHNIILRSMCSTGRWMDAE+LL++M+ K
Sbjct: 300  VLVNGICKEGRLDEAIKFLNDMPSYGCQPNVITHNIILRSMCSTGRWMDAERLLAEMVLK 359

Query: 1051 GCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRA 872
            GCSPSVVTFNILINFLCRKGLLGRAI+IL+KMP+HGCTPNSLSYNP+LHGFCKEKKMDRA
Sbjct: 360  GCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPVLHGFCKEKKMDRA 419

Query: 871  IEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDG 692
            IEYL+IMVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL +K C+PVLITYNTVIDG
Sbjct: 420  IEYLEIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSNKHCSPVLITYNTVIDG 479

Query: 691  LSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPN 512
            LSK+G TE+A         KGL+PD ITYSSLV GLSREGKVDE+IK FHDLE +G+RPN
Sbjct: 480  LSKVGKTEQAMKLLEEMRTKGLKPDTITYSSLVGGLSREGKVDEAIKLFHDLERLGVRPN 539

Query: 511  AITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXX 332
             ITYNS++LGLCKARQT RAID LA MV++GCKPTEATYTILIEGI              
Sbjct: 540  VITYNSIILGLCKARQTYRAIDILADMVTRGCKPTEATYTILIEGIAYEGLAKEALDLLN 599

Query: 331  XLCRRGVVKRSSAQNVAVKM 272
             LC RGVVK+SSA+ VAVKM
Sbjct: 600  QLCSRGVVKKSSAEQVAVKM 619


>ref|XP_006472405.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Citrus sinensis]
          Length = 619

 Score =  838 bits (2166), Expect = 0.0
 Identities = 413/514 (80%), Positives = 458/514 (89%), Gaps = 2/514 (0%)
 Frame = -3

Query: 1807 DNGHSNSMYSKNS--YEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLI 1634
            +NGH +S    +S  +EDF+SNNHLRRLVRNGELEEG  FLE+MV HGDIPDIIPCTSLI
Sbjct: 106  NNGHISSGMENSSLNFEDFESNNHLRRLVRNGELEEGFKFLESMVYHGDIPDIIPCTSLI 165

Query: 1633 RGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDV 1454
            RGFC+VGKT+KATRV+EI+E+SGA+PDVITYNVLISGYC+LGEIDNAL VL+RMSV PDV
Sbjct: 166  RGFCKVGKTRKATRVMEIVEDSGAVPDVITYNVLISGYCRLGEIDNALQVLERMSVAPDV 225

Query: 1453 VTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDE 1274
            VTYNTILR+LCDSGKL  AMEVL  QL+KECYPDVITYTILIEATC+ESGVGQAMKL+DE
Sbjct: 226  VTYNTILRTLCDSGKLNLAMEVLHKQLEKECYPDVITYTILIEATCKESGVGQAMKLLDE 285

Query: 1273 MRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGR 1094
            MR+KGC PDVVTYNVL+NGICKEGRLDEAIKFLN+MPSYGCQPNVITHNIILRSMCSTGR
Sbjct: 286  MRNKGCIPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSYGCQPNVITHNIILRSMCSTGR 345

Query: 1093 WMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNP 914
            WMDAE+LL++M+ KGCSPSVVTFNILINFLCRKGLLGRAI+IL+KMP+HGCTPNSLSYNP
Sbjct: 346  WMDAERLLAEMVHKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNP 405

Query: 913  LLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKG 734
            +LHGFCKEKKMDRAIEYL+IMVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+LNQL +K 
Sbjct: 406  VLHGFCKEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSNKH 465

Query: 733  CAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESI 554
            C+PVLITYNTVIDGLSK+G TE+A         KGL+PD ITYSSLV GLSREGKVDE+I
Sbjct: 466  CSPVLITYNTVIDGLSKVGKTEQAMKLLEEMRTKGLKPDTITYSSLVGGLSREGKVDEAI 525

Query: 553  KFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGI 374
            K FHDLE +G+RPN ITYNS+MLGLCKARQT RAID LA MV++ CKPTEATYTILIEGI
Sbjct: 526  KLFHDLERLGVRPNVITYNSIMLGLCKARQTYRAIDILADMVTRSCKPTEATYTILIEGI 585

Query: 373  XXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                           LC RGVVK+SSA+ VAVKM
Sbjct: 586  AYEGLAKEALDLLNQLCSRGVVKKSSAEQVAVKM 619


>ref|XP_004170776.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Cucumis sativus]
          Length = 665

 Score =  835 bits (2157), Expect = 0.0
 Identities = 413/550 (75%), Positives = 471/550 (85%), Gaps = 5/550 (0%)
 Frame = -3

Query: 1906 NYPSRKYRRNPNPVLAMYGSNGKLPTPS---NTHLSDNGHSNSMYSKNSY--EDFQSNNH 1742
            +Y S +      P +  + SNG+L       +THL+ +  S+S YS +S   E+ ++NNH
Sbjct: 57   SYGSEEQLVRAVPRVDTFSSNGRLSHGEKNLHTHLNGSSSSSSSYSNHSQSSEEVENNNH 116

Query: 1741 LRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGA 1562
            LRRLVRNGELEEG  FLE+MV  GDIPDII CTSLIRG C+ GKT KATRV+EILE+SGA
Sbjct: 117  LRRLVRNGELEEGFKFLEDMVCRGDIPDIIACTSLIRGLCKTGKTWKATRVMEILEDSGA 176

Query: 1561 IPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLD 1382
            +PDVITYNVLISGYCK GEI +AL +LDRMSV PDVVTYNTILR+LCDSGKL++AMEVLD
Sbjct: 177  VPDVITYNVLISGYCKTGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKEAMEVLD 236

Query: 1381 LQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEG 1202
             Q+Q+ECYPDVITYTILIEATC+ESGVGQAMKL+DEMR KGCKPDVVTYNVLINGICKEG
Sbjct: 237  RQMQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEG 296

Query: 1201 RLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFN 1022
            RLDEAI+FLN+MPSYGCQPNVITHNIILRSMCSTGRWMDAEK L++M+RKGCSPSVVTFN
Sbjct: 297  RLDEAIRFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKFLAEMIRKGCSPSVVTFN 356

Query: 1021 ILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSR 842
            ILINFLCRKGL+GRAI++L+KMP+HGCTPNSLSYNPLLH  CK+KKM+RAIEYLDIMVSR
Sbjct: 357  ILINFLCRKGLIGRAIDVLEKMPQHGCTPNSLSYNPLLHALCKDKKMERAIEYLDIMVSR 416

Query: 841  GCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERA 662
            GCYPDIVTYNTLLTALCKDGKV+VAVE+LNQLGSKGC+PVLITYNTVIDGLSK+G T+ A
Sbjct: 417  GCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTDDA 476

Query: 661  XXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLG 482
                     KGL+PDIITYS+LV GLSREGKVDE+I FFHDLE +G++PNAITYNS+MLG
Sbjct: 477  IKLLDEMKGKGLKPDIITYSTLVGGLSREGKVDEAIAFFHDLEEMGVKPNAITYNSIMLG 536

Query: 481  LCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKR 302
            LCKARQTVRAIDFLAYMV++GCKPTE +Y ILIEG+               LC RGVVK+
Sbjct: 537  LCKARQTVRAIDFLAYMVARGCKPTETSYMILIEGLAYEGLAKEALELLNELCSRGVVKK 596

Query: 301  SSAQNVAVKM 272
            SSA+ V VK+
Sbjct: 597  SSAEQVVVKI 606



 Score =  163 bits (413), Expect = 3e-37
 Identities = 96/336 (28%), Positives = 163/336 (48%), Gaps = 46/336 (13%)
 Frame = -3

Query: 1168 MPSYGC-----------------------QPNVITH-----------------------N 1127
            +PSYG                        + N+ TH                       N
Sbjct: 55   IPSYGSEEQLVRAVPRVDTFSSNGRLSHGEKNLHTHLNGSSSSSSSYSNHSQSSEEVENN 114

Query: 1126 IILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEH 947
              LR +   G   +  K L DM+ +G  P ++    LI  LC+ G   +A  +++ + + 
Sbjct: 115  NHLRRLVRNGELEEGFKFLEDMVCRGDIPDIIACTSLIRGLCKTGKTWKATRVMEILEDS 174

Query: 946  GCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVA 767
            G  P+ ++YN L+ G+CK  ++  A++ LD M      PD+VTYNT+L  LC  GK++ A
Sbjct: 175  GAVPDVITYNVLISGYCKTGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKEA 231

Query: 766  VELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAG 587
            +E+L++   + C P +ITY  +I+   K     +A        +KG +PD++TY+ L+ G
Sbjct: 232  MEVLDRQMQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLING 291

Query: 586  LSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPT 407
            + +EG++DE+I+F + +   G +PN IT+N ++  +C   + + A  FLA M+ KGC P+
Sbjct: 292  ICKEGRLDEAIRFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKFLAEMIRKGCSPS 351

Query: 406  EATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRS 299
              T+ ILI  +                CR+G++ R+
Sbjct: 352  VVTFNILINFL----------------CRKGLIGRA 371


>ref|XP_004288876.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Fragaria vesca subsp. vesca]
          Length = 605

 Score =  833 bits (2153), Expect = 0.0
 Identities = 417/526 (79%), Positives = 458/526 (87%)
 Frame = -3

Query: 1849 SNGKLPTPSNTHLSDNGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHG 1670
            SNG+L    N   +  G+ N   S N  E+F+SNN LRRLVRNGELEEG   LE+MV  G
Sbjct: 83   SNGRL---KNVEKTPYGNLNGGDSSNGLEEFESNNQLRRLVRNGELEEGFRLLESMVYQG 139

Query: 1669 DIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNAL 1490
            DIPDII CTSLIRGFC+ GKT+KATR++ ILE SGA+ DVITYNVLISGYC+ GEIDNAL
Sbjct: 140  DIPDIIACTSLIRGFCKSGKTRKATRIMNILEESGAVLDVITYNVLISGYCRAGEIDNAL 199

Query: 1489 NVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRE 1310
             VLDRMSV PDVVTYNTILR+LCDSGKL+QAMEVLD QLQ+ECYPDVITYTILIEATC+E
Sbjct: 200  RVLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPDVITYTILIEATCKE 259

Query: 1309 SGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITH 1130
            SGV QAMKL+DEM+SKGCKPDVVTYNVLINGICKEGRLDEAI+FLNNMP   CQPNVITH
Sbjct: 260  SGVEQAMKLLDEMKSKGCKPDVVTYNVLINGICKEGRLDEAIEFLNNMPPSDCQPNVITH 319

Query: 1129 NIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPE 950
            NIILRSMCSTGRWMDAE+LL++M+ KGCSPSVVTFNILINFLCRKGLLGRAI+IL+KMP+
Sbjct: 320  NIILRSMCSTGRWMDAERLLAEMVGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPK 379

Query: 949  HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEV 770
            HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKV+V
Sbjct: 380  HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDV 439

Query: 769  AVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVA 590
            AVE+LNQL SKGC+PVLITYNTVIDGLSK+G TERA        +KGL+PDIITYSSLV 
Sbjct: 440  AVEILNQLSSKGCSPVLITYNTVIDGLSKVGKTERAIELLEEMRKKGLKPDIITYSSLVG 499

Query: 589  GLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKP 410
            GLSREGKVDE+IKF  DLEG+G+RPNAIT+N +MLGLCKARQT RAIDFLA+M+SKGCKP
Sbjct: 500  GLSREGKVDEAIKFVRDLEGMGVRPNAITFNCIMLGLCKARQTSRAIDFLAHMISKGCKP 559

Query: 409  TEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
            TEATYTILIEGI               LC RGVVKRSSA+ VAVK+
Sbjct: 560  TEATYTILIEGIAYEGLAEEALELLNELCYRGVVKRSSAEQVAVKI 605


>ref|XP_003523769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Glycine max]
          Length = 602

 Score =  828 bits (2140), Expect = 0.0
 Identities = 414/534 (77%), Positives = 458/534 (85%), Gaps = 7/534 (1%)
 Frame = -3

Query: 1852 GSNGKLP----TPSNTHLSDNGHSNSMYSKN---SYEDFQSNNHLRRLVRNGELEEGLVF 1694
            G NG+L     TP N  L+  G  +S    N   S+E+F SN HLR+LVRNGELEEGL F
Sbjct: 70   GLNGRLQQIVSTP-NGDLNVIGMESSPIGVNGSRSFEEFASNIHLRKLVRNGELEEGLKF 128

Query: 1693 LENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCK 1514
            LE M+  GDIPD+I CTSLIRGFCR GKTKKATR++EILENSGA+PDVITYNVLI GYCK
Sbjct: 129  LERMIYQGDIPDVIACTSLIRGFCRSGKTKKATRIMEILENSGAVPDVITYNVLIGGYCK 188

Query: 1513 LGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTI 1334
             GEID AL VL+RMSV PDVVTYNTILRSLCDSGKL++AMEVLD QLQ+ECYPDVITYTI
Sbjct: 189  SGEIDKALEVLERMSVAPDVVTYNTILRSLCDSGKLKEAMEVLDRQLQRECYPDVITYTI 248

Query: 1333 LIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYG 1154
            LIEATC +SGVGQAMKL+DEMR KGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYG
Sbjct: 249  LIEATCNDSGVGQAMKLLDEMRKKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYG 308

Query: 1153 CQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAI 974
            C+PNVITHNIILRSMCSTGRWMDAE+LLSDMLRKGCSPSVVTFNILINFLCRK LLGRAI
Sbjct: 309  CKPNVITHNIILRSMCSTGRWMDAERLLSDMLRKGCSPSVVTFNILINFLCRKRLLGRAI 368

Query: 973  NILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTAL 794
            ++L+KMP+HGC PNSLSYNPLLHGFC+EKKMDRAIEYL+IMVSRGCYPDIVTYNTLLTAL
Sbjct: 369  DVLEKMPKHGCVPNSLSYNPLLHGFCQEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTAL 428

Query: 793  CKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDI 614
            CKDGKV+ AVE+LNQL SKGC+PVLITYNTVIDGL+K+G TE A         KGL+PDI
Sbjct: 429  CKDGKVDAAVEILNQLSSKGCSPVLITYNTVIDGLTKVGKTEYAVELLEEMRRKGLKPDI 488

Query: 613  ITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAY 434
            ITYS+L+ GL REGKVDE+IK FHD+EG+ I+P+A+TYN++MLGLCKA+QT RAIDFLAY
Sbjct: 489  ITYSTLLRGLGREGKVDEAIKIFHDMEGLSIKPSAVTYNAIMLGLCKAQQTSRAIDFLAY 548

Query: 433  MVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
            MV KGCKPTEATYTILIEGI               LC RG VK+SSA+ V VKM
Sbjct: 549  MVEKGCKPTEATYTILIEGIADEGLAEEALELLNELCSRGFVKKSSAEQVVVKM 602


>ref|XP_007137661.1| hypothetical protein PHAVU_009G145100g [Phaseolus vulgaris]
            gi|561010748|gb|ESW09655.1| hypothetical protein
            PHAVU_009G145100g [Phaseolus vulgaris]
          Length = 600

 Score =  822 bits (2124), Expect = 0.0
 Identities = 414/568 (72%), Positives = 470/568 (82%), Gaps = 12/568 (2%)
 Frame = -3

Query: 1939 VLSSDCDVRKSNYPSRKYR-RNPNPVLAMY-----GSNGKLPTPSNTHLSD-NG-----H 1796
            +LS+D     +   + ++R R+ + V A+      G NG+L     T   D NG      
Sbjct: 33   ILSADTVTHFTKLKATRFRKRSESRVFAVSKSETSGLNGRLQQIVRTPNGDLNGIAMESS 92

Query: 1795 SNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRV 1616
             N +    ++E+F SN HLR+LVRNGELEEGL FLE M+  GDIPD+I CTSLIRGFC+ 
Sbjct: 93   GNGVNCSRNFEEFASNIHLRKLVRNGELEEGLKFLERMIYQGDIPDVIACTSLIRGFCKG 152

Query: 1615 GKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTI 1436
            GKTKKATRV+EILENSGA+PDVITYNVLISGYCK G+ID AL VL+RMSV PDVVTYNTI
Sbjct: 153  GKTKKATRVMEILENSGAVPDVITYNVLISGYCKSGDIDRALQVLERMSVAPDVVTYNTI 212

Query: 1435 LRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGC 1256
            LRSLC SGKL++AMEVLD QLQ+ECYPDVITYTILIEATC ESGVGQAMKL+DEMR+KGC
Sbjct: 213  LRSLCSSGKLKEAMEVLDRQLQRECYPDVITYTILIEATCNESGVGQAMKLLDEMRNKGC 272

Query: 1255 KPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEK 1076
            KPDVVTYNVLINGICKEGRLDEAIKFLN+MPSYGCQPNVITHNIILRSMCSTGRWMDAE+
Sbjct: 273  KPDVVTYNVLINGICKEGRLDEAIKFLNSMPSYGCQPNVITHNIILRSMCSTGRWMDAER 332

Query: 1075 LLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFC 896
            LL+DMLRKGCSPSVVTFNILINFLCRK LLGRAI++L+KMP+HGC PNSLSYNPLLHGFC
Sbjct: 333  LLADMLRKGCSPSVVTFNILINFLCRKRLLGRAIDVLEKMPKHGCVPNSLSYNPLLHGFC 392

Query: 895  KEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAPVLI 716
            +EKKMDRAIEYL+IMVSRGCYPDIVTYNTLLTALCKDGKV+ A+E+LNQL SKGC+PVL+
Sbjct: 393  QEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTALCKDGKVDAAIEILNQLSSKGCSPVLV 452

Query: 715  TYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDL 536
            TYNTVIDGL+K+G TE A         KGL+PDIITYSSL+ GL REGKVD++IK F D+
Sbjct: 453  TYNTVIDGLAKVGKTESAVELLEEMRRKGLKPDIITYSSLLRGLGREGKVDKAIKIFRDM 512

Query: 535  EGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXXXXX 356
            EG+ I+PNAITYNS+M GLCKA+QT RAIDFLAYMV +GC+PTE TYTILIEGI      
Sbjct: 513  EGLSIKPNAITYNSIMFGLCKAQQTSRAIDFLAYMVEQGCRPTEVTYTILIEGIADEGLA 572

Query: 355  XXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                     LC RG VK+SSA+ VAVKM
Sbjct: 573  EEALELLNVLCSRGFVKKSSAEQVAVKM 600


>ref|XP_003527866.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Glycine max]
          Length = 603

 Score =  822 bits (2122), Expect = 0.0
 Identities = 405/534 (75%), Positives = 453/534 (84%), Gaps = 7/534 (1%)
 Frame = -3

Query: 1852 GSNGKLPTPSNTHLSD-------NGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVF 1694
            G NG+L    +T   D       +   N +    S+E+F SN HLR+LVRNGELEEGL F
Sbjct: 70   GMNGRLQQIVSTPNGDLNGIGMESSSPNGVNGSRSFEEFASNIHLRKLVRNGELEEGLKF 129

Query: 1693 LENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCK 1514
            LE M+  GDIPD+I CTSLIRGFCR GKT+KATR++EILENSGA+PDVITYNVLI GYCK
Sbjct: 130  LERMIYQGDIPDVIACTSLIRGFCRSGKTRKATRIMEILENSGAVPDVITYNVLIGGYCK 189

Query: 1513 LGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTI 1334
             GEID AL VL+RMSV PDVVTYNTILRSLCDSGKL++AMEVLD Q+Q+ECYPDVITYTI
Sbjct: 190  SGEIDKALQVLERMSVAPDVVTYNTILRSLCDSGKLKEAMEVLDRQMQRECYPDVITYTI 249

Query: 1333 LIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYG 1154
            LIEATC +SGVGQAMKL+DEMR KGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMP YG
Sbjct: 250  LIEATCNDSGVGQAMKLLDEMRKKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPLYG 309

Query: 1153 CQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAI 974
            CQPNVITHNIILRSMCSTGRWMDAE+LL+DMLRKGCSPSVVTFNILINFLCRK LLGRAI
Sbjct: 310  CQPNVITHNIILRSMCSTGRWMDAERLLADMLRKGCSPSVVTFNILINFLCRKRLLGRAI 369

Query: 973  NILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTAL 794
            ++L+KMP+HGC PNSLSYNPLLHGFC+EKKMDRAIEYL+IMVSRGCYPDIVTYNTLLTAL
Sbjct: 370  DVLEKMPKHGCMPNSLSYNPLLHGFCQEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTAL 429

Query: 793  CKDGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDI 614
            CKDGK + AVE+LNQL SKGC+PVLITYNTVIDGL+K+G TE A         KGL+PDI
Sbjct: 430  CKDGKADAAVEILNQLSSKGCSPVLITYNTVIDGLTKVGKTEYAAELLEEMRRKGLKPDI 489

Query: 613  ITYSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAY 434
            ITYS+L+ GL  EGKVDE+IK FHD+EG+ I+P+A+TYN++MLGLCKA+QT RAIDFLAY
Sbjct: 490  ITYSTLLRGLGCEGKVDEAIKIFHDMEGLSIKPSAVTYNAIMLGLCKAQQTSRAIDFLAY 549

Query: 433  MVSKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
            MV KGCKPT+ATYTILIEGI               LC RG VK+SSA+ VAVKM
Sbjct: 550  MVEKGCKPTKATYTILIEGIADEGLAEEALELLNELCSRGFVKKSSAEQVAVKM 603


>ref|XP_002889775.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297335617|gb|EFH66034.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  815 bits (2106), Expect = 0.0
 Identities = 400/532 (75%), Positives = 457/532 (85%), Gaps = 5/532 (0%)
 Frame = -3

Query: 1852 GSNG---KLPTPSNTHLSDNGHSNSMYSKNSY--EDFQSNNHLRRLVRNGELEEGLVFLE 1688
            G NG   K  T ++ H + NG+ +   + +S+  ED +SNNHLR+LVR GELEEG  FLE
Sbjct: 67   GLNGRAQKFDTLASGHSNSNGNGHFSSANSSFVLEDVESNNHLRQLVRTGELEEGFKFLE 126

Query: 1687 NMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLG 1508
            NMV HG++PDIIPCT+LIRGFCR+GKT+KA ++LE+LE SGA+PDVITYNV+ISGYCK G
Sbjct: 127  NMVYHGNVPDIIPCTTLIRGFCRMGKTRKAAKILEVLEGSGAVPDVITYNVMISGYCKAG 186

Query: 1507 EIDNALNVLDRMSVPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILI 1328
            EI+NAL+VLDRMSV PDVVTYNTILRSLCDSGKL+QAMEVLD  LQ++CYPDVITYTILI
Sbjct: 187  EINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILI 246

Query: 1327 EATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQ 1148
            EATCR+SGVGQAMKL+DEMR +GC PDVVTYNVL+NGICKEGRLDEAIKFLN+MPS GCQ
Sbjct: 247  EATCRDSGVGQAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSSGCQ 306

Query: 1147 PNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINI 968
            PNVITHNIILRSMCSTGRWMDAEKLL+DMLRKG SPSVVTFNILINFLCRKGLLGRAI+I
Sbjct: 307  PNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDI 366

Query: 967  LDKMPEHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCK 788
            L+KMP+HGC PNSLSYNPLLHGFCKEKKMDRAIEYL+ MVSRGCYPDIVTYNT+LTALCK
Sbjct: 367  LEKMPKHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCK 426

Query: 787  DGKVEVAVELLNQLGSKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIIT 608
            DGKVE AVE+LNQL SKGC+PVLITYNTVIDGL+K G T +A         K L+PD IT
Sbjct: 427  DGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTIT 486

Query: 607  YSSLVAGLSREGKVDESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMV 428
            YSSLV GLSREGKVDE+IKFFH+ E +G+RPNA+T+NS+MLGLCK RQT RAIDFL YM+
Sbjct: 487  YSSLVGGLSREGKVDEAIKFFHEFERMGVRPNAVTFNSIMLGLCKTRQTDRAIDFLVYMI 546

Query: 427  SKGCKPTEATYTILIEGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
            ++GCKPTE +YTILIEG+               LC +G++KRSSA+ VA KM
Sbjct: 547  NRGCKPTETSYTILIEGLAYEGMAKEALELLNELCNKGLMKRSSAEQVAGKM 598


>ref|NP_172461.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122215618|sp|Q3EDF8.1|PPR28_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g09900 gi|332190391|gb|AEE28512.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 598

 Score =  811 bits (2094), Expect = 0.0
 Identities = 393/511 (76%), Positives = 447/511 (87%)
 Frame = -3

Query: 1804 NGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGF 1625
            NGH +S+ S  + ED +SNNHLR++VR GELEEG  FLENMV HG++PDIIPCT+LIRGF
Sbjct: 88   NGHYSSVNSSFALEDVESNNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGF 147

Query: 1624 CRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTY 1445
            CR+GKT+KA ++LEILE SGA+PDVITYNV+ISGYCK GEI+NAL+VLDRMSV PDVVTY
Sbjct: 148  CRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGEINNALSVLDRMSVSPDVVTY 207

Query: 1444 NTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRS 1265
            NTILRSLCDSGKL+QAMEVLD  LQ++CYPDVITYTILIEATCR+SGVG AMKL+DEMR 
Sbjct: 208  NTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRD 267

Query: 1264 KGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMD 1085
            +GC PDVVTYNVL+NGICKEGRLDEAIKFLN+MPS GCQPNVITHNIILRSMCSTGRWMD
Sbjct: 268  RGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMD 327

Query: 1084 AEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLH 905
            AEKLL+DMLRKG SPSVVTFNILINFLCRKGLLGRAI+IL+KMP+HGC PNSLSYNPLLH
Sbjct: 328  AEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLH 387

Query: 904  GFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCAP 725
            GFCKEKKMDRAIEYL+ MVSRGCYPDIVTYNT+LTALCKDGKVE AVE+LNQL SKGC+P
Sbjct: 388  GFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSP 447

Query: 724  VLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFF 545
            VLITYNTVIDGL+K G T +A         K L+PD ITYSSLV GLSREGKVDE+IKFF
Sbjct: 448  VLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFF 507

Query: 544  HDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXXX 365
            H+ E +GIRPNA+T+NS+MLGLCK+RQT RAIDFL +M+++GCKP E +YTILIEG+   
Sbjct: 508  HEFERMGIRPNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYE 567

Query: 364  XXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                        LC +G++K+SSA+ VA KM
Sbjct: 568  GMAKEALELLNELCNKGLMKKSSAEQVAGKM 598


>ref|XP_006306156.1| hypothetical protein CARUB_v10011677mg, partial [Capsella rubella]
            gi|482574867|gb|EOA39054.1| hypothetical protein
            CARUB_v10011677mg, partial [Capsella rubella]
          Length = 609

 Score =  774 bits (1998), Expect = 0.0
 Identities = 381/517 (73%), Positives = 436/517 (84%)
 Frame = -3

Query: 1822 NTHLSDNGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCT 1643
            +T+ + NGH ++  S  + ED +SNNHLR+LVR GELEEG  FLENMV HG++PDIIPCT
Sbjct: 84   HTNSNGNGHYSTANSSFALEDVESNNHLRQLVRTGELEEGFRFLENMVYHGNVPDIIPCT 143

Query: 1642 SLIRGFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVP 1463
            +LIRGFCR+GKT+KA ++LEILE SGA+PDVITYNV+ISGYCK GEI NAL+VLDRMSV 
Sbjct: 144  TLIRGFCRMGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGEISNALSVLDRMSVS 203

Query: 1462 PDVVTYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKL 1283
            PDVVTYNTILRSLCDSGKL+QAMEVLD  LQ++             +TCR+SGVGQAMKL
Sbjct: 204  PDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRD-------------STCRDSGVGQAMKL 250

Query: 1282 IDEMRSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCS 1103
            +DEMR +GC PDVVTYNVL+NGICKEGRL+EAIKFLN+MPS GCQPNVITHNIILRSMCS
Sbjct: 251  LDEMRDRGCTPDVVTYNVLVNGICKEGRLNEAIKFLNDMPSSGCQPNVITHNIILRSMCS 310

Query: 1102 TGRWMDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLS 923
            TGRWMDAEKLL+DMLRKG SPSVVTFNILINFLCRKGLLGRAI+IL+KMP HGC PNSLS
Sbjct: 311  TGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDILEKMPNHGCQPNSLS 370

Query: 922  YNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLG 743
            YNPLLHGFCKEKKMDRAIEYL+ MVSRGCYPDIVTYNT+LTALCKDGKVE AVE+LNQL 
Sbjct: 371  YNPLLHGFCKEKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLS 430

Query: 742  SKGCAPVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVD 563
            SKGC+PVLITYNTVIDGL+K G T +A         K L+PD ITYSSLV GLSREGKVD
Sbjct: 431  SKGCSPVLITYNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVD 490

Query: 562  ESIKFFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILI 383
            E+IKFFH+ E +GIRPNA+T+NS+MLGLCK RQT RAIDFL YM+++GCKPTE +YTILI
Sbjct: 491  EAIKFFHEFERMGIRPNAVTFNSIMLGLCKTRQTDRAIDFLVYMINRGCKPTETSYTILI 550

Query: 382  EGIXXXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
            EG+               LC +G++K+SSA+ VA K+
Sbjct: 551  EGLAYEGMAKEALELLNELCNKGLMKKSSAEQVAGKI 587


>ref|XP_004501057.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Cicer arietinum]
          Length = 591

 Score =  760 bits (1963), Expect = 0.0
 Identities = 376/518 (72%), Positives = 428/518 (82%)
 Frame = -3

Query: 1810 SDNGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIR 1631
            + NG  +S     + E+  +N++L +LVR G+LE+G  FLE M   GD+PD+I CT+LIR
Sbjct: 74   NSNGIQSSSIDSQNLEEIDNNSYLVKLVRIGKLEQGFRFLERMSYQGDMPDVIACTNLIR 133

Query: 1630 GFCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVV 1451
             FC+ GKTKKATRVL+ILE+SGA+PDVITYNVLISGYCK GE++ AL VL+RMSV PDVV
Sbjct: 134  QFCKTGKTKKATRVLQILEDSGAVPDVITYNVLISGYCKSGEVEEALQVLERMSVSPDVV 193

Query: 1450 TYNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEM 1271
            TYNTILRSLCDSGKL+QAMEVLD QL++ CYPDVITYTILIEA C+ESGVG+AMKL D M
Sbjct: 194  TYNTILRSLCDSGKLKQAMEVLDRQLERVCYPDVITYTILIEAICKESGVGEAMKLFDAM 253

Query: 1270 RSKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRW 1091
            R KGCKPDV T+NVLING CKEGRLD+AIKFLN+M SYGC+PNVITHNIILRS+C TGRW
Sbjct: 254  RIKGCKPDVFTFNVLINGFCKEGRLDKAIKFLNDMSSYGCEPNVITHNIILRSLCGTGRW 313

Query: 1090 MDAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPL 911
             DAE LLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAI+IL+KM  HGCTPNSLSYNPL
Sbjct: 314  RDAESLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMKNHGCTPNSLSYNPL 373

Query: 910  LHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGC 731
            LHGFC+EKKMDRAIEYL++MVSRGCYPDIVTYNTLLTALCKDGKV+VA+ELLNQL SKGC
Sbjct: 374  LHGFCQEKKMDRAIEYLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVALELLNQLSSKGC 433

Query: 730  APVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIK 551
            +PV ITYNTVI GLSK+G TERA         KGL+PD++TYSSL+AG  REGKVD +IK
Sbjct: 434  SPVAITYNTVIGGLSKVGATERAMKLLDEMCRKGLKPDVVTYSSLIAGFIREGKVDVAIK 493

Query: 550  FFHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIX 371
             FH+LE +GIR NA+TYNS+M GLCKAR+T  AID LA M++KGCKPTEATYTILIEGI 
Sbjct: 494  IFHELERLGIRANAVTYNSIMSGLCKARRTSHAIDLLARMIAKGCKPTEATYTILIEGIA 553

Query: 370  XXXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM*GFPS 257
                          L  RG VK+SSA  VA      PS
Sbjct: 554  YEGLAEEALGLLNELSSRGFVKKSSADKVAELKYNIPS 591


>ref|XP_006856585.1| hypothetical protein AMTR_s00046p00202770 [Amborella trichopoda]
            gi|548860466|gb|ERN18052.1| hypothetical protein
            AMTR_s00046p00202770 [Amborella trichopoda]
          Length = 585

 Score =  757 bits (1954), Expect = 0.0
 Identities = 371/512 (72%), Positives = 429/512 (83%)
 Frame = -3

Query: 1807 DNGHSNSMYSKNSYEDFQSNNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRG 1628
            ++G  ++  +   ++DF+SN+ L+R VRNGELEE LVFLENM  +G+IPDIIPCTSLIRG
Sbjct: 94   NSGSFSTSVTSPEFDDFESNDLLKRHVRNGELEEALVFLENMARNGEIPDIIPCTSLIRG 153

Query: 1627 FCRVGKTKKATRVLEILENSGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVT 1448
            FC++GKTKK TRV+EI+  SGA+PDVITYNVLISGYCK GE+DNAL VL+RMS  PDVVT
Sbjct: 154  FCKIGKTKKGTRVMEIIHESGAVPDVITYNVLISGYCKSGEVDNALLVLERMSCSPDVVT 213

Query: 1447 YNTILRSLCDSGKLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMR 1268
            YNTILRSLCD GKL+QAMEVLD  + + C+PDVITYTILIEATC+ESGVGQAMKL+DEMR
Sbjct: 214  YNTILRSLCDEGKLKQAMEVLDRMMNRGCFPDVITYTILIEATCKESGVGQAMKLLDEMR 273

Query: 1267 SKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWM 1088
            SKGCKPDVVTYNVLINGICKEG+L+EAIKFLN+MPSYGC+PNVITHNIILRSMCSTGRWM
Sbjct: 274  SKGCKPDVVTYNVLINGICKEGKLNEAIKFLNSMPSYGCRPNVITHNIILRSMCSTGRWM 333

Query: 1087 DAEKLLSDMLRKGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLL 908
            DAEKLLS+M+  GCSPSVVTFNILINFLCRKGL+ RAI++L++MPEHGCTPNSLSYNP+L
Sbjct: 334  DAEKLLSEMIENGCSPSVVTFNILINFLCRKGLMRRAIDVLERMPEHGCTPNSLSYNPIL 393

Query: 907  HGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKGCA 728
            HGFCKEK MDR IEYL++MV RGC+PDIVTYNTLLTALCKDGKV+ A+E+L QL SKGC+
Sbjct: 394  HGFCKEKNMDRVIEYLEVMVLRGCFPDIVTYNTLLTALCKDGKVDAALEILRQLRSKGCS 453

Query: 727  PVLITYNTVIDGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKF 548
            PVLITYNTVIDGLSKMG TE A                           REGKVD++I+F
Sbjct: 454  PVLITYNTVIDGLSKMGKTEEAIELQMR--------------------CREGKVDKAIEF 493

Query: 547  FHDLEGIGIRPNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGIXX 368
            F ++EG GI PNAITYN+++LGLCKA++T +AIDFLA+MVSKGCKPTE+TYTILIEG+  
Sbjct: 494  FFEMEGKGIGPNAITYNALILGLCKAQRTGQAIDFLAHMVSKGCKPTESTYTILIEGVAN 553

Query: 367  XXXXXXXXXXXXXLCRRGVVKRSSAQNVAVKM 272
                         LC RGVVKRSSAQNVAV M
Sbjct: 554  EGRPKEALNLLNELCERGVVKRSSAQNVAVNM 585


>gb|EPS61251.1| hypothetical protein M569_13548, partial [Genlisea aurea]
          Length = 488

 Score =  691 bits (1783), Expect = 0.0
 Identities = 338/468 (72%), Positives = 402/468 (85%), Gaps = 9/468 (1%)
 Frame = -3

Query: 1750 NNHLRRLVRNGELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILEN 1571
            N+ LRRLVR+G+LE+ L  ++ MVS  +IPDIIPCTSLIRGFCR GKT KAT V++ILE 
Sbjct: 1    NSSLRRLVRHGQLEKALRHIQGMVSQREIPDIIPCTSLIRGFCRAGKTNKATVVMQILEE 60

Query: 1570 SGAIPDVITYNVLISGYCKLGEIDNALNVLDRMSVPPDVVTYNTILRSLCDSG------- 1412
            SGA PD+ITYNVLISG+CKLGE+ NAL +L+ M+V PDVVTYNTILR+LC+ G       
Sbjct: 61   SGAAPDLITYNVLISGFCKLGEVGNALQLLESMTVAPDVVTYNTILRALCNGGGGGGGGR 120

Query: 1411 -KLRQAMEVLDLQLQKECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTY 1235
             +L +AMEV+D  L KEC+PDVITYTILIEAT +E+GV QAM+L+D+M+ +GCKPD+VTY
Sbjct: 121  GRLSEAMEVIDRMLLKECHPDVITYTILIEATLKENGVDQAMELLDDMKRRGCKPDIVTY 180

Query: 1234 NVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLR 1055
            NVLI+GICKEG+LDEAIKFL+ M SYGC+PNVITHNIILRSMCSTGRWMDAEKLLS+ML 
Sbjct: 181  NVLIDGICKEGKLDEAIKFLDTMSSYGCRPNVITHNIILRSMCSTGRWMDAEKLLSEMLV 240

Query: 1054 KGCSPSVVTFNILINFLCRKGLLGRAINILDKMPEHGCTPNSLSYNPLLHGFCKEKKMDR 875
            KGCSPSVVTFNILINFLCRKGLL RA+++L++MPE+GCTPNSLSYN LLH FCKEKKMD 
Sbjct: 241  KGCSPSVVTFNILINFLCRKGLLLRAVDVLERMPENGCTPNSLSYNSLLHTFCKEKKMDS 300

Query: 874  AIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLNQLGSKG-CAPVLITYNTVI 698
            A+EYL++MVSRGCYPDIVTYNT+LTALC+DGKV+ AV +LN+L SKG C+PVLITYNTVI
Sbjct: 301  ALEYLELMVSRGCYPDIVTYNTMLTALCRDGKVDAAVAILNRLRSKGRCSPVLITYNTVI 360

Query: 697  DGLSKMGNTERAXXXXXXXXEKGLQPDIITYSSLVAGLSREGKVDESIKFFHDLEGIGIR 518
            DGLSKMG T+ A         +GL+PD+IT SS++ GLS+EGKV+ES++FF  LEG GIR
Sbjct: 361  DGLSKMGRTDEAMELLVEMRGRGLRPDVITCSSIMMGLSKEGKVEESVEFFESLEGSGIR 420

Query: 517  PNAITYNSVMLGLCKARQTVRAIDFLAYMVSKGCKPTEATYTILIEGI 374
            PNA  YNS+MLG+CKAR+T RAIDFL  MV  GCKPTE+TYTILIEG+
Sbjct: 421  PNANIYNSMMLGMCKARRTDRAIDFLDRMVDGGCKPTESTYTILIEGL 468



 Score =  229 bits (583), Expect = 6e-57
 Identities = 128/368 (34%), Positives = 204/368 (55%), Gaps = 39/368 (10%)
 Frame = -3

Query: 1720 GELEEGLVFLENMVSHGDIPDIIPCTSLIRGFCRVGKTKKATRVLEILENSGAIPDVITY 1541
            G L E +  ++ M+     PD+I  T LI    +     +A  +L+ ++  G  PD++TY
Sbjct: 121  GRLSEAMEVIDRMLLKECHPDVITYTILIEATLKENGVDQAMELLDDMKRRGCKPDIVTY 180

Query: 1540 NVLISGYCKLGEIDNALNVLDRMS---VPPDVVTYNTILRSLCDSGKLRQAMEVLDLQLQ 1370
            NVLI G CK G++D A+  LD MS     P+V+T+N ILRS+C +G+   A ++L   L 
Sbjct: 181  NVLIDGICKEGKLDEAIKFLDTMSSYGCRPNVITHNIILRSMCSTGRWMDAEKLLSEMLV 240

Query: 1369 KECYPDVITYTILIEATCRESGVGQAMKLIDEMRSKGCKPDVVTYNVLINGICKEGRLDE 1190
            K C P V+T+ ILI   CR+  + +A+ +++ M   GC P+ ++YN L++  CKE ++D 
Sbjct: 241  KGCSPSVVTFNILINFLCRKGLLLRAVDVLERMPENGCTPNSLSYNSLLHTFCKEKKMDS 300

Query: 1189 AIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKG-CSPSVVTFNILI 1013
            A+++L  M S GC P+++T+N +L ++C  G+   A  +L+ +  KG CSP ++T+N +I
Sbjct: 301  ALEYLELMVSRGCYPDIVTYNTMLTALCRDGKVDAAVAILNRLRSKGRCSPVLITYNTVI 360

Query: 1012 NFLCRKGLLGRAINILDKMPEHG-------CT---------------------------- 938
            + L + G    A+ +L +M   G       C+                            
Sbjct: 361  DGLSKMGRTDEAMELLVEMRGRGLRPDVITCSSIMMGLSKEGKVEESVEFFESLEGSGIR 420

Query: 937  PNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVEVAVEL 758
            PN+  YN ++ G CK ++ DRAI++LD MV  GC P   TY  L+  L K+G  E A+EL
Sbjct: 421  PNANIYNSMMLGMCKARRTDRAIDFLDRMVDGGCKPTESTYTILIEGLSKEGLSEEALEL 480

Query: 757  LNQLGSKG 734
            LN+L S+G
Sbjct: 481  LNELRSRG 488


Top