BLASTX nr result

ID: Catharanthus23_contig00000364 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00000364
         (3200 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containi...  1166   0.0  
ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containi...  1159   0.0  
ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containi...  1159   0.0  
ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containi...  1119   0.0  
gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]                       1065   0.0  
ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containi...  1057   0.0  
ref|XP_002515260.1| pentatricopeptide repeat-containing protein,...  1057   0.0  
ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citr...  1056   0.0  
gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]    1054   0.0  
ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containi...  1050   0.0  
ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containi...  1049   0.0  
ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Popu...  1048   0.0  
ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Popu...  1046   0.0  
gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus pe...  1034   0.0  
ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containi...  1029   0.0  
ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Popu...  1019   0.0  
ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutr...  1001   0.0  
ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Caps...   969   0.0  
ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidop...   962   0.0  
ref|XP_002881173.1| pentatricopeptide repeat-containing protein ...   962   0.0  

>ref|XP_006355855.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Solanum tuberosum]
          Length = 848

 Score = 1166 bits (3017), Expect = 0.0
 Identities = 606/853 (71%), Positives = 686/853 (80%)
 Frame = -2

Query: 2914 MASSTPPPHYTLSSSKPYXXXXXXXXXXXXXXXXXXXXXXXSQKVSLNNRSSHXXXXXXX 2735
            MASSTPPPH  L++SKPY                       SQKVSLN  +         
Sbjct: 1    MASSTPPPHCALTTSKPYHPHPLTQTHSHPNHRNNHQRHWSSQKVSLNRPAPPRNATHPP 60

Query: 2734 XXXXXXXXXXXXXXXXPTTFPSLSKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTAT 2555
                                 S SKS+ +ADF+GRRSTRFVSKMHFGR K +   RH++ 
Sbjct: 61   PSQTPNFLSL-----------SSSKSDFSADFSGRRSTRFVSKMHFGRAKISGNGRHSSF 109

Query: 2554 AEEALQLAIRSRGDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVS 2375
            AEEAL+ AIR   +++ +D++LL F  KL G DDYTFL RELGNRGE    MRCF+FAV 
Sbjct: 110  AEEALEEAIRCCKNEAGLDQVLLTFGSKLLGSDDYTFLFRELGNRGEWLAAMRCFEFAVG 169

Query: 2374 RERKRNEQGKLTSSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYC 2195
            RERKRNEQGKL SSMISILGR GKVDLA KVFE AV++GYGNTVYAYSALISAYAKSGYC
Sbjct: 170  RERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGNTVYAYSALISAYAKSGYC 229

Query: 2194 DDAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNS 2015
            ++AIRVFETMKDSGLKPNLVTYNALIDACGKGGADF+RASEIF+EMLRNGVQPDRIT+NS
Sbjct: 230  NEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLRNGVQPDRITFNS 289

Query: 2014 LLAVCSGASLWETAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKN 1835
            LLAVCSGA LWETA+ LFNEM+Y+GIDQDIYTYNT LDAACNGG +D AF+IMSEM  KN
Sbjct: 290  LLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDAACNGGQIDVAFDIMSEMHAKN 349

Query: 1834 IMPNEVTYSTMIRGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEAL 1655
            I+PN+VTYST+IRGCAKAG+LDRAL+LFNEMK AGI LDRVSYNTLLAIYASLG+F EAL
Sbjct: 350  ILPNQVTYSTVIRGCAKAGRLDRALSLFNEMKCAGITLDRVSYNTLLAIYASLGKFEEAL 409

Query: 1654 AVGEEMESIGIKKDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVF 1475
             V +EMES+GIKKDVVTYNALLDGFGKQGMY KV +LF  MKAE LSPNLLTYSTLISV+
Sbjct: 410  NVSKEMESMGIKKDVVTYNALLDGFGKQGMYIKVKQLFAEMKAEKLSPNLLTYSTLISVY 469

Query: 1474 SKGGLYHDAMQVYKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNV 1295
             KG LYHDA++VYKEFK QGLKADVVFYSKL+D+LCK GLVE S LLL+EM K+GIQPNV
Sbjct: 470  LKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLLNEMTKEGIQPNV 529

Query: 1294 VTYNSIINAFGWSVATDHPLDSEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGK 1115
            VTYNSIINAFG S + +   D+   + S+      +++S  E+ ++D I+ IFEQLAA K
Sbjct: 530  VTYNSIINAFGESASNECGSDNVTQIVST------ISQSKWENTEEDNIVKIFEQLAAQK 583

Query: 1114 SASFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXX 935
            SAS +K N+ +QD LC+LGVF KMHE++IKPNVVTFSAILNACSRC+SF           
Sbjct: 584  SASGKKTNAERQDILCILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEASLLLEELR 643

Query: 934  LFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGA 755
            +FDNQVYGVAHGLLMG  E VW QALSLF+EVKQMDSSTASAFYNALTDMLWHF Q++GA
Sbjct: 644  IFDNQVYGVAHGLLMGQREGVWAQALSLFNEVKQMDSSTASAFYNALTDMLWHFDQKQGA 703

Query: 754  QMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIM 575
            Q+VVLEGKR +VWE+  S S LDLHLMSSGAA AMVHAWLLSIRSIVF+G ELPK+LSI+
Sbjct: 704  QLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHELPKMLSIL 763

Query: 574  TGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVL 395
            TGWGKHSK+ GDGALKRAIE LLTSI APF VAKCNIGRFISTG+VVTAW+RESGTL+VL
Sbjct: 764  TGWGKHSKITGDGALKRAIEGLLTSIGAPFQVAKCNIGRFISTGAVVTAWLRESGTLEVL 823

Query: 394  VLQDDRIHPESSR 356
            VLQDD  H  ++R
Sbjct: 824  VLQDDTSHLRATR 836


>ref|XP_004240565.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 2 [Solanum lycopersicum]
          Length = 829

 Score = 1159 bits (2999), Expect = 0.0
 Identities = 603/855 (70%), Positives = 684/855 (80%), Gaps = 2/855 (0%)
 Frame = -2

Query: 2914 MASSTPPPHYTLSSSKPYXXXXXXXXXXXXXXXXXXXXXXXSQKVSLN--NRSSHXXXXX 2741
            MASSTPPPH  L++SKPY                        QKVSLN     +H     
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNHRNNHQRHWSS--QKVSLNPPRNPNHPSQTP 58

Query: 2740 XXXXXXXXXXXXXXXXXXPTTFPSLSKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHT 2561
                                   S SKS+ +ADF+GRRSTRFVSKMHFGR K +   RH+
Sbjct: 59   NFLSL------------------SSSKSDFSADFSGRRSTRFVSKMHFGRAKISGNGRHS 100

Query: 2560 ATAEEALQLAIRSRGDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFA 2381
            + A+EAL+ AIR   +++ +D++LL F  KL G DDYTFL RELGNRGE    MRCF FA
Sbjct: 101  SFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEWLAAMRCFQFA 160

Query: 2380 VSRERKRNEQGKLTSSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSG 2201
            V RERKRNEQGKL SSMISILGR GKVDLA KVFE AV++GYG+TVYAYSALISAYAKSG
Sbjct: 161  VGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYSALISAYAKSG 220

Query: 2200 YCDDAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITY 2021
            YC++AIRVFETMKDSGLKPNLVTYNALIDACGKGGADF+RASEIF+EMLRNGVQPDRIT+
Sbjct: 221  YCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLRNGVQPDRITF 280

Query: 2020 NSLLAVCSGASLWETAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSL 1841
            NSLLAVCSGA LWETA+ LFNEM+Y+GIDQDIYTYNT LD ACNGG +D AF+IMSEM  
Sbjct: 281  NSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDVAFDIMSEMHA 340

Query: 1840 KNIMPNEVTYSTMIRGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHE 1661
            KNI+PN+VTYST+IRGCAKAG+LD+AL+LFNEMK AGIKLDRVSYNTLLAIYASLG+F E
Sbjct: 341  KNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLAIYASLGKFEE 400

Query: 1660 ALAVGEEMESIGIKKDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLIS 1481
            AL V +EME +GIKKDVVTYNALLDGFGKQGMY KV +LF  MKAE LSPNLLTYSTLIS
Sbjct: 401  ALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSPNLLTYSTLIS 460

Query: 1480 VFSKGGLYHDAMQVYKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQP 1301
            V+ KG LYHDA++VYKEFK QGLKADVVFYSKL+D+LCK GLVE S LLL+EM K+GIQP
Sbjct: 461  VYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLLNEMTKEGIQP 520

Query: 1300 NVVTYNSIINAFGWSVATDHPLDSEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAA 1121
            NVVTYNSIINAFG S   +   D+  H+ S+      +++S  E+ ++D I+ IFEQLAA
Sbjct: 521  NVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDNIVKIFEQLAA 574

Query: 1120 GKSASFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXX 941
             KSAS +K N+ +QD LC+LGVF KMHE++IKPNVVTFSAILNACSRC+SF         
Sbjct: 575  QKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEASLLLEE 634

Query: 940  XXLFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRR 761
              LFDNQVYGVAHGLLMG  E VW QALSLF+EVKQMDSSTASAFYNALTDMLWHF Q++
Sbjct: 635  LRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALTDMLWHFDQKQ 694

Query: 760  GAQMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLS 581
            GAQ+VVLEGKR +VWE+  S S LDLHLMSSGAA AMVHAWLLSIRSIVF+G ELPK+LS
Sbjct: 695  GAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHELPKMLS 754

Query: 580  IMTGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLK 401
            I+TGWGKHSK+ GDGALKRAIE LLTSI APF +AKCNIGRFISTG+VVTAW+RESGTL+
Sbjct: 755  ILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVTAWLRESGTLE 814

Query: 400  VLVLQDDRIHPESSR 356
            VLVLQDD  H  ++R
Sbjct: 815  VLVLQDDTSHLRATR 829


>ref|XP_004240564.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like isoform 1 [Solanum lycopersicum]
          Length = 841

 Score = 1159 bits (2999), Expect = 0.0
 Identities = 603/855 (70%), Positives = 684/855 (80%), Gaps = 2/855 (0%)
 Frame = -2

Query: 2914 MASSTPPPHYTLSSSKPYXXXXXXXXXXXXXXXXXXXXXXXSQKVSLN--NRSSHXXXXX 2741
            MASSTPPPH  L++SKPY                        QKVSLN     +H     
Sbjct: 1    MASSTPPPHCALTTSKPYQPQTHSHPHPNHRNNHQRHWSS--QKVSLNPPRNPNHPSQTP 58

Query: 2740 XXXXXXXXXXXXXXXXXXPTTFPSLSKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHT 2561
                                   S SKS+ +ADF+GRRSTRFVSKMHFGR K +   RH+
Sbjct: 59   NFLSL------------------SSSKSDFSADFSGRRSTRFVSKMHFGRAKISGNGRHS 100

Query: 2560 ATAEEALQLAIRSRGDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFA 2381
            + A+EAL+ AIR   +++ +D++LL F  KL G DDYTFL RELGNRGE    MRCF FA
Sbjct: 101  SFAQEALEEAIRCCNNEAGLDQVLLTFGSKLVGSDDYTFLFRELGNRGEWLAAMRCFQFA 160

Query: 2380 VSRERKRNEQGKLTSSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSG 2201
            V RERKRNEQGKL SSMISILGR GKVDLA KVFE AV++GYG+TVYAYSALISAYAKSG
Sbjct: 161  VGRERKRNEQGKLASSMISILGRSGKVDLAEKVFENAVSDGYGSTVYAYSALISAYAKSG 220

Query: 2200 YCDDAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITY 2021
            YC++AIRVFETMKDSGLKPNLVTYNALIDACGKGGADF+RASEIF+EMLRNGVQPDRIT+
Sbjct: 221  YCNEAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFKRASEIFDEMLRNGVQPDRITF 280

Query: 2020 NSLLAVCSGASLWETAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSL 1841
            NSLLAVCSGA LWETA+ LFNEM+Y+GIDQDIYTYNT LD ACNGG +D AF+IMSEM  
Sbjct: 281  NSLLAVCSGAGLWETARGLFNEMIYRGIDQDIYTYNTFLDVACNGGQIDVAFDIMSEMHA 340

Query: 1840 KNIMPNEVTYSTMIRGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHE 1661
            KNI+PN+VTYST+IRGCAKAG+LD+AL+LFNEMK AGIKLDRVSYNTLLAIYASLG+F E
Sbjct: 341  KNILPNQVTYSTVIRGCAKAGRLDKALSLFNEMKCAGIKLDRVSYNTLLAIYASLGKFEE 400

Query: 1660 ALAVGEEMESIGIKKDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLIS 1481
            AL V +EME +GIKKDVVTYNALLDGFGKQGMY KV +LF  MKAE LSPNLLTYSTLIS
Sbjct: 401  ALNVSKEMEGMGIKKDVVTYNALLDGFGKQGMYTKVKQLFAEMKAEKLSPNLLTYSTLIS 460

Query: 1480 VFSKGGLYHDAMQVYKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQP 1301
            V+ KG LYHDA++VYKEFK QGLKADVVFYSKL+D+LCK GLVE S LLL+EM K+GIQP
Sbjct: 461  VYLKGALYHDAVEVYKEFKKQGLKADVVFYSKLIDALCKKGLVEYSSLLLNEMTKEGIQP 520

Query: 1300 NVVTYNSIINAFGWSVATDHPLDSEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAA 1121
            NVVTYNSIINAFG S   +   D+  H+ S+      +++S  E+ ++D I+ IFEQLAA
Sbjct: 521  NVVTYNSIINAFGESANNECGSDNVTHIVSA------ISQSKWENTEEDNIVKIFEQLAA 574

Query: 1120 GKSASFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXX 941
             KSAS +K N+ +QD LC+LGVF KMHE++IKPNVVTFSAILNACSRC+SF         
Sbjct: 575  QKSASGKKTNAERQDMLCILGVFHKMHELQIKPNVVTFSAILNACSRCSSFDEASLLLEE 634

Query: 940  XXLFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRR 761
              LFDNQVYGVAHGLLMG  E VW QALSLF+EVKQMDSSTASAFYNALTDMLWHF Q++
Sbjct: 635  LRLFDNQVYGVAHGLLMGQREGVWSQALSLFNEVKQMDSSTASAFYNALTDMLWHFDQKQ 694

Query: 760  GAQMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLS 581
            GAQ+VVLEGKR +VWE+  S S LDLHLMSSGAA AMVHAWLLSIRSIVF+G ELPK+LS
Sbjct: 695  GAQLVVLEGKRSEVWENTWSTSCLDLHLMSSGAACAMVHAWLLSIRSIVFEGHELPKMLS 754

Query: 580  IMTGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLK 401
            I+TGWGKHSK+ GDGALKRAIE LLTSI APF +AKCNIGRFISTG+VVTAW+RESGTL+
Sbjct: 755  ILTGWGKHSKITGDGALKRAIEGLLTSIGAPFQIAKCNIGRFISTGAVVTAWLRESGTLE 814

Query: 400  VLVLQDDRIHPESSR 356
            VLVLQDD  H  ++R
Sbjct: 815  VLVLQDDTSHLRATR 829


>ref|XP_002271180.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic [Vitis vinifera]
          Length = 867

 Score = 1119 bits (2895), Expect = 0.0
 Identities = 570/787 (72%), Positives = 658/787 (83%), Gaps = 11/787 (1%)
 Frame = -2

Query: 2683 TTFPSLS-----KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSR 2519
            + FPSLS     KSEL ADF+GRRSTRFVSKMHFGRPKTAAA+RHT+TAEEAL+ AIR  
Sbjct: 69   SNFPSLSPLPPSKSELTADFSGRRSTRFVSKMHFGRPKTAAAARHTSTAEEALRHAIRFA 128

Query: 2518 GDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLT 2339
             DD  +D +LL+FE +L G DDYTFLLRELGNRGE    +RCF+FAV RE++RNEQGKL 
Sbjct: 129  SDDKGIDSVLLNFESRLCGSDDYTFLLRELGNRGEWAKAIRCFEFAVRREQRRNEQGKLA 188

Query: 2338 SSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKD 2159
            S+MISILGRLG+V+LA+ VFETA+NEGYGNTVYA+SALISAY +SGYCD+AI+VFETMK 
Sbjct: 189  SAMISILGRLGQVELAKNVFETALNEGYGNTVYAFSALISAYGRSGYCDEAIKVFETMKS 248

Query: 2158 SGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWE 1979
            SGLKPNLVTYNA+IDACGKGG DF RA+EIF+EMLRNGVQPDRIT+NSLLAVC    LWE
Sbjct: 249  SGLKPNLVTYNAVIDACGKGGVDFNRAAEIFDEMLRNGVQPDRITFNSLLAVCGRGGLWE 308

Query: 1978 TAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMI 1799
             A+ LF+EM+Y+GI+QDI+TYNTLLDA C GG MD AF+IMSEM  K+IMPN VTYST+I
Sbjct: 309  AARNLFSEMLYRGIEQDIFTYNTLLDAVCKGGQMDLAFQIMSEMPRKHIMPNVVTYSTVI 368

Query: 1798 RGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIK 1619
             G AKAG+LD AL LFNEMK+A I LDRVSYNTLL+IYA LGRF EAL V +EMES GIK
Sbjct: 369  DGYAKAGRLDEALNLFNEMKFASIGLDRVSYNTLLSIYAKLGRFEEALNVCKEMESSGIK 428

Query: 1618 KDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQV 1439
            KD VTYNALL G+GKQG Y +V  +F+ MKAE + PNLLTYSTLI V+SKGGLY +AM+V
Sbjct: 429  KDAVTYNALLGGYGKQGKYEEVKRVFEEMKAERIFPNLLTYSTLIDVYSKGGLYQEAMEV 488

Query: 1438 YKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGW 1259
            ++EFK  GLKADVV YS L+D+LCKNGLVES+V  LDEM K+GI+PNVVTYNSII+AFG 
Sbjct: 489  FREFKKAGLKADVVLYSALIDALCKNGLVESAVSFLDEMTKEGIRPNVVTYNSIIDAFGR 548

Query: 1258 SVATDHPLD-----SEEHMKSSAL-VTADVTESNSEDKDKDTIITIFEQLAAGKSASFEK 1097
            S + +  +D     +   M SS+L V  D TES   DK+ + II IF QLAA K+   +K
Sbjct: 549  SGSAECVIDPPYETNVSKMSSSSLKVVEDATESEVGDKEDNQIIKIFGQLAAEKTCHAKK 608

Query: 1096 DNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQV 917
            +N G+Q+ LC+L VF KMHE++IKPNVVTFSAILNACSRCNSF           LFDNQV
Sbjct: 609  ENRGRQEILCILAVFHKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQV 668

Query: 916  YGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLE 737
            YGVAHGLLMG+ + VW+QA SLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQ+VVLE
Sbjct: 669  YGVAHGLLMGYGDNVWVQAQSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQLVVLE 728

Query: 736  GKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKH 557
            GKRR VWE+  S+S LDLHLMSSGAARAMVHAWLL+IRSIVF+G ELP+LLSI+TGWGKH
Sbjct: 729  GKRRHVWENMWSNSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPQLLSILTGWGKH 788

Query: 556  SKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDR 377
            SKVVGDGAL+RAIEALLT + APF VAKCN+GRFISTG+VV AW+RESGTLKVLVL DDR
Sbjct: 789  SKVVGDGALRRAIEALLTGMGAPFRVAKCNLGRFISTGAVVAAWLRESGTLKVLVLHDDR 848

Query: 376  IHPESSR 356
             +P+ +R
Sbjct: 849  TNPDRAR 855


>gb|EOX95298.1| S uncoupled 1 [Theobroma cacao]
          Length = 866

 Score = 1065 bits (2755), Expect = 0.0
 Identities = 537/778 (69%), Positives = 635/778 (81%), Gaps = 5/778 (0%)
 Frame = -2

Query: 2677 FPSLSKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMD 2498
            FPSL     A DF+GRRSTRFVSKMH GRPKT+  +RHT+ AEE LQLA+ +    S ++
Sbjct: 83   FPSL-----APDFSGRRSTRFVSKMHLGRPKTSTNTRHTSIAEEVLQLALHN--GHSGLE 135

Query: 2497 RILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISIL 2318
            R+L+ FE KL G DDYTFLLRELGNRGE    ++CF FAV RER++ EQGKL S+MISIL
Sbjct: 136  RVLVSFESKLCGSDDYTFLLRELGNRGEYEKAIKCFQFAVRRERRKTEQGKLASAMISIL 195

Query: 2317 GRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNL 2138
            GRLGKV+LA+ +FETA+ EGYGNTVYA+SALISA+ +SGY D+AI+VF++MK++GLKPNL
Sbjct: 196  GRLGKVELAKGIFETALTEGYGNTVYAFSALISAFGRSGYSDEAIKVFDSMKNNGLKPNL 255

Query: 2137 VTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFN 1958
            VTYNA+IDACGKGG +F+R  EIF+EMLR+GVQPDRIT+NSLLAVCS   LWE A+ LF+
Sbjct: 256  VTYNAVIDACGKGGVEFKRVVEIFDEMLRSGVQPDRITFNSLLAVCSRGGLWEAARNLFS 315

Query: 1957 EMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAG 1778
            EMV++GIDQDI+TYNTLLDA C GG MD AFEIM+EM  KNI+PN VTYSTMI G AKAG
Sbjct: 316  EMVHRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMAEMPTKNILPNVVTYSTMIDGYAKAG 375

Query: 1777 KLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYN 1598
            + D AL LFNEMK+ GI LDRVSYNT+L+IYA LGRF EAL +  EME  GI+KDVVTYN
Sbjct: 376  RFDDALNLFNEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALDICREMEGSGIRKDVVTYN 435

Query: 1597 ALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQ 1418
            ALL G+GKQG Y++V  LF+ MK + +SPNLLTYST+I V+SKGGLY +AM V++EFK  
Sbjct: 436  ALLGGYGKQGKYDEVRRLFEEMKTQKVSPNLLTYSTVIDVYSKGGLYEEAMDVFREFKRV 495

Query: 1417 GLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHP 1238
            GLKADVV YS L+D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG S  ++  
Sbjct: 496  GLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSATSECA 555

Query: 1237 LD-----SEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDNSGKQDF 1073
             D     S    +SS+LV     E  + D + + +I  F QLAA K    +KD  GKQ+ 
Sbjct: 556  FDAGGEISALQTESSSLVIGHSIEGKARDGEDNQVIKFFGQLAAEKGGQAKKDCRGKQEI 615

Query: 1072 LCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLL 893
            LC+LGVFQKMHE+EIKPNVVTFSAILNACSRC+SF           LFDNQVYGVAHGLL
Sbjct: 616  LCILGVFQKMHELEIKPNVVTFSAILNACSRCDSFEDASMLLEELRLFDNQVYGVAHGLL 675

Query: 892  MGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWE 713
            MG+ E VW+QA SLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKRRQVWE
Sbjct: 676  MGYRENVWIQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWE 735

Query: 712  SAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGA 533
            +  S+S LDLHLMSSGAARAMVHAWLL+IRSI+F+G ELPKLLSI+TGWGKHSKVVGDGA
Sbjct: 736  NVWSNSCLDLHLMSSGAARAMVHAWLLNIRSIIFEGHELPKLLSILTGWGKHSKVVGDGA 795

Query: 532  LKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHPESS 359
            L+R +E+L T + APF +AKCN+GRF+STG VVTAW+RESGTLK+LVL DDR  PE++
Sbjct: 796  LRRTVESLFTGMGAPFRLAKCNLGRFVSTGPVVTAWLRESGTLKLLVLHDDRTQPENT 853


>ref|XP_006492356.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Citrus sinensis]
          Length = 877

 Score = 1057 bits (2734), Expect = 0.0
 Identities = 537/778 (69%), Positives = 627/778 (80%), Gaps = 9/778 (1%)
 Frame = -2

Query: 2665 SKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILL 2486
            SKSELA DF+GRRSTRFVSKMHFGRPK A ++RH+  AEEAL        DD  +  IL 
Sbjct: 88   SKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVAEEALHHVTAFARDDVSLGDILK 147

Query: 2485 DFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLG 2306
             FE KL G DDYTFLLRELGNRGE    ++CF FAV RE ++N+QGKL S+MISILGRLG
Sbjct: 148  KFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKREERKNDQGKLASAMISILGRLG 207

Query: 2305 KVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYN 2126
            KVDLA+ +FETA+NEGYGNTVYA+SALISAY +SGYC +AI VF +MK   LKPNLVTYN
Sbjct: 208  KVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQEAISVFNSMKRYNLKPNLVTYN 267

Query: 2125 ALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVY 1946
            A+IDACGKGG DF+   EIF++MLRNGVQPDRIT+NSLLAVCS   LWE A+ LFNEMV+
Sbjct: 268  AVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSLLAVCSRGGLWEAARNLFNEMVH 327

Query: 1945 KGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDR 1766
            +GIDQDI+TYNTLLDA C G  MD AFEIM+EM  KNI PN VTYSTMI G AKAG+LD 
Sbjct: 328  RGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDD 387

Query: 1765 ALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLD 1586
            AL +F+EMK+ GI LDRVSYNT+L+IYA LGRF EAL V +EMES GI+KD VTYNALL 
Sbjct: 388  ALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALLVCKEMESSGIRKDAVTYNALLG 447

Query: 1585 GFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKA 1406
            G+GKQG Y++V  +F+ MKA+ +SPNLLTYSTLI V+SKGGLY +AMQ+++EFK  GLKA
Sbjct: 448  GYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKA 507

Query: 1405 DVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLDSE 1226
            DVV YS L+D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG S  T+  +D  
Sbjct: 508  DVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSATTECTVDDV 567

Query: 1225 EHMKSSALVTADVTESNSEDKDKDT---------IITIFEQLAAGKSASFEKDNSGKQDF 1073
            E        +A++    S+D DKD          II +F QL A K+   +K+N  +Q+ 
Sbjct: 568  ERDLGKQKESANLDAMCSQD-DKDVQEAGRTDNQIIKVFGQLVAEKAGQGKKENRCRQEI 626

Query: 1072 LCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLL 893
            LC+LGVFQKMH+++IKPNVVTFSAILNACSRCNSF           LFDNQVYGVAHGLL
Sbjct: 627  LCILGVFQKMHKLKIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLL 686

Query: 892  MGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWE 713
            MG+ + +W+QALSLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKRRQVWE
Sbjct: 687  MGYRDNIWVQALSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWE 746

Query: 712  SAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGA 533
            +  S+S LDLHLMSSGAARAMVHAWLL+I SIVF+G ELPKLLSI+TGWGKHSKVVGDGA
Sbjct: 747  NVWSESCLDLHLMSSGAARAMVHAWLLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGA 806

Query: 532  LKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHPESS 359
            L+RA+E LLT + APF VA CN+GRFISTG +V +W+RESGTLKVLVL DDR H E++
Sbjct: 807  LRRAVEVLLTGMGAPFWVANCNLGRFISTGPMVASWLRESGTLKVLVLHDDRTHSENA 864


>ref|XP_002515260.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545740|gb|EEF47244.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 878

 Score = 1057 bits (2733), Expect = 0.0
 Identities = 550/859 (64%), Positives = 655/859 (76%), Gaps = 11/859 (1%)
 Frame = -2

Query: 2905 STPPPHYTLSSSKPYXXXXXXXXXXXXXXXXXXXXXXXSQKVSLNNRSSHXXXXXXXXXX 2726
            ++ PPH +++++KPY                        QKVSL                
Sbjct: 2    ASTPPHCSITATKPYQNHQYPQNHLKNHRQTHHHRWTN-QKVSLTKPPLAPSPCNAPKAA 60

Query: 2725 XXXXXXXXXXXXXPTTFPSLS-----KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHT 2561
                           TF SLS     KS+L+ADF+GRRSTRFVSK+HFGRPKT   +RHT
Sbjct: 61   AAAAAATTTHHTPNPTFHSLSPLQSQKSDLSADFSGRRSTRFVSKLHFGRPKTNM-NRHT 119

Query: 2560 ATAEEALQLAIRSRGDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFA 2381
            + A EALQ  I+   DD  ++ +LL+FE +L G DDYTFLLRELGNRG+    +RCF+FA
Sbjct: 120  SVALEALQQVIQYGKDDKALENVLLNFESRLCGPDDYTFLLRELGNRGDSAKAVRCFEFA 179

Query: 2380 VSRERKRNEQGKLTSSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSG 2201
            V RE  +NEQGKL S+MIS LGRLGKV+LA+ VF+TA+ EGYG TVYA+SALISAY +SG
Sbjct: 180  VRRESGKNEQGKLASAMISTLGRLGKVELAKAVFDTALKEGYGKTVYAFSALISAYGRSG 239

Query: 2200 YCDDAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITY 2021
            YC++AI+VF++MK +GL PNLVTYNA+IDACGKGG +F++  EIF+ ML NGVQPDRIT+
Sbjct: 240  YCNEAIKVFDSMKSNGLMPNLVTYNAVIDACGKGGVEFKKVVEIFDGMLSNGVQPDRITF 299

Query: 2020 NSLLAVCSGASLWETAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSL 1841
            NSLLAVCS   LWE A+ LF+ MV KGIDQDI+TYNTLLDA C GG MD AFEIMSEM  
Sbjct: 300  NSLLAVCSRGGLWEAARRLFSAMVDKGIDQDIFTYNTLLDAVCKGGQMDLAFEIMSEMPT 359

Query: 1840 KNIMPNEVTYSTMIRGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHE 1661
            KNI+PN VTYSTMI G AK G+LD AL +FNEMK+ G+ LDRVSYNTLL++YA LGRF +
Sbjct: 360  KNILPNVVTYSTMIDGYAKVGRLDDALNMFNEMKFLGVGLDRVSYNTLLSVYAKLGRFEQ 419

Query: 1660 ALAVGEEMESIGIKKDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLIS 1481
            AL V +EME+ GI+KDVVTYNALL G+GKQ  Y++V  +F+ MK   +SPNLLTYSTLI 
Sbjct: 420  ALDVCKEMENAGIRKDVVTYNALLAGYGKQYRYDEVRRVFEEMKRGRVSPNLLTYSTLID 479

Query: 1480 VFSKGGLYHDAMQVYKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQP 1301
            V+SKGGLY +AM+V++EFK  GLKADVV YS L+D+LCKNGLVESSV LLDEM K+GI+P
Sbjct: 480  VYSKGGLYKEAMEVFREFKQAGLKADVVLYSALIDALCKNGLVESSVTLLDEMTKEGIRP 539

Query: 1300 NVVTYNSIINAFGWSVATDHPLDSEEHMKS------SALVTADVTESNSEDKDKDTIITI 1139
            NVVTYNSII+AFG S +    +D      +      S++V  +  ES + DK+ + II I
Sbjct: 540  NVVTYNSIIDAFGRSASAQCVVDDSGETTALQVESLSSIVVQEAIESQAADKEDNRIIEI 599

Query: 1138 FEQLAAGKSASFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXX 959
            F +LAA K+   E  NSGKQ+ LC+LGVFQKMHE++IKPNVVTFSAILNACSRC+SF   
Sbjct: 600  FGKLAAEKAC--EAKNSGKQEILCILGVFQKMHELKIKPNVVTFSAILNACSRCDSFEDA 657

Query: 958  XXXXXXXXLFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLW 779
                    LFDNQVYGVAHGLLMG+ E VW+QA SLFDEVK MDSSTASAFYNALTDMLW
Sbjct: 658  SMLLEELRLFDNQVYGVAHGLLMGYRENVWLQAQSLFDEVKLMDSSTASAFYNALTDMLW 717

Query: 778  HFGQRRGAQMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRE 599
            HFGQ+RGAQ+VVLEGKRRQVWE+  SDS LDLHLMSSGAARAMVHAWLL+IRSIVF+G E
Sbjct: 718  HFGQKRGAQLVVLEGKRRQVWENIWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHE 777

Query: 598  LPKLLSIMTGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMR 419
            LPKLLSI+TGWGKHSKVVGD AL+RA+EALL  + APF +AKCN+GRFISTGSVV AW++
Sbjct: 778  LPKLLSILTGWGKHSKVVGDSALRRAVEALLIGMGAPFRLAKCNLGRFISTGSVVAAWLK 837

Query: 418  ESGTLKVLVLQDDRIHPES 362
            ESGTL+VLVL DDR HPE+
Sbjct: 838  ESGTLEVLVLHDDRTHPEN 856


>ref|XP_006444533.1| hypothetical protein CICLE_v10018807mg [Citrus clementina]
            gi|557546795|gb|ESR57773.1| hypothetical protein
            CICLE_v10018807mg [Citrus clementina]
          Length = 877

 Score = 1056 bits (2732), Expect = 0.0
 Identities = 537/778 (69%), Positives = 627/778 (80%), Gaps = 9/778 (1%)
 Frame = -2

Query: 2665 SKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILL 2486
            SKSELA DF+GRRSTRFVSKMHFGRPK A ++RH+  AEEAL        DD  +  IL 
Sbjct: 88   SKSELAPDFSGRRSTRFVSKMHFGRPKIAMSTRHSVVAEEALHHVTAFARDDVSLGDILK 147

Query: 2485 DFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLG 2306
             FE KL G DDYTFLLRELGNRGE    ++CF FAV RE ++N+QGKL S+MISILGRLG
Sbjct: 148  KFEFKLCGADDYTFLLRELGNRGEWSKAIQCFAFAVKREERKNDQGKLASAMISILGRLG 207

Query: 2305 KVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYN 2126
            KVDLA+ +FETA+NEGYGNTVYA+SALISAY +SGYC +AI VF +MK   LKPNLVTYN
Sbjct: 208  KVDLAKNIFETALNEGYGNTVYAFSALISAYGRSGYCQEAISVFNSMKRYHLKPNLVTYN 267

Query: 2125 ALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVY 1946
            A+IDACGKGG DF+   EIF++MLRNGVQPDRIT+NSLLAVCS   LWE A+ LFNEMV+
Sbjct: 268  AVIDACGKGGVDFKHVVEIFDDMLRNGVQPDRITFNSLLAVCSRGGLWEAARNLFNEMVH 327

Query: 1945 KGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDR 1766
            +GIDQDI+TYNTLLDA C G  MD AFEIM+EM  KNI PN VTYSTMI G AKAG+LD 
Sbjct: 328  RGIDQDIFTYNTLLDAICKGAQMDLAFEIMAEMPAKNISPNVVTYSTMIDGYAKAGRLDD 387

Query: 1765 ALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLD 1586
            AL +F+EMK+ GI LDRVSYNT+L+IYA LGRF EAL V +EMES GI+KD VTYNALL 
Sbjct: 388  ALNMFSEMKFLGIGLDRVSYNTVLSIYAKLGRFEEALLVCKEMESSGIRKDAVTYNALLG 447

Query: 1585 GFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKA 1406
            G+GKQG Y++V  +F+ MKA+ +SPNLLTYSTLI V+SKGGLY +AMQ+++EFK  GLKA
Sbjct: 448  GYGKQGKYDEVRRMFEQMKADCVSPNLLTYSTLIDVYSKGGLYKEAMQIFREFKQAGLKA 507

Query: 1405 DVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLDSE 1226
            DVV YS L+D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG S  T+  +D  
Sbjct: 508  DVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSATTECTVDDV 567

Query: 1225 EHMKSSALVTADVTESNSEDKDKDT---------IITIFEQLAAGKSASFEKDNSGKQDF 1073
            E        +A++    S+D DKD          II +F QL A K+   +K+N  +Q+ 
Sbjct: 568  ERDLGKQKESANLDAMCSQD-DKDVQEAGRTDNQIIKVFGQLVAEKAGQGKKENRCRQEI 626

Query: 1072 LCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLL 893
            LC+LGVFQKMH+++IKPNVVTFSAILNACSRCNSF           LFDNQVYGVAHGLL
Sbjct: 627  LCILGVFQKMHKLKIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVAHGLL 686

Query: 892  MGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWE 713
            MG+ + +W+QALSLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKRRQVWE
Sbjct: 687  MGYRDNIWVQALSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRQVWE 746

Query: 712  SAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGA 533
            +  S+S LDLHLMSSGAARAMVHAWLL+I SIVF+G ELPKLLSI+TGWGKHSKVVGDGA
Sbjct: 747  NVWSESCLDLHLMSSGAARAMVHAWLLNIHSIVFEGHELPKLLSILTGWGKHSKVVGDGA 806

Query: 532  LKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHPESS 359
            L+RA+E LLT + APF VA CN+GRFISTG +V +W+RESGTLKVLVL DDR H E++
Sbjct: 807  LRRAVEVLLTGMGAPFWVANCNLGRFISTGPMVASWLRESGTLKVLVLHDDRTHSENA 864


>gb|EXB28566.1| hypothetical protein L484_009725 [Morus notabilis]
          Length = 871

 Score = 1054 bits (2726), Expect = 0.0
 Identities = 548/860 (63%), Positives = 649/860 (75%), Gaps = 10/860 (1%)
 Frame = -2

Query: 2905 STPPPHYTLSSSKPYXXXXXXXXXXXXXXXXXXXXXXXSQ----KVSLNNRSSHXXXXXX 2738
            ++ PPH ++++SKPY                        Q    KVSL   S        
Sbjct: 2    ASTPPHCSITASKPYQSHQYAQNPNLKSHHRHSNHRQGHQWTTQKVSLTKPSPSPPPARN 61

Query: 2737 XXXXXXXXXXXXXXXXXPTTFPSLSKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTA 2558
                               + P+  KS+LAA F+GRRSTRFVSKMH GRPKT   SRHTA
Sbjct: 62   AAATPAQHASQNPAFHSLCSLPA-PKSDLAAVFSGRRSTRFVSKMHLGRPKTTVGSRHTA 120

Query: 2557 TAEEALQLAIRSRGDDSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAV 2378
             AEE LQ AI+   DD  +D +LL FEPKL G DDYTFLLRELGNRGE    +RCF+FAV
Sbjct: 121  VAEEVLQQAIQFGKDDLGIDNVLLSFEPKLCGSDDYTFLLRELGNRGECRKAIRCFEFAV 180

Query: 2377 SRERKRNEQGKLTSSMISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGY 2198
            +RER++ EQGKLTS+MIS LGRLGKV+LAR VFETA+  GYGNTVY YSALISAY +SGY
Sbjct: 181  ARERRKTEQGKLTSAMISTLGRLGKVELARDVFETALFAGYGNTVYTYSALISAYGRSGY 240

Query: 2197 CDDAIRVFETMKDSGLKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYN 2018
             ++A RV E+MKDSGLKPNLVTYNA+IDACGKGGA+F+R  EIF+EMLRNGVQPDRITYN
Sbjct: 241  WEEARRVVESMKDSGLKPNLVTYNAVIDACGKGGAEFKRVVEIFDEMLRNGVQPDRITYN 300

Query: 2017 SLLAVCSGASLWETAKCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLK 1838
            SLLAVCS   LWE A+ LF+EMV + IDQDIYTYNTLLDA C GG MD A +IMSEM  K
Sbjct: 301  SLLAVCSRGGLWEAARSLFSEMVERQIDQDIYTYNTLLDAICKGGQMDLARQIMSEMPSK 360

Query: 1837 NIMPNEVTYSTMIRGCAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEA 1658
             I+PN VTYSTMI G AKAG+L+ AL LFNEMKY  I LDRV YNTLL+IYA LGRF EA
Sbjct: 361  KILPNVVTYSTMIDGYAKAGRLEDALNLFNEMKYLAIGLDRVLYNTLLSIYAKLGRFEEA 420

Query: 1657 LAVGEEMESIGIKKDVVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISV 1478
            L V +EMES GI +DVV+YNALL G+GKQG Y++V  ++Q MKA+++SPNLLTYSTLI V
Sbjct: 421  LKVCKEMESSGIVRDVVSYNALLGGYGKQGKYDEVKRMYQDMKADHVSPNLLTYSTLIDV 480

Query: 1477 FSKGGLYHDAMQVYKEFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPN 1298
            +SKGGLY +AM+V++EFK  GLKADVV YS+L+++LCKNG+VES+V LLDEM K+GI PN
Sbjct: 481  YSKGGLYREAMEVFREFKQAGLKADVVLYSELINALCKNGMVESAVSLLDEMTKEGIMPN 540

Query: 1297 VVTYNSIINAFGWSVATDHPLDSE------EHMKSSALVTADVTESNSEDKDKDTIITIF 1136
            V+TYNSII+AFG     D  L +       E   SS++   +  ++ + +K    II +F
Sbjct: 541  VITYNSIIDAFGRPATADSALGAAIGGNELETELSSSISNENANKNKAVNKGDHQIIKMF 600

Query: 1135 EQLAAGKSASFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXX 956
             QLAA +    +KD   +Q+ LC+LGVFQKMHE+ IKPNVVTFSAILNACSRCNSF    
Sbjct: 601  GQLAAEQEGHTKKDKKIRQEILCILGVFQKMHELNIKPNVVTFSAILNACSRCNSFEDAS 660

Query: 955  XXXXXXXLFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWH 776
                   LFDNQVYGVAHGLLMGH E VW++A SLFDEVKQMDSSTASAFYNALTDMLWH
Sbjct: 661  MLLEELRLFDNQVYGVAHGLLMGHRENVWLEAQSLFDEVKQMDSSTASAFYNALTDMLWH 720

Query: 775  FGQRRGAQMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGREL 596
            FGQ+RGAQ+VVLEGKRR VWES  S+S LDLHLMSSGAARA++HAWLL+IRS+VF+G+EL
Sbjct: 721  FGQKRGAQLVVLEGKRRNVWESVWSNSFLDLHLMSSGAARALLHAWLLNIRSVVFEGQEL 780

Query: 595  PKLLSIMTGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRE 416
            P+LLSI+TGWGKHSKVVGD AL+RAIE+LL S+ APF  AKCN+GRF S G +V  W++E
Sbjct: 781  PRLLSILTGWGKHSKVVGDSALRRAIESLLISMGAPFEAAKCNLGRFTSPGPMVAGWLKE 840

Query: 415  SGTLKVLVLQDDRIHPESSR 356
            SGTLKVLVL DDR H ++++
Sbjct: 841  SGTLKVLVLHDDRSHSQNAK 860


>ref|XP_004135985.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1050 bits (2716), Expect = 0.0
 Identities = 528/782 (67%), Positives = 634/782 (81%), Gaps = 10/782 (1%)
 Frame = -2

Query: 2677 FPSL-----SKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGD 2513
            FPSL     SKSELA++F+GRRSTRFVSK HFGRPK++  +RH+A AEE L   ++   D
Sbjct: 73   FPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRHSAIAEEVLHQVLQFGKD 132

Query: 2512 DSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSS 2333
            D+ +D ILL+FE KL G +DYTFLLRELGNRGE +  +RCFDFA+ RE ++NE+GKL S+
Sbjct: 133  DASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIRCFDFALVREGRKNERGKLASA 192

Query: 2332 MISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSG 2153
            MIS LGRLGKV+LA+ VFETA++EGYGNTV+A+SALISAY KSGY D+AI+VFE+MK SG
Sbjct: 193  MISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDEAIKVFESMKVSG 252

Query: 2152 LKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETA 1973
            LKPNLVTYNA+IDACGKGG +F+R  EIFEEMLRNGVQPDRITYNSLLAVCS   LWE A
Sbjct: 253  LKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA 312

Query: 1972 KCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRG 1793
            + LFNEM+ +GIDQD++TYNTLLDA C GG MD A+EIM EM  K I+PN VTYSTM  G
Sbjct: 313  RNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEMPGKKILPNVVTYSTMADG 372

Query: 1792 CAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKD 1613
             AKAG+L+ AL L+NEMK+ GI LDRVSYNTLL+IYA LGRF +AL V +EM S G+KKD
Sbjct: 373  YAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALKVCKEMGSSGVKKD 432

Query: 1612 VVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYK 1433
            VVTYNALLDG+GKQG +N+V  +F+ MK + + PNLLTYSTLI V+SKG LY +AM+V++
Sbjct: 433  VVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTLIDVYSKGSLYEEAMEVFR 492

Query: 1432 EFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSV 1253
            EFK  GLKADVV YS+L+++LCKNGLV+S+VLLLDEM K+GI+PNVVTYNSII+AFG S 
Sbjct: 493  EFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGIRPNVVTYNSIIDAFGRST 552

Query: 1252 ATDHPLD-----SEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDNS 1088
              +  +D     +E   +S + +  +  + +  + D   +   ++QL + K    +K+  
Sbjct: 553  TAEFLVDGVGASNERQSESPSFMLIEGVDESEINWDDGHVFKFYQQLVSEKEGPAKKERL 612

Query: 1087 GKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGV 908
            GK++   +L VF+KMHE+EIKPNVVTFSAILNACSRC S            LFDNQVYGV
Sbjct: 613  GKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEDASMLLEELRLFDNQVYGV 672

Query: 907  AHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKR 728
            AHGLLMG  E VW+QA  LFDEVKQMDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKR
Sbjct: 673  AHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKR 732

Query: 727  RQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKV 548
            R+VWE+  SDS LDLHLMSSGAARAMVHAWLL I S+VF+G +LPKLLSI+TGWGKHSKV
Sbjct: 733  RKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFEGHQLPKLLSILTGWGKHSKV 792

Query: 547  VGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHP 368
            VGDGAL+RAIEALLTS+ APF VAKCNIGR++STGSVV AW++ESGTLK+LVL DDR HP
Sbjct: 793  VGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVVAAWLKESGTLKLLVLHDDRTHP 852

Query: 367  ES 362
            +S
Sbjct: 853  DS 854


>ref|XP_004166285.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Cucumis sativus]
          Length = 868

 Score = 1049 bits (2712), Expect = 0.0
 Identities = 527/782 (67%), Positives = 633/782 (80%), Gaps = 10/782 (1%)
 Frame = -2

Query: 2677 FPSL-----SKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGD 2513
            FPSL     SKSELA++F+GRRSTRFVSK HFGRPK++  +RH+A AEE L   ++   D
Sbjct: 73   FPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMTTRHSAIAEEVLHQVLQFGKD 132

Query: 2512 DSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSS 2333
            D+ +D ILL+FE KL G +DYTFLLRELGNRGE +  +RCFDFA+ RE ++NE+GKL S+
Sbjct: 133  DASLDNILLNFESKLCGSEDYTFLLRELGNRGECWKAIRCFDFALVREGRKNERGKLASA 192

Query: 2332 MISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSG 2153
            MIS LGRLGKV+LA+ VFETA++EGYGNTV+A+SALISAY KSGY D+AI+VFE+MK SG
Sbjct: 193  MISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDEAIKVFESMKVSG 252

Query: 2152 LKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETA 1973
            LKPNLVTYNA+IDACGKGG +F+R  EIFEEMLRNGVQPDRITYNSLLAVCS   LWE A
Sbjct: 253  LKPNLVTYNAVIDACGKGGVEFKRVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA 312

Query: 1972 KCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRG 1793
            + LFNEM+ +GIDQD++TYNTLLDA C GG MD A+EIM EM  K I+PN VTYSTM  G
Sbjct: 313  RNLFNEMIDRGIDQDVFTYNTLLDAVCKGGQMDLAYEIMLEMPGKKILPNVVTYSTMADG 372

Query: 1792 CAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKD 1613
             AKAG+L+ AL L+NEMK+ GI LDRVSYNTLL+IYA LGRF +AL V +EM S G+KKD
Sbjct: 373  YAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALKVCKEMGSSGVKKD 432

Query: 1612 VVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYK 1433
            VVTYNALLDG+GKQG +N+V  +F+ MK + + PNLLTYSTLI V+SKG LY +AM+V++
Sbjct: 433  VVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVFPNLLTYSTLIDVYSKGSLYEEAMEVFR 492

Query: 1432 EFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSV 1253
            EFK  GLKADVV YS+L+++LCKNGLV+S+VLLLDEM K+GI+PNVVTYNSII+AFG S 
Sbjct: 493  EFKQAGLKADVVLYSELINALCKNGLVDSAVLLLDEMTKEGIRPNVVTYNSIIDAFGRST 552

Query: 1252 ATDHPLD-----SEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDNS 1088
              +  +D     +E   +S   +  +  + +  + D   +   ++QL + K    +K+  
Sbjct: 553  TAEFLVDGVGASNERQSESPTFMLIEGVDESEINWDDGHVFKFYQQLVSEKEGPAKKERL 612

Query: 1087 GKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGV 908
            GK++   +L VF+KMHE+EIKPNVVTFSAILNACSRC S            LFDNQVYGV
Sbjct: 613  GKEEIRSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEDASMLLEELRLFDNQVYGV 672

Query: 907  AHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKR 728
            AHGLLMG  E VW+QA  LFDEVKQMDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKR
Sbjct: 673  AHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKR 732

Query: 727  RQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKV 548
            R+VWE+  SDS LDLHLMSSGAARAMVHAWLL I S+VF+G +LPKLLSI+TGWGKHSKV
Sbjct: 733  RKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFEGHQLPKLLSILTGWGKHSKV 792

Query: 547  VGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHP 368
            VGDGAL+RAIEALLTS+ APF VAKCNIGR++STGSVV AW++ESGTLK+LVL DDR HP
Sbjct: 793  VGDGALRRAIEALLTSMGAPFRVAKCNIGRYVSTGSVVAAWLKESGTLKLLVLHDDRTHP 852

Query: 367  ES 362
            ++
Sbjct: 853  DT 854


>ref|XP_002320970.2| hypothetical protein POPTR_0014s11380g [Populus trichocarpa]
            gi|550323986|gb|EEE99285.2| hypothetical protein
            POPTR_0014s11380g [Populus trichocarpa]
          Length = 875

 Score = 1048 bits (2710), Expect = 0.0
 Identities = 540/787 (68%), Positives = 632/787 (80%), Gaps = 8/787 (1%)
 Frame = -2

Query: 2680 TFPSLS--KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDS 2507
            TFPSL   KSELA+DF+GRRSTRFVSK++FGRP+T   +RHT+ AEEALQ  I    D+ 
Sbjct: 83   TFPSLQSPKSELASDFSGRRSTRFVSKLNFGRPRTTMGTRHTSVAEEALQNVIEYGKDEG 142

Query: 2506 CMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMI 2327
             ++ +LL+FE +LSG DDY FLLRELGNRG+    + CF+FAV RERK+NEQGKL S+MI
Sbjct: 143  ALENVLLNFESRLSGSDDYIFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMI 202

Query: 2326 SILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLK 2147
            S LGRLGKV++A+ VFE A+ EGYGNTVYA+SA+ISAY +SGYCD+AI+VF++MK  GLK
Sbjct: 203  STLGRLGKVEIAKSVFEAALIEGYGNTVYAFSAIISAYGRSGYCDEAIKVFDSMKHYGLK 262

Query: 2146 PNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKC 1967
            PNLVTYNA+IDACGKGG +F+R  EIF+EMLRNGVQPDRIT+NSLLAVCS   LWE A+ 
Sbjct: 263  PNLVTYNAVIDACGKGGVEFKRVVEIFDEMLRNGVQPDRITFNSLLAVCSRGGLWEAARS 322

Query: 1966 LFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCA 1787
            L +EM+ +GIDQDI+TYNTLLDA C GG MD AFEIMSEM  KNI+PN VTYSTMI G A
Sbjct: 323  LSSEMLNRGIDQDIFTYNTLLDAVCKGGQMDMAFEIMSEMPAKNILPNVVTYSTMIDGYA 382

Query: 1786 KAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVV 1607
            KAG+ D AL LFNEMK+  I LDRVSYNTLL+IYA LGRF EAL V  EME+ GI+KDVV
Sbjct: 383  KAGRFDDALNLFNEMKFLCISLDRVSYNTLLSIYAKLGRFQEALDVCREMENCGIRKDVV 442

Query: 1606 TYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEF 1427
            TYNALL G+GKQ  Y++V  +F  MKA  +SPNLLTYSTLI V+SKGGLY +AM V++EF
Sbjct: 443  TYNALLGGYGKQCKYDEVRRVFGEMKAGRVSPNLLTYSTLIDVYSKGGLYREAMDVFREF 502

Query: 1426 KHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVAT 1247
            K  GLKADVV YS ++D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG S  T
Sbjct: 503  KKAGLKADVVLYSAVIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRSAIT 562

Query: 1246 DHPLD-----SEEHMKS-SALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDNSG 1085
            +  +D     S+  ++S S+ V  + T+S   D++ + II IF QLA  K+   +  N  
Sbjct: 563  ESVVDDNVQTSQLQIESLSSGVVEEATKSLLADREGNRIIKIFGQLAVEKAG--QAKNCS 620

Query: 1084 KQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVA 905
             Q+ +C+L VF KMHE+EIKPNVVTFSAILNACSRCNSF           LFDNQVYGVA
Sbjct: 621  GQEMMCILAVFHKMHELEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNQVYGVA 680

Query: 904  HGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRR 725
            HGLLMG+ E VW QA SLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGKRR
Sbjct: 681  HGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR 740

Query: 724  QVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVV 545
            QVWE+  S+S LDLHLMSSGAARAMVHAWLL+IRSIVF+G ELPKLLSI+TGWGKHSKVV
Sbjct: 741  QVWENVWSESCLDLHLMSSGAARAMVHAWLLNIRSIVFEGHELPKLLSILTGWGKHSKVV 800

Query: 544  GDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHPE 365
            GD  L+RAIEALL  + APF +AKCN+GRFISTGSVV AW+RESGTLKVLVL D R   E
Sbjct: 801  GDSTLRRAIEALLMGMGAPFRLAKCNLGRFISTGSVVAAWLRESGTLKVLVLHDHRTEQE 860

Query: 364  SSRSAAA 344
            + R   A
Sbjct: 861  NLRFGQA 867


>ref|XP_006386713.1| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345388|gb|ERP64510.1| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 873

 Score = 1046 bits (2705), Expect = 0.0
 Identities = 533/785 (67%), Positives = 630/785 (80%), Gaps = 10/785 (1%)
 Frame = -2

Query: 2680 TFPSLS--KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDS 2507
            TF S    KSEL +DF GRRSTRFVSK+HFGRP+T   +RHT+ A+EALQ  I    D+ 
Sbjct: 81   TFSSFQPPKSELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDER 140

Query: 2506 CMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMI 2327
             ++ +LL+FE +LSG DDY FLLRELGNRG+    + CF+FAV RERK+NEQGKL S+MI
Sbjct: 141  ALENVLLNFESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMI 200

Query: 2326 SILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLK 2147
            S LGRLGKV++A+ VF+ A+ EGYGNTVYA+SA+ISAY +SGYC++AI++F +MKD GLK
Sbjct: 201  STLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLK 260

Query: 2146 PNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKC 1967
            PNLVTYNA+IDACGKGG +F+R  EIF+EMLRNG+QPDRIT+NSLLAVCS   LWE A+ 
Sbjct: 261  PNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARS 320

Query: 1966 LFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCA 1787
            L  EMV +GIDQDI+TYNTLLDA C GG +D AFEIMSEM  KNI+PN VTYSTMI G A
Sbjct: 321  LSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYA 380

Query: 1786 KAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVV 1607
            KAG+LD A  LFNEMK+ GI LDRVSYNTLL+IYA LGRF EA+ V  EME+ GI+KDVV
Sbjct: 381  KAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVV 440

Query: 1606 TYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEF 1427
            TYNALL G+GKQ  Y+ V ++F+ MKA ++SPNLLTYSTLI V+SKGGLY +AM V++EF
Sbjct: 441  TYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREF 500

Query: 1426 KHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVAT 1247
            K  GLKADVV YS L+D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG    T
Sbjct: 501  KKAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATT 560

Query: 1246 DHPLDSE--------EHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDN 1091
            +  +D          + + SSA+  A  T+S   D++ + II IF QLAA K+   +  N
Sbjct: 561  ESVVDDAGQTSELQIDSLSSSAVEKA--TKSLVADREDNRIIKIFGQLAAEKAG--QAKN 616

Query: 1090 SGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYG 911
            SG Q+ +C+LGVF KMHE+EIKPNVVTFSAILNACSRCNSF           LFDNQVYG
Sbjct: 617  SGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYG 676

Query: 910  VAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGK 731
            VAHGLLMG+ E VW QA SLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGK
Sbjct: 677  VAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGK 736

Query: 730  RRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSK 551
            RRQVWE+  S+S LDLHLMSSGAARAMVHAWLL++R+IVF+G E+PKLLSI+TGWGKHSK
Sbjct: 737  RRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLLSILTGWGKHSK 796

Query: 550  VVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIH 371
            VVGD  L+RA+EALL  + APF  AKCN+GR ISTGSVV +W+RESGTLKVLVL DDR H
Sbjct: 797  VVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHDDRTH 856

Query: 370  PESSR 356
             E+ R
Sbjct: 857  QENLR 861


>gb|EMJ22752.1| hypothetical protein PRUPE_ppa001263mg [Prunus persica]
          Length = 868

 Score = 1034 bits (2673), Expect = 0.0
 Identities = 520/774 (67%), Positives = 621/774 (80%), Gaps = 6/774 (0%)
 Frame = -2

Query: 2662 KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILLD 2483
            KS+L   F+GRRSTRFVSKMH GRPKT   S  +  AEEAL  A++   DD  +D ILL 
Sbjct: 82   KSDLVTAFSGRRSTRFVSKMHLGRPKTTMGSYRSPLAEEALHQAVQFGNDDLALDDILLS 141

Query: 2482 FEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLGK 2303
            F  +L G DDYTFL RELGNRGE +  +RCF+FAV RE++R EQGKL SSMIS LGRLGK
Sbjct: 142  FHSRLCGSDDYTFLFRELGNRGECWKAIRCFEFAVRREKRRTEQGKLASSMISTLGRLGK 201

Query: 2302 VDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYNA 2123
            V+LA+ VF+TAVNEGYG TVY YSALI+AY ++GYC++AIRVFE+MKDSGLKPNLVTYNA
Sbjct: 202  VELAKNVFQTAVNEGYGKTVYTYSALITAYGRNGYCEEAIRVFESMKDSGLKPNLVTYNA 261

Query: 2122 LIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVYK 1943
            +IDA GKGG +F+R  EIF EMLRNG QPDRITYNSLLAVCS   LWE A+ LF+EMV +
Sbjct: 262  VIDAYGKGGVEFKRVVEIFNEMLRNGEQPDRITYNSLLAVCSRGGLWEMARNLFSEMVDR 321

Query: 1942 GIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDRA 1763
            GIDQDIYTYNTL+DA C GG MD A++IMSEM  KNI+PN VTYST+I G AKAG+L+ A
Sbjct: 322  GIDQDIYTYNTLIDAICKGGQMDLAYQIMSEMPSKNILPNVVTYSTIIDGYAKAGRLEDA 381

Query: 1762 LALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLDG 1583
            L+LFNEMK+  I LDRV YNTLL++Y  LGRF +AL V +EMES+GI KDVV+YNALL G
Sbjct: 382  LSLFNEMKFLAIGLDRVLYNTLLSLYGKLGRFEDALKVCKEMESVGIAKDVVSYNALLGG 441

Query: 1582 FGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKAD 1403
            +GKQG Y+    ++  MK E +SPN+LTYSTLI V+SKGGLY +AM+V++EFK  GLKAD
Sbjct: 442  YGKQGKYDDAKRMYNQMKEERVSPNILTYSTLIDVYSKGGLYMEAMKVFREFKQAGLKAD 501

Query: 1402 VVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLDSE- 1226
            VV YS+L+++LCKNGLVES+VLLLDEM K+GI+PNVVTYNSII+AFG S  T+   D+  
Sbjct: 502  VVLYSELVNALCKNGLVESAVLLLDEMTKEGIRPNVVTYNSIIDAFGRSATTECAADAAG 561

Query: 1225 -----EHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDNSGKQDFLCVL 1061
                 +   SS++   D       D+  +  + +F QLAA K+   + D   +Q+ LC+L
Sbjct: 562  GGIVLQTESSSSVSEGDAIGIQVGDRGDNRFMKMFGQLAAEKAGYAKTDRKVRQEILCIL 621

Query: 1060 GVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLLMGHD 881
            G+FQKMHE++IKPNVVTFSAILNACSRCNSF           LFDN+VYGVAHGLLMG+ 
Sbjct: 622  GIFQKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVAHGLLMGYR 681

Query: 880  ERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWESAIS 701
            + VW++A SLFDEVKQMDSSTASAFYNALTDMLWH+GQ++GAQ+VVLEGKRR VWES  S
Sbjct: 682  DNVWVKAESLFDEVKQMDSSTASAFYNALTDMLWHYGQKQGAQLVVLEGKRRNVWESVWS 741

Query: 700  DSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGALKRA 521
            +S LDLHLMSSGAARAMVHAWLL+IRSIVF+G++LP LLSI+TGWGKHSKVVGD  L+RA
Sbjct: 742  NSCLDLHLMSSGAARAMVHAWLLNIRSIVFEGQQLPNLLSILTGWGKHSKVVGDSTLRRA 801

Query: 520  IEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIHPESS 359
            IEALLTS+ APF VAKCN+GRFISTGS+  AW+RESGTL+VLVL DDR  P+S+
Sbjct: 802  IEALLTSMGAPFRVAKCNLGRFISTGSMAAAWLRESGTLEVLVLHDDRTCPKSA 855


>ref|XP_004288538.1| PREDICTED: pentatricopeptide repeat-containing protein At2g31400,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 870

 Score = 1029 bits (2661), Expect = 0.0
 Identities = 528/790 (66%), Positives = 633/790 (80%), Gaps = 15/790 (1%)
 Frame = -2

Query: 2683 TTFPSL---SKSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGD 2513
            ++F SL   +KS+L + F+GRRSTR VSKMH GRPKT   SRH+  AEEAL+ AIR   D
Sbjct: 69   SSFSSLCPPAKSDLVSAFSGRRSTRMVSKMHLGRPKTTVGSRHSPLAEEALETAIRFGKD 128

Query: 2512 DSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSS 2333
            D  +D +L  FE +L   DD+TFLLRELGNRGE +  +RCF+FAV RERKR EQGKL SS
Sbjct: 129  DFALDDVLHSFESRLVS-DDFTFLLRELGNRGECWKAIRCFEFAVRRERKRTEQGKLASS 187

Query: 2332 MISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSG 2153
            MIS LGRLGKV+LA+ VF+TAVNEGYG TVY YSALISAY +SGYCD+AIRV E+MKDSG
Sbjct: 188  MISTLGRLGKVELAKNVFQTAVNEGYGRTVYTYSALISAYGRSGYCDEAIRVLESMKDSG 247

Query: 2152 LKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETA 1973
            +KPNLVTYNA+IDACGKGG +F++  EIF+EML+ GVQPDRITYNSLLAVCS   LWE A
Sbjct: 248  VKPNLVTYNAVIDACGKGGVEFKKVVEIFDEMLKVGVQPDRITYNSLLAVCSRGGLWEAA 307

Query: 1972 KCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRG 1793
            + LF+EMV +GIDQDIYTYNTLLDA   GG MD A++IMSEM  KNI+PN VTYSTMI G
Sbjct: 308  RNLFSEMVDRGIDQDIYTYNTLLDAISKGGQMDLAYKIMSEMPSKNILPNVVTYSTMIDG 367

Query: 1792 CAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKD 1613
             AKAG+L+ AL LFNEMK+  I LDRV YNTLL++Y  LGRF EAL V +EMES+GI KD
Sbjct: 368  YAKAGRLEDALNLFNEMKFLAIGLDRVLYNTLLSLYGKLGRFEEALNVCKEMESVGIAKD 427

Query: 1612 VVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYK 1433
            VV+YNALL G+GKQG Y++V  L+  MK E +SPNLLTYSTLI V+SKGGLY +A++V++
Sbjct: 428  VVSYNALLGGYGKQGKYDEVKGLYNEMKVERVSPNLLTYSTLIDVYSKGGLYAEAVKVFR 487

Query: 1432 EFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSV 1253
            EFK  GLKADVV YS+L+++LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG   
Sbjct: 488  EFKQAGLKADVVLYSELINALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPA 547

Query: 1252 ATDHPLDSEE-----HMKSSALVTA---DVTESNSE----DKDKDTIITIFEQLAAGKSA 1109
             T   +D+         +SS+ ++A   D+++ N +    D++   I+ +F QL A K+ 
Sbjct: 548  TTVCAVDAGACGIVLRSESSSSISARDFDISDKNVQNEMRDREDTRIMKMFGQLTADKAG 607

Query: 1108 SFEKDNSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLF 929
              +KD   +Q+ LC+LGVFQKMHE++IKPNVVTFSAILNACSRCNSF           LF
Sbjct: 608  YAKKDRKVRQEILCILGVFQKMHELDIKPNVVTFSAILNACSRCNSFEDASMLLEELRLF 667

Query: 928  DNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQM 749
            DNQVYGVAHGLLMG    VW++A SLFDEVKQMD STASAFYNALTDMLWHFGQ++GAQ+
Sbjct: 668  DNQVYGVAHGLLMGCRGNVWVKAQSLFDEVKQMDCSTASAFYNALTDMLWHFGQKKGAQL 727

Query: 748  VVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTG 569
            VVLEG+RR VWE+A S+S LDLHLMSSGAARAMVHAWLL+I SIV++G++LP LLSI+TG
Sbjct: 728  VVLEGERRNVWENAWSNSRLDLHLMSSGAARAMVHAWLLNIHSIVYQGQQLPNLLSILTG 787

Query: 568  WGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVL 389
            WGKHSKVVGD AL+RA+EALLTS+ APF V +CNIGRFISTGSV  AW++ESGTL+VL+L
Sbjct: 788  WGKHSKVVGDSALRRAVEALLTSMGAPFRVHECNIGRFISTGSVAAAWLKESGTLEVLML 847

Query: 388  QDDRIHPESS 359
             DDR  P S+
Sbjct: 848  HDDRAEPNSA 857


>ref|XP_002301519.2| hypothetical protein POPTR_0002s19470g [Populus trichocarpa]
            gi|550345387|gb|EEE80792.2| hypothetical protein
            POPTR_0002s19470g [Populus trichocarpa]
          Length = 864

 Score = 1019 bits (2634), Expect = 0.0
 Identities = 525/785 (66%), Positives = 621/785 (79%), Gaps = 10/785 (1%)
 Frame = -2

Query: 2680 TFPSLS--KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDS 2507
            TF S    KSEL +DF GRRSTRFVSK+HFGRP+T   +RHT+ A+EALQ  I    D+ 
Sbjct: 81   TFSSFQPPKSELVSDFPGRRSTRFVSKLHFGRPRTTMGTRHTSVAQEALQNVIEYGKDER 140

Query: 2506 CMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMI 2327
             ++ +LL+FE +LSG DDY FLLRELGNRG+    + CF+FAV RERK+NEQGKL S+MI
Sbjct: 141  ALENVLLNFESRLSGSDDYVFLLRELGNRGDCKKAICCFEFAVKRERKKNEQGKLASAMI 200

Query: 2326 SILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLK 2147
            S LGRLGKV++A+ VF+ A+ EGYGNTVYA+SA+ISAY +SGYC++AI++F +MKD GLK
Sbjct: 201  STLGRLGKVEMAKTVFKAALTEGYGNTVYAFSAIISAYGRSGYCNEAIKIFYSMKDYGLK 260

Query: 2146 PNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKC 1967
            PNLVTYNA+IDACGKGG +F+R  EIF+EMLRNG+QPDRIT+NSLLAVCS   LWE A+ 
Sbjct: 261  PNLVTYNAVIDACGKGGVEFKRVLEIFDEMLRNGMQPDRITFNSLLAVCSKGGLWEAARS 320

Query: 1966 LFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCA 1787
            L  EMV +GIDQDI+TYNTLLDA C GG +D AFEIMSEM  KNI+PN VTYSTMI G A
Sbjct: 321  LSCEMVNRGIDQDIFTYNTLLDAVCKGGQLDMAFEIMSEMPAKNILPNVVTYSTMIDGYA 380

Query: 1786 KAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVV 1607
            KAG+LD A  LFNEMK+ GI LDRVSYNTLL+IYA LGRF EA+ V  EME+ GI+KDVV
Sbjct: 381  KAGRLDDARNLFNEMKFLGISLDRVSYNTLLSIYAKLGRFEEAMDVCREMENSGIRKDVV 440

Query: 1606 TYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEF 1427
            TYNALL G+GKQ  Y+ V ++F+ MKA ++SPNLLTYSTLI V+SKGGLY +AM V++EF
Sbjct: 441  TYNALLGGYGKQYKYDVVRKVFEEMKARHVSPNLLTYSTLIDVYSKGGLYREAMDVFREF 500

Query: 1426 KHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVAT 1247
            K  GLKADVV YS L+D+LCKNGLVES+V LLDEM K+GI+PNVVTYNSII+AFG    T
Sbjct: 501  KKAGLKADVVLYSALIDALCKNGLVESAVSLLDEMTKEGIRPNVVTYNSIIDAFGRPATT 560

Query: 1246 DHPLDSE--------EHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKDN 1091
            +  +D          + + SSA+  A  T+S   D++ + II IF QLAA K+   +  N
Sbjct: 561  ESVVDDAGQTSELQIDSLSSSAVEKA--TKSLVADREDNRIIKIFGQLAAEKAG--QAKN 616

Query: 1090 SGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYG 911
            SG Q+ +C+LGVF KMHE+EIKPNVVTFSAILNACSRCNSF           LFDNQVYG
Sbjct: 617  SGGQEMMCILGVFHKMHELEIKPNVVTFSAILNACSRCNSFEEASMLLEELRLFDNQVYG 676

Query: 910  VAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGK 731
            VAHGLLMG+ E VW QA SLFDEVK MDSSTASAFYNALTDMLWHFGQ+RGAQ+VVLEGK
Sbjct: 677  VAHGLLMGYRENVWEQAQSLFDEVKLMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGK 736

Query: 730  RRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSK 551
            RRQVWE+  S+S LDLHLMSSGAARAMVHAWLL++R+IVF+G E+PKLL         SK
Sbjct: 737  RRQVWENVWSESCLDLHLMSSGAARAMVHAWLLNVRAIVFEGHEVPKLL---------SK 787

Query: 550  VVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDRIH 371
            VVGD  L+RA+EALL  + APF  AKCN+GR ISTGSVV +W+RESGTLKVLVL DDR H
Sbjct: 788  VVGDSTLRRAVEALLMGMGAPFRSAKCNLGRLISTGSVVASWLRESGTLKVLVLHDDRTH 847

Query: 370  PESSR 356
             E+ R
Sbjct: 848  QENLR 852


>ref|XP_006417966.1| hypothetical protein EUTSA_v10006755mg [Eutrema salsugineum]
            gi|557095737|gb|ESQ36319.1| hypothetical protein
            EUTSA_v10006755mg [Eutrema salsugineum]
          Length = 895

 Score = 1001 bits (2587), Expect = 0.0
 Identities = 513/800 (64%), Positives = 625/800 (78%), Gaps = 27/800 (3%)
 Frame = -2

Query: 2677 FPSLS-----KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGD 2513
            FP+LS     KS+L+ DF GRRSTRFVSKMHFGRPKTA ASRH+  AE+AL  AI+  G+
Sbjct: 92   FPALSPLQTPKSDLSPDFAGRRSTRFVSKMHFGRPKTAMASRHSLVAEDALHHAIQFSGN 151

Query: 2512 DSCMDRILLDFEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSS 2333
            D  +  +LL FE KL G DDYT++LRELGNRGE    +R ++FAV RER++NEQGKL S+
Sbjct: 152  DEGLQNLLLSFESKLCGSDDYTYILRELGNRGEFEKAVRFYEFAVKRERRKNEQGKLASA 211

Query: 2332 MISILGRLGKVDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSG 2153
            MIS LGRLGKV +A++VFETA+ +GYGNTVYA+SA+ISAY +SGY +DAI+VF +MK  G
Sbjct: 212  MISTLGRLGKVGIAKRVFETALADGYGNTVYAFSAIISAYGRSGYHEDAIKVFSSMKGHG 271

Query: 2152 LKPNLVTYNALIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETA 1973
            L+PNLVTYNA+IDACGKGG +F++ +E F+EM RN VQPDRIT+NSLLAVCS    WE A
Sbjct: 272  LRPNLVTYNAVIDACGKGGMEFKQVAEFFDEMQRNRVQPDRITFNSLLAVCSRGGSWEAA 331

Query: 1972 KCLFNEMVYKGIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRG 1793
            + LF+EM+ +GI+QDI+TYNTLLDA C GG MD AFEI+++M  KNIMPN VTYST+I G
Sbjct: 332  RNLFDEMLNRGIEQDIFTYNTLLDAICKGGQMDLAFEILAQMPAKNIMPNVVTYSTVIDG 391

Query: 1792 CAKAGKLDRALALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKD 1613
             AKAG+ + AL LF EMKY GI LDRVSYNTL++IYA LGRF EAL + +EM + GI+KD
Sbjct: 392  YAKAGRFNDALTLFGEMKYLGIPLDRVSYNTLVSIYAKLGRFEEALDIVKEMAAAGIRKD 451

Query: 1612 VVTYNALLDGFGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYK 1433
             VTYNALL G+GK   Y++V  +F  MK E + PNLLTYSTLI V+SKGGLY +AM++++
Sbjct: 452  AVTYNALLGGYGKHEKYDEVKSVFAEMKQERVLPNLLTYSTLIDVYSKGGLYKEAMEIFR 511

Query: 1432 EFKHQGLKADVVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSV 1253
            EFK  GL+ADVV YS L+D+LCKNGLVES+V LLDEM K+GI PNVVTYNS+I+AFG S 
Sbjct: 512  EFKSVGLRADVVLYSALIDALCKNGLVESAVSLLDEMTKEGISPNVVTYNSMIDAFGRSA 571

Query: 1252 ATD----------HPLDSEEHMKSSAL-------VTADVTESNSEDK----DKDTIITIF 1136
             T+          + L+ +E   SS+        ++  V E++S  K    +   I+ IF
Sbjct: 572  TTECLADINEGGANGLEEDESFSSSSASLSHTDSLSLAVGEADSLSKLTKTEDHRIVEIF 631

Query: 1135 EQLAAGKSASFEKD-NSGKQDFLCVLGVFQKMHEMEIKPNVVTFSAILNACSRCNSFXXX 959
             QL    +   ++D   G Q+  C+L V  KMHE+EIKPNVVTFSAILNACSRCNSF   
Sbjct: 632  GQLVTEGNNQIKRDCKQGVQELSCILEVCHKMHELEIKPNVVTFSAILNACSRCNSFEEA 691

Query: 958  XXXXXXXXLFDNQVYGVAHGLLMGHDERVWMQALSLFDEVKQMDSSTASAFYNALTDMLW 779
                    LFDN+VYGVAHGLLMG++E VW+QA SLFDEVK MD STASAFYNALTDMLW
Sbjct: 692  SMLLEELRLFDNKVYGVAHGLLMGYNENVWIQAQSLFDEVKAMDGSTASAFYNALTDMLW 751

Query: 778  HFGQRRGAQMVVLEGKRRQVWESAISDSELDLHLMSSGAARAMVHAWLLSIRSIVFKGRE 599
            HFGQ+RGAQ VVLEG+RR+VWE+  SDS LDLHLMSSGAARAMVHAWLL+IRSIV++G E
Sbjct: 752  HFGQKRGAQSVVLEGRRRKVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHE 811

Query: 598  LPKLLSIMTGWGKHSKVVGDGALKRAIEALLTSIDAPFHVAKCNIGRFISTGSVVTAWMR 419
            LPKLLSI+TGWGKHSKV+GDG L+RA+EALL  + APFHVAKCN+GRF+S+GSVV AW+R
Sbjct: 812  LPKLLSILTGWGKHSKVMGDGTLRRAVEALLRGMGAPFHVAKCNVGRFVSSGSVVAAWLR 871

Query: 418  ESGTLKVLVLQDDRIHPESS 359
            ESGTLKVLVL+D + H E+S
Sbjct: 872  ESGTLKVLVLEDHK-HEEAS 890


>ref|XP_006293642.1| hypothetical protein CARUB_v10022597mg [Capsella rubella]
            gi|482562350|gb|EOA26540.1| hypothetical protein
            CARUB_v10022597mg [Capsella rubella]
          Length = 932

 Score =  969 bits (2506), Expect = 0.0
 Identities = 482/767 (62%), Positives = 600/767 (78%), Gaps = 5/767 (0%)
 Frame = -2

Query: 2662 KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILLD 2483
            KS+L++DF+GRRSTRFVSKMHFGRPKTA A+RH++ AE+ALQ AI   GD      ++L 
Sbjct: 138  KSDLSSDFSGRRSTRFVSKMHFGRPKTAMATRHSSAAEDALQNAIDFSGDSEMFHSLMLS 197

Query: 2482 FEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLGK 2303
            FE KL G DD T+++RELGNRGE    +  ++FAV RER++NEQGKL S+MIS LGR GK
Sbjct: 198  FESKLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGK 257

Query: 2302 VDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYNA 2123
            V +A+++FETA   GYGNTVYA+SALISAY +SG  ++AI VF +MKD GL+PNLVTYNA
Sbjct: 258  VTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFSSMKDHGLRPNLVTYNA 317

Query: 2122 LIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVYK 1943
            +IDACGKGG +F++ ++ F+EM +NGVQPDRIT+NSLLAVCS   LWE A+ LF+EM  +
Sbjct: 318  VIDACGKGGMEFKQVAKFFDEMQKNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNR 377

Query: 1942 GIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDRA 1763
             I+QD+++YNTLLDA C GG MD AFEI+++M  K IMPN V+YST+I G AKAG+ D A
Sbjct: 378  RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEA 437

Query: 1762 LALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLDG 1583
            L LF EM+Y GI LDRVSYNTLL+IY  +GR  EAL +  EM S+GIKKDVVTYNALL G
Sbjct: 438  LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 497

Query: 1582 FGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKAD 1403
            +GKQG Y++V ++F  MK E++ PNLLTYSTLI  +SKGGLY +AM++++EFK  GL+AD
Sbjct: 498  YGKQGKYDEVKKVFAEMKREHVVPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 557

Query: 1402 VVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLD--- 1232
            VV YS L+D+LCKNGLV S+V L+DEM K+GI PNVVTYNSII+AFG S   +   D   
Sbjct: 558  VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMERSADYSN 617

Query: 1231 -SEEHMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKD-NSGKQDFLCVLG 1058
                +++  +L  +    S   + + + +I +F QL A  +    KD   G Q+  C+L 
Sbjct: 618  GEANNLEVGSLALSSSALSKLTETEGNRVIQLFGQLTAESNNRMTKDCKEGMQELSCILE 677

Query: 1057 VFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLLMGHDE 878
            VF+KMH++EIKPNVVTFSAILNACSRCNSF           LFDN+VYGV HGLLMG  E
Sbjct: 678  VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGERE 737

Query: 877  RVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWESAISD 698
             VW+QA SLFD+V +MD STASAFYNALTDMLWHFGQ+RGA++V LEG+ RQVWE+  SD
Sbjct: 738  NVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWSD 797

Query: 697  SELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGALKRAI 518
            S LDLHLMSSGAARAMVHAWLL+IRSIV++G ELPK+LSI+TGWGKHSKVVGDGAL+RA+
Sbjct: 798  SCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALRRAV 857

Query: 517  EALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQDDR 377
            E LL  +DAPFH++KCN+GRFIS+GSVV  W+RES TLK+L+L D +
Sbjct: 858  EVLLRGMDAPFHLSKCNMGRFISSGSVVATWLRESATLKLLILHDHK 904


>ref|NP_180698.1| pentatricopeptide-repeat protein GUN1 [Arabidopsis thaliana]
            gi|75206083|sp|Q9SIC9.1|PP178_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g31400, chloroplastic; Flags: Precursor
            gi|4589961|gb|AAD26479.1| unknown protein [Arabidopsis
            thaliana] gi|330253448|gb|AEC08542.1| genomes uncoupled 1
            protein [Arabidopsis thaliana]
          Length = 918

 Score =  962 bits (2488), Expect = 0.0
 Identities = 484/765 (63%), Positives = 593/765 (77%), Gaps = 5/765 (0%)
 Frame = -2

Query: 2662 KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILLD 2483
            KS+L++DF+GRRSTRFVSKMHFGR KT  A+RH++ AE+ALQ AI   GDD     ++L 
Sbjct: 129  KSDLSSDFSGRRSTRFVSKMHFGRQKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLS 188

Query: 2482 FEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLGK 2303
            FE KL G DD T+++RELGNR E    +  ++FAV RER++NEQGKL S+MIS LGR GK
Sbjct: 189  FESKLCGSDDCTYIIRELGNRNECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGK 248

Query: 2302 VDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYNA 2123
            V +A+++FETA   GYGNTVYA+SALISAY +SG  ++AI VF +MK+ GL+PNLVTYNA
Sbjct: 249  VTIAKRIFETAFAGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNA 308

Query: 2122 LIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVYK 1943
            +IDACGKGG +F++ ++ F+EM RNGVQPDRIT+NSLLAVCS   LWE A+ LF+EM  +
Sbjct: 309  VIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNR 368

Query: 1942 GIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDRA 1763
             I+QD+++YNTLLDA C GG MD AFEI+++M +K IMPN V+YST+I G AKAG+ D A
Sbjct: 369  RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428

Query: 1762 LALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLDG 1583
            L LF EM+Y GI LDRVSYNTLL+IY  +GR  EAL +  EM S+GIKKDVVTYNALL G
Sbjct: 429  LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488

Query: 1582 FGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKAD 1403
            +GKQG Y++V ++F  MK E++ PNLLTYSTLI  +SKGGLY +AM++++EFK  GL+AD
Sbjct: 489  YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548

Query: 1402 VVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLDSEE 1223
            VV YS L+D+LCKNGLV S+V L+DEM K+GI PNVVTYNSII+AFG S   D   D   
Sbjct: 549  VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608

Query: 1222 ----HMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKD-NSGKQDFLCVLG 1058
                   SSAL     TE N        +I +F QL    +    KD   G Q+  C+L 
Sbjct: 609  GGSLPFSSSALSALTETEGN-------RVIQLFGQLTTESNNRTTKDCEEGMQELSCILE 661

Query: 1057 VFQKMHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLLMGHDE 878
            VF+KMH++EIKPNVVTFSAILNACSRCNSF           LFDN+VYGV HGLLMG  E
Sbjct: 662  VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRE 721

Query: 877  RVWMQALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWESAISD 698
             VW+QA SLFD+V +MD STASAFYNALTDMLWHFGQ+RGA++V LEG+ RQVWE+  SD
Sbjct: 722  NVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWSD 781

Query: 697  SELDLHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGALKRAI 518
            S LDLHLMSSGAARAMVHAWLL+IRSIV++G ELPK+LSI+TGWGKHSKVVGDGAL+RA+
Sbjct: 782  SCLDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALRRAV 841

Query: 517  EALLTSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQD 383
            E LL  +DAPFH++KCN+GRF S+GSVV  W+RES TLK+L+L D
Sbjct: 842  EVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHD 886


>ref|XP_002881173.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327012|gb|EFH57432.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 917

 Score =  962 bits (2486), Expect = 0.0
 Identities = 480/761 (63%), Positives = 596/761 (78%), Gaps = 1/761 (0%)
 Frame = -2

Query: 2662 KSELAADFTGRRSTRFVSKMHFGRPKTAAASRHTATAEEALQLAIRSRGDDSCMDRILLD 2483
            KS+L++DF+GRRSTRFVSKMHFGRPKT  A+RH++ AE+ALQ AI   GDD     ++L 
Sbjct: 129  KSDLSSDFSGRRSTRFVSKMHFGRPKTTMATRHSSAAEDALQNAIDFSGDDEMFHSLMLS 188

Query: 2482 FEPKLSGVDDYTFLLRELGNRGELYMTMRCFDFAVSRERKRNEQGKLTSSMISILGRLGK 2303
            FE KL G DD T+++RELGNRGE    +  ++FAV RER++NEQGKL S+MIS LGR GK
Sbjct: 189  FESKLCGSDDCTYIIRELGNRGECDKAVGFYEFAVKRERRKNEQGKLASAMISTLGRYGK 248

Query: 2302 VDLARKVFETAVNEGYGNTVYAYSALISAYAKSGYCDDAIRVFETMKDSGLKPNLVTYNA 2123
            V +A+++FETA + GYGNTVYA+SALISAY +SG  ++AI VF +MK+ GL+PNLVTYNA
Sbjct: 249  VTIAKRIFETAFSGGYGNTVYAFSALISAYGRSGLHEEAISVFNSMKEYGLRPNLVTYNA 308

Query: 2122 LIDACGKGGADFRRASEIFEEMLRNGVQPDRITYNSLLAVCSGASLWETAKCLFNEMVYK 1943
            +IDACGKGG +F++ ++ F+EM RN VQPDRIT+NSLLAVCS   LWE A+ LF+EM  +
Sbjct: 309  VIDACGKGGMEFKQVAKFFDEMQRNCVQPDRITFNSLLAVCSRGGLWEAARNLFDEMSNR 368

Query: 1942 GIDQDIYTYNTLLDAACNGGHMDAAFEIMSEMSLKNIMPNEVTYSTMIRGCAKAGKLDRA 1763
             I+QD+++YNTLLDA C GG MD AFEI+++M  K IMPN V+YST+I G AKAG+ D A
Sbjct: 369  RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPAKRIMPNVVSYSTVIDGFAKAGRFDEA 428

Query: 1762 LALFNEMKYAGIKLDRVSYNTLLAIYASLGRFHEALAVGEEMESIGIKKDVVTYNALLDG 1583
            L LF EM+Y  I LDRVSYNTLL+IY  +GR  EAL +  EM S+GIKKDVVTYNALL G
Sbjct: 429  LNLFGEMRYLNIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488

Query: 1582 FGKQGMYNKVNELFQAMKAENLSPNLLTYSTLISVFSKGGLYHDAMQVYKEFKHQGLKAD 1403
            +GKQG Y++V ++F  MK E++ PNLLTYSTLI  +SKGGLY +AM+V++EFK  GL+AD
Sbjct: 489  YGKQGKYDEVKKVFAEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEVFREFKSAGLRAD 548

Query: 1402 VVFYSKLMDSLCKNGLVESSVLLLDEMMKKGIQPNVVTYNSIINAFGWSVATDHPLDSEE 1223
            VV YS L+D+LCKNGLV S+V L+DEM K+GI PNVVTYNSII+AFG S   +    S +
Sbjct: 549  VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMER---SAD 605

Query: 1222 HMKSSALVTADVTESNSEDKDKDTIITIFEQLAAGKSASFEKD-NSGKQDFLCVLGVFQK 1046
            +    +L  +    S   + + + +I +F QL +  +    KD   G Q+  C+L VF+K
Sbjct: 606  YSNGGSLPFSSSALSELTETEGNRVIQLFGQLTSEGNNRMTKDCKEGMQELSCILEVFRK 665

Query: 1045 MHEMEIKPNVVTFSAILNACSRCNSFXXXXXXXXXXXLFDNQVYGVAHGLLMGHDERVWM 866
            MH++EIKPNVVTFSAILNACSRCNSF           LFDN+VYGV HGLLMG  E VW+
Sbjct: 666  MHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRENVWL 725

Query: 865  QALSLFDEVKQMDSSTASAFYNALTDMLWHFGQRRGAQMVVLEGKRRQVWESAISDSELD 686
            QA SLFD+V +MD STASAFYNALTDMLWHFGQ+RGA++V LEG+ RQVWE+  SDS LD
Sbjct: 726  QAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVWSDSCLD 785

Query: 685  LHLMSSGAARAMVHAWLLSIRSIVFKGRELPKLLSIMTGWGKHSKVVGDGALKRAIEALL 506
            LHLMSSGAARAMVHAWLL+IRSIV++G ELPK+LSI+TGWGKHSKVVGDGALKRA+E LL
Sbjct: 786  LHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALKRAVEVLL 845

Query: 505  TSIDAPFHVAKCNIGRFISTGSVVTAWMRESGTLKVLVLQD 383
              +DAPFH++KCN+GRF S+GSVV  W+RES TLK+L+L D
Sbjct: 846  RGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLILHD 886


Top