BLASTX nr result

ID: Sinomenium21_contig00010818 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00010818
         (2911 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr...   588   e-165
ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi...   586   e-164
ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi...   585   e-164
ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containi...   571   e-160
ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfam...   566   e-158
ref|XP_002534070.1| pentatricopeptide repeat-containing protein,...   561   e-157
ref|XP_002306741.1| pentatricopeptide repeat-containing family p...   560   e-156
gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis...   539   e-150
ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi...   527   e-146
ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containi...   523   e-145
ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phas...   516   e-143
ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi...   514   e-142
ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containi...   502   e-139
ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-137
gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus...   491   e-136
ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ...   485   e-134
ref|XP_002880012.1| pentatricopeptide repeat-containing protein ...   477   e-131
ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar...   467   e-128
ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutr...   465   e-128
ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Caps...   459   e-126

>ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina]
            gi|557539373|gb|ESR50417.1| hypothetical protein
            CICLE_v10031197mg [Citrus clementina]
          Length = 534

 Score =  588 bits (1515), Expect = e-165
 Identities = 311/523 (59%), Positives = 385/523 (73%), Gaps = 13/523 (2%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            +SKFISD P LS+L+  CT+M +L K+HA+LIK GL +D IAASR+LAFC TSPAGD+NY
Sbjct: 13   MSKFISDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLPKDPIAASRILAFC-TSPAGDINY 71

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  VF++I+ PNLF WNTIIRGFSQSS P+N I LFIDML TSPIQPQRLTYPSLFKAYA
Sbjct: 72   AYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSLFKAYA 131

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDE-DSSFDAVAWN 2242
            +LGLARDGA LHGRV+K GLE D  + NTII+MYANCGFL      FDE D+ FD VAWN
Sbjct: 132  QLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLIFDEVDTEFDVVAWN 191

Query: 2241 SMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHP 2062
            SMI+GLAK G++DESRRLFDKM S++TV+WNSMISGYVRN + KEA +LF +MQ Q I P
Sbjct: 192  SMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQNIKP 251

Query: 2061 TEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVF 1882
            +EF + SLL AC  LGA+ QGEWI  +   +  E+N+IV+TAI++ YCKCG  ++A QVF
Sbjct: 252  SEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPERALQVF 311

Query: 1881 NDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVG 1702
            N  PKKGLS WNSM+ GLA+NG   EAI+LFS LQ S  KPD +SFI VLT+ NH G V 
Sbjct: 312  NTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLKPDYISFIAVLTACNHSGKVN 371

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGH--PC 1537
            +A+ YF +MT+  KIKP+IKHYSCMVDALGRAG LEEAE  I +M      ++ G     
Sbjct: 372  QAKDYFTLMTETYKIKPSIKHYSCMVDALGRAGLLEEAEKLIRSMPSDPDAIIWGSLLSA 431

Query: 1536 FQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRK 1357
             +   N+E+ +   KQ              +   + Y +S +FE+ M  RLL ++  + K
Sbjct: 432  CRKHGNIEMAKQAAKQIIELD--KNESCGYVLMSNLYAASYQFEEAMEERLLMKEVKIEK 489

Query: 1356 EPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELSLMLKEV 1231
            EPGCSLIEV+ EVHEFVAGG LHP+  E++ LL +L L+++E+
Sbjct: 490  EPGCSLIEVDGEVHEFVAGGRLHPKAPEVYLLLNDLGLLIQEM 532


>ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic [Vitis vinifera]
            gi|302143555|emb|CBI22116.3| unnamed protein product
            [Vitis vinifera]
          Length = 533

 Score =  586 bits (1511), Expect = e-164
 Identities = 307/530 (57%), Positives = 380/530 (71%), Gaps = 24/530 (4%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            +SKFISDHP LS+LE  CTTM +L K+HA+L+K GLA+  +A S +LAFCATSP GD+NY
Sbjct: 17   ISKFISDHPHLSILEKHCTTMKDLQKIHAHLLKTGLAKHPLAVSPVLAFCATSPGGDINY 76

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  VF++I  PNLF+WNTIIRGFSQSS P + ISLFIDML  S +QP RLTYPS+FKAYA
Sbjct: 77   AYLVFTQIHSPNLFSWNTIIRGFSQSSTPHHAISLFIDMLIVSSVQPHRLTYPSVFKAYA 136

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNS 2239
            +LGLA  GA LHGRV+KLGL+ DP +RNTII+MYANCGFL      F E   FD VAWNS
Sbjct: 137  QLGLAHYGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYERMDFDIVAWNS 196

Query: 2238 MIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPT 2059
            MIMGLAK G+VDESR+LFD+M  ++TV+WNSMISGYVRNGRL+EA DLF QMQ + I P+
Sbjct: 197  MIMGLAKCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLFGQMQEERIKPS 256

Query: 2058 EFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
            EF + SLL A   LGAL+QGEWI  Y RK+  E+N IV  +I++ YCKCGS+ +AFQVF 
Sbjct: 257  EFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDMYCKCGSIGEAFQVFE 316

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGE 1699
             AP KGLS+WN+MI+GLA+NG   EAI+LFSRL+ S  +PDDV+F+GVLT+ N+ G+V +
Sbjct: 317  MAPLKGLSSWNTMILGLAMNGCENEAIQLFSRLECSNLRPDDVTFVGVLTACNYSGLVDK 376

Query: 1698 ARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQTLLHGHPCFQLV*N 1519
            A++YF +M+K  KI+P+IKHYSCMVD LGRAG LEEAE  I NM               N
Sbjct: 377  AKEYFSLMSKTYKIEPSIKHYSCMVDTLGRAGLLEEAEELIRNMPV-------------N 423

Query: 1518 MEILRWRNKQQSNCSNWNQVKAAAMYFCHA-----------------YRSSARFEDTMNL 1390
             + + W +   S C     V+ A     H                  Y +S +FE+ M  
Sbjct: 424  PDAIIW-SSLLSACRKHGNVELAKRAAKHIVDLDGNDSCGYVLLSNIYAASDQFEEAMEQ 482

Query: 1389 RLLTQKTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELSLM 1243
            RL  ++  + KEPGCSLIEVN E+HEFVAGG LHPQ  E+++LL EL +M
Sbjct: 483  RLSMKEKQIEKEPGCSLIEVNGEIHEFVAGGRLHPQAQEVYSLLNELGMM 532


>ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Citrus sinensis]
          Length = 534

 Score =  585 bits (1507), Expect = e-164
 Identities = 310/523 (59%), Positives = 383/523 (73%), Gaps = 13/523 (2%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            +SKFISD P LS+L+  CT+M +L K+HA+LIK GLA+D IAASR+L FC TSPAGD+NY
Sbjct: 13   MSKFISDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLAKDPIAASRILTFC-TSPAGDINY 71

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  VF++I+ PNLF WNTIIRGFSQSS P+N I LFIDML TSPIQPQRLTYPSLFKAYA
Sbjct: 72   AYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSLFKAYA 131

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDE-DSSFDAVAWN 2242
            +LGLARDGA LHGRV+K GLE D  + NTII+MYANCGFL      FDE D+ FD VAWN
Sbjct: 132  QLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLMFDEVDTEFDVVAWN 191

Query: 2241 SMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHP 2062
            SMI+GLAK G++DESRRLFDKM S++TV+WNSMISGYVRN + KEA +LF +MQ Q I P
Sbjct: 192  SMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQNIKP 251

Query: 2061 TEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVF 1882
            +EF + SLL AC  LGA+ QGEWI  +   +  E+N+IV+TAI++ YCKCG  ++A QVF
Sbjct: 252  SEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPERALQVF 311

Query: 1881 NDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVG 1702
            N  PKKGLS WNSM+ GLA+NG   EAI+LFS LQ S   PD  SFI VLT+ NH G V 
Sbjct: 312  NTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLTPDYTSFIAVLTACNHSGKVN 371

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGH--PC 1537
            +A+ YF +MT+  KIKP+IKHYSCMVDALGRAG LEEAE  I +M      ++ G     
Sbjct: 372  QAKDYFTLMTETYKIKPSIKHYSCMVDALGRAGLLEEAEKLIRSMPSDPDAIIWGSLLSA 431

Query: 1536 FQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRK 1357
             +   N+E+ +   KQ              +   + Y +S +FE+ M  RLL ++  + K
Sbjct: 432  CRKHGNIEMAKQAAKQIIELD--KNESCGYVLMSNLYAASYQFEEAMEERLLMKEVKIEK 489

Query: 1356 EPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELSLMLKEV 1231
            EPGCSLIEV+ EVHEFVAGG LHP+  E++ LL +L L+++E+
Sbjct: 490  EPGCSLIEVDGEVHEFVAGGRLHPKAPEVYLLLNDLGLLIQEM 532


>ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 550

 Score =  571 bits (1471), Expect = e-160
 Identities = 313/548 (57%), Positives = 374/548 (68%), Gaps = 24/548 (4%)
 Frame = -2

Query: 2808 MHPCFCXXXXXXXXXSVSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAAS 2629
            M PC C          +SKFISD P L MLEN CT M +L K+HA+LIK GLA DT+AAS
Sbjct: 1    MTPCCCSFTSSTS---ISKFISDKPHLFMLENQCTNMKDLQKIHAHLIKTGLANDTVAAS 57

Query: 2628 RLLAFCATSPAGDVNYALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSP 2449
            R+LAFCA SPAGD+NYA  VF  I +PNLF WNTIIRGFS SSNP+  ISLFIDML TS 
Sbjct: 58   RVLAFCA-SPAGDINYAYMVFRHIHNPNLFIWNTIIRGFSNSSNPEAAISLFIDMLVTST 116

Query: 2448 IQPQRLTYPSLFKAYARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL---- 2281
            +QPQRLTYPS+FKAYA+LGLA DGA LHGRV+KLGLESD  VRNTII MY+NCG L    
Sbjct: 117  VQPQRLTYPSVFKAYAQLGLAHDGAQLHGRVVKLGLESDQFVRNTIIHMYSNCGLLSEAR 176

Query: 2280 --FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKE 2107
              FDED  FD VAWNSMIMGL+K G+V ESRRLFDKM  +++++WNSMI G VRNG   E
Sbjct: 177  RVFDEDLEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTE 236

Query: 2106 AFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVE 1927
            A DLF +MQ Q I P+EF + SLL A   LGA+ QGEWI  Y RK+ I++N IV+TAI+ 
Sbjct: 237  ALDLFGEMQKQKIKPSEFTMVSLLNASAQLGAIRQGEWIHEYIRKNHIQLNPIVVTAIIN 296

Query: 1926 KYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVS 1747
             Y KCGS++KA  VF  AP+ GLS WNS+I+GLA NG  EEAIELFSRL+ S   PDDVS
Sbjct: 297  MYSKCGSIEKAVHVFEAAPRTGLSCWNSIIMGLATNGCEEEAIELFSRLKSSSFVPDDVS 356

Query: 1746 FIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM 1567
            F+GVLT+ +H GMV +ARKYF VM +  +I P+IKHYSCMVD LGRAG LEEAE      
Sbjct: 357  FLGVLTACSHSGMVEKARKYFSVMRETYRIAPSIKHYSCMVDVLGRAGLLEEAE------ 410

Query: 1566 TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHA-------------- 1429
               L+ G P        + + W     S+C     ++ A     H               
Sbjct: 411  --KLIDGMPL-----KADAIIW-GSLLSSCRKHRDIEMAKRAAKHVIELDPSDCCGYVLM 462

Query: 1428 ---YRSSARFEDTMNLRLLTQKTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALL 1261
               Y +S++FE+ M  RL  +   + KEPGCSLIEV+ EVHEF+AGG LH +  EI++LL
Sbjct: 463  SNVYAASSQFEEAMRERLSMKGQKIEKEPGCSLIEVDGEVHEFIAGGRLHQKAPEIYSLL 522

Query: 1260 GELSLMLK 1237
              L  ML+
Sbjct: 523  NGLGFMLE 530


>ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508701125|gb|EOX93021.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 538

 Score =  567 bits (1460), Expect = e-158
 Identities = 311/549 (56%), Positives = 381/549 (69%), Gaps = 25/549 (4%)
 Frame = -2

Query: 2808 MHPCFCXXXXXXXXXSVSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAAS 2629
            M  CFC          ++KFISD P LS+LEN+CT+M +L KLHA LIK GL  D IAAS
Sbjct: 1    MVQCFCSLTPSPAS--ITKFISDQPYLSLLENNCTSMKDLKKLHAQLIKTGLVNDIIAAS 58

Query: 2628 RLLAFCATSPAGDVNYALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSP 2449
            R+LAFC  SPAGD+NYA  VF++I++PNLFTWNTIIRGFSQSSNPQ  ISLFIDML  S 
Sbjct: 59   RVLAFCV-SPAGDMNYAYLVFTQIKNPNLFTWNTIIRGFSQSSNPQIAISLFIDMLVGSS 117

Query: 2448 IQPQRLTYPSLFKAYARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL---- 2281
            IQP+RLTYPS+FKAYA+LGLA DG  LHGRV+KLGL+ D  +RNTII+MYANCG L    
Sbjct: 118  IQPERLTYPSVFKAYAQLGLACDGRQLHGRVIKLGLDYDQFIRNTIIYMYANCGLLSEAW 177

Query: 2280 --FDEDS-SFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLK 2110
              FDE+    D VAWNSMI+GLAK G+VDESRRLF+KM S++TV+WNSMISGYVRNGR  
Sbjct: 178  RMFDEEHMELDIVAWNSMIIGLAKCGEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFL 237

Query: 2109 EAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIV 1930
            EA +LF +MQ + I P+EF + SLL AC  LGA+ QG+WI  Y  K   E+N IV+TAI+
Sbjct: 238  EALELFQEMQEEHIRPSEFTMVSLLNACACLGAITQGKWIHDYILKQNFELNGIVVTAII 297

Query: 1929 EKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDV 1750
            + YCKCG+ +KA QVF  +PK+GLS WNSMI+GLA NG   EA +LFS+L+    KPD V
Sbjct: 298  DMYCKCGNAEKALQVFTTSPKEGLSCWNSMILGLATNGCENEARQLFSKLESLSLKPDHV 357

Query: 1749 SFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIAN 1570
            +FIGVL + N  GMV +A+ YF +MT+  KIKPTIKHYSCMVD LG AG LEEAE  I +
Sbjct: 358  TFIGVLMACNSAGMVDKAKYYFSLMTEKYKIKPTIKHYSCMVDVLGNAGLLEEAEQLIRS 417

Query: 1569 MTQTLLHGHPCFQLV*NMEILRWRNKQQSNC---SNWNQVKAAA--------------MY 1441
            M               N + + W     S C    N    K AA              + 
Sbjct: 418  MPV-------------NEDAIIW-GSLLSACRKHGNVGMAKRAAKLVIELDPAERSGYVL 463

Query: 1440 FCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIEVNE-VHEFVAGGVLHPQVVEIHAL 1264
              + Y ++ +FE+ +  RL  ++  L+KEPGCSLIEVN+ VHEFV+GG LHPQ  EI+++
Sbjct: 464  MSNVYAATRQFEEAIKQRLSMKEKQLQKEPGCSLIEVNDVVHEFVSGGRLHPQAKEIYSV 523

Query: 1263 LGELSLMLK 1237
            L EL LML+
Sbjct: 524  LNELKLMLQ 532


>ref|XP_002534070.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223525897|gb|EEF28314.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  561 bits (1447), Expect = e-157
 Identities = 301/531 (56%), Positives = 375/531 (70%), Gaps = 23/531 (4%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            +SK ISD   LSML+ +CTTM +L K+H+ LIK GLA+DT AASR+LAFCA SPAGD+NY
Sbjct: 15   ISKLISDQTYLSMLDKNCTTMKDLKKIHSQLIKTGLAKDTNAASRILAFCA-SPAGDINY 73

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  VF +IQ+PN+F WNTIIRGFS+SS PQN ISL+IDML TSP+QPQRLTYPS+FKA+A
Sbjct: 74   AYLVFVQIQNPNIFAWNTIIRGFSRSSVPQNSISLYIDMLLTSPVQPQRLTYPSVFKAFA 133

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGF------LFDEDSSFDAVAWNS 2239
            +L LA +GA LHG+++KLGLE+D  +RNTI+FMY NCGF      +FD    FD VAWN+
Sbjct: 134  QLDLASEGAQLHGKMIKLGLENDSFIRNTILFMYVNCGFTSEARKVFDRGMDFDIVAWNT 193

Query: 2238 MIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPT 2059
            MIMG+AK G VDESRRLFDKM  ++ V+WNSMISGYVRNGR  +A +LF +MQ + I P+
Sbjct: 194  MIMGVAKCGLVDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELFQKMQVERIEPS 253

Query: 2058 EFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
            EF + SLL AC  LGA+ QGEWI  Y  K K E+N IV+TAI++ Y KCGS+DKA QVF 
Sbjct: 254  EFTMVSLLNACACLGAIRQGEWIHDYMVKKKFELNPIVVTAIIDMYSKCGSIDKAVQVFQ 313

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGE 1699
             AP++GLS WNSMI+GLA+NGQ  EA++LFS LQ S  +PDDVSFI VLT+ +H GMV +
Sbjct: 314  SAPRRGLSCWNSMILGLAMNGQENEALQLFSVLQSSDLRPDDVSFIAVLTACDHTGMVDK 373

Query: 1698 ARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQTLLHGHPCFQLV*N 1519
            A+ YFL+M    KIKP IKH+SCMVD LGRAG LEEAE  I +M     H  P       
Sbjct: 374  AKDYFLLMRDKYKIKPGIKHFSCMVDVLGRAGLLEEAEELIRSM-----HVDP------- 421

Query: 1518 MEILRWRNKQQSNC--SNWNQVKAAAMYF--------------CHAYRSSARFEDTMNLR 1387
             + + W +   S C   N    K AA +                +AY ++  FE+ +  R
Sbjct: 422  -DAIIWGSLLWSCCKYGNIKMAKRAANHLIELNPSESSSFVLVANAYAAANNFEEALKER 480

Query: 1386 LLTQKTGLRKEPGCSLIEV-NEVHEFVAGGVLHPQVVEIHALLGELSLMLK 1237
            L  ++  + KEPGCS IEV  EVHEFVAGG  HP++ EI+ +L  L+L  K
Sbjct: 481  LTLKENHIGKEPGCSCIEVGGEVHEFVAGGRAHPEIKEIYRVLDVLTLTNK 531


>ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222856190|gb|EEE93737.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 509

 Score =  560 bits (1443), Expect = e-156
 Identities = 296/512 (57%), Positives = 376/512 (73%), Gaps = 14/512 (2%)
 Frame = -2

Query: 2724 MLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNYALSVFSKIQHPN 2545
            ML+ +CT+M +L K+HA LIK GLA+DTIAASR+LAFC TSPAGD+NYA  VF++I++PN
Sbjct: 1    MLDKNCTSMKDLQKIHAQLIKTGLAKDTIAASRVLAFC-TSPAGDINYAYLVFTQIRNPN 59

Query: 2544 LFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPI-QPQRLTYPSLFKAYARLGLARDGA*L 2368
            LF WNTIIRGFSQSS P N ISLFIDM+ TSP  QPQRLTYPS+FKAYA+LGLA +GA L
Sbjct: 60   LFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRLTYPSVFKAYAQLGLAHEGAQL 119

Query: 2367 HGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQV 2206
            HGRV+KLGLE+D  ++NTI+ MY NCGFL      FD  + FD V WN+MI+GLAK G++
Sbjct: 120  HGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGFDVVTWNTMIIGLAKCGEI 179

Query: 2205 DESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTAC 2026
            D+SRRLFDKM  ++TV+WNSMISGYVR GR  EA +LF +MQ + I P+EF + SLL AC
Sbjct: 180  DKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQEEGIKPSEFTMVSLLNAC 239

Query: 2025 GGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWN 1846
              LGAL QGEWI  Y  K+   +NSIV+TAI++ Y KCGS+DKA QVF  APKKGLS WN
Sbjct: 240  ACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGLSCWN 299

Query: 1845 SMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKI 1666
            S+I+GLA++G+G EA+ LFS+L+ S  KPD VSFIGVLT+ NH GMV  A+ YFL+M++ 
Sbjct: 300  SLILGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAGMVDRAKDYFLLMSET 359

Query: 1665 CKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHG---HPCFQLV*NMEILR 1504
             KI+P+IKHYSCMVD LGRAG LEEAE  I +M      ++ G     C +   N+E+ +
Sbjct: 360  YKIEPSIKHYSCMVDVLGRAGLLEEAEELIKSMPVNPDAIIWGSLLSSCREYG-NIEMAK 418

Query: 1503 WRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIEVN- 1327
               K+ +         ++ +   + Y +   FE+ +  RL  ++  + KEPGCSLIEVN 
Sbjct: 419  QAAKRVNELD--PNESSSFILLSNVYAAHNHFEEAIEQRLSLKEKQMDKEPGCSLIEVNG 476

Query: 1326 EVHEFVAGGVLHPQVVEIHALLGELSLMLKEV 1231
            EVHEFVAGG LHP+  +I+  L +L L LKE+
Sbjct: 477  EVHEFVAGGRLHPRSKDIYHALDDLGLTLKEM 508


>gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis]
            gi|587904202|gb|EXB92403.1| hypothetical protein
            L484_021387 [Morus notabilis]
          Length = 530

 Score =  539 bits (1389), Expect = e-150
 Identities = 292/513 (56%), Positives = 366/513 (71%), Gaps = 11/513 (2%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            ++KFISD P LSMLE  C TM++L K+HA+LIK GL   TIA+SRLLAFCA SPAG++NY
Sbjct: 19   IAKFISDQPHLSMLEKRCATMSDLRKIHAHLIKTGLISHTIASSRLLAFCA-SPAGNINY 77

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            AL VFS+IQ+PNLF WNTIIRGFS+SS PQ  I LFIDML  SP++PQRLTYPS+FKAYA
Sbjct: 78   ALMVFSQIQNPNLFIWNTIIRGFSRSSTPQTAIFLFIDMLVGSPLEPQRLTYPSVFKAYA 137

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNS 2239
            +LGLA  GA LHGRV+KLGL+ D  VRNTII MY NCGFL      FDE S  D VAWNS
Sbjct: 138  QLGLACFGAQLHGRVIKLGLDCDRFVRNTIIHMYINCGFLSEARQLFDESSELDLVAWNS 197

Query: 2238 MIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPT 2059
            MIMGL+K G+V ESRRLFD+M  +++V+WNSMISGYVRNG+  EA +LF +MQ + I  +
Sbjct: 198  MIMGLSKCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKCVEALELFGKMQGEGIKAS 257

Query: 2058 EFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
            EF + SLL A G LGA+ QGEWI  Y  K+ IE+N IV+TAI++ YCKCGSV+KA  VF 
Sbjct: 258  EFTMVSLLNASGRLGAIRQGEWIHEYITKNGIELNVIVVTAIIDMYCKCGSVNKALSVFK 317

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLS-GSKPDDVSFIGVLTSGNHCGMVG 1702
             APK GLS WNSM++GLA+NG  EEA+ELFSRL+ S   +PD VSF+ VLT+ NH GMV 
Sbjct: 318  TAPKLGLSCWNSMVMGLAMNGCEEEALELFSRLESSIDLRPDGVSFLAVLTACNHSGMVD 377

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGHPCFQ 1531
            +AR YF +M     I+P+ +HYSCMVD LG+AG LEEAE  I +M      ++ G     
Sbjct: 378  KARDYFSLMRGKYNIEPSTRHYSCMVDVLGKAGHLEEAEKLILSMPINPDAIIWGSLLSA 437

Query: 1530 LV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEP 1351
               +  I   +   +          +A +   + Y SS+ +++ +  R+  ++  + KEP
Sbjct: 438  CRKHGNIEMAQRALERVIELDPSESSAYVLMSNVYGSSSHYDEAVKQRINMKEKRIEKEP 497

Query: 1350 GCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGE 1255
            GCSLIEV+ EVHEFVA G  HP+  EI++LL E
Sbjct: 498  GCSLIEVDGEVHEFVAFGRTHPRAKEIYSLLSE 530


>ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum tuberosum]
          Length = 522

 Score =  527 bits (1358), Expect = e-146
 Identities = 279/521 (53%), Positives = 364/521 (69%), Gaps = 21/521 (4%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPA-GDVN 2584
            +SKFISD P L MLE  CTTM +L K+HA+LIK+GL +D IA+SR+LAF A SP  GD+N
Sbjct: 11   ISKFISDQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIASSRVLAFSAKSPPIGDIN 70

Query: 2583 YALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAY 2404
            YA  VF+ I++PNLFTWNTIIRGFS+SS PQ  I LFI+ML  S +QP  LTYPS+FKAY
Sbjct: 71   YANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVFKAY 130

Query: 2403 ARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWN 2242
            AR GL ++GA LHGR++KLGLE D  +RNT+++MYA+CGFL      FDED   D V+WN
Sbjct: 131  ARGGLVKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASCGFLVEARKLFDEDEIEDVVSWN 190

Query: 2241 SMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHP 2062
            SMIMGLAKSG++D+S RLF KM +++ V+WNSMISG+VRNG+  EA +LF  MQ + I P
Sbjct: 191  SMIMGLAKSGEIDDSWRLFSKMSTRNDVSWNSMISGFVRNGKWNEALELFSTMQEENIKP 250

Query: 2061 TEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVF 1882
            +EF L SLL ACG LGALEQG WI  Y +K+ +E+N IV+TAI++ YCKCG+V+ A+ VF
Sbjct: 251  SEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCGNVEMAWHVF 310

Query: 1881 NDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVG 1702
                 KGLS+WNSMI+GLA NG  ++AI+LF+RLQ S  KPD VSFIGVLT+ NH G+V 
Sbjct: 311  ISISNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSGLVD 370

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQ--------TLL-- 1552
            +A+ YF +M K   I+P+IKHY CMVD LGRAG +EEA+  I +M          +LL  
Sbjct: 371  KAKDYFQLMKKEYGIEPSIKHYGCMVDILGRAGLVEEADEVIRSMKMEPDAVIWCSLLSA 430

Query: 1551 ---HGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLL 1381
               HG        NME+ RW  +            +  +   + Y +S +F + ++ R+ 
Sbjct: 431  CRSHG--------NMELARWSAENLLELD--PNESSGYVLMANMYAASGQFAEAIDERIS 480

Query: 1380 TQKTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALL 1261
             +   + KEPGCS +EVN EVHEF +G  L  +  +I++L+
Sbjct: 481  MKDKHIAKEPGCSSVEVNGEVHEFASGRKLDSEFHDIYSLM 521


>ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum lycopersicum]
          Length = 522

 Score =  523 bits (1348), Expect = e-145
 Identities = 275/521 (52%), Positives = 365/521 (70%), Gaps = 21/521 (4%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPA-GDVN 2584
            +SKFI D P L MLE  CTTM +L K+HA+LIK+GL +D IAASR+LAF A SP  GD+N
Sbjct: 11   ISKFILDQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIAASRVLAFSAKSPPIGDIN 70

Query: 2583 YALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAY 2404
            YA  VF+ I++PN FTWNTIIRGFS+SS PQ  I LFI+ML  S +QP  LTYPS+FKAY
Sbjct: 71   YANLVFTHIENPNPFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVFKAY 130

Query: 2403 ARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWN 2242
            AR G+A++GA LHGR++KLGLE D  +RNT+++MYA+CGFL      FDED   D V+WN
Sbjct: 131  ARGGIAKNGAQLHGRIMKLGLEFDTFIRNTLLYMYASCGFLVEARKLFDEDEIEDVVSWN 190

Query: 2241 SMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHP 2062
            SMI+GLAKSG++D+S RLF KM +++ V+WNSMISG+VRNG+  EA +LF  MQ + + P
Sbjct: 191  SMIIGLAKSGEIDDSWRLFSKMPTRNDVSWNSMISGFVRNGKWNEALELFSTMQEENVKP 250

Query: 2061 TEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVF 1882
            +EF L SLL ACG LGALEQG WI  Y +K+ +E+N IV+TAI++ YCKC +V+ A+ VF
Sbjct: 251  SEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCANVEMAWHVF 310

Query: 1881 NDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVG 1702
              +  KGLS+WNSMI+GLA NG  ++AI+LF+RLQ S  KPD VSFIGVLT+ NH G+V 
Sbjct: 311  VSSSNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSGLVE 370

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQ--------TLL-- 1552
            +A+ YF +M     I+P+IKHY CMVD LGRAG +EEAE  I +M          +LL  
Sbjct: 371  KAKDYFQLMKMEYGIEPSIKHYGCMVDILGRAGLVEEAEEVIRSMKMEPDAVIWGSLLSA 430

Query: 1551 ---HGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLL 1381
               HG        N+E+ RW  +            +  +   + Y +S  F++ MN R+ 
Sbjct: 431  CRSHG--------NVELARWSAENLLELD--PNESSGYVLMANMYAASGLFDEAMNERIS 480

Query: 1380 TQKTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALL 1261
             ++  + KEPGCS +E+N EVHEF +G  L+ ++ +I++L+
Sbjct: 481  MKEKHIAKEPGCSSVEINGEVHEFASGRKLYSELHDIYSLM 521


>ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris]
            gi|561014990|gb|ESW13851.1| hypothetical protein
            PHAVU_008G231600g [Phaseolus vulgaris]
          Length = 525

 Score =  516 bits (1328), Expect = e-143
 Identities = 278/537 (51%), Positives = 367/537 (68%), Gaps = 21/537 (3%)
 Frame = -2

Query: 2802 PCFCXXXXXXXXXSVSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRL 2623
            P  C         S++ FISDHPCL+ML+N CT M +L K+H ++IK GLA D IAASR+
Sbjct: 2    PILCSALPPSSSPSIANFISDHPCLTMLQNQCTNMKDLQKIHPHIIKTGLALDHIAASRV 61

Query: 2622 LAFCATSPAGDVNYALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQ 2443
            L FCA+S +GD+NYA  VF+ I +PNL+ WNTIIRGFS+SS PQ  ISLF+DML  S ++
Sbjct: 62   LTFCASS-SGDINYAYLVFTGIPNPNLYCWNTIIRGFSRSSTPQFAISLFVDMLY-SAVE 119

Query: 2442 PQRLTYPSLFKAYARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------ 2281
            PQRLTYPS+FKAYA+LG   DGA LHGRV+KLGLE D  + NTI++MYAN G +      
Sbjct: 120  PQRLTYPSVFKAYAQLGAGHDGAQLHGRVVKLGLEKDQFISNTILYMYANSGLMSEARRV 179

Query: 2280 FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAF 2101
            FDE    D VA NSMIMGLAK G+VD+SRRLFD M +++ V+WNSMISGYVRNGRL E  
Sbjct: 180  FDEPLELDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTAVSWNSMISGYVRNGRLTEGL 239

Query: 2100 DLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKY 1921
            +LF +MQ + + P+EF + SLL+AC  LGAL+ GEW+  Y ++   ++N IVLTAI++ Y
Sbjct: 240  ELFRKMQEEGVEPSEFTMVSLLSACAHLGALQHGEWVHDYIKRGNFKLNVIVLTAIIDMY 299

Query: 1920 CKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFI 1741
            CKCGS++KA +VF  +P +GL  WNS+IIGLA+NG   EAIE FS+L+ S  KPD VSFI
Sbjct: 300  CKCGSIEKAVEVFAASPTRGLPCWNSIIIGLALNGHEREAIEYFSKLESSNIKPDCVSFI 359

Query: 1740 GVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMT- 1564
            GVLT+  + G V EAR YF +M    +I+P+IKHY+C+V+ LG A  LEEAE  I  M+ 
Sbjct: 360  GVLTACKYLGAVREARDYFALMMDKYEIEPSIKHYTCLVEVLGHAALLEEAEEVIKGMSI 419

Query: 1563 -------QTLL-----HGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAA-MYFCHAYR 1423
                    +LL     HG        N+EI +   +        N  +A+  +   +   
Sbjct: 420  EADFIIWGSLLSSCRKHG--------NVEIAK---RAAQRVFELNPREASGYLLMSNVQA 468

Query: 1422 SSARFEDTMNLRLLTQKTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGE 1255
            +S +FE+ +  R+L ++  + KEPGCS IE++ EVHEF+AGG LHP+  EI++LL +
Sbjct: 469  ASNQFEEALEHRILMKERLVEKEPGCSSIELHGEVHEFLAGGRLHPKDREIYSLLND 525


>ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Glycine max]
          Length = 534

 Score =  514 bits (1323), Expect = e-142
 Identities = 280/533 (52%), Positives = 360/533 (67%), Gaps = 24/533 (4%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            ++KFISD PCL+ML+  CT M +L K+HA++IK GLA  T+AASR+L FCA+S +GD+NY
Sbjct: 18   IAKFISDQPCLTMLQTQCTNMKDLQKIHAHIIKTGLAHHTVAASRVLTFCASS-SGDINY 76

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  +F+ I  PNL+ WNTIIRGFS+SS P   ISLF+DML +S + PQRLTYPS+FKAYA
Sbjct: 77   AYLLFTTIPSPNLYCWNTIIRGFSRSSTPHLAISLFVDMLCSS-VLPQRLTYPSVFKAYA 135

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNS 2239
            +LG   DGA LHGRV+KLGLE D  ++NTII+MYAN G L      FDE    D VA NS
Sbjct: 136  QLGAGYDGAQLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVFDELVDLDVVACNS 195

Query: 2238 MIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPT 2059
            MIMGLAK G+VD+SRRLFD M +++ VTWNSMISGYVRN RL EA +LF +MQ + + P+
Sbjct: 196  MIMGLAKCGEVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALELFRKMQGERVEPS 255

Query: 2058 EFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
            EF + SLL+AC  LGAL+ GEW+  Y ++   E+N IVLTAI++ YCKCG + KA +VF 
Sbjct: 256  EFTMVSLLSACAHLGALKHGEWVHDYVKRGHFELNVIVLTAIIDMYCKCGVIVKAIEVFE 315

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGE 1699
             +P +GLS WNS+IIGLA+NG   +AIE FS+L+ S  KPD VSFIGVLT+  + G VG+
Sbjct: 316  ASPTRGLSCWNSIIIGLALNGYERKAIEYFSKLEASDLKPDHVSFIGVLTACKYIGAVGK 375

Query: 1698 ARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQTLLHGHPCFQLV*N 1519
            AR YF +M    +I+P+IKHY+CMV+ LG+A  LEEAE         L+ G P       
Sbjct: 376  ARDYFSLMMNKYEIEPSIKHYTCMVEVLGQAALLEEAE--------QLIKGMPL-----K 422

Query: 1518 MEILRWRNKQQSNCSNWNQV---KAAAMYFCHAYRSSA--------------RFEDTMNL 1390
             + + W     S+C     V   K AA   C    S A              +FE+ M  
Sbjct: 423  ADFIIW-GSLLSSCRKHGNVEIAKRAAQRVCELNPSDASGYLLMSNVQAASNQFEEAMEQ 481

Query: 1389 RLLTQKTGLRKEPGCSLIEV-NEVHEFVAGGVLHPQVVEIHALLGELSLMLKE 1234
            R+L ++    KEPGCS IE+  EVHEF+AGG LHP+  EI+ LL + S  L++
Sbjct: 482  RILMRERLAEKEPGCSSIELYGEVHEFLAGGRLHPKAREIYYLLNDSSFALQD 534


>ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cicer arietinum]
          Length = 536

 Score =  502 bits (1292), Expect = e-139
 Identities = 267/535 (49%), Positives = 362/535 (67%), Gaps = 12/535 (2%)
 Frame = -2

Query: 2802 PCFCXXXXXXXXXSVSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRL 2623
            PC           S+SKFISD PCL+ML+N CTT+ + H ++ ++IK GL  + IA++R+
Sbjct: 4    PCSLFSQSPPPPPSISKFISDQPCLTMLQNHCTTLKHFHMIYPHIIKTGLTHNPIASTRV 63

Query: 2622 LAFCATSPAGDVNYALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQ 2443
            L FCA SP+G++NYA  +F+++ +PNL++WNTIIR FS+SS PQ  ISLF+DML  S IQ
Sbjct: 64   LTFCA-SPSGNINYAYKLFARMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQIQ 121

Query: 2442 PQRLTYPSLFKAYARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------ 2281
            PQ LTYPS+FKAYA+L     G+ LHG V+KLGL+ D  + NTII+MYAN G L      
Sbjct: 122  PQHLTYPSVFKAYAQLSAGDYGSQLHGMVVKLGLQRDQFIHNTIIYMYANSGLLSEAKRV 181

Query: 2280 FDEDSSF-DAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEA 2104
            FDE     D VA+NSMIMG AK G++DE+R+LFD+M ++++VTWNSMISGYVRNG+L EA
Sbjct: 182  FDEKLELGDVVAFNSMIMGFAKCGEIDEARKLFDEMFTRTSVTWNSMISGYVRNGKLMEA 241

Query: 2103 FDLFFQMQ-NQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVE 1927
             +LF +MQ  + + P+EF + SLL AC  LGAL+ G+W+  Y +++  E+N IVLTAI++
Sbjct: 242  LELFHKMQLEERVEPSEFTMVSLLNACAHLGALQHGKWVHDYIKRNDFELNVIVLTAIID 301

Query: 1926 KYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVS 1747
             YCKCGSV+ A QVF+  P +GLS WNS+IIGLA+NG   EA E FS L+LS  KPD VS
Sbjct: 302  MYCKCGSVENAIQVFDTYPGRGLSCWNSIIIGLAMNGHEREAFEFFSELELSKFKPDSVS 361

Query: 1746 FIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM 1567
            FIGVLT+  H G V +A+ YF +M    KI+P+IKHY+CMV+ LG+A FLEEAE  I  M
Sbjct: 362  FIGVLTACKHLGAVDKAKDYFALMMNEYKIEPSIKHYTCMVEVLGQAAFLEEAEELIQGM 421

Query: 1566 ---TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTM 1396
                  ++ G        +  + R +   Q          +  +   + Y +S +FE+ +
Sbjct: 422  PIKPDAIIWGSLLSSCRKHGNVQRAKRAAQRVYELNPSDASGYVLMSNVYAASNKFEEAV 481

Query: 1395 NLRLLTQKTGLRKEPGCSLIEV-NEVHEFVAGGVLHPQVVEIHALLGELSLMLKE 1234
              R+L ++    KEPGCS IE+  EVHEF+AGG LHP+  EI+ LL + S  L++
Sbjct: 482  EQRVLMKENLTEKEPGCSSIELYGEVHEFLAGGRLHPKTQEIYHLLNDSSFSLQD 536


>ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
            gi|449530724|ref|XP_004172343.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
          Length = 543

 Score =  494 bits (1272), Expect = e-137
 Identities = 262/518 (50%), Positives = 360/518 (69%), Gaps = 13/518 (2%)
 Frame = -2

Query: 2748 ISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNYALSV 2569
            IS+ P LSM++  CTTM +L + HA+LIK+G A ++ AASR+LAFCA SP G+++YA  V
Sbjct: 21   ISNQPYLSMVDKYCTTMRDLQQFHAHLIKSGQAIESFAASRILAFCA-SPLGNMDYAYLV 79

Query: 2568 FSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYARLGL 2389
            F ++Q+PNLF+WNT+IRGFSQSSNPQ  + LFIDML +S ++PQRLTYPS+FKAY++LGL
Sbjct: 80   FLQMQNPNLFSWNTVIRGFSQSSNPQIALYLFIDMLVSSQVEPQRLTYPSIFKAYSQLGL 139

Query: 2388 ARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNSMIMG 2227
            A DGA LHGR++KLGL+ DP +RNTI++MYA  GFL      F+++  FD V+WNSMI+G
Sbjct: 140  AHDGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQEMEFDVVSWNSMILG 199

Query: 2226 LAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPTEFAL 2047
            LAK G++DESR+LFDKM  K+ ++WNSMI GYVRNG  KEA  LF +MQ + I P+EF +
Sbjct: 200  LAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFIKMQEERIQPSEFTM 259

Query: 2046 ASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPK 1867
             SLL A   +GAL QG WI  Y +K+ +++N+IV+TAI++ YCKCGS+  A QVF   P 
Sbjct: 260  VSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDMYCKCGSIGNALQVFEKIPC 319

Query: 1866 KGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKY 1687
            + LS+WNSMI GLA+NG  +EAI +F  L+ S  KPD +SF+ VLT+ NH  MV E  ++
Sbjct: 320  RSLSSWNSMIFGLAVNGCEKEAILVFKMLESSSLKPDCISFMAVLTACNHGAMVDEGMEF 379

Query: 1686 FLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHG--HPCFQLV* 1522
            F  M    +I+P+IKHY+ MVD + RAGFLEEAE  I  M      ++ G      ++  
Sbjct: 380  FSRMKNTYRIEPSIKHYNLMVDMISRAGFLEEAEQFIKTMPIEKDAIIWGCLLSACRIYG 439

Query: 1521 NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCS 1342
            N E+ +   ++ +       +    M   HA+ ++  F   M  R+  +   + KEPG S
Sbjct: 440  NTEMAKRAAEKVNELDPEETMGYVLMANIHAWGNN--FVGAMEKRVAMRMKKVEKEPGGS 497

Query: 1341 LIEVN-EVHEFVA-GGVLHPQVVEIHALLGELSLMLKE 1234
             IEV+ EVHEF+A GG LH +  EI+ +LG+L +ML++
Sbjct: 498  FIEVDEEVHEFIAGGGRLHRKAQEIYIVLGQLGVMLQD 535


>gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus guttatus]
          Length = 505

 Score =  491 bits (1265), Expect = e-136
 Identities = 267/506 (52%), Positives = 343/506 (67%), Gaps = 21/506 (4%)
 Frame = -2

Query: 2748 ISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCAT-SPAGDVNYALS 2572
            I+D P LS+LE +C T+ +L K+HA LIK GLA+DTIA SR+LAFCA   PA D++YA S
Sbjct: 2    IADQPFLSLLETNCHTIKDLTKIHAQLIKTGLAKDTIAVSRILAFCAAPGPARDLDYAFS 61

Query: 2571 VFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYARLG 2392
            VFS I+ PNLFTWNTIIRGF QSS+P   ISLF+DML  S ++P+ LTYPS+FKAY +LG
Sbjct: 62   VFSHIEKPNLFTWNTIIRGFCQSSHPHVAISLFVDMLTNSTLEPENLTYPSVFKAYTQLG 121

Query: 2391 LARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGF------LFDEDSSFDAVAWNSMIM 2230
            LA DGA LHGR++KLG E DP +RN+II MYA+CG       LFDED   D VAWNSM+M
Sbjct: 122  LAGDGAQLHGRIIKLGFEHDPFIRNSIIHMYADCGLFGSARKLFDEDEDTDVVAWNSMVM 181

Query: 2229 GLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPTEFA 2050
            GLAK G+VDES RLF K+  ++ ++WN+MISGYVRNG+  +A  LF +MQ + I P+EF 
Sbjct: 182  GLAKCGEVDESWRLFCKIPCRNDISWNTMISGYVRNGKWVDALSLFAEMQQRQIRPSEFT 241

Query: 2049 LASLLTACGGLGALEQGEWICAYNRKS---KIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
            L S+L AC  LGALEQG+WI  Y +KS    I+ N+IV+TAI++ YCKCG +  A +VF 
Sbjct: 242  LVSMLNACAKLGALEQGKWIHRYIKKSDINNIDRNTIVVTAIIDMYCKCGDIKTAREVFE 301

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRL-QLSGSKPDDVSFIGVLTSGNHCGMVG 1702
              P+K LS WNSMI+GLA NG  EEA +LF+ L Q S   PD VSFIGVLT+ NH   V 
Sbjct: 302  STPQKALSGWNSMILGLATNGFEEEAFQLFTELEQSSNLNPDSVSFIGVLTASNHSVRVD 361

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGHPCFQ 1531
            +AR+YF VM +   I+PTIKHY C+VD LGRAG +E+A   I +M      ++ G     
Sbjct: 362  KAREYFKVMKETYGIEPTIKHYGCLVDVLGRAGLIEQAAEVIKSMPMKPDAIIWGSLL-- 419

Query: 1530 LV*NMEILRWRNKQQSNCSNWNQVKA------AAMYFCHAYRSSARFEDTMNLRLLTQKT 1369
                    R R+   +  +  N + A      A +   + Y +S  F+  +N R   +K 
Sbjct: 420  ----SACRRCRDVGVAELAARNLLLAGPDETSAHVLMSNVYAASGDFKKAVNERTKMKKK 475

Query: 1368 GLRKEPGCSLIEVN-EVHEFVAGGVL 1294
             + K+PGCS IEV+ EVHEF++GG L
Sbjct: 476  KMEKQPGCSFIEVDGEVHEFLSGGDL 501


>ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355518779|gb|AET00403.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 542

 Score =  485 bits (1248), Expect = e-134
 Identities = 259/524 (49%), Positives = 355/524 (67%), Gaps = 15/524 (2%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            +SKFIS+HPCL+ML+N CTT+ + H+++ ++IK GL  + IA++R L FCA SP+G++NY
Sbjct: 21   ISKFISNHPCLTMLQNHCTTINHFHQIYPHIIKTGLTLNPIASTRALTFCA-SPSGNINY 79

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYA 2401
            A  +F ++ +PNL++WNTIIR FS+SS PQ  ISLF+DML  S IQPQ LTYPS+FKAYA
Sbjct: 80   AYKLFVRMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQIQPQYLTYPSVFKAYA 138

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFLFDEDSSFDA----------V 2251
            +LG A  GA LHGRV+KLGL++D  + NTII+MYAN G + +    FD           V
Sbjct: 139  QLGHAHYGAQLHGRVVKLGLQNDQFICNTIIYMYANGGLMSEARRVFDGKKLELYDHDVV 198

Query: 2250 AWNSMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ* 2071
            A NSMIMG AK G++DESR LFD M ++++V+WNSMISGYVRNG+L EA +LF +MQ + 
Sbjct: 199  AINSMIMGYAKCGEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEALELFNKMQVEG 258

Query: 2070 IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAF 1891
               +EF + SLL AC  LGAL+ G+W+  Y +++  E+N IV+TAI++ YCKCGSV+ A 
Sbjct: 259  FEVSEFTMVSLLNACAHLGALQHGKWVHDYIKRNHFELNVIVVTAIIDMYCKCGSVENAV 318

Query: 1890 QVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSG-SKPDDVSFIGVLTSGNHC 1714
            +VF   P++GLS WNS+IIGLA+NG   EA E FS+L+ S   KPD VSFIGVLT+  H 
Sbjct: 319  EVFETCPRRGLSCWNSIIIGLAMNGHEREAFEFFSKLESSKLLKPDSVSFIGVLTACKHL 378

Query: 1713 GMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGH 1543
            G + +AR YF +M    +I+P+IKHY+C+VD LG+AG LEEAE  I  M      ++ G 
Sbjct: 379  GAINKARDYFELMMNKYEIEPSIKHYTCIVDVLGQAGLLEEAEELIKGMPLKPDAIIWGS 438

Query: 1542 PCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGL 1363
                   +  +   R   Q          +  +   + + +S +FE+ +  RLL ++   
Sbjct: 439  LLSSCRKHRNVQIARRAAQRVYELNPSDASGYVLMSNVHAASNKFEEAIEQRLLMKENLT 498

Query: 1362 RKEPGCSLIEV-NEVHEFVAGGVLHPQVVEIHALLGELSLMLKE 1234
             KEPGCS IE+  EVHEF+AGG LHP+  EI+ LL + S   ++
Sbjct: 499  EKEPGCSSIELYGEVHEFIAGGRLHPKTQEIYHLLNDSSFAFQD 542


>ref|XP_002880012.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325851|gb|EFH56271.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 542

 Score =  477 bits (1228), Expect = e-131
 Identities = 260/518 (50%), Positives = 352/518 (67%), Gaps = 15/518 (2%)
 Frame = -2

Query: 2757 SKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNYA 2578
            S F+S + CL +++  C+TM  L ++HANLIK GL  DT+AASR+LAFC  SP+ D NYA
Sbjct: 19   SGFVSGNTCLRLIDTRCSTMRELKQIHANLIKTGLISDTVAASRVLAFCCASPS-DRNYA 77

Query: 2577 LSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSP-IQPQRLTYPSLFKAYA 2401
              VF++I H N F WNTIIRGFS+SS P+  IS+FIDML +SP ++PQRLTYPS+FKAYA
Sbjct: 78   YLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYA 137

Query: 2400 RLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWNS 2239
             LGLARDG  LHGRV+K GLE D  +RNT++ MY  CG L      F     FD VAWNS
Sbjct: 138  SLGLARDGRQLHGRVIKEGLEDDSFIRNTMLHMYVTCGCLVEAWRLFVGMMGFDVVAWNS 197

Query: 2238 MIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPT 2059
            +IMGLAK G +D++++LFD+M  ++ V+WNSMISG+VRNGR K+A ++F +MQ + + P 
Sbjct: 198  IIMGLAKCGLIDQAQKLFDEMPQRNGVSWNSMISGFVRNGRFKDALEMFREMQERDVKPD 257

Query: 2058 EFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFN 1879
             F + SLL AC  LGA EQG WI  Y  +++ E+NSIV+TA+++ YCKCG  ++  +VF 
Sbjct: 258  GFTMVSLLNACAYLGASEQGRWIHKYIVRNRFELNSIVITALIDMYCKCGCFEEGLKVFE 317

Query: 1878 DAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGE 1699
             AP K LS WNSMI+GLA NG  E A++LF  L+ +G +PD VSFIGVLT+  H G V +
Sbjct: 318  CAPTKQLSCWNSMILGLANNGCEERAMDLFLELERTGLEPDSVSFIGVLTACAHSGEVHK 377

Query: 1698 ARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMT---QTLLHGH--PCF 1534
            A ++F +M +   I+P+IKHY+CMV+ LG AG L+EAE  I  M     T++        
Sbjct: 378  AGEFFRLMREKYMIEPSIKHYTCMVNVLGGAGLLDEAEALIKKMPVEGDTIIWSSLLAAC 437

Query: 1533 QLV*NMEILRWRNKQQSNC-SNWNQVKAAA-MYFCHAYRSSARFEDTMNLRLLTQKTGLR 1360
            +   N+E+     K+ +NC  N +  +    +   +AY S   FE+ +  RLL ++  + 
Sbjct: 438  RKNGNVEMA----KRAANCLKNLDPDETCGYVLMSNAYASYGLFEEAVEQRLLMKERQME 493

Query: 1359 KEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELS 1249
            KE GCS IEV+ EVHEFV+ G  HP+  EI++LLG L+
Sbjct: 494  KEVGCSSIEVDFEVHEFVSCGKKHPKSTEIYSLLGILN 531


>ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g42920, chloroplastic; Flags: Precursor
            gi|4512663|gb|AAD21717.1| hypothetical protein
            [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|110738441|dbj|BAF01146.1| hypothetical protein
            [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 559

 Score =  467 bits (1202), Expect = e-128
 Identities = 254/515 (49%), Positives = 342/515 (66%), Gaps = 15/515 (2%)
 Frame = -2

Query: 2748 ISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNYALSV 2569
            +S +  L +++  C+TM  L ++HA+LIK GL  DT+ ASR+LAFC  SP+ D+NYA  V
Sbjct: 22   LSGNTYLRLIDTQCSTMRELKQIHASLIKTGLISDTVTASRVLAFCCASPS-DMNYAYLV 80

Query: 2568 FSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSP-IQPQRLTYPSLFKAYARLG 2392
            F++I H N F WNTIIRGFS+SS P+  IS+FIDML +SP ++PQRLTYPS+FKAY RLG
Sbjct: 81   FTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLG 140

Query: 2391 LARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFLFDEDS------SFDAVAWNSMIM 2230
             ARDG  LHG V+K GLE D  +RNT++ MY  CG L +          FD VAWNSMIM
Sbjct: 141  QARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIM 200

Query: 2229 GLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPTEFA 2050
            G AK G +D+++ LFD+M  ++ V+WNSMISG+VRNGR K+A D+F +MQ + + P  F 
Sbjct: 201  GFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFT 260

Query: 2049 LASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAP 1870
            + SLL AC  LGA EQG WI  Y  +++ E+NSIV+TA+++ YCKCG +++   VF  AP
Sbjct: 261  MVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNVFECAP 320

Query: 1869 KKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARK 1690
            KK LS WNSMI+GLA NG  E A++LFS L+ SG +PD VSFIGVLT+  H G V  A +
Sbjct: 321  KKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHSGEVHRADE 380

Query: 1689 YFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANM---TQTLLHGH--PCFQLV 1525
            +F +M +   I+P+IKHY+ MV+ LG AG LEEAE  I NM     T++        + +
Sbjct: 381  FFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEEDTVIWSSLLSACRKI 440

Query: 1524 *NMEILRWRNKQQSNCSNWNQVKAAAMY--FCHAYRSSARFEDTMNLRLLTQKTGLRKEP 1351
             N+E+     K+ + C           Y    +AY S   FE+ +  RLL ++  + KE 
Sbjct: 441  GNVEMA----KRAAKCLKKLDPDETCGYVLLSNAYASYGLFEEAVEQRLLMKERQMEKEV 496

Query: 1350 GCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELS 1249
            GCS IEV+ EVHEF++ G  HP+  EI++LL  L+
Sbjct: 497  GCSSIEVDFEVHEFISCGGTHPKSAEIYSLLDILN 531


>ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum]
            gi|557112734|gb|ESQ53018.1| hypothetical protein
            EUTSA_v10017572mg [Eutrema salsugineum]
          Length = 546

 Score =  465 bits (1196), Expect = e-128
 Identities = 246/523 (47%), Positives = 341/523 (65%), Gaps = 23/523 (4%)
 Frame = -2

Query: 2748 ISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNYALSV 2569
            +S +  + +++  C+TM  L ++HANLIK GL  DTIAASR+LAFC TSP+ D++YA  +
Sbjct: 22   VSGNSHIRLIDTQCSTMRELKQIHANLIKTGLISDTIAASRVLAFCCTSPS-DMSYAYLL 80

Query: 2568 FSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSPIQPQRLTYPSLFKAYARLGL 2389
            F++I H N F WNTIIRGFS+SS P+  I++FIDM  ++  +PQRLTYPS+FKAYA LG 
Sbjct: 81   FTRINHKNPFVWNTIIRGFSRSSFPEMSITIFIDMFSSASAKPQRLTYPSVFKAYASLGK 140

Query: 2388 ARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGF------LFDEDSSFDAVAWNSMIMG 2227
            ARDG  LHG V+K GLE D  +RNT++ MYA CG       +F     FD VAWNSM+MG
Sbjct: 141  ARDGMQLHGMVIKEGLEDDSFIRNTMLHMYATCGCFVEAWRIFMAMKHFDVVAWNSMMMG 200

Query: 2226 LAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHPTEFAL 2047
            LA+ G ++++++LFD+M  ++ ++WNSMISG+V+NGR K+A ++F +MQ + + P  F +
Sbjct: 201  LARYGLIEQAQKLFDEMPQRNEISWNSMISGFVKNGRFKDALEMFRKMQERNVKPDGFTM 260

Query: 2046 ASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPK 1867
             SLL AC  LGA EQG WI  Y  K++ E+NSIV+TA+++ YCKCG +++  +VF  AP 
Sbjct: 261  VSLLNACAYLGASEQGRWIHEYIVKNRFELNSIVITALIDMYCKCGCIEEGLRVFESAPN 320

Query: 1866 KGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKY 1687
            K LS WNSM++GLA NG  E A++LFS L+ S  +PD VSFIGVLT+  + G V EA ++
Sbjct: 321  KQLSCWNSMVLGLANNGYEERAMDLFSELESSDLEPDSVSFIGVLTACAYSGKVDEAGEF 380

Query: 1686 FLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEIL 1507
            F +M +   I+P+IKHY+CMV+ LG AG LEEAE  I NM                 + +
Sbjct: 381  FRLMREKYLIEPSIKHYTCMVNVLGGAGLLEEAEAMIKNMPM-------------EQDAI 427

Query: 1506 RWRNKQQSNCSNWNQVKAAAMYFC----------------HAYRSSARFEDTMNLRLLTQ 1375
             W +   +   N N   A     C                +AY S   FE+ +  R+L +
Sbjct: 428  IWSSLLSACRKNGNVEMAERAAKCLKKLDPDDTCGYVLMSNAYASYGLFEEAVEQRVLMK 487

Query: 1374 KTGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELS 1249
            +  + KE GCS IEV+ EVHEFV+ G  HP+  EI++LLG L+
Sbjct: 488  ERQMEKEIGCSSIEVDFEVHEFVSCGKRHPKSSEIYSLLGVLN 530


>ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Capsella rubella]
            gi|565472276|ref|XP_006293940.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562647|gb|EOA26837.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562648|gb|EOA26838.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
          Length = 555

 Score =  459 bits (1180), Expect = e-126
 Identities = 260/549 (47%), Positives = 353/549 (64%), Gaps = 20/549 (3%)
 Frame = -2

Query: 2760 VSKFISDHPCLSMLENSCTTMANLHKLHANLIKAGLARDTIAASRLLAFCATSPAGDVNY 2581
            V+ F S    L +++  C+TM  L ++H NLIK GL  DT+AASR+LAFC  SP+ D+NY
Sbjct: 13   VAAFPSPASYLRLIDTQCSTMRELKQIHGNLIKTGLISDTVAASRVLAFCCASPS-DMNY 71

Query: 2580 ALSVFSKIQHPNLFTWNTIIRGFSQSSNPQNVISLFIDMLQTSP-IQPQRLTYPSLFKAY 2404
            A  VF++I H N F WNTIIRGFSQSS P+  IS+FIDML +SP ++PQ LTYPS+FKAY
Sbjct: 72   AYLVFTRINHKNPFVWNTIIRGFSQSSFPEMAISIFIDMLCSSPSVKPQNLTYPSVFKAY 131

Query: 2403 ARLGLARDGA*LHGRVLKLGLESDPLVRNTIIFMYANCGFL------FDEDSSFDAVAWN 2242
             RLG A DG  LHGRVLK GLE D  +RNT++ MY   G L      F   + FD VAWN
Sbjct: 132  GRLGQAIDGRQLHGRVLKEGLEDDSFIRNTMLQMYVTSGCLVEAWRIFVGMTDFDVVAWN 191

Query: 2241 SMIMGLAKSGQVDESRRLFDKMESKSTVTWNSMISGYVRNGRLKEAFDLFFQMQNQ*IHP 2062
            SMIMGLAK G + ++++LFD+M  ++ V+WNSMISG+VRNGR K+A ++F +MQ + + P
Sbjct: 192  SMIMGLAKCGLISQAQQLFDEMPHRNEVSWNSMISGFVRNGRFKDALEMFREMQERNVKP 251

Query: 2061 TEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVF 1882
              F + SLL AC  LGA EQG WI  Y  +++ E+NSIV+TA++E YCKCG +++  +VF
Sbjct: 252  DGFTMVSLLNACAYLGANEQGRWIHEYIARNRFELNSIVITALIEMYCKCGCIEEGLKVF 311

Query: 1881 NDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVG 1702
              APKK LS WNSMI+GLA NG  E A++LF  L+  G +PD VSFIGVLT+  + G V 
Sbjct: 312  ECAPKKQLSCWNSMILGLANNGCEERAMDLFLELERFGLEPDSVSFIGVLTACAYSGEVH 371

Query: 1701 EARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMT--------QTLLHG 1546
            +A  +F +M +   ++P+IKHY+CMV+ LG AG L+EAE  I  M          +LL  
Sbjct: 372  KAGGFFRLMREKYMVEPSIKHYTCMVNVLGGAGLLDEAESLIKKMPVEEDAIIWSSLLAA 431

Query: 1545 HPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMY--FCHAYRSSARFEDTMNLRLLTQK 1372
               +    N+E+     K+ + C           Y    +AY     FE+ +  R+L ++
Sbjct: 432  CRKYS---NVEMA----KRAAKCLKKLDPDETCGYVLMSNAYAHYGLFEEAVEQRILMKE 484

Query: 1371 TGLRKEPGCSLIEVN-EVHEFVAGGVLHPQVVEIHALLGELSLMLKEVERA--QHGTLNF 1201
              + KE GCS IEV+ EVHEFV+ G  HP+  EI++LLG L+  +  ++      G+LN 
Sbjct: 485  RKMEKEVGCSSIEVDFEVHEFVSCGKRHPKSTEIYSLLGILNWDVTAIKSGLLLLGSLNC 544

Query: 1200 LDSEPGEMS 1174
            L  + G +S
Sbjct: 545  LILQLGSVS 553


Top