BLASTX nr result

ID: Sinomenium22_contig00018620 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00018620
         (981 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi...   337   3e-90
ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr...   337   6e-90
ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi...   333   6e-89
ref|XP_002306741.1| pentatricopeptide repeat-containing family p...   327   5e-87
ref|XP_002534070.1| pentatricopeptide repeat-containing protein,...   326   1e-86
ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containi...   323   5e-86
ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi...   316   1e-83
ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfam...   316   1e-83
ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containi...   313   7e-83
gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis...   306   8e-81
ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containi...   295   2e-77
ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi...   294   3e-77
ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phas...   293   9e-77
gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus...   285   2e-74
ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containi...   279   1e-72
ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ...   277   5e-72
ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar...   275   3e-71
ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutr...   272   2e-70
ref|XP_002880012.1| pentatricopeptide repeat-containing protein ...   268   2e-69
ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Caps...   260   7e-67

>ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic [Vitis vinifera]
            gi|302143555|emb|CBI22116.3| unnamed protein product
            [Vitis vinifera]
          Length = 533

 Score =  337 bits (865), Expect = 3e-90
 Identities = 183/349 (52%), Positives = 231/349 (66%), Gaps = 23/349 (6%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            I+MYANCGFL      F E   FD VAWNSMIMGLAK G+VDESR+LFD+M  ++TV+WN
Sbjct: 167  IYMYANCGFLSEMWKAFYERMDFDIVAWNSMIMGLAKCGEVDESRKLFDEMPLRNTVSWN 226

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI GYVRNGRL+EA DLF QMQ + I P+EF + SLL A   LGAL+QGEWI  Y RK+
Sbjct: 227  SMISGYVRNGRLREALDLFGQMQEERIKPSEFTMVSLLNASARLGALKQGEWIHDYIRKN 286

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
              E+N IV  +I++ YCKCGS+ +AFQVF  AP KGLS+WN+MI+GLA+NG   EAI+LF
Sbjct: 287  NFELNVIVTASIIDMYCKCGSIGEAFQVFEMAPLKGLSSWNTMILGLAMNGCENEAIQLF 346

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            SRL+ S  +PDDV+F+GVLT+ N+ G+V +A++YF +M+K  KI+P+IKHYSCMVD LGR
Sbjct: 347  SRLECSNLRPDDVTFVGVLTACNYSGLVDKAKEYFSLMSKTYKIEPSIKHYSCMVDTLGR 406

Query: 704  AGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHA 883
            AG LEEAE  I NM               N + + W +   S C     V+ A     H 
Sbjct: 407  AGLLEEAEELIRNMPV-------------NPDAIIW-SSLLSACRKHGNVELAKRAAKHI 452

Query: 884  -----------------YRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                             Y +S +FE+ M  RL  ++  + KEPGCSLIE
Sbjct: 453  VDLDGNDSCGYVLLSNIYAASDQFEEAMEQRLSMKEKQIEKEPGCSLIE 501



 Score = 67.4 bits (163), Expect = 8e-09
 Identities = 51/211 (24%), Positives = 98/211 (46%), Gaps = 1/211 (0%)
 Frame = +2

Query: 98  GQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQN-Q*IHPTEFALASL 274
           G ++ +  +F ++ S +  +WN++I G+ ++     A  LF  M     + P      S+
Sbjct: 72  GDINYAYLVFTQIHSPNLFSWNTIIRGFSQSSTPHHAISLFIDMLIVSSVQPHRLTYPSV 131

Query: 275 LTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGL 454
             A   LG    G  +     K  ++ +  +   I+  Y  CG + + ++ F +     +
Sbjct: 132 FKAYAQLGLAHYGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYERMDFDI 191

Query: 455 STWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLV 634
             WNSMI+GLA  G+ +E+ +LF  + L  +    VS+  +++     G + EA   F  
Sbjct: 192 VAWNSMIMGLAKCGEVDESRKLFDEMPLRNT----VSWNSMISGYVRNGRLREALDLFGQ 247

Query: 635 MTKICKIKPTIKHYSCMVDALGRAGFLEEAE 727
           M +  +IKP+      +++A  R G L++ E
Sbjct: 248 MQEE-RIKPSEFTMVSLLNASARLGALKQGE 277


>ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina]
            gi|557539373|gb|ESR50417.1| hypothetical protein
            CICLE_v10031197mg [Citrus clementina]
          Length = 534

 Score =  337 bits (863), Expect = 6e-90
 Identities = 182/338 (53%), Positives = 230/338 (68%), Gaps = 12/338 (3%)
 Frame = +2

Query: 2    IFMYANCGFL------FDE-DSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNW 160
            I+MYANCGFL      FDE D+ FD VAWNSMI+GLAK G++DESRRLFDKM S++TV+W
Sbjct: 162  IYMYANCGFLSEARLIFDEVDTEFDVVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSW 221

Query: 161  NSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRK 340
            NSMI GYVRN + KEA +LF +MQ Q I P+EF + SLL AC  LGA+ QGEWI  +   
Sbjct: 222  NSMISGYVRNVKFKEALELFREMQEQNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVT 281

Query: 341  SKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIEL 520
            +  E+N+IV+TAI++ YCKCG  ++A QVFN  PKKGLS WNSM+ GLA+NG   EAI+L
Sbjct: 282  NCFELNTIVVTAIIDMYCKCGCPERALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKL 341

Query: 521  FSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALG 700
            FS LQ S  KPD +SFI VLT+ NH G V +A+ YF +MT+  KIKP+IKHYSCMVDALG
Sbjct: 342  FSGLQSSNLKPDYISFIAVLTACNHSGKVNQAKDYFTLMTETYKIKPSIKHYSCMVDALG 401

Query: 701  RAGFLEEAEGPIANM---TQTLLHGH--PCFQLV*NMEILRWRNKQQSNCSNWNQVKAAA 865
            RAG LEEAE  I +M      ++ G      +   N+E+ +   KQ              
Sbjct: 402  RAGLLEEAEKLIRSMPSDPDAIIWGSLLSACRKHGNIEMAKQAAKQIIELD--KNESCGY 459

Query: 866  MYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
            +   + Y +S +FE+ M  RLL ++  + KEPGCSLIE
Sbjct: 460  VLMSNLYAASYQFEEAMEERLLMKEVKIEKEPGCSLIE 497


>ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Citrus sinensis]
          Length = 534

 Score =  333 bits (854), Expect = 6e-89
 Identities = 181/338 (53%), Positives = 228/338 (67%), Gaps = 12/338 (3%)
 Frame = +2

Query: 2    IFMYANCGFL------FDE-DSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNW 160
            I+MYANCGFL      FDE D+ FD VAWNSMI+GLAK G++DESRRLFDKM S++TV+W
Sbjct: 162  IYMYANCGFLSEARLMFDEVDTEFDVVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSW 221

Query: 161  NSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRK 340
            NSMI GYVRN + KEA +LF +MQ Q I P+EF + SLL AC  LGA+ QGEWI  +   
Sbjct: 222  NSMISGYVRNVKFKEALELFREMQEQNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVT 281

Query: 341  SKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIEL 520
            +  E+N+IV+TAI++ YCKCG  ++A QVFN  PKKGLS WNSM+ GLA+NG   EAI+L
Sbjct: 282  NCFELNTIVVTAIIDMYCKCGCPERALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKL 341

Query: 521  FSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALG 700
            FS LQ S   PD  SFI VLT+ NH G V +A+ YF +MT+  KIKP+IKHYSCMVDALG
Sbjct: 342  FSGLQSSNLTPDYTSFIAVLTACNHSGKVNQAKDYFTLMTETYKIKPSIKHYSCMVDALG 401

Query: 701  RAGFLEEAEGPIANM---TQTLLHGH--PCFQLV*NMEILRWRNKQQSNCSNWNQVKAAA 865
            RAG LEEAE  I +M      ++ G      +   N+E+ +   KQ              
Sbjct: 402  RAGLLEEAEKLIRSMPSDPDAIIWGSLLSACRKHGNIEMAKQAAKQIIELD--KNESCGY 459

Query: 866  MYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
            +   + Y +S +FE+ M  RLL ++  + KEPGCSLIE
Sbjct: 460  VLMSNLYAASYQFEEAMEERLLMKEVKIEKEPGCSLIE 497


>ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222856190|gb|EEE93737.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 509

 Score =  327 bits (838), Expect = 5e-87
 Identities = 174/336 (51%), Positives = 228/336 (67%), Gaps = 12/336 (3%)
 Frame = +2

Query: 8    MYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWNSM 169
            MY NCGFL      FD  + FD V WN+MI+GLAK G++D+SRRLFDKM  ++TV+WNSM
Sbjct: 141  MYVNCGFLGEAQRIFDGATGFDVVTWNTMIIGLAKCGEIDKSRRLFDKMLLRNTVSWNSM 200

Query: 170  IGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKI 349
            I GYVR GR  EA +LF +MQ + I P+EF + SLL AC  LGAL QGEWI  Y  K+  
Sbjct: 201  ISGYVRKGRFFEAMELFSRMQEEGIKPSEFTMVSLLNACACLGALRQGEWIHDYIVKNNF 260

Query: 350  EVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSR 529
             +NSIV+TAI++ Y KCGS+DKA QVF  APKKGLS WNS+I+GLA++G+G EA+ LFS+
Sbjct: 261  ALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGLSCWNSLILGLAMSGRGNEAVRLFSK 320

Query: 530  LQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAG 709
            L+ S  KPD VSFIGVLT+ NH GMV  A+ YFL+M++  KI+P+IKHYSCMVD LGRAG
Sbjct: 321  LESSNLKPDHVSFIGVLTACNHAGMVDRAKDYFLLMSETYKIEPSIKHYSCMVDVLGRAG 380

Query: 710  FLEEAEGPIANM---TQTLLHG---HPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMY 871
             LEEAE  I +M      ++ G     C +   N+E+ +   K+ +         ++ + 
Sbjct: 381  LLEEAEELIKSMPVNPDAIIWGSLLSSCREYG-NIEMAKQAAKRVNELD--PNESSSFIL 437

Query: 872  FCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
              + Y +   FE+ +  RL  ++  + KEPGCSLIE
Sbjct: 438  LSNVYAAHNHFEEAIEQRLSLKEKQMDKEPGCSLIE 473



 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 56/245 (22%), Positives = 104/245 (42%), Gaps = 35/245 (14%)
 Frame = +2

Query: 95  SGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQ--NQ*IHPTEFALA 268
           +G ++ +  +F ++ + +   WN++I G+ ++     A  LF  M   +    P      
Sbjct: 42  AGDINYAYLVFTQIRNPNLFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRLTYP 101

Query: 269 SLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKK 448
           S+  A   LG   +G  +     K  +E +  +   I+  Y  CG + +A ++F+ A   
Sbjct: 102 SVFKAYAQLGLAHEGAQLHGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGF 161

Query: 449 GLSTWNSMIIGLAINGQGE-------------------------------EAIELFSRLQ 535
            + TWN+MIIGLA  G+ +                               EA+ELFSR+Q
Sbjct: 162 DVVTWNTMIIGLAKCGEIDKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQ 221

Query: 536 LSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKH--YSCMVDALGRAG 709
             G KP + + + +L   N C  +G  R+   +   I K    +     + ++D   + G
Sbjct: 222 EEGIKPSEFTMVSLL---NACACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCG 278

Query: 710 FLEEA 724
            +++A
Sbjct: 279 SIDKA 283


>ref|XP_002534070.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223525897|gb|EEF28314.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  326 bits (835), Expect = 1e-86
 Identities = 179/348 (51%), Positives = 224/348 (64%), Gaps = 22/348 (6%)
 Frame = +2

Query: 2    IFMYANCGF------LFDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            +FMY NCGF      +FD    FD VAWN+MIMG+AK G VDESRRLFDKM  ++ V+WN
Sbjct: 164  LFMYVNCGFTSEARKVFDRGMDFDIVAWNTMIMGVAKCGLVDESRRLFDKMSLRNAVSWN 223

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI GYVRNGR  +A +LF +MQ + I P+EF + SLL AC  LGA+ QGEWI  Y  K 
Sbjct: 224  SMISGYVRNGRFFDALELFQKMQVERIEPSEFTMVSLLNACACLGAIRQGEWIHDYMVKK 283

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
            K E+N IV+TAI++ Y KCGS+DKA QVF  AP++GLS WNSMI+GLA+NGQ  EA++LF
Sbjct: 284  KFELNPIVVTAIIDMYSKCGSIDKAVQVFQSAPRRGLSCWNSMILGLAMNGQENEALQLF 343

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            S LQ S  +PDDVSFI VLT+ +H GMV +A+ YFL+M    KIKP IKH+SCMVD LGR
Sbjct: 344  SVLQSSDLRPDDVSFIAVLTACDHTGMVDKAKDYFLLMRDKYKIKPGIKHFSCMVDVLGR 403

Query: 704  AGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNC--SNWNQVKAAAMYF- 874
            AG LEEAE  I +M     H  P        + + W +   S C   N    K AA +  
Sbjct: 404  AGLLEEAEELIRSM-----HVDP--------DAIIWGSLLWSCCKYGNIKMAKRAANHLI 450

Query: 875  -------------CHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                          +AY ++  FE+ +  RL  ++  + KEPGCS IE
Sbjct: 451  ELNPSESSSFVLVANAYAAANNFEEALKERLTLKENHIGKEPGCSCIE 498



 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 48/246 (19%), Positives = 106/246 (43%), Gaps = 36/246 (14%)
 Frame = +2

Query: 95  SGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQ-NQ*IHPTEFALAS 271
           +G ++ +  +F ++++ +   WN++I G+ R+   + +  L+  M     + P      S
Sbjct: 68  AGDINYAYLVFVQIQNPNIFAWNTIIRGFSRSSVPQNSISLYIDMLLTSPVQPQRLTYPS 127

Query: 272 LLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKG 451
           +  A   L    +G  +     K  +E +S +   I+  Y  CG   +A +VF+      
Sbjct: 128 VFKAFAQLDLASEGAQLHGKMIKLGLENDSFIRNTILFMYVNCGFTSEARKVFDRGMDFD 187

Query: 452 LSTWNSMIIGLA-------------------------------INGQGEEAIELFSRLQL 538
           +  WN+MI+G+A                                NG+  +A+ELF ++Q+
Sbjct: 188 IVAWNTMIMGVAKCGLVDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELFQKMQV 247

Query: 539 SGSKPDDVSFIGVLTSGNHCGMVGEARK----YFLVMTKICKIKPTIKHYSCMVDALGRA 706
              +P + + + +L   N C  +G  R+    +  ++ K  ++ P +   + ++D   + 
Sbjct: 248 ERIEPSEFTMVSLL---NACACLGAIRQGEWIHDYMVKKKFELNPIV--VTAIIDMYSKC 302

Query: 707 GFLEEA 724
           G +++A
Sbjct: 303 GSIDKA 308


>ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 550

 Score =  323 bits (829), Expect = 5e-86
 Identities = 180/349 (51%), Positives = 222/349 (63%), Gaps = 23/349 (6%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            I MY+NCG L      FDED  FD VAWNSMIMGL+K G+V ESRRLFDKM  +++++WN
Sbjct: 163  IHMYSNCGLLSEARRVFDEDLEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNSISWN 222

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMIGG VRNG   EA DLF +MQ Q I P+EF + SLL A   LGA+ QGEWI  Y RK+
Sbjct: 223  SMIGGSVRNGMYTEALDLFGEMQKQKIKPSEFTMVSLLNASAQLGAIRQGEWIHEYIRKN 282

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
             I++N IV+TAI+  Y KCGS++KA  VF  AP+ GLS WNS+I+GLA NG  EEAIELF
Sbjct: 283  HIQLNPIVVTAIINMYSKCGSIEKAVHVFEAAPRTGLSCWNSIIMGLATNGCEEEAIELF 342

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            SRL+ S   PDDVSF+GVLT+ +H GMV +ARKYF VM +  +I P+IKHYSCMVD LGR
Sbjct: 343  SRLKSSSFVPDDVSFLGVLTACSHSGMVEKARKYFSVMRETYRIAPSIKHYSCMVDVLGR 402

Query: 704  AGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFCHA 883
            AG LEEAE         L+ G P        + + W     S+C     ++ A     H 
Sbjct: 403  AGLLEEAE--------KLIDGMPL-----KADAIIW-GSLLSSCRKHRDIEMAKRAAKHV 448

Query: 884  -----------------YRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                             Y +S++FE+ M  RL  +   + KEPGCSLIE
Sbjct: 449  IELDPSDCCGYVLMSNVYAASSQFEEAMRERLSMKGQKIEKEPGCSLIE 497



 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 57/227 (25%), Positives = 103/227 (45%), Gaps = 2/227 (0%)
 Frame = +2

Query: 53  DAVAWNSMIMGLAK-SGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQM 229
           D VA + ++   A  +G ++ +  +F  + + +   WN++I G+  +   + A  LF  M
Sbjct: 52  DTVAASRVLAFCASPAGDINYAYMVFRHIHNPNLFIWNTIIRGFSNSSNPEAAISLFIDM 111

Query: 230 Q-NQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGS 406
                + P      S+  A   LG    G  +     K  +E +  V   I+  Y  CG 
Sbjct: 112 LVTSTVQPQRLTYPSVFKAYAQLGLAHDGAQLHGRVVKLGLESDQFVRNTIIHMYSNCGL 171

Query: 407 VDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTS 586
           + +A +VF++  +  +  WNSMI+GL+  G+  E+  LF ++    S    +S+  ++  
Sbjct: 172 LSEARRVFDEDLEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNS----ISWNSMIGG 227

Query: 587 GNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAE 727
               GM  EA   F  M K  KIKP+      +++A  + G + + E
Sbjct: 228 SVRNGMYTEALDLFGEMQK-QKIKPSEFTMVSLLNASAQLGAIRQGE 273


>ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum tuberosum]
          Length = 522

 Score =  316 bits (809), Expect = 1e-83
 Identities = 169/345 (48%), Positives = 226/345 (65%), Gaps = 19/345 (5%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            ++MYA+CGFL      FDED   D V+WNSMIMGLAKSG++D+S RLF KM +++ V+WN
Sbjct: 162  LYMYASCGFLVEARKLFDEDEIEDVVSWNSMIMGLAKSGEIDDSWRLFSKMSTRNDVSWN 221

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI G+VRNG+  EA +LF  MQ + I P+EF L SLL ACG LGALEQG WI  Y +K+
Sbjct: 222  SMISGFVRNGKWNEALELFSTMQEENIKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKN 281

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
             +E+N IV+TAI++ YCKCG+V+ A+ VF     KGLS+WNSMI+GLA NG  ++AI+LF
Sbjct: 282  NVELNVIVVTAIIDMYCKCGNVEMAWHVFISISNKGLSSWNSMILGLATNGFEDDAIKLF 341

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            +RLQ S  KPD VSFIGVLT+ NH G+V +A+ YF +M K   I+P+IKHY CMVD LGR
Sbjct: 342  ARLQCSILKPDSVSFIGVLTACNHSGLVDKAKDYFQLMKKEYGIEPSIKHYGCMVDILGR 401

Query: 704  AGFLEEAEGPIANMTQ--------TLL-----HGHPCFQLV*NMEILRWRNKQQSNCSNW 844
            AG +EEA+  I +M          +LL     HG        NME+ RW  +        
Sbjct: 402  AGLVEEADEVIRSMKMEPDAVIWCSLLSACRSHG--------NMELARWSAENLLELD-- 451

Query: 845  NQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                +  +   + Y +S +F + ++ R+  +   + KEPGCS +E
Sbjct: 452  PNESSGYVLMANMYAASGQFAEAIDERISMKDKHIAKEPGCSSVE 496



 Score = 77.8 bits (190), Expect = 6e-12
 Identities = 58/227 (25%), Positives = 115/227 (50%), Gaps = 4/227 (1%)
 Frame = +2

Query: 53  DAVAWNSMIMGLAKS---GQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFF 223
           D +A + ++   AKS   G ++ +  +F  +E+ +   WN++I G+  +   + A  LF 
Sbjct: 49  DKIASSRVLAFSAKSPPIGDINYANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFI 108

Query: 224 QM-QNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKC 400
           +M  N  + P      S+  A    G ++ G  +     K  +E ++ +   ++  Y  C
Sbjct: 109 EMLNNSQVQPHLLTYPSVFKAYARGGLVKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASC 168

Query: 401 GSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVL 580
           G + +A ++F++   + + +WNSMI+GLA +G+ +++  LFS++    S  +DVS+  ++
Sbjct: 169 GFLVEARKLFDEDEIEDVVSWNSMIMGLAKSGEIDDSWRLFSKM----STRNDVSWNSMI 224

Query: 581 TSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEE 721
           +     G   EA + F  M +   IKP+      +++A G  G LE+
Sbjct: 225 SGFVRNGKWNEALELFSTMQEE-NIKPSEFTLVSLLNACGHLGALEQ 270


>ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508701125|gb|EOX93021.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 538

 Score =  316 bits (809), Expect = 1e-83
 Identities = 177/350 (50%), Positives = 223/350 (63%), Gaps = 24/350 (6%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDS-SFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNW 160
            I+MYANCG L      FDE+    D VAWNSMI+GLAK G+VDESRRLF+KM S++TV+W
Sbjct: 164  IYMYANCGLLSEAWRMFDEEHMELDIVAWNSMIIGLAKCGEVDESRRLFNKMVSRNTVSW 223

Query: 161  NSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRK 340
            NSMI GYVRNGR  EA +LF +MQ + I P+EF + SLL AC  LGA+ QG+WI  Y  K
Sbjct: 224  NSMISGYVRNGRFLEALELFQEMQEEHIRPSEFTMVSLLNACACLGAITQGKWIHDYILK 283

Query: 341  SKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIEL 520
               E+N IV+TAI++ YCKCG+ +KA QVF  +PK+GLS WNSMI+GLA NG   EA +L
Sbjct: 284  QNFELNGIVVTAIIDMYCKCGNAEKALQVFTTSPKEGLSCWNSMILGLATNGCENEARQL 343

Query: 521  FSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALG 700
            FS+L+    KPD V+FIGVL + N  GMV +A+ YF +MT+  KIKPTIKHYSCMVD LG
Sbjct: 344  FSKLESLSLKPDHVTFIGVLMACNSAGMVDKAKYYFSLMTEKYKIKPTIKHYSCMVDVLG 403

Query: 701  RAGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNC---SNWNQVKAAA-- 865
             AG LEEAE  I +M               N + + W     S C    N    K AA  
Sbjct: 404  NAGLLEEAEQLIRSMPV-------------NEDAIIW-GSLLSACRKHGNVGMAKRAAKL 449

Query: 866  ------------MYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                        +   + Y ++ +FE+ +  RL  ++  L+KEPGCSLIE
Sbjct: 450  VIELDPAERSGYVLMSNVYAATRQFEEAIKQRLSMKEKQLQKEPGCSLIE 499


>ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum lycopersicum]
          Length = 522

 Score =  313 bits (802), Expect = 7e-83
 Identities = 167/345 (48%), Positives = 226/345 (65%), Gaps = 19/345 (5%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            ++MYA+CGFL      FDED   D V+WNSMI+GLAKSG++D+S RLF KM +++ V+WN
Sbjct: 162  LYMYASCGFLVEARKLFDEDEIEDVVSWNSMIIGLAKSGEIDDSWRLFSKMPTRNDVSWN 221

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI G+VRNG+  EA +LF  MQ + + P+EF L SLL ACG LGALEQG WI  Y +K+
Sbjct: 222  SMISGFVRNGKWNEALELFSTMQEENVKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKN 281

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
             +E+N IV+TAI++ YCKC +V+ A+ VF  +  KGLS+WNSMI+GLA NG  ++AI+LF
Sbjct: 282  NVELNVIVVTAIIDMYCKCANVEMAWHVFVSSSNKGLSSWNSMILGLATNGFEDDAIKLF 341

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            +RLQ S  KPD VSFIGVLT+ NH G+V +A+ YF +M     I+P+IKHY CMVD LGR
Sbjct: 342  ARLQCSILKPDSVSFIGVLTACNHSGLVEKAKDYFQLMKMEYGIEPSIKHYGCMVDILGR 401

Query: 704  AGFLEEAEGPIANMTQ--------TLL-----HGHPCFQLV*NMEILRWRNKQQSNCSNW 844
            AG +EEAE  I +M          +LL     HG        N+E+ RW  +        
Sbjct: 402  AGLVEEAEEVIRSMKMEPDAVIWGSLLSACRSHG--------NVELARWSAENLLELD-- 451

Query: 845  NQVKAAAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                +  +   + Y +S  F++ MN R+  ++  + KEPGCS +E
Sbjct: 452  PNESSGYVLMANMYAASGLFDEAMNERISMKEKHIAKEPGCSSVE 496



 Score = 77.4 bits (189), Expect = 8e-12
 Identities = 57/227 (25%), Positives = 113/227 (49%), Gaps = 4/227 (1%)
 Frame = +2

Query: 53  DAVAWNSMIMGLAKS---GQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFF 223
           D +A + ++   AKS   G ++ +  +F  +E+ +   WN++I G+  +   + A  LF 
Sbjct: 49  DKIAASRVLAFSAKSPPIGDINYANLVFTHIENPNPFTWNTIIRGFSESSTPQYAIHLFI 108

Query: 224 QM-QNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKC 400
           +M  N  + P      S+  A    G  + G  +     K  +E ++ +   ++  Y  C
Sbjct: 109 EMLNNSQVQPHLLTYPSVFKAYARGGIAKNGAQLHGRIMKLGLEFDTFIRNTLLYMYASC 168

Query: 401 GSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVL 580
           G + +A ++F++   + + +WNSMIIGLA +G+ +++  LFS++       +DVS+  ++
Sbjct: 169 GFLVEARKLFDEDEIEDVVSWNSMIIGLAKSGEIDDSWRLFSKMPTR----NDVSWNSMI 224

Query: 581 TSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEE 721
           +     G   EA + F  M +   +KP+      +++A G  G LE+
Sbjct: 225 SGFVRNGKWNEALELFSTMQEE-NVKPSEFTLVSLLNACGHLGALEQ 270


>gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis]
            gi|587904202|gb|EXB92403.1| hypothetical protein
            L484_021387 [Morus notabilis]
          Length = 530

 Score =  306 bits (784), Expect = 8e-81
 Identities = 168/336 (50%), Positives = 220/336 (65%), Gaps = 10/336 (2%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            I MY NCGFL      FDE S  D VAWNSMIMGL+K G+V ESRRLFD+M  +++V+WN
Sbjct: 168  IHMYINCGFLSEARQLFDESSELDLVAWNSMIMGLSKCGEVGESRRLFDRMPLRNSVSWN 227

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI GYVRNG+  EA +LF +MQ + I  +EF + SLL A G LGA+ QGEWI  Y  K+
Sbjct: 228  SMISGYVRNGKCVEALELFGKMQGEGIKASEFTMVSLLNASGRLGAIRQGEWIHEYITKN 287

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
             IE+N IV+TAI++ YCKCGSV+KA  VF  APK GLS WNSM++GLA+NG  EEA+ELF
Sbjct: 288  GIELNVIVVTAIIDMYCKCGSVNKALSVFKTAPKLGLSCWNSMVMGLAMNGCEEEALELF 347

Query: 524  SRLQLS-GSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALG 700
            SRL+ S   +PD VSF+ VLT+ NH GMV +AR YF +M     I+P+ +HYSCMVD LG
Sbjct: 348  SRLESSIDLRPDGVSFLAVLTACNHSGMVDKARDYFSLMRGKYNIEPSTRHYSCMVDVLG 407

Query: 701  RAGFLEEAEGPIANM---TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMY 871
            +AG LEEAE  I +M      ++ G        +  I   +   +          +A + 
Sbjct: 408  KAGHLEEAEKLILSMPINPDAIIWGSLLSACRKHGNIEMAQRALERVIELDPSESSAYVL 467

Query: 872  FCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
              + Y SS+ +++ +  R+  ++  + KEPGCSLIE
Sbjct: 468  MSNVYGSSSHYDEAVKQRINMKEKRIEKEPGCSLIE 503



 Score = 68.6 bits (166), Expect = 4e-09
 Identities = 55/212 (25%), Positives = 100/212 (47%), Gaps = 1/212 (0%)
 Frame = +2

Query: 95  SGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQ-NQ*IHPTEFALAS 271
           +G ++ +  +F ++++ +   WN++I G+ R+   + A  LF  M     + P      S
Sbjct: 72  AGNINYALMVFSQIQNPNLFIWNTIIRGFSRSSTPQTAIFLFIDMLVGSPLEPQRLTYPS 131

Query: 272 LLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKG 451
           +  A   LG    G  +     K  ++ +  V   I+  Y  CG + +A Q+F+++ +  
Sbjct: 132 VFKAYAQLGLACFGAQLHGRVIKLGLDCDRFVRNTIIHMYINCGFLSEARQLFDESSELD 191

Query: 452 LSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFL 631
           L  WNSMI+GL+  G+  E+  LF R+ L  S    VS+  +++     G   EA + F 
Sbjct: 192 LVAWNSMIMGLSKCGEVGESRRLFDRMPLRNS----VSWNSMISGYVRNGKCVEALELFG 247

Query: 632 VMTKICKIKPTIKHYSCMVDALGRAGFLEEAE 727
            M     IK +      +++A GR G + + E
Sbjct: 248 KMQGE-GIKASEFTMVSLLNASGRLGAIRQGE 278


>ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cicer arietinum]
          Length = 536

 Score =  295 bits (754), Expect = 2e-77
 Identities = 160/337 (47%), Positives = 217/337 (64%), Gaps = 11/337 (3%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSF-DAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNW 160
            I+MYAN G L      FDE     D VA+NSMIMG AK G++DE+R+LFD+M ++++V W
Sbjct: 166  IYMYANSGLLSEAKRVFDEKLELGDVVAFNSMIMGFAKCGEIDEARKLFDEMFTRTSVTW 225

Query: 161  NSMIGGYVRNGRLKEAFDLFFQMQ-NQ*IHPTEFALASLLTACGGLGALEQGEWICAYNR 337
            NSMI GYVRNG+L EA +LF +MQ  + + P+EF + SLL AC  LGAL+ G+W+  Y +
Sbjct: 226  NSMISGYVRNGKLMEALELFHKMQLEERVEPSEFTMVSLLNACAHLGALQHGKWVHDYIK 285

Query: 338  KSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIE 517
            ++  E+N IVLTAI++ YCKCGSV+ A QVF+  P +GLS WNS+IIGLA+NG   EA E
Sbjct: 286  RNDFELNVIVLTAIIDMYCKCGSVENAIQVFDTYPGRGLSCWNSIIIGLAMNGHEREAFE 345

Query: 518  LFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDAL 697
             FS L+LS  KPD VSFIGVLT+  H G V +A+ YF +M    KI+P+IKHY+CMV+ L
Sbjct: 346  FFSELELSKFKPDSVSFIGVLTACKHLGAVDKAKDYFALMMNEYKIEPSIKHYTCMVEVL 405

Query: 698  GRAGFLEEAEGPIANM---TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAM 868
            G+A FLEEAE  I  M      ++ G        +  + R +   Q          +  +
Sbjct: 406  GQAAFLEEAEELIQGMPIKPDAIIWGSLLSSCRKHGNVQRAKRAAQRVYELNPSDASGYV 465

Query: 869  YFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
               + Y +S +FE+ +  R+L ++    KEPGCS IE
Sbjct: 466  LMSNVYAASNKFEEAVEQRVLMKENLTEKEPGCSSIE 502



 Score = 70.5 bits (171), Expect = 1e-09
 Identities = 50/202 (24%), Positives = 90/202 (44%), Gaps = 33/202 (16%)
 Frame = +2

Query: 95  SGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASL 274
           SG ++ + +LF +M + +  +WN++I  + R+   + A  LF  M    I P      S+
Sbjct: 71  SGNINYAYKLFARMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLYSQIQPQHLTYPSV 130

Query: 275 LTACGGLGALEQGEWICAYNRK-------------------------------SKIEVNS 361
             A   L A + G  +     K                                K+E+  
Sbjct: 131 FKAYAQLSAGDYGSQLHGMVVKLGLQRDQFIHNTIIYMYANSGLLSEAKRVFDEKLELGD 190

Query: 362 IV-LTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQL 538
           +V   +++  + KCG +D+A ++F++   +   TWNSMI G   NG+  EA+ELF ++QL
Sbjct: 191 VVAFNSMIMGFAKCGEIDEARKLFDEMFTRTSVTWNSMISGYVRNGKLMEALELFHKMQL 250

Query: 539 SGS-KPDDVSFIGVLTSGNHCG 601
               +P + + + +L +  H G
Sbjct: 251 EERVEPSEFTMVSLLNACAHLG 272


>ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Glycine max]
          Length = 534

 Score =  294 bits (753), Expect = 3e-77
 Identities = 167/349 (47%), Positives = 215/349 (61%), Gaps = 23/349 (6%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            I+MYAN G L      FDE    D VA NSMIMGLAK G+VD+SRRLFD M +++ V WN
Sbjct: 166  IYMYANSGLLSEARRVFDELVDLDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTRVTWN 225

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI GYVRN RL EA +LF +MQ + + P+EF + SLL+AC  LGAL+ GEW+  Y ++ 
Sbjct: 226  SMISGYVRNKRLMEALELFRKMQGERVEPSEFTMVSLLSACAHLGALKHGEWVHDYVKRG 285

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
              E+N IVLTAI++ YCKCG + KA +VF  +P +GLS WNS+IIGLA+NG   +AIE F
Sbjct: 286  HFELNVIVLTAIIDMYCKCGVIVKAIEVFEASPTRGLSCWNSIIIGLALNGYERKAIEYF 345

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            S+L+ S  KPD VSFIGVLT+  + G VG+AR YF +M    +I+P+IKHY+CMV+ LG+
Sbjct: 346  SKLEASDLKPDHVSFIGVLTACKYIGAVGKARDYFSLMMNKYEIEPSIKHYTCMVEVLGQ 405

Query: 704  AGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQV---KAAAMYF 874
            A  LEEAE         L+ G P        + + W     S+C     V   K AA   
Sbjct: 406  AALLEEAE--------QLIKGMPL-----KADFIIW-GSLLSSCRKHGNVEIAKRAAQRV 451

Query: 875  CHAYRSSA--------------RFEDTMNLRLLTQKTGLRKEPGCSLIE 979
            C    S A              +FE+ M  R+L ++    KEPGCS IE
Sbjct: 452  CELNPSDASGYLLMSNVQAASNQFEEAMEQRILMRERLAEKEPGCSSIE 500



 Score = 70.5 bits (171), Expect = 1e-09
 Identities = 51/202 (25%), Positives = 85/202 (42%), Gaps = 31/202 (15%)
 Frame = +2

Query: 89  AKSGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALA 268
           + SG ++ +  LF  + S +   WN++I G+ R+     A  LF  M    + P      
Sbjct: 69  SSSGDINYAYLLFTTIPSPNLYCWNTIIRGFSRSSTPHLAISLFVDMLCSSVLPQRLTYP 128

Query: 269 SLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKY------------------- 391
           S+  A   LGA   G  +     K  +E +  +   I+  Y                   
Sbjct: 129 SVFKAYAQLGAGYDGAQLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVFDELVDL 188

Query: 392 ------------CKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQ 535
                        KCG VDK+ ++F++ P +   TWNSMI G   N +  EA+ELF ++Q
Sbjct: 189 DVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALELFRKMQ 248

Query: 536 LSGSKPDDVSFIGVLTSGNHCG 601
               +P + + + +L++  H G
Sbjct: 249 GERVEPSEFTMVSLLSACAHLG 270


>ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris]
            gi|561014990|gb|ESW13851.1| hypothetical protein
            PHAVU_008G231600g [Phaseolus vulgaris]
          Length = 525

 Score =  293 bits (749), Expect = 9e-77
 Identities = 162/346 (46%), Positives = 221/346 (63%), Gaps = 20/346 (5%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            ++MYAN G +      FDE    D VA NSMIMGLAK G+VD+SRRLFD M +++ V+WN
Sbjct: 164  LYMYANSGLMSEARRVFDEPLELDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTAVSWN 223

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI GYVRNGRL E  +LF +MQ + + P+EF + SLL+AC  LGAL+ GEW+  Y ++ 
Sbjct: 224  SMISGYVRNGRLTEGLELFRKMQEEGVEPSEFTMVSLLSACAHLGALQHGEWVHDYIKRG 283

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
              ++N IVLTAI++ YCKCGS++KA +VF  +P +GL  WNS+IIGLA+NG   EAIE F
Sbjct: 284  NFKLNVIVLTAIIDMYCKCGSIEKAVEVFAASPTRGLPCWNSIIIGLALNGHEREAIEYF 343

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            S+L+ S  KPD VSFIGVLT+  + G V EAR YF +M    +I+P+IKHY+C+V+ LG 
Sbjct: 344  SKLESSNIKPDCVSFIGVLTACKYLGAVREARDYFALMMDKYEIEPSIKHYTCLVEVLGH 403

Query: 704  AGFLEEAEGPIANMT--------QTLL-----HGHPCFQLV*NMEILRWRNKQQSNCSNW 844
            A  LEEAE  I  M+         +LL     HG        N+EI +   +        
Sbjct: 404  AALLEEAEEVIKGMSIEADFIIWGSLLSSCRKHG--------NVEIAK---RAAQRVFEL 452

Query: 845  NQVKAAA-MYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
            N  +A+  +   +   +S +FE+ +  R+L ++  + KEPGCS IE
Sbjct: 453  NPREASGYLLMSNVQAASNQFEEALEHRILMKERLVEKEPGCSSIE 498



 Score = 75.5 bits (184), Expect = 3e-11
 Identities = 58/264 (21%), Positives = 111/264 (42%), Gaps = 31/264 (11%)
 Frame = +2

Query: 89  AKSGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALA 268
           + SG ++ +  +F  + + +   WN++I G+ R+   + A  LF  M    + P      
Sbjct: 67  SSSGDINYAYLVFTGIPNPNLYCWNTIIRGFSRSSTPQFAISLFVDMLYSAVEPQRLTYP 126

Query: 269 SLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKY------------------- 391
           S+  A   LGA   G  +     K  +E +  +   I+  Y                   
Sbjct: 127 SVFKAYAQLGAGHDGAQLHGRVVKLGLEKDQFISNTILYMYANSGLMSEARRVFDEPLEL 186

Query: 392 ------------CKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQ 535
                        KCG VDK+ ++F++ P +   +WNSMI G   NG+  E +ELF ++Q
Sbjct: 187 DVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTAVSWNSMISGYVRNGRLTEGLELFRKMQ 246

Query: 536 LSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFL 715
             G +P + + + +L++  H G + +  ++     K    K  +   + ++D   + G +
Sbjct: 247 EEGVEPSEFTMVSLLSACAHLGAL-QHGEWVHDYIKRGNFKLNVIVLTAIIDMYCKCGSI 305

Query: 716 EEAEGPIANMTQTLLHGHPCFQLV 787
           E+A   +     +   G PC+  +
Sbjct: 306 EKA---VEVFAASPTRGLPCWNSI 326


>gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus guttatus]
          Length = 505

 Score =  285 bits (730), Expect = 2e-74
 Identities = 162/345 (46%), Positives = 213/345 (61%), Gaps = 19/345 (5%)
 Frame = +2

Query: 2    IFMYANCGF------LFDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            I MYA+CG       LFDED   D VAWNSM+MGLAK G+VDES RLF K+  ++ ++WN
Sbjct: 149  IHMYADCGLFGSARKLFDEDEDTDVVAWNSMVMGLAKCGEVDESWRLFCKIPCRNDISWN 208

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            +MI GYVRNG+  +A  LF +MQ + I P+EF L S+L AC  LGALEQG+WI  Y +KS
Sbjct: 209  TMISGYVRNGKWVDALSLFAEMQQRQIRPSEFTLVSMLNACAKLGALEQGKWIHRYIKKS 268

Query: 344  ---KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAI 514
                I+ N+IV+TAI++ YCKCG +  A +VF   P+K LS WNSMI+GLA NG  EEA 
Sbjct: 269  DINNIDRNTIVVTAIIDMYCKCGDIKTAREVFESTPQKALSGWNSMILGLATNGFEEEAF 328

Query: 515  ELFSRL-QLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVD 691
            +LF+ L Q S   PD VSFIGVLT+ NH   V +AR+YF VM +   I+PTIKHY C+VD
Sbjct: 329  QLFTELEQSSNLNPDSVSFIGVLTASNHSVRVDKAREYFKVMKETYGIEPTIKHYGCLVD 388

Query: 692  ALGRAGFLEEAEGPIANM---TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKA- 859
             LGRAG +E+A   I +M      ++ G             R R+   +  +  N + A 
Sbjct: 389  VLGRAGLIEQAAEVIKSMPMKPDAIIWGSLL------SACRRCRDVGVAELAARNLLLAG 442

Query: 860  -----AAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                 A +   + Y +S  F+  +N R   +K  + K+PGCS IE
Sbjct: 443  PDETSAHVLMSNVYAASGDFKKAVNERTKMKKKKMEKQPGCSFIE 487



 Score = 66.2 bits (160), Expect = 2e-08
 Identities = 54/229 (23%), Positives = 103/229 (44%), Gaps = 4/229 (1%)
 Frame = +2

Query: 53  DAVAWNSMIMGLAKSG---QVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFF 223
           D +A + ++   A  G    +D +  +F  +E  +   WN++I G+ ++     A  LF 
Sbjct: 36  DTIAVSRILAFCAAPGPARDLDYAFSVFSHIEKPNLFTWNTIIRGFCQSSHPHVAISLFV 95

Query: 224 QM-QNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKC 400
            M  N  + P      S+  A   LG    G  +     K   E +  +  +I+  Y  C
Sbjct: 96  DMLTNSTLEPENLTYPSVFKAYTQLGLAGDGAQLHGRIIKLGFEHDPFIRNSIIHMYADC 155

Query: 401 GSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVL 580
           G    A ++F++     +  WNSM++GLA  G+ +E+  LF ++       +D+S+  ++
Sbjct: 156 GLFGSARKLFDEDEDTDVVAWNSMVMGLAKCGEVDESWRLFCKIPCR----NDISWNTMI 211

Query: 581 TSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAE 727
           +     G   +A   F  M +  +I+P+      M++A  + G LE+ +
Sbjct: 212 SGYVRNGKWVDALSLFAEMQQ-RQIRPSEFTLVSMLNACAKLGALEQGK 259


>ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
            gi|449530724|ref|XP_004172343.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
          Length = 543

 Score =  279 bits (714), Expect = 1e-72
 Identities = 152/337 (45%), Positives = 212/337 (62%), Gaps = 11/337 (3%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            ++MYA  GFL      F+++  FD V+WNSMI+GLAK G++DESR+LFDKM  K+ ++WN
Sbjct: 166  LYMYATGGFLSEARRIFNQEMEFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWN 225

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMIGGYVRNG  KEA  LF +MQ + I P+EF + SLL A   +GAL QG WI  Y +K+
Sbjct: 226  SMIGGYVRNGMFKEALKLFIKMQEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKN 285

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
             +++N+IV+TAI++ YCKCGS+  A QVF   P + LS+WNSMI GLA+NG  +EAI +F
Sbjct: 286  NLQLNAIVVTAIIDMYCKCGSIGNALQVFEKIPCRSLSSWNSMIFGLAVNGCEKEAILVF 345

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
              L+ S  KPD +SF+ VLT+ NH  MV E  ++F  M    +I+P+IKHY+ MVD + R
Sbjct: 346  KMLESSSLKPDCISFMAVLTACNHGAMVDEGMEFFSRMKNTYRIEPSIKHYNLMVDMISR 405

Query: 704  AGFLEEAEGPIANM---TQTLLHG--HPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAM 868
            AGFLEEAE  I  M      ++ G      ++  N E+ +   ++ +       +    M
Sbjct: 406  AGFLEEAEQFIKTMPIEKDAIIWGCLLSACRIYGNTEMAKRAAEKVNELDPEETMGYVLM 465

Query: 869  YFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
               HA+ ++  F   M  R+  +   + KEPG S IE
Sbjct: 466  ANIHAWGNN--FVGAMEKRVAMRMKKVEKEPGGSFIE 500



 Score = 67.8 bits (164), Expect = 6e-09
 Identities = 50/209 (23%), Positives = 103/209 (49%), Gaps = 1/209 (0%)
 Frame = +2

Query: 98  GQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQMQ-NQ*IHPTEFALASL 274
           G +D +  +F +M++ +  +WN++I G+ ++   + A  LF  M  +  + P      S+
Sbjct: 71  GNMDYAYLVFLQMQNPNLFSWNTVIRGFSQSSNPQIALYLFIDMLVSSQVEPQRLTYPSI 130

Query: 275 LTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGL 454
             A   LG    G  +     K  ++ +  +   I+  Y   G + +A ++FN   +  +
Sbjct: 131 FKAYSQLGLAHDGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQEMEFDV 190

Query: 455 STWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLV 634
            +WNSMI+GLA  G+ +E+ +LF ++ +     + +S+  ++      GM  EA K F+ 
Sbjct: 191 VSWNSMILGLAKCGEIDESRKLFDKMPVK----NPISWNSMIGGYVRNGMFKEALKLFIK 246

Query: 635 MTKICKIKPTIKHYSCMVDALGRAGFLEE 721
           M +  +I+P+      +++A  + G L +
Sbjct: 247 MQEE-RIQPSEFTMVSLLNASAQIGALRQ 274


>ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355518779|gb|AET00403.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 542

 Score =  277 bits (708), Expect = 5e-72
 Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 14/340 (4%)
 Frame = +2

Query: 2    IFMYANCGFLFDEDSSFDA----------VAWNSMIMGLAKSGQVDESRRLFDKMESKST 151
            I+MYAN G + +    FD           VA NSMIMG AK G++DESR LFD M ++++
Sbjct: 169  IYMYANGGLMSEARRVFDGKKLELYDHDVVAINSMIMGYAKCGEIDESRNLFDDMITRTS 228

Query: 152  VNWNSMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAY 331
            V+WNSMI GYVRNG+L EA +LF +MQ +    +EF + SLL AC  LGAL+ G+W+  Y
Sbjct: 229  VSWNSMISGYVRNGKLMEALELFNKMQVEGFEVSEFTMVSLLNACAHLGALQHGKWVHDY 288

Query: 332  NRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEA 511
             +++  E+N IV+TAI++ YCKCGSV+ A +VF   P++GLS WNS+IIGLA+NG   EA
Sbjct: 289  IKRNHFELNVIVVTAIIDMYCKCGSVENAVEVFETCPRRGLSCWNSIIIGLAMNGHEREA 348

Query: 512  IELFSRLQLSG-SKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMV 688
             E FS+L+ S   KPD VSFIGVLT+  H G + +AR YF +M    +I+P+IKHY+C+V
Sbjct: 349  FEFFSKLESSKLLKPDSVSFIGVLTACKHLGAINKARDYFELMMNKYEIEPSIKHYTCIV 408

Query: 689  DALGRAGFLEEAEGPIANM---TQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKA 859
            D LG+AG LEEAE  I  M      ++ G        +  +   R   Q          +
Sbjct: 409  DVLGQAGLLEEAEELIKGMPLKPDAIIWGSLLSSCRKHRNVQIARRAAQRVYELNPSDAS 468

Query: 860  AAMYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
              +   + + +S +FE+ +  RLL ++    KEPGCS IE
Sbjct: 469  GYVLMSNVHAASNKFEEAIEQRLLMKENLTEKEPGCSSIE 508


>ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g42920, chloroplastic; Flags: Precursor
            gi|4512663|gb|AAD21717.1| hypothetical protein
            [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|110738441|dbj|BAF01146.1| hypothetical protein
            [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 559

 Score =  275 bits (702), Expect = 3e-71
 Identities = 153/339 (45%), Positives = 209/339 (61%), Gaps = 13/339 (3%)
 Frame = +2

Query: 2    IFMYANCGFLFDEDS------SFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            + MY  CG L +          FD VAWNSMIMG AK G +D+++ LFD+M  ++ V+WN
Sbjct: 168  LHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWN 227

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI G+VRNGR K+A D+F +MQ + + P  F + SLL AC  LGA EQG WI  Y  ++
Sbjct: 228  SMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRN 287

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
            + E+NSIV+TA+++ YCKCG +++   VF  APKK LS WNSMI+GLA NG  E A++LF
Sbjct: 288  RFELNSIVVTALIDMYCKCGCIEEGLNVFECAPKKQLSCWNSMILGLANNGFEERAMDLF 347

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            S L+ SG +PD VSFIGVLT+  H G V  A ++F +M +   I+P+IKHY+ MV+ LG 
Sbjct: 348  SELERSGLEPDSVSFIGVLTACAHSGEVHRADEFFRLMKEKYMIEPSIKHYTLMVNVLGG 407

Query: 704  AGFLEEAEGPIANM---TQTLLHGH--PCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAM 868
            AG LEEAE  I NM     T++        + + N+E+     K+ + C           
Sbjct: 408  AGLLEEAEALIKNMPVEEDTVIWSSLLSACRKIGNVEMA----KRAAKCLKKLDPDETCG 463

Query: 869  Y--FCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
            Y    +AY S   FE+ +  RLL ++  + KE GCS IE
Sbjct: 464  YVLLSNAYASYGLFEEAVEQRLLMKERQMEKEVGCSSIE 502



 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 49/235 (20%), Positives = 97/235 (41%), Gaps = 35/235 (14%)
 Frame = +2

Query: 122 LFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQM--QNQ*IHPTEFALASLLTACGGL 295
           +F ++  K+   WN++I G+ R+   + A  +F  M   +  + P      S+  A G L
Sbjct: 80  VFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRL 139

Query: 296 GALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMI 475
           G    G  +     K  +E +S +   ++  Y  CG + +A+++F       +  WNSMI
Sbjct: 140 GQARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMI 199

Query: 476 IGLA-------------------------------INGQGEEAIELFSRLQLSGSKPDDV 562
           +G A                                NG+ ++A+++F  +Q    KPD  
Sbjct: 200 MGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGF 259

Query: 563 SFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKH--YSCMVDALGRAGFLEE 721
           + + +L   N C  +G + +   +   I + +  +     + ++D   + G +EE
Sbjct: 260 TMVSLL---NACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEE 311


>ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum]
            gi|557112734|gb|ESQ53018.1| hypothetical protein
            EUTSA_v10017572mg [Eutrema salsugineum]
          Length = 546

 Score =  272 bits (695), Expect = 2e-70
 Identities = 146/348 (41%), Positives = 208/348 (59%), Gaps = 22/348 (6%)
 Frame = +2

Query: 2    IFMYANCGF------LFDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            + MYA CG       +F     FD VAWNSM+MGLA+ G ++++++LFD+M  ++ ++WN
Sbjct: 167  LHMYATCGCFVEAWRIFMAMKHFDVVAWNSMMMGLARYGLIEQAQKLFDEMPQRNEISWN 226

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI G+V+NGR K+A ++F +MQ + + P  F + SLL AC  LGA EQG WI  Y  K+
Sbjct: 227  SMISGFVKNGRFKDALEMFRKMQERNVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVKN 286

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
            + E+NSIV+TA+++ YCKCG +++  +VF  AP K LS WNSM++GLA NG  E A++LF
Sbjct: 287  RFELNSIVITALIDMYCKCGCIEEGLRVFESAPNKQLSCWNSMVLGLANNGYEERAMDLF 346

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
            S L+ S  +PD VSFIGVLT+  + G V EA ++F +M +   I+P+IKHY+CMV+ LG 
Sbjct: 347  SELESSDLEPDSVSFIGVLTACAYSGKVDEAGEFFRLMREKYLIEPSIKHYTCMVNVLGG 406

Query: 704  AGFLEEAEGPIANMTQTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMYFC-- 877
            AG LEEAE  I NM                 + + W +   +   N N   A     C  
Sbjct: 407  AGLLEEAEAMIKNMPM-------------EQDAIIWSSLLSACRKNGNVEMAERAAKCLK 453

Query: 878  --------------HAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
                          +AY S   FE+ +  R+L ++  + KE GCS IE
Sbjct: 454  KLDPDDTCGYVLMSNAYASYGLFEEAVEQRVLMKERQMEKEIGCSSIE 501



 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 50/234 (21%), Positives = 96/234 (41%), Gaps = 34/234 (14%)
 Frame = +2

Query: 122 LFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFFQM-QNQ*IHPTEFALASLLTACGGLG 298
           LF ++  K+   WN++I G+ R+   + +  +F  M  +    P      S+  A   LG
Sbjct: 80  LFTRINHKNPFVWNTIIRGFSRSSFPEMSITIFIDMFSSASAKPQRLTYPSVFKAYASLG 139

Query: 299 ALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMII 478
               G  +     K  +E +S +   ++  Y  CG   +A+++F       +  WNSM++
Sbjct: 140 KARDGMQLHGMVIKEGLEDDSFIRNTMLHMYATCGCFVEAWRIFMAMKHFDVVAWNSMMM 199

Query: 479 GLA-------------------------------INGQGEEAIELFSRLQLSGSKPDDVS 565
           GLA                                NG+ ++A+E+F ++Q    KPD  +
Sbjct: 200 GLARYGLIEQAQKLFDEMPQRNEISWNSMISGFVKNGRFKDALEMFRKMQERNVKPDGFT 259

Query: 566 FIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKH--YSCMVDALGRAGFLEE 721
            + +L   N C  +G + +   +   I K +  +     + ++D   + G +EE
Sbjct: 260 MVSLL---NACAYLGASEQGRWIHEYIVKNRFELNSIVITALIDMYCKCGCIEE 310


>ref|XP_002880012.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325851|gb|EFH56271.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 542

 Score =  268 bits (686), Expect = 2e-69
 Identities = 150/339 (44%), Positives = 213/339 (62%), Gaps = 13/339 (3%)
 Frame = +2

Query: 2    IFMYANCGFL------FDEDSSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWN 163
            + MY  CG L      F     FD VAWNS+IMGLAK G +D++++LFD+M  ++ V+WN
Sbjct: 168  LHMYVTCGCLVEAWRLFVGMMGFDVVAWNSIIMGLAKCGLIDQAQKLFDEMPQRNGVSWN 227

Query: 164  SMIGGYVRNGRLKEAFDLFFQMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKS 343
            SMI G+VRNGR K+A ++F +MQ + + P  F + SLL AC  LGA EQG WI  Y  ++
Sbjct: 228  SMISGFVRNGRFKDALEMFREMQERDVKPDGFTMVSLLNACAYLGASEQGRWIHKYIVRN 287

Query: 344  KIEVNSIVLTAIVEKYCKCGSVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELF 523
            + E+NSIV+TA+++ YCKCG  ++  +VF  AP K LS WNSMI+GLA NG  E A++LF
Sbjct: 288  RFELNSIVITALIDMYCKCGCFEEGLKVFECAPTKQLSCWNSMILGLANNGCEERAMDLF 347

Query: 524  SRLQLSGSKPDDVSFIGVLTSGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGR 703
              L+ +G +PD VSFIGVLT+  H G V +A ++F +M +   I+P+IKHY+CMV+ LG 
Sbjct: 348  LELERTGLEPDSVSFIGVLTACAHSGEVHKAGEFFRLMREKYMIEPSIKHYTCMVNVLGG 407

Query: 704  AGFLEEAEGPIANMT---QTLLHGH--PCFQLV*NMEILRWRNKQQSNC-SNWNQVKAAA 865
            AG L+EAE  I  M     T++        +   N+E+     K+ +NC  N +  +   
Sbjct: 408  AGLLDEAEALIKKMPVEGDTIIWSSLLAACRKNGNVEMA----KRAANCLKNLDPDETCG 463

Query: 866  -MYFCHAYRSSARFEDTMNLRLLTQKTGLRKEPGCSLIE 979
             +   +AY S   FE+ +  RLL ++  + KE GCS IE
Sbjct: 464  YVLMSNAYASYGLFEEAVEQRLLMKERQMEKEVGCSSIE 502


>ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Capsella rubella]
            gi|565472276|ref|XP_006293940.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562647|gb|EOA26837.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562648|gb|EOA26838.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
          Length = 555

 Score =  260 bits (664), Expect = 7e-67
 Identities = 142/322 (44%), Positives = 201/322 (62%), Gaps = 10/322 (3%)
 Frame = +2

Query: 44   SSFDAVAWNSMIMGLAKSGQVDESRRLFDKMESKSTVNWNSMIGGYVRNGRLKEAFDLFF 223
            + FD VAWNSMIMGLAK G + ++++LFD+M  ++ V+WNSMI G+VRNGR K+A ++F 
Sbjct: 183  TDFDVVAWNSMIMGLAKCGLISQAQQLFDEMPHRNEVSWNSMISGFVRNGRFKDALEMFR 242

Query: 224  QMQNQ*IHPTEFALASLLTACGGLGALEQGEWICAYNRKSKIEVNSIVLTAIVEKYCKCG 403
            +MQ + + P  F + SLL AC  LGA EQG WI  Y  +++ E+NSIV+TA++E YCKCG
Sbjct: 243  EMQERNVKPDGFTMVSLLNACAYLGANEQGRWIHEYIARNRFELNSIVITALIEMYCKCG 302

Query: 404  SVDKAFQVFNDAPKKGLSTWNSMIIGLAINGQGEEAIELFSRLQLSGSKPDDVSFIGVLT 583
             +++  +VF  APKK LS WNSMI+GLA NG  E A++LF  L+  G +PD VSFIGVLT
Sbjct: 303  CIEEGLKVFECAPKKQLSCWNSMILGLANNGCEERAMDLFLELERFGLEPDSVSFIGVLT 362

Query: 584  SGNHCGMVGEARKYFLVMTKICKIKPTIKHYSCMVDALGRAGFLEEAEGPIANMT----- 748
            +  + G V +A  +F +M +   ++P+IKHY+CMV+ LG AG L+EAE  I  M      
Sbjct: 363  ACAYSGEVHKAGGFFRLMREKYMVEPSIKHYTCMVNVLGGAGLLDEAESLIKKMPVEEDA 422

Query: 749  ---QTLLHGHPCFQLV*NMEILRWRNKQQSNCSNWNQVKAAAMY--FCHAYRSSARFEDT 913
                +LL     +    N+E+     K+ + C           Y    +AY     FE+ 
Sbjct: 423  IIWSSLLAACRKYS---NVEMA----KRAAKCLKKLDPDETCGYVLMSNAYAHYGLFEEA 475

Query: 914  MNLRLLTQKTGLRKEPGCSLIE 979
            +  R+L ++  + KE GCS IE
Sbjct: 476  VEQRILMKERKMEKEVGCSSIE 497


Top