BLASTX nr result

ID: Mentha29_contig00010400 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00010400
         (1214 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus...   590   e-166
ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi...   511   e-142
ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containi...   508   e-141
ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi...   506   e-141
ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containi...   503   e-140
ref|XP_002306741.1| pentatricopeptide repeat-containing family p...   501   e-139
ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfam...   499   e-139
gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis...   498   e-138
ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr...   496   e-138
ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-137
ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi...   482   e-133
ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phas...   476   e-132
ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ...   475   e-131
ref|XP_002534070.1| pentatricopeptide repeat-containing protein,...   475   e-131
ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containi...   471   e-130
ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutr...   471   e-130
ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containi...   469   e-130
ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar...   461   e-127
ref|XP_002880012.1| pentatricopeptide repeat-containing protein ...   458   e-126
ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Caps...   452   e-124

>gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus guttatus]
          Length = 505

 Score =  590 bits (1521), Expect = e-166
 Identities = 289/407 (71%), Positives = 341/407 (83%), Gaps = 4/407 (0%)
 Frame = +1

Query: 4    ATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKL 183
            A   A DL YA S+F  I++PNLFTWNTIIR F  SS PHVAISLF++MLT S + P  L
Sbjct: 49   APGPARDLDYAFSVFSHIEKPNLFTWNTIIRGFCQSSHPHVAISLFVDMLTNSTLEPENL 108

Query: 184  TYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDEN 363
            TYPSVFKAYTQLGLA DGAQLHGRI+KLG E DPFIRNSIIHMYA CGL G+A  LFDE+
Sbjct: 109  TYPSVFKAYTQLGLAGDGAQLHGRIIKLGFEHDPFIRNSIIHMYADCGLFGSARKLFDED 168

Query: 364  RDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFH 543
             D DVVAWNSM+MG AKCGE++ESWRLFCKIP RN+ISWNTMISGYVRNG+W++AL+LF 
Sbjct: 169  EDTDVVAWNSMVMGLAKCGEVDESWRLFCKIPCRNDISWNTMISGYVRNGKWVDALSLFA 228

Query: 544  EMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKR---NDFELNVIVITAIIDMYC 714
            EMQ++QIRP+ FTLVS+LNAC KLGALEQGKWIH YIK+   N+ + N IV+TAIIDMYC
Sbjct: 229  EMQQRQIRPSEFTLVSMLNACAKLGALEQGKWIHRYIKKSDINNIDRNTIVVTAIIDMYC 288

Query: 715  KCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLE-SSNLRPDAVSFV 891
            KCG+I  AREVF+++P+K L+ WNSM+LGLA NG  E+ F+LF++LE SSNL PD+VSF+
Sbjct: 289  KCGDIKTAREVFESTPQKALSGWNSMILGLATNGFEEEAFQLFTELEQSSNLNPDSVSFI 348

Query: 892  AVLTASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPM 1071
             VLTASNHSVRVD AR+YF++MKETYGIEP I+HYGC+VDVLGRAGLIE+AAE +KSMPM
Sbjct: 349  GVLTASNHSVRVDKAREYFKVMKETYGIEPTIKHYGCLVDVLGRAGLIEQAAEVIKSMPM 408

Query: 1072 EADDVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            + D +IWGSLL+ C +    +V VAE AARNL L   ++TSAHVLMS
Sbjct: 409  KPDAIIWGSLLSACRRCR--DVGVAELAARNLLLAGPDETSAHVLMS 453


>ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic [Vitis vinifera]
            gi|302143555|emb|CBI22116.3| unnamed protein product
            [Vitis vinifera]
          Length = 533

 Score =  511 bits (1315), Expect = e-142
 Identities = 249/404 (61%), Positives = 318/404 (78%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CATS   D++YA  +F +I  PNLF+WNTIIR FS SS PH AISLF++ML  S V P +
Sbjct: 66   CATSPGGDINYAYLVFTQIHSPNLFSWNTIIRGFSQSSTPHHAISLFIDMLIVSSVQPHR 125

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLGLA  GAQLHGR++KLGL+ DPFIRN+II+MYA+CG L      F E
Sbjct: 126  LTYPSVFKAYAQLGLAHYGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYE 185

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
              D D+VAWNSMIMG AKCGE++ES +LF ++PLRN +SWN+MISGYVRNGR  EAL+LF
Sbjct: 186  RMDFDIVAWNSMIMGLAKCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLF 245

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQE++I+P+ FT+VS+LNA  +LGAL+QG+WIHDYI++N+FELNVIV  +IIDMYCKC
Sbjct: 246  GQMQEERIKPSEFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDMYCKC 305

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G IG A +VF+ +P KGL+ WN+M+LGLA NG   +  +LFS+LE SNLRPD V+FV VL
Sbjct: 306  GSIGEAFQVFEMAPLKGLSSWNTMILGLAMNGCENEAIQLFSRLECSNLRPDDVTFVGVL 365

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA N+S  VD A++YF LM +TY IEP I+HY C+VD LGRAGL+EEA E +++MP+  D
Sbjct: 366  TACNYSGLVDKAKEYFSLMSKTYKIEPSIKHYSCMVDTLGRAGLLEEAEELIRNMPVNPD 425

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IW SLL+ C +KH  NVE+A+ AA+++  LD  D+  +VL+S
Sbjct: 426  AIIWSSLLSAC-RKHG-NVELAKRAAKHIVDLDGNDSCGYVLLS 467


>ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum lycopersicum]
          Length = 522

 Score =  508 bits (1308), Expect = e-141
 Identities = 247/397 (62%), Positives = 310/397 (78%)
 Frame = +1

Query: 22   DLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLTYPSVF 201
            D++YA  +F  I+ PN FTWNTIIR FS SS P  AI LF+EML  SQV P  LTYPSVF
Sbjct: 68   DINYANLVFTHIENPNPFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVF 127

Query: 202  KAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVV 381
            KAY + G+A +GAQLHGRI+KLGLE D FIRN++++MYASCG L  A  LFDE+   DVV
Sbjct: 128  KAYARGGIAKNGAQLHGRIMKLGLEFDTFIRNTLLYMYASCGFLVEARKLFDEDEIEDVV 187

Query: 382  AWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQ 561
            +WNSMI+G AK GEI++SWRLF K+P RN++SWN+MISG+VRNG+W EAL LF  MQE+ 
Sbjct: 188  SWNSMIIGLAKSGEIDDSWRLFSKMPTRNDVSWNSMISGFVRNGKWNEALELFSTMQEEN 247

Query: 562  IRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAR 741
            ++P+ FTLVS+LNACG LGALEQG WI+ Y+K+N+ ELNVIV+TAIIDMYCKC  + MA 
Sbjct: 248  VKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCANVEMAW 307

Query: 742  EVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSV 921
             VF +S  KGL+ WNSM+LGLA NG  +   +LF++L+ S L+PD+VSF+ VLTA NHS 
Sbjct: 308  HVFVSSSNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSG 367

Query: 922  RVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDVIWGSL 1101
             V+ A+ YF+LMK  YGIEP I+HYGC+VD+LGRAGL+EEA E ++SM ME D VIWGSL
Sbjct: 368  LVEKAKDYFQLMKMEYGIEPSIKHYGCMVDILGRAGLVEEAEEVIRSMKMEPDAVIWGSL 427

Query: 1102 LANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            L+ C + H  NVE+A W+A NL  LD  ++S +VLM+
Sbjct: 428  LSAC-RSHG-NVELARWSAENLLELDPNESSGYVLMA 462



 Score = 88.2 bits (217), Expect = 6e-15
 Identities = 63/247 (25%), Positives = 123/247 (49%), Gaps = 4/247 (1%)
 Frame = +1

Query: 418  GEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEM-QEKQIRPTHFTLVSI 594
            G+I  +  +F  I   N  +WNT+I G+  +     A++LF EM    Q++P   T  S+
Sbjct: 67   GDINYANLVFTHIENPNPFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSV 126

Query: 595  LNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAREVFKTSPRKGL 774
              A  + G  + G  +H  I +   E +  +   ++ MY  CG +  AR++F     + +
Sbjct: 127  FKAYARGGIAKNGAQLHGRIMKLGLEFDTFIRNTLLYMYASCGFLVEARKLFDEDEIEDV 186

Query: 775  ACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSVRVDDARKYFRL 954
              WNSM++GLA +G  +  + LFSK+ + N     VS+ ++++    + + ++A + F  
Sbjct: 187  VSWNSMIIGLAKSGEIDDSWRLFSKMPTRN----DVSWNSMISGFVRNGKWNEALELFST 242

Query: 955  MKETYGIEPKIEHYGCVVDVLGRAGLIEEA---AEFVKSMPMEADDVIWGSLLANCSKKH 1125
            M+E   ++P       +++  G  G +E+     ++VK   +E + ++  +++    K  
Sbjct: 243  MQEE-NVKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCK-- 299

Query: 1126 PCNVEVA 1146
              NVE+A
Sbjct: 300  CANVEMA 306


>ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Solanum tuberosum]
          Length = 522

 Score =  506 bits (1304), Expect = e-141
 Identities = 248/397 (62%), Positives = 309/397 (77%)
 Frame = +1

Query: 22   DLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLTYPSVF 201
            D++YA  +F  I+ PNLFTWNTIIR FS SS P  AI LF+EML  SQV P  LTYPSVF
Sbjct: 68   DINYANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVF 127

Query: 202  KAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVV 381
            KAY + GL  +GAQLHGRI+KLGLE D FIRN++++MYASCG L  A  LFDE+   DVV
Sbjct: 128  KAYARGGLVKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASCGFLVEARKLFDEDEIEDVV 187

Query: 382  AWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQ 561
            +WNSMIMG AK GEI++SWRLF K+  RN++SWN+MISG+VRNG+W EAL LF  MQE+ 
Sbjct: 188  SWNSMIMGLAKSGEIDDSWRLFSKMSTRNDVSWNSMISGFVRNGKWNEALELFSTMQEEN 247

Query: 562  IRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAR 741
            I+P+ FTLVS+LNACG LGALEQG WI+ Y+K+N+ ELNVIV+TAIIDMYCKCG + MA 
Sbjct: 248  IKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCGNVEMAW 307

Query: 742  EVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSV 921
             VF +   KGL+ WNSM+LGLA NG  +   +LF++L+ S L+PD+VSF+ VLTA NHS 
Sbjct: 308  HVFISISNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNHSG 367

Query: 922  RVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDVIWGSL 1101
             VD A+ YF+LMK+ YGIEP I+HYGC+VD+LGRAGL+EEA E ++SM ME D VIW SL
Sbjct: 368  LVDKAKDYFQLMKKEYGIEPSIKHYGCMVDILGRAGLVEEADEVIRSMKMEPDAVIWCSL 427

Query: 1102 LANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            L+ C + H  N+E+A W+A NL  LD  ++S +VLM+
Sbjct: 428  LSAC-RSHG-NMELARWSAENLLELDPNESSGYVLMA 462



 Score = 87.4 bits (215), Expect = 1e-14
 Identities = 65/248 (26%), Positives = 125/248 (50%), Gaps = 5/248 (2%)
 Frame = +1

Query: 418  GEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEM-QEKQIRPTHFTLVSI 594
            G+I  +  +F  I   N  +WNT+I G+  +     A++LF EM    Q++P   T  S+
Sbjct: 67   GDINYANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSV 126

Query: 595  LNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAREVFKTSPRKGL 774
              A  + G ++ G  +H  I +   E +  +   ++ MY  CG +  AR++F     + +
Sbjct: 127  FKAYARGGLVKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASCGFLVEARKLFDEDEIEDV 186

Query: 775  ACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSVRVDDARKYFRL 954
              WNSM++GLA +G  +  + LFSK+ + N     VS+ ++++    + + ++A + F  
Sbjct: 187  VSWNSMIMGLAKSGEIDDSWRLFSKMSTRN----DVSWNSMISGFVRNGKWNEALELFST 242

Query: 955  MKETYGIEPKIEHYGCVVDVLGRAGLIEEA---AEFVKSMPMEADDVIWGSLLANCSKKH 1125
            M+E   I+P       +++  G  G +E+     ++VK   +E + ++  +++    K  
Sbjct: 243  MQEE-NIKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCK-- 299

Query: 1126 PC-NVEVA 1146
             C NVE+A
Sbjct: 300  -CGNVEMA 306


>ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 550

 Score =  503 bits (1296), Expect = e-140
 Identities = 245/404 (60%), Positives = 317/404 (78%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S A D++YA  +FR I  PNLF WNTIIR FS+SS+P  AISLF++ML TS V P +
Sbjct: 63   CA-SPAGDINYAYMVFRHIHNPNLFIWNTIIRGFSNSSNPEAAISLFIDMLVTSTVQPQR 121

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLGLA DGAQLHGR+VKLGLESD F+RN+IIHMY++CGLL  A  +FDE
Sbjct: 122  LTYPSVFKAYAQLGLAHDGAQLHGRVVKLGLESDQFVRNTIIHMYSNCGLLSEARRVFDE 181

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
            + + D+VAWNSMIMG +KCGE+ ES RLF K+P RN ISWN+MI G VRNG + EAL+LF
Sbjct: 182  DLEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTEALDLF 241

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             EMQ+++I+P+ FT+VS+LNA  +LGA+ QG+WIH+YI++N  +LN IV+TAII+MY KC
Sbjct: 242  GEMQKQKIKPSEFTMVSLLNASAQLGAIRQGEWIHEYIRKNHIQLNPIVVTAIINMYSKC 301

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G I  A  VF+ +PR GL+CWNS+++GLA NG  E+  ELFS+L+SS+  PD VSF+ VL
Sbjct: 302  GSIEKAVHVFEAAPRTGLSCWNSIIMGLATNGCEEEAIELFSRLKSSSFVPDDVSFLGVL 361

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA +HS  V+ ARKYF +M+ETY I P I+HY C+VDVLGRAGL+EEA + +  MP++AD
Sbjct: 362  TACSHSGMVEKARKYFSVMRETYRIAPSIKHYSCMVDVLGRAGLLEEAEKLIDGMPLKAD 421

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IWGSLL++C K    ++E+A+ AA+++  LD  D   +VLMS
Sbjct: 422  AIIWGSLLSSCRKHR--DIEMAKRAAKHVIELDPSDCCGYVLMS 463


>ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222856190|gb|EEE93737.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 509

 Score =  501 bits (1290), Expect = e-139
 Identities = 245/403 (60%), Positives = 313/403 (77%), Gaps = 1/403 (0%)
 Frame = +1

Query: 7    TSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVL-PGKL 183
            TS A D++YA  +F +I  PNLF WNTIIR FS SS PH AISLF++M+ TS    P +L
Sbjct: 39   TSPAGDINYAYLVFTQIRNPNLFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRL 98

Query: 184  TYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDEN 363
            TYPSVFKAY QLGLA +GAQLHGR++KLGLE+D FI+N+I++MY +CG LG A  +FD  
Sbjct: 99   TYPSVFKAYAQLGLAHEGAQLHGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGA 158

Query: 364  RDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFH 543
               DVV WN+MI+G AKCGEI++S RLF K+ LRN +SWN+MISGYVR GR+ EA+ LF 
Sbjct: 159  TGFDVVTWNTMIIGLAKCGEIDKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFS 218

Query: 544  EMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCG 723
             MQE+ I+P+ FT+VS+LNAC  LGAL QG+WIHDYI +N+F LN IVITAIIDMY KCG
Sbjct: 219  RMQEEGIKPSEFTMVSLLNACACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCG 278

Query: 724  EIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLT 903
             I  A +VFK++P+KGL+CWNS++LGLA +G   +   LFSKLESSNL+PD VSF+ VLT
Sbjct: 279  SIDKALQVFKSAPKKGLSCWNSLILGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLT 338

Query: 904  ASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADD 1083
            A NH+  VD A+ YF LM ETY IEP I+HY C+VDVLGRAGL+EEA E +KSMP+  D 
Sbjct: 339  ACNHAGMVDRAKDYFLLMSETYKIEPSIKHYSCMVDVLGRAGLLEEAEELIKSMPVNPDA 398

Query: 1084 VIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            +IWGSLL++C  +   N+E+A+ AA+ ++ LD  ++S+ +L+S
Sbjct: 399  IIWGSLLSSC--REYGNIEMAKQAAKRVNELDPNESSSFILLS 439


>ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508701125|gb|EOX93021.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 538

 Score =  499 bits (1286), Expect = e-139
 Identities = 247/402 (61%), Positives = 312/402 (77%), Gaps = 1/402 (0%)
 Frame = +1

Query: 10   SAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLTY 189
            S A D++YA  +F +I  PNLFTWNTIIR FS SS+P +AISLF++ML  S + P +LTY
Sbjct: 66   SPAGDMNYAYLVFTQIKNPNLFTWNTIIRGFSQSSNPQIAISLFIDMLVGSSIQPERLTY 125

Query: 190  PSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENR- 366
            PSVFKAY QLGLA DG QLHGR++KLGL+ D FIRN+II+MYA+CGLL  A  +FDE   
Sbjct: 126  PSVFKAYAQLGLACDGRQLHGRVIKLGLDYDQFIRNTIIYMYANCGLLSEAWRMFDEEHM 185

Query: 367  DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHE 546
            +LD+VAWNSMI+G AKCGE++ES RLF K+  RN +SWN+MISGYVRNGR++EAL LF E
Sbjct: 186  ELDIVAWNSMIIGLAKCGEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFLEALELFQE 245

Query: 547  MQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGE 726
            MQE+ IRP+ FT+VS+LNAC  LGA+ QGKWIHDYI + +FELN IV+TAIIDMYCKCG 
Sbjct: 246  MQEEHIRPSEFTMVSLLNACACLGAITQGKWIHDYILKQNFELNGIVVTAIIDMYCKCGN 305

Query: 727  IGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTA 906
               A +VF TSP++GL+CWNSM+LGLA NG   +  +LFSKLES +L+PD V+F+ VL A
Sbjct: 306  AEKALQVFTTSPKEGLSCWNSMILGLATNGCENEARQLFSKLESLSLKPDHVTFIGVLMA 365

Query: 907  SNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDV 1086
             N +  VD A+ YF LM E Y I+P I+HY C+VDVLG AGL+EEA + ++SMP+  D +
Sbjct: 366  CNSAGMVDKAKYYFSLMTEKYKIKPTIKHYSCMVDVLGNAGLLEEAEQLIRSMPVNEDAI 425

Query: 1087 IWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            IWGSLL+ C +KH  NV +A+ AA+ +  LD  + S +VLMS
Sbjct: 426  IWGSLLSAC-RKHG-NVGMAKRAAKLVIELDPAERSGYVLMS 465


>gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis]
            gi|587904202|gb|EXB92403.1| hypothetical protein
            L484_021387 [Morus notabilis]
          Length = 530

 Score =  498 bits (1282), Expect = e-138
 Identities = 247/405 (60%), Positives = 312/405 (77%), Gaps = 1/405 (0%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S A +++YAL +F +I  PNLF WNTIIR FS SS P  AI LF++ML  S + P +
Sbjct: 68   CA-SPAGNINYALMVFSQIQNPNLFIWNTIIRGFSRSSTPQTAIFLFIDMLVGSPLEPQR 126

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLGLA  GAQLHGR++KLGL+ D F+RN+IIHMY +CG L  A  LFDE
Sbjct: 127  LTYPSVFKAYAQLGLACFGAQLHGRVIKLGLDCDRFVRNTIIHMYINCGFLSEARQLFDE 186

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
            + +LD+VAWNSMIMG +KCGE+ ES RLF ++PLRN +SWN+MISGYVRNG+ +EAL LF
Sbjct: 187  SSELDLVAWNSMIMGLSKCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKCVEALELF 246

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQ + I+ + FT+VS+LNA G+LGA+ QG+WIH+YI +N  ELNVIV+TAIIDMYCKC
Sbjct: 247  GKMQGEGIKASEFTMVSLLNASGRLGAIRQGEWIHEYITKNGIELNVIVVTAIIDMYCKC 306

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESS-NLRPDAVSFVAV 897
            G +  A  VFKT+P+ GL+CWNSM++GLA NG  E+  ELFS+LESS +LRPD VSF+AV
Sbjct: 307  GSVNKALSVFKTAPKLGLSCWNSMVMGLAMNGCEEEALELFSRLESSIDLRPDGVSFLAV 366

Query: 898  LTASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEA 1077
            LTA NHS  VD AR YF LM+  Y IEP   HY C+VDVLG+AG +EEA + + SMP+  
Sbjct: 367  LTACNHSGMVDKARDYFSLMRGKYNIEPSTRHYSCMVDVLGKAGHLEEAEKLILSMPINP 426

Query: 1078 DDVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            D +IWGSLL+ C +KH  N+E+A+ A   +  LD  ++SA+VLMS
Sbjct: 427  DAIIWGSLLSAC-RKHG-NIEMAQRALERVIELDPSESSAYVLMS 469


>ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina]
            gi|557539373|gb|ESR50417.1| hypothetical protein
            CICLE_v10031197mg [Citrus clementina]
          Length = 534

 Score =  496 bits (1277), Expect = e-138
 Identities = 244/403 (60%), Positives = 312/403 (77%), Gaps = 1/403 (0%)
 Frame = +1

Query: 7    TSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLT 186
            TS A D++YA  +F +I +PNLF WNTIIR FS SS P  AI LF++ML TS + P +LT
Sbjct: 63   TSPAGDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLT 122

Query: 187  YPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE-N 363
            YPS+FKAY QLGLA DGAQLHGR+VK GLE D FI N+II+MYA+CG L  A  +FDE +
Sbjct: 123  YPSLFKAYAQLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLIFDEVD 182

Query: 364  RDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFH 543
             + DVVAWNSMI+G AKCGEI+ES RLF K+  RN +SWN+MISGYVRN ++ EAL LF 
Sbjct: 183  TEFDVVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFR 242

Query: 544  EMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCG 723
            EMQE+ I+P+ FT+VS+LNAC KLGA+ QG+WIH+++  N FELN IV+TAIIDMYCKCG
Sbjct: 243  EMQEQNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCG 302

Query: 724  EIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLT 903
                A +VF T P+KGL+CWNSM+ GLA NG+  +  +LFS L+SSNL+PD +SF+AVLT
Sbjct: 303  CPERALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLKPDYISFIAVLT 362

Query: 904  ASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADD 1083
            A NHS +V+ A+ YF LM ETY I+P I+HY C+VD LGRAGL+EEA + ++SMP + D 
Sbjct: 363  ACNHSGKVNQAKDYFTLMTETYKIKPSIKHYSCMVDALGRAGLLEEAEKLIRSMPSDPDA 422

Query: 1084 VIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            +IWGSLL+ C +KH  N+E+A+ AA+ +  LD  ++  +VLMS
Sbjct: 423  IIWGSLLSAC-RKHG-NIEMAKQAAKQIIELDKNESCGYVLMS 463


>ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Citrus sinensis]
          Length = 534

 Score =  494 bits (1271), Expect = e-137
 Identities = 244/403 (60%), Positives = 310/403 (76%), Gaps = 1/403 (0%)
 Frame = +1

Query: 7    TSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLT 186
            TS A D++YA  +F +I +PNLF WNTIIR FS SS P  AI LF++ML TS + P +LT
Sbjct: 63   TSPAGDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLT 122

Query: 187  YPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE-N 363
            YPS+FKAY QLGLA DGAQLHGR+VK GLE D FI N+II+MYA+CG L  A  +FDE +
Sbjct: 123  YPSLFKAYAQLGLARDGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLMFDEVD 182

Query: 364  RDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFH 543
             + DVVAWNSMI+G AKCGEI+ES RLF K+  RN +SWN+MISGYVRN ++ EAL LF 
Sbjct: 183  TEFDVVAWNSMIIGLAKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFR 242

Query: 544  EMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCG 723
            EMQE+ I+P+ FT+VS+LNAC KLGA+ QG+WIH+++  N FELN IV+TAIIDMYCKCG
Sbjct: 243  EMQEQNIKPSEFTMVSLLNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCG 302

Query: 724  EIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLT 903
                A +VF T P+KGL+CWNSM+ GLA NG+  +  +LFS L+SSNL PD  SF+AVLT
Sbjct: 303  CPERALQVFNTVPKKGLSCWNSMVFGLAMNGYENEAIKLFSGLQSSNLTPDYTSFIAVLT 362

Query: 904  ASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADD 1083
            A NHS +V+ A+ YF LM ETY I+P I+HY C+VD LGRAGL+EEA + ++SMP + D 
Sbjct: 363  ACNHSGKVNQAKDYFTLMTETYKIKPSIKHYSCMVDALGRAGLLEEAEKLIRSMPSDPDA 422

Query: 1084 VIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            +IWGSLL+ C +KH  N+E+A+ AA+ +  LD  ++  +VLMS
Sbjct: 423  IIWGSLLSAC-RKHG-NIEMAKQAAKQIIELDKNESCGYVLMS 463


>ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Glycine max]
          Length = 534

 Score =  482 bits (1240), Expect = e-133
 Identities = 241/404 (59%), Positives = 311/404 (76%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA+S+  D++YA  LF  I  PNL+ WNTIIR FS SS PH+AISLF++ML +S VLP +
Sbjct: 67   CASSSG-DINYAYLLFTTIPSPNLYCWNTIIRGFSRSSTPHLAISLFVDMLCSS-VLPQR 124

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLG   DGAQLHGR+VKLGLE D FI+N+II+MYA+ GLL  A  +FDE
Sbjct: 125  LTYPSVFKAYAQLGAGYDGAQLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVFDE 184

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
              DLDVVA NSMIMG AKCGE+++S RLF  +P R  ++WN+MISGYVRN R +EAL LF
Sbjct: 185  LVDLDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALELF 244

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQ +++ P+ FT+VS+L+AC  LGAL+ G+W+HDY+KR  FELNVIV+TAIIDMYCKC
Sbjct: 245  RKMQGERVEPSEFTMVSLLSACAHLGALKHGEWVHDYVKRGHFELNVIVLTAIIDMYCKC 304

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G I  A EVF+ SP +GL+CWNS+++GLA NG+  +  E FSKLE+S+L+PD VSF+ VL
Sbjct: 305  GVIVKAIEVFEASPTRGLSCWNSIIIGLALNGYERKAIEYFSKLEASDLKPDHVSFIGVL 364

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA  +   V  AR YF LM   Y IEP I+HY C+V+VLG+A L+EEA + +K MP++AD
Sbjct: 365  TACKYIGAVGKARDYFSLMMNKYEIEPSIKHYTCMVEVLGQAALLEEAEQLIKGMPLKAD 424

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IWGSLL++C +KH  NVE+A+ AA+ +  L+  D S ++LMS
Sbjct: 425  FIIWGSLLSSC-RKHG-NVEIAKRAAQRVCELNPSDASGYLLMS 466


>ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris]
            gi|561014990|gb|ESW13851.1| hypothetical protein
            PHAVU_008G231600g [Phaseolus vulgaris]
          Length = 525

 Score =  476 bits (1225), Expect = e-132
 Identities = 239/404 (59%), Positives = 304/404 (75%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA+S+  D++YA  +F  I  PNL+ WNTIIR FS SS P  AISLF++ML  S V P +
Sbjct: 65   CASSSG-DINYAYLVFTGIPNPNLYCWNTIIRGFSRSSTPQFAISLFVDMLY-SAVEPQR 122

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLG   DGAQLHGR+VKLGLE D FI N+I++MYA+ GL+  A  +FDE
Sbjct: 123  LTYPSVFKAYAQLGAGHDGAQLHGRVVKLGLEKDQFISNTILYMYANSGLMSEARRVFDE 182

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
              +LDVVA NSMIMG AKCGE+++S RLF  +P R  +SWN+MISGYVRNGR  E L LF
Sbjct: 183  PLELDVVACNSMIMGLAKCGEVDKSRRLFDNMPTRTAVSWNSMISGYVRNGRLTEGLELF 242

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQE+ + P+ FT+VS+L+AC  LGAL+ G+W+HDYIKR +F+LNVIV+TAIIDMYCKC
Sbjct: 243  RKMQEEGVEPSEFTMVSLLSACAHLGALQHGEWVHDYIKRGNFKLNVIVLTAIIDMYCKC 302

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G I  A EVF  SP +GL CWNS+++GLA NGH  +  E FSKLESSN++PD VSF+ VL
Sbjct: 303  GSIEKAVEVFAASPTRGLPCWNSIIIGLALNGHEREAIEYFSKLESSNIKPDCVSFIGVL 362

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA  +   V +AR YF LM + Y IEP I+HY C+V+VLG A L+EEA E +K M +EAD
Sbjct: 363  TACKYLGAVREARDYFALMMDKYEIEPSIKHYTCLVEVLGHAALLEEAEEVIKGMSIEAD 422

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IWGSLL++C +KH  NVE+A+ AA+ +  L+  + S ++LMS
Sbjct: 423  FIIWGSLLSSC-RKHG-NVEIAKRAAQRVFELNPREASGYLLMS 464


>ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355518779|gb|AET00403.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 542

 Score =  475 bits (1222), Expect = e-131
 Identities = 243/409 (59%), Positives = 307/409 (75%), Gaps = 5/409 (1%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S + +++YA  LF R+  PNL++WNTIIRAFS SS P  AISLF++ML  SQ+ P  
Sbjct: 70   CA-SPSGNINYAYKLFVRMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQIQPQY 127

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QLG A  GAQLHGR+VKLGL++D FI N+II+MYA+ GL+  A  +FD 
Sbjct: 128  LTYPSVFKAYAQLGHAHYGAQLHGRVVKLGLQNDQFICNTIIYMYANGGLMSEARRVFDG 187

Query: 361  NR----DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEA 528
             +    D DVVA NSMIMG+AKCGEI+ES  LF  +  R  +SWN+MISGYVRNG+ +EA
Sbjct: 188  KKLELYDHDVVAINSMIMGYAKCGEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEA 247

Query: 529  LNLFHEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDM 708
            L LF++MQ +    + FT+VS+LNAC  LGAL+ GKW+HDYIKRN FELNVIV+TAIIDM
Sbjct: 248  LELFNKMQVEGFEVSEFTMVSLLNACAHLGALQHGKWVHDYIKRNHFELNVIVVTAIIDM 307

Query: 709  YCKCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSN-LRPDAVS 885
            YCKCG +  A EVF+T PR+GL+CWNS+++GLA NGH  + FE FSKLESS  L+PD+VS
Sbjct: 308  YCKCGSVENAVEVFETCPRRGLSCWNSIIIGLAMNGHEREAFEFFSKLESSKLLKPDSVS 367

Query: 886  FVAVLTASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSM 1065
            F+ VLTA  H   ++ AR YF LM   Y IEP I+HY C+VDVLG+AGL+EEA E +K M
Sbjct: 368  FIGVLTACKHLGAINKARDYFELMMNKYEIEPSIKHYTCIVDVLGQAGLLEEAEELIKGM 427

Query: 1066 PMEADDVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            P++ D +IWGSLL++C K    NV++A  AA+ +  L+  D S +VLMS
Sbjct: 428  PLKPDAIIWGSLLSSCRKHR--NVQIARRAAQRVYELNPSDASGYVLMS 474


>ref|XP_002534070.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223525897|gb|EEF28314.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  475 bits (1222), Expect = e-131
 Identities = 231/404 (57%), Positives = 310/404 (76%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S A D++YA  +F +I  PN+F WNTIIR FS SS P  +ISL+++ML TS V P +
Sbjct: 64   CA-SPAGDINYAYLVFVQIQNPNIFAWNTIIRGFSRSSVPQNSISLYIDMLLTSPVQPQR 122

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKA+ QL LA +GAQLHG+++KLGLE+D FIRN+I+ MY +CG    A  +FD 
Sbjct: 123  LTYPSVFKAFAQLDLASEGAQLHGKMIKLGLENDSFIRNTILFMYVNCGFTSEARKVFDR 182

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
              D D+VAWN+MIMG AKCG ++ES RLF K+ LRN +SWN+MISGYVRNGR+ +AL LF
Sbjct: 183  GMDFDIVAWNTMIMGVAKCGLVDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELF 242

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQ ++I P+ FT+VS+LNAC  LGA+ QG+WIHDY+ +  FELN IV+TAIIDMY KC
Sbjct: 243  QKMQVERIEPSEFTMVSLLNACACLGAIRQGEWIHDYMVKKKFELNPIVVTAIIDMYSKC 302

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G I  A +VF+++PR+GL+CWNSM+LGLA NG   +  +LFS L+SS+LRPD VSF+AVL
Sbjct: 303  GSIDKAVQVFQSAPRRGLSCWNSMILGLAMNGQENEALQLFSVLQSSDLRPDDVSFIAVL 362

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA +H+  VD A+ YF LM++ Y I+P I+H+ C+VDVLGRAGL+EEA E ++SM ++ D
Sbjct: 363  TACDHTGMVDKAKDYFLLMRDKYKIKPGIKHFSCMVDVLGRAGLLEEAEELIRSMHVDPD 422

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IWGSLL +C K    N+++A+ AA +L  L+  ++S+ VL++
Sbjct: 423  AIIWGSLLWSCCKYG--NIKMAKRAANHLIELNPSESSSFVLVA 464


>ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
            gi|449530724|ref|XP_004172343.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cucumis sativus]
          Length = 543

 Score =  471 bits (1213), Expect = e-130
 Identities = 226/404 (55%), Positives = 309/404 (76%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S   ++ YA  +F ++  PNLF+WNT+IR FS SS+P +A+ LF++ML +SQV P +
Sbjct: 66   CA-SPLGNMDYAYLVFLQMQNPNLFSWNTVIRGFSQSSNPQIALYLFIDMLVSSQVEPQR 124

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPS+FKAY+QLGLA DGAQLHGRI+KLGL+ DPFIRN+I++MYA+ G L  A  +F++
Sbjct: 125  LTYPSIFKAYSQLGLAHDGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQ 184

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
              + DVV+WNSMI+G AKCGEI+ES +LF K+P++N ISWN+MI GYVRNG + EAL LF
Sbjct: 185  EMEFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLF 244

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQE++I+P+ FT+VS+LNA  ++GAL QG WIH+YIK+N+ +LN IV+TAIIDMYCKC
Sbjct: 245  IKMQEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDMYCKC 304

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G IG A +VF+  P + L+ WNSM+ GLA NG  ++   +F  LESS+L+PD +SF+AVL
Sbjct: 305  GSIGNALQVFEKIPCRSLSSWNSMIFGLAVNGCEKEAILVFKMLESSSLKPDCISFMAVL 364

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA NH   VD+  ++F  MK TY IEP I+HY  +VD++ RAG +EEA +F+K+MP+E D
Sbjct: 365  TACNHGAMVDEGMEFFSRMKNTYRIEPSIKHYNLMVDMISRAGFLEEAEQFIKTMPIEKD 424

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IWG LL+ C  +   N E+A+ AA  ++ LD E+T  +VLM+
Sbjct: 425  AIIWGCLLSAC--RIYGNTEMAKRAAEKVNELDPEETMGYVLMA 466


>ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum]
            gi|557112734|gb|ESQ53018.1| hypothetical protein
            EUTSA_v10017572mg [Eutrema salsugineum]
          Length = 546

 Score =  471 bits (1211), Expect = e-130
 Identities = 231/404 (57%), Positives = 305/404 (75%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            C TS + D+ YA  LF RI+  N F WNTIIR FS SS P ++I++F++M +++   P +
Sbjct: 67   CCTSPS-DMSYAYLLFTRINHKNPFVWNTIIRGFSRSSFPEMSITIFIDMFSSASAKPQR 125

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY  LG A DG QLHG ++K GLE D FIRN+++HMYA+CG    A  +F  
Sbjct: 126  LTYPSVFKAYASLGKARDGMQLHGMVIKEGLEDDSFIRNTMLHMYATCGCFVEAWRIFMA 185

Query: 361  NRDLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLF 540
             +  DVVAWNSM+MG A+ G IE++ +LF ++P RNEISWN+MISG+V+NGR+ +AL +F
Sbjct: 186  MKHFDVVAWNSMMMGLARYGLIEQAQKLFDEMPQRNEISWNSMISGFVKNGRFKDALEMF 245

Query: 541  HEMQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKC 720
             +MQE+ ++P  FT+VS+LNAC  LGA EQG+WIH+YI +N FELN IVITA+IDMYCKC
Sbjct: 246  RKMQERNVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVKNRFELNSIVITALIDMYCKC 305

Query: 721  GEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVL 900
            G I     VF+++P K L+CWNSM+LGLANNG+ E+  +LFS+LESS+L PD+VSF+ VL
Sbjct: 306  GCIEEGLRVFESAPNKQLSCWNSMVLGLANNGYEERAMDLFSELESSDLEPDSVSFIGVL 365

Query: 901  TASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEAD 1080
            TA  +S +VD+A ++FRLM+E Y IEP I+HY C+V+VLG AGL+EEA   +K+MPME D
Sbjct: 366  TACAYSGKVDEAGEFFRLMREKYLIEPSIKHYTCMVNVLGGAGLLEEAEAMIKNMPMEQD 425

Query: 1081 DVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             +IW SLL+ C K    NVE+AE AA+ L  LD +DT  +VLMS
Sbjct: 426  AIIWSSLLSACRKNG--NVEMAERAAKCLKKLDPDDTCGYVLMS 467


>ref|XP_004491336.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
            chloroplastic-like [Cicer arietinum]
          Length = 536

 Score =  469 bits (1208), Expect = e-130
 Identities = 237/406 (58%), Positives = 308/406 (75%), Gaps = 2/406 (0%)
 Frame = +1

Query: 1    CATSAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGK 180
            CA S + +++YA  LF R+  PNL++WNTIIRAFS SS P  AISLF++ML  SQ+ P  
Sbjct: 67   CA-SPSGNINYAYKLFARMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLY-SQIQPQH 124

Query: 181  LTYPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDE 360
            LTYPSVFKAY QL     G+QLHG +VKLGL+ D FI N+II+MYA+ GLL  A  +FDE
Sbjct: 125  LTYPSVFKAYAQLSAGDYGSQLHGMVVKLGLQRDQFIHNTIIYMYANSGLLSEAKRVFDE 184

Query: 361  NRDL-DVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNL 537
              +L DVVA+NSMIMGFAKCGEI+E+ +LF ++  R  ++WN+MISGYVRNG+ +EAL L
Sbjct: 185  KLELGDVVAFNSMIMGFAKCGEIDEARKLFDEMFTRTSVTWNSMISGYVRNGKLMEALEL 244

Query: 538  FHEMQ-EKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYC 714
            FH+MQ E+++ P+ FT+VS+LNAC  LGAL+ GKW+HDYIKRNDFELNVIV+TAIIDMYC
Sbjct: 245  FHKMQLEERVEPSEFTMVSLLNACAHLGALQHGKWVHDYIKRNDFELNVIVLTAIIDMYC 304

Query: 715  KCGEIGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVA 894
            KCG +  A +VF T P +GL+CWNS+++GLA NGH  + FE FS+LE S  +PD+VSF+ 
Sbjct: 305  KCGSVENAIQVFDTYPGRGLSCWNSIIIGLAMNGHEREAFEFFSELELSKFKPDSVSFIG 364

Query: 895  VLTASNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPME 1074
            VLTA  H   VD A+ YF LM   Y IEP I+HY C+V+VLG+A  +EEA E ++ MP++
Sbjct: 365  VLTACKHLGAVDKAKDYFALMMNEYKIEPSIKHYTCMVEVLGQAAFLEEAEELIQGMPIK 424

Query: 1075 ADDVIWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
             D +IWGSLL++C +KH  NV+ A+ AA+ +  L+  D S +VLMS
Sbjct: 425  PDAIIWGSLLSSC-RKHG-NVQRAKRAAQRVYELNPSDASGYVLMS 468


>ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g42920, chloroplastic; Flags: Precursor
            gi|4512663|gb|AAD21717.1| hypothetical protein
            [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|110738441|dbj|BAF01146.1| hypothetical protein
            [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 559

 Score =  461 bits (1185), Expect = e-127
 Identities = 232/402 (57%), Positives = 300/402 (74%), Gaps = 1/402 (0%)
 Frame = +1

Query: 10   SAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTS-QVLPGKLT 186
            ++ +D++YA  +F RI+  N F WNTIIR FS SS P +AIS+F++ML +S  V P +LT
Sbjct: 69   ASPSDMNYAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLT 128

Query: 187  YPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENR 366
            YPSVFKAY +LG A DG QLHG ++K GLE D FIRN+++HMY +CG L  A  +F    
Sbjct: 129  YPSVFKAYGRLGQARDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMI 188

Query: 367  DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHE 546
              DVVAWNSMIMGFAKCG I+++  LF ++P RN +SWN+MISG+VRNGR+ +AL++F E
Sbjct: 189  GFDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFRE 248

Query: 547  MQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGE 726
            MQEK ++P  FT+VS+LNAC  LGA EQG+WIH+YI RN FELN IV+TA+IDMYCKCG 
Sbjct: 249  MQEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGC 308

Query: 727  IGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTA 906
            I     VF+ +P+K L+CWNSM+LGLANNG  E+  +LFS+LE S L PD+VSF+ VLTA
Sbjct: 309  IEEGLNVFECAPKKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTA 368

Query: 907  SNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDV 1086
              HS  V  A ++FRLMKE Y IEP I+HY  +V+VLG AGL+EEA   +K+MP+E D V
Sbjct: 369  CAHSGEVHRADEFFRLMKEKYMIEPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEEDTV 428

Query: 1087 IWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            IW SLL+ C K    NVE+A+ AA+ L  LD ++T  +VL+S
Sbjct: 429  IWSSLLSACRKIG--NVEMAKRAAKCLKKLDPDETCGYVLLS 468


>ref|XP_002880012.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325851|gb|EFH56271.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 542

 Score =  458 bits (1179), Expect = e-126
 Identities = 230/402 (57%), Positives = 295/402 (73%), Gaps = 1/402 (0%)
 Frame = +1

Query: 10   SAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTS-QVLPGKLT 186
            ++ +D +YA  +F RI+  N F WNTIIR FS SS P +AIS+F++ML +S  V P +LT
Sbjct: 69   ASPSDRNYAYLVFTRINHKNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLT 128

Query: 187  YPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENR 366
            YPSVFKAY  LGLA DG QLHGR++K GLE D FIRN+++HMY +CG L  A  LF    
Sbjct: 129  YPSVFKAYASLGLARDGRQLHGRVIKEGLEDDSFIRNTMLHMYVTCGCLVEAWRLFVGMM 188

Query: 367  DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHE 546
              DVVAWNS+IMG AKCG I+++ +LF ++P RN +SWN+MISG+VRNGR+ +AL +F E
Sbjct: 189  GFDVVAWNSIIMGLAKCGLIDQAQKLFDEMPQRNGVSWNSMISGFVRNGRFKDALEMFRE 248

Query: 547  MQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGE 726
            MQE+ ++P  FT+VS+LNAC  LGA EQG+WIH YI RN FELN IVITA+IDMYCKCG 
Sbjct: 249  MQERDVKPDGFTMVSLLNACAYLGASEQGRWIHKYIVRNRFELNSIVITALIDMYCKCGC 308

Query: 727  IGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTA 906
                 +VF+ +P K L+CWNSM+LGLANNG  E+  +LF +LE + L PD+VSF+ VLTA
Sbjct: 309  FEEGLKVFECAPTKQLSCWNSMILGLANNGCEERAMDLFLELERTGLEPDSVSFIGVLTA 368

Query: 907  SNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDV 1086
              HS  V  A ++FRLM+E Y IEP I+HY C+V+VLG AGL++EA   +K MP+E D +
Sbjct: 369  CAHSGEVHKAGEFFRLMREKYMIEPSIKHYTCMVNVLGGAGLLDEAEALIKKMPVEGDTI 428

Query: 1087 IWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            IW SLLA C K    NVE+A+ AA  L  LD ++T  +VLMS
Sbjct: 429  IWSSLLAACRKNG--NVEMAKRAANCLKNLDPDETCGYVLMS 468



 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 84/372 (22%), Positives = 147/372 (39%), Gaps = 2/372 (0%)
 Frame = +1

Query: 34   ALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTSQVLPGKLTYPSVFKAYT 213
            A  LF  + + N  +WN++I  F  +     A+ +F EM     V P   T  S+  A  
Sbjct: 211  AQKLFDEMPQRNGVSWNSMISGFVRNGRFKDALEMFREM-QERDVKPDGFTMVSLLNACA 269

Query: 214  QLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENRDLDVVAWNS 393
             LG +  G  +H  IV+   E +  +  ++I MY                          
Sbjct: 270  YLGASEQGRWIHKYIVRNRFELNSIVITALIDMYC------------------------- 304

Query: 394  MIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHEMQEKQIRPT 573
                  KCG  EE  ++F   P +    WN+MI G   NG    A++LF E++   + P 
Sbjct: 305  ------KCGCFEEGLKVFECAPTKQLSCWNSMILGLANNGCEERAMDLFLELERTGLEPD 358

Query: 574  HFTLVSILNACGKLGALEQ-GKWIHDYIKRNDFELNVIVITAIIDMYCKCGEIGMAREVF 750
              + + +L AC   G + + G++     ++   E ++   T ++++    G +  A  + 
Sbjct: 359  SVSFIGVLTACAHSGEVHKAGEFFRLMREKYMIEPSIKHYTCMVNVLGGAGLLDEAEALI 418

Query: 751  KTSPRKG-LACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTASNHSVRV 927
            K  P +G    W+S++     NG+ E      + L+  NL PD      +++ +  S  +
Sbjct: 419  KKMPVEGDTIIWSSLLAACRKNGNVEMAKRAANCLK--NLDPDETCGYVLMSNAYASYGL 476

Query: 928  DDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDVIWGSLLA 1107
             +     RL+ +   +E ++      VD         E  EFV                 
Sbjct: 477  FEEAVEQRLLMKERQMEKEVGCSSIEVDF--------EVHEFV----------------- 511

Query: 1108 NCSKKHPCNVEV 1143
            +C KKHP + E+
Sbjct: 512  SCGKKHPKSTEI 523


>ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Capsella rubella]
            gi|565472276|ref|XP_006293940.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562647|gb|EOA26837.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
            gi|482562648|gb|EOA26838.1| hypothetical protein
            CARUB_v10022931mg [Capsella rubella]
          Length = 555

 Score =  452 bits (1162), Expect = e-124
 Identities = 227/402 (56%), Positives = 297/402 (73%), Gaps = 1/402 (0%)
 Frame = +1

Query: 10   SAAADLHYALSLFRRIDRPNLFTWNTIIRAFSHSSDPHVAISLFLEMLTTS-QVLPGKLT 186
            ++ +D++YA  +F RI+  N F WNTIIR FS SS P +AIS+F++ML +S  V P  LT
Sbjct: 64   ASPSDMNYAYLVFTRINHKNPFVWNTIIRGFSQSSFPEMAISIFIDMLCSSPSVKPQNLT 123

Query: 187  YPSVFKAYTQLGLAMDGAQLHGRIVKLGLESDPFIRNSIIHMYASCGLLGNAGNLFDENR 366
            YPSVFKAY +LG A+DG QLHGR++K GLE D FIRN+++ MY + G L  A  +F    
Sbjct: 124  YPSVFKAYGRLGQAIDGRQLHGRVLKEGLEDDSFIRNTMLQMYVTSGCLVEAWRIFVGMT 183

Query: 367  DLDVVAWNSMIMGFAKCGEIEESWRLFCKIPLRNEISWNTMISGYVRNGRWIEALNLFHE 546
            D DVVAWNSMIMG AKCG I ++ +LF ++P RNE+SWN+MISG+VRNGR+ +AL +F E
Sbjct: 184  DFDVVAWNSMIMGLAKCGLISQAQQLFDEMPHRNEVSWNSMISGFVRNGRFKDALEMFRE 243

Query: 547  MQEKQIRPTHFTLVSILNACGKLGALEQGKWIHDYIKRNDFELNVIVITAIIDMYCKCGE 726
            MQE+ ++P  FT+VS+LNAC  LGA EQG+WIH+YI RN FELN IVITA+I+MYCKCG 
Sbjct: 244  MQERNVKPDGFTMVSLLNACAYLGANEQGRWIHEYIARNRFELNSIVITALIEMYCKCGC 303

Query: 727  IGMAREVFKTSPRKGLACWNSMMLGLANNGHYEQVFELFSKLESSNLRPDAVSFVAVLTA 906
            I    +VF+ +P+K L+CWNSM+LGLANNG  E+  +LF +LE   L PD+VSF+ VLTA
Sbjct: 304  IEEGLKVFECAPKKQLSCWNSMILGLANNGCEERAMDLFLELERFGLEPDSVSFIGVLTA 363

Query: 907  SNHSVRVDDARKYFRLMKETYGIEPKIEHYGCVVDVLGRAGLIEEAAEFVKSMPMEADDV 1086
              +S  V  A  +FRLM+E Y +EP I+HY C+V+VLG AGL++EA   +K MP+E D +
Sbjct: 364  CAYSGEVHKAGGFFRLMREKYMVEPSIKHYTCMVNVLGGAGLLDEAESLIKKMPVEEDAI 423

Query: 1087 IWGSLLANCSKKHPCNVEVAEWAARNLSLLDSEDTSAHVLMS 1212
            IW SLLA C K    NVE+A+ AA+ L  LD ++T  +VLMS
Sbjct: 424  IWSSLLAACRKY--SNVEMAKRAAKCLKKLDPDETCGYVLMS 463


Top