BLASTX nr result

ID: Akebia24_contig00021346 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00021346
         (938 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi...   406   e-111
ref|XP_002306741.1| pentatricopeptide repeat-containing family p...   384   e-104
ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi...   382   e-104
ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr...   382   e-103
ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfam...   378   e-102
ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containi...   377   e-102
ref|XP_002534070.1| pentatricopeptide repeat-containing protein,...   373   e-101
ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phas...   370   e-100
ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containi...   370   e-100
gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis...   369   e-99 
ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi...   363   5e-98
ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containi...   362   2e-97
ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi...   359   8e-97
gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus...   349   8e-94
ref|XP_002880012.1| pentatricopeptide repeat-containing protein ...   339   8e-91
ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar...   335   1e-89
ref|XP_007216544.1| hypothetical protein PRUPE_ppb007734mg [Prun...   334   3e-89
ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ...   334   3e-89
ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Caps...   331   2e-88
ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutr...   329   1e-87

>ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic [Vitis vinifera]
           gi|302143555|emb|CBI22116.3| unnamed protein product
           [Vitis vinifera]
          Length = 533

 Score =  406 bits (1043), Expect = e-111
 Identities = 198/269 (73%), Positives = 227/269 (84%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           +HP LS+LEK CTTMKDL K+HA+L+KTGL K  +A S VL+FCATS  GDINYAYLVF 
Sbjct: 23  DHPHLSILEKHCTTMKDLQKIHAHLLKTGLAKHPLAVSPVLAFCATSPGGDINYAYLVFT 82

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           QI  PNLF+WNTIIRGFSQSSTPH AISLFIDML  S +QPHRLTYPS+FKAYAQLGLAH
Sbjct: 83  QIHSPNLFSWNTIIRGFSQSSTPHHAISLFIDMLIVSSVQPHRLTYPSVFKAYAQLGLAH 142

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHA 270
            G+QLHGR+IKLGL+ DPFIRNTII+MY NCG+L E    F E  DFD+VAWNSMI+G A
Sbjct: 143 YGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYERMDFDIVAWNSMIMGLA 202

Query: 269 KNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVS 90
           K G++DESR+LFD MP R+T+SWNSMISGYVRNGRL++A DLF +MQ + I+PSEFT VS
Sbjct: 203 KCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLFGQMQEERIKPSEFTMVS 262

Query: 89  LLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           LL A A LGALKQGEWIH YI+KNN E+N
Sbjct: 263 LLNASARLGALKQGEWIHDYIRKNNFELN 291



 Score = 91.7 bits (226), Expect = 4e-16
 Identities = 64/253 (25%), Positives = 121/253 (47%), Gaps = 5/253 (1%)
 Frame = -1

Query: 752 KLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQ 573
           +LH  +IK GL  D    + ++   A    G ++  +  FY+    ++  WN++I G ++
Sbjct: 146 QLHGRVIKLGLQFDPFIRNTIIYMYANC--GFLSEMWKAFYERMDFDIVAWNSMIMGLAK 203

Query: 572 SSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPF 393
                 +  LF +M    P++ + +++ S+   Y + G   +   L G++ +  ++   F
Sbjct: 204 CGEVDESRKLFDEM----PLR-NTVSWNSMISGYVRNGRLREALDLFGQMQEERIKPSEF 258

Query: 392 IRNTIIFMYTNCGYLIEAGWLFN----ENSDFDVVAWNSMIIGHAKNGQIDESRRLFDMM 225
              +++      G L +  W+ +     N + +V+   S+I  + K G I E+ ++F+M 
Sbjct: 259 TMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDMYCKCGSIGEAFQVFEMA 318

Query: 224 PSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTACAHLGAL-KQG 48
           P +   SWN+MI G   NG   +A  LF R++   + P + T V +LTAC + G + K  
Sbjct: 319 PLKGLSSWNTMILGLAMNGCENEAIQLFSRLECSNLRPDDVTFVGVLTACNYSGLVDKAK 378

Query: 47  EWIHAYIKKNNIE 9
           E+     K   IE
Sbjct: 379 EYFSLMSKTYKIE 391


>ref|XP_002306741.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222856190|gb|EEE93737.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 509

 Score =  384 bits (986), Expect = e-104
 Identities = 192/264 (72%), Positives = 224/264 (84%), Gaps = 1/264 (0%)
 Frame = -1

Query: 791 MLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPN 612
           ML+K+CT+MKDL K+HA LIKTGL KDTIAASRVL+FC TS  GDINYAYLVF QI++PN
Sbjct: 1   MLDKNCTSMKDLQKIHAQLIKTGLAKDTIAASRVLAFC-TSPAGDINYAYLVFTQIRNPN 59

Query: 611 LFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPI-QPHRLTYPSLFKAYAQLGLAHDGSQL 435
           LF WNTIIRGFSQSSTPH AISLFIDM+++SP  QP RLTYPS+FKAYAQLGLAH+G+QL
Sbjct: 60  LFVWNTIIRGFSQSSTPHNAISLFIDMMFTSPTTQPQRLTYPSVFKAYAQLGLAHEGAQL 119

Query: 434 HGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNGQI 255
           HGR+IKLGLE+D FI+NTI+ MY NCG+L EA  +F+  + FDVV WN+MIIG AK G+I
Sbjct: 120 HGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGFDVVTWNTMIIGLAKCGEI 179

Query: 254 DESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTAC 75
           D+SRRLFD M  R+T+SWNSMISGYVR GR  +A +LF RMQ +GI+PSEFT VSLL AC
Sbjct: 180 DKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQEEGIKPSEFTMVSLLNAC 239

Query: 74  AHLGALKQGEWIHAYIKKNNIEVN 3
           A LGAL+QGEWIH YI KNN  +N
Sbjct: 240 ACLGALRQGEWIHDYIVKNNFALN 263



 Score = 75.1 bits (183), Expect = 4e-11
 Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 5/211 (2%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+I+ +  +F ++   N  +WN++I G+ +      A+ LF  M     I+P   T  SL
Sbjct: 177 GEIDKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRM-QEEGIKPSEFTMVSL 235

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A LG    G  +H  I+K     +  +   II MY+ CG + +A  +F       +
Sbjct: 236 LNACACLGALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGL 295

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSRST----ISWNSMISGYVRNGRLKDAFDLFHR 135
             WNS+I+G A +G+ +E+ RLF  + S +     +S+  +++     G +  A D F  
Sbjct: 296 SCWNSLILGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAGMVDRAKDYFLL 355

Query: 134 M-QNKGIEPSEFTSVSLLTACAHLGALKQGE 45
           M +   IEPS      ++      G L++ E
Sbjct: 356 MSETYKIEPSIKHYSCMVDVLGRAGLLEEAE 386


>ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Citrus sinensis]
          Length = 534

 Score =  382 bits (982), Expect = e-104
 Identities = 191/270 (70%), Positives = 224/270 (82%), Gaps = 1/270 (0%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           + P LS+L+K CT+MKDL K+HA+LIKTGL KD IAASR+L+FC TS  GDINYAYLVF 
Sbjct: 19  DQPLLSLLDKQCTSMKDLKKIHAHLIKTGLAKDPIAASRILTFC-TSPAGDINYAYLVFT 77

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           QI+ PNLF WNTIIRGFSQSSTP  AI LFIDML +SPIQP RLTYPSLFKAYAQLGLA 
Sbjct: 78  QIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSLFKAYAQLGLAR 137

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNE-NSDFDVVAWNSMIIGH 273
           DG+QLHGR++K GLE D FI NTII+MY NCG+L EA  +F+E +++FDVVAWNSMIIG 
Sbjct: 138 DGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLMFDEVDTEFDVVAWNSMIIGL 197

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK G+IDESRRLFD M SR+T+SWNSMISGYVRN + K+A +LF  MQ + I+PSEFT V
Sbjct: 198 AKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQNIKPSEFTMV 257

Query: 92  SLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           SLL ACA LGA++QGEWIH ++  N  E+N
Sbjct: 258 SLLNACAKLGAIRQGEWIHNFLVTNCFELN 287



 Score = 77.4 bits (189), Expect = 8e-12
 Identities = 58/211 (27%), Positives = 102/211 (48%), Gaps = 5/211 (2%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+I+ +  +F ++   N  +WN++I G+ ++     A+ LF +M   + I+P   T  SL
Sbjct: 201 GEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQN-IKPSEFTMVSL 259

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A+LG    G  +H  ++    E +  +   II MY  CG    A  +FN      +
Sbjct: 260 LNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPERALQVFNTVPKKGL 319

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSRST----ISWNSMISGYVRNGRLKDAFDLFHR 135
             WNSM+ G A NG  +E+ +LF  + S +      S+ ++++    +G++  A D F  
Sbjct: 320 SCWNSMVFGLAMNGYENEAIKLFSGLQSSNLTPDYTSFIAVLTACNHSGKVNQAKDYFTL 379

Query: 134 M-QNKGIEPSEFTSVSLLTACAHLGALKQGE 45
           M +   I+PS      ++ A    G L++ E
Sbjct: 380 MTETYKIKPSIKHYSCMVDALGRAGLLEEAE 410


>ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina]
           gi|557539373|gb|ESR50417.1| hypothetical protein
           CICLE_v10031197mg [Citrus clementina]
          Length = 534

 Score =  382 bits (980), Expect = e-103
 Identities = 191/270 (70%), Positives = 224/270 (82%), Gaps = 1/270 (0%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           + P LS+L+K CT+MKDL K+HA+LIKTGL KD IAASR+L+FC TS  GDINYAYLVF 
Sbjct: 19  DQPLLSLLDKQCTSMKDLKKIHAHLIKTGLPKDPIAASRILAFC-TSPAGDINYAYLVFT 77

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           QI+ PNLF WNTIIRGFSQSSTP  AI LFIDML +SPIQP RLTYPSLFKAYAQLGLA 
Sbjct: 78  QIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDMLVTSPIQPQRLTYPSLFKAYAQLGLAR 137

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNE-NSDFDVVAWNSMIIGH 273
           DG+QLHGR++K GLE D FI NTII+MY NCG+L EA  +F+E +++FDVVAWNSMIIG 
Sbjct: 138 DGAQLHGRVVKQGLEFDQFIHNTIIYMYANCGFLSEARLIFDEVDTEFDVVAWNSMIIGL 197

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK G+IDESRRLFD M SR+T+SWNSMISGYVRN + K+A +LF  MQ + I+PSEFT V
Sbjct: 198 AKCGEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQNIKPSEFTMV 257

Query: 92  SLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           SLL ACA LGA++QGEWIH ++  N  E+N
Sbjct: 258 SLLNACAKLGAIRQGEWIHNFLVTNCFELN 287



 Score = 79.3 bits (194), Expect = 2e-12
 Identities = 59/211 (27%), Positives = 103/211 (48%), Gaps = 5/211 (2%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+I+ +  +F ++   N  +WN++I G+ ++     A+ LF +M   + I+P   T  SL
Sbjct: 201 GEIDESRRLFDKMVSRNTVSWNSMISGYVRNVKFKEALELFREMQEQN-IKPSEFTMVSL 259

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A+LG    G  +H  ++    E +  +   II MY  CG    A  +FN      +
Sbjct: 260 LNACAKLGAIRQGEWIHNFLVTNCFELNTIVVTAIIDMYCKCGCPERALQVFNTVPKKGL 319

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSRST----ISWNSMISGYVRNGRLKDAFDLFHR 135
             WNSM+ G A NG  +E+ +LF  + S +     IS+ ++++    +G++  A D F  
Sbjct: 320 SCWNSMVFGLAMNGYENEAIKLFSGLQSSNLKPDYISFIAVLTACNHSGKVNQAKDYFTL 379

Query: 134 M-QNKGIEPSEFTSVSLLTACAHLGALKQGE 45
           M +   I+PS      ++ A    G L++ E
Sbjct: 380 MTETYKIKPSIKHYSCMVDALGRAGLLEEAE 410


>ref|XP_007048864.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
           cacao] gi|508701125|gb|EOX93021.1| Pentatricopeptide
           repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 538

 Score =  378 bits (970), Expect = e-102
 Identities = 191/270 (70%), Positives = 220/270 (81%), Gaps = 1/270 (0%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           + P LS+LE +CT+MKDL KLHA LIKTGL+ D IAASRVL+FC  S  GD+NYAYLVF 
Sbjct: 21  DQPYLSLLENNCTSMKDLKKLHAQLIKTGLVNDIIAASRVLAFCV-SPAGDMNYAYLVFT 79

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           QI++PNLFTWNTIIRGFSQSS P IAISLFIDML  S IQP RLTYPS+FKAYAQLGLA 
Sbjct: 80  QIKNPNLFTWNTIIRGFSQSSNPQIAISLFIDMLVGSSIQPERLTYPSVFKAYAQLGLAC 139

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFN-ENSDFDVVAWNSMIIGH 273
           DG QLHGR+IKLGL+ D FIRNTII+MY NCG L EA  +F+ E+ + D+VAWNSMIIG 
Sbjct: 140 DGRQLHGRVIKLGLDYDQFIRNTIIYMYANCGLLSEAWRMFDEEHMELDIVAWNSMIIGL 199

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK G++DESRRLF+ M SR+T+SWNSMISGYVRNGR  +A +LF  MQ + I PSEFT V
Sbjct: 200 AKCGEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFLEALELFQEMQEEHIRPSEFTMV 259

Query: 92  SLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           SLL ACA LGA+ QG+WIH YI K N E+N
Sbjct: 260 SLLNACACLGAITQGKWIHDYILKQNFELN 289



 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 48/150 (32%), Positives = 77/150 (51%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+++ +  +F ++   N  +WN++I G+ ++     A+ LF +M     I+P   T  SL
Sbjct: 203 GEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFLEALELFQEM-QEEHIRPSEFTMVSL 261

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A LG    G  +H  I+K   E +  +   II MY  CG   +A  +F  +    +
Sbjct: 262 LNACACLGAITQGKWIHDYILKQNFELNGIVVTAIIDMYCKCGNAEKALQVFTTSPKEGL 321

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSRS 213
             WNSMI+G A NG  +E+R+LF  + S S
Sbjct: 322 SCWNSMILGLATNGCENEARQLFSKLESLS 351


>ref|XP_004305832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 550

 Score =  377 bits (967), Expect = e-102
 Identities = 184/267 (68%), Positives = 218/267 (81%)
 Frame = -1

Query: 803 PSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQI 624
           P L MLE  CT MKDL K+HA+LIKTGL  DT+AASRVL+FCA S  GDINYAY+VF  I
Sbjct: 22  PHLFMLENQCTNMKDLQKIHAHLIKTGLANDTVAASRVLAFCA-SPAGDINYAYMVFRHI 80

Query: 623 QHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDG 444
            +PNLF WNTIIRGFS SS P  AISLFIDML +S +QP RLTYPS+FKAYAQLGLAHDG
Sbjct: 81  HNPNLFIWNTIIRGFSNSSNPEAAISLFIDMLVTSTVQPQRLTYPSVFKAYAQLGLAHDG 140

Query: 443 SQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKN 264
           +QLHGR++KLGLESD F+RNTII MY+NCG L EA  +F+E+ +FD+VAWNSMI+G +K 
Sbjct: 141 AQLHGRVVKLGLESDQFVRNTIIHMYSNCGLLSEARRVFDEDLEFDIVAWNSMIMGLSKC 200

Query: 263 GQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLL 84
           G++ ESRRLFD MP R++ISWNSMI G VRNG   +A DLF  MQ + I+PSEFT VSLL
Sbjct: 201 GEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTEALDLFGEMQKQKIKPSEFTMVSLL 260

Query: 83  TACAHLGALKQGEWIHAYIKKNNIEVN 3
            A A LGA++QGEWIH YI+KN+I++N
Sbjct: 261 NASAQLGAIRQGEWIHEYIRKNHIQLN 287



 Score = 80.1 bits (196), Expect = 1e-12
 Identities = 55/207 (26%), Positives = 94/207 (45%)
 Frame = -1

Query: 671 SLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTY 492
           S  G++  +  +F ++   N  +WN++I G  ++     A+ LF +M     I+P   T 
Sbjct: 198 SKCGEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTEALDLFGEM-QKQKIKPSEFTM 256

Query: 491 PSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSD 312
            SL  A AQLG    G  +H  I K  ++ +P +   II MY+ CG + +A  +F     
Sbjct: 257 VSLLNASAQLGAIRQGEWIHEYIRKNHIQLNPIVVTAIINMYSKCGSIEKAVHVFEAAPR 316

Query: 311 FDVVAWNSMIIGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRM 132
             +  WNS+I+G A NG  +E                               A +LF R+
Sbjct: 317 TGLSCWNSIIMGLATNGCEEE-------------------------------AIELFSRL 345

Query: 131 QNKGIEPSEFTSVSLLTACAHLGALKQ 51
           ++    P + + + +LTAC+H G +++
Sbjct: 346 KSSSFVPDDVSFLGVLTACSHSGMVEK 372


>ref|XP_002534070.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223525897|gb|EEF28314.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  373 bits (958), Expect = e-101
 Identities = 182/265 (68%), Positives = 219/265 (82%)
 Frame = -1

Query: 797 LSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQH 618
           LSML+K+CTTMKDL K+H+ LIKTGL KDT AASR+L+FCA S  GDINYAYLVF QIQ+
Sbjct: 25  LSMLDKNCTTMKDLKKIHSQLIKTGLAKDTNAASRILAFCA-SPAGDINYAYLVFVQIQN 83

Query: 617 PNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQ 438
           PN+F WNTIIRGFS+SS P  +ISL+IDML +SP+QP RLTYPS+FKA+AQL LA +G+Q
Sbjct: 84  PNIFAWNTIIRGFSRSSVPQNSISLYIDMLLTSPVQPQRLTYPSVFKAFAQLDLASEGAQ 143

Query: 437 LHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNGQ 258
           LHG++IKLGLE+D FIRNTI+FMY NCG+  EA  +F+   DFD+VAWN+MI+G AK G 
Sbjct: 144 LHGKMIKLGLENDSFIRNTILFMYVNCGFTSEARKVFDRGMDFDIVAWNTMIMGVAKCGL 203

Query: 257 IDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTA 78
           +DESRRLFD M  R+ +SWNSMISGYVRNGR  DA +LF +MQ + IEPSEFT VSLL A
Sbjct: 204 VDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELFQKMQVERIEPSEFTMVSLLNA 263

Query: 77  CAHLGALKQGEWIHAYIKKNNIEVN 3
           CA LGA++QGEWIH Y+ K   E+N
Sbjct: 264 CACLGAIRQGEWIHDYMVKKKFELN 288



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 59/211 (27%), Positives = 103/211 (48%), Gaps = 5/211 (2%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G ++ +  +F ++   N  +WN++I G+ ++     A+ LF  M     I+P   T  SL
Sbjct: 202 GLVDESRRLFDKMSLRNAVSWNSMISGYVRNGRFFDALELFQKMQVER-IEPSEFTMVSL 260

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A LG    G  +H  ++K   E +P +   II MY+ CG + +A  +F       +
Sbjct: 261 LNACACLGAIRQGEWIHDYMVKKKFELNPIVVTAIIDMYSKCGSIDKAVQVFQSAPRRGL 320

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSR----STISWNSMISGYVRNGRLKDAFDLFHR 135
             WNSMI+G A NGQ +E+ +LF ++ S       +S+ ++++     G +  A D F  
Sbjct: 321 SCWNSMILGLAMNGQENEALQLFSVLQSSDLRPDDVSFIAVLTACDHTGMVDKAKDYFLL 380

Query: 134 MQNK-GIEPSEFTSVSLLTACAHLGALKQGE 45
           M++K  I+P       ++      G L++ E
Sbjct: 381 MRDKYKIKPGIKHFSCMVDVLGRAGLLEEAE 411


>ref|XP_007141857.1| hypothetical protein PHAVU_008G231600g [Phaseolus vulgaris]
           gi|561014990|gb|ESW13851.1| hypothetical protein
           PHAVU_008G231600g [Phaseolus vulgaris]
          Length = 525

 Score =  370 bits (951), Expect = e-100
 Identities = 178/269 (66%), Positives = 221/269 (82%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           +HP L+ML+  CT MKDL K+H ++IKTGL  D IAASRVL+FCA+S  GDINYAYLVF 
Sbjct: 22  DHPCLTMLQNQCTNMKDLQKIHPHIIKTGLALDHIAASRVLTFCASSS-GDINYAYLVFT 80

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
            I +PNL+ WNTIIRGFS+SSTP  AISLF+DMLYS+ ++P RLTYPS+FKAYAQLG  H
Sbjct: 81  GIPNPNLYCWNTIIRGFSRSSTPQFAISLFVDMLYSA-VEPQRLTYPSVFKAYAQLGAGH 139

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHA 270
           DG+QLHGR++KLGLE D FI NTI++MY N G + EA  +F+E  + DVVA NSMI+G A
Sbjct: 140 DGAQLHGRVVKLGLEKDQFISNTILYMYANSGLMSEARRVFDEPLELDVVACNSMIMGLA 199

Query: 269 KNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVS 90
           K G++D+SRRLFD MP+R+ +SWNSMISGYVRNGRL +  +LF +MQ +G+EPSEFT VS
Sbjct: 200 KCGEVDKSRRLFDNMPTRTAVSWNSMISGYVRNGRLTEGLELFRKMQEEGVEPSEFTMVS 259

Query: 89  LLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           LL+ACAHLGAL+ GEW+H YIK+ N ++N
Sbjct: 260 LLSACAHLGALQHGEWVHDYIKRGNFKLN 288



 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 58/238 (24%), Positives = 119/238 (50%), Gaps = 4/238 (1%)
 Frame = -1

Query: 752 KLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQ 573
           +LH  ++K GL KD   ++ +L   A S  G ++ A  VF +    ++   N++I G ++
Sbjct: 143 QLHGRVVKLGLEKDQFISNTILYMYANS--GLMSEARRVFDEPLELDVVACNSMIMGLAK 200

Query: 572 SSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPF 393
                 +  LF +M   + +     ++ S+   Y + G   +G +L  ++ + G+E   F
Sbjct: 201 CGEVDKSRRLFDNMPTRTAV-----SWNSMISGYVRNGRLTEGLELFRKMQEEGVEPSEF 255

Query: 392 IRNTIIFMYTNCGYLIEAGWLFNE----NSDFDVVAWNSMIIGHAKNGQIDESRRLFDMM 225
              +++    + G L    W+ +     N   +V+   ++I  + K G I+++  +F   
Sbjct: 256 TMVSLLSACAHLGALQHGEWVHDYIKRGNFKLNVIVLTAIIDMYCKCGSIEKAVEVFAAS 315

Query: 224 PSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTACAHLGALKQ 51
           P+R    WNS+I G   NG  ++A + F ++++  I+P   + + +LTAC +LGA+++
Sbjct: 316 PTRGLPCWNSIIIGLALNGHEREAIEYFSKLESSNIKPDCVSFIGVLTACKYLGAVRE 373


>ref|XP_004151347.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Cucumis sativus]
           gi|449530724|ref|XP_004172343.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Cucumis sativus]
          Length = 543

 Score =  370 bits (949), Expect = e-100
 Identities = 177/269 (65%), Positives = 223/269 (82%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           N P LSM++K CTTM+DL + HA+LIK+G   ++ AASR+L+FCA+ L G+++YAYLVF 
Sbjct: 23  NQPYLSMVDKYCTTMRDLQQFHAHLIKSGQAIESFAASRILAFCASPL-GNMDYAYLVFL 81

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           Q+Q+PNLF+WNT+IRGFSQSS P IA+ LFIDML SS ++P RLTYPS+FKAY+QLGLAH
Sbjct: 82  QMQNPNLFSWNTVIRGFSQSSNPQIALYLFIDMLVSSQVEPQRLTYPSIFKAYSQLGLAH 141

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHA 270
           DG+QLHGRIIKLGL+ DPFIRNTI++MY   G+L EA  +FN+  +FDVV+WNSMI+G A
Sbjct: 142 DGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQEMEFDVVSWNSMILGLA 201

Query: 269 KNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVS 90
           K G+IDESR+LFD MP ++ ISWNSMI GYVRNG  K+A  LF +MQ + I+PSEFT VS
Sbjct: 202 KCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFIKMQEERIQPSEFTMVS 261

Query: 89  LLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           LL A A +GAL+QG WIH YIKKNN+++N
Sbjct: 262 LLNASAQIGALRQGVWIHEYIKKNNLQLN 290



 Score = 85.1 bits (209), Expect = 4e-14
 Identities = 80/300 (26%), Positives = 128/300 (42%), Gaps = 34/300 (11%)
 Frame = -1

Query: 806  HPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCAT--------------- 672
            +PS+           D  +LH  +IK GL  D    + +L   AT               
Sbjct: 127  YPSIFKAYSQLGLAHDGAQLHGRIIKLGLQFDPFIRNTILYMYATGGFLSEARRIFNQEM 186

Query: 671  --------------SLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFID 534
                          +  G+I+ +  +F ++   N  +WN++I G+ ++     A+ LFI 
Sbjct: 187  EFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFIK 246

Query: 533  MLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCG 354
            M     IQP   T  SL  A AQ+G    G  +H  I K  L+ +  +   II MY  CG
Sbjct: 247  M-QEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDMYCKCG 305

Query: 353  YLIEAGWLFNENSDFDVVAWNSMIIGHAKNGQIDESRRLFDMMPSRS----TISWNSMIS 186
             +  A  +F +     + +WNSMI G A NG   E+  +F M+ S S     IS+ ++++
Sbjct: 306  SIGNALQVFEKIPCRSLSSWNSMIFGLAVNGCEKEAILVFKMLESSSLKPDCISFMAVLT 365

Query: 185  GYVRNGRLKDAFDLFHRMQNK-GIEPSEFTSVSLLTACAHLGALKQGEWIHAYIKKNNIE 9
                   + +  + F RM+N   IEPS      ++   +  G L++ E    +IK   IE
Sbjct: 366  ACNHGAMVDEGMEFFSRMKNTYRIEPSIKHYNLMVDMISRAGFLEEAE---QFIKTMPIE 422


>gb|EXB44509.1| hypothetical protein L484_000760 [Morus notabilis]
           gi|587904202|gb|EXB92403.1| hypothetical protein
           L484_021387 [Morus notabilis]
          Length = 530

 Score =  369 bits (947), Expect = e-99
 Identities = 182/269 (67%), Positives = 220/269 (81%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           + P LSMLEK C TM DL K+HA+LIKTGLI  TIA+SR+L+FCA S  G+INYA +VF 
Sbjct: 25  DQPHLSMLEKRCATMSDLRKIHAHLIKTGLISHTIASSRLLAFCA-SPAGNINYALMVFS 83

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           QIQ+PNLF WNTIIRGFS+SSTP  AI LFIDML  SP++P RLTYPS+FKAYAQLGLA 
Sbjct: 84  QIQNPNLFIWNTIIRGFSRSSTPQTAIFLFIDMLVGSPLEPQRLTYPSVFKAYAQLGLAC 143

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHA 270
            G+QLHGR+IKLGL+ D F+RNTII MY NCG+L EA  LF+E+S+ D+VAWNSMI+G +
Sbjct: 144 FGAQLHGRVIKLGLDCDRFVRNTIIHMYINCGFLSEARQLFDESSELDLVAWNSMIMGLS 203

Query: 269 KNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVS 90
           K G++ ESRRLFD MP R+++SWNSMISGYVRNG+  +A +LF +MQ +GI+ SEFT VS
Sbjct: 204 KCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKCVEALELFGKMQGEGIKASEFTMVS 263

Query: 89  LLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           LL A   LGA++QGEWIH YI KN IE+N
Sbjct: 264 LLNASGRLGAIRQGEWIHEYITKNGIELN 292



 Score = 78.6 bits (192), Expect = 3e-12
 Identities = 59/215 (27%), Positives = 100/215 (46%), Gaps = 6/215 (2%)
 Frame = -1

Query: 671 SLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTY 492
           S  G++  +  +F ++   N  +WN++I G+ ++     A+ LF  M     I+    T 
Sbjct: 203 SKCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKCVEALELFGKM-QGEGIKASEFTM 261

Query: 491 PSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSD 312
            SL  A  +LG    G  +H  I K G+E +  +   II MY  CG + +A  +F     
Sbjct: 262 VSLLNASGRLGAIRQGEWIHEYITKNGIELNVIVVTAIIDMYCKCGSVNKALSVFKTAPK 321

Query: 311 FDVVAWNSMIIGHAKNGQIDESRRLFDMMPSR-----STISWNSMISGYVRNGRLKDAFD 147
             +  WNSM++G A NG  +E+  LF  + S        +S+ ++++    +G +  A D
Sbjct: 322 LGLSCWNSMVMGLAMNGCEEEALELFSRLESSIDLRPDGVSFLAVLTACNHSGMVDKARD 381

Query: 146 LFHRMQNK-GIEPSEFTSVSLLTACAHLGALKQGE 45
            F  M+ K  IEPS      ++      G L++ E
Sbjct: 382 YFSLMRGKYNIEPSTRHYSCMVDVLGKAGHLEEAE 416


>ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Glycine max]
          Length = 534

 Score =  363 bits (932), Expect = 5e-98
 Identities = 177/269 (65%), Positives = 220/269 (81%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           + P L+ML+  CT MKDL K+HA++IKTGL   T+AASRVL+FCA+S  GDINYAYL+F 
Sbjct: 24  DQPCLTMLQTQCTNMKDLQKIHAHIIKTGLAHHTVAASRVLTFCASSS-GDINYAYLLFT 82

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
            I  PNL+ WNTIIRGFS+SSTPH+AISLF+DML SS + P RLTYPS+FKAYAQLG  +
Sbjct: 83  TIPSPNLYCWNTIIRGFSRSSTPHLAISLFVDMLCSS-VLPQRLTYPSVFKAYAQLGAGY 141

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHA 270
           DG+QLHGR++KLGLE D FI+NTII+MY N G L EA  +F+E  D DVVA NSMI+G A
Sbjct: 142 DGAQLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVFDELVDLDVVACNSMIMGLA 201

Query: 269 KNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVS 90
           K G++D+SRRLFD MP+R+ ++WNSMISGYVRN RL +A +LF +MQ + +EPSEFT VS
Sbjct: 202 KCGEVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALELFRKMQGERVEPSEFTMVS 261

Query: 89  LLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           LL+ACAHLGALK GEW+H Y+K+ + E+N
Sbjct: 262 LLSACAHLGALKHGEWVHDYVKRGHFELN 290



 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 61/278 (21%), Positives = 110/278 (39%), Gaps = 30/278 (10%)
 Frame = -1

Query: 752 KLHANLIKTGLIKDTIAASRVLSFCATSLV-----------------------------G 660
           +LH  ++K GL KD    + ++   A S +                             G
Sbjct: 145 QLHGRVVKLGLEKDQFIQNTIIYMYANSGLLSEARRVFDELVDLDVVACNSMIMGLAKCG 204

Query: 659 DINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLF 480
           +++ +  +F  +      TWN++I G+ ++     A+ LF  M     ++P   T  SL 
Sbjct: 205 EVDKSRRLFDNMPTRTRVTWNSMISGYVRNKRLMEALELFRKM-QGERVEPSEFTMVSLL 263

Query: 479 KAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVV 300
            A A LG    G  +H  + +   E +  +   II MY  CG +++A             
Sbjct: 264 SACAHLGALKHGEWVHDYVKRGHFELNVIVLTAIIDMYCKCGVIVKA------------- 310

Query: 299 AWNSMIIGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKG 120
                               +F+  P+R    WNS+I G   NG  + A + F +++   
Sbjct: 311 ------------------IEVFEASPTRGLSCWNSIIIGLALNGYERKAIEYFSKLEASD 352

Query: 119 IEPSEFTSVSLLTACAHLGAL-KQGEWIHAYIKKNNIE 9
           ++P   + + +LTAC ++GA+ K  ++    + K  IE
Sbjct: 353 LKPDHVSFIGVLTACKYIGAVGKARDYFSLMMNKYEIE 390


>ref|XP_004250888.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Solanum lycopersicum]
          Length = 522

 Score =  362 bits (928), Expect = 2e-97
 Identities = 176/270 (65%), Positives = 221/270 (81%), Gaps = 1/270 (0%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSL-VGDINYAYLVF 633
           + P L MLE  CTTM DL K+HA+LIK+GLIKD IAASRVL+F A S  +GDINYA LVF
Sbjct: 17  DQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIAASRVLAFSAKSPPIGDINYANLVF 76

Query: 632 YQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLA 453
             I++PN FTWNTIIRGFS+SSTP  AI LFI+ML +S +QPH LTYPS+FKAYA+ G+A
Sbjct: 77  THIENPNPFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVFKAYARGGIA 136

Query: 452 HDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGH 273
            +G+QLHGRI+KLGLE D FIRNT+++MY +CG+L+EA  LF+E+   DVV+WNSMIIG 
Sbjct: 137 KNGAQLHGRIMKLGLEFDTFIRNTLLYMYASCGFLVEARKLFDEDEIEDVVSWNSMIIGL 196

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK+G+ID+S RLF  MP+R+ +SWNSMISG+VRNG+  +A +LF  MQ + ++PSEFT V
Sbjct: 197 AKSGEIDDSWRLFSKMPTRNDVSWNSMISGFVRNGKWNEALELFSTMQEENVKPSEFTLV 256

Query: 92  SLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           SLL AC HLGAL+QG WI+ Y+KKNN+E+N
Sbjct: 257 SLLNACGHLGALEQGNWIYKYVKKNNVELN 286



 Score = 80.1 bits (196), Expect = 1e-12
 Identities = 58/209 (27%), Positives = 110/209 (52%), Gaps = 5/209 (2%)
 Frame = -1

Query: 719 IKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLF 540
           I+D ++ + ++   A S  G+I+ ++ +F ++   N  +WN++I GF ++   + A+ LF
Sbjct: 183 IEDVVSWNSMIIGLAKS--GEIDDSWRLFSKMPTRNDVSWNSMISGFVRNGKWNEALELF 240

Query: 539 IDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTN 360
             M   + ++P   T  SL  A   LG    G+ ++  + K  +E +  +   II MY  
Sbjct: 241 STMQEEN-VKPSEFTLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCK 299

Query: 359 CGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNGQIDESRRLFDMMP----SRSTISWNSM 192
           C  +  A  +F  +S+  + +WNSMI+G A NG  D++ +LF  +        ++S+  +
Sbjct: 300 CANVEMAWHVFVSSSNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGV 359

Query: 191 ISGYVRNGRLKDAFDLFHRMQNK-GIEPS 108
           ++    +G ++ A D F  M+ + GIEPS
Sbjct: 360 LTACNHSGLVEKAKDYFQLMKMEYGIEPS 388


>ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920,
           chloroplastic-like [Solanum tuberosum]
          Length = 522

 Score =  359 bits (922), Expect = 8e-97
 Identities = 176/270 (65%), Positives = 220/270 (81%), Gaps = 1/270 (0%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSL-VGDINYAYLVF 633
           + P L MLE  CTTM DL K+HA+LIK+GLIKD IA+SRVL+F A S  +GDINYA LVF
Sbjct: 17  DQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIASSRVLAFSAKSPPIGDINYANLVF 76

Query: 632 YQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLA 453
             I++PNLFTWNTIIRGFS+SSTP  AI LFI+ML +S +QPH LTYPS+FKAYA+ GL 
Sbjct: 77  THIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNNSQVQPHLLTYPSVFKAYARGGLV 136

Query: 452 HDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGH 273
            +G+QLHGRIIKLGLE D FIRNT+++MY +CG+L+EA  LF+E+   DVV+WNSMI+G 
Sbjct: 137 KNGAQLHGRIIKLGLEFDTFIRNTMLYMYASCGFLVEARKLFDEDEIEDVVSWNSMIMGL 196

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK+G+ID+S RLF  M +R+ +SWNSMISG+VRNG+  +A +LF  MQ + I+PSEFT V
Sbjct: 197 AKSGEIDDSWRLFSKMSTRNDVSWNSMISGFVRNGKWNEALELFSTMQEENIKPSEFTLV 256

Query: 92  SLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           SLL AC HLGAL+QG WI+ Y+KKNN+E+N
Sbjct: 257 SLLNACGHLGALEQGNWIYKYVKKNNVELN 286



 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 63/256 (24%), Positives = 126/256 (49%), Gaps = 7/256 (2%)
 Frame = -1

Query: 767 MKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTII 588
           +K+  +LH  +IK GL  DT   + +L   A+   G +  A  +F + +  ++ +WN++I
Sbjct: 136 VKNGAQLHGRIIKLGLEFDTFIRNTMLYMYASC--GFLVEARKLFDEDEIEDVVSWNSMI 193

Query: 587 RGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGL 408
            G ++S     +  LF  M   + +     ++ S+   + + G  ++  +L   + +  +
Sbjct: 194 MGLAKSGEIDDSWRLFSKMSTRNDV-----SWNSMISGFVRNGKWNEALELFSTMQEENI 248

Query: 407 ESDPFIRNTIIFMYTNCGYL--IEAG-WLFN----ENSDFDVVAWNSMIIGHAKNGQIDE 249
           +   F   T++ +   CG+L  +E G W++      N + +V+   ++I  + K G ++ 
Sbjct: 249 KPSEF---TLVSLLNACGHLGALEQGNWIYKYVKKNNVELNVIVVTAIIDMYCKCGNVEM 305

Query: 248 SRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTACAH 69
           +  +F  + ++   SWNSMI G   NG   DA  LF R+Q   ++P   + + +LTAC H
Sbjct: 306 AWHVFISISNKGLSSWNSMILGLATNGFEDDAIKLFARLQCSILKPDSVSFIGVLTACNH 365

Query: 68  LGALKQGEWIHAYIKK 21
            G + + +     +KK
Sbjct: 366 SGLVDKAKDYFQLMKK 381


>gb|EYU29595.1| hypothetical protein MIMGU_mgv1a025435mg [Mimulus guttatus]
          Length = 505

 Score =  349 bits (896), Expect = 8e-94
 Identities = 176/273 (64%), Positives = 211/273 (77%), Gaps = 4/273 (1%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCAT-SLVGDINYAYLVF 633
           + P LS+LE +C T+KDL K+HA LIKTGL KDTIA SR+L+FCA      D++YA+ VF
Sbjct: 4   DQPFLSLLETNCHTIKDLTKIHAQLIKTGLAKDTIAVSRILAFCAAPGPARDLDYAFSVF 63

Query: 632 YQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLA 453
             I+ PNLFTWNTIIRGF QSS PH+AISLF+DML +S ++P  LTYPS+FKAY QLGLA
Sbjct: 64  SHIEKPNLFTWNTIIRGFCQSSHPHVAISLFVDMLTNSTLEPENLTYPSVFKAYTQLGLA 123

Query: 452 HDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGH 273
            DG+QLHGRIIKLG E DPFIRN+II MY +CG    A  LF+E+ D DVVAWNSM++G 
Sbjct: 124 GDGAQLHGRIIKLGFEHDPFIRNSIIHMYADCGLFGSARKLFDEDEDTDVVAWNSMVMGL 183

Query: 272 AKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           AK G++DES RLF  +P R+ ISWN+MISGYVRNG+  DA  LF  MQ + I PSEFT V
Sbjct: 184 AKCGEVDESWRLFCKIPCRNDISWNTMISGYVRNGKWVDALSLFAEMQQRQIRPSEFTLV 243

Query: 92  SLLTACAHLGALKQGEWIHAYIKK---NNIEVN 3
           S+L ACA LGAL+QG+WIH YIKK   NNI+ N
Sbjct: 244 SMLNACAKLGALEQGKWIHRYIKKSDINNIDRN 276



 Score = 77.0 bits (188), Expect = 1e-11
 Identities = 55/199 (27%), Positives = 92/199 (46%), Gaps = 1/199 (0%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+++ ++ +F +I   N  +WNT+I G+ ++     A+SLF +M     I+P   T  S+
Sbjct: 187 GEVDESWRLFCKIPCRNDISWNTMISGYVRNGKWVDALSLFAEM-QQRQIRPSEFTLVSM 245

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A+LG    G  +H  I K  + +    RNTI+                        
Sbjct: 246 LNACAKLGALEQGKWIHRYIKKSDINN--IDRNTIVV----------------------- 280

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRM-QN 126
               ++I  + K G I  +R +F+  P ++   WNSMI G   NG  ++AF LF  + Q+
Sbjct: 281 ---TAIIDMYCKCGDIKTAREVFESTPQKALSGWNSMILGLATNGFEEEAFQLFTELEQS 337

Query: 125 KGIEPSEFTSVSLLTACAH 69
             + P   + + +LTA  H
Sbjct: 338 SNLNPDSVSFIGVLTASNH 356


>ref|XP_002880012.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297325851|gb|EFH56271.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 542

 Score =  339 bits (870), Expect = 8e-91
 Identities = 167/266 (62%), Positives = 206/266 (77%), Gaps = 1/266 (0%)
 Frame = -1

Query: 797 LSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQH 618
           L +++  C+TM++L ++HANLIKTGLI DT+AASRVL+FC  S   D NYAYLVF +I H
Sbjct: 28  LRLIDTRCSTMRELKQIHANLIKTGLISDTVAASRVLAFCCAS-PSDRNYAYLVFTRINH 86

Query: 617 PNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSP-IQPHRLTYPSLFKAYAQLGLAHDGS 441
            N F WNTIIRGFS+SS P +AIS+FIDML SSP ++P RLTYPS+FKAYA LGLA DG 
Sbjct: 87  KNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYASLGLARDGR 146

Query: 440 QLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNG 261
           QLHGR+IK GLE D FIRNT++ MY  CG L+EA  LF     FDVVAWNS+I+G AK G
Sbjct: 147 QLHGRVIKEGLEDDSFIRNTMLHMYVTCGCLVEAWRLFVGMMGFDVVAWNSIIMGLAKCG 206

Query: 260 QIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLT 81
            ID++++LFD MP R+ +SWNSMISG+VRNGR KDA ++F  MQ + ++P  FT VSLL 
Sbjct: 207 LIDQAQKLFDEMPQRNGVSWNSMISGFVRNGRFKDALEMFREMQERDVKPDGFTMVSLLN 266

Query: 80  ACAHLGALKQGEWIHAYIKKNNIEVN 3
           ACA+LGA +QG WIH YI +N  E+N
Sbjct: 267 ACAYLGASEQGRWIHKYIVRNRFELN 292



 Score = 94.4 bits (233), Expect = 6e-17
 Identities = 71/274 (25%), Positives = 129/274 (47%), Gaps = 8/274 (2%)
 Frame = -1

Query: 806 HPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQ 627
           +PS+     S    +D  +LH  +IK GL  D+   + +L    T   G +  A+ +F  
Sbjct: 129 YPSVFKAYASLGLARDGRQLHGRVIKEGLEDDSFIRNTMLHMYVTC--GCLVEAWRLFVG 186

Query: 626 IQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHD 447
           +   ++  WN+II G ++      A  LF +M      Q + +++ S+   + + G   D
Sbjct: 187 MMGFDVVAWNSIIMGLAKCGLIDQAQKLFDEMP-----QRNGVSWNSMISGFVRNGRFKD 241

Query: 446 GSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYL--IEAG-----WLFNENSDFDVVAWNS 288
             ++   + +  ++ D F   T++ +   C YL   E G     ++     + + +   +
Sbjct: 242 ALEMFREMQERDVKPDGF---TMVSLLNACAYLGASEQGRWIHKYIVRNRFELNSIVITA 298

Query: 287 MIIGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPS 108
           +I  + K G  +E  ++F+  P++    WNSMI G   NG  + A DLF  ++  G+EP 
Sbjct: 299 LIDMYCKCGCFEEGLKVFECAPTKQLSCWNSMILGLANNGCEERAMDLFLELERTGLEPD 358

Query: 107 EFTSVSLLTACAHLGAL-KQGEWIHAYIKKNNIE 9
             + + +LTACAH G + K GE+     +K  IE
Sbjct: 359 SVSFIGVLTACAHSGEVHKAGEFFRLMREKYMIE 392


>ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g42920, chloroplastic; Flags: Precursor
           gi|4512663|gb|AAD21717.1| hypothetical protein
           [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|110738441|dbj|BAF01146.1| hypothetical protein
           [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 559

 Score =  335 bits (860), Expect = 1e-89
 Identities = 165/266 (62%), Positives = 203/266 (76%), Gaps = 1/266 (0%)
 Frame = -1

Query: 797 LSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQH 618
           L +++  C+TM++L ++HA+LIKTGLI DT+ ASRVL+FC  S   D+NYAYLVF +I H
Sbjct: 28  LRLIDTQCSTMRELKQIHASLIKTGLISDTVTASRVLAFCCAS-PSDMNYAYLVFTRINH 86

Query: 617 PNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSP-IQPHRLTYPSLFKAYAQLGLAHDGS 441
            N F WNTIIRGFS+SS P +AIS+FIDML SSP ++P RLTYPS+FKAY +LG A DG 
Sbjct: 87  KNPFVWNTIIRGFSRSSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLGQARDGR 146

Query: 440 QLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNG 261
           QLHG +IK GLE D FIRNT++ MY  CG LIEA  +F     FDVVAWNSMI+G AK G
Sbjct: 147 QLHGMVIKEGLEDDSFIRNTMLHMYVTCGCLIEAWRIFLGMIGFDVVAWNSMIMGFAKCG 206

Query: 260 QIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLT 81
            ID+++ LFD MP R+ +SWNSMISG+VRNGR KDA D+F  MQ K ++P  FT VSLL 
Sbjct: 207 LIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLN 266

Query: 80  ACAHLGALKQGEWIHAYIKKNNIEVN 3
           ACA+LGA +QG WIH YI +N  E+N
Sbjct: 267 ACAYLGASEQGRWIHEYIVRNRFELN 292



 Score = 90.5 bits (223), Expect = 9e-16
 Identities = 64/255 (25%), Positives = 121/255 (47%), Gaps = 7/255 (2%)
 Frame = -1

Query: 764 KDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTIIR 585
           +D  +LH  +IK GL  D+   + +L    T   G +  A+ +F  +   ++  WN++I 
Sbjct: 143 RDGRQLHGMVIKEGLEDDSFIRNTMLHMYVTC--GCLIEAWRIFLGMIGFDVVAWNSMIM 200

Query: 584 GFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQLHGRIIKLGLE 405
           GF++      A +LF +M      Q + +++ S+   + + G   D   +   + +  ++
Sbjct: 201 GFAKCGLIDQAQNLFDEMP-----QRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVK 255

Query: 404 SDPFIRNTIIFMYTNCGYL--IEAG-----WLFNENSDFDVVAWNSMIIGHAKNGQIDES 246
            D F   T++ +   C YL   E G     ++     + + +   ++I  + K G I+E 
Sbjct: 256 PDGF---TMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEG 312

Query: 245 RRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTACAHL 66
             +F+  P +    WNSMI G   NG  + A DLF  ++  G+EP   + + +LTACAH 
Sbjct: 313 LNVFECAPKKQLSCWNSMILGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHS 372

Query: 65  GALKQGEWIHAYIKK 21
           G + + +     +K+
Sbjct: 373 GEVHRADEFFRLMKE 387


>ref|XP_007216544.1| hypothetical protein PRUPE_ppb007734mg [Prunus persica]
           gi|462412694|gb|EMJ17743.1| hypothetical protein
           PRUPE_ppb007734mg [Prunus persica]
          Length = 297

 Score =  334 bits (856), Expect = 3e-89
 Identities = 162/237 (68%), Positives = 198/237 (83%)
 Frame = -1

Query: 803 PSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQI 624
           P LSMLEK CT MKDL K+HA+LIKTGL+ DT+AASRVL+FCA S  G+INYAY+VF  I
Sbjct: 22  PHLSMLEKQCTNMKDLQKIHAHLIKTGLVSDTVAASRVLAFCA-SPAGNINYAYMVFRNI 80

Query: 623 QHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDG 444
           Q+PNLF WNTIIRGFS+S +P IAISLFIDML +S I+PHRLTYPS+FKAYAQLGLA DG
Sbjct: 81  QNPNLFIWNTIIRGFSESPSPEIAISLFIDMLVTSAIEPHRLTYPSVFKAYAQLGLAQDG 140

Query: 443 SQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKN 264
           +QLHGRI+KLGLESD FIRNTII MY NCG+LIEA  +F+E+ + D VAWNSMI+G +K 
Sbjct: 141 AQLHGRILKLGLESDQFIRNTIIHMYANCGFLIEARRMFDEDLECDTVAWNSMIMGLSKW 200

Query: 263 GQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSV 93
           G++ E++RLFD    R+++SWNSMISG+VRNG+  +A +LF  MQ + ++PSEFT V
Sbjct: 201 GEVSEAKRLFDKFSLRNSVSWNSMISGFVRNGKYTEALELFSEMQEERVKPSEFTMV 257


>ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355518779|gb|AET00403.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 542

 Score =  334 bits (856), Expect = 3e-89
 Identities = 167/273 (61%), Positives = 217/273 (79%), Gaps = 4/273 (1%)
 Frame = -1

Query: 809 NHPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFY 630
           NHP L+ML+  CTT+   H+++ ++IKTGL  + IA++R L+FCA S  G+INYAY +F 
Sbjct: 27  NHPCLTMLQNHCTTINHFHQIYPHIIKTGLTLNPIASTRALTFCA-SPSGNINYAYKLFV 85

Query: 629 QIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAH 450
           ++ +PNL++WNTIIR FS+SSTP  AISLF+DMLYS  IQP  LTYPS+FKAYAQLG AH
Sbjct: 86  RMPNPNLYSWNTIIRAFSRSSTPQFAISLFVDMLYSQ-IQPQYLTYPSVFKAYAQLGHAH 144

Query: 449 DGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFN----ENSDFDVVAWNSMI 282
            G+QLHGR++KLGL++D FI NTII+MY N G + EA  +F+    E  D DVVA NSMI
Sbjct: 145 YGAQLHGRVVKLGLQNDQFICNTIIYMYANGGLMSEARRVFDGKKLELYDHDVVAINSMI 204

Query: 281 IGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEF 102
           +G+AK G+IDESR LFD M +R+++SWNSMISGYVRNG+L +A +LF++MQ +G E SEF
Sbjct: 205 MGYAKCGEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEALELFNKMQVEGFEVSEF 264

Query: 101 TSVSLLTACAHLGALKQGEWIHAYIKKNNIEVN 3
           T VSLL ACAHLGAL+ G+W+H YIK+N+ E+N
Sbjct: 265 TMVSLLNACAHLGALQHGKWVHDYIKRNHFELN 297



 Score = 68.2 bits (165), Expect = 5e-09
 Identities = 57/212 (26%), Positives = 91/212 (42%), Gaps = 6/212 (2%)
 Frame = -1

Query: 662 GDINYAYLVFYQIQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSL 483
           G+I+ +  +F  +      +WN++I G+ ++     A+ LF  M      +    T  SL
Sbjct: 211 GEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEALELFNKMQVEG-FEVSEFTMVSL 269

Query: 482 FKAYAQLGLAHDGSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDV 303
             A A LG    G  +H  I +   E +  +   II MY  CG +  A  +F       +
Sbjct: 270 LNACAHLGALQHGKWVHDYIKRNHFELNVIVVTAIIDMYCKCGSVENAVEVFETCPRRGL 329

Query: 302 VAWNSMIIGHAKNGQIDESRRLFDMMPSR-----STISWNSMISGYVRNGRLKDAFDLFH 138
             WNS+IIG A NG   E+   F  + S       ++S+  +++     G +  A D F 
Sbjct: 330 SCWNSIIIGLAMNGHEREAFEFFSKLESSKLLKPDSVSFIGVLTACKHLGAINKARDYFE 389

Query: 137 RMQNK-GIEPSEFTSVSLLTACAHLGALKQGE 45
            M NK  IEPS      ++      G L++ E
Sbjct: 390 LMMNKYEIEPSIKHYTCIVDVLGQAGLLEEAE 421


>ref|XP_006293939.1| hypothetical protein CARUB_v10022931mg [Capsella rubella]
           gi|565472276|ref|XP_006293940.1| hypothetical protein
           CARUB_v10022931mg [Capsella rubella]
           gi|482562647|gb|EOA26837.1| hypothetical protein
           CARUB_v10022931mg [Capsella rubella]
           gi|482562648|gb|EOA26838.1| hypothetical protein
           CARUB_v10022931mg [Capsella rubella]
          Length = 555

 Score =  331 bits (849), Expect = 2e-88
 Identities = 162/266 (60%), Positives = 204/266 (76%), Gaps = 1/266 (0%)
 Frame = -1

Query: 797 LSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQH 618
           L +++  C+TM++L ++H NLIKTGLI DT+AASRVL+FC  S   D+NYAYLVF +I H
Sbjct: 23  LRLIDTQCSTMRELKQIHGNLIKTGLISDTVAASRVLAFCCAS-PSDMNYAYLVFTRINH 81

Query: 617 PNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSP-IQPHRLTYPSLFKAYAQLGLAHDGS 441
            N F WNTIIRGFSQSS P +AIS+FIDML SSP ++P  LTYPS+FKAY +LG A DG 
Sbjct: 82  KNPFVWNTIIRGFSQSSFPEMAISIFIDMLCSSPSVKPQNLTYPSVFKAYGRLGQAIDGR 141

Query: 440 QLHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNG 261
           QLHGR++K GLE D FIRNT++ MY   G L+EA  +F   +DFDVVAWNSMI+G AK G
Sbjct: 142 QLHGRVLKEGLEDDSFIRNTMLQMYVTSGCLVEAWRIFVGMTDFDVVAWNSMIMGLAKCG 201

Query: 260 QIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLT 81
            I ++++LFD MP R+ +SWNSMISG+VRNGR KDA ++F  MQ + ++P  FT VSLL 
Sbjct: 202 LISQAQQLFDEMPHRNEVSWNSMISGFVRNGRFKDALEMFREMQERNVKPDGFTMVSLLN 261

Query: 80  ACAHLGALKQGEWIHAYIKKNNIEVN 3
           ACA+LGA +QG WIH YI +N  E+N
Sbjct: 262 ACAYLGANEQGRWIHEYIARNRFELN 287



 Score = 89.4 bits (220), Expect = 2e-15
 Identities = 66/261 (25%), Positives = 123/261 (47%), Gaps = 10/261 (3%)
 Frame = -1

Query: 761 DLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQHPNLFTWNTIIRG 582
           D  +LH  ++K GL  D+   + +L    TS  G +  A+ +F  +   ++  WN++I G
Sbjct: 139 DGRQLHGRVLKEGLEDDSFIRNTMLQMYVTS--GCLVEAWRIFVGMTDFDVVAWNSMIMG 196

Query: 581 FSQSSTPHIAISLFIDMLYSSPIQPHR--LTYPSLFKAYAQLGLAHDGSQLHGRIIKLGL 408
            ++      A  LF +M       PHR  +++ S+   + + G   D  ++   + +  +
Sbjct: 197 LAKCGLISQAQQLFDEM-------PHRNEVSWNSMISGFVRNGRFKDALEMFREMQERNV 249

Query: 407 ESDPFIRNTIIFMYTNCGYL---IEAGWLFN----ENSDFDVVAWNSMIIGHAKNGQIDE 249
           + D F   T++ +   C YL    +  W+         + + +   ++I  + K G I+E
Sbjct: 250 KPDGF---TMVSLLNACAYLGANEQGRWIHEYIARNRFELNSIVITALIEMYCKCGCIEE 306

Query: 248 SRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTACAH 69
             ++F+  P +    WNSMI G   NG  + A DLF  ++  G+EP   + + +LTACA+
Sbjct: 307 GLKVFECAPKKQLSCWNSMILGLANNGCEERAMDLFLELERFGLEPDSVSFIGVLTACAY 366

Query: 68  LGAL-KQGEWIHAYIKKNNIE 9
            G + K G +     +K  +E
Sbjct: 367 SGEVHKAGGFFRLMREKYMVE 387


>ref|XP_006411565.1| hypothetical protein EUTSA_v10017572mg [Eutrema salsugineum]
           gi|557112734|gb|ESQ53018.1| hypothetical protein
           EUTSA_v10017572mg [Eutrema salsugineum]
          Length = 546

 Score =  329 bits (843), Expect = 1e-87
 Identities = 156/265 (58%), Positives = 203/265 (76%)
 Frame = -1

Query: 797 LSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQIQH 618
           + +++  C+TM++L ++HANLIKTGLI DTIAASRVL+FC TS   D++YAYL+F +I H
Sbjct: 28  IRLIDTQCSTMRELKQIHANLIKTGLISDTIAASRVLAFCCTS-PSDMSYAYLLFTRINH 86

Query: 617 PNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHDGSQ 438
            N F WNTIIRGFS+SS P ++I++FIDM  S+  +P RLTYPS+FKAYA LG A DG Q
Sbjct: 87  KNPFVWNTIIRGFSRSSFPEMSITIFIDMFSSASAKPQRLTYPSVFKAYASLGKARDGMQ 146

Query: 437 LHGRIIKLGLESDPFIRNTIIFMYTNCGYLIEAGWLFNENSDFDVVAWNSMIIGHAKNGQ 258
           LHG +IK GLE D FIRNT++ MY  CG  +EA  +F     FDVVAWNSM++G A+ G 
Sbjct: 147 LHGMVIKEGLEDDSFIRNTMLHMYATCGCFVEAWRIFMAMKHFDVVAWNSMMMGLARYGL 206

Query: 257 IDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPSEFTSVSLLTA 78
           I+++++LFD MP R+ ISWNSMISG+V+NGR KDA ++F +MQ + ++P  FT VSLL A
Sbjct: 207 IEQAQKLFDEMPQRNEISWNSMISGFVKNGRFKDALEMFRKMQERNVKPDGFTMVSLLNA 266

Query: 77  CAHLGALKQGEWIHAYIKKNNIEVN 3
           CA+LGA +QG WIH YI KN  E+N
Sbjct: 267 CAYLGASEQGRWIHEYIVKNRFELN 291



 Score = 96.7 bits (239), Expect = 1e-17
 Identities = 69/274 (25%), Positives = 133/274 (48%), Gaps = 8/274 (2%)
 Frame = -1

Query: 806 HPSLSMLEKSCTTMKDLHKLHANLIKTGLIKDTIAASRVLSFCATSLVGDINYAYLVFYQ 627
           +PS+     S    +D  +LH  +IK GL  D+   + +L   AT   G    A+ +F  
Sbjct: 128 YPSVFKAYASLGKARDGMQLHGMVIKEGLEDDSFIRNTMLHMYATC--GCFVEAWRIFMA 185

Query: 626 IQHPNLFTWNTIIRGFSQSSTPHIAISLFIDMLYSSPIQPHRLTYPSLFKAYAQLGLAHD 447
           ++H ++  WN+++ G ++      A  LF +M      Q + +++ S+   + + G   D
Sbjct: 186 MKHFDVVAWNSMMMGLARYGLIEQAQKLFDEMP-----QRNEISWNSMISGFVKNGRFKD 240

Query: 446 GSQLHGRIIKLGLESDPFIRNTIIFMYTNCGYL--IEAG-----WLFNENSDFDVVAWNS 288
             ++  ++ +  ++ D F   T++ +   C YL   E G     ++     + + +   +
Sbjct: 241 ALEMFRKMQERNVKPDGF---TMVSLLNACAYLGASEQGRWIHEYIVKNRFELNSIVITA 297

Query: 287 MIIGHAKNGQIDESRRLFDMMPSRSTISWNSMISGYVRNGRLKDAFDLFHRMQNKGIEPS 108
           +I  + K G I+E  R+F+  P++    WNSM+ G   NG  + A DLF  +++  +EP 
Sbjct: 298 LIDMYCKCGCIEEGLRVFESAPNKQLSCWNSMVLGLANNGYEERAMDLFSELESSDLEPD 357

Query: 107 EFTSVSLLTACAHLGALKQ-GEWIHAYIKKNNIE 9
             + + +LTACA+ G + + GE+     +K  IE
Sbjct: 358 SVSFIGVLTACAYSGKVDEAGEFFRLMREKYLIE 391


Top