BLASTX nr result

ID: Ephedra26_contig00020828 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00020828
         (942 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006484205.1| PREDICTED: pentatricopeptide repeat-containi...   112   2e-22
ref|XP_002887217.1| pentatricopeptide repeat-containing protein ...   112   3e-22
emb|CBI27289.3| unnamed protein product [Vitis vinifera]              111   3e-22
ref|XP_002274151.1| PREDICTED: pentatricopeptide repeat-containi...   111   3e-22
ref|XP_006391068.1| hypothetical protein EUTSA_v10018264mg [Eutr...   111   5e-22
gb|AAG52501.1|AC018364_19 unknown protein; 45065-49536 [Arabidop...   110   6e-22
ref|NP_177089.2| pentatricopeptide repeat-containing protein [Ar...   110   6e-22
ref|XP_006437925.1| hypothetical protein CICLE_v10033305mg [Citr...   110   8e-22
ref|XP_002888712.1| hypothetical protein ARALYDRAFT_339164 [Arab...   107   7e-21
ref|XP_006391035.1| hypothetical protein EUTSA_v10018238mg [Eutr...   106   1e-20
ref|XP_006301643.1| hypothetical protein CARUB_v10022087mg [Caps...   106   1e-20
gb|EOY01697.1| Pentatricopeptide repeat (PPR) superfamily protei...   104   4e-20
ref|NP_177062.1| pentatricopeptide repeat-containing protein [Ar...   104   6e-20
gb|EPS64873.1| hypothetical protein M569_09905 [Genlisea aurea]       103   7e-20
ref|XP_006301564.1| hypothetical protein CARUB_v10022000mg [Caps...   103   7e-20
ref|XP_004297237.1| PREDICTED: pentatricopeptide repeat-containi...   103   7e-20
gb|EMJ26324.1| hypothetical protein PRUPE_ppa002589mg [Prunus pe...   102   2e-19
ref|XP_002311339.1| pentatricopeptide repeat-containing family p...   101   4e-19
ref|XP_006354656.1| PREDICTED: pentatricopeptide repeat-containi...   100   6e-19
gb|ESW25515.1| hypothetical protein PHAVU_003G042400g [Phaseolus...    98   5e-18

>ref|XP_006484205.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g69290-like [Citrus sinensis]
          Length = 666

 Score =  112 bits (281), Expect = 2e-22
 Identities = 71/237 (29%), Positives = 114/237 (48%), Gaps = 1/237 (0%)
 Frame = +2

Query: 230 QSNGVPSTSENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLI 409
           ++N   S   N+   +W +FKSL      P+K + N+L+  + S    H+LK+AFA V+ 
Sbjct: 76  ETNLHKSLLTNNTDEAWKSFKSLTANSLFPSKPVTNSLIAHLSSLQDNHNLKRAFASVVY 135

Query: 410 ILEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAE 589
           ++EK P   LL+F T+ T+L ++                       P  +WG  L  I  
Sbjct: 136 VIEKNP--KLLDFQTVHTLLGSMRNANTAAPAFALVKCMFKNRYFMPFELWGGFLVDICR 193

Query: 590 EKSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFL-CACLNLDMVSQAEEMIQRSYA 766
           + S     +++  E CR  ++   +       A N+ L   C  L  VS AE++IQ    
Sbjct: 194 KNSNFVAFLKVFEECCRIALDEKLDFMKPNIYACNAALEGCCYGLQSVSDAEKVIQTMSV 253

Query: 767 LGINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           LG+ PN  SFG LA LYA  GL + +  LE +M  +G   +  +Y+ L+ G++  G+
Sbjct: 254 LGVRPNESSFGFLAYLYALKGLQEKIVELESLMNEFGFSSQMVFYSSLISGYVKLGN 310


>ref|XP_002887217.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297333058|gb|EFH63476.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 623

 Score =  112 bits (279), Expect = 3e-22
 Identities = 76/270 (28%), Positives = 130/270 (48%), Gaps = 13/270 (4%)
 Frame = +2

Query: 167 SLLKLIATVNTKPSISPFSNHQSNGVPSTSEN-----DPRASWITFKSLINEGHLPNKVL 331
           SL +  ++++ KPS    + HQ +   ST  +     D   +W  F+S      LP+K L
Sbjct: 10  SLRRPFSSISRKPSPKTLTPHQKSSFESTLHHSLITHDTDQAWKVFRSFAAASSLPDKRL 69

Query: 332 VNALLIRVLS-------ESALHDLKKAFADVLIILEKYPHHNLLEFDTLETVLQNLTXXX 490
           +N+L+  + S        S  H LK+AF     ++EK P   LLEF+T+ TVL+++    
Sbjct: 70  LNSLITHLSSLHHADQNTSLRHRLKRAFVSTTYVIEKDPI--LLEFETIRTVLESMKLAK 127

Query: 491 XXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEICRKNMENLSECE 670
                              P  +WG L+  I  E  +    +++  E CR  +    +  
Sbjct: 128 TSGPALALVECMFKNRYFVPFDLWGRLIIDICSETGSLAAFLKVFRESCRIAVYEKLDFM 187

Query: 671 HARRVAFNSFLCACL-NLDMVSQAEEMIQRSYALGINPNVGSFGLLAQLYAKLGLHKNVA 847
               VA N+ L AC   L+ ++ AE++I+    LG+ P+  SFG LA LYA+ GL + ++
Sbjct: 188 KPDLVASNAALEACCWQLESLADAEDVIESMAVLGVKPDESSFGFLAYLYARKGLREKIS 247

Query: 848 SLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
            +E+ M  +G +  +  Y+ ++ G++  GD
Sbjct: 248 EIENSMDGFGFVSRRILYSNVISGYVKSGD 277


>emb|CBI27289.3| unnamed protein product [Vitis vinifera]
          Length = 967

 Score =  111 bits (278), Expect = 3e-22
 Identities = 84/289 (29%), Positives = 136/289 (47%), Gaps = 2/289 (0%)
 Frame = +2

Query: 77  RASSFACCEDSQAYRLLSNAYFSSHETPRYSLLKLIATVNTKPSISPFSNHQSNGV-PST 253
           R SS +  E    Y  L  + FS    P  S  +     + KP  SP      + +  S 
Sbjct: 15  RFSSTSESEFPTLYSFLQPSLFSLKPIP--SAPRSPHPTSPKPLQSPAPEDLESALHTSL 72

Query: 254 SENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHH 433
           S N+   +W +FK+L      P+K L N+L+  + S   L++LK+AFA  + +LEK P  
Sbjct: 73  STNNTDEAWKSFKALTTNSTFPSKSLANSLIAHLASLHDLYNLKRAFASAVFLLEKNP-- 130

Query: 434 NLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYA 613
           +LL+F T+ T+L ++                       P SMWG ++ +I     +    
Sbjct: 131 SLLDFGTVRTLLGSMNSANTAAPAFALINCMFKNRYFMPFSMWGGVIVEITRRNRSFVAF 190

Query: 614 IEICLEICRKNMENLSECEHARRVAFNSFLCACL-NLDMVSQAEEMIQRSYALGINPNVG 790
           + +  E CR  ++   E       A N  L  C  +L+ VS+AE++++    LGI P+  
Sbjct: 191 LRVFNETCRIAIDEKLESMKPDLDACNVALEGCSQDLESVSEAEKVVEMMSVLGIQPDES 250

Query: 791 SFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           SFG LA LYA  GL + +  LE +MR +G   ++  Y+ L+  ++  G+
Sbjct: 251 SFGFLAYLYALKGLEEKIVELEGLMRGFGFSSKKVIYSYLINAYVKSGN 299


>ref|XP_002274151.1| PREDICTED: pentatricopeptide repeat-containing protein At1g69290
           [Vitis vinifera]
          Length = 655

 Score =  111 bits (278), Expect = 3e-22
 Identities = 84/289 (29%), Positives = 136/289 (47%), Gaps = 2/289 (0%)
 Frame = +2

Query: 77  RASSFACCEDSQAYRLLSNAYFSSHETPRYSLLKLIATVNTKPSISPFSNHQSNGV-PST 253
           R SS +  E    Y  L  + FS    P  S  +     + KP  SP      + +  S 
Sbjct: 15  RFSSTSESEFPTLYSFLQPSLFSLKPIP--SAPRSPHPTSPKPLQSPAPEDLESALHTSL 72

Query: 254 SENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHH 433
           S N+   +W +FK+L      P+K L N+L+  + S   L++LK+AFA  + +LEK P  
Sbjct: 73  STNNTDEAWKSFKALTTNSTFPSKSLANSLIAHLASLHDLYNLKRAFASAVFLLEKNP-- 130

Query: 434 NLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYA 613
           +LL+F T+ T+L ++                       P SMWG ++ +I     +    
Sbjct: 131 SLLDFGTVRTLLGSMNSANTAAPAFALINCMFKNRYFMPFSMWGGVIVEITRRNRSFVAF 190

Query: 614 IEICLEICRKNMENLSECEHARRVAFNSFLCACL-NLDMVSQAEEMIQRSYALGINPNVG 790
           + +  E CR  ++   E       A N  L  C  +L+ VS+AE++++    LGI P+  
Sbjct: 191 LRVFNETCRIAIDEKLESMKPDLDACNVALEGCSQDLESVSEAEKVVEMMSVLGIQPDES 250

Query: 791 SFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           SFG LA LYA  GL + +  LE +MR +G   ++  Y+ L+  ++  G+
Sbjct: 251 SFGFLAYLYALKGLEEKIVELEGLMRGFGFSSKKVIYSYLINAYVKSGN 299


>ref|XP_006391068.1| hypothetical protein EUTSA_v10018264mg [Eutrema salsugineum]
           gi|557087502|gb|ESQ28354.1| hypothetical protein
           EUTSA_v10018264mg [Eutrema salsugineum]
          Length = 632

 Score =  111 bits (277), Expect = 5e-22
 Identities = 72/245 (29%), Positives = 122/245 (49%), Gaps = 6/245 (2%)
 Frame = +2

Query: 221 SNHQSNGVPSTSENDPRASWITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLK 385
           S+ +S+   S + +D   +W  F+S      LP+K+L+N+L+  + S      S  H LK
Sbjct: 43  SSFESSLRHSLTAHDTDQAWKAFRSFAAASSLPDKLLLNSLITHMSSFHAGDTSLRHRLK 102

Query: 386 KAFADVLIILEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWG 565
           +AF     ++EK P   LLEF+TL T+L+++                       P  +WG
Sbjct: 103 RAFVSAAYVIEKDPI--LLEFETLRTLLESMKLAKAAAPALALVECMFKNRYFVPFDLWG 160

Query: 566 ALLEKIAEEKSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFLCACL-NLDMVSQAE 742
            L+  I  E  T    +++  E CR +++   +      VA N+ L AC   ++ V+ AE
Sbjct: 161 HLIIDICRENGTLAAFLKVFRESCRISVDEKLDFMKPDLVASNAALEACCWQMESVADAE 220

Query: 743 EMIQRSYALGINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGH 922
            +++    LG+ P+  SFG LA LYA+ GL + ++ LED M  +G    +  Y+ ++ G+
Sbjct: 221 NVMESMAVLGVKPDESSFGFLAYLYARKGLREKISELEDAMDGFGFASRRILYSNMISGY 280

Query: 923 LT*GD 937
           +  GD
Sbjct: 281 VKMGD 285


>gb|AAG52501.1|AC018364_19 unknown protein; 45065-49536 [Arabidopsis thaliana]
          Length = 860

 Score =  110 bits (276), Expect = 6e-22
 Identities = 87/295 (29%), Positives = 140/295 (47%), Gaps = 24/295 (8%)
 Frame = +2

Query: 125  LSNAYFSSH--ETPR-YSLLKLIA----TVNTKPSISPFSN------HQSNGVPSTSEND 265
            +S  +FSS   E+P  YS LK        +   PS+SP  N       Q +   ST  + 
Sbjct: 211  ISRRHFSSSSPESPSLYSFLKPSLFSHKPITLSPSLSPPQNPKTLTPDQKSSFESTLHDS 270

Query: 266  PRA-----SWITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLKKAFADVLIIL 415
              A     +W  F+SL     LP K L+N+L+  +       ES  H LK+AFA    ++
Sbjct: 271  LNAHYTDEAWKAFRSLTAASSLPEKRLINSLITHLSGVEGSGESISHRLKRAFASAAYVI 330

Query: 416  EKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEK 595
            EK P   LLEF+T+ T+L+++                       P  +WG L+  I  E 
Sbjct: 331  EKDPI--LLEFETVRTLLESMKLAKAAGPALALVKCMFKNRYFVPFDLWGHLVIDICREN 388

Query: 596  STAFYAIEICLEICRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALG 772
             +    +++  E CR +++   E      VA N+ L AC   ++ ++ AE +I+    LG
Sbjct: 389  GSLAPFLKVFKESCRISVDEKLEFMKPDLVASNAALEACCRQMESLADAENVIESMAVLG 448

Query: 773  INPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
            + P+  SFG LA LYA+ GL + ++ LE++M  +G    +  Y+ ++ G++  GD
Sbjct: 449  VKPDELSFGFLAYLYARKGLREKISELENLMDGFGFASRRILYSNMISGYVKSGD 503


>ref|NP_177089.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|193806277|sp|P0C7R4.1|PP110_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g69290 gi|332196785|gb|AEE34906.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 658

 Score =  110 bits (276), Expect = 6e-22
 Identities = 87/295 (29%), Positives = 140/295 (47%), Gaps = 24/295 (8%)
 Frame = +2

Query: 125 LSNAYFSSH--ETPR-YSLLKLIA----TVNTKPSISPFSN------HQSNGVPSTSEND 265
           +S  +FSS   E+P  YS LK        +   PS+SP  N       Q +   ST  + 
Sbjct: 9   ISRRHFSSSSPESPSLYSFLKPSLFSHKPITLSPSLSPPQNPKTLTPDQKSSFESTLHDS 68

Query: 266 PRA-----SWITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLKKAFADVLIIL 415
             A     +W  F+SL     LP K L+N+L+  +       ES  H LK+AFA    ++
Sbjct: 69  LNAHYTDEAWKAFRSLTAASSLPEKRLINSLITHLSGVEGSGESISHRLKRAFASAAYVI 128

Query: 416 EKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEK 595
           EK P   LLEF+T+ T+L+++                       P  +WG L+  I  E 
Sbjct: 129 EKDPI--LLEFETVRTLLESMKLAKAAGPALALVKCMFKNRYFVPFDLWGHLVIDICREN 186

Query: 596 STAFYAIEICLEICRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALG 772
            +    +++  E CR +++   E      VA N+ L AC   ++ ++ AE +I+    LG
Sbjct: 187 GSLAPFLKVFKESCRISVDEKLEFMKPDLVASNAALEACCRQMESLADAENVIESMAVLG 246

Query: 773 INPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           + P+  SFG LA LYA+ GL + ++ LE++M  +G    +  Y+ ++ G++  GD
Sbjct: 247 VKPDELSFGFLAYLYARKGLREKISELENLMDGFGFASRRILYSNMISGYVKSGD 301


>ref|XP_006437925.1| hypothetical protein CICLE_v10033305mg [Citrus clementina]
           gi|557540121|gb|ESR51165.1| hypothetical protein
           CICLE_v10033305mg [Citrus clementina]
          Length = 948

 Score =  110 bits (275), Expect = 8e-22
 Identities = 70/237 (29%), Positives = 114/237 (48%), Gaps = 1/237 (0%)
 Frame = +2

Query: 230 QSNGVPSTSENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLI 409
           ++N   S   N+   +W +FKSL      P+K + N+L+  + S    H+LK+AFA V+ 
Sbjct: 76  ETNLHKSLLTNNTDEAWKSFKSLTANSLFPSKPVTNSLIAHLSSLQDNHNLKRAFASVVY 135

Query: 410 ILEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAE 589
           ++EK P   LL+F T+ T+L ++                       P  +WG  L  I  
Sbjct: 136 VIEKNP--KLLDFQTVHTLLGSMRNANTAAPAFALVKCMFKNRYFMPFELWGGFLVDICR 193

Query: 590 EKSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFL-CACLNLDMVSQAEEMIQRSYA 766
           + S     +++  E CR  ++   +       A N+ L   C  L  VS AE++I+    
Sbjct: 194 KNSNFVAFLKVFEECCRIALDEKLDFMKPNIYACNAALEGCCYGLQSVSDAEKVIETMSV 253

Query: 767 LGINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           LG+ PN  SFG LA LYA  GL + V  LE ++  +G   +  +Y+ L+ G++  G+
Sbjct: 254 LGVRPNESSFGFLAYLYALKGLQEKVVELESLINEFGFSSQMVFYSSLISGYVKLGN 310


>ref|XP_002888712.1| hypothetical protein ARALYDRAFT_339164 [Arabidopsis lyrata subsp.
           lyrata] gi|297334553|gb|EFH64971.1| hypothetical protein
           ARALYDRAFT_339164 [Arabidopsis lyrata subsp. lyrata]
          Length = 1042

 Score =  107 bits (267), Expect = 7e-21
 Identities = 83/296 (28%), Positives = 144/296 (48%), Gaps = 24/296 (8%)
 Frame = +2

Query: 122 LLSNAYFSSH--ETPR-YSLLK--LIAT--VNTKPSISPFSN------HQSNGVPST--- 253
           ++S  +FSS   E+P  YS LK  L +   +   PS+SP  N       Q +   ST   
Sbjct: 8   IISRRHFSSSSPESPSLYSFLKPSLFSNKPITLTPSLSPPQNLKTLTQEQKSSFESTLHD 67

Query: 254 --SENDPRASWITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLKKAFADVLII 412
             + ++   +W  F+SL     LP K L+N+L+  + +     E+  H LK+AFA    +
Sbjct: 68  SLTTHNTDEAWKAFRSLTAASSLPEKRLINSLITHLSNTEESGENTAHRLKRAFASAAYV 127

Query: 413 LEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEE 592
           ++K P   LLEF+T+ T+++++                       P  +WG L+  I  E
Sbjct: 128 IQKDPI--LLEFETVRTLMESMKLAKAAGPALALVKCMFKNRYFVPFDLWGHLIIDICRE 185

Query: 593 KSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYAL 769
             +    +++  E CR  ++   +      VA N+ L AC   L+ ++ A+ +I+    L
Sbjct: 186 NGSLAAFLKVFKESCRIAVDEKLDFMKPDLVASNAALEACCRQLESLADADNVIESMAVL 245

Query: 770 GINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           G+ P+  SFG LA LYA+ GL + ++ LE++M  +G    +  Y+ ++ G++  GD
Sbjct: 246 GVKPDELSFGFLAYLYARKGLREKISELENLMDGFGFASRRILYSNMISGYVKSGD 301


>ref|XP_006391035.1| hypothetical protein EUTSA_v10018238mg [Eutrema salsugineum]
           gi|557087469|gb|ESQ28321.1| hypothetical protein
           EUTSA_v10018238mg [Eutrema salsugineum]
          Length = 661

 Score =  106 bits (264), Expect = 1e-20
 Identities = 83/287 (28%), Positives = 137/287 (47%), Gaps = 22/287 (7%)
 Frame = +2

Query: 143 SSHETPR-YSLLK---LIATVNT-KPSISP------FSNHQSNGVPST-----SENDPRA 274
           SS E+P  YS LK        NT  PS+SP       S  Q + + S      + ++   
Sbjct: 17  SSPESPSLYSFLKPSLFSHKPNTLTPSLSPPQTPKTLSQDQRSSIESALHDSLASHNTDE 76

Query: 275 SWITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLKKAFADVLIILEKYPHHNL 439
           +W  F+SL     LP K LVN+L+  +       E++ H LK+AFA    ++EK P   L
Sbjct: 77  AWKAFRSLTAASSLPEKRLVNSLITHLSGSCGDGENSSHRLKRAFASAAYVIEKDPI--L 134

Query: 440 LEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIE 619
           LEF+T+ T+++++                       P  +WG L+     E  T    ++
Sbjct: 135 LEFETVRTLMESMKVAKAAAPALALVKCMFQNRYFVPFDLWGHLIIDSCRENGTLAAFLK 194

Query: 620 ICLEICRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALGINPNVGSF 796
           +  E CR  ++   +      VA N+ L AC   ++ ++ AE +I+    LG+ P+  SF
Sbjct: 195 VFRESCRIAVDEKLDFMKPDLVASNAALEACCRQMESLADAENVIESMAILGVKPDESSF 254

Query: 797 GLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           G LA LYA+ GL + ++ LE++M  +G    +  Y+ ++ G++  GD
Sbjct: 255 GFLAYLYARKGLKEKISELENLMDGFGFESRRVLYSNMISGYVKMGD 301


>ref|XP_006301643.1| hypothetical protein CARUB_v10022087mg [Capsella rubella]
           gi|482570353|gb|EOA34541.1| hypothetical protein
           CARUB_v10022087mg [Capsella rubella]
          Length = 658

 Score =  106 bits (264), Expect = 1e-20
 Identities = 81/289 (28%), Positives = 133/289 (46%), Gaps = 24/289 (8%)
 Frame = +2

Query: 143 SSHETPR-YSLLKLIATVNTKPSISPFSNHQSNGVPSTSENDPRAS-------------- 277
           SS E+P  YS LK     N   +++P  +   N  P T   D +AS              
Sbjct: 17  SSPESPSLYSFLKPSLFSNKPITLTPSLSPPQN--PKTLTQDQKASFESALHDSLTAQNT 74

Query: 278 ---WITFKSLINEGHLPNKVLVNALLIRVLS-----ESALHDLKKAFADVLIILEKYPHH 433
              W  F+SL     LP K L+N+L+  + +     E+  H LK+AFA    ++EK P  
Sbjct: 75  DEAWKAFRSLTAASSLPEKRLINSLITHLSNTEGSGENTSHRLKRAFASAAYVIEKDPI- 133

Query: 434 NLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYA 613
            LLEF+T+ +VL+++                       P  +WG L+  I  E  +    
Sbjct: 134 -LLEFETVRSVLESMKLAKASGPALALVKCMFKNRYFVPFDLWGHLIIDICRENGSLAAF 192

Query: 614 IEICLEICRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALGINPNVG 790
           +++  E CR  ++   +      VA N+ L AC   L+ ++ A+ +I+    LG+ P+  
Sbjct: 193 LKVFKESCRIAVDEKLDFMKPDLVASNAALEACCRQLESLADADNVIESMAVLGVKPDES 252

Query: 791 SFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           SFG LA LYA+ G  + ++ LE++M  +G       Y+ ++ G++  GD
Sbjct: 253 SFGFLAYLYARKGFREKISELENLMDGFGFASRGILYSNMISGYVKNGD 301


>gb|EOY01697.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao]
          Length = 655

 Score =  104 bits (260), Expect = 4e-20
 Identities = 62/217 (28%), Positives = 109/217 (50%), Gaps = 1/217 (0%)
 Frame = +2

Query: 275 SWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHHNLLEFDT 454
           +W +FK+L      PNK L N+L+  + S    H+LK+AFA V+ ++EK P    L F+T
Sbjct: 80  AWKSFKALTTNSIFPNKPLTNSLITYLSSLKDTHNLKRAFASVVFVIEKNPKS--LSFET 137

Query: 455 LETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEI 634
           + +VL+++                       P  +WG +L  I+ +  +    + +  E 
Sbjct: 138 VTSVLRSMKIANTAAPAFALIKCMLKNRYFMPFVLWGDMLVDISRKNGSFVAFLRVFEEC 197

Query: 635 CRKNMENLSECEHARRVAFNSFL-CACLNLDMVSQAEEMIQRSYALGINPNVGSFGLLAQ 811
           CR  ++   +       A N+ L C C  L  VS AE++++    LG+ P+  SFG L+ 
Sbjct: 198 CRIAIDEKLDYMKPDLAACNAALECCCYELKSVSDAEKVVETMSVLGVRPDESSFGFLSY 257

Query: 812 LYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGH 922
           LYA  GL + +  L+++M  +G+  ++  Y+ L+ G+
Sbjct: 258 LYALKGLEEKIDELKNLMLEFGLSNKKMVYSSLIGGY 294


>ref|NP_177062.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75333630|sp|Q9CAA5.1|PP109_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g68980, mitochondrial; Flags: Precursor
           gi|12323218|gb|AAG51590.1|AC011665_11 unknown protein
           [Arabidopsis thaliana] gi|110740675|dbj|BAE98440.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332196751|gb|AEE34872.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 619

 Score =  104 bits (259), Expect = 6e-20
 Identities = 76/270 (28%), Positives = 127/270 (47%), Gaps = 13/270 (4%)
 Frame = +2

Query: 167 SLLKLIATVNTKPSISPFSNHQSNGVPSTSEN-----DPRASWITFKSLINEGHLPNKVL 331
           +L+ L    ++ PS    + HQ +   ST  +     D   +W  F+S      LP+K L
Sbjct: 7   TLISLRRPFSSIPS-KTLTPHQKSSFESTLHHSLITHDTDQAWKVFRSFAAASSLPDKRL 65

Query: 332 VNALLIRVLS-------ESALHDLKKAFADVLIILEKYPHHNLLEFDTLETVLQNLTXXX 490
           +N+L+  + S        S  H LK+AF     ++EK P   LLEF+T+ TVL+++    
Sbjct: 66  LNSLITHLSSFHNTDQNTSLRHRLKRAFVSTTYVIEKDPI--LLEFETVRTVLESMKLAK 123

Query: 491 XXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEICRKNMENLSECE 670
                              P  +WG LL  +  E  +    +++  E CR  ++   +  
Sbjct: 124 ASGPALALVECMFKNRYFVPFDLWGDLLIDVCRENGSLAAFLKVFRESCRIAVDEKLDFM 183

Query: 671 HARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALGINPNVGSFGLLAQLYAKLGLHKNVA 847
               VA N+ L AC   ++ ++ AE +I+    LG+ P+  SFG LA LYA+ GL + ++
Sbjct: 184 KPDLVASNAALEACCRQMESLADAENLIESMDVLGVKPDELSFGFLAYLYARKGLREKIS 243

Query: 848 SLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
            LED+M   G    +  Y+ ++ G++  GD
Sbjct: 244 ELEDLMDGLGFASRRILYSSMISGYVKSGD 273


>gb|EPS64873.1| hypothetical protein M569_09905 [Genlisea aurea]
          Length = 667

 Score =  103 bits (258), Expect = 7e-20
 Identities = 83/300 (27%), Positives = 137/300 (45%), Gaps = 27/300 (9%)
 Frame = +2

Query: 122 LLSNAYFSSHE----TPRYSLLK--LIATVNTKPSISPFSNHQSNGVPSTSENDPRAS-- 277
           LLS   FSS      TP YS LK  L +   ++    P S   SN   S+S + PR S  
Sbjct: 9   LLSRRLFSSETEKKPTPLYSFLKPSLFSLTRSQQEPPPKSKRDSN---SSSSSPPRKSDL 65

Query: 278 ----------------WITFKSLINEG--HLPNKVLVNALLIRVLSESALHDLKKAFADV 403
                           W +F++L N G    P+K L+N+++  + S +  H+LK+A+A V
Sbjct: 66  EASIQQSLFNGHTDQAWKSFRALANGGPSSFPDKPLINSMITHLSSTNDAHNLKRAYASV 125

Query: 404 LIILEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKI 583
           +  LEK P  + LEF T++++L ++                       P  MWG  +  +
Sbjct: 126 IFALEKNP--SSLEFSTVKSLLDSVKTAAPALALVKSMLSHRFF---MPFPMWGGAVLDL 180

Query: 584 AEEKSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFLC-ACLNLDMVSQAEEMIQRS 760
             +  +    + +  ++CR ++    +       A N+ L   C  +  + +AE++I+  
Sbjct: 181 CRKNGSLSCFLGVFRQVCRISLVEKLDFMKPDLAACNAALHHCCREVGSIVEAEKVIESM 240

Query: 761 YALGINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GDF 940
             LGI P+  ++G LA LYA  GLH  +  LE +M   G+  E+  +  L+ G +  GDF
Sbjct: 241 SILGIKPDESTYGSLAYLYAFRGLHDKITDLEYLMDKLGVSNERPLFHNLICGFINCGDF 300


>ref|XP_006301564.1| hypothetical protein CARUB_v10022000mg [Capsella rubella]
           gi|482570274|gb|EOA34462.1| hypothetical protein
           CARUB_v10022000mg [Capsella rubella]
          Length = 623

 Score =  103 bits (258), Expect = 7e-20
 Identities = 75/269 (27%), Positives = 125/269 (46%), Gaps = 13/269 (4%)
 Frame = +2

Query: 170 LLKLIATVNTKPSISPFSNHQSNGVPSTSEN-----DPRASWITFKSLINEGHLPNKVLV 334
           L +  +++  KPS    + HQ +   ST  +     D   +W  F+S      LP K L+
Sbjct: 11  LRRPFSSIPPKPSPKTLTPHQKSSFESTLHHSLIAHDTDQAWKVFRSFAAASSLPEKSLL 70

Query: 335 NALLIRVLSE-------SALHDLKKAFADVLIILEKYPHHNLLEFDTLETVLQNLTXXXX 493
           N+L+  + S        S  H LK+AF     ++EK P   LL+F T+ TVL+++     
Sbjct: 71  NSLITHLSSFNHADQNISRRHRLKRAFVSATYVIEKDPI--LLDFGTVLTVLESMKLAKA 128

Query: 494 XXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEICRKNMENLSECEH 673
                             P  +WG L+  I  E  T    +++  E CR  ++   +   
Sbjct: 129 SGPALALVECMFKNRYFVPFDLWGHLIIDICRENGTLAAFLKVFKESCRIAVDENLDFMK 188

Query: 674 ARRVAFNSFLCACL-NLDMVSQAEEMIQRSYALGINPNVGSFGLLAQLYAKLGLHKNVAS 850
              VA N+ L AC   L+ ++ AE +I+    LG+ P+  SFG LA L+A+ GL + ++ 
Sbjct: 189 PDLVASNAALEACCWQLESLADAEYVIESMAVLGVKPDESSFGFLAYLFARKGLQEKISE 248

Query: 851 LEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
           LE+ M  +G    +  Y+ ++ G++  GD
Sbjct: 249 LENSMDGFGFASRRILYSNMISGYVKSGD 277


>ref|XP_004297237.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g69290-like [Fragaria vesca subsp. vesca]
          Length = 651

 Score =  103 bits (258), Expect = 7e-20
 Identities = 67/222 (30%), Positives = 111/222 (50%), Gaps = 2/222 (0%)
 Frame = +2

Query: 275 SWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHHNLLEFDT 454
           +W +FKSL      P+K L N+++  + S   +H+LK+AFA V+ ++EK P   LLEF+T
Sbjct: 76  AWKSFKSLTGSSVFPSKSLTNSMITHLASLGEIHNLKRAFASVVYVVEKSPE--LLEFET 133

Query: 455 LETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEI 634
           + +VL  +                       P S+WG+++ +I+         + +  E 
Sbjct: 134 VGSVLGAMNCANTAAPAFALIQCMFKNRFFLPFSVWGSVVVEISRRNGNFGAFLRVFEEN 193

Query: 635 CRKNMENLSECEHARRVAFNSFL--CACLNLDMVSQAEEMIQRSYALGINPNVGSFGLLA 808
           CR  +E   E       A N+ L  C C  L+ VS AE++++    LG+ P+  SFG LA
Sbjct: 194 CRVALEEKMEVMKPDLAACNAALEGCCC-ELESVSGAEKVVETMVGLGVRPDECSFGFLA 252

Query: 809 QLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*G 934
            LYA  GL + ++ LE +M  +G    + +   L+ G++  G
Sbjct: 253 YLYALKGLGEKISELEGLMGGFGFSDRRVFRNNLINGYVKSG 294


>gb|EMJ26324.1| hypothetical protein PRUPE_ppa002589mg [Prunus persica]
          Length = 655

 Score =  102 bits (255), Expect = 2e-19
 Identities = 64/221 (28%), Positives = 110/221 (49%), Gaps = 1/221 (0%)
 Frame = +2

Query: 275 SWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHHNLLEFDT 454
           +W +FK+L      P+K L N+L+  + S   +H+LK+AFA V+ ++EK P    L+F+T
Sbjct: 80  AWKSFKTLTGSSAFPSKSLTNSLITHLSSLGDIHNLKRAFATVVYVVEKNP--GFLDFET 137

Query: 455 LETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEI 634
           + T+L  +                       P S+WG +L +I+ +       + +  E 
Sbjct: 138 VGTLLDAMKCANTAAPAFALIKSVFKNRFFLPFSVWGNVLIEISRKNGNFVAFLRVFEEN 197

Query: 635 CRKNMENLSECEHARRVAFNSFLCACLN-LDMVSQAEEMIQRSYALGINPNVGSFGLLAQ 811
           CR  ++   E       A N+ L  C   L+ VS AE++++    LG+ P+  SFG LA 
Sbjct: 198 CRIALDEKLESMKPDLAACNAALEGCCRELESVSDAEKVVETMAVLGVRPDESSFGFLAY 257

Query: 812 LYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*G 934
           LYA  GL + +  LE +M  +G   ++ + + L+ G++  G
Sbjct: 258 LYALKGLEEKITELEGLMGGFGFSNKRVFQSNLINGYVKSG 298


>ref|XP_002311339.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222851159|gb|EEE88706.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 654

 Score =  101 bits (252), Expect = 4e-19
 Identities = 77/295 (26%), Positives = 132/295 (44%), Gaps = 5/295 (1%)
 Frame = +2

Query: 71  MTRASSFACCEDSQAYRLLSNAYFSSHETPRYSLLKLIATVNTKPSI---SPFSNHQSNG 241
           + R S     E    Y  L    F+  +TP  +      T    P I      +N +S  
Sbjct: 9   LRRRSFSTTPEIPNLYSFLQPTIFALKKTPPSTTNPATTTNRQTPKILTQDHITNLESTL 68

Query: 242 VPSTSENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEK 421
             S   N+   +W +FKSL +    P+K L N+L+  + S +   +LK+AFA ++ ++EK
Sbjct: 69  HKSLITNNTNEAWASFKSLTSNSAFPSKSLTNSLITHLSSLNDTINLKRAFASIVYVIEK 128

Query: 422 YPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKST 601
            P    L+F+T++  L ++                       P  +WG +L +I+ +   
Sbjct: 129 NPKS--LDFETVQLFLGSMVRANTAAPAFALIKCMFKNRFFMPFRLWGDILIEISRKNDK 186

Query: 602 AFYAIEICLEICRKNMENLSECEHARRVAFNSFL--CACLNLDMVSQAEEMIQRSYALGI 775
               +++  E CR  ++   +       A N  L  C C  L+ VS+AE++I+    LGI
Sbjct: 187 VIAFLKVFEESCRIAIDEKLDFMKPDMDACNVALEGCCC-ELESVSEAEKVIETMSVLGI 245

Query: 776 NPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GDF 940
            P+  SFG LA LYA  G    +  L  +M  +G   ++ +++ L+ G++  G F
Sbjct: 246 KPDELSFGFLAYLYALKGFQDKIIELNGLMSGFGFSNKKLFFSYLIRGYVKSGSF 300


>ref|XP_006354656.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g69290-like isoform X1 [Solanum tuberosum]
           gi|565376327|ref|XP_006354657.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g69290-like isoform X2 [Solanum tuberosum]
          Length = 654

 Score =  100 bits (250), Expect = 6e-19
 Identities = 65/240 (27%), Positives = 118/240 (49%), Gaps = 1/240 (0%)
 Frame = +2

Query: 221 SNHQSNGVPSTSENDPRASWITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFAD 400
           SN +S    S   N+   +W +FK+L N    P+K L N+++  + S +  H++K+AFA 
Sbjct: 61  SNLESTLQDSIKSNNTDEAWKSFKTLSNYSAFPSKSLTNSVITHLSSLNDTHNIKRAFAS 120

Query: 401 VLIILEKYPHHNLLEFDTLETVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEK 580
           V+ +LEK     LL+ +T+  +L ++                       P S+WG +L +
Sbjct: 121 VVFLLEK--KQELLKPETVHVLLNSMREANSAAPAFALVKCMFKNRFFIPFSLWGDVLVE 178

Query: 581 IAEEKSTAFYAIEICLEICRKNMENLSECEHARRVAFNSFL-CACLNLDMVSQAEEMIQR 757
           I  +       +++  E CR  ++           A N+ L C C  ++ ++ AE++++ 
Sbjct: 179 ICRKNGNFGGFLQVFNENCRVAIDEKLNFLKPSLAACNAALECCCREVESITDAEKVVET 238

Query: 758 SYALGINPNVGSFGLLAQLYAKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHLT*GD 937
              LG+ P+  SFGLLA LYA  GL + +A LE ++  +G   +  + + L+ G +  G+
Sbjct: 239 MSVLGVRPDECSFGLLAYLYALKGLKEKIAELEGLISGFGFPDKGVFLSNLISGFVKCGN 298


>gb|ESW25515.1| hypothetical protein PHAVU_003G042400g [Phaseolus vulgaris]
          Length = 655

 Score = 97.8 bits (242), Expect = 5e-18
 Identities = 75/276 (27%), Positives = 122/276 (44%), Gaps = 1/276 (0%)
 Frame = +2

Query: 101 EDSQAYRLLSNAYFSSHETPRYSLLKLIATVNTKPSISPFSNHQSNGVPSTSENDPRASW 280
           E    Y  L  + F+  +  +  + +      T  S S  S  Q+    S   ++   +W
Sbjct: 22  ETPTLYSFLQPSIFALTKNKQQPISEASPKAPTCLSTSQLSTLQTTLHKSLISSNTDEAW 81

Query: 281 ITFKSLINEGHLPNKVLVNALLIRVLSESALHDLKKAFADVLIILEKYPHHNLLEFDTLE 460
            +FK+L      P K L N+LL  + S     +LK+AFA  L ++EK P   LL+ DTL 
Sbjct: 82  KSFKALTTHQAFPPKPLTNSLLSHLSSLGDTLNLKRAFASALFLMEKNPL--LLQHDTLH 139

Query: 461 TVLQNLTXXXXXXXXXXXXXXXXXXGSCPPISMWGALLEKIAEEKSTAFYAIEICLEICR 640
            +L ++                       P  +WG +L +I+ +       + +  E CR
Sbjct: 140 HMLLSMKGANTAAPAFALVRSMLRFRFFVPFHIWGPVLVEISRDCGNLAAFLRLFEENCR 199

Query: 641 KNMENLSECEHARRVAFNSFL-CACLNLDMVSQAEEMIQRSYALGINPNVGSFGLLAQLY 817
             +E   E      VA N+ L   C  L+ VS AE ++     LG+ P+  SFG L  LY
Sbjct: 200 VALEERVEFMKPDVVACNAALEGCCFELESVSDAERVVGTMSNLGVRPDESSFGFLGYLY 259

Query: 818 AKLGLHKNVASLEDIMRLYGIIPEQQYYAGLVLGHL 925
           A  GL + +  LE +M  +G + ++ +Y  L+ G++
Sbjct: 260 ALKGLEEKIRELEVLMGGFGCLNKKGFYCNLIRGYV 295


Top