BLASTX nr result

ID: Akebia24_contig00006226 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00006226
         (822 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007204122.1| hypothetical protein PRUPE_ppa024755mg [Prun...   374   e-101
ref|XP_006466829.1| PREDICTED: pentatricopeptide repeat-containi...   371   e-100
ref|XP_006425640.1| hypothetical protein CICLE_v10025352mg [Citr...   369   e-100
emb|CBI32639.3| unnamed protein product [Vitis vinifera]              363   5e-98
ref|XP_002273763.1| PREDICTED: pentatricopeptide repeat-containi...   363   5e-98
ref|XP_007029218.1| Tetratricopeptide repeat-like superfamily pr...   352   7e-95
ref|XP_004486409.1| PREDICTED: pentatricopeptide repeat-containi...   350   4e-94
ref|XP_003594404.1| Pentatricopeptide repeat-containing protein ...   346   7e-93
ref|XP_004146551.1| PREDICTED: pentatricopeptide repeat-containi...   333   6e-89
ref|XP_007159287.1| hypothetical protein PHAVU_002G225300g [Phas...   332   1e-88
ref|XP_003539183.1| PREDICTED: pentatricopeptide repeat-containi...   332   1e-88
ref|XP_006604975.1| PREDICTED: pentatricopeptide repeat-containi...   322   8e-86
ref|XP_002310252.2| hypothetical protein POPTR_0007s13130g [Popu...   317   4e-84
gb|EAY91060.1| hypothetical protein OsI_12669 [Oryza sativa Indi...   291   3e-76
ref|NP_001050692.1| Os03g0624800 [Oryza sativa Japonica Group] g...   291   3e-76
ref|NP_201451.1| pentatricopeptide repeat-containing protein [Ar...   289   1e-75
ref|XP_002865063.1| pentatricopeptide repeat-containing protein ...   288   2e-75
gb|ADQ43215.1| unknown [Eutrema parvulum]                             286   8e-75
ref|XP_006281458.1| hypothetical protein CARUB_v10027537mg [Caps...   283   5e-74
ref|XP_006393844.1| hypothetical protein EUTSA_v10003984mg [Eutr...   278   1e-72

>ref|XP_007204122.1| hypothetical protein PRUPE_ppa024755mg [Prunus persica]
            gi|462399653|gb|EMJ05321.1| hypothetical protein
            PRUPE_ppa024755mg [Prunus persica]
          Length = 497

 Score =  374 bits (961), Expect = e-101
 Identities = 187/274 (68%), Positives = 219/274 (79%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+LV+GCV+N+KY  A+SI+S MKPNVIALTS L ACS NS+L +G+QIHCVA+R+GF  
Sbjct: 212  NSLVAGCVRNKKYKEALSIMSTMKPNVIALTSALAACSENSNLWIGKQIHCVAMRHGFTS 271

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DTQ+ N LLDMYAKCGKIS  R LF  I  +NVV+WT++IDAYGS G G+EAL+LFK M 
Sbjct: 272  DTQMCNVLLDMYAKCGKISNARSLFNGISNKNVVSWTSMIDAYGSHGNGLEALDLFKRMG 331

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE S V PN VTFLAVLSACGHSGLVEQGREC  S  EKYG+D GPEHY CFID+LGRAG
Sbjct: 332  EERSEVLPNSVTFLAVLSACGHSGLVEQGRECFNSAPEKYGLDLGPEHYGCFIDMLGRAG 391

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             IDEVW +F DM   G  PT +V +A+LNAC  NLD+ RGE  AK L ELEP+KPGN+V 
Sbjct: 392  QIDEVWCVFQDMVEHGIRPTAAVWSALLNACSHNLDVTRGEVAAKHLLELEPNKPGNFVL 451

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            VSNFYA++GRWD V++LRS+M  KGL KE GSSW
Sbjct: 452  VSNFYATIGRWDSVDQLRSIMEMKGLVKEVGSSW 485



 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 56/200 (28%), Positives = 90/200 (45%), Gaps = 9/200 (4%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILS---RMKP--NVIALTSGLTACSANSDLLMGRQIHCVAIR 167
           NAL++   +N ++    ++ S   R KP  N    T  L AC A      GRQ+H + I+
Sbjct: 10  NALLASYNRNGQFSTTWALFSCIHRAKPDLNAYTFTRVLGACRALPRPERGRQVHGLMIK 69

Query: 168 NGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALEL 347
            G E  T    A++DMY+K G +  +   F  +  ++VVTW  ++ ++   G   EAL  
Sbjct: 70  TGAESGTVAKTAVIDMYSKYGYLEDSVRAFEEMEFKDVVTWNALLSSFLRHGLAREALGA 129

Query: 348 FKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE----CIISMREKYGIDPGPEHYACF 515
           F+ M EE  V  +  T  ++L AC       QG++     ++  R+   +          
Sbjct: 130 FEAMREER-VEISEFTLCSLLKACASLRSSPQGKQVHGMVVVMGRDMLILG------TAL 182

Query: 516 IDLLGRAGLIDEVWGLFSDM 575
           ID     G I E   +FS +
Sbjct: 183 IDFYSAVGCISEAMKVFSGL 202


>ref|XP_006466829.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like isoform X1 [Citrus sinensis]
            gi|568824902|ref|XP_006466830.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like isoform X2 [Citrus sinensis]
          Length = 527

 Score =  371 bits (952), Expect = e-100
 Identities = 181/274 (66%), Positives = 220/274 (80%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGCVQN+KY  A SI+S M+PNV+ALTS L ACS NS+L +G+QIHCVA+R GF +
Sbjct: 247  NSLISGCVQNKKYKEAFSIMSTMRPNVVALTSALAACSENSNLWIGKQIHCVALRFGFIY 306

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            +TQ+ N +LDMYAKCGK+   R LF  + Q++VV+WT++I AYGS G G+ ALELFK + 
Sbjct: 307  ETQMCNVMLDMYAKCGKLLNARSLFDGVFQKDVVSWTSMIAAYGSHGHGLGALELFKKLG 366

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE SGV PN VTFLAVLSAC HSGLVE+GR C  SMREKYG+DPGPEHYACFID LGRAG
Sbjct: 367  EEGSGVLPNSVTFLAVLSACAHSGLVEEGRACFNSMREKYGLDPGPEHYACFIDTLGRAG 426

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW LF+DM   GT  T +V A ++ AC +NLD++RGEF AK L ELEPDKPGNYV 
Sbjct: 427  QIEEVWCLFNDMVKNGTKTTAAVWATLVKACNLNLDVKRGEFAAKQLLELEPDKPGNYVL 486

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            +SNFYA+VG+WD V+ LRS+M +KGL KE GSSW
Sbjct: 487  LSNFYAAVGKWDSVDNLRSIMRKKGLAKEIGSSW 520



 Score = 59.3 bits (142), Expect = 2e-06
 Identities = 36/109 (33%), Positives = 57/109 (52%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L ACSA      G+Q+H + I+ G + +  +  AL+DMY+K G +  +   F  I  
Sbjct: 80  TPVLGACSALPAPERGKQVHALMIKGGTDSEPVVKTALMDMYSKYGLLGESVEAFKEIEF 139

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSAC 419
           ++VVTW  ++ ++   G   EA  +F+ M  E  V  +  T  +VL AC
Sbjct: 140 KDVVTWNALLSSFLRHGLAKEAFGVFQAMTRER-VEFSEFTLSSVLKAC 187


>ref|XP_006425640.1| hypothetical protein CICLE_v10025352mg [Citrus clementina]
            gi|557527630|gb|ESR38880.1| hypothetical protein
            CICLE_v10025352mg [Citrus clementina]
          Length = 527

 Score =  369 bits (947), Expect = e-100
 Identities = 181/274 (66%), Positives = 220/274 (80%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGCVQN+KY  A SI+S M+PNV+ALTS L ACS NS+L +G+QIHCVA+R GF +
Sbjct: 247  NSLISGCVQNKKYKEAFSIMSTMRPNVVALTSALAACSENSNLWIGKQIHCVALRFGFIY 306

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            +TQ+ N +LDMYAKCGK+   R LF  + Q++VV+WT++I AYGS G G+ ALELFK + 
Sbjct: 307  ETQMCNVMLDMYAKCGKLLNARSLFDGVFQKDVVSWTSMIAAYGSHGHGLGALELFKKLG 366

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE SGV PN VTFLAVLSAC HSGLVE+GR C  SMREKYG+DPGPEHYACFID L RAG
Sbjct: 367  EEGSGVLPNSVTFLAVLSACAHSGLVEEGRACFNSMREKYGLDPGPEHYACFIDTLCRAG 426

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW LF+DM   GT  T +V AA++ AC +NLD++RGEF AK L ELEPDKPGNYV 
Sbjct: 427  QIEEVWCLFNDMVKNGTKTTAAVWAALVKACNLNLDVKRGEFAAKQLLELEPDKPGNYVL 486

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            +SNFYA+VG+WD V+ LRS+M +KGL KE GSSW
Sbjct: 487  LSNFYAAVGKWDSVDNLRSIMRKKGLAKEIGSSW 520



 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 37/120 (30%), Positives = 60/120 (50%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L ACSA      G+Q+H + I+ G + +  +  AL+DMY+K G +  +   F  I  
Sbjct: 80  TPVLGACSALPAPERGKQVHALMIKGGTDSEPVVKTALMDMYSKYGLLGESVEAFKEIEF 139

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           ++VVTW  ++ ++   G   EA  +F+ M  E  V  +  T  +VL AC        G++
Sbjct: 140 KDVVTWNALLSSFLRHGLAKEAFGVFQAMTRER-VEFSEFTLSSVLKACAQLKAFRLGKQ 198


>emb|CBI32639.3| unnamed protein product [Vitis vinifera]
          Length = 459

 Score =  363 bits (931), Expect = 5e-98
 Identities = 177/274 (64%), Positives = 220/274 (80%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
           N+L+SGCV+N++Y  A  I+S M+PNV+A+TS L ACS NSDL +G+QIHCVA+R GF +
Sbjct: 172 NSLISGCVRNRRYKEAFLIMSAMRPNVVAVTSALAACSKNSDLWVGKQIHCVAMRFGFTF 231

Query: 183 DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
           DTQL N LLDMYAKCGKI   R LF R+ +++VV+WT++IDAYG+ G G+EAL+LFK M 
Sbjct: 232 DTQLCNVLLDMYAKCGKILNARSLFDRMDKKDVVSWTSMIDAYGNHGHGLEALKLFKKME 291

Query: 363 EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            E + + PN VTFLAVLSAC HSG+VEQG+EC   +++KY +DPGPEHYACFID+LGRAG
Sbjct: 292 GEGNSILPNLVTFLAVLSACAHSGMVEQGQECFNLIQKKYSLDPGPEHYACFIDILGRAG 351

Query: 540 LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
            I+EVW LF++M    T PT +V AAILNAC  NLD+ RGEF AK L ELEP+KPGNYV 
Sbjct: 352 QIEEVWRLFNNMIKNQTKPTAAVWAAILNACSHNLDVSRGEFAAKNLLELEPNKPGNYVL 411

Query: 720 VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
           +SNFYA+VGRWD V ELRS+M +KGL KE G+SW
Sbjct: 412 LSNFYAAVGRWDSVNELRSIMRKKGLVKETGNSW 445



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 35/110 (31%), Positives = 62/110 (56%)
 Frame = +3

Query: 123 SDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTII 302
           S +  G  +H +AI+ G +  T    AL+DMY+K G++  +  +F  +  ++VVTW T++
Sbjct: 15  SHVRSGNAVHALAIKTGSDTPTVTKTALMDMYSKYGQLGSSVRVFEEVGFKDVVTWNTML 74

Query: 303 DAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
            ++   G   EAL +F+ M +E GV  +  T  ++L AC      +QG++
Sbjct: 75  SSFVRHGRPEEALAVFREMQKE-GVWLSEFTLCSLLKACTLLKAFQQGKQ 123


>ref|XP_002273763.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like, partial [Vitis vinifera]
          Length = 498

 Score =  363 bits (931), Expect = 5e-98
 Identities = 177/274 (64%), Positives = 220/274 (80%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGCV+N++Y  A  I+S M+PNV+A+TS L ACS NSDL +G+QIHCVA+R GF +
Sbjct: 211  NSLISGCVRNRRYKEAFLIMSAMRPNVVAVTSALAACSKNSDLWVGKQIHCVAMRFGFTF 270

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DTQL N LLDMYAKCGKI   R LF R+ +++VV+WT++IDAYG+ G G+EAL+LFK M 
Sbjct: 271  DTQLCNVLLDMYAKCGKILNARSLFDRMDKKDVVSWTSMIDAYGNHGHGLEALKLFKKME 330

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
             E + + PN VTFLAVLSAC HSG+VEQG+EC   +++KY +DPGPEHYACFID+LGRAG
Sbjct: 331  GEGNSILPNLVTFLAVLSACAHSGMVEQGQECFNLIQKKYSLDPGPEHYACFIDILGRAG 390

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW LF++M    T PT +V AAILNAC  NLD+ RGEF AK L ELEP+KPGNYV 
Sbjct: 391  QIEEVWRLFNNMIKNQTKPTAAVWAAILNACSHNLDVSRGEFAAKNLLELEPNKPGNYVL 450

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            +SNFYA+VGRWD V ELRS+M +KGL KE G+SW
Sbjct: 451  LSNFYAAVGRWDSVNELRSIMRKKGLVKETGNSW 484



 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 47/151 (31%), Positives = 83/151 (54%), Gaps = 1/151 (0%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILSRMKPNVIALTS-GLTACSANSDLLMGRQIHCVAIRNGFE 179
           N+L++  V++     A+S+  R+      L S  LT   A  D + G+Q+H +AI+ G +
Sbjct: 13  NSLIASHVRSGNAVAALSVFRRIHRAGSHLNSYTLTPVLAVLDPICGKQVHALAIKTGSD 72

Query: 180 WDTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMM 359
             T    AL+DMY+K G++  +  +F  +  ++VVTW T++ ++   G   EAL +F+ M
Sbjct: 73  TPTVTKTALMDMYSKYGQLGSSVRVFEEVGFKDVVTWNTMLSSFVRHGRPEEALAVFREM 132

Query: 360 LEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
            +E GV  +  T  ++L AC      +QG++
Sbjct: 133 QKE-GVWLSEFTLCSLLKACTLLKAFQQGKQ 162


>ref|XP_007029218.1| Tetratricopeptide repeat-like superfamily protein, putative
            [Theobroma cacao] gi|508717823|gb|EOY09720.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative [Theobroma cacao]
          Length = 551

 Score =  352 bits (904), Expect = 7e-95
 Identities = 176/273 (64%), Positives = 208/273 (76%), Gaps = 1/273 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+ GC +N+K+  A SI+S+M+PNV+ALTS L ACS N DL +G+Q+HCVA+R GF  
Sbjct: 256  NSLIRGCFKNRKFREAFSIMSKMRPNVVALTSALGACSENVDLWIGKQVHCVALRYGFTD 315

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMM- 359
            DTQL N +LDMYAKCGKI   R LF  I  + VV+WT++IDAYGS G G+ ALELFK M 
Sbjct: 316  DTQLCNVILDMYAKCGKILNARSLFDGILHKCVVSWTSMIDAYGSHGHGLAALELFKQMR 375

Query: 360  LEESGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            +E  GV PN VTFLAVLSACGHSG VE+GREC  SMREKYG++P  EHYACFID+LGRAG
Sbjct: 376  VEGKGVVPNSVTFLAVLSACGHSGQVEEGRECFNSMREKYGLNPDQEHYACFIDVLGRAG 435

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I E W L  DM   G  PT    +A+LNAC +N DI RGEF AK L ELEPDKPGNYV 
Sbjct: 436  QIGEAWSLLDDMIKNGIKPTALTWSALLNACSLNQDIARGEFAAKHLLELEPDKPGNYVL 495

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            +SNFYA+VGRWD V+ LR +M +KGL KEAGSS
Sbjct: 496  LSNFYAAVGRWDSVDNLRDIMRKKGLSKEAGSS 528



 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 52/169 (30%), Positives = 83/169 (49%)
 Frame = +3

Query: 78  NVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLF 257
           N  + +S L+ACS+      G+Q+H + I+ G +  T    AL+++Y+K G +  +   F
Sbjct: 84  NAYSFSSVLSACSSLPGTKHGKQVHGLMIKTGVDAGTVAKTALMNLYSKYGCLGDSVRAF 143

Query: 258 YRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLV 437
             I  ++VVTW  +I ++  QG   EAL++F  M  E  V  +  T  +VL +C      
Sbjct: 144 EEIELKDVVTWNALISSFLRQGLAKEALDVFATMRRER-VQLSEFTLCSVLKSCASLKAF 202

Query: 438 EQGRECIISMREKYGIDPGPEHYACFIDLLGRAGLIDEVWGLFSDMGNM 584
           EQG++ I  +   +G D      A  ID       I E   +FS + NM
Sbjct: 203 EQGKQ-IHGLVVVFGRDLVILSTA-LIDFYSDVECISEALKVFSSLNNM 249


>ref|XP_004486409.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like [Cicer arietinum]
          Length = 528

 Score =  350 bits (898), Expect = 4e-94
 Identities = 168/274 (61%), Positives = 210/274 (76%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+LVSGC++N++YG A  ++S +  N +ALTS L  CS   D L G+Q+HCVA+R GF +
Sbjct: 235  NSLVSGCIRNRRYGEAFKVMSLVNLNAVALTSVLVCCSEELDSLTGKQVHCVAVRRGFTF 294

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DTQL N LLDMYAKCGKIS+ R +F  I Q++V++WT +IDAYG  GCG EA+ELF+ M 
Sbjct: 295  DTQLCNVLLDMYAKCGKISLARSVFDGIFQKDVISWTCMIDAYGRNGCGHEAIELFRKMR 354

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE S V PN VTFL+VLSAC HSGLVE+G++C   +REKYGI+P PEHYACFID+LGRAG
Sbjct: 355  EEGSEVLPNSVTFLSVLSACDHSGLVEEGKQCFNLLREKYGIEPDPEHYACFIDILGRAG 414

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I++VW  + +M  MG+ PT  V  A+LNAC +N D ERGEF AK L +LEPDK  N V 
Sbjct: 415  NIEDVWSTYHNMIEMGSRPTAGVWIALLNACSLNQDFERGEFAAKHLLQLEPDKASNIVL 474

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            VSNFYA++GRWD V+ELRS+M  KGL KEAG+SW
Sbjct: 475  VSNFYAAIGRWDCVDELRSIMRTKGLVKEAGNSW 508


>ref|XP_003594404.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355483452|gb|AES64655.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 516

 Score =  346 bits (887), Expect = 7e-93
 Identities = 167/274 (60%), Positives = 208/274 (75%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+LVSGC++N +Y  A  ++S +KPN +ALTS L  CS  SDLL G+Q+HCVA+R GF +
Sbjct: 229  NSLVSGCIKNGRYREAFKVMSLVKPNAVALTSVLVCCSEESDLLTGKQVHCVAVRQGFTF 288

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELF-KMM 359
            +TQL N LLDMYAKCGKI     +F  I Q++V++WT +ID YG  GCG EA+ELF KMM
Sbjct: 289  ETQLCNVLLDMYAKCGKILQAWSVFDGIFQKDVISWTCMIDGYGRNGCGYEAVELFWKMM 348

Query: 360  LEESGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
             + S V PN VTFL+VLSACGHSGLVE+G++C   M+EKYGIDP PEHYACFID+LGRAG
Sbjct: 349  EDGSEVLPNSVTFLSVLSACGHSGLVEEGKQCFNIMKEKYGIDPEPEHYACFIDILGRAG 408

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW  + +M + GT+PT  V  ++LNAC +  D ERGEF AK L +LEP+K  N V 
Sbjct: 409  KIEEVWSAYQNMIDQGTSPTAGVWISLLNACSLGQDFERGEFAAKSLLQLEPNKASNIVL 468

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
             SNFYA++GRWD V ELRS+M EKGL KEAG+SW
Sbjct: 469  ASNFYAAIGRWDCVGELRSMMREKGLVKEAGNSW 502


>ref|XP_004146551.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like [Cucumis sativus]
            gi|449529896|ref|XP_004171934.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like [Cucumis sativus]
          Length = 544

 Score =  333 bits (853), Expect = 6e-89
 Identities = 161/268 (60%), Positives = 204/268 (76%), Gaps = 1/268 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGCV+N++Y  A S++S+M+PN IALTS L ACS NSDL +G+QIHCV++R+G   
Sbjct: 263  NSLISGCVRNKRYEEAFSLMSKMRPNAIALTSALHACSENSDLWIGKQIHCVSVRHGLTS 322

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            +TQL N LLDMYAKCGK+   R +F  +  +NVV+W+++I  YGS G G++A ELFK+M+
Sbjct: 323  NTQLCNILLDMYAKCGKVLNARAVFDGMCHKNVVSWSSMIQTYGSHGDGLKAFELFKIMV 382

Query: 363  E-ESGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            E  +GV PN VTFL+VLSACGHSGLV+QG+EC    +EKY    GPEHYACFID+LGRAG
Sbjct: 383  EGRTGVLPNSVTFLSVLSACGHSGLVQQGQECFYLAKEKYSSCLGPEHYACFIDVLGRAG 442

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             IDEVW LF DM   G   T  + AA+LNAC  N D+ RGEF AK L +L+P+K GNYV 
Sbjct: 443  KIDEVWSLFHDMEMCGVKITSKIWAAVLNACNHNQDVSRGEFAAKKLLQLDPNKAGNYVL 502

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRK 803
             SNFYAS+G+WD V+ELR +M  KGLRK
Sbjct: 503  ASNFYASIGKWDSVDELRRLMKAKGLRK 530



 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 45/144 (31%), Positives = 74/144 (51%), Gaps = 5/144 (3%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILSRMKPNVIALTSG-----LTACSANSDLLMGRQIHCVAIR 167
           N+L++  V+  +   A S+ SRM  +   LT+      L ACSA      G+ +H + I+
Sbjct: 61  NSLLTSYVRGCRQSDAWSLFSRMHRSFSPLTAHTLTAVLAACSALPTSQYGQLVHGLIIK 120

Query: 168 NGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALEL 347
            G         A+LDMY+KCG +  +  +F  +  R+VV W +++ ++  +G   EAL +
Sbjct: 121 TGAYSGIVTKTAILDMYSKCGLLDDSVKVFEEMEMRDVVAWNSLLSSFLREGLAEEALNV 180

Query: 348 FKMMLEESGVSPNPVTFLAVLSAC 419
           F+ M  E  V  +  T  +VL AC
Sbjct: 181 FEEMKREK-VEFSEFTLCSVLKAC 203


>ref|XP_007159287.1| hypothetical protein PHAVU_002G225300g [Phaseolus vulgaris]
            gi|561032702|gb|ESW31281.1| hypothetical protein
            PHAVU_002G225300g [Phaseolus vulgaris]
          Length = 551

 Score =  332 bits (850), Expect = 1e-88
 Identities = 159/276 (57%), Positives = 205/276 (74%), Gaps = 3/276 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+LVSGCV+N++YG A  ++  ++PN +ALTS L  CS N DL  G+Q+HCVA+R GF  
Sbjct: 261  NSLVSGCVRNRRYGEAFRVMGSVRPNAVALTSALVGCSENLDLWAGKQMHCVAVRQGFTR 320

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELF---K 353
            +TQL NALLDMY KCGK+S  +LLF  I +++V++WT +IDAYG  G G EA+ LF   +
Sbjct: 321  ETQLCNALLDMYGKCGKVSHAQLLFDGICEKDVISWTCMIDAYGRNGRGPEAVLLFQEMR 380

Query: 354  MMLEESGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGR 533
            M  E   V PN VTFL+VLSACGHSGLVE+G++C   +REKYG++P PEHYAC+ID+LGR
Sbjct: 381  MRKEGRKVLPNSVTFLSVLSACGHSGLVEEGKKCFKLLREKYGLEPDPEHYACYIDILGR 440

Query: 534  AGLIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNY 713
            AG I+ VW  + +M   GT  T  +  A+LNAC +N D+ERGE  AK L +LEP+K    
Sbjct: 441  AGNIEGVWSAYHNMVEQGTRLTAGIWVALLNACSLNQDVERGELAAKHLLQLEPNKSSYI 500

Query: 714  VSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            V VSNFYA++GRWDRV+ELRS+M  KGL KEAG+SW
Sbjct: 501  VLVSNFYAAIGRWDRVDELRSIMRSKGLLKEAGNSW 536



 Score = 59.7 bits (143), Expect = 1e-06
 Identities = 36/130 (27%), Positives = 62/130 (47%)
 Frame = +3

Query: 63  SRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISI 242
           +R   +    TS L AC+      +G Q+H   ++ G +  T    +L+DMY+KCG +  
Sbjct: 84  ARAVVDAYTFTSVLRACTLLHVSQLGIQVHAQMLKTGADSGTVAKTSLVDMYSKCGSLDE 143

Query: 243 TRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACG 422
              +F  + QR+VV W  ++  +      VEA+ + + M  E+ V  +  T  + L  C 
Sbjct: 144 AVKVFDEMSQRDVVAWNALLSCFLRCDLPVEAVGVLRAMGREN-VEVSEFTLCSALKCCA 202

Query: 423 HSGLVEQGRE 452
               +E GR+
Sbjct: 203 SLRALELGRQ 212


>ref|XP_003539183.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like isoform X1 [Glycine max]
            gi|571489039|ref|XP_006591097.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like isoform X2 [Glycine max]
          Length = 547

 Score =  332 bits (850), Expect = 1e-88
 Identities = 162/274 (59%), Positives = 205/274 (74%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N++VSGCV++++Y  A  ++  ++PN IALTS L  CS N DL  G+QIHCVA+R GF +
Sbjct: 259  NSMVSGCVRSRRYDEAFRVMGFVRPNAIALTSALVGCSENLDLWAGKQIHCVAVRWGFTF 318

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DTQL NALLDMYAKCG+IS    +F  I +++V++WT +IDAYG  G G EA+E+F+ M 
Sbjct: 319  DTQLCNALLDMYAKCGRISQALSVFDGICEKDVISWTCMIDAYGRNGQGREAVEVFREMR 378

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            E  S V PN VTFL+VLSACGHSGLVE+G+ C   +REKYG+ P PEHYAC+ID+LGRAG
Sbjct: 379  EVGSKVLPNSVTFLSVLSACGHSGLVEEGKNCFKLLREKYGLQPDPEHYACYIDILGRAG 438

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW  + +M   GT PT  V  A+LNAC +N D+ERGE  AK L +LEP+K  N V 
Sbjct: 439  NIEEVWSAYHNMVVQGTRPTAGVWVALLNACSLNQDVERGELAAKHLLQLEPNKASNIVL 498

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            VSNFYA++ RWD VEELRS+M  KGL KEAG+SW
Sbjct: 499  VSNFYAAIDRWDCVEELRSIMRTKGLAKEAGNSW 532


>ref|XP_006604975.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66500,
            mitochondrial-like, partial [Glycine max]
          Length = 547

 Score =  322 bits (826), Expect = 8e-86
 Identities = 158/274 (57%), Positives = 202/274 (73%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N++VSGCV++++Y  A  ++  ++PN +ALTS L  CS N DL  G+QIHCVA R  F +
Sbjct: 259  NSMVSGCVRSRRYDEAFRVMGFVRPNAVALTSALVGCSENLDLWAGKQIHCVAFRWAFTF 318

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DTQL NALLDMYAKCG+IS    +F+ I +++V++WT +IDAYG  G G EA+E+F+ M 
Sbjct: 319  DTQLCNALLDMYAKCGRISQALSVFHGICEKDVISWTCMIDAYGRNGQGREAVEVFREMR 378

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            E  S V PN VTFL+VLSA GHSGLVE+G+ C   +REKYG+ P PEHYAC+ID+LGRAG
Sbjct: 379  EVGSKVLPNSVTFLSVLSASGHSGLVEEGKNCFKLLREKYGLQPDPEHYACYIDILGRAG 438

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I+EVW  + +M   GT PT  V  A+LNAC +N D+ER E  AK L +LEP+K  N V 
Sbjct: 439  NIEEVWYAYHNMVVQGTRPTAGVWVALLNACSLNQDVERSELAAKHLLQLEPNKASNIVL 498

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            VSNFYA++ RWD VEELRS+M  KGL KEAG+SW
Sbjct: 499  VSNFYAAIDRWDCVEELRSIMRTKGLAKEAGNSW 532


>ref|XP_002310252.2| hypothetical protein POPTR_0007s13130g [Populus trichocarpa]
            gi|550334777|gb|EEE90702.2| hypothetical protein
            POPTR_0007s13130g [Populus trichocarpa]
          Length = 540

 Score =  317 bits (811), Expect = 4e-84
 Identities = 159/274 (58%), Positives = 203/274 (74%), Gaps = 1/274 (0%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L++GCV++++Y  A  ++S M+PN +ALT+ L ACS NSDL +G QIHCVA+R GF  
Sbjct: 263  NSLIAGCVKHRRYEEAFLVMSTMRPNAVALTTALAACSDNSDLWIGMQIHCVALRFGFIA 322

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            +TQ+ N LLDMYAKCG+I  +R +F  I  + VV+WT++IDAYG  G G EAL+LFK M 
Sbjct: 323  NTQVCNVLLDMYAKCGRILKSRSIFDGICHKTVVSWTSMIDAYGRHGHGDEALKLFKEMG 382

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            +E S V PN +T LAVLSACGHSGLV++G+E   S REKYG+DP  EHY+C ID+LGRAG
Sbjct: 383  QEGSRVLPNSLTLLAVLSACGHSGLVKEGQELFNSAREKYGLDPSQEHYSCVIDILGRAG 442

Query: 540  LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
             I++ W LF DM   G  PT +V AA++NAC +NLD+ RGEF AK L ELEP+  G +V 
Sbjct: 443  QIEDAWCLFHDMVKKGIGPTAAVWAALVNACCLNLDVSRGEFAAKHLLELEPNNDGIHVL 502

Query: 720  VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSSW 821
            VS FYAS+ RWD VE LR+ M +KGL K  GSSW
Sbjct: 503  VSKFYASIDRWDVVESLRNNMRKKGLTKVLGSSW 536



 Score = 73.6 bits (179), Expect = 9e-11
 Identities = 51/163 (31%), Positives = 76/163 (46%), Gaps = 4/163 (2%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L ACSA  D   GRQ+H + I+ G +  T    A++DMY+K G +  +  +F  +  
Sbjct: 96  TPVLRACSALPDTKCGRQVHALMIKTGTDLGTITKTAVMDMYSKYGCLGESVKVFEEMEF 155

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           R+VVTW  ++ ++   G   EAL +F+ M  ES V     T  +VL AC       QG++
Sbjct: 156 RDVVTWNALVSSFLRHGLAKEALGVFRAMRRES-VEITEFTLCSVLKACAFIKAFRQGKQ 214

Query: 453 ----CIISMREKYGIDPGPEHYACFIDLLGRAGLIDEVWGLFS 569
                I+  R+   +          ID     G I E   +FS
Sbjct: 215 VHGLVIVMGRDLVVLG------TALIDFYSNVGYISEAMKVFS 251


>gb|EAY91060.1| hypothetical protein OsI_12669 [Oryza sativa Indica Group]
          Length = 449

 Score =  291 bits (744), Expect = 3e-76
 Identities = 144/273 (52%), Positives = 194/273 (71%), Gaps = 1/273 (0%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
           NA++SGCV+N ++  A  IL R++ N I LT  LTACSA ++L+ G Q+HC A+R GF  
Sbjct: 171 NAVISGCVENGRFREAFFILGRIELNGITLTCALTACSATANLMYGMQVHCKALRGGFTL 230

Query: 183 DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
           +T L NAL+DMYAKCG+ +  R++F R+  RNVV+W+++IDAY   G G  AL+LFK M 
Sbjct: 231 ETILCNALIDMYAKCGRTTAARMVFDRMACRNVVSWSSMIDAYSHHGHGEAALDLFKRMD 290

Query: 363 EESGVS-PNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
           E   V  PN +TFLAVLSACG SGLV++GR     M+ +YGI+PGPEHYACFIDLLGRAG
Sbjct: 291 ETVPVVLPNAITFLAVLSACGQSGLVDEGRAMFHLMKRQYGINPGPEHYACFIDLLGRAG 350

Query: 540 LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
            IDE W L+       +  +GS+  A+LNACR N+D+ RG  VA  L E++P+ PG++V 
Sbjct: 351 QIDEAWDLYCSFSTTRSELSGSICVAMLNACRANMDVVRGNKVALHLLEVDPENPGSHVL 410

Query: 720 VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
           +SNF+A+  +W   +E R ++ +KGLRKEA SS
Sbjct: 411 ISNFHAAARQWFESDESRRIIIDKGLRKEAASS 443


>ref|NP_001050692.1| Os03g0624800 [Oryza sativa Japonica Group]
           gi|40737026|gb|AAR89039.1| putative pentatricopeptide
           repeat protein [Oryza sativa Japonica Group]
           gi|108709901|gb|ABF97696.1| pentatricopeptide, putative,
           expressed [Oryza sativa Japonica Group]
           gi|108709902|gb|ABF97697.1| pentatricopeptide, putative,
           expressed [Oryza sativa Japonica Group]
           gi|113549163|dbj|BAF12606.1| Os03g0624800 [Oryza sativa
           Japonica Group] gi|215712410|dbj|BAG94537.1| unnamed
           protein product [Oryza sativa Japonica Group]
          Length = 449

 Score =  291 bits (744), Expect = 3e-76
 Identities = 144/273 (52%), Positives = 194/273 (71%), Gaps = 1/273 (0%)
 Frame = +3

Query: 3   NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
           NA++SGCV+N ++  A  IL R++ N I LT  LTACSA ++L+ G Q+HC A+R GF  
Sbjct: 171 NAVISGCVENGRFREAFFILGRIELNGITLTCALTACSATANLMYGMQVHCKALRGGFTL 230

Query: 183 DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
           +T L NAL+DMYAKCG+ +  R++F R+  RNVV+W+++IDAY   G G  AL+LFK M 
Sbjct: 231 ETILCNALIDMYAKCGRTTAARMVFDRMACRNVVSWSSMIDAYSHHGHGEAALDLFKRMD 290

Query: 363 EESGVS-PNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
           E   V  PN +TFLAVLSACG SGLV++GR     M+ +YGI+PGPEHYACFIDLLGRAG
Sbjct: 291 ETVPVVLPNAITFLAVLSACGQSGLVDEGRAMFHLMKRQYGINPGPEHYACFIDLLGRAG 350

Query: 540 LIDEVWGLFSDMGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFELEPDKPGNYVS 719
            IDE W L+       +  +GS+  A+LNACR N+D+ RG  VA  L E++P+ PG++V 
Sbjct: 351 QIDEAWDLYCSFSTTRSELSGSICVAMLNACRANMDVVRGNKVALHLLEVDPENPGSHVL 410

Query: 720 VSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
           +SNF+A+  +W   +E R ++ +KGLRKEA SS
Sbjct: 411 ISNFHAAARQWFESDESRRIIIDKGLRKEAASS 443


>ref|NP_201451.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171135|sp|Q9FJY9.1|PP448_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g66500, mitochondrial; Flags: Precursor
            gi|10177531|dbj|BAB10926.1| unnamed protein product
            [Arabidopsis thaliana] gi|332010838|gb|AED98221.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 532

 Score =  289 bits (739), Expect = 1e-75
 Identities = 144/275 (52%), Positives = 195/275 (70%), Gaps = 3/275 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGC++N+ Y  A  ++SR +PNV  L+S L  CS NSDL +G+QIHCVA+RNGF  
Sbjct: 255  NSLISGCIRNRNYKEAFLLMSRQRPNVRVLSSSLAGCSDNSDLWIGKQIHCVALRNGFVS 314

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            D++L N L+DMY KCG+I   R +F  IP ++VV+WT++IDAY   G GV+ALE+F+ M 
Sbjct: 315  DSKLCNGLMDMYGKCGQIVQARTIFRAIPSKSVVSWTSMIDAYAVNGDGVKALEIFREMC 374

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE SGV PN VTFL V+SAC H+GLV++G+EC   M+EKY + PG EHY CFID+L +AG
Sbjct: 375  EEGSGVLPNSVTFLVVISACAHAGLVKEGKECFGMMKEKYRLVPGTEHYVCFIDILSKAG 434

Query: 540  LIDEVWGLFSD-MGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLF-ELEPDKPGNY 713
              +E+W L    M N   +   ++  A+L+AC +N+D+ RGE+VA+ L  E  P+    Y
Sbjct: 435  ETEEIWRLVERMMENDNQSIPCAIWVAVLSACSLNMDLTRGEYVARRLMEETGPENASIY 494

Query: 714  VSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            V VSNFYA++G+WD VEELR  +  KGL K AG S
Sbjct: 495  VLVSNFYAAMGKWDVVEELRGKLKNKGLVKTAGHS 529



 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 38/120 (31%), Positives = 64/120 (53%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L ACS  S    GRQ+H + I+ G E  T    AL+DMY+K G +  +  +F  + +
Sbjct: 88  TPVLGACSLLSYPETGRQVHALMIKQGAETGTISKTALIDMYSKYGHLVDSVRVFESVEE 147

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           +++V+W  ++  +   G G EAL +F  M  E  V  +  T  +V+  C    +++QG++
Sbjct: 148 KDLVSWNALLSGFLRNGKGKEALGVFAAMYRER-VEISEFTLSSVVKTCASLKILQQGKQ 206


>ref|XP_002865063.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297310898|gb|EFH41322.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 532

 Score =  288 bits (736), Expect = 2e-75
 Identities = 147/276 (53%), Positives = 193/276 (69%), Gaps = 4/276 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGC++N+ Y  A  ++SR +PNV  L+S L  CS NSDL +G+QIHCVA+RNGF  
Sbjct: 255  NSLISGCIRNRNYKEAFLLMSRKRPNVRVLSSCLAGCSDNSDLWIGKQIHCVALRNGFVS 314

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            D +L N L+DMY KCG+I   R LF  I  ++VV+WT++IDAY   G GV+ALE+F+ M 
Sbjct: 315  DIKLCNGLMDMYGKCGQIVQARTLFRAISSKSVVSWTSMIDAYAVNGDGVKALEIFREMC 374

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE SGV PN VTFL VLSAC H+GLVE+G+EC   M+EKY + PG EHY CFID+L +AG
Sbjct: 375  EEGSGVLPNSVTFLVVLSACAHAGLVEEGKECFGMMKEKYRLVPGTEHYVCFIDILSKAG 434

Query: 540  LIDEVWGLFSDM--GNMGTTPTGSVLAAILNACRVNLDIERGEFVA-KCLFELEPDKPGN 710
              +E+W L   M   N    P  ++  A+L+AC +N+D+ RGE+VA K + E  P+    
Sbjct: 435  DTEEIWRLVERMMENNKRNIPC-AIWVAVLSACSLNMDVTRGEYVARKLMEETGPENASI 493

Query: 711  YVSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            YV VSNFYA++G+WD VEELR  +  KGL K AG S
Sbjct: 494  YVLVSNFYAAIGKWDVVEELRGKLKNKGLVKAAGHS 529



 Score = 63.9 bits (154), Expect = 7e-08
 Identities = 37/120 (30%), Positives = 64/120 (53%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L AC+  S    GRQ+H + I+ G E  T    AL++MY+K G +  +  +F  + +
Sbjct: 88  TPVLGACALLSYPETGRQVHALMIKQGAETGTISKTALINMYSKYGHLVDSVRVFESVEE 147

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           ++VV+W  ++  +   G G EAL +F  M  E  V  +  T  +V+  C    +++QG++
Sbjct: 148 KDVVSWNALLSGFLRNGKGKEALGVFAAMCRER-VEISEFTLSSVVKTCASLKILQQGKQ 206


>gb|ADQ43215.1| unknown [Eutrema parvulum]
          Length = 527

 Score =  286 bits (731), Expect = 8e-75
 Identities = 142/276 (51%), Positives = 191/276 (69%), Gaps = 4/276 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGC++N+ Y  A  ++SR +PNV  L S L  CS NSD+ +G+QIHC A+RNGF  
Sbjct: 250  NSLISGCIRNRNYREAFLLMSRQRPNVRVLVSSLAGCSDNSDMWIGKQIHCAALRNGFVS 309

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            DT+L N L+DMY KCG+I   R LF  I  + VV+WT++IDAY   G GV+AL++F+ M 
Sbjct: 310  DTRLCNGLMDMYGKCGRIVQARTLFRAIQSKTVVSWTSMIDAYAVNGDGVKALQIFREMC 369

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE +GV PN VTFL VLSAC H+GLVE+G+EC   M+EKYG+ PG EHY CFID+L +AG
Sbjct: 370  EEGTGVLPNSVTFLVVLSACAHAGLVEEGKECFGMMKEKYGLAPGTEHYVCFIDILSKAG 429

Query: 540  LIDEVWGLFSDM--GNMGTTPTGSVLAAILNACRVNLDIERGEFVA-KCLFELEPDKPGN 710
              +E+W L   M   N    P  ++  A+L+AC +N+D+ RGEF A + + E  P+    
Sbjct: 430  DTEEIWRLVERMMVNNNRNLPC-AIWVAVLSACSLNMDVTRGEFAARRVMEETGPENASI 488

Query: 711  YVSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            YV VSNFYA++G+W  VEE+R  + +KGL K AG S
Sbjct: 489  YVLVSNFYAAIGKWHMVEEMREKLKKKGLVKAAGRS 524



 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 39/120 (32%), Positives = 64/120 (53%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L AC+  S   MGRQ+H +  + G E  T    AL+DMY+K G++  +  +F  +  
Sbjct: 83  TPVLGACALLSYPEMGRQVHALMFKQGAETGTISKTALIDMYSKYGQLVDSVTVFESVED 142

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           ++VV+W  ++  +   G G EAL LF  M  E  V  +  T  +V+  C    +++QG++
Sbjct: 143 KDVVSWNALLSGFLRNGKGKEALGLFAAMYREK-VEISEFTLCSVVKTCASLKILQQGKQ 201


>ref|XP_006281458.1| hypothetical protein CARUB_v10027537mg [Capsella rubella]
            gi|482550162|gb|EOA14356.1| hypothetical protein
            CARUB_v10027537mg [Capsella rubella]
          Length = 530

 Score =  283 bits (724), Expect = 5e-74
 Identities = 145/275 (52%), Positives = 189/275 (68%), Gaps = 3/275 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGC++N+ Y  A  ++SR +PNV  L S L  CS N+DL +G+QIHCVA+RNGF  
Sbjct: 253  NSLISGCIRNRDYKEAFLLMSRQRPNVRVLISSLAGCSDNADLWIGKQIHCVALRNGFVS 312

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            D++L N L+DMY KCG+I     +F  I  ++VV+WT++IDAY   G GV+ALE+F+ M 
Sbjct: 313  DSKLCNGLMDMYGKCGQIVQAHTIFRSISYKSVVSWTSMIDAYAVNGDGVKALEIFREMC 372

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            EE SGV PN VTFL VLSAC H+GLVE+G+EC   M+EKY + PG EHY CFID+L +AG
Sbjct: 373  EEGSGVLPNSVTFLVVLSACAHAGLVEEGKECFGMMKEKYRLVPGTEHYVCFIDILSKAG 432

Query: 540  LIDEVWGLFSD-MGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFE-LEPDKPGNY 713
              +E+W L    M N       +V  A+L+AC +N+D  RGE+VA  L E   P+    Y
Sbjct: 433  DTEEIWRLVERMMENNNRNIPCAVWVAVLSACSLNMDATRGEYVASRLMEDSGPENASIY 492

Query: 714  VSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            V VSNFYA++G+WD VEELR  M  KGL K AG S
Sbjct: 493  VLVSNFYAAIGKWDVVEELRGKMKNKGLVKAAGHS 527



 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 53/185 (28%), Positives = 91/185 (49%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L AC+  S    GRQ+H + I+ G E  T    AL++MY+K G +  +  +F  + +
Sbjct: 86  TPVLGACALLSYPETGRQVHALMIKEGAETGTISKTALINMYSKYGHLVDSITVFESVEE 145

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           ++VV+W  ++  +   G G EAL +F  M  E  V  +  T  +V+  C    +++QG++
Sbjct: 146 KDVVSWNALLSGFLRNGKGQEALGVFAAMYRER-VEISEFTLSSVVKTCASLKILQQGKQ 204

Query: 453 CIISMREKYGIDPGPEHYACFIDLLGRAGLIDEVWGLFSDMGNMGTTPTGSVLAAILNAC 632
            + SM    G D      A  I+     GLI E   ++  + N+ T     +L ++++ C
Sbjct: 205 -VHSMVLVTGRDLVVLGTA-MINFYSSVGLISEAMKVYLSL-NVHTDEV--MLNSLISGC 259

Query: 633 RVNLD 647
             N D
Sbjct: 260 IRNRD 264


>ref|XP_006393844.1| hypothetical protein EUTSA_v10003984mg [Eutrema salsugineum]
            gi|78499702|gb|ABB45856.1| hypothetical protein [Eutrema
            halophilum] gi|557090483|gb|ESQ31130.1| hypothetical
            protein EUTSA_v10003984mg [Eutrema salsugineum]
          Length = 527

 Score =  278 bits (712), Expect = 1e-72
 Identities = 138/275 (50%), Positives = 190/275 (69%), Gaps = 3/275 (1%)
 Frame = +3

Query: 3    NALVSGCVQNQKYGVAISILSRMKPNVIALTSGLTACSANSDLLMGRQIHCVAIRNGFEW 182
            N+L+SGC++N+KY  A  ++ R +PNV  L S L  CS NSDL +G+QIHCVA+RNGF  
Sbjct: 250  NSLISGCIRNRKYKEAFLLMRRQRPNVRVLISSLAGCSDNSDLWIGKQIHCVALRNGFVS 309

Query: 183  DTQLGNALLDMYAKCGKISITRLLFYRIPQRNVVTWTTIIDAYGSQGCGVEALELFKMML 362
            D++L N L+DMY KCG+I     +F  +P ++VV+WT++I AY   G GV+A+E+F+ M 
Sbjct: 310  DSKLCNGLMDMYGKCGQIVQACTMFSAMPSKSVVSWTSMIGAYAVNGDGVKAIEIFREMC 369

Query: 363  EE-SGVSPNPVTFLAVLSACGHSGLVEQGRECIISMREKYGIDPGPEHYACFIDLLGRAG 539
            +E S V PN VTFL VLSAC H+GLVE+G+EC   M+E+Y + PG EHY CFID+L +AG
Sbjct: 370  KEGSEVLPNLVTFLVVLSACAHAGLVEEGKECFGMMKERYRLVPGTEHYVCFIDILSKAG 429

Query: 540  LIDEVWGLFSD-MGNMGTTPTGSVLAAILNACRVNLDIERGEFVAKCLFE-LEPDKPGNY 713
              +E+W L    M N       ++  A+LNAC +N+D+ RGE+ A  L E   P+    Y
Sbjct: 430  DTEEIWRLVGRLMENNNRNLPCAIWVAVLNACSLNMDVSRGEYAAMMLMEGTGPENASIY 489

Query: 714  VSVSNFYASVGRWDRVEELRSVMNEKGLRKEAGSS 818
            V +SNFYA++G+WD+VEELR  + +KGL K AG S
Sbjct: 490  VLLSNFYAAIGKWDKVEELRGKLKKKGLVKAAGHS 524



 Score = 68.2 bits (165), Expect = 4e-09
 Identities = 37/120 (30%), Positives = 65/120 (54%)
 Frame = +3

Query: 93  TSGLTACSANSDLLMGRQIHCVAIRNGFEWDTQLGNALLDMYAKCGKISITRLLFYRIPQ 272
           T  L AC+  S    GRQ+H + I+ G E  T    AL+DMY+K G +  +  +F  +  
Sbjct: 83  TPVLGACALLSSPKTGRQVHALMIKQGAETGTISKTALIDMYSKYGHLVDSVTVFESVED 142

Query: 273 RNVVTWTTIIDAYGSQGCGVEALELFKMMLEESGVSPNPVTFLAVLSACGHSGLVEQGRE 452
           ++VV+W +++  +   G G EAL +F  M  E+ +  +  T  +V+  C    +++QG++
Sbjct: 143 KDVVSWNSLLSGFLRNGKGKEALGVFAAMYRET-IEISEFTLCSVVKTCASLKILQQGKQ 201


Top