BLASTX nr result

ID: Akebia25_contig00049329 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00049329
         (780 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI20053.3| unnamed protein product [Vitis vinifera]              229   8e-58
ref|XP_002269015.1| PREDICTED: pentatricopeptide repeat-containi...   229   8e-58
ref|XP_007041365.1| Pentatricopeptide repeat superfamily protein...   225   2e-56
ref|XP_007041360.1| Pentatricopeptide repeat (PPR) superfamily p...   225   2e-56
ref|XP_004137128.1| PREDICTED: pentatricopeptide repeat-containi...   222   1e-55
ref|XP_004168722.1| PREDICTED: pentatricopeptide repeat-containi...   221   2e-55
ref|XP_002519129.1| pentatricopeptide repeat-containing protein,...   217   3e-54
ref|XP_006346504.1| PREDICTED: pentatricopeptide repeat-containi...   207   4e-51
gb|EXC05161.1| hypothetical protein L484_003967 [Morus notabilis]     206   8e-51
ref|XP_007201524.1| hypothetical protein PRUPE_ppa015625mg, part...   205   1e-50
ref|XP_004230838.1| PREDICTED: pentatricopeptide repeat-containi...   205   1e-50
ref|XP_006849567.1| hypothetical protein AMTR_s00024p00183850 [A...   203   6e-50
ref|XP_003540687.1| PREDICTED: pentatricopeptide repeat-containi...   202   1e-49
ref|XP_004292464.1| PREDICTED: pentatricopeptide repeat-containi...   200   4e-49
ref|NP_179165.1| pentatricopeptide repeat-containing protein [Ar...   199   1e-48
ref|XP_006423135.1| hypothetical protein CICLE_v10030406mg [Citr...   198   2e-48
ref|XP_007161634.1| hypothetical protein PHAVU_001G085800g [Phas...   194   3e-47
ref|XP_006297214.1| hypothetical protein CARUB_v10013223mg [Caps...   192   9e-47
ref|XP_002883912.1| pentatricopeptide repeat-containing protein ...   186   1e-44
ref|XP_006409538.1| hypothetical protein EUTSA_v10022592mg [Eutr...   179   1e-42

>emb|CBI20053.3| unnamed protein product [Vitis vinifera]
          Length = 634

 Score =  229 bits (584), Expect = 8e-58
 Identities = 117/227 (51%), Positives = 160/227 (70%), Gaps = 6/227 (2%)
 Frame = -1

Query: 663 SSTFDFS------SISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNV 502
           SSTF FS      S++ +E+ P      TE ++ KS+L S W+ I+++ P L+P LI NV
Sbjct: 24  SSTFHFSFNRNFNSLASSESTP----PITEEVISKSVLSSQWHFIEQVSPNLTPALISNV 79

Query: 501 LFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNK 322
           L+ L   P ++  FI  L     D K +CLAVV+++ LPSP+  L LLK+ +  +  TN+
Sbjct: 80  LYNLCSKPQLVSDFIHHLHPHCLDTKSYCLAVVLLARLPSPKLALQLLKQVMGTRIATNR 139

Query: 321 QIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVL 142
           ++FDEL ++R +L+  +S VFDLL+  CC+L+ ADEA + FY MKE  I+P+IETCN++L
Sbjct: 140 ELFDELTLSRDRLSVKSSIVFDLLVRVCCELRRADEAFKCFYMMKEKGIVPKIETCNDML 199

Query: 141 SLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           SLFLK NR E AWVL+AEMFRL++ STV TFNIM+NVLCKEGKLKKA
Sbjct: 200 SLFLKLNRMEMAWVLYAEMFRLRISSTVYTFNIMVNVLCKEGKLKKA 246


>ref|XP_002269015.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Vitis vinifera]
          Length = 656

 Score =  229 bits (584), Expect = 8e-58
 Identities = 117/227 (51%), Positives = 160/227 (70%), Gaps = 6/227 (2%)
 Frame = -1

Query: 663 SSTFDFS------SISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNV 502
           SSTF FS      S++ +E+ P      TE ++ KS+L S W+ I+++ P L+P LI NV
Sbjct: 46  SSTFHFSFNRNFNSLASSESTP----PITEEVISKSVLSSQWHFIEQVSPNLTPALISNV 101

Query: 501 LFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNK 322
           L+ L   P ++  FI  L     D K +CLAVV+++ LPSP+  L LLK+ +  +  TN+
Sbjct: 102 LYNLCSKPQLVSDFIHHLHPHCLDTKSYCLAVVLLARLPSPKLALQLLKQVMGTRIATNR 161

Query: 321 QIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVL 142
           ++FDEL ++R +L+  +S VFDLL+  CC+L+ ADEA + FY MKE  I+P+IETCN++L
Sbjct: 162 ELFDELTLSRDRLSVKSSIVFDLLVRVCCELRRADEAFKCFYMMKEKGIVPKIETCNDML 221

Query: 141 SLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           SLFLK NR E AWVL+AEMFRL++ STV TFNIM+NVLCKEGKLKKA
Sbjct: 222 SLFLKLNRMEMAWVLYAEMFRLRISSTVYTFNIMVNVLCKEGKLKKA 268


>ref|XP_007041365.1| Pentatricopeptide repeat superfamily protein isoform 6 [Theobroma
           cacao] gi|508705300|gb|EOX97196.1| Pentatricopeptide
           repeat superfamily protein isoform 6 [Theobroma cacao]
          Length = 494

 Score =  225 bits (573), Expect = 2e-56
 Identities = 116/251 (46%), Positives = 161/251 (64%)
 Frame = -1

Query: 753 NKIKIHKAFDLGTFKQIKFKDLYYINPGQCSSTFDFSSISHTEAPPVQNKDTTEPLLEKS 574
           N +K HK       K+ K K +   +   CSST   S +  ++     +   +  LL +S
Sbjct: 14  NNMKAHKVLSSQILKRKKLKTVIPHSSALCSST---SQLVTSDQSQTASSQISPELLIES 70

Query: 573 ILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIIS 394
           +  S W+ I+     L+P +I  VL  LHK+P + L F   + F R D+K  CLA+ + S
Sbjct: 71  VRSSQWHFIKHQSSDLNPSVISTVLLNLHKTPELALQFTSHIEFQRLDVKTRCLAIAVAS 130

Query: 393 HLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADE 214
            LPSP+PTL LLK+ I     +   IFDEL +AR +L  S + +FDLLI +CC++K  DE
Sbjct: 131 RLPSPKPTLQLLKQTIYSDIASVTVIFDELALARDRLGISTTILFDLLIRACCEMKRVDE 190

Query: 213 ALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMIN 34
            LE FY MK+  ++P+IETCN++LS FLK NRTE+AWVL+AEMF++++KS++ TFNIMIN
Sbjct: 191 GLECFYMMKDKGLIPKIETCNDMLSTFLKLNRTESAWVLYAEMFKMRIKSSIYTFNIMIN 250

Query: 33  VLCKEGKLKKA 1
           VLCKEGKLKKA
Sbjct: 251 VLCKEGKLKKA 261


>ref|XP_007041360.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
           isoform 1 [Theobroma cacao]
           gi|590682507|ref|XP_007041361.1| Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590682510|ref|XP_007041362.1|
           Pentatricopeptide repeat (PPR) superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590682513|ref|XP_007041363.1| Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590682516|ref|XP_007041364.1|
           Pentatricopeptide repeat (PPR) superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590682524|ref|XP_007041366.1| Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508705295|gb|EOX97191.1|
           Pentatricopeptide repeat (PPR) superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508705296|gb|EOX97192.1| Pentatricopeptide repeat
           (PPR) superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508705297|gb|EOX97193.1| Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508705298|gb|EOX97194.1|
           Pentatricopeptide repeat (PPR) superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508705299|gb|EOX97195.1| Pentatricopeptide repeat
           (PPR) superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508705301|gb|EOX97197.1| Pentatricopeptide
           repeat (PPR) superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 650

 Score =  225 bits (573), Expect = 2e-56
 Identities = 116/251 (46%), Positives = 161/251 (64%)
 Frame = -1

Query: 753 NKIKIHKAFDLGTFKQIKFKDLYYINPGQCSSTFDFSSISHTEAPPVQNKDTTEPLLEKS 574
           N +K HK       K+ K K +   +   CSST   S +  ++     +   +  LL +S
Sbjct: 14  NNMKAHKVLSSQILKRKKLKTVIPHSSALCSST---SQLVTSDQSQTASSQISPELLIES 70

Query: 573 ILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIIS 394
           +  S W+ I+     L+P +I  VL  LHK+P + L F   + F R D+K  CLA+ + S
Sbjct: 71  VRSSQWHFIKHQSSDLNPSVISTVLLNLHKTPELALQFTSHIEFQRLDVKTRCLAIAVAS 130

Query: 393 HLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADE 214
            LPSP+PTL LLK+ I     +   IFDEL +AR +L  S + +FDLLI +CC++K  DE
Sbjct: 131 RLPSPKPTLQLLKQTIYSDIASVTVIFDELALARDRLGISTTILFDLLIRACCEMKRVDE 190

Query: 213 ALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMIN 34
            LE FY MK+  ++P+IETCN++LS FLK NRTE+AWVL+AEMF++++KS++ TFNIMIN
Sbjct: 191 GLECFYMMKDKGLIPKIETCNDMLSTFLKLNRTESAWVLYAEMFKMRIKSSIYTFNIMIN 250

Query: 33  VLCKEGKLKKA 1
           VLCKEGKLKKA
Sbjct: 251 VLCKEGKLKKA 261


>ref|XP_004137128.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Cucumis sativus]
          Length = 628

 Score =  222 bits (565), Expect = 1e-55
 Identities = 118/224 (52%), Positives = 151/224 (67%), Gaps = 8/224 (3%)
 Frame = -1

Query: 648 FSSISHTEAP---PVQNKDTTEPL----LEKSILESNWNSIQEIYPTLSPYLIQNVLFKL 490
           FSSIS  + P   PV   +   PL    LE+S   S W+ I+++  +L+P LI   L  L
Sbjct: 17  FSSISLQKTPLESPVSTTNLASPLTPHFLEQSARSSQWHFIKQVESSLTPSLISQTLLNL 76

Query: 489 HKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTN-KQIF 313
           H+SP V+L F+        D +  CLA+VI++ LPSP+P LHLLK+A+ G    + ++IF
Sbjct: 77  HESPQVVLDFLNHFHHKLSDARTLCLAIVIVARLPSPKPALHLLKQALGGGTTNSIREIF 136

Query: 312 DELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLF 133
           + L  +R +L   +S VFD LI SCC +  ADEA E FY MKE  +LP IETCN +LSLF
Sbjct: 137 EFLAASRDRLGFKSSIVFDYLIKSCCDMNRADEAFECFYTMKEKGVLPTIETCNSLLSLF 196

Query: 132 LKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           LK NRTE AWVL+AEMFRL++KS+V TFNIMINVLCKEGKLKKA
Sbjct: 197 LKLNRTEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKA 240


>ref|XP_004168722.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Cucumis sativus]
          Length = 628

 Score =  221 bits (563), Expect = 2e-55
 Identities = 117/224 (52%), Positives = 151/224 (67%), Gaps = 8/224 (3%)
 Frame = -1

Query: 648 FSSISHTEAP---PVQNKDTTEPL----LEKSILESNWNSIQEIYPTLSPYLIQNVLFKL 490
           FSSIS  + P   PV   +   PL    LE+S   S W+ I+++  +L+P LI   L  L
Sbjct: 17  FSSISLQQTPLESPVSTTNLASPLTPHFLEQSARSSQWHFIKQVESSLTPSLISQTLLNL 76

Query: 489 HKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTN-KQIF 313
           H+SP V+L F+        D +  CLA+VI++ LPSP+P LHLL++A+ G    + ++IF
Sbjct: 77  HESPQVVLDFLNHFHHKLSDARTLCLAIVIVARLPSPKPALHLLRQALGGGTTNSIREIF 136

Query: 312 DELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLF 133
           + L  +R +L   +S VFD LI SCC +  ADEA E FY MKE  +LP IETCN +LSLF
Sbjct: 137 EFLAASRDRLGFKSSIVFDYLIKSCCDMNRADEAFECFYTMKEKGVLPTIETCNSLLSLF 196

Query: 132 LKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           LK NRTE AWVL+AEMFRL++KS+V TFNIMINVLCKEGKLKKA
Sbjct: 197 LKLNRTEAAWVLYAEMFRLRIKSSVYTFNIMINVLCKEGKLKKA 240


>ref|XP_002519129.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223541792|gb|EEF43340.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 643

 Score =  217 bits (553), Expect = 3e-54
 Identities = 121/241 (50%), Positives = 152/241 (63%), Gaps = 7/241 (2%)
 Frame = -1

Query: 702 KFKDLYYINPGQCSSTFDFSSISHTEAPPVQ-NKDTTEPL------LEKSILESNWNSIQ 544
           K K    I+    SST     I H  A  +  N  T  PL      L  SI  S W+ I+
Sbjct: 16  KLKPSILISYAHFSST-PIPIIDHLHAETLHPNASTDSPLVITHQSLLDSIQSSQWHLIK 74

Query: 543 EIYPTLSPYLIQNVLFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLH 364
            + P LSP LI   L  LHK  ++ L F+  +GF   D+K  CLAV ++S  PSP+ TLH
Sbjct: 75  HLAPNLSPSLISATLLSLHKKSDLALQFVTHIGFKGLDIKTKCLAVAVVSRSPSPKSTLH 134

Query: 363 LLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKE 184
           LLK+ I+ +    K +F EL I R +L + +S VFD+LI +CC+LK  D+A E F  MKE
Sbjct: 135 LLKQTIESRVAGVKDVFHELAITRDRLGTKSSIVFDMLIRACCELKRGDDAFECFDMMKE 194

Query: 183 ISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKK 4
             ++P+IET N +LSLFLK N+TET WVL+AEMFRLK+KSTV TFNIMINVLCKEGKLKK
Sbjct: 195 KGVVPKIETFNAMLSLFLKLNQTETVWVLYAEMFRLKIKSTVYTFNIMINVLCKEGKLKK 254

Query: 3   A 1
           A
Sbjct: 255 A 255


>ref|XP_006346504.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Solanum tuberosum]
          Length = 618

 Score =  207 bits (526), Expect = 4e-51
 Identities = 106/218 (48%), Positives = 147/218 (67%)
 Frame = -1

Query: 654 FDFSSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPN 475
           F   S+S + +P +     T  +L +S+  S W+ I+ +   L+P LI   L  L  SP+
Sbjct: 15  FSTLSLSKSTSPTIP---ITAEVLRESVTSSQWHFIKHVTGELNPTLISATLPDLRSSPD 71

Query: 474 VILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIA 295
            +L FIE L     D+ C+CLA+ I+S LPSP+   HLLK+ I  +  ++ +IFD LV A
Sbjct: 72  RVLTFIENLSPNCLDISCYCLAISILSRLPSPKQATHLLKQVISYRLASHNEIFDGLVSA 131

Query: 294 RGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRT 115
           R +L   +S V DLL+ + C+LK  ++AL+ FY MK+  ILP++ETCN++LSLFLK NRT
Sbjct: 132 REKLEIKSSIVLDLLVRAYCELKKGEDALKCFYLMKQKGILPKVETCNDLLSLFLKLNRT 191

Query: 114 ETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
             AW+++AEMFR+KM STV TFNIMINVLC+EGKLKKA
Sbjct: 192 HLAWIVYAEMFRMKMSSTVCTFNIMINVLCREGKLKKA 229


>gb|EXC05161.1| hypothetical protein L484_003967 [Morus notabilis]
          Length = 634

 Score =  206 bits (524), Expect = 8e-51
 Identities = 109/219 (49%), Positives = 150/219 (68%), Gaps = 3/219 (1%)
 Frame = -1

Query: 648 FSSI-SHTEAPPVQNKDT--TEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSP 478
           FSSI   T+ P   N     T+  L K I  S W+ I++    LS  LI + L  LH++P
Sbjct: 28  FSSIPQQTKNPEKHNPQDIITQKSLLKLIHSSQWHFIKQHSRNLSTSLISDTLCTLHQTP 87

Query: 477 NVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVI 298
            ++L FI ++ F R +++  CLA  I++ +PSP+  LHLLKRA++G   + ++IF+EL  
Sbjct: 88  QLVLKFINQIEFDRLNVESLCLATTILAPIPSPKTALHLLKRAVNGGIASTREIFEELER 147

Query: 297 ARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNR 118
           AR +L+  ++ VFD LI +CC+L  A EA E F+ MKE  ++P+IETCN++LSLF K N 
Sbjct: 148 ARERLSIESAVVFDFLIRACCELNKAKEAFECFWIMKEKGVVPKIETCNDMLSLFSKSNM 207

Query: 117 TETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
            E AW+L+AEMFRLK+KS+V TFNIMINVLCKEGKLKKA
Sbjct: 208 LEMAWLLYAEMFRLKIKSSVYTFNIMINVLCKEGKLKKA 246


>ref|XP_007201524.1| hypothetical protein PRUPE_ppa015625mg, partial [Prunus persica]
           gi|462396924|gb|EMJ02723.1| hypothetical protein
           PRUPE_ppa015625mg, partial [Prunus persica]
          Length = 545

 Score =  205 bits (522), Expect = 1e-50
 Identities = 96/186 (51%), Positives = 136/186 (73%)
 Frame = -1

Query: 558 WNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSP 379
           W+ I+ + P LSP LI   LF+L KSP ++L FI  + F R D++  CLA+ I++   SP
Sbjct: 1   WHFIKHLSPNLSPSLISEALFELQKSPQLVLEFISNVDFHRLDIQTRCLAIAIVARQSSP 60

Query: 378 QPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIF 199
           QP L LLK+ +     T +++F+ L ++R +L+ ++S +FDLL+ +CC++K ADEA++ F
Sbjct: 61  QPALELLKQVVGSGIATIREVFNPLALSRDRLSVNSSIIFDLLLRACCEMKKADEAVDCF 120

Query: 198 YFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKE 19
           Y M +   +P+ ETCN++LSLFLK N+TE  WVL+AEMFRLK+ S+V TFNIMINVLCKE
Sbjct: 121 YLMVDKGFMPKTETCNDMLSLFLKLNQTERVWVLYAEMFRLKINSSVCTFNIMINVLCKE 180

Query: 18  GKLKKA 1
           GKLKKA
Sbjct: 181 GKLKKA 186


>ref|XP_004230838.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Solanum lycopersicum]
          Length = 618

 Score =  205 bits (522), Expect = 1e-50
 Identities = 105/218 (48%), Positives = 147/218 (67%)
 Frame = -1

Query: 654 FDFSSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPN 475
           F    +S + +P +     T  +L +S+  S W+ I+ +   L+P LI   L +L  SP+
Sbjct: 15  FSTLGLSKSTSPTIP---ITAEVLRESVTSSQWHFIKHVTGELNPTLISATLPELRSSPD 71

Query: 474 VILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIA 295
            +L FIE LG    D+ C+CLA+ I+S LPSP+   HLLK+ I  ++ +  +IF  LV A
Sbjct: 72  RVLTFIENLGPDCLDISCYCLAISILSRLPSPKQATHLLKQVISSRFASPNEIFYGLVSA 131

Query: 294 RGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRT 115
           R +L   +S V DLL+ + C+LK  ++AL+ FY MK+  ILP++ETCN++LSLFLK NRT
Sbjct: 132 REKLVVKSSIVLDLLVRAYCELKKGEDALKCFYLMKQKGILPKVETCNDLLSLFLKLNRT 191

Query: 114 ETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
             AW+++AEMFR+KM STV TFNIMINVLC+EGKLKKA
Sbjct: 192 HLAWIVYAEMFRMKMSSTVCTFNIMINVLCREGKLKKA 229


>ref|XP_006849567.1| hypothetical protein AMTR_s00024p00183850 [Amborella trichopoda]
           gi|548853142|gb|ERN11148.1| hypothetical protein
           AMTR_s00024p00183850 [Amborella trichopoda]
          Length = 633

 Score =  203 bits (516), Expect = 6e-50
 Identities = 115/254 (45%), Positives = 161/254 (63%), Gaps = 5/254 (1%)
 Frame = -1

Query: 747 IKIHKAFDLGTFKQIKFKDLYYI--NPGQCSSTFDFSSISHTEAPPVQNKDTTEPLLEKS 574
           +KI K F     K  +  DL +   +PG C     F S S+ E    QN   T   LEK+
Sbjct: 1   MKILKTFYFQELKSCRIIDLLFNGGSPG-CLGESRFLSTSNLEI--AQNVSGT---LEKA 54

Query: 573 ILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEELG---FVRFDLKCFCLAVV 403
           IL+S W S++ I   ++  LI + + KL   P+ ILGF++ L        DL+C C+A+ 
Sbjct: 55  ILKSQWTSLERISRVMTMDLIADTMVKLRSRPHKILGFVKHLESDLVFHLDLRCLCIAIH 114

Query: 402 IISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKM 223
           II+ L +PQP L LL+R ++G +  N  IFD L+ A+    + N+ VF+LLI +CC L+ 
Sbjct: 115 IIAGLENPQPALQLLQRIVNGGFGPNTLIFDALMKAKEVCETKNTLVFNLLIKACCHLQK 174

Query: 222 ADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNI 43
           +DEA++IFY MK   + P IE+CN +LS   KQN+TETAWV++AE+FRLK+ S++VTFNI
Sbjct: 175 SDEAVQIFYLMKGHKLSPSIESCNFLLSTLSKQNKTETAWVIYAEIFRLKIPSSIVTFNI 234

Query: 42  MINVLCKEGKLKKA 1
           MIN+LCKEGKL KA
Sbjct: 235 MINILCKEGKLNKA 248


>ref|XP_003540687.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Glycine max]
          Length = 623

 Score =  202 bits (513), Expect = 1e-49
 Identities = 103/219 (47%), Positives = 150/219 (68%), Gaps = 3/219 (1%)
 Frame = -1

Query: 648 FSSISHTEAPPVQNKDT-TEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNV 472
           F+S+SH++ PP     T TE  L  SI  S W+ I+++ P L+P L+ + L  L  +P +
Sbjct: 17  FNSLSHSQTPPFSIPTTLTESTLLHSIESSQWHFIEQVAPHLTPSLLSSTLTTLRHNPQL 76

Query: 471 ILGFIEELGFV--RFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVI 298
           +L  +  L       DL    LA+ ++  LPSP+P+++L++R I     TN+ IFDEL +
Sbjct: 77  VLHLLSHLQNHPHSLDLATSSLAICVLYRLPSPKPSINLIQRLILSPTCTNRTIFDELAL 136

Query: 297 ARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNR 118
           AR ++++  + +FDLL+ + C+LK  +EALE FY +KE   +P IETCN++LSLFLK NR
Sbjct: 137 ARDRVDAKTTLIFDLLVRAYCELKKPNEALECFYLIKEKGFVPNIETCNQMLSLFLKLNR 196

Query: 117 TETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           T+ AWVL+AEMFR+ ++S++ TFNIMINVLCKEGKLKKA
Sbjct: 197 TQMAWVLYAEMFRMNIRSSLYTFNIMINVLCKEGKLKKA 235


>ref|XP_004292464.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 623

 Score =  200 bits (509), Expect = 4e-49
 Identities = 104/216 (48%), Positives = 139/216 (64%)
 Frame = -1

Query: 648 FSSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVI 469
           FSS++    PP     T   LL  SI  S+W+ IQ + P L   L+   LF LH++P+++
Sbjct: 21  FSSLTQPSPPPPPPPITHLSLLN-SIQTSHWHFIQHLPPNLPSSLVSQTLFSLHQTPHLV 79

Query: 468 LGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARG 289
             F       R D    CLA  I++ LPSP+P++ LLK+ +       + +FD L  AR 
Sbjct: 80  HRFTSHFDLRRLDTDTQCLAAAILAALPSPKPSIALLKQLLGSGIAPIRDVFDSLAKART 139

Query: 288 QLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRTET 109
           +L + +  V DLL+S+CC+LK ADE  + F  M   +++P+  TCNE+LSLF K NRTET
Sbjct: 140 RLGAQSGVVLDLLVSACCELKRADEGFQCFRSMTSANVMPKTRTCNELLSLFSKMNRTET 199

Query: 108 AWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           AWVL+AEMFRLK+KS+V TFNIMINVLCKEGKLKKA
Sbjct: 200 AWVLYAEMFRLKIKSSVCTFNIMINVLCKEGKLKKA 235


>ref|NP_179165.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75216226|sp|Q9ZQF1.1|PP152_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g15630, mitochondrial; Flags: Precursor
           gi|4335729|gb|AAD17407.1| putative salt-inducible
           protein [Arabidopsis thaliana]
           gi|330251331|gb|AEC06425.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 627

 Score =  199 bits (505), Expect = 1e-48
 Identities = 94/215 (43%), Positives = 137/215 (63%)
 Frame = -1

Query: 645 SSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVIL 466
           SS++ T  P       T  +L +SI  S W+ ++ +   L+P L+   L  L K+PN+  
Sbjct: 30  SSLAQTSTPESVLPPITSEILLESIRSSQWHIVEHVADKLTPSLVSTTLLSLVKTPNLAF 89

Query: 465 GFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQ 286
            F+  +   R D +  CLA+ +IS L SP+P   LLK  +  +  + + +FDELV+A  +
Sbjct: 90  NFVNHIDLYRLDFQTQCLAIAVISKLSSPKPVTQLLKEVVTSRKNSIRNLFDELVLAHDR 149

Query: 285 LNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETA 106
           L + ++ +FDLL+  CC+L+M DEA+E FY MKE    P+ ETCN +L+L  + NR E A
Sbjct: 150 LETKSTILFDLLVRCCCQLRMVDEAIECFYLMKEKGFYPKTETCNHILTLLSRLNRIENA 209

Query: 105 WVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           WV +A+M+R+++KS V TFNIMINVLCKEGKLKKA
Sbjct: 210 WVFYADMYRMEIKSNVYTFNIMINVLCKEGKLKKA 244


>ref|XP_006423135.1| hypothetical protein CICLE_v10030406mg [Citrus clementina]
           gi|568851376|ref|XP_006479369.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g15630,
           mitochondrial-like [Citrus sinensis]
           gi|557525069|gb|ESR36375.1| hypothetical protein
           CICLE_v10030406mg [Citrus clementina]
          Length = 645

 Score =  198 bits (504), Expect = 2e-48
 Identities = 97/202 (48%), Positives = 139/202 (68%), Gaps = 1/202 (0%)
 Frame = -1

Query: 603 DTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEELGFVRF-DL 427
           + T  LL   I  S W+ I+++ P ++P LI + L  LHK+P++   FI  LGF R  D+
Sbjct: 56  EITSELLNSYIHSSQWHFIKQLAPKITPSLITSALLDLHKNPDLAFQFINHLGFRRIRDI 115

Query: 426 KCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNSRVFDLLI 247
           K  C A+ +IS L + +PTL LLK  ++    T + +F+EL +AR +L   +S VFD L+
Sbjct: 116 KTRCFAIAVISRLSTSKPTLQLLKETLNSGIATIQVVFNELAVARDELRIRSSTVFDFLL 175

Query: 246 SSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAEMFRLKMK 67
             CC+LK  D+A + FY MKE   +P+IE+CN++LS+F+K NR   AWVL+AEMFR+++K
Sbjct: 176 RVCCELKRDDDAFKCFYMMKEKGFVPKIESCNDMLSMFVKLNRPYKAWVLYAEMFRMRIK 235

Query: 66  STVVTFNIMINVLCKEGKLKKA 1
           S+V TFNIMIN+LCKEGKL+KA
Sbjct: 236 SSVCTFNIMINLLCKEGKLQKA 257


>ref|XP_007161634.1| hypothetical protein PHAVU_001G085800g [Phaseolus vulgaris]
           gi|561035098|gb|ESW33628.1| hypothetical protein
           PHAVU_001G085800g [Phaseolus vulgaris]
          Length = 618

 Score =  194 bits (493), Expect = 3e-47
 Identities = 102/218 (46%), Positives = 145/218 (66%), Gaps = 2/218 (0%)
 Frame = -1

Query: 648 FSSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVI 469
           F+S+S     P+     TE  L  SI  SNW+ I++  P  +P ++ + L  L  +P ++
Sbjct: 17  FNSLSQ----PITVPPITESELLNSIESSNWHFIKQAAPHYTPSILSSTLTSLRNNPQLV 72

Query: 468 LGFIEELGFV--RFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIA 295
           L F+  L       DL    LA  I+  LPSP+P+++LL+R I    +TN  IF EL ++
Sbjct: 73  LQFLSHLNTHPHSLDLTTSSLAACILCRLPSPKPSINLLQRLILSSTLTNTTIFHELALS 132

Query: 294 RGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRT 115
           R ++++ +S +FDLL+ + C+LK  +EALE FY MKE  + P IETCN++LSLFLK NRT
Sbjct: 133 RDRVDAKSSLIFDLLVRAYCELKKPNEALECFYLMKEKGVEPNIETCNQMLSLFLKLNRT 192

Query: 114 ETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           + AWVL+AEMFR+ ++S+V TFNIM+NVLCKEGKLKKA
Sbjct: 193 QMAWVLYAEMFRMNIRSSVYTFNIMVNVLCKEGKLKKA 230


>ref|XP_006297214.1| hypothetical protein CARUB_v10013223mg [Capsella rubella]
           gi|482565923|gb|EOA30112.1| hypothetical protein
           CARUB_v10013223mg [Capsella rubella]
          Length = 623

 Score =  192 bits (489), Expect = 9e-47
 Identities = 93/209 (44%), Positives = 136/209 (65%)
 Frame = -1

Query: 627 EAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVILGFIEEL 448
           E PP+     T  +L  SI  S W+ ++ +    +P L+   L  L K+P++ LGF+  +
Sbjct: 37  ELPPI-----TSDILLDSIKSSQWHIVEHLSDKFTPSLLSTTLLNLVKTPDLALGFVNHI 91

Query: 447 GFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTNKQIFDELVIARGQLNSSNS 268
                D +  CLA+ ++S L SP+P   LLK  +  +  + + +FDELV+AR +L + ++
Sbjct: 92  DLRCLDFQTQCLAIAVVSKLSSPKPVTQLLKEVVSSRKNSIRDLFDELVLARDRLETKST 151

Query: 267 RVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRTETAWVLFAE 88
            +FD L+  CC+L+M DEA+E FY MKE    P+ ETCN +LSL  + NRTE+AWV +A+
Sbjct: 152 ILFDFLVRCCCQLRMVDEAIECFYLMKEKGFDPKTETCNCILSLLSRLNRTESAWVFYAD 211

Query: 87  MFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           M+R+++KS V TFNIMINVLCKEGKLKKA
Sbjct: 212 MYRMEIKSNVYTFNIMINVLCKEGKLKKA 240


>ref|XP_002883912.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297329752|gb|EFH60171.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 623

 Score =  186 bits (471), Expect = 1e-44
 Identities = 95/229 (41%), Positives = 141/229 (61%)
 Frame = -1

Query: 687 YYINPGQCSSTFDFSSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQ 508
           YY    + SS    S+ + ++ PP+    T+E LLE SI  S W+ I+ +   L P L+ 
Sbjct: 22  YYPTAARLSSFAQTSTTTESQLPPI----TSEVLLE-SIKSSQWHFIEHVTDKLIPSLVS 76

Query: 507 NVLFKLHKSPNVILGFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVT 328
             L  L K+P++   F+  +     D +  CLA+ ++S L SP+    LLK  +  +  +
Sbjct: 77  TTLLSLVKTPDLAFNFVNHIDLRCLDFQTQCLAIAVVSKLSSPKSVTQLLKEVVSTRKNS 136

Query: 327 NKQIFDELVIARGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNE 148
            + +FDELV+A  +L + ++ +FD ++   C+L+M DEA+E FY MKE    P+ ETCN 
Sbjct: 137 VRDLFDELVLAHDRLQTKSTILFDFMVRFYCQLRMVDEAIECFYLMKEKGFDPKTETCNH 196

Query: 147 VLSLFLKQNRTETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           +LSL  + NR E AWV +A+M+R+++KS V TFNIMINVLCKEGKLKKA
Sbjct: 197 ILSLLSRLNRIENAWVFYADMYRMEIKSNVYTFNIMINVLCKEGKLKKA 245


>ref|XP_006409538.1| hypothetical protein EUTSA_v10022592mg [Eutrema salsugineum]
           gi|557110700|gb|ESQ50991.1| hypothetical protein
           EUTSA_v10022592mg [Eutrema salsugineum]
          Length = 633

 Score =  179 bits (453), Expect = 1e-42
 Identities = 84/218 (38%), Positives = 139/218 (63%), Gaps = 3/218 (1%)
 Frame = -1

Query: 645 SSISHTEAPPVQNKDTTEPLLEKSILESNWNSIQEIYPTLSPYLIQNVLFKLHKSPNVIL 466
           S+   +  PP+ +      +L +S+  S W+ ++++   ++P ++   L  L K+P++ L
Sbjct: 36  STTEESSQPPISSD-----ILLESVRSSQWHFVEQLSDKITPSVVSTTLLNLVKTPDLAL 90

Query: 465 GFIEELGFVRFDLKCFCLAVVIISHLPSPQPTLHLLKRAIDGKYVTN---KQIFDELVIA 295
            F++ +     D    CLA+ ++S L SP+P L LL   +  +  ++   + +FDELV+A
Sbjct: 91  SFVKHIDLRFLDFSTQCLAIAVVSKLSSPKPALQLLNEVVVSRSKSSFSVRDVFDELVLA 150

Query: 294 RGQLNSSNSRVFDLLISSCCKLKMADEALEIFYFMKEISILPQIETCNEVLSLFLKQNRT 115
           R +L + ++ +FD L+  CC+L M +E++E FY MKE   +P+ ETCN +LS   + NRT
Sbjct: 151 RDKLETKSTILFDFLVRCCCQLNMVEESIECFYLMKEKGFVPKTETCNCILSSLSRLNRT 210

Query: 114 ETAWVLFAEMFRLKMKSTVVTFNIMINVLCKEGKLKKA 1
           E+AWV +A+M+R+ +KS + T+NIMINVLCKEGKLKKA
Sbjct: 211 ESAWVFYADMYRMDIKSNLYTYNIMINVLCKEGKLKKA 248


Top