BLASTX nr result

ID: Anemarrhena21_contig00014996 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00014996
         (1231 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008781345.1| PREDICTED: pentatricopeptide repeat-containi...   402   e-109
ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containi...   398   e-108
ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containi...   369   3e-99
ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi...   357   1e-95
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   335   4e-89
gb|KHG25256.1| hypothetical protein F383_08951 [Gossypium arboreum]   325   3e-86
ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containi...   323   1e-85
ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containi...   319   2e-84
ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containi...   319   2e-84
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   319   2e-84
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   319   2e-84
ref|XP_010028778.1| PREDICTED: pentatricopeptide repeat-containi...   318   4e-84
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   315   4e-83
ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containi...   313   2e-82
ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   312   3e-82
ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containi...   309   3e-81
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   308   7e-81
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   307   9e-81
gb|ERM96287.1| hypothetical protein AMTR_s00001p00173820 [Ambore...   306   2e-80
ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containi...   305   5e-80

>ref|XP_008781345.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Phoenix dactylifera]
          Length = 427

 Score =  402 bits (1032), Expect = e-109
 Identities = 209/381 (54%), Positives = 270/381 (70%), Gaps = 5/381 (1%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSPLALPIYRRISTTHWFNWTPKLVAELISVLESNGX 1051
            IRKFVA+SSK+TAL TL+ LLS SSP +LPIYRRIS   WF W PKL A + +VL + G 
Sbjct: 48   IRKFVAASSKSTALHTLSRLLSLSSPFSLPIYRRISEAIWFKWNPKLAAAMAAVLVNQGR 107

Query: 1050 XXXXXXXXXXSVAKLASQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMGFLGKRPFQ 871
                      SV++L S  +I+ FYCDLI+A S+R LK+  L+ Y R++EM    ++P++
Sbjct: 108  AAEAESLISESVSRLNSDLEISLFYCDLIEAFSERGLKDLALDFYFRLREMPCSRRKPYE 167

Query: 870  SMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFDEMIRVLRKMEDA 691
            SM+K L  MG+P DAE+ L EM   GF+ S FE+R+V Q YG+LGSF EM RVL  MEDA
Sbjct: 168  SMIKALCLMGLPVDAEEKLKEMALLGFRPSPFEFRLVLQSYGKLGSFAEMRRVLGIMEDA 227

Query: 690  GFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVLNSCPTVVLMVKD 511
            G  VDT+C NVVLSCYGD+G+  EMV+WIRKM+ LG+G+S+RT N VLNSCPT++ +V+D
Sbjct: 228  GLAVDTICTNVVLSCYGDHGELAEMVSWIRKMKKLGVGFSIRTFNVVLNSCPTIISIVQD 287

Query: 510  LVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXEN-----LKWSDSEVELDLHGFHLTSA 346
                PL +  L+KK+  D                +     L+WS +E +LDLHGFH++SA
Sbjct: 288  AKHFPLSIAALVKKVEEDSPSPDEALLVRELVGSSVLVDILEWSPNEGKLDLHGFHVSSA 347

Query: 345  YVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLMFKLNSPM 166
            YVILLQWMEELR RF+V E  VVPL+ISVVCGSGK S   G+SPVK LVS++MF+LNS M
Sbjct: 348  YVILLQWMEELRMRFRVDE--VVPLEISVVCGSGKKSDKIGESPVKMLVSEMMFQLNSSM 405

Query: 165  KLDKKNVGKLLAKGSVVRDWL 103
            ++D+KN G+ +A+G  VRDWL
Sbjct: 406  RIDRKNAGRFVAQGKAVRDWL 426


>ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Elaeis guineensis]
          Length = 434

 Score =  398 bits (1023), Expect = e-108
 Identities = 205/381 (53%), Positives = 268/381 (70%), Gaps = 5/381 (1%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSPLALPIYRRISTTHWFNWTPKLVAELISVLESNGX 1051
            +RKFVA+SSK+ AL TL+ LLS SS  ALPIYRR+S  +WF W PKL A + +VL + G 
Sbjct: 54   VRKFVAASSKSAALHTLSHLLSLSSRFALPIYRRVSEANWFKWNPKLAAAMAAVLVNQGR 113

Query: 1050 XXXXXXXXXXSVAKLASQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMGFLGKRPFQ 871
                      SV++L S  +I+ FYCDLI+A S+R LK+F L+ Y R+ E+    ++P++
Sbjct: 114  ATEAESLISESVSRLNSDLEISLFYCDLIEAFSERGLKDFALDFYSRLHEIPCSVRKPYE 173

Query: 870  SMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFDEMIRVLRKMEDA 691
            SM+K L  MG+P DAE+ L EM   GF+ S FE+R+V Q YG+ GSF EM RVL  MEDA
Sbjct: 174  SMIKALCLMGLPVDAEEKLKEMAFLGFRPSPFEFRLVMQSYGKSGSFAEMSRVLGIMEDA 233

Query: 690  GFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVLNSCPTVVLMVKD 511
            G  +DTVC NVVLSCYGD+G+  +MV+WIRKM+ LGIG+SVRT N VLNSCPT++ MV+D
Sbjct: 234  GLAIDTVCTNVVLSCYGDHGELAKMVSWIRKMKKLGIGFSVRTFNVVLNSCPTIISMVQD 293

Query: 510  LVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXEN-----LKWSDSEVELDLHGFHLTSA 346
            +  +PL +  L+KK+  D                +     L+WS  E +LDLHGFH+ SA
Sbjct: 294  VKHIPLSIAALVKKVEEDSLSLDEALLVRELVGSSVLVDILEWSPDEGKLDLHGFHVASA 353

Query: 345  YVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLMFKLNSPM 166
            +VILLQW+EELR RF+V E   VPL+ISVVCGSGKHS   G+SPVK LVS++MF+LNS M
Sbjct: 354  FVILLQWVEELRIRFRVDE--AVPLEISVVCGSGKHSDKIGESPVKMLVSEMMFQLNSSM 411

Query: 165  KLDKKNVGKLLAKGSVVRDWL 103
            ++D+KN G+ +A+G  VRDWL
Sbjct: 412  RIDRKNAGRFVARGKAVRDWL 432


>ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Musa acuminata subsp. malaccensis]
          Length = 430

 Score =  369 bits (946), Expect = 3e-99
 Identities = 185/381 (48%), Positives = 257/381 (67%), Gaps = 5/381 (1%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSPLALPIYRRISTTHWFNWTPKLVAELISVLESNGX 1051
            IRKF+A+SSK  AL  L+  LS SSP A P+Y RIS   WF+W PKL A ++++LE  G 
Sbjct: 51   IRKFLAASSKPAALHALSSFLSLSSPFAPPLYERISEASWFSWKPKLAATVVALLEKQGR 110

Query: 1050 XXXXXXXXXXSVAKLASQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMGFLGKRPFQ 871
                      +V++  + RD+A FYCDLI+  S++ L+  VLE Y R++E+ F G+RP++
Sbjct: 111  CAEAETLTLDAVSRSKTHRDLALFYCDLIECFSEQGLEQPVLETYARLREVPFAGRRPYE 170

Query: 870  SMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFDEMIRVLRKMEDA 691
            SM+K L  MGMP +AE  L EM +SG K S FE+R V Q YGR G   EM RV+  MEDA
Sbjct: 171  SMIKALCLMGMPGEAEAKLKEMASSGCKPSPFEFRSVIQSYGRSGLLSEMRRVVGSMEDA 230

Query: 690  GFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVLNSCPTVVLMVKD 511
            G  +DTVC NVVLSCYG +G+  EM +W+ KMR  GI +S+RT N VLNSCP VV +  D
Sbjct: 231  GLPIDTVCVNVVLSCYGHHGELPEMASWMTKMREKGIVFSIRTFNCVLNSCPRVVSIASD 290

Query: 510  LVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENL-----KWSDSEVELDLHGFHLTSA 346
              ++PL +++L++KL N+                ++     +WS S  +LDLHG H+ +A
Sbjct: 291  AGSLPLSMEELLQKLENESSSRTEALLVQELTSSSVLADISEWSPSGSKLDLHGLHVAAA 350

Query: 345  YVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLMFKLNSPM 166
            Y+ILL+W++ELR RF  +EE V+PL+ISV+CGSGKHS  RG SP+K+LVS++MF+ +SPM
Sbjct: 351  YIILLKWIQELRRRF--QEEDVIPLEISVICGSGKHSERRGRSPIKDLVSEMMFRKSSPM 408

Query: 165  KLDKKNVGKLLAKGSVVRDWL 103
            ++D KN G+ +A+G  V +W+
Sbjct: 409  RIDSKNPGRFVARGKAVWEWM 429


>ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nelumbo nucifera]
          Length = 451

 Score =  357 bits (916), Expect = 1e-95
 Identities = 186/388 (47%), Positives = 256/388 (65%), Gaps = 12/388 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            IRKFVASSSK+ AL  L+ L+S +      S L LP+YRRI+ T WFNW PKLVA +I+ 
Sbjct: 58   IRKFVASSSKSDALNALSHLISSNTTHFHLSSLVLPMYRRIAETPWFNWNPKLVASVIAY 117

Query: 1068 LESNGXXXXXXXXXXXSVAKLASQ-RDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM-- 898
            L+  G           SV KL  Q RD+A FYCDLID+ S ++ +  V E Y R+K++  
Sbjct: 118  LDKQGQPEEAEALISESVQKLGFQERDVALFYCDLIDSYSKQRSRIGVFESYARLKQLFS 177

Query: 897  ---GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFD 727
                 L +R +++++  L S+ +P DAE +++EM  SGFK S FE+R +  GYGRLG F 
Sbjct: 178  DSSSSLSRRAYETIICSLCSVDLPRDAENMVEEMTISGFKPSAFEFRSLVSGYGRLGLFT 237

Query: 726  EMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVL 547
            +M RVLRKMEDAG+ +DT+C+N+VLS +G + +  EM +W+RKM+   I +S+RT NSV+
Sbjct: 238  DMRRVLRKMEDAGYCLDTICSNMVLSSFGAHSELSEMASWLRKMKDSNISFSIRTYNSVM 297

Query: 546  NSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDLH 367
            NSCPT+  ++KDL  VPL ++DL  +L  D               + LKW  SE +LDLH
Sbjct: 298  NSCPTITSLLKDLKFVPLSMEDLKGRLQKDETLLVEQLIGSSVLMDALKWCPSEGKLDLH 357

Query: 366  GFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLM 187
            G HL +AY+I+LQW++ LR+RF      V+P +  V+CGSGKHS VRG+SPVK LV Q+M
Sbjct: 358  GMHLATAYLIMLQWVQVLRSRFSA-GNWVIPTEFRVICGSGKHSSVRGESPVKALVKQMM 416

Query: 186  FKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
             ++ SPMK+D+ NVG  + +G  VRDWL
Sbjct: 417  VRMKSPMKIDRNNVGCFVGRGKAVRDWL 444


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  335 bits (859), Expect = 4e-89
 Identities = 176/391 (45%), Positives = 256/391 (65%), Gaps = 14/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I+KFVASS K+ AL  L+ LLS        S LA P+Y +IS T W+NW PKLVAELI++
Sbjct: 61   IKKFVASSPKSIALNALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAELIAL 120

Query: 1068 LESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM-- 898
            L   G           +V+KL   +RD+  FYC+ I++ S    K    + Y  + E+  
Sbjct: 121  LVKQGRYDESEALISQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSELIC 180

Query: 897  ----GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ ++ ++SMV  L  M  P +AE +++EM  +G   + FE+R ++ GYG+LG F
Sbjct: 181  NSSSVYVKRQGYKSMVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGYGQLGLF 240

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M R++ +ME  GF VDT+C+N+VLS YG Y  F +MV W++KM+ L I +S+RT NSV
Sbjct: 241  EDMERMVCEMEIEGFEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSIRTYNSV 300

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXEN-LKWSDSEVELD 373
            LNSCP ++ +V+ L +VPL + +L K LN D               +  ++W+ SE +LD
Sbjct: 301  LNSCPEIMSLVQGLDSVPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWNGSEGKLD 360

Query: 372  LHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQ 193
            LHG HL SAY+I+LQW+EE++ RFKV EE V+P +I++VCGSGKHS VRG+SPVK L+ +
Sbjct: 361  LHGMHLGSAYLIMLQWIEEMKCRFKV-EECVIPAQITIVCGSGKHSSVRGESPVKTLMRK 419

Query: 192  LMFKLNSPMKLDKKNVGKLLAKGSVVRDWLI 100
            +M K+ SPMK+D+KN+G  +AKG VV++WLI
Sbjct: 420  MMVKMKSPMKIDRKNIGCFIAKGQVVKNWLI 450


>gb|KHG25256.1| hypothetical protein F383_08951 [Gossypium arboreum]
          Length = 458

 Score =  325 bits (834), Expect = 3e-86
 Identities = 170/391 (43%), Positives = 251/391 (64%), Gaps = 14/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I+KFVASS K+ AL  L+ LLS        S +A P+Y +IS   W+NW PKLVA+L+ +
Sbjct: 65   IKKFVASSPKSIALNALSHLLSPRNSHPHLSAIAFPLYTKISEASWYNWNPKLVADLVPL 124

Query: 1068 LESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG- 895
            L+  G            V+KL   +RD+  FYC+LI++ S  + K    + YG + E+  
Sbjct: 125  LDIQGKHDESQALNSQVVSKLKFKERDLVQFYCNLIESCSKHESKQGFNDAYGFLSELVN 184

Query: 894  -----FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ K+ F+SMV  L  MG P +AE V+++M  +G K S FE R V  GYG++G F
Sbjct: 185  NSSSMYVKKQGFKSMVSSLCEMGQPNEAENVVEDMIKNGVKPSLFELRFVLYGYGKMGFF 244

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M R+++KME  GF VDT+ +N++LS YG Y    +MV W++KM+ L I +S+RT N V
Sbjct: 245  EDMERMVKKMEIEGFGVDTISSNMILSSYGAYNALPKMVPWLQKMKALEIPFSIRTYNCV 304

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNN-DXXXXXXXXXXXXXXXENLKWSDSEVELD 373
            LNSCP ++  V+     P+ V +L+  L+  +               E ++W D E++LD
Sbjct: 305  LNSCPMIMSFVRGSGGFPVSVSELVNVLDEAEALLVKELVESSSVLDEAMEWDDLELKLD 364

Query: 372  LHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQ 193
            LHG H  SAY+I+LQW+EE+++RF+V EE VVP +I+VVCG+GKHS VRG+SPVK L+  
Sbjct: 365  LHGMHSGSAYLIMLQWIEEMKSRFRV-EECVVPAQITVVCGTGKHSSVRGESPVKTLIKA 423

Query: 192  LMFKLNSPMKLDKKNVGKLLAKGSVVRDWLI 100
            +M ++ SPM++D+KN+G+ +AKG VVR+WLI
Sbjct: 424  MMVQMKSPMRIDRKNIGRFIAKGQVVRNWLI 454


>ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Gossypium raimondii] gi|763765430|gb|KJB32684.1|
            hypothetical protein B456_005G255600 [Gossypium
            raimondii]
          Length = 458

 Score =  323 bits (829), Expect = 1e-85
 Identities = 168/391 (42%), Positives = 250/391 (63%), Gaps = 14/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I+KFVASS K+ AL  L+ LLS        S +A P+Y +IS   W+NW PKLVA+L+ +
Sbjct: 65   IKKFVASSPKSIALNALSHLLSPRNSHPHLSAIAFPLYTKISEASWYNWNPKLVADLVPL 124

Query: 1068 LESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG- 895
            L+  G            V+KL   +RD+  FYC+LI++ S  + K    + YG + E+  
Sbjct: 125  LDIQGKHDESQALISQVVSKLKFKERDLVQFYCNLIESCSKHESKQGFNDAYGYLSELVN 184

Query: 894  -----FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ K+ ++SMV  L  MG P +AE V+++M  +G K S FE R V  GYG++G F
Sbjct: 185  NSSSMYVKKQGYKSMVSSLCEMGQPNEAENVVEDMIKNGVKPSLFELRFVLYGYGKMGFF 244

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M R+++KME  GF VDT+ +N++LS YG Y    +MV W++KM+ L I +S+RT N V
Sbjct: 245  EDMERMVKKMEIEGFGVDTISSNMILSSYGAYNALPKMVPWLQKMKALEIPFSIRTYNCV 304

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXEN-LKWSDSEVELD 373
            LNSCP ++  V+     P+ V +L+  L+ D               +  ++W D E++LD
Sbjct: 305  LNSCPMIMSFVRGSGGFPVSVSELVNVLDEDEALLVKELVESSSVLDEAMEWDDLELKLD 364

Query: 372  LHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQ 193
            LHG H  SAY+I+LQW++E+++RF+VK E VVP +I+VVCG+GKHS VRG+SPVK L+  
Sbjct: 365  LHGMHSGSAYLIMLQWIKEMKSRFRVK-ECVVPAQITVVCGTGKHSSVRGESPVKTLIKA 423

Query: 192  LMFKLNSPMKLDKKNVGKLLAKGSVVRDWLI 100
            +M ++ SPM++D+KN+G  +AKG VVR+WLI
Sbjct: 424  MMVQMKSPMRIDRKNIGCFIAKGQVVRNWLI 454


>ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Malus domestica]
          Length = 461

 Score =  319 bits (818), Expect = 2e-84
 Identities = 167/389 (42%), Positives = 244/389 (62%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSP------LALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KF++SS K+ AL TL+ LLS  S       LA P+Y +I+   WF W PKLVA L+++
Sbjct: 70   ISKFLSSSPKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVAL 129

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM-- 898
            L++ G           +++KL S +R++A F+C L+++ S    K+     Y  + ++  
Sbjct: 130  LDNQGLYSQSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLH 189

Query: 897  ----GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ +R F+SMV GL +M  P++A+ +++EM   G K S FE+R V  GYGRLG F
Sbjct: 190  NSSSVYVKRRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLF 249

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            +EM++V+ KME  G  VDT+C+N+VLS YG Y +   MV W+RKM++L + +S+RT NSV
Sbjct: 250  EEMLKVVEKMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSV 309

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCPT++ M++D   VP  ++ L   LN D               E + W   E +LDL
Sbjct: 310  LNSCPTIMAMLQDPKDVPCSIEQLNGVLNGDEGLVVKELVGSTVLEEVMVWESLEAKLDL 369

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+I+L+W E +R RF    E V+P ++ +VCG GKHS VRG+SPVK LV  +
Sbjct: 370  HGLHLGSAYLIMLEWFEAMRHRFNC-GECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVM 428

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M ++ SPM++D+KNVG  +AKG  V+DWL
Sbjct: 429  MHRMGSPMRIDRKNVGCFIAKGRAVKDWL 457


>ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Malus domestica]
          Length = 461

 Score =  319 bits (818), Expect = 2e-84
 Identities = 167/389 (42%), Positives = 244/389 (62%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSP------LALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KF++SS K+ AL TL+ LLS  S       LA P+Y +I+   WF W PKLVA L+++
Sbjct: 70   ISKFLSSSPKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVAL 129

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM-- 898
            L++ G           +++KL S +R++A F+C L+++ S    K+     Y  + ++  
Sbjct: 130  LDNQGLYSQSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLH 189

Query: 897  ----GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ +R F+SMV GL +M  P++A+ +++EM   G K S FE+R V  GYGRLG F
Sbjct: 190  NSSSVYVKRRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLF 249

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            +EM++V+ KME  G  VDT+C+N+VLS YG Y +   MV W+RKM++L + +S+RT NSV
Sbjct: 250  EEMLKVVEKMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSV 309

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCPT++ M++D   VP  ++ L   LN D               E + W   E +LDL
Sbjct: 310  LNSCPTIMAMLQDPKDVPCSIEQLNGVLNGDEGLVVKELVGSTVLEEVMVWESLEAKLDL 369

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+I+L+W E +R RF    E V+P ++ +VCG GKHS VRG+SPVK LV  +
Sbjct: 370  HGLHLGSAYLIMLEWFEAMRHRFNC-GECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVM 428

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M ++ SPM++D+KNVG  +AKG  V+DWL
Sbjct: 429  MHRMGSPMRIDRKNVGCFIAKGRAVKDWL 457


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  319 bits (818), Expect = 2e-84
 Identities = 170/389 (43%), Positives = 249/389 (64%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KF+ASSSK+ AL  L+ LLS +      S LALP+Y RIS   WF+W PKL+A++I++
Sbjct: 47   ICKFIASSSKSIALNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIAL 106

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG- 895
            L   G           ++ KL S +RD+  FYC+LID+ S       V +   R+  +  
Sbjct: 107  LYKQGQLKEAETLVSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVS 166

Query: 894  -----FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ +R ++SM+  L ++G+P +AE +++EM   G K S FE+R V  GYGR+G  
Sbjct: 167  ESSSVYVKERAYKSMISSLCAVGLPLEAENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLS 226

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M R+L +M + GF +DTV +N+VLS YG Y K  EMV+W+++M+   I +S+RT NSV
Sbjct: 227  EDMQRILLQMGNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSV 286

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCP ++ +++DL T P  +D+L++ L  D               E ++W  SE +LDL
Sbjct: 287  LNSCPMIMSILQDLKTFPPTIDELMETLKGDEALLVKELIGSMVLAELMEWDCSEGKLDL 346

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+I+LQW EELR R     E V+P++I+VVCGSGKHS VRG+SPVK +V ++
Sbjct: 347  HGMHLGSAYLIMLQWREELRYRLNA-AEYVMPVEITVVCGSGKHSSVRGESPVKRMVREM 405

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M +  SPMK+D+KN+G  +AK  VV++WL
Sbjct: 406  MTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  319 bits (818), Expect = 2e-84
 Identities = 170/391 (43%), Positives = 245/391 (62%), Gaps = 15/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KFVASS +  AL  L+ LLS        S LA P+Y RI+   WF W PKLVAE+I+ 
Sbjct: 61   ISKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAF 120

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG- 895
            L+  G           +++KL S +R++  FYC+LID+      K    + Y R+ ++  
Sbjct: 121  LDKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVN 180

Query: 894  -----FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ ++  +SM+ GL  MG P +AE +++EM   G + SGFEY+ +  GYGRLG  
Sbjct: 181  SSSSVYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLL 240

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M R++ +ME  G RVDTVC+N+VLS YGD+ +   MV W++KM+  GI +SVRT NSV
Sbjct: 241  EDMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSV 300

Query: 549  LNSCPTVVLMVKDLVT--VPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVEL 376
            LNSC T++ M++DL +   PL + +L + LN +               E +KW   E +L
Sbjct: 301  LNSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKL 360

Query: 375  DLHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVS 196
            DLHG HL SAY I+LQWM+E+R RF   E+ V+P +I+VVCGSGKHS VRG+S VK +V 
Sbjct: 361  DLHGMHLGSAYFIILQWMDEMRNRFN-NEKHVIPAEITVVCGSGKHSTVRGESSVKAMVK 419

Query: 195  QLMFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            ++M + +SPM++ + N+G  +AKG VV+DWL
Sbjct: 420  KMMVRTSSPMRVHRNNIGCFIAKGHVVKDWL 450


>ref|XP_010028778.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Eucalyptus grandis] gi|629089337|gb|KCW55590.1|
            hypothetical protein EUGRSUZ_I01461 [Eucalyptus grandis]
          Length = 469

 Score =  318 bits (816), Expect = 4e-84
 Identities = 166/385 (43%), Positives = 244/385 (63%), Gaps = 9/385 (2%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS--SPLALPIYRRISTTHWFNWTPKLVAELISVLESN 1057
            +RKFVASS K+ +L  L+ LL     S LALP Y +I    WF W  KL A+LI+ LE  
Sbjct: 59   MRKFVASSPKSASLDALSHLLPHPHLSSLALPFYAKIREAPWFRWNAKLAADLIASLEKQ 118

Query: 1056 GXXXXXXXXXXXSVAKLASQ-RDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM------ 898
            G           ++A+L  + RD+A  YC LID+  + K +        R+K +      
Sbjct: 119  GLTAESESLAAEAIARLGQRDRDVALLYCHLIDSLVELKSETGFDTYLTRLKHIVDGSSS 178

Query: 897  GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFDEMI 718
             ++ +R + SM++GLS MG P +AE ++DEM   G +LS FE   V  GYG+LG FD+MI
Sbjct: 179  AYVKRRGYGSMIRGLSEMGRPGEAESLMDEMRLKGTELSNFEVNAVVYGYGKLGMFDDMI 238

Query: 717  RVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVLNSC 538
            R + KM+  GF +DTVCAN+VLS YG+     +M++W+R+M+   I +S+RT N+VLNSC
Sbjct: 239  RNIDKMDAQGFEIDTVCANMVLSSYGNSRDLSQMLSWLRRMKATAIPFSIRTYNTVLNSC 298

Query: 537  PTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDLHGFH 358
            PT++ +++D    PL +++LI+ L+++               + ++W  SE +LDLHG H
Sbjct: 299  PTIMSLLRDPSGFPLSLEELIEALDSEEGALVKELVDSPVLSKAMEWDPSEAKLDLHGMH 358

Query: 357  LTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLMFKL 178
            L SAY+I+LQW EE+R R     E V+P +++VVCGSGKHS VRG+SPVK++V  ++ +L
Sbjct: 359  LGSAYLIMLQWTEEMRCRLN-GGEYVIPAEVTVVCGSGKHSAVRGESPVKQMVKAMLGRL 417

Query: 177  NSPMKLDKKNVGKLLAKGSVVRDWL 103
             SPM++D+KNVG  +AKG V+ +WL
Sbjct: 418  GSPMRIDRKNVGCFVAKGRVLNNWL 442


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  315 bits (807), Expect = 4e-83
 Identities = 168/389 (43%), Positives = 243/389 (62%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILL------SQSSPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I+KFVA+S K+ AL  L+ LL      S  S LA  +Y +I+   WF W PKLVA++++ 
Sbjct: 72   IKKFVAASPKSIALDALSHLLNPHSSHSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAF 131

Query: 1068 LESNGXXXXXXXXXXXSVAKL-ASQRDIAHFYCDLIDAASD----RKLKNFVLECYGRIK 904
            L+  G           S++KL   +RD+A FYC+L+++ S     R   N V      + 
Sbjct: 132  LDKQGRYDESATLVSDSISKLQVKERDLARFYCNLVESQSKQNSIRGFDNSVASLMQLVC 191

Query: 903  EMG--FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ ++ ++SMV GL  MG P +AE +++EMG  G + S FE++ V   YG LGSF
Sbjct: 192  NSNSVYVKRQGYKSMVNGLCEMGRPREAETLIEEMGKEGVRPSMFEFKCVVYAYGSLGSF 251

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            +EM + L +ME AGFRVDTVC+N++L+ YG +    EMV W++KM+ LGI +S+RT NS 
Sbjct: 252  EEMNKCLHQMERAGFRVDTVCSNMILASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSA 311

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCPT++ M+++    P+ + DL+K L+ D               E +KW  +E +LDL
Sbjct: 312  LNSCPTIMSMMQNSNDFPISIHDLMKILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDL 371

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+I+L W+EE+R RFK     V P +I+VVCGSG HSIVRG+SPVK +V   
Sbjct: 372  HGTHLCSAYLIILLWIEEMRKRFK-SVNYVNPTEITVVCGSGNHSIVRGESPVKCMVKDF 430

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M +  SPM++D++N+G  +AKG VV +WL
Sbjct: 431  MVRARSPMRIDRRNIGCFIAKGKVVEEWL 459


>ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Jatropha curcas]
          Length = 473

 Score =  313 bits (801), Expect = 2e-82
 Identities = 165/393 (41%), Positives = 247/393 (62%), Gaps = 17/393 (4%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSP------LALPIYRRISTTHWFNWTPKLVAELISV 1069
            I+KFVA+S K+ AL  L+ LLS +S       LA P+Y +I   HWF+W PKLVAE++++
Sbjct: 81   IKKFVAASPKSIALDALSHLLSPNSSYSHLSSLAFPLYLKIQEAHWFDWNPKLVAEVVAL 140

Query: 1068 LESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMGF 892
            L+  G           S++KL   +RD+A FYC+L+++ S +       + + R+ ++ F
Sbjct: 141  LDKQGQYNESGTLISDSISKLKLRERDLALFYCNLVESHSKQNCVQGFEDSFARLNQLVF 200

Query: 891  ------LGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                  + K+ ++SM+ GL  MG P++A+ +++EM   G K S +E+R V   YG+LG F
Sbjct: 201  SSNSVYIKKQAYKSMISGLCEMGRPKEAQDLIEEMRGKGVKPSVYEFRCVLHAYGKLGLF 260

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
             EM  +L +ME  GF+VDTVC+N+VLS YG Y    E+V+W++KM+ LGI +S RT NSV
Sbjct: 261  QEMQMILDQMESGGFKVDTVCSNMVLSSYGVYNALPEIVSWLKKMKDLGIPFSSRTCNSV 320

Query: 549  LNSCPTVVLMVK--DLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXEN--LKWSDSEV 382
            LNSCPT++  V+  +  T P+ + +L+K L  D                   ++W   E 
Sbjct: 321  LNSCPTMMSTVQNSNANTYPISIQELMKILRGDEAMVVNELIIGSSSVLEEAMQWDALES 380

Query: 381  ELDLHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKEL 202
            +LDLHG HL SAY+I+L W EE++ RF      V+P +I+VVCGSG HSIVRG+SPVK +
Sbjct: 381  KLDLHGMHLCSAYLIMLLWFEEMKNRFN-GGNYVIPAEITVVCGSGNHSIVRGESPVKRM 439

Query: 201  VSQLMFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            +  +M +  SPM++D+KN+G  +AKG VV++WL
Sbjct: 440  IKSIMVQTRSPMRVDRKNLGCFIAKGKVVKEWL 472


>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  312 bits (800), Expect = 3e-82
 Identities = 162/389 (41%), Positives = 242/389 (62%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KF+ SS+K+ AL TL+ LLS        S LALP Y +I+   WF W PKLVA L+++
Sbjct: 59   IAKFLTSSTKSIALNTLSYLLSPDTTLPHLSSLALPFYSKITEASWFEWNPKLVAALVAL 118

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEM-- 898
            L+  G           +++KL S +R++A F+C L+++ S    K+     Y  + ++  
Sbjct: 119  LDKQGQHNEAEVLISETISKLGSRERELALFHCQLVESHSKLSSKHGFDSSYSYLYQLLH 178

Query: 897  ----GFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++  R F+SMV GL  M  P +A+ +++EM   G K S FE+R V  GYGRLG F
Sbjct: 179  NSSSVYVKNRAFESMVSGLCEMDRPREADNLIEEMRVRGLKPSVFEFRSVVYGYGRLGLF 238

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            ++M++V+ +ME+ G  +DT+C+N+VLS YG + +   M+ W+RKM+ L + +S+RT NSV
Sbjct: 239  EDMLKVVEQMENQGIAIDTICSNMVLSSYGAHSELAAMLVWLRKMKSLSLPFSIRTYNSV 298

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSC T++ M+++    P  +++L   LN D               E + W   E +LDL
Sbjct: 299  LNSCLTIMAMLQEPKDFPCSIEELNGVLNGDEALLVKELVESTVLDEVMVWEPLEAKLDL 358

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+ILL+W E +R RF   ++ V+P ++ V+CGSGKHS VRG+SPVK LV Q+
Sbjct: 359  HGMHLGSAYLILLEWFEAMRCRFNSGKD-VIPAEVVVICGSGKHSSVRGESPVKGLVKQM 417

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M ++ SPM++D+KNVG  +AKG  V+DWL
Sbjct: 418  MLRMESPMRIDRKNVGCFVAKGRAVKDWL 446


>ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Populus euphratica]
          Length = 473

 Score =  309 bits (791), Expect = 3e-81
 Identities = 167/391 (42%), Positives = 241/391 (61%), Gaps = 15/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSP-------LALPIYRRISTTHWFNWTPKLVAELIS 1072
            I+KFVASS K+ AL  L+ LLS  S        L LP+Y +IS   WF+W PKLVA+++ 
Sbjct: 83   IKKFVASSPKSIALDALSHLLSPDSTHHPLLYLLTLPLYLKISEASWFSWNPKLVAQVVV 142

Query: 1071 VLESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG 895
            +L+  G           +V++L   +R++  FYC+LI   S         + Y R+ +  
Sbjct: 143  LLDKQGLDKELKALMSETVSRLQFKERELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFV 202

Query: 894  ------FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGS 733
                  ++ K+ +++M+ GL  MG   +AE ++ EM   G K + FE+R V  GYGRLG 
Sbjct: 203  SESKSVYVKKQGYKAMISGLCEMGRAREAEDLIGEMRERGLKPTLFEFRCVLYGYGRLGL 262

Query: 732  FDEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNS 553
            F +M R+L KME     VDTVCAN+VL+ YG +    EM  W+RKM+ LGI  S+RT NS
Sbjct: 263  FKDMERILDKMESGEIEVDTVCANMVLASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNS 322

Query: 552  VLNSCPTVVLMVKDL-VTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVEL 376
            VLNSCPT++ ++++L  + P+ + +L+K L+ D               E ++W  SE +L
Sbjct: 323  VLNSCPTIMALMRNLDASYPVSIQELLKILSEDEAMLVKELIESSVLKEAVEWDTSEGKL 382

Query: 375  DLHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVS 196
            DLHG HL SAYVI+LQWMEE R R     E V+P +I+VVCGSG HS VRG+SPVK +++
Sbjct: 383  DLHGMHLGSAYVIMLQWMEETRNRLS-DGEHVIPAEITVVCGSGNHSTVRGESPVKSMIT 441

Query: 195  QLMFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            ++M +  SPM++D+KN+G  +AKG+VV+ WL
Sbjct: 442  EIMAQTRSPMRIDRKNIGCFVAKGNVVKKWL 472


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  308 bits (788), Expect = 7e-81
 Identities = 167/391 (42%), Positives = 239/391 (61%), Gaps = 15/391 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQSSP-------LALPIYRRISTTHWFNWTPKLVAELIS 1072
            I+KFVASS K+ AL  L+ LLS  S        L LP+Y +IS   WF+W PKLVA+++ 
Sbjct: 83   IKKFVASSPKSIALDALSNLLSPDSTHHPLLYLLTLPLYLKISEASWFSWNPKLVAQVVV 142

Query: 1071 VLESNGXXXXXXXXXXXSVAKLA-SQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMG 895
            +L+  G           +V++L   +R++  FYC+LI   S         + Y R+ +  
Sbjct: 143  LLDKQGLDKELKALMSETVSRLQFKERELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFV 202

Query: 894  ------FLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGS 733
                  ++ K+ +++M+ GL  MG   +AE ++ EM   G K   FE+R V  GYGRLG 
Sbjct: 203  SDSNSVYVKKQGYKAMISGLCEMGRAREAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGL 262

Query: 732  FDEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNS 553
            F +M R+L KME     VDTVCAN+VL+ YG +    EM  W+RKM+ LGI  S+RT NS
Sbjct: 263  FKDMERILDKMESGEIEVDTVCANMVLASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNS 322

Query: 552  VLNSCPTVVLMVKDL-VTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVEL 376
            VLNSCPT++ ++++L  + P+ + +L+K L+ +               E  KW  SE +L
Sbjct: 323  VLNSCPTIMALMRNLDASYPVSIQELLKILSEEEAMLVKELIESSVLKEATKWDTSEGKL 382

Query: 375  DLHGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVS 196
            DLHG HL SAYVI+LQWMEE R R     E V+P +I+VVCGSG HS VRG+SPVK +++
Sbjct: 383  DLHGMHLGSAYVIMLQWMEETRNRLS-DGEHVIPAEITVVCGSGNHSTVRGESPVKSMIT 441

Query: 195  QLMFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            ++M +  SPM++D+KN+G  +AKG+VV+ WL
Sbjct: 442  EIMAQTRSPMRIDRKNIGCFVAKGNVVKKWL 472


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  307 bits (787), Expect = 9e-81
 Identities = 162/389 (41%), Positives = 240/389 (61%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            I KF+++S K+TAL TL+ LLS        S LALP+Y +I+   WF W PKLVA L+++
Sbjct: 60   ISKFLSTSPKSTALTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVAL 119

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLK-NFVLECYG-----R 910
            L   G           +++KL + +R++  F+C L+++ S    K  F   C       +
Sbjct: 120  LAKQGQQSQSEALISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQ 179

Query: 909  IKEMGFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 ++ +R F+SMV GL +M  P +A+++++EM   G K S FE+R V  GYGRLG F
Sbjct: 180  NSSSVYVKRRAFESMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMF 239

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
            +EM++++ +ME  GF  DT+C N+VLS YG + +   M  W+RKM+   + +SVRT NSV
Sbjct: 240  EEMLKIVDQMEKQGFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSV 299

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCPT++ M+++   VP  V +L   L+ D               E + W  +E +LDL
Sbjct: 300  LNSCPTIMAMLQEPKAVPCSVGELSGVLDGDEALVVKELVGSAVVDEAMVWDSAEAKLDL 359

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG HL SAY+++L+W E +  RFK   E VVP ++ +VCG GKHS VRG+SPVK+LV ++
Sbjct: 360  HGMHLGSAYLVMLEWFEAMGNRFK-SAECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEM 418

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            M ++ SPM++D+KNVG  +AKG  V+DWL
Sbjct: 419  MHQMESPMRIDRKNVGCFIAKGRAVKDWL 447


>gb|ERM96287.1| hypothetical protein AMTR_s00001p00173820 [Amborella trichopoda]
          Length = 432

 Score =  306 bits (784), Expect = 2e-80
 Identities = 162/380 (42%), Positives = 234/380 (61%), Gaps = 6/380 (1%)
 Frame = -2

Query: 1224 KFVASSSKATALQTLAILLSQSS-PLALPIYRRISTTHWFNWTPKLVAELISVLESNGXX 1048
            KF  SSSKA  L+    LLS     +ALP+Y+ IS   W+ W  KL+A LI +LE     
Sbjct: 54   KFSGSSSKALQLEAFHHLLSSHQWAVALPMYKIISEAPWYKWNGKLIASLIGLLEKYEQR 113

Query: 1047 XXXXXXXXXSVAKLASQRDIAHFYCDLIDAASDRKLKNFVLECYGRIKEMGFLG--KRPF 874
                        K    RD+A FYC+LID+ S+  LKN VLE Y  +K++ +       +
Sbjct: 114  EEALSLLGSETIKKLGPRDLATFYCNLIDSYSEHGLKNQVLETYSSLKKLPYRSGDALAY 173

Query: 873  QSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSFDEMIRVLRKMED 694
            +S++ G S M +P  A ++LDEM  SGFK S FE+R +   YGR G F EM + L  ME+
Sbjct: 174  KSIINGFSLMNLPHHANEMLDEMMASGFKASPFEFRSLILAYGRCGFFPEMEKTLELMEN 233

Query: 693  AGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSVLNSCPTVVLMVK 514
             G+ +DTV AN +LS YG + ++ +MV+W++KM  L + +S+RT NSV+NSCPT++ +V+
Sbjct: 234  LGYSIDTVTANTILSSYGSFMEYSKMVSWLKKMASLNVDFSIRTYNSVVNSCPTILSIVQ 293

Query: 513  DLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENL---KWSDSEVELDLHGFHLTSAY 343
               TVPL +++L++ LN                   L    W+  E  LDLHG H  +AY
Sbjct: 294  QPQTVPLSINELLELLNLSTPKELFVIQEFLSSVIQLGSVNWASEEWRLDLHGMHTGAAY 353

Query: 342  VILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQLMFKLNSPMK 163
            VILLQW EE++ R  + +  V+PL++SVV G+GKHS VRG+S VK+LVS++MF++ SP+K
Sbjct: 354  VILLQWFEEMKER--LYDGTVIPLEVSVVTGTGKHSSVRGESTVKKLVSEMMFQMGSPLK 411

Query: 162  LDKKNVGKLLAKGSVVRDWL 103
            +D+ NVG+ +A+G  V+ WL
Sbjct: 412  VDRLNVGRFVARGKAVKTWL 431


>ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nicotiana tomentosiformis]
          Length = 459

 Score =  305 bits (781), Expect = 5e-80
 Identities = 160/389 (41%), Positives = 245/389 (62%), Gaps = 13/389 (3%)
 Frame = -2

Query: 1230 IRKFVASSSKATALQTLAILLSQS------SPLALPIYRRISTTHWFNWTPKLVAELISV 1069
            +RKFVASSSK  AL TL+ LLS +      S LALP+Y  IS   WF+W  KLVA+L+++
Sbjct: 60   LRKFVASSSKHVALSTLSHLLSPTTSHLRLSSLALPLYLEISEASWFDWNSKLVADLVAL 119

Query: 1068 LESNGXXXXXXXXXXXSVAKLAS-QRDIAHFYCDLIDAASDRKLKNFVLECYGRIK---- 904
            L               +V+KL   +RD+  FY  LI + S  K +  VL+   ++K    
Sbjct: 120  LYKLERFDEAETLVTETVSKLGGRERDLCSFYSQLIHSQSKHKSEKGVLDFCTKLKLFLS 179

Query: 903  --EMGFLGKRPFQSMVKGLSSMGMPEDAEKVLDEMGNSGFKLSGFEYRMVTQGYGRLGSF 730
                 +L ++ + SMV    S+G+P DAE++++EM   G KLS FE+R +   YG+ G F
Sbjct: 180  CSSSVYLKQQGYASMVDAFCSIGLPRDAEELIEEMKELGLKLSKFEFRALVYSYGKSGFF 239

Query: 729  DEMIRVLRKMEDAGFRVDTVCANVVLSCYGDYGKFEEMVTWIRKMRVLGIGYSVRTLNSV 550
             +M R++ +ME  G ++DTV AN+VL+ +G   +  EMV+W++KM V G+ +S+RT NSV
Sbjct: 240  SDMKRIVGQMESMGLQLDTVGANMVLNSFGSQYELSEMVSWLQKMDVSGVPFSIRTYNSV 299

Query: 549  LNSCPTVVLMVKDLVTVPLCVDDLIKKLNNDXXXXXXXXXXXXXXXENLKWSDSEVELDL 370
            LNSCPT+ L+++D  +VPL +++L+  LN +               E ++W+ SE++LDL
Sbjct: 300  LNSCPTISLLLQDPKSVPLSLEELLANLNENEASLVKILVGSSVLEETMQWNPSELKLDL 359

Query: 369  HGFHLTSAYVILLQWMEELRARFKVKEEAVVPLKISVVCGSGKHSIVRGDSPVKELVSQL 190
            HG H +SAYVI+LQW  +L+ +    E  V+P +I+VVCG+GKHS+VRG+SPVK L+ +L
Sbjct: 360  HGMHFSSAYVIILQWFHQLQCKLDA-ENRVLPAEITVVCGAGKHSVVRGESPVKGLIKEL 418

Query: 189  MFKLNSPMKLDKKNVGKLLAKGSVVRDWL 103
            + ++  P+++D+KN+G  + KG    +WL
Sbjct: 419  LLRVGCPLRIDRKNIGCFIGKGKSFMEWL 447


Top