BLASTX nr result

ID: Akebia23_contig00001455 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00001455
         (1617 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr...   454   e-125
ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi...   449   e-123
ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas...   447   e-123
ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas...   443   e-122
ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi...   441   e-121
ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi...   432   e-118
ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein...   430   e-118
gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]     429   e-117
ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ...   428   e-117
ref|XP_002309173.2| pentatricopeptide repeat-containing family p...   425   e-116
gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss...   418   e-114
ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi...   395   e-107
ref|XP_002533822.1| pentatricopeptide repeat-containing protein,...   379   e-102
ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi...   363   2e-97
ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr...   355   4e-95
ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps...   338   3e-90
ref|XP_002879744.1| pentatricopeptide repeat-containing protein ...   335   3e-89
ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar...   335   3e-89
gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637...   335   3e-89
ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A...   298   5e-78

>ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina]
            gi|557531581|gb|ESR42764.1| hypothetical protein
            CICLE_v10013613mg [Citrus clementina]
          Length = 506

 Score =  454 bits (1169), Expect = e-125
 Identities = 225/397 (56%), Positives = 299/397 (75%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L +NSQF  +  +LD +EK E F  PE IF++LI+ Y  A++ QD++ +F++I
Sbjct: 83   YHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTYADAHRFQDSVNLFYKI 142

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            PKFRC PSV SL+A+LSVLC+ KE ++MV Q+LLKS + MNIR+EE+SFRILI+ LC+IN
Sbjct: 143  PKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKS-QLMNIRIEESSFRILISTLCRIN 201

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +  +AIEILN M + G+  D K  S ILSS+C+ RDLSS E+LGF++E++K+GF     D
Sbjct: 202  RVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQEMKKLGFCFGMVD 261

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+NVI  LVK  +  DAL ILNQMK  GIKP++VCYT VL+GV+    + KAEE+FDE+L
Sbjct: 262  YTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQEDYVKAEELFDELL 321

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            VLG++PD+YTYN+Y+ GLC Q+N E G KM+ CMEELG KP+V TYNTL+ ALCK  EL 
Sbjct: 322  VLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNTLLQALCKVRELN 381

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            + RE++K+M  KG+  N  TY I+ID L  +G+++EA GLLE  L KGL   S +FDE I
Sbjct: 382  RLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKGLCTQSSMFDETI 441

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191
            CGLC+RGLV +A ++L +M  +DV+PGA  WEALLLS
Sbjct: 442  CGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLS 478


>ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 491

 Score =  449 bits (1156), Expect = e-123
 Identities = 218/397 (54%), Positives = 297/397 (74%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y+F++K L + SQ   +P +LDRLE IEKF+PPE IF NLIR YG AN+++DAI++F RI
Sbjct: 74   YNFVLKTLFKTSQLSHIPSVLDRLESIEKFHPPESIFANLIRFYGSANRVEDAIDVFCRI 133

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            PKFRC PS  SL+++L VLC   EGL+MV QVL+ S  AM IRLEE+SFRILI+ LC+I 
Sbjct: 134  PKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSR-AMGIRLEESSFRILISALCRIG 192

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
               YAIEI+  M   GYD D K+ SL+LSSLC+ + +  +EV+GF+EE++KVGF P   D
Sbjct: 193  SVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGFVEEMKKVGFCPGMLD 252

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            YSNVI  LVK G+ LDAL +L +MK+ G+KP++VCYT VL GV++ G +  A++VFDE+L
Sbjct: 253  YSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIANGDYKNADKVFDELL 312

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            VLG++PD+YTYN+Y+ GLC+Q+N E G KM+ CM+ELGC+PN+ TYN L+ ALCK  EL 
Sbjct: 313  VLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLITYNLLLKALCKNEELS 372

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +ARE++ +M L GV  N  T++I++D L C+G+V EA   +E ML K +      +D++I
Sbjct: 373  RARELVSEMTLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEMLDKFMCRRCSAYDDVI 432

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191
             GLC+RGLV +A  +L +M+ ++V PGA +WEALLLS
Sbjct: 433  YGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALLLS 469


>ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris]
            gi|561011455|gb|ESW10362.1| hypothetical protein
            PHAVU_009G202600g [Phaseolus vulgaris]
          Length = 513

 Score =  447 bits (1150), Expect = e-123
 Identities = 220/427 (51%), Positives = 306/427 (71%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK LT  SQFQ +PP+LD LE +EKF  PE   V LIR YG ++K+QDA+++F RI
Sbjct: 82   YYFLIKTLTCTSQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFYGLSDKVQDAVDLFLRI 141

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P+FRCTP+V SL+ +LS+LC+K+E L+MV ++LLKS   MNIR+EE++F++LI  LC+I 
Sbjct: 142  PRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQH-MNIRVEESTFQVLIKALCRIK 200

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +  YAI++LN M   GY  D  + SLI+SSLC+  D++SVE L    ++RK+GF P   D
Sbjct: 201  RVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVIWRDMRKLGFCPGIMD 260

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+N+I FLVK G+ +DALDILNQ K  GIKP+VVCYT VL G+++ G++ K EE+FDE+L
Sbjct: 261  YTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIAEGEYVKLEELFDEIL 320

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            V G++PD+YTYN+Y+ GLC Q+N +   K++  MEEL CKPNV T N L+GALC AG+LR
Sbjct: 321  VFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVTCNILLGALCVAGDLR 380

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            KAR V+K+MG KGV+ + H+Y I++D L+ +GE+ EA  LLE ML K   P S  FD +I
Sbjct: 381  KARGVMKEMGWKGVRLDLHSYRIMLDGLVGKGEIGEACFLLEEMLEKSFFPRSSTFDHII 440

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISPDLDGLLSNLPH 1260
              +C++GL+ EA ++  +++ +   PGA +WEALL S  +L F E     L G  +NL +
Sbjct: 441  FQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLKSGSKLGFSETTFSGLLGQKNNLSY 500

Query: 1261 HVN*NGN 1281
                +GN
Sbjct: 501  QTG-SGN 506


>ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris]
            gi|561013301|gb|ESW12162.1| hypothetical protein
            PHAVU_008G089500g [Phaseolus vulgaris]
          Length = 514

 Score =  443 bits (1140), Expect = e-122
 Identities = 218/419 (52%), Positives = 303/419 (72%), Gaps = 1/419 (0%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK LT  S  Q +PP+LD LE++E F  PE I V LIR YG ++++QDA+++F RI
Sbjct: 82   YYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGLSDRVQDAVDLFLRI 141

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P+FRCTP+V SL+ +LS+LC+K+E L+MV ++LLKS   MNIR+EE++F++LI  LC+I 
Sbjct: 142  PRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQH-MNIRVEESTFQVLIEALCRIK 200

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +  YAI++LN M   GY  D  + SLI+SSLC+  D++SVE L    ++RK+GF P   D
Sbjct: 201  RVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVIWRDMRKLGFCPGVMD 260

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+N+I FLVK G+  DALDILNQ K  GIKP+VVCYT VL G+V+ G++ K EE+FDE+L
Sbjct: 261  YTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVAEGEYVKLEELFDEIL 320

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            V G++PD+YTYN+Y+ GLC Q+N +   K++  MEEL C+PNV T NTL+GALC AG+LR
Sbjct: 321  VFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVTCNTLLGALCVAGDLR 380

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            KAR V+K+MG KGV  N H+Y I++D L+ +GE+ EA  LLE ML K   P S  FD +I
Sbjct: 381  KARGVMKEMGWKGVGLNLHSYRIMLDGLVGKGEIGEACFLLEEMLEKCFFPRSSTFDHII 440

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLLSNL 1254
              +C++GL++EA ++  +++ +   PGA +WEALLL S  +L F E     L G ++NL
Sbjct: 441  FQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALLLKSGSKLGFSETTFSGLLGQINNL 499


>ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  441 bits (1134), Expect = e-121
 Identities = 225/425 (52%), Positives = 300/425 (70%), Gaps = 4/425 (0%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+I  LT+  QF  LPP+L RLEK+EKF  PE IF NLI++YG AN  +DA+++FFRI
Sbjct: 80   YRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKVYGNANMFEDAVDLFFRI 139

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PSV SL+A+L VLCK++EGL MV Q+LLKS +AMNIRLEE+SFRIL+  LC+I 
Sbjct: 140  PNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKS-QAMNIRLEESSFRILVAALCRIK 198

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            K +YAI ILN M + GY  D+K+ S+ILSSLC+ + LS  EVL F+EE+RK+GF P   D
Sbjct: 199  KHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDEVLRFMEEMRKLGFYPGRVD 258

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
             +NVI FLVK G  +DAL + +QMK  GIKP+ V YT +L+GV + G + KA+++FDEML
Sbjct: 259  CNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILNGVTADGDYEKADDLFDEML 318

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            VLGV+PDI+ YN+Y+  LC Q+N E G +ML  M ELGCKP+  TYN L+  + K  +L 
Sbjct: 319  VLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKPDYVTYNMLLEGMSKVRDLG 378

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
              RE+ ++M L+GV+ N  TY I++D L+ +GE+ E+  LLE ML K        FDE+I
Sbjct: 379  GMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSLLEEMLDKYFSCWCSTFDEII 438

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISPDLDGLL----S 1248
            C LC+RGLV +A Q++ +M+ + +APGA +WEALLL   E SF E    +L   +    +
Sbjct: 439  CELCQRGLVCKALQLVNKMVRKTIAPGARAWEALLLGSVEFSFAETSLTELVNPIQIHPA 498

Query: 1249 NLPHH 1263
             LP H
Sbjct: 499  RLPEH 503


>ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Glycine max]
          Length = 499

 Score =  432 bits (1112), Expect = e-118
 Identities = 214/414 (51%), Positives = 299/414 (72%), Gaps = 1/414 (0%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F++K LT  SQ Q +PP+L  LE +EKF  PE I V LIR YG ++++QDA+++FFRI
Sbjct: 85   YFFVLKTLTSTSQLQDIPPVLYHLEHLEKFETPESILVYLIRFYGLSDRVQDAVDLFFRI 144

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P+FRCTP+V SL+ +LS+LC+K++ L+MV ++LLKS   MNIR+EE++FR+LI  LC+I 
Sbjct: 145  PRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQH-MNIRVEESTFRVLIRALCRIK 203

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +  YAI++LN M   GY  D K+ SL++S+LC+ +DL+S E L    ++RK+GF P   D
Sbjct: 204  RVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVVWRDMRKLGFCPGVMD 263

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+N+I FLVK GR +DALDILNQ K  GIK +VV YT VL G+V+ G++   +E+FDEML
Sbjct: 264  YTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVAEGEYVMLDELFDEML 323

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            V+G+IPD YTYN+Y+ GLC Q+N     +++  MEELGCKPNV TYNTL+GAL  AG+  
Sbjct: 324  VIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVTYNTLLGALSVAGDFV 383

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            KARE++K+MG KGV  N HTY I++D L+ +GE+ E+  LLE ML K L P S  FD +I
Sbjct: 384  KARELMKEMGWKGVGLNLHTYRIVLDGLVGKGEIGESCLLLEEMLEKCLFPRSSTFDNII 443

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDG 1239
              +C++ L +EA ++  +++ +   PGAS+WEALLL S  +L + +A    L G
Sbjct: 444  FQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALLLNSGSKLGYSKATFSGLLG 497


>ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 542

 Score =  430 bits (1106), Expect = e-118
 Identities = 209/397 (52%), Positives = 288/397 (72%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L QN  F  +P +L  LE +EKF  PE+IF +LI  YG AN+IQDA++IF+RI
Sbjct: 121  YHFLIKTLIQNLHFNHIPSVLHHLEHVEKFQTPEYIFADLITTYGIANRIQDAVDIFYRI 180

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            PKFRC PS  SL+++L++LC+ +  L++V QVLLKS   MNIR+EE++ RIL++ LC++N
Sbjct: 181  PKFRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSL-LMNIRVEESTLRILVSALCRMN 239

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            K SYAI+IL  M   G   + K+ S ILSS+C   DL   +V+G   E+ K+GF P   D
Sbjct: 240  KVSYAIDILQRMIDEGLGVNDKVCSFILSSICAKADLDGEDVMGLWRELGKLGFCPAMSD 299

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ +I FLVK GR LDALD LNQMK  GIKP +V YT  L+GV++ G +  A+E+FDE+L
Sbjct: 300  YNCLIRFLVKKGRGLDALDFLNQMKSVGIKPGIVSYTMALNGVIAEGDYMLADELFDELL 359

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG++PD+YTYN Y+  LC Q+  E G KM+ CMEEL CKPNV TYN L+ A+CK GE+ 
Sbjct: 360  MLGLVPDVYTYNAYIDALCKQNKVEEGIKMVACMEELRCKPNVLTYNMLLEAICKVGEIS 419

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A E++K+M  KG++ N  +Y ++ID L+ +GE++EA GL+E +L K     S+ FDE+I
Sbjct: 420  RAMELVKEMKYKGIEMNLVSYTVIIDGLVSKGEILEAHGLVEEVLHKCFCHQSLAFDEVI 479

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191
            CGLC+RGLV EA ++L +M+ ++V+PGA  WEALLLS
Sbjct: 480  CGLCQRGLVCEALELLRKMVAKNVSPGARGWEALLLS 516


>gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]
          Length = 494

 Score =  429 bits (1103), Expect = e-117
 Identities = 219/411 (53%), Positives = 301/411 (73%), Gaps = 4/411 (0%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F++K L + SQF  +  +LDR+E +EKF  PE+ F  +I  YGF ++I+DAI+IF+RI
Sbjct: 72   YHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYGFLDRIEDAIDIFWRI 131

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            PKFRC PS  SL+++L VLC++ EGL+ V +VL+KS + MNIRLEEASFRILIT LCKI 
Sbjct: 132  PKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRD-MNIRLEEASFRILITALCKIG 190

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLC---KSRDLSSVEVLGFLEEIRKVGFSPN 531
            K  YAIEIL+ M   GYD D+++ SLILS LC   K  DL+  +VL  L+++ K+GF P 
Sbjct: 191  KVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDVLELLQKMEKMGFCPR 250

Query: 532  GFDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFD 711
              DYS VI  LV+  R L+ALDIL QMK  G+KP+VVCYT VL G+V+ G++ KA+E+FD
Sbjct: 251  MGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHGIVAEGEYSKADEMFD 310

Query: 712  EMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAG 891
            EMLVLG++PD+YTYN Y+ GLC Q++ +     ++ MEELGCKPN+ TYN ++ ALCK G
Sbjct: 311  EMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPNLITYNLILRALCKNG 370

Query: 892  ELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFD 1071
            E  +A+E++ +M LKG +    TY+I++D L+ +GE+VEA GL+E ML K L     ++D
Sbjct: 371  EFGRAKELVAEMSLKGFEDYLQTYIIMLDVLLGKGEIVEACGLMEEMLDKLLCRRCSMYD 430

Query: 1072 EMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSR-FELSFKEAI 1221
            E+I GLC+RGL  +A ++L +M+ ++VAPGA +W+ALLLS   EL+  EAI
Sbjct: 431  EIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALLLSSGSELTLPEAI 481


>ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355498545|gb|AES79748.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 653

 Score =  428 bits (1101), Expect = e-117
 Identities = 216/419 (51%), Positives = 302/419 (72%), Gaps = 3/419 (0%)
 Frame = +1

Query: 1    YSFIIKILTQ--NSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFF 174
            Y F+IK +T    S   ++P IL+ LE  EKF  PE IF+ LIR YGF +++QDA+++FF
Sbjct: 77   YFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIRFYGFNDRVQDAVDLFF 136

Query: 175  RIPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCK 354
            RIP+FRCTP+V SL+ +LS+LC K+E L+MV  +LLKS + M IRLEE+SF +LI  LC+
Sbjct: 137  RIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRD-MKIRLEESSFWVLIKALCR 195

Query: 355  INKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNG 534
            I +  YAI+++N M   GY  D K+ SLI+SSLC+  DL+SVE L     +RK+GF P  
Sbjct: 196  IKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEALVVWGNMRKLGFCPGV 255

Query: 535  FDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDE 714
             D +N+I FLVK G+ +DAL+ILNQ+K  GIKP++VCYT VL G+V  G + K +E+FDE
Sbjct: 256  MDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGIVKEGDYVKLDELFDE 315

Query: 715  MLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGE 894
            +LVLG++PD+YTYN+Y+ GLC Q+NF+   K++V ME+LGCKPNV TYNTL+GALC +G+
Sbjct: 316  ILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNVVTYNTLLGALCMSGD 375

Query: 895  LRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDE 1074
            L KA+ V+K+M LKGV+ N HTY I++D L+ +GE+ EA  LLE ML K   P S  FD 
Sbjct: 376  LGKAKRVMKEMRLKGVELNLHTYRIMLDGLVGKGEIGEACVLLEEMLEKCFYPRSSTFDS 435

Query: 1075 MICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLLS 1248
            ++  +C++GL+S+A  ++ +++ +   PGA  WEALLL S  ++++ E       GLLS
Sbjct: 436  IVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALLLNSESKVTYSET---TFAGLLS 491


>ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550335936|gb|EEE92696.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 490

 Score =  425 bits (1092), Expect = e-116
 Identities = 204/379 (53%), Positives = 277/379 (73%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            + FI K L + SQF  +P +LD LEK+E F PPE  F  LI +YG  NK  +AIE+F+RI
Sbjct: 80   FDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTNKTHEAIELFYRI 139

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            PKFRC PSV SL+ ++SVLC+  +GL++V ++LLKS + MNIR+EE++F++LIT LC+I 
Sbjct: 140  PKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKS-QVMNIRVEESTFQVLITALCRIR 198

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            K  +AIE+LN M + G+  ++++YSL+LS LC+ +D +  EV+GFLE++RK+GF P   D
Sbjct: 199  KVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGFLEQLRKLGFFPGMVD 258

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            YSNVI FLVK  R LDAL +LN MK   IKP++ CYT VL GV+    + KA+E+FDE+L
Sbjct: 259  YSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIEDKDYLKADELFDELL 318

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            V G++PD YTYN+Y+ GLC Q+N + G KM+  MEELGCKPN+ TYN L+  LCK GEL 
Sbjct: 319  VFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLITYNMLVKQLCKVGELS 378

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            KA E++++MGLKG+  N  TY I+ID L   G++VEA GL E  L KGL   S++FDE+I
Sbjct: 379  KAGELVREMGLKGIGLNMQTYRIMIDGLASNGKIVEACGLFEEALDKGLCTQSLMFDEII 438

Query: 1081 CGLCKRGLVSEAQQVLTEM 1137
            CGLC R L  +A ++L +M
Sbjct: 439  CGLCHRDLSCKALKLLEKM 457


>gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 480

 Score =  418 bits (1075), Expect = e-114
 Identities = 202/397 (50%), Positives = 288/397 (72%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L  N QF  +P +L  L+ ++ F  PE+IF +L++ YG AN+IQDA++IF+RI
Sbjct: 73   YHFLIKTLLHNRQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFYGKANRIQDAVDIFYRI 131

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P+FRC PS  SL+A+L++LC+ + GL+++ QVLL S   MNIRLEE++FR+L+  LC++N
Sbjct: 132  PQFRCFPSAYSLNALLALLCRSQRGLKLLPQVLLNSLH-MNIRLEESTFRLLVCTLCRMN 190

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            K +YAIEIL  M   G   + K++S +LSS+C   DL   +V+GF   +RK+GFSP   D
Sbjct: 191  KVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGFWRGLRKLGFSPAMGD 250

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y  V+ FLVK GR LDA D+LNQMK  GI P ++ YT VL+GV + G +  A+E+FDE+L
Sbjct: 251  YDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTAEGDYILADELFDELL 310

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG++P++YTY  Y+  LC Q+  E G KM+ CMEELGCKPNV  YNTL+  + KAGE+ 
Sbjct: 311  MLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLIYNTLLRTISKAGEIS 370

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +ARE++K+M  KG++ N  +Y I+ID L+  GE++EA  L+E +L K +   S+ FDE+I
Sbjct: 371  RARELVKEMKYKGIEMNWVSYTIIIDGLVSNGEILEACALVEEVLHKCIFIKSLTFDEVI 430

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLS 1191
            CGLC+RGLV +A+++L +M+ R ++PGA  WEALLLS
Sbjct: 431  CGLCQRGLVCKARELLGKMVERSISPGARVWEALLLS 467


>ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
            gi|449483740|ref|XP_004156675.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
          Length = 491

 Score =  395 bits (1016), Expect = e-107
 Identities = 199/416 (47%), Positives = 285/416 (68%), Gaps = 1/416 (0%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F++K L + SQF  +PP+L RL+ +E F  PE+IFV+LI++YG  N+IQDA+ +F RI
Sbjct: 75   YYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLYGRMNRIQDAVTLFRRI 134

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  SL+++LS L +  +GL ++  ++L SH +M IRLE ++F+ILIT LCK+N
Sbjct: 135  PMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSH-SMGIRLEHSTFQILITALCKVN 193

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            K  +A+E+ N M   GY  + ++ SLIL+SLC+ +  S   VLGFLEE+R+ GF P   D
Sbjct: 194  KVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGFLEEMRQKGFCPAVVD 253

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            YSNVI F V  G   DA+D+LN+MK  G KP++VCYT VL+GV++ G +  A+E+FDE+L
Sbjct: 254  YSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFDELL 313

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            + G++PDIYTYN+Y+ GLC Q +   G +M+  ME LGC+PNV TYN ++ +LCK GEL 
Sbjct: 314  LFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVITYNVILKSLCKTGELD 373

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +AR++  KM LKG+  N  T+ I+ID L   GEV+EA  LLE ML     P    F E++
Sbjct: 374  EARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEMLGSRFPPQISTFSEIL 433

Query: 1081 CGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLL-SRFELSFKEAISPDLDGLL 1245
              LCKR +V +A ++L  M+ ++ +PG  +WE LLL S  EL+  +++   L  L+
Sbjct: 434  SWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTSVKSLETTLKDLV 489


>ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223526239|gb|EEF28557.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 373

 Score =  379 bits (972), Expect = e-102
 Identities = 189/374 (50%), Positives = 263/374 (70%)
 Frame = +1

Query: 148  IQDAIEIFFRIPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASF 327
            +Q+AI +F+R P FRC PSV  L+ +LSVLC+  EGL  V +VLLKS + MNIR+EE+SF
Sbjct: 1    MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQD-MNIRMEESSF 59

Query: 328  RILITVLCKINKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEI 507
            R+LI  LC INK  YA+E+ N M + G+  DSK+ SL+LSSLC   D+SS EV+ FL E+
Sbjct: 60   RLLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGEL 119

Query: 508  RKVGFSPNGFDYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQF 687
            RK GF P   DYS VI FLV+ G  ++AL++LNQMK+ GIKP++VCYT VL+GV++ G +
Sbjct: 120  RKFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVY 179

Query: 688  HKAEEVFDEMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTL 867
             KA+E+FDE+LV G++PD+YTYN+Y+ GLC Q+N E G +M+  MEELGCKPN+ TYN L
Sbjct: 180  SKADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNIL 239

Query: 868  MGALCKAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGL 1047
            +  LCK GE  +AR++++ MG KG+     TY ++I  L   G++V+A  LLE  L KGL
Sbjct: 240  LEDLCKNGEDSRARDLVRDMGSKGIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDKGL 299

Query: 1048 VPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSRFELSFKEAISP 1227
             P  + FDE+I GLC+ G + +A ++L +++ ++V+PG   WE LLL +  ++F E    
Sbjct: 300  CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLLL-KSNINFVEDTFI 358

Query: 1228 DLDGLLSNLPHHVN 1269
            DL  +    PH  N
Sbjct: 359  DLVWVWETHPHCQN 372


>ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Solanum lycopersicum]
          Length = 496

 Score =  363 bits (931), Expect = 2e-97
 Identities = 183/396 (46%), Positives = 268/396 (67%), Gaps = 1/396 (0%)
 Frame = +1

Query: 1    YSFIIKILTQN-SQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFR 177
            Y FI+K LTQN S + ++P ILD + K E F  PE+IF  LI+ YG +N    A E+FF 
Sbjct: 94   YYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEYIFTYLIKFYGDSNMTHLAYEMFFT 153

Query: 178  IPKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKI 357
            +P +RC PSV SL+ ++ VLCK    L++V QVL+KS + +NI +EE++F+ILI  LC+I
Sbjct: 154  MPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVKS-QLLNIWVEESTFKILIRALCRI 212

Query: 358  NKFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGF 537
             K + A+++L LM   G++ D+ + SLILS++   +D   VE+ G LEE+RK+G+SP   
Sbjct: 213  GKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKDCVGVEIWGVLEEMRKLGYSPKRV 272

Query: 538  DYSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEM 717
            D  NVI F V  G+ +DAL++LN+MKM G+ P+VVCY  VL+G++  G++  A+E+FDE+
Sbjct: 273  DLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCYNLVLNGLIFEGEYSNADELFDEL 332

Query: 718  LVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGEL 897
            LVLG+ PDI TYN+Y+ GLC Q       ++L CME+LGCKP + TY+T++  LC+ G L
Sbjct: 333  LVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMEDLGCKPEMNTYHTILDGLCRCGML 392

Query: 898  RKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEM 1077
               +EVL +M  KG++ +SH Y ++I+ +I  GEV EA+ LL  M+  G VP SI FD +
Sbjct: 393  SSVKEVLGQMKSKGLQLSSHIYGVIINCMIRNGEVDEAYNLLHEMVDMGFVPQSITFDGL 452

Query: 1078 ICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALL 1185
            I  LC +G   E  ++L+ M  +++ PG  SWEA +
Sbjct: 453  IGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFV 488


>ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum]
            gi|557112223|gb|ESQ52507.1| hypothetical protein
            EUTSA_v10017948mg [Eutrema salsugineum]
          Length = 456

 Score =  355 bits (910), Expect = 4e-95
 Identities = 174/380 (45%), Positives = 268/380 (70%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L + SQ + +  +L+ +E  EKF+ PE IF ++I  YGF+ +I++AI++FF+I
Sbjct: 78   YKFVIKTLAKTSQLENIASVLNHIEISEKFDTPESIFRDVIFAYGFSGRIEEAIDVFFKI 137

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  +L+A+LSVL +K++GL+MV +VLLK+ + + +RLEE++  ILI  LC+I 
Sbjct: 138  PNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASK-LGVRLEESTLGILIDALCRIG 196

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +   A +++  M    Y  D +LYSL+LSS+CK +D S  +V+G+LE +RK  FSP+  D
Sbjct: 197  EVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEGLRKTRFSPDLRD 256

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ V+ FLV+ GR  + + +LNQMK   I+P++VCYT +L GV++   + KA+++FDE+L
Sbjct: 257  YTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIADEDYKKADKLFDELL 316

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG++PD+YTYN+Y+ GLC QS+ E G KM+ CME+LG +PNV TYN L+ AL KAG++ 
Sbjct: 317  LLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVTYNILIKALVKAGDMS 376

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A+ + ++M   GV  NSH+Y I+++  I   EVV A GLLE    + LV  S   +E+I
Sbjct: 377  RAKIIWEEMETNGVDRNSHSYDIMVNASIEADEVVCAHGLLEEAFSRSLVVKSSRTEEVI 436

Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140
            C LC +GL+ +A ++L  ++
Sbjct: 437  CRLCDKGLMDKAVELLVHLV 456


>ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella]
            gi|482562854|gb|EOA27044.1| hypothetical protein
            CARUB_v10023139mg [Capsella rubella]
          Length = 470

 Score =  338 bits (868), Expect = 3e-90
 Identities = 170/380 (44%), Positives = 255/380 (67%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L + SQ + +  +L  LE  EKF+ PE IF ++I  YGFA +I +AI++FF+I
Sbjct: 92   YRFVIKTLAKTSQLENIASVLSHLEVSEKFDTPESIFRDVIAAYGFAGRIGEAIDVFFKI 151

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  +L+A+L VL +K+E L++V ++L+K+   M +RLEE++F ILI  LCKI 
Sbjct: 152  PNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASR-MGVRLEESTFGILIDALCKIG 210

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +   A E++  M       D +LYS +LSS+CK +D S  +V+G+LE++RK  FSP   D
Sbjct: 211  EVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGYLEDLRKTRFSPGLRD 270

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ V+ FLV+ GR  + + +LNQMK   I+P++VCYT VL GV++  ++ KA++ FDE+L
Sbjct: 271  YTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIADAEYSKADKFFDELL 330

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG+ PD+YTYN+Y+ GLC Q++ E   KM+  M +LG +PNV TYN L+ AL  AG+L 
Sbjct: 331  LLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVITYNILIKALVNAGDLS 390

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A+ + ++MG+ GV  NSHTY I+I   I  G+VV A G LE      +   S   +E+I
Sbjct: 391  QAKTLWEEMGINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEAFNMNVFAKSSRTEEVI 450

Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140
              LC +GL+ +A ++L  ++
Sbjct: 451  SRLCDKGLMDKAVELLAHLV 470


>ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325583|gb|EFH56003.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 444

 Score =  335 bits (860), Expect = 3e-89
 Identities = 169/380 (44%), Positives = 256/380 (67%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+I+ L + SQ + +  +LD LE  EKF+ PE IF ++I  YGF+ +I++AI++FF+I
Sbjct: 66   YRFVIETLAKTSQLENIASVLDHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIDVFFKI 125

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  +L+A+L VL +K++ L++V ++L+K+   M +RLEE++F ILI  LC+I 
Sbjct: 126  PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASR-MGVRLEESTFGILINALCRIG 184

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +   A E++  M       D +LYSL+LSS+CK +D S  +V+G+LE++RK  F P   D
Sbjct: 185  EVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEDLRKTRFLPGLRD 244

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ V+ FLV+ GR  + + +LNQMK   I P+VVCYT VL GV++   + KA+++FDE+L
Sbjct: 245  YTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIADEDYPKADKLFDELL 304

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG+ PD+YTYN+Y+ GLC Q++ E   KM+  M +LG +PNV TYN ++  L KAG+L 
Sbjct: 305  LLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVTYNIVIKGLVKAGDLS 364

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A+ + K+M + GV  NSHTY I+I   I   EVV A GLLE      L   S   +E+I
Sbjct: 365  RAKTLWKEMEMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEAFNMNLFVKSSKIEEVI 424

Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140
              LC++GL+ +A ++L  ++
Sbjct: 425  SRLCEKGLMDKAVELLAHLV 444


>ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g38420, mitochondrial; Flags: Precursor
            gi|3395430|gb|AAC28762.1| hypothetical protein
            [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 453

 Score =  335 bits (860), Expect = 3e-89
 Identities = 168/380 (44%), Positives = 257/380 (67%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L ++SQ + +  +L  LE  EKF+ PE IF ++I  YGF+ +I++AIE+FF+I
Sbjct: 75   YRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVFFKI 134

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  +L+A+L VL +K++ L++V ++L+K+   M +RLEE++F ILI  LC+I 
Sbjct: 135  PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIG 193

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +   A E++  M       D +LYS +LSS+CK +D S  +V+G+LE++RK  FSP   D
Sbjct: 194  EVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGLRD 253

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ V+ FLV+ GR  + + +LNQMK   ++P++VCYT VL GV++   + KA+++FDE+L
Sbjct: 254  YTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDELL 313

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG+ PD+YTYN+Y+ GLC Q++ E   KM+  M +LG +PNV TYN L+ AL KAG+L 
Sbjct: 314  LLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGDLS 373

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A+ + K+M   GV  NSHT+ I+I   I   EVV A GLLE      +   S   +E+I
Sbjct: 374  RAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFVKSSRIEEVI 433

Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140
              LC++GL+ +A ++L  ++
Sbjct: 434  SRLCEKGLMDQAVELLAHLV 453


>gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1|
            At2g38420 [Arabidopsis thaliana]
          Length = 444

 Score =  335 bits (860), Expect = 3e-89
 Identities = 168/380 (44%), Positives = 257/380 (67%)
 Frame = +1

Query: 1    YSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRI 180
            Y F+IK L ++SQ + +  +L  LE  EKF+ PE IF ++I  YGF+ +I++AIE+FF+I
Sbjct: 66   YRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVFFKI 125

Query: 181  PKFRCTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKIN 360
            P FRC PS  +L+A+L VL +K++ L++V ++L+K+   M +RLEE++F ILI  LC+I 
Sbjct: 126  PNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIG 184

Query: 361  KFSYAIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFD 540
            +   A E++  M       D +LYS +LSS+CK +D S  +V+G+LE++RK  FSP   D
Sbjct: 185  EVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGLRD 244

Query: 541  YSNVIGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEML 720
            Y+ V+ FLV+ GR  + + +LNQMK   ++P++VCYT VL GV++   + KA+++FDE+L
Sbjct: 245  YTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDELL 304

Query: 721  VLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELR 900
            +LG+ PD+YTYN+Y+ GLC Q++ E   KM+  M +LG +PNV TYN L+ AL KAG+L 
Sbjct: 305  LLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGDLS 364

Query: 901  KAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMI 1080
            +A+ + K+M   GV  NSHT+ I+I   I   EVV A GLLE      +   S   +E+I
Sbjct: 365  RAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFNMNVFVKSSRIEEVI 424

Query: 1081 CGLCKRGLVSEAQQVLTEMI 1140
              LC++GL+ +A ++L  ++
Sbjct: 425  SRLCEKGLMDQAVELLAHLV 444


>ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda]
            gi|548857785|gb|ERN15583.1| hypothetical protein
            AMTR_s00048p00149840 [Amborella trichopoda]
          Length = 464

 Score =  298 bits (763), Expect = 5e-78
 Identities = 161/376 (42%), Positives = 235/376 (62%)
 Frame = +1

Query: 13   IKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKIQDAIEIFFRIPKFR 192
            I IL QN QF  L  +L  L+   KF+ PE   + LI+    +  +++A+++FF +P  R
Sbjct: 88   IVILAQNPQFSGLKTLLRCLQSNRKFSTPETRIIGLIQSCASSKMVKEALDLFFAMPHLR 147

Query: 193  CTPSVSSLHAILSVLCKKKEGLQMVHQVLLKSHEAMNIRLEEASFRILITVLCKINKFSY 372
            C PS +SL+A+LSVLC   +   +V ++L+K+ E MNIRL+ +SFRILI  LC+I K  +
Sbjct: 148  CQPSTTSLNALLSVLCDT-DSFHLVPELLIKTLE-MNIRLDASSFRILIGSLCRIGKLGF 205

Query: 373  AIEILNLMPHYGYDPDSKLYSLILSSLCKSRDLSSVEVLGFLEEIRKVGFSPNGFDYSNV 552
            AIE+L LMP  G  PDS  Y+ IL  LC+  + S  E+ GFL+E++  GF P+   Y+ V
Sbjct: 206  AIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFS--EIYGFLDEMKDAGFFPDKIAYAIV 263

Query: 553  IGFLVKVGRWLDALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEMLVLGV 732
            I  L K GR  +A  ILN+MK+ G KP+ + YT ++DG    G+F +A EVFDEML +G+
Sbjct: 264  IDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEVFDEMLAMGL 323

Query: 733  IPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALCKAGELRKARE 912
            +PD++TY++Y+ GLC +   E   ++L  M E+GC+PNV TYNTL+   C  G LR+A E
Sbjct: 324  VPDVFTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLIRTFCSDGNLRRADE 383

Query: 913  VLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGLLEVMLRKGLVPTSIIFDEMICGLC 1092
            ++ +MG  GV  NS TY  LI+  + EG VVEA  LL  M+ KG  P    ++ ++    
Sbjct: 384  LVAEMGSNGVCGNSVTYRTLINAYLREGMVVEANELLVQMVGKGFFPHFSTWEALLSSTV 443

Query: 1093 KRGLVSEAQQVLTEMI 1140
             +  + +A   L E+I
Sbjct: 444  FKWDILQALNALEELI 459



 Score = 74.3 bits (181), Expect = 1e-10
 Identities = 55/235 (23%), Positives = 93/235 (39%), Gaps = 35/235 (14%)
 Frame = +1

Query: 586  DALDILNQMKMSGIKPNVVCYTRVLDGVVSAGQFHKAEEVFDEMLVLGVIPDIYTYNIYV 765
            +ALD+   M     +P+      +L  +     FH   E+  + L + +  D  ++ I +
Sbjct: 135  EALDLFFAMPHLRCQPSTTSLNALLSVLCDTDSFHLVPELLIKTLEMNIRLDASSFRILI 194

Query: 766  KGLCSQSNFETGFKMLVCMEELGCKPNVTTYNTLMGALC--------------------- 882
              LC         ++L  M + GC P+   Y  ++  LC                     
Sbjct: 195  GSLCRIGKLGFAIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFSEIYGFLDEMKDAGFF 254

Query: 883  --------------KAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAFGL 1020
                          K G L +AR +L +M L+G K ++ TY  ++D     GE  +A  +
Sbjct: 255  PDKIAYAIVIDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEV 314

Query: 1021 LEVMLRKGLVPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALL 1185
             + ML  GLVP    +   I GLC+   + EA++VL  M      P   ++  L+
Sbjct: 315  FDEMLAMGLVPDVFTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLI 369



 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 47/189 (24%), Positives = 93/189 (49%), Gaps = 1/189 (0%)
 Frame = +1

Query: 655  VLDGVVSAGQFHKAEEVFDEMLVLGVIPDIYTYNIYVKGLCSQSNFETGFKMLVCMEELG 834
            ++    S+    +A ++F  M  L   P   + N  +  LC   +F    ++L+   E+ 
Sbjct: 123  LIQSCASSKMVKEALDLFFAMPHLRCQPSTTSLNALLSVLCDTDSFHLVPELLIKTLEMN 182

Query: 835  CKPNVTTYNTLMGALCKAGELRKAREVLKKMGLKGVKANSHTYMILIDRLICEGEVVEAF 1014
             + + +++  L+G+LC+ G+L  A E+L+ M  +G   +S  Y  ++ +L   GE  E +
Sbjct: 183  IRLDASSFRILIGSLCRIGKLGFAIELLRLMPDQGCWPDSGFYAEILCKLCEFGEFSEIY 242

Query: 1015 GLLEVMLRKGLVPTSIIFDEMICGLCKRGLVSEAQQVLTEMIPRDVAPGASSWEALLLSR 1194
            G L+ M   G  P  I +  +I  L K G ++EA+ +L  M      P   ++ +++   
Sbjct: 243  GFLDEMKDAGFFPDKIAYAIVIDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGF 302

Query: 1195 FEL-SFKEA 1218
            +++  FK+A
Sbjct: 303  YKIGEFKQA 311


Top