BLASTX nr result

ID: Akebia22_contig00005566 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00005566
         (1138 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi...   141   3e-48
ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr...   127   7e-44
ref|XP_002309173.2| pentatricopeptide repeat-containing family p...   135   5e-43
ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi...   135   4e-42
ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi...   125   5e-42
ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas...   131   8e-41
gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]     124   8e-41
ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas...   127   1e-39
ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ...   115   1e-37
ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi...   126   8e-37
ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein...   120   1e-35
ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi...   115   4e-33
gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss...    95   3e-30
ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps...   111   1e-28
ref|XP_002879744.1| pentatricopeptide repeat-containing protein ...   107   3e-28
ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr...   105   6e-27
ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar...   106   6e-27
gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637...   106   6e-27
emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera]   108   3e-21
ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A...    59   6e-16

>ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  141 bits (356), Expect(3) = 3e-48
 Identities = 68/136 (50%), Positives = 87/136 (63%)
 Frame = +1

Query: 478 RKKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSF 657
           R+KWPL PYK  W +TF+ + AM+ L+    +             S      LS+LI+SF
Sbjct: 16  RRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQ-------SPSNSQFLSILIDSF 68

Query: 658 SIYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANK 837
            IY  DPTP+AY F+I  LT+  QF  LPP+L RLEK+EKF  PE IF NLI++YG AN 
Sbjct: 69  RIYNSDPTPNAYRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKVYGNANM 128

Query: 838 I*DAIEIFFRIPNFRC 885
             DA+++FFRIPNFRC
Sbjct: 129 FEDAVDLFFRIPNFRC 144



 Score = 70.9 bits (172), Expect(3) = 3e-48
 Identities = 40/68 (58%), Positives = 50/68 (73%)
 Frame = +2

Query: 935  KECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK 1114
            +E L MV Q+LLKS +AMNIRLEE+ FRIL+  LC+I K +YAI ILN M + GY  D+K
Sbjct: 162  REGLVMVPQILLKS-QAMNIRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAK 220

Query: 1115 *YSLILLS 1138
              S+IL S
Sbjct: 221  MCSIILSS 228



 Score = 28.5 bits (62), Expect(3) = 3e-48
 Identities = 12/23 (52%), Positives = 17/23 (73%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKK 938
           P  +  PSV SL+A+L VLCK++
Sbjct: 140 PNFRCVPSVYSLNALLYVLCKRR 162


>ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina]
           gi|557531581|gb|ESR42764.1| hypothetical protein
           CICLE_v10013613mg [Citrus clementina]
          Length = 506

 Score =  127 bits (320), Expect(3) = 7e-44
 Identities = 59/135 (43%), Positives = 87/135 (64%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWPL PYK KW QT +QQ A + ++++ T+               K+ H+LS L++SFS
Sbjct: 18  RKWPLSPYKAKWHQTLDQQQAKQNVKQSLTTPPTKQQQQIP-----KQPHILSSLLHSFS 72

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           IY C+P P AY F+IK L +NSQF  +  +LD +EK E F  PE IF++LI+ Y  A++ 
Sbjct: 73  IYNCEPPPEAYHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTYADAHRF 132

Query: 841 *DAIEIFFRIPNFRC 885
            D++ +F++IP FRC
Sbjct: 133 QDSVNLFYKIPKFRC 147



 Score = 67.8 bits (164), Expect(3) = 7e-44
 Identities = 38/68 (55%), Positives = 50/68 (73%)
 Frame = +2

Query: 935  KECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK 1114
            KE ++MV Q+LLKS + MNIR+EE+ FRILI+ LC+IN+  +AIEILN M + G+  D K
Sbjct: 165  KEWVKMVPQILLKS-QLMNIRIEESSFRILISTLCRINRVGFAIEILNCMINDGFCVDGK 223

Query: 1115 *YSLILLS 1138
              S IL S
Sbjct: 224  TCSWILSS 231



 Score = 30.8 bits (68), Expect(3) = 7e-44
 Identities = 14/27 (51%), Positives = 19/27 (70%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKKNVFK 950
           P+ +  PSV SL+A+LSVLC+ K   K
Sbjct: 143 PKFRCVPSVYSLNALLSVLCRNKEWVK 169


>ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550335936|gb|EEE92696.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 490

 Score =  135 bits (339), Expect(3) = 5e-43
 Identities = 65/135 (48%), Positives = 85/135 (62%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWP  PYK +W + FNQQ AM++L+++A               S  + HLLS LI+SFS
Sbjct: 18  RKWPYSPYKARWHRIFNQQQAMQSLKQSALKPPQQE--------SPNKPHLLSSLIHSFS 69

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           IY  +P P A+ FI K L + SQF  +P +LD LEK+E F PPE  F  LI +YG  NK 
Sbjct: 70  IYDVEPAPKAFDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTNKT 129

Query: 841 *DAIEIFFRIPNFRC 885
            +AIE+F+RIP FRC
Sbjct: 130 HEAIELFYRIPKFRC 144



 Score = 61.6 bits (148), Expect(3) = 5e-43
 Identities = 31/63 (49%), Positives = 49/63 (77%)
 Frame = +2

Query: 944  LQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK*YS 1123
            L++V ++LLKS + MNIR+EE+ F++LIT LC+I K  +AIE+LN M + G+  +++ YS
Sbjct: 165  LKLVPEILLKS-QVMNIRVEESTFQVLITALCRIRKVGFAIEMLNCMVNDGFIVNAEIYS 223

Query: 1124 LIL 1132
            L+L
Sbjct: 224  LLL 226



 Score = 26.9 bits (58), Expect(3) = 5e-43
 Identities = 11/27 (40%), Positives = 17/27 (62%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKKNVFK 950
           P+ +  PSV SL+ ++SVLC+     K
Sbjct: 140 PKFRCVPSVYSLNTLISVLCRNSKGLK 166


>ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 491

 Score =  135 bits (339), Expect(2) = 4e-42
 Identities = 64/135 (47%), Positives = 87/135 (64%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWP+ PY  KW + FNQ  A++ L+ +  +                 + LLS LI+SF+
Sbjct: 19  RKWPVSPYNTKWHKLFNQHQALQTLKHSPLNPP---------------QTLLSTLIHSFN 63

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            + CDPTP AY+F++K L + SQ   +P +LDRLE IEKF+PPE IF NLIR YG AN++
Sbjct: 64  TFNCDPTPEAYNFVLKTLFKTSQLSHIPSVLDRLESIEKFHPPESIFANLIRFYGSANRV 123

Query: 841 *DAIEIFFRIPNFRC 885
            DAI++F RIP FRC
Sbjct: 124 EDAIDVFCRIPKFRC 138



 Score = 64.7 bits (156), Expect(2) = 4e-42
 Identities = 38/67 (56%), Positives = 45/67 (67%)
 Frame = +2

Query: 938  ECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK* 1117
            E L+MV QVL+ S  AM IRLEE+ FRILI+ LC+I    YAIEI+  M   GYD D K 
Sbjct: 157  EGLKMVPQVLMNSR-AMGIRLEESSFRILISALCRIGSVGYAIEIMKCMISNGYDLDVKI 215

Query: 1118 YSLILLS 1138
             SL+L S
Sbjct: 216  CSLVLSS 222


>ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Glycine max]
          Length = 499

 Score =  125 bits (314), Expect(3) = 5e-42
 Identities = 61/136 (44%), Positives = 81/136 (59%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           KKWP  PYK  W   F ++ AM+ L++A                      LLS L++SF 
Sbjct: 18  KKWPHSPYKTSWHHNFGEEQAMKNLKQATLEMDSSQHPQRPNLPCP---FLLSTLLDSFK 74

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            Y+ DPTP AY F++K LT  SQ Q +PP+L  LE +EKF  PE I V LIR YG ++++
Sbjct: 75  AYSIDPTPKAYFFVLKTLTSTSQLQDIPPVLYHLEHLEKFETPESILVYLIRFYGLSDRV 134

Query: 841 *DAIEIFFRIPNFRCT 888
            DA+++FFRIP FRCT
Sbjct: 135 QDAVDLFFRIPRFRCT 150



 Score = 67.0 bits (162), Expect(3) = 5e-42
 Identities = 33/67 (49%), Positives = 48/67 (71%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            +++CL+MV ++LLKS   MNIR+EE+ FR+LI  LC+I +  YAI++LN M   GY  D 
Sbjct: 166  KRDCLEMVPEILLKSQH-MNIRVEESTFRVLIRALCRIKRVGYAIKMLNFMVEDGYGLDE 224

Query: 1112 K*YSLIL 1132
            K  SL++
Sbjct: 225  KICSLVI 231



 Score = 27.7 bits (60), Expect(3) = 5e-42
 Identities = 10/24 (41%), Positives = 19/24 (79%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKKN 941
           P+ +  P+V SL+ +LS+LC+K++
Sbjct: 145 PRFRCTPTVCSLNLVLSLLCRKRD 168


>ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris]
           gi|561011455|gb|ESW10362.1| hypothetical protein
           PHAVU_009G202600g [Phaseolus vulgaris]
          Length = 513

 Score =  131 bits (329), Expect(2) = 8e-41
 Identities = 65/136 (47%), Positives = 82/136 (60%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWP  PYK  W   F +Q AM  L++A                +     LLS LI+SF 
Sbjct: 18  RKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTP------NLPHPFLLSTLIDSFK 71

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            Y+CDPTP AY F+IK LT  SQFQ +PP+LD LE +EKF  PE   V LIR YG ++K+
Sbjct: 72  SYSCDPTPKAYYFLIKTLTCTSQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFYGLSDKV 131

Query: 841 *DAIEIFFRIPNFRCT 888
            DA+++F RIP FRCT
Sbjct: 132 QDAVDLFLRIPRFRCT 147



 Score = 64.3 bits (155), Expect(2) = 8e-41
 Identities = 34/69 (49%), Positives = 48/69 (69%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            ++ECL+MV ++LLKS   MNIR+EE+ F++LI  LC+I +  YAI++LN M   GY  D 
Sbjct: 163  KRECLKMVPEILLKSQH-MNIRVEESTFQVLIKALCRIKRVGYAIKMLNYMIEGGYGLDE 221

Query: 1112 K*YSLILLS 1138
               SLI+ S
Sbjct: 222  TMCSLIISS 230


>gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]
          Length = 494

 Score =  124 bits (311), Expect(2) = 8e-41
 Identities = 58/135 (42%), Positives = 85/135 (62%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +++P+ PYK KW +TFNQ  A++ L++                 +     LLS+L+NSF+
Sbjct: 17  REFPISPYKTKWHETFNQTQALQTLKR---------------HQNENPNRLLSLLLNSFN 61

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            Y C+PTP AY F++K L + SQF  +  +LDR+E +EKF  PE+ F  +I  YGF ++I
Sbjct: 62  SYDCNPTPEAYHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYGFLDRI 121

Query: 841 *DAIEIFFRIPNFRC 885
            DAI+IF+RIP FRC
Sbjct: 122 EDAIDIFWRIPKFRC 136



 Score = 71.2 bits (173), Expect(2) = 8e-41
 Identities = 41/65 (63%), Positives = 48/65 (73%)
 Frame = +2

Query: 938  ECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK* 1117
            E L+ V +VL+KS + MNIRLEEA FRILIT LCKI K  YAIEIL+ M   GYD D++ 
Sbjct: 155  EGLRFVPEVLIKSRD-MNIRLEEASFRILITALCKIGKVGYAIEILDCMISDGYDIDARI 213

Query: 1118 YSLIL 1132
             SLIL
Sbjct: 214  CSLIL 218


>ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris]
           gi|561013301|gb|ESW12162.1| hypothetical protein
           PHAVU_008G089500g [Phaseolus vulgaris]
          Length = 514

 Score =  127 bits (318), Expect(2) = 1e-39
 Identities = 60/136 (44%), Positives = 81/136 (59%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWP  PYK  W   F +Q AM  L++A                +     LLS L+++F 
Sbjct: 18  RKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTP------NLPHPFLLSTLLDAFK 71

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            Y+CDPTP AY F+IK LT  S  Q +PP+LD LE++E F  PE I V LIR YG ++++
Sbjct: 72  AYSCDPTPKAYYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGLSDRV 131

Query: 841 *DAIEIFFRIPNFRCT 888
            DA+++F RIP FRCT
Sbjct: 132 QDAVDLFLRIPRFRCT 147



 Score = 64.3 bits (155), Expect(2) = 1e-39
 Identities = 34/69 (49%), Positives = 48/69 (69%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            ++ECL+MV ++LLKS   MNIR+EE+ F++LI  LC+I +  YAI++LN M   GY  D 
Sbjct: 163  KRECLKMVPEILLKSQH-MNIRVEESTFQVLIEALCRIKRVGYAIKMLNYMIEGGYGLDE 221

Query: 1112 K*YSLILLS 1138
               SLI+ S
Sbjct: 222  TICSLIISS 230


>ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355498545|gb|AES79748.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 653

 Score =  115 bits (289), Expect(3) = 1e-37
 Identities = 60/138 (43%), Positives = 81/138 (58%), Gaps = 2/138 (1%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWP  PYK  W   F +Q A++ L  A T              ++ +  LLS LI+SF 
Sbjct: 18  RKWPHSPYKTSWHHNFGEQQAIQILINAKTQTQ-----------NNNDPFLLSTLIHSFK 66

Query: 661 IYACDPTPSAYSFIIKILTQ--NSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFAN 834
            Y  DP+P AY F+IK +T    S   ++P IL+ LE  EKF  PE IF+ LIR YGF +
Sbjct: 67  AYHTDPSPKAYFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIRFYGFND 126

Query: 835 KI*DAIEIFFRIPNFRCT 888
           ++ DA+++FFRIP FRCT
Sbjct: 127 RVQDAVDLFFRIPRFRCT 144



 Score = 63.5 bits (153), Expect(3) = 1e-37
 Identities = 34/69 (49%), Positives = 47/69 (68%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            ++ECL+MV  +LLKS + M IRLEE+ F +LI  LC+I +  YAI+++N M   GY  D 
Sbjct: 160  KRECLRMVPDILLKSRD-MKIRLEESSFWVLIKALCRIKRVDYAIKMMNCMVEDGYCLDD 218

Query: 1112 K*YSLILLS 1138
            K  SLI+ S
Sbjct: 219  KICSLIISS 227



 Score = 25.8 bits (55), Expect(3) = 1e-37
 Identities = 10/27 (37%), Positives = 18/27 (66%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKKNVFK 950
           P+ +  P+V SL+ +LS+LC K+   +
Sbjct: 139 PRFRCTPTVCSLNLLLSLLCGKRECLR 165


>ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Cucumis sativus]
           gi|449483740|ref|XP_004156675.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Cucumis sativus]
          Length = 491

 Score =  126 bits (317), Expect(2) = 8e-37
 Identities = 64/143 (44%), Positives = 89/143 (62%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +KWPL  +K KW QTF+Q  A+  L++AA                 +   LLS L+ SF+
Sbjct: 19  RKWPLSSHKTKWHQTFDQDEALRILKQAANP--------------DQPHLLLSALVTSFT 64

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
            Y+C PTP+AY F++K L + SQF  +PP+L RL+ +E F  PE+IFV+LI++YG  N+I
Sbjct: 65  AYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLYGRMNRI 124

Query: 841 *DAIEIFFRIPNFRCTLLCPPST 909
            DA+ +F RIP FRC     PST
Sbjct: 125 QDAVTLFRRIPMFRCV----PST 143



 Score = 55.5 bits (132), Expect(2) = 8e-37
 Identities = 30/65 (46%), Positives = 43/65 (66%)
 Frame = +2

Query: 944  LQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK*YS 1123
            L ++  ++L SH +M IRLE + F+ILIT LCK+NK  +A+E+ N M   GY  + +  S
Sbjct: 160  LPIIPDIILNSH-SMGIRLEHSTFQILITALCKVNKVGHAMELFNYMITEGYGLNPQICS 218

Query: 1124 LILLS 1138
            LIL S
Sbjct: 219  LILAS 223


>ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
           cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide
           repeat superfamily protein, putative [Theobroma cacao]
          Length = 542

 Score =  120 bits (301), Expect(2) = 1e-35
 Identities = 60/135 (44%), Positives = 78/135 (57%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           ++WP   YK KW QTF Q+ AM + ++                       LLS L+ SFS
Sbjct: 61  RRWPHFAYKTKWNQTFTQKQAMLSFKQLVAVAQDNLPPPI----------LLSTLVRSFS 110

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           +Y   PTP AY F+IK L QN  F  +P +L  LE +EKF  PE+IF +LI  YG AN+I
Sbjct: 111 LYNVHPTPQAYHFLIKTLIQNLHFNHIPSVLHHLEHVEKFQTPEYIFADLITTYGIANRI 170

Query: 841 *DAIEIFFRIPNFRC 885
            DA++IF+RIP FRC
Sbjct: 171 QDAVDIFYRIPKFRC 185



 Score = 57.4 bits (137), Expect(2) = 1e-35
 Identities = 40/88 (45%), Positives = 52/88 (59%), Gaps = 5/88 (5%)
 Frame = +2

Query: 890  FCVLPPRNSIRSL*EKEC-----LQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKF 1054
            F  +P   S+ SL    C     L++V QVLLKS   MNIR+EE+  RIL++ LC++NK 
Sbjct: 183  FRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSL-LMNIRVEESTLRILVSALCRMNKV 241

Query: 1055 SYAIEILNLMPHYGYDPDSK*YSLILLS 1138
            SYAI+IL  M   G   + K  S IL S
Sbjct: 242  SYAIDILQRMIDEGLGVNDKVCSFILSS 269


>ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Solanum lycopersicum]
          Length = 496

 Score =  115 bits (287), Expect(3) = 4e-33
 Identities = 62/138 (44%), Positives = 85/138 (61%), Gaps = 2/138 (1%)
 Frame = +1

Query: 478 RKKWPLLPYKGKWQQT-FNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINS 654
           R+KWPL  YK KWQ+     QL+M+ L ++  +              S + HLLS+L++S
Sbjct: 35  RRKWPLSLYKTKWQEEKLTHQLSMQKLVESTPNR-------------SPKTHLLSILLDS 81

Query: 655 FSIYACDPTPSAYSFIIKILTQN-SQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFA 831
           FS Y CDPTP+AY FI+K LTQN S + ++P ILD + K E F  PE+IF  LI+ YG +
Sbjct: 82  FSAYECDPTPNAYYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEYIFTYLIKFYGDS 141

Query: 832 NKI*DAIEIFFRIPNFRC 885
           N    A E+FF +P +RC
Sbjct: 142 NMTHLAYEMFFTMPAYRC 159



 Score = 51.2 bits (121), Expect(3) = 4e-33
 Identities = 29/63 (46%), Positives = 45/63 (71%)
 Frame = +2

Query: 944  LQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK*YS 1123
            L++V QVL+KS + +NI +EE+ F+ILI  LC+I K + A+++L LM   G++ D+   S
Sbjct: 180  LRIVLQVLVKS-QLLNIWVEESTFKILIRALCRIGKTNNAVDLLKLMVDSGFNLDANICS 238

Query: 1124 LIL 1132
            LIL
Sbjct: 239  LIL 241



 Score = 23.9 bits (50), Expect(3) = 4e-33
 Identities = 10/21 (47%), Positives = 14/21 (66%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCK 932
           P  +  PSV SL+ ++ VLCK
Sbjct: 155 PAYRCNPSVKSLNCLIWVLCK 175


>gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 480

 Score = 94.7 bits (234), Expect(3) = 3e-30
 Identities = 49/136 (36%), Positives = 76/136 (55%), Gaps = 1/136 (0%)
 Frame = +1

Query: 481 KKWPLLP-YKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSF 657
           +KWPL+  +K KW+Q F Q   M + ++                 +  +   +  L+ S 
Sbjct: 14  RKWPLISSHKTKWRQAFTQNQPMVSFKQLVARH------------NPLQPDFVPSLLQSL 61

Query: 658 SIYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANK 837
           S+Y    +P AY F+IK L  N QF  +P +L  L+ ++ F  PE+IF +L++ YG AN+
Sbjct: 62  SLYNLHQSPQAYHFLIKTLLHNRQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFYGKANR 120

Query: 838 I*DAIEIFFRIPNFRC 885
           I DA++IF+RIP FRC
Sbjct: 121 IQDAVDIFYRIPQFRC 136



 Score = 56.6 bits (135), Expect(3) = 3e-30
 Identities = 32/65 (49%), Positives = 44/65 (67%)
 Frame = +2

Query: 944  LQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK*YS 1123
            L+++ QVLL S   MNIRLEE+ FR+L+  LC++NK +YAIEIL  M   G   + K +S
Sbjct: 157  LKLLPQVLLNSLH-MNIRLEESTFRLLVCTLCRMNKVAYAIEILQRMLDDGLGVNDKVFS 215

Query: 1124 LILLS 1138
             +L S
Sbjct: 216  FVLSS 220



 Score = 28.9 bits (63), Expect(3) = 3e-30
 Identities = 11/27 (40%), Positives = 19/27 (70%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLCKKKNVFK 950
           PQ + +PS  SL+A+L++LC+ +   K
Sbjct: 132 PQFRCFPSAYSLNALLALLCRSQRGLK 158


>ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella]
           gi|482562854|gb|EOA27044.1| hypothetical protein
           CARUB_v10023139mg [Capsella rubella]
          Length = 470

 Score =  111 bits (277), Expect(2) = 1e-28
 Identities = 58/155 (37%), Positives = 87/155 (56%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +K P  P+K KW +   Q+ AME L+ +  +              S++  ++  L++SF 
Sbjct: 35  RKIPHSPFKTKWNENLKQKYAMEELRSSPVA-------------DSEDGGVIRTLVSSFR 81

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           ++ C+PTP AY F+IK L + SQ + +  +L  LE  EKF+ PE IF ++I  YGFA +I
Sbjct: 82  LHNCEPTPQAYRFVIKTLAKTSQLENIASVLSHLEVSEKFDTPESIFRDVIAAYGFAGRI 141

Query: 841 *DAIEIFFRIPNFRCTLLCPPSTQFYPFSVRKRMS 945
            +AI++FF+IPNFRC              VRKR S
Sbjct: 142 GEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRES 176



 Score = 43.1 bits (100), Expect(2) = 1e-28
 Identities = 26/69 (37%), Positives = 41/69 (59%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            ++E L++V ++L+K+   M +RLEE+ F ILI  LCKI +   A E++  M       D 
Sbjct: 173  KRESLELVPEILVKASR-MGVRLEESTFGILIDALCKIGEVDCATELVRYMSIDCVIVDP 231

Query: 1112 K*YSLILLS 1138
            + YS +L S
Sbjct: 232  RLYSQLLSS 240


>ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297325583|gb|EFH56003.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 444

 Score =  107 bits (267), Expect(2) = 3e-28
 Identities = 56/155 (36%), Positives = 85/155 (54%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +K P   +K KW +   Q+ AME L+    +              S+   ++  L++SF 
Sbjct: 9   RKIPQSSFKTKWNENLKQKYAMEELRSNLLA-------------DSENGSVMRTLVSSFQ 55

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           ++ C+PTP AY F+I+ L + SQ + +  +LD LE  EKF+ PE IF ++I  YGF+ +I
Sbjct: 56  LHNCEPTPQAYRFVIETLAKTSQLENIASVLDHLEVSEKFDTPESIFRDVIAAYGFSGRI 115

Query: 841 *DAIEIFFRIPNFRCTLLCPPSTQFYPFSVRKRMS 945
            +AI++FF+IPNFRC              VRKR S
Sbjct: 116 EEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRQS 150



 Score = 45.8 bits (107), Expect(2) = 3e-28
 Identities = 25/69 (36%), Positives = 42/69 (60%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            +++ L++V ++L+K+   M +RLEE+ F ILI  LC+I +   A E++  M       D 
Sbjct: 147  KRQSLELVPEILVKASR-MGVRLEESTFGILINALCRIGEVDCATELVRYMSEDSVIVDP 205

Query: 1112 K*YSLILLS 1138
            + YSL+L S
Sbjct: 206  RLYSLLLSS 214


>ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum]
           gi|557112223|gb|ESQ52507.1| hypothetical protein
           EUTSA_v10017948mg [Eutrema salsugineum]
          Length = 456

 Score =  105 bits (263), Expect(2) = 6e-27
 Identities = 56/153 (36%), Positives = 85/153 (55%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +K P   +K KW +   Q+ AME L+    +             S++   +L  LI+SF 
Sbjct: 18  RKIPHSSFKTKWNENLKQKYAMEELRSGLIADSG----------SNENDGVLRTLISSFR 67

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           ++ C+PTP AY F+IK L + SQ + +  +L+ +E  EKF+ PE IF ++I  YGF+ +I
Sbjct: 68  LHNCEPTPQAYKFVIKTLAKTSQLENIASVLNHIEISEKFDTPESIFRDVIFAYGFSGRI 127

Query: 841 *DAIEIFFRIPNFRCTLLCPPSTQFYPFSVRKR 939
            +AI++FF+IPNFRC              VRKR
Sbjct: 128 EEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKR 160



 Score = 43.1 bits (100), Expect(2) = 6e-27
 Identities = 26/69 (37%), Positives = 43/69 (62%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            +++ L+MV +VLLK+ + + +RLEE+   ILI  LC+I +   A +++  M    Y  D 
Sbjct: 159  KRQGLKMVPEVLLKASK-LGVRLEESTLGILIDALCRIGEVDCATDLVKDMSDDCYIVDP 217

Query: 1112 K*YSLILLS 1138
            + YSL+L S
Sbjct: 218  RLYSLLLSS 226


>ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g38420, mitochondrial; Flags: Precursor
           gi|3395430|gb|AAC28762.1| hypothetical protein
           [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 453

 Score =  106 bits (265), Expect(2) = 6e-27
 Identities = 57/155 (36%), Positives = 85/155 (54%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +K P   +K KW +   Q+ AME L+    +              S+   ++  L++SF 
Sbjct: 18  RKIPHSSFKTKWNENLKQKYAMEELRSNLLT-------------DSENASVMRTLLSSFQ 64

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           ++ C+PTP AY F+IK L ++SQ + +  +L  LE  EKF+ PE IF ++I  YGF+ +I
Sbjct: 65  LHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRI 124

Query: 841 *DAIEIFFRIPNFRCTLLCPPSTQFYPFSVRKRMS 945
            +AIE+FF+IPNFRC              VRKR S
Sbjct: 125 EEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQS 159



 Score = 42.4 bits (98), Expect(2) = 6e-27
 Identities = 24/69 (34%), Positives = 41/69 (59%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            +++ L++V ++L+K+   M +RLEE+ F ILI  LC+I +   A E++  M       D 
Sbjct: 156  KRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDP 214

Query: 1112 K*YSLILLS 1138
            + YS +L S
Sbjct: 215  RLYSRLLSS 223


>gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1|
           At2g38420 [Arabidopsis thaliana]
          Length = 444

 Score =  106 bits (265), Expect(2) = 6e-27
 Identities = 57/155 (36%), Positives = 85/155 (54%)
 Frame = +1

Query: 481 KKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSFS 660
           +K P   +K KW +   Q+ AME L+    +              S+   ++  L++SF 
Sbjct: 9   RKIPHSSFKTKWNENLKQKYAMEELRSNLLT-------------DSENASVMRTLLSSFQ 55

Query: 661 IYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI 840
           ++ C+PTP AY F+IK L ++SQ + +  +L  LE  EKF+ PE IF ++I  YGF+ +I
Sbjct: 56  LHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRI 115

Query: 841 *DAIEIFFRIPNFRCTLLCPPSTQFYPFSVRKRMS 945
            +AIE+FF+IPNFRC              VRKR S
Sbjct: 116 EEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQS 150



 Score = 42.4 bits (98), Expect(2) = 6e-27
 Identities = 24/69 (34%), Positives = 41/69 (59%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            +++ L++V ++L+K+   M +RLEE+ F ILI  LC+I +   A E++  M       D 
Sbjct: 147  KRQSLELVPEILVKACR-MGVRLEESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDP 205

Query: 1112 K*YSLILLS 1138
            + YS +L S
Sbjct: 206  RLYSRLLSS 214


>emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera]
          Length = 390

 Score =  108 bits (271), Expect = 3e-21
 Identities = 54/114 (47%), Positives = 69/114 (60%)
 Frame = +1

Query: 478 RKKWPLLPYKGKWQQTFNQQLAMEALQKAATSXXXXXXXXXXXXXSSKERHLLSVLINSF 657
           R+KWPL PYK  W +TF+ + AM+ L+    +             S      LS+LI+SF
Sbjct: 18  RRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQ-------SPSNSQFLSILIDSF 70

Query: 658 SIYACDPTPSAYSFIIKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRI 819
            IY  DPTP+AY F+I  LT+  QF  LPP+L RLEK+EKF  PE IF NLI+I
Sbjct: 71  RIYNSDPTPNAYRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKI 124



 Score = 64.3 bits (155), Expect = 9e-08
 Identities = 35/60 (58%), Positives = 45/60 (75%)
 Frame = +2

Query: 959  QVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDSK*YSLILLS 1138
            ++LLKS +AMNIRLEE+ FRIL+  LC+I K +YAI ILN M + GY  D+K  S+IL S
Sbjct: 123  KILLKS-QAMNIRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSS 181


>ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda]
            gi|548857785|gb|ERN15583.1| hypothetical protein
            AMTR_s00048p00149840 [Amborella trichopoda]
          Length = 464

 Score = 58.5 bits (140), Expect(3) = 6e-16
 Identities = 32/67 (47%), Positives = 45/67 (67%)
 Frame = +2

Query: 932  EKECLQMVHQVLLKSHEAMNIRLEEAGFRILITVLCKINKFSYAIEILNLMPHYGYDPDS 1111
            + +   +V ++L+K+ E MNIRL+ + FRILI  LC+I K  +AIE+L LMP  G  PDS
Sbjct: 164  DTDSFHLVPELLIKTLE-MNIRLDASSFRILIGSLCRIGKLGFAIELLRLMPDQGCWPDS 222

Query: 1112 K*YSLIL 1132
              Y+ IL
Sbjct: 223  GFYAEIL 229



 Score = 44.7 bits (104), Expect(3) = 6e-16
 Identities = 22/61 (36%), Positives = 35/61 (57%)
 Frame = +1

Query: 703 IKILTQNSQFQQLPPILDRLEKIEKFNPPEHIFVNLIRIYGFANKI*DAIEIFFRIPNFR 882
           I IL QN QF  L  +L  L+   KF+ PE   + LI+    +  + +A+++FF +P+ R
Sbjct: 88  IVILAQNPQFSGLKTLLRCLQSNRKFSTPETRIIGLIQSCASSKMVKEALDLFFAMPHLR 147

Query: 883 C 885
           C
Sbjct: 148 C 148



 Score = 28.5 bits (62), Expect(3) = 6e-16
 Identities = 12/20 (60%), Positives = 16/20 (80%)
 Frame = +3

Query: 870 PQLQVYPSVSSLHAILSVLC 929
           P L+  PS +SL+A+LSVLC
Sbjct: 144 PHLRCQPSTTSLNALLSVLC 163


Top