BLASTX nr result

ID: Catharanthus22_contig00002774 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00002774
         (476 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355831.1| PREDICTED: pentatricopeptide repeat-containi...   197   1e-48
ref|XP_004240647.1| PREDICTED: pentatricopeptide repeat-containi...   195   6e-48
gb|EPS64899.1| hypothetical protein M569_09880, partial [Genlise...   174   8e-42
emb|CBI36977.3| unnamed protein product [Vitis vinifera]              155   4e-36
gb|ESW35233.1| hypothetical protein PHAVU_001G217700g [Phaseolus...   151   1e-34
ref|XP_006604760.1| PREDICTED: pentatricopeptide repeat-containi...   142   5e-32
ref|XP_006604759.1| PREDICTED: pentatricopeptide repeat-containi...   142   5e-32
ref|XP_004494443.1| PREDICTED: pentatricopeptide repeat-containi...   142   5e-32
ref|XP_003626000.1| Pentatricopeptide repeat protein [Medicago t...   130   2e-28
ref|XP_004289549.1| PREDICTED: pentatricopeptide repeat-containi...   129   4e-28
gb|EXB55980.1| hypothetical protein L484_018766 [Morus notabilis]     127   2e-27
ref|XP_006394465.1| hypothetical protein EUTSA_v10005291mg [Eutr...   119   3e-25
gb|EMJ02790.1| hypothetical protein PRUPE_ppa020837mg [Prunus pe...   110   1e-22
ref|XP_002528283.1| pentatricopeptide repeat-containing protein,...   109   4e-22
ref|XP_006296137.1| hypothetical protein CARUB_v10025289mg [Caps...   105   6e-21
ref|XP_002880355.1| pentatricopeptide repeat-containing protein ...   104   1e-20
gb|EOX97730.1| Pentatricopeptide repeat (PPR-like) superfamily p...   103   3e-20
ref|NP_179705.1| pentatricopeptide repeat-containing protein [Ar...   102   4e-20
ref|XP_006479936.1| PREDICTED: pentatricopeptide repeat-containi...   102   5e-20
ref|XP_006479937.1| PREDICTED: pentatricopeptide repeat-containi...   101   9e-20

>ref|XP_006355831.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g02750-like [Solanum tuberosum]
          Length = 529

 Score =  197 bits (501), Expect = 1e-48
 Identities = 96/158 (60%), Positives = 120/158 (75%), Gaps = 1/158 (0%)
 Frame = +2

Query: 5   RHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGD-RPNDLTFA 181
           +HN +QQAE LFD+MP RDV+SWNTM SG+R AN  +K ++ FL M+R GD RPN+LTFA
Sbjct: 62  KHNLIQQAEYLFDKMPHRDVVSWNTMLSGYRNANNPEKVYRCFLDMNRCGDMRPNELTFA 121

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
           + ISSFL   +  L PQLHG V+C G++LNV +GSALMRGY+DL + +   +VFDEIL K
Sbjct: 122 VSISSFLHLYYKHLIPQLHGIVLCLGISLNVFVGSALMRGYVDLDDYRGLARVFDEILDK 181

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV P NVLILGYM+FG  +EAQ  F  MP +N F+WST
Sbjct: 182 DVTPWNVLILGYMKFGCTSEAQRAFDMMPMRNYFTWST 219



 Score = 75.1 bits (183), Expect = 9e-12
 Identities = 42/148 (28%), Positives = 76/148 (51%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           I +  L +A  +FD+M ++DV+SW  M  G+ +  +  K+ + F  M   G RPN  TF+
Sbjct: 225 IENKKLNEARFVFDKMSEKDVVSWTAMIRGYVQYGEFMKALKLFKLMLNSGSRPNHFTFS 284

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       ++  Q+H  ++ SG  L+VVL ++L+  Y    + +    +F+ I  +
Sbjct: 285 TVLDACAGYSAVLVGNQVHACILKSGFPLDVVLLTSLLDMYAKCGDIEVAFCIFESIPAR 344

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I GY   GL   A + F  M
Sbjct: 345 NLVAWNSIIGGYARHGLAERAMQEFERM 372


>ref|XP_004240647.1| PREDICTED: pentatricopeptide repeat-containing protein At1g56690,
           mitochondrial-like [Solanum lycopersicum]
          Length = 630

 Score =  195 bits (495), Expect = 6e-48
 Identities = 95/158 (60%), Positives = 119/158 (75%), Gaps = 1/158 (0%)
 Frame = +2

Query: 5   RHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGD-RPNDLTFA 181
           +HN +Q+AE LFD+MP RDV+SWNTM SG+R AN   K ++ FL M+R G+ RPN+LTFA
Sbjct: 62  KHNLIQKAEYLFDKMPHRDVVSWNTMLSGYRNANNPGKVYRCFLDMNRCGEMRPNELTFA 121

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
           + ISSFL   +  L PQLHG V+CSG++LNV +GSALMRGY+DL + +   +VFDEIL K
Sbjct: 122 VSISSFLHLHYKHLIPQLHGLVLCSGISLNVFVGSALMRGYVDLDDYRGLVRVFDEILDK 181

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV P NVLILGYM FG   EAQ  F  MP +N+F+WST
Sbjct: 182 DVTPWNVLILGYMRFGCTIEAQRAFDMMPMRNSFTWST 219



 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 42/148 (28%), Positives = 75/148 (50%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           I +  L +A  +FD M ++DV+SW  M  G+ +  +  K+ + F  M   G RPN  TF+
Sbjct: 225 IENKKLNEARFVFDEMSEKDVVSWTAMIRGYVQYGEFMKALKLFKLMLNSGSRPNHFTFS 284

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       ++  Q+H  ++ SG  L+VVL ++L+  Y    + +    +F+ I  +
Sbjct: 285 TVLDACAGYSAVLVGNQVHVCILKSGFPLDVVLLTSLLDMYAKCGDIEVAFCIFESIPAR 344

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I GY   GL   A + F  M
Sbjct: 345 NLVAWNSIIGGYARHGLAERAMQVFERM 372


>gb|EPS64899.1| hypothetical protein M569_09880, partial [Genlisea aurea]
          Length = 455

 Score =  174 bits (442), Expect = 8e-42
 Identities = 83/158 (52%), Positives = 114/158 (72%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++ N + +A+ LFD MP RDV+SWNTM SGFR      KSH+ FL M R G  P++LTF+
Sbjct: 37  VKRNEISKAQKLFDEMPLRDVVSWNTMLSGFRDIRSPDKSHRCFLTMMRYGPTPDELTFS 96

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
           ILI +FL ++  +L PQLH  VI  G++LN+ LGSALMR Y+ L   +++ +VFDEI +K
Sbjct: 97  ILIGTFLSSQHKILIPQLHAIVIRLGISLNIYLGSALMRAYVALGHSESYIRVFDEIPVK 156

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DVAP NVLILG++EFG +++A++ F  MP +N  SWST
Sbjct: 157 DVAPYNVLILGHIEFGSISQAKKVFDEMPVRNPHSWST 194



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 40/148 (27%), Positives = 75/148 (50%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           +++  +++A + FDR+  RDV+S   M  GF    +  ++ + F  M   G RPN  T +
Sbjct: 200 MKNGMIKEAVEEFDRIEVRDVVSTTAMIRGFADVRRFAEALKIFRSMMNDGIRPNRFTLS 259

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++++       V+  Q+HG +   G+  +VV  +AL+  Y    +  +  KVF+ +  +
Sbjct: 260 TVLNACAGDSSLVMGIQVHGVMSKLGIPADVVTETALVDMYGKCGDMDSASKVFESMPDR 319

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++A  N +I GY   G  + A + F  M
Sbjct: 320 NLASWNSMIGGYARNGSTHIAFQVFDEM 347


>emb|CBI36977.3| unnamed protein product [Vitis vinifera]
          Length = 372

 Score =  155 bits (393), Expect = 4e-36
 Identities = 75/158 (47%), Positives = 107/158 (67%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           I+++    A++LFD MP RD++SWNT  SG +K    +  +  FL+M R+G +PN+ T +
Sbjct: 63  IQNDEPHNAQNLFDEMPDRDIVSWNTALSGLKKIKNPEGVYLCFLKMRRVGLKPNEFTLS 122

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
           I+IS+ L+T F+VLTPQ+H  V+  G N +V +GSALMRGY  + +R    +VFDEI IK
Sbjct: 123 IMISAILDTVFNVLTPQIHAVVVSLGFNSSVFVGSALMRGYAHVGDRVALGRVFDEISIK 182

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           +VA  N L+LGYM+ G  +EAQ  F  MP +N  SW+T
Sbjct: 183 NVASWNALVLGYMDLGDTDEAQRVFGLMPERNVVSWTT 220



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 30/93 (32%), Positives = 52/93 (55%)
 Frame = +2

Query: 26  AEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISSFLE 205
           A  +FDRM +R+V+SW  M SG+ +  K   + + FL M R G + N  T + ++ +   
Sbjct: 234 ARSVFDRMTERNVVSWTVMISGYVQNGKFVDALRLFLLMLRSGTQGNHFTLSSVLEACAG 293

Query: 206 TKFSVLTPQLHGQVICSGLNLNVVLGSALMRGY 304
               +L  Q+H  V+ SG+  +V+L ++L+  Y
Sbjct: 294 CSSLLLGKQVHLNVLKSGIPDDVILSTSLVDMY 326


>gb|ESW35233.1| hypothetical protein PHAVU_001G217700g [Phaseolus vulgaris]
          Length = 561

 Score =  151 bits (381), Expect = 1e-34
 Identities = 69/158 (43%), Positives = 104/158 (65%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++H  ++ A +LFD+MP +D +SWN M SGF +    +  +++FLQM R G  P+D T +
Sbjct: 99  VKHYQIEHAHNLFDQMPLKDTVSWNIMLSGFSRITDSEGLYRHFLQMGRSGVAPDDYTIS 158

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+ + + TK  VL PQ+H   +   LNL+V +GS+L+R Y +LRE + F +VFD+IL+K
Sbjct: 159 TLLRTVISTKLDVLVPQVHALALHLALNLSVFVGSSLIRAYANLREEEAFKRVFDDILVK 218

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV   N L+ GYME G +++AQ  F  MP +N  SW+T
Sbjct: 219 DVTSWNALVSGYMEVGRMDDAQTNFDAMPQRNIISWTT 256



 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 46/148 (31%), Positives = 79/148 (53%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           IR+  + +A  +FD+M +R+V+SW  M SG+ +  +   + + FL M + G RPN  TF+
Sbjct: 262 IRNKKINKARSVFDKMSERNVVSWTVMISGYVQNKRFMDALKLFLLMFKSGTRPNHFTFS 321

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       V+  Q+H  VI SG+  +V+  ++L+  Y    +      VF+ IL K
Sbjct: 322 SVLDACAGCSSLVVGMQVHLCVIKSGIPDDVISLTSLVDMYAKCGDTDAAFLVFESILKK 381

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I G+   GL N A + F  M
Sbjct: 382 NLVSWNSIIGGFARNGLGNRALKEFDRM 409


>ref|XP_006604760.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g02750-like isoform X2 [Glycine max]
          Length = 299

 Score =  142 bits (358), Expect = 5e-32
 Identities = 70/159 (44%), Positives = 102/159 (64%), Gaps = 1/159 (0%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++H+ +Q A+ LFD+MP +D +SWN M SGF +       ++ FLQM R G  P+D T +
Sbjct: 67  VKHHQIQYAQYLFDQMPFKDTVSWNIMLSGFHRITNSDGLYRCFLQMGRAGVPPDDYTVS 126

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+ + + T+  VL PQLH + +   LNL+V +GS+L+R Y  LR+ + F + FD+IL K
Sbjct: 127 TLLRAVISTELDVLIPQLHARALHLALNLSVFVGSSLIRAYASLRDEEAFKQAFDDILGK 186

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFS-CMPGKNAFSWST 475
           DV   N L+ GYME G +++AQ TF   MP KN  SW+T
Sbjct: 187 DVTSWNALVSGYMEVGSMDDAQTTFDMMMPEKNIISWTT 225


>ref|XP_006604759.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g02750-like isoform X1 [Glycine max]
          Length = 531

 Score =  142 bits (358), Expect = 5e-32
 Identities = 70/159 (44%), Positives = 102/159 (64%), Gaps = 1/159 (0%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++H+ +Q A+ LFD+MP +D +SWN M SGF +       ++ FLQM R G  P+D T +
Sbjct: 67  VKHHQIQYAQYLFDQMPFKDTVSWNIMLSGFHRITNSDGLYRCFLQMGRAGVPPDDYTVS 126

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+ + + T+  VL PQLH + +   LNL+V +GS+L+R Y  LR+ + F + FD+IL K
Sbjct: 127 TLLRAVISTELDVLIPQLHARALHLALNLSVFVGSSLIRAYASLRDEEAFKQAFDDILGK 186

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFS-CMPGKNAFSWST 475
           DV   N L+ GYME G +++AQ TF   MP KN  SW+T
Sbjct: 187 DVTSWNALVSGYMEVGSMDDAQTTFDMMMPEKNIISWTT 225



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 41/148 (27%), Positives = 73/148 (49%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           IR+  + +A  +F++M +R+V+SW  M SG+ +  +   +   FL M   G  PN  TF+
Sbjct: 231 IRNKRINKARSVFNKMSERNVVSWTAMISGYVQNKRFMDALNLFLLMFNSGTCPNHFTFS 290

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       +   Q+H  VI SG+  +V+  ++L+  Y    +     +VF+ I  K
Sbjct: 291 SVLDACAGCSSLLTGMQVHLCVIKSGIPEDVISLTSLVDMYAKCGDMDAAFRVFESIPNK 350

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I G    G+   A E F  M
Sbjct: 351 NLVSWNSIIGGCARNGIATRALEEFDRM 378


>ref|XP_004494443.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g02750-like [Cicer arietinum]
          Length = 527

 Score =  142 bits (358), Expect = 5e-32
 Identities = 69/158 (43%), Positives = 97/158 (61%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++H+ +  A  LFD+MP +D +SWN M SGF K    +  +Q FLQM R G  PND T +
Sbjct: 65  VQHHQIGLAHQLFDKMPLKDAVSWNIMLSGFMKTRNTEGLYQCFLQMGRAGVVPNDYTIS 124

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+ + + T+  VL  Q+H      GLNLNV +GS+L+R Y  LRE +   + FD+I +K
Sbjct: 125 KLLRAVICTELEVLVYQVHAMASHLGLNLNVFVGSSLIRAYAALREEEALSRAFDDISMK 184

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV   N L+ GYME G++ +AQ  F  MP +N  SW+T
Sbjct: 185 DVTSWNALVSGYMELGMMVDAQTAFDLMPQRNVISWTT 222



 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 42/148 (28%), Positives = 78/148 (52%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           +++N + +A  +FD+M +R+V+SW  M SG+ +  +   + + FL M   G RPN  TF+
Sbjct: 228 VKNNRVNKARSVFDKMSERNVVSWTVMISGYVQNKRFMAALKLFLLMFESGTRPNHFTFS 287

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       ++  Q+H  ++ SG+  +V+  ++L+  Y    +      VF+ I+ K
Sbjct: 288 SVLDACAGCSSLLMGLQVHLCIVKSGIPNDVIWLTSLVDMYAKCGDMDAAFCVFESIMDK 347

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I GY   GL   A E F  M
Sbjct: 348 NLVSWNAIIGGYASHGLAARALEEFDRM 375


>ref|XP_003626000.1| Pentatricopeptide repeat protein [Medicago truncatula]
           gi|355501015|gb|AES82218.1| Pentatricopeptide repeat
           protein [Medicago truncatula]
          Length = 607

 Score =  130 bits (327), Expect = 2e-28
 Identities = 63/158 (39%), Positives = 93/158 (58%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++HN +    DLFD+MP +D +SWN M SGF++    +  ++ FLQM R G  PND T +
Sbjct: 51  LQHNQIGPVHDLFDKMPLKDAVSWNIMLSGFQRTRNSEGLYRCFLQMGRAGVVPNDYTIS 110

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+ + + T+  VL  Q+H      G  LNV +GS+L+R Y  L+E +   + F++I +K
Sbjct: 111 TLLRAVISTELDVLVRQVHALAFHLGHYLNVFVGSSLIRAYAGLKEEEALGRAFNDISMK 170

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV   N L+  YME G   +AQ  F  MP +N  SW+T
Sbjct: 171 DVTSWNALVSSYMELGKFVDAQTAFDQMPQRNIISWTT 208



 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 41/148 (27%), Positives = 76/148 (51%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           +++  + +A  +FD M +R+V+SW  M SG+ +  +   + + F+ M +   RPN  TF+
Sbjct: 214 VKNKQVNKARSVFDDMSERNVVSWTAMISGYVQNKRFVDALKLFVLMFKTETRPNHFTFS 273

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +   +   ++  QLH  +I SG+  +V+  ++L+  Y    +      VF+ I  K
Sbjct: 274 SVLDACAGSSSLIMGLQLHPCIIKSGIANDVIWLTSLVDMYAKCGDMDAAFGVFESIRDK 333

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I GY   GL   A E F  M
Sbjct: 334 NLVSWNAIIGGYASHGLATRALEEFDRM 361


>ref|XP_004289549.1| PREDICTED: pentatricopeptide repeat-containing protein At4g16835,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 521

 Score =  129 bits (324), Expect = 4e-28
 Identities = 68/158 (43%), Positives = 94/158 (59%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           +R+   Q A +LFD M  +DV+SWNT+ SG   A      HQ  LQM R G  PN+ T +
Sbjct: 53  VRNGQTQIAHNLFDEMRVKDVVSWNTILSGLHNAKDPHGIHQLLLQMRRDGFGPNEYTIS 112

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
           I++ +FL T  +VL  Q+H   I   LN +V +GSALM+GY ++ +R     VFDEI  K
Sbjct: 113 IVLRAFLGTVLNVLVSQIHAFAIVLALNSSVFVGSALMKGYANIGDRVAMACVFDEISAK 172

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV+  N LI  Y+E G ++EAQ  F  M  +N  SW++
Sbjct: 173 DVSSWNALISSYVELGCLDEAQRVFDGMLERNVVSWTS 210



 Score = 68.6 bits (166), Expect = 9e-10
 Identities = 38/148 (25%), Positives = 72/148 (48%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           IR+  + +A  +FD+M  ++V+SW  M SG+ K  +   + +  L M + G  PN  TF+
Sbjct: 216 IRNRRVDKARSVFDKMSGKNVVSWTVMISGYVKNQRFLDALELVLLMLKSGTLPNQFTFS 275

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       +   Q+H  ++ +G+  +V L ++L+  Y    +      +F  +  K
Sbjct: 276 SVLDASAGCSSLIFGQQVHSSILKTGIPEDVTLSTSLVDMYAKCGDIDAAYCIFGSMQKK 335

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           ++   N +I GY   G+   A E F  M
Sbjct: 336 NLISWNSIIGGYARHGIATRALEEFERM 363


>gb|EXB55980.1| hypothetical protein L484_018766 [Morus notabilis]
          Length = 522

 Score =  127 bits (319), Expect = 2e-27
 Identities = 68/157 (43%), Positives = 91/157 (57%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           IR   LQ A  LFD MP RD++SWNTM SG RK+      +   L M R G  PN+ T +
Sbjct: 54  IRSGRLQHAHKLFDEMPLRDLVSWNTMLSGLRKSKDPHGVYGCLLGMRRTGLSPNEYTIS 113

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +   T+F+VL  QLH  VI   LN  + +GSALMRGY D+ +     +VFDE+  K
Sbjct: 114 TVLKAVFGTEFNVLVFQLHAFVIRVALNSCLFVGSALMRGYADVGDPVVLRRVFDELFEK 173

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
           DV+    L+  YM  G ++EA+  F  MP KN  SW+
Sbjct: 174 DVSSWTALVSSYMRIGWLHEAERVFDTMPEKNIVSWT 210



 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 46/148 (31%), Positives = 76/148 (51%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           I +  +++A  +FDRM +R+VISW  M +G+ + +    + Q F  M R G RPND TF+
Sbjct: 217 IDNKKIKKARTVFDRMGERNVISWTAMINGYVQNHYFADALQLFAWMLRSGTRPNDFTFS 276

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ +       +   Q+H  ++ SG+   +V+ S+L+  Y    +      VFD +  K
Sbjct: 277 SVLGACAGCCSLLTGQQVHSSILKSGVPDGIVMSSSLVDMYAKCGDVDVALCVFDAMPHK 336

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCM 445
           +V   N +I GY   GL   A + F  M
Sbjct: 337 NVVSWNSIIGGYARHGLAMRALDEFERM 364


>ref|XP_006394465.1| hypothetical protein EUTSA_v10005291mg [Eutrema salsugineum]
           gi|557091104|gb|ESQ31751.1| hypothetical protein
           EUTSA_v10005291mg [Eutrema salsugineum]
          Length = 514

 Score =  119 bits (299), Expect = 3e-25
 Identities = 58/158 (36%), Positives = 90/158 (56%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++   + +A  +FD+MP RD++SWNTM  G +K    +    +F+ M R G  P++LT  
Sbjct: 67  VQRGLMLEAHQVFDQMPVRDMVSWNTMLMGLKKTRDPESVVSFFIAMRRSGLNPDELTLP 126

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            ++ + L + F V   Q+H   I  GL+L+   GSAL+R Y   R+ +   +VF++IL K
Sbjct: 127 AIMDAVLGSTFKVSVLQIHTLAIRLGLSLSPYTGSALLRAYTSFRDFRGLERVFEDILFK 186

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
           DV   NV I GYM+ G V + +  F  MP +N  +W T
Sbjct: 187 DVVSWNVFITGYMKLGFVEDGERAFGEMPERNIITWHT 224


>gb|EMJ02790.1| hypothetical protein PRUPE_ppa020837mg [Prunus persica]
          Length = 610

 Score =  110 bits (276), Expect = 1e-22
 Identities = 55/153 (35%), Positives = 94/153 (61%)
 Frame = +2

Query: 14  HLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILIS 193
           +L++A  LFD+MP++DV+SWNTM  G+ ++    ++ +Y+ ++ RL    N+ +FA +++
Sbjct: 139 NLKEARSLFDKMPEKDVVSWNTMVIGYAQSGVCDEALRYYRELRRLSIGYNEFSFAGVLT 198

Query: 194 SFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIKDVAP 373
             ++ K   LT Q+HGQV+ +G   NVVL S+L+  Y    E  +  ++FD + ++DV  
Sbjct: 199 VCVKLKELELTRQVHGQVLVAGFLSNVVLSSSLVDAYTKCGEMGDARRLFDNMPVRDVLA 258

Query: 374 CNVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
              L+ GY ++G +    E FS MP KN  SW+
Sbjct: 259 WTTLVSGYAKWGDMESGSELFSQMPEKNPVSWT 291


>ref|XP_002528283.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223532320|gb|EEF34121.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 602

 Score =  109 bits (272), Expect = 4e-22
 Identities = 52/153 (33%), Positives = 94/153 (61%)
 Frame = +2

Query: 17  LQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISS 196
           ++ A  LFD+MP++DV+SWNTM   + K+     + +++ ++ RLG   N+ +FA L++ 
Sbjct: 133 IKPARKLFDKMPEKDVVSWNTMVIAYAKSGFCNDALRFYRELRRLGIGYNEYSFAGLLNI 192

Query: 197 FLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIKDVAPC 376
            ++ K   L+ Q HGQV+ +G   N+V+ S+++  Y    E  +  ++FDE++I+DV   
Sbjct: 193 CVKVKELELSKQAHGQVLVAGFLSNLVISSSVLDAYAKCSEMGDARRLFDEMIIRDVLAW 252

Query: 377 NVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
             ++ GY ++G V  A+E F  MP KN  +W++
Sbjct: 253 TTMVSGYAQWGDVEAARELFDLMPEKNPVAWTS 285


>ref|XP_006296137.1| hypothetical protein CARUB_v10025289mg [Capsella rubella]
           gi|482564845|gb|EOA29035.1| hypothetical protein
           CARUB_v10025289mg [Capsella rubella]
          Length = 597

 Score =  105 bits (262), Expect = 6e-21
 Identities = 53/157 (33%), Positives = 93/157 (59%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++   L +A  +FD MP+RDV+SWNTM  G+ +   L ++  ++ ++ R G + N+ +FA
Sbjct: 124 VKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQNGNLHEALWFYKELRRSGIKYNEFSFA 183

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+++ ++++   L  Q HGQV+ +GL  NVVL  +++  Y    + ++  + FDE+ +K
Sbjct: 184 GLLTACVKSRHLQLNRQAHGQVLIAGLLSNVVLSCSIIDAYAKCGQMESAKRCFDEMAVK 243

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
           D+     LI GY + G +  A + F  MP KN  SW+
Sbjct: 244 DIHIWTTLISGYAKLGDMEAADKLFCEMPEKNPVSWT 280


>ref|XP_002880355.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297326194|gb|EFH56614.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 599

 Score =  104 bits (259), Expect = 1e-20
 Identities = 53/157 (33%), Positives = 92/157 (58%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++   L +A  +FD MP+RDV+SWNTM  G+ +   L ++  +F ++ R G + N+ +FA
Sbjct: 124 VKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFFKELRRSGIKFNEFSFA 183

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+++ ++++   L  Q HGQV+ +G   NVVL  +++  Y    + ++  + FDE+ +K
Sbjct: 184 GLLTACVKSRQLQLNQQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVK 243

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
           D+     LI GY + G +  A + F  MP KN  SW+
Sbjct: 244 DIHIWTTLISGYAKLGDMEAADKLFREMPEKNPVSWT 280


>gb|EOX97730.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
           cacao]
          Length = 610

 Score =  103 bits (256), Expect = 3e-20
 Identities = 50/152 (32%), Positives = 92/152 (60%)
 Frame = +2

Query: 17  LQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISS 196
           ++ A  LFD+MP+RDV+SWNTM   + ++   +++ +++ ++  L    N+ +FA +++ 
Sbjct: 138 IKPARKLFDQMPERDVVSWNTMVIAYAQSGFFEEALRFYKELRNLCIGYNEFSFAGVLTV 197

Query: 197 FLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIKDVAPC 376
            ++ +   LT Q+HGQV+ SG   N+V+ S+++ GY+         ++FDE+ +KDV   
Sbjct: 198 CVKLRELQLTRQVHGQVLVSGFLSNLVISSSVVDGYVKCGMMGESRRLFDEMKVKDVLAW 257

Query: 377 NVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
             L+ GY ++G +  A + F  MP KN  SW+
Sbjct: 258 TTLVSGYSQWGDMESANDLFDKMPQKNPVSWT 289



 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 38/144 (26%), Positives = 67/144 (46%), Gaps = 1/144 (0%)
 Frame = +2

Query: 17  LQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISS 196
           ++ A DLFD+MPQ++ +SW  + SG+ +     K+ + F +M     RP+  TF+  + +
Sbjct: 270 MESANDLFDKMPQKNPVSWTALISGYARNGMGNKALELFTRMMVCRVRPDQFTFSSCLCA 329

Query: 197 FLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK-DVAP 373
                      Q+H  +I +    N+++ S+L+  Y      K   ++FD    K D   
Sbjct: 330 CASVASLTHGKQIHACLIRTNFRPNMIVISSLIDMYSKCGSLKASKRIFDLTDNKQDPVL 389

Query: 374 CNVLILGYMEFGLVNEAQETFSCM 445
            N +I    + G   EA + F  M
Sbjct: 390 WNTMISALAQHGYGEEAVKMFDDM 413


>ref|NP_179705.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206523|sp|Q9SKQ4.1|PP167_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g21090 gi|4803934|gb|AAD29807.1| unknown protein
           [Arabidopsis thaliana] gi|330252028|gb|AEC07122.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 597

 Score =  102 bits (255), Expect = 4e-20
 Identities = 52/157 (33%), Positives = 92/157 (58%)
 Frame = +2

Query: 2   IRHNHLQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFA 181
           ++   L +A  +FD MP+RDV+SWNTM  G+ +   L ++  ++ +  R G + N+ +FA
Sbjct: 124 VKSGMLVRARVVFDSMPERDVVSWNTMVIGYAQDGNLHEALWFYKEFRRSGIKFNEFSFA 183

Query: 182 ILISSFLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIK 361
            L+++ ++++   L  Q HGQV+ +G   NVVL  +++  Y    + ++  + FDE+ +K
Sbjct: 184 GLLTACVKSRQLQLNRQAHGQVLVAGFLSNVVLSCSIIDAYAKCGQMESAKRCFDEMTVK 243

Query: 362 DVAPCNVLILGYMEFGLVNEAQETFSCMPGKNAFSWS 472
           D+     LI GY + G +  A++ F  MP KN  SW+
Sbjct: 244 DIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWT 280


>ref|XP_006479936.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g21090-like isoform X1 [Citrus sinensis]
          Length = 605

 Score =  102 bits (254), Expect = 5e-20
 Identities = 50/153 (32%), Positives = 90/153 (58%)
 Frame = +2

Query: 17  LQQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISS 196
           ++ A +LFD M +RDV+SWNTM  G+ K+  +++  +++  + R     N+ +FA +++ 
Sbjct: 135 MKHARNLFDNMAERDVVSWNTMIIGYAKSGAVEEGLKFYKVLRRFSISCNEFSFAGILTI 194

Query: 197 FLETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIKDVAPC 376
            ++ +   LT Q+HGQV+ +G   NVV+ S+++  Y    E  +  ++FDE   +DV   
Sbjct: 195 CVKLEELKLTRQVHGQVLVTGFLSNVVISSSIVDAYAKCGELSDARRLFDETEARDVLTW 254

Query: 377 NVLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
             ++ GY + G +  A + F+ MP KN  SW+T
Sbjct: 255 TTMVSGYAKLGDMESASKLFNEMPEKNPVSWTT 287


>ref|XP_006479937.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g21090-like isoform X2 [Citrus sinensis]
          Length = 578

 Score =  101 bits (252), Expect = 9e-20
 Identities = 50/152 (32%), Positives = 89/152 (58%)
 Frame = +2

Query: 20  QQAEDLFDRMPQRDVISWNTMFSGFRKANKLKKSHQYFLQMSRLGDRPNDLTFAILISSF 199
           + A +LFD M +RDV+SWNTM  G+ K+  +++  +++  + R     N+ +FA +++  
Sbjct: 109 KHARNLFDNMAERDVVSWNTMIIGYAKSGAVEEGLKFYKVLRRFSISCNEFSFAGILTIC 168

Query: 200 LETKFSVLTPQLHGQVICSGLNLNVVLGSALMRGYIDLRERKNFCKVFDEILIKDVAPCN 379
           ++ +   LT Q+HGQV+ +G   NVV+ S+++  Y    E  +  ++FDE   +DV    
Sbjct: 169 VKLEELKLTRQVHGQVLVTGFLSNVVISSSIVDAYAKCGELSDARRLFDETEARDVLTWT 228

Query: 380 VLILGYMEFGLVNEAQETFSCMPGKNAFSWST 475
            ++ GY + G +  A + F+ MP KN  SW+T
Sbjct: 229 TMVSGYAKLGDMESASKLFNEMPEKNPVSWTT 260


Top