BLASTX nr result

ID: Catharanthus22_contig00032801 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00032801
         (381 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004236153.1| PREDICTED: pentatricopeptide repeat-containi...   218   5e-55
ref|XP_006344998.1| PREDICTED: pentatricopeptide repeat-containi...   218   7e-55
ref|XP_004301284.1| PREDICTED: pentatricopeptide repeat-containi...   207   2e-51
ref|XP_004152457.1| PREDICTED: pentatricopeptide repeat-containi...   204   1e-50
ref|XP_006476683.1| PREDICTED: pentatricopeptide repeat-containi...   200   1e-49
ref|XP_006439730.1| hypothetical protein CICLE_v10019863mg [Citr...   200   1e-49
sp|Q9S7R4.1|PP125_ARATH RecName: Full=Pentatricopeptide repeat-c...   198   5e-49
ref|NP_177628.2| pentatricopeptide repeat-containing protein [Ar...   198   5e-49
ref|XP_002887564.1| predicted protein [Arabidopsis lyrata subsp....   198   7e-49
ref|XP_006301673.1| hypothetical protein CARUB_v10022126mg [Caps...   197   9e-49
dbj|BAD44503.1| hypothetical protein [Arabidopsis thaliana]           197   2e-48
ref|XP_002321560.2| pentatricopeptide repeat-containing family p...   196   2e-48
gb|EOY20582.1| Pentatricopeptide repeat (PPR) superfamily protei...   196   2e-48
gb|EOY20581.1| Pentatricopeptide repeat superfamily protein isof...   196   2e-48
ref|XP_006390373.1| hypothetical protein EUTSA_v10018527mg [Eutr...   196   4e-48
ref|XP_002511467.1| pentatricopeptide repeat-containing protein,...   196   4e-48
gb|EXC32244.1| hypothetical protein L484_004747 [Morus notabilis]     195   6e-48
ref|XP_003601293.1| Pentatricopeptide repeat-containing protein,...   192   4e-47
ref|XP_004501962.1| PREDICTED: pentatricopeptide repeat-containi...   189   3e-46
gb|EPS67440.1| hypothetical protein M569_07334, partial [Genlise...   185   6e-45

>ref|XP_004236153.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like [Solanum lycopersicum]
          Length = 492

 Score =  218 bits (556), Expect = 5e-55
 Identities = 101/127 (79%), Positives = 117/127 (92%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAIITERYVS GK+DKAV +FLSMHKHGCPQDL+SFN+FLDVLCK+KR EMA KL 
Sbjct: 136 NPKTFAIITERYVSAGKADKAVNVFLSMHKHGCPQDLSSFNAFLDVLCKSKRAEMALKLF 195

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+FR +F+ADT+SY+ +ANGFCLVKRTPKA EILKEMVERGL+PT+TTYNIML GFFRAG
Sbjct: 196 KMFRSRFKADTISYNTLANGFCLVKRTPKAQEILKEMVERGLNPTITTYNIMLNGFFRAG 255

Query: 21  QLNEAWK 1
           Q+ EAW+
Sbjct: 256 QIKEAWE 262



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 30/123 (24%), Positives = 63/123 (51%), Gaps = 1/123 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL-KI 196
           T+  +   +   G+ +KA K+F  M   G    + ++N+ + V+CK    E A  +  ++
Sbjct: 278 TYTTLVHGFGVAGEVEKAQKLFNEMVGAGILPSIATYNALIQVMCKKDSTENAILVFNEM 337

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            R  +  +  +Y+ I  G C V +   A+E + +M E G  P + TYN++++ +   G++
Sbjct: 338 LRKGYLPNATTYNAIIRGLCHVGKMDNAMEYMDKMNEDGCEPNVQTYNVVIRYYCDEGEI 397

Query: 15  NEA 7
            ++
Sbjct: 398 EKS 400


>ref|XP_006344998.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like isoform X1 [Solanum tuberosum]
           gi|565356286|ref|XP_006344999.1| PREDICTED:
           pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like isoform X2 [Solanum tuberosum]
           gi|565356288|ref|XP_006345000.1| PREDICTED:
           pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like isoform X3 [Solanum tuberosum]
          Length = 486

 Score =  218 bits (555), Expect = 7e-55
 Identities = 101/127 (79%), Positives = 116/127 (91%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAIITERYVS GK+DKAV +FLSMHKHGCPQDL SFN+FLDVLCK+KR EMA KL 
Sbjct: 130 NPKTFAIITERYVSAGKADKAVNVFLSMHKHGCPQDLNSFNAFLDVLCKSKRAEMALKLF 189

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+FR +F+ADT+SY+ +ANGFCLVKRTPKA EILKEMVERGL+PT+TTYNIML GFFRAG
Sbjct: 190 KMFRSRFKADTISYNTLANGFCLVKRTPKAQEILKEMVERGLNPTITTYNIMLNGFFRAG 249

Query: 21  QLNEAWK 1
           Q+ EAW+
Sbjct: 250 QIKEAWE 256



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 32/123 (26%), Positives = 64/123 (52%), Gaps = 1/123 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL-KI 196
           T+  I   +   G+ +KA K+F  M   G    + ++N+ + V+CK   VE A  +  ++
Sbjct: 272 TYTTIVHGFGVAGEVEKAQKLFNEMVGAGILPSVATYNALIQVMCKKDSVENAILIFNEM 331

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            R  +  +  +Y+ I  G C V +   A+E + +M E G  P + TYN++++ +   G++
Sbjct: 332 LRKGYLPNATTYNAIIRGLCHVGKMDNAMEYMDKMNEDGCEPNVQTYNVVIRYYCDEGEI 391

Query: 15  NEA 7
            ++
Sbjct: 392 EKS 394


>ref|XP_004301284.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 489

 Score =  207 bits (526), Expect = 2e-51
 Identities = 93/127 (73%), Positives = 113/127 (88%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +P+TFAII ERYV+ GK D+AVK+FLSMH+HGCPQDL SFN+ LDVLCKAKRVE AY L 
Sbjct: 133 APRTFAIIAERYVAAGKPDRAVKVFLSMHEHGCPQDLNSFNTVLDVLCKAKRVEKAYNLF 192

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+FRG+FRAD VSY++I NG+CL+KRTPKALE+L+EMVERG+ P+L TYNIMLKG+ RAG
Sbjct: 193 KVFRGRFRADCVSYNVIVNGWCLIKRTPKALEVLREMVERGIEPSLVTYNIMLKGYLRAG 252

Query: 21  QLNEAWK 1
           Q+ EAW+
Sbjct: 253 QVKEAWE 259



 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 30/123 (24%), Positives = 61/123 (49%), Gaps = 1/123 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKIF 193
           T+  +   +   G+  K  KIF  M + G    + ++N+ + VLCK   VE A  + +  
Sbjct: 275 TYTTLVHGFGVLGEIKKVRKIFDGMVEEGVLPSVATYNALIQVLCKKDSVENAVVVFEEM 334

Query: 192 RGK-FRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
             K +  +  +Y+++  G C        +E+++ M +    P + TYN++++ F   GQ+
Sbjct: 335 VSKGYVPNVTTYNVLVRGLCHAGNMDSGMELMERMKDDDCEPNVQTYNVVIRYFCDDGQI 394

Query: 15  NEA 7
           ++A
Sbjct: 395 DKA 397


>ref|XP_004152457.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like [Cucumis sativus]
           gi|449487784|ref|XP_004157799.1| PREDICTED:
           pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like [Cucumis sativus]
          Length = 502

 Score =  204 bits (519), Expect = 1e-50
 Identities = 95/128 (74%), Positives = 115/128 (89%), Gaps = 1/128 (0%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYK-L 205
           S KTFAII ER+V+ GK D+A+K+FLSM +HGCPQDL SFN+ LD+LCK+KRVEMAY  L
Sbjct: 146 SSKTFAIIAERFVAAGKPDRAIKVFLSMREHGCPQDLHSFNTILDILCKSKRVEMAYNNL 205

Query: 204 LKIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRA 25
            K+ RGKF+AD VSY+IIANG+CL+KRTPKALE+LKEMVERGL+PT+TTYNI+LKG+FRA
Sbjct: 206 FKVLRGKFKADVVSYNIIANGWCLIKRTPKALEVLKEMVERGLTPTITTYNILLKGYFRA 265

Query: 24  GQLNEAWK 1
           GQL EAW+
Sbjct: 266 GQLKEAWE 273


>ref|XP_006476683.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like isoform X1 [Citrus sinensis]
           gi|568845657|ref|XP_006476684.1| PREDICTED:
           pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like isoform X2 [Citrus sinensis]
          Length = 493

 Score =  200 bits (509), Expect = 1e-49
 Identities = 92/125 (73%), Positives = 111/125 (88%)
 Frame = -1

Query: 375 KTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKI 196
           KTFAII ERYVS GK+D+AVKIFLSMH+HGC Q L SFN+ LD+LCK K+VE AY L K+
Sbjct: 139 KTFAIIAERYVSAGKADRAVKIFLSMHEHGCRQSLNSFNTILDLLCKEKKVEKAYNLFKV 198

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
           FRGKF+AD +SY++IANG+CLVKRT KALE+LKEMV+RGL+P LTTYNI+LKG+FRAGQ+
Sbjct: 199 FRGKFKADVISYNVIANGWCLVKRTNKALEVLKEMVDRGLNPNLTTYNIVLKGYFRAGQI 258

Query: 15  NEAWK 1
            EAW+
Sbjct: 259 EEAWR 263



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 32/120 (26%), Positives = 60/120 (50%), Gaps = 1/120 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKIF 193
           T+  I   +   G+  +A  +F  M   G    + ++N+ + VLCK   VE A  + +  
Sbjct: 279 TYTTIVHGFGVVGEIKRARNVFDGMVNGGVLPSVATYNAMIQVLCKKDSVENAILVFEEM 338

Query: 192 RGK-FRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            GK +  ++ +Y+++  G C      +ALE +  M +    P + TYNI+++ F  AG++
Sbjct: 339 VGKGYMPNSTTYNVVIRGLCHTGEMERALEFVGRMKDDECEPNVQTYNILIRYFCDAGEI 398


>ref|XP_006439730.1| hypothetical protein CICLE_v10019863mg [Citrus clementina]
           gi|557541992|gb|ESR52970.1| hypothetical protein
           CICLE_v10019863mg [Citrus clementina]
          Length = 493

 Score =  200 bits (509), Expect = 1e-49
 Identities = 92/125 (73%), Positives = 111/125 (88%)
 Frame = -1

Query: 375 KTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKI 196
           KTFAII ERYVS GK+D+AVKIFLSMH+HGC Q L SFN+ LD+LCK K+VE AY L K+
Sbjct: 139 KTFAIIAERYVSAGKADRAVKIFLSMHEHGCRQSLNSFNTILDLLCKEKKVEKAYNLFKV 198

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
           FRGKF+AD +SY++IANG+CLVKRT KALE+LKEMV+RGL+P LTTYNI+LKG+FRAGQ+
Sbjct: 199 FRGKFKADVISYNVIANGWCLVKRTNKALEVLKEMVDRGLNPNLTTYNIVLKGYFRAGQI 258

Query: 15  NEAWK 1
            EAW+
Sbjct: 259 EEAWR 263



 Score = 57.8 bits (138), Expect = 2e-06
 Identities = 31/120 (25%), Positives = 60/120 (50%), Gaps = 1/120 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLK-I 196
           T+  I   +   G+  +A  +F  M   G    + ++N+ + VLCK   VE A  + + +
Sbjct: 279 TYTTIVHGFGIVGEIKRARNVFDGMVNGGVLPSVATYNAMIQVLCKKDSVENAILVFEEM 338

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            R  +  ++ +Y+++  G C      +ALE +  M +    P + TYNI+++ F  AG++
Sbjct: 339 VRKGYMPNSTTYNVVIRGLCHAGEMERALEFVGRMKDDECEPNVQTYNILIRYFCDAGEI 398


>sp|Q9S7R4.1|PP125_ARATH RecName: Full=Pentatricopeptide repeat-containing protein
           At1g74900, mitochondrial; AltName: Full=Protein
           ORGANELLE TRANSCRIPT PROCESSING DEFECT 43; Flags:
           Precursor gi|5882733|gb|AAD55286.1|AC008263_17 Contains
           a PF|01535 DUF17 domain [Arabidopsis thaliana]
           gi|12323885|gb|AAG51911.1|AC013258_5 hypothetical
           protein; 69434-67986 [Arabidopsis thaliana]
          Length = 482

 Score =  198 bits (504), Expect = 5e-49
 Identities = 90/127 (70%), Positives = 109/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ERY S GK DKAVK+FL+MH+HGC QDL SFN+ LDVLCK+KRVE AY+L 
Sbjct: 125 SPKTFAIVAERYASAGKPDKAVKLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAYELF 184

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F  DTV+Y++I NG+CL+KRTPKALE+LKEMVERG++P LTTYN MLKGFFRAG
Sbjct: 185 RALRGRFSVDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFFRAG 244

Query: 21  QLNEAWK 1
           Q+  AW+
Sbjct: 245 QIRHAWE 251


>ref|NP_177628.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|133778904|gb|ABO38792.1| At1g74900 [Arabidopsis
           thaliana] gi|332197524|gb|AEE35645.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 453

 Score =  198 bits (504), Expect = 5e-49
 Identities = 90/127 (70%), Positives = 109/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ERY S GK DKAVK+FL+MH+HGC QDL SFN+ LDVLCK+KRVE AY+L 
Sbjct: 125 SPKTFAIVAERYASAGKPDKAVKLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAYELF 184

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F  DTV+Y++I NG+CL+KRTPKALE+LKEMVERG++P LTTYN MLKGFFRAG
Sbjct: 185 RALRGRFSVDTVTYNVILNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFFRAG 244

Query: 21  QLNEAWK 1
           Q+  AW+
Sbjct: 245 QIRHAWE 251


>ref|XP_002887564.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297333405|gb|EFH63823.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score =  198 bits (503), Expect = 7e-49
 Identities = 89/127 (70%), Positives = 111/127 (87%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ERY S GK DKAVK+FL+MH+HGC QDL SFN+ LDVLCK+KRVE AY+L 
Sbjct: 123 SPKTFAIVAERYASSGKPDKAVKLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAYELF 182

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F ADTV+Y++I NG+CL+KRTPKALE+LKEMV+RG++P LTTYN ML+GFFRAG
Sbjct: 183 RALRGRFSADTVTYNVIVNGWCLIKRTPKALEVLKEMVDRGINPNLTTYNTMLQGFFRAG 242

Query: 21  QLNEAWK 1
           Q+ +AW+
Sbjct: 243 QIRQAWE 249


>ref|XP_006301673.1| hypothetical protein CARUB_v10022126mg [Capsella rubella]
           gi|482570383|gb|EOA34571.1| hypothetical protein
           CARUB_v10022126mg [Capsella rubella]
          Length = 451

 Score =  197 bits (502), Expect = 9e-49
 Identities = 90/127 (70%), Positives = 109/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ER+ S GK DKAVK+FL+MH+HGC QDL SFN+ LDVLCK+KRVE AY+L 
Sbjct: 123 SPKTFAIVAERFASSGKPDKAVKLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAYELF 182

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F ADTV+Y++I NG+CL+KRTPKALE+LKEMVERG+ P LTTYN MLKGFFRAG
Sbjct: 183 RALRGRFGADTVTYNVIVNGWCLIKRTPKALEVLKEMVERGIDPNLTTYNTMLKGFFRAG 242

Query: 21  QLNEAWK 1
           Q+  AW+
Sbjct: 243 QIRHAWE 249


>dbj|BAD44503.1| hypothetical protein [Arabidopsis thaliana]
          Length = 447

 Score =  197 bits (500), Expect = 2e-48
 Identities = 89/127 (70%), Positives = 108/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ERY S GK DKAVK+FL+MH+HGC QDL SFN+ LDVLCK+KRVE AY+L 
Sbjct: 119 SPKTFAIVAERYASAGKPDKAVKLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAYELF 178

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F  DTV+Y++I NG+CL+KRTPK LE+LKEMVERG++P LTTYN MLKGFFRAG
Sbjct: 179 RALRGRFSVDTVTYNVILNGWCLIKRTPKTLEVLKEMVERGINPNLTTYNTMLKGFFRAG 238

Query: 21  QLNEAWK 1
           Q+  AW+
Sbjct: 239 QIRHAWE 245


>ref|XP_002321560.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550322291|gb|EEF05687.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 491

 Score =  196 bits (499), Expect = 2e-48
 Identities = 91/126 (72%), Positives = 110/126 (87%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAII ERY S GK  +AVK+FLSMH+ GC QDL SFN+ LDVLCK+KRVEMAY L 
Sbjct: 135 TPKTFAIIAERYASAGKPHRAVKVFLSMHQFGCFQDLQSFNTILDVLCKSKRVEMAYNLF 194

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+F+GKFRAD VSY+++ NG+CL+KRT KALE+LKEMV+RGL+P LT+YN MLKG+FRAG
Sbjct: 195 KVFKGKFRADCVSYNVMVNGWCLIKRTNKALEMLKEMVKRGLTPNLTSYNTMLKGYFRAG 254

Query: 21  QLNEAW 4
           Q+NEAW
Sbjct: 255 QINEAW 260



 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 31/124 (25%), Positives = 69/124 (55%), Gaps = 2/124 (1%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLK-- 199
           T+  +   +   G+  +A K+F +M K G    + ++N+F+ VLCK   V+ A  + +  
Sbjct: 277 TYTTVIHGFGVAGEIKRARKVFDTMVKKGVLPSVATYNAFIQVLCKKDNVDNAIVIFEEM 336

Query: 198 IFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQ 19
           + +G +  ++++Y+++  G C      +A+E +  M + G  P + TYN++++ F   G+
Sbjct: 337 VVKG-YVPNSITYNLVIRGLCHRGEMERAMEFMGRMRDDGCEPNVQTYNLVIRYFCDEGE 395

Query: 18  LNEA 7
           +++A
Sbjct: 396 IDKA 399


>gb|EOY20582.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
           [Theobroma cacao] gi|508773327|gb|EOY20583.1|
           Pentatricopeptide repeat (PPR) superfamily protein
           isoform 2 [Theobroma cacao]
          Length = 413

 Score =  196 bits (499), Expect = 2e-48
 Identities = 93/127 (73%), Positives = 108/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAII ERYV+ GK DKA+KIFLSMH+HGC QDL SFN+ LDVLCKAKRVE A    
Sbjct: 135 TPKTFAIIAERYVAAGKPDKALKIFLSMHEHGCFQDLHSFNTILDVLCKAKRVEKACNFF 194

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+ RGKF+AD +SY+IIANG+CL+KRT  ALE LKEMVE+GL+P LTTYNIMLKG+FRAG
Sbjct: 195 KVLRGKFKADVISYNIIANGWCLIKRTNMALETLKEMVEKGLTPNLTTYNIMLKGYFRAG 254

Query: 21  QLNEAWK 1
           Q+ E WK
Sbjct: 255 QIEEGWK 261



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 30/122 (24%), Positives = 65/122 (53%), Gaps = 1/122 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLK-I 196
           T+  +       G+  +A K+F  M + G    + ++N+ + VLCK   VE A  + + +
Sbjct: 277 TYTTVVHGLGVAGEIKRARKVFDEMVREGVLPSVATYNALIQVLCKKDCVENAILVFEEM 336

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            R  +  ++ +Y+++  G C  ++  +A+E + +M +    P + TYNI+++ F  AG++
Sbjct: 337 LRKGYVPNSTTYNVVIRGLCHKEQMDRAIEFMDKMRDDECGPNVQTYNIVIRYFCDAGEI 396

Query: 15  NE 10
            +
Sbjct: 397 EK 398


>gb|EOY20581.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
           cacao] gi|508773328|gb|EOY20584.1| Pentatricopeptide
           repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 491

 Score =  196 bits (499), Expect = 2e-48
 Identities = 93/127 (73%), Positives = 108/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAII ERYV+ GK DKA+KIFLSMH+HGC QDL SFN+ LDVLCKAKRVE A    
Sbjct: 135 TPKTFAIIAERYVAAGKPDKALKIFLSMHEHGCFQDLHSFNTILDVLCKAKRVEKACNFF 194

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K+ RGKF+AD +SY+IIANG+CL+KRT  ALE LKEMVE+GL+P LTTYNIMLKG+FRAG
Sbjct: 195 KVLRGKFKADVISYNIIANGWCLIKRTNMALETLKEMVEKGLTPNLTTYNIMLKGYFRAG 254

Query: 21  QLNEAWK 1
           Q+ E WK
Sbjct: 255 QIEEGWK 261



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 30/122 (24%), Positives = 65/122 (53%), Gaps = 1/122 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLK-I 196
           T+  +       G+  +A K+F  M + G    + ++N+ + VLCK   VE A  + + +
Sbjct: 277 TYTTVVHGLGVAGEIKRARKVFDEMVREGVLPSVATYNALIQVLCKKDCVENAILVFEEM 336

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            R  +  ++ +Y+++  G C  ++  +A+E + +M +    P + TYNI+++ F  AG++
Sbjct: 337 LRKGYVPNSTTYNVVIRGLCHKEQMDRAIEFMDKMRDDECGPNVQTYNIVIRYFCDAGEI 396

Query: 15  NE 10
            +
Sbjct: 397 EK 398


>ref|XP_006390373.1| hypothetical protein EUTSA_v10018527mg [Eutrema salsugineum]
           gi|557086807|gb|ESQ27659.1| hypothetical protein
           EUTSA_v10018527mg [Eutrema salsugineum]
          Length = 454

 Score =  196 bits (497), Expect = 4e-48
 Identities = 88/127 (69%), Positives = 109/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAI+ ERY S GK DKAV +FL+MH+HGC QDL SFN+ LDVLCK+KRVE A++L 
Sbjct: 126 SPKTFAIVAERYASAGKPDKAVNLFLNMHEHGCFQDLASFNTILDVLCKSKRVEKAHELF 185

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           +  RG+F  DTV+Y++I NG+CL+KRTPKALE+LKEMVERG++P LTTYN MLKGFFRAG
Sbjct: 186 RALRGRFSVDTVTYNVIVNGWCLIKRTPKALEVLKEMVERGINPNLTTYNTMLKGFFRAG 245

Query: 21  QLNEAWK 1
           Q+ +AW+
Sbjct: 246 QIKQAWE 252


>ref|XP_002511467.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223550582|gb|EEF52069.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 482

 Score =  196 bits (497), Expect = 4e-48
 Identities = 91/126 (72%), Positives = 109/126 (86%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SP+TFAII ERY + GK  +AV +F+SMH++GC QDL+SFN+ LDVLCK+KRVEMAY L 
Sbjct: 126 SPRTFAIIAERYAAMGKPHRAVTVFMSMHEYGCFQDLSSFNTILDVLCKSKRVEMAYNLF 185

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K  +GKF+AD VSY+II NG+CL+KRTPKALE+LKEMVERGL+P LTTYNIML G+FRAG
Sbjct: 186 KALKGKFKADCVSYNIIVNGWCLIKRTPKALEMLKEMVERGLTPNLTTYNIMLNGYFRAG 245

Query: 21  QLNEAW 4
           Q NEAW
Sbjct: 246 QTNEAW 251



 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 31/111 (27%), Positives = 63/111 (56%), Gaps = 2/111 (1%)
 Frame = -1

Query: 336 GKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLK--IFRGKFRADTVS 163
           G+  +A  +F  M K G    + +FN+ + +LCK   VE A  + +  + RG +  ++++
Sbjct: 280 GEIKRARNVFNQMVKDGVLPSVATFNALIQILCKKDSVENAILIFEEMVKRG-YVPNSIT 338

Query: 162 YSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQLNE 10
           Y+++  G C V    +A+E+++ M +    P + TYNI+++ F  AG++ +
Sbjct: 339 YNLVIRGLCHVGEMQRAMELMERMEDDDCEPNVQTYNILIRYFCDAGEIEK 389


>gb|EXC32244.1| hypothetical protein L484_004747 [Morus notabilis]
          Length = 521

 Score =  195 bits (495), Expect = 6e-48
 Identities = 89/127 (70%), Positives = 110/127 (86%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTFAII ERYVS GKSD+A+K+FLSM +HGC QDL SFNS LDVLCK+ RVEMA+   
Sbjct: 126 SPKTFAIIAERYVSAGKSDRAIKVFLSMREHGCSQDLNSFNSVLDVLCKSGRVEMAHNFF 185

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           + +R  FR DTVSY++IANG+CL+K+TPKALE+L++MV+RG SP+L TYNIMLKG+FRAG
Sbjct: 186 RAYRRNFRVDTVSYNVIANGWCLIKKTPKALEVLEDMVKRGFSPSLITYNIMLKGYFRAG 245

Query: 21  QLNEAWK 1
           Q+ EAW+
Sbjct: 246 QVKEAWE 252



 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 33/112 (29%), Positives = 60/112 (53%), Gaps = 1/112 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL-KI 196
           ++ +I   +    K+ KA+++   M K G    L ++N  L    +A +V+ A++   ++
Sbjct: 198 SYNVIANGWCLIKKTPKALEVLEDMVKRGFSPSLITYNIMLKGYFRAGQVKEAWEFFGEM 257

Query: 195 FRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLK 40
            R K   D V+Y+ + +GF +V    KA  I  EMV  G+ PT+ TYN +++
Sbjct: 258 KRRKVEIDVVTYTTLVHGFGVVGEIKKARRIFDEMVGEGVVPTVATYNALIQ 309



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 32/123 (26%), Positives = 61/123 (49%), Gaps = 1/123 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKIF 193
           T+  +   +   G+  KA +IF  M   G    + ++N+ + VLCK   VE A  + +  
Sbjct: 268 TYTTLVHGFGVVGEIKKARRIFDEMVGEGVVPTVATYNALIQVLCKKDSVENAVVVFEEM 327

Query: 192 RGKFRADTVS-YSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            GK     V+ Y+++  G C   +  +++E ++ M   G  P +  YNI+++ F   G++
Sbjct: 328 VGKGCVPNVTTYTVLVRGLCHAGQMERSMEFVERMKGDGCEPNVQIYNIVIRYFCDDGEI 387

Query: 15  NEA 7
            +A
Sbjct: 388 EKA 390


>ref|XP_003601293.1| Pentatricopeptide repeat-containing protein, partial [Medicago
           truncatula] gi|355490341|gb|AES71544.1|
           Pentatricopeptide repeat-containing protein, partial
           [Medicago truncatula]
          Length = 317

 Score =  192 bits (488), Expect = 4e-47
 Identities = 87/127 (68%), Positives = 108/127 (85%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +PKTFAI+ ERY + GK+ KAVK+FLSMH+HGC QDL SFN+ LDVLCK KRVEMA  L 
Sbjct: 134 TPKTFAILAERYATGGKAHKAVKVFLSMHEHGCHQDLNSFNTILDVLCKTKRVEMANNLF 193

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K  RG+F+ D+VSY+I+ANG+CL+KRTP AL++LKEMVERG+ PT+ TYN +LKG+FR G
Sbjct: 194 KTLRGRFKCDSVSYNIMANGWCLIKRTPMALQVLKEMVERGVDPTMVTYNTLLKGYFRCG 253

Query: 21  QLNEAWK 1
           QLNEAW+
Sbjct: 254 QLNEAWE 260


>ref|XP_004501962.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74900,
           mitochondrial-like [Cicer arietinum]
          Length = 498

 Score =  189 bits (480), Expect = 3e-46
 Identities = 83/126 (65%), Positives = 111/126 (88%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           +P+TFAI++ERY + GK+ +AVK+FLSMH+HGC QDL SFN+ LDVLCK KRVEMA+ L 
Sbjct: 142 TPRTFAILSERYATGGKAHRAVKVFLSMHEHGCNQDLNSFNTILDVLCKTKRVEMAHNLF 201

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
           K F+G+F+ D+VSY+I+ANG+CL+KRTP AL+++KEMVERG++PT+ TYN +LKG+FR+ 
Sbjct: 202 KTFKGRFKCDSVSYNIMANGWCLMKRTPMALQVMKEMVERGITPTMVTYNTLLKGYFRSH 261

Query: 21  QLNEAW 4
           QLNEAW
Sbjct: 262 QLNEAW 267



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 32/124 (25%), Positives = 66/124 (53%), Gaps = 1/124 (0%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKIF 193
           T+  +   +   G+  ++ ++F +M K G    + ++N+ + VLCK   V+ A  + +  
Sbjct: 284 TYTTMVHGFGVAGEVKRSKRVFDAMVKEGLIPSVATYNALIQVLCKKDNVQNALLVFEEM 343

Query: 192 RGK-FRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAGQL 16
            GK +  +  +Y+++  G C      KALE ++ M E G  P++ TYN++++ F   G+L
Sbjct: 344 VGKGYVPNLTTYNVVIRGLCHSGEMEKALEFMERMEEHGCRPSVQTYNVVIRYFCDDGEL 403

Query: 15  NEAW 4
            + +
Sbjct: 404 EKGF 407


>gb|EPS67440.1| hypothetical protein M569_07334, partial [Genlisea aurea]
          Length = 451

 Score =  185 bits (469), Expect = 6e-45
 Identities = 83/124 (66%), Positives = 107/124 (86%)
 Frame = -1

Query: 381 SPKTFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLL 202
           SPKTF+I+ ERY S G++DKAV +FL+MH+HGCPQDL+SFN+ LDVLCK+KR E A+KL 
Sbjct: 94  SPKTFSIVIERYASAGRADKAVNVFLTMHRHGCPQDLSSFNAMLDVLCKSKRAEKAHKLF 153

Query: 201 KIFRGKFRADTVSYSIIANGFCLVKRTPKALEILKEMVERGLSPTLTTYNIMLKGFFRAG 22
            I RG+FRAD ++Y+IIA GFCL K+T +A+E++KEMVERGL PTLTTYNI+LKG+F AG
Sbjct: 154 TILRGRFRADAITYNIIAYGFCLKKQTSRAVEVMKEMVERGLIPTLTTYNILLKGYFGAG 213

Query: 21  QLNE 10
           Q+ +
Sbjct: 214 QIKQ 217



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 34/124 (27%), Positives = 65/124 (52%), Gaps = 2/124 (1%)
 Frame = -1

Query: 372 TFAIITERYVSCGKSDKAVKIFLSMHKHGCPQDLTSFNSFLDVLCKAKRVEMAYKLLKIF 193
           +F  +       G+ +KA K+F  M + G    + S+N+ + +LCK   VE A K+    
Sbjct: 236 SFTTVVHGLGVAGEIEKARKVFREMSEAGVLPTVASYNAMIQILCKKDSVENAMKVFDEM 295

Query: 192 RGK-FRADTVSYSIIANGFCLVKRTPKALEILKEMVERGL-SPTLTTYNIMLKGFFRAGQ 19
           + K  + +  +Y+++  G C V +   A+E +  M E G  +PT  T+NI+++ +   G+
Sbjct: 296 QQKGTKPNATTYNLVIRGLCHVGKFDMAIEYMDRMKENGCCTPTFQTFNIIIRYYCDEGE 355

Query: 18  LNEA 7
           + +A
Sbjct: 356 IEKA 359


Top