BLASTX nr result

ID: Catharanthus22_contig00015214 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00015214
         (719 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS68541.1| hypothetical protein M569_06226 [Genlisea aurea]       261   1e-67
ref|XP_004234452.1| PREDICTED: pentatricopeptide repeat-containi...   261   2e-67
ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containi...   259   8e-67
gb|EMJ20518.1| hypothetical protein PRUPE_ppa019039mg [Prunus pe...   256   5e-66
ref|XP_004500471.1| PREDICTED: pentatricopeptide repeat-containi...   250   4e-64
gb|AHB18407.1| pentatricopeptide repeat-containing protein [Goss...   247   3e-63
gb|EOY22925.1| Tetratricopeptide repeat-like superfamily protein...   247   3e-63
ref|XP_002322407.2| hypothetical protein POPTR_0015s14630g [Popu...   244   2e-62
ref|XP_004307818.1| PREDICTED: pentatricopeptide repeat-containi...   241   1e-61
ref|XP_006490089.1| PREDICTED: pentatricopeptide repeat-containi...   240   4e-61
ref|XP_006421716.1| hypothetical protein CICLE_v10004726mg [Citr...   240   4e-61
ref|XP_004160887.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   239   6e-61
ref|XP_004148162.1| PREDICTED: pentatricopeptide repeat-containi...   239   6e-61
ref|XP_002510931.1| pentatricopeptide repeat-containing protein,...   237   3e-60
ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containi...   234   2e-59
gb|ACU21163.1| unknown [Glycine max]                                  234   2e-59
ref|NP_194257.1| pentatricopeptide repeat protein OTP70 [Arabido...   231   1e-58
ref|XP_006413323.1| hypothetical protein EUTSA_v10024921mg [Eutr...   230   3e-58
ref|XP_006282784.1| hypothetical protein CARUB_v10006372mg, part...   229   5e-58
gb|ESW03326.1| hypothetical protein PHAVU_011G004900g [Phaseolus...   228   2e-57

>gb|EPS68541.1| hypothetical protein M569_06226 [Genlisea aurea]
          Length = 531

 Score =  261 bits (668), Expect = 1e-67
 Identities = 128/215 (59%), Positives = 159/215 (73%), Gaps = 3/215 (1%)
 Frame = -2

Query: 637 NPYSFSKKNPGKRKQIR---PSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKL 467
           +P+  S     K+K      P  +  KL +PK  P PLL D    P+TK+QALESVI  L
Sbjct: 24  SPFDCSASGRKKKKNYEKFYPKIKPTKLPYPKYRPAPLLPDPKLRPQTKIQALESVICDL 83

Query: 466 ETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYAS 287
           E S+K+ + ++D ++FA+LLETCF L ++DH  R+H LIP K+L +N G+ SKL+RLYAS
Sbjct: 84  EASLKNGVTIDDPQVFASLLETCFRLSAFDHGFRVHALIPAKLLRRNTGILSKLLRLYAS 143

Query: 286 DGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFP 107
            G LE AH+LFDEMP RN+SAFPWNSLI+GY E G +EDALAL+FQMVEE V PD+HTFP
Sbjct: 144 RGDLEHAHKLFDEMPDRNSSAFPWNSLISGYAEAGFHEDALALFFQMVEEGVIPDEHTFP 203

Query: 106 RVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           RVLKACGG+G I VG+E+HRHVIR G G  GFVLN
Sbjct: 204 RVLKACGGVGMIHVGEEVHRHVIRFGFGGSGFVLN 238


>ref|XP_004234452.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Solanum lycopersicum]
          Length = 539

 Score =  261 bits (666), Expect = 2e-67
 Identities = 128/224 (57%), Positives = 167/224 (74%)
 Frame = -2

Query: 673 HSSGLLRPHFSFNPYSFSKKNPGKRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQ 494
           HS+    P+FSF     + +N  K+K    + +  ++   +S    L+ +Q   P+TKLQ
Sbjct: 21  HSAKSTHPNFSF-----TNENTKKQKLHMENPKSRRISLQRSGQNQLVQNQKLPPKTKLQ 75

Query: 493 ALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVS 314
           ALE+VI  LE ++K+   + D +IFA+LLETCF+LQ+ DH +R+H LIPEK+L KNVG+S
Sbjct: 76  ALETVIRNLEMTVKNGTDIYDPQIFASLLETCFQLQAIDHGVRVHELIPEKLLRKNVGIS 135

Query: 313 SKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEED 134
           SKLIRLYA  G  + AHQLFD+MP+RN SAFPWNS+I+GY EKGL+EDALA+YFQMVEE 
Sbjct: 136 SKLIRLYACSGQTQKAHQLFDKMPKRNTSAFPWNSIISGYAEKGLFEDALAMYFQMVEEG 195

Query: 133 VEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           VEPD +TFPR LKACGG+G I VG+E+HRHVIR G G +GF+LN
Sbjct: 196 VEPDCYTFPRALKACGGVGLIHVGEEVHRHVIRRGFGSNGFILN 239


>ref|XP_002263650.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic [Vitis vinifera]
           gi|296084180|emb|CBI24568.3| unnamed protein product
           [Vitis vinifera]
          Length = 516

 Score =  259 bits (661), Expect = 8e-67
 Identities = 131/201 (65%), Positives = 158/201 (78%)
 Frame = -2

Query: 604 KRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTR 425
           K+KQ      KP L FPKS PTPLLI+      TKLQALE+++  L+ SI+D I V D +
Sbjct: 23  KQKQFHRDT-KPNLVFPKSSPTPLLINHKPRNHTKLQALEALLRDLQASIQDGITV-DAQ 80

Query: 424 IFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEM 245
           IF++LLETCF+LQ++DH IRIHRLIP  +L K+V +SSKL+RLYAS G +E AH+LFD+M
Sbjct: 81  IFSSLLETCFQLQAFDHGIRIHRLIPTSLLRKSVALSSKLLRLYASIGRIEEAHRLFDQM 140

Query: 244 PQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGV 65
            +RN SAF WNSLI+GY E GLYEDA+ALYFQM EE V PD+ TFPRVLKACGGIG I V
Sbjct: 141 SRRNRSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVVPDRFTFPRVLKACGGIGSISV 200

Query: 64  GKEIHRHVIRCGLGDDGFVLN 2
           G+E+HRHV+RCG  DDGFVLN
Sbjct: 201 GEEVHRHVVRCGFADDGFVLN 221



 Score = 57.4 bits (137), Expect = 5e-06
 Identities = 35/103 (33%), Positives = 60/103 (58%)
 Frame = -2

Query: 367 RIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTE 188
           +IH  +  + +  N+ +++ LI LY++ G L+ A  LFD MP+R+  +  WNS+I+ +  
Sbjct: 301 QIHGWVLRRGVQWNLSIANSLIVLYSNHGKLDQACWLFDHMPERDVVS--WNSIISAH-R 357

Query: 187 KGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGK 59
           K L   A+  + +M + DV PD  TF  +L AC  +G +  G+
Sbjct: 358 KDL--KAITYFSRMQKADVLPDVVTFVSLLSACAHLGLVKDGE 398



 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 39/144 (27%), Positives = 74/144 (51%)
 Frame = -2

Query: 457 IKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGH 278
           +++E  V D   F  +L+ C  + S      +HR +     A +  V + L+ +YA  G 
Sbjct: 173 MEEEGVVPDRFTFPRVLKACGGIGSISVGEEVHRHVVRCGFADDGFVLNALVDMYAKCGD 232

Query: 277 LELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVL 98
           +  A ++FD++  R++ +  WNS++ GY   GL   AL+++ +M++   EPD      V+
Sbjct: 233 IVKARKVFDKIVCRDSVS--WNSMLTGYIRHGLPLQALSIFRRMLQYGFEPDAVAISTVV 290

Query: 97  KACGGIGFIGVGKEIHRHVIRCGL 26
               G+  + +  +IH  V+R G+
Sbjct: 291 T---GVPSLKLAGQIHGWVLRRGV 311


>gb|EMJ20518.1| hypothetical protein PRUPE_ppa019039mg [Prunus persica]
          Length = 519

 Score =  256 bits (654), Expect = 5e-66
 Identities = 126/209 (60%), Positives = 164/209 (78%)
 Frame = -2

Query: 628 SFSKKNPGKRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKD 449
           S SKKN  K+KQ+  ++    L FPK+ PTPL+I    + +TKLQAL++V++ LE +I  
Sbjct: 20  SKSKKNK-KQKQVSQNQSNNSLSFPKTIPTPLIICHKPHSQTKLQALDAVVNDLEAAIGK 78

Query: 448 EIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLEL 269
            I V DT  FA+LLETC++ Q+ D+ +R+HRLIP  +L +NVG+SSKL+RLYAS G++E 
Sbjct: 79  GINV-DTETFASLLETCYQFQAMDYGLRVHRLIPRSVLRRNVGISSKLLRLYASHGYIEE 137

Query: 268 AHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKAC 89
           AHQ+FDEMP+R+ SAF WNSLI+GY E GLYEDA+ALYFQM EE VEPD+ TFPRVLKAC
Sbjct: 138 AHQVFDEMPKRDVSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVLKAC 197

Query: 88  GGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           GGIGFI +G+ +HRH++R GL +D FVLN
Sbjct: 198 GGIGFIQIGEAVHRHIVRLGLLNDRFVLN 226


>ref|XP_004500471.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Cicer arietinum]
          Length = 520

 Score =  250 bits (638), Expect = 4e-64
 Identities = 124/213 (58%), Positives = 160/213 (75%), Gaps = 4/213 (1%)
 Frame = -2

Query: 628 SFSKKNPGKR-KQIRP---SRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLET 461
           S  K N  K+ K++R     RRK    +P+  PTPLLI Q  +P+TK Q LE V++ LE 
Sbjct: 16  SAKKSNDNKKLKKLRKWETQRRKNTFSYPQPNPTPLLIQQQPFPQTKFQVLEQVLNDLEG 75

Query: 460 SIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDG 281
           SI+  I + DT I+A+LLETC+  Q+ +H IR+HRLIP  +L +NVG+SSKL+RLYAS G
Sbjct: 76  SIEKGITI-DTEIYASLLETCYRFQAINHGIRLHRLIPPTLLHRNVGISSKLVRLYASFG 134

Query: 280 HLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRV 101
           H++ AH LFD+M +R+  AFPWNSLI+GY + GLY+DA+ALYFQMVEE VEPD  TFPRV
Sbjct: 135 HMDDAHDLFDQMTKRDMYAFPWNSLISGYAQLGLYDDAIALYFQMVEEGVEPDLFTFPRV 194

Query: 100 LKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           LK CGGIG + VG+E+HRH++R G G+DGFVLN
Sbjct: 195 LKVCGGIGSVQVGEEVHRHIVRSGFGNDGFVLN 227



 Score = 60.1 bits (144), Expect = 7e-07
 Identities = 39/136 (28%), Positives = 70/136 (51%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      +HR I       +  V + L+ +Y+  G +  A ++F
Sbjct: 187 DLFTFPRVLKVCGGIGSVQVGEEVHRHIVRSGFGNDGFVLNALVDMYSKCGDIVKARKVF 246

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           +++P R++ +  WNS++A Y   GL  +A+ ++ QM+ E   PD  +   +L    G+  
Sbjct: 247 NKIPFRDSVS--WNSMLAAYVHHGLEVEAINIFRQMLLEGKRPDFFSISVILT---GVSS 301

Query: 73  IGVGKEIHRHVIRCGL 26
           + VG +IH  VIR G+
Sbjct: 302 LDVGVQIHGWVIRRGV 317



 Score = 58.5 bits (140), Expect = 2e-06
 Identities = 31/113 (27%), Positives = 65/113 (57%)
 Frame = -2

Query: 391 LQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWN 212
           + S D  ++IH  +  + +  N+ +++ LI +Y++ G L+ A  +F+ MP+R+  +  WN
Sbjct: 299 VSSLDVGVQIHGWVIRRGVEWNLSIANSLIVVYSNHGRLDKARSIFNLMPERDVVS--WN 356

Query: 211 SLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEI 53
           S+I+ + +   + +A+  + +M E    PD+ TF  +L AC  +G +  G+ +
Sbjct: 357 SIISAHCK---HPEAIGYFEKMEEAGEVPDKITFVSLLSACAHLGLVKDGERL 406


>gb|AHB18407.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 522

 Score =  247 bits (630), Expect = 3e-63
 Identities = 127/226 (56%), Positives = 164/226 (72%), Gaps = 7/226 (3%)
 Frame = -2

Query: 658 LRPH-----FSFNPYSFSKKNPG-KRKQIRPSRRK-PKLRFPKSFPTPLLIDQTSYPRTK 500
           L+PH       F+  S  KKN   KR+Q   ++   P L FP+S PTPL I+   +P+TK
Sbjct: 7   LQPHTIPLTLHFHCSSKGKKNQKQKRRQFEHNKNTAPALPFPRSSPTPLFINNKPFPQTK 66

Query: 499 LQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVG 320
           LQA++S++  LE S+K  I + D+ IF++LLETC++L+S DH I IHRL+P+ +L KN G
Sbjct: 67  LQAVDSIVKDLEASVKKGIII-DSEIFSSLLETCYQLKSIDHGIAIHRLVPQNLLRKNTG 125

Query: 319 VSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVE 140
           +SSKL+RLYA+ G +E AHQ+FD+M +RN  AFPWNSLI+GY E G YEDALALYFQM E
Sbjct: 126 ISSKLLRLYATAGRMESAHQVFDQMSKRNEYAFPWNSLISGYAELGQYEDALALYFQMEE 185

Query: 139 EDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           E VEPD+ TFPR LKAC GIG I VG+ +HR V+R G G+D FVLN
Sbjct: 186 EGVEPDRFTFPRALKACAGIGSIHVGQAVHRDVVRKGFGNDVFVLN 231



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 43/145 (29%), Positives = 70/145 (48%), Gaps = 1/145 (0%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F   L+ C  + S      +HR +  K    +V V + LI +YA  G +  A ++F
Sbjct: 191 DRFTFPRALKACAGIGSIHVGQAVHRDVVRKGFGNDVFVLNALIDMYAKCGDIVKARRVF 250

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKA-CGGIG 77
           D +  ++  +  WNS++ GY   GL   AL ++  M++E  EPD  T   +L + C    
Sbjct: 251 DSIACKDNIS--WNSMLTGYIRHGLLAGALQVFRGMIQEGFEPDSVTISTILSSFCS--- 305

Query: 76  FIGVGKEIHRHVIRCGLGDDGFVLN 2
            +    +IH  V+R G+  D  V+N
Sbjct: 306 -LKTAAQIHGWVLRRGIEWDTSVVN 329



 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 36/127 (28%), Positives = 68/127 (53%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D+   +T+L +   L++     +IH  +  + +  +  V + +I +Y++ G L+ A  LF
Sbjct: 292 DSVTISTILSSFCSLKT---AAQIHGWVLRRGIEWDTSVVNAMIVVYSNLGKLDGASWLF 348

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
             MP+R+  +  WNS+I+G+++     +AL  + QMV     PD  TF  +L AC  +G 
Sbjct: 349 QRMPERDIVS--WNSIISGHSKN---PEALLYFEQMVRSCTSPDSITFVAILSACAHLGL 403

Query: 73  IGVGKEI 53
           +  G+ +
Sbjct: 404 VKDGERL 410


>gb|EOY22925.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
          Length = 773

 Score =  247 bits (630), Expect = 3e-63
 Identities = 133/242 (54%), Positives = 173/242 (71%), Gaps = 10/242 (4%)
 Frame = -2

Query: 697 SALMNSSLHSSGLLRP-HFSFNPYSFS---------KKNPGKRKQIRPSRRKPKLRFPKS 548
           SAL+  SL    LL+P  F F  ++ S         K+   KRKQI  S+    L F KS
Sbjct: 245 SALL--SLTMVALLQPPSFHFVSWTLSCSSKSKKSEKQKQLKRKQIHQSK-STALPFRKS 301

Query: 547 FPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVI 368
            PTPLLI+   + +TKLQAL++V+  LE S+K+ + +  + IF++LLETC++L+S D  I
Sbjct: 302 SPTPLLINHKPFTQTKLQALDAVVKDLEASVKNGMNIT-SEIFSSLLETCYQLKSIDQGI 360

Query: 367 RIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTE 188
           +IH L+P+ +L KN G+SSKL+RLYAS GH+E AHQ+FDEM +RN SAFPWNSLI+GY E
Sbjct: 361 KIHNLVPKTLLRKNTGISSKLLRLYASCGHIESAHQVFDEMSKRNESAFPWNSLISGYAE 420

Query: 187 KGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFV 8
            G YEDALA+YFQM EE VEPD++TFPR LKAC GIG I +G+ +HR V+R G G+DGFV
Sbjct: 421 LGQYEDALAIYFQMEEEGVEPDRYTFPRALKACAGIGLIQIGEAVHRDVVRKGFGNDGFV 480

Query: 7   LN 2
           LN
Sbjct: 481 LN 482



 Score = 60.5 bits (145), Expect = 5e-07
 Identities = 40/132 (30%), Positives = 71/132 (53%)
 Frame = -2

Query: 448 EIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLEL 269
           E Y  D    +T+L   + L+     ++IH  I  +    N+ V + LI +Y++ G L+ 
Sbjct: 538 EGYEPDPVAMSTILSGVWSLKI---ALQIHGWILRRGNEWNLSVVNALIVVYSNHGKLDR 594

Query: 268 AHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKAC 89
           A  LF  +P+ +  +  WNS+I+G++++    +AL  + QMV     PD  TF  +L AC
Sbjct: 595 ASWLFHRIPEPDVVS--WNSIISGHSKR---PEALVYFEQMVSGGTLPDSITFVAILSAC 649

Query: 88  GGIGFIGVGKEI 53
             +GF+  G+++
Sbjct: 650 AHLGFVRDGEQL 661


>ref|XP_002322407.2| hypothetical protein POPTR_0015s14630g [Populus trichocarpa]
           gi|550322722|gb|EEF06534.2| hypothetical protein
           POPTR_0015s14630g [Populus trichocarpa]
          Length = 529

 Score =  244 bits (624), Expect = 2e-62
 Identities = 121/205 (59%), Positives = 158/205 (77%)
 Frame = -2

Query: 616 KNPGKRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYV 437
           K   K+ Q + +  K  L FPKS PTPLLI+    P+T+L+ALE+VI  L++S++  I +
Sbjct: 35  KEKHKQTQFQKNLNKRTLSFPKSSPTPLLINHRPPPQTQLEALENVIKDLQSSMEKGIRI 94

Query: 436 NDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQL 257
            DT+IF++LLETC+ L + +  ++IHRLIP  +L +N G+SSKL+RLY+S G +E+AHQ+
Sbjct: 95  -DTQIFSSLLETCYRLNAIELGVKIHRLIPINLLRRNAGISSKLVRLYSSCGDVEVAHQV 153

Query: 256 FDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIG 77
           FDEM +R  SAFPWNSLIAGYTE GLYEDA+ALYFQM EE VEPDQ TFPRVLKACGGIG
Sbjct: 154 FDEMFKRGESAFPWNSLIAGYTESGLYEDAMALYFQMEEEGVEPDQFTFPRVLKACGGIG 213

Query: 76  FIGVGKEIHRHVIRCGLGDDGFVLN 2
            I +G+ +HR ++R G  +DGFVLN
Sbjct: 214 LIRIGEAVHRDLVRLGFVNDGFVLN 238



 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 34/115 (29%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
 Frame = -2

Query: 391 LQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWN 212
           + S++  ++IH  I  + +  +  +++ LI +Y++   L+ A  LFD MP+++  +  WN
Sbjct: 310 VSSFEVAVQIHGWIVRRGMEWDFSIANSLIAVYSNGRKLDRARWLFDHMPKKDIVS--WN 367

Query: 211 SLIAGYTEKGLYEDALAL-YFQMVEED-VEPDQHTFPRVLKACGGIGFIGVGKEI 53
           S+I+ +      +D  AL YF+++E D   PD+ TF  +L AC  +G +  G+ +
Sbjct: 368 SIISAHC-----KDLKALTYFELMERDGALPDKITFVSLLSACAHLGLVKDGERL 417


>ref|XP_004307818.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 522

 Score =  241 bits (616), Expect = 1e-61
 Identities = 123/210 (58%), Positives = 158/210 (75%), Gaps = 1/210 (0%)
 Frame = -2

Query: 628 SFSKKNPGKRKQIRPSRRKPKL-RFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIK 452
           S SK++  K KQ    ++  KL  F    PTPL++      +TKLQALE++I +LETS +
Sbjct: 21  SNSKRSKNKPKQQLHQKQSTKLLSFSNPTPTPLIVYHKPQTQTKLQALEAIIKELETSSE 80

Query: 451 DEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLE 272
           + I V DT  FA+LLETC++L + D+ +R+HRLIP  +L +NVG+SSKL+RLYAS G +E
Sbjct: 81  NGIDV-DTETFASLLETCYKLDAMDYCLRVHRLIPRNLLRRNVGLSSKLLRLYASCGFVE 139

Query: 271 LAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKA 92
            AHQ+FDEMP+R+ SAF WNSLI+GY E GLYEDA+ALYFQM EE VEPD+ TFPRVLKA
Sbjct: 140 EAHQVFDEMPKRDVSAFAWNSLISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVLKA 199

Query: 91  CGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           CGGIGF+ VG+ +HRH++R G   D FVLN
Sbjct: 200 CGGIGFVQVGEAVHRHLVRLGFVGDRFVLN 229


>ref|XP_006490089.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Citrus sinensis]
          Length = 526

 Score =  240 bits (612), Expect = 4e-61
 Identities = 125/227 (55%), Positives = 166/227 (73%)
 Frame = -2

Query: 682 SSLHSSGLLRPHFSFNPYSFSKKNPGKRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRT 503
           SS H+S ++    S N  S  K+   K++QI  +R      +PKS PTPLL +Q ++P+T
Sbjct: 9   SSFHTSLVIIHCGSKNKRS-RKQRRQKQQQISRNRITTFSSYPKSSPTPLLTNQKAFPKT 67

Query: 502 KLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNV 323
           KLQAL+S+I  LE+S+++ I V  T  FA+LLETC++L++ +H I++HRLIP  +L KN 
Sbjct: 68  KLQALDSIIQDLESSVQNGITVQ-TETFASLLETCYQLKAVEHGIKLHRLIPTNLLRKNK 126

Query: 322 GVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMV 143
           G+SSKL+RLYA+ G ++ AHQ+FD+M  R A AFPWNSLI+GY E G YEDA+ALYFQM 
Sbjct: 127 GISSKLLRLYATFGLIDEAHQVFDQMSNRTAFAFPWNSLISGYAELGEYEDAIALYFQME 186

Query: 142 EEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           EE VEPDQ TFPRVLKAC G+G I VG+++H   +R G G DGFVLN
Sbjct: 187 EEGVEPDQFTFPRVLKACAGLGLIRVGEKVHLDAVRFGFGFDGFVLN 233



 Score = 59.7 bits (143), Expect = 9e-07
 Identities = 33/105 (31%), Positives = 61/105 (58%)
 Frame = -2

Query: 367 RIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTE 188
           ++H  +  + +  ++ +++ LI +Y+ DG L+ A  LFD MPQ++  +  WNS+I  +++
Sbjct: 313 QVHGWVLRRGVEWDLCIANSLIVVYSKDGKLDQACWLFDHMPQKDVVS--WNSIIHAHSK 370

Query: 187 KGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEI 53
                +AL  + QM  + V PD  TF  +L AC  +G + VG+ +
Sbjct: 371 D---HEALIYFEQMERDGVLPDHLTFVSLLSACAHLGSVKVGERL 412


>ref|XP_006421716.1| hypothetical protein CICLE_v10004726mg [Citrus clementina]
           gi|557523589|gb|ESR34956.1| hypothetical protein
           CICLE_v10004726mg [Citrus clementina]
          Length = 526

 Score =  240 bits (612), Expect = 4e-61
 Identities = 125/227 (55%), Positives = 166/227 (73%)
 Frame = -2

Query: 682 SSLHSSGLLRPHFSFNPYSFSKKNPGKRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRT 503
           SS H+S ++    S N  S  K+   K++QI  +R      +PKS PTPLL +Q ++P+T
Sbjct: 9   SSFHTSLVIIHCGSKNKRS-RKQRRQKQQQISRNRITTFSSYPKSSPTPLLTNQKAFPKT 67

Query: 502 KLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNV 323
           KLQAL+S+I  LE+S+++ I V  T  FA+LLETC++L++ +H I++HRLIP  +L KN 
Sbjct: 68  KLQALDSIIQDLESSVQNGITVQ-TETFASLLETCYQLKAVEHGIKLHRLIPTNLLRKNK 126

Query: 322 GVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMV 143
           G+SSKL+RLYA+ G ++ AHQ+FD+M  R A AFPWNSLI+GY E G YEDA+ALYFQM 
Sbjct: 127 GISSKLLRLYATFGLIDEAHQVFDQMSNRTAFAFPWNSLISGYAELGEYEDAIALYFQME 186

Query: 142 EEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           EE VEPDQ TFPRVLKAC G+G I VG+++H   +R G G DGFVLN
Sbjct: 187 EEGVEPDQFTFPRVLKACAGLGLIRVGEKVHLDAVRFGFGFDGFVLN 233



 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 32/106 (30%), Positives = 62/106 (58%), Gaps = 1/106 (0%)
 Frame = -2

Query: 367 RIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTE 188
           ++H  +  + +  ++ +++ LI +Y+ DG L+ A  LFD MPQ++  +  WNS+I  +++
Sbjct: 313 QVHGWVLRRGVEWDLCIANSLIVVYSKDGKLDQACWLFDHMPQKDVVS--WNSIIHAHSK 370

Query: 187 KGLYEDALALYFQMVEED-VEPDQHTFPRVLKACGGIGFIGVGKEI 53
               +  + +YF+ +E D V PD  TF  +L AC  +G +  G+ +
Sbjct: 371 ----DHEVLIYFEQMERDGVLPDHITFVSLLSACAHLGSVKDGERL 412


>ref|XP_004160887.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At4g25270, chloroplastic-like [Cucumis sativus]
          Length = 489

 Score =  239 bits (610), Expect = 6e-61
 Identities = 118/194 (60%), Positives = 150/194 (77%)
 Frame = -2

Query: 583 SRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLE 404
           +++   L FPKS PTPLLI    + ++K+QAL++V++ LE SI + +++ D  IF++LLE
Sbjct: 4   AKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLE 62

Query: 403 TCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASA 224
            C++LQ+  H IRIHRLIP  +L +NVG+SSKL+RLYAS G++E AHQ+FDEM  RN SA
Sbjct: 63  LCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSA 122

Query: 223 FPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRH 44
           F WNSLI+GY E GLYEDALALYFQM EE VEPD  TFPRVLKACGGIG I +G+ +HRH
Sbjct: 123 FAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRH 182

Query: 43  VIRCGLGDDGFVLN 2
           V+R G   D FVLN
Sbjct: 183 VVRSGFAGDVFVLN 196



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 40/136 (29%), Positives = 69/136 (50%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      +HR +     A +V V + L+ +Y+  G +  A ++F
Sbjct: 156 DNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVF 215

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           D++  ++  +  WNS++ GYT  GL+ +AL ++ QM++E  EPD      +L     + F
Sbjct: 216 DQIEYKDIVS--WNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKF 273

Query: 73  IGVGKEIHRHVIRCGL 26
                 IH  VIR G+
Sbjct: 274 ---KLHIHGWVIRHGV 286



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 42/135 (31%), Positives = 69/135 (51%), Gaps = 2/135 (1%)
 Frame = -2

Query: 448 EIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLEL 269
           E Y  D+   +TLL     + S    + IH  +    +  N+ +++ LI +YA  G L  
Sbjct: 252 EGYEPDSVALSTLLSN---ISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNR 308

Query: 268 AHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALAL-YFQMVEE-DVEPDQHTFPRVLK 95
           A  LF +MPQ++  +  WNS+I+ +     +  A AL YF+++E   V PD  TF  +L 
Sbjct: 309 AKWLFQQMPQKDMVS--WNSIISAH-----FNSAEALTYFEVMESLGVSPDGVTFVSLLS 361

Query: 94  ACGGIGFIGVGKEIH 50
            C  +G +  G E++
Sbjct: 362 TCAHLGLVKEGXELY 376


>ref|XP_004148162.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Cucumis sativus]
          Length = 489

 Score =  239 bits (610), Expect = 6e-61
 Identities = 118/194 (60%), Positives = 150/194 (77%)
 Frame = -2

Query: 583 SRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLE 404
           +++   L FPKS PTPLLI    + ++K+QAL++V++ LE SI + +++ D  IF++LLE
Sbjct: 4   AKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLE 62

Query: 403 TCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASA 224
            C++LQ+  H IRIHRLIP  +L +NVG+SSKL+RLYAS G++E AHQ+FDEM  RN SA
Sbjct: 63  LCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSA 122

Query: 223 FPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRH 44
           F WNSLI+GY E GLYEDALALYFQM EE VEPD  TFPRVLKACGGIG I +G+ +HRH
Sbjct: 123 FAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRH 182

Query: 43  VIRCGLGDDGFVLN 2
           V+R G   D FVLN
Sbjct: 183 VVRSGFAGDVFVLN 196



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 40/136 (29%), Positives = 69/136 (50%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      +HR +     A +V V + L+ +Y+  G +  A ++F
Sbjct: 156 DNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVF 215

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           D++  ++  +  WNS++ GYT  GL+ +AL ++ QM++E  EPD      +L     + F
Sbjct: 216 DQIEYKDIVS--WNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKF 273

Query: 73  IGVGKEIHRHVIRCGL 26
                 IH  VIR G+
Sbjct: 274 ---KLHIHGWVIRHGV 286


>ref|XP_002510931.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223550046|gb|EEF51533.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 461

 Score =  237 bits (604), Expect = 3e-60
 Identities = 125/218 (57%), Positives = 155/218 (71%), Gaps = 8/218 (3%)
 Frame = -2

Query: 631 YSFSKKNPGKRKQIR--------PSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVI 476
           Y+ S K   K++Q+          +R    L FP   PTPLLI+  +Y +TKLQAL+ VI
Sbjct: 19  YASSSKKRRKQRQLNRIQKNDFYQNRNANGLSFPVPSPTPLLINLNTYTQTKLQALDDVI 78

Query: 475 SKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRL 296
             LE+SI   I + DT+I ++LLETC+ L S DH +RIHRLIP  IL KN GVSSKL+RL
Sbjct: 79  KDLESSIGKGIKI-DTQIISSLLETCYRLNSIDHGMRIHRLIPTSILRKNTGVSSKLLRL 137

Query: 295 YASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQH 116
           YAS G+++ AHQ+FDEM  R+ SAF WNSLIAGY+E GLYEDA+ALYFQM EE VEPD+ 
Sbjct: 138 YASCGYMDEAHQMFDEMSNRDESAFAWNSLIAGYSELGLYEDAIALYFQMDEEYVEPDEF 197

Query: 115 TFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           TFPRVLKACGG+G I VG+ +HR +IR G  +D F  N
Sbjct: 198 TFPRVLKACGGLGLIQVGEAVHRDLIRLGFANDRFASN 235


>ref|XP_003533519.1| PREDICTED: pentatricopeptide repeat-containing protein At4g25270,
           chloroplastic-like [Glycine max]
          Length = 526

 Score =  234 bits (597), Expect = 2e-59
 Identities = 114/201 (56%), Positives = 148/201 (73%)
 Frame = -2

Query: 604 KRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTR 425
           K++++    R+  L FPK   TPLLI    +P+TKL+ALE V+  LE S++  I + D  
Sbjct: 33  KQRRLESQERRNGLSFPKPKSTPLLIHHRPHPKTKLEALEQVVKDLEASVEKGIKI-DPE 91

Query: 424 IFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEM 245
           I+A+LLETC+  Q+  H IR+HRLIP  +L KNVG+SSKL+RLYAS G+L+ AH LFD+M
Sbjct: 92  IYASLLETCYRFQAILHGIRVHRLIPTSLLHKNVGISSKLLRLYASCGYLDDAHDLFDQM 151

Query: 244 PQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGV 65
            +R+ SAFPWNSLI+GY + G Y++A+ALYFQMVEE VE D  TFPRVLK C GIG + V
Sbjct: 152 AKRDTSAFPWNSLISGYAQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQV 211

Query: 64  GKEIHRHVIRCGLGDDGFVLN 2
           G+E+HRH IR G   DGF+LN
Sbjct: 212 GEEVHRHAIRAGFAADGFILN 232



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 46/178 (25%), Positives = 83/178 (46%)
 Frame = -2

Query: 562 RFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQS 383
           R   +FP   LI   +      +A+      +E  ++ +++      F  +L+ C  + S
Sbjct: 154 RDTSAFPWNSLISGYAQVGHYDEAIALYFQMVEEGVEADLFT-----FPRVLKVCAGIGS 208

Query: 382 YDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLI 203
                 +HR       A +  + + L+ +Y+  G +  A ++FD+MP R+  +  WNS++
Sbjct: 209 VQVGEEVHRHAIRAGFAADGFILNALVDMYSKCGDIVKARKVFDKMPHRDPVS--WNSML 266

Query: 202 AGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCG 29
             Y   GL   A+ ++ QM+ E  EPD  +   VL    G+  +G+G +IH  VI  G
Sbjct: 267 TAYVHHGLEVQAMNIFRQMLLEGCEPDSVSISTVLT---GVSSLGLGVQIHGWVISQG 321



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 35/114 (30%), Positives = 66/114 (57%)
 Frame = -2

Query: 370 IRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYT 191
           ++IH  +  +    N+ +++ LI +Y++ G LE A  +F+ MP+R+  +  WNS+I+ + 
Sbjct: 311 VQIHGWVISQGHEWNLSIANSLIMMYSNHGRLEKARWVFNLMPERDVVS--WNSIISAHC 368

Query: 190 EKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCG 29
           ++    +ALA + QM    V+PD+ TF  +L AC  +G +  G+ +    + CG
Sbjct: 369 KR---REALAFFEQMEGAGVQPDKITFVSILSACAYLGLLKDGERL--FALMCG 417


>gb|ACU21163.1| unknown [Glycine max]
          Length = 481

 Score =  234 bits (597), Expect = 2e-59
 Identities = 114/201 (56%), Positives = 148/201 (73%)
 Frame = -2

Query: 604 KRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTR 425
           K++++    R+  L FPK   TPLLI    +P+TKL+ALE V+  LE S++  I + D  
Sbjct: 33  KQRRLESQERRNGLSFPKPKSTPLLIHHRPHPKTKLEALEQVVKDLEASVEKGIKI-DPE 91

Query: 424 IFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEM 245
           I+A+LLETC+  Q+  H IR+HRLIP  +L KNVG+SSKL+RLYAS G+L+ AH LFD+M
Sbjct: 92  IYASLLETCYRFQAILHGIRVHRLIPTSLLHKNVGISSKLLRLYASCGYLDDAHDLFDQM 151

Query: 244 PQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGV 65
            +R+ SAFPWNSLI+GY + G Y++A+ALYFQMVEE VE D  TFPRVLK C GIG + V
Sbjct: 152 AKRDTSAFPWNSLISGYAQVGHYDEAIALYFQMVEEGVEADLFTFPRVLKVCAGIGSVQV 211

Query: 64  GKEIHRHVIRCGLGDDGFVLN 2
           G+E+HRH IR G   DGF+LN
Sbjct: 212 GEEVHRHAIRAGFAADGFILN 232



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 46/178 (25%), Positives = 83/178 (46%)
 Frame = -2

Query: 562 RFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQS 383
           R   +FP   LI   +      +A+      +E  ++ +++      F  +L+ C  + S
Sbjct: 154 RDTSAFPWNSLISGYAQVGHYDEAIALYFQMVEEGVEADLFT-----FPRVLKVCAGIGS 208

Query: 382 YDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLI 203
                 +HR       A +  + + L+ +Y+  G +  A ++FD+MP R+  +  WNS++
Sbjct: 209 VQVGEEVHRHAIRAGFAADGFILNALVDMYSKCGDIVKARKVFDKMPHRDPVS--WNSML 266

Query: 202 AGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCG 29
             Y   GL   A+ ++ QM+ E  EPD  +   VL    G+  +G+G +IH  VI  G
Sbjct: 267 TAYVHHGLEVQAMNIFRQMLLEGCEPDSVSISTVLT---GVSSLGLGVQIHGWVISQG 321



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 35/114 (30%), Positives = 66/114 (57%)
 Frame = -2

Query: 370 IRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYT 191
           ++IH  +  +    N+ +++ LI +Y++ G LE A  +F+ MP+R+  +  WNS+I+ + 
Sbjct: 311 VQIHGWVISQGHEWNLSIANSLIMMYSNHGRLEKARWVFNLMPERDVVS--WNSIISAHC 368

Query: 190 EKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCG 29
           ++    +ALA + QM    V+PD+ TF  +L AC  +G +  G+ +    + CG
Sbjct: 369 KR---REALAFFEQMEGAGVQPDKITFVSILSACAYLGLLKDGERL--FALMCG 417


>ref|NP_194257.1| pentatricopeptide repeat protein OTP70 [Arabidopsis thaliana]
           gi|75265547|sp|Q9SB36.1|PP337_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g25270, chloroplastic; Flags: Precursor
           gi|4454015|emb|CAA23068.1| putative protein [Arabidopsis
           thaliana] gi|7269378|emb|CAB81338.1| putative protein
           [Arabidopsis thaliana] gi|332659633|gb|AEE85033.1|
           pentatricopeptide repeat protein OTP70 [Arabidopsis
           thaliana]
          Length = 527

 Score =  231 bits (590), Expect = 1e-58
 Identities = 117/227 (51%), Positives = 161/227 (70%), Gaps = 9/227 (3%)
 Frame = -2

Query: 655 RPHFSFNPYSFS--KKNPGKRKQIRPSRRKP-------KLRFPKSFPTPLLIDQTSYPRT 503
           +P FS+   S S  KK P   +Q++  R+          L F K  PTPLLI++ S  RT
Sbjct: 8   KPSFSYPSVSSSSMKKKPRHHQQLKQHRQNQYNNNGFTSLSFTKPSPTPLLIEKQSIHRT 67

Query: 502 KLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNV 323
           +L+AL+SVI+ LETS +  I + +  IFA+LLETC+ L++ DH +R+H LIP  +L  N+
Sbjct: 68  QLEALDSVITDLETSAQKGISLTEPEIFASLLETCYSLRAIDHGVRVHHLIPPYLLRNNL 127

Query: 322 GVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMV 143
           G+SSKL+RLYAS G+ E+AH++FD M +R++S F WNSLI+GY E G YEDA+ALYFQM 
Sbjct: 128 GISSKLVRLYASCGYAEVAHEVFDRMSKRDSSPFAWNSLISGYAELGQYEDAMALYFQMA 187

Query: 142 EEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           E+ V+PD+ TFPRVLKACGGIG + +G+ IHR +++ G G D +VLN
Sbjct: 188 EDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYDVYVLN 234



 Score = 60.8 bits (146), Expect = 4e-07
 Identities = 40/136 (29%), Positives = 69/136 (50%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      IHR + ++    +V V + L+ +YA  G +  A  +F
Sbjct: 194 DRFTFPRVLKACGGIGSVQIGEAIHRDLVKEGFGYDVYVLNALVVMYAKCGDIVKARNVF 253

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           D +P ++  +  WNS++ GY   GL  +AL ++  MV+  +EPD+     VL     +  
Sbjct: 254 DMIPHKDYVS--WNSMLTGYLHHGLLHEALDIFRLMVQNGIEPDKVAISSVL---ARVLS 308

Query: 73  IGVGKEIHRHVIRCGL 26
              G+++H  VIR G+
Sbjct: 309 FKHGRQLHGWVIRRGM 324


>ref|XP_006413323.1| hypothetical protein EUTSA_v10024921mg [Eutrema salsugineum]
           gi|557114493|gb|ESQ54776.1| hypothetical protein
           EUTSA_v10024921mg [Eutrema salsugineum]
          Length = 523

 Score =  230 bits (587), Expect = 3e-58
 Identities = 119/227 (52%), Positives = 161/227 (70%), Gaps = 7/227 (3%)
 Frame = -2

Query: 661 LLRPHFSFNPYSFS--KKNPG-----KRKQIRPSRRKPKLRFPKSFPTPLLIDQTSYPRT 503
           +++P F +   S S  KK P      K+KQI+ +     L F K  P P+LI + S  RT
Sbjct: 4   IVQPSFCYPSVSSSSMKKKPRHYEQLKQKQIQDNNGFTSLSFTKPSPIPILIGKQSIHRT 63

Query: 502 KLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNV 323
           +L+AL SVI+ LETS +  I +++  IFA+LLETC+ L++ D  +R+HRLIP  +L  N+
Sbjct: 64  QLEALSSVITDLETSARKGITISEPEIFASLLETCYSLRAIDLGVRVHRLIPVHLLRNNL 123

Query: 322 GVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMV 143
           G+SSKL+RLYAS G+ E+AH++FD M +R +SAF WNSLI+GY E G YEDA+ALYFQM 
Sbjct: 124 GISSKLVRLYASCGYAEVAHEVFDRMSKRKSSAFAWNSLISGYAESGQYEDAMALYFQMA 183

Query: 142 EEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           EE V+PD+ TFPRVLKACGGIG I +G+ IHR +++ G G D +VLN
Sbjct: 184 EEGVKPDRFTFPRVLKACGGIGSIQIGEAIHRDLVKQGYGYDVYVLN 230



 Score = 57.4 bits (137), Expect = 5e-06
 Identities = 39/144 (27%), Positives = 70/144 (48%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      IHR + ++    +V V + L+ +YA  G +     +F
Sbjct: 190 DRFTFPRVLKACGGIGSIQIGEAIHRDLVKQGYGYDVYVLNALVDMYAKCGDIVKGRNVF 249

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           D +P  N     WNS++  Y   GL ++A+ ++  MV++ +EPD+     VL     +  
Sbjct: 250 DMIP--NKDYVSWNSMLTSYLHHGLLQEAMHIFRLMVQDGIEPDKVAISSVL---ARVLS 304

Query: 73  IGVGKEIHRHVIRCGLGDDGFVLN 2
              G+++H   IR G+  +  V+N
Sbjct: 305 FKHGRQLHGWAIRRGMECELSVVN 328


>ref|XP_006282784.1| hypothetical protein CARUB_v10006372mg, partial [Capsella rubella]
           gi|482551489|gb|EOA15682.1| hypothetical protein
           CARUB_v10006372mg, partial [Capsella rubella]
          Length = 533

 Score =  229 bits (585), Expect = 5e-58
 Identities = 118/227 (51%), Positives = 156/227 (68%), Gaps = 9/227 (3%)
 Frame = -2

Query: 655 RPHFSFNPYSFS--KKNPGKRKQIRPSRRKP-------KLRFPKSFPTPLLIDQTSYPRT 503
           +P FS+   S S  KK P   +Q++  R+          L F K  PTP+LI + S  RT
Sbjct: 14  KPSFSYPSVSSSSMKKKPRHHQQLKQQRQNQYDNNGFTSLSFTKPSPTPILIGKQSIHRT 73

Query: 502 KLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNV 323
           KL+AL+SVI+ LETS +  I  ++  IFA+LLETC+ L++ DH +R+H LIP  +L  N+
Sbjct: 74  KLEALDSVITDLETSAQKGISFSEPEIFASLLETCYSLRAIDHGVRVHHLIPPYLLRNNL 133

Query: 322 GVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMV 143
           G+SSKL+RLYAS G+ E+AH++FD M +R  S F WNSLI+GY E G YEDALALYFQM 
Sbjct: 134 GISSKLVRLYASCGYTEVAHEVFDRMSKRELSPFAWNSLISGYAELGQYEDALALYFQMA 193

Query: 142 EEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
           E+ V+PD+ TFPRVLKAC GIG I +G  IHR +++ G G D +VLN
Sbjct: 194 EDGVKPDRFTFPRVLKACAGIGSIQIGDAIHRDLVKAGFGYDVYVLN 240



 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 39/136 (28%), Positives = 68/136 (50%)
 Frame = -2

Query: 433 DTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLF 254
           D   F  +L+ C  + S      IHR + +     +V V + L+ +YA  G +     +F
Sbjct: 200 DRFTFPRVLKACAGIGSIQIGDAIHRDLVKAGFGYDVYVLNALVDMYAKCGDIVKGRNVF 259

Query: 253 DEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGF 74
           D +P ++  +  WNS++ GY   GL  +AL ++  MV++ +EPD+     VL     +  
Sbjct: 260 DMIPHKDYVS--WNSMLTGYLHHGLLLEALDIFRLMVQDGIEPDKVAISSVL---ARVLS 314

Query: 73  IGVGKEIHRHVIRCGL 26
              G+++H  VIR G+
Sbjct: 315 FKHGRQLHGWVIRRGI 330


>gb|ESW03326.1| hypothetical protein PHAVU_011G004900g [Phaseolus vulgaris]
          Length = 522

 Score =  228 bits (580), Expect = 2e-57
 Identities = 116/211 (54%), Positives = 151/211 (71%), Gaps = 5/211 (2%)
 Frame = -2

Query: 619 KKNPGKRKQ-----IRPSRRKPKLRFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSI 455
           KKN  K K      +    R+  LRFPK   TPLLI +   P+T+ +ALE VI+ LE S+
Sbjct: 19  KKNKNKNKTWKERCLERQERRNVLRFPKPKSTPLLIHRRPPPQTQSEALEQVITDLEDSL 78

Query: 454 KDEIYVNDTRIFATLLETCFELQSYDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHL 275
           +  I + D  I+A+LLE C+ LQ+    IR+HRLIP  +L +N G+SSKL+RLYA+ G +
Sbjct: 79  EKGIRI-DPEIYASLLEICYRLQAIRPGIRLHRLIPTSLLHRNFGISSKLLRLYAACGLV 137

Query: 274 ELAHQLFDEMPQRNASAFPWNSLIAGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLK 95
           + AH+LFD+M +R+ SAFPWNSLI+GY + GLY+DA+ALYFQMVEE VEPD  TFPRVLK
Sbjct: 138 DDAHELFDQMAKRDTSAFPWNSLISGYAQMGLYDDAIALYFQMVEEGVEPDLFTFPRVLK 197

Query: 94  ACGGIGFIGVGKEIHRHVIRCGLGDDGFVLN 2
            C GIG + VG+E+HRH++R G   DGFVLN
Sbjct: 198 VCAGIGSVRVGEEVHRHLVRAGFATDGFVLN 228



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 36/106 (33%), Positives = 64/106 (60%)
 Frame = -2

Query: 370 IRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLIAGYT 191
           ++IH  +  + L  N+ +++ L+ +Y+S G LE A  +F+ MP+R+  +  WNS+I+ + 
Sbjct: 307 VQIHGWVIRRGLDWNLSIANSLMVMYSSHGRLEKARWIFNLMPERDVVS--WNSIISAHC 364

Query: 190 EKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEI 53
           ++    +AL  + QM E  VEPD+ TF  VL AC  +G +  G+ +
Sbjct: 365 KR---REALEFFEQMEEAGVEPDKITFVSVLSACAYLGLVKEGERV 407



 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 46/179 (25%), Positives = 84/179 (46%)
 Frame = -2

Query: 562 RFPKSFPTPLLIDQTSYPRTKLQALESVISKLETSIKDEIYVNDTRIFATLLETCFELQS 383
           R   +FP   LI   +       A+      +E  ++ +++      F  +L+ C  + S
Sbjct: 150 RDTSAFPWNSLISGYAQMGLYDDAIALYFQMVEEGVEPDLFT-----FPRVLKVCAGIGS 204

Query: 382 YDHVIRIHRLIPEKILAKNVGVSSKLIRLYASDGHLELAHQLFDEMPQRNASAFPWNSLI 203
                 +HR +     A +  V + L+ +Y+  G +  A ++FD+MP R++ +  WNS++
Sbjct: 205 VRVGEEVHRHLVRAGFATDGFVLNALVDMYSKCGDIVKAQKIFDKMPHRDSIS--WNSML 262

Query: 202 AGYTEKGLYEDALALYFQMVEEDVEPDQHTFPRVLKACGGIGFIGVGKEIHRHVIRCGL 26
             Y   GL   A+ ++ QM+ +  EPD  +   +L    G+    +G +IH  VIR GL
Sbjct: 263 TAYVHHGLEVGAVNIFRQMILDGCEPDSVSVSTILT---GVSSPCLGVQIHGWVIRRGL 318


Top