BLASTX nr result

ID: Akebia24_contig00015815 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00015815
         (752 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278719.1| PREDICTED: pentatricopeptide repeat-containi...   278   1e-72
gb|EXB31933.1| Pentatricopeptide repeat-containing protein [Moru...   261   2e-67
ref|XP_007032614.1| Pentatricopeptide repeat (PPR) superfamily p...   260   4e-67
ref|XP_004306124.1| PREDICTED: pentatricopeptide repeat-containi...   260   4e-67
ref|XP_002530010.1| pentatricopeptide repeat-containing protein,...   257   3e-66
ref|XP_007217570.1| hypothetical protein PRUPE_ppa020933mg [Prun...   255   1e-65
ref|XP_004242297.1| PREDICTED: pentatricopeptide repeat-containi...   247   3e-63
ref|XP_002324074.2| pentatricopeptide repeat-containing family p...   243   5e-62
gb|EYU44138.1| hypothetical protein MIMGU_mgv1a003617mg [Mimulus...   225   1e-56
ref|XP_006400058.1| hypothetical protein EUTSA_v10012971mg [Eutr...   224   2e-56
ref|XP_004489421.1| PREDICTED: pentatricopeptide repeat-containi...   223   6e-56
ref|XP_006290195.1| hypothetical protein CARUB_v10003880mg [Caps...   223   6e-56
ref|NP_197038.1| pentatricopeptide repeat-containing protein [Ar...   221   2e-55
ref|XP_002873720.1| pentatricopeptide repeat-containing protein ...   218   2e-54
ref|XP_007151258.1| hypothetical protein PHAVU_004G031500g [Phas...   217   4e-54
gb|EPS65080.1| hypothetical protein M569_09698 [Genlisea aurea]       214   3e-53
ref|XP_006603878.1| PREDICTED: pentatricopeptide repeat-containi...   205   1e-50
ref|XP_003618546.1| Pentatricopeptide repeat-containing protein ...   205   1e-50
gb|EPS72347.1| hypothetical protein M569_02407, partial [Genlise...   149   8e-34
ref|XP_002282912.2| PREDICTED: pentatricopeptide repeat-containi...   146   7e-33

>ref|XP_002278719.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Vitis vinifera]
          Length = 632

 Score =  278 bits (711), Expect = 1e-72
 Identities = 136/249 (54%), Positives = 175/249 (70%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HAT++  G+A +P  F+HNALL  YA CG +  A KVFD+IP++HKDTVDWTTLM 
Sbjct: 32  GERLHATIITTGIAGAPETFLHNALLQFYASCGCAWQARKVFDEIPHSHKDTVDWTTLMG 91

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C+ R+   + AL +F  M+  G+ PDEVTLVC    C+ L DVV GAQGH C++K GL  
Sbjct: 92  CFVRHNVSDEALLIFVEMRRCGVKPDEVTLVCLFGGCARLGDVVVGAQGHGCMVKMGLGG 151

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
              A NA MDMYAK G MG+ARRVF E+   +VVSWTVIL GVI+ EGV+NGR  FDEM 
Sbjct: 152 VEKACNAVMDMYAKSGLMGEARRVFYEMKGQSVVSWTVILDGVIRSEGVRNGRVVFDEMP 211

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWT+M+AGY++SG + E+F L+  M+   L++ LNYV            GD++MG
Sbjct: 212 ERNEVAWTIMIAGYLDSGLTQESFALVREMIFD-LEMELNYVTLCSILTACSQSGDLMMG 270

Query: 721 RWLHVYALK 747
           RW+H YALK
Sbjct: 271 RWVHAYALK 279



 Score = 71.2 bits (173), Expect = 4e-10
 Identities = 52/179 (29%), Positives = 88/179 (49%), Gaps = 5/179 (2%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQGE-GLCPDEVTLVCFLTACSW 297
           VFD++P   ++ V WT ++A Y  +G    +  L R M  +  +  + VTL   LTACS 
Sbjct: 206 VFDEMPE--RNEVAWTIMIAGYLDSGLTQESFALVREMIFDLEMELNYVTLCSILTACSQ 263

Query: 298 LRDVVSGAQGHLCLIK-RGLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             D++ G   H   +K +     +    A +DMYAKCGR+  A + F ++ + NVVSW  
Sbjct: 264 SGDLMMGRWVHAYALKTKEKELNIMVGTAMVDMYAKCGRIHIAFKFFKKMPQRNVVSWNA 323

Query: 475 ILTGVIKWEGVQNGRFFFDEM---SDRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSY 642
           +L+G+      +     F +M   +  ++V +T +++    SG   +     GN+   Y
Sbjct: 324 MLSGLAMHGLGRAALDIFPQMFKEAKPDDVTFTSVLSACSHSGLVDQGCFYFGNLESVY 382


>gb|EXB31933.1| Pentatricopeptide repeat-containing protein [Morus notabilis]
          Length = 753

 Score =  261 bits (667), Expect = 2e-67
 Identities = 130/249 (52%), Positives = 165/249 (66%)
 Frame = +1

Query: 1    GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
            G+K HA L+ +GL ASP+AF+ NALLH YA CG    A K+FD+IPN+HKD  DWT L+ 
Sbjct: 260  GKKLHAVLITSGLVASPDAFLRNALLHFYAACGSISFARKLFDEIPNSHKDAADWTALVG 319

Query: 181  CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            C+AR+G P N L LF  M  EGL  D+V LVCF  AC+ L +   G QGH  L K GL +
Sbjct: 320  CFARHGIPKNGLRLFVEMIREGLRADDVALVCFFNACARLGNAEVGLQGHGVLEKMGLGA 379

Query: 361  TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
            +V   NA MDMY KCG + +ARRVF+ + E +VVSWTVIL G +  EG+++GR  FD M 
Sbjct: 380  SVKVCNAVMDMYVKCGMLREARRVFERMEERSVVSWTVILDGAVNLEGMKSGRVVFDGMP 439

Query: 541  DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
            +RNEVAWT+MV GY+ +GF+ E   LL  MV     + LN+V            GD+LMG
Sbjct: 440  ERNEVAWTIMVVGYVANGFNREGLSLLCEMVFG-CGLKLNHVTLCSVLCASAQSGDLLMG 498

Query: 721  RWLHVYALK 747
            RW+H+YALK
Sbjct: 499  RWVHIYALK 507



 Score = 74.7 bits (182), Expect = 3e-11
 Identities = 57/194 (29%), Positives = 86/194 (44%), Gaps = 31/194 (15%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNT------------ 144
           G + H  L K GL AS    + NA++ MY  CG    A +VF+++               
Sbjct: 365 GLQGHGVLEKMGLGASVK--VCNAVMDMYVKCGMLREARRVFERMEERSVVSWTVILDGA 422

Query: 145 -----------------HKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEVTL 270
                             ++ V WT ++  Y  NGF    L L   M  G GL  + VTL
Sbjct: 423 VNLEGMKSGRVVFDGMPERNEVAWTIMVVGYVANGFNREGLSLLCEMVFGCGLKLNHVTL 482

Query: 271 VCFLTACSWLRDVVSGAQGHLCLIKRGLLST-VTASNAAMDMYAKCGRMGDARRVFDEVS 447
              L A +   D++ G   H+  +K       +    A +DMYAKCGR+  A +VF+++ 
Sbjct: 483 CSVLCASAQSGDLLMGRWVHIYALKMTERKIDIMVDTALVDMYAKCGRIDTAMKVFEQMP 542

Query: 448 EPNVVSWTVILTGV 489
             NVV+W  +L+G+
Sbjct: 543 LRNVVAWNAMLSGL 556


>ref|XP_007032614.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao] gi|508711643|gb|EOY03540.1| Pentatricopeptide
           repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 633

 Score =  260 bits (664), Expect = 4e-67
 Identities = 124/249 (49%), Positives = 167/249 (67%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA +L  G+    N+F+ NALLH+YA CG + +A K+FD+IP + KDT DWT LM+
Sbjct: 33  GKKLHALVLTTGVYRIRNSFLLNALLHLYASCGDTPAAHKLFDEIPPSSKDTADWTALMS 92

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            ++R+  P +ALHLF  M+G  +  D+V +VC   AC+WLRDV  G+Q H C++K G   
Sbjct: 93  SFSRDNMPLDALHLFAQMRGNSMEIDDVVMVCLFCACAWLRDVGVGSQVHGCVVKTGFQG 152

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
            V   NA MDMY KCG +G+ R+VF ++ E +VVSWTV+L GV+KWEGV++GR  FDEM 
Sbjct: 153 RVKVCNAVMDMYGKCGMVGEMRKVFGDMKEKSVVSWTVLLDGVLKWEGVRSGRVVFDEMP 212

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWT+M+ GY+ SGF  E F LL  M+  +    LN+V            GD+LMG
Sbjct: 213 ERNEVAWTIMIVGYMGSGFCREGFSLLSEMM-FHWGFKLNHVTLCSLLSACAQSGDVLMG 271

Query: 721 RWLHVYALK 747
            W+HVY LK
Sbjct: 272 GWVHVYGLK 280



 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 41/125 (32%), Positives = 67/125 (53%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQGE-GLCPDEVTLVCFLTACSW 297
           VFD++P   ++ V WT ++  Y  +GF      L   M    G   + VTL   L+AC+ 
Sbjct: 207 VFDEMPE--RNEVAWTIMIVGYMGSGFCREGFSLLSEMMFHWGFKLNHVTLCSLLSACAQ 264

Query: 298 LRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             DV+ G   H+  +K  G+   +    A +DMY+KCGR+  A +VF+ +   N+V+W  
Sbjct: 265 SGDVLMGGWVHVYGLKMMGMEMDIMVGTALVDMYSKCGRVDTAVKVFECMPRRNLVAWNA 324

Query: 475 ILTGV 489
           +L+G+
Sbjct: 325 MLSGL 329


>ref|XP_004306124.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 634

 Score =  260 bits (664), Expect = 4e-67
 Identities = 129/249 (51%), Positives = 165/249 (66%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA ++  GLAA  ++FI+NALLH +A CG  + A KVFD+IP +HKDTVDWT LM 
Sbjct: 33  GKKLHAAIITTGLAALADSFIYNALLHFHAACGSPVHARKVFDEIPKSHKDTVDWTILMG 92

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C AR+G   N L +F  M+ EG+  D++ +VC    C+ L +V  G QGH  ++K GL S
Sbjct: 93  CLARHGMHRNGLDVFVEMRREGVRVDDIAVVCLFGGCARLGEVEIGVQGHGLMVKVGLSS 152

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V A NAAMDMY KCG +G ARRVF+E+ E +VVSWTVIL GV++WEGV +GR  FD M 
Sbjct: 153 SVKACNAAMDMYVKCGELGMARRVFEEMGERSVVSWTVILDGVVRWEGVGSGRVVFDAMP 212

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWTVM+ GY+  G   E F LL  MV     + LNYV            GD++ G
Sbjct: 213 ERNEVAWTVMIVGYVSVGLVREGFALLKEMVFG-CALGLNYVTLCSFLSACAQSGDVVTG 271

Query: 721 RWLHVYALK 747
            W+HVYA K
Sbjct: 272 SWVHVYACK 280



 Score = 78.2 bits (191), Expect = 3e-12
 Identities = 58/194 (29%), Positives = 88/194 (45%), Gaps = 31/194 (15%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNT------------ 144
           G + H  ++K GL++S  A   NA + MY  CG    A +VF+++               
Sbjct: 138 GVQGHGLMVKVGLSSSVKAC--NAAMDMYVKCGELGMARRVFEEMGERSVVSWTVILDGV 195

Query: 145 -----------------HKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEVTL 270
                             ++ V WT ++  Y   G       L + M  G  L  + VTL
Sbjct: 196 VRWEGVGSGRVVFDAMPERNEVAWTVMIVGYVSVGLVREGFALLKEMVFGCALGLNYVTL 255

Query: 271 VCFLTACSWLRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDEVS 447
             FL+AC+   DVV+G+  H+   K  G    V    A +DMYAKCGR+  A +VF ++ 
Sbjct: 256 CSFLSACAQSGDVVTGSWVHVYACKTMGSEMDVMVGTALVDMYAKCGRVDTALKVFQQMK 315

Query: 448 EPNVVSWTVILTGV 489
             NVV+W  +L+G+
Sbjct: 316 YRNVVTWNAVLSGL 329


>ref|XP_002530010.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223530489|gb|EEF32372.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 487

 Score =  257 bits (656), Expect = 3e-66
 Identities = 122/251 (48%), Positives = 168/251 (66%), Gaps = 1/251 (0%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA LL  G+A SPNAF+ NALLH+Y+ CG +  A  +FDQIPN+HKDT DWT+L++
Sbjct: 43  GKKLHAILLTTGVATSPNAFLLNALLHLYSQCGITRYAHHLFDQIPNSHKDTADWTSLLS 102

Query: 181 CYARN-GFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLL 357
           C A++   P NA  LF  M+  G+  D+V  VC  + C+ + ++  G Q H C++K G  
Sbjct: 103 CLAKHTSTPRNAFSLFEEMRKRGVILDDVAFVCVFSLCARVGNLEMGRQAHGCVVKMGFG 162

Query: 358 STVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEM 537
             V   NA M++Y KC  MG+A+ VF E+ E ++VSWT +L GV+ WEGV+NG+  FD+M
Sbjct: 163 INVKVCNAVMNVYVKCRLMGEAKGVFSEMGERDIVSWTALLEGVVNWEGVENGKVVFDQM 222

Query: 538 SDRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLM 717
            +RNEV WT+M++GY+ SGF  E FLLL  MV   L++ LNYV            GD++M
Sbjct: 223 PERNEVGWTIMISGYVGSGFCKEGFLLLSEMVLG-LRLELNYVTLCSILSACAQSGDVVM 281

Query: 718 GRWLHVYALKR 750
           GRW+HVYALK+
Sbjct: 282 GRWVHVYALKK 292



 Score = 70.5 bits (171), Expect = 6e-10
 Identities = 47/125 (37%), Positives = 66/125 (52%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEVTLVCFLTACSW 297
           VFDQ+P   ++ V WT +++ Y  +GF      L   M  G  L  + VTL   L+AC+ 
Sbjct: 218 VFDQMPE--RNEVGWTIMISGYVGSGFCKEGFLLLSEMVLGLRLELNYVTLCSILSACAQ 275

Query: 298 LRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             DVV G   H+  +K+ G    +    A +DMYAKCGR+  A  VF  +   NVV+W  
Sbjct: 276 SGDVVMGRWVHVYALKKMGREIDMMVGTALIDMYAKCGRIKMAYEVFKYLPRRNVVAWNA 335

Query: 475 ILTGV 489
           IL G+
Sbjct: 336 ILGGL 340


>ref|XP_007217570.1| hypothetical protein PRUPE_ppa020933mg [Prunus persica]
           gi|462413720|gb|EMJ18769.1| hypothetical protein
           PRUPE_ppa020933mg [Prunus persica]
          Length = 633

 Score =  255 bits (651), Expect = 1e-65
 Identities = 127/249 (51%), Positives = 164/249 (65%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA ++  GLAA P++F+HNALLH+YA  G   SA K+FD+IPN+HKD VDWT LM 
Sbjct: 33  GKKLHAAIITGGLAAMPDSFLHNALLHLYAAHGSVCSARKLFDEIPNSHKDAVDWTVLMG 92

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C++R+G P + L LF  M+ E +  D+V + C   AC+ L +V  G QGH  ++K GL S
Sbjct: 93  CFSRHGMPQSGLRLFVEMRRENVRVDDVAMACLFNACARLGNVEIGEQGHGFVMKVGLGS 152

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V A N  MDMY KCG +G ARRVF+E+ E +VVSWTVIL GV+K EGV +GR  FD M 
Sbjct: 153 SVKACNGVMDMYVKCGLLGMARRVFEEMGERSVVSWTVILDGVVKLEGVGSGRRVFDNMP 212

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWT+M+ GY+  G   E F LL  MV     + LNYV            GD + G
Sbjct: 213 ERNEVAWTIMIVGYVSVGLIREGFSLLEEMVFG-CGLGLNYVTLCSFLSASAQSGDTMTG 271

Query: 721 RWLHVYALK 747
           RW+H YA+K
Sbjct: 272 RWVHAYAVK 280



 Score = 70.9 bits (172), Expect = 5e-10
 Identities = 64/250 (25%), Positives = 107/250 (42%), Gaps = 36/250 (14%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICG--------------HSL---------- 108
           G++ H  ++K GL +S  A   N ++ MY  CG               S+          
Sbjct: 138 GEQGHGFVMKVGLGSSVKAC--NGVMDMYVKCGLLGMARRVFEEMGERSVVSWTVILDGV 195

Query: 109 -------SAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEV 264
                  S  +VFD +P   ++ V WT ++  Y   G       L   M  G GL  + V
Sbjct: 196 VKLEGVGSGRRVFDNMPE--RNEVAWTIMIVGYVSVGLIREGFSLLEEMVFGCGLGLNYV 253

Query: 265 TLVCFLTACSWLRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDE 441
           TL  FL+A +   D ++G   H   +K  G    +    A +DMYAKCGR+  A +VF+ 
Sbjct: 254 TLCSFLSASAQSGDTMTGRWVHAYAVKAVGNEIDIMVGTAVVDMYAKCGRVDTALKVFEH 313

Query: 442 VSEPNVVSWTVILTGVIKWEGVQNGRFFFDEM---SDRNEVAWTVMVAGYIESGFSSEAF 612
           + + N V+W  +L+G+      +     F +M   +  +++ +T +++    SG   +  
Sbjct: 314 MHQRNEVTWNALLSGLAMHGRGKLVLNMFPQMLKEAKPDDLTFTALLSACSHSGLVEQGR 373

Query: 613 LLLGNMVPSY 642
               N+  SY
Sbjct: 374 HYFDNLEASY 383


>ref|XP_004242297.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Solanum lycopersicum]
          Length = 632

 Score =  247 bits (631), Expect = 3e-63
 Identities = 125/249 (50%), Positives = 164/249 (65%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA+++  GL    N F+ NA+LHMYA CG+ L A KVFD+IP ++KDTVDWTTLM 
Sbjct: 32  GKKLHASIVTTGLVNFRNTFLRNAILHMYAACGYVLYARKVFDEIPLSYKDTVDWTTLMG 91

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           CYAR GFP +AL LF  M+   +  DE T+V    A +       G QGH C++K G  S
Sbjct: 92  CYARGGFPLDALKLFVHMRKSDVLIDEYTMVVVFLASTKTGCEQFGIQGHGCMVKMGFNS 151

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           ++ A NA MDMY KCG +   +R+F E+ E +VVSWTV+L GV+K EG +N RF FD+M 
Sbjct: 152 SIKACNAVMDMYVKCGLIDKTKRIFREMGEKSVVSWTVVLKGVVKSEGFENARFLFDKMP 211

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWTVM+A YIE+G + EAF LL  M+       LN+V            G++L+G
Sbjct: 212 ERNEVAWTVMIAAYIENGLTKEAFGLLREML-FESGFELNFVTLSSLLSACAQSGNVLVG 270

Query: 721 RWLHVYALK 747
           +W+HVYALK
Sbjct: 271 KWVHVYALK 279



 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 43/125 (34%), Positives = 68/125 (54%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQGE-GLCPDEVTLVCFLTACSW 297
           +FD++P   ++ V WT ++A Y  NG    A  L R M  E G   + VTL   L+AC+ 
Sbjct: 206 LFDKMPE--RNEVAWTVMIAAYIENGLTKEAFGLLREMLFESGFELNFVTLSSLLSACAQ 263

Query: 298 LRDVVSGAQGHLCLIKRGLLST-VTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             +V+ G   H+  +K       +  +   ++MYAKCGR+ DA RVF  +   NV++W  
Sbjct: 264 SGNVLVGKWVHVYALKMIEHEIDIVVATTLINMYAKCGRIDDAFRVFLVMRRRNVITWNA 323

Query: 475 ILTGV 489
           +L+G+
Sbjct: 324 MLSGL 328


>ref|XP_002324074.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550320120|gb|EEF04207.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 636

 Score =  243 bits (620), Expect = 5e-62
 Identities = 122/251 (48%), Positives = 165/251 (65%), Gaps = 2/251 (0%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAAS-PNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLM 177
           G+K HA +L +GLA+S PN F+ NAL H+YA CG + SA  +F QIP +HKD  DWTTL+
Sbjct: 34  GKKLHAVILTSGLASSSPNTFLLNALHHLYASCGVTSSARHLFYQIPRSHKDVTDWTTLL 93

Query: 178 ACYARNGF-PNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGL 354
               ++G  P+     F+ M+ EG+  D+V ++     C+ + D+  G Q   CL+K GL
Sbjct: 94  TSLVQHGTKPSEGFFFFKEMRKEGVVLDDVAMISVFVLCTRVEDLGMGRQAQGCLVKMGL 153

Query: 355 LSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDE 534
              V   NA M+MY KCG + + RRVF E++E NVVSW+ +L GV+KWEGV+NGR  FDE
Sbjct: 154 GLGVKVCNAIMNMYVKCGLVEEVRRVFCEMNERNVVSWSTLLEGVVKWEGVENGRVVFDE 213

Query: 535 MSDRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDML 714
           M +RNEV WT+M+AGY+ +GFS E FLLL  MV  + ++ LN+V            GD+L
Sbjct: 214 MPERNEVGWTIMIAGYVGNGFSREGFLLLDEMVLRF-RLGLNFVTLSSILSACAQSGDVL 272

Query: 715 MGRWLHVYALK 747
           MGRW+HVYALK
Sbjct: 273 MGRWVHVYALK 283



 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 44/125 (35%), Positives = 65/125 (52%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQGE-GLCPDEVTLVCFLTACSW 297
           VFD++P   ++ V WT ++A Y  NGF      L   M     L  + VTL   L+AC+ 
Sbjct: 210 VFDEMPE--RNEVGWTIMIAGYVGNGFSREGFLLLDEMVLRFRLGLNFVTLSSILSACAQ 267

Query: 298 LRDVVSGAQGHLCLIK-RGLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             DV+ G   H+  +K  G    +    A +DMYAKCG +  A +VF  + + NVV+W  
Sbjct: 268 SGDVLMGRWVHVYALKGMGREMHIMVGTALVDMYAKCGPIDMAFKVFKYLPKRNVVAWNA 327

Query: 475 ILTGV 489
           +L G+
Sbjct: 328 MLGGL 332


>gb|EYU44138.1| hypothetical protein MIMGU_mgv1a003617mg [Mimulus guttatus]
          Length = 574

 Score =  225 bits (573), Expect = 1e-56
 Identities = 111/222 (50%), Positives = 149/222 (67%)
 Frame = +1

Query: 82  MYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQGEGLCPDE 261
           MYA CG   SA K+FD+IP   KDTVDWT LM C+ R G P +AL+LF  M+ EG+  DE
Sbjct: 1   MYAACGDVGSARKLFDEIPVPDKDTVDWTALMDCHGRFGSPMDALNLFVIMRREGVSVDE 60

Query: 262 VTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTASNAAMDMYAKCGRMGDARRVFDE 441
           +T+V     C+ + + V G QGH C+IK GL   + A NAAMDMY KCG M DA+R+FDE
Sbjct: 61  ITVVSLFGTCARVGNSVFGIQGHTCMIKLGLDFCIKARNAAMDMYVKCGLMIDAKRLFDE 120

Query: 442 VSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMSDRNEVAWTVMVAGYIESGFSSEAFLLL 621
            +  N+VSWTV+L GV+KWEG+  G+  FDEM +RNE+AWT+M++ Y+E+GFS EAF LL
Sbjct: 121 TTVRNIVSWTVLLWGVVKWEGLVKGKKLFDEMPERNEIAWTIMISRYVENGFSMEAFRLL 180

Query: 622 GNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMGRWLHVYALK 747
             M+    ++ LN              GD++MG+W+H +AL+
Sbjct: 181 REMILG-SELQLNSTSLCSLLSACTQSGDVVMGKWVHSHALR 221



 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 53/169 (31%), Positives = 84/169 (49%), Gaps = 6/169 (3%)
 Frame = +1

Query: 118 KVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSM-QGEGLCPDEVTLVCFLTACS 294
           K+FD++P   ++ + WT +++ Y  NGF   A  L R M  G  L  +  +L   L+AC+
Sbjct: 147 KLFDEMPE--RNEIAWTIMISRYVENGFSMEAFRLLREMILGSELQLNSTSLCSLLSACT 204

Query: 295 WLRDVVSGAQGHLCLIKRGLLST--VTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSW 468
              DVV G   H   ++  + +T  +    A +DMYAKCGR+  A +VF  +   NVV+W
Sbjct: 205 QSGDVVMGKWVHSHALRATIDATTNIKFETALLDMYAKCGRLNSALQVFKSMPAKNVVTW 264

Query: 469 TVILTGVIKWEGVQNGRFFFDEMSDR---NEVAWTVMVAGYIESGFSSE 606
             +L G+            F EM +    N+V +T +++    SG   E
Sbjct: 265 NAMLGGLAMHGKGAMVLDMFKEMVEEIKPNDVTFTAVLSACSHSGLVDE 313


>ref|XP_006400058.1| hypothetical protein EUTSA_v10012971mg [Eutrema salsugineum]
           gi|557101148|gb|ESQ41511.1| hypothetical protein
           EUTSA_v10012971mg [Eutrema salsugineum]
          Length = 623

 Score =  224 bits (572), Expect = 2e-56
 Identities = 112/250 (44%), Positives = 161/250 (64%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA L  +GL   P +++ N L   YA  G   +A K+FD+IP   KD VDWTTL++
Sbjct: 25  GKELHAVLTTSGLTKVPRSYLSNKLFQFYAASGDMTTARKLFDEIPLLEKDNVDWTTLLS 84

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            ++R GF  +++ LF  M  + +  D+V++VC    CS L D+  G QGH   +K GLL+
Sbjct: 85  SFSRYGFLVDSMKLFVDMTRKRVEIDDVSVVCLFGVCSKLEDLGFGVQGHGFALKMGLLT 144

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V   NA MDMY KCG +G+ +R+FD + E +VVSWTV++  V+KWEG++ GR  FD+M 
Sbjct: 145 SVKVCNALMDMYGKCGFVGEVKRIFDVLEEKSVVSWTVVMDTVVKWEGLERGREVFDKMP 204

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RN VAWTVMVAGY+ +GF+ EA  LL  MV      +LN+V            G++++G
Sbjct: 205 ERNAVAWTVMVAGYLGAGFTGEALELLAEMV-FKCGHDLNFVSLCSMLSACAQSGNLVIG 263

Query: 721 RWLHVYALKR 750
           RW+HVYALK+
Sbjct: 264 RWVHVYALKK 273



 Score = 68.2 bits (165), Expect = 3e-09
 Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 43/247 (17%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNT------------ 144
           G + H   LK GL  S    + NAL+ MY  CG      ++FD +               
Sbjct: 130 GVQGHGFALKMGLLTSVK--VCNALMDMYGKCGFVGEVKRIFDVLEEKSVVSWTVVMDTV 187

Query: 145 -----------------HKDTVDWTTLMACYARNGFPNNALHLFRSMQGE-GLCPDEVTL 270
                             ++ V WT ++A Y   GF   AL L   M  + G   + V+L
Sbjct: 188 VKWEGLERGREVFDKMPERNAVAWTVMVAGYLGAGFTGEALELLAEMVFKCGHDLNFVSL 247

Query: 271 VCFLTACSWLRDVVSGAQGHLCLIKRGLL-------STVTASNAAMDMYAKCGRMGDARR 429
              L+AC+   ++V G   H+  +K+ ++         V    A +DMYAKCG +  + +
Sbjct: 248 CSMLSACAQSGNLVIGRWVHVYALKKAMIMEEEESYDPVMVGTALVDMYAKCGNIDSSIK 307

Query: 430 VFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMSDR------NEVAWTVMVAGYIES 591
           VF  + + NVV+W  + +G+        GR   D   +       +E+ +T +++    S
Sbjct: 308 VFRLMRKRNVVTWNAMFSGLAMH---GKGRMVIDMFPEMVREVKPDELTFTAVLSACSHS 364

Query: 592 GFSSEAF 612
           G   E +
Sbjct: 365 GMVGEGW 371


>ref|XP_004489421.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Cicer arietinum]
          Length = 635

 Score =  223 bits (568), Expect = 6e-56
 Identities = 114/249 (45%), Positives = 160/249 (64%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA +   GL +SPN F+ NA+LH+YA C     A K+FD+IP++HKD+VD+TTL+ 
Sbjct: 39  GKQLHAAVTVTGLISSPNLFLRNAVLHLYASCSLPSHARKLFDEIPHSHKDSVDYTTLIR 98

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C      P  +L LF  M+   L  D V +VC L AC+   D+  G Q H+C++K G   
Sbjct: 99  CSP----PFESLKLFVQMRQLCLPLDGVAMVCSLNACARHGDLNLGPQMHVCVVKFGFEK 154

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
            V   NA M++Y K G +GDA++VF+E+  P+VVSWT++L G++KWEG+++GR  FDEM 
Sbjct: 155 FVKVCNALMNVYVKFGLLGDAKKVFEEIEVPSVVSWTIVLEGLVKWEGLESGRVVFDEMP 214

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RNEVAWTVM+ GY+ +GF+ EAF LL  MV       LN V            GD+ +G
Sbjct: 215 ERNEVAWTVMIVGYVGNGFTKEAFCLLKEMVFG-CGFVLNCVTLCSVLSACSQSGDVCLG 273

Query: 721 RWLHVYALK 747
           RW+H YA+K
Sbjct: 274 RWVHGYAVK 282



 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEVTLVCFLTACSW 297
           VFD++P   ++ V WT ++  Y  NGF   A  L + M  G G   + VTL   L+ACS 
Sbjct: 209 VFDEMPE--RNEVAWTVMIVGYVGNGFTKEAFCLLKEMVFGCGFVLNCVTLCSVLSACSQ 266

Query: 298 LRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             DV  G   H   +K  GL   V    + +DMYAKCGR+  +  VF  +   NVV+W  
Sbjct: 267 SGDVCLGRWVHGYAVKEMGLDFGVMVGTSLVDMYAKCGRISASLMVFRHMWRRNVVAWNA 326

Query: 475 ILTGV 489
           +L G+
Sbjct: 327 MLGGL 331


>ref|XP_006290195.1| hypothetical protein CARUB_v10003880mg [Capsella rubella]
           gi|482558901|gb|EOA23093.1| hypothetical protein
           CARUB_v10003880mg [Capsella rubella]
          Length = 624

 Score =  223 bits (568), Expect = 6e-56
 Identities = 110/250 (44%), Positives = 160/250 (64%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA L  +GL  +P +++ NAL   YA  G  ++A K+FD+IP + KD VDWTTL++
Sbjct: 26  GKELHAVLTTSGLKKAPRSYLTNALFQFYAASGELVTAHKLFDEIPLSEKDNVDWTTLLS 85

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            ++R G+  +++ LF  M  E +  D+V+LVC    CS L D+  G QGH   +K G L+
Sbjct: 86  SFSRFGWLVDSMKLFVEMTRESVEIDDVSLVCLFCVCSKLEDLKFGEQGHGVAVKMGFLT 145

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V   NA MDMY KCG + + +R+F+ + E +VVSWTV+L  V+KWEG+  GR  FD+M 
Sbjct: 146 SVKVCNALMDMYGKCGFVSEVKRIFEVLEEKSVVSWTVVLDTVVKWEGLDRGREVFDQMP 205

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RN VAWTV+VAGY+ +GF+ E   LL  MV       LN+V            G++++G
Sbjct: 206 ERNAVAWTVLVAGYLGAGFTREVLELLAEMV-FRCGHGLNFVTLCSMLSACAQSGNLVIG 264

Query: 721 RWLHVYALKR 750
           RW+HVYALK+
Sbjct: 265 RWVHVYALKK 274



 Score = 71.6 bits (174), Expect = 3e-10
 Identities = 57/204 (27%), Positives = 87/204 (42%), Gaps = 41/204 (20%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFK-------------------- 120
           G++ H   +K G   S    + NAL+ MY  CG      +                    
Sbjct: 131 GEQGHGVAVKMGFLTSVK--VCNALMDMYGKCGFVSEVKRIFEVLEEKSVVSWTVVLDTV 188

Query: 121 -----------VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ---GEGLCPD 258
                      VFDQ+P   ++ V WT L+A Y   GF    L L   M    G GL  +
Sbjct: 189 VKWEGLDRGREVFDQMPE--RNAVAWTVLVAGYLGAGFTREVLELLAEMVFRCGHGL--N 244

Query: 259 EVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLL-------STVTASNAAMDMYAKCGRMG 417
            VTL   L+AC+   ++V G   H+  +K+ ++         V    A +DMYAKCG + 
Sbjct: 245 FVTLCSMLSACAQSGNLVIGRWVHVYALKKAMMMGEEVTYDDVMVGTAMVDMYAKCGNID 304

Query: 418 DARRVFDEVSEPNVVSWTVILTGV 489
            + +VF  + + NVV+W  + +G+
Sbjct: 305 SSMKVFRLMPKRNVVTWNAMFSGL 328


>ref|NP_197038.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75180838|sp|Q9LXE8.1|PP386_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g15340, mitochondrial; Flags: Precursor
           gi|7671503|emb|CAB89344.1| putative protein [Arabidopsis
           thaliana] gi|332004768|gb|AED92151.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 623

 Score =  221 bits (564), Expect = 2e-55
 Identities = 109/250 (43%), Positives = 161/250 (64%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA L  +GL  +P +++ NAL   YA  G  ++A K+FD+IP + KD VDWTTL++
Sbjct: 25  GKELHAVLTTSGLKKAPRSYLSNALFQFYASSGEMVTAQKLFDEIPLSEKDNVDWTTLLS 84

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            ++R G   N++ LF  M+ + +  D+V++VC    C+ L D+    QGH   +K G+L+
Sbjct: 85  SFSRYGLLVNSMKLFVEMRRKRVEIDDVSVVCLFGVCAKLEDLGFAQQGHGVAVKMGVLT 144

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V   NA MDMY KCG + + +R+F+E+ E +VVSWTV+L  V+KWEG++ GR  F EM 
Sbjct: 145 SVKVCNALMDMYGKCGLVSEVKRIFEELEEKSVVSWTVVLDTVVKWEGLERGREVFHEMP 204

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RN VAWTVMVAGY+ +GF+ E   LL  MV       LN+V            G++++G
Sbjct: 205 ERNAVAWTVMVAGYLGAGFTREVLELLAEMV-FRCGHGLNFVTLCSMLSACAQSGNLVVG 263

Query: 721 RWLHVYALKR 750
           RW+HVYALK+
Sbjct: 264 RWVHVYALKK 273



 Score = 67.8 bits (164), Expect = 4e-09
 Identities = 53/201 (26%), Positives = 86/201 (42%), Gaps = 39/201 (19%)
 Frame = +1

Query: 4   QKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNT------------- 144
           Q+ H   +K G+  S    + NAL+ MY  CG      ++F+++                
Sbjct: 131 QQGHGVAVKMGVLTSVK--VCNALMDMYGKCGLVSEVKRIFEELEEKSVVSWTVVLDTVV 188

Query: 145 ----------------HKDTVDWTTLMACYARNGFPNNALHLFRSMQ---GEGLCPDEVT 267
                            ++ V WT ++A Y   GF    L L   M    G GL  + VT
Sbjct: 189 KWEGLERGREVFHEMPERNAVAWTVMVAGYLGAGFTREVLELLAEMVFRCGHGL--NFVT 246

Query: 268 LVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTAS-------NAAMDMYAKCGRMGDAR 426
           L   L+AC+   ++V G   H+  +K+ ++    AS        A +DMYAKCG +  + 
Sbjct: 247 LCSMLSACAQSGNLVVGRWVHVYALKKEMMMGEEASYDDVMVGTALVDMYAKCGNIDSSM 306

Query: 427 RVFDEVSEPNVVSWTVILTGV 489
            VF  + + NVV+W  + +G+
Sbjct: 307 NVFRLMRKRNVVTWNALFSGL 327


>ref|XP_002873720.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319557|gb|EFH49979.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 623

 Score =  218 bits (555), Expect = 2e-54
 Identities = 109/250 (43%), Positives = 158/250 (63%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA L  +GL  +P +++ NAL   YA  G   +A K+FD+IP + KD VDWTTL++
Sbjct: 25  GRELHAVLTTSGLKKAPRSYLSNALFQFYASSGEIATAQKLFDEIPLSDKDNVDWTTLLS 84

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
            ++R G   N++ LF  M+ + +  D V+LVC    C+ L D+  G QGH   +K G L+
Sbjct: 85  SFSRFGLLVNSMKLFVEMRRKRVEIDHVSLVCLFGVCAKLEDLRFGEQGHGVAVKMGFLT 144

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
           +V   NA MDMY KCG + + +R+F  + E +VVSWTV+L  ++KWEG++ GR  FDEM 
Sbjct: 145 SVKVCNALMDMYGKCGFVSEVKRIFQALEEKSVVSWTVVLDTLVKWEGLKRGREVFDEMP 204

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMG 720
           +RN VAWT+MVAGY+ +GF+ E   LL  MV       LN+V            G++++G
Sbjct: 205 ERNVVAWTLMVAGYLGAGFTREVLELLAEMV-FRCGHGLNFVTLCSMLSACAQSGNLVIG 263

Query: 721 RWLHVYALKR 750
           RW+HVYALK+
Sbjct: 264 RWVHVYALKK 273



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 55/204 (26%), Positives = 87/204 (42%), Gaps = 41/204 (20%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFK-------------------- 120
           G++ H   +K G   S    + NAL+ MY  CG      +                    
Sbjct: 130 GEQGHGVAVKMGFLTSVK--VCNALMDMYGKCGFVSEVKRIFQALEEKSVVSWTVVLDTL 187

Query: 121 -----------VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ---GEGLCPD 258
                      VFD++P   ++ V WT ++A Y   GF    L L   M    G GL  +
Sbjct: 188 VKWEGLKRGREVFDEMPE--RNVVAWTLMVAGYLGAGFTREVLELLAEMVFRCGHGL--N 243

Query: 259 EVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLL-------STVTASNAAMDMYAKCGRMG 417
            VTL   L+AC+   ++V G   H+  +K+ ++         V    A +DMYAKCG + 
Sbjct: 244 FVTLCSMLSACAQSGNLVIGRWVHVYALKKAMMMGEEETYDGVMVGTALVDMYAKCGNID 303

Query: 418 DARRVFDEVSEPNVVSWTVILTGV 489
            + +VF  + + NVV+W  + +G+
Sbjct: 304 SSIKVFRLMRKRNVVTWNALFSGL 327


>ref|XP_007151258.1| hypothetical protein PHAVU_004G031500g [Phaseolus vulgaris]
           gi|561024567|gb|ESW23252.1| hypothetical protein
           PHAVU_004G031500g [Phaseolus vulgaris]
          Length = 734

 Score =  217 bits (552), Expect = 4e-54
 Identities = 109/211 (51%), Positives = 144/211 (68%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA     GL +SP+ F+ NAL+H+YA C   L A K+FD+IP+THKD+VD+T L+ 
Sbjct: 33  GEQLHAAATVAGLLSSPSHFLLNALVHLYAACSLPLHAHKLFDRIPHTHKDSVDYTALIR 92

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C      P ++L  F  M+   L  D VTL+C L AC+ L D     Q H+ ++K GLLS
Sbjct: 93  C----SHPLDSLRFFLQMRQRALPLDGVTLICALGACARLEDHSLVPQMHVGVVKFGLLS 148

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
                NA MD Y KCG +G+ARRVF E+ EP+VVSWTV+L GV+KWEGV++GR  FD M 
Sbjct: 149 HTKVCNAVMDGYVKCGLLGEARRVFKEIEEPSVVSWTVVLEGVVKWEGVESGRAVFDLMP 208

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMV 633
           +RNEVAWTVM+ GY+ +GF+ EAF+LL  MV
Sbjct: 209 ERNEVAWTVMIKGYVGNGFTEEAFMLLREMV 239


>gb|EPS65080.1| hypothetical protein M569_09698 [Genlisea aurea]
          Length = 792

 Score =  214 bits (544), Expect = 3e-53
 Identities = 113/253 (44%), Positives = 157/253 (62%), Gaps = 4/253 (1%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+K HA ++ +GL   P AF+ N +LHMYA  G    A KVFD IP   KDTVDWT L+ 
Sbjct: 195 GKKLHAAVVTSGLITLPGAFLRNVILHMYAASGDLACARKVFDDIPVGLKDTVDWTRLID 254

Query: 181 CYARNGFPNN--ALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGL 354
           CY R G+ ++   L LF  M+ +G+  DE+T++  L  CS + + V G QGH C+IK GL
Sbjct: 255 CYNRYGWSSSLDGLSLFVDMRRQGVPMDEITIMAVLGICSKIGNPVFGIQGHACMIKMGL 314

Query: 355 --LSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFF 528
              S + A NAAMDMY KCG + +A+++FD     +VVSWTV+L GV+KWEG++ G+  F
Sbjct: 315 GLSSGLKAWNAAMDMYVKCGMLAEAKKLFDGFDGKDVVSWTVLLWGVLKWEGLEKGKKLF 374

Query: 529 DEMSDRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGD 708
           DEM +RNE+AW+++V+ YIE+ F  EAF LL  M         + +            GD
Sbjct: 375 DEMPERNEIAWSILVSRYIENCFIREAFHLLQEMAAESTSFTSSSL--CLLLSACTQSGD 432

Query: 709 MLMGRWLHVYALK 747
           +  GRW+H +AL+
Sbjct: 433 VSTGRWVHSFALR 445



 Score = 87.0 bits (214), Expect = 6e-15
 Identities = 68/235 (28%), Positives = 97/235 (41%), Gaps = 37/235 (15%)
 Frame = +1

Query: 1    GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLM- 177
            G + HA ++K GL  S      NA + MY  CG    A K+FD      KD V WT L+ 
Sbjct: 302  GIQGHACMIKMGLGLSSGLKAWNAAMDMYVKCGMLAEAKKLFDGFDG--KDVVSWTVLLW 359

Query: 178  ------------------------------ACYARNGFPNNALHLFRSMQGEGLCPDEVT 267
                                          + Y  N F   A HL + M  E       +
Sbjct: 360  GVLKWEGLEKGKKLFDEMPERNEIAWSILVSRYIENCFIREAFHLLQEMAAESTSFTSSS 419

Query: 268  LVCFLTACSWLRDVVSGAQGHLCLIKR--GLLSTVTASNAAMDMYAKCGRMGDARRVFDE 441
            L   L+AC+   DV +G   H   ++      + V  S A +DMYAKCGR+  A RVF+ 
Sbjct: 420  LCLLLSACTQSGDVSTGRWVHSFALRTTADASTDVRFSTALLDMYAKCGRINSAIRVFEA 479

Query: 442  VSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMSD----RNEVAWTVMVAGYIESG 594
            +   NVV+W  +L G+            FD M+D     ++V +T +++    SG
Sbjct: 480  MPSKNVVTWNAMLGGLAMHGRGSVALDMFDSMADGGWKPDDVTFTALLSACSHSG 534



 Score = 67.0 bits (162), Expect = 7e-09
 Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 8/162 (4%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G+  H+  L+    AS +     ALL MYA CG   SA +VF+ +P+  K+ V W  ++ 
Sbjct: 436 GRWVHSFALRTTADASTDVRFSTALLDMYAKCGRINSAIRVFEAMPS--KNVVTWNAMLG 493

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRG--L 354
             A +G  + AL +F SM   G  PD+VT    L+ACS           H  L+ RG  L
Sbjct: 494 GLAMHGRGSVALDMFDSMADGGWKPDDVTFTALLSACS-----------HSGLVDRGREL 542

Query: 355 LSTVTASN-----AAMDMYAKCGRMGDARRVFDEV-SEPNVV 462
              V + +     AA+D+  + G + +A  V   +  +PN V
Sbjct: 543 FRAVKSPSMENCAAAVDLLGRAGHLEEAEAVIRGMPMQPNEV 584


>ref|XP_006603878.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Glycine max]
          Length = 706

 Score =  205 bits (522), Expect = 1e-50
 Identities = 105/211 (49%), Positives = 141/211 (66%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ HA    +GL  SP++F+ NALLH+YA C     A K+FD+IP++HKD+VD+T L+ 
Sbjct: 31  GEQLHAAATVSGLLFSPSSFLLNALLHLYASCPLPSHARKLFDRIPHSHKDSVDYTALIR 90

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLS 360
           C      P +AL  +  M+   L  D V L+C L ACS L D     Q H+ ++K G L 
Sbjct: 91  C----SHPLDALRFYLQMRQRALPLDGVALICALGACSKLGDSNLVPQMHVGVVKFGFLR 146

Query: 361 TVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMS 540
                N  MD Y KCG +G+ARRVF+E+ EP+VVSWTV+L GV+K EGV++G+  FDEM 
Sbjct: 147 HTKVLNGVMDGYVKCGLVGEARRVFEEIEEPSVVSWTVVLEGVVKCEGVESGKVVFDEMP 206

Query: 541 DRNEVAWTVMVAGYIESGFSSEAFLLLGNMV 633
           +RNEVAWTV++ GY+ SGF+ EAFLLL  MV
Sbjct: 207 ERNEVAWTVLIKGYVGSGFTKEAFLLLKEMV 237


>ref|XP_003618546.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355493561|gb|AES74764.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 637

 Score =  205 bits (522), Expect = 1e-50
 Identities = 110/251 (43%), Positives = 156/251 (62%), Gaps = 2/251 (0%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           GQ+ HAT +  GL +SPN F+ NALLH+Y  C     A K+FD+IP +HKD+VD+T L+ 
Sbjct: 39  GQQLHATAIVTGLISSPNHFLRNALLHLYGSCSLPSHARKLFDEIPQSHKDSVDYTALI- 97

Query: 181 CYARNGFPNNALHLFRSMQGEGLCPDEVTLVCFLTACSWLR--DVVSGAQGHLCLIKRGL 354
              R+  P  +L LF  M+   L  D V +VC L AC+ L   D   G+Q H+ ++K G 
Sbjct: 98  ---RHCPPFESLKLFIQMRQFDLPLDGVVMVCALNACARLGGGDTKVGSQMHVGVVKFGF 154

Query: 355 LSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDE 534
           +      NA M++Y K G +G+AR++F+ +   +VVSW+  L G++KWE V++GR  FDE
Sbjct: 155 VKFDKVCNALMNVYVKFGLVGEARKMFEGIEVRSVVSWSCFLEGLVKWESVESGRVLFDE 214

Query: 535 MSDRNEVAWTVMVAGYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDML 714
           M +RNEVAWTVM+ GY+ +GF+ EAFLLL  MV       L++V            GD+ 
Sbjct: 215 MPERNEVAWTVMIVGYVGNGFTKEAFLLLKEMVFG-CGFRLSFVTLCSVLSACSQSGDVC 273

Query: 715 MGRWLHVYALK 747
           +GRW+H YA+K
Sbjct: 274 VGRWVHCYAVK 284



 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 2/125 (1%)
 Frame = +1

Query: 121 VFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRSMQ-GEGLCPDEVTLVCFLTACSW 297
           +FD++P   ++ V WT ++  Y  NGF   A  L + M  G G     VTL   L+ACS 
Sbjct: 211 LFDEMPE--RNEVAWTVMIVGYVGNGFTKEAFLLLKEMVFGCGFRLSFVTLCSVLSACSQ 268

Query: 298 LRDVVSGAQGHLCLIKR-GLLSTVTASNAAMDMYAKCGRMGDARRVFDEVSEPNVVSWTV 474
             DV  G   H   +K  GL   V    + +DMYAKCGR+  A  VF  + + NVV+W  
Sbjct: 269 SGDVCVGRWVHCYAVKEMGLDFGVMVGTSLVDMYAKCGRINAALSVFRSMLKRNVVAWNA 328

Query: 475 ILTGV 489
           +L G+
Sbjct: 329 MLGGL 333


>gb|EPS72347.1| hypothetical protein M569_02407, partial [Genlisea aurea]
          Length = 646

 Score =  149 bits (377), Expect = 8e-34
 Identities = 85/234 (36%), Positives = 128/234 (54%), Gaps = 1/234 (0%)
 Frame = +1

Query: 52  NAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRS 231
           N F+HNAL+H     G   SA KVFD+ P   +D V W +L+  YAR+G P+ AL ++R+
Sbjct: 174 NVFVHNALIHFLITSGELNSARKVFDESPV--RDLVSWNSLINGYARSGKPDEALRIYRA 231

Query: 232 MQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTASNAAMDMYAKCGR 411
           M+     PD+VT++  +TAC+ LRD+  G++ H  ++  GL  T    NA +DMY KCG 
Sbjct: 232 MEDR---PDDVTMIGVVTACTQLRDLKLGSETHHHIVNHGLKLTAPLVNALLDMYMKCGA 288

Query: 412 MGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMSDRNE-VAWTVMVAGYIE 588
              A R+F  +   N VSWT ++ G  +   +   R  FDEM D+++ V W  +++GY+E
Sbjct: 289 ADRAERLFRRMEARNAVSWTTMVVGYARQGRLDVARRVFDEMPDKDDAVPWNALMSGYVE 348

Query: 589 SGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMGRWLHVYALKR 750
           +    EA  L   M+ S   I+ + V            G + +G W+H Y  KR
Sbjct: 349 TQSHREALSLFNEMIAS--GIDPDEVTMTSCLSACTHLGALDVGVWIHRYVEKR 400



 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 53/145 (36%), Positives = 75/145 (51%)
 Frame = +1

Query: 52  NAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRS 231
           NA     ++  YA  G    A +VFD++P+   D V W  LM+ Y        AL LF  
Sbjct: 303 NAVSWTTMVVGYARQGRLDVARRVFDEMPDKD-DAVPWNALMSGYVETQSHREALSLFNE 361

Query: 232 MQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTASNAAMDMYAKCGR 411
           M   G+ PDEVT+   L+AC+ L  +  G   H  + KR +   V    A +DMYAKCG 
Sbjct: 362 MIASGIDPDEVTMTSCLSACTHLGALDVGVWIHRYVEKRRIPVNVVLGTALVDMYAKCGN 421

Query: 412 MGDARRVFDEVSEPNVVSWTVILTG 486
           +  A RVFDE+   N +++T ++ G
Sbjct: 422 ISKALRVFDEIPARNALTYTAVICG 446



 Score = 65.9 bits (159), Expect = 1e-08
 Identities = 44/141 (31%), Positives = 74/141 (52%), Gaps = 2/141 (1%)
 Frame = +1

Query: 52  NAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNALHLFRS 231
           N  +  AL+ MYA CG+   A +VFD+IP   ++ + +T ++   A +G  ++AL LFR 
Sbjct: 405 NVVLGTALVDMYAKCGNISKALRVFDEIPA--RNALTYTAVICGSALHGDASDALSLFRR 462

Query: 232 MQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCL-IKRGLLSTVTASNAAMDMYAKCG 408
           M  +GL PDEVTL+  LTAC     V  G      +  K  +   +   +  +D+  + G
Sbjct: 463 MLRDGLQPDEVTLLGVLTACCHGGLVEEGKAAFSDMRSKYNIPPQIKHYSVMVDLLGRAG 522

Query: 409 RMGDARRVFDEV-SEPNVVSW 468
            + +A  + + + +EP+   W
Sbjct: 523 LLDEAAELLESMPAEPDSAVW 543


>ref|XP_002282912.2| PREDICTED: pentatricopeptide repeat-containing protein At2g22410,
           mitochondrial-like [Vitis vinifera]
          Length = 642

 Score =  146 bits (369), Expect = 7e-33
 Identities = 78/237 (32%), Positives = 126/237 (53%)
 Frame = +1

Query: 37  LAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMACYARNGFPNNAL 216
           L    + F+ NA++H+   CG    A K+FD+  +  +D V W +++  Y R G+   AL
Sbjct: 136 LGFDSDIFVSNAVIHLLVSCGDLDGARKMFDK--SCVRDLVSWNSMINGYVRRGWAYEAL 193

Query: 217 HLFRSMQGEGLCPDEVTLVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTASNAAMDMY 396
           + +R M+ EG+ PDEVT++  +++C+ L D+  G + H  + + GL  TV  +NA MDMY
Sbjct: 194 NFYREMKVEGIKPDEVTMIGVVSSCAQLEDLDLGRESHCYIEENGLKLTVPLANALMDMY 253

Query: 397 AKCGRMGDARRVFDEVSEPNVVSWTVILTGVIKWEGVQNGRFFFDEMSDRNEVAWTVMVA 576
            KCG +  AR++FD ++   +VSWT ++ G  +   +      FDEM D++ V W  M+ 
Sbjct: 254 MKCGNLESARKLFDSMTNKTMVSWTTMVVGYAQSGLLDMAWKLFDEMPDKDVVPWNAMIG 313

Query: 577 GYIESGFSSEAFLLLGNMVPSYLQINLNYVXXXXXXXXXXXXGDMLMGRWLHVYALK 747
           GY+ +    EA  L   M    + IN + V            G + +G W+H Y  K
Sbjct: 314 GYVHANRGKEALALFNEM--QAMNINPDEVTMVSCLSACSQLGALDVGIWIHHYIEK 368



 Score =  100 bits (250), Expect = 4e-19
 Identities = 71/228 (31%), Positives = 107/228 (46%), Gaps = 31/228 (13%)
 Frame = +1

Query: 1   GQKPHATLLKNGLAASPNAFIHNALLHMYAICGHSLSAFKVFDQIPNTHKDTVDWTTLMA 180
           G++ H  + +NGL  +    + NAL+ MY  CG+  SA K+FD +  T+K  V WTT++ 
Sbjct: 227 GRESHCYIEENGLKLTVP--LANALMDMYMKCGNLESARKLFDSM--TNKTMVSWTTMVV 282

Query: 181 CYARNGFPN-------------------------------NALHLFRSMQGEGLCPDEVT 267
            YA++G  +                                AL LF  MQ   + PDEVT
Sbjct: 283 GYAQSGLLDMAWKLFDEMPDKDVVPWNAMIGGYVHANRGKEALALFNEMQAMNINPDEVT 342

Query: 268 LVCFLTACSWLRDVVSGAQGHLCLIKRGLLSTVTASNAAMDMYAKCGRMGDARRVFDEVS 447
           +V  L+ACS L  +  G   H  + K  L   V    A +DMYAKCG++  A +VF E+ 
Sbjct: 343 MVSCLSACSQLGALDVGIWIHHYIEKHELSLNVALGTALIDMYAKCGKITKAIQVFQELP 402

Query: 448 EPNVVSWTVILTGVIKWEGVQNGRFFFDEMSDRNEVAWTVMVAGYIES 591
             N ++WT I++G+           +F EM D + +   V   G + +
Sbjct: 403 GRNSLTWTAIISGLALHGNAHGAIAYFSEMIDNSVMPDEVTFLGLLSA 450


Top