BLASTX nr result

ID: Stemona21_contig00011051 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00011051
         (1248 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ14918.1| hypothetical protein PRUPE_ppa000485mg [Prunus pe...   330   1e-87
ref|XP_002273767.1| PREDICTED: peroxisome biogenesis protein 1-l...   326   1e-86
ref|XP_002517570.1| peroxisome biogenesis factor, putative [Rici...   322   2e-85
gb|EXC24769.1| Peroxisome biogenesis protein 1 [Morus notabilis]      322   3e-85
emb|CBI20540.3| unnamed protein product [Vitis vinifera]              320   6e-85
ref|XP_006448771.1| hypothetical protein CICLE_v10014090mg [Citr...   319   1e-84
ref|XP_006468418.1| PREDICTED: peroxisome biogenesis protein 1-l...   315   3e-83
ref|XP_002298113.2| hypothetical protein POPTR_0001s17400g [Popu...   308   3e-81
ref|XP_004293758.1| PREDICTED: peroxisome biogenesis protein 1-l...   305   2e-80
ref|XP_006365432.1| PREDICTED: peroxisome biogenesis protein 1-l...   299   1e-78
ref|XP_004237362.1| PREDICTED: peroxisome biogenesis protein 1-l...   291   4e-76
ref|XP_006399345.1| hypothetical protein EUTSA_v10012497mg [Eutr...   288   4e-75
dbj|BAB09996.1| unnamed protein product [Arabidopsis thaliana]        285   2e-74
gb|AAG44817.1| peroxisome biogenesis protein PEX1 [Arabidopsis t...   285   2e-74
ref|NP_196464.2| peroxisome biogenesis protein 1 [Arabidopsis th...   285   2e-74
ref|XP_003529444.1| PREDICTED: peroxisome biogenesis protein 1-l...   284   5e-74
ref|XP_006853404.1| hypothetical protein AMTR_s00032p00152530 [A...   284   6e-74
gb|EOY27465.1| Peroxisome biogenesis protein 1 [Theobroma cacao]      281   4e-73
ref|XP_002871329.1| peroxisome biogenesis protein PEX1 [Arabidop...   281   5e-73
gb|ESW29810.1| hypothetical protein PHAVU_002G100600g [Phaseolus...   280   9e-73

>gb|EMJ14918.1| hypothetical protein PRUPE_ppa000485mg [Prunus persica]
          Length = 1135

 Score =  330 bits (845), Expect = 1e-87
 Identities = 194/417 (46%), Positives = 268/417 (64%), Gaps = 4/417 (0%)
 Frame = -3

Query: 1240 EENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQK 1061
            E N G+S+  K+  R T+VR++ SDSVAKGH+M+ QSLR++LR  LHSWVY+K  +   K
Sbjct: 274  ESNNGISNDKKDN-RETIVRLLISDSVAKGHVMVAQSLRLYLRARLHSWVYLKGCNGILK 332

Query: 1060 KDIPSLTLSPCRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHE 884
             DIP L+LSPC  K  GK+ A+ERNG++  D H   K KNM+        ++V DWS H+
Sbjct: 333  TDIPLLSLSPCHFKIFGKDKAVERNGIEVLDRHKIRKKKNMLLTTGSSTYIDVTDWSTHD 392

Query: 883  KLFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKE 704
            K+  + S        E A+ ++   +  + L+  W+  QL AIAS   G E+ S+ L  E
Sbjct: 393  KVVDAFSYESSCKEDEGASQKSEEGKGVESLVKAWILAQLDAIAS-NAGEEINSLVLGNE 451

Query: 703  TLLHFELKILNLKSKNEQPLFDRS---LERGSSTGESVVELLYILTAKFKESSQDDLGNI 533
            T+LHFE+K    KS  E+ + + S   LE  +   E  VE+LY+LT  F + SQ   GN 
Sbjct: 452  TILHFEVK--GQKSGIEEKVHESSSGGLENKNENAELPVEILYVLT--FSKESQH-AGNA 506

Query: 532  CELALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAIS 353
             EL  + R+K  NN+   +++  KL+ G+P+S  SV E   +    + +SSLSWM    S
Sbjct: 507  YELVFDERNKDNNNLGGLETIV-KLKEGDPLSFYSVRERMSEKDVPADVSSLSWMGTIAS 565

Query: 352  DVTNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHV 173
            DV NR+ VLL+P SG   S+  LPLPGHVLIHGP GSGKT LA  VAKCLEE +++LAHV
Sbjct: 566  DVLNRMLVLLTPASGAWFSSHDLPLPGHVLIHGPPGSGKTLLARTVAKCLEEDKDLLAHV 625

Query: 172  IFISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            +F+SCS+L++EK+  IRQA+++ +SEAL H+PS++I DDLD IV  SS+S GSQ S+
Sbjct: 626  VFVSCSQLAMEKALTIRQALSSYMSEALDHAPSLVILDDLDSIVSSSSDSEGSQTST 682


>ref|XP_002273767.1| PREDICTED: peroxisome biogenesis protein 1-like [Vitis vinifera]
          Length = 1134

 Score =  326 bits (835), Expect = 1e-86
 Identities = 189/407 (46%), Positives = 257/407 (63%), Gaps = 2/407 (0%)
 Frame = -3

Query: 1216 ASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTL 1037
            A K+   + VVR++ S+SVAKGH+M+ QSLR +LRT LHSWVY+K+  IN KK+I  L+L
Sbjct: 278  ADKKEPCQVVVRLLISESVAKGHVMMAQSLRHYLRTGLHSWVYMKRCDINLKKEISLLSL 337

Query: 1036 SPCRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLSC 860
            SPC+ K   KN ALE NGL+  D  TN K+K+M+   +    +N+ DWS HE+  ++LS 
Sbjct: 338  SPCQFKMFEKNKALEENGLEVLDSLTNHKTKSMLLETNSDTYMNISDWSTHEEFAAALSF 397

Query: 859  GFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELK 680
                +  EK + ++ + +  + LL  W    L AI S   G E+ S+ +  ETLLHF + 
Sbjct: 398  ESPGSEDEKTSSQSGSRKGLQSLLQAWFLAHLDAINS-NAGTEIDSLVVGNETLLHFNVT 456

Query: 679  ILNLKSKNE-QPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNTRSK 503
                 +  + Q   + S +  SS G+  VE+LYIL    +ES      N  EL+   R+K
Sbjct: 457  SDKFGTLGKFQASSNGSSKNRSSYGDLSVEILYILAIS-EESQHSGKFNAYELSFPERNK 515

Query: 502  VGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLL 323
              NN+   + L G L LGEPVS   + E      F  T SSLSW+  A SD+ NRLT LL
Sbjct: 516  RNNNLGNLELLVGNLRLGEPVSFYCMKERTSAKGFSLTASSLSWIGTAASDIINRLTTLL 575

Query: 322  SPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSL 143
            SP SG   ST+ LPLPGHVLI+GP GSGKT LA  VAK LEE E++L H++F+SCS+L+L
Sbjct: 576  SPASGMWFSTYNLPLPGHVLIYGPPGSGKTLLARTVAKALEEQEDLLTHIVFVSCSQLAL 635

Query: 142  EKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            EK+  IRQA+++ +S+AL H PS++IFDDLD I+  SS+  GSQPS+
Sbjct: 636  EKAVTIRQALSSYLSDALDHVPSLVIFDDLDLIISSSSDLEGSQPST 682


>ref|XP_002517570.1| peroxisome biogenesis factor, putative [Ricinus communis]
            gi|223543202|gb|EEF44734.1| peroxisome biogenesis factor,
            putative [Ricinus communis]
          Length = 1137

 Score =  322 bits (825), Expect = 2e-85
 Identities = 186/411 (45%), Positives = 264/411 (64%), Gaps = 8/411 (1%)
 Frame = -3

Query: 1210 KEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTLSP 1031
            K+ YR+ +VRI+FSDSVAKGH+M+ +SLR++L   LHSWVY+K  +++ K+DI SL+LSP
Sbjct: 282  KKEYRQAIVRIVFSDSVAKGHLMIARSLRLYLMASLHSWVYLKICTMDLKEDITSLSLSP 341

Query: 1030 CRLKSMGKNFALERNGLDS-DIHTNFKSKNMISR-KDLFLGVNVPDWSLHEKLFSSLSCG 857
            C  K  G++ A+E+N L+  D     K +N++S     ++G    DWS+H+++ ++LS  
Sbjct: 342  CHFKMPGQDNAIEKNSLEVLDQRIIQKPRNLVSGGSGSYMGT--VDWSVHDRILAALSND 399

Query: 856  FFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELKI 677
            F   GG++   +++  +  + LL  W   QL AIASF  G E  SV L KET+LHFE+K 
Sbjct: 400  FPCEGGQETIYQSNNRKGLRRLLQAWFLAQLDAIASFA-GSEANSVILGKETILHFEVKG 458

Query: 676  LNLKSKNEQPLFDRS-----LERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNT 512
             +++S  +  +   S     +E+  + GE  +E L++LT   +ES         +L+ + 
Sbjct: 459  CDVESDRKDEILATSNSNGLIEKRKNNGELPLEFLFVLTIS-EESMHGRQACSYKLSFDE 517

Query: 511  RSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLT 332
            R K    V E   L GKL+LG PVS+ ++ E        + +SSLSWM    +DV NR  
Sbjct: 518  RKKDNLGVME---LFGKLKLGGPVSMYALKERNSHKGISANLSSLSWMGTTAADVINRTM 574

Query: 331  VLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSK 152
             LLSP SG L ST+ LP PGHVLI+GP GSGKT LA AVAK LEEHE++LAH++F+ CS 
Sbjct: 575  ALLSPTSGMLFSTYNLPFPGHVLIYGPHGSGKTILARAVAKSLEEHEDLLAHIVFVGCSA 634

Query: 151  LSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVG-SQPSS 2
            L+LEK+  IRQA++  ISEAL H+PS+IIFDDLD I+  SS+  G  QPS+
Sbjct: 635  LALEKASIIRQALSAYISEALDHAPSLIIFDDLDTIISSSSDGEGPPQPST 685


>gb|EXC24769.1| Peroxisome biogenesis protein 1 [Morus notabilis]
          Length = 1225

 Score =  322 bits (824), Expect = 3e-85
 Identities = 191/418 (45%), Positives = 267/418 (63%), Gaps = 12/418 (2%)
 Frame = -3

Query: 1219 SASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLT 1040
            +ASK   R+ +VRI+FSDSVAKGH+M+ QSLR +L   LHSWVY+K  +I  +KDIPS++
Sbjct: 372  TASKLENRQAIVRILFSDSVAKGHVMISQSLRFYLGAGLHSWVYLKGRNI-LRKDIPSVS 430

Query: 1039 LSPCRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLS 863
            LSPC  K + K+  LE+NGL+  D H N +  NM+ ++     V+V DWS H+++ ++LS
Sbjct: 431  LSPCHFKMIEKSKNLEKNGLEVFDNHKNGRRINMLLKRSSANYVDVVDWSTHDEVIAALS 490

Query: 862  CGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFEL 683
                     K+  ++   +  + L+  W   Q+ AI+S T G+EV S+FL  ETL+H E+
Sbjct: 491  HESHYKEDGKSAFKDDNGRGLQNLMKVWFLAQVGAISS-TSGLEVNSLFLGSETLVHIEV 549

Query: 682  KILNLKSKNE-QPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNTRS 506
            K  NL S+ + Q   +  LE    T +   E+LY+LT   +  S    G + EL  +  +
Sbjct: 550  KSHNLGSQEDVQASSNGFLENIKKTSKLTAEILYVLTIPVESHSG---GIVYELVFDELN 606

Query: 505  KVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLT-- 332
            K  N +    +L  KLE+G+PVS + V E  ID    + +SSLSWM   +SD+ NRL   
Sbjct: 607  KGHNTLQG--ALFEKLEMGDPVSFSCVRERIIDDDLSTNVSSLSWMGTTVSDIINRLNNN 664

Query: 331  --------VLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAH 176
                    VLLSP SG   S++ LPLPGHVLI+GP+GSGKT LA AVAK L+E E+ILAH
Sbjct: 665  LDEVRGMMVLLSPASGVWFSSYNLPLPGHVLIYGPTGSGKTLLAKAVAKFLQEREDILAH 724

Query: 175  VIFISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            ++F+ CSKLSLEK+ +IRQA++  ISEAL ++PS++I DDLD I+  SS+S GSQ SS
Sbjct: 725  IVFVCCSKLSLEKAPSIRQALSGHISEALDNAPSLVILDDLDCIIASSSDSEGSQASS 782


>emb|CBI20540.3| unnamed protein product [Vitis vinifera]
          Length = 1114

 Score =  320 bits (821), Expect = 6e-85
 Identities = 188/406 (46%), Positives = 252/406 (62%), Gaps = 1/406 (0%)
 Frame = -3

Query: 1216 ASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTL 1037
            A K+   + VVR++ S+SVAKGH+M+ QSLR +LRT LHSWVY+K+  IN KK+I  L+L
Sbjct: 278  ADKKEPCQVVVRLLISESVAKGHVMMAQSLRHYLRTGLHSWVYMKRCDINLKKEISLLSL 337

Query: 1036 SPCRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLSC 860
            SPC+ K   KN ALE NGL+  D  TN K+K+M+   +    +N+ DWS HE+  ++LS 
Sbjct: 338  SPCQFKMFEKNKALEENGLEVLDSLTNHKTKSMLLETNSDTYMNISDWSTHEEFAAALSF 397

Query: 859  GFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELK 680
                +  EK + ++ + +  + LL  W    L AI S   G E+ S+ +  ETLLHF   
Sbjct: 398  ESPGSEDEKTSSQSGSRKGLQSLLQAWFLAHLDAINS-NAGTEIDSLVVGNETLLHF--- 453

Query: 679  ILNLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNTRSKV 500
              N+ S N               G+  VE+LYIL    +ES      N  EL+   R+K 
Sbjct: 454  --NVTSDNY--------------GDLSVEILYILAIS-EESQHSGKFNAYELSFPERNKR 496

Query: 499  GNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLLS 320
             NN+   + L G L LGEPVS   + E      F  T SSLSW+  A SD+ NRLT LLS
Sbjct: 497  NNNLGNLELLVGNLRLGEPVSFYCMKERTSAKGFSLTASSLSWIGTAASDIINRLTTLLS 556

Query: 319  PYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSLE 140
            P SG   ST+ LPLPGHVLI+GP GSGKT LA  VAK LEE E++L H++F+SCS+L+LE
Sbjct: 557  PASGMWFSTYNLPLPGHVLIYGPPGSGKTLLARTVAKALEEQEDLLTHIVFVSCSQLALE 616

Query: 139  KSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            K+  IRQA+++ +S+AL H PS++IFDDLD I+  SS+  GSQPS+
Sbjct: 617  KAVTIRQALSSYLSDALDHVPSLVIFDDLDLIISSSSDLEGSQPST 662


>ref|XP_006448771.1| hypothetical protein CICLE_v10014090mg [Citrus clementina]
            gi|557551382|gb|ESR62011.1| hypothetical protein
            CICLE_v10014090mg [Citrus clementina]
          Length = 1134

 Score =  319 bits (818), Expect = 1e-84
 Identities = 185/415 (44%), Positives = 259/415 (62%), Gaps = 1/415 (0%)
 Frame = -3

Query: 1243 NEENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQ 1064
            ++E  G +S  K+  R+ VVR++FS+SVAKGH+ + ++LR++L   LHSWVY+KK ++N 
Sbjct: 271  SKEISGGASTDKKECRQAVVRLLFSNSVAKGHVKIARALRLYLNAGLHSWVYLKKCTVNL 330

Query: 1063 KKDIPSLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHE 884
            KK+IP ++LSPC  K + K+ A    GL+ D + N K+K M+      + ++  D S  +
Sbjct: 331  KKEIPMVSLSPCHFKMLEKDKAFGI-GLELD-NKNHKTKKMLENTSSGIYMDDGDLSAED 388

Query: 883  KLFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKE 704
            ++ ++LS    +   E+A  +    +  + LLH WL  QL A+AS   G E  ++ LS E
Sbjct: 389  EVIAALSSEPSLKEDEEAVYQFENKKGLECLLHTWLLAQLNAVAS-NIGSEFNTLVLSNE 447

Query: 703  TLLHFELKILNLKSKNEQPLF-DRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICE 527
            TLLHFE+K     +  + P   + +LE  +   E   E+  +LT   +ES      N  E
Sbjct: 448  TLLHFEVKGYKSGTYGKVPASCNGALENKTKARELRTEIFCVLTFS-EESLHGGKNNAYE 506

Query: 526  LALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDV 347
            L L  R +  NN      L GKL  G+PVS  +V E      F S +SSLSWM    SDV
Sbjct: 507  LTLEARGQQNNNTEAVCQLFGKLNSGDPVSFYTVKERGSTQGFDSNVSSLSWMGTTASDV 566

Query: 346  TNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIF 167
             NR+ VLLSP SG   ST+ LPLPGH+LIHGP GSGKTSLA AVAK LE H++++AH++F
Sbjct: 567  INRIKVLLSPDSGLWFSTYHLPLPGHILIHGPPGSGKTSLAKAVAKSLEHHKDLVAHIVF 626

Query: 166  ISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            + CS+LSLEK   IRQA++N ISEAL H+PS++IFDDLD I+  SS+  GSQPS+
Sbjct: 627  VCCSRLSLEKGPIIRQALSNFISEALDHAPSIVIFDDLDSIISSSSDPEGSQPST 681


>ref|XP_006468418.1| PREDICTED: peroxisome biogenesis protein 1-like [Citrus sinensis]
          Length = 1134

 Score =  315 bits (806), Expect = 3e-83
 Identities = 183/415 (44%), Positives = 257/415 (61%), Gaps = 1/415 (0%)
 Frame = -3

Query: 1243 NEENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQ 1064
            ++E  G +S  K+  R+ VV ++FSDSVAKGH+ + ++LR++L   LHSWVY+KK ++N 
Sbjct: 271  SKEISGGASTDKKECRQAVVHLLFSDSVAKGHVKIARALRLYLNAGLHSWVYLKKCTVNL 330

Query: 1063 KKDIPSLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHE 884
            KK+IP ++LSPC  K + K+ A    GL+ D + N K+K M+ +    + ++  D S  +
Sbjct: 331  KKEIPMVSLSPCHFKMLEKDKAFGI-GLELD-NKNHKTKKMLEKTSSGIYMDDGDLSAED 388

Query: 883  KLFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKE 704
             + ++LS        E+A  +    +  + LLH WL  QL A+AS   G E  ++ LS E
Sbjct: 389  DIIAALSSEPSSKEDEEAVYQFENKKGLECLLHTWLLAQLTAVAS-NIGSEFNTLVLSNE 447

Query: 703  TLLHFELKILNLKSKNEQPLF-DRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICE 527
            TLLHFE+K     +  + P   + +LE  +   E   E+  +LT   +ES      N  E
Sbjct: 448  TLLHFEVKGYKSGTYGKVPASCNGALENKTKARELRTEIFCVLTFS-EESLHGGKNNAYE 506

Query: 526  LALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDV 347
            L L  R +  NN    + L GKL  G+ VS  +V E      F S +SSLSWM    SDV
Sbjct: 507  LTLEARGQQNNNTEAVRQLFGKLNSGDSVSFYTVKERGSTQGFDSNVSSLSWMGTTASDV 566

Query: 346  TNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIF 167
             NR+ VLLSP SG   ST+ LPLPGH+LIHGP GSGKTSLA AVAK LE H++++AH++F
Sbjct: 567  INRIKVLLSPDSGLWFSTYHLPLPGHILIHGPPGSGKTSLAKAVAKSLEHHKDLVAHIVF 626

Query: 166  ISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            + CS+LSLEK   IRQA++N ISEAL H+PS++IFD+LD I+  SS+  GSQPS+
Sbjct: 627  VCCSRLSLEKGPIIRQALSNFISEALDHAPSIVIFDNLDSIISSSSDPEGSQPST 681


>ref|XP_002298113.2| hypothetical protein POPTR_0001s17400g [Populus trichocarpa]
            gi|550347541|gb|EEE82918.2| hypothetical protein
            POPTR_0001s17400g [Populus trichocarpa]
          Length = 1133

 Score =  308 bits (789), Expect = 3e-81
 Identities = 190/418 (45%), Positives = 261/418 (62%), Gaps = 5/418 (1%)
 Frame = -3

Query: 1240 EENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQK 1061
            E N G  +  KE + + +VR++FSDSVAKGH+M+ +SLR++LR  LHSW+Y+K + I   
Sbjct: 274  EANNGTLTDKKE-FHQAIVRLLFSDSVAKGHVMIARSLRLYLRAGLHSWIYLKGW-ITDL 331

Query: 1060 KDIPSLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEK 881
            KDI SL+LSPC  K  G++  +E+ GL+  I  +   K   +  D ++  +  DWS+H+K
Sbjct: 332  KDIASLSLSPCYFKMPGQDKPVEKPGLEL-IDIDKLQKPRKTSLDTYM--DAVDWSIHDK 388

Query: 880  LFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKET 701
            +F+SLS  F     E+        +  + LL  W   QL AIAS T GVEV S+ + KET
Sbjct: 389  IFASLSQDFPSKQEEETGYLPDNKKGLRRLLQAWYRAQLDAIAS-TSGVEVNSLIVGKET 447

Query: 700  LLHFELKI----LNLKSKNEQPLFDR-SLERGSSTGESVVELLYILTAKFKESSQDDLGN 536
            LLHFE+K     ++ K++ +   +   SL+  + TG + +E LY+L+   +ES      N
Sbjct: 448  LLHFEVKGYDFGIDRKTREKASSYSNGSLKNRNKTGGTQLEFLYVLSIP-EESVHGIKVN 506

Query: 535  ICELALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAI 356
               LA N R K    V     L  +L+LG PVS  S+ ES     F S  SSLSWM    
Sbjct: 507  AYSLAFNERKKDNLGV----GLFERLKLGGPVSFYSLKESNSFTGFSSNASSLSWMGTTA 562

Query: 355  SDVTNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAH 176
            SDV NRL VLL P      +T+ LPLPGH+LI+GP GSGKT+LA AVAK LEE E++ AH
Sbjct: 563  SDVINRLMVLLYPPYSTWFNTYNLPLPGHILIYGPHGSGKTTLARAVAKSLEEREDLFAH 622

Query: 175  VIFISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            ++F+SCS L+L+K+ AIRQ ++  ISEAL H+PS++IFDDLD IV  SS+S GSQPS+
Sbjct: 623  IVFVSCSGLTLDKASAIRQTLSASISEALDHAPSLVIFDDLDTIVSASSDSEGSQPST 680


>ref|XP_004293758.1| PREDICTED: peroxisome biogenesis protein 1-like [Fragaria vesca
            subsp. vesca]
          Length = 1129

 Score =  305 bits (782), Expect = 2e-80
 Identities = 191/431 (44%), Positives = 251/431 (58%), Gaps = 16/431 (3%)
 Frame = -3

Query: 1246 RNEENYGL---SSASKEAYRRT----------VVRIIFSDSVAKGHIMLPQSLRVFLRTD 1106
            +N E+ GL   SS  KE+  R           VVR++ SDSVAKGH+M+ QSLR++LR  
Sbjct: 255  KNSESDGLRIGSSTPKESSVRVPNDKKDNHQAVVRLLISDSVAKGHLMIAQSLRLYLRAG 314

Query: 1105 LHSWVYVKKYSINQKKDIPSLTLSPCRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRK 929
            LHSWVY+K      K ++P  +LSPC  K   K  A+ERNGL   D H   K  +M+   
Sbjct: 315  LHSWVYLKGCGGILKNNMPMCSLSPCHFKISPKEKAVERNGLQVLDRHKTRKKNDMLLTP 374

Query: 928  DLFLGVNVPDWSLHEKLFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIAS 749
                 ++V DWS H+K+ +  S        E+           + LL  W+  QL AI S
Sbjct: 375  GSSTYIDVVDWSTHDKVVAEFSSKSSCEEDEEPAHHYDKGNGVESLLKAWILAQLDAITS 434

Query: 748  FTEGVEVMSVFLSKETLLHFELK--ILNLKSKNEQPLFDRSLERGSSTGESVVELLYILT 575
               GVEV S+ L  ETLLHFE+K     +K K+++   D  L   +   E  VE+LY+LT
Sbjct: 435  -KAGVEVNSLILGNETLLHFEVKGNQSGIKGKDQESSND-ILANNNMNPEVPVEILYVLT 492

Query: 574  AKFKESSQDDLGNICELALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFK 395
               KES +   GN  EL  + R+K  NN  E    H    +GEPVS  SV E   D    
Sbjct: 493  IS-KESQRG--GNAYELVFDERNKDNNNTLESLEKH----MGEPVSFYSVRERMYDKNIT 545

Query: 394  STISSLSWMEAAISDVTNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAV 215
            S ISSLSWM    S+V NR+ VLL+P  G   S+  LPLPGHVLIHGP GSGKT LA  V
Sbjct: 546  SDISSLSWMGTTASEVLNRMLVLLTPAYGVWFSSQNLPLPGHVLIHGPPGSGKTLLARTV 605

Query: 214  AKCLEEHEEILAHVIFISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMF 35
             +CLEEH  +LAH++++ CS+L++EK+  +RQA+++ ISEAL H+PS++I DDLD IV  
Sbjct: 606  GRCLEEHGGLLAHIVYVCCSQLAMEKALTVRQALSSYISEALDHAPSLVILDDLDSIVSS 665

Query: 34   SSESVGSQPSS 2
            SS+  GSQPS+
Sbjct: 666  SSDLEGSQPST 676


>ref|XP_006365432.1| PREDICTED: peroxisome biogenesis protein 1-like [Solanum tuberosum]
          Length = 1128

 Score =  299 bits (766), Expect = 1e-78
 Identities = 173/407 (42%), Positives = 247/407 (60%), Gaps = 4/407 (0%)
 Frame = -3

Query: 1210 KEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTLSP 1031
            K    + +VR+IFS+SVAKGHIMLP+S+R++LR +LHS VYVK++++  KK+IP ++LSP
Sbjct: 284  KHNIHQAMVRLIFSESVAKGHIMLPRSIRLYLRAELHSRVYVKRFNVKLKKEIPLVSLSP 343

Query: 1030 CRLKSMGKNFALERNGLDSDIHTNF-KSKNMISRKDLFLGVNVPDWSLHEKLFSSLSCGF 854
            C  K   +    E N  ++    N+ K+   + R +  + +   DWS+HEK+ ++ SC  
Sbjct: 344  CEFKIFQETGVSEENSSEALGKNNYNKTLTTLFRTNSDIEMGTSDWSIHEKIAAAFSCES 403

Query: 853  FVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELKIL 674
                 E +  ++   +D   +LH W   QL A+ +   GVEV S+ L   TLLHF+ K  
Sbjct: 404  SKEDKETSI-KSDLKKDIAAILHRWCLAQLHAV-TIKAGVEVKSLILGNTTLLHFKAKD- 460

Query: 673  NLKSKNEQPLFDRSLERGSST---GESVVELLYILTAKFKESSQDDLGNICELALNTRSK 503
                        RS++ G  T   GE+ ++ +Y+L+    +S +D+  +  E+A +  SK
Sbjct: 461  -----------SRSIKHGGQTMNGGETSLDAMYVLSTT-DDSLRDETIDAYEVAFDEGSK 508

Query: 502  VGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLL 323
            +  +   ++   GKL+LG  +S+ +V E         T SSL WM  A  DV NRL VLL
Sbjct: 509  LTTSPKNFEPWLGKLQLGNGLSIRTVREKLFAKSTSLTTSSLDWMGTAAPDVINRLVVLL 568

Query: 322  SPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSL 143
            S  S  L S +  PLPGH+LIHGPSGSGKT LA   AK  EE E+ILAH+IF+SCSKL+L
Sbjct: 569  SSASWMLSSAYDFPLPGHILIHGPSGSGKTLLATVAAKFAEESEDILAHIIFLSCSKLAL 628

Query: 142  EKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            EK  AIRQ + + +++AL H+PSV++FDDLD IV  SSES  SQPSS
Sbjct: 629  EKPSAIRQTLLSYVADALDHAPSVVVFDDLDSIVAASSESEASQPSS 675


>ref|XP_004237362.1| PREDICTED: peroxisome biogenesis protein 1-like [Solanum
            lycopersicum]
          Length = 1128

 Score =  291 bits (745), Expect = 4e-76
 Identities = 170/407 (41%), Positives = 244/407 (59%), Gaps = 4/407 (0%)
 Frame = -3

Query: 1210 KEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTLSP 1031
            K    + +VR+IFS+SVAKGHIMLP+S+R++L+ +LHS VYVK++++  KK+IP + LSP
Sbjct: 284  KHDIHQAMVRLIFSESVAKGHIMLPRSIRLYLKAELHSCVYVKRFNVKLKKEIPPVLLSP 343

Query: 1030 CRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLSCGF 854
            C  K   +    E N  ++   + N K+   + R +  + +   DWS+HE++ ++ S   
Sbjct: 344  CEFKIFQETGVSEENNAEALGKNNNNKTLTTVLRTNSDIEMGSSDWSIHEEIAAAFSYES 403

Query: 853  FVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELKIL 674
                 E +  ++   +D   +LH W   QL A+     GVEV S+ L   TLLHF+ K  
Sbjct: 404  SKEDKEMSI-KSDIKKDIAAILHRWCLAQLHAV-KIKAGVEVKSLILGNTTLLHFKAKD- 460

Query: 673  NLKSKNEQPLFDRSLERGSST---GESVVELLYILTAKFKESSQDDLGNICELALNTRSK 503
                        RS++ G  T   GE+ ++ +Y+L+     S +D+  +  E+A +  SK
Sbjct: 461  -----------SRSIKHGVQTMNGGETSLDAMYVLSTT-DGSLRDEAIDAYEVAFDEGSK 508

Query: 502  VGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLL 323
            +  +   ++   GKL+LG  +S+ +V E         T SSL WM  A  DV NRL VLL
Sbjct: 509  LTTSPKSFEPWLGKLQLGNGISIRTVREKLFAKSTSLTTSSLDWMGTAAPDVINRLVVLL 568

Query: 322  SPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSL 143
            S  S  L S +  PLPGH+LIHGPSGSGKT LA   AK  EE E+ILAH+IF+SCSK++L
Sbjct: 569  SSASWMLSSAYDFPLPGHILIHGPSGSGKTLLATVAAKFAEESEDILAHIIFLSCSKIAL 628

Query: 142  EKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
            EK  AIRQA+ + +++AL H+PSV++FDDLD IV  SSES  SQPSS
Sbjct: 629  EKPSAIRQALLSYVADALDHAPSVVVFDDLDSIVAASSESEASQPSS 675


>ref|XP_006399345.1| hypothetical protein EUTSA_v10012497mg [Eutrema salsugineum]
            gi|557100435|gb|ESQ40798.1| hypothetical protein
            EUTSA_v10012497mg [Eutrema salsugineum]
          Length = 1127

 Score =  288 bits (736), Expect = 4e-75
 Identities = 170/411 (41%), Positives = 250/411 (60%), Gaps = 2/411 (0%)
 Frame = -3

Query: 1228 GLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIP 1049
            G  SA KE  RR ++R++FSD  AKGH+M+ +SLR++L   LHSWVY++  ++N  K+IP
Sbjct: 276  GTPSAKKEP-RRAILRLVFSDLAAKGHLMMVESLRLYLGAGLHSWVYLRGCNVNVNKEIP 334

Query: 1048 SLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSS 869
            +L+LS C  K   K   L+R G D   + +F  K+   R  L   V+V DWS+H+K+ ++
Sbjct: 335  ALSLSSCVFKISEKEKVLDR-GTDMLGNHSFNRKSSHPRSGLTTNVDVLDWSVHDKVLTA 393

Query: 868  LSCG-FFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLH 692
            LS     +   +    +    +  + L   W   QL AIAS T GV+V S+ + +ETL H
Sbjct: 394  LSSEELHIKEEQDNAYQLKNRKGLERLTRLWSLAQLDAIASLT-GVDVSSLIVGRETLFH 452

Query: 691  FELKIL-NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALN 515
            FE++ L + K ++ QPL +  LE         +E+LY++     E S  D   + EL L+
Sbjct: 453  FEVRGLESYKPRDGQPLVNDRLENRKKDKNVPLEILYVMKVS-DEPSLGDKFAVYELTLD 511

Query: 514  TRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRL 335
             RS+  +NV   + +  K+ LGEP+  +S  E + +    + +SSL+WM + + DV  R+
Sbjct: 512  -RSEKRDNVGHIEPVLEKMNLGEPIFFSSAKERHCNKGVSTDLSSLAWMGSIVLDVIKRM 570

Query: 334  TVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCS 155
            TVLLSP +G   S F +P PGH+LI+GP GSGKT LA A AK  EE +++LAHVI +SCS
Sbjct: 571  TVLLSPEAGMWFSKFSIPSPGHILIYGPPGSGKTILARAAAKYFEEQKDLLAHVILVSCS 630

Query: 154  KLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPSS 2
             L+LEK Q I Q ++  I+E L H+PSVII DDLD I+  SS++ G+Q S+
Sbjct: 631  ALALEKVQHIHQVLSGVIAEGLEHAPSVIILDDLDSIISSSSDTEGTQASN 681


>dbj|BAB09996.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1125

 Score =  285 bits (730), Expect = 2e-74
 Identities = 171/411 (41%), Positives = 247/411 (60%), Gaps = 3/411 (0%)
 Frame = -3

Query: 1228 GLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIP 1049
            G SSA KE  R+ ++R++FSD  AKGH+M+ +SLR++L   LHSWVY++  ++N+ K+IP
Sbjct: 284  GTSSAKKEP-RQAILRLVFSDLAAKGHLMMVESLRLYLGAGLHSWVYLRGCNVNEDKEIP 342

Query: 1048 SLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSS 869
            +L+LSPC  K       L++ G D   + N   K+      L   V+V DWS+H+K+ ++
Sbjct: 343  ALSLSPCVFKISENEKVLDK-GTDRLGNNNSVRKSSHPPSGLSTYVDVVDWSVHDKVVTA 401

Query: 868  LSCGFFVNGGEKATPENHTAQDK--KFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLL 695
            LS     + G      NH    K  ++L   W   QL A+AS T GV+V S+ + +ET  
Sbjct: 402  LSSEGLHDEG------NHDKNKKGLEYLTRLWSLAQLDAMASVT-GVDVSSLIVGRETFF 454

Query: 694  HFELKIL-NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELAL 518
            HFE++ L + KS + QP  +   E G     + +E+LY++T   +    D      +L+L
Sbjct: 455  HFEVRGLESYKSIDGQPSVNDRWESGKKDKHTPLEILYVMTVSDESLLGDKFAGY-DLSL 513

Query: 517  NTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNR 338
            +   K  N VH    L  K+ LGEP+ L S  E++ +      ISSL+WM   +SDV  R
Sbjct: 514  DRSEKSDNVVHIEPVLE-KMNLGEPIYLKSAKETHCNKGVSPDISSLTWMGPIVSDVIKR 572

Query: 337  LTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISC 158
            +TVLLSP +G   S F +P PGH+LI+GP GSGKT LA A AK  EE +++LAHVI +SC
Sbjct: 573  MTVLLSPAAGMWFSKFKIPSPGHILIYGPPGSGKTILARAAAKYFEEQKDLLAHVILVSC 632

Query: 157  SKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
            S L+LEK Q I   +++ I+E L H+PSVII DDLD I+  SS++ G+Q S
Sbjct: 633  STLALEKVQHIHHVLSSVIAEGLEHAPSVIILDDLDSIISSSSDTEGTQAS 683


>gb|AAG44817.1| peroxisome biogenesis protein PEX1 [Arabidopsis thaliana]
          Length = 1119

 Score =  285 bits (730), Expect = 2e-74
 Identities = 171/411 (41%), Positives = 247/411 (60%), Gaps = 3/411 (0%)
 Frame = -3

Query: 1228 GLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIP 1049
            G SSA KE  R+ ++R++FSD  AKGH+M+ +SLR++L   LHSWVY++  ++N+ K+IP
Sbjct: 273  GTSSAKKEP-RQAILRLVFSDLAAKGHLMMVESLRLYLGAGLHSWVYLRGCNVNEDKEIP 331

Query: 1048 SLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSS 869
            +L+LSPC  K       L++ G D   + N   K+      L   V+V DWS+H+K+ ++
Sbjct: 332  ALSLSPCVFKISENEKVLDK-GTDRLGNNNSVRKSSHPPSGLSTYVDVVDWSVHDKVVTA 390

Query: 868  LSCGFFVNGGEKATPENHTAQDK--KFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLL 695
            LS     + G      NH    K  ++L   W   QL A+AS T GV+V S+ + +ET  
Sbjct: 391  LSSEGLHDEG------NHDKNKKGLEYLTRLWSLAQLDAMASVT-GVDVSSLIVGRETFF 443

Query: 694  HFELKIL-NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELAL 518
            HFE++ L + KS + QP  +   E G     + +E+LY++T   +    D      +L+L
Sbjct: 444  HFEVRGLESYKSIDGQPSVNDRWESGKKDKHTPLEILYVMTVSDESLLGDKFAGY-DLSL 502

Query: 517  NTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNR 338
            +   K  N VH    L  K+ LGEP+ L S  E++ +      ISSL+WM   +SDV  R
Sbjct: 503  DRSEKSDNVVHIEPVLE-KMNLGEPIYLKSAKETHCNKGVSPDISSLTWMGPIVSDVIKR 561

Query: 337  LTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISC 158
            +TVLLSP +G   S F +P PGH+LI+GP GSGKT LA A AK  EE +++LAHVI +SC
Sbjct: 562  MTVLLSPAAGMWFSKFKIPSPGHILIYGPPGSGKTILARAAAKYFEEQKDLLAHVILVSC 621

Query: 157  SKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
            S L+LEK Q I   +++ I+E L H+PSVII DDLD I+  SS++ G+Q S
Sbjct: 622  STLALEKVQHIHHVLSSVIAEGLEHAPSVIILDDLDSIISSSSDTEGTQAS 672


>ref|NP_196464.2| peroxisome biogenesis protein 1 [Arabidopsis thaliana]
            gi|322967561|sp|Q9FNP1.2|PEX1_ARATH RecName:
            Full=Peroxisome biogenesis protein 1; AltName:
            Full=Peroxin-1; Short=AtPEX1 gi|332003924|gb|AED91307.1|
            peroxisome biogenesis protein 1 [Arabidopsis thaliana]
          Length = 1130

 Score =  285 bits (730), Expect = 2e-74
 Identities = 171/411 (41%), Positives = 247/411 (60%), Gaps = 3/411 (0%)
 Frame = -3

Query: 1228 GLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIP 1049
            G SSA KE  R+ ++R++FSD  AKGH+M+ +SLR++L   LHSWVY++  ++N+ K+IP
Sbjct: 284  GTSSAKKEP-RQAILRLVFSDLAAKGHLMMVESLRLYLGAGLHSWVYLRGCNVNEDKEIP 342

Query: 1048 SLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSS 869
            +L+LSPC  K       L++ G D   + N   K+      L   V+V DWS+H+K+ ++
Sbjct: 343  ALSLSPCVFKISENEKVLDK-GTDRLGNNNSVRKSSHPPSGLSTYVDVVDWSVHDKVVTA 401

Query: 868  LSCGFFVNGGEKATPENHTAQDK--KFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLL 695
            LS     + G      NH    K  ++L   W   QL A+AS T GV+V S+ + +ET  
Sbjct: 402  LSSEGLHDEG------NHDKNKKGLEYLTRLWSLAQLDAMASVT-GVDVSSLIVGRETFF 454

Query: 694  HFELKIL-NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELAL 518
            HFE++ L + KS + QP  +   E G     + +E+LY++T   +    D      +L+L
Sbjct: 455  HFEVRGLESYKSIDGQPSVNDRWESGKKDKHTPLEILYVMTVSDESLLGDKFAGY-DLSL 513

Query: 517  NTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNR 338
            +   K  N VH    L  K+ LGEP+ L S  E++ +      ISSL+WM   +SDV  R
Sbjct: 514  DRSEKSDNVVHIEPVLE-KMNLGEPIYLKSAKETHCNKGVSPDISSLTWMGPIVSDVIKR 572

Query: 337  LTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISC 158
            +TVLLSP +G   S F +P PGH+LI+GP GSGKT LA A AK  EE +++LAHVI +SC
Sbjct: 573  MTVLLSPAAGMWFSKFKIPSPGHILIYGPPGSGKTILARAAAKYFEEQKDLLAHVILVSC 632

Query: 157  SKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
            S L+LEK Q I   +++ I+E L H+PSVII DDLD I+  SS++ G+Q S
Sbjct: 633  STLALEKVQHIHHVLSSVIAEGLEHAPSVIILDDLDSIISSSSDTEGTQAS 683


>ref|XP_003529444.1| PREDICTED: peroxisome biogenesis protein 1-like isoform X1 [Glycine
            max]
          Length = 1130

 Score =  284 bits (727), Expect = 5e-74
 Identities = 167/402 (41%), Positives = 247/402 (61%), Gaps = 2/402 (0%)
 Frame = -3

Query: 1210 KEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTLSP 1031
            K  YR+T+V+++ S+SVA+GH+M+ +SLR++LR  LHSWVY+K   I  +K IPS +L P
Sbjct: 283  KTEYRQTIVQLLISESVAEGHVMVAKSLRLYLRASLHSWVYLKACDIILEKSIPSTSLFP 342

Query: 1030 CRLKSMGKNFALERNGLDS-DIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLSCGF 854
            C+ K + +  A+E++GL+    H N   +N+ ++    + V+  DWS+  ++ ++LS   
Sbjct: 343  CQFKLLKQENAVEKDGLEVFHGHKNHIDENLHAKPTSGVFVDTIDWSIQNEVAAALSDES 402

Query: 853  FVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELKIL 674
                 E+AT ++   +  + L+  W   QLKAI S + G+EV S+ +  +TLLHFE+   
Sbjct: 403  SYKAEEEATNQSQNQRGLQSLVRLWYIMQLKAITSIS-GMEVSSLIIGNKTLLHFEVSCY 461

Query: 673  NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNTRSKVGN 494
             L++  +  L   S E      E    +L++LT   +      L N  E+AL  R    N
Sbjct: 462  KLRNNGKVQLAYNSSENSGKAAE----MLFLLTFGEEYLHHGKL-NAYEVALGGRL---N 513

Query: 493  NVHEWQ-SLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLLSP 317
            N++     L  +++L +PVS++S+ E   +    S +SSL WME A  DV NR+ +LL  
Sbjct: 514  NINIGDLKLFERMKLCDPVSIHSIEERASEDHISSNVSSLGWMEKAADDVINRMLILLCS 573

Query: 316  YSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSLEK 137
             SG    +  LPLPGHVLI+GPSGSGKT LA  VAK LE  E+ILAH+IF+SCSKL+LEK
Sbjct: 574  ASGLWFGSHNLPLPGHVLIYGPSGSGKTILARTVAKSLENREDILAHIIFVSCSKLALEK 633

Query: 136  SQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQ 11
               IRQ + N ++EAL H+PSV+IFDDLD I+  + +S GSQ
Sbjct: 634  VPVIRQELANHVTEALNHAPSVVIFDDLDSIIS-TPDSEGSQ 674


>ref|XP_006853404.1| hypothetical protein AMTR_s00032p00152530 [Amborella trichopoda]
            gi|548857057|gb|ERN14871.1| hypothetical protein
            AMTR_s00032p00152530 [Amborella trichopoda]
          Length = 1113

 Score =  284 bits (726), Expect = 6e-74
 Identities = 173/417 (41%), Positives = 238/417 (57%), Gaps = 5/417 (1%)
 Frame = -3

Query: 1240 EENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQK 1061
            E+N G   +     R   V I  SDSVA+GH+ML +SLR++++ DLH+WV+V + S + K
Sbjct: 263  EKNNGWLRSGTMVPRHATVCISLSDSVARGHVMLQRSLRLYIKADLHTWVHVWRCSSHVK 322

Query: 1060 KDIPSLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLGVNVPDWSLHEK 881
            KD  SL LSPC  K +  +  LE N    +   + K+ +M    D      V DWS HE+
Sbjct: 323  KDA-SLILSPCHFK-LETDKLLEDNANLFEFRNSLKTNSMHQNIDSIFNEEVMDWSTHEE 380

Query: 880  LFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKET 701
               +L  G   +G  +   E    + K+ L+  W  GQL  +A+     +V S+ L +ET
Sbjct: 381  FIEALPSGCHGHGENEHDCETCAVKQKERLVQIWTMGQLNIMATLNGVDDVKSLVLGRET 440

Query: 700  LLHFELKILNLKSKNEQPLFDRSLERGSS-----TGESVVELLYILTAKFKESSQDDLGN 536
            +LHFE+         +  L   S + GS      + +S +ELL++LT    ES   +   
Sbjct: 441  ILHFEV---------DMGLTFGSCKTGSKGTINMSDKSPLELLFLLTVTSDESDLGEQYE 491

Query: 535  ICELALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAI 356
              ELA +T +         +    KL+ G PV  +   E      F S++SSLSWM  A+
Sbjct: 492  SYELAFSTVNSSSEKHGGLELQFEKLDFGGPVCFDCPNEKCFGRSFSSSVSSLSWMAVAL 551

Query: 355  SDVTNRLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAH 176
            +D+ NRLTVLLSP SGKL S   LPLPGHVL+HGP GSGKT LAMAVAK LE  ++ILAH
Sbjct: 552  TDIINRLTVLLSPSSGKLFSNLDLPLPGHVLVHGPPGSGKTLLAMAVAKHLEGSKDILAH 611

Query: 175  VIFISCSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
            ++FI+CSKL+LE    IR+ +   ISEAL H P+++IFDDLD ++  SSES GSQ S
Sbjct: 612  IVFINCSKLALENVNTIRETLNGYISEALDHPPALVIFDDLDALIS-SSESDGSQSS 667


>gb|EOY27465.1| Peroxisome biogenesis protein 1 [Theobroma cacao]
          Length = 1153

 Score =  281 bits (719), Expect = 4e-73
 Identities = 190/447 (42%), Positives = 257/447 (57%), Gaps = 34/447 (7%)
 Frame = -3

Query: 1240 EENYGLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSW----------- 1094
            E N G+S+ +KE +R+ +V ++ SDSVA+GH+M+ +SLR++LR  LHS            
Sbjct: 271  EANSGISTDNKE-FRQVIVHLLISDSVAEGHVMITRSLRLYLRAGLHSCMLNLSKNQLLI 329

Query: 1093 --------VYVKKYSINQKKDIPSLTLSPCRLKSMGKNFALERNGLDS-DIHTNFKSKNM 941
                    VY+K Y++  KK+I  L+LSPC  K +  +   + NGL+  D H   + KN 
Sbjct: 330  LLYLPRKGVYLKGYNVALKKEISVLSLSPCHFKVVAND---KENGLEVLDGHKTRRMKNS 386

Query: 940  ISRKDLFLGVNVPDWSLHEKLFSSLSCGFFVNGGEKATPENHTAQDKKFLLHYWLTGQLK 761
             S   L     V +WS H+ + + LS  F     E ++ E+ T +  + LL  W   QL 
Sbjct: 387  GSGTSL----EVVNWSTHDDVVAVLSSEFPFQEAEDSSQED-TKKGLECLLRAWFLAQLD 441

Query: 760  AIASFTEGVEVMSVFLSKETLLHFELKILNLKSKNEQPLFDRS--LERGSSTGESVVELL 587
            AIAS   G EV ++ L  E LLHFE+   N        L   +   E+ + T +  VE+ 
Sbjct: 442  AIAS-NAGTEVKTLVLGNENLLHFEV---NRYDSGTYGLVSSNGFSEKRNKTKDLPVEIS 497

Query: 586  YILTAKFKESSQDDLGNICELALNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYID 407
            YILT   +E       N  ELAL+ R+K  N+V     L GKL LG P+SL SV +    
Sbjct: 498  YILTIS-EELLHSGNVNAYELALDDRNK-RNDVQGGFELFGKLNLGNPMSLYSVKDRTSV 555

Query: 406  GKFKSTISSLSWMEAAISDVTNR------------LTVLLSPYSGKLLSTFGLPLPGHVL 263
              F +  SSLSWM    SDV N             + VLL+P SG   ST+ LPLPGHVL
Sbjct: 556  KGFSTNASSLSWMGVTASDVINSRCFKGLLKIVIGMMVLLAPASGIWFSTYNLPLPGHVL 615

Query: 262  IHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSLEKSQAIRQAITNCISEALAH 83
            I+GP+GSGKT LA AVAK LEEH+++LAHVIFI CS L+LEK   IRQA+++ +SEAL H
Sbjct: 616  IYGPAGSGKTLLARAVAKSLEEHKDLLAHVIFICCSGLALEKPPTIRQALSSFVSEALDH 675

Query: 82   SPSVIIFDDLDHIVMFSSESVGSQPSS 2
            +PSV++FDDLD I+  SS+S GSQPS+
Sbjct: 676  APSVVVFDDLDSIIQSSSDSEGSQPST 702


>ref|XP_002871329.1| peroxisome biogenesis protein PEX1 [Arabidopsis lyrata subsp. lyrata]
            gi|297317166|gb|EFH47588.1| peroxisome biogenesis protein
            PEX1 [Arabidopsis lyrata subsp. lyrata]
          Length = 1122

 Score =  281 bits (718), Expect = 5e-73
 Identities = 170/412 (41%), Positives = 246/412 (59%), Gaps = 4/412 (0%)
 Frame = -3

Query: 1228 GLSSASKEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIP 1049
            G SSA KE  R+T++R++FSD VAKGH+M+ +SLR++L   LHSWVY++  ++N+ K+IP
Sbjct: 271  GTSSAKKEP-RQTILRLVFSDLVAKGHLMMVESLRLYLGAGLHSWVYLRGCNVNEDKEIP 329

Query: 1048 SLTLSPCRLKSMGKNFALERNGLDSDIHTNFKSKNMISRKDLFLG--VNVPDWSLHEKLF 875
            +L+LSPC  K       L+R    +D   N  S    S     L   ++V DWS+H+K+ 
Sbjct: 330  ALSLSPCVFKISENEKVLDRG---TDTLGNHNSIRNCSHPPSGLSTYMDVVDWSVHDKVV 386

Query: 874  SSLSC-GFFVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETL 698
            ++LS  G    G +    +    +  + L   W   QL AIAS T GV+V S+ + +ET 
Sbjct: 387  TALSSEGLHDEGNQVNAYQVKNKKKLECLTRLWSLAQLDAIASVT-GVDVSSLIVGRETF 445

Query: 697  LHFELK-ILNLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELA 521
             HFE++   + K ++ QP  +   E G     + +E+LY++T    ES   D     +L+
Sbjct: 446  FHFEVRGPESYKFRDGQPSVNDRWESGKKDKNTPLEILYVMTVS-DESLLGDKFTGYDLS 504

Query: 520  LNTRSKVGNNVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTN 341
            L+   K  N VH    L  K+ LG+P+   S  E++ +      ISSL+WM   +SDV  
Sbjct: 505  LDRSEKSDNVVHIEPVLE-KMNLGDPIYFTSAKETHCNKGVSPDISSLTWMGPIVSDVIK 563

Query: 340  RLTVLLSPYSGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFIS 161
            R+ VLLSP +G   S F +P PGH+LI+GP GSGKT LA A AK  EE +++LAHVI +S
Sbjct: 564  RMAVLLSPAAGMWFSKFKIPSPGHILIYGPPGSGKTILARAAAKYFEEQKDLLAHVILVS 623

Query: 160  CSKLSLEKSQAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
            CS L+LEK Q I Q +++ I+E L H+PSVII DDLD I+  SS++ G+Q S
Sbjct: 624  CSTLALEKVQHIHQVLSSVIAEGLEHAPSVIILDDLDSIISSSSDTEGTQAS 675


>gb|ESW29810.1| hypothetical protein PHAVU_002G100600g [Phaseolus vulgaris]
          Length = 1126

 Score =  280 bits (716), Expect = 9e-73
 Identities = 169/403 (41%), Positives = 246/403 (61%), Gaps = 1/403 (0%)
 Frame = -3

Query: 1210 KEAYRRTVVRIIFSDSVAKGHIMLPQSLRVFLRTDLHSWVYVKKYSINQKKDIPSLTLSP 1031
            K  YR+ +V+++ S+SVA+GH+M+ +SLR++LR  L SWVY+K  +I  +K+IPS +L P
Sbjct: 279  KTEYRQAIVQLMISESVAEGHVMVAKSLRLYLRASLRSWVYLKACNIILEKNIPSTSLFP 338

Query: 1030 CRLKSMGKNFALERNGLD-SDIHTNFKSKNMISRKDLFLGVNVPDWSLHEKLFSSLSCGF 854
            C+ K + +  ++E++G + S  H N   KN+ ++    + V+  DWS+  K+  ++S   
Sbjct: 339  CQFKLLRQENSVEKDGPEVSHGHNNHIDKNVQAKATSGVFVDSIDWSIQNKVLEAVSDES 398

Query: 853  FVNGGEKATPENHTAQDKKFLLHYWLTGQLKAIASFTEGVEVMSVFLSKETLLHFELKIL 674
                 E+AT ++H  +  + L+  W   QLKAI S + GVEV S+ +  +TLLHFE+   
Sbjct: 399  NYKAEEEATNQSHNQRGLQSLVRLWYITQLKAITSIS-GVEVSSLIMGDKTLLHFEVSCH 457

Query: 673  NLKSKNEQPLFDRSLERGSSTGESVVELLYILTAKFKESSQDDLGNICELALNTRSKVGN 494
             L+S N +  F  SL   S       E+L++LT   +E   +   N  ++AL    ++ N
Sbjct: 458  KLES-NGKAKFAYSLSENSG---KAAEMLFLLTFG-EEYLHNGKLNAYDVALG--GELDN 510

Query: 493  NVHEWQSLHGKLELGEPVSLNSVTESYIDGKFKSTISSLSWMEAAISDVTNRLTVLLSPY 314
                      +++L +PVSL S+ E   + +  S +SSL WME    DV NR+ VLL   
Sbjct: 511  ISIVDLKFFERMKLCDPVSLLSIVERASEDRISSNLSSLGWMEKTADDVINRMLVLLCSA 570

Query: 313  SGKLLSTFGLPLPGHVLIHGPSGSGKTSLAMAVAKCLEEHEEILAHVIFISCSKLSLEKS 134
            SG    +  LPLPGHVLI+GP GSGKT LA  VAK LE  E+I AH+IFISCSKL+LEK 
Sbjct: 571  SGLWFGSHNLPLPGHVLIYGPPGSGKTLLARTVAKSLENREDIFAHIIFISCSKLALEKV 630

Query: 133  QAIRQAITNCISEALAHSPSVIIFDDLDHIVMFSSESVGSQPS 5
              IRQ + N ++EAL H+PSV+IFDDLD I+  S +S GSQPS
Sbjct: 631  PVIRQELANHVTEALNHAPSVVIFDDLDSIIS-SPDSEGSQPS 672


Top