BLASTX nr result

ID: Ephedra25_contig00000863 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00000863
         (1353 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006853404.1| hypothetical protein AMTR_s00032p00152530 [A...   264   7e-68
emb|CBI20540.3| unnamed protein product [Vitis vinifera]              255   3e-65
ref|XP_002273767.1| PREDICTED: peroxisome biogenesis protein 1-l...   248   4e-63
ref|XP_002298113.2| hypothetical protein POPTR_0001s17400g [Popu...   224   5e-56
ref|XP_004237362.1| PREDICTED: peroxisome biogenesis protein 1-l...   224   8e-56
ref|XP_006365432.1| PREDICTED: peroxisome biogenesis protein 1-l...   223   1e-55
dbj|BAB09996.1| unnamed protein product [Arabidopsis thaliana]        223   2e-55
gb|AAG44817.1| peroxisome biogenesis protein PEX1 [Arabidopsis t...   223   2e-55
ref|NP_196464.2| peroxisome biogenesis protein 1 [Arabidopsis th...   223   2e-55
gb|EOY27465.1| Peroxisome biogenesis protein 1 [Theobroma cacao]      222   3e-55
ref|XP_002517570.1| peroxisome biogenesis factor, putative [Rici...   221   7e-55
ref|XP_002871329.1| peroxisome biogenesis protein PEX1 [Arabidop...   218   5e-54
ref|XP_006399345.1| hypothetical protein EUTSA_v10012497mg [Eutr...   214   7e-53
ref|XP_004293758.1| PREDICTED: peroxisome biogenesis protein 1-l...   211   6e-52
ref|XP_006286937.1| hypothetical protein CARUB_v10000082mg [Caps...   210   1e-51
ref|XP_006448771.1| hypothetical protein CICLE_v10014090mg [Citr...   208   5e-51
ref|XP_003529444.1| PREDICTED: peroxisome biogenesis protein 1-l...   207   6e-51
gb|EMJ14918.1| hypothetical protein PRUPE_ppa000485mg [Prunus pe...   206   2e-50
ref|XP_006468418.1| PREDICTED: peroxisome biogenesis protein 1-l...   204   5e-50
ref|XP_003574917.1| PREDICTED: peroxisome biogenesis protein 1-l...   203   2e-49

>ref|XP_006853404.1| hypothetical protein AMTR_s00032p00152530 [Amborella trichopoda]
            gi|548857057|gb|ERN14871.1| hypothetical protein
            AMTR_s00032p00152530 [Amborella trichopoda]
          Length = 1113

 Score =  264 bits (674), Expect = 7e-68
 Identities = 167/452 (36%), Positives = 240/452 (53%), Gaps = 21/452 (4%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDGFLPPMLVLELRS---KGSVWHVAWIGMPSSSHSIEISA 177
            + SCFV LP  LI +LQ T  GFLPP+L LEL+S       WH+AW G  S SH+IE++ 
Sbjct: 1    MESCFVALPLALIHSLQSTCPGFLPPVLALELQSVTDSKEPWHLAWSGAASRSHAIEVAK 60

Query: 178  KLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQ 318
            +LA CIG+P+  +V VR   ++PKA    +EP +EDDWE++                   
Sbjct: 61   QLAECIGMPNRTKVQVRAAANLPKATFAMIEPISEDDWEVMELNSEFAEETILKQVGIVH 120

Query: 319  EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSE 498
            EG+ FPLWL GHTV  F V ST P K VVQLVP TE+AVAPK+RK V  A    +  G  
Sbjct: 121  EGMKFPLWLHGHTVATFVVVSTTPKKPVVQLVPETEVAVAPKRRKNVGGA---QQGVGYV 177

Query: 499  KKDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTT 678
            K+  T KALLRVQ L+R+ + T K   V   V+ TS  F+ P+T ++F   NGQ V +++
Sbjct: 178  KEHITTKALLRVQELNRNYVHTYKQEGVKLGVVLTSVVFLHPETARHFMFDNGQLVSISS 237

Query: 679  ASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVML 858
             + G+   Q+ +    +KK                        VC I     VARGHVML
Sbjct: 238  RASGNGSLQNQKWGASRKKANLTTAEKNNGWLRSGTMVPRHATVC-ISLSDSVARGHVML 296

Query: 859  ARSLCRYIHTNLHERVIVRECLSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENG 1038
             RSL  YI  +LH  V V  C S   + ++L+LSP   +  + + +   N+   + + N 
Sbjct: 297  QRSLRLYIKADLHTWVHVWRCSSHVKKDASLILSPCHFK-LETDKLLEDNANLFEFR-NS 354

Query: 1039 LENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSS-DKEGCDDSEKATC----KNIVLEA 1203
            L+    ++  D+   E E++DW+ H+ F+ A  S     G ++ +  TC    K  +++ 
Sbjct: 355  LKTNSMHQNIDSIFNE-EVMDWSTHEEFIEALPSGCHGHGENEHDCETCAVKQKERLVQI 413

Query: 1204 WLSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
            W     +I++ L   +   SL++G  T+LHFE
Sbjct: 414  WTMGQLNIMATLNGVDDVKSLVLGRETILHFE 445


>emb|CBI20540.3| unnamed protein product [Vitis vinifera]
          Length = 1114

 Score =  255 bits (651), Expect = 3e-65
 Identities = 170/479 (35%), Positives = 249/479 (51%), Gaps = 30/479 (6%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDGFLPPMLVLELRSKGS-VWHVAWIGMPSSSHSIEISAKL 183
            I SCFV LP  LI+ LQ T  G LPP+L LELRS  + VW VAW G  S+S SIE++ + 
Sbjct: 11   IESCFVSLPLPLIQTLQSTSSGLLPPVLALELRSSNNDVWVVAWSGSASTSSSIEVARQF 70

Query: 184  ASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ-------------EG 324
            A CI LPD   V VR   ++PKA  + +EP TEDDWE+L                   E 
Sbjct: 71   AECISLPDHTAVQVRAVANLPKATLVTIEPHTEDDWEVLELNAEHAEAAILKQIGIVHEA 130

Query: 325  LTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKK 504
            + FPLWL G T I F V ST P K+VVQLVPGTE+AVAPK+RKK   ++       S K 
Sbjct: 131  MRFPLWLHGRTTITFLVVSTFPKKAVVQLVPGTEVAVAPKRRKKYLDSHKNALVQSSNKD 190

Query: 505  DFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTAS 684
                KALLRVQ   + LI   +   V   V+ T+  ++ P+T + +   + Q VIL   S
Sbjct: 191  HPIAKALLRVQDSGQKLIHKSEVKGVELGVVLTNVVYIHPETARNYSFDSLQLVILVPRS 250

Query: 685  GGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLAR 864
                   D +M   K                    +   +VV +++  + VA+GHVM+A+
Sbjct: 251  PSKGNYNDTDMFRKKS-----ISTAKEFSDGLADKKEPCQVVVRLLISESVAKGHVMMAQ 305

Query: 865  SLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGL 1041
            SL  Y+ T LH  V ++ C ++   E+S L LSP   + F++            ++ENGL
Sbjct: 306  SLRHYLRTGLHSWVYMKRCDINLKKEISLLSLSPCQFKMFEKNKA---------LEENGL 356

Query: 1042 ENGDD--NRESDAFVTE------KEIVDWNRHQNFLSACVSSDKEGCDD----SEKATCK 1185
            E  D   N ++ + + E        I DW+ H+ F +A +S +  G +D    S+  + K
Sbjct: 357  EVLDSLTNHKTKSMLLETNSDTYMNISDWSTHEEF-AAALSFESPGSEDEKTSSQSGSRK 415

Query: 1186 NI--VLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAYSE-HGTVTYDVLFLLSV 1353
             +  +L+AW  A    ++    G    SL++G+ TLLHF   S+ +G ++ ++L++L++
Sbjct: 416  GLQSLLQAWFLAHLDAINS-NAGTEIDSLVVGNETLLHFNVTSDNYGDLSVEILYILAI 473


>ref|XP_002273767.1| PREDICTED: peroxisome biogenesis protein 1-like [Vitis vinifera]
          Length = 1134

 Score =  248 bits (633), Expect = 4e-63
 Identities = 167/464 (35%), Positives = 237/464 (51%), Gaps = 29/464 (6%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDGFLPPMLVLELRSKGS-VWHVAWIGMPSSSHSIEISAKL 183
            I SCFV LP  LI+ LQ T  G LPP+L LELRS  + VW VAW G  S+S SIE++ + 
Sbjct: 11   IESCFVSLPLPLIQTLQSTSSGLLPPVLALELRSSNNDVWVVAWSGSASTSSSIEVARQF 70

Query: 184  ASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ-------------EG 324
            A CI LPD   V VR   ++PKA  + +EP TEDDWE+L                   E 
Sbjct: 71   AECISLPDHTAVQVRAVANLPKATLVTIEPHTEDDWEVLELNAEHAEAAILKQIGIVHEA 130

Query: 325  LTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKK 504
            + FPLWL G T I F V ST P K+VVQLVPGTE+AVAPK+RKK   ++       S K 
Sbjct: 131  MRFPLWLHGRTTITFLVVSTFPKKAVVQLVPGTEVAVAPKRRKKYLDSHKNALVQSSNKD 190

Query: 505  DFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTAS 684
                KALLRVQ   + LI   +   V   V+ T+  ++ P+T + +   + Q VIL   S
Sbjct: 191  HPIAKALLRVQDSGQKLIHKSEVKGVELGVVLTNVVYIHPETARNYSFDSLQLVILVPRS 250

Query: 685  GGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLAR 864
                   D +M   K                    +   +VV +++  + VA+GHVM+A+
Sbjct: 251  PSKGNYNDTDMFRKKS-----ISTAKEFSDGLADKKEPCQVVVRLLISESVAKGHVMMAQ 305

Query: 865  SLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGL 1041
            SL  Y+ T LH  V ++ C ++   E+S L LSP   + F++            ++ENGL
Sbjct: 306  SLRHYLRTGLHSWVYMKRCDINLKKEISLLSLSPCQFKMFEKNKA---------LEENGL 356

Query: 1042 ENGDD--NRESDAFVTE------KEIVDWNRHQNFLSACVSSDKEGCDD----SEKATCK 1185
            E  D   N ++ + + E        I DW+ H+ F +A +S +  G +D    S+  + K
Sbjct: 357  EVLDSLTNHKTKSMLLETNSDTYMNISDWSTHEEF-AAALSFESPGSEDEKTSSQSGSRK 415

Query: 1186 NI--VLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAYSE 1311
             +  +L+AW  A    ++    G    SL++G+ TLLHF   S+
Sbjct: 416  GLQSLLQAWFLAHLDAINS-NAGTEIDSLVVGNETLLHFNVTSD 458


>ref|XP_002298113.2| hypothetical protein POPTR_0001s17400g [Populus trichocarpa]
            gi|550347541|gb|EEE82918.2| hypothetical protein
            POPTR_0001s17400g [Populus trichocarpa]
          Length = 1133

 Score =  224 bits (572), Expect = 5e-56
 Identities = 159/460 (34%), Positives = 228/460 (49%), Gaps = 29/460 (6%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETH-DGFLPPMLVLELRSKGSV--WHVAWIGMPSSSHSIEISA 177
            I +CFV LP  LI+ L+ T     LPP+L LELRS  +   W VAW G  SSS SIE++ 
Sbjct: 11   IENCFVSLPINLIQILESTRRPAPLPPLLTLELRSPSANRHWTVAWSGATSSSSSIEVAQ 70

Query: 178  KLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ------------- 318
            + A CI LPD I V VR  ++V  A  + +EP +EDDWE+L     Q             
Sbjct: 71   QFAECISLPDHISVQVRAVSNVVNATLVTIEPHSEDDWEVLELNAEQAEASILKQVRIVN 130

Query: 319  EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSE 498
            EG+ FPLWL G  VI F V ST P ++VVQLVPG E+AVAPK+R+KV   N +D    S 
Sbjct: 131  EGMRFPLWLHGGAVITFLVVSTSPKRAVVQLVPGAEVAVAPKRREKV--VNKQDATVQSY 188

Query: 499  KKDFTV-KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILT 675
             K+  + KALLR+Q LDR L   C    V     PT  A++ P+T + F L + Q V L 
Sbjct: 189  NKESNMAKALLRLQDLDRRLFHNCDVKGVELATAPTCVAYMHPETAQMFSLDSLQLVTLV 248

Query: 676  ---TASGGSKPKQDDEMH--HGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVA 840
               ++  G K    D +       KE N                   + + +++F   VA
Sbjct: 249  PRLSSKDGVKTPDSDALRVKSASPKEANNGTLTDKKEFH--------QAIVRLLFSDSVA 300

Query: 841  RGHVMLARSLCRYIHTNLHERVIVRECLSRANEVSALLLSPISLQ-SFQRESINRVNSET 1017
            +GHVM+ARSL  Y+   LH  + ++  ++   ++++L LSP   +   Q + + +   E 
Sbjct: 301  KGHVMIARSLRLYLRAGLHSWIYLKGWITDLKDIASLSLSPCYFKMPGQDKPVEKPGLEL 360

Query: 1018 IDVKENGLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSEKATCKN--- 1188
            ID+ +             +  T  + VDW+ H   + A +S D     + E     +   
Sbjct: 361  IDIDKL------QKPRKTSLDTYMDAVDWSIHDK-IFASLSQDFPSKQEEETGYLPDNKK 413

Query: 1189 ---IVLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                +L+AW  A    ++   +G    SL++G  TLLHFE
Sbjct: 414  GLRRLLQAWYRAQLDAIAS-TSGVEVNSLIVGKETLLHFE 452


>ref|XP_004237362.1| PREDICTED: peroxisome biogenesis protein 1-like [Solanum
            lycopersicum]
          Length = 1128

 Score =  224 bits (570), Expect = 8e-56
 Identities = 161/486 (33%), Positives = 241/486 (49%), Gaps = 36/486 (7%)
 Frame = +1

Query: 1    ATIRSCFVGLPAYLIEALQETH-DGFLPPMLVLELRSKGSVWHVAWIGMPSSS---HSIE 168
            A I SCFV LP  L++ L+ T   G+LPP+L LELRS  ++W +AW G  SS+   +SI+
Sbjct: 9    AGIESCFVSLPVTLLQTLESTTASGYLPPVLALELRSGNNLWRLAWSGSASSNPFPNSIQ 68

Query: 169  ISAKLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQE--------- 321
            I+ + A CIGL D   V V+V +++PKA  + +EPDTEDDWE+L                
Sbjct: 69   IAKQYAECIGLLDRTVVQVKVVSNLPKATMVTIEPDTEDDWEVLELNAEHAEQAILKQVA 128

Query: 322  ----GLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKD 489
                 + FPLWL G T+I F V ST P   VVQLVPGTE+AVAPK+RK+    NI   ++
Sbjct: 129  IVYGAMRFPLWLHGQTIITFKVVSTFPLTPVVQLVPGTEVAVAPKRRKR----NISSGEE 184

Query: 490  GSEKKD--FTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQP 663
               + D     KALLRVQ  D   I   +   V   V+ TS  F+ P+T   +  +  Q 
Sbjct: 185  SMMQDDELSVSKALLRVQDTDDQCIHKYEAEGVEMSVVLTSAIFIHPETASIYSFEPLQT 244

Query: 664  VILTTASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDV-EVVCQIIFQKGVA 840
            V++        P++  + H    +                  + D+ + + ++IF + VA
Sbjct: 245  VVIIPR---LLPRETKKNHETYSRRGKSSVTSKEGSVGVLPDKHDIHQAMVRLIFSESVA 301

Query: 841  RGHVMLARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSET 1017
            +GH+ML RS+  Y+   LH  V V+   +    E+  +LLSP   + FQ   ++  N+  
Sbjct: 302  KGHIMLPRSIRLYLKAELHSCVYVKRFNVKLKKEIPPVLLSPCEFKIFQETGVSEENNAE 361

Query: 1018 IDVKENGLENGDDNRESDAFVTEKEIVDWNRHQNFLSA-CVSSDKEGCDDSEKATCKN-- 1188
               K N  +       +++ + E    DW+ H+   +A    S KE  + S K+  K   
Sbjct: 362  ALGKNNNNKTLTTVLRTNSDI-EMGSSDWSIHEEIAAAFSYESSKEDKEMSIKSDIKKDI 420

Query: 1189 -IVLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAYS----EHGTVT-------YD 1332
              +L  W  A    + K+  G    SL++G+ TLLHF+A      +HG  T        D
Sbjct: 421  AAILHRWCLAQLHAV-KIKAGVEVKSLILGNTTLLHFKAKDSRSIKHGVQTMNGGETSLD 479

Query: 1333 VLFLLS 1350
             +++LS
Sbjct: 480  AMYVLS 485


>ref|XP_006365432.1| PREDICTED: peroxisome biogenesis protein 1-like [Solanum tuberosum]
          Length = 1128

 Score =  223 bits (568), Expect = 1e-55
 Identities = 160/490 (32%), Positives = 240/490 (48%), Gaps = 40/490 (8%)
 Frame = +1

Query: 1    ATIRSCFVGLPAYLIEALQETH-DGFLPPMLVLELRSKGSVWHVAWIGMPSSS---HSIE 168
            A I SCFV LP  L++ L+ T   G+LPP+L LELRS  ++W +AW G  SS+   +SI+
Sbjct: 9    AGIESCFVSLPVTLLQTLESTTASGYLPPVLALELRSGNNLWRLAWSGSASSNPFPNSIQ 68

Query: 169  ISAKLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQE--------- 321
            I+ + A CIGL D   V V+V +++PKA  + +EPDTEDDWE+L                
Sbjct: 69   IAKQYAECIGLSDRTVVQVKVVSNLPKATMVTIEPDTEDDWEVLELNAEHAEQAILKQVA 128

Query: 322  ----GLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKD 489
                 + FPLWL G T+I F V ST P   VVQLVPGTE+AVAPK+RK+    NI   ++
Sbjct: 129  IVYGAMRFPLWLHGQTIITFKVVSTFPLTPVVQLVPGTEVAVAPKRRKR----NISSGEE 184

Query: 490  GSEKKD--FTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQP 663
               + D     KALLRVQ  D   I   +   V   V+ TS  F+ P+T   +  +  Q 
Sbjct: 185  SMMQDDELSVSKALLRVQDTDDQCIHKYEADGVEMRVVLTSAIFIHPETASIYSFEPLQT 244

Query: 664  VILTTASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDV-EVVCQIIFQKGVA 840
            V++        P++  + H    +                  + ++ + + ++IF + VA
Sbjct: 245  VVIIPR---LLPRETKKNHETDSRTGKSSVTSKEGNVGVLPDKHNIHQAMVRLIFSESVA 301

Query: 841  RGHVMLARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSET 1017
            +GH+ML RS+  Y+   LH RV V+   +    E+  + LSP   + FQ   ++  NS  
Sbjct: 302  KGHIMLPRSIRLYLRAELHSRVYVKRFNVKLKKEIPLVSLSPCEFKIFQETGVSEENS-- 359

Query: 1018 IDVKENGLENGDDNRESDAFVTEKEI----VDWNRHQNFLSA-CVSSDKEGCDDSEKATC 1182
                E   +N  +   +  F T  +I     DW+ H+   +A    S KE  + S K+  
Sbjct: 360  ---SEALGKNNYNKTLTTLFRTNSDIEMGTSDWSIHEKIAAAFSCESSKEDKETSIKSDL 416

Query: 1183 KN---IVLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAYSEH-----------GT 1320
            K     +L  W  A    ++ +  G    SL++G+ TLLHF+A               G 
Sbjct: 417  KKDIAAILHRWCLAQLHAVT-IKAGVEVKSLILGNTTLLHFKAKDSRSIKHGGQTMNGGE 475

Query: 1321 VTYDVLFLLS 1350
             + D +++LS
Sbjct: 476  TSLDAMYVLS 485


>dbj|BAB09996.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1125

 Score =  223 bits (567), Expect = 2e-55
 Identities = 158/448 (35%), Positives = 218/448 (48%), Gaps = 20/448 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP  L+ ALQ T    LPP+L +ELRS    W VAW G  SSS +IEI+   A  I
Sbjct: 28   CFVSLPRQLLHALQSTSSSPLPPLLPVELRSGDRRWSVAWSGSSSSSSAIEIARVFAESI 87

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   V VRV  +VPKA  + +EP+TEDDWE+L                   E + FP
Sbjct: 88   SLPDGTVVKVRVLPNVPKATLVTVEPETEDDWEVLELNAELAEAAILSQVRILHETMKFP 147

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI F V ST P+K VVQLVPGTE+AVAPK+R +    N+K +K   EK+   V
Sbjct: 148  LWLHDRTVIRFSVVSTFPSKGVVQLVPGTEVAVAPKRRDR----NLKAKK-SQEKECNNV 202

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            KALLRVQ  DRS             V  TS A++ P+T K   L++ Q + ++       
Sbjct: 203  KALLRVQETDRSAFHEADVKGFELRVALTSIAYIHPETAKKHSLESLQLISVSPRIPLKG 262

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K  E +                   + + +++F    A+GH+M+  SL  
Sbjct: 263  SAKKDEALNMKNSEASKVAENGTSSAKKEPR----QAILRLVFSDLAAKGHLMMVESLRL 318

Query: 877  YIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C ++   E+ AL LSP   +  + E +       +D   + L N +
Sbjct: 319  YLGAGLHSWVYLRGCNVNEDKEIPALSLSPCVFKISENEKV-------LDKGTDRLGNNN 371

Query: 1054 DNRES----DAFVTEKEIVDWNRHQNFLSACVSS--DKEGCDDSEKATCKNIVLEAWLSA 1215
              R+S        T  ++VDW+ H   ++A  S     EG  D  K   + +    W  A
Sbjct: 372  SVRKSSHPPSGLSTYVDVVDWSVHDKVVTALSSEGLHDEGNHDKNKKGLEYLT-RLWSLA 430

Query: 1216 ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                ++  VTG    SL++G  T  HFE
Sbjct: 431  QLDAMAS-VTGVDVSSLIVGRETFFHFE 457


>gb|AAG44817.1| peroxisome biogenesis protein PEX1 [Arabidopsis thaliana]
          Length = 1119

 Score =  223 bits (567), Expect = 2e-55
 Identities = 158/448 (35%), Positives = 218/448 (48%), Gaps = 20/448 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP  L+ ALQ T    LPP+L +ELRS    W VAW G  SSS +IEI+   A  I
Sbjct: 17   CFVSLPRQLLHALQSTSSSPLPPLLPVELRSGDRRWSVAWSGSSSSSSAIEIARVFAESI 76

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   V VRV  +VPKA  + +EP+TEDDWE+L                   E + FP
Sbjct: 77   SLPDGTVVKVRVLPNVPKATLVTVEPETEDDWEVLELNAELAEAAILSQVRILHETMKFP 136

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI F V ST P+K VVQLVPGTE+AVAPK+R +    N+K +K   EK+   V
Sbjct: 137  LWLHDRTVIRFSVVSTFPSKGVVQLVPGTEVAVAPKRRDR----NLKAKK-SQEKECNNV 191

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            KALLRVQ  DRS             V  TS A++ P+T K   L++ Q + ++       
Sbjct: 192  KALLRVQETDRSAFHEADVKGFELRVALTSIAYIHPETAKKHSLESLQLISVSPRIPLKG 251

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K  E +                   + + +++F    A+GH+M+  SL  
Sbjct: 252  SAKKDEALNMKNSEASKVAENGTSSAKKEPR----QAILRLVFSDLAAKGHLMMVESLRL 307

Query: 877  YIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C ++   E+ AL LSP   +  + E +       +D   + L N +
Sbjct: 308  YLGAGLHSWVYLRGCNVNEDKEIPALSLSPCVFKISENEKV-------LDKGTDRLGNNN 360

Query: 1054 DNRES----DAFVTEKEIVDWNRHQNFLSACVSS--DKEGCDDSEKATCKNIVLEAWLSA 1215
              R+S        T  ++VDW+ H   ++A  S     EG  D  K   + +    W  A
Sbjct: 361  SVRKSSHPPSGLSTYVDVVDWSVHDKVVTALSSEGLHDEGNHDKNKKGLEYLT-RLWSLA 419

Query: 1216 ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                ++  VTG    SL++G  T  HFE
Sbjct: 420  QLDAMAS-VTGVDVSSLIVGRETFFHFE 446


>ref|NP_196464.2| peroxisome biogenesis protein 1 [Arabidopsis thaliana]
            gi|322967561|sp|Q9FNP1.2|PEX1_ARATH RecName:
            Full=Peroxisome biogenesis protein 1; AltName:
            Full=Peroxin-1; Short=AtPEX1 gi|332003924|gb|AED91307.1|
            peroxisome biogenesis protein 1 [Arabidopsis thaliana]
          Length = 1130

 Score =  223 bits (567), Expect = 2e-55
 Identities = 158/448 (35%), Positives = 218/448 (48%), Gaps = 20/448 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP  L+ ALQ T    LPP+L +ELRS    W VAW G  SSS +IEI+   A  I
Sbjct: 28   CFVSLPRQLLHALQSTSSSPLPPLLPVELRSGDRRWSVAWSGSSSSSSAIEIARVFAESI 87

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   V VRV  +VPKA  + +EP+TEDDWE+L                   E + FP
Sbjct: 88   SLPDGTVVKVRVLPNVPKATLVTVEPETEDDWEVLELNAELAEAAILSQVRILHETMKFP 147

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI F V ST P+K VVQLVPGTE+AVAPK+R +    N+K +K   EK+   V
Sbjct: 148  LWLHDRTVIRFSVVSTFPSKGVVQLVPGTEVAVAPKRRDR----NLKAKK-SQEKECNNV 202

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            KALLRVQ  DRS             V  TS A++ P+T K   L++ Q + ++       
Sbjct: 203  KALLRVQETDRSAFHEADVKGFELRVALTSIAYIHPETAKKHSLESLQLISVSPRIPLKG 262

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K  E +                   + + +++F    A+GH+M+  SL  
Sbjct: 263  SAKKDEALNMKNSEASKVAENGTSSAKKEPR----QAILRLVFSDLAAKGHLMMVESLRL 318

Query: 877  YIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C ++   E+ AL LSP   +  + E +       +D   + L N +
Sbjct: 319  YLGAGLHSWVYLRGCNVNEDKEIPALSLSPCVFKISENEKV-------LDKGTDRLGNNN 371

Query: 1054 DNRES----DAFVTEKEIVDWNRHQNFLSACVSS--DKEGCDDSEKATCKNIVLEAWLSA 1215
              R+S        T  ++VDW+ H   ++A  S     EG  D  K   + +    W  A
Sbjct: 372  SVRKSSHPPSGLSTYVDVVDWSVHDKVVTALSSEGLHDEGNHDKNKKGLEYLT-RLWSLA 430

Query: 1216 ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                ++  VTG    SL++G  T  HFE
Sbjct: 431  QLDAMAS-VTGVDVSSLIVGRETFFHFE 457


>gb|EOY27465.1| Peroxisome biogenesis protein 1 [Theobroma cacao]
          Length = 1153

 Score =  222 bits (565), Expect = 3e-55
 Identities = 165/491 (33%), Positives = 241/491 (49%), Gaps = 45/491 (9%)
 Frame = +1

Query: 1    ATIRSCFVGLPAYLIEALQETHDGFLPPMLVLELR---SKGSVWHVAWIGMPSSSHSIEI 171
            A I  CFV LP  LI+ LQ T    LPP+L LELR   S    W VAW G  SSS +IE+
Sbjct: 9    AGIEDCFVSLPLLLIQTLQSTRSSLLPPLLALELRLPRSSDHPWIVAWSGAASSSTAIEV 68

Query: 172  SAKLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ----------- 318
            S + A CI LP+   V VR  +++ KA  + +EP TEDDWE+L                 
Sbjct: 69   SQQFAECISLPNHTTVQVRAASNMAKATLVTIEPHTEDDWEVLELNSEHAEAAILKQVRI 128

Query: 319  --EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDG 492
              EG+ FPLWL G T++ F V ST P K+VVQLVPGTE+AVAPK+R+K    N+K+  + 
Sbjct: 129  VHEGMRFPLWLHGRTIVTFLVVSTFPKKAVVQLVPGTEVAVAPKRREK----NLKN-MES 183

Query: 493  SEKKDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVIL 672
            S ++    KALLR+Q  DR L        V   V  TS AF+   T K F L++ Q V++
Sbjct: 184  STRESHGAKALLRLQDSDRRLFHKSNVKGVELGVALTSVAFIHQVTAKRFSLESLQLVVI 243

Query: 673  T---TASGGSKPKQDDEMHHG---KKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKG 834
                ++ G  K  ++D +        KE N                   +V+  ++    
Sbjct: 244  VPRLSSKGSVKNLENDALRMKGSLTSKEANSGISTDNKEFR--------QVIVHLLISDS 295

Query: 835  VARGHVMLARSLCRYIHTNLHERVIVRECLSRANEVSALLLSPISLQSFQRESINRVNSE 1014
            VA GHVM+ RSL  Y+   LH       C+   ++   L+L  +  +    +  N    +
Sbjct: 296  VAEGHVMITRSLRLYLRAGLH------SCMLNLSKNQLLILLYLPRKGVYLKGYNVALKK 349

Query: 1015 TIDV--------------KENGLENGDDNR----ESDAFVTEKEIVDWNRHQNFLSACVS 1140
             I V              KENGLE  D ++    ++    T  E+V+W+ H + + A +S
Sbjct: 350  EISVLSLSPCHFKVVANDKENGLEVLDGHKTRRMKNSGSGTSLEVVNWSTHDDVV-AVLS 408

Query: 1141 SD---KEGCDDSEKATCKNI--VLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAY 1305
            S+   +E  D S++ T K +  +L AW  A    ++    G    +L++G+  LLHFE  
Sbjct: 409  SEFPFQEAEDSSQEDTKKGLECLLRAWFLAQLDAIAS-NAGTEVKTLVLGNENLLHFEV- 466

Query: 1306 SEHGTVTYDVL 1338
            + + + TY ++
Sbjct: 467  NRYDSGTYGLV 477


>ref|XP_002517570.1| peroxisome biogenesis factor, putative [Ricinus communis]
            gi|223543202|gb|EEF44734.1| peroxisome biogenesis factor,
            putative [Ricinus communis]
          Length = 1137

 Score =  221 bits (562), Expect = 7e-55
 Identities = 152/455 (33%), Positives = 230/455 (50%), Gaps = 24/455 (5%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDG-FLPPMLVLELRSKGS--VWHVAWIGMPSSSHSIEISA 177
            I +CF+ LP  LI+ L+ T  G F   +L LELRS  +   W VAW G  SSS +IE++ 
Sbjct: 11   IENCFISLPIQLIQTLESTRPGDFHSQILTLELRSSTTDHQWVVAWSGATSSSSAIEVAR 70

Query: 178  KLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXX-------------Q 318
            + A CI LPD I V VR  ++V  A  + +EP +EDDWE+L                   
Sbjct: 71   QFADCISLPDRISVKVRAVSNVASATLVTIEPSSEDDWEVLELNADLAEAAILNQVRIVH 130

Query: 319  EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSE 498
            E + FPLWL G T+I FHV ST P K+VVQLVPGTE+AVAPK+RK     ++  +   S 
Sbjct: 131  ETMKFPLWLHGRTIITFHVVSTLPKKAVVQLVPGTEVAVAPKRRK----TDLNKQDLQSS 186

Query: 499  KKDFTV-KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILT 675
             K+F + KALLR+Q  DR L+   +   V   V+ TS A++ P+T   F L + Q V + 
Sbjct: 187  SKEFKITKALLRLQDSDRRLLHRREVEGVELGVVLTSVAYIHPETATRFSLDSLQLVTIV 246

Query: 676  TASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVM 855
                  +  +  E    + K  N               +   + + +I+F   VA+GH+M
Sbjct: 247  PRLSSKETIRTPESDVSRTK--NSSALKEIKNDILTDKKEYRQAIVRIVFSDSVAKGHLM 304

Query: 856  LARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKE 1032
            +ARSL  Y+  +LH  V ++ C +    ++++L LSP   +   ++  N +   +++V +
Sbjct: 305  IARSLRLYLMASLHSWVYLKICTMDLKEDITSLSLSPCHFKMPGQD--NAIEKNSLEVLD 362

Query: 1033 NGLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSSD------KEGCDDSEKATCKNIV 1194
              +     N  S    +    VDW+ H   L+A +S+D      +E    S        +
Sbjct: 363  QRIIQKPRNLVSGGSGSYMGTVDWSVHDRILAA-LSNDFPCEGGQETIYQSNNRKGLRRL 421

Query: 1195 LEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
            L+AW  A    ++    G  A S+++G  T+LHFE
Sbjct: 422  LQAWFLAQLDAIASF-AGSEANSVILGKETILHFE 455


>ref|XP_002871329.1| peroxisome biogenesis protein PEX1 [Arabidopsis lyrata subsp. lyrata]
            gi|297317166|gb|EFH47588.1| peroxisome biogenesis protein
            PEX1 [Arabidopsis lyrata subsp. lyrata]
          Length = 1122

 Score =  218 bits (555), Expect = 5e-54
 Identities = 152/449 (33%), Positives = 218/449 (48%), Gaps = 21/449 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP  L+ ALQ T    LPP+L +ELRS    W VAW G  SSS +IE++   A  I
Sbjct: 15   CFVSLPRQLLHALQSTSSSPLPPLLPVELRSGDRRWSVAWSGSSSSSSAIEVARVFAETI 74

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   V VRV  +VPKA  + +EP+TEDDWE+L                   E + FP
Sbjct: 75   SLPDATVVQVRVLPNVPKATLVTVEPETEDDWEVLELNAELAEAAILSQVRILHETMKFP 134

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI+F V ST P+K VVQLVPGTE+AVAPK+R +    N+K +K   EK+   V
Sbjct: 135  LWLHDRTVISFAVVSTFPSKGVVQLVPGTEVAVAPKRRDR----NLKAKK-SQEKECTNV 189

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            KALLRVQ   RS             V  TS A++ P+T K + +++ Q + ++       
Sbjct: 190  KALLRVQDTGRSAFREADVKGFELRVALTSVAYIHPETAKKYSIESLQLISVSPRIPLKG 249

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K    N               +   + + +++F   VA+GH+M+  SL  
Sbjct: 250  TAKKDEALNIK----NSGASKVAENGTSSAKKEPRQTILRLVFSDLVAKGHLMMVESLRL 305

Query: 877  YIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C ++   E+ AL LSP   +  + E +    ++T+    N + N  
Sbjct: 306  YLGAGLHSWVYLRGCNVNEDKEIPALSLSPCVFKISENEKVLDRGTDTLG-NHNSIRN-- 362

Query: 1054 DNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSEKATCKNIVLEAWLSAATSILS 1233
             +       T  ++VDW+ H   ++A  S      D+  +     +  +  L   T + S
Sbjct: 363  CSHPPSGLSTYMDVVDWSVHDKVVTALSSEGLH--DEGNQVNAYQVKNKKKLECLTRLWS 420

Query: 1234 -------KLVTGEGAGSLLIGSNTLLHFE 1299
                     VTG    SL++G  T  HFE
Sbjct: 421  LAQLDAIASVTGVDVSSLIVGRETFFHFE 449


>ref|XP_006399345.1| hypothetical protein EUTSA_v10012497mg [Eutrema salsugineum]
            gi|557100435|gb|ESQ40798.1| hypothetical protein
            EUTSA_v10012497mg [Eutrema salsugineum]
          Length = 1127

 Score =  214 bits (545), Expect = 7e-53
 Identities = 150/448 (33%), Positives = 213/448 (47%), Gaps = 20/448 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP ++++ LQ T    LPP+L  ELRS    W VAW G  SSS +IE++   A  I
Sbjct: 15   CFVSLPHHILQTLQSTSSAPLPPLLPFELRSGDRRWPVAWSGSSSSSSAIEVARVFAESI 74

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   VHVRV ++VPKA  + +EP+TEDDWEIL                   E + FP
Sbjct: 75   SLPDGTVVHVRVLSNVPKATLVTVEPETEDDWEILELNAELAESAILSQVRILHETMKFP 134

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI F V ST P K VVQLV GTE+AVAPK+R++   A    +   S+K+    
Sbjct: 135  LWLHDRTVIRFAVVSTFPPKGVVQLVTGTEVAVAPKRRERNLNAKNGSDAFASDKECNNE 194

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            K LLRVQ   RS             V  TS A++ P+T K + L++ Q + ++       
Sbjct: 195  KILLRVQNTTRSAFHEADVKGFDVRVALTSIAYIHPETAKKYSLESLQMISVSPRIPLKG 254

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K  E +                     + +++F    A+GH+M+  SL  
Sbjct: 255  SAKKDEALNMKSSEASKVVENGTPSAKKEPR----RAILRLVFSDLAAKGHLMMVESLRL 310

Query: 877  YIHTNLHERVIVRECLSRAN-EVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C    N E+ AL LS    +  ++E   +V     D+  N   N  
Sbjct: 311  YLGAGLHSWVYLRGCNVNVNKEIPALSLSSCVFKISEKE---KVLDRGTDMLGNHSFNRK 367

Query: 1054 DNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSEKA-TCKN-----IVLEAWLSA 1215
             +       T  +++DW+ H   L+A  S +    ++ + A   KN      +   W  A
Sbjct: 368  SSHPRSGLTTNVDVLDWSVHDKVLTALSSEELHIKEEQDNAYQLKNRKGLERLTRLWSLA 427

Query: 1216 ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                ++ L TG    SL++G  TL HFE
Sbjct: 428  QLDAIASL-TGVDVSSLIVGRETLFHFE 454


>ref|XP_004293758.1| PREDICTED: peroxisome biogenesis protein 1-like [Fragaria vesca
            subsp. vesca]
          Length = 1129

 Score =  211 bits (537), Expect = 6e-52
 Identities = 151/456 (33%), Positives = 219/456 (48%), Gaps = 24/456 (5%)
 Frame = +1

Query: 4    TIRSCFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSV--WHVAWIGMPSSSHSIEISA 177
            TI  C+V LP  LI+ L  +    LPP+L L+LRS  +   W VAW G  SSS +IE++ 
Sbjct: 10   TIEDCYVSLPLALIQTLHSSSPS-LPPVLALDLRSSSTDHHWTVAWSGATSSSPAIEVAQ 68

Query: 178  KLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXX-------------Q 318
            +   CI LPD   V VR  + V +A  + +EP TEDDWE++                   
Sbjct: 69   QFGECISLPDRSRVQVRALSSVDRATLVTIEPSTEDDWEVMELNSELAEAAILNQVRIVH 128

Query: 319  EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSE 498
            EG+ FPLWL G T + F V ST P KSVVQLVPGTE+AVAPK+RK V   +  DE   S 
Sbjct: 129  EGMKFPLWLHGRTTVTFLVVSTFPKKSVVQLVPGTEVAVAPKRRKNV--NSNGDEMLASG 186

Query: 499  KKDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTT 678
                  KALLRVQ  D+ L+       V   V+ TS   V P+T + F LK   P+ L  
Sbjct: 187  GGHHFSKALLRVQDADKRLVHQSNVKGVELGVVLTSVGIVHPETAERFSLK---PLELVA 243

Query: 679  ASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVML 858
                  PK+  +                         + + + V +++    VA+GH+M+
Sbjct: 244  VVPRLIPKESMKNSESDGLRIGSSTPKESSVRVPNDKKDNHQAVVRLLISDSVAKGHLMI 303

Query: 859  ARSLCRYIHTNLHERVIVRECLS-RANEVSALLLSPISLQ-SFQRESINRVNSETIDVKE 1032
            A+SL  Y+   LH  V ++ C     N +    LSP   + S + +++ R   + +D  +
Sbjct: 304  AQSLRLYLRAGLHSWVYLKGCGGILKNNMPMCSLSPCHFKISPKEKAVERNGLQVLDRHK 363

Query: 1033 NGLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSE-------KATCKNI 1191
               +N  D   +    T  ++VDW+ H   ++    S K  C++ E       K      
Sbjct: 364  TRKKN--DMLLTPGSSTYIDVVDWSTHDKVVAE--FSSKSSCEEDEEPAHHYDKGNGVES 419

Query: 1192 VLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
            +L+AW+ A    ++    G    SL++G+ TLLHFE
Sbjct: 420  LLKAWILAQLDAITS-KAGVEVNSLILGNETLLHFE 454


>ref|XP_006286937.1| hypothetical protein CARUB_v10000082mg [Capsella rubella]
            gi|482555643|gb|EOA19835.1| hypothetical protein
            CARUB_v10000082mg [Capsella rubella]
          Length = 1128

 Score =  210 bits (535), Expect = 1e-51
 Identities = 148/448 (33%), Positives = 213/448 (47%), Gaps = 20/448 (4%)
 Frame = +1

Query: 16   CFVGLPAYLIEALQETHDGFLPPMLVLELRSKGSVWHVAWIGMPSSSHSIEISAKLASCI 195
            CFV LP  L+ ALQ T    LPP+L +ELRS    W VAW G  SSS +IE++   A  I
Sbjct: 15   CFVSLPRQLLHALQSTSSSPLPPLLPVELRSGDRRWSVAWSGSTSSSTAIEVARVFAESI 74

Query: 196  GLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQEGLTFP 336
             LPD   V VRV  +VPKA  + +EPDTEDDWE+L                   E + FP
Sbjct: 75   SLPDGTVVQVRVLPNVPKATLVTVEPDTEDDWEVLELNAELAEAAILSQVRLLHETMKFP 134

Query: 337  LWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEKKDFTV 516
            LWL   TVI F V ST P+K VVQLVPGTE+AVAPK+R +   A    +     K+   +
Sbjct: 135  LWLHDRTVIRFSVVSTFPSKGVVQLVPGTEVAVAPKRRDRNLNAKKSPDAFSPGKECSNL 194

Query: 517  KALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTASGGSK 696
            K LLRVQ  D S+            V  TS A++ P+T K + L++ Q + ++       
Sbjct: 195  KVLLRVQDTDESVFHQADVKGFELRVALTSIAYIHPETAKKYFLESLQLISVSPRIPLQG 254

Query: 697  PKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLARSLCR 876
              + DE  + K  E +                     + +++F    A+GH+M++ SL  
Sbjct: 255  SAKKDEALNMKNSEASKVAENGTPSEKKEPR----RAILRLVFSDLAAKGHLMMSESLRL 310

Query: 877  YIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKENGLENGD 1053
            Y+   LH  V +R C ++   E+ AL LSP   +  ++E +   +++ +    N +  G 
Sbjct: 311  YLGAGLHSWVYLRGCNVNVDKEIPALALSPCVFKIPEKEKVLNRSADMLG-NHNSVRKG- 368

Query: 1054 DNRESDAFVTEKEIVDWNRHQNFLSACVSS--DKEGCDDSEKATCKNIVLEA----WLSA 1215
             +       T  ++ DW+ H   ++A  S    ++G  D+         LE     W  A
Sbjct: 369  -SHPPSGLSTSMDVFDWSVHDKVVTALSSEGVHEKGNQDNVYQVKNKKGLECLTRLWSLA 427

Query: 1216 ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                +S  V G    SL++G  T  HFE
Sbjct: 428  QLDAISS-VAGVDVSSLVVGRETFFHFE 454


>ref|XP_006448771.1| hypothetical protein CICLE_v10014090mg [Citrus clementina]
            gi|557551382|gb|ESR62011.1| hypothetical protein
            CICLE_v10014090mg [Citrus clementina]
          Length = 1134

 Score =  208 bits (529), Expect = 5e-51
 Identities = 148/450 (32%), Positives = 225/450 (50%), Gaps = 19/450 (4%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDG-FLPPMLVLELRSKGSV-WHVAWIGMPSSSHSIEISAK 180
            + +CFV LP  LIE L+ T     LP +L LELRS+ +  W VAW G  SSS  IE++ +
Sbjct: 11   VENCFVSLPLKLIETLESTRSAHLLPQVLSLELRSRSNQRWVVAWSGATSSSSFIEVARQ 70

Query: 181  LASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ-------------E 321
             A CI L D   V VRV ++VPKA  + +EP TEDDWE+L                   E
Sbjct: 71   FAECISLADHTIVQVRVVSNVPKATLVTIEPLTEDDWEVLELNSEHAEAAILNQVRIVHE 130

Query: 322  GLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEK 501
             + FPLWL G T+I FHV ST P K VVQLVPGTE+AVAPK+RK     +         +
Sbjct: 131  AMIFPLWLHGRTIITFHVVSTFPKKPVVQLVPGTEVAVAPKRRKNDGKKHEDSYMQAFNE 190

Query: 502  KDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPV-ILTT 678
                 KALLRVQ  D  L   C    V   V  +S AF++P+T +   L + + V IL  
Sbjct: 191  STSIAKALLRVQDSDEGLSHKCNVKGVELGVALSSVAFINPETAENVSLCSLELVAILPR 250

Query: 679  ASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVML 858
             S     K+++  ++  + + N               E   + V +++F   VA+GHV +
Sbjct: 251  LSS----KENNPENNAPRIKSNLTSKEISGGASTDKKECR-QAVVRLLFSNSVAKGHVKI 305

Query: 859  ARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVKEN 1035
            AR+L  Y++  LH  V +++C ++   E+  + LSP   +  +++    +  E +D K +
Sbjct: 306  ARALRLYLNAGLHSWVYLKKCTVNLKKEIPMVSLSPCHFKMLEKDKAFGIGLE-LDNKNH 364

Query: 1036 GLENGDDNRESDAFVTEKEIVDWNRHQNFLSA--CVSSDKEGCDDSEKATCKNIVLEAWL 1209
              +   +N  S  ++ + ++   +     LS+   +  D+E     E       +L  WL
Sbjct: 365  KTKKMLENTSSGIYMDDGDLSAEDEVIAALSSEPSLKEDEEAVYQFENKKGLECLLHTWL 424

Query: 1210 SAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
             A  + ++  + G    +L++ + TLLHFE
Sbjct: 425  LAQLNAVASNI-GSEFNTLVLSNETLLHFE 453


>ref|XP_003529444.1| PREDICTED: peroxisome biogenesis protein 1-like isoform X1 [Glycine
            max]
          Length = 1130

 Score =  207 bits (528), Expect = 6e-51
 Identities = 162/464 (34%), Positives = 224/464 (48%), Gaps = 33/464 (7%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDGFLPPMLVLELRSKGS---VWHVAWIGMPSSSHS-IEIS 174
            I SCFV LP  LI+ LQ T    +P +L LELRS       W VAW G  SSS S IE+S
Sbjct: 11   IDSCFVSLPLSLIQTLQSTRSSPIPQILALELRSPTHPPHTWFVAWSGATSSSSSAIEVS 70

Query: 175  AKLASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ------------ 318
             + A C+ LP+   V VR   +VP A  + +EP TEDDWEIL     Q            
Sbjct: 71   PQFAECVSLPNHATVQVRAAPNVPHASLVTIEPHTEDDWEILELNADQAEAQILSQVRIV 130

Query: 319  -EGLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGS 495
             EG+ FPLWL GHTVI F VAS  P   VVQL+PGTE+AVAPK+RKK    +  D    S
Sbjct: 131  HEGMRFPLWLHGHTVITFQVASVFPKNVVVQLMPGTEVAVAPKRRKK-SSDSAGDSHLDS 189

Query: 496  EKKDFTVKALLRVQWLDRSLIETCKH-GNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVIL 672
              K+ T K LLR+Q  D  L  T  H   V   V  TS AFV P+T K +     Q V +
Sbjct: 190  SNKEHTAKMLLRLQDPD-GLCSTSTHVKGVELHVGLTSVAFVHPETAKKYSFNMLQLVSI 248

Query: 673  TTASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHV 852
                     K++  +      +                     + + Q++  + VA GHV
Sbjct: 249  VP----RVTKENVNISRSNIMKAKSGPATNEVENGYTDKTEYRQTIVQLLISESVAEGHV 304

Query: 853  MLARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETIDVK 1029
            M+A+SL  Y+  +LH  V ++ C +     + +  L P   +  ++E  N V  + ++V 
Sbjct: 305  MVAKSLRLYLRASLHSWVYLKACDIILEKSIPSTSLFPCQFKLLKQE--NAVEKDGLEVF 362

Query: 1030 ENGLENGDDNRE----SDAFVTEKEIVDWNRHQNFLSACVS------SDKEGCDDSEKAT 1179
                 + D+N      S  FV   + +DW+  QN ++A +S      +++E  + S+   
Sbjct: 363  HGHKNHIDENLHAKPTSGVFV---DTIDWS-IQNEVAAALSDESSYKAEEEATNQSQNQR 418

Query: 1180 CKNIVLEAW----LSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
                ++  W    L A TSI     +G    SL+IG+ TLLHFE
Sbjct: 419  GLQSLVRLWYIMQLKAITSI-----SGMEVSSLIIGNKTLLHFE 457


>gb|EMJ14918.1| hypothetical protein PRUPE_ppa000485mg [Prunus persica]
          Length = 1135

 Score =  206 bits (523), Expect = 2e-50
 Identities = 146/456 (32%), Positives = 226/456 (49%), Gaps = 25/456 (5%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDGFLPPMLVLELRSKG--SVWHVAWIGMPSSSHSIEISAK 180
            I +C+V LP  LI+ LQ +    LP +L LEL S    S W+VAW G  S+S +IE++ +
Sbjct: 11   IENCYVSLPLALIQTLQSSSSS-LPHVLALELLSSSNDSRWNVAWSGATSTSQAIEVAQQ 69

Query: 181  LASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEIL-------------XXXXXQE 321
               CI LPD   V VR  ++V KA  + +EP TEDDWE+L                   E
Sbjct: 70   FGDCISLPDHARVQVRALSNVTKATLVTIEPSTEDDWEVLELNSELAEAAILNQVRIVHE 129

Query: 322  GLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKDGSEK 501
             + FPLWL G T I F V ST P K VVQLVPGTE+AVAPK+RK V           + +
Sbjct: 130  AMRFPLWLHGRTTITFLVVSTFPRKLVVQLVPGTEVAVAPKRRKTVNSHGDSSTLASNGE 189

Query: 502  KDFTVKALLRVQWLDRSLIETCKH-GNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTT 678
            +  + KALLR+Q  DR L+    +   V   V+ TS A + P+T K F L + Q V +  
Sbjct: 190  RHIS-KALLRIQDPDRRLVHKSGYVKGVELGVVLTSVAMIHPETAKMFSLNSLQLVAVVP 248

Query: 679  ASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVML 858
                 +  ++ E  +   + ++               + + E + +++    VA+GHVM+
Sbjct: 249  RLSPKESMKNSE--NDGLRTRSSSTPKESNNGISNDKKDNRETIVRLLISDSVAKGHVMV 306

Query: 859  ARSLCRYIHTNLHERVIVRECLS-RANEVSALLLSPISLQSFQRE-SINRVNSETIDVKE 1032
            A+SL  Y+   LH  V ++ C      ++  L LSP   + F ++ ++ R   E +D   
Sbjct: 307  AQSLRLYLRARLHSWVYLKGCNGILKTDIPLLSLSPCHFKIFGKDKAVERNGIEVLD--R 364

Query: 1033 NGLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSEKATCKN-------I 1191
            + +    +   +    T  ++ DW+ H   + A   S +  C + E A+ K+        
Sbjct: 365  HKIRKKKNMLLTTGSSTYIDVTDWSTHDKVVDA--FSYESSCKEDEGASQKSEEGKGVES 422

Query: 1192 VLEAWLSAATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
            +++AW+ A    ++    GE   SL++G+ T+LHFE
Sbjct: 423  LVKAWILAQLDAIAS-NAGEEINSLVLGNETILHFE 457


>ref|XP_006468418.1| PREDICTED: peroxisome biogenesis protein 1-like [Citrus sinensis]
          Length = 1134

 Score =  204 bits (520), Expect = 5e-50
 Identities = 154/455 (33%), Positives = 228/455 (50%), Gaps = 24/455 (5%)
 Frame = +1

Query: 7    IRSCFVGLPAYLIEALQETHDG-FLPPMLVLELRSKGSV-WHVAWIGMPSSSHSIEISAK 180
            + +CFV LP  LIE L+ T     LP +L LELRS+ +  W VAW G  SSS  IE++ +
Sbjct: 11   VENCFVSLPLKLIETLESTRSAHLLPQVLSLELRSRSNQRWVVAWSGATSSSSFIEVARQ 70

Query: 181  LASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXXQ-------------E 321
             A CI L D   V VRV ++V KA  + +EP TEDDWE+L                   E
Sbjct: 71   FAECISLADHTIVQVRVVSNVLKATLVTIEPLTEDDWEVLELNSEHAEAAILNQVRIVHE 130

Query: 322  GLTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKDEKD---- 489
             + FPLWL G T+I FHV ST P K VVQLVPGTE+AVAPK+RK     N+K  +D    
Sbjct: 131  AMRFPLWLHGRTIITFHVVSTFPKKPVVQLVPGTEVAVAPKRRKN----NVKKHEDSYMQ 186

Query: 490  GSEKKDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPV- 666
               +     KALLRVQ  D  L   C    V   V  TS AF++P+T +   L + + V 
Sbjct: 187  AFNESTSIAKALLRVQDSDEGLSHKCNVKGVELGVALTSVAFINPETAENVSLCSLELVA 246

Query: 667  ILTTASGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARG 846
            IL   S     K+++  ++  + + N               E   + V  ++F   VA+G
Sbjct: 247  ILPRLSS----KENNPENNAPRIKSNLTSKEISGGASTDKKECR-QAVVHLLFSDSVAKG 301

Query: 847  HVMLARSLCRYIHTNLHERVIVREC-LSRANEVSALLLSPISLQSFQRESINRVNSETID 1023
            HV +AR+L  Y++  LH  V +++C ++   E+  + LSP   +  +++    +  E +D
Sbjct: 302  HVKIARALRLYLNAGLHSWVYLKKCTVNLKKEIPMVSLSPCHFKMLEKDKAFGIGLE-LD 360

Query: 1024 VKENGLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSS--DKEGCDDSEKATCKNIVL 1197
             K +  +   +   S  ++ + ++   +     LS+  SS  D+E     E       +L
Sbjct: 361  NKNHKTKKMLEKTSSGIYMDDGDLSAEDDIIAALSSEPSSKEDEEAVYQFENKKGLECLL 420

Query: 1198 EAWLSA-ATSILSKLVTGEGAGSLLIGSNTLLHFE 1299
              WL A  T++ S +  G    +L++ + TLLHFE
Sbjct: 421  HTWLLAQLTAVASNI--GSEFNTLVLSNETLLHFE 453


>ref|XP_003574917.1| PREDICTED: peroxisome biogenesis protein 1-like [Brachypodium
            distachyon]
          Length = 1091

 Score =  203 bits (516), Expect = 2e-49
 Identities = 148/470 (31%), Positives = 219/470 (46%), Gaps = 22/470 (4%)
 Frame = +1

Query: 10   RSCFVGLPAYLIEALQETH-DGFLPPMLVLELRSKGSV-WHVAWIGMPSSSHSIEISAKL 183
            RSCFV LP +LI+AL  T   G LPP+L L+LRS     W +AW G  S S +IE++ +L
Sbjct: 18   RSCFVALPLHLIQALSRTSATGDLPPVLALDLRSPARARWSLAWSGAASRSRAIEVAQEL 77

Query: 184  ASCIGLPDLIEVHVRVRTDVPKALTIELEPDTEDDWEILXXXXX-------------QEG 324
            A CI LPD     + V   + +A ++ +EP +EDDWEIL                   EG
Sbjct: 78   AECISLPDGTIAQLSVARSLTRADSVSIEPFSEDDWEILESRADLAEETILQQVGIVYEG 137

Query: 325  LTFPLWLRGHTVINFHVASTKPAKSVVQLVPGTELAVAPKQRKKVPIANIKD-EKDGSEK 501
            + FPLWL GH ++ F V S+ P KSVVQLVPGTE+AVAPK+RK+      KD +K  S  
Sbjct: 138  MKFPLWLDGHNIVKFVVVSSTPKKSVVQLVPGTEVAVAPKKRKE----KYKDVQKQSSLN 193

Query: 502  KDFTVKALLRVQWLDRSLIETCKHGNVSFDVLPTSFAFVSPQTGKYFKLKNGQPVILTTA 681
            +    KALLRVQ  D       K+  +   V+ +    + P T     L N Q   L T 
Sbjct: 194  EQVQTKALLRVQAADNKYAHKFKYKGIELGVVLSCAVLIHPDTAARTSLGNLQ---LVTI 250

Query: 682  SGGSKPKQDDEMHHGKKKEQNXXXXXXXXXXXXXXXEGDVEVVCQIIFQKGVARGHVMLA 861
            S  S PK   +   G +K+                 E D E+   ++F   VA+GHVML 
Sbjct: 251  SSKSSPKGIQKGKEGAQKK-----------GVLAPKERDQEMAVYVLFSDTVAKGHVMLP 299

Query: 862  RSLCRYIHTNLHERVIVRECLSRA-NEVSALLLSPISLQSFQRESINR--VNSETIDVKE 1032
             SL  +I  + H  V V+ C +    +   + +SP+      ++  +   + S+ +D   
Sbjct: 300  PSLRHFISADTHSWVYVKTCSANVKKDEPVITISPLRFNKHGKDEHDNSDLGSQEMDTWR 359

Query: 1033 N---GLENGDDNRESDAFVTEKEIVDWNRHQNFLSACVSSDKEGCDDSEKATCKNIVLEA 1203
                  ENGD               D    ++ LSA V+S  E   +      + ++++ 
Sbjct: 360  KTRIHSENGD------------SFQDARNSEDILSAAVNSTSESMSE------QKVLIKH 401

Query: 1204 WLSAATSILSKLVTGEGAGSLLIGSNTLLHFEAYSEHGTVTYDVLFLLSV 1353
            WL      +          S+++ +  L+HFE   +      + L+LL++
Sbjct: 402  WLIGQLKEMGLHAETSEMSSVVLPAKVLIHFEVVDQKQNRGVEFLYLLTI 451


Top