BLASTX nr result

ID: Achyranthes22_contig00015638 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00015638
         (1359 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi...   372   e-100
gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus pe...   360   8e-97
gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis]    356   1e-95
ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Caps...   345   3e-92
ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, part...   344   6e-92
ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-l...   343   1e-91
ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutr...   341   5e-91
ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutr...   335   2e-89
ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyr...   332   2e-88
ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|33265...   325   2e-86
emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448...   324   5e-86
dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana]        323   1e-85
ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arab...   323   1e-85
dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana]        323   1e-85
dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana]        322   3e-85
dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana] g...   321   4e-85
gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao]    318   3e-84
ref|NP_001190738.1| monooxygenase 1 [Arabidopsis thaliana] gi|33...   318   4e-84
gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao]              317   7e-84
ref|XP_003533524.1| PREDICTED: zeaxanthin epoxidase, chloroplast...   310   7e-82

>ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi|223545636|gb|EEF47140.1|
            monoxygenase, putative [Ricinus communis]
          Length = 397

 Score =  372 bits (954), Expect = e-100
 Identities = 193/389 (49%), Positives = 266/389 (68%), Gaps = 1/389 (0%)
 Frame = +2

Query: 125  LATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAI 304
            LATALALHRKGI SVVLERS+ LRA+GA I +  NGWR L +LG+ S ++ TA+P+ R  
Sbjct: 19   LATALALHRKGIRSVVLERSETLRAAGAGIAVLTNGWRALDELGVGSKIRPTALPLQRYH 78

Query: 305  DVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSY 484
             ++    V       +I  GEARC++RSDL+E LA+ LPL TIRFG  ++SV+++   S+
Sbjct: 79   PILIAPIV-------MIEIGEARCVKRSDLIEALADDLPLGTIRFGCDILSVNLDPEISF 131

Query: 485  ALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRGMTTYPNGHGYAPEFL 664
             +LQL  GSSIKAK LIGCDGANSV+++++ LKP  LFS  +VRG T YPNGHG APE +
Sbjct: 132  PILQLSNGSSIKAKALIGCDGANSVVSDFLELKPKKLFSLCAVRGFTHYPNGHGLAPELI 191

Query: 665  RVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMILEKSGGLSKDSVEML 844
            R+ K     GR+P+D+  V+WF    +  K++  P+DP  ++Q  LE       + +EM+
Sbjct: 192  RMVKGNVLCGRVPVDDNLVFWFIIQNFFPKDTNIPKDPELMRQFSLESIKDFPTERLEMV 251

Query: 845  EKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIVLA 1024
            +  +++SLS+T LR R PW +  G FR+GT  VAGDA H+MGPFIGQGGS A+EDA+VLA
Sbjct: 252  KNCEVTSLSLTHLRYRTPWEIYLGKFRRGTATVAGDAMHIMGPFIGQGGSAAIEDAVVLA 311

Query: 1025 RCLSNKIGN-GDPNSERRMNLSAHKTMEALDEFFKERRRRVVLISTLSYLTGILIVEDPS 1201
            RCLS K+   G   S    ++ + K  EA D++ KERR R+V +ST +YL G L+ ++ S
Sbjct: 312  RCLSAKMQEVGQLKSS--SHIMSQKIGEAFDDYVKERRMRLVWLSTQTYLYGSLL-QNSS 368

Query: 1202 FLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
             L++    V ++V FGNP+ +T+YDCG L
Sbjct: 369  RLVKVSIAVAMIVLFGNPIYHTRYDCGPL 397


>gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus persica]
          Length = 387

 Score =  360 bits (924), Expect = 8e-97
 Identities = 187/390 (47%), Positives = 256/390 (65%), Gaps = 2/390 (0%)
 Frame = +2

Query: 125  LATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAI 304
            LATALALHRKG+ SVVLERS+ LRA+GA ITI  NGWR L +LG+ S L+ TA+P     
Sbjct: 19   LATALALHRKGLRSVVLERSESLRATGAGITIRTNGWRALDELGVASKLRQTAMP----- 73

Query: 305  DVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSY 484
                           L   GE RC++R DL+  LA SLP  TIR G Q +SV ++ S+S 
Sbjct: 74   ---------------LQGGGETRCLKRMDLITALAESLPRGTIRLGCQALSVRLDSSTSS 118

Query: 485  ALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRGMTTYPNGHGYAPEFL 664
              L L  GSSIKAKVLIGCDG NSV+A+++ LKP+ LFS   VRG T YP+GH +  +F+
Sbjct: 119  PSLHLQNGSSIKAKVLIGCDGTNSVVADFLDLKPSKLFSLSEVRGFTMYPSGHNFGNQFV 178

Query: 665  RVRKDENFLGRIPIDEKTVYWFFS--IRYTQKESKKPEDPAFIKQMILEKSGGLSKDSVE 838
            +V+ D+  +GRIPI  K VYWF +  + Y +   + P+DP  I+Q+ LE       + ++
Sbjct: 179  QVKGDKCTVGRIPIHNKLVYWFVTQKVMYGRGGLEVPKDPELIRQLTLEAIKDFPSEMID 238

Query: 839  MLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIV 1018
            M+ KSD  SLS T+LR R+PW+++  NFRKG++ VAGDA H MGPF+GQGGS  +ED+IV
Sbjct: 239  MISKSDTKSLSNTRLRYRSPWDILVRNFRKGSVTVAGDAMHTMGPFLGQGGSAGIEDSIV 298

Query: 1019 LARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLISTLSYLTGILIVEDP 1198
            +ARCL+ ++        R  N+   K  EALD++ KERR R+VL+ST +YL G L+ +D 
Sbjct: 299  IARCLAQELAENYDKKSRARNIMMMKVEEALDKYVKERRMRLVLLSTQTYLAG-LLQQDS 357

Query: 1199 SFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
              +++F+CI ++   F +   +T+YDCG L
Sbjct: 358  GLIVKFVCIFLMTALFSDMTRHTRYDCGCL 387


>gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis]
          Length = 404

 Score =  356 bits (913), Expect = 1e-95
 Identities = 188/406 (46%), Positives = 265/406 (65%), Gaps = 2/406 (0%)
 Frame = +2

Query: 77   AMEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLG 256
            A EE            LATALALHRKGI+SVVLERS+ LRA G+AI I  NGWR L QLG
Sbjct: 3    AAEEIDIVIVGAGICGLATALALHRKGIKSVVLERSETLRAFGSAIAILTNGWRALDQLG 62

Query: 257  LDSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIR 436
            +   L+ TA+P+    D+  D    R      +S+GEARC++RSDL+  LA  LP  TIR
Sbjct: 63   IGPKLRQTALPLQGVRDIWLDGNKQR---RGPLSKGEARCVKRSDLINMLAQDLPHGTIR 119

Query: 437  FGTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVR 616
            FG  ++ V+++  +++ +LQL  G +IKAK+LIGCDGA+SV+A Y+ +KP   F +F +R
Sbjct: 120  FGCHILFVELDPLTNFPILQLRDGRAIKAKILIGCDGASSVVAEYLKVKPKKSFPAFGIR 179

Query: 617  GMTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQM 796
            G+T YP+ HG+ PEF+R   +    GR  I++  V+WF  +    K+S+  +DP  IKQM
Sbjct: 180  GLTYYPSPHGFDPEFVRTHGNNVVCGRSTINQNLVFWFLLLPGYLKDSEIFKDPELIKQM 239

Query: 797  ILEKSG-GLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGP 973
             LEK+     K+++EM++  D++SLS+T L  R  W+++ G FRKG + +AGD+ HVMGP
Sbjct: 240  ALEKTNDAFPKETIEMIKDCDITSLSLTHLWYRPAWDILLGTFRKGMVTLAGDSMHVMGP 299

Query: 974  FIGQGGSVALEDAIVLARCLSNKI-GNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVL 1150
            F+GQGGS A+EDA+VLARCL+NKI G      E    L   K  EA+D + KERR R+V 
Sbjct: 300  FLGQGGSAAMEDAVVLARCLANKIHGESINGFEGNNGLFRKKMEEAMDLYVKERRMRLVR 359

Query: 1151 ISTLSYLTGILIVEDPSFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            +S  SY+TG+L     S + + + + +I+V F +P+ +T+YDCG L
Sbjct: 360  LSAQSYVTGLLF-SSASMIGKILLLALIIVLFQDPIRHTRYDCGHL 404


>ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Capsella rubella]
            gi|482552553|gb|EOA16746.1| hypothetical protein
            CARUB_v10004954mg [Capsella rubella]
          Length = 404

 Score =  345 bits (885), Expect = 3e-92
 Identities = 182/408 (44%), Positives = 267/408 (65%), Gaps = 5/408 (1%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT+LALHRKGI+SVVLERS+ +R+ GAA  I  NGW  L QLG+
Sbjct: 1    MEELDIVIVGGGIAGLATSLALHRKGIKSVVLERSESVRSQGAAFGIQTNGWLALEQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  ++PI +  DV+ +KG+ R     L S GE R + R+DLV  LA++LPL T+R 
Sbjct: 61   ADKLRLNSLPIPQIRDVMFEKGIKRRESVGLASYGEVRGVIRNDLVRALAHALPLGTLRL 120

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G Q++SV +++++S+ ++ +  G  IKAKVLIGCDG+NS+++ ++GL PT    + +VRG
Sbjct: 121  GCQIVSVQLDETTSFPIVHVQNGEPIKAKVLIGCDGSNSIVSRFLGLNPTKALGARAVRG 180

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKE-----SKKPEDPAF 784
             T YP+GH +  EF+R++ D    GR+PI  K V+WF  +    +E      KK ED   
Sbjct: 181  FTNYPDGHEFPNEFIRIKMDNVVCGRLPITHKLVFWFVVLLNCPQELDSNLVKKQED--- 237

Query: 785  IKQMILEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHV 964
            I ++ L   G  S+D  EM++  D+ SL +++LR RAPW+++SG FR+GT+ VAGD+ H+
Sbjct: 238  ITRLTLTSIGEFSEDWKEMVKNCDMDSLYISRLRYRAPWDVMSGKFRRGTVTVAGDSMHL 297

Query: 965  MGPFIGQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRV 1144
            MGPF+GQG S ALED +VLARCL  K+G    NS    + S  +  EA+DE+ +ERR R+
Sbjct: 298  MGPFLGQGTSAALEDGVVLARCLWRKLGQNSVNSNVSYSASRTQFEEAIDEYIRERRGRL 357

Query: 1145 VLISTLSYLTGILIVEDPSFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            V +ST +YLTG LI E  S + + + +V++++ F + + +T+YDCG L
Sbjct: 358  VGLSTQTYLTGCLI-EASSPVRKILFVVLLMILFRDRIGHTRYDCGRL 404


>ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, partial [Capsella rubella]
            gi|482552541|gb|EOA16734.1| hypothetical protein
            CARUB_v10004937mg, partial [Capsella rubella]
          Length = 410

 Score =  344 bits (882), Expect = 6e-92
 Identities = 185/412 (44%), Positives = 273/412 (66%), Gaps = 1/412 (0%)
 Frame = +2

Query: 56   REKQVLVAMEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGW 235
            R + ++  MEE            LAT+LALHRKGI+SVVLER++++R+ GA I    NGW
Sbjct: 6    RSENIISQMEEVGILIVGGGIAGLATSLALHRKGIKSVVLERAEQVRSEGAGIGTLTNGW 65

Query: 236  RVLGQLGLDSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANS 415
            R L QLG+   L+ T++ I +A  ++ + G  +     L    EARCI+R+DLVE LA++
Sbjct: 66   RALDQLGVGHRLRLTSLLIHKARTMLIENG--KTQEFVLTIADEARCIKRNDLVEALADA 123

Query: 416  LPLKTIRFGTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCL 595
            LP  TIRFG+Q++S++ ++++S+ ++QL  G +IKAK+LIGCDGANSV+++Y+ L P   
Sbjct: 124  LPQGTIRFGSQIVSINEDQTTSFPVVQLSNGKTIKAKILIGCDGANSVVSDYLQLGPRKA 183

Query: 596  FSSFSVRGMTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPED 775
            FS  +VRG T YPNGHG+  E LR++K    +GR+P+ E  V+WF  + + Q    K ED
Sbjct: 184  FSCRAVRGFTNYPNGHGFPQELLRIKKGNILVGRLPLTENQVFWF--LVHMQDNHYKVED 241

Query: 776  PAFIKQMILEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDA 955
               I  + L+    +S++  EM++  ++ SLS+T LR RAP  ++ G FR+GT+ VAGDA
Sbjct: 242  QESIANLCLKWVDEMSQEWKEMVKICNVESLSLTHLRYRAPSEIMLGKFRRGTVTVAGDA 301

Query: 956  WHVMGPFIGQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERR 1135
             HVMGPF+GQGGS ALEDA+VLARCL+ K+  G    +   + S     E +DE+ KERR
Sbjct: 302  MHVMGPFLGQGGSAALEDAVVLARCLARKV--GPDQGDLLKDCSMRSIEEGIDEYVKERR 359

Query: 1136 RRVVLISTLSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             R++ +S  +YLTG   ++ PS ++R M IV++++ FG + + +TKYDCG L
Sbjct: 360  MRLLGLSVQTYLTG-RSLQTPSKVVRLMFIVLLVLLFGRDQIRHTKYDCGRL 410


>ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-like [Solanum
            lycopersicum]
          Length = 394

 Score =  343 bits (879), Expect = 1e-91
 Identities = 179/388 (46%), Positives = 261/388 (67%)
 Frame = +2

Query: 125  LATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAI 304
            LATALALHRKG++SVVLE+S+ LR+ GAAI + PNGW+ L QLG+   L++TA+P+    
Sbjct: 22   LATALALHRKGVKSVVLEKSESLRSEGAAIGVLPNGWKALDQLGVAPYLRTTALPLQGMR 81

Query: 305  DVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSY 484
                DKG  +      I  GE RC++RSD+VET A++LP +TIRFG  ++SV+++  +S 
Sbjct: 82   ITWMDKGNEKFTPYKNI--GEVRCLKRSDIVETFADALPPRTIRFGCDIVSVEMDPITSL 139

Query: 485  ALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRGMTTYPNGHGYAPEFL 664
              + L  G+ I AKVLIGCDG+ S++A+++GLKP   F + ++RG+T+YPNGH +  EF+
Sbjct: 140  PSILLSNGNRIGAKVLIGCDGSRSIVASFLGLKPAKTFRTCAIRGLTSYPNGHSFPLEFV 199

Query: 665  RVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMILEKSGGLSKDSVEML 844
            R+   +  +GR+PI +K V+WF S++    ++K P+D   IKQ  +E   G   D  EM+
Sbjct: 200  RLIVGQTAVGRLPITDKLVHWFVSVQ-QGTDAKFPQDTQVIKQRAMEAVIGHPADVQEMI 258

Query: 845  EKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIVLA 1024
            +K DL SL  + LR RAPW+L+ GNFR+ T+ VAGDA HVMGPF+GQGGS  +EDA+VL 
Sbjct: 259  KKCDLDSLWFSHLRYRAPWDLMFGNFREKTVTVAGDAMHVMGPFLGQGGSSGIEDAVVLG 318

Query: 1025 RCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLISTLSYLTGILIVEDPSF 1204
            R           N  + +N S     EA++++ KER+ RVV ++T SYLTG+L  E+   
Sbjct: 319  R-----------NLAKTINGSCFDHEEAVNQYIKERKMRVVKLATQSYLTGLLF-ENRPM 366

Query: 1205 LLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            L + + + ++ +FF NP  +T+YDCG+L
Sbjct: 367  LTKIVIVAVMAIFFRNPSAHTQYDCGLL 394


>ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutrema salsugineum]
            gi|557115621|gb|ESQ55904.1| hypothetical protein
            EUTSA_v10025403mg [Eutrema salsugineum]
          Length = 394

 Score =  341 bits (874), Expect = 5e-91
 Identities = 184/404 (45%), Positives = 270/404 (66%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT+LALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+ T+  I +A  ++ + G  R    ++  E EARCIRR+DLVE LA++LP +TIRF
Sbjct: 61   SHRLRLTSNLIRKARTMLIENGKKREFVLNI--EDEARCIRRNDLVEALADALPEETIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+Q++S++ ++++S+ ++ L  G++IKAKVLIGCDGANSV+++Y+ L P   F+  +VRG
Sbjct: 119  GSQIVSIEEDETTSFPVVHLTNGNTIKAKVLIGCDGANSVVSDYLRLSPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR++     +GR+P+ +  V+WF  + + Q       D   I  + 
Sbjct: 179  FTNYPNGHGFPQELLRMKTGNVLVGRLPLTDNLVFWF--VVHMQDNHHNGTDQESIANVT 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
            L+    LS+D  EM++K D+ SL++T LR R+PW ++   FR+GT+ VAGDA HVMGPF+
Sbjct: 237  LKWVDKLSEDWQEMVQKCDVESLTITHLRYRSPWEIMFRKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
            GQGGS ALEDA+VLARCL+ K+G      +   + S     EA+DE+ ++RR R+V +ST
Sbjct: 297  GQGGSAALEDAVVLARCLAKKVG-----PDHGEDCSMKNIEEAIDEYVEKRRMRLVGLST 351

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S ++R M IV+++V FG + + +TKYDCG L
Sbjct: 352  QTYLTG-RSLQTQSNVVRLMFIVLLVVLFGRDQIRHTKYDCGRL 394


>ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutrema salsugineum]
            gi|557115620|gb|ESQ55903.1| hypothetical protein
            EUTSA_v10025376mg [Eutrema salsugineum]
          Length = 398

 Score =  335 bits (860), Expect = 2e-89
 Identities = 175/403 (43%), Positives = 261/403 (64%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT+LALHRKGI+S+VLERS+ +R+ GAA  I  NGW  L QLGL
Sbjct: 1    MEELDIVILGGGIAGLATSLALHRKGIKSIVLERSETVRSEGAAFGIQTNGWLALQQLGL 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  ++PI +  DV+ ++G+ R       S GE R + R+DLV  LA+ LPL T+R 
Sbjct: 61   ADKLRPNSLPIHQIRDVLIEEGIKRRESVGPASYGEVRGVIRNDLVRALAHELPLGTLRL 120

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G Q++SV ++++ S+ ++ +  G  IK+KVLIGCDG+NSV++ ++GLKPT   SS +VRG
Sbjct: 121  GCQIVSVKLDETLSFPIVHVKNGQDIKSKVLIGCDGSNSVVSEFLGLKPTKSLSSRAVRG 180

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YP+GHG+  EF+R++ D    GR+PI  K V+WF  +    ++S    +   I +  
Sbjct: 181  FTNYPDGHGFRQEFIRIKMDNVVSGRLPITPKLVFWFVVLLKCPQDSNFLRNQEDIARFT 240

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
            L      S++  EM++  D++SL + +LR RAPW+++SG FR+GT+ VAGD+ H+MGPF+
Sbjct: 241  LSSVNDFSQEWKEMVKNCDINSLYINRLRYRAPWDVMSGKFRRGTVTVAGDSMHLMGPFL 300

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
            GQG S ALED +VLARCL  K+G    N+      S  +  EA+D++ +ERR R+V +ST
Sbjct: 301  GQGCSAALEDGVVLARCLWRKLGQDGMNNV----FSRKRIEEAIDDYVRERRGRLVRLST 356

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
             +YLT  LI E  S + + + +V++++ F + + +T+YDCG L
Sbjct: 357  QTYLTSRLI-EASSPVTKLLVVVLLMIMFRDQIGHTRYDCGRL 398


>ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyrata]
            gi|297314019|gb|EFH44442.1| monooxygenase [Arabidopsis
            lyrata subsp. lyrata]
          Length = 397

 Score =  332 bits (852), Expect = 2e-88
 Identities = 178/404 (44%), Positives = 269/404 (66%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT+LALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+ T+  I +A  ++ + G  +    +L+ E  ARCI+R+DLVE LA++LP  TIRF
Sbjct: 61   GDRLRLTSRLIHKARTMLIENGKKQEFVSTLVDE--ARCIKRNDLVEALADALPEGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+Q++S++ +KS+S+ ++ L  G++I+AKVLIGCDGANS+++ Y+ L P   F+  +VRG
Sbjct: 119  GSQIVSIEEDKSTSFPVVHLTNGNTIEAKVLIGCDGANSIVSEYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTNYPNGHGFPQEVLRIKQGNILIGRLPLTDNLVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
            L+ +  LS+D  EM++  D+ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  LKWAEDLSEDWKEMVKICDVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+ +ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVEERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +++YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHSRYDCGRL 397


>ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|332658247|gb|AEE83647.1|
            monooxygenase 1 [Arabidopsis thaliana]
          Length = 422

 Score =  325 bits (834), Expect = 2e-86
 Identities = 174/412 (42%), Positives = 268/412 (65%), Gaps = 1/412 (0%)
 Frame = +2

Query: 56   REKQVLVAMEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGW 235
            R + +   MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGW
Sbjct: 18   RSENLSFEMEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGW 77

Query: 236  RVLGQLGLDSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANS 415
            R L QLG+   L+  +  I +A  ++ + G  R    +++ E  ARCI+R+DLVE L+++
Sbjct: 78   RALDQLGVGDRLRLNSSLIHKARTMLIENGKKREFVSNIVDE--ARCIKRNDLVEALSDA 135

Query: 416  LPLKTIRFGTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCL 595
            LP  TIRFG+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   
Sbjct: 136  LPKGTIRFGSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKA 195

Query: 596  FSSFSVRGMTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPED 775
            F+  +VRG T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D
Sbjct: 196  FACRAVRGFTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKD 253

Query: 776  PAFIKQMILEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDA 955
               I  +  + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA
Sbjct: 254  QESIANLCRKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDA 313

Query: 956  WHVMGPFIGQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERR 1135
             HVMGPF+ QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR
Sbjct: 314  MHVMGPFLAQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERR 371

Query: 1136 RRVVLISTLSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             R++ +S  +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 372  MRLLGLSVQTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 422


>emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448|dbj|BAD42916.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51968540|dbj|BAD42962.1| unnamed protein product
            [Arabidopsis thaliana] gi|51968730|dbj|BAD43057.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51968814|dbj|BAD43099.1| unnamed protein product
            [Arabidopsis thaliana] gi|51968850|dbj|BAD43117.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51968966|dbj|BAD43175.1| unnamed protein product
            [Arabidopsis thaliana] gi|51969074|dbj|BAD43229.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51969116|dbj|BAD43250.1| unnamed protein product
            [Arabidopsis thaliana] gi|51970812|dbj|BAD44098.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51971010|dbj|BAD44197.1| unnamed protein product
            [Arabidopsis thaliana] gi|51971188|dbj|BAD44286.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51971399|dbj|BAD44364.1| unnamed protein product
            [Arabidopsis thaliana] gi|51971599|dbj|BAD44464.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51971627|dbj|BAD44478.1| unnamed protein product
            [Arabidopsis thaliana] gi|51971681|dbj|BAD44505.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|51971689|dbj|BAD44509.1| unnamed protein product
            [Arabidopsis thaliana]
          Length = 397

 Score =  324 bits (831), Expect = 5e-86
 Identities = 173/404 (42%), Positives = 265/404 (65%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  +  I +A  ++ + G  R    +++ E  ARCI+R+DLVE L+++LP  TIRF
Sbjct: 61   GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDE--ARCIKRNDLVEALSDALPKGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   F+  +VRG
Sbjct: 119  GSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
             + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  RKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397


>dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana]
          Length = 397

 Score =  323 bits (828), Expect = 1e-85
 Identities = 173/404 (42%), Positives = 264/404 (65%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L   +  I +A  ++ + G  R    +++ E  ARCI+R+DLVE L+++LP  TIRF
Sbjct: 61   GDRLHLNSSLIHKARTMLIENGKKREFVSNIVDE--ARCIKRNDLVEALSDALPKGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   F+  +VRG
Sbjct: 119  GSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
             + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  RKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397


>ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arabidopsis lyrata subsp.
            lyrata] gi|297314018|gb|EFH44441.1| hypothetical protein
            ARALYDRAFT_355191 [Arabidopsis lyrata subsp. lyrata]
          Length = 408

 Score =  323 bits (828), Expect = 1e-85
 Identities = 175/409 (42%), Positives = 263/409 (64%), Gaps = 6/409 (1%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT+LALHRKGI+S+VLER++ +R+ GAA  I  NGW  L QLG+
Sbjct: 1    MEELDIVIVGGGIAGLATSLALHRKGIKSIVLERAESVRSEGAAFGIQTNGWLALQQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  ++PI +  DV+ +KG+ +       S GE R + R+DLV  LA++LPL T+R 
Sbjct: 61   ADKLRLNSLPIHQIRDVLIEKGIKQRESVGPASYGEVRGVLRNDLVRALAHALPLGTLRL 120

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAK-----VLIGCDGANSVIANYVGLKPTCLFSS 604
            G  ++SV +++++S+ ++ +  G +IKAK     VLIGCDG+NSV++ ++GL PT    S
Sbjct: 121  GCHILSVKLDETTSFPIVHVKNGEAIKAKARLATVLIGCDGSNSVVSRFLGLNPTKDLGS 180

Query: 605  FSVRGMTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAF 784
             +VRG T YP+ HG+  EF+R++ D    GRIPI  K V+WF  +    ++S    + A 
Sbjct: 181  RAVRGFTNYPDDHGFRQEFIRIKMDNVVSGRIPITHKLVFWFVVLLNCPQDSSFLRNQAD 240

Query: 785  IKQMILEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHV 964
            I ++ L      S++  EM++  D+ SL + +LR RAPW+++SG FR GT+ VAGD+ H+
Sbjct: 241  IARLTLASVHEFSEEWKEMVKNCDMDSLYINRLRYRAPWDVLSGKFRCGTVTVAGDSMHL 300

Query: 965  MGPFIGQGGSVALEDAIVLARCLSNKIGNG-DPNSERRMNLSAHKTMEALDEFFKERRRR 1141
            MGPFIGQG S ALED +VLARCL  K+  G D  +    + S  +  EA+DE+ +ERR R
Sbjct: 301  MGPFIGQGCSAALEDGVVLARCLWRKLSLGQDGMNNVSYSSSRMQIEEAIDEYIRERRGR 360

Query: 1142 VVLISTLSYLTGILIVEDPSFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            +V +ST +YLTG LI +  S + +F+ +V++++ F + + +T+YDCG L
Sbjct: 361  LVGLSTQTYLTGNLI-KASSPVTKFLLVVLLMILFRDQIGHTRYDCGRL 408


>dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana]
          Length = 397

 Score =  323 bits (827), Expect = 1e-85
 Identities = 172/404 (42%), Positives = 265/404 (65%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHR+GI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHREGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  +  I +A  ++ + G  R    +++ E  ARCI+R+DLVE L+++LP  TIRF
Sbjct: 61   GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDE--ARCIKRNDLVEALSDALPKGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   F+  +VRG
Sbjct: 119  GSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
             + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  RKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397


>dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana]
          Length = 397

 Score =  322 bits (824), Expect = 3e-85
 Identities = 172/404 (42%), Positives = 264/404 (65%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  +  I +A  ++ + G  R    +++ E  ARCI+R+DLV  L+++LP  TIRF
Sbjct: 61   GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDE--ARCIKRNDLVGALSDALPKGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   F+  +VRG
Sbjct: 119  GSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
             + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  RKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397


>dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana]
            gi|62318646|dbj|BAD95117.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 397

 Score =  321 bits (823), Expect = 4e-85
 Identities = 172/404 (42%), Positives = 264/404 (65%), Gaps = 1/404 (0%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTAIPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRF 439
               L+  +  I +A  ++ +    R    +++ E  ARCI+R+DLVE L+++LP  TIRF
Sbjct: 61   GDRLRLNSSLIHKARTMLIENEKKREFVSNIVDE--ARCIKRNDLVEALSDALPKGTIRF 118

Query: 440  GTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRG 619
            G+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L P   F+  +VRG
Sbjct: 119  GSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRG 178

Query: 620  MTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMI 799
             T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  +   +D   I  + 
Sbjct: 179  FTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNHNGKDQESIANLC 236

Query: 800  LEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFI 979
             + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ VAGDA HVMGPF+
Sbjct: 237  RKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFL 296

Query: 980  GQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLIST 1159
             QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+  ERR R++ +S 
Sbjct: 297  AQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSV 354

Query: 1160 LSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  QTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397


>gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao]
          Length = 413

 Score =  318 bits (816), Expect = 3e-84
 Identities = 169/383 (44%), Positives = 243/383 (63%), Gaps = 4/383 (1%)
 Frame = +2

Query: 152  KGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAIDVVADKGVH 331
            KGIE++VLERS+ LRA+GAAI + PNGWR L QLG+ S L+ TA+ I     +    G  
Sbjct: 41   KGIETIVLERSENLRATGAAIIVQPNGWRALDQLGIASKLRQTAVSIQSGRYITVKDGKQ 100

Query: 332  RVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSYALLQLHCGS 511
            + +    +  GE RC++R+DL+  LA +LP  T+R G +V+S+ ++ S+SY +LQL  GS
Sbjct: 101  KDLPVGDV--GELRCLKRTDLLNALAENLPADTVRLGCKVVSITLDPSTSYPILQLQDGS 158

Query: 512  SIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRGMTTYPNGHGYAPEFLRVRKDENFL 691
             + AKV+IGCDG NS IAN +GL  T LFS+  +RG T Y  GH +   FL   KD+  L
Sbjct: 159  VLMAKVVIGCDGVNSTIANILGLNSTRLFSTSVIRGFTNYETGHEFGSAFLVFSKDDVQL 218

Query: 692  GRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMILEKSGGLSKDSVEMLEKSDLSSLS 871
            G +P+ EK VYWF + + T ++SK  +    IK+  +E   G     +EM++ SDL SL 
Sbjct: 219  GLLPVTEKLVYWFVTRKQTSQDSKVSKSQTLIKESTVEAMKGFPIHIMEMVKDSDLDSLH 278

Query: 872  MTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIVLARCLSNKIGN 1051
            +T LR  APW+L+  N R+GT+ VAGDA H M PF+ QGGS +LEDA+VLARCLS     
Sbjct: 279  LTDLRFLAPWDLLGTNLRRGTVTVAGDAMHAMAPFLAQGGSASLEDAVVLARCLSQ---- 334

Query: 1052 GDPNSERRMNLSAHKTM----EALDEFFKERRRRVVLISTLSYLTGILIVEDPSFLLRFM 1219
               N   R++    KTM     ALD++ KER+ RV  +S  ++L G ++ +  + L++ +
Sbjct: 335  ---NQTMRVDEKQAKTMMDMEAALDQYVKERKMRVFWLSLETFLIGTML-DTSTLLVKCL 390

Query: 1220 CIVIILVFFGNPLNYTKYDCGVL 1288
            CI+ ++V F + + +T+YDCG L
Sbjct: 391  CIISLMVLFRDKIAHTRYDCGRL 413


>ref|NP_001190738.1| monooxygenase 1 [Arabidopsis thaliana] gi|332658248|gb|AEE83648.1|
            monooxygenase 1 [Arabidopsis thaliana]
          Length = 409

 Score =  318 bits (814), Expect = 4e-84
 Identities = 173/416 (41%), Positives = 266/416 (63%), Gaps = 13/416 (3%)
 Frame = +2

Query: 80   MEEXXXXXXXXXXXXLATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGL 259
            MEE            LAT++ALHRKGI+SVVLER++++R+ GA I    NGWR L QLG+
Sbjct: 1    MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60

Query: 260  DSTLKSTA------------IPILRAIDVVADKGVHRVVRESLISEGEARCIRRSDLVET 403
               L+  +            + + RA  ++ + G  R    +++ E  ARCI+R+DLVE 
Sbjct: 61   GDRLRLNSSLIHKILIYGPFLDMNRARTMLIENGKKREFVSNIVDE--ARCIKRNDLVEA 118

Query: 404  LANSLPLKTIRFGTQVISVDVEKSSSYALLQLHCGSSIKAKVLIGCDGANSVIANYVGLK 583
            L+++LP  TIRFG+ ++S++ +K++ + ++ L  G+SIKAKVLIGCDGANS++++Y+ L 
Sbjct: 119  LSDALPKGTIRFGSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLN 178

Query: 584  PTCLFSSFSVRGMTTYPNGHGYAPEFLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESK 763
            P   F+  +VRG T YPNGHG+  E LR+++    +GR+P+ +  V+WF  + + Q  + 
Sbjct: 179  PKKAFACRAVRGFTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWF--LVHMQDNNH 236

Query: 764  KPEDPAFIKQMILEKSGGLSKDSVEMLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMV 943
              +D   I  +  + +  LS+D  EM++  ++ SL++T LR RAP  ++ G FR+GT+ V
Sbjct: 237  NGKDQESIANLCRKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTV 296

Query: 944  AGDAWHVMGPFIGQGGSVALEDAIVLARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFF 1123
            AGDA HVMGPF+ QGGS ALEDA+VLARCL+ K+  G  + +   + S     EA+DE+ 
Sbjct: 297  AGDAMHVMGPFLAQGGSAALEDAVVLARCLARKV--GPDHGDLLKDCSMKNIEEAIDEYV 354

Query: 1124 KERRRRVVLISTLSYLTGILIVEDPSFLLRFMCIVIILVFFG-NPLNYTKYDCGVL 1288
             ERR R++ +S  +YLTG   ++  S +LR M I ++L+ FG + + +T+YDCG L
Sbjct: 355  DERRMRLLGLSVQTYLTG-RSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 409


>gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao]
          Length = 414

 Score =  317 bits (812), Expect = 7e-84
 Identities = 173/390 (44%), Positives = 251/390 (64%), Gaps = 2/390 (0%)
 Frame = +2

Query: 125  LATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAI 304
            LATALALHRKGI+SVVLE+S+ LR +G  I + PNGWR L QLG+ S L+ TA+ I    
Sbjct: 33   LATALALHRKGIKSVVLEKSETLRTTGVGIIMQPNGWRALDQLGVASKLRETAMDISSRQ 92

Query: 305  DVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSY 484
             ++ D G      E  + +GE RC++R DLVE LA  LP+ T+ FG +V+S+ ++  +SY
Sbjct: 93   LIMVDDGKRL---ELPLGKGELRCLKRLDLVEVLAEPLPVNTVHFGCKVLSIVLDPVTSY 149

Query: 485  ALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPTCLFSSFSVRGMTTYPNGHGYAPEFL 664
             +LQLH GS I+AK++IGCDG NSVI+ ++G+ P  LFS  + RG T Y  GH ++  F 
Sbjct: 150  PVLQLHDGSIIRAKIVIGCDGVNSVISKFLGMNPPKLFSRCATRGFTWYERGHDFSGVF- 208

Query: 665  RVRKDENF-LGRIPIDEKTVYWFFSIRYTQKESK-KPEDPAFIKQMILEKSGGLSKDSVE 838
            R+ K +N  LG++P+ +K VYWF +   T ++S    +DPA+ K+  +E   G   ++VE
Sbjct: 209  RIHKTDNVQLGQLPVTDKLVYWFLTRSLTPQDSNASKKDPAYTKEASMEAMKGFPHETVE 268

Query: 839  MLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIV 1018
            M++ S+  SL +T+LR   PW L+   FR GT++VAGDA H M PFI QGG  +LEDA+V
Sbjct: 269  MIKNSEDKSLYLTELRYLPPWELLRAKFRLGTVVVAGDAMHAMCPFISQGGGASLEDAVV 328

Query: 1019 LARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLISTLSYLTGILIVEDP 1198
            LARCLS KI      S +   +   K   ALD + +ERR R+  +S  +YL G + +++ 
Sbjct: 329  LARCLSEKIKIKMQTSRQEQKMMLEK---ALDLYVRERRMRLFWLSLQTYLIG-MTLDNT 384

Query: 1199 SFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            S + + + IV +++ F +  ++T YDCG L
Sbjct: 385  SKVKKVLGIVSLILIFRDQRSHTDYDCGRL 414


>ref|XP_003533524.1| PREDICTED: zeaxanthin epoxidase, chloroplastic-like [Glycine max]
          Length = 399

 Score =  310 bits (795), Expect = 7e-82
 Identities = 170/390 (43%), Positives = 248/390 (63%), Gaps = 2/390 (0%)
 Frame = +2

Query: 125  LATALALHRKGIESVVLERSDRLRASGAAITIFPNGWRVLGQLGLDSTLKSTAIPILRAI 304
            LATALALHRK I+S+VLERS+ LRA+GAAI +  NGWR L QLG+ STL+ TAI I    
Sbjct: 19   LATALALHRKRIKSLVLERSENLRATGAAIIVHANGWRALDQLGIGSTLRQTAIQIQGGR 78

Query: 305  DVVADKGVHRVVRESLISEGEARCIRRSDLVETLANSLPLKTIRFGTQVISVDVEKSSSY 484
             +  ++     +      + E RC++R+DL++ +A++LP  TIR   QV+S++++  +  
Sbjct: 79   FISLNEA--EPMEFPFGVDQELRCLKRTDLMKAMADNLPAGTIRTNCQVLSIELDPLTRS 136

Query: 485  ALLQLHCGSSIKAKVLIGCDGANSVIANYVGLKPT--CLFSSFSVRGMTTYPNGHGYAPE 658
              L L  GS ++AKV+IGCDG NS IAN  GL  T   LFS+   RG T +PNGH +  E
Sbjct: 137  PQLLLSNGSILQAKVVIGCDGVNSAIANMFGLHRTKLLLFSTCVARGFTNFPNGHEFGSE 196

Query: 659  FLRVRKDENFLGRIPIDEKTVYWFFSIRYTQKESKKPEDPAFIKQMILEKSGGLSKDSVE 838
            F  + +D+  LGRIP+ +K VYWF +   T K+S   +DP  I+Q ++E   G  + +VE
Sbjct: 197  FAMMSRDQVQLGRIPVSDKLVYWFVTRPRTSKDSTIWKDPVLIRQSLIESMKGFPEGAVE 256

Query: 839  MLEKSDLSSLSMTQLRSRAPWNLVSGNFRKGTIMVAGDAWHVMGPFIGQGGSVALEDAIV 1018
            ++    LS L +T+L+ RAPW+LV   FRKGT+ +AGDA H  GPFI QGGS ++EDA+V
Sbjct: 257  IIRNCKLSFLHLTELKYRAPWDLVFNKFRKGTVTIAGDAMHATGPFIAQGGSASIEDALV 316

Query: 1019 LARCLSNKIGNGDPNSERRMNLSAHKTMEALDEFFKERRRRVVLISTLSYLTGILIVEDP 1198
            LARCL+ K       +E    ++  +  EA D++ KER+ R   +S  S+L G  + +  
Sbjct: 317  LARCLAQK------KAEETAEINIAEAEEAFDQYVKERKMRNFWLSLHSFLVGKKL-DTK 369

Query: 1199 SFLLRFMCIVIILVFFGNPLNYTKYDCGVL 1288
            S ++RF+ + I+ + F +P  +++Y CGVL
Sbjct: 370  SSIVRFIILAIMGILFRDPDWHSRYHCGVL 399


Top