BLASTX nr result
ID: Achyranthes23_contig00012837
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00012837 (1776 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi... 373 e-100 gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus pe... 363 2e-97 gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis] 350 1e-93 ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutr... 347 7e-93 ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutr... 345 5e-92 ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-l... 341 5e-91 ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Caps... 338 3e-90 ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, part... 338 3e-90 ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyr... 328 5e-87 ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arab... 325 4e-86 emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448... 323 2e-85 ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|33265... 323 2e-85 gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao] 322 3e-85 dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana] 321 6e-85 dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana] 321 7e-85 dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana] 320 1e-84 gb|ESW03318.1| hypothetical protein PHAVU_011G004100g [Phaseolus... 320 2e-84 dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana] g... 320 2e-84 gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao] 318 6e-84 ref|NP_001190738.1| monooxygenase 1 [Arabidopsis thaliana] gi|33... 317 1e-83 >ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi|223545636|gb|EEF47140.1| monoxygenase, putative [Ricinus communis] Length = 397 Score = 373 bits (957), Expect = e-100 Identities = 196/402 (48%), Positives = 259/402 (64%) Frame = +2 Query: 251 MGAMEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQ 430 M A EE LATALALHRKGI+SVVLERS+TLRA GA I V NGWRAL + Sbjct: 1 MDANEEVELVIVGGGICGLATALALHRKGIRSVVLERSETLRAAGAGIAVLTNGWRALDE 60 Query: 431 LGLDSTLRPTATQLQRVVDDLADKGVVRETPMSEGEARCLKRSDLVEALANALPLETIRF 610 LG+ S +RPTA LQR L V+ E GEARC+KRSDL+EALA+ LPL TIRF Sbjct: 61 LGVGSKIRPTALPLQRYHPILIAPIVMIEI----GEARCVKRSDLIEALADDLPLGTIRF 116 Query: 611 GSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRG 790 G I+SVN+D S ++QL +GS IKAK LIGCDGA+S++ D++ LKP ++FS CAVRG Sbjct: 117 GCDILSVNLDPEISFPILQLSNGSSIKAKALIGCDGANSVVSDFLELKPKKLFSLCAVRG 176 Query: 791 LTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMT 970 T YPNGH A E R+ K L GR+P+D N V+WF++ + D +PKDP +RQ + Sbjct: 177 FTHYPNGHGLAPELIRMVKGNVLCGRVPVDDNLVFWFIIQNFFPKDTNIPKDPELMRQFS 236 Query: 971 QDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFL 1150 +++ F + + M++N ++TSLSLT LRYR PW++ FR+ TVAGDA H+MGPF+ Sbjct: 237 LESIKDFPTERLEMVKNCEVTSLSLTHLRYRTPWEIYLGKFRRGTATVAGDAMHIMGPFI 296 Query: 1151 GQGGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXX 1330 GQGGS A+EDA+VLARCLS ++ G ++ ++ QK EA D+Y+ E Sbjct: 297 GQGGSAAIEDAVVLARCLSAKMQEVGQLKSSSHIMSQKIGEAFDDYVKE-RRMRLVWLST 355 Query: 1331 XXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +++ S L+K + + V F +P+ H RYDCG L Sbjct: 356 QTYLYGSLLQNSSRLVKVSIAVAMIVLFGNPIYHTRYDCGPL 397 >gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus persica] Length = 387 Score = 363 bits (931), Expect = 2e-97 Identities = 191/385 (49%), Positives = 247/385 (64%), Gaps = 3/385 (0%) Frame = +2 Query: 305 LATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGLDSTLRPTATQLQRVV 484 LATALALHRKG++SVVLERS++LRATGA I + NGWRAL +LG+ S LR TA LQ Sbjct: 19 LATALALHRKGLRSVVLERSESLRATGAGITIRTNGWRALDELGVASKLRQTAMPLQ--- 75 Query: 485 DDLADKGVVRETPMSEGEARCLKRSDLVEALANALPLETIRFGSQIVSVNVDKSSSPAVI 664 GE RCLKR DL+ ALA +LP TIR G Q +SV +D S+S + Sbjct: 76 --------------GGGETRCLKRMDLITALAESLPRGTIRLGCQALSVRLDSSTSSPSL 121 Query: 665 QLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLTIYPNGHAYASEFYRIK 844 L +GS IKAKVLIGCDG +S++ D++ LKP+++FS VRG T+YP+GH + ++F ++K Sbjct: 122 HLQNGSSIKAKVLIGCDGTNSVVADFLDLKPSKLFSLSEVRGFTMYPSGHNFGNQFVQVK 181 Query: 845 KDKHLVGRLPIDKNTVYWFVV--LPWNQGDAEMPKDPASIRQMTQDAVAGFSEDFVGMIE 1018 DK VGR+PI VYWFV + + +G E+PKDP IRQ+T +A+ F + + MI Sbjct: 182 GDKCTVGRIPIHNKLVYWFVTQKVMYGRGGLEVPKDPELIRQLTLEAIKDFPSEMIDMIS 241 Query: 1019 NSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQGGSLALEDAIVLAR 1198 SD SLS TRLRYR+PWD+L NFRK +VTVAGDA H MGPFLGQGGS +ED+IV+AR Sbjct: 242 KSDTKSLSNTRLRYRSPWDILVRNFRKGSVTVAGDAMHTMGPFLGQGGSAGIEDSIVIAR 301 Query: 1199 CLSKRIC-NAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXXXXXXXXVEDPSLL 1375 CL++ + N N+ K EALD+Y+ E +D L+ Sbjct: 302 CLAQELAENYDKKSRARNIMMMKVEEALDKYVKERRMRLVLLSTQTYLAGLLQ-QDSGLI 360 Query: 1376 LKFMCIILLTVFFRDPLNHIRYDCG 1450 +KF+CI L+T F D H RYDCG Sbjct: 361 VKFVCIFLMTALFSDMTRHTRYDCG 385 >gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis] Length = 404 Score = 350 bits (898), Expect = 1e-93 Identities = 195/408 (47%), Positives = 253/408 (62%), Gaps = 6/408 (1%) Frame = +2 Query: 251 MGAMEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQ 430 M A EE LATALALHRKGIKSVVLERS+TLRA G+AI + NGWRAL Q Sbjct: 1 MEAAEEIDIVIVGAGICGLATALALHRKGIKSVVLERSETLRAFGSAIAILTNGWRALDQ 60 Query: 431 LGLDSTLRPTATQLQRVVDDLADKGVVRETPMSEGEARCLKRSDLVEALANALPLETIRF 610 LG+ LR TA LQ V D D R P+S+GEARC+KRSDL+ LA LP TIRF Sbjct: 61 LGIGPKLRQTALPLQGVRDIWLDGNKQRRGPLSKGEARCVKRSDLINMLAQDLPHGTIRF 120 Query: 611 GSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRG 790 G I+ V +D ++ ++QL DG IKAK+LIGCDGA S++ +Y+ +KP + F +RG Sbjct: 121 GCHILFVELDPLTNFPILQLRDGRAIKAKILIGCDGASSVVAEYLKVKPKKSFPAFGIRG 180 Query: 791 LTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQM- 967 LT YP+ H + EF R + + GR I++N V+WF++LP D+E+ KDP I+QM Sbjct: 181 LTYYPSPHGFDPEFVRTHGNNVVCGRSTINQNLVFWFLLLPGYLKDSEIFKDPELIKQMA 240 Query: 968 ---TQDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVM 1138 T DA F ++ + MI++ DITSLSLT L YR WD+L FRK VT+AGD+ HVM Sbjct: 241 LEKTNDA---FPKETIEMIKDCDITSLSLTHLWYRPAWDILLGTFRKGMVTLAGDSMHVM 297 Query: 1139 GPFLGQGGSLALEDAIVLARCLSKRICNAGVN--ENRMNLSQQKAMEALDEYLMEXXXXX 1312 GPFLGQGGS A+EDA+VLARCL+ +I +N E L ++K EA+D Y+ E Sbjct: 298 GPFLGQGGSAAMEDAVVLARCLANKIHGESINGFEGNNGLFRKKMEEAMDLYVKE-RRMR 356 Query: 1313 XXXXXXXXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 S++ K + + L+ V F+DP+ H RYDCG L Sbjct: 357 LVRLSAQSYVTGLLFSSASMIGKILLLALIIVLFQDPIRHTRYDCGHL 404 >ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutrema salsugineum] gi|557115621|gb|ESQ55904.1| hypothetical protein EUTSA_v10025403mg [Eutrema salsugineum] Length = 394 Score = 347 bits (891), Expect = 7e-93 Identities = 193/401 (48%), Positives = 258/401 (64%), Gaps = 2/401 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT+LALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR T+ +++ L + G RE ++ E EARC++R+DLVEALA+ALP ETIRFGS Sbjct: 61 SHRLRLTSNLIRKARTMLIENGKKREFVLNIEDEARCIRRNDLVEALADALPEETIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 QIVS+ D+++S V+ L +G+ IKAKVLIGCDGA+S++ DY+ L P + F+ AVRG T Sbjct: 121 QIVSIEEDETTSFPVVHLTNGNTIKAKVLIGCDGANSVVSDYLRLSPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E R+K LVGRLP+ N V+WFVV Q + D SI +T Sbjct: 181 NYPNGHGFPQELLRMKTGNVLVGRLPLTDNLVFWFVV--HMQDNHHNGTDQESIANVTLK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 V SED+ M++ D+ SL++T LRYR+PW+++F FR+ VTVAGDA HVMGPFLGQ Sbjct: 239 WVDKLSEDWQEMVQKCDVESLTITHLRYRSPWEIMFRKFRRGTVTVAGDAMHVMGPFLGQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL+K++ ++ + S + EA+DEY +E Sbjct: 299 GGSAALEDAVVLARCLAKKV----GPDHGEDCSMKNIEEAIDEY-VEKRRMRLVGLSTQT 353 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFF-RDPLNHIRYDCGEL 1456 ++ S +++ M I+LL V F RD + H +YDCG L Sbjct: 354 YLTGRSLQTQSNVVRLMFIVLLVVLFGRDQIRHTKYDCGRL 394 >ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutrema salsugineum] gi|557115620|gb|ESQ55903.1| hypothetical protein EUTSA_v10025376mg [Eutrema salsugineum] Length = 398 Score = 345 bits (884), Expect = 5e-92 Identities = 184/402 (45%), Positives = 255/402 (63%), Gaps = 3/402 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT+LALHRKGIKS+VLERS+T+R+ GAA G+ NGW AL QLGL Sbjct: 1 MEELDIVILGGGIAGLATSLALHRKGIKSIVLERSETVRSEGAAFGIQTNGWLALQQLGL 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRET---PMSEGEARCLKRSDLVEALANALPLETIRF 610 LRP + + ++ D L ++G+ R P S GE R + R+DLV ALA+ LPL T+R Sbjct: 61 ADKLRPNSLPIHQIRDVLIEEGIKRRESVGPASYGEVRGVIRNDLVRALAHELPLGTLRL 120 Query: 611 GSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRG 790 G QIVSV +D++ S ++ + +G IK+KVLIGCDG++S++ +++GLKPT+ S AVRG Sbjct: 121 GCQIVSVKLDETLSFPIVHVKNGQDIKSKVLIGCDGSNSVVSEFLGLKPTKSLSSRAVRG 180 Query: 791 LTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMT 970 T YP+GH + EF RIK D + GRLPI V+WFVVL D+ ++ I + T Sbjct: 181 FTNYPDGHGFRQEFIRIKMDNVVSGRLPITPKLVFWFVVLLKCPQDSNFLRNQEDIARFT 240 Query: 971 QDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFL 1150 +V FS+++ M++N DI SL + RLRYRAPWD++ FR+ VTVAGD+ H+MGPFL Sbjct: 241 LSSVNDFSQEWKEMVKNCDINSLYINRLRYRAPWDVMSGKFRRGTVTVAGDSMHLMGPFL 300 Query: 1151 GQGGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXX 1330 GQG S ALED +VLARCL +++ G+N S+++ EA+D+Y+ E Sbjct: 301 GQGCSAALEDGVVLARCLWRKLGQDGMNN---VFSRKRIEEAIDDYVRE-RRGRLVRLST 356 Query: 1331 XXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +E S + K + ++LL + FRD + H RYDCG L Sbjct: 357 QTYLTSRLIEASSPVTKLLVVVLLMIMFRDQIGHTRYDCGRL 398 >ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-like [Solanum lycopersicum] Length = 394 Score = 341 bits (875), Expect = 5e-91 Identities = 193/386 (50%), Positives = 245/386 (63%), Gaps = 2/386 (0%) Frame = +2 Query: 305 LATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGLDSTLRPTATQLQRVV 484 LATALALHRKG+KSVVLE+S++LR+ GAAIGV PNGW+AL QLG+ LR TA LQ + Sbjct: 22 LATALALHRKGVKSVVLEKSESLRSEGAAIGVLPNGWKALDQLGVAPYLRTTALPLQGMR 81 Query: 485 DDLADKGVVRETPMSE-GEARCLKRSDLVEALANALPLETIRFGSQIVSVNVDKSSSPAV 661 DKG + TP GE RCLKRSD+VE A+ALP TIRFG IVSV +D +S Sbjct: 82 ITWMDKGNEKFTPYKNIGEVRCLKRSDIVETFADALPPRTIRFGCDIVSVEMDPITSLPS 141 Query: 662 IQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLTIYPNGHAYASEFYRI 841 I L +G+ I AKVLIGCDG+ SI+ ++GLKP + F CA+RGLT YPNGH++ EF R+ Sbjct: 142 ILLSNGNRIGAKVLIGCDGSRSIVASFLGLKPAKTFRTCAIRGLTSYPNGHSFPLEFVRL 201 Query: 842 KKDKHLVGRLPIDKNTVYWFVVLPWNQG-DAEMPKDPASIRQMTQDAVAGFSEDFVGMIE 1018 + VGRLPI V+WFV + QG DA+ P+D I+Q +AV G D MI+ Sbjct: 202 IVGQTAVGRLPITDKLVHWFVSV--QQGTDAKFPQDTQVIKQRAMEAVIGHPADVQEMIK 259 Query: 1019 NSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQGGSLALEDAIVLAR 1198 D+ SL + LRYRAPWDL+F NFR+ VTVAGDA HVMGPFLGQGGS +EDA+VL R Sbjct: 260 KCDLDSLWFSHLRYRAPWDLMFGNFREKTVTVAGDAMHVMGPFLGQGGSSGIEDAVVLGR 319 Query: 1199 CLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXXXXXXXXVEDPSLLL 1378 L+K I N S EA+++Y+ E E+ +L Sbjct: 320 NLAKTI----------NGSCFDHEEAVNQYIKE-RKMRVVKLATQSYLTGLLFENRPMLT 368 Query: 1379 KFMCIILLTVFFRDPLNHIRYDCGEL 1456 K + + ++ +FFR+P H +YDCG L Sbjct: 369 KIVIVAVMAIFFRNPSAHTQYDCGLL 394 >ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Capsella rubella] gi|482552553|gb|EOA16746.1| hypothetical protein CARUB_v10004954mg [Capsella rubella] Length = 404 Score = 338 bits (868), Expect = 3e-90 Identities = 186/406 (45%), Positives = 258/406 (63%), Gaps = 7/406 (1%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT+LALHRKGIKSVVLERS+++R+ GAA G+ NGW AL QLG+ Sbjct: 1 MEELDIVIVGGGIAGLATSLALHRKGIKSVVLERSESVRSQGAAFGIQTNGWLALEQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPM---SEGEARCLKRSDLVEALANALPLETIRF 610 LR + + ++ D + +KG+ R + S GE R + R+DLV ALA+ALPL T+R Sbjct: 61 ADKLRLNSLPIPQIRDVMFEKGIKRRESVGLASYGEVRGVIRNDLVRALAHALPLGTLRL 120 Query: 611 GSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRG 790 G QIVSV +D+++S ++ + +G IKAKVLIGCDG++SI+ ++GL PT+ AVRG Sbjct: 121 GCQIVSVQLDETTSFPIVHVQNGEPIKAKVLIGCDGSNSIVSRFLGLNPTKALGARAVRG 180 Query: 791 LTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVL---PWNQGDAEMPKDPASIR 961 T YP+GH + +EF RIK D + GRLPI V+WFVVL P + D+ + K I Sbjct: 181 FTNYPDGHEFPNEFIRIKMDNVVCGRLPITHKLVFWFVVLLNCP-QELDSNLVKKQEDIT 239 Query: 962 QMTQDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMG 1141 ++T ++ FSED+ M++N D+ SL ++RLRYRAPWD++ FR+ VTVAGD+ H+MG Sbjct: 240 RLTLTSIGEFSEDWKEMVKNCDMDSLYISRLRYRAPWDVMSGKFRRGTVTVAGDSMHLMG 299 Query: 1142 PFLGQGGSLALEDAIVLARCLSKRICNAGVNEN-RMNLSQQKAMEALDEYLMEXXXXXXX 1318 PFLGQG S ALED +VLARCL +++ VN N + S+ + EA+DEY+ E Sbjct: 300 PFLGQGTSAALEDGVVLARCLWRKLGQNSVNSNVSYSASRTQFEEAIDEYIRE-RRGRLV 358 Query: 1319 XXXXXXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +E S + K + ++LL + FRD + H RYDCG L Sbjct: 359 GLSTQTYLTGCLIEASSPVRKILFVVLLMILFRDRIGHTRYDCGRL 404 >ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, partial [Capsella rubella] gi|482552541|gb|EOA16734.1| hypothetical protein CARUB_v10004937mg, partial [Capsella rubella] Length = 410 Score = 338 bits (868), Expect = 3e-90 Identities = 189/405 (46%), Positives = 257/405 (63%), Gaps = 2/405 (0%) Frame = +2 Query: 248 VMGAMEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALH 427 ++ MEE LAT+LALHRKGIKSVVLER++ +R+ GA IG NGWRAL Sbjct: 10 IISQMEEVGILIVGGGIAGLATSLALHRKGIKSVVLERAEQVRSEGAGIGTLTNGWRALD 69 Query: 428 QLGLDSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETI 604 QLG+ LR T+ + + L + G +E ++ EARC+KR+DLVEALA+ALP TI Sbjct: 70 QLGVGHRLRLTSLLIHKARTMLIENGKTQEFVLTIADEARCIKRNDLVEALADALPQGTI 129 Query: 605 RFGSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAV 784 RFGSQIVS+N D+++S V+QL +G IKAK+LIGCDGA+S++ DY+ L P + FS AV Sbjct: 130 RFGSQIVSINEDQTTSFPVVQLSNGKTIKAKILIGCDGANSVVSDYLQLGPRKAFSCRAV 189 Query: 785 RGLTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQ 964 RG T YPNGH + E RIKK LVGRLP+ +N V+WF+V Q + +D SI Sbjct: 190 RGFTNYPNGHGFPQELLRIKKGNILVGRLPLTENQVFWFLV--HMQDNHYKVEDQESIAN 247 Query: 965 MTQDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGP 1144 + V S+++ M++ ++ SLSLT LRYRAP +++ FR+ VTVAGDA HVMGP Sbjct: 248 LCLKWVDEMSQEWKEMVKICNVESLSLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGP 307 Query: 1145 FLGQGGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXX 1324 FLGQGGS ALEDA+VLARCL++++ + + S + E +DEY+ E Sbjct: 308 FLGQGGSAALEDAVVLARCLARKV-GPDQGDLLKDCSMRSIEEGIDEYVKE-RRMRLLGL 365 Query: 1325 XXXXXXXXXXVEDPSLLLKFMCIILLTVFF-RDPLNHIRYDCGEL 1456 ++ PS +++ M I+LL + F RD + H +YDCG L Sbjct: 366 SVQTYLTGRSLQTPSKVVRLMFIVLLVLLFGRDQIRHTKYDCGRL 410 >ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyrata] gi|297314019|gb|EFH44442.1| monooxygenase [Arabidopsis lyrata subsp. lyrata] Length = 397 Score = 328 bits (841), Expect = 5e-87 Identities = 186/400 (46%), Positives = 244/400 (61%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT+LALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR T+ + + L + G +E + EARC+KR+DLVEALA+ALP TIRFGS Sbjct: 61 GDRLRLTSRLIHKARTMLIENGKKQEFVSTLVDEARCIKRNDLVEALADALPEGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 QIVS+ DKS+S V+ L +G+ I+AKVLIGCDGA+SI+ +Y+ L P + F+ AVRG T Sbjct: 121 QIVSIEEDKSTSFPVVHLTNGNTIEAKVLIGCDGANSIVSEYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + Sbjct: 181 NYPNGHGFPQEVLRIKQGNILIGRLPLTDNLVFWFLV--HMQDNNHNGKDQESIANLCLK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ D+ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WAEDLSEDWKEMVKICDVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVEERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHSRYDCGRL 397 >ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arabidopsis lyrata subsp. lyrata] gi|297314018|gb|EFH44441.1| hypothetical protein ARALYDRAFT_355191 [Arabidopsis lyrata subsp. lyrata] Length = 408 Score = 325 bits (833), Expect = 4e-86 Identities = 178/409 (43%), Positives = 253/409 (61%), Gaps = 10/409 (2%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT+LALHRKGIKS+VLER++++R+ GAA G+ NGW AL QLG+ Sbjct: 1 MEELDIVIVGGGIAGLATSLALHRKGIKSIVLERAESVRSEGAAFGIQTNGWLALQQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRET---PMSEGEARCLKRSDLVEALANALPLETIRF 610 LR + + ++ D L +KG+ + P S GE R + R+DLV ALA+ALPL T+R Sbjct: 61 ADKLRLNSLPIHQIRDVLIEKGIKQRESVGPASYGEVRGVLRNDLVRALAHALPLGTLRL 120 Query: 611 GSQIVSVNVDKSSSPAVIQLHDGSVIKAK-----VLIGCDGAHSIIGDYIGLKPTRIFSK 775 G I+SV +D+++S ++ + +G IKAK VLIGCDG++S++ ++GL PT+ Sbjct: 121 GCHILSVKLDETTSFPIVHVKNGEAIKAKARLATVLIGCDGSNSVVSRFLGLNPTKDLGS 180 Query: 776 CAVRGLTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPAS 955 AVRG T YP+ H + EF RIK D + GR+PI V+WFVVL D+ ++ A Sbjct: 181 RAVRGFTNYPDDHGFRQEFIRIKMDNVVSGRIPITHKLVFWFVVLLNCPQDSSFLRNQAD 240 Query: 956 IRQMTQDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHV 1135 I ++T +V FSE++ M++N D+ SL + RLRYRAPWD+L FR VTVAGD+ H+ Sbjct: 241 IARLTLASVHEFSEEWKEMVKNCDMDSLYINRLRYRAPWDVLSGKFRCGTVTVAGDSMHL 300 Query: 1136 MGPFLGQGGSLALEDAIVLARCLSKRIC--NAGVNENRMNLSQQKAMEALDEYLMEXXXX 1309 MGPF+GQG S ALED +VLARCL +++ G+N + S+ + EA+DEY+ E Sbjct: 301 MGPFIGQGCSAALEDGVVLARCLWRKLSLGQDGMNNVSYSSSRMQIEEAIDEYIRE-RRG 359 Query: 1310 XXXXXXXXXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 ++ S + KF+ ++LL + FRD + H RYDCG L Sbjct: 360 RLVGLSTQTYLTGNLIKASSPVTKFLLVVLLMILFRDQIGHTRYDCGRL 408 >emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448|dbj|BAD42916.1| unnamed protein product [Arabidopsis thaliana] gi|51968540|dbj|BAD42962.1| unnamed protein product [Arabidopsis thaliana] gi|51968730|dbj|BAD43057.1| unnamed protein product [Arabidopsis thaliana] gi|51968814|dbj|BAD43099.1| unnamed protein product [Arabidopsis thaliana] gi|51968850|dbj|BAD43117.1| unnamed protein product [Arabidopsis thaliana] gi|51968966|dbj|BAD43175.1| unnamed protein product [Arabidopsis thaliana] gi|51969074|dbj|BAD43229.1| unnamed protein product [Arabidopsis thaliana] gi|51969116|dbj|BAD43250.1| unnamed protein product [Arabidopsis thaliana] gi|51970812|dbj|BAD44098.1| unnamed protein product [Arabidopsis thaliana] gi|51971010|dbj|BAD44197.1| unnamed protein product [Arabidopsis thaliana] gi|51971188|dbj|BAD44286.1| unnamed protein product [Arabidopsis thaliana] gi|51971399|dbj|BAD44364.1| unnamed protein product [Arabidopsis thaliana] gi|51971599|dbj|BAD44464.1| unnamed protein product [Arabidopsis thaliana] gi|51971627|dbj|BAD44478.1| unnamed protein product [Arabidopsis thaliana] gi|51971681|dbj|BAD44505.1| unnamed protein product [Arabidopsis thaliana] gi|51971689|dbj|BAD44509.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 323 bits (827), Expect = 2e-85 Identities = 182/400 (45%), Positives = 243/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR ++ + + L + G RE + EARC+KR+DLVEAL++ALP TIRFGS Sbjct: 61 GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 121 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 181 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397 >ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|332658247|gb|AEE83647.1| monooxygenase 1 [Arabidopsis thaliana] Length = 422 Score = 323 bits (827), Expect = 2e-85 Identities = 182/400 (45%), Positives = 243/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 26 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 85 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR ++ + + L + G RE + EARC+KR+DLVEAL++ALP TIRFGS Sbjct: 86 GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGS 145 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 146 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 205 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 206 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 263 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 264 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 323 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 324 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 382 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 383 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 422 >gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao] Length = 414 Score = 322 bits (825), Expect = 3e-85 Identities = 177/389 (45%), Positives = 241/389 (61%), Gaps = 5/389 (1%) Frame = +2 Query: 305 LATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGLDSTLRPTATQLQRVV 484 LATALALHRKGIKSVVLE+S+TLR TG I + PNGWRAL QLG+ S LR TA + Sbjct: 33 LATALALHRKGIKSVVLEKSETLRTTGVGIIMQPNGWRALDQLGVASKLRETAMDISSRQ 92 Query: 485 DDLADKGVVRETPMSEGEARCLKRSDLVEALANALPLETIRFGSQIVSVNVDKSSSPAVI 664 + D G E P+ +GE RCLKR DLVE LA LP+ T+ FG +++S+ +D +S V+ Sbjct: 93 LIMVDDGKRLELPLGKGELRCLKRLDLVEVLAEPLPVNTVHFGCKVLSIVLDPVTSYPVL 152 Query: 665 QLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLTIYPNGHAYASEFYRIK 844 QLHDGS+I+AK++IGCDG +S+I ++G+ P ++FS+CA RG T Y GH ++ F K Sbjct: 153 QLHDGSIIRAKIVIGCDGVNSVISKFLGMNPPKLFSRCATRGFTWYERGHDFSGVFRIHK 212 Query: 845 KDKHLVGRLPIDKNTVYWFVVLPWNQGDAE-MPKDPASIRQMTQDAVAGFSEDFVGMIEN 1021 D +G+LP+ VYWF+ D+ KDPA ++ + +A+ GF + V MI+N Sbjct: 213 TDNVQLGQLPVTDKLVYWFLTRSLTPQDSNASKKDPAYTKEASMEAMKGFPHETVEMIKN 272 Query: 1022 SDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQGGSLALEDAIVLARC 1201 S+ SL LT LRY PW+LL A FR V VAGDA H M PF+ QGG +LEDA+VLARC Sbjct: 273 SEDKSLYLTELRYLPPWELLRAKFRLGTVVVAGDAMHAMCPFISQGGGASLEDAVVLARC 332 Query: 1202 LSKRICNAGVNENRMNLSQQKAM----EALDEYLMEXXXXXXXXXXXXXXXXXXXVEDPS 1369 LS++I + +M S+Q+ +ALD Y+ E +++ S Sbjct: 333 LSEKI------KIKMQTSRQEQKMMLEKALDLYVRE-RRMRLFWLSLQTYLIGMTLDNTS 385 Query: 1370 LLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 + K + I+ L + FRD +H YDCG L Sbjct: 386 KVKKVLGIVSLILIFRDQRSHTDYDCGRL 414 >dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 321 bits (823), Expect = 6e-85 Identities = 181/400 (45%), Positives = 243/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHR+GIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHREGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR ++ + + L + G RE + EARC+KR+DLVEAL++ALP TIRFGS Sbjct: 61 GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 121 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 181 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397 >dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 321 bits (822), Expect = 7e-85 Identities = 181/400 (45%), Positives = 242/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 L ++ + + L + G RE + EARC+KR+DLVEAL++ALP TIRFGS Sbjct: 61 GDRLHLNSSLIHKARTMLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 121 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 181 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397 >dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 320 bits (820), Expect = 1e-84 Identities = 181/400 (45%), Positives = 242/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR ++ + + L + G RE + EARC+KR+DLV AL++ALP TIRFGS Sbjct: 61 GDRLRLNSSLIHKARTMLIENGKKREFVSNIVDEARCIKRNDLVGALSDALPKGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 121 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 181 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397 >gb|ESW03318.1| hypothetical protein PHAVU_011G004100g [Phaseolus vulgaris] Length = 404 Score = 320 bits (819), Expect = 2e-84 Identities = 178/387 (45%), Positives = 239/387 (61%), Gaps = 3/387 (0%) Frame = +2 Query: 305 LATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGLDSTLRPTATQLQRVV 484 LATALALHRK IKSVVLERS+T+RATGAAI V NGW ALHQLG+ STLR TA +QR Sbjct: 19 LATALALHRKRIKSVVLERSETVRATGAAIIVQANGWHALHQLGIASTLRQTAIPIQRGR 78 Query: 485 DDLADKGVVRETPMSEG-EARCLKRSDLVEALANALPLETIRFGSQIVSVNVDKSSSPAV 661 ++ E P E RCLKRSDLV+ +A+ LP TIR Q++S+++D ++ Sbjct: 79 FISLNEAEPMEFPFGVNQEFRCLKRSDLVKVMADNLPKGTIRTNCQVLSIDLDPVTNFPH 138 Query: 662 IQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRI--FSKCAVRGLTIYPNGHAYASEFY 835 + L +G+VI AKV+IGCDG +S IG GL T + FS C RG T YPNGH +ASEF Sbjct: 139 LMLSNGTVIHAKVVIGCDGVNSAIGSMFGLYRTTLSLFSTCVARGFTNYPNGHQFASEFV 198 Query: 836 RIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQDAVAGFSEDFVGMI 1015 + + + +GR+P+ VYWFV D+ + KDP IRQ +++ GF E MI Sbjct: 199 MMSRGQVQLGRIPVTDKLVYWFVTRLRTSRDSTIWKDPVLIRQSLMESMKGFPEGPTEMI 258 Query: 1016 ENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQGGSLALEDAIVLA 1195 +N +++ L LT L+YRAPW+LLF +FRK VT+AGDA H GPF+ QGGS ++ED IVLA Sbjct: 259 KNCNLSFLHLTELKYRAPWELLFNSFRKGTVTIAGDAMHATGPFVAQGGSASIEDGIVLA 318 Query: 1196 RCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXXXXXXXXVEDPSLL 1375 RCL+++ N ++ A EA DEY+ E ++ S + Sbjct: 319 RCLAQKKFNNAKKTEETEINIAVAEEAFDEYVRE-RKMRNFWLSFHSFLVGKKLDTKSSI 377 Query: 1376 LKFMCIILLTVFFRDPLNHIRYDCGEL 1456 ++F+ + +++ FRDP H RY CG L Sbjct: 378 IRFIILAIMSTLFRDPDWHSRYHCGNL 404 >dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana] gi|62318646|dbj|BAD95117.1| hypothetical protein [Arabidopsis thaliana] Length = 397 Score = 320 bits (819), Expect = 2e-84 Identities = 181/400 (45%), Positives = 242/400 (60%), Gaps = 1/400 (0%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVVDDLADKGVVRETPMS-EGEARCLKRSDLVEALANALPLETIRFGS 616 LR ++ + + L + RE + EARC+KR+DLVEAL++ALP TIRFGS Sbjct: 61 GDRLRLNSSLIHKARTMLIENEKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGS 120 Query: 617 QIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLT 796 IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P + F+ AVRG T Sbjct: 121 HIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFT 180 Query: 797 IYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQD 976 YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + KD SI + + Sbjct: 181 KYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNGKDQESIANLCRK 238 Query: 977 AVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQ 1156 SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAGDA HVMGPFL Q Sbjct: 239 WADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQ 298 Query: 1157 GGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEXXXXXXXXXXXXX 1336 GGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 GGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTY 357 Query: 1337 XXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 LTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 397 >gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao] Length = 413 Score = 318 bits (814), Expect = 6e-84 Identities = 175/380 (46%), Positives = 234/380 (61%), Gaps = 5/380 (1%) Frame = +2 Query: 332 KGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGLDSTLRPTATQLQRVVDDLADKGVV 511 KGI+++VLERS+ LRATGAAI V PNGWRAL QLG+ S LR TA +Q G Sbjct: 41 KGIETIVLERSENLRATGAAIIVQPNGWRALDQLGIASKLRQTAVSIQSGRYITVKDGKQ 100 Query: 512 RETPMSE-GEARCLKRSDLVEALANALPLETIRFGSQIVSVNVDKSSSPAVIQLHDGSVI 688 ++ P+ + GE RCLKR+DL+ ALA LP +T+R G ++VS+ +D S+S ++QL DGSV+ Sbjct: 101 KDLPVGDVGELRCLKRTDLLNALAENLPADTVRLGCKVVSITLDPSTSYPILQLQDGSVL 160 Query: 689 KAKVLIGCDGAHSIIGDYIGLKPTRIFSKCAVRGLTIYPNGHAYASEFYRIKKDKHLVGR 868 AKV+IGCDG +S I + +GL TR+FS +RG T Y GH + S F KD +G Sbjct: 161 MAKVVIGCDGVNSTIANILGLNSTRLFSTSVIRGFTNYETGHEFGSAFLVFSKDDVQLGL 220 Query: 869 LPIDKNTVYWFVVLPWNQGDAEMPKDPASIRQMTQDAVAGFSEDFVGMIENSDITSLSLT 1048 LP+ + VYWFV D+++ K I++ T +A+ GF + M+++SD+ SL LT Sbjct: 221 LPVTEKLVYWFVTRKQTSQDSKVSKSQTLIKESTVEAMKGFPIHIMEMVKDSDLDSLHLT 280 Query: 1049 RLRYRAPWDLLFANFRKDNVTVAGDAWHVMGPFLGQGGSLALEDAIVLARCLSKRICNAG 1228 LR+ APWDLL N R+ VTVAGDA H M PFL QGGS +LEDA+VLARCLS+ Sbjct: 281 DLRFLAPWDLLGTNLRRGTVTVAGDAMHAMAPFLAQGGSASLEDAVVLARCLSQN----- 335 Query: 1229 VNENRMNLSQQKAM----EALDEYLMEXXXXXXXXXXXXXXXXXXXVEDPSLLLKFMCII 1396 R++ Q K M ALD+Y+ E ++ +LL+K +CII Sbjct: 336 -QTMRVDEKQAKTMMDMEAALDQYVKE-RKMRVFWLSLETFLIGTMLDTSTLLVKCLCII 393 Query: 1397 LLTVFFRDPLNHIRYDCGEL 1456 L V FRD + H RYDCG L Sbjct: 394 SLMVLFRDKIAHTRYDCGRL 413 >ref|NP_001190738.1| monooxygenase 1 [Arabidopsis thaliana] gi|332658248|gb|AEE83648.1| monooxygenase 1 [Arabidopsis thaliana] Length = 409 Score = 317 bits (811), Expect = 1e-83 Identities = 182/412 (44%), Positives = 245/412 (59%), Gaps = 13/412 (3%) Frame = +2 Query: 260 MEEXXXXXXXXXXXXLATALALHRKGIKSVVLERSDTLRATGAAIGVFPNGWRALHQLGL 439 MEE LAT++ALHRKGIKSVVLER++ +R+ GA IG NGWRAL QLG+ Sbjct: 1 MEEIGIVIVGGGIAGLATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGV 60 Query: 440 DSTLRPTATQLQRVV------------DDLADKGVVRETPMS-EGEARCLKRSDLVEALA 580 LR ++ + +++ L + G RE + EARC+KR+DLVEAL+ Sbjct: 61 GDRLRLNSSLIHKILIYGPFLDMNRARTMLIENGKKREFVSNIVDEARCIKRNDLVEALS 120 Query: 581 NALPLETIRFGSQIVSVNVDKSSSPAVIQLHDGSVIKAKVLIGCDGAHSIIGDYIGLKPT 760 +ALP TIRFGS IVS+ DK++ V+ L +G+ IKAKVLIGCDGA+SI+ DY+ L P Sbjct: 121 DALPKGTIRFGSHIVSIEQDKTTLFPVVHLANGNSIKAKVLIGCDGANSIVSDYLQLNPK 180 Query: 761 RIFSKCAVRGLTIYPNGHAYASEFYRIKKDKHLVGRLPIDKNTVYWFVVLPWNQGDAEMP 940 + F+ AVRG T YPNGH + E RIK+ L+GRLP+ N V+WF+V Q + Sbjct: 181 KAFACRAVRGFTKYPNGHGFPQEVLRIKQGNVLIGRLPLTDNQVFWFLV--HMQDNNHNG 238 Query: 941 KDPASIRQMTQDAVAGFSEDFVGMIENSDITSLSLTRLRYRAPWDLLFANFRKDNVTVAG 1120 KD SI + + SED+ M++ ++ SL+LT LRYRAP +++ FR+ VTVAG Sbjct: 239 KDQESIANLCRKWADDLSEDWKEMVKICNVESLTLTHLRYRAPSEIMLGKFRRGTVTVAG 298 Query: 1121 DAWHVMGPFLGQGGSLALEDAIVLARCLSKRICNAGVNENRMNLSQQKAMEALDEYLMEX 1300 DA HVMGPFL QGGS ALEDA+VLARCL++++ + + S + EA+DEY+ E Sbjct: 299 DAMHVMGPFLAQGGSAALEDAVVLARCLARKV-GPDHGDLLKDCSMKNIEEAIDEYVDER 357 Query: 1301 XXXXXXXXXXXXXXXXXXVEDPSLLLKFMCIILLTVFFRDPLNHIRYDCGEL 1456 +L +LL +F RD + H RYDCG L Sbjct: 358 RMRLLGLSVQTYLTGRSLQTSSKVLRLMFIALLLLLFGRDQIRHTRYDCGRL 409