BLASTX nr result
ID: Rauwolfia21_contig00006824
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00006824 (1439 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-l... 390 e-106 ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi... 381 e-103 gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis] 378 e-102 gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus pe... 367 7e-99 ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutr... 353 8e-95 ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Caps... 349 1e-93 ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutr... 349 2e-93 ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyr... 347 9e-93 gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao] 344 5e-92 ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, part... 344 5e-92 ref|XP_003540567.1| PREDICTED: zeaxanthin epoxidase, chloroplast... 343 1e-91 emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448... 340 9e-91 ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|33265... 340 9e-91 dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana] 339 2e-90 dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana] g... 339 2e-90 ref|XP_003533524.1| PREDICTED: zeaxanthin epoxidase, chloroplast... 339 2e-90 dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana] 338 3e-90 dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana] 338 3e-90 gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao] 336 1e-89 ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arab... 332 2e-88 >ref|XP_004237255.1| PREDICTED: FAD-dependent urate hydroxylase-like [Solanum lycopersicum] Length = 394 Score = 390 bits (1002), Expect = e-106 Identities = 203/372 (54%), Positives = 259/372 (69%), Gaps = 2/372 (0%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RKG+K VVLEKS+ LR GAAIGVL NGW+ALDQLGV LR A+ +QG + W+D+G Sbjct: 30 RKGVKSVVLEKSESLRSEGAAIGVLPNGWKALDQLGVAPYLRTTALPLQGMRITWMDKGN 89 Query: 325 REDLQFL-AGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKC 501 + + GE RCLKR D+++ ADALPP T+RF C IV+V+MDP P++ +G Sbjct: 90 EKFTPYKNIGEVRCLKRSDIVETFADALPPRTIRFGCDIVSVEMDPITSLPSILLSNGNR 149 Query: 502 IRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLRKNKILFG 681 I K LIGCDGSRSIVA FLGLKP+K+F CA RG T+Y +GHSF EFVRL + G Sbjct: 150 IGAKVLIGCDGSRSIVASFLGLKPAKTFRTCAIRGLTSYPNGHSFPLEFVRLIVGQTAVG 209 Query: 682 RIPINDNLVYWFVA-QPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGCDLDSLS 858 R+PI D LV+WFV+ Q T DAK P D ++IKQ A++A IG P D+ EMI+ CDLDSL Sbjct: 210 RLPITDKLVHWFVSVQQGT--DAKFPQDTQVIKQRAMEAVIGHPADVQEMIKKCDLDSLW 267 Query: 859 FAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVGL 1038 F+HLRYR PW+++ G F VTVAGDAMHVMGPFLGQGGS+GIEDAVVL RN+ + + Sbjct: 268 FSHLRYRAPWDLMFGNFREKTVTVAGDAMHVMGPFLGQGGSSGIEDAVVLGRNLAKTING 327 Query: 1039 DGYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILMNILFSD 1218 + E+ A Y++ER+MRV++L+ Q+YL GLL E ++K + V +M I F + Sbjct: 328 SCFDHEE-----AVNQYIKERKMRVVKLATQSYLTGLLFENRPMLTKIVIVAVMAIFFRN 382 Query: 1219 KGGHTKYDCGHL 1254 HT+YDCG L Sbjct: 383 PSAHTQYDCGLL 394 >ref|XP_002515156.1| monoxygenase, putative [Ricinus communis] gi|223545636|gb|EEF47140.1| monoxygenase, putative [Ricinus communis] Length = 397 Score = 381 bits (978), Expect = e-103 Identities = 193/375 (51%), Positives = 260/375 (69%), Gaps = 5/375 (1%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RKG++ VVLE+S+ LR GA I VL NGWRALD+LGVG+++R A+ +Q I I Sbjct: 27 RKGIRSVVLERSETLRAAGAGIAVLTNGWRALDELGVGSKIRPTALPLQRYHPILI---- 82 Query: 325 REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKCI 504 + GEARC+KR DLI+ALAD LP GT+RF C I++V +DP P L +G I Sbjct: 83 APIVMIEIGEARCVKRSDLIEALADDLPLGTIRFGCDILSVNLDPEISFPILQLSNGSSI 142 Query: 505 RTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLRKNKILFGR 684 + K LIGCDG+ S+V+DFL LKP K F+LCA RG+T+Y +GH + E +R+ K +L GR Sbjct: 143 KAKALIGCDGANSVVSDFLELKPKKLFSLCAVRGFTHYPNGHGLAPELIRMVKGNVLCGR 202 Query: 685 IPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGCDLDSLSFA 864 +P++DNLV+WF+ Q P D +P DPEL++Q +L++ FPT+ +EM++ C++ SLS Sbjct: 203 VPVDDNLVFWFIIQNFFPKDTNIPKDPELMRQFSLESIKDFPTERLEMVKNCEVTSLSLT 262 Query: 865 HLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVGLDG 1044 HLRYR PWEI +G+F RG TVAGDAMH+MGPF+GQGGSA IEDAVVLAR + K+ G Sbjct: 263 HLRYRTPWEIYLGKFRRGTATVAGDAMHIMGPFIGQGGSAAIEDAVVLARCLSAKMQEVG 322 Query: 1045 YGRE-----QREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILMNIL 1209 + ++IG+AF DYV+ERRMR++ LS QTYL G LL+ +S + K + M +L Sbjct: 323 QLKSSSHIMSQKIGEAFDDYVKERRMRLVWLSTQTYLYGSLLQNSSRLVKVSIAVAMIVL 382 Query: 1210 FSDKGGHTKYDCGHL 1254 F + HT+YDCG L Sbjct: 383 FGNPIYHTRYDCGPL 397 >gb|EXC30730.1| 3-hydroxybenzoate 6-hydroxylase 1 [Morus notabilis] Length = 404 Score = 378 bits (971), Expect = e-102 Identities = 196/378 (51%), Positives = 259/378 (68%), Gaps = 8/378 (2%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RKG+K VVLE+S+ LR G+AI +L NGWRALDQLG+G +LRQ A+ +QG +DIW+D K Sbjct: 27 RKGIKSVVLERSETLRAFGSAIAILTNGWRALDQLGIGPKLRQTALPLQGVRDIWLDGNK 86 Query: 325 REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKCI 504 + GEARC+KR DLI LA LP GT+RF C I+ V++DP P L DG+ I Sbjct: 87 QRRGPLSKGEARCVKRSDLINMLAQDLPHGTIRFGCHILFVELDPLTNFPILQLRDGRAI 146 Query: 505 RTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLRKNKILFGR 684 + K LIGCDG+ S+VA++L +KP KSF RG T Y H F EFVR N ++ GR Sbjct: 147 KAKILIGCDGASSVVAEYLKVKPKKSFPAFGIRGLTYYPSPHGFDPEFVRTHGNNVVCGR 206 Query: 685 IPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQAT-IGFPTDMVEMIEGCDLDSLSF 861 IN NLV+WF+ P D+++ DPELIKQ+AL+ T FP + +EMI+ CD+ SLS Sbjct: 207 STINQNLVFWFLLLPGYLKDSEIFKDPELIKQMALEKTNDAFPKETIEMIKDCDITSLSL 266 Query: 862 AHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKV--- 1032 HL YRP W+IL+G F +G VT+AGD+MHVMGPFLGQGGSA +EDAVVLAR + K+ Sbjct: 267 THLWYRPAWDILLGTFRKGMVTLAGDSMHVMGPFLGQGGSAAMEDAVVLARCLANKIHGE 326 Query: 1033 ---GLDG-YGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILM 1200 G +G G ++++ +A YV+ERRMR++RLSAQ+Y+ GLL + S + K + + L+ Sbjct: 327 SINGFEGNNGLFRKKMEEAMDLYVKERRMRLVRLSAQSYVTGLLFSSASMIGKILLLALI 386 Query: 1201 NILFSDKGGHTKYDCGHL 1254 +LF D HT+YDCGHL Sbjct: 387 IVLFQDPIRHTRYDCGHL 404 >gb|EMJ01313.1| hypothetical protein PRUPE_ppa018848mg [Prunus persica] Length = 387 Score = 367 bits (942), Expect = 7e-99 Identities = 192/379 (50%), Positives = 257/379 (67%), Gaps = 9/379 (2%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RKGL+ VVLE+S+ LR TGA I + NGWRALD+LGV ++LRQ A+ +QGG Sbjct: 27 RKGLRSVVLERSESLRATGAGITIRTNGWRALDELGVASKLRQTAMPLQGG--------- 77 Query: 325 REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKCI 504 GE RCLKR+DLI ALA++LP GT+R CQ ++V++D P+LH +G I Sbjct: 78 --------GETRCLKRMDLITALAESLPRGTIRLGCQALSVRLDSSTSSPSLHLQNGSSI 129 Query: 505 RTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLRKNKILFGR 684 + K LIGCDG+ S+VADFL LKPSK F+L RG+T Y GH+F ++FV+++ +K GR Sbjct: 130 KAKVLIGCDGTNSVVADFLDLKPSKLFSLSEVRGFTMYPSGHNFGNQFVQVKGDKCTVGR 189 Query: 685 IPINDNLVYWFVAQPST--PLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGCDLDSLS 858 IPI++ LVYWFV Q ++P DPELI+QL L+A FP++M++MI D SLS Sbjct: 190 IPIHNKLVYWFVTQKVMYGRGGLEVPKDPELIRQLTLEAIKDFPSEMIDMISKSDTKSLS 249 Query: 859 FAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVGL 1038 LRYR PW+ILV F +G VTVAGDAMH MGPFLGQGGSAGIED++V+AR + +++ Sbjct: 250 NTRLRYRSPWDILVRNFRKGSVTVAGDAMHTMGPFLGQGGSAGIEDSIVIARCLAQELA- 308 Query: 1039 DGYGREQR-------EIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVIL 1197 + Y ++ R ++ +A YV+ERRMR++ LS QTYL GLL + + + KF+ + L Sbjct: 309 ENYDKKSRARNIMMMKVEEALDKYVKERRMRLVLLSTQTYLAGLLQQDSGLIVKFVCIFL 368 Query: 1198 MNILFSDKGGHTKYDCGHL 1254 M LFSD HT+YDCG L Sbjct: 369 MTALFSDMTRHTRYDCGCL 387 >ref|XP_006414450.1| hypothetical protein EUTSA_v10025376mg [Eutrema salsugineum] gi|557115620|gb|ESQ55903.1| hypothetical protein EUTSA_v10025376mg [Eutrema salsugineum] Length = 398 Score = 353 bits (907), Expect = 8e-95 Identities = 181/382 (47%), Positives = 259/382 (67%), Gaps = 5/382 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K +VLE+S+ +R GAA G+ NGW AL QLG+ +LR ++ I +D Sbjct: 17 ATSLALHRKGIKSIVLERSETVRSEGAAFGIQTNGWLALQQLGLADKLRPNSLPIHQIRD 76 Query: 304 IWIDEG--KREDLQFLA-GEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQP 474 + I+EG +RE + + GE R + R DL++ALA LP GT+R CQIV+VK+D P Sbjct: 77 VLIEEGIKRRESVGPASYGEVRGVIRNDLVRALAHELPLGTLRLGCQIVSVKLDETLSFP 136 Query: 475 TLHSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVR 654 +H +G+ I++K LIGCDGS S+V++FLGLKP+KS + A RG+TNY DGH F EF+R Sbjct: 137 IVHVKNGQDIKSKVLIGCDGSNSVVSEFLGLKPTKSLSSRAVRGFTNYPDGHGFRQEFIR 196 Query: 655 LRKNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIE 834 ++ + ++ GR+PI LV+WFV P D+ + E I + L + F + EM++ Sbjct: 197 IKMDNVVSGRLPITPKLVFWFVVLLKCPQDSNFLRNQEDIARFTLSSVNDFSQEWKEMVK 256 Query: 835 GCDLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLAR 1014 CD++SL LRYR PW+++ G+F RG VTVAGD+MH+MGPFLGQG SA +ED VVLAR Sbjct: 257 NCDINSLYINRLRYRAPWDVMSGKFRRGTVTVAGDSMHLMGPFLGQGCSAALEDGVVLAR 316 Query: 1015 NVVEKVGLDGYGR--EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIA 1188 + K+G DG ++ I +A DYV ERR R++RLS QTYL L+EA+SP++K + Sbjct: 317 CLWRKLGQDGMNNVFSRKRIEEAIDDYVRERRGRLVRLSTQTYLTSRLIEASSPVTKLLV 376 Query: 1189 VILMNILFSDKGGHTKYDCGHL 1254 V+L+ I+F D+ GHT+YDCG L Sbjct: 377 VVLLMIMFRDQIGHTRYDCGRL 398 >ref|XP_006283848.1| hypothetical protein CARUB_v10004954mg [Capsella rubella] gi|482552553|gb|EOA16746.1| hypothetical protein CARUB_v10004954mg [Capsella rubella] Length = 404 Score = 349 bits (896), Expect = 1e-93 Identities = 184/388 (47%), Positives = 259/388 (66%), Gaps = 11/388 (2%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K VVLE+S+ +R GAA G+ NGW AL+QLGV +LR ++ I +D Sbjct: 17 ATSLALHRKGIKSVVLERSESVRSQGAAFGIQTNGWLALEQLGVADKLRLNSLPIPQIRD 76 Query: 304 IWIDEG-KREDLQFLA--GEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQP 474 + ++G KR + LA GE R + R DL++ALA ALP GT+R CQIV+V++D P Sbjct: 77 VMFEKGIKRRESVGLASYGEVRGVIRNDLVRALAHALPLGTLRLGCQIVSVQLDETTSFP 136 Query: 475 TLHSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVR 654 +H +G+ I+ K LIGCDGS SIV+ FLGL P+K+ A RG+TNY DGH F +EF+R Sbjct: 137 IVHVQNGEPIKAKVLIGCDGSNSIVSRFLGLNPTKALGARAVRGFTNYPDGHEFPNEFIR 196 Query: 655 LRKNKILFGRIPINDNLVYWFVAQPSTP--LDAKLPNDPELIKQLALQATIGFPTDMVEM 828 ++ + ++ GR+PI LV+WFV + P LD+ L E I +L L + F D EM Sbjct: 197 IKMDNVVCGRLPITHKLVFWFVVLLNCPQELDSNLVKKQEDITRLTLTSIGEFSEDWKEM 256 Query: 829 IEGCDLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVL 1008 ++ CD+DSL + LRYR PW+++ G+F RG VTVAGD+MH+MGPFLGQG SA +ED VVL Sbjct: 257 VKNCDMDSLYISRLRYRAPWDVMSGKFRRGTVTVAGDSMHLMGPFLGQGTSAALEDGVVL 316 Query: 1009 ARNVVEKVGLD------GYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSP 1170 AR + K+G + Y + + +A +Y+ ERR R++ LS QTYL G L+EA+SP Sbjct: 317 ARCLWRKLGQNSVNSNVSYSASRTQFEEAIDEYIRERRGRLVGLSTQTYLTGCLIEASSP 376 Query: 1171 MSKFIAVILMNILFSDKGGHTKYDCGHL 1254 + K + V+L+ ILF D+ GHT+YDCG L Sbjct: 377 VRKILFVVLLMILFRDRIGHTRYDCGRL 404 >ref|XP_006414451.1| hypothetical protein EUTSA_v10025403mg [Eutrema salsugineum] gi|557115621|gb|ESQ55904.1| hypothetical protein EUTSA_v10025403mg [Eutrema salsugineum] Length = 394 Score = 349 bits (895), Expect = 2e-93 Identities = 184/380 (48%), Positives = 253/380 (66%), Gaps = 3/380 (0%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K VVLE+++ +R GA IG L NGWRALDQLGV LR + LI+ + Sbjct: 17 ATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGVSHRLRLTSNLIRKART 76 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC++R DL++ALADALP T+RF QIV+++ D P + Sbjct: 77 MLIENGKKREFVLNIEDEARCIRRNDLVEALADALPEETIRFGSQIVSIEEDETTSFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ S+V+D+L L P K+FA A RG+TNY +GH F E +R++ Sbjct: 137 HLTNGNTIKAKVLIGCDGANSVVSDYLRLSPKKAFACRAVRGFTNYPNGHGFPQELLRMK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 +L GR+P+ DNLV+WFV + D E I + L+ D EM++ C Sbjct: 197 TGNVLVGRLPLTDNLVFWFVVHMQD--NHHNGTDQESIANVTLKWVDKLSEDWQEMVQKC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 D++SL+ HLRYR PWEI+ +F RG VTVAGDAMHVMGPFLGQGGSA +EDAVVLAR + Sbjct: 255 DVESLTITHLRYRSPWEIMFRKFRRGTVTVAGDAMHVMGPFLGQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLD-GYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVIL 1197 +KVG D G + I +A +YVE+RRMR++ LS QTYL G L+ S + + + ++L Sbjct: 315 AKKVGPDHGEDCSMKNIEEAIDEYVEKRRMRLVGLSTQTYLTGRSLQTQSNVVRLMFIVL 374 Query: 1198 MNILFS-DKGGHTKYDCGHL 1254 + +LF D+ HTKYDCG L Sbjct: 375 LVVLFGRDQIRHTKYDCGRL 394 >ref|XP_002868183.1| monooxygenase [Arabidopsis lyrata subsp. lyrata] gi|297314019|gb|EFH44442.1| monooxygenase [Arabidopsis lyrata subsp. lyrata] Length = 397 Score = 347 bits (889), Expect = 9e-93 Identities = 186/384 (48%), Positives = 255/384 (66%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K VVLE+++ +R GA IG L NGWRALDQLGVG LR + LI + Sbjct: 17 ATSLALHRKGIKSVVLERAEKVRSEGAGIGTLTNGWRALDQLGVGDRLRLTSRLIHKART 76 Query: 304 IWIDEGKRED-LQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK+++ + L EARC+KR DL++ALADALP GT+RF QIV+++ D P + Sbjct: 77 MLIENGKKQEFVSTLVDEARCIKRNDLVEALADALPEGTIRFGSQIVSIEEDKSTSFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I K LIGCDG+ SIV+++L L P K+FA A RG+TNY +GH F E +R++ Sbjct: 137 HLTNGNTIEAKVLIGCDGANSIVSEYLQLNPKKAFACRAVRGFTNYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + IL GR+P+ DNLV+WF+ + D E I L L+ D EM++ C Sbjct: 197 QGNILIGRLPLTDNLVFWFLVHMQD--NNHNGKDQESIANLCLKWAEDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 D++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 DVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YVEERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVEERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ H++YDCG L Sbjct: 374 FIALLLLLFGRDQIRHSRYDCGRL 397 >gb|EOY17302.1| Monooxygenase, putative [Theobroma cacao] Length = 414 Score = 344 bits (883), Expect = 5e-92 Identities = 177/374 (47%), Positives = 241/374 (64%), Gaps = 4/374 (1%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RKG+K VVLEKS+ LR TG I + NGWRALDQLGV ++LR+ A+ I Q I +D+GK Sbjct: 41 RKGIKSVVLEKSETLRTTGVGIIMQPNGWRALDQLGVASKLRETAMDISSRQLIMVDDGK 100 Query: 325 REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKCI 504 R +L GE RCLKR+DL++ LA+ LP TV F C+++++ +DP P L +DG I Sbjct: 101 RLELPLGKGELRCLKRLDLVEVLAEPLPVNTVHFGCKVLSIVLDPVTSYPVLQLHDGSII 160 Query: 505 RTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLRKNKILFGR 684 R K +IGCDG S+++ FLG+ P K F+ CATRG+T Y GH FS F + + + G+ Sbjct: 161 RAKIVIGCDGVNSVISKFLGMNPPKLFSRCATRGFTWYERGHDFSGVFRIHKTDNVQLGQ 220 Query: 685 IPINDNLVYWFVAQPSTPLDAKL-PNDPELIKQLALQATIGFPTDMVEMIEGCDLDSLSF 861 +P+ D LVYWF+ + TP D+ DP K+ +++A GFP + VEMI+ + SL Sbjct: 221 LPVTDKLVYWFLTRSLTPQDSNASKKDPAYTKEASMEAMKGFPHETVEMIKNSEDKSLYL 280 Query: 862 AHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVGLD 1041 LRY PPWE+L +F G V VAGDAMH M PF+ QGG A +EDAVVLAR + EK+ + Sbjct: 281 TELRYLPPWELLRAKFRLGTVVVAGDAMHAMCPFISQGGGASLEDAVVLARCLSEKIKIK 340 Query: 1042 GYGREQRE---IGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILMNILF 1212 Q + + KA YV ERRMR+ LS QTYLIG+ L+ TS + K + ++ + ++F Sbjct: 341 MQTSRQEQKMMLEKALDLYVRERRMRLFWLSLQTYLIGMTLDNTSKVKKVLGIVSLILIF 400 Query: 1213 SDKGGHTKYDCGHL 1254 D+ HT YDCG L Sbjct: 401 RDQRSHTDYDCGRL 414 >ref|XP_006283836.1| hypothetical protein CARUB_v10004937mg, partial [Capsella rubella] gi|482552541|gb|EOA16734.1| hypothetical protein CARUB_v10004937mg, partial [Capsella rubella] Length = 410 Score = 344 bits (883), Expect = 5e-92 Identities = 184/383 (48%), Positives = 252/383 (65%), Gaps = 6/383 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K VVLE+++ +R GA IG L NGWRALDQLGVG LR ++LI + Sbjct: 30 ATSLALHRKGIKSVVLERAEQVRSEGAGIGTLTNGWRALDQLGVGHRLRLTSLLIHKART 89 Query: 304 IWIDEGKREDLQF-LAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK ++ +A EARC+KR DL++ALADALP GT+RF QIV++ D P + Sbjct: 90 MLIENGKTQEFVLTIADEARCIKRNDLVEALADALPQGTIRFGSQIVSINEDQTTSFPVV 149 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 +GK I+ K LIGCDG+ S+V+D+L L P K+F+ A RG+TNY +GH F E +R++ Sbjct: 150 QLSNGKTIKAKILIGCDGANSVVSDYLQLGPRKAFSCRAVRGFTNYPNGHGFPQELLRIK 209 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 K IL GR+P+ +N V+WF+ + D E I L L+ + EM++ C Sbjct: 210 KGNILVGRLPLTENQVFWFLVHMQD--NHYKVEDQESIANLCLKWVDEMSQEWKEMVKIC 267 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SLS HLRYR P EI++G+F RG VTVAGDAMHVMGPFLGQGGSA +EDAVVLAR + Sbjct: 268 NVESLSLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLGQGGSAALEDAVVLARCL 327 Query: 1021 VEKVGLDG----YGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIA 1188 KVG D R I + +YV+ERRMR+L LS QTYL G L+ S + + + Sbjct: 328 ARKVGPDQGDLLKDCSMRSIEEGIDEYVKERRMRLLGLSVQTYLTGRSLQTPSKVVRLMF 387 Query: 1189 VILMNILFS-DKGGHTKYDCGHL 1254 ++L+ +LF D+ HTKYDCG L Sbjct: 388 IVLLVLLFGRDQIRHTKYDCGRL 410 >ref|XP_003540567.1| PREDICTED: zeaxanthin epoxidase, chloroplastic-like [Glycine max] Length = 397 Score = 343 bits (879), Expect = 1e-91 Identities = 172/373 (46%), Positives = 250/373 (67%), Gaps = 3/373 (0%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RK +K +VLE+S++LR TGAAI V ANGWRALDQLG+G+ LRQ A+ I+GG+ I ++E + Sbjct: 27 RKRIKSLVLERSENLRATGAAIIVQANGWRALDQLGIGSTLRQTAIQIEGGRFISLNEAE 86 Query: 325 REDLQF-LAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKC 501 + F + E RCLKR DL+KA+AD LP GT+R CQ+V++++DP P L +G Sbjct: 87 PMEFPFGVNQELRCLKRTDLVKAMADNLPVGTIRTNCQVVSIELDPLTHSPQLLLSNGSI 146 Query: 502 IRTKNLIGCDGSRSIVADFLGLKPSKS--FALCATRGWTNYSDGHSFSHEFVRLRKNKIL 675 ++ K +IGCDG S +A+ GL +K F+ C RG+TN+ +GH F+ EFV + + ++ Sbjct: 147 LQAKVVIGCDGVNSAIANMFGLHRTKLLLFSTCVARGFTNFPNGHQFASEFVVMSRGQVQ 206 Query: 676 FGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGCDLDSL 855 GRIP++D LVYWFV +P T D+ + +P LI+Q +++ GFP VEMI+ C L L Sbjct: 207 LGRIPVSDQLVYWFVTRPRTSKDSTIWKEPVLIRQSLIESMKGFPEGAVEMIQNCKLSFL 266 Query: 856 SFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVG 1035 L+YR PW++++ +F +G VT+AGDAMH GPF+ QGGSA IEDA+VLAR + +K Sbjct: 267 HLTELKYRAPWDLVLNKFRKGTVTIAGDAMHATGPFIAQGGSASIEDALVLARCLAQKKF 326 Query: 1036 LDGYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILMNILFS 1215 +G E +AF Y++ER+MR+ LS ++L+G L+ S + +FI + +M ILF Sbjct: 327 AEGMNIADAE--EAFDQYLKERKMRIFWLSLHSFLVGKKLDTKSSIVRFIILAIMAILFR 384 Query: 1216 DKGGHTKYDCGHL 1254 D H++Y CG L Sbjct: 385 DPDWHSRYHCGLL 397 >emb|CAA07574.1| monooxygenase [Arabidopsis thaliana] gi|51968448|dbj|BAD42916.1| unnamed protein product [Arabidopsis thaliana] gi|51968540|dbj|BAD42962.1| unnamed protein product [Arabidopsis thaliana] gi|51968730|dbj|BAD43057.1| unnamed protein product [Arabidopsis thaliana] gi|51968814|dbj|BAD43099.1| unnamed protein product [Arabidopsis thaliana] gi|51968850|dbj|BAD43117.1| unnamed protein product [Arabidopsis thaliana] gi|51968966|dbj|BAD43175.1| unnamed protein product [Arabidopsis thaliana] gi|51969074|dbj|BAD43229.1| unnamed protein product [Arabidopsis thaliana] gi|51969116|dbj|BAD43250.1| unnamed protein product [Arabidopsis thaliana] gi|51970812|dbj|BAD44098.1| unnamed protein product [Arabidopsis thaliana] gi|51971010|dbj|BAD44197.1| unnamed protein product [Arabidopsis thaliana] gi|51971188|dbj|BAD44286.1| unnamed protein product [Arabidopsis thaliana] gi|51971399|dbj|BAD44364.1| unnamed protein product [Arabidopsis thaliana] gi|51971599|dbj|BAD44464.1| unnamed protein product [Arabidopsis thaliana] gi|51971627|dbj|BAD44478.1| unnamed protein product [Arabidopsis thaliana] gi|51971681|dbj|BAD44505.1| unnamed protein product [Arabidopsis thaliana] gi|51971689|dbj|BAD44509.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 340 bits (872), Expect = 9e-91 Identities = 182/384 (47%), Positives = 253/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + RKG+K VVLE+++ +R GA IG L+NGWRALDQLGVG LR + LI + Sbjct: 17 ATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLRLNSSLIHKART 76 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC+KR DL++AL+DALP GT+RF IV+++ D + P + Sbjct: 77 MLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 137 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 197 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 374 FIALLLLLFGRDQIRHTRYDCGRL 397 >ref|NP_193311.6| monooxygenase 1 [Arabidopsis thaliana] gi|332658247|gb|AEE83647.1| monooxygenase 1 [Arabidopsis thaliana] Length = 422 Score = 340 bits (872), Expect = 9e-91 Identities = 182/384 (47%), Positives = 253/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + RKG+K VVLE+++ +R GA IG L+NGWRALDQLGVG LR + LI + Sbjct: 42 ATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLRLNSSLIHKART 101 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC+KR DL++AL+DALP GT+RF IV+++ D + P + Sbjct: 102 MLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 161 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 162 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 221 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 222 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 279 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 280 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 339 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 340 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 398 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 399 FIALLLLLFGRDQIRHTRYDCGRL 422 >dbj|BAD44358.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 339 bits (869), Expect = 2e-90 Identities = 182/384 (47%), Positives = 252/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + RKG+K VVLE+++ +R GA IG L+NGWRALDQLGVG LR + LI + Sbjct: 17 ATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLRLNSSLIHKART 76 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC+KR DL+ AL+DALP GT+RF IV+++ D + P + Sbjct: 77 MLIENGKKREFVSNIVDEARCIKRNDLVGALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 137 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 197 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 374 FIALLLLLFGRDQIRHTRYDCGRL 397 >dbj|BAD44627.1| unnamed protein product [Arabidopsis thaliana] gi|62318646|dbj|BAD95117.1| hypothetical protein [Arabidopsis thaliana] Length = 397 Score = 339 bits (869), Expect = 2e-90 Identities = 182/384 (47%), Positives = 253/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + RKG+K VVLE+++ +R GA IG L+NGWRALDQLGVG LR + LI + Sbjct: 17 ATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLRLNSSLIHKART 76 Query: 304 IWID-EGKREDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ E KRE + + EARC+KR DL++AL+DALP GT+RF IV+++ D + P + Sbjct: 77 MLIENEKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 137 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 197 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 374 FIALLLLLFGRDQIRHTRYDCGRL 397 >ref|XP_003533524.1| PREDICTED: zeaxanthin epoxidase, chloroplastic-like [Glycine max] Length = 399 Score = 339 bits (869), Expect = 2e-90 Identities = 171/373 (45%), Positives = 245/373 (65%), Gaps = 3/373 (0%) Frame = +1 Query: 145 RKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQDIWIDEGK 324 RK +K +VLE+S++LR TGAAI V ANGWRALDQLG+G+ LRQ A+ IQGG+ I ++E + Sbjct: 27 RKRIKSLVLERSENLRATGAAIIVHANGWRALDQLGIGSTLRQTAIQIQGGRFISLNEAE 86 Query: 325 REDLQF-LAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTLHSYDGKC 501 + F + E RCLKR DL+KA+AD LP GT+R CQ++++++DP P L +G Sbjct: 87 PMEFPFGVDQELRCLKRTDLMKAMADNLPAGTIRTNCQVLSIELDPLTRSPQLLLSNGSI 146 Query: 502 IRTKNLIGCDGSRSIVADFLGLKPSKS--FALCATRGWTNYSDGHSFSHEFVRLRKNKIL 675 ++ K +IGCDG S +A+ GL +K F+ C RG+TN+ +GH F EF + ++++ Sbjct: 147 LQAKVVIGCDGVNSAIANMFGLHRTKLLLFSTCVARGFTNFPNGHEFGSEFAMMSRDQVQ 206 Query: 676 FGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGCDLDSL 855 GRIP++D LVYWFV +P T D+ + DP LI+Q +++ GFP VE+I C L L Sbjct: 207 LGRIPVSDKLVYWFVTRPRTSKDSTIWKDPVLIRQSLIESMKGFPEGAVEIIRNCKLSFL 266 Query: 856 SFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNVVEKVG 1035 L+YR PW+++ +F +G VT+AGDAMH GPF+ QGGSA IEDA+VLAR + +K Sbjct: 267 HLTELKYRAPWDLVFNKFRKGTVTIAGDAMHATGPFIAQGGSASIEDALVLARCLAQKKA 326 Query: 1036 LDGYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFIAVILMNILFS 1215 + E +AF YV+ER+MR LS ++L+G L+ S + +FI + +M ILF Sbjct: 327 EETAEINIAEAEEAFDQYVKERKMRNFWLSLHSFLVGKKLDTKSSIVRFIILAIMGILFR 386 Query: 1216 DKGGHTKYDCGHL 1254 D H++Y CG L Sbjct: 387 DPDWHSRYHCGVL 399 >dbj|BAD43328.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 338 bits (868), Expect = 3e-90 Identities = 181/384 (47%), Positives = 253/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + R+G+K VVLE+++ +R GA IG L+NGWRALDQLGVG LR + LI + Sbjct: 17 ATSIALHREGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLRLNSSLIHKART 76 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC+KR DL++AL+DALP GT+RF IV+++ D + P + Sbjct: 77 MLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 137 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 197 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 374 FIALLLLLFGRDQIRHTRYDCGRL 397 >dbj|BAD43227.1| unnamed protein product [Arabidopsis thaliana] Length = 397 Score = 338 bits (867), Expect = 3e-90 Identities = 181/384 (47%), Positives = 252/384 (65%), Gaps = 7/384 (1%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + +I + RKG+K VVLE+++ +R GA IG L+NGWRALDQLGVG L + LI + Sbjct: 17 ATSIALHRKGIKSVVLERAEKVRSEGAGIGTLSNGWRALDQLGVGDRLHLNSSLIHKART 76 Query: 304 IWIDEGK-REDLQFLAGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQPTL 480 + I+ GK RE + + EARC+KR DL++AL+DALP GT+RF IV+++ D + P + Sbjct: 77 MLIENGKKREFVSNIVDEARCIKRNDLVEALSDALPKGTIRFGSHIVSIEQDKTTLFPVV 136 Query: 481 HSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSHEFVRLR 660 H +G I+ K LIGCDG+ SIV+D+L L P K+FA A RG+T Y +GH F E +R++ Sbjct: 137 HLANGNSIKAKVLIGCDGANSIVSDYLQLNPKKAFACRAVRGFTKYPNGHGFPQEVLRIK 196 Query: 661 KNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMVEMIEGC 840 + +L GR+P+ DN V+WF+ + D E I L + D EM++ C Sbjct: 197 QGNVLIGRLPLTDNQVFWFLVHMQD--NNHNGKDQESIANLCRKWADDLSEDWKEMVKIC 254 Query: 841 DLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAVVLARNV 1020 +++SL+ HLRYR P EI++G+F RG VTVAGDAMHVMGPFL QGGSA +EDAVVLAR + Sbjct: 255 NVESLTLTHLRYRAPSEIMLGKFRRGTVTVAGDAMHVMGPFLAQGGSAALEDAVVLARCL 314 Query: 1021 VEKVGLDGYGR-----EQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPMSKFI 1185 KVG D +G + I +A +YV+ERRMR+L LS QTYL G L+ +S + + + Sbjct: 315 ARKVGPD-HGDLLKDCSMKNIEEAIDEYVDERRMRLLGLSVQTYLTGRSLQTSSKVLRLM 373 Query: 1186 AVILMNILFS-DKGGHTKYDCGHL 1254 + L+ +LF D+ HT+YDCG L Sbjct: 374 FIALLLLLFGRDQIRHTRYDCGRL 397 >gb|EOY03035.1| Monooxygenase, putative isoform 1 [Theobroma cacao] Length = 413 Score = 336 bits (862), Expect = 1e-89 Identities = 174/387 (44%), Positives = 247/387 (63%), Gaps = 4/387 (1%) Frame = +1 Query: 106 FEIDSSSANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVL 285 ++ +S S IL KG++ +VLE+S++LR TGAAI V NGWRALDQLG+ ++LRQ AV Sbjct: 27 WQCNSCSGCILGGGKGIETIVLERSENLRATGAAIIVQPNGWRALDQLGIASKLRQTAVS 86 Query: 286 IQGGQDIWIDEGKREDLQFL-AGEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPP 462 IQ G+ I + +GK++DL GE RCLKR DL+ ALA+ LP TVR C++V++ +DP Sbjct: 87 IQSGRYITVKDGKQKDLPVGDVGELRCLKRTDLLNALAENLPADTVRLGCKVVSITLDPS 146 Query: 463 NMQPTLHSYDGKCIRTKNLIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFSH 642 P L DG + K +IGCDG S +A+ LGL ++ F+ RG+TNY GH F Sbjct: 147 TSYPILQLQDGSVLMAKVVIGCDGVNSTIANILGLNSTRLFSTSVIRGFTNYETGHEFGS 206 Query: 643 EFVRLRKNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDMV 822 F+ K+ + G +P+ + LVYWFV + T D+K+ LIK+ ++A GFP ++ Sbjct: 207 AFLVFSKDDVQLGLLPVTEKLVYWFVTRKQTSQDSKVSKSQTLIKESTVEAMKGFPIHIM 266 Query: 823 EMIEGCDLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDAV 1002 EM++ DLDSL LR+ PW++L RG VTVAGDAMH M PFL QGGSA +EDAV Sbjct: 267 EMVKDSDLDSLHLTDLRFLAPWDLLGTNLRRGTVTVAGDAMHAMAPFLAQGGSASLEDAV 326 Query: 1003 VLARNVVEKVGL---DGYGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLEATSPM 1173 VLAR + + + + + ++ A YV+ER+MRV LS +T+LIG +L+ ++ + Sbjct: 327 VLARCLSQNQTMRVDEKQAKTMMDMEAALDQYVKERKMRVFWLSLETFLIGTMLDTSTLL 386 Query: 1174 SKFIAVILMNILFSDKGGHTKYDCGHL 1254 K + +I + +LF DK HT+YDCG L Sbjct: 387 VKCLCIISLMVLFRDKIAHTRYDCGRL 413 >ref|XP_002868182.1| hypothetical protein ARALYDRAFT_355191 [Arabidopsis lyrata subsp. lyrata] gi|297314018|gb|EFH44441.1| hypothetical protein ARALYDRAFT_355191 [Arabidopsis lyrata subsp. lyrata] Length = 408 Score = 332 bits (851), Expect = 2e-88 Identities = 177/392 (45%), Positives = 255/392 (65%), Gaps = 15/392 (3%) Frame = +1 Query: 124 SANILICRKGLKCVVLEKSDDLRQTGAAIGVLANGWRALDQLGVGAELRQKAVLIQGGQD 303 + ++ + RKG+K +VLE+++ +R GAA G+ NGW AL QLGV +LR ++ I +D Sbjct: 17 ATSLALHRKGIKSIVLERAESVRSEGAAFGIQTNGWLALQQLGVADKLRLNSLPIHQIRD 76 Query: 304 IWIDEG--KREDLQFLA-GEARCLKRVDLIKALADALPPGTVRFRCQIVTVKMDPPNMQP 474 + I++G +RE + + GE R + R DL++ALA ALP GT+R C I++VK+D P Sbjct: 77 VLIEKGIKQRESVGPASYGEVRGVLRNDLVRALAHALPLGTLRLGCHILSVKLDETTSFP 136 Query: 475 TLHSYDGKCIRTKN-----LIGCDGSRSIVADFLGLKPSKSFALCATRGWTNYSDGHSFS 639 +H +G+ I+ K LIGCDGS S+V+ FLGL P+K A RG+TNY D H F Sbjct: 137 IVHVKNGEAIKAKARLATVLIGCDGSNSVVSRFLGLNPTKDLGSRAVRGFTNYPDDHGFR 196 Query: 640 HEFVRLRKNKILFGRIPINDNLVYWFVAQPSTPLDAKLPNDPELIKQLALQATIGFPTDM 819 EF+R++ + ++ GRIPI LV+WFV + P D+ + I +L L + F + Sbjct: 197 QEFIRIKMDNVVSGRIPITHKLVFWFVVLLNCPQDSSFLRNQADIARLTLASVHEFSEEW 256 Query: 820 VEMIEGCDLDSLSFAHLRYRPPWEILVGRFCRGPVTVAGDAMHVMGPFLGQGGSAGIEDA 999 EM++ CD+DSL LRYR PW++L G+F G VTVAGD+MH+MGPF+GQG SA +ED Sbjct: 257 KEMVKNCDMDSLYINRLRYRAPWDVLSGKFRCGTVTVAGDSMHLMGPFIGQGCSAALEDG 316 Query: 1000 VVLARNVVEK--VGLDG-----YGREQREIGKAFRDYVEERRMRVLRLSAQTYLIGLLLE 1158 VVLAR + K +G DG Y + +I +A +Y+ ERR R++ LS QTYL G L++ Sbjct: 317 VVLARCLWRKLSLGQDGMNNVSYSSSRMQIEEAIDEYIRERRGRLVGLSTQTYLTGNLIK 376 Query: 1159 ATSPMSKFIAVILMNILFSDKGGHTKYDCGHL 1254 A+SP++KF+ V+L+ ILF D+ GHT+YDCG L Sbjct: 377 ASSPVTKFLLVVLLMILFRDQIGHTRYDCGRL 408