BLASTX nr result

ID: Cocculus23_contig00002001 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00002001
         (1559 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   834   0.0  
ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vi...   828   0.0  
gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]        826   0.0  
ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   811   0.0  
gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum ...   811   0.0  
ref|XP_007021635.1| Homogentisate 1,2-dioxygenase isoform 1 [The...   811   0.0  
ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   810   0.0  
ref|XP_007213873.1| hypothetical protein PRUPE_ppa005219mg [Prun...   810   0.0  
gb|EYU44990.1| hypothetical protein MIMGU_mgv1a006018mg [Mimulus...   809   0.0  
ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ric...   809   0.0  
dbj|BAO57290.1| homogentisate 1,2-dioxygenase [Ipomoea nil]           806   0.0  
ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   800   0.0  
ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [A...   794   0.0  
ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   778   0.0  
gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]   778   0.0  
ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis l...   777   0.0  
ref|XP_007133067.1| hypothetical protein PHAVU_011G148800g [Phas...   776   0.0  
ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Caps...   775   0.0  
ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thal...   774   0.0  
ref|XP_007021636.1| Homogentisate 1,2-dioxygenase isoform 2, par...   773   0.0  

>ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Citrus sinensis]
          Length = 478

 Score =  834 bits (2155), Expect = 0.0
 Identities = 385/453 (84%), Positives = 421/453 (92%), Gaps = 1/453 (0%)
 Frame = +1

Query: 70   KTNGSDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKL 249
            KT+G D  ++L Y SGFGN FSSEAI GALPRGQN+P+ CP+GLYAEQISGTSFTSPRKL
Sbjct: 15   KTDGEDF-SDLNYESGFGNSFSSEAIDGALPRGQNSPLVCPFGLYAEQISGTSFTSPRKL 73

Query: 250  NQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFID 429
            NQRSWLYRIKPS THEPFKP  P+H KLVSEF+++NS+ TPTQLRW+P ++PDSPTDFID
Sbjct: 74   NQRSWLYRIKPSATHEPFKPRVPAHGKLVSEFDKSNSYTTPTQLRWKPVDIPDSPTDFID 133

Query: 430  GLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQV 609
            GLYTICGAGSSFLRHGYAIHMY ANKSM+NCA+CNADGDFLVVPQKGRLWI TECGKL+V
Sbjct: 134  GLYTICGAGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQKGRLWIATECGKLEV 193

Query: 610  SPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAW 789
            SPGE+ VLPQGFRF V+LPDGPS GY+AEIFGTHFQLPDLGPIGANGLAA RDFLVPTAW
Sbjct: 194  SPGEIAVLPQGFRFAVSLPDGPSRGYIAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAW 253

Query: 790  FEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGD 969
            FE+G R GYTIVQKFGGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCPFNTVL+DHGD
Sbjct: 254  FEEGSRLGYTIVQKFGGELFTARQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLVDHGD 313

Query: 970  PSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKS 1149
            PS+NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI G YEAK+
Sbjct: 314  PSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIRGGYEAKA 373

Query: 1150 DGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWAL 1329
            DGFLPGGASLHSCMTPHGPDTKT+EAT+ +G+  GP++ITDTMAFMFES LIPRICPWAL
Sbjct: 374  DGFLPGGASLHSCMTPHGPDTKTYEATIARGSEAGPYKITDTMAFMFESCLIPRICPWAL 433

Query: 1330 ESPYLDHNYYQCWIGLRSHFSYEEA-NCNTGEL 1425
            ESP++DH+YY+CWIGLRSHFSYEEA N + GEL
Sbjct: 434  ESPFMDHDYYRCWIGLRSHFSYEEADNESDGEL 466


>ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vitis vinifera]
            gi|302142933|emb|CBI20228.3| unnamed protein product
            [Vitis vinifera]
          Length = 463

 Score =  828 bits (2140), Expect = 0.0
 Identities = 385/461 (83%), Positives = 416/461 (90%)
 Frame = +1

Query: 49   MEIQSTWKTNGSDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTS 228
            M+ QS  KTN SD P++L+Y  GFGN  SSEAIAGALPRGQNNP+ CP+GLYAEQISGT 
Sbjct: 1    MDGQSVSKTNDSDPPSDLQYQFGFGNHLSSEAIAGALPRGQNNPLTCPFGLYAEQISGTP 60

Query: 229  FTSPRKLNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPD 408
            FT+PRK NQ SWLYRIKPSVTHEPFKP  PSH KLVSEFNQ+NS   PTQLRW+P E+PD
Sbjct: 61   FTAPRKQNQFSWLYRIKPSVTHEPFKPRVPSHGKLVSEFNQSNSSTNPTQLRWKPVEIPD 120

Query: 409  SPTDFIDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITT 588
            SPTDFIDGLYT+CGAGSSFLRHGYAIHMY ANKSM+NCA+CNADGDFL+VPQKGRL ITT
Sbjct: 121  SPTDFIDGLYTVCGAGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQKGRLSITT 180

Query: 589  ECGKLQVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRD 768
            ECGKLQVSPGE+VVLP GFRF V+LPDGPS GYVAEIFG HFQLPDLGPIGANGLAASRD
Sbjct: 181  ECGKLQVSPGEIVVLPHGFRFVVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAASRD 240

Query: 769  FLVPTAWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNT 948
            FLVP AW+E+  RPGYTIVQKFGGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCP NT
Sbjct: 241  FLVPVAWYEECSRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPVNT 300

Query: 949  VLIDHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 1128
            VL DH DPS+NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY
Sbjct: 301  VLKDHADPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 360

Query: 1129 GSYEAKSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIP 1308
            G YEAK+DGFLPGGASLHSCMTPHGPDTKTFEATV  G + GPFRIT+TMAFMFES LIP
Sbjct: 361  GGYEAKADGFLPGGASLHSCMTPHGPDTKTFEATVAHGKDAGPFRITNTMAFMFESCLIP 420

Query: 1309 RICPWALESPYLDHNYYQCWIGLRSHFSYEEANCNTGELQD 1431
            RICPWAL+SP +DH+YYQCW+GLRSHFS EEA+  +  +Q+
Sbjct: 421  RICPWALDSPSIDHDYYQCWVGLRSHFSREEASDESQTIQN 461


>gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]
          Length = 460

 Score =  826 bits (2134), Expect = 0.0
 Identities = 372/452 (82%), Positives = 414/452 (91%)
 Frame = +1

Query: 85   DSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSW 264
            +S  EL Y SG+GN FSSEA+AGALP GQN+P+ CPY LYAEQISGTSFTSPRKLN RSW
Sbjct: 5    NSLEELSYQSGYGNSFSSEALAGALPHGQNSPLLCPYSLYAEQISGTSFTSPRKLNLRSW 64

Query: 265  LYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTI 444
            LYRIKPSVTHEPFKP  PSH KL+SEF+++NS ATPTQLRW+P E+PDSPTDF+DGL+T+
Sbjct: 65   LYRIKPSVTHEPFKPRVPSHGKLLSEFDRSNSSATPTQLRWKPVEIPDSPTDFVDGLFTV 124

Query: 445  CGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEV 624
            CGAGSSFLRHG+A+HMY ANKSM+NCA+CNADGDFL+VPQKGRLWITTECGKLQVSPGEV
Sbjct: 125  CGAGSSFLRHGFAVHMYTANKSMDNCAFCNADGDFLIVPQKGRLWITTECGKLQVSPGEV 184

Query: 625  VVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGP 804
             +LPQGFRF V+LPDGPS GYVAEIFG HFQLPDLGPIGANGLAA RDFL PTAWFE G 
Sbjct: 185  AILPQGFRFAVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGR 244

Query: 805  RPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNT 984
            RPGYTIVQKFGGELFTA QDFSPFNVV+WHGN+VPYKYDLSKFCP+NTVL+DH DPS+NT
Sbjct: 245  RPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNHVPYKYDLSKFCPYNTVLVDHSDPSINT 304

Query: 985  VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLP 1164
            VLTAPTDKPGVALLDFV+FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK+DGFLP
Sbjct: 305  VLTAPTDKPGVALLDFVVFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 364

Query: 1165 GGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYL 1344
            GG+SLHSCMTPHGPDTKT+EAT+ +GN PGPFRI DTMAFMFES L+PR+C WALESP++
Sbjct: 365  GGSSLHSCMTPHGPDTKTYEATIARGNEPGPFRIKDTMAFMFESCLMPRVCAWALESPFM 424

Query: 1345 DHNYYQCWIGLRSHFSYEEANCNTGELQDVDG 1440
            DH+YYQCWIGLRSHF++E  N  + +  +VDG
Sbjct: 425  DHDYYQCWIGLRSHFTWESRNATSKDDNEVDG 456


>ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase
            [Solanum lycopersicum]
          Length = 480

 Score =  811 bits (2096), Expect = 0.0
 Identities = 369/449 (82%), Positives = 409/449 (91%)
 Frame = +1

Query: 82   SDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRS 261
            S+ P++LEY +GFGN FSSEAI GALP+GQN+P+ CP+GLYAEQISGTSFTSPRKLNQRS
Sbjct: 8    SNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRS 67

Query: 262  WLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYT 441
            WLYRIKPSVTHEPF+P  P H+KLVSEFNQ+NS ATPTQLRW+P E+P++PTDFIDGLYT
Sbjct: 68   WLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYT 127

Query: 442  ICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGE 621
            ICGAGSS+LRHG+AIHMY ANKSMEN A+CNADGDFL+VPQKGRLWITTECG+LQV PGE
Sbjct: 128  ICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGE 187

Query: 622  VVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQG 801
            +V+LPQG+RF V+LPDGPS GYVAE FGTH QLPDLGPIGANGLAA RDFLVP AW+  G
Sbjct: 188  IVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDG 247

Query: 802  PRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVN 981
             RPGYTIVQK+GGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCP+NTVL+DH DPS+N
Sbjct: 248  SRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSIN 307

Query: 982  TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFL 1161
            TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK+DGF 
Sbjct: 308  TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFH 367

Query: 1162 PGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPY 1341
            PGGASLHSCMTPHGPDTKTFEAT+  GN  GP RI DTMAFMFES L+PR+CPWALESP+
Sbjct: 368  PGGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPF 427

Query: 1342 LDHNYYQCWIGLRSHFSYEEANCNTGELQ 1428
            +DH+YYQCWIGL+SHFS    N +  +LQ
Sbjct: 428  MDHDYYQCWIGLKSHFSGLSMNEDNVDLQ 456


>gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum lycopersicum]
          Length = 477

 Score =  811 bits (2096), Expect = 0.0
 Identities = 369/449 (82%), Positives = 409/449 (91%)
 Frame = +1

Query: 82   SDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRS 261
            S+ P++LEY +GFGN FSSEAI GALP+GQN+P+ CP+GLYAEQISGTSFTSPRKLNQRS
Sbjct: 5    SNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRS 64

Query: 262  WLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYT 441
            WLYRIKPSVTHEPF+P  P H+KLVSEFNQ+NS ATPTQLRW+P E+P++PTDFIDGLYT
Sbjct: 65   WLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYT 124

Query: 442  ICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGE 621
            ICGAGSS+LRHG+AIHMY ANKSMEN A+CNADGDFL+VPQKGRLWITTECG+LQV PGE
Sbjct: 125  ICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGE 184

Query: 622  VVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQG 801
            +V+LPQG+RF V+LPDGPS GYVAE FGTH QLPDLGPIGANGLAA RDFLVP AW+  G
Sbjct: 185  IVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDG 244

Query: 802  PRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVN 981
             RPGYTIVQK+GGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCP+NTVL+DH DPS+N
Sbjct: 245  SRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSIN 304

Query: 982  TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFL 1161
            TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK+DGF 
Sbjct: 305  TVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFH 364

Query: 1162 PGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPY 1341
            PGGASLHSCMTPHGPDTKTFEAT+  GN  GP RI DTMAFMFES L+PR+CPWALESP+
Sbjct: 365  PGGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPF 424

Query: 1342 LDHNYYQCWIGLRSHFSYEEANCNTGELQ 1428
            +DH+YYQCWIGL+SHFS    N +  +LQ
Sbjct: 425  MDHDYYQCWIGLKSHFSGLSMNEDNVDLQ 453


>ref|XP_007021635.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobroma cacao]
            gi|508721263|gb|EOY13160.1| Homogentisate 1,2-dioxygenase
            isoform 1 [Theobroma cacao]
          Length = 451

 Score =  811 bits (2095), Expect = 0.0
 Identities = 372/444 (83%), Positives = 407/444 (91%), Gaps = 1/444 (0%)
 Frame = +1

Query: 64   TWKTNGSDS-PAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSP 240
            T K NG    P +LEY SGFGN FSSEAIAGALPRGQN+P+ CP+GLYAEQISGTSFTSP
Sbjct: 8    TTKGNGLGVFPEDLEYQSGFGNHFSSEAIAGALPRGQNSPLICPFGLYAEQISGTSFTSP 67

Query: 241  RKLNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTD 420
            RKLNQRSWLYRIKPSVTHEPF P   SHKKLVSEF+ +N+ A PTQLRW+P ++PD+PTD
Sbjct: 68   RKLNQRSWLYRIKPSVTHEPFWPRDSSHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTD 127

Query: 421  FIDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGK 600
            FIDGL+TICGAGSSFLRHGYAIHMY ANKSM+NCA+CNADGDFLVVPQ+GRLWITTECG+
Sbjct: 128  FIDGLFTICGAGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQQGRLWITTECGR 187

Query: 601  LQVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVP 780
            LQVSPGE+ VLPQGFRF V+LPDGPS GYVAE+FGTHFQLPDLGPIGANGLAASRDFL P
Sbjct: 188  LQVSPGEIAVLPQGFRFVVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAASRDFLAP 247

Query: 781  TAWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLID 960
            TAWFE+ PRPG+TIVQKFGGELF A QDFSPFNVV+WHGNYVPYKYDLSKFCP+NTVL+D
Sbjct: 248  TAWFEEHPRPGFTIVQKFGGELFNARQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVD 307

Query: 961  HGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYE 1140
            HGDPS+NTVLTAPTDKPGVALLDFVIFP RWLVAEHTFRPPYYHRNCMSEFMGLIYG YE
Sbjct: 308  HGDPSINTVLTAPTDKPGVALLDFVIFPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYE 367

Query: 1141 AKSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICP 1320
            AK+DGFLPGGASLHSCMTPHGPDTKT+EAT+ +G   GP +ITDTMAFMFES L+PR CP
Sbjct: 368  AKADGFLPGGASLHSCMTPHGPDTKTYEATIARGYEAGPHKITDTMAFMFESFLMPRTCP 427

Query: 1321 WALESPYLDHNYYQCWIGLRSHFS 1392
            W LESP+ DH+YYQCW+GL+SHFS
Sbjct: 428  WVLESPFRDHDYYQCWVGLKSHFS 451


>ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase-like
            [Solanum tuberosum]
          Length = 492

 Score =  810 bits (2093), Expect = 0.0
 Identities = 369/452 (81%), Positives = 410/452 (90%)
 Frame = +1

Query: 73   TNGSDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLN 252
            +  S+ P++LEY +GFGN FSSEAI GALP+GQN+P+ CP+GLYAEQISGTSFTSPRKLN
Sbjct: 5    SRSSNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLN 64

Query: 253  QRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDG 432
            QRSWLYRIKPSVTHEPF+P  P H+KLVSEFNQ+NS ATPTQLRW+P E+P++PTDFIDG
Sbjct: 65   QRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDG 124

Query: 433  LYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVS 612
            LYTICGAGSS+LRHG+AIHMY ANKSMEN A+CNADGDFL+VPQKGRLWITTECG+LQV 
Sbjct: 125  LYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVC 184

Query: 613  PGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWF 792
            PGE+V+LPQG+RF V+LPDGPS GYVAE FGTH QLPDLGPIGANGLAA RDFLVP AW+
Sbjct: 185  PGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWY 244

Query: 793  EQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDP 972
            E G RPGYTIVQK+GGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCP+NTVL+DH DP
Sbjct: 245  EDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDP 304

Query: 973  SVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSD 1152
            S+NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI G YEAK+D
Sbjct: 305  SINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLINGGYEAKAD 364

Query: 1153 GFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALE 1332
            GF PGGASLHSCMTPHGPDTKT+EAT+  GN  GP RI DTMAFMFES LIPR+CPWALE
Sbjct: 365  GFHPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALE 424

Query: 1333 SPYLDHNYYQCWIGLRSHFSYEEANCNTGELQ 1428
            SP++DH+YYQCWIGL+SHFS    N +  +LQ
Sbjct: 425  SPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQ 456


>ref|XP_007213873.1| hypothetical protein PRUPE_ppa005219mg [Prunus persica]
            gi|462409738|gb|EMJ15072.1| hypothetical protein
            PRUPE_ppa005219mg [Prunus persica]
          Length = 472

 Score =  810 bits (2092), Expect = 0.0
 Identities = 366/462 (79%), Positives = 412/462 (89%), Gaps = 4/462 (0%)
 Frame = +1

Query: 58   QSTWKTNGSDSPA----ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGT 225
            Q   KTNG DS +    +L+Y SGF N FSSEA+ G LP GQ++P+ CPYGLYAEQISGT
Sbjct: 6    QPVIKTNGPDSSSSSFSDLQYQSGFHNHFSSEALPGTLPHGQSSPLLCPYGLYAEQISGT 65

Query: 226  SFTSPRKLNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVP 405
            SFTSPRKLN R+WLYR+KPSVTHEPFKPL  SH+KLVSEF  +NS  TPTQLRW+P ++P
Sbjct: 66   SFTSPRKLNHRTWLYRVKPSVTHEPFKPLESSHRKLVSEFTDSNSSTTPTQLRWKPVDIP 125

Query: 406  DSPTDFIDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWIT 585
            ++PTDF++GLYT+CGAGSSFLRHG+AIHMY ANKSM+NCA+CNADGDFL+VPQ GRLWIT
Sbjct: 126  ETPTDFVEGLYTVCGAGSSFLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPQTGRLWIT 185

Query: 586  TECGKLQVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASR 765
            TECGKLQ+SPGE+ VLPQGFRF V+LPDGPS GYVAE+FGTHFQLPDLGPIGANGLAA R
Sbjct: 186  TECGKLQISPGEIAVLPQGFRFAVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAAPR 245

Query: 766  DFLVPTAWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFN 945
            DFLVPTAWFE   RPGY I+QKFGGELFTA Q+FSPFNVV+WHGNY PYKYDL+ FCPFN
Sbjct: 246  DFLVPTAWFEDSYRPGYVIIQKFGGELFTAKQEFSPFNVVAWHGNYAPYKYDLTTFCPFN 305

Query: 946  TVLIDHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI 1125
            TVL DHGDPS+NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI
Sbjct: 306  TVLFDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI 365

Query: 1126 YGSYEAKSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLI 1305
            YG YEAK+DGFLPGGASLHSCMTPHGPDTKT+EAT+ +GN  GP RI+DT+AFMFES LI
Sbjct: 366  YGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIARGNEAGPSRISDTLAFMFESCLI 425

Query: 1306 PRICPWALESPYLDHNYYQCWIGLRSHFSYEEANCNTGELQD 1431
            PRICPWALESP++D +YYQCWIGLRSHF+ E A+   G++Q+
Sbjct: 426  PRICPWALESPFIDRDYYQCWIGLRSHFTREGASAKDGDIQN 467


>gb|EYU44990.1| hypothetical protein MIMGU_mgv1a006018mg [Mimulus guttatus]
          Length = 461

 Score =  809 bits (2089), Expect = 0.0
 Identities = 366/437 (83%), Positives = 409/437 (93%)
 Frame = +1

Query: 91   PAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLY 270
            PA L+Y SGFGNDFS+EAI GALPRGQN+P+ CPYGLYAEQISGTSFTSPRK NQRSWLY
Sbjct: 13   PAVLDYQSGFGNDFSTEAIPGALPRGQNSPLVCPYGLYAEQISGTSFTSPRKHNQRSWLY 72

Query: 271  RIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICG 450
            R+KPSVTHEPF+P  P+H KLVSEFNQ+NS ATPTQLRWRPAE+P++PTDF+DGLYT+CG
Sbjct: 73   RVKPSVTHEPFRPRIPNHVKLVSEFNQSNSAATPTQLRWRPAEIPETPTDFVDGLYTVCG 132

Query: 451  AGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVV 630
            AGSS+LRHG+AIHMY+ANKSM++CA+C+ADGDFL+VPQ+GRL ITTECG+L+VSPGE+VV
Sbjct: 133  AGSSYLRHGFAIHMYSANKSMKDCAFCSADGDFLIVPQEGRLRITTECGRLEVSPGEIVV 192

Query: 631  LPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRP 810
            +PQG RF V+LPDGPS GYVAEIFGTHFQLPDLGPIGANGLAASRDFLVP AWFE    P
Sbjct: 193  IPQGLRFAVDLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPVAWFEDISIP 252

Query: 811  GYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVL 990
            GYT+VQKFGGELFTA QDFSPFNVV+WHGNY PYKYDLSKFCP+NTVLIDHGDPS+NTVL
Sbjct: 253  GYTVVQKFGGELFTAKQDFSPFNVVAWHGNYAPYKYDLSKFCPYNTVLIDHGDPSINTVL 312

Query: 991  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGG 1170
            TAPTD+PGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK+DGFLPGG
Sbjct: 313  TAPTDRPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG 372

Query: 1171 ASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDH 1350
            ASLHSCMTPHGPDTKT+EAT+  GN  GP RI+DTMAFMFES L+PR+CPWALESPY+DH
Sbjct: 373  ASLHSCMTPHGPDTKTYEATIALGNEAGPKRISDTMAFMFESCLMPRVCPWALESPYMDH 432

Query: 1351 NYYQCWIGLRSHFSYEE 1401
            +YYQCWIGL+SHF+ +E
Sbjct: 433  DYYQCWIGLKSHFTGDE 449


>ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis]
            gi|223542482|gb|EEF44023.1| homogentisate
            1,2-dioxygenase, putative [Ricinus communis]
          Length = 457

 Score =  809 bits (2089), Expect = 0.0
 Identities = 375/452 (82%), Positives = 414/452 (91%), Gaps = 2/452 (0%)
 Frame = +1

Query: 70   KTNGSDSPAE-LEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRK 246
            K+NG D P++  +YLSGFGN F SEAI GALPRGQN+P+ CPYGLYAEQISG+SFTSPRK
Sbjct: 5    KSNGRDFPSDDHDYLSGFGNTFESEAIHGALPRGQNSPLICPYGLYAEQISGSSFTSPRK 64

Query: 247  LNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFAT-PTQLRWRPAEVPDSPTDF 423
            L+QRSWLYRIKPSVTHEPFKP  PSH K+VSEF++T+S  T PTQLRW+P ++PDSPTDF
Sbjct: 65   LSQRSWLYRIKPSVTHEPFKPRVPSHGKIVSEFDKTDSCTTTPTQLRWKPVDIPDSPTDF 124

Query: 424  IDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKL 603
            IDGL+TICGAGSSFLRHG+AIHMY ANKSM NCA CNADGDFLVVPQ+GRLWITTECGKL
Sbjct: 125  IDGLFTICGAGSSFLRHGFAIHMYTANKSMGNCALCNADGDFLVVPQEGRLWITTECGKL 184

Query: 604  QVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPT 783
            QVSPGEVVVLPQGFRF V+LPDGPS GYVAEIFGTHFQLPDLGPIGANGLAA RDFLVP 
Sbjct: 185  QVSPGEVVVLPQGFRFAVDLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPK 244

Query: 784  AWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDH 963
            AW+E+GP PGYTI+QKFGGELFTA QDFSPFNVV+WHGN+VPYKYDL KFCP+NTVLIDH
Sbjct: 245  AWYEEGPCPGYTIIQKFGGELFTAKQDFSPFNVVAWHGNFVPYKYDLKKFCPYNTVLIDH 304

Query: 964  GDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1143
             DPS+NTVLTA TDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEA
Sbjct: 305  SDPSINTVLTASTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEA 364

Query: 1144 KSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPW 1323
            K+DGF+PGGASLHSCMTPHGPDTKT+EAT+ +GN+ GP RITDTMAFMFES LIPRIC W
Sbjct: 365  KADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPSRITDTMAFMFESCLIPRICLW 424

Query: 1324 ALESPYLDHNYYQCWIGLRSHFSYEEANCNTG 1419
            A+ESP++DH+YYQCWIGL+SHFS+   + N G
Sbjct: 425  AVESPFIDHDYYQCWIGLKSHFSHGADSKNGG 456


>dbj|BAO57290.1| homogentisate 1,2-dioxygenase [Ipomoea nil]
          Length = 464

 Score =  806 bits (2082), Expect = 0.0
 Identities = 366/463 (79%), Positives = 410/463 (88%)
 Frame = +1

Query: 49   MEIQSTWKTNGSDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTS 228
            ME +    T     PA+LEY SGFGN FSSEAIAGALPRGQN+P+ CP GLYAEQISGTS
Sbjct: 1    MESKDDNVTGSPKFPADLEYQSGFGNHFSSEAIAGALPRGQNSPLVCPLGLYAEQISGTS 60

Query: 229  FTSPRKLNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPD 408
            FTSPRKLNQ SW YR+KPSVTHEPFKP  P+H++LVSEFNQ+NS ATPTQLRW+P ++P+
Sbjct: 61   FTSPRKLNQLSWFYRVKPSVTHEPFKPRVPTHERLVSEFNQSNSSATPTQLRWKPVDIPE 120

Query: 409  SPTDFIDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITT 588
            +PTDFIDGLYT+CGAGSS+LRHG+AIHMY ANKSM+NCA+CNADGDFL+VPQKGRLWITT
Sbjct: 121  TPTDFIDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPQKGRLWITT 180

Query: 589  ECGKLQVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRD 768
            ECG+LQV+PGE+VV+PQGFRF V+LPDG S GYV EIFGTHFQLPDLGPIGANGLAA RD
Sbjct: 181  ECGRLQVNPGEIVVIPQGFRFAVDLPDGESRGYVGEIFGTHFQLPDLGPIGANGLAAPRD 240

Query: 769  FLVPTAWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNT 948
            FLVP AWF+    PGYTIVQKFGGELFTA Q+FSPFNVV+WHGNY PYKYDLSKFCP+NT
Sbjct: 241  FLVPVAWFDDSCHPGYTIVQKFGGELFTAKQEFSPFNVVAWHGNYAPYKYDLSKFCPYNT 300

Query: 949  VLIDHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 1128
            VL DH DPS+NTVLTAPTDKPGVAL+DFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY
Sbjct: 301  VLFDHSDPSINTVLTAPTDKPGVALMDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 360

Query: 1129 GSYEAKSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIP 1308
            G YEAK+DGFLPGGASLHSCMTPHGPDTKT+E T+  GN  GP RIT TMAFMFES L+P
Sbjct: 361  GGYEAKADGFLPGGASLHSCMTPHGPDTKTYEKTIALGNEAGPHRITGTMAFMFESCLVP 420

Query: 1309 RICPWALESPYLDHNYYQCWIGLRSHFSYEEANCNTGELQDVD 1437
            R+CPWALESP++DH+YYQCWIGL+SHF+    +    EL++ D
Sbjct: 421  RVCPWALESPFIDHDYYQCWIGLKSHFTGGSTDEGNKELENGD 463


>ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus]
            gi|449524824|ref|XP_004169421.1| PREDICTED: homogentisate
            1,2-dioxygenase-like [Cucumis sativus]
          Length = 471

 Score =  800 bits (2065), Expect = 0.0
 Identities = 361/450 (80%), Positives = 405/450 (90%)
 Frame = +1

Query: 49   MEIQSTWKTNGSDSPAELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTS 228
            M  QS  +T+G+D P++L YLSGF N FSSEAI GALP+ QN+P+ CP+GLYAEQISGTS
Sbjct: 1    MAAQSVGETDGTDFPSDLPYLSGFNNHFSSEAIPGALPQSQNSPLICPFGLYAEQISGTS 60

Query: 229  FTSPRKLNQRSWLYRIKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPD 408
            FTSPRK N  SWLYRIKPSVTHEPF+   P ++KL+SEFN +N  +TPTQLRW+PA+ PD
Sbjct: 61   FTSPRKANLCSWLYRIKPSVTHEPFRQRLPKNEKLISEFNASNCSSTPTQLRWKPADFPD 120

Query: 409  SPTDFIDGLYTICGAGSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITT 588
            SP DF+DGLYT+CGAGSSFLRHG+AIHMY ANKSMENCA+CNADGDFL+VPQ G+LWI T
Sbjct: 121  SPVDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGKLWIIT 180

Query: 589  ECGKLQVSPGEVVVLPQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRD 768
            ECG+L+VSPGEVVVLPQGFRF V LPDGPS GYVAEIFG+HFQLPDLGPIGANGLAA RD
Sbjct: 181  ECGRLEVSPGEVVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRD 240

Query: 769  FLVPTAWFEQGPRPGYTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNT 948
            FL P AWFE  PRPGYTI+QKFGGELFTA+QDFSPFNVV+WHGNYVPYKYDL KFCP+NT
Sbjct: 241  FLAPVAWFENSPRPGYTIIQKFGGELFTAIQDFSPFNVVAWHGNYVPYKYDLCKFCPYNT 300

Query: 949  VLIDHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 1128
            VL DH DPS+NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY
Sbjct: 301  VLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIY 360

Query: 1129 GSYEAKSDGFLPGGASLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIP 1308
            G YEAK+DGF+PGGASLHSCMTPHGPDTKT+EAT+ +GN+ GP +I+ TMAFMFESSLIP
Sbjct: 361  GGYEAKADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPHKISGTMAFMFESSLIP 420

Query: 1309 RICPWALESPYLDHNYYQCWIGLRSHFSYE 1398
            R+C WALESP++DH+YYQCWIGL+SHF  E
Sbjct: 421  RVCSWALESPFIDHDYYQCWIGLKSHFKNE 450


>ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda]
            gi|548862420|gb|ERN19780.1| hypothetical protein
            AMTR_s00064p00100410 [Amborella trichopoda]
          Length = 471

 Score =  794 bits (2051), Expect = 0.0
 Identities = 368/456 (80%), Positives = 409/456 (89%)
 Frame = +1

Query: 94   AELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYR 273
            + LEY SGFGN FSSEA+ GALPR QN+P+ CP+GLYAEQISGT+FT+PRKLNQRSWLYR
Sbjct: 10   SSLEYQSGFGNVFSSEAMGGALPRDQNSPLLCPFGLYAEQISGTAFTAPRKLNQRSWLYR 69

Query: 274  IKPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGA 453
            IKPSVTHEPF P  P+H  LVSEFNQ++S ATPTQLRW+PA+VP+SPTDFIDGLYTICGA
Sbjct: 70   IKPSVTHEPFHPRVPTHAHLVSEFNQSSSSATPTQLRWKPADVPESPTDFIDGLYTICGA 129

Query: 454  GSSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVL 633
            GSSFLRHGYA+HMYAANKSM++CA+C+ADGDFL+VPQKGRLW+TTECG+LQ+ PGE+VVL
Sbjct: 130  GSSFLRHGYAVHMYAANKSMDSCAFCSADGDFLIVPQKGRLWLTTECGRLQICPGEIVVL 189

Query: 634  PQGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPG 813
            PQGFRF V+LPDGPS GYVAE+FGTHFQLP+LGPIGANGLAASRDFLVPTA+FE+   PG
Sbjct: 190  PQGFRFSVDLPDGPSRGYVAEVFGTHFQLPELGPIGANGLAASRDFLVPTAFFEEEHHPG 249

Query: 814  YTIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLT 993
            YTIVQKFGGELFTA QDFSPFNVV+WHGNYVPYKYDLSKFCPFNTVL DHGDPSVNTVLT
Sbjct: 250  YTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHGDPSVNTVLT 309

Query: 994  APTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGA 1173
            AP++KPGVAL+DFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK DGFLPGGA
Sbjct: 310  APSEKPGVALVDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKKDGFLPGGA 369

Query: 1174 SLHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHN 1353
            SLHSCMTPHGPDTKTFEATV    +  PFRI DTMAFMFES LIPRICPWALESP LD +
Sbjct: 370  SLHSCMTPHGPDTKTFEATVSCEKSSEPFRIADTMAFMFESCLIPRICPWALESPDLDPD 429

Query: 1354 YYQCWIGLRSHFSYEEANCNTGELQDVDGAEHESSH 1461
            YY+CW+GL+SHF  +E      ++   DG    SS+
Sbjct: 430  YYKCWVGLKSHFLRKEVTQYVQKINLSDGRNAFSSN 465


>ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cicer arietinum]
          Length = 455

 Score =  778 bits (2009), Expect = 0.0
 Identities = 356/446 (79%), Positives = 402/446 (90%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            +  YLSGFGN FSSEAIAGALP GQN+P+ CP+GLYAEQISGTSFT+PR LN  SWLYRI
Sbjct: 9    DFNYLSGFGNHFSSEAIAGALPVGQNSPLICPFGLYAEQISGTSFTTPRTLNLFSWLYRI 68

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFK   PS+ K++SEFN +NS A PTQLRW+P ++PDSPTDFIDGL T+CG+G
Sbjct: 69   KPSVTHEPFKARVPSNGKILSEFNDSNSSANPTQLRWKPEDIPDSPTDFIDGLSTVCGSG 128

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSF+RHGYAIHMY ANKSM+NCA+CNADGDFL+VPQ+GRL ITTECG+L+VSPG++ ++P
Sbjct: 129  SSFMRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGDIAIIP 188

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF+VNLPDGPS GYVAEIFGTHFQLPDLGPIGANGLAA RDFLVPTAWFE    PGY
Sbjct: 189  QGFRFNVNLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEDKSYPGY 248

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFGGELFTAVQDFSPFNVV+WHGNYVPYKYDLSKFCP+NT L DH DPS+NTVLTA
Sbjct: 249  TIVQKFGGELFTAVQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTALYDHSDPSINTVLTA 308

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI+G+YEAK DGFLPGGAS
Sbjct: 309  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGNYEAKVDGFLPGGAS 368

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LH+CMTPHGPDTK++EAT+ +G+N GP +ITDT+AFMFESSLIPRI   ALESP+LDH+Y
Sbjct: 369  LHNCMTPHGPDTKSYEATIARGDNVGPHKITDTLAFMFESSLIPRISRSALESPFLDHDY 428

Query: 1357 YQCWIGLRSHFSYEEANCNTGELQDV 1434
            YQCWIGLRSHF+  E +  +  L ++
Sbjct: 429  YQCWIGLRSHFTVSETSLGSLALNNI 454


>gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
          Length = 461

 Score =  778 bits (2009), Expect = 0.0
 Identities = 355/432 (82%), Positives = 392/432 (90%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            EL+Y SGFGN FSSEAIAGALP  QN+P+ CPYGLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 10   ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 69

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFKP  P+HKKLVSEF+ +NS   PTQLRWRP ++PDS TDF+DGL+TICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSETDFVDGLFTICGAG 129

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSFLRHG+AIHMY ANK M++ A+CNADGDFL+VPQ GRLWI TECG+L VSPGE+ V+P
Sbjct: 130  SSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVSPGEIAVIP 189

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF ++LPDG S GYVAEI+G HFQLPDLGPIGANGLAA RDFL PTAWFE G RP Y
Sbjct: 190  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGLRPEY 249

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFGGELFTA QDFSPFNVV+WHGNYVPYKYDL KFCP+NTVL+DHGDPS+NTVLTA
Sbjct: 250  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG+YEAK+DGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LHSCMTPHGPDT T+EAT+ + N   P ++T TMAFMFES+LIPR+C WALESP+LDH+Y
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 1357 YQCWIGLRSHFS 1392
            YQCWIGL+SHFS
Sbjct: 430  YQCWIGLKSHFS 441


>ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata]
            gi|297310136|gb|EFH40560.1| homogentisate 1,2-dioxygenase
            [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  777 bits (2007), Expect = 0.0
 Identities = 355/437 (81%), Positives = 394/437 (90%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            +L+Y SGFGN FSSEAIAGALP  QN+P+ CPYGLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 10   DLKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 69

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFKP  P+HKKLVSEF+ +NS   PTQLRWRP ++P+S TDF+DGLYTICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESETDFVDGLYTICGAG 129

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSFLRHG+AIHMY ANK M+N A+CNADGDFL+VPQ GRLWI TECG+L V+PGE+ V+P
Sbjct: 130  SSFLRHGFAIHMYVANKGMKNSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF V+LPDG S GYVAEI+G HFQLPDLGPIGANGLAA RDFL PTAWFE+G RP Y
Sbjct: 190  QGFRFSVDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEEGLRPEY 249

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFG ELFTA QDFSPFNVV+WHGNYVPYKYDL KFCP+NTVL+DHGDPS+NTVLTA
Sbjct: 250  TIVQKFGAELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNTVLLDHGDPSINTVLTA 309

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG+YEAK+DGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LHSCMTPHGPDT T+EAT+ + N   P ++T TMAFMFES+LIPR+C WALESP+LDH+Y
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 1357 YQCWIGLRSHFSYEEAN 1407
            YQCWIGL+SHFS  + N
Sbjct: 430  YQCWIGLKSHFSRIDLN 446


>ref|XP_007133067.1| hypothetical protein PHAVU_011G148800g [Phaseolus vulgaris]
            gi|561006067|gb|ESW05061.1| hypothetical protein
            PHAVU_011G148800g [Phaseolus vulgaris]
          Length = 458

 Score =  776 bits (2005), Expect = 0.0
 Identities = 355/438 (81%), Positives = 393/438 (89%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            +  YLSGFGN  SSEA+ GALP GQN+P+ CP+GLYAEQISGTSFTSPR  N+ SW YRI
Sbjct: 15   DFTYLSGFGNHLSSEALPGALPEGQNSPLICPFGLYAEQISGTSFTSPRNRNRCSWFYRI 74

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFKP  PS+ K+ SEFN +NS A PTQLRW+P + PDSPTDFIDGL TICG+G
Sbjct: 75   KPSVTHEPFKPRVPSNWKIFSEFNSSNSSANPTQLRWKPMDAPDSPTDFIDGLSTICGSG 134

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSF+RHGYAIHMYAANKSM+NCA+CNADGDFL+VPQ+GRL ITTECG+L+VSPGE+ +LP
Sbjct: 135  SSFMRHGYAIHMYAANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGEIAILP 194

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF VNLPDGPS GYVAEIFGTHF+LPDLGPIGANGLAA RDFLVPTAWFE    PGY
Sbjct: 195  QGFRFSVNLPDGPSRGYVAEIFGTHFELPDLGPIGANGLAAPRDFLVPTAWFEDKSYPGY 254

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFGGELF AVQDFSPFNVV+WHGNY PYKYDLSKFCP+NTVL DH DPS+NTVLTA
Sbjct: 255  TIVQKFGGELFAAVQDFSPFNVVAWHGNYFPYKYDLSKFCPYNTVLFDHSDPSINTVLTA 314

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI+G YEAK+DGFLPGGAS
Sbjct: 315  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGAS 374

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LH+CMTPHGPDTK++EAT+ +GN+ GP +ITDTMAFMFESSLIPRI  WALESP+LD +Y
Sbjct: 375  LHNCMTPHGPDTKSYEATIARGNDIGPSKITDTMAFMFESSLIPRISQWALESPFLDQDY 434

Query: 1357 YQCWIGLRSHFSYEEANC 1410
            YQCWIGLRSHF+ +   C
Sbjct: 435  YQCWIGLRSHFTVKTRAC 452


>ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Capsella rubella]
            gi|482549107|gb|EOA13301.1| hypothetical protein
            CARUB_v10026329mg [Capsella rubella]
          Length = 476

 Score =  775 bits (2001), Expect = 0.0
 Identities = 354/437 (81%), Positives = 392/437 (89%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            EL+Y SGFGN FSSEAIAGALP  QN+P+ CPYGLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 25   ELKYQSGFGNHFSSEAIAGALPLDQNSPLICPYGLYAEQISGTSFTSPRKLNQRSWLYRI 84

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFKP  P+HKKLVSEF+ +NS   PTQLRWRP ++P+S TDF+DGLYTICGAG
Sbjct: 85   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESATDFVDGLYTICGAG 144

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSFLRHG+AIHMY ANK M++ A+CNADGDFL+VPQ GRLWI TECG+L VSPGE+ V+P
Sbjct: 145  SSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQAGRLWIETECGRLLVSPGEIAVIP 204

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF ++LPDG S GYVAEI+G HFQLPDLGPIGANGLAA RDFL PTAWFE   RP Y
Sbjct: 205  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDAVRPDY 264

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TI+QKFGGELFTA QDFSPFNVV+WHGNYVPYKYDL KFCP+N VL+DHGDPSVNTVLTA
Sbjct: 265  TIIQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNAVLLDHGDPSVNTVLTA 324

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG+YEAK+DGFLPGGAS
Sbjct: 325  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 384

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LHSCMTPHGPDT T+EAT+ + N   P ++T TMAFMFES+LIPR+C WALESP+LDH+Y
Sbjct: 385  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 444

Query: 1357 YQCWIGLRSHFSYEEAN 1407
            YQCWIGL+SHFS  + N
Sbjct: 445  YQCWIGLKSHFSRIDLN 461


>ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|30696407|ref|NP_851187.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|13432134|sp|Q9ZRA2.2|HGD_ARATH RecName:
            Full=Homogentisate 1,2-dioxygenase; AltName:
            Full=Homogentisate oxygenase; AltName: Full=Homogentisic
            acid oxidase; AltName: Full=Homogentisicase
            gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana] gi|22655252|gb|AAM98216.1|
            homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|33942055|gb|AAQ55280.1| At5g54080 [Arabidopsis
            thaliana] gi|332009064|gb|AED96447.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|332009065|gb|AED96448.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana]
          Length = 461

 Score =  774 bits (1998), Expect = 0.0
 Identities = 352/432 (81%), Positives = 391/432 (90%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            EL+Y SGFGN FSSEAIAGALP  QN+P+ CPYGLYAEQISGTSFTSPRKLNQRSWLYR+
Sbjct: 10   ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRV 69

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPFKP  P+HKKLVSEF+ +NS   PTQLRWRP ++PDS  DF+DGL+TICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAG 129

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSFLRHG+AIHMY AN  M++ A+CNADGDFL+VPQ GRLWI TECG+L V+PGE+ V+P
Sbjct: 130  SSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF ++LPDG S GYVAEI+G HFQLPDLGPIGANGLAASRDFL PTAWFE G RP Y
Sbjct: 190  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEY 249

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFGGELFTA QDFSPFNVV+WHGNYVPYKYDL KFCP+NTVL+DHGDPS+NTVLTA
Sbjct: 250  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG+YEAK+DGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LHSCMTPHGPDT T+EAT+ + N   P ++T TMAFMFES+LIPR+C WALESP+LDH+Y
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 1357 YQCWIGLRSHFS 1392
            YQCWIGL+SHFS
Sbjct: 430  YQCWIGLKSHFS 441


>ref|XP_007021636.1| Homogentisate 1,2-dioxygenase isoform 2, partial [Theobroma cacao]
            gi|508721264|gb|EOY13161.1| Homogentisate 1,2-dioxygenase
            isoform 2, partial [Theobroma cacao]
          Length = 421

 Score =  773 bits (1997), Expect = 0.0
 Identities = 355/432 (82%), Positives = 390/432 (90%)
 Frame = +1

Query: 97   ELEYLSGFGNDFSSEAIAGALPRGQNNPIRCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 276
            +LEY SGFGN FSSEAIAGALPRGQN+P+ CP+GLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 2    DLEYQSGFGNHFSSEAIAGALPRGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 61

Query: 277  KPSVTHEPFKPLSPSHKKLVSEFNQTNSFATPTQLRWRPAEVPDSPTDFIDGLYTICGAG 456
            KPSVTHEPF P   SHKKLVSEF+ +N+ A PTQLRW+P ++PD+PTDFIDGL+TICGAG
Sbjct: 62   KPSVTHEPFWPRDSSHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTDFIDGLFTICGAG 121

Query: 457  SSFLRHGYAIHMYAANKSMENCAYCNADGDFLVVPQKGRLWITTECGKLQVSPGEVVVLP 636
            SSFLRHGYAIHMY ANKSM+NCA+CNADGDFLVVPQ+GRLWITTECG+LQVSPGE+ VLP
Sbjct: 122  SSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQQGRLWITTECGRLQVSPGEIAVLP 181

Query: 637  QGFRFDVNLPDGPSLGYVAEIFGTHFQLPDLGPIGANGLAASRDFLVPTAWFEQGPRPGY 816
            QGFRF V+LPDGPS GYVAE+FG            ANGLAASRDFL PTAWFE+ PRPG+
Sbjct: 182  QGFRFVVDLPDGPSRGYVAEVFG------------ANGLAASRDFLAPTAWFEEHPRPGF 229

Query: 817  TIVQKFGGELFTAVQDFSPFNVVSWHGNYVPYKYDLSKFCPFNTVLIDHGDPSVNTVLTA 996
            TIVQKFGGELF A QDFSPFNVV+WHGNYVPYKYDLSKFCP+NTVL+DHGDPS+NTVLTA
Sbjct: 230  TIVQKFGGELFNARQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVDHGDPSINTVLTA 289

Query: 997  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKSDGFLPGGAS 1176
            PTDKPGVALLDFVIFP RWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAK+DGFLPGGAS
Sbjct: 290  PTDKPGVALLDFVIFPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 349

Query: 1177 LHSCMTPHGPDTKTFEATVVQGNNPGPFRITDTMAFMFESSLIPRICPWALESPYLDHNY 1356
            LHSCMTPHGPDTKT+EAT+ +G   GP +ITDTMAFMFES L+PR CPW LESP+ DH+Y
Sbjct: 350  LHSCMTPHGPDTKTYEATIARGYEAGPHKITDTMAFMFESFLMPRTCPWVLESPFRDHDY 409

Query: 1357 YQCWIGLRSHFS 1392
            YQCW+GL+SHFS
Sbjct: 410  YQCWVGLKSHFS 421


Top