BLASTX nr result

ID: Atropa21_contig00015578 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00015578
         (1859 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   936   0.0  
ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   936   0.0  
gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum ...   926   0.0  
ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   840   0.0  
gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]        838   0.0  
gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobrom...   830   0.0  
ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vi...   830   0.0  
gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus pe...   827   0.0  
ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   823   0.0  
ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ric...   818   0.0  
ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis l...   800   0.0  
ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [A...   797   0.0  
gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]   796   0.0  
ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Caps...   796   0.0  
ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Popu...   795   0.0  
gb|EOY13161.1| Homogentisate 1,2-dioxygenase isoform 2, partial ...   794   0.0  
ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thal...   789   0.0  
ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   788   0.0  
gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus...   788   0.0  
gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]   787   0.0  

>ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase-like
            [Solanum tuberosum]
          Length = 492

 Score =  936 bits (2419), Expect(2) = 0.0
 Identities = 442/465 (95%), Positives = 449/465 (96%)
 Frame = -1

Query: 1670 MECKDSSKTCNFPCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFT 1491
            MEC  SS++ NFP DLEYQTGFGNHFSSEAI GALPQGQNSPLICPFGLYAEQISGTSFT
Sbjct: 1    MEC--SSRSSNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFT 58

Query: 1490 SPRKLNQRSWLYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETP 1311
            SPRKLNQRSWLYRIKPSVTHEPFR R+PRHEKLVSEFN SNSSATPTQLRWKPVEIPETP
Sbjct: 59   SPRKLNQRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETP 118

Query: 1310 TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTEC 1131
            TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMEN AFCNADGDFLIVPQKGRLWITTEC
Sbjct: 119  TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTEC 178

Query: 1130 GRLQVCLGEIVILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFL 951
            GRLQVC GEIVILPQG+RFAVDLPDGPSRGYVAE FGTHLQLPDLGPIGANGLAAPRDFL
Sbjct: 179  GRLQVCPGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFL 238

Query: 950  VPVAWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL 771
            VPVAWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL
Sbjct: 239  VPVAWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL 298

Query: 770  MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGG 591
            MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI GG
Sbjct: 299  MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLINGG 358

Query: 590  YEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRV 411
            YEAKADGF PGGASLHSCMTPHGPDTKTYEATIALGNEAGPH+I DTMAFMFESCLIPRV
Sbjct: 359  YEAKADGFHPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHRIADTMAFMFESCLIPRV 418

Query: 410  CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNRDLQNGQPIER 276
            CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDN DLQ G+PI +
Sbjct: 419  CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQKGKPIXK 463



 Score = 49.7 bits (117), Expect(2) = 0.0
 Identities = 25/29 (86%), Positives = 26/29 (89%)
 Frame = -2

Query: 280 KGEMLIVKLASVVPLKVT*LVHKASRACK 194
           KGEMLIV+LA VVPLKVT LVHK SRACK
Sbjct: 463 KGEMLIVQLAPVVPLKVTWLVHKESRACK 491


>ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase
            [Solanum lycopersicum]
          Length = 480

 Score =  936 bits (2420), Expect(2) = 0.0
 Identities = 441/465 (94%), Positives = 449/465 (96%)
 Frame = -1

Query: 1670 MECKDSSKTCNFPCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFT 1491
            MEC  SS+T NFP DLEYQTGFGNHFSSEAI GALPQGQNSPLICPFGLYAEQISGTSFT
Sbjct: 1    MEC--SSRTSNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFT 58

Query: 1490 SPRKLNQRSWLYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETP 1311
            SPRKLNQRSWLYRIKPSVTHEPFR R+PRHEKLVSEFN SNSSATPTQLRWKPVEIPETP
Sbjct: 59   SPRKLNQRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETP 118

Query: 1310 TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTEC 1131
            TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMEN AFCNADGDFLIVPQKGRLWITTEC
Sbjct: 119  TDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTEC 178

Query: 1130 GRLQVCLGEIVILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFL 951
            GRLQVC GEIVILPQG+RFAVDLPDGPSRGYVAE FGTHLQLPDLGPIGANGLAAPRDFL
Sbjct: 179  GRLQVCPGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFL 238

Query: 950  VPVAWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL 771
            VPVAWY DGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL
Sbjct: 239  VPVAWYGDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL 298

Query: 770  MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGG 591
            MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGG
Sbjct: 299  MDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGG 358

Query: 590  YEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRV 411
            YEAKADGF PGGASLHSCMTPHGPDTKT+EATIALGNEAGPH+I DTMAFMFESCL+PRV
Sbjct: 359  YEAKADGFHPGGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRV 418

Query: 410  CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNRDLQNGQPIER 276
            CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDN DLQ G+PI +
Sbjct: 419  CPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQKGKPIXK 463



 Score = 30.4 bits (67), Expect(2) = 0.0
 Identities = 15/18 (83%), Positives = 16/18 (88%)
 Frame = -2

Query: 280 KGEMLIVKLASVVPLKVT 227
           KGEMLIV+LA VVP KVT
Sbjct: 463 KGEMLIVQLAPVVPPKVT 480


>gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum lycopersicum]
          Length = 477

 Score =  926 bits (2394), Expect(2) = 0.0
 Identities = 434/454 (95%), Positives = 441/454 (97%)
 Frame = -1

Query: 1649 KTCNFPCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQ 1470
            +T NFP DLEYQTGFGNHFSSEAI GALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQ
Sbjct: 3    RTSNFPSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQ 62

Query: 1469 RSWLYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGL 1290
            RSWLYRIKPSVTHEPFR R+PRHEKLVSEFN SNSSATPTQLRWKPVEIPETPTDFIDGL
Sbjct: 63   RSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGL 122

Query: 1289 YTICGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCL 1110
            YTICGAGSSYLRHGFAIHMYTANKSMEN AFCNADGDFLIVPQKGRLWITTECGRLQVC 
Sbjct: 123  YTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCP 182

Query: 1109 GEIVILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYE 930
            GEIVILPQG+RFAVDLPDGPSRGYVAE FGTHLQLPDLGPIGANGLAAPRDFLVPVAWY 
Sbjct: 183  GEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYG 242

Query: 929  DGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPS 750
            DGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPS
Sbjct: 243  DGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPS 302

Query: 749  INTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADG 570
            INTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADG
Sbjct: 303  INTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADG 362

Query: 569  FLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALES 390
            F PGGASLHSCMTPHGPDTKT+EATIALGNEAGPH+I DTMAFMFESCL+PRVCPWALES
Sbjct: 363  FHPGGASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALES 422

Query: 389  PFMDHDYYQCWIGLKSHFSGLSMNEDNRDLQNGQ 288
            PFMDHDYYQCWIGLKSHFSGLSMNEDN DLQ G+
Sbjct: 423  PFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQKGK 456



 Score = 30.4 bits (67), Expect(2) = 0.0
 Identities = 15/18 (83%), Positives = 16/18 (88%)
 Frame = -2

Query: 280 KGEMLIVKLASVVPLKVT 227
           KGEMLIV+LA VVP KVT
Sbjct: 460 KGEMLIVQLAPVVPPKVT 477


>ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Citrus sinensis]
          Length = 478

 Score =  840 bits (2170), Expect = 0.0
 Identities = 383/432 (88%), Positives = 412/432 (95%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            DL Y++GFGN FSSEAIDGALP+GQNSPL+CPFGLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 23   DLNYESGFGNSFSSEAIDGALPRGQNSPLVCPFGLYAEQISGTSFTSPRKLNQRSWLYRI 82

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPS THEPF+ R+P H KLVSEF+ SNS  TPTQLRWKPV+IP++PTDFIDGLYTICGAG
Sbjct: 83   KPSATHEPFKPRVPAHGKLVSEFDKSNSYTTPTQLRWKPVDIPDSPTDFIDGLYTICGAG 142

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHG+AIHMYTANKSM+NCAFCNADGDFL+VPQKGRLWI TECG+L+V  GEI +LP
Sbjct: 143  SSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQKGRLWIATECGKLEVSPGEIAVLP 202

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRFAV LPDGPSRGY+AEIFGTH QLPDLGPIGANGLAAPRDFLVP AW+E+GSR GY
Sbjct: 203  QGFRFAVSLPDGPSRGYIAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEEGSRLGY 262

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTA+QDFSPFNVVAWHGNYVPYKYDLSKFCP+NTVL+DH DPSINTVLTA
Sbjct: 263  TIVQKFGGELFTARQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLVDHGDPSINTVLTA 322

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI GGYEAKADGFLPGGAS
Sbjct: 323  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIRGGYEAKADGFLPGGAS 382

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDTKTYEATIA G+EAGP+KITDTMAFMFESCLIPR+CPWALESPFMDHDY
Sbjct: 383  LHSCMTPHGPDTKTYEATIARGSEAGPYKITDTMAFMFESCLIPRICPWALESPFMDHDY 442

Query: 368  YQCWIGLKSHFS 333
            Y+CWIGL+SHFS
Sbjct: 443  YRCWIGLRSHFS 454


>gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]
          Length = 460

 Score =  838 bits (2166), Expect = 0.0
 Identities = 382/442 (86%), Positives = 414/442 (93%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            +L YQ+G+GN FSSEA+ GALP GQNSPL+CP+ LYAEQISGTSFTSPRKLN RSWLYRI
Sbjct: 9    ELSYQSGYGNSFSSEALAGALPHGQNSPLLCPYSLYAEQISGTSFTSPRKLNLRSWLYRI 68

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H KL+SEF+ SNSSATPTQLRWKPVEIP++PTDF+DGL+T+CGAG
Sbjct: 69   KPSVTHEPFKPRVPSHGKLLSEFDRSNSSATPTQLRWKPVEIPDSPTDFVDGLFTVCGAG 128

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFA+HMYTANKSM+NCAFCNADGDFLIVPQKGRLWITTECG+LQV  GE+ ILP
Sbjct: 129  SSFLRHGFAVHMYTANKSMDNCAFCNADGDFLIVPQKGRLWITTECGKLQVSPGEVAILP 188

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRFAVDLPDGPSRGYVAEIFG H QLPDLGPIGANGLAAPRDFL P AW+EDG RPGY
Sbjct: 189  QGFRFAVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGRRPGY 248

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTAKQDFSPFNVVAWHGN+VPYKYDLSKFCPYNTVL+DHSDPSINTVLTA
Sbjct: 249  TIVQKFGGELFTAKQDFSPFNVVAWHGNHVPYKYDLSKFCPYNTVLVDHSDPSINTVLTA 308

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFV+FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG+S
Sbjct: 309  PTDKPGVALLDFVVFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGSS 368

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDTKTYEATIA GNE GP +I DTMAFMFESCL+PRVC WALESPFMDHDY
Sbjct: 369  LHSCMTPHGPDTKTYEATIARGNEPGPFRIKDTMAFMFESCLMPRVCAWALESPFMDHDY 428

Query: 368  YQCWIGLKSHFSGLSMNEDNRD 303
            YQCWIGL+SHF+  S N  ++D
Sbjct: 429  YQCWIGLRSHFTWESRNATSKD 450


>gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobroma cacao]
          Length = 451

 Score =  830 bits (2144), Expect = 0.0
 Identities = 383/435 (88%), Positives = 406/435 (93%)
 Frame = -1

Query: 1637 FPCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWL 1458
            FP DLEYQ+GFGNHFSSEAI GALP+GQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWL
Sbjct: 17   FPEDLEYQSGFGNHFSSEAIAGALPRGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWL 76

Query: 1457 YRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTIC 1278
            YRIKPSVTHEPF  R   H+KLVSEF+ SN+ A PTQLRWKPV+IP+TPTDFIDGL+TIC
Sbjct: 77   YRIKPSVTHEPFWPRDSSHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTDFIDGLFTIC 136

Query: 1277 GAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIV 1098
            GAGSS+LRHG+AIHMYTANKSM+NCAFCNADGDFL+VPQ+GRLWITTECGRLQV  GEI 
Sbjct: 137  GAGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQQGRLWITTECGRLQVSPGEIA 196

Query: 1097 ILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSR 918
            +LPQGFRF VDLPDGPSRGYVAE+FGTH QLPDLGPIGANGLAA RDFL P AW+E+  R
Sbjct: 197  VLPQGFRFVVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAASRDFLAPTAWFEEHPR 256

Query: 917  PGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTV 738
            PG+TIVQK+GGELF A+QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DH DPSINTV
Sbjct: 257  PGFTIVQKFGGELFNARQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVDHGDPSINTV 316

Query: 737  LTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPG 558
            LTAPTDKPGVALLDFVIFP RWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPG
Sbjct: 317  LTAPTDKPGVALLDFVIFPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPG 376

Query: 557  GASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMD 378
            GASLHSCMTPHGPDTKTYEATIA G EAGPHKITDTMAFMFES L+PR CPW LESPF D
Sbjct: 377  GASLHSCMTPHGPDTKTYEATIARGYEAGPHKITDTMAFMFESFLMPRTCPWVLESPFRD 436

Query: 377  HDYYQCWIGLKSHFS 333
            HDYYQCW+GLKSHFS
Sbjct: 437  HDYYQCWVGLKSHFS 451


>ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vitis vinifera]
            gi|302142933|emb|CBI20228.3| unnamed protein product
            [Vitis vinifera]
          Length = 463

 Score =  830 bits (2143), Expect = 0.0
 Identities = 381/448 (85%), Positives = 414/448 (92%)
 Frame = -1

Query: 1634 PCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLY 1455
            P DL+YQ GFGNH SSEAI GALP+GQN+PL CPFGLYAEQISGT FT+PRK NQ SWLY
Sbjct: 15   PSDLQYQFGFGNHLSSEAIAGALPRGQNNPLTCPFGLYAEQISGTPFTAPRKQNQFSWLY 74

Query: 1454 RIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICG 1275
            RIKPSVTHEPF+ R+P H KLVSEFN SNSS  PTQLRWKPVEIP++PTDFIDGLYT+CG
Sbjct: 75   RIKPSVTHEPFKPRVPSHGKLVSEFNQSNSSTNPTQLRWKPVEIPDSPTDFIDGLYTVCG 134

Query: 1274 AGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVI 1095
            AGSS+LRHG+AIHMYTANKSM+NCAFCNADGDFLIVPQKGRL ITTECG+LQV  GEIV+
Sbjct: 135  AGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQKGRLSITTECGKLQVSPGEIVV 194

Query: 1094 LPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRP 915
            LP GFRF VDLPDGPSRGYVAEIFG H QLPDLGPIGANGLAA RDFLVPVAWYE+ SRP
Sbjct: 195  LPHGFRFVVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAASRDFLVPVAWYEECSRP 254

Query: 914  GYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVL 735
            GYTIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCP NTVL DH+DPSINTVL
Sbjct: 255  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPVNTVLKDHADPSINTVL 314

Query: 734  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG 555
            TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG
Sbjct: 315  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG 374

Query: 554  ASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDH 375
            ASLHSCMTPHGPDTKT+EAT+A G +AGP +IT+TMAFMFESCLIPR+CPWAL+SP +DH
Sbjct: 375  ASLHSCMTPHGPDTKTFEATVAHGKDAGPFRITNTMAFMFESCLIPRICPWALDSPSIDH 434

Query: 374  DYYQCWIGLKSHFSGLSMNEDNRDLQNG 291
            DYYQCW+GL+SHFS    +++++ +QNG
Sbjct: 435  DYYQCWVGLRSHFSREEASDESQTIQNG 462


>gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus persica]
          Length = 472

 Score =  827 bits (2136), Expect = 0.0
 Identities = 374/447 (83%), Positives = 410/447 (91%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            DL+YQ+GF NHFSSEA+ G LP GQ+SPL+CP+GLYAEQISGTSFTSPRKLN R+WLYR+
Sbjct: 23   DLQYQSGFHNHFSSEALPGTLPHGQSSPLLCPYGLYAEQISGTSFTSPRKLNHRTWLYRV 82

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+     H KLVSEF  SNSS TPTQLRWKPV+IPETPTDF++GLYT+CGAG
Sbjct: 83   KPSVTHEPFKPLESSHRKLVSEFTDSNSSTTPTQLRWKPVDIPETPTDFVEGLYTVCGAG 142

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMYTANKSM+NCAFCNADGDFLIVPQ GRLWITTECG+LQ+  GEI +LP
Sbjct: 143  SSFLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPQTGRLWITTECGKLQISPGEIAVLP 202

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRFAVDLPDGPSRGYVAE+FGTH QLPDLGPIGANGLAAPRDFLVP AW+ED  RPGY
Sbjct: 203  QGFRFAVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEDSYRPGY 262

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
             I+QK+GGELFTAKQ+FSPFNVVAWHGNY PYKYDL+ FCP+NTVL DH DPSINTVLTA
Sbjct: 263  VIIQKFGGELFTAKQEFSPFNVVAWHGNYAPYKYDLTTFCPFNTVLFDHGDPSINTVLTA 322

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS
Sbjct: 323  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 382

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDTKTYEATIA GNEAGP +I+DT+AFMFESCLIPR+CPWALESPF+D DY
Sbjct: 383  LHSCMTPHGPDTKTYEATIARGNEAGPSRISDTLAFMFESCLIPRICPWALESPFIDRDY 442

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQ 288
            YQCWIGL+SHF+    +  + D+QNG+
Sbjct: 443  YQCWIGLRSHFTREGASAKDGDIQNGE 469


>ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus]
            gi|449524824|ref|XP_004169421.1| PREDICTED: homogentisate
            1,2-dioxygenase-like [Cucumis sativus]
          Length = 471

 Score =  823 bits (2127), Expect = 0.0
 Identities = 377/442 (85%), Positives = 406/442 (91%)
 Frame = -1

Query: 1640 NFPCDLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSW 1461
            +FP DL Y +GF NHFSSEAI GALPQ QNSPLICPFGLYAEQISGTSFTSPRK N  SW
Sbjct: 13   DFPSDLPYLSGFNNHFSSEAIPGALPQSQNSPLICPFGLYAEQISGTSFTSPRKANLCSW 72

Query: 1460 LYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTI 1281
            LYRIKPSVTHEPFR R+P++EKL+SEFN SN S+TPTQLRWKP + P++P DF+DGLYT+
Sbjct: 73   LYRIKPSVTHEPFRQRLPKNEKLISEFNASNCSSTPTQLRWKPADFPDSPVDFVDGLYTV 132

Query: 1280 CGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEI 1101
            CGAGSS+LRHGFAIHMYTANKSMENCAFCNADGDFLIVPQ G+LWI TECGRL+V  GE+
Sbjct: 133  CGAGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGKLWIITECGRLEVSPGEV 192

Query: 1100 VILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGS 921
            V+LPQGFRF V LPDGPSRGYVAEIFG+H QLPDLGPIGANGLAAPRDFL PVAW+E+  
Sbjct: 193  VVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENSP 252

Query: 920  RPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINT 741
            RPGYTI+QK+GGELFTA QDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL DHSDPSINT
Sbjct: 253  RPGYTIIQKFGGELFTAIQDFSPFNVVAWHGNYVPYKYDLCKFCPYNTVLFDHSDPSINT 312

Query: 740  VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLP 561
            VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF+P
Sbjct: 313  VLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVP 372

Query: 560  GGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFM 381
            GGASLHSCMTPHGPDTKTYEATIA GN+AGPHKI+ TMAFMFES LIPRVC WALESPF+
Sbjct: 373  GGASLHSCMTPHGPDTKTYEATIARGNDAGPHKISGTMAFMFESSLIPRVCSWALESPFI 432

Query: 380  DHDYYQCWIGLKSHFSGLSMNE 315
            DHDYYQCWIGLKSHF   ++ +
Sbjct: 433  DHDYYQCWIGLKSHFKNEAIGD 454


>ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis]
            gi|223542482|gb|EEF44023.1| homogentisate
            1,2-dioxygenase, putative [Ricinus communis]
          Length = 457

 Score =  818 bits (2112), Expect = 0.0
 Identities = 377/438 (86%), Positives = 412/438 (94%), Gaps = 2/438 (0%)
 Frame = -1

Query: 1640 NFPCD-LEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRS 1464
            +FP D  +Y +GFGN F SEAI GALP+GQNSPLICP+GLYAEQISG+SFTSPRKL+QRS
Sbjct: 10   DFPSDDHDYLSGFGNTFESEAIHGALPRGQNSPLICPYGLYAEQISGSSFTSPRKLSQRS 69

Query: 1463 WLYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNS-SATPTQLRWKPVEIPETPTDFIDGLY 1287
            WLYRIKPSVTHEPF+ R+P H K+VSEF+ ++S + TPTQLRWKPV+IP++PTDFIDGL+
Sbjct: 70   WLYRIKPSVTHEPFKPRVPSHGKIVSEFDKTDSCTTTPTQLRWKPVDIPDSPTDFIDGLF 129

Query: 1286 TICGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLG 1107
            TICGAGSS+LRHGFAIHMYTANKSM NCA CNADGDFL+VPQ+GRLWITTECG+LQV  G
Sbjct: 130  TICGAGSSFLRHGFAIHMYTANKSMGNCALCNADGDFLVVPQEGRLWITTECGKLQVSPG 189

Query: 1106 EIVILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYED 927
            E+V+LPQGFRFAVDLPDGPSRGYVAEIFGTH QLPDLGPIGANGLAAPRDFLVP AWYE+
Sbjct: 190  EVVVLPQGFRFAVDLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPKAWYEE 249

Query: 926  GSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSI 747
            G  PGYTI+QK+GGELFTAKQDFSPFNVVAWHGN+VPYKYDL KFCPYNTVL+DHSDPSI
Sbjct: 250  GPCPGYTIIQKFGGELFTAKQDFSPFNVVAWHGNFVPYKYDLKKFCPYNTVLIDHSDPSI 309

Query: 746  NTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF 567
            NTVLTA TDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF
Sbjct: 310  NTVLTASTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGF 369

Query: 566  LPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESP 387
            +PGGASLHSCMTPHGPDTKTYEATIA GN+AGP +ITDTMAFMFESCLIPR+C WA+ESP
Sbjct: 370  VPGGASLHSCMTPHGPDTKTYEATIARGNDAGPSRITDTMAFMFESCLIPRICLWAVESP 429

Query: 386  FMDHDYYQCWIGLKSHFS 333
            F+DHDYYQCWIGLKSHFS
Sbjct: 430  FIDHDYYQCWIGLKSHFS 447


>ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata]
            gi|297310136|gb|EFH40560.1| homogentisate 1,2-dioxygenase
            [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  800 bits (2065), Expect = 0.0
 Identities = 369/448 (82%), Positives = 399/448 (89%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            DL+YQ+GFGNHFSSEAI GALP  QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 10   DLKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 69

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H+KLVSEF+ SNS   PTQLRW+P +IPE+ TDF+DGLYTICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESETDFVDGLYTICGAG 129

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMY ANK M+N AFCNADGDFL+VPQ GRLWI TECGRL V  GEI ++P
Sbjct: 130  SSFLRHGFAIHMYVANKGMKNSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF+VDLPDG SRGYVAEI+G H QLPDLGPIGANGLAAPRDFL P AW+E+G RP Y
Sbjct: 190  QGFRFSVDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEEGLRPEY 249

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+G ELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH DPSINTVLTA
Sbjct: 250  TIVQKFGAELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNTVLLDHGDPSINTVLTA 309

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDT TYEATIA  N   P K+T TMAFMFES LIPRVC WALESPF+DHDY
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQP 285
            YQCWIGLKSHFS + +N+ N +    +P
Sbjct: 430  YQCWIGLKSHFSRIDLNKTNVEPTEKEP 457


>ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda]
            gi|548862420|gb|ERN19780.1| hypothetical protein
            AMTR_s00064p00100410 [Amborella trichopoda]
          Length = 471

 Score =  797 bits (2059), Expect = 0.0
 Identities = 359/430 (83%), Positives = 400/430 (93%)
 Frame = -1

Query: 1625 LEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRIK 1446
            LEYQ+GFGN FSSEA+ GALP+ QNSPL+CPFGLYAEQISGT+FT+PRKLNQRSWLYRIK
Sbjct: 12   LEYQSGFGNVFSSEAMGGALPRDQNSPLLCPFGLYAEQISGTAFTAPRKLNQRSWLYRIK 71

Query: 1445 PSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAGS 1266
            PSVTHEPF  R+P H  LVSEFN S+SSATPTQLRWKP ++PE+PTDFIDGLYTICGAGS
Sbjct: 72   PSVTHEPFHPRVPTHAHLVSEFNQSSSSATPTQLRWKPADVPESPTDFIDGLYTICGAGS 131

Query: 1265 SYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILPQ 1086
            S+LRHG+A+HMY ANKSM++CAFC+ADGDFLIVPQKGRLW+TTECGRLQ+C GEIV+LPQ
Sbjct: 132  SFLRHGYAVHMYAANKSMDSCAFCSADGDFLIVPQKGRLWLTTECGRLQICPGEIVVLPQ 191

Query: 1085 GFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGYT 906
            GFRF+VDLPDGPSRGYVAE+FGTH QLP+LGPIGANGLAA RDFLVP A++E+   PGYT
Sbjct: 192  GFRFSVDLPDGPSRGYVAEVFGTHFQLPELGPIGANGLAASRDFLVPTAFFEEEHHPGYT 251

Query: 905  IVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTAP 726
            IVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCP+NTVL DH DPS+NTVLTAP
Sbjct: 252  IVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHGDPSVNTVLTAP 311

Query: 725  TDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASL 546
            ++KPGVAL+DFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAK DGFLPGGASL
Sbjct: 312  SEKPGVALVDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKKDGFLPGGASL 371

Query: 545  HSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDYY 366
            HSCMTPHGPDTKT+EAT++    + P +I DTMAFMFESCLIPR+CPWALESP +D DYY
Sbjct: 372  HSCMTPHGPDTKTFEATVSCEKSSEPFRIADTMAFMFESCLIPRICPWALESPDLDPDYY 431

Query: 365  QCWIGLKSHF 336
            +CW+GLKSHF
Sbjct: 432  KCWVGLKSHF 441


>gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
          Length = 461

 Score =  796 bits (2057), Expect = 0.0
 Identities = 366/448 (81%), Positives = 401/448 (89%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            +L+YQ+GFGNHFSSEAI GALP  QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 10   ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 69

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H+KLVSEF+ SNS   PTQLRW+P +IP++ TDF+DGL+TICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSETDFVDGLFTICGAG 129

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMY ANK M++ AFCNADGDFL+VPQ GRLWI TECGRL V  GEI ++P
Sbjct: 130  SSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVSPGEIAVIP 189

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF++DLPDG SRGYVAEI+G H QLPDLGPIGANGLAAPRDFL P AW+EDG RP Y
Sbjct: 190  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGLRPEY 249

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH DPSINTVLTA
Sbjct: 250  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDT TYEATIA  N   P K+T TMAFMFES LIPRVC WALESPF+DHDY
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQP 285
            YQCWIGLKSHFS +S+++ N +    +P
Sbjct: 430  YQCWIGLKSHFSRISLDKTNVEPTEKEP 457


>ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Capsella rubella]
            gi|482549107|gb|EOA13301.1| hypothetical protein
            CARUB_v10026329mg [Capsella rubella]
          Length = 476

 Score =  796 bits (2055), Expect = 0.0
 Identities = 365/448 (81%), Positives = 398/448 (88%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            +L+YQ+GFGNHFSSEAI GALP  QNSPLICP+GLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 25   ELKYQSGFGNHFSSEAIAGALPLDQNSPLICPYGLYAEQISGTSFTSPRKLNQRSWLYRI 84

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H+KLVSEF+ SNS   PTQLRW+P +IPE+ TDF+DGLYTICGAG
Sbjct: 85   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESATDFVDGLYTICGAG 144

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMY ANK M++ AFCNADGDFL+VPQ GRLWI TECGRL V  GEI ++P
Sbjct: 145  SSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQAGRLWIETECGRLLVSPGEIAVIP 204

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF++DLPDG SRGYVAEI+G H QLPDLGPIGANGLAAPRDFL P AW+ED  RP Y
Sbjct: 205  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDAVRPDY 264

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TI+QK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYN VL+DH DPS+NTVLTA
Sbjct: 265  TIIQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNAVLLDHGDPSVNTVLTA 324

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 325  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 384

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDT TYEATIA  N   P K+T TMAFMFES LIPRVC WALESPF+DHDY
Sbjct: 385  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 444

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQP 285
            YQCWIGLKSHFS + +N+ N +    +P
Sbjct: 445  YQCWIGLKSHFSRIDLNKTNIEPTEKEP 472


>ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Populus trichocarpa]
            gi|222846158|gb|EEE83705.1| hypothetical protein
            POPTR_0001s38310g [Populus trichocarpa]
          Length = 464

 Score =  795 bits (2054), Expect = 0.0
 Identities = 371/444 (83%), Positives = 404/444 (90%), Gaps = 5/444 (1%)
 Frame = -1

Query: 1646 TCNFPCD-LEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQ 1470
            T +FP D  +Y +GFGN F SE+I G+LP+ QNSPL+CP+GLYAEQISGTSFTSP KLNQ
Sbjct: 11   TNDFPSDDHDYLSGFGNTFESESIPGSLPRRQNSPLLCPYGLYAEQISGTSFTSPHKLNQ 70

Query: 1469 RSWLYRIKPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIP----ETPTDF 1302
            RSWLYRIKPSVTHEPF++R PRH+KLVSEF+ SNS  TPTQLRWKP  +       P DF
Sbjct: 71   RSWLYRIKPSVTHEPFQARFPRHDKLVSEFDKSNSYTTPTQLRWKPKPVDTVEESAPIDF 130

Query: 1301 IDGLYTICGAGSSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRL 1122
            ++GLYT+CGAGSS+LRHGFAIHMYTANKSM++ AFCNADGDFLIVPQKGRLWI TECG+L
Sbjct: 131  VEGLYTVCGAGSSFLRHGFAIHMYTANKSMDDRAFCNADGDFLIVPQKGRLWIATECGKL 190

Query: 1121 QVCLGEIVILPQGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPV 942
            QV  GEIV++PQGFRFAVDLPDGPSRGYV+EIFGTH QLPDLGPIGANGLAAPRDFLVP 
Sbjct: 191  QVSPGEIVVIPQGFRFAVDLPDGPSRGYVSEIFGTHFQLPDLGPIGANGLAAPRDFLVPK 250

Query: 941  AWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDH 762
            AW+EDGSRPGYT+VQKYGGELF AKQDFSPFNVVAWHGNYVPYKYDL+KFCPYNTVL DH
Sbjct: 251  AWFEDGSRPGYTVVQKYGGELFVAKQDFSPFNVVAWHGNYVPYKYDLNKFCPYNTVLFDH 310

Query: 761  SDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEA 582
            SDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEA
Sbjct: 311  SDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEA 370

Query: 581  KADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPW 402
            KADGFLPGGASLHSCMTPHGPDTKTYEATI  G++AGP KIT+T+AFMFESCLIPR+   
Sbjct: 371  KADGFLPGGASLHSCMTPHGPDTKTYEATIESGHDAGPSKITNTLAFMFESCLIPRISLC 430

Query: 401  ALESPFMDHDYYQCWIGLKSHFSG 330
            AL+SP MD+DYYQCW GLKSHFSG
Sbjct: 431  ALKSPLMDNDYYQCWTGLKSHFSG 454


>gb|EOY13161.1| Homogentisate 1,2-dioxygenase isoform 2, partial [Theobroma cacao]
          Length = 421

 Score =  794 bits (2050), Expect = 0.0
 Identities = 370/432 (85%), Positives = 393/432 (90%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            DLEYQ+GFGNHFSSEAI GALP+GQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 2    DLEYQSGFGNHFSSEAIAGALPRGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 61

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF  R   H+KLVSEF+ SN+ A PTQLRWKPV+IP+TPTDFIDGL+TICGAG
Sbjct: 62   KPSVTHEPFWPRDSSHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTDFIDGLFTICGAG 121

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHG+AIHMYTANKSM+NCAFCNADGDFL+VPQ+GRLWITTECGRLQV  GEI +LP
Sbjct: 122  SSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQQGRLWITTECGRLQVSPGEIAVLP 181

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF VDLPDGPSRGYVAE+FG            ANGLAA RDFL P AW+E+  RPG+
Sbjct: 182  QGFRFVVDLPDGPSRGYVAEVFG------------ANGLAASRDFLAPTAWFEEHPRPGF 229

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELF A+QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DH DPSINTVLTA
Sbjct: 230  TIVQKFGGELFNARQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVDHGDPSINTVLTA 289

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFP RWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS
Sbjct: 290  PTDKPGVALLDFVIFPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 349

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDTKTYEATIA G EAGPHKITDTMAFMFES L+PR CPW LESPF DHDY
Sbjct: 350  LHSCMTPHGPDTKTYEATIARGYEAGPHKITDTMAFMFESFLMPRTCPWVLESPFRDHDY 409

Query: 368  YQCWIGLKSHFS 333
            YQCW+GLKSHFS
Sbjct: 410  YQCWVGLKSHFS 421


>ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|30696407|ref|NP_851187.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|13432134|sp|Q9ZRA2.2|HGD_ARATH RecName:
            Full=Homogentisate 1,2-dioxygenase; AltName:
            Full=Homogentisate oxygenase; AltName: Full=Homogentisic
            acid oxidase; AltName: Full=Homogentisicase
            gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana] gi|22655252|gb|AAM98216.1|
            homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|33942055|gb|AAQ55280.1| At5g54080 [Arabidopsis
            thaliana] gi|332009064|gb|AED96447.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|332009065|gb|AED96448.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana]
          Length = 461

 Score =  789 bits (2037), Expect = 0.0
 Identities = 362/448 (80%), Positives = 398/448 (88%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            +L+YQ+GFGNHFSSEAI GALP  QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLYR+
Sbjct: 10   ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRV 69

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H+KLVSEF+ SNS   PTQLRW+P +IP++  DF+DGL+TICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAG 129

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMY AN  M++ AFCNADGDFL+VPQ GRLWI TECGRL V  GEI ++P
Sbjct: 130  SSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF++DLPDG SRGYVAEI+G H QLPDLGPIGANGLAA RDFL P AW+EDG RP Y
Sbjct: 190  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEY 249

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH DPSINTVLTA
Sbjct: 250  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDT TYEATIA  N   P K+T TMAFMFES LIPRVC WALESPF+DHDY
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDY 429

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQP 285
            YQCWIGLKSHFS +S+++ N +    +P
Sbjct: 430  YQCWIGLKSHFSRISLDKTNVESTEKEP 457


>ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cicer arietinum]
          Length = 455

 Score =  788 bits (2036), Expect = 0.0
 Identities = 364/432 (84%), Positives = 397/432 (91%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            D  Y +GFGNHFSSEAI GALP GQNSPLICPFGLYAEQISGTSFT+PR LN  SWLYRI
Sbjct: 9    DFNYLSGFGNHFSSEAIAGALPVGQNSPLICPFGLYAEQISGTSFTTPRTLNLFSWLYRI 68

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF++R+P + K++SEFN SNSSA PTQLRWKP +IP++PTDFIDGL T+CG+G
Sbjct: 69   KPSVTHEPFKARVPSNGKILSEFNDSNSSANPTQLRWKPEDIPDSPTDFIDGLSTVCGSG 128

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS++RHG+AIHMYTANKSM+NCAFCNADGDFLIVPQ+GRL ITTECGRL+V  G+I I+P
Sbjct: 129  SSFMRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGDIAIIP 188

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF V+LPDGPSRGYVAEIFGTH QLPDLGPIGANGLAAPRDFLVP AW+ED S PGY
Sbjct: 189  QGFRFNVNLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEDKSYPGY 248

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNT L DHSDPSINTVLTA
Sbjct: 249  TIVQKFGGELFTAVQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTALYDHSDPSINTVLTA 308

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI+G YEAK DGFLPGGAS
Sbjct: 309  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGNYEAKVDGFLPGGAS 368

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LH+CMTPHGPDTK+YEATIA G+  GPHKITDT+AFMFES LIPR+   ALESPF+DHDY
Sbjct: 369  LHNCMTPHGPDTKSYEATIARGDNVGPHKITDTLAFMFESSLIPRISRSALESPFLDHDY 428

Query: 368  YQCWIGLKSHFS 333
            YQCWIGL+SHF+
Sbjct: 429  YQCWIGLRSHFT 440


>gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus vulgaris]
          Length = 458

 Score =  788 bits (2035), Expect = 0.0
 Identities = 363/432 (84%), Positives = 395/432 (91%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            D  Y +GFGNH SSEA+ GALP+GQNSPLICPFGLYAEQISGTSFTSPR  N+ SW YRI
Sbjct: 15   DFTYLSGFGNHLSSEALPGALPEGQNSPLICPFGLYAEQISGTSFTSPRNRNRCSWFYRI 74

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P + K+ SEFN SNSSA PTQLRWKP++ P++PTDFIDGL TICG+G
Sbjct: 75   KPSVTHEPFKPRVPSNWKIFSEFNSSNSSANPTQLRWKPMDAPDSPTDFIDGLSTICGSG 134

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS++RHG+AIHMY ANKSM+NCAFCNADGDFLIVPQ+GRL ITTECGRL+V  GEI ILP
Sbjct: 135  SSFMRHGYAIHMYAANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGEIAILP 194

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF+V+LPDGPSRGYVAEIFGTH +LPDLGPIGANGLAAPRDFLVP AW+ED S PGY
Sbjct: 195  QGFRFSVNLPDGPSRGYVAEIFGTHFELPDLGPIGANGLAAPRDFLVPTAWFEDKSYPGY 254

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELF A QDFSPFNVVAWHGNY PYKYDLSKFCPYNTVL DHSDPSINTVLTA
Sbjct: 255  TIVQKFGGELFAAVQDFSPFNVVAWHGNYFPYKYDLSKFCPYNTVLFDHSDPSINTVLTA 314

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLI+GGYEAKADGFLPGGAS
Sbjct: 315  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGAS 374

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LH+CMTPHGPDTK+YEATIA GN+ GP KITDTMAFMFES LIPR+  WALESPF+D DY
Sbjct: 375  LHNCMTPHGPDTKSYEATIARGNDIGPSKITDTMAFMFESSLIPRISQWALESPFLDQDY 434

Query: 368  YQCWIGLKSHFS 333
            YQCWIGL+SHF+
Sbjct: 435  YQCWIGLRSHFT 446


>gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
          Length = 461

 Score =  787 bits (2033), Expect = 0.0
 Identities = 361/448 (80%), Positives = 398/448 (88%)
 Frame = -1

Query: 1628 DLEYQTGFGNHFSSEAIDGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRI 1449
            +L+YQ+GFGNHFSSEAI GALP  QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLYR+
Sbjct: 10   ELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRV 69

Query: 1448 KPSVTHEPFRSRIPRHEKLVSEFNHSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAG 1269
            KPSVTHEPF+ R+P H+KLVSEF+ SNS   PTQLRW+P +IP++  DF+DGL+TICGAG
Sbjct: 70   KPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAG 129

Query: 1268 SSYLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQKGRLWITTECGRLQVCLGEIVILP 1089
            SS+LRHGFAIHMY AN  M++ AFCNADGDFL+VPQ GRLWI TECGRL V  GEI ++P
Sbjct: 130  SSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIP 189

Query: 1088 QGFRFAVDLPDGPSRGYVAEIFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGY 909
            QGFRF++DLPDG SRGYVAEI+G H QLPDLGPIGANGLAA RDFL P AW+EDG RP Y
Sbjct: 190  QGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEY 249

Query: 908  TIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTA 729
            TIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH DPSINTVLTA
Sbjct: 250  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTA 309

Query: 728  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 549
            PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 310  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGAS 369

Query: 548  LHSCMTPHGPDTKTYEATIALGNEAGPHKITDTMAFMFESCLIPRVCPWALESPFMDHDY 369
            LHSCMTPHGPDT TYEATIA  N   P K+T TMAFMFES LIPRVC WALESPF+DH+Y
Sbjct: 370  LHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHEY 429

Query: 368  YQCWIGLKSHFSGLSMNEDNRDLQNGQP 285
            YQCWIGLKSHFS +S+++ N +    +P
Sbjct: 430  YQCWIGLKSHFSRISLDKTNVESTEKEP 457


Top