BLASTX nr result
ID: Rehmannia23_contig00013230
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00013230 (1571 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat... 798 0.0 gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum ... 798 0.0 ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat... 797 0.0 ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vi... 787 0.0 gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis] 786 0.0 gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus pe... 780 0.0 ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 778 0.0 ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [A... 767 0.0 gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobrom... 766 0.0 ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ric... 766 0.0 ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 766 0.0 ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Popu... 744 0.0 ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Caps... 743 0.0 gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 741 0.0 ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 739 0.0 ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thal... 738 0.0 ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis l... 738 0.0 gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 737 0.0 gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus... 736 0.0 ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 734 0.0 >ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase-like [Solanum tuberosum] Length = 492 Score = 798 bits (2061), Expect = 0.0 Identities = 365/415 (87%), Positives = 394/415 (94%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPFRPR+P+H KLVSEFN+SNSS Sbjct: 42 ICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSS 101 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 ATPTQLRWKP E+PE+PTDF+DGLYT+CGAGSSYLRHGFAIHMY ANKSM++ AFC+ADG Sbjct: 102 ATPTQLRWKPVEIPETPTDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADG 161 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRLWITTECGRLQV PGE+V+LPQG+RF VDLPDGPSRGYVAE FGTH QLP Sbjct: 162 DFLIVPQKGRLWITTECGRLQVCPGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLP 221 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVPVAW+E SRPGYTIVQK+GGELFTAKQDFSPFNVVAWHGNY Sbjct: 222 DLGPIGANGLAAPRDFLVPVAWYEDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNY 281 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVLMDH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 282 VPYKYDLSKFCPYNTVLMDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 341 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLI GGYEAKADGF PGGASLHSCMTPHGPDTKTYEATIALGNEAGP R Sbjct: 342 YYHRNCMSEFMGLINGGYEAKADGFHPGGASLHSCMTPHGPDTKTYEATIALGNEAGPHR 401 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVE 1245 I++TMAFMFESCL+PRVCPWALESP+MD DYYQCWIGLKSHF+ ++ED D++ Sbjct: 402 IADTMAFMFESCLIPRVCPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQ 456 >gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum lycopersicum] Length = 477 Score = 798 bits (2061), Expect = 0.0 Identities = 365/422 (86%), Positives = 396/422 (93%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPFRPR+P+H KLVSEFN+SNSS Sbjct: 39 ICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSS 98 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 ATPTQLRWKP E+PE+PTDF+DGLYT+CGAGSSYLRHGFAIHMY ANKSM++ AFC+ADG Sbjct: 99 ATPTQLRWKPVEIPETPTDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADG 158 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRLWITTECGRLQV PGE+V+LPQG+RF VDLPDGPSRGYVAE FGTH QLP Sbjct: 159 DFLIVPQKGRLWITTECGRLQVCPGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLP 218 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVPVAW+ SRPGYTIVQK+GGELFTAKQDFSPFNVVAWHGNY Sbjct: 219 DLGPIGANGLAAPRDFLVPVAWYGDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNY 278 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVLMDH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 279 VPYKYDLSKFCPYNTVLMDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 338 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGF PGGASLHSCMTPHGPDTKT+EATIALGNEAGP R Sbjct: 339 YYHRNCMSEFMGLIYGGYEAKADGFHPGGASLHSCMTPHGPDTKTFEATIALGNEAGPHR 398 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEHGE 1260 I++TMAFMFESCL+PRVCPWALESP+MD DYYQCWIGLKSHF+ ++ED D++ Sbjct: 399 IADTMAFMFESCLVPRVCPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQKGKTH 458 Query: 1261 KK 1266 +K Sbjct: 459 RK 460 >ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase [Solanum lycopersicum] Length = 480 Score = 797 bits (2059), Expect = 0.0 Identities = 364/415 (87%), Positives = 394/415 (94%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPFRPR+P+H KLVSEFN+SNSS Sbjct: 42 ICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFRPRMPRHEKLVSEFNQSNSS 101 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 ATPTQLRWKP E+PE+PTDF+DGLYT+CGAGSSYLRHGFAIHMY ANKSM++ AFC+ADG Sbjct: 102 ATPTQLRWKPVEIPETPTDFIDGLYTICGAGSSYLRHGFAIHMYTANKSMENSAFCNADG 161 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRLWITTECGRLQV PGE+V+LPQG+RF VDLPDGPSRGYVAE FGTH QLP Sbjct: 162 DFLIVPQKGRLWITTECGRLQVCPGEIVILPQGYRFAVDLPDGPSRGYVAETFGTHLQLP 221 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVPVAW+ SRPGYTIVQK+GGELFTAKQDFSPFNVVAWHGNY Sbjct: 222 DLGPIGANGLAAPRDFLVPVAWYGDGSRPGYTIVQKYGGELFTAKQDFSPFNVVAWHGNY 281 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVLMDH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 282 VPYKYDLSKFCPYNTVLMDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 341 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGF PGGASLHSCMTPHGPDTKT+EATIALGNEAGP R Sbjct: 342 YYHRNCMSEFMGLIYGGYEAKADGFHPGGASLHSCMTPHGPDTKTFEATIALGNEAGPHR 401 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVE 1245 I++TMAFMFESCL+PRVCPWALESP+MD DYYQCWIGLKSHF+ ++ED D++ Sbjct: 402 IADTMAFMFESCLVPRVCPWALESPFMDHDYYQCWIGLKSHFSGLSMNEDNVDLQ 456 >ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vitis vinifera] gi|302142933|emb|CBI20228.3| unnamed protein product [Vitis vinifera] Length = 463 Score = 787 bits (2032), Expect = 0.0 Identities = 358/417 (85%), Positives = 390/417 (93%) Frame = +1 Query: 4 CPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSSA 183 CP+GLYAEQISGT FT+PRK NQ SWLYR+KPSVTHEPF+PRVP HGKLVSEFN+SNSS Sbjct: 47 CPFGLYAEQISGTPFTAPRKQNQFSWLYRIKPSVTHEPFKPRVPSHGKLVSEFNQSNSST 106 Query: 184 TPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADGD 363 PTQLRWKP E+P+SPTDF+DGLYTVCGAGSS+LRHG+AIHMY ANKSMD+CAFC+ADGD Sbjct: 107 NPTQLRWKPVEIPDSPTDFIDGLYTVCGAGSSFLRHGYAIHMYTANKSMDNCAFCNADGD 166 Query: 364 FLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLPD 543 FLIVPQ+GRL ITTECG+LQVSPGE+VVLP GFRFVVDLPDGPSRGYVAEIFG HFQLPD Sbjct: 167 FLIVPQKGRLSITTECGKLQVSPGEIVVLPHGFRFVVDLPDGPSRGYVAEIFGAHFQLPD 226 Query: 544 LGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYV 723 LGPIGANGLAASRDFLVPVAW+E SRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYV Sbjct: 227 LGPIGANGLAASRDFLVPVAWYEECSRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYV 286 Query: 724 PYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPPY 903 PYKYDLSKFCP NTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPPY Sbjct: 287 PYKYDLSKFCPVNTVLKDHADPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPY 346 Query: 904 YHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRRI 1083 YHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKT+EAT+A G +AGP RI Sbjct: 347 YHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTFEATVAHGKDAGPFRI 406 Query: 1084 SNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEH 1254 +NTMAFMFESCL+PR+CPWAL+SP +D DYYQCW+GL+SHF+ E S++ Q ++N H Sbjct: 407 TNTMAFMFESCLIPRICPWALDSPSIDHDYYQCWVGLRSHFSREEASDESQTIQNGH 463 >gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis] Length = 460 Score = 786 bits (2029), Expect = 0.0 Identities = 361/420 (85%), Positives = 388/420 (92%), Gaps = 3/420 (0%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPY LYAEQISGTSFTSPRKLN RSWLYR+KPSVTHEPF+PRVP HGKL+SEF+ SNSS Sbjct: 38 LCPYSLYAEQISGTSFTSPRKLNLRSWLYRIKPSVTHEPFKPRVPSHGKLLSEFDRSNSS 97 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 ATPTQLRWKP E+P+SPTDFVDGL+TVCGAGSS+LRHGFA+HMY ANKSMD+CAFC+ADG Sbjct: 98 ATPTQLRWKPVEIPDSPTDFVDGLFTVCGAGSSFLRHGFAVHMYTANKSMDNCAFCNADG 157 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRLWITTECG+LQVSPGEV +LPQGFRF VDLPDGPSRGYVAEIFG HFQLP Sbjct: 158 DFLIVPQKGRLWITTECGKLQVSPGEVAILPQGFRFAVDLPDGPSRGYVAEIFGAHFQLP 217 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFL P AWFE RPGYTIVQKFGGELFTAKQDFSPFNVVAWHGN+ Sbjct: 218 DLGPIGANGLAAPRDFLAPTAWFEDGRRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNH 277 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVL+DH DPSINTVLTAPTD+PGVALLDFV+FPPRWLVAEHTFRPP Sbjct: 278 VPYKYDLSKFCPYNTVLVDHSDPSINTVLTAPTDKPGVALLDFVVFPPRWLVAEHTFRPP 337 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGFLPGG+SLHSCMTPHGPDTKTYEATIA GNE GP R Sbjct: 338 YYHRNCMSEFMGLIYGGYEAKADGFLPGGSSLHSCMTPHGPDTKTYEATIARGNEPGPFR 397 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACE---IVSEDGQDVENE 1251 I +TMAFMFESCLMPRVC WALESP+MD DYYQCWIGL+SHF E S+D +V+ + Sbjct: 398 IKDTMAFMFESCLMPRVCAWALESPFMDHDYYQCWIGLRSHFTWESRNATSKDDNEVDGK 457 >gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus persica] Length = 472 Score = 780 bits (2015), Expect = 0.0 Identities = 357/422 (84%), Positives = 387/422 (91%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLN R+WLYRVKPSVTHEPF+P H KLVSEF +SNSS Sbjct: 52 LCPYGLYAEQISGTSFTSPRKLNHRTWLYRVKPSVTHEPFKPLESSHRKLVSEFTDSNSS 111 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 TPTQLRWKP ++PE+PTDFV+GLYTVCGAGSS+LRHGFAIHMY ANKSMD+CAFC+ADG Sbjct: 112 TTPTQLRWKPVDIPETPTDFVEGLYTVCGAGSSFLRHGFAIHMYTANKSMDNCAFCNADG 171 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ GRLWITTECG+LQ+SPGE+ VLPQGFRF VDLPDGPSRGYVAE+FGTHFQLP Sbjct: 172 DFLIVPQTGRLWITTECGKLQISPGEIAVLPQGFRFAVDLPDGPSRGYVAEVFGTHFQLP 231 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVP AWFE RPGY I+QKFGGELFTAKQ+FSPFNVVAWHGNY Sbjct: 232 DLGPIGANGLAAPRDFLVPTAWFEDSYRPGYVIIQKFGGELFTAKQEFSPFNVVAWHGNY 291 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 PYKYDL+ FCP+NTVL DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 292 APYKYDLTTFCPFNTVLFDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 351 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIA GNEAGP R Sbjct: 352 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIARGNEAGPSR 411 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEHGE 1260 IS+T+AFMFESCL+PR+CPWALESP++D DYYQCWIGL+SHF E S D++N GE Sbjct: 412 ISDTLAFMFESCLIPRICPWALESPFIDRDYYQCWIGLRSHFTREGASAKDGDIQN--GE 469 Query: 1261 KK 1266 K+ Sbjct: 470 KE 471 >ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Citrus sinensis] Length = 478 Score = 778 bits (2010), Expect = 0.0 Identities = 351/405 (86%), Positives = 385/405 (95%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 VCP+GLYAEQISGTSFTSPRKLNQRSWLYR+KPS THEPF+PRVP HGKLVSEF++SNS Sbjct: 52 VCPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSATHEPFKPRVPAHGKLVSEFDKSNSY 111 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 TPTQLRWKP ++P+SPTDF+DGLYT+CGAGSS+LRHG+AIHMY ANKSMD+CAFC+ADG Sbjct: 112 TTPTQLRWKPVDIPDSPTDFIDGLYTICGAGSSFLRHGYAIHMYTANKSMDNCAFCNADG 171 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ+GRLWI TECG+L+VSPGE+ VLPQGFRF V LPDGPSRGY+AEIFGTHFQLP Sbjct: 172 DFLVVPQKGRLWIATECGKLEVSPGEIAVLPQGFRFAVSLPDGPSRGYIAEIFGTHFQLP 231 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVP AWFE SR GYTIVQKFGGELFTA+QDFSPFNVVAWHGNY Sbjct: 232 DLGPIGANGLAAPRDFLVPTAWFEEGSRLGYTIVQKFGGELFTARQDFSPFNVVAWHGNY 291 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCP+NTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 292 VPYKYDLSKFCPFNTVLVDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 351 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLI GGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIA G+EAGP + Sbjct: 352 YYHRNCMSEFMGLIRGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIARGSEAGPYK 411 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACE 1215 I++TMAFMFESCL+PR+CPWALESP+MD DYY+CWIGL+SHF+ E Sbjct: 412 ITDTMAFMFESCLIPRICPWALESPFMDHDYYRCWIGLRSHFSYE 456 >ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda] gi|548862420|gb|ERN19780.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda] Length = 471 Score = 767 bits (1981), Expect = 0.0 Identities = 346/419 (82%), Positives = 387/419 (92%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGT+FT+PRKLNQRSWLYR+KPSVTHEPF PRVP H LVSEFN+S+SS Sbjct: 40 LCPFGLYAEQISGTAFTAPRKLNQRSWLYRIKPSVTHEPFHPRVPTHAHLVSEFNQSSSS 99 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 ATPTQLRWKPA+VPESPTDF+DGLYT+CGAGSS+LRHG+A+HMY ANKSMD CAFCSADG Sbjct: 100 ATPTQLRWKPADVPESPTDFIDGLYTICGAGSSFLRHGYAVHMYAANKSMDSCAFCSADG 159 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRLW+TTECGRLQ+ PGE+VVLPQGFRF VDLPDGPSRGYVAE+FGTHFQLP Sbjct: 160 DFLIVPQKGRLWLTTECGRLQICPGEIVVLPQGFRFSVDLPDGPSRGYVAEVFGTHFQLP 219 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 +LGPIGANGLAASRDFLVP A+FE PGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY Sbjct: 220 ELGPIGANGLAASRDFLVPTAFFEEEHHPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 279 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCP+NTVL DHGDPS+NTVLTAP+++PGVAL+DFVIFPPRWLVAEHTFRPP Sbjct: 280 VPYKYDLSKFCPFNTVLFDHGDPSVNTVLTAPSEKPGVALVDFVIFPPRWLVAEHTFRPP 339 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAK DGFLPGGASLHSCMTPHGPDTKT+EAT++ + P R Sbjct: 340 YYHRNCMSEFMGLIYGGYEAKKDGFLPGGASLHSCMTPHGPDTKTFEATVSCEKSSEPFR 399 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEHG 1257 I++TMAFMFESCL+PR+CPWALESP +DPDYY+CW+GLKSHF + V++ Q + G Sbjct: 400 IADTMAFMFESCLIPRICPWALESPDLDPDYYKCWVGLKSHFLRKEVTQYVQKINLSDG 458 >gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobroma cacao] Length = 451 Score = 766 bits (1979), Expect = 0.0 Identities = 347/403 (86%), Positives = 377/403 (93%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPF PR H KLVSEF+ SN+ Sbjct: 49 ICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFWPRDSSHKKLVSEFDGSNTV 108 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 A PTQLRWKP ++P++PTDF+DGL+T+CGAGSS+LRHG+AIHMY ANKSMD+CAFC+ADG Sbjct: 109 ANPTQLRWKPVDIPDTPTDFIDGLFTICGAGSSFLRHGYAIHMYTANKSMDNCAFCNADG 168 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ+GRLWITTECGRLQVSPGE+ VLPQGFRFVVDLPDGPSRGYVAE+FGTHFQLP Sbjct: 169 DFLVVPQQGRLWITTECGRLQVSPGEIAVLPQGFRFVVDLPDGPSRGYVAEVFGTHFQLP 228 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAASRDFL P AWFE RPG+TIVQKFGGELF A+QDFSPFNVVAWHGNY Sbjct: 229 DLGPIGANGLAASRDFLAPTAWFEEHPRPGFTIVQKFGGELFNARQDFSPFNVVAWHGNY 288 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFP RWLVAEHTFRPP Sbjct: 289 VPYKYDLSKFCPYNTVLVDHGDPSINTVLTAPTDKPGVALLDFVIFPSRWLVAEHTFRPP 348 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIA G EAGP + Sbjct: 349 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIARGYEAGPHK 408 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFA 1209 I++TMAFMFES LMPR CPW LESP+ D DYYQCW+GLKSHF+ Sbjct: 409 ITDTMAFMFESFLMPRTCPWVLESPFRDHDYYQCWVGLKSHFS 451 >ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis] gi|223542482|gb|EEF44023.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis] Length = 457 Score = 766 bits (1978), Expect = 0.0 Identities = 348/412 (84%), Positives = 387/412 (93%), Gaps = 1/412 (0%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNS- 177 +CPYGLYAEQISG+SFTSPRKL+QRSWLYR+KPSVTHEPF+PRVP HGK+VSEF++++S Sbjct: 44 ICPYGLYAEQISGSSFTSPRKLSQRSWLYRIKPSVTHEPFKPRVPSHGKIVSEFDKTDSC 103 Query: 178 SATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSAD 357 + TPTQLRWKP ++P+SPTDF+DGL+T+CGAGSS+LRHGFAIHMY ANKSM +CA C+AD Sbjct: 104 TTTPTQLRWKPVDIPDSPTDFIDGLFTICGAGSSFLRHGFAIHMYTANKSMGNCALCNAD 163 Query: 358 GDFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQL 537 GDFL+VPQEGRLWITTECG+LQVSPGEVVVLPQGFRF VDLPDGPSRGYVAEIFGTHFQL Sbjct: 164 GDFLVVPQEGRLWITTECGKLQVSPGEVVVLPQGFRFAVDLPDGPSRGYVAEIFGTHFQL 223 Query: 538 PDLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGN 717 PDLGPIGANGLAA RDFLVP AW+E PGYTI+QKFGGELFTAKQDFSPFNVVAWHGN Sbjct: 224 PDLGPIGANGLAAPRDFLVPKAWYEEGPCPGYTIIQKFGGELFTAKQDFSPFNVVAWHGN 283 Query: 718 YVPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRP 897 +VPYKYDL KFCPYNTVL+DH DPSINTVLTA TD+PGVALLDFVIFPPRWLVAEHTFRP Sbjct: 284 FVPYKYDLKKFCPYNTVLIDHSDPSINTVLTASTDKPGVALLDFVIFPPRWLVAEHTFRP 343 Query: 898 PYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPR 1077 PYYHRNCMSEFMGLIYGGYEAKADGF+PGGASLHSCMTPHGPDTKTYEATIA GN+AGP Sbjct: 344 PYYHRNCMSEFMGLIYGGYEAKADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPS 403 Query: 1078 RISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDG 1233 RI++TMAFMFESCL+PR+C WA+ESP++D DYYQCWIGLKSHF+ S++G Sbjct: 404 RITDTMAFMFESCLIPRICLWAVESPFIDHDYYQCWIGLKSHFSHGADSKNG 455 >ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus] gi|449524824|ref|XP_004169421.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus] Length = 471 Score = 766 bits (1977), Expect = 0.0 Identities = 357/424 (84%), Positives = 387/424 (91%), Gaps = 3/424 (0%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPRK N SWLYR+KPSVTHEPFR R+PK+ KL+SEFN SN S Sbjct: 46 ICPFGLYAEQISGTSFTSPRKANLCSWLYRIKPSVTHEPFRQRLPKNEKLISEFNASNCS 105 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 +TPTQLRWKPA+ P+SP DFVDGLYTVCGAGSS+LRHGFAIHMY ANKSM++CAFC+ADG Sbjct: 106 STPTQLRWKPADFPDSPVDFVDGLYTVCGAGSSFLRHGFAIHMYTANKSMENCAFCNADG 165 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ G+LWI TECGRL+VSPGEVVVLPQGFRFVV LPDGPSRGYVAEIFG+HFQLP Sbjct: 166 DFLIVPQSGKLWIITECGRLEVSPGEVVVLPQGFRFVVYLPDGPSRGYVAEIFGSHFQLP 225 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFL PVAWFE+ RPGYTI+QKFGGELFTA QDFSPFNVVAWHGNY Sbjct: 226 DLGPIGANGLAAPRDFLAPVAWFENSPRPGYTIIQKFGGELFTAIQDFSPFNVVAWHGNY 285 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYNTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 286 VPYKYDLCKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 345 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYGGYEAKADGF+PGGASLHSCMTPHGPDTKTYEATIA GN+AGP + Sbjct: 346 YYHRNCMSEFMGLIYGGYEAKADGFVPGGASLHSCMTPHGPDTKTYEATIARGNDAGPHK 405 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSE-DGQDV--ENE 1251 IS TMAFMFES L+PRVC WALESP++D DYYQCWIGLKSHF E + + D Q V E+E Sbjct: 406 ISGTMAFMFESSLIPRVCSWALESPFIDHDYYQCWIGLKSHFKNEAIGDTDPQKVRIESE 465 Query: 1252 HGEK 1263 +G + Sbjct: 466 NGRQ 469 >ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Populus trichocarpa] gi|222846158|gb|EEE83705.1| hypothetical protein POPTR_0001s38310g [Populus trichocarpa] Length = 464 Score = 744 bits (1922), Expect = 0.0 Identities = 347/418 (83%), Positives = 375/418 (89%), Gaps = 4/418 (0%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSP KLNQRSWLYR+KPSVTHEPF+ R P+H KLVSEF++SNS Sbjct: 47 LCPYGLYAEQISGTSFTSPHKLNQRSWLYRIKPSVTHEPFQARFPRHDKLVSEFDKSNSY 106 Query: 181 ATPTQLRWKPAEVP----ESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFC 348 TPTQLRWKP V +P DFV+GLYTVCGAGSS+LRHGFAIHMY ANKSMDD AFC Sbjct: 107 TTPTQLRWKPKPVDTVEESAPIDFVEGLYTVCGAGSSFLRHGFAIHMYTANKSMDDRAFC 166 Query: 349 SADGDFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTH 528 +ADGDFLIVPQ+GRLWI TECG+LQVSPGE+VV+PQGFRF VDLPDGPSRGYV+EIFGTH Sbjct: 167 NADGDFLIVPQKGRLWIATECGKLQVSPGEIVVIPQGFRFAVDLPDGPSRGYVSEIFGTH 226 Query: 529 FQLPDLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAW 708 FQLPDLGPIGANGLAA RDFLVP AWFE SRPGYT+VQK+GGELF AKQDFSPFNVVAW Sbjct: 227 FQLPDLGPIGANGLAAPRDFLVPKAWFEDGSRPGYTVVQKYGGELFVAKQDFSPFNVVAW 286 Query: 709 HGNYVPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHT 888 HGNYVPYKYDL+KFCPYNTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHT Sbjct: 287 HGNYVPYKYDLNKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHT 346 Query: 889 FRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEA 1068 FRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATI G++A Sbjct: 347 FRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIESGHDA 406 Query: 1069 GPRRISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDV 1242 GP +I+NT+AFMFESCL+PR+ AL+SP MD DYYQCW GLKSHF+ E G V Sbjct: 407 GPSKITNTLAFMFESCLIPRISLCALKSPLMDNDYYQCWTGLKSHFSGEGADSKGNGV 464 >ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Capsella rubella] gi|482549107|gb|EOA13301.1| hypothetical protein CARUB_v10026329mg [Capsella rubella] Length = 476 Score = 743 bits (1917), Expect = 0.0 Identities = 337/403 (83%), Positives = 364/403 (90%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPF+PRVP H KLVSEF+ SNS Sbjct: 54 ICPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSR 113 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 PTQLRW+P ++PES TDFVDGLYT+CGAGSS+LRHGFAIHMY+ANK M D AFC+ADG Sbjct: 114 TNPTQLRWRPEDIPESATDFVDGLYTICGAGSSFLRHGFAIHMYVANKGMKDSAFCNADG 173 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ GRLWI TECGRL VSPGE+ V+PQGFRF +DLPDG SRGYVAEI+G HFQLP Sbjct: 174 DFLLVPQAGRLWIETECGRLLVSPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLP 233 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFL P AWFE RP YTI+QKFGGELFTAKQDFSPFNVVAWHGNY Sbjct: 234 DLGPIGANGLAAPRDFLAPTAWFEDAVRPDYTIIQKFGGELFTAKQDFSPFNVVAWHGNY 293 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYN VL+DHGDPS+NTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 294 VPYKYDLQKFCPYNAVLLDHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 353 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYG YEAKADGFLPGGASLHSCMTPHGPDT TYEATIA N P + Sbjct: 354 YYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSK 413 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFA 1209 ++ TMAFMFES L+PRVC WALESP++D DYYQCWIGLKSHF+ Sbjct: 414 LTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS 456 >gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 741 bits (1912), Expect = 0.0 Identities = 342/415 (82%), Positives = 371/415 (89%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPF+PRVP H KLVSEF+ SNS Sbjct: 39 LCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSR 98 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 PTQLRW+P ++P+S TDFVDGL+T+CGAGSS+LRHGFAIHMY+ANK M D AFC+ADG Sbjct: 99 TNPTQLRWRPEDIPDSETDFVDGLFTICGAGSSFLRHGFAIHMYVANKGMKDSAFCNADG 158 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ GRLWI TECGRL VSPGE+ V+PQGFRF +DLPDG SRGYVAEI+G HFQLP Sbjct: 159 DFLLVPQTGRLWIETECGRLLVSPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLP 218 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFL P AWFE RP YTIVQKFGGELFTAKQDFSPFNVVAWHGNY Sbjct: 219 DLGPIGANGLAAPRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 278 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYNTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 279 VPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 338 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYG YEAKADGFLPGGASLHSCMTPHGPDT TYEATIA N P + Sbjct: 339 YYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSK 398 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVE 1245 ++ TMAFMFES L+PRVC WALESP++D DYYQCWIGLKSHF+ +S D +VE Sbjct: 399 LTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS--RISLDKTNVE 451 >ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-like isoform X1 [Glycine max] gi|571493465|ref|XP_006592560.1| PREDICTED: homogentisate 1,2-dioxygenase-like isoform X2 [Glycine max] Length = 455 Score = 739 bits (1908), Expect = 0.0 Identities = 338/416 (81%), Positives = 374/416 (89%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 VCPYGLYAEQISGTSFTSPR N SW YR+KPSVTHEPF+PRVP +G+++SEFN SNSS Sbjct: 38 VCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPSVTHEPFKPRVPGNGRILSEFNNSNSS 97 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 A PTQLRWKP + P+SPTDF+DGL TVCG+GSS++RHG+AIHMY ANKSMD+CAFC+ADG Sbjct: 98 ANPTQLRWKPLDAPDSPTDFIDGLSTVCGSGSSFMRHGYAIHMYTANKSMDNCAFCNADG 157 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRL +TTECGRL+VSPGE+ +LPQGFRF V+LPDGPSRGYVAEIFGTHFQLP Sbjct: 158 DFLIVPQQGRLLVTTECGRLKVSPGEIAILPQGFRFSVNLPDGPSRGYVAEIFGTHFQLP 217 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLA+ RDFLVP AWFE S PGYTIVQKFGGELF A QDFSPFNVVAWHGNY Sbjct: 218 DLGPIGANGLASPRDFLVPTAWFEDKSYPGYTIVQKFGGELFDAVQDFSPFNVVAWHGNY 277 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPY YDL+KFCPYNTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 278 VPYMYDLNKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 337 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLI+GGYEAKADGFLPGGASLHSCMTPHGPDTK+YEATIA GN+ GP + Sbjct: 338 YYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHSCMTPHGPDTKSYEATIARGNDVGPCK 397 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVEN 1248 I++TMAFMFES L+PR+ WA ESP++D DYYQCWIGLKSHFA S + + N Sbjct: 398 ITDTMAFMFESSLIPRISQWASESPFLDQDYYQCWIGLKSHFAVTKTSPENPSLGN 453 >ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|30696407|ref|NP_851187.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|13432134|sp|Q9ZRA2.2|HGD_ARATH RecName: Full=Homogentisate 1,2-dioxygenase; AltName: Full=Homogentisate oxygenase; AltName: Full=Homogentisic acid oxidase; AltName: Full=Homogentisicase gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|22655252|gb|AAM98216.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|33942055|gb|AAQ55280.1| At5g54080 [Arabidopsis thaliana] gi|332009064|gb|AED96447.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|332009065|gb|AED96448.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 738 bits (1906), Expect = 0.0 Identities = 342/420 (81%), Positives = 372/420 (88%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPF+PRVP H KLVSEF+ SNS Sbjct: 39 LCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFKPRVPAHKKLVSEFDASNSR 98 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 PTQLRW+P ++P+S DFVDGL+T+CGAGSS+LRHGFAIHMY+AN M D AFC+ADG Sbjct: 99 TNPTQLRWRPEDIPDSEIDFVDGLFTICGAGSSFLRHGFAIHMYVANTGMKDSAFCNADG 158 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ GRLWI TECGRL V+PGE+ V+PQGFRF +DLPDG SRGYVAEI+G HFQLP Sbjct: 159 DFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLP 218 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAASRDFL P AWFE RP YTIVQKFGGELFTAKQDFSPFNVVAWHGNY Sbjct: 219 DLGPIGANGLAASRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 278 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYNTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 279 VPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 338 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYG YEAKADGFLPGGASLHSCMTPHGPDT TYEATIA N P + Sbjct: 339 YYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSK 398 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEHGE 1260 ++ TMAFMFES L+PRVC WALESP++D DYYQCWIGLKSHF+ +S D +VE+ E Sbjct: 399 LTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS--RISLDKTNVESTEKE 456 >ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata] gi|297310136|gb|EFH40560.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata] Length = 461 Score = 738 bits (1906), Expect = 0.0 Identities = 338/403 (83%), Positives = 364/403 (90%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLNQRSWLYR+KPSVTHEPF+PRVP H KLVSEF+ SNS Sbjct: 39 LCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSR 98 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 PTQLRW+P ++PES TDFVDGLYT+CGAGSS+LRHGFAIHMY+ANK M + AFC+ADG Sbjct: 99 TNPTQLRWRPEDIPESETDFVDGLYTICGAGSSFLRHGFAIHMYVANKGMKNSAFCNADG 158 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ GRLWI TECGRL V+PGE+ V+PQGFRF VDLPDG SRGYVAEI+G HFQLP Sbjct: 159 DFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSVDLPDGKSRGYVAEIYGAHFQLP 218 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFL P AWFE RP YTIVQKFG ELFTAKQDFSPFNVVAWHGNY Sbjct: 219 DLGPIGANGLAAPRDFLAPTAWFEEGLRPEYTIVQKFGAELFTAKQDFSPFNVVAWHGNY 278 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYNTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 279 VPYKYDLQKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 338 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYG YEAKADGFLPGGASLHSCMTPHGPDT TYEATIA N P + Sbjct: 339 YYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSK 398 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFA 1209 ++ TMAFMFES L+PRVC WALESP++D DYYQCWIGLKSHF+ Sbjct: 399 LTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS 441 >gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 737 bits (1902), Expect = 0.0 Identities = 341/420 (81%), Positives = 372/420 (88%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPF+PRVP H KLVSEF+ SNS Sbjct: 39 LCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFKPRVPAHKKLVSEFDASNSR 98 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 PTQLRW+P ++P+S DFVDGL+T+CGAGSS+LRHGFAIHMY+AN M D AFC+ADG Sbjct: 99 TNPTQLRWRPEDIPDSEIDFVDGLFTICGAGSSFLRHGFAIHMYVANTGMKDSAFCNADG 158 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFL+VPQ GRLWI TECGRL V+PGE+ V+PQGFRF +DLPDG SRGYVAEI+G HFQLP Sbjct: 159 DFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLP 218 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAASRDFL P AWFE RP YTIVQKFGGELFTAKQDFSPFNVVAWHGNY Sbjct: 219 DLGPIGANGLAASRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 278 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDL KFCPYNTVL+DHGDPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 279 VPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 338 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLIYG YEAKADGFLPGGASLHSCMTPHGPDT TYEATIA N P + Sbjct: 339 YYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSK 398 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVENEHGE 1260 ++ TMAFMFES L+PRVC WALESP++D +YYQCWIGLKSHF+ +S D +VE+ E Sbjct: 399 LTGTMAFMFESALIPRVCHWALESPFLDHEYYQCWIGLKSHFS--RISLDKTNVESTEKE 456 >gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus vulgaris] Length = 458 Score = 736 bits (1899), Expect = 0.0 Identities = 333/402 (82%), Positives = 369/402 (91%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 +CP+GLYAEQISGTSFTSPR N+ SW YR+KPSVTHEPF+PRVP + K+ SEFN SNSS Sbjct: 44 ICPFGLYAEQISGTSFTSPRNRNRCSWFYRIKPSVTHEPFKPRVPSNWKIFSEFNSSNSS 103 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 A PTQLRWKP + P+SPTDF+DGL T+CG+GSS++RHG+AIHMY ANKSMD+CAFC+ADG Sbjct: 104 ANPTQLRWKPMDAPDSPTDFIDGLSTICGSGSSFMRHGYAIHMYAANKSMDNCAFCNADG 163 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRL ITTECGRL+VSPGE+ +LPQGFRF V+LPDGPSRGYVAEIFGTHF+LP Sbjct: 164 DFLIVPQQGRLLITTECGRLKVSPGEIAILPQGFRFSVNLPDGPSRGYVAEIFGTHFELP 223 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLAA RDFLVP AWFE S PGYTIVQKFGGELF A QDFSPFNVVAWHGNY Sbjct: 224 DLGPIGANGLAAPRDFLVPTAWFEDKSYPGYTIVQKFGGELFAAVQDFSPFNVVAWHGNY 283 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 PYKYDLSKFCPYNTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 284 FPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 343 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLI+GGYEAKADGFLPGGASLH+CMTPHGPDTK+YEATIA GN+ GP + Sbjct: 344 YYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHNCMTPHGPDTKSYEATIARGNDIGPSK 403 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHF 1206 I++TMAFMFES L+PR+ WALESP++D DYYQCWIGL+SHF Sbjct: 404 ITDTMAFMFESSLIPRISQWALESPFLDQDYYQCWIGLRSHF 445 >ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Glycine max] Length = 455 Score = 734 bits (1895), Expect = 0.0 Identities = 335/416 (80%), Positives = 374/416 (89%) Frame = +1 Query: 1 VCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFRPRVPKHGKLVSEFNESNSS 180 VCPYGLYAEQISGTSFTSPR N SW YR+KPSVTHEPF+PRVP +G+++SEFN S+SS Sbjct: 38 VCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPSVTHEPFKPRVPGNGRILSEFNNSSSS 97 Query: 181 ATPTQLRWKPAEVPESPTDFVDGLYTVCGAGSSYLRHGFAIHMYIANKSMDDCAFCSADG 360 A PTQLRWKP + P+SP DF+DGL T+CG+GSS++RHG+AIHMY ANKSMD+CAFC+ADG Sbjct: 98 ANPTQLRWKPMDAPDSPMDFIDGLSTMCGSGSSFMRHGYAIHMYNANKSMDNCAFCNADG 157 Query: 361 DFLIVPQEGRLWITTECGRLQVSPGEVVVLPQGFRFVVDLPDGPSRGYVAEIFGTHFQLP 540 DFLIVPQ+GRL ITTECGRL+VSPGE+ ++P GFRF V+LPDGPSRGYVAEIFGTHFQLP Sbjct: 158 DFLIVPQQGRLLITTECGRLKVSPGEIAIIPHGFRFSVNLPDGPSRGYVAEIFGTHFQLP 217 Query: 541 DLGPIGANGLAASRDFLVPVAWFEHISRPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNY 720 DLGPIGANGLA+ RDFLVP AWFE S PGYTIVQKFGGELF A QDFSPFNVVAWHGNY Sbjct: 218 DLGPIGANGLASPRDFLVPSAWFEDKSYPGYTIVQKFGGELFDAVQDFSPFNVVAWHGNY 277 Query: 721 VPYKYDLSKFCPYNTVLMDHGDPSINTVLTAPTDRPGVALLDFVIFPPRWLVAEHTFRPP 900 VPYKYDLSKFCPYNTVL DH DPSINTVLTAPTD+PGVALLDFVIFPPRWLVAEHTFRPP Sbjct: 278 VPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPP 337 Query: 901 YYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKTYEATIALGNEAGPRR 1080 YYHRNCMSEFMGLI+GGYEAKADGFLPGGASLH+CMTPHGPDTK+YEATIA GN+ GP + Sbjct: 338 YYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHNCMTPHGPDTKSYEATIARGNDGGPCK 397 Query: 1081 ISNTMAFMFESCLMPRVCPWALESPYMDPDYYQCWIGLKSHFACEIVSEDGQDVEN 1248 I++TMAFMFES L+PR+ WALESP++D DYYQCWIGLKSHF S + ++ N Sbjct: 398 ITDTMAFMFESSLIPRISQWALESPFLDQDYYQCWIGLKSHFTVTETSPENTNLRN 453