BLASTX nr result
ID: Ephedra26_contig00004656
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00004656 (1279 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABR16047.1| unknown [Picea sitchensis] 722 0.0 ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [A... 675 0.0 ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ric... 673 0.0 gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis] 672 0.0 gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus pe... 669 0.0 ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 667 0.0 ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 667 0.0 gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus... 666 0.0 ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 665 0.0 gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 659 0.0 ref|XP_002437662.1| hypothetical protein SORBIDRAFT_10g000360 [S... 659 0.0 ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 658 0.0 gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobrom... 657 0.0 ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Caps... 657 0.0 ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis l... 655 0.0 ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thal... 654 0.0 ref|XP_004964265.1| PREDICTED: homogentisate 1,2-dioxygenase-lik... 654 0.0 ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vi... 654 0.0 ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat... 654 0.0 gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 653 0.0 >gb|ABR16047.1| unknown [Picea sitchensis] Length = 463 Score = 722 bits (1864), Expect = 0.0 Identities = 329/426 (77%), Positives = 369/426 (86%), Gaps = 1/426 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EAK GALPV QNNP+ CPYGLYAEQ+SGTAFTVPRK N+RSWLYRIKPSVTHEPF+PR P Sbjct: 34 EAKPGALPVGQNNPLKCPYGLYAEQVSGTAFTVPRKLNQRSWLYRIKPSVTHEPFHPRVP 93 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 ATPTQLRW PF P+ K NF+DGL TICGAGSSF+RHG+AVHMY Sbjct: 94 SHDYLISEFNQSSSSATPTQLRWSPFGIPDTKTNFIDGLFTICGAGSSFLRHGFAVHMYA 153 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 ANASM CAF NADGDFLI+PQQGRLWITTELG++Q+SPGE+VVL QGFRYS+DLPDGPS Sbjct: 154 ANASMEGCAFANADGDFLIVPQQGRLWITTELGRLQVSPGEIVVLQQGFRYSIDLPDGPS 213 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY++EV+ HF+LPDLG IGANGLASP DFL+P+AWFE+++ PGY I+HKFGGSLFTAK Sbjct: 214 RGYVVEVFSGHFQLPDLGPIGANGLASPPDFLTPTAWFEDKAYPGYTIVHKFGGSLFTAK 273 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 Q+FSPFNVVAW GNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVA+VDFVI Sbjct: 274 QNFSPFNVVAWHGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAVVDFVI 333 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GF PGG SLH+CMTPHGPD+ T Sbjct: 334 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGNYEAKADGFQPGGASLHSCMTPHGPDTTT 393 Query: 1081 YEKTI-QGGNEEPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 +EKTI + + +P KI+DTMAFMFESS+IP++T W L S LD DYYKCW GLKSHF H+ Sbjct: 394 FEKTIAEEDDAKPAKIRDTMAFMFESSLIPRITPWVLKSPHLDNDYYKCWTGLKSHFHHE 453 Query: 1258 KVEENG 1275 + ENG Sbjct: 454 HLPENG 459 >ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda] gi|548862420|gb|ERN19780.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda] Length = 471 Score = 675 bits (1741), Expect = 0.0 Identities = 310/424 (73%), Positives = 352/424 (83%), Gaps = 1/424 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CP+GLYAEQISGTAFT PRK N+RSWLYRIKPSVTHEPF+PR P Sbjct: 25 EAMGGALPRDQNSPLLCPFGLYAEQISGTAFTAPRKLNQRSWLYRIKPSVTHEPFHPRVP 84 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 ATPTQLRWKP D PE+ +F+DGL TICGAGSSF+RHGYAVHMY Sbjct: 85 THAHLVSEFNQSSSSATPTQLRWKPADVPESPTDFIDGLYTICGAGSSFLRHGYAVHMYA 144 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM CAFC+ADGDFLI+PQ+GRLW+TTE G++QI PGE+VVLPQGFR+SVDLPDGPS Sbjct: 145 ANKSMDSCAFCSADGDFLIVPQKGRLWLTTECGRLQICPGEIVVLPQGFRFSVDLPDGPS 204 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ EV+G+HF+LP+LG IGANGLA+ DFL P+A+FEE PGY I+ KFGG LFTAK Sbjct: 205 RGYVAEVFGTHFQLPELGPIGANGLAASRDFLVPTAFFEEEHHPGYTIVQKFGGELFTAK 264 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCPFNTVLFDHGDPS+NTVLT PSEKPGVA+VDFVI Sbjct: 265 QDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHGDPSVNTVLTAPSEKPGVALVDFVI 324 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAK +GFLPGG SLH+CMTPHGPD+ T Sbjct: 325 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKKDGFLPGGASLHSCMTPHGPDTKT 384 Query: 1081 YEKTIQ-GGNEEPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 +E T+ + EP +I DTMAFMFES +IP++ WAL S LD DYYKCW+GLKSHF Sbjct: 385 FEATVSCEKSSEPFRIADTMAFMFESCLIPRICPWALESPDLDPDYYKCWVGLKSHFLRK 444 Query: 1258 KVEE 1269 +V + Sbjct: 445 EVTQ 448 >ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis] gi|223542482|gb|EEF44023.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis] Length = 457 Score = 673 bits (1737), Expect = 0.0 Identities = 306/427 (71%), Positives = 354/427 (82%), Gaps = 2/427 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CPYGLYAEQISG++FT PRK ++RSWLYRIKPSVTHEPF PR P Sbjct: 29 EAIHGALPRGQNSPLICPYGLYAEQISGSSFTSPRKLSQRSWLYRIKPSVTHEPFKPRVP 88 Query: 181 XXXXXXXXXXXXXX-KATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMY 357 TPTQLRWKP D P++ +F+DGL TICGAGSSF+RHG+A+HMY Sbjct: 89 SHGKIVSEFDKTDSCTTTPTQLRWKPVDIPDSPTDFIDGLFTICGAGSSFLRHGFAIHMY 148 Query: 358 VANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGP 537 AN SMG+CA CNADGDFL++PQ+GRLWITTE GK+Q+SPGEVVVLPQGFR++VDLPDGP Sbjct: 149 TANKSMGNCALCNADGDFLVVPQEGRLWITTECGKLQVSPGEVVVLPQGFRFAVDLPDGP 208 Query: 538 SRGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTA 717 SRGY+ E++G+HF+LPDLG IGANGLA+P DFL P AW+EE CPGY II KFGG LFTA Sbjct: 209 SRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPKAWYEEGPCPGYTIIQKFGGELFTA 268 Query: 718 KQDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFV 897 KQDFSPFNVVAW GN+ PYKYDLKKFCP+NTVL DH DPSINTVLT ++KPGVA++DFV Sbjct: 269 KQDFSPFNVVAWHGNFVPYKYDLKKFCPYNTVLIDHSDPSINTVLTASTDKPGVALLDFV 328 Query: 898 IFPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSA 1077 IFPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GF+PGG SLH+CMTPHGPD+ Sbjct: 329 IFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVPGGASLHSCMTPHGPDTK 388 Query: 1078 TYEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCH 1254 TYE TI GN+ P +I DTMAFMFES +IP++ WA+ S +D DYY+CWIGLKSHF H Sbjct: 389 TYEATIARGNDAGPSRITDTMAFMFESCLIPRICLWAVESPFIDHDYYQCWIGLKSHFSH 448 Query: 1255 DKVEENG 1275 +NG Sbjct: 449 GADSKNG 455 >gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis] Length = 460 Score = 672 bits (1733), Expect = 0.0 Identities = 304/417 (72%), Positives = 350/417 (83%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CPY LYAEQISGT+FT PRK N RSWLYRIKPSVTHEPF PR P Sbjct: 23 EALAGALPHGQNSPLLCPYSLYAEQISGTSFTSPRKLNLRSWLYRIKPSVTHEPFKPRVP 82 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 ATPTQLRWKP + P++ +FVDGL T+CGAGSSF+RHG+AVHMY Sbjct: 83 SHGKLLSEFDRSNSSATPTQLRWKPVEIPDSPTDFVDGLFTVCGAGSSFLRHGFAVHMYT 142 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQ+GRLWITTE GK+Q+SPGEV +LPQGFR++VDLPDGPS Sbjct: 143 ANKSMDNCAFCNADGDFLIVPQKGRLWITTECGKLQVSPGEVAILPQGFRFAVDLPDGPS 202 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++G+HF+LPDLG IGANGLA+P DFL+P+AWFE+ PGY I+ KFGG LFTAK Sbjct: 203 RGYVAEIFGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGRRPGYTIVQKFGGELFTAK 262 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GN+ PYKYDL KFCP+NTVL DH DPSINTVLT P++KPGVA++DFV+ Sbjct: 263 QDFSPFNVVAWHGNHVPYKYDLSKFCPYNTVLVDHSDPSINTVLTAPTDKPGVALLDFVV 322 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 323 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGSSLHSCMTPHGPDTKT 382 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI GNE P +IKDTMAFMFES ++P+V WAL S +D DYY+CWIGL+SHF Sbjct: 383 YEATIARGNEPGPFRIKDTMAFMFESCLMPRVCAWALESPFMDHDYYQCWIGLRSHF 439 >gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus persica] Length = 472 Score = 669 bits (1727), Expect = 0.0 Identities = 304/420 (72%), Positives = 349/420 (83%), Gaps = 1/420 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA G LP Q++P+ CPYGLYAEQISGT+FT PRK N R+WLYR+KPSVTHEPF P + Sbjct: 37 EALPGTLPHGQSSPLLCPYGLYAEQISGTSFTSPRKLNHRTWLYRVKPSVTHEPFKPLES 96 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 TPTQLRWKP D PE +FV+GL T+CGAGSSF+RHG+A+HMY Sbjct: 97 SHRKLVSEFTDSNSSTTPTQLRWKPVDIPETPTDFVEGLYTVCGAGSSFLRHGFAIHMYT 156 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQ GRLWITTE GK+QISPGE+ VLPQGFR++VDLPDGPS Sbjct: 157 ANKSMDNCAFCNADGDFLIVPQTGRLWITTECGKLQISPGEIAVLPQGFRFAVDLPDGPS 216 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ EV+G+HF+LPDLG IGANGLA+P DFL P+AWFE+ PGYVII KFGG LFTAK Sbjct: 217 RGYVAEVFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEDSYRPGYVIIQKFGGELFTAK 276 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 Q+FSPFNVVAW GNYAPYKYDL FCPFNTVLFDHGDPSINTVLT P++KPGVA++DFVI Sbjct: 277 QEFSPFNVVAWHGNYAPYKYDLTTFCPFNTVLFDHGDPSINTVLTAPTDKPGVALLDFVI 336 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 337 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKT 396 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 YE TI GNE P +I DT+AFMFES +IP++ WAL S +D+DYY+CWIGL+SHF + Sbjct: 397 YEATIARGNEAGPSRISDTLAFMFESCLIPRICPWALESPFIDRDYYQCWIGLRSHFTRE 456 >ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Citrus sinensis] Length = 478 Score = 667 bits (1721), Expect = 0.0 Identities = 302/423 (71%), Positives = 351/423 (82%), Gaps = 1/423 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA +GALP QN+P+ CP+GLYAEQISGT+FT PRK N+RSWLYRIKPS THEPF PR P Sbjct: 37 EAIDGALPRGQNSPLVCPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSATHEPFKPRVP 96 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 TPTQLRWKP D P++ +F+DGL TICGAGSSF+RHGYA+HMY Sbjct: 97 AHGKLVSEFDKSNSYTTPTQLRWKPVDIPDSPTDFIDGLYTICGAGSSFLRHGYAIHMYT 156 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFL++PQ+GRLWI TE GK+++SPGE+ VLPQGFR++V LPDGPS Sbjct: 157 ANKSMDNCAFCNADGDFLVVPQKGRLWIATECGKLEVSPGEIAVLPQGFRFAVSLPDGPS 216 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGYI E++G+HF+LPDLG IGANGLA+P DFL P+AWFEE S GY I+ KFGG LFTA+ Sbjct: 217 RGYIAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEEGSRLGYTIVQKFGGELFTAR 276 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCPFNTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 277 QDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLVDHGDPSINTVLTAPTDKPGVALLDFVI 336 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLI G YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 337 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIRGGYEAKADGFLPGGASLHSCMTPHGPDTKT 396 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 YE TI G+E P KI DTMAFMFES +IP++ WAL S +D DYY+CWIGL+SHF ++ Sbjct: 397 YEATIARGSEAGPYKITDTMAFMFESCLIPRICPWALESPFMDHDYYRCWIGLRSHFSYE 456 Query: 1258 KVE 1266 + + Sbjct: 457 EAD 459 >ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Glycine max] Length = 455 Score = 667 bits (1720), Expect = 0.0 Identities = 302/417 (72%), Positives = 349/417 (83%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP AQN+P+ CPYGLYAEQISGT+FT PR N SW YRIKPSVTHEPF PR P Sbjct: 23 EALAGALPAAQNSPLVCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPSVTHEPFKPRVP 82 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 A PTQLRWKP D P++ +F+DGL T+CG+GSSF+RHGYA+HMY Sbjct: 83 GNGRILSEFNNSSSSANPTQLRWKPMDAPDSPMDFIDGLSTMCGSGSSFMRHGYAIHMYN 142 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQQGRL ITTE G++++SPGE+ ++P GFR+SV+LPDGPS Sbjct: 143 ANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGEIAIIPHGFRFSVNLPDGPS 202 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++G+HF+LPDLG IGANGLASP DFL PSAWFE++S PGY I+ KFGG LF A Sbjct: 203 RGYVAEIFGTHFQLPDLGPIGANGLASPRDFLVPSAWFEDKSYPGYTIVQKFGGELFDAV 262 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP+NTVLFDH DPSINTVLT P++KPGVA++DFVI Sbjct: 263 QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVI 322 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLI+G YEAKA+GFLPGG SLHNCMTPHGPD+ + Sbjct: 323 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHNCMTPHGPDTKS 382 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI GN+ P KI DTMAFMFESS+IP+++QWAL S LD+DYY+CWIGLKSHF Sbjct: 383 YEATIARGNDGGPCKITDTMAFMFESSLIPRISQWALESPFLDQDYYQCWIGLKSHF 439 >gb|ESW05061.1| hypothetical protein PHAVU_011G148800g [Phaseolus vulgaris] Length = 458 Score = 666 bits (1718), Expect = 0.0 Identities = 301/417 (72%), Positives = 350/417 (83%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CP+GLYAEQISGT+FT PR N+ SW YRIKPSVTHEPF PR P Sbjct: 29 EALPGALPEGQNSPLICPFGLYAEQISGTSFTSPRNRNRCSWFYRIKPSVTHEPFKPRVP 88 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 A PTQLRWKP D P++ +F+DGL TICG+GSSF+RHGYA+HMY Sbjct: 89 SNWKIFSEFNSSNSSANPTQLRWKPMDAPDSPTDFIDGLSTICGSGSSFMRHGYAIHMYA 148 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQQGRL ITTE G++++SPGE+ +LPQGFR+SV+LPDGPS Sbjct: 149 ANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGEIAILPQGFRFSVNLPDGPS 208 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++G+HFELPDLG IGANGLA+P DFL P+AWFE++S PGY I+ KFGG LF A Sbjct: 209 RGYVAEIFGTHFELPDLGPIGANGLAAPRDFLVPTAWFEDKSYPGYTIVQKFGGELFAAV 268 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP+NTVLFDH DPSINTVLT P++KPGVA++DFVI Sbjct: 269 QDFSPFNVVAWHGNYFPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVI 328 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLI+G YEAKA+GFLPGG SLHNCMTPHGPD+ + Sbjct: 329 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHNCMTPHGPDTKS 388 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI GN+ P KI DTMAFMFESS+IP+++QWAL S LD+DYY+CWIGL+SHF Sbjct: 389 YEATIARGNDIGPSKITDTMAFMFESSLIPRISQWALESPFLDQDYYQCWIGLRSHF 445 >ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-like isoform X1 [Glycine max] gi|571493465|ref|XP_006592560.1| PREDICTED: homogentisate 1,2-dioxygenase-like isoform X2 [Glycine max] Length = 455 Score = 665 bits (1715), Expect = 0.0 Identities = 301/421 (71%), Positives = 350/421 (83%), Gaps = 1/421 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALPVAQN+P+ CPYGLYAEQISGT+FT PR N SW YRIKPSVTHEPF PR P Sbjct: 23 EALAGALPVAQNSPLVCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPSVTHEPFKPRVP 82 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 A PTQLRWKP D P++ +F+DGL T+CG+GSSF+RHGYA+HMY Sbjct: 83 GNGRILSEFNNSNSSANPTQLRWKPLDAPDSPTDFIDGLSTVCGSGSSFMRHGYAIHMYT 142 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQQGRL +TTE G++++SPGE+ +LPQGFR+SV+LPDGPS Sbjct: 143 ANKSMDNCAFCNADGDFLIVPQQGRLLVTTECGRLKVSPGEIAILPQGFRFSVNLPDGPS 202 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++G+HF+LPDLG IGANGLASP DFL P+AWFE++S PGY I+ KFGG LF A Sbjct: 203 RGYVAEIFGTHFQLPDLGPIGANGLASPRDFLVPTAWFEDKSYPGYTIVQKFGGELFDAV 262 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PY YDL KFCP+NTVLFDH DPSINTVLT P++KPGVA++DFVI Sbjct: 263 QDFSPFNVVAWHGNYVPYMYDLNKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVI 322 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLI+G YEAKA+GFLPGG SLH+CMTPHGPD+ + Sbjct: 323 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHSCMTPHGPDTKS 382 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 YE TI GN+ P KI DTMAFMFESS+IP+++QWA S LD+DYY+CWIGLKSHF Sbjct: 383 YEATIARGNDVGPCKITDTMAFMFESSLIPRISQWASESPFLDQDYYQCWIGLKSHFAVT 442 Query: 1258 K 1260 K Sbjct: 443 K 443 >gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 659 bits (1701), Expect = 0.0 Identities = 300/417 (71%), Positives = 346/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP+ QN+P+ CPYGLYAEQISGT+FT PRK N+RSWLYRIKPSVTHEPF PR P Sbjct: 24 EAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVP 83 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 + PTQLRW+P D P+++ +FVDGL TICGAGSSF+RHG+A+HMYV Sbjct: 84 AHKKLVSEFDASNSRTNPTQLRWRPEDIPDSETDFVDGLFTICGAGSSFLRHGFAIHMYV 143 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN M D AFCNADGDFL++PQ GRLWI TE G++ +SPGE+ V+PQGFR+S+DLPDG S Sbjct: 144 ANKGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVSPGEIAVIPQGFRFSIDLPDGKS 203 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E+YG+HF+LPDLG IGANGLA+P DFL+P+AWFE+ P Y I+ KFGG LFTAK Sbjct: 204 RGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAK 263 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDLKKFCP+NTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 264 QDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVI 323 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 324 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTT 383 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI N P K+ TMAFMFES++IP+V WAL S LD DYY+CWIGLKSHF Sbjct: 384 YEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHF 440 >ref|XP_002437662.1| hypothetical protein SORBIDRAFT_10g000360 [Sorghum bicolor] gi|241915885|gb|EER89029.1| hypothetical protein SORBIDRAFT_10g000360 [Sorghum bicolor] Length = 469 Score = 659 bits (1700), Expect = 0.0 Identities = 301/421 (71%), Positives = 349/421 (82%), Gaps = 2/421 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA G+LPV QN+P+ CP GLYAEQ+SGT+FT PR N R+WLYRIKPSVTHEPF PR+P Sbjct: 32 EAVPGSLPVGQNSPLVCPLGLYAEQLSGTSFTTPRARNLRTWLYRIKPSVTHEPFYPRNP 91 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFP-EAKQNFVDGLITICGAGSSFIRHGYAVHMY 357 ATPTQLRW+P D P +F+DGL T+CGAGSS +RHGYA+HMY Sbjct: 92 TNERLVGEFHRATTVATPTQLRWRPADVPLHPDLDFIDGLYTVCGAGSSCLRHGYAIHMY 151 Query: 358 VANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGP 537 AN SM CAFCNADGDFLI+PQQGRL ITTE GK+ +SPGE+VV+PQGFR++VDLPDGP Sbjct: 152 AANKSMDGCAFCNADGDFLIVPQQGRLLITTECGKLLVSPGEIVVIPQGFRFAVDLPDGP 211 Query: 538 SRGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTA 717 SRGY+ E++G+HF+LPDLG IGANGLASP DFLSP+AWFE+ PGY I+ K+GG LFTA Sbjct: 212 SRGYVSEIFGTHFQLPDLGPIGANGLASPRDFLSPTAWFEQDHHPGYTIVQKYGGELFTA 271 Query: 718 KQDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFV 897 QDFSPFNVVAW GNY PYKYDL KFCPFNTVLFDHGDPS+NTVLT P++KPGVA++DFV Sbjct: 272 TQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHGDPSVNTVLTAPTDKPGVALLDFV 331 Query: 898 IFPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSA 1077 IFPPRWLVAE+TFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ Sbjct: 332 IFPPRWLVAENTFRPPYYHRNCMSEFMGLIYGIYEAKADGFLPGGASLHSCMTPHGPDTK 391 Query: 1078 TYEKTI-QGGNEEPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCH 1254 TYE TI + G EP ++ T+AFMFESS+IP+V +WAL+S D DYY+CWIGLKSHF H Sbjct: 392 TYEATISRAGANEPFRLSGTLAFMFESSLIPRVCRWALDSPCRDLDYYQCWIGLKSHFSH 451 Query: 1255 D 1257 D Sbjct: 452 D 452 >ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus] gi|449524824|ref|XP_004169421.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus] Length = 471 Score = 658 bits (1698), Expect = 0.0 Identities = 303/422 (71%), Positives = 348/422 (82%), Gaps = 1/422 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP +QN+P+ CP+GLYAEQISGT+FT PRK N SWLYRIKPSVTHEPF R P Sbjct: 31 EAIPGALPQSQNSPLICPFGLYAEQISGTSFTSPRKANLCSWLYRIKPSVTHEPFRQRLP 90 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 +TPTQLRWKP DFP++ +FVDGL T+CGAGSSF+RHG+A+HMY Sbjct: 91 KNEKLISEFNASNCSSTPTQLRWKPADFPDSPVDFVDGLYTVCGAGSSFLRHGFAIHMYT 150 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQ G+LWI TE G++++SPGEVVVLPQGFR+ V LPDGPS Sbjct: 151 ANKSMENCAFCNADGDFLIVPQSGKLWIITECGRLEVSPGEVVVLPQGFRFVVYLPDGPS 210 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++GSHF+LPDLG IGANGLA+P DFL+P AWFE PGY II KFGG LFTA Sbjct: 211 RGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENSPRPGYTIIQKFGGELFTAI 270 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP+NTVLFDH DPSINTVLT P++KPGVA++DFVI Sbjct: 271 QDFSPFNVVAWHGNYVPYKYDLCKFCPYNTVLFDHSDPSINTVLTAPTDKPGVALLDFVI 330 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GF+PGG SLH+CMTPHGPD+ T Sbjct: 331 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVPGGASLHSCMTPHGPDTKT 390 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 YE TI GN+ P KI TMAFMFESS+IP+V WAL S +D DYY+CWIGLKSHF ++ Sbjct: 391 YEATIARGNDAGPHKISGTMAFMFESSLIPRVCSWALESPFIDHDYYQCWIGLKSHFKNE 450 Query: 1258 KV 1263 + Sbjct: 451 AI 452 >gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobroma cacao] Length = 451 Score = 657 bits (1696), Expect = 0.0 Identities = 300/417 (71%), Positives = 342/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CP+GLYAEQISGT+FT PRK N+RSWLYRIKPSVTHEPF PRD Sbjct: 34 EAIAGALPRGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFWPRDS 93 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 A PTQLRWKP D P+ +F+DGL TICGAGSSF+RHGYA+HMY Sbjct: 94 SHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTDFIDGLFTICGAGSSFLRHGYAIHMYT 153 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFL++PQQGRLWITTE G++Q+SPGE+ VLPQGFR+ VDLPDGPS Sbjct: 154 ANKSMDNCAFCNADGDFLVVPQQGRLWITTECGRLQVSPGEIAVLPQGFRFVVDLPDGPS 213 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ EV+G+HF+LPDLG IGANGLA+ DFL+P+AWFEE PG+ I+ KFGG LF A+ Sbjct: 214 RGYVAEVFGTHFQLPDLGPIGANGLAASRDFLAPTAWFEEHPRPGFTIVQKFGGELFNAR 273 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP+NTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 274 QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVDHGDPSINTVLTAPTDKPGVALLDFVI 333 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FP RWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 334 FPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKT 393 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI G E P KI DTMAFMFES ++P+ W L S D DYY+CW+GLKSHF Sbjct: 394 YEATIARGYEAGPHKITDTMAFMFESFLMPRTCPWVLESPFRDHDYYQCWVGLKSHF 450 >ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Capsella rubella] gi|482549107|gb|EOA13301.1| hypothetical protein CARUB_v10026329mg [Capsella rubella] Length = 476 Score = 657 bits (1694), Expect = 0.0 Identities = 299/417 (71%), Positives = 344/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP+ QN+P+ CPYGLYAEQISGT+FT PRK N+RSWLYRIKPSVTHEPF PR P Sbjct: 39 EAIAGALPLDQNSPLICPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVP 98 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 + PTQLRW+P D PE+ +FVDGL TICGAGSSF+RHG+A+HMYV Sbjct: 99 AHKKLVSEFDASNSRTNPTQLRWRPEDIPESATDFVDGLYTICGAGSSFLRHGFAIHMYV 158 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN M D AFCNADGDFL++PQ GRLWI TE G++ +SPGE+ V+PQGFR+S+DLPDG S Sbjct: 159 ANKGMKDSAFCNADGDFLLVPQAGRLWIETECGRLLVSPGEIAVIPQGFRFSIDLPDGKS 218 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E+YG+HF+LPDLG IGANGLA+P DFL+P+AWFE+ P Y II KFGG LFTAK Sbjct: 219 RGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDAVRPDYTIIQKFGGELFTAK 278 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL+KFCP+N VL DHGDPS+NTVLT P++KPGVA++DFVI Sbjct: 279 QDFSPFNVVAWHGNYVPYKYDLQKFCPYNAVLLDHGDPSVNTVLTAPTDKPGVALLDFVI 338 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 339 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTT 398 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI N P K+ TMAFMFES++IP+V WAL S LD DYY+CWIGLKSHF Sbjct: 399 YEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHF 455 >ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata] gi|297310136|gb|EFH40560.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata] Length = 461 Score = 655 bits (1689), Expect = 0.0 Identities = 299/417 (71%), Positives = 345/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP+ QN+P+ CPYGLYAEQISGT+FT PRK N+RSWLYRIKPSVTHEPF PR P Sbjct: 24 EAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFKPRVP 83 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 + PTQLRW+P D PE++ +FVDGL TICGAGSSF+RHG+A+HMYV Sbjct: 84 AHKKLVSEFDASNSRTNPTQLRWRPEDIPESETDFVDGLYTICGAGSSFLRHGFAIHMYV 143 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN M + AFCNADGDFL++PQ GRLWI TE G++ ++PGE+ V+PQGFR+SVDLPDG S Sbjct: 144 ANKGMKNSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSVDLPDGKS 203 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E+YG+HF+LPDLG IGANGLA+P DFL+P+AWFEE P Y I+ KFG LFTAK Sbjct: 204 RGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEEGLRPEYTIVQKFGAELFTAK 263 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL+KFCP+NTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 264 QDFSPFNVVAWHGNYVPYKYDLQKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVI 323 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 324 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTT 383 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI N P K+ TMAFMFES++IP+V WAL S LD DYY+CWIGLKSHF Sbjct: 384 YEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHF 440 >ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|30696407|ref|NP_851187.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|13432134|sp|Q9ZRA2.2|HGD_ARATH RecName: Full=Homogentisate 1,2-dioxygenase; AltName: Full=Homogentisate oxygenase; AltName: Full=Homogentisic acid oxidase; AltName: Full=Homogentisicase gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|22655252|gb|AAM98216.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|33942055|gb|AAQ55280.1| At5g54080 [Arabidopsis thaliana] gi|332009064|gb|AED96447.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] gi|332009065|gb|AED96448.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 654 bits (1688), Expect = 0.0 Identities = 297/417 (71%), Positives = 345/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP+ QN+P+ CPYGLYAEQISGT+FT PRK N+RSWLYR+KPSVTHEPF PR P Sbjct: 24 EAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFKPRVP 83 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 + PTQLRW+P D P+++ +FVDGL TICGAGSSF+RHG+A+HMYV Sbjct: 84 AHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAGSSFLRHGFAIHMYV 143 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN M D AFCNADGDFL++PQ GRLWI TE G++ ++PGE+ V+PQGFR+S+DLPDG S Sbjct: 144 ANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSIDLPDGKS 203 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E+YG+HF+LPDLG IGANGLA+ DFL+P+AWFE+ P Y I+ KFGG LFTAK Sbjct: 204 RGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAK 263 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDLKKFCP+NTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 264 QDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVI 323 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 324 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTT 383 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI N P K+ TMAFMFES++IP+V WAL S LD DYY+CWIGLKSHF Sbjct: 384 YEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHF 440 >ref|XP_004964265.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Setaria italica] Length = 454 Score = 654 bits (1687), Expect = 0.0 Identities = 297/422 (70%), Positives = 353/422 (83%), Gaps = 3/422 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA G+LPV QN+P+ CP GLYAEQ+SGT+FT PR +N R+WLYRIKPSVTHEPF+PR+ Sbjct: 25 EAVAGSLPVGQNSPLVCPLGLYAEQLSGTSFTTPRASNLRTWLYRIKPSVTHEPFHPRED 84 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFP-EAKQNFVDGLITICGAGSSFIRHGYAVHMY 357 ATPTQLRW+P + P + +F+DGL T+CGAGS+F+RHGYA+HMY Sbjct: 85 KGRLVGEFDRATTV-ATPTQLRWRPTEVPLDRPLDFIDGLYTVCGAGSAFLRHGYAIHMY 143 Query: 358 VANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGP 537 AN SM CAFCNADGDFLI+PQQGRL+ITTE GK+ +SPGE+VV+PQGFR++VDLPDGP Sbjct: 144 AANKSMDGCAFCNADGDFLIVPQQGRLFITTECGKMLVSPGEIVVIPQGFRFAVDLPDGP 203 Query: 538 SRGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTA 717 SRGY+ E++G+HF+LPDLG IGANGLASP DFLSP+AWFE+ PGY+I+ K+GG LFTA Sbjct: 204 SRGYVSEIFGAHFQLPDLGPIGANGLASPRDFLSPTAWFEQAHRPGYMIVQKYGGELFTA 263 Query: 718 KQDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFV 897 QDFSPFNVVAW GNY PYKYDL +FCPFNTVLFDHGDPS+NTVLT P++KPGVA++DFV Sbjct: 264 TQDFSPFNVVAWHGNYVPYKYDLSRFCPFNTVLFDHGDPSVNTVLTAPTDKPGVALLDFV 323 Query: 898 IFPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSA 1077 IFPPRWLVAE+TFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ Sbjct: 324 IFPPRWLVAENTFRPPYYHRNCMSEFMGLIYGMYEAKADGFLPGGASLHSCMTPHGPDTK 383 Query: 1078 TYEKTIQ--GGNEEPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFC 1251 TYE TI + EP ++ T+AFMFESS+IP+V +WAL+S D DYY+CWIGLKSHF Sbjct: 384 TYEATISRADASTEPFRLSGTLAFMFESSLIPRVCRWALDSPCRDLDYYQCWIGLKSHFS 443 Query: 1252 HD 1257 HD Sbjct: 444 HD 445 >ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vitis vinifera] gi|302142933|emb|CBI20228.3| unnamed protein product [Vitis vinifera] Length = 463 Score = 654 bits (1687), Expect = 0.0 Identities = 295/427 (69%), Positives = 347/427 (81%), Gaps = 1/427 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QNNP+ CP+GLYAEQISGT FT PRK N+ SWLYRIKPSVTHEPF PR P Sbjct: 31 EAIAGALPRGQNNPLTCPFGLYAEQISGTPFTAPRKQNQFSWLYRIKPSVTHEPFKPRVP 90 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 PTQLRWKP + P++ +F+DGL T+CGAGSSF+RHGYA+HMY Sbjct: 91 SHGKLVSEFNQSNSSTNPTQLRWKPVEIPDSPTDFIDGLYTVCGAGSSFLRHGYAIHMYT 150 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM +CAFCNADGDFLI+PQ+GRL ITTE GK+Q+SPGE+VVLP GFR+ VDLPDGPS Sbjct: 151 ANKSMDNCAFCNADGDFLIVPQKGRLSITTECGKLQVSPGEIVVLPHGFRFVVDLPDGPS 210 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E++G+HF+LPDLG IGANGLA+ DFL P AW+EE S PGY I+ KFGG LFTAK Sbjct: 211 RGYVAEIFGAHFQLPDLGPIGANGLAASRDFLVPVAWYEECSRPGYTIVQKFGGELFTAK 270 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP NTVL DH DPSINTVLT P++KPGVA++DFVI Sbjct: 271 QDFSPFNVVAWHGNYVPYKYDLSKFCPVNTVLKDHADPSINTVLTAPTDKPGVALLDFVI 330 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 331 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGASLHSCMTPHGPDTKT 390 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHFCHD 1257 +E T+ G + P +I +TMAFMFES +IP++ WAL+S ++D DYY+CW+GL+SHF + Sbjct: 391 FEATVAHGKDAGPFRITNTMAFMFESCLIPRICPWALDSPSIDHDYYQCWVGLRSHFSRE 450 Query: 1258 KVEENGR 1278 + + + Sbjct: 451 EASDESQ 457 >ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase-like [Solanum tuberosum] Length = 492 Score = 654 bits (1686), Expect = 0.0 Identities = 298/417 (71%), Positives = 344/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP QN+P+ CP+GLYAEQISGT+FT PRK N+RSWLYRIKPSVTHEPF PR P Sbjct: 27 EAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVTHEPFRPRMP 86 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 ATPTQLRWKP + PE +F+DGL TICGAGSS++RHG+A+HMY Sbjct: 87 RHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTICGAGSSYLRHGFAIHMYT 146 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN SM + AFCNADGDFLI+PQ+GRLWITTE G++Q+ PGE+V+LPQG+R++VDLPDGPS Sbjct: 147 ANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEIVILPQGYRFAVDLPDGPS 206 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E +G+H +LPDLG IGANGLA+P DFL P AW+E+ S PGY I+ K+GG LFTAK Sbjct: 207 RGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRPGYTIVQKYGGELFTAK 266 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDL KFCP+NTVL DH DPSINTVLT P++KPGVA++DFVI Sbjct: 267 QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVLTAPTDKPGVALLDFVI 326 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLI G YEAKA+GF PGG SLH+CMTPHGPD+ T Sbjct: 327 FPPRWLVAEHTFRPPYYHRNCMSEFMGLINGGYEAKADGFHPGGASLHSCMTPHGPDTKT 386 Query: 1081 YEKTIQGGNEE-PVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI GNE P +I DTMAFMFES +IP+V WAL S +D DYY+CWIGLKSHF Sbjct: 387 YEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALESPFMDHDYYQCWIGLKSHF 443 >gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] Length = 461 Score = 653 bits (1684), Expect = 0.0 Identities = 296/417 (70%), Positives = 345/417 (82%), Gaps = 1/417 (0%) Frame = +1 Query: 1 EAKEGALPVAQNNPIHCPYGLYAEQISGTAFTVPRKNNKRSWLYRIKPSVTHEPFNPRDP 180 EA GALP+ QN+P+ CPYGLYAEQISGT+FT PRK N+RSWLYR+KPSVTHEPF PR P Sbjct: 24 EAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPRKLNQRSWLYRVKPSVTHEPFKPRVP 83 Query: 181 XXXXXXXXXXXXXXKATPTQLRWKPFDFPEAKQNFVDGLITICGAGSSFIRHGYAVHMYV 360 + PTQLRW+P D P+++ +FVDGL TICGAGSSF+RHG+A+HMYV Sbjct: 84 AHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDFVDGLFTICGAGSSFLRHGFAIHMYV 143 Query: 361 ANASMGDCAFCNADGDFLIIPQQGRLWITTELGKIQISPGEVVVLPQGFRYSVDLPDGPS 540 AN M D AFCNADGDFL++PQ GRLWI TE G++ ++PGE+ V+PQGFR+S+DLPDG S Sbjct: 144 ANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRLLVTPGEIAVIPQGFRFSIDLPDGKS 203 Query: 541 RGYILEVYGSHFELPDLGLIGANGLASPSDFLSPSAWFEERSCPGYVIIHKFGGSLFTAK 720 RGY+ E+YG+HF+LPDLG IGANGLA+ DFL+P+AWFE+ P Y I+ KFGG LFTAK Sbjct: 204 RGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPTAWFEDGLRPEYTIVQKFGGELFTAK 263 Query: 721 QDFSPFNVVAWQGNYAPYKYDLKKFCPFNTVLFDHGDPSINTVLTVPSEKPGVAIVDFVI 900 QDFSPFNVVAW GNY PYKYDLKKFCP+NTVL DHGDPSINTVLT P++KPGVA++DFVI Sbjct: 264 QDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDHGDPSINTVLTAPTDKPGVALLDFVI 323 Query: 901 FPPRWLVAEHTFRPPYFHRNCMSEFMGLIYGQYEAKANGFLPGGGSLHNCMTPHGPDSAT 1080 FPPRWLVAEHTFRPPY+HRNCMSEFMGLIYG YEAKA+GFLPGG SLH+CMTPHGPD+ T Sbjct: 324 FPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTT 383 Query: 1081 YEKTIQGGNE-EPVKIKDTMAFMFESSMIPKVTQWALNSHTLDKDYYKCWIGLKSHF 1248 YE TI N P K+ TMAFMFES++IP+V WAL S LD +YY+CWIGLKSHF Sbjct: 384 YEATIARVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHEYYQCWIGLKSHF 440