BLASTX nr result

ID: Catharanthus23_contig00005742 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005742
         (1929 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   835   0.0  
ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisat...   834   0.0  
gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum ...   834   0.0  
gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]        830   0.0  
ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   823   0.0  
ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vi...   815   0.0  
ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ric...   810   0.0  
gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobrom...   807   0.0  
gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus pe...   805   0.0  
ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis l...   795   0.0  
ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   794   0.0  
gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]   791   0.0  
ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Caps...   789   0.0  
ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thal...   783   0.0  
ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   781   0.0  
gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]   781   0.0  
ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [A...   779   0.0  
ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   778   0.0  
ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-lik...   778   0.0  
ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Popu...   775   0.0  

>ref|XP_006358956.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase-like
            [Solanum tuberosum]
          Length = 492

 Score =  835 bits (2157), Expect = 0.0
 Identities = 384/445 (86%), Positives = 416/445 (93%)
 Frame = +3

Query: 180  PENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLY 359
            P +L+YQ+GFGN+FSSEAI GALP+ QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLY
Sbjct: 11   PSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLY 70

Query: 360  RIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCG 539
            RIKPSVTHEPF+PR+P H KLVSEFNQSNS+ATPTQLRWKPVEIP+ PTDFIDGLYT+CG
Sbjct: 71   RIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTICG 130

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSSYLRHGFAIHMYTANKSM+N AFCNADGDFLIVP+ GRLW+TTE G+LQV PGEIV+
Sbjct: 131  AGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEIVI 190

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            +PQG+RFA+DLPDGPSRGYVAE FG H QLPDLGPIGANGLA+PRDFLVPVAW+E  S P
Sbjct: 191  LPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYEDGSRP 250

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYTIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DHSDPSINTVL
Sbjct: 251  GYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVL 310

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLI G YEAKADGF PGG
Sbjct: 311  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLINGGYEAKADGFHPGG 370

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK+YEATI+LGNEAGP +I  TMAFMFESCLIPRVC WALES FMDH
Sbjct: 371  ASLHSCMTPHGPDTKTYEATIALGNEAGPHRIADTMAFMFESCLIPRVCPWALESPFMDH 430

Query: 1440 DYYQCWIGLKSHFTHETTNEETGDV 1514
            DYYQCWIGLKSHF+  + NE+  D+
Sbjct: 431  DYYQCWIGLKSHFSGLSMNEDNVDL 455


>ref|XP_004251883.1| PREDICTED: LOW QUALITY PROTEIN: homogentisate 1,2-dioxygenase
            [Solanum lycopersicum]
          Length = 480

 Score =  834 bits (2154), Expect = 0.0
 Identities = 382/445 (85%), Positives = 416/445 (93%)
 Frame = +3

Query: 180  PENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLY 359
            P +L+YQ+GFGN+FSSEAI GALP+ QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLY
Sbjct: 11   PSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLY 70

Query: 360  RIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCG 539
            RIKPSVTHEPF+PR+P H KLVSEFNQSNS+ATPTQLRWKPVEIP+ PTDFIDGLYT+CG
Sbjct: 71   RIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTICG 130

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSSYLRHGFAIHMYTANKSM+N AFCNADGDFLIVP+ GRLW+TTE G+LQV PGEIV+
Sbjct: 131  AGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEIVI 190

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            +PQG+RFA+DLPDGPSRGYVAE FG H QLPDLGPIGANGLA+PRDFLVPVAW+   S P
Sbjct: 191  LPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDGSRP 250

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYTIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DHSDPSINTVL
Sbjct: 251  GYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVL 310

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGF PGG
Sbjct: 311  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHPGG 370

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK++EATI+LGNEAGP +I  TMAFMFESCL+PRVC WALES FMDH
Sbjct: 371  ASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFMDH 430

Query: 1440 DYYQCWIGLKSHFTHETTNEETGDV 1514
            DYYQCWIGLKSHF+  + NE+  D+
Sbjct: 431  DYYQCWIGLKSHFSGLSMNEDNVDL 455


>gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Solanum lycopersicum]
          Length = 477

 Score =  834 bits (2154), Expect = 0.0
 Identities = 382/445 (85%), Positives = 416/445 (93%)
 Frame = +3

Query: 180  PENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLY 359
            P +L+YQ+GFGN+FSSEAI GALP+ QNSPL+CP+GLYAEQISGTSFTSPRKLNQRSWLY
Sbjct: 8    PSDLEYQTGFGNHFSSEAIVGALPQGQNSPLICPFGLYAEQISGTSFTSPRKLNQRSWLY 67

Query: 360  RIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCG 539
            RIKPSVTHEPF+PR+P H KLVSEFNQSNS+ATPTQLRWKPVEIP+ PTDFIDGLYT+CG
Sbjct: 68   RIKPSVTHEPFRPRMPRHEKLVSEFNQSNSSATPTQLRWKPVEIPETPTDFIDGLYTICG 127

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSSYLRHGFAIHMYTANKSM+N AFCNADGDFLIVP+ GRLW+TTE G+LQV PGEIV+
Sbjct: 128  AGSSYLRHGFAIHMYTANKSMENSAFCNADGDFLIVPQKGRLWITTECGRLQVCPGEIVI 187

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            +PQG+RFA+DLPDGPSRGYVAE FG H QLPDLGPIGANGLA+PRDFLVPVAW+   S P
Sbjct: 188  LPQGYRFAVDLPDGPSRGYVAETFGTHLQLPDLGPIGANGLAAPRDFLVPVAWYGDGSRP 247

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYTIVQK+GGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DHSDPSINTVL
Sbjct: 248  GYTIVQKYGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLMDHSDPSINTVL 307

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGF PGG
Sbjct: 308  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHPGG 367

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK++EATI+LGNEAGP +I  TMAFMFESCL+PRVC WALES FMDH
Sbjct: 368  ASLHSCMTPHGPDTKTFEATIALGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFMDH 427

Query: 1440 DYYQCWIGLKSHFTHETTNEETGDV 1514
            DYYQCWIGLKSHF+  + NE+  D+
Sbjct: 428  DYYQCWIGLKSHFSGLSMNEDNVDL 452


>gb|EXB75014.1| Homogentisate 1,2-dioxygenase [Morus notabilis]
          Length = 460

 Score =  830 bits (2145), Expect = 0.0
 Identities = 379/445 (85%), Positives = 411/445 (92%)
 Frame = +3

Query: 177  SPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWL 356
            S E L YQSG+GN+FSSEA+AGALP  QNSPL+CPY LYAEQISGTSFTSPRKLN RSWL
Sbjct: 6    SLEELSYQSGYGNSFSSEALAGALPHGQNSPLLCPYSLYAEQISGTSFTSPRKLNLRSWL 65

Query: 357  YRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVC 536
            YRIKPSVTHEPFKPR+P HGKL+SEF++SNS+ATPTQLRWKPVEIP  PTDF+DGL+TVC
Sbjct: 66   YRIKPSVTHEPFKPRVPSHGKLLSEFDRSNSSATPTQLRWKPVEIPDSPTDFVDGLFTVC 125

Query: 537  GAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIV 716
            GAGSS+LRHGFA+HMYTANKSMDNCAFCNADGDFLIVP+ GRLW+TTE GKLQV PGE+ 
Sbjct: 126  GAGSSFLRHGFAVHMYTANKSMDNCAFCNADGDFLIVPQKGRLWITTECGKLQVSPGEVA 185

Query: 717  VIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSN 896
            ++PQGFRFA+DLPDGPSRGYVAEIFG HFQLPDLGPIGANGLA+PRDFL P AWFE    
Sbjct: 186  ILPQGFRFAVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAAPRDFLAPTAWFEDGRR 245

Query: 897  PGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTV 1076
            PGYTIVQKFGGELFTAKQDFSPFNVVAWHGN+VPYKYDLSKFCPYNTVL+DHSDPSINTV
Sbjct: 246  PGYTIVQKFGGELFTAKQDFSPFNVVAWHGNHVPYKYDLSKFCPYNTVLVDHSDPSINTV 305

Query: 1077 LTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPG 1256
            LTAPTDKPGVALLDFV+FPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPG
Sbjct: 306  LTAPTDKPGVALLDFVVFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPG 365

Query: 1257 GASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMD 1436
            G+SLHSCMTPHGPDTK+YEATI+ GNE GP +I  TMAFMFESCL+PRVC WALES FMD
Sbjct: 366  GSSLHSCMTPHGPDTKTYEATIARGNEPGPFRIKDTMAFMFESCLMPRVCAWALESPFMD 425

Query: 1437 HDYYQCWIGLKSHFTHETTNEETGD 1511
            HDYYQCWIGL+SHFT E+ N  + D
Sbjct: 426  HDYYQCWIGLRSHFTWESRNATSKD 450


>ref|XP_006494848.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Citrus sinensis]
          Length = 478

 Score =  823 bits (2126), Expect = 0.0
 Identities = 374/440 (85%), Positives = 410/440 (93%)
 Frame = +3

Query: 186  NLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 365
            +L+Y+SGFGN+FSSEAI GALP+ QNSPLVCP+GLYAEQISGTSFTSPRKLNQRSWLYRI
Sbjct: 23   DLNYESGFGNSFSSEAIDGALPRGQNSPLVCPFGLYAEQISGTSFTSPRKLNQRSWLYRI 82

Query: 366  KPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCGAG 545
            KPS THEPFKPR+P HGKLVSEF++SNS  TPTQLRWKPV+IP  PTDFIDGLYT+CGAG
Sbjct: 83   KPSATHEPFKPRVPAHGKLVSEFDKSNSYTTPTQLRWKPVDIPDSPTDFIDGLYTICGAG 142

Query: 546  SSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIP 725
            SS+LRHG+AIHMYTANKSMDNCAFCNADGDFL+VP+ GRLW+ TE GKL+V PGEI V+P
Sbjct: 143  SSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQKGRLWIATECGKLEVSPGEIAVLP 202

Query: 726  QGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGY 905
            QGFRFA+ LPDGPSRGY+AEIFG HFQLPDLGPIGANGLA+PRDFLVP AWFE+ S  GY
Sbjct: 203  QGFRFAVSLPDGPSRGYIAEIFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEEGSRLGY 262

Query: 906  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTA 1085
            TIVQKFGGELFTA+QDFSPFNVVAWHGNYVPYKYDLSKFCP+NTVL+DH DPSINTVLTA
Sbjct: 263  TIVQKFGGELFTARQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLVDHGDPSINTVLTA 322

Query: 1086 PTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGAS 1265
            PTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLI G YEAKADGFLPGGAS
Sbjct: 323  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIRGGYEAKADGFLPGGAS 382

Query: 1266 LHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDY 1445
            LHSCMTPHGPDTK+YEATI+ G+EAGP KIT TMAFMFESCLIPR+C WALES FMDHDY
Sbjct: 383  LHSCMTPHGPDTKTYEATIARGSEAGPYKITDTMAFMFESCLIPRICPWALESPFMDHDY 442

Query: 1446 YQCWIGLKSHFTHETTNEET 1505
            Y+CWIGL+SHF++E  + E+
Sbjct: 443  YRCWIGLRSHFSYEEADNES 462


>ref|XP_002285298.1| PREDICTED: homogentisate 1,2-dioxygenase [Vitis vinifera]
            gi|302142933|emb|CBI20228.3| unnamed protein product
            [Vitis vinifera]
          Length = 463

 Score =  815 bits (2104), Expect = 0.0
 Identities = 373/449 (83%), Positives = 408/449 (90%)
 Frame = +3

Query: 180  PENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLY 359
            P +L YQ GFGN+ SSEAIAGALP+ QN+PL CP+GLYAEQISGT FT+PRK NQ SWLY
Sbjct: 15   PSDLQYQFGFGNHLSSEAIAGALPRGQNNPLTCPFGLYAEQISGTPFTAPRKQNQFSWLY 74

Query: 360  RIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCG 539
            RIKPSVTHEPFKPR+P HGKLVSEFNQSNS+  PTQLRWKPVEIP  PTDFIDGLYTVCG
Sbjct: 75   RIKPSVTHEPFKPRVPSHGKLVSEFNQSNSSTNPTQLRWKPVEIPDSPTDFIDGLYTVCG 134

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSS+LRHG+AIHMYTANKSMDNCAFCNADGDFLIVP+ GRL +TTE GKLQV PGEIVV
Sbjct: 135  AGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQKGRLSITTECGKLQVSPGEIVV 194

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            +P GFRF +DLPDGPSRGYVAEIFG HFQLPDLGPIGANGLA+ RDFLVPVAW+E+ S P
Sbjct: 195  LPHGFRFVVDLPDGPSRGYVAEIFGAHFQLPDLGPIGANGLAASRDFLVPVAWYEECSRP 254

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCP NTVL DH+DPSINTVL
Sbjct: 255  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPVNTVLKDHADPSINTVL 314

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGG
Sbjct: 315  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG 374

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK++EAT++ G +AGP +IT TMAFMFESCLIPR+C WAL+S  +DH
Sbjct: 375  ASLHSCMTPHGPDTKTFEATVAHGKDAGPFRITNTMAFMFESCLIPRICPWALDSPSIDH 434

Query: 1440 DYYQCWIGLKSHFTHETTNEETGDVHNSH 1526
            DYYQCW+GL+SHF+ E  ++E+  + N H
Sbjct: 435  DYYQCWVGLRSHFSREEASDESQTIQNGH 463


>ref|XP_002518387.1| homogentisate 1,2-dioxygenase, putative [Ricinus communis]
            gi|223542482|gb|EEF44023.1| homogentisate
            1,2-dioxygenase, putative [Ricinus communis]
          Length = 457

 Score =  810 bits (2093), Expect = 0.0
 Identities = 369/440 (83%), Positives = 406/440 (92%), Gaps = 1/440 (0%)
 Frame = +3

Query: 192  DYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKP 371
            DY SGFGN F SEAI GALP+ QNSPL+CPYGLYAEQISG+SFTSPRKL+QRSWLYRIKP
Sbjct: 17   DYLSGFGNTFESEAIHGALPRGQNSPLICPYGLYAEQISGSSFTSPRKLSQRSWLYRIKP 76

Query: 372  SVTHEPFKPRIPCHGKLVSEFNQSNSAAT-PTQLRWKPVEIPKQPTDFIDGLYTVCGAGS 548
            SVTHEPFKPR+P HGK+VSEF++++S  T PTQLRWKPV+IP  PTDFIDGL+T+CGAGS
Sbjct: 77   SVTHEPFKPRVPSHGKIVSEFDKTDSCTTTPTQLRWKPVDIPDSPTDFIDGLFTICGAGS 136

Query: 549  SYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIPQ 728
            S+LRHGFAIHMYTANKSM NCA CNADGDFL+VP+ GRLW+TTE GKLQV PGE+VV+PQ
Sbjct: 137  SFLRHGFAIHMYTANKSMGNCALCNADGDFLVVPQEGRLWITTECGKLQVSPGEVVVLPQ 196

Query: 729  GFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGYT 908
            GFRFA+DLPDGPSRGYVAEIFG HFQLPDLGPIGANGLA+PRDFLVP AW+E+   PGYT
Sbjct: 197  GFRFAVDLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPKAWYEEGPCPGYT 256

Query: 909  IVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTAP 1088
            I+QKFGGELFTAKQDFSPFNVVAWHGN+VPYKYDL KFCPYNTVLIDHSDPSINTVLTA 
Sbjct: 257  IIQKFGGELFTAKQDFSPFNVVAWHGNFVPYKYDLKKFCPYNTVLIDHSDPSINTVLTAS 316

Query: 1089 TDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGASL 1268
            TDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGF+PGGASL
Sbjct: 317  TDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVPGGASL 376

Query: 1269 HSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDYY 1448
            HSCMTPHGPDTK+YEATI+ GN+AGP +IT TMAFMFESCLIPR+C WA+ES F+DHDYY
Sbjct: 377  HSCMTPHGPDTKTYEATIARGNDAGPSRITDTMAFMFESCLIPRICLWAVESPFIDHDYY 436

Query: 1449 QCWIGLKSHFTHETTNEETG 1508
            QCWIGLKSHF+H   ++  G
Sbjct: 437  QCWIGLKSHFSHGADSKNGG 456


>gb|EOY13160.1| Homogentisate 1,2-dioxygenase isoform 1 [Theobroma cacao]
          Length = 451

 Score =  807 bits (2084), Expect = 0.0
 Identities = 368/442 (83%), Positives = 402/442 (90%)
 Frame = +3

Query: 156  ENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRK 335
            + N + + PE+L+YQSGFGN+FSSEAIAGALP+ QNSPL+CP+GLYAEQISGTSFTSPRK
Sbjct: 10   KGNGLGVFPEDLEYQSGFGNHFSSEAIAGALPRGQNSPLICPFGLYAEQISGTSFTSPRK 69

Query: 336  LNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFI 515
            LNQRSWLYRIKPSVTHEPF PR   H KLVSEF+ SN+ A PTQLRWKPV+IP  PTDFI
Sbjct: 70   LNQRSWLYRIKPSVTHEPFWPRDSSHKKLVSEFDGSNTVANPTQLRWKPVDIPDTPTDFI 129

Query: 516  DGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQ 695
            DGL+T+CGAGSS+LRHG+AIHMYTANKSMDNCAFCNADGDFL+VP+ GRLW+TTE G+LQ
Sbjct: 130  DGLFTICGAGSSFLRHGYAIHMYTANKSMDNCAFCNADGDFLVVPQQGRLWITTECGRLQ 189

Query: 696  VVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVA 875
            V PGEI V+PQGFRF +DLPDGPSRGYVAE+FG HFQLPDLGPIGANGLA+ RDFL P A
Sbjct: 190  VSPGEIAVLPQGFRFVVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAASRDFLAPTA 249

Query: 876  WFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHS 1055
            WFE+H  PG+TIVQKFGGELF A+QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL+DH 
Sbjct: 250  WFEEHPRPGFTIVQKFGGELFNARQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLVDHG 309

Query: 1056 DPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAK 1235
            DPSINTVLTAPTDKPGVALLDFVIFP RW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAK
Sbjct: 310  DPSINTVLTAPTDKPGVALLDFVIFPSRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAK 369

Query: 1236 ADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWA 1415
            ADGFLPGGASLHSCMTPHGPDTK+YEATI+ G EAGP KIT TMAFMFES L+PR C W 
Sbjct: 370  ADGFLPGGASLHSCMTPHGPDTKTYEATIARGYEAGPHKITDTMAFMFESFLMPRTCPWV 429

Query: 1416 LESSFMDHDYYQCWIGLKSHFT 1481
            LES F DHDYYQCW+GLKSHF+
Sbjct: 430  LESPFRDHDYYQCWVGLKSHFS 451


>gb|EMJ15072.1| hypothetical protein PRUPE_ppa005219mg [Prunus persica]
          Length = 472

 Score =  805 bits (2080), Expect = 0.0
 Identities = 365/445 (82%), Positives = 401/445 (90%)
 Frame = +3

Query: 186  NLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 365
            +L YQSGF N+FSSEA+ G LP  Q+SPL+CPYGLYAEQISGTSFTSPRKLN R+WLYR+
Sbjct: 23   DLQYQSGFHNHFSSEALPGTLPHGQSSPLLCPYGLYAEQISGTSFTSPRKLNHRTWLYRV 82

Query: 366  KPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCGAG 545
            KPSVTHEPFKP    H KLVSEF  SNS+ TPTQLRWKPV+IP+ PTDF++GLYTVCGAG
Sbjct: 83   KPSVTHEPFKPLESSHRKLVSEFTDSNSSTTPTQLRWKPVDIPETPTDFVEGLYTVCGAG 142

Query: 546  SSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIP 725
            SS+LRHGFAIHMYTANKSMDNCAFCNADGDFLIVP+ GRLW+TTE GKLQ+ PGEI V+P
Sbjct: 143  SSFLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPQTGRLWITTECGKLQISPGEIAVLP 202

Query: 726  QGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGY 905
            QGFRFA+DLPDGPSRGYVAE+FG HFQLPDLGPIGANGLA+PRDFLVP AWFE    PGY
Sbjct: 203  QGFRFAVDLPDGPSRGYVAEVFGTHFQLPDLGPIGANGLAAPRDFLVPTAWFEDSYRPGY 262

Query: 906  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTA 1085
             I+QKFGGELFTAKQ+FSPFNVVAWHGNY PYKYDL+ FCP+NTVL DH DPSINTVLTA
Sbjct: 263  VIIQKFGGELFTAKQEFSPFNVVAWHGNYAPYKYDLTTFCPFNTVLFDHGDPSINTVLTA 322

Query: 1086 PTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGAS 1265
            PTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGGAS
Sbjct: 323  PTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGGAS 382

Query: 1266 LHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDY 1445
            LHSCMTPHGPDTK+YEATI+ GNEAGP +I+ T+AFMFESCLIPR+C WALES F+D DY
Sbjct: 383  LHSCMTPHGPDTKTYEATIARGNEAGPSRISDTLAFMFESCLIPRICPWALESPFIDRDY 442

Query: 1446 YQCWIGLKSHFTHETTNEETGDVHN 1520
            YQCWIGL+SHFT E  + + GD+ N
Sbjct: 443  YQCWIGLRSHFTREGASAKDGDIQN 467


>ref|XP_002864301.1| homogentisate 1,2-dioxygenase [Arabidopsis lyrata subsp. lyrata]
            gi|297310136|gb|EFH40560.1| homogentisate 1,2-dioxygenase
            [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  795 bits (2052), Expect = 0.0
 Identities = 369/449 (82%), Positives = 398/449 (88%)
 Frame = +3

Query: 153  MENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPR 332
            ME  K KL  E+L YQSGFGN+FSSEAIAGALP  QNSPL+CPYGLYAEQISGTSFTSPR
Sbjct: 1    MEEKKKKL--EDLKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPR 58

Query: 333  KLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDF 512
            KLNQRSWLYRIKPSVTHEPFKPR+P H KLVSEF+ SNS   PTQLRW+P +IP+  TDF
Sbjct: 59   KLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESETDF 118

Query: 513  IDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKL 692
            +DGLYT+CGAGSS+LRHGFAIHMY ANK M N AFCNADGDFL+VP+ GRLW+ TE G+L
Sbjct: 119  VDGLYTICGAGSSFLRHGFAIHMYVANKGMKNSAFCNADGDFLLVPQTGRLWIETECGRL 178

Query: 693  QVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPV 872
             V PGEI VIPQGFRF++DLPDG SRGYVAEI+G HFQLPDLGPIGANGLA+PRDFL P 
Sbjct: 179  LVTPGEIAVIPQGFRFSVDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPT 238

Query: 873  AWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDH 1052
            AWFE+   P YTIVQKFG ELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH
Sbjct: 239  AWFEEGLRPEYTIVQKFGAELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNTVLLDH 298

Query: 1053 SDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1232
             DPSINTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG+YEA
Sbjct: 299  GDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEA 358

Query: 1233 KADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHW 1412
            KADGFLPGGASLHSCMTPHGPDT +YEATI+  N   P K+TGTMAFMFES LIPRVCHW
Sbjct: 359  KADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHW 418

Query: 1413 ALESSFMDHDYYQCWIGLKSHFTHETTNE 1499
            ALES F+DHDYYQCWIGLKSHF+    N+
Sbjct: 419  ALESPFLDHDYYQCWIGLKSHFSRIDLNK 447


>ref|XP_004137214.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cucumis sativus]
            gi|449524824|ref|XP_004169421.1| PREDICTED: homogentisate
            1,2-dioxygenase-like [Cucumis sativus]
          Length = 471

 Score =  794 bits (2051), Expect = 0.0
 Identities = 362/440 (82%), Positives = 398/440 (90%)
 Frame = +3

Query: 180  PENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLY 359
            P +L Y SGF N+FSSEAI GALP++QNSPL+CP+GLYAEQISGTSFTSPRK N  SWLY
Sbjct: 15   PSDLPYLSGFNNHFSSEAIPGALPQSQNSPLICPFGLYAEQISGTSFTSPRKANLCSWLY 74

Query: 360  RIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCG 539
            RIKPSVTHEPF+ R+P + KL+SEFN SN ++TPTQLRWKP + P  P DF+DGLYTVCG
Sbjct: 75   RIKPSVTHEPFRQRLPKNEKLISEFNASNCSSTPTQLRWKPADFPDSPVDFVDGLYTVCG 134

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSS+LRHGFAIHMYTANKSM+NCAFCNADGDFLIVP++G+LW+ TE G+L+V PGE+VV
Sbjct: 135  AGSSFLRHGFAIHMYTANKSMENCAFCNADGDFLIVPQSGKLWIITECGRLEVSPGEVVV 194

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            +PQGFRF + LPDGPSRGYVAEIFG HFQLPDLGPIGANGLA+PRDFL PVAWFE    P
Sbjct: 195  LPQGFRFVVYLPDGPSRGYVAEIFGSHFQLPDLGPIGANGLAAPRDFLAPVAWFENSPRP 254

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYTI+QKFGGELFTA QDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL DHSDPSINTVL
Sbjct: 255  GYTIIQKFGGELFTAIQDFSPFNVVAWHGNYVPYKYDLCKFCPYNTVLFDHSDPSINTVL 314

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGF+PGG
Sbjct: 315  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFVPGG 374

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK+YEATI+ GN+AGP KI+GTMAFMFES LIPRVC WALES F+DH
Sbjct: 375  ASLHSCMTPHGPDTKTYEATIARGNDAGPHKISGTMAFMFESSLIPRVCSWALESPFIDH 434

Query: 1440 DYYQCWIGLKSHFTHETTNE 1499
            DYYQCWIGLKSHF +E   +
Sbjct: 435  DYYQCWIGLKSHFKNEAIGD 454


>gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
          Length = 461

 Score =  791 bits (2042), Expect = 0.0
 Identities = 371/453 (81%), Positives = 398/453 (87%), Gaps = 3/453 (0%)
 Frame = +3

Query: 153  MENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPR 332
            ME  K KL  E L YQSGFGN+FSSEAIAGALP  QNSPL+CPYGLYAEQISGTSFTSPR
Sbjct: 1    MEEKKKKL--EELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPR 58

Query: 333  KLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDF 512
            KLNQRSWLYRIKPSVTHEPFKPR+P H KLVSEF+ SNS   PTQLRW+P +IP   TDF
Sbjct: 59   KLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSETDF 118

Query: 513  IDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKL 692
            +DGL+T+CGAGSS+LRHGFAIHMY ANK M + AFCNADGDFL+VP+ GRLW+ TE G+L
Sbjct: 119  VDGLFTICGAGSSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQTGRLWIETECGRL 178

Query: 693  QVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPV 872
             V PGEI VIPQGFRF+IDLPDG SRGYVAEI+G HFQLPDLGPIGANGLA+PRDFL P 
Sbjct: 179  LVSPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLAPT 238

Query: 873  AWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDH 1052
            AWFE    P YTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH
Sbjct: 239  AWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDH 298

Query: 1053 SDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1232
             DPSINTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG+YEA
Sbjct: 299  GDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEA 358

Query: 1233 KADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHW 1412
            KADGFLPGGASLHSCMTPHGPDT +YEATI+  N   P K+TGTMAFMFES LIPRVCHW
Sbjct: 359  KADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHW 418

Query: 1413 ALESSFMDHDYYQCWIGLKSHFTH---ETTNEE 1502
            ALES F+DHDYYQCWIGLKSHF+    + TN E
Sbjct: 419  ALESPFLDHDYYQCWIGLKSHFSRISLDKTNVE 451


>ref|XP_006280403.1| hypothetical protein CARUB_v10026329mg [Capsella rubella]
            gi|482549107|gb|EOA13301.1| hypothetical protein
            CARUB_v10026329mg [Capsella rubella]
          Length = 476

 Score =  789 bits (2037), Expect = 0.0
 Identities = 366/451 (81%), Positives = 396/451 (87%)
 Frame = +3

Query: 147  ILMENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTS 326
            + ME  K K   E L YQSGFGN+FSSEAIAGALP  QNSPL+CPYGLYAEQISGTSFTS
Sbjct: 13   VAMEEVK-KKKLEELKYQSGFGNHFSSEAIAGALPLDQNSPLICPYGLYAEQISGTSFTS 71

Query: 327  PRKLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPT 506
            PRKLNQRSWLYRIKPSVTHEPFKPR+P H KLVSEF+ SNS   PTQLRW+P +IP+  T
Sbjct: 72   PRKLNQRSWLYRIKPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPESAT 131

Query: 507  DFIDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFG 686
            DF+DGLYT+CGAGSS+LRHGFAIHMY ANK M + AFCNADGDFL+VP+ GRLW+ TE G
Sbjct: 132  DFVDGLYTICGAGSSFLRHGFAIHMYVANKGMKDSAFCNADGDFLLVPQAGRLWIETECG 191

Query: 687  KLQVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLV 866
            +L V PGEI VIPQGFRF+IDLPDG SRGYVAEI+G HFQLPDLGPIGANGLA+PRDFL 
Sbjct: 192  RLLVSPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAAPRDFLA 251

Query: 867  PVAWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLI 1046
            P AWFE    P YTI+QKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYN VL+
Sbjct: 252  PTAWFEDAVRPDYTIIQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLQKFCPYNAVLL 311

Query: 1047 DHSDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSY 1226
            DH DPS+NTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG+Y
Sbjct: 312  DHGDPSVNTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAY 371

Query: 1227 EAKADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVC 1406
            EAKADGFLPGGASLHSCMTPHGPDT +YEATI+  N   P K+TGTMAFMFES LIPRVC
Sbjct: 372  EAKADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVC 431

Query: 1407 HWALESSFMDHDYYQCWIGLKSHFTHETTNE 1499
            HWALES F+DHDYYQCWIGLKSHF+    N+
Sbjct: 432  HWALESPFLDHDYYQCWIGLKSHFSRIDLNK 462


>ref|NP_200219.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|30696407|ref|NP_851187.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|13432134|sp|Q9ZRA2.2|HGD_ARATH RecName:
            Full=Homogentisate 1,2-dioxygenase; AltName:
            Full=Homogentisate oxygenase; AltName: Full=Homogentisic
            acid oxidase; AltName: Full=Homogentisicase
            gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana] gi|22655252|gb|AAM98216.1|
            homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
            gi|33942055|gb|AAQ55280.1| At5g54080 [Arabidopsis
            thaliana] gi|332009064|gb|AED96447.1| homogentisate
            1,2-dioxygenase [Arabidopsis thaliana]
            gi|332009065|gb|AED96448.1| homogentisate 1,2-dioxygenase
            [Arabidopsis thaliana]
          Length = 461

 Score =  783 bits (2021), Expect = 0.0
 Identities = 367/460 (79%), Positives = 397/460 (86%), Gaps = 8/460 (1%)
 Frame = +3

Query: 153  MENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPR 332
            ME  K +L  E L YQSGFGN+FSSEAIAGALP  QNSPL+CPYGLYAEQISGTSFTSPR
Sbjct: 1    MEEKKKEL--EELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPR 58

Query: 333  KLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDF 512
            KLNQRSWLYR+KPSVTHEPFKPR+P H KLVSEF+ SNS   PTQLRW+P +IP    DF
Sbjct: 59   KLNQRSWLYRVKPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDF 118

Query: 513  IDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKL 692
            +DGL+T+CGAGSS+LRHGFAIHMY AN  M + AFCNADGDFL+VP+ GRLW+ TE G+L
Sbjct: 119  VDGLFTICGAGSSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRL 178

Query: 693  QVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPV 872
             V PGEI VIPQGFRF+IDLPDG SRGYVAEI+G HFQLPDLGPIGANGLA+ RDFL P 
Sbjct: 179  LVTPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPT 238

Query: 873  AWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDH 1052
            AWFE    P YTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH
Sbjct: 239  AWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDH 298

Query: 1053 SDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1232
             DPSINTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG+YEA
Sbjct: 299  GDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEA 358

Query: 1233 KADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHW 1412
            KADGFLPGGASLHSCMTPHGPDT +YEATI+  N   P K+TGTMAFMFES LIPRVCHW
Sbjct: 359  KADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHW 418

Query: 1413 ALESSFMDHDYYQCWIGLKSHFTH--------ETTNEETG 1508
            ALES F+DHDYYQCWIGLKSHF+         E+T +E G
Sbjct: 419  ALESPFLDHDYYQCWIGLKSHFSRISLDKTNVESTEKEPG 458


>ref|XP_003540068.1| PREDICTED: homogentisate 1,2-dioxygenase-like isoform X1 [Glycine
            max] gi|571493465|ref|XP_006592560.1| PREDICTED:
            homogentisate 1,2-dioxygenase-like isoform X2 [Glycine
            max]
          Length = 455

 Score =  781 bits (2018), Expect = 0.0
 Identities = 363/442 (82%), Positives = 394/442 (89%)
 Frame = +3

Query: 195  YQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPS 374
            Y SGFGN+FSSEA+AGALP AQNSPLVCPYGLYAEQISGTSFTSPR  N  SW YRIKPS
Sbjct: 12   YLSGFGNHFSSEALAGALPVAQNSPLVCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPS 71

Query: 375  VTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCGAGSSY 554
            VTHEPFKPR+P +G+++SEFN SNS+A PTQLRWKP++ P  PTDFIDGL TVCG+GSS+
Sbjct: 72   VTHEPFKPRVPGNGRILSEFNNSNSSANPTQLRWKPLDAPDSPTDFIDGLSTVCGSGSSF 131

Query: 555  LRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIPQGF 734
            +RHG+AIHMYTANKSMDNCAFCNADGDFLIVP+ GRL VTTE G+L+V PGEI ++PQGF
Sbjct: 132  MRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQQGRLLVTTECGRLKVSPGEIAILPQGF 191

Query: 735  RFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGYTIV 914
            RF+++LPDGPSRGYVAEIFG HFQLPDLGPIGANGLASPRDFLVP AWFE  S PGYTIV
Sbjct: 192  RFSVNLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLASPRDFLVPTAWFEDKSYPGYTIV 251

Query: 915  QKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTAPTD 1094
            QKFGGELF A QDFSPFNVVAWHGNYVPY YDL+KFCPYNTVL DHSDPSINTVLTAPTD
Sbjct: 252  QKFGGELFDAVQDFSPFNVVAWHGNYVPYMYDLNKFCPYNTVLFDHSDPSINTVLTAPTD 311

Query: 1095 KPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGASLHS 1274
            KPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLI+G YEAKADGFLPGGASLHS
Sbjct: 312  KPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHS 371

Query: 1275 CMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDYYQC 1454
            CMTPHGPDTKSYEATI+ GN+ GP KIT TMAFMFES LIPR+  WA ES F+D DYYQC
Sbjct: 372  CMTPHGPDTKSYEATIARGNDVGPCKITDTMAFMFESSLIPRISQWASESPFLDQDYYQC 431

Query: 1455 WIGLKSHFTHETTNEETGDVHN 1520
            WIGLKSHF    T+ E   + N
Sbjct: 432  WIGLKSHFAVTKTSPENPSLGN 453


>gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
          Length = 461

 Score =  781 bits (2017), Expect = 0.0
 Identities = 366/460 (79%), Positives = 397/460 (86%), Gaps = 8/460 (1%)
 Frame = +3

Query: 153  MENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPR 332
            ME  K +L  E L YQSGFGN+FSSEAIAGALP  QNSPL+CPYGLYAEQISGTSFTSPR
Sbjct: 1    MEEKKKEL--EELKYQSGFGNHFSSEAIAGALPLDQNSPLLCPYGLYAEQISGTSFTSPR 58

Query: 333  KLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDF 512
            KLNQRSWLYR+KPSVTHEPFKPR+P H KLVSEF+ SNS   PTQLRW+P +IP    DF
Sbjct: 59   KLNQRSWLYRVKPSVTHEPFKPRVPAHKKLVSEFDASNSRTNPTQLRWRPEDIPDSEIDF 118

Query: 513  IDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKL 692
            +DGL+T+CGAGSS+LRHGFAIHMY AN  M + AFCNADGDFL+VP+ GRLW+ TE G+L
Sbjct: 119  VDGLFTICGAGSSFLRHGFAIHMYVANTGMKDSAFCNADGDFLLVPQTGRLWIETECGRL 178

Query: 693  QVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPV 872
             V PGEI VIPQGFRF+IDLPDG SRGYVAEI+G HFQLPDLGPIGANGLA+ RDFL P 
Sbjct: 179  LVTPGEIAVIPQGFRFSIDLPDGKSRGYVAEIYGAHFQLPDLGPIGANGLAASRDFLAPT 238

Query: 873  AWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDH 1052
            AWFE    P YTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDL KFCPYNTVL+DH
Sbjct: 239  AWFEDGLRPEYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLKKFCPYNTVLLDH 298

Query: 1053 SDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1232
             DPSINTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG+YEA
Sbjct: 299  GDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGAYEA 358

Query: 1233 KADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHW 1412
            KADGFLPGGASLHSCMTPHGPDT +YEATI+  N   P K+TGTMAFMFES LIPRVCHW
Sbjct: 359  KADGFLPGGASLHSCMTPHGPDTTTYEATIARVNAMAPSKLTGTMAFMFESALIPRVCHW 418

Query: 1413 ALESSFMDHDYYQCWIGLKSHFTH--------ETTNEETG 1508
            ALES F+DH+YYQCWIGLKSHF+         E+T +E G
Sbjct: 419  ALESPFLDHEYYQCWIGLKSHFSRISLDKTNVESTEKEPG 458


>ref|XP_006858313.1| hypothetical protein AMTR_s00064p00100410 [Amborella trichopoda]
            gi|548862420|gb|ERN19780.1| hypothetical protein
            AMTR_s00064p00100410 [Amborella trichopoda]
          Length = 471

 Score =  779 bits (2011), Expect = 0.0
 Identities = 349/438 (79%), Positives = 399/438 (91%)
 Frame = +3

Query: 186  NLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRI 365
            +L+YQSGFGN FSSEA+ GALP+ QNSPL+CP+GLYAEQISGT+FT+PRKLNQRSWLYRI
Sbjct: 11   SLEYQSGFGNVFSSEAMGGALPRDQNSPLLCPFGLYAEQISGTAFTAPRKLNQRSWLYRI 70

Query: 366  KPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCGAG 545
            KPSVTHEPF PR+P H  LVSEFNQS+S+ATPTQLRWKP ++P+ PTDFIDGLYT+CGAG
Sbjct: 71   KPSVTHEPFHPRVPTHAHLVSEFNQSSSSATPTQLRWKPADVPESPTDFIDGLYTICGAG 130

Query: 546  SSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIP 725
            SS+LRHG+A+HMY ANKSMD+CAFC+ADGDFLIVP+ GRLW+TTE G+LQ+ PGEIVV+P
Sbjct: 131  SSFLRHGYAVHMYAANKSMDSCAFCSADGDFLIVPQKGRLWLTTECGRLQICPGEIVVLP 190

Query: 726  QGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGY 905
            QGFRF++DLPDGPSRGYVAE+FG HFQLP+LGPIGANGLA+ RDFLVP A+FE+  +PGY
Sbjct: 191  QGFRFSVDLPDGPSRGYVAEVFGTHFQLPELGPIGANGLAASRDFLVPTAFFEEEHHPGY 250

Query: 906  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTA 1085
            TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCP+NTVL DH DPS+NTVLTA
Sbjct: 251  TIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPFNTVLFDHGDPSVNTVLTA 310

Query: 1086 PTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGAS 1265
            P++KPGVAL+DFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAK DGFLPGGAS
Sbjct: 311  PSEKPGVALVDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKKDGFLPGGAS 370

Query: 1266 LHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDY 1445
            LHSCMTPHGPDTK++EAT+S    + P +I  TMAFMFESCLIPR+C WALES  +D DY
Sbjct: 371  LHSCMTPHGPDTKTFEATVSCEKSSEPFRIADTMAFMFESCLIPRICPWALESPDLDPDY 430

Query: 1446 YQCWIGLKSHFTHETTNE 1499
            Y+CW+GLKSHF  +   +
Sbjct: 431  YKCWVGLKSHFLRKEVTQ 448


>ref|XP_003527216.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Glycine max]
          Length = 455

 Score =  778 bits (2009), Expect = 0.0
 Identities = 360/440 (81%), Positives = 394/440 (89%)
 Frame = +3

Query: 201  SGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKPSVT 380
            SGFGN+FSSEA+AGALP AQNSPLVCPYGLYAEQISGTSFTSPR  N  SW YRIKPSVT
Sbjct: 14   SGFGNHFSSEALAGALPAAQNSPLVCPYGLYAEQISGTSFTSPRNRNLFSWFYRIKPSVT 73

Query: 381  HEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDFIDGLYTVCGAGSSYLR 560
            HEPFKPR+P +G+++SEFN S+S+A PTQLRWKP++ P  P DFIDGL T+CG+GSS++R
Sbjct: 74   HEPFKPRVPGNGRILSEFNNSSSSANPTQLRWKPMDAPDSPMDFIDGLSTMCGSGSSFMR 133

Query: 561  HGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVVIPQGFRF 740
            HG+AIHMY ANKSMDNCAFCNADGDFLIVP+ GRL +TTE G+L+V PGEI +IP GFRF
Sbjct: 134  HGYAIHMYNANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRLKVSPGEIAIIPHGFRF 193

Query: 741  AIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNPGYTIVQK 920
            +++LPDGPSRGYVAEIFG HFQLPDLGPIGANGLASPRDFLVP AWFE  S PGYTIVQK
Sbjct: 194  SVNLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLASPRDFLVPSAWFEDKSYPGYTIVQK 253

Query: 921  FGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVLTAPTDKP 1100
            FGGELF A QDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVL DHSDPSINTVLTAPTDKP
Sbjct: 254  FGGELFDAVQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLFDHSDPSINTVLTAPTDKP 313

Query: 1101 GVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGGASLHSCM 1280
            GVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLI+G YEAKADGFLPGGASLH+CM
Sbjct: 314  GVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGGYEAKADGFLPGGASLHNCM 373

Query: 1281 TPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDHDYYQCWI 1460
            TPHGPDTKSYEATI+ GN+ GP KIT TMAFMFES LIPR+  WALES F+D DYYQCWI
Sbjct: 374  TPHGPDTKSYEATIARGNDGGPCKITDTMAFMFESSLIPRISQWALESPFLDQDYYQCWI 433

Query: 1461 GLKSHFTHETTNEETGDVHN 1520
            GLKSHFT   T+ E  ++ N
Sbjct: 434  GLKSHFTVTETSPENTNLRN 453


>ref|XP_004510037.1| PREDICTED: homogentisate 1,2-dioxygenase-like [Cicer arietinum]
          Length = 455

 Score =  778 bits (2008), Expect = 0.0
 Identities = 361/448 (80%), Positives = 400/448 (89%)
 Frame = +3

Query: 153  MENNKIKLSPENLDYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPR 332
            MEN    ++ ++ +Y SGFGN+FSSEAIAGALP  QNSPL+CP+GLYAEQISGTSFT+PR
Sbjct: 1    MEN---PIAADDFNYLSGFGNHFSSEAIAGALPVGQNSPLICPFGLYAEQISGTSFTTPR 57

Query: 333  KLNQRSWLYRIKPSVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIPKQPTDF 512
             LN  SWLYRIKPSVTHEPFK R+P +GK++SEFN SNS+A PTQLRWKP +IP  PTDF
Sbjct: 58   TLNLFSWLYRIKPSVTHEPFKARVPSNGKILSEFNDSNSSANPTQLRWKPEDIPDSPTDF 117

Query: 513  IDGLYTVCGAGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKL 692
            IDGL TVCG+GSS++RHG+AIHMYTANKSMDNCAFCNADGDFLIVP+ GRL +TTE G+L
Sbjct: 118  IDGLSTVCGSGSSFMRHGYAIHMYTANKSMDNCAFCNADGDFLIVPQQGRLLITTECGRL 177

Query: 693  QVVPGEIVVIPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPV 872
            +V PG+I +IPQGFRF ++LPDGPSRGYVAEIFG HFQLPDLGPIGANGLA+PRDFLVP 
Sbjct: 178  KVSPGDIAIIPQGFRFNVNLPDGPSRGYVAEIFGTHFQLPDLGPIGANGLAAPRDFLVPT 237

Query: 873  AWFEQHSNPGYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDH 1052
            AWFE  S PGYTIVQKFGGELFTA QDFSPFNVVAWHGNYVPYKYDLSKFCPYNT L DH
Sbjct: 238  AWFEDKSYPGYTIVQKFGGELFTAVQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTALYDH 297

Query: 1053 SDPSINTVLTAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEA 1232
            SDPSINTVLTAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLI+G+YEA
Sbjct: 298  SDPSINTVLTAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIHGNYEA 357

Query: 1233 KADGFLPGGASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHW 1412
            K DGFLPGGASLH+CMTPHGPDTKSYEATI+ G+  GP KIT T+AFMFES LIPR+   
Sbjct: 358  KVDGFLPGGASLHNCMTPHGPDTKSYEATIARGDNVGPHKITDTLAFMFESSLIPRISRS 417

Query: 1413 ALESSFMDHDYYQCWIGLKSHFTHETTN 1496
            ALES F+DHDYYQCWIGL+SHFT   T+
Sbjct: 418  ALESPFLDHDYYQCWIGLRSHFTVSETS 445


>ref|XP_002298900.1| hypothetical protein POPTR_0001s38310g [Populus trichocarpa]
            gi|222846158|gb|EEE83705.1| hypothetical protein
            POPTR_0001s38310g [Populus trichocarpa]
          Length = 464

 Score =  775 bits (2001), Expect = 0.0
 Identities = 362/445 (81%), Positives = 393/445 (88%), Gaps = 4/445 (0%)
 Frame = +3

Query: 192  DYQSGFGNNFSSEAIAGALPKAQNSPLVCPYGLYAEQISGTSFTSPRKLNQRSWLYRIKP 371
            DY SGFGN F SE+I G+LP+ QNSPL+CPYGLYAEQISGTSFTSP KLNQRSWLYRIKP
Sbjct: 20   DYLSGFGNTFESESIPGSLPRRQNSPLLCPYGLYAEQISGTSFTSPHKLNQRSWLYRIKP 79

Query: 372  SVTHEPFKPRIPCHGKLVSEFNQSNSAATPTQLRWKPVEIP----KQPTDFIDGLYTVCG 539
            SVTHEPF+ R P H KLVSEF++SNS  TPTQLRWKP  +       P DF++GLYTVCG
Sbjct: 80   SVTHEPFQARFPRHDKLVSEFDKSNSYTTPTQLRWKPKPVDTVEESAPIDFVEGLYTVCG 139

Query: 540  AGSSYLRHGFAIHMYTANKSMDNCAFCNADGDFLIVPENGRLWVTTEFGKLQVVPGEIVV 719
            AGSS+LRHGFAIHMYTANKSMD+ AFCNADGDFLIVP+ GRLW+ TE GKLQV PGEIVV
Sbjct: 140  AGSSFLRHGFAIHMYTANKSMDDRAFCNADGDFLIVPQKGRLWIATECGKLQVSPGEIVV 199

Query: 720  IPQGFRFAIDLPDGPSRGYVAEIFGMHFQLPDLGPIGANGLASPRDFLVPVAWFEQHSNP 899
            IPQGFRFA+DLPDGPSRGYV+EIFG HFQLPDLGPIGANGLA+PRDFLVP AWFE  S P
Sbjct: 200  IPQGFRFAVDLPDGPSRGYVSEIFGTHFQLPDLGPIGANGLAAPRDFLVPKAWFEDGSRP 259

Query: 900  GYTIVQKFGGELFTAKQDFSPFNVVAWHGNYVPYKYDLSKFCPYNTVLIDHSDPSINTVL 1079
            GYT+VQK+GGELF AKQDFSPFNVVAWHGNYVPYKYDL+KFCPYNTVL DHSDPSINTVL
Sbjct: 260  GYTVVQKYGGELFVAKQDFSPFNVVAWHGNYVPYKYDLNKFCPYNTVLFDHSDPSINTVL 319

Query: 1080 TAPTDKPGVALLDFVIFPPRWVVAEHTFRPPYYHRNCMSEFMGLIYGSYEAKADGFLPGG 1259
            TAPTDKPGVALLDFVIFPPRW+VAEHTFRPPYYHRNCMSEFMGLIYG YEAKADGFLPGG
Sbjct: 320  TAPTDKPGVALLDFVIFPPRWLVAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFLPGG 379

Query: 1260 ASLHSCMTPHGPDTKSYEATISLGNEAGPQKITGTMAFMFESCLIPRVCHWALESSFMDH 1439
            ASLHSCMTPHGPDTK+YEATI  G++AGP KIT T+AFMFESCLIPR+   AL+S  MD+
Sbjct: 380  ASLHSCMTPHGPDTKTYEATIESGHDAGPSKITNTLAFMFESCLIPRISLCALKSPLMDN 439

Query: 1440 DYYQCWIGLKSHFTHETTNEETGDV 1514
            DYYQCW GLKSHF+ E  + +   V
Sbjct: 440  DYYQCWTGLKSHFSGEGADSKGNGV 464


Top