BLASTX nr result

ID: Catharanthus22_contig00007320 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00007320
         (1373 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004232237.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   356   1e-95
ref|XP_006357120.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   355   3e-95
ref|XP_006338477.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   353   9e-95
ref|XP_004233351.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   350   6e-94
gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus pe...   334   5e-89
gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao]    331   5e-88
gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao]    329   1e-87
gb|ACV71019.1| UPA19 [Capsicum annuum]                                325   4e-86
ref|XP_004147465.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   323   1e-85
ref|XP_002282633.1| PREDICTED: 2-aminoethanethiol dioxygenase is...   323   1e-85
gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao]    322   2e-85
ref|XP_002282644.1| PREDICTED: 2-aminoethanethiol dioxygenase is...   318   3e-84
ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   318   4e-84
ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis...   318   4e-84
ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   315   4e-83
ref|XP_002305324.2| hypothetical protein POPTR_0004s13450g [Popu...   312   2e-82
ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana] ...   310   1e-81
dbj|BAB10214.1| unnamed protein product [Arabidopsis thaliana]        310   1e-81
ref|XP_002323838.1| hypothetical protein POPTR_0017s11560g [Popu...   309   2e-81
ref|XP_006405584.1| hypothetical protein EUTSA_v10027872mg [Eutr...   308   3e-81

>ref|XP_004232237.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum
           lycopersicum]
          Length = 269

 Score =  356 bits (914), Expect = 1e-95
 Identities = 172/274 (62%), Positives = 199/274 (72%), Gaps = 7/274 (2%)
 Frame = -3

Query: 921 MGIDQNVSKPKGKVYXXXXXXXXXXXXXXXXXRLYETCKEVFADCGPGVVPSPDKVELLK 742
           M I++NVS+P+G+ Y                 RLYETCKE FA+CGPGVVPS +K+E LK
Sbjct: 1   MRIEKNVSEPRGREYSDSKKNRRRQRMISPVQRLYETCKETFANCGPGVVPSAEKIERLK 60

Query: 741 AALDNMSEVDVGLNPNMPFFMADATDRPPRITYIHIYDCDKFSIGIFCLPPSGVIPLHNH 562
             LD M+  DVGL PNMP+F +   DRPP ITY+H+++CDKFSIGIFCLPPS VIPLH+H
Sbjct: 61  EVLDTMAGADVGLRPNMPYFKSTRYDRPPTITYLHLHECDKFSIGIFCLPPSAVIPLHDH 120

Query: 561 PQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ-------GGDMGGLRLAQVKVNSEFTA 403
           P MTVFSKLLFG MHIKSYDWVDN  A  +          G    G+RLA+VK+NS F A
Sbjct: 121 PGMTVFSKLLFGEMHIKSYDWVDNLPADPTPVAKPLDNGLGESTTGIRLAKVKINSAFRA 180

Query: 402 PCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGRHCQYYLDFPLDSYSVEGVDE 223
           PC TSILYPADGGNMHCF A T CAVLDVLGPPYCD EGRHCQYY DFP  S SV     
Sbjct: 181 PCKTSILYPADGGNMHCFKAKTACAVLDVLGPPYCDPEGRHCQYYCDFPFSSISV----- 235

Query: 222 AMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
             EE+K  YAWL+ERE+P+D+T+VGALY GPK+V
Sbjct: 236 -TEEQKGGYAWLKEREKPDDLTLVGALYKGPKMV 268


>ref|XP_006357120.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum tuberosum]
           gi|565404275|ref|XP_006367569.1| PREDICTED:
           2-aminoethanethiol dioxygenase-like [Solanum tuberosum]
          Length = 270

 Score =  355 bits (910), Expect = 3e-95
 Identities = 163/238 (68%), Positives = 191/238 (80%), Gaps = 4/238 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LY TCK+VFA+C PGVVPSP+ VEL++A LD M+E DVGL PNMP+F +  +DRPP+ITY
Sbjct: 34  LYRTCKQVFANCKPGVVPSPENVELVRAVLDKMTEADVGLRPNMPYFKSKVSDRPPKITY 93

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+++CDKFSIGIFCLPP  VIPLHNHP MTVFSKLLFG MHIKSYDW DN    ++   
Sbjct: 94  LHLHECDKFSIGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPESTTPN 153

Query: 462 GG----DMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCD 295
                 D  GLRLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD
Sbjct: 154 ANISDRDCTGLRLAKLKVNSKFKAPCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCD 213

Query: 294 SEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            EGRHCQYY DFP  + SV G+    EE++S YAWL+ERE+PED+TV GALY+GP +V
Sbjct: 214 PEGRHCQYYCDFPFANISVNGL-SVPEEQQSEYAWLKEREKPEDLTVAGALYSGPNLV 270


>ref|XP_006338477.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum tuberosum]
          Length = 269

 Score =  353 bits (906), Expect = 9e-95
 Identities = 172/274 (62%), Positives = 198/274 (72%), Gaps = 7/274 (2%)
 Frame = -3

Query: 921 MGIDQNVSKPKGKVYXXXXXXXXXXXXXXXXXRLYETCKEVFADCGPGVVPSPDKVELLK 742
           M ID+NV + +G+ Y                 RLYETCKE FA+CGPGVVPS +K+E LK
Sbjct: 1   MRIDKNVCERRGREYSDSKKNRRRQRMISPVQRLYETCKETFANCGPGVVPSAEKIERLK 60

Query: 741 AALDNMSEVDVGLNPNMPFFMADATDRPPRITYIHIYDCDKFSIGIFCLPPSGVIPLHNH 562
             LD M+  DVGL PNMP+F +   DRPP ITY+H+++CDKFSIGIFCLPPS VIPLH+H
Sbjct: 61  EVLDTMAGADVGLRPNMPYFKSIRYDRPPTITYLHLHECDKFSIGIFCLPPSAVIPLHDH 120

Query: 561 PQMTVFSKLLFGTMHIKSYDWVDNAHASASA-------EQGGDMGGLRLAQVKVNSEFTA 403
           P MTVFSKLLFG MHIKSYDWVDN  A  +          G    G+RLA+VK+NS F A
Sbjct: 121 PGMTVFSKLLFGEMHIKSYDWVDNLPAEPTPLAKPLDNGLGDSTTGIRLAKVKMNSAFRA 180

Query: 402 PCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGRHCQYYLDFPLDSYSVEGVDE 223
           PC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD EGRHCQYY DFP  + SV     
Sbjct: 181 PCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCDPEGRHCQYYYDFPFSNISVP---- 236

Query: 222 AMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
             EE+K  YAWL+ERE+P+D+TVVGALY GPK+V
Sbjct: 237 --EEQKGDYAWLKEREKPDDLTVVGALYKGPKMV 268


>ref|XP_004233351.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum
           lycopersicum]
          Length = 263

 Score =  350 bits (899), Expect = 6e-94
 Identities = 161/234 (68%), Positives = 189/234 (80%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LY TCK+VF +C PGVVPSP+ VEL+K+ LD M+E DVGL PNMP+F +  +D+PP+ITY
Sbjct: 33  LYRTCKQVFTNCKPGVVPSPENVELVKSVLDKMTEADVGLRPNMPYFKSTVSDKPPKITY 92

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+++CDKFSIGIFCLPP  VIPLHNHP MTVFSKLLFG MHIKSYDW DN    ++   
Sbjct: 93  LHLHECDKFSIGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPKSTTP- 151

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
            GD  GLRLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD +GR
Sbjct: 152 -GDCTGLRLAKLKVNSKFKAPCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCDPDGR 210

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           HCQYY DFP  + SV       EEE+S YAWL+ERE+PED+TV GALY+GP +V
Sbjct: 211 HCQYYYDFPFANMSVNDF-LVPEEEQSEYAWLKEREKPEDLTVAGALYSGPNLV 263


>gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus persica]
          Length = 282

 Score =  334 bits (857), Expect = 5e-89
 Identities = 155/237 (65%), Positives = 185/237 (78%), Gaps = 3/237 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LY+TCK+VF+ CG G+VPSP+ ++ L++ LD M   DVGL P +P+F      R P ITY
Sbjct: 45  LYQTCKDVFSFCGAGIVPSPEDIQRLRSVLDTMKPADVGLTPELPYFRMTVARRTPAITY 104

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV +A    S   
Sbjct: 105 LHLHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVADATEDKSTSA 164

Query: 462 GGDMG---GLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 292
                   G+RLA+VKV+++FTAPCNTSILYPADGGNMHCFTA+T CAVLDVLGPPY D 
Sbjct: 165 NPSPATPPGVRLAKVKVDADFTAPCNTSILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 224

Query: 291 EGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           +GRHCQYYLDFP   +SV+GV  A EEEK  YAWLQE E+PED+ V GA Y GPKIV
Sbjct: 225 DGRHCQYYLDFPFSHFSVDGVSVA-EEEKEGYAWLQEIEKPEDLAVDGAKYRGPKIV 280


>gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 310

 Score =  331 bits (848), Expect = 5e-88
 Identities = 159/238 (66%), Positives = 183/238 (76%), Gaps = 4/238 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L++TCK+VFA  G G+VP+PDK+E L+A LD +   DVGL P MPFF    T R P ITY
Sbjct: 72  LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWV----DNAHASA 475
            HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV     NA A  
Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191

Query: 474 SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCD 295
           +  Q      +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D
Sbjct: 192 APSQTVQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSD 251

Query: 294 SEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            EGRHC YY D+P    SV+GV  A EEEK  YAWLQERE PED+ VVGA Y GP+IV
Sbjct: 252 PEGRHCTYYFDYPFTKLSVDGVTVA-EEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 308


>gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 309

 Score =  329 bits (844), Expect = 1e-87
 Identities = 158/237 (66%), Positives = 184/237 (77%), Gaps = 3/237 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L++TCK+VFA  G G+VP+PDK+E L+A LD +   DVGL P MPFF    T R P ITY
Sbjct: 72  LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
            HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV +  ++ASA  
Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191

Query: 462 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 292
                    +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D 
Sbjct: 192 APSQMQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 251

Query: 291 EGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           EGRHC YY D+P    SV+GV  A EEEK  YAWLQERE PED+ VVGA Y GP+IV
Sbjct: 252 EGRHCTYYFDYPFTKLSVDGVTVA-EEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 307


>gb|ACV71019.1| UPA19 [Capsicum annuum]
          Length = 276

 Score =  325 bits (832), Expect = 4e-86
 Identities = 158/242 (65%), Positives = 187/242 (77%), Gaps = 9/242 (3%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMAD-ATDRPPRIT 646
           LY+TCK VFA+C PGVVPS + VE +KA LD M+  DVGL  NMP+F +  ++DRPP+IT
Sbjct: 35  LYKTCKLVFANCRPGVVPSMENVERVKAVLDKMTLADVGLRRNMPYFKSTVSSDRPPKIT 94

Query: 645 YIHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASA- 469
           Y+H+++CDKFS+GIFCLPP  VIPLHNHP MTVFSKLLFG MHIKSYDW DN    ++  
Sbjct: 95  YLHLHECDKFSMGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPESTPN 154

Query: 468 ----EQGGDMG--GLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGP 307
               + G   G  G RLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGP
Sbjct: 155 ANNFDNGAGYGNTGPRLAKLKVNSKFRAPCKTSILYPADGGNMHCFTAKTACAVLDVLGP 214

Query: 306 PYCDSEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERER-PEDVTVVGALYNGP 130
           PYCD EGRHCQYY DFP    SV+G+    EE++S Y WL ERE+ PED+TV GALY+GP
Sbjct: 215 PYCDPEGRHCQYYYDFPFADLSVDGL-SVPEEQQSEYXWLIEREKLPEDLTVAGALYSGP 273

Query: 129 KI 124
           K+
Sbjct: 274 KL 275


>ref|XP_004147465.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus]
          Length = 288

 Score =  323 bits (828), Expect = 1e-85
 Identities = 153/240 (63%), Positives = 186/240 (77%), Gaps = 6/240 (2%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LYETCK+VFA  G G+VPS + +E L+A LD M  VDVGL+P+MP+F   ++ R P ITY
Sbjct: 47  LYETCKKVFASSGTGIVPSSEDIERLQAVLDKMKPVDVGLSPDMPYFWTTSSQRTPPITY 106

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDW-----VDNAHAS 478
           +H+Y+ +KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIK+YDW     V+ A A 
Sbjct: 107 LHLYENNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEAGAVNGASAC 166

Query: 477 ASAEQG-GDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 301
                G      +RLA+VKV+++FTAPC++SILYPADGGNMHCFTA+T CAVLDVLGPPY
Sbjct: 167 VDTSSGTAPSRSVRLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPY 226

Query: 300 CDSEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            D +GRHC YYLDFP   +SV+ +    E E+ SYAWL+ERE+PED+  VGALY GPKIV
Sbjct: 227 SDPDGRHCSYYLDFPFTEFSVDRI-SVPEAERESYAWLEEREQPEDLAAVGALYEGPKIV 285


>ref|XP_002282633.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform 2 [Vitis
           vinifera] gi|296082863|emb|CBI22164.3| unnamed protein
           product [Vitis vinifera]
          Length = 279

 Score =  323 bits (828), Expect = 1e-85
 Identities = 154/237 (64%), Positives = 182/237 (76%), Gaps = 3/237 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LYETCKEVF+ CG G+VP P  VE L + L++M   DVGLNP M  F  +A D  P+ITY
Sbjct: 42  LYETCKEVFSSCGAGIVPPPGDVEKLASVLNSMKLEDVGLNPEMSCFRTEAPDEAPKITY 101

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+Y+C+KFSIGIFCLPPSGVIPLHNHP MTVFSKLLFG+MHIKSYDW   +  + SA  
Sbjct: 102 LHLYECEKFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGSMHIKSYDWAVGSPCNPSANA 161

Query: 462 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 292
                   G++LA+VKV+++FTAPCN+SILYPADGGNMH FTA+T CAVLDVLGPPY D 
Sbjct: 162 NPSQIQHPGVQLAKVKVDADFTAPCNSSILYPADGGNMHRFTALTACAVLDVLGPPYSDP 221

Query: 291 EGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           EGR C YY DFP  ++SV+GV    EEE+  YAWLQERE+ ED  VVGA+YNGP IV
Sbjct: 222 EGRDCTYYFDFPFTNFSVDGV-SVPEEEREGYAWLQEREKLEDFAVVGAVYNGPMIV 277


>gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 304

 Score =  322 bits (825), Expect = 2e-85
 Identities = 155/237 (65%), Positives = 180/237 (75%), Gaps = 3/237 (1%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L++TCK+VFA  G G+VP+PDK+E L+A LD +   DVGL P MPFF    T R P ITY
Sbjct: 72  LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
            HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV +  ++ASA  
Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191

Query: 462 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 292
                    +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D 
Sbjct: 192 APSQMQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 251

Query: 291 EGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           EGRHC YY D+P    SV       EEEK  YAWLQERE PED+ VVGA Y GP+IV
Sbjct: 252 EGRHCTYYFDYPFTKLSV------AEEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 302


>ref|XP_002282644.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform 3 [Vitis
           vinifera]
          Length = 268

 Score =  318 bits (816), Expect = 3e-84
 Identities = 155/234 (66%), Positives = 181/234 (77%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LYETCKEVF+ CG G+VP P  VE L + L++M   DVGLNP M  F  +A D  P+ITY
Sbjct: 42  LYETCKEVFSSCGAGIVPPPGDVEKLASVLNSMKLEDVGLNPEMSCFRTEAPDEAPKITY 101

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+Y+C+KFSIGIFCLPPSGVIPLHNHP MTVFSKLLFG+MHIKSYDW     A  S  Q
Sbjct: 102 LHLYECEKFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGSMHIKSYDW-----AVGSPFQ 156

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
                G++LA+VKV+++FTAPCN+SILYPADGGNMH FTA+T CAVLDVLGPPY D EGR
Sbjct: 157 ---HPGVQLAKVKVDADFTAPCNSSILYPADGGNMHRFTALTACAVLDVLGPPYSDPEGR 213

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            C YY DFP  ++SV+GV    EEE+  YAWLQERE+ ED  VVGA+YNGP IV
Sbjct: 214 DCTYYFDFPFTNFSVDGV-SVPEEEREGYAWLQEREKLEDFAVVGAVYNGPMIV 266


>ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform 1 [Glycine
           max]
          Length = 281

 Score =  318 bits (814), Expect = 4e-84
 Identities = 152/240 (63%), Positives = 185/240 (77%), Gaps = 6/240 (2%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L+ETCK VFA  G G VP  + ++ L++ LD +   DVGL P+MP+F   AT R PRITY
Sbjct: 44  LFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITY 103

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASA---- 475
           +HIY+C+KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV ++   +    
Sbjct: 104 LHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDSPPESPTTL 163

Query: 474 --SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 301
             S  QG +M   RLA+VKV+++FTAPCN SILYP DGGN+HCFTA+T CAVLDVLGPPY
Sbjct: 164 KPSENQGPEM---RLAKVKVDADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPY 220

Query: 300 CDSEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            D+EGRHC YY DFP  ++SV+G+    EEEK++Y WLQER+  ED+ V G +YNGPKIV
Sbjct: 221 SDAEGRHCTYYHDFPFSNFSVDGL-SIPEEEKNAYEWLQERDELEDLEVNGKMYNGPKIV 279


>ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis]
           gi|223543490|gb|EEF45021.1| Protein C10orf22, putative
           [Ricinus communis]
          Length = 288

 Score =  318 bits (814), Expect = 4e-84
 Identities = 153/237 (64%), Positives = 182/237 (76%), Gaps = 1/237 (0%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LY+TCK+VF+  GPGVVP+PDK+E L+A LD ++  DVGL+P MP+F      R P I Y
Sbjct: 55  LYDTCKDVFSIGGPGVVPAPDKIEKLRAVLDVITPEDVGLHPEMPYFRLPVAGRAPPIRY 114

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +HI++C+KFSIGIFC PPSGVIPLHNHP MTVFSKLLFG MHIKSYDWVD    + SA  
Sbjct: 115 LHIHECNKFSIGIFCFPPSGVIPLHNHPGMTVFSKLLFGKMHIKSYDWVDEDSVNGSAVV 174

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
             +   +RLA+VK++S+FTAPCN  ILYP DGGNMHCFTA T CAVLDVLGPPY D EGR
Sbjct: 175 --NPSEVRLAKVKIDSDFTAPCNPCILYPVDGGNMHCFTAATACAVLDVLGPPYSDPEGR 232

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKIVIK 115
           HC YY DFP  ++SV+GV    EEE+  YAWLQER ++P+D  +VG LY GPKIV K
Sbjct: 233 HCTYYNDFPFANFSVDGV-SLPEEEREGYAWLQERTKQPDDFKMVGELYRGPKIVKK 288


>ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  315 bits (806), Expect = 4e-83
 Identities = 151/240 (62%), Positives = 183/240 (76%), Gaps = 6/240 (2%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L+ETCK VFA  G G VP  + ++ L++ LD +   DVGL P+MP+F   AT R PRITY
Sbjct: 44  LFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITY 103

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASA---- 475
           +HIY+C+KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV +    +    
Sbjct: 104 LHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPESPTTI 163

Query: 474 --SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 301
             S  QG +M   RLA+VKV+++FTAPCN SILYP DGGN+HCFTA+T CAVLDVLGPPY
Sbjct: 164 KPSENQGPEM---RLAKVKVDADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPY 220

Query: 300 CDSEGRHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
            D+EGRHC YY +FP  ++S +G+    EEEK++Y WLQERE  ED+ V G +YNGPKIV
Sbjct: 221 SDAEGRHCTYYHNFPFSNFSADGL-SIPEEEKNAYEWLQEREELEDLEVNGKMYNGPKIV 279


>ref|XP_002305324.2| hypothetical protein POPTR_0004s13450g [Populus trichocarpa]
           gi|550340933|gb|EEE85835.2| hypothetical protein
           POPTR_0004s13450g [Populus trichocarpa]
          Length = 287

 Score =  312 bits (799), Expect = 2e-82
 Identities = 151/236 (63%), Positives = 173/236 (73%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L+ TC EVF  C  G++PS D ++ LKA LDN    DVGL P MP F A    R P I Y
Sbjct: 60  LFNTCNEVFDSCSTGIIPSSDNIQKLKAVLDNFKPADVGLFPEMPHFQASVAGRTPVIRY 119

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +H+++CDKFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV +  AS S + 
Sbjct: 120 LHLHECDKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVADVPASKSKQT 179

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
                 +RLA+VKVNS+ TAPCNTSILYP DGGNMHCFTA+T CAVLDVLGPPY   +GR
Sbjct: 180 -----EVRLAKVKVNSKLTAPCNTSILYPTDGGNMHCFTAVTACAVLDVLGPPYSAPDGR 234

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIVIK 115
           HCQYYLDFP  ++S   V      +K  +AWLQERE PED+T VG LY GP IV K
Sbjct: 235 HCQYYLDFPFANFSGTMVH---LHKKEGHAWLQERETPEDLTFVGELYGGPVIVEK 287


>ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana]
           gi|21536502|gb|AAM60834.1| unknown [Arabidopsis
           thaliana] gi|27808558|gb|AAO24559.1| At5g39890
           [Arabidopsis thaliana] gi|110736241|dbj|BAF00091.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332007105|gb|AED94488.1| uncharacterized protein
           AT5G39890 [Arabidopsis thaliana]
          Length = 276

 Score =  310 bits (793), Expect = 1e-81
 Identities = 148/234 (63%), Positives = 179/234 (76%), Gaps = 1/234 (0%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L++TCK+VFAD   G VPS + +E+L+A LD +   DVG+NP M +F +  T R P +TY
Sbjct: 50  LFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTGRSPLVTY 109

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +HIY C +FSI IFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++   +S   
Sbjct: 110 LHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSS--- 166

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
                  RLA+VKV+S+FTAPC+TSILYPADGGNMHCFTA T CAVLDV+GPPY D  GR
Sbjct: 167 -----DTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGR 221

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 124
           HC YY D+P  S+SV+GV  A EEEK  YAWL+ER E+PED+TV   +Y+GP I
Sbjct: 222 HCTYYFDYPFSSFSVDGVVVA-EEEKEGYAWLKEREEKPEDLTVTALMYSGPTI 274


>dbj|BAB10214.1| unnamed protein product [Arabidopsis thaliana]
          Length = 270

 Score =  310 bits (793), Expect = 1e-81
 Identities = 148/234 (63%), Positives = 179/234 (76%), Gaps = 1/234 (0%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           L++TCK+VFAD   G VPS + +E+L+A LD +   DVG+NP M +F +  T R P +TY
Sbjct: 44  LFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTGRSPLVTY 103

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           +HIY C +FSI IFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++   +S   
Sbjct: 104 LHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSS--- 160

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 283
                  RLA+VKV+S+FTAPC+TSILYPADGGNMHCFTA T CAVLDV+GPPY D  GR
Sbjct: 161 -----DTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGR 215

Query: 282 HCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 124
           HC YY D+P  S+SV+GV  A EEEK  YAWL+ER E+PED+TV   +Y+GP I
Sbjct: 216 HCTYYFDYPFSSFSVDGVVVA-EEEKEGYAWLKEREEKPEDLTVTALMYSGPTI 268


>ref|XP_002323838.1| hypothetical protein POPTR_0017s11560g [Populus trichocarpa]
           gi|222866840|gb|EEF03971.1| hypothetical protein
           POPTR_0017s11560g [Populus trichocarpa]
          Length = 241

 Score =  309 bits (792), Expect = 2e-81
 Identities = 150/235 (63%), Positives = 172/235 (73%), Gaps = 1/235 (0%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 643
           LY TC EVF  C  G++PSPD ++ LKA LD+    DVGL+P MP F A      P I Y
Sbjct: 10  LYNTCNEVFDSCSAGIIPSPDNIQKLKAVLDDFKPADVGLSPEMPHFRASVAGETPVIRY 69

Query: 642 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 463
           I+I++C+KFSIGIFCLPPS  IPLHNHP MTVFSKLLFGTMHIKSYDWV +   S SA Q
Sbjct: 70  IYIHECEKFSIGIFCLPPSSAIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPPSTSAVQ 129

Query: 462 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY-CDSEG 286
                  RLA+VKVNS FTAPCNTSILYP DGGNMHCFTA+T CAVLDVLGPPY  DS+G
Sbjct: 130 ----PEARLAEVKVNSNFTAPCNTSILYPTDGGNMHCFTAVTACAVLDVLGPPYGSDSDG 185

Query: 285 RHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 121
           RHCQ+Y DFP  + SV+G+    E  K  +AWLQER++PED+ VVG LY  P  V
Sbjct: 186 RHCQFYFDFPFSNISVDGL-SLPEGGKEGFAWLQERKKPEDLIVVGELYGDPTTV 239


>ref|XP_006405584.1| hypothetical protein EUTSA_v10027872mg [Eutrema salsugineum]
           gi|557106722|gb|ESQ47037.1| hypothetical protein
           EUTSA_v10027872mg [Eutrema salsugineum]
          Length = 290

 Score =  308 bits (789), Expect = 3e-81
 Identities = 149/235 (63%), Positives = 180/235 (76%), Gaps = 2/235 (0%)
 Frame = -3

Query: 822 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADAT-DRPPRIT 646
           L+ETC +VFAD   G+VPS + +E+L+A LD +   DV ++PNMP+F + +  DR P +T
Sbjct: 63  LFETCNKVFADGKSGIVPSQENIEMLRAVLDKIKPEDVDVSPNMPYFRSKSVGDRSPIVT 122

Query: 645 YIHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAE 466
           Y+HIY C KFS+GIFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++   +S  
Sbjct: 123 YLHIYKCHKFSMGIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVADSPQPSS-- 180

Query: 465 QGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEG 286
                   RLA+VKV+SEFTAPC+TSILYPADGGNMHCFTA T CAVLDVLGPPY D  G
Sbjct: 181 ------DTRLAKVKVDSEFTAPCDTSILYPADGGNMHCFTAKTACAVLDVLGPPYSDPAG 234

Query: 285 RHCQYYLDFPLDSYSVEGVDEAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 124
           RHC YY D+P   +SV+GV  A EEEK  Y WL+ER E PED+TV+  +Y+GP I
Sbjct: 235 RHCTYYFDYPFSRFSVDGVAVA-EEEKERYEWLKEREEEPEDLTVMAMMYSGPTI 288


Top