BLASTX nr result
ID: Catharanthus23_contig00017019
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00017019 (1320 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006357120.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 357 8e-96 ref|XP_004232237.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 356 1e-95 ref|XP_006338477.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 353 9e-95 ref|XP_004233351.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 352 2e-94 gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus pe... 337 9e-90 gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao] 333 1e-88 gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao] 332 3e-88 gb|ACV71019.1| UPA19 [Capsicum annuum] 327 9e-87 ref|XP_004147465.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 325 3e-86 ref|XP_002282633.1| PREDICTED: 2-aminoethanethiol dioxygenase is... 325 3e-86 gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao] 322 2e-85 ref|XP_002282644.1| PREDICTED: 2-aminoethanethiol dioxygenase is... 320 6e-85 ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 320 1e-84 ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis... 319 2e-84 ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 317 9e-84 ref|XP_002305324.2| hypothetical protein POPTR_0004s13450g [Popu... 313 8e-83 ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana] ... 312 2e-82 dbj|BAB10214.1| unnamed protein product [Arabidopsis thaliana] 312 2e-82 ref|XP_006405584.1| hypothetical protein EUTSA_v10027872mg [Eutr... 310 7e-82 ref|XP_002323838.1| hypothetical protein POPTR_0017s11560g [Popu... 310 7e-82 >ref|XP_006357120.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum tuberosum] gi|565404275|ref|XP_006367569.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum tuberosum] Length = 270 Score = 357 bits (915), Expect = 8e-96 Identities = 164/238 (68%), Positives = 192/238 (80%), Gaps = 4/238 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LY TCK+VFA+C PGVVPSP+ VEL++A LD M+E DVGL PNMP+F + +DRPP+ITY Sbjct: 34 LYRTCKQVFANCKPGVVPSPENVELVRAVLDKMTEADVGLRPNMPYFKSKVSDRPPKITY 93 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+++CDKFSIGIFCLPP VIPLHNHP MTVFSKLLFG MHIKSYDW DN ++ Sbjct: 94 LHLHECDKFSIGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPESTTPN 153 Query: 432 GG----DMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCD 265 D GLRLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD Sbjct: 154 ANISDRDCTGLRLAKLKVNSKFKAPCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCD 213 Query: 264 SEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EGRHCQYY DFP + SV G+ V EE++S YAWL+ERE+PED+TV GALY+GP +V Sbjct: 214 PEGRHCQYYCDFPFANISVNGLSVP-EEQQSEYAWLKEREKPEDLTVAGALYSGPNLV 270 >ref|XP_004232237.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum lycopersicum] Length = 269 Score = 356 bits (914), Expect = 1e-95 Identities = 172/274 (62%), Positives = 199/274 (72%), Gaps = 7/274 (2%) Frame = -1 Query: 891 MGIDQNVSKPKGKVYXXXXXXXXXXXXXXXXXRLYETCKEVFADCGPGVVPSPDKVELLK 712 M I++NVS+P+G+ Y RLYETCKE FA+CGPGVVPS +K+E LK Sbjct: 1 MRIEKNVSEPRGREYSDSKKNRRRQRMISPVQRLYETCKETFANCGPGVVPSAEKIERLK 60 Query: 711 AALDNMSEVDVGLNPNMPFFMADATDRPPRITYIHIYDCDKFSIGIFCLPPSGVIPLHNH 532 LD M+ DVGL PNMP+F + DRPP ITY+H+++CDKFSIGIFCLPPS VIPLH+H Sbjct: 61 EVLDTMAGADVGLRPNMPYFKSTRYDRPPTITYLHLHECDKFSIGIFCLPPSAVIPLHDH 120 Query: 531 PQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ-------GGDMGGLRLAQVKVNSEFTA 373 P MTVFSKLLFG MHIKSYDWVDN A + G G+RLA+VK+NS F A Sbjct: 121 PGMTVFSKLLFGEMHIKSYDWVDNLPADPTPVAKPLDNGLGESTTGIRLAKVKINSAFRA 180 Query: 372 PCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGRHCQYYLDFPLDSYSVEGVDV 193 PC TSILYPADGGNMHCF A T CAVLDVLGPPYCD EGRHCQYY DFP S SV Sbjct: 181 PCKTSILYPADGGNMHCFKAKTACAVLDVLGPPYCDPEGRHCQYYCDFPFSSISV----- 235 Query: 192 AMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EE+K YAWL+ERE+P+D+T+VGALY GPK+V Sbjct: 236 -TEEQKGGYAWLKEREKPDDLTLVGALYKGPKMV 268 >ref|XP_006338477.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum tuberosum] Length = 269 Score = 353 bits (906), Expect = 9e-95 Identities = 172/274 (62%), Positives = 198/274 (72%), Gaps = 7/274 (2%) Frame = -1 Query: 891 MGIDQNVSKPKGKVYXXXXXXXXXXXXXXXXXRLYETCKEVFADCGPGVVPSPDKVELLK 712 M ID+NV + +G+ Y RLYETCKE FA+CGPGVVPS +K+E LK Sbjct: 1 MRIDKNVCERRGREYSDSKKNRRRQRMISPVQRLYETCKETFANCGPGVVPSAEKIERLK 60 Query: 711 AALDNMSEVDVGLNPNMPFFMADATDRPPRITYIHIYDCDKFSIGIFCLPPSGVIPLHNH 532 LD M+ DVGL PNMP+F + DRPP ITY+H+++CDKFSIGIFCLPPS VIPLH+H Sbjct: 61 EVLDTMAGADVGLRPNMPYFKSIRYDRPPTITYLHLHECDKFSIGIFCLPPSAVIPLHDH 120 Query: 531 PQMTVFSKLLFGTMHIKSYDWVDNAHASASA-------EQGGDMGGLRLAQVKVNSEFTA 373 P MTVFSKLLFG MHIKSYDWVDN A + G G+RLA+VK+NS F A Sbjct: 121 PGMTVFSKLLFGEMHIKSYDWVDNLPAEPTPLAKPLDNGLGDSTTGIRLAKVKMNSAFRA 180 Query: 372 PCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGRHCQYYLDFPLDSYSVEGVDV 193 PC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD EGRHCQYY DFP + SV Sbjct: 181 PCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCDPEGRHCQYYYDFPFSNISVP---- 236 Query: 192 AMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EE+K YAWL+ERE+P+D+TVVGALY GPK+V Sbjct: 237 --EEQKGDYAWLKEREKPDDLTVVGALYKGPKMV 268 >ref|XP_004233351.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Solanum lycopersicum] Length = 263 Score = 352 bits (903), Expect = 2e-94 Identities = 161/234 (68%), Positives = 190/234 (81%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LY TCK+VF +C PGVVPSP+ VEL+K+ LD M+E DVGL PNMP+F + +D+PP+ITY Sbjct: 33 LYRTCKQVFTNCKPGVVPSPENVELVKSVLDKMTEADVGLRPNMPYFKSTVSDKPPKITY 92 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+++CDKFSIGIFCLPP VIPLHNHP MTVFSKLLFG MHIKSYDW DN ++ Sbjct: 93 LHLHECDKFSIGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPKSTTP- 151 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 GD GLRLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGPPYCD +GR Sbjct: 152 -GDCTGLRLAKLKVNSKFKAPCKTSILYPADGGNMHCFTAKTACAVLDVLGPPYCDPDGR 210 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 HCQYY DFP + SV + EEE+S YAWL+ERE+PED+TV GALY+GP +V Sbjct: 211 HCQYYYDFPFANMSVNDF-LVPEEEQSEYAWLKEREKPEDLTVAGALYSGPNLV 263 >gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus persica] Length = 282 Score = 337 bits (863), Expect = 9e-90 Identities = 156/237 (65%), Positives = 186/237 (78%), Gaps = 3/237 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LY+TCK+VF+ CG G+VPSP+ ++ L++ LD M DVGL P +P+F R P ITY Sbjct: 45 LYQTCKDVFSFCGAGIVPSPEDIQRLRSVLDTMKPADVGLTPELPYFRMTVARRTPAITY 104 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV +A S Sbjct: 105 LHLHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVADATEDKSTSA 164 Query: 432 GGDMG---GLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 262 G+RLA+VKV+++FTAPCNTSILYPADGGNMHCFTA+T CAVLDVLGPPY D Sbjct: 165 NPSPATPPGVRLAKVKVDADFTAPCNTSILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 224 Query: 261 EGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 +GRHCQYYLDFP +SV+GV VA EEEK YAWLQE E+PED+ V GA Y GPKIV Sbjct: 225 DGRHCQYYLDFPFSHFSVDGVSVA-EEEKEGYAWLQEIEKPEDLAVDGAKYRGPKIV 280 >gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 310 Score = 333 bits (854), Expect = 1e-88 Identities = 160/238 (67%), Positives = 184/238 (77%), Gaps = 4/238 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L++TCK+VFA G G+VP+PDK+E L+A LD + DVGL P MPFF T R P ITY Sbjct: 72 LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWV----DNAHASA 445 HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV NA A Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191 Query: 444 SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCD 265 + Q +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D Sbjct: 192 APSQTVQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSD 251 Query: 264 SEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EGRHC YY D+P SV+GV VA EEEK YAWLQERE PED+ VVGA Y GP+IV Sbjct: 252 PEGRHCTYYFDYPFTKLSVDGVTVA-EEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 308 >gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 309 Score = 332 bits (850), Expect = 3e-88 Identities = 159/237 (67%), Positives = 185/237 (78%), Gaps = 3/237 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L++TCK+VFA G G+VP+PDK+E L+A LD + DVGL P MPFF T R P ITY Sbjct: 72 LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV + ++ASA Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191 Query: 432 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 262 +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D Sbjct: 192 APSQMQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 251 Query: 261 EGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EGRHC YY D+P SV+GV VA EEEK YAWLQERE PED+ VVGA Y GP+IV Sbjct: 252 EGRHCTYYFDYPFTKLSVDGVTVA-EEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 307 >gb|ACV71019.1| UPA19 [Capsicum annuum] Length = 276 Score = 327 bits (837), Expect = 9e-87 Identities = 159/242 (65%), Positives = 188/242 (77%), Gaps = 9/242 (3%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMAD-ATDRPPRIT 616 LY+TCK VFA+C PGVVPS + VE +KA LD M+ DVGL NMP+F + ++DRPP+IT Sbjct: 35 LYKTCKLVFANCRPGVVPSMENVERVKAVLDKMTLADVGLRRNMPYFKSTVSSDRPPKIT 94 Query: 615 YIHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASA- 439 Y+H+++CDKFS+GIFCLPP VIPLHNHP MTVFSKLLFG MHIKSYDW DN ++ Sbjct: 95 YLHLHECDKFSMGIFCLPPKAVIPLHNHPGMTVFSKLLFGKMHIKSYDWADNLLPESTPN 154 Query: 438 ----EQGGDMG--GLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGP 277 + G G G RLA++KVNS+F APC TSILYPADGGNMHCFTA T CAVLDVLGP Sbjct: 155 ANNFDNGAGYGNTGPRLAKLKVNSKFRAPCKTSILYPADGGNMHCFTAKTACAVLDVLGP 214 Query: 276 PYCDSEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERER-PEDVTVVGALYNGP 100 PYCD EGRHCQYY DFP SV+G+ V EE++S Y WL ERE+ PED+TV GALY+GP Sbjct: 215 PYCDPEGRHCQYYYDFPFADLSVDGLSVP-EEQQSEYXWLIEREKLPEDLTVAGALYSGP 273 Query: 99 KI 94 K+ Sbjct: 274 KL 275 >ref|XP_004147465.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus] Length = 288 Score = 325 bits (833), Expect = 3e-86 Identities = 154/240 (64%), Positives = 187/240 (77%), Gaps = 6/240 (2%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LYETCK+VFA G G+VPS + +E L+A LD M VDVGL+P+MP+F ++ R P ITY Sbjct: 47 LYETCKKVFASSGTGIVPSSEDIERLQAVLDKMKPVDVGLSPDMPYFWTTSSQRTPPITY 106 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDW-----VDNAHAS 448 +H+Y+ +KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIK+YDW V+ A A Sbjct: 107 LHLYENNKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKAYDWAEAGAVNGASAC 166 Query: 447 ASAEQG-GDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 271 G +RLA+VKV+++FTAPC++SILYPADGGNMHCFTA+T CAVLDVLGPPY Sbjct: 167 VDTSSGTAPSRSVRLAKVKVDADFTAPCDSSILYPADGGNMHCFTAVTACAVLDVLGPPY 226 Query: 270 CDSEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 D +GRHC YYLDFP +SV+ + V E E+ SYAWL+ERE+PED+ VGALY GPKIV Sbjct: 227 SDPDGRHCSYYLDFPFTEFSVDRISVP-EAERESYAWLEEREQPEDLAAVGALYEGPKIV 285 >ref|XP_002282633.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform 2 [Vitis vinifera] gi|296082863|emb|CBI22164.3| unnamed protein product [Vitis vinifera] Length = 279 Score = 325 bits (833), Expect = 3e-86 Identities = 155/237 (65%), Positives = 183/237 (77%), Gaps = 3/237 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LYETCKEVF+ CG G+VP P VE L + L++M DVGLNP M F +A D P+ITY Sbjct: 42 LYETCKEVFSSCGAGIVPPPGDVEKLASVLNSMKLEDVGLNPEMSCFRTEAPDEAPKITY 101 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+Y+C+KFSIGIFCLPPSGVIPLHNHP MTVFSKLLFG+MHIKSYDW + + SA Sbjct: 102 LHLYECEKFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGSMHIKSYDWAVGSPCNPSANA 161 Query: 432 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 262 G++LA+VKV+++FTAPCN+SILYPADGGNMH FTA+T CAVLDVLGPPY D Sbjct: 162 NPSQIQHPGVQLAKVKVDADFTAPCNSSILYPADGGNMHRFTALTACAVLDVLGPPYSDP 221 Query: 261 EGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EGR C YY DFP ++SV+GV V EEE+ YAWLQERE+ ED VVGA+YNGP IV Sbjct: 222 EGRDCTYYFDFPFTNFSVDGVSVP-EEEREGYAWLQEREKLEDFAVVGAVYNGPMIV 277 >gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 304 Score = 322 bits (825), Expect = 2e-85 Identities = 155/237 (65%), Positives = 180/237 (75%), Gaps = 3/237 (1%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L++TCK+VFA G G+VP+PDK+E L+A LD + DVGL P MPFF T R P ITY Sbjct: 72 LFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLTPQMPFFSLPVTRRAPPITY 131 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 HI++C+KFS+GIFCLPPSGV+PLHNHP MTVFSKLLFGTMHIKSYDWV + ++ASA Sbjct: 132 QHIHECEKFSMGIFCLPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVV 191 Query: 432 GGDM---GGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDS 262 +RLA+VKV+S+FTAPC+ SILYPADGGNMHCFTA+T CAVLDVLGPPY D Sbjct: 192 APSQMQHREVRLAKVKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDP 251 Query: 261 EGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 EGRHC YY D+P SV EEEK YAWLQERE PED+ VVGA Y GP+IV Sbjct: 252 EGRHCTYYFDYPFTKLSV------AEEEKDKYAWLQEREEPEDLAVVGAPYTGPEIV 302 >ref|XP_002282644.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform 3 [Vitis vinifera] Length = 268 Score = 320 bits (821), Expect = 6e-85 Identities = 156/234 (66%), Positives = 182/234 (77%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LYETCKEVF+ CG G+VP P VE L + L++M DVGLNP M F +A D P+ITY Sbjct: 42 LYETCKEVFSSCGAGIVPPPGDVEKLASVLNSMKLEDVGLNPEMSCFRTEAPDEAPKITY 101 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+Y+C+KFSIGIFCLPPSGVIPLHNHP MTVFSKLLFG+MHIKSYDW A S Q Sbjct: 102 LHLYECEKFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGSMHIKSYDW-----AVGSPFQ 156 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 G++LA+VKV+++FTAPCN+SILYPADGGNMH FTA+T CAVLDVLGPPY D EGR Sbjct: 157 ---HPGVQLAKVKVDADFTAPCNSSILYPADGGNMHRFTALTACAVLDVLGPPYSDPEGR 213 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 C YY DFP ++SV+GV V EEE+ YAWLQERE+ ED VVGA+YNGP IV Sbjct: 214 DCTYYFDFPFTNFSVDGVSVP-EEEREGYAWLQEREKLEDFAVVGAVYNGPMIV 266 >ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform 1 [Glycine max] Length = 281 Score = 320 bits (819), Expect = 1e-84 Identities = 152/240 (63%), Positives = 186/240 (77%), Gaps = 6/240 (2%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L+ETCK VFA G G VP + ++ L++ LD + DVGL P+MP+F AT R PRITY Sbjct: 44 LFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITY 103 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASA---- 445 +HIY+C+KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV ++ + Sbjct: 104 LHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDSPPESPTTL 163 Query: 444 --SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 271 S QG +M RLA+VKV+++FTAPCN SILYP DGGN+HCFTA+T CAVLDVLGPPY Sbjct: 164 KPSENQGPEM---RLAKVKVDADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPY 220 Query: 270 CDSEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 D+EGRHC YY DFP ++SV+G+ + EEEK++Y WLQER+ ED+ V G +YNGPKIV Sbjct: 221 SDAEGRHCTYYHDFPFSNFSVDGLSIP-EEEKNAYEWLQERDELEDLEVNGKMYNGPKIV 279 >ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis] gi|223543490|gb|EEF45021.1| Protein C10orf22, putative [Ricinus communis] Length = 288 Score = 319 bits (817), Expect = 2e-84 Identities = 153/237 (64%), Positives = 183/237 (77%), Gaps = 1/237 (0%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LY+TCK+VF+ GPGVVP+PDK+E L+A LD ++ DVGL+P MP+F R P I Y Sbjct: 55 LYDTCKDVFSIGGPGVVPAPDKIEKLRAVLDVITPEDVGLHPEMPYFRLPVAGRAPPIRY 114 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +HI++C+KFSIGIFC PPSGVIPLHNHP MTVFSKLLFG MHIKSYDWVD + SA Sbjct: 115 LHIHECNKFSIGIFCFPPSGVIPLHNHPGMTVFSKLLFGKMHIKSYDWVDEDSVNGSAVV 174 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 + +RLA+VK++S+FTAPCN ILYP DGGNMHCFTA T CAVLDVLGPPY D EGR Sbjct: 175 --NPSEVRLAKVKIDSDFTAPCNPCILYPVDGGNMHCFTAATACAVLDVLGPPYSDPEGR 232 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKIVIK 85 HC YY DFP ++SV+GV + EEE+ YAWLQER ++P+D +VG LY GPKIV K Sbjct: 233 HCTYYNDFPFANFSVDGVSLP-EEEREGYAWLQERTKQPDDFKMVGELYRGPKIVKK 288 >ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max] Length = 281 Score = 317 bits (811), Expect = 9e-84 Identities = 151/240 (62%), Positives = 184/240 (76%), Gaps = 6/240 (2%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L+ETCK VFA G G VP + ++ L++ LD + DVGL P+MP+F AT R PRITY Sbjct: 44 LFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITY 103 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASA---- 445 +HIY+C+KFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV + + Sbjct: 104 LHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPESPTTI 163 Query: 444 --SAEQGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY 271 S QG +M RLA+VKV+++FTAPCN SILYP DGGN+HCFTA+T CAVLDVLGPPY Sbjct: 164 KPSENQGPEM---RLAKVKVDADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPY 220 Query: 270 CDSEGRHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 D+EGRHC YY +FP ++S +G+ + EEEK++Y WLQERE ED+ V G +YNGPKIV Sbjct: 221 SDAEGRHCTYYHNFPFSNFSADGLSIP-EEEKNAYEWLQEREELEDLEVNGKMYNGPKIV 279 >ref|XP_002305324.2| hypothetical protein POPTR_0004s13450g [Populus trichocarpa] gi|550340933|gb|EEE85835.2| hypothetical protein POPTR_0004s13450g [Populus trichocarpa] Length = 287 Score = 313 bits (803), Expect = 8e-83 Identities = 151/236 (63%), Positives = 174/236 (73%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L+ TC EVF C G++PS D ++ LKA LDN DVGL P MP F A R P I Y Sbjct: 60 LFNTCNEVFDSCSTGIIPSSDNIQKLKAVLDNFKPADVGLFPEMPHFQASVAGRTPVIRY 119 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +H+++CDKFS+GIFCLPPSGVIPLHNHP MTVFSKLLFGTMHIKSYDWV + AS S + Sbjct: 120 LHLHECDKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVADVPASKSKQT 179 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 +RLA+VKVNS+ TAPCNTSILYP DGGNMHCFTA+T CAVLDVLGPPY +GR Sbjct: 180 -----EVRLAKVKVNSKLTAPCNTSILYPTDGGNMHCFTAVTACAVLDVLGPPYSAPDGR 234 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIVIK 85 HCQYYLDFP ++S V + +K +AWLQERE PED+T VG LY GP IV K Sbjct: 235 HCQYYLDFPFANFSGTMVHL---HKKEGHAWLQERETPEDLTFVGELYGGPVIVEK 287 >ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana] gi|21536502|gb|AAM60834.1| unknown [Arabidopsis thaliana] gi|27808558|gb|AAO24559.1| At5g39890 [Arabidopsis thaliana] gi|110736241|dbj|BAF00091.1| hypothetical protein [Arabidopsis thaliana] gi|332007105|gb|AED94488.1| uncharacterized protein AT5G39890 [Arabidopsis thaliana] Length = 276 Score = 312 bits (799), Expect = 2e-82 Identities = 149/234 (63%), Positives = 180/234 (76%), Gaps = 1/234 (0%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L++TCK+VFAD G VPS + +E+L+A LD + DVG+NP M +F + T R P +TY Sbjct: 50 LFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTGRSPLVTY 109 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +HIY C +FSI IFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++ +S Sbjct: 110 LHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSS--- 166 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 RLA+VKV+S+FTAPC+TSILYPADGGNMHCFTA T CAVLDV+GPPY D GR Sbjct: 167 -----DTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGR 221 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 94 HC YY D+P S+SV+GV VA EEEK YAWL+ER E+PED+TV +Y+GP I Sbjct: 222 HCTYYFDYPFSSFSVDGVVVA-EEEKEGYAWLKEREEKPEDLTVTALMYSGPTI 274 >dbj|BAB10214.1| unnamed protein product [Arabidopsis thaliana] Length = 270 Score = 312 bits (799), Expect = 2e-82 Identities = 149/234 (63%), Positives = 180/234 (76%), Gaps = 1/234 (0%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 L++TCK+VFAD G VPS + +E+L+A LD + DVG+NP M +F + T R P +TY Sbjct: 44 LFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTGRSPLVTY 103 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 +HIY C +FSI IFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++ +S Sbjct: 104 LHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVPDSPQPSS--- 160 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEGR 253 RLA+VKV+S+FTAPC+TSILYPADGGNMHCFTA T CAVLDV+GPPY D GR Sbjct: 161 -----DTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGR 215 Query: 252 HCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 94 HC YY D+P S+SV+GV VA EEEK YAWL+ER E+PED+TV +Y+GP I Sbjct: 216 HCTYYFDYPFSSFSVDGVVVA-EEEKEGYAWLKEREEKPEDLTVTALMYSGPTI 268 >ref|XP_006405584.1| hypothetical protein EUTSA_v10027872mg [Eutrema salsugineum] gi|557106722|gb|ESQ47037.1| hypothetical protein EUTSA_v10027872mg [Eutrema salsugineum] Length = 290 Score = 310 bits (795), Expect = 7e-82 Identities = 150/235 (63%), Positives = 181/235 (77%), Gaps = 2/235 (0%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADAT-DRPPRIT 616 L+ETC +VFAD G+VPS + +E+L+A LD + DV ++PNMP+F + + DR P +T Sbjct: 63 LFETCNKVFADGKSGIVPSQENIEMLRAVLDKIKPEDVDVSPNMPYFRSKSVGDRSPIVT 122 Query: 615 YIHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAE 436 Y+HIY C KFS+GIFCLPPSGVIPLHNHP+MTVFSKLLFGTMHIKSYDWV ++ +S Sbjct: 123 YLHIYKCHKFSMGIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVADSPQPSS-- 180 Query: 435 QGGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPYCDSEG 256 RLA+VKV+SEFTAPC+TSILYPADGGNMHCFTA T CAVLDVLGPPY D G Sbjct: 181 ------DTRLAKVKVDSEFTAPCDTSILYPADGGNMHCFTAKTACAVLDVLGPPYSDPAG 234 Query: 255 RHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQER-ERPEDVTVVGALYNGPKI 94 RHC YY D+P +SV+GV VA EEEK Y WL+ER E PED+TV+ +Y+GP I Sbjct: 235 RHCTYYFDYPFSRFSVDGVAVA-EEEKERYEWLKEREEEPEDLTVMAMMYSGPTI 288 >ref|XP_002323838.1| hypothetical protein POPTR_0017s11560g [Populus trichocarpa] gi|222866840|gb|EEF03971.1| hypothetical protein POPTR_0017s11560g [Populus trichocarpa] Length = 241 Score = 310 bits (795), Expect = 7e-82 Identities = 150/235 (63%), Positives = 173/235 (73%), Gaps = 1/235 (0%) Frame = -1 Query: 792 LYETCKEVFADCGPGVVPSPDKVELLKAALDNMSEVDVGLNPNMPFFMADATDRPPRITY 613 LY TC EVF C G++PSPD ++ LKA LD+ DVGL+P MP F A P I Y Sbjct: 10 LYNTCNEVFDSCSAGIIPSPDNIQKLKAVLDDFKPADVGLSPEMPHFRASVAGETPVIRY 69 Query: 612 IHIYDCDKFSIGIFCLPPSGVIPLHNHPQMTVFSKLLFGTMHIKSYDWVDNAHASASAEQ 433 I+I++C+KFSIGIFCLPPS IPLHNHP MTVFSKLLFGTMHIKSYDWV + S SA Q Sbjct: 70 IYIHECEKFSIGIFCLPPSSAIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPPSTSAVQ 129 Query: 432 GGDMGGLRLAQVKVNSEFTAPCNTSILYPADGGNMHCFTAMTPCAVLDVLGPPY-CDSEG 256 RLA+VKVNS FTAPCNTSILYP DGGNMHCFTA+T CAVLDVLGPPY DS+G Sbjct: 130 ----PEARLAEVKVNSNFTAPCNTSILYPTDGGNMHCFTAVTACAVLDVLGPPYGSDSDG 185 Query: 255 RHCQYYLDFPLDSYSVEGVDVAMEEEKSSYAWLQERERPEDVTVVGALYNGPKIV 91 RHCQ+Y DFP + SV+G+ + E K +AWLQER++PED+ VVG LY P V Sbjct: 186 RHCQFYFDFPFSNISVDGLSLP-EGGKEGFAWLQERKKPEDLIVVGELYGDPTTV 239