BLASTX nr result
ID: Ephedra26_contig00017174
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra26_contig00017174 (1251 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADE76354.1| unknown [Picea sitchensis] 222 3e-55 gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao] 221 5e-55 gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao] 221 6e-55 gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao] 218 3e-54 ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 218 4e-54 gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus pe... 217 9e-54 ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 214 5e-53 gb|ESW23204.1| hypothetical protein PHAVU_004G026900g [Phaseolus... 214 8e-53 ref|XP_003621141.1| 2-aminoethanethiol dioxygenase [Medicago tru... 211 4e-52 ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis... 211 5e-52 ref|XP_002871649.1| hypothetical protein ARALYDRAFT_488353 [Arab... 209 1e-51 ref|XP_004489380.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 209 3e-51 gb|EXC32620.1| 2-aminoethanethiol dioxygenase [Morus notabilis] 208 3e-51 ref|NP_197016.1| uncharacterized protein [Arabidopsis thaliana] ... 208 4e-51 ref|XP_006842138.1| hypothetical protein AMTR_s00078p00120410 [A... 205 3e-50 ref|XP_002272019.1| PREDICTED: 2-aminoethanethiol dioxygenase [V... 205 3e-50 ref|XP_006400029.1| hypothetical protein EUTSA_v10014207mg [Eutr... 204 8e-50 ref|XP_003534459.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 202 2e-49 ref|XP_006587759.1| PREDICTED: 2-aminoethanethiol dioxygenase-li... 201 5e-49 gb|ESW11590.1| hypothetical protein PHAVU_008G043100g [Phaseolus... 201 5e-49 >gb|ADE76354.1| unknown [Picea sitchensis] Length = 275 Score = 222 bits (565), Expect = 3e-55 Identities = 121/288 (42%), Positives = 169/288 (58%), Gaps = 1/288 (0%) Frame = +3 Query: 87 SVVQQLYLLCVETFSVPCRDYTPD-AIHKLHSFLDRIRPADLGIKEPLQLQESEKTKPLG 263 S VQ LY +C ETFS P AI +L S LD I+P D+G+ E + E G Sbjct: 26 SAVQNLYEVCNETFSSSAVPVPPQRAIQRLRSVLDTIKPVDVGLNEDV----FENDHGYG 81 Query: 264 VRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLPPSGVLPLHNHPGM 443 G S G ++ + +A P+TY+H+YECDRFSIGIFCLP S V+P HNHPGM Sbjct: 82 F---FGPSLWRGRHSRIVAR---WAAPVTYLHLYECDRFSIGIFCLPASAVIPFHNHPGM 135 Query: 444 TVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYTAPCEPTILCPKRG 623 TVLSKLL+GS+++++YD V P + +S P+ LA+ +VD ++T+PC+ ++L P G Sbjct: 136 TVLSKLLFGSMYIKAYDWVD--PINTETNSNPSQLRLARLEVDNVFTSPCDTSVLYPTSG 193 Query: 624 GTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGPAEPNDRQECSRSP 803 G IHS AVT+CA+LDVL PPYSD +GR+CTYY +P + D +D Q C+ Sbjct: 194 GNIHSFRAVTSCAVLDVLGPPYSDIEGRNCTYYSEYPYSSLPDDGNTIPDDDDQGCA--- 250 Query: 804 ESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYRGPKIDS 947 WLEE + P++F++RG Y+GP+I++ Sbjct: 251 -----------------------WLEEIKRPDEFIVRGAPYKGPQIEA 275 >gb|EOY03392.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 309 Score = 221 bits (563), Expect = 5e-55 Identities = 125/306 (40%), Positives = 170/306 (55%), Gaps = 9/306 (2%) Frame = +3 Query: 51 RRNKENGVKASLSV--VQQLYLLCVETFSVPCRDY--TPDAIHKLHSFLDRIRPADLGIK 218 RR K+ + A++ V VQ+L+ C + F++ TPD I +L + LD+I+PAD+G+ Sbjct: 53 RRPKKTTMPAAVVVSPVQRLFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLT 112 Query: 219 EPLQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFC 398 + T+ APPITY H++EC++FS+GIFC Sbjct: 113 PQMPFFSLPVTRR--------------------------APPITYQHIHECEKFSMGIFC 146 Query: 399 LPPSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKT 563 LPPSGVLPLHNHPGMTV SKLL+G++H++SYD V D P++ P+ + LAK Sbjct: 147 LPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVVAPSQMQHREVRLAKV 206 Query: 564 KVDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHR 743 KVD +TAPC +IL P GG +H TAVTACA+LDVL PPYSD +GRHCTYY +P + Sbjct: 207 KVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDPEGRHCTYYFDYPFTK 266 Query: 744 ISADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDE 923 +S D + E+ D+ Y WL+E E PED + G Sbjct: 267 LSVDGVTV-----------------AEEEKDK---------YAWLQEREEPEDLAVVGAP 300 Query: 924 YRGPKI 941 Y GP+I Sbjct: 301 YTGPEI 306 >gb|EOY03391.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 310 Score = 221 bits (562), Expect = 6e-55 Identities = 125/307 (40%), Positives = 170/307 (55%), Gaps = 10/307 (3%) Frame = +3 Query: 51 RRNKENGVKASLSV--VQQLYLLCVETFSVPCRDY--TPDAIHKLHSFLDRIRPADLGIK 218 RR K+ + A++ V VQ+L+ C + F++ TPD I +L + LD+I+PAD+G+ Sbjct: 53 RRPKKTTMPAAVVVSPVQRLFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLT 112 Query: 219 EPLQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFC 398 + T+ APPITY H++EC++FS+GIFC Sbjct: 113 PQMPFFSLPVTRR--------------------------APPITYQHIHECEKFSMGIFC 146 Query: 399 LPPSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANS------ELAK 560 LPPSGVLPLHNHPGMTV SKLL+G++H++SYD V D P++ P+ + LAK Sbjct: 147 LPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVVAPSQTVQHREVRLAK 206 Query: 561 TKVDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCH 740 KVD +TAPC +IL P GG +H TAVTACA+LDVL PPYSD +GRHCTYY +P Sbjct: 207 VKVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDPEGRHCTYYFDYPFT 266 Query: 741 RISADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGD 920 ++S D + E+ D+ Y WL+E E PED + G Sbjct: 267 KLSVDGVTV-----------------AEEEKDK---------YAWLQEREEPEDLAVVGA 300 Query: 921 EYRGPKI 941 Y GP+I Sbjct: 301 PYTGPEI 307 >gb|EOY03393.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 304 Score = 218 bits (556), Expect = 3e-54 Identities = 124/306 (40%), Positives = 169/306 (55%), Gaps = 9/306 (2%) Frame = +3 Query: 51 RRNKENGVKASLSV--VQQLYLLCVETFSVPCRDY--TPDAIHKLHSFLDRIRPADLGIK 218 RR K+ + A++ V VQ+L+ C + F++ TPD I +L + LD+I+PAD+G+ Sbjct: 53 RRPKKTTMPAAVVVSPVQRLFDTCKDVFALAGTGIVPTPDKIEQLRAVLDQIQPADVGLT 112 Query: 219 EPLQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFC 398 + T+ APPITY H++EC++FS+GIFC Sbjct: 113 PQMPFFSLPVTRR--------------------------APPITYQHIHECEKFSMGIFC 146 Query: 399 LPPSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKT 563 LPPSGVLPLHNHPGMTV SKLL+G++H++SYD V D P++ P+ + LAK Sbjct: 147 LPPSGVLPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNASAVVAPSQMQHREVRLAKV 206 Query: 564 KVDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHR 743 KVD +TAPC +IL P GG +H TAVTACA+LDVL PPYSD +GRHCTYY +P + Sbjct: 207 KVDSDFTAPCSASILYPADGGNMHCFTAVTACAVLDVLGPPYSDPEGRHCTYYFDYPFTK 266 Query: 744 ISADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDE 923 +S + E+ D+ Y WL+E E PED + G Sbjct: 267 LSV----------------------AEEEKDK---------YAWLQEREEPEDLAVVGAP 295 Query: 924 YRGPKI 941 Y GP+I Sbjct: 296 YTGPEI 301 >ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max] Length = 281 Score = 218 bits (555), Expect = 4e-54 Identities = 123/304 (40%), Positives = 165/304 (54%), Gaps = 7/304 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 RRN+ K VQ+L+ C F+ + P + I +L S LD I+P D+G++ Sbjct: 29 RRNRRRQRKKP--PVQKLFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPD 86 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + + T+ + P ITY+H+YEC++FS+GIFCLP Sbjct: 87 MPYFRTSATQRV--------------------------PRITYLHIYECEKFSMGIFCLP 120 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKTKV 569 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P + P + P+ ++ LAK KV Sbjct: 121 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPESPTTIKPSENQGPEMRLAKVKV 180 Query: 570 DRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRIS 749 D +TAPC P+IL P+ GG +H TAVTACA+LDVL PPYSD +GRHCTYY P S Sbjct: 181 DADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPYSDAEGRHCTYYHNFPFSNFS 240 Query: 750 ADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYR 929 AD G + P + + Y WL+E E ED + G Y Sbjct: 241 AD-GLSIPEEEKNA-------------------------YEWLQEREELEDLEVNGKMYN 274 Query: 930 GPKI 941 GPKI Sbjct: 275 GPKI 278 >gb|EMJ16977.1| hypothetical protein PRUPE_ppa009667mg [Prunus persica] Length = 282 Score = 217 bits (552), Expect = 9e-54 Identities = 128/297 (43%), Positives = 161/297 (54%), Gaps = 11/297 (3%) Frame = +3 Query: 84 LSVVQQLYLLCVETFS------VPCRDYTPDAIHKLHSFLDRIRPADLGIKEPLQLQESE 245 +S VQ+LY C + FS VP +P+ I +L S LD ++PAD+G+ L Sbjct: 39 MSPVQRLYQTCKDVFSFCGAGIVP----SPEDIQRLRSVLDTMKPADVGLTPELPY---- 90 Query: 246 KTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLPPSGVLPL 425 R V A+RT P ITY+H++EC++FS+GIFCLPPSGVLPL Sbjct: 91 ------FRMTV---------ARRT-------PAITYLHLHECEKFSMGIFCLPPSGVLPL 128 Query: 426 HNHPGMTVLSKLLYGSLHLRSYDIVTDYPAD-----IPKSSCPANSELAKTKVDRIYTAP 590 HNHPGMTV SKLL+G++H++SYD V D D P + P LAK KVD +TAP Sbjct: 129 HNHPGMTVFSKLLFGTMHIKSYDWVADATEDKSTSANPSPATPPGVRLAKVKVDADFTAP 188 Query: 591 CEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGPAE 770 C +IL P GG +H TAVTACA+LDVL PPYSD DGRHC YY P S D Sbjct: 189 CNTSILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCQYYLDFPFSHFSVDG---- 244 Query: 771 PNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYRGPKI 941 S ++ + Y WL+E E PED + G +YRGPKI Sbjct: 245 ----------------------VSVAEEEKEGYAWLQEIEKPEDLAVDGAKYRGPKI 279 >ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform 1 [Glycine max] Length = 281 Score = 214 bits (546), Expect = 5e-53 Identities = 121/304 (39%), Positives = 164/304 (53%), Gaps = 7/304 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 RRN+ K VQ+L+ C F+ + P + I +L S LD I+P D+G++ Sbjct: 29 RRNRRRQRKKP--PVQKLFETCKVVFASAGTGFVPPHEDIDELQSVLDGIKPEDVGLRPD 86 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + + T+ + P ITY+H+YEC++FS+GIFCLP Sbjct: 87 MPYFRTSATQRV--------------------------PRITYLHIYECEKFSMGIFCLP 120 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKTKV 569 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P + P + P+ ++ LAK KV Sbjct: 121 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDSPPESPTTLKPSENQGPEMRLAKVKV 180 Query: 570 DRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRIS 749 D +TAPC P+IL P+ GG +H TAVTACA+LDVL PPYSD +GRHCTYY P S Sbjct: 181 DADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPYSDAEGRHCTYYHDFPFSNFS 240 Query: 750 ADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYR 929 D G + P + + Y WL+E + ED + G Y Sbjct: 241 VD-GLSIPEEEKNA-------------------------YEWLQERDELEDLEVNGKMYN 274 Query: 930 GPKI 941 GPKI Sbjct: 275 GPKI 278 >gb|ESW23204.1| hypothetical protein PHAVU_004G026900g [Phaseolus vulgaris] Length = 281 Score = 214 bits (544), Expect = 8e-53 Identities = 123/304 (40%), Positives = 160/304 (52%), Gaps = 7/304 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTPDA--IHKLHSFLDRIRPADLGIKEP 224 RRN+ K VQ L+ C F+ + P I KL S LD IRP D+G++ Sbjct: 29 RRNRRRERKKP--PVQMLFETCKVVFASGGTGFVPPLRDIEKLRSVLDGIRPEDVGLRPD 86 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + + ++ + P I Y+H+YEC++FS+GIFCLP Sbjct: 87 MPYFRTSASQRV--------------------------PKIQYLHIYECEKFSMGIFCLP 120 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKTKV 569 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P + PK P ++ LAK KV Sbjct: 121 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDMPPESPKIINPPENQAPEMRLAKIKV 180 Query: 570 DRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRIS 749 D +TAPC P+IL P+ GG +H TAVTACA LDVL PPYSD +GRHCTYY P S Sbjct: 181 DADFTAPCNPSILYPEDGGNMHCFTAVTACAFLDVLGPPYSDSEGRHCTYYHNFPFSNFS 240 Query: 750 ADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYR 929 D G + P + + Y WL+E E ED ++G Y Sbjct: 241 VD-GLSIPEEEKSA-------------------------YEWLQEREELEDLEVKGKMYS 274 Query: 930 GPKI 941 GPKI Sbjct: 275 GPKI 278 >ref|XP_003621141.1| 2-aminoethanethiol dioxygenase [Medicago truncatula] gi|355496156|gb|AES77359.1| 2-aminoethanethiol dioxygenase [Medicago truncatula] Length = 272 Score = 211 bits (538), Expect = 4e-52 Identities = 124/300 (41%), Positives = 160/300 (53%), Gaps = 3/300 (1%) Frame = +3 Query: 48 PRRNKEN-GVKASLSVVQQLYLLCVETFSVPCRDYTPDAIH--KLHSFLDRIRPADLGIK 218 PR+N+ + + ++ VQ+L+L C F+ P + H L S L I+P DLG+K Sbjct: 23 PRKNRRHLRRRTEMTPVQKLFLACKHVFANAAHGIVPSSQHIEMLRSVLAGIKPEDLGLK 82 Query: 219 EPLQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFC 398 + +N +GG P ITY+H+YEC++FS+GIFC Sbjct: 83 PDMPYF--------------------------SNINGG-TPKITYLHIYECEKFSMGIFC 115 Query: 399 LPPSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRI 578 LPPSGV+PLHNHPGMTV SKLL+G++H++SYD D PAD+ ++ P LAK KVD Sbjct: 116 LPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWAGDLPADVSQTQIP-EKRLAKIKVDAD 174 Query: 579 YTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADN 758 +TAPC P+IL P GG +H TAVTACA+LDVL PPYSD DGRHC YYR P N Sbjct: 175 FTAPCNPSILYPDDGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCAYYRSFP-----FSN 229 Query: 759 GPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYRGPK 938 P E E E+ D Y WL+E E PE + Y K Sbjct: 230 FPVEGISIPE-----------EEKKD----------YEWLQEREKPESLQVIVKMYSSSK 268 >ref|XP_002517479.1| Protein C10orf22, putative [Ricinus communis] gi|223543490|gb|EEF45021.1| Protein C10orf22, putative [Ricinus communis] Length = 288 Score = 211 bits (537), Expect = 5e-52 Identities = 123/291 (42%), Positives = 157/291 (53%), Gaps = 3/291 (1%) Frame = +3 Query: 78 ASLSVVQQLYLLCVETFSV--PCRDYTPDAIHKLHSFLDRIRPADLGIKEPLQLQESEKT 251 A +S VQ+LY C + FS+ P PD I KL + LD I P D+G+ + Sbjct: 47 AVVSPVQKLYDTCKDVFSIGGPGVVPAPDKIEKLRAVLDVITPEDVGLHPEMPY------ 100 Query: 252 KPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLPPSGVLPLHN 431 R V G APPI Y+H++EC++FSIGIFC PPSGV+PLHN Sbjct: 101 ----FRLPVA----------------GRAPPIRYLHIHECNKFSIGIFCFPPSGVIPLHN 140 Query: 432 HPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYTAPCEPTILC 611 HPGMTV SKLL+G +H++SYD V + + P+ LAK K+D +TAPC P IL Sbjct: 141 HPGMTVFSKLLFGKMHIKSYDWVDEDSVNGSAVVNPSEVRLAKVKIDSDFTAPCNPCILY 200 Query: 612 PKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGPAEPNDRQEC 791 P GG +H TA TACA+LDVL PPYSD +GRHCTYY P S D G + P + +E Sbjct: 201 PVDGGNMHCFTAATACAVLDVLGPPYSDPEGRHCTYYNDFPFANFSVD-GVSLPEEERE- 258 Query: 792 SRSPESYDRSSEQDDRSPESDQCQMYVWLEE-GEVPEDFVIRGDEYRGPKI 941 Y WL+E + P+DF + G+ YRGPKI Sbjct: 259 ------------------------GYAWLQERTKQPDDFKMVGELYRGPKI 285 >ref|XP_002871649.1| hypothetical protein ARALYDRAFT_488353 [Arabidopsis lyrata subsp. lyrata] gi|297317486|gb|EFH47908.1| hypothetical protein ARALYDRAFT_488353 [Arabidopsis lyrata subsp. lyrata] Length = 289 Score = 209 bits (533), Expect = 1e-51 Identities = 121/302 (40%), Positives = 165/302 (54%), Gaps = 4/302 (1%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 RR K + ++ V++L+ C E FS P D I +L LD ++P D+G+ Sbjct: 40 RRKKIDSPADEITAVRRLFNTCKEVFSNGGPGVVPSEDKIQQLREILDDMKPEDVGLAPT 99 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + R G+ TR+ +PPITY+H+++CD+FSIGIFCLP Sbjct: 100 MPY----------FRPNTGLETRS-------------SPPITYLHLHQCDQFSIGIFCLP 136 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYT 584 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P PK + LAK KVD +T Sbjct: 137 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDTPMRDPK------TWLAKLKVDSTFT 190 Query: 585 APCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGP 764 APC +IL P+ GG +H TA TACA+LDVL PPY + +GRHCTY+ P + Sbjct: 191 APCNTSILYPEDGGNMHRFTAKTACAVLDVLGPPYCNPEGRHCTYFLEFPFDQF------ 244 Query: 765 AEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEE-GEVPEDFV-IRGDEYRGPK 938 SSE DD ++ + Y WL+E + PED + G YRGPK Sbjct: 245 ------------------SSEDDDILRSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPK 286 Query: 939 ID 944 ++ Sbjct: 287 VE 288 >ref|XP_004489380.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cicer arietinum] Length = 282 Score = 209 bits (531), Expect = 3e-51 Identities = 122/304 (40%), Positives = 161/304 (52%), Gaps = 7/304 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTPDA--IHKLHSFLDRIRPADLGIKEP 224 RRN+ K + VQ+L+ C E F P I KL S LD I+P D+ +K Sbjct: 29 RRNRRRQKKTT-PPVQKLFETCKEVFESVETGIVPPTQDIDKLRSVLDGIKPEDVDLKPD 87 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + R +++ P ITY+H+YEC++FS+GIFCLP Sbjct: 88 MPY----------------------FRENASHRR----PKITYLHIYECEKFSMGIFCLP 121 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSE-----LAKTKV 569 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P + P P+ S+ LAK KV Sbjct: 122 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPESPTIVKPSESQIPELRLAKIKV 181 Query: 570 DRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRIS 749 D +TAPC P+IL P+ GG +H TAVTACA LDVL PPYSD +GRHCTYY +P S Sbjct: 182 DDDFTAPCNPSILYPEDGGNLHCFTAVTACAFLDVLGPPYSDFEGRHCTYYTNYPFSNFS 241 Query: 750 ADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYR 929 + G + P + ++ Y WL+E + ED + G Y Sbjct: 242 VE-GLSIPEEEKKA-------------------------YEWLQEKDQLEDLKVEGKMYS 275 Query: 930 GPKI 941 GP I Sbjct: 276 GPTI 279 >gb|EXC32620.1| 2-aminoethanethiol dioxygenase [Morus notabilis] Length = 316 Score = 208 bits (530), Expect = 3e-51 Identities = 125/314 (39%), Positives = 168/314 (53%), Gaps = 17/314 (5%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 R+N+ K +S VQ+L+ +C E F+ P + I +L S LD ++P D+G+ Sbjct: 29 RKNRRRYKK--MSPVQKLFEMCKEVFTAGATGVVPPPEDIQRLQSVLDVMKPEDVGLTPE 86 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 L RA G T P ITY+H++EC+ FS+GIFCLP Sbjct: 87 LPY----------FRANAGSRT----------------PAITYLHLHECENFSMGIFCLP 120 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADI------PKSSCPANSELAKTK 566 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P++ + + ++ LAK K Sbjct: 121 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDVPSNTSATVNSSQDTTTSDVRLAKVK 180 Query: 567 VDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRI 746 VD +TAPC +IL P GG +H TAVTACA+LDVL PPYSD DGRHCTYY P Sbjct: 181 VDSDFTAPCNASILYPADGGNMHCFTAVTACAVLDVLGPPYSDPDGRHCTYYHDRPFSDF 240 Query: 747 SADNGPAEPNDRQEC-SRSPESYDRSS-------EQDDRSPESDQCQMYVWLEEGEV-PE 899 S + S P S D + ++ + + WL+E E+ PE Sbjct: 241 SGTLAIFLLGSNENVHSFLPLPNSEFSTLVLFGISVDGVAVPEEEKESHAWLQEREILPE 300 Query: 900 DFVIRGDEYRGPKI 941 D + G YRGPKI Sbjct: 301 DLAVVGAPYRGPKI 314 >ref|NP_197016.1| uncharacterized protein [Arabidopsis thaliana] gi|7671481|emb|CAB89322.1| putative protein [Arabidopsis thaliana] gi|30725348|gb|AAP37696.1| At5g15120 [Arabidopsis thaliana] gi|110736659|dbj|BAF00293.1| hypothetical protein [Arabidopsis thaliana] gi|332004736|gb|AED92119.1| uncharacterized protein AT5G15120 [Arabidopsis thaliana] Length = 293 Score = 208 bits (529), Expect = 4e-51 Identities = 119/302 (39%), Positives = 165/302 (54%), Gaps = 4/302 (1%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 RR K + ++ V++L+ C E FS P D I +L LD ++P D+G+ Sbjct: 44 RRKKIDSPADGITAVRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPT 103 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + R G+ R+ +PPITY+H+++CD+FSIGIFCLP Sbjct: 104 MPY----------FRPNSGVEARS-------------SPPITYLHLHQCDQFSIGIFCLP 140 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYT 584 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P K + LAK KVD +T Sbjct: 141 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAPMRDSK------TRLAKLKVDSTFT 194 Query: 585 APCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGP 764 APC +IL P+ GG +H TA+TACA+LDVL PPY + +GRHCTY+ P ++ Sbjct: 195 APCNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKL------ 248 Query: 765 AEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEE-GEVPEDFV-IRGDEYRGPK 938 SSE DD ++ + Y WL+E + PED + G YRGPK Sbjct: 249 ------------------SSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPK 290 Query: 939 ID 944 ++ Sbjct: 291 VE 292 >ref|XP_006842138.1| hypothetical protein AMTR_s00078p00120410 [Amborella trichopoda] gi|548844187|gb|ERN03813.1| hypothetical protein AMTR_s00078p00120410 [Amborella trichopoda] Length = 273 Score = 205 bits (522), Expect = 3e-50 Identities = 116/302 (38%), Positives = 159/302 (52%), Gaps = 5/302 (1%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTPDAIHKLHSFLDRIRPADLGIKEPLQ 230 ++ + KA + VQ+L+ +C + F+ +P + +L S LD ++P+D+G+ E + Sbjct: 22 KKTRRKHKKAMPTAVQRLFEICNDVFAGAGSVPSPPQVERLQSVLDSMKPSDVGLNELMP 81 Query: 231 LQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLPPS 410 E+EK + G+ PPITY+HVYECD FSIGIFCLPPS Sbjct: 82 YFEAEKNE-------------------------GY-PPITYLHVYECDNFSIGIFCLPPS 115 Query: 411 GVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTD-----YPADIPKSSCPANSELAKTKVDR 575 GV+PLHNHP MTV SKLL+GS+H++S+D +PA K+ ++ LAK KVD Sbjct: 116 GVIPLHNHPNMTVFSKLLFGSMHIKSFDWAPPPFDAVWPAK-AKAETTSSVRLAKVKVDS 174 Query: 576 IYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISAD 755 + APC+ +IL P GG +H+ A TACA+LDV PPY+D GRHCTY+ P S D Sbjct: 175 DFNAPCKTSILYPTSGGNMHTFHAQTACAVLDVFGPPYNDSKGRHCTYFHEFPYPSFSGD 234 Query: 756 NGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYRGP 935 + N + Y WLEE E P + G EY GP Sbjct: 235 AVSVQENGGE---------------------------YAWLEEIERPGSLKVVGAEYEGP 267 Query: 936 KI 941 KI Sbjct: 268 KI 269 >ref|XP_002272019.1| PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera] gi|297740513|emb|CBI30695.3| unnamed protein product [Vitis vinifera] Length = 244 Score = 205 bits (522), Expect = 3e-50 Identities = 117/286 (40%), Positives = 160/286 (55%) Frame = +3 Query: 84 LSVVQQLYLLCVETFSVPCRDYTPDAIHKLHSFLDRIRPADLGIKEPLQLQESEKTKPLG 263 + VVQ+LY C E+FSV + +A+ K+ S LD ++P+++G+++ QL K G Sbjct: 1 MPVVQKLYNACKESFSVD-GPLSEEALGKVRSILDDMKPSNVGLEQEAQLARGWKGSMHG 59 Query: 264 VRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLPPSGVLPLHNHPGM 443 K RNG + PPI Y+H++ECDRFSIGIFC+PPS ++PLHNHPGM Sbjct: 60 ANGK---KVRNGSHQ--------YPPPIKYLHLHECDRFSIGIFCMPPSSIIPLHNHPGM 108 Query: 444 TVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYTAPCEPTILCPKRG 623 TVLSKLLYG+LH++SYD + D+P ++ + + AK D +APC TIL P G Sbjct: 109 TVLSKLLYGTLHVKSYDWL-----DLPGTADLSQARPAKLVRDCEMSAPCGTTILYPTNG 163 Query: 624 GTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGPAEPNDRQECSRSP 803 G IH A+T CA+ DVL+PPYS +DGRHC+Y+R P + P Q C P Sbjct: 164 GNIHCFKAITPCALFDVLSPPYSSEDGRHCSYFRKSPRKDL--------PGIDQLCGIKP 215 Query: 804 ESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEYRGPKI 941 VWLEE + PE+ V+ +Y GP I Sbjct: 216 SE-------------------VVWLEEIQPPENVVVLRGQYEGPII 242 >ref|XP_006400029.1| hypothetical protein EUTSA_v10014207mg [Eutrema salsugineum] gi|557101119|gb|ESQ41482.1| hypothetical protein EUTSA_v10014207mg [Eutrema salsugineum] Length = 304 Score = 204 bits (518), Expect = 8e-50 Identities = 116/301 (38%), Positives = 160/301 (53%), Gaps = 3/301 (0%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTP--DAIHKLHSFLDRIRPADLGIKEP 224 R+ ++ + ++ V++L+ C E FS P D I +L LD ++P D+G+ Sbjct: 55 RKKTDSSPEDEITAVRRLFNTCKEVFSDGGPGIVPSEDKIQQLRQILDNMKPEDVGLTPT 114 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + R G+ G +PPITY+H+++CD+FSIGIFCLP Sbjct: 115 MPY----------FRPNAGLGN-------------GSSPPITYLHLHQCDQFSIGIFCLP 151 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANSELAKTKVDRIYT 584 PSGV+PLHNHPGMTV SKLL+G++H++SYD V D P PK + LAK K+D Sbjct: 152 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAPMKDPK------TRLAKVKMDSTLN 205 Query: 585 APCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRISADNGP 764 APC +IL P+ GG +H TA TACA+LDVL PPY + +GRHCTY+ P Sbjct: 206 APCNASILYPEDGGNMHRFTAKTACAVLDVLGPPYCNPEGRHCTYFLDFPIEIF------ 259 Query: 765 AEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEE-GEVPEDFVIRGDEYRGPKI 941 SSE+DD + + WL+E + PED + G YRGPK+ Sbjct: 260 ------------------SSEEDDVLRGEMGKESHAWLQERDDNPEDLNVVGALYRGPKV 301 Query: 942 D 944 D Sbjct: 302 D 302 >ref|XP_003534459.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform X1 [Glycine max] Length = 287 Score = 202 bits (514), Expect = 2e-49 Identities = 122/305 (40%), Positives = 161/305 (52%), Gaps = 8/305 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSV--PCRDYTPDAIHKLHSFLDRIRPADLGIKEP 224 RRN+ + + +S Q+L+ C E F+ P +P I L S L I+ D+G+K Sbjct: 33 RRNRRHRQR-KMSPGQKLFQTCNEVFASTGPGIVPSPQNIEMLLSVLGGIKQEDVGLKPE 91 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + S + RT P ITY+H+YEC FS+GIFCLP Sbjct: 92 MPFFSSNNPR-------------------RT-------PKITYLHIYECKEFSMGIFCLP 125 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANS------ELAKTK 566 P GV+PLHNHPGMTV SKLL+G++H++SYD V D P +P P++ LAK K Sbjct: 126 PCGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPHMPTIVKPSSETLTPDMRLAKVK 185 Query: 567 VDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRI 746 VD + APC+P+IL P GG +H TAVTACA+LDVL PPYSD DGRHCTYY+ P Sbjct: 186 VDADFNAPCDPSILYPADGGNMHWFTAVTACAVLDVLGPPYSDPDGRHCTYYQNFPFSNY 245 Query: 747 SADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEY 926 S D S +++R+ Y WL+E E PE+ + + Y Sbjct: 246 SVDG-------------------LSIPEEERT-------AYEWLQEKEKPENLKVVVNMY 279 Query: 927 RGPKI 941 GPKI Sbjct: 280 SGPKI 284 >ref|XP_006587759.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform X2 [Glycine max] Length = 286 Score = 201 bits (511), Expect = 5e-49 Identities = 120/305 (39%), Positives = 160/305 (52%), Gaps = 8/305 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSV--PCRDYTPDAIHKLHSFLDRIRPADLGIKEP 224 RRN+ + + +S Q+L+ C E F+ P +P I L S L I+ D+G+K Sbjct: 33 RRNRRHRQR-KMSPGQKLFQTCNEVFASTGPGIVPSPQNIEMLLSVLGGIKQEDVGLKPE 91 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + S + RT P ITY+H+YEC FS+GIFCLP Sbjct: 92 MPFFSSNNPR-------------------RT-------PKITYLHIYECKEFSMGIFCLP 125 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADIPKSSCPANS------ELAKTK 566 P GV+PLHNHPGMTV SKLL+G++H++SYD V D P +P P++ LAK K Sbjct: 126 PCGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPHMPTIVKPSSETLTPDMRLAKVK 185 Query: 567 VDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRI 746 VD + APC+P+IL P GG +H TAVTACA+LDVL PPYSD DGRHCTYY Sbjct: 186 VDADFNAPCDPSILYPADGGNMHWFTAVTACAVLDVLGPPYSDPDGRHCTYY-------- 237 Query: 747 SADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEY 926 +++ S+ D S ++ Y WL+E E PE+ + + Y Sbjct: 238 -------------------QNFPFSNYSDGLSIPEEERTAYEWLQEKEKPENLKVVVNMY 278 Query: 927 RGPKI 941 GPKI Sbjct: 279 SGPKI 283 >gb|ESW11590.1| hypothetical protein PHAVU_008G043100g [Phaseolus vulgaris] Length = 279 Score = 201 bits (511), Expect = 5e-49 Identities = 118/305 (38%), Positives = 162/305 (53%), Gaps = 8/305 (2%) Frame = +3 Query: 51 RRNKENGVKASLSVVQQLYLLCVETFSVPCRDYTPDAIH--KLHSFLDRIRPADLGIKEP 224 R+N+ ++ +S+ Q+L+ C + F+ P H L S LD I D+G++ Sbjct: 27 RKNRRQRLR-KMSIGQRLFQTCNQVFASTSPGIVPSPQHIEMLLSVLDGISHEDVGLRPD 85 Query: 225 LQLQESEKTKPLGVRAKVGMSTRNGIRAKRTNKHGGFAPPITYVHVYECDRFSIGIFCLP 404 + TNK P ITY+H+YEC++FS+GIFCLP Sbjct: 86 MP-------------------------CFNTNKR---TPKITYLHIYECEQFSMGIFCLP 117 Query: 405 PSGVLPLHNHPGMTVLSKLLYGSLHLRSYDIVTDYPADI------PKSSCPANSELAKTK 566 PSGV+PLHNHPGMTV SKLL+G++H++SYD VTD P + ++S ++ LAK K Sbjct: 118 PSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVTDLPPHMSTMVKPSETSQTSDMRLAKVK 177 Query: 567 VDRIYTAPCEPTILCPKRGGTIHSLTAVTACAILDVLAPPYSDKDGRHCTYYRLHPCHRI 746 VD + APC+P++L P GG +H TAVTACA+LDVL PPYSD DGR CTYY+ P Sbjct: 178 VDAEFDAPCDPSVLYPNDGGNMHWFTAVTACAVLDVLGPPYSDPDGRDCTYYQNFPFSNY 237 Query: 747 SADNGPAEPNDRQECSRSPESYDRSSEQDDRSPESDQCQMYVWLEEGEVPEDFVIRGDEY 926 S D G + P + + + Y WL+E E PE+ + Y Sbjct: 238 SVD-GISIPEEER-------------------------KTYEWLQEKEKPENLKVVVKMY 271 Query: 927 RGPKI 941 GPKI Sbjct: 272 SGPKI 276