BLASTX nr result

ID: Cinnamomum23_contig00019596 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00019596
         (963 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010929085.1| PREDICTED: 2-aminoethanethiol dioxygenase is...   385   e-104
ref|XP_010929084.1| PREDICTED: 2-aminoethanethiol dioxygenase is...   385   e-104
ref|XP_009395267.1| PREDICTED: probable 2-aminoethanethiol dioxy...   370   e-100
ref|XP_009395266.1| PREDICTED: probable 2-aminoethanethiol dioxy...   370   e-100
ref|XP_009395265.1| PREDICTED: probable 2-aminoethanethiol dioxy...   370   e-100
ref|XP_002274302.2| PREDICTED: 2-aminoethanethiol dioxygenase [V...   370   1e-99
ref|XP_010261985.1| PREDICTED: probable 2-aminoethanethiol dioxy...   344   5e-92
ref|XP_007037732.1| Uncharacterized protein isoform 1 [Theobroma...   343   1e-91
ref|XP_006485740.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   342   2e-91
ref|XP_011623281.1| PREDICTED: 2-aminoethanethiol dioxygenase [A...   341   4e-91
ref|XP_007037733.1| Uncharacterized protein isoform 2 [Theobroma...   341   4e-91
gb|KHG13152.1| 2-aminoethanethiol dioxygenase [Gossypium arboreum]    338   3e-90
ref|XP_012488993.1| PREDICTED: plant cysteine oxidase 3 isoform ...   338   3e-90
gb|KJB39982.1| hypothetical protein B456_007G040900 [Gossypium r...   338   3e-90
ref|XP_012092920.1| PREDICTED: 2-aminoethanethiol dioxygenase is...   338   4e-90
gb|KDP20052.1| hypothetical protein JCGZ_05821 [Jatropha curcas]      338   4e-90
ref|XP_007210646.1| hypothetical protein PRUPE_ppa019247mg [Prun...   337   6e-90
ref|XP_008244076.1| PREDICTED: 2-aminoethanethiol dioxygenase [P...   337   1e-89
ref|XP_007037734.1| Uncharacterized protein isoform 3 [Theobroma...   337   1e-89
ref|XP_006440863.1| hypothetical protein CICLE_v10021506mg [Citr...   336   1e-89

>ref|XP_010929085.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform X2 [Elaeis
           guineensis]
          Length = 256

 Score =  385 bits (988), Expect = e-104
 Identities = 185/254 (72%), Positives = 209/254 (82%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGF 720
           M K S VQ LY+LCK+TF+P+G +  SQA  KL++LLDTI P++VGLK+DN EDD G GF
Sbjct: 1   MAKGSSVQVLYELCKRTFSPSGASPSSQAIRKLAALLDTISPAEVGLKDDNLEDDGGHGF 60

Query: 719 FGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVL 540
           FG N F+N  S R ARWAQPITYLHIYEC+SF+IGIFCLPTSSVIPLHDHP MTVLSK+L
Sbjct: 61  FGPNSFKN--SARTARWAQPITYLHIYECDSFSIGIFCLPTSSVIPLHDHPGMTVLSKML 118

Query: 539 YGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFT 360
           YGSMHVKAYDWIEPA   R  EP+  PVRLAKL  D VLTAPC TTVLYP+SGGNLHCFT
Sbjct: 119 YGSMHVKAYDWIEPARTMRSQEPDSFPVRLAKLHKDTVLTAPCPTTVLYPKSGGNLHCFT 178

Query: 359 AITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEAPDD 180
           A+T CAVLDVLAPPYSE AGR CTYY D+PYSSF T N L  ++REEDYAWLE I+ PDD
Sbjct: 179 AVTSCAVLDVLAPPYSEEAGRVCTYYHDYPYSSFTTDNTLGENEREEDYAWLEAIDVPDD 238

Query: 179 LCIRPGKYDGPSVQ 138
           L +R G+Y GP+VQ
Sbjct: 239 LYMRSGRYAGPAVQ 252


>ref|XP_010929084.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform X1 [Elaeis
           guineensis]
          Length = 264

 Score =  385 bits (988), Expect = e-104
 Identities = 185/254 (72%), Positives = 209/254 (82%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGF 720
           M K S VQ LY+LCK+TF+P+G +  SQA  KL++LLDTI P++VGLK+DN EDD G GF
Sbjct: 1   MAKGSSVQVLYELCKRTFSPSGASPSSQAIRKLAALLDTISPAEVGLKDDNLEDDGGHGF 60

Query: 719 FGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVL 540
           FG N F+N  S R ARWAQPITYLHIYEC+SF+IGIFCLPTSSVIPLHDHP MTVLSK+L
Sbjct: 61  FGPNSFKN--SARTARWAQPITYLHIYECDSFSIGIFCLPTSSVIPLHDHPGMTVLSKML 118

Query: 539 YGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFT 360
           YGSMHVKAYDWIEPA   R  EP+  PVRLAKL  D VLTAPC TTVLYP+SGGNLHCFT
Sbjct: 119 YGSMHVKAYDWIEPARTMRSQEPDSFPVRLAKLHKDTVLTAPCPTTVLYPKSGGNLHCFT 178

Query: 359 AITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEAPDD 180
           A+T CAVLDVLAPPYSE AGR CTYY D+PYSSF T N L  ++REEDYAWLE I+ PDD
Sbjct: 179 AVTSCAVLDVLAPPYSEEAGRVCTYYHDYPYSSFTTDNTLGENEREEDYAWLEAIDVPDD 238

Query: 179 LCIRPGKYDGPSVQ 138
           L +R G+Y GP+VQ
Sbjct: 239 LYMRSGRYAGPAVQ 252


>ref|XP_009395267.1| PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X3 [Musa
           acuminata subsp. malaccensis]
          Length = 259

 Score =  370 bits (950), Expect = e-100
 Identities = 175/257 (68%), Positives = 211/257 (82%), Gaps = 3/257 (1%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAA---PSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHG 729
           M + S VQALY+LCKKTF+P+  A+   P+ A  K+++LLDTI P +VGLK D+ EDD G
Sbjct: 1   MARGSSVQALYELCKKTFSPSAAASSPPPTSAIRKIAALLDTISPVEVGLKADDLEDDRG 60

Query: 728 LGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLS 549
            GFFG +I+++  STR+ARWAQPITYLHIYEC SF+IGIFCLPTSSVIPLHDHP MTVLS
Sbjct: 61  HGFFGSSIYKH--STRVARWAQPITYLHIYECNSFSIGIFCLPTSSVIPLHDHPGMTVLS 118

Query: 548 KVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLH 369
           K+LYGSMHVK+YDWIEP C+ R  +P+  PVRLAKL  D VLTAPC TTVL+PRSGGNLH
Sbjct: 119 KILYGSMHVKSYDWIEPGCVTRSNKPDDFPVRLAKLHMDTVLTAPCPTTVLFPRSGGNLH 178

Query: 368 CFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEA 189
           CFTA+T CAVLDVLAPPYSE AGR CTYY D+PYS+F   + + +++ E++YAWLE IEA
Sbjct: 179 CFTAVTSCAVLDVLAPPYSEEAGRCCTYYHDYPYSTFTPVSRILVNENEDEYAWLEAIEA 238

Query: 188 PDDLCIRPGKYDGPSVQ 138
           PDDL +R G+Y GP+VQ
Sbjct: 239 PDDLHMRSGRYTGPAVQ 255


>ref|XP_009395266.1| PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X2 [Musa
           acuminata subsp. malaccensis]
          Length = 267

 Score =  370 bits (950), Expect = e-100
 Identities = 175/257 (68%), Positives = 211/257 (82%), Gaps = 3/257 (1%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAA---PSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHG 729
           M + S VQALY+LCKKTF+P+  A+   P+ A  K+++LLDTI P +VGLK D+ EDD G
Sbjct: 1   MARGSSVQALYELCKKTFSPSAAASSPPPTSAIRKIAALLDTISPVEVGLKADDLEDDRG 60

Query: 728 LGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLS 549
            GFFG +I+++  STR+ARWAQPITYLHIYEC SF+IGIFCLPTSSVIPLHDHP MTVLS
Sbjct: 61  HGFFGSSIYKH--STRVARWAQPITYLHIYECNSFSIGIFCLPTSSVIPLHDHPGMTVLS 118

Query: 548 KVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLH 369
           K+LYGSMHVK+YDWIEP C+ R  +P+  PVRLAKL  D VLTAPC TTVL+PRSGGNLH
Sbjct: 119 KILYGSMHVKSYDWIEPGCVTRSNKPDDFPVRLAKLHMDTVLTAPCPTTVLFPRSGGNLH 178

Query: 368 CFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEA 189
           CFTA+T CAVLDVLAPPYSE AGR CTYY D+PYS+F   + + +++ E++YAWLE IEA
Sbjct: 179 CFTAVTSCAVLDVLAPPYSEEAGRCCTYYHDYPYSTFTPVSRILVNENEDEYAWLEAIEA 238

Query: 188 PDDLCIRPGKYDGPSVQ 138
           PDDL +R G+Y GP+VQ
Sbjct: 239 PDDLHMRSGRYTGPAVQ 255


>ref|XP_009395265.1| PREDICTED: probable 2-aminoethanethiol dioxygenase isoform X1 [Musa
           acuminata subsp. malaccensis]
          Length = 283

 Score =  370 bits (950), Expect = e-100
 Identities = 175/257 (68%), Positives = 211/257 (82%), Gaps = 3/257 (1%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAA---PSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHG 729
           M + S VQALY+LCKKTF+P+  A+   P+ A  K+++LLDTI P +VGLK D+ EDD G
Sbjct: 1   MARGSSVQALYELCKKTFSPSAAASSPPPTSAIRKIAALLDTISPVEVGLKADDLEDDRG 60

Query: 728 LGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLS 549
            GFFG +I+++  STR+ARWAQPITYLHIYEC SF+IGIFCLPTSSVIPLHDHP MTVLS
Sbjct: 61  HGFFGSSIYKH--STRVARWAQPITYLHIYECNSFSIGIFCLPTSSVIPLHDHPGMTVLS 118

Query: 548 KVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLH 369
           K+LYGSMHVK+YDWIEP C+ R  +P+  PVRLAKL  D VLTAPC TTVL+PRSGGNLH
Sbjct: 119 KILYGSMHVKSYDWIEPGCVTRSNKPDDFPVRLAKLHMDTVLTAPCPTTVLFPRSGGNLH 178

Query: 368 CFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEA 189
           CFTA+T CAVLDVLAPPYSE AGR CTYY D+PYS+F   + + +++ E++YAWLE IEA
Sbjct: 179 CFTAVTSCAVLDVLAPPYSEEAGRCCTYYHDYPYSTFTPVSRILVNENEDEYAWLEAIEA 238

Query: 188 PDDLCIRPGKYDGPSVQ 138
           PDDL +R G+Y GP+VQ
Sbjct: 239 PDDLHMRSGRYTGPAVQ 255


>ref|XP_002274302.2| PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera]
           gi|297734135|emb|CBI15382.3| unnamed protein product
           [Vitis vinifera]
          Length = 251

 Score =  370 bits (949), Expect = 1e-99
 Identities = 179/253 (70%), Positives = 203/253 (80%)
 Frame = -2

Query: 893 KASPVQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGFFG 714
           K++ +QALYDLCKKTF+P+G   PSQA  KLSSLLDTIGP+DVGL+EDN EDD G G FG
Sbjct: 4   KSTSIQALYDLCKKTFSPSGTPPPSQAIHKLSSLLDTIGPADVGLREDNPEDDRGHGIFG 63

Query: 713 ENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVLYG 534
            N F      R+ARWAQPITYL I+EC SFT+ IFC PTSSVIPLHDHP MTVLSKVLYG
Sbjct: 64  LNGFN-----RIARWAQPITYLDIFECNSFTMCIFCFPTSSVIPLHDHPGMTVLSKVLYG 118

Query: 533 SMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFTAI 354
           S+HVKAYDW+EPA IQ+   P Y  VRLAKL  D VLTAP  T++LYP+SGGNLH FTAI
Sbjct: 119 SLHVKAYDWVEPARIQKGKGPGYFTVRLAKLAVDKVLTAPVGTSILYPKSGGNLHYFTAI 178

Query: 353 TPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEAPDDLC 174
           TPCAVLDVLAPPY E +GR CTYY D+PYSSF+TGN  E+S +EEDYAWL  IE PDDL 
Sbjct: 179 TPCAVLDVLAPPYQEASGRKCTYYHDYPYSSFSTGNEAEISGKEEDYAWLAEIETPDDLY 238

Query: 173 IRPGKYDGPSVQV 135
           +R G Y GP++QV
Sbjct: 239 MRQGVYAGPAIQV 251


>ref|XP_010261985.1| PREDICTED: probable 2-aminoethanethiol dioxygenase [Nelumbo
           nucifera]
          Length = 251

 Score =  344 bits (883), Expect = 5e-92
 Identities = 165/255 (64%), Positives = 195/255 (76%), Gaps = 1/255 (0%)
 Frame = -2

Query: 899 MPKASPVQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGF 720
           M K S VQA+YDLCKKTFT +G  A  Q+  KLSSLLD IGPSD GLKEDN+EDD G G 
Sbjct: 1   MTKNSSVQAIYDLCKKTFTSSGIPASPQSIQKLSSLLDKIGPSDAGLKEDNTEDDRGHGV 60

Query: 719 FGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVL 540
           FG N +      R  RW QPITYL IY+ +SFT+ IFCLPTSSVIPLHDHP MTVLSKVL
Sbjct: 61  FGLNHY-----DRAVRWTQPITYLDIYQSDSFTMCIFCLPTSSVIPLHDHPGMTVLSKVL 115

Query: 539 YGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFT 360
           YGSMHVKAYDW+EPA + +   P Y PVRLAKL  D V+TAPC T++LYP+SGGNLHCFT
Sbjct: 116 YGSMHVKAYDWVEPAYVLKSRGPGYPPVRLAKLTVDKVITAPCGTSILYPKSGGNLHCFT 175

Query: 359 AITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSK-REEDYAWLEMIEAPD 183
            +TPCAV D+L PPY E AGR CTYY D+P+SSF+  N   +S  +EE+YAWLE ++ P+
Sbjct: 176 GVTPCAVFDILTPPYQEAAGRKCTYYHDYPFSSFSMENGFRISDGKEEEYAWLEEMDTPN 235

Query: 182 DLCIRPGKYDGPSVQ 138
           DL +R G+Y GP++Q
Sbjct: 236 DLYMRQGEYKGPAIQ 250


>ref|XP_007037732.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508774977|gb|EOY22233.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 282

 Score =  343 bits (879), Expect = 1e-91
 Identities = 169/265 (63%), Positives = 196/265 (73%), Gaps = 2/265 (0%)
 Frame = -2

Query: 923 PHSPAHASMPKASP-VQALYDLCKKTFTPTG-NAAPSQAFAKLSSLLDTIGPSDVGLKED 750
           P    H +M   SP VQ L+DLCK TFTP+G  +A  Q   KL SLLDT GP+D+GLKE+
Sbjct: 27  PKQKLHMAMNTTSPKVQLLFDLCKTTFTPSGLPSASPQPIRKLCSLLDTFGPADIGLKEE 86

Query: 749 NSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDH 570
           + +DD G GFFG N        R+ARWAQPIT+L IYEC+SFT+ +FC PTSSVIPLHDH
Sbjct: 87  SPDDDRGHGFFGLN--------RVARWAQPITFLDIYECDSFTMCVFCFPTSSVIPLHDH 138

Query: 569 PEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYP 390
           P MTV SKVLYGSMHVKAYDW+EP CI+   EP Y  VRLA+L  D V TAPC T+VLYP
Sbjct: 139 PGMTVFSKVLYGSMHVKAYDWVEPVCIKESREPGYPQVRLARLAVDKVSTAPCGTSVLYP 198

Query: 389 RSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYA 210
           ++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EEDYA
Sbjct: 199 KTGGNLHCFTAVTPCAVLDVLAPPYREDIGRKCTYYIDYPYSTFGNGTEISNGK-EEDYA 257

Query: 209 WLEMIEAPDDLCIRPGKYDGPSVQV 135
           WL  IE PDDL +R G Y GP++QV
Sbjct: 258 WLAEIETPDDLYMREGVYVGPAIQV 282


>ref|XP_006485740.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform X1 [Citrus
           sinensis]
          Length = 288

 Score =  342 bits (877), Expect = 2e-91
 Identities = 166/259 (64%), Positives = 194/259 (74%), Gaps = 1/259 (0%)
 Frame = -2

Query: 908 HASMPKASPVQALYDLCKKTFTPTGNAAPS-QAFAKLSSLLDTIGPSDVGLKEDNSEDDH 732
           H     +S VQ LYD CKKTFTP+G   PS QA   L SLLDT+GP+DVGL+E +S+DD 
Sbjct: 35  HMERKNSSKVQGLYDFCKKTFTPSGTPPPSSQAVRDLCSLLDTVGPADVGLEEQSSDDDR 94

Query: 731 GLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVL 552
           GLGF G      Y   R+ARWAQPITYL IYEC+SFT+ IFC PTS+VIPLHDHP MTVL
Sbjct: 95  GLGFSGL-----YGLNRVARWAQPITYLDIYECDSFTMCIFCFPTSAVIPLHDHPGMTVL 149

Query: 551 SKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNL 372
           SKVLYGSMHVKAYDW+EPA  Q    P Y PVRLAKL  D VLT    T+VLYP+SGGNL
Sbjct: 150 SKVLYGSMHVKAYDWVEPARFQETKGPGYRPVRLAKLATDKVLTPQYGTSVLYPKSGGNL 209

Query: 371 HCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIE 192
           HCFTA+TPCAVLD+L PPY+E AGR CTYY D+P+ +F+  N  E+S  +E+YAWL  I+
Sbjct: 210 HCFTAVTPCAVLDILTPPYNEDAGRKCTYYVDYPFPTFSAVNGAEVSNEKEEYAWLSEID 269

Query: 191 APDDLCIRPGKYDGPSVQV 135
            PDDL +RPG Y GP++ V
Sbjct: 270 TPDDLYMRPGVYAGPAILV 288


>ref|XP_011623281.1| PREDICTED: 2-aminoethanethiol dioxygenase [Amborella trichopoda]
          Length = 248

 Score =  341 bits (875), Expect = 4e-91
 Identities = 172/251 (68%), Positives = 194/251 (77%), Gaps = 1/251 (0%)
 Frame = -2

Query: 887 SPVQALYDLCKKTFTP-TGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGFFGE 711
           S VQALYDLCKKTFT  T    PS +  KLS+L+D   P DVGL+ED+SED  G G FG+
Sbjct: 4   SSVQALYDLCKKTFTSSTPTPPPSHSIRKLSALMDAFEPEDVGLREDSSED-RGHGCFGQ 62

Query: 710 NIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVLYGS 531
           N   N  STRLARWAQPITYLHI+EC +FTIGIFCLPTSS IPLHDHP MTV SKVLYGS
Sbjct: 63  NRLMNNLSTRLARWAQPITYLHIHECNNFTIGIFCLPTSSGIPLHDHPGMTVWSKVLYGS 122

Query: 530 MHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFTAIT 351
           MHVK+YDW+EPACI+     +   +RLAKL+ DNVLTAPC T+VL+PRSGGN+HCFTAIT
Sbjct: 123 MHVKSYDWVEPACIRTEANSQ---LRLAKLRVDNVLTAPCETSVLFPRSGGNIHCFTAIT 179

Query: 350 PCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIEAPDDLCI 171
            CAVLDVLAPPYSE AGR+CTYY D+P SS   G  L     EE +AWLE +EAPDD  I
Sbjct: 180 SCAVLDVLAPPYSEAAGRNCTYYNDYPLSSLENGFDLH---DEESFAWLEEVEAPDDFYI 236

Query: 170 RPGKYDGPSVQ 138
           R GKY GP+VQ
Sbjct: 237 RSGKYKGPAVQ 247


>ref|XP_007037733.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508774978|gb|EOY22234.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 294

 Score =  341 bits (875), Expect = 4e-91
 Identities = 168/264 (63%), Positives = 195/264 (73%), Gaps = 2/264 (0%)
 Frame = -2

Query: 923 PHSPAHASMPKASP-VQALYDLCKKTFTPTG-NAAPSQAFAKLSSLLDTIGPSDVGLKED 750
           P    H +M   SP VQ L+DLCK TFTP+G  +A  Q   KL SLLDT GP+D+GLKE+
Sbjct: 27  PKQKLHMAMNTTSPKVQLLFDLCKTTFTPSGLPSASPQPIRKLCSLLDTFGPADIGLKEE 86

Query: 749 NSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDH 570
           + +DD G GFFG N        R+ARWAQPIT+L IYEC+SFT+ +FC PTSSVIPLHDH
Sbjct: 87  SPDDDRGHGFFGLN--------RVARWAQPITFLDIYECDSFTMCVFCFPTSSVIPLHDH 138

Query: 569 PEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYP 390
           P MTV SKVLYGSMHVKAYDW+EP CI+   EP Y  VRLA+L  D V TAPC T+VLYP
Sbjct: 139 PGMTVFSKVLYGSMHVKAYDWVEPVCIKESREPGYPQVRLARLAVDKVSTAPCGTSVLYP 198

Query: 389 RSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYA 210
           ++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EEDYA
Sbjct: 199 KTGGNLHCFTAVTPCAVLDVLAPPYREDIGRKCTYYIDYPYSTFGNGTEISNGK-EEDYA 257

Query: 209 WLEMIEAPDDLCIRPGKYDGPSVQ 138
           WL  IE PDDL +R G Y GP++Q
Sbjct: 258 WLAEIETPDDLYMREGVYVGPAIQ 281


>gb|KHG13152.1| 2-aminoethanethiol dioxygenase [Gossypium arboreum]
          Length = 284

 Score =  338 bits (868), Expect = 3e-90
 Identities = 169/267 (63%), Positives = 200/267 (74%), Gaps = 4/267 (1%)
 Frame = -2

Query: 923 PHSPAHASMPK--ASPVQALYDLCKKTFTPTG-NAAPS-QAFAKLSSLLDTIGPSDVGLK 756
           P    H +M    A  VQ LYDLCK TFTP+G +++PS Q   K+ SLLDT GP+DVGLK
Sbjct: 27  PKQKLHMAMNNTTAPKVQLLYDLCKTTFTPSGLSSSPSPQPIHKICSLLDTFGPADVGLK 86

Query: 755 EDNSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLH 576
           E++ +DD G GFFG N        R+ RWAQPITYL I+EC+SFT+ +FC PTSSVIPLH
Sbjct: 87  EESPDDDRGHGFFGLN--------RVTRWAQPITYLDIHECDSFTMCVFCFPTSSVIPLH 138

Query: 575 DHPEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVL 396
           DHP MTVLSKVLYGSMHVKAYDW+EP+CI+   EP    VRLA+L AD VLTAPC T++L
Sbjct: 139 DHPGMTVLSKVLYGSMHVKAYDWVEPSCIKESQEPGCPQVRLARLAADKVLTAPCGTSIL 198

Query: 395 YPRSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREED 216
           YP++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EE+
Sbjct: 199 YPKTGGNLHCFTAVTPCAVLDVLAPPYREDLGRKCTYYVDYPYSAFGNGAQISNGK-EEE 257

Query: 215 YAWLEMIEAPDDLCIRPGKYDGPSVQV 135
           YAWL  IE PDDL +R G Y GPS++V
Sbjct: 258 YAWLAEIETPDDLYMRSGVYVGPSIRV 284


>ref|XP_012488993.1| PREDICTED: plant cysteine oxidase 3 isoform X2 [Gossypium
           raimondii]
          Length = 290

 Score =  338 bits (867), Expect = 3e-90
 Identities = 170/267 (63%), Positives = 199/267 (74%), Gaps = 4/267 (1%)
 Frame = -2

Query: 923 PHSPAHASMPK--ASPVQALYDLCKKTFTPTG-NAAPS-QAFAKLSSLLDTIGPSDVGLK 756
           P    H +M    A  VQ LYDLCK TFTP+G +++PS Q   KL SLLDT GP+DVGLK
Sbjct: 33  PKQQLHMAMNNTTAPKVQLLYDLCKTTFTPSGLSSSPSPQPIHKLCSLLDTFGPADVGLK 92

Query: 755 EDNSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLH 576
           E++ +DD G GFFG N        R+ RWAQPITYL I+EC+SFT+ IFC PTSSVIPLH
Sbjct: 93  EESPDDDRGHGFFGLN--------RVTRWAQPITYLDIHECDSFTMCIFCFPTSSVIPLH 144

Query: 575 DHPEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVL 396
           DHP MTV SKVLYGSMHVKAYDW+EP+CI+   EP    VRLA+L AD VLTAPC T++L
Sbjct: 145 DHPGMTVFSKVLYGSMHVKAYDWVEPSCIKESQEPGCPQVRLARLAADKVLTAPCGTSIL 204

Query: 395 YPRSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREED 216
           YP++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EE+
Sbjct: 205 YPKTGGNLHCFTAVTPCAVLDVLAPPYREDLGRKCTYYVDYPYSAFGNGAQISNGK-EEE 263

Query: 215 YAWLEMIEAPDDLCIRPGKYDGPSVQV 135
           YAWL  IE PDDL +R G Y GPS++V
Sbjct: 264 YAWLAEIETPDDLYMRSGVYVGPSIRV 290


>gb|KJB39982.1| hypothetical protein B456_007G040900 [Gossypium raimondii]
          Length = 321

 Score =  338 bits (867), Expect = 3e-90
 Identities = 170/267 (63%), Positives = 199/267 (74%), Gaps = 4/267 (1%)
 Frame = -2

Query: 923 PHSPAHASMPK--ASPVQALYDLCKKTFTPTG-NAAPS-QAFAKLSSLLDTIGPSDVGLK 756
           P    H +M    A  VQ LYDLCK TFTP+G +++PS Q   KL SLLDT GP+DVGLK
Sbjct: 64  PKQQLHMAMNNTTAPKVQLLYDLCKTTFTPSGLSSSPSPQPIHKLCSLLDTFGPADVGLK 123

Query: 755 EDNSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLH 576
           E++ +DD G GFFG N        R+ RWAQPITYL I+EC+SFT+ IFC PTSSVIPLH
Sbjct: 124 EESPDDDRGHGFFGLN--------RVTRWAQPITYLDIHECDSFTMCIFCFPTSSVIPLH 175

Query: 575 DHPEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVL 396
           DHP MTV SKVLYGSMHVKAYDW+EP+CI+   EP    VRLA+L AD VLTAPC T++L
Sbjct: 176 DHPGMTVFSKVLYGSMHVKAYDWVEPSCIKESQEPGCPQVRLARLAADKVLTAPCGTSIL 235

Query: 395 YPRSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREED 216
           YP++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EE+
Sbjct: 236 YPKTGGNLHCFTAVTPCAVLDVLAPPYREDLGRKCTYYVDYPYSAFGNGAQISNGK-EEE 294

Query: 215 YAWLEMIEAPDDLCIRPGKYDGPSVQV 135
           YAWL  IE PDDL +R G Y GPS++V
Sbjct: 295 YAWLAEIETPDDLYMRSGVYVGPSIRV 321


>ref|XP_012092920.1| PREDICTED: 2-aminoethanethiol dioxygenase isoform X1 [Jatropha
           curcas]
          Length = 291

 Score =  338 bits (866), Expect = 4e-90
 Identities = 165/268 (61%), Positives = 198/268 (73%), Gaps = 4/268 (1%)
 Frame = -2

Query: 932 PLHPHSPAHASMPKASP---VQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVG 762
           P   +  +H +M + SP   VQALYDLCK TFTP+   + S A  KL SLLDT+ P+DVG
Sbjct: 27  PFAVYFRSHVNMVEQSPSTKVQALYDLCKNTFTPSEIPSSSPAINKLCSLLDTVRPADVG 86

Query: 761 LKEDNSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIP 582
           LKE+N +DD G G FG N     + +R ARWAQPITY+ IYEC+SFT+ IFC PTSSVIP
Sbjct: 87  LKEENPDDDRGHGIFGLN-----RLSRAARWAQPITYIDIYECDSFTMCIFCFPTSSVIP 141

Query: 581 LHDHPEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATT 402
           LHDHP MTV SK+LYGS+HVKAYDW+EP CI    E    PV+LAKL  D VLTAPC T+
Sbjct: 142 LHDHPGMTVFSKILYGSLHVKAYDWVEPTCILEGKESGNPPVKLAKLAVDKVLTAPCGTS 201

Query: 401 VLYPRSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSK-R 225
           +LYP+SGGNLHCFTA+TPCAVLD+L P Y E  GR C+YY D+PYS F++GN  E+   +
Sbjct: 202 ILYPKSGGNLHCFTAVTPCAVLDILTPSYREDVGRKCSYYHDYPYSPFSSGNGSELGDGK 261

Query: 224 EEDYAWLEMIEAPDDLCIRPGKYDGPSV 141
           EEDYAWL  IE PD+L +RPG Y GP+V
Sbjct: 262 EEDYAWLAEIETPDNLYMRPGIYTGPAV 289


>gb|KDP20052.1| hypothetical protein JCGZ_05821 [Jatropha curcas]
          Length = 288

 Score =  338 bits (866), Expect = 4e-90
 Identities = 165/268 (61%), Positives = 198/268 (73%), Gaps = 4/268 (1%)
 Frame = -2

Query: 932 PLHPHSPAHASMPKASP---VQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVG 762
           P   +  +H +M + SP   VQALYDLCK TFTP+   + S A  KL SLLDT+ P+DVG
Sbjct: 24  PFAVYFRSHVNMVEQSPSTKVQALYDLCKNTFTPSEIPSSSPAINKLCSLLDTVRPADVG 83

Query: 761 LKEDNSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIP 582
           LKE+N +DD G G FG N     + +R ARWAQPITY+ IYEC+SFT+ IFC PTSSVIP
Sbjct: 84  LKEENPDDDRGHGIFGLN-----RLSRAARWAQPITYIDIYECDSFTMCIFCFPTSSVIP 138

Query: 581 LHDHPEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATT 402
           LHDHP MTV SK+LYGS+HVKAYDW+EP CI    E    PV+LAKL  D VLTAPC T+
Sbjct: 139 LHDHPGMTVFSKILYGSLHVKAYDWVEPTCILEGKESGNPPVKLAKLAVDKVLTAPCGTS 198

Query: 401 VLYPRSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSK-R 225
           +LYP+SGGNLHCFTA+TPCAVLD+L P Y E  GR C+YY D+PYS F++GN  E+   +
Sbjct: 199 ILYPKSGGNLHCFTAVTPCAVLDILTPSYREDVGRKCSYYHDYPYSPFSSGNGSELGDGK 258

Query: 224 EEDYAWLEMIEAPDDLCIRPGKYDGPSV 141
           EEDYAWL  IE PD+L +RPG Y GP+V
Sbjct: 259 EEDYAWLAEIETPDNLYMRPGIYTGPAV 286


>ref|XP_007210646.1| hypothetical protein PRUPE_ppa019247mg [Prunus persica]
           gi|462406381|gb|EMJ11845.1| hypothetical protein
           PRUPE_ppa019247mg [Prunus persica]
          Length = 250

 Score =  337 bits (865), Expect = 6e-90
 Identities = 166/255 (65%), Positives = 198/255 (77%), Gaps = 2/255 (0%)
 Frame = -2

Query: 893 KASPVQALYDLCKKTFTPTGNAAPSQA-FAKLSSLLDTIGPSDVGLKEDNSEDDHGLGFF 717
           K+S VQALY+LC+  FTP+G+  PS A   KL S+LDT+ P+DVGLKE+N +DD G GFF
Sbjct: 4   KSSKVQALYELCQNMFTPSGSPPPSSAAINKLCSVLDTMSPADVGLKEENLDDDRGHGFF 63

Query: 716 GENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVLY 537
           G       Q  R+ARW QPITYL IYEC+SFT+ IFC PTSSVIPLHDHP MTV SKVLY
Sbjct: 64  GLE-----QLNRVARWTQPITYLDIYECDSFTMCIFCFPTSSVIPLHDHPGMTVFSKVLY 118

Query: 536 GSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFTA 357
           GS+HV+AYDW+EPA  Q    P Y PVRLAKL  D VLTAPC T+VLYPR+GGNLH FTA
Sbjct: 119 GSLHVRAYDWVEPA--QESKGPNYFPVRLAKLAVDKVLTAPCGTSVLYPRNGGNLHYFTA 176

Query: 356 ITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSK-REEDYAWLEMIEAPDD 180
           +TPCAVLD+L PPY E AGR CTYYRD+PY++FATGN +++   +EE+YAWL   E PD+
Sbjct: 177 VTPCAVLDILTPPYREDAGRKCTYYRDYPYTAFATGNGIKIEDGKEEEYAWLAETE-PDN 235

Query: 179 LCIRPGKYDGPSVQV 135
           L +RPG Y GP++QV
Sbjct: 236 LYMRPGNYTGPTIQV 250


>ref|XP_008244076.1| PREDICTED: 2-aminoethanethiol dioxygenase [Prunus mume]
          Length = 279

 Score =  337 bits (863), Expect = 1e-89
 Identities = 164/254 (64%), Positives = 196/254 (77%), Gaps = 1/254 (0%)
 Frame = -2

Query: 893 KASPVQALYDLCKKTFTPTGNAAPSQAFAKLSSLLDTIGPSDVGLKEDNSEDDHGLGFFG 714
           K+S VQALY+LC+  FTP+G+   S A  KL S+LDT+ P+DVGLKE+N +DD G GFFG
Sbjct: 34  KSSKVQALYELCQNMFTPSGSPPSSAAINKLCSVLDTMSPTDVGLKEENLDDDRGHGFFG 93

Query: 713 ENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVLSKVLYG 534
                  Q  R+ARWAQPITY  IYEC+SFT+ IFC PTSSVIPLHDHP MTV SKVLYG
Sbjct: 94  LE-----QLNRVARWAQPITYFDIYECDSFTMCIFCFPTSSVIPLHDHPGMTVFSKVLYG 148

Query: 533 SMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNLHCFTAI 354
           S+HV+AYDW+EPA  Q    P Y PVRLAKL  D VLTAPC T+VLYPR+GGNLH FTA+
Sbjct: 149 SLHVRAYDWVEPA--QESKGPNYFPVRLAKLAVDKVLTAPCGTSVLYPRNGGNLHYFTAV 206

Query: 353 TPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSK-REEDYAWLEMIEAPDDL 177
           TPCAVLD+L PPY E AGR CTYYRD+PY++FATGN +++   +EE+YAWL     PD+L
Sbjct: 207 TPCAVLDILTPPYREDAGRKCTYYRDYPYTAFATGNGIKIEDGKEEEYAWLAE-TVPDNL 265

Query: 176 CIRPGKYDGPSVQV 135
            +RPG Y GP++QV
Sbjct: 266 YMRPGNYTGPTIQV 279


>ref|XP_007037734.1| Uncharacterized protein isoform 3 [Theobroma cacao]
           gi|508774979|gb|EOY22235.1| Uncharacterized protein
           isoform 3 [Theobroma cacao]
          Length = 278

 Score =  337 bits (863), Expect = 1e-89
 Identities = 168/265 (63%), Positives = 195/265 (73%), Gaps = 2/265 (0%)
 Frame = -2

Query: 923 PHSPAHASMPKASP-VQALYDLCKKTFTPTG-NAAPSQAFAKLSSLLDTIGPSDVGLKED 750
           P    H +M   SP VQ L+DLCK TFTP+G  +A  Q   KL SLLDT GP+D+GLKE+
Sbjct: 27  PKQKLHMAMNTTSPKVQLLFDLCKTTFTPSGLPSASPQPIRKLCSLLDTFGPADIGLKEE 86

Query: 749 NSEDDHGLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDH 570
           + +DD G GFFG N        R+ARWAQPIT+L IYEC+SFT+ +FC PTSSVIPLHDH
Sbjct: 87  SPDDDRGHGFFGLN--------RVARWAQPITFLDIYECDSFTMCVFCFPTSSVIPLHDH 138

Query: 569 PEMTVLSKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYP 390
           P MTV SKVLYGSMHVKAYDW+EP CI+   E    PVRLA+L  D V TAPC T+VLYP
Sbjct: 139 PGMTVFSKVLYGSMHVKAYDWVEPVCIKESRE----PVRLARLAVDKVSTAPCGTSVLYP 194

Query: 389 RSGGNLHCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYA 210
           ++GGNLHCFTA+TPCAVLDVLAPPY E  GR CTYY D+PYS+F  G  +   K EEDYA
Sbjct: 195 KTGGNLHCFTAVTPCAVLDVLAPPYREDIGRKCTYYIDYPYSTFGNGTEISNGK-EEDYA 253

Query: 209 WLEMIEAPDDLCIRPGKYDGPSVQV 135
           WL  IE PDDL +R G Y GP++QV
Sbjct: 254 WLAEIETPDDLYMREGVYVGPAIQV 278


>ref|XP_006440863.1| hypothetical protein CICLE_v10021506mg [Citrus clementina]
           gi|557543125|gb|ESR54103.1| hypothetical protein
           CICLE_v10021506mg [Citrus clementina]
          Length = 286

 Score =  336 bits (862), Expect = 1e-89
 Identities = 163/259 (62%), Positives = 193/259 (74%), Gaps = 1/259 (0%)
 Frame = -2

Query: 908 HASMPKASPVQALYDLCKKTFTPTGNAAPS-QAFAKLSSLLDTIGPSDVGLKEDNSEDDH 732
           H     +S VQ LYD CKKTFTP+G   PS QA   L SLLDT+GP+DVGL+E +S+DD 
Sbjct: 35  HMERKNSSKVQGLYDFCKKTFTPSGTPPPSSQAVRDLCSLLDTVGPADVGLEEQSSDDDR 94

Query: 731 GLGFFGENIFRNYQSTRLARWAQPITYLHIYECESFTIGIFCLPTSSVIPLHDHPEMTVL 552
           GLGF G      Y   R+ARWAQPITYL IYEC+SFT+ IFC PTS+VIPLHDHP MTV 
Sbjct: 95  GLGFSGL-----YGLNRVARWAQPITYLDIYECDSFTMCIFCFPTSAVIPLHDHPGMTVF 149

Query: 551 SKVLYGSMHVKAYDWIEPACIQRIGEPEYLPVRLAKLKADNVLTAPCATTVLYPRSGGNL 372
           SKVLYGSMHVKAYDW+EPA  Q    P Y PVRLAKL  D +LT    T++LYP+SGGN+
Sbjct: 150 SKVLYGSMHVKAYDWVEPARFQETKGPGYRPVRLAKLATDKILTPQYGTSILYPKSGGNM 209

Query: 371 HCFTAITPCAVLDVLAPPYSEVAGRHCTYYRDHPYSSFATGNMLEMSKREEDYAWLEMIE 192
           HCFTA+TPCAVLD+L PPY+E AGR CTYY D+P+S+F+  N  +  K  E+YAWL  I+
Sbjct: 210 HCFTAVTPCAVLDILTPPYNEDAGRKCTYYIDYPFSTFSAVNGADNEK--EEYAWLSEID 267

Query: 191 APDDLCIRPGKYDGPSVQV 135
            PDDL +RPG Y GP++QV
Sbjct: 268 TPDDLYMRPGVYAGPAIQV 286


Top