BLASTX nr result

ID: Catharanthus22_contig00016561 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00016561
         (1511 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003624687.1| 2-aminoethanethiol dioxygenase [Medicago tru...   328   5e-87
gb|AFK44841.1| unknown [Medicago truncatula]                          326   1e-86
gb|EMJ05897.1| hypothetical protein PRUPE_ppa009537mg [Prunus pe...   325   2e-86
ref|XP_004149110.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   322   3e-85
ref|XP_006600234.1| PREDICTED: uncharacterized protein LOC100819...   321   5e-85
ref|NP_001241359.1| uncharacterized protein LOC100819405 [Glycin...   319   2e-84
ref|XP_004493204.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   318   3e-84
ref|XP_002275517.1| PREDICTED: 2-aminoethanethiol dioxygenase [V...   318   4e-84
ref|XP_003554039.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   316   1e-83
ref|XP_003554041.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   315   3e-83
ref|XP_004489672.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   313   2e-82
ref|XP_002319186.2| hypothetical protein POPTR_0013s06040g [Popu...   310   8e-82
gb|EOY10511.1| Uncharacterized protein isoform 1 [Theobroma cacao]    309   2e-81
ref|XP_003548690.1| PREDICTED: 2-aminoethanethiol dioxygenase [G...   307   7e-81
gb|EXB36268.1| 2-aminoethanethiol dioxygenase [Morus notabilis]       301   4e-79
ref|XP_006443485.1| hypothetical protein CICLE_v10021575mg [Citr...   300   8e-79
ref|XP_002325431.2| hypothetical protein POPTR_0019s05510g [Popu...   300   1e-78
gb|ESW33986.1| hypothetical protein PHAVU_001G114800g [Phaseolus...   294   6e-77
gb|ESW33987.1| hypothetical protein PHAVU_001G114800g [Phaseolus...   294   8e-77
ref|XP_004303011.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   294   8e-77

>ref|XP_003624687.1| 2-aminoethanethiol dioxygenase [Medicago truncatula]
            gi|87162727|gb|ABD28522.1| Cupin, RmlC-type [Medicago
            truncatula] gi|355499702|gb|AES80905.1|
            2-aminoethanethiol dioxygenase [Medicago truncatula]
          Length = 283

 Score =  328 bits (840), Expect = 5e-87
 Identities = 163/237 (68%), Positives = 182/237 (76%), Gaps = 2/237 (0%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 635
            ALQ LF SC++ FKG GTVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 46   ALQELFDSCKQTFKGPGTVPSPRDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 105

Query: 636  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 815
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD  +++  
Sbjct: 106  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDH-EASHN 164

Query: 816  TLTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 989
             L P  K+RLA++KAN  F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDG
Sbjct: 165  LLQPSSKLRLAKLKANKTFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDG 224

Query: 990  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            RDCSYYKD PY     EE  G VK+++D SYG L          MD I+YLGP + D
Sbjct: 225  RDCSYYKDYPYNAFPNEEKIGEVKDKDD-SYGLLEEIDMPENCQMDGIEYLGPPIDD 280


>gb|AFK44841.1| unknown [Medicago truncatula]
          Length = 283

 Score =  326 bits (836), Expect = 1e-86
 Identities = 163/237 (68%), Positives = 181/237 (76%), Gaps = 2/237 (0%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 635
            ALQ LF SC++ FKG GTVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 46   ALQELFDSCKQTFKGPGTVPSPRDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 105

Query: 636  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 815
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD  ++   
Sbjct: 106  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDH-EAFHN 164

Query: 816  TLTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 989
             L P  K+RLA++KAN  F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDG
Sbjct: 165  LLQPSSKLRLAKLKANKTFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDG 224

Query: 990  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            RDCSYYKD PY     EE  G VK+++D SYG L          MD I+YLGP + D
Sbjct: 225  RDCSYYKDYPYNAFPNEEKIGEVKDKDD-SYGLLEEIDMPENCQMDGIEYLGPPIDD 280


>gb|EMJ05897.1| hypothetical protein PRUPE_ppa009537mg [Prunus persica]
          Length = 287

 Score =  325 bits (834), Expect = 2e-86
 Identities = 157/240 (65%), Positives = 180/240 (75%)
 Frame = +3

Query: 447  PILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIER 626
            P   LQ+LF+SC++VFKG GTVPSP+DV  LC ILD M PEDVGLS DLQFFKP+ V++ 
Sbjct: 49   PPTVLQQLFVSCRQVFKGPGTVPSPHDVHNLCSILDKMRPEDVGLSRDLQFFKPKTVVQG 108

Query: 627  NARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDS 806
              RVTY TIY+C NFSLC LF+PA+ VIPLHNHP MTVFSKLLLG MHIKSYDWVDPV+S
Sbjct: 109  TPRVTYTTIYECSNFSLCCLFIPATGVIPLHNHPEMTVFSKLLLGKMHIKSYDWVDPVNS 168

Query: 807  NDQTLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKED 986
            +  T  P++RLA++KA+ +F +PCNTSVLYPT GGNIH F AITPCAVLDVLGPPYSKED
Sbjct: 169  DGSTPAPQLRLAKLKADSVFTSPCNTSVLYPTEGGNIHAFTAITPCAVLDVLGPPYSKED 228

Query: 987  GRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
             RDCSYYKD PY      E S  V E     YGWL          MD I YLGP+V + +
Sbjct: 229  DRDCSYYKDHPYAAYSNGEAS--VTEGNGDCYGWLEEIEMPENSEMDKIPYLGPQVTETS 286


>ref|XP_004149110.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus]
          Length = 278

 Score =  322 bits (825), Expect = 3e-85
 Identities = 154/236 (65%), Positives = 182/236 (77%)
 Frame = +3

Query: 453  LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 632
            +ALQ LF+SC+EVFKG GTVP P DV+KLC ILD M  EDVGLS  LQFFKP   ++ + 
Sbjct: 43   MALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSP 102

Query: 633  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 812
            RVTY TIYKCDNFSLCI FLPA+ VIPLHNHPGMTVFSKLLLG MHIKSYDWVDP +S+D
Sbjct: 103  RVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDD 162

Query: 813  QTLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 992
                 + RLA++KA+ +F +PC+TSVLYPTSGGNIH F AITPCAVLDVLGPPYS EDGR
Sbjct: 163  TAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGR 222

Query: 993  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            DCSYYK+ PY +    ++ G+ +E++   YGWL          MD I+YLGP++ D
Sbjct: 223  DCSYYKEHPYASFPNGDM-GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICD 277


>ref|XP_006600234.1| PREDICTED: uncharacterized protein LOC100819405 isoform X1 [Glycine
            max]
          Length = 299

 Score =  321 bits (823), Expect = 5e-85
 Identities = 161/239 (67%), Positives = 182/239 (76%), Gaps = 2/239 (0%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHG-TVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 632
            ALQ LF+SC+E FKG G TVPSP DVQKLCHILD+M PEDVGL  DLQFFKP  +++ N 
Sbjct: 62   ALQELFVSCRETFKGPGGTVPSPQDVQKLCHILDSMKPEDVGLRSDLQFFKPENIVKENQ 121

Query: 633  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 812
            RVT  TIY C+NFSLC+ FLPA  VIPLHNHP MTVFSKLLLG MHIKSYDWVD   S++
Sbjct: 122  RVTCTTIYSCENFSLCLFFLPAKGVIPLHNHPEMTVFSKLLLGQMHIKSYDWVDSEVSHN 181

Query: 813  QTLTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 989
                P + RLAR+KAN++F APC+TSVLYP SGGNIHEF AITPCAVLDVLGPPYSK+DG
Sbjct: 182  LLHQPSQFRLARLKANNVFTAPCDTSVLYPQSGGNIHEFTAITPCAVLDVLGPPYSKDDG 241

Query: 990  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
            RDCS+Y+D PY      E SG VKEE D SYGWL          MD I+YLGP + + A
Sbjct: 242  RDCSFYRDHPYTAFPNGE-SGKVKEEND-SYGWLEEIEMPENSQMDGIEYLGPPIIETA 298


>ref|NP_001241359.1| uncharacterized protein LOC100819405 [Glycine max]
            gi|255641533|gb|ACU21040.1| unknown [Glycine max]
          Length = 301

 Score =  319 bits (818), Expect = 2e-84
 Identities = 160/240 (66%), Positives = 182/240 (75%), Gaps = 3/240 (1%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHG-TVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 632
            ALQ LF+SC+E FKG G TVPSP DVQKLCHILD+M PEDVGL  DLQFFKP  +++ N 
Sbjct: 62   ALQELFVSCRETFKGPGGTVPSPQDVQKLCHILDSMKPEDVGLRSDLQFFKPENIVKENQ 121

Query: 633  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 812
            RVT  TIY C+NFSLC+ FLPA  VIPLHNHP MTVFSKLLLG MHIKSYDWVD   S++
Sbjct: 122  RVTCTTIYSCENFSLCLFFLPAKGVIPLHNHPEMTVFSKLLLGQMHIKSYDWVDSEVSHN 181

Query: 813  QTLTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 989
                P + RLAR+KAN++F APC+TSVLYP SGGNIHEF AITPCAVLDVLGPPYSK+DG
Sbjct: 182  LLHQPSQFRLARLKANNVFTAPCDTSVLYPQSGGNIHEFTAITPCAVLDVLGPPYSKDDG 241

Query: 990  RDCSYYKDTPYKTL-LKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
            RDCS+Y+D PY      +  SG VKEE D SYGWL          MD I+YLGP + + A
Sbjct: 242  RDCSFYRDHPYTAFPTADGESGKVKEEND-SYGWLEEIEMPENSQMDGIEYLGPPIIETA 300


>ref|XP_004493204.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cicer arietinum]
          Length = 286

 Score =  318 bits (816), Expect = 3e-84
 Identities = 158/236 (66%), Positives = 177/236 (75%), Gaps = 1/236 (0%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 635
            ALQ LF SC++ FKG  TVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 49   ALQELFGSCKQTFKGINTVPSPQDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 108

Query: 636  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 815
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD   +++ 
Sbjct: 109  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDSEATHNL 168

Query: 816  TLTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 992
               P K+RLA++KAND+F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDGR
Sbjct: 169  LQQPSKLRLAKLKANDVFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDGR 228

Query: 993  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            DCSYYKD P      EE    V ++ D SY  L          MD I+YLGP + D
Sbjct: 229  DCSYYKDHPCDAFPNEEEIAKVNDKND-SYALLEEIEMPENCQMDGIEYLGPPIND 283


>ref|XP_002275517.1| PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera]
            gi|296085895|emb|CBI31219.3| unnamed protein product
            [Vitis vinifera]
          Length = 275

 Score =  318 bits (815), Expect = 4e-84
 Identities = 153/239 (64%), Positives = 180/239 (75%), Gaps = 2/239 (0%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 635
            +LQRLF++C++VFKG GTVP P DV KLCHILD M PEDVGLS D+ FFK ++  +   +
Sbjct: 36   SLQRLFVACRDVFKGLGTVPQPIDVTKLCHILDNMRPEDVGLSKDIPFFKAKRAAQGIPK 95

Query: 636  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 815
            VT AT+YKC+ FSLCI FLP   VIPLHNHPGMTVFSKLLLG+MHIKSYDWVDPV S+  
Sbjct: 96   VTCATVYKCEEFSLCIFFLPPRAVIPLHNHPGMTVFSKLLLGSMHIKSYDWVDPVGSDSS 155

Query: 816  TLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRD 995
            +   K+RLAR+KA+ +F APCNTSVLYPTSGGNIH F AITPCAVLDVLGPPYSK+DGRD
Sbjct: 156  SPPSKLRLARLKADSVFTAPCNTSVLYPTSGGNIHAFTAITPCAVLDVLGPPYSKKDGRD 215

Query: 996  CSYYKDTPYKTLLKEELSGVVKE--EEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
            CSYYKD+PY      E     +E  EE+  YGWL          MD  +YLGP++ D +
Sbjct: 216  CSYYKDSPYTPFSNGEARTRKEEDGEEEERYGWLEEVEMPEDSKMDWTEYLGPQIIDTS 274


>ref|XP_003554039.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  316 bits (810), Expect = 1e-83
 Identities = 153/239 (64%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = +3

Query: 438  QISPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 617
            ++S    L +LF SC+E FKG GTVPSP DVQ+L HILD M PEDVGLS DLQFFKP  +
Sbjct: 41   ELSVSKTLHQLFDSCREAFKGPGTVPSPQDVQRLTHILDNMKPEDVGLSRDLQFFKPGNI 100

Query: 618  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 797
            ++ N RVTY T+YKCDNFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWVDP
Sbjct: 101  VKENQRVTYTTVYKCDNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVDP 160

Query: 798  VDSNDQTLTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 971
              S+D  L P  ++RLA +K + +F + C+TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 161  EASDDNMLQPQSQLRLAMLKVDKVFTSSCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 220

Query: 972  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            YSKEDGRDCSYY+D PY     E + G  KEE D SY WL          M+ ++YLGP
Sbjct: 221  YSKEDGRDCSYYRDHPYTCFPNERIIGEAKEEND-SYTWLEEIEMPENSEMNGVEYLGP 278


>ref|XP_003554041.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  315 bits (807), Expect = 3e-83
 Identities = 152/239 (63%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = +3

Query: 438  QISPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 617
            ++S    L +LF SC+E FKG GTVPSP DV++L HILD M PEDVGLS DLQFFKP  +
Sbjct: 41   ELSVSKTLHQLFDSCREAFKGPGTVPSPQDVKRLTHILDNMKPEDVGLSRDLQFFKPGNI 100

Query: 618  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 797
            ++ N RVTY T+YKCDNFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWV+P
Sbjct: 101  VKENQRVTYTTVYKCDNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVEP 160

Query: 798  VDSNDQTLTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 971
              S+D  L P  ++RLAR+K + +F + C TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 161  EASDDNMLQPQSQLRLARLKVDKVFTSSCGTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 220

Query: 972  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            YSKEDGRDCSYY+D PY     E + G  KEE D SY WL          M+ ++YLGP
Sbjct: 221  YSKEDGRDCSYYRDHPYTCFPNERIIGEAKEEND-SYTWLEEIEMPENSEMNGVEYLGP 278


>ref|XP_004489672.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cicer arietinum]
          Length = 293

 Score =  313 bits (801), Expect = 2e-82
 Identities = 150/237 (63%), Positives = 180/237 (75%), Gaps = 1/237 (0%)
 Frame = +3

Query: 459  LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 638
            LQ+LF+SC+E FKG  T+PS   V KLCHILD M PEDVGLS DLQFFK   +++ N RV
Sbjct: 58   LQKLFVSCKETFKGPDTIPSTQHVHKLCHILDNMKPEDVGLSKDLQFFKSEYIVKENPRV 117

Query: 639  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 818
            TY TIYKCDNFSLCI FLP+  VIPLHNHPGMTVFSKLLLG MHIKSYDWVDP  S++  
Sbjct: 118  TYTTIYKCDNFSLCIFFLPSKGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDPEVSHNLL 177

Query: 819  LTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRD 995
              P ++R+A++KAN +F +PC+TSVLYP +GGNIHEF AITPCAVLDV+GPPYSK+DGRD
Sbjct: 178  QQPSQLRMAKLKANKVFTSPCDTSVLYPKTGGNIHEFTAITPCAVLDVIGPPYSKDDGRD 237

Query: 996  CSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
            CSYYKD  Y      E++ +  +EE+ SY WL          MD I+YLGP + ++A
Sbjct: 238  CSYYKDHLYTAFPNGEIAEL--KEENESYAWLEEIEMPENSQMDGIEYLGPPIIESA 292


>ref|XP_002319186.2| hypothetical protein POPTR_0013s06040g [Populus trichocarpa]
            gi|118485411|gb|ABK94562.1| unknown [Populus trichocarpa]
            gi|550325071|gb|EEE95109.2| hypothetical protein
            POPTR_0013s06040g [Populus trichocarpa]
          Length = 278

 Score =  310 bits (795), Expect = 8e-82
 Identities = 152/284 (53%), Positives = 192/284 (67%)
 Frame = +3

Query: 315  MSIEAVEVAGLVGPRKDFIGQVNEXXXXXXXXXXXXXXXXXQISPILALQRLFLSCQEVF 494
            M+IEA      V PR++    VN                  + +P +ALQ L++SC+EVF
Sbjct: 1    MTIEAT-----VEPRREPTAHVNRLGFAKRPTKRKRSKKTKKCAPTMALQDLYVSCKEVF 55

Query: 495  KGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARVTYATIYKCDNFS 674
            KG GTVP   DV++LCH+LD M  ED GLS  L+FF P+  +    RVTY  +Y+CD FS
Sbjct: 56   KGPGTVPLHQDVKRLCHMLDNMKLEDFGLSCKLEFFNPKAAVRGTPRVTYTIVYECDKFS 115

Query: 675  LCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQTLTPKMRLARMKA 854
            +C+ FLPA+ VIPLHNHPGMTVFSKLL+GTMH+KSYDWVDP  +++     ++RLA+++A
Sbjct: 116  MCVFFLPATAVIPLHNHPGMTVFSKLLMGTMHVKSYDWVDPPATDEPDSPAQVRLAKLEA 175

Query: 855  NDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDCSYYKDTPYKTLL 1034
            + +F APC+TSVLYPT+GGNIH+F AITPCAVLDVLGPPYS EDGRDCSYYKD PY    
Sbjct: 176  DSVFTAPCHTSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSNEDGRDCSYYKDFPYTAFP 235

Query: 1035 KEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
              E+    +EEE   Y WL          M VIKYLGP+V D++
Sbjct: 236  NGEMGS--EEEEGDCYAWLEEITVPENLQMFVIKYLGPQVDDSS 277


>gb|EOY10511.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 287

 Score =  309 bits (791), Expect = 2e-81
 Identities = 150/236 (63%), Positives = 173/236 (73%)
 Frame = +3

Query: 459  LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 638
            L  LF++C+EVFKG G VP P+DV KLC ILD M PEDVGLS +LQFFK R  +    RV
Sbjct: 52   LPELFVACREVFKGPGNVPPPSDVDKLCSILDRMKPEDVGLSKNLQFFKARGAVTGTPRV 111

Query: 639  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 818
            TY TIY+CD FSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVDPV S D  
Sbjct: 112  TYTTIYQCDEFSLCIFFLPEKAVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPVHSEDPV 171

Query: 819  LTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDC 998
               + RLAR+KA+ +F APC+TSVLYPT+GGNIH+F AITPCAVLDVLGPPYSKED RDC
Sbjct: 172  PPSQPRLARLKADSVFTAPCDTSVLYPTAGGNIHQFTAITPCAVLDVLGPPYSKEDDRDC 231

Query: 999  SYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADAA 1166
            SYY+D P       E + V +E E   +GWL          MD I+YLGP++A+ +
Sbjct: 232  SYYRDVPCSAFPNGETT-VSEEVEGDLFGWLEEIQVPENSKMDRIEYLGPQIAETS 286


>ref|XP_003548690.1| PREDICTED: 2-aminoethanethiol dioxygenase [Glycine max]
          Length = 282

 Score =  307 bits (787), Expect = 7e-81
 Identities = 152/239 (63%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = +3

Query: 438  QISPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 617
            ++S    LQ+LF SC+EVFKG GTVPSP DVQ+L HIL+ M PEDVGLS DLQFFK    
Sbjct: 42   ELSVSKTLQQLFDSCREVFKGPGTVPSPQDVQRLRHILNNMKPEDVGLSRDLQFFKSGNK 101

Query: 618  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 797
            ++   RVTY T+YKC+NFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWV  
Sbjct: 102  VKEKQRVTYTTVYKCNNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVVH 161

Query: 798  VDSNDQTLTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 971
              S+D  L P  ++RLA++KA+ +F + C+TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 162  EASDDNLLQPQSQLRLAKLKADKVFTSSCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 221

Query: 972  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            YSKEDGRDCSYY+D PY +   E + G  KEE D SY WL          MD I+YLGP
Sbjct: 222  YSKEDGRDCSYYRDHPYASFPNERIIGEAKEEND-SYAWLEEIEMPENSEMDGIEYLGP 279


>gb|EXB36268.1| 2-aminoethanethiol dioxygenase [Morus notabilis]
          Length = 293

 Score =  301 bits (772), Expect = 4e-79
 Identities = 150/242 (61%), Positives = 180/242 (74%), Gaps = 8/242 (3%)
 Frame = +3

Query: 453  LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 632
            + LQ LF SC++VFKG GTVP PNDV K+C IL+ M  EDVGLS DLQFFKP  ++ +  
Sbjct: 42   VTLQDLFFSCRQVFKGPGTVPLPNDVLKICRILEKMKAEDVGLSSDLQFFKPNSIVPKGT 101

Query: 633  --RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVD---- 794
              RVTY TIYKC +FSLC+ FLPA+ VIPLHNHPGMTVFSKLLLGTMHIKSYDWVD    
Sbjct: 102  PPRVTYTTIYKCIDFSLCLFFLPANGVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDHHAS 161

Query: 795  --PVDSNDQTLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGP 968
                D + Q+ + ++RLA++K +++F APCNTSVLYPT+GGNIH F AITPCAVLDVLGP
Sbjct: 162  AISKDDSSQS-SSQLRLAKLKTDNVFTAPCNTSVLYPTTGGNIHAFTAITPCAVLDVLGP 220

Query: 969  PYSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            PYS EDGRDC+YYKD P+ +      +G  K EE +SYGWL          MD I+YLGP
Sbjct: 221  PYSTEDGRDCTYYKDYPHSSY----SNGENKLEEGASYGWLEEIEMPENSQMDWIEYLGP 276

Query: 1149 KV 1154
            ++
Sbjct: 277  QI 278


>ref|XP_006443485.1| hypothetical protein CICLE_v10021575mg [Citrus clementina]
            gi|568850955|ref|XP_006479161.1| PREDICTED:
            2-aminoethanethiol dioxygenase-like [Citrus sinensis]
            gi|557545747|gb|ESR56725.1| hypothetical protein
            CICLE_v10021575mg [Citrus clementina]
          Length = 280

 Score =  300 bits (769), Expect = 8e-79
 Identities = 145/236 (61%), Positives = 171/236 (72%)
 Frame = +3

Query: 453  LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 632
            +ALQRLFLSC++VF+G GTVP+P+ VQ LC ILD M PEDVGLS  LQ    +  ++   
Sbjct: 45   MALQRLFLSCKDVFRGPGTVPAPSHVQMLCSILDEMKPEDVGLSSKLQLLSAKDAMKGTP 104

Query: 633  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 812
             VT  TIYKC NFSLC+ FLP + VIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVD+ND
Sbjct: 105  IVTSTTIYKCQNFSLCLFFLPPTAVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDAND 164

Query: 813  QTLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 992
                 K RLA++ A+  F APCNTSVLYPT+GGNIHEF AIT CAVLDVLGPPYSK+DGR
Sbjct: 165  SAAPTKPRLAKLIADSDFTAPCNTSVLYPTTGGNIHEFTAITTCAVLDVLGPPYSKDDGR 224

Query: 993  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            DCSYYK+ P   +   E     +++   S  WL          MD I+YLGP++ +
Sbjct: 225  DCSYYKELPLPAVPNGENQEAKEDDGGESCRWLEEIGVPENSHMDEIEYLGPQIIE 280


>ref|XP_002325431.2| hypothetical protein POPTR_0019s05510g [Populus trichocarpa]
            gi|550316870|gb|EEE99812.2| hypothetical protein
            POPTR_0019s05510g [Populus trichocarpa]
          Length = 236

 Score =  300 bits (767), Expect = 1e-78
 Identities = 143/235 (60%), Positives = 177/235 (75%)
 Frame = +3

Query: 459  LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 638
            L  LF+SC+++FKG  TVP P D+++LC+ILD M PEDVGLS +LQFFK +  ++   RV
Sbjct: 2    LHNLFVSCRQMFKGPDTVPLPEDIKRLCNILDNMKPEDVGLSSELQFFKTKAAVKGTPRV 61

Query: 639  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 818
            TY TIYKC++FSLCI FLPA+ VIPLHNHPGMTVFSKLLLG MHIK+YD VDP  ++   
Sbjct: 62   TYTTIYKCNDFSLCIFFLPANAVIPLHNHPGMTVFSKLLLGKMHIKAYDLVDPPRADGPD 121

Query: 819  LTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDC 998
               ++RLA+++A+ +  APCNTSVLYPT+GGNIH+F AITPCAVLDVLGPPYSKE  RDC
Sbjct: 122  TPIQLRLAKLEADSVLTAPCNTSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSKEGDRDC 181

Query: 999  SYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVADA 1163
            SYYKD PY  L   E+   +K+EE S Y WL          MD I+YLGP+V ++
Sbjct: 182  SYYKDFPYTALSNGEME--LKKEEGSCYAWLEETEVPENSKMDGIEYLGPQVDES 234


>gb|ESW33986.1| hypothetical protein PHAVU_001G114800g [Phaseolus vulgaris]
          Length = 280

 Score =  294 bits (753), Expect = 6e-77
 Identities = 144/232 (62%), Positives = 170/232 (73%), Gaps = 2/232 (0%)
 Frame = +3

Query: 459  LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 638
            L++LF SC E FKG  TVPSP DVQ+L HILD M  EDVGL+ DLQFFKP  +IE N RV
Sbjct: 47   LRQLFHSCTETFKGPDTVPSPQDVQRLRHILDNMKAEDVGLNRDLQFFKPGNIIE-NQRV 105

Query: 639  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 818
            TY T++KCDNFSLCI F+P   +IPLHNHPGMTV SKLL+G MHIKSYDWV+P  S D  
Sbjct: 106  TYTTVFKCDNFSLCIFFIPEGGIIPLHNHPGMTVLSKLLIGLMHIKSYDWVEPEVSKDNL 165

Query: 819  LTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 992
            L    ++RLA++KA+ ++   C+TSVLYPT+GGNIHEF AITPCAV DV+GPPYSK+D R
Sbjct: 166  LEQPSQLRLAKLKADKMYTTSCDTSVLYPTTGGNIHEFSAITPCAVFDVIGPPYSKKDDR 225

Query: 993  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            DCSYYKD P  +  KE +  V +E E  SY WL          MD I+YLGP
Sbjct: 226  DCSYYKDHPCTSSPKERIGEVKEENEKDSYAWLEEIEMPENSEMDGIEYLGP 277


>gb|ESW33987.1| hypothetical protein PHAVU_001G114800g [Phaseolus vulgaris]
          Length = 281

 Score =  294 bits (752), Expect = 8e-77
 Identities = 144/233 (61%), Positives = 170/233 (72%), Gaps = 3/233 (1%)
 Frame = +3

Query: 459  LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 638
            L++LF SC E FKG  TVPSP DVQ+L HILD M  EDVGL+ DLQFFKP  +IE N RV
Sbjct: 47   LRQLFHSCTETFKGPDTVPSPQDVQRLRHILDNMKAEDVGLNRDLQFFKPGNIIE-NQRV 105

Query: 639  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 818
            TY T++KCDNFSLCI F+P   +IPLHNHPGMTV SKLL+G MHIKSYDWV+P  S D  
Sbjct: 106  TYTTVFKCDNFSLCIFFIPEGGIIPLHNHPGMTVLSKLLIGLMHIKSYDWVEPEVSKDNL 165

Query: 819  L---TPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 989
            L   +  +RLA++KA+ ++   C+TSVLYPT+GGNIHEF AITPCAV DV+GPPYSK+D 
Sbjct: 166  LEQPSQSVRLAKLKADKMYTTSCDTSVLYPTTGGNIHEFSAITPCAVFDVIGPPYSKKDD 225

Query: 990  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGP 1148
            RDCSYYKD P  +  KE +  V +E E  SY WL          MD I+YLGP
Sbjct: 226  RDCSYYKDHPCTSSPKERIGEVKEENEKDSYAWLEEIEMPENSEMDGIEYLGP 278


>ref|XP_004303011.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Fragaria vesca subsp.
            vesca]
          Length = 286

 Score =  294 bits (752), Expect = 8e-77
 Identities = 147/238 (61%), Positives = 175/238 (73%), Gaps = 3/238 (1%)
 Frame = +3

Query: 456  ALQRLFLSCQEVFKG--HGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERN 629
            ALQRLF+SC++VFKG  +GT+P P+ VQ+L  +LD + P+DVGLS DLQFFKP   ++  
Sbjct: 53   ALQRLFVSCKDVFKGLGNGTLPLPHQVQELRSVLDKIRPQDVGLSNDLQFFKPNTRVKGT 112

Query: 630  ARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWV-DPVDS 806
             RVTY TIYKC NFSLC  F+PA+ VIPLHNHPGMTVFSKLLLG MHIKSYD V DP   
Sbjct: 113  PRVTYTTIYKCSNFSLCCFFIPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDLVDDPTKK 172

Query: 807  NDQTLTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKED 986
            N      ++RLA++KA+ +F APCNTSVLYPT+GGNIH F AITPCAVLDVLGPPYSK+D
Sbjct: 173  NSD--GSQLRLAKLKADSVFTAPCNTSVLYPTTGGNIHAFTAITPCAVLDVLGPPYSKQD 230

Query: 987  GRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXXMDVIKYLGPKVAD 1160
            GRDCSYY+D PY        +  V +EE   YGWL          MD I+YLGP++ D
Sbjct: 231  GRDCSYYRDHPYAAY----PNATVTQEEGHYYGWLEEIEVPPNSEMDGIEYLGPQIID 284


Top