BLASTX nr result

ID: Catharanthus23_contig00018690 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00018690
         (1486 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003624687.1| 2-aminoethanethiol dioxygenase [Medicago tru...   328   5e-87
gb|EMJ05897.1| hypothetical protein PRUPE_ppa009537mg [Prunus pe...   326   1e-86
gb|AFK44841.1| unknown [Medicago truncatula]                          326   1e-86
ref|XP_004149110.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   323   2e-85
ref|XP_006600234.1| PREDICTED: uncharacterized protein LOC100819...   322   3e-85
ref|NP_001241359.1| uncharacterized protein LOC100819405 [Glycin...   320   1e-84
ref|XP_004493204.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   319   2e-84
ref|XP_002275517.1| PREDICTED: 2-aminoethanethiol dioxygenase [V...   319   2e-84
ref|XP_004489672.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   313   9e-83
ref|XP_003554039.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   313   9e-83
ref|XP_002319186.2| hypothetical protein POPTR_0013s06040g [Popu...   313   2e-82
ref|XP_003554041.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   312   2e-82
gb|EOY10511.1| Uncharacterized protein isoform 1 [Theobroma cacao]    310   1e-81
ref|XP_003548690.1| PREDICTED: 2-aminoethanethiol dioxygenase [G...   305   4e-80
gb|EXB36268.1| 2-aminoethanethiol dioxygenase [Morus notabilis]       304   7e-80
ref|XP_006443485.1| hypothetical protein CICLE_v10021575mg [Citr...   301   5e-79
ref|XP_002325431.2| hypothetical protein POPTR_0019s05510g [Popu...   300   8e-79
ref|XP_004303011.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   296   2e-77
gb|ESW33987.1| hypothetical protein PHAVU_001G114800g [Phaseolus...   293   2e-76
gb|ESW33986.1| hypothetical protein PHAVU_001G114800g [Phaseolus...   293   2e-76

>ref|XP_003624687.1| 2-aminoethanethiol dioxygenase [Medicago truncatula]
            gi|87162727|gb|ABD28522.1| Cupin, RmlC-type [Medicago
            truncatula] gi|355499702|gb|AES80905.1|
            2-aminoethanethiol dioxygenase [Medicago truncatula]
          Length = 283

 Score =  328 bits (840), Expect = 5e-87
 Identities = 162/236 (68%), Positives = 182/236 (77%), Gaps = 1/236 (0%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 864
            ALQ LF SC++ FKG GTVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 46   ALQELFDSCKQTFKGPGTVPSPRDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 105

Query: 863  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 684
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD   S++ 
Sbjct: 106  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDHEASHNL 165

Query: 683  TS-TPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
               + K+RLA++KAN  F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDGR
Sbjct: 166  LQPSSKLRLAKLKANKTFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDGR 225

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            DCSYYKD PY     EE  G VK+++D SYG L         +MD I+YLGP + D
Sbjct: 226  DCSYYKDYPYNAFPNEEKIGEVKDKDD-SYGLLEEIDMPENCQMDGIEYLGPPIDD 280


>gb|EMJ05897.1| hypothetical protein PRUPE_ppa009537mg [Prunus persica]
          Length = 287

 Score =  326 bits (836), Expect = 1e-86
 Identities = 158/240 (65%), Positives = 181/240 (75%)
 Frame = -3

Query: 1052 PILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIER 873
            P   LQ+LF+SC++VFKG GTVPSP+DV  LC ILD M PEDVGLS DLQFFKP+ V++ 
Sbjct: 49   PPTVLQQLFVSCRQVFKGPGTVPSPHDVHNLCSILDKMRPEDVGLSRDLQFFKPKTVVQG 108

Query: 872  NARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDS 693
              RVTY TIY+C NFSLC LF+PA+ VIPLHNHP MTVFSKLLLG MHIKSYDWVDPV+S
Sbjct: 109  TPRVTYTTIYECSNFSLCCLFIPATGVIPLHNHPEMTVFSKLLLGKMHIKSYDWVDPVNS 168

Query: 692  NDQTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKED 513
            +  T  P++RLA++KA+ +F +PCNTSVLYPT GGNIH F AITPCAVLDVLGPPYSKED
Sbjct: 169  DGSTPAPQLRLAKLKADSVFTSPCNTSVLYPTEGGNIHAFTAITPCAVLDVLGPPYSKED 228

Query: 512  GRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
             RDCSYYKD PY      E S  V E     YGWL         EMD I YLGP+V + +
Sbjct: 229  DRDCSYYKDHPYAAYSNGEAS--VTEGNGDCYGWLEEIEMPENSEMDKIPYLGPQVTETS 286


>gb|AFK44841.1| unknown [Medicago truncatula]
          Length = 283

 Score =  326 bits (836), Expect = 1e-86
 Identities = 162/236 (68%), Positives = 180/236 (76%), Gaps = 1/236 (0%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 864
            ALQ LF SC++ FKG GTVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 46   ALQELFDSCKQTFKGPGTVPSPRDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 105

Query: 863  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVD-PVDSND 687
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD     N 
Sbjct: 106  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDHEAFHNL 165

Query: 686  QTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
               + K+RLA++KAN  F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDGR
Sbjct: 166  LQPSSKLRLAKLKANKTFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDGR 225

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            DCSYYKD PY     EE  G VK+++D SYG L         +MD I+YLGP + D
Sbjct: 226  DCSYYKDYPYNAFPNEEKIGEVKDKDD-SYGLLEEIDMPENCQMDGIEYLGPPIDD 280


>ref|XP_004149110.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cucumis sativus]
          Length = 278

 Score =  323 bits (827), Expect = 2e-85
 Identities = 155/236 (65%), Positives = 183/236 (77%)
 Frame = -3

Query: 1046 LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 867
            +ALQ LF+SC+EVFKG GTVP P DV+KLC ILD M  EDVGLS  LQFFKP   ++ + 
Sbjct: 43   MALQELFVSCREVFKGPGTVPLPCDVEKLCRILDNMKAEDVGLSSSLQFFKPNVPVKGSP 102

Query: 866  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 687
            RVTY TIYKCDNFSLCI FLPA+ VIPLHNHPGMTVFSKLLLG MHIKSYDWVDP +S+D
Sbjct: 103  RVTYTTIYKCDNFSLCIFFLPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPTNSDD 162

Query: 686  QTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
                 + RLA++KA+ +F +PC+TSVLYPTSGGNIH F AITPCAVLDVLGPPYS EDGR
Sbjct: 163  TAQPCEKRLAKLKADAVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVLGPPYSMEDGR 222

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            DCSYYK+ PY +    ++ G+ +E++   YGWL         EMD I+YLGP++ D
Sbjct: 223  DCSYYKEHPYASFPNGDM-GLGEEDQGEGYGWLEEIEVPENSEMDGIEYLGPQICD 277


>ref|XP_006600234.1| PREDICTED: uncharacterized protein LOC100819405 isoform X1 [Glycine
            max]
          Length = 299

 Score =  322 bits (825), Expect = 3e-85
 Identities = 161/239 (67%), Positives = 183/239 (76%), Gaps = 2/239 (0%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHG-TVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 867
            ALQ LF+SC+E FKG G TVPSP DVQKLCHILD+M PEDVGL  DLQFFKP  +++ N 
Sbjct: 62   ALQELFVSCRETFKGPGGTVPSPQDVQKLCHILDSMKPEDVGLRSDLQFFKPENIVKENQ 121

Query: 866  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 687
            RVT  TIY C+NFSLC+ FLPA  VIPLHNHP MTVFSKLLLG MHIKSYDWVD   S++
Sbjct: 122  RVTCTTIYSCENFSLCLFFLPAKGVIPLHNHPEMTVFSKLLLGQMHIKSYDWVDSEVSHN 181

Query: 686  QTSTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 510
                P + RLAR+KAN++F APC+TSVLYP SGGNIHEF AITPCAVLDVLGPPYSK+DG
Sbjct: 182  LLHQPSQFRLARLKANNVFTAPCDTSVLYPQSGGNIHEFTAITPCAVLDVLGPPYSKDDG 241

Query: 509  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
            RDCS+Y+D PY      E SG VKEE D SYGWL         +MD I+YLGP + + A
Sbjct: 242  RDCSFYRDHPYTAFPNGE-SGKVKEEND-SYGWLEEIEMPENSQMDGIEYLGPPIIETA 298


>ref|NP_001241359.1| uncharacterized protein LOC100819405 [Glycine max]
            gi|255641533|gb|ACU21040.1| unknown [Glycine max]
          Length = 301

 Score =  320 bits (820), Expect = 1e-84
 Identities = 160/240 (66%), Positives = 183/240 (76%), Gaps = 3/240 (1%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHG-TVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 867
            ALQ LF+SC+E FKG G TVPSP DVQKLCHILD+M PEDVGL  DLQFFKP  +++ N 
Sbjct: 62   ALQELFVSCRETFKGPGGTVPSPQDVQKLCHILDSMKPEDVGLRSDLQFFKPENIVKENQ 121

Query: 866  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 687
            RVT  TIY C+NFSLC+ FLPA  VIPLHNHP MTVFSKLLLG MHIKSYDWVD   S++
Sbjct: 122  RVTCTTIYSCENFSLCLFFLPAKGVIPLHNHPEMTVFSKLLLGQMHIKSYDWVDSEVSHN 181

Query: 686  QTSTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 510
                P + RLAR+KAN++F APC+TSVLYP SGGNIHEF AITPCAVLDVLGPPYSK+DG
Sbjct: 182  LLHQPSQFRLARLKANNVFTAPCDTSVLYPQSGGNIHEFTAITPCAVLDVLGPPYSKDDG 241

Query: 509  RDCSYYKDTPYKTL-LKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
            RDCS+Y+D PY      +  SG VKEE D SYGWL         +MD I+YLGP + + A
Sbjct: 242  RDCSFYRDHPYTAFPTADGESGKVKEEND-SYGWLEEIEMPENSQMDGIEYLGPPIIETA 300


>ref|XP_004493204.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cicer arietinum]
          Length = 286

 Score =  319 bits (818), Expect = 2e-84
 Identities = 158/236 (66%), Positives = 178/236 (75%), Gaps = 1/236 (0%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 864
            ALQ LF SC++ FKG  TVPSP DV KLCHILD M PEDVGLS DLQFFKP  +I+ N R
Sbjct: 49   ALQELFGSCKQTFKGINTVPSPQDVHKLCHILDNMKPEDVGLSRDLQFFKPGNIIKENQR 108

Query: 863  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 684
            VTY T+YKCDNFSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVD   +++ 
Sbjct: 109  VTYTTVYKCDNFSLCIFFLPERGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDSEATHNL 168

Query: 683  TSTP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
               P K+RLA++KAND+F APC+TSVLYPT+GGNIHEF AITPCAVLDV+GPPYSKEDGR
Sbjct: 169  LQQPSKLRLAKLKANDVFTAPCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPPYSKEDGR 228

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            DCSYYKD P      EE    V ++ D SY  L         +MD I+YLGP + D
Sbjct: 229  DCSYYKDHPCDAFPNEEEIAKVNDKND-SYALLEEIEMPENCQMDGIEYLGPPIND 283


>ref|XP_002275517.1| PREDICTED: 2-aminoethanethiol dioxygenase [Vitis vinifera]
            gi|296085895|emb|CBI31219.3| unnamed protein product
            [Vitis vinifera]
          Length = 275

 Score =  319 bits (817), Expect = 2e-84
 Identities = 153/239 (64%), Positives = 181/239 (75%), Gaps = 2/239 (0%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNAR 864
            +LQRLF++C++VFKG GTVP P DV KLCHILD M PEDVGLS D+ FFK ++  +   +
Sbjct: 36   SLQRLFVACRDVFKGLGTVPQPIDVTKLCHILDNMRPEDVGLSKDIPFFKAKRAAQGIPK 95

Query: 863  VTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQ 684
            VT AT+YKC+ FSLCI FLP   VIPLHNHPGMTVFSKLLLG+MHIKSYDWVDPV S+  
Sbjct: 96   VTCATVYKCEEFSLCIFFLPPRAVIPLHNHPGMTVFSKLLLGSMHIKSYDWVDPVGSDSS 155

Query: 683  TSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRD 504
            +   K+RLAR+KA+ +F APCNTSVLYPTSGGNIH F AITPCAVLDVLGPPYSK+DGRD
Sbjct: 156  SPPSKLRLARLKADSVFTAPCNTSVLYPTSGGNIHAFTAITPCAVLDVLGPPYSKKDGRD 215

Query: 503  CSYYKDTPYKTLLKEELSGVVKE--EEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
            CSYYKD+PY      E     +E  EE+  YGWL         +MD  +YLGP++ D +
Sbjct: 216  CSYYKDSPYTPFSNGEARTRKEEDGEEEERYGWLEEVEMPEDSKMDWTEYLGPQIIDTS 274


>ref|XP_004489672.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Cicer arietinum]
          Length = 293

 Score =  313 bits (803), Expect = 9e-83
 Identities = 150/237 (63%), Positives = 181/237 (76%), Gaps = 1/237 (0%)
 Frame = -3

Query: 1040 LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 861
            LQ+LF+SC+E FKG  T+PS   V KLCHILD M PEDVGLS DLQFFK   +++ N RV
Sbjct: 58   LQKLFVSCKETFKGPDTIPSTQHVHKLCHILDNMKPEDVGLSKDLQFFKSEYIVKENPRV 117

Query: 860  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 681
            TY TIYKCDNFSLCI FLP+  VIPLHNHPGMTVFSKLLLG MHIKSYDWVDP  S++  
Sbjct: 118  TYTTIYKCDNFSLCIFFLPSKGVIPLHNHPGMTVFSKLLLGQMHIKSYDWVDPEVSHNLL 177

Query: 680  STP-KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRD 504
              P ++R+A++KAN +F +PC+TSVLYP +GGNIHEF AITPCAVLDV+GPPYSK+DGRD
Sbjct: 178  QQPSQLRMAKLKANKVFTSPCDTSVLYPKTGGNIHEFTAITPCAVLDVIGPPYSKDDGRD 237

Query: 503  CSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
            CSYYKD  Y      E++ +  +EE+ SY WL         +MD I+YLGP + ++A
Sbjct: 238  CSYYKDHLYTAFPNGEIAEL--KEENESYAWLEEIEMPENSQMDGIEYLGPPIIESA 292


>ref|XP_003554039.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  313 bits (803), Expect = 9e-83
 Identities = 153/239 (64%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = -3

Query: 1061 QVSPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 882
            ++S    L +LF SC+E FKG GTVPSP DVQ+L HILD M PEDVGLS DLQFFKP  +
Sbjct: 41   ELSVSKTLHQLFDSCREAFKGPGTVPSPQDVQRLTHILDNMKPEDVGLSRDLQFFKPGNI 100

Query: 881  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 702
            ++ N RVTY T+YKCDNFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWVDP
Sbjct: 101  VKENQRVTYTTVYKCDNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVDP 160

Query: 701  VDSNDQTSTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 528
              S+D    P  ++RLA +K + +F + C+TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 161  EASDDNMLQPQSQLRLAMLKVDKVFTSSCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 220

Query: 527  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGP 351
            YSKEDGRDCSYY+D PY     E + G  KEE D SY WL         EM+ ++YLGP
Sbjct: 221  YSKEDGRDCSYYRDHPYTCFPNERIIGEAKEEND-SYTWLEEIEMPENSEMNGVEYLGP 278


>ref|XP_002319186.2| hypothetical protein POPTR_0013s06040g [Populus trichocarpa]
            gi|118485411|gb|ABK94562.1| unknown [Populus trichocarpa]
            gi|550325071|gb|EEE95109.2| hypothetical protein
            POPTR_0013s06040g [Populus trichocarpa]
          Length = 278

 Score =  313 bits (801), Expect = 2e-82
 Identities = 153/284 (53%), Positives = 194/284 (68%)
 Frame = -3

Query: 1184 MSIEAVEVAGLVGPRKDFIGQVNEXXXXXXXXXXXXXXXXIQVSPILALQRLFLSCQEVF 1005
            M+IEA      V PR++    VN                  + +P +ALQ L++SC+EVF
Sbjct: 1    MTIEAT-----VEPRREPTAHVNRLGFAKRPTKRKRSKKTKKCAPTMALQDLYVSCKEVF 55

Query: 1004 KGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARVTYATIYKCDNFS 825
            KG GTVP   DV++LCH+LD M  ED GLS  L+FF P+  +    RVTY  +Y+CD FS
Sbjct: 56   KGPGTVPLHQDVKRLCHMLDNMKLEDFGLSCKLEFFNPKAAVRGTPRVTYTIVYECDKFS 115

Query: 824  LCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQTSTPKMRLARMKA 645
            +C+ FLPA+ VIPLHNHPGMTVFSKLL+GTMH+KSYDWVDP  +++  S  ++RLA+++A
Sbjct: 116  MCVFFLPATAVIPLHNHPGMTVFSKLLMGTMHVKSYDWVDPPATDEPDSPAQVRLAKLEA 175

Query: 644  NDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDCSYYKDTPYKTLL 465
            + +F APC+TSVLYPT+GGNIH+F AITPCAVLDVLGPPYS EDGRDCSYYKD PY    
Sbjct: 176  DSVFTAPCHTSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSNEDGRDCSYYKDFPYTAFP 235

Query: 464  KEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
              E+    +EEE   Y WL         +M VIKYLGP+V D++
Sbjct: 236  NGEMGS--EEEEGDCYAWLEEITVPENLQMFVIKYLGPQVDDSS 277


>ref|XP_003554041.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  312 bits (800), Expect = 2e-82
 Identities = 152/239 (63%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = -3

Query: 1061 QVSPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 882
            ++S    L +LF SC+E FKG GTVPSP DV++L HILD M PEDVGLS DLQFFKP  +
Sbjct: 41   ELSVSKTLHQLFDSCREAFKGPGTVPSPQDVKRLTHILDNMKPEDVGLSRDLQFFKPGNI 100

Query: 881  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 702
            ++ N RVTY T+YKCDNFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWV+P
Sbjct: 101  VKENQRVTYTTVYKCDNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVEP 160

Query: 701  VDSNDQTSTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 528
              S+D    P  ++RLAR+K + +F + C TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 161  EASDDNMLQPQSQLRLARLKVDKVFTSSCGTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 220

Query: 527  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGP 351
            YSKEDGRDCSYY+D PY     E + G  KEE D SY WL         EM+ ++YLGP
Sbjct: 221  YSKEDGRDCSYYRDHPYTCFPNERIIGEAKEEND-SYTWLEEIEMPENSEMNGVEYLGP 278


>gb|EOY10511.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 287

 Score =  310 bits (793), Expect = 1e-81
 Identities = 150/236 (63%), Positives = 174/236 (73%)
 Frame = -3

Query: 1040 LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 861
            L  LF++C+EVFKG G VP P+DV KLC ILD M PEDVGLS +LQFFK R  +    RV
Sbjct: 52   LPELFVACREVFKGPGNVPPPSDVDKLCSILDRMKPEDVGLSKNLQFFKARGAVTGTPRV 111

Query: 860  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 681
            TY TIY+CD FSLCI FLP   VIPLHNHPGMTVFSKLLLG MHIKSYDWVDPV S D  
Sbjct: 112  TYTTIYQCDEFSLCIFFLPEKAVIPLHNHPGMTVFSKLLLGKMHIKSYDWVDPVHSEDPV 171

Query: 680  STPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDC 501
               + RLAR+KA+ +F APC+TSVLYPT+GGNIH+F AITPCAVLDVLGPPYSKED RDC
Sbjct: 172  PPSQPRLARLKADSVFTAPCDTSVLYPTAGGNIHQFTAITPCAVLDVLGPPYSKEDDRDC 231

Query: 500  SYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADAA 333
            SYY+D P       E + V +E E   +GWL         +MD I+YLGP++A+ +
Sbjct: 232  SYYRDVPCSAFPNGETT-VSEEVEGDLFGWLEEIQVPENSKMDRIEYLGPQIAETS 286


>ref|XP_003548690.1| PREDICTED: 2-aminoethanethiol dioxygenase [Glycine max]
          Length = 282

 Score =  305 bits (780), Expect = 4e-80
 Identities = 152/239 (63%), Positives = 178/239 (74%), Gaps = 2/239 (0%)
 Frame = -3

Query: 1061 QVSPILALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQV 882
            ++S    LQ+LF SC+EVFKG GTVPSP DVQ+L HIL+ M PEDVGLS DLQFFK    
Sbjct: 42   ELSVSKTLQQLFDSCREVFKGPGTVPSPQDVQRLRHILNNMKPEDVGLSRDLQFFKSGNK 101

Query: 881  IERNARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDP 702
            ++   RVTY T+YKC+NFSLCI F+P   VIPLHNHP MTVFSKLLLG MHIKSYDWV  
Sbjct: 102  VKEKQRVTYTTVYKCNNFSLCIFFIPEGGVIPLHNHPDMTVFSKLLLGLMHIKSYDWVVH 161

Query: 701  VDSNDQTSTP--KMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 528
              S+D    P  ++RLA++KA+ +F + C+TSVLYPT+GGNIHEF AITPCAVLDV+GPP
Sbjct: 162  EASDDNLLQPQSQLRLAKLKADKVFTSSCDTSVLYPTTGGNIHEFTAITPCAVLDVIGPP 221

Query: 527  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGP 351
            YSKEDGRDCSYY+D PY +   E + G  KEE D SY WL         EMD I+YLGP
Sbjct: 222  YSKEDGRDCSYYRDHPYASFPNERIIGEAKEEND-SYAWLEEIEMPENSEMDGIEYLGP 279


>gb|EXB36268.1| 2-aminoethanethiol dioxygenase [Morus notabilis]
          Length = 293

 Score =  304 bits (778), Expect = 7e-80
 Identities = 150/241 (62%), Positives = 182/241 (75%), Gaps = 7/241 (2%)
 Frame = -3

Query: 1046 LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 867
            + LQ LF SC++VFKG GTVP PNDV K+C IL+ M  EDVGLS DLQFFKP  ++ +  
Sbjct: 42   VTLQDLFFSCRQVFKGPGTVPLPNDVLKICRILEKMKAEDVGLSSDLQFFKPNSIVPKGT 101

Query: 866  --RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVD---- 705
              RVTY TIYKC +FSLC+ FLPA+ VIPLHNHPGMTVFSKLLLGTMHIKSYDWVD    
Sbjct: 102  PPRVTYTTIYKCIDFSLCLFFLPANGVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDHHAS 161

Query: 704  PVDSNDQT-STPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPP 528
             +  +D + S+ ++RLA++K +++F APCNTSVLYPT+GGNIH F AITPCAVLDVLGPP
Sbjct: 162  AISKDDSSQSSSQLRLAKLKTDNVFTAPCNTSVLYPTTGGNIHAFTAITPCAVLDVLGPP 221

Query: 527  YSKEDGRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPK 348
            YS EDGRDC+YYKD P+ +      +G  K EE +SYGWL         +MD I+YLGP+
Sbjct: 222  YSTEDGRDCTYYKDYPHSSY----SNGENKLEEGASYGWLEEIEMPENSQMDWIEYLGPQ 277

Query: 347  V 345
            +
Sbjct: 278  I 278


>ref|XP_006443485.1| hypothetical protein CICLE_v10021575mg [Citrus clementina]
            gi|568850955|ref|XP_006479161.1| PREDICTED:
            2-aminoethanethiol dioxygenase-like [Citrus sinensis]
            gi|557545747|gb|ESR56725.1| hypothetical protein
            CICLE_v10021575mg [Citrus clementina]
          Length = 280

 Score =  301 bits (771), Expect = 5e-79
 Identities = 145/236 (61%), Positives = 172/236 (72%)
 Frame = -3

Query: 1046 LALQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNA 867
            +ALQRLFLSC++VF+G GTVP+P+ VQ LC ILD M PEDVGLS  LQ    +  ++   
Sbjct: 45   MALQRLFLSCKDVFRGPGTVPAPSHVQMLCSILDEMKPEDVGLSSKLQLLSAKDAMKGTP 104

Query: 866  RVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND 687
             VT  TIYKC NFSLC+ FLP + VIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVD+ND
Sbjct: 105  IVTSTTIYKCQNFSLCLFFLPPTAVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDAND 164

Query: 686  QTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
              +  K RLA++ A+  F APCNTSVLYPT+GGNIHEF AIT CAVLDVLGPPYSK+DGR
Sbjct: 165  SAAPTKPRLAKLIADSDFTAPCNTSVLYPTTGGNIHEFTAITTCAVLDVLGPPYSKDDGR 224

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            DCSYYK+ P   +   E     +++   S  WL          MD I+YLGP++ +
Sbjct: 225  DCSYYKELPLPAVPNGENQEAKEDDGGESCRWLEEIGVPENSHMDEIEYLGPQIIE 280


>ref|XP_002325431.2| hypothetical protein POPTR_0019s05510g [Populus trichocarpa]
            gi|550316870|gb|EEE99812.2| hypothetical protein
            POPTR_0019s05510g [Populus trichocarpa]
          Length = 236

 Score =  300 bits (769), Expect = 8e-79
 Identities = 143/235 (60%), Positives = 179/235 (76%)
 Frame = -3

Query: 1040 LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 861
            L  LF+SC+++FKG  TVP P D+++LC+ILD M PEDVGLS +LQFFK +  ++   RV
Sbjct: 2    LHNLFVSCRQMFKGPDTVPLPEDIKRLCNILDNMKPEDVGLSSELQFFKTKAAVKGTPRV 61

Query: 860  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 681
            TY TIYKC++FSLCI FLPA+ VIPLHNHPGMTVFSKLLLG MHIK+YD VDP  ++   
Sbjct: 62   TYTTIYKCNDFSLCIFFLPANAVIPLHNHPGMTVFSKLLLGKMHIKAYDLVDPPRADGPD 121

Query: 680  STPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGRDC 501
            +  ++RLA+++A+ +  APCNTSVLYPT+GGNIH+F AITPCAVLDVLGPPYSKE  RDC
Sbjct: 122  TPIQLRLAKLEADSVLTAPCNTSVLYPTTGGNIHQFTAITPCAVLDVLGPPYSKEGDRDC 181

Query: 500  SYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVADA 336
            SYYKD PY  L   E+   +K+EE S Y WL         +MD I+YLGP+V ++
Sbjct: 182  SYYKDFPYTALSNGEME--LKKEEGSCYAWLEETEVPENSKMDGIEYLGPQVDES 234


>ref|XP_004303011.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Fragaria vesca subsp.
            vesca]
          Length = 286

 Score =  296 bits (757), Expect = 2e-77
 Identities = 149/238 (62%), Positives = 177/238 (74%), Gaps = 3/238 (1%)
 Frame = -3

Query: 1043 ALQRLFLSCQEVFKG--HGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERN 870
            ALQRLF+SC++VFKG  +GT+P P+ VQ+L  +LD + P+DVGLS DLQFFKP   ++  
Sbjct: 53   ALQRLFVSCKDVFKGLGNGTLPLPHQVQELRSVLDKIRPQDVGLSNDLQFFKPNTRVKGT 112

Query: 869  ARVTYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWV-DPVDS 693
             RVTY TIYKC NFSLC  F+PA+ VIPLHNHPGMTVFSKLLLG MHIKSYD V DP   
Sbjct: 113  PRVTYTTIYKCSNFSLCCFFIPATGVIPLHNHPGMTVFSKLLLGKMHIKSYDLVDDPTKK 172

Query: 692  NDQTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKED 513
            N   S  ++RLA++KA+ +F APCNTSVLYPT+GGNIH F AITPCAVLDVLGPPYSK+D
Sbjct: 173  NSDGS--QLRLAKLKADSVFTAPCNTSVLYPTTGGNIHAFTAITPCAVLDVLGPPYSKQD 230

Query: 512  GRDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGPKVAD 339
            GRDCSYY+D PY        +  V +EE   YGWL         EMD I+YLGP++ D
Sbjct: 231  GRDCSYYRDHPYAAY----PNATVTQEEGHYYGWLEEIEVPPNSEMDGIEYLGPQIID 284


>gb|ESW33987.1| hypothetical protein PHAVU_001G114800g [Phaseolus vulgaris]
          Length = 281

 Score =  293 bits (749), Expect = 2e-76
 Identities = 144/233 (61%), Positives = 171/233 (73%), Gaps = 3/233 (1%)
 Frame = -3

Query: 1040 LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 861
            L++LF SC E FKG  TVPSP DVQ+L HILD M  EDVGL+ DLQFFKP  +IE N RV
Sbjct: 47   LRQLFHSCTETFKGPDTVPSPQDVQRLRHILDNMKAEDVGLNRDLQFFKPGNIIE-NQRV 105

Query: 860  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSND-- 687
            TY T++KCDNFSLCI F+P   +IPLHNHPGMTV SKLL+G MHIKSYDWV+P  S D  
Sbjct: 106  TYTTVFKCDNFSLCIFFIPEGGIIPLHNHPGMTVLSKLLIGLMHIKSYDWVEPEVSKDNL 165

Query: 686  -QTSTPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDG 510
             +  +  +RLA++KA+ ++   C+TSVLYPT+GGNIHEF AITPCAV DV+GPPYSK+D 
Sbjct: 166  LEQPSQSVRLAKLKADKMYTTSCDTSVLYPTTGGNIHEFSAITPCAVFDVIGPPYSKKDD 225

Query: 509  RDCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGP 351
            RDCSYYKD P  +  KE +  V +E E  SY WL         EMD I+YLGP
Sbjct: 226  RDCSYYKDHPCTSSPKERIGEVKEENEKDSYAWLEEIEMPENSEMDGIEYLGP 278


>gb|ESW33986.1| hypothetical protein PHAVU_001G114800g [Phaseolus vulgaris]
          Length = 280

 Score =  293 bits (749), Expect = 2e-76
 Identities = 144/232 (62%), Positives = 170/232 (73%), Gaps = 2/232 (0%)
 Frame = -3

Query: 1040 LQRLFLSCQEVFKGHGTVPSPNDVQKLCHILDAMMPEDVGLSGDLQFFKPRQVIERNARV 861
            L++LF SC E FKG  TVPSP DVQ+L HILD M  EDVGL+ DLQFFKP  +IE N RV
Sbjct: 47   LRQLFHSCTETFKGPDTVPSPQDVQRLRHILDNMKAEDVGLNRDLQFFKPGNIIE-NQRV 105

Query: 860  TYATIYKCDNFSLCILFLPASTVIPLHNHPGMTVFSKLLLGTMHIKSYDWVDPVDSNDQT 681
            TY T++KCDNFSLCI F+P   +IPLHNHPGMTV SKLL+G MHIKSYDWV+P  S D  
Sbjct: 106  TYTTVFKCDNFSLCIFFIPEGGIIPLHNHPGMTVLSKLLIGLMHIKSYDWVEPEVSKDNL 165

Query: 680  --STPKMRLARMKANDIFRAPCNTSVLYPTSGGNIHEFRAITPCAVLDVLGPPYSKEDGR 507
                 ++RLA++KA+ ++   C+TSVLYPT+GGNIHEF AITPCAV DV+GPPYSK+D R
Sbjct: 166  LEQPSQLRLAKLKADKMYTTSCDTSVLYPTTGGNIHEFSAITPCAVFDVIGPPYSKKDDR 225

Query: 506  DCSYYKDTPYKTLLKEELSGVVKEEEDSSYGWLXXXXXXXXXEMDVIKYLGP 351
            DCSYYKD P  +  KE +  V +E E  SY WL         EMD I+YLGP
Sbjct: 226  DCSYYKDHPCTSSPKERIGEVKEENEKDSYAWLEEIEMPENSEMDGIEYLGP 277


Top