BLASTX nr result

ID: Angelica22_contig00010230 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00010230
         (1212 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   324   3e-86
ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   323   5e-86
ref|XP_003554359.1| PREDICTED: 2-aminoethanethiol dioxygenase-li...   321   2e-85
ref|XP_002870743.1| predicted protein [Arabidopsis lyrata subsp....   317   3e-84
ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana] ...   316   7e-84

>ref|XP_003543744.1| PREDICTED: 2-aminoethanethiol dioxygenase-like [Glycine max]
          Length = 281

 Score =  324 bits (830), Expect = 3e-86
 Identities = 160/281 (56%), Positives = 193/281 (68%), Gaps = 10/281 (3%)
 Frame = -2

Query: 1076 EKLPGQKKGKDYMEXXXXXXR----------QKKLLPVQVLYDTCQQVFADCGPGIVPGP 927
            E+    +KG+D+ E                 Q+K  PVQ L++TC+ VFA  G G VP  
Sbjct: 4    ERTLADRKGRDFCELPRETIASSNSRRNRRRQRKKPPVQKLFETCKVVFASAGTGFVPPH 63

Query: 926  EKIQLLKTVLDGISGADVGVNPNMPFFREQETEGLPTITYLHIHECDKFSIGIFCLPPSG 747
            E I  L++VLDGI   DVG+ P+MP+FR   T+ +P ITYLHI+EC+KFS+GIFCLPPSG
Sbjct: 64   EDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITYLHIYECEKFSMGIFCLPPSG 123

Query: 746  VLPLHNHPEMTVFSKLLFGTMHIKSLDWVNGIPSNKVPDGDSSNANGALSSEARLAKIKV 567
            V+PLHNHP MTVFSKLLFGTMHIKS DWV  +P         S   G    E RLAK+KV
Sbjct: 124  VIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPPESPTTIKPSENQG---PEMRLAKVKV 180

Query: 566  DSDFIAPCDTSILYPTDGGNMHCFTAVTQCAVLDVLGPPYNDAEGRHCAYYIEHPFDRIP 387
            D+DF APC+ SILYP DGGN+HCFTAVT CAVLDVLGPPY+DAEGRHC YY   PF    
Sbjct: 181  DADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPYSDAEGRHCTYYHNFPFSNFS 240

Query: 386  ANGISVAEEEKEGYEWLQERDKPEGLTVYGVPYSGPPIAKT 264
            A+G+S+ EEEK  YEWLQER++ E L V G  Y+GP I ++
Sbjct: 241  ADGLSIPEEEKNAYEWLQEREELEDLEVNGKMYNGPKIVES 281


>ref|XP_003554358.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform 1 [Glycine
            max]
          Length = 281

 Score =  323 bits (828), Expect = 5e-86
 Identities = 160/281 (56%), Positives = 192/281 (68%), Gaps = 10/281 (3%)
 Frame = -2

Query: 1076 EKLPGQKKGKDYMEXXXXXXR----------QKKLLPVQVLYDTCQQVFADCGPGIVPGP 927
            E+    +KG+D+ E                 Q+K  PVQ L++TC+ VFA  G G VP  
Sbjct: 4    ERTLADRKGRDFCELPRETIASSNSRRNRRRQRKKPPVQKLFETCKVVFASAGTGFVPPH 63

Query: 926  EKIQLLKTVLDGISGADVGVNPNMPFFREQETEGLPTITYLHIHECDKFSIGIFCLPPSG 747
            E I  L++VLDGI   DVG+ P+MP+FR   T+ +P ITYLHI+EC+KFS+GIFCLPPSG
Sbjct: 64   EDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITYLHIYECEKFSMGIFCLPPSG 123

Query: 746  VLPLHNHPEMTVFSKLLFGTMHIKSLDWVNGIPSNKVPDGDSSNANGALSSEARLAKIKV 567
            V+PLHNHP MTVFSKLLFGTMHIKS DWV   P         S   G    E RLAK+KV
Sbjct: 124  VIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDSPPESPTTLKPSENQG---PEMRLAKVKV 180

Query: 566  DSDFIAPCDTSILYPTDGGNMHCFTAVTQCAVLDVLGPPYNDAEGRHCAYYIEHPFDRIP 387
            D+DF APC+ SILYP DGGN+HCFTAVT CAVLDVLGPPY+DAEGRHC YY + PF    
Sbjct: 181  DADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPYSDAEGRHCTYYHDFPFSNFS 240

Query: 386  ANGISVAEEEKEGYEWLQERDKPEGLTVYGVPYSGPPIAKT 264
             +G+S+ EEEK  YEWLQERD+ E L V G  Y+GP I ++
Sbjct: 241  VDGLSIPEEEKNAYEWLQERDELEDLEVNGKMYNGPKIVES 281


>ref|XP_003554359.1| PREDICTED: 2-aminoethanethiol dioxygenase-like isoform 2 [Glycine
            max]
          Length = 276

 Score =  321 bits (823), Expect = 2e-85
 Identities = 158/281 (56%), Positives = 192/281 (68%), Gaps = 10/281 (3%)
 Frame = -2

Query: 1076 EKLPGQKKGKDYMEXXXXXXR----------QKKLLPVQVLYDTCQQVFADCGPGIVPGP 927
            E+    +KG+D+ E                 Q+K  PVQ L++TC+ VFA  G G VP  
Sbjct: 4    ERTLADRKGRDFCELPRETIASSNSRRNRRRQRKKPPVQKLFETCKVVFASAGTGFVPPH 63

Query: 926  EKIQLLKTVLDGISGADVGVNPNMPFFREQETEGLPTITYLHIHECDKFSIGIFCLPPSG 747
            E I  L++VLDGI   DVG+ P+MP+FR   T+ +P ITYLHI+EC+KFS+GIFCLPPSG
Sbjct: 64   EDIDELQSVLDGIKPEDVGLRPDMPYFRTSATQRVPRITYLHIYECEKFSMGIFCLPPSG 123

Query: 746  VLPLHNHPEMTVFSKLLFGTMHIKSLDWVNGIPSNKVPDGDSSNANGALSSEARLAKIKV 567
            V+PLHNHP MTVFSKLLFGTMHIKS DW        V D    +      SE+ LAK+KV
Sbjct: 124  VIPLHNHPGMTVFSKLLFGTMHIKSYDW--------VVDSPPESPTTLKPSESELAKVKV 175

Query: 566  DSDFIAPCDTSILYPTDGGNMHCFTAVTQCAVLDVLGPPYNDAEGRHCAYYIEHPFDRIP 387
            D+DF APC+ SILYP DGGN+HCFTAVT CAVLDVLGPPY+DAEGRHC YY + PF    
Sbjct: 176  DADFTAPCNPSILYPEDGGNLHCFTAVTACAVLDVLGPPYSDAEGRHCTYYHDFPFSNFS 235

Query: 386  ANGISVAEEEKEGYEWLQERDKPEGLTVYGVPYSGPPIAKT 264
             +G+S+ EEEK  YEWLQERD+ E L V G  Y+GP I ++
Sbjct: 236  VDGLSIPEEEKNAYEWLQERDELEDLEVNGKMYNGPKIVES 276


>ref|XP_002870743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297316579|gb|EFH47002.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 284

 Score =  317 bits (813), Expect = 3e-84
 Identities = 156/249 (62%), Positives = 184/249 (73%), Gaps = 2/249 (0%)
 Frame = -2

Query: 1013 QKKLL-PVQVLYDTCQQVFADCGPGIVPGPEKIQLLKTVLDGISGADVGVNPNMPFFREQ 837
            QK L+ PVQ L+DTC++VFA+   G VP  E I++L+ VLD I+  DVGV+P MPFFR +
Sbjct: 48   QKTLICPVQKLFDTCKKVFANGKSGTVPSQENIEMLRAVLDVITPEDVGVSPKMPFFRSK 107

Query: 836  ETEGLPTITYLHIHECDKFSIGIFCLPPSGVLPLHNHPEMTVFSKLLFGTMHIKSLDWVN 657
             T   P +TYLHI+ C +FSI IFCLPPSGV+PLHNHPEMTVFSKLLFGT+HIKS DWV 
Sbjct: 108  VTGSSPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTVHIKSYDWVA 167

Query: 656  GIPSNKVPDGDSSNANGALSSEARLAKIKVDSDFIAPCDTSILYPTDGGNMHCFTAVTQC 477
              P                SS+ RLAK+KVDSDF APCDTSILYP DGGNMHCFTA T C
Sbjct: 168  DSPQP--------------SSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTAC 213

Query: 476  AVLDVLGPPYNDAEGRHCAYYIEHPFDRIPANGISVAEEEKEGYEWLQER-DKPEGLTVY 300
            AVLDVLGPPY+D  GRHC YY ++PF     +G++VAEEEKEGY WL+ER ++PE LTV 
Sbjct: 214  AVLDVLGPPYSDPAGRHCTYYFDYPFSSFSVDGVAVAEEEKEGYAWLKEREEEPEDLTVS 273

Query: 299  GVPYSGPPI 273
             + YSGP I
Sbjct: 274  AMMYSGPTI 282


>ref|NP_198805.1| uncharacterized protein [Arabidopsis thaliana]
            gi|21536502|gb|AAM60834.1| unknown [Arabidopsis thaliana]
            gi|27808558|gb|AAO24559.1| At5g39890 [Arabidopsis
            thaliana] gi|110736241|dbj|BAF00091.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332007105|gb|AED94488.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 276

 Score =  316 bits (810), Expect = 7e-84
 Identities = 155/245 (63%), Positives = 179/245 (73%), Gaps = 1/245 (0%)
 Frame = -2

Query: 1004 LLPVQVLYDTCQQVFADCGPGIVPGPEKIQLLKTVLDGISGADVGVNPNMPFFREQETEG 825
            + PVQ L+DTC++VFAD   G VP  E I++L+ VLD I   DVGVNP M +FR   T  
Sbjct: 44   ICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTGR 103

Query: 824  LPTITYLHIHECDKFSIGIFCLPPSGVLPLHNHPEMTVFSKLLFGTMHIKSLDWVNGIPS 645
             P +TYLHI+ C +FSI IFCLPPSGV+PLHNHPEMTVFSKLLFGTMHIKS DWV   P 
Sbjct: 104  SPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWV---PD 160

Query: 644  NKVPDGDSSNANGALSSEARLAKIKVDSDFIAPCDTSILYPTDGGNMHCFTAVTQCAVLD 465
            +  P           SS+ RLAK+KVDSDF APCDTSILYP DGGNMHCFTA T CAVLD
Sbjct: 161  SPQP-----------SSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLD 209

Query: 464  VLGPPYNDAEGRHCAYYIEHPFDRIPANGISVAEEEKEGYEWLQER-DKPEGLTVYGVPY 288
            V+GPPY+D  GRHC YY ++PF     +G+ VAEEEKEGY WL+ER +KPE LTV  + Y
Sbjct: 210  VIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMY 269

Query: 287  SGPPI 273
            SGP I
Sbjct: 270  SGPTI 274


Top