BLASTX nr result

ID: Dioscorea21_contig00023416 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00023416
         (1247 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266459.1| PREDICTED: cysteine synthase 2 [Vitis vinife...   596   e-168
emb|CAN80713.1| hypothetical protein VITISV_024163 [Vitis vinifera]   594   e-167
ref|XP_003558493.1| PREDICTED: cysteine synthase 2-like [Brachyp...   581   e-163
ref|XP_002298863.1| predicted protein [Populus trichocarpa] gi|2...   578   e-162
ref|NP_175984.1| cysteine synthase A [Arabidopsis thaliana] gi|4...   573   e-161

>ref|XP_002266459.1| PREDICTED: cysteine synthase 2 [Vitis vinifera]
            gi|296089907|emb|CBI39726.3| unnamed protein product
            [Vitis vinifera]
          Length = 421

 Score =  596 bits (1537), Expect = e-168
 Identities = 297/390 (76%), Positives = 332/390 (85%), Gaps = 10/390 (2%)
 Frame = +1

Query: 94   RRRMSRASRKGEKKGLIAAVGNTPLIRINSLSDATGCEILGKAEFLNPGGSVKDRVAVKI 273
            R+     S+K  ++GLI A+GNTPLIRINSLS+ATGCEILGKAEFLNPGGSVKDRVAVKI
Sbjct: 29   RKTHKPISKKKPRRGLIEAIGNTPLIRINSLSEATGCEILGKAEFLNPGGSVKDRVAVKI 88

Query: 274  IEEALDAGALVPGGVVTEGSAGSTAISLATVAPSFGCKCHVVIPDDAAIEKSEILEALGA 453
            IEEAL++G L PGGVVTEGSAGSTAISLATVAP++GCKCHVVIPDD AIEKS+ILEALGA
Sbjct: 89   IEEALESGQLAPGGVVTEGSAGSTAISLATVAPAYGCKCHVVIPDDVAIEKSQILEALGA 148

Query: 454  TVERVRPVSITHRDHYVNIARRRASEANILAT----------NGYRQCNGHTKEAGEFPA 603
            TVERVRPVSITH+DHYVN+ARRRA EAN  A+          +G    NGH  E  +  +
Sbjct: 149  TVERVRPVSITHKDHYVNVARRRALEANEFASKHGKYIGMDADGLVPANGHISEEEKQNS 208

Query: 604  VSMADCKGGFFADQFENLANFRAHYEGTGPEIWEQTGGNLHGFIXXXXXXXXXXXISRFL 783
            V  ++CKGGFFADQFENLANFRAHYEGTGPEIWEQT G+LH F+           +SR L
Sbjct: 209  VFSSNCKGGFFADQFENLANFRAHYEGTGPEIWEQTSGDLHAFVAAAGTGGTLAGVSRSL 268

Query: 784  KEKDPRIKCFLIDPPGSGLYNKVKRGVMYTKEEAEGRRLKNPFDTITEGIGINRLTMNFM 963
            +EK P IKCFLIDPPGSGL+NK+ RGVMYT+EEAEGRRLKNPFDTITEGIGINRLT NF+
Sbjct: 269  QEKSPNIKCFLIDPPGSGLFNKITRGVMYTREEAEGRRLKNPFDTITEGIGINRLTQNFL 328

Query: 964  MAELDGAFRGTDKEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRLARALGPGHTIVTILCD 1143
            MAELDGAF GTD EAVEMSRYLLKNDGLFVGSSSAMNCVGAVR+A+++GPGHTIVTILCD
Sbjct: 329  MAELDGAFHGTDMEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRVAQSIGPGHTIVTILCD 388

Query: 1144 SGIRHLSKFHNSQYLSDHGLTPSAAGLEFL 1233
            SG+RHLSKF++SQYLS HGLTP+A GLEFL
Sbjct: 389  SGMRHLSKFYDSQYLSQHGLTPTATGLEFL 418


>emb|CAN80713.1| hypothetical protein VITISV_024163 [Vitis vinifera]
          Length = 421

 Score =  594 bits (1531), Expect = e-167
 Identities = 296/390 (75%), Positives = 331/390 (84%), Gaps = 10/390 (2%)
 Frame = +1

Query: 94   RRRMSRASRKGEKKGLIAAVGNTPLIRINSLSDATGCEILGKAEFLNPGGSVKDRVAVKI 273
            R+     S+K  ++GLI A+GNTPLIRINSLS+ATGCEILGKAEFLNPGGSVKDRVAVKI
Sbjct: 29   RKTHKPISKKKPRRGLIEAIGNTPLIRINSLSEATGCEILGKAEFLNPGGSVKDRVAVKI 88

Query: 274  IEEALDAGALVPGGVVTEGSAGSTAISLATVAPSFGCKCHVVIPDDAAIEKSEILEALGA 453
            IEEAL++G L PGGVV EGSAGSTAISLATVAP++GCKCHVVIPDD AIEKS+ILEALGA
Sbjct: 89   IEEALESGQLAPGGVVXEGSAGSTAISLATVAPAYGCKCHVVIPDDVAIEKSQILEALGA 148

Query: 454  TVERVRPVSITHRDHYVNIARRRASEANILAT----------NGYRQCNGHTKEAGEFPA 603
            TVERVRPVSITH+DHYVN+ARRRA EAN  A+          +G    NGH  E  +  +
Sbjct: 149  TVERVRPVSITHKDHYVNVARRRALEANEFASKHGKYIGMDADGLVPANGHISEEEKQNS 208

Query: 604  VSMADCKGGFFADQFENLANFRAHYEGTGPEIWEQTGGNLHGFIXXXXXXXXXXXISRFL 783
            V  ++CKGGFFADQFENLANFRAHYEGTGPEIWEQT G+LH F+           +SR L
Sbjct: 209  VFSSNCKGGFFADQFENLANFRAHYEGTGPEIWEQTSGDLHAFVAAAGTGGTLAGVSRSL 268

Query: 784  KEKDPRIKCFLIDPPGSGLYNKVKRGVMYTKEEAEGRRLKNPFDTITEGIGINRLTMNFM 963
            +EK P IKCFLIDPPGSGL+NK+ RGVMYT+EEAEGRRLKNPFDTITEGIGINRLT NF+
Sbjct: 269  QEKSPNIKCFLIDPPGSGLFNKITRGVMYTREEAEGRRLKNPFDTITEGIGINRLTQNFL 328

Query: 964  MAELDGAFRGTDKEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRLARALGPGHTIVTILCD 1143
            MAELDGAF GTD EAVEMSRYLLKNDGLFVGSSSAMNCVGAVR+A+++GPGHTIVTILCD
Sbjct: 329  MAELDGAFHGTDMEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRVAQSIGPGHTIVTILCD 388

Query: 1144 SGIRHLSKFHNSQYLSDHGLTPSAAGLEFL 1233
            SG+RHLSKF++SQYLS HGLTP+A GLEFL
Sbjct: 389  SGMRHLSKFYDSQYLSQHGLTPTATGLEFL 418


>ref|XP_003558493.1| PREDICTED: cysteine synthase 2-like [Brachypodium distachyon]
          Length = 439

 Score =  581 bits (1497), Expect = e-163
 Identities = 296/423 (69%), Positives = 343/423 (81%), Gaps = 26/423 (6%)
 Frame = +1

Query: 46   VFSCYVLF-SNGSKLSWRRRMSRASRKGEKKGLIAAVGNTPLIRINSLSDATGCEILGKA 222
            +F+ Y+L   +GSK  W R    + R+  +KGL+ A+GNTPLIRINSLSDATGCEILGKA
Sbjct: 18   LFAYYLLLHKSGSKFPWSRTTGASGRRTRRKGLVEAIGNTPLIRINSLSDATGCEILGKA 77

Query: 223  EFLNPGGSVKDRVAVKIIEEALDAGALVPGGVVTEGSAGSTAISLATVAPSFGCKCHVVI 402
            EFLNPGGSVKDRVAVKIIEEAL +G LV GGVVTEGSAGSTAISLATVAP++GC+CHVVI
Sbjct: 78   EFLNPGGSVKDRVAVKIIEEALKSGDLVCGGVVTEGSAGSTAISLATVAPAYGCRCHVVI 137

Query: 403  PDDAAIEKSEILEALGATVERVRPVSITHRDHYVNIARRRASEANILAT---NGYRQCNG 573
            PDDAA+EKS+I+EALGA VERVRPVSITHRDH+VNIARRRA EANI +T   +  RQ NG
Sbjct: 138  PDDAAVEKSQIIEALGAIVERVRPVSITHRDHFVNIARRRALEANIASTQIESNDRQTNG 197

Query: 574  ---------HTKEA-------------GEFPAVSMADCKGGFFADQFENLANFRAHYEGT 687
                     HTK+              G++  +S  D KGGFFADQFENLAN+RAHYE T
Sbjct: 198  SAYVKTKMLHTKQTNGSAHANTELSSTGKYCPIS--DSKGGFFADQFENLANYRAHYEWT 255

Query: 688  GPEIWEQTGGNLHGFIXXXXXXXXXXXISRFLKEKDPRIKCFLIDPPGSGLYNKVKRGVM 867
            GPEIWEQT G +H F+           +SR+LKEK+  ++CFL+DPPGSGL+NKV RGVM
Sbjct: 256  GPEIWEQTKGTIHAFVAAAGTGGTIAGVSRYLKEKNRNVQCFLMDPPGSGLFNKVTRGVM 315

Query: 868  YTKEEAEGRRLKNPFDTITEGIGINRLTMNFMMAELDGAFRGTDKEAVEMSRYLLKNDGL 1047
            YTKEEAEG+RLKNPFDTITEGIGINR+T NFMMAELDGA+RGTD+EAVEMSR+LL+ DGL
Sbjct: 316  YTKEEAEGKRLKNPFDTITEGIGINRVTRNFMMAELDGAYRGTDREAVEMSRFLLRKDGL 375

Query: 1048 FVGSSSAMNCVGAVRLARALGPGHTIVTILCDSGIRHLSKFHNSQYLSDHGLTPSAAGLE 1227
            F+GSSSAMNCVGA R+A+ LGPGHTIVTILCDSG+RHLSKF N +YL++HGLTP+A GLE
Sbjct: 376  FLGSSSAMNCVGAARVAQDLGPGHTIVTILCDSGMRHLSKFFNDEYLANHGLTPTATGLE 435

Query: 1228 FLD 1236
            FLD
Sbjct: 436  FLD 438


>ref|XP_002298863.1| predicted protein [Populus trichocarpa] gi|222846121|gb|EEE83668.1|
            predicted protein [Populus trichocarpa]
          Length = 424

 Score =  578 bits (1490), Expect = e-162
 Identities = 292/384 (76%), Positives = 327/384 (85%), Gaps = 10/384 (2%)
 Frame = +1

Query: 112  ASRKGEKKGLIAAVGNTPLIRINSLSDATGCEILGKAEFLNPGGSVKDRVAVKIIEEALD 291
            +  K  + GLI AVGNTPLIRINSLS+ATGCEILGK EFLNPGGSVKDRVAVKIIEEAL+
Sbjct: 39   SKNKKPRNGLIHAVGNTPLIRINSLSEATGCEILGKCEFLNPGGSVKDRVAVKIIEEALE 98

Query: 292  AGALVPGGVVTEGSAGSTAISLATVAPSFGCKCHVVIPDDAAIEKSEILEALGATVERVR 471
            +G LV GGVVTEGSAGSTAISLATVAP++GCKCHVVIPDD AIEKS+ILEALGATVERVR
Sbjct: 99   SGQLVCGGVVTEGSAGSTAISLATVAPAYGCKCHVVIPDDVAIEKSQILEALGATVERVR 158

Query: 472  PVSITHRDHYVNIARRRASEANILATNGYR----------QCNGHTKEAGEFPAVSMADC 621
            PVSITHRDHYVNIARRRA EAN LA+   +          Q NG   +  +  ++  + C
Sbjct: 159  PVSITHRDHYVNIARRRALEANELASKLRKTEKIDGKVLEQINGCISDGEKKGSIFSSYC 218

Query: 622  KGGFFADQFENLANFRAHYEGTGPEIWEQTGGNLHGFIXXXXXXXXXXXISRFLKEKDPR 801
             GGFFADQFENLANFRAHY+GTGPEIWEQ+G +L  F+           IS FL+EK+P 
Sbjct: 219  SGGFFADQFENLANFRAHYQGTGPEIWEQSGCSLDSFVAAAGTGGTVAGISSFLQEKNPN 278

Query: 802  IKCFLIDPPGSGLYNKVKRGVMYTKEEAEGRRLKNPFDTITEGIGINRLTMNFMMAELDG 981
            IKCFLIDPPGSGL+NKV RGVMYT+EEAEG+RLKNPFDTITEGIGINRLT NF MA+LDG
Sbjct: 279  IKCFLIDPPGSGLFNKVTRGVMYTREEAEGKRLKNPFDTITEGIGINRLTQNFKMAKLDG 338

Query: 982  AFRGTDKEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRLARALGPGHTIVTILCDSGIRHL 1161
            A+RGTDKEAVEMSRYLLKNDGLF+GSSSAMNCVGAVR+A++LGPGHTIVTILCDSG+RHL
Sbjct: 339  AYRGTDKEAVEMSRYLLKNDGLFLGSSSAMNCVGAVRVAQSLGPGHTIVTILCDSGMRHL 398

Query: 1162 SKFHNSQYLSDHGLTPSAAGLEFL 1233
            SKFH++QYLS+HGLTP+A GLEFL
Sbjct: 399  SKFHDAQYLSEHGLTPTATGLEFL 422


>ref|NP_175984.1| cysteine synthase A [Arabidopsis thaliana] gi|46518441|gb|AAS99702.1|
            At1g55880 [Arabidopsis thaliana]
            gi|110741637|dbj|BAE98765.1| hypothetical protein
            [Arabidopsis thaliana] gi|332195192|gb|AEE33313.1|
            cysteine synthase A [Arabidopsis thaliana]
          Length = 421

 Score =  573 bits (1478), Expect = e-161
 Identities = 290/392 (73%), Positives = 330/392 (84%), Gaps = 14/392 (3%)
 Frame = +1

Query: 100  RMSRASRKGEK----KGLIAAVGNTPLIRINSLSDATGCEILGKAEFLNPGGSVKDRVAV 267
            R+S   ++ EK     GL+ A+GNTPLIRINSLS+ATGCEILGK EFLNPGGSVKDRVAV
Sbjct: 27   RLSEKKKRKEKLTMRNGLVDAIGNTPLIRINSLSEATGCEILGKCEFLNPGGSVKDRVAV 86

Query: 268  KIIEEALDAGALVPGGVVTEGSAGSTAISLATVAPSFGCKCHVVIPDDAAIEKSEILEAL 447
            KII+EAL++G L PGG+VTEGSAGSTAISLATVAP++GCKCHVVIPDDAAIEKS+I+EAL
Sbjct: 87   KIIQEALESGKLFPGGIVTEGSAGSTAISLATVAPAYGCKCHVVIPDDAAIEKSQIIEAL 146

Query: 448  GATVERVRPVSITHRDHYVNIARRRASEANILA--------TNGYRQ--CNGHTKEAGEF 597
            GA+VERVRPVSITH+DHYVNIARRRA EAN LA        TNG  Q   NG T E  + 
Sbjct: 147  GASVERVRPVSITHKDHYVNIARRRADEANELASKRRLGSETNGIHQEKTNGCTVEEVKE 206

Query: 598  PAVSMADCKGGFFADQFENLANFRAHYEGTGPEIWEQTGGNLHGFIXXXXXXXXXXXISR 777
            P++      GGFFADQFENLAN+RAHYEGTGPEIW QT GN+  F+           +SR
Sbjct: 207  PSLFSDSVTGGFFADQFENLANYRAHYEGTGPEIWHQTQGNIDAFVAAAGTGGTLAGVSR 266

Query: 778  FLKEKDPRIKCFLIDPPGSGLYNKVKRGVMYTKEEAEGRRLKNPFDTITEGIGINRLTMN 957
            FL++K+ R+KCFLIDPPGSGLYNKV RGVMYT+EEAEGRRLKNPFDTITEGIGINRLT N
Sbjct: 267  FLQDKNERVKCFLIDPPGSGLYNKVTRGVMYTREEAEGRRLKNPFDTITEGIGINRLTKN 326

Query: 958  FMMAELDGAFRGTDKEAVEMSRYLLKNDGLFVGSSSAMNCVGAVRLARALGPGHTIVTIL 1137
            F+MA+LDG FRGTDKEAVEMSR+LLKNDGLFVGSSSAMNCVGAVR+A+ LGPGHTIVTIL
Sbjct: 327  FLMAKLDGGFRGTDKEAVEMSRFLLKNDGLFVGSSSAMNCVGAVRVAQTLGPGHTIVTIL 386

Query: 1138 CDSGIRHLSKFHNSQYLSDHGLTPSAAGLEFL 1233
            CDSG+RHLSKFH+ +YL+ +GL+P+A GLEFL
Sbjct: 387  CDSGMRHLSKFHDPKYLNLYGLSPTAIGLEFL 418


Top