BLASTX nr result

ID: Dioscorea21_contig00019309 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00019309
         (961 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002532597.1| strictosidine synthase, putative [Ricinus co...   325   1e-86
ref|XP_002308895.1| predicted protein [Populus trichocarpa] gi|2...   309   6e-82
gb|EAY82206.1| hypothetical protein OsI_37409 [Oryza sativa Indi...   309   6e-82
ref|XP_002323253.1| predicted protein [Populus trichocarpa] gi|2...   309   7e-82
gb|ABA95779.2| Strictosidine synthase family protein, expressed ...   309   7e-82

>ref|XP_002532597.1| strictosidine synthase, putative [Ricinus communis]
           gi|223527685|gb|EEF29794.1| strictosidine synthase,
           putative [Ricinus communis]
          Length = 375

 Score =  325 bits (832), Expect = 1e-86
 Identities = 156/307 (50%), Positives = 218/307 (71%), Gaps = 8/307 (2%)
 Frame = -3

Query: 959 GPYTGVSDGRIFKWSPEHHQWTQFAVSSGYMDEECAGSQ--DKEKEHICGRPLGLEFNNK 786
           GPYTG+SDGRI +W     +W  FAV+S Y D  C G      + EHICGRPLGL FN  
Sbjct: 67  GPYTGISDGRIIRWEEHEQRWIDFAVTSLYRDG-CEGPHVDQYQMEHICGRPLGLCFNES 125

Query: 785 TGDLFVADAYKGLLKATQDERILKPVVTSAEGGT-LGFTNSLDIDQNSGVIYFSDSSINF 609
            GDL+VADAY GLLK  +D  +   + T  +      FTNSLD+D +S  +YF+DSS  +
Sbjct: 126 NGDLYVADAYMGLLKVGRDGGLATTIATHGDDDIPFNFTNSLDVDPSSSALYFTDSSSRY 185

Query: 608 QRRQFMKAIITGDRTGRVMKYDPEEEKVEVLINGLAFANGVALSRDGSFLLIVETTECRI 429
           QRR+++ AI++GD++GR+++YDPE++KV +L+  L+F NGVALS+DG+F+LI ETT CR+
Sbjct: 186 QRREYIYAILSGDKSGRLLRYDPEDKKVRILLGNLSFPNGVALSKDGNFILIAETTTCRV 245

Query: 428 LKYKLKEKS---VQVLVKLPGFPDNIKRSPRGGYWVAMHSRRKKVVQWALSVSWIRRMIP 258
           LKY +K      ++V  ++PGFPDNIKRSPRGGYWVA++SRR K ++W LS  WI   + 
Sbjct: 246 LKYWIKTSKAGILEVFAQVPGFPDNIKRSPRGGYWVAINSRRDKFLEWVLSHPWIGNSLI 305

Query: 257 YLPFDLHKLSELMERWRGGALAMRIGDDGEVLEVLD--NRFKFISEVHERNGTLWIGSVL 84
            LPFDL K+  ++ ++RG  +A+R+ ++G++LEV +  NRFK +SEV E++G LWIGS+ 
Sbjct: 306 KLPFDLMKIYSILGKYRGTGMAVRLDENGDILEVFEDRNRFKTLSEVMEKDGKLWIGSIN 365

Query: 83  VPFASLY 63
           +PF   Y
Sbjct: 366 LPFVGRY 372


>ref|XP_002308895.1| predicted protein [Populus trichocarpa] gi|222854871|gb|EEE92418.1|
           predicted protein [Populus trichocarpa]
          Length = 368

 Score =  309 bits (792), Expect = 6e-82
 Identities = 152/308 (49%), Positives = 209/308 (67%), Gaps = 7/308 (2%)
 Frame = -3

Query: 959 GPYTGVSDGRIFKWSPEHHQWTQFAVSSGYMDEECAGSQDKEKEHICGRPLGLEFNNKTG 780
           GPY  +SDGRI KW      WT FAV+S      C        EHICGRPLGL F+   G
Sbjct: 62  GPYASLSDGRIVKWQGNRKGWTDFAVASPNR-YACKQQPFAHTEHICGRPLGLCFDETHG 120

Query: 779 DLFVADAYKGLLKATQDERILKPVVTSAEGGTLGFTNSLDIDQNSGVIYFSDSSINFQRR 600
           DL++ADAY GLL+      +   +VT A+G  L FTN LDIDQ+SG IYF+DSS  +QRR
Sbjct: 121 DLYIADAYMGLLRVGTQGGLATKIVTHAQGIPLRFTNGLDIDQSSGAIYFTDSSSQYQRR 180

Query: 599 QFMKAIITGDRTGRVMKYDPEEEKVEVLINGLAFANGVALSRDGSFLLIVETTECRILKY 420
           Q++  +++GD++GR+MKYDP  ++V VL++ L F NGVALS+DG+F+L+ ETT CRIL+Y
Sbjct: 181 QYLSVVLSGDKSGRLMKYDPVNKQVRVLLSNLTFPNGVALSKDGNFILLAETTRCRILRY 240

Query: 419 KLKEK---SVQVLVKLPGFPDNIKRSPRGGYWVAMHSRRKKVVQWALSVSWIRRMIPYLP 249
            +K     +V+V  +L GFPDNIKRSPRGGYWV M+SRR+K+ +   S  WI  ++  LP
Sbjct: 241 WIKTSKAGTVEVFAQLQGFPDNIKRSPRGGYWVGMNSRREKLSELLFSYPWIGNVLLKLP 300

Query: 248 FDLHKLSELMERWRGGALAMRIGDDGEVLEVLDNR----FKFISEVHERNGTLWIGSVLV 81
            D+  L   + ++RG  LA+R+ ++G++LEV ++      K ISEV E++G LWIGS+ +
Sbjct: 301 LDIAMLQSTLSKYRGSGLAVRLSENGDILEVFEDNDGDGLKSISEVMEKDGRLWIGSIAL 360

Query: 80  PFASLYKL 57
           PFA  Y++
Sbjct: 361 PFAGRYRI 368


>gb|EAY82206.1| hypothetical protein OsI_37409 [Oryza sativa Indica Group]
          Length = 371

 Score =  309 bits (792), Expect = 6e-82
 Identities = 145/312 (46%), Positives = 216/312 (69%), Gaps = 11/312 (3%)
 Frame = -3

Query: 959 GPYTGVSDGRIFKW-SPEHHQWTQFAVSSGYMDEECAGSQDKEKEHICGRPLGLEFNNKT 783
           GPYT VSDGR+ KW  P   +W + + S   + + C GS+D ++E  CGRPLGL+FN+KT
Sbjct: 60  GPYTSVSDGRVLKWLPPPERRWVEHSCSVPELLDSCRGSKDTKREQECGRPLGLKFNSKT 119

Query: 782 GDLFVADAYKGLLKATQDERILKPVVTSAEGGTLGFTNSLDIDQNSGVIYFSDSSINFQR 603
           G+L+VADAY GL   +  E + +P+V         F+N ++ID  +GVIYF+++S  FQR
Sbjct: 120 GELYVADAYLGLRVVSPGENVSRPLVPKWTESPFSFSNGVEIDHETGVIYFTETSTRFQR 179

Query: 602 RQFMKAIITGDRTGRVMKYDPEEEKVEVLINGLAFANGVALSRDGSFLLIVETTECRILK 423
           R+F+  +ITGD TGR++KYDP+E KVEVL++GL F NG+A+S DGS+LL+ ETT  +IL+
Sbjct: 180 REFLNIVITGDNTGRLLKYDPKENKVEVLVDGLCFPNGLAMSNDGSYLLLAETTTGKILR 239

Query: 422 YKL---KEKSVQVLVKLPGFPDNIKRSPRGGYWVAMHSRRKKVVQWALSVSWIRRMIPYL 252
           Y +   K  +++ +V+LPGFPDNIK SPRGG+WV +H++R K+ +W++S  W+R++I  L
Sbjct: 240 YWIKTPKASTIEEVVQLPGFPDNIKMSPRGGFWVGLHAKRGKIAEWSISYPWLRKVILKL 299

Query: 251 PFD-LHKLSELMERWRGGALAMRIGDDGEVLEVLD------NRFKFISEVHERNGTLWIG 93
           P   + +++  +  +    +A+R+ +DG+ +E +         FK ISEV E+NG LWIG
Sbjct: 300 PAQRIQRITSFLTGFGRQVIALRLSEDGKTIEAMSVHGDVRKLFKSISEVEEKNGNLWIG 359

Query: 92  SVLVPFASLYKL 57
           SVL PF  LY++
Sbjct: 360 SVLSPFLGLYRI 371


>ref|XP_002323253.1| predicted protein [Populus trichocarpa] gi|222867883|gb|EEF05014.1|
           predicted protein [Populus trichocarpa]
          Length = 349

 Score =  309 bits (791), Expect = 7e-82
 Identities = 151/309 (48%), Positives = 213/309 (68%), Gaps = 8/309 (2%)
 Frame = -3

Query: 959 GPYTGVSDGRIFKWSPEHHQWTQFAVSSGYMDEECAGSQDKEK-EHICGRPLGLEFNNKT 783
           GPYT +SDGRI KW  +  +W  FAV+S   D  C G  D  + EH+CGRPLG  F+   
Sbjct: 42  GPYTSLSDGRIIKWQGDKKRWIDFAVTSPNRDG-CGGPHDHHQMEHVCGRPLGSCFDETH 100

Query: 782 GDLFVADAYKGLLKATQDERILKPVVTSAEGGTLGFTNSLDIDQNSGVIYFSDSSINFQR 603
           GDL++ADAY GLL+   +  +   + T A+G    FTNSLDIDQ+SG IYF+DSS  +QR
Sbjct: 101 GDLYIADAYMGLLRVGPEGGLATKIATHAQGIPFRFTNSLDIDQSSGAIYFTDSSTQYQR 160

Query: 602 RQFMKAIITGDRTGRVMKYDPEEEKVEVLINGLAFANGVALSRDGSFLLIVETTECRILK 423
           R ++  +++GD++GR+MKYD   ++V VL+  L F NGVALS DGSF+L+ ETT CRIL+
Sbjct: 161 RDYLSVVLSGDKSGRLMKYDTASKQVTVLLKNLTFPNGVALSTDGSFVLLAETTSCRILR 220

Query: 422 YKLKEK---SVQVLVKLPGFPDNIKRSPRGGYWVAMHSRRKKVVQWALSVSWIRRMIPYL 252
           Y +K     +++V  +L GFPDNIKRSPRGGYWV ++S+R+K+ +   S  WI +++  L
Sbjct: 221 YWIKTSKAGALEVFAQLQGFPDNIKRSPRGGYWVGINSKREKLSELLFSYPWIGKVLLKL 280

Query: 251 PFDLHKLSELMERWRGGALAMRIGDDGEVLEVLD----NRFKFISEVHERNGTLWIGSVL 84
           P D+ K    + ++RGG LA+R+ ++G+++EV +    NR K ISEV E++G LWIGS+ 
Sbjct: 281 PLDITKFQTALAKYRGGGLAVRLSENGDIVEVFEDRDGNRLKSISEVMEKDGKLWIGSID 340

Query: 83  VPFASLYKL 57
           +PFA  +KL
Sbjct: 341 LPFAGRFKL 349


>gb|ABA95779.2| Strictosidine synthase family protein, expressed [Oryza sativa
            Japonica Group] gi|215694000|dbj|BAG89199.1| unnamed
            protein product [Oryza sativa Japonica Group]
          Length = 430

 Score =  309 bits (791), Expect = 7e-82
 Identities = 146/313 (46%), Positives = 217/313 (69%), Gaps = 12/313 (3%)
 Frame = -3

Query: 959  GPYTGVSDGRIFKWSPEHHQWTQF--AVSSGYMDEECAGSQDKEKEHICGRPLGLEFNNK 786
            GPYTGVSDGR+ KW P   +W +   AV   +M + C GS+D ++E  CGRPLGL+FN+K
Sbjct: 118  GPYTGVSDGRVLKWLPLERRWVEHSSAVIEPHMLDSCRGSKDTKREQECGRPLGLKFNSK 177

Query: 785  TGDLFVADAYKGLLKATQDERILKPVVTSAEGGTLGFTNSLDIDQNSGVIYFSDSSINFQ 606
            TG+L+VADAY GL   +  E + +P+V         F+N ++ID  +GVIYF+++S  FQ
Sbjct: 178  TGELYVADAYLGLRVVSPGENVSRPLVPKWTESPFSFSNGVEIDHETGVIYFTETSTRFQ 237

Query: 605  RRQFMKAIITGDRTGRVMKYDPEEEKVEVLINGLAFANGVALSRDGSFLLIVETTECRIL 426
            RR+F+  +ITGD TGR++KYDP+E KVEVL++GL F NG+A+S DGS+LL+ ETT  +IL
Sbjct: 238  RREFLNIVITGDNTGRLLKYDPKENKVEVLVDGLCFPNGLAMSNDGSYLLLAETTTGKIL 297

Query: 425  KYKL---KEKSVQVLVKLPGFPDNIKRSPRGGYWVAMHSRRKKVVQWALSVSWIRRMIPY 255
            +Y +   K  +++ +V+L GFPDNIK SPRGG+WV +H++R K+ +W++S  W+R++I  
Sbjct: 298  RYWIKTPKASTIEEVVQLHGFPDNIKMSPRGGFWVGLHAKRGKIAEWSISYPWLRKVILK 357

Query: 254  LPFD-LHKLSELMERWRGGALAMRIGDDGEVLEVLD------NRFKFISEVHERNGTLWI 96
            LP   + +++  +  +    +A+R+ +DG+ +E +         FK ISEV E++G LWI
Sbjct: 358  LPAQRIQRITSFLTGFGRQVIALRLSEDGKTIEAMSVHGDVRKLFKSISEVEEKDGNLWI 417

Query: 95   GSVLVPFASLYKL 57
            GSVL PF  LY++
Sbjct: 418  GSVLSPFLGLYRI 430


Top