BLASTX nr result

ID: Dioscorea21_contig00002470 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00002470
         (1852 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269466.2| PREDICTED: eukaryotic translation initiation...   382   e-103
ref|XP_002326844.1| predicted protein [Populus trichocarpa] gi|2...   373   e-100
gb|ABK95855.1| unknown [Populus trichocarpa]                          371   e-100
ref|XP_002302506.1| predicted protein [Populus trichocarpa] gi|2...   370   e-100
ref|XP_002460845.1| hypothetical protein SORBIDRAFT_02g036110 [S...   361   4e-97

>ref|XP_002269466.2| PREDICTED: eukaryotic translation initiation factor 4G-like [Vitis
            vinifera]
          Length = 1935

 Score =  382 bits (982), Expect = e-103
 Identities = 197/283 (69%), Positives = 219/283 (77%), Gaps = 4/283 (1%)
 Frame = +3

Query: 1014 VISQIFDKALTEPTFCEMYADFCFHLSSELPDFSENNEKITFKRLLLNKCXXXXXXXXXX 1193
            VISQIFDKAL EPTFCEMYA+FCFHL+ ELPDFSE+NEKITFKRLLLNKC          
Sbjct: 1318 VISQIFDKALMEPTFCEMYANFCFHLARELPDFSEDNEKITFKRLLLNKCQEEFERGERE 1377

Query: 1194 XXXXDKAXXXXXXXXXXXXXXXXXXXXXXXMLGNIRLIGELYKKKMLTERIMHECIKKLL 1373
                ++A                       MLGNIRLIGELYKK+MLTERIMHECIKKLL
Sbjct: 1378 QEEANRADEEGEIKQSEEEREEKRIKARRRMLGNIRLIGELYKKRMLTERIMHECIKKLL 1437

Query: 1374 GQYPNPDEEDIEALCKLMSTIGQMIDHPKAKEHMDAYFDMMLKLSTNPKLSSRLRFMLKD 1553
            GQY NPDEEDIE+LCKLMSTIG+MIDHPKAKEHMD YFD M KLS N KLSSR+RFMLKD
Sbjct: 1438 GQYQNPDEEDIESLCKLMSTIGEMIDHPKAKEHMDVYFDRMAKLSNNMKLSSRVRFMLKD 1497

Query: 1554 AIDLRKNRWQQRRKIEGPKKIEEVHRDVAHERQAQASRLARGPVISSTPRRGA-AVDFGP 1730
            AIDLRKN+WQQRRK+EGPKKIEEVHRD A ERQAQASRL+RGP ++S+ RRGA  +DFGP
Sbjct: 1498 AIDLRKNKWQQRRKVEGPKKIEEVHRDAAQERQAQASRLSRGPSMNSSTRRGAPPMDFGP 1557

Query: 1731 RGSTILSSPPQQMGGSRVFP---VQARGPQDVRMDDRHPFESK 1850
            RGST+LSSP  QMGG R  P   V+  G QDVR++DR  +ES+
Sbjct: 1558 RGSTMLSSPNSQMGGFRGLPSPQVRGFGAQDVRLEDRQSYESR 1600



 Score =  196 bits (499), Expect(2) = 6e-52
 Identities = 109/194 (56%), Positives = 133/194 (68%), Gaps = 5/194 (2%)
 Frame = +2

Query: 269  GNEATGRKKYSRDFLMTFSEQCTELPAGFEIGYGIANALMSGPVSASYAVDREPHPSPGR 448
            GN   G KKYSRDFL+TF++QC +LP GFEI   IA ALM   ++ S+ +DR+ +PSPGR
Sbjct: 1085 GNGVLG-KKYSRDFLLTFADQCNDLPEGFEITSDIAEALMISNINMSHLIDRDSYPSPGR 1143

Query: 449  STDRSPKAPWVDRR-IVVGDDDKWTKFPG--SSGPGL--DLAHGISAVSFRPGQGVNHGV 613
              DR       DRR   V DDDKW+K PG  SSG  L  D+ +G + V FR  QG N+GV
Sbjct: 1144 IVDRQAGGSRPDRRGSGVVDDDKWSKLPGPFSSGRDLRPDIGYGGNVVGFRSVQGGNYGV 1203

Query: 614  LRNPRGQLSSQYAGGILSGPLQAMASPGGVPQNGIDADRWHRAPSAQRGLVPSPQTPSQV 793
            LRNPRGQ + QY GGILSGP+Q+M S GG  +N  DADRW RA   Q+GL+PSPQT  Q 
Sbjct: 1204 LRNPRGQSTMQYVGGILSGPMQSMGSQGG-QRNSPDADRWQRATGFQKGLIPSPQTSVQ- 1261

Query: 794  MHKATRKYEVGKVS 835
            MH+A +KYEVGK +
Sbjct: 1262 MHRAEKKYEVGKAT 1275



 Score = 36.2 bits (82), Expect(2) = 6e-52
 Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
 Frame = +3

Query: 24   SKDISTTVDTKNMSVNDLNKEAVVNAE-DKVSKGEVDDWEDAADISTLKL 170
            S + ++  + K +S  D  +E VV ++  +  K E DDWEDAADIST KL
Sbjct: 1018 SSESTSAGNVKQVSA-DAGQEDVVGSDIGEQPKAEPDDWEDAADISTPKL 1066


>ref|XP_002326844.1| predicted protein [Populus trichocarpa] gi|222835159|gb|EEE73594.1|
            predicted protein [Populus trichocarpa]
          Length = 1166

 Score =  373 bits (957), Expect = e-100
 Identities = 191/281 (67%), Positives = 215/281 (76%), Gaps = 2/281 (0%)
 Frame = +3

Query: 1014 VISQIFDKALTEPTFCEMYADFCFHLSSELPDFSENNEKITFKRLLLNKCXXXXXXXXXX 1193
            VISQIFDKAL EPTFCEMYA+FCFHL++ELP+  E++EK+TFKRLLLNKC          
Sbjct: 553  VISQIFDKALMEPTFCEMYANFCFHLAAELPELIEDDEKVTFKRLLLNKCQEEFERGERE 612

Query: 1194 XXXXDKAXXXXXXXXXXXXXXXXXXXXXXXMLGNIRLIGELYKKKMLTERIMHECIKKLL 1373
                +KA                       MLGNIRLIGELYKK+MLTERIMHECIKKLL
Sbjct: 613  QEEANKADEEGEIKKSDEEREEQRIKARRRMLGNIRLIGELYKKRMLTERIMHECIKKLL 672

Query: 1374 GQYPNPDEEDIEALCKLMSTIGQMIDHPKAKEHMDAYFDMMLKLSTNPKLSSRLRFMLKD 1553
            GQY NPDEED+E+LCKLMSTIG+MIDHPKAK HMDAYFDMM KLS N KLSSR+RFMLKD
Sbjct: 673  GQYQNPDEEDVESLCKLMSTIGEMIDHPKAKVHMDAYFDMMAKLSNNMKLSSRVRFMLKD 732

Query: 1554 AIDLRKNRWQQRRKIEGPKKIEEVHRDVAHERQAQASRLARGPVISSTPRRGAAVDFGPR 1733
            AIDLRKN+WQQRRK+EGPKKIEEVHRD A ERQ Q SRLAR P ++S+PRRG  +DFGPR
Sbjct: 733  AIDLRKNKWQQRRKVEGPKKIEEVHRDAAQERQLQTSRLARNPGMNSSPRRG-PMDFGPR 791

Query: 1734 GSTILSSPPQQMGGSRVFPVQAR--GPQDVRMDDRHPFESK 1850
            GST+LSSP   MGG R FP Q R  G QDVR +DR  +E++
Sbjct: 792  GSTMLSSPNAHMGGFRGFPSQVRGHGNQDVRHEDRQSYEAR 832



 Score =  182 bits (462), Expect(2) = 4e-47
 Identities = 103/194 (53%), Positives = 125/194 (64%), Gaps = 5/194 (2%)
 Frame = +2

Query: 269 GNEATGRKKYSRDFLMTFSEQCTELPAGFEIGYGIANALMSGPVSASYAVDREPHPSPGR 448
           GN  T  KKYSRDFL+ FSEQ + LP GF I   IA AL    V+ S+  D + +PSP R
Sbjct: 322 GNANTA-KKYSRDFLLKFSEQFSNLPEGFVITSDIAEALS---VNVSHPADLDSYPSPAR 377

Query: 449 STDRSPKAPWVDRRIVVGDDDKWTKFPGSSGPG----LDLAHGISAVSFRPGQGVNHGVL 616
             DRS     + R   + DD +W+K PG  GPG    LD+ +G +A SFRP  G NHGVL
Sbjct: 378 VMDRSNSGSRIGRGSGMVDDGRWSKQPGPFGPGRDLHLDMGYGPNA-SFRPVAGGNHGVL 436

Query: 617 RNPRGQLSSQYAGGILSGPLQAMASPGGVPQNGIDADRWHRA-PSAQRGLVPSPQTPSQV 793
           RNPR Q   QYAGGILSGP+Q+    GG+ + G DAD+W R+  S  +GL+PSP TP Q 
Sbjct: 437 RNPRAQSPGQYAGGILSGPVQSTGLQGGMQRGGSDADKWQRSVSSVYKGLIPSPHTPLQT 496

Query: 794 MHKATRKYEVGKVS 835
           MHKA RKYEVGKV+
Sbjct: 497 MHKAERKYEVGKVA 510



 Score = 34.3 bits (77), Expect(2) = 4e-47
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
 Frame = +3

Query: 6   PSDLSESKDISTTVDTKNMSVNDLNKEA--VVNAEDKVSKGEVDDWEDAADISTLKL 170
           P +  E+   S   ++ +  +N    +A  V +   + +K E DDWEDAAD+ST KL
Sbjct: 248 PEEKKENVISSEVTESTSPILNQTPADALQVDSVASEKNKAEPDDWEDAADMSTPKL 304


>gb|ABK95855.1| unknown [Populus trichocarpa]
          Length = 670

 Score =  371 bits (952), Expect = e-100
 Identities = 190/281 (67%), Positives = 214/281 (76%), Gaps = 2/281 (0%)
 Frame = +3

Query: 1014 VISQIFDKALTEPTFCEMYADFCFHLSSELPDFSENNEKITFKRLLLNKCXXXXXXXXXX 1193
            VISQIFDKAL EPTFCEMYA+FCFHL++ELP+  E++EK+TFKRLLLNKC          
Sbjct: 57   VISQIFDKALMEPTFCEMYANFCFHLAAELPELIEDDEKVTFKRLLLNKCQEEFERGERE 116

Query: 1194 XXXXDKAXXXXXXXXXXXXXXXXXXXXXXXMLGNIRLIGELYKKKMLTERIMHECIKKLL 1373
                +KA                       MLGNIRLIGELYKK+MLTERIMHECIKKLL
Sbjct: 117  QEEANKADEEGEIKKSDEEREEQRIKARRRMLGNIRLIGELYKKRMLTERIMHECIKKLL 176

Query: 1374 GQYPNPDEEDIEALCKLMSTIGQMIDHPKAKEHMDAYFDMMLKLSTNPKLSSRLRFMLKD 1553
            GQY NPDEED+E+LCKLMSTIG+MIDHPKAK HMDAYFDMM KLS N KLSSR+RFMLKD
Sbjct: 177  GQYQNPDEEDVESLCKLMSTIGEMIDHPKAKVHMDAYFDMMAKLSNNMKLSSRVRFMLKD 236

Query: 1554 AIDLRKNRWQQRRKIEGPKKIEEVHRDVAHERQAQASRLARGPVISSTPRRGAAVDFGPR 1733
            AIDLRKN+WQQRRK+EGPKKIEEVHRD A ERQ Q SRLAR P ++S+PRRG  +DFGPR
Sbjct: 237  AIDLRKNKWQQRRKVEGPKKIEEVHRDAAQERQLQTSRLARNPGMNSSPRRG-PMDFGPR 295

Query: 1734 GSTILSSPPQQMGGSRVFPVQAR--GPQDVRMDDRHPFESK 1850
            GST+LSSP   MGG R FP Q R  G  DVR +DR  +E++
Sbjct: 296  GSTMLSSPNAHMGGFRGFPSQVRGHGNHDVRHEDRQSYEAR 336


>ref|XP_002302506.1| predicted protein [Populus trichocarpa] gi|222844232|gb|EEE81779.1|
            predicted protein [Populus trichocarpa]
          Length = 797

 Score =  370 bits (951), Expect = e-100
 Identities = 188/281 (66%), Positives = 214/281 (76%), Gaps = 2/281 (0%)
 Frame = +3

Query: 1014 VISQIFDKALTEPTFCEMYADFCFHLSSELPDFSENNEKITFKRLLLNKCXXXXXXXXXX 1193
            VISQIFDKAL EPTFCEMYA+FCFHL++ELP+ +E+NEK+TFKR+LLNKC          
Sbjct: 340  VISQIFDKALMEPTFCEMYANFCFHLAAELPELTEDNEKVTFKRILLNKCQEEFERGERE 399

Query: 1194 XXXXDKAXXXXXXXXXXXXXXXXXXXXXXXMLGNIRLIGELYKKKMLTERIMHECIKKLL 1373
                +KA                       MLGNIRLIGELYKK+MLTERIMHECIKKLL
Sbjct: 400  QEEANKADEEGEIKQSEEEREEKRIKARRRMLGNIRLIGELYKKRMLTERIMHECIKKLL 459

Query: 1374 GQYPNPDEEDIEALCKLMSTIGQMIDHPKAKEHMDAYFDMMLKLSTNPKLSSRLRFMLKD 1553
            GQY NPDEED+EALCKLMSTIG+MIDHPKAKEHMD YFDMM KLS N KLSSR+RFMLKD
Sbjct: 460  GQYQNPDEEDLEALCKLMSTIGEMIDHPKAKEHMDVYFDMMAKLSNNMKLSSRVRFMLKD 519

Query: 1554 AIDLRKNRWQQRRKIEGPKKIEEVHRDVAHERQAQASRLARGPVISSTPRRGAAVDFGPR 1733
            +IDLRKN+WQQRRK+EGPKKIEEVHRD A ERQ Q SRLAR P I+ +PRRG  +DFGPR
Sbjct: 520  SIDLRKNKWQQRRKVEGPKKIEEVHRDAAQERQLQTSRLARNPGINPSPRRG-PMDFGPR 578

Query: 1734 GSTILSSPPQQMGGSRVFPVQAR--GPQDVRMDDRHPFESK 1850
            GST+L S   QMGG R FP Q R  G QDVR +++  +E++
Sbjct: 579  GSTMLPSLNAQMGGFRGFPTQVRGHGTQDVRFEEKQSYEAR 619



 Score =  189 bits (480), Expect(2) = 7e-51
 Identities = 103/187 (55%), Positives = 122/187 (65%), Gaps = 5/187 (2%)
 Frame = +2

Query: 290 KKYSRDFLMTFSEQCTELPAGFEIGYGIANALMSGPVSASYAVDREPHPSPGRSTDRSPK 469
           KKYSRDFL+ FSEQCT+LP GF+I   IA +LM   V  S+  DR+P PSP R  DRS  
Sbjct: 113 KKYSRDFLLKFSEQCTDLPGGFQIPSDIAGSLMG--VGVSHLADRDPCPSPARVMDRSNS 170

Query: 470 APWVDRR-IVVGDDDKWTKFPGSSGPGLDLAHGISA---VSFRPGQGVNHGVLRNPRGQL 637
              +DRR   + DD +W+K PG SGPG DL   IS    V FRP  G N+G LRNPR Q 
Sbjct: 171 GSRIDRRGSGIVDDGRWSKQPGPSGPGRDLHLDISYGANVGFRPVAGGNYGALRNPRAQS 230

Query: 638 SSQYAGGILSGPLQAMASPGGVPQNGIDADRWHRAP-SAQRGLVPSPQTPSQVMHKATRK 814
              Y GGILSGP+Q+M   GG+ + G+DADRW RA     +G   SPQTP Q MHKA +K
Sbjct: 231 PVHYGGGILSGPMQSMGPQGGLQRGGLDADRWQRAAIFVHKGSFSSPQTPLQTMHKAEKK 290

Query: 815 YEVGKVS 835
           YEVGKV+
Sbjct: 291 YEVGKVT 297



 Score = 40.0 bits (92), Expect(2) = 7e-51
 Identities = 19/45 (42%), Positives = 27/45 (60%)
 Frame = +3

Query: 36  STTVDTKNMSVNDLNKEAVVNAEDKVSKGEVDDWEDAADISTLKL 170
           ST+ + K    + L  + V + +   +K E DDWEDA D+STLKL
Sbjct: 43  STSPNLKQAPADALQVQTVASEKSMQNKAEPDDWEDATDMSTLKL 87


>ref|XP_002460845.1| hypothetical protein SORBIDRAFT_02g036110 [Sorghum bicolor]
            gi|241924222|gb|EER97366.1| hypothetical protein
            SORBIDRAFT_02g036110 [Sorghum bicolor]
          Length = 825

 Score =  361 bits (926), Expect = 4e-97
 Identities = 181/280 (64%), Positives = 208/280 (74%), Gaps = 1/280 (0%)
 Frame = +3

Query: 1014 VISQIFDKALTEPTFCEMYADFCFHLSSELPDFSENNEKITFKRLLLNKCXXXXXXXXXX 1193
            VISQIFDKAL EPTFCEMYA+FC HL+  LPDFSE+NEKITFKRLLLNKC          
Sbjct: 249  VISQIFDKALMEPTFCEMYANFCSHLAGALPDFSEDNEKITFKRLLLNKCQEEFERGERE 308

Query: 1194 XXXXDKAXXXXXXXXXXXXXXXXXXXXXXXMLGNIRLIGELYKKKMLTERIMHECIKKLL 1373
                DK                        MLGNIRLIGELYKKKMLTERIMHECIKKL 
Sbjct: 309  EAEADKTEEEGEIKQTKEEREEKRIRARRRMLGNIRLIGELYKKKMLTERIMHECIKKLF 368

Query: 1374 GQYPNPDEEDIEALCKLMSTIGQMIDHPKAKEHMDAYFDMMLKLSTNPKLSSRLRFMLKD 1553
            G + NPDEE+IEALCKLMSTIG+MIDH KAKEHMDAYF  M  +ST+ KLSSR+RFML+D
Sbjct: 369  GNHDNPDEENIEALCKLMSTIGEMIDHVKAKEHMDAYFARMENMSTSQKLSSRVRFMLRD 428

Query: 1554 AIDLRKNRWQQRRKIEGPKKIEEVHRDVAHERQAQASRLARGPVISSTPRRGA-AVDFGP 1730
            +IDLR+N+WQQRRK+EGPKKIEEVHRD A ER AQ+SRL RGP +SS PRRGA  +D+GP
Sbjct: 429  SIDLRRNKWQQRRKVEGPKKIEEVHRDAAQERHAQSSRLGRGPAVSSVPRRGAPPMDYGP 488

Query: 1731 RGSTILSSPPQQMGGSRVFPVQARGPQDVRMDDRHPFESK 1850
            RGS+ L+SP    G  R  P Q+RG QD+R ++RH F+++
Sbjct: 489  RGSSALASPSSHQGSIRGMPPQSRGSQDIRYEERHQFDNR 528



 Score =  182 bits (463), Expect = 2e-43
 Identities = 100/191 (52%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
 Frame = +2

Query: 275 EATGRKKYSRDFLMTFSEQCTELPAGFEIGYGIANALMSGPVSASYAVDREPHPSPGRST 454
           EA GRKKYSRDFL+T  +  T LP GF++   + NA+M+     SY VDREPHPSPGR  
Sbjct: 26  EANGRKKYSRDFLLTLQQHWTGLPVGFKMNEAV-NAIMNNLAGKSYVVDREPHPSPGRGL 84

Query: 455 DRSPKAPWVDRRIVVGDDDKWTKFPGSSGPG----LDLAHGISAVSFRPGQGVNHGVLRN 622
           DR       DRR     DD+WTK      PG    +DLA+G S +++R   G NHGVLRN
Sbjct: 85  DRPTSRG--DRRGAAMADDRWTKAGIPLSPGRDVHMDLANGPSIINYRGSSGGNHGVLRN 142

Query: 623 PRGQLSSQYAGGILSGPLQAMASPGGVPQNGIDADRWHRAPSAQRGLVPSPQTPSQVMHK 802
           PRGQ S+QY GG+L+GP+ ++     V ++G DADRW      Q+GL+PSP TP Q MHK
Sbjct: 143 PRGQPSNQYGGGLLAGPMHSVGPQ--VSRSGSDADRWQ-----QKGLMPSPVTPMQAMHK 195

Query: 803 ATRKYEVGKVS 835
           A RKY VGKVS
Sbjct: 196 AERKYVVGKVS 206


Top