BLASTX nr result

ID: Dioscorea21_contig00017354 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00017354
         (1550 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273827.1| PREDICTED: uncharacterized protein LOC100252...   543   e-152
ref|XP_002531235.1| conserved hypothetical protein [Ricinus comm...   541   e-151
ref|XP_003534761.1| PREDICTED: uncharacterized protein LOC100785...   523   e-146
ref|XP_003528554.1| PREDICTED: uncharacterized protein LOC100792...   520   e-145
ref|XP_004145479.1| PREDICTED: uncharacterized protein LOC101207...   520   e-145

>ref|XP_002273827.1| PREDICTED: uncharacterized protein LOC100252719 [Vitis vinifera]
          Length = 427

 Score =  543 bits (1398), Expect = e-152
 Identities = 267/414 (64%), Positives = 328/414 (79%), Gaps = 6/414 (1%)
 Frame = -2

Query: 1432 KTVILHPFPGFWWIQQKGV-YWTAASATPAFQSSVRKDKFLEVPQIAWGLNNQKIAFARA 1256
            +  +L  F GF  + Q         S +   +  +RK+KFLEVPQI WGLNNQKIAFARA
Sbjct: 16   RAAMLPTFSGFGEVAQNSFRVINKGSLSSDSELQMRKNKFLEVPQIVWGLNNQKIAFARA 75

Query: 1255 CLTARLLNRTLLMPSLSASLFYKEVELLEAVPFDKIFQFERFDTMCNGFIQLGRYSDLLN 1076
            CLTAR++ RTLLMPSLSASLFYKE++LL+ + FDK+FQFERF+++CNGF++LGRYSDL N
Sbjct: 76   CLTARMMKRTLLMPSLSASLFYKEIDLLQPISFDKVFQFERFNSLCNGFVRLGRYSDLSN 135

Query: 1075 QTKPFELQKGSGRKWTKEKDLHQLEQCKEDSVDKFELIRIVGKNPFLWHDHWPVTDYAKI 896
            +T+ FELQKGSGRKWT E+DL QL +  ++  D +E+IRI+GKNPFLWHDHWPV DYAK+
Sbjct: 136  RTQVFELQKGSGRKWTIERDLDQLREFSKEPYDGYEVIRILGKNPFLWHDHWPVKDYAKV 195

Query: 895  FECLSFVXXXXXEAVKVISRIREVGTKARS-----ENFAHGHSSGSLIQSVPYIAVHMRV 731
            F+CL  V     EA KV+S+IRE+G K  S     +N ++  S  SL   +PYIAVHMR+
Sbjct: 196  FDCLVLVEEISKEADKVVSKIREMGRKVGSKAVFSQNASNSESPSSL--PMPYIAVHMRI 253

Query: 730  EKDWMIHCKKLEQRLNINQICSNKTEITQRVAKISVLQKPIVVYLAVADSLLEDNSILSG 551
            EKDWMIHCKKLEQR NI+QICS+K EI  RV  I+ L+ P++VYLAVADSLLEDNSIL+G
Sbjct: 254  EKDWMIHCKKLEQRFNISQICSSKEEIIGRVGNIAGLKTPMIVYLAVADSLLEDNSILNG 313

Query: 550  WQQGLLPFEKKKLGVWDIYKKYPYLVQSAIDYEVCLRADVFIGNSYSTFSSLIVLERTLK 371
            W++GLLPFEKKKLGV  IY KYPYL QSAIDYEVCLRA+VF+GNS+STFSSLI LERTLK
Sbjct: 314  WKEGLLPFEKKKLGVMGIYNKYPYLFQSAIDYEVCLRANVFVGNSFSTFSSLIALERTLK 373

Query: 370  LIKIGVTRSCGREARLPSYAYNVVGKDGGPQSWMTDFSASSLQSISYGSNNVSC 209
            +IK+G+T SCG++   PSYAYN++G+  GPQ WMTD S SSL +ISYG+ +VSC
Sbjct: 374  MIKMGITTSCGKDVTWPSYAYNILGELNGPQRWMTDMSDSSLHAISYGTKDVSC 427


>ref|XP_002531235.1| conserved hypothetical protein [Ricinus communis]
            gi|223529172|gb|EEF31149.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 446

 Score =  541 bits (1394), Expect = e-151
 Identities = 272/431 (63%), Positives = 332/431 (77%), Gaps = 4/431 (0%)
 Frame = -2

Query: 1489 NMNSMVWKXXXXXXXXXXLKTVILHPFPGFWWIQQKGV-YWTAASATPAFQSSVRKDKFL 1313
            N+NS+  K          L+ V+L  F  +  I QK +    + S +      VRKDKFL
Sbjct: 16   NLNSVTCKCLSLVVILLVLRVVLLSSFSDYGRINQKDIDLIPSRSLSLDSDYGVRKDKFL 75

Query: 1312 EVPQIAWGLNNQKIAFARACLTARLLNRTLLMPSLSASLFYKEVELLEAVPFDKIFQFER 1133
            EVPQI WGLNNQKIAFARACLTAR+LNRTLLMP LSASLFYKE++ L+ + FDKIFQFER
Sbjct: 76   EVPQIVWGLNNQKIAFARACLTARMLNRTLLMPKLSASLFYKEIDRLQPISFDKIFQFER 135

Query: 1132 FDTMCNGFIQLGRYSDLLNQTKPFELQKGSGRKWTKEKDLHQLEQCKEDSVDKFELIRIV 953
            F+++CNGF+QLG+YSD+ N +  +ELQKGSGR+WT E+DL QL Q  +D  +  E+IRIV
Sbjct: 136  FNSLCNGFVQLGQYSDVRNHSGVYELQKGSGRRWTVERDLDQLRQFIQDPYNGHEVIRIV 195

Query: 952  GKNPFLWHDHWPVTDYAKIFECLSFVXXXXXEAVKVISRIREVGTKARS--ENFAHGHSS 779
            GKNPFLWHDHWPV DYA++FECL  V     EA KVIS+IREVG + RS  EN ++   S
Sbjct: 196  GKNPFLWHDHWPVKDYARVFECLVLVEEIEKEAAKVISKIREVGREVRSNIENPSNSSDS 255

Query: 778  -GSLIQSVPYIAVHMRVEKDWMIHCKKLEQRLNINQICSNKTEITQRVAKISVLQKPIVV 602
             GS +Q+VPY+AVHMR+E DWMIHCKKLE+R NI+QICS+K EI +RV  I  ++ P VV
Sbjct: 256  DGSSLQAVPYVAVHMRIEIDWMIHCKKLERRSNISQICSSKEEIMERVGNIVGMKTPSVV 315

Query: 601  YLAVADSLLEDNSILSGWQQGLLPFEKKKLGVWDIYKKYPYLVQSAIDYEVCLRADVFIG 422
            YLAVADSLLED SIL+GW+ GLLPFEKKKLGV  IYKKYPYL+QSAIDYEVCLRAD+F G
Sbjct: 316  YLAVADSLLEDPSILTGWKHGLLPFEKKKLGVDGIYKKYPYLIQSAIDYEVCLRADIFFG 375

Query: 421  NSYSTFSSLIVLERTLKLIKIGVTRSCGREARLPSYAYNVVGKDGGPQSWMTDFSASSLQ 242
            NS+STFSSLI LERT K+I++GVT +CG + R PSYAYN++G+  GPQ WMT+ S + LQ
Sbjct: 376  NSFSTFSSLIALERTQKMIRMGVTNTCGMDVRWPSYAYNILGESNGPQRWMTNMSDARLQ 435

Query: 241  SISYGSNNVSC 209
            +ISYG+N + C
Sbjct: 436  AISYGTNTIYC 446


>ref|XP_003534761.1| PREDICTED: uncharacterized protein LOC100785035 [Glycine max]
          Length = 438

 Score =  523 bits (1348), Expect = e-146
 Identities = 252/414 (60%), Positives = 318/414 (76%), Gaps = 6/414 (1%)
 Frame = -2

Query: 1432 KTVILHPFPGFWWIQQKGVYWTAASATPAFQSSVRKDKFLEVPQIAWGLNNQKIAFARAC 1253
            + ++   FPGF  I+   + +  A     F   +R+DKFL VPQ+ WGLNNQKIAFARAC
Sbjct: 26   RALLFASFPGFGGIEWGNLVYLRAPLLN-FDFGIRQDKFLVVPQLVWGLNNQKIAFARAC 84

Query: 1252 LTARLLNRTLLMPSLSASLFYKEVELLEAVPFDKIFQFERFDTMCNGFIQLGRYSDLLNQ 1073
            LTARLLNRTLLMPSLSASLFYKE++LL+ + FDK+FQFE+F+ +C GF++LGRYSD+LN+
Sbjct: 85   LTARLLNRTLLMPSLSASLFYKEIDLLQPISFDKVFQFEKFNALCRGFVRLGRYSDVLNR 144

Query: 1072 TKPFELQKGSGRKWTKEKDLHQLEQCKEDSVDKFELIRIVGKNPFLWHDHWPVTDYAKIF 893
            T+  E++KGSGR+WT E+DL QL++  E S D +E+IRI+GKNPFLWHDHWPV DYA+IF
Sbjct: 145  TEVLEMEKGSGRRWTVERDLSQLKEHSEGSFDDYEIIRIIGKNPFLWHDHWPVKDYARIF 204

Query: 892  ECLSFVXXXXXEAVKVISRIREVG------TKARSENFAHGHSSGSLIQSVPYIAVHMRV 731
            ECL        EA +V+SRIR VG      T+A     +   S GS  Q +P++AVHMR+
Sbjct: 205  ECLDLTEEIAKEADRVVSRIRAVGREVTGNTEAVEVKSSIITSDGSSFQPLPFVAVHMRI 264

Query: 730  EKDWMIHCKKLEQRLNINQICSNKTEITQRVAKISVLQKPIVVYLAVADSLLEDNSILSG 551
            E DWMIHCK LE+RLN N+ICS K  I +RV  I+ L+ P+VVYLAVAD LL ++SIL G
Sbjct: 265  EIDWMIHCKNLERRLNTNRICSGKKAIVERVRNIAGLKTPVVVYLAVADKLLNNSSILDG 324

Query: 550  WQQGLLPFEKKKLGVWDIYKKYPYLVQSAIDYEVCLRADVFIGNSYSTFSSLIVLERTLK 371
            W++G LPFEKKKLGV  IYKKYPYL+QSAIDYEVCL+AD+F+GNS+STFSSLIVLERT K
Sbjct: 325  WEEGFLPFEKKKLGVVGIYKKYPYLIQSAIDYEVCLKADIFVGNSFSTFSSLIVLERTQK 384

Query: 370  LIKIGVTRSCGREARLPSYAYNVVGKDGGPQSWMTDFSASSLQSISYGSNNVSC 209
            +I++ VT  CG   R PSYAYN+ G+  GP  W+T+ S SSLQ+ISYG+N++SC
Sbjct: 385  MIRMSVTNMCGENVRWPSYAYNIPGESNGPMRWVTNMSESSLQAISYGTNHISC 438


>ref|XP_003528554.1| PREDICTED: uncharacterized protein LOC100792558 [Glycine max]
          Length = 430

 Score =  520 bits (1340), Expect = e-145
 Identities = 253/410 (61%), Positives = 320/410 (78%), Gaps = 2/410 (0%)
 Frame = -2

Query: 1432 KTVILHPFPGFWWIQQKGVYWTAASATPAFQSSVRKDKFLEVPQIAWGLNNQKIAFARAC 1253
            + +++ P PGF  ++     +   + +P+F   VR+DKFLEVPQI WGLNNQKIAFARAC
Sbjct: 24   RALLVPPSPGFGGVEWSNFVYIR-NHSPSFGVGVRQDKFLEVPQIVWGLNNQKIAFARAC 82

Query: 1252 LTARLLNRTLLMPSLSASLFYKEVELLEAVPFDKIFQFERFDTMCNGFIQLGRYSDLLNQ 1073
             TAR +NR LLMPSLSASLFYKE++LL+ + FD++FQ+++F+ +C+GF+QLGRYSDL NQ
Sbjct: 83   HTARTMNRILLMPSLSASLFYKEIDLLQPISFDRVFQYDKFNALCSGFVQLGRYSDLSNQ 142

Query: 1072 TKPFELQKGSGRKWTKEKDLHQLEQCKEDSVDKFELIRIVGKNPFLWHDHWPVTDYAKIF 893
            T+  E+QKGSGRKWT E+DL QL    +   +  E+IRIVGKNPFLWHDHWPV DYAK+F
Sbjct: 143  TRVLEMQKGSGRKWTVERDLDQLRDYSKGEFEDHEVIRIVGKNPFLWHDHWPVKDYAKVF 202

Query: 892  ECLSFVXXXXXEAVKVISRIREVG--TKARSENFAHGHSSGSLIQSVPYIAVHMRVEKDW 719
            ECL  +     EA +V+SRIR VG  T++ SE+    + S S  Q +PY+AVHMRVE DW
Sbjct: 203  ECLVLIDEIGREADRVVSRIRAVGRETQSNSESVELENDSSS-FQPLPYVAVHMRVEIDW 261

Query: 718  MIHCKKLEQRLNINQICSNKTEITQRVAKISVLQKPIVVYLAVADSLLEDNSILSGWQQG 539
            MIHCKKLEQRLN NQICS+K EI +RVA I  L+ P VVYLAVAD LL+++S+L GW++G
Sbjct: 262  MIHCKKLEQRLNTNQICSSKKEIMERVANIKGLKTPSVVYLAVADKLLQNSSVLEGWEEG 321

Query: 538  LLPFEKKKLGVWDIYKKYPYLVQSAIDYEVCLRADVFIGNSYSTFSSLIVLERTLKLIKI 359
             LP+EKKKLGV  IYKKYPYL+QSAIDYEVCLRAD+F+GNS+STFSSLIVLERT K+I +
Sbjct: 322  FLPYEKKKLGVDGIYKKYPYLIQSAIDYEVCLRADIFVGNSFSTFSSLIVLERTQKIITM 381

Query: 358  GVTRSCGREARLPSYAYNVVGKDGGPQSWMTDFSASSLQSISYGSNNVSC 209
            GV   CG++   PSYAYN+ G+  GP  W+T+ S S+LQ ISYG+N++SC
Sbjct: 382  GV-NMCGKDVTWPSYAYNIQGESNGPMRWVTNMSHSTLQEISYGTNHISC 430


>ref|XP_004145479.1| PREDICTED: uncharacterized protein LOC101207020 [Cucumis sativus]
            gi|449527777|ref|XP_004170886.1| PREDICTED:
            uncharacterized LOC101207020 [Cucumis sativus]
          Length = 446

 Score =  520 bits (1339), Expect = e-145
 Identities = 266/411 (64%), Positives = 315/411 (76%), Gaps = 5/411 (1%)
 Frame = -2

Query: 1426 VILHP-FPGFWWIQQKGVYWTAASATPAFQSS--VRKDKFLEVPQIAWGLNNQKIAFARA 1256
            VIL P    F  I++ G+     + +P + S   +R DKFLEVPQI WGLNNQKIAFARA
Sbjct: 36   VILFPTISSFGRIEENGLV-VVRNLSPLYGSDFGIRVDKFLEVPQIVWGLNNQKIAFARA 94

Query: 1255 CLTARLLNRTLLMPSLSASLFYKEVELLEAVPFDKIFQFERFDTMCNGFIQLGRYSDLLN 1076
            CLTAR+LNRTLLMPSLSASLFYKEVE LE + FDKIFQFE F++ CNGF++LGRY D+ N
Sbjct: 95   CLTARMLNRTLLMPSLSASLFYKEVERLEPIFFDKIFQFEEFNSRCNGFVRLGRYMDISN 154

Query: 1075 QTKPFELQKGSGRKWTKEKDLHQLEQCKEDSVDKFELIRIVGKNPFLWHDHWPVTDYAKI 896
            QTKP EL KGSGRKWT E+DL QLE+  ++  D+ E+I IVGKNPFLWHDHWPV DYAKI
Sbjct: 155  QTKPIELLKGSGRKWTIERDLEQLEEYSKEPFDQSEVITIVGKNPFLWHDHWPVKDYAKI 214

Query: 895  FECLSFVXXXXXEAVKVISRIREVGTKARS--ENFAHGHSSGSLIQSVPYIAVHMRVEKD 722
            FECL  V     E  KVISRIREVG+K RS  ++ A    S + +Q +PY+AVHMR+E D
Sbjct: 215  FECLVLVDEIEKEVDKVISRIREVGSKVRSKFDSDATVVKSENSLQPMPYVAVHMRIEID 274

Query: 721  WMIHCKKLEQRLNINQICSNKTEITQRVAKISVLQKPIVVYLAVADSLLEDNSILSGWQQ 542
            WMIHCKKLEQR  INQICS+K EI  RV  I  ++ P VVYLAVADSLL D+SIL GW++
Sbjct: 275  WMIHCKKLEQRSRINQICSSKEEIMNRVGNILEMKVPTVVYLAVADSLLNDSSILKGWKE 334

Query: 541  GLLPFEKKKLGVWDIYKKYPYLVQSAIDYEVCLRADVFIGNSYSTFSSLIVLERTLKLIK 362
            GLLPFEKKKLG+  IYKKYPYL+QSAIDYEVCLRADVF+GNS+STFSSL+VL RT KL+K
Sbjct: 335  GLLPFEKKKLGIDKIYKKYPYLIQSAIDYEVCLRADVFVGNSFSTFSSLVVLGRTQKLMK 394

Query: 361  IGVTRSCGREARLPSYAYNVVGKDGGPQSWMTDFSASSLQSISYGSNNVSC 209
              V   C      PSYAYN++G   GP+ W+++ S  SL++ISYGSN+ SC
Sbjct: 395  TDVVDLCDTNLSWPSYAYNILGDSNGPRKWISNMSDISLKNISYGSNDTSC 445


Top