BLASTX nr result

ID: Dioscorea21_contig00021020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00021020
         (1398 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN67425.1| hypothetical protein VITISV_006652 [Vitis vinifera]   303   7e-80
emb|CBI19029.3| unnamed protein product [Vitis vinifera]              286   1e-74
ref|XP_003561795.1| PREDICTED: uncharacterized protein LOC100837...   256   1e-65
ref|XP_002513798.1| hypothetical protein RCOM_1032100 [Ricinus c...   249   1e-63
ref|XP_003589591.1| Replication factor C large subunit [Medicago...   237   5e-60

>emb|CAN67425.1| hypothetical protein VITISV_006652 [Vitis vinifera]
          Length = 1170

 Score =  303 bits (776), Expect = 7e-80
 Identities = 181/433 (41%), Positives = 248/433 (57%), Gaps = 12/433 (2%)
 Frame = -2

Query: 1295 ASSSLPEHEQVGQLVADNVGSNRNSTIYLPFRCSNSIVDMETEKDHMLIKERLDSYNLRR 1116
            A+  +P H Q+ + +  N   N          C  +I  M   +   L++E +  Y L  
Sbjct: 198  ANEQVPYH-QLSKNMEGNQEGNHIGFFTGDSGCGRNIDAMPPSR---LLQESMMPYYLGC 253

Query: 1115 GNFPESSLWTDKYQPENTLELCGNSKSIRILSEWLKSWDE-----GPRSNRSFNSSVKDG 951
            GN PE SLW +KYQPE  +E+CGN +S+++LSEWL  W E       ++       ++D 
Sbjct: 254  GNQPEDSLWINKYQPEKAIEVCGNGESVKLLSEWLHLWHEKDSQSSKKATGGDKCIMQDS 313

Query: 950  HHSSYDIESYMDDADDEAIRKNVLLLTGPIGSGKSAAIHACTKEQGFEVIEVNTSDVRSG 771
             +S Y  +S   D D+    KNVLL+TGP+GSGKSAAI+AC KEQGF +IE+NTS +RSG
Sbjct: 314  DNSFYGSDSD-SDLDEGTGLKNVLLVTGPVGSGKSAAIYACAKEQGFRIIEINTSGLRSG 372

Query: 770  ALMKQKFGDSIEASAITPSLCLKDNPIGERKKII----PALLSGXXXXXXXXDSFSQRTS 603
             ++KQ+ G+++E+  +  SL   +NPIG + K I    PAL +G               S
Sbjct: 373  TVVKQRIGEALESHGLKRSL---ENPIGSQSKHIMKSFPALPNGTATQEFESKVIELIPS 429

Query: 602  ENCQQGKIATGGTQKLVEKKNINS--RKVKGSIQXXXXXXXXXXXDMGFISAVLQLAEKT 429
             + +    A G  +K + KKN  +  R    ++            D G I+A+ QLAE  
Sbjct: 430  SDEEDSHDAIGTPEKHIHKKNRTACDRGETITLILFEDVDITFPEDRGLIAAIQQLAETA 489

Query: 428  KKPIILTSNDKNPILP-HLDPLIVDFTVPLSEDLLSHAQMVCAVEGIHISSELLERLISY 252
            K+PIILTSN  NP+LP +LD L V FT+P  ++LL HA MVCA E  +I   L+ER I Y
Sbjct: 490  KRPIILTSNSNNPVLPDNLDRLEVCFTLPSPKELLCHAYMVCAAEKTNIQPWLIERFIEY 549

Query: 251  CRGDIRKMLMFLQFWCQGTRCQIDESKRCTYNPLPFDVSAAHLIMPRVIPWDFPCDLSEK 72
            C+GDIRK LM LQFWCQG R + D      Y PL FD+ A H I+P++IPWDFP  LSE 
Sbjct: 550  CQGDIRKTLMHLQFWCQGKRYRQDRKAHKIYGPLSFDLEAGHQILPKIIPWDFPSQLSEL 609

Query: 71   IEEEISGSFSLFE 33
            +E+EI+ S S  E
Sbjct: 610  VEKEIAKSLSKME 622


>emb|CBI19029.3| unnamed protein product [Vitis vinifera]
          Length = 919

 Score =  286 bits (731), Expect = 1e-74
 Identities = 168/389 (43%), Positives = 226/389 (58%), Gaps = 23/389 (5%)
 Frame = -2

Query: 1130 YNLRRGNFPESSLWTDKYQPENTLELCGNSKSIRILSEWLKSWDE-----GPRSNRSFNS 966
            Y L  GN PE SLW +KYQPE  +E+CGN +S+++LSEWL  W E       ++      
Sbjct: 23   YYLGCGNQPEDSLWINKYQPEKAIEVCGNGESVKLLSEWLHLWHEKDSQSSKKATGGDKC 82

Query: 965  SVKDGHHSSYDIESYMDDADDEAIRKNVLLLTGPIG-----------SGKSAAIHACTKE 819
             ++D  +S Y  +S   D D+    KNVLL+TGP+G           SGKSAAI+AC KE
Sbjct: 83   IMQDSDNSFYGSDSD-SDLDEGTGLKNVLLVTGPVGVYTHSISTAIFSGKSAAIYACAKE 141

Query: 818  QGFEVIEVNTSDVRSGALMKQKFGDSIEASAITPSLCLKDNPIGERKKII----PALLSG 651
            QGF +IE+NTS +RSG ++KQ+ G+++E+  +  SL   +NPIG + K I    PAL +G
Sbjct: 142  QGFRIIEINTSGLRSGTVVKQRIGEALESHGLKRSL---ENPIGSQSKHIMKSFPALPNG 198

Query: 650  XXXXXXXXDSFSQRTSENCQQGKIATGGTQKLVEKKNINS--RKVKGSIQXXXXXXXXXX 477
                           S + +      G  +K + KKN  +  R    ++           
Sbjct: 199  TATQEFESKVIELIPSSDEEDSHDDIGTPEKHIHKKNRTACDRGETITLILFEDVDITFP 258

Query: 476  XDMGFISAVLQLAEKTKKPIILTSNDKNPILP-HLDPLIVDFTVPLSEDLLSHAQMVCAV 300
             D G I+A+ QLAE  K+PIILTSN  NP+LP +LD L V FT+P  ++LL HA MVCA 
Sbjct: 259  EDRGLIAAIQQLAETAKRPIILTSNSNNPVLPDNLDRLEVCFTLPSLKELLCHAYMVCAA 318

Query: 299  EGIHISSELLERLISYCRGDIRKMLMFLQFWCQGTRCQIDESKRCTYNPLPFDVSAAHLI 120
            E  +I   L+ER I YC+GDIRK LM LQFWCQG R +  +     Y PL FD+ A H I
Sbjct: 319  EKTNIQPWLIERFIEYCQGDIRKTLMHLQFWCQGKRYRQGQKAHKIYGPLSFDLDAGHQI 378

Query: 119  MPRVIPWDFPCDLSEKIEEEISGSFSLFE 33
            +P++IPWDFP  LSE +E+EI+ S S  E
Sbjct: 379  LPKIIPWDFPSQLSELVEKEIAKSLSKME 407


>ref|XP_003561795.1| PREDICTED: uncharacterized protein LOC100837229 [Brachypodium
            distachyon]
          Length = 1272

 Score =  256 bits (653), Expect = 1e-65
 Identities = 165/470 (35%), Positives = 247/470 (52%), Gaps = 7/470 (1%)
 Frame = -2

Query: 1394 ESSIKPLELDTACHSQRHPNQMLAKPNFITSAVASSSLPEHEQVGQLVADNVGSNRNSTI 1215
            E  +KPL +++ C  + HP + L + N   +  +   LP    V          ++ S++
Sbjct: 302  EGFVKPLTIESNC-KRIHPYK-LVEQNVADNTASRMDLPSFSNVQS-------ESKLSSL 352

Query: 1214 YLPFRCSNSIVDMETEKDHMLIKERL---DSYNLRRGNFPESSLWTDKYQPENTLELCGN 1044
             + F    S++  +         ER+    S  L++   P   LWTDKY+PE  +++CGN
Sbjct: 353  NIHFD-DESLLAYDASHHFGKHPERILQGCSEVLQKCCQPAYDLWTDKYRPETAVQVCGN 411

Query: 1043 SKSIRILSEWLKSWDEGPRSNRS---FNSSVKDGHHSSYDIESYMDDADDEAIRKNVLLL 873
             + ++ LSEWLK WDE    N+     N S+ DG+      ES  D +++ +  +NVLL+
Sbjct: 412  MEHVKFLSEWLKGWDEKGHKNKQNGVTNGSINDGYCQD---ESDTDCSEEASDFENVLLI 468

Query: 872  TGPIGSGKSAAIHACTKEQGFEVIEVNTSDVRSGALMKQKFGDSIEASAITPSLCLKDNP 693
            TGP+G GKSAA+ AC +EQGF VIEVNTSD R+GA +KQKF ++ ++  +      +   
Sbjct: 469  TGPVGCGKSAAVFACAREQGFNVIEVNTSDTRNGAYVKQKFEEATKSHGLEKWSQEEVTT 528

Query: 692  IGERKKIIPALLSGXXXXXXXXDSFSQRTSENCQQGKIATGGTQKLVEKKNINSRKVKGS 513
                  + P   SG         S S   S  C     ++    K V  K +   +   +
Sbjct: 529  PPRNDSLDPT--SGIPDRTEYNQSIS--CSVKCYSSSKSSDEAPKQVMNKTLILFEDVDT 584

Query: 512  IQXXXXXXXXXXXDMGFISAVLQLAEKTKKPIILTSNDKNPILPHL-DPLIVDFTVPLSE 336
            +            D GFIS +L++AE TK PIILTSN K+P LPHL D L++DF  P S 
Sbjct: 585  V---------FDEDRGFISTILKIAETTKWPIILTSNKKDPSLPHLLDQLVLDFKYPSSG 635

Query: 335  DLLSHAQMVCAVEGIHISSELLERLISYCRGDIRKMLMFLQFWCQGTRCQIDESKRCTYN 156
            +LLSH  M+C  EG+++++  L+ +I+ C GDIR+  + LQFW QG     + S +C   
Sbjct: 636  ELLSHVGMICKSEGVNVTASQLKYIINACLGDIRRTTLLLQFWYQGKHQYTERSNKCLSG 695

Query: 155  PLPFDVSAAHLIMPRVIPWDFPCDLSEKIEEEISGSFSLFENLCLEDAKE 6
            P   D+ A H  +PR++PWDFPC LSE +  EI       +N+ L D K+
Sbjct: 696  PFSLDLDAIHSTVPRMLPWDFPCKLSETVCMEIE------KNILLADEKK 739


>ref|XP_002513798.1| hypothetical protein RCOM_1032100 [Ricinus communis]
            gi|223546884|gb|EEF48381.1| hypothetical protein
            RCOM_1032100 [Ricinus communis]
          Length = 1247

 Score =  249 bits (636), Expect = 1e-63
 Identities = 164/474 (34%), Positives = 253/474 (53%), Gaps = 9/474 (1%)
 Frame = -2

Query: 1397 FESSIKPLELDTACHSQRHPNQMLAKPNFITSAVASSSLPEHEQVGQLVADNVGSNRNST 1218
            FE + K L  D           +L   N  ++  + +++    QV QLV        +  
Sbjct: 249  FECTGKSLCFDEFPSVSNPSGSLLCPDNLSSNKASPTAVSLDVQVDQLVKHAEDYQVDEV 308

Query: 1217 IYLPFRCSNSIVDMETEKDHMLIKERLDSYNLRRGNFPESSLWTDKYQPENTLELCGNSK 1038
            + +     NS      E+     +ER  S++L   N  +S LWTDKYQP+ + ELCGN  
Sbjct: 309  VLISGCERNS-----DEQQSQYPQERAASFHLGCANQLDSRLWTDKYQPKKSTELCGNDD 363

Query: 1037 SIRILSEWLKSW-DEGPRSNRSF----NSSVKDGHHSSYDIESYMDDADDEAIRKNVLLL 873
            S++ILSEWL++W   G ++NR         ++D  ++ +  +S  ++  +E   KNVLL+
Sbjct: 364  SVKILSEWLRTWCRRGRQANRDEPGGDECDIQDPDYNCFRDDSDSENISEEGSFKNVLLI 423

Query: 872  TGPIGSGKSAAIHACTKEQGFEVIEVNTSDVRSGALMKQKFGDSIEASAITPSLCLKDNP 693
            TGP+GSGKSAAI+AC KEQGF V+E + S+ R+GALMK++FG ++E+ +   S  L+   
Sbjct: 424  TGPVGSGKSAAIYACAKEQGFRVLEASASECRNGALMKERFG-ALESQSTLDSQLLQ--- 479

Query: 692  IGERKKIIPALLSGXXXXXXXXDSFSQRTSENCQQGKIATGGTQKLVEKKNINSRKVKGS 513
                   I    S          + +      C QG++       L E  +I   + +G 
Sbjct: 480  --WFVNFIHFFSSSTWNLICCFLALA------CGQGQLK---PLILFEDVDIVFAEDRG- 527

Query: 512  IQXXXXXXXXXXXDMGFISAVLQLAEKTKKPIILTSNDKNPILP-HLDPLIVDFTVPLSE 336
                            FISA+ Q+A+K K P+ILTSN   P LP +LD L + F +PL +
Sbjct: 528  ----------------FISAIQQIADKIKGPVILTSNSNKPFLPDNLDRLELCFKMPLEK 571

Query: 335  DLLSHAQMVCAVEGIHISSELLERLISYCRGDIRKMLMFLQFWCQG---TRCQIDESKRC 165
            +LL H  MVC+ E +++   L+E LI +C+ DIRK +M LQFWCQG   T+ ++ E +R 
Sbjct: 572  ELLQHLCMVCSAEKVNVQPRLIEHLIDFCQRDIRKTIMHLQFWCQGEQFTKGRVSEVQRL 631

Query: 164  TYNPLPFDVSAAHLIMPRVIPWDFPCDLSEKIEEEISGSFSLFENLCLEDAKEH 3
            + +PLPFD+ A + I P++IPW+FP  L+E + +EI+ S    E++      +H
Sbjct: 632  S-SPLPFDLEAGYQIFPKMIPWEFPSQLAELVMKEIAMSLCTMEDVIENQFDDH 684


>ref|XP_003589591.1| Replication factor C large subunit [Medicago truncatula]
            gi|355478639|gb|AES59842.1| Replication factor C large
            subunit [Medicago truncatula]
          Length = 1178

 Score =  237 bits (605), Expect = 5e-60
 Identities = 159/481 (33%), Positives = 238/481 (49%), Gaps = 26/481 (5%)
 Frame = -2

Query: 1394 ESSIKPLELDTACHSQRHPNQMLAKPNFITSAVASSSLP-EHEQVGQLVADNVGSNRNST 1218
            E S++PL  D   HS    +      N ++   +   LP + E + +++ +N   +    
Sbjct: 172  EDSVEPLNFDNF-HSGVKSSSTSISQNALS--YSDDKLPTQSEHMMEMLPENSAVDNE-- 226

Query: 1217 IYLPFRCSNSIVDME-------------TEKDHMLIKERLDSYNLRRGNFPESSLWTDKY 1077
               P +  ++IVD+E             T K       R+ S+     +  ESSLW  KY
Sbjct: 227  ---PAKPEDAIVDLEMDEASTISGQACNTGKSDAEPPSRMSSFCQSCEDKAESSLWIHKY 283

Query: 1076 QPENTLELCGNSKSIRILSEWLKSWDEGPRSNRSFNSSVK-------DGHHSSYDIESYM 918
            +P    E+CGN +S+  L +WL  W E    NR  +S+         DG ++    +   
Sbjct: 284  KPTKASEVCGNDESLNFLRDWLHHWHERRYQNRKDSSNKDQTDIPDGDGDYNCAGCDYAS 343

Query: 917  DDADDEAIRKNVLLLTGPIGSGKSAAIHACTKEQGFEVIEVNTSDVRSGALMKQKFGDSI 738
             D  +E   KNVLL+TGP+GSGKSAA++AC +EQGFEV+E+NTSD R+   +KQ FGD++
Sbjct: 344  KDVSEEGSLKNVLLITGPVGSGKSAAVYACAQEQGFEVLELNTSDCRNATAVKQYFGDAL 403

Query: 737  EASAITPSLCLKDNPIGERKKIIPALLSGXXXXXXXXDSFSQRTSENC---QQGKIATGG 567
             +  +     L ++ +G +KK +  L +                 E       G    GG
Sbjct: 404  GSQCVKS---LVEHTVGSQKKTLKLLQAPASPNVKEAKEMDHDVIEMITLSDDGAHGPGG 460

Query: 566  T-QKLVEKKNINSRKVKGSIQXXXXXXXXXXXDMGFISAVLQLAEKTKKPIILTSNDKNP 390
            T QKL    N  +     ++            D G I+A+  +AE  K PIILTSN K  
Sbjct: 461  TSQKLHAIDNTLTSDAVQTLILVEDVDILFPEDRGCIAAIQHIAETAKGPIILTSNSKKA 520

Query: 389  ILPH-LDPLIVDFTVPLSEDLLSHAQMVCAVEGIHISSELLERLISYCRGDIRKMLMFLQ 213
             LP+    L V F++PL ++LL H   VCA E +  +  L+E+ I  C  DIRK ++ LQ
Sbjct: 521  GLPNNFCRLHVSFSLPLPDELLRHLFTVCATEEVDANPLLMEKFIQSCDRDIRKTILHLQ 580

Query: 212  FWCQGTRCQIDESKRCTYNPLPFDVSAAHLIMPRVIPWDFPCDLSEKIEEEISGSFSLFE 33
            FW Q  +   D+  +  Y  LPFD+ A H I+P++IPW FP +LS+ IE E++ S +  E
Sbjct: 581  FWFQNKKYSKDKKVQTLYGSLPFDLEAGHKILPKMIPWSFPSELSKLIENEVTKSIATME 640

Query: 32   N 30
            N
Sbjct: 641  N 641


Top