BLASTX nr result

ID: Dioscorea21_contig00024091 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00024091
         (1010 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN60238.1| hypothetical protein VITISV_032906 [Vitis vinifera]   348   1e-93
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         342   8e-92
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   342   8e-92
gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ...   341   2e-91
emb|CAA69272.1| lectin receptor kinase [Arabidopsis thaliana]         339   5e-91

>emb|CAN60238.1| hypothetical protein VITISV_032906 [Vitis vinifera]
          Length = 1430

 Score =  348 bits (893), Expect = 1e-93
 Identities = 180/345 (52%), Positives = 228/345 (66%), Gaps = 9/345 (2%)
 Frame = -2

Query: 1009 REFTAPYSPQQNGTVERKNRTVVEMARSLLRSGGMPKRFWGEAVGTAVYLINRSPTRALP 830
            RE T PYSP+QNG  ERKNRTVVEMARS++ +  +   FW E V TAVYL+N SPT+A+ 
Sbjct: 699  RELTTPYSPEQNGVAERKNRTVVEMARSMMXAKNLSNHFWAEGVATAVYLLNISPTKAVL 758

Query: 829  NRTPHEAWYGFKPIVSHLRVFGCLAYALIPSQKLQKLNDKSEKCVFIGYCLESKAYKLFN 650
            NRTP+EAWYG KP VSHL+VFG +AY L  S    KL++KS KC+FIGYC +SK YKL+N
Sbjct: 759  NRTPYEAWYGRKPWVSHLKVFGSVAYTLXBSHNRSKLDEKSVKCIFIGYCSQSKGYKLYN 818

Query: 649  PVSCKVIVSRDVVFHEYFRWNWEANSDEAPRLLIEEDIEATGD---------AGAVNTHH 497
            PVS K+IVSR+VVF E     W  + D A   +  E   A  +              +H 
Sbjct: 819  PVSGKIIVSRNVVFDEKASXTWRVSEDGALVEISSESEMAQSEDQQPSVQIPXSPTPSHS 878

Query: 496  ETSSFEDQGHTDINAGDTPPRKTRLLEDVYNSCTFALCANPPDSFSNAVKIKGWKSAMDS 317
             +S       +  ++ +TPPRK R L D+Y + T  L    P +F  AV+ + W SAM  
Sbjct: 879  PSSPNLSXSSSSQSSEETPPRKFRSLRDIYET-TQVLFVADPTTFEEAVEKEEWCSAMKE 937

Query: 316  EMESIEKNDTWFLCELPAGKQAVGLKWVYKPKMNAQGEVVRLKARLVAKGYSQRQGVDYD 137
            E+ +IEKN+TW L ELP  K  +G+KWV++ K  A G + + KARLVAKGY+Q+ GVDYD
Sbjct: 938  EIAAIEKNETWELVELPEDKNVIGVKWVFRTKYLADGSIQKHKARLVAKGYAQQHGVDYD 997

Query: 136  EVFSPVARLDTVRLVLALAAHAGWPVFHFDVKSAFLNGEIQEEVY 2
            + FSPVAR +TVR +LALAAH  W V+ FDVKSAFLNGE+ EEVY
Sbjct: 998  DTFSPVARFETVRTLLALAAHMHWCVYQFDVKSAFLNGELVEEVY 1042


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  342 bits (877), Expect = 8e-92
 Identities = 175/348 (50%), Positives = 235/348 (67%), Gaps = 12/348 (3%)
 Frame = -2

Query: 1009 REFTAPYSPQQNGTVERKNRTVVEMARSLLRSGGMPKRFWGEAVGTAVYLINRSPTRALP 830
            R+ T P SPQQNG VERKNRT++EMARS+L+S  +PK  W EAV  AVYL+NRSPT+++ 
Sbjct: 617  RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVS 676

Query: 829  NRTPHEAWYGFKPIVSHLRVFGCLAYALIPSQKLQKLNDKSEKCVFIGYCLESKAYKLFN 650
             +TP EAW G KP VSHLRVFG +A+A +P +K  KL+DKSEK +FIGY   SK YKL+N
Sbjct: 677  GKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYN 736

Query: 649  PVSCKVIVSRDVVFHEYFRWNWEANSDEAPRL--LIEEDIEATGDAGAVNTHHETSSFED 476
            P + K I+SR++VF E   W+W +N ++        E++ E T +           +   
Sbjct: 737  PDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPT 796

Query: 475  QGHTDINAGDTPPRKTRLLEDVYN----------SCTFALCANPPDSFSNAVKIKGWKSA 326
                + ++ +  PR  R ++++Y            C FA C   P  F  A++ K W++A
Sbjct: 797  SSQIEESSSERTPR-FRSIQELYEVTENQENLTLFCLFAEC--EPMDFQKAIEKKTWRNA 853

Query: 325  MDSEMESIEKNDTWFLCELPAGKQAVGLKWVYKPKMNAQGEVVRLKARLVAKGYSQRQGV 146
            MD E++SI+KNDTW L  LP G +A+G+KWVYK K N++GEV R KARLVAKGYSQR G+
Sbjct: 854  MDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGI 913

Query: 145  DYDEVFSPVARLDTVRLVLALAAHAGWPVFHFDVKSAFLNGEIQEEVY 2
            DYDEVF+PVARL+TVRL+++LAA   W +   DVKSAFLNG+++EEVY
Sbjct: 914  DYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVY 961


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  342 bits (877), Expect = 8e-92
 Identities = 175/348 (50%), Positives = 235/348 (67%), Gaps = 12/348 (3%)
 Frame = -2

Query: 1009 REFTAPYSPQQNGTVERKNRTVVEMARSLLRSGGMPKRFWGEAVGTAVYLINRSPTRALP 830
            R+ T P SPQQNG VERKNRT++EMARS+L+S  +PK  W EAV  AVYL+NRSPT+++ 
Sbjct: 617  RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVS 676

Query: 829  NRTPHEAWYGFKPIVSHLRVFGCLAYALIPSQKLQKLNDKSEKCVFIGYCLESKAYKLFN 650
             +TP EAW G KP VSHLRVFG +A+A +P +K  KL+DKSEK +FIGY   SK YKL+N
Sbjct: 677  GKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYN 736

Query: 649  PVSCKVIVSRDVVFHEYFRWNWEANSDEAPRL--LIEEDIEATGDAGAVNTHHETSSFED 476
            P + K I+SR++VF E   W+W +N ++        E++ E T +           +   
Sbjct: 737  PDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDEPEPTREEPPSEEPTTPPTSPT 796

Query: 475  QGHTDINAGDTPPRKTRLLEDVYN----------SCTFALCANPPDSFSNAVKIKGWKSA 326
                + ++ +  PR  R ++++Y            C FA C   P  F  A++ K W++A
Sbjct: 797  SSQIEESSSERTPR-FRSIQELYEVTENQENLTLFCLFAEC--EPMDFQKAIEKKTWRNA 853

Query: 325  MDSEMESIEKNDTWFLCELPAGKQAVGLKWVYKPKMNAQGEVVRLKARLVAKGYSQRQGV 146
            MD E++SI+KNDTW L  LP G +A+G+KWVYK K N++GEV R KARLVAKGYSQR G+
Sbjct: 854  MDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRVGI 913

Query: 145  DYDEVFSPVARLDTVRLVLALAAHAGWPVFHFDVKSAFLNGEIQEEVY 2
            DYDEVF+PVARL+TVRL+++LAA   W +   DVKSAFLNG+++EEVY
Sbjct: 914  DYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVY 961


>gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1333

 Score =  341 bits (874), Expect = 2e-91
 Identities = 179/352 (50%), Positives = 237/352 (67%), Gaps = 16/352 (4%)
 Frame = -2

Query: 1009 REFTAPYSPQQNGTVERKNRTVVEMARSLLRSGGMPKRFWGEAVGTAVYLINRSPTRALP 830
            RE TAPY+P+QNG  ERKNRTVVEMARS L++ G+P  FWGEAV T VY +N SPT+ + 
Sbjct: 593  RELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGEAVATVVYFLNISPTKDVW 652

Query: 829  NRTPHEAWYGFKPIVSHLRVFGCLAYALIPSQKLQKLNDKSEKCVFIGYCLESKAYKLFN 650
            N TP EAW G KP VSHLR+FGC+AYAL+      KL++KS KC+F+GY L+SKAY+L+N
Sbjct: 653  NTTPLEAWNGKKPRVSHLRIFGCIAYALVNFHS--KLDEKSTKCIFVGYSLQSKAYRLYN 710

Query: 649  PVSCKVIVSRDVVFHEYFRWNWEANSDEAPRLLIEEDIEATGDAG-AVNTHHETSSFEDQ 473
            P+S KVI+SR+VVF+E   WN+ + +  +   L+  D E+  D G + N+   +SS    
Sbjct: 711  PISGKVIISRNVVFNEDVSWNFNSGNMMSNIQLLPTDEESAVDFGNSPNSSPVSSSVSSP 770

Query: 472  -------GHTDINAGDTPPRKT--------RLLEDVYNSCTFALCANPPDSFSNAVKIKG 338
                      + +    P R++        +    V  SC FAL  + P  +  AV+   
Sbjct: 771  IAPSTTVAPDESSVEPIPLRRSTREKKPNPKYSNTVNTSCQFALLVSDPICYEEAVEQSE 830

Query: 337  WKSAMDSEMESIEKNDTWFLCELPAGKQAVGLKWVYKPKMNAQGEVVRLKARLVAKGYSQ 158
            WK+AM  E+++IE+N TW L + P GK  +GLKWV++ K NA G + + KARLVAKGYSQ
Sbjct: 831  WKNAMIEEIQAIERNSTWELVDAPEGKNVIGLKWVFRTKYNADGSIQKHKARLVAKGYSQ 890

Query: 157  RQGVDYDEVFSPVARLDTVRLVLALAAHAGWPVFHFDVKSAFLNGEIQEEVY 2
            +QGVD+DE FSPVAR +TVR+VLALAA    PV+ FDVKSAFLNG+++EEVY
Sbjct: 891  QQGVDFDETFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVY 942


>emb|CAA69272.1| lectin receptor kinase [Arabidopsis thaliana]
          Length = 623

 Score =  339 bits (870), Expect = 5e-91
 Identities = 173/348 (49%), Positives = 233/348 (66%), Gaps = 12/348 (3%)
 Frame = -2

Query: 1009 REFTAPYSPQQNGTVERKNRTVVEMARSLLRSGGMPKRFWGEAVGTAVYLINRSPTRALP 830
            R+ T P SPQQNG  ERKNRT++EMARS+L+S  +PK  W EAV  AVYL+NRSPT+++ 
Sbjct: 114  RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVS 173

Query: 829  NRTPHEAWYGFKPIVSHLRVFGCLAYALIPSQKLQKLNDKSEKCVFIGYCLESKAYKLFN 650
             +TP EAW G KP VSHLRVFG +A+A +P +K  KL+DKSEK +FIGY   SK YKL+N
Sbjct: 174  GKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKGYKLYN 233

Query: 649  PVSCKVIVSRDVVFHEYFRWNWEANSDEAPRL--LIEEDIEATGDAGAVNTHHETSSFED 476
            P + K I+SR++VF E   W+W +N ++        E+  E T +           +   
Sbjct: 234  PDTKKTIISRNIVFDEEGEWDWNSNEEDYNFFPHFEEDKPEPTREEPPSEEPTTPPTSPT 293

Query: 475  QGHTDINAGDTPPRKTRLLEDVYN----------SCTFALCANPPDSFSNAVKIKGWKSA 326
                + ++ +  PR  R ++++Y            C FA C   P  F  A++ K W++A
Sbjct: 294  SSQIEESSSERTPR-FRSIQELYEVTENQENLTLFCLFAEC--EPMDFQEAIEKKTWRNA 350

Query: 325  MDSEMESIEKNDTWFLCELPAGKQAVGLKWVYKPKMNAQGEVVRLKARLVAKGYSQRQGV 146
            MD E++SI+KNDTW L  LP G +A+G+KWVYK K N++GEV R KARLVAKGYSQR G+
Sbjct: 351  MDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGI 410

Query: 145  DYDEVFSPVARLDTVRLVLALAAHAGWPVFHFDVKSAFLNGEIQEEVY 2
            DYDE+F+PVARL+TVRL+++LAA   W +   DVKSAFLNG+++EEVY
Sbjct: 411  DYDEIFAPVARLETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVY 458


Top