BLASTX nr result

ID: Dioscorea21_contig00004150 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00004150
         (1745 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274609.1| PREDICTED: uncharacterized protein LOC100248...   496   e-138
ref|XP_003535374.1| PREDICTED: uncharacterized protein LOC100794...   473   e-131
ref|XP_002522002.1| metalloprotease m41 ftsh, putative [Ricinus ...   471   e-130
ref|XP_003555576.1| PREDICTED: uncharacterized protein LOC100817...   469   e-129
ref|XP_002884441.1| predicted protein [Arabidopsis lyrata subsp....   457   e-126

>ref|XP_002274609.1| PREDICTED: uncharacterized protein LOC100248755 [Vitis vinifera]
            gi|298204855|emb|CBI34162.3| unnamed protein product
            [Vitis vinifera]
          Length = 1320

 Score =  496 bits (1278), Expect = e-138
 Identities = 255/511 (49%), Positives = 345/511 (67%), Gaps = 12/511 (2%)
 Frame = +1

Query: 247  LLDLARKPLAILIFSATVSFSPFVSGPLPAIAAPTLTST----------IEDELEIKNGD 396
            L+    +P+   +F   V F P     +PAIAAP  +            +E+  E+K+ D
Sbjct: 99   LVQCIARPIVFAVFCIAVGFFPTGRFQVPAIAAPVASDVMWKKKESGKVLEETKELKSKD 158

Query: 397  HEFSGYTXXXXXXXXXXXXXXXXXKSGNGDMNLVREALKMVKKKRREIQGEVLEKLNAEV 576
            H++S  T                 +SG  DM  V   L+ VK K+ E+Q E++ +L AE+
Sbjct: 159  HKYSDCTRSLLEVVSGLLRSIEEVRSGKADMKKVEAVLREVKLKKEELQEEIMNELYAEL 218

Query: 577  REWKREKAEVIKKSGEVLDMGFAARKERDRILKEDEVDGGTGXXXXXXXXXXXXXSVAED 756
            RE KREK  +  +S E++DM   A++E DR+L +   DG                S  ++
Sbjct: 219  RELKREKDGLSDRSEEIVDMVVKAKREHDRLLGKASGDGKK--IKEQIARLEESMSRLDE 276

Query: 757  EYNLLWEKVGEIEDRISRKETMTYSVAIRELSFIERECDILVDRCKIQLKDGDLAEQPMS 936
            EY  +WE++GEIEDRI R++TM  S+ IRELSFI RE + LV   + ++K G     P  
Sbjct: 277  EYAKIWERIGEIEDRILRRDTMAMSIGIRELSFITRESEQLVASFRREMKLGRTNSVPQG 336

Query: 937  YSSRLSKSDIQKELENAHKDYWEQLLLPTVLEAED-SEIFANDTIQSFALNVKRILEESQ 1113
             +++LS+SDIQK+LE A ++YWEQ++LP++LE ED   +F  D++  F L++K+ L+ES+
Sbjct: 337  SATKLSRSDIQKDLETAQREYWEQMILPSILEIEDLGPLFYRDSMD-FVLHIKQALKESR 395

Query: 1114 HMQRNLEAHFRQKLKKFGDEKRFLVHTP-EEALKGFPEVELKWMFGAKEVVVPKAASLHL 1290
             MQRN+EA  R+ +++FGDEKRF+V+TP +E +KGFPE+ELKWMFG KEVVVPKA S HL
Sbjct: 396  EMQRNMEARVRKNMRRFGDEKRFVVNTPTDEVVKGFPEIELKWMFGDKEVVVPKAISFHL 455

Query: 1291 FHGWKKWREEAKGNLKKDLLENEEYYKQYIANRQERILLDRERVMTRTWYNDERNRWEID 1470
            FHGWKKWREEAK +LK+ LLEN +  KQY+A RQE ILLDR+RV+ +TW+++E++RWE+D
Sbjct: 456  FHGWKKWREEAKADLKRTLLENVDLGKQYVAQRQEHILLDRDRVVAKTWFSEEKSRWEMD 515

Query: 1471 PVAVPYVISKKLIESARIRHDWAVMYIALKGDDKEYLVDIKELDLLFEDFGGFDGLYTKM 1650
            P+AVPY +SKKL+E ARIRHDWA MYIALKGDDKEY VDIKE ++LFED GGFDGLY KM
Sbjct: 516  PMAVPYAVSKKLVEHARIRHDWAAMYIALKGDDKEYYVDIKEFEVLFEDLGGFDGLYLKM 575

Query: 1651 LACGIPTTVQVMWISFSELDIHQQFLLASRL 1743
            LA GIPT V +M I FSEL+  +QF L  RL
Sbjct: 576  LAAGIPTAVHLMRIPFSELNFREQFFLIMRL 606


>ref|XP_003535374.1| PREDICTED: uncharacterized protein LOC100794385 [Glycine max]
          Length = 1246

 Score =  473 bits (1216), Expect = e-131
 Identities = 249/505 (49%), Positives = 337/505 (66%), Gaps = 6/505 (1%)
 Frame = +1

Query: 247  LLDLARKPLAILIFSATVSFSPFVSGPLP----AIAAP-TLTSTIEDELEIKNGDHEFSG 411
            ++ +  K L   +F   V FS   +   P    AIAAP T  +    E + +   H++S 
Sbjct: 26   IIRIITKKLVRALFCFAVGFSALGAFHAPPPAFAIAAPWTYWAKRGTEEKERAKSHQYSD 85

Query: 412  YTXXXXXXXXXXXXXXXXXKSGNGDMNLVREALKMVKKKRREIQGEVLEKLNAEVREWKR 591
             T                 ++GNGD++    AL+ VK K+ E++ E+  +L   ++  +R
Sbjct: 86   CTDRLLETVSFLLKTVDEVRNGNGDVSEAEAALEAVKSKKEEMRKEINGRLYPALKRLRR 145

Query: 592  EKAEVIKKSGEVLDMGFAARKERDRILKEDEVDGGTGXXXXXXXXXXXXXSVAEDEYNLL 771
            E+  + K+SGE++     A  E D+ LK                       V EDEYN +
Sbjct: 146  ERKALWKRSGEIVGEILNAMAEYDK-LKAKVAANEKENENARMKELEESVGVMEDEYNGV 204

Query: 772  WEKVGEIEDRISRKETMTYSVAIRELSFIERECDILVDRCKIQLKDGDLAEQPMSYSSRL 951
            WE+VGEIEDRISR+ET+  S  +RE++FIEREC+ LV+R K ++K+ D    P    +RL
Sbjct: 205  WERVGEIEDRISREETVALSYGVREINFIERECEQLVERFKREVKNKDFKSLPTGSVTRL 264

Query: 952  SKSDIQKELENAHKDYWEQLLLPTVLEAEDSEIFANDTIQSFALNVKRILEESQHMQRNL 1131
            SKS IQK+LE  H+   EQ++LP++L+ ED   F ++   +FA  + R L++S+  QRNL
Sbjct: 265  SKSAIQKDLETVHRKQAEQIILPSILDVEDLGPFFHEDSINFAQCLTRSLKDSREKQRNL 324

Query: 1132 EAHFRQKLKKFGDEKRFLVHTPEE-ALKGFPEVELKWMFGAKEVVVPKAASLHLFHGWKK 1308
            EA  R+K+KKFG EKR ++++PEE  +KGFPEVELKWMFG KEVV+PKA  LHL+HGWKK
Sbjct: 325  EAQIRKKMKKFGKEKRSIIYSPEEEVVKGFPEVELKWMFGNKEVVLPKAVGLHLYHGWKK 384

Query: 1309 WREEAKGNLKKDLLENEEYYKQYIANRQERILLDRERVMTRTWYNDERNRWEIDPVAVPY 1488
            WREEAK NLK++L+++ E+ +QY+A RQERILLDR+RV++RTWYN+E++RWEIDPVAVPY
Sbjct: 385  WREEAKANLKQNLIKDAEFGRQYVAERQERILLDRDRVVSRTWYNEEKSRWEIDPVAVPY 444

Query: 1489 VISKKLIESARIRHDWAVMYIALKGDDKEYLVDIKELDLLFEDFGGFDGLYTKMLACGIP 1668
             +SKKLIE  RIRHDW  MYIALKG+D+E+ VDIKE ++LFED GGFDGLY KMLACGIP
Sbjct: 445  AVSKKLIEHVRIRHDWGAMYIALKGEDEEFYVDIKEYEMLFEDLGGFDGLYMKMLACGIP 504

Query: 1669 TTVQVMWISFSELDIHQQFLLASRL 1743
            T V +MWI FSEL+I QQFLL  R+
Sbjct: 505  TAVHLMWIPFSELNIRQQFLLILRV 529


>ref|XP_002522002.1| metalloprotease m41 ftsh, putative [Ricinus communis]
            gi|223538806|gb|EEF40406.1| metalloprotease m41 ftsh,
            putative [Ricinus communis]
          Length = 1312

 Score =  471 bits (1211), Expect = e-130
 Identities = 247/504 (49%), Positives = 332/504 (65%), Gaps = 11/504 (2%)
 Frame = +1

Query: 265  KPLAILIFSATVSFSPFVSGPLPAIAAPTLTSTI----EDELEIKNGD------HEFSGY 414
            +P+   +F   + F    S P  A  A  + S +    + E E K  +      HE+S Y
Sbjct: 91   RPIVYALFCIAIGFCSVGSFPAYAAVAEQVASEVIELKKKEKEKKLNEEKYSKGHEYSDY 150

Query: 415  TXXXXXXXXXXXXXXXXXKSGNGDMNLVREALKMVKKKRREIQGEVLEKLNAEVREWKRE 594
            +                 +  NGD   V  ALK VK K+  +QG++LE L +EVRE K+E
Sbjct: 151  SRNLLAEVSVLLKCIEETRRRNGDSEEVDLALKAVKAKKEGLQGQILEGLYSEVRELKKE 210

Query: 595  KAEVIKKSGEVLDMGFAARKERDRILKEDEVDGGTGXXXXXXXXXXXXXSVAEDEYNLLW 774
            K  + K++ ++LD G  AR+E + +        G                V E+EY+ +W
Sbjct: 211  KESLEKRADKILDEGLKARREYETL--------GINAEKGRMEELEERMGVIEEEYSGVW 262

Query: 775  EKVGEIEDRISRKETMTYSVAIRELSFIERECDILVDRCKIQLKDGDLAEQPMSYSSRLS 954
            EKVGEIED I R+ETM  SV IREL FIEREC+ LV R   +++         S  ++LS
Sbjct: 263  EKVGEIEDAILRRETMAMSVGIRELCFIERECEELVKRFNQEMRRKSKESPRSSSITKLS 322

Query: 955  KSDIQKELENAHKDYWEQLLLPTVLEAEDSEIFANDTIQSFALNVKRILEESQHMQRNLE 1134
            KS+IQ+ELE A +   EQ +LPT++E +      +  + +F++ +K+ L++S+ +Q++LE
Sbjct: 323  KSEIQRELETAQRKLLEQKILPTLVEVDGFGPLFDQDLVNFSICIKQGLKDSRKLQKDLE 382

Query: 1135 AHFRQKLKKFGDEKRFLVHTP-EEALKGFPEVELKWMFGAKEVVVPKAASLHLFHGWKKW 1311
            A  R+K+KKFGDEKR +V TP  E +KGFPEVELKWMFG KEV+VPKA  LHL+HGWKKW
Sbjct: 383  ARVRKKMKKFGDEKRLIVMTPANEVVKGFPEVELKWMFGNKEVLVPKAIRLHLYHGWKKW 442

Query: 1312 REEAKGNLKKDLLENEEYYKQYIANRQERILLDRERVMTRTWYNDERNRWEIDPVAVPYV 1491
            RE+AK NLK++LLE+ ++ KQY+A  QERILLDR+RV+++TWYN+E+NRWE+DP+AVPY 
Sbjct: 443  REDAKANLKRNLLEDVDFAKQYVAQIQERILLDRDRVVSKTWYNEEKNRWEMDPIAVPYA 502

Query: 1492 ISKKLIESARIRHDWAVMYIALKGDDKEYLVDIKELDLLFEDFGGFDGLYTKMLACGIPT 1671
            +SKKL+E ARIRHDW  MY+ALK DDKEY VDIKE D+L+EDFGGFDGLY KMLA  IPT
Sbjct: 503  VSKKLVEHARIRHDWGAMYLALKADDKEYYVDIKEFDMLYEDFGGFDGLYMKMLAQDIPT 562

Query: 1672 TVQVMWISFSELDIHQQFLLASRL 1743
             V +MWI FSEL++HQQFLL +RL
Sbjct: 563  AVHLMWIPFSELNLHQQFLLIARL 586


>ref|XP_003555576.1| PREDICTED: uncharacterized protein LOC100817872 [Glycine max]
          Length = 1274

 Score =  469 bits (1207), Expect = e-129
 Identities = 248/518 (47%), Positives = 338/518 (65%), Gaps = 5/518 (0%)
 Frame = +1

Query: 205  IVSTSRTPPNPKLQLLDLARKPLAILIFSATVSFSPFVSGPLP----AIAAPTLTSTIED 372
            I   +++ P+P     D+  K L   +F   V FS   +   P    AIAAP        
Sbjct: 46   ITFAAKSTPSPND---DVLFKRLVRALFCFAVGFSALGAFRAPPPAFAIAAPWTYWGKRG 102

Query: 373  ELEIKNGDHEFSGYTXXXXXXXXXXXXXXXXXKSGNGDMNLVREALKMVKKKRREIQGEV 552
              + +   H++S  T                 + GNG++N V  AL+ VK K+ E++ E+
Sbjct: 103  AEKERAKSHQYSDCTDRLLETVSFLLKTVDEVREGNGEVNEVEAALESVKSKKEELRKEI 162

Query: 553  LEKLNAEVREWKREKAEVIKKSGEVLDMGFAARKERDRILKEDEVDGGTGXXXXXXXXXX 732
              +L   ++  +RE+  + K+SGE++     A  E +++  +     G            
Sbjct: 163  NGRLYPALKRLRRERKALWKRSGEIVGEILKATAEYEKLKVKV---AGNEKENARMKELE 219

Query: 733  XXXSVAEDEYNLLWEKVGEIEDRISRKETMTYSVAIRELSFIERECDILVDRCKIQLKDG 912
                V EDEYN +WE+VGEIEDRISR+ET+  S  +RE++FIEREC+ LV+R K ++K+ 
Sbjct: 220  ESVGVMEDEYNGVWERVGEIEDRISREETVALSYGVREINFIERECEQLVERFKREIKNK 279

Query: 913  DLAEQPMSYSSRLSKSDIQKELENAHKDYWEQLLLPTVLEAEDSEIFANDTIQSFALNVK 1092
            D    P    +RLSKS IQK+LE  H+   EQ++LP++L+ ED   F ++   +FA  + 
Sbjct: 280  DFKSLPTGSVTRLSKSVIQKDLETVHRKQAEQIILPSILDVEDLWPFFHEDSINFAQRLT 339

Query: 1093 RILEESQHMQRNLEAHFRQKLKKFGDEKRFLVHTPEE-ALKGFPEVELKWMFGAKEVVVP 1269
            R L++S+  QRNLEA  R+K+KKFG EK  ++++PEE  +KGFPEVELKWMFG KEVV+P
Sbjct: 340  RSLKDSREKQRNLEAQIRKKMKKFGKEKHSIIYSPEEEVVKGFPEVELKWMFGNKEVVLP 399

Query: 1270 KAASLHLFHGWKKWREEAKGNLKKDLLENEEYYKQYIANRQERILLDRERVMTRTWYNDE 1449
            KA  LHL+HGWKKWREEAK NLK++L+++ E+ +QY+A RQERILLDR+RV++RTWYN+ 
Sbjct: 400  KAVGLHLYHGWKKWREEAKANLKQNLIKDAEFGRQYVAERQERILLDRDRVVSRTWYNEG 459

Query: 1450 RNRWEIDPVAVPYVISKKLIESARIRHDWAVMYIALKGDDKEYLVDIKELDLLFEDFGGF 1629
            +NRWEIDPVAVPY +SKKLIE  RIRHDW  MYI LKG+D+E+ VDIKE ++LFED GGF
Sbjct: 460  KNRWEIDPVAVPYAVSKKLIEHVRIRHDWGAMYITLKGEDEEFYVDIKEYEMLFEDLGGF 519

Query: 1630 DGLYTKMLACGIPTTVQVMWISFSELDIHQQFLLASRL 1743
            DGLY KMLACGIPT V +MWI FSEL+I QQFLL  R+
Sbjct: 520  DGLYMKMLACGIPTAVHLMWIPFSELNIRQQFLLILRV 557


>ref|XP_002884441.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297330281|gb|EFH60700.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1294

 Score =  457 bits (1177), Expect = e-126
 Identities = 235/509 (46%), Positives = 334/509 (65%), Gaps = 10/509 (1%)
 Frame = +1

Query: 247  LLDLARKPLAILIFSATVSFSPFVSGPLPAIAAPTLTSTI---------EDELEIKNGDH 399
            ++    KPL  ++F   + FSP  S   PA+A P ++  I         E E+ +K  DH
Sbjct: 107  VIQFVSKPLVYVLFCIAIGFSPIHSFQAPALAVPFVSDVIWKKKKETLKEKEVVLKAVDH 166

Query: 400  EFSGYTXXXXXXXXXXXXXXXXXKSGNGDMNLVREALKMVKKKRREIQGEVLEKLNAEVR 579
            EFS YT                 +  NGD+  V  AL  VK ++ ++Q E++  L  ++R
Sbjct: 167  EFSDYTRRLLETVSVLLKTIDKVRKENGDVAEVGTALDTVKVEKEKLQKEIMTGLYRDMR 226

Query: 580  EWKREKAEVIKKSGEVLDMGFAARKERDRILKEDEVDGGTGXXXXXXXXXXXXXSVAEDE 759
              ++E+  ++K++  ++D     +KE +++L++   +                  + E E
Sbjct: 227  RLRKERDVLMKRADGIVDEALRLKKESEKLLRKGARE--------KVEKLEESVDIMETE 278

Query: 760  YNLLWEKVGEIEDRISRKETMTYSVAIRELSFIERECDILVDRCKIQLKDGDLAEQPMSY 939
            YN +WE++ EI D I +KET T S  +REL FIEREC  LV     +         P S 
Sbjct: 279  YNKIWERIDEIVDIILKKETTTLSFGVRELIFIERECVELVKSFNRETNQKSSESAPESS 338

Query: 940  SSRLSKSDIQKELENAHKDYWEQLLLPTVLEAEDSEIFANDTIQSFALNVKRILEESQHM 1119
             ++LS+S+I++EL NA + + EQ++LP VLE E+ + F +     F+L +K+ LEES+ +
Sbjct: 339  ITKLSRSEIKQELVNAQRKHLEQMILPNVLELEEVDPFFDRDSVDFSLRIKKRLEESKKL 398

Query: 1120 QRNLEAHFRQKLKKFGDEKRFLVHTPE-EALKGFPEVELKWMFGAKEVVVPKAASLHLFH 1296
            QR+L+   R+++KKFG+EK F+  TP  EA+KGFPE E+KWMFG KEVVVPKA  LHL H
Sbjct: 399  QRDLQNRIRKRMKKFGEEKLFVQKTPVGEAVKGFPEAEVKWMFGDKEVVVPKAIQLHLRH 458

Query: 1297 GWKKWREEAKGNLKKDLLENEEYYKQYIANRQERILLDRERVMTRTWYNDERNRWEIDPV 1476
            GWKKW+EEAK +LK+ LLE+ ++ KQYIA RQE++LLDR+RV+++TWYN+++NRWE+DP+
Sbjct: 459  GWKKWQEEAKADLKQKLLEDVDFGKQYIAQRQEQVLLDRDRVVSKTWYNEDKNRWEMDPM 518

Query: 1477 AVPYVISKKLIESARIRHDWAVMYIALKGDDKEYLVDIKELDLLFEDFGGFDGLYTKMLA 1656
            AVPY +S+KLI+SARIRHD+AVMY+ALKGDDKEY VDIKE ++LFE FGGFD LY KMLA
Sbjct: 519  AVPYAVSRKLIDSARIRHDYAVMYVALKGDDKEYYVDIKEYEMLFEKFGGFDALYLKMLA 578

Query: 1657 CGIPTTVQVMWISFSELDIHQQFLLASRL 1743
            CGIPT+V +MWI  SEL + QQFLLA+R+
Sbjct: 579  CGIPTSVHLMWIPMSELSLQQQFLLATRV 607


Top