BLASTX nr result

ID: Dioscorea21_contig00005845 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005845
         (1352 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002264528.1| PREDICTED: galacturonokinase [Vitis vinifera...   507   e-141
gb|AEY11272.1| GALK [Morus alba var. multicaulis]                     497   e-138
ref|XP_004149677.1| PREDICTED: galacturonokinase-like [Cucumis s...   486   e-135
ref|NP_187681.2| galactokinase [Arabidopsis thaliana] gi|7530444...   480   e-133
ref|XP_002884808.1| GHMP kinase family protein [Arabidopsis lyra...   478   e-132

>ref|XP_002264528.1| PREDICTED: galacturonokinase [Vitis vinifera]
            gi|296090474|emb|CBI40670.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  507 bits (1305), Expect = e-141
 Identities = 262/407 (64%), Positives = 311/407 (76%), Gaps = 4/407 (0%)
 Frame = +3

Query: 144  SWPSTAEVNAVKERVVQMSGGNIGDVRIVVSPYRICPLGAHIDHQGGIVSAMTINKGIIL 323
            SWPS  E++ V++ V +M+G N  +VR+VVSPYRICPLGAHIDHQGG+VSA+T+NKGI+L
Sbjct: 5    SWPSQEELDRVRKVVAEMAGRNSKEVRVVVSPYRICPLGAHIDHQGGVVSAVTVNKGILL 64

Query: 324  GFIPSDDGQIILQSGQFDGDVRFKVDVSQLPRSSTATQE----NGASKSCNKELDWGCYA 491
            GFIPS D Q++LQSGQF G+VRF+VD  Q PR S    +    NG+SKS  +E DWG YA
Sbjct: 65   GFIPSGDSQVLLQSGQFKGEVRFRVDEIQHPRHSALKNDKIITNGSSKS-KEECDWGRYA 123

Query: 492  TGAVYALQRRGFHLDKGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXENANSLIVSQTDN 671
             GA+YALQ R  HL +GIIGFI                         ENAN+L VS  +N
Sbjct: 124  RGALYALQSRENHLSQGIIGFINGSEGLDSSGLSSSAATGIAYLLALENANNLTVSPMEN 183

Query: 672  IELDRLIENEYLGLRNGILDQSAVLLSKYGCLMRMNCKTKEHNLVRLFEMEGNQRLNGHG 851
            IE DRLIEN YLGLRNGILDQSA+LLS YGCL  MNCKTKEH LVR  ++  NQ  +   
Sbjct: 184  IEYDRLIENGYLGLRNGILDQSAILLSSYGCLTFMNCKTKEHKLVRP-KLLKNQEADMLK 242

Query: 852  AYKILLAFSGLKHALANNPGYNSRVSECQEAARLLLCASGDEDVEPLLCNVDPSTYEAHK 1031
            ++KILLA SGLKHAL NNPGYN+RV+EC+EAAR+LL ASG++ +EPLL NV+P  YEAHK
Sbjct: 243  SFKILLALSGLKHALTNNPGYNNRVAECEEAARVLLHASGNDKLEPLLSNVEPEAYEAHK 302

Query: 1032 AGLEPNLAKRAKHYFTENFRVKEGLKAWASGELETFGKLISASGLSSIENYECGCEPMIQ 1211
              LE  LA+RA+HYF+EN RV +GL+AWASG LE FGKLI++SGLSSI+NYECG EP+IQ
Sbjct: 303  GKLEATLARRAEHYFSENMRVIKGLEAWASGNLEDFGKLITSSGLSSIKNYECGAEPLIQ 362

Query: 1212 LYQILVRAPGVYGARFSGAGFRGCCLALVDADRAEEAASFVKLEYPK 1352
            LY+ILVRAPGVYGARFSGAGFRGCC+A VDA RA EAASFV+ EY K
Sbjct: 363  LYEILVRAPGVYGARFSGAGFRGCCIAFVDASRAVEAASFVRDEYYK 409


>gb|AEY11272.1| GALK [Morus alba var. multicaulis]
          Length = 431

 Score =  497 bits (1280), Expect = e-138
 Identities = 250/406 (61%), Positives = 305/406 (75%), Gaps = 3/406 (0%)
 Frame = +3

Query: 144  SWPSTAEVNAVKERVVQMSGGNIGDVRIVVSPYRICPLGAHIDHQGGIVSAMTINKGIIL 323
            SWPS +E+N V+E V +M+G    +VR+V SPYRICPLGAHIDHQGG VSAMTINKGI+L
Sbjct: 5    SWPSQSELNEVREIVSKMAGRGTEEVRVVASPYRICPLGAHIDHQGGTVSAMTINKGILL 64

Query: 324  GFIPSDDGQIILQSGQFDGDVRFKVDVSQLPRSSTATQENGASKSCNK---ELDWGCYAT 494
            GF+PS D Q++L+SGQF G+VRF VD +Q    + A      +   +K   E +WG Y  
Sbjct: 65   GFVPSGDSQVVLRSGQFKGEVRFSVDEAQDSGHANAMNNKIDANDSSKIRDECNWGNYPR 124

Query: 495  GAVYALQRRGFHLDKGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXENANSLIVSQTDNI 674
            GA+YALQR+G HL +G+IG+IC                        ENAN+L+V+  +NI
Sbjct: 125  GALYALQRKGNHLSQGLIGYICGSEGLDCSGLSSSAAVGVACLLALENANNLMVTPEENI 184

Query: 675  ELDRLIENEYLGLRNGILDQSAVLLSKYGCLMRMNCKTKEHNLVRLFEMEGNQRLNGHGA 854
            E DRLIENEYLGL+NGILDQSAVLLSKYG L+ MNCKTKEH L++      N+ +  H A
Sbjct: 185  EYDRLIENEYLGLKNGILDQSAVLLSKYGYLLCMNCKTKEHKLIK------NENIEPHTA 238

Query: 855  YKILLAFSGLKHALANNPGYNSRVSECQEAARLLLCASGDEDVEPLLCNVDPSTYEAHKA 1034
            YKILLAFSGLKHAL NNPGYN RVSECQEAAR+L  ASG   VEPLL +++P  Y+ HK 
Sbjct: 239  YKILLAFSGLKHALTNNPGYNHRVSECQEAARILSHASGIGKVEPLLSDIEPEAYQRHKN 298

Query: 1035 GLEPNLAKRAKHYFTENFRVKEGLKAWASGELETFGKLISASGLSSIENYECGCEPMIQL 1214
             L+PN+AKRA+HYF+EN RV +GL+ WASG LE  G+LI+ASGLSSI+NYECG EP+IQL
Sbjct: 299  KLQPNIAKRAEHYFSENLRVNKGLEFWASGNLEDLGRLITASGLSSIKNYECGSEPLIQL 358

Query: 1215 YQILVRAPGVYGARFSGAGFRGCCLALVDADRAEEAASFVKLEYPK 1352
            Y+IL+RAPGV+GARFSGAGFRGCCLALVD++ A+EAASFV+ EY K
Sbjct: 359  YEILLRAPGVFGARFSGAGFRGCCLALVDSNHADEAASFVRREYRK 404


>ref|XP_004149677.1| PREDICTED: galacturonokinase-like [Cucumis sativus]
            gi|449507367|ref|XP_004163011.1| PREDICTED:
            galacturonokinase-like [Cucumis sativus]
          Length = 437

 Score =  486 bits (1250), Expect = e-135
 Identities = 248/407 (60%), Positives = 297/407 (72%), Gaps = 4/407 (0%)
 Frame = +3

Query: 144  SWPSTAEVNAVKERVVQMSGGNIGDVRIVVSPYRICPLGAHIDHQGGIVSAMTINKGIIL 323
            SWPS  E+N +K  V +MS  +  DVR+VVSPYRICPLGAHIDHQGG VSAM INKG++L
Sbjct: 5    SWPSEEELNGIKTIVSEMSKRSKEDVRVVVSPYRICPLGAHIDHQGGNVSAMAINKGVLL 64

Query: 324  GFIPSDDGQIILQSGQFDGDVRFKVDVSQLPRSST----ATQENGASKSCNKELDWGCYA 491
            GF+PS D Q++L+S QF GDV F+VD    P   +     T ENG +K   ++ +WG YA
Sbjct: 65   GFVPSGDVQVVLRSAQFKGDVNFRVDEKLYPNHCSNKKEGTNENGHAK-LQEDNNWGRYA 123

Query: 492  TGAVYALQRRGFHLDKGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXENANSLIVSQTDN 671
             GAVYALQ +   L +GIIG+I                         ENAN+L +S T+N
Sbjct: 124  RGAVYALQEKEHCLSQGIIGYIYGSDGLDSSGLSSSAAVGLAYLLALENANNLTISPTEN 183

Query: 672  IELDRLIENEYLGLRNGILDQSAVLLSKYGCLMRMNCKTKEHNLVRLFEMEGNQRLNGHG 851
            IE DRLIEN YLGLRNGILDQSA+LLS YGCL+ MNCKTK+  L+R  +ME + +     
Sbjct: 184  IEYDRLIENGYLGLRNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLDMESSLKSEKQK 243

Query: 852  AYKILLAFSGLKHALANNPGYNSRVSECQEAARLLLCASGDEDVEPLLCNVDPSTYEAHK 1031
             Y+ILLAFSGLK AL NNPGYN RV+ECQEAA++LL ASG+  +EPLLCNVD   Y+AHK
Sbjct: 244  EYQILLAFSGLKQALTNNPGYNHRVAECQEAAKILLNASGNSHMEPLLCNVDQEAYKAHK 303

Query: 1032 AGLEPNLAKRAKHYFTENFRVKEGLKAWASGELETFGKLISASGLSSIENYECGCEPMIQ 1211
            + LEPNLAKRA+HYF+EN RV +GL+AWASG LE FGKLI+ SG SSI NYECG EP++Q
Sbjct: 304  SQLEPNLAKRAEHYFSENTRVLQGLEAWASGRLEDFGKLIADSGRSSIVNYECGAEPLVQ 363

Query: 1212 LYQILVRAPGVYGARFSGAGFRGCCLALVDADRAEEAASFVKLEYPK 1352
            LY+IL+RAPGV GARFSGAGFRGCCLALVD + A EAA FV+ EY K
Sbjct: 364  LYEILLRAPGVCGARFSGAGFRGCCLALVDVEYATEAAEFVRTEYMK 410


>ref|NP_187681.2| galactokinase [Arabidopsis thaliana]
            gi|75304441|sp|Q8VYG2.1|GALAK_ARATH RecName:
            Full=Galacturonokinase; AltName: Full=D-galacturonic
            acid-1-P kinase gi|18175773|gb|AAL59925.1| putative
            galactokinase [Arabidopsis thaliana]
            gi|20465755|gb|AAM20366.1| putative galactokinase
            [Arabidopsis thaliana] gi|215276406|gb|ACJ65066.1|
            D-galacturonic acid-1-P kinase [Arabidopsis thaliana]
            gi|332641423|gb|AEE74944.1| galactokinase [Arabidopsis
            thaliana]
          Length = 424

 Score =  480 bits (1235), Expect = e-133
 Identities = 252/405 (62%), Positives = 302/405 (74%), Gaps = 2/405 (0%)
 Frame = +3

Query: 144  SWPSTAEVNAVKERVVQMSGGNIGDVRIVVSPYRICPLGAHIDHQGGIVSAMTINKGIIL 323
            SWP+ +E+N++KE V QMSG + G+VR+VV+PYRICPLGAHIDHQGG VSAMTINKGI+L
Sbjct: 2    SWPTDSELNSIKEAVAQMSGRDKGEVRVVVAPYRICPLGAHIDHQGGTVSAMTINKGILL 61

Query: 324  GFIPSDDGQIILQSGQFDGDVRFKVDVSQLPRSSTATQENGASK-SCNKELD-WGCYATG 497
            GF+PS D Q+ L+S QF+G+V F+VD  Q P       +NGAS  S +KE   WG YA G
Sbjct: 62   GFVPSGDTQVQLRSAQFEGEVCFRVDEIQHPIG--LANKNGASTPSPSKEKSIWGTYARG 119

Query: 498  AVYALQRRGFHLDKGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXENANSLIVSQTDNIE 677
            AVYALQ    +L +GIIG++                         ENAN L VS T+NIE
Sbjct: 120  AVYALQSSKKNLKQGIIGYLSGSNGLDSSGLSSSAAVGVAYLLALENANELTVSPTENIE 179

Query: 678  LDRLIENEYLGLRNGILDQSAVLLSKYGCLMRMNCKTKEHNLVRLFEMEGNQRLNGHGAY 857
             DRLIEN YLGLRNGILDQSA+LLS YGCL  M+CKT +H LV+  E+E          +
Sbjct: 180  YDRLIENGYLGLRNGILDQSAILLSNYGCLTYMDCKTLDHELVQAPELEK--------PF 231

Query: 858  KILLAFSGLKHALANNPGYNSRVSECQEAARLLLCASGDEDVEPLLCNVDPSTYEAHKAG 1037
            +ILLAFSGL+ AL  NPGYN RVSECQEAA++LL ASG+ ++EP LCNV+ + YEAHK  
Sbjct: 232  RILLAFSGLRQALTTNPGYNLRVSECQEAAKVLLTASGNSELEPTLCNVEHAVYEAHKHE 291

Query: 1038 LEPNLAKRAKHYFTENFRVKEGLKAWASGELETFGKLISASGLSSIENYECGCEPMIQLY 1217
            L+P LAKRA+HYF+EN RV +G +AWASG LE FGKLISASGLSSIENYECG EP+IQLY
Sbjct: 292  LKPVLAKRAEHYFSENMRVIKGREAWASGNLEEFGKLISASGLSSIENYECGAEPLIQLY 351

Query: 1218 QILVRAPGVYGARFSGAGFRGCCLALVDADRAEEAASFVKLEYPK 1352
            +IL++APGVYGARFSGAGFRGCCLA VDA++AE AAS+VK EY K
Sbjct: 352  KILLKAPGVYGARFSGAGFRGCCLAFVDAEKAEAAASYVKDEYEK 396


>ref|XP_002884808.1| GHMP kinase family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297330648|gb|EFH61067.1| GHMP kinase family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 424

 Score =  478 bits (1231), Expect = e-132
 Identities = 251/405 (61%), Positives = 301/405 (74%), Gaps = 2/405 (0%)
 Frame = +3

Query: 144  SWPSTAEVNAVKERVVQMSGGNIGDVRIVVSPYRICPLGAHIDHQGGIVSAMTINKGIIL 323
            SWP+ +E+ ++KE V QMSG + G+VR+VV+PYRICPLGAHIDHQGG VSAMTINKGI+L
Sbjct: 2    SWPTDSELISIKEAVAQMSGRDKGEVRVVVAPYRICPLGAHIDHQGGTVSAMTINKGILL 61

Query: 324  GFIPSDDGQIILQSGQFDGDVRFKVDVSQLPRSSTATQENGASK-SCNKELD-WGCYATG 497
            GF+PS D Q+ L+S QF+G+V F+VD  Q P       +NGAS  S +KE   WG YA G
Sbjct: 62   GFVPSGDTQVQLRSAQFEGEVCFRVDEIQHPIG--LANKNGASTPSPSKEKSIWGTYARG 119

Query: 498  AVYALQRRGFHLDKGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXENANSLIVSQTDNIE 677
            AVYALQ    +L +GI+G++                         ENAN L VS T+NIE
Sbjct: 120  AVYALQTSKKNLKQGIVGYLSGSNGLDSSGLSSSAAVGVAYLLALENANELTVSPTENIE 179

Query: 678  LDRLIENEYLGLRNGILDQSAVLLSKYGCLMRMNCKTKEHNLVRLFEMEGNQRLNGHGAY 857
             DRLIEN YLGLRNGILDQSA+LLS YGCL  M+CKT +H LV+  E+E          +
Sbjct: 180  YDRLIENRYLGLRNGILDQSAILLSSYGCLTYMDCKTMDHELVQAPELEK--------PF 231

Query: 858  KILLAFSGLKHALANNPGYNSRVSECQEAARLLLCASGDEDVEPLLCNVDPSTYEAHKAG 1037
            KILLAFSGL+ AL  NPGYN RVSECQEAA++LL ASG+ ++EP LCNV+ + YEAHK  
Sbjct: 232  KILLAFSGLRQALTTNPGYNLRVSECQEAAKVLLTASGNSELEPTLCNVEHAVYEAHKHE 291

Query: 1038 LEPNLAKRAKHYFTENFRVKEGLKAWASGELETFGKLISASGLSSIENYECGCEPMIQLY 1217
            L+P LAKRA+HYF+EN RV +G +AWASG LE FGKLISASGLSSIENYECG EP+IQLY
Sbjct: 292  LKPVLAKRAEHYFSENMRVIKGREAWASGNLEEFGKLISASGLSSIENYECGAEPLIQLY 351

Query: 1218 QILVRAPGVYGARFSGAGFRGCCLALVDADRAEEAASFVKLEYPK 1352
            +IL++APGVYGARFSGAGFRGCCLA VDA++AE AAS+VK EY K
Sbjct: 352  KILLKAPGVYGARFSGAGFRGCCLAFVDAEKAEAAASYVKDEYEK 396


Top