BLASTX nr result
ID: Akebia27_contig00026344
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00026344 (1029 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AHC32019.1| galacturonokinase [Camellia sinensis] 355 2e-95 ref|XP_006361588.1| PREDICTED: LOW QUALITY PROTEIN: galacturonok... 349 1e-93 ref|XP_004242872.1| PREDICTED: galacturonokinase-like isoform 2 ... 349 1e-93 ref|XP_002264528.1| PREDICTED: galacturonokinase [Vitis vinifera... 342 1e-91 ref|XP_007014469.1| Galacturonic acid kinase isoform 2 [Theobrom... 340 4e-91 gb|EXB40717.1| hypothetical protein L484_007300 [Morus notabilis] 338 2e-90 ref|XP_004242871.1| PREDICTED: galacturonokinase-like isoform 1 ... 338 2e-90 ref|XP_004299776.1| PREDICTED: galacturonokinase-like [Fragaria ... 334 4e-89 gb|AEY11272.1| GALK [Morus alba var. multicaulis] 333 7e-89 ref|XP_007014468.1| Galacturonic acid kinase isoform 1 [Theobrom... 332 1e-88 ref|XP_006850937.1| hypothetical protein AMTR_s00025p00189780 [A... 323 5e-86 ref|XP_002514384.1| galactokinase, putative [Ricinus communis] g... 322 1e-85 gb|EYU25741.1| hypothetical protein MIMGU_mgv1a006821mg [Mimulus... 321 3e-85 ref|XP_004149677.1| PREDICTED: galacturonokinase-like [Cucumis s... 318 2e-84 ref|XP_006453202.1| hypothetical protein CICLE_v10008332mg [Citr... 318 2e-84 ref|XP_006381135.1| hypothetical protein POPTR_0006s06980g [Popu... 314 3e-83 ref|XP_003525312.1| PREDICTED: galacturonokinase-like isoform X1... 312 2e-82 ref|XP_003630628.1| Galactokinase [Medicago truncatula] gi|35552... 312 2e-82 ref|XP_003630627.1| Galactokinase [Medicago truncatula] gi|35552... 312 2e-82 ref|XP_007014471.1| Galacturonic acid kinase isoform 4 [Theobrom... 311 4e-82 >gb|AHC32019.1| galacturonokinase [Camellia sinensis] Length = 436 Score = 355 bits (911), Expect = 2e-95 Identities = 183/284 (64%), Positives = 205/284 (72%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG WPS+ ELD +RK+VAEM SPYRICPLGAHIDHQGGTVSAMTI++ Sbjct: 1 MGELPWPSKAELDGLRKMVAEMAGKGTEKVGVVVSPYRICPLGAHIDHQGGTVSAMTINR 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD QVLL SGQF+GEVRF +DEI+HP H + K + S EE NWG Sbjct: 61 GILLGFVPSGDSQVLLCSGQFKGEVRFSVDEIKHPKHFVKENDKINGSGSSKQQEECNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NYARGA+YALQ GN LTQGI+GF+C SANNL +SP Sbjct: 121 NYARGAIYALQSRGNHLTQGIVGFVCGSEDLDSSGLSSSAAVGIAYLLAFESANNLAMSP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 ENIE DRLIENEYLGL+NGILDQSAILLSSYGCLTCMNCKTKEHKL+HP K Q N E+ Sbjct: 181 AENIEYDRLIENEYLGLKNGILDQSAILLSSYGCLTCMNCKTKEHKLVHPPKFQ-NHETG 239 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGN 1019 KAYK+LLAFSGLKQALT+NPGYN RV EC+ AAR LL +SGN Sbjct: 240 IKKAYKVLLAFSGLKQALTSNPGYNHRVAECQAAARFLLKSSGN 283 >ref|XP_006361588.1| PREDICTED: LOW QUALITY PROTEIN: galacturonokinase-like [Solanum tuberosum] Length = 444 Score = 349 bits (895), Expect = 1e-93 Identities = 183/289 (63%), Positives = 207/289 (71%), Gaps = 2/289 (0%) Frame = +3 Query: 168 MGVSS--WPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTI 341 MGV S WPSE ELD++RK VAE+ SPYRICPLGAHIDHQGG VSAMTI Sbjct: 1 MGVVSGHWPSESELDKIRKKVAELSGRDAQEVRVVVSPYRICPLGAHIDHQGGAVSAMTI 60 Query: 342 DKGILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENN 521 +KGILLGF+ S D QV L+SGQF+GEVR RIDE+Q P H++ T+G +D S EE Sbjct: 61 NKGILLGFVPSDDTQVTLQSGQFEGEVRLRIDEVQLPKHMSETNGLTEQMDSSTPQEECK 120 Query: 522 WGNYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLIL 701 WGNYARGA+YALQ GN L GI GFIC SAN L++ Sbjct: 121 WGNYARGAIYALQSKGNHLKTGITGFICGSEGLDSSGLSSSAAVGIAYLLAFESANGLVV 180 Query: 702 SPTENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQE 881 SPTENIE DRLIENEYLGL+NGILDQSAILLSSYGCLT MNCKT +HKLIHP +Q N E Sbjct: 181 SPTENIEYDRLIENEYLGLKNGILDQSAILLSSYGCLTFMNCKTIKHKLIHPPTVQNNHE 240 Query: 882 SKDLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 + AYKILLAFSGLKQALTTNPGYNRRV EC+EAA+ILL ASG+ ++ Sbjct: 241 GELGNAYKILLAFSGLKQALTTNPGYNRRVAECQEAAKILLQASGDEEM 289 >ref|XP_004242872.1| PREDICTED: galacturonokinase-like isoform 2 [Solanum lycopersicum] Length = 444 Score = 349 bits (895), Expect = 1e-93 Identities = 183/289 (63%), Positives = 207/289 (71%), Gaps = 2/289 (0%) Frame = +3 Query: 168 MGVSS--WPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTI 341 MGV S WPSE ELD++R VAE+ SPYRICPLGAHIDHQGGTVSAMTI Sbjct: 1 MGVFSGHWPSESELDKIRNKVAELSGRDAQEVMVVVSPYRICPLGAHIDHQGGTVSAMTI 60 Query: 342 DKGILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENN 521 +KGILLGF+ S D QV L+SGQF+GEVR RIDE+Q P H+ GT+G +D S EE Sbjct: 61 NKGILLGFVPSDDTQVTLQSGQFEGEVRLRIDEVQLPKHMYGTNGLTEQMDSSPPQEEWK 120 Query: 522 WGNYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLIL 701 WGNYARGA+YALQ GN L GI GFIC SAN L++ Sbjct: 121 WGNYARGAIYALQSKGNHLKTGITGFICGSEGLDSSGLSSSAAVGVAYLLAFESANGLVV 180 Query: 702 SPTENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQE 881 SPTENIE DRLIENEYLGL+NGILDQSAILLSSYGCLT MNCKT +HKLIHP ++ N E Sbjct: 181 SPTENIEYDRLIENEYLGLKNGILDQSAILLSSYGCLTFMNCKTIKHKLIHPPTVENNHE 240 Query: 882 SKDLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 + AYKILLAFSGLKQALTTNPGYNRRV EC+EAA+ILL ASG+ ++ Sbjct: 241 GEFGNAYKILLAFSGLKQALTTNPGYNRRVAECQEAAKILLQASGDEEM 289 >ref|XP_002264528.1| PREDICTED: galacturonokinase [Vitis vinifera] gi|296090474|emb|CBI40670.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 342 bits (877), Expect = 1e-91 Identities = 183/283 (64%), Positives = 202/283 (71%) Frame = +3 Query: 180 SWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDKGILL 359 SWPS+ ELDRVRK+VAEM SPYRICPLGAHIDHQGG VSA+T++KGILL Sbjct: 5 SWPSQEELDRVRKVVAEMAGRNSKEVRVVVSPYRICPLGAHIDHQGGVVSAVTVNKGILL 64 Query: 360 GFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWGNYAR 539 GFI SGD QVLL+SGQF+GEVRFR+DEIQHP H + K S EE +WG YAR Sbjct: 65 GFIPSGDSQVLLQSGQFKGEVRFRVDEIQHPRHSALKNDKIITNGSSKSKEECDWGRYAR 124 Query: 540 GALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSPTENI 719 GALYALQ N L+QGIIGFI +ANNL +SP ENI Sbjct: 125 GALYALQSRENHLSQGIIGFINGSEGLDSSGLSSSAATGIAYLLALENANNLTVSPMENI 184 Query: 720 ELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESKDLKA 899 E DRLIEN YLGLRNGILDQSAILLSSYGCLT MNCKTKEHKL+ P KL KNQE+ LK+ Sbjct: 185 EYDRLIENGYLGLRNGILDQSAILLSSYGCLTFMNCKTKEHKLVRP-KLLKNQEADMLKS 243 Query: 900 YKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 +KILLA SGLK ALT NPGYN RV EC EAAR+LL ASGN +L Sbjct: 244 FKILLALSGLKHALTNNPGYNNRVAECEEAARVLLHASGNDKL 286 >ref|XP_007014469.1| Galacturonic acid kinase isoform 2 [Theobroma cacao] gi|508784832|gb|EOY32088.1| Galacturonic acid kinase isoform 2 [Theobroma cacao] Length = 437 Score = 340 bits (873), Expect = 4e-91 Identities = 177/287 (61%), Positives = 202/287 (70%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M SWP++ ELD++R IV+EM SPYRICPLGAHIDHQGG VSAMTI+K Sbjct: 1 MAAMSWPTQDELDKIRGIVSEMAGKGTEDVRVVVSPYRICPLGAHIDHQGGIVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SG+ QV LRSGQF+GEVRFR++E Q P H + V S +E WG Sbjct: 61 GILLGFVPSGNTQVALRSGQFKGEVRFRVNETQQPRHRISKGEEIKVDKSSPSPQECYWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YA GALYALQ GN L QGIIG+IC SANNL +SP Sbjct: 121 RYAIGALYALQSRGNHLAQGIIGYICGSEGLDSSGLSSSAAVGVAYLLALESANNLTVSP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DR+IENEYLGLRNGILDQSAILLSS+GCLT MNCKT EHKLIHPL K+ E++ Sbjct: 181 TENIEYDRVIENEYLGLRNGILDQSAILLSSHGCLTYMNCKTTEHKLIHPLNFLKDHETE 240 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K YKILLAFSGL+QALT+NPGYN RV EC+EAA+ILL ASGNG+L Sbjct: 241 PQKGYKILLAFSGLRQALTSNPGYNSRVAECQEAAKILLHASGNGEL 287 >gb|EXB40717.1| hypothetical protein L484_007300 [Morus notabilis] Length = 432 Score = 338 bits (868), Expect = 2e-90 Identities = 177/287 (61%), Positives = 202/287 (70%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG SWPS+ EL+ VR+IV++M SPYRICPLGAHIDHQGGTVSAMTI+K Sbjct: 1 MGGFSWPSQSELNEVREIVSKMAGRGTEEVRVVASPYRICPLGAHIDHQGGTVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD QV+LRSGQF+GEVRF +DE Q H + K D S I +E NWG Sbjct: 61 GILLGFVPSGDSQVVLRSGQFKGEVRFSVDEAQDSGHANAMNNKIDANDSSKIRDECNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NY RGALYALQR GN L+QG+IG IC +ANNL+++P Sbjct: 121 NYPRGALYALQRKGNHLSQGLIGHICGSEGLDCSGLSSSAAVGVACLLALENANNLMVTP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 ENIE DRLIENEYLGL+NGILDQSAILLS YGCL CMNCKTKEHKLI KN+ + Sbjct: 181 EENIEYDRLIENEYLGLKNGILDQSAILLSKYGCLLCMNCKTKEHKLI------KNENIE 234 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 AYKILLAFSGLK ALT NPGYNRRV+EC+EAARIL ASG G++ Sbjct: 235 PHTAYKILLAFSGLKHALTNNPGYNRRVSECQEAARILTHASGVGKV 281 >ref|XP_004242871.1| PREDICTED: galacturonokinase-like isoform 1 [Solanum lycopersicum] Length = 460 Score = 338 bits (868), Expect = 2e-90 Identities = 183/305 (60%), Positives = 207/305 (67%), Gaps = 18/305 (5%) Frame = +3 Query: 168 MGVSS--WPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTI 341 MGV S WPSE ELD++R VAE+ SPYRICPLGAHIDHQGGTVSAMTI Sbjct: 1 MGVFSGHWPSESELDKIRNKVAELSGRDAQEVMVVVSPYRICPLGAHIDHQGGTVSAMTI 60 Query: 342 DKGILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENN 521 +KGILLGF+ S D QV L+SGQF+GEVR RIDE+Q P H+ GT+G +D S EE Sbjct: 61 NKGILLGFVPSDDTQVTLQSGQFEGEVRLRIDEVQLPKHMYGTNGLTEQMDSSPPQEEWK 120 Query: 522 WGNYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLIL 701 WGNYARGA+YALQ GN L GI GFIC SAN L++ Sbjct: 121 WGNYARGAIYALQSKGNHLKTGITGFICGSEGLDSSGLSSSAAVGVAYLLAFESANGLVV 180 Query: 702 SPTENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCK----------------T 833 SPTENIE DRLIENEYLGL+NGILDQSAILLSSYGCLT MNCK T Sbjct: 181 SPTENIEYDRLIENEYLGLKNGILDQSAILLSSYGCLTFMNCKILCSLENSDNYRPHVQT 240 Query: 834 KEHKLIHPLKLQKNQESKDLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDAS 1013 +HKLIHP ++ N E + AYKILLAFSGLKQALTTNPGYNRRV EC+EAA+ILL AS Sbjct: 241 IKHKLIHPPTVENNHEGEFGNAYKILLAFSGLKQALTTNPGYNRRVAECQEAAKILLQAS 300 Query: 1014 GNGQL 1028 G+ ++ Sbjct: 301 GDEEM 305 >ref|XP_004299776.1| PREDICTED: galacturonokinase-like [Fragaria vesca subsp. vesca] Length = 429 Score = 334 bits (856), Expect = 4e-89 Identities = 175/285 (61%), Positives = 203/285 (71%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG SSWPSE +L+ +R+IV+EM SPYRICPLGAHIDHQGG VSAMTI++ Sbjct: 1 MGGSSWPSETQLNEIREIVSEMSGRGREQVRVVASPYRICPLGAHIDHQGGIVSAMTINR 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD QV+LRSGQF+G VRFRIDE+ P + + K EE++WG Sbjct: 61 GILLGFVPSGDNQVILRSGQFKGRVRFRIDEVSCPMNNGNDASKRD--------EESDWG 112 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 +YARGA+YALQ L QGIIG+IC +ANNLI+SP Sbjct: 113 SYARGAVYALQSKKTCLVQGIIGYICGTEGMDSSGVSSSAAVGVAYLMALENANNLIVSP 172 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 ENIE DRLIENE+ GLRNGILDQSAILLSSYG L CMNCKTKEHKL+HP KL KN E++ Sbjct: 173 EENIEFDRLIENEFRGLRNGILDQSAILLSSYGSLLCMNCKTKEHKLVHPPKLGKNHETE 232 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNG 1022 +AYKILLAFSGLKQALT N GYNRRV EC+EAA +LL+ASGNG Sbjct: 233 WQEAYKILLAFSGLKQALTENSGYNRRVGECQEAATVLLNASGNG 277 >gb|AEY11272.1| GALK [Morus alba var. multicaulis] Length = 431 Score = 333 bits (854), Expect = 7e-89 Identities = 174/287 (60%), Positives = 201/287 (70%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG SWPS+ EL+ VR+IV++M SPYRICPLGAHIDHQGGTVSAMTI+K Sbjct: 1 MGGFSWPSQSELNEVREIVSKMAGRGTEEVRVVASPYRICPLGAHIDHQGGTVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD QV+LRSGQF+GEVRF +DE Q H + K D S I +E NWG Sbjct: 61 GILLGFVPSGDSQVVLRSGQFKGEVRFSVDEAQDSGHANAMNNKIDANDSSKIRDECNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NY RGALYALQR GN L+QG+IG+IC +ANNL+++P Sbjct: 121 NYPRGALYALQRKGNHLSQGLIGYICGSEGLDCSGLSSSAAVGVACLLALENANNLMVTP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 ENIE DRLIENEYLGL+NGILDQSA+LLS YG L CMNCKTKEHKLI KN+ + Sbjct: 181 EENIEYDRLIENEYLGLKNGILDQSAVLLSKYGYLLCMNCKTKEHKLI------KNENIE 234 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 AYKILLAFSGLK ALT NPGYN RV+EC+EAARIL ASG G++ Sbjct: 235 PHTAYKILLAFSGLKHALTNNPGYNHRVSECQEAARILSHASGIGKV 281 >ref|XP_007014468.1| Galacturonic acid kinase isoform 1 [Theobroma cacao] gi|508784831|gb|EOY32087.1| Galacturonic acid kinase isoform 1 [Theobroma cacao] Length = 447 Score = 332 bits (852), Expect = 1e-88 Identities = 177/297 (59%), Positives = 202/297 (68%), Gaps = 10/297 (3%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M SWP++ ELD++R IV+EM SPYRICPLGAHIDHQGG VSAMTI+K Sbjct: 1 MAAMSWPTQDELDKIRGIVSEMAGKGTEDVRVVVSPYRICPLGAHIDHQGGIVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SG+ QV LRSGQF+GEVRFR++E Q P H + V S +E WG Sbjct: 61 GILLGFVPSGNTQVALRSGQFKGEVRFRVNETQQPRHRISKGEEIKVDKSSPSPQECYWG 120 Query: 528 NYARGALYALQRGGNQLTQ----------GIIGFICXXXXXXXXXXXXXXXXXXXXXXXX 677 YA GALYALQ GN L Q GIIG+IC Sbjct: 121 RYAIGALYALQSRGNHLAQVFNKFHSYLQGIIGYICGSEGLDSSGLSSSAAVGVAYLLAL 180 Query: 678 XSANNLILSPTENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHP 857 SANNL +SPTENIE DR+IENEYLGLRNGILDQSAILLSS+GCLT MNCKT EHKLIHP Sbjct: 181 ESANNLTVSPTENIEYDRVIENEYLGLRNGILDQSAILLSSHGCLTYMNCKTTEHKLIHP 240 Query: 858 LKLQKNQESKDLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 L K+ E++ K YKILLAFSGL+QALT+NPGYN RV EC+EAA+ILL ASGNG+L Sbjct: 241 LNFLKDHETEPQKGYKILLAFSGLRQALTSNPGYNSRVAECQEAAKILLHASGNGEL 297 >ref|XP_006850937.1| hypothetical protein AMTR_s00025p00189780 [Amborella trichopoda] gi|548854608|gb|ERN12518.1| hypothetical protein AMTR_s00025p00189780 [Amborella trichopoda] Length = 431 Score = 323 bits (829), Expect = 5e-86 Identities = 175/287 (60%), Positives = 196/287 (68%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG +WPSE E DRVRK V SPYRICPLGAHIDHQGGTVSAMTI++ Sbjct: 1 MGTFTWPSEQEFDRVRKAVVATSGCDEGDVRVAVSPYRICPLGAHIDHQGGTVSAMTINR 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD +VLL+S QF GEVRFRIDEI+ P +L K HV EE WG Sbjct: 61 GILLGFVPSGDSKVLLQSAQFAGEVRFRIDEIKSPRYLVDL--KNHVKSD----EECGWG 114 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NYARGALYALQ GG L QGIIG+IC SANN+ +SP Sbjct: 115 NYARGALYALQAGGKHLHQGIIGYICGSEGLDSSGLSSSAAVGIAYLLAFESANNISVSP 174 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 +NIELDRLIEN YLGL+NGILDQSAILLS+YGCLTC+NCKTK++ LI + QE K Sbjct: 175 IDNIELDRLIENGYLGLKNGILDQSAILLSNYGCLTCINCKTKDYTLIKHPQWHGGQEIK 234 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K +KILLAFSGLK ALT+ PGYN RV ECREAARILL +SGNG L Sbjct: 235 RSKPFKILLAFSGLKHALTSKPGYNSRVAECREAARILLSSSGNGSL 281 >ref|XP_002514384.1| galactokinase, putative [Ricinus communis] gi|223546481|gb|EEF47980.1| galactokinase, putative [Ricinus communis] Length = 431 Score = 322 bits (826), Expect = 1e-85 Identities = 170/285 (59%), Positives = 195/285 (68%), Gaps = 1/285 (0%) Frame = +3 Query: 177 SSWPSEGELDRVRKIVAEMCXXXXXXXXXXX-SPYRICPLGAHIDHQGGTVSAMTIDKGI 353 S WPSE EL+ +R++V+ M SPYRICPLGAHIDHQGG VSAMTI+KG+ Sbjct: 6 SCWPSEDELNEIREVVSAMSSGTSPEQVRVVVSPYRICPLGAHIDHQGGIVSAMTINKGV 65 Query: 354 LLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWGNY 533 LLGF+ SGDPQV+LRS QF+GEVRF +DE+Q+ + G + D + E++NWGN+ Sbjct: 66 LLGFVPSGDPQVILRSAQFRGEVRFSVDEVQYSRPI-GKKDENRATDSQKVREDSNWGNF 124 Query: 534 ARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSPTE 713 ARGALYALQ GN + QGI G+I SANNL PT Sbjct: 125 ARGALYALQSRGNSIIQGITGYISGSEDFDRSGLSSSAAVGVAYLLALESANNLTFPPTV 184 Query: 714 NIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESKDL 893 NIE DR+IENEYLGLRNGILDQSAILLSS+GCLTCMNCKTKEHKLIHP KL L Sbjct: 185 NIEYDRIIENEYLGLRNGILDQSAILLSSHGCLTCMNCKTKEHKLIHPSKL--------L 236 Query: 894 KAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K YKIL+AFSGLK ALT NPGYN RV EC+EAAR LL ASGN L Sbjct: 237 KPYKILVAFSGLKDALTNNPGYNSRVAECQEAARFLLKASGNDNL 281 >gb|EYU25741.1| hypothetical protein MIMGU_mgv1a006821mg [Mimulus guttatus] Length = 430 Score = 321 bits (823), Expect = 3e-85 Identities = 174/284 (61%), Positives = 193/284 (67%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG WPSE E++ +RK VAEMC SPYRICPLGAHIDHQGG VSAMTI+K Sbjct: 1 MGAMCWPSESEINVIRKKVAEMCGRNSEEVKIVASPYRICPLGAHIDHQGGIVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ S D QV+++SGQF+GEVRFR+DE Q P + V EE WG Sbjct: 61 GILLGFVPSYDSQVIIQSGQFEGEVRFRVDEEQLPKINDSKEYSSSV-------EECRWG 113 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NYARGALYAL++ GN L QGIIGFI SAN L +SP Sbjct: 114 NYARGALYALRKKGNNLNQGIIGFISGAEGLDSSGLSSSAAVGVAYLLAFESANKLTISP 173 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DRLIENEYLGL+NGILDQSAILLSSYGCLT MNCKTKEHKL+ K + Sbjct: 174 TENIEYDRLIENEYLGLKNGILDQSAILLSSYGCLTRMNCKTKEHKLVQCPKSSGIHNKE 233 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGN 1019 K +KILLAFSGLKQAL TNPGYN RV+ECREAARILL ASGN Sbjct: 234 TDKRFKILLAFSGLKQALITNPGYNSRVSECREAARILLKASGN 277 >ref|XP_004149677.1| PREDICTED: galacturonokinase-like [Cucumis sativus] gi|449507367|ref|XP_004163011.1| PREDICTED: galacturonokinase-like [Cucumis sativus] Length = 437 Score = 318 bits (816), Expect = 2e-84 Identities = 165/287 (57%), Positives = 199/287 (69%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG SWPSE EL+ ++ IV+EM SPYRICPLGAHIDHQGG VSAM I+K Sbjct: 1 MGKPSWPSEEELNGIKTIVSEMSKRSKEDVRVVVSPYRICPLGAHIDHQGGNVSAMAINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 G+LLGF+ SGD QV+LRS QF+G+V FR+DE +P H + + + + E+NNWG Sbjct: 61 GVLLGFVPSGDVQVVLRSAQFKGDVNFRVDEKLYPNHCSNKKEGTNENGHAKLQEDNNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YARGA+YALQ + L+QGIIG+I +ANNL +SP Sbjct: 121 RYARGAVYALQEKEHCLSQGIIGYIYGSDGLDSSGLSSSAAVGLAYLLALENANNLTISP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DRLIEN YLGLRNGILDQSAILLSSYGCL MNCKTK+ KLI PL ++ + +S+ Sbjct: 181 TENIEYDRLIENGYLGLRNGILDQSAILLSSYGCLLHMNCKTKDFKLIRPLDMESSLKSE 240 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K Y+ILLAFSGLKQALT NPGYN RV EC+EAA+ILL+ASGN + Sbjct: 241 KQKEYQILLAFSGLKQALTNNPGYNHRVAECQEAAKILLNASGNSHM 287 >ref|XP_006453202.1| hypothetical protein CICLE_v10008332mg [Citrus clementina] gi|568840713|ref|XP_006474310.1| PREDICTED: galacturonokinase-like isoform X1 [Citrus sinensis] gi|557556428|gb|ESR66442.1| hypothetical protein CICLE_v10008332mg [Citrus clementina] Length = 437 Score = 318 bits (815), Expect = 2e-84 Identities = 166/287 (57%), Positives = 198/287 (68%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 MG SWP+E +L +R V+EM SPYRICPLGAHIDHQGGTVSAMTI+K Sbjct: 1 MGEFSWPTEDQLKEMRNKVSEMSGRDAEEVRVVVSPYRICPLGAHIDHQGGTVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SGD +V+LRSGQF GEVRFRIDEIQ P + + D + I EE WG Sbjct: 61 GILLGFVPSGDTEVVLRSGQFDGEVRFRIDEIQQPTNSVKKHHAVYASDSAKIKEECKWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 NYARGALYALQ GN LT+GIIG+IC SAN++ ++P Sbjct: 121 NYARGALYALQSRGNILTEGIIGYICGSDNLDSSGLSSSAAVGIAYLLALESANDMNVTP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 +NIE DRLIEN YLGLRNGILDQSAILLS YGCL CM+CK+KE+++I P + Q E++ Sbjct: 181 LDNIEYDRLIENGYLGLRNGILDQSAILLSRYGCLMCMDCKSKEYEIIQPREPQNGGETE 240 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K+YKILLAFSGL+ ALT NPGYN RV EC+EAA+ LL ASG ++ Sbjct: 241 FKKSYKILLAFSGLRCALTNNPGYNCRVAECQEAAKFLLCASGKAEM 287 >ref|XP_006381135.1| hypothetical protein POPTR_0006s06980g [Populus trichocarpa] gi|550335670|gb|ERP58932.1| hypothetical protein POPTR_0006s06980g [Populus trichocarpa] Length = 401 Score = 314 bits (805), Expect = 3e-83 Identities = 168/283 (59%), Positives = 196/283 (69%) Frame = +3 Query: 180 SWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDKGILL 359 SWP E EL+ +++IV+ M SPYRICPLGAHIDHQGGTVSAMTI+KGILL Sbjct: 5 SWPIENELNEIKEIVSAMAGRGPEEVRVVVSPYRICPLGAHIDHQGGTVSAMTINKGILL 64 Query: 360 GFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWGNYAR 539 GFI S D +V+LRSGQF+GEVRF +DE+Q P + G++H D + E NWGN+AR Sbjct: 65 GFIPSDDTEVILRSGQFKGEVRFSVDEVQQPRPIR-KKGESHATDSPKLQEAGNWGNFAR 123 Query: 540 GALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSPTENI 719 GA+YALQ G LTQGI G+IC +ANNL ++PTENI Sbjct: 124 GAVYALQSRGISLTQGITGYICGSEGLDSSGLSSSAAAGVAYLLAFETANNLTMTPTENI 183 Query: 720 ELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESKDLKA 899 E DRLIENEYLGL+NGILDQSAILLSS+G LT MNCKTKEHKL+ P Q N + K+ Sbjct: 184 EYDRLIENEYLGLKNGILDQSAILLSSHGFLTHMNCKTKEHKLV-PSPKQSNFQ----KS 238 Query: 900 YKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 YKILLAFSGL+ ALT +PGYN RV EC EAARILL ASGN L Sbjct: 239 YKILLAFSGLRNALTNSPGYNLRVVECHEAARILLKASGNDNL 281 >ref|XP_003525312.1| PREDICTED: galacturonokinase-like isoform X1 [Glycine max] gi|571456834|ref|XP_006580491.1| PREDICTED: galacturonokinase-like isoform X2 [Glycine max] Length = 431 Score = 312 bits (799), Expect = 2e-82 Identities = 161/284 (56%), Positives = 201/284 (70%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M WPS+ EL+ +R+ V+++ SPYRICPLGAHIDHQGGTV+AMTI+K Sbjct: 1 MASRCWPSDAELNELRERVSKIVDLNKEEVRVVVSPYRICPLGAHIDHQGGTVAAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF SG QV++RSGQF+GEV+FR+DEIQ P K+ D S + E+ NWG Sbjct: 61 GILLGFAPSGSNQVVIRSGQFEGEVKFRVDEIQQP------KDKSLDKDSSELQEQCNWG 114 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YARGA+YALQ GN L++GIIG+IC AN+L++SP Sbjct: 115 RYARGAVYALQSRGNNLSKGIIGYICGSEGLDSSGLSSSAAVGVACLMALQYANDLVISP 174 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENI+ DRLIENEYLGL+NGI+DQSAILLSS+GCL CMNCKTK++KL++ K+ + ES Sbjct: 175 TENIDYDRLIENEYLGLKNGIMDQSAILLSSHGCLMCMNCKTKDYKLVYRPKVLEYNESG 234 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGN 1019 + KA +ILLA SGLKQAL NPGYN+RV ECREAA+ILL+ASG+ Sbjct: 235 EPKATRILLALSGLKQALMNNPGYNKRVAECREAAQILLEASGD 278 >ref|XP_003630628.1| Galactokinase [Medicago truncatula] gi|355524650|gb|AET05104.1| Galactokinase [Medicago truncatula] Length = 308 Score = 312 bits (799), Expect = 2e-82 Identities = 161/284 (56%), Positives = 198/284 (69%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M S WPS+ EL+ +R+ V++M SPYRICPLGAHIDHQGGTV AMTI+K Sbjct: 1 MAGSCWPSDTELNEMREKVSQMAKVKKEDVRVVVSPYRICPLGAHIDHQGGTVLAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF SG + ++RSGQFQGEV+FR+D+IQ P T + S E+ NWG Sbjct: 61 GILLGFTPSGSDEFVIRSGQFQGEVKFRVDDIQQPVQTTKIKNDNMAENSSEPQEQCNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YARGA+YALQ G+ +++GIIG+I AN+L++SP Sbjct: 121 RYARGAVYALQNRGHNISKGIIGYIRGSDGLDSSGLSSSAAVGVAYLLALEHANDLVISP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DRLIENEYLGL+NGI+DQSAILLS +GCL CMNCKTKE+KLIH +Q ++S+ Sbjct: 181 TENIEYDRLIENEYLGLKNGIMDQSAILLSRHGCLMCMNCKTKEYKLIHRPTVQDYKKSE 240 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGN 1019 KA K+LLA SGLKQALTTNPGYNRRV EC+EAA+ILL+ASG+ Sbjct: 241 QPKAAKMLLALSGLKQALTTNPGYNRRVAECKEAAQILLEASGD 284 >ref|XP_003630627.1| Galactokinase [Medicago truncatula] gi|355524649|gb|AET05103.1| Galactokinase [Medicago truncatula] Length = 437 Score = 312 bits (799), Expect = 2e-82 Identities = 161/284 (56%), Positives = 198/284 (69%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M S WPS+ EL+ +R+ V++M SPYRICPLGAHIDHQGGTV AMTI+K Sbjct: 1 MAGSCWPSDTELNEMREKVSQMAKVKKEDVRVVVSPYRICPLGAHIDHQGGTVLAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF SG + ++RSGQFQGEV+FR+D+IQ P T + S E+ NWG Sbjct: 61 GILLGFTPSGSDEFVIRSGQFQGEVKFRVDDIQQPVQTTKIKNDNMAENSSEPQEQCNWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YARGA+YALQ G+ +++GIIG+I AN+L++SP Sbjct: 121 RYARGAVYALQNRGHNISKGIIGYIRGSDGLDSSGLSSSAAVGVAYLLALEHANDLVISP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DRLIENEYLGL+NGI+DQSAILLS +GCL CMNCKTKE+KLIH +Q ++S+ Sbjct: 181 TENIEYDRLIENEYLGLKNGIMDQSAILLSRHGCLMCMNCKTKEYKLIHRPTVQDYKKSE 240 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGN 1019 KA K+LLA SGLKQALTTNPGYNRRV EC+EAA+ILL+ASG+ Sbjct: 241 QPKAAKMLLALSGLKQALTTNPGYNRRVAECKEAAQILLEASGD 284 >ref|XP_007014471.1| Galacturonic acid kinase isoform 4 [Theobroma cacao] gi|508784834|gb|EOY32090.1| Galacturonic acid kinase isoform 4 [Theobroma cacao] Length = 423 Score = 311 bits (796), Expect = 4e-82 Identities = 167/287 (58%), Positives = 192/287 (66%) Frame = +3 Query: 168 MGVSSWPSEGELDRVRKIVAEMCXXXXXXXXXXXSPYRICPLGAHIDHQGGTVSAMTIDK 347 M SWP++ ELD++R IV+EM SPYRICPLGAHIDHQGG VSAMTI+K Sbjct: 1 MAAMSWPTQDELDKIRGIVSEMAGKGTEDVRVVVSPYRICPLGAHIDHQGGIVSAMTINK 60 Query: 348 GILLGFIASGDPQVLLRSGQFQGEVRFRIDEIQHPFHLTGTSGKAHVIDPSIIHEENNWG 527 GILLGF+ SG+ QV LRSGQF+GEVRFR++E Q P H + V S +E WG Sbjct: 61 GILLGFVPSGNTQVALRSGQFKGEVRFRVNETQQPRHRISKGEEIKVDKSSPSPQECYWG 120 Query: 528 NYARGALYALQRGGNQLTQGIIGFICXXXXXXXXXXXXXXXXXXXXXXXXXSANNLILSP 707 YA GALYALQ GN L QGIIG+IC SANNL +SP Sbjct: 121 RYAIGALYALQSRGNHLAQGIIGYICGSEGLDSSGLSSSAAVGVAYLLALESANNLTVSP 180 Query: 708 TENIELDRLIENEYLGLRNGILDQSAILLSSYGCLTCMNCKTKEHKLIHPLKLQKNQESK 887 TENIE DR+IENEYLGLRNGILDQSAILLSS+GCLT MNC K+ E++ Sbjct: 181 TENIEYDRVIENEYLGLRNGILDQSAILLSSHGCLTYMNC--------------KDHETE 226 Query: 888 DLKAYKILLAFSGLKQALTTNPGYNRRVNECREAARILLDASGNGQL 1028 K YKILLAFSGL+QALT+NPGYN RV EC+EAA+ILL ASGNG+L Sbjct: 227 PQKGYKILLAFSGLRQALTSNPGYNSRVAECQEAAKILLHASGNGEL 273