BLASTX nr result
ID: Angelica27_contig00010163
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica27_contig00010163 (2818 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_017230532.1 PREDICTED: hydroxyproline O-galactosyltransferase... 1233 0.0 KZN11226.1 hypothetical protein DCAR_003882 [Daucus carota subsp... 1221 0.0 XP_017235846.1 PREDICTED: hydroxyproline O-galactosyltransferase... 1151 0.0 XP_017235847.1 PREDICTED: hydroxyproline O-galactosyltransferase... 1135 0.0 KZN07032.1 hypothetical protein DCAR_007869 [Daucus carota subsp... 996 0.0 XP_007213608.1 hypothetical protein PRUPE_ppa002345mg [Prunus pe... 988 0.0 XP_017978397.1 PREDICTED: hydroxyproline O-galactosyltransferase... 986 0.0 XP_008225535.1 PREDICTED: hydroxyproline O-galactosyltransferase... 986 0.0 XP_012454305.1 PREDICTED: probable beta-1,3-galactosyltransferas... 982 0.0 XP_017649307.1 PREDICTED: hydroxyproline O-galactosyltransferase... 980 0.0 XP_016701253.1 PREDICTED: hydroxyproline O-galactosyltransferase... 979 0.0 XP_016725906.1 PREDICTED: hydroxyproline O-galactosyltransferase... 978 0.0 XP_016701254.1 PREDICTED: hydroxyproline O-galactosyltransferase... 978 0.0 XP_012072180.1 PREDICTED: probable beta-1,3-galactosyltransferas... 977 0.0 KHG03949.1 putative beta-1,3-galactosyltransferase 20 -like prot... 977 0.0 XP_018847685.1 PREDICTED: hydroxyproline O-galactosyltransferase... 976 0.0 XP_009362277.1 PREDICTED: hydroxyproline O-galactosyltransferase... 976 0.0 XP_009360949.1 PREDICTED: hydroxyproline O-galactosyltransferase... 974 0.0 XP_002531052.1 PREDICTED: probable beta-1,3-galactosyltransferas... 974 0.0 OMO65814.1 hypothetical protein COLO4_31004 [Corchorus olitorius] 971 0.0 >XP_017230532.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like [Daucus carota subsp. sativus] Length = 679 Score = 1233 bits (3189), Expect = 0.0 Identities = 599/680 (88%), Positives = 627/680 (92%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKRSKSE +RFKL++ LFGIAALYL+FICYKFP+FF++ GVLSGDDG+DKLGSSGF+ Sbjct: 1 MKRSKSEQLLGRRFKLTHLLFGIAALYLIFICYKFPEFFKTGGVLSGDDGYDKLGSSGFM 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D NGDI+SKP LRSVSEETIHRRLD+ENEVVPFI+PGK L DK GLPP+K FKN YGR Sbjct: 61 DV-NGDIISKPKLRSVSEETIHRRLDDENEVVPFIMPGKALIDKNKGLPPMKPFKNHYGR 119 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 IAS+ILRQMN+TNDLSVLERMADEAWTLGLKAWEEVK+Y GKE DQNSILEVKQESCPSW Sbjct: 120 IASEILRQMNKTNDLSVLERMADEAWTLGLKAWEEVKHYDGKEADQNSILEVKQESCPSW 179 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 VSSSGKEL KGDQ+MFLPCGLAAGSSITVVGTPKYAHHEY+PTQAK+R +SLILVSQFM Sbjct: 180 VSSSGKELAKGDQIMFLPCGLAAGSSITVVGTPKYAHHEYLPTQAKVRNGNSLILVSQFM 239 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD Sbjct: 240 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 299 Query: 1333 MRVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 1512 M VDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL Sbjct: 300 MLVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 359 Query: 1513 DGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM 1692 DGFH+ISGGRH+TSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM Sbjct: 360 DGFHIISGGRHITSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM 419 Query: 1693 SPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNPRK 1872 SPKWK+QPLP+GPIPLFIGVLSATNHFAERMAVRKSWM RFFVALNPRK Sbjct: 420 SPKWKSQPLPNGPIPLFIGVLSATNHFAERMAVRKSWMQASAIKSSIVAVRFFVALNPRK 479 Query: 1873 EVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIRVE 2052 EVNAVLKQEAAYFGD VILPFLDRYELVVLKTIAICEYGVQNA+AAHIMK DDDTFIRV+ Sbjct: 480 EVNAVLKQEAAYFGDIVILPFLDRYELVVLKTIAICEYGVQNATAAHIMKGDDDTFIRVD 539 Query: 2053 TVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKDIA 2232 TVLKELDKV+ KSLYMGNLNLLHRPLRSGKWAVS ANGPGYIIS+DIA Sbjct: 540 TVLKELDKVSVKKSLYMGNLNLLHRPLRSGKWAVSYEEYPEEVYPPYANGPGYIISEDIA 599 Query: 2233 KYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQSP 2412 KYIVSQHV Q LKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQSP Sbjct: 600 KYIVSQHVNQKLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQSP 659 Query: 2413 RQMICLWDKLVRGQARCCNF 2472 RQMICLWDKL RGQA CCNF Sbjct: 660 RQMICLWDKLARGQAGCCNF 679 >KZN11226.1 hypothetical protein DCAR_003882 [Daucus carota subsp. sativus] Length = 699 Score = 1221 bits (3158), Expect = 0.0 Identities = 599/700 (85%), Positives = 627/700 (89%), Gaps = 20/700 (2%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKRSKSE +RFKL++ LFGIAALYL+FICYKFP+FF++ GVLSGDDG+DKLGSSGF+ Sbjct: 1 MKRSKSEQLLGRRFKLTHLLFGIAALYLIFICYKFPEFFKTGGVLSGDDGYDKLGSSGFM 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D NGDI+SKP LRSVSEETIHRRLD+ENEVVPFI+PGK L DK GLPP+K FKN YGR Sbjct: 61 DV-NGDIISKPKLRSVSEETIHRRLDDENEVVPFIMPGKALIDKNKGLPPMKPFKNHYGR 119 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 IAS+ILRQMN+TNDLSVLERMADEAWTLGLKAWEEVK+Y GKE DQNSILEVKQESCPSW Sbjct: 120 IASEILRQMNKTNDLSVLERMADEAWTLGLKAWEEVKHYDGKEADQNSILEVKQESCPSW 179 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 VSSSGKEL KGDQ+MFLPCGLAAGSSITVVGTPKYAHHEY+PTQAK+R +SLILVSQFM Sbjct: 180 VSSSGKELAKGDQIMFLPCGLAAGSSITVVGTPKYAHHEYLPTQAKVRNGNSLILVSQFM 239 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD Sbjct: 240 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 299 Query: 1333 MRVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 1512 M VDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL Sbjct: 300 MLVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 359 Query: 1513 DGFHVISGGRHVTSFPYRT--------------------GFTLEDATGLAIKGDVDVHSV 1632 DGFH+ISGGRH+TSFPYRT GFTLEDATGLAIKGDVDVHSV Sbjct: 360 DGFHIISGGRHITSFPYRTGFTLEDATGLAIKGDVDVHSGFTLEDATGLAIKGDVDVHSV 419 Query: 1633 FATSLPTSHPSFSPQRVLEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXX 1812 FATSLPTSHPSFSPQRVLEMSPKWK+QPLP+GPIPLFIGVLSATNHFAERMAVRKSWM Sbjct: 420 FATSLPTSHPSFSPQRVLEMSPKWKSQPLPNGPIPLFIGVLSATNHFAERMAVRKSWMQA 479 Query: 1813 XXXXXXXXXXRFFVALNPRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGV 1992 RFFVALNPRKEVNAVLKQEAAYFGD VILPFLDRYELVVLKTIAICEYGV Sbjct: 480 SAIKSSIVAVRFFVALNPRKEVNAVLKQEAAYFGDIVILPFLDRYELVVLKTIAICEYGV 539 Query: 1993 QNASAAHIMKSDDDTFIRVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXX 2172 QNA+AAHIMK DDDTFIRV+TVLKELDKV+ KSLYMGNLNLLHRPLRSGKWAVS Sbjct: 540 QNATAAHIMKGDDDTFIRVDTVLKELDKVSVKKSLYMGNLNLLHRPLRSGKWAVSYEEYP 599 Query: 2173 XXXXXXXANGPGYIISKDIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHS 2352 ANGPGYIIS+DIAKYIVSQHV Q LKLFKMEDVSMGMWVEQFNSSRPVEYSHS Sbjct: 600 EEVYPPYANGPGYIISEDIAKYIVSQHVNQKLKLFKMEDVSMGMWVEQFNSSRPVEYSHS 659 Query: 2353 WKFCQYGCVENYFTAHYQSPRQMICLWDKLVRGQARCCNF 2472 WKFCQYGCVENYFTAHYQSPRQMICLWDKL RGQA CCNF Sbjct: 660 WKFCQYGCVENYFTAHYQSPRQMICLWDKLARGQAGCCNF 699 >XP_017235846.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like isoform X1 [Daucus carota subsp. sativus] Length = 680 Score = 1151 bits (2977), Expect = 0.0 Identities = 550/680 (80%), Positives = 606/680 (89%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE GV+R +LS+ L G+A+LYLL+ICYKFP FF+SVG+LSGDD +++LGSS FV Sbjct: 1 MKRLKSEQLGVRRLRLSHLLLGMASLYLLYICYKFPGFFKSVGMLSGDDSYNRLGSSDFV 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 + NGDIL+KP+LRSVS++TIHRRLDNEN+ VP I+PGKV++D NG+PP+K FK+ YGR Sbjct: 61 NVNNGDILTKPLLRSVSKDTIHRRLDNENDFVPLIVPGKVVEDNSNGVPPMKPFKSHYGR 120 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I D LRQMNRTNDLSVLE MADEAWTLGLKAWEEV NY+GKE ++SILE ++E+CPSW Sbjct: 121 ITGDTLRQMNRTNDLSVLENMADEAWTLGLKAWEEVDNYNGKEFGESSILEGRKETCPSW 180 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 VS+S +EL GDQVMFLPCGLAAGSSITVVGTPKYAH+EY+P QA +RTA S ILVSQFM Sbjct: 181 VSTSEEELANGDQVMFLPCGLAAGSSITVVGTPKYAHNEYIPRQANVRTADSFILVSQFM 240 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWG GQRCDGLPSK DDD Sbjct: 241 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGTGQRCDGLPSKSDDD 300 Query: 1333 MRVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 1512 M VDGYLRCEKW RNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPF EGKMFVLTLRAGL Sbjct: 301 MLVDGYLRCEKWMRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFVEGKMFVLTLRAGL 360 Query: 1513 DGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM 1692 DGFHVI+GGRHVTSFPYRTG TLE+ATGLAIKGDVD+HSVFATSLPTSHPSFSPQRVL+M Sbjct: 361 DGFHVITGGRHVTSFPYRTGLTLEEATGLAIKGDVDIHSVFATSLPTSHPSFSPQRVLDM 420 Query: 1693 SPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNPRK 1872 S KWK+QPL +GP LFIGVLSATNHFAERMAVRK+WM RFFVALNPRK Sbjct: 421 SEKWKSQPLLNGPTQLFIGVLSATNHFAERMAVRKTWMQASAIKSSIVVVRFFVALNPRK 480 Query: 1873 EVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIRVE 2052 EVNAVLKQEAAYFGD VILPF+DRYELVVLKTIAICE+GVQNA+AA+IMK DDDTFIRV+ Sbjct: 481 EVNAVLKQEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNATAAYIMKCDDDTFIRVD 540 Query: 2053 TVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKDIA 2232 TV+KELDKV+ + LYMGNLNLLHRPLRSGKWAVS ANGPGYIISK+IA Sbjct: 541 TVMKELDKVSGKRPLYMGNLNLLHRPLRSGKWAVSYEEYPQEVYPPYANGPGYIISKEIA 600 Query: 2233 KYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQSP 2412 KYI+S+HV +LKLFKMEDVSMGMWVEQFNSS PV+YSHSWKFCQYGC+ENY+TAHYQSP Sbjct: 601 KYIISKHVDGDLKLFKMEDVSMGMWVEQFNSSTPVQYSHSWKFCQYGCLENYYTAHYQSP 660 Query: 2413 RQMICLWDKLVRGQARCCNF 2472 RQMICLWDKLVRGQARCCNF Sbjct: 661 RQMICLWDKLVRGQARCCNF 680 >XP_017235847.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like isoform X2 [Daucus carota subsp. sativus] Length = 675 Score = 1135 bits (2936), Expect = 0.0 Identities = 545/680 (80%), Positives = 601/680 (88%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE GV+R +LS+ L G+A+LYLL+ICYKFP FF+SVG+LSGDD +++LGSS FV Sbjct: 1 MKRLKSEQLGVRRLRLSHLLLGMASLYLLYICYKFPGFFKSVGMLSGDDSYNRLGSSDFV 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 + NGDIL+KP+LRSVS++TIHRRLDNEN+ VP I+PGKV++D NG+PP+K FK+ YGR Sbjct: 61 NVNNGDILTKPLLRSVSKDTIHRRLDNENDFVPLIVPGKVVEDNSNGVPPMKPFKSHYGR 120 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I D LRQMNRTNDLSVLE MADEAWTLGLKAWEEV NY+GKE ++SILE ++E+CPSW Sbjct: 121 ITGDTLRQMNRTNDLSVLENMADEAWTLGLKAWEEVDNYNGKEFGESSILEGRKETCPSW 180 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 VS+S +EL GDQVMFLPCGLAAGSSITVVGTPKYAH+EY+P QA +RTA S ILVSQFM Sbjct: 181 VSTSEEELANGDQVMFLPCGLAAGSSITVVGTPKYAHNEYIPRQANVRTADSFILVSQFM 240 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWG GQRCDGLPSK DDD Sbjct: 241 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGTGQRCDGLPSKSDDD 300 Query: 1333 MRVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 1512 M VDGYLRCEKW RNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPF EGKMFVLTLRAGL Sbjct: 301 MLVDGYLRCEKWMRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFVEGKMFVLTLRAGL 360 Query: 1513 DGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM 1692 DGFHVI+GGRHVTSFPYRTG TLE+ATGLAIKGDVD+HSVFATSLPTSHPSFSPQRVL+M Sbjct: 361 DGFHVITGGRHVTSFPYRTGLTLEEATGLAIKGDVDIHSVFATSLPTSHPSFSPQRVLDM 420 Query: 1693 SPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNPRK 1872 S KWK+QPL +GP LFIGVLSATNHFAERMAVRK+WM RFFVALNPRK Sbjct: 421 SEKWKSQPLLNGPTQLFIGVLSATNHFAERMAVRKTWMQASAIKSSIVVVRFFVALNPRK 480 Query: 1873 EVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIRVE 2052 EVNAVLKQEAAYFGD VILPF+DRYELVVLKTIAICE+GVQNA+AA+IMK DDDTFIRV+ Sbjct: 481 EVNAVLKQEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNATAAYIMKCDDDTFIRVD 540 Query: 2053 TVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKDIA 2232 TV+KELDKV+ + LYMGNLNLLHRPLRSGKWAVS ANGPGYIISK+IA Sbjct: 541 TVMKELDKVSGKRPLYMGNLNLLHRPLRSGKWAVSYEEYPQEVYPPYANGPGYIISKEIA 600 Query: 2233 KYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQSP 2412 KYI+S+HV +LK DVSMGMWVEQFNSS PV+YSHSWKFCQYGC+ENY+TAHYQSP Sbjct: 601 KYIISKHVDGDLK-----DVSMGMWVEQFNSSTPVQYSHSWKFCQYGCLENYYTAHYQSP 655 Query: 2413 RQMICLWDKLVRGQARCCNF 2472 RQMICLWDKLVRGQARCCNF Sbjct: 656 RQMICLWDKLVRGQARCCNF 675 >KZN07032.1 hypothetical protein DCAR_007869 [Daucus carota subsp. sativus] Length = 1244 Score = 996 bits (2576), Expect = 0.0 Identities = 488/636 (76%), Positives = 541/636 (85%), Gaps = 22/636 (3%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE GV+R +LS+ L G+A+LYLL+ICYKFP FF+SVG+LSGDD +++LGSS FV Sbjct: 1 MKRLKSEQLGVRRLRLSHLLLGMASLYLLYICYKFPGFFKSVGMLSGDDSYNRLGSSDFV 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 + NGDIL+KP+LRSVS++TIHRRLDNEN+ VP I+PGKV++D NG+PP+K FK+ YGR Sbjct: 61 NVNNGDILTKPLLRSVSKDTIHRRLDNENDFVPLIVPGKVVEDNSNGVPPMKPFKSHYGR 120 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I D LRQMNRTNDLSVLE MADEAWTLGLKAWEEV NY+GKE ++SILE ++E+CPSW Sbjct: 121 ITGDTLRQMNRTNDLSVLENMADEAWTLGLKAWEEVDNYNGKEFGESSILEGRKETCPSW 180 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 VS+S +EL GDQVMFLPCGLAAGSSITVVGTPKYAH+EY+P QA +RTA S ILVSQFM Sbjct: 181 VSTSEEELANGDQVMFLPCGLAAGSSITVVGTPKYAHNEYIPRQANVRTADSFILVSQFM 240 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWG GQRCDGLPSK DDD Sbjct: 241 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGTGQRCDGLPSKSDDD 300 Query: 1333 MRVDGYLRCEKWSRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRAGL 1512 M VDGYLRCEKW RNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPF EGKMFVLTLRAGL Sbjct: 301 MLVDGYLRCEKWMRNDNRDKESRTTSWFQRFIGRAKKPEVTWPFPFVEGKMFVLTLRAGL 360 Query: 1513 DGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVLEM 1692 DGFHVI+GGRHVTSFPYRTG TLE+ATGLAIKGDVD+HSVFATSLPTSHPSFSPQRVL+M Sbjct: 361 DGFHVITGGRHVTSFPYRTGLTLEEATGLAIKGDVDIHSVFATSLPTSHPSFSPQRVLDM 420 Query: 1693 SPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNPRK 1872 S KWK+QPL +GP LFIGVLSATNHFAERMAVRK+WM RFFVALNPRK Sbjct: 421 SEKWKSQPLLNGPTQLFIGVLSATNHFAERMAVRKTWMQASAIKSSIVVVRFFVALNPRK 480 Query: 1873 EVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIRVE 2052 EVNAVLKQEAAYFGD VILPF+DRYELVVLKTIAICE+GVQNA+AA+IMK DDDTFIRV+ Sbjct: 481 EVNAVLKQEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNATAAYIMKCDDDTFIRVD 540 Query: 2053 TVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVS----------------------XXX 2166 TV+KELDKV+ + LYMGNLNLLHRPLRSGKWAVS Sbjct: 541 TVMKELDKVSGKRPLYMGNLNLLHRPLRSGKWAVSYELKNFVFSFLATVYLKNDILMKQE 600 Query: 2167 XXXXXXXXXANGPGYIISKDIAKYIVSQHVKQNLKL 2274 ANGPGYIISK+IAKYI+S+HV +LKL Sbjct: 601 YPQEVYPPYANGPGYIISKEIAKYIISKHVDGDLKL 636 Score = 59.7 bits (143), Expect(2) = 3e-11 Identities = 28/37 (75%), Positives = 34/37 (91%) Frame = +1 Query: 2194 ANGPGYIISKDIAKYIVSQHVKQNLKLFKMEDVSMGM 2304 ANGPGYIISK+IA+ I+S+HV +LKLFKMEDVSMG+ Sbjct: 668 ANGPGYIISKEIAQDIISKHVDGDLKLFKMEDVSMGI 704 Score = 39.7 bits (91), Expect(2) = 3e-11 Identities = 20/26 (76%), Positives = 22/26 (84%) Frame = +3 Query: 2397 SLSISEANDMSVGQIGKRSGSMLQLL 2474 SL ISEANDMSVGQ G+ S S+LQLL Sbjct: 705 SLPISEANDMSVGQTGQGSSSLLQLL 730 >XP_007213608.1 hypothetical protein PRUPE_ppa002345mg [Prunus persica] ONI11082.1 hypothetical protein PRUPE_4G086100 [Prunus persica] Length = 684 Score = 988 bits (2553), Expect = 0.0 Identities = 472/684 (69%), Positives = 563/684 (82%), Gaps = 4/684 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKL + LF +AALYL+FI KFP+F E +SGDDG+ L + Sbjct: 1 MKRLKIEPSVARRFKLQHLLFALAALYLIFISVKFPQFLEIAKAMSGDDGYVGLDLAKVQ 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D+++GD LSKP+ SV ++T HR+L+++++ P + L++KK+ PI+ +++YGR Sbjct: 61 DSQDGD-LSKPLFSSVYKDTFHRKLEDQSQDAPVRPSKEPLEEKKSESKPIRPLQHRYGR 119 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I +ILRQ NRTN+LSVLERMADEAWTLGL AWEEV + GKE+ ++SI+E K ESCPSW Sbjct: 120 ITGEILRQRNRTNELSVLERMADEAWTLGLNAWEEVDKHDGKEIGESSIVEGKPESCPSW 179 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 +S SG+EL GD++MFLPCGLAAGSS+TVVGT YAH EYVP AK+R +++VSQFM Sbjct: 180 LSMSGEELAMGDKLMFLPCGLAAGSSVTVVGTSHYAHQEYVPQLAKLRRGDGIVMVSQFM 239 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSV GEDPPKILHLNPRL+GDWSH PVIEHNTCYRMQWG+ QRCDGLPSK ++D Sbjct: 240 VELQGLKSVDGEDPPKILHLNPRLKGDWSHRPVIEHNTCYRMQWGSAQRCDGLPSKNNED 299 Query: 1333 MRVDGYLRCEKWSRND---NRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLR 1503 M VDGY RCEKW RND +++ +++TTSWF+RFIGR +KPEVTWPFPF EG++F+LT+R Sbjct: 300 MLVDGYGRCEKWMRNDMVDSKESKTKTTSWFKRFIGREQKPEVTWPFPFTEGRLFILTIR 359 Query: 1504 AGLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRV 1683 AG+DGFH+ GGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSV+ATSLP SHPSFSPQRV Sbjct: 360 AGVDGFHISVGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVYATSLPASHPSFSPQRV 419 Query: 1684 LEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALN 1863 LEMS KWKA+PLP P+ LFIGVLSATNHFAERMAVRK+WM RFFVALN Sbjct: 420 LEMSEKWKARPLPKSPVRLFIGVLSATNHFAERMAVRKTWMQSSVIKSSDVVVRFFVALN 479 Query: 1864 PRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFI 2043 PRKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTI+ICE+GVQN +AA+IMK DDDTF+ Sbjct: 480 PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTISICEFGVQNVTAAYIMKCDDDTFV 539 Query: 2044 RVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISK 2223 RV+TVLKE++ ++ KSLYMGNLNLLHRPLRSGKWAV+ ANGPGYIIS Sbjct: 540 RVDTVLKEIEGISSKKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYIISI 599 Query: 2224 DIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSS-RPVEYSHSWKFCQYGCVENYFTAH 2400 DIAK+++SQH ++L+LFKMEDVSMGMWVEQFNSS V+YSH+WKFCQYGC+ENY+TAH Sbjct: 600 DIAKFVISQHGSRSLRLFKMEDVSMGMWVEQFNSSMATVQYSHNWKFCQYGCMENYYTAH 659 Query: 2401 YQSPRQMICLWDKLVRGQARCCNF 2472 YQSPRQMICLWDKL RG+ +CCNF Sbjct: 660 YQSPRQMICLWDKLARGRVQCCNF 683 >XP_017978397.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Theobroma cacao] EOY27579.1 Galactosyltransferase family protein isoform 1 [Theobroma cacao] Length = 682 Score = 986 bits (2550), Expect = 0.0 Identities = 478/682 (70%), Positives = 556/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE +RFKLS+FL GI LYL+FI +KFP F E VLSGD +D+L Sbjct: 1 MKRVKSELSTGRRFKLSHFLLGIGGLYLIFIAFKFPHFLEIAAVLSGDGSYDELDGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRL-DNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D + D L+KP++ SV ++T HR+L DN N+ P + L++ K L PIK +++YG Sbjct: 61 DVNDAD-LNKPLVNSVYKDTFHRKLEDNLNQDAPLRPSKEPLEEGKGRLQPIKPLQHRYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +I+R+MN+T+DLSVLERMADEAWTLGLKAWEEV + GK++ QNS+ + K ESCPS Sbjct: 120 RITGEIMRRMNKTSDLSVLERMADEAWTLGLKAWEEVDKFDGKKIGQNSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S SG++L GD++MFLPCGL AGSSITVVGTP+YAH E+VP A++R L++VSQF Sbjct: 180 WLSVSGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEFVPQLARLRLGDGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGLKSV GEDPPKILHLNPRL+GDWSH PVIEHNTCYRMQWG QRCDGL SK D+ Sbjct: 240 MVELQGLKSVDGEDPPKILHLNPRLKGDWSHRPVIEHNTCYRMQWGTAQRCDGLRSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDG+ RCEKW R+D D KES+TTSWF+RFIGR +KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGHRRCEKWIRDDVADSKESKTTSWFKRFIGREQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 +DG+H+ GGRHVTSFPYRTGF+LEDATGLAIKGDVDVHSV+ATSLPTSHPSFSPQRVL Sbjct: 360 AVDGYHINVGGRHVTSFPYRTGFSLEDATGLAIKGDVDVHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIGVLSATNHFAERMAVRK+WM RFFVALN Sbjct: 420 EMSPKWKAYPLPRRSIQLFIGVLSATNHFAERMAVRKTWMQSSAIKSSNVVVRFFVALNT 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN SAA+IMK DDDTF+R Sbjct: 480 RKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVSAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V+TVLKE+D ++ KSLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDTVLKEIDGISPKKSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+I+SQH + L+LFKMEDVSMGMWVEQFNSS V+YSH+WKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIISQHGNRKLRLFKMEDVSMGMWVEQFNSSTTVQYSHNWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RG+A CCNF Sbjct: 660 SPRQMICLWDKLSRGRAHCCNF 681 >XP_008225535.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Prunus mume] Length = 684 Score = 986 bits (2548), Expect = 0.0 Identities = 473/684 (69%), Positives = 563/684 (82%), Gaps = 4/684 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKL + LF +AALYL+FI KFP+F E +SGDDG+ L + Sbjct: 1 MKRLKIEPSVARRFKLQHLLFALAALYLVFISVKFPQFLEIAKAMSGDDGYVGLDLAKVQ 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D+++GD LSKP+ SV ++T HR+L+++++ P + L++KK+ PI+ +++YGR Sbjct: 61 DSQDGD-LSKPLFSSVYKDTFHRKLEDQSQDAPVRPSKEPLEEKKSESKPIRPLQHRYGR 119 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I +ILRQ NRTN+LSVLERMADEAWTLGL AWEEV + GK ++SI+E K ESCPSW Sbjct: 120 ITGEILRQRNRTNELSVLERMADEAWTLGLNAWEEVDKHDGKVTGESSIVEGKPESCPSW 179 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 +S SG+EL GD++MFLPCGLAAGSS+TVVGT YAH EYVP AK+R +++VSQFM Sbjct: 180 LSMSGEELAMGDKLMFLPCGLAAGSSVTVVGTSHYAHQEYVPQLAKLRRGDGIVMVSQFM 239 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSV GEDPPKILHLNPRL+GDWSH PVIEHNTCYRMQWG+ QRCDGLPSK ++D Sbjct: 240 VELQGLKSVDGEDPPKILHLNPRLKGDWSHRPVIEHNTCYRMQWGSAQRCDGLPSKNNED 299 Query: 1333 MRVDGYLRCEKWSRND---NRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLR 1503 M VDGY RCEKW RND +++ +++TTSWF+RFIGR +KPEVTWPFPF EG++F+LT+R Sbjct: 300 MLVDGYGRCEKWMRNDMVDSKESKTKTTSWFKRFIGREQKPEVTWPFPFTEGRLFILTIR 359 Query: 1504 AGLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRV 1683 AG+DGFH+ GGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSV+ATSLP+SHPSFSPQRV Sbjct: 360 AGVDGFHISVGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVYATSLPSSHPSFSPQRV 419 Query: 1684 LEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALN 1863 LEMS KWKA+PLP PI LFIGVLSATNHFAERMAVRK+WM RFFVALN Sbjct: 420 LEMSEKWKARPLPKSPIRLFIGVLSATNHFAERMAVRKTWMQSSVIKSSNVVVRFFVALN 479 Query: 1864 PRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFI 2043 PRKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTI+ICE+GVQN +AA+IMK DDDTF+ Sbjct: 480 PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTISICEFGVQNVTAAYIMKCDDDTFV 539 Query: 2044 RVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISK 2223 RV+TVLKE++ ++ KSLYMGNLNLLHRPLRSGKWAV+ ANGPGYIIS Sbjct: 540 RVDTVLKEIEGISSEKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYIISI 599 Query: 2224 DIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSS-RPVEYSHSWKFCQYGCVENYFTAH 2400 DIAK+++SQH ++L+LFKMEDVSMGMWVEQFNSS V+YSH+WKFCQYGC+ENY+TAH Sbjct: 600 DIAKFVISQHGSRSLRLFKMEDVSMGMWVEQFNSSMATVQYSHNWKFCQYGCMENYYTAH 659 Query: 2401 YQSPRQMICLWDKLVRGQARCCNF 2472 YQSPRQMICLWDKL RG+A+CCNF Sbjct: 660 YQSPRQMICLWDKLARGRAQCCNF 683 >XP_012454305.1 PREDICTED: probable beta-1,3-galactosyltransferase 20 [Gossypium raimondii] KJB69891.1 hypothetical protein B456_011G048300 [Gossypium raimondii] Length = 682 Score = 982 bits (2538), Expect = 0.0 Identities = 472/682 (69%), Positives = 556/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLSG+D +D Sbjct: 1 MKRVKCEIPTGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSGEDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP AK R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLAKTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW RND D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGYRRCEKWIRNDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_017649307.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Gossypium arboreum] Length = 682 Score = 980 bits (2534), Expect = 0.0 Identities = 471/682 (69%), Positives = 555/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLSG+D +D Sbjct: 1 MKRVKCEIPTGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSGEDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP AK R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLAKTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW RND D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGYRRCEKWIRNDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVN VLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNGVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_016701253.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like isoform X1 [Gossypium hirsutum] Length = 682 Score = 979 bits (2530), Expect = 0.0 Identities = 471/682 (69%), Positives = 555/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLS +D +D Sbjct: 1 MKRVKCEIPTGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSREDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP AK R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLAKTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW RND D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGYRRCEKWIRNDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_016725906.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like [Gossypium hirsutum] Length = 682 Score = 978 bits (2528), Expect = 0.0 Identities = 470/682 (68%), Positives = 554/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLS +D +D Sbjct: 1 MKRVKCEIPSGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSREDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP AK R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLAKTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW RND D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGYRRCEKWIRNDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVN VLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNGVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_016701254.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like isoform X2 [Gossypium hirsutum] Length = 682 Score = 978 bits (2527), Expect = 0.0 Identities = 471/682 (69%), Positives = 554/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLS +D +D Sbjct: 1 MKRVKCEIPTGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSREDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP AK R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLAKTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW RND D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F LTLRA Sbjct: 300 DMLVDGYRRCEKWIRNDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFALTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_012072180.1 PREDICTED: probable beta-1,3-galactosyltransferase 20 [Jatropha curcas] Length = 682 Score = 977 bits (2526), Expect = 0.0 Identities = 471/682 (69%), Positives = 552/682 (80%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE +RFKLS FL GI ALYL+F+ +KFP F E +LSGDD + L ++ Sbjct: 1 MKRLKSEPPNGRRFKLSQFLLGIGALYLVFLAFKFPHFLEIAAMLSGDDSYVGLDTAATE 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRL-DNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D ++ D LS+P SV ++T HR+L DN+N+ P + + L++ K PIK +++YG Sbjct: 61 DVEDSD-LSRPFFGSVYKDTFHRKLEDNQNQNAPTMPDKEPLEEVKGMNKPIKPHQHRYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +I+R+ NRT+ LSVLE +ADEAWTLGLKAWEEV+ Y GKE+ QNS+ E K +SCPS Sbjct: 120 RITGEIMRRRNRTSGLSVLETLADEAWTLGLKAWEEVQKYDGKEIGQNSVYEGKLDSCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 WVS SG+EL G+++MFLPCGL+AGSSITVVGTP YAH EYVP A+ R+ + VSQF Sbjct: 180 WVSISGEELASGEKMMFLPCGLSAGSSITVVGTPHYAHEEYVPQLARFRSGDGNVKVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 M+ELQGLK+V GEDPPKILHLNPRLRGDWS PVIEHNTCYRMQWG QRCDGLPSK D+ Sbjct: 240 MIELQGLKAVDGEDPPKILHLNPRLRGDWSKQPVIEHNTCYRMQWGTAQRCDGLPSKKDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDG++RCEKW RND D KES+TTSWF+RFIGR +KPEVTWPFPF EGK+F+LTLRA Sbjct: 300 DMLVDGFMRCEKWMRNDIVDSKESKTTSWFKRFIGREQKPEVTWPFPFVEGKLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+ GGRHV+SF YR GFTLEDATGLAIKGD+DVHSV+ATSLP+SHPSFSPQRVL Sbjct: 360 GVDGYHINVGGRHVSSFAYRPGFTLEDATGLAIKGDIDVHSVYATSLPSSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMS KWKA P P PI LFIG+LSATNHFAERMAVRK+WM RFFVALNP Sbjct: 420 EMSDKWKALPSPKNPIQLFIGILSATNHFAERMAVRKTWMQASPIKSSKVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVN VLK+EAAYFGD VILPF+DRYELVVLKT+AICE+GVQN SAA+IMK DDDTF+R Sbjct: 480 RKEVNVVLKKEAAYFGDIVILPFMDRYELVVLKTVAICEFGVQNVSAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 VETVLKE+ V+ KSLYMGNLNLLHRPLR+GKWAV+ ANGP Y+IS D Sbjct: 540 VETVLKEISGVSPKKSLYMGNLNLLHRPLRTGKWAVTFEEWPESVYPPYANGPAYVISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+I++QH ++L+LFKMEDVSMGMWVEQFNSS+PV+YSH+WKFCQYGC+ENY+TAHYQ Sbjct: 600 IAKFIIAQHGNRSLRLFKMEDVSMGMWVEQFNSSKPVQYSHNWKFCQYGCMENYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RG A CCNF Sbjct: 660 SPRQMICLWDKLARGGAHCCNF 681 >KHG03949.1 putative beta-1,3-galactosyltransferase 20 -like protein [Gossypium arboreum] Length = 682 Score = 977 bits (2526), Expect = 0.0 Identities = 469/682 (68%), Positives = 555/682 (81%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKLS+FL G+ LYL+FI +KFP F E VLSG+D +D Sbjct: 1 MKRVKCEIPTGRRFKLSHFLLGLGGLYLIFIAFKFPYFLEIAAVLSGEDSYDGWNGKVVG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNE-NEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D +GD LSKP++ SV ++ HR+L++E + P + L++ ++G+ PIK K+ YG Sbjct: 61 DVNDGD-LSKPLVNSVYKDMFHRKLEDELTQDAPMRPTEEPLEEGRDGVQPIKPLKHLYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +++R+MN+T++LSVLERMADEAWTLGLKAWEEV + GKE+ Q+S+ + K ESCPS Sbjct: 120 RITGEVMRRMNKTSELSVLERMADEAWTLGLKAWEEVDEFDGKEIGQSSLFDGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S +G++L GD++MFLPCGL AGSSITVVGTP+YAH EYVP A+ R + L++VSQF Sbjct: 180 WLSVNGEDLASGDRLMFLPCGLKAGSSITVVGTPRYAHQEYVPQLARTRGGNGLVMVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGL+SV GE PPKILHLNPRL+GDWS PVIEHNTCYRM WG QRCDGLPSK D+ Sbjct: 240 MVELQGLQSVDGEAPPKILHLNPRLKGDWSRKPVIEHNTCYRMHWGTAQRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDGY RCEKW R+D D KES+TTSWF RFIGRA+KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGYRRCEKWIRDDVVDSKESKTTSWFGRFIGRAQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+I GGRHVTSF YRTGF+LEDATGLAIKGDVD+HSV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHIIVGGRHVTSFAYRTGFSLEDATGLAIKGDVDIHSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP I LFIG+LSATNHFAERMAVR++WM RFFVALNP Sbjct: 420 EMSPKWKASPLPKRSIRLFIGILSATNHFAERMAVRQTWMQSSAIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVN VLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF+R Sbjct: 480 RKEVNGVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V++VLK++D+++ +SLYMGNLNLLHRPLR+GKWAV+ ANGPGYIIS D Sbjct: 540 VDSVLKQIDRISPKRSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYIISSD 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+IVSQH Q L+LFKMEDVSMGMWVEQFN S V+YSHSWKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIVSQHADQKLRLFKMEDVSMGMWVEQFNQSTTVQYSHSWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLWDKL RGQARCCNF Sbjct: 660 SPRQMICLWDKLSRGQARCCNF 681 >XP_018847685.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Juglans regia] XP_018847686.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2 [Juglans regia] Length = 682 Score = 976 bits (2524), Expect = 0.0 Identities = 479/685 (69%), Positives = 557/685 (81%), Gaps = 5/685 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE G +RFKLS+FL GIA LYL+FI +KFP F E +LSGDD + +G+ G + Sbjct: 1 MKRPKSEPPGARRFKLSHFLLGIAVLYLVFISFKFPHFLEIAAMLSGDDSY--VGTDGTM 58 Query: 613 --DAKNGDILSKPILRSVSEETIHRRL-DNENEVVPFILPGKVLKDKKNGLPPIKSFKNQ 783 D+++ D LSKP SV ++ HR+L DN+N+ PF + L++KK+ PIK +++ Sbjct: 59 RGDSEDPD-LSKPFFTSVYKDAFHRKLEDNQNQDAPFRPSQEPLEEKKSASRPIKPLQHR 117 Query: 784 YGRIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESC 963 YGRI +I+++ NRT DLSVLERMADEAWTLGLKAWEE+ KE ++SILE K ESC Sbjct: 118 YGRITGEIMKRRNRTIDLSVLERMADEAWTLGLKAWEELDKVDEKETGESSILEGKPESC 177 Query: 964 PSWVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVS 1143 PSW+S SG+EL KGD++M LPCGLAAGSS+TVVGTP YAH EYVP AK+R S++++VS Sbjct: 178 PSWISISGEEL-KGDRLMILPCGLAAGSSVTVVGTPHYAHQEYVPQLAKLRGGSAMVMVS 236 Query: 1144 QFMVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKG 1323 QFMVELQGLK V GE+PPKILHLNPRL+GDWS PVIEHNTCYRMQWG QRCDGLPS Sbjct: 237 QFMVELQGLKVVDGEEPPKILHLNPRLKGDWSKRPVIEHNTCYRMQWGKAQRCDGLPSNN 296 Query: 1324 DDDMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTL 1500 DDDM VDG+ RCEKW RND D KES+TTSWF+RFIGR +KPEVTWPFPF EG++F+LTL Sbjct: 297 DDDMLVDGHGRCEKWMRNDIVDSKESKTTSWFKRFIGREQKPEVTWPFPFVEGRLFILTL 356 Query: 1501 RAGLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQR 1680 RAG+DG+H+ GGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSV+ATSLPTSHPSFSP R Sbjct: 357 RAGVDGYHISVGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVYATSLPTSHPSFSPHR 416 Query: 1681 VLEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVAL 1860 VLE S KWK PLP +PLF+GVLSA NHFAERMAVRK+WM RFFVAL Sbjct: 417 VLEFSEKWKVNPLPKSKVPLFVGVLSAPNHFAERMAVRKTWMQTSAIKSSDVVVRFFVAL 476 Query: 1861 NPRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTF 2040 NPRKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN +AA+IMK DDDTF Sbjct: 477 NPRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVTAAYIMKCDDDTF 536 Query: 2041 IRVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIIS 2220 +RV+TVLKE++ ++ KSLYMGNLNLLHRPLRSGKWAV+ ANGPGY+IS Sbjct: 537 VRVDTVLKEIEGISSNKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYVIS 596 Query: 2221 KDIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSR-PVEYSHSWKFCQYGCVENYFTA 2397 DIAKYI+SQH ++L+LFKMEDVSMGMWVEQFNSS+ V+YSHSWKFCQYGC+ENYFTA Sbjct: 597 IDIAKYIISQHGNRSLRLFKMEDVSMGMWVEQFNSSKAAVQYSHSWKFCQYGCLENYFTA 656 Query: 2398 HYQSPRQMICLWDKLVRGQARCCNF 2472 HYQSPRQMICLW L RG+A CCNF Sbjct: 657 HYQSPRQMICLWGNLARGRAHCCNF 681 >XP_009362277.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like [Pyrus x bretschneideri] Length = 679 Score = 976 bits (2522), Expect = 0.0 Identities = 470/684 (68%), Positives = 552/684 (80%), Gaps = 4/684 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKL + LFGIAALYL+FI +KFP+F E LSGDDG+ L V Sbjct: 1 MKRLKVEPSVARRFKLQHLLFGIAALYLVFISFKFPQFLEIANTLSGDDGYVGLKLVEDV 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D LSKP+ S ++T HR+L+++N+ P + L++ K G PIK ++QYGR Sbjct: 61 D------LSKPLFSSAYKDTFHRKLEDQNQDAPVRPSKEPLEESKGGSKPIKPLQHQYGR 114 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I +I+R+ NRTN+LSVLERMADEAWTLGL AWEEV+ GKE+ ++SI+E K ESCPSW Sbjct: 115 ITGEIMRRRNRTNELSVLERMADEAWTLGLNAWEEVRQLDGKEIGESSIIEGKPESCPSW 174 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 +S SG++L GD++MFLPCGLAAGSSITVVGTP YAH EYVP AK+R ++VSQFM Sbjct: 175 LSMSGEDLATGDKLMFLPCGLAAGSSITVVGTPHYAHKEYVPQLAKLRRGDGTVMVSQFM 234 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSV GEDPPKILHLNPRL+GDWS PVIEHNTCYRMQWG QRCDGLP K ++D Sbjct: 235 VELQGLKSVDGEDPPKILHLNPRLKGDWSQQPVIEHNTCYRMQWGTAQRCDGLPFKSNED 294 Query: 1333 MRVDGYLRCEKWSRND---NRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLR 1503 M VDG+ RCEKW RND +++ +++TTSWF+RFIGR +KPEVTWPFPF EG++F+LT+R Sbjct: 295 MLVDGFGRCEKWMRNDMADSKESKTKTTSWFKRFIGREQKPEVTWPFPFTEGRLFILTIR 354 Query: 1504 AGLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRV 1683 AG+DGFHV GGRH+TSFPYR GFTL+DATGLAIKGDVD HSV+ATSLPTSHPSFSPQRV Sbjct: 355 AGVDGFHVSVGGRHLTSFPYRMGFTLQDATGLAIKGDVDAHSVYATSLPTSHPSFSPQRV 414 Query: 1684 LEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALN 1863 LEM+ KWKA+PLP PI LFIGVLSATNHFAERMAVRK+WM RFFVALN Sbjct: 415 LEMAEKWKARPLPKSPIRLFIGVLSATNHFAERMAVRKTWMQSSSIKSSNVVVRFFVALN 474 Query: 1864 PRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFI 2043 PRKEVN +LK+EAAYFGD VILPF+D YELVVLKTI+ICE+G QN +AA+IMK DDDTFI Sbjct: 475 PRKEVNQMLKKEAAYFGDIVILPFMDHYELVVLKTISICEFGAQNVTAAYIMKCDDDTFI 534 Query: 2044 RVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISK 2223 RV+TVLKE++ + KSLYMGNLNLLHRPLRSGKWAV+ ANGPGY+IS Sbjct: 535 RVDTVLKEIEGIPSKKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYVISI 594 Query: 2224 DIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRP-VEYSHSWKFCQYGCVENYFTAH 2400 DIAK+I+SQHV ++L+LFKMEDVSMGMWVEQFNS+ V+YSH+WKFCQYGC++NYFTAH Sbjct: 595 DIAKFIISQHVSRSLRLFKMEDVSMGMWVEQFNSTTAVVQYSHNWKFCQYGCMDNYFTAH 654 Query: 2401 YQSPRQMICLWDKLVRGQARCCNF 2472 YQSPRQMICLWDKL RGQARCCNF Sbjct: 655 YQSPRQMICLWDKLARGQARCCNF 678 >XP_009360949.1 PREDICTED: hydroxyproline O-galactosyltransferase GALT2-like [Pyrus x bretschneideri] Length = 678 Score = 974 bits (2517), Expect = 0.0 Identities = 472/683 (69%), Positives = 549/683 (80%), Gaps = 3/683 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR K E +RFKL + LFGIAALYL+FI +KFP+F E LSGDD +D L V Sbjct: 1 MKRLKIEPSVARRFKLQHLLFGIAALYLVFISFKFPQFLEIAKTLSGDDSYDGLELVEDV 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRLDNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYGR 792 D LSKP+ SV ++T HR+L+++N+ P + L++ K G PIK ++QYGR Sbjct: 61 D------LSKPLFNSVYKDTFHRKLEDQNQDAPVRPRKEPLEESKGGSKPIKPLQHQYGR 114 Query: 793 IASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPSW 972 I +I+R+ NRTN+LSVLERMADEAWTLGL AWEEV GKE+ ++SI+E K ESCPSW Sbjct: 115 ITGEIMRRRNRTNELSVLERMADEAWTLGLSAWEEVGKLDGKEIGESSIIEGKPESCPSW 174 Query: 973 VSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQFM 1152 +S G+EL GD++MFLPCGLAAGSSITVVGT +YAH EYVP AK+R ++VSQFM Sbjct: 175 LSMRGEELATGDKLMFLPCGLAAGSSITVVGTSRYAHEEYVPQLAKLRRGDGTVMVSQFM 234 Query: 1153 VELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDDD 1332 VELQGLKSV GEDPPKILHLNPRL+GDWS PVIEHNTCYRMQWG QRCDGLPSK ++D Sbjct: 235 VELQGLKSVDGEDPPKILHLNPRLKGDWSKQPVIEHNTCYRMQWGTAQRCDGLPSKSNED 294 Query: 1333 MRVDGYLRCEKWSRN--DNRDKESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 M VDGY RCEKW RN D+++ +++TTSWF+RFIGR +KPEVTWPFPF EG++FVLT+RA Sbjct: 295 MLVDGYGRCEKWMRNGMDSKESKTKTTSWFKRFIGREQKPEVTWPFPFTEGRLFVLTIRA 354 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DGFH GGRHVTSFPYR GFTL+DATGLAIKGDVDVHSV+ATSLP SHPSFSPQRVL Sbjct: 355 GVDGFHTSVGGRHVTSFPYRMGFTLQDATGLAIKGDVDVHSVYATSLPASHPSFSPQRVL 414 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EM+ KWKA+PLP P LFIGVLSATNHFAERMAVRK+WM RFFVALNP Sbjct: 415 EMAEKWKARPLPKRPFQLFIGVLSATNHFAERMAVRKTWMQSSAIKSSNVVVRFFVALNP 474 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVN +LK+EAAYFGD VILPF+DRYELVVLKTI+ICE+G QN +AA+IMK DDDTF+R Sbjct: 475 RKEVNQMLKKEAAYFGDIVILPFMDRYELVVLKTISICEFGSQNVTAAYIMKCDDDTFVR 534 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V+TVLKE+ ++ KSLYMGNLNLLHRPLRSGKWAV+ ANGPGY+IS D Sbjct: 535 VDTVLKEIVGISSKKSLYMGNLNLLHRPLRSGKWAVTYEEWPEEVYPPYANGPGYVISID 594 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRP-VEYSHSWKFCQYGCVENYFTAHY 2403 IAK+I+SQHV L+LFKMEDVSMGMWVEQFNS+ V+YSH+WKFCQYGC+ENYFTAHY Sbjct: 595 IAKFIISQHVSHKLRLFKMEDVSMGMWVEQFNSTTAIVQYSHNWKFCQYGCMENYFTAHY 654 Query: 2404 QSPRQMICLWDKLVRGQARCCNF 2472 QSPRQMICLWDKL RGQA+CCNF Sbjct: 655 QSPRQMICLWDKLARGQAQCCNF 677 >XP_002531052.1 PREDICTED: probable beta-1,3-galactosyltransferase 20 [Ricinus communis] EEF31313.1 transferase, transferring glycosyl groups, putative [Ricinus communis] Length = 683 Score = 974 bits (2517), Expect = 0.0 Identities = 472/683 (69%), Positives = 557/683 (81%), Gaps = 3/683 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE +R KLS+FL GI ALYL+F+ +KFP F E +LSGDD + L + Sbjct: 1 MKRLKSEPPSGRRCKLSHFLLGIGALYLVFLAFKFPHFLEIAAMLSGDDSYVGLDGALVE 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRL-DNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D ++ + L+KP+ SV ++T HR+L DN+N+ P + + L++ K PIK ++ YG Sbjct: 61 DMEDSE-LTKPLFSSVYKDTFHRKLEDNQNQNAPRMPSKEPLEEVKGESKPIKPLQHPYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSG-KEVDQNSILEVKQESCP 966 RI +IL++ NRT+DLS+LERMADEAWTLGLKAWEEV+ Y KE+ QNS+ + K E CP Sbjct: 120 RITGEILKRRNRTSDLSILERMADEAWTLGLKAWEEVEKYDDEKEIGQNSVYDGKTEPCP 179 Query: 967 SWVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQ 1146 SWVS G EL+ +++MFLPCGLAAGSSIT+VGTP YAH EYVP A++R +++VSQ Sbjct: 180 SWVSMKGAELSGEEKMMFLPCGLAAGSSITLVGTPHYAHQEYVPQLARLRNGDGIVMVSQ 239 Query: 1147 FMVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGD 1326 FM+ELQGLK+V GEDPPKILHLNPRLRGDWS PVIEHNTCYRMQWG QRCDGLPSK D Sbjct: 240 FMIELQGLKAVDGEDPPKILHLNPRLRGDWSKQPVIEHNTCYRMQWGTAQRCDGLPSKKD 299 Query: 1327 DDMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLR 1503 +DM VDG+LRCEKW RND D KES+TTSWF+RFIGR +KPEVTWPFPFAEG++F+LTLR Sbjct: 300 EDMLVDGFLRCEKWMRNDIVDSKESKTTSWFKRFIGREQKPEVTWPFPFAEGRLFILTLR 359 Query: 1504 AGLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRV 1683 AG+DG+H+ GG HVTSFPYR GFTLEDATGLAIKG+VDVHS++ATSLP+SHP+FSPQRV Sbjct: 360 AGVDGYHINVGGLHVTSFPYRPGFTLEDATGLAIKGEVDVHSIYATSLPSSHPNFSPQRV 419 Query: 1684 LEMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALN 1863 LEMS KWKA PLP PI LFIG+LSATNHFAERMAVRK+WM RFFVAL+ Sbjct: 420 LEMSEKWKAHPLPKIPIRLFIGILSATNHFAERMAVRKTWMQSSSIKSSSVVVRFFVALS 479 Query: 1864 PRKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFI 2043 PRKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTIAICE+GVQN SAA+IMK DDDTF+ Sbjct: 480 PRKEVNAVLKKEAAYFGDIVILPFMDRYELVVLKTIAICEFGVQNVSAAYIMKCDDDTFV 539 Query: 2044 RVETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISK 2223 RVETVLKE+D ++ KSLYMGNLNLLHRPLRSGKWAV+ ANGPGY+IS Sbjct: 540 RVETVLKEIDGISSKKSLYMGNLNLLHRPLRSGKWAVTFEEWPEAVYPPYANGPGYVISY 599 Query: 2224 DIAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHY 2403 DIAK+IV+QH ++L+LFKMEDVSMGMWVEQFNSSR V+YSH+WKFCQYGC+ENY+TAHY Sbjct: 600 DIAKFIVAQHGNRSLRLFKMEDVSMGMWVEQFNSSRTVQYSHNWKFCQYGCMENYYTAHY 659 Query: 2404 QSPRQMICLWDKLVRGQARCCNF 2472 QSPRQMICLWDKL RG+A+CCNF Sbjct: 660 QSPRQMICLWDKLSRGRAQCCNF 682 >OMO65814.1 hypothetical protein COLO4_31004 [Corchorus olitorius] Length = 682 Score = 971 bits (2511), Expect = 0.0 Identities = 472/682 (69%), Positives = 552/682 (80%), Gaps = 2/682 (0%) Frame = +1 Query: 433 MKRSKSEHFGVKRFKLSYFLFGIAALYLLFICYKFPKFFESVGVLSGDDGFDKLGSSGFV 612 MKR KSE +RFKLS+FL GI LYL+FI +KFP F E VLS DD +D L Sbjct: 1 MKRVKSELPTGRRFKLSHFLLGIGVLYLIFIAFKFPHFLEIAAVLSVDDSYDGLDGKVAG 60 Query: 613 DAKNGDILSKPILRSVSEETIHRRL-DNENEVVPFILPGKVLKDKKNGLPPIKSFKNQYG 789 D + D LSKP++ SV ++ HR+L DN N+ P + L+++K + PIK +++YG Sbjct: 61 DVNDAD-LSKPLVNSVYKDAFHRKLEDNLNQDAPLRPSKEPLEERKGKVQPIKPLQHKYG 119 Query: 790 RIASDILRQMNRTNDLSVLERMADEAWTLGLKAWEEVKNYSGKEVDQNSILEVKQESCPS 969 RI +I+R+MN+T++LSVLERMADEAWTLGLKAWEEV + K++ NS+ E K ESCPS Sbjct: 120 RITGEIMRRMNKTSELSVLERMADEAWTLGLKAWEEVDKFDAKDIGLNSLFEGKPESCPS 179 Query: 970 WVSSSGKELTKGDQVMFLPCGLAAGSSITVVGTPKYAHHEYVPTQAKMRTASSLILVSQF 1149 W+S SG++L GD++MFLPCGL AGSSITVVGTP +AH E+VP AK+R L+ VSQF Sbjct: 180 WLSISGEDLASGDRLMFLPCGLKAGSSITVVGTPHHAHQEFVPQLAKLRLNDGLVNVSQF 239 Query: 1150 MVELQGLKSVVGEDPPKILHLNPRLRGDWSHLPVIEHNTCYRMQWGAGQRCDGLPSKGDD 1329 MVELQGLKSV GEDPPKILHLNPRL+GDWSH PVIEHNTCYRMQWG RCDGLPSK D+ Sbjct: 240 MVELQGLKSVDGEDPPKILHLNPRLKGDWSHHPVIEHNTCYRMQWGTAHRCDGLPSKDDE 299 Query: 1330 DMRVDGYLRCEKWSRNDNRD-KESRTTSWFQRFIGRAKKPEVTWPFPFAEGKMFVLTLRA 1506 DM VDG RCEKW R+D D KES+TTSWF+RFIGR +KPEVTWPFPFAEG++F+LTLRA Sbjct: 300 DMLVDGNRRCEKWIRDDVADSKESKTTSWFKRFIGREQKPEVTWPFPFAEGRLFILTLRA 359 Query: 1507 GLDGFHVISGGRHVTSFPYRTGFTLEDATGLAIKGDVDVHSVFATSLPTSHPSFSPQRVL 1686 G+DG+H+ GGRHVTSFPYRTGF+LEDATGLAIKGDVDV SV+ATSLPTSHPSFSPQRVL Sbjct: 360 GVDGYHINVGGRHVTSFPYRTGFSLEDATGLAIKGDVDVRSVYATSLPTSHPSFSPQRVL 419 Query: 1687 EMSPKWKAQPLPSGPIPLFIGVLSATNHFAERMAVRKSWMXXXXXXXXXXXXRFFVALNP 1866 EMSPKWKA PLP + LFIGVLSATNHFAERMAVRK+WM RFFVALNP Sbjct: 420 EMSPKWKAFPLPKRSVKLFIGVLSATNHFAERMAVRKTWMQSSPIKSLDVVVRFFVALNP 479 Query: 1867 RKEVNAVLKQEAAYFGDTVILPFLDRYELVVLKTIAICEYGVQNASAAHIMKSDDDTFIR 2046 RKEVNAVLK+EAAYFGD VILPF+DRYELVVLKTI+ICE+GVQN SAA+IMK DDDTF+R Sbjct: 480 RKEVNAVLKEEAAYFGDIVILPFMDRYELVVLKTISICEFGVQNVSAAYIMKCDDDTFVR 539 Query: 2047 VETVLKELDKVAQTKSLYMGNLNLLHRPLRSGKWAVSXXXXXXXXXXXXANGPGYIISKD 2226 V+TVLKE+D ++ KSLYMGNLNLLHRPLR+GKWAV+ ANGPGY+IS D Sbjct: 540 VDTVLKEIDGISPKKSLYMGNLNLLHRPLRNGKWAVTYEEWPEEVYPPYANGPGYVISID 599 Query: 2227 IAKYIVSQHVKQNLKLFKMEDVSMGMWVEQFNSSRPVEYSHSWKFCQYGCVENYFTAHYQ 2406 IAK+I+SQH + L+LFKMEDVSMGMWVEQFNSS+ V+YSH+WKFCQYGC+ +Y+TAHYQ Sbjct: 600 IAKFIISQHGDRKLRLFKMEDVSMGMWVEQFNSSKTVQYSHNWKFCQYGCMVDYYTAHYQ 659 Query: 2407 SPRQMICLWDKLVRGQARCCNF 2472 SPRQMICLW+KL RG+A CCNF Sbjct: 660 SPRQMICLWEKLSRGRAHCCNF 681