BLASTX nr result
ID: Akebia23_contig00016989
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00016989 (2834 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI18781.3| unnamed protein product [Vitis vinifera] 978 0.0 emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera] 973 0.0 ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258... 972 0.0 gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notab... 951 0.0 ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theob... 939 0.0 ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theob... 937 0.0 ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ri... 934 0.0 ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferas... 923 0.0 ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prun... 916 0.0 ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferas... 912 0.0 ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [A... 911 0.0 ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferas... 910 0.0 ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutr... 909 0.0 ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferas... 908 0.0 ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citr... 908 0.0 ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferas... 906 0.0 gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus... 905 0.0 ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferas... 898 0.0 ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferas... 896 0.0 ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferas... 892 0.0 >emb|CBI18781.3| unnamed protein product [Vitis vinifera] Length = 638 Score = 978 bits (2529), Expect = 0.0 Identities = 488/658 (74%), Positives = 548/658 (83%), Gaps = 11/658 (1%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLP 2298 M+ RK VLFLL+VTVL+PIVLYTD LG SF S + +EF EDV+ L+ GG KLN+LP Sbjct: 1 MIKRKTVLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLP 60 Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT-------- 2142 QESS TLKEPIGIVYSDN S ++++S +L L S EHKTRVLS T Sbjct: 61 QESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQ 114 Query: 2141 -ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGK 1965 ENPI+QV DG + D LQ + + + GQQS GK Sbjct: 115 RENPIRQVTDG-----KDDNLQRGSELTSHNASQNSETEH----------GQQSAQTSGK 159 Query: 1964 TNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKE 1785 + EP K + +P + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KE Sbjct: 160 GDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKE 219 Query: 1784 VQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVH 1605 VQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVH Sbjct: 220 VQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVH 279 Query: 1604 KKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDN 1425 KKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDN Sbjct: 280 KKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDN 339 Query: 1424 ILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWL 1245 ILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWL Sbjct: 340 ILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWL 399 Query: 1244 NSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 1065 NSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV Sbjct: 400 NSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKV 459 Query: 1064 LFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACG 885 LFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD ACG Sbjct: 460 LFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACG 519 Query: 884 WAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHV 705 WAYGMN+FDL++WKKQ+ITEVYHTWQKLN RQLWKLGTLPPGLITFW RTFPIDRSWHV Sbjct: 520 WAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHV 579 Query: 704 LGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 LGLGYNP+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D YLRDCNIN Sbjct: 580 LGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 637 >emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera] Length = 759 Score = 973 bits (2516), Expect = 0.0 Identities = 486/660 (73%), Positives = 546/660 (82%), Gaps = 11/660 (1%) Frame = -1 Query: 2477 KTMMVRKPVLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNV 2304 K M+ RK VLFLL+VTV +PIVLYTD LG SF S + +EF EDV+ L+ GG KLN+ Sbjct: 120 KEMIKRKTVLFLLLVTVXSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNL 179 Query: 2303 LPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ 2142 LPQESS TLKEPIGIVYSDN S ++++S +L L S EHKTR LS T Sbjct: 180 LPQESSTTLKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRXLSTTYEEGDR 233 Query: 2141 ---ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD 1971 ENPI+QV DG + D LQ + + + GQQS Sbjct: 234 SQRENPIRQVTDG-----KDDSLQRGSELTSHNASQNSETEH----------GQQSAQTS 278 Query: 1970 GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1791 GK + EP K + +P + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+ Sbjct: 279 GKGDHKEPVKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARM 338 Query: 1790 KEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLR 1611 KEVQRALGDATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLR Sbjct: 339 KEVQRALGDATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLR 398 Query: 1610 VHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFS 1431 VHKKQTM+LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFS Sbjct: 399 VHKKQTMYLTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFS 458 Query: 1430 DNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFT 1251 DNILAAAVVVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFT Sbjct: 459 DNILAAAVVVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFT 518 Query: 1250 WLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 1071 WLNSSYSPVLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN Sbjct: 519 WLNSSYSPVLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLN 578 Query: 1070 KVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRA 891 KVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD A Sbjct: 579 KVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHA 638 Query: 890 CGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSW 711 CGWAYGMN+FDL++WKKQ+ITEVYHTWQKLN RQLWKLGTLPPGLITFW RT PIDRSW Sbjct: 639 CGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTXPIDRSW 698 Query: 710 HVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 HVLGLGYNP+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D YLRDCNIN Sbjct: 699 HVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 758 >ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera] Length = 1286 Score = 972 bits (2513), Expect = 0.0 Identities = 484/652 (74%), Positives = 544/652 (83%), Gaps = 11/652 (1%) Frame = -1 Query: 2453 VLFLLVVTVLAPIVLYTDRLG-SF-ISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESSNT 2280 +LFLL+VTVL+PIVLYTD LG SF S + +EF EDV+ L+ GG KLN+LPQESS T Sbjct: 655 LLFLLLVTVLSPIVLYTDTLGRSFKTSFSAADEFDEDVTALTLGGVDAKLNLLPQESSTT 714 Query: 2279 LKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT---------ENPIK 2127 LKEPIGIVYSDN S ++++S +L L S EHKTRVLS T ENPI+ Sbjct: 715 LKEPIGIVYSDND------SLDVDESAADLQLGGSVEHKTRVLSTTYEEGDRSQRENPIR 768 Query: 2126 QVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEP 1947 QV DG + D LQ + + + GQQS GK + EP Sbjct: 769 QVTDG-----KDDNLQRGSELTSHNASQNSETEH----------GQQSAQTSGKGDHKEP 813 Query: 1946 PKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALG 1767 K + +P + V+ DARV+ LKDQLIRAKV+L L ATRNN HFIRELR R+KEVQRALG Sbjct: 814 VKTRNEKPIDQTVILDARVQQLKDQLIRAKVFLSLSATRNNAHFIRELRARMKEVQRALG 873 Query: 1766 DATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMF 1587 DATKDSELP+NAY+KLK MEQTLAKGKQIQDDCAAV+KKLRAILHS EEQLRVHKKQTM+ Sbjct: 874 DATKDSELPKNAYEKLKGMEQTLAKGKQIQDDCAAVVKKLRAILHSAEEQLRVHKKQTMY 933 Query: 1586 LTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAV 1407 LTQL AKTLPKGLHCLPLRLSTEYY+L+S++QQFPN++KLEDP L+HYALFSDNILAAAV Sbjct: 934 LTQLTAKTLPKGLHCLPLRLSTEYYNLDSAQQQFPNQDKLEDPRLFHYALFSDNILAAAV 993 Query: 1406 VVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSP 1227 VVNSTV++AK+P+ HVFHIV+DRLNYAAMRMWFLANPPGKATIQVQNI+EFTWLNSSYSP Sbjct: 994 VVNSTVSNAKDPSKHVFHIVSDRLNYAAMRMWFLANPPGKATIQVQNIDEFTWLNSSYSP 1053 Query: 1226 VLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1047 VLKQLGS SMIDYYFK HR+NSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD Sbjct: 1054 VLKQLGSPSMIDYYFKGHRSNSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDD 1113 Query: 1046 IVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN 867 IVVQ+DLTGLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD ACGWAYGMN Sbjct: 1114 IVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDSHACGWAYGMN 1173 Query: 866 VFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYN 687 +FDL++WKKQ+ITEVYHTWQKLN RQLWKLGTLPPGLITFW RTFPIDRSWHVLGLGYN Sbjct: 1174 IFDLDQWKKQHITEVYHTWQKLNHDRQLWKLGTLPPGLITFWKRTFPIDRSWHVLGLGYN 1233 Query: 686 PTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 P+VN++EI+RAAVIHYNGN+KPWLEIG+PK++ YWAKF D+D YLRDCNIN Sbjct: 1234 PSVNRREIERAAVIHYNGNLKPWLEIGMPKFRNYWAKFADFDNEYLRDCNIN 1285 >gb|EXB64628.1| putative galacturonosyltransferase 4 [Morus notabilis] Length = 657 Score = 951 bits (2458), Expect = 0.0 Identities = 475/671 (70%), Positives = 541/671 (80%), Gaps = 24/671 (3%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLD-SRNEFIEDVSTLSFGGEIRKLNVLPQ 2295 MMVR V+ +L VTV+APIVLYTDRLG+F S S NEF+EDV+T+ Sbjct: 1 MMVRNVVIGMLFVTVIAPIVLYTDRLGTFQSYSASTNEFVEDVTTV-------------- 46 Query: 2294 ESSNTLKEPIGIVYSDNSRNSTL--------FSDEIEDSVEELPLAESTEH-KTRVLSAT 2142 E S +KEPIGIVYSDNS S S + +S ++ L +S EH RVLS T Sbjct: 47 EPSTKIKEPIGIVYSDNSNQSLPNSGDAVKESSTDTSNSEQDWQLGDSMEHVSARVLSTT 106 Query: 2141 --------ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAG-- 1992 EN I++V D +EG++ + L I GE KGE +G Sbjct: 107 NDENNSRKENAIREVTDRDQEGDQ-ETLDIVDGEGETKGEAIDAEVKEIQQKVDDGSGDT 165 Query: 1991 ----QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1824 +Q+ + + EP K + + N V+PDARVRHLKDQL+RA+VYL L ATRNN Sbjct: 166 EVKPEQTTETSSRVDKREPRKTRPEKQNDRTVIPDARVRHLKDQLVRARVYLSLPATRNN 225 Query: 1823 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1644 PHF RELR+R+KEVQRALGDA+KDSELPRNAYD+LKAMEQ+LAKGKQIQDDCAA +KKLR Sbjct: 226 PHFTRELRVRMKEVQRALGDASKDSELPRNAYDRLKAMEQSLAKGKQIQDDCAAAVKKLR 285 Query: 1643 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1464 A+LHSTEEQLRVHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN SEQ FPNE+KLE Sbjct: 286 AMLHSTEEQLRVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNYSEQHFPNEDKLE 345 Query: 1463 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1284 DP LYHYALFSDN+LAAAVVVNST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPGKA Sbjct: 346 DPQLYHYALFSDNVLAAAVVVNSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGKA 405 Query: 1283 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1104 T+QVQNIEEFTWLNSSYSPVLKQLGSQSMI+YYF+ HRA+SDSNLKFRNPKYLSILNHLR Sbjct: 406 TVQVQNIEEFTWLNSSYSPVLKQLGSQSMINYYFRTHRASSDSNLKFRNPKYLSILNHLR 465 Query: 1103 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 924 FYLP+IFPKL+KVLF+DDDIVVQ+DLT LWSL+LKG VNGAVETCGESFHRFDRYLNFSN Sbjct: 466 FYLPQIFPKLDKVLFVDDDIVVQKDLTALWSLDLKGNVNGAVETCGESFHRFDRYLNFSN 525 Query: 923 PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 744 PLISKNFDPRACGWAYGMN+FDL+EWK+Q IT+VYH+WQKLN RQLWKLGTLPPGLITF Sbjct: 526 PLISKNFDPRACGWAYGMNIFDLKEWKRQQITDVYHSWQKLNHDRQLWKLGTLPPGLITF 585 Query: 743 WNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 564 W RT+P+DRSWHVLGLGYNP V QK+I+RAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDY Sbjct: 586 WKRTYPLDRSWHVLGLGYNPNVGQKDIERAAVIHYNGNMKPWLEIGIPKYRNYWAKYVDY 645 Query: 563 DQVYLRDCNIN 531 DQ+YLR+CN+N Sbjct: 646 DQLYLRECNLN 656 >ref|XP_007026430.1| Galacturonosyltransferase 4 isoform 1 [Theobroma cacao] gi|508781796|gb|EOY29052.1| Galacturonosyltransferase 4 isoform 1 [Theobroma cacao] Length = 626 Score = 939 bits (2426), Expect = 0.0 Identities = 473/656 (72%), Positives = 533/656 (81%), Gaps = 9/656 (1%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 M VR VL LL VTV+API LYTDR+ +F S +F++DV+T + G+ R+LNVLPQE Sbjct: 1 MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60 Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 2139 +S +KEP GIVYSD+S NS + E+ EHK TRVLSAT+ Sbjct: 61 TSTAIKEPAGIVYSDHSNNSFR------------KVTETREHKSTRVLSATDEERQPQLH 108 Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTN 1959 NPI+QV D N + L + + G + Q + N D K + Sbjct: 109 NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 156 Query: 1958 SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1779 SD +++A P DA+VRHLKDQLIRAKVYL L A ++N H RELRLR+KEV Sbjct: 157 SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 209 Query: 1778 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1599 RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK Sbjct: 210 RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 269 Query: 1598 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1419 QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L Sbjct: 270 QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 329 Query: 1418 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1239 AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS Sbjct: 330 AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 389 Query: 1238 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1059 SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF Sbjct: 390 SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 449 Query: 1058 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 879 LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA Sbjct: 450 LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 509 Query: 878 YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLG 699 YGMN+FDLEEW++QNITEVYH WQKLN RQLWKLGTLPPGLITFW RT+P+DRSWHVLG Sbjct: 510 YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 569 Query: 698 LGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 LGYNP VNQ+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN Sbjct: 570 LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 625 >ref|XP_007026431.1| Galacturonosyltransferase 4 isoform 2 [Theobroma cacao] gi|508781797|gb|EOY29053.1| Galacturonosyltransferase 4 isoform 2 [Theobroma cacao] Length = 624 Score = 937 bits (2423), Expect = 0.0 Identities = 473/656 (72%), Positives = 532/656 (81%), Gaps = 9/656 (1%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 M VR VL LL VTV+API LYTDR+ +F S +F++DV+T + G+ R+LNVLPQE Sbjct: 1 MKVRHLVLGLLSVTVIAPIFLYTDRVATFNPSSSGRDFLDDVATFTLLGDTRRLNVLPQE 60 Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATE-------- 2139 +S +KEP GIVYSD+S NS E+ EHK TRVLSAT+ Sbjct: 61 TSTAIKEPAGIVYSDHSNNSFR--------------KETREHKSTRVLSATDEERQPQLH 106 Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTN 1959 NPI+QV D N + L + + G + Q + N D K + Sbjct: 107 NPIRQVTDPA-PANLTTPLDSHPNASHHLGTKLEQQPT-----------QLAGNIDQKEH 154 Query: 1958 SDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQ 1779 SD +++A P DA+VRHLKDQLIRAKVYL L A ++N H RELRLR+KEV Sbjct: 155 SDNKT-SRLAEP------VDAQVRHLKDQLIRAKVYLSLPAIKSNQHVTRELRLRIKEVS 207 Query: 1778 RALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKK 1599 RALGDATKDS+LP+NA+DKLKAMEQ+L KGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKK Sbjct: 208 RALGDATKDSDLPKNAFDKLKAMEQSLEKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKK 267 Query: 1598 QTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNIL 1419 QTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q F N+EKLEDP LYHYALFSDN+L Sbjct: 268 QTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQNFLNQEKLEDPRLYHYALFSDNVL 327 Query: 1418 AAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNS 1239 AAAVVVNSTV+HAK P+NHVFHIVTDRLNYAAMRMWFL NPPGKATIQVQNIEEFTWLNS Sbjct: 328 AAAVVVNSTVSHAKHPSNHVFHIVTDRLNYAAMRMWFLNNPPGKATIQVQNIEEFTWLNS 387 Query: 1238 SYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 1059 SYSPVLKQLGS SMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF Sbjct: 388 SYSPVLKQLGSPSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLF 447 Query: 1058 LDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWA 879 LDDDIVV++D++GLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDP ACGWA Sbjct: 448 LDDDIVVRKDISGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPHACGWA 507 Query: 878 YGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLG 699 YGMN+FDLEEW++QNITEVYH WQKLN RQLWKLGTLPPGLITFW RT+P+DRSWHVLG Sbjct: 508 YGMNIFDLEEWRRQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKRTYPLDRSWHVLG 567 Query: 698 LGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 LGYNP VNQ+E++RAAVIHYNGN+KPWLEIGIPKYK YWAK+VDYD +YLRDCNIN Sbjct: 568 LGYNPNVNQREVERAAVIHYNGNLKPWLEIGIPKYKNYWAKYVDYDNMYLRDCNIN 623 >ref|XP_002525229.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis] gi|223535526|gb|EEF37195.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis] Length = 647 Score = 934 bits (2415), Expect = 0.0 Identities = 462/650 (71%), Positives = 529/650 (81%), Gaps = 3/650 (0%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRK-LNVLP 2298 M +R V+ +L+VTV+API+LYTD R +F S S EF+EDV++L+ G+ R LNVLP Sbjct: 3 MKLRNLVVGMLLVTVIAPIILYTDNRFSTFNSSSSTTEFLEDVASLTLSGDSRDHLNVLP 62 Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHK-TRVLSATENPIKQV 2121 QES++ LKEPIGIVY+DNS S + I+ LP ++ EHK TRVLSAT + + Sbjct: 63 QESTSLLKEPIGIVYTDNSTISPPHTSTIQFHSSPLP-QDTREHKSTRVLSATNDQHQSQ 121 Query: 2120 NDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEPPK 1941 D + + K ++ + QQS K PPK Sbjct: 122 TDTIIRQVTNQQASRTTDANNKNSKQNPSDGGSQNAVV-----QQSSLTSEKVTEKGPPK 176 Query: 1940 NKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDA 1761 ++ + + +PDARVR L+DQLIRAKVYL L +T+NNPHF RELRLR+KEVQR LGDA Sbjct: 177 SRTDKQTAQTPVPDARVRQLRDQLIRAKVYLSLPSTKNNPHFTRELRLRIKEVQRVLGDA 236 Query: 1760 TKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLT 1581 TKDS+LP+NA DKLKAM+Q+LAKGKQ+QDDCA+V+KKLRA+LHS+EEQLRVHKKQTMFLT Sbjct: 237 TKDSDLPKNANDKLKAMDQSLAKGKQVQDDCASVVKKLRAMLHSSEEQLRVHKKQTMFLT 296 Query: 1580 QLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVV 1401 QL AKTLPKGLHC PLRL+ EYYSLNSS+QQFPN+EKLEDP LYHYALFSDN+LAAAVVV Sbjct: 297 QLTAKTLPKGLHCFPLRLTNEYYSLNSSQQQFPNQEKLEDPQLYHYALFSDNVLAAAVVV 356 Query: 1400 NSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVL 1221 NST+ HAK+P+ HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIEE TWLNSSYSPVL Sbjct: 357 NSTITHAKDPSKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIEELTWLNSSYSPVL 416 Query: 1220 KQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIV 1041 KQLGSQSMIDYYF+ HRANSDSNLK+RNPKYLSILNHLRFYLPEIFP LNKVLFLDDDIV Sbjct: 417 KQLGSQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIFPMLNKVLFLDDDIV 476 Query: 1040 VQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVF 861 VQ+DLTGLWSL+LKG VNGAVETCGE FHRFDRYLNFSNPLISKNFDP ACGWAYGMNVF Sbjct: 477 VQKDLTGLWSLDLKGNVNGAVETCGERFHRFDRYLNFSNPLISKNFDPHACGWAYGMNVF 536 Query: 860 DLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYNPT 681 DL++WK+QNIT VYHTWQKLN R LWKLGTLPPGLITFW +T+ IDRSWHVLGLGYNP Sbjct: 537 DLDQWKRQNITGVYHTWQKLNHDRLLWKLGTLPPGLITFWKQTYSIDRSWHVLGLGYNPN 596 Query: 680 VNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 VNQ+EI+RAAVIHYNGN+KPWLEIGI KY+ YWAK+VDYD VYLR+CNIN Sbjct: 597 VNQREIERAAVIHYNGNLKPWLEIGISKYRNYWAKYVDYDHVYLRECNIN 646 >ref|XP_004293423.1| PREDICTED: probable galacturonosyltransferase 4-like [Fragaria vesca subsp. vesca] Length = 654 Score = 923 bits (2385), Expect = 0.0 Identities = 464/663 (69%), Positives = 524/663 (79%), Gaps = 16/663 (2%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGS-----------FISLDSRNEFIEDVSTLSFGG 2325 MMVR V+ LL VTV+API+LYTDRLGS FIS +++EF+EDV+ F Sbjct: 1 MMVRNVVMILLFVTVIAPIILYTDRLGSIHTSSSSSSFPFISA-AQDEFVEDVTAFPFNA 59 Query: 2324 EIR-KLNVLPQESSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAE----STEHKT 2160 +LN+LPQE S TLKEPIG+VYSDNS S + E + S ST Sbjct: 60 HSGGRLNLLPQELS-TLKEPIGVVYSDNSTESFPETKESQASTNHSHQVSARVLSTTTNE 118 Query: 2159 RVLSATENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSV 1980 + LS +NPI QV + +GN+ + + G + Q+S Sbjct: 119 QDLSQKDNPIIQVTQTLDQGNQL--------LAAESGAKTATSEKKTDNASQNTLNQKST 170 Query: 1979 NADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELR 1800 K + E K + E + D RVRHLKDQLIRA+VYL L A RNNP F RE+R Sbjct: 171 QTSIKVDQRESVKTVSVKNIHETTITDGRVRHLKDQLIRARVYLSLPAARNNPQFAREIR 230 Query: 1799 LRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEE 1620 LR+KEVQRAL DA+KDS+LPRNA D+LKAMEQTLAKGKQIQDDCAA++KKLRA+LHS +E Sbjct: 231 LRIKEVQRALVDASKDSDLPRNANDRLKAMEQTLAKGKQIQDDCAAMVKKLRAMLHSMDE 290 Query: 1619 QLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYA 1440 QLRVHKKQTMFLTQL AKT+PKGLHCLPLRL+TEYYSLNSS+ FPN+E+LEDP +YHYA Sbjct: 291 QLRVHKKQTMFLTQLTAKTVPKGLHCLPLRLTTEYYSLNSSQMNFPNQERLEDPLMYHYA 350 Query: 1439 LFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIE 1260 +FSDN+LA AVVVNSTV HAK+PA HVFHIVTDRLNYAAMRMWFL NPPG+ATIQVQNIE Sbjct: 351 IFSDNVLATAVVVNSTVTHAKDPAKHVFHIVTDRLNYAAMRMWFLVNPPGQATIQVQNIE 410 Query: 1259 EFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFP 1080 EFTWLNSSYSPVLKQLGS SMIDYYF+ HR++SDSNLKFRNPKYLSILNHLRFYLPEIFP Sbjct: 411 EFTWLNSSYSPVLKQLGSASMIDYYFRTHRSSSDSNLKFRNPKYLSILNHLRFYLPEIFP 470 Query: 1079 KLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFD 900 KLNKVLFLDDDIVV++DLTGLWSL+LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFD Sbjct: 471 KLNKVLFLDDDIVVRKDLTGLWSLDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFD 530 Query: 899 PRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPID 720 P ACGWAYGMNVFDLE+WKKQNITEVYH WQKLN RQLWKLGTLPPGLITFW T+P+D Sbjct: 531 PHACGWAYGMNVFDLEQWKKQNITEVYHRWQKLNHDRQLWKLGTLPPGLITFWKHTYPLD 590 Query: 719 RSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDC 540 RSWHVLGLGYNP+V+QKEIDRAAVIHYNGNMKPWLEIGIPKY+ YWAK+VDYD Y+R+C Sbjct: 591 RSWHVLGLGYNPSVSQKEIDRAAVIHYNGNMKPWLEIGIPKYRSYWAKYVDYDHKYMREC 650 Query: 539 NIN 531 NIN Sbjct: 651 NIN 653 >ref|XP_007213869.1| hypothetical protein PRUPE_ppa018681mg [Prunus persica] gi|462409734|gb|EMJ15068.1| hypothetical protein PRUPE_ppa018681mg [Prunus persica] Length = 659 Score = 916 bits (2367), Expect = 0.0 Identities = 465/679 (68%), Positives = 522/679 (76%), Gaps = 32/679 (4%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 MMVR V+ +L VTV+API+LYTDRLGSF VS+ S +LN+LPQE Sbjct: 1 MMVRNVVMVMLFVTVIAPIILYTDRLGSF-----------QVSSSSC-----RLNLLPQE 44 Query: 2291 SSNTLKEPIGIVYSDNSRNSTL----FSDEIEDSVEELPLAESTEH-KTRVLSAT----- 2142 SS TLKEP+G+VYSDNS NS S S ++ P +S EH RVLS T Sbjct: 45 SSTTLKEPVGVVYSDNSTNSYPETRGSSAHPNHSHKDGPSVDSMEHVSARVLSTTNDQNL 104 Query: 2141 ---ENPIKQVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD 1971 +NPI+QV + +GN Q + K G K +QS Sbjct: 105 SQTDNPIRQVTQTLEQGN-----QFMSDLHAKGGGASEQSIDNASQTTEIKNERQSTQTS 159 Query: 1970 GKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRV 1791 + + +P K + N E +PD RVRHLKDQLIRAKVYL L ATRNNPHF RELRLR+ Sbjct: 160 SRVDQRKPKKTMTEKQNDETAVPDVRVRHLKDQLIRAKVYLSLPATRNNPHFTRELRLRI 219 Query: 1790 KEVQRALGDATK-------------------DSELPRNAYDKLKAMEQTLAKGKQIQDDC 1668 KEV++ G + + +AYDKLKAMEQTL KGKQIQDDC Sbjct: 220 KEVKKHFGRQPRILTCQGIFTPSDQVLGSGPSIHVVCDAYDKLKAMEQTLTKGKQIQDDC 279 Query: 1667 AAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQ 1488 AA++KKLRA+LHS EEQLRVH+KQTMFLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+Q Sbjct: 280 AAMVKKLRAMLHSMEEQLRVHRKQTMFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQQV 339 Query: 1487 FPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWF 1308 FPN+EKLEDP LYHYALFSDN+LAAAVVVNST+ HAK+PANHVFHIVTDRLNYAAMRMWF Sbjct: 340 FPNQEKLEDPLLYHYALFSDNVLAAAVVVNSTITHAKDPANHVFHIVTDRLNYAAMRMWF 399 Query: 1307 LANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKY 1128 L N PGKATIQVQNIEEFTWLNSSYSPVLKQLGS SMI+YYF+ HRANSDSNLKFRNPKY Sbjct: 400 LVNSPGKATIQVQNIEEFTWLNSSYSPVLKQLGSASMINYYFRTHRANSDSNLKFRNPKY 459 Query: 1127 LSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRF 948 LSILNHLRFYLPE+FPKLNKVLFLDDD+VVQ+DLTGLW+L+LKG VNGAVETCGESFHRF Sbjct: 460 LSILNHLRFYLPEVFPKLNKVLFLDDDVVVQKDLTGLWALDLKGNVNGAVETCGESFHRF 519 Query: 947 DRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGT 768 DRYLNFSNPLISKNFD RACGWAYGMN+FDLEEWKKQNITEVYH WQ+LN RQLWKLGT Sbjct: 520 DRYLNFSNPLISKNFDARACGWAYGMNIFDLEEWKKQNITEVYHRWQELNHDRQLWKLGT 579 Query: 767 LPPGLITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKG 588 LPPGLITFW RT+P+DRSWHVLGLGYNP+VNQKEIDRAAVIHYNGNMKPWLEIGIPKY+ Sbjct: 580 LPPGLITFWKRTYPLDRSWHVLGLGYNPSVNQKEIDRAAVIHYNGNMKPWLEIGIPKYRN 639 Query: 587 YWAKFVDYDQVYLRDCNIN 531 YW K+VDYD +Y+R+CNIN Sbjct: 640 YWVKYVDYDHMYMRECNIN 658 >ref|XP_006350232.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1 [Solanum tuberosum] gi|565367133|ref|XP_006350233.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X2 [Solanum tuberosum] gi|565367135|ref|XP_006350234.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X3 [Solanum tuberosum] Length = 680 Score = 912 bits (2358), Expect = 0.0 Identities = 457/675 (67%), Positives = 534/675 (79%), Gaps = 27/675 (4%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298 M +RKPVLFLL+VTV APIVLYTD LG++ + SR EFIED+ST +FGG++R LNVLP Sbjct: 3 MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62 Query: 2297 QESSNTLKEPIGIVYSDNSRNST-----LFSDEIEDSVEELPLAESTEHKTRVLSATE-- 2139 QESS +LKEP G VYS+NS +S S E +L AES +H+T S+ + Sbjct: 63 QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEAESMKHQTATGSSNDGV 122 Query: 2138 ------NPIKQVNDGVREGNESDGLQIN------------KSIGEKKGEERTNXXXXXXX 2013 + I QV + E ++D ++I +KK + Sbjct: 123 EVAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTLDST 182 Query: 2012 XXXXKAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGAT 1833 + Q++V GK S E + K N +IV PDARVR LKDQLIRAKVYL L AT Sbjct: 183 KTETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSLSAT 242 Query: 1832 RNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIK 1653 R+NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++K Sbjct: 243 RSNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIVK 302 Query: 1652 KLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEE 1473 KLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++E Sbjct: 303 KLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQE 362 Query: 1472 KLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPP 1293 LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLANPP Sbjct: 363 NLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANPP 422 Query: 1292 GKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILN 1113 AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+N Sbjct: 423 KYATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIMN 481 Query: 1112 HLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLN 933 HLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYLN Sbjct: 482 HLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYLN 541 Query: 932 FSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGL 753 FSNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ N RQLWKLGTLPPGL Sbjct: 542 FSNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLPPGL 601 Query: 752 ITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKF 573 ITFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+KF Sbjct: 602 ITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSKF 661 Query: 572 VDYDQVYLRDCNINR 528 VDYDQ +LR+CNIN+ Sbjct: 662 VDYDQAFLRECNINK 676 >ref|XP_006857626.1| hypothetical protein AMTR_s00061p00126570 [Amborella trichopoda] gi|548861722|gb|ERN19093.1| hypothetical protein AMTR_s00061p00126570 [Amborella trichopoda] Length = 672 Score = 911 bits (2354), Expect = 0.0 Identities = 453/671 (67%), Positives = 535/671 (79%), Gaps = 24/671 (3%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 M R PVL LL +VLAPIVLYTDRLGSF S ++ F E+ S +++G +I KL VLPQE Sbjct: 1 MKFRMPVLLLLCFSVLAPIVLYTDRLGSFSSSIAKAGFSEEFSPINYGRDINKLKVLPQE 60 Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDG 2112 S N LKEP G+VY + S S + E + + +S V + E I +V+ Sbjct: 61 SVNALKEPSGVVYLSDKDPSEAISVKEEPKMARSRVLQSNVKPLEVETHIEQVIDKVHRE 120 Query: 2111 VREGNESDG--------------LQINKS-IGEKK----GEERTNXXXXXXXXXXXKAGQ 1989 + G E G LQ N+ IG K+ G + + A + Sbjct: 121 EKNGQEIAGDSQAETIEESQQVLLQSNEQKIGAKREEQFGHQDASIKEEIGLSSRTDAEK 180 Query: 1988 QSVNA----DGKTNSDEPPKNKMARPN-GEIVMPDARVRHLKDQLIRAKVYLGLGATRNN 1824 Q + GK++ D P + R N + MPDARV HL+DQLI+AKVYL LG TR+N Sbjct: 181 QEPDKPEIESGKSDPDGPSQPSPERQNDNKKPMPDARVHHLRDQLIKAKVYLSLGTTRSN 240 Query: 1823 PHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLR 1644 PHFI+ELR+R++EVQRALGDATKDSELPR AYDKLKAME+TLAKGKQIQDDCAAVIKKLR Sbjct: 241 PHFIKELRVRIREVQRALGDATKDSELPRGAYDKLKAMEETLAKGKQIQDDCAAVIKKLR 300 Query: 1643 AILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLE 1464 AILHSTEEQLRVHKKQ+MFL QL+AKTLPKGLHCLPLRL+TEYYSLNS++QQFPN+EKLE Sbjct: 301 AILHSTEEQLRVHKKQSMFLMQLSAKTLPKGLHCLPLRLTTEYYSLNSTQQQFPNQEKLE 360 Query: 1463 DPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKA 1284 +P++YHYALFSDN+LAAAVVVNSTV++A++P NHVFHIVTDRLNYAAMRMWF+ANPPGKA Sbjct: 361 NPNIYHYALFSDNVLAAAVVVNSTVSNARDPRNHVFHIVTDRLNYAAMRMWFIANPPGKA 420 Query: 1283 TIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLR 1104 TIQVQ++EEFTWLNSSYSPVLKQLGS SMIDYYF+ HRAN DSNLK+RNPKYLSILNHLR Sbjct: 421 TIQVQSVEEFTWLNSSYSPVLKQLGSTSMIDYYFRTHRANPDSNLKYRNPKYLSILNHLR 480 Query: 1103 FYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSN 924 FY+PEIFPKL+KVLFLDDDIVVQRDLT LW ++LKGK+NGAVETC ESFHRFDRYLNFSN Sbjct: 481 FYMPEIFPKLHKVLFLDDDIVVQRDLTQLWKIDLKGKINGAVETCRESFHRFDRYLNFSN 540 Query: 923 PLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITF 744 PLISKNF+ ACGWA+GMN+FDL+EWKKQ ITE+YH+WQKLN RQLWKLGTLPPGLITF Sbjct: 541 PLISKNFEAHACGWAFGMNIFDLKEWKKQEITEIYHSWQKLNNDRQLWKLGTLPPGLITF 600 Query: 743 WNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDY 564 +NRTFP++R WHVLGLGY+P+VNQ++I RAA IHYNGN+KPWLEIG+PK++GYW K+++Y Sbjct: 601 YNRTFPLNRGWHVLGLGYDPSVNQRDIQRAAAIHYNGNLKPWLEIGLPKFRGYWQKYINY 660 Query: 563 DQVYLRDCNIN 531 +Q YL+DCNIN Sbjct: 661 NQPYLQDCNIN 671 >ref|XP_006467237.1| PREDICTED: probable galacturonosyltransferase 4-like [Citrus sinensis] Length = 646 Score = 910 bits (2351), Expect = 0.0 Identities = 453/654 (69%), Positives = 526/654 (80%), Gaps = 7/654 (1%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 2295 M R V+ +L TVLAPI+++T S+ S EF+ED++ + GG+ R LN+LPQ Sbjct: 1 MKTRNLVVGMLCATVLAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60 Query: 2294 ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 2124 ESS TL K+PI +V SD + S S EHK+ RVLSAT N + Q Sbjct: 61 ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111 Query: 2123 --VNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNAD-GKTNSD 1953 ++ +R+ + QINK +++ + N QQ + G Sbjct: 112 SKTDNPIRQVTDLTKTQINKHADQEQIKASDNHISAHHSQILDTKHQQESSLTYGVLEKK 171 Query: 1952 EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1773 EP K + + PD RVR LKDQLI+AKVYL L A RNN +F+RELRLR+KEVQRA Sbjct: 172 EPTKINNEKQTEQTTPPDFRVRQLKDQLIKAKVYLSLPAMRNNANFVRELRLRIKEVQRA 231 Query: 1772 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1593 LGDATKDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT Sbjct: 232 LGDATKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291 Query: 1592 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1413 +FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS++ FPN+EKLEDP L+HYALFSDN+LAA Sbjct: 292 LFLTQLTAKTLPKGLHCLPLRLTTEYYTLNSSQRHFPNQEKLEDPRLFHYALFSDNVLAA 351 Query: 1412 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1233 AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY Sbjct: 352 AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411 Query: 1232 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1053 SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD Sbjct: 412 SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471 Query: 1052 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 873 DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG Sbjct: 472 DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531 Query: 872 MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLG 693 MN+FDL+EW++QNIT+VYHTWQK+N RQLWKLGTLPPGLITFW RT+P+DR WHVLGLG Sbjct: 532 MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591 Query: 692 YNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 YNP+VNQ++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN Sbjct: 592 YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645 >ref|XP_006398488.1| hypothetical protein EUTSA_v10000819mg [Eutrema salsugineum] gi|557099577|gb|ESQ39941.1| hypothetical protein EUTSA_v10000819mg [Eutrema salsugineum] Length = 631 Score = 909 bits (2350), Expect = 0.0 Identities = 439/645 (68%), Positives = 520/645 (80%), Gaps = 1/645 (0%) Frame = -1 Query: 2462 RKPVLFLLVVTVLAPIVLYTD-RLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQESS 2286 R VLF L++TV API+LYTD SF + S+ +F+EDV+ L+F + +LN+LP+ES Sbjct: 5 RNLVLFFLLLTVAAPILLYTDPSSASFKTPFSKRDFLEDVTALTFNSDENRLNLLPRESP 64 Query: 2285 NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSATENPIKQVNDGVR 2106 ++ +G+VYS + +S+ E D + L+ + + S TE+PIKQV DG Sbjct: 65 EVVRGVVGVVYSKQNSDSSR-RQEARDQLSARVLSTTDDDNQ---SQTEDPIKQVTDGAS 120 Query: 2105 EGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEPPKNKMAR 1926 E ++ + + + + + Q + GK + EP + Sbjct: 121 EMDKPNDMHASDDNSQNREGMHV---------------QLTQQTSGKVDEQEPKSFGGEK 165 Query: 1925 PNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRALGDATKDSE 1746 G +VMPD +V+HLKDQLIRAKVYL L A + N HF+RELRLR+KEVQRAL DATKDS+ Sbjct: 166 ERGNVVMPDTQVKHLKDQLIRAKVYLSLPAAKANAHFVRELRLRIKEVQRALSDATKDSD 225 Query: 1745 LPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQTMFLTQLAAK 1566 LP+NA +KLKAMEQTLAKGKQIQDDC+ V+KKLRA+LHS EEQLRVHKKQTMFLTQL AK Sbjct: 226 LPKNAVEKLKAMEQTLAKGKQIQDDCSTVVKKLRAMLHSAEEQLRVHKKQTMFLTQLTAK 285 Query: 1565 TLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAAAVVVNSTVN 1386 T+PKGLHCLPLRL+T+YY+LNSSEQQFPN+E LED LYHYALFSDN+LA +VVVNST+ Sbjct: 286 TIPKGLHCLPLRLTTDYYALNSSEQQFPNQENLEDNQLYHYALFSDNVLATSVVVNSTIT 345 Query: 1385 HAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSYSPVLKQLGS 1206 +AK P+ HVFHIVTDRLNYAAMRMWFL NPPGKATIQVQN+EEFTWLNSSYSPVLKQL S Sbjct: 346 NAKHPSKHVFHIVTDRLNYAAMRMWFLDNPPGKATIQVQNVEEFTWLNSSYSPVLKQLSS 405 Query: 1205 QSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQRDL 1026 QSMIDYYF+AH NSD+NLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQ+DL Sbjct: 406 QSMIDYYFRAHHTNSDTNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLDDDIVVQKDL 465 Query: 1025 TGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNVFDLEEW 846 +GLWS++LKG VNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMN+FDL+EW Sbjct: 466 SGLWSVDLKGNVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYGMNIFDLDEW 525 Query: 845 KKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLGYNPTVNQKE 666 KKQNITEVYH WQ LN+GR+LWKLGTLPPGLITFW RT+P+DR WH+LGLGYNP+VNQ++ Sbjct: 526 KKQNITEVYHRWQTLNEGRELWKLGTLPPGLITFWRRTYPLDRKWHILGLGYNPSVNQRD 585 Query: 665 IDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 I+R AVIHYNGN+KPWLEIGIP+Y+G+WAK VDY+ VYLR+CNIN Sbjct: 586 IERGAVIHYNGNLKPWLEIGIPRYRGFWAKHVDYEHVYLRECNIN 630 >ref|XP_006350235.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X4 [Solanum tuberosum] Length = 679 Score = 908 bits (2347), Expect = 0.0 Identities = 456/674 (67%), Positives = 536/674 (79%), Gaps = 26/674 (3%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298 M +RKPVLFLL+VTV APIVLYTD LG++ + SR EFIED+ST +FGG++R LNVLP Sbjct: 3 MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62 Query: 2297 QESSNTLKEPIGIVYSDNSRNSTLFSDEI---EDSVEELPLAE-STEHKTRVLSATE--- 2139 QESS +LKEP G VYS+NS +S + + ED+ + L E S +H+T S+ + Sbjct: 63 QESSTSLKEPRGDVYSENSSHSLSNASDTLSSEDARKTRQLTEESMKHQTATGSSNDGVE 122 Query: 2138 -----NPIKQVNDGVREGNESDGLQIN------------KSIGEKKGEERTNXXXXXXXX 2010 + I QV + E ++D ++I +KK + Sbjct: 123 VAMNGSHISQVTANLHEPQQTDKTSPKLVSAGKNESIAMETISKKKTSPTDSNQTLDSTK 182 Query: 2009 XXXKAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATR 1830 + Q++V GK S E + K N +IV PDARVR LKDQLIRAKVYL L ATR Sbjct: 183 TETRHDQRTVQTSGKFVSGETARGKDEERNVQIVPPDARVRQLKDQLIRAKVYLSLSATR 242 Query: 1829 NNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKK 1650 +NPHFIRELRLR+KEV RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++KK Sbjct: 243 SNPHFIRELRLRIKEVLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIVKK 302 Query: 1649 LRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEK 1470 LRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++E Sbjct: 303 LRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQEN 362 Query: 1469 LEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPG 1290 LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLANPP Sbjct: 363 LENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANPPK 422 Query: 1289 KATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNH 1110 AT+ VQN+EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+NH Sbjct: 423 YATVDVQNVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIMNH 481 Query: 1109 LRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNF 930 LRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYLNF Sbjct: 482 LRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYLNF 541 Query: 929 SNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLI 750 SNPLISKNFDPRACGWA+GMN+ DL +W++QNITEVYH+WQ N RQLWKLGTLPPGLI Sbjct: 542 SNPLISKNFDPRACGWAFGMNIIDLNQWRRQNITEVYHSWQNRNHERQLWKLGTLPPGLI 601 Query: 749 TFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFV 570 TFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+KFV Sbjct: 602 TFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSKFV 661 Query: 569 DYDQVYLRDCNINR 528 DYDQ +LR+CNIN+ Sbjct: 662 DYDQAFLRECNINK 675 >ref|XP_006449976.1| hypothetical protein CICLE_v10014426mg [Citrus clementina] gi|557552587|gb|ESR63216.1| hypothetical protein CICLE_v10014426mg [Citrus clementina] Length = 646 Score = 908 bits (2347), Expect = 0.0 Identities = 453/654 (69%), Positives = 525/654 (80%), Gaps = 7/654 (1%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRL-GSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQ 2295 M R V+ +L TV API+++T S+ S EF+ED++ + GG+ R LN+LPQ Sbjct: 1 MKTRNLVVGMLCATVFAPILIFTSTFKDSYPSSSESGEFLEDLTAFTVGGDARHLNLLPQ 60 Query: 2294 ESSNTL--KEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKT-RVLSATENPIKQ 2124 ESS TL K+PI +V SD + S S EHK+ RVLSAT N + Q Sbjct: 61 ESSTTLSLKQPI-LVISDKIAQHSAHSQSQSQG--------SWEHKSARVLSATTNGLDQ 111 Query: 2123 --VNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQ-SVNADGKTNSD 1953 ++ +R+ + INK +++ + N QQ S G Sbjct: 112 SKTDNPIRQVTDLTKTPINKHADQEQIKASDNHISAHHSQILDTKHQQESSQTYGVLEKK 171 Query: 1952 EPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLRVKEVQRA 1773 EP K + + PD RVR LKDQLI+AKVYL L ATRNN +F+RELRLR+KEVQRA Sbjct: 172 EPTKINNEKQTEQTAPPDFRVRQLKDQLIKAKVYLSLPATRNNANFVRELRLRIKEVQRA 231 Query: 1772 LGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQLRVHKKQT 1593 LGDA+KDS+LPR A D+LKAMEQ+LAKGKQIQDDCAAV+KKLRA+LHSTEEQLRVHKKQT Sbjct: 232 LGDASKDSDLPRIANDRLKAMEQSLAKGKQIQDDCAAVVKKLRAMLHSTEEQLRVHKKQT 291 Query: 1592 MFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALFSDNILAA 1413 +FLTQL AKTLPKGLHCLPLRL+TEYYSLNSS++ FPN+EKLEDP L+HYALFSDN+LAA Sbjct: 292 LFLTQLTAKTLPKGLHCLPLRLTTEYYSLNSSQRYFPNQEKLEDPRLFHYALFSDNVLAA 351 Query: 1412 AVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEFTWLNSSY 1233 AVVVNSTV HAK P+NHVFHIVTDRLNYAAMRMWFLANPPG+AT+QVQNIEEFTWLNSSY Sbjct: 352 AVVVNSTVTHAKHPSNHVFHIVTDRLNYAAMRMWFLANPPGRATVQVQNIEEFTWLNSSY 411 Query: 1232 SPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 1053 SPVLKQL SQSMIDYYF+AHRANSDSNLKFRNPKYLSILNHLRFYLPE+FP+LNKVLFLD Sbjct: 412 SPVLKQLNSQSMIDYYFRAHRANSDSNLKFRNPKYLSILNHLRFYLPEVFPRLNKVLFLD 471 Query: 1052 DDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPRACGWAYG 873 DD+VVQ+DL+GLWS++LKGKVNGAVETCGE+FHRFDRYLNFSNPLISKNFDPRACGWAYG Sbjct: 472 DDVVVQKDLSGLWSIDLKGKVNGAVETCGETFHRFDRYLNFSNPLISKNFDPRACGWAYG 531 Query: 872 MNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRSWHVLGLG 693 MN+FDL+EW++QNIT+VYHTWQK+N RQLWKLGTLPPGLITFW RT+P+DR WHVLGLG Sbjct: 532 MNIFDLDEWRRQNITDVYHTWQKMNHDRQLWKLGTLPPGLITFWKRTYPLDRFWHVLGLG 591 Query: 692 YNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNIN 531 YNP+VNQ++I+RAAVIHYNGNMKPWLEI IPKY+ YW K VDYDQ+YLR+CNIN Sbjct: 592 YNPSVNQRDIERAAVIHYNGNMKPWLEINIPKYRNYWTKHVDYDQLYLRECNIN 645 >ref|XP_004236640.1| PREDICTED: probable galacturonosyltransferase 4-like [Solanum lycopersicum] Length = 680 Score = 906 bits (2342), Expect = 0.0 Identities = 457/676 (67%), Positives = 532/676 (78%), Gaps = 28/676 (4%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISL--DSRNEFIEDVSTLSFGGEIRKLNVLP 2298 M +RKPVLFLL+VTV APIVLYTD LG++ + SR EFIED+ST +FGG++R LNVLP Sbjct: 3 MKLRKPVLFLLLVTVFAPIVLYTDTLGTYFTSPSSSRTEFIEDLSTFTFGGDVRPLNVLP 62 Query: 2297 QESSNTLKEPIGIVYSDNSRNS------TLFSDEIEDSVEELPLAESTEHKTRVLSATE- 2139 QESS +LKEP G VYS+NS + TL S++ + +L AES +H+T S+ + Sbjct: 63 QESSTSLKEPRGDVYSENSSQTISNASDTLGSEDARKT-RQLTEAESLKHQTATGSSNDG 121 Query: 2138 -------NPIKQVNDGVREGNESDGLQ-----------INKSIGEKKGEERTNXXXXXXX 2013 N I QV D + E ++D I KK T+ Sbjct: 122 VEVAMNGNHISQVTDNLHEPQQTDKTSPKLVSAGKDESIAMETNSKKKTSSTDPNQTLDS 181 Query: 2012 XXXXKA-GQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGA 1836 Q +V GK S E + K N +IV PDARVR LKDQLIRAKVYL L A Sbjct: 182 TKTETRHDQHTVQTSGKVVSGETARGKDEERNAQIVPPDARVRQLKDQLIRAKVYLSLSA 241 Query: 1835 TRNNPHFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVI 1656 TR+NPHFIRELRLR+KE RALG+ATKDS+L R+A +KLKAMEQTLAKGKQIQDDCA ++ Sbjct: 242 TRSNPHFIRELRLRIKESLRALGEATKDSDLSRSANEKLKAMEQTLAKGKQIQDDCATIV 301 Query: 1655 KKLRAILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNE 1476 KKLRA+LHS EEQLRVHKKQT++LT L AKTLPKGLHCLPLRLSTEY+ LNSS+Q FP++ Sbjct: 302 KKLRAMLHSAEEQLRVHKKQTLYLTHLTAKTLPKGLHCLPLRLSTEYFKLNSSQQHFPHQ 361 Query: 1475 EKLEDPSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANP 1296 E LE+P LYHYALFSDNILAAAVVVNSTV+HAK+P+ HVFHIVTDRLN+AAMRMWFLAN Sbjct: 362 ENLENPKLYHYALFSDNILAAAVVVNSTVSHAKDPSKHVFHIVTDRLNFAAMRMWFLANQ 421 Query: 1295 PGKATIQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSIL 1116 P AT+ VQ++EEFTWLNSSYSPVLKQL SQSMIDYYF++ RA+SD N+KFRNPKYLSI+ Sbjct: 422 PKYATVDVQSVEEFTWLNSSYSPVLKQLNSQSMIDYYFRS-RADSDPNVKFRNPKYLSIM 480 Query: 1115 NHLRFYLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYL 936 NHLRFYLPEIFPKL+KVLFLDDDIVVQ+DL GLWSL+LKGKV G VETCGESFHRFDRYL Sbjct: 481 NHLRFYLPEIFPKLDKVLFLDDDIVVQKDLGGLWSLDLKGKVIGVVETCGESFHRFDRYL 540 Query: 935 NFSNPLISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPG 756 NFSNPLIS+NFDPRACGWA+GMN+ DL EW++QNITEVYH+WQ N RQLWKLGTLPPG Sbjct: 541 NFSNPLISENFDPRACGWAFGMNIIDLNEWRRQNITEVYHSWQNRNHERQLWKLGTLPPG 600 Query: 755 LITFWNRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAK 576 LITFW RT+ +DRSWHVLGLGYNP V+QK+I RAAVIHYNGN+KPWLEI IPK++ YW+K Sbjct: 601 LITFWKRTYALDRSWHVLGLGYNPNVSQKDIQRAAVIHYNGNLKPWLEISIPKFRDYWSK 660 Query: 575 FVDYDQVYLRDCNINR 528 FVDYDQ +LR+CNIN+ Sbjct: 661 FVDYDQTFLRECNINK 676 >gb|EYU33824.1| hypothetical protein MIMGU_mgv1a002625mg [Mimulus guttatus] Length = 653 Score = 905 bits (2338), Expect = 0.0 Identities = 449/660 (68%), Positives = 522/660 (79%), Gaps = 16/660 (2%) Frame = -1 Query: 2465 VRKPVLFLLVVTVLAPIVLYTDRLGSFIS-LDSRNEFIEDVSTLSFGGEIRKLNVLPQES 2289 +RKPVLFLL+VTV APIVLYTD LG + + SRNEF+ED ST +F GE+R LNVLPQES Sbjct: 5 LRKPVLFLLLVTVFAPIVLYTDTLGLYSTPSSSRNEFMEDGSTFTFAGEVRPLNVLPQES 64 Query: 2288 SNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT------ENPIK 2127 S TLKEP+G+VYS+NS ++ E + ESTE KT LS + ENPI+ Sbjct: 65 STTLKEPLGVVYSENSIEASSNKSEESTRITRQLTEESTEDKTTNLSGSSGGSKDENPIR 124 Query: 2126 QVNDGVREGNESDGLQINKSIGEKKGEERTNXXXXXXXXXXXKAGQQSVNADGKTNSDEP 1947 QV V E G + + + E N Q V ++ + E Sbjct: 125 QVISTVHEDEVGTGKEKSNKPQLHENTEIENR-------------QDDVTSENVSEKKEL 171 Query: 1946 PKNK---------MARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFIRELRLR 1794 + K +R N V+ DARVR LKDQLI+ +VYL L ATRNNPHFIR+LRLR Sbjct: 172 KRIKHSSRTREEVKSRQNERAVLSDARVRQLKDQLIQGRVYLSLSATRNNPHFIRDLRLR 231 Query: 1793 VKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILHSTEEQL 1614 +KEVQR LG+ATKDSELPRNA +K+KAMEQTL KGKQIQDDCAAV+KKLRA+LH EEQL Sbjct: 232 IKEVQRVLGEATKDSELPRNANEKMKAMEQTLLKGKQIQDDCAAVVKKLRAMLHLAEEQL 291 Query: 1613 RVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSLYHYALF 1434 R HKKQ +FLT L AKT+PKGLHC PLRLS+EY+ LNSS++ F N+E LE+P LYHYALF Sbjct: 292 RAHKKQALFLTHLTAKTVPKGLHCFPLRLSSEYFMLNSSQRDFSNKENLENPKLYHYALF 351 Query: 1433 SDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQVQNIEEF 1254 SDN+LAAAVVVNST+ HAK+P+ HVFH+VTDRLNYAAM+MWFLANPPGKATIQVQN+EEF Sbjct: 352 SDNVLAAAVVVNSTITHAKDPSKHVFHVVTDRLNYAAMKMWFLANPPGKATIQVQNVEEF 411 Query: 1253 TWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLPEIFPKL 1074 TWLNSSYSPVLKQL S+SMIDYYFK RA SDSNLK+RNPKYLSI+NHLRFYLPEIFPKL Sbjct: 412 TWLNSSYSPVLKQLSSRSMIDYYFKGKRAESDSNLKYRNPKYLSIMNHLRFYLPEIFPKL 471 Query: 1073 NKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLISKNFDPR 894 +KVLFLDDDIVVQ+DL+G++SLNLKGKV G VETCGE+FHRFDRYLNFSNP+ISKNFDPR Sbjct: 472 DKVLFLDDDIVVQKDLSGIFSLNLKGKVIGVVETCGETFHRFDRYLNFSNPIISKNFDPR 531 Query: 893 ACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRTFPIDRS 714 ACGWA+GMN+FDL+EW+KQNITEVYH WQ LN+ R LWKLGTLPPGLITF NRT+ +D+S Sbjct: 532 ACGWAFGMNIFDLDEWRKQNITEVYHKWQNLNEDRLLWKLGTLPPGLITFSNRTYALDKS 591 Query: 713 WHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVYLRDCNI 534 WHVLGLGYNP V K+I+RAAVIHYNGN+KPWLEIG+PK++ YWAKFVDYD YLR+CNI Sbjct: 592 WHVLGLGYNPNVPLKDIERAAVIHYNGNLKPWLEIGLPKFRNYWAKFVDYDHQYLRECNI 651 >ref|XP_004499343.1| PREDICTED: probable galacturonosyltransferase 4-like [Cicer arietinum] Length = 658 Score = 898 bits (2320), Expect = 0.0 Identities = 451/668 (67%), Positives = 521/668 (77%), Gaps = 23/668 (3%) Frame = -1 Query: 2462 RKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGG-EIRKLNVLPQESS 2286 R V LL +TV+ PI+LYTDRL F + +EFI+DV+ + GG + LN+LPQE+S Sbjct: 5 RNIVFLLLCITVVTPILLYTDRLTDFNYPSAEHEFIQDVTAFAVGGAKSSHLNLLPQETS 64 Query: 2285 NTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEHKTRVLSAT--------ENPI 2130 LKEPIG+VYS+++ N ++ LP E TRVLSAT +NPI Sbjct: 65 TILKEPIGVVYSEDTSN-----------IKSLPQREHV--LTRVLSATNEEDWSKGDNPI 111 Query: 2129 KQVNDGVREGNESDGLQ--------------INKSIGEKKGEERTNXXXXXXXXXXXKAG 1992 K + DGV+ N+S L+ I+ + K + +N K G Sbjct: 112 KLLTDGVKPINQSSYLEKADITGGSVNGEDAIDVDDNDGKLTKSSNASDQVSETILTKQG 171 Query: 1991 QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFI 1812 +Q + K N+ + + NG+ DARVR LKDQLI+AKVYL L A RNNPH Sbjct: 172 KQRTGSSSKGNNKGTILQETTKHNGQ-TPSDARVRKLKDQLIQAKVYLSLQAVRNNPHLT 230 Query: 1811 RELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILH 1632 RELRLRVKEV R LGDA+KDS+LPRNA +++K+MEQ+L KG+QIQDDCA +KKLRA+LH Sbjct: 231 RELRLRVKEVSRTLGDASKDSDLPRNANERMKSMEQSLMKGRQIQDDCATSVKKLRAMLH 290 Query: 1631 STEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSL 1452 S+E+QLRVHKKQT FLTQL AKTLPKGLHCLPLRL+TEYY+LNSS+QQFPN+EKLEDP L Sbjct: 291 SSEDQLRVHKKQTSFLTQLTAKTLPKGLHCLPLRLTTEYYNLNSSQQQFPNQEKLEDPGL 350 Query: 1451 YHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQV 1272 YHYA+FSDNILA AVVVNST HAK+ + HVFHIVTDRLNYAAMRMWFLANPPGKA IQV Sbjct: 351 YHYAIFSDNILATAVVVNSTAAHAKDASKHVFHIVTDRLNYAAMRMWFLANPPGKAAIQV 410 Query: 1271 QNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLP 1092 QNIE+FTWLNSSYSPVLKQLGS SMIDYYFK HRA SDSNLKFRNPKYLS+LNHLRFYLP Sbjct: 411 QNIEDFTWLNSSYSPVLKQLGSPSMIDYYFKTHRATSDSNLKFRNPKYLSMLNHLRFYLP 470 Query: 1091 EIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLIS 912 EIFPKL KVLFLDDD+VVQ+DLTGLWS++LKG VNGAVETC ESFHRFDRYLNFSNPL++ Sbjct: 471 EIFPKLKKVLFLDDDVVVQKDLTGLWSIDLKGNVNGAVETCAESFHRFDRYLNFSNPLVA 530 Query: 911 KNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRT 732 +NFDPRACGWAYGMNVFDL WKKQNITEVYH WQKLN RQLWKLGTLPPGLITFW RT Sbjct: 531 RNFDPRACGWAYGMNVFDLVGWKKQNITEVYHNWQKLNHDRQLWKLGTLPPGLITFWKRT 590 Query: 731 FPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVY 552 FP++RSWHVLGLGYNP VNQK+I+RAAVIHYNGNMKPWLEI IPK++ YW K+VDYD VY Sbjct: 591 FPLNRSWHVLGLGYNPNVNQKDIERAAVIHYNGNMKPWLEISIPKFRAYWTKYVDYDIVY 650 Query: 551 LRDCNINR 528 LR+CNIN+ Sbjct: 651 LRECNINQ 658 >ref|XP_006600275.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1 [Glycine max] gi|571532515|ref|XP_006600276.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X2 [Glycine max] Length = 661 Score = 896 bits (2316), Expect = 0.0 Identities = 447/667 (67%), Positives = 524/667 (78%), Gaps = 20/667 (2%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 ++ R VL LL +T +APIVL+TDRLG+F + EFIE V+ + LN+LPQE Sbjct: 2 VVTRNIVLLLLSITFVAPIVLFTDRLGTFKYPFAEQEFIEAVTAFVSAADSGHLNLLPQE 61 Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 2139 SS KEPIG+VY++++ N+ E+ + L A+ EH RVLSAT E Sbjct: 62 SSTVFKEPIGLVYTEDTSNT-------ENLLHGLHFAKPGEHVSARVLSATNDEGQTKGE 114 Query: 2138 NPIKQVNDGVREGNESDGLQINKSIGEK-KGEERTNXXXXXXXXXXXK----------AG 1992 NPIK V DG+ +GN++ + + G+ GE+ + Sbjct: 115 NPIKLVTDGINQGNQNSYMVKADTTGDSVNGEDAIDVDDNDGKLAKSSDLVSETTDTKQE 174 Query: 1991 QQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNPHFI 1812 Q+ + + + EP ++ + N + PDARV+ LKDQLI+A+VYL L A R+NPH Sbjct: 175 QEHIKSSSQVTQKEPILSEADKHNDQ-TPPDARVQQLKDQLIQARVYLSLQAVRSNPHLT 233 Query: 1811 RELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRAILH 1632 RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KKLRA+LH Sbjct: 234 RELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKLRAMLH 293 Query: 1631 STEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLEDPSL 1452 STEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQF N++KLEDP L Sbjct: 294 STEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQFRNQQKLEDPRL 353 Query: 1451 YHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKATIQV 1272 YHYA+FSDNILA AVVVNSTV HAK+ + HVFHIVTDRLNYAAMRMWFL NPP KATIQV Sbjct: 354 YHYAIFSDNILATAVVVNSTVAHAKDTSKHVFHIVTDRLNYAAMRMWFLVNPPQKATIQV 413 Query: 1271 QNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRFYLP 1092 QNIE+FTWLNSSYSPVLKQLGS SMID+YFK HRA+SDSNLKFRNPKYLSILNHLRFYLP Sbjct: 414 QNIEDFTWLNSSYSPVLKQLGSPSMIDFYFKTHRASSDSNLKFRNPKYLSILNHLRFYLP 473 Query: 1091 EIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNPLIS 912 EIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNFSNPLI+ Sbjct: 474 EIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFSNPLIA 533 Query: 911 KNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFWNRT 732 KNFDPRACGWAYGMNVFDL +WK+QNIT+VYH WQK+N RQLWKLGTLPPGLITFW RT Sbjct: 534 KNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWKLGTLPPGLITFWKRT 593 Query: 731 FPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYDQVY 552 F + RSWHVLGLGYNP +NQKEI+RAAVIHYNGNMKPWLEI IPK++GYW K+VDY+ VY Sbjct: 594 FQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISIPKFRGYWTKYVDYNLVY 653 Query: 551 LRDCNIN 531 LR+CNIN Sbjct: 654 LRECNIN 660 >ref|XP_006584115.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X1 [Glycine max] gi|571468064|ref|XP_006584116.1| PREDICTED: probable galacturonosyltransferase 4-like isoform X2 [Glycine max] Length = 664 Score = 892 bits (2304), Expect = 0.0 Identities = 446/670 (66%), Positives = 523/670 (78%), Gaps = 23/670 (3%) Frame = -1 Query: 2471 MMVRKPVLFLLVVTVLAPIVLYTDRLGSFISLDSRNEFIEDVSTLSFGGEIRKLNVLPQE 2292 ++ R VL LL +T +APIVLYTDR G+F + EFI+ V+ + LN+LPQE Sbjct: 2 VVTRNIVLLLLSITFVAPIVLYTDRFGTFKYPFAEQEFIDAVTAFVSAADSGHLNLLPQE 61 Query: 2291 SSNTLKEPIGIVYSDNSRNSTLFSDEIEDSVEELPLAESTEH-KTRVLSAT--------E 2139 +S KEPIG+VY++++ N+ ++ + L A+ EH RVLSAT E Sbjct: 62 TSTVFKEPIGLVYTEDAANT-------KNLLHGLHFAKPGEHVSARVLSATKDEGQTKGE 114 Query: 2138 NPIKQVNDGVREGNESDGL--------------QINKSIGEKKGEERTNXXXXXXXXXXX 2001 NPIK V DG+ +GN++ L I+ + K + ++ Sbjct: 115 NPIKLVTDGINQGNQNSYLVKADITGDSVNGEDAIDVDDNDGKLAKSSDASDLASETMDT 174 Query: 2000 KAGQQSVNADGKTNSDEPPKNKMARPNGEIVMPDARVRHLKDQLIRAKVYLGLGATRNNP 1821 K QQ + + + + + K A + + PDARVR+LKDQLI+ +VYL L A RNNP Sbjct: 175 KQEQQHIKSSSQV-TQKGSKLSEADKHIDQTPPDARVRYLKDQLIQVRVYLSLQAVRNNP 233 Query: 1820 HFIRELRLRVKEVQRALGDATKDSELPRNAYDKLKAMEQTLAKGKQIQDDCAAVIKKLRA 1641 H RELRLRVKEV R LGDA+KDS+LPRNA +++KAMEQTL KG+QIQ+DCAA +KKLRA Sbjct: 234 HLTRELRLRVKEVSRTLGDASKDSDLPRNANERMKAMEQTLMKGRQIQNDCAAAVKKLRA 293 Query: 1640 ILHSTEEQLRVHKKQTMFLTQLAAKTLPKGLHCLPLRLSTEYYSLNSSEQQFPNEEKLED 1461 +LHSTEEQL VHKKQT+FLTQL AKTLPKGLHCLPLRL+TEYYSLN+S+QQ PN++KLE+ Sbjct: 294 MLHSTEEQLHVHKKQTLFLTQLTAKTLPKGLHCLPLRLTTEYYSLNTSQQQLPNQQKLEN 353 Query: 1460 PSLYHYALFSDNILAAAVVVNSTVNHAKEPANHVFHIVTDRLNYAAMRMWFLANPPGKAT 1281 P LYHYA+FSDNILA AVVVNSTV HAK+ +NHVFHIVTDRLNYAAMRMWFL NPP KAT Sbjct: 354 PRLYHYAIFSDNILATAVVVNSTVAHAKDTSNHVFHIVTDRLNYAAMRMWFLVNPPKKAT 413 Query: 1280 IQVQNIEEFTWLNSSYSPVLKQLGSQSMIDYYFKAHRANSDSNLKFRNPKYLSILNHLRF 1101 IQVQNIE+FTWLNSSYSPVLKQLGS SM+D+YFK HRA+SDSNLKFRNPKYLSILNHLRF Sbjct: 414 IQVQNIEDFTWLNSSYSPVLKQLGSPSMVDFYFKTHRASSDSNLKFRNPKYLSILNHLRF 473 Query: 1100 YLPEIFPKLNKVLFLDDDIVVQRDLTGLWSLNLKGKVNGAVETCGESFHRFDRYLNFSNP 921 YLPEIFPKLNKVLFLDDDIVVQ+DLTGLWS++LKG VNGAVETCGE FHRFDRYLNFSNP Sbjct: 474 YLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLNFSNP 533 Query: 920 LISKNFDPRACGWAYGMNVFDLEEWKKQNITEVYHTWQKLNQGRQLWKLGTLPPGLITFW 741 I+KNFDPRACGWAYGMNVFDL +WK+QNITEVYH WQKLN RQLWKLGTLPPGLITFW Sbjct: 534 HIAKNFDPRACGWAYGMNVFDLVQWKRQNITEVYHNWQKLNHDRQLWKLGTLPPGLITFW 593 Query: 740 NRTFPIDRSWHVLGLGYNPTVNQKEIDRAAVIHYNGNMKPWLEIGIPKYKGYWAKFVDYD 561 RTF ++RSWHVLGLGYNP +NQKEI+RAAVIHYNGNMKPWLEI PK++GYW K+VDYD Sbjct: 594 KRTFQLNRSWHVLGLGYNPNINQKEIERAAVIHYNGNMKPWLEISFPKFRGYWTKYVDYD 653 Query: 560 QVYLRDCNIN 531 VYLR+CNIN Sbjct: 654 LVYLRECNIN 663