BLASTX nr result
ID: Rheum21_contig00007194
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00007194 (2521 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ08397.1| hypothetical protein PRUPE_ppa002860mg [Prunus pe... 653 0.0 ref|XP_006429684.1| hypothetical protein CICLE_v10011265mg [Citr... 635 e-179 ref|XP_006481281.1| PREDICTED: probable galacturonosyltransferas... 631 e-178 ref|XP_006429685.1| hypothetical protein CICLE_v10011265mg [Citr... 630 e-178 gb|EOY03195.1| Glycosyltransferase, CAZy family GT8, putative is... 621 e-175 gb|EOY03194.1| Glycosyltransferase, CAZy family GT8, putative is... 621 e-175 emb|CAQ58617.1| transferase, transferring glycosyl groups / unkn... 617 e-173 gb|EXC35198.1| putative galacturonosyltransferase 7 [Morus notab... 615 e-173 ref|XP_006411083.1| hypothetical protein EUTSA_v10016387mg [Eutr... 611 e-172 ref|XP_006411082.1| hypothetical protein EUTSA_v10016387mg [Eutr... 609 e-171 ref|XP_002323701.2| glycosyl transferase family 8 family protein... 608 e-171 ref|XP_002881608.1| GAUT7/LGT7 [Arabidopsis lyrata subsp. lyrata... 608 e-171 ref|XP_004305055.1| PREDICTED: probable galacturonosyltransferas... 607 e-171 ref|XP_003623702.1| hypothetical protein MTR_7g074680 [Medicago ... 607 e-171 ref|XP_006293843.1| hypothetical protein CARUB_v10022827mg [Caps... 606 e-170 ref|XP_004147522.1| PREDICTED: probable galacturonosyltransferas... 606 e-170 ref|NP_565893.1| alpha-1,4-galacturonosyltransferase [Arabidopsi... 606 e-170 ref|XP_004163983.1| PREDICTED: probable galacturonosyltransferas... 605 e-170 ref|XP_003551632.2| PREDICTED: probable galacturonosyltransferas... 598 e-168 ref|XP_004492632.1| PREDICTED: probable galacturonosyltransferas... 597 e-168 >gb|EMJ08397.1| hypothetical protein PRUPE_ppa002860mg [Prunus persica] Length = 626 Score = 653 bits (1684), Expect = 0.0 Identities = 347/629 (55%), Positives = 422/629 (67%), Gaps = 14/629 (2%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 K+RWK LSMLVPL+FLLGLHNGFHS + ++P + L Sbjct: 13 KRRWKGLVIAVLGLVFLSMLVPLLFLLGLHNGFHSPG--------SEQQSSPSIGLGGYG 64 Query: 2187 RKINHNMWMNL*--------ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSP 2032 KI NL +D +F D L + S H Sbjct: 65 TKIVIRDASNLSEGDRSNHVDDLVKQFAPTLSKDILKNISHPAENETKSPSAMHD-NEEE 123 Query: 2031 PGQSGKKWISKTDKVKESKPAGSVSEAKV------VGESEISCELKYGSYCLWRREYRED 1870 G S E+ P S + V +S SCELK+GSYCLWR ++RED Sbjct: 124 KGFSAPPHADLQSPPIENNPKAGASVQIIDYAKGGVDQSGKSCELKFGSYCLWREQHRED 183 Query: 1869 MKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLM 1690 MKDSMVK LKD LFVARAYYPSIAKLP DKLSRE+RQ+IQ+ ERV+SE+T DADLPP + Sbjct: 184 MKDSMVKRLKDHLFVARAYYPSIAKLPSQDKLSREMRQNIQEVERVLSESTTDADLPPQI 243 Query: 1689 DKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKS 1510 KKLQ+M +I++AKS ++CNNVDKK RQ+ DLTEDE +FH +QS FLYQLAVQTMPKS Sbjct: 244 GKKLQRMQAAIARAKSFHVDCNNVDKKLRQIYDLTEDEANFHMRQSVFLYQLAVQTMPKS 303 Query: 1509 LHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKES 1330 LHCLSMRLTVEYFR P D + S+A KY+D+ L HYVIFS N+LAS+VVINSTVMHAKES Sbjct: 304 LHCLSMRLTVEYFRSPFDDTEASLADKYIDRALQHYVIFSTNVLASSVVINSTVMHAKES 363 Query: 1329 RKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVS 1150 KLVFH+LTD++NYFAMKLWF RNT+K + ++VLN+E + Q SLP EFRVS Sbjct: 364 GKLVFHVLTDEENYFAMKLWFFRNTYKEATIEVLNMERLDLNNQKL---QFSLPVEFRVS 420 Query: 1149 FQSAKLSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGK 970 S RTEY+S FS+ HY LP+IF VQ+DLSALWN+NMEGK Sbjct: 421 HSVDAQS----RTEYLSTFSHLHYRLPEIFQNLEKVVVLDDDVVVQQDLSALWNLNMEGK 476 Query: 969 VNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELN 790 VNAA + C V L LLRSYLG N F+ SCAWMSGLNV+DL +WR+L+L+ T++ FV+E++ Sbjct: 477 VNAAVQFCSVKLSLLRSYLGENSFNKNSCAWMSGLNVIDLVKWRELDLTETYQKFVKEVS 536 Query: 789 TEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPW 610 T+ + AVA HASLLTFQD +YPLDGSW LSGLGHDY++++ + +AVLHYNG MKPW Sbjct: 537 TQEAQNEAVALHASLLTFQDLIYPLDGSWALSGLGHDYNVDVYPIRNAAVLHYNGKMKPW 596 Query: 609 LELGIPKFKNYWTKYLNRDDQFLTDCNVN 523 LELGIPK+K YW ++NR+DQFLTDCN N Sbjct: 597 LELGIPKYKGYWKNFVNREDQFLTDCNWN 625 >ref|XP_006429684.1| hypothetical protein CICLE_v10011265mg [Citrus clementina] gi|568855375|ref|XP_006481282.1| PREDICTED: probable galacturonosyltransferase 7-like isoform X3 [Citrus sinensis] gi|557531741|gb|ESR42924.1| hypothetical protein CICLE_v10011265mg [Citrus clementina] Length = 623 Score = 635 bits (1639), Expect = e-179 Identities = 333/622 (53%), Positives = 428/622 (68%), Gaps = 6/622 (0%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPV--TLRI 2194 K+RW+ LSMLVPL FLLGLHNGFHS PV T + Sbjct: 19 KRRWRSLVIGVLFLVILSMLVPLAFLLGLHNGFHSPNPNPN--------GYVPVHKTSIV 70 Query: 2193 SQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTR--SPPGQS 2020 S KI + N + + +F + + D+ H SPP Sbjct: 71 SDLKI----YDKYENSETFNYAEDVRSNFPDGAKTETSDMSATDTSHHSKVTPVSPPAVP 126 Query: 2019 GKKWISKTDKVKESKPAGSVSEAKVVGESEI-SCELKYGSYCLWRREYREDMKDSMVKTL 1843 SK AG+V+++ G E +CELK+GSYCLWRRE+RE+MKD+MVK L Sbjct: 127 -----QSLPNTSNSKIAGTVADSGRGGVDENENCELKFGSYCLWRREHREEMKDTMVKKL 181 Query: 1842 KDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKMDV 1663 KD+LFVARAYYPSIAKLP DKL+R LRQ+IQ+ ERV+SE+ D DLPP ++KK+Q+M+ Sbjct: 182 KDQLFVARAYYPSIAKLPSQDKLTRALRQNIQEVERVLSESATDVDLPPGIEKKIQRMEA 241 Query: 1662 SISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMRLT 1483 +I+KAKS+ ++C+NVDKKFRQ++D+T DE +FH KQSAFLYQLAVQTMPKSLHCLSMRLT Sbjct: 242 AITKAKSVPVDCSNVDKKFRQILDMTNDEANFHMKQSAFLYQLAVQTMPKSLHCLSMRLT 301 Query: 1482 VEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHILT 1303 VEYF+ P++ M+ S A ++ D +L+HYVIFS N+LAS+V+INSTV+ A+E++ VFH+LT Sbjct: 302 VEYFKSPSVVMELSQADRFSDPSLHHYVIFSTNVLASSVLINSTVLCARENKNQVFHVLT 361 Query: 1302 DKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEFRVSFQSAKLSN 1126 D QNYFAMKLWF RNTFK + VQVLNIE + D H+ LP E+RVS S + Sbjct: 362 DGQNYFAMKLWFFRNTFKEATVQVLNIEQLNLESHDKAILIHMFLPVEYRVSLLSVDGPS 421 Query: 1125 MPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAEHC 946 + + +Y+S+FS+ HY+LP+IF VQ+DLSALW+INM GKVN A + C Sbjct: 422 IHSKMQYISVFSHLHYLLPEIFQSLTKVVVLDDDVVVQKDLSALWDINMGGKVNGAVQSC 481 Query: 945 DVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSDRA 766 V+LG L+SYLG N +D SCAWMSGLN+VDLARWR+L+L+ T++ VRE++ S A Sbjct: 482 SVSLGQLKSYLGENSYDKNSCAWMSGLNIVDLARWRELDLTKTYQRLVREVSMGEESKEA 541 Query: 765 VASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIPKF 586 VA SLLTFQD VY LDG W LSGLGHDY LNI+A+ K+AVLHYNGNMKPWLELGIP++ Sbjct: 542 VALRGSLLTFQDLVYALDGVWALSGLGHDYGLNIEAIKKAAVLHYNGNMKPWLELGIPRY 601 Query: 585 KNYWTKYLNRDDQFLTDCNVNP 520 K +W K+LN++DQ L++CNV+P Sbjct: 602 KKFWKKFLNQEDQLLSECNVHP 623 >ref|XP_006481281.1| PREDICTED: probable galacturonosyltransferase 7-like isoform X2 [Citrus sinensis] Length = 642 Score = 631 bits (1628), Expect = e-178 Identities = 331/632 (52%), Positives = 430/632 (68%), Gaps = 16/632 (2%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXX------RVDLEDTV----- 2221 K+RW+ LSMLVPL FLLGLHNGFHS + + D Sbjct: 19 KRRWRSLVIGVLFLVILSMLVPLAFLLGLHNGFHSPNPNPNGYVPVHKTSISDLKIYDKY 78 Query: 2220 -NTPPVTLRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQP 2044 N+ + R + N +L + + +F + + D+ H Sbjct: 79 ENSETFNYAENYRSTHIN---DLVKKLAPNISKDVRSNFPDGAKTETSDMSATDTSHHSK 135 Query: 2043 TR--SPPGQSGKKWISKTDKVKESKPAGSVSEAKVVGESEI-SCELKYGSYCLWRREYRE 1873 SPP SK AG+V+++ G E +CELK+GSYCLWRRE+RE Sbjct: 136 VTPVSPPAVP-----QSLPNTSNSKIAGTVADSGRGGVDENENCELKFGSYCLWRREHRE 190 Query: 1872 DMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPL 1693 +MKD+MVK LKD+LFVARAYYPSIAKLP DKL+R LRQ+IQ+ ERV+SE+ D DLPP Sbjct: 191 EMKDTMVKKLKDQLFVARAYYPSIAKLPSQDKLTRALRQNIQEVERVLSESATDVDLPPG 250 Query: 1692 MDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPK 1513 ++KK+Q+M+ +I+KAKS+ ++C+NVDKKFRQ++D+T DE +FH KQSAFLYQLAVQTMPK Sbjct: 251 IEKKIQRMEAAITKAKSVPVDCSNVDKKFRQILDMTNDEANFHMKQSAFLYQLAVQTMPK 310 Query: 1512 SLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKE 1333 SLHCLSMRLTVEYF+ P++ M+ S A ++ D +L+HYVIFS N+LAS+V+INSTV+ A+E Sbjct: 311 SLHCLSMRLTVEYFKSPSVVMELSQADRFSDPSLHHYVIFSTNVLASSVLINSTVLCARE 370 Query: 1332 SRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEFR 1156 ++ VFH+LTD QNYFAMKLWF RNTFK + VQVLNIE + D H+ LP E+R Sbjct: 371 NKNQVFHVLTDGQNYFAMKLWFFRNTFKEATVQVLNIEQLNLESHDKAILIHMFLPVEYR 430 Query: 1155 VSFQSAKLSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINME 976 VS S ++ + +Y+S+FS+ HY+LP+IF VQ+DLSALW+INM Sbjct: 431 VSLLSVDGPSIHSKMQYISVFSHLHYLLPEIFQSLTKVVVLDDDVVVQKDLSALWDINMG 490 Query: 975 GKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRE 796 GKVN A + C V+LG L+SYLG N +D SCAWMSGLN+VDLARWR+L+L+ T++ VRE Sbjct: 491 GKVNGAVQSCSVSLGQLKSYLGENSYDKNSCAWMSGLNIVDLARWRELDLTKTYQRLVRE 550 Query: 795 LNTEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMK 616 ++ S AVA SLLTFQD VY LDG W LSGLGHDY LNI+A+ K+AVLHYNGNMK Sbjct: 551 VSMGEESKEAVALRGSLLTFQDLVYALDGVWALSGLGHDYGLNIEAIKKAAVLHYNGNMK 610 Query: 615 PWLELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 PWLELGIP++K +W K+LN++DQ L++CNV+P Sbjct: 611 PWLELGIPRYKKFWKKFLNQEDQLLSECNVHP 642 >ref|XP_006429685.1| hypothetical protein CICLE_v10011265mg [Citrus clementina] gi|568855371|ref|XP_006481280.1| PREDICTED: probable galacturonosyltransferase 7-like isoform X1 [Citrus sinensis] gi|557531742|gb|ESR42925.1| hypothetical protein CICLE_v10011265mg [Citrus clementina] Length = 643 Score = 630 bits (1626), Expect = e-178 Identities = 331/633 (52%), Positives = 429/633 (67%), Gaps = 17/633 (2%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHS-------------TXXXXXRVDLED 2227 K+RW+ LSMLVPL FLLGLHNGFHS T + Sbjct: 19 KRRWRSLVIGVLFLVILSMLVPLAFLLGLHNGFHSPNPNPNGYVPVHKTSIVSDLKIYDK 78 Query: 2226 TVNTPPVTLRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQ 2047 N+ + R + N +L + + +F + + D+ H Sbjct: 79 YENSETFNYAENYRSTHIN---DLVKKLAPNISKDVRSNFPDGAKTETSDMSATDTSHHS 135 Query: 2046 PTR--SPPGQSGKKWISKTDKVKESKPAGSVSEAKVVGESEI-SCELKYGSYCLWRREYR 1876 SPP SK AG+V+++ G E +CELK+GSYCLWRRE+R Sbjct: 136 KVTPVSPPAVP-----QSLPNTSNSKIAGTVADSGRGGVDENENCELKFGSYCLWRREHR 190 Query: 1875 EDMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPP 1696 E+MKD+MVK LKD+LFVARAYYPSIAKLP DKL+R LRQ+IQ+ ERV+SE+ D DLPP Sbjct: 191 EEMKDTMVKKLKDQLFVARAYYPSIAKLPSQDKLTRALRQNIQEVERVLSESATDVDLPP 250 Query: 1695 LMDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMP 1516 ++KK+Q+M+ +I+KAKS+ ++C+NVDKKFRQ++D+T DE +FH KQSAFLYQLAVQTMP Sbjct: 251 GIEKKIQRMEAAITKAKSVPVDCSNVDKKFRQILDMTNDEANFHMKQSAFLYQLAVQTMP 310 Query: 1515 KSLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAK 1336 KSLHCLSMRLTVEYF+ P++ M+ S A ++ D +L+HYVIFS N+LAS+V+INSTV+ A+ Sbjct: 311 KSLHCLSMRLTVEYFKSPSVVMELSQADRFSDPSLHHYVIFSTNVLASSVLINSTVLCAR 370 Query: 1335 ESRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEF 1159 E++ VFH+LTD QNYFAMKLWF RNTFK + VQVLNIE + D H+ LP E+ Sbjct: 371 ENKNQVFHVLTDGQNYFAMKLWFFRNTFKEATVQVLNIEQLNLESHDKAILIHMFLPVEY 430 Query: 1158 RVSFQSAKLSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINM 979 RVS S ++ + +Y+S+FS+ HY+LP+IF VQ+DLSALW+INM Sbjct: 431 RVSLLSVDGPSIHSKMQYISVFSHLHYLLPEIFQSLTKVVVLDDDVVVQKDLSALWDINM 490 Query: 978 EGKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVR 799 GKVN A + C V+LG L+SYLG N +D SCAWMSGLN+VDLARWR+L+L+ T++ VR Sbjct: 491 GGKVNGAVQSCSVSLGQLKSYLGENSYDKNSCAWMSGLNIVDLARWRELDLTKTYQRLVR 550 Query: 798 ELNTEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNM 619 E++ S AVA SLLTFQD VY LDG W LSGLGHDY LNI+A+ K+AVLHYNGNM Sbjct: 551 EVSMGEESKEAVALRGSLLTFQDLVYALDGVWALSGLGHDYGLNIEAIKKAAVLHYNGNM 610 Query: 618 KPWLELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 KPWLELGIP++K +W K+LN++DQ L++CNV+P Sbjct: 611 KPWLELGIPRYKKFWKKFLNQEDQLLSECNVHP 643 >gb|EOY03195.1| Glycosyltransferase, CAZy family GT8, putative isoform 2 [Theobroma cacao] Length = 610 Score = 621 bits (1601), Expect = e-175 Identities = 339/630 (53%), Positives = 431/630 (68%), Gaps = 12/630 (1%) Frame = -2 Query: 2373 PPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRI 2194 P K+RW+ LSMLVPL FLLGLHNGFHS + L+ T + + I Sbjct: 14 PAKRRWRGLAIGVLFLVVLSMLVPLGFLLGLHNGFHSAGI----MPLQHTSSPGDRSSHI 69 Query: 2193 SQ--RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSPPGQS 2020 RK+ + ++ ++KG ++ + T P Sbjct: 70 DSLVRKLGPTL-----------------------QKDILKGFINEAKNETSSTNVTPKNQ 106 Query: 2019 GKKWISKTDKVKESKPAGSVS----EAKVVG---ESEISCELKYGSYCLWRREYREDMKD 1861 +K I +V ++S +A + G ESE CELKYGSYC+W E RE+MKD Sbjct: 107 QRKGIPVPPQVLLQPLTINISSISDKAGMKGHLDESEGLCELKYGSYCIWHEENREEMKD 166 Query: 1860 SMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKK 1681 S VK LKD+LFVARAY+PSIAK+P KLSRELRQ+IQ+ ERV+SE+T DADLPP ++KK Sbjct: 167 SKVKKLKDQLFVARAYFPSIAKVPAQSKLSRELRQNIQELERVLSESTTDADLPPEIEKK 226 Query: 1680 LQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHC 1501 ++M+ +I++AKS+ ++CNNVDKK RQ+ DLTEDE +FH KQSAFLYQLAVQTMPKSLHC Sbjct: 227 SRRMEAAIARAKSVSVDCNNVDKKLRQIFDLTEDEANFHMKQSAFLYQLAVQTMPKSLHC 286 Query: 1500 LSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKL 1321 LSMRLTVEYF+D + D + + +K+ D TL HYVIFSNN++AS+VVINSTVMHA+ES L Sbjct: 287 LSMRLTVEYFKDHSFDKE--LPEKFSDPTLQHYVIFSNNVIASSVVINSTVMHARESMNL 344 Query: 1320 VFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEFRVSFQ 1144 VFH+LTD QNYFAMKLWFL+NTFK++ +QVLNIE D T SHL+LP EFRVSF Sbjct: 345 VFHVLTDGQNYFAMKLWFLKNTFKDAVIQVLNIEHLNSEYYDKATLSHLTLPVEFRVSFH 404 Query: 1143 SAKLSNMPH-RTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKV 967 S+ + H RT+Y+S+FS+SHY+LP+IF VQ+DLSAL +++M GKV Sbjct: 405 SSDNAPAIHDRTQYLSIFSHSHYLLPEIFRNLEKVVVLDDDVVVQQDLSALRSLDMAGKV 464 Query: 966 NAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRE-LN 790 A + C V LG LRSYLG + FD SC+WMSGLNV+DL WR+L +S T+ V+E ++ Sbjct: 465 IGAVQICSVRLGQLRSYLGRSSFDKNSCSWMSGLNVIDLVMWRELGISETYWKLVKEKVS 524 Query: 789 TEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPW 610 + GS A ASLLTFQD VY LD WVLSGLGHDY LNI+ + K+AVLHYNGNMKPW Sbjct: 525 MKEGS----ALLASLLTFQDLVYALDSVWVLSGLGHDYGLNIEGIEKAAVLHYNGNMKPW 580 Query: 609 LELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 L+LGIPK+K YW K+LN++DQFL++CNVNP Sbjct: 581 LDLGIPKYKAYWKKFLNQEDQFLSECNVNP 610 >gb|EOY03194.1| Glycosyltransferase, CAZy family GT8, putative isoform 1 [Theobroma cacao] Length = 611 Score = 621 bits (1601), Expect = e-175 Identities = 339/630 (53%), Positives = 431/630 (68%), Gaps = 12/630 (1%) Frame = -2 Query: 2373 PPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRI 2194 P K+RW+ LSMLVPL FLLGLHNGFHS + L+ T + + I Sbjct: 14 PAKRRWRGLAIGVLFLVVLSMLVPLGFLLGLHNGFHSAAGI---MPLQHTSSPGDRSSHI 70 Query: 2193 SQ--RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSPPGQS 2020 RK+ + ++ ++KG ++ + T P Sbjct: 71 DSLVRKLGPTL-----------------------QKDILKGFINEAKNETSSTNVTPKNQ 107 Query: 2019 GKKWISKTDKVKESKPAGSVS----EAKVVG---ESEISCELKYGSYCLWRREYREDMKD 1861 +K I +V ++S +A + G ESE CELKYGSYC+W E RE+MKD Sbjct: 108 QRKGIPVPPQVLLQPLTINISSISDKAGMKGHLDESEGLCELKYGSYCIWHEENREEMKD 167 Query: 1860 SMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKK 1681 S VK LKD+LFVARAY+PSIAK+P KLSRELRQ+IQ+ ERV+SE+T DADLPP ++KK Sbjct: 168 SKVKKLKDQLFVARAYFPSIAKVPAQSKLSRELRQNIQELERVLSESTTDADLPPEIEKK 227 Query: 1680 LQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHC 1501 ++M+ +I++AKS+ ++CNNVDKK RQ+ DLTEDE +FH KQSAFLYQLAVQTMPKSLHC Sbjct: 228 SRRMEAAIARAKSVSVDCNNVDKKLRQIFDLTEDEANFHMKQSAFLYQLAVQTMPKSLHC 287 Query: 1500 LSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKL 1321 LSMRLTVEYF+D + D + + +K+ D TL HYVIFSNN++AS+VVINSTVMHA+ES L Sbjct: 288 LSMRLTVEYFKDHSFDKE--LPEKFSDPTLQHYVIFSNNVIASSVVINSTVMHARESMNL 345 Query: 1320 VFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEFRVSFQ 1144 VFH+LTD QNYFAMKLWFL+NTFK++ +QVLNIE D T SHL+LP EFRVSF Sbjct: 346 VFHVLTDGQNYFAMKLWFLKNTFKDAVIQVLNIEHLNSEYYDKATLSHLTLPVEFRVSFH 405 Query: 1143 SAKLSNMPH-RTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKV 967 S+ + H RT+Y+S+FS+SHY+LP+IF VQ+DLSAL +++M GKV Sbjct: 406 SSDNAPAIHDRTQYLSIFSHSHYLLPEIFRNLEKVVVLDDDVVVQQDLSALRSLDMAGKV 465 Query: 966 NAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRE-LN 790 A + C V LG LRSYLG + FD SC+WMSGLNV+DL WR+L +S T+ V+E ++ Sbjct: 466 IGAVQICSVRLGQLRSYLGRSSFDKNSCSWMSGLNVIDLVMWRELGISETYWKLVKEKVS 525 Query: 789 TEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPW 610 + GS A ASLLTFQD VY LD WVLSGLGHDY LNI+ + K+AVLHYNGNMKPW Sbjct: 526 MKEGS----ALLASLLTFQDLVYALDSVWVLSGLGHDYGLNIEGIEKAAVLHYNGNMKPW 581 Query: 609 LELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 L+LGIPK+K YW K+LN++DQFL++CNVNP Sbjct: 582 LDLGIPKYKAYWKKFLNQEDQFLSECNVNP 611 >emb|CAQ58617.1| transferase, transferring glycosyl groups / unknown protein [Vitis vinifera] Length = 541 Score = 617 bits (1590), Expect = e-173 Identities = 306/484 (63%), Positives = 375/484 (77%), Gaps = 8/484 (1%) Frame = -2 Query: 1947 VVGESEISCELKYGSYCLWRREYREDMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSR 1768 VV ESE SCELK+GSYCLWR+E+REDMKD MVK LKDRLFVARAYYPS+AKLP HDKLSR Sbjct: 58 VVDESEKSCELKFGSYCLWRQEHREDMKDMMVKKLKDRLFVARAYYPSVAKLPAHDKLSR 117 Query: 1767 ELRQSIQDFERVMSEATVDADLPPLMDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDL 1588 EL+Q+IQ+ ERV+SEA+ DA+LPP + KKL +M+V+I++AKS+ ++CNNVDKK RQ++D+ Sbjct: 118 ELKQNIQELERVLSEASTDAELPPQIGKKLTRMEVAITRAKSITVDCNNVDKKLRQILDM 177 Query: 1587 TEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLY 1408 TEDE FH KQSAFLYQLA+ T PKS HCLSMRLTVEYF+ P LDM+ +KY++ Sbjct: 178 TEDEADFHMKQSAFLYQLAIHTTPKSHHCLSMRLTVEYFKSPPLDMEVQQDEKYMNPASQ 237 Query: 1407 HYVIFSNNILASTVVINSTVMHAKESRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVL 1228 HYVIFS N+LASTVVINSTVMH +ES VFH++TD QNYFAMKLWF RNTF+ + VQVL Sbjct: 238 HYVIFSKNVLASTVVINSTVMHTEESGNQVFHVVTDGQNYFAMKLWFSRNTFRQAMVQVL 297 Query: 1227 NIEDERHSTQD-TTTSHLSLPEEFRVSFQSA-KLSNMPHRTEYMSLFSYSHYVLPDIFHX 1054 NIED D T LSLP+EFR+S+ SA L RTEY+S+FS+SHY+LP+IF Sbjct: 298 NIEDLNLDHHDEATLLDLSLPQEFRISYGSANNLPTSSMRTEYLSIFSHSHYLLPEIFQN 357 Query: 1053 XXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWM 874 VQ+DLSALW+INMEGKVN A E C V LG L+SYLG G D SCAWM Sbjct: 358 LKKVVILDDDIVVQQDLSALWSINMEGKVNGAVEFCRVRLGELKSYLGEKGVDEHSCAWM 417 Query: 873 SGLNVVDLARWRDLNLSGTFR------SFVRELNTEGGSDRAVASHASLLTFQDQVYPLD 712 SGLN++DL RWR+ +++G +R S V++L+ S VA ASLL+FQD VY LD Sbjct: 418 SGLNIIDLVRWREQDVTGLYRRLVQEVSHVQKLSMGEESLGHVALRASLLSFQDLVYALD 477 Query: 711 GSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIPKFKNYWTKYLNRDDQFLTDC 532 +WV SGLGH+Y L+ +A+ ++AVLHYNGNMKPWLELGIPK++NYW K+LN D+Q+LT+C Sbjct: 478 DTWVFSGLGHNYHLDTQAIKRAAVLHYNGNMKPWLELGIPKYRNYWRKFLNLDEQYLTEC 537 Query: 531 NVNP 520 NVNP Sbjct: 538 NVNP 541 >gb|EXC35198.1| putative galacturonosyltransferase 7 [Morus notabilis] Length = 626 Score = 615 bits (1586), Expect = e-173 Identities = 327/630 (51%), Positives = 428/630 (67%), Gaps = 12/630 (1%) Frame = -2 Query: 2373 PPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHS--------TXXXXXRVDLEDTVN 2218 P K+RW+ LSMLVPLVFLLG HNGF S + R ++D+ Sbjct: 13 PTKRRWRGLVLGVLGLVLLSMLVPLVFLLGFHNGFQSPGFVSEQSSASNPIRGYIKDSSR 72 Query: 2217 TPPVTLRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTR 2038 P Q + ++ L S + + KE+ + G D + + + Sbjct: 73 DTPDLSEGDQSRHVDDLVRRLAPTLSKDIFKKS-----KPKEETIGGVTVHDDVPRKASP 127 Query: 2037 SPPGQSGKKWISKT-DKVKESKPAGSVSEAKVVGESEISCELKYGSYCLWRREYREDMKD 1861 +P + + +S T +K + P K V ES CELKYGS+CLWR+E++E+MKD Sbjct: 128 APAKKVPR--VSPTINKTRADGPTHITKNPKYVDESGKQCELKYGSFCLWRQEHKEEMKD 185 Query: 1860 SMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKK 1681 SMVK LKD+LFVARAYYP+IAKLP DKLSRE++Q+IQ+FER++SE + DADLP + KK Sbjct: 186 SMVKKLKDKLFVARAYYPTIAKLPAQDKLSREMKQNIQEFERILSETSTDADLPSQVQKK 245 Query: 1680 LQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHC 1501 LQKMD I++AKS ++CNNVDKK RQ+ D+TEDE +FH +QS+FLYQLAVQTMPKSLHC Sbjct: 246 LQKMDAVIARAKSFPVDCNNVDKKLRQIFDMTEDEANFHMRQSSFLYQLAVQTMPKSLHC 305 Query: 1500 LSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKL 1321 LSMRLTV+YF+ P+ D++ S+ +KY+D L HYVIFS N+LAS+ VINSTVMHAKES Sbjct: 306 LSMRLTVDYFKSPS-DVELSLTEKYMDPALQHYVIFSKNVLASSAVINSTVMHAKESVNQ 364 Query: 1320 VFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQS 1141 VFH+LT+ QNY+AMK WF+RNT+K + V+VLNIE Q+ LSLP EFRVSF S Sbjct: 365 VFHVLTNGQNYYAMKQWFIRNTYKEATVRVLNIEALNLENQNL---ELSLPVEFRVSFHS 421 Query: 1140 AKLSNMP---HRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGK 970 + N P RTEY+S FS+SHY+LP IF VQ+DLSALW++NM GK Sbjct: 422 --VDNPPVAQMRTEYLSTFSHSHYLLPQIFQNLKRVVVLDDDVIVQQDLSALWSLNMGGK 479 Query: 969 VNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELN 790 VN A + C V L LL+SYLG FD SC WMSGLNV+DL +WR+++L+ T+ ++EL+ Sbjct: 480 VNGAVQMCSVRLNLLKSYLGERSFDKNSCVWMSGLNVIDLDKWREVDLTETYGRLLKELS 539 Query: 789 TEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPW 610 G AV ASLL+FQD +Y LD +W LSGLG+DY L+IKA+ ++AVLHYNGNMKPW Sbjct: 540 MGEGLSEAV---ASLLSFQDLIYVLDDAWALSGLGYDYGLDIKAIKRAAVLHYNGNMKPW 596 Query: 609 LELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 L+LGIPK+++YW + N++DQFL++CNV+P Sbjct: 597 LDLGIPKYRHYWKNFRNQEDQFLSECNVSP 626 >ref|XP_006411083.1| hypothetical protein EUTSA_v10016387mg [Eutrema salsugineum] gi|557112252|gb|ESQ52536.1| hypothetical protein EUTSA_v10016387mg [Eutrema salsugineum] Length = 620 Score = 611 bits (1576), Expect = e-172 Identities = 328/624 (52%), Positives = 419/624 (67%), Gaps = 8/624 (1%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 K+RWK LSMLVPL FLLGLHNGFHS V P + S Sbjct: 16 KRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFHSPGF----------VTVQPASPFESL 65 Query: 2187 RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGS------GSKDSMQHQPTRSPPG 2026 +IN ++ L K + GS S DS + SP Sbjct: 66 SRINATKHSQRDLSDRVDDVLHKINPVLPKKSDINVGSRDMNRTSSSDSKKKGLPVSPAV 125 Query: 2025 QSGKKWISKTDKVKESKPA-GSVSEAKVVGESEISCELKYGSYCLWRREYREDMKDSMVK 1849 + +KT K G+++ A E++ +CE+KYGSYCLWR E +E MKD+ VK Sbjct: 126 VANPSPANKTKTEASYKGVQGAIANAD---ETQKTCEVKYGSYCLWREENKEPMKDAKVK 182 Query: 1848 TLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKM 1669 +KD LFVARAYYPSIAK+P KL+R+++Q+IQ+FE+++SE++ DADLPP +DKK QKM Sbjct: 183 HMKDLLFVARAYYPSIAKMPSQTKLTRDMKQNIQEFEKILSESSADADLPPQVDKKFQKM 242 Query: 1668 DVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMR 1489 + ISKAKS ++CNNVDKK RQ++DLTEDE SFH KQS FLYQLAVQTMPKSLHCLSMR Sbjct: 243 EAVISKAKSFPVDCNNVDKKLRQILDLTEDEASFHMKQSVFLYQLAVQTMPKSLHCLSMR 302 Query: 1488 LTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHI 1309 LTVEYF+ +LD++DS +K+ D +L H+VI S+NILAS+VVINSTV+HA+ES+ VFH+ Sbjct: 303 LTVEYFKSASLDIEDS--EKFSDPSLLHFVIISDNILASSVVINSTVLHARESKNFVFHV 360 Query: 1308 LTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAKLS 1129 LTD+QNYFAMK WF+RN K + +QVLNIE D + LSLP EFRVSF S S Sbjct: 361 LTDEQNYFAMKQWFIRNPCKQATIQVLNIE---KLELDNSDLKLSLPAEFRVSFPSGDNS 417 Query: 1128 -NMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAE 952 + +RT Y+SLFS SHY+LP +FH VQRDLS LW+++MEGKVN A + Sbjct: 418 ASQQNRTHYLSLFSQSHYLLPKLFHKLEKVVILDDDVVVQRDLSPLWDLDMEGKVNGAVK 477 Query: 951 HCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSD 772 C V LG L+S L FD +C WMSGLNV+DLARWR+L +S T++ F +E++ S Sbjct: 478 SCSVRLGQLKS-LKRGNFDTNACLWMSGLNVIDLARWRELGVSETYQKFYKEMSGGEESR 536 Query: 771 RAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIP 592 A+A ASLLTFQD+VY L+ W LSGLG+DY +N + + +A+LHYNGNMKPWLELGIP Sbjct: 537 EAIALQASLLTFQDKVYALEDKWALSGLGYDYYINTQTIKNAAILHYNGNMKPWLELGIP 596 Query: 591 KFKNYWTKYLNRDDQFLTDCNVNP 520 ++K+YW K+LNR+D+FL+DCNVNP Sbjct: 597 QYKSYWRKHLNREDRFLSDCNVNP 620 >ref|XP_006411082.1| hypothetical protein EUTSA_v10016387mg [Eutrema salsugineum] gi|557112251|gb|ESQ52535.1| hypothetical protein EUTSA_v10016387mg [Eutrema salsugineum] Length = 621 Score = 609 bits (1571), Expect = e-171 Identities = 329/625 (52%), Positives = 419/625 (67%), Gaps = 9/625 (1%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 K+RWK LSMLVPL FLLGLHNGFHS V P + S Sbjct: 16 KRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFHSPGF----------VTVQPASPFESL 65 Query: 2187 RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGS------GSKDSMQHQPTRSPPG 2026 +IN ++ L K + GS S DS + SP Sbjct: 66 SRINATKHSQRDLSDRVDDVLHKINPVLPKKSDINVGSRDMNRTSSSDSKKKGLPVSPAV 125 Query: 2025 QSGKKWISKTDKVKESKPA-GSVSEAKVVGESEISCELKYGSYCLWRREYREDMKDSMVK 1849 + +KT K G+++ A E++ +CE+KYGSYCLWR E +E MKD+ VK Sbjct: 126 VANPSPANKTKTEASYKGVQGAIANAD---ETQKTCEVKYGSYCLWREENKEPMKDAKVK 182 Query: 1848 TLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKM 1669 +KD LFVARAYYPSIAK+P KL+R+++Q+IQ+FE+++SE++ DADLPP +DKK QKM Sbjct: 183 HMKDLLFVARAYYPSIAKMPSQTKLTRDMKQNIQEFEKILSESSADADLPPQVDKKFQKM 242 Query: 1668 DVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMR 1489 + ISKAKS ++CNNVDKK RQ++DLTEDE SFH KQS FLYQLAVQTMPKSLHCLSMR Sbjct: 243 EAVISKAKSFPVDCNNVDKKLRQILDLTEDEASFHMKQSVFLYQLAVQTMPKSLHCLSMR 302 Query: 1488 LTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHI 1309 LTVEYF+ +LD++DS +K+ D +L H+VI S+NILAS+VVINSTV+HA+ES+ VFH+ Sbjct: 303 LTVEYFKSASLDIEDS--EKFSDPSLLHFVIISDNILASSVVINSTVLHARESKNFVFHV 360 Query: 1308 LTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAKLS 1129 LTD+QNYFAMK WF+RN K + +QVLNIE D + LSLP EFRVSF S S Sbjct: 361 LTDEQNYFAMKQWFIRNPCKQATIQVLNIE---KLELDNSDLKLSLPAEFRVSFPSGDNS 417 Query: 1128 -NMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAE 952 + +RT Y+SLFS SHY+LP +FH VQRDLS LW+++MEGKVN A + Sbjct: 418 ASQQNRTHYLSLFSQSHYLLPKLFHKLEKVVILDDDVVVQRDLSPLWDLDMEGKVNGAVK 477 Query: 951 HCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSD 772 C V LG L+S L FD +C WMSGLNV+DLARWR+L +S T++ F +E + G Sbjct: 478 SCSVRLGQLKS-LKRGNFDTNACLWMSGLNVIDLARWRELGVSETYQKFYKEQMSGGEES 536 Query: 771 R-AVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGI 595 R A+A ASLLTFQD+VY L+ W LSGLG+DY +N + + +A+LHYNGNMKPWLELGI Sbjct: 537 REAIALQASLLTFQDKVYALEDKWALSGLGYDYYINTQTIKNAAILHYNGNMKPWLELGI 596 Query: 594 PKFKNYWTKYLNRDDQFLTDCNVNP 520 P++K+YW K+LNR+D+FL+DCNVNP Sbjct: 597 PQYKSYWRKHLNREDRFLSDCNVNP 621 >ref|XP_002323701.2| glycosyl transferase family 8 family protein [Populus trichocarpa] gi|550321552|gb|EEF05462.2| glycosyl transferase family 8 family protein [Populus trichocarpa] Length = 620 Score = 608 bits (1568), Expect = e-171 Identities = 330/624 (52%), Positives = 421/624 (67%), Gaps = 8/624 (1%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGL-HNGFHSTXXXXXRVDLEDTVNTPPVTLRIS 2191 K+RW+ LSMLVPLVFLLGL HNGFHST + L PP +++ Sbjct: 12 KRRWRCLVIGVLFLVLLSMLVPLVFLLGLYHNGFHSTGNSLQQ-HLSLFHPPPPSQIQLP 70 Query: 2190 QRKINHNMWMNL*ND*SLEF----QRXXXXDFLSTK-EKVVKGSGSKDSMQHQPTRSPPG 2026 + NL + +L F ++ FLS + + K S S H R Sbjct: 71 FHFFCCFLLSNLTDTYTLYFLLNTRQPDLFFFLSHQMNSITKLCHSSSSAGHLSDRQTSS 130 Query: 2025 QSGKKWISKTDKVKESKPAGSVSEAKVVGESEISCELKYGSYCLWRREYREDMKDSMVKT 1846 S I+K + V ESE CEL++G YC WR E+RE+MKD MVK Sbjct: 131 ASAVYEITKHKR-------------NAVEESE-KCELRFGGYCHWRDEHRENMKDFMVKK 176 Query: 1845 LKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKMD 1666 LKD+LFVARAYYPSIAKLP +KL+ EL+Q+IQ+ ER++SE++ DADLPP + KKLQKM+ Sbjct: 177 LKDQLFVARAYYPSIAKLPSQEKLTHELKQNIQELERILSESSTDADLPPQIQKKLQKME 236 Query: 1665 VSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMRL 1486 ISKAK+ ++CNNVDKK RQ++DLTE+ET+FH KQSAFLYQLAVQTMPK LHCLSMRL Sbjct: 237 NVISKAKTFPVDCNNVDKKLRQILDLTEEETNFHMKQSAFLYQLAVQTMPKGLHCLSMRL 296 Query: 1485 TVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHIL 1306 VEYF+ A D + ++++Y D +L HYV+FS N+LA++VVINST +HA+ES LVFH+L Sbjct: 297 IVEYFKSSAHDKEFPLSERYSDPSLQHYVVFSTNVLAASVVINSTAVHARESGNLVFHVL 356 Query: 1305 TDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQD-TTTSHLSLPEEFRVSFQSAKLS 1129 TD NY+AMKLWFLRNT+K +AVQVLNIE+ D +SLP E+RVSF + Sbjct: 357 TDGLNYYAMKLWFLRNTYKEAAVQVLNIENVTLKYYDKEVLKSMSLPVEYRVSFPTVTNP 416 Query: 1128 NMPH-RTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAE 952 H RTEY+S+FS++HY+LP IF VQRDLS LWN+NM KVN A + Sbjct: 417 PASHLRTEYVSVFSHTHYLLPYIFEKLKRVVVLDDDVVVQRDLSDLWNLNMGRKVNGALQ 476 Query: 951 HCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSD 772 C V LG LRSYLG + FD SCAWMSGLNV+DL RWR+L+L+ T+ +E++ SD Sbjct: 477 LCSVQLGQLRSYLGKSIFDKTSCAWMSGLNVIDLVRWRELDLTKTYWKLGQEVSKGTESD 536 Query: 771 RAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIP 592 +VA SLLTFQD VYPLDG+W LSGLGHDY ++++A+ K++VLH+NG MKPWLE+GIP Sbjct: 537 ESVALSTSLLTFQDLVYPLDGAWALSGLGHDYGIDVQAIKKASVLHFNGQMKPWLEVGIP 596 Query: 591 KFKNYWTKYLNRDDQFLTDCNVNP 520 K+K+YW ++LNR DQ L +CNVNP Sbjct: 597 KYKHYWKRFLNRHDQLLVECNVNP 620 >ref|XP_002881608.1| GAUT7/LGT7 [Arabidopsis lyrata subsp. lyrata] gi|297327447|gb|EFH57867.1| GAUT7/LGT7 [Arabidopsis lyrata subsp. lyrata] Length = 617 Score = 608 bits (1567), Expect = e-171 Identities = 329/625 (52%), Positives = 420/625 (67%), Gaps = 9/625 (1%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 K+RWK LSMLVPL FLLGLHNGFHS V P + S Sbjct: 13 KRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFHSPGF----------VTVQPASSFESF 62 Query: 2187 RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGS------GSKDSMQHQPTRSPPG 2026 +IN ++ L K + GS DS + SP Sbjct: 63 TRINATKHTQRDVSERVDEVLQKINPVLPKKSDINVGSRDMNVTSGTDSKKRGLPVSPTV 122 Query: 2025 QSGKKWISKTDKVKESKPAGSVSEAKVVGESEI--SCELKYGSYCLWRREYREDMKDSMV 1852 + +KT +S+ + + KVV E +CE+KYGSYCLWR E +E MKD+ V Sbjct: 123 VANPSPANKT----KSEASYEGVQRKVVSGDETWRTCEVKYGSYCLWREENKEPMKDTKV 178 Query: 1851 KTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQK 1672 K +KD+LFVARAYYPSIAK+P KL+R+++Q+IQ+FER++SE++ DADLPP +DKKLQK Sbjct: 179 KQMKDQLFVARAYYPSIAKMPSQSKLTRDMKQNIQEFERILSESSQDADLPPQVDKKLQK 238 Query: 1671 MDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSM 1492 M+ I+KAKS ++CNNVDKK RQ++DLTEDE SFH KQS FLYQLAVQTMPKSLHCLSM Sbjct: 239 MEAVIAKAKSFPVDCNNVDKKLRQILDLTEDEASFHMKQSVFLYQLAVQTMPKSLHCLSM 298 Query: 1491 RLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFH 1312 RLTVE+F+ +L +D +++K+ D +L H+VI S+NILAS+VVINSTV+HA++S+ VFH Sbjct: 299 RLTVEHFKSASL--EDPISEKFSDPSLLHFVIISDNILASSVVINSTVVHARDSKNFVFH 356 Query: 1311 ILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAK- 1135 +LTD+QNYFAMK WF+RN K S VQVLNIE D + LSLP EFRVSF S Sbjct: 357 VLTDEQNYFAMKQWFVRNPCKQSTVQVLNIE---KLELDDSDMKLSLPAEFRVSFPSGDL 413 Query: 1134 LSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAA 955 L++ +RT Y+SLFS SHY+LP +F VQ++LS LW+++MEGKVN A Sbjct: 414 LASQQNRTHYLSLFSQSHYLLPKLFDKLEKVVVLDDDVVVQQNLSPLWDLDMEGKVNGAV 473 Query: 954 EHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGS 775 + C V LG L+S L FD +C WMSGLNVVDLARWR+L +S T++ + +E++ S Sbjct: 474 KLCTVRLGQLKS-LKRGNFDTNACLWMSGLNVVDLARWRELGVSETYQKYYKEMSGGDES 532 Query: 774 DRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGI 595 A+A ASLLTFQDQVY LD W LSGLG+DY +N +A+ +A+LHYNGNMKPWLELGI Sbjct: 533 SEAIALQASLLTFQDQVYALDDKWALSGLGYDYYINAEAIKNAAILHYNGNMKPWLELGI 592 Query: 594 PKFKNYWTKYLNRDDQFLTDCNVNP 520 PK+KNYW K+LNR+D+FL+DCNVNP Sbjct: 593 PKYKNYWRKHLNREDRFLSDCNVNP 617 >ref|XP_004305055.1| PREDICTED: probable galacturonosyltransferase 7-like [Fragaria vesca subsp. vesca] Length = 559 Score = 607 bits (1566), Expect = e-171 Identities = 328/615 (53%), Positives = 417/615 (67%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 KKRWK LSMLVPL+FLLG+HNGF + N+P T I Sbjct: 6 KKRWKGLVVGVLGLVLLSMLVPLLFLLGIHNGFQGYG--------SEQPNSPTHTTPIII 57 Query: 2187 RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSPPGQSGKKW 2008 ++ S++E VK KD ++H +PP K Sbjct: 58 KE--------------------------SSEEDHVKHV--KDIVKHF---APPIFKNSK- 85 Query: 2007 ISKTDKVKESKPAGSVSEAKVVGESEISCELKYGSYCLWRREYREDMKDSMVKTLKDRLF 1828 + K + + +K + ES +C++K+GSYCLWR+E++EDMKD MVK LKD LF Sbjct: 86 VEKVEMIDYAKGG--------IDESGKTCQVKFGSYCLWRQEHKEDMKDFMVKKLKDSLF 137 Query: 1827 VARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKMDVSISKA 1648 VARAY+P+IAKLP K S E++Q+IQ+ E+V+SE+T DADLP ++KKL +M I+KA Sbjct: 138 VARAYFPTIAKLPSQRKFSSEMKQNIQELEKVLSESTTDADLPSQIEKKLARMQAVIAKA 197 Query: 1647 KSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMRLTVEYFR 1468 K+ +EC+NVDKK RQ+ D+T DE FH KQS FLYQLAVQTMPKSLHCLSMRLTVE+FR Sbjct: 198 KTFPVECHNVDKKLRQIFDMTADEAHFHMKQSVFLYQLAVQTMPKSLHCLSMRLTVEFFR 257 Query: 1467 DPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHILTDKQNY 1288 DP DM +S+A KY D L HYVIFS N+LAS+VVINSTVMHAKES KLVFH+LTD+QNY Sbjct: 258 DPVYDM-ESLASKYNDPALQHYVIFSTNVLASSVVINSTVMHAKESGKLVFHVLTDQQNY 316 Query: 1287 FAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAKLSNMPHRTE 1108 FA+KLWF RNT+K + VQVLN+E+ D LSLP EFRVSF + RTE Sbjct: 317 FALKLWFYRNTYKEAIVQVLNLEE-----IDNRKLFLSLPVEFRVSFGIGAEA----RTE 367 Query: 1107 YMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAEHCDVTLGL 928 Y+S+FS+SHY+LP+IF VQ+DLSALW++NMEGKVNAA + C V L Sbjct: 368 YLSIFSHSHYLLPEIFQKLEKVVVLDEDVVVQQDLSALWSLNMEGKVNAAVQLCSVKL-- 425 Query: 927 LRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSDRAVASHAS 748 S L + FD SCAWMSGLNV+DL +WR+L+L+ T+R FV E +T+ G + A A HAS Sbjct: 426 --SNLLKSSFDKTSCAWMSGLNVIDLVKWRELDLTETYRRFVNEASTQEGQNEAAALHAS 483 Query: 747 LLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIPKFKNYWTK 568 LLTF+D +YPLD W LSGLG+DY++++ A++K+AVLHYNG MKPWL+LGIPK+K +W + Sbjct: 484 LLTFKDLIYPLDSVWALSGLGNDYNIDMSAVTKAAVLHYNGKMKPWLDLGIPKYKVFWKR 543 Query: 567 YLNRDDQFLTDCNVN 523 +LNR+DQFLTDCNVN Sbjct: 544 FLNREDQFLTDCNVN 558 >ref|XP_003623702.1| hypothetical protein MTR_7g074680 [Medicago truncatula] gi|124360299|gb|ABN08312.1| Glycosyl transferase, family 8 [Medicago truncatula] gi|355498717|gb|AES79920.1| hypothetical protein MTR_7g074680 [Medicago truncatula] Length = 645 Score = 607 bits (1566), Expect = e-171 Identities = 320/639 (50%), Positives = 432/639 (67%), Gaps = 19/639 (2%) Frame = -2 Query: 2382 YPLPPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTP--P 2209 Y +P K+RW+ LSMLVPLVFLLGLHN FH++ + + NTP P Sbjct: 13 YGVPAKRRWRGLIIAVLGLVILSMLVPLVFLLGLHNSFHTSGY------IYEQRNTPSSP 66 Query: 2208 VTLRISQRKINHNMWMNL*ND*S------LEFQRXXXXDFLSTKEKVVK-GSGSKDSMQH 2050 + ++ + H + + S +F+ D L K K G + + +H Sbjct: 67 NIIEYNRHDVRHKEDKSEGDKTSHVKELITKFEPTLPKDVLKNYSKGDKNGIVNTNEEKH 126 Query: 2049 QPTRSPPGQSGKKWI-----SKTDKVKESKPAGS--VSEAKV--VGESEISCELKYGSYC 1897 + ++PP + + T KV K + V+ K E+ SCEL YGSYC Sbjct: 127 RGVKTPPPLPPNAALQSPPTTNTPKVHNPKHGRTEQVTHPKTSSADETGTSCELTYGSYC 186 Query: 1896 LWRREYREDMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEAT 1717 LW++E++E MKD+MVK LKD+LFVARAYYPSIAKLP DKLSR+L+QSIQ+ E V+SE++ Sbjct: 187 LWQQEHKEVMKDAMVKKLKDQLFVARAYYPSIAKLPAQDKLSRQLKQSIQELEHVLSESS 246 Query: 1716 VDADLPPLMDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQ 1537 DADLPPL++ K ++MDV+I++AKS+ + C+NVDKKFRQL DLTEDE FH KQSAFLY+ Sbjct: 247 TDADLPPLVETKSERMDVAIARAKSVPVVCDNVDKKFRQLYDLTEDEADFHRKQSAFLYK 306 Query: 1536 LAVQTMPKSLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVIN 1357 L V TMPKS HCL+++LTVEYF+ + D +++ ++K+ D +L+HYVIFSNN+LA++VVIN Sbjct: 307 LNVLTMPKSFHCLALKLTVEYFKS-SHDEEEADSEKFEDSSLHHYVIFSNNVLAASVVIN 365 Query: 1356 STVMHAKESRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHL 1177 STV HAK SR VFH+L+D QNY+AMKLWF RN + +AVQVLN+E + + L Sbjct: 366 STVTHAKVSRNQVFHVLSDGQNYYAMKLWFKRNNYGEAAVQVLNVEHLEMDSLKDNSLQL 425 Query: 1176 SLPEEFRVSFQSAKLSNM-PHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLS 1000 SLPEEFRVSF+S +M RTEY+S+FS+SHY+LPDIF +QRDLS Sbjct: 426 SLPEEFRVSFRSYDNPSMGQFRTEYISIFSHSHYLLPDIFSKLKKVVVLDDDVVIQRDLS 485 Query: 999 ALWNINMEGKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSG 820 +LWN++M KVN A + C V LG L+ YLG GF SCAWMSGLN++DL RWR+ L+ Sbjct: 486 SLWNLDMGEKVNGAVQFCSVRLGQLKGYLGEKGFSHNSCAWMSGLNIIDLVRWREFGLTQ 545 Query: 819 TFRSFVRELNTEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAV 640 T++ ++EL+ + GS A A ASLL F++++YPL+ SWV SGLGHDY ++ ++ + V Sbjct: 546 TYKRLIKELSVQKGSTTAAAWPASLLAFENKIYPLNESWVRSGLGHDYKIDSNSIKSAPV 605 Query: 639 LHYNGNMKPWLELGIPKFKNYWTKYLNRDDQFLTDCNVN 523 LHYNG MKPWL+LGIP +K+YW KYLN++DQ L++CNVN Sbjct: 606 LHYNGKMKPWLDLGIPNYKSYWKKYLNKEDQLLSECNVN 644 >ref|XP_006293843.1| hypothetical protein CARUB_v10022827mg [Capsella rubella] gi|482562551|gb|EOA26741.1| hypothetical protein CARUB_v10022827mg [Capsella rubella] Length = 620 Score = 606 bits (1563), Expect = e-170 Identities = 326/623 (52%), Positives = 416/623 (66%), Gaps = 7/623 (1%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVTLRISQ 2188 K+RWK LSMLVPL FLLGLHNGFHS V P + S Sbjct: 17 KRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFHSPGF----------VTVQPASNFESF 66 Query: 2187 RKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTR------SPPG 2026 +IN ++ L K + GS + + SP Sbjct: 67 TRINATKHTQRDVSERVDEVLQKINPVLPKKSDINVGSSDMNGTSGSDIKIRGIPGSPTV 126 Query: 2025 QSGKKWISKTDKVKESKPAGSVSEAKVVGESEISCELKYGSYCLWRREYREDMKDSMVKT 1846 + +KT V K G+ + E+ +CE+KYGSYCLWR E +E MKD+ VK Sbjct: 127 VANPSPANKTKIVASGK--GTQRKIASTDETWRTCEVKYGSYCLWREENKEAMKDAKVKQ 184 Query: 1845 LKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQKMD 1666 +KD+LFVARAYYPSIAK+P +KL+R+++Q+IQ+FER++SE++ DADLPP ++KKLQKM+ Sbjct: 185 MKDQLFVARAYYPSIAKMPSQNKLTRDMKQNIQEFERILSESSQDADLPPQVEKKLQKME 244 Query: 1665 VSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSMRL 1486 I+KAKS ++CNNVDKK RQ++DLTEDE SFH KQS FLYQLAVQTMPKSLHCLSMRL Sbjct: 245 AVIAKAKSFPVDCNNVDKKLRQILDLTEDEASFHMKQSVFLYQLAVQTMPKSLHCLSMRL 304 Query: 1485 TVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFHIL 1306 TVE+F+ +L +D +++K+ D +L+H+VI S+NILAS+VVINSTV+HA +SR VFH+L Sbjct: 305 TVEHFKSASL--EDPISEKFSDPSLFHFVIISDNILASSVVINSTVLHAMDSRNFVFHVL 362 Query: 1305 TDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAK-LS 1129 TD+QNYFAMK WF+RN K S VQVLNIE D + LSLP EFRVSF S L+ Sbjct: 363 TDEQNYFAMKQWFVRNPCKQSTVQVLNIE---KLELDDSDMKLSLPAEFRVSFPSGDLLA 419 Query: 1128 NMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAEH 949 + +RT Y+SLFS SHY+LP +F VQRDLS LW+++MEGKVN A + Sbjct: 420 SQQNRTHYLSLFSQSHYLLPKLFAKLKKVVILDDDVVVQRDLSPLWDLDMEGKVNGAVKS 479 Query: 948 CDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSDR 769 C V LG L G FD +C WMSGLNVVDLARWR+L +S T++ F +E++ S Sbjct: 480 CTVRLGQLSLKRG--SFDNNACLWMSGLNVVDLARWRELGVSETYQKFYKEMSGGDESSE 537 Query: 768 AVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIPK 589 A+A ASLLTFQD+VY LD W LSGLG+D+ +N +A+ +AVLHYNGNMKPWLELGIPK Sbjct: 538 AIALQASLLTFQDKVYALDDKWALSGLGYDHYVNAQAIKNAAVLHYNGNMKPWLELGIPK 597 Query: 588 FKNYWTKYLNRDDQFLTDCNVNP 520 +KNYW K+L+R+D+FL+DCNVNP Sbjct: 598 YKNYWRKHLSREDRFLSDCNVNP 620 >ref|XP_004147522.1| PREDICTED: probable galacturonosyltransferase 7-like [Cucumis sativus] Length = 612 Score = 606 bits (1562), Expect = e-170 Identities = 323/624 (51%), Positives = 418/624 (66%), Gaps = 3/624 (0%) Frame = -2 Query: 2382 YPLPPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVT 2203 Y P K+RW+ LSMLVPLVFLLGL+NGFH+ D N+ P Sbjct: 14 YGFPAKRRWRGLVIGVLGLVILSMLVPLVFLLGLYNGFHTAGYA------SDPQNSKPGF 67 Query: 2202 LRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSPPGQ 2023 + + L D FQ+ T + + + +P PP + Sbjct: 68 QPSHVDDVIRKLGPTLPKD---VFQKYAIEPKKETVDFIHESQ--------EPKGLPPPK 116 Query: 2022 SGKKWISKTDKVKESKPAGSVSEAK---VVGESEISCELKYGSYCLWRREYREDMKDSMV 1852 + K +K G V V ES CE K+GSYC+WR+E+RE +KDSMV Sbjct: 117 VDA--LPKHTHENSTKVGGRVQPTDRMTAVDESGKPCEWKFGSYCIWRQEHREVIKDSMV 174 Query: 1851 KTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQK 1672 K LKD+LFVARAYYP+IAKLP +L++E++Q+IQ+ ERV+SE+T D DLP ++KK K Sbjct: 175 KKLKDQLFVARAYYPTIAKLPTQSQLTQEMKQNIQELERVLSESTTDLDLPLQIEKKSLK 234 Query: 1671 MDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSM 1492 M+ +I+KAKS ++CNNVDKK RQ+ D+TEDE +FH KQSAFL+QLAVQTMPKS+HCLSM Sbjct: 235 MEATIAKAKSFPVDCNNVDKKLRQIFDMTEDEANFHMKQSAFLFQLAVQTMPKSMHCLSM 294 Query: 1491 RLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFH 1312 +LTVEYFR + ++ S A+KY D TL HY+IFSNNILAS+VVINSTV ++KESR VFH Sbjct: 295 QLTVEYFRIYSTKLELSQAEKYSDPTLNHYIIFSNNILASSVVINSTVSNSKESRNQVFH 354 Query: 1311 ILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAKL 1132 +LTD QNYFAM LWFLRN+++ +AV+V+N+E + + T LP+EFR+SF++ Sbjct: 355 VLTDGQNYFAMNLWFLRNSYEEAAVEVINVEQLKLDDHENVT--FVLPQEFRISFRTLTH 412 Query: 1131 SNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAE 952 S RTEY+S+FS+ HY+LP+IF VQRDLSALW+++M+GKVN AA+ Sbjct: 413 S----RTEYISMFSHLHYLLPEIFKNLDKVVVLEDDVIVQRDLSALWSLDMDGKVNGAAQ 468 Query: 951 HCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSD 772 C V LG L+S LG NG+ C WMSGLNV+DLA+WR+L+LS TFRS VREL +GGS Sbjct: 469 CCHVRLGELKSILGENGYVQNDCTWMSGLNVIDLAKWRELDLSQTFRSLVRELTMQGGST 528 Query: 771 RAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIP 592 AVA ASLLTFQ +Y LD SW L GLGHDY LN++ + +A LHYNG +KPWLELGIP Sbjct: 529 DAVALRASLLTFQSLIYALDDSWSLYGLGHDYKLNVQDVENAATLHYNGYLKPWLELGIP 588 Query: 591 KFKNYWTKYLNRDDQFLTDCNVNP 520 K+K YW K+L+R+D FL+ CN+NP Sbjct: 589 KYKAYWKKFLDREDPFLSKCNINP 612 >ref|NP_565893.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana] gi|334184793|ref|NP_001189702.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana] gi|75216987|sp|Q9ZVI7.2|GAUT7_ARATH RecName: Full=Probable galacturonosyltransferase 7; AltName: Full=Like glycosyl transferase 7 gi|15293097|gb|AAK93659.1| unknown protein [Arabidopsis thaliana] gi|20197396|gb|AAC67353.2| expressed protein [Arabidopsis thaliana] gi|20259303|gb|AAM14387.1| unknown protein [Arabidopsis thaliana] gi|330254468|gb|AEC09562.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana] gi|330254469|gb|AEC09563.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana] Length = 619 Score = 606 bits (1562), Expect = e-170 Identities = 328/630 (52%), Positives = 424/630 (67%), Gaps = 14/630 (2%) Frame = -2 Query: 2367 KKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHS----TXXXXXRVDLEDTVNTPPVTL 2200 K+RWK LSMLVPL FLLGLHNGFHS T + +N T Sbjct: 15 KRRWKVLVIGVLVLVILSMLVPLAFLLGLHNGFHSPGFVTVQPASSFESFTRINATKHTQ 74 Query: 2199 R-ISQR------KINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPT 2041 R +S+R KIN + ++ + V + DS + Sbjct: 75 RDVSERVDEVLQKINPVL---------------PKKSDINVGSRDVNATSGTDSKKRGLP 119 Query: 2040 RSPPGQSGKKWISKTDKVKESKPAGSVSEAKVVGESEI--SCELKYGSYCLWRREYREDM 1867 SP + +KT +S+ + + + K+V E +CE+KYGSYCLWR E +E M Sbjct: 120 VSPTVVANPSPANKT----KSEASYTGVQRKIVSGDETWRTCEVKYGSYCLWREENKEPM 175 Query: 1866 KDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMD 1687 KD+ VK +KD+LFVARAYYPSIAK+P KL+R+++Q+IQ+FER++SE++ DADLPP +D Sbjct: 176 KDAKVKQMKDQLFVARAYYPSIAKMPSQSKLTRDMKQNIQEFERILSESSQDADLPPQVD 235 Query: 1686 KKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSL 1507 KKLQKM+ I+KAKS ++CNNVDKK RQ++DLTEDE SFH KQS FLYQLAVQTMPKSL Sbjct: 236 KKLQKMEAVIAKAKSFPVDCNNVDKKLRQILDLTEDEASFHMKQSVFLYQLAVQTMPKSL 295 Query: 1506 HCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESR 1327 HCLSMRLTVE+F+ +L +D +++K+ D +L H+VI S+NILAS+VVINSTV+HA++S+ Sbjct: 296 HCLSMRLTVEHFKSDSL--EDPISEKFSDPSLLHFVIISDNILASSVVINSTVVHARDSK 353 Query: 1326 KLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSF 1147 VFH+LTD+QNYFAMK WF+RN K S VQVLNIE D + LSL EFRVSF Sbjct: 354 NFVFHVLTDEQNYFAMKQWFIRNPCKQSTVQVLNIE---KLELDDSDMKLSLSAEFRVSF 410 Query: 1146 QSAK-LSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGK 970 S L++ +RT Y+SLFS SHY+LP +F VQRDLS LW+++MEGK Sbjct: 411 PSGDLLASQQNRTHYLSLFSQSHYLLPKLFDKLEKVVILDDDVVVQRDLSPLWDLDMEGK 470 Query: 969 VNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELN 790 VN A + C V LG LRS L FD +C WMSGLNVVDLARWR L +S T++ + +E++ Sbjct: 471 VNGAVKSCTVRLGQLRS-LKRGNFDTNACLWMSGLNVVDLARWRALGVSETYQKYYKEMS 529 Query: 789 TEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPW 610 + S A+A ASLLTFQDQVY LD W LSGLG+DY +N +A+ +A+LHYNGNMKPW Sbjct: 530 SGDESSEAIALQASLLTFQDQVYALDDKWALSGLGYDYYINAQAIKNAAILHYNGNMKPW 589 Query: 609 LELGIPKFKNYWTKYLNRDDQFLTDCNVNP 520 LELGIP +KNYW ++L+R+D+FL+DCNVNP Sbjct: 590 LELGIPNYKNYWRRHLSREDRFLSDCNVNP 619 >ref|XP_004163983.1| PREDICTED: probable galacturonosyltransferase 7-like [Cucumis sativus] Length = 612 Score = 605 bits (1561), Expect = e-170 Identities = 323/624 (51%), Positives = 418/624 (66%), Gaps = 3/624 (0%) Frame = -2 Query: 2382 YPLPPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVT 2203 Y P K+RW+ LSMLVPLVFLLGL+NGFH+ D N+ P Sbjct: 14 YGFPAKRRWRGLVIGVLGLVILSMLVPLVFLLGLYNGFHTAGYA------SDPQNSKPGF 67 Query: 2202 LRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGSGSKDSMQHQPTRSPPGQ 2023 + + L D FQ+ T + + + +P PP + Sbjct: 68 QPSHVDDVIRKLGPTLPKD---VFQKYAIEPKKETVDFIHESQ--------EPKGLPPPK 116 Query: 2022 SGKKWISKTDKVKESKPAGSVSEAK---VVGESEISCELKYGSYCLWRREYREDMKDSMV 1852 + K +K G V V ES CE K+GSYC+WR+E+RE +KDSMV Sbjct: 117 VDA--LPKHTHENSTKVGGRVQPTDRMTAVDESGKPCEWKFGSYCIWRQEHREVIKDSMV 174 Query: 1851 KTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPLMDKKLQK 1672 K LKD+LFVARAYYP+IAKLP +L++E++Q+IQ+ ERV+SE+T D DLP ++KK K Sbjct: 175 KKLKDQLFVARAYYPTIAKLPTQSQLTQEMKQNIQELERVLSESTTDLDLPLQIEKKSLK 234 Query: 1671 MDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPKSLHCLSM 1492 M+ +I+KAKS ++CNNVDKK RQ+ D+TEDE +FH KQSAFL+QLAVQTMPKS+HCLSM Sbjct: 235 MEATIAKAKSFPVDCNNVDKKLRQIFDMTEDEANFHMKQSAFLFQLAVQTMPKSMHCLSM 294 Query: 1491 RLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKESRKLVFH 1312 +LTVEYFR + ++ S A+KY D TL HY+IFSNNILAS+VVINSTV ++KESR VFH Sbjct: 295 QLTVEYFRIYSTKLELSQAEKYSDPTLNHYIIFSNNILASSVVINSTVSNSKESRNQVFH 354 Query: 1311 ILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRVSFQSAKL 1132 +LTD QNYFAM LWFLRN+++ +AV+V+N+E + + T LP+EFR+SF++ Sbjct: 355 VLTDGQNYFAMNLWFLRNSYEEAAVEVINVEQLKLDDHENVT--FVLPQEFRISFRTLTH 412 Query: 1131 SNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINMEGKVNAAAE 952 S RTEY+S+FS+ HY+LP+IF VQRDLSALW+++M+GKVN AA+ Sbjct: 413 S----RTEYISMFSHLHYLLPEIFKNLDKVVVLEDDVIVQRDLSALWSLDMDGKVNGAAQ 468 Query: 951 HCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRELNTEGGSD 772 C V LG L+S LG NG+ C WMSGLNV+DLA+WR+L+LS TFRS VREL +GGS Sbjct: 469 CCHVRLGELKSILGENGYVQNDCTWMSGLNVIDLAKWRELDLSQTFRSLVRELTMQGGST 528 Query: 771 RAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMKPWLELGIP 592 AVA ASLLTFQ +Y LD SW L GLGHDY LN++ + +A LHYNG +KPWLELGIP Sbjct: 529 DAVALRASLLTFQSLIYALDDSWSLYGLGHDYKLNVQDVENAATLHYNGYLKPWLELGIP 588 Query: 591 KFKNYWTKYLNRDDQFLTDCNVNP 520 K+K YW K+L+R+D FL+ CN+NP Sbjct: 589 KYKAYWKKFLDREDLFLSKCNINP 612 >ref|XP_003551632.2| PREDICTED: probable galacturonosyltransferase 7-like [Glycine max] Length = 627 Score = 598 bits (1542), Expect = e-168 Identities = 307/632 (48%), Positives = 428/632 (67%), Gaps = 12/632 (1%) Frame = -2 Query: 2382 YPLPPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVT 2203 Y +P K+RW+ LSMLVPLVFLLGLHNGFHS+ + ++T + Sbjct: 17 YGVPAKRRWRGLVIAVLGLVILSMLVPLVFLLGLHNGFHSSGYIYEQ---KNTPSNEKSL 73 Query: 2202 LRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTK-------EKVVKGSGSKDSMQHQP 2044 R + + HN E ++ + L TK + + K + S + + Sbjct: 74 ERYDRHDVGHN---------ESEGEQSSHVEDLITKFEPTLPKDVLKKYTREGKSDKQRG 124 Query: 2043 TRSPPGQSGKKWISKTDKVKESKPAGSVSEAK-----VVGESEISCELKYGSYCLWRREY 1879 +R+PP K + ++ S +G + + E SCEL +GSYCLW++E+ Sbjct: 125 SRAPP-----KGVLQSPPTSNSPRSGQIEQVNNPKTSSTDEGGKSCELTFGSYCLWQQEH 179 Query: 1878 REDMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLP 1699 R++MKD++VK LKD+LFVARAYYPS+AKLP +DKLSR+L+Q+IQ+ E ++SE+T DADLP Sbjct: 180 RQEMKDALVKKLKDQLFVARAYYPSLAKLPANDKLSRQLKQNIQEMEHMLSESTTDADLP 239 Query: 1698 PLMDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTM 1519 P+ + +KM+ +I++ KS+ + C+NVDKK RQ+ DLTEDE +FH KQSAFLY+L VQTM Sbjct: 240 PVAESYSKKMEKTITRVKSIPVVCDNVDKKLRQIFDLTEDEANFHMKQSAFLYKLNVQTM 299 Query: 1518 PKSLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHA 1339 PKS HCLS++LTVEYF+ D + + +K++D +L+HYVIFSNN+LA++VVINSTV HA Sbjct: 300 PKSHHCLSLKLTVEYFKSSHND-EKADEEKFIDSSLHHYVIFSNNVLAASVVINSTVFHA 358 Query: 1338 KESRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEF 1159 KES LVFH+LTD +NY+A+KLWFLRN +K +AVQVLN+E + +Q LSLPEEF Sbjct: 359 KESSNLVFHVLTDGENYYAIKLWFLRNHYKEAAVQVLNVELD---SQKENPLLLSLPEEF 415 Query: 1158 RVSFQSAKLSNMPHRTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINM 979 R+SF+ N RTEY+S+FS SHY+LP +F +Q+DLSALWNI++ Sbjct: 416 RISFRDNPSRNRI-RTEYLSIFSDSHYLLPHLFSNLNKVVVLDDDVVIQQDLSALWNIDL 474 Query: 978 EGKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVR 799 KVN A + C V LG L+SYLG GF SCAWMSGLN++DL RWR+L L+ T+R ++ Sbjct: 475 GHKVNGAVQFCSVKLGKLKSYLGEKGFSQNSCAWMSGLNIIDLVRWRELGLTQTYRKLIK 534 Query: 798 ELNTEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNM 619 E + GS +A ASLLTF++++YPL+ SWV+SG+GHDY + + + ++VLHYNG M Sbjct: 535 EFTMQEGSVEGIAWRASLLTFENEIYPLNESWVVSGMGHDYTIGTQPIKTASVLHYNGKM 594 Query: 618 KPWLELGIPKFKNYWTKYLNRDDQFLTDCNVN 523 KPWL+LGIP++K+YW K+LN++D L++CNVN Sbjct: 595 KPWLDLGIPQYKSYWKKFLNKEDHLLSECNVN 626 >ref|XP_004492632.1| PREDICTED: probable galacturonosyltransferase 7-like, partial [Cicer arietinum] Length = 627 Score = 597 bits (1539), Expect = e-168 Identities = 310/631 (49%), Positives = 419/631 (66%), Gaps = 11/631 (1%) Frame = -2 Query: 2382 YPLPPKKRWKXXXXXXXXXXXLSMLVPLVFLLGLHNGFHSTXXXXXRVDLEDTVNTPPVT 2203 Y +P K+RW+ LSMLVPLVFLLGLHNGFHS+ + + Sbjct: 5 YGVPAKRRWRGFVIAVLGLVILSMLVPLVFLLGLHNGFHSSGYIYEQRSTPSSQKGLERY 64 Query: 2202 LRISQRKINHNMWMNL*ND*SLEFQRXXXXDFLSTKEKVVKGS----GSKDSMQH----- 2050 R +++ ++ D +F+ D L + + K G+ D Sbjct: 65 DRHDEKQSEGEKSSHV-QDLITKFEPTLPKDVLDSYARGDKNGTVSRGASDEKHKGVKAP 123 Query: 2049 -QPTRSPPGQSGKKWISKTDKVKESKPAGSVSEAKVVGESEISCELKYGSYCLWRREYRE 1873 P PP + + ++V K K SCEL YGSYCLW++E++E Sbjct: 124 PNPVPQPPPAFNNPKVDRIEQVAHPKTNSPDENGK-------SCELTYGSYCLWQQEHKE 176 Query: 1872 DMKDSMVKTLKDRLFVARAYYPSIAKLPDHDKLSRELRQSIQDFERVMSEATVDADLPPL 1693 MKD+MVK LKD+LFVARAYYPSIAKLP DKLSR+L+Q+IQ+ E V+SE++ DADLPPL Sbjct: 177 VMKDAMVKKLKDQLFVARAYYPSIAKLPAQDKLSRQLKQNIQELEHVLSESSTDADLPPL 236 Query: 1692 MDKKLQKMDVSISKAKSLIMECNNVDKKFRQLVDLTEDETSFHAKQSAFLYQLAVQTMPK 1513 ++ K + M+++I+KAKS+ + C+NVDKK RQ+ DLTEDE FH KQSAFLY+L VQTMPK Sbjct: 237 VETKSENMEIAIAKAKSVPVVCDNVDKKLRQIYDLTEDEAEFHMKQSAFLYRLNVQTMPK 296 Query: 1512 SLHCLSMRLTVEYFRDPALDMDDSVAQKYLDQTLYHYVIFSNNILASTVVINSTVMHAKE 1333 S HCL+++LTVEYF+ + + +++ ++K+ D +L+HYVIFSNN+LA++VVINSTV HAK Sbjct: 297 SFHCLALKLTVEYFKS-SHNEEEADSEKFEDSSLHHYVIFSNNVLAASVVINSTVTHAKV 355 Query: 1332 SRKLVFHILTDKQNYFAMKLWFLRNTFKNSAVQVLNIEDERHSTQDTTTSHLSLPEEFRV 1153 SR VFH+L+D QNY+AMKLWF RN ++ +AVQVLN+E + LSLPEEFRV Sbjct: 356 SRNQVFHVLSDGQNYYAMKLWFRRNNYREAAVQVLNVEHLEMDSLKDNPLQLSLPEEFRV 415 Query: 1152 SFQSAKLSNMPH-RTEYMSLFSYSHYVLPDIFHXXXXXXXXXXXXXVQRDLSALWNINME 976 SF+S +M RTEY+S+FS+SHY+LPDIF +Q+DLSALWN++M Sbjct: 416 SFRSYDNPSMGQFRTEYVSIFSHSHYLLPDIFSKLKKVVVLDDDIVIQQDLSALWNLDMG 475 Query: 975 GKVNAAAEHCDVTLGLLRSYLGVNGFDGKSCAWMSGLNVVDLARWRDLNLSGTFRSFVRE 796 KVN A + C V LG L+SYLG F SCAWMSGLNV+DL RWR+L L+ T++ ++E Sbjct: 476 EKVNGAVQFCSVRLGQLKSYLGEKSFGQNSCAWMSGLNVIDLVRWRELGLTKTYKRLIKE 535 Query: 795 LNTEGGSDRAVASHASLLTFQDQVYPLDGSWVLSGLGHDYDLNIKALSKSAVLHYNGNMK 616 L+ + GS A ASLLTF++++YPL+ SWV SGLGH Y ++ ++ + VLHYNG MK Sbjct: 536 LSAQKGSTATAAWPASLLTFENKIYPLNESWVQSGLGHAYKIDSNSIKTAPVLHYNGKMK 595 Query: 615 PWLELGIPKFKNYWTKYLNRDDQFLTDCNVN 523 PWL+LGIP +K+YW K+LN++DQ L++CNVN Sbjct: 596 PWLDLGIPNYKSYWKKFLNKEDQLLSECNVN 626