BLASTX nr result
ID: Cocculus23_contig00014553
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00014553 (1533 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268... 197 1e-47 emb|CBI20855.3| unnamed protein product [Vitis vinifera] 188 5e-45 ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g... 188 5e-45 ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Popu... 187 1e-44 emb|CBI28020.3| unnamed protein product [Vitis vinifera] 187 1e-44 ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g... 185 5e-44 emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera] 185 5e-44 ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Popu... 182 3e-43 gb|EXB59797.1| putative glycosyltransferase [Morus notabilis] 180 2e-42 ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g... 178 5e-42 ref|XP_006451252.1| hypothetical protein CICLE_v10007703mg [Citr... 177 9e-42 gb|EYU27286.1| hypothetical protein MIMGU_mgv1a002540mg [Mimulus... 176 3e-41 ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]... 176 3e-41 ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g... 175 4e-41 ref|XP_007204228.1| hypothetical protein PRUPE_ppa002755mg [Prun... 175 4e-41 ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citr... 174 7e-41 ref|XP_007012125.1| Exostosin family protein, putative isoform 2... 174 1e-40 ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g... 174 1e-40 gb|EXB59796.1| putative glycosyltransferase [Morus notabilis] 172 3e-40 ref|XP_007013073.1| Exostosin family protein [Theobroma cacao] g... 172 4e-40 >ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera] Length = 738 Score = 197 bits (500), Expect = 1e-47 Identities = 119/269 (44%), Positives = 150/269 (55%), Gaps = 14/269 (5%) Frame = +2 Query: 761 NNGESVDD----DIQLQMEKAGIRGXXXXXXXXXXXXIVASTNLTLLRKSDPNVASNDVP 928 N G S D+ D L G G + + N T + K N +++ Sbjct: 221 NEGISTDNIVKADASLTPSTPGSLGTTFKSHLLASPGVDSLFNTTYIEKMASNGNASNHL 280 Query: 929 INTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKP----IXXX 1096 T++SS GK E LSKDE + LQS LNN+ MT + K+ + + P Sbjct: 281 TATDISSVGKPEKEILSKDENLLVLQSDLADLNNNSAMTSNPGRKKMQSEMPPKSVTSIY 340 Query: 1097 XXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKR 1276 W S DQE+L AK Q++NAP +K + EL A LF+NVSMFKR Sbjct: 341 DMNRRLVRHRASSRAMRPRWASPRDQEMLAAKLQIQNAPRVKNDPELHAPLFRNVSMFKR 400 Query: 1277 SYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYL 1456 SYELMERILKVYVY +GEKPIFHQP LKG+YASEGWFM+LME NK FVVKDPR+A LFY+ Sbjct: 401 SYELMERILKVYVYKDGEKPIFHQPILKGLYASEGWFMKLMERNKHFVVKDPRQAQLFYM 460 Query: 1457 PFSAKVLRYTL------DRNNLSNFLGSY 1525 PFS+++L Y L +R NL +L Y Sbjct: 461 PFSSRMLEYKLYVRNSHNRTNLRQYLKQY 489 >emb|CBI20855.3| unnamed protein product [Vitis vinifera] Length = 618 Score = 188 bits (478), Expect = 5e-45 Identities = 109/260 (41%), Positives = 148/260 (56%), Gaps = 6/260 (2%) Frame = +2 Query: 767 GESVDDDIQLQMEKAGIRGXXXXXXXXXXXXIVASTNLTLLRKSDPNVASNDVPINTNVS 946 G+ +DD+ L +++ G I++S+N T L DP+ + + S Sbjct: 139 GKIQEDDMALLSQRSERSGVGLISPLPALPQIISSSNTTSLTNLDPH----PITLPPERS 194 Query: 947 STGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKPIXXXXXXXXXXXXX 1126 S + A TL+KDEKA + Q + L+N ++ + R + Sbjct: 195 SVEEDAAHTLNKDEKAETSQKD-LTLSNRSSISVPALETRPELPAVTTISEMNDLLVQSR 253 Query: 1127 XXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILK 1306 W S D+ELLYAKSQ+ENAPIIK + L ASL++NVS+FKRSYELME LK Sbjct: 254 ASSRSMKPRWSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLK 313 Query: 1307 VYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYT 1486 VY Y EGE+P+FHQP +KGIYASEGWFM+LM+ NK+FV K+ RKAHLFYLPFS+ +L Sbjct: 314 VYTYREGERPVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPFSSLMLEEA 373 Query: 1487 L------DRNNLSNFLGSYV 1528 L R NL +L +Y+ Sbjct: 374 LYVPNSHSRKNLEQYLKNYL 393 >ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera] Length = 675 Score = 188 bits (478), Expect = 5e-45 Identities = 109/260 (41%), Positives = 148/260 (56%), Gaps = 6/260 (2%) Frame = +2 Query: 767 GESVDDDIQLQMEKAGIRGXXXXXXXXXXXXIVASTNLTLLRKSDPNVASNDVPINTNVS 946 G+ +DD+ L +++ G I++S+N T L DP+ + + S Sbjct: 167 GKIQEDDMALLSQRSERSGVGLISPLPALPQIISSSNTTSLTNLDPH----PITLPPERS 222 Query: 947 STGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKPIXXXXXXXXXXXXX 1126 S + A TL+KDEKA + Q + L+N ++ + R + Sbjct: 223 SVEEDAAHTLNKDEKAETSQKD-LTLSNRSSISVPALETRPELPAVTTISEMNDLLVQSR 281 Query: 1127 XXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILK 1306 W S D+ELLYAKSQ+ENAPIIK + L ASL++NVS+FKRSYELME LK Sbjct: 282 ASSRSMKPRWSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLK 341 Query: 1307 VYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYT 1486 VY Y EGE+P+FHQP +KGIYASEGWFM+LM+ NK+FV K+ RKAHLFYLPFS+ +L Sbjct: 342 VYTYREGERPVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPFSSLMLEEA 401 Query: 1487 L------DRNNLSNFLGSYV 1528 L R NL +L +Y+ Sbjct: 402 LYVPNSHSRKNLEQYLKNYL 421 >ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa] gi|550337072|gb|EEE93070.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa] Length = 705 Score = 187 bits (475), Expect = 1e-44 Identities = 101/227 (44%), Positives = 135/227 (59%), Gaps = 9/227 (3%) Frame = +2 Query: 875 NLTLLRKSDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDR 1054 N+TL ++P+ ++ VPI +N S K A +L D K + +L+N+P +T Sbjct: 230 NITLQMNAEPSTIAHIVPIESNTSKVDKDAAPSLENDGKTGDQKKDLTLLHNNPSVTSFP 289 Query: 1055 VLKRNRMK---KPIXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKK 1225 +K+ + + WPSV DQELL AKSQ++NAPI++ Sbjct: 290 EVKKEPQTPSLEVVSISEMKNLQLQRWSSPNSRRPRWPSVVDQELLNAKSQIQNAPIVEN 349 Query: 1226 NGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEG 1405 + L A L+ N+SMFK+SYELME ILKVY+Y EGE PIFHQP L GIYASEGWFM+L+EG Sbjct: 350 DPVLYAPLYWNISMFKKSYELMEDILKVYIYKEGEMPIFHQPLLNGIYASEGWFMKLLEG 409 Query: 1406 NKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYV 1528 NK+FV KD +KAHLFYLPFS++ L L NL +L Y+ Sbjct: 410 NKKFVTKDSKKAHLFYLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYL 456 >emb|CBI28020.3| unnamed protein product [Vitis vinifera] Length = 665 Score = 187 bits (474), Expect = 1e-44 Identities = 107/220 (48%), Positives = 132/220 (60%), Gaps = 10/220 (4%) Frame = +2 Query: 896 SDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRM 1075 S N+ D + S+ G E LSKDE + LQS LNN+ MT + K+ + Sbjct: 257 STDNIVKADASLTP--STPGSLEKEILSKDENLLVLQSDLADLNNNSAMTSNPGRKKMQS 314 Query: 1076 KKP----IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDA 1243 + P W S DQE+L AK Q++NAP +K + EL A Sbjct: 315 EMPPKSVTSIYDMNRRLVRHRASSRAMRPRWASPRDQEMLAAKLQIQNAPRVKNDPELHA 374 Query: 1244 SLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVV 1423 LF+NVSMFKRSYELMERILKVYVY +GEKPIFHQP LKG+YASEGWFM+LME NK FVV Sbjct: 375 PLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQPILKGLYASEGWFMKLMERNKHFVV 434 Query: 1424 KDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSY 1525 KDPR+A LFY+PFS+++L Y L +R NL +L Y Sbjct: 435 KDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQY 474 >ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis vinifera] gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera] Length = 665 Score = 185 bits (469), Expect = 5e-44 Identities = 135/381 (35%), Positives = 191/381 (50%), Gaps = 8/381 (2%) Frame = +2 Query: 389 LPYKNVLFSLPSVKSGNE---GELTISDGSNYTDPSSVVLRPVNNSKGSDLRKGKTEQES 559 LP N L P+VK G+ TI S + S V+ VNNS SDL E E+ Sbjct: 37 LPSMNTLTLSPTVKGSVSMMVGDATILKNSISAN-SYVIRTVVNNSDASDL-----EDEA 90 Query: 560 GVELNGRYTNDGD-DSKLQXXXXXXXXXXXXXXXXXXXXXTM-LQNAKFPNNGSMQD--K 727 ++ + +DGD D ++ +M ++N + +N + + Sbjct: 91 DMDYHLASDDDGDLDYSVEMHKEKNSDNEFILEKGVGLDKSMTVRNVRHTDNSPKEKAIE 150 Query: 728 FRE-PEEAYLLENNGESVDDDIQLQMEKAGIRGXXXXXXXXXXXXIVASTNLTLLRKSDP 904 FR P E + +N +DDD KA + + + K Sbjct: 151 FRHGPLEHLKISDNNFKIDDD-----RKASTSLTIGEGSNRDGLVSLPLVSPGISSKGTR 205 Query: 905 NVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKP 1084 N+ ++ + + S K E KD+ LQ+ V L+N+ + D + R R KP Sbjct: 206 NLDADSRTSDLSTVSNVKHVMEA-EKDKNTNLLQTVSVPLDNNYTIA-DISITRRRGMKP 263 Query: 1085 IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVS 1264 W S D+ELL A+S+++NAP+I+ L AS+++NVS Sbjct: 264 TTISKMNLLLLQSAVSSYSMRPRWSSPRDRELLSARSEIQNAPVIRNTPGLYASVYRNVS 323 Query: 1265 MFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAH 1444 MFKRSYELMER+LK+Y+Y EGEKPIFHQP L+GIYASEGWFM+L+EGNK+FVV+DPRKAH Sbjct: 324 MFKRSYELMERVLKIYIYREGEKPIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383 Query: 1445 LFYLPFSAKVLRYTLDRNNLS 1507 LFY+PFS+K+LR N S Sbjct: 384 LFYVPFSSKMLRTVFYEQNSS 404 >emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera] Length = 1908 Score = 185 bits (469), Expect = 5e-44 Identities = 135/381 (35%), Positives = 191/381 (50%), Gaps = 8/381 (2%) Frame = +2 Query: 389 LPYKNVLFSLPSVKSGNE---GELTISDGSNYTDPSSVVLRPVNNSKGSDLRKGKTEQES 559 LP N L P+VK G+ TI S + S V+ VNNS SDL E E+ Sbjct: 37 LPSMNTLTLSPTVKGSVSMMVGDATILKNSISAN-SYVIRTVVNNSDASDL-----EDEA 90 Query: 560 GVELNGRYTNDGD-DSKLQXXXXXXXXXXXXXXXXXXXXXTM-LQNAKFPNNGSMQD--K 727 ++ + +DGD D ++ +M ++N + +N + + Sbjct: 91 DMDYHLASDDDGDLDYSVEMHKEKNSDNEFILEKGVGLDKSMTVRNVRHTDNSPKEKAIE 150 Query: 728 FRE-PEEAYLLENNGESVDDDIQLQMEKAGIRGXXXXXXXXXXXXIVASTNLTLLRKSDP 904 FR P E + +N +DDD KA + + + K Sbjct: 151 FRHGPLEHLKISDNNFKIDDD-----RKASTSLTIGEGSNRDGLVSLPLVSPGISSKGTR 205 Query: 905 NVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKP 1084 N+ ++ + + S K E KD+ LQ+ V L+N+ + D + R R KP Sbjct: 206 NLDADSRTSDLSTVSNVKHVMEA-EKDKNTNLLQTVSVPLDNNYTIA-DISITRRRGMKP 263 Query: 1085 IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVS 1264 W S D+ELL A+S+++NAP+I+ L AS+++NVS Sbjct: 264 TTISKMNLLLLQSAVSSYSMRPRWSSPRDRELLSARSEIQNAPVIRNTPGLYASVYRNVS 323 Query: 1265 MFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAH 1444 MFKRSYELMER+LK+Y+Y EGEKPIFHQP L+GIYASEGWFM+L+EGNK+FVV+DPRKAH Sbjct: 324 MFKRSYELMERVLKIYIYREGEKPIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383 Query: 1445 LFYLPFSAKVLRYTLDRNNLS 1507 LFY+PFS+K+LR N S Sbjct: 384 LFYVPFSSKMLRTVFYEQNSS 404 Score = 174 bits (440), Expect = 1e-40 Identities = 86/130 (66%), Positives = 102/130 (78%), Gaps = 6/130 (4%) Frame = +2 Query: 1154 WPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEK 1333 W S DQE+L AK Q++NAP +K + EL A LF+NVSMFKRSYELMERILKVYVY +GEK Sbjct: 1016 WASPRDQEMLAAKLQIQNAPRVKNDPELHAPLFRNVSMFKRSYELMERILKVYVYKDGEK 1075 Query: 1334 PIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DR 1495 PIFHQP LKG+YASEGWFM+LME NK FVVKDPR+A LFY+PFS+++L Y L +R Sbjct: 1076 PIFHQPILKGLYASEGWFMKLMERNKXFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNR 1135 Query: 1496 NNLSNFLGSY 1525 NL +L Y Sbjct: 1136 TNLRQYLKQY 1145 >ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa] gi|550317697|gb|EEF03366.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa] Length = 707 Score = 182 bits (463), Expect = 3e-43 Identities = 99/228 (43%), Positives = 135/228 (59%), Gaps = 9/228 (3%) Frame = +2 Query: 872 TNLTLLRKSDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRD 1051 TN+ + R ++P+ + VP+ +N S T K A+ L D KA + L N+ +T Sbjct: 231 TNIAIPRNAEPSTLAPVVPVESNTSKTDKDASHGLENDGKAGEQLNNSTSLQNNTSVTSV 290 Query: 1052 RVLKRN-RMKKP--IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIK 1222 R +K+ P I WPS DQELL AKSQ++ AP+++ Sbjct: 291 REVKKEPHTPSPAVISISEMNNLQLQSWSSPISRRPRWPSAVDQELLNAKSQIQKAPLVE 350 Query: 1223 KNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELME 1402 + L A L++N+SMFK+SYELME ILKVY+Y EGE+PI HQ LKGIYASEGWFM+L+E Sbjct: 351 SDSMLYAPLYRNISMFKKSYELMEDILKVYIYKEGERPILHQAPLKGIYASEGWFMKLLE 410 Query: 1403 GNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYV 1528 NK+FV KDP+K+HLFYLPFS++ L L NL +L +Y+ Sbjct: 411 TNKKFVTKDPKKSHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLKNYL 458 >gb|EXB59797.1| putative glycosyltransferase [Morus notabilis] Length = 637 Score = 180 bits (456), Expect = 2e-42 Identities = 96/203 (47%), Positives = 125/203 (61%), Gaps = 4/203 (1%) Frame = +2 Query: 932 NTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKPIXXXXXXXX 1111 N +V K +T+ K + VI N+ ++ + KKP Sbjct: 190 NLSVGGLEKNVEQTMKTQPKEGKTEL--VIPNSEDSTIASTLMPKRWDKKPTTISQMNTL 247 Query: 1112 XXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELM 1291 W SV D+ELL AK ++ENAPII+ + EL A +F+NVS FKRSYELM Sbjct: 248 LLRSPLSTHSTRPRWSSVRDRELLSAKLEIENAPIIRNSPELSAFVFRNVSKFKRSYELM 307 Query: 1292 ERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAK 1471 ER+LKVY+Y EGEKP+FHQP ++GIYASEGWFM+LME NK+FVV+DPRKAHLFYLPFS+K Sbjct: 308 ERMLKVYIYREGEKPVFHQPYMRGIYASEGWFMKLMEANKKFVVRDPRKAHLFYLPFSSK 367 Query: 1472 VLRYTLD----RNNLSNFLGSYV 1528 +LR T + + + +L SYV Sbjct: 368 LLRTTFENSKGKKDFEKYLKSYV 390 >ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Citrus sinensis] Length = 663 Score = 178 bits (452), Expect = 5e-42 Identities = 106/228 (46%), Positives = 136/228 (59%), Gaps = 8/228 (3%) Frame = +2 Query: 869 STNLTLLRKSDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTR 1048 S+N+TL N+ S + I++N SST K AT L K EK S + NS + Sbjct: 182 SSNITL---QGANI-STPITIHSNSSSTDKDATPALDKIEKPAQ-SSLNTLGENSSGVDV 236 Query: 1049 DRVLKRNRMKKP--IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIK 1222 + K+ + P I W S DQE+LYA+SQ+ENAP++K Sbjct: 237 PKENKKPEIPTPAVITIAEMKNMLLQNRASYRSMRPRWSSAVDQEMLYARSQIENAPLLK 296 Query: 1223 KNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELME 1402 + EL A L++NVS FKRSYELME LKVYVY EG++PI H+P LKGIYASEGWFM+ +E Sbjct: 297 NDHELYAPLYRNVSRFKRSYELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQLE 356 Query: 1403 GNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYV 1528 NKQFV KD RKAHLFYLPFS+++L TL + NL +L +YV Sbjct: 357 ANKQFVTKDSRKAHLFYLPFSSRMLEETLYVQNSHNHKNLIQYLRNYV 404 >ref|XP_006451252.1| hypothetical protein CICLE_v10007703mg [Citrus clementina] gi|568883068|ref|XP_006494322.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform X1 [Citrus sinensis] gi|557554478|gb|ESR64492.1| hypothetical protein CICLE_v10007703mg [Citrus clementina] Length = 647 Score = 177 bits (450), Expect = 9e-42 Identities = 105/223 (47%), Positives = 138/223 (61%), Gaps = 8/223 (3%) Frame = +2 Query: 887 LRKSDPNVASNDVPINTNVSSTG--KQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVL 1060 + K D N +++ N+SST +Q TET + K VS Q I N+ +L Sbjct: 180 VEKLDVNSTASESISAANLSSTADVRQTTETQPMNPK-VSKQPPASIPTNNLSAADISIL 238 Query: 1061 KRNRMKKPIXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELD 1240 KR ++P SV D+ELL AK ++ENAP+ EL Sbjct: 239 KRWN-RRPTSISKMDLLLLQSRVSSRSMRPSSSSVRDRELLSAKVEIENAPVSWNTPELH 297 Query: 1241 ASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFV 1420 AS+F+NVS+FKRSYELME +LKVY+Y EGEKPIFHQP ++GIYASEGWFM+LMEGN++FV Sbjct: 298 ASVFRNVSIFKRSYELMESLLKVYIYKEGEKPIFHQPIMRGIYASEGWFMKLMEGNRKFV 357 Query: 1421 VKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYVK 1531 V+DPRKAHLFYLPFS+++LR L + +L N+L +YVK Sbjct: 358 VRDPRKAHLFYLPFSSQMLRIALSEQKLQNHQDLQNYLKTYVK 400 >gb|EYU27286.1| hypothetical protein MIMGU_mgv1a002540mg [Mimulus guttatus] Length = 661 Score = 176 bits (446), Expect = 3e-41 Identities = 101/224 (45%), Positives = 133/224 (59%), Gaps = 17/224 (7%) Frame = +2 Query: 905 NVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNN----------SPKMTRDR 1054 +V S+ + I + VS+T L + K +G + +P +R Sbjct: 207 SVTSSPLLIESQVSTTSSAEGHILMVNNKLSDSTNGSSVKKKMRCDMPPKTVTPVNEMER 266 Query: 1055 VLKRNRMKKPIXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGE 1234 +L RNR + W S DQE+L AK ++E+ PI+ + E Sbjct: 267 ILVRNRARS------------------RAMRPRWSSERDQEILTAKLKIESPPILNNDPE 308 Query: 1235 LDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELME-GNK 1411 L A LF+N+SMFKRSYELMER+LKVYVY EGEKPIFHQP LKG+YASEGWFM+LME GNK Sbjct: 309 LYAPLFRNISMFKRSYELMERVLKVYVYKEGEKPIFHQPILKGLYASEGWFMKLMEGGNK 368 Query: 1412 QFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSY 1525 +F+VKDPRKAHLFY+PFS+++L YTL +R NL ++L Y Sbjct: 369 RFLVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRHYLKDY 412 >ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana] gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana] gi|591401784|gb|AHL38619.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 610 Score = 176 bits (445), Expect = 3e-41 Identities = 98/225 (43%), Positives = 133/225 (59%), Gaps = 8/225 (3%) Frame = +2 Query: 881 TLLRKSDPNVASNDVPI-NTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKMTRDRV 1057 T+++K + ++N + N V S + LS S SG L S K+++ + Sbjct: 139 TVMQKESVSTSNNGYQVQNVTVQSQKNVKSSILSGGSSIASPASGNSSLLVSKKVSKKKK 198 Query: 1058 LKRNRMKKPIXXXXXXXXXXXXXXXXXXXXX-HWPSVCDQELLYAKSQVENAPIIKKNGE 1234 ++ + K + W S D+E+L A+ ++ENAP+ K E Sbjct: 199 MRCDLPPKSVTTIDEMNRILARHRRTSRAMRPRWSSRRDEEILTARKEIENAPVAKLERE 258 Query: 1235 LDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQ 1414 L +F+NVS+FKRSYELMERILKVYVY EG +PIFH P LKG+YASEGWFM+LMEGNKQ Sbjct: 259 LYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFHTPILKGLYASEGWFMKLMEGNKQ 318 Query: 1415 FVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYVK 1531 + VKDPRKAHL+Y+PFSA++L YTL +R NL FL Y + Sbjct: 319 YTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTE 363 >ref|XP_006353481.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1 [Solanum tuberosum] gi|565373856|ref|XP_006353482.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2 [Solanum tuberosum] Length = 674 Score = 175 bits (444), Expect = 4e-41 Identities = 84/130 (64%), Positives = 105/130 (80%), Gaps = 6/130 (4%) Frame = +2 Query: 1154 WPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEK 1333 W S D+E+L A+ Q+ENAP+++ + EL A F+N+SMFKRSYELMERILKVYVY EGEK Sbjct: 296 WSSERDKEILAARLQIENAPLLRNDRELYAPAFRNMSMFKRSYELMERILKVYVYKEGEK 355 Query: 1334 PIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DR 1495 PIFHQP +KG+YASEGWFM+LMEGN +FVVKDPRKAHLFYLPFS+++L ++L +R Sbjct: 356 PIFHQPIMKGLYASEGWFMKLMEGNNRFVVKDPRKAHLFYLPFSSRMLEHSLYVHNSHNR 415 Query: 1496 NNLSNFLGSY 1525 NL +L Y Sbjct: 416 TNLRQYLKDY 425 >ref|XP_007204228.1| hypothetical protein PRUPE_ppa002755mg [Prunus persica] gi|462399759|gb|EMJ05427.1| hypothetical protein PRUPE_ppa002755mg [Prunus persica] Length = 636 Score = 175 bits (444), Expect = 4e-41 Identities = 99/214 (46%), Positives = 129/214 (60%), Gaps = 9/214 (4%) Frame = +2 Query: 914 SNDVPINTNVSSTG--KQATETLSKDEKAVSLQSGPVILNNSPKMTRDRVLKRNRMKKPI 1087 S + P N S G KQ TET + +K Q PV LN + MT +LK+ +P Sbjct: 176 STESPETLNADSKGNVKQTTETQIEHQKTELWQPVPVTLNGNSTMTSISILKKWN-PRPT 234 Query: 1088 XXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSM 1267 S D+EL AK ++ENAPII+ N L AS+F+N+S Sbjct: 235 SLSQMNALLLRIPVSSPSMSPRRYSTRDRELQSAKLEIENAPIIRNNPGLSASVFRNLSK 294 Query: 1268 FKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHL 1447 F RSY+LM+ +LKVY+Y EGEKP+FHQP ++GIYASEGWFM+L+EGNK+FVV+DP+KAHL Sbjct: 295 FIRSYDLMDHMLKVYIYKEGEKPVFHQPLMRGIYASEGWFMKLVEGNKKFVVRDPKKAHL 354 Query: 1448 FYLPFSAKVLRYTLDRNNLSN-------FLGSYV 1528 FYLPF + +LR TL N+ N +L SYV Sbjct: 355 FYLPFDSHMLRLTLSGQNVKNGKKVLEKYLKSYV 388 >ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citrus clementina] gi|557553910|gb|ESR63924.1| hypothetical protein CICLE_v10007698mg [Citrus clementina] Length = 652 Score = 174 bits (442), Expect = 7e-41 Identities = 116/290 (40%), Positives = 154/290 (53%), Gaps = 8/290 (2%) Frame = +2 Query: 683 LQNAKFPNNGSMQDKFREPEEAYLLENNGESVDDDIQLQMEKAGIRGXXXXXXXXXXXXI 862 +QNA NG +K RE E++++ N+ G + Sbjct: 135 VQNA---GNGPGPEKGRESEQSFIQRNDS-----------------GGAGLSPIPVSPVM 174 Query: 863 VASTNLTLLRKSDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSGPVILNNSPKM 1042 S+N+TL N+ S + I++N SST K AT L K EK S + NS + Sbjct: 175 DLSSNITL---QGANI-STPITIHSNSSSTDKDATPALDKIEKPAQ-SSLNTLGENSSGV 229 Query: 1043 TRDRVLKRNRMKKP--IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQELLYAKSQVENAPI 1216 + K+ + P I S DQE+LYA+SQ+ENAP+ Sbjct: 230 DVPKENKKPEIPTPAVITIAEMKNMLLQNRASYRSMSPRLSSAVDQEMLYARSQIENAPL 289 Query: 1217 IKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMEL 1396 +K + EL A L++NVS FKRSYELME LKVYVY EG++PI H+P LKGIYASEGWFM+ Sbjct: 290 LKNDHELYAPLYRNVSRFKRSYELMEETLKVYVYKEGQRPILHEPVLKGIYASEGWFMKQ 349 Query: 1397 MEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYV 1528 +E NKQFV KD RKAHLFYLPFS+++L TL + NL +L +YV Sbjct: 350 LEANKQFVTKDSRKAHLFYLPFSSRMLEETLYVQNSHNHKNLIQYLRNYV 399 >ref|XP_007012125.1| Exostosin family protein, putative isoform 2 [Theobroma cacao] gi|508782488|gb|EOY29744.1| Exostosin family protein, putative isoform 2 [Theobroma cacao] Length = 788 Score = 174 bits (441), Expect = 1e-40 Identities = 100/229 (43%), Positives = 135/229 (58%), Gaps = 8/229 (3%) Frame = +2 Query: 866 ASTNLTLLRKSDPNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQSG-PVILNNSPKM 1042 +STN TL + N+ + V +N++ SS + T + K+EK +++ +NS Sbjct: 313 SSTNKTLENDVETNIQTPVVSVNSSTSSLEQHVTPSFDKNEKVEEIKNNFTTSSDNSSPT 372 Query: 1043 TRDRVLKRNRMKKPIXXXXXXXXXXXXXXXXXXXXX-HWPSVCDQELLYAKSQVENAPII 1219 +V K+ M + W S DQ LL A+SQ+ENAPI+ Sbjct: 373 NTPKVGKKPEMPPALTTIADMNNLFYQSRVSYYSKTPRWSSGADQVLLNARSQIENAPIV 432 Query: 1220 KKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSLKGIYASEGWFMELM 1399 K + L A LF+NVSMFKRSYELME LKVYVY EG++PI H P LKGIYASEGWFM+ + Sbjct: 433 KNDPRLYAPLFRNVSMFKRSYELMESTLKVYVYQEGKRPIVHTPILKGIYASEGWFMKQL 492 Query: 1400 EGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLGSYV 1528 E NK+FV K+PR+AHLFYLPFS+++L TL + NL +L +YV Sbjct: 493 EANKKFVTKNPREAHLFYLPFSSRMLEETLYVPDSHNHKNLIEYLKNYV 541 >ref|XP_004251626.1| PREDICTED: probable glycosyltransferase At5g03795-like [Solanum lycopersicum] Length = 674 Score = 174 bits (441), Expect = 1e-40 Identities = 83/130 (63%), Positives = 105/130 (80%), Gaps = 6/130 (4%) Frame = +2 Query: 1154 WPSVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEK 1333 W S D+E+L A+ Q+ENAP+I+ + E+ A F+N+SMFKRSYELMERIL+VYVY EGEK Sbjct: 296 WSSERDKEILAARLQIENAPLIRNDREIYAPAFRNMSMFKRSYELMERILRVYVYKEGEK 355 Query: 1334 PIFHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DR 1495 PIFHQP +KG+YASEGWFM+LMEGN +FVVKDPRKAHLFYLPFS+++L ++L +R Sbjct: 356 PIFHQPIMKGLYASEGWFMKLMEGNNKFVVKDPRKAHLFYLPFSSRMLEHSLYVRNSHNR 415 Query: 1496 NNLSNFLGSY 1525 NL +L Y Sbjct: 416 TNLRQYLKDY 425 >gb|EXB59796.1| putative glycosyltransferase [Morus notabilis] Length = 669 Score = 172 bits (437), Expect = 3e-40 Identities = 103/242 (42%), Positives = 138/242 (57%), Gaps = 21/242 (8%) Frame = +2 Query: 863 VASTNLTL-LRKSD----------PNVASNDVPINTNVSSTGKQATETLSKDEKAVSLQS 1009 + + N+ L L+KSD P +S D +N + S+T + VS QS Sbjct: 189 IRTENIDLRLKKSDGGLDSPFQPSPLASSADALVNASFSTTSTSS----------VSEQS 238 Query: 1010 GPVILNNSPKMTRDRVLKRNRMKKP----IXXXXXXXXXXXXXXXXXXXXXHWPSVCDQE 1177 G +I NN + +K+ R P W SV D+E Sbjct: 239 GLLITNNHSAIATTPGVKKMRCNMPPKSITTFQEMNQILVRHRAKSRSLRPRWSSVRDKE 298 Query: 1178 LLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPIFHQPSL 1357 +L K Q+ENAP+ + EL A LF+NVSMFKRSYELMER LKVYVY +G+KPIFHQP + Sbjct: 299 ILAMKPQIENAPLAMNDQELYAPLFRNVSMFKRSYELMERTLKVYVYKDGDKPIFHQPIM 358 Query: 1358 KGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNNLSNFLG 1519 KG+YASEGWFM+LME N+++VVKDPR+AHLFY+PFS+++L + L +R NL +L Sbjct: 359 KGLYASEGWFMKLMERNRRYVVKDPRRAHLFYMPFSSRMLEHVLYVRNSHNRTNLRQYLK 418 Query: 1520 SY 1525 Y Sbjct: 419 EY 420 >ref|XP_007013073.1| Exostosin family protein [Theobroma cacao] gi|508783436|gb|EOY30692.1| Exostosin family protein [Theobroma cacao] Length = 736 Score = 172 bits (436), Expect = 4e-40 Identities = 85/130 (65%), Positives = 103/130 (79%), Gaps = 6/130 (4%) Frame = +2 Query: 1160 SVCDQELLYAKSQVENAPIIKKNGELDASLFKNVSMFKRSYELMERILKVYVYGEGEKPI 1339 SV DQE A+SQ+E+AP+I + EL A LF+NVSMFKRSYELMER LKVYVY G+KPI Sbjct: 360 SVRDQETFAARSQIESAPVIVNDQELYAPLFRNVSMFKRSYELMERTLKVYVYKNGKKPI 419 Query: 1340 FHQPSLKGIYASEGWFMELMEGNKQFVVKDPRKAHLFYLPFSAKVLRYTL------DRNN 1501 FH P LKG+YASEGWFM+LM+GNK+FVVKDPR+AHLFY+PFS+++L YTL +R N Sbjct: 420 FHLPILKGLYASEGWFMKLMQGNKRFVVKDPRRAHLFYMPFSSRMLEYTLYVRNSHNRTN 479 Query: 1502 LSNFLGSYVK 1531 L FL Y + Sbjct: 480 LRQFLKDYTE 489