BLASTX nr result
ID: Ephedra25_contig00009764
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00009764 (2364 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, p... 520 e-144 ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [A... 518 e-144 ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Caps... 518 e-144 gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao] 518 e-144 gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus pe... 515 e-143 ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arab... 512 e-142 ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutr... 511 e-142 ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Popu... 511 e-142 gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis] 511 e-142 ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolo... 509 e-141 gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao] 509 e-141 ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Caps... 508 e-141 ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-l... 507 e-140 gb|AED99886.1| glycosyltransferase [Panax notoginseng] 506 e-140 ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-l... 506 e-140 ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana] ... 506 e-140 gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus pe... 505 e-140 ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, p... 504 e-140 ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arab... 504 e-140 ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutr... 503 e-139 >ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus communis] gi|223549902|gb|EEF51389.1| KDEL motif-containing protein 1 precursor, putative [Ricinus communis] Length = 506 Score = 520 bits (1339), Expect = e-144 Identities = 235/447 (52%), Positives = 317/447 (70%), Gaps = 9/447 (2%) Frame = -3 Query: 1780 SSNPIHKPR---VVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRP 1625 S+ P+ KP V+ NC + + T +PN+ CP++F +IHEDLRP Sbjct: 58 STVPLEKPDNRLVIPLNCHALNLTRTCPTDYPSTSSQDPNRSSPPTCPEYFRWIHEDLRP 117 Query: 1624 WKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGIL 1445 W TGIT E +E A TANFRL I++G Y+ + +KSFQTRDVFT+WG +QL+ YPG + Sbjct: 118 WVRTGITRETMERAKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRKYPGRV 177 Query: 1444 PDVDLMFDCVDWPVIGK-KYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKP 1268 PD+++MFDCVDWPV+ Y SS+ PPPLFRYC ++E LDI PDWS+WGW E N KP Sbjct: 178 PDLEMMFDCVDWPVVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVETNIKP 237 Query: 1267 WEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWI 1088 WE +VKD+ +GN+R W++R P AYWKGNP VA+ R +LM+CN S HDWNARLY QDW+ Sbjct: 238 WEKIVKDLKEGNQRSKWKEREPYAYWKGNPNVAETRLDLMKCNVSQEHDWNARLYTQDWV 297 Query: 1087 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMP 908 RES+QGYKQS+LANQC+HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R LMP Sbjct: 298 RESQQGYKQSDLANQCNHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTRGLMP 357 Query: 907 GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 728 HYWPI+ D KC+SIKFAVDWGN H ++A+A+G+A S F++EDLK+ VYDYMFH+L++ Sbjct: 358 NHHYWPIKEDDKCKSIKFAVDWGNSHKQKAQAIGKAASDFIQEDLKMDYVYDYMFHLLNE 417 Query: 727 YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 548 Y++L+ +KP++P+ A ++C+++ C + + K+ M S EGP +PC + S D Sbjct: 418 YARLLTFKPTIPQNATKLCAETMACPAD--GLAKKLMMDSMVEGPADTSPCTMPSSYDPS 475 Query: 547 RLDQWNKMRAKALRKVAKMEEESWSKE 467 L + + A++++ E + W + Sbjct: 476 SLYNVTREKVNAIKQIELWENKHWENQ 502 >ref|XP_006842991.1| hypothetical protein AMTR_s00076p00109920 [Amborella trichopoda] gi|548845188|gb|ERN04666.1| hypothetical protein AMTR_s00076p00109920 [Amborella trichopoda] Length = 496 Score = 518 bits (1335), Expect = e-144 Identities = 239/431 (55%), Positives = 313/431 (72%), Gaps = 2/431 (0%) Frame = -3 Query: 1750 VNFNCTSRE--PCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANR 1577 +N NC+S P + +NL +P + CPD+F +IHEDL+PWK TGIT EMVE A R Sbjct: 73 INTNCSSLPWPPFPSIQNL-----DPPTSTCPDYFRWIHEDLKPWKGTGITQEMVERARR 127 Query: 1576 TANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIG 1397 TA FRL ++DG++Y+ K++Q RD FTIWG +QL Y G +PD+DLMFDCVDWPV+ Sbjct: 128 TATFRLLVIDGKVYVERYAKAYQCRDDFTIWGMLQLFRRYSGRVPDLDLMFDCVDWPVV- 186 Query: 1396 KKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINW 1217 K++D PPPLFRYC D + LDI PDWSFWGW E+N +PWE L+KD+ GNK+I W Sbjct: 187 KRWDYRGRVVPPPLFRYCGDKDSLDIVFPDWSFWGWPEINIEPWEALLKDLDDGNKKIKW 246 Query: 1216 EKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCD 1037 R P AYWKGNP+VAD RK+L++CN + DWNAR+Y+QDWI+ES+QGYK+SNLANQC Sbjct: 247 MNRDPTAYWKGNPYVADTRKDLLKCNVTETQDWNARVYVQDWIKESQQGYKESNLANQCT 306 Query: 1036 HRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIK 857 HRYKIY+EGSAWSVS K I+ACDS TL+VTP YYDF +RALMP HYWPI+ D KC SIK Sbjct: 307 HRYKIYIEGSAWSVSEKYILACDSPTLLVTPHYYDFVTRALMPTHHYWPIKGDDKCRSIK 366 Query: 856 FAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKE 677 +AVDWGN H ++A+A+G+ SSF+ ED+K++ VYDYMFH+L +YSKL++YKP+VPE A + Sbjct: 367 YAVDWGNSHKQKAQAIGKTASSFILEDVKMAYVYDYMFHLLSEYSKLLRYKPTVPEKAVQ 426 Query: 676 VCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVA 497 CS+S C + + ++FM +S + P+ PC L P + L + +A A+++V Sbjct: 427 YCSESMACPAKGN--YEKFMKESFVKVPSDSEPCILPPPFEPPALQLLLRRKANAIKQVE 484 Query: 496 KMEEESWSKEK 464 E+ S K K Sbjct: 485 TWEQNSRKKTK 495 >ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Capsella rubella] gi|482556148|gb|EOA20340.1| hypothetical protein CARUB_v10000648mg [Capsella rubella] Length = 544 Score = 518 bits (1334), Expect = e-144 Identities = 237/423 (56%), Positives = 307/423 (72%), Gaps = 1/423 (0%) Frame = -3 Query: 1732 SREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTI 1553 +++P A N T P CPD+F +IHEDLRPW TGIT E +E AN+TANFRL I Sbjct: 120 NKDPTTASFN-DDDTNHPPTATCPDYFRWIHEDLRPWARTGITREALERANKTANFRLAI 178 Query: 1552 VDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSS 1373 V G++Y+ Q +FQTRDVFTIWGF+QL+ YPG +PD++LMFDCVDWPV+ Sbjct: 179 VGGKVYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRAAEFAGVD 238 Query: 1372 SP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAA 1196 +P PPPLFRYC + E LDI PDWSFWGWAEVN KPWE L+K++ +GN++INW R P A Sbjct: 239 APSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNEKINWINREPYA 298 Query: 1195 YWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYV 1016 YWKGNP VA+ R++LM+CN S H+WNARLY QDWI+ES++GYKQS+LANQC HRYKIY+ Sbjct: 299 YWKGNPVVAETRQDLMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLANQCHHRYKIYI 358 Query: 1015 EGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGN 836 EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P HYWP+R KC SIKFAVDWGN Sbjct: 359 EGSAWSVSEKYILACDSMTLLVKPHYYDFFTRGLLPAHHYWPVREKDKCRSIKFAVDWGN 418 Query: 835 QHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEY 656 H ++A+ +G+A S F++++LK+ VYDYM+H+L++YSKL+++KP VP A E+CS++ Sbjct: 419 SHIQKAQDIGKAASEFIQQELKMDYVYDYMYHLLNEYSKLLQFKPEVPPNAVEICSETMA 478 Query: 655 CSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESW 476 C+ RS E++FM +S + P PC L P D L K + ++ ME + W Sbjct: 479 CT--RSGNERKFMTESLVKHPAESGPCALPPPYDPVSLYSVAKRKQSTTARILHMEMKYW 536 Query: 475 SKE 467 SK+ Sbjct: 537 SKQ 539 >gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao] Length = 522 Score = 518 bits (1333), Expect = e-144 Identities = 235/437 (53%), Positives = 316/437 (72%), Gaps = 6/437 (1%) Frame = -3 Query: 1756 RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 1592 R + NCT+R +A +E P+ CPD+F +IHEDLRPW TGI+++M+ Sbjct: 88 RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147 Query: 1591 EMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 1412 + A +TANFRL +V+GR Y+ ++SFQTRDVFT+WG +QL+ YPG +PD+DLMFDCVD Sbjct: 148 KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207 Query: 1411 WPVIGKK-YDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1235 WPVI Y +++ PPPLFRYC D+E LDI PDWSFWGW E+N KPW L+ D+ +G Sbjct: 208 WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267 Query: 1234 NKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1055 NKR+ WE R P AYWKGNP VA R++L++CN S DW AR+Y QDW RES+QGYKQS+ Sbjct: 268 NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327 Query: 1054 LANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDK 875 LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D Sbjct: 328 LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387 Query: 874 KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 695 KC SIK AVDWGN H ++A+A+G+A S F+KE LK+ VYDYMFH+L++Y+KL++YKP+V Sbjct: 388 KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447 Query: 694 PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAK 515 P A E+CS++ C E ++K+FM +S +GP+ +PC + P D L + Sbjct: 448 PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYDPASLYALLSKKEN 505 Query: 514 ALRKVAKMEEESWSKEK 464 ++++V + E++ W +K Sbjct: 506 SIKQVEEWEKKFWEMQK 522 >gb|EMJ21936.1| hypothetical protein PRUPE_ppa023179mg [Prunus persica] Length = 502 Score = 515 bits (1327), Expect = e-143 Identities = 228/406 (56%), Positives = 307/406 (75%), Gaps = 1/406 (0%) Frame = -3 Query: 1681 PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTR 1502 P+P CP++F +IHEDLRPW TGIT EMVE ANRTANF+ IV+G+ Y+ +K+FQTR Sbjct: 95 PSPPTCPEYFRWIHEDLRPWARTGITREMVERANRTANFKFVIVNGKAYVEQYEKAFQTR 154 Query: 1501 DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHL 1325 DVFT+WGF+QL+ YPG +PD++LMFDCVDWPVI +Y +++ PPPLFRYC+D+ L Sbjct: 155 DVFTVWGFLQLLRRYPGQVPDLELMFDCVDWPVIPSHEYSGPNATAPPPLFRYCADDNTL 214 Query: 1324 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQ 1145 DI PDWSFWGWAE+N +PWE L +++ +GNKR W +R P AYWKGNP +A+ R++L++ Sbjct: 215 DIVFPDWSFWGWAEINIRPWEVLFEELKEGNKRKTWLEREPYAYWKGNPDIAETRQDLIK 274 Query: 1144 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 965 CN S HDWNARLY QDW RES++GY +S+LA+QC HRYKIY+EGSAWSVS K I+ACDS Sbjct: 275 CNVSEEHDWNARLYAQDWDRESKEGYNKSDLASQCIHRYKIYIEGSAWSVSEKYILACDS 334 Query: 964 ATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 785 TLIV P+YYDFF+R LMP HYWPI+ D KC SIKF+VDWGN H ++A+A+G+A S+ + Sbjct: 335 VTLIVKPRYYDFFTRRLMPVEHYWPIKDDDKCRSIKFSVDWGNTHRRKAQAIGKASSNLI 394 Query: 784 KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 605 +E+LK+ VYDYMFH+L++Y+KL+++KP+VP+ A E+CS++ C E + EK+FM QS Sbjct: 395 QEELKMEYVYDYMFHLLNEYAKLLQFKPTVPKKAVELCSEAMACQAEGT--EKKFMLQSL 452 Query: 604 TEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 +GP PC + P D L + + ++++V E W + Sbjct: 453 VKGPAVSEPCAMPPPYDPSSLFAVLRRKENSIKQVETWERNYWESQ 498 >ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp. lyrata] gi|297317912|gb|EFH48334.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp. lyrata] Length = 543 Score = 512 bits (1318), Expect = e-142 Identities = 229/424 (54%), Positives = 305/424 (71%), Gaps = 1/424 (0%) Frame = -3 Query: 1735 TSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLT 1556 +++ P A T P CPD+F +IHEDLRPW TGIT E +E A +TANFRL Sbjct: 117 SNKYPTTASFGEDDDTNHPPNATCPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLA 176 Query: 1555 IVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDES 1379 I+DG++Y+ Q +FQTRDVFTIWGF+QL+ YPG +PD++LMFDCVDWPV+ ++ + Sbjct: 177 IIDGKIYVEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGA 236 Query: 1378 SSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPA 1199 ++ PPPLFRYC + E LDI PDWSFWGWAEVN KPWE L+K++ +GN+R W R P Sbjct: 237 NAPSPPPLFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPY 296 Query: 1198 AYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIY 1019 AYWKGNP VA+ R++LM+CN S H+WNARLY+QDWI+ES +GYKQS+LA+QC HRYKIY Sbjct: 297 AYWKGNPMVAETRQDLMKCNVSEEHEWNARLYVQDWIKESNEGYKQSDLASQCHHRYKIY 356 Query: 1018 VEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWG 839 +EGSAWSVS K I+ACDS TL+V P YYDFF+R L+P HYWP+R KC SIKFAVDWG Sbjct: 357 IEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWG 416 Query: 838 NQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSE 659 N H ++A+ +G+A S F++ +LK+ VYDYM+H+L +YSKL+++KP +P+ A E+CS++ Sbjct: 417 NSHIQKAQDIGKAASDFIQHELKMDYVYDYMYHLLTEYSKLLRFKPEIPQNAAEICSETM 476 Query: 658 YCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEES 479 C RS E++FM +S + P PC + P D L K + ++ + E + Sbjct: 477 AC--PRSGNERKFMTESFVKHPAESGPCAMPPPYDPALLYGVVKRKQSTNMRILQWEMKY 534 Query: 478 WSKE 467 WSK+ Sbjct: 535 WSKQ 538 >ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutrema salsugineum] gi|557091280|gb|ESQ31927.1| hypothetical protein EUTSA_v10003948mg [Eutrema salsugineum] Length = 545 Score = 511 bits (1317), Expect = e-142 Identities = 227/401 (56%), Positives = 300/401 (74%), Gaps = 1/401 (0%) Frame = -3 Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487 CPD+F +IHEDLRPW++TGIT E +E A +TANFRL IV G++Y+ Q +FQTRDVFTI Sbjct: 142 CPDYFRWIHEDLRPWEKTGITREALERAKKTANFRLAIVGGKLYVEKFQDAFQTRDVFTI 201 Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLP 1310 WGF+QL+ YPG +PD++LMFDCVDWPV+ ++SP PPPLFRYC + E LDI P Sbjct: 202 WGFLQLLRRYPGKIPDLELMFDCVDWPVVKAANFAGANSPSPPPLFRYCGNEETLDIVFP 261 Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130 DWSFWGW+EVN KPWE L+K++ +GN++ NW R P AYWKGNP VA+ R++LM+CN S Sbjct: 262 DWSFWGWSEVNIKPWESLLKELREGNEKTNWINREPYAYWKGNPLVAETRQDLMKCNVSE 321 Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950 H+WNARLY QDWIRES++GYKQS+LA+QC HR+KIY+EGSAWSVS K I+ACDS TL+V Sbjct: 322 EHEWNARLYAQDWIRESKEGYKQSDLASQCHHRFKIYIEGSAWSVSEKYILACDSVTLLV 381 Query: 949 TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770 P YYDFF+R L+P HYWP+R KC SIKFAV WGN H ++A+ +G+A S F++++LK Sbjct: 382 KPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVHWGNSHIQKAQDIGKAASEFIQQELK 441 Query: 769 ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590 + VYDYMFH+L +YSKL+++KP +P+ AKE+CS++ C RS E++FM +S + P Sbjct: 442 MDYVYDYMFHLLTEYSKLLQFKPEIPQNAKEICSETMAC--PRSGNERKFMTESLVKHPA 499 Query: 589 SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 PC + P D K + A ++ + E + WSK+ Sbjct: 500 QTGPCAMPPPYDPASFYAVVKRKQSAATRILQWEMKYWSKQ 540 >ref|XP_002321919.2| hypothetical protein POPTR_0015s13090g [Populus trichocarpa] gi|550322617|gb|EEF06046.2| hypothetical protein POPTR_0015s13090g [Populus trichocarpa] Length = 506 Score = 511 bits (1317), Expect = e-142 Identities = 231/427 (54%), Positives = 306/427 (71%), Gaps = 1/427 (0%) Frame = -3 Query: 1744 FNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANF 1565 FN T + P N + P+ + CP+ F +IHEDLRPW TGI+ +MVE A RTANF Sbjct: 78 FNPTRKCPLNYPTNTQEGPDRPSVSTCPEHFRWIHEDLRPWAHTGISRDMVERAKRTANF 137 Query: 1564 RLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKY 1388 RL IV+G+ Y+ +KSFQTRD FT+WG IQL+ YPG LPD+D+MFDCVDWPVI Y Sbjct: 138 RLVIVNGKAYMERYRKSFQTRDTFTVWGIIQLLRKYPGKLPDLDMMFDCVDWPVIRSSDY 197 Query: 1387 DESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKR 1208 +++ PP LFRYC D++ LD+ PDWSFWGW E+N KPWE L D+ +GNK W +R Sbjct: 198 SGPNATSPPALFRYCGDDDSLDVVFPDWSFWGWPEINIKPWESLSNDLKEGNKITKWMER 257 Query: 1207 VPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRY 1028 P AYWKGNP VA R++LM+C+AS DWNAR+Y QDWI+ES+QGY+QSNLANQC H+Y Sbjct: 258 EPYAYWKGNPSVAATRQDLMKCHASETQDWNARVYAQDWIKESQQGYQQSNLANQCVHKY 317 Query: 1027 KIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAV 848 KIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L+P RHYWPI+ D KC SIKFAV Sbjct: 318 KIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLVPNRHYWPIKEDDKCRSIKFAV 377 Query: 847 DWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCS 668 +WGN H+++A+AMG+A S F++EDLK+ VYDYMFH+L++Y+KL+ +KP++P A E+C+ Sbjct: 378 EWGNNHSEEAQAMGKAASEFIQEDLKMDYVYDYMFHLLNEYAKLLTFKPTIPGRAIELCA 437 Query: 667 DSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKME 488 ++ C + +EK+FM S P +PC + P D L + ++++V E Sbjct: 438 EAMACPA--NGLEKKFMMDSMVMSPADTSPCTMPPPYDPLSLHSVFQRNGNSIKQVESWE 495 Query: 487 EESWSKE 467 +E W + Sbjct: 496 KEYWDNQ 502 >gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis] Length = 515 Score = 511 bits (1315), Expect = e-142 Identities = 228/401 (56%), Positives = 299/401 (74%), Gaps = 1/401 (0%) Frame = -3 Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487 CPD+F +I+EDLRPW TGI+ +MVE A RTANFRL IV+G+ Y+ QK+FQTRDVFT+ Sbjct: 113 CPDYFRWIYEDLRPWAYTGISRDMVERAKRTANFRLVIVNGKAYVETFQKAFQTRDVFTL 172 Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLP 1310 WG +QL+ YPG +PD++LMFDCVDWPV+ K Y ++ PPPLFRYC D+ LDI P Sbjct: 173 WGILQLLRKYPGRVPDLELMFDCVDWPVVLSKAYSGPDATTPPPLFRYCGDDSTLDIVFP 232 Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130 DWSFWGW E N KPWE L+K++ +GNK+ W +R AYWKGNP VA R++L++CN S Sbjct: 233 DWSFWGWPETNIKPWEALLKELEEGNKKSKWVEREAYAYWKGNPVVAATRQDLLKCNVSD 292 Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950 DWNARLY QDW++ES++GYKQS+LANQC HRYKIY+EGSAWSVS K I+ACDS TLIV Sbjct: 293 KQDWNARLYAQDWLKESKEGYKQSDLANQCIHRYKIYIEGSAWSVSEKYILACDSVTLIV 352 Query: 949 TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770 P YYDFF+R L+P +HYWPI+ D KC SIKFAVDWGN H K+AK++G+A S F+++DLK Sbjct: 353 KPHYYDFFTRGLVPMQHYWPIKDDDKCRSIKFAVDWGNSHKKKAKSIGKAASRFIQDDLK 412 Query: 769 ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590 + VYDYMFH+L++Y+KL+K+KPS+PE A E CS+S C+ E + K+FM +S +GP Sbjct: 413 MEYVYDYMFHLLNEYAKLLKFKPSIPEKAVEFCSESMACTAE--GIGKKFMMESMVKGPA 470 Query: 589 SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 +PC + PS + L + + + +V + + W + Sbjct: 471 DSSPCTMPPSYNPSSLYSLIQKKTSLIEQVEMWQNKYWENQ 511 >ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera] gi|302143884|emb|CBI22745.3| unnamed protein product [Vitis vinifera] Length = 525 Score = 509 bits (1312), Expect = e-141 Identities = 227/406 (55%), Positives = 304/406 (74%), Gaps = 1/406 (0%) Frame = -3 Query: 1681 PNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTR 1502 P+P +CP +F +I+ DLRPW ++GIT EMVE A RTA F+L I++GR Y+ Q++FQTR Sbjct: 120 PSPPECPHYFRWIYGDLRPWMKSGITREMVERAKRTATFKLVILNGRAYVEKYQRAFQTR 179 Query: 1501 DVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHL 1325 DVFT+WG +QL+ YPG +PD++LMFDCVDWPVI +Y +++ PPPLFRYC D+ L Sbjct: 180 DVFTLWGILQLLRRYPGKVPDLELMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDATL 239 Query: 1324 DIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQ 1145 DI PDWSFWGW E+N KPWE L+KD+ +GNKR W +R P AYWKGNP VA R +L++ Sbjct: 240 DIVFPDWSFWGWPEINIKPWESLLKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLK 299 Query: 1144 CNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDS 965 CN S DWNAR+Y QDWI ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS Sbjct: 300 CNVSDKQDWNARVYTQDWILESQEGYKQSDLASQCIHRYKIYIEGSAWSVSQKYILACDS 359 Query: 964 ATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFM 785 TL+V P YYDFF+R+LMP HYWPIR D KC SIKFAVDWGN+H ++A+++G+A S F+ Sbjct: 360 VTLLVKPHYYDFFTRSLMPVHHYWPIREDDKCRSIKFAVDWGNRHKQKAQSIGKAASDFI 419 Query: 784 KEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSA 605 +EDLK+ NVYDYMFH+L++Y+KL+K+KP+VPE A E+CS+ C E ++K+FM +S Sbjct: 420 QEDLKMDNVYDYMFHLLNEYAKLLKFKPTVPEKAVELCSERMGCGAE--GLKKKFMMESM 477 Query: 604 TEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 + P +PC + P L + + ++++V E++ W + Sbjct: 478 VKYPMDASPCTMPPPFSPLELQTFLNRKVNSIKQVEAWEKKFWENQ 523 >gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao] Length = 498 Score = 509 bits (1311), Expect = e-141 Identities = 230/407 (56%), Positives = 302/407 (74%), Gaps = 6/407 (1%) Frame = -3 Query: 1756 RVVNFNCTSREPCKARKNLSKKTLEPNPNK-----CPDFFMFIHEDLRPWKETGITLEMV 1592 R + NCT+R +A +E P+ CPD+F +IHEDLRPW TGI+++M+ Sbjct: 88 RDIPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDML 147 Query: 1591 EMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVD 1412 + A +TANFRL +V+GR Y+ ++SFQTRDVFT+WG +QL+ YPG +PD+DLMFDCVD Sbjct: 148 KRAEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVD 207 Query: 1411 WPVIGKK-YDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKG 1235 WPVI Y +++ PPPLFRYC D+E LDI PDWSFWGW E+N KPW L+ D+ +G Sbjct: 208 WPVIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEG 267 Query: 1234 NKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSN 1055 NKR+ WE R P AYWKGNP VA R++L++CN S DW AR+Y QDW RES+QGYKQS+ Sbjct: 268 NKRMGWEGREPHAYWKGNPNVATTRQDLLKCNVSDKQDWGARVYAQDWARESQQGYKQSD 327 Query: 1054 LANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDK 875 LANQC HR+KIY+EGSAWSVS K I+ACDS TL+V P+YYDFF+R+L P RHYWPI+ D Sbjct: 328 LANQCIHRFKIYIEGSAWSVSEKYILACDSLTLLVKPRYYDFFTRSLEPMRHYWPIKDDD 387 Query: 874 KCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSV 695 KC SIK AVDWGN H ++A+A+G+A S F+KE LK+ VYDYMFH+L++Y+KL++YKP+V Sbjct: 388 KCRSIKHAVDWGNGHQQEAQAIGKAASEFIKEGLKMDYVYDYMFHLLNEYAKLLRYKPTV 447 Query: 694 PEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPD 554 P A E+CS++ C E ++K+FM +S +GP+ +PC + P D Sbjct: 448 PRKAVELCSETMACPAE--GLQKKFMMESMVKGPSVTSPCTMPPPYD 492 >ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Capsella rubella] gi|482559574|gb|EOA23765.1| hypothetical protein CARUB_v10016976mg [Capsella rubella] Length = 539 Score = 508 bits (1309), Expect = e-141 Identities = 225/404 (55%), Positives = 298/404 (73%), Gaps = 1/404 (0%) Frame = -3 Query: 1675 PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDV 1496 P CPD+F +IHEDLRPW++TGIT E +E AN TA FRL I+DGR+Y+ +++FQTRDV Sbjct: 133 PATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIIDGRIYVENFREAFQTRDV 192 Query: 1495 FTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDI 1319 FTIWGF+QL+ YPG +PD++LMFDCVDWPV+ ++Y PPPLFRYC+++E LDI Sbjct: 193 FTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAEEYSGVDKPSPPPLFRYCANDETLDI 252 Query: 1318 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCN 1139 PDWS+WGWAEVN KPWE L+KD+S+GN+R W R P AYWKGNP VA+ R +LM+CN Sbjct: 253 VFPDWSYWGWAEVNIKPWESLLKDLSEGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCN 312 Query: 1138 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSAT 959 S +DW ARLY QDW++ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T Sbjct: 313 LSEEYDWKARLYKQDWLKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVT 372 Query: 958 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 779 L+V P YYDFF+R + PG HYWP++ D KC SIKFAVDWGN H ++A+ +G+ S F+++ Sbjct: 373 LMVKPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQ 432 Query: 778 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 599 +LK+ VYDYMFH+L QYSKL+++KP +P+ + EVCS++ C R E++FM +S + Sbjct: 433 ELKMDYVYDYMFHLLTQYSKLLRFKPEIPQNSTEVCSETMAC--PRDGNERKFMMESLVK 490 Query: 598 GPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 P PC + P D K R ++ + E + W K+ Sbjct: 491 RPAETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKYWRKQ 534 >ref|XP_004140839.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus] Length = 538 Score = 507 bits (1305), Expect = e-140 Identities = 240/451 (53%), Positives = 311/451 (68%), Gaps = 15/451 (3%) Frame = -3 Query: 1774 NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 1637 NP H+PR +FN + C A + T E NP + CPD+F +IHE Sbjct: 86 NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145 Query: 1636 DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELY 1457 DLRPW TGIT +E RTANFRL I++G+ Y+ +KSFQTRD FT+WG +QL+ Y Sbjct: 146 DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205 Query: 1456 PGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEV 1280 PG +PD+DLMFDCVDWPVI + + P PPPLFRYC D+ DI PDWSFWGW E+ Sbjct: 206 PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265 Query: 1279 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYI 1100 N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S DWNAR++ Sbjct: 266 NIKPWEPLLKDIKEGNKRIPWKSREPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325 Query: 1099 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSR 920 QDW +ES++GYKQS+L+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R Sbjct: 326 QDWTKESQEGYKQSDLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385 Query: 919 ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 740 LMP HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+ VYDYMFH Sbjct: 386 GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445 Query: 739 MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 560 +L +YSKL+ +KP++P A E+CS++ C E + K+FM +S + P PC + P Sbjct: 446 LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPPP 503 Query: 559 PDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 D L + ++++V K E W+ + Sbjct: 504 YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534 >gb|AED99886.1| glycosyltransferase [Panax notoginseng] Length = 546 Score = 506 bits (1303), Expect = e-140 Identities = 227/401 (56%), Positives = 296/401 (73%), Gaps = 1/401 (0%) Frame = -3 Query: 1675 PNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDV 1496 P CP++F +I+EDLRPW+ETGIT EMVE A RTANFRL I++GR Y+ +QKSFQ+RDV Sbjct: 143 PVSCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDV 202 Query: 1495 FTIWGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDI 1319 FT+WG +QL+ +YPG +PD+DLMFDCVDWPVI + Y +++ PPPLFRYC+D+ LDI Sbjct: 203 FTLWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDI 262 Query: 1318 PLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCN 1139 PDW+FWGW E+N KPW L+KD+ +GN W R P AYWKGNP VA R +L++CN Sbjct: 263 VFPDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCN 322 Query: 1138 ASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSAT 959 S DWNAR+Y DW RES+ GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS T Sbjct: 323 VSDKQDWNARVYAXDWARESQLGYKQSDLASQCIHRYKIYIEGSAWSVSEKYILACDSVT 382 Query: 958 LIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKE 779 L V P+YYDFF+R LMP HYWPIR D KC SIKFAVDWGN H ++A ++G+ S+F++E Sbjct: 383 LXVKPRYYDFFTRGLMPVHHYWPIRDDDKCRSIKFAVDWGNNHKQKAHSIGKEASNFIQE 442 Query: 778 DLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATE 599 DLK+ VYDYMFH+L++Y+KL++YKP+VP A E+CS++ C E K+FM +S + Sbjct: 443 DLKMDYVYDYMFHLLNEYAKLLRYKPTVPPKAVELCSETMACPAE--GFTKKFMMESIVK 500 Query: 598 GPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESW 476 GP +PC + P D L + + ++++V E+ W Sbjct: 501 GPTDKSPCVMQPPYDPPTLHSVLRRKENSIKQVENWEKLYW 541 >ref|XP_004157225.1| PREDICTED: protein O-glucosyltransferase 1-like [Cucumis sativus] Length = 538 Score = 506 bits (1302), Expect = e-140 Identities = 240/451 (53%), Positives = 310/451 (68%), Gaps = 15/451 (3%) Frame = -3 Query: 1774 NPIHKPR---------VVNFNCTSREPCKARKNLSKKTLEP-NP----NKCPDFFMFIHE 1637 NP H+PR +FN + C A + T E NP + CPD+F +IHE Sbjct: 86 NPNHQPRRPQVEFTLHCASFNNITPGACPAHYPTNWTTDEDQNPPSSSSACPDYFRWIHE 145 Query: 1636 DLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELY 1457 DLRPW TGIT +E RTANFRL I++G+ Y+ +KSFQTRD FT+WG +QL+ Y Sbjct: 146 DLRPWARTGITRATLEAGQRTANFRLLILNGKAYVETYKKSFQTRDTFTVWGILQLLRRY 205 Query: 1456 PGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDNEHLDIPLPDWSFWGWAEV 1280 PG +PD+DLMFDCVDWPVI + + P PPPLFRYC D+ DI PDWSFWGW E+ Sbjct: 206 PGKVPDLDLMFDCVDWPVILTSHFSGPNGPTPPPLFRYCGDDATFDIVFPDWSFWGWPEI 265 Query: 1279 NTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYI 1100 N KPWE L+KDI +GNKRI W+ R P AYWKGNP VAD RK+L++CN S DWNAR++ Sbjct: 266 NIKPWEPLLKDIKEGNKRIPWKSRQPYAYWKGNPEVADTRKDLIKCNVSDQQDWNARVFA 325 Query: 1099 QDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSR 920 QDW +ES++GYKQSNL+NQC HRYKIY+EGSAWSVS K I+ACDS TLIV P YYDFF+R Sbjct: 326 QDWTKESQEGYKQSNLSNQCLHRYKIYIEGSAWSVSEKYILACDSVTLIVKPHYYDFFTR 385 Query: 919 ALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFH 740 LMP HYWP++ D KC+SIKFAVDWGN H ++A+A+G+A SSF++E+LK+ VYDYMFH Sbjct: 386 GLMPVHHYWPVKDDDKCKSIKFAVDWGNSHKQKAQAIGKAASSFIQEELKMDYVYDYMFH 445 Query: 739 MLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPS 560 +L +YSKL+ +KP++P A E+CS++ C E + K+FM +S + P PC + Sbjct: 446 LLSEYSKLLTFKPTLPPNAIELCSEAMACPAE--GLTKKFMTESLVKRPAESNPCTMPSP 503 Query: 559 PDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 D L + ++++V K E W+ + Sbjct: 504 YDPASLHFVLSRKENSIKQVEKWETSFWNTQ 534 >ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana] gi|10176852|dbj|BAB10058.1| unnamed protein product [Arabidopsis thaliana] gi|48310551|gb|AAT41837.1| At5g23850 [Arabidopsis thaliana] gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis thaliana] gi|332005839|gb|AED93222.1| uncharacterized protein AT5G23850 [Arabidopsis thaliana] Length = 542 Score = 506 bits (1302), Expect = e-140 Identities = 227/409 (55%), Positives = 297/409 (72%), Gaps = 1/409 (0%) Frame = -3 Query: 1690 TLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSF 1511 T P CPD+F +IHEDLRPW TGIT E +E A +TA FRL IV G++Y+ Q +F Sbjct: 131 TNHPPTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAF 190 Query: 1510 QTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKKYDESSSSP-PPPLFRYCSDN 1334 QTRDVFTIWGF+QL+ YPG +PD++LMFDCVDWPV+ +++P PPPLFRYC + Sbjct: 191 QTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNE 250 Query: 1333 EHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKE 1154 E LDI PDWSFWGWAEVN KPWE L+K++ +GN+R W R P AYWKGNP VA+ R++ Sbjct: 251 ETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQD 310 Query: 1153 LMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMA 974 LM+CN S H+WNARLY QDWI+ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+A Sbjct: 311 LMKCNVSEEHEWNARLYAQDWIKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILA 370 Query: 973 CDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGS 794 CDS TL+V P YYDFF+R L+P HYWP+R KC SIKFAVDWGN H ++A+ +G+A S Sbjct: 371 CDSVTLLVKPHYYDFFTRGLLPAHHYWPVREHDKCRSIKFAVDWGNSHIQKAQDIGKAAS 430 Query: 793 SFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMF 614 F+++DLK+ VYDYM+H+L +YSKL+++KP +P A E+CS++ C RS E++FM Sbjct: 431 DFIQQDLKMDYVYDYMYHLLTEYSKLLQFKPEIPRNAVEICSETMACL--RSGNERKFMT 488 Query: 613 QSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 +S + P PC + P D + K + ++ + E + WSK+ Sbjct: 489 ESLVKQPADSGPCAMPPPYDPATYYEVVKRKQSTNMRILQWEMKYWSKQ 537 >gb|EMJ21654.1| hypothetical protein PRUPE_ppa005169mg [Prunus persica] Length = 474 Score = 505 bits (1301), Expect = e-140 Identities = 228/419 (54%), Positives = 303/419 (72%), Gaps = 1/419 (0%) Frame = -3 Query: 1720 CKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGR 1541 C N + P P CP++F +IHEDLRPW TGIT +M++ A RTANF+L IV+G+ Sbjct: 54 CTRLLNSRQDPDRPLPPTCPEYFRWIHEDLRPWAHTGITRDMIQRAKRTANFKLVIVNGK 113 Query: 1540 MYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLMFDCVDWPVIGKK-YDESSSSPP 1364 Y+ QKSFQTRDVFT+WG +QL+ YPG +PD++LMFDCVDWPVI Y +++ P Sbjct: 114 AYVEKYQKSFQTRDVFTMWGILQLLRRYPGQVPDLELMFDCVDWPVISSNDYSGPNATAP 173 Query: 1363 PPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKG 1184 PPLFRYC D+ LDI PDWSFWGWAE+N PWE L+KD+ +GNKR W R P AYWKG Sbjct: 174 PPLFRYCGDDNSLDIVFPDWSFWGWAEINIMPWEVLLKDLEEGNKRRRWIDRAPYAYWKG 233 Query: 1183 NPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSA 1004 NP VA R++L++CN S DWNAR+Y QDW+RES +GYKQS+LA+QC RYKIY+EGSA Sbjct: 234 NPSVAATRQDLLKCNVSDQQDWNARVYAQDWLRESSEGYKQSDLASQCVDRYKIYIEGSA 293 Query: 1003 WSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTK 824 WSVS K I+ACDS TLIV P+YYDFF+R+LMP HYWPI+ D KC SIKFAVDWGN H + Sbjct: 294 WSVSDKYILACDSVTLIVKPRYYDFFTRSLMPVHHYWPIKDDDKCRSIKFAVDWGNSHKQ 353 Query: 823 QAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGE 644 +A+A+G+A S ++E+LK+ VYDYMFH+L++Y+KL+++KP++P A E+CS++ C + Sbjct: 354 KAQAIGKAASKLIQEELKMDYVYDYMFHLLNEYAKLLQFKPTIPRKAIELCSEAMACQAQ 413 Query: 643 RSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 + EK+FM +S +GP PC + P L + A ++++V E++ W + Sbjct: 414 GT--EKKFMMESMVKGPAVSNPCTMPPPYGPASLFAVLRRNANSIKQVETWEKKYWENQ 470 >ref|XP_002510788.1| KDEL motif-containing protein 1 precursor, putative [Ricinus communis] gi|223549903|gb|EEF51390.1| KDEL motif-containing protein 1 precursor, putative [Ricinus communis] Length = 528 Score = 504 bits (1299), Expect = e-140 Identities = 229/442 (51%), Positives = 313/442 (70%), Gaps = 2/442 (0%) Frame = -3 Query: 1783 NSSNPIHKP-RVVNFNCTSREPCKARKNLSKKTLEPNPNKCPDFFMFIHEDLRPWKETGI 1607 N+ N I+ P FN T P ++ P+ + CP+++ +I+EDLRPW TGI Sbjct: 86 NALNKINIPLNCAAFNLTRTCPSNYPTTFTENPDRPSVSACPEYYRWIYEDLRPWARTGI 145 Query: 1606 TLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGILPDVDLM 1427 + +MVE A TANFRL IV+G+ Y+ +++FQTRDVFT+WG +QL+ YPG +PD++LM Sbjct: 146 SRDMVERAKTTANFRLVIVNGKAYVEKYRRAFQTRDVFTLWGILQLLRRYPGKVPDLELM 205 Query: 1426 FDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKPWEHLVK 1250 FDCVDWPVI Y ++ PPPLFRYC D++ LD+ PDWSFWGW+E+N KPWE L++ Sbjct: 206 FDCVDWPVIKSSNYSGPNAMAPPPLFRYCGDDDTLDVVFPDWSFWGWSEINIKPWERLLR 265 Query: 1249 DISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWIRESEQG 1070 ++ +GN++ W +R P AYWKGNP VA+ R++LM+CN S DWNAR+Y QDWI+E +QG Sbjct: 266 ELKEGNEKRRWMEREPYAYWKGNPAVAETRQDLMKCNVSEQQDWNARVYAQDWIKELQQG 325 Query: 1069 YKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMPGRHYWP 890 YKQSNLA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R+L P HYWP Sbjct: 326 YKQSNLASQCMHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRSLRPIHHYWP 385 Query: 889 IRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQYSKLMK 710 I+ KC SIKFAVDWGN H ++A+A+G+A S F++E+LK+ VYDYMFH+L++Y+KL+ Sbjct: 386 IKDYDKCRSIKFAVDWGNNHKQKAQAIGKAASEFIQEELKMDYVYDYMFHLLNEYAKLLT 445 Query: 709 YKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQRLDQWN 530 +KP +P A E+CS+S C + +EK FM +S +GP PC + P D L Sbjct: 446 FKPVIPRKAVELCSESMACPA--NGIEKEFMMESMVQGPAETNPCIMLPPYDPSALHSIF 503 Query: 529 KMRAKALRKVAKMEEESWSKEK 464 + + ++R+V E+ W K+K Sbjct: 504 RRKENSIRQVELWEKMYWDKQK 525 >ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp. lyrata] gi|297321774|gb|EFH52195.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp. lyrata] Length = 539 Score = 504 bits (1297), Expect = e-140 Identities = 221/401 (55%), Positives = 297/401 (74%), Gaps = 1/401 (0%) Frame = -3 Query: 1666 CPDFFMFIHEDLRPWKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTI 1487 CPD+F +IHEDLRPW++TGIT E +E AN TANFRL I++GR+Y+ +++FQTRDVFTI Sbjct: 136 CPDYFRWIHEDLRPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTI 195 Query: 1486 WGFIQLMELYPGILPDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLP 1310 WGF+QL+ YPG +PD++LMFDCVDWPV+ ++ PPPPLFRYC+++E LDI P Sbjct: 196 WGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDIVFP 255 Query: 1309 DWSFWGWAEVNTKPWEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASH 1130 DWS+WGWAEVN KPWE L+K++ +GN+R W R P AYWKGNP VA+ R +LM+CN S Sbjct: 256 DWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLSE 315 Query: 1129 GHDWNARLYIQDWIRESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIV 950 +DW ARLY QDW++ES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V Sbjct: 316 EYDWKARLYKQDWVKESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLV 375 Query: 949 TPKYYDFFSRALMPGRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLK 770 P YYDFF+R + PG HYWP++ D KC SIKFAVDWGN H ++A+ +G+ S F++++LK Sbjct: 376 KPHYYDFFTRGMFPGHHYWPVKEDDKCRSIKFAVDWGNLHMRKAQDIGKKASEFVQQELK 435 Query: 769 ISNVYDYMFHMLDQYSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPN 590 + VYDYMFH+L QYSKL+++KP +P+ + E+CS++ C R E++FM +S + P Sbjct: 436 MDYVYDYMFHLLIQYSKLLRFKPEIPQNSTELCSEAMAC--PRDGNERKFMMESLVKHPA 493 Query: 589 SIAPCQLDPSPDGQRLDQWNKMRAKALRKVAKMEEESWSKE 467 PC + P D K R ++ + E + W K+ Sbjct: 494 ETGPCAMPPPYDPASFYSVLKRRQSTTSRIEQWESKYWRKQ 534 >ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutrema salsugineum] gi|557105314|gb|ESQ45648.1| hypothetical protein EUTSA_v10010269mg [Eutrema salsugineum] Length = 543 Score = 503 bits (1294), Expect = e-139 Identities = 230/447 (51%), Positives = 308/447 (68%), Gaps = 15/447 (3%) Frame = -3 Query: 1762 KPRVVNFNCTS-------------REPCKARKNLSKKTLEPNPNK-CPDFFMFIHEDLRP 1625 KP+ NC + R P R + E +P CPD+F +IHEDLRP Sbjct: 94 KPKEFTLNCAAFSGNETVITCPRNRYPTSLRSGAREDDPERSPPATCPDYFRWIHEDLRP 153 Query: 1624 WKETGITLEMVEMANRTANFRLTIVDGRMYILINQKSFQTRDVFTIWGFIQLMELYPGIL 1445 W++TGIT E +E AN TANFRL I++GR+Y+ +++FQTRDVFTIWGF+QL+ YPG + Sbjct: 154 WEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVFTIWGFVQLLRRYPGKI 213 Query: 1444 PDVDLMFDCVDWPVI-GKKYDESSSSPPPPLFRYCSDNEHLDIPLPDWSFWGWAEVNTKP 1268 PD++LMFDCVDWPV+ ++ PPPLFRYC +NE LDI PDWS+WGWAEVN KP Sbjct: 214 PDLELMFDCVDWPVVKAAEFAGVDQLTPPPLFRYCGNNETLDIVFPDWSYWGWAEVNIKP 273 Query: 1267 WEHLVKDISKGNKRINWEKRVPAAYWKGNPFVADVRKELMQCNASHGHDWNARLYIQDWI 1088 WE L+K++ +GN+R W R P AYWKGNP VA+ R++LM+CN S +DW ARLY QDW+ Sbjct: 274 WESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRQDLMKCNVSEDYDWKARLYPQDWV 333 Query: 1087 RESEQGYKQSNLANQCDHRYKIYVEGSAWSVSLKNIMACDSATLIVTPKYYDFFSRALMP 908 RES++GYKQS+LA+QC HRYKIY+EGSAWSVS K I+ACDS TL+V P YYDFF+R + P Sbjct: 334 RESKEGYKQSDLASQCHHRYKIYIEGSAWSVSEKYILACDSVTLLVKPHYYDFFTRGMFP 393 Query: 907 GRHYWPIRLDKKCESIKFAVDWGNQHTKQAKAMGRAGSSFMKEDLKISNVYDYMFHMLDQ 728 G HYWP++ D KC SIKFAVD+GN H +A+ +G+ S F++++LK+ VYDYM+H+L Q Sbjct: 394 GHHYWPVKEDDKCRSIKFAVDFGNLHMLKAQDIGKKASEFVQQELKMDYVYDYMYHLLTQ 453 Query: 727 YSKLMKYKPSVPEGAKEVCSDSEYCSGERSRVEKRFMFQSATEGPNSIAPCQLDPSPDGQ 548 YSKL+++KP +P+ A E+CS++ C R E++FM +S + P PC + P D Sbjct: 454 YSKLLRFKPKIPQNATELCSEAMAC--PRDGNERKFMMESLVKRPAETGPCAMPPPYDPA 511 Query: 547 RLDQWNKMRAKALRKVAKMEEESWSKE 467 K R ++ + E + W K+ Sbjct: 512 SFYSVLKRRQSTTSRIEQWESKYWRKQ 538