BLASTX nr result

ID: Cocculus22_contig00012517 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00012517
         (1924 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007048603.1| Uncharacterized protein isoform 1 [Theobroma...   577   e-162
ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-ace...   572   e-160
ref|XP_002303734.2| hypothetical protein POPTR_0003s15750g [Popu...   548   e-153
ref|XP_002299365.2| hypothetical protein POPTR_0001s12610g [Popu...   546   e-153
ref|XP_007139131.1| hypothetical protein PHAVU_008G004100g [Phas...   541   e-151
ref|XP_004489126.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   540   e-151
ref|XP_004489125.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   540   e-151
ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   540   e-151
ref|XP_004489127.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   535   e-149
ref|XP_006487659.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   535   e-149
ref|XP_004299204.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   535   e-149
ref|XP_006423878.1| hypothetical protein CICLE_v10028441mg [Citr...   532   e-148
ref|XP_007226448.1| hypothetical protein PRUPE_ppa024277mg [Prun...   531   e-148
gb|AFK42154.1| unknown [Lotus japonicus]                              531   e-148
ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   531   e-148
gb|EYU28584.1| hypothetical protein MIMGU_mgv1a006117mg [Mimulus...   529   e-147
gb|EYU26066.1| hypothetical protein MIMGU_mgv1a018021mg, partial...   523   e-146
ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   515   e-143
gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]        509   e-141
gb|ACL53164.1| unknown [Zea mays] gi|413937084|gb|AFW71635.1| hy...   509   e-141

>ref|XP_007048603.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700864|gb|EOX92760.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 480

 Score =  577 bits (1488), Expect = e-162
 Identities = 284/434 (65%), Positives = 338/434 (77%)
 Frame = +3

Query: 234  EDGYTDKEMESVRQQHSHYSHSLKIKEGNXXXXXXXXXANVLMADAEDASKRRLVSLDVF 413
            +D   DKEM  +  Q S  + S K +EG                  + A ++RLVSLDVF
Sbjct: 37   DDDDDDKEMGQLALQISETTISNKGEEGPLTPISNSTHLGRRQQQPQHA-QQRLVSLDVF 95

Query: 414  RGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYKKASCRVLA 593
            RGLT+VLMILVDD GG+LPAINHSPWNG+TLAD+VMPFFLFIVGVSLGLTYK+ SCRV A
Sbjct: 96   RGLTIVLMILVDDVGGLLPAINHSPWNGLTLADYVMPFFLFIVGVSLGLTYKRLSCRVTA 155

Query: 594  TKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYLLAALCEIW 773
            T+KA LRALKL VLGL LQGG+FHGLN+LTYGV+++ +R MGILQRIA AYL+AA+CEIW
Sbjct: 156  TRKAILRALKLLVLGLFLQGGFFHGLNNLTYGVDIQQMRLMGILQRIAIAYLVAAICEIW 215

Query: 774  LKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESSSIAPRTFS 953
            LK D  V S+++LL K+RFQW+  +               PDWE+QIP  +SS AP+ FS
Sbjct: 216  LKGDHHVKSELNLLKKHRFQWVAALALTIIYISLLYGLYVPDWEYQIPVATSSSAPKFFS 275

Query: 954  VKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGPLPPNAPSWCQ 1133
            VKCGVRGDTGPACN VGMIDR++LGI+HLYR+PV+ RT+QCSI+SPDYGPLP +APSWCQ
Sbjct: 276  VKCGVRGDTGPACNVVGMIDRKILGIKHLYRKPVFERTKQCSINSPDYGPLPSDAPSWCQ 335

Query: 1134 APFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVALGFILDFTGM 1313
            APFDPEGLLSSVMA+VTC            FK H+DR+ LW+ ++SGL+ LG  LDF GM
Sbjct: 336  APFDPEGLLSSVMAMVTCLVGLHYGQIIVHFKDHRDRIRLWLISSSGLLVLGLALDFFGM 395

Query: 1314 HVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHALMIYILAACNL 1493
            HVNKALYT SYMCVTAG AGFLFA IY+LVD+C YRR TLVLEWMG HALMIYILAACN+
Sbjct: 396  HVNKALYTFSYMCVTAGAAGFLFAGIYLLVDICGYRRMTLVLEWMGKHALMIYILAACNI 455

Query: 1494 LPVVLQGFYWKKPE 1535
            +P+++QGFYWK+P+
Sbjct: 456  VPIIIQGFYWKQPQ 469


>ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Vitis vinifera] gi|296085565|emb|CBI29297.3| unnamed
            protein product [Vitis vinifera]
          Length = 444

 Score =  572 bits (1473), Expect = e-160
 Identities = 279/387 (72%), Positives = 311/387 (80%)
 Frame = +3

Query: 375  DASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSL 554
            +ASKRRLVSLDVFRGLTV +MILVDDAGG+LPAINHSPWNG+TLADFVMPFFLFIVGVSL
Sbjct: 47   NASKRRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 106

Query: 555  GLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRI 734
             L YK  S   LATK A +RALKL V GL LQGGYFHGLN+LTYGV++E IR  GILQRI
Sbjct: 107  ALAYKNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRI 166

Query: 735  AAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQI 914
            A AY LAA+CEIWLK D  V S  SLL KY+FQW VV+               PDWE+ I
Sbjct: 167  AVAYFLAAVCEIWLKGDSNVKSGSSLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSI 226

Query: 915  PSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPD 1094
            PSE+SS A + F VKCGVR DTGPACNAVGMIDR VLGIQHLY+RP+Y R +QCSI+SPD
Sbjct: 227  PSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPD 286

Query: 1095 YGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASG 1274
            YGPLPPNAP+WCQAPFDPEGLLSSVMAIVTC            FK HKDR+L W+  +S 
Sbjct: 287  YGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSC 346

Query: 1275 LVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGM 1454
            L+ LGF LDF GMHVNKALYT+SYMCVTAG AG LFA IY++VDM  YRRPT+V+EWMGM
Sbjct: 347  LLVLGFALDFFGMHVNKALYTLSYMCVTAGAAGILFAGIYLMVDMYGYRRPTIVMEWMGM 406

Query: 1455 HALMIYILAACNLLPVVLQGFYWKKPE 1535
            HALMIYILAACN+LPV LQGFYW++P+
Sbjct: 407  HALMIYILAACNILPVFLQGFYWRRPQ 433


>ref|XP_002303734.2| hypothetical protein POPTR_0003s15750g [Populus trichocarpa]
            gi|550343268|gb|EEE78713.2| hypothetical protein
            POPTR_0003s15750g [Populus trichocarpa]
          Length = 464

 Score =  548 bits (1412), Expect = e-153
 Identities = 263/387 (67%), Positives = 307/387 (79%)
 Frame = +3

Query: 372  EDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVS 551
            +   ++RLVSLDVFRGLTV LMILVDDAGGVLPAINHSPWNG+TLAD VMPFFLF+VGVS
Sbjct: 66   QQQQQQRLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVS 125

Query: 552  LGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQR 731
            LGLTYKK   + +AT+KA LRALKL V+GL LQGG+ HGLNDLT+GV++  IRWMGILQR
Sbjct: 126  LGLTYKKLPSKAVATRKAILRALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQR 185

Query: 732  IAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQ 911
            IA  YL+ A+CEIWLK D+ V S +S+L KY+ QW  V+               PDWE++
Sbjct: 186  IAIGYLIGAMCEIWLKGDNHVASGLSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYE 245

Query: 912  IPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSP 1091
            IP  +SS +P+ F VKCGVRG TG ACNAVGMIDR VLGIQHLYR+P+Y RT+ CSI+SP
Sbjct: 246  IPVAASSSSPKIFRVKCGVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSP 305

Query: 1092 DYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTAS 1271
            DYGPLPP+APSWCQAPFDPEGLLSSVMAIVTC            FK HKDR+L W+  ++
Sbjct: 306  DYGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPST 365

Query: 1272 GLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMG 1451
              V LG +LD +GMHVNKALYT SYMCVTAG AG +F  IY+LVD+C +RRPTLVLEWMG
Sbjct: 366  CFVVLGLVLDLSGMHVNKALYTFSYMCVTAGAAGIVFTGIYMLVDVCGFRRPTLVLEWMG 425

Query: 1452 MHALMIYILAACNLLPVVLQGFYWKKP 1532
            MHALMI+ILA  N+LPVV+QGFYWK+P
Sbjct: 426  MHALMIFILATSNVLPVVMQGFYWKQP 452


>ref|XP_002299365.2| hypothetical protein POPTR_0001s12610g [Populus trichocarpa]
            gi|550347105|gb|EEE84170.2| hypothetical protein
            POPTR_0001s12610g [Populus trichocarpa]
          Length = 453

 Score =  546 bits (1408), Expect = e-153
 Identities = 277/462 (59%), Positives = 328/462 (70%), Gaps = 2/462 (0%)
 Frame = +3

Query: 153  MGAYELIKGEEKGGADLTKTNSCFLSVEDGYTDKEMESVRQQHSHY--SHSLKIKEGNXX 326
            M  Y  IKG+E+   D  K  S   + E G+ D + E+V +  S    +HS +       
Sbjct: 1    MAVYTPIKGQEEDNGDRNK--SLKSNDEGGHVDDD-EAVDENKSFILPTHSRQ------- 50

Query: 327  XXXXXXXANVLMADAEDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTL 506
                         D +   ++RLVSLDVFRGLTV LMILVDDAGGVLPAINHSPWNG+TL
Sbjct: 51   -----------QDDVQRQKQQRLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTL 99

Query: 507  ADFVMPFFLFIVGVSLGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTY 686
            AD VMPFFLFIVGVSLGLTYKK SC+ +AT+KA LR LKL ++GL LQGG+ HGLNDLTY
Sbjct: 100  ADVVMPFFLFIVGVSLGLTYKKLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTY 159

Query: 687  GVNLEYIRWMGILQRIAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXX 866
            GV++  IRWMGILQRIA  YL+ A+CEIWLK  + V S +S+L KY+FQW  V+      
Sbjct: 160  GVDMTQIRWMGILQRIAIGYLVGAMCEIWLKGGNHVTSGLSMLRKYQFQWAAVLMFVTIY 219

Query: 867  XXXXXXXXXPDWEFQIPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYR 1046
                     PDWE+QIP  +S+  P+ F VKCGVRG TGPACNA GMIDR +LGIQHLYR
Sbjct: 220  LSLLYGLHVPDWEYQIPVAASASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYR 279

Query: 1047 RPVYGRTEQCSISSPDYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXF 1226
            +P+Y RT+ CSI+SP YGPLPP+APSWCQAPFDPEGLLSSVMAIVTC            F
Sbjct: 280  KPIYARTKPCSINSPGYGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHF 339

Query: 1227 KGHKDRLLLWVGTASGLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVD 1406
            K HKDR L W+  ++  + LG +LD  GMHVNKALYT SYMCVTAG AG +F  IY+LVD
Sbjct: 340  KEHKDRTLHWMVPSTCFLVLGLVLDLLGMHVNKALYTFSYMCVTAGAAGIVFTGIYLLVD 399

Query: 1407 MCEYRRPTLVLEWMGMHALMIYILAACNLLPVVLQGFYWKKP 1532
            +C +R P LVLEWMGMHAL+I+ LA  N+LPVVLQGFYWK+P
Sbjct: 400  VCGFRWPMLVLEWMGMHALLIFTLATSNILPVVLQGFYWKQP 441


>ref|XP_007139131.1| hypothetical protein PHAVU_008G004100g [Phaseolus vulgaris]
            gi|561012264|gb|ESW11125.1| hypothetical protein
            PHAVU_008G004100g [Phaseolus vulgaris]
          Length = 460

 Score =  541 bits (1395), Expect = e-151
 Identities = 262/381 (68%), Positives = 301/381 (79%)
 Frame = +3

Query: 390  RLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYK 569
            RLVSLDVFRGLTV LMILVDDAGG++PA+NHSPWNG+TLAD+VMPFFLFIVGVSL LTYK
Sbjct: 69   RLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYK 128

Query: 570  KASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYL 749
            K SCRV A++KA LRALKL VLGL LQGGYFH +NDLTYGV+++ IRWMGILQRIA AYL
Sbjct: 129  KLSCRVDASRKAGLRALKLLVLGLFLQGGYFHRVNDLTYGVDIKQIRWMGILQRIALAYL 188

Query: 750  LAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESS 929
            +AALCEIWLK+DD V+S  SLL KYR+QW+V                 PDWE+QI +E S
Sbjct: 189  VAALCEIWLKSDDSVNSGPSLLRKYRYQWVVAFMISFLYLCLLYGLYVPDWEYQIQTEPS 248

Query: 930  SIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGPLP 1109
            S  P+TFSVKCG RGD GPACNAVGMIDR VLGIQHLYRRP+Y R  +CSI+SP+YGPLP
Sbjct: 249  S-EPKTFSVKCGRRGDIGPACNAVGMIDRTVLGIQHLYRRPIYARMPECSINSPNYGPLP 307

Query: 1110 PNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVALG 1289
            P+AP+WCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+   S LV  G
Sbjct: 308  PDAPAWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFG 367

Query: 1290 FILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHALMI 1469
              LD  GM +NK LY++SY CVTAG AG LF  IY++VD+C YRR T+ +EWMGMHALMI
Sbjct: 368  LALDLFGMDINKVLYSLSYTCVTAGAAGILFVGIYLMVDVCGYRRMTIFMEWMGMHALMI 427

Query: 1470 YILAACNLLPVVLQGFYWKKP 1532
            YILAACN+ P+ LQGFYW  P
Sbjct: 428  YILAACNVFPIFLQGFYWGSP 448


>ref|XP_004489126.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X2 [Cicer arietinum]
          Length = 460

 Score =  540 bits (1391), Expect = e-151
 Identities = 264/449 (58%), Positives = 321/449 (71%), Gaps = 5/449 (1%)
 Frame = +3

Query: 201  LTKTNSCFLSVEDGYTDKEMESVRQQ-HSHYSHSL----KIKEGNXXXXXXXXXANVLMA 365
            +T+      S E+   D+++E   QQ H+  S S+      K+ N               
Sbjct: 1    MTRNYEAIKSFEENENDEDLEMGSQQKHNQISDSIIKLNNNKKKNKIEECSMQKQQHQQQ 60

Query: 366  DAEDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVG 545
              +     RL+SLDVFRGLTV LMI+VDD GG++PA+NHSPWNG+T+ADFVMPFFLFIVG
Sbjct: 61   QQQQPQSHRLLSLDVFRGLTVALMIVVDDIGGLVPALNHSPWNGLTIADFVMPFFLFIVG 120

Query: 546  VSLGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGIL 725
            V+L  TYKK SC+V AT+KA LRALKL  LG+ LQGGYFH +ND+T+GV+L+ IRWMGIL
Sbjct: 121  VALAFTYKKPSCKVDATRKAILRALKLLALGIFLQGGYFHRVNDMTFGVDLKQIRWMGIL 180

Query: 726  QRIAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWE 905
            QRIA AYL+AALCEIWLK +DIV+S  SL  KYR+QW + +               PDWE
Sbjct: 181  QRIAVAYLIAALCEIWLKREDIVNSASSLFRKYRYQWALALFLSFIYLCLLYGMYVPDWE 240

Query: 906  FQIPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSIS 1085
            +++P+E SS+ P+ FSVKCGVR DTGPACN VGMIDR++LGIQHLYRRP+Y RT +CSI+
Sbjct: 241  YEVPTEPSSV-PKIFSVKCGVRADTGPACNVVGMIDRKILGIQHLYRRPIYARTPECSIN 299

Query: 1086 SPDYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGT 1265
            SPD GPLPP+APSWCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+  
Sbjct: 300  SPDSGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIILHFKDHRIRIIYWMIP 359

Query: 1266 ASGLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEW 1445
             S LV  G  LD  GMH+NK LY++SY CVTAG AG LF  IY++VD+C Y R T VLEW
Sbjct: 360  TSCLVVSGLALDLLGMHLNKVLYSLSYTCVTAGAAGILFTGIYLMVDVCGYSRMTSVLEW 419

Query: 1446 MGMHALMIYILAACNLLPVVLQGFYWKKP 1532
            MGMHALMI+ILAACN+ P+ LQGFYW  P
Sbjct: 420  MGMHALMIFILAACNIFPIFLQGFYWGNP 448


>ref|XP_004489125.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X1 [Cicer arietinum]
          Length = 463

 Score =  540 bits (1391), Expect = e-151
 Identities = 264/449 (58%), Positives = 321/449 (71%), Gaps = 5/449 (1%)
 Frame = +3

Query: 201  LTKTNSCFLSVEDGYTDKEMESVRQQ-HSHYSHSL----KIKEGNXXXXXXXXXANVLMA 365
            +T+      S E+   D+++E   QQ H+  S S+      K+ N               
Sbjct: 1    MTRNYEAIKSFEENENDEDLEMGSQQKHNQISDSIIKLNNNKKKNKIEECSMQKQQHQQQ 60

Query: 366  DAEDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVG 545
              +     RL+SLDVFRGLTV LMI+VDD GG++PA+NHSPWNG+T+ADFVMPFFLFIVG
Sbjct: 61   QQQQPQSHRLLSLDVFRGLTVALMIVVDDIGGLVPALNHSPWNGLTIADFVMPFFLFIVG 120

Query: 546  VSLGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGIL 725
            V+L  TYKK SC+V AT+KA LRALKL  LG+ LQGGYFH +ND+T+GV+L+ IRWMGIL
Sbjct: 121  VALAFTYKKPSCKVDATRKAILRALKLLALGIFLQGGYFHRVNDMTFGVDLKQIRWMGIL 180

Query: 726  QRIAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWE 905
            QRIA AYL+AALCEIWLK +DIV+S  SL  KYR+QW + +               PDWE
Sbjct: 181  QRIAVAYLIAALCEIWLKREDIVNSASSLFRKYRYQWALALFLSFIYLCLLYGMYVPDWE 240

Query: 906  FQIPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSIS 1085
            +++P+E SS+ P+ FSVKCGVR DTGPACN VGMIDR++LGIQHLYRRP+Y RT +CSI+
Sbjct: 241  YEVPTEPSSV-PKIFSVKCGVRADTGPACNVVGMIDRKILGIQHLYRRPIYARTPECSIN 299

Query: 1086 SPDYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGT 1265
            SPD GPLPP+APSWCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+  
Sbjct: 300  SPDSGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIILHFKDHRIRIIYWMIP 359

Query: 1266 ASGLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEW 1445
             S LV  G  LD  GMH+NK LY++SY CVTAG AG LF  IY++VD+C Y R T VLEW
Sbjct: 360  TSCLVVSGLALDLLGMHLNKVLYSLSYTCVTAGAAGILFTGIYLMVDVCGYSRMTSVLEW 419

Query: 1446 MGMHALMIYILAACNLLPVVLQGFYWKKP 1532
            MGMHALMI+ILAACN+ P+ LQGFYW  P
Sbjct: 420  MGMHALMIFILAACNIFPIFLQGFYWGNP 448


>ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Glycine max]
          Length = 463

 Score =  540 bits (1391), Expect = e-151
 Identities = 261/381 (68%), Positives = 301/381 (79%)
 Frame = +3

Query: 390  RLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYK 569
            RLVSLDVFRGLTV LMILVDDAGG++PA+NHSPWNG+TLAD+VMPFFLFIVGVSL LTYK
Sbjct: 72   RLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYK 131

Query: 570  KASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYL 749
            K SC V A++KA+LRALKL VLGL LQGGYFH +NDLTYGV+L+ IRWMGILQRI  AYL
Sbjct: 132  KLSCGVDASRKASLRALKLLVLGLFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIGVAYL 191

Query: 750  LAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESS 929
            +AALCEIWLK+DD V+S  SLL KYR+QW V +               PDW +QI +E S
Sbjct: 192  VAALCEIWLKSDDTVNSGPSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPS 251

Query: 930  SIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGPLP 1109
            S  P+TFSVKCGVRG+TGPACNAVGMIDR +LGI HLY+RP+Y R  +CSI+SP+YGPLP
Sbjct: 252  S-EPKTFSVKCGVRGNTGPACNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYGPLP 310

Query: 1110 PNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVALG 1289
            P+AP+WCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+   S LV  G
Sbjct: 311  PDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFG 370

Query: 1290 FILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHALMI 1469
              LD  GMH+NK LY++SY CVTAG AG LF  IY++VD+C  RR TLVLEWMGMHALMI
Sbjct: 371  LALDLFGMHINKVLYSLSYTCVTAGAAGILFVGIYLMVDVCGCRRMTLVLEWMGMHALMI 430

Query: 1470 YILAACNLLPVVLQGFYWKKP 1532
            YILAACN+ P+ LQGFYW  P
Sbjct: 431  YILAACNVFPIFLQGFYWGSP 451


>ref|XP_004489127.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            isoform X3 [Cicer arietinum]
          Length = 465

 Score =  535 bits (1378), Expect = e-149
 Identities = 264/451 (58%), Positives = 321/451 (71%), Gaps = 7/451 (1%)
 Frame = +3

Query: 201  LTKTNSCFLSVEDGYTDKEMESVRQQ-HSHYSHSL----KIKEGNXXXXXXXXXANVLMA 365
            +T+      S E+   D+++E   QQ H+  S S+      K+ N               
Sbjct: 1    MTRNYEAIKSFEENENDEDLEMGSQQKHNQISDSIIKLNNNKKKNKIEECSMQKQQHQQQ 60

Query: 366  DAEDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVG 545
              +     RL+SLDVFRGLTV LMI+VDD GG++PA+NHSPWNG+T+ADFVMPFFLFIVG
Sbjct: 61   QQQQPQSHRLLSLDVFRGLTVALMIVVDDIGGLVPALNHSPWNGLTIADFVMPFFLFIVG 120

Query: 546  VSLGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGIL 725
            V+L  TYKK SC+V AT+KA LRALKL  LG+ LQGGYFH +ND+T+GV+L+ IRWMGIL
Sbjct: 121  VALAFTYKKPSCKVDATRKAILRALKLLALGIFLQGGYFHRVNDMTFGVDLKQIRWMGIL 180

Query: 726  QRIAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWE 905
            QRIA AYL+AALCEIWLK +DIV+S  SL  KYR+QW + +               PDWE
Sbjct: 181  QRIAVAYLIAALCEIWLKREDIVNSASSLFRKYRYQWALALFLSFIYLCLLYGMYVPDWE 240

Query: 906  FQIPSESSSIAPRTFSV--KCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCS 1079
            +++P+E SS+ P+ FSV  KCGVR DTGPACN VGMIDR++LGIQHLYRRP+Y RT +CS
Sbjct: 241  YEVPTEPSSV-PKIFSVSVKCGVRADTGPACNVVGMIDRKILGIQHLYRRPIYARTPECS 299

Query: 1080 ISSPDYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWV 1259
            I+SPD GPLPP+APSWCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+
Sbjct: 300  INSPDSGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIILHFKDHRIRIIYWM 359

Query: 1260 GTASGLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVL 1439
               S LV  G  LD  GMH+NK LY++SY CVTAG AG LF  IY++VD+C Y R T VL
Sbjct: 360  IPTSCLVVSGLALDLLGMHLNKVLYSLSYTCVTAGAAGILFTGIYLMVDVCGYSRMTSVL 419

Query: 1440 EWMGMHALMIYILAACNLLPVVLQGFYWKKP 1532
            EWMGMHALMI+ILAACN+ P+ LQGFYW  P
Sbjct: 420  EWMGMHALMIFILAACNIFPIFLQGFYWGNP 450


>ref|XP_006487659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Citrus sinensis]
          Length = 447

 Score =  535 bits (1377), Expect = e-149
 Identities = 250/384 (65%), Positives = 305/384 (79%)
 Frame = +3

Query: 384  KRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLT 563
            +RRL+SLDVFRGLTV LMILVDD GG+LPAINHSPWNG+TLADFVMPFFLFIVGVSL LT
Sbjct: 53   QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112

Query: 564  YKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAA 743
            YK   C+V+AT+KA LRAL LF+LG+ LQGG+FHG+N+L YGV++  IRWMG+LQRIA A
Sbjct: 113  YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172

Query: 744  YLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSE 923
            YL+AALCEIWLK D  V S++SL  KYR  W+V +               PDW+++ P E
Sbjct: 173  YLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232

Query: 924  SSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGP 1103
            +SS +P  F+V CGVRG TGPACNAVG+IDR++LGIQHLYR+P+Y RT+QCSI+SPDYGP
Sbjct: 233  TSSSSPWIFNVTCGVRGSTGPACNAVGVIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292

Query: 1104 LPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVA 1283
            +P +APSWCQAPFDPEGLLSSVMA VTC            FK H+DR+L W+  +S L+ 
Sbjct: 293  MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352

Query: 1284 LGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHAL 1463
            LG  LDF GMH+NKALY++SY C+TAG +G L A IY +VD+  +RR T+V EWMG+HAL
Sbjct: 353  LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVDVQGHRRVTMVFEWMGLHAL 412

Query: 1464 MIYILAACNLLPVVLQGFYWKKPE 1535
            MIYIL ACN+LPV+LQGFYW++P+
Sbjct: 413  MIYILVACNILPVLLQGFYWRQPQ 436


>ref|XP_004299204.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Fragaria vesca subsp. vesca]
          Length = 475

 Score =  535 bits (1377), Expect = e-149
 Identities = 254/388 (65%), Positives = 306/388 (78%)
 Frame = +3

Query: 372  EDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVS 551
            +   ++RLVSLDVFRGLTVVLMILVD+AGG++PAINH+PWNG+TLAD VMPFFLF+VGVS
Sbjct: 78   QQPQQQRLVSLDVFRGLTVVLMILVDEAGGLVPAINHAPWNGLTLADLVMPFFLFMVGVS 137

Query: 552  LGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQR 731
            L L YKK SCR +ATKKA LR LKL  LGL LQGG+FHG+ +LT+GV++  +RWMGILQR
Sbjct: 138  LSLVYKKLSCRAIATKKALLRGLKLLALGLFLQGGFFHGIKELTFGVDIAKMRWMGILQR 197

Query: 732  IAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQ 911
            IA  YL+AALCEIWLK D+ V S  SLL KY+ Q ++ +               PDWE+Q
Sbjct: 198  IAIGYLVAALCEIWLKGDETVTSGSSLLRKYKLQLVMAVMITVTYLSLLYGLHVPDWEYQ 257

Query: 912  IPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSP 1091
            + +  SS AP+TFSVKCGVRGDTGPACNAVGMIDR++LG+QHLYRRP+Y RT QCSI+SP
Sbjct: 258  VSTGPSS-APKTFSVKCGVRGDTGPACNAVGMIDRKLLGLQHLYRRPIYSRTAQCSINSP 316

Query: 1092 DYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTAS 1271
            DYG LP +APSWCQAPFDPEG+LSS+MAIVTC            FK H++R+L W  +++
Sbjct: 317  DYGRLPAHAPSWCQAPFDPEGVLSSLMAIVTCLVGLHYGHIINHFKDHRNRVLHWTISST 376

Query: 1272 GLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMG 1451
             LV LG  LD  GMH+NKALYT SYMC+TAG AG +FA IY++VD+C YRRPT+VLEWMG
Sbjct: 377  SLVVLGLALDLFGMHINKALYTFSYMCMTAGSAGIIFAGIYLMVDVCGYRRPTIVLEWMG 436

Query: 1452 MHALMIYILAACNLLPVVLQGFYWKKPE 1535
             HAL I++L ACNLLPV+L GFYW KP+
Sbjct: 437  KHALAIFVLIACNLLPVILHGFYWGKPQ 464


>ref|XP_006423878.1| hypothetical protein CICLE_v10028441mg [Citrus clementina]
            gi|557525812|gb|ESR37118.1| hypothetical protein
            CICLE_v10028441mg [Citrus clementina]
          Length = 447

 Score =  532 bits (1371), Expect = e-148
 Identities = 249/384 (64%), Positives = 304/384 (79%)
 Frame = +3

Query: 384  KRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLT 563
            +RRL+SLDVFRGLTV LMILVDD GG+LPAINHSPWNG+TLADFVMPFFLFIVGVSL LT
Sbjct: 53   QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112

Query: 564  YKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAA 743
            YK   C+V+AT+KA  RAL LF+LG+ LQGG+FHG+N+L YGV++  IRWMG+LQRIA +
Sbjct: 113  YKNFPCKVVATRKAIPRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIS 172

Query: 744  YLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSE 923
            YL+AALCEIWLK D  V S++SL  KYR  W+V +               PDW+++ P E
Sbjct: 173  YLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232

Query: 924  SSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGP 1103
            +SS +P  F+V CGVRG TGPACNAVGMIDR++LGIQHLYR+P+Y RT+QCSI+SPDYGP
Sbjct: 233  TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292

Query: 1104 LPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVA 1283
            +P +APSWCQAPFDPEGLLSSVMA VTC            FK H+DR+L W+  +S L+ 
Sbjct: 293  MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352

Query: 1284 LGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHAL 1463
            LG  LDF GMH+NKALY++SY C+TAG +G L A IY +VD+  +RR T+V EWMG+HAL
Sbjct: 353  LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVDVQGHRRVTMVFEWMGLHAL 412

Query: 1464 MIYILAACNLLPVVLQGFYWKKPE 1535
            MIYIL ACN+LPV+LQGFYW++P+
Sbjct: 413  MIYILVACNILPVLLQGFYWRQPQ 436


>ref|XP_007226448.1| hypothetical protein PRUPE_ppa024277mg [Prunus persica]
            gi|462423384|gb|EMJ27647.1| hypothetical protein
            PRUPE_ppa024277mg [Prunus persica]
          Length = 468

 Score =  531 bits (1369), Expect = e-148
 Identities = 252/384 (65%), Positives = 294/384 (76%)
 Frame = +3

Query: 384  KRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLT 563
            ++RLVSLDVFRG TV +MILVDD GG+LPAINHSPWNG+TLAD VMPFFLF+VGVSL LT
Sbjct: 86   QQRLVSLDVFRGSTVAIMILVDDVGGILPAINHSPWNGLTLADLVMPFFLFMVGVSLSLT 145

Query: 564  YKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAA 743
            YKK SC  +AT+K  LR LKL  LGL LQGGYFHG+ DLT+GV++E +RWMGILQRIA A
Sbjct: 146  YKKMSCGTVATRKTVLRTLKLLALGLFLQGGYFHGIKDLTFGVDIEQMRWMGILQRIAIA 205

Query: 744  YLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSE 923
            Y +AALCEIWLK DD V+S  SLL KYRFQW   +               PDWE+Q    
Sbjct: 206  YFVAALCEIWLKGDDNVNSGRSLLRKYRFQWSAALIITVLYLSLLYGLHVPDWEYQ---- 261

Query: 924  SSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGP 1103
                    F VKCGV GDTGPACNAVGMIDR++LG++HLYRRP+Y RTEQCSI+SPD GP
Sbjct: 262  --------FLVKCGVWGDTGPACNAVGMIDRKILGLRHLYRRPIYARTEQCSINSPDNGP 313

Query: 1104 LPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVA 1283
            LP +APSWCQAPFDPEGLLSS+MAIVTC            FK H+DR+L W  ++S L+ 
Sbjct: 314  LPADAPSWCQAPFDPEGLLSSMMAIVTCLVGLHYGHIIVHFKSHRDRILRWSISSSSLII 373

Query: 1284 LGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHAL 1463
            LG  LD  GMH+NK LYT SYMC+TAG AG LF  IY++VD+C YRRPT+V+EWMGMHAL
Sbjct: 374  LGLALDLLGMHINKPLYTFSYMCITAGSAGILFTAIYLMVDVCGYRRPTIVMEWMGMHAL 433

Query: 1464 MIYILAACNLLPVVLQGFYWKKPE 1535
            MI++L ACNLLPV++ GFYW KP+
Sbjct: 434  MIFVLVACNLLPVIIHGFYWGKPQ 457


>gb|AFK42154.1| unknown [Lotus japonicus]
          Length = 467

 Score =  531 bits (1369), Expect = e-148
 Identities = 254/385 (65%), Positives = 297/385 (77%)
 Frame = +3

Query: 378  ASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLG 557
            +  +RLVS+DVFRGLTV LMILVDDAGG+LPA+NHSPW+G+T+ADFVMP FLFIVG+SL 
Sbjct: 72   SQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIVGLSLA 131

Query: 558  LTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIA 737
            LTYKK SC V+AT+KA LRALKL  LGL LQGGYFH +NDLT+GV+++ IR MGILQRIA
Sbjct: 132  LTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGILQRIA 191

Query: 738  AAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIP 917
             AYLL ALCEIWLK DDIV S  SLL KYR+QW V                 PDWE+QIP
Sbjct: 192  IAYLLTALCEIWLKCDDIVKSGSSLLRKYRYQWAVAFVLSGFYLCLLYGLYVPDWEYQIP 251

Query: 918  SESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDY 1097
            ++SSS+ P+TFSVKCGV  DTGPACN VGMIDR++LGIQHLYRRP+Y R  +CSI+SPDY
Sbjct: 252  TDSSSV-PKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMPECSINSPDY 310

Query: 1098 GPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGL 1277
            GPLPP+AP+WCQAPFDPEGLLSSVMAIVTC            +K H+ R++ W+   S L
Sbjct: 311  GPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRIIHWMIPTSCL 370

Query: 1278 VALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMH 1457
            +  GF L   GMHVNK LY+ SY CVTAG AG L   IY++VD+C Y R T V+EWMG H
Sbjct: 371  IVFGFALHLFGMHVNKVLYSFSYTCVTAGAAGILLVAIYLMVDVCGYSRVTKVMEWMGKH 430

Query: 1458 ALMIYILAACNLLPVVLQGFYWKKP 1532
            ALMIY+LAACN+ P+ LQGFYW  P
Sbjct: 431  ALMIYVLAACNIFPIFLQGFYWGNP 455


>ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Glycine max]
          Length = 461

 Score =  531 bits (1369), Expect = e-148
 Identities = 254/381 (66%), Positives = 300/381 (78%)
 Frame = +3

Query: 390  RLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYK 569
            RLVSLDVFRGLTV LMILVDDAGG++PA+NHSPWNG+TLAD+VMPFFLFIVGVSL L+YK
Sbjct: 70   RLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALSYK 129

Query: 570  KASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYL 749
            K SC V A++KA+LRALKL  LGL LQGGYFH +NDLT+GV+++ IRWMGILQRIA AYL
Sbjct: 130  KLSCGVDASRKASLRALKLLALGLFLQGGYFHRVNDLTFGVDIKQIRWMGILQRIAVAYL 189

Query: 750  LAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESS 929
            + ALCEIWLK+DD V+S  SLL KYR+QW V +               PDW +QI +E S
Sbjct: 190  VVALCEIWLKSDDTVNSGPSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPS 249

Query: 930  SIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGPLP 1109
            +  P+TFSVKCGVRG+TGPACN VGMIDR +LGIQHLY+RP+Y R  +CSI+SP+YGPLP
Sbjct: 250  A-EPKTFSVKCGVRGNTGPACNVVGMIDRMILGIQHLYKRPIYARMPECSINSPNYGPLP 308

Query: 1110 PNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVALG 1289
            P+AP+WCQAPFDPEGLLSSVMAIVTC            FK H+ R++ W+   S L+  G
Sbjct: 309  PDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLLVFG 368

Query: 1290 FILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHALMI 1469
              LD  GMH+NK LY++SY CVTAG AG LF  IY++VD+C  RR TLV+EWMGMHALMI
Sbjct: 369  LALDLFGMHINKVLYSLSYTCVTAGAAGVLFVGIYLMVDVCGCRRMTLVMEWMGMHALMI 428

Query: 1470 YILAACNLLPVVLQGFYWKKP 1532
            YILAACN+ P+ LQGFYW  P
Sbjct: 429  YILAACNVFPIFLQGFYWGSP 449


>gb|EYU28584.1| hypothetical protein MIMGU_mgv1a006117mg [Mimulus guttatus]
          Length = 456

 Score =  529 bits (1363), Expect = e-147
 Identities = 260/460 (56%), Positives = 326/460 (70%)
 Frame = +3

Query: 153  MGAYELIKGEEKGGADLTKTNSCFLSVEDGYTDKEMESVRQQHSHYSHSLKIKEGNXXXX 332
            M +Y+LIK   +GG DL   N+C + ++D   D ++ES  Q+ ++ S S           
Sbjct: 1    MASYQLIK---EGGLDLKGNNNCTVRIDD---DDDVESAFQRINNQSSSSSSSS------ 48

Query: 333  XXXXXANVLMADAEDASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLAD 512
                 ++ L   A      RLVSLDVFRGLTVVLMI+VDDAGG++P+INHSPWNG+TLAD
Sbjct: 49   ----PSHRLRPKACTQPSSRLVSLDVFRGLTVVLMIIVDDAGGIIPSINHSPWNGLTLAD 104

Query: 513  FVMPFFLFIVGVSLGLTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGV 692
            FVMPFFLF+VGVSLGL YK   CR  A++KA  R  K  +LG+ LQGGYFHG+N+LTYGV
Sbjct: 105  FVMPFFLFMVGVSLGLVYKNMPCRTTASRKAIFRTAKFLILGVFLQGGYFHGINNLTYGV 164

Query: 693  NLEYIRWMGILQRIAAAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXX 872
            ++  IRWMGILQRIA AYL+ A+CEIWL+ DD V S +SLL KY++ W++V         
Sbjct: 165  DMGQIRWMGILQRIAIAYLVGAMCEIWLRNDDKVGSGLSLLKKYQWHWVMVFMLTTVYLV 224

Query: 873  XXXXXXXPDWEFQIPSESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRP 1052
                   PDW +Q+P  +S    +   VKCGVRG+TGPACNA GMIDR +LG+QHLYR+P
Sbjct: 225  LLYGLYVPDWSYQVPLSASFEVKK---VKCGVRGNTGPACNAAGMIDRMILGVQHLYRKP 281

Query: 1053 VYGRTEQCSISSPDYGPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKG 1232
            +Y RT+QCS +SPDYGPLPPNAPSWCQAPFDPEGLLS+VMA+ TC            FK 
Sbjct: 282  IYARTQQCSTNSPDYGPLPPNAPSWCQAPFDPEGLLSTVMALATCLIGVQYGHVIVHFKD 341

Query: 1233 HKDRLLLWVGTASGLVALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMC 1412
            HK RLL W+  + G + LG + +  GMH+NKALY+ SYMCVT GVAG L A IY++VD+ 
Sbjct: 342  HKYRLLQWLVPSLGFIILGVLCNIFGMHINKALYSFSYMCVTTGVAGILLASIYLVVDVY 401

Query: 1413 EYRRPTLVLEWMGMHALMIYILAACNLLPVVLQGFYWKKP 1532
             YRR T+VLEWMGM+AL+IY+L ACN+LP++LQGFYW  P
Sbjct: 402  GYRRCTMVLEWMGMNALVIYVLVACNILPLILQGFYWNHP 441


>gb|EYU26066.1| hypothetical protein MIMGU_mgv1a018021mg, partial [Mimulus guttatus]
          Length = 430

 Score =  523 bits (1348), Expect = e-146
 Identities = 243/381 (63%), Positives = 295/381 (77%)
 Frame = +3

Query: 390  RLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYK 569
            RLVSLDVFRGLTVVLMI+VDDAGG++P+INHSPWNG+TLADFVMPFFLF+VGVSLGL YK
Sbjct: 46   RLVSLDVFRGLTVVLMIIVDDAGGIIPSINHSPWNGLTLADFVMPFFLFMVGVSLGLVYK 105

Query: 570  KASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYL 749
               CR  A++KA  R +K  +LG+ LQGGYFHG+N+LTYGV++  IRWMGILQRIA AYL
Sbjct: 106  NMPCRATASRKAIFRTVKFLILGVFLQGGYFHGINNLTYGVDMGQIRWMGILQRIAIAYL 165

Query: 750  LAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESS 929
            + A+CEIWL+ D+ V S +SLL KY++ W++V                PDW +Q+P  +S
Sbjct: 166  VGAMCEIWLRNDEKVGSGLSLLKKYQWHWVMVFMLTTVYLVLLYGLYVPDWSYQVPLSAS 225

Query: 930  SIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGPLP 1109
            S       VKCGVRG+TGPACNA GMIDR +LGIQHLYR+P+Y RT+QCSI+SPDYGPLP
Sbjct: 226  SEVRELVKVKCGVRGNTGPACNAAGMIDRVILGIQHLYRKPIYARTQQCSINSPDYGPLP 285

Query: 1110 PNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVALG 1289
            PNAPSWCQAPFDPEGLLS+VMA+ TC            FK HK RLL W+  +SG + LG
Sbjct: 286  PNAPSWCQAPFDPEGLLSTVMALATCLIGVQYGHVIVHFKDHKYRLLQWLVPSSGFIILG 345

Query: 1290 FILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHALMI 1469
             + +  GMH+NKALY+ SYMCVT GVAG L A IY++VD+  YRR T+VLEWMGM+AL+I
Sbjct: 346  VLCNIFGMHINKALYSFSYMCVTTGVAGILLASIYLVVDVYGYRRFTMVLEWMGMNALVI 405

Query: 1470 YILAACNLLPVVLQGFYWKKP 1532
            Y+L ACN+LP++LQGFYW  P
Sbjct: 406  YVLVACNILPLILQGFYWNHP 426


>ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Cucumis sativus]
          Length = 494

 Score =  515 bits (1326), Expect = e-143
 Identities = 246/384 (64%), Positives = 298/384 (77%), Gaps = 2/384 (0%)
 Frame = +3

Query: 390  RLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLGLTYK 569
            RLVSLDVFRG+TV LMI+VD AGGV+PAINHSPW+G+TLAD VMPFFLFIVGVSL L YK
Sbjct: 100  RLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYK 159

Query: 570  KASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIAAAYL 749
            K   R +AT+KA LR LKL  LGL LQGG+ HG+N+LTYGV+++ IRWMGILQRIA AY 
Sbjct: 160  KIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYF 219

Query: 750  LAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIPSESS 929
            LAALCEIWLK  D V+S+ +L  KY+ Q +  +               PDWE+Q+PS ++
Sbjct: 220  LAALCEIWLKGSDYVNSETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTT 279

Query: 930  S--IAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDYGP 1103
            S   +P+ FSVKCG RGDTGPACNAVGMIDR++ GIQHLY+RP+Y RTEQCSI++PDYGP
Sbjct: 280  SDVASPKIFSVKCGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGP 339

Query: 1104 LPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGLVA 1283
            LPP+APSWCQAPFDPEGLLS+VMA+VTC            FK H+DR+L W+  +S L+ 
Sbjct: 340  LPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIV 399

Query: 1284 LGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMHAL 1463
            L   LDF GMH+NK LYTVSYM VTAG AG LF  IY++VD+  +RR  +V+EWMG HAL
Sbjct: 400  LAIGLDFLGMHINKVLYTVSYMSVTAGAAGLLFTGIYLMVDVYSWRRMNVVMEWMGKHAL 459

Query: 1464 MIYILAACNLLPVVLQGFYWKKPE 1535
            +IY+LAACN+LPV+LQGFY  +P+
Sbjct: 460  VIYVLAACNVLPVILQGFYLGQPQ 483


>gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
          Length = 441

 Score =  509 bits (1312), Expect = e-141
 Identities = 245/385 (63%), Positives = 294/385 (76%)
 Frame = +3

Query: 378  ASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLG 557
            A ++RL SLDVFRG+TV+LMI+VDDAGG LPA+NHSPW+GVT+ADF+MPFFLFIVGVSL 
Sbjct: 45   ARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLT 104

Query: 558  LTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIA 737
            L YK+   RV AT+KA LRALKLF LGLVLQGG+FHG++ LT+GV+L  IR MGILQRIA
Sbjct: 105  LAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIA 164

Query: 738  AAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIP 917
             AYLLAA+CEIWLK DD VDS   LL +YR+Q  V +               PDWE+QI 
Sbjct: 165  IAYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIA 224

Query: 918  SESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDY 1097
               SS   ++FSVKCGVRGDTGPACNAVGM+DR VLGI HLYRRPVY RT++CSI  P+ 
Sbjct: 225  GPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYPEN 284

Query: 1098 GPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGL 1277
            GPLPP+APSWCQAPFDPEGLLSSVMAIVTC            F+ H+ R+  W+  +  +
Sbjct: 285  GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSFSM 344

Query: 1278 VALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMH 1457
            +AL F++DF GM +NK LYT+SY   TAG AG LFA IY LVD+  +RRPT+ +EWMG H
Sbjct: 345  LALAFVMDFVGMRMNKPLYTMSYTLATAGAAGLLFAGIYALVDLYGFRRPTIAMEWMGKH 404

Query: 1458 ALMIYILAACNLLPVVLQGFYWKKP 1532
            ALMIY+L ACN+LP+ ++GFYW+ P
Sbjct: 405  ALMIYVLVACNILPMFIRGFYWRDP 429


>gb|ACL53164.1| unknown [Zea mays] gi|413937084|gb|AFW71635.1| hypothetical protein
            ZEAMMB73_862609 [Zea mays]
          Length = 482

 Score =  509 bits (1312), Expect = e-141
 Identities = 245/385 (63%), Positives = 294/385 (76%)
 Frame = +3

Query: 378  ASKRRLVSLDVFRGLTVVLMILVDDAGGVLPAINHSPWNGVTLADFVMPFFLFIVGVSLG 557
            A ++RL SLDVFRG+TV+LMI+VDDAGG LPA+NHSPW+GVT+ADF+MPFFLFIVGVSL 
Sbjct: 86   ARQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLT 145

Query: 558  LTYKKASCRVLATKKATLRALKLFVLGLVLQGGYFHGLNDLTYGVNLEYIRWMGILQRIA 737
            L YK+   RV AT+KA LRALKLF LGLVLQGG+FHG++ LT+GV+L  IR MGILQRIA
Sbjct: 146  LAYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIA 205

Query: 738  AAYLLAALCEIWLKTDDIVDSQVSLLNKYRFQWLVVIXXXXXXXXXXXXXXXPDWEFQIP 917
             AYLLAA+CEIWLK DD VDS   LL +YR+Q  V +               PDWE+QI 
Sbjct: 206  IAYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIA 265

Query: 918  SESSSIAPRTFSVKCGVRGDTGPACNAVGMIDREVLGIQHLYRRPVYGRTEQCSISSPDY 1097
               SS   ++FSVKCGVRGDTGPACNAVGM+DR VLGI HLYRRPVY RT++CSI  P+ 
Sbjct: 266  GPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYPEN 325

Query: 1098 GPLPPNAPSWCQAPFDPEGLLSSVMAIVTCXXXXXXXXXXXXFKGHKDRLLLWVGTASGL 1277
            GPLPP+APSWCQAPFDPEGLLSSVMAIVTC            F+ H+ R+  W+  +  +
Sbjct: 326  GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSFSM 385

Query: 1278 VALGFILDFTGMHVNKALYTVSYMCVTAGVAGFLFAVIYVLVDMCEYRRPTLVLEWMGMH 1457
            +AL F++DF GM +NK LYT+SY   TAG AG LFA IY LVD+  +RRPT+ +EWMG H
Sbjct: 386  LALAFVMDFVGMRMNKPLYTMSYTLATAGAAGLLFAGIYALVDLYGFRRPTIAMEWMGKH 445

Query: 1458 ALMIYILAACNLLPVVLQGFYWKKP 1532
            ALMIY+L ACN+LP+ ++GFYW+ P
Sbjct: 446  ALMIYVLVACNILPMFIRGFYWRDP 470


Top