BLASTX nr result

ID: Akebia27_contig00020442 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00020442
         (1610 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007220406.1| hypothetical protein PRUPE_ppa007793mg [Prun...   582   e-163
ref|XP_002511939.1| conserved hypothetical protein [Ricinus comm...   581   e-163
ref|XP_002279469.1| PREDICTED: UDP-galactose:fucoside alpha-3-ga...   581   e-163
ref|XP_004138472.1| PREDICTED: UDP-galactose:fucoside alpha-3-ga...   580   e-163
gb|EXB52709.1| UDP-galactose:fucoside alpha-3-galactosyltransfer...   578   e-162
ref|XP_002273155.1| PREDICTED: UDP-galactose:fucoside alpha-3-ga...   575   e-161
emb|CBI31422.3| unnamed protein product [Vitis vinifera]              571   e-160
ref|XP_002301385.2| hypothetical protein POPTR_0002s16760g [Popu...   570   e-160
ref|XP_007051669.1| Nucleotide-diphospho-sugar transferase famil...   569   e-159
ref|XP_006842583.1| hypothetical protein AMTR_s00077p00159550 [A...   565   e-158
ref|XP_006444918.1| hypothetical protein CICLE_v10020808mg [Citr...   565   e-158
ref|XP_002320168.2| hypothetical protein POPTR_0014s08810g [Popu...   564   e-158
emb|CAN69309.1| hypothetical protein VITISV_003084 [Vitis vinifera]   549   e-153
ref|XP_006396288.1| hypothetical protein EUTSA_v10028748mg [Eutr...   539   e-150
ref|XP_004299518.1| PREDICTED: uncharacterized protein LOC101304...   531   e-148
gb|AAZ94713.1| putative alpha 1,3-xylosyltransferase [Linum usit...   527   e-147
ref|XP_004504649.1| PREDICTED: uncharacterized protein LOC101506...   521   e-145
ref|NP_849279.1| rhamnogalacturonan II specific xylosyltransfera...   521   e-145
ref|XP_002872902.1| hypothetical protein ARALYDRAFT_912111 [Arab...   520   e-144
ref|XP_003531111.1| PREDICTED: UDP-D-xylose:L-fucose alpha-1,3-D...   514   e-143

>ref|XP_007220406.1| hypothetical protein PRUPE_ppa007793mg [Prunus persica]
            gi|462416868|gb|EMJ21605.1| hypothetical protein
            PRUPE_ppa007793mg [Prunus persica]
          Length = 355

 Score =  582 bits (1499), Expect = e-163
 Identities = 273/356 (76%), Positives = 308/356 (86%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MSS LYQR   H  LS+PYPISPRSS +  +S S+FS T                PW+ +
Sbjct: 1    MSSFLYQRPI-HNPLSDPYPISPRSSSNSQKSYSIFSPTALLVLLSLMVVMGVFFPWVGM 59

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQE 673
             E LFS  K S+ KW+DYTL QAVSFVA+NGTVIVCAVSQPYLPFLNNWLISI+RQKHQ+
Sbjct: 60   RESLFSVTKPSISKWRDYTLAQAVSFVAQNGTVIVCAVSQPYLPFLNNWLISITRQKHQD 119

Query: 674  KVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILEL 853
            KVLVIAEDYATLYKVN++WPGHAVL+PPA D+QTAHKFGS+GFFNFTSRRPRHLLHILEL
Sbjct: 120  KVLVIAEDYATLYKVNERWPGHAVLVPPALDSQTAHKFGSQGFFNFTSRRPRHLLHILEL 179

Query: 854  GYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCM 1033
            GYNVMYNDVDMVWLADPFPYL+G HDVYFTDDM AVKPL HS +LP PGKKGRTYICSCM
Sbjct: 180  GYNVMYNDVDMVWLADPFPYLKGKHDVYFTDDMTAVKPLYHSHDLPPPGKKGRTYICSCM 239

Query: 1034 IFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPS 1213
            IFLRPT+GAK VMKKWIEE++DQPWS+  K+NDQPAFNWAL++   +VD+YLLPQAAFP+
Sbjct: 240  IFLRPTSGAKLVMKKWIEEMKDQPWSRAKKANDQPAFNWALDKLANKVDLYLLPQAAFPT 299

Query: 1214 GGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GGLYFKN+TWVQETKG+HVIIHNNYI GF+KKIKRFHD+GLWLVDDH+LESPLG++
Sbjct: 300  GGLYFKNKTWVQETKGMHVIIHNNYILGFEKKIKRFHDYGLWLVDDHALESPLGRI 355


>ref|XP_002511939.1| conserved hypothetical protein [Ricinus communis]
            gi|223549119|gb|EEF50608.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 356

 Score =  581 bits (1498), Expect = e-163
 Identities = 274/357 (76%), Positives = 307/357 (85%), Gaps = 1/357 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            M+S L+QR   H  LS+PYP+SPR+S +  R IS+FSRT                PW ++
Sbjct: 1    MTSFLHQRPL-HNSLSDPYPLSPRNSANSQRQISIFSRTGLIVLFSLLLILGVFVPWTEL 59

Query: 494  PEGLFSGNK-NSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQ 670
            P G+FS  K +S+ KW+ YTL QA SFVA+NGTVIVCAVSQPYLPFLNNWLISI+RQKHQ
Sbjct: 60   PNGIFSATKQSSVAKWRQYTLPQAASFVAQNGTVIVCAVSQPYLPFLNNWLISITRQKHQ 119

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            +KVLVIAEDYATLYKVN+KWPGHAVL+PPAPD+QTAHKFGS+GFFNFTSRRPRHLLH+LE
Sbjct: 120  DKVLVIAEDYATLYKVNEKWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPRHLLHLLE 179

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGYNVMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPL+HS +LP PGKKGRTYICSC
Sbjct: 180  LGYNVMYNDVDMVWLGDPFIYLEGKHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 239

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIFL PT GAK VMKKWI+ELQ QPWSK  K+NDQPAFNWALN+T GQVD+YLLPQAAFP
Sbjct: 240  MIFLHPTVGAKLVMKKWIKELQAQPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQAAFP 299

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+TWV+ETKG HVIIHNNYITGF+KKIKRF DFGLWLVDDH+ ESPLGKL
Sbjct: 300  TGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAQESPLGKL 356


>ref|XP_002279469.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
            [Vitis vinifera] gi|297745375|emb|CBI40455.3| unnamed
            protein product [Vitis vinifera]
          Length = 361

 Score =  581 bits (1498), Expect = e-163
 Identities = 281/363 (77%), Positives = 309/363 (85%), Gaps = 7/363 (1%)
 Frame = +2

Query: 314  MSSILYQRQQQ-----HQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXX 478
            MS+ ++QRQQQ     H LLSNP PISPR+ +   R IS+F RT                
Sbjct: 1    MSTSIHQRQQQQPHHHHHLLSNPNPISPRNPLDWRRPISIFGRTGLLILLTLMVVVGVLL 60

Query: 479  PWIDI--PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISI 652
            P + +  P+GL S  + S+ KW+DYTL QA  FVAKNGTVIVCAVSQPYLPFLNNWLISI
Sbjct: 61   PTMKMRMPDGLLS--RASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLPFLNNWLISI 118

Query: 653  SRQKHQEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRH 832
            +RQKHQ+KVLVIAEDYATLY VNQKWPGHAVL+PPAPDAQTAHKFGS GFFNFTSRRPRH
Sbjct: 119  ARQKHQDKVLVIAEDYATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFFNFTSRRPRH 178

Query: 833  LLHILELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGR 1012
            LL+ILELGYNVMYNDVDMVWLADPFPYLQG HDVYFTDDMAAVKPLNHS +LP PGKKGR
Sbjct: 179  LLNILELGYNVMYNDVDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHDLPPPGKKGR 238

Query: 1013 TYICSCMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLL 1192
            TYICSCMIF+RPTNGAK VMKKWIEELQ QPWS+  KSNDQPAFNWALNRT G+VD+YLL
Sbjct: 239  TYICSCMIFMRPTNGAKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRTAGEVDLYLL 298

Query: 1193 PQAAFPSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPL 1372
            PQAAFP+GGLYFKN+TWVQETKG++VIIHNNYITGF+KKIKRF D+GLWLVDDH+ ESPL
Sbjct: 299  PQAAFPTGGLYFKNKTWVQETKGMNVIIHNNYITGFEKKIKRFQDYGLWLVDDHAQESPL 358

Query: 1373 GKL 1381
            GKL
Sbjct: 359  GKL 361


>ref|XP_004138472.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase-like
            [Cucumis sativus] gi|449495244|ref|XP_004159776.1|
            PREDICTED: UDP-galactose:fucoside
            alpha-3-galactosyltransferase-like [Cucumis sativus]
          Length = 355

 Score =  580 bits (1495), Expect = e-163
 Identities = 270/356 (75%), Positives = 308/356 (86%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            M S+L+QR   H  L+N YP+SPRSS S  RS S+FS                  PW++I
Sbjct: 1    MPSVLHQRST-HSSLANSYPLSPRSSSSSERSFSIFSPINLLALLSLIVILGVFLPWMNI 59

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQE 673
             E +FS +K S  KW++Y+L +A SFVA+NGTVIVCAVSQPYLPFLNNWLIS+SRQKH E
Sbjct: 60   QESIFSSSKVSNSKWREYSLAEAASFVARNGTVIVCAVSQPYLPFLNNWLISLSRQKHHE 119

Query: 674  KVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILEL 853
            KVLVIAEDYATLYKVN++WPGHAVL+PPAPDAQTAHKFGS+GFFNFTSRRPRHLLHILEL
Sbjct: 120  KVLVIAEDYATLYKVNERWPGHAVLVPPAPDAQTAHKFGSQGFFNFTSRRPRHLLHILEL 179

Query: 854  GYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCM 1033
            GYNVMYNDVDMVWLADPFPYLQG+HDVYFTDDMAAVKPL+HS +LP PGKKGRTYICSCM
Sbjct: 180  GYNVMYNDVDMVWLADPFPYLQGNHDVYFTDDMAAVKPLHHSHDLPPPGKKGRTYICSCM 239

Query: 1034 IFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPS 1213
            IFLRPT+GAK VM+KWIEEL+ QPWSK  K+NDQPAFNWALN+T G+VD+YLLPQ+AFP+
Sbjct: 240  IFLRPTSGAKLVMRKWIEELKAQPWSKAKKANDQPAFNWALNKTAGEVDLYLLPQSAFPT 299

Query: 1214 GGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GGLYFKNE+WVQETKG+HVIIHNNYITGF+KKIKRF +F LW VDDH+LESPLG++
Sbjct: 300  GGLYFKNESWVQETKGMHVIIHNNYITGFEKKIKRFREFNLWYVDDHTLESPLGRI 355


>gb|EXB52709.1| UDP-galactose:fucoside alpha-3-galactosyltransferase [Morus
            notabilis]
          Length = 359

 Score =  578 bits (1491), Expect = e-162
 Identities = 275/358 (76%), Positives = 306/358 (85%), Gaps = 2/358 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MSS L+QR   H  LSNPYP+SPRSS S  RS SLFS T                PW  +
Sbjct: 1    MSSFLHQRPI-HNPLSNPYPLSPRSSSSSQRSFSLFSPTALLLLLSLLVLLGVFLPWAGV 59

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNG--TVIVCAVSQPYLPFLNNWLISISRQKH 667
            P+G+FS  K S+ KW+ YTL QA SFVA+NG  TVIVCAVSQPYLPFL+NWLISISRQKH
Sbjct: 60   PDGIFSSVKPSITKWRHYTLSQAASFVARNGNGTVIVCAVSQPYLPFLSNWLISISRQKH 119

Query: 668  QEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHIL 847
            Q+KVLVIAEDYATLYKVN+ WPGHAVL+PPAP++QTAHKFGS+GFFNFTSRRPRHLL IL
Sbjct: 120  QDKVLVIAEDYATLYKVNELWPGHAVLVPPAPESQTAHKFGSQGFFNFTSRRPRHLLQIL 179

Query: 848  ELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICS 1027
            ELGYNVMYNDVDMVWLADPFPYL+G+HDVYFTDDMA VKPLNHS  LP PGKKGRTYICS
Sbjct: 180  ELGYNVMYNDVDMVWLADPFPYLEGNHDVYFTDDMAQVKPLNHSHELPPPGKKGRTYICS 239

Query: 1028 CMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAF 1207
            CMIFLRPTNGAK VMKKWIEE+Q+QPWSK  KSNDQPAFNWALN+T G+V +YLLPQAAF
Sbjct: 240  CMIFLRPTNGAKLVMKKWIEEMQEQPWSKTKKSNDQPAFNWALNKTAGEVGLYLLPQAAF 299

Query: 1208 PSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            P+GGLYFKN+TWVQETK +HVIIHNNYITGF+KKIKRF ++GLWLV+DH+ ESPLG+L
Sbjct: 300  PTGGLYFKNKTWVQETKSMHVIIHNNYITGFEKKIKRFREYGLWLVEDHADESPLGRL 357


>ref|XP_002273155.1| PREDICTED: UDP-galactose:fucoside alpha-3-galactosyltransferase
            [Vitis vinifera]
          Length = 360

 Score =  575 bits (1483), Expect = e-161
 Identities = 278/360 (77%), Positives = 305/360 (84%), Gaps = 4/360 (1%)
 Frame = +2

Query: 314  MSSI--LYQRQQQHQ--LLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXP 481
            MSSI    Q QQQHQ  LLS+ YPIS RSS +  RSIS+F RT                P
Sbjct: 1    MSSIHQRQQLQQQHQQHLLSDSYPISARSSPNWGRSISVFGRTGLLVLLTLVVVLGVVLP 60

Query: 482  WIDIPEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQ 661
             +  P+GLF G+K S+ KW++YTL +AV F AKNGT+IVCAVSQPYLPFLNNWLISISRQ
Sbjct: 61   GMRAPDGLFGGSKVSVSKWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQ 120

Query: 662  KHQEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLH 841
            KHQ+KVLVIAEDYATLY VN +WPGHAVL+PPAPDAQ AHKFGS+GFFNFTSRRPRHLL+
Sbjct: 121  KHQDKVLVIAEDYATLYAVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLY 180

Query: 842  ILELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYI 1021
            ILELGYNVMYNDVDMVWLADPFPYLQG HDVYFTDDM AVKPLNHS +LP PGKKGRTYI
Sbjct: 181  ILELGYNVMYNDVDMVWLADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYI 240

Query: 1022 CSCMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQA 1201
            CSCMIF+RPT+GAK VMK WIEELQ QPWS   KSNDQPAFNWALNRT  QVD+YLLPQ 
Sbjct: 241  CSCMIFMRPTDGAKLVMKDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQVDLYLLPQV 300

Query: 1202 AFPSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            AFP+GGLYFKN+TWVQETKGLHVIIHNNYITGF+KKIKRF DFGLWLVDD++LESPLG++
Sbjct: 301  AFPTGGLYFKNQTWVQETKGLHVIIHNNYITGFEKKIKRFRDFGLWLVDDYALESPLGRI 360


>emb|CBI31422.3| unnamed protein product [Vitis vinifera]
          Length = 1331

 Score =  571 bits (1472), Expect = e-160
 Identities = 270/356 (75%), Positives = 300/356 (84%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            + S  +Q +    LLS+ YPIS RSS +  RSIS+F RT                P +  
Sbjct: 976  LKSTPFQAEIHQHLLSDSYPISARSSPNWGRSISVFGRTGLLVLLTLVVVLGVVLPGMRA 1035

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQE 673
            P+GLF G+K S+ KW++YTL +AV F AKNGT+IVCAVSQPYLPFLNNWLISISRQKHQ+
Sbjct: 1036 PDGLFGGSKVSVSKWREYTLEEAVPFAAKNGTLIVCAVSQPYLPFLNNWLISISRQKHQD 1095

Query: 674  KVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILEL 853
            KVLVIAEDYATLY VN +WPGHAVL+PPAPDAQ AHKFGS+GFFNFTSRRPRHLL+ILEL
Sbjct: 1096 KVLVIAEDYATLYAVNDRWPGHAVLVPPAPDAQVAHKFGSQGFFNFTSRRPRHLLYILEL 1155

Query: 854  GYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCM 1033
            GYNVMYNDVDMVWLADPFPYLQG HDVYFTDDM AVKPLNHS +LP PGKKGRTYICSCM
Sbjct: 1156 GYNVMYNDVDMVWLADPFPYLQGDHDVYFTDDMTAVKPLNHSHDLPPPGKKGRTYICSCM 1215

Query: 1034 IFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPS 1213
            IF+RPT+GAK VMK WIEELQ QPWS   KSNDQPAFNWALNRT  QVD+YLLPQ AFP+
Sbjct: 1216 IFMRPTDGAKLVMKDWIEELQAQPWSNAKKSNDQPAFNWALNRTAAQVDLYLLPQVAFPT 1275

Query: 1214 GGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GGLYFKN+TWVQETKGLHVIIHNNYITGF+KKIKRF DFGLWLVDD++LESPLG++
Sbjct: 1276 GGLYFKNQTWVQETKGLHVIIHNNYITGFEKKIKRFRDFGLWLVDDYALESPLGRI 1331


>ref|XP_002301385.2| hypothetical protein POPTR_0002s16760g [Populus trichocarpa]
            gi|550345173|gb|EEE80658.2| hypothetical protein
            POPTR_0002s16760g [Populus trichocarpa]
          Length = 355

 Score =  570 bits (1468), Expect = e-160
 Identities = 275/357 (77%), Positives = 301/357 (84%), Gaps = 1/357 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MS+ L+QR   +  LS+P P+SPR S S  R ISLFSRT                PW   
Sbjct: 1    MSTFLHQRPL-YSPLSDPDPLSPRQSSSSQRQISLFSRTGLIALLSLLLILGVILPWTGT 59

Query: 494  PEGLFSGNKN-SLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQ 670
            P  +FS  K  SL KW+ YTL QAVSFVAKN TVIVCAVSQPYLPFL+NWLISISRQKHQ
Sbjct: 60   PS-IFSATKPASLTKWQQYTLSQAVSFVAKNKTVIVCAVSQPYLPFLSNWLISISRQKHQ 118

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            +KVLVIAEDYATLYKVN+KWPGHAVL+PPAPD+QTAHKFGS+GFFNFTSRRPRHLLHILE
Sbjct: 119  DKVLVIAEDYATLYKVNEKWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPRHLLHILE 178

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGYNVMYNDVDMVWL DPFPYL+G+HDVYFTDDMAAVKPL HS +LP PGKKGRTYICSC
Sbjct: 179  LGYNVMYNDVDMVWLQDPFPYLEGNHDVYFTDDMAAVKPLGHSHDLPPPGKKGRTYICSC 238

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIF+ PT+GAK V+KKWIEELQ QPWSK  KSNDQPAFNWALN+T GQVD+YLLPQ AFP
Sbjct: 239  MIFMHPTDGAKLVLKKWIEELQAQPWSKTKKSNDQPAFNWALNKTAGQVDLYLLPQTAFP 298

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+TWVQETKG H IIHNNYITGF+KKIKRFHD+GLWLVD H+ ESPLGKL
Sbjct: 299  TGGLYFKNQTWVQETKGKHAIIHNNYITGFEKKIKRFHDYGLWLVDGHASESPLGKL 355


>ref|XP_007051669.1| Nucleotide-diphospho-sugar transferase family protein isoform 1
            [Theobroma cacao] gi|508703930|gb|EOX95826.1|
            Nucleotide-diphospho-sugar transferase family protein
            isoform 1 [Theobroma cacao]
          Length = 357

 Score =  569 bits (1467), Expect = e-159
 Identities = 272/358 (75%), Positives = 301/358 (84%), Gaps = 2/358 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MS+ L+QR   H   SN YPISPR S +  R IS+FS T                PW  +
Sbjct: 1    MSAFLHQRPI-HNPFSNAYPISPRPSSAFQRPISIFSPTGLIILLSLMVILGVFLPWSGM 59

Query: 494  PEGLFSGN--KNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKH 667
            P+ +FS +   +SL KW+DYTL +A SFVAKNGTVIVCAVSQPYLPFLNNWLISI+RQKH
Sbjct: 60   PQSMFSNSIKASSLSKWRDYTLAEAASFVAKNGTVIVCAVSQPYLPFLNNWLISITRQKH 119

Query: 668  QEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHIL 847
            Q+KVLVIAEDYATLYKVN+KWPGHAVL+PPAPD+QTAHKFGS+GFFNFTSRRPRHLL IL
Sbjct: 120  QDKVLVIAEDYATLYKVNEKWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPRHLLQIL 179

Query: 848  ELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICS 1027
            ELGYNVMYNDVDMVWL DPF YL+G+HDVYFTDDMA VKP NHS +LP PGKKGRTYICS
Sbjct: 180  ELGYNVMYNDVDMVWLGDPFRYLEGNHDVYFTDDMAVVKPPNHSHDLPPPGKKGRTYICS 239

Query: 1028 CMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAF 1207
            CMIFLRPT+GAK VMK+WIEELQ QPWSK  K+NDQPAFNWALNRT GQVD+ LLPQ AF
Sbjct: 240  CMIFLRPTDGAKLVMKEWIEELQAQPWSKAKKANDQPAFNWALNRTAGQVDLCLLPQTAF 299

Query: 1208 PSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            P+GGLYFKN+TWVQETKG HVIIHNNYITGF+KKIKRF D+GLWLVDDH LESPLG+L
Sbjct: 300  PTGGLYFKNQTWVQETKGTHVIIHNNYITGFEKKIKRFRDYGLWLVDDHFLESPLGRL 357


>ref|XP_006842583.1| hypothetical protein AMTR_s00077p00159550 [Amborella trichopoda]
            gi|548844669|gb|ERN04258.1| hypothetical protein
            AMTR_s00077p00159550 [Amborella trichopoda]
          Length = 355

 Score =  565 bits (1456), Expect = e-158
 Identities = 271/355 (76%), Positives = 301/355 (84%), Gaps = 3/355 (0%)
 Frame = +2

Query: 326  LYQRQQQHQLLSNP--YPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDIPE 499
            L+QRQQQ    SNP  +PISPR      RSISL S T                PW+ +P+
Sbjct: 4    LHQRQQQQ---SNPTKFPISPRFYGVQPRSISLVSSTGLLILLFLMILFGILFPWLGLPK 60

Query: 500  GLFSGNKNSLLKWKDYTLVQAVSFVAKNG-TVIVCAVSQPYLPFLNNWLISISRQKHQEK 676
             LFSGN +SL KWKDYTL QAV+FV KNG TVIVCAVS+PYLPFL+NW+ISISRQKHQ+K
Sbjct: 61   SLFSGNNSSLSKWKDYTLAQAVAFVGKNGGTVIVCAVSKPYLPFLSNWVISISRQKHQDK 120

Query: 677  VLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILELG 856
            VLVIAEDYATLY+VN++WPGHAVL+PPAPD+QTAHKFGS+GFFNFTSRRP+HLL ILELG
Sbjct: 121  VLVIAEDYATLYEVNRRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPQHLLQILELG 180

Query: 857  YNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCMI 1036
            +N +YNDVDMVW+ADPFPY +G+HDVYFTDDMAAVKP NHS +LPAPGKKGRTYICSCMI
Sbjct: 181  FNALYNDVDMVWMADPFPYFKGNHDVYFTDDMAAVKPPNHSHDLPAPGKKGRTYICSCMI 240

Query: 1037 FLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPSG 1216
            FLRPT GAK VMKKWIEELQ QPWS +TK+NDQPAFNWALN+T GQVD+YLLPQ  FPSG
Sbjct: 241  FLRPTPGAKLVMKKWIEELQVQPWSTKTKTNDQPAFNWALNKTAGQVDLYLLPQTGFPSG 300

Query: 1217 GLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GLYFKN+TWV ETKG HVIIHNNYITGFDKKIKRF DFGLWLVDDH+ ESPLG L
Sbjct: 301  GLYFKNQTWVDETKGKHVIIHNNYITGFDKKIKRFRDFGLWLVDDHAHESPLGGL 355


>ref|XP_006444918.1| hypothetical protein CICLE_v10020808mg [Citrus clementina]
            gi|568876280|ref|XP_006491209.1| PREDICTED:
            UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase
            MGP4-like [Citrus sinensis] gi|557547180|gb|ESR58158.1|
            hypothetical protein CICLE_v10020808mg [Citrus
            clementina]
          Length = 358

 Score =  565 bits (1455), Expect = e-158
 Identities = 268/359 (74%), Positives = 301/359 (83%), Gaps = 3/359 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSIS---HTRSISLFSRTXXXXXXXXXXXXXXXXPW 484
            MS  L+QR   H  L NPYP+SPR+S++       + + +RT                PW
Sbjct: 1    MSQWLHQRPL-HNPLPNPYPLSPRNSMTFQFQRPMLLVLNRTTLLVLLSLLVVLGVILPW 59

Query: 485  IDIPEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQK 664
               P  +F    +SL KW+DYTL QA SFVAKNGT+IVCAVSQPYLPFLNNWLISISRQK
Sbjct: 60   TGTPGFMFPNATSSLAKWRDYTLSQAASFVAKNGTIIVCAVSQPYLPFLNNWLISISRQK 119

Query: 665  HQEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHI 844
            HQ++VLVIAEDYATLYKVN +WPGHAVL+PPAPD+QTAHKFGS+GFFNFTSRRP HLLHI
Sbjct: 120  HQDQVLVIAEDYATLYKVNGRWPGHAVLVPPAPDSQTAHKFGSQGFFNFTSRRPCHLLHI 179

Query: 845  LELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYIC 1024
            LELGYNVMYNDVDMVWL DPFPYLQG HDVYFTDDMAAVKPL+HS +LP PGKKGRTYIC
Sbjct: 180  LELGYNVMYNDVDMVWLKDPFPYLQGDHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYIC 239

Query: 1025 SCMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAA 1204
            SCMI+LRPT+GAK VMKKWIEELQ +PWSK  K+NDQPAFNWALN+T GQVD+YLLPQ+A
Sbjct: 240  SCMIYLRPTDGAKLVMKKWIEELQAEPWSKAKKANDQPAFNWALNKTAGQVDLYLLPQSA 299

Query: 1205 FPSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            FP+GGLYFKN+TWV+ETKG HVIIHNNYITGF+KKIKRF DFGLWLVDDH++ESPLGKL
Sbjct: 300  FPTGGLYFKNQTWVEETKGKHVIIHNNYITGFEKKIKRFRDFGLWLVDDHAVESPLGKL 358


>ref|XP_002320168.2| hypothetical protein POPTR_0014s08810g [Populus trichocarpa]
            gi|550323793|gb|EEE98483.2| hypothetical protein
            POPTR_0014s08810g [Populus trichocarpa]
          Length = 355

 Score =  564 bits (1454), Expect = e-158
 Identities = 271/357 (75%), Positives = 302/357 (84%), Gaps = 1/357 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            M + L+QR   H  LS+PYP+SPR S S  R ISLFSRT                PW   
Sbjct: 1    MPTFLHQRPL-HGTLSDPYPLSPRQSSSSQRQISLFSRTGLIAILSLLLILGVILPWTGT 59

Query: 494  PEGLFSGNKN-SLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQ 670
            P  +FS  K  SL KW+ YTL QAV+FVAKN TVIVCAVSQPYLPFL+NWLISISRQKHQ
Sbjct: 60   PS-IFSATKPASLAKWQQYTLPQAVAFVAKNKTVIVCAVSQPYLPFLSNWLISISRQKHQ 118

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            +KVLVIAEDYATLY VN++WPGHAVL+PPAPD+Q+AHKFGS+GFFNFTSRRPRHLLHILE
Sbjct: 119  DKVLVIAEDYATLYNVNERWPGHAVLVPPAPDSQSAHKFGSQGFFNFTSRRPRHLLHILE 178

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGY+VMYNDVDMVWL DPF YL+G+HDVYFTDDMAAVKPL+HS +LP PGKKGRTYICSC
Sbjct: 179  LGYDVMYNDVDMVWLGDPFRYLEGNHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 238

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIF+RPT+GAK VMKKWIEEL+ QPWSK  K+NDQPAFNWALN+T GQVD+YLLPQAAFP
Sbjct: 239  MIFMRPTDGAKLVMKKWIEELKAQPWSKTRKANDQPAFNWALNKTAGQVDLYLLPQAAFP 298

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+TWVQETKG HVIIHNNYITGF+KKIKRF D+ LWLVDDH+ ESPLGKL
Sbjct: 299  TGGLYFKNQTWVQETKGKHVIIHNNYITGFEKKIKRFRDYSLWLVDDHASESPLGKL 355


>emb|CAN69309.1| hypothetical protein VITISV_003084 [Vitis vinifera]
          Length = 309

 Score =  549 bits (1414), Expect = e-153
 Identities = 256/297 (86%), Positives = 277/297 (93%)
 Frame = +2

Query: 491  IPEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQ 670
            +P+GL S  + S+ KW+DYTL QA  FVAKNGTVIVCAVSQPYLPFLNNWLISI+RQKHQ
Sbjct: 15   MPDGLLS--RASVSKWRDYTLAQAAEFVAKNGTVIVCAVSQPYLPFLNNWLISIARQKHQ 72

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            +KVLVIAEDYATLY VNQKWPGHAVL+PPAPDAQTAHKFGS GFFNFTSRRPRHLL+ILE
Sbjct: 73   DKVLVIAEDYATLYTVNQKWPGHAVLVPPAPDAQTAHKFGSMGFFNFTSRRPRHLLNILE 132

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGYNVMYNDVDMVWLADPFPYLQG HDVYFTDDMAAVKPLNHS +LP PGKKGRTYICSC
Sbjct: 133  LGYNVMYNDVDMVWLADPFPYLQGKHDVYFTDDMAAVKPLNHSHDLPPPGKKGRTYICSC 192

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIF+RPTNGAK VMKKWIEELQ QPWS+  KSNDQPAFNWALNRT G+VD+YLLPQAAFP
Sbjct: 193  MIFMRPTNGAKLVMKKWIEELQAQPWSRAKKSNDQPAFNWALNRTAGEVDLYLLPQAAFP 252

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+TWVQETKG++VIIHNNYITGF+KKIKRF D+GLWLVDDH+ ESPLGKL
Sbjct: 253  TGGLYFKNKTWVQETKGMNVIIHNNYITGFEKKIKRFQDYGLWLVDDHAQESPLGKL 309


>ref|XP_006396288.1| hypothetical protein EUTSA_v10028748mg [Eutrema salsugineum]
            gi|557097305|gb|ESQ37741.1| hypothetical protein
            EUTSA_v10028748mg [Eutrema salsugineum]
          Length = 365

 Score =  539 bits (1388), Expect = e-150
 Identities = 262/365 (71%), Positives = 292/365 (80%), Gaps = 6/365 (1%)
 Frame = +2

Query: 305  LQEMSSILYQRQQQHQLLSNPYPISPRSSISHT-RSISLFSRTXXXXXXXXXXXXXXXXP 481
            + +    L+QR  Q+   SNP+  SP S+ S + R ISL SR                 P
Sbjct: 1    MAQQQQFLHQRPIQNPF-SNPFSSSPLSNSSASNRPISLLSRNGLLLLLALLVILGVFLP 59

Query: 482  WIDIPEGLF-----SGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLI 646
            W   P   F     S + +S  KW++Y+L QA  F AKNGTVIVCAVS PYLPFLNNWLI
Sbjct: 60   WAGSPLFPFPNRSSSSSSSSHSKWREYSLAQAAKFAAKNGTVIVCAVSYPYLPFLNNWLI 119

Query: 647  SISRQKHQEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRP 826
            S+SRQKHQ+KVLVIAEDYATLYKVN+KWPGHAVLIPPA D+QTAHKFGS+GFFNFTSRRP
Sbjct: 120  SVSRQKHQDKVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTSRRP 179

Query: 827  RHLLHILELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKK 1006
            +HLLHILELGYNVMYNDVDMVWL DPF YL+G HD YF DDM A+KPL+HS +LP PGKK
Sbjct: 180  QHLLHILELGYNVMYNDVDMVWLQDPFQYLEGSHDAYFMDDMTAIKPLDHSHDLPPPGKK 239

Query: 1007 GRTYICSCMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMY 1186
            GRTYICSCMIFLRPTNGAK +MKKWIEELQDQPWSK  K+NDQPAFNWALN+T  QVD+Y
Sbjct: 240  GRTYICSCMIFLRPTNGAKLLMKKWIEELQDQPWSKAKKANDQPAFNWALNKTAHQVDLY 299

Query: 1187 LLPQAAFPSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLES 1366
            LL QAAFP+GGLYFKN+TWV+ETKG HVIIHNNYI GFDKKIKRF DFGLWLVDDH+LES
Sbjct: 300  LLSQAAFPTGGLYFKNQTWVKETKGKHVIIHNNYILGFDKKIKRFRDFGLWLVDDHALES 359

Query: 1367 PLGKL 1381
            PLGKL
Sbjct: 360  PLGKL 364


>ref|XP_004299518.1| PREDICTED: uncharacterized protein LOC101304000 [Fragaria vesca
            subsp. vesca]
          Length = 354

 Score =  531 bits (1369), Expect = e-148
 Identities = 251/361 (69%), Positives = 294/361 (81%), Gaps = 5/361 (1%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPIS-----PRSSISHTRSISLFSRTXXXXXXXXXXXXXXXX 478
            MS+ L+QR   H   S+P+P+S     PR   S++  I+L                    
Sbjct: 1    MSAFLHQRPL-HNPFSDPHPLSQPPSNPRKPFSYSNPITLL------VLLCLLIIMGVFL 53

Query: 479  PWIDIPEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISR 658
            PW  + +GLFS    S LKW+DYTL QA +FVA+NGT+IVCAVS+PYLPFLNNWLISI+R
Sbjct: 54   PWTSMQQGLFSITNKSQLKWRDYTLAQAAAFVAQNGTLIVCAVSEPYLPFLNNWLISIAR 113

Query: 659  QKHQEKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLL 838
            QKHQ+KVLVIAEDY TLYKVN++WPGHAVLIPPA DA+ AHKFGS+GFFNFTSRRPRHLL
Sbjct: 114  QKHQDKVLVIAEDYGTLYKVNERWPGHAVLIPPALDAKAAHKFGSQGFFNFTSRRPRHLL 173

Query: 839  HILELGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTY 1018
            +ILELGY+VMYNDVDMVWLADPFPYL G+HDVYFTDDM  VKPLNHS +LP  GKKGRTY
Sbjct: 174  NILELGYSVMYNDVDMVWLADPFPYLVGNHDVYFTDDMTPVKPLNHSHDLPPIGKKGRTY 233

Query: 1019 ICSCMIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQ 1198
            ICSCMIFLRPT+GAK +MKKWIEE+Q +PWS+  K+NDQPAFNWAL++   Q D+YLLPQ
Sbjct: 234  ICSCMIFLRPTSGAKLIMKKWIEEMQQEPWSRAKKANDQPAFNWALDKNAAQADLYLLPQ 293

Query: 1199 AAFPSGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGK 1378
            AAFP+GGLYFKN+TWV+ETKG+HVIIHNNYI GF+KKIKRFHD+ LWLVDDH+ ESPLG+
Sbjct: 294  AAFPTGGLYFKNKTWVKETKGMHVIIHNNYILGFEKKIKRFHDYDLWLVDDHADESPLGR 353

Query: 1379 L 1381
            +
Sbjct: 354  I 354


>gb|AAZ94713.1| putative alpha 1,3-xylosyltransferase [Linum usitatissimum]
          Length = 357

 Score =  527 bits (1357), Expect = e-147
 Identities = 257/357 (71%), Positives = 290/357 (81%), Gaps = 1/357 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            M++ L+QR  Q        P S   S    R ISLFSR                 PW D+
Sbjct: 1    MTTSLHQRPFQPAFSHRFLPASLHYSNLPQRPISLFSRAGLLSLLALMLILGVIVPWADM 60

Query: 494  PEGLFSGNK-NSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQ 670
            P G+FS NK +S+ +W+ YTL QA SFVAKNGT+IVCAVSQ YLPFLNNWLISISRQK Q
Sbjct: 61   PGGIFSVNKASSVNQWRHYTLPQAASFVAKNGTLIVCAVSQAYLPFLNNWLISISRQKRQ 120

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            + VLVIAEDYATL KVN++WPGHAVLIPPA D+Q AHKFGS+GFFNFT+RRP+HLL+ILE
Sbjct: 121  DMVLVIAEDYATLDKVNERWPGHAVLIPPALDSQAAHKFGSQGFFNFTARRPQHLLNILE 180

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGY+VMYNDVDMVWL DPF YL+G HDVYFTDDMAAVKPL+HS +LP PGKKGRTYICSC
Sbjct: 181  LGYSVMYNDVDMVWLGDPFTYLRGLHDVYFTDDMAAVKPLDHSHDLPPPGKKGRTYICSC 240

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIFLRPT+GAK VMKKWIEELQ QPWS+  K+NDQPAFNWAL +T GQVD+YLLPQ+AFP
Sbjct: 241  MIFLRPTDGAKLVMKKWIEELQAQPWSRAKKANDQPAFNWALMKTTGQVDVYLLPQSAFP 300

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+TWVQ TKG HVIIHNNYI GF+KKIKRF D+GLWLVDDH+ ESPLGKL
Sbjct: 301  TGGLYFKNKTWVQGTKGKHVIIHNNYIVGFEKKIKRFRDYGLWLVDDHAKESPLGKL 357


>ref|XP_004504649.1| PREDICTED: uncharacterized protein LOC101506544 [Cicer arietinum]
          Length = 355

 Score =  521 bits (1343), Expect = e-145
 Identities = 246/356 (69%), Positives = 286/356 (80%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MS  L+QR   H   SNP+PIS  SS +  + IS+FS T                PW+ +
Sbjct: 1    MSGFLHQRSL-HNPFSNPFPISSSSSTNSKKPISIFSPTILLSLLSLIVVLGVFSPWLGM 59

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQE 673
            P+ LF  +K S+ KW  YTL QA++FVAKNGTVIVC VSQPYLPFLNNWLISI+ QK Q+
Sbjct: 60   PQNLFFTSKPSISKWGHYTLDQALTFVAKNGTVIVCIVSQPYLPFLNNWLISITMQKRQD 119

Query: 674  KVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILEL 853
             VLVIAEDY +LYKVN+ WPGHAVLIPP  D + AHKFGS+GFFNFT+RRP HLL ILEL
Sbjct: 120  MVLVIAEDYPSLYKVNELWPGHAVLIPPVLDVEAAHKFGSKGFFNFTARRPSHLLKILEL 179

Query: 854  GYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCM 1033
            GY+VMYNDVDMVWLADPFPYLQG+HDVYFTDDM A+KPL+HS +LP PG KGR YICSCM
Sbjct: 180  GYSVMYNDVDMVWLADPFPYLQGNHDVYFTDDMTAIKPLDHSHDLPPPGNKGRPYICSCM 239

Query: 1034 IFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPS 1213
            IFLRPT+GAK ++KKW+EELQ +PWS+  KSNDQPAFNWAL + V +VD+YLLPQAAFP+
Sbjct: 240  IFLRPTDGAKLILKKWLEELQLEPWSRTKKSNDQPAFNWALMKNVKEVDLYLLPQAAFPT 299

Query: 1214 GGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GGLYFKN+TWV+ETKG HVIIHNNYI GF+KKIKRF D+G WLVD+H+ ESPLG L
Sbjct: 300  GGLYFKNKTWVKETKGKHVIIHNNYIVGFEKKIKRFRDYGFWLVDEHANESPLGGL 355


>ref|NP_849279.1| rhamnogalacturonan II specific xylosyltransferase [Arabidopsis
            thaliana] gi|75181726|sp|Q9M146.1|MGP4_ARATH RecName:
            Full=UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase
            MGP4; AltName: Full=Protein MALE GAMETOPHYTE DEFECTIVE 4;
            AltName: Full=Rhamnogalacturonan xylosyltransferase MGP4
            gi|7267619|emb|CAB80931.1| hypothetical protein
            [Arabidopsis thaliana] gi|193885155|gb|ACF28391.1|
            At4g01220 [Arabidopsis thaliana]
            gi|332656595|gb|AEE81995.1| rhamnogalacturonan II
            specific xylosyltransferase [Arabidopsis thaliana]
            gi|591401972|gb|AHL38713.1| glycosyltransferase, partial
            [Arabidopsis thaliana]
          Length = 360

 Score =  521 bits (1342), Expect = e-145
 Identities = 257/356 (72%), Positives = 286/356 (80%), Gaps = 4/356 (1%)
 Frame = +2

Query: 326  LYQRQQQHQLLSNPYPISPRSSIS-HTRSISLFSRTXXXXXXXXXXXXXXXXPWIDIPEG 502
            L+QR  Q+   +NP+  SP S+ S   R ISL SR                 PW   P  
Sbjct: 7    LHQRPIQNPF-TNPFSSSPLSTSSISNRPISLLSRNGLLLLLALLVILGVFLPWAGSP-- 63

Query: 503  LF-SGNK--NSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQE 673
            LF S NK   S  KW+DY+L QAV FVAKNGTVIVCAVS PYLPFLNNWLIS+SRQKHQ+
Sbjct: 64   LFPSPNKLSPSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQD 123

Query: 674  KVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILEL 853
            +VLVIAEDYATLYKVN+KWPGHAVLIPPA D+QTAHKFGS+GFFNFT+RRP+HLL ILEL
Sbjct: 124  QVLVIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILEL 183

Query: 854  GYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCM 1033
            GYNVMYNDVDMVWL DPF YL+G HD YF DDM A+KPL+HS +LP PGKKGRTYICSCM
Sbjct: 184  GYNVMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSCM 243

Query: 1034 IFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPS 1213
            IFLRPTNGAK +MKKWIEEL+ QPWS+  K+NDQP FNWALN+T  QVDMYLL QAAFP+
Sbjct: 244  IFLRPTNGAKLLMKKWIEELETQPWSRAKKANDQPGFNWALNKTANQVDMYLLSQAAFPT 303

Query: 1214 GGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            GGLYFKN+TWV+ETKG H IIHNNYI GF+KKIKRF DF LWLVDDH+ ESPLGKL
Sbjct: 304  GGLYFKNKTWVKETKGKHAIIHNNYIVGFEKKIKRFRDFNLWLVDDHASESPLGKL 359


>ref|XP_002872902.1| hypothetical protein ARALYDRAFT_912111 [Arabidopsis lyrata subsp.
            lyrata] gi|297318739|gb|EFH49161.1| hypothetical protein
            ARALYDRAFT_912111 [Arabidopsis lyrata subsp. lyrata]
          Length = 356

 Score =  520 bits (1338), Expect = e-144
 Identities = 251/353 (71%), Positives = 283/353 (80%), Gaps = 3/353 (0%)
 Frame = +2

Query: 332  QRQQQHQL-LSNPYPISPRSSIS-HTRSISLFSRTXXXXXXXXXXXXXXXXPWIDIPEGL 505
            Q++  HQ  + NP+  SP S+ S   R ISL SR                 PW   P   
Sbjct: 3    QQKFLHQRPIQNPFSSSPLSNSSISNRPISLLSRNGLLILLALLVILGVFLPWAGSPLFP 62

Query: 506  FSGN-KNSLLKWKDYTLVQAVSFVAKNGTVIVCAVSQPYLPFLNNWLISISRQKHQEKVL 682
            F     +S  KW+DY+L QAV FVAKNGTVIVCAVS PYLPFLNNWLIS+SRQKHQ++VL
Sbjct: 63   FPNRLSSSQSKWRDYSLPQAVKFVAKNGTVIVCAVSYPYLPFLNNWLISVSRQKHQDQVL 122

Query: 683  VIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILELGYN 862
            VIAEDYATLYKVN+KWPGHAVLIPPA D+QTAHKFGS+GFFNFT+RRP+HLL ILELGYN
Sbjct: 123  VIAEDYATLYKVNEKWPGHAVLIPPALDSQTAHKFGSQGFFNFTARRPQHLLEILELGYN 182

Query: 863  VMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSCMIFL 1042
            VMYNDVDMVWL DPF YL+G HD YF DDM A+KPL+HS +LP PGKKGRTYICSCMIFL
Sbjct: 183  VMYNDVDMVWLQDPFQYLEGKHDAYFMDDMTAIKPLDHSHDLPPPGKKGRTYICSCMIFL 242

Query: 1043 RPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFPSGGL 1222
            RPTNGAK +MKKWIEELQ QPWS+  K+NDQP FNWALN+T  QVD+Y+L QAAFP+GGL
Sbjct: 243  RPTNGAKLLMKKWIEELQTQPWSRAKKANDQPGFNWALNKTAHQVDLYMLSQAAFPTGGL 302

Query: 1223 YFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            YFKN+TWV+ETKG HVIIHNNYI GF+KKIKRF DF LWLVDDH+ ESPLGK+
Sbjct: 303  YFKNKTWVKETKGKHVIIHNNYIVGFEKKIKRFRDFNLWLVDDHASESPLGKV 355


>ref|XP_003531111.1| PREDICTED: UDP-D-xylose:L-fucose alpha-1,3-D-xylosyltransferase
            MGP4-like [Glycine max]
          Length = 356

 Score =  514 bits (1324), Expect = e-143
 Identities = 247/357 (69%), Positives = 280/357 (78%), Gaps = 1/357 (0%)
 Frame = +2

Query: 314  MSSILYQRQQQHQLLSNPYPISPRSSISHTRSISLFSRTXXXXXXXXXXXXXXXXPWIDI 493
            MSS L+QR       SNP+P S  SS +  +S+S+   T                PW+  
Sbjct: 1    MSSFLHQRSSLQNPFSNPFPASTPSSSNSKKSLSIMGPTTLLALISLIVILGVFCPWVGF 60

Query: 494  PEGLFSGNKNSLLKWKDYTLVQAVSFVAKNGT-VIVCAVSQPYLPFLNNWLISISRQKHQ 670
            P+G F        KW  YTL QA+SFVAKNG+ VIVC VSQPYLPFLNNWLISIS QK Q
Sbjct: 61   PQG-FPFTPTPTSKWAHYTLEQALSFVAKNGSSVIVCIVSQPYLPFLNNWLISISMQKRQ 119

Query: 671  EKVLVIAEDYATLYKVNQKWPGHAVLIPPAPDAQTAHKFGSEGFFNFTSRRPRHLLHILE 850
            + VLVIAEDYA+L +VN  WPGHAVLIPP  DA+ AHKFGS+GFFNFT+RRP HLL ILE
Sbjct: 120  DMVLVIAEDYASLDRVNLLWPGHAVLIPPVLDAEAAHKFGSQGFFNFTARRPSHLLKILE 179

Query: 851  LGYNVMYNDVDMVWLADPFPYLQGHHDVYFTDDMAAVKPLNHSSNLPAPGKKGRTYICSC 1030
            LGY+VMYNDVDMVWLADPFPYLQG+HDVYFTDDM A+KPLNHS +LP PGKKGR YICSC
Sbjct: 180  LGYSVMYNDVDMVWLADPFPYLQGNHDVYFTDDMTAIKPLNHSHDLPPPGKKGRPYICSC 239

Query: 1031 MIFLRPTNGAKYVMKKWIEELQDQPWSKRTKSNDQPAFNWALNRTVGQVDMYLLPQAAFP 1210
            MIFLRPTNGAK +++KWIEELQ QPWSK  KSNDQPAFNWAL +   +VD+YLLPQAAFP
Sbjct: 240  MIFLRPTNGAKLILRKWIEELQIQPWSKTVKSNDQPAFNWALMKNAKEVDLYLLPQAAFP 299

Query: 1211 SGGLYFKNETWVQETKGLHVIIHNNYITGFDKKIKRFHDFGLWLVDDHSLESPLGKL 1381
            +GGLYFKN+ WV+ETKG+HVIIHNNYI GF+KKIKRF D+GLWLVDDH+ ESPLG L
Sbjct: 300  TGGLYFKNKAWVKETKGMHVIIHNNYIVGFEKKIKRFRDYGLWLVDDHAHESPLGGL 356


Top