BLASTX nr result

ID: Mentha26_contig00017396 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00017396
         (1842 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38924.1| hypothetical protein MIMGU_mgv1a000673mg [Mimulus...   466   e-128
ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591...   417   e-113
ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246...   416   e-113
ref|XP_007051668.1| Glycosyl transferase family 1 protein isofor...   414   e-113
ref|XP_007051667.1| Glycosyl transferase family 1 protein isofor...   414   e-113
ref|XP_002301386.2| glycosyltransferase family protein [Populus ...   408   e-111
gb|EYU32192.1| hypothetical protein MIMGU_mgv1a000786mg [Mimulus...   407   e-111
gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis]     406   e-110
ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prun...   405   e-110
ref|XP_002320170.1| glycosyltransferase family protein [Populus ...   402   e-109
gb|EPS70431.1| hypothetical protein M569_04330 [Genlisea aurea]       398   e-108
ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262...   394   e-107
emb|CBI40456.3| unnamed protein product [Vitis vinifera]              394   e-107
emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera]   384   e-104
ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   383   e-103
ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212...   383   e-103
ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabid...   382   e-103
ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302...   381   e-103
ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citr...   379   e-102
ref|XP_002511940.1| transferase, transferring glycosyl groups, p...   376   e-101

>gb|EYU38924.1| hypothetical protein MIMGU_mgv1a000673mg [Mimulus guttatus]
          Length = 1023

 Score =  466 bits (1198), Expect = e-128
 Identities = 244/420 (58%), Positives = 298/420 (70%), Gaps = 23/420 (5%)
 Frame = +1

Query: 652  FWGQRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGR---- 819
            F GQR RSRF RLV FK++DY+QLI                 LPG   ++  K G     
Sbjct: 22   FSGQRSRSRFTRLVFFKRVDYLQLICGVATLFFFVFLFQVFFLPGEDGNNNNKSGNNKIN 81

Query: 820  --IDERGSR---ELSFLKDLDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKK 984
              +   G     EL FLK+LDFGEDLKFEPL+I  KFRK+      G  S+ V RFGY+K
Sbjct: 82   DLVGGNGGAVFDELLFLKELDFGEDLKFEPLRISEKFRKN------GDLSKMVARFGYRK 135

Query: 985  PKLALVFADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVIS-A 1161
            PK+ALVFADL VD HQILMVTVATAL EIGYEIEVFS E+GP    WRE+G+P+ VI+ +
Sbjct: 136  PKIALVFADLVVDHHQILMVTVATALLEIGYEIEVFSTENGPAQATWREIGVPIRVIATS 195

Query: 1162 DENMKFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASN 1341
            D+N+  SVDWLNY GI+VNSL +VG L  LMQEPFK++PLVW IHE TL++RLR YV+S 
Sbjct: 196  DDNINCSVDWLNYDGILVNSLKSVGFLSCLMQEPFKNIPLVWMIHEHTLASRLRTYVSSG 255

Query: 1342 QTEMVNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKNG 1521
            Q+E+V++W++ F RATVVV+PNY LP+ YS CDPGNYF+IPGSP+  W+A K +A   N 
Sbjct: 256  QSELVDTWKRFFSRATVVVFPNYILPIEYSICDPGNYFVIPGSPEEAWKADKQLALPNNN 315

Query: 1522 ------------FSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDFTNSTSHLKIFI-LAG 1662
                        F IA+VGSQL Y+G+WLEHAF+LQ+LYP+ T F +S+S L+I I L G
Sbjct: 316  NLRSELDFRQDDFVIAVVGSQLSYKGVWLEHAFVLQALYPILTHFEDSSSRLRIIIVLGG 375

Query: 1663 DSTSNYSRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            DSTSNYS  +ETIAL L YPNETVK V    N + V++ ADLVIYGSFL+EH+FPDILLK
Sbjct: 376  DSTSNYSTTLETIALKLGYPNETVKRVSADRNTNTVINTADLVIYGSFLDEHSFPDILLK 435


>ref|XP_006339650.1| PREDICTED: uncharacterized protein LOC102591393 [Solanum tuberosum]
          Length = 1038

 Score =  417 bits (1071), Expect = e-113
 Identities = 220/414 (53%), Positives = 284/414 (68%), Gaps = 18/414 (4%)
 Frame = +1

Query: 655  WGQRY-RSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDER 831
            +GQR  RSRFAR +  KKI+Y+Q I                 LPGSV +      +  E 
Sbjct: 28   FGQRQVRSRFARFLFVKKINYLQWICTVAVFFFFVVLFQML-LPGSVMEKSGNLTQDSEV 86

Query: 832  GSRELSFLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALV 1002
            G  +L+ LK+L   DFGED+KFEPLK++AKF  +A ++N  V+SR V RFGY+KPKLALV
Sbjct: 87   GYGDLALLKELGGLDFGEDIKFEPLKLLAKFHDEAVEANGTVASRTVVRFGYRKPKLALV 146

Query: 1003 FADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFS 1182
            FA+L VD +QI+MV VA AL+EIGYEIEV SLEDGPV  +W++VG+P+ +++ D + K S
Sbjct: 147  FANLLVDPYQIMMVNVAAALREIGYEIEVLSLEDGPVRSIWKDVGVPVIIMNTDGHTKIS 206

Query: 1183 VDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNS 1362
            +DWLNY G++VNSL AV +L  +MQEPFK+VPLVWTI+E TL++RL+QY++S Q + V++
Sbjct: 207  LDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVPLVWTINELTLASRLKQYISSGQNDFVDN 266

Query: 1363 WRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKNG------- 1521
            WRKVF RA VVV+PNY LP+ YS CD GNYF+IPGSPK  WE    MA   +        
Sbjct: 267  WRKVFSRANVVVFPNYILPIGYSVCDAGNYFVIPGSPKEAWEVDSFMAVSNDNLRAKMDY 326

Query: 1522 ----FSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNY 1680
                F I +VGS LLY+GLWLE A +LQ+L PVF + T   NS SH KI +L   S +NY
Sbjct: 327  APEDFVIVVVGSHLLYKGLWLEQALVLQALLPVFPELTNDGNSNSHFKIVVLTEGSNTNY 386

Query: 1681 SRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            S AVE IA +L YP   VK +   E+ +  LS+ADLVIY SF EE +FP+ L+K
Sbjct: 387  SVAVEAIARNLRYPEGMVKHIAPAEDTERTLSVADLVIYASFREEQSFPNTLVK 440


>ref|XP_004229962.1| PREDICTED: uncharacterized protein LOC101246380 [Solanum
            lycopersicum]
          Length = 1038

 Score =  416 bits (1070), Expect = e-113
 Identities = 221/414 (53%), Positives = 286/414 (69%), Gaps = 18/414 (4%)
 Frame = +1

Query: 655  WGQRY-RSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDER 831
            +GQR  RSRFAR +  KKI+Y+Q I                 LPGSV +         E 
Sbjct: 28   FGQRQVRSRFARFLFVKKINYLQWICTVAVFFFFVVLFQML-LPGSVMEKSGNLTLDSEV 86

Query: 832  GSRELSFLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALV 1002
            G  +L+ LK+L   DFGED+KFEPLK++AKFR++A ++N  V+SR V RFGY+KPKLALV
Sbjct: 87   GYGDLALLKELGGLDFGEDIKFEPLKLLAKFREEAVEANGTVASRIVVRFGYRKPKLALV 146

Query: 1003 FADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFS 1182
            F++LSVD +QI+MV VA AL+EIGYEIEV SLEDGPV  +W+++G+P+ +++ D + K S
Sbjct: 147  FSNLSVDPYQIMMVNVAAALREIGYEIEVLSLEDGPVRSIWKDIGVPVIIMNTDGHTKIS 206

Query: 1183 VDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNS 1362
            +DWLNY G++VNSL AV +L  +MQEPFK+VPLVWTI+E TL++RL+QY++S Q + V++
Sbjct: 207  LDWLNYDGLLVNSLEAVNVLSCVMQEPFKNVPLVWTINELTLASRLKQYMSSGQNDFVDN 266

Query: 1363 WRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKN-------- 1518
            WRKVF RA VVV+PNY LP+ YS CD GNYF+IPGSPK  WE    MA   +        
Sbjct: 267  WRKVFSRANVVVFPNYILPIGYSVCDAGNYFVIPGSPKEAWEVDTFMAVSNDDLRAKMDY 326

Query: 1519 ---GFSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNY 1680
                F I +VGSQLLY+GLWLE A +LQ+L PVF +     NS SH KI +L   S +NY
Sbjct: 327  AAEDFVIVVVGSQLLYKGLWLEQALVLQALLPVFPELMNDGNSNSHFKIVVLTEGSNTNY 386

Query: 1681 SRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            S AVE IA +L YP   VK +   E+ +  LS+ADLVIY SF EE +FP+ LLK
Sbjct: 387  SVAVEAIARNLRYPEGMVKHIAPAEDTERTLSVADLVIYASFREEPSFPNTLLK 440


>ref|XP_007051668.1| Glycosyl transferase family 1 protein isoform 2 [Theobroma cacao]
            gi|508703929|gb|EOX95825.1| Glycosyl transferase family 1
            protein isoform 2 [Theobroma cacao]
          Length = 686

 Score =  414 bits (1065), Expect = e-113
 Identities = 223/414 (53%), Positives = 292/414 (70%), Gaps = 21/414 (5%)
 Frame = +1

Query: 664  RYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGR-KFGRIDERGSR 840
            R RSRF+R +LFKK+DY+Q I                 LPGSV D  +  F    +    
Sbjct: 25   RPRSRFSRFLLFKKLDYLQWICTVVVFLFFVVFFQMY-LPGSVMDKSQDSFLEDKDLVYG 83

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSS---RNVTRFGYKKPKLALV 1002
            EL +LK+   LDFGED++ EP K++ KF+++ +  N+  SS   R+  RF Y+KP+LALV
Sbjct: 84   ELRYLKEMGGLDFGEDIRLEPRKLLEKFQRENKVLNLESSSGFNRSQHRFQYRKPQLALV 143

Query: 1003 FADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFS 1182
            FADL VD  Q+LMVT+ATAL+EIGY I+V+SLEDGPV  VW+ +G+P++V+  + N +  
Sbjct: 144  FADLLVDPQQLLMVTIATALREIGYAIQVYSLEDGPVHNVWQSIGVPVSVLQVNSN-EIG 202

Query: 1183 VDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNS 1362
            VDWLNY GI+V+SL A G+  S MQEPFKS+PL+WTIHE+TL+ R RQ+ +S Q E+VN+
Sbjct: 203  VDWLNYDGILVSSLEAKGVFSSFMQEPFKSIPLIWTIHERTLAVRSRQFTSSGQIELVNN 262

Query: 1363 WRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKN------GF 1524
            W+KVF RATVVV+PNY LP+ YSA D GNY++IPGSP   W+ + +M  +K+      G+
Sbjct: 263  WKKVFSRATVVVFPNYALPMIYSAFDTGNYYVIPGSPAEAWKGENAMNLYKDNQRVKMGY 322

Query: 1525 S-----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNY 1680
                  IAIVGSQ +YRGLWLEHA +LQ+L P+FTDF   TNS SH KI IL+GDSTSNY
Sbjct: 323  GPDEVLIAIVGSQFMYRGLWLEHAIVLQALLPLFTDFSSDTNSNSHPKIIILSGDSTSNY 382

Query: 1681 SRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            S AVE I  +L YP+  VK V +  + D+VLSM D+VIYGSFLEE +FP+IL+K
Sbjct: 383  SMAVERITHNLKYPSGVVKHVAVDGDVDSVLSMTDIVIYGSFLEEPSFPEILIK 436


>ref|XP_007051667.1| Glycosyl transferase family 1 protein isoform 1 [Theobroma cacao]
            gi|508703928|gb|EOX95824.1| Glycosyl transferase family 1
            protein isoform 1 [Theobroma cacao]
          Length = 1026

 Score =  414 bits (1065), Expect = e-113
 Identities = 223/414 (53%), Positives = 292/414 (70%), Gaps = 21/414 (5%)
 Frame = +1

Query: 664  RYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGR-KFGRIDERGSR 840
            R RSRF+R +LFKK+DY+Q I                 LPGSV D  +  F    +    
Sbjct: 25   RPRSRFSRFLLFKKLDYLQWICTVVVFLFFVVFFQMY-LPGSVMDKSQDSFLEDKDLVYG 83

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSS---RNVTRFGYKKPKLALV 1002
            EL +LK+   LDFGED++ EP K++ KF+++ +  N+  SS   R+  RF Y+KP+LALV
Sbjct: 84   ELRYLKEMGGLDFGEDIRLEPRKLLEKFQRENKVLNLESSSGFNRSQHRFQYRKPQLALV 143

Query: 1003 FADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFS 1182
            FADL VD  Q+LMVT+ATAL+EIGY I+V+SLEDGPV  VW+ +G+P++V+  + N +  
Sbjct: 144  FADLLVDPQQLLMVTIATALREIGYAIQVYSLEDGPVHNVWQSIGVPVSVLQVNSN-EIG 202

Query: 1183 VDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNS 1362
            VDWLNY GI+V+SL A G+  S MQEPFKS+PL+WTIHE+TL+ R RQ+ +S Q E+VN+
Sbjct: 203  VDWLNYDGILVSSLEAKGVFSSFMQEPFKSIPLIWTIHERTLAVRSRQFTSSGQIELVNN 262

Query: 1363 WRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKN------GF 1524
            W+KVF RATVVV+PNY LP+ YSA D GNY++IPGSP   W+ + +M  +K+      G+
Sbjct: 263  WKKVFSRATVVVFPNYALPMIYSAFDTGNYYVIPGSPAEAWKGENAMNLYKDNQRVKMGY 322

Query: 1525 S-----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNY 1680
                  IAIVGSQ +YRGLWLEHA +LQ+L P+FTDF   TNS SH KI IL+GDSTSNY
Sbjct: 323  GPDEVLIAIVGSQFMYRGLWLEHAIVLQALLPLFTDFSSDTNSNSHPKIIILSGDSTSNY 382

Query: 1681 SRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            S AVE I  +L YP+  VK V +  + D+VLSM D+VIYGSFLEE +FP+IL+K
Sbjct: 383  SMAVERITHNLKYPSGVVKHVAVDGDVDSVLSMTDIVIYGSFLEEPSFPEILIK 436


>ref|XP_002301386.2| glycosyltransferase family protein [Populus trichocarpa]
            gi|550345174|gb|EEE80659.2| glycosyltransferase family
            protein [Populus trichocarpa]
          Length = 984

 Score =  408 bits (1049), Expect = e-111
 Identities = 216/413 (52%), Positives = 283/413 (68%), Gaps = 20/413 (4%)
 Frame = +1

Query: 664  RYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSV---SDDGRKFGRIDERG 834
            R RSR +R +LFKK+DY+Q I                 LPGSV   S+ G    R  E  
Sbjct: 35   RPRSRLSRFLLFKKLDYIQWICTVAVFLFFVVLFQMF-LPGSVVEKSELGSSPWRGMELV 93

Query: 835  SRELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVF 1005
            +++L +LK+   LDFGED+KFEP KI+ KFRK+  + N+  ++  ++RF Y+KP+LALVF
Sbjct: 94   NKDLLYLKEIGGLDFGEDIKFEPSKILQKFRKENREMNMPFTNGTLSRFPYRKPQLALVF 153

Query: 1006 ADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSV 1185
            ADL VD  Q+LMVTVATALQEIGY I V++L DGPV  +W+ +G P+ +I     ++ +V
Sbjct: 154  ADLLVDPQQLLMVTVATALQEIGYTIHVYTLRDGPVQNIWKSMGYPVTIIQMSHKLEIAV 213

Query: 1186 DWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSW 1365
            DWLNY GI+VNSL    ++   MQEPFKSVPL+WTIHE+ L+ R RQY +S Q E++N W
Sbjct: 214  DWLNYDGILVNSLETRSVISCFMQEPFKSVPLIWTIHERALAIRSRQYTSSWQIELLNDW 273

Query: 1366 RKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASF------KNGFS 1527
            RK F RATVVV+PN+ LP+ YSA D GNY++IPGSP  VWEA  +MA +      K G+ 
Sbjct: 274  RKAFNRATVVVFPNHVLPMMYSAFDAGNYYVIPGSPAEVWEADTTMALYNDDIRVKMGYE 333

Query: 1528 -----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNYS 1683
                 IA+VGSQ LYRGLWLEHA +L++L P+  DF   +NS SHLKI +L+GDST NYS
Sbjct: 334  PTDIVIAVVGSQFLYRGLWLEHALVLKALLPLLQDFPLDSNSISHLKIIVLSGDSTGNYS 393

Query: 1684 RAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
             AVE IA++L+YP  TVK   +  +  + LS  DLVIYGSFLEE +FP+ L++
Sbjct: 394  AAVEAIAVNLSYPRGTVKHFAVDGDVSSALSAVDLVIYGSFLEEQSFPEFLVR 446


>gb|EYU32192.1| hypothetical protein MIMGU_mgv1a000786mg [Mimulus guttatus]
          Length = 986

 Score =  407 bits (1047), Expect = e-111
 Identities = 230/440 (52%), Positives = 289/440 (65%), Gaps = 6/440 (1%)
 Frame = +1

Query: 541  MGFQESRQLLKRDHGFQXXXXXXXXXXXXXXXXXXXXFWGQRYRSRFARLVLFKKIDYVQ 720
            MG  E+R  LKRDH F                           RSRFARL+LF KIDY+Q
Sbjct: 1    MGSLENRPPLKRDHLFHSSSC-------------------SSVRSRFARLLLFNKIDYLQ 41

Query: 721  LISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDER-----GSRELSFLKDLDFGEDLK 885
            LI                 LPGS +++       D+       + +LSFLK+L FGEDLK
Sbjct: 42   LICAVSVSFFFVFLFQVFFLPGSAANEEEM--NYDKAHYLFTNNTDLSFLKELGFGEDLK 99

Query: 886  FEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFADLSVDSHQILMVTVATALQ 1065
            F+PLK++ KFR  A+  N   +S  V      KPKLALVFAD+ VDSHQILMVT+ATAL+
Sbjct: 100  FQPLKLLDKFRNGAKYFNGSFASTGVIL----KPKLALVFADMWVDSHQILMVTIATALR 155

Query: 1066 EIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVDWLNYHGIIVNSLGAVGLLP 1245
            E GYE EVFSLE+GPV  VW+EVG  + VI+ADEN  F +DWLNY GI+VNSL A G+L 
Sbjct: 156  ETGYEFEVFSLEEGPVYAVWKEVGFRVRVINADENTNFGIDWLNYDGILVNSLKAAGVLS 215

Query: 1246 SLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWRKVFQRATVVVYPNYYLPVA 1425
            SLMQEPFK VP++WTIHEQ L+ RL     S QT++V++WRK+F RAT VV+PNY LP+A
Sbjct: 216  SLMQEPFKHVPVIWTIHEQELALRL-----SGQTQLVDNWRKLFGRATAVVFPNYILPMA 270

Query: 1426 YSACDPGNYFIIPGSP-KAVWEAKKSMASFKNGFSIAIVGSQLLYRGLWLEHAFILQSLY 1602
            YSACDPGNYF+IPG P +A         + KN F +A+VGSQLLY+GL LE+A +L++L 
Sbjct: 271  YSACDPGNYFVIPGPPAEACNTVHNGNRNRKNNFVVAVVGSQLLYKGLLLENALVLKALL 330

Query: 1603 PVFTDFTNSTSHLKIFILAGDSTSNYSRAVETIALSLNYPNETVKLVPIYENADAVLSMA 1782
            P+    +N+ S LKI +L G+STS +  AVETIA +LNYPN TV  + +  N D V+  A
Sbjct: 331  PLLEKGSNN-SRLKILVLIGNSTSKFGTAVETIAQNLNYPNGTVNHIGVDGNTDNVVRDA 389

Query: 1783 DLVIYGSFLEEHAFPDILLK 1842
            D++IYGSFLEE+ FP+IL K
Sbjct: 390  DILIYGSFLEENIFPEILSK 409


>gb|EXB52710.1| hypothetical protein L484_022487 [Morus notabilis]
          Length = 1040

 Score =  406 bits (1043), Expect = e-110
 Identities = 214/412 (51%), Positives = 285/412 (69%), Gaps = 18/412 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR RSRF+R  LFKK+DY+Q I                 LPGSV +   K  R +E  S 
Sbjct: 34   QRQRSRFSRFFLFKKLDYLQWICTVAVFLFFVVLFQMF-LPGSVVEKSIKTHRDEEFSSG 92

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVS-SRNVTRFGYKKPKLALVFA 1008
            +L FLK+   LDFGED++FEP K++ KFR++ ++ N+  + +R+  R+ +KKP+LALVFA
Sbjct: 93   DLFFLKEYGILDFGEDIRFEPSKVLEKFRRENKEVNLSHAFNRSRLRYPHKKPQLALVFA 152

Query: 1009 DLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVD 1188
            DL VDS Q+LMVTVA ALQEIGYEI+V+SLE GPV G+WR +G+P+++I A +    +VD
Sbjct: 153  DLLVDSQQLLMVTVAAALQEIGYEIQVYSLEGGPVHGIWRNLGVPVSIIQACDPADVTVD 212

Query: 1189 WLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWR 1368
            WL Y GI+VNS  A  +    +QEPFKS+PLVWTIH++ L+ R R Y ++ Q E++N W+
Sbjct: 213  WLIYDGILVNSFEAKDMFSCFVQEPFKSLPLVWTIHDRALATRSRNYTSNKQIELLNDWK 272

Query: 1369 KVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVW------EAKKSMASFKNGFS- 1527
            + F R+TVVV+PNY LP+ YS  D GN+F+IPGSP   W      E++K     K G+  
Sbjct: 273  RAFNRSTVVVFPNYVLPMIYSTFDSGNFFVIPGSPAEAWKIETLMESEKDYLRAKMGYGH 332

Query: 1528 ----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNYSR 1686
                I IVGS+LLYRGLWLEH+ +LQ+L+P+  DF+   NS SHLKI +L+GD TSNYS 
Sbjct: 333  EDIVITIVGSELLYRGLWLEHSIVLQALFPLLEDFSSDENSFSHLKIIVLSGDPTSNYSS 392

Query: 1687 AVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            AVE IAL+L YPN  V  VP+   AD VL+ +D+VIYGS +EE +FPDIL+K
Sbjct: 393  AVEAIALNLKYPNGIVNHVPMDAEADNVLTASDVVIYGSSVEEQSFPDILIK 444


>ref|XP_007220285.1| hypothetical protein PRUPE_ppa000692mg [Prunus persica]
            gi|462416747|gb|EMJ21484.1| hypothetical protein
            PRUPE_ppa000692mg [Prunus persica]
          Length = 1034

 Score =  405 bits (1042), Expect = e-110
 Identities = 215/413 (52%), Positives = 286/413 (69%), Gaps = 19/413 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR RS+F+R +L KK+DY+Q I                 LPGSV +  R   +  E  S 
Sbjct: 31   QRPRSKFSRFLLIKKLDYLQWICTVAVFLFFVVLFQMF-LPGSVVEKSRVLMKNVELNSE 89

Query: 841  ELSFLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTR--FGYKKPKLALVF 1005
            +L FLK+L   DFGED++FEP K++ KF+K+A ++++  S+ N TR  FGY+KP+LALVF
Sbjct: 90   DLRFLKELGLLDFGEDIRFEPSKLLEKFQKEAREASL-TSAMNRTRQHFGYRKPQLALVF 148

Query: 1006 ADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSV 1185
            ADLSV S Q+LMVTVA ALQEIGY   V+SLEDGPV  VWR +G+P+ +I   +  + ++
Sbjct: 149  ADLSVASQQLLMVTVAAALQEIGYAFSVYSLEDGPVHDVWRSLGVPVTIIQTYDQSELNI 208

Query: 1186 DWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSW 1365
            DWLNY GI+VNSL A G+    +QEPFKS+P++WTIHEQ L+ R R+Y ++ Q E+ N W
Sbjct: 209  DWLNYDGILVNSLEAKGIFSCFVQEPFKSLPILWTIHEQALATRSRKYSSNRQIELFNDW 268

Query: 1366 RKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKN------GFS 1527
            +++F R+TVVV+PNY+LP+AYS  D GN+F+IPGSP    +A   M   KN      G+ 
Sbjct: 269  KRLFSRSTVVVFPNYFLPMAYSVFDAGNFFVIPGSPAEACKADSIMVLDKNHLLAKMGYG 328

Query: 1528 -----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNYS 1683
                 I IVGSQ LYRGLWLEH+ +L+++ P+  DF    NS SHLKI +L+GDSTSNYS
Sbjct: 329  SEDVVITIVGSQFLYRGLWLEHSIVLRAVLPLLEDFPLDNNSYSHLKIIVLSGDSTSNYS 388

Query: 1684 RAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
              VE IA +L YP+  VK V +   AD+VLS++D+VIYGSFLEE +FPDIL+K
Sbjct: 389  SVVEAIAYNLKYPSGIVKHVAVDMAADSVLSISDVVIYGSFLEEQSFPDILIK 441


>ref|XP_002320170.1| glycosyltransferase family protein [Populus trichocarpa]
            gi|222860943|gb|EEE98485.1| glycosyltransferase family
            protein [Populus trichocarpa]
          Length = 990

 Score =  402 bits (1034), Expect = e-109
 Identities = 216/413 (52%), Positives = 280/413 (67%), Gaps = 20/413 (4%)
 Frame = +1

Query: 664  RYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSV---SDDGRKFGRIDERG 834
            R RS F+R + FKK+DY+Q I                 LPGSV   S+ G    R  E  
Sbjct: 35   RPRSSFSRFLRFKKLDYIQWICTVAVFLFFVVLFQMF-LPGSVVEKSELGSSPWRGMELV 93

Query: 835  SRELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVF 1005
             ++L +LK+   LDFGED+KF+P KI+  FRK+  + N+  S+R ++RF Y+KP+LALVF
Sbjct: 94   DKDLWYLKEIGGLDFGEDIKFQPSKILQHFRKENREMNMSFSNRTLSRFPYRKPQLALVF 153

Query: 1006 ADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSV 1185
            ADL VD HQ+LMVTVATALQEIGY I V+SL DGP   +W+ +  P+N+I     M+ +V
Sbjct: 154  ADLLVDPHQLLMVTVATALQEIGYTIHVYSLGDGPAQSIWKSMRSPVNIIQISHKMEIAV 213

Query: 1186 DWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSW 1365
            DWLNY GI+VNSL    +    MQEPFKSVPL+WTI+E+TL+   RQY +S Q E++  W
Sbjct: 214  DWLNYDGILVNSLETKSVFSCFMQEPFKSVPLIWTINERTLATHSRQYTSSWQIELLYDW 273

Query: 1366 RKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASF------KNGFS 1527
            RK F RATVVV+PN+ LP+ YSA D GNY++IPGSP  +WE + +MA +      K G+ 
Sbjct: 274  RKAFNRATVVVFPNHVLPMMYSAFDTGNYYVIPGSPADIWETETTMALYNDEIHVKMGYE 333

Query: 1528 -----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNYS 1683
                 IAIVGSQ LYRGLWLEHA +L++L P+F +F+   NS SHLKI IL+GD T NYS
Sbjct: 334  PDDIVIAIVGSQFLYRGLWLEHALVLKALLPLFAEFSLDNNSKSHLKIIILSGDPTGNYS 393

Query: 1684 RAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
             AVE IA +L+YP  TVK   + ++  + L  ADLVIYGSFLEE +FP+IL+K
Sbjct: 394  VAVEAIAANLSYPRGTVKHFAVDDDVGSPLGAADLVIYGSFLEEQSFPEILVK 446


>gb|EPS70431.1| hypothetical protein M569_04330 [Genlisea aurea]
          Length = 1000

 Score =  398 bits (1023), Expect = e-108
 Identities = 199/362 (54%), Positives = 256/362 (70%), Gaps = 15/362 (4%)
 Frame = +1

Query: 796  DDGRKFGRIDERGSR----ELSFLKDLDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNV 963
            +DGR   RI     +    +LS LK+LDFGED+ FEP+ ++AKF+K + +S     S N+
Sbjct: 50   EDGRNLRRIPNIFKKIAVGDLSLLKELDFGEDVSFEPVNLLAKFQKHSNESKGSYVSFNI 109

Query: 964  TRFGYKKPKLALVFADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLP 1143
             R+GY+KPKLAL FADL VDSH ILM+T+A ALQ IGYEIEV SLEDGP   VWREVG P
Sbjct: 110  VRYGYRKPKLALAFADLRVDSHHILMLTLAAALQSIGYEIEVLSLEDGPGNAVWREVGFP 169

Query: 1144 LNVISADENMKFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLR 1323
            + VI A +N+ F VDWLN++G++VNS+ AV  + SLMQ+PF+ VPLVWTIHE  L+ R R
Sbjct: 170  IRVIEAAQNLMFPVDWLNFNGVLVNSVKAVDAVYSLMQDPFRDVPLVWTIHEHELALRFR 229

Query: 1324 QYVASNQTEMVNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEA---- 1491
             YV++ Q  + ++W+K F RA+VVV+PN+ LP+AYSACDPGNYF+IPGS    WE     
Sbjct: 230  DYVSNGQVNLFDNWKKFFARASVVVFPNHILPMAYSACDPGNYFVIPGSSMEAWEVGEVT 289

Query: 1492 --KKSMAS-----FKNGFSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDFTNSTSHLKIF 1650
              KK   S     F+  F +AIVGS L+Y+G WLEHA +L++L+P    F+ S +HLKI 
Sbjct: 290  KDKKDNTSAVGKDFETFFVVAIVGSSLVYKGRWLEHALVLKALHPFLRSFSGSGTHLKIV 349

Query: 1651 ILAGDSTSNYSRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPD 1830
            IL G ST +YS  VETI  +L YPN TV+ V   EN D +L  +D+V+YGSFLEEH FP+
Sbjct: 350  ILTGSSTPDYSSVVETIVENLKYPNGTVEHVVGDENVDDILRRSDVVLYGSFLEEHTFPE 409

Query: 1831 IL 1836
            IL
Sbjct: 410  IL 411


>ref|XP_002276292.2| PREDICTED: uncharacterized protein LOC100262009 [Vitis vinifera]
          Length = 1026

 Score =  394 bits (1013), Expect = e-107
 Identities = 212/412 (51%), Positives = 274/412 (66%), Gaps = 18/412 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR   RF+R + F K+DY+Q +                 LPG + +   +  +  E G  
Sbjct: 27   QRPIVRFSRFLFFGKLDYLQWVCTVAVFCFFVVLFQMF-LPGLIMEKSGESLKNMENGYG 85

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFAD 1011
            +LSF+K+   LDFGE ++FEP K++ KF+K+A++ N+  +SR   RFGY+KP+LALVF D
Sbjct: 86   DLSFIKNIGGLDFGEGIRFEPSKLLQKFQKEADEVNLSSASRLRHRFGYRKPQLALVFPD 145

Query: 1012 LSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVDW 1191
            L VD  Q+LMVTVA+AL E+GY I+V+SLEDGPV  +WR VG P+ +I ++      VDW
Sbjct: 146  LLVDPQQLLMVTVASALLEMGYTIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDW 205

Query: 1192 LNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWRK 1371
            LNY GIIVNSL A G++   +QEPFKS+PL+WTI E TL+ RLRQY  + + E+VN W+K
Sbjct: 206  LNYDGIIVNSLEARGVVSCFVQEPFKSLPLIWTIPEGTLATRLRQYNLTGKIELVNDWKK 265

Query: 1372 VFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKNG---------- 1521
            VF RAT VV+PNY LP+ YS  D GNYF+IPGSP   WE    MAS ++           
Sbjct: 266  VFNRATAVVFPNYVLPMIYSTFDSGNYFVIPGSPAQAWEVDNFMASHRDSPRVKMGYGPD 325

Query: 1522 -FSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNYSRA 1689
             F IA+V SQ LY+GLWLEHA ILQ+L P+  +F    NS SHLKI I +G+S +NYS A
Sbjct: 326  DFVIALVRSQFLYKGLWLEHALILQALLPLVAEFPVDNNSNSHLKILITSGNSANNYSVA 385

Query: 1690 VETIALSLNYPNETVKLVPI-YENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            VE IAL L YP   VK + I    AD VL+ AD+VIYGSFLEE +FPDIL+K
Sbjct: 386  VEAIALKLRYPKGVVKHIAIDVGEADNVLAAADIVIYGSFLEEQSFPDILIK 437


>emb|CBI40456.3| unnamed protein product [Vitis vinifera]
          Length = 1026

 Score =  394 bits (1013), Expect = e-107
 Identities = 212/412 (51%), Positives = 274/412 (66%), Gaps = 18/412 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR   RF+R + F K+DY+Q +                 LPG + +   +  +  E G  
Sbjct: 27   QRPIVRFSRFLFFGKLDYLQWVCTVAVFCFFVVLFQMF-LPGLIMEKSGESLKNMENGYG 85

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFAD 1011
            +LSF+K+   LDFGE ++FEP K++ KF+K+A++ N+  +SR   RFGY+KP+LALVF D
Sbjct: 86   DLSFIKNIGGLDFGEGIRFEPSKLLQKFQKEADEVNLSSASRLRHRFGYRKPQLALVFPD 145

Query: 1012 LSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVDW 1191
            L VD  Q+LMVTVA+AL E+GY I+V+SLEDGPV  +WR VG P+ +I ++      VDW
Sbjct: 146  LLVDPQQLLMVTVASALLEMGYTIQVYSLEDGPVNAIWRNVGFPVTIIRSNAKSAAVVDW 205

Query: 1192 LNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWRK 1371
            LNY GIIVNSL A G++   +QEPFKS+PL+WTI E TL+ RLRQY  + + E+VN W+K
Sbjct: 206  LNYDGIIVNSLEARGVVSCFVQEPFKSLPLIWTIPEGTLATRLRQYNLTGKIELVNDWKK 265

Query: 1372 VFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASFKNG---------- 1521
            VF RAT VV+PNY LP+ YS  D GNYF+IPGSP   WE    MAS ++           
Sbjct: 266  VFNRATAVVFPNYVLPMIYSTFDSGNYFVIPGSPAQAWEVDNFMASHRDSPRVKMGYGPD 325

Query: 1522 -FSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKIFILAGDSTSNYSRA 1689
             F IA+V SQ LY+GLWLEHA ILQ+L P+  +F    NS SHLKI I +G+S +NYS A
Sbjct: 326  DFVIALVRSQFLYKGLWLEHALILQALLPLVAEFPVDNNSNSHLKILITSGNSANNYSVA 385

Query: 1690 VETIALSLNYPNETVKLVPI-YENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            VE IAL L YP   VK + I    AD VL+ AD+VIYGSFLEE +FPDIL+K
Sbjct: 386  VEAIALKLRYPKGVVKHIAIDVGEADNVLAAADIVIYGSFLEEQSFPDILIK 437


>emb|CAN69310.1| hypothetical protein VITISV_003086 [Vitis vinifera]
          Length = 1040

 Score =  384 bits (986), Expect = e-104
 Identities = 212/426 (49%), Positives = 273/426 (64%), Gaps = 32/426 (7%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR   RF+R + F K+DY+Q +                 LPG + +   +  +  E G  
Sbjct: 27   QRPIVRFSRFLFFGKLDYLQWVCTVAVFCFFVVLFQMF-LPGLIMEKSGESLKNMENGYG 85

Query: 841  ELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFAD 1011
            +LSF+K    LDFGE ++FEP K++ KF+K+A++ N+  +SR   RFGY+KP+LALVF D
Sbjct: 86   DLSFIKKIGGLDFGEGIRFEPSKLLQKFQKEADEVNLSSASRLRHRFGYRKPQLALVFPD 145

Query: 1012 LSVDSHQILMVTVATALQEIGYEIE--------------VFSLEDGPVGGVWREVGLPLN 1149
            L VD  Q+LMVTVA+AL E+GY I+              V+SLEDGPV  +WR VG P+ 
Sbjct: 146  LLVDPQQLLMVTVASALLEMGYTIQALPYLVSIYVAWIQVYSLEDGPVNAIWRNVGFPVT 205

Query: 1150 VISADENMKFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQY 1329
            +I ++      VDWLNY GIIVNSL A G++   +QEPFKS+PL+WTI E TL+ RLRQY
Sbjct: 206  IIRSNAKSAAVVDWLNYDGIIVNSLEARGVVSCFVQEPFKSLPLIWTIPEGTLATRLRQY 265

Query: 1330 VASNQTEMVNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMAS 1509
              + + E+VN W+KVF RAT VV+PNY LP+ YS  D GNYF+IPGSP   WE    MAS
Sbjct: 266  NLTGKIELVNDWKKVFNRATAVVFPNYVLPMIYSTFDSGNYFVIPGSPAQAWEVDNFMAS 325

Query: 1510 FKNG-----------FSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDF---TNSTSHLKI 1647
             ++            F IA+V SQ LY+GLWLEHA ILQ+L P+  +F    NS SHLKI
Sbjct: 326  HRDSPRVKMGYGPDDFVIALVRSQFLYKGLWLEHALILQALLPLVAEFPVDNNSNSHLKI 385

Query: 1648 FILAGDSTSNYSRAVETIALSLNYPNETVKLVPI-YENADAVLSMADLVIYGSFLEEHAF 1824
             I +G+S +NYS AVE IAL L YP   VK + I    AD VL+ AD+VIYGSFLEE +F
Sbjct: 386  LITSGNSANNYSVAVEAIALKLRYPKGVVKHIAIDVGEADNVLAAADIVIYGSFLEEQSF 445

Query: 1825 PDILLK 1842
            PDIL+K
Sbjct: 446  PDILIK 451


>ref|XP_004159777.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101212216 [Cucumis
            sativus]
          Length = 1037

 Score =  383 bits (984), Expect = e-103
 Identities = 197/411 (47%), Positives = 277/411 (67%), Gaps = 17/411 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR RSRF+R + F+KIDY+Q I                 LPGSV +      +  E+   
Sbjct: 31   QRPRSRFSRFLFFRKIDYLQWICTVAVFFFFVVLFQMF-LPGSVVEKSEVALKDVEKSLG 89

Query: 841  ELSFLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFAD 1011
            +L FLK+L   DFGED++FEP K++ KF+K+A +++    +R  +RFGY+KP+LALVF+D
Sbjct: 90   DLKFLKELGMLDFGEDIRFEPSKLLGKFKKEAREADFSSFNRTRSRFGYRKPQLALVFSD 149

Query: 1012 LSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVDW 1191
            L VDS+Q+LMVT+A+ALQEIGY  +V+SL+ GP   VWR++G+P+ +I + +  +  VDW
Sbjct: 150  LLVDSYQVLMVTIASALQEIGYVFQVYSLQGGPANDVWRQMGVPVTLIQSCDETEVMVDW 209

Query: 1192 LNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWRK 1371
            LNY GI+V+SLG   +    +QEPFKS+PL+WTIHE+ L+ R + Y +    +++N W++
Sbjct: 210  LNYDGILVHSLGVKDVFSCYLQEPFKSLPLIWTIHEEALAIRSQNYASDGLLDILNDWKR 269

Query: 1372 VFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMAS------FKNGFS-- 1527
            VF  +TVVV+PNY +P+ YSA D GN+F+IP  P    EA+  + S       K G++  
Sbjct: 270  VFNHSTVVVFPNYVMPMIYSAYDSGNFFVIPSFPAEALEAEIDVTSDADNLRAKMGYAND 329

Query: 1528 ---IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNYSRA 1689
               IAIVGSQ LYRG+WLEHA +LQ++ P+  +F+   +S S LKIF+L+GDS SNY+ A
Sbjct: 330  DLVIAIVGSQFLYRGMWLEHAMVLQAMLPLLHEFSFYEHSNSRLKIFVLSGDSNSNYTMA 389

Query: 1690 VETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            VE IA  L YP   VK  P+  ++D  LSMADLVIYGS LEE +FP +L+K
Sbjct: 390  VEAIAQRLEYPRSVVKHFPVAADSDKALSMADLVIYGSCLEEQSFPKVLVK 440


>ref|XP_004138457.1| PREDICTED: uncharacterized protein LOC101212216 [Cucumis sativus]
          Length = 1037

 Score =  383 bits (984), Expect = e-103
 Identities = 197/411 (47%), Positives = 277/411 (67%), Gaps = 17/411 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR RSRF+R + F+KIDY+Q I                 LPGSV +      +  E+   
Sbjct: 31   QRPRSRFSRFLFFRKIDYLQWICTVAVFFFFVVLFQMF-LPGSVVEKSEVALKDVEKSLG 89

Query: 841  ELSFLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVTRFGYKKPKLALVFAD 1011
            +L FLK+L   DFGED++FEP K++ KF+K+A +++    +R  +RFGY+KP+LALVF+D
Sbjct: 90   DLKFLKELGMLDFGEDIRFEPSKLLGKFKKEAREADFSSFNRTRSRFGYRKPQLALVFSD 149

Query: 1012 LSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFSVDW 1191
            L VDS+Q+LMVT+A+ALQEIGY  +V+SL+ GP   VWR++G+P+ +I + +  +  VDW
Sbjct: 150  LLVDSYQVLMVTIASALQEIGYVFQVYSLQGGPANDVWRQMGVPVTLIQSCDETEVMVDW 209

Query: 1192 LNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNSWRK 1371
            LNY GI+V+SLG   +    +QEPFKS+PL+WTIHE+ L+ R + Y +    +++N W++
Sbjct: 210  LNYDGILVHSLGVKDVFSCYLQEPFKSLPLIWTIHEEALAIRSQNYASDGLLDILNDWKR 269

Query: 1372 VFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMAS------FKNGFS-- 1527
            VF  +TVVV+PNY +P+ YSA D GN+F+IP  P    EA+  + S       K G++  
Sbjct: 270  VFNHSTVVVFPNYVMPMIYSAYDSGNFFVIPSFPAEALEAEIDVTSDADNLRAKMGYAND 329

Query: 1528 ---IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNYSRA 1689
               IAIVGSQ LYRG+WLEHA +LQ++ P+  +F+   +S S LKIF+L+GDS SNY+ A
Sbjct: 330  DLVIAIVGSQFLYRGMWLEHAMVLQAMLPLLHEFSFYEHSNSRLKIFVLSGDSNSNYTMA 389

Query: 1690 VETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            VE IA  L YP   VK  P+  ++D  LSMADLVIYGS LEE +FP +L+K
Sbjct: 390  VEAIAQRLEYPRSVVKHFPVAADSDKALSMADLVIYGSCLEEQSFPKVLVK 440


>ref|XP_002872903.1| glycosyltransferase family protein 1 [Arabidopsis lyrata subsp.
            lyrata] gi|297318740|gb|EFH49162.1| glycosyltransferase
            family protein 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 1018

 Score =  382 bits (980), Expect = e-103
 Identities = 203/408 (49%), Positives = 270/408 (66%), Gaps = 11/408 (2%)
 Frame = +1

Query: 652  FWGQRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDE- 828
            F+ QR RSR +R  L K  +Y+Q IS                LPG V D   K     E 
Sbjct: 31   FFLQRNRSRLSRFFLLKSFNYLQWISSICVFFFFVVLFQMF-LPGLVIDKSDKPWTSKEI 89

Query: 829  -----RGSRELSFLKDLDFGEDLKFEPLKIVAKFRKDAEDSNVGVSSRNVT--RFGYKKP 987
                  G RE  FL   DFG+D++FEP K++ KF+++A   N   SS N T  RFG++KP
Sbjct: 90   LPPDLLGFREKGFL---DFGDDVRFEPTKLLMKFQREANGLNFTSSSLNTTLQRFGFRKP 146

Query: 988  KLALVFADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADE 1167
            KLALVFADL  D  Q+LMV+++ ALQEIGY IEV+SLEDGPV  +WR++G+P+ ++  + 
Sbjct: 147  KLALVFADLLADPEQVLMVSLSKALQEIGYAIEVYSLEDGPVNSIWRKMGVPVTILKTNH 206

Query: 1168 NMKFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQT 1347
                 +DWL+Y GIIVNSL A  +    MQEPFKS+PL+W I+E+TL+ R RQY +  QT
Sbjct: 207  ASSCVIDWLSYDGIIVNSLRAKSMFTCFMQEPFKSLPLIWVINEETLAVRSRQYNSIGQT 266

Query: 1348 EMVNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKK-SMASFKNGF 1524
            E++N W+K+F RA+VVV+ NY LP+ Y+  D GN+++IPGSP+ VW+AK       K+  
Sbjct: 267  ELLNDWKKIFSRASVVVFHNYLLPILYTEFDAGNFYVIPGSPEDVWKAKNLEFPPQKDDV 326

Query: 1525 SIAIVGSQLLYRGLWLEHAFILQSLYPVFTD--FTNSTSHLKIFILAGDSTSNYSRAVET 1698
             I+IVGSQ LY+G WLEHA +LQ+L P+F      + TSHLKI +L G+S SNYS A+ET
Sbjct: 327  VISIVGSQFLYKGQWLEHALLLQALRPLFPGNYLESDTSHLKIIVLGGESASNYSVAIET 386

Query: 1699 IALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            I+ +L YP + VK V I  N D +L  +DLVIYGSFLEE +FP+IL+K
Sbjct: 387  ISQNLTYPKDAVKHVSIAGNVDKILESSDLVIYGSFLEEQSFPEILMK 434


>ref|XP_004306713.1| PREDICTED: uncharacterized protein LOC101302584 [Fragaria vesca
            subsp. vesca]
          Length = 1039

 Score =  381 bits (978), Expect = e-103
 Identities = 203/418 (48%), Positives = 279/418 (66%), Gaps = 24/418 (5%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVSDDGRKFGRIDERGSR 840
            QR RSRF+R ++ KK+DY+  I                 LPGSV +   K G + ++ + 
Sbjct: 33   QRPRSRFSRFLILKKLDYLLWICTVAVFLFFVVLFQMF-LPGSVVE---KSGSLLQKKNV 88

Query: 841  ELS-----FLKDL---DFGEDLKFEPLKIVAKFRKDAEDSNVGVS-SRNVTRFGYKKPKL 993
            EL      F+K+L   DFGED++FEP K++ KFRK+  ++++    +R +  FG +KP+L
Sbjct: 89   ELDYGDLRFVKELGLLDFGEDIRFEPSKLLEKFRKEGREASLSSGFNRTLQHFGLRKPQL 148

Query: 994  ALVFADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENM 1173
            ALVFADL  DSHQ+ MVTVA ALQEIGYE+ V+SLEDGP  G W+ +G+P+ +I   +  
Sbjct: 149  ALVFADLLFDSHQLQMVTVAAALQEIGYELWVYSLEDGPARGAWKSLGVPVTIIQTCDQP 208

Query: 1174 KFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEM 1353
            K  VDWLNY+GI+V+SL A G+    +QEPFKS+P++WTIHE+ L+ R R+Y +S+Q E+
Sbjct: 209  KIVVDWLNYNGILVSSLEAKGIFSCFVQEPFKSLPVIWTIHEEALATRSRKYSSSSQIEL 268

Query: 1354 VNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEA-------------K 1494
            +N W++VF R+TVVV+PNY+LP+ YS  D GN+F+IPGSP    +              +
Sbjct: 269  LNDWKRVFNRSTVVVFPNYFLPMIYSTLDAGNFFVIPGSPAEACKTDSDSIVALDIDNLQ 328

Query: 1495 KSMASFKNGFSIAIVGSQLLYRGLWLEHAFILQSLYPVFTDF--TNSTSHLKIFILAGDS 1668
             S  +      I IVGS+ LYRGLWLEH+ +L++L P+  DF   N++SHLKI +L+GDS
Sbjct: 329  GSAGNEPENVVITIVGSKFLYRGLWLEHSIVLRALLPLLEDFLLDNNSSHLKIIVLSGDS 388

Query: 1669 TSNYSRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            TSNYS  VE IA +L YP+  VK   I  +AD VLS + LVIYGSFLEE +FPDIL+K
Sbjct: 389  TSNYSSVVEAIAYNLKYPSGIVKHAAIDVDADNVLSTSHLVIYGSFLEEQSFPDILIK 446


>ref|XP_006444916.1| hypothetical protein CICLE_v10018649mg [Citrus clementina]
            gi|568876282|ref|XP_006491210.1| PREDICTED:
            uncharacterized protein LOC102628793 [Citrus sinensis]
            gi|557547178|gb|ESR58156.1| hypothetical protein
            CICLE_v10018649mg [Citrus clementina]
          Length = 1038

 Score =  379 bits (972), Expect = e-102
 Identities = 205/414 (49%), Positives = 278/414 (67%), Gaps = 20/414 (4%)
 Frame = +1

Query: 661  QRYRSRFARLVLFKKIDYVQLISXXXXXXXXXXXXXXXXLPGSVS--DDGRKFGRIDERG 834
            QR RSRF+R + FKK+DY+  I                 LPGSV+  D+ +   R  ++ 
Sbjct: 32   QRNRSRFSRFLFFKKLDYLLWICTVAVFLFFVVIFQLF-LPGSVTVMDESQGSLRDFDKV 90

Query: 835  SRELSFLKD---LDFGEDLKFEPLKIVAKFRKDAEDSNV-GVSSRNVTRFGYKKPKLALV 1002
              +L FLK+   LDFGE++ F PLK++ KF+ + +D N+  V  R + RFGY+KP+LALV
Sbjct: 91   PADLMFLKEMGLLDFGEEVTFLPLKLMEKFQSEDKDVNLTSVFHRKLHRFGYRKPQLALV 150

Query: 1003 FADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGVWREVGLPLNVISADENMKFS 1182
            F DL +D  Q+ MVT+A AL+EIGY I+V+SLEDG    VWR +G+P+ ++         
Sbjct: 151  FPDLLIDPQQLQMVTIAIALREIGYAIQVYSLEDGRAHEVWRNIGVPVAILQTGREKASF 210

Query: 1183 VDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQTLSARLRQYVASNQTEMVNS 1362
            V+WLNY GI+VNSL A  ++ ++MQEPFKS+PLVWTIHE TL+ R R Y +S Q E++N 
Sbjct: 211  VNWLNYDGILVNSLEAKVVISNIMQEPFKSLPLVWTIHEGTLATRARNYASSGQLELLND 270

Query: 1363 WRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAVWEAKKSMASF------KNGF 1524
            W+KVF RATVVV+P+Y LP+ YSA D GNY++IPGSP   WEA  +M  +      K GF
Sbjct: 271  WKKVFNRATVVVFPDYVLPMMYSAFDAGNYYVIPGSPAKAWEADTNMDLYNDTVRVKMGF 330

Query: 1525 S-----IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT---NSTSHLKIFILAGDSTSNY 1680
                  IAIVG+Q +YRGLWLEHA IL++L P+F++ +    S S +K+ IL+GDSTSNY
Sbjct: 331  KPDDLVIAIVGTQFMYRGLWLEHALILRALLPLFSEVSVENESNSPIKVMILSGDSTSNY 390

Query: 1681 SRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYGSFLEEHAFPDILLK 1842
            S  +E IA +L+YP   VK +    + D+VL+ AD+VIYGSFLEE  FP+IL+K
Sbjct: 391  SVVIEAIAHNLHYPLGVVKHIAAEGDVDSVLNTADVVIYGSFLEEQTFPEILVK 444


>ref|XP_002511940.1| transferase, transferring glycosyl groups, putative [Ricinus
            communis] gi|223549120|gb|EEF50609.1| transferase,
            transferring glycosyl groups, putative [Ricinus communis]
          Length = 935

 Score =  376 bits (966), Expect = e-101
 Identities = 194/374 (51%), Positives = 263/374 (70%), Gaps = 19/374 (5%)
 Frame = +1

Query: 778  LPGSVSDDGRKFGRIDERGSRELSFLK---DLDFGEDLKFEPLKIVAKFRKDAEDSNVGV 948
            LPGS+ D      +  E    +L +LK    LDFGED++F+PLK++ KF+K+  + N+  
Sbjct: 18   LPGSMIDKSEVSLKKLEIVPGDLLYLKAMGTLDFGEDVQFQPLKLLEKFQKENREVNLTS 77

Query: 949  SSRNVT--RFGYKKPKLALVFADLSVDSHQILMVTVATALQEIGYEIEVFSLEDGPVGGV 1122
            S+ N T  RFGY+KP+LALVFADL  D  Q+LMVTVATALQEIGY I+VFS+ DGPV  +
Sbjct: 78   SAFNRTLLRFGYRKPQLALVFADLLADPQQLLMVTVATALQEIGYAIQVFSVNDGPVHDI 137

Query: 1123 WREVGLPLNVISADENMKFSVDWLNYHGIIVNSLGAVGLLPSLMQEPFKSVPLVWTIHEQ 1302
            W+ +G+P+ +   +  M+ +VDWL +  IIVNSL A  + P  MQEPFKS+PL+WTIHE+
Sbjct: 138  WKRIGVPVTIFQTNHKMEIAVDWLIFDSIIVNSLEAKVVFPCFMQEPFKSIPLIWTIHEK 197

Query: 1303 TLSARLRQYVASNQTEMVNSWRKVFQRATVVVYPNYYLPVAYSACDPGNYFIIPGSPKAV 1482
            TL  R RQY+++ Q E+V+ W++VF RATVVV+PN+ LP+ YSA D  NY++IPGSP  V
Sbjct: 198  TLGIRSRQYISNGQIELVSDWKRVFNRATVVVFPNHVLPMMYSAFDAENYYVIPGSPAEV 257

Query: 1483 WEAKKSMASFKNGFS-----------IAIVGSQLLYRGLWLEHAFILQSLYPVFTDFT-- 1623
            WEA+   A +K+              IAIVGSQ LYRGLWLEHA ILQ+L P+F+DF+  
Sbjct: 258  WEAEAMAAVYKDSIRMKMGYRPDDIIIAIVGSQFLYRGLWLEHALILQALSPLFSDFSFD 317

Query: 1624 -NSTSHLKIFILAGDSTSNYSRAVETIALSLNYPNETVKLVPIYENADAVLSMADLVIYG 1800
             NS  HLKI +L+G+STSNYS A+E IA++L+YP   VK + I  +  + L+ AD+V YG
Sbjct: 318  DNSNPHLKIIVLSGNSTSNYSVAIEAIAINLHYPIGAVKHIAIDGDVGSFLTAADIVTYG 377

Query: 1801 SFLEEHAFPDILLK 1842
            SF +  +FP++L+K
Sbjct: 378  SFHDGQSFPEMLMK 391


Top