BLASTX nr result

ID: Zingiber24_contig00001679 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber24_contig00001679
         (1104 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolo...   271   3e-70
gb|EXC27339.1| hypothetical protein L484_001075 [Morus notabilis]     267   5e-69
ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arab...   266   1e-68
ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana] ...   265   2e-68
gb|EXC27334.1| hypothetical protein L484_001070 [Morus notabilis]     265   3e-68
ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutr...   261   3e-67
ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Caps...   261   3e-67
ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Caps...   261   5e-67
gb|AED99886.1| glycosyltransferase [Panax notoginseng]                260   6e-67
gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]     259   1e-66
ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutr...   259   1e-66
ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolo...   259   1e-66
ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, p...   259   1e-66
gb|EOY23195.1| Glycosyltransferase isoform 3 [Theobroma cacao]        259   2e-66
gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]        259   2e-66
gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]        259   2e-66
gb|EOY24688.1| Glycosyltransferase isoform 1 [Theobroma cacao]        258   2e-66
ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arab...   258   2e-66
ref|XP_006599594.1| PREDICTED: KDEL motif-containing protein 2-l...   258   3e-66
dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]           258   3e-66

>ref|XP_002268245.1| PREDICTED: O-glucosyltransferase rumi homolog [Vitis vinifera]
            gi|302143884|emb|CBI22745.3| unnamed protein product
            [Vitis vinifera]
          Length = 525

 Score =  271 bits (694), Expect = 3e-70
 Identities = 122/221 (55%), Positives = 159/221 (71%), Gaps = 6/221 (2%)
 Frame = +1

Query: 460  KTQPIPIT--LSCESPTAT-VCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKST 630
            K  P+ I   L+C +   T  C R      +P      + P CP YFRWI+ DLR W  +
Sbjct: 83   KKPPVKIEYPLNCSAGNLTRTCPRNYPTAFSPEDPDRPSPPECPHYFRWIYGDLRPWMKS 142

Query: 631  GITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLD 810
            GITREM+E A   ATF+LV+L+GR YV++Y   F TR+VF+LWGILQL+ RYPG+VPDL+
Sbjct: 143  GITREMVERAKRTATFKLVILNGRAYVEKYQRAFQTRDVFTLWGILQLLRRYPGKVPDLE 202

Query: 811  LMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASL 981
            LMF+C+D P ++S EY   N++ PPP+F YC +D T DI+FPDWSFWGWPEINIKPW SL
Sbjct: 203  LMFDCVDWPVIQSNEYRGPNATAPPPLFRYCGDDATLDIVFPDWSFWGWPEINIKPWESL 262

Query: 982  LKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            LK++KE N+  +W++REPYA+W+GNP +   R DLLKCNV+
Sbjct: 263  LKDLKEGNKRSRWMEREPYAYWKGNPAVAATRLDLLKCNVS 303


>gb|EXC27339.1| hypothetical protein L484_001075 [Morus notabilis]
          Length = 476

 Score =  267 bits (683), Expect = 5e-69
 Identities = 123/231 (53%), Positives = 164/231 (70%), Gaps = 12/231 (5%)
 Frame = +1

Query: 448  LQHSKTQPIPITLSCESPTATVCERRSTPMPTPHLTTSYN---------SPSCPEYFRWI 600
            +  S  Q + I L+C +  AT    R+ P    + TT++N          P+CP+YFRWI
Sbjct: 74   ISESPPQKVEIPLNCTAYEAT----RTRPS---NYTTAHNIQDDPDRPLPPTCPDYFRWI 126

Query: 601  HDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVN 780
            ++DLR W  TGI+R+ +E A   A FRLV+++G+ YV+ Y   F TR+VF+LWGILQL+ 
Sbjct: 127  YEDLRPWAHTGISRDTVERAKPTADFRLVIVNGKAYVETYRRSFQTRDVFTLWGILQLLQ 186

Query: 781  RYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWP 951
            RYPGRVPDLDLMFNC D P + S  Y   N+ +PPP+FHYC ND T DI+FPDWSFWGWP
Sbjct: 187  RYPGRVPDLDLMFNCGDLPLILSKAYRKANAKSPPPLFHYCANDNTLDIVFPDWSFWGWP 246

Query: 952  EINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            E+NIKPW  LLKE++E N++ KW+DR+PYA+W+GNP +  +R +LLKCNV+
Sbjct: 247  EVNIKPWEPLLKELEEGNKKSKWVDRQPYAYWKGNPDVSRSRRNLLKCNVS 297


>ref|XP_002872075.1| hypothetical protein ARALYDRAFT_910396 [Arabidopsis lyrata subsp.
            lyrata] gi|297317912|gb|EFH48334.1| hypothetical protein
            ARALYDRAFT_910396 [Arabidopsis lyrata subsp. lyrata]
          Length = 543

 Score =  266 bits (679), Expect = 1e-68
 Identities = 128/255 (50%), Positives = 173/255 (67%), Gaps = 12/255 (4%)
 Frame = +1

Query: 376  SSEQVYQQTINPPNAKRPDRQTSNLQHSKTQPIPITLSCESPTATVCERRSTPMPTPHLT 555
            ++ Q   QTI+P   K P   T   Q  K +    TL C S   T     S   PT   T
Sbjct: 74   TTTQTQTQTISP---KYPRPTTVITQSPKPE---FTLHC-SANETTASCPSNKYPT---T 123

Query: 556  TSY------NSP---SCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVY 708
             S+      N P   +CP+YFRWIH+DLR W STGITRE +E A + A FRL ++DG++Y
Sbjct: 124  ASFGEDDDTNHPPNATCPDYFRWIHEDLRPWSSTGITREALERAKKTANFRLAIIDGKIY 183

Query: 709  VQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPP 879
            V+++   F TR+VF++WG LQL+ +YPG++PDL+LMF+C+D P VK+ E+   N+ +PPP
Sbjct: 184  VEKFQDAFQTRDVFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVKASEFTGANAPSPPP 243

Query: 880  VFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNP 1059
            +F YC N++T DI+FPDWSFWGW E+NIKPW SLLKE++E N+  KWI+REPYA+W+GNP
Sbjct: 244  LFRYCGNEETLDIVFPDWSFWGWAEVNIKPWESLLKELREGNQRTKWINREPYAYWKGNP 303

Query: 1060 HMGGNRFDLLKCNVT 1104
             +   R DL+KCNV+
Sbjct: 304  MVAETRQDLMKCNVS 318


>ref|NP_197774.1| uncharacterized protein [Arabidopsis thaliana]
            gi|10176852|dbj|BAB10058.1| unnamed protein product
            [Arabidopsis thaliana] gi|48310551|gb|AAT41837.1|
            At5g23850 [Arabidopsis thaliana]
            gi|62320258|dbj|BAD94534.1| putative protein [Arabidopsis
            thaliana] gi|332005839|gb|AED93222.1| uncharacterized
            protein AT5G23850 [Arabidopsis thaliana]
          Length = 542

 Score =  265 bits (678), Expect = 2e-68
 Identities = 123/243 (50%), Positives = 165/243 (67%), Gaps = 7/243 (2%)
 Frame = +1

Query: 397  QTINPPNAKRPDRQTSNLQHSKTQPIPITLSCESPTATV-CERRSTPMPTPHLTTSYNSP 573
            QTI P   K P   T   Q  K +    TL C +   T  C     P  T       N P
Sbjct: 81   QTITP---KYPRPTTVITQSPKPE---FTLHCSANETTASCPSNKYPTTTSFEDDDTNHP 134

Query: 574  ---SCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRN 744
               +CP+YFRWIH+DLR W  TGITRE +E A + ATFRL ++ G++YV+++   F TR+
Sbjct: 135  PTATCPDYFRWIHEDLRPWSRTGITREALERAKKTATFRLAIVGGKIYVEKFQDAFQTRD 194

Query: 745  VFSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWD 915
            VF++WG LQL+ +YPG++PDL+LMF+C+D P V++ E+   N+ +PPP+F YC N++T D
Sbjct: 195  VFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRATEFAGANAPSPPPLFRYCGNEETLD 254

Query: 916  ILFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKC 1095
            I+FPDWSFWGW E+NIKPW SLLKE++E NE  KWI+REPYA+W+GNP +   R DL+KC
Sbjct: 255  IVFPDWSFWGWAEVNIKPWESLLKELREGNERTKWINREPYAYWKGNPMVAETRQDLMKC 314

Query: 1096 NVT 1104
            NV+
Sbjct: 315  NVS 317


>gb|EXC27334.1| hypothetical protein L484_001070 [Morus notabilis]
          Length = 511

 Score =  265 bits (676), Expect = 3e-68
 Identities = 115/214 (53%), Positives = 156/214 (72%), Gaps = 3/214 (1%)
 Frame = +1

Query: 472  IPITLSCESPTATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITREMI 651
            IP+  +   PT T     +T           + P+CP+YFRWI++DLR W  TGI+R+M+
Sbjct: 77   IPLNCTAYDPTRTCPSNYTTAHNKQDDLDRPSPPTCPDYFRWIYEDLRPWAHTGISRDMV 136

Query: 652  ENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMD 831
            E A   A FRLV+++G+ YV+ Y   F TR++F+LWGILQL+ RYPGRVPDLDLMFNC D
Sbjct: 137  ERAKPTADFRLVIVNGKAYVETYRRSFQTRDIFTLWGILQLLRRYPGRVPDLDLMFNCGD 196

Query: 832  QPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEA 1002
             P + S  Y   N+++PPP+FHYC +D T DI+FPDWSFWGWPE+NIKPW  LLKE++E 
Sbjct: 197  LPLILSKSYSGANATSPPPLFHYCADDYTLDIVFPDWSFWGWPEVNIKPWEPLLKELEEG 256

Query: 1003 NEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            N++ KW+DR+P+A+W+GNP++  +R DLLKC V+
Sbjct: 257  NKKSKWVDRQPHAYWKGNPNVSPSRQDLLKCKVS 290


>ref|XP_006394641.1| hypothetical protein EUTSA_v10003948mg [Eutrema salsugineum]
            gi|557091280|gb|ESQ31927.1| hypothetical protein
            EUTSA_v10003948mg [Eutrema salsugineum]
          Length = 545

 Score =  261 bits (668), Expect = 3e-67
 Identities = 115/220 (52%), Positives = 157/220 (71%), Gaps = 8/220 (3%)
 Frame = +1

Query: 469  PIPITLSCE-SPTATVCERRSTPMPTPHL----TTSYNSPSCPEYFRWIHDDLRHWKSTG 633
            P   TL C  + T   C R + P          T S  + +CP+YFRWIH+DLR W+ TG
Sbjct: 101  PREFTLHCSGNETTGTCPRNNYPTTVSFKEDDSTHSSTTATCPDYFRWIHEDLRPWEKTG 160

Query: 634  ITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDL 813
            ITRE +E A + A FRL ++ G++YV+++   F TR+VF++WG LQL+ RYPG++PDL+L
Sbjct: 161  ITREALERAKKTANFRLAIVGGKLYVEKFQDAFQTRDVFTIWGFLQLLRRYPGKIPDLEL 220

Query: 814  MFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLL 984
            MF+C+D P VK+  +   NS +PPP+F YC N++T DI+FPDWSFWGW E+NIKPW SLL
Sbjct: 221  MFDCVDWPVVKAANFAGANSPSPPPLFRYCGNEETLDIVFPDWSFWGWSEVNIKPWESLL 280

Query: 985  KEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            KE++E NE+  WI+REPYA+W+GNP +   R DL+KCNV+
Sbjct: 281  KELREGNEKTNWINREPYAYWKGNPLVAETRQDLMKCNVS 320


>ref|XP_006287442.1| hypothetical protein CARUB_v10000648mg [Capsella rubella]
            gi|482556148|gb|EOA20340.1| hypothetical protein
            CARUB_v10000648mg [Capsella rubella]
          Length = 544

 Score =  261 bits (668), Expect = 3e-67
 Identities = 120/243 (49%), Positives = 167/243 (68%), Gaps = 8/243 (3%)
 Frame = +1

Query: 400  TINPPNAKRPDRQTSNLQHSKTQPIPITLSCES--PTATVCERRSTPMPTPHLTTSYNSP 573
            TI+P   K P   T   Q+ K Q    TL C +   T   C +   P          N P
Sbjct: 83   TISP---KYPRPATVITQNPKPQ---FTLHCSANETTGNTCPKNKDPTTASFNDDDTNHP 136

Query: 574  ---SCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRN 744
               +CP+YFRWIH+DLR W  TGITRE +E A++ A FRL ++ G+VYV+++   F TR+
Sbjct: 137  PTATCPDYFRWIHEDLRPWARTGITREALERANKTANFRLAIVGGKVYVEKFQDAFQTRD 196

Query: 745  VFSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWD 915
            VF++WG LQL+ +YPG++PDL+LMF+C+D P V++ E+   ++ +PPP+F YC N++T D
Sbjct: 197  VFTIWGFLQLLRKYPGKIPDLELMFDCVDWPVVRAAEFAGVDAPSPPPLFRYCGNEETLD 256

Query: 916  ILFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKC 1095
            I+FPDWSFWGW E+NIKPW SLLKE++E NE++ WI+REPYA+W+GNP +   R DL+KC
Sbjct: 257  IVFPDWSFWGWAEVNIKPWESLLKELREGNEKINWINREPYAYWKGNPVVAETRQDLMKC 316

Query: 1096 NVT 1104
            NV+
Sbjct: 317  NVS 319


>ref|XP_006290867.1| hypothetical protein CARUB_v10016976mg [Capsella rubella]
            gi|482559574|gb|EOA23765.1| hypothetical protein
            CARUB_v10016976mg [Capsella rubella]
          Length = 539

 Score =  261 bits (666), Expect = 5e-67
 Identities = 115/218 (52%), Positives = 157/218 (72%), Gaps = 10/218 (4%)
 Frame = +1

Query: 481  TLSCESPTAT---VCERRSTPMPTPHLTTSYNS----PSCPEYFRWIHDDLRHWKSTGIT 639
            TL+C + +      C R S P       TS+ S     +CP+YFRWIH+DLR W+ TGIT
Sbjct: 104  TLNCAAFSGNDTVTCPRNSYP-------TSFRSNAEPATCPDYFRWIHEDLRPWEKTGIT 156

Query: 640  REMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMF 819
            RE +E A+  A FRL ++DGR+YV+ +   F TR+VF++WG +QL+ RYPG++PDL+LMF
Sbjct: 157  REALERANATAIFRLAIIDGRIYVENFREAFQTRDVFTIWGFVQLLRRYPGKIPDLELMF 216

Query: 820  NCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKE 990
            +C+D P VK+ EY   +  +PPP+F YC ND+T DI+FPDWS+WGW E+NIKPW SLLK+
Sbjct: 217  DCVDWPVVKAEEYSGVDKPSPPPLFRYCANDETLDIVFPDWSYWGWAEVNIKPWESLLKD 276

Query: 991  MKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            + E N+  KWIDREPYA+W+GNP +   R DL+KCN++
Sbjct: 277  LSEGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCNLS 314


>gb|AED99886.1| glycosyltransferase [Panax notoginseng]
          Length = 546

 Score =  260 bits (665), Expect = 6e-67
 Identities = 112/180 (62%), Positives = 142/180 (78%), Gaps = 3/180 (1%)
 Frame = +1

Query: 574  SCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFS 753
            SCPEYFRWI++DLR W+ TGITREM+E A   A FRLV+L+GR YV+ +   F +R+VF+
Sbjct: 145  SCPEYFRWIYEDLRPWRETGITREMVERARRTANFRLVILNGRAYVETHQKSFQSRDVFT 204

Query: 754  LWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILF 924
            LWGILQL+  YPG+VPDLDLMF+C+D P + S  Y   N++ PPP+F YC +D T DI+F
Sbjct: 205  LWGILQLLRMYPGKVPDLDLMFDCVDWPVIISRFYHGPNATAPPPLFRYCADDSTLDIVF 264

Query: 925  PDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            PDW+FWGWPEINIKPW SLLK++KE N   +W+DREPYA+W+GNP +   R DLLKCNV+
Sbjct: 265  PDWTFWGWPEINIKPWGSLLKDLKEGNTGTQWMDREPYAYWKGNPIVAKTRMDLLKCNVS 324


>gb|EXB29382.1| hypothetical protein L484_001025 [Morus notabilis]
          Length = 515

 Score =  259 bits (662), Expect = 1e-66
 Identities = 119/223 (53%), Positives = 159/223 (71%), Gaps = 12/223 (5%)
 Frame = +1

Query: 472  IPITLSCESPTATVCERRSTPMPTPHLTTSYNS---------PSCPEYFRWIHDDLRHWK 624
            IP+  S  SPT      R+ P   P   T+YN          P+CP+YFRWI++DLR W 
Sbjct: 78   IPLNCSAYSPT------RTCPANYP---TTYNKQDDLDRPLLPTCPDYFRWIYEDLRPWA 128

Query: 625  STGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPD 804
             TGI+R+M+E A   A FRLV+++G+ YV+ +   F TR+VF+LWGILQL+ +YPGRVPD
Sbjct: 129  YTGISRDMVERAKRTANFRLVIVNGKAYVETFQKAFQTRDVFTLWGILQLLRKYPGRVPD 188

Query: 805  LDLMFNCMDQPAVKSFEYN---SSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWA 975
            L+LMF+C+D P V S  Y+   ++TPPP+F YC +D T DI+FPDWSFWGWPE NIKPW 
Sbjct: 189  LELMFDCVDWPVVLSKAYSGPDATTPPPLFRYCGDDSTLDIVFPDWSFWGWPETNIKPWE 248

Query: 976  SLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            +LLKE++E N++ KW++RE YA+W+GNP +   R DLLKCNV+
Sbjct: 249  ALLKELEEGNKKSKWVEREAYAYWKGNPVVAATRQDLLKCNVS 291


>ref|XP_006404195.1| hypothetical protein EUTSA_v10010269mg [Eutrema salsugineum]
            gi|557105314|gb|ESQ45648.1| hypothetical protein
            EUTSA_v10010269mg [Eutrema salsugineum]
          Length = 543

 Score =  259 bits (662), Expect = 1e-66
 Identities = 118/241 (48%), Positives = 163/241 (67%), Gaps = 12/241 (4%)
 Frame = +1

Query: 418  AKRPDRQTSNLQHSKTQPIPITLSCES----PTATVCERRSTPMPT---PHLTTSYNSP- 573
            A  P    S    ++ +P   TL+C +     T   C R   P              SP 
Sbjct: 78   AVSPKHPQSTKLITEEKPKEFTLNCAAFSGNETVITCPRNRYPTSLRSGAREDDPERSPP 137

Query: 574  -SCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVF 750
             +CP+YFRWIH+DLR W+ TGITRE +E A+  A FRL +++GR+YV+++   F TR+VF
Sbjct: 138  ATCPDYFRWIHEDLRPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDVF 197

Query: 751  SLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDIL 921
            ++WG +QL+ RYPG++PDL+LMF+C+D P VK+ E+   +  TPPP+F YC N++T DI+
Sbjct: 198  TIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQLTPPPLFRYCGNNETLDIV 257

Query: 922  FPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNV 1101
            FPDWS+WGW E+NIKPW SLLKE++E N+  KWIDREPYA+W+GNP +   R DL+KCNV
Sbjct: 258  FPDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRQDLMKCNV 317

Query: 1102 T 1104
            +
Sbjct: 318  S 318


>ref|XP_004234394.1| PREDICTED: O-glucosyltransferase rumi homolog [Solanum lycopersicum]
          Length = 514

 Score =  259 bits (662), Expect = 1e-66
 Identities = 109/187 (58%), Positives = 145/187 (77%), Gaps = 3/187 (1%)
 Frame = +1

Query: 553  TTSYNSPSCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKF 732
            T+S   P+CP+YFRWI+DDL HW+ TGIT+EM+  A   A FRLV+++GR YV+ Y+  F
Sbjct: 101  TSSSPPPTCPDYFRWIYDDLWHWRETGITKEMVMRAKRTADFRLVIVNGRAYVETYHKAF 160

Query: 733  MTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNND 903
             +R+ F+LWGILQ++ RYPG+VPDLDLMF+C+D P +K+  Y    +  PPP+F YC ND
Sbjct: 161  QSRDTFTLWGILQMLRRYPGKVPDLDLMFDCVDWPVLKTEFYRHPKAPVPPPLFRYCGND 220

Query: 904  KTWDILFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFD 1083
             + DI+FPDWSFWGWPEINIKPW +L K++K+ NE+MKW +REPYA+W+GNP +   R D
Sbjct: 221  SSLDIVFPDWSFWGWPEINIKPWETLSKDLKKGNEKMKWTEREPYAYWKGNPVVAETRRD 280

Query: 1084 LLKCNVT 1104
            LLKCN +
Sbjct: 281  LLKCNAS 287


>ref|XP_002510787.1| KDEL motif-containing protein 1 precursor, putative [Ricinus
            communis] gi|223549902|gb|EEF51389.1| KDEL
            motif-containing protein 1 precursor, putative [Ricinus
            communis]
          Length = 506

 Score =  259 bits (662), Expect = 1e-66
 Identities = 113/212 (53%), Positives = 151/212 (71%), Gaps = 3/212 (1%)
 Frame = +1

Query: 478  ITLSCESPTATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITREMIEN 657
            I L+C +   T       P  +       + P+CPEYFRWIH+DLR W  TGITRE +E 
Sbjct: 71   IPLNCHALNLTRTCPTDYPSTSSQDPNRSSPPTCPEYFRWIHEDLRPWVRTGITRETMER 130

Query: 658  AHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQP 837
            A   A FRLV+L+G  Y++ Y   F TR+VF+LWGILQL+ +YPGRVPDL++MF+C+D P
Sbjct: 131  AKATANFRLVILNGTAYLEMYEKSFQTRDVFTLWGILQLLRKYPGRVPDLEMMFDCVDWP 190

Query: 838  AVKSFEYNSS---TPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEANE 1008
             VKS +Y+ S   +PPP+F YC ND+T DI+FPDWS+WGW E NIKPW  ++K++KE N+
Sbjct: 191  VVKSVDYSGSSAISPPPLFRYCGNDETLDIVFPDWSYWGWVETNIKPWEKIVKDLKEGNQ 250

Query: 1009 EMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
              KW +REPYA+W+GNP++   R DL+KCNV+
Sbjct: 251  RSKWKEREPYAYWKGNPNVAETRLDLMKCNVS 282


>gb|EOY23195.1| Glycosyltransferase isoform 3 [Theobroma cacao]
          Length = 485

 Score =  259 bits (661), Expect = 2e-66
 Identities = 115/212 (54%), Positives = 152/212 (71%), Gaps = 3/212 (1%)
 Frame = +1

Query: 478  ITLSCESPTATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITREMIEN 657
            I L+C +   T     + P        S  +  CP+YFRWIH+DLR W  TGI+ +M++ 
Sbjct: 90   IPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDMLKR 149

Query: 658  AHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQP 837
            A + A FRLVV++GR YVQ Y   F TR+VF+LWGILQL+ RYPG+VPDLDLMF+C+D P
Sbjct: 150  AEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVDWP 209

Query: 838  AVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEANE 1008
             +K+ +Y   N++TPPP+F YC +D+T DI+FPDWSFWGWPEINIKPW  LL ++ E N+
Sbjct: 210  VIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEGNK 269

Query: 1009 EMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
             M W  REP+A+W+GNP++   R DLLKCNV+
Sbjct: 270  RMGWEGREPHAYWKGNPNVATTRQDLLKCNVS 301


>gb|EOY23194.1| Glycosyltransferase isoform 2 [Theobroma cacao]
          Length = 498

 Score =  259 bits (661), Expect = 2e-66
 Identities = 115/212 (54%), Positives = 152/212 (71%), Gaps = 3/212 (1%)
 Frame = +1

Query: 478  ITLSCESPTATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITREMIEN 657
            I L+C +   T     + P        S  +  CP+YFRWIH+DLR W  TGI+ +M++ 
Sbjct: 90   IPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDMLKR 149

Query: 658  AHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQP 837
            A + A FRLVV++GR YVQ Y   F TR+VF+LWGILQL+ RYPG+VPDLDLMF+C+D P
Sbjct: 150  AEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVDWP 209

Query: 838  AVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEANE 1008
             +K+ +Y   N++TPPP+F YC +D+T DI+FPDWSFWGWPEINIKPW  LL ++ E N+
Sbjct: 210  VIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEGNK 269

Query: 1009 EMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
             M W  REP+A+W+GNP++   R DLLKCNV+
Sbjct: 270  RMGWEGREPHAYWKGNPNVATTRQDLLKCNVS 301


>gb|EOY23193.1| Glycosyltransferase isoform 1 [Theobroma cacao]
          Length = 522

 Score =  259 bits (661), Expect = 2e-66
 Identities = 115/212 (54%), Positives = 152/212 (71%), Gaps = 3/212 (1%)
 Frame = +1

Query: 478  ITLSCESPTATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITREMIEN 657
            I L+C +   T     + P        S  +  CP+YFRWIH+DLR W  TGI+ +M++ 
Sbjct: 90   IPLNCTARNLTRACPTNDPTAIEEEPDSSLNAMCPDYFRWIHEDLRPWAYTGISMDMLKR 149

Query: 658  AHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFNCMDQP 837
            A + A FRLVV++GR YVQ Y   F TR+VF+LWGILQL+ RYPG+VPDLDLMF+C+D P
Sbjct: 150  AEKTANFRLVVVNGRAYVQRYRRSFQTRDVFTLWGILQLLRRYPGKVPDLDLMFDCVDWP 209

Query: 838  AVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEMKEANE 1008
             +K+ +Y   N++TPPP+F YC +D+T DI+FPDWSFWGWPEINIKPW  LL ++ E N+
Sbjct: 210  VIKTSDYGGPNATTPPPLFRYCKDDETLDIVFPDWSFWGWPEINIKPWVPLLNDLMEGNK 269

Query: 1009 EMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
             M W  REP+A+W+GNP++   R DLLKCNV+
Sbjct: 270  RMGWEGREPHAYWKGNPNVATTRQDLLKCNVS 301


>gb|EOY24688.1| Glycosyltransferase isoform 1 [Theobroma cacao]
          Length = 516

 Score =  258 bits (660), Expect = 2e-66
 Identities = 117/219 (53%), Positives = 157/219 (71%), Gaps = 6/219 (2%)
 Frame = +1

Query: 466  QPIPITLSCESP---TATVCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGI 636
            Q I I L C S    T T          T  L  S N   CP+YFRWIH+DLR WK++GI
Sbjct: 74   QKIEIPLGCTSSKNQTQTCPTNYPKTFQTEDLDPSSNHV-CPDYFRWIHEDLRPWKTSGI 132

Query: 637  TREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLM 816
            TR+M+E A+  ATFRLV++ G+ YV+ Y     TR+VF++WG+LQL+ +YPGR+PDL++M
Sbjct: 133  TRDMVERANRTATFRLVIIGGKAYVENYRKAIQTRDVFTIWGVLQLLRKYPGRLPDLEIM 192

Query: 817  FNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLK 987
            F+  D+P V+S +Y   N++ PPP+F YC + +T DI+FPDWSFWGW EINIKPW S+LK
Sbjct: 193  FDTEDKPVVRSRDYRGPNATGPPPLFRYCGDKETLDIVFPDWSFWGWAEINIKPWHSILK 252

Query: 988  EMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            ++++ N + KWIDREPYA+W+GNP + G R DLLKCNV+
Sbjct: 253  DVRQGNNQTKWIDREPYAYWKGNPFVDGKRQDLLKCNVS 291


>ref|XP_002875936.1| hypothetical protein ARALYDRAFT_485256 [Arabidopsis lyrata subsp.
            lyrata] gi|297321774|gb|EFH52195.1| hypothetical protein
            ARALYDRAFT_485256 [Arabidopsis lyrata subsp. lyrata]
          Length = 539

 Score =  258 bits (660), Expect = 2e-66
 Identities = 105/182 (57%), Positives = 145/182 (79%), Gaps = 3/182 (1%)
 Frame = +1

Query: 568  SPSCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNV 747
            S +CP+YFRWIH+DLR W+ TGITRE +E A+  A FRL +++GR+YV+++   F TR+V
Sbjct: 133  SATCPDYFRWIHEDLRPWEKTGITREALERANATANFRLAIINGRIYVEKFREAFQTRDV 192

Query: 748  FSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDI 918
            F++WG +QL+ RYPG++PDL+LMF+C+D P VK+ E+   +   PPP+F YC ND+T DI
Sbjct: 193  FTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDI 252

Query: 919  LFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCN 1098
            +FPDWS+WGW E+NIKPW SLLKE++E N+  KWIDREPYA+W+GNP +   R DL+KCN
Sbjct: 253  VFPDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCN 312

Query: 1099 VT 1104
            ++
Sbjct: 313  LS 314


>ref|XP_006599594.1| PREDICTED: KDEL motif-containing protein 2-like [Glycine max]
          Length = 534

 Score =  258 bits (659), Expect = 3e-66
 Identities = 115/217 (52%), Positives = 153/217 (70%), Gaps = 4/217 (1%)
 Frame = +1

Query: 466  QPIPITLSCESPTAT-VCERRSTPMPTPHLTTSYNSPSCPEYFRWIHDDLRHWKSTGITR 642
            +PI I L+C +   T  C     P+P    +   +S +CPEYFRWIH+DLR W  TGIT+
Sbjct: 95   KPIEIPLNCTAYNLTRTCSTNQFPIPENDQSHP-SSATCPEYFRWIHEDLRPWARTGITQ 153

Query: 643  EMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNVFSLWGILQLVNRYPGRVPDLDLMFN 822
            +M+E A E A F+LV+L G+ Y++ Y   + TR+VFS+WGILQL+ RYPG++PDL+LMF+
Sbjct: 154  DMVERAKETANFKLVILKGKAYLETYEKAYQTRDVFSIWGILQLLRRYPGKIPDLELMFD 213

Query: 823  CMDQPAVKSFEYNS---STPPPVFHYCNNDKTWDILFPDWSFWGWPEINIKPWASLLKEM 993
            C+D P V S  YN      PPP+F YC ND T DI+FPDWSFWGW E+NIKPW  LL E+
Sbjct: 214  CVDWPVVLSDRYNGPNVEQPPPLFRYCGNDATLDIVFPDWSFWGWAEVNIKPWEILLTEL 273

Query: 994  KEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCNVT 1104
            KE  + + W++REPYA+W+GNP +   R DL+KCNV+
Sbjct: 274  KEGTKRIPWLNREPYAYWKGNPVVAETRQDLMKCNVS 310


>dbj|BAE99650.1| hypothetical protein [Arabidopsis thaliana]
          Length = 433

 Score =  258 bits (659), Expect = 3e-66
 Identities = 105/182 (57%), Positives = 145/182 (79%), Gaps = 3/182 (1%)
 Frame = +1

Query: 568  SPSCPEYFRWIHDDLRHWKSTGITREMIENAHEYATFRLVVLDGRVYVQEYYGKFMTRNV 747
            S +CP+YFRWIH+DLR W+ TGITRE +E A+  A FRL +++GR+YV+++   F TR+V
Sbjct: 27   SATCPDYFRWIHEDLRPWEKTGITREALERANATAIFRLAIINGRIYVEKFREAFQTRDV 86

Query: 748  FSLWGILQLVNRYPGRVPDLDLMFNCMDQPAVKSFEY---NSSTPPPVFHYCNNDKTWDI 918
            F++WG +QL+ RYPG++PDL+LMF+C+D P VK+ E+   +   PPP+F YC ND+T DI
Sbjct: 87   FTIWGFVQLLRRYPGKIPDLELMFDCVDWPVVKAAEFAGVDQPPPPPLFRYCANDETLDI 146

Query: 919  LFPDWSFWGWPEINIKPWASLLKEMKEANEEMKWIDREPYAFWRGNPHMGGNRFDLLKCN 1098
            +FPDWS+WGW E+NIKPW SLLKE++E N+  KWIDREPYA+W+GNP +   R DL+KCN
Sbjct: 147  VFPDWSYWGWAEVNIKPWESLLKELREGNQRTKWIDREPYAYWKGNPTVAETRLDLMKCN 206

Query: 1099 VT 1104
            ++
Sbjct: 207  LS 208


Top