BLASTX nr result

ID: Mentha24_contig00024514 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00024514
         (744 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU44778.1| hypothetical protein MIMGU_mgv1a008498mg [Mimulus...   412   e-113
ref|XP_004232318.1| PREDICTED: uncharacterized protein LOC101257...   380   e-103
ref|XP_006338588.1| PREDICTED: uncharacterized protein LOC102600...   376   e-102
ref|XP_002275670.1| PREDICTED: uncharacterized protein LOC100259...   363   4e-98
ref|XP_007031552.1| Core-2/I-branching beta-1,6-N-acetylglucosam...   351   1e-94
ref|XP_007031549.1| Core-2/I-branching beta-1,6-N-acetylglucosam...   351   1e-94
ref|XP_007215567.1| hypothetical protein PRUPE_ppa007456mg [Prun...   351   2e-94
ref|XP_007031550.1| Core-2/I-branching beta-1,6-N-acetylglucosam...   347   3e-93
ref|XP_006289626.1| hypothetical protein CARUB_v10003187mg [Caps...   341   2e-91
ref|XP_002873676.1| hypothetical protein ARALYDRAFT_909422 [Arab...   340   4e-91
ref|XP_004142791.1| PREDICTED: uncharacterized protein LOC101222...   339   5e-91
ref|XP_006470273.1| PREDICTED: uncharacterized protein LOC102626...   338   1e-90
ref|XP_006446563.1| hypothetical protein CICLE_v10015712mg [Citr...   338   1e-90
ref|XP_006399968.1| hypothetical protein EUTSA_v10015269mg [Eutr...   333   5e-89
ref|XP_007031553.1| Core-2/I-branching beta-1,6-N-acetylglucosam...   332   6e-89
ref|NP_196959.2| core-2/I-branching beta-1,6-N-acetylglucosaminy...   332   6e-89
dbj|BAD43086.1| putative protein [Arabidopsis thaliana]               332   6e-89
ref|XP_004304653.1| PREDICTED: uncharacterized protein LOC101304...   332   8e-89
ref|XP_007150933.1| hypothetical protein PHAVU_004G007000g [Phas...   329   7e-88
emb|CBI34881.3| unnamed protein product [Vitis vinifera]              329   7e-88

>gb|EYU44778.1| hypothetical protein MIMGU_mgv1a008498mg [Mimulus guttatus]
          Length = 371

 Score =  412 bits (1059), Expect = e-113
 Identities = 193/238 (81%), Positives = 214/238 (89%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQKPKIAFMF 209
           MKRRS+  KFHHRW+RKF A LL   C ATFLLME+E+S+IK+ + ISP L+KPKIAF+F
Sbjct: 1   MKRRSSLQKFHHRWRRKFFALLLFLFCVATFLLMEAEHSRIKLLTFISPPLRKPKIAFLF 60

Query: 210 IARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDWG 389
           IARNRIPLD+VWDVFFQ D ENRFSIYVHSRPGFLL +A TRS FFLNRQ+NDS+QV+WG
Sbjct: 61  IARNRIPLDMVWDVFFQGDAENRFSIYVHSRPGFLLNTATTRSTFFLNRQINDSIQVEWG 120

Query: 390 EATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNKE 569
           EA+MIQAERILL+HAL DP NERFLFLSDSCIPLYNFSYTYDYIMST  SFVDSFAD+KE
Sbjct: 121 EASMIQAERILLQHALMDPFNERFLFLSDSCIPLYNFSYTYDYIMSTSTSFVDSFADSKE 180

Query: 570 SRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           SRYNP+MHPVI VD+WRKGSQWV+LTRKHA IVVEDET+F  FQLHCKRKSLPEFWRD
Sbjct: 181 SRYNPRMHPVIHVDNWRKGSQWVILTRKHAAIVVEDETIFPTFQLHCKRKSLPEFWRD 238


>ref|XP_004232318.1| PREDICTED: uncharacterized protein LOC101257325 [Solanum
           lycopersicum]
          Length = 374

 Score =  380 bits (975), Expect = e-103
 Identities = 179/240 (74%), Positives = 203/240 (84%), Gaps = 1/240 (0%)
 Frame = +3

Query: 27  RMKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLIS-PRLQKPKIAF 203
           ++ RRSNS K  HRWK K  A +L    F T +LME++Y+KI+M +L+S P+LQ PKIAF
Sbjct: 4   KLIRRSNSQKSQHRWKIKVFAMMLFVFFFGTLVLMETQYNKIRMLALLSAPQLQNPKIAF 63

Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383
           +FIARNR+PLDIVWD FFQ D ENRFSI VHSRPGFLL    TRS +FLNRQ+NDS+QVD
Sbjct: 64  LFIARNRLPLDIVWDAFFQGDKENRFSILVHSRPGFLLNKVTTRSAYFLNRQMNDSIQVD 123

Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563
           WGEA+MIQAERILL+HAL DP+NERF+FLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 
Sbjct: 124 WGEASMIQAERILLQHALMDPLNERFVFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADT 183

Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           KE RYNPKMHP+I V SWRKGSQW VL RKHA IVV+DE +F MFQLHCK+K LPEFWRD
Sbjct: 184 KEGRYNPKMHPIIPVQSWRKGSQWAVLNRKHADIVVKDEILFPMFQLHCKKKPLPEFWRD 243


>ref|XP_006338588.1| PREDICTED: uncharacterized protein LOC102600190 [Solanum tuberosum]
          Length = 374

 Score =  376 bits (965), Expect = e-102
 Identities = 177/240 (73%), Positives = 203/240 (84%), Gaps = 1/240 (0%)
 Frame = +3

Query: 27  RMKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLIS-PRLQKPKIAF 203
           ++ RRSNS K  HRWK K  A +L    F T +LME++Y++I+M +L+S P++Q PKIAF
Sbjct: 4   KLIRRSNSQKSQHRWKIKVFAMMLFIFFFGTLVLMETQYNRIRMLALLSAPQVQNPKIAF 63

Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383
           +FIARNR+PLDIVWD FFQ D EN+FSI VHSRPGFLL    TRS +FLNRQ+NDS+QVD
Sbjct: 64  LFIARNRLPLDIVWDAFFQGDKENKFSILVHSRPGFLLNKVTTRSAYFLNRQMNDSIQVD 123

Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563
           WGEATMIQAERILL+HAL DP+NERF+FLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 
Sbjct: 124 WGEATMIQAERILLQHALMDPLNERFVFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADT 183

Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           KE RYNPKM P+ISV SWRKGSQW VL RKHA IVV+DE +F MFQLHCK+K LPEFWRD
Sbjct: 184 KEGRYNPKMDPIISVQSWRKGSQWAVLNRKHADIVVKDEILFPMFQLHCKKKPLPEFWRD 243


>ref|XP_002275670.1| PREDICTED: uncharacterized protein LOC100259507 [Vitis vinifera]
          Length = 367

 Score =  363 bits (931), Expect = 4e-98
 Identities = 172/239 (71%), Positives = 202/239 (84%), Gaps = 1/239 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKM-RSLISPRLQKPKIAFM 206
           MKR+  S    ++WKR   A LL+  CF + +L++++YS+I+M  S+ SP LQ+PKIAF+
Sbjct: 1   MKRKQKS---QYKWKRNLFAMLLLGFCFGSLVLLQTQYSRIRMFASMPSPFLQRPKIAFL 57

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PLD+VWD FF+ + EN+FSI+VHSRPGFLL  A TRSV+FLNRQ+NDS+QVDW
Sbjct: 58  FIARNRLPLDVVWDAFFRDEKENKFSIFVHSRPGFLLNKATTRSVYFLNRQLNDSIQVDW 117

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566
           GEA+MIQAERILL+ AL DP+NERF+FLSDSCIPLYNFSY YDYIMST  SFVDSFAD K
Sbjct: 118 GEASMIQAERILLRSALLDPLNERFVFLSDSCIPLYNFSYIYDYIMSTSTSFVDSFADTK 177

Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           E RYNPKM PVI V +WRKGSQWVVLTRKHA IVVED+TVF MFQ HCKRKSLPEFWRD
Sbjct: 178 EGRYNPKMDPVIPVHNWRKGSQWVVLTRKHAQIVVEDDTVFPMFQQHCKRKSLPEFWRD 236


>ref|XP_007031552.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
           protein isoform 4, partial [Theobroma cacao]
           gi|508710581|gb|EOY02478.1| Core-2/I-branching
           beta-1,6-N-acetylglucosaminyltransferase family protein
           isoform 4, partial [Theobroma cacao]
          Length = 289

 Score =  351 bits (901), Expect = 1e-94
 Identities = 169/239 (70%), Positives = 197/239 (82%), Gaps = 1/239 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206
           MKR++   K   RWKRK LA LL+A C A+  LME++YS+I   + +  R   KPKIAF+
Sbjct: 4   MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL   A TRS +FLNRQVNDS+QVDW
Sbjct: 64  FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566
           GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST  SFVDSFAD K
Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182

Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD
Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 241


>ref|XP_007031549.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
           protein isoform 1 [Theobroma cacao]
           gi|508710578|gb|EOY02475.1| Core-2/I-branching
           beta-1,6-N-acetylglucosaminyltransferase family protein
           isoform 1 [Theobroma cacao]
          Length = 372

 Score =  351 bits (901), Expect = 1e-94
 Identities = 169/239 (70%), Positives = 197/239 (82%), Gaps = 1/239 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206
           MKR++   K   RWKRK LA LL+A C A+  LME++YS+I   + +  R   KPKIAF+
Sbjct: 4   MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL   A TRS +FLNRQVNDS+QVDW
Sbjct: 64  FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566
           GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST  SFVDSFAD K
Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182

Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD
Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 241


>ref|XP_007215567.1| hypothetical protein PRUPE_ppa007456mg [Prunus persica]
           gi|462411717|gb|EMJ16766.1| hypothetical protein
           PRUPE_ppa007456mg [Prunus persica]
          Length = 367

 Score =  351 bits (900), Expect = 2e-94
 Identities = 171/240 (71%), Positives = 196/240 (81%), Gaps = 2/240 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATF-LLMESEYSKIK-MRSLISPRLQKPKIAF 203
           MKR++   K  H+WKRK    LL+  CF T  LLM ++YS+I  + SL +   Q PK+AF
Sbjct: 1   MKRKAGPQKAQHKWKRKLFVALLMGFCFGTLVLLMHTQYSRIMTLASLHTQLTQPPKVAF 60

Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383
           +FIARNR+PLD++WDVFFQ   E+RFSIYVHSRPGFL   A TRSVFFLNRQVNDS+QVD
Sbjct: 61  LFIARNRLPLDLLWDVFFQGG-ESRFSIYVHSRPGFLFNKATTRSVFFLNRQVNDSIQVD 119

Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563
           WGEA+MI+AERILLKHAL DP+N+RF FLSDSCIPLY+FSY YDYIMST  SFVDSFAD 
Sbjct: 120 WGEASMIEAERILLKHALEDPLNQRFAFLSDSCIPLYSFSYIYDYIMSTRTSFVDSFADT 179

Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           KE RYNPKM+PVI V +WRKGSQWVVLTRKHA +VV+D TVF MFQ HCK KSLPEFWRD
Sbjct: 180 KEGRYNPKMNPVIPVHNWRKGSQWVVLTRKHAEVVVKDNTVFPMFQQHCKTKSLPEFWRD 239


>ref|XP_007031550.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
           protein isoform 2 [Theobroma cacao]
           gi|508710579|gb|EOY02476.1| Core-2/I-branching
           beta-1,6-N-acetylglucosaminyltransferase family protein
           isoform 2 [Theobroma cacao]
          Length = 381

 Score =  347 bits (889), Expect = 3e-93
 Identities = 169/240 (70%), Positives = 197/240 (82%), Gaps = 2/240 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206
           MKR++   K   RWKRK LA LL+A C A+  LME++YS+I   + +  R   KPKIAF+
Sbjct: 4   MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL   A TRS +FLNRQVNDS+QVDW
Sbjct: 64  FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVD-SFADN 563
           GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST  SFVD SFAD 
Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSSFADT 182

Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           KE RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD
Sbjct: 183 KEGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 242


>ref|XP_006289626.1| hypothetical protein CARUB_v10003187mg [Capsella rubella]
           gi|482558332|gb|EOA22524.1| hypothetical protein
           CARUB_v10003187mg [Capsella rubella]
          Length = 371

 Score =  341 bits (874), Expect = 2e-91
 Identities = 161/241 (66%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKI--KMRSLISPRL-QKPKIA 200
           MK++++  K  +RWKRK  A L+ A C  TF+ +++ +S I   + SL  PRL QKP+IA
Sbjct: 1   MKKKASHQKLLYRWKRKVYATLMFAFCLGTFVFIQARFSGITASLHSLKKPRLHQKPQIA 60

Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380
           F+FIARNR+PL+ VWD FFQ + + +FSIYVHSRPGF+L+ A TRS FFL+RQVNDS+QV
Sbjct: 61  FLFIARNRLPLEFVWDAFFQGE-DGKFSIYVHSRPGFILSEATTRSKFFLDRQVNDSIQV 119

Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560
           DWGE+TMI+AER+LL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD
Sbjct: 120 DWGESTMIEAERVLLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFAD 179

Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740
            K+SRYNP+M+P+I V +WRKGSQWVVL RKH+ IVV D +VF MFQ HC+RKSLPEFWR
Sbjct: 180 TKDSRYNPRMNPIIPVHNWRKGSQWVVLNRKHSEIVVNDTSVFPMFQQHCRRKSLPEFWR 239

Query: 741 D 743
           D
Sbjct: 240 D 240


>ref|XP_002873676.1| hypothetical protein ARALYDRAFT_909422 [Arabidopsis lyrata subsp.
           lyrata] gi|297319513|gb|EFH49935.1| hypothetical protein
           ARALYDRAFT_909422 [Arabidopsis lyrata subsp. lyrata]
          Length = 369

 Score =  340 bits (871), Expect = 4e-91
 Identities = 161/241 (66%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKI--KMRSLISPRL-QKPKIA 200
           MK++ +  K  +RWKRK  A L+ A C  TF+ +++ ++ I   + SL  PRL QKP+IA
Sbjct: 1   MKKKVSQQKLLYRWKRKVYATLMFAFCLGTFVFIQARFNGITASLDSLKKPRLDQKPQIA 60

Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380
           F+FIARNR+PL++VWD FFQ + + +FSIYVHSRPGF+L+ A TRS FFL+RQVNDS+QV
Sbjct: 61  FLFIARNRLPLELVWDAFFQGE-DGKFSIYVHSRPGFVLSEATTRSKFFLDRQVNDSIQV 119

Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560
           DWGE+TMI+AER+LL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD
Sbjct: 120 DWGESTMIEAERVLLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFAD 179

Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740
            K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RKSLPEFWR
Sbjct: 180 TKDSRYNPRMNPIIPVHNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRKSLPEFWR 239

Query: 741 D 743
           D
Sbjct: 240 D 240


>ref|XP_004142791.1| PREDICTED: uncharacterized protein LOC101222566 [Cucumis sativus]
           gi|449483780|ref|XP_004156689.1| PREDICTED:
           uncharacterized LOC101222566 [Cucumis sativus]
          Length = 366

 Score =  339 bits (870), Expect = 5e-91
 Identities = 162/241 (67%), Positives = 192/241 (79%), Gaps = 3/241 (1%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQK---PKIA 200
           MK++    K   +W+RK    LL   CF + ++M+S Y ++ M + +    Q    PK+A
Sbjct: 1   MKQKVAQRKALFKWRRKLAFVLLFVFCFGSLVMMQSRYGRVMMLASLHLHPQSAHGPKVA 60

Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380
           F+FIARNR+PLDIVWDVFFQ + EN+FSI+VHSRPGFL   A TRS +FLNRQVNDS+QV
Sbjct: 61  FLFIARNRLPLDIVWDVFFQ-EGENKFSIFVHSRPGFLFNKATTRSTYFLNRQVNDSIQV 119

Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560
           DWGEA+MI+AERILL+HALTD  N+RF+FLSDSC+PLYNFSYTYDY+MST  SFVDSFAD
Sbjct: 120 DWGEASMIEAERILLRHALTDSSNQRFVFLSDSCVPLYNFSYTYDYVMSTSTSFVDSFAD 179

Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740
            KE RYNPKM PVI V +WRKGSQWVVLTRKHA +VV+D TVF MFQ HCKRKSLPEFWR
Sbjct: 180 TKEGRYNPKMDPVIPVQNWRKGSQWVVLTRKHAKVVVKDITVFPMFQQHCKRKSLPEFWR 239

Query: 741 D 743
           D
Sbjct: 240 D 240


>ref|XP_006470273.1| PREDICTED: uncharacterized protein LOC102626456 [Citrus sinensis]
          Length = 374

 Score =  338 bits (867), Expect = 1e-90
 Identities = 163/242 (67%), Positives = 197/242 (81%), Gaps = 4/242 (1%)
 Frame = +3

Query: 30  MKRR---SNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPR-LQKPKI 197
           MKR+       KF+++WKRK  A +L+  CF + +LM+ +Y++I     + PR +QKPKI
Sbjct: 1   MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMS---LRPRFVQKPKI 57

Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377
           AF+FIARNR+PL++VWD FF+ + E+RFSIYVHSRPGFL +   TRS++FL+RQVNDS+Q
Sbjct: 58  AFLFIARNRLPLEMVWDKFFKGE-ESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQ 116

Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557
           VDWG A+MI+AERILL+HAL DP N+RF+FLSDSCIPLYNFSYTY+YIMST  SFVDSFA
Sbjct: 117 VDWGGASMIEAERILLRHALADPFNDRFVFLSDSCIPLYNFSYTYNYIMSTSTSFVDSFA 176

Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737
           D KE RYNPKM PVI V +WRKGSQW VLTRKHA IVV D TVF MFQ HCKRKSLPEFW
Sbjct: 177 DTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFW 236

Query: 738 RD 743
           R+
Sbjct: 237 RE 238


>ref|XP_006446563.1| hypothetical protein CICLE_v10015712mg [Citrus clementina]
           gi|557549174|gb|ESR59803.1| hypothetical protein
           CICLE_v10015712mg [Citrus clementina]
          Length = 361

 Score =  338 bits (867), Expect = 1e-90
 Identities = 163/242 (67%), Positives = 197/242 (81%), Gaps = 4/242 (1%)
 Frame = +3

Query: 30  MKRR---SNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPR-LQKPKI 197
           MKR+       KF+++WKRK  A +L+  CF + +LM+ +Y++I     + PR +QKPKI
Sbjct: 1   MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMS---LRPRFVQKPKI 57

Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377
           AF+FIARNR+PL++VWD FF+ + E+RFSIYVHSRPGFL +   TRS++FL+RQVNDS+Q
Sbjct: 58  AFLFIARNRLPLEMVWDKFFKGE-ESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQ 116

Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557
           VDWG A+MI+AERILL+HAL DP N+RF+FLSDSCIPLYNFSYTY+YIMST  SFVDSFA
Sbjct: 117 VDWGGASMIEAERILLRHALADPFNDRFVFLSDSCIPLYNFSYTYNYIMSTSTSFVDSFA 176

Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737
           D KE RYNPKM PVI V +WRKGSQW VLTRKHA IVV D TVF MFQ HCKRKSLPEFW
Sbjct: 177 DTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFW 236

Query: 738 RD 743
           R+
Sbjct: 237 RE 238


>ref|XP_006399968.1| hypothetical protein EUTSA_v10015269mg [Eutrema salsugineum]
           gi|557101058|gb|ESQ41421.1| hypothetical protein
           EUTSA_v10015269mg [Eutrema salsugineum]
          Length = 365

 Score =  333 bits (853), Expect = 5e-89
 Identities = 160/239 (66%), Positives = 192/239 (80%), Gaps = 1/239 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRL-QKPKIAFM 206
           MKR+++  K  +RWKRK  A L+ A C  +F  +++ YS I   + + PRL QKP+IAF+
Sbjct: 1   MKRKASQHKLLNRWKRKVFATLIFAFCLGSFAFIQARYSGIT--ASLKPRLDQKPQIAFL 58

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PL+ VWD FF  + + +FSI+VHSRPGF+L+ A TRS FFL+RQVNDS+QVDW
Sbjct: 59  FIARNRLPLESVWDAFFMGE-DGKFSIFVHSRPGFILSEATTRSKFFLDRQVNDSIQVDW 117

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566
           GEATMI+AERILL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD K
Sbjct: 118 GEATMIEAERILLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFADTK 177

Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           +SRYNP+M P+I V  WRKGSQWVVL RKHA IVV D  V  MFQ HC+RKSLPEFWRD
Sbjct: 178 DSRYNPRMSPIIPVHHWRKGSQWVVLNRKHAEIVVNDTIVLPMFQQHCRRKSLPEFWRD 236


>ref|XP_007031553.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
           protein isoform 5 [Theobroma cacao]
           gi|508710582|gb|EOY02479.1| Core-2/I-branching
           beta-1,6-N-acetylglucosaminyltransferase family protein
           isoform 5 [Theobroma cacao]
          Length = 253

 Score =  332 bits (852), Expect = 6e-89
 Identities = 161/233 (69%), Positives = 189/233 (81%), Gaps = 1/233 (0%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206
           MKR++   K   RWKRK LA LL+A C A+  LME++YS+I   + +  R   KPKIAF+
Sbjct: 4   MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63

Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386
           FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL   A TRS +FLNRQVNDS+QVDW
Sbjct: 64  FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122

Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566
           GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST  SFVDSFAD K
Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182

Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSL 725
           E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCK K +
Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKAKEI 235


>ref|NP_196959.2| core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family
           protein [Arabidopsis thaliana]
           gi|209863160|gb|ACI88738.1| At5g14550 [Arabidopsis
           thaliana] gi|332004663|gb|AED92046.1| core-2/I-branching
           beta-1,6-N-acetylglucosaminyltransferase family protein
           [Arabidopsis thaliana] gi|591401812|gb|AHL38633.1|
           glycosyltransferase, partial [Arabidopsis thaliana]
          Length = 377

 Score =  332 bits (852), Expect = 6e-89
 Identities = 158/248 (63%), Positives = 198/248 (79%), Gaps = 10/248 (4%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMR---------SLISPRL 182
           MK++ +  K  +RWKRK  A L+ A CF TF+ +++ ++ I+ R         SL  PRL
Sbjct: 1   MKKKVSQQKLLYRWKRKVYATLMFAFCFGTFVFIQARFASIQARFNRISASLDSLKKPRL 60

Query: 183 -QKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQ 359
            Q+P+IAF+FIARNR+PL+ VWD FF+ + + +FSIYVHSRPGF+L  A TRS +FL+RQ
Sbjct: 61  DQRPQIAFLFIARNRLPLEFVWDAFFKGE-DGKFSIYVHSRPGFVLNEATTRSKYFLDRQ 119

Query: 360 VNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNS 539
           +NDS+QVDWGE+TMI+AER+LL+HAL D  N RF+FLSDSCIPLY+FSYTY+YIMSTP S
Sbjct: 120 LNDSIQVDWGESTMIEAERVLLRHALRDSFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTS 179

Query: 540 FVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRK 719
           FVDSFAD K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RK
Sbjct: 180 FVDSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRK 239

Query: 720 SLPEFWRD 743
           SLPEFWRD
Sbjct: 240 SLPEFWRD 247


>dbj|BAD43086.1| putative protein [Arabidopsis thaliana]
          Length = 377

 Score =  332 bits (852), Expect = 6e-89
 Identities = 158/248 (63%), Positives = 198/248 (79%), Gaps = 10/248 (4%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMR---------SLISPRL 182
           MK++ +  K  +RWKRK  A L+ A CF TF+ +++ ++ I+ R         SL  PRL
Sbjct: 1   MKKKVSQQKLLYRWKRKVYATLMFAFCFGTFVFIQARFASIQARFNRISASLDSLKKPRL 60

Query: 183 -QKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQ 359
            Q+P+IAF+FIARNR+PL+ VWD FF+ + + +FSIYVHSRPGF+L  A TRS +FL+RQ
Sbjct: 61  DQRPQIAFLFIARNRLPLEFVWDAFFKGE-DGKFSIYVHSRPGFVLNEATTRSKYFLDRQ 119

Query: 360 VNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNS 539
           +NDS+QVDWGE+TMI+AER+LL+HAL D  N RF+FLSDSCIPLY+FSYTY+YIMSTP S
Sbjct: 120 LNDSIQVDWGESTMIEAERVLLRHALRDSFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTS 179

Query: 540 FVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRK 719
           FVDSFAD K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RK
Sbjct: 180 FVDSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRK 239

Query: 720 SLPEFWRD 743
           SLPEFWRD
Sbjct: 240 SLPEFWRD 247


>ref|XP_004304653.1| PREDICTED: uncharacterized protein LOC101304206 [Fragaria vesca
           subsp. vesca]
          Length = 359

 Score =  332 bits (851), Expect = 8e-89
 Identities = 164/242 (67%), Positives = 193/242 (79%), Gaps = 4/242 (1%)
 Frame = +3

Query: 30  MKRRSNSGKFHHRWKRKFLAFLLIAICFATF-LLMESEYSKIKMRSLISPRL---QKPKI 197
           MKR + S     +WKRK    LL+ +CF T  LLM + YS+I   + I P++   Q+PKI
Sbjct: 1   MKRATKS---QFKWKRK----LLMCLCFGTLVLLMHTHYSRIMTLASIRPQMNTIQRPKI 53

Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377
           AF+FIARNR+PLD++WDVFFQ   E++FSIYVHSRPGFL     TRS FFLNRQVNDS+Q
Sbjct: 54  AFLFIARNRLPLDMLWDVFFQGG-ESKFSIYVHSRPGFLFNKVTTRSDFFLNRQVNDSIQ 112

Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557
           VDWGEATMI+AERILLKHAL DP+N+RF F+SDSCIPLY+F Y YDY+MST  SFVDSFA
Sbjct: 113 VDWGEATMIEAERILLKHALEDPLNQRFAFVSDSCIPLYSFKYIYDYVMSTRTSFVDSFA 172

Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737
           D K+ RYNPKM P+I V +WRKGSQW VLTRKHA +VV+D TVF MFQL+CKRKSLPEFW
Sbjct: 173 DTKDGRYNPKMDPIIPVQNWRKGSQWAVLTRKHAEVVVKDNTVFPMFQLYCKRKSLPEFW 232

Query: 738 RD 743
           RD
Sbjct: 233 RD 234


>ref|XP_007150933.1| hypothetical protein PHAVU_004G007000g [Phaseolus vulgaris]
           gi|561024242|gb|ESW22927.1| hypothetical protein
           PHAVU_004G007000g [Phaseolus vulgaris]
          Length = 363

 Score =  329 bits (843), Expect = 7e-88
 Identities = 154/233 (66%), Positives = 187/233 (80%), Gaps = 6/233 (2%)
 Frame = +3

Query: 63  HRWKRKFLAFLLIAICFATFLLMESEYSKIK-----MRSLIS-PRLQKPKIAFMFIARNR 224
           HRWK+K  A + +  C  +FL M++ Y+ +       R  +S P +Q PKIAF+FIARNR
Sbjct: 8   HRWKKKLFALIFVVFCLGSFLFMQTRYNHVVGLVSLQRHFVSKPEVQSPKIAFLFIARNR 67

Query: 225 IPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDWGEATMI 404
           +PL+IVWD FF+   + +FSI+VH RPGFLL  A TRS +FLNRQVN+SVQV+WGEA+MI
Sbjct: 68  LPLEIVWDAFFRGG-DGKFSIFVHCRPGFLLNKATTRSPYFLNRQVNNSVQVEWGEASMI 126

Query: 405 QAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNKESRYNP 584
           +AER+LL+HAL+DP+N+RF+FLSDSCIPLYNF+YTYDYIMS   SFVDSFAD KE RYNP
Sbjct: 127 EAERVLLRHALSDPLNDRFVFLSDSCIPLYNFTYTYDYIMSASTSFVDSFADTKEGRYNP 186

Query: 585 KMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743
           KM PVI V +WRKGSQW VLTRKHA +VVEDETVF MFQ +CK+K LPEFWRD
Sbjct: 187 KMDPVIPVYNWRKGSQWAVLTRKHAKVVVEDETVFPMFQKYCKKKPLPEFWRD 239


>emb|CBI34881.3| unnamed protein product [Vitis vinifera]
          Length = 324

 Score =  329 bits (843), Expect = 7e-88
 Identities = 154/191 (80%), Positives = 171/191 (89%)
 Frame = +3

Query: 171 SPRLQKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFL 350
           SP LQ+PKIAF+FIARNR+PLD+VWD FF+ + EN+FSI+VHSRPGFLL  A TRSV+FL
Sbjct: 3   SPFLQRPKIAFLFIARNRLPLDVVWDAFFRDEKENKFSIFVHSRPGFLLNKATTRSVYFL 62

Query: 351 NRQVNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMST 530
           NRQ+NDS+QVDWGEA+MIQAERILL+ AL DP+NERF+FLSDSCIPLYNFSY YDYIMST
Sbjct: 63  NRQLNDSIQVDWGEASMIQAERILLRSALLDPLNERFVFLSDSCIPLYNFSYIYDYIMST 122

Query: 531 PNSFVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHC 710
             SFVDSFAD KE RYNPKM PVI V +WRKGSQWVVLTRKHA IVVED+TVF MFQ HC
Sbjct: 123 STSFVDSFADTKEGRYNPKMDPVIPVHNWRKGSQWVVLTRKHAQIVVEDDTVFPMFQQHC 182

Query: 711 KRKSLPEFWRD 743
           KRKSLPEFWRD
Sbjct: 183 KRKSLPEFWRD 193


Top