BLASTX nr result
ID: Mentha24_contig00024514
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00024514 (744 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44778.1| hypothetical protein MIMGU_mgv1a008498mg [Mimulus... 412 e-113 ref|XP_004232318.1| PREDICTED: uncharacterized protein LOC101257... 380 e-103 ref|XP_006338588.1| PREDICTED: uncharacterized protein LOC102600... 376 e-102 ref|XP_002275670.1| PREDICTED: uncharacterized protein LOC100259... 363 4e-98 ref|XP_007031552.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 351 1e-94 ref|XP_007031549.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 351 1e-94 ref|XP_007215567.1| hypothetical protein PRUPE_ppa007456mg [Prun... 351 2e-94 ref|XP_007031550.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 347 3e-93 ref|XP_006289626.1| hypothetical protein CARUB_v10003187mg [Caps... 341 2e-91 ref|XP_002873676.1| hypothetical protein ARALYDRAFT_909422 [Arab... 340 4e-91 ref|XP_004142791.1| PREDICTED: uncharacterized protein LOC101222... 339 5e-91 ref|XP_006470273.1| PREDICTED: uncharacterized protein LOC102626... 338 1e-90 ref|XP_006446563.1| hypothetical protein CICLE_v10015712mg [Citr... 338 1e-90 ref|XP_006399968.1| hypothetical protein EUTSA_v10015269mg [Eutr... 333 5e-89 ref|XP_007031553.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 332 6e-89 ref|NP_196959.2| core-2/I-branching beta-1,6-N-acetylglucosaminy... 332 6e-89 dbj|BAD43086.1| putative protein [Arabidopsis thaliana] 332 6e-89 ref|XP_004304653.1| PREDICTED: uncharacterized protein LOC101304... 332 8e-89 ref|XP_007150933.1| hypothetical protein PHAVU_004G007000g [Phas... 329 7e-88 emb|CBI34881.3| unnamed protein product [Vitis vinifera] 329 7e-88 >gb|EYU44778.1| hypothetical protein MIMGU_mgv1a008498mg [Mimulus guttatus] Length = 371 Score = 412 bits (1059), Expect = e-113 Identities = 193/238 (81%), Positives = 214/238 (89%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQKPKIAFMF 209 MKRRS+ KFHHRW+RKF A LL C ATFLLME+E+S+IK+ + ISP L+KPKIAF+F Sbjct: 1 MKRRSSLQKFHHRWRRKFFALLLFLFCVATFLLMEAEHSRIKLLTFISPPLRKPKIAFLF 60 Query: 210 IARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDWG 389 IARNRIPLD+VWDVFFQ D ENRFSIYVHSRPGFLL +A TRS FFLNRQ+NDS+QV+WG Sbjct: 61 IARNRIPLDMVWDVFFQGDAENRFSIYVHSRPGFLLNTATTRSTFFLNRQINDSIQVEWG 120 Query: 390 EATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNKE 569 EA+MIQAERILL+HAL DP NERFLFLSDSCIPLYNFSYTYDYIMST SFVDSFAD+KE Sbjct: 121 EASMIQAERILLQHALMDPFNERFLFLSDSCIPLYNFSYTYDYIMSTSTSFVDSFADSKE 180 Query: 570 SRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 SRYNP+MHPVI VD+WRKGSQWV+LTRKHA IVVEDET+F FQLHCKRKSLPEFWRD Sbjct: 181 SRYNPRMHPVIHVDNWRKGSQWVILTRKHAAIVVEDETIFPTFQLHCKRKSLPEFWRD 238 >ref|XP_004232318.1| PREDICTED: uncharacterized protein LOC101257325 [Solanum lycopersicum] Length = 374 Score = 380 bits (975), Expect = e-103 Identities = 179/240 (74%), Positives = 203/240 (84%), Gaps = 1/240 (0%) Frame = +3 Query: 27 RMKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLIS-PRLQKPKIAF 203 ++ RRSNS K HRWK K A +L F T +LME++Y+KI+M +L+S P+LQ PKIAF Sbjct: 4 KLIRRSNSQKSQHRWKIKVFAMMLFVFFFGTLVLMETQYNKIRMLALLSAPQLQNPKIAF 63 Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383 +FIARNR+PLDIVWD FFQ D ENRFSI VHSRPGFLL TRS +FLNRQ+NDS+QVD Sbjct: 64 LFIARNRLPLDIVWDAFFQGDKENRFSILVHSRPGFLLNKVTTRSAYFLNRQMNDSIQVD 123 Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563 WGEA+MIQAERILL+HAL DP+NERF+FLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD Sbjct: 124 WGEASMIQAERILLQHALMDPLNERFVFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADT 183 Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 KE RYNPKMHP+I V SWRKGSQW VL RKHA IVV+DE +F MFQLHCK+K LPEFWRD Sbjct: 184 KEGRYNPKMHPIIPVQSWRKGSQWAVLNRKHADIVVKDEILFPMFQLHCKKKPLPEFWRD 243 >ref|XP_006338588.1| PREDICTED: uncharacterized protein LOC102600190 [Solanum tuberosum] Length = 374 Score = 376 bits (965), Expect = e-102 Identities = 177/240 (73%), Positives = 203/240 (84%), Gaps = 1/240 (0%) Frame = +3 Query: 27 RMKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLIS-PRLQKPKIAF 203 ++ RRSNS K HRWK K A +L F T +LME++Y++I+M +L+S P++Q PKIAF Sbjct: 4 KLIRRSNSQKSQHRWKIKVFAMMLFIFFFGTLVLMETQYNRIRMLALLSAPQVQNPKIAF 63 Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383 +FIARNR+PLDIVWD FFQ D EN+FSI VHSRPGFLL TRS +FLNRQ+NDS+QVD Sbjct: 64 LFIARNRLPLDIVWDAFFQGDKENKFSILVHSRPGFLLNKVTTRSAYFLNRQMNDSIQVD 123 Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563 WGEATMIQAERILL+HAL DP+NERF+FLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD Sbjct: 124 WGEATMIQAERILLQHALMDPLNERFVFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADT 183 Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 KE RYNPKM P+ISV SWRKGSQW VL RKHA IVV+DE +F MFQLHCK+K LPEFWRD Sbjct: 184 KEGRYNPKMDPIISVQSWRKGSQWAVLNRKHADIVVKDEILFPMFQLHCKKKPLPEFWRD 243 >ref|XP_002275670.1| PREDICTED: uncharacterized protein LOC100259507 [Vitis vinifera] Length = 367 Score = 363 bits (931), Expect = 4e-98 Identities = 172/239 (71%), Positives = 202/239 (84%), Gaps = 1/239 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKM-RSLISPRLQKPKIAFM 206 MKR+ S ++WKR A LL+ CF + +L++++YS+I+M S+ SP LQ+PKIAF+ Sbjct: 1 MKRKQKS---QYKWKRNLFAMLLLGFCFGSLVLLQTQYSRIRMFASMPSPFLQRPKIAFL 57 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PLD+VWD FF+ + EN+FSI+VHSRPGFLL A TRSV+FLNRQ+NDS+QVDW Sbjct: 58 FIARNRLPLDVVWDAFFRDEKENKFSIFVHSRPGFLLNKATTRSVYFLNRQLNDSIQVDW 117 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566 GEA+MIQAERILL+ AL DP+NERF+FLSDSCIPLYNFSY YDYIMST SFVDSFAD K Sbjct: 118 GEASMIQAERILLRSALLDPLNERFVFLSDSCIPLYNFSYIYDYIMSTSTSFVDSFADTK 177 Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 E RYNPKM PVI V +WRKGSQWVVLTRKHA IVVED+TVF MFQ HCKRKSLPEFWRD Sbjct: 178 EGRYNPKMDPVIPVHNWRKGSQWVVLTRKHAQIVVEDDTVFPMFQQHCKRKSLPEFWRD 236 >ref|XP_007031552.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 4, partial [Theobroma cacao] gi|508710581|gb|EOY02478.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 4, partial [Theobroma cacao] Length = 289 Score = 351 bits (901), Expect = 1e-94 Identities = 169/239 (70%), Positives = 197/239 (82%), Gaps = 1/239 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206 MKR++ K RWKRK LA LL+A C A+ LME++YS+I + + R KPKIAF+ Sbjct: 4 MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL A TRS +FLNRQVNDS+QVDW Sbjct: 64 FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566 GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST SFVDSFAD K Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182 Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 241 >ref|XP_007031549.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 1 [Theobroma cacao] gi|508710578|gb|EOY02475.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 1 [Theobroma cacao] Length = 372 Score = 351 bits (901), Expect = 1e-94 Identities = 169/239 (70%), Positives = 197/239 (82%), Gaps = 1/239 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206 MKR++ K RWKRK LA LL+A C A+ LME++YS+I + + R KPKIAF+ Sbjct: 4 MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL A TRS +FLNRQVNDS+QVDW Sbjct: 64 FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566 GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST SFVDSFAD K Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182 Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 241 >ref|XP_007215567.1| hypothetical protein PRUPE_ppa007456mg [Prunus persica] gi|462411717|gb|EMJ16766.1| hypothetical protein PRUPE_ppa007456mg [Prunus persica] Length = 367 Score = 351 bits (900), Expect = 2e-94 Identities = 171/240 (71%), Positives = 196/240 (81%), Gaps = 2/240 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATF-LLMESEYSKIK-MRSLISPRLQKPKIAF 203 MKR++ K H+WKRK LL+ CF T LLM ++YS+I + SL + Q PK+AF Sbjct: 1 MKRKAGPQKAQHKWKRKLFVALLMGFCFGTLVLLMHTQYSRIMTLASLHTQLTQPPKVAF 60 Query: 204 MFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVD 383 +FIARNR+PLD++WDVFFQ E+RFSIYVHSRPGFL A TRSVFFLNRQVNDS+QVD Sbjct: 61 LFIARNRLPLDLLWDVFFQGG-ESRFSIYVHSRPGFLFNKATTRSVFFLNRQVNDSIQVD 119 Query: 384 WGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADN 563 WGEA+MI+AERILLKHAL DP+N+RF FLSDSCIPLY+FSY YDYIMST SFVDSFAD Sbjct: 120 WGEASMIEAERILLKHALEDPLNQRFAFLSDSCIPLYSFSYIYDYIMSTRTSFVDSFADT 179 Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 KE RYNPKM+PVI V +WRKGSQWVVLTRKHA +VV+D TVF MFQ HCK KSLPEFWRD Sbjct: 180 KEGRYNPKMNPVIPVHNWRKGSQWVVLTRKHAEVVVKDNTVFPMFQQHCKTKSLPEFWRD 239 >ref|XP_007031550.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 [Theobroma cacao] gi|508710579|gb|EOY02476.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 [Theobroma cacao] Length = 381 Score = 347 bits (889), Expect = 3e-93 Identities = 169/240 (70%), Positives = 197/240 (82%), Gaps = 2/240 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206 MKR++ K RWKRK LA LL+A C A+ LME++YS+I + + R KPKIAF+ Sbjct: 4 MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL A TRS +FLNRQVNDS+QVDW Sbjct: 64 FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVD-SFADN 563 GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST SFVD SFAD Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSSFADT 182 Query: 564 KESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 KE RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCKR+SLPEFWRD Sbjct: 183 KEGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKRRSLPEFWRD 242 >ref|XP_006289626.1| hypothetical protein CARUB_v10003187mg [Capsella rubella] gi|482558332|gb|EOA22524.1| hypothetical protein CARUB_v10003187mg [Capsella rubella] Length = 371 Score = 341 bits (874), Expect = 2e-91 Identities = 161/241 (66%), Positives = 199/241 (82%), Gaps = 3/241 (1%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKI--KMRSLISPRL-QKPKIA 200 MK++++ K +RWKRK A L+ A C TF+ +++ +S I + SL PRL QKP+IA Sbjct: 1 MKKKASHQKLLYRWKRKVYATLMFAFCLGTFVFIQARFSGITASLHSLKKPRLHQKPQIA 60 Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380 F+FIARNR+PL+ VWD FFQ + + +FSIYVHSRPGF+L+ A TRS FFL+RQVNDS+QV Sbjct: 61 FLFIARNRLPLEFVWDAFFQGE-DGKFSIYVHSRPGFILSEATTRSKFFLDRQVNDSIQV 119 Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560 DWGE+TMI+AER+LL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD Sbjct: 120 DWGESTMIEAERVLLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFAD 179 Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740 K+SRYNP+M+P+I V +WRKGSQWVVL RKH+ IVV D +VF MFQ HC+RKSLPEFWR Sbjct: 180 TKDSRYNPRMNPIIPVHNWRKGSQWVVLNRKHSEIVVNDTSVFPMFQQHCRRKSLPEFWR 239 Query: 741 D 743 D Sbjct: 240 D 240 >ref|XP_002873676.1| hypothetical protein ARALYDRAFT_909422 [Arabidopsis lyrata subsp. lyrata] gi|297319513|gb|EFH49935.1| hypothetical protein ARALYDRAFT_909422 [Arabidopsis lyrata subsp. lyrata] Length = 369 Score = 340 bits (871), Expect = 4e-91 Identities = 161/241 (66%), Positives = 199/241 (82%), Gaps = 3/241 (1%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKI--KMRSLISPRL-QKPKIA 200 MK++ + K +RWKRK A L+ A C TF+ +++ ++ I + SL PRL QKP+IA Sbjct: 1 MKKKVSQQKLLYRWKRKVYATLMFAFCLGTFVFIQARFNGITASLDSLKKPRLDQKPQIA 60 Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380 F+FIARNR+PL++VWD FFQ + + +FSIYVHSRPGF+L+ A TRS FFL+RQVNDS+QV Sbjct: 61 FLFIARNRLPLELVWDAFFQGE-DGKFSIYVHSRPGFVLSEATTRSKFFLDRQVNDSIQV 119 Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560 DWGE+TMI+AER+LL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD Sbjct: 120 DWGESTMIEAERVLLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFAD 179 Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740 K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RKSLPEFWR Sbjct: 180 TKDSRYNPRMNPIIPVHNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRKSLPEFWR 239 Query: 741 D 743 D Sbjct: 240 D 240 >ref|XP_004142791.1| PREDICTED: uncharacterized protein LOC101222566 [Cucumis sativus] gi|449483780|ref|XP_004156689.1| PREDICTED: uncharacterized LOC101222566 [Cucumis sativus] Length = 366 Score = 339 bits (870), Expect = 5e-91 Identities = 162/241 (67%), Positives = 192/241 (79%), Gaps = 3/241 (1%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQK---PKIA 200 MK++ K +W+RK LL CF + ++M+S Y ++ M + + Q PK+A Sbjct: 1 MKQKVAQRKALFKWRRKLAFVLLFVFCFGSLVMMQSRYGRVMMLASLHLHPQSAHGPKVA 60 Query: 201 FMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQV 380 F+FIARNR+PLDIVWDVFFQ + EN+FSI+VHSRPGFL A TRS +FLNRQVNDS+QV Sbjct: 61 FLFIARNRLPLDIVWDVFFQ-EGENKFSIFVHSRPGFLFNKATTRSTYFLNRQVNDSIQV 119 Query: 381 DWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFAD 560 DWGEA+MI+AERILL+HALTD N+RF+FLSDSC+PLYNFSYTYDY+MST SFVDSFAD Sbjct: 120 DWGEASMIEAERILLRHALTDSSNQRFVFLSDSCVPLYNFSYTYDYVMSTSTSFVDSFAD 179 Query: 561 NKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWR 740 KE RYNPKM PVI V +WRKGSQWVVLTRKHA +VV+D TVF MFQ HCKRKSLPEFWR Sbjct: 180 TKEGRYNPKMDPVIPVQNWRKGSQWVVLTRKHAKVVVKDITVFPMFQQHCKRKSLPEFWR 239 Query: 741 D 743 D Sbjct: 240 D 240 >ref|XP_006470273.1| PREDICTED: uncharacterized protein LOC102626456 [Citrus sinensis] Length = 374 Score = 338 bits (867), Expect = 1e-90 Identities = 163/242 (67%), Positives = 197/242 (81%), Gaps = 4/242 (1%) Frame = +3 Query: 30 MKRR---SNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPR-LQKPKI 197 MKR+ KF+++WKRK A +L+ CF + +LM+ +Y++I + PR +QKPKI Sbjct: 1 MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMS---LRPRFVQKPKI 57 Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377 AF+FIARNR+PL++VWD FF+ + E+RFSIYVHSRPGFL + TRS++FL+RQVNDS+Q Sbjct: 58 AFLFIARNRLPLEMVWDKFFKGE-ESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQ 116 Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557 VDWG A+MI+AERILL+HAL DP N+RF+FLSDSCIPLYNFSYTY+YIMST SFVDSFA Sbjct: 117 VDWGGASMIEAERILLRHALADPFNDRFVFLSDSCIPLYNFSYTYNYIMSTSTSFVDSFA 176 Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737 D KE RYNPKM PVI V +WRKGSQW VLTRKHA IVV D TVF MFQ HCKRKSLPEFW Sbjct: 177 DTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFW 236 Query: 738 RD 743 R+ Sbjct: 237 RE 238 >ref|XP_006446563.1| hypothetical protein CICLE_v10015712mg [Citrus clementina] gi|557549174|gb|ESR59803.1| hypothetical protein CICLE_v10015712mg [Citrus clementina] Length = 361 Score = 338 bits (867), Expect = 1e-90 Identities = 163/242 (67%), Positives = 197/242 (81%), Gaps = 4/242 (1%) Frame = +3 Query: 30 MKRR---SNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPR-LQKPKI 197 MKR+ KF+++WKRK A +L+ CF + +LM+ +Y++I + PR +QKPKI Sbjct: 1 MKRKVVYQQQQKFNYKWKRKVFAAILLGFCFGSLVLMQCQYTRIMS---LRPRFVQKPKI 57 Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377 AF+FIARNR+PL++VWD FF+ + E+RFSIYVHSRPGFL + TRS++FL+RQVNDS+Q Sbjct: 58 AFLFIARNRLPLEMVWDKFFKGE-ESRFSIYVHSRPGFLFSKGTTRSIYFLDRQVNDSIQ 116 Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557 VDWG A+MI+AERILL+HAL DP N+RF+FLSDSCIPLYNFSYTY+YIMST SFVDSFA Sbjct: 117 VDWGGASMIEAERILLRHALADPFNDRFVFLSDSCIPLYNFSYTYNYIMSTSTSFVDSFA 176 Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737 D KE RYNPKM PVI V +WRKGSQW VLTRKHA IVV D TVF MFQ HCKRKSLPEFW Sbjct: 177 DTKEGRYNPKMAPVIPVHNWRKGSQWAVLTRKHAEIVVNDTTVFPMFQQHCKRKSLPEFW 236 Query: 738 RD 743 R+ Sbjct: 237 RE 238 >ref|XP_006399968.1| hypothetical protein EUTSA_v10015269mg [Eutrema salsugineum] gi|557101058|gb|ESQ41421.1| hypothetical protein EUTSA_v10015269mg [Eutrema salsugineum] Length = 365 Score = 333 bits (853), Expect = 5e-89 Identities = 160/239 (66%), Positives = 192/239 (80%), Gaps = 1/239 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRL-QKPKIAFM 206 MKR+++ K +RWKRK A L+ A C +F +++ YS I + + PRL QKP+IAF+ Sbjct: 1 MKRKASQHKLLNRWKRKVFATLIFAFCLGSFAFIQARYSGIT--ASLKPRLDQKPQIAFL 58 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PL+ VWD FF + + +FSI+VHSRPGF+L+ A TRS FFL+RQVNDS+QVDW Sbjct: 59 FIARNRLPLESVWDAFFMGE-DGKFSIFVHSRPGFILSEATTRSKFFLDRQVNDSIQVDW 117 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566 GEATMI+AERILL+HAL DP N RF+FLSDSCIPLY+FSYTY+YIMSTP SFVDSFAD K Sbjct: 118 GEATMIEAERILLRHALRDPFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTSFVDSFADTK 177 Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 +SRYNP+M P+I V WRKGSQWVVL RKHA IVV D V MFQ HC+RKSLPEFWRD Sbjct: 178 DSRYNPRMSPIIPVHHWRKGSQWVVLNRKHAEIVVNDTIVLPMFQQHCRRKSLPEFWRD 236 >ref|XP_007031553.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5 [Theobroma cacao] gi|508710582|gb|EOY02479.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5 [Theobroma cacao] Length = 253 Score = 332 bits (852), Expect = 6e-89 Identities = 161/233 (69%), Positives = 189/233 (81%), Gaps = 1/233 (0%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMRSLISPRLQ-KPKIAFM 206 MKR++ K RWKRK LA LL+A C A+ LME++YS+I + + R KPKIAF+ Sbjct: 4 MKRKAVQQKSQIRWKRKVLAALLVAFCLASLALMETQYSRIVSLASLRHRFAVKPKIAFL 63 Query: 207 FIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDW 386 FIARNR+PLD+VWD FF+ + +NRFSIYVHSRPGFL A TRS +FLNRQVNDS+QVDW Sbjct: 64 FIARNRLPLDMVWDAFFKGE-DNRFSIYVHSRPGFLFNKATTRSSYFLNRQVNDSIQVDW 122 Query: 387 GEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNK 566 GEA+MI+AERILL+HALTDP NERF+F+SDSCIPLYNFSY YDYIMST SFVDSFAD K Sbjct: 123 GEASMIEAERILLRHALTDPFNERFVFVSDSCIPLYNFSYMYDYIMSTSTSFVDSFADTK 182 Query: 567 ESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSL 725 E RYNPKM+P+I V +WRKGSQWVVLTRKHA +V+ D TVF MFQ HCK K + Sbjct: 183 EGRYNPKMNPIIPVYNWRKGSQWVVLTRKHAEVVINDTTVFPMFQEHCKAKEI 235 >ref|NP_196959.2| core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] gi|209863160|gb|ACI88738.1| At5g14550 [Arabidopsis thaliana] gi|332004663|gb|AED92046.1| core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] gi|591401812|gb|AHL38633.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 377 Score = 332 bits (852), Expect = 6e-89 Identities = 158/248 (63%), Positives = 198/248 (79%), Gaps = 10/248 (4%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMR---------SLISPRL 182 MK++ + K +RWKRK A L+ A CF TF+ +++ ++ I+ R SL PRL Sbjct: 1 MKKKVSQQKLLYRWKRKVYATLMFAFCFGTFVFIQARFASIQARFNRISASLDSLKKPRL 60 Query: 183 -QKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQ 359 Q+P+IAF+FIARNR+PL+ VWD FF+ + + +FSIYVHSRPGF+L A TRS +FL+RQ Sbjct: 61 DQRPQIAFLFIARNRLPLEFVWDAFFKGE-DGKFSIYVHSRPGFVLNEATTRSKYFLDRQ 119 Query: 360 VNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNS 539 +NDS+QVDWGE+TMI+AER+LL+HAL D N RF+FLSDSCIPLY+FSYTY+YIMSTP S Sbjct: 120 LNDSIQVDWGESTMIEAERVLLRHALRDSFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTS 179 Query: 540 FVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRK 719 FVDSFAD K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RK Sbjct: 180 FVDSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRK 239 Query: 720 SLPEFWRD 743 SLPEFWRD Sbjct: 240 SLPEFWRD 247 >dbj|BAD43086.1| putative protein [Arabidopsis thaliana] Length = 377 Score = 332 bits (852), Expect = 6e-89 Identities = 158/248 (63%), Positives = 198/248 (79%), Gaps = 10/248 (4%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATFLLMESEYSKIKMR---------SLISPRL 182 MK++ + K +RWKRK A L+ A CF TF+ +++ ++ I+ R SL PRL Sbjct: 1 MKKKVSQQKLLYRWKRKVYATLMFAFCFGTFVFIQARFASIQARFNRISASLDSLKKPRL 60 Query: 183 -QKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQ 359 Q+P+IAF+FIARNR+PL+ VWD FF+ + + +FSIYVHSRPGF+L A TRS +FL+RQ Sbjct: 61 DQRPQIAFLFIARNRLPLEFVWDAFFKGE-DGKFSIYVHSRPGFVLNEATTRSKYFLDRQ 119 Query: 360 VNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNS 539 +NDS+QVDWGE+TMI+AER+LL+HAL D N RF+FLSDSCIPLY+FSYTY+YIMSTP S Sbjct: 120 LNDSIQVDWGESTMIEAERVLLRHALRDSFNHRFVFLSDSCIPLYSFSYTYNYIMSTPTS 179 Query: 540 FVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRK 719 FVDSFAD K+SRYNP+M+P+I V +WRKGSQWVVL RKHA IVV D +VF MFQ HC+RK Sbjct: 180 FVDSFADTKDSRYNPRMNPIIPVRNWRKGSQWVVLNRKHAEIVVNDTSVFPMFQQHCRRK 239 Query: 720 SLPEFWRD 743 SLPEFWRD Sbjct: 240 SLPEFWRD 247 >ref|XP_004304653.1| PREDICTED: uncharacterized protein LOC101304206 [Fragaria vesca subsp. vesca] Length = 359 Score = 332 bits (851), Expect = 8e-89 Identities = 164/242 (67%), Positives = 193/242 (79%), Gaps = 4/242 (1%) Frame = +3 Query: 30 MKRRSNSGKFHHRWKRKFLAFLLIAICFATF-LLMESEYSKIKMRSLISPRL---QKPKI 197 MKR + S +WKRK LL+ +CF T LLM + YS+I + I P++ Q+PKI Sbjct: 1 MKRATKS---QFKWKRK----LLMCLCFGTLVLLMHTHYSRIMTLASIRPQMNTIQRPKI 53 Query: 198 AFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQ 377 AF+FIARNR+PLD++WDVFFQ E++FSIYVHSRPGFL TRS FFLNRQVNDS+Q Sbjct: 54 AFLFIARNRLPLDMLWDVFFQGG-ESKFSIYVHSRPGFLFNKVTTRSDFFLNRQVNDSIQ 112 Query: 378 VDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFA 557 VDWGEATMI+AERILLKHAL DP+N+RF F+SDSCIPLY+F Y YDY+MST SFVDSFA Sbjct: 113 VDWGEATMIEAERILLKHALEDPLNQRFAFVSDSCIPLYSFKYIYDYVMSTRTSFVDSFA 172 Query: 558 DNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFW 737 D K+ RYNPKM P+I V +WRKGSQW VLTRKHA +VV+D TVF MFQL+CKRKSLPEFW Sbjct: 173 DTKDGRYNPKMDPIIPVQNWRKGSQWAVLTRKHAEVVVKDNTVFPMFQLYCKRKSLPEFW 232 Query: 738 RD 743 RD Sbjct: 233 RD 234 >ref|XP_007150933.1| hypothetical protein PHAVU_004G007000g [Phaseolus vulgaris] gi|561024242|gb|ESW22927.1| hypothetical protein PHAVU_004G007000g [Phaseolus vulgaris] Length = 363 Score = 329 bits (843), Expect = 7e-88 Identities = 154/233 (66%), Positives = 187/233 (80%), Gaps = 6/233 (2%) Frame = +3 Query: 63 HRWKRKFLAFLLIAICFATFLLMESEYSKIK-----MRSLIS-PRLQKPKIAFMFIARNR 224 HRWK+K A + + C +FL M++ Y+ + R +S P +Q PKIAF+FIARNR Sbjct: 8 HRWKKKLFALIFVVFCLGSFLFMQTRYNHVVGLVSLQRHFVSKPEVQSPKIAFLFIARNR 67 Query: 225 IPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFLNRQVNDSVQVDWGEATMI 404 +PL+IVWD FF+ + +FSI+VH RPGFLL A TRS +FLNRQVN+SVQV+WGEA+MI Sbjct: 68 LPLEIVWDAFFRGG-DGKFSIFVHCRPGFLLNKATTRSPYFLNRQVNNSVQVEWGEASMI 126 Query: 405 QAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMSTPNSFVDSFADNKESRYNP 584 +AER+LL+HAL+DP+N+RF+FLSDSCIPLYNF+YTYDYIMS SFVDSFAD KE RYNP Sbjct: 127 EAERVLLRHALSDPLNDRFVFLSDSCIPLYNFTYTYDYIMSASTSFVDSFADTKEGRYNP 186 Query: 585 KMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHCKRKSLPEFWRD 743 KM PVI V +WRKGSQW VLTRKHA +VVEDETVF MFQ +CK+K LPEFWRD Sbjct: 187 KMDPVIPVYNWRKGSQWAVLTRKHAKVVVEDETVFPMFQKYCKKKPLPEFWRD 239 >emb|CBI34881.3| unnamed protein product [Vitis vinifera] Length = 324 Score = 329 bits (843), Expect = 7e-88 Identities = 154/191 (80%), Positives = 171/191 (89%) Frame = +3 Query: 171 SPRLQKPKIAFMFIARNRIPLDIVWDVFFQSDTENRFSIYVHSRPGFLLTSAMTRSVFFL 350 SP LQ+PKIAF+FIARNR+PLD+VWD FF+ + EN+FSI+VHSRPGFLL A TRSV+FL Sbjct: 3 SPFLQRPKIAFLFIARNRLPLDVVWDAFFRDEKENKFSIFVHSRPGFLLNKATTRSVYFL 62 Query: 351 NRQVNDSVQVDWGEATMIQAERILLKHALTDPINERFLFLSDSCIPLYNFSYTYDYIMST 530 NRQ+NDS+QVDWGEA+MIQAERILL+ AL DP+NERF+FLSDSCIPLYNFSY YDYIMST Sbjct: 63 NRQLNDSIQVDWGEASMIQAERILLRSALLDPLNERFVFLSDSCIPLYNFSYIYDYIMST 122 Query: 531 PNSFVDSFADNKESRYNPKMHPVISVDSWRKGSQWVVLTRKHAGIVVEDETVFSMFQLHC 710 SFVDSFAD KE RYNPKM PVI V +WRKGSQWVVLTRKHA IVVED+TVF MFQ HC Sbjct: 123 STSFVDSFADTKEGRYNPKMDPVIPVHNWRKGSQWVVLTRKHAQIVVEDDTVFPMFQQHC 182 Query: 711 KRKSLPEFWRD 743 KRKSLPEFWRD Sbjct: 183 KRKSLPEFWRD 193