BLASTX nr result
ID: Mentha29_contig00015843
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00015843 (636 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU27722.1| hypothetical protein MIMGU_mgv1a008710mg [Mimulus... 309 4e-82 ref|XP_002264137.1| PREDICTED: uncharacterized protein LOC100262... 267 2e-69 ref|XP_006348151.1| PREDICTED: uncharacterized protein LOC102578... 267 2e-69 ref|XP_006348150.1| PREDICTED: uncharacterized protein LOC102578... 267 2e-69 ref|XP_004232690.1| PREDICTED: uncharacterized protein LOC101246... 261 1e-67 ref|XP_002518435.1| conserved hypothetical protein [Ricinus comm... 246 5e-63 gb|EXB77042.1| hypothetical protein L484_014168 [Morus notabilis] 245 7e-63 ref|XP_002317140.2| hypothetical protein POPTR_0011s01410g [Popu... 242 8e-62 ref|XP_007214167.1| hypothetical protein PRUPE_ppa018994mg [Prun... 238 1e-60 ref|XP_007026153.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 237 2e-60 ref|XP_007026152.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 237 2e-60 ref|XP_007026150.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 237 2e-60 ref|XP_007026149.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 237 2e-60 ref|XP_006305062.1| hypothetical protein CARUB_v10009428mg [Caps... 237 3e-60 ref|XP_004134777.1| PREDICTED: uncharacterized protein LOC101222... 233 3e-59 ref|NP_172658.2| core-2/I-branching beta-1,6-N-acetylglucosaminy... 233 4e-59 gb|AAC17624.1| Contains similarity to hypothetical protein gb|U9... 233 4e-59 ref|XP_002892665.1| hypothetical protein ARALYDRAFT_312224 [Arab... 233 4e-59 ref|XP_007026151.1| Core-2/I-branching beta-1,6-N-acetylglucosam... 233 5e-59 ref|XP_006417284.1| hypothetical protein EUTSA_v10007925mg [Eutr... 228 1e-57 >gb|EYU27722.1| hypothetical protein MIMGU_mgv1a008710mg [Mimulus guttatus] Length = 365 Score = 309 bits (792), Expect = 4e-82 Identities = 155/193 (80%), Positives = 170/193 (88%), Gaps = 5/193 (2%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQS-----TAALPRKS 237 MTKK AS+KPGLSMRHVLCLGWKL ILVS+ LCV+AFLRIQQYSQS + LPR++ Sbjct: 1 MTKKGYASLKPGLSMRHVLCLGWKLLILVSLILCVWAFLRIQQYSQSMGSSASVVLPRRT 60 Query: 238 RSLVYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTRS 417 R Y F G+PKIAFLFLVRKNLPLDFLWESFFENVD+A +SIYIHSEPGF+FDE TTR Sbjct: 61 RVSDYHFRGDPKIAFLFLVRKNLPLDFLWESFFENVDKAKYSIYIHSEPGFLFDESTTRP 120 Query: 418 AIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYNY 597 IFFNRQL+NSIKVAWGE SMI+AER+LFEEAL+DPANQRFVLLSDSCVPLYNFSYIYNY Sbjct: 121 -IFFNRQLKNSIKVAWGEESMIEAERLLFEEALQDPANQRFVLLSDSCVPLYNFSYIYNY 179 Query: 598 VMGSPRSFVDSFL 636 + SPRSFVDSFL Sbjct: 180 LQNSPRSFVDSFL 192 >ref|XP_002264137.1| PREDICTED: uncharacterized protein LOC100262450 [Vitis vinifera] gi|302144098|emb|CBI23203.3| unnamed protein product [Vitis vinifera] Length = 380 Score = 267 bits (683), Expect = 2e-69 Identities = 138/191 (72%), Positives = 155/191 (81%), Gaps = 3/191 (1%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQ-STAALPRKSRSL- 246 MTKKA P S+RHV GWKL ILVSVALCV A LR+Q S+ S+ +LP + Sbjct: 1 MTKKA-----PSFSIRHVFWFGWKLVILVSVALCVLALLRLQSNSELSSISLPPQGPRFY 55 Query: 247 -VYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTRSAI 423 V + GNPKIAFLFLVR++LPLDFLW SFFEN D ANFSIYIHS+PGFVFDE T+RS Sbjct: 56 RVSVYQGNPKIAFLFLVRRSLPLDFLWGSFFENADAANFSIYIHSQPGFVFDETTSRSRF 115 Query: 424 FFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYNYVM 603 F+NRQL NSI+VAWGE+SMIQAER+LFE ALEDPANQRFVLLSDSCVPLYNFSYIYNY+M Sbjct: 116 FYNRQLSNSIQVAWGESSMIQAERLLFEAALEDPANQRFVLLSDSCVPLYNFSYIYNYMM 175 Query: 604 GSPRSFVDSFL 636 SPRS+VDSFL Sbjct: 176 ASPRSYVDSFL 186 >ref|XP_006348151.1| PREDICTED: uncharacterized protein LOC102578773 isoform X2 [Solanum tuberosum] Length = 391 Score = 267 bits (682), Expect = 2e-69 Identities = 134/197 (68%), Positives = 157/197 (79%), Gaps = 9/197 (4%) Frame = +1 Query: 73 MTKKAQASVK----PGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQSTAALPRK-- 234 M KK+ A++ G+S+R+VL L WKL +LVS+ LCV AFL++Q YS S + L Sbjct: 1 MKKKSAAAMAMAATAGMSVRNVLWLWWKLLVLVSLTLCVLAFLKLQNYSLSDSELSSSTS 60 Query: 235 ---SRSLVYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEF 405 SRS D++GNPK+AFLFLVR+NLPLDFLW +FFEN D NFSIY+HSEPGFVFDE Sbjct: 61 SISSRSRAVDYTGNPKVAFLFLVRRNLPLDFLWGNFFENADTGNFSIYVHSEPGFVFDES 120 Query: 406 TTRSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSY 585 TTRS FFNRQL NSIKVAWGE+SMIQAE++L AL+DPANQRFVLLSDSCVPLYNFS+ Sbjct: 121 TTRSTFFFNRQLTNSIKVAWGESSMIQAEKLLLGAALDDPANQRFVLLSDSCVPLYNFSF 180 Query: 586 IYNYVMGSPRSFVDSFL 636 IYNY+M SPRSFVDSFL Sbjct: 181 IYNYLMASPRSFVDSFL 197 >ref|XP_006348150.1| PREDICTED: uncharacterized protein LOC102578773 isoform X1 [Solanum tuberosum] Length = 428 Score = 267 bits (682), Expect = 2e-69 Identities = 134/197 (68%), Positives = 157/197 (79%), Gaps = 9/197 (4%) Frame = +1 Query: 73 MTKKAQASVK----PGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQSTAALPRK-- 234 M KK+ A++ G+S+R+VL L WKL +LVS+ LCV AFL++Q YS S + L Sbjct: 38 MKKKSAAAMAMAATAGMSVRNVLWLWWKLLVLVSLTLCVLAFLKLQNYSLSDSELSSSTS 97 Query: 235 ---SRSLVYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEF 405 SRS D++GNPK+AFLFLVR+NLPLDFLW +FFEN D NFSIY+HSEPGFVFDE Sbjct: 98 SISSRSRAVDYTGNPKVAFLFLVRRNLPLDFLWGNFFENADTGNFSIYVHSEPGFVFDES 157 Query: 406 TTRSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSY 585 TTRS FFNRQL NSIKVAWGE+SMIQAE++L AL+DPANQRFVLLSDSCVPLYNFS+ Sbjct: 158 TTRSTFFFNRQLTNSIKVAWGESSMIQAEKLLLGAALDDPANQRFVLLSDSCVPLYNFSF 217 Query: 586 IYNYVMGSPRSFVDSFL 636 IYNY+M SPRSFVDSFL Sbjct: 218 IYNYLMASPRSFVDSFL 234 >ref|XP_004232690.1| PREDICTED: uncharacterized protein LOC101246782 [Solanum lycopersicum] Length = 391 Score = 261 bits (667), Expect = 1e-67 Identities = 131/199 (65%), Positives = 159/199 (79%), Gaps = 11/199 (5%) Frame = +1 Query: 73 MTKKAQASVK----PGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYS-------QSTA 219 M KK+ A++ G+S+R+VL L WKL +LVS+ +CV AFL++Q YS ST+ Sbjct: 1 MKKKSAAAMAMAATAGMSVRNVLWLWWKLLVLVSLTICVLAFLKLQNYSLSDSELSSSTS 60 Query: 220 ALPRKSRSLVYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFD 399 ++ +SR+L Y +GNPK+AFLFLVR+NLPLDFLW +FFEN D NFSIY+HSEPGFVFD Sbjct: 61 SISSRSRALYY--TGNPKVAFLFLVRRNLPLDFLWGNFFENADPGNFSIYVHSEPGFVFD 118 Query: 400 EFTTRSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNF 579 E TTRS F+NRQL NSIKVAWGE+SMI AE++L AL+DPANQRFVLLSDSCVPLYNF Sbjct: 119 ESTTRSTFFYNRQLTNSIKVAWGESSMIHAEKLLLGAALDDPANQRFVLLSDSCVPLYNF 178 Query: 580 SYIYNYVMGSPRSFVDSFL 636 S+IYNY+M SPRSFVDSFL Sbjct: 179 SFIYNYLMASPRSFVDSFL 197 >ref|XP_002518435.1| conserved hypothetical protein [Ricinus communis] gi|223542280|gb|EEF43822.1| conserved hypothetical protein [Ricinus communis] Length = 405 Score = 246 bits (627), Expect = 5e-63 Identities = 129/203 (63%), Positives = 151/203 (74%), Gaps = 15/203 (7%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 MTKKA P + RHV+ LGWKL I++SV+LCVFA LR+ YS +++ S Sbjct: 14 MTKKA-----PPVPPRHVIWLGWKLVIILSVSLCVFALLRLHFQSDHYSSPSSSSSSSSS 68 Query: 241 SLVY-----------DFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPG 387 S Y +F G PK+AFLFLVR++LPLDFLW SFFEN D A+FSI+IHS PG Sbjct: 69 SSFYRPRSRLSRANLEFHGPPKLAFLFLVRQDLPLDFLWGSFFENADVASFSIFIHSSPG 128 Query: 388 FVFDEFTTRSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVP 567 F FDE TTRS F+ RQL+NSI+VAWGE+SMI+AER+L ALEDPANQRFVLLSDSCVP Sbjct: 129 FEFDESTTRSHFFYGRQLKNSIQVAWGESSMIEAERLLLSAALEDPANQRFVLLSDSCVP 188 Query: 568 LYNFSYIYNYVMGSPRSFVDSFL 636 LYNFSYIY+YVM SPRSFVDSFL Sbjct: 189 LYNFSYIYSYVMASPRSFVDSFL 211 >gb|EXB77042.1| hypothetical protein L484_014168 [Morus notabilis] Length = 362 Score = 245 bits (626), Expect = 7e-63 Identities = 128/196 (65%), Positives = 148/196 (75%), Gaps = 7/196 (3%) Frame = +1 Query: 70 AMTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQ-----YSQSTAALPRK 234 +MTKK+ P ++ RHVL L WKL +++SV LC+ A R+ YS S++ + Sbjct: 6 SMTKKS-----PPVATRHVLWLSWKLVVILSVFLCLLALFRLHSQPGFPYSPSSSISSAR 60 Query: 235 SRSLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFT 408 SR L D F+G PKIAFLFL R+NLPLDF WESFFEN D ANFSIY+HS PG FDE T Sbjct: 61 SR-LYRDNVFAGPPKIAFLFLARRNLPLDFFWESFFENADAANFSIYVHSAPGLAFDEST 119 Query: 409 TRSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYI 588 TRS F RQLRNSI+V WGE++MIQAER+L E ALEDPANQRFVLLSDSCVPLYNFSYI Sbjct: 120 TRSHFFHGRQLRNSIQVGWGESTMIQAERLLLEAALEDPANQRFVLLSDSCVPLYNFSYI 179 Query: 589 YNYVMGSPRSFVDSFL 636 YNY+M SPRSFVDSFL Sbjct: 180 YNYLMASPRSFVDSFL 195 >ref|XP_002317140.2| hypothetical protein POPTR_0011s01410g [Populus trichocarpa] gi|550327319|gb|EEE97752.2| hypothetical protein POPTR_0011s01410g [Populus trichocarpa] Length = 386 Score = 242 bits (617), Expect = 8e-62 Identities = 128/194 (65%), Positives = 149/194 (76%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGL---SMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQSTAALPRK--- 234 MTKK+ S+ P L S R V+ GWKL I++S+ LCVFA RI S L R+ Sbjct: 1 MTKKS--SLLPILLQQSRRRVIWSGWKLVIILSMGLCVFALFRIHLSSPPETLLSRRRSF 58 Query: 235 SRSLVYDFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 SR +V FSG PK+AFLFLVR+ LPLDFLW SFFEN D NFSI++HSEPGF FDE TTR Sbjct: 59 SREVV--FSGPPKVAFLFLVRRGLPLDFLWGSFFENADTGNFSIHVHSEPGFEFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F+ RQL+NSI+V WGE+SMI+AER+L + ALEDPANQRFVLLSDSCVPLYNFSYIY+ Sbjct: 117 SHFFYGRQLKNSIQVIWGESSMIEAERLLLDAALEDPANQRFVLLSDSCVPLYNFSYIYS 176 Query: 595 YVMGSPRSFVDSFL 636 Y+M SPRSFVDSFL Sbjct: 177 YLMASPRSFVDSFL 190 >ref|XP_007214167.1| hypothetical protein PRUPE_ppa018994mg [Prunus persica] gi|462410032|gb|EMJ15366.1| hypothetical protein PRUPE_ppa018994mg [Prunus persica] Length = 383 Score = 238 bits (606), Expect = 1e-60 Identities = 122/194 (62%), Positives = 145/194 (74%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQ----YSQSTAALPRKSR 240 MTKK+ P + RHVL W+L +++S+ LCV AF ++ YS ++ +SR Sbjct: 1 MTKKS-----PPIPARHVLRFSWQLVVILSITLCVLAFFKLHSQPDLYSSPSSLSIARSR 55 Query: 241 SLVY--DFSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 + +FSG PKIAFLFL R++LPLDFLW SFFE+ D NFSIYIHS PGF FDE TTR Sbjct: 56 VSRHGNNFSGPPKIAFLFLARRSLPLDFLWGSFFESADMPNFSIYIHSAPGFSFDESTTR 115 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F+ RQL NSI+V WGE+SMI+AER+LF ALEDPANQRFVLLSDSCVPLYNFSYIYN Sbjct: 116 SHFFYGRQLTNSIQVGWGESSMIEAERLLFATALEDPANQRFVLLSDSCVPLYNFSYIYN 175 Query: 595 YVMGSPRSFVDSFL 636 Y+M SPRSFVDSFL Sbjct: 176 YLMASPRSFVDSFL 189 >ref|XP_007026153.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5, partial [Theobroma cacao] gi|590626382|ref|XP_007026154.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5, partial [Theobroma cacao] gi|508781519|gb|EOY28775.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5, partial [Theobroma cacao] gi|508781520|gb|EOY28776.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 5, partial [Theobroma cacao] Length = 266 Score = 237 bits (605), Expect = 2e-60 Identities = 129/194 (66%), Positives = 143/194 (73%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 M KK A V R VL LGWKL IL+SVALC A LR+ S ++ + P + R Sbjct: 1 MMKKLPAPVPA----RQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVR 56 Query: 241 SLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 S + F G PKIAFLFL R NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTR Sbjct: 57 SRISGGTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F++RQL NSI+V WGE+SMI+AER+L E ALEDPANQRFVLLSDSCVPLYNFSYIY Sbjct: 117 SLFFYDRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSCVPLYNFSYIYR 176 Query: 595 YVMGSPRSFVDSFL 636 Y+M S RSFVDSFL Sbjct: 177 YLMSSSRSFVDSFL 190 >ref|XP_007026152.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 4 [Theobroma cacao] gi|508781518|gb|EOY28774.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 4 [Theobroma cacao] Length = 284 Score = 237 bits (605), Expect = 2e-60 Identities = 129/194 (66%), Positives = 143/194 (73%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 M KK A V R VL LGWKL IL+SVALC A LR+ S ++ + P + R Sbjct: 1 MMKKLPAPVPA----RQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVR 56 Query: 241 SLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 S + F G PKIAFLFL R NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTR Sbjct: 57 SRISGGTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F++RQL NSI+V WGE+SMI+AER+L E ALEDPANQRFVLLSDSCVPLYNFSYIY Sbjct: 117 SLFFYDRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSCVPLYNFSYIYR 176 Query: 595 YVMGSPRSFVDSFL 636 Y+M S RSFVDSFL Sbjct: 177 YLMSSSRSFVDSFL 190 >ref|XP_007026150.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 [Theobroma cacao] gi|508781516|gb|EOY28772.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 2 [Theobroma cacao] Length = 384 Score = 237 bits (605), Expect = 2e-60 Identities = 129/194 (66%), Positives = 143/194 (73%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 M KK A V R VL LGWKL IL+SVALC A LR+ S ++ + P + R Sbjct: 1 MMKKLPAPVPA----RQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVR 56 Query: 241 SLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 S + F G PKIAFLFL R NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTR Sbjct: 57 SRISGGTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F++RQL NSI+V WGE+SMI+AER+L E ALEDPANQRFVLLSDSCVPLYNFSYIY Sbjct: 117 SLFFYDRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSCVPLYNFSYIYR 176 Query: 595 YVMGSPRSFVDSFL 636 Y+M S RSFVDSFL Sbjct: 177 YLMSSSRSFVDSFL 190 >ref|XP_007026149.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 1 [Theobroma cacao] gi|508781515|gb|EOY28771.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 1 [Theobroma cacao] Length = 282 Score = 237 bits (605), Expect = 2e-60 Identities = 129/194 (66%), Positives = 143/194 (73%), Gaps = 6/194 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 M KK A V R VL LGWKL IL+SVALC A LR+ S ++ + P + R Sbjct: 1 MMKKLPAPVPA----RQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVR 56 Query: 241 SLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 S + F G PKIAFLFL R NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTR Sbjct: 57 SRISGGTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYN 594 S F++RQL NSI+V WGE+SMI+AER+L E ALEDPANQRFVLLSDSCVPLYNFSYIY Sbjct: 117 SLFFYDRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSCVPLYNFSYIYR 176 Query: 595 YVMGSPRSFVDSFL 636 Y+M S RSFVDSFL Sbjct: 177 YLMSSSRSFVDSFL 190 >ref|XP_006305062.1| hypothetical protein CARUB_v10009428mg [Capsella rubella] gi|482573773|gb|EOA37960.1| hypothetical protein CARUB_v10009428mg [Capsella rubella] Length = 384 Score = 237 bits (604), Expect = 3e-60 Identities = 119/195 (61%), Positives = 145/195 (74%), Gaps = 7/195 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQQYSQSTAALPR-----KS 237 MT+K Q ++P LS R + LGWKL I S ALC+ A LRIQ S A LP +S Sbjct: 1 MTRKPQPQIQPPLSRRGFVWLGWKLVIAFSAALCLLALLRIQLQYHSVATLPSPLSVARS 60 Query: 238 RSLVYDFSGN--PKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTT 411 +L+ ++SG+ PK+AFLFL R++LPLDF+W+ FF+ VD ANFSIY+HS PGFVF+E TT Sbjct: 61 HTLLREYSGDRRPKLAFLFLARRDLPLDFMWDRFFKGVDHANFSIYVHSLPGFVFNEDTT 120 Query: 412 RSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIY 591 RS F+NRQL NSIKV WGE+SMI AER+L ALED +NQRFVLLSD C PLY+F YIY Sbjct: 121 RSQYFYNRQLNNSIKVVWGESSMIAAERLLLASALEDQSNQRFVLLSDRCAPLYDFGYIY 180 Query: 592 NYVMGSPRSFVDSFL 636 Y++ SPRSFVDSFL Sbjct: 181 RYLISSPRSFVDSFL 195 >ref|XP_004134777.1| PREDICTED: uncharacterized protein LOC101222689 [Cucumis sativus] gi|449479497|ref|XP_004155615.1| PREDICTED: uncharacterized protein LOC101225507 [Cucumis sativus] Length = 382 Score = 233 bits (595), Expect = 3e-59 Identities = 117/177 (66%), Positives = 135/177 (76%), Gaps = 4/177 (2%) Frame = +1 Query: 118 RHVLCLGWKLAILVSVALCVFAFLRIQQYSQST----AALPRKSRSLVYDFSGNPKIAFL 285 R + WKL + S+ALC+FA + + +T A+L R+ R F G PKIAFL Sbjct: 11 RSLFWFSWKLLVTFSLALCIFALVSLHSSPSTTDLASASLSRRLRPPSDSFLGRPKIAFL 70 Query: 286 FLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTRSAIFFNRQLRNSIKVAW 465 FL R+NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTRS FF RQL NSI+VAW Sbjct: 71 FLTRRNLPLDFLWGSFFENGDVANFSIYIHSAPGFVFDESTTRSHFFFGRQLENSIQVAW 130 Query: 466 GEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYNYVMGSPRSFVDSFL 636 G++SMI AER+L E ALEDPANQRF+LLSDSCVPLYNFSYIY+Y+M SP+SFVDSFL Sbjct: 131 GKSSMIAAERLLLEAALEDPANQRFILLSDSCVPLYNFSYIYSYLMASPKSFVDSFL 187 >ref|NP_172658.2| core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] gi|26450342|dbj|BAC42287.1| unknown protein [Arabidopsis thaliana] gi|28827514|gb|AAO50601.1| unknown protein [Arabidopsis thaliana] gi|332190698|gb|AEE28819.1| core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein [Arabidopsis thaliana] gi|591402450|gb|AHL38952.1| glycosyltransferase, partial [Arabidopsis thaliana] Length = 383 Score = 233 bits (594), Expect = 4e-59 Identities = 122/195 (62%), Positives = 151/195 (77%), Gaps = 7/195 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRH-VLCLGWKLAILVSVALCVFAFLRIQ-QYSQ-STAALP---RK 234 MTKK+Q + P LS R V+ LGWKL I SVALC+ A LRIQ QY+ +T + P + Sbjct: 1 MTKKSQPQIPPPLSRRGGVVWLGWKLVIAFSVALCLLALLRIQLQYNSFTTLSFPLSVAR 60 Query: 235 SRSLVYDFSGN-PKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTT 411 S++ ++ +SG+ PK+AFLFL R++LPLDF+W+ FF+ VD ANFSIYIHS PGFVF+E TT Sbjct: 61 SQTPLHKYSGDRPKLAFLFLARRDLPLDFMWDRFFKGVDHANFSIYIHSVPGFVFNEETT 120 Query: 412 RSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIY 591 RS F+NRQL NSIKV WGE+SMI+AER+L ALED +NQRFVLLSD C PLY+F YIY Sbjct: 121 RSQYFYNRQLNNSIKVVWGESSMIEAERLLLASALEDHSNQRFVLLSDRCAPLYDFGYIY 180 Query: 592 NYVMGSPRSFVDSFL 636 Y++ SPRSFVDSFL Sbjct: 181 KYLISSPRSFVDSFL 195 >gb|AAC17624.1| Contains similarity to hypothetical protein gb|U95973 from A. thaliana [Arabidopsis thaliana] Length = 364 Score = 233 bits (594), Expect = 4e-59 Identities = 122/195 (62%), Positives = 151/195 (77%), Gaps = 7/195 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRH-VLCLGWKLAILVSVALCVFAFLRIQ-QYSQ-STAALP---RK 234 MTKK+Q + P LS R V+ LGWKL I SVALC+ A LRIQ QY+ +T + P + Sbjct: 1 MTKKSQPQIPPPLSRRGGVVWLGWKLVIAFSVALCLLALLRIQLQYNSFTTLSFPLSVAR 60 Query: 235 SRSLVYDFSGN-PKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTT 411 S++ ++ +SG+ PK+AFLFL R++LPLDF+W+ FF+ VD ANFSIYIHS PGFVF+E TT Sbjct: 61 SQTPLHKYSGDRPKLAFLFLARRDLPLDFMWDRFFKGVDHANFSIYIHSVPGFVFNEETT 120 Query: 412 RSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIY 591 RS F+NRQL NSIKV WGE+SMI+AER+L ALED +NQRFVLLSD C PLY+F YIY Sbjct: 121 RSQYFYNRQLNNSIKVVWGESSMIEAERLLLASALEDHSNQRFVLLSDRCAPLYDFGYIY 180 Query: 592 NYVMGSPRSFVDSFL 636 Y++ SPRSFVDSFL Sbjct: 181 KYLISSPRSFVDSFL 195 >ref|XP_002892665.1| hypothetical protein ARALYDRAFT_312224 [Arabidopsis lyrata subsp. lyrata] gi|297338507|gb|EFH68924.1| hypothetical protein ARALYDRAFT_312224 [Arabidopsis lyrata subsp. lyrata] Length = 383 Score = 233 bits (594), Expect = 4e-59 Identities = 120/195 (61%), Positives = 147/195 (75%), Gaps = 7/195 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRH-VLCLGWKLAILVSVALCVFAFLRIQQYSQSTAALPR-----K 234 MT+K+Q ++P LS R V+ LGWKL I SVALC+ A LRIQ S LP + Sbjct: 1 MTRKSQPQIQPPLSRRGGVVWLGWKLVIAFSVALCLLALLRIQLQYNSDTTLPSPLSVAR 60 Query: 235 SRSLVYDFSGN-PKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTT 411 S++ ++ +SG+ PK+AFLFL R++LPLDF+W+ FF+ VD ANFSIYIHS PGFVF+E TT Sbjct: 61 SQTPLHKYSGDRPKLAFLFLARRDLPLDFMWDRFFKGVDHANFSIYIHSLPGFVFNEETT 120 Query: 412 RSAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIY 591 RS F+NRQL NSIKV WGE+SMI AER+L ALED +NQRFVLLSD C PLY+F YIY Sbjct: 121 RSQYFYNRQLNNSIKVVWGESSMIAAERLLLASALEDHSNQRFVLLSDRCAPLYDFGYIY 180 Query: 592 NYVMGSPRSFVDSFL 636 Y++ SPRSFVDSFL Sbjct: 181 RYLISSPRSFVDSFL 195 >ref|XP_007026151.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 3 [Theobroma cacao] gi|508781517|gb|EOY28773.1| Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein isoform 3 [Theobroma cacao] Length = 269 Score = 233 bits (593), Expect = 5e-59 Identities = 129/195 (66%), Positives = 143/195 (73%), Gaps = 7/195 (3%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ----QYSQSTAALPRKSR 240 M KK A V R VL LGWKL IL+SVALC A LR+ S ++ + P + R Sbjct: 1 MMKKLPAPVPA----RQVLWLGWKLVILLSVALCFVALLRLHFSPDLSSPNSLSRPARVR 56 Query: 241 SLVYD--FSGNPKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTR 414 S + F G PKIAFLFL R NLPLDFLW SFFEN D ANFSIYIHS PGFVFDE TTR Sbjct: 57 SRISGGTFDGIPKIAFLFLARFNLPLDFLWGSFFENADVANFSIYIHSAPGFVFDESTTR 116 Query: 415 SAIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSD-SCVPLYNFSYIY 591 S F++RQL NSI+V WGE+SMI+AER+L E ALEDPANQRFVLLSD SCVPLYNFSYIY Sbjct: 117 SLFFYDRQLTNSIQVIWGESSMIEAERLLLESALEDPANQRFVLLSDSSCVPLYNFSYIY 176 Query: 592 NYVMGSPRSFVDSFL 636 Y+M S RSFVDSFL Sbjct: 177 RYLMSSSRSFVDSFL 191 >ref|XP_006417284.1| hypothetical protein EUTSA_v10007925mg [Eutrema salsugineum] gi|557095055|gb|ESQ35637.1| hypothetical protein EUTSA_v10007925mg [Eutrema salsugineum] Length = 381 Score = 228 bits (581), Expect = 1e-57 Identities = 119/193 (61%), Positives = 142/193 (73%), Gaps = 5/193 (2%) Frame = +1 Query: 73 MTKKAQASVKPGLSMRHVLCLGWKLAILVSVALCVFAFLRIQ-QYSQSTAALPRK---SR 240 MT+K+Q + +S R V+ LGWKL I SVALC+ A LRI QY T P S Sbjct: 1 MTRKSQPQAQHSVSRRGVVWLGWKLVIAFSVALCLLALLRIHLQYHSVTTLAPLSVAGSH 60 Query: 241 SLVYDFSGN-PKIAFLFLVRKNLPLDFLWESFFENVDRANFSIYIHSEPGFVFDEFTTRS 417 + FSG+ PK+AFLFL R++LPLDFLW+ FF+ D+ANFSIYIHS PGFVF+E TTRS Sbjct: 61 ISLRKFSGDSPKVAFLFLARRDLPLDFLWDRFFKGADQANFSIYIHSVPGFVFNEDTTRS 120 Query: 418 AIFFNRQLRNSIKVAWGEASMIQAERILFEEALEDPANQRFVLLSDSCVPLYNFSYIYNY 597 F++RQL NSIKV WGE+SMI AER+L ALED +NQRFVLLSD CVPLY+F YIY Y Sbjct: 121 QYFYDRQLNNSIKVIWGESSMIAAERLLLASALEDQSNQRFVLLSDRCVPLYDFGYIYRY 180 Query: 598 VMGSPRSFVDSFL 636 ++ SPRSFVDSFL Sbjct: 181 LISSPRSFVDSFL 193