BLASTX nr result

ID: Mentha29_contig00020433 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00020433
         (738 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU20439.1| hypothetical protein MIMGU_mgv1a003679mg [Mimulus...   119   1e-24
emb|CBI20855.3| unnamed protein product [Vitis vinifera]               76   1e-11
gb|EYU20438.1| hypothetical protein MIMGU_mgv1a003254mg [Mimulus...    75   2e-11
ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Popu...    74   4e-11
ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Popu...    72   2e-10
ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g...    71   4e-10
ref|XP_002516500.1| catalytic, putative [Ricinus communis] gi|22...    69   1e-09
ref|XP_004498468.1| PREDICTED: probable glycosyltransferase At5g...    69   2e-09
ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prun...    69   2e-09
ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g...    65   2e-08
ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g...    65   2e-08
gb|EPS66570.1| hypothetical protein M569_08206 [Genlisea aurea]        65   2e-08
ref|XP_004498470.1| PREDICTED: probable glycosyltransferase At5g...    65   2e-08
ref|XP_007012125.1| Exostosin family protein, putative isoform 2...    64   7e-08
ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g...    64   7e-08
ref|XP_004245169.1| PREDICTED: probable glycosyltransferase At5g...    63   9e-08
ref|XP_007012124.1| Exostosin family protein, putative isoform 1...    63   1e-07
ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citr...    61   4e-07
ref|XP_006412431.1| hypothetical protein EUTSA_v10024793mg [Eutr...    60   6e-07
ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g...    60   6e-07

>gb|EYU20439.1| hypothetical protein MIMGU_mgv1a003679mg [Mimulus guttatus]
          Length = 570

 Score =  119 bits (297), Expect = 1e-24
 Identities = 88/252 (34%), Positives = 129/252 (51%), Gaps = 8/252 (3%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSGVVFENVTKDEKELGNGKN 180
           VQ+IELPYGYV+SS LS  +                          N  K      +  N
Sbjct: 20  VQYIELPYGYVLSSFLSFTKSR-----------------------SNSAKTNHARDSSVN 56

Query: 181 STDSLSSDDYVDQHSSTDFLETNHPRNESLAAE-TEIEHVHALPVSSYTSVRNSSAESPQ 357
            T   SSD++V              RN+S   E   ++H + + +  Y S +NSS+   Q
Sbjct: 57  ITS--SSDEHV-------------VRNKSSTPELAPVQHGYNVSIPKYISSKNSSSVDLQ 101

Query: 358 GPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSLGPSQLSTEVNSSAIRRFPE--L 531
             NAAP++  + + ++++T   V +  L+  N++   +  S LS  VNSS +        
Sbjct: 102 LTNAAPTNTKKKDRNSQTTIPQVANMVLN--NEKPKKIAQSDLSASVNSSFVNGTSSKMR 159

Query: 532 KRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQADKELLYAKKLIETADSVKKN-- 705
           +RFKG PS VVP+S+MN +L+ESR S+ S K RWPS+ DKELL A+  IE+A  V+ +  
Sbjct: 160 RRFKGPPSRVVPMSEMNHMLTESRLSFRSVKPRWPSEVDKELLNARTQIESAQIVETSPH 219

Query: 706 ---NVYRNFSAF 732
              +VYRNFS+F
Sbjct: 220 IDVSVYRNFSSF 231


>emb|CBI20855.3| unnamed protein product [Vitis vinifera]
          Length = 618

 Score = 76.3 bits (186), Expect = 1e-11
 Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 30/276 (10%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIG------RXXXXXXXXXXXXXXXXXXDGSGVVFENVT----- 147
           VQ+ ELPYG V+SSL S G      +                  +G    F +V      
Sbjct: 30  VQYFELPYGDVLSSLFSAGDIPAPGKTSLPSSDSFNAETMEGNNEGPKNDFASVMNGALD 89

Query: 148 ------KDEKELGNGK-NSTDSLSSDDYVDQHSSTDFLETNHPRNESLAAETEIEHVHAL 306
                 +D K +   K N++ + S+     +H S+ +LE     + S   + + + +  L
Sbjct: 90  KSFGLDEDNKNVTVEKVNNSGNRSALKNASKHESSLYLENITADSNSSLGKIQEDDMALL 149

Query: 307 PVSSYTSVRNSSAESPQGPNAAPSSITE-------YNGSNKSTASTVEDPALSVQNKEKT 465
              S  S     +  P  P    SS T        +  +     S+VE+ A    NK++ 
Sbjct: 150 SQRSERSGVGLISPLPALPQIISSSNTTSLTNLDPHPITLPPERSSVEEDAAHTLNKDEK 209

Query: 466 SLGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQA 645
           +    +  T  N S+I   P L+     P+V   IS+MNDLL +SRAS  S K RW S  
Sbjct: 210 AETSQKDLTLSNRSSIS-VPALETRPELPAVTT-ISEMNDLLVQSRASSRSMKPRWSSAV 267

Query: 646 DKELLYAKKLIETADSVKKN-----NVYRNFSAFKK 738
           DKELLYAK  IE A  +K +     ++YRN S FK+
Sbjct: 268 DKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKR 303


>gb|EYU20438.1| hypothetical protein MIMGU_mgv1a003254mg [Mimulus guttatus]
          Length = 597

 Score = 75.1 bits (183), Expect = 2e-11
 Identities = 67/257 (26%), Positives = 104/257 (40%), Gaps = 11/257 (4%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSG----VVFENVTKDEKELG 168
           +Q  ELPY  V SSL    +                    +       F +      ++ 
Sbjct: 30  IQFFELPYTNVFSSLFPTEKSQTPPLESYPSTNSSSSRSPTNSRIPTYFSSTNSTNLDVA 89

Query: 169 NGKNSTDSLSSDDYVDQHSSTDFLETNHPRNESLAAETEIEHVHALPVSSYTSVRNSSAE 348
           NG ++T   S  +  D+  S         +N   +   +I+     P   +  + ++S  
Sbjct: 90  NGASNTTKKSDRNDFDREKS---------KNSDDSLNDDIDPEDESPFKDFPEMDHNSTV 140

Query: 349 S--PQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSLGPSQLSTEVNSSAIRRF 522
           +  P   +       E +  +  T+S +            TS G  +   + N  A +  
Sbjct: 141 AWIPLNDSFPLEKSRETHHESLETSSPI------------TSTGKDEKVNDENPKAKKSA 188

Query: 523 PELKRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQADKELLYAKKLIETADSVKK 702
           P  K  + A   VV ISKMND+L +SR SY   K +WPS AD+ELL   +LIE A  + K
Sbjct: 189 PVTKETEPA---VVTISKMNDMLKQSRVSYKPVKTKWPSTADQELLSMTRLIENASFIGK 245

Query: 703 N-----NVYRNFSAFKK 738
           +     N+YRNFS FK+
Sbjct: 246 DPHFDANLYRNFSEFKR 262


>ref|XP_002324801.2| hypothetical protein POPTR_0018s00290g [Populus trichocarpa]
           gi|550317697|gb|EEF03366.2| hypothetical protein
           POPTR_0018s00290g [Populus trichocarpa]
          Length = 707

 Score = 74.3 bits (181), Expect = 4e-11
 Identities = 64/223 (28%), Positives = 110/223 (49%), Gaps = 22/223 (9%)
 Frame = +1

Query: 136 ENVTKDEKELGNGKNSTDSLSSDDYVDQHSSTDFLETNHPRNESLAAETEI----EHVHA 303
           E +    K LG   +  +S S +  +DQ+ ++  LE NH  N S + ET+     E++ +
Sbjct: 148 EGIKGLNKSLGIDNHGRES-SPEQLLDQNENST-LELNHSGNGSASIETDRSLFRENITS 205

Query: 304 LPVSSYTSVRNSSAESPQGP------------NAAPSSITEYNGSNKSTASTVEDPALSV 447
              ++ TS    +  +P  P            NA PS++        +T+ T +D +  +
Sbjct: 206 TSENTGTSQAGITPIAPALPPVDSPTNIAIPRNAEPSTLAPVVPVESNTSKTDKDASHGL 265

Query: 448 QNKEKTSLGPSQLSTEVNSSAIRRFPELKRFKGAPS-VVVPISKMNDLLSESRASYHSTK 624
           +N  K     +  ++  N++++    E+K+    PS  V+ IS+MN+L  +S +S  S +
Sbjct: 266 ENDGKAGEQLNNSTSLQNNTSVTSVREVKKEPHTPSPAVISISEMNNLQLQSWSSPISRR 325

Query: 625 ARWPSQADKELLYAKKLIETADSVKKNN-----VYRNFSAFKK 738
            RWPS  D+ELL AK  I+ A  V+ ++     +YRN S FKK
Sbjct: 326 PRWPSAVDQELLNAKSQIQKAPLVESDSMLYAPLYRNISMFKK 368


>ref|XP_002309547.2| hypothetical protein POPTR_0006s25540g [Populus trichocarpa]
           gi|550337072|gb|EEE93070.2| hypothetical protein
           POPTR_0006s25540g [Populus trichocarpa]
          Length = 705

 Score = 72.4 bits (176), Expect = 2e-10
 Identities = 63/205 (30%), Positives = 96/205 (46%), Gaps = 6/205 (2%)
 Frame = +1

Query: 142 VTKDEKELGNGKNSTDSLSSDDYVDQHSSTDFLETNHPRNESLAAETEIEHVHALPVSSY 321
           V +    LGNG    ++  S    D  S ++ +  +  R   +A E        LPV S 
Sbjct: 176 VMEPVNSLGNGSAPQETERSLSREDVTSISENIGASDARIAPIAPEL-------LPVDSP 228

Query: 322 TSVRNSSAESPQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSLGPSQLSTEVN 501
            ++           NA PS+I        +T+   +D A S++N  KT      L+   N
Sbjct: 229 PNITLQM-------NAEPSTIAHIVPIESNTSKVDKDAAPSLENDGKTGDQKKDLTLLHN 281

Query: 502 SSAIRRFPELKRFKGAPSV-VVPISKMNDLLSESRASYHSTKARWPSQADKELLYAKKLI 678
           + ++  FPE+K+    PS+ VV IS+M +L  +  +S +S + RWPS  D+ELL AK  I
Sbjct: 282 NPSVTSFPEVKKEPQTPSLEVVSISEMKNLQLQRWSSPNSRRPRWPSVVDQELLNAKSQI 341

Query: 679 ETADSVKKNNV-----YRNFSAFKK 738
           + A  V+ + V     Y N S FKK
Sbjct: 342 QNAPIVENDPVLYAPLYWNISMFKK 366


>ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 675

 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 48/112 (42%), Positives = 64/112 (57%), Gaps = 5/112 (4%)
 Frame = +1

Query: 418 STVEDPALSVQNKEKTSLGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDLLSE 597
           S+VE+ A    NK++ +    +  T  N S+I   P L+     P+V   IS+MNDLL +
Sbjct: 222 SSVEEDAAHTLNKDEKAETSQKDLTLSNRSSIS-VPALETRPELPAVTT-ISEMNDLLVQ 279

Query: 598 SRASYHSTKARWPSQADKELLYAKKLIETADSVKKN-----NVYRNFSAFKK 738
           SRAS  S K RW S  DKELLYAK  IE A  +K +     ++YRN S FK+
Sbjct: 280 SRASSRSMKPRWSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKR 331


>ref|XP_002516500.1| catalytic, putative [Ricinus communis] gi|223544320|gb|EEF45841.1|
           catalytic, putative [Ricinus communis]
          Length = 456

 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 43/121 (35%), Positives = 65/121 (53%), Gaps = 6/121 (4%)
 Frame = +1

Query: 394 NGSNKSTASTVEDPALSVQNKEKTS-LGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPI 570
           N  +  T ST+        NKE  S L  S  S   + S+  +   +K+ K  P+ V  I
Sbjct: 7   NNRSSDTVSTI--------NKEGNSGLVSSNSSVSSSDSSASKASAMKKSKKPPTRVFSI 58

Query: 571 SKMNDLLSESRASYHSTKARWPSQADKELLYAKKLIETADSVKKNNV-----YRNFSAFK 735
           S+MND L +SRAS++S +  WP + D++L++A+  IE A  VK + V     YRN S F+
Sbjct: 59  SQMNDFLRQSRASFNSVRPHWPLEVDQQLMFARSQIENAPGVKNDTVLYAPIYRNVSMFE 118

Query: 736 K 738
           +
Sbjct: 119 R 119


>ref|XP_004498468.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
           [Cicer arietinum] gi|502124301|ref|XP_004498469.1|
           PREDICTED: probable glycosyltransferase At5g03795-like
           isoform X2 [Cicer arietinum]
          Length = 612

 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 76/269 (28%), Positives = 108/269 (40%), Gaps = 23/269 (8%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSGVVFENVTKDEKELGNGKN 180
           +Q++ELPYG V+ SL S  +                    S  +  NVT     + N  N
Sbjct: 30  LQYLELPYGNVLLSLFSSDKIPTSQSTI------------SQTIVTNVT-----IFNESN 72

Query: 181 STDSLSSDDYVDQHSSTDF-LETNHPRN------ESLAAETEI--EHVHALPVSSYTSVR 333
           STD+       D      F LE   P+N      ES   ET I  E  +  P+S++    
Sbjct: 73  STDATK-----DTMPIIGFVLEPEWPQNKSEKFHESGNDETNIHQEEQNITPLSTHEHKS 127

Query: 334 NSSAESPQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNK-----EKTSLGPSQLSTEV 498
                    P  AP+++    G   + +  +    LS  +      +K S  PSQ    +
Sbjct: 128 GRLDPYSPSPENAPTNLAPPFGFKTNVSPNITTAMLSNDDNIANSIKKESFRPSQNGGNI 187

Query: 499 NSSAIRRFPELKRFKGAPSVVVP----ISKMNDLLSESRASYHSTKARWPSQADKELLYA 666
              +      + +        +P    IS+MN LL +S ASY S K RW S  D+ELL A
Sbjct: 188 QGKSSSISSNVPKENQESHTQIPEVTTISEMNKLLFQSHASYRSMKPRWFSDVDQELLQA 247

Query: 667 KKLIETADSVKKNN-----VYRNFSAFKK 738
           +  IE A  VK N      +Y N S FK+
Sbjct: 248 RSEIENAPIVKNNQNLYGPIYHNVSMFKR 276


>ref|XP_007225154.1| hypothetical protein PRUPE_ppa002395mg [Prunus persica]
           gi|462422090|gb|EMJ26353.1| hypothetical protein
           PRUPE_ppa002395mg [Prunus persica]
          Length = 678

 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 64/217 (29%), Positives = 102/217 (47%), Gaps = 13/217 (5%)
 Frame = +1

Query: 127 VVFENVTKDEKELGNGKNSTDSLSSDDYVDQHSSTD--FLETNHPRNESLAAETEIEHVH 300
           ++ EN+   E            +SS   V++ ++TD  +LE     NE+   +  +    
Sbjct: 135 IIVENIKPLETNFAQEGGREPEVSS---VEKKNTTDNTYLE-GRIGNENNTVDV-VNSTA 189

Query: 301 ALPVSSYTSVRNSSAESPQGPNAAPSSITEYNGS-----NKSTASTVEDPALSVQNKEKT 465
            LPVSS      +S+     P+ AP+      G+     + +  S  +D     +  E +
Sbjct: 190 GLPVSSPAPPMMNSS-----PSTAPAIFETNVGAPIKSVDSNVTSVEKDRTTPSEKTENS 244

Query: 466 SLGPSQLSTEVNSSAIRRFPELKRFKGAPSV-VVPISKMNDLLSESRASYHSTKARWPSQ 642
               S L+   ++S++ R PE+K     P + V  IS MN+LL +SRASY+S  A+W S 
Sbjct: 245 EQLHSDLNQTEHNSSMTRVPEVKIEPEVPILDVYSISDMNNLLLQSRASYNSMLAQWSSP 304

Query: 643 ADKELLYAKKLIETADSVKKNN-----VYRNFSAFKK 738
           AD+EL Y    IE A  +K +      +YRN S FK+
Sbjct: 305 ADQELQYVASQIENAPIIKSDPTLYALLYRNLSVFKR 341


>ref|XP_006476045.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X2
           [Citrus sinensis]
          Length = 663

 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 64/210 (30%), Positives = 107/210 (50%), Gaps = 14/210 (6%)
 Frame = +1

Query: 151 DEKELGNGKNST-DSLSSDDYVD-----QHSSTDFLETNHPRNESLAAETEIEHVHALPV 312
           +EK  G  KNST D++ +   V      + S   F++    RN+ +  ++    +  +PV
Sbjct: 120 NEKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQ----RNDIMGGDSGGVGLSPIPV 175

Query: 313 SSYTSVRNSSAESPQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSLGPSQLST 492
           S    +  SS  + QG N + + IT ++ S+    ST +D   ++   EK    P+Q S 
Sbjct: 176 SPVMDL--SSNITLQGANIS-TPITIHSNSS----STDKDATPALDKIEK----PAQSSL 224

Query: 493 EV---NSSAIRRFPELKRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQADKELLY 663
                NSS +    E K+ +     V+ I++M ++L ++RASY S + RW S  D+E+LY
Sbjct: 225 NTLGENSSGVDVPKENKKPEIPTPAVITIAEMKNMLLQNRASYRSMRPRWSSAVDQEMLY 284

Query: 664 AKKLIETADSVKKNN-----VYRNFSAFKK 738
           A+  IE A  +K ++     +YRN S FK+
Sbjct: 285 ARSQIENAPLLKNDHELYAPLYRNVSRFKR 314


>ref|XP_006476044.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
           [Citrus sinensis]
          Length = 670

 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 64/210 (30%), Positives = 107/210 (50%), Gaps = 14/210 (6%)
 Frame = +1

Query: 151 DEKELGNGKNST-DSLSSDDYVD-----QHSSTDFLETNHPRNESLAAETEIEHVHALPV 312
           +EK  G  KNST D++ +   V      + S   F++    RN+ +  ++    +  +PV
Sbjct: 120 NEKLEGLNKNSTVDTVQNAGNVPGPEKGRESEQSFIQ----RNDIMGGDSGGVGLSPIPV 175

Query: 313 SSYTSVRNSSAESPQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSLGPSQLST 492
           S    +  SS  + QG N + + IT ++ S+    ST +D   ++   EK    P+Q S 
Sbjct: 176 SPVMDL--SSNITLQGANIS-TPITIHSNSS----STDKDATPALDKIEK----PAQSSL 224

Query: 493 EV---NSSAIRRFPELKRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQADKELLY 663
                NSS +    E K+ +     V+ I++M ++L ++RASY S + RW S  D+E+LY
Sbjct: 225 NTLGENSSGVDVPKENKKPEIPTPAVITIAEMKNMLLQNRASYRSMRPRWSSAVDQEMLY 284

Query: 664 AKKLIETADSVKKNN-----VYRNFSAFKK 738
           A+  IE A  +K ++     +YRN S FK+
Sbjct: 285 ARSQIENAPLLKNDHELYAPLYRNVSRFKR 314


>gb|EPS66570.1| hypothetical protein M569_08206 [Genlisea aurea]
          Length = 231

 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 43/105 (40%), Positives = 64/105 (60%), Gaps = 5/105 (4%)
 Frame = +1

Query: 439 LSVQNKEKTSLGPSQ-LSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDLLSESRASYH 615
           +S Q   K S G +  +  E NS+A    PELK         + +S+M D+L +SR+SY 
Sbjct: 139 ISSQAPPKISNGNAPPVPMEENSTA----PELK--------TMSVSQMKDMLLQSRSSYP 186

Query: 616 STKARWPSQADKELLYAKKLIETADSVKKN----NVYRNFSAFKK 738
           S K +WPS AD+EL +AK LI TA +++++    ++YRNFS FK+
Sbjct: 187 SPKPKWPSIADQELFHAKTLITTAPNIEQDPRIVSLYRNFSEFKR 231


>ref|XP_004498470.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X3
           [Cicer arietinum]
          Length = 553

 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 75/258 (29%), Positives = 111/258 (43%), Gaps = 12/258 (4%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSGVVFENVTKDEKELGNGKN 180
           +Q++ELPYG V+ SL S  +                    S  +  NVT     + N  N
Sbjct: 30  LQYLELPYGNVLLSLFSSDKIPTSQSTI------------SQTIVTNVT-----IFNESN 72

Query: 181 STDSLSSDDYVDQHSSTDF-LETNHPRNESLAAETEIEHVHALPVSSYTSVRNSSAESPQ 357
           STD+       D      F LE   P+N+S   E  ++     P ++ T++       P 
Sbjct: 73  STDATK-----DTMPIIGFVLEPEWPQNKS---EKFLDPYSPSPENAPTNLA-----PPF 119

Query: 358 G--PNAAPSSITEYNGSNKSTASTVEDPALSV-QNKEKTSLGPSQLSTEV---NSSAIRR 519
           G   N +P+  T    ++ + A++++  +    QN        S +S+ V   N  +  +
Sbjct: 120 GFKTNVSPNITTAMLSNDDNIANSIKKESFRPSQNGGNIQGKSSSISSNVPKENQESHTQ 179

Query: 520 FPELKRFKGAPSVVVPISKMNDLLSESRASYHSTKARWPSQADKELLYAKKLIETADSVK 699
            PE          V  IS+MN LL +S ASY S K RW S  D+ELL A+  IE A  VK
Sbjct: 180 IPE----------VTTISEMNKLLFQSHASYRSMKPRWFSDVDQELLQARSEIENAPIVK 229

Query: 700 KNN-----VYRNFSAFKK 738
            N      +Y N S FK+
Sbjct: 230 NNQNLYGPIYHNVSMFKR 247


>ref|XP_007012125.1| Exostosin family protein, putative isoform 2 [Theobroma cacao]
           gi|508782488|gb|EOY29744.1| Exostosin family protein,
           putative isoform 2 [Theobroma cacao]
          Length = 788

 Score = 63.5 bits (153), Expect = 7e-08
 Identities = 58/231 (25%), Positives = 99/231 (42%), Gaps = 39/231 (16%)
 Frame = +1

Query: 163 LGNGKNSTDSLSSDDYVD--QHSSTDFLETNHPRNESLAAETEIEHV------------H 300
           L  G  S+   S++ +VD  ++S+ D+ E+ +      A++TE                +
Sbjct: 221 LDEGSTSSRESSTEQFVDLNKNSTVDYAESFNKTVAEEASKTEESFSLKNDTIDVNTSNN 280

Query: 301 ALPVSSYTSVRNSSAESPQGPNAAPSSITEYNGS--------------------NKSTAS 420
            +   ++TS   S+  S  G  +   ++T  N S                    N ST+S
Sbjct: 281 NIGNGNFTSSAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSS 340

Query: 421 TVEDPALSVQNKEKTSLGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDLLSES 600
             +    S    EK     +  +T  ++S+    P++ +    P  +  I+ MN+L  +S
Sbjct: 341 LEQHVTPSFDKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQS 400

Query: 601 RASYHSTKARWPSQADKELLYAKKLIETADSVKKN-----NVYRNFSAFKK 738
           R SY+S   RW S AD+ LL A+  IE A  VK +      ++RN S FK+
Sbjct: 401 RVSYYSKTPRWSSGADQVLLNARSQIENAPIVKNDPRLYAPLFRNVSMFKR 451


>ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
           [Glycine max] gi|571459562|ref|XP_006581450.1|
           PREDICTED: probable glycosyltransferase At5g03795-like
           isoform X2 [Glycine max]
          Length = 637

 Score = 63.5 bits (153), Expect = 7e-08
 Identities = 65/222 (29%), Positives = 97/222 (43%), Gaps = 14/222 (6%)
 Frame = +1

Query: 115 DGSGVVFENVTKDEKELGNGKNSTDSLSSDDYVDQHSSTDFLETNHPRN-ESLAAETE-- 285
           D +     N T+  +E G   N+   L ++   +   S  F ETN     ES+       
Sbjct: 83  DENAFEIANETRTSEEKGTVSNT--GLITEPGRESSRSLGFDETNESSTVESIEISNNGS 140

Query: 286 -IEHVHALPVSSYTSVRNSSAESPQGPN--AAPSSITEYNGSNKSTASTVEDPALSVQNK 456
             E      +S Y +  +SS      P   A P S TE + +  S  S+ +        +
Sbjct: 141 ATEQTGKFGLSIYNNTISSSPSHAIIPTNLAPPLSPTEVSPNITSPMSSNDYDETDFAEE 200

Query: 457 EKTSLGPSQLSTEVNSSAIRRFPELKRFKGAP-SVVVPISKMNDLLSESRASYHSTKARW 633
           E+      + +   N+S+I   P+  +    P   V  IS+MN+LL ++RASY S + RW
Sbjct: 201 ERFKPSKDEFNIVGNNSSINSVPKETKGSQIPLPEVTTISEMNELLLQNRASYRSMRPRW 260

Query: 634 PSQADKELLYAKKLIETADSVKKNNV-------YRNFSAFKK 738
            S  D+ELL A+  IE A  V  NNV       +RN S FK+
Sbjct: 261 SSAVDQELLQARLEIENAPIV--NNVENLYAPLFRNISRFKR 300


>ref|XP_004245169.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform 1
           [Solanum lycopersicum] gi|460399281|ref|XP_004245170.1|
           PREDICTED: probable glycosyltransferase At5g03795-like
           isoform 2 [Solanum lycopersicum]
          Length = 647

 Score = 63.2 bits (152), Expect = 9e-08
 Identities = 69/295 (23%), Positives = 115/295 (38%), Gaps = 49/295 (16%)
 Frame = +1

Query: 1   VQHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSGVVFENVTKDEKELGN--- 171
           +Q+   PYGY +SS+ +                     D SG  F +   +E E G+   
Sbjct: 30  IQYFGFPYGYALSSIFTAN---------GGQISSSQRVDQSGTKFSDANNEEVEDGSMPP 80

Query: 172 -GKNSTDSLSSDDYVDQHSSTDFLETNHP---------RNESLAAETEIEHVHALPVSSY 321
             + S D  +  + +D    + F ++            RN SL  E  ++  + L  S+ 
Sbjct: 81  MNERSGDGDTLTEDIDPEDESPFKDSKLDNKSNVETLGRNSSLPPEKAVDSENDLQASNG 140

Query: 322 TSVRNSS--AESPQGPNAAPSSITEYNGSNKSTASTVEDPAL------------------ 441
           TS  + S   ++  G + +P+ +   +     T  ++  P L                  
Sbjct: 141 TSESSLSRVVDTDGGGSISPAPMEAKSWEISPTVLSIAPPPLVVTPQVNLDAKKEAPLIT 200

Query: 442 SVQN-----------KEKTSLGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDL 588
           S QN           +E  +L P Q  T+   +   + P +K        VV I++MN +
Sbjct: 201 SYQNVSEKEGNTGHLRESDNL-PVQKHTDHAPTVGHKIPVMKESDKPIDSVVSIAEMNVM 259

Query: 589 LSESRASYHSTKARWPSQADKELLYAKKLIETADSVKKN-----NVYRNFSAFKK 738
             E+R S+HS   RW S  D+ELL+AK +IE +            +YRN S F +
Sbjct: 260 QQEARTSFHSMIPRWSSDVDQELLHAKNVIENSPLAGNEPGLYAPLYRNMSVFMR 314


>ref|XP_007012124.1| Exostosin family protein, putative isoform 1 [Theobroma cacao]
           gi|508782487|gb|EOY29743.1| Exostosin family protein,
           putative isoform 1 [Theobroma cacao]
          Length = 802

 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 58/230 (25%), Positives = 98/230 (42%), Gaps = 39/230 (16%)
 Frame = +1

Query: 163 LGNGKNSTDSLSSDDYVD--QHSSTDFLETNHPRNESLAAETEIEHV------------H 300
           L  G  S+   S++ +VD  ++S+ D+ E+ +      A++TE                +
Sbjct: 221 LDEGSTSSRESSTEQFVDLNKNSTVDYAESFNKTVAEEASKTEESFSLKNDTIDVNTSNN 280

Query: 301 ALPVSSYTSVRNSSAESPQGPNAAPSSITEYNGS--------------------NKSTAS 420
            +   ++TS   S+  S  G  +   ++T  N S                    N ST+S
Sbjct: 281 NIGNGNFTSSAESTGSSDTGLGSPLPALTPTNSSTNKTLENDVETNIQTPVVSVNSSTSS 340

Query: 421 TVEDPALSVQNKEKTSLGPSQLSTEVNSSAIRRFPELKRFKGAPSVVVPISKMNDLLSES 600
             +    S    EK     +  +T  ++S+    P++ +    P  +  I+ MN+L  +S
Sbjct: 341 LEQHVTPSFDKNEKVEEIKNNFTTSSDNSSPTNTPKVGKKPEMPPALTTIADMNNLFYQS 400

Query: 601 RASYHSTKARWPSQADKELLYAKKLIETADSVKKN-----NVYRNFSAFK 735
           R SY+S   RW S AD+ LL A+  IE A  VK +      ++RN S FK
Sbjct: 401 RVSYYSKTPRWSSGADQVLLNARSQIENAPIVKNDPRLYAPLFRNVSMFK 450


>ref|XP_006450684.1| hypothetical protein CICLE_v10007698mg [Citrus clementina]
           gi|557553910|gb|ESR63924.1| hypothetical protein
           CICLE_v10007698mg [Citrus clementina]
          Length = 652

 Score = 61.2 bits (147), Expect = 4e-07
 Identities = 74/288 (25%), Positives = 114/288 (39%), Gaps = 43/288 (14%)
 Frame = +1

Query: 4   QHIELPYGYVMSSLLSIGRXXXXXXXXXXXXXXXXXXDGSGVVFENVTKDEKELGNGKNS 183
           Q+ ELPY +V+SS+ S G+                         E+ ++   +  NG NS
Sbjct: 31  QYFELPYDHVLSSVFSTGKVPAPAVENNSLVTGGP---------ESKSEIASDTANGLNS 81

Query: 184 TDSLSSDDYV---------DQHSSTDFL--ETNH--PRNESLAAETEIEHVHALPVSSYT 324
           T + +  +           D +   DF   E NH  P  E L    +   V  +  +   
Sbjct: 82  TGTHNVHEMANDTRTSKAEDANLQDDFYDGEDNHEEPMTEKLEELNKNSTVDTVQNAGNG 141

Query: 325 SVRNSSAESPQ---------GPNAAP---------SSITEYNGSNKSTASTVEDPALSVQ 450
                  ES Q         G   +P         SS     G+N ST  T+   + S  
Sbjct: 142 PGPEKGRESEQSFIQRNDSGGAGLSPIPVSPVMDLSSNITLQGANISTPITIHSNSSSTD 201

Query: 451 NKEKTSLG----PSQLSTEV---NSSAIRRFPELKRFKGAPSVVVPISKMNDLLSESRAS 609
                +L     P+Q S      NSS +    E K+ +     V+ I++M ++L ++RAS
Sbjct: 202 KDATPALDKIEKPAQSSLNTLGENSSGVDVPKENKKPEIPTPAVITIAEMKNMLLQNRAS 261

Query: 610 YHSTKARWPSQADKELLYAKKLIETADSVKKNN-----VYRNFSAFKK 738
           Y S   R  S  D+E+LYA+  IE A  +K ++     +YRN S FK+
Sbjct: 262 YRSMSPRLSSAVDQEMLYARSQIENAPLLKNDHELYAPLYRNVSRFKR 309


>ref|XP_006412431.1| hypothetical protein EUTSA_v10024793mg [Eutrema salsugineum]
           gi|557113601|gb|ESQ53884.1| hypothetical protein
           EUTSA_v10024793mg [Eutrema salsugineum]
          Length = 566

 Score = 60.5 bits (145), Expect = 6e-07
 Identities = 57/215 (26%), Positives = 96/215 (44%), Gaps = 10/215 (4%)
 Frame = +1

Query: 124 GVVFENVTKDEKELGNGKNSTDSLSSDDYVDQHSSTDFLETNHPRNESLAAET----EIE 291
           G +F   ++D+  + +   ST+ +         S T+ L  +  R+  +A E     E +
Sbjct: 27  GAIFSLPSEDKFPISSINGSTEPIRP------LSGTEKLNFSSSRSVEVAKEERTGLEED 80

Query: 292 HVHALPVSSYTSVRNSSAESPQGPNAAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSL 471
           H+     S      +S  E  +        +   + SN S +  VED  +  +N     +
Sbjct: 81  HIIGSDKSESIESHDSFIEDAEDKETLELLLRTGSSSNGSYSDIVEDADIVNENSRNVEI 140

Query: 472 GPSQLSTEVNSSAIRRFPELKRFKGAP-SVVVPISKMNDLLSESRASYHSTKARWPSQAD 648
             S+  T V++ +    PE+KR      S VVPI++M +LL +SR S+ S K +  S  D
Sbjct: 141 LESKSDTSVDNLS----PEVKRLMNVSNSGVVPITEMMNLLHQSRTSHVSLKLKRSSAVD 196

Query: 649 KELLYAKKLIETADSVKKN-----NVYRNFSAFKK 738
            ELL+A+  IE    V+ +      +Y N S FK+
Sbjct: 197 TELLFARTQIENPPMVENDPLLHGPLYWNLSMFKR 231


>ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like isoform X1
           [Glycine max] gi|571459560|ref|XP_006581449.1|
           PREDICTED: probable glycosyltransferase At5g03795-like
           isoform X2 [Glycine max]
          Length = 633

 Score = 60.5 bits (145), Expect = 6e-07
 Identities = 64/217 (29%), Positives = 93/217 (42%), Gaps = 11/217 (5%)
 Frame = +1

Query: 121 SGVVFENVTKDEKELGNGKNSTDSLSSDDYVDQHSSTDFLETNHPRN-ESLAAETEIEHV 297
           S  V E  T +EK+  +G      L  +       S  F ET      +S     E  H 
Sbjct: 85  SATVNETRTSEEKDNVSGTGLISELGRES----SRSLGFNETEESSTVQSTRISNESTHA 140

Query: 298 HALPVSSYTSVRNSSAESPQGPN--AAPSSITEYNGSNKSTASTVEDPALSVQNKEKTSL 471
             L +SSY    + S      P   A P S T+ + +     S+ +         EK   
Sbjct: 141 GNLGLSSYNDTISHSPSRAIIPTNLAPPLSPTKVSPNITPPMSSNDHEETDFAEDEKLRP 200

Query: 472 GPSQLSTEVNSSAIRRFPELKRFKGAPSV---VVPISKMNDLLSESRASYHSTKARWPSQ 642
               ++   ++S I      K+ KG+      V  IS+MN+LL ++RAS+HS + RW S 
Sbjct: 201 VQDDVNILRHNSPINSVAP-KKTKGSQKPLPEVTTISEMNELLLQNRASFHSERPRWSSI 259

Query: 643 ADKELLYAKKLIETADSVKKN-NVY----RNFSAFKK 738
            D+ELL A+  IE A  V  + N+Y    RN S FK+
Sbjct: 260 VDQELLQARSEIENAQIVNDDVNLYAPLFRNVSRFKR 296


Top