BLASTX nr result

ID: Cephaelis21_contig00011144 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00011144
         (1231 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAK55746.1| UDP-glucose glucosyltransferase [Gardenia jasmin...   512   e-143
dbj|BAF75901.1| tetrahydroxychalcone 2'-glucosyltransferase [Cat...   460   e-127
dbj|BAD29721.1| UDP-glucose glucosyltransferase [Catharanthus ro...   442   e-121
dbj|BAF96582.1| lignan glucosyltransferase [Sesamum indicum]          441   e-121
dbj|BAF96581.1| lignan glucosyltransferase [Sesamum alatum]           441   e-121

>dbj|BAK55746.1| UDP-glucose glucosyltransferase [Gardenia jasminoides]
          Length = 477

 Score =  512 bits (1319), Expect = e-143
 Identities = 245/359 (68%), Positives = 294/359 (81%)
 Frame = -1

Query: 1231 CTTMIDVANEFGAPSYVFFASGAAMLGLCFHMQSLRDNFNEDVTEFKNSNGEMHVPTYIN 1052
            C++MIDVANEFG PSYVF+ SGAAMLGL  H+QSLRD+F EDVT ++NS  E+ VPTYIN
Sbjct: 120  CSSMIDVANEFGVPSYVFYTSGAAMLGLMLHLQSLRDDFGEDVTNYENSKVELAVPTYIN 179

Query: 1051 XXXXXXXXXVFFDKNGGCHMFLNQIKRYRETKGILVNTSFELESHAVQALHNDETIPPVY 872
                       FD  GG +MFLN  KR+RETKGI++N+ FELESHA+QAL ND+TIPPVY
Sbjct: 180  PVPVKVLPSRLFDMEGGGNMFLNLTKRFRETKGIVINSFFELESHAIQALSNDKTIPPVY 239

Query: 871  PIGPLLNPNKNCGQNQETDKMIMEWLDLQPDSSVVFICFGTIGCFDGDQVKEIADALEHS 692
            P+GP+L+  ++ GQNQET+ MI +WLD+QPDSSVVF+CFG+ GCFDG QVKEIA ALE S
Sbjct: 240  PVGPILDLKESNGQNQETE-MITKWLDIQPDSSVVFLCFGSRGCFDGGQVKEIACALESS 298

Query: 691  GYRFLWSIRRPPPKEKIELPSDYENLEEVLPEGFIKRTAEVGKVLGWAPQAAVLSHPAIG 512
            GYRFLWS+RRPPPK K E P DYENLEE LPEGF++RTAEVGKV+GWAPQAA+LSHPA+G
Sbjct: 299  GYRFLWSLRRPPPKGKFESPGDYENLEEALPEGFLQRTAEVGKVIGWAPQAAILSHPAVG 358

Query: 511  GFVSHCGWNSILESVWFGVPVATWPLYSEQQVNAFQMLKDLDIAVEVKMEFRKDFKAESS 332
             FVSHCGWNS LESVWFGVP+ATWPLY+EQQVNAF +LKDL +AV++KM+F+      S+
Sbjct: 359  CFVSHCGWNSTLESVWFGVPMATWPLYAEQQVNAFLLLKDLGMAVDIKMDFKSTSFEPST 418

Query: 331  EILSADFIESRIRHLMDSENEVRRKAKEMKEKSRSARSEGGSSTASLKYFIDEVFKNIP 155
            EI++AD IE  I+HLMD ENE+R+K KE KEKSR + SEGG S+ASL  F+D +  NIP
Sbjct: 419  EIVAADLIEKAIKHLMDPENEIRKKVKEKKEKSRLSLSEGGPSSASLGRFLDALIDNIP 477


>dbj|BAF75901.1| tetrahydroxychalcone 2'-glucosyltransferase [Catharanthus roseus]
          Length = 476

 Score =  460 bits (1184), Expect = e-127
 Identities = 219/354 (61%), Positives = 274/354 (77%)
 Frame = -1

Query: 1231 CTTMIDVANEFGAPSYVFFASGAAMLGLCFHMQSLRDNFNEDVTEFKNSNGEMHVPTYIN 1052
            CT+MIDVANEFG PSYV++ SGAAMLGL  H Q LRD+ NED+ E+K+ + +  VPTYIN
Sbjct: 122  CTSMIDVANEFGVPSYVYYTSGAAMLGLVLHFQHLRDDLNEDIIEYKDKDTDFTVPTYIN 181

Query: 1051 XXXXXXXXXVFFDKNGGCHMFLNQIKRYRETKGILVNTSFELESHAVQALHNDETIPPVY 872
                     V FD   G  +FL+Q KRYRETKGI++NT  ELESH+V AL  D  IPPVY
Sbjct: 182  PLHSKVLPSVLFDNEEGSKLFLDQAKRYRETKGIIINTFLELESHSVTALSEDPNIPPVY 241

Query: 871  PIGPLLNPNKNCGQNQETDKMIMEWLDLQPDSSVVFICFGTIGCFDGDQVKEIADALEHS 692
              GP+LN      Q  E   +I++WL+LQP+SSVVF+CFG+ G F  +QVKEIA ALE+S
Sbjct: 242  TAGPILNLKSEASQESE---LILKWLNLQPESSVVFLCFGSYGSFSAEQVKEIAIALENS 298

Query: 691  GYRFLWSIRRPPPKEKIELPSDYENLEEVLPEGFIKRTAEVGKVLGWAPQAAVLSHPAIG 512
            G+RFLWS+RRPPP+ K+E PS+YENLEE+LPEGF+KRTAE GK++GWAPQ  VLSH A+G
Sbjct: 299  GHRFLWSLRRPPPEGKMEPPSEYENLEEILPEGFLKRTAETGKIIGWAPQIEVLSHSAVG 358

Query: 511  GFVSHCGWNSILESVWFGVPVATWPLYSEQQVNAFQMLKDLDIAVEVKMEFRKDFKAESS 332
            GFVSHCGWNS LESVW GVP+ATWP+Y+EQQ+NAF+M+KDL++AVE+K+++R++    +S
Sbjct: 359  GFVSHCGWNSTLESVWCGVPMATWPIYAEQQLNAFEMVKDLEMAVEIKIDYRREVWTTNS 418

Query: 331  EILSADFIESRIRHLMDSENEVRRKAKEMKEKSRSARSEGGSSTASLKYFIDEV 170
            EIL AD IE RIR LMD EN++R K KEM+ KS S   EGGSS +S++ FID V
Sbjct: 419  EILGADLIEERIRCLMDPENKIRSKVKEMQRKSSSTLKEGGSSWSSIRRFIDSV 472


>dbj|BAD29721.1| UDP-glucose glucosyltransferase [Catharanthus roseus]
          Length = 480

 Score =  442 bits (1136), Expect = e-121
 Identities = 210/358 (58%), Positives = 281/358 (78%)
 Frame = -1

Query: 1231 CTTMIDVANEFGAPSYVFFASGAAMLGLCFHMQSLRDNFNEDVTEFKNSNGEMHVPTYIN 1052
            CT MIDVANEF  P+Y+F+ + AAMLGL  H+QSLRD+F +++ ++K+S  E+ +P+Y N
Sbjct: 125  CTNMIDVANEFRVPTYLFYTTTAAMLGLVLHLQSLRDDFAQNLADYKDSISELSIPSYKN 184

Query: 1051 XXXXXXXXXVFFDKNGGCHMFLNQIKRYRETKGILVNTSFELESHAVQALHNDETIPPVY 872
                     + FDK    ++FLN  KRYRE KGI++NT  +LES+A++ L  DET+PPVY
Sbjct: 185  PVPVNILPSIVFDKGESSNVFLNHAKRYREMKGIIINTFLDLESYALENLTEDETLPPVY 244

Query: 871  PIGPLLNPNKNCGQNQETDKMIMEWLDLQPDSSVVFICFGTIGCFDGDQVKEIADALEHS 692
             +GP+LN   +  Q+ E + +I+EWLDLQP+SSVVF+CFG+ G FD +QVKEIA ALEHS
Sbjct: 245  AVGPILNVKGSHNQDNEVE-VILEWLDLQPNSSVVFLCFGSRGYFDKEQVKEIAYALEHS 303

Query: 691  GYRFLWSIRRPPPKEKIELPSDYENLEEVLPEGFIKRTAEVGKVLGWAPQAAVLSHPAIG 512
            GYRFLWS+R+PP   K+   +++ NLEE+LPEGF +R+AE+GKV+GWAPQ  VLSHPA+G
Sbjct: 304  GYRFLWSLRQPPSPGKVA--TEFGNLEELLPEGFFQRSAEIGKVIGWAPQVQVLSHPAVG 361

Query: 511  GFVSHCGWNSILESVWFGVPVATWPLYSEQQVNAFQMLKDLDIAVEVKMEFRKDFKAESS 332
            GFVSHCGWNS LES+WFGVP+ATWPLY+EQQ NAFQ++KDL++AVE+K+++RK+F A + 
Sbjct: 362  GFVSHCGWNSTLESIWFGVPMATWPLYAEQQGNAFQLVKDLEMAVEIKIDYRKNFFASTE 421

Query: 331  EILSADFIESRIRHLMDSENEVRRKAKEMKEKSRSARSEGGSSTASLKYFIDEVFKNI 158
            +I+ AD IE+ IR LMD ENEVR K KEMKE+SR A  EGGSS  S+++FI+++ K I
Sbjct: 422  DIVKADEIEAGIRRLMDPENEVRNKVKEMKERSRVAIVEGGSSYTSMQWFIEDMKKTI 479


>dbj|BAF96582.1| lignan glucosyltransferase [Sesamum indicum]
          Length = 476

 Score =  441 bits (1134), Expect = e-121
 Identities = 212/359 (59%), Positives = 272/359 (75%), Gaps = 1/359 (0%)
 Frame = -1

Query: 1231 CTTMIDVANEFGAPSYVFFASGAAMLGLCFHMQSLRDNFNEDVTEFKNSNGEMHVPTYIN 1052
            CTTMIDVANE G P+Y+FF+SG+A LGL FH+QSLRD+ N DV E+KNS+  + +PTY+N
Sbjct: 124  CTTMIDVANELGVPTYMFFSSGSATLGLMFHLQSLRDDNNVDVMEYKNSDAAISIPTYVN 183

Query: 1051 XXXXXXXXXVFFDKNGGCHMFLNQIKRYRETKGILVNTSFELESHAVQALHNDETIPPVY 872
                       F+++ G   FL+  KR+RETKGI+VNT  E E+H +++L +D+ IPPVY
Sbjct: 184  PVPVAVWPSPVFEEDSG---FLDFAKRFRETKGIIVNTFLEFETHQIRSLSDDKKIPPVY 240

Query: 871  PIGPLLNPNKN-CGQNQETDKMIMEWLDLQPDSSVVFICFGTIGCFDGDQVKEIADALEH 695
            P+GP+L  ++N   Q +E    IM WLD QPDSSVVF+CFGT GC +GDQVKEIA ALE+
Sbjct: 241  PVGPILQADENKIEQEKEKHAEIMRWLDKQPDSSVVFLCFGTHGCLEGDQVKEIAVALEN 300

Query: 694  SGYRFLWSIRRPPPKEKIELPSDYENLEEVLPEGFIKRTAEVGKVLGWAPQAAVLSHPAI 515
            SG+RFLWS+R+PPPKEK+E P +YEN EEVLPEGF+ RT ++GKV+GWAPQ AVLSHPA+
Sbjct: 301  SGHRFLWSLRKPPPKEKVEFPGEYENSEEVLPEGFLGRTTDMGKVIGWAPQMAVLSHPAV 360

Query: 514  GGFVSHCGWNSILESVWFGVPVATWPLYSEQQVNAFQMLKDLDIAVEVKMEFRKDFKAES 335
            GGFVSHCGWNS+LESVW GVP+A WPL +EQQ NAF ++K+ ++AVE+KM    D+K  +
Sbjct: 361  GGFVSHCGWNSVLESVWCGVPMAVWPLSAEQQANAFLLVKEFEMAVEIKM----DYKKNA 416

Query: 334  SEILSADFIESRIRHLMDSENEVRRKAKEMKEKSRSARSEGGSSTASLKYFIDEVFKNI 158
            + I+  + IE  IR LMD ENE+R K + +KEKSR A  EGGSS   LK F++ V  NI
Sbjct: 417  NVIVGTETIEEAIRQLMDPENEIRVKVRALKEKSRMALMEGGSSYNYLKRFVENVVNNI 475


>dbj|BAF96581.1| lignan glucosyltransferase [Sesamum alatum]
          Length = 476

 Score =  441 bits (1134), Expect = e-121
 Identities = 212/359 (59%), Positives = 272/359 (75%), Gaps = 1/359 (0%)
 Frame = -1

Query: 1231 CTTMIDVANEFGAPSYVFFASGAAMLGLCFHMQSLRDNFNEDVTEFKNSNGEMHVPTYIN 1052
            CTTMIDVANE G P+Y+FF+SG+A LGL FH+QSLRD+ N DV E+KNS+  + +PTY+N
Sbjct: 124  CTTMIDVANELGVPTYMFFSSGSATLGLMFHLQSLRDDNNVDVMEYKNSDAAISIPTYVN 183

Query: 1051 XXXXXXXXXVFFDKNGGCHMFLNQIKRYRETKGILVNTSFELESHAVQALHNDETIPPVY 872
                       F+++ G   FL+  KR+RETKGI+VNT  E E+H +++L +D+ IPPVY
Sbjct: 184  PVPVAVWPSQVFEEDSG---FLDFAKRFRETKGIIVNTFLEFETHQIRSLSDDKKIPPVY 240

Query: 871  PIGPLLNPNKN-CGQNQETDKMIMEWLDLQPDSSVVFICFGTIGCFDGDQVKEIADALEH 695
            P+GP+L  ++N   Q +E    IM WLD QPDSSVVF+CFGT GC +GDQVKEIA ALE+
Sbjct: 241  PVGPILQADENKIEQEKEKHAEIMRWLDKQPDSSVVFLCFGTHGCLEGDQVKEIAVALEN 300

Query: 694  SGYRFLWSIRRPPPKEKIELPSDYENLEEVLPEGFIKRTAEVGKVLGWAPQAAVLSHPAI 515
            SG+RFLWS+R+PPPKEK+E P +YEN EEVLPEGF+ RT ++GKV+GWAPQ AVLSHPA+
Sbjct: 301  SGHRFLWSLRKPPPKEKVEFPGEYENSEEVLPEGFLGRTTDMGKVIGWAPQMAVLSHPAV 360

Query: 514  GGFVSHCGWNSILESVWFGVPVATWPLYSEQQVNAFQMLKDLDIAVEVKMEFRKDFKAES 335
            GGFVSHCGWNS+LESVW GVP+A WPL +EQQ NAF ++K+ ++AVE+KM    D+K  +
Sbjct: 361  GGFVSHCGWNSVLESVWCGVPMAVWPLSAEQQANAFLLVKEFEMAVEIKM----DYKKNA 416

Query: 334  SEILSADFIESRIRHLMDSENEVRRKAKEMKEKSRSARSEGGSSTASLKYFIDEVFKNI 158
            + I+  + IE  IR LMD ENE+R K + +KEKSR A  EGGSS   LK F++ V  NI
Sbjct: 417  NVIVGTETIEEAIRQLMDPENEIRVKVRALKEKSRMALMEGGSSYNYLKRFVENVVNNI 475