BLASTX nr result

ID: Cimicifuga21_contig00012250 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00012250
         (2173 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283249.1| PREDICTED: probable beta-1,4-xylosyltransfer...   600   e-169
emb|CBI21374.3| unnamed protein product [Vitis vinifera]              589   e-166
emb|CAI94901.1| glycosyltransferase [Citrus trifoliata]               579   e-163
ref|XP_002310709.1| glycosyl transferase, CAZy family GT43 [Popu...   576   e-161
ref|XP_002306485.1| predicted protein [Populus trichocarpa] gi|2...   566   e-158

>ref|XP_002283249.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX14H-like [Vitis
            vinifera]
          Length = 513

 Score =  600 bits (1546), Expect = e-169
 Identities = 316/516 (61%), Positives = 367/516 (71%), Gaps = 6/516 (1%)
 Frame = -3

Query: 1964 MKFSILQQSFTNRRSNSFRTTNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGXXXXX 1785
            MK S LQQS+TNRRSNSFR                                  LG     
Sbjct: 1    MKLSALQQSYTNRRSNSFRAAGGLDSSVDGSGKSPAAIFWLVLHGLCCLISLVLGFRFSR 60

Query: 1784 XXXXXXXSTTTDHITS-----TPFFKTDLETL-TIHRQSPPFSSFANLEASLNXXXXXXX 1623
                   ST ++  TS     TPF  T  +   ++  Q+ P S    L  +         
Sbjct: 61   LVFFLFFSTASNGGTSGLYPSTPFLGTTADIAGSLSFQANP-SPNLELPPNRTAGGISSS 119

Query: 1622 XXXXXRHGILIRPWPHPNPTEVMKAHRIIERVQREQRVQYGIKNPKNLIVITPTYSRTFQ 1443
                 RHGI IRPWPHPNP EVMKAHRIIERVQREQ++Q+GIKNP+ +IV+TPTY RTFQ
Sbjct: 120  RVVVGRHGIRIRPWPHPNPDEVMKAHRIIERVQREQKLQFGIKNPRTVIVVTPTYVRTFQ 179

Query: 1442 TLHLTGLMHSLMLVPYELVWIVVEAGGTSNETALILEKSKLQTIHIGIDQKMPISWDERH 1263
            TLHLTGLMHSLM VPY+L+WIV+EAGGT+NETA +L KS L+TIHIG D++MP SW++RH
Sbjct: 180  TLHLTGLMHSLMNVPYDLIWIVIEAGGTTNETASLLAKSGLRTIHIGFDRRMPNSWEDRH 239

Query: 1262 RLEARMRFRGLRVVREARLDGIVMFADDSNMHSMELFDEIQSVKWFGAVSVGILAHSGNS 1083
            RLEA+MR R LR+VRE +LDGI+MF DDSNMHSMELFDEIQ VKW GAVSVGILAHSGN+
Sbjct: 240  RLEAQMRLRALRIVREEKLDGILMFGDDSNMHSMELFDEIQKVKWIGAVSVGILAHSGNT 299

Query: 1082 VESSSMTNKEEDEENIPMPVQGPACNSSGQLVGWHTFNSLPYVEKSATYIDDSATVLPRK 903
             E SS+ +K+ +EEN+P PVQGPACNSS +LVGWH FNSLPYV   ATYIDD ATVLPRK
Sbjct: 300  DELSSVAHKKAEEENLPPPVQGPACNSSEKLVGWHIFNSLPYVGNGATYIDDRATVLPRK 359

Query: 902  LEWAGFVLNSRLVWKEAEDKPEWVRDLDTLADDGDAVETPLSLLKDASFIEPLGNCGRKV 723
            LEW+GFVLNSRL+WK AED+PEWV+DLD L    + +E+PLSLLKD S +EPLG+CGRKV
Sbjct: 360  LEWSGFVLNSRLLWKAAEDRPEWVKDLDKLDGVREEIESPLSLLKDPSMVEPLGSCGRKV 419

Query: 722  LLWWLRVEARADSKFPPGWIIDPPLEIIVPAKRTPWPDAPPELPSDERLNGIGEPTEKRI 543
            LLWWLRVEAR DSKFP  WIIDPPLE+ VPAKRTPWPDAPPELPS+ +   I E TEKR 
Sbjct: 420  LLWWLRVEARTDSKFPARWIIDPPLEVTVPAKRTPWPDAPPELPSNVKEISIQEHTEKRH 479

Query: 542  PKTGRASRSKHGPRNKKKRDSSVVVDTQGSGGRHEE 435
             K+ RASRSKH  R+K+K +S    D Q S    EE
Sbjct: 480  AKS-RASRSKHSSRSKRKHESR-TADPQVSSKVSEE 513


>emb|CBI21374.3| unnamed protein product [Vitis vinifera]
          Length = 475

 Score =  589 bits (1519), Expect = e-166
 Identities = 285/390 (73%), Positives = 328/390 (84%)
 Frame = -3

Query: 1604 HGILIRPWPHPNPTEVMKAHRIIERVQREQRVQYGIKNPKNLIVITPTYSRTFQTLHLTG 1425
            HGI IRPWPHPNP EVMKAHRIIERVQREQ++Q+GIKNP+ +IV+TPTY RTFQTLHLTG
Sbjct: 88   HGIRIRPWPHPNPDEVMKAHRIIERVQREQKLQFGIKNPRTVIVVTPTYVRTFQTLHLTG 147

Query: 1424 LMHSLMLVPYELVWIVVEAGGTSNETALILEKSKLQTIHIGIDQKMPISWDERHRLEARM 1245
            LMHSLM VPY+L+WIV+EAGGT+NETA +L KS L+TIHIG D++MP SW++RHRLEA+M
Sbjct: 148  LMHSLMNVPYDLIWIVIEAGGTTNETASLLAKSGLRTIHIGFDRRMPNSWEDRHRLEAQM 207

Query: 1244 RFRGLRVVREARLDGIVMFADDSNMHSMELFDEIQSVKWFGAVSVGILAHSGNSVESSSM 1065
            R R LR+VRE +LDGI+MF DDSNMHSMELFDEIQ VKW GAVSVGILAHSGN+ E SS+
Sbjct: 208  RLRALRIVREEKLDGILMFGDDSNMHSMELFDEIQKVKWIGAVSVGILAHSGNTDELSSV 267

Query: 1064 TNKEEDEENIPMPVQGPACNSSGQLVGWHTFNSLPYVEKSATYIDDSATVLPRKLEWAGF 885
             +K+ +EEN+P PVQGPACNSS +LVGWH FNSLPYV   ATYIDD ATVLPRKLEW+GF
Sbjct: 268  AHKKAEEENLPPPVQGPACNSSEKLVGWHIFNSLPYVGNGATYIDDRATVLPRKLEWSGF 327

Query: 884  VLNSRLVWKEAEDKPEWVRDLDTLADDGDAVETPLSLLKDASFIEPLGNCGRKVLLWWLR 705
            VLNSRL+WK AED+PEWV+DLD L    + +E+PLSLLKD S +EPLG+CGRKVLLWWLR
Sbjct: 328  VLNSRLLWKAAEDRPEWVKDLDKLDGVREEIESPLSLLKDPSMVEPLGSCGRKVLLWWLR 387

Query: 704  VEARADSKFPPGWIIDPPLEIIVPAKRTPWPDAPPELPSDERLNGIGEPTEKRIPKTGRA 525
            VEAR DSKFP  WIIDPPLE+ VPAKRTPWPDAPPELPS+ +   I E TEKR  K+ RA
Sbjct: 388  VEARTDSKFPARWIIDPPLEVTVPAKRTPWPDAPPELPSNVKEISIQEHTEKRHAKS-RA 446

Query: 524  SRSKHGPRNKKKRDSSVVVDTQGSGGRHEE 435
            SRSKH  R+K+K +S    D Q S    EE
Sbjct: 447  SRSKHSSRSKRKHESR-TADPQVSSKVSEE 475


>emb|CAI94901.1| glycosyltransferase [Citrus trifoliata]
          Length = 507

 Score =  579 bits (1493), Expect = e-163
 Identities = 312/507 (61%), Positives = 360/507 (71%), Gaps = 3/507 (0%)
 Frame = -3

Query: 1964 MKFSILQQSFTNRRSNSFRTTNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGXXXXX 1785
            MK S LQQS+ +RRSNSFR +                                LG     
Sbjct: 1    MKLSALQQSYLSRRSNSFRGSAPLDSSSDSAIKSPAAIFWLVLHGLCCLISLVLGFRFSR 60

Query: 1784 XXXXXXXSTTTDHITSTPFFKTDLETLTIHRQSPPFSSFANLEASL--NXXXXXXXXXXX 1611
                   ST+T   ++T  +      L     +P  SS   +E  +              
Sbjct: 61   LVFFFIFSTSTT--STTNLYTAPFRNLASDITTPFVSSSTPVEIPVLNRTTPNSNSRVVV 118

Query: 1610 XRHGILIRPWPHPNPTEVMKAHRIIERVQREQRVQYGIKNPKNLIVITPTYSRTFQTLHL 1431
             RHGI IRPWPHPNPTEVMKAH+IIERVQREQR   G+KNP+ LIV+TPTY RTFQTLHL
Sbjct: 119  GRHGIRIRPWPHPNPTEVMKAHKIIERVQREQRAHVGVKNPRTLIVVTPTYVRTFQTLHL 178

Query: 1430 TGLMHSLMLVPYELVWIVVEAGGTSNETALILEKSKLQTIHIGIDQKMPISWDERHRLEA 1251
            TG+MHSLMLVPY+LVWIVVEA G +NETA ++ KSKL+TIH+G+DQKMP SW  RH+LEA
Sbjct: 179  TGVMHSLMLVPYDLVWIVVEARGVTNETASLIAKSKLRTIHVGVDQKMPASWGGRHQLEA 238

Query: 1250 RMRFRGLRVVREARLDGIVMFADDSNMHSMELFDEIQSVKWFGAVSVGILAHSGNSVESS 1071
            +MR R LR+VRE +LDGIVMFADDSNMHSMELFDEIQ+VKWFGAVSVGILA +GN  ESS
Sbjct: 239  KMRLRALRIVREEKLDGIVMFADDSNMHSMELFDEIQNVKWFGAVSVGILALAGNQDESS 298

Query: 1070 SMTNKEEDEENIPMPVQGPACNSSGQLVGWHTFNSLPYVEKSATYIDDSATVLPRKLEWA 891
            S+   EE  EN  MPVQGPACNSS  + GWHTFN+ PY   SATYIDD ATVLPRKLEWA
Sbjct: 299  SVI-MEEGGENTAMPVQGPACNSSNNVAGWHTFNT-PYARTSATYIDDRATVLPRKLEWA 356

Query: 890  GFVLNSRLVWKEAEDKPEWVRDLDTLADDGDAVETPLSLLKDASFIEPLGNCGRKVLLWW 711
            GFVLNSRL+WKEA+DKPEWV DLD L D  + +E+PLSLLKD S +EPLGNCGR+VL+WW
Sbjct: 357  GFVLNSRLLWKEAKDKPEWVNDLD-LLDGLEDIESPLSLLKDQSMVEPLGNCGRQVLVWW 415

Query: 710  LRVEARADSKFPPGWIIDPPLEIIVPAKRTPWPDAPPELPSDER-LNGIGEPTEKRIPKT 534
            LRVEAR+DSKFPPG IIDPPLEI VP+KRTPWPDAPPELPS+E+ L GI E T K  PK 
Sbjct: 416  LRVEARSDSKFPPGGIIDPPLEITVPSKRTPWPDAPPELPSNEKVLVGIQEHTVKHTPK- 474

Query: 533  GRASRSKHGPRNKKKRDSSVVVDTQGS 453
             R+SRSK   R+K+K ++  VVD Q S
Sbjct: 475  NRSSRSKRSSRSKRKHETK-VVDMQAS 500


>ref|XP_002310709.1| glycosyl transferase, CAZy family GT43 [Populus trichocarpa]
            gi|222853612|gb|EEE91159.1| glycosyl transferase, CAZy
            family GT43 [Populus trichocarpa]
            gi|333951815|gb|AEG25425.1| glycosyltransferase GT43C
            [Populus trichocarpa]
          Length = 510

 Score =  576 bits (1484), Expect = e-161
 Identities = 308/522 (59%), Positives = 376/522 (72%), Gaps = 11/522 (2%)
 Frame = -3

Query: 1964 MKFSILQQSFTNRRSNSFRTTNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL---GXX 1794
            MKFS+LQQS+ NRRS SFR ++                               +      
Sbjct: 1    MKFSLLQQSYNNRRSGSFRGSSAPLDSSPDNTIKSPAAIFWLFLHGICCLISLVLGFRFS 60

Query: 1793 XXXXXXXXXXSTTTDHITSTPFFKTDLETLTIHRQSPPFSSFANLEASLNXXXXXXXXXX 1614
                      STTT    +TPF     +T  I   S P ++ AN    +N          
Sbjct: 61   RLVFFFLFSTSTTTTLYVTTPFHPLS-KTSDI---SNPLTNSANDLPVINKTVSSRVVVG 116

Query: 1613 XXRHGILIRPWPHPNPTEVMKAHRIIERVQREQRVQYGIKNPKNLIVITPTYSRTFQTLH 1434
               HGI IRPWPHPNP+EV+KAH+IIERVQREQ  Q+G+K+P++LIV+TPTY RTFQTLH
Sbjct: 117  R--HGIRIRPWPHPNPSEVIKAHQIIERVQREQSNQFGVKSPRSLIVVTPTYVRTFQTLH 174

Query: 1433 LTGLMHSLMLVPYELVWIVVEAGGTSNETALILEKSKLQTIHIGIDQKMPISWDERHRLE 1254
            +TG+MHSLML+PY++VWIVVEAGG +NETALI+ KS ++T+HIG +QKMP SW+ RHRLE
Sbjct: 175  MTGVMHSLMLLPYDVVWIVVEAGGVTNETALIIAKSGVKTLHIGFNQKMPNSWEGRHRLE 234

Query: 1253 ARMRFRGLRVVREARLDGIVMFADDSNMHSMELFDEIQSVKWFGAVSVGILAHSGNSVE- 1077
             +MR R LRVVRE ++DGIVMFADDSNMHSMELFDEIQ+VKWFGAVSVGIL HSG + E 
Sbjct: 235  TKMRLRALRVVREEKMDGIVMFADDSNMHSMELFDEIQNVKWFGAVSVGILVHSGGADET 294

Query: 1076 ----SSSMTNKEEDEENIP---MPVQGPACNSSGQLVGWHTFNSLPYVEKSATYIDDSAT 918
                +++M +KE  EEN+P   +PVQGPACN+S +LVGWHTFNSLPY  KSA YIDD AT
Sbjct: 295  LLTAAAAMVDKEA-EENLPNPVVPVQGPACNASNKLVGWHTFNSLPYEGKSAVYIDDRAT 353

Query: 917  VLPRKLEWAGFVLNSRLVWKEAEDKPEWVRDLDTLADDGDAVETPLSLLKDASFIEPLGN 738
            VLPRKLEWAGF+LNSRL+WKEAEDKPEWV+D+D + ++   +E PL+LLKD S +EPLG+
Sbjct: 354  VLPRKLEWAGFMLNSRLLWKEAEDKPEWVKDMDLVDEN---IENPLALLKDPSMVEPLGS 410

Query: 737  CGRKVLLWWLRVEARADSKFPPGWIIDPPLEIIVPAKRTPWPDAPPELPSDERLNGIGEP 558
            CGR+VLLWWLRVEARADSKFPPGWIIDPPLEI VP+KRTPWPDAPPELPS+E+++   E 
Sbjct: 411  CGRQVLLWWLRVEARADSKFPPGWIIDPPLEITVPSKRTPWPDAPPELPSNEKISVNQEQ 470

Query: 557  TEKRIPKTGRASRSKHGPRNKKKRDSSVVVDTQGSGGRHEEE 432
            T KR  KT R+ RSK   R+K+K +  V+ +TQ S  RH E+
Sbjct: 471  TAKRSSKT-RSPRSKRSSRSKRKHE-VVLAETQVS-ARHSEQ 509


>ref|XP_002306485.1| predicted protein [Populus trichocarpa] gi|222855934|gb|EEE93481.1|
            predicted protein [Populus trichocarpa]
            gi|333951817|gb|AEG25426.1| glycosyltransferase GT43D
            [Populus trichocarpa]
          Length = 503

 Score =  566 bits (1458), Expect = e-158
 Identities = 304/515 (59%), Positives = 366/515 (71%), Gaps = 4/515 (0%)
 Frame = -3

Query: 1964 MKFSILQQSFTNRRSNSFRTTNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL-GXXXX 1788
            MK S+LQQS+ NRRS SFR ++                               + G    
Sbjct: 1    MKLSMLQQSYMNRRSASFRGSSAPLDSSTDNTIKSPAAIFWLLLHGFCCLISLVLGFRFS 60

Query: 1787 XXXXXXXXSTTTDHITSTPFFKTDLETLTIHRQSPPFSSFANLEASLNXXXXXXXXXXXX 1608
                    ST+T   T+T +  T L  LT  + +   +        +N            
Sbjct: 61   RLVFFFLFSTST---TTTLYIATPLPHLT--KTNNNINDLPLEIPVINKTLSSSSRVVVG 115

Query: 1607 RHGILIRPWPHPNPTEVMKAHRIIERVQREQRVQYGIKNPKNLIVITPTYSRTFQTLHLT 1428
            RHGI IRPWPHPNP+EVMKAH+IIE VQREQR Q+G+K+P+ LIV+TPTY RTFQTLHLT
Sbjct: 116  RHGIRIRPWPHPNPSEVMKAHQIIETVQREQRTQFGVKSPRTLIVVTPTYVRTFQTLHLT 175

Query: 1427 GLMHSLMLVPYELVWIVVEAGGTSNETALILEKSKLQTIHIGIDQKMPISWDERHRLEAR 1248
            G+MHSLMLVPY++VWIVVEAGG +NETA I+ KS ++T HIG  QKMP SW+ RH+LE +
Sbjct: 176  GVMHSLMLVPYDVVWIVVEAGGATNETASIIAKSSIKTFHIGFTQKMPNSWEGRHKLETK 235

Query: 1247 MRFRGLRVVREARLDGIVMFADDSNMHSMELFDEIQSVKWFGAVSVGILAHSGNSVESSS 1068
            MR R LRVVRE  +DGIVMFADDSNMHSMELFDEIQ+VKWFGAVSVGILAHSG   ESSS
Sbjct: 236  MRLRALRVVREEMMDGIVMFADDSNMHSMELFDEIQNVKWFGAVSVGILAHSGGGGESSS 295

Query: 1067 MTNKEEDEENI---PMPVQGPACNSSGQLVGWHTFNSLPYVEKSATYIDDSATVLPRKLE 897
               +++ + N+    MPVQGPACN+S +LVGWHTF+SLPY  KSA YIDD ATVLPRKLE
Sbjct: 296  AVAEKDVKPNLSNPAMPVQGPACNASNKLVGWHTFDSLPYEGKSAVYIDDRATVLPRKLE 355

Query: 896  WAGFVLNSRLVWKEAEDKPEWVRDLDTLADDGDAVETPLSLLKDASFIEPLGNCGRKVLL 717
            WAGFVLNSRL+ KEA+DKPEWV+DLD + ++   +E+PL+LLKD S +EPLG+CGR+VLL
Sbjct: 356  WAGFVLNSRLLLKEAQDKPEWVKDLDLVDEN---IESPLALLKDPSMVEPLGSCGRQVLL 412

Query: 716  WWLRVEARADSKFPPGWIIDPPLEIIVPAKRTPWPDAPPELPSDERLNGIGEPTEKRIPK 537
            WWLRVEARADSKFPPGWIIDPPLEI VP+KRTPWPDAPPELPS+++L    E T KR PK
Sbjct: 413  WWLRVEARADSKFPPGWIIDPPLEITVPSKRTPWPDAPPELPSNKKLTINQEQTIKRSPK 472

Query: 536  TGRASRSKHGPRNKKKRDSSVVVDTQGSGGRHEEE 432
            T R+ RSK   R  K++  + +V+TQ S  RH E+
Sbjct: 473  T-RSPRSK---RRSKRKHEAKLVETQVS-TRHSEQ 502


Top