BLASTX nr result

ID: Coptis23_contig00016041 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00016041
         (2066 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alph...   610   e-172
ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   604   e-170
ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   603   e-170
ref|XP_002319516.1| predicted protein [Populus trichocarpa] gi|2...   598   e-168
ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-ace...   578   e-162

>ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
            N-acetyltransferase-like [Vitis vinifera]
          Length = 499

 Score =  610 bits (1574), Expect = e-172
 Identities = 294/433 (67%), Positives = 338/433 (78%), Gaps = 1/433 (0%)
 Frame = -2

Query: 1858 TRVKEMESSNAEHPMMIMEENQQMKEADQVTKEKSKRVASLDIFRGLTVAMMILVDDAGG 1679
            T +K+    N +H ++I +     +E  Q    K+KR+ASLDIFRGLTVA+MILVDDAGG
Sbjct: 32   TSIKDDAPDN-QHRLIISDSGFPPEERPQ----KTKRLASLDIFRGLTVALMILVDDAGG 86

Query: 1678 EWQMIGHAPWNGCNLADFVMPFFLFIVGMAIALAFKRIPNRLLSIKRVIIRTIKLILWGL 1499
            EW MIGHAPWNGCNLADFVMPFFLFIVG+AIALA KRIP+RL++IK+V +RT+KL+ WGL
Sbjct: 87   EWPMIGHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGL 146

Query: 1498 ILQGGYSHAPDELTYGVDMKRIRWCGILQRIALSYLVVAILEISTKDGQAKDLLPGWQPV 1319
            +LQG ++  PD+LTYGVDMK+IRWCGILQ IAL+YLVVA+LEI+TK  QAKDL PG   +
Sbjct: 147  LLQGSFTQDPDKLTYGVDMKKIRWCGILQXIALAYLVVALLEITTKKAQAKDLSPGQFSI 206

Query: 1318 LRQYHWHWLAGACVLVVYLSVIYGAYVPDWHFTVYNMKSPDYGKTFNVKCGIRGTLDPPC 1139
             + Y WHWL GACVL+VY++V YG YVPDWHFTV++  S DYGK   V CG RG LDPPC
Sbjct: 207  FKLYCWHWLMGACVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPC 266

Query: 1138 NAVGYIDRVVLGTNHMYQHPAWKRSKVCTESSPYEGPFRKDAASWCQAPFEPEGXXXXXX 959
            N VGYIDR +LG NHMYQHPAW RSK C E SP +GPFRKDA SWC APFEPEG      
Sbjct: 267  NVVGYIDREILGMNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSIS 326

Query: 958  XXXXXXIGVHFGHVLVHIKDHSDRLKHWXXXXXXXXXXGIVLHFTNAIPLNKQLYSFSYV 779
                  IGVHFGHVL+H+K HSDRLKHW          GI LHFT AIPLNKQLY+FSYV
Sbjct: 327  AILSTIIGVHFGHVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYV 386

Query: 778  CLTAGAAAWVFSTFYVLVDILGWRHLFLPLEWIGMNAMLVYVMAAEGIFAGFINGWYYED 599
            C+T+GAAA VFS FY+LVD+ G R L LPLEWIGMNAMLVYVMAAEG+FA FINGWYY D
Sbjct: 387  CVTSGAAALVFSFFYILVDVWGMRFLCLPLEWIGMNAMLVYVMAAEGVFAKFINGWYYGD 446

Query: 598  PHNTLI-YFYSHI 563
            PHNTLI +   HI
Sbjct: 447  PHNTLINWIQQHI 459


>ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Glycine max]
          Length = 464

 Score =  604 bits (1557), Expect = e-170
 Identities = 283/429 (65%), Positives = 338/429 (78%)
 Frame = -2

Query: 1843 MESSNAEHPMMIMEENQQMKEADQVTKEKSKRVASLDIFRGLTVAMMILVDDAGGEWQMI 1664
            M     EH + + EE   +  +D+    K+KRVASLDIFRGLTVA+MILVDDAGG+W MI
Sbjct: 1    MAEIKGEHSLNVSEE---LPLSDK-NLPKTKRVASLDIFRGLTVALMILVDDAGGQWPMI 56

Query: 1663 GHAPWNGCNLADFVMPFFLFIVGMAIALAFKRIPNRLLSIKRVIIRTIKLILWGLILQGG 1484
            GHAPWNGCNLADFVMPFFLFIVGMAI LA KRIPNRLL++K+VI+RT+KL+ WGL+LQGG
Sbjct: 57   GHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGG 116

Query: 1483 YSHAPDELTYGVDMKRIRWCGILQRIALSYLVVAILEISTKDGQAKDLLPGWQPVLRQYH 1304
            +SHAPD LTYGVDMK IRWCGILQRIAL+YLVVA++EI ++  QA+D  P    + + Y+
Sbjct: 117  FSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFKLYY 176

Query: 1303 WHWLAGACVLVVYLSVIYGAYVPDWHFTVYNMKSPDYGKTFNVKCGIRGTLDPPCNAVGY 1124
            WHWL GAC+L VYL+++YG +VPDW FTV+N  S   G T  V CG+RG LDPPCNAVGY
Sbjct: 177  WHWLVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGY 236

Query: 1123 IDRVVLGTNHMYQHPAWKRSKVCTESSPYEGPFRKDAASWCQAPFEPEGXXXXXXXXXXX 944
            IDR V+G NHMY+ PAW+RS+ CTE+SPYEGPF+K+A SWC APFEPEG           
Sbjct: 237  IDREVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAILST 296

Query: 943  XIGVHFGHVLVHIKDHSDRLKHWXXXXXXXXXXGIVLHFTNAIPLNKQLYSFSYVCLTAG 764
             IG+HFGHVL+H++DH  RLKHW          G++LHFT+AIPLNKQLY+ SYVC+T+G
Sbjct: 297  IIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVTSG 356

Query: 763  AAAWVFSTFYVLVDILGWRHLFLPLEWIGMNAMLVYVMAAEGIFAGFINGWYYEDPHNTL 584
            AAA +FS FY++VDI G   LFLPL+WIGMNAMLVYVMAAEGIFAGFINGWYY DPHNTL
Sbjct: 357  AAALLFSAFYIMVDIWGLTFLFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTL 416

Query: 583  IYFYSHITF 557
            +Y+     F
Sbjct: 417  VYWIQKHVF 425


>ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Glycine max]
          Length = 465

 Score =  603 bits (1556), Expect = e-170
 Identities = 284/429 (66%), Positives = 335/429 (78%)
 Frame = -2

Query: 1843 MESSNAEHPMMIMEENQQMKEADQVTKEKSKRVASLDIFRGLTVAMMILVDDAGGEWQMI 1664
            M     EH + +   +Q++ E       K+KRVASLDIFRGLTVA+MILVDDAG +W MI
Sbjct: 1    MAEIKGEHSLNV---SQELPEVSDKNLPKTKRVASLDIFRGLTVALMILVDDAGEQWPMI 57

Query: 1663 GHAPWNGCNLADFVMPFFLFIVGMAIALAFKRIPNRLLSIKRVIIRTIKLILWGLILQGG 1484
            GHAPWNGCNLADFVMPFFLFIVGMAI LA KRIPNRLL++K+VI+RT+KL+ WGL+LQGG
Sbjct: 58   GHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGG 117

Query: 1483 YSHAPDELTYGVDMKRIRWCGILQRIALSYLVVAILEISTKDGQAKDLLPGWQPVLRQYH 1304
            +SHAPD LTYGVDMK IRWCGILQRIAL+YLVVA++EI ++  QA+D  P    +   Y+
Sbjct: 118  FSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSTQARDPEPTHLSIFNLYY 177

Query: 1303 WHWLAGACVLVVYLSVIYGAYVPDWHFTVYNMKSPDYGKTFNVKCGIRGTLDPPCNAVGY 1124
            WHWL GAC+LVVYL+++YG +VPDW FTV+N  S   G T  V CG+RG LDPPCNAVGY
Sbjct: 178  WHWLVGACILVVYLALLYGIHVPDWGFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGY 237

Query: 1123 IDRVVLGTNHMYQHPAWKRSKVCTESSPYEGPFRKDAASWCQAPFEPEGXXXXXXXXXXX 944
            IDR VLG NHMY+ PAW+RS+ CTE+SPYEGPF+K+A SWC APFEPEG           
Sbjct: 238  IDREVLGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAILST 297

Query: 943  XIGVHFGHVLVHIKDHSDRLKHWXXXXXXXXXXGIVLHFTNAIPLNKQLYSFSYVCLTAG 764
             IG+HFGHVL+H++DH  RLKHW          G++LHFT+AIPLNKQLY+ SYVC+T+G
Sbjct: 298  IIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVTSG 357

Query: 763  AAAWVFSTFYVLVDILGWRHLFLPLEWIGMNAMLVYVMAAEGIFAGFINGWYYEDPHNTL 584
            AAA +FS FY+ VDI G   LFLPL+WIGMNAMLVYVMAAEGIFAGFINGWYY DPHNTL
Sbjct: 358  AAALLFSAFYITVDIWGLTFLFLPLKWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTL 417

Query: 583  IYFYSHITF 557
            IY+     F
Sbjct: 418  IYWIQKHVF 426


>ref|XP_002319516.1| predicted protein [Populus trichocarpa] gi|222857892|gb|EEE95439.1|
            predicted protein [Populus trichocarpa]
          Length = 468

 Score =  598 bits (1543), Expect = e-168
 Identities = 277/392 (70%), Positives = 322/392 (82%)
 Frame = -2

Query: 1750 RVASLDIFRGLTVAMMILVDDAGGEWQMIGHAPWNGCNLADFVMPFFLFIVGMAIALAFK 1571
            RVASLDI+RGLTVA+MILVDDAGGEW  IGHAPWNGCNLADFVMPFFLFIVGMAI LAFK
Sbjct: 32   RVASLDIYRGLTVALMILVDDAGGEWPKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFK 91

Query: 1570 RIPNRLLSIKRVIIRTIKLILWGLILQGGYSHAPDELTYGVDMKRIRWCGILQRIALSYL 1391
            RI +R  +++RVI+RT+KL+ WG++LQGG+SHAPD+LTYGVDMK+IRWCGILQRIA +YL
Sbjct: 92   RITSRHHAVRRVIVRTLKLLFWGIMLQGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYL 151

Query: 1390 VVAILEISTKDGQAKDLLPGWQPVLRQYHWHWLAGACVLVVYLSVIYGAYVPDWHFTVYN 1211
            VVA++EI TK  Q ++L PGW  + + Y   WL GAC+LV+YL+VIYG YVP W FTV +
Sbjct: 152  VVALMEIFTKKKQTRELPPGWLSIYKLYSSQWLMGACILVIYLAVIYGTYVPHWQFTVND 211

Query: 1210 MKSPDYGKTFNVKCGIRGTLDPPCNAVGYIDRVVLGTNHMYQHPAWKRSKVCTESSPYEG 1031
              S DYGK F V+C +RG LDPPCNAVG+IDR +LG NHMYQHPAWKRS+ CTE+SPYEG
Sbjct: 212  RDSADYGKVFTVECAVRGKLDPPCNAVGFIDREILGINHMYQHPAWKRSEACTENSPYEG 271

Query: 1030 PFRKDAASWCQAPFEPEGXXXXXXXXXXXXIGVHFGHVLVHIKDHSDRLKHWXXXXXXXX 851
            PFR  A SWC+APFEPEG            IGVHFGHVLV+++ H+ RLKHW        
Sbjct: 272  PFRTSAPSWCKAPFEPEGILSSISAVLSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALL 331

Query: 850  XXGIVLHFTNAIPLNKQLYSFSYVCLTAGAAAWVFSTFYVLVDILGWRHLFLPLEWIGMN 671
              G+VLHFT+AIPLNKQLY+FSYVC+T+GAAA VFS+ Y LVDI GW+ +F PL WIGMN
Sbjct: 332  ILGLVLHFTHAIPLNKQLYTFSYVCVTSGAAALVFSSIYALVDIWGWKCIFQPLAWIGMN 391

Query: 670  AMLVYVMAAEGIFAGFINGWYYEDPHNTLIYF 575
            AMLVYVMAAEGIFAGFINGWYY DPHNTLIY+
Sbjct: 392  AMLVYVMAAEGIFAGFINGWYYNDPHNTLIYW 423


>ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
            [Cucumis sativus] gi|449490878|ref|XP_004158735.1|
            PREDICTED: heparan-alpha-glucosaminide
            N-acetyltransferase-like [Cucumis sativus]
          Length = 490

 Score =  578 bits (1490), Expect = e-162
 Identities = 273/395 (69%), Positives = 319/395 (80%)
 Frame = -2

Query: 1759 KSKRVASLDIFRGLTVAMMILVDDAGGEWQMIGHAPWNGCNLADFVMPFFLFIVGMAIAL 1580
            KSKR+ASLDIFRGLTVA+MILVDDAGGEW MIGHAPW GCNLADFVMPFFLFIVGMAIAL
Sbjct: 51   KSKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIAL 110

Query: 1579 AFKRIPNRLLSIKRVIIRTIKLILWGLILQGGYSHAPDELTYGVDMKRIRWCGILQRIAL 1400
            A KRIPN+L++I++V +RT+KL+ WGL+LQGGYSHAPD+LTYGVD+++IR  GILQRIAL
Sbjct: 111  ALKRIPNQLMAIEKVTLRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIAL 170

Query: 1399 SYLVVAILEISTKDGQAKDLLPGWQPVLRQYHWHWLAGACVLVVYLSVIYGAYVPDWHFT 1220
            +YLVVA +E+ ++  Q+         + + Y W+WL GAC+LVVY +++YG YVPDW FT
Sbjct: 171  AYLVVAFVEVLSRKTQSNVQPFNHFSIFKSYFWNWLVGACILVVYFALLYGIYVPDWQFT 230

Query: 1219 VYNMKSPDYGKTFNVKCGIRGTLDPPCNAVGYIDRVVLGTNHMYQHPAWKRSKVCTESSP 1040
            V + +S  YG+ F V CG+RG LDPPCNAVGYIDR VLG NH+Y HPAW+RS+ CTE+SP
Sbjct: 231  VTDSESVYYGRNFTVACGVRGNLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSP 290

Query: 1039 YEGPFRKDAASWCQAPFEPEGXXXXXXXXXXXXIGVHFGHVLVHIKDHSDRLKHWXXXXX 860
            Y G FR +A SWC APFEPEG            IGVHFGHVL+H +DHS RLK W     
Sbjct: 291  YAGSFRDNAPSWCFAPFEPEGILSSISAILSTIIGVHFGHVLIHFQDHSARLKQWVTMGF 350

Query: 859  XXXXXGIVLHFTNAIPLNKQLYSFSYVCLTAGAAAWVFSTFYVLVDILGWRHLFLPLEWI 680
                 G+VLHFT+AIPLNKQLY+FSYVC+T+GAAA VFS FY LVDI G R LFLPLEWI
Sbjct: 351  TLLILGLVLHFTHAIPLNKQLYTFSYVCVTSGAAALVFSVFYTLVDIWGLRPLFLPLEWI 410

Query: 679  GMNAMLVYVMAAEGIFAGFINGWYYEDPHNTLIYF 575
            GMNAMLVYVMAA GIFAGFINGWYY+DPHNTLIY+
Sbjct: 411  GMNAMLVYVMAAAGIFAGFINGWYYDDPHNTLIYW 445


Top