BLASTX nr result

ID: Coptis25_contig00026519 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00026519
         (1180 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|2...   434   e-119
ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   432   e-119
ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-lik...   429   e-118
gb|ACU19258.1| unknown [Glycine max]                                  425   e-116
ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917...   425   e-116

>ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|222844615|gb|EEE82162.1|
           predicted protein [Populus trichocarpa]
          Length = 287

 Score =  434 bits (1115), Expect = e-119
 Identities = 211/284 (74%), Positives = 239/284 (84%), Gaps = 23/284 (8%)
 Frame = +3

Query: 126 SMKIVFGLLTFVTVGMIIGSLFQLAFITRLEESSGG-----------------------S 236
           SMKIVFGLL FVT GMI+G+ FQLAFI +LE+S G                        S
Sbjct: 4   SMKIVFGLLAFVTAGMIVGAFFQLAFILKLEDSYGTKFPSFKRVRKLQSDAYLQLPRGIS 63

Query: 237 HWINDKEAEVLRLGFVKPEIVSWSPRIIVLHNFLSMEECDYLRAIAKPRLRFSTVVDTKT 416
           HW ND EA VLR+G+VKPEI+SWSPRIIVLH+FLS EECDYLRA+AKPRLR STVVD KT
Sbjct: 64  HWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKT 123

Query: 417 GKGIKSDVRTSSGMFLTSGERKYSIIQAIEKRISVYSQIPVENGELIQVLRYEKSELYRP 596
           GKGI+S VRTSSGMFL+S E+ Y ++QAIEKRISVYSQ+P+ENGELIQVLRYEK++ Y+P
Sbjct: 124 GKGIESKVRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKP 183

Query: 597 HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGTGECSCGRETKKGMSVKPVK 776
           HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAG+G+CSCG +   G+SVKP+K
Sbjct: 184 HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGGKVVDGLSVKPIK 243

Query: 777 GDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRSTF 908
           G+AVLFWSMGLDGQSDP+SIHGGCEVL+G KWSATKWMRQR+TF
Sbjct: 244 GNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQRATF 287


>ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  432 bits (1111), Expect = e-119
 Identities = 210/280 (75%), Positives = 237/280 (84%), Gaps = 23/280 (8%)
 Frame = +3

Query: 126 SMKIVFGLLTFVTVGMIIGSLFQLAFITRLEESSGGSH---------------------- 239
           +M+IVFGLLTFVTVGMIIG+L QLA I RLE+S G                         
Sbjct: 4   AMRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSHGTDSLPFSRLRGLDTDRHLQLPRGIP 63

Query: 240 -WINDKEAEVLRLGFVKPEIVSWSPRIIVLHNFLSMEECDYLRAIAKPRLRFSTVVDTKT 416
            W NDKEAEVLRLG+VKPE+++WSPRII+LHNFLSMEECDYLRAIA PRL  S VVDTKT
Sbjct: 64  FWNNDKEAEVLRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKT 123

Query: 417 GKGIKSDVRTSSGMFLTSGERKYSIIQAIEKRISVYSQIPVENGELIQVLRYEKSELYRP 596
           GKGIKSDVRTSSGMFL   ERKY ++QAIEKRISVYSQIP+ENGEL+QVLRYEK++ Y+P
Sbjct: 124 GKGIKSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKP 183

Query: 597 HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGTGECSCGRETKKGMSVKPVK 776
           HHDYFSDTFNLKRGGQR+ATMLMYLSDN+EGGETYFP+AG+GECSCG +  KG+SVKP+K
Sbjct: 184 HHDYFSDTFNLKRGGQRIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPIK 243

Query: 777 GDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQ 896
           G+AVLFWSMGLDGQSDPNS+HGGCEV++GEKWSATKWMRQ
Sbjct: 244 GNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWMRQ 283


>ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score =  429 bits (1103), Expect = e-118
 Identities = 208/287 (72%), Positives = 238/287 (82%), Gaps = 23/287 (8%)
 Frame = +3

Query: 114 IIMFSMKIVFGLLTFVTVGMIIGSLFQLAFITRLEESSGGS------------------- 236
           ++   M+IVFGLLTFVTVGMIIG+L QLAF+ RLE+S G                     
Sbjct: 1   MVSSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLP 60

Query: 237 ----HWINDKEAEVLRLGFVKPEIVSWSPRIIVLHNFLSMEECDYLRAIAKPRLRFSTVV 404
               +WINDKEAE+LRLG+VKPE+VSWSPRIIVLHNFLS +ECDYL+ IA  RL  STVV
Sbjct: 61  RGFPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVV 120

Query: 405 DTKTGKGIKSDVRTSSGMFLTSGERKYSIIQAIEKRISVYSQIPVENGELIQVLRYEKSE 584
           DTKTGKG+KSD RTSSGMFL+  E+ + ++QAIEKRISVYSQ+PVENGELIQVLRYEK++
Sbjct: 121 DTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQ 180

Query: 585 LYRPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGTGECSCGRETKKGMSV 764
            Y+PHHDYFSDTFNLKRGGQR+ATMLMYLS+N+EGGETYFP AG+GECSCG +T  G+SV
Sbjct: 181 FYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSV 240

Query: 765 KPVKGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRST 905
           KP KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQ+ST
Sbjct: 241 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST 287


>gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score =  425 bits (1093), Expect = e-116
 Identities = 206/280 (73%), Positives = 236/280 (84%), Gaps = 23/280 (8%)
 Frame = +3

Query: 126 SMKIVFGLLTFVTVGMIIGSLFQLAFITRLEESSGGSH---------------------- 239
           +M+IVFGLLTFVTVGMIIG+L QLA I RLE+S G                         
Sbjct: 4   AMRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSYGTDSLPFRRLRGLDTDRHLQLPRGVP 63

Query: 240 -WINDKEAEVLRLGFVKPEIVSWSPRIIVLHNFLSMEECDYLRAIAKPRLRFSTVVDTKT 416
            W NDKEAE+LRLG+VKPE+++WSPRII+LHNFLSMEECDYLRA+A PRL  STVVDTKT
Sbjct: 64  FWNNDKEAEILRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKT 123

Query: 417 GKGIKSDVRTSSGMFLTSGERKYSIIQAIEKRISVYSQIPVENGELIQVLRYEKSELYRP 596
           GKGIKSDVRTSSGMFL S ERKY ++QAIEKRISVYSQIP+ENGEL+QVLRYEK++ Y+P
Sbjct: 124 GKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKP 183

Query: 597 HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGTGECSCGRETKKGMSVKPVK 776
            HDYF DTFNLKRGGQ +ATMLMYLSDN+EGGETYFP+AG+GECSCG +  KG+SVKP+K
Sbjct: 184 RHDYFFDTFNLKRGGQGIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPIK 243

Query: 777 GDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQ 896
           G+AVLFWSMGLDGQSDPNS+HGGCEV++GEKWSATKW+RQ
Sbjct: 244 GNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWLRQ 283


>ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917|gb|AAC64297.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|20197628|gb|AAM15158.1| hypothetical protein
           [Arabidopsis thaliana] gi|26450452|dbj|BAC42340.1|
           unknown protein [Arabidopsis thaliana]
           gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis
           thaliana] gi|330255112|gb|AEC10206.1| P4H isoform 1
           [Arabidopsis thaliana]
          Length = 283

 Score =  425 bits (1093), Expect = e-116
 Identities = 207/279 (74%), Positives = 236/279 (84%), Gaps = 19/279 (6%)
 Frame = +3

Query: 126 SMKIVFGLLTFVTVGMIIGSLFQLAFITRLEESSGG-------------------SHWIN 248
           +MKIVFGLLTFVTVGM+IGSL QLAFI RLE+S G                    S W N
Sbjct: 4   AMKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGFPSLRGLRGQNTRYLRDVSRWAN 63

Query: 249 DKEAEVLRLGFVKPEIVSWSPRIIVLHNFLSMEECDYLRAIAKPRLRFSTVVDTKTGKGI 428
           DK+AE+LR+G VKPE+VSWSPRIIVLH+FLS EEC+YL+AIA+PRL+ STVVD KTGKG+
Sbjct: 64  DKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGV 123

Query: 429 KSDVRTSSGMFLTSGERKYSIIQAIEKRISVYSQIPVENGELIQVLRYEKSELYRPHHDY 608
           KSDVRTSSGMFLT  ER Y IIQAIEKRI+V+SQ+P ENGELIQVLRYE  + Y+PHHDY
Sbjct: 124 KSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDY 183

Query: 609 FSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGTGECSCGRETKKGMSVKPVKGDAV 788
           F+DTFNLKRGGQRVATMLMYL+D+VEGGETYFP+AG G+C+CG +  KG+SVKP KGDAV
Sbjct: 184 FADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPTKGDAV 243

Query: 789 LFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRST 905
           LFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQ++T
Sbjct: 244 LFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKAT 282


Top