BLASTX nr result

ID: Angelica23_contig00007683 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00007683
         (1882 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   449   e-124
gb|ACU19258.1| unknown [Glycine max]                                  448   e-123
ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|2...   447   e-123
ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-lik...   445   e-122
ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917...   444   e-122

>ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  449 bits (1156), Expect = e-124
 Identities = 214/281 (76%), Positives = 246/281 (87%)
 Frame = +3

Query: 618  PTMKIVFGLLTLVTLGMILGALVQLAFIRRLEDSTVSPFPSFRRKHVSGNFGNFKLARGF 797
            P M+IVFGLLT VT+GMI+GAL QLA IRRLEDS  +    F R        + +L RG 
Sbjct: 3    PAMRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSHGTDSLPFSRLRGLDTDRHLQLPRGI 62

Query: 798  SPWANDKDAITLRVGYVKPEIISWSPRIIVLRNFLSMEECDYLRALARPRLQVSTVVDAK 977
              W NDK+A  LR+GYVKPE+++WSPRII+L NFLSMEECDYLRA+A PRL +S VVD K
Sbjct: 63   PFWNNDKEAEVLRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTK 122

Query: 978  TGKGIKSDVRTSSGMFLNSKEKKYPMIQAIEKRISVYSQIPIENGELIQVLRYEKHQFYK 1157
            TGKGIKSDVRTSSGMFLN +E+KYPM+QAIEKRISVYSQIPIENGEL+QVLRYEK+Q+YK
Sbjct: 123  TGKGIKSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYK 182

Query: 1158 PHHDYFSDTFNLKRGGQRVATMLMYLTDNVEGGETFFPMAGSGECSCGGRMVKGMCVKPN 1337
            PHHDYFSDTFNLKRGGQR+ATMLMYL+DN+EGGET+FP+AGSGECSCGG++VKG+ VKP 
Sbjct: 183  PHHDYFSDTFNLKRGGQRIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPI 242

Query: 1338 KGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQ 1460
            KG+AVLFWSMGLDGQSDPNS+HGGCEV++GEKWSATKWMRQ
Sbjct: 243  KGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWMRQ 283


>gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score =  448 bits (1152), Expect = e-123
 Identities = 215/281 (76%), Positives = 246/281 (87%)
 Frame = +3

Query: 618  PTMKIVFGLLTLVTLGMILGALVQLAFIRRLEDSTVSPFPSFRRKHVSGNFGNFKLARGF 797
            P M+IVFGLLT VT+GMI+GAL QLA IRRLEDS  +    FRR        + +L RG 
Sbjct: 3    PAMRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSYGTDSLPFRRLRGLDTDRHLQLPRGV 62

Query: 798  SPWANDKDAITLRVGYVKPEIISWSPRIIVLRNFLSMEECDYLRALARPRLQVSTVVDAK 977
              W NDK+A  LR+GYVKPE+++WSPRII+L NFLSMEECDYLRALA PRL +STVVD K
Sbjct: 63   PFWNNDKEAEILRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTK 122

Query: 978  TGKGIKSDVRTSSGMFLNSKEKKYPMIQAIEKRISVYSQIPIENGELIQVLRYEKHQFYK 1157
            TGKGIKSDVRTSSGMFLNSKE+KYPM+QAIEKRISVYSQIPIENGEL+QVLRYEK+Q+YK
Sbjct: 123  TGKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYK 182

Query: 1158 PHHDYFSDTFNLKRGGQRVATMLMYLTDNVEGGETFFPMAGSGECSCGGRMVKGMCVKPN 1337
            P HDYF DTFNLKRGGQ +ATMLMYL+DN+EGGET+FP+AGSGECSCGG++VKG+ VKP 
Sbjct: 183  PRHDYFFDTFNLKRGGQGIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVKGLSVKPI 242

Query: 1338 KGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQ 1460
            KG+AVLFWSMGLDGQSDPNS+HGGCEV++GEKWSATKW+RQ
Sbjct: 243  KGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWLRQ 283


>ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|222844615|gb|EEE82162.1|
            predicted protein [Populus trichocarpa]
          Length = 287

 Score =  447 bits (1151), Expect = e-123
 Identities = 217/283 (76%), Positives = 248/283 (87%)
 Frame = +3

Query: 621  TMKIVFGLLTLVTLGMILGALVQLAFIRRLEDSTVSPFPSFRRKHVSGNFGNFKLARGFS 800
            +MKIVFGLL  VT GMI+GA  QLAFI +LEDS  + FPSF+R     +    +L RG S
Sbjct: 4    SMKIVFGLLAFVTAGMIVGAFFQLAFILKLEDSYGTKFPSFKRVRKLQSDAYLQLPRGIS 63

Query: 801  PWANDKDAITLRVGYVKPEIISWSPRIIVLRNFLSMEECDYLRALARPRLQVSTVVDAKT 980
             W ND +A  LR+GYVKPEIISWSPRIIVL +FLS EECDYLRALA+PRL++STVVD KT
Sbjct: 64   HWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKT 123

Query: 981  GKGIKSDVRTSSGMFLNSKEKKYPMIQAIEKRISVYSQIPIENGELIQVLRYEKHQFYKP 1160
            GKGI+S VRTSSGMFL+S+EK Y ++QAIEKRISVYSQ+PIENGELIQVLRYEK+Q+YKP
Sbjct: 124  GKGIESKVRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKP 183

Query: 1161 HHDYFSDTFNLKRGGQRVATMLMYLTDNVEGGETFFPMAGSGECSCGGRMVKGMCVKPNK 1340
            HHDYFSDTFNLKRGGQRVATMLMYL+DNVEGGET+FPMAGSG+CSCGG++V G+ VKP K
Sbjct: 184  HHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGGKVVDGLSVKPIK 243

Query: 1341 GDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRST 1469
            G+AVLFWSMGLDGQSDP+SIHGGCEVL+G KWSATKWMRQR+T
Sbjct: 244  GNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQRAT 286


>ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score =  445 bits (1145), Expect = e-122
 Identities = 211/282 (74%), Positives = 244/282 (86%)
 Frame = +3

Query: 624  MKIVFGLLTLVTLGMILGALVQLAFIRRLEDSTVSPFPSFRRKHVSGNFGNFKLARGFSP 803
            M+IVFGLLT VT+GMI+GAL+QLAF+RRLEDS  + F    R H +      +L RGF  
Sbjct: 6    MRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPN 65

Query: 804  WANDKDAITLRVGYVKPEIISWSPRIIVLRNFLSMEECDYLRALARPRLQVSTVVDAKTG 983
            W NDK+A  LR+GYVKPE++SWSPRIIVL NFLS +ECDYL+ +A  RL++STVVD KTG
Sbjct: 66   WINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTG 125

Query: 984  KGIKSDVRTSSGMFLNSKEKKYPMIQAIEKRISVYSQIPIENGELIQVLRYEKHQFYKPH 1163
            KG+KSD RTSSGMFL+  EK +PM+QAIEKRISVYSQ+P+ENGELIQVLRYEK+QFYKPH
Sbjct: 126  KGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPH 185

Query: 1164 HDYFSDTFNLKRGGQRVATMLMYLTDNVEGGETFFPMAGSGECSCGGRMVKGMCVKPNKG 1343
            HDYFSDTFNLKRGGQR+ATMLMYL++N+EGGET+FP AGSGECSCGG+ V G+ VKP KG
Sbjct: 186  HDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVPGLSVKPAKG 245

Query: 1344 DAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRST 1469
            DAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQ+ST
Sbjct: 246  DAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKST 287


>ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917|gb|AAC64297.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|20197628|gb|AAM15158.1| hypothetical protein
            [Arabidopsis thaliana] gi|26450452|dbj|BAC42340.1|
            unknown protein [Arabidopsis thaliana]
            gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis
            thaliana] gi|330255112|gb|AEC10206.1| P4H isoform 1
            [Arabidopsis thaliana]
          Length = 283

 Score =  444 bits (1141), Expect = e-122
 Identities = 213/285 (74%), Positives = 246/285 (86%)
 Frame = +3

Query: 618  PTMKIVFGLLTLVTLGMILGALVQLAFIRRLEDSTVSPFPSFRRKHVSGNFGNFKLARGF 797
            P MKIVFGLLT VT+GM++G+L+QLAFI RLEDS  + FPS R         N +  R  
Sbjct: 3    PAMKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGFPSLRGLRGQ----NTRYLRDV 58

Query: 798  SPWANDKDAITLRVGYVKPEIISWSPRIIVLRNFLSMEECDYLRALARPRLQVSTVVDAK 977
            S WANDKDA  LR+G VKPE++SWSPRIIVL +FLS EEC+YL+A+ARPRLQVSTVVD K
Sbjct: 59   SRWANDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVK 118

Query: 978  TGKGIKSDVRTSSGMFLNSKEKKYPMIQAIEKRISVYSQIPIENGELIQVLRYEKHQFYK 1157
            TGKG+KSDVRTSSGMFL   E+ YP+IQAIEKRI+V+SQ+P ENGELIQVLRYE  QFYK
Sbjct: 119  TGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYK 178

Query: 1158 PHHDYFSDTFNLKRGGQRVATMLMYLTDNVEGGETFFPMAGSGECSCGGRMVKGMCVKPN 1337
            PHHDYF+DTFNLKRGGQRVATMLMYLTD+VEGGET+FP+AG G+C+CGG+++KG+ VKP 
Sbjct: 179  PHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMKGISVKPT 238

Query: 1338 KGDAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRSTS 1472
            KGDAVLFWSMGLDGQSDP SIHGGCEVL+GEKWSATKWMRQ++TS
Sbjct: 239  KGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQKATS 283


Top