BLASTX nr result
ID: Cephaelis21_contig00007176
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00007176 (1414 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-lik... 278 e-115 ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917... 270 e-115 ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 275 e-115 gb|ACU19258.1| unknown [Glycine max] 265 e-114 ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|2... 270 e-113 >ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus] Length = 290 Score = 278 bits (712), Expect(2) = e-115 Identities = 137/188 (72%), Positives = 146/188 (77%) Frame = -3 Query: 845 KSSVRTSSGMFLSHEERKYPIIQAIEKRISVYSQVPVENGELIQVLRYEKNQFYRPHHDY 666 KS RTSSGMFLSH E+ +P++QAIEKRISVYSQVPVENGELIQVLRYEKNQFY+PHHDY Sbjct: 129 KSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDY 188 Query: 665 FSDTFNLKRGGQRVATMLMYLSNNVEGGETYFPMXXXXXXXXXXXXXXXXXXXXXXGETY 486 FSDTFNLKRGGQR+ATMLMYLS N+EGGET Y Sbjct: 189 FSDTFNLKRGGQRIATMLMYLSENIEGGET-----------------------------Y 219 Query: 485 FPMAGTGECSCGGKMVKGLCVKPSKGDAVLFWSMGLDGQSDQYSIHGGCEVLGGEKWSAT 306 FP AG+GECSCGGK V GL VKP+KGDAVLFWSMGLDGQSD SIHGGCEVL GEKWSAT Sbjct: 220 FPKAGSGECSCGGKTVPGLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSAT 279 Query: 305 KWMRQKET 282 KWMRQK T Sbjct: 280 KWMRQKST 287 Score = 166 bits (419), Expect(2) = e-115 Identities = 85/132 (64%), Positives = 102/132 (77%), Gaps = 2/132 (1%) Frame = -2 Query: 1305 STRMKIVFGLLTFVTVGMIIGALLQLAFIRRLEDSYGSA--TSFRRTLEGRTGSRSLTRG 1132 S++M+IVFGLLTFVTVGMIIGALLQLAF+RRLEDS G+ + R L RG Sbjct: 3 SSQMRIVFGLLTFVTVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRG 62 Query: 1131 YSHWAYDKDAVILRVGYVKPEVISWSPRIILFHNFLSPEECDYLRAISLPRLQTSTVVDA 952 + +W DK+A ILR+GYVKPEV+SWSPRII+ HNFLS +ECDYL+ I+L RL+ STVVD Sbjct: 63 FPNWINDKEAEILRLGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDT 122 Query: 951 KTGKGIKSSVRT 916 KTGKG+KS RT Sbjct: 123 KTGKGVKSDFRT 134 >ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana] gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana] gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana] gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana] gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana] gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana] Length = 283 Score = 270 bits (690), Expect(2) = e-115 Identities = 132/189 (69%), Positives = 148/189 (78%) Frame = -3 Query: 845 KSSVRTSSGMFLSHEERKYPIIQAIEKRISVYSQVPVENGELIQVLRYEKNQFYRPHHDY 666 KS VRTSSGMFL+H ER YPIIQAIEKRI+V+SQVP ENGELIQVLRYE QFY+PHHDY Sbjct: 124 KSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDY 183 Query: 665 FSDTFNLKRGGQRVATMLMYLSNNVEGGETYFPMXXXXXXXXXXXXXXXXXXXXXXGETY 486 F+DTFNLKRGGQRVATMLMYL+++VEGGET Y Sbjct: 184 FADTFNLKRGGQRVATMLMYLTDDVEGGET-----------------------------Y 214 Query: 485 FPMAGTGECSCGGKMVKGLCVKPSKGDAVLFWSMGLDGQSDQYSIHGGCEVLGGEKWSAT 306 FP+AG G+C+CGGK++KG+ VKP+KGDAVLFWSMGLDGQSD SIHGGCEVL GEKWSAT Sbjct: 215 FPLAGDGDCTCGGKIMKGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSAT 274 Query: 305 KWMRQKETN 279 KWMRQK T+ Sbjct: 275 KWMRQKATS 283 Score = 174 bits (440), Expect(2) = e-115 Identities = 90/127 (70%), Positives = 104/127 (81%) Frame = -2 Query: 1296 MKIVFGLLTFVTVGMIIGALLQLAFIRRLEDSYGSATSFRRTLEGRTGSRSLTRGYSHWA 1117 MKIVFGLLTFVTVGM+IG+LLQLAFI RLEDSYG+ R L G+ +R L R S WA Sbjct: 5 MKIVFGLLTFVTVGMVIGSLLQLAFINRLEDSYGTGFPSLRGLRGQN-TRYL-RDVSRWA 62 Query: 1116 YDKDAVILRVGYVKPEVISWSPRIILFHNFLSPEECDYLRAISLPRLQTSTVVDAKTGKG 937 DKDA +LR+G VKPEV+SWSPRII+ H+FLSPEEC+YL+AI+ PRLQ STVVD KTGKG Sbjct: 63 NDKDAELLRIGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKG 122 Query: 936 IKSSVRT 916 +KS VRT Sbjct: 123 VKSDVRT 129 >ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max] Length = 287 Score = 275 bits (702), Expect(2) = e-115 Identities = 131/185 (70%), Positives = 149/185 (80%) Frame = -3 Query: 845 KSSVRTSSGMFLSHEERKYPIIQAIEKRISVYSQVPVENGELIQVLRYEKNQFYRPHHDY 666 KS VRTSSGMFL+ +ERKYP++QAIEKRISVYSQ+P+ENGEL+QVLRYEKNQ+Y+PHHDY Sbjct: 128 KSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDY 187 Query: 665 FSDTFNLKRGGQRVATMLMYLSNNVEGGETYFPMXXXXXXXXXXXXXXXXXXXXXXGETY 486 FSDTFNLKRGGQR+ATMLMYLS+N+EGGET Y Sbjct: 188 FSDTFNLKRGGQRIATMLMYLSDNIEGGET-----------------------------Y 218 Query: 485 FPMAGTGECSCGGKMVKGLCVKPSKGDAVLFWSMGLDGQSDQYSIHGGCEVLGGEKWSAT 306 FP+AG+GECSCGGK+VKGL VKP KG+AVLFWSMGLDGQSD S+HGGCEV+ GEKWSAT Sbjct: 219 FPLAGSGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSAT 278 Query: 305 KWMRQ 291 KWMRQ Sbjct: 279 KWMRQ 283 Score = 167 bits (423), Expect(2) = e-115 Identities = 89/129 (68%), Positives = 99/129 (76%), Gaps = 2/129 (1%) Frame = -2 Query: 1296 MKIVFGLLTFVTVGMIIGALLQLAFIRRLEDSYGSATSFRRTLEGRTGSR--SLTRGYSH 1123 M+IVFGLLTFVTVGMIIGAL QLA IRRLEDS+G+ + L G R L RG Sbjct: 5 MRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSHGTDSLPFSRLRGLDTDRHLQLPRGIPF 64 Query: 1122 WAYDKDAVILRVGYVKPEVISWSPRIILFHNFLSPEECDYLRAISLPRLQTSTVVDAKTG 943 W DK+A +LR+GYVKPEV++WSPRIIL HNFLS EECDYLRAI+LPRL S VVD KTG Sbjct: 65 WNNDKEAEVLRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTG 124 Query: 942 KGIKSSVRT 916 KGIKS VRT Sbjct: 125 KGIKSDVRT 133 >gb|ACU19258.1| unknown [Glycine max] Length = 287 Score = 265 bits (678), Expect(2) = e-114 Identities = 127/185 (68%), Positives = 146/185 (78%) Frame = -3 Query: 845 KSSVRTSSGMFLSHEERKYPIIQAIEKRISVYSQVPVENGELIQVLRYEKNQFYRPHHDY 666 KS VRTSSGMFL+ +ERKYP++QAIEKRISVYSQ+P+ENGEL+QVLRYEKNQ+Y+P HDY Sbjct: 128 KSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDY 187 Query: 665 FSDTFNLKRGGQRVATMLMYLSNNVEGGETYFPMXXXXXXXXXXXXXXXXXXXXXXGETY 486 F DTFNLKRGGQ +ATMLMYLS+N+EGGET Y Sbjct: 188 FFDTFNLKRGGQGIATMLMYLSDNIEGGET-----------------------------Y 218 Query: 485 FPMAGTGECSCGGKMVKGLCVKPSKGDAVLFWSMGLDGQSDQYSIHGGCEVLGGEKWSAT 306 FP+AG+GECSCGGK+VKGL VKP KG+AVLFWSMGLDGQSD S+HGGCEV+ GEKWSAT Sbjct: 219 FPLAGSGECSCGGKLVKGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSAT 278 Query: 305 KWMRQ 291 KW+RQ Sbjct: 279 KWLRQ 283 Score = 173 bits (438), Expect(2) = e-114 Identities = 92/129 (71%), Positives = 101/129 (78%), Gaps = 2/129 (1%) Frame = -2 Query: 1296 MKIVFGLLTFVTVGMIIGALLQLAFIRRLEDSYGSATSFRRTLEGRTGSR--SLTRGYSH 1123 M+IVFGLLTFVTVGMIIGAL QLA IRRLEDSYG+ + R L G R L RG Sbjct: 5 MRIVFGLLTFVTVGMIIGALSQLAIIRRLEDSYGTDSLPFRRLRGLDTDRHLQLPRGVPF 64 Query: 1122 WAYDKDAVILRVGYVKPEVISWSPRIILFHNFLSPEECDYLRAISLPRLQTSTVVDAKTG 943 W DK+A ILR+GYVKPEV++WSPRIIL HNFLS EECDYLRA++LPRL STVVD KTG Sbjct: 65 WNNDKEAEILRLGYVKPEVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTG 124 Query: 942 KGIKSSVRT 916 KGIKS VRT Sbjct: 125 KGIKSDVRT 133 >ref|XP_002302889.1| predicted protein [Populus trichocarpa] gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa] Length = 287 Score = 270 bits (690), Expect(2) = e-113 Identities = 134/188 (71%), Positives = 147/188 (78%) Frame = -3 Query: 845 KSSVRTSSGMFLSHEERKYPIIQAIEKRISVYSQVPVENGELIQVLRYEKNQFYRPHHDY 666 +S VRTSSGMFLS EE+ Y ++QAIEKRISVYSQVP+ENGELIQVLRYEKNQ+Y+PHHDY Sbjct: 128 ESKVRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDY 187 Query: 665 FSDTFNLKRGGQRVATMLMYLSNNVEGGETYFPMXXXXXXXXXXXXXXXXXXXXXXGETY 486 FSDTFNLKRGGQRVATMLMYLS+NVEGGETY Sbjct: 188 FSDTFNLKRGGQRVATMLMYLSDNVEGGETY----------------------------- 218 Query: 485 FPMAGTGECSCGGKMVKGLCVKPSKGDAVLFWSMGLDGQSDQYSIHGGCEVLGGEKWSAT 306 FPMAG+G+CSCGGK+V GL VKP KG+AVLFWSMGLDGQSD SIHGGCEVL G KWSAT Sbjct: 219 FPMAGSGKCSCGGKVVDGLSVKPIKGNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSAT 278 Query: 305 KWMRQKET 282 KWMRQ+ T Sbjct: 279 KWMRQRAT 286 Score = 167 bits (423), Expect(2) = e-113 Identities = 88/135 (65%), Positives = 106/135 (78%), Gaps = 2/135 (1%) Frame = -2 Query: 1314 MASSTRMKIVFGLLTFVTVGMIIGALLQLAFIRRLEDSYGSA-TSFRRTLEGRTGSR-SL 1141 MASS MKIVFGLL FVT GMI+GA QLAFI +LEDSYG+ SF+R + ++ + L Sbjct: 1 MASS--MKIVFGLLAFVTAGMIVGAFFQLAFILKLEDSYGTKFPSFKRVRKLQSDAYLQL 58 Query: 1140 TRGYSHWAYDKDAVILRVGYVKPEVISWSPRIILFHNFLSPEECDYLRAISLPRLQTSTV 961 RG SHW D +A +LR+GYVKPE+ISWSPRII+ H+FLS EECDYLRA++ PRL+ STV Sbjct: 59 PRGISHWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTV 118 Query: 960 VDAKTGKGIKSSVRT 916 VD KTGKGI+S VRT Sbjct: 119 VDVKTGKGIESKVRT 133