BLASTX nr result
ID: Dioscorea21_contig00007339
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00007339 (3329 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [S... 459 e-126 ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative... 459 e-126 ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family ... 459 e-126 ref|NP_001169835.1| uncharacterized protein LOC100383727 precurs... 458 e-126 gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays] 452 e-124 >ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor] gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor] Length = 303 Score = 459 bits (1181), Expect = e-126 Identities = 218/276 (78%), Positives = 238/276 (86%), Gaps = 3/276 (1%) Frame = +2 Query: 2228 AAPNAAQFFDPTRVIQLSWIPRAFLYKRFLSNEECDHLIALAKDKLEKSMVADNDSGKSV 2407 AA A FDP+RV+QLSW PRAFL+K FLS+ ECDHLI LAKDKLEKSMVADN+SGKSV Sbjct: 27 AASRGAGSFDPSRVVQLSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSV 86 Query: 2408 MSEVRTSSGMFLEKRQDEIVSNIETRLAAWTLLPEENGESIQILHYENGEKYEPHFDYFH 2587 SEVRTSSGMFLEK+QDE+V IE R+AAWT LP ENGESIQILHY+NGEKYEPH+DYFH Sbjct: 87 QSEVRTSSGMFLEKKQDEVVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFH 146 Query: 2588 DKANQELGGHRIATVLMYLSNVSKGGETIFPNSEGKLSQPKDETWSDCAKNGYAVKPAKG 2767 DK NQ LGGHRIATVLMYLSNV KGGETIFPN+EGKL QPKD+TWSDCA+NGYAVKP KG Sbjct: 147 DKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKG 206 Query: 2768 DALLFFSLHPDATTDTSSLHGSCPVIEGEKWSATKWIHVRSFE---KIERSSDTCADDNE 2938 DALLFFSLHPDATTD+ SLHGSCPVIEG+KWSATKWIHVRSF+ K SSD C DDN Sbjct: 207 DALLFFSLHPDATTDSESLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNV 266 Query: 2939 LCALWASAGECEKNPNYMVGSKGAVGYCRKSCNACS 3046 LC WA+ GEC KNPNYMVG+K A G+CRKSC C+ Sbjct: 267 LCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVCA 302 >ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] Length = 311 Score = 459 bits (1180), Expect = e-126 Identities = 219/291 (75%), Positives = 246/291 (84%), Gaps = 2/291 (0%) Frame = +2 Query: 2183 NPSHSSTLRLPNDRRAAPNAAQFFDPTRVIQLSWIPRAFLYKRFLSNEECDHLIALAKDK 2362 N S LRL + +++ FDPTRV QLSW PRAFLYK FLS EECDHLI LA+DK Sbjct: 25 NEIQGSVLRL----KKGVVSSRIFDPTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDK 80 Query: 2363 LEKSMVADNDSGKSVMSEVRTSSGMFLEKRQDEIVSNIETRLAAWTLLPEENGESIQILH 2542 LEKSMVADN+SGKS+ SEVRTSSGMF+ K QDEIV++IE R+AAWT LPEENGES+QILH Sbjct: 81 LEKSMVADNESGKSIESEVRTSSGMFIAKAQDEIVADIEARIAAWTFLPEENGESMQILH 140 Query: 2543 YENGEKYEPHFDYFHDKANQELGGHRIATVLMYLSNVSKGGETIFPNSEGKLSQPKDETW 2722 YE+G+KYEPHFDYFHDKANQELGGHR+ATVLMYLSNV KGGET+FPN+EGKLSQPK+++W Sbjct: 141 YEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSW 200 Query: 2723 SDCAKNGYAVKPAKGDALLFFSLHPDATTDTSSLHGSCPVIEGEKWSATKWIHVRSFEKI 2902 SDCAK GYAVKP KGDALLFFSLHPDATTD+ SLHGSCPVIEGEKWSATKWIHVRSFEK Sbjct: 201 SDCAKGGYAVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEKWSATKWIHVRSFEKS 260 Query: 2903 --ERSSDTCADDNELCALWASAGECEKNPNYMVGSKGAVGYCRKSCNACSS 3049 + C D+N+ C LWA AGEC+KNP YM+GS GA GYCRKSC C+S Sbjct: 261 FKQLGKGDCVDENDHCPLWAKAGECKKNPLYMIGSGGANGYCRKSCKVCTS 311 >ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus trichocarpa] gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus trichocarpa] Length = 308 Score = 459 bits (1180), Expect = e-126 Identities = 216/268 (80%), Positives = 237/268 (88%), Gaps = 2/268 (0%) Frame = +2 Query: 2252 FDPTRVIQLSWIPRAFLYKRFLSNEECDHLIALAKDKLEKSMVADNDSGKSVMSEVRTSS 2431 FDPTRV QLSW PRAFLYK FLS+EECDHL+ LA+DKLEKSMVADN+SGKS+ SEVRTSS Sbjct: 41 FDPTRVTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSS 100 Query: 2432 GMFLEKRQDEIVSNIETRLAAWTLLPEENGESIQILHYENGEKYEPHFDYFHDKANQELG 2611 GMF+ K QDEIV +IE R+AAWT LP+ENGESIQILHYE+G+KYEPHFDYFHDKANQELG Sbjct: 101 GMFIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELG 160 Query: 2612 GHRIATVLMYLSNVSKGGETIFPNSEGKLSQPKDETWSDCAKNGYAVKPAKGDALLFFSL 2791 GHR+ TVLMYLSNV KGGET+FPNSEGK QPKD++WSDCAKNGYAVKP KGDALLFFSL Sbjct: 161 GHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFFSL 220 Query: 2792 HPDATTDTSSLHGSCPVIEGEKWSATKWIHVRSFEKI--ERSSDTCADDNELCALWASAG 2965 HPDATTDT+SLHGSCPVIEGEKWSATKWIHVRSFEK +S C D+NE C LWA AG Sbjct: 221 HPDATTDTNSLHGSCPVIEGEKWSATKWIHVRSFEKSLKHAASGGCIDENENCPLWAKAG 280 Query: 2966 ECEKNPNYMVGSKGAVGYCRKSCNACSS 3049 EC+KNP YMVGS+G+ G CRKSC CSS Sbjct: 281 ECQKNPVYMVGSEGSYGSCRKSCKVCSS 308 >ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays] gi|224031897|gb|ACN35024.1| unknown [Zea mays] gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays] gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays] Length = 299 Score = 458 bits (1179), Expect = e-126 Identities = 217/276 (78%), Positives = 239/276 (86%), Gaps = 3/276 (1%) Frame = +2 Query: 2228 AAPNAAQFFDPTRVIQLSWIPRAFLYKRFLSNEECDHLIALAKDKLEKSMVADNDSGKSV 2407 AA A FDP+RV+QLSW PRAFL+K FLS+ ECDHLIALAKDKLEKSMVADN+SGKSV Sbjct: 23 AASRGAGSFDPSRVVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSV 82 Query: 2408 MSEVRTSSGMFLEKRQDEIVSNIETRLAAWTLLPEENGESIQILHYENGEKYEPHFDYFH 2587 SEVRTSSGMFLE++QDE+V+ IE R++AWT LP ENGESIQILHY+NGEKYEPH+DYFH Sbjct: 83 QSEVRTSSGMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFH 142 Query: 2588 DKANQELGGHRIATVLMYLSNVSKGGETIFPNSEGKLSQPKDETWSDCAKNGYAVKPAKG 2767 DK NQ LGGHRIATVLMYLSNV KGGETIFPN+EGKL QPKD TWSDCA+NGYAVKP KG Sbjct: 143 DKKNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKG 202 Query: 2768 DALLFFSLHPDATTDTSSLHGSCPVIEGEKWSATKWIHVRSFE---KIERSSDTCADDNE 2938 DALLFFSLHPDATTD+ SLHGSCPVIEG+KWSATKWIHVRSF+ K SSD C DDN Sbjct: 203 DALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNI 262 Query: 2939 LCALWASAGECEKNPNYMVGSKGAVGYCRKSCNACS 3046 LC WA+ GEC KNPNYMVG+K A G+CRKSC C+ Sbjct: 263 LCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVCA 298 >gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays] Length = 298 Score = 452 bits (1164), Expect = e-124 Identities = 214/276 (77%), Positives = 237/276 (85%), Gaps = 3/276 (1%) Frame = +2 Query: 2228 AAPNAAQFFDPTRVIQLSWIPRAFLYKRFLSNEECDHLIALAKDKLEKSMVADNDSGKSV 2407 AA A FDP+RV+QLSW PRAFL+K FL + ECDHLIALAKDKLEKSMVADN SGKSV Sbjct: 22 AASRGAGSFDPSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSV 81 Query: 2408 MSEVRTSSGMFLEKRQDEIVSNIETRLAAWTLLPEENGESIQILHYENGEKYEPHFDYFH 2587 SEVRTSSGMFLEK+QDE+V+ IE R++AWT LP ENGE+IQILHY+NGEKYEPH+DYFH Sbjct: 82 QSEVRTSSGMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFH 141 Query: 2588 DKANQELGGHRIATVLMYLSNVSKGGETIFPNSEGKLSQPKDETWSDCAKNGYAVKPAKG 2767 DK NQ LGGHRIATVLMYLSNV KGGETIFPN+EGKL QPKD+TWSDCA+NGYAVKP KG Sbjct: 142 DKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKG 201 Query: 2768 DALLFFSLHPDATTDTSSLHGSCPVIEGEKWSATKWIHVRSFE---KIERSSDTCADDNE 2938 DALLFFSLHPD+TTD+ SLHGSCPVIEG+KWSATKWIHVRSF+ K SD C DDN Sbjct: 202 DALLFFSLHPDSTTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNV 261 Query: 2939 LCALWASAGECEKNPNYMVGSKGAVGYCRKSCNACS 3046 LC WA+ GEC KNPNYMVG+K A G+CRKSC C+ Sbjct: 262 LCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVCA 297