BLASTX nr result
ID: Sinomenium21_contig00022751
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00022751 (819 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006841992.1| hypothetical protein AMTR_s00144p00069890 [A... 342 7e-92 ref|XP_006466497.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 341 2e-91 ref|XP_006426046.1| hypothetical protein CICLE_v10027070mg [Citr... 339 8e-91 ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 335 9e-90 ref|XP_006412300.1| hypothetical protein EUTSA_v10025904mg [Eutr... 335 1e-89 dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum] 335 2e-89 ref|XP_006364423.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 333 5e-89 ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 332 8e-89 ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. ly... 331 2e-88 ref|XP_006486947.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 330 4e-88 ref|XP_006284244.1| hypothetical protein CARUB_v10005407mg [Caps... 330 5e-88 ref|XP_004233448.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 330 5e-88 gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana] 329 7e-88 ref|XP_006422858.1| hypothetical protein CICLE_v10028993mg [Citr... 329 9e-88 ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family pro... 329 9e-88 gb|EXB76669.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab... 328 1e-87 ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative... 328 1e-87 ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative... 327 3e-87 emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera] 327 3e-87 ref|XP_007198985.1| hypothetical protein PRUPE_ppa023428mg [Prun... 324 3e-86 >ref|XP_006841992.1| hypothetical protein AMTR_s00144p00069890 [Amborella trichopoda] gi|548844029|gb|ERN03667.1| hypothetical protein AMTR_s00144p00069890 [Amborella trichopoda] Length = 271 Score = 342 bits (878), Expect = 7e-92 Identities = 167/257 (64%), Positives = 206/257 (80%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 M+ K KG WG +KL P VFL CSV S+DI + R+ R+ E+ D+ Sbjct: 1 MRGKGKGSWGLSSKLEFPTVFLLCSVFFLAGIFGTMILSRDIRD-QSRTRRLQEAFDEFS 59 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 E+F +P+G+TGDSS+ +IP QVLSWKP A+YFP+FA+ +QC +I++A+ +L PS+LAF Sbjct: 60 EDFTTMPHGETGDSSIDTIPFQVLSWKPRALYFPKFATKEQCQHVIKIAQSSLVPSSLAF 119 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R+GET ENTKGIRTSSGTF+S++EDK+GILD IE+KIAK T++P +HGEAFNILRYEIGQ Sbjct: 120 REGETKENTKGIRTSSGTFVSASEDKSGILDLIEEKIAKVTMLPRTHGEAFNILRYEIGQ 179 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 RY SH+DAF+PAEYGPQKSQRVASFLLYLSDVE GGETMFP+ENGLN+ S YDYKKCVGL Sbjct: 180 RYHSHFDAFNPAEYGPQKSQRVASFLLYLSDVEGGGETMFPFENGLNVHSEYDYKKCVGL 239 Query: 52 TVKPRQGDGLLFYSLLP 2 VKP+ GDGLLFYS+ P Sbjct: 240 KVKPQLGDGLLFYSVYP 256 >ref|XP_006466497.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Citrus sinensis] Length = 289 Score = 341 bits (875), Expect = 2e-91 Identities = 167/253 (66%), Positives = 203/253 (80%) Frame = -3 Query: 760 SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFE 581 +K W +K+ LPFVFL C SQD+T+ARP S+R++ESV D E++ Sbjct: 7 NKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARP-SARVVESVKD---EYK 62 Query: 580 AIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGE 401 +P+G+ GD SV +IP QVLSW P A+YFP FA+ +QC SII MAK+NL+PSTLA RKGE Sbjct: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122 Query: 400 TTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLS 221 T +NT+GIRTSSG FIS+AED++G LD IE+KIAK T++P HGEAFNILRY+IGQ+Y S Sbjct: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRIHGEAFNILRYKIGQKYNS 182 Query: 220 HYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKP 41 HYDAFDP EYGPQKSQRVASFL+YL+D+EEGGETMFP+ENG+N D SYDY+KC+GL VKP Sbjct: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242 Query: 40 RQGDGLLFYSLLP 2 RQGDGLLFYSLLP Sbjct: 243 RQGDGLLFYSLLP 255 >ref|XP_006426046.1| hypothetical protein CICLE_v10027070mg [Citrus clementina] gi|557528036|gb|ESR39286.1| hypothetical protein CICLE_v10027070mg [Citrus clementina] Length = 289 Score = 339 bits (869), Expect = 8e-91 Identities = 167/253 (66%), Positives = 202/253 (79%) Frame = -3 Query: 760 SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFE 581 +K W +K+ LPFVFL C SQD+T ARP S+R++ESV D E+E Sbjct: 7 NKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTVARP-SARVVESVKD---EYE 62 Query: 580 AIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGE 401 +P+G+ GD SV +IP QVLSW P A+YFP FA+ +QC SII MAK+NL+PSTLA RKGE Sbjct: 63 WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122 Query: 400 TTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLS 221 T +NT+GIRTSSG FIS+AED++G LD IE+KIAK T++P +GEAFNILRY+IGQ+Y S Sbjct: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182 Query: 220 HYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKP 41 HYDAFDP EYGPQKSQRVASFL+YL+D+EEGGETMFP+ENG+N D SYDY+KC+GL VKP Sbjct: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242 Query: 40 RQGDGLLFYSLLP 2 RQGDGLLFYSLLP Sbjct: 243 RQGDGLLFYSLLP 255 >ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera] gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera] Length = 284 Score = 335 bits (860), Expect = 9e-90 Identities = 173/257 (67%), Positives = 204/257 (79%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 MK K KG W KLGL +F+ S SQD+ R + R+LESV Sbjct: 1 MKGKGKGVWR--PKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQP-RLLESV---- 53 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 EE+ +P+G+TG+SSV IP QVLSWKP A+YFPRFA+A+QC SIIEMAK +L+PSTLA Sbjct: 54 EEYSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLAL 113 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R+GET E+TKG RTSSGTFIS++EDKTGILDF+E+KIAKAT+IP SHGEAFNILRYEIGQ Sbjct: 114 RQGETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQ 173 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 RY SHYDAF+PAEYGPQ SQRVASFLLYLSDVEEGGETMFP+E+ LN+ + YDYKKC+GL Sbjct: 174 RYNSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGL 233 Query: 52 TVKPRQGDGLLFYSLLP 2 VKP++GDGLLFYS+ P Sbjct: 234 KVKPQRGDGLLFYSVFP 250 >ref|XP_006412300.1| hypothetical protein EUTSA_v10025904mg [Eutrema salsugineum] gi|557113470|gb|ESQ53753.1| hypothetical protein EUTSA_v10025904mg [Eutrema salsugineum] Length = 288 Score = 335 bits (859), Expect = 1e-89 Identities = 165/244 (67%), Positives = 195/244 (79%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGL V +FCS+ SQ++ RPR R+LE V++ +EE +P+G TG+ Sbjct: 12 KLGLATVIIFCSLCFLIGFYGSTLLSQNVPGVRPRL-RMLEMVENGEEEAGLMPHGVTGE 70 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 SV SIP QVLSWKP AIYFP FASA+QC +IIE AK+NLKPS LA RKGET E+TKG R Sbjct: 71 ESVGSIPFQVLSWKPRAIYFPNFASAEQCQTIIERAKINLKPSALALRKGETAESTKGTR 130 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTFIS++E+ TG LDF+E+KIA+AT+IP +HGEAFNILRYE+GQ+Y SHYD F+P E Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRTHGEAFNILRYELGQKYDSHYDVFNPTE 190 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM S YDYKKCVGL VKPR+GDGLLFY Sbjct: 191 YGPQPSQRIASFLLYLSDVEEGGETMFPFENGANMGSGYDYKKCVGLKVKPRRGDGLLFY 250 Query: 13 SLLP 2 S+ P Sbjct: 251 SVFP 254 >dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum] Length = 286 Score = 335 bits (858), Expect = 2e-89 Identities = 172/258 (66%), Positives = 200/258 (77%), Gaps = 1/258 (0%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXS-QDITSARPRSSRILESVDDD 596 MK + + GW + LP VFL C S QD+ S RPR R LESV + Sbjct: 1 MKSRGRFGWW---SVRLPSVFLLCLFFFLLGFFGSALFSHQDVPSVRPRP-RFLESVYQE 56 Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416 D F+ +P G+TG+ S+ SIP QVLSW P A+YFP FAS +QC SII+MAK N++PS+LA Sbjct: 57 D--FDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLA 114 Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236 R GET E TKGIRTSSGTFIS++EDKTGILD IE+KIAKAT+IP +HGEAFN+LRYEIG Sbjct: 115 LRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIG 174 Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56 QRY SHYDAFDPA+YGPQKSQR ASFLLYLSDVEEGGET+FPYENG NMD+SYD+ KC+G Sbjct: 175 QRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETVFPYENGQNMDASYDFSKCIG 234 Query: 55 LTVKPRQGDGLLFYSLLP 2 L VKPR+GDGLLFYSL P Sbjct: 235 LKVKPRRGDGLLFYSLFP 252 >ref|XP_006364423.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1 [Solanum tuberosum] Length = 288 Score = 333 bits (854), Expect = 5e-89 Identities = 171/258 (66%), Positives = 202/258 (78%), Gaps = 1/258 (0%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQ-DITSARPRSSRILESVDDD 596 MK + K +G LGLP VFL C SQ D+ + R SR+LESVD + Sbjct: 1 MKSRGKSVFGGWWNLGLPSVFLLCLFFFLLGLFASALFSQQDVPNVR---SRVLESVDLE 57 Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416 ++F+A+P G TGD S SIP QVLSW P A+YFP FAS +QC II++AK +L+PS+LA Sbjct: 58 -KDFDALPTGVTGDDSFTSIPFQVLSWFPRALYFPNFASIEQCQGIIKIAKASLEPSSLA 116 Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236 RKGET E TKGIRTSSGTFIS++EDKTGILD IE+KIA+AT+IP +HGEAFN+LRYEIG Sbjct: 117 LRKGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIARATMIPKTHGEAFNVLRYEIG 176 Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56 QRY SHYDAFDPA+YGPQKSQRVASFLLYLSDVEEGGET+FP+E+ NMD +YDY KC+G Sbjct: 177 QRYQSHYDAFDPAQYGPQKSQRVASFLLYLSDVEEGGETVFPFESAQNMDGNYDYSKCIG 236 Query: 55 LTVKPRQGDGLLFYSLLP 2 L VKPR+GDGLLFYSLLP Sbjct: 237 LKVKPRRGDGLLFYSLLP 254 >ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera] gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera] Length = 285 Score = 332 bits (852), Expect = 8e-89 Identities = 160/257 (62%), Positives = 198/257 (77%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 MK K+KG W GTKLGLP VFLFC Q+ +S+ PR R++ ++ Sbjct: 1 MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEPR--RLIR----EE 54 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 +++ + +G++G+ SV SIP QVLSW+P A+YFP FA+++QC SII MAK NL PST+A Sbjct: 55 TDYDPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVAL 114 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R GE NT+GIRTSSG FIS++EDKTG LD IEQKIA+ +IP +HGEAFN+LRYEIGQ Sbjct: 115 RVGEIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQ 174 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 RY SHYDAFDPAEYGPQKS R+A+FL+YLSDVEEGGETMFP+ENGLNMD YD+++C+GL Sbjct: 175 RYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGL 234 Query: 52 TVKPRQGDGLLFYSLLP 2 VKP QGDGLLFYS+ P Sbjct: 235 KVKPHQGDGLLFYSMFP 251 >ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata] gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata] Length = 288 Score = 331 bits (849), Expect = 2e-88 Identities = 161/244 (65%), Positives = 196/244 (80%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGL V +FCS+ SQ++ +PR R+LE V++ +E+ ++P+G TG+ Sbjct: 12 KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRL-RMLEMVENGEEDTGSMPHGVTGE 70 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 SV SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R Sbjct: 71 ESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTFIS++ED TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E Sbjct: 131 TSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM + YDYK+C+GL VKPR+GDGLLFY Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFY 250 Query: 13 SLLP 2 S+ P Sbjct: 251 SVFP 254 >ref|XP_006486947.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Citrus sinensis] Length = 286 Score = 330 bits (846), Expect = 4e-88 Identities = 168/257 (65%), Positives = 200/257 (77%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 MK K+K TKLGLP L CS S+D+ S RP+ R LE V+ ++ Sbjct: 1 MKGKAKRS---STKLGLPTALLLCSFFFLAGFYGSTLLSRDVPSIRPKL-RTLEVVEKEN 56 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 E +P+G+TGD+S+ SIP QVLSW+P A+YFP FASA+QC SII AK LKPS LA Sbjct: 57 ES--GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R+GET E+TKG RTSSGTFIS++EDKTGIL+ IE KIA+AT++P +HGEAFN+LRYEIGQ Sbjct: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 +Y SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG+ +DS YDYKKCVGL Sbjct: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCVGL 234 Query: 52 TVKPRQGDGLLFYSLLP 2 VKPR+GDGLLFYSL P Sbjct: 235 KVKPRRGDGLLFYSLFP 251 >ref|XP_006284244.1| hypothetical protein CARUB_v10005407mg [Capsella rubella] gi|565444836|ref|XP_006284245.1| hypothetical protein CARUB_v10005407mg [Capsella rubella] gi|482552949|gb|EOA17142.1| hypothetical protein CARUB_v10005407mg [Capsella rubella] gi|482552950|gb|EOA17143.1| hypothetical protein CARUB_v10005407mg [Capsella rubella] Length = 288 Score = 330 bits (845), Expect = 5e-88 Identities = 158/244 (64%), Positives = 195/244 (79%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGL V +FCS+ SQD+ +PR R+LE ++D +E ++P+G TGD Sbjct: 12 KLGLATVIVFCSLCFLVGFYGSTLLSQDVPRVKPRL-RMLEVMEDGGDEAASMPHGVTGD 70 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 SV SIP QVLSW+P AIYFP FA+A+QC +II+ AK+NLKPS LA RKGET ENTKG R Sbjct: 71 ESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIDRAKINLKPSALALRKGETAENTKGTR 130 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTF+S++E+ TG LDF+E+KIA+AT+IP +HGE+FNILRYE+GQ+Y SHYD F+P E Sbjct: 131 TSSGTFVSASEESTGALDFVEKKIARATMIPRTHGESFNILRYELGQKYDSHYDVFNPTE 190 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM + YDYK+C+GL VKPR+GDGLLFY Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFY 250 Query: 13 SLLP 2 S+ P Sbjct: 251 SVFP 254 >ref|XP_004233448.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum lycopersicum] Length = 288 Score = 330 bits (845), Expect = 5e-88 Identities = 170/258 (65%), Positives = 201/258 (77%), Gaps = 1/258 (0%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQ-DITSARPRSSRILESVDDD 596 MK + K G LGLP VFL C SQ D+ + R SR+LESVD + Sbjct: 1 MKSRGKSVIGGWWNLGLPSVFLLCLFFFLLGLFASVLFSQQDVPNVR---SRVLESVDLE 57 Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416 ++F+ +P G +GD S SIP QVLSW P A+YFP FAS +QC SII++AK +L+PS+LA Sbjct: 58 -KDFDPLPTGVSGDDSFTSIPFQVLSWFPRALYFPNFASIEQCQSIIKIAKTSLEPSSLA 116 Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236 RKGET E TKGIRTSSGTFIS++EDKTGILD IE+KIA+AT+IP +HGEAFN+LRYEIG Sbjct: 117 LRKGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIARATMIPKTHGEAFNVLRYEIG 176 Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56 QRY SHYDAFDPA+YGPQKSQRVASFLLYLSDVEEGGET+FP+E+ NMD +YDY KC+G Sbjct: 177 QRYQSHYDAFDPAQYGPQKSQRVASFLLYLSDVEEGGETVFPFESAQNMDGTYDYSKCIG 236 Query: 55 LTVKPRQGDGLLFYSLLP 2 L VKPR+GDGLLFYSLLP Sbjct: 237 LKVKPRRGDGLLFYSLLP 254 >gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana] Length = 288 Score = 329 bits (844), Expect = 7e-88 Identities = 160/244 (65%), Positives = 195/244 (79%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGL V +FCS+ SQ++ +PR R+LE V++ +EE ++P+G TG+ Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRL-RMLEMVENGEEEAGSMPHGVTGE 70 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 S+ SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R Sbjct: 71 ESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTFIS++E+ TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM YDYK+C+GL VKPR+GDGLLFY Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFY 250 Query: 13 SLLP 2 S+ P Sbjct: 251 SVFP 254 >ref|XP_006422858.1| hypothetical protein CICLE_v10028993mg [Citrus clementina] gi|557524792|gb|ESR36098.1| hypothetical protein CICLE_v10028993mg [Citrus clementina] Length = 286 Score = 329 bits (843), Expect = 9e-88 Identities = 167/257 (64%), Positives = 200/257 (77%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 MK K+K TKLGLP L CS S+D+ S RP+ R LE V+ ++ Sbjct: 1 MKGKAKRS---STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKL-RTLEVVEKEN 56 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 E +P+G+TGD+S+ SIP QVLSW+P A+YFP FASA+QC SII AK LKPS LA Sbjct: 57 ES--GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R+GET E+TKG RTSSGTFIS++EDKTGIL+ IE KIA+AT++P +HGEAFN+LRYEIGQ Sbjct: 115 RQGETVESTKGTRTSSGTFISASEDKTGILESIEHKIARATMLPQTHGEAFNVLRYEIGQ 174 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 +Y SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG+ +DS YDYKKC+GL Sbjct: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234 Query: 52 TVKPRQGDGLLFYSLLP 2 VKPR+GDGLLFYSL P Sbjct: 235 KVKPRRGDGLLFYSLFP 251 >ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana] gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana] gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana] gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana] gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana] Length = 288 Score = 329 bits (843), Expect = 9e-88 Identities = 159/244 (65%), Positives = 195/244 (79%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGL V +FCS+ SQ++ +PR R+L+ V++ +EE ++P+G TG+ Sbjct: 12 KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRL-RMLDMVENGEEEASSMPHGVTGE 70 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 S+ SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R Sbjct: 71 ESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTFIS++E+ TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM YDYK+C+GL VKPR+GDGLLFY Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFY 250 Query: 13 SLLP 2 S+ P Sbjct: 251 SVFP 254 >gb|EXB76669.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis] Length = 286 Score = 328 bits (842), Expect = 1e-87 Identities = 165/245 (67%), Positives = 193/245 (78%) Frame = -3 Query: 736 TKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTG 557 TK G P FL C + SQD+ RP SR+LESV +D + +P+G+TG Sbjct: 10 TKFGPPTAFLLCFLSFLAGFFFSNLLSQDVPGVRP-GSRVLESVGNDGGG-DLMPFGETG 67 Query: 556 DSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGI 377 DSS IP QVLSWKP A+YFP FA+A+QC SIIEMAK NL PSTLA RKGET E+TKG Sbjct: 68 DSSFQVIPFQVLSWKPRALYFPGFATAEQCQSIIEMAKSNLMPSTLALRKGETDESTKGT 127 Query: 376 RTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPA 197 RTSSGTFIS++EDKTGILD IEQKIA+AT++P +HGEAFNILRY IGQ+Y SHYDAF+PA Sbjct: 128 RTSSGTFISASEDKTGILDAIEQKIARATMLPTTHGEAFNILRYNIGQKYDSHYDAFNPA 187 Query: 196 EYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLF 17 EYGPQKSQR+ASFLLYLSDV+EGGETMFP+ENG +D +DY+KC GL VKPR+GDGLLF Sbjct: 188 EYGPQKSQRIASFLLYLSDVDEGGETMFPFENGEKIDMGFDYRKCTGLKVKPRRGDGLLF 247 Query: 16 YSLLP 2 YS+ P Sbjct: 248 YSVFP 252 >ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] Length = 290 Score = 328 bits (842), Expect = 1e-87 Identities = 166/258 (64%), Positives = 201/258 (77%), Gaps = 1/258 (0%) Frame = -3 Query: 772 MKPK-SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDD 596 MK K SKG W +KLGLP VFL C SQ++ + R R L+ V ++ Sbjct: 1 MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR--RQLQWVKEE 58 Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416 E++ +P G TGD + IP QVLSWKP A+YFP FA+A+QC S+I MAK NL PSTLA Sbjct: 59 IIEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLA 118 Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236 RKGET ENTKGIRTSSG F+S++EDKTG+LD IE+KIA+AT++P ++GEAFNILRYEIG Sbjct: 119 LRKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIG 178 Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56 Q+Y SHYDAF+PAEYGPQKSQRVASFLLYLSDVEEGGETMFP+EN L++D SYD++KC+G Sbjct: 179 QKYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIG 238 Query: 55 LTVKPRQGDGLLFYSLLP 2 L V+PR+GDGLLFYSL P Sbjct: 239 LQVRPRRGDGLLFYSLFP 256 >ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] Length = 286 Score = 327 bits (838), Expect = 3e-87 Identities = 166/244 (68%), Positives = 194/244 (79%) Frame = -3 Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554 KLGLP V L CSV SQD+ +PR R+LE D+ E+ +A+P G TG+ Sbjct: 12 KLGLPAVILVCSVFFVAGFYASTLISQDVPVIKPRL-RMLEVTDE--EKHQAMPRGVTGE 68 Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374 S + SIP QVLSWKP A+YFP FA+ +QC +IIEMAK+ LKPS LA RKGET E+TKG R Sbjct: 69 SYIESIPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTR 128 Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194 TSSGTF+S++ED TG LDFIE KIA+AT+IP SHGEAFNILRYEIGQ+Y SHYD+F+PAE Sbjct: 129 TSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAE 188 Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14 YGPQ SQRVASFLLYLSDVE+GGETMFP+ENG+ + S YDYKKC GL VKPRQGDG+LFY Sbjct: 189 YGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFY 248 Query: 13 SLLP 2 SLLP Sbjct: 249 SLLP 252 >emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera] Length = 276 Score = 327 bits (838), Expect = 3e-87 Identities = 171/257 (66%), Positives = 200/257 (77%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593 MK K KG W KLGL +F+ S SQ R+LESV Sbjct: 1 MKGKGKGVWR--PKLGLLLLFISWSFFFLAGLFGSMLFSQP---------RLLESV---- 45 Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413 EE+ +P+G+TG+SSV IP QVLSWKP A+YFPRFA+A+QC SIIEMAK +L+PSTLA Sbjct: 46 EEYSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLAL 105 Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233 R+GET E+TKG RTSSGTFIS++EDKTGILDF+E+KIAKAT+IP SHGEAFNILRYEIGQ Sbjct: 106 RQGETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQ 165 Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53 RY SHYDAF+PAEYGPQ SQRVASFLLYLSDVEEGGETMFP+E+ LN+ + YDYKKC+GL Sbjct: 166 RYNSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGL 225 Query: 52 TVKPRQGDGLLFYSLLP 2 VKP++GDGLLFYS+ P Sbjct: 226 KVKPQRGDGLLFYSVFP 242 >ref|XP_007198985.1| hypothetical protein PRUPE_ppa023428mg [Prunus persica] gi|462394385|gb|EMJ00184.1| hypothetical protein PRUPE_ppa023428mg [Prunus persica] Length = 290 Score = 324 bits (830), Expect = 3e-86 Identities = 167/261 (63%), Positives = 197/261 (75%), Gaps = 4/261 (1%) Frame = -3 Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCS----VXXXXXXXXXXXXSQDITSARPRSSRILESV 605 MK K+K K GLP VFL CS V S SR L+S Sbjct: 1 MKVKAKSP---KAKFGLPAVFLLCSLFFFVGLFTFTLLSHVSFPQFLSGPRSVSRTLQS- 56 Query: 604 DDDDEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPS 425 +D E+ +P G+TGDS + SIP QVLSWKP A+YFPRFA+A+QC S+IEMAK L+PS Sbjct: 57 -EDGEDHGPMPQGETGDSFIQSIPFQVLSWKPRALYFPRFATAEQCESVIEMAKTKLRPS 115 Query: 424 TLAFRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRY 245 LA RKGETTE+TKG RTSSGTFIS++ED+TGIL+ IE+KIA+AT++P +HGEAFN+LRY Sbjct: 116 ALALRKGETTESTKGTRTSSGTFISASEDETGILEIIEEKIARATMLPRTHGEAFNVLRY 175 Query: 244 EIGQRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKK 65 EIGQ+Y SHYDAF+P+EYG QKSQR ASFLLYLSDVEEGGETMFP+ENGL+M SYDYKK Sbjct: 176 EIGQKYDSHYDAFNPSEYGQQKSQRFASFLLYLSDVEEGGETMFPFENGLHMGMSYDYKK 235 Query: 64 CVGLTVKPRQGDGLLFYSLLP 2 C+GL V PRQGDGLLFYS+LP Sbjct: 236 CIGLKVMPRQGDGLLFYSVLP 256