BLASTX nr result

ID: Sinomenium21_contig00022751 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00022751
         (819 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006841992.1| hypothetical protein AMTR_s00144p00069890 [A...   342   7e-92
ref|XP_006466497.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   341   2e-91
ref|XP_006426046.1| hypothetical protein CICLE_v10027070mg [Citr...   339   8e-91
ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   335   9e-90
ref|XP_006412300.1| hypothetical protein EUTSA_v10025904mg [Eutr...   335   1e-89
dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]              335   2e-89
ref|XP_006364423.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   333   5e-89
ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   332   8e-89
ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. ly...   331   2e-88
ref|XP_006486947.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   330   4e-88
ref|XP_006284244.1| hypothetical protein CARUB_v10005407mg [Caps...   330   5e-88
ref|XP_004233448.1| PREDICTED: prolyl 4-hydroxylase subunit alph...   330   5e-88
gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]      329   7e-88
ref|XP_006422858.1| hypothetical protein CICLE_v10028993mg [Citr...   329   9e-88
ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family pro...   329   9e-88
gb|EXB76669.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab...   328   1e-87
ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative...   328   1e-87
ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative...   327   3e-87
emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]   327   3e-87
ref|XP_007198985.1| hypothetical protein PRUPE_ppa023428mg [Prun...   324   3e-86

>ref|XP_006841992.1| hypothetical protein AMTR_s00144p00069890 [Amborella trichopoda]
           gi|548844029|gb|ERN03667.1| hypothetical protein
           AMTR_s00144p00069890 [Amborella trichopoda]
          Length = 271

 Score =  342 bits (878), Expect = 7e-92
 Identities = 167/257 (64%), Positives = 206/257 (80%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           M+ K KG WG  +KL  P VFL CSV            S+DI   + R+ R+ E+ D+  
Sbjct: 1   MRGKGKGSWGLSSKLEFPTVFLLCSVFFLAGIFGTMILSRDIRD-QSRTRRLQEAFDEFS 59

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
           E+F  +P+G+TGDSS+ +IP QVLSWKP A+YFP+FA+ +QC  +I++A+ +L PS+LAF
Sbjct: 60  EDFTTMPHGETGDSSIDTIPFQVLSWKPRALYFPKFATKEQCQHVIKIAQSSLVPSSLAF 119

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R+GET ENTKGIRTSSGTF+S++EDK+GILD IE+KIAK T++P +HGEAFNILRYEIGQ
Sbjct: 120 REGETKENTKGIRTSSGTFVSASEDKSGILDLIEEKIAKVTMLPRTHGEAFNILRYEIGQ 179

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           RY SH+DAF+PAEYGPQKSQRVASFLLYLSDVE GGETMFP+ENGLN+ S YDYKKCVGL
Sbjct: 180 RYHSHFDAFNPAEYGPQKSQRVASFLLYLSDVEGGGETMFPFENGLNVHSEYDYKKCVGL 239

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKP+ GDGLLFYS+ P
Sbjct: 240 KVKPQLGDGLLFYSVYP 256


>ref|XP_006466497.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Citrus
           sinensis]
          Length = 289

 Score =  341 bits (875), Expect = 2e-91
 Identities = 167/253 (66%), Positives = 203/253 (80%)
 Frame = -3

Query: 760 SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFE 581
           +K  W   +K+ LPFVFL C              SQD+T+ARP S+R++ESV D   E++
Sbjct: 7   NKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTAARP-SARVVESVKD---EYK 62

Query: 580 AIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGE 401
            +P+G+ GD SV +IP QVLSW P A+YFP FA+ +QC SII MAK+NL+PSTLA RKGE
Sbjct: 63  WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122

Query: 400 TTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLS 221
           T +NT+GIRTSSG FIS+AED++G LD IE+KIAK T++P  HGEAFNILRY+IGQ+Y S
Sbjct: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRIHGEAFNILRYKIGQKYNS 182

Query: 220 HYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKP 41
           HYDAFDP EYGPQKSQRVASFL+YL+D+EEGGETMFP+ENG+N D SYDY+KC+GL VKP
Sbjct: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242

Query: 40  RQGDGLLFYSLLP 2
           RQGDGLLFYSLLP
Sbjct: 243 RQGDGLLFYSLLP 255


>ref|XP_006426046.1| hypothetical protein CICLE_v10027070mg [Citrus clementina]
           gi|557528036|gb|ESR39286.1| hypothetical protein
           CICLE_v10027070mg [Citrus clementina]
          Length = 289

 Score =  339 bits (869), Expect = 8e-91
 Identities = 167/253 (66%), Positives = 202/253 (79%)
 Frame = -3

Query: 760 SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFE 581
           +K  W   +K+ LPFVFL C              SQD+T ARP S+R++ESV D   E+E
Sbjct: 7   NKANWSLKSKIELPFVFLACLFFFLAGLLGSSLLSQDVTVARP-SARVVESVKD---EYE 62

Query: 580 AIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGE 401
            +P+G+ GD SV +IP QVLSW P A+YFP FA+ +QC SII MAK+NL+PSTLA RKGE
Sbjct: 63  WMPHGQAGDDSVTNIPFQVLSWMPRALYFPNFATPEQCKSIINMAKLNLRPSTLALRKGE 122

Query: 400 TTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLS 221
           T +NT+GIRTSSG FIS+AED++G LD IE+KIAK T++P  +GEAFNILRY+IGQ+Y S
Sbjct: 123 TVDNTQGIRTSSGVFISAAEDESGTLDLIEEKIAKVTMLPRINGEAFNILRYKIGQKYNS 182

Query: 220 HYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKP 41
           HYDAFDP EYGPQKSQRVASFL+YL+D+EEGGETMFP+ENG+N D SYDY+KC+GL VKP
Sbjct: 183 HYDAFDPQEYGPQKSQRVASFLVYLTDLEEGGETMFPFENGMNADGSYDYQKCIGLKVKP 242

Query: 40  RQGDGLLFYSLLP 2
           RQGDGLLFYSLLP
Sbjct: 243 RQGDGLLFYSLLP 255


>ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
           gi|296083079|emb|CBI22483.3| unnamed protein product
           [Vitis vinifera]
          Length = 284

 Score =  335 bits (860), Expect = 9e-90
 Identities = 173/257 (67%), Positives = 204/257 (79%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           MK K KG W    KLGL  +F+  S             SQD+   R +  R+LESV    
Sbjct: 1   MKGKGKGVWR--PKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQP-RLLESV---- 53

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
           EE+  +P+G+TG+SSV  IP QVLSWKP A+YFPRFA+A+QC SIIEMAK +L+PSTLA 
Sbjct: 54  EEYSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLAL 113

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R+GET E+TKG RTSSGTFIS++EDKTGILDF+E+KIAKAT+IP SHGEAFNILRYEIGQ
Sbjct: 114 RQGETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQ 173

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           RY SHYDAF+PAEYGPQ SQRVASFLLYLSDVEEGGETMFP+E+ LN+ + YDYKKC+GL
Sbjct: 174 RYNSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGL 233

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKP++GDGLLFYS+ P
Sbjct: 234 KVKPQRGDGLLFYSVFP 250


>ref|XP_006412300.1| hypothetical protein EUTSA_v10025904mg [Eutrema salsugineum]
           gi|557113470|gb|ESQ53753.1| hypothetical protein
           EUTSA_v10025904mg [Eutrema salsugineum]
          Length = 288

 Score =  335 bits (859), Expect = 1e-89
 Identities = 165/244 (67%), Positives = 195/244 (79%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGL  V +FCS+            SQ++   RPR  R+LE V++ +EE   +P+G TG+
Sbjct: 12  KLGLATVIIFCSLCFLIGFYGSTLLSQNVPGVRPRL-RMLEMVENGEEEAGLMPHGVTGE 70

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
            SV SIP QVLSWKP AIYFP FASA+QC +IIE AK+NLKPS LA RKGET E+TKG R
Sbjct: 71  ESVGSIPFQVLSWKPRAIYFPNFASAEQCQTIIERAKINLKPSALALRKGETAESTKGTR 130

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTFIS++E+ TG LDF+E+KIA+AT+IP +HGEAFNILRYE+GQ+Y SHYD F+P E
Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRTHGEAFNILRYELGQKYDSHYDVFNPTE 190

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM S YDYKKCVGL VKPR+GDGLLFY
Sbjct: 191 YGPQPSQRIASFLLYLSDVEEGGETMFPFENGANMGSGYDYKKCVGLKVKPRRGDGLLFY 250

Query: 13  SLLP 2
           S+ P
Sbjct: 251 SVFP 254


>dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score =  335 bits (858), Expect = 2e-89
 Identities = 172/258 (66%), Positives = 200/258 (77%), Gaps = 1/258 (0%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXS-QDITSARPRSSRILESVDDD 596
           MK + + GW     + LP VFL C              S QD+ S RPR  R LESV  +
Sbjct: 1   MKSRGRFGWW---SVRLPSVFLLCLFFFLLGFFGSALFSHQDVPSVRPRP-RFLESVYQE 56

Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416
           D  F+ +P G+TG+ S+ SIP QVLSW P A+YFP FAS +QC SII+MAK N++PS+LA
Sbjct: 57  D--FDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLA 114

Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236
            R GET E TKGIRTSSGTFIS++EDKTGILD IE+KIAKAT+IP +HGEAFN+LRYEIG
Sbjct: 115 LRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIG 174

Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56
           QRY SHYDAFDPA+YGPQKSQR ASFLLYLSDVEEGGET+FPYENG NMD+SYD+ KC+G
Sbjct: 175 QRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETVFPYENGQNMDASYDFSKCIG 234

Query: 55  LTVKPRQGDGLLFYSLLP 2
           L VKPR+GDGLLFYSL P
Sbjct: 235 LKVKPRRGDGLLFYSLFP 252


>ref|XP_006364423.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1
           [Solanum tuberosum]
          Length = 288

 Score =  333 bits (854), Expect = 5e-89
 Identities = 171/258 (66%), Positives = 202/258 (78%), Gaps = 1/258 (0%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQ-DITSARPRSSRILESVDDD 596
           MK + K  +G    LGLP VFL C              SQ D+ + R   SR+LESVD +
Sbjct: 1   MKSRGKSVFGGWWNLGLPSVFLLCLFFFLLGLFASALFSQQDVPNVR---SRVLESVDLE 57

Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416
            ++F+A+P G TGD S  SIP QVLSW P A+YFP FAS +QC  II++AK +L+PS+LA
Sbjct: 58  -KDFDALPTGVTGDDSFTSIPFQVLSWFPRALYFPNFASIEQCQGIIKIAKASLEPSSLA 116

Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236
            RKGET E TKGIRTSSGTFIS++EDKTGILD IE+KIA+AT+IP +HGEAFN+LRYEIG
Sbjct: 117 LRKGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIARATMIPKTHGEAFNVLRYEIG 176

Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56
           QRY SHYDAFDPA+YGPQKSQRVASFLLYLSDVEEGGET+FP+E+  NMD +YDY KC+G
Sbjct: 177 QRYQSHYDAFDPAQYGPQKSQRVASFLLYLSDVEEGGETVFPFESAQNMDGNYDYSKCIG 236

Query: 55  LTVKPRQGDGLLFYSLLP 2
           L VKPR+GDGLLFYSLLP
Sbjct: 237 LKVKPRRGDGLLFYSLLP 254


>ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
           gi|296087348|emb|CBI33722.3| unnamed protein product
           [Vitis vinifera]
          Length = 285

 Score =  332 bits (852), Expect = 8e-89
 Identities = 160/257 (62%), Positives = 198/257 (77%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           MK K+KG W  GTKLGLP VFLFC               Q+ +S+ PR  R++     ++
Sbjct: 1   MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEPR--RLIR----EE 54

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
            +++ + +G++G+ SV SIP QVLSW+P A+YFP FA+++QC SII MAK NL PST+A 
Sbjct: 55  TDYDPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVAL 114

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R GE   NT+GIRTSSG FIS++EDKTG LD IEQKIA+  +IP +HGEAFN+LRYEIGQ
Sbjct: 115 RVGEIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQ 174

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           RY SHYDAFDPAEYGPQKS R+A+FL+YLSDVEEGGETMFP+ENGLNMD  YD+++C+GL
Sbjct: 175 RYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGL 234

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKP QGDGLLFYS+ P
Sbjct: 235 KVKPHQGDGLLFYSMFP 251


>ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
           gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis
           lyrata subsp. lyrata]
          Length = 288

 Score =  331 bits (849), Expect = 2e-88
 Identities = 161/244 (65%), Positives = 196/244 (80%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGL  V +FCS+            SQ++   +PR  R+LE V++ +E+  ++P+G TG+
Sbjct: 12  KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRL-RMLEMVENGEEDTGSMPHGVTGE 70

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
            SV SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R
Sbjct: 71  ESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTFIS++ED TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E
Sbjct: 131 TSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM + YDYK+C+GL VKPR+GDGLLFY
Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFY 250

Query: 13  SLLP 2
           S+ P
Sbjct: 251 SVFP 254


>ref|XP_006486947.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Citrus
           sinensis]
          Length = 286

 Score =  330 bits (846), Expect = 4e-88
 Identities = 168/257 (65%), Positives = 200/257 (77%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           MK K+K      TKLGLP   L CS             S+D+ S RP+  R LE V+ ++
Sbjct: 1   MKGKAKRS---STKLGLPTALLLCSFFFLAGFYGSTLLSRDVPSIRPKL-RTLEVVEKEN 56

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
           E    +P+G+TGD+S+ SIP QVLSW+P A+YFP FASA+QC SII  AK  LKPS LA 
Sbjct: 57  ES--GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R+GET E+TKG RTSSGTFIS++EDKTGIL+ IE KIA+AT++P +HGEAFN+LRYEIGQ
Sbjct: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           +Y SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG+ +DS YDYKKCVGL
Sbjct: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCVGL 234

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKPR+GDGLLFYSL P
Sbjct: 235 KVKPRRGDGLLFYSLFP 251


>ref|XP_006284244.1| hypothetical protein CARUB_v10005407mg [Capsella rubella]
           gi|565444836|ref|XP_006284245.1| hypothetical protein
           CARUB_v10005407mg [Capsella rubella]
           gi|482552949|gb|EOA17142.1| hypothetical protein
           CARUB_v10005407mg [Capsella rubella]
           gi|482552950|gb|EOA17143.1| hypothetical protein
           CARUB_v10005407mg [Capsella rubella]
          Length = 288

 Score =  330 bits (845), Expect = 5e-88
 Identities = 158/244 (64%), Positives = 195/244 (79%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGL  V +FCS+            SQD+   +PR  R+LE ++D  +E  ++P+G TGD
Sbjct: 12  KLGLATVIVFCSLCFLVGFYGSTLLSQDVPRVKPRL-RMLEVMEDGGDEAASMPHGVTGD 70

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
            SV SIP QVLSW+P AIYFP FA+A+QC +II+ AK+NLKPS LA RKGET ENTKG R
Sbjct: 71  ESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIDRAKINLKPSALALRKGETAENTKGTR 130

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTF+S++E+ TG LDF+E+KIA+AT+IP +HGE+FNILRYE+GQ+Y SHYD F+P E
Sbjct: 131 TSSGTFVSASEESTGALDFVEKKIARATMIPRTHGESFNILRYELGQKYDSHYDVFNPTE 190

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM + YDYK+C+GL VKPR+GDGLLFY
Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFY 250

Query: 13  SLLP 2
           S+ P
Sbjct: 251 SVFP 254


>ref|XP_004233448.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum
           lycopersicum]
          Length = 288

 Score =  330 bits (845), Expect = 5e-88
 Identities = 170/258 (65%), Positives = 201/258 (77%), Gaps = 1/258 (0%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQ-DITSARPRSSRILESVDDD 596
           MK + K   G    LGLP VFL C              SQ D+ + R   SR+LESVD +
Sbjct: 1   MKSRGKSVIGGWWNLGLPSVFLLCLFFFLLGLFASVLFSQQDVPNVR---SRVLESVDLE 57

Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416
            ++F+ +P G +GD S  SIP QVLSW P A+YFP FAS +QC SII++AK +L+PS+LA
Sbjct: 58  -KDFDPLPTGVSGDDSFTSIPFQVLSWFPRALYFPNFASIEQCQSIIKIAKTSLEPSSLA 116

Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236
            RKGET E TKGIRTSSGTFIS++EDKTGILD IE+KIA+AT+IP +HGEAFN+LRYEIG
Sbjct: 117 LRKGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIARATMIPKTHGEAFNVLRYEIG 176

Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56
           QRY SHYDAFDPA+YGPQKSQRVASFLLYLSDVEEGGET+FP+E+  NMD +YDY KC+G
Sbjct: 177 QRYQSHYDAFDPAQYGPQKSQRVASFLLYLSDVEEGGETVFPFESAQNMDGTYDYSKCIG 236

Query: 55  LTVKPRQGDGLLFYSLLP 2
           L VKPR+GDGLLFYSLLP
Sbjct: 237 LKVKPRRGDGLLFYSLLP 254


>gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score =  329 bits (844), Expect = 7e-88
 Identities = 160/244 (65%), Positives = 195/244 (79%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGL  V +FCS+            SQ++   +PR  R+LE V++ +EE  ++P+G TG+
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRL-RMLEMVENGEEEAGSMPHGVTGE 70

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
            S+ SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R
Sbjct: 71  ESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTFIS++E+ TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E
Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM   YDYK+C+GL VKPR+GDGLLFY
Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFY 250

Query: 13  SLLP 2
           S+ P
Sbjct: 251 SVFP 254


>ref|XP_006422858.1| hypothetical protein CICLE_v10028993mg [Citrus clementina]
           gi|557524792|gb|ESR36098.1| hypothetical protein
           CICLE_v10028993mg [Citrus clementina]
          Length = 286

 Score =  329 bits (843), Expect = 9e-88
 Identities = 167/257 (64%), Positives = 200/257 (77%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           MK K+K      TKLGLP   L CS             S+D+ S RP+  R LE V+ ++
Sbjct: 1   MKGKAKRS---STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKL-RTLEVVEKEN 56

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
           E    +P+G+TGD+S+ SIP QVLSW+P A+YFP FASA+QC SII  AK  LKPS LA 
Sbjct: 57  ES--GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R+GET E+TKG RTSSGTFIS++EDKTGIL+ IE KIA+AT++P +HGEAFN+LRYEIGQ
Sbjct: 115 RQGETVESTKGTRTSSGTFISASEDKTGILESIEHKIARATMLPQTHGEAFNVLRYEIGQ 174

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           +Y SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG+ +DS YDYKKC+GL
Sbjct: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKPR+GDGLLFYSL P
Sbjct: 235 KVKPRRGDGLLFYSLFP 251


>ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana] gi|17381226|gb|AAL36425.1| unknown protein
           [Arabidopsis thaliana] gi|20465827|gb|AAM20018.1|
           unknown protein [Arabidopsis thaliana]
           gi|21592377|gb|AAM64328.1| putative dioxygenase
           [Arabidopsis thaliana] gi|332660892|gb|AEE86292.1|
           oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Arabidopsis thaliana]
          Length = 288

 Score =  329 bits (843), Expect = 9e-88
 Identities = 159/244 (65%), Positives = 195/244 (79%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGL  V +FCS+            SQ++   +PR  R+L+ V++ +EE  ++P+G TG+
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRL-RMLDMVENGEEEASSMPHGVTGE 70

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
            S+ SIP QVLSW+P AIYFP FA+A+QC +IIE AKVNLKPS LA RKGET ENTKG R
Sbjct: 71  ESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTR 130

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTFIS++E+ TG LDF+E+KIA+AT+IP SHGE+FNILRYE+GQ+Y SHYD F+P E
Sbjct: 131 TSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTE 190

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQR+ASFLLYLSDVEEGGETMFP+ENG NM   YDYK+C+GL VKPR+GDGLLFY
Sbjct: 191 YGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFY 250

Query: 13  SLLP 2
           S+ P
Sbjct: 251 SVFP 254


>gb|EXB76669.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis]
          Length = 286

 Score =  328 bits (842), Expect = 1e-87
 Identities = 165/245 (67%), Positives = 193/245 (78%)
 Frame = -3

Query: 736 TKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTG 557
           TK G P  FL C +            SQD+   RP  SR+LESV +D    + +P+G+TG
Sbjct: 10  TKFGPPTAFLLCFLSFLAGFFFSNLLSQDVPGVRP-GSRVLESVGNDGGG-DLMPFGETG 67

Query: 556 DSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGI 377
           DSS   IP QVLSWKP A+YFP FA+A+QC SIIEMAK NL PSTLA RKGET E+TKG 
Sbjct: 68  DSSFQVIPFQVLSWKPRALYFPGFATAEQCQSIIEMAKSNLMPSTLALRKGETDESTKGT 127

Query: 376 RTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPA 197
           RTSSGTFIS++EDKTGILD IEQKIA+AT++P +HGEAFNILRY IGQ+Y SHYDAF+PA
Sbjct: 128 RTSSGTFISASEDKTGILDAIEQKIARATMLPTTHGEAFNILRYNIGQKYDSHYDAFNPA 187

Query: 196 EYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLF 17
           EYGPQKSQR+ASFLLYLSDV+EGGETMFP+ENG  +D  +DY+KC GL VKPR+GDGLLF
Sbjct: 188 EYGPQKSQRIASFLLYLSDVDEGGETMFPFENGEKIDMGFDYRKCTGLKVKPRRGDGLLF 247

Query: 16  YSLLP 2
           YS+ P
Sbjct: 248 YSVFP 252


>ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
           gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha
           subunit, putative [Ricinus communis]
          Length = 290

 Score =  328 bits (842), Expect = 1e-87
 Identities = 166/258 (64%), Positives = 201/258 (77%), Gaps = 1/258 (0%)
 Frame = -3

Query: 772 MKPK-SKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDD 596
           MK K SKG W   +KLGLP VFL C              SQ++   + R  R L+ V ++
Sbjct: 1   MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR--RQLQWVKEE 58

Query: 595 DEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLA 416
             E++ +P G TGD  +  IP QVLSWKP A+YFP FA+A+QC S+I MAK NL PSTLA
Sbjct: 59  IIEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLA 118

Query: 415 FRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIG 236
            RKGET ENTKGIRTSSG F+S++EDKTG+LD IE+KIA+AT++P ++GEAFNILRYEIG
Sbjct: 119 LRKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIG 178

Query: 235 QRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVG 56
           Q+Y SHYDAF+PAEYGPQKSQRVASFLLYLSDVEEGGETMFP+EN L++D SYD++KC+G
Sbjct: 179 QKYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIG 238

Query: 55  LTVKPRQGDGLLFYSLLP 2
           L V+PR+GDGLLFYSL P
Sbjct: 239 LQVRPRRGDGLLFYSLFP 256


>ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
           gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha
           subunit, putative [Ricinus communis]
          Length = 286

 Score =  327 bits (838), Expect = 3e-87
 Identities = 166/244 (68%), Positives = 194/244 (79%)
 Frame = -3

Query: 733 KLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDDEEFEAIPYGKTGD 554
           KLGLP V L CSV            SQD+   +PR  R+LE  D+  E+ +A+P G TG+
Sbjct: 12  KLGLPAVILVCSVFFVAGFYASTLISQDVPVIKPRL-RMLEVTDE--EKHQAMPRGVTGE 68

Query: 553 SSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAFRKGETTENTKGIR 374
           S + SIP QVLSWKP A+YFP FA+ +QC +IIEMAK+ LKPS LA RKGET E+TKG R
Sbjct: 69  SYIESIPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTR 128

Query: 373 TSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQRYLSHYDAFDPAE 194
           TSSGTF+S++ED TG LDFIE KIA+AT+IP SHGEAFNILRYEIGQ+Y SHYD+F+PAE
Sbjct: 129 TSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAE 188

Query: 193 YGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGLTVKPRQGDGLLFY 14
           YGPQ SQRVASFLLYLSDVE+GGETMFP+ENG+ + S YDYKKC GL VKPRQGDG+LFY
Sbjct: 189 YGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFY 248

Query: 13  SLLP 2
           SLLP
Sbjct: 249 SLLP 252


>emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score =  327 bits (838), Expect = 3e-87
 Identities = 171/257 (66%), Positives = 200/257 (77%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCSVXXXXXXXXXXXXSQDITSARPRSSRILESVDDDD 593
           MK K KG W    KLGL  +F+  S             SQ          R+LESV    
Sbjct: 1   MKGKGKGVWR--PKLGLLLLFISWSFFFLAGLFGSMLFSQP---------RLLESV---- 45

Query: 592 EEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPSTLAF 413
           EE+  +P+G+TG+SSV  IP QVLSWKP A+YFPRFA+A+QC SIIEMAK +L+PSTLA 
Sbjct: 46  EEYSPMPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLAL 105

Query: 412 RKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRYEIGQ 233
           R+GET E+TKG RTSSGTFIS++EDKTGILDF+E+KIAKAT+IP SHGEAFNILRYEIGQ
Sbjct: 106 RQGETDESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQ 165

Query: 232 RYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKKCVGL 53
           RY SHYDAF+PAEYGPQ SQRVASFLLYLSDVEEGGETMFP+E+ LN+ + YDYKKC+GL
Sbjct: 166 RYNSHYDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGL 225

Query: 52  TVKPRQGDGLLFYSLLP 2
            VKP++GDGLLFYS+ P
Sbjct: 226 KVKPQRGDGLLFYSVFP 242


>ref|XP_007198985.1| hypothetical protein PRUPE_ppa023428mg [Prunus persica]
           gi|462394385|gb|EMJ00184.1| hypothetical protein
           PRUPE_ppa023428mg [Prunus persica]
          Length = 290

 Score =  324 bits (830), Expect = 3e-86
 Identities = 167/261 (63%), Positives = 197/261 (75%), Gaps = 4/261 (1%)
 Frame = -3

Query: 772 MKPKSKGGWGHGTKLGLPFVFLFCS----VXXXXXXXXXXXXSQDITSARPRSSRILESV 605
           MK K+K       K GLP VFL CS    V                 S     SR L+S 
Sbjct: 1   MKVKAKSP---KAKFGLPAVFLLCSLFFFVGLFTFTLLSHVSFPQFLSGPRSVSRTLQS- 56

Query: 604 DDDDEEFEAIPYGKTGDSSVYSIPSQVLSWKPLAIYFPRFASAKQCNSIIEMAKVNLKPS 425
            +D E+   +P G+TGDS + SIP QVLSWKP A+YFPRFA+A+QC S+IEMAK  L+PS
Sbjct: 57  -EDGEDHGPMPQGETGDSFIQSIPFQVLSWKPRALYFPRFATAEQCESVIEMAKTKLRPS 115

Query: 424 TLAFRKGETTENTKGIRTSSGTFISSAEDKTGILDFIEQKIAKATLIPASHGEAFNILRY 245
            LA RKGETTE+TKG RTSSGTFIS++ED+TGIL+ IE+KIA+AT++P +HGEAFN+LRY
Sbjct: 116 ALALRKGETTESTKGTRTSSGTFISASEDETGILEIIEEKIARATMLPRTHGEAFNVLRY 175

Query: 244 EIGQRYLSHYDAFDPAEYGPQKSQRVASFLLYLSDVEEGGETMFPYENGLNMDSSYDYKK 65
           EIGQ+Y SHYDAF+P+EYG QKSQR ASFLLYLSDVEEGGETMFP+ENGL+M  SYDYKK
Sbjct: 176 EIGQKYDSHYDAFNPSEYGQQKSQRFASFLLYLSDVEEGGETMFPFENGLHMGMSYDYKK 235

Query: 64  CVGLTVKPRQGDGLLFYSLLP 2
           C+GL V PRQGDGLLFYS+LP
Sbjct: 236 CIGLKVMPRQGDGLLFYSVLP 256