BLASTX nr result
ID: Catharanthus22_contig00003483
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00003483 (933 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006356692.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 389 e-106 ref|XP_004241065.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 388 e-105 dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum] 387 e-105 ref|XP_004229482.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 386 e-105 ref|XP_006365272.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 381 e-103 ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like prot... 377 e-102 gb|EXC05706.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab... 376 e-102 ref|XP_004296772.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 376 e-102 ref|XP_006419737.1| hypothetical protein CICLE_v10005535mg [Citr... 375 e-101 gb|EOY06341.1| 2-oxoglutarate and Fe(II)-dependent oxygenase sup... 375 e-101 ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative... 374 e-101 ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like prot... 372 e-101 ref|XP_002312720.1| oxidoreductase family protein [Populus trich... 371 e-100 gb|EMJ24403.1| hypothetical protein PRUPE_ppa009336mg [Prunus pe... 370 e-100 ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 369 e-100 ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 369 1e-99 ref|NP_001241485.1| uncharacterized protein LOC100783075 precurs... 367 3e-99 ref|NP_001276206.1| uncharacterized protein LOC100818794 precurs... 367 5e-99 ref|NP_001242363.1| uncharacterized protein LOC100796794 precurs... 366 6e-99 ref|XP_004508327.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 363 7e-98 >ref|XP_006356692.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum tuberosum] Length = 295 Score = 389 bits (1000), Expect = e-106 Identities = 189/257 (73%), Positives = 215/257 (83%), Gaps = 7/257 (2%) Frame = +3 Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362 FL ++ +++SS SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV Sbjct: 12 FLIFIIAFIHESSSSAIINPSKSKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 71 Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521 ADN SG+SK SEVRTSSGMFI PKENGE+IQVLRYE QK Sbjct: 72 ADNESGKSKHSEVRTSSGMFISKAKDPIVSGIEDKIATWTFLPKENGEEIQVLRYEEGQK 131 Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701 Y+PHYDYF DKVN+ARGGHR ATVLMYL+DVEKGGETVFP AEE+ RRRS+ AD+ LS+C Sbjct: 132 YEPHYDYFVDKVNIARGGHRFATVLMYLTDVEKGGETVFPKAEESHRRRSMAADDSLSEC 191 Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881 AKKG+AVKPRKGDALLF+SLHP+A PDP SLHGGCPV++GEKWSATKWIHVD+FDKT+G+ Sbjct: 192 AKKGIAVKPRKGDALLFYSLHPNATPDPISLHGGCPVLQGEKWSATKWIHVDSFDKTVGT 251 Query: 882 SDSCTDANENCERWAAL 932 +CTDA+ENCERWAAL Sbjct: 252 DGNCTDADENCERWAAL 268 >ref|XP_004241065.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum lycopersicum] Length = 295 Score = 388 bits (997), Expect = e-105 Identities = 189/257 (73%), Positives = 215/257 (83%), Gaps = 7/257 (2%) Frame = +3 Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362 FL ++ ++S+ SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV Sbjct: 12 FLIFIIAFTHESTSSAIINPSKSKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 71 Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521 ADN SGESK SEVRTSSGMFI PKENGE+IQVLRYE QK Sbjct: 72 ADNESGESKHSEVRTSSGMFISKAKDPIVSGIEDKIATWTFLPKENGEEIQVLRYEEGQK 131 Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701 Y+PHYDYF DKVN+ARGGHR+ATVLMYL+DVEKGGETVFP AEE+ RRRS+ AD+ LS+C Sbjct: 132 YEPHYDYFVDKVNIARGGHRLATVLMYLTDVEKGGETVFPKAEESHRRRSMAADDSLSEC 191 Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881 AKKG+AVKPRKGDALLFFSL+P+A PDP SLHGGCPV++GEKWSATKWIHVD+FDKT+G+ Sbjct: 192 AKKGIAVKPRKGDALLFFSLYPNATPDPISLHGGCPVLQGEKWSATKWIHVDSFDKTVGT 251 Query: 882 SDSCTDANENCERWAAL 932 +CTDA+ENCERWAAL Sbjct: 252 DGNCTDADENCERWAAL 268 >dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum] Length = 294 Score = 387 bits (994), Expect = e-105 Identities = 188/257 (73%), Positives = 213/257 (82%), Gaps = 7/257 (2%) Frame = +3 Query: 183 FLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAV 362 FL ++ V +SS SAIINPSK KQISW+PRAFVY GFLTDEEC+HLISLAKSELKRSAV Sbjct: 11 FLLFIIAFVRESSSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAV 70 Query: 363 ADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQK 521 ADN SG SK SEVRTSSGMFIP PKENGE+IQVLRYE QK Sbjct: 71 ADNESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQK 130 Query: 522 YDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDC 701 Y+PHYDYF DKVN+ARGGHR+ATVLMYL++VEKGGETVFP AEE+ RRRS+IAD+ LS+C Sbjct: 131 YEPHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSEC 190 Query: 702 AKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGS 881 AKKG+ VKPRKGDALLF+SLHP+A PDP SLHGGCPVI+GEKWSATKWIHVD+FDKT+ + Sbjct: 191 AKKGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSFDKTVDT 250 Query: 882 SDSCTDANENCERWAAL 932 +C+D +ENCERWAAL Sbjct: 251 EGNCSDRDENCERWAAL 267 >ref|XP_004229482.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum lycopersicum] Length = 295 Score = 386 bits (991), Expect = e-105 Identities = 194/273 (71%), Positives = 217/273 (79%), Gaps = 8/273 (2%) Frame = +3 Query: 138 LKLFSSMIKFWQLILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECD 317 + +FS + F F + + V SS SAIINPSKVKQISW+PRAFVY GFLTDEEC+ Sbjct: 1 MNIFSQIFTF-----FFFLIVVFVTKSSCSAIINPSKVKQISWKPRAFVYEGFLTDEECN 55 Query: 318 HLISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKE 476 HLISLAK ELKRSAVADN SGESKLSEVRTSSGMFI PKE Sbjct: 56 HLISLAKKELKRSAVADNESGESKLSEVRTSSGMFISKAKDPIVTGIEEKIATWTFLPKE 115 Query: 477 NGEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEET 656 NGEDIQVLRYE Q+Y+PHYDYFTDKVN+ RGGHR+ATVLMYLSDVEKGGETVFP AE + Sbjct: 116 NGEDIQVLRYEEGQRYEPHYDYFTDKVNIVRGGHRLATVLMYLSDVEKGGETVFPEAEVS 175 Query: 657 SRRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSA 836 +RRRS+ AD+ LS+CAK+G+AVKPRKGDALLFFSLHP+A+PDP SLHGGCPV+EGEKWSA Sbjct: 176 TRRRSMAADDSLSECAKRGIAVKPRKGDALLFFSLHPNAVPDPMSLHGGCPVMEGEKWSA 235 Query: 837 TKWIHVDNFDKTLGSSDS-CTDANENCERWAAL 932 TKWIHVD+FDKT+ S C D NENCERWAAL Sbjct: 236 TKWIHVDSFDKTVDSEGGHCADHNENCERWAAL 268 >ref|XP_006365272.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum tuberosum] Length = 295 Score = 381 bits (979), Expect = e-103 Identities = 192/265 (72%), Positives = 213/265 (80%), Gaps = 9/265 (3%) Frame = +3 Query: 165 FWQLILFLSIALSI-VNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKS 341 F Q+ FL +++ V SS SAIINPSKVKQISW+PRAFVY GFLTDEEC+HL+SLAK Sbjct: 4 FSQIFTFLFFLIAVFVTKSSCSAIINPSKVKQISWKPRAFVYEGFLTDEECNHLVSLAKK 63 Query: 342 ELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVL 500 ELKRSAVADN SGESKLSEVRTSSGMFI P ENGEDIQVL Sbjct: 64 ELKRSAVADNDSGESKLSEVRTSSGMFISKAKDPIVTGIEEKIATWTFLPTENGEDIQVL 123 Query: 501 RYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIA 680 RYE Q+Y+PH+DYFTDKVN+ RGGHR+ATVLMYLSDVEKGGET FP AE ++RRRS+ A Sbjct: 124 RYEEGQRYEPHHDYFTDKVNIVRGGHRLATVLMYLSDVEKGGETAFPEAEVSTRRRSMAA 183 Query: 681 DEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDN 860 D LS+CAKKG+AVKPRKGDALLFFSLHP+A+PDP SLHGGCPVIEGEKWSATKWIHVD+ Sbjct: 184 DNSLSECAKKGIAVKPRKGDALLFFSLHPNAVPDPMSLHGGCPVIEGEKWSATKWIHVDS 243 Query: 861 FDKTLGSSDS-CTDANENCERWAAL 932 FDKT+ S C D NENCERWAAL Sbjct: 244 FDKTVESEGGHCADHNENCERWAAL 268 >ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago truncatula] gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago truncatula] Length = 301 Score = 377 bits (969), Expect = e-102 Identities = 186/260 (71%), Positives = 211/260 (81%), Gaps = 7/260 (2%) Frame = +3 Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353 L+L I ++ + + SAII+P+KVKQ+SW+PRAFVY+GFLTD ECDHLIS+AKSELKR Sbjct: 15 LLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKR 74 Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512 SAVADNLSGESKLSEVRTSSGMFI PKENGEDIQVLRYE Sbjct: 75 SAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEH 134 Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692 QKYDPHYDYF DKVN+ARGGHR+ATVLMYL++V KGGETVFPNAEE+ R + DEDL Sbjct: 135 GQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDL 194 Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872 S+C KKGVAVKPR+GDALLFFSLHP+AIPD SLH GCPVIEGEKWSATKWIHVD+FDKT Sbjct: 195 SECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKT 254 Query: 873 LGSSDSCTDANENCERWAAL 932 +G+ CTD +E+CERWAAL Sbjct: 255 VGAGGDCTDQHESCERWAAL 274 >gb|EXC05706.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis] Length = 300 Score = 376 bits (966), Expect = e-102 Identities = 191/273 (69%), Positives = 214/273 (78%), Gaps = 14/273 (5%) Frame = +3 Query: 156 MIKFW-QLILFLSIALSIVNDSSGS------AIINPSKVKQISWRPRAFVYRGFLTDEEC 314 M K W QL LFL S ++SS S +IINPSKVKQ+SW+PRAFVY GFLTD EC Sbjct: 1 MSKLWVQLFLFLLSISSSFHESSSSYAGSAASIINPSKVKQVSWKPRAFVYEGFLTDLEC 60 Query: 315 DHLISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PK 473 DHLISLAKSELKRSAVADN+SG+SKLSEVRTSSGMFIP PK Sbjct: 61 DHLISLAKSELKRSAVADNVSGKSKLSEVRTSSGMFIPKAKDPIVAGIEDKISTWTFLPK 120 Query: 474 ENGEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEE 653 ENGED+QVLRYE QKYDPHYDYF DKVN+ARGGHRIATVLMYL+DV KGGETVFP+AEE Sbjct: 121 ENGEDMQVLRYEHGQKYDPHYDYFADKVNIARGGHRIATVLMYLTDVVKGGETVFPSAEE 180 Query: 654 TSRRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWS 833 + ++ D+DLS+CAKKG+AVKPR+GDALLFFSL P+A+PD SLH GCPVIEGEKWS Sbjct: 181 SHHHKASTTDDDLSECAKKGIAVKPRRGDALLFFSLLPTAVPDTISLHAGCPVIEGEKWS 240 Query: 834 ATKWIHVDNFDKTLGSSDSCTDANENCERWAAL 932 ATKWIHVD+FDK L + CTD NE+CERWAAL Sbjct: 241 ATKWIHVDSFDKDLSAGGKCTDQNESCERWAAL 273 >ref|XP_004296772.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Fragaria vesca subsp. vesca] Length = 294 Score = 376 bits (965), Expect = e-102 Identities = 185/259 (71%), Positives = 209/259 (80%), Gaps = 7/259 (2%) Frame = +3 Query: 177 ILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRS 356 + FLS+ LS+ + +S + +NPSKVKQISW PRAFVY G L++ ECDHLIS+AKSELKRS Sbjct: 10 LCFLSLLLSLTS-ASATFTVNPSKVKQISWNPRAFVYEGLLSELECDHLISIAKSELKRS 68 Query: 357 AVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPR 515 AVADNLSG+SKLSEVRTSSGMFIP PKENGEDIQVLRYEP Sbjct: 69 AVADNLSGQSKLSEVRTSSGMFIPKAKDHIVAGIEDKLATWTFLPKENGEDIQVLRYEPG 128 Query: 516 QKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLS 695 QKY+PHYDYF DKVN+ARGGHRIATVLMYL+DV KGGETVFP AEE RR++ + D LS Sbjct: 129 QKYEPHYDYFADKVNIARGGHRIATVLMYLTDVAKGGETVFPLAEEVHRRKASVPDASLS 188 Query: 696 DCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTL 875 DCAKKG+AVKPR+GDALLFFSLHP+AIPD SLH GCPVIEGEKWSATKWIHVD+FD L Sbjct: 189 DCAKKGIAVKPRRGDALLFFSLHPNAIPDENSLHAGCPVIEGEKWSATKWIHVDSFDNIL 248 Query: 876 GSSDSCTDANENCERWAAL 932 + +CTD NE+CERWAAL Sbjct: 249 DTGGNCTDLNESCERWAAL 267 >ref|XP_006419737.1| hypothetical protein CICLE_v10005535mg [Citrus clementina] gi|557521610|gb|ESR32977.1| hypothetical protein CICLE_v10005535mg [Citrus clementina] Length = 296 Score = 375 bits (964), Expect = e-101 Identities = 186/256 (72%), Positives = 206/256 (80%), Gaps = 7/256 (2%) Frame = +3 Query: 186 LSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVA 365 LS +L I S +AIINPSKVKQISW+PRAFVY GFLTD ECDHLI+LAKS+LKRSAVA Sbjct: 14 LSFSLLIRKSFSSTAIINPSKVKQISWKPRAFVYEGFLTDLECDHLINLAKSQLKRSAVA 73 Query: 366 DNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKY 524 DNLSGESKLS+VRTSSG FIP PKENGEDIQVLRYE QKY Sbjct: 74 DNLSGESKLSDVRTSSGTFIPKGKDAIIAGIEDKIATWTFLPKENGEDIQVLRYEHGQKY 133 Query: 525 DPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCA 704 +PHYDYF+DKVN+ RGGHR+ATVLMYLSDV KGGETVFPNAEE RRR+ ++DLS+CA Sbjct: 134 EPHYDYFSDKVNIVRGGHRLATVLMYLSDVAKGGETVFPNAEEPPRRRTPATNDDLSECA 193 Query: 705 KKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSS 884 KKG+AVKPR+GDALLFFSLH +AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK + Sbjct: 194 KKGIAVKPRRGDALLFFSLHTNAIPDPVSLHSGCPVIEGEKWSATKWIHVDSFDKIVEEG 253 Query: 885 DSCTDANENCERWAAL 932 CTD N +CERWAAL Sbjct: 254 GDCTDNNASCERWAAL 269 >gb|EOY06341.1| 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein [Theobroma cacao] Length = 287 Score = 375 bits (964), Expect = e-101 Identities = 182/243 (74%), Positives = 203/243 (83%), Gaps = 7/243 (2%) Frame = +3 Query: 225 SAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSEVR 404 S+IINP+K KQ+SW+PRAFVY GFLTD ECDHLISLAKSELKRSAVADN+SG+SKLSEVR Sbjct: 33 SSIINPAKAKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNVSGKSKLSEVR 92 Query: 405 TSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKVNV 563 TSSGMFI PKENGEDIQVLRYE QKYDPHYDYF DKVN+ Sbjct: 93 TSSGMFISKGKDPIVAGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDPHYDYFVDKVNI 152 Query: 564 ARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKGDA 743 ARGGHRIATVLMYL+DV KGGET+FP AEE+SRR++ D+DLS+CAKKG+AVKPR+GDA Sbjct: 153 ARGGHRIATVLMYLTDVTKGGETIFPQAEESSRRKTPATDDDLSECAKKGIAVKPRRGDA 212 Query: 744 LLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCERW 923 LLFFSL P+AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK L + +CTD NE+CERW Sbjct: 213 LLFFSLSPTAIPDPSSLHAGCPVIEGEKWSATKWIHVDSFDKNLEAGGNCTDLNESCERW 272 Query: 924 AAL 932 AAL Sbjct: 273 AAL 275 >ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] Length = 297 Score = 374 bits (959), Expect = e-101 Identities = 184/260 (70%), Positives = 209/260 (80%), Gaps = 7/260 (2%) Frame = +3 Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353 L++ L S S ++II+PSKVKQ+SW+PRAFVY GFLTD ECDHLISLAKSELKR Sbjct: 11 LLISLIFHKSSSYPGSPTSIIDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKR 70 Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512 SAVADN SG+SKLSEVRTSSGMFI PKENGED+QVLRYE Sbjct: 71 SAVADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEH 130 Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692 QKYDPHYDYF DK+N+ARGGHR+ATVLMYLSDV KGGETVFPNAEE RR++ + EDL Sbjct: 131 GQKYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDL 190 Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872 S+CAKKG++VKPR+GDALLFFSLHP+AIPDP SLH GCPVIEGEKWSATKWIHVD+FDK Sbjct: 191 SECAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVDSFDKN 250 Query: 873 LGSSDSCTDANENCERWAAL 932 + + +CTD NE+CERWAAL Sbjct: 251 IEAGGNCTDKNESCERWAAL 270 >ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago truncatula] gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago truncatula] Length = 303 Score = 372 bits (956), Expect = e-101 Identities = 186/262 (70%), Positives = 211/262 (80%), Gaps = 9/262 (3%) Frame = +3 Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353 L+L I ++ + + SAII+P+KVKQ+SW+PRAFVY+GFLTD ECDHLIS+AKSELKR Sbjct: 15 LLLICKIHFALGSYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKR 74 Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512 SAVADNLSGESKLSEVRTSSGMFI PKENGEDIQVLRYE Sbjct: 75 SAVADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEH 134 Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAE--ETSRRRSVIADE 686 QKYDPHYDYF DKVN+ARGGHR+ATVLMYL++V KGGETVFPNAE E+ R + DE Sbjct: 135 GQKYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDE 194 Query: 687 DLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFD 866 DLS+C KKGVAVKPR+GDALLFFSLHP+AIPD SLH GCPVIEGEKWSATKWIHVD+FD Sbjct: 195 DLSECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFD 254 Query: 867 KTLGSSDSCTDANENCERWAAL 932 KT+G+ CTD +E+CERWAAL Sbjct: 255 KTVGAGGDCTDQHESCERWAAL 276 >ref|XP_002312720.1| oxidoreductase family protein [Populus trichocarpa] gi|222852540|gb|EEE90087.1| oxidoreductase family protein [Populus trichocarpa] Length = 300 Score = 371 bits (953), Expect = e-100 Identities = 181/260 (69%), Positives = 211/260 (81%), Gaps = 7/260 (2%) Frame = +3 Query: 174 LILFLSIALSIVNDSSGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKR 353 L +F + SI + S+IINP+KVKQ+SW+PRAFVY GFLTD ECDHLISLAKSELKR Sbjct: 14 LSIFSILHKSISYPGTSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKR 73 Query: 354 SAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEP 512 SAVADN SG+SKLSEVRTSSGMFI P+ENGEDIQVLRYE Sbjct: 74 SAVADNESGKSKLSEVRTSSGMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEH 133 Query: 513 RQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDL 692 QKYDPHYDYF+DKVN+ARGGHR+ATVLMYL+DVEKGGETVFP+AEE RR++ ++ EDL Sbjct: 134 GQKYDPHYDYFSDKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDL 193 Query: 693 SDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKT 872 S+CA+KG+AVKPR+GDALLFFSL+P+A+PD S+H GCPVIEGEKWSATKWIHVD+FDK Sbjct: 194 SECARKGIAVKPRRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIHVDSFDKN 253 Query: 873 LGSSDSCTDANENCERWAAL 932 L + +CTD NE+C RWAAL Sbjct: 254 LEAGGNCTDQNESCGRWAAL 273 >gb|EMJ24403.1| hypothetical protein PRUPE_ppa009336mg [Prunus persica] Length = 297 Score = 370 bits (951), Expect = e-100 Identities = 184/270 (68%), Positives = 207/270 (76%), Gaps = 11/270 (4%) Frame = +3 Query: 156 MIKFWQLILFLSIALSIVNDSSGSA----IINPSKVKQISWRPRAFVYRGFLTDEECDHL 323 M + W + F LSI + S ++ +NPSKV+QISW PRAFVY G LTD ECDHL Sbjct: 1 MTRVWLQLFFFFFLLSISSSSYAASPHTFTVNPSKVRQISWNPRAFVYEGLLTDAECDHL 60 Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482 IS+AKSELKRSAVADNLSG+SKLSEVRTSSGMFIP PKENG Sbjct: 61 ISIAKSELKRSAVADNLSGQSKLSEVRTSSGMFIPKAKDPIVAGIEDKIATWTFLPKENG 120 Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662 EDIQVLRYEP QKY+PHYDYF DKVN+ARGGHRIATVLMYL+DV +GGETVFP AE SR Sbjct: 121 EDIQVLRYEPGQKYEPHYDYFADKVNIARGGHRIATVLMYLTDVTRGGETVFPEAEVPSR 180 Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842 R++ D LS+CAKKG+AVKPR+GDALLFFSL P A+PD SLH GCPVIEGEKWSATK Sbjct: 181 RKASEVDHSLSECAKKGIAVKPRRGDALLFFSLTPHAVPDENSLHAGCPVIEGEKWSATK 240 Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932 WIHVD+FDK L +S +C D NE+CERWAAL Sbjct: 241 WIHVDSFDKNLDASGNCADLNESCERWAAL 270 >ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis vinifera] gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera] Length = 298 Score = 369 bits (947), Expect = e-100 Identities = 185/271 (68%), Positives = 215/271 (79%), Gaps = 12/271 (4%) Frame = +3 Query: 156 MIKFWQLILFLSIALSIVNDSSGSAI-----INPSKVKQISWRPRAFVYRGFLTDEECDH 320 M+ Q +L L I+ +I+ SS A ++ +KV+QISW+PRAFVY GFL++EECDH Sbjct: 1 MVSSLQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDH 60 Query: 321 LISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKEN 479 LISLAKSELKRSAVADN+SG+S+LSEVRTSSGMFI PK+N Sbjct: 61 LISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDN 120 Query: 480 GEDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETS 659 GED+QVLRYEP QKYD HYDYF DKVN+ARGGHRIATVLMYLSDV KGGETVFP AEE S Sbjct: 121 GEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPS 180 Query: 660 RRRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSAT 839 RR+ + ++DLS+CA+KG+AVKPRKGDALLFFSLHP+AIPDP SLHGGCPVIEGEKWSAT Sbjct: 181 RRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSAT 240 Query: 840 KWIHVDNFDKTLGSSDSCTDANENCERWAAL 932 KWIHVD+FDK L +CTD N++CERWAAL Sbjct: 241 KWIHVDSFDKILKPGGNCTDENDSCERWAAL 271 >ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1 [Glycine max] Length = 301 Score = 369 bits (946), Expect = 1e-99 Identities = 180/245 (73%), Positives = 198/245 (80%), Gaps = 7/245 (2%) Frame = +3 Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398 S SAII+PSKVKQ+SW+PRAFVY GFLT+ ECDHLIS+AKSELKRSAVADNLSGESKLSE Sbjct: 30 SASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE 89 Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557 VRTSSGMFIP PKENGEDIQVLRYE QKYDPHYDYF DKV Sbjct: 90 VRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 149 Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737 N+ARGGHR+ATVLMYL+DV KGGETVFPNAEE+ R R EDLS+CA+KG+AVKPR+G Sbjct: 150 NIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRG 209 Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917 DALLFFSL+P+AIPD SLH GCPVIEGEKWSATKWIHVD+FDK + C D ENC+ Sbjct: 210 DALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDKMVADGGDCNDKQENCD 269 Query: 918 RWAAL 932 RWA L Sbjct: 270 RWATL 274 >ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max] gi|571532068|ref|XP_006600167.1| PREDICTED: uncharacterized protein LOC100783075 isoform X1 [Glycine max] gi|255645457|gb|ACU23224.1| unknown [Glycine max] Length = 298 Score = 367 bits (942), Expect = 3e-99 Identities = 185/270 (68%), Positives = 209/270 (77%), Gaps = 9/270 (3%) Frame = +3 Query: 150 SSMIKFWQLILFLSIALSIVNDSSGSA--IINPSKVKQISWRPRAFVYRGFLTDEECDHL 323 SS + F +L +S + +GSA I+NPSKVKQISW+PRAFVY GFLTD ECDHL Sbjct: 2 SSRVWFLLFLLLISKCHQVWGSYAGSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHL 61 Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482 ISLAKSELKRSAVADNLSGES+LS+VRTSSGMFI PKENG Sbjct: 62 ISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKENG 121 Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662 EDIQVLRYE QKYDPHYDYFTDKVN+ARGGHRIATVLMYL++V KGGETVFP+AEE R Sbjct: 122 EDIQVLRYEHGQKYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPR 181 Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842 RR DLS+CAKKG+AVKP +GDALLFFSLH +A PD SLH GCPVIEGEKWSATK Sbjct: 182 RRGTETSSDLSECAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATK 241 Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932 WIHVD+FDKT+G+ C+D + +CERWA+L Sbjct: 242 WIHVDSFDKTVGAGGDCSDHHVSCERWASL 271 >ref|NP_001276206.1| uncharacterized protein LOC100818794 precursor [Glycine max] gi|255641919|gb|ACU21228.1| unknown [Glycine max] Length = 301 Score = 367 bits (941), Expect = 5e-99 Identities = 178/245 (72%), Positives = 201/245 (82%), Gaps = 7/245 (2%) Frame = +3 Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398 S SAII+PSKVKQ+SW+PRAFVY GFLT+ ECDHLIS+AKSELKRSAVADNLSGESKLSE Sbjct: 30 SASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSE 89 Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557 VRTSSGMFIP PKENGEDIQVLRYE QKYDPHYDYF DKV Sbjct: 90 VRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKV 149 Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737 N+ARGGHR+ATVLMYL+DV KGGETVFP+AEE+ R + +E+LS+CA+KG+AVKPR+G Sbjct: 150 NIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRG 209 Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917 DALLFFSL+P+AIPD SLH GCPVIEGEKWSAT+WIHVD+FDK +G C D +ENCE Sbjct: 210 DALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIHVDSFDKVVGDGGDCNDKHENCE 269 Query: 918 RWAAL 932 RWA L Sbjct: 270 RWATL 274 >ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max] gi|255641119|gb|ACU20838.1| unknown [Glycine max] Length = 297 Score = 366 bits (940), Expect = 6e-99 Identities = 180/245 (73%), Positives = 199/245 (81%), Gaps = 7/245 (2%) Frame = +3 Query: 219 SGSAIINPSKVKQISWRPRAFVYRGFLTDEECDHLISLAKSELKRSAVADNLSGESKLSE 398 S S++INPSKVKQISW+PRAFVY GFLTD ECDHLISLAKSELKRSAVADNLSGES+LS+ Sbjct: 26 SASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSD 85 Query: 399 VRTSSGMFIPXXXXXXXXXXXXX-------PKENGEDIQVLRYEPRQKYDPHYDYFTDKV 557 VRTSSGMFI PKENGEDIQV RYE QKYDPHYDYFTDKV Sbjct: 86 VRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYFTDKV 145 Query: 558 NVARGGHRIATVLMYLSDVEKGGETVFPNAEETSRRRSVIADEDLSDCAKKGVAVKPRKG 737 N+ARGGHRIATVLMYL+DV KGGETVFP+AEE RRR DLS+CAKKG+AVKPR+G Sbjct: 146 NIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRG 205 Query: 738 DALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATKWIHVDNFDKTLGSSDSCTDANENCE 917 DALLFFSLH +A PD SLH GCPVIEGEKWSATKWIHVD+FDKT+G+ C+D + +CE Sbjct: 206 DALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVGAGGDCSDNHVSCE 265 Query: 918 RWAAL 932 RWA+L Sbjct: 266 RWASL 270 >ref|XP_004508327.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cicer arietinum] Length = 297 Score = 363 bits (931), Expect = 7e-98 Identities = 184/270 (68%), Positives = 209/270 (77%), Gaps = 11/270 (4%) Frame = +3 Query: 156 MIKFWQLILFLSIALSIVNDSS----GSAIINPSKVKQISWRPRAFVYRGFLTDEECDHL 323 MIK W L+L + I+ + SS S+IINPSKVKQISW PRAFVY+GFLTD ECDHL Sbjct: 1 MIKVWFLLLLVLISQTDEVHSSYAGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHL 60 Query: 324 ISLAKSELKRSAVADNLSGESKLSEVRTSSGMFIPXXXXXXXXXXXXX-------PKENG 482 ISLAKSELKRSAVADNLSG+SKLS+VRTSSGMFI PKENG Sbjct: 61 ISLAKSELKRSAVADNLSGDSKLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENG 120 Query: 483 EDIQVLRYEPRQKYDPHYDYFTDKVNVARGGHRIATVLMYLSDVEKGGETVFPNAEETSR 662 EDIQVLRYE QKYDPHYDYFTDKVN+A+GGHR TVLMYL++V KGGET+FP A+E R Sbjct: 121 EDIQVLRYEHGQKYDPHYDYFTDKVNIAQGGHRFVTVLMYLTNVTKGGETMFPVAKEPPR 180 Query: 663 RRSVIADEDLSDCAKKGVAVKPRKGDALLFFSLHPSAIPDPESLHGGCPVIEGEKWSATK 842 RR DLS+CAKKG+AVKPR+GDALLFFSLH +A PD SLH GCPVIEGEKWSATK Sbjct: 181 RRGSETSSDLSECAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATK 240 Query: 843 WIHVDNFDKTLGSSDSCTDANENCERWAAL 932 WIHVD+FDK +G+ C+D +E+CERWA+L Sbjct: 241 WIHVDSFDKNVGAGGGCSDQHESCERWASL 270